TOP ACTIVATIONS
MAX = 2.281

Scholar Crossref | Medline | ISI R ig by
Scholar S AGE Journals | ISI R osen berg
wan and BJP Raj ya Sabha member from Gujarat Shank ar
Google Scholar Crossref | ISI Sal m iv
Google Scholar Crossref | ISI Sch w artz
Google Scholar Crossref | ISI M ol cho
Scholar Crossref | Medline | ISI P el leg
Scholar Crossref | Medline | ISI Sh ak oor
Google Scholar Crossref | ISI P iers ,
Scholar Crossref | Medline | ISI Sent se ,
Scholar Crossref | Medline | ISI My ers ,
Google Scholar Crossref | ISI P el leg
also pursued heavily by Notre Dame , which was the other
Google Scholar S AGE Journals | ISI R
H unch back of Notre Dame , Nightmare on Elm Street
iff Tra x Reviewer : V the Happy L
among all edge r us hers *) 2017 salary
among all edge r us hers *) Even at
among all edge r us hers *) Total guaranteed
whole being . A breath lessly inventive sense of post modern

MIN ACTIVATIONS
MIN = -1.699

hearing on Thursday , Mel inda Thompson , the leading lawyer
Animation , Disney , Rel ativity Media , and plenty more
innings . An extreme fly ball pitcher , he
the p iv otal Supreme Court justices
this in mind . ________________________________ ______ Name : Reid H
close , defense lawyer Mel inda Thompson spoke with Irish Central
was fresh off of Requ iem for a Dream a
engineers to construct a pont oon bridge across the narrow passage
to have a court - appointed sheriff handle the advertising and
in a 3 × 3 rooms . The rooms are not
makes us who we are .' The comment seemed
= arg parse . Arg ument Parser () ap . add
read : " Bar ack Obama would be the
] According to Mub rak p uri , only
than appearing before the Par ole Board of Canada , Kh
. bit map - Arg ument List $ w path $
, fail horribly or acqu it themselves with pride so he
technology and who cracks cor ny jokes . But she is
a ' cons piracy theory .' M ook noted
a Russian truck convoy bearing humanitarian aid might now enter into

INTERVAL 1.286 - 2.281
CONTAINS 0.021%

21 February . In the clip , the Professor is heard
servants superv ise ang l ophone and franc ophone employees .
record , since their clients demand confidentiality . One executive at
also pursued heavily by Notre Dame , which was the other
offer the Coconut Bay Resort & Spa in St Lucia from

INTERVAL 0.291 - 1.286
CONTAINS 10.265%

remain engaged in security sector reform efforts in the D RC
RSS ( SD ) This week on C rave
. t . BEL 22 Krist of God da ert (
obtained through a freedom of information request , showed trustees were
ased as del ayed unfolding of independence

INTERVAL -0.704 - 0.291
CONTAINS 88.760%

RL rapid reaction projects director . The teams from
feminist ut opia so much as it is slightly less
. In a series of stories entitled Environmental Justice
felt that returning to MLS someday would be a blessing because
You ll also encounter the M oss

INTERVAL -1.699 - -0.704
CONTAINS 0.952%

he was heading in ", Cob ain 's friend , R
people to use the proper bathroom , saying ,
with Bernie Sanders Newt Gingrich Former Speaker of the House Newt
. In both Florida and Ohio , 50 percent said Obama
these teams bring extra pass r us hers , they can
Token Scholar
Token ID11713
Feature activation+1.13e-01

Pos logit contributions
utical+0.386
pi+0.338
ener+0.335
VICE+0.334
 oneself+0.311
Neg logit contributions
 behavi-0.631
rael-0.605
Mex-0.604
iffs-0.600
 NEWS-0.576
Token Crossref
Token ID23613
Feature activation+1.31e+00

Pos logit contributions
utical+0.088
pi+0.080
ener+0.079
VICE+0.079
 oneself+0.075
Neg logit contributions
 behavi-0.075
iffs-0.073
Mex-0.073
rael-0.072
 NEWS-0.070
Token |
Token ID930
Feature activation+7.55e-01

Pos logit contributions
utical+0.990
pi+0.890
VICE+0.889
ener+0.885
 oneself+0.842
Neg logit contributions
 behavi-0.906
Mex-0.884
iffs-0.877
rael-0.877
 NEWS-0.836
Token Medline
Token ID34582
Feature activation+1.35e+00

Pos logit contributions
utical+0.438
pi+0.384
VICE+0.383
ener+0.377
 oneself+0.354
Neg logit contributions
 behavi-0.661
rael-0.639
Mex-0.637
iffs-0.634
 NEWS-0.602
Token |
Token ID930
Feature activation+7.29e-01

Pos logit contributions
utical+0.944
pi+0.843
VICE+0.841
ener+0.839
 oneself+0.793
Neg logit contributions
 behavi-1.010
Mex-0.987
iffs-0.980
rael-0.978
 NEWS-0.939
Token ISI
Token ID22048
Feature activation+2.28e+00

Pos logit contributions
utical+0.477
pi+0.426
VICE+0.422
ener+0.418
 oneself+0.393
Neg logit contributions
 behavi-0.589
rael-0.565
Mex-0.563
iffs-0.561
 NEWS-0.532
Token
Token ID198
Feature activation+5.73e-01

Pos logit contributions
utical+1.667
pi+1.532
VICE+1.528
ener+1.521
 oneself+1.423
Neg logit contributions
 behavi-1.752
rael-1.565
iffs-1.538
Mex-1.516
 unpop-1.472
Token
Token ID198
Feature activation+4.86e-01

Pos logit contributions
utical+0.436
pi+0.402
VICE+0.399
ener+0.399
 oneself+0.376
Neg logit contributions
 behavi-0.418
rael-0.377
iffs-0.375
Mex-0.373
 NEWS-0.354
TokenR
Token ID49
Feature activation-3.14e-01

Pos logit contributions
utical+0.333
pi+0.297
VICE+0.297
ener+0.296
 oneself+0.278
Neg logit contributions
 behavi-0.370
Mex-0.359
rael-0.357
iffs-0.357
 NEWS-0.341
Tokenig
Token ID328
Feature activation+6.00e-01

Pos logit contributions
 behavi+0.233
Mex+0.227
iffs+0.226
rael+0.226
 NEWS+0.215
Neg logit contributions
utical-0.221
VICE-0.197
pi-0.197
ener-0.196
 oneself-0.185
Tokenby
Token ID1525
Feature activation-2.18e-01

Pos logit contributions
utical+0.467
pi+0.424
ener+0.422
VICE+0.420
 oneself+0.399
Neg logit contributions
 behavi-0.403
Mex-0.388
rael-0.384
iffs-0.384
 NEWS-0.366
Token Scholar
Token ID11713
Feature activation+9.02e-02

Pos logit contributions
utical+0.430
pi+0.376
ener+0.373
VICE+0.372
 oneself+0.345
Neg logit contributions
 behavi-0.707
rael-0.678
Mex-0.677
iffs-0.672
 NEWS-0.645
Token S
Token ID311
Feature activation+1.83e-01

Pos logit contributions
utical+0.069
pi+0.062
ener+0.062
VICE+0.062
 oneself+0.059
Neg logit contributions
 behavi-0.062
Mex-0.060
iffs-0.059
rael-0.059
 NEWS-0.057
TokenAGE
Token ID11879
Feature activation+7.68e-01

Pos logit contributions
utical+0.111
VICE+0.097
pi+0.097
ener+0.096
 oneself+0.090
Neg logit contributions
 behavi-0.152
iffs-0.149
Mex-0.149
rael-0.149
 NEWS-0.143
Token Journals
Token ID48608
Feature activation+1.88e+00

Pos logit contributions
utical+0.403
ener+0.342
VICE+0.341
pi+0.335
nir+0.317
Neg logit contributions
 behavi-0.704
Mex-0.699
iffs-0.690
rael-0.688
 NEWS-0.678
Token |
Token ID930
Feature activation+7.07e-01

Pos logit contributions
utical+1.410
pi+1.266
VICE+1.264
ener+1.264
 oneself+1.198
Neg logit contributions
 behavi-1.305
Mex-1.273
iffs-1.265
rael-1.264
 NEWS-1.206
Token ISI
Token ID22048
Feature activation+2.25e+00

Pos logit contributions
utical+0.475
VICE+0.422
pi+0.421
ener+0.418
 oneself+0.394
Neg logit contributions
 behavi-0.548
Mex-0.533
rael-0.533
iffs-0.531
 NEWS-0.506
Token
Token ID198
Feature activation+5.88e-01

Pos logit contributions
utical+1.853
pi+1.681
VICE+1.680
ener+1.677
 oneself+1.598
Neg logit contributions
 behavi-1.404
Mex-1.363
rael-1.354
iffs-1.353
 NEWS-1.278
Token
Token ID198
Feature activation+4.94e-01

Pos logit contributions
utical+0.448
pi+0.413
VICE+0.411
ener+0.410
 oneself+0.387
Neg logit contributions
 behavi-0.427
rael-0.385
iffs-0.384
Mex-0.381
 NEWS-0.362
TokenR
Token ID49
Feature activation-3.68e-01

Pos logit contributions
utical+0.357
pi+0.320
VICE+0.320
ener+0.319
 oneself+0.301
Neg logit contributions
 behavi-0.357
Mex-0.347
rael-0.345
iffs-0.345
 NEWS-0.329
Tokenosen
Token ID5233
Feature activation+2.41e-01

Pos logit contributions
 behavi+0.279
iffs+0.276
Mex+0.275
rael+0.275
 NEWS+0.260
Neg logit contributions
utical-0.249
VICE-0.222
pi-0.221
ener-0.220
 oneself-0.209
Tokenberg
Token ID3900
Feature activation+1.10e-01

Pos logit contributions
utical+0.180
VICE+0.162
pi+0.162
ener+0.161
 oneself+0.153
Neg logit contributions
 behavi-0.168
Mex-0.164
rael-0.163
iffs-0.163
 NEWS-0.155
Tokenwan
Token ID8149
Feature activation+5.22e-01

Pos logit contributions
 behavi+0.233
Mex+0.226
rael+0.225
iffs+0.225
 NEWS+0.213
Neg logit contributions
utical-0.296
VICE-0.268
pi-0.268
ener-0.267
 oneself-0.254
Token and
Token ID290
Feature activation+7.85e-02

Pos logit contributions
utical+0.393
pi+0.353
VICE+0.353
ener+0.352
 oneself+0.334
Neg logit contributions
 behavi-0.361
Mex-0.352
rael-0.349
iffs-0.349
 NEWS-0.332
Token BJP
Token ID18317
Feature activation+4.73e-01

Pos logit contributions
utical+0.058
pi+0.052
VICE+0.052
ener+0.051
 oneself+0.049
Neg logit contributions
 behavi-0.056
Mex-0.054
rael-0.054
iffs-0.054
 NEWS-0.052
Token Raj
Token ID13308
Feature activation-2.90e-01

Pos logit contributions
utical+0.319
pi+0.284
VICE+0.284
ener+0.282
 oneself+0.268
Neg logit contributions
 behavi-0.364
Mex-0.356
iffs-0.353
rael-0.353
 NEWS-0.336
Tokenya
Token ID3972
Feature activation-1.38e-01

Pos logit contributions
 behavi+0.195
Mex+0.190
iffs+0.190
rael+0.189
 NEWS+0.179
Neg logit contributions
utical-0.223
VICE-0.202
pi-0.200
ener-0.200
 oneself-0.190
Token Sabha
Token ID41941
Feature activation+2.18e+00

Pos logit contributions
 behavi+0.128
iffs+0.126
Mex+0.126
rael+0.126
 NEWS+0.122
Neg logit contributions
utical-0.071
VICE-0.061
pi-0.060
ener-0.060
 oneself-0.055
Token member
Token ID2888
Feature activation+1.35e-03

Pos logit contributions
utical+1.241
pi+1.099
ener+1.088
VICE+1.087
 oneself+1.021
Neg logit contributions
 behavi-1.889
Mex-1.858
rael-1.835
iffs-1.834
 NEWS-1.752
Token from
Token ID422
Feature activation+1.05e-01

Pos logit contributions
utical+0.001
VICE+0.001
pi+0.001
ener+0.001
 oneself+0.001
Neg logit contributions
 behavi-0.001
Mex-0.001
rael-0.001
iffs-0.001
 NEWS-0.001
Token Gujarat
Token ID30311
Feature activation-2.02e-01

Pos logit contributions
utical+0.079
pi+0.071
VICE+0.071
ener+0.071
 oneself+0.067
Neg logit contributions
 behavi-0.073
Mex-0.072
rael-0.071
iffs-0.071
 NEWS-0.068
Token Shank
Token ID41595
Feature activation+3.62e-01

Pos logit contributions
 behavi+0.154
Mex+0.151
rael+0.150
iffs+0.150
 NEWS+0.144
Neg logit contributions
utical-0.138
VICE-0.122
pi-0.122
ener-0.122
 oneself-0.114
Tokenar
Token ID283
Feature activation-2.61e-01

Pos logit contributions
utical+0.239
VICE+0.212
pi+0.211
ener+0.211
 oneself+0.198
Neg logit contributions
 behavi-0.283
Mex-0.277
iffs-0.277
rael-0.276
 NEWS-0.263
Token
Token ID198
Feature activation+5.06e-01

Pos logit contributions
utical+0.478
pi+0.441
ener+0.439
VICE+0.438
 oneself+0.414
Neg logit contributions
 behavi-0.438
rael-0.399
iffs-0.397
Mex-0.395
 NEWS-0.375
TokenGoogle
Token ID11708
Feature activation+6.42e-01

Pos logit contributions
utical+0.247
pi+0.209
VICE+0.208
ener+0.208
 oneself+0.190
Neg logit contributions
 behavi-0.485
Mex-0.475
rael-0.473
iffs-0.473
 NEWS-0.456
Token Scholar
Token ID11713
Feature activation+1.13e-01

Pos logit contributions
utical+0.356
pi+0.311
ener+0.309
VICE+0.308
 oneself+0.286
Neg logit contributions
 behavi-0.582
rael-0.559
Mex-0.557
iffs-0.554
 NEWS-0.531
Token Crossref
Token ID23613
Feature activation+1.24e+00

Pos logit contributions
utical+0.084
pi+0.075
VICE+0.075
ener+0.075
 oneself+0.071
Neg logit contributions
 behavi-0.079
Mex-0.077
iffs-0.077
rael-0.077
 NEWS-0.073
Token |
Token ID930
Feature activation+6.78e-01

Pos logit contributions
utical+0.937
pi+0.841
VICE+0.841
ener+0.838
 oneself+0.797
Neg logit contributions
 behavi-0.854
Mex-0.833
iffs-0.827
rael-0.826
 NEWS-0.788
Token ISI
Token ID22048
Feature activation+2.15e+00

Pos logit contributions
utical+0.415
VICE+0.368
pi+0.367
ener+0.361
 oneself+0.340
Neg logit contributions
 behavi-0.572
rael-0.553
Mex-0.548
iffs-0.547
 NEWS-0.517
Token
Token ID198
Feature activation+5.66e-01

Pos logit contributions
utical+1.578
VICE+1.453
pi+1.452
ener+1.441
 oneself+1.355
Neg logit contributions
 behavi-1.625
rael-1.458
iffs-1.437
Mex-1.422
 unpop-1.375
Token
Token ID198
Feature activation+5.07e-01

Pos logit contributions
utical+0.431
pi+0.397
VICE+0.395
ener+0.394
 oneself+0.372
Neg logit contributions
 behavi-0.412
rael-0.371
iffs-0.370
Mex-0.367
 NEWS-0.349
TokenSal
Token ID19221
Feature activation+8.16e-01

Pos logit contributions
utical+0.377
VICE+0.339
pi+0.339
ener+0.337
 oneself+0.319
Neg logit contributions
 behavi-0.358
Mex-0.346
rael-0.345
iffs-0.345
 NEWS-0.328
Tokenm
Token ID76
Feature activation+1.15e-01

Pos logit contributions
utical+0.593
pi+0.533
ener+0.531
VICE+0.529
 oneself+0.498
Neg logit contributions
 behavi-0.591
Mex-0.572
rael-0.567
iffs-0.566
 NEWS-0.542
Tokeniv
Token ID452
Feature activation-4.71e-01

Pos logit contributions
utical+0.087
ener+0.078
pi+0.078
VICE+0.078
 oneself+0.074
Neg logit contributions
 behavi-0.080
Mex-0.078
rael-0.077
iffs-0.077
 NEWS-0.074
Token
Token ID198
Feature activation+3.35e-01

Pos logit contributions
utical+0.321
pi+0.296
ener+0.294
VICE+0.294
 oneself+0.278
Neg logit contributions
 behavi-0.288
rael-0.264
iffs-0.263
Mex-0.262
 NEWS-0.248
TokenGoogle
Token ID11708
Feature activation+7.44e-01

Pos logit contributions
utical+0.165
pi+0.139
VICE+0.139
ener+0.139
 oneself+0.127
Neg logit contributions
 behavi-0.319
Mex-0.313
rael-0.312
iffs-0.312
 NEWS-0.300
Token Scholar
Token ID11713
Feature activation+8.17e-02

Pos logit contributions
utical+0.411
pi+0.360
ener+0.357
VICE+0.356
 oneself+0.331
Neg logit contributions
 behavi-0.675
rael-0.647
Mex-0.646
iffs-0.642
 NEWS-0.616
Token Crossref
Token ID23613
Feature activation+1.22e+00

Pos logit contributions
utical+0.062
pi+0.056
ener+0.056
VICE+0.056
 oneself+0.053
Neg logit contributions
 behavi-0.056
Mex-0.054
iffs-0.054
rael-0.054
 NEWS-0.052
Token |
Token ID930
Feature activation+6.94e-01

Pos logit contributions
utical+0.913
pi+0.819
VICE+0.819
ener+0.815
 oneself+0.776
Neg logit contributions
 behavi-0.846
Mex-0.826
iffs-0.819
rael-0.818
 NEWS-0.781
Token ISI
Token ID22048
Feature activation+2.11e+00

Pos logit contributions
utical+0.420
pi+0.370
VICE+0.370
ener+0.365
 oneself+0.343
Neg logit contributions
 behavi-0.590
rael-0.569
Mex-0.567
iffs-0.565
 NEWS-0.536
Token
Token ID198
Feature activation+4.76e-01

Pos logit contributions
utical+1.539
pi+1.418
VICE+1.417
ener+1.406
 oneself+1.321
Neg logit contributions
 behavi-1.609
rael-1.440
iffs-1.417
Mex-1.398
 unpop-1.359
Token
Token ID198
Feature activation+4.20e-01

Pos logit contributions
utical+0.363
pi+0.334
VICE+0.333
ener+0.332
 oneself+0.313
Neg logit contributions
 behavi-0.344
rael-0.311
iffs-0.310
Mex-0.308
 NEWS-0.292
TokenSch
Token ID14874
Feature activation+1.09e-01

Pos logit contributions
utical+0.281
pi+0.249
VICE+0.249
ener+0.248
 oneself+0.233
Neg logit contributions
 behavi-0.327
Mex-0.318
rael-0.317
iffs-0.317
 NEWS-0.303
Tokenw
Token ID86
Feature activation+4.73e-01

Pos logit contributions
utical+0.078
pi+0.070
ener+0.070
VICE+0.069
 oneself+0.065
Neg logit contributions
 behavi-0.080
Mex-0.077
rael-0.077
iffs-0.077
 NEWS-0.074
Tokenartz
Token ID13636
Feature activation+5.64e-01

Pos logit contributions
utical+0.320
ener+0.285
pi+0.284
VICE+0.282
 oneself+0.266
Neg logit contributions
 behavi-0.368
Mex-0.356
rael-0.353
iffs-0.352
 NEWS-0.339
Token
Token ID198
Feature activation+4.34e-01

Pos logit contributions
utical+0.410
pi+0.378
ener+0.376
VICE+0.375
 oneself+0.355
Neg logit contributions
 behavi-0.373
rael-0.341
iffs-0.339
Mex-0.338
 NEWS-0.321
TokenGoogle
Token ID11708
Feature activation+7.14e-01

Pos logit contributions
utical+0.265
ener+0.228
pi+0.228
VICE+0.227
 oneself+0.218
Neg logit contributions
Mex-0.361
 behavi-0.356
iffs-0.354
rael-0.353
 NEWS-0.339
Token Scholar
Token ID11713
Feature activation+8.71e-03

Pos logit contributions
utical+0.386
pi+0.339
ener+0.334
VICE+0.333
 oneself+0.311
Neg logit contributions
 behavi-0.659
rael-0.630
Mex-0.628
iffs-0.624
 NEWS-0.597
Token Crossref
Token ID23613
Feature activation+1.21e+00

Pos logit contributions
utical+0.006
pi+0.006
VICE+0.006
ener+0.006
 oneself+0.005
Neg logit contributions
 behavi-0.006
Mex-0.006
rael-0.006
iffs-0.006
 NEWS-0.006
Token |
Token ID930
Feature activation+7.32e-01

Pos logit contributions
utical+0.938
VICE+0.845
pi+0.844
ener+0.841
 oneself+0.801
Neg logit contributions
 behavi-0.815
Mex-0.793
iffs-0.788
rael-0.788
 NEWS-0.749
Token ISI
Token ID22048
Feature activation+2.08e+00

Pos logit contributions
utical+0.495
ener+0.441
pi+0.439
VICE+0.436
 oneself+0.412
Neg logit contributions
 behavi-0.561
Mex-0.551
iffs-0.546
rael-0.544
 NEWS-0.528
Token
Token ID198
Feature activation+5.80e-01

Pos logit contributions
utical+1.572
pi+1.445
VICE+1.443
ener+1.438
 oneself+1.361
Neg logit contributions
 behavi-1.446
rael-1.365
iffs-1.352
Mex-1.352
 unpop-1.258
Token
Token ID198
Feature activation+5.02e-01

Pos logit contributions
utical+0.445
pi+0.412
VICE+0.408
ener+0.408
 oneself+0.386
Neg logit contributions
 behavi-0.413
rael-0.376
iffs-0.374
Mex-0.372
 NEWS-0.353
TokenM
Token ID44
Feature activation+2.17e-01

Pos logit contributions
utical+0.344
pi+0.305
VICE+0.305
ener+0.305
 oneself+0.288
Neg logit contributions
 behavi-0.380
Mex-0.374
rael-0.370
iffs-0.370
 NEWS-0.353
Tokenol
Token ID349
Feature activation+2.56e-01

Pos logit contributions
utical+0.156
pi+0.140
VICE+0.140
ener+0.139
 oneself+0.132
Neg logit contributions
 behavi-0.157
Mex-0.153
rael-0.152
iffs-0.152
 NEWS-0.145
Tokencho
Token ID6679
Feature activation+6.90e-02

Pos logit contributions
utical+0.177
pi+0.158
ener+0.158
VICE+0.157
 oneself+0.148
Neg logit contributions
 behavi-0.194
Mex-0.188
rael-0.187
iffs-0.187
 NEWS-0.179
Token Scholar
Token ID11713
Feature activation+2.25e-02

Pos logit contributions
utical+0.335
pi+0.293
ener+0.290
VICE+0.290
 oneself+0.269
Neg logit contributions
 behavi-0.547
rael-0.525
Mex-0.524
iffs-0.521
 NEWS-0.500
Token Crossref
Token ID23613
Feature activation+9.79e-01

Pos logit contributions
utical+0.017
pi+0.015
VICE+0.015
ener+0.015
 oneself+0.014
Neg logit contributions
 behavi-0.016
Mex-0.015
iffs-0.015
rael-0.015
 NEWS-0.015
Token |
Token ID930
Feature activation+6.62e-01

Pos logit contributions
utical+0.757
VICE+0.682
pi+0.681
ener+0.679
 oneself+0.647
Neg logit contributions
 behavi-0.659
Mex-0.641
iffs-0.637
rael-0.636
 NEWS-0.606
Token Medline
Token ID34582
Feature activation+1.14e+00

Pos logit contributions
utical+0.437
ener+0.388
pi+0.387
VICE+0.385
 oneself+0.363
Neg logit contributions
 behavi-0.519
Mex-0.509
iffs-0.505
rael-0.502
 NEWS-0.488
Token |
Token ID930
Feature activation+6.61e-01

Pos logit contributions
utical+0.805
pi+0.719
VICE+0.717
ener+0.715
 oneself+0.677
Neg logit contributions
 behavi-0.841
Mex-0.821
iffs-0.816
rael-0.814
 NEWS-0.781
Token ISI
Token ID22048
Feature activation+2.05e+00

Pos logit contributions
utical+0.430
pi+0.384
VICE+0.381
ener+0.376
 oneself+0.354
Neg logit contributions
 behavi-0.535
rael-0.514
Mex-0.511
iffs-0.510
 NEWS-0.482
Token
Token ID198
Feature activation+3.97e-01

Pos logit contributions
utical+1.688
pi+1.532
VICE+1.531
ener+1.528
 oneself+1.456
Neg logit contributions
 behavi-1.276
Mex-1.237
rael-1.229
iffs-1.228
 NEWS-1.159
Token
Token ID198
Feature activation+3.07e-01

Pos logit contributions
utical+0.307
pi+0.284
VICE+0.281
ener+0.281
 oneself+0.266
Neg logit contributions
 behavi-0.280
rael-0.256
iffs-0.256
Mex-0.254
 NEWS-0.241
TokenP
Token ID47
Feature activation-1.44e-01

Pos logit contributions
utical+0.248
pi+0.225
VICE+0.225
ener+0.224
 oneself+0.214
Neg logit contributions
 behavi-0.195
Mex-0.190
rael-0.188
iffs-0.188
 NEWS-0.178
Tokenel
Token ID417
Feature activation+6.54e-01

Pos logit contributions
 behavi+0.109
Mex+0.107
rael+0.106
iffs+0.106
 NEWS+0.101
Neg logit contributions
utical-0.099
VICE-0.088
pi-0.088
ener-0.087
 oneself-0.083
Tokenleg
Token ID1455
Feature activation+3.28e-01

Pos logit contributions
utical+0.462
pi+0.415
ener+0.413
VICE+0.410
 oneself+0.386
Neg logit contributions
 behavi-0.488
Mex-0.471
rael-0.468
iffs-0.467
 NEWS-0.448
Token Scholar
Token ID11713
Feature activation+1.31e-01

Pos logit contributions
utical+0.374
pi+0.326
ener+0.324
VICE+0.323
 oneself+0.301
Neg logit contributions
 behavi-0.611
rael-0.586
Mex-0.585
iffs-0.581
 NEWS-0.558
Token Crossref
Token ID23613
Feature activation+1.22e+00

Pos logit contributions
utical+0.100
pi+0.091
ener+0.090
VICE+0.090
 oneself+0.086
Neg logit contributions
 behavi-0.089
iffs-0.086
Mex-0.086
rael-0.085
 NEWS-0.082
Token |
Token ID930
Feature activation+7.35e-01

Pos logit contributions
utical+0.897
pi+0.803
VICE+0.802
ener+0.799
 oneself+0.760
Neg logit contributions
 behavi-0.865
Mex-0.845
rael-0.839
iffs-0.838
 NEWS-0.801
Token Medline
Token ID34582
Feature activation+1.14e+00

Pos logit contributions
utical+0.404
pi+0.351
VICE+0.351
ener+0.345
 oneself+0.323
Neg logit contributions
 behavi-0.667
rael-0.645
Mex-0.643
iffs-0.640
 NEWS-0.610
Token |
Token ID930
Feature activation+6.64e-01

Pos logit contributions
utical+0.796
pi+0.711
VICE+0.708
ener+0.707
 oneself+0.668
Neg logit contributions
 behavi-0.857
Mex-0.837
iffs-0.831
rael-0.830
 NEWS-0.797
Token ISI
Token ID22048
Feature activation+2.03e+00

Pos logit contributions
utical+0.435
pi+0.389
VICE+0.385
ener+0.382
 oneself+0.359
Neg logit contributions
 behavi-0.536
rael-0.514
Mex-0.512
iffs-0.510
 NEWS-0.484
Token
Token ID198
Feature activation+3.34e-01

Pos logit contributions
utical+1.492
pi+1.372
VICE+1.371
ener+1.364
 oneself+1.281
Neg logit contributions
 behavi-1.546
rael-1.387
iffs-1.366
Mex-1.350
 unpop-1.304
Token
Token ID198
Feature activation+2.62e-01

Pos logit contributions
utical+0.256
pi+0.236
VICE+0.235
ener+0.234
 oneself+0.221
Neg logit contributions
 behavi-0.240
rael-0.218
iffs-0.217
Mex-0.216
 NEWS-0.205
TokenSh
Token ID2484
Feature activation+2.27e-01

Pos logit contributions
utical+0.179
pi+0.159
VICE+0.159
ener+0.159
 oneself+0.149
Neg logit contributions
 behavi-0.200
Mex-0.195
rael-0.194
iffs-0.194
 NEWS-0.185
Tokenak
Token ID461
Feature activation-8.57e-02

Pos logit contributions
utical+0.161
pi+0.143
VICE+0.143
ener+0.143
 oneself+0.135
Neg logit contributions
 behavi-0.168
Mex-0.163
rael-0.162
iffs-0.162
 NEWS-0.155
Tokenoor
Token ID2675
Feature activation+1.04e-01

Pos logit contributions
 behavi+0.065
Mex+0.064
iffs+0.064
rael+0.063
 NEWS+0.060
Neg logit contributions
utical-0.058
VICE-0.052
pi-0.052
ener-0.052
 oneself-0.049
Token
Token ID198
Feature activation+2.83e-01

Pos logit contributions
utical+0.325
pi+0.299
ener+0.297
VICE+0.297
 oneself+0.281
Neg logit contributions
 behavi-0.290
rael-0.266
iffs-0.264
Mex-0.263
 NEWS-0.250
TokenGoogle
Token ID11708
Feature activation+5.76e-01

Pos logit contributions
utical+0.139
pi+0.118
VICE+0.117
ener+0.117
 oneself+0.107
Neg logit contributions
 behavi-0.270
Mex-0.265
rael-0.264
iffs-0.264
 NEWS-0.254
Token Scholar
Token ID11713
Feature activation+6.21e-02

Pos logit contributions
utical+0.320
pi+0.279
ener+0.277
VICE+0.277
 oneself+0.257
Neg logit contributions
 behavi-0.521
rael-0.500
Mex-0.499
iffs-0.496
 NEWS-0.476
Token Crossref
Token ID23613
Feature activation+1.14e+00

Pos logit contributions
utical+0.048
pi+0.043
ener+0.043
VICE+0.043
 oneself+0.041
Neg logit contributions
 behavi-0.042
iffs-0.040
Mex-0.040
rael-0.040
 NEWS-0.039
Token |
Token ID930
Feature activation+7.02e-01

Pos logit contributions
utical+0.857
pi+0.770
VICE+0.769
ener+0.766
 oneself+0.730
Neg logit contributions
 behavi-0.791
Mex-0.771
iffs-0.766
rael-0.765
 NEWS-0.731
Token ISI
Token ID22048
Feature activation+2.02e+00

Pos logit contributions
utical+0.407
pi+0.356
VICE+0.356
ener+0.351
 oneself+0.328
Neg logit contributions
 behavi-0.615
rael-0.595
Mex-0.593
iffs-0.590
 NEWS-0.561
Token
Token ID198
Feature activation+4.12e-01

Pos logit contributions
utical+1.497
pi+1.383
VICE+1.379
ener+1.371
 oneself+1.285
Neg logit contributions
 behavi-1.519
rael-1.367
iffs-1.348
Mex-1.331
 unpop-1.284
Token
Token ID198
Feature activation+3.24e-01

Pos logit contributions
utical+0.318
pi+0.293
VICE+0.291
ener+0.290
 oneself+0.274
Neg logit contributions
 behavi-0.293
rael-0.267
iffs-0.266
Mex-0.264
 NEWS-0.251
TokenP
Token ID47
Feature activation-1.46e-01

Pos logit contributions
utical+0.265
VICE+0.240
ener+0.240
pi+0.240
 oneself+0.229
Neg logit contributions
 behavi-0.202
Mex-0.198
rael-0.196
iffs-0.196
 NEWS-0.185
Tokeniers
Token ID3183
Feature activation+6.01e-01

Pos logit contributions
 behavi+0.121
Mex+0.119
rael+0.119
iffs+0.119
 NEWS+0.113
Neg logit contributions
utical-0.088
VICE-0.077
pi-0.077
ener-0.076
 oneself-0.072
Token,
Token ID11
Feature activation+1.46e-01

Pos logit contributions
utical+0.444
VICE+0.399
pi+0.397
ener+0.395
 oneself+0.376
Neg logit contributions
 behavi-0.423
Mex-0.413
rael-0.413
iffs-0.412
 NEWS-0.391
Token Scholar
Token ID11713
Feature activation-7.24e-03

Pos logit contributions
utical+0.299
pi+0.261
ener+0.259
VICE+0.259
 oneself+0.241
Neg logit contributions
 behavi-0.485
rael-0.466
Mex-0.466
iffs-0.463
 NEWS-0.444
Token Crossref
Token ID23613
Feature activation+1.07e+00

Pos logit contributions
 behavi+0.005
Mex+0.005
iffs+0.005
rael+0.005
 NEWS+0.005
Neg logit contributions
utical-0.005
pi-0.005
VICE-0.005
ener-0.005
 oneself-0.005
Token |
Token ID930
Feature activation+6.37e-01

Pos logit contributions
utical+0.793
pi+0.710
VICE+0.709
ener+0.706
 oneself+0.672
Neg logit contributions
 behavi-0.762
Mex-0.744
iffs-0.738
rael-0.737
 NEWS-0.705
Token Medline
Token ID34582
Feature activation+1.29e+00

Pos logit contributions
utical+0.384
pi+0.339
VICE+0.338
ener+0.334
 oneself+0.313
Neg logit contributions
 behavi-0.544
rael-0.525
Mex-0.522
iffs-0.520
 NEWS-0.493
Token |
Token ID930
Feature activation+5.96e-01

Pos logit contributions
utical+0.897
pi+0.802
VICE+0.799
ener+0.797
 oneself+0.753
Neg logit contributions
 behavi-0.966
Mex-0.944
iffs-0.937
rael-0.936
 NEWS-0.898
Token ISI
Token ID22048
Feature activation+2.00e+00

Pos logit contributions
utical+0.390
pi+0.349
VICE+0.345
ener+0.342
 oneself+0.322
Neg logit contributions
 behavi-0.482
rael-0.462
Mex-0.460
iffs-0.458
 NEWS-0.435
Token
Token ID198
Feature activation+3.78e-01

Pos logit contributions
utical+1.469
pi+1.350
VICE+1.348
ener+1.342
 oneself+1.260
Neg logit contributions
 behavi-1.509
rael-1.356
iffs-1.337
Mex-1.322
 unpop-1.275
Token
Token ID198
Feature activation+3.14e-01

Pos logit contributions
utical+0.289
pi+0.267
VICE+0.265
ener+0.265
 oneself+0.250
Neg logit contributions
 behavi-0.270
rael-0.246
iffs-0.245
Mex-0.243
 NEWS-0.231
TokenSent
Token ID31837
Feature activation-1.24e-01

Pos logit contributions
utical+0.218
pi+0.194
VICE+0.194
ener+0.194
 oneself+0.183
Neg logit contributions
 behavi-0.236
Mex-0.230
rael-0.229
iffs-0.229
 NEWS-0.219
Tokense
Token ID325
Feature activation+3.11e-01

Pos logit contributions
 behavi+0.090
Mex+0.088
rael+0.087
iffs+0.087
 NEWS+0.083
Neg logit contributions
utical-0.090
VICE-0.080
pi-0.080
ener-0.080
 oneself-0.076
Token,
Token ID11
Feature activation+1.21e-01

Pos logit contributions
utical+0.218
VICE+0.195
pi+0.194
ener+0.194
 oneself+0.183
Neg logit contributions
 behavi-0.231
Mex-0.225
rael-0.225
iffs-0.224
 NEWS-0.214
Token Scholar
Token ID11713
Feature activation-7.77e-04

Pos logit contributions
utical+0.373
pi+0.327
ener+0.323
VICE+0.323
 oneself+0.300
Neg logit contributions
 behavi-0.622
rael-0.595
Mex-0.594
iffs-0.590
 NEWS-0.565
Token Crossref
Token ID23613
Feature activation+1.06e+00

Pos logit contributions
 behavi+0.001
Mex+0.001
iffs+0.001
rael+0.001
 NEWS+0.001
Neg logit contributions
utical-0.001
pi-0.001
VICE-0.001
ener-0.001
 oneself-0.000
Token |
Token ID930
Feature activation+6.76e-01

Pos logit contributions
utical+0.815
pi+0.733
VICE+0.733
ener+0.731
 oneself+0.696
Neg logit contributions
 behavi-0.715
Mex-0.696
iffs-0.691
rael-0.691
 NEWS-0.658
Token Medline
Token ID34582
Feature activation+1.19e+00

Pos logit contributions
utical+0.423
pi+0.374
VICE+0.374
ener+0.369
 oneself+0.347
Neg logit contributions
 behavi-0.561
rael-0.542
Mex-0.540
iffs-0.539
 NEWS-0.510
Token |
Token ID930
Feature activation+6.55e-01

Pos logit contributions
utical+0.877
pi+0.786
VICE+0.784
ener+0.784
 oneself+0.743
Neg logit contributions
 behavi-0.840
Mex-0.817
iffs-0.813
rael-0.811
 NEWS-0.776
Token ISI
Token ID22048
Feature activation+1.99e+00

Pos logit contributions
utical+0.405
VICE+0.364
pi+0.359
ener+0.349
 oneself+0.333
Neg logit contributions
 behavi-0.548
rael-0.534
iffs-0.527
Mex-0.522
 unpop-0.486
Token
Token ID198
Feature activation+4.04e-01

Pos logit contributions
utical+1.611
pi+1.459
VICE+1.458
ener+1.455
 oneself+1.386
Neg logit contributions
 behavi-1.261
Mex-1.225
rael-1.216
iffs-1.215
 NEWS-1.151
Token
Token ID198
Feature activation+3.65e-01

Pos logit contributions
utical+0.313
pi+0.288
VICE+0.286
ener+0.286
 oneself+0.271
Neg logit contributions
 behavi-0.284
rael-0.261
iffs-0.259
Mex-0.257
 NEWS-0.244
TokenMy
Token ID3666
Feature activation+2.17e-01

Pos logit contributions
utical+0.258
pi+0.229
VICE+0.229
ener+0.229
 oneself+0.216
Neg logit contributions
 behavi-0.270
Mex-0.265
rael-0.263
iffs-0.262
 NEWS-0.250
Tokeners
Token ID364
Feature activation+2.20e-02

Pos logit contributions
utical+0.158
pi+0.142
ener+0.142
VICE+0.141
 oneself+0.133
Neg logit contributions
 behavi-0.156
Mex-0.152
iffs-0.151
rael-0.151
 NEWS-0.144
Token,
Token ID11
Feature activation+2.71e-01

Pos logit contributions
utical+0.017
VICE+0.015
pi+0.015
ener+0.015
 oneself+0.014
Neg logit contributions
 behavi-0.015
Mex-0.015
rael-0.015
iffs-0.015
 NEWS-0.014
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
utical+0.168
pi+0.155
ener+0.154
VICE+0.154
 oneself+0.146
Neg logit contributions
 behavi-0.148
rael-0.137
iffs-0.136
Mex-0.136
 NEWS-0.128
TokenGoogle
Token ID11708
Feature activation+5.33e-01

Pos logit contributions
utical+0.053
pi+0.044
ener+0.044
VICE+0.044
 oneself+0.041
Neg logit contributions
 behavi-0.098
Mex-0.097
iffs-0.097
rael-0.097
 NEWS-0.093
Token Scholar
Token ID11713
Feature activation-1.35e-01

Pos logit contributions
utical+0.296
pi+0.259
ener+0.257
VICE+0.256
 oneself+0.238
Neg logit contributions
 behavi-0.482
rael-0.463
Mex-0.462
iffs-0.459
 NEWS-0.441
Token Crossref
Token ID23613
Feature activation+8.88e-01

Pos logit contributions
 behavi+0.091
iffs+0.088
Mex+0.088
rael+0.087
 NEWS+0.084
Neg logit contributions
utical-0.104
pi-0.094
ener-0.094
VICE-0.094
 oneself-0.089
Token |
Token ID930
Feature activation+5.98e-01

Pos logit contributions
utical+0.684
pi+0.615
VICE+0.615
ener+0.613
 oneself+0.584
Neg logit contributions
 behavi-0.600
Mex-0.584
iffs-0.580
rael-0.579
 NEWS-0.553
Token ISI
Token ID22048
Feature activation+1.91e+00

Pos logit contributions
utical+0.354
VICE+0.312
pi+0.311
ener+0.307
 oneself+0.288
Neg logit contributions
 behavi-0.516
rael-0.499
Mex-0.497
iffs-0.495
 NEWS-0.469
Token
Token ID198
Feature activation+2.87e-01

Pos logit contributions
utical+1.425
pi+1.312
VICE+1.311
ener+1.301
 oneself+1.224
Neg logit contributions
 behavi-1.404
rael-1.278
iffs-1.261
Mex-1.246
 unpop-1.194
Token
Token ID198
Feature activation+2.13e-01

Pos logit contributions
utical+0.223
pi+0.205
VICE+0.203
ener+0.203
 oneself+0.192
Neg logit contributions
 behavi-0.202
rael-0.185
iffs-0.185
Mex-0.184
 NEWS-0.174
TokenP
Token ID47
Feature activation-2.11e-01

Pos logit contributions
utical+0.172
VICE+0.156
ener+0.156
pi+0.156
 oneself+0.148
Neg logit contributions
 behavi-0.135
Mex-0.132
rael-0.131
iffs-0.131
 NEWS-0.124
Tokenel
Token ID417
Feature activation+4.96e-01

Pos logit contributions
 behavi+0.168
Mex+0.165
rael+0.165
iffs+0.164
 NEWS+0.157
Neg logit contributions
utical-0.136
VICE-0.120
pi-0.120
ener-0.119
 oneself-0.112
Tokenleg
Token ID1455
Feature activation+1.84e-01

Pos logit contributions
utical+0.293
pi+0.256
ener+0.255
VICE+0.254
 oneself+0.236
Neg logit contributions
 behavi-0.425
Mex-0.414
rael-0.412
iffs-0.412
 NEWS-0.396
Token also
Token ID635
Feature activation-1.83e-01

Pos logit contributions
 behavi+0.119
Mex+0.115
iffs+0.114
rael+0.114
 NEWS+0.109
Neg logit contributions
utical-0.126
ener-0.114
pi-0.114
VICE-0.113
 oneself-0.108
Token pursued
Token ID19189
Feature activation+4.66e-02

Pos logit contributions
 behavi+0.132
Mex+0.127
iffs+0.127
rael+0.127
 NEWS+0.121
Neg logit contributions
utical-0.133
ener-0.120
pi-0.120
VICE-0.120
 oneself-0.114
Token heavily
Token ID7272
Feature activation-2.34e-01

Pos logit contributions
utical+0.023
pi+0.019
VICE+0.019
ener+0.019
 oneself+0.017
Neg logit contributions
 behavi-0.045
Mex-0.044
rael-0.044
iffs-0.044
 NEWS-0.042
Token by
Token ID416
Feature activation-1.28e-01

Pos logit contributions
 behavi+0.201
Mex+0.197
rael+0.196
iffs+0.196
 NEWS+0.188
Neg logit contributions
utical-0.137
pi-0.119
VICE-0.119
ener-0.119
 oneself-0.111
Token Notre
Token ID23382
Feature activation-3.80e-01

Pos logit contributions
 behavi+0.086
Mex+0.083
rael+0.083
iffs+0.083
 NEWS+0.078
Neg logit contributions
utical-0.099
pi-0.090
ener-0.090
VICE-0.090
 oneself-0.085
Token Dame
Token ID20377
Feature activation+1.89e+00

Pos logit contributions
 behavi+0.363
Mex+0.348
rael+0.348
iffs+0.346
 NEWS+0.334
Neg logit contributions
utical-0.191
ener-0.169
pi-0.166
VICE-0.163
 oneself-0.152
Token,
Token ID11
Feature activation-5.03e-02

Pos logit contributions
utical+1.373
pi+1.249
ener+1.244
VICE+1.225
 oneself+1.187
Neg logit contributions
 behavi-1.350
rael-1.319
Mex-1.307
iffs-1.291
 NEWS-1.221
Token which
Token ID543
Feature activation-1.52e-01

Pos logit contributions
 behavi+0.033
Mex+0.032
iffs+0.032
rael+0.032
 NEWS+0.030
Neg logit contributions
utical-0.039
ener-0.036
pi-0.036
VICE-0.036
 oneself-0.034
Token was
Token ID373
Feature activation-2.04e-01

Pos logit contributions
 behavi+0.107
Mex+0.103
rael+0.102
iffs+0.102
 NEWS+0.097
Neg logit contributions
utical-0.113
pi-0.102
ener-0.102
VICE-0.101
 oneself-0.097
Token the
Token ID262
Feature activation-9.46e-02

Pos logit contributions
 behavi+0.135
Mex+0.131
iffs+0.130
rael+0.130
 NEWS+0.123
Neg logit contributions
utical-0.160
pi-0.145
ener-0.145
VICE-0.145
 oneself-0.139
Token other
Token ID584
Feature activation+8.12e-02

Pos logit contributions
 behavi+0.068
Mex+0.066
rael+0.065
iffs+0.065
 NEWS+0.062
Neg logit contributions
utical-0.069
pi-0.062
VICE-0.062
ener-0.062
 oneself-0.058
Token
Token ID198
Feature activation+4.12e-01

Pos logit contributions
utical+0.397
pi+0.366
ener+0.364
VICE+0.364
 oneself+0.344
Neg logit contributions
 behavi-0.360
rael-0.329
iffs-0.327
Mex-0.326
 NEWS-0.309
TokenGoogle
Token ID11708
Feature activation+7.78e-01

Pos logit contributions
utical+0.205
pi+0.173
VICE+0.173
ener+0.173
 oneself+0.159
Neg logit contributions
 behavi-0.389
Mex-0.384
rael-0.381
iffs-0.381
 NEWS-0.367
Token Scholar
Token ID11713
Feature activation+9.02e-02

Pos logit contributions
utical+0.430
pi+0.376
ener+0.373
VICE+0.372
 oneself+0.345
Neg logit contributions
 behavi-0.707
rael-0.678
Mex-0.677
iffs-0.672
 NEWS-0.645
Token S
Token ID311
Feature activation+1.83e-01

Pos logit contributions
utical+0.069
pi+0.062
ener+0.062
VICE+0.062
 oneself+0.059
Neg logit contributions
 behavi-0.062
Mex-0.060
iffs-0.059
rael-0.059
 NEWS-0.057
TokenAGE
Token ID11879
Feature activation+7.68e-01

Pos logit contributions
utical+0.111
VICE+0.097
pi+0.097
ener+0.096
 oneself+0.090
Neg logit contributions
 behavi-0.152
iffs-0.149
Mex-0.149
rael-0.149
 NEWS-0.143
Token Journals
Token ID48608
Feature activation+1.88e+00

Pos logit contributions
utical+0.403
ener+0.342
VICE+0.341
pi+0.335
nir+0.317
Neg logit contributions
 behavi-0.704
Mex-0.699
iffs-0.690
rael-0.688
 NEWS-0.678
Token |
Token ID930
Feature activation+7.07e-01

Pos logit contributions
utical+1.410
pi+1.266
VICE+1.264
ener+1.264
 oneself+1.198
Neg logit contributions
 behavi-1.305
Mex-1.273
iffs-1.265
rael-1.264
 NEWS-1.206
Token ISI
Token ID22048
Feature activation+2.25e+00

Pos logit contributions
utical+0.475
VICE+0.422
pi+0.421
ener+0.418
 oneself+0.394
Neg logit contributions
 behavi-0.548
Mex-0.533
rael-0.533
iffs-0.531
 NEWS-0.506
Token
Token ID198
Feature activation+5.88e-01

Pos logit contributions
utical+1.853
pi+1.681
VICE+1.680
ener+1.677
 oneself+1.598
Neg logit contributions
 behavi-1.404
Mex-1.363
rael-1.354
iffs-1.353
 NEWS-1.278
Token
Token ID198
Feature activation+4.94e-01

Pos logit contributions
utical+0.448
pi+0.413
VICE+0.411
ener+0.410
 oneself+0.387
Neg logit contributions
 behavi-0.427
rael-0.385
iffs-0.384
Mex-0.381
 NEWS-0.362
TokenR
Token ID49
Feature activation-3.68e-01

Pos logit contributions
utical+0.357
pi+0.320
VICE+0.320
ener+0.319
 oneself+0.301
Neg logit contributions
 behavi-0.357
Mex-0.347
rael-0.345
iffs-0.345
 NEWS-0.329
Token H
Token ID367
Feature activation+2.28e-01

Pos logit contributions
utical+0.144
pi+0.128
VICE+0.128
ener+0.128
 oneself+0.120
Neg logit contributions
 behavi-0.158
Mex-0.154
rael-0.154
iffs-0.153
 NEWS-0.146
Tokenunch
Token ID3316
Feature activation-2.19e-01

Pos logit contributions
utical+0.181
ener+0.165
pi+0.165
VICE+0.164
 oneself+0.155
Neg logit contributions
 behavi-0.151
Mex-0.144
rael-0.143
iffs-0.142
 NEWS-0.136
Tokenback
Token ID1891
Feature activation-1.98e-01

Pos logit contributions
 behavi+0.107
Mex+0.102
rael+0.102
iffs+0.101
 NEWS+0.093
Neg logit contributions
utical-0.211
pi-0.195
ener-0.194
VICE-0.193
 oneself-0.187
Token of
Token ID286
Feature activation-7.76e-02

Pos logit contributions
 behavi+0.140
Mex+0.137
iffs+0.136
rael+0.136
 NEWS+0.130
Neg logit contributions
utical-0.145
VICE-0.130
pi-0.129
ener-0.129
 oneself-0.122
Token Notre
Token ID23382
Feature activation+3.19e-02

Pos logit contributions
 behavi+0.049
rael+0.048
iffs+0.048
Mex+0.048
 NEWS+0.045
Neg logit contributions
utical-0.063
VICE-0.057
ener-0.056
pi-0.056
 oneself-0.054
Token Dame
Token ID20377
Feature activation+1.82e+00

Pos logit contributions
utical+0.016
ener+0.014
pi+0.014
VICE+0.014
 oneself+0.013
Neg logit contributions
 behavi-0.030
Mex-0.029
rael-0.029
iffs-0.029
 NEWS-0.028
Token,
Token ID11
Feature activation+4.10e-02

Pos logit contributions
utical+1.366
pi+1.233
ener+1.228
VICE+1.222
 oneself+1.163
Neg logit contributions
 behavi-1.265
Mex-1.218
rael-1.212
iffs-1.209
 NEWS-1.145
Token Nightmare
Token ID23951
Feature activation-3.09e-03

Pos logit contributions
utical+0.031
pi+0.028
ener+0.028
VICE+0.028
 oneself+0.026
Neg logit contributions
 behavi-0.029
Mex-0.028
rael-0.028
iffs-0.028
 NEWS-0.026
Token on
Token ID319
Feature activation-8.51e-02

Pos logit contributions
 behavi+0.002
rael+0.002
Mex+0.002
iffs+0.002
 NEWS+0.002
Neg logit contributions
utical-0.002
ener-0.002
pi-0.002
VICE-0.002
 oneself-0.002
Token Elm
Token ID35189
Feature activation-1.29e-01

Pos logit contributions
 behavi+0.053
Mex+0.051
iffs+0.051
rael+0.050
 NEWS+0.048
Neg logit contributions
utical-0.070
pi-0.064
ener-0.064
VICE-0.064
 oneself-0.061
Token Street
Token ID3530
Feature activation+2.44e-01

Pos logit contributions
 behavi+0.074
Mex+0.070
rael+0.068
iffs+0.068
 NEWS+0.065
Neg logit contributions
utical-0.118
ener-0.108
pi-0.104
VICE-0.103
 oneself-0.102
Tokeniff
Token ID733
Feature activation+1.22e-01

Pos logit contributions
 behavi+0.201
Mex+0.172
rael+0.170
 NEWS+0.160
iffs+0.157
Neg logit contributions
utical-0.494
ener-0.463
pi-0.463
VICE-0.461
nir-0.440
TokenTra
Token ID15721
Feature activation+7.70e-02

Pos logit contributions
utical+0.076
VICE+0.068
pi+0.067
ener+0.067
 oneself+0.062
Neg logit contributions
 behavi-0.102
rael-0.097
iffs-0.097
Mex-0.097
 NEWS-0.093
Tokenx
Token ID87
Feature activation+4.81e-01

Pos logit contributions
utical+0.048
pi+0.042
VICE+0.042
ener+0.042
 oneself+0.039
Neg logit contributions
 behavi-0.064
Mex-0.062
rael-0.062
iffs-0.062
 NEWS-0.059
Token
Token ID198
Feature activation+2.11e-01

Pos logit contributions
utical+0.395
pi+0.358
VICE+0.358
ener+0.357
 oneself+0.340
Neg logit contributions
 behavi-0.300
Mex-0.292
rael-0.290
iffs-0.290
 NEWS-0.274
Token
Token ID198
Feature activation+1.63e-01

Pos logit contributions
utical+0.163
pi+0.150
ener+0.150
VICE+0.150
 oneself+0.141
Neg logit contributions
 behavi-0.149
rael-0.136
iffs-0.135
Mex-0.135
 NEWS-0.128
TokenReviewer
Token ID35407
Feature activation+1.75e+00

Pos logit contributions
utical+0.156
pi+0.144
VICE+0.143
ener+0.143
 oneself+0.137
Neg logit contributions
 behavi-0.081
Mex-0.076
rael-0.076
iffs-0.076
 NEWS-0.071
Token:
Token ID25
Feature activation+2.48e-01

Pos logit contributions
utical+1.033
pi+0.930
ener+0.923
VICE+0.917
 oneself+0.851
Neg logit contributions
 behavi-1.575
rael-1.439
Mex-1.434
iffs-1.427
 NEWS-1.375
Token V
Token ID569
Feature activation-9.16e-02

Pos logit contributions
utical+0.193
pi+0.174
ener+0.174
VICE+0.174
 oneself+0.165
Neg logit contributions
 behavi-0.165
Mex-0.160
rael-0.159
iffs-0.159
 NEWS-0.151
Tokenthe
Token ID1169
Feature activation+7.41e-02

Pos logit contributions
 behavi+0.067
Mex+0.065
rael+0.065
iffs+0.065
 NEWS+0.062
Neg logit contributions
utical-0.066
pi-0.059
ener-0.059
VICE-0.059
 oneself-0.055
TokenHappy
Token ID25082
Feature activation+2.73e-02

Pos logit contributions
utical+0.054
pi+0.048
VICE+0.048
ener+0.048
 oneself+0.046
Neg logit contributions
 behavi-0.053
Mex-0.052
rael-0.052
iffs-0.051
 NEWS-0.049
TokenL
Token ID43
Feature activation+2.66e-01

Pos logit contributions
utical+0.020
VICE+0.017
pi+0.017
ener+0.017
 oneself+0.017
Neg logit contributions
 behavi-0.020
Mex-0.019
rael-0.019
iffs-0.019
 NEWS-0.018
Token among
Token ID1871
Feature activation+3.10e-01

Pos logit contributions
 behavi+0.089
Mex+0.087
iffs+0.086
rael+0.086
 NEWS+0.081
Neg logit contributions
utical-0.130
VICE-0.118
ener-0.118
pi-0.118
nir-0.112
Token all
Token ID477
Feature activation+1.62e-01

Pos logit contributions
utical+0.268
pi+0.245
VICE+0.244
ener+0.244
 oneself+0.233
Neg logit contributions
 behavi-0.180
Mex-0.174
rael-0.173
iffs-0.173
 NEWS-0.162
Token edge
Token ID5743
Feature activation-1.06e+00

Pos logit contributions
utical+0.137
pi+0.125
ener+0.124
VICE+0.124
 oneself+0.119
Neg logit contributions
 behavi-0.098
Mex-0.094
rael-0.094
iffs-0.093
 NEWS-0.088
Token r
Token ID374
Feature activation-4.82e-01

Pos logit contributions
 behavi+0.556
Mex+0.538
iffs+0.538
rael+0.534
 NEWS+0.500
Neg logit contributions
utical-0.977
VICE-0.895
pi-0.889
ener-0.888
 oneself-0.854
Tokenus
Token ID385
Feature activation-2.08e-01

Pos logit contributions
 behavi+0.371
Mex+0.360
rael+0.359
iffs+0.355
 NEWS+0.343
Neg logit contributions
utical-0.328
pi-0.294
ener-0.292
VICE-0.289
 oneself-0.272
Tokenhers
Token ID7084
Feature activation+1.73e+00

Pos logit contributions
 behavi+0.200
Mex+0.195
rael+0.194
iffs+0.193
 NEWS+0.188
Neg logit contributions
utical-0.100
ener-0.086
pi-0.086
VICE-0.085
 oneself-0.078
Token*)
Token ID28104
Feature activation-4.95e-02

Pos logit contributions
utical+1.389
ener+1.263
pi+1.261
VICE+1.250
 oneself+1.197
Neg logit contributions
 behavi-1.119
Mex-1.079
rael-1.071
iffs-1.063
 NEWS-1.010
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
 behavi+0.030
Mex+0.029
iffs+0.029
rael+0.029
 NEWS+0.027
Neg logit contributions
utical-0.041
VICE-0.038
pi-0.038
ener-0.037
 oneself-0.036
Token
Token ID198
Feature activation-1.99e-01

Pos logit contributions
 behavi+0.121
rael+0.111
iffs+0.111
Mex+0.111
 NEWS+0.104
Neg logit contributions
utical-0.138
pi-0.127
ener-0.126
VICE-0.126
 oneself-0.119
Token2017
Token ID5539
Feature activation-9.03e-02

Pos logit contributions
 behavi+0.144
Mex+0.143
iffs+0.141
rael+0.141
 NEWS+0.134
Neg logit contributions
utical-0.141
pi-0.126
ener-0.125
VICE-0.125
 oneself-0.120
Token salary
Token ID9588
Feature activation+2.49e-01

Pos logit contributions
 behavi+0.074
Mex+0.072
iffs+0.072
rael+0.072
 NEWS+0.069
Neg logit contributions
utical-0.056
VICE-0.049
pi-0.049
ener-0.049
 oneself-0.046
Token among
Token ID1871
Feature activation+2.89e-01

Pos logit contributions
 behavi+0.279
Mex+0.272
iffs+0.269
rael+0.269
 NEWS+0.255
Neg logit contributions
utical-0.408
VICE-0.371
pi-0.369
ener-0.369
nir-0.351
Token all
Token ID477
Feature activation+2.55e-01

Pos logit contributions
utical+0.211
pi+0.190
ener+0.189
VICE+0.189
 oneself+0.179
Neg logit contributions
 behavi-0.206
Mex-0.201
rael-0.199
iffs-0.199
 NEWS-0.190
Token edge
Token ID5743
Feature activation-1.02e+00

Pos logit contributions
utical+0.168
pi+0.150
ener+0.149
VICE+0.149
 oneself+0.140
Neg logit contributions
 behavi-0.200
Mex-0.195
rael-0.194
iffs-0.194
 NEWS-0.186
Token r
Token ID374
Feature activation-3.96e-01

Pos logit contributions
 behavi+0.505
iffs+0.488
rael+0.487
Mex+0.486
 NEWS+0.448
Neg logit contributions
utical-0.965
VICE-0.891
pi-0.882
ener-0.880
 oneself-0.845
Tokenus
Token ID385
Feature activation-1.62e-01

Pos logit contributions
 behavi+0.297
Mex+0.290
rael+0.289
iffs+0.289
 NEWS+0.275
Neg logit contributions
utical-0.274
pi-0.244
VICE-0.244
ener-0.243
 oneself-0.229
Tokenhers
Token ID7084
Feature activation+1.71e+00

Pos logit contributions
 behavi+0.167
Mex+0.158
rael+0.156
iffs+0.155
 NEWS+0.152
Neg logit contributions
utical-0.074
ener-0.064
pi-0.063
VICE-0.060
nir-0.056
Token*)
Token ID28104
Feature activation-1.23e-01

Pos logit contributions
utical+1.465
pi+1.336
ener+1.333
VICE+1.332
 oneself+1.272
Neg logit contributions
 behavi-1.004
Mex-0.972
rael-0.965
iffs-0.964
 NEWS-0.907
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
 behavi+0.074
Mex+0.072
iffs+0.071
rael+0.071
 NEWS+0.067
Neg logit contributions
utical-0.104
VICE-0.095
pi-0.094
ener-0.094
 oneself-0.090
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
 behavi+0.077
rael+0.071
iffs+0.071
Mex+0.071
 NEWS+0.067
Neg logit contributions
utical-0.088
pi-0.081
ener-0.081
VICE-0.081
 oneself-0.076
TokenEven
Token ID6104
Feature activation+3.75e-01

Pos logit contributions
 behavi+0.085
Mex+0.084
iffs+0.083
rael+0.083
 NEWS+0.079
Neg logit contributions
utical-0.088
pi-0.079
ener-0.079
VICE-0.078
 oneself-0.075
Token at
Token ID379
Feature activation+4.44e-02

Pos logit contributions
utical+0.275
ener+0.247
pi+0.247
VICE+0.246
 oneself+0.233
Neg logit contributions
 behavi-0.267
Mex-0.260
rael-0.259
iffs-0.258
 NEWS-0.245
Token among
Token ID1871
Feature activation+2.24e-01

Pos logit contributions
 behavi+0.146
Mex+0.142
rael+0.141
iffs+0.141
 NEWS+0.133
Neg logit contributions
utical-0.199
VICE-0.181
pi-0.180
ener-0.180
 oneself-0.171
Token all
Token ID477
Feature activation+1.08e-01

Pos logit contributions
utical+0.171
pi+0.155
ener+0.154
VICE+0.154
 oneself+0.146
Neg logit contributions
 behavi-0.153
Mex-0.149
rael-0.148
iffs-0.147
 NEWS-0.140
Token edge
Token ID5743
Feature activation-1.10e+00

Pos logit contributions
utical+0.076
pi+0.069
ener+0.068
VICE+0.068
 oneself+0.064
Neg logit contributions
 behavi-0.080
Mex-0.077
rael-0.077
iffs-0.076
 NEWS-0.073
Token r
Token ID374
Feature activation-4.81e-01

Pos logit contributions
 behavi+0.566
Mex+0.546
iffs+0.545
rael+0.544
 NEWS+0.505
Neg logit contributions
utical-1.029
VICE-0.945
pi-0.941
ener-0.938
 oneself-0.902
Tokenus
Token ID385
Feature activation-1.90e-01

Pos logit contributions
 behavi+0.363
Mex+0.353
rael+0.352
iffs+0.351
 NEWS+0.335
Neg logit contributions
utical-0.332
pi-0.296
VICE-0.295
ener-0.295
 oneself-0.278
Tokenhers
Token ID7084
Feature activation+1.70e+00

Pos logit contributions
 behavi+0.194
Mex+0.184
rael+0.182
iffs+0.180
 NEWS+0.178
Neg logit contributions
utical-0.087
ener-0.075
pi-0.074
VICE-0.071
 oneself-0.065
Token*)
Token ID28104
Feature activation-1.13e-01

Pos logit contributions
utical+1.399
pi+1.274
ener+1.271
VICE+1.264
 oneself+1.211
Neg logit contributions
 behavi-1.053
Mex-1.020
rael-1.012
iffs-1.008
 NEWS-0.953
Token
Token ID198
Feature activation-1.92e-01

Pos logit contributions
 behavi+0.068
Mex+0.066
iffs+0.066
rael+0.065
 NEWS+0.062
Neg logit contributions
utical-0.095
VICE-0.086
pi-0.086
ener-0.086
 oneself-0.082
Token
Token ID198
Feature activation-1.91e-01

Pos logit contributions
 behavi+0.131
rael+0.121
iffs+0.121
Mex+0.121
 NEWS+0.114
Neg logit contributions
utical-0.151
pi-0.139
ener-0.138
VICE-0.138
 oneself-0.130
TokenTotal
Token ID14957
Feature activation-4.57e-02

Pos logit contributions
Mex+0.138
 behavi+0.136
iffs+0.135
rael+0.135
 NEWS+0.127
Neg logit contributions
utical-0.136
pi-0.121
ener-0.120
VICE-0.119
 oneself-0.117
Token guaranteed
Token ID11462
Feature activation-7.70e-01

Pos logit contributions
 behavi+0.040
Mex+0.037
iffs+0.037
rael+0.037
 NEWS+0.036
Neg logit contributions
utical-0.028
pi-0.024
VICE-0.024
ener-0.024
 oneself-0.023
Token whole
Token ID2187
Feature activation+5.88e-02

Pos logit contributions
utical+0.140
pi+0.126
ener+0.126
VICE+0.126
 oneself+0.120
Neg logit contributions
 behavi-0.127
Mex-0.123
rael-0.122
iffs-0.122
 NEWS-0.116
Token being
Token ID852
Feature activation+6.75e-02

Pos logit contributions
utical+0.043
pi+0.039
VICE+0.039
ener+0.039
 oneself+0.037
Neg logit contributions
 behavi-0.042
Mex-0.041
rael-0.040
iffs-0.040
 NEWS-0.039
Token.
Token ID13
Feature activation+6.29e-02

Pos logit contributions
utical+0.048
pi+0.043
VICE+0.043
ener+0.043
 oneself+0.040
Neg logit contributions
 behavi-0.049
Mex-0.048
rael-0.048
iffs-0.048
 NEWS-0.046
Token A
Token ID317
Feature activation-2.80e-02

Pos logit contributions
utical+0.045
pi+0.040
VICE+0.040
ener+0.040
 oneself+0.038
Neg logit contributions
 behavi-0.046
rael-0.045
Mex-0.045
iffs-0.045
 NEWS-0.042
Token breath
Token ID8033
Feature activation-1.76e-01

Pos logit contributions
 behavi+0.018
Mex+0.017
iffs+0.017
rael+0.017
 NEWS+0.016
Neg logit contributions
utical-0.023
pi-0.021
ener-0.021
VICE-0.021
 oneself-0.020
Tokenlessly
Token ID8613
Feature activation+1.61e+00

Pos logit contributions
 behavi+0.126
Mex+0.123
rael+0.122
iffs+0.122
 NEWS+0.116
Neg logit contributions
utical-0.128
pi-0.115
ener-0.114
VICE-0.114
 oneself-0.108
Token inventive
Token ID47602
Feature activation-1.19e-01

Pos logit contributions
utical+1.376
ener+1.271
pi+1.268
VICE+1.262
 oneself+1.214
Neg logit contributions
 behavi-0.949
rael-0.905
Mex-0.899
iffs-0.895
 NEWS-0.851
Token sense
Token ID2565
Feature activation+5.53e-01

Pos logit contributions
 behavi+0.078
Mex+0.075
rael+0.075
iffs+0.075
 NEWS+0.071
Neg logit contributions
utical-0.093
ener-0.085
pi-0.085
VICE-0.085
 oneself-0.081
Token of
Token ID286
Feature activation-3.23e-02

Pos logit contributions
utical+0.410
VICE+0.369
pi+0.367
ener+0.365
 oneself+0.346
Neg logit contributions
 behavi-0.391
Mex-0.380
iffs-0.378
rael-0.377
 NEWS-0.358
Token post
Token ID1281
Feature activation+5.37e-01

Pos logit contributions
 behavi+0.019
Mex+0.019
rael+0.019
iffs+0.018
 NEWS+0.017
Neg logit contributions
utical-0.027
pi-0.025
ener-0.025
VICE-0.025
 oneself-0.024
Tokenmodern
Token ID23922
Feature activation-4.42e-01

Pos logit contributions
utical+0.338
pi+0.298
ener+0.298
VICE+0.297
 oneself+0.278
Neg logit contributions
 behavi-0.439
Mex-0.428
rael-0.426
iffs-0.426
 NEWS-0.408
Token hearing
Token ID4854
Feature activation+1.92e-02

Pos logit contributions
 behavi+0.038
Mex+0.036
rael+0.035
iffs+0.035
 NEWS+0.030
Neg logit contributions
utical-0.127
ener-0.120
VICE-0.119
pi-0.119
 oneself-0.115
Token on
Token ID319
Feature activation-2.40e-01

Pos logit contributions
utical+0.014
pi+0.013
ener+0.013
VICE+0.013
 oneself+0.012
Neg logit contributions
 behavi-0.014
Mex-0.013
rael-0.013
iffs-0.013
 NEWS-0.013
Token Thursday
Token ID3635
Feature activation+2.74e-01

Pos logit contributions
 behavi+0.140
Mex+0.136
rael+0.135
iffs+0.134
 NEWS+0.126
Neg logit contributions
utical-0.207
pi-0.189
VICE-0.188
ener-0.188
 oneself-0.180
Token,
Token ID11
Feature activation-5.45e-02

Pos logit contributions
utical+0.204
pi+0.183
ener+0.183
VICE+0.183
 oneself+0.173
Neg logit contributions
 behavi-0.192
Mex-0.186
rael-0.186
iffs-0.185
 NEWS-0.176
Token Mel
Token ID5616
Feature activation-3.61e-01

Pos logit contributions
 behavi+0.039
Mex+0.038
rael+0.038
iffs+0.038
 NEWS+0.036
Neg logit contributions
utical-0.040
pi-0.036
ener-0.036
VICE-0.036
 oneself-0.034
Tokeninda
Token ID22261
Feature activation-1.70e+00

Pos logit contributions
 behavi+0.213
Mex+0.206
rael+0.203
iffs+0.203
 NEWS+0.191
Neg logit contributions
utical-0.309
pi-0.283
ener-0.283
VICE-0.281
 oneself-0.268
Token Thompson
Token ID11654
Feature activation+1.28e-01

Pos logit contributions
 behavi+1.164
Mex+1.142
iffs+1.135
rael+1.133
 NEWS+1.083
Neg logit contributions
utical-1.282
VICE-1.156
pi-1.147
ener-1.142
 oneself-1.084
Token,
Token ID11
Feature activation-1.40e-02

Pos logit contributions
utical+0.093
ener+0.084
pi+0.084
VICE+0.083
 oneself+0.079
Neg logit contributions
 behavi-0.091
Mex-0.088
rael-0.088
iffs-0.088
 NEWS-0.083
Token the
Token ID262
Feature activation-4.11e-04

Pos logit contributions
 behavi+0.010
Mex+0.010
rael+0.010
iffs+0.010
 NEWS+0.009
Neg logit contributions
utical-0.010
pi-0.009
VICE-0.009
ener-0.009
 oneself-0.009
Token leading
Token ID3756
Feature activation-2.13e-01

Pos logit contributions
 behavi+0.000
Mex+0.000
rael+0.000
iffs+0.000
 NEWS+0.000
Neg logit contributions
utical-0.000
ener-0.000
pi-0.000
VICE-0.000
 oneself-0.000
Token lawyer
Token ID6853
Feature activation-2.97e-01

Pos logit contributions
 behavi+0.146
Mex+0.141
rael+0.141
iffs+0.141
 NEWS+0.133
Neg logit contributions
utical-0.162
pi-0.147
ener-0.147
VICE-0.146
 oneself-0.139
Token Animation
Token ID23535
Feature activation-1.17e-02

Pos logit contributions
 behavi+0.402
Mex+0.387
rael+0.386
iffs+0.384
 NEWS+0.362
Neg logit contributions
utical-0.561
pi-0.515
VICE-0.512
ener-0.511
 oneself-0.488
Token,
Token ID11
Feature activation-1.15e-01

Pos logit contributions
 behavi+0.008
Mex+0.008
rael+0.008
iffs+0.008
 NEWS+0.007
Neg logit contributions
utical-0.009
pi-0.008
VICE-0.008
ener-0.008
 oneself-0.008
Token Disney
Token ID8519
Feature activation-8.59e-02

Pos logit contributions
 behavi+0.089
Mex+0.086
rael+0.086
iffs+0.086
 NEWS+0.082
Neg logit contributions
utical-0.077
pi-0.069
VICE-0.069
ener-0.069
 oneself-0.064
Token,
Token ID11
Feature activation-8.72e-02

Pos logit contributions
 behavi+0.061
Mex+0.059
rael+0.059
iffs+0.058
 NEWS+0.056
Neg logit contributions
utical-0.064
pi-0.057
VICE-0.057
ener-0.057
 oneself-0.054
Token Rel
Token ID4718
Feature activation-4.43e-01

Pos logit contributions
 behavi+0.059
Mex+0.057
rael+0.057
iffs+0.057
 NEWS+0.054
Neg logit contributions
utical-0.067
pi-0.061
VICE-0.061
ener-0.061
 oneself-0.057
Tokenativity
Token ID22055
Feature activation-1.66e+00

Pos logit contributions
 behavi+0.145
Mex+0.137
rael+0.134
iffs+0.134
 NEWS+0.120
Neg logit contributions
utical-0.495
pi-0.462
VICE-0.461
ener-0.461
 oneself-0.445
Token Media
Token ID6343
Feature activation-7.22e-01

Pos logit contributions
 behavi+1.033
rael+0.992
Mex+0.989
iffs+0.986
 NEWS+0.922
Neg logit contributions
utical-1.377
pi-1.258
VICE-1.249
ener-1.248
 oneself-1.192
Token,
Token ID11
Feature activation-5.19e-02

Pos logit contributions
 behavi+0.492
Mex+0.484
iffs+0.481
rael+0.477
 NEWS+0.458
Neg logit contributions
utical-0.546
pi-0.491
ener-0.490
VICE-0.490
 oneself-0.467
Token and
Token ID290
Feature activation+4.55e-02

Pos logit contributions
 behavi+0.034
Mex+0.033
rael+0.033
iffs+0.033
 NEWS+0.031
Neg logit contributions
utical-0.041
pi-0.037
VICE-0.037
ener-0.037
 oneself-0.035
Token plenty
Token ID6088
Feature activation-1.20e-01

Pos logit contributions
utical+0.031
pi+0.028
VICE+0.028
ener+0.028
 oneself+0.026
Neg logit contributions
 behavi-0.034
Mex-0.034
rael-0.033
iffs-0.033
 NEWS-0.032
Token more
Token ID517
Feature activation+1.37e-01

Pos logit contributions
 behavi+0.080
Mex+0.078
rael+0.077
iffs+0.077
 NEWS+0.073
Neg logit contributions
utical-0.094
VICE-0.085
pi-0.085
ener-0.084
 oneself-0.080
Token innings
Token ID16267
Feature activation-1.71e-02

Pos logit contributions
 behavi+0.010
Mex+0.009
rael+0.009
iffs+0.009
 NEWS+0.009
Neg logit contributions
utical-0.016
pi-0.015
VICE-0.015
ener-0.015
 oneself-0.014
Token.
Token ID13
Feature activation-3.44e-01

Pos logit contributions
 behavi+0.013
Mex+0.012
rael+0.012
iffs+0.012
 NEWS+0.012
Neg logit contributions
utical-0.012
pi-0.011
ener-0.011
VICE-0.011
 oneself-0.010
Token An
Token ID1052
Feature activation-8.88e-02

Pos logit contributions
 behavi+0.235
Mex+0.228
rael+0.227
iffs+0.226
 NEWS+0.215
Neg logit contributions
utical-0.262
pi-0.236
VICE-0.236
ener-0.236
 oneself-0.223
Token extreme
Token ID3257
Feature activation-7.28e-01

Pos logit contributions
 behavi+0.059
Mex+0.057
iffs+0.057
rael+0.057
 NEWS+0.054
Neg logit contributions
utical-0.069
pi-0.063
VICE-0.063
ener-0.063
 oneself-0.059
Token fly
Token ID6129
Feature activation-2.01e-01

Pos logit contributions
 behavi+0.473
Mex+0.460
iffs+0.457
rael+0.456
 NEWS+0.433
Neg logit contributions
utical-0.579
VICE-0.523
pi-0.522
ener-0.521
 oneself-0.495
Tokenball
Token ID1894
Feature activation-1.66e+00

Pos logit contributions
 behavi+0.108
Mex+0.105
iffs+0.104
rael+0.104
 NEWS+0.097
Neg logit contributions
utical-0.184
VICE-0.167
pi-0.166
ener-0.165
 oneself-0.159
Token pitcher
Token ID18086
Feature activation-2.47e-01

Pos logit contributions
 behavi+1.157
Mex+1.129
iffs+1.121
rael+1.120
 NEWS+1.066
Neg logit contributions
utical-1.246
VICE-1.115
pi-1.108
ener-1.102
 oneself-1.050
Token,
Token ID11
Feature activation-1.83e-01

Pos logit contributions
 behavi+0.178
Mex+0.173
rael+0.172
iffs+0.172
 NEWS+0.164
Neg logit contributions
utical-0.179
pi-0.161
ener-0.161
VICE-0.161
 oneself-0.152
Token he
Token ID339
Feature activation-1.91e-01

Pos logit contributions
 behavi+0.118
Mex+0.114
rael+0.113
iffs+0.113
 NEWS+0.107
Neg logit contributions
utical-0.147
pi-0.134
VICE-0.133
ener-0.133
 oneself-0.127
Token
Token ID447
Feature activation-1.01e-01

Pos logit contributions
 behavi+0.131
Mex+0.126
rael+0.126
iffs+0.125
 NEWS+0.119
Neg logit contributions
utical-0.146
pi-0.132
ener-0.132
VICE-0.131
 oneself-0.125
Token
Token ID247
Feature activation+1.00e-01

Pos logit contributions
 behavi+0.096
Mex+0.093
iffs+0.093
rael+0.093
 NEWS+0.090
Neg logit contributions
utical-0.051
pi-0.043
VICE-0.043
ener-0.043
 oneself-0.040
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
 behavi+0.150
Mex+0.146
rael+0.145
iffs+0.145
 NEWS+0.138
Neg logit contributions
utical-0.161
pi-0.145
VICE-0.145
ener-0.144
 oneself-0.137
Token
Token ID564
Feature activation-1.23e-01

Pos logit contributions
 behavi+0.074
Mex+0.071
rael+0.071
iffs+0.071
 NEWS+0.067
Neg logit contributions
utical-0.094
ener-0.086
pi-0.086
VICE-0.085
 oneself-0.082
Token
Token ID246
Feature activation-2.45e-01

Pos logit contributions
 behavi+0.109
Mex+0.107
rael+0.106
iffs+0.106
 NEWS+0.102
Neg logit contributions
utical-0.069
pi-0.060
VICE-0.060
ener-0.060
 oneself-0.055
Tokenp
Token ID79
Feature activation-2.18e-01

Pos logit contributions
 behavi+0.177
Mex+0.172
rael+0.171
iffs+0.171
 NEWS+0.163
Neg logit contributions
utical-0.177
pi-0.159
ener-0.159
VICE-0.158
 oneself-0.149
Tokeniv
Token ID452
Feature activation-5.83e-01

Pos logit contributions
 behavi+0.167
Mex+0.162
iffs+0.161
rael+0.160
 NEWS+0.154
Neg logit contributions
utical-0.149
ener-0.133
pi-0.133
VICE-0.132
 oneself-0.125
Tokenotal
Token ID4997
Feature activation-1.65e+00

Pos logit contributions
 behavi+0.439
Mex+0.429
iffs+0.429
rael+0.428
 NEWS+0.408
Neg logit contributions
utical-0.402
VICE-0.357
pi-0.356
ener-0.355
 oneself-0.336
Token
Token ID447
Feature activation+5.59e-02

Pos logit contributions
 behavi+1.081
Mex+1.056
iffs+1.050
rael+1.046
 NEWS+0.993
Neg logit contributions
utical-1.310
pi-1.178
VICE-1.177
ener-1.169
 oneself-1.114
Token
Token ID247
Feature activation-1.67e-01

Pos logit contributions
utical+0.033
pi+0.029
VICE+0.029
ener+0.029
 oneself+0.027
Neg logit contributions
 behavi-0.047
rael-0.046
Mex-0.046
iffs-0.046
 NEWS-0.044
Token Supreme
Token ID5617
Feature activation-3.55e-01

Pos logit contributions
 behavi+0.110
Mex+0.107
rael+0.107
iffs+0.106
 NEWS+0.101
Neg logit contributions
utical-0.131
pi-0.118
VICE-0.118
ener-0.118
 oneself-0.112
Token Court
Token ID3078
Feature activation+6.98e-01

Pos logit contributions
 behavi+0.207
Mex+0.200
rael+0.199
iffs+0.199
 NEWS+0.187
Neg logit contributions
utical-0.306
pi-0.279
VICE-0.279
ener-0.278
 oneself-0.266
Token justices
Token ID27818
Feature activation+2.75e-01

Pos logit contributions
utical+0.601
pi+0.552
ener+0.550
VICE+0.547
 oneself+0.524
Neg logit contributions
 behavi-0.407
rael-0.389
Mex-0.389
iffs-0.384
 NEWS-0.363
Token this
Token ID428
Feature activation+1.91e-01

Pos logit contributions
utical+0.142
pi+0.129
ener+0.129
VICE+0.128
 oneself+0.124
Neg logit contributions
 behavi-0.150
Mex-0.147
rael-0.146
iffs-0.145
 NEWS-0.138
Token in
Token ID287
Feature activation+2.87e-01

Pos logit contributions
utical+0.136
pi+0.123
ener+0.123
VICE+0.123
 oneself+0.116
Neg logit contributions
 behavi-0.139
Mex-0.135
iffs-0.135
rael-0.135
 NEWS-0.128
Token mind
Token ID2000
Feature activation-1.16e-01

Pos logit contributions
utical+0.222
pi+0.202
ener+0.201
VICE+0.201
 oneself+0.193
Neg logit contributions
 behavi-0.192
Mex-0.187
rael-0.186
iffs-0.186
 NEWS-0.175
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
 behavi+0.084
Mex+0.082
rael+0.082
iffs+0.081
 NEWS+0.077
Neg logit contributions
utical-0.083
pi-0.074
ener-0.074
VICE-0.074
 oneself-0.070
Token
Token ID220
Feature activation-2.33e-01

Pos logit contributions
utical+0.144
pi+0.130
VICE+0.130
ener+0.129
 oneself+0.122
Neg logit contributions
 behavi-0.153
rael-0.148
Mex-0.147
iffs-0.147
 NEWS-0.138
Token________________________________
Token ID10221
Feature activation-1.62e+00

Pos logit contributions
 behavi+0.164
Mex+0.160
rael+0.159
iffs+0.159
 NEWS+0.151
Neg logit contributions
utical-0.172
pi-0.155
VICE-0.154
ener-0.154
 oneself-0.146
Token______
Token ID25947
Feature activation+7.92e-02

Pos logit contributions
 behavi+0.924
Mex+0.898
rael+0.890
iffs+0.889
 NEWS+0.838
Neg logit contributions
utical-1.420
pi-1.294
VICE-1.292
ener-1.291
 oneself-1.236
Token Name
Token ID6530
Feature activation+1.72e-01

Pos logit contributions
utical+0.057
VICE+0.051
pi+0.051
ener+0.051
 oneself+0.048
Neg logit contributions
 behavi-0.057
rael-0.056
Mex-0.056
iffs-0.055
 NEWS-0.052
Token:
Token ID25
Feature activation+2.03e-01

Pos logit contributions
utical+0.116
pi+0.103
VICE+0.103
ener+0.103
 oneself+0.097
Neg logit contributions
 behavi-0.132
Mex-0.129
rael-0.128
iffs-0.128
 NEWS-0.122
Token Reid
Token ID16095
Feature activation+1.52e-01

Pos logit contributions
utical+0.146
pi+0.131
VICE+0.131
ener+0.130
 oneself+0.123
Neg logit contributions
 behavi-0.147
Mex-0.143
rael-0.142
iffs-0.142
 NEWS-0.136
Token H
Token ID367
Feature activation+1.05e-02

Pos logit contributions
utical+0.111
pi+0.099
VICE+0.099
ener+0.099
 oneself+0.093
Neg logit contributions
 behavi-0.109
Mex-0.107
rael-0.106
iffs-0.106
 NEWS-0.101
Token close
Token ID1969
Feature activation+1.44e-01

Pos logit contributions
utical+0.014
ener+0.013
pi+0.013
VICE+0.013
 oneself+0.013
Neg logit contributions
 behavi-0.007
Mex-0.007
rael-0.007
iffs-0.007
 NEWS-0.006
Token,
Token ID11
Feature activation-1.77e-02

Pos logit contributions
utical+0.108
pi+0.097
VICE+0.097
ener+0.097
 oneself+0.091
Neg logit contributions
 behavi-0.100
Mex-0.097
rael-0.097
iffs-0.097
 NEWS-0.092
Token defense
Token ID3761
Feature activation+4.86e-01

Pos logit contributions
 behavi+0.012
Mex+0.012
rael+0.012
iffs+0.012
 NEWS+0.011
Neg logit contributions
utical-0.013
pi-0.012
ener-0.012
VICE-0.012
 oneself-0.011
Token lawyer
Token ID6853
Feature activation-3.04e-01

Pos logit contributions
utical+0.379
ener+0.344
pi+0.344
VICE+0.343
 oneself+0.327
Neg logit contributions
 behavi-0.321
Mex-0.311
rael-0.310
iffs-0.308
 NEWS-0.291
Token Mel
Token ID5616
Feature activation-4.44e-01

Pos logit contributions
 behavi+0.219
Mex+0.213
rael+0.212
iffs+0.212
 NEWS+0.202
Neg logit contributions
utical-0.220
pi-0.197
VICE-0.197
ener-0.197
 oneself-0.186
Tokeninda
Token ID22261
Feature activation-1.61e+00

Pos logit contributions
 behavi+0.140
Mex+0.133
rael+0.131
iffs+0.131
 NEWS+0.116
Neg logit contributions
utical-0.501
VICE-0.467
pi-0.467
ener-0.466
 oneself-0.451
Token Thompson
Token ID11654
Feature activation-2.36e-02

Pos logit contributions
 behavi+1.254
Mex+1.233
iffs+1.228
rael+1.225
 NEWS+1.184
Neg logit contributions
utical-1.057
VICE-0.935
pi-0.930
ener-0.923
 oneself-0.871
Token spoke
Token ID5158
Feature activation-5.48e-02

Pos logit contributions
 behavi+0.015
Mex+0.015
iffs+0.015
rael+0.015
 NEWS+0.014
Neg logit contributions
utical-0.019
ener-0.017
pi-0.017
VICE-0.017
 oneself-0.016
Token with
Token ID351
Feature activation+7.91e-03

Pos logit contributions
 behavi+0.040
Mex+0.039
rael+0.039
iffs+0.039
 NEWS+0.037
Neg logit contributions
utical-0.039
pi-0.035
VICE-0.035
ener-0.035
 oneself-0.033
Token Irish
Token ID8685
Feature activation-3.10e-01

Pos logit contributions
utical+0.006
pi+0.005
VICE+0.005
ener+0.005
 oneself+0.005
Neg logit contributions
 behavi-0.005
Mex-0.005
rael-0.005
iffs-0.005
 NEWS-0.005
TokenCentral
Token ID30645
Feature activation+8.36e-02

Pos logit contributions
 behavi+0.199
iffs+0.192
rael+0.192
Mex+0.191
 NEWS+0.177
Neg logit contributions
utical-0.248
pi-0.228
ener-0.228
VICE-0.227
 oneself-0.216
Token was
Token ID373
Feature activation-6.09e-02

Pos logit contributions
utical+0.290
pi+0.262
ener+0.262
VICE+0.261
 oneself+0.248
Neg logit contributions
 behavi-0.268
Mex-0.260
rael-0.259
iffs-0.258
 NEWS-0.245
Token fresh
Token ID4713
Feature activation-2.44e-02

Pos logit contributions
 behavi+0.040
Mex+0.039
iffs+0.038
rael+0.038
 NEWS+0.036
Neg logit contributions
utical-0.048
pi-0.044
ener-0.044
VICE-0.044
 oneself-0.041
Token off
Token ID572
Feature activation-6.36e-02

Pos logit contributions
 behavi+0.018
Mex+0.018
rael+0.018
iffs+0.018
 NEWS+0.017
Neg logit contributions
utical-0.017
pi-0.015
ener-0.015
VICE-0.015
 oneself-0.014
Token of
Token ID286
Feature activation-5.74e-02

Pos logit contributions
 behavi+0.043
Mex+0.042
rael+0.042
iffs+0.042
 NEWS+0.040
Neg logit contributions
utical-0.048
pi-0.044
VICE-0.044
ener-0.044
 oneself-0.041
Token Requ
Token ID9394
Feature activation+2.76e-02

Pos logit contributions
 behavi+0.039
Mex+0.038
rael+0.038
iffs+0.038
 NEWS+0.036
Neg logit contributions
utical-0.044
pi-0.039
ener-0.039
VICE-0.039
 oneself-0.037
Tokeniem
Token ID26597
Feature activation-1.60e+00

Pos logit contributions
utical+0.024
VICE+0.022
pi+0.022
ener+0.022
 oneself+0.021
Neg logit contributions
 behavi-0.016
Mex-0.015
rael-0.015
iffs-0.015
 NEWS-0.014
Token for
Token ID329
Feature activation-6.26e-02

Pos logit contributions
 behavi+1.169
Mex+1.148
iffs+1.138
rael+1.138
 NEWS+1.087
Neg logit contributions
utical-1.131
VICE-1.016
pi-1.012
ener-1.007
 oneself-0.952
Token a
Token ID257
Feature activation-2.11e-02

Pos logit contributions
 behavi+0.047
Mex+0.045
iffs+0.045
rael+0.045
 NEWS+0.043
Neg logit contributions
utical-0.044
ener-0.039
pi-0.039
VICE-0.039
 oneself-0.037
Token Dream
Token ID7610
Feature activation-1.86e-01

Pos logit contributions
 behavi+0.017
Mex+0.017
rael+0.017
iffs+0.016
 NEWS+0.016
Neg logit contributions
utical-0.014
pi-0.012
VICE-0.012
ener-0.012
 oneself-0.011
Token
Token ID1399
Feature activation-3.05e-02

Pos logit contributions
 behavi+0.129
Mex+0.126
rael+0.125
iffs+0.125
 NEWS+0.119
Neg logit contributions
utical-0.140
pi-0.126
VICE-0.126
ener-0.125
 oneself-0.119
Token a
Token ID257
Feature activation-5.56e-02

Pos logit contributions
 behavi+0.021
Mex+0.020
rael+0.020
iffs+0.020
 NEWS+0.019
Neg logit contributions
utical-0.023
ener-0.021
pi-0.021
VICE-0.021
 oneself-0.020
Token engineers
Token ID12037
Feature activation-4.42e-01

Pos logit contributions
 behavi+0.017
Mex+0.017
rael+0.017
iffs+0.017
 NEWS+0.016
Neg logit contributions
utical-0.023
pi-0.021
ener-0.021
VICE-0.021
 oneself-0.020
Token to
Token ID284
Feature activation+2.00e-02

Pos logit contributions
 behavi+0.340
Mex+0.332
rael+0.331
iffs+0.331
 NEWS+0.316
Neg logit contributions
utical-0.297
VICE-0.264
pi-0.264
ener-0.263
 oneself-0.247
Token construct
Token ID5678
Feature activation-7.20e-01

Pos logit contributions
utical+0.015
pi+0.013
VICE+0.013
ener+0.013
 oneself+0.012
Neg logit contributions
 behavi-0.014
Mex-0.014
rael-0.014
iffs-0.014
 NEWS-0.013
Token a
Token ID257
Feature activation-8.83e-02

Pos logit contributions
 behavi+0.491
Mex+0.477
rael+0.474
iffs+0.474
 NEWS+0.450
Neg logit contributions
utical-0.550
pi-0.495
VICE-0.494
ener-0.493
 oneself-0.468
Token pont
Token ID45443
Feature activation-1.49e-01

Pos logit contributions
 behavi+0.053
Mex+0.052
iffs+0.051
rael+0.051
 NEWS+0.048
Neg logit contributions
utical-0.074
pi-0.068
ener-0.067
VICE-0.067
 oneself-0.064
Tokenoon
Token ID2049
Feature activation-1.55e+00

Pos logit contributions
 behavi+0.091
Mex+0.087
rael+0.087
iffs+0.086
 NEWS+0.082
Neg logit contributions
utical-0.125
ener-0.114
pi-0.114
VICE-0.113
 oneself-0.108
Token bridge
Token ID7696
Feature activation+2.60e-01

Pos logit contributions
 behavi+0.925
Mex+0.898
iffs+0.891
rael+0.891
 NEWS+0.839
Neg logit contributions
utical-1.318
VICE-1.198
pi-1.198
ener-1.193
 oneself-1.139
Token across
Token ID1973
Feature activation+8.70e-02

Pos logit contributions
utical+0.186
pi+0.166
ener+0.166
VICE+0.166
 oneself+0.157
Neg logit contributions
 behavi-0.190
Mex-0.184
rael-0.183
iffs-0.183
 NEWS-0.175
Token the
Token ID262
Feature activation+1.06e-01

Pos logit contributions
utical+0.069
VICE+0.062
pi+0.062
ener+0.062
 oneself+0.058
Neg logit contributions
 behavi-0.057
Mex-0.056
rael-0.055
iffs-0.055
 NEWS-0.052
Token narrow
Token ID7135
Feature activation+1.67e-01

Pos logit contributions
utical+0.090
pi+0.082
ener+0.082
VICE+0.082
 oneself+0.078
Neg logit contributions
 behavi-0.063
Mex-0.061
iffs-0.061
rael-0.061
 NEWS-0.057
Token passage
Token ID10066
Feature activation-7.26e-01

Pos logit contributions
utical+0.140
pi+0.128
ener+0.128
VICE+0.128
 oneself+0.122
Neg logit contributions
 behavi-0.100
Mex-0.097
rael-0.096
iffs-0.096
 NEWS-0.091
Token to
Token ID284
Feature activation-3.43e-02

Pos logit contributions
utical+0.033
VICE+0.029
pi+0.029
ener+0.029
 oneself+0.027
Neg logit contributions
 behavi-0.040
Mex-0.039
iffs-0.039
rael-0.039
 NEWS-0.037
Token have
Token ID423
Feature activation+2.11e-02

Pos logit contributions
 behavi+0.023
Mex+0.023
rael+0.022
iffs+0.022
 NEWS+0.021
Neg logit contributions
utical-0.026
pi-0.024
ener-0.024
VICE-0.024
 oneself-0.023
Token a
Token ID257
Feature activation-3.37e-02

Pos logit contributions
utical+0.016
pi+0.014
VICE+0.014
ener+0.014
 oneself+0.013
Neg logit contributions
 behavi-0.015
Mex-0.014
rael-0.014
iffs-0.014
 NEWS-0.013
Token court
Token ID2184
Feature activation+4.14e-01

Pos logit contributions
 behavi+0.023
Mex+0.022
rael+0.022
iffs+0.022
 NEWS+0.020
Neg logit contributions
utical-0.026
pi-0.024
ener-0.024
VICE-0.024
 oneself-0.023
Token-
Token ID12
Feature activation+8.81e-02

Pos logit contributions
utical+0.335
ener+0.305
pi+0.304
VICE+0.304
 oneself+0.290
Neg logit contributions
 behavi-0.265
Mex-0.256
rael-0.254
iffs-0.251
 NEWS-0.240
Tokenappointed
Token ID32924
Feature activation-1.52e+00

Pos logit contributions
utical+0.061
ener+0.054
pi+0.054
VICE+0.054
 oneself+0.051
Neg logit contributions
 behavi-0.067
Mex-0.065
rael-0.064
iffs-0.064
 NEWS-0.062
Token sheriff
Token ID16570
Feature activation-3.80e-02

Pos logit contributions
 behavi+1.048
Mex+1.020
rael+1.014
iffs+1.012
 NEWS+0.962
Neg logit contributions
utical-1.142
pi-1.029
VICE-1.027
ener-1.026
 oneself-0.972
Token handle
Token ID5412
Feature activation+6.05e-01

Pos logit contributions
 behavi+0.026
Mex+0.025
rael+0.025
iffs+0.025
 NEWS+0.024
Neg logit contributions
utical-0.029
pi-0.026
ener-0.026
VICE-0.026
 oneself-0.025
Token the
Token ID262
Feature activation+1.24e-01

Pos logit contributions
utical+0.455
pi+0.413
VICE+0.410
ener+0.409
 oneself+0.390
Neg logit contributions
 behavi-0.418
Mex-0.408
rael-0.405
iffs-0.401
 NEWS-0.383
Token advertising
Token ID8560
Feature activation+9.68e-02

Pos logit contributions
utical+0.104
pi+0.095
ener+0.095
VICE+0.095
 oneself+0.091
Neg logit contributions
 behavi-0.075
Mex-0.073
rael-0.072
iffs-0.072
 NEWS-0.068
Token and
Token ID290
Feature activation-4.03e-02

Pos logit contributions
utical+0.073
pi+0.066
ener+0.065
VICE+0.065
 oneself+0.062
Neg logit contributions
 behavi-0.067
Mex-0.065
rael-0.065
iffs-0.064
 NEWS-0.061
Token in
Token ID287
Feature activation+1.14e-01

Pos logit contributions
utical+0.521
VICE+0.470
pi+0.468
ener+0.468
nir+0.446
Neg logit contributions
 behavi-0.467
Mex-0.454
rael-0.452
iffs-0.449
 NEWS-0.426
Token a
Token ID257
Feature activation+5.10e-02

Pos logit contributions
utical+0.085
VICE+0.077
pi+0.076
ener+0.076
 oneself+0.072
Neg logit contributions
 behavi-0.079
Mex-0.077
rael-0.076
iffs-0.076
 NEWS-0.073
Token 3
Token ID513
Feature activation-2.39e-02

Pos logit contributions
utical+0.038
pi+0.035
VICE+0.034
ener+0.034
 oneself+0.033
Neg logit contributions
 behavi-0.035
Mex-0.034
rael-0.034
iffs-0.034
 NEWS-0.032
Token×
Token ID12906
Feature activation-3.09e-01

Pos logit contributions
 behavi+0.021
Mex+0.021
iffs+0.021
rael+0.021
 NEWS+0.020
Neg logit contributions
utical-0.013
VICE-0.011
pi-0.011
ener-0.011
 oneself-0.010
Token3
Token ID18
Feature activation+1.09e-01

Pos logit contributions
 behavi+0.242
Mex+0.239
rael+0.236
iffs+0.236
 NEWS+0.226
Neg logit contributions
utical-0.205
VICE-0.179
ener-0.178
pi-0.176
 oneself-0.168
Tokenrooms
Token ID9649
Feature activation-1.51e+00

Pos logit contributions
utical+0.096
VICE+0.088
pi+0.088
ener+0.087
 oneself+0.083
Neg logit contributions
 behavi-0.062
Mex-0.060
rael-0.059
iffs-0.059
 NEWS-0.056
Token.
Token ID13
Feature activation-4.89e-02

Pos logit contributions
 behavi+1.077
Mex+1.061
rael+1.056
iffs+1.047
 NEWS+0.994
Neg logit contributions
utical-1.103
VICE-0.984
ener-0.974
pi-0.972
nir-0.924
Token The
Token ID383
Feature activation+2.57e-01

Pos logit contributions
 behavi+0.036
Mex+0.035
rael+0.035
iffs+0.035
 NEWS+0.033
Neg logit contributions
utical-0.035
VICE-0.031
pi-0.031
ener-0.031
 oneself-0.029
Token rooms
Token ID9519
Feature activation-7.62e-01

Pos logit contributions
utical+0.189
VICE+0.169
pi+0.169
ener+0.169
 oneself+0.159
Neg logit contributions
 behavi-0.183
Mex-0.178
rael-0.177
iffs-0.177
 NEWS-0.169
Token are
Token ID389
Feature activation+2.08e-02

Pos logit contributions
 behavi+0.463
Mex+0.452
rael+0.452
iffs+0.449
 NEWS+0.424
Neg logit contributions
utical-0.635
VICE-0.574
pi-0.573
ener-0.572
nir-0.545
Token not
Token ID407
Feature activation+1.13e-01

Pos logit contributions
utical+0.016
pi+0.015
VICE+0.015
ener+0.015
 oneself+0.014
Neg logit contributions
 behavi-0.014
Mex-0.013
rael-0.013
iffs-0.013
 NEWS-0.012
Tokenmakes
Token ID49123
Feature activation-1.02e-01

Pos logit contributions
 behavi+0.303
Mex+0.294
rael+0.293
iffs+0.293
 NEWS+0.279
Neg logit contributions
utical-0.302
pi-0.270
VICE-0.270
ener-0.270
 oneself-0.254
Token us
Token ID514
Feature activation-3.01e-01

Pos logit contributions
 behavi+0.065
Mex+0.062
iffs+0.062
rael+0.062
 NEWS+0.058
Neg logit contributions
utical-0.083
pi-0.076
ener-0.076
VICE-0.076
 oneself-0.073
Token who
Token ID508
Feature activation-1.61e-01

Pos logit contributions
 behavi+0.198
Mex+0.191
iffs+0.190
rael+0.189
 NEWS+0.179
Neg logit contributions
utical-0.237
ener-0.215
pi-0.215
VICE-0.214
 oneself-0.206
Token we
Token ID356
Feature activation-2.53e-01

Pos logit contributions
 behavi+0.121
Mex+0.118
rael+0.117
iffs+0.117
 NEWS+0.112
Neg logit contributions
utical-0.112
pi-0.100
ener-0.099
VICE-0.099
 oneself-0.094
Token are
Token ID389
Feature activation-1.15e-01

Pos logit contributions
 behavi+0.152
Mex+0.147
rael+0.146
iffs+0.146
 NEWS+0.138
Neg logit contributions
utical-0.213
pi-0.194
VICE-0.194
ener-0.193
 oneself-0.184
Token.'
Token ID2637
Feature activation-1.49e+00

Pos logit contributions
 behavi+0.067
Mex+0.065
rael+0.065
iffs+0.065
 NEWS+0.061
Neg logit contributions
utical-0.099
pi-0.091
ener-0.091
VICE-0.091
 oneself-0.087
Token
Token ID198
Feature activation+8.46e-02

Pos logit contributions
 behavi+0.870
Mex+0.853
iffs+0.839
rael+0.837
 NEWS+0.802
Neg logit contributions
utical-1.280
VICE-1.168
pi-1.166
ener-1.153
 oneself-1.114
Token
Token ID198
Feature activation+2.07e-02

Pos logit contributions
utical+0.067
pi+0.061
VICE+0.061
ener+0.061
 oneself+0.058
Neg logit contributions
 behavi-0.058
rael-0.053
iffs-0.053
Mex-0.053
 NEWS-0.050
TokenThe
Token ID464
Feature activation+5.74e-02

Pos logit contributions
utical+0.016
VICE+0.014
pi+0.014
ener+0.014
 oneself+0.013
Neg logit contributions
 behavi-0.014
rael-0.014
Mex-0.014
iffs-0.014
 NEWS-0.013
Token comment
Token ID2912
Feature activation-5.79e-01

Pos logit contributions
utical+0.046
ener+0.042
pi+0.042
VICE+0.042
 oneself+0.040
Neg logit contributions
 behavi-0.037
Mex-0.035
rael-0.035
iffs-0.035
 NEWS-0.033
Token seemed
Token ID3947
Feature activation-9.06e-02

Pos logit contributions
 behavi+0.347
Mex+0.337
rael+0.335
iffs+0.335
 NEWS+0.315
Neg logit contributions
utical-0.489
pi-0.445
VICE-0.444
ener-0.443
 oneself-0.423
Token =
Token ID796
Feature activation-1.07e-01

Pos logit contributions
utical+0.228
VICE+0.205
pi+0.204
ener+0.204
 oneself+0.194
Neg logit contributions
 behavi-0.199
Mex-0.195
rael-0.194
iffs-0.194
 NEWS-0.183
Token arg
Token ID1822
Feature activation-7.62e-01

Pos logit contributions
 behavi+0.077
iffs+0.075
Mex+0.075
rael+0.075
 NEWS+0.072
Neg logit contributions
utical-0.078
VICE-0.069
ener-0.069
pi-0.069
 oneself-0.065
Tokenparse
Token ID29572
Feature activation-9.37e-02

Pos logit contributions
iffs+0.501
Mex+0.501
rael+0.497
 behavi+0.496
 NEWS+0.471
Neg logit contributions
utical-0.582
VICE-0.528
ener-0.520
pi-0.515
 oneself-0.496
Token.
Token ID13
Feature activation-2.53e-02

Pos logit contributions
 behavi+0.068
iffs+0.067
Mex+0.067
rael+0.066
 NEWS+0.063
Neg logit contributions
utical-0.067
VICE-0.060
ener-0.059
pi-0.059
 oneself-0.056
TokenArg
Token ID28100
Feature activation-1.28e-01

Pos logit contributions
 behavi+0.015
Mex+0.014
iffs+0.014
rael+0.014
 NEWS+0.013
Neg logit contributions
utical-0.022
VICE-0.020
ener-0.020
pi-0.020
 oneself-0.019
Tokenument
Token ID1713
Feature activation-1.49e+00

Pos logit contributions
 behavi+0.083
Mex+0.082
iffs+0.081
rael+0.081
 NEWS+0.077
Neg logit contributions
utical-0.101
pi-0.091
VICE-0.091
ener-0.091
 oneself-0.087
TokenParser
Token ID46677
Feature activation-9.69e-02

Pos logit contributions
Mex+0.977
 behavi+0.966
iffs+0.964
rael+0.946
 NEWS+0.902
Neg logit contributions
utical-1.157
VICE-1.038
ener-1.037
pi-1.032
 oneself-0.990
Token()
Token ID3419
Feature activation+1.63e-01

Pos logit contributions
 behavi+0.080
Mex+0.079
iffs+0.079
rael+0.078
 NEWS+0.075
Neg logit contributions
utical-0.060
pi-0.052
ener-0.052
VICE-0.052
 oneself-0.049
Token ap
Token ID2471
Feature activation+2.24e-01

Pos logit contributions
utical+0.127
VICE+0.114
pi+0.114
ener+0.114
 oneself+0.108
Neg logit contributions
 behavi-0.109
Mex-0.106
iffs-0.105
rael-0.105
 NEWS-0.100
Token.
Token ID13
Feature activation+2.22e-02

Pos logit contributions
utical+0.164
VICE+0.147
ener+0.147
pi+0.146
 oneself+0.139
Neg logit contributions
 behavi-0.159
Mex-0.156
iffs-0.156
rael-0.155
 NEWS-0.147
Tokenadd
Token ID2860
Feature activation+2.28e-01

Pos logit contributions
utical+0.016
VICE+0.014
pi+0.014
ener+0.014
 oneself+0.013
Neg logit contributions
 behavi-0.016
Mex-0.016
iffs-0.016
rael-0.016
 NEWS-0.015
Token read
Token ID1100
Feature activation+7.01e-02

Pos logit contributions
utical+0.171
pi+0.153
VICE+0.153
ener+0.153
 oneself+0.145
Neg logit contributions
 behavi-0.165
Mex-0.161
rael-0.160
iffs-0.160
 NEWS-0.152
Token:
Token ID25
Feature activation+1.01e-01

Pos logit contributions
utical+0.048
pi+0.042
VICE+0.042
ener+0.042
 oneself+0.040
Neg logit contributions
 behavi-0.054
Mex-0.052
rael-0.052
iffs-0.052
 NEWS-0.050
Token
Token ID198
Feature activation-2.77e-01

Pos logit contributions
utical+0.076
pi+0.068
VICE+0.068
ener+0.067
 oneself+0.064
Neg logit contributions
 behavi-0.071
Mex-0.069
rael-0.069
iffs-0.069
 NEWS-0.065
Token
Token ID198
Feature activation-2.67e-01

Pos logit contributions
 behavi+0.191
rael+0.176
iffs+0.175
Mex+0.174
 NEWS+0.165
Neg logit contributions
utical-0.217
pi-0.199
VICE-0.199
ener-0.199
 oneself-0.187
Token"
Token ID1
Feature activation+1.81e-02

Pos logit contributions
 behavi+0.185
Mex+0.181
rael+0.180
iffs+0.180
 NEWS+0.171
Neg logit contributions
utical-0.200
pi-0.180
VICE-0.179
ener-0.179
 oneself-0.170
TokenBar
Token ID10374
Feature activation-1.49e+00

Pos logit contributions
utical+0.014
VICE+0.013
pi+0.013
ener+0.013
 oneself+0.012
Neg logit contributions
 behavi-0.012
rael-0.012
iffs-0.012
Mex-0.012
 NEWS-0.011
Tokenack
Token ID441
Feature activation+4.96e-01

Pos logit contributions
 behavi+1.026
Mex+0.994
rael+0.984
iffs+0.981
 NEWS+0.936
Neg logit contributions
utical-1.133
pi-1.022
VICE-1.017
ener-1.017
 oneself-0.969
Token Obama
Token ID2486
Feature activation-3.34e-01

Pos logit contributions
utical+0.507
pi+0.470
VICE+0.469
ener+0.469
 oneself+0.451
Neg logit contributions
 behavi-0.210
Mex-0.199
iffs-0.197
rael-0.197
 NEWS-0.180
Token would
Token ID561
Feature activation-3.15e-01

Pos logit contributions
 behavi+0.246
Mex+0.239
rael+0.237
iffs+0.237
 NEWS+0.226
Neg logit contributions
utical-0.237
pi-0.212
ener-0.212
VICE-0.212
 oneself-0.200
Token be
Token ID307
Feature activation+6.51e-02

Pos logit contributions
 behavi+0.225
Mex+0.217
iffs+0.215
rael+0.215
 NEWS+0.205
Neg logit contributions
utical-0.231
pi-0.208
ener-0.208
VICE-0.208
 oneself-0.198
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
utical+0.049
VICE+0.044
ener+0.044
pi+0.044
 oneself+0.042
Neg logit contributions
 behavi-0.045
Mex-0.044
iffs-0.043
rael-0.043
 NEWS-0.041
Token]
Token ID60
Feature activation-1.84e-01

Pos logit contributions
utical+0.126
VICE+0.113
pi+0.113
ener+0.113
 oneself+0.107
Neg logit contributions
 behavi-0.120
Mex-0.117
iffs-0.116
rael-0.116
 NEWS-0.111
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
 behavi+0.128
Mex+0.124
rael+0.124
iffs+0.124
 NEWS+0.117
Neg logit contributions
utical-0.138
pi-0.124
VICE-0.124
ener-0.123
 oneself-0.117
Token
Token ID198
Feature activation-1.44e-01

Pos logit contributions
 behavi+0.077
rael+0.071
iffs+0.070
Mex+0.070
 NEWS+0.067
Neg logit contributions
utical-0.088
pi-0.081
VICE-0.080
ener-0.080
 oneself-0.076
TokenAccording
Token ID4821
Feature activation+1.68e-01

Pos logit contributions
 behavi+0.099
Mex+0.097
rael+0.096
iffs+0.096
 NEWS+0.091
Neg logit contributions
utical-0.109
pi-0.098
VICE-0.098
ener-0.098
 oneself-0.093
Token to
Token ID284
Feature activation+1.32e-01

Pos logit contributions
utical+0.106
pi+0.093
VICE+0.093
ener+0.093
 oneself+0.087
Neg logit contributions
 behavi-0.138
Mex-0.134
rael-0.134
iffs-0.133
 NEWS-0.128
Token Mub
Token ID40052
Feature activation-1.47e+00

Pos logit contributions
utical+0.096
pi+0.086
VICE+0.086
ener+0.086
 oneself+0.081
Neg logit contributions
 behavi-0.094
Mex-0.092
rael-0.091
iffs-0.091
 NEWS-0.087
Tokenrak
Token ID17716
Feature activation+3.28e-01

Pos logit contributions
 behavi+1.058
rael+1.046
Mex+1.042
iffs+1.041
 NEWS+0.985
Neg logit contributions
utical-1.060
VICE-0.951
pi-0.943
ener-0.935
 oneself-0.893
Tokenp
Token ID79
Feature activation-1.69e-01

Pos logit contributions
utical+0.235
pi+0.211
ener+0.210
VICE+0.209
 oneself+0.198
Neg logit contributions
 behavi-0.240
Mex-0.234
rael-0.232
iffs-0.232
 NEWS-0.221
Tokenuri
Token ID9900
Feature activation+3.12e-01

Pos logit contributions
 behavi+0.123
Mex+0.120
rael+0.120
iffs+0.120
 NEWS+0.114
Neg logit contributions
utical-0.121
VICE-0.108
pi-0.108
ener-0.108
 oneself-0.102
Token,
Token ID11
Feature activation+1.18e-01

Pos logit contributions
utical+0.225
pi+0.202
ener+0.201
VICE+0.201
 oneself+0.190
Neg logit contributions
 behavi-0.226
Mex-0.220
rael-0.218
iffs-0.218
 NEWS-0.208
Token only
Token ID691
Feature activation-1.98e-01

Pos logit contributions
utical+0.087
pi+0.078
VICE+0.078
ener+0.077
 oneself+0.073
Neg logit contributions
 behavi-0.083
Mex-0.081
rael-0.081
iffs-0.081
 NEWS-0.077
Token than
Token ID621
Feature activation-2.39e-01

Pos logit contributions
 behavi+0.058
Mex+0.057
iffs+0.056
rael+0.056
 NEWS+0.053
Neg logit contributions
utical-0.070
pi-0.064
VICE-0.063
ener-0.063
nir-0.060
Token appearing
Token ID12655
Feature activation-1.30e-02

Pos logit contributions
 behavi+0.149
Mex+0.145
rael+0.144
iffs+0.143
 NEWS+0.136
Neg logit contributions
utical-0.196
pi-0.178
VICE-0.177
ener-0.177
 oneself-0.169
Token before
Token ID878
Feature activation-2.67e-01

Pos logit contributions
 behavi+0.010
Mex+0.009
rael+0.009
iffs+0.009
 NEWS+0.009
Neg logit contributions
utical-0.009
ener-0.008
pi-0.008
VICE-0.008
 oneself-0.008
Token the
Token ID262
Feature activation+1.90e-02

Pos logit contributions
 behavi+0.186
Mex+0.181
rael+0.180
iffs+0.180
 NEWS+0.171
Neg logit contributions
utical-0.200
pi-0.180
VICE-0.180
ener-0.179
 oneself-0.170
Token Par
Token ID2547
Feature activation+3.56e-01

Pos logit contributions
utical+0.014
pi+0.013
ener+0.013
VICE+0.013
 oneself+0.012
Neg logit contributions
 behavi-0.013
Mex-0.013
rael-0.013
iffs-0.013
 NEWS-0.012
Tokenole
Token ID2305
Feature activation-1.47e+00

Pos logit contributions
utical+0.237
pi+0.210
ener+0.210
VICE+0.209
 oneself+0.196
Neg logit contributions
 behavi-0.279
Mex-0.272
iffs-0.270
rael-0.269
 NEWS-0.258
Token Board
Token ID5926
Feature activation-1.48e-01

Pos logit contributions
 behavi+1.058
Mex+1.031
rael+1.025
iffs+1.025
 NEWS+0.975
Neg logit contributions
utical-1.069
pi-0.957
VICE-0.957
ener-0.955
 oneself-0.903
Token of
Token ID286
Feature activation+2.13e-02

Pos logit contributions
 behavi+0.107
Mex+0.104
rael+0.104
iffs+0.104
 NEWS+0.099
Neg logit contributions
utical-0.107
pi-0.096
ener-0.095
VICE-0.095
 oneself-0.090
Token Canada
Token ID3340
Feature activation+6.38e-01

Pos logit contributions
utical+0.015
pi+0.014
VICE+0.014
ener+0.014
 oneself+0.013
Neg logit contributions
 behavi-0.016
Mex-0.015
iffs-0.015
rael-0.015
 NEWS-0.014
Token,
Token ID11
Feature activation-1.97e-01

Pos logit contributions
utical+0.453
ener+0.405
pi+0.405
VICE+0.401
 oneself+0.383
Neg logit contributions
 behavi-0.470
Mex-0.454
rael-0.454
iffs-0.451
 NEWS-0.432
Token Kh
Token ID5311
Feature activation+1.65e-02

Pos logit contributions
 behavi+0.137
Mex+0.134
iffs+0.133
rael+0.133
 NEWS+0.126
Neg logit contributions
utical-0.147
pi-0.132
VICE-0.132
ener-0.131
 oneself-0.124
Token.
Token ID13
Feature activation+1.89e-01

Pos logit contributions
utical+0.296
pi+0.265
VICE+0.264
ener+0.264
 oneself+0.248
Neg logit contributions
 behavi-0.321
Mex-0.311
rael-0.309
iffs-0.308
 NEWS-0.295
Tokenbit
Token ID2545
Feature activation+9.36e-02

Pos logit contributions
utical+0.146
VICE+0.131
ener+0.131
pi+0.131
 oneself+0.125
Neg logit contributions
 behavi-0.127
Mex-0.125
iffs-0.124
rael-0.124
 NEWS-0.117
Tokenmap
Token ID8899
Feature activation-3.03e-02

Pos logit contributions
utical+0.045
VICE+0.037
pi+0.037
ener+0.037
 oneself+0.034
Neg logit contributions
 behavi-0.091
Mex-0.090
iffs-0.089
rael-0.089
 NEWS-0.086
Token -
Token ID532
Feature activation-2.01e-01

Pos logit contributions
 behavi+0.022
iffs+0.022
Mex+0.022
rael+0.022
 NEWS+0.021
Neg logit contributions
utical-0.021
VICE-0.019
pi-0.019
ener-0.019
 oneself-0.018
TokenArg
Token ID28100
Feature activation+1.32e-01

Pos logit contributions
 behavi+0.153
iffs+0.150
Mex+0.150
rael+0.148
 NEWS+0.142
Neg logit contributions
utical-0.139
VICE-0.122
pi-0.121
ener-0.120
 oneself-0.116
Tokenument
Token ID1713
Feature activation-1.47e+00

Pos logit contributions
utical+0.108
pi+0.097
VICE+0.097
ener+0.096
 oneself+0.092
Neg logit contributions
 behavi-0.083
Mex-0.081
iffs-0.081
rael-0.080
 NEWS-0.076
TokenList
Token ID8053
Feature activation-2.70e-01

Pos logit contributions
iffs+0.892
 behavi+0.882
Mex+0.880
rael+0.857
 NEWS+0.838
Neg logit contributions
utical-1.219
pi-1.094
ener-1.088
VICE-1.083
 oneself-1.045
Token $
Token ID720
Feature activation-9.88e-02

Pos logit contributions
 behavi+0.200
iffs+0.200
Mex+0.198
rael+0.196
 NEWS+0.190
Neg logit contributions
utical-0.187
pi-0.163
VICE-0.163
ener-0.156
nir-0.155
Tokenw
Token ID86
Feature activation-2.28e-01

Pos logit contributions
 behavi+0.067
Mex+0.066
iffs+0.065
rael+0.065
 NEWS+0.062
Neg logit contributions
utical-0.076
ener-0.067
VICE-0.067
pi-0.067
 oneself-0.065
Tokenpath
Token ID6978
Feature activation-3.65e-01

Pos logit contributions
 behavi+0.162
Mex+0.162
iffs+0.162
rael+0.161
 NEWS+0.152
Neg logit contributions
utical-0.165
VICE-0.146
ener-0.144
pi-0.143
 oneself-0.139
Token $
Token ID720
Feature activation+1.13e-01

Pos logit contributions
 behavi+0.269
iffs+0.263
Mex+0.262
rael+0.262
 NEWS+0.251
Neg logit contributions
utical-0.258
VICE-0.228
pi-0.228
ener-0.226
 oneself-0.215
Token,
Token ID11
Feature activation+7.92e-02

Pos logit contributions
utical+0.295
pi+0.261
ener+0.260
VICE+0.260
 oneself+0.245
Neg logit contributions
 behavi-0.370
Mex-0.362
rael-0.360
iffs-0.359
 NEWS-0.344
Token fail
Token ID2038
Feature activation+3.53e-01

Pos logit contributions
utical+0.055
ener+0.049
pi+0.049
VICE+0.049
 oneself+0.047
Neg logit contributions
 behavi-0.059
Mex-0.058
rael-0.057
iffs-0.057
 NEWS-0.055
Token horribly
Token ID33437
Feature activation-4.76e-02

Pos logit contributions
utical+0.215
pi+0.189
VICE+0.188
ener+0.188
 oneself+0.176
Neg logit contributions
 behavi-0.295
Mex-0.288
rael-0.287
iffs-0.286
 NEWS-0.275
Token or
Token ID393
Feature activation+1.20e-01

Pos logit contributions
 behavi+0.035
Mex+0.035
rael+0.034
iffs+0.034
 NEWS+0.033
Neg logit contributions
utical-0.033
pi-0.030
VICE-0.030
ener-0.030
 oneself-0.028
Token acqu
Token ID4078
Feature activation+4.30e-01

Pos logit contributions
utical+0.089
pi+0.080
ener+0.080
VICE+0.080
 oneself+0.076
Neg logit contributions
 behavi-0.084
Mex-0.082
rael-0.081
iffs-0.081
 NEWS-0.077
Tokenit
Token ID270
Feature activation-1.47e+00

Pos logit contributions
utical+0.296
pi+0.267
ener+0.267
VICE+0.263
nir+0.251
Neg logit contributions
 behavi-0.330
Mex-0.314
iffs-0.312
rael-0.310
 NEWS-0.301
Token themselves
Token ID2405
Feature activation-7.60e-02

Pos logit contributions
 behavi+1.175
Mex+1.149
iffs+1.142
rael+1.142
 NEWS+1.091
Neg logit contributions
utical-0.947
pi-0.833
VICE-0.833
ener-0.829
 oneself-0.770
Token with
Token ID351
Feature activation-7.40e-04

Pos logit contributions
 behavi+0.055
Mex+0.053
rael+0.053
iffs+0.053
 NEWS+0.050
Neg logit contributions
utical-0.055
pi-0.049
VICE-0.049
ener-0.049
 oneself-0.047
Token pride
Token ID11293
Feature activation+2.47e-01

Pos logit contributions
 behavi+0.000
Mex+0.000
rael+0.000
iffs+0.000
 NEWS+0.000
Neg logit contributions
utical-0.001
pi-0.001
VICE-0.001
ener-0.001
 oneself-0.000
Token so
Token ID523
Feature activation-9.91e-02

Pos logit contributions
utical+0.176
pi+0.158
ener+0.158
VICE+0.157
 oneself+0.150
Neg logit contributions
 behavi-0.180
Mex-0.176
rael-0.175
iffs-0.174
 NEWS-0.166
Token he
Token ID339
Feature activation+3.50e-03

Pos logit contributions
 behavi+0.063
Mex+0.061
rael+0.060
iffs+0.060
 NEWS+0.057
Neg logit contributions
utical-0.080
pi-0.073
ener-0.073
VICE-0.073
 oneself-0.070
Token technology
Token ID3037
Feature activation+3.56e-01

Pos logit contributions
 behavi+0.003
Mex+0.003
rael+0.003
iffs+0.003
 NEWS+0.003
Neg logit contributions
utical-0.004
pi-0.004
VICE-0.004
ener-0.004
 oneself-0.003
Token and
Token ID290
Feature activation-1.91e-02

Pos logit contributions
utical+0.265
pi+0.240
VICE+0.240
ener+0.239
 oneself+0.228
Neg logit contributions
 behavi-0.246
Mex-0.242
rael-0.241
iffs-0.239
 NEWS-0.227
Token who
Token ID508
Feature activation-4.87e-02

Pos logit contributions
 behavi+0.013
Mex+0.013
iffs+0.013
rael+0.013
 NEWS+0.012
Neg logit contributions
utical-0.014
pi-0.013
VICE-0.013
ener-0.013
 oneself-0.012
Token cracks
Token ID23217
Feature activation+1.36e-01

Pos logit contributions
 behavi+0.036
Mex+0.035
rael+0.035
iffs+0.035
 NEWS+0.033
Neg logit contributions
utical-0.035
pi-0.031
VICE-0.031
ener-0.031
 oneself-0.029
Token cor
Token ID1162
Feature activation+1.28e-01

Pos logit contributions
utical+0.111
pi+0.101
VICE+0.101
ener+0.101
 oneself+0.097
Neg logit contributions
 behavi-0.086
Mex-0.083
rael-0.082
iffs-0.082
 NEWS-0.077
Tokenny
Token ID3281
Feature activation-1.46e+00

Pos logit contributions
utical+0.103
VICE+0.093
ener+0.093
pi+0.093
 oneself+0.089
Neg logit contributions
 behavi-0.081
Mex-0.080
iffs-0.079
rael-0.079
 NEWS-0.075
Token jokes
Token ID14532
Feature activation-2.42e-01

Pos logit contributions
 behavi+0.729
Mex+0.701
rael+0.697
iffs+0.695
 NEWS+0.644
Neg logit contributions
utical-1.379
pi-1.269
VICE-1.269
ener-1.266
 oneself-1.216
Token.
Token ID13
Feature activation+3.12e-02

Pos logit contributions
 behavi+0.168
Mex+0.164
rael+0.163
iffs+0.162
 NEWS+0.154
Neg logit contributions
utical-0.182
pi-0.164
VICE-0.164
ener-0.163
 oneself-0.155
Token But
Token ID887
Feature activation-5.58e-02

Pos logit contributions
utical+0.023
VICE+0.020
pi+0.020
ener+0.020
 oneself+0.019
Neg logit contributions
 behavi-0.023
rael-0.022
iffs-0.022
Mex-0.022
 NEWS-0.020
Token she
Token ID673
Feature activation-1.52e-01

Pos logit contributions
 behavi+0.041
Mex+0.040
rael+0.039
iffs+0.039
 NEWS+0.037
Neg logit contributions
utical-0.040
VICE-0.036
ener-0.036
pi-0.036
 oneself-0.034
Token is
Token ID318
Feature activation-3.13e-02

Pos logit contributions
 behavi+0.113
Mex+0.110
iffs+0.109
rael+0.109
 NEWS+0.104
Neg logit contributions
utical-0.107
pi-0.096
ener-0.095
VICE-0.095
 oneself-0.090
Token a
Token ID257
Feature activation-1.27e-01

Pos logit contributions
utical+0.164
pi+0.149
ener+0.148
VICE+0.147
 oneself+0.141
Neg logit contributions
 behavi-0.175
iffs-0.170
Mex-0.170
rael-0.169
 NEWS-0.160
Token '
Token ID705
Feature activation-2.77e-01

Pos logit contributions
 behavi+0.092
iffs+0.090
Mex+0.089
rael+0.089
 NEWS+0.084
Neg logit contributions
utical-0.091
ener-0.082
pi-0.082
VICE-0.081
 oneself-0.077
Tokencons
Token ID5936
Feature activation-2.23e-01

Pos logit contributions
 behavi+0.231
Mex+0.224
iffs+0.224
rael+0.224
 NEWS+0.214
Neg logit contributions
utical-0.170
pi-0.150
ener-0.150
VICE-0.150
 oneself-0.139
Tokenpiracy
Token ID8703
Feature activation-7.56e-01

Pos logit contributions
 behavi+0.150
Mex+0.144
iffs+0.144
rael+0.143
 NEWS+0.136
Neg logit contributions
utical-0.174
pi-0.158
ener-0.158
VICE-0.157
 oneself-0.148
Token theory
Token ID4583
Feature activation+3.35e-01

Pos logit contributions
 behavi+0.347
Mex+0.332
iffs+0.331
rael+0.330
 NEWS+0.300
Neg logit contributions
utical-0.745
pi-0.689
ener-0.689
VICE-0.687
 oneself-0.662
Token.'
Token ID2637
Feature activation-1.46e+00

Pos logit contributions
utical+0.274
ener+0.250
pi+0.250
VICE+0.249
 oneself+0.239
Neg logit contributions
 behavi-0.208
Mex-0.202
rael-0.201
iffs-0.200
 NEWS-0.188
Token
Token ID198
Feature activation+4.76e-02

Pos logit contributions
 behavi+0.864
Mex+0.852
iffs+0.838
rael+0.833
 NEWS+0.804
Neg logit contributions
utical-1.238
pi-1.122
ener-1.114
VICE-1.113
 oneself-1.072
Token
Token ID198
Feature activation-5.14e-02

Pos logit contributions
utical+0.038
pi+0.035
VICE+0.035
ener+0.035
 oneself+0.033
Neg logit contributions
 behavi-0.032
rael-0.030
iffs-0.030
Mex-0.030
 NEWS-0.028
TokenM
Token ID44
Feature activation+7.31e-02

Pos logit contributions
 behavi+0.032
Mex+0.031
rael+0.031
iffs+0.031
 NEWS+0.029
Neg logit contributions
utical-0.042
VICE-0.038
pi-0.038
ener-0.038
 oneself-0.036
Tokenook
Token ID566
Feature activation+4.91e-01

Pos logit contributions
utical+0.043
pi+0.038
ener+0.038
VICE+0.037
astery+0.034
Neg logit contributions
 behavi-0.066
Mex-0.061
iffs-0.060
rael-0.060
 NEWS-0.058
Token noted
Token ID4367
Feature activation-9.58e-03

Pos logit contributions
utical+0.364
pi+0.328
ener+0.328
VICE+0.327
 oneself+0.311
Neg logit contributions
 behavi-0.346
Mex-0.336
rael-0.333
iffs-0.332
 NEWS-0.316
Token a
Token ID257
Feature activation-1.91e-01

Pos logit contributions
 behavi+0.123
Mex+0.119
iffs+0.118
rael+0.117
 NEWS+0.111
Neg logit contributions
utical-0.151
VICE-0.137
ener-0.137
pi-0.137
 oneself-0.131
Token Russian
Token ID3394
Feature activation+2.92e-01

Pos logit contributions
 behavi+0.120
Mex+0.116
iffs+0.115
rael+0.114
 NEWS+0.108
Neg logit contributions
utical-0.156
pi-0.143
VICE-0.142
ener-0.142
 oneself-0.136
Token truck
Token ID7779
Feature activation-7.76e-01

Pos logit contributions
utical+0.254
ener+0.234
VICE+0.233
pi+0.233
 oneself+0.224
Neg logit contributions
 behavi-0.167
Mex-0.162
iffs-0.160
rael-0.159
 NEWS-0.148
Token convoy
Token ID32860
Feature activation-2.63e-01

Pos logit contributions
 behavi+0.466
Mex+0.449
rael+0.445
iffs+0.444
 NEWS+0.420
Neg logit contributions
utical-0.656
pi-0.601
ener-0.601
VICE-0.598
 oneself-0.572
Token bearing
Token ID14121
Feature activation-6.56e-01

Pos logit contributions
 behavi+0.168
Mex+0.161
rael+0.159
iffs+0.159
 NEWS+0.151
Neg logit contributions
utical-0.212
ener-0.194
pi-0.194
VICE-0.193
 oneself-0.185
Token humanitarian
Token ID15706
Feature activation-1.46e+00

Pos logit contributions
 behavi+0.400
Mex+0.389
rael+0.388
iffs+0.387
 NEWS+0.366
Neg logit contributions
utical-0.547
pi-0.497
VICE-0.496
ener-0.495
 oneself-0.471
Token aid
Token ID6133
Feature activation-4.07e-04

Pos logit contributions
 behavi+0.848
Mex+0.823
iffs+0.818
rael+0.815
 NEWS+0.771
Neg logit contributions
utical-1.255
pi-1.142
VICE-1.140
ener-1.139
 oneself-1.087
Token might
Token ID1244
Feature activation+2.45e-01

Pos logit contributions
 behavi+0.000
Mex+0.000
rael+0.000
iffs+0.000
 NEWS+0.000
Neg logit contributions
utical-0.000
ener-0.000
pi-0.000
VICE-0.000
 oneself-0.000
Token now
Token ID783
Feature activation+9.82e-03

Pos logit contributions
utical+0.189
ener+0.173
pi+0.172
VICE+0.171
 oneself+0.165
Neg logit contributions
 behavi-0.166
Mex-0.159
rael-0.156
iffs-0.156
 NEWS-0.149
Token enter
Token ID3802
Feature activation+6.37e-01

Pos logit contributions
utical+0.007
ener+0.007
pi+0.007
VICE+0.007
 oneself+0.006
Neg logit contributions
 behavi-0.007
Mex-0.006
iffs-0.006
rael-0.006
 NEWS-0.006
Token into
Token ID656
Feature activation+1.78e-01

Pos logit contributions
utical+0.565
ener+0.517
pi+0.517
VICE+0.515
 oneself+0.496
Neg logit contributions
 behavi-0.358
Mex-0.344
iffs-0.341
rael-0.337
 NEWS-0.317
Token 21
Token ID2310
Feature activation-1.75e-02

Pos logit contributions
 behavi+0.107
Mex+0.104
rael+0.104
iffs+0.104
 NEWS+0.098
Neg logit contributions
utical-0.122
pi-0.110
VICE-0.110
ener-0.109
 oneself-0.104
Token February
Token ID3945
Feature activation-1.59e-01

Pos logit contributions
 behavi+0.013
Mex+0.012
rael+0.012
iffs+0.012
 NEWS+0.012
Neg logit contributions
utical-0.013
pi-0.011
VICE-0.011
ener-0.011
 oneself-0.011
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
 behavi+0.112
Mex+0.108
rael+0.107
iffs+0.107
 NEWS+0.102
Neg logit contributions
utical-0.119
pi-0.107
ener-0.107
VICE-0.106
 oneself-0.101
Token In
Token ID554
Feature activation-8.78e-02

Pos logit contributions
 behavi+0.081
Mex+0.079
rael+0.079
iffs+0.078
 NEWS+0.074
Neg logit contributions
utical-0.083
VICE-0.075
pi-0.075
ener-0.074
 oneself-0.070
Token the
Token ID262
Feature activation-4.24e-02

Pos logit contributions
 behavi+0.061
Mex+0.060
iffs+0.059
rael+0.059
 NEWS+0.056
Neg logit contributions
utical-0.065
pi-0.059
ener-0.059
VICE-0.059
 oneself-0.056
Token clip
Token ID10651
Feature activation+1.42e+00

Pos logit contributions
 behavi+0.029
Mex+0.028
rael+0.028
iffs+0.028
 NEWS+0.026
Neg logit contributions
utical-0.032
pi-0.029
ener-0.029
VICE-0.029
 oneself-0.028
Token,
Token ID11
Feature activation-1.34e-01

Pos logit contributions
utical+1.022
ener+0.922
pi+0.918
VICE+0.917
 oneself+0.874
Neg logit contributions
 behavi-1.023
Mex-0.995
rael-0.983
iffs-0.979
 NEWS-0.933
Token the
Token ID262
Feature activation+6.09e-02

Pos logit contributions
 behavi+0.096
Mex+0.093
rael+0.093
iffs+0.093
 NEWS+0.088
Neg logit contributions
utical-0.098
pi-0.088
VICE-0.088
ener-0.088
 oneself-0.083
Token Professor
Token ID8129
Feature activation-1.89e-01

Pos logit contributions
utical+0.049
pi+0.044
VICE+0.044
ener+0.044
 oneself+0.042
Neg logit contributions
 behavi-0.039
Mex-0.038
rael-0.038
iffs-0.038
 NEWS-0.035
Token is
Token ID318
Feature activation+1.11e-01

Pos logit contributions
 behavi+0.137
Mex+0.134
rael+0.134
iffs+0.133
 NEWS+0.127
Neg logit contributions
utical-0.136
VICE-0.122
pi-0.122
ener-0.121
 oneself-0.114
Token heard
Token ID2982
Feature activation-7.81e-02

Pos logit contributions
utical+0.100
VICE+0.093
ener+0.092
pi+0.092
 oneself+0.089
Neg logit contributions
 behavi-0.059
Mex-0.057
iffs-0.056
rael-0.056
 NEWS-0.053
Token servants
Token ID17523
Feature activation-2.62e-01

Pos logit contributions
utical+0.183
pi+0.164
ener+0.163
VICE+0.163
 oneself+0.154
Neg logit contributions
 behavi-0.180
Mex-0.174
iffs-0.174
rael-0.174
 NEWS-0.166
Token superv
Token ID29745
Feature activation-6.07e-01

Pos logit contributions
 behavi+0.178
Mex+0.173
rael+0.172
iffs+0.171
 NEWS+0.162
Neg logit contributions
utical-0.201
pi-0.182
VICE-0.181
ener-0.181
 oneself-0.173
Tokenise
Token ID786
Feature activation-3.87e-01

Pos logit contributions
 behavi+0.481
Mex+0.459
rael+0.452
iffs+0.445
 NEWS+0.438
Neg logit contributions
utical-0.413
pi-0.366
VICE-0.363
ener-0.363
 oneself-0.342
Token ang
Token ID3550
Feature activation+6.24e-01

Pos logit contributions
 behavi+0.257
Mex+0.250
rael+0.249
iffs+0.249
 NEWS+0.236
Neg logit contributions
utical-0.302
pi-0.273
VICE-0.273
ener-0.272
 oneself-0.259
Tokenl
Token ID75
Feature activation-1.42e-01

Pos logit contributions
utical+0.405
pi+0.358
VICE+0.357
ener+0.357
 oneself+0.335
Neg logit contributions
 behavi-0.496
Mex-0.485
rael-0.482
iffs-0.482
 NEWS-0.461
Tokenophone
Token ID38656
Feature activation+1.35e+00

Pos logit contributions
 behavi+0.120
Mex+0.116
rael+0.116
iffs+0.115
 NEWS+0.111
Neg logit contributions
utical-0.086
pi-0.076
ener-0.075
VICE-0.075
 oneself-0.070
Token and
Token ID290
Feature activation-1.01e-01

Pos logit contributions
utical+1.067
pi+0.979
ener+0.971
VICE+0.970
 oneself+0.934
Neg logit contributions
 behavi-0.878
rael-0.858
Mex-0.855
iffs-0.847
 NEWS-0.791
Token franc
Token ID46754
Feature activation+5.10e-01

Pos logit contributions
 behavi+0.066
Mex+0.064
rael+0.064
iffs+0.064
 NEWS+0.060
Neg logit contributions
utical-0.079
pi-0.072
VICE-0.072
ener-0.072
 oneself-0.069
Tokenophone
Token ID38656
Feature activation+1.36e+00

Pos logit contributions
utical+0.318
VICE+0.280
pi+0.278
ener+0.277
 oneself+0.259
Neg logit contributions
 behavi-0.417
Mex-0.411
iffs-0.410
rael-0.408
 NEWS-0.390
Token employees
Token ID4409
Feature activation-4.56e-01

Pos logit contributions
utical+1.076
pi+0.984
VICE+0.981
ener+0.977
 oneself+0.941
Neg logit contributions
 behavi-0.873
rael-0.851
Mex-0.850
iffs-0.842
 NEWS-0.787
Token.
Token ID13
Feature activation-5.16e-02

Pos logit contributions
 behavi+0.316
Mex+0.307
rael+0.305
iffs+0.305
 NEWS+0.290
Neg logit contributions
utical-0.344
pi-0.309
VICE-0.309
ener-0.308
 oneself-0.293
Token record
Token ID1700
Feature activation-3.77e-01

Pos logit contributions
 behavi+0.053
Mex+0.051
rael+0.051
iffs+0.051
 NEWS+0.047
Neg logit contributions
utical-0.100
pi-0.092
VICE-0.092
ener-0.092
 oneself-0.088
Token,
Token ID11
Feature activation+4.29e-02

Pos logit contributions
 behavi+0.266
Mex+0.259
rael+0.258
iffs+0.257
 NEWS+0.245
Neg logit contributions
utical-0.279
pi-0.251
VICE-0.250
ener-0.250
 oneself-0.237
Token since
Token ID1201
Feature activation-2.08e-01

Pos logit contributions
utical+0.033
pi+0.029
VICE+0.029
ener+0.029
 oneself+0.028
Neg logit contributions
 behavi-0.029
Mex-0.029
rael-0.028
iffs-0.028
 NEWS-0.027
Token their
Token ID511
Feature activation+1.34e-01

Pos logit contributions
 behavi+0.142
Mex+0.138
rael+0.137
iffs+0.137
 NEWS+0.130
Neg logit contributions
utical-0.159
pi-0.143
VICE-0.143
ener-0.143
 oneself-0.136
Token clients
Token ID7534
Feature activation-1.42e-01

Pos logit contributions
utical+0.104
pi+0.094
VICE+0.094
ener+0.094
 oneself+0.090
Neg logit contributions
 behavi-0.089
Mex-0.086
rael-0.086
iffs-0.086
 NEWS-0.081
Token demand
Token ID3512
Feature activation+1.37e+00

Pos logit contributions
 behavi+0.094
Mex+0.090
iffs+0.090
rael+0.090
 NEWS+0.085
Neg logit contributions
utical-0.112
pi-0.102
VICE-0.102
ener-0.101
 oneself-0.097
Token confidentiality
Token ID32554
Feature activation+2.17e-01

Pos logit contributions
utical+1.061
pi+0.964
VICE+0.960
ener+0.956
 oneself+0.926
Neg logit contributions
 behavi-0.919
Mex-0.888
rael-0.881
iffs-0.880
 NEWS-0.835
Token.
Token ID13
Feature activation+1.86e-01

Pos logit contributions
utical+0.159
pi+0.143
VICE+0.143
ener+0.142
 oneself+0.136
Neg logit contributions
 behavi-0.154
Mex-0.150
rael-0.150
iffs-0.149
 NEWS-0.142
Token One
Token ID1881
Feature activation-9.82e-02

Pos logit contributions
utical+0.141
pi+0.127
VICE+0.127
ener+0.126
 oneself+0.120
Neg logit contributions
 behavi-0.129
Mex-0.125
rael-0.124
iffs-0.124
 NEWS-0.117
Token executive
Token ID4640
Feature activation+2.59e-01

Pos logit contributions
 behavi+0.067
Mex+0.065
rael+0.065
iffs+0.065
 NEWS+0.062
Neg logit contributions
utical-0.075
pi-0.067
VICE-0.067
ener-0.067
 oneself-0.063
Token at
Token ID379
Feature activation+5.78e-03

Pos logit contributions
utical+0.202
pi+0.182
VICE+0.182
ener+0.182
 oneself+0.172
Neg logit contributions
 behavi-0.172
Mex-0.168
rael-0.167
iffs-0.167
 NEWS-0.158
Token also
Token ID635
Feature activation-1.83e-01

Pos logit contributions
 behavi+0.119
Mex+0.115
iffs+0.114
rael+0.114
 NEWS+0.109
Neg logit contributions
utical-0.126
ener-0.114
pi-0.114
VICE-0.113
 oneself-0.108
Token pursued
Token ID19189
Feature activation+4.66e-02

Pos logit contributions
 behavi+0.132
Mex+0.127
iffs+0.127
rael+0.127
 NEWS+0.121
Neg logit contributions
utical-0.133
ener-0.120
pi-0.120
VICE-0.120
 oneself-0.114
Token heavily
Token ID7272
Feature activation-2.34e-01

Pos logit contributions
utical+0.023
pi+0.019
VICE+0.019
ener+0.019
 oneself+0.017
Neg logit contributions
 behavi-0.045
Mex-0.044
rael-0.044
iffs-0.044
 NEWS-0.042
Token by
Token ID416
Feature activation-1.28e-01

Pos logit contributions
 behavi+0.201
Mex+0.197
rael+0.196
iffs+0.196
 NEWS+0.188
Neg logit contributions
utical-0.137
pi-0.119
VICE-0.119
ener-0.119
 oneself-0.111
Token Notre
Token ID23382
Feature activation-3.80e-01

Pos logit contributions
 behavi+0.086
Mex+0.083
rael+0.083
iffs+0.083
 NEWS+0.078
Neg logit contributions
utical-0.099
pi-0.090
ener-0.090
VICE-0.090
 oneself-0.085
Token Dame
Token ID20377
Feature activation+1.89e+00

Pos logit contributions
 behavi+0.363
Mex+0.348
rael+0.348
iffs+0.346
 NEWS+0.334
Neg logit contributions
utical-0.191
ener-0.169
pi-0.166
VICE-0.163
 oneself-0.152
Token,
Token ID11
Feature activation-5.03e-02

Pos logit contributions
utical+1.373
pi+1.249
ener+1.244
VICE+1.225
 oneself+1.187
Neg logit contributions
 behavi-1.350
rael-1.319
Mex-1.307
iffs-1.291
 NEWS-1.221
Token which
Token ID543
Feature activation-1.52e-01

Pos logit contributions
 behavi+0.033
Mex+0.032
iffs+0.032
rael+0.032
 NEWS+0.030
Neg logit contributions
utical-0.039
ener-0.036
pi-0.036
VICE-0.036
 oneself-0.034
Token was
Token ID373
Feature activation-2.04e-01

Pos logit contributions
 behavi+0.107
Mex+0.103
rael+0.102
iffs+0.102
 NEWS+0.097
Neg logit contributions
utical-0.113
pi-0.102
ener-0.102
VICE-0.101
 oneself-0.097
Token the
Token ID262
Feature activation-9.46e-02

Pos logit contributions
 behavi+0.135
Mex+0.131
iffs+0.130
rael+0.130
 NEWS+0.123
Neg logit contributions
utical-0.160
pi-0.145
ener-0.145
VICE-0.145
 oneself-0.139
Token other
Token ID584
Feature activation+8.12e-02

Pos logit contributions
 behavi+0.068
Mex+0.066
rael+0.065
iffs+0.065
 NEWS+0.062
Neg logit contributions
utical-0.069
pi-0.062
VICE-0.062
ener-0.062
 oneself-0.058
Token offer
Token ID2897
Feature activation-3.09e-01

Pos logit contributions
utical+0.110
VICE+0.098
pi+0.098
ener+0.098
 oneself+0.093
Neg logit contributions
 behavi-0.106
Mex-0.104
iffs-0.104
rael-0.104
 NEWS-0.099
Token the
Token ID262
Feature activation+1.73e-01

Pos logit contributions
 behavi+0.210
Mex+0.206
iffs+0.205
rael+0.204
 NEWS+0.195
Neg logit contributions
utical-0.235
VICE-0.211
pi-0.211
ener-0.210
nir-0.199
Token Coconut
Token ID46943
Feature activation+9.76e-02

Pos logit contributions
utical+0.127
VICE+0.113
pi+0.113
ener+0.113
 oneself+0.106
Neg logit contributions
 behavi-0.124
Mex-0.121
iffs-0.120
rael-0.120
 NEWS-0.115
Token Bay
Token ID4696
Feature activation+3.91e-02

Pos logit contributions
utical+0.072
VICE+0.064
pi+0.064
ener+0.064
 oneself+0.061
Neg logit contributions
 behavi-0.069
Mex-0.068
iffs-0.067
rael-0.067
 NEWS-0.064
Token Resort
Token ID28345
Feature activation-1.58e-01

Pos logit contributions
utical+0.029
VICE+0.026
pi+0.026
ener+0.026
 oneself+0.025
Neg logit contributions
 behavi-0.027
Mex-0.027
iffs-0.027
rael-0.027
 NEWS-0.025
Token &
Token ID1222
Feature activation+1.32e+00

Pos logit contributions
 behavi+0.112
Mex+0.110
iffs+0.109
rael+0.108
 NEWS+0.104
Neg logit contributions
utical-0.116
VICE-0.104
pi-0.104
ener-0.104
 oneself-0.098
Token Spa
Token ID40208
Feature activation+3.95e-01

Pos logit contributions
utical+0.563
pi+0.463
VICE+0.462
ener+0.461
 oneself+0.414
Neg logit contributions
 behavi-1.341
Mex-1.316
rael-1.311
iffs-1.311
 NEWS-1.267
Token in
Token ID287
Feature activation+3.17e-03

Pos logit contributions
utical+0.287
VICE+0.257
pi+0.257
ener+0.256
 oneself+0.242
Neg logit contributions
 behavi-0.283
Mex-0.276
iffs-0.274
rael-0.274
 NEWS-0.262
Token St
Token ID520
Feature activation+2.75e-01

Pos logit contributions
utical+0.002
pi+0.002
VICE+0.002
ener+0.002
 oneself+0.002
Neg logit contributions
 behavi-0.002
Mex-0.002
rael-0.002
iffs-0.002
 NEWS-0.002
Token Lucia
Token ID48158
Feature activation+4.81e-01

Pos logit contributions
utical+0.197
VICE+0.176
pi+0.176
ener+0.175
 oneself+0.166
Neg logit contributions
 behavi-0.200
Mex-0.195
rael-0.195
iffs-0.195
 NEWS-0.185
Token from
Token ID422
Feature activation+6.61e-02

Pos logit contributions
utical+0.336
pi+0.300
VICE+0.299
ener+0.299
 oneself+0.282
Neg logit contributions
 behavi-0.360
Mex-0.350
rael-0.349
iffs-0.348
 NEWS-0.332
Token remain
Token ID3520
Feature activation-5.19e-01

Pos logit contributions
utical+0.042
pi+0.038
ener+0.038
VICE+0.038
 oneself+0.036
Neg logit contributions
 behavi-0.036
Mex-0.035
rael-0.035
iffs-0.035
 NEWS-0.033
Token engaged
Token ID7953
Feature activation-2.56e-01

Pos logit contributions
 behavi+0.330
Mex+0.321
iffs+0.318
rael+0.318
 NEWS+0.302
Neg logit contributions
utical-0.420
ener-0.382
pi-0.381
VICE-0.381
 oneself-0.363
Token in
Token ID287
Feature activation+1.58e-01

Pos logit contributions
 behavi+0.185
Mex+0.181
rael+0.180
iffs+0.179
 NEWS+0.171
Neg logit contributions
utical-0.184
pi-0.165
VICE-0.165
ener-0.164
 oneself-0.155
Token security
Token ID2324
Feature activation+1.84e-01

Pos logit contributions
utical+0.123
pi+0.111
VICE+0.111
ener+0.111
 oneself+0.106
Neg logit contributions
 behavi-0.105
Mex-0.103
iffs-0.102
rael-0.102
 NEWS-0.096
Token sector
Token ID6567
Feature activation-7.40e-01

Pos logit contributions
utical+0.150
VICE+0.135
pi+0.135
ener+0.135
 oneself+0.129
Neg logit contributions
 behavi-0.117
Mex-0.114
rael-0.113
iffs-0.113
 NEWS-0.107
Token reform
Token ID4975
Feature activation+4.16e-01

Pos logit contributions
 behavi+0.498
Mex+0.484
rael+0.481
iffs+0.481
 NEWS+0.456
Neg logit contributions
utical-0.570
pi-0.515
VICE-0.514
ener-0.513
 oneself-0.487
Token efforts
Token ID4040
Feature activation-1.95e-01

Pos logit contributions
utical+0.313
pi+0.283
ener+0.282
VICE+0.281
 oneself+0.267
Neg logit contributions
 behavi-0.287
Mex-0.280
rael-0.278
iffs-0.278
 NEWS-0.264
Token in
Token ID287
Feature activation+3.03e-02

Pos logit contributions
 behavi+0.139
Mex+0.136
rael+0.135
iffs+0.135
 NEWS+0.128
Neg logit contributions
utical-0.143
pi-0.128
ener-0.128
VICE-0.128
 oneself-0.121
Token the
Token ID262
Feature activation+8.80e-02

Pos logit contributions
utical+0.024
pi+0.022
VICE+0.022
ener+0.022
 oneself+0.021
Neg logit contributions
 behavi-0.020
Mex-0.019
iffs-0.019
rael-0.019
 NEWS-0.018
Token D
Token ID360
Feature activation-7.34e-02

Pos logit contributions
utical+0.066
pi+0.059
ener+0.059
VICE+0.059
 oneself+0.056
Neg logit contributions
 behavi-0.061
Mex-0.060
iffs-0.059
rael-0.059
 NEWS-0.056
TokenRC
Token ID7397
Feature activation-1.24e-01

Pos logit contributions
 behavi+0.056
Mex+0.053
iffs+0.052
rael+0.052
 NEWS+0.050
Neg logit contributions
utical-0.052
pi-0.047
ener-0.047
VICE-0.047
nir-0.043
Token RSS
Token ID25012
Feature activation+2.13e-01

Pos logit contributions
utical+0.306
pi+0.270
VICE+0.269
ener+0.269
 oneself+0.251
Neg logit contributions
 behavi-0.397
Mex-0.388
rael-0.386
iffs-0.386
 NEWS-0.368
Token (
Token ID357
Feature activation-2.20e-01

Pos logit contributions
utical+0.149
VICE+0.132
pi+0.132
ener+0.130
 oneself+0.125
Neg logit contributions
 behavi-0.159
Mex-0.158
rael-0.156
iffs-0.155
 NEWS-0.150
TokenSD
Token ID10305
Feature activation+2.03e-01

Pos logit contributions
 behavi+0.157
Mex+0.155
iffs+0.154
rael+0.153
 NEWS+0.146
Neg logit contributions
utical-0.161
ener-0.143
pi-0.143
VICE-0.143
 oneself-0.136
Token)
Token ID8
Feature activation-7.72e-02

Pos logit contributions
utical+0.160
VICE+0.148
pi+0.147
ener+0.146
 oneself+0.139
Neg logit contributions
 behavi-0.136
rael-0.127
iffs-0.126
Mex-0.126
 NEWS-0.119
Token
Token ID198
Feature activation+4.01e-01

Pos logit contributions
 behavi+0.049
Mex+0.048
rael+0.048
iffs+0.048
 NEWS+0.046
Neg logit contributions
utical-0.062
pi-0.056
VICE-0.056
ener-0.056
 oneself-0.054
Token
Token ID198
Feature activation+3.84e-01

Pos logit contributions
utical+0.312
pi+0.287
VICE+0.286
ener+0.286
 oneself+0.269
Neg logit contributions
 behavi-0.281
rael-0.257
iffs-0.256
Mex-0.255
 NEWS-0.241
TokenThis
Token ID1212
Feature activation+1.05e-01

Pos logit contributions
utical+0.275
VICE+0.247
pi+0.247
ener+0.246
 oneself+0.231
Neg logit contributions
 behavi-0.282
Mex-0.272
rael-0.272
iffs-0.271
 NEWS-0.259
Token week
Token ID1285
Feature activation+1.06e-01

Pos logit contributions
utical+0.082
pi+0.075
ener+0.075
VICE+0.075
 oneself+0.071
Neg logit contributions
 behavi-0.069
Mex-0.067
iffs-0.067
rael-0.066
 NEWS-0.063
Token on
Token ID319
Feature activation-3.96e-02

Pos logit contributions
utical+0.076
pi+0.067
VICE+0.067
ener+0.067
 oneself+0.064
Neg logit contributions
 behavi-0.077
Mex-0.075
rael-0.075
iffs-0.075
 NEWS-0.072
Token C
Token ID327
Feature activation+2.00e-01

Pos logit contributions
 behavi+0.028
Mex+0.027
rael+0.027
iffs+0.027
 NEWS+0.026
Neg logit contributions
utical-0.029
pi-0.026
VICE-0.026
ener-0.026
 oneself-0.025
Tokenrave
Token ID5758
Feature activation+4.61e-02

Pos logit contributions
utical+0.155
pi+0.140
VICE+0.140
ener+0.140
 oneself+0.133
Neg logit contributions
 behavi-0.134
Mex-0.130
rael-0.130
iffs-0.129
 NEWS-0.123
Token.
Token ID13
Feature activation-3.30e-02

Pos logit contributions
utical+0.112
pi+0.101
ener+0.101
VICE+0.101
 oneself+0.094
Neg logit contributions
 behavi-0.131
rael-0.123
Mex-0.123
iffs-0.123
 NEWS-0.118
Tokent
Token ID83
Feature activation+1.51e-01

Pos logit contributions
 behavi+0.022
rael+0.020
iffs+0.020
Mex+0.020
 NEWS+0.019
Neg logit contributions
utical-0.027
pi-0.025
ener-0.024
VICE-0.024
nir-0.023
Token.
Token ID13
Feature activation+3.53e-02

Pos logit contributions
utical+0.102
pi+0.092
VICE+0.091
ener+0.091
 oneself+0.085
Neg logit contributions
 behavi-0.120
rael-0.113
iffs-0.113
Mex-0.113
 NEWS-0.107
Token BEL
Token ID29991
Feature activation+4.73e-01

Pos logit contributions
utical+0.026
pi+0.023
VICE+0.023
ener+0.023
 oneself+0.022
Neg logit contributions
 behavi-0.025
Mex-0.025
rael-0.024
iffs-0.024
 NEWS-0.023
Token 22
Token ID2534
Feature activation+7.20e-03

Pos logit contributions
utical+0.379
pi+0.342
VICE+0.342
ener+0.342
 oneself+0.325
Neg logit contributions
 behavi-0.304
rael-0.296
Mex-0.296
iffs-0.295
 NEWS-0.280
Token Krist
Token ID14912
Feature activation+3.85e-01

Pos logit contributions
utical+0.005
VICE+0.004
pi+0.004
ener+0.004
 oneself+0.004
Neg logit contributions
 behavi-0.005
Mex-0.005
rael-0.005
iffs-0.005
 NEWS-0.005
Tokenof
Token ID1659
Feature activation-5.25e-02

Pos logit contributions
utical+0.286
pi+0.257
VICE+0.257
ener+0.256
 oneself+0.243
Neg logit contributions
 behavi-0.270
Mex-0.263
rael-0.262
iffs-0.262
 NEWS-0.249
Token God
Token ID1793
Feature activation+5.16e-01

Pos logit contributions
 behavi+0.039
iffs+0.039
Mex+0.039
rael+0.039
 NEWS+0.037
Neg logit contributions
utical-0.036
VICE-0.032
ener-0.032
pi-0.032
 oneself-0.030
Tokenda
Token ID6814
Feature activation+1.99e-02

Pos logit contributions
utical+0.368
pi+0.330
ener+0.330
VICE+0.329
 oneself+0.310
Neg logit contributions
 behavi-0.379
Mex-0.367
iffs-0.365
rael-0.365
 NEWS-0.348
Tokenert
Token ID861
Feature activation+4.08e-01

Pos logit contributions
utical+0.014
VICE+0.013
pi+0.012
ener+0.012
 oneself+0.012
Neg logit contributions
 behavi-0.015
iffs-0.014
rael-0.014
Mex-0.014
 NEWS-0.014
Token (
Token ID357
Feature activation+1.47e-01

Pos logit contributions
utical+0.279
VICE+0.249
pi+0.245
ener+0.240
 oneself+0.230
Neg logit contributions
 behavi-0.307
rael-0.306
iffs-0.305
Mex-0.304
 NEWS-0.290
Token obtained
Token ID6492
Feature activation+4.64e-01

Pos logit contributions
utical+0.118
pi+0.107
VICE+0.107
ener+0.107
 oneself+0.101
Neg logit contributions
 behavi-0.108
Mex-0.105
rael-0.104
iffs-0.104
 NEWS-0.099
Token through
Token ID832
Feature activation+2.66e-01

Pos logit contributions
utical+0.234
pi+0.200
VICE+0.199
ener+0.199
 oneself+0.183
Neg logit contributions
 behavi-0.436
Mex-0.427
rael-0.425
iffs-0.424
 NEWS-0.409
Token a
Token ID257
Feature activation+8.25e-02

Pos logit contributions
utical+0.201
pi+0.182
VICE+0.181
ener+0.181
 oneself+0.172
Neg logit contributions
 behavi-0.184
Mex-0.179
rael-0.177
iffs-0.176
 NEWS-0.168
Token freedom
Token ID4925
Feature activation+2.91e-02

Pos logit contributions
utical+0.069
pi+0.063
VICE+0.062
ener+0.062
 oneself+0.059
Neg logit contributions
 behavi-0.051
Mex-0.049
iffs-0.049
rael-0.048
 NEWS-0.046
Token of
Token ID286
Feature activation+1.98e-01

Pos logit contributions
utical+0.022
VICE+0.019
pi+0.019
ener+0.019
nir+0.018
Neg logit contributions
 behavi-0.020
Mex-0.020
rael-0.020
iffs-0.020
 NEWS-0.019
Token information
Token ID1321
Feature activation+4.17e-01

Pos logit contributions
utical+0.161
pi+0.146
VICE+0.146
ener+0.146
 oneself+0.139
Neg logit contributions
 behavi-0.125
Mex-0.121
iffs-0.120
rael-0.120
 NEWS-0.113
Token request
Token ID2581
Feature activation+7.30e-01

Pos logit contributions
utical+0.258
pi+0.226
VICE+0.226
ener+0.225
 oneself+0.211
Neg logit contributions
 behavi-0.343
Mex-0.336
rael-0.336
iffs-0.335
 NEWS-0.321
Token,
Token ID11
Feature activation+1.69e-01

Pos logit contributions
utical+0.496
pi+0.441
ener+0.441
VICE+0.440
 oneself+0.415
Neg logit contributions
 behavi-0.561
Mex-0.544
rael-0.541
iffs-0.540
 NEWS-0.517
Token showed
Token ID3751
Feature activation+4.49e-01

Pos logit contributions
utical+0.119
pi+0.107
ener+0.107
VICE+0.106
 oneself+0.101
Neg logit contributions
 behavi-0.126
Mex-0.122
rael-0.121
iffs-0.121
 NEWS-0.115
Token trustees
Token ID36636
Feature activation+4.27e-01

Pos logit contributions
utical+0.352
pi+0.318
VICE+0.318
ener+0.317
 oneself+0.302
Neg logit contributions
 behavi-0.297
Mex-0.289
rael-0.287
iffs-0.287
 NEWS-0.271
Token were
Token ID547
Feature activation+3.42e-01

Pos logit contributions
utical+0.320
VICE+0.289
ener+0.289
pi+0.289
 oneself+0.274
Neg logit contributions
 behavi-0.298
Mex-0.288
iffs-0.287
rael-0.285
 NEWS-0.270
Tokenased
Token ID839
Feature activation-4.12e-02

Pos logit contributions
 behavi+0.131
Mex+0.124
rael+0.123
iffs+0.122
 NEWS+0.115
Neg logit contributions
utical-0.218
pi-0.200
ener-0.200
VICE-0.198
 oneself-0.190
Token as
Token ID355
Feature activation+3.24e-02

Pos logit contributions
 behavi+0.029
Mex+0.028
rael+0.028
iffs+0.028
 NEWS+0.027
Neg logit contributions
utical-0.031
VICE-0.028
pi-0.028
ener-0.027
nir-0.026
Token
Token ID564
Feature activation+5.22e-02

Pos logit contributions
utical+0.023
VICE+0.020
pi+0.020
ener+0.020
 oneself+0.019
Neg logit contributions
 behavi-0.024
Mex-0.023
rael-0.023
iffs-0.023
 NEWS-0.022
Token
Token ID246
Feature activation+6.25e-02

Pos logit contributions
utical+0.033
VICE+0.029
pi+0.029
ener+0.029
 oneself+0.027
Neg logit contributions
 behavi-0.043
Mex-0.042
rael-0.042
iffs-0.042
 NEWS-0.040
Tokendel
Token ID12381
Feature activation+3.02e-01

Pos logit contributions
utical+0.047
pi+0.042
VICE+0.042
ener+0.042
 oneself+0.040
Neg logit contributions
 behavi-0.044
Mex-0.043
rael-0.042
iffs-0.042
 NEWS-0.040
Tokenayed
Token ID16548
Feature activation+6.46e-01

Pos logit contributions
utical+0.215
pi+0.192
VICE+0.192
ener+0.191
 oneself+0.181
Neg logit contributions
 behavi-0.222
Mex-0.217
rael-0.215
iffs-0.215
 NEWS-0.206
Token unfolding
Token ID34502
Feature activation-1.24e-01

Pos logit contributions
utical+0.463
pi+0.415
VICE+0.414
ener+0.413
 oneself+0.390
Neg logit contributions
 behavi-0.471
Mex-0.458
rael-0.456
iffs-0.455
 NEWS-0.434
Token of
Token ID286
Feature activation+5.04e-02

Pos logit contributions
 behavi+0.089
Mex+0.087
rael+0.086
iffs+0.086
 NEWS+0.082
Neg logit contributions
utical-0.091
pi-0.081
VICE-0.081
ener-0.081
 oneself-0.077
Token independence
Token ID10404
Feature activation+4.39e-01

Pos logit contributions
utical+0.039
pi+0.035
VICE+0.035
ener+0.035
 oneself+0.033
Neg logit contributions
 behavi-0.034
Mex-0.033
rael-0.033
iffs-0.033
 NEWS-0.031
Token
Token ID447
Feature activation+2.69e-01

Pos logit contributions
utical+0.318
pi+0.286
ener+0.284
VICE+0.284
 oneself+0.270
Neg logit contributions
 behavi-0.317
Mex-0.308
rael-0.306
iffs-0.306
 NEWS-0.291
Token
Token ID247
Feature activation+8.32e-02

Pos logit contributions
utical+0.134
pi+0.113
ener+0.113
VICE+0.113
 oneself+0.104
Neg logit contributions
 behavi-0.256
Mex-0.250
iffs-0.248
rael-0.248
 NEWS-0.240
TokenRL
Token ID7836
Feature activation+1.70e-01

Pos logit contributions
utical+0.154
pi+0.106
VICE+0.105
ener+0.104
 oneself+0.080
Neg logit contributions
 behavi-0.780
Mex-0.767
rael-0.764
iffs-0.764
 NEWS-0.743
Token rapid
Token ID5801
Feature activation-2.97e-01

Pos logit contributions
utical+0.119
pi+0.107
ener+0.106
VICE+0.106
 oneself+0.100
Neg logit contributions
 behavi-0.127
Mex-0.124
rael-0.123
iffs-0.123
 NEWS-0.117
Token reaction
Token ID6317
Feature activation-1.57e-02

Pos logit contributions
 behavi+0.154
Mex+0.149
iffs+0.148
rael+0.147
 NEWS+0.138
Neg logit contributions
utical-0.275
pi-0.253
VICE-0.252
ener-0.251
 oneself-0.241
Token projects
Token ID4493
Feature activation-2.13e-01

Pos logit contributions
 behavi+0.012
Mex+0.012
rael+0.012
iffs+0.011
 NEWS+0.011
Neg logit contributions
utical-0.011
pi-0.010
ener-0.010
VICE-0.010
 oneself-0.009
Token director
Token ID3437
Feature activation-7.84e-01

Pos logit contributions
 behavi+0.166
Mex+0.162
rael+0.161
iffs+0.161
 NEWS+0.154
Neg logit contributions
utical-0.141
pi-0.125
VICE-0.125
ener-0.125
 oneself-0.117
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
 behavi+0.567
Mex+0.554
rael+0.551
iffs+0.551
 NEWS+0.524
Neg logit contributions
utical-0.565
VICE-0.505
pi-0.505
ener-0.503
 oneself-0.476
Token
Token ID198
Feature activation+1.39e-01

Pos logit contributions
utical+0.131
pi+0.119
ener+0.119
VICE+0.119
 oneself+0.113
Neg logit contributions
 behavi-0.110
Mex-0.106
rael-0.106
iffs-0.106
 NEWS-0.100
Token
Token ID198
Feature activation+7.03e-02

Pos logit contributions
utical+0.107
pi+0.099
ener+0.099
VICE+0.098
 oneself+0.093
Neg logit contributions
 behavi-0.098
rael-0.090
iffs-0.089
Mex-0.089
 NEWS-0.084
TokenThe
Token ID464
Feature activation+1.99e-01

Pos logit contributions
utical+0.054
pi+0.049
VICE+0.049
ener+0.049
 oneself+0.046
Neg logit contributions
 behavi-0.047
Mex-0.046
rael-0.046
iffs-0.046
 NEWS-0.043
Token teams
Token ID3466
Feature activation+1.98e-01

Pos logit contributions
utical+0.154
pi+0.140
ener+0.140
VICE+0.138
 oneself+0.132
Neg logit contributions
 behavi-0.134
Mex-0.129
rael-0.128
iffs-0.128
 NEWS-0.121
Token from
Token ID422
Feature activation+1.52e-01

Pos logit contributions
utical+0.142
pi+0.127
ener+0.127
VICE+0.127
 oneself+0.120
Neg logit contributions
 behavi-0.144
Mex-0.140
iffs-0.139
rael-0.139
 NEWS-0.132
Token feminist
Token ID14314
Feature activation+2.78e-01

Pos logit contributions
 behavi+0.058
Mex+0.056
iffs+0.056
rael+0.056
 NEWS+0.052
Neg logit contributions
utical-0.081
ener-0.074
pi-0.074
VICE-0.073
 oneself-0.070
Token ut
Token ID3384
Feature activation-6.02e-01

Pos logit contributions
utical+0.210
ener+0.190
pi+0.189
VICE+0.188
 oneself+0.180
Neg logit contributions
 behavi-0.192
iffs-0.187
Mex-0.185
rael-0.184
 NEWS-0.174
Tokenopia
Token ID24464
Feature activation+3.03e-02

Pos logit contributions
Mex+0.494
 behavi+0.493
rael+0.490
iffs+0.486
 NEWS+0.465
Neg logit contributions
utical-0.370
VICE-0.321
pi-0.321
ener-0.314
 oneself-0.301
Token so
Token ID523
Feature activation-8.78e-02

Pos logit contributions
utical+0.022
pi+0.020
ener+0.020
VICE+0.020
 oneself+0.019
Neg logit contributions
 behavi-0.022
Mex-0.021
rael-0.021
iffs-0.021
 NEWS-0.020
Token much
Token ID881
Feature activation-7.75e-02

Pos logit contributions
 behavi+0.049
Mex+0.047
rael+0.047
iffs+0.047
 NEWS+0.044
Neg logit contributions
utical-0.078
pi-0.071
VICE-0.071
ener-0.071
 oneself-0.068
Token as
Token ID355
Feature activation-1.14e-01

Pos logit contributions
 behavi+0.048
Mex+0.046
rael+0.046
iffs+0.046
 NEWS+0.043
Neg logit contributions
utical-0.064
pi-0.058
VICE-0.058
ener-0.058
 oneself-0.056
Token it
Token ID340
Feature activation-6.34e-02

Pos logit contributions
 behavi+0.081
Mex+0.079
rael+0.078
iffs+0.078
 NEWS+0.074
Neg logit contributions
utical-0.084
pi-0.075
VICE-0.075
ener-0.075
 oneself-0.071
Token is
Token ID318
Feature activation+2.37e-02

Pos logit contributions
 behavi+0.048
Mex+0.047
rael+0.047
iffs+0.047
 NEWS+0.045
Neg logit contributions
utical-0.043
pi-0.039
ener-0.039
VICE-0.039
 oneself-0.036
Token
Token ID3926
Feature activation+3.00e-02

Pos logit contributions
utical+0.018
pi+0.016
VICE+0.016
ener+0.016
 oneself+0.015
Neg logit contributions
 behavi-0.016
Mex-0.016
rael-0.016
iffs-0.016
 NEWS-0.015
Token slightly
Token ID4622
Feature activation-6.56e-01

Pos logit contributions
utical+0.023
pi+0.021
VICE+0.021
ener+0.021
 oneself+0.020
Neg logit contributions
 behavi-0.020
Mex-0.020
rael-0.020
iffs-0.019
 NEWS-0.018
Token less
Token ID1342
Feature activation+9.56e-03

Pos logit contributions
 behavi+0.407
Mex+0.395
rael+0.392
iffs+0.392
 NEWS+0.371
Neg logit contributions
utical-0.541
pi-0.491
VICE-0.490
ener-0.489
 oneself-0.466
Token.
Token ID13
Feature activation+1.42e-02

Pos logit contributions
utical+0.341
pi+0.303
VICE+0.303
ener+0.303
 oneself+0.285
Neg logit contributions
 behavi-0.374
Mex-0.365
rael-0.363
iffs-0.362
 NEWS-0.346
Token In
Token ID554
Feature activation+4.57e-03

Pos logit contributions
utical+0.010
pi+0.009
VICE+0.009
ener+0.009
 oneself+0.009
Neg logit contributions
 behavi-0.010
rael-0.010
Mex-0.010
iffs-0.010
 NEWS-0.009
Token a
Token ID257
Feature activation-2.83e-02

Pos logit contributions
utical+0.004
pi+0.003
ener+0.003
VICE+0.003
 oneself+0.003
Neg logit contributions
 behavi-0.003
Mex-0.003
rael-0.003
iffs-0.003
 NEWS-0.003
Token series
Token ID2168
Feature activation-1.47e-01

Pos logit contributions
 behavi+0.018
Mex+0.017
iffs+0.017
rael+0.017
 NEWS+0.016
Neg logit contributions
utical-0.023
pi-0.021
ener-0.021
VICE-0.021
 oneself-0.020
Token of
Token ID286
Feature activation-2.80e-02

Pos logit contributions
 behavi+0.112
iffs+0.107
Mex+0.107
rael+0.107
 NEWS+0.102
Neg logit contributions
utical-0.101
pi-0.092
ener-0.092
VICE-0.091
 oneself-0.086
Token stories
Token ID3923
Feature activation+8.02e-02

Pos logit contributions
 behavi+0.017
Mex+0.016
rael+0.016
iffs+0.016
 NEWS+0.015
Neg logit contributions
utical-0.024
pi-0.022
ener-0.022
VICE-0.022
 oneself-0.021
Token entitled
Token ID9080
Feature activation+2.02e-01

Pos logit contributions
utical+0.062
pi+0.056
ener+0.056
VICE+0.056
 oneself+0.053
Neg logit contributions
 behavi-0.054
Mex-0.052
rael-0.052
iffs-0.052
 NEWS-0.049
Token
Token ID564
Feature activation+2.00e-02

Pos logit contributions
utical+0.141
pi+0.126
VICE+0.126
ener+0.125
 oneself+0.118
Neg logit contributions
 behavi-0.150
Mex-0.147
rael-0.146
iffs-0.146
 NEWS-0.139
Token
Token ID250
Feature activation+1.44e-01

Pos logit contributions
utical+0.012
pi+0.010
VICE+0.010
ener+0.010
 oneself+0.010
Neg logit contributions
 behavi-0.017
Mex-0.017
rael-0.017
iffs-0.017
 NEWS-0.016
TokenEnvironmental
Token ID47213
Feature activation-2.34e-01

Pos logit contributions
utical+0.112
pi+0.101
VICE+0.101
ener+0.100
 oneself+0.095
Neg logit contributions
 behavi-0.097
Mex-0.094
rael-0.093
iffs-0.093
 NEWS-0.089
Token Justice
Token ID4796
Feature activation+6.06e-01

Pos logit contributions
 behavi+0.172
Mex+0.167
rael+0.166
iffs+0.166
 NEWS+0.158
Neg logit contributions
utical-0.167
pi-0.149
VICE-0.149
ener-0.149
 oneself-0.141
Token felt
Token ID2936
Feature activation+4.69e-02

Pos logit contributions
 behavi+0.150
Mex+0.145
rael+0.145
iffs+0.144
 NEWS+0.138
Neg logit contributions
utical-0.149
pi-0.134
ener-0.133
VICE-0.133
 oneself-0.126
Token that
Token ID326
Feature activation+1.57e-01

Pos logit contributions
utical+0.034
ener+0.031
pi+0.031
VICE+0.030
 oneself+0.029
Neg logit contributions
 behavi-0.034
Mex-0.033
rael-0.033
iffs-0.033
 NEWS-0.031
Token returning
Token ID8024
Feature activation+1.61e-01

Pos logit contributions
utical+0.118
pi+0.107
ener+0.106
VICE+0.106
 oneself+0.101
Neg logit contributions
 behavi-0.109
Mex-0.106
rael-0.106
iffs-0.105
 NEWS-0.100
Token to
Token ID284
Feature activation+4.72e-02

Pos logit contributions
utical+0.106
pi+0.094
VICE+0.094
ener+0.094
 oneself+0.088
Neg logit contributions
 behavi-0.127
Mex-0.124
rael-0.123
iffs-0.123
 NEWS-0.118
Token MLS
Token ID13981
Feature activation-7.85e-01

Pos logit contributions
utical+0.038
pi+0.034
VICE+0.034
ener+0.034
 oneself+0.032
Neg logit contributions
 behavi-0.031
Mex-0.030
rael-0.030
iffs-0.030
 NEWS-0.028
Token someday
Token ID25580
Feature activation+9.42e-02

Pos logit contributions
 behavi+0.547
Mex+0.532
iffs+0.529
rael+0.529
 NEWS+0.503
Neg logit contributions
utical-0.587
pi-0.529
ener-0.528
VICE-0.527
 oneself-0.500
Token would
Token ID561
Feature activation-1.80e-01

Pos logit contributions
utical+0.071
pi+0.064
ener+0.064
VICE+0.064
 oneself+0.061
Neg logit contributions
 behavi-0.065
Mex-0.063
iffs-0.063
rael-0.062
 NEWS-0.059
Token be
Token ID307
Feature activation+9.76e-02

Pos logit contributions
 behavi+0.129
Mex+0.125
iffs+0.124
rael+0.124
 NEWS+0.118
Neg logit contributions
utical-0.132
pi-0.119
ener-0.119
VICE-0.118
 oneself-0.112
Token a
Token ID257
Feature activation-1.77e-02

Pos logit contributions
utical+0.073
pi+0.066
ener+0.065
VICE+0.065
 oneself+0.062
Neg logit contributions
 behavi-0.068
Mex-0.066
iffs-0.066
rael-0.066
 NEWS-0.063
Token blessing
Token ID20027
Feature activation+1.78e-01

Pos logit contributions
 behavi+0.011
iffs+0.011
Mex+0.011
rael+0.011
 NEWS+0.010
Neg logit contributions
utical-0.014
pi-0.013
ener-0.013
VICE-0.013
 oneself-0.012
Token because
Token ID780
Feature activation+9.61e-02

Pos logit contributions
utical+0.131
ener+0.118
pi+0.118
VICE+0.117
 oneself+0.112
Neg logit contributions
 behavi-0.127
Mex-0.123
rael-0.123
iffs-0.122
 NEWS-0.116
TokenYou
Token ID1639
Feature activation+1.45e-01

Pos logit contributions
 behavi+0.092
Mex+0.089
rael+0.089
iffs+0.089
 NEWS+0.084
Neg logit contributions
utical-0.104
pi-0.094
VICE-0.094
ener-0.093
 oneself-0.089
Token
Token ID447
Feature activation-4.75e-02

Pos logit contributions
utical+0.105
pi+0.095
ener+0.095
VICE+0.094
 oneself+0.090
Neg logit contributions
 behavi-0.104
Mex-0.101
rael-0.101
iffs-0.100
 NEWS-0.095
Token
Token ID247
Feature activation-6.41e-02

Pos logit contributions
 behavi+0.045
Mex+0.044
iffs+0.044
rael+0.044
 NEWS+0.042
Neg logit contributions
utical-0.024
pi-0.020
ener-0.020
VICE-0.020
 oneself-0.018
Tokenll
Token ID297
Feature activation-9.46e-02

Pos logit contributions
 behavi+0.042
Mex+0.040
rael+0.040
iffs+0.040
 NEWS+0.038
Neg logit contributions
utical-0.051
pi-0.046
VICE-0.046
ener-0.046
 oneself-0.044
Token also
Token ID635
Feature activation-1.44e-01

Pos logit contributions
 behavi+0.070
Mex+0.068
rael+0.067
iffs+0.067
 NEWS+0.064
Neg logit contributions
utical-0.067
pi-0.060
ener-0.060
VICE-0.060
 oneself-0.057
Token encounter
Token ID8791
Feature activation-4.94e-01

Pos logit contributions
 behavi+0.107
Mex+0.103
rael+0.102
iffs+0.102
 NEWS+0.097
Neg logit contributions
utical-0.101
pi-0.092
ener-0.092
VICE-0.091
 oneself-0.087
Token the
Token ID262
Feature activation+4.77e-02

Pos logit contributions
 behavi+0.341
Mex+0.332
rael+0.330
iffs+0.329
 NEWS+0.312
Neg logit contributions
utical-0.372
pi-0.335
ener-0.335
VICE-0.335
 oneself-0.318
Token
Token ID564
Feature activation+1.49e-01

Pos logit contributions
utical+0.036
ener+0.033
pi+0.033
VICE+0.033
 oneself+0.031
Neg logit contributions
 behavi-0.032
Mex-0.032
rael-0.031
iffs-0.031
 NEWS-0.030
Token
Token ID250
Feature activation-1.70e-01

Pos logit contributions
utical+0.093
VICE+0.082
pi+0.082
ener+0.081
 oneself+0.077
Neg logit contributions
 behavi-0.122
Mex-0.119
rael-0.119
iffs-0.119
 NEWS-0.114
TokenM
Token ID44
Feature activation+8.88e-02

Pos logit contributions
 behavi+0.121
Mex+0.118
rael+0.117
iffs+0.117
 NEWS+0.111
Neg logit contributions
utical-0.125
pi-0.112
VICE-0.112
ener-0.112
 oneself-0.106
Tokenoss
Token ID793
Feature activation-4.72e-02

Pos logit contributions
utical+0.065
pi+0.059
VICE+0.059
ener+0.059
 oneself+0.055
Neg logit contributions
 behavi-0.063
Mex-0.061
rael-0.061
iffs-0.061
 NEWS-0.058
Token he
Token ID339
Feature activation-1.22e-01

Pos logit contributions
 behavi+0.217
Mex+0.212
rael+0.210
iffs+0.210
 NEWS+0.200
Neg logit contributions
utical-0.225
pi-0.202
VICE-0.201
ener-0.201
 oneself-0.190
Token was
Token ID373
Feature activation+2.01e-01

Pos logit contributions
 behavi+0.085
Mex+0.083
rael+0.083
iffs+0.083
 NEWS+0.078
Neg logit contributions
utical-0.092
pi-0.082
VICE-0.082
ener-0.082
 oneself-0.078
Token heading
Token ID9087
Feature activation-2.51e-01

Pos logit contributions
utical+0.160
pi+0.145
VICE+0.145
ener+0.145
 oneself+0.138
Neg logit contributions
 behavi-0.130
Mex-0.126
iffs-0.126
rael-0.125
 NEWS-0.119
Token in
Token ID287
Feature activation+2.32e-01

Pos logit contributions
 behavi+0.185
Mex+0.180
rael+0.179
iffs+0.179
 NEWS+0.170
Neg logit contributions
utical-0.178
pi-0.159
VICE-0.158
ener-0.158
 oneself-0.149
Token",
Token ID1600
Feature activation+6.66e-01

Pos logit contributions
utical+0.172
pi+0.155
VICE+0.154
ener+0.154
 oneself+0.148
Neg logit contributions
 behavi-0.163
Mex-0.158
iffs-0.157
rael-0.156
 NEWS-0.148
Token Cob
Token ID14828
Feature activation-7.09e-01

Pos logit contributions
utical+0.495
pi+0.445
VICE+0.445
ener+0.444
 oneself+0.421
Neg logit contributions
 behavi-0.468
Mex-0.455
iffs-0.452
rael-0.451
 NEWS-0.428
Tokenain
Token ID391
Feature activation+4.92e-01

Pos logit contributions
 behavi+0.481
Mex+0.471
iffs+0.470
rael+0.469
 NEWS+0.445
Neg logit contributions
utical-0.539
VICE-0.488
pi-0.483
ener-0.483
 oneself-0.460
Token's
Token ID338
Feature activation+2.72e-01

Pos logit contributions
utical+0.373
pi+0.337
ener+0.336
VICE+0.335
 oneself+0.318
Neg logit contributions
 behavi-0.338
Mex-0.328
iffs-0.325
rael-0.325
 NEWS-0.308
Token friend
Token ID1545
Feature activation+3.85e-01

Pos logit contributions
utical+0.211
pi+0.192
VICE+0.192
ener+0.191
 oneself+0.182
Neg logit contributions
 behavi-0.181
Mex-0.175
rael-0.175
iffs-0.174
 NEWS-0.165
Token,
Token ID11
Feature activation+8.65e-02

Pos logit contributions
utical+0.290
pi+0.261
VICE+0.260
ener+0.260
 oneself+0.246
Neg logit contributions
 behavi-0.266
Mex-0.259
rael-0.257
iffs-0.257
 NEWS-0.244
Token R
Token ID371
Feature activation-1.52e-01

Pos logit contributions
utical+0.064
pi+0.058
VICE+0.058
ener+0.058
 oneself+0.055
Neg logit contributions
 behavi-0.061
Mex-0.059
iffs-0.059
rael-0.059
 NEWS-0.056
Token people
Token ID661
Feature activation+5.73e-03

Pos logit contributions
utical+0.162
pi+0.150
VICE+0.149
ener+0.149
 oneself+0.143
Neg logit contributions
 behavi-0.109
rael-0.105
Mex-0.104
iffs-0.103
 NEWS-0.097
Token to
Token ID284
Feature activation+1.26e-01

Pos logit contributions
utical+0.004
ener+0.003
pi+0.003
VICE+0.003
 oneself+0.003
Neg logit contributions
 behavi-0.004
Mex-0.004
iffs-0.004
rael-0.004
 NEWS-0.004
Token use
Token ID779
Feature activation+4.89e-03

Pos logit contributions
utical+0.083
pi+0.074
VICE+0.073
ener+0.073
 oneself+0.069
Neg logit contributions
 behavi-0.099
Mex-0.097
rael-0.096
iffs-0.096
 NEWS-0.092
Token the
Token ID262
Feature activation-6.08e-02

Pos logit contributions
utical+0.004
pi+0.004
VICE+0.004
ener+0.004
 oneself+0.004
Neg logit contributions
 behavi-0.003
Mex-0.003
rael-0.003
iffs-0.003
 NEWS-0.003
Token proper
Token ID1774
Feature activation-1.80e-01

Pos logit contributions
 behavi+0.031
Mex+0.030
iffs+0.030
rael+0.030
 NEWS+0.028
Neg logit contributions
utical-0.057
VICE-0.052
pi-0.052
ener-0.052
 oneself-0.050
Token bathroom
Token ID12436
Feature activation-8.43e-01

Pos logit contributions
 behavi+0.092
Mex+0.089
iffs+0.088
rael+0.088
 NEWS+0.082
Neg logit contributions
utical-0.168
pi-0.155
VICE-0.154
ener-0.154
 oneself-0.148
Token,
Token ID11
Feature activation-2.01e-02

Pos logit contributions
 behavi+0.603
Mex+0.586
iffs+0.585
rael+0.584
 NEWS+0.556
Neg logit contributions
utical-0.617
VICE-0.550
pi-0.550
ener-0.549
 oneself-0.518
Token saying
Token ID2282
Feature activation-1.07e-02

Pos logit contributions
 behavi+0.014
Mex+0.014
rael+0.014
iffs+0.014
 NEWS+0.013
Neg logit contributions
utical-0.015
pi-0.014
VICE-0.014
ener-0.013
 oneself-0.013
Token,
Token ID11
Feature activation-4.26e-02

Pos logit contributions
 behavi+0.008
Mex+0.008
rael+0.008
iffs+0.008
 NEWS+0.007
Neg logit contributions
utical-0.008
pi-0.007
ener-0.007
VICE-0.007
 oneself-0.006
Token
Token ID564
Feature activation-2.45e-02

Pos logit contributions
 behavi+0.034
Mex+0.033
rael+0.033
iffs+0.033
 NEWS+0.031
Neg logit contributions
utical-0.028
pi-0.025
VICE-0.025
ener-0.024
 oneself-0.023
Token
Token ID250
Feature activation-1.20e-01

Pos logit contributions
 behavi+0.022
Mex+0.021
rael+0.021
iffs+0.021
 NEWS+0.020
Neg logit contributions
utical-0.014
pi-0.013
ener-0.013
VICE-0.013
 oneself-0.012
Token with
Token ID351
Feature activation-2.67e-01

Pos logit contributions
 behavi+0.096
Mex+0.093
rael+0.093
iffs+0.093
 NEWS+0.088
Neg logit contributions
utical-0.123
pi-0.111
VICE-0.111
ener-0.111
 oneself-0.105
Token Bernie
Token ID10477
Feature activation-3.74e-01

Pos logit contributions
 behavi+0.133
Mex+0.128
iffs+0.127
rael+0.127
 NEWS+0.118
Neg logit contributions
utical-0.253
VICE-0.233
pi-0.233
ener-0.233
 oneself-0.223
Token Sanders
Token ID5831
Feature activation+9.14e-02

Pos logit contributions
 behavi+0.194
Mex+0.188
iffs+0.186
rael+0.186
 NEWS+0.174
Neg logit contributions
utical-0.346
pi-0.317
VICE-0.317
ener-0.316
 oneself-0.303
Token Newt
Token ID40893
Feature activation-5.61e-01

Pos logit contributions
utical+0.065
pi+0.058
VICE+0.058
ener+0.058
 oneself+0.054
Neg logit contributions
 behavi-0.067
Mex-0.066
rael-0.065
iffs-0.065
 NEWS-0.062
Token Gingrich
Token ID29855
Feature activation-5.98e-01

Pos logit contributions
 behavi+0.308
Mex+0.297
rael+0.295
iffs+0.294
 NEWS+0.275
Neg logit contributions
utical-0.504
pi-0.461
VICE-0.461
ener-0.461
 oneself-0.441
Token Former
Token ID14466
Feature activation-1.18e+00

Pos logit contributions
 behavi+0.419
Mex+0.409
iffs+0.406
rael+0.406
 NEWS+0.388
Neg logit contributions
utical-0.444
pi-0.398
VICE-0.398
ener-0.396
 oneself-0.376
Token Speaker
Token ID14931
Feature activation-3.10e-01

Pos logit contributions
 behavi+0.745
Mex+0.722
rael+0.718
iffs+0.718
 NEWS+0.677
Neg logit contributions
utical-0.961
VICE-0.873
pi-0.872
ener-0.871
 oneself-0.829
Token of
Token ID286
Feature activation-2.05e-01

Pos logit contributions
 behavi+0.215
Mex+0.210
iffs+0.209
rael+0.209
 NEWS+0.200
Neg logit contributions
utical-0.232
pi-0.209
VICE-0.208
ener-0.207
 oneself-0.197
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
 behavi+0.130
Mex+0.127
rael+0.127
iffs+0.126
 NEWS+0.121
Neg logit contributions
utical-0.165
pi-0.150
ener-0.149
VICE-0.148
 oneself-0.142
Token House
Token ID2097
Feature activation-4.20e-01

Pos logit contributions
 behavi+0.149
Mex+0.143
iffs+0.142
rael+0.142
 NEWS+0.132
Neg logit contributions
utical-0.254
VICE-0.233
ener-0.233
pi-0.233
 oneself-0.223
Token Newt
Token ID40893
Feature activation-4.01e-01

Pos logit contributions
 behavi+0.294
Mex+0.288
rael+0.286
iffs+0.286
 NEWS+0.273
Neg logit contributions
utical-0.312
VICE-0.279
pi-0.279
ener-0.278
 oneself-0.263
Token.
Token ID13
Feature activation+4.21e-02

Pos logit contributions
utical+0.048
pi+0.043
VICE+0.043
ener+0.043
 oneself+0.041
Neg logit contributions
 behavi-0.050
Mex-0.048
rael-0.048
iffs-0.048
 NEWS-0.046
Token In
Token ID554
Feature activation-3.45e-02

Pos logit contributions
utical+0.033
VICE+0.030
pi+0.030
ener+0.030
 oneself+0.028
Neg logit contributions
 behavi-0.028
rael-0.027
Mex-0.027
iffs-0.027
 NEWS-0.025
Token both
Token ID1111
Feature activation+1.36e-02

Pos logit contributions
 behavi+0.023
Mex+0.022
iffs+0.022
rael+0.022
 NEWS+0.020
Neg logit contributions
utical-0.027
pi-0.025
VICE-0.025
ener-0.025
 oneself-0.024
Token Florida
Token ID4744
Feature activation-4.18e-01

Pos logit contributions
utical+0.012
VICE+0.011
pi+0.011
ener+0.011
 oneself+0.011
Neg logit contributions
 behavi-0.008
Mex-0.007
iffs-0.007
rael-0.007
 NEWS-0.007
Token and
Token ID290
Feature activation-1.11e-01

Pos logit contributions
 behavi+0.249
Mex+0.241
rael+0.239
iffs+0.239
 NEWS+0.225
Neg logit contributions
utical-0.355
pi-0.324
VICE-0.323
ener-0.323
 oneself-0.309
Token Ohio
Token ID6835
Feature activation-8.38e-01

Pos logit contributions
 behavi+0.065
Mex+0.063
iffs+0.062
rael+0.062
 NEWS+0.058
Neg logit contributions
utical-0.096
VICE-0.087
pi-0.087
ener-0.087
 oneself-0.083
Token,
Token ID11
Feature activation-2.82e-01

Pos logit contributions
 behavi+0.593
Mex+0.577
rael+0.574
iffs+0.574
 NEWS+0.546
Neg logit contributions
utical-0.619
pi-0.555
VICE-0.554
ener-0.553
 oneself-0.524
Token 50
Token ID2026
Feature activation-3.23e-01

Pos logit contributions
 behavi+0.178
Mex+0.172
rael+0.171
iffs+0.171
 NEWS+0.161
Neg logit contributions
utical-0.229
VICE-0.209
pi-0.209
ener-0.208
 oneself-0.199
Token percent
Token ID1411
Feature activation-3.38e-01

Pos logit contributions
 behavi+0.192
Mex+0.186
rael+0.185
iffs+0.184
 NEWS+0.174
Neg logit contributions
utical-0.275
pi-0.251
VICE-0.250
ener-0.250
 oneself-0.239
Token said
Token ID531
Feature activation-2.40e-01

Pos logit contributions
 behavi+0.239
Mex+0.233
rael+0.232
iffs+0.231
 NEWS+0.220
Neg logit contributions
utical-0.249
pi-0.224
VICE-0.223
ener-0.223
 oneself-0.211
Token Obama
Token ID2486
Feature activation-8.11e-01

Pos logit contributions
 behavi+0.159
Mex+0.154
iffs+0.153
rael+0.153
 NEWS+0.145
Neg logit contributions
utical-0.188
VICE-0.170
pi-0.170
ener-0.169
 oneself-0.161
Token these
Token ID777
Feature activation-3.05e-02

Pos logit contributions
 behavi+0.080
Mex+0.077
rael+0.077
iffs+0.077
 NEWS+0.073
Neg logit contributions
utical-0.101
pi-0.092
ener-0.092
VICE-0.091
 oneself-0.087
Token teams
Token ID3466
Feature activation+6.38e-02

Pos logit contributions
 behavi+0.020
Mex+0.020
rael+0.020
iffs+0.020
 NEWS+0.019
Neg logit contributions
utical-0.024
pi-0.022
ener-0.021
VICE-0.021
 oneself-0.020
Token bring
Token ID2222
Feature activation-1.03e-01

Pos logit contributions
utical+0.050
pi+0.046
ener+0.046
VICE+0.046
 oneself+0.043
Neg logit contributions
 behavi-0.042
Mex-0.041
iffs-0.040
rael-0.040
 NEWS-0.038
Token extra
Token ID3131
Feature activation-5.01e-01

Pos logit contributions
 behavi+0.071
Mex+0.069
rael+0.069
iffs+0.069
 NEWS+0.065
Neg logit contributions
utical-0.078
pi-0.070
ener-0.070
VICE-0.070
 oneself-0.067
Token pass
Token ID1208
Feature activation+7.22e-01

Pos logit contributions
 behavi+0.333
Mex+0.325
iffs+0.323
rael+0.322
 NEWS+0.306
Neg logit contributions
utical-0.390
VICE-0.352
pi-0.352
ener-0.351
 oneself-0.333
Token r
Token ID374
Feature activation-7.05e-01

Pos logit contributions
utical+0.537
ener+0.487
pi+0.486
VICE+0.482
 oneself+0.457
Neg logit contributions
 behavi-0.508
Mex-0.493
rael-0.489
iffs-0.485
 NEWS-0.464
Tokenus
Token ID385
Feature activation-4.65e-01

Pos logit contributions
 behavi+0.537
Mex+0.522
rael+0.519
iffs+0.517
 NEWS+0.496
Neg logit contributions
utical-0.484
pi-0.431
ener-0.430
VICE-0.429
 oneself-0.404
Tokenhers
Token ID7084
Feature activation+1.36e+00

Pos logit contributions
 behavi+0.431
Mex+0.425
iffs+0.425
rael+0.424
 NEWS+0.407
Neg logit contributions
utical-0.238
pi-0.202
VICE-0.202
ener-0.200
 oneself-0.184
Token,
Token ID11
Feature activation-9.73e-02

Pos logit contributions
utical+0.901
ener+0.805
pi+0.803
VICE+0.797
 oneself+0.756
Neg logit contributions
 behavi-1.062
Mex-1.032
iffs-1.025
rael-1.025
 NEWS-0.981
Token they
Token ID484
Feature activation+5.97e-02

Pos logit contributions
 behavi+0.066
Mex+0.064
rael+0.063
iffs+0.063
 NEWS+0.060
Neg logit contributions
utical-0.075
pi-0.068
ener-0.068
VICE-0.068
 oneself-0.064
Token can
Token ID460
Feature activation+3.66e-02

Pos logit contributions
utical+0.045
pi+0.041
ener+0.041
VICE+0.041
 oneself+0.039
Neg logit contributions
 behavi-0.041
Mex-0.040
iffs-0.039
rael-0.039
 NEWS-0.037