TOP ACTIVATIONS
MAX = 2.039

screw drivers . It can be a bit of a pain
you removed ). This can be a pain , just take
invited O AS Secretary General Jose Miguel Ins ul za ,
Among those arrested was Jose Fer rer , leader of
a time when Prime Minister Justin Trudeau has promised to build
s prestige to new heights . And
which claim to run universally across Linux distributions , bund ling
being awakened at 6 : 30 a . m . by
35 z 1 GER ON IM O T OC IN IT
Children learn quickly how to try to fit in . However
with honorary tattoo Ch ad " O ch oc
been hard to reach the public with the information . Despite
planning to get Racing Apex on to consoles in the near
. It can be a bit of a pain , but
giving Body conscious a low grade is wondering who it could
is planned for a release on the Xbox 360 and Microsoft
applicable to this article can be found here . Unfortunately it
- a - half - deep " source rock " out
17 Krypt on as dexter ously as Kodi 16 . 1
dominant tactical advantage over the ch asm below . War th

MIN ACTIVATIONS
MIN = -1.993

Example : Your + 1 shield keeps it s
ogen F oul kes , in Geneva , says that while
is relevant ," he said at an interactive session after delivering
ury agreed . Ren aud Lav o ie (@ ren
Th ad ik omb u D ind ig ul
The BBC 's Jonah Fisher in Yang on , also known
the victims group U OK G , said Ike
The No . 4 U . S . mobile network
13 th century Sanskrit , U y gh ur , and
means BW Editor - In - Chief " Watch Bake mon
out how the Dan ie ll cell battery is constructed .
total to 166 . The U . S ., by comparison
dr ab brick building in D ae je on , South
February 2012 report from the U . S . Treasury Department
. 5 ) D han want ri : There
save , your character now receives secret voices in his or
held as ' object ively and universally true ' under some
{ ( 8 s )} R f 8 { ( 206
, Google disclosed that the U . S . Federal Trade
Singh Dal al ( Ret d ) to stir up trouble

INTERVAL 1.031 - 2.039
CONTAINS 0.456%

current 6 . 2 - liter V 8 , which is
votes . Georgia Rep . St acey Evans ( D )
successful strategy to block and blame President Obama . Their anger
going to have the incredible excitement among 18 - to 22
Bl ume , is published on 4 June by Pic ador

INTERVAL 0.023 - 1.031
CONTAINS 46.410%

N . No news here ,
image Path ) image = im utils . res ize (
- track session at Circuit Paul Ric ard .
could pass the Republican - led House by a simple majority
is open to both consumers and beauty industry professionals . Consumers

INTERVAL -0.985 - 0.023
CONTAINS 52.424%

identified itself with . Between 1850 and 1890 , new academic
, and even discussed by director Bryan Singer during the press
and n ines . RF : After working with
According to TRA 's ' Registration Requirements for Mobile
s approach to storytelling . Bio Shock is famous

INTERVAL -1.993 - -0.985
CONTAINS 0.710%

t understand this stuff . If the universe is everything ,
least from a punk ideological standpoint ) one of the most
n wary , to say the least . Some
health care . In the U . S ., minority African
Comcast has started " im posing extra fees on Internet customers
Token screw
Token ID9580
Feature activation-4.09e-01

Pos logit contributions
aline+1.124
iculty+1.119
LOAD+1.050
\":+1.032
olo+0.992
Neg logit contributions
 Kap-0.921
idences-0.902
nings-0.902
rament-0.878
sung-0.871
Tokendrivers
Token ID36702
Feature activation-3.08e-02

Pos logit contributions
 Kap+3.863
nings+3.840
ings+3.763
idences+3.733
sung+3.702
Neg logit contributions
iculty-3.332
aline-3.211
LOAD-3.014
\":-2.973
olo-2.787
Token.
Token ID13
Feature activation+3.87e-01

Pos logit contributions
 Kap+0.272
nings+0.268
idences+0.263
ings+0.257
sung+0.257
Neg logit contributions
iculty-0.270
aline-0.265
LOAD-0.250
\":-0.243
olo-0.232
Token It
Token ID632
Feature activation+7.80e-01

Pos logit contributions
iculty+3.466
aline+3.408
LOAD+3.347
\":+3.131
Published+3.057
Neg logit contributions
nings-3.193
idences-3.144
 Kap-3.136
rament-3.018
sung-2.979
Token can
Token ID460
Feature activation+9.40e-01

Pos logit contributions
aline+6.274
iculty+6.151
LOAD+5.683
\":+5.515
olo+5.416
Neg logit contributions
 Kap-6.593
nings-6.557
rament-6.433
idences-6.420
 Cth-6.312
Token be
Token ID307
Feature activation+2.04e+00

Pos logit contributions
iculty+6.776
aline+6.639
LOAD+5.879
olo+5.702
erate+5.621
Neg logit contributions
 Kap-8.789
nings-8.501
rament-8.365
idences-8.312
 Cth-8.129
Token a
Token ID257
Feature activation+1.35e+00

Pos logit contributions
iculty+12.400
aline+11.859
 anytime+11.582
LOAD+11.400
easy+11.275
Neg logit contributions
idences-15.897
rament-15.000
nings-14.778
 Kap-14.536
pai-13.884
Token bit
Token ID1643
Feature activation+1.70e+00

Pos logit contributions
iculty+9.653
aline+8.985
LOAD+8.669
erate+8.160
olo+8.106
Neg logit contributions
idences-11.349
nings-11.092
pai-10.799
sung-10.719
 Kap-10.646
Token of
Token ID286
Feature activation+1.15e+00

Pos logit contributions
iculty+10.611
aline+10.302
olo+9.236
easy+8.838
Published+8.752
Neg logit contributions
idences-15.073
rament-14.631
 Kap-13.771
sung-13.744
nings-13.732
Token a
Token ID257
Feature activation+1.47e+00

Pos logit contributions
iculty+7.776
aline+7.362
olo+6.879
LOAD+6.656
Published+6.219
Neg logit contributions
idences-10.919
rament-10.662
pps-10.384
nings-10.331
pai-10.248
Token pain
Token ID2356
Feature activation+8.77e-01

Pos logit contributions
iculty+8.845
aline+7.979
LOAD+7.634
olo+7.331
 DOI+7.280
Neg logit contributions
idences-13.589
rament-13.158
pai-13.143
sung-13.014
nings-12.982
Token you
Token ID345
Feature activation+2.23e-02

Pos logit contributions
 Kap+3.739
nings+3.733
ings+3.669
idences+3.595
sung+3.580
Neg logit contributions
iculty-4.950
aline-4.782
LOAD-4.586
\":-4.512
Published-4.278
Token removed
Token ID4615
Feature activation-3.70e-01

Pos logit contributions
iculty+0.212
aline+0.211
LOAD+0.199
\":+0.195
olo+0.188
Neg logit contributions
 Kap-0.177
nings-0.176
idences-0.170
sung-0.168
rament-0.166
Token).
Token ID737
Feature activation+3.08e-01

Pos logit contributions
 Kap+3.080
nings+3.053
idences+2.964
ings+2.962
sung+2.936
Neg logit contributions
iculty-3.429
aline-3.312
LOAD-3.149
\":-3.104
olo-2.933
Token This
Token ID770
Feature activation+4.79e-01

Pos logit contributions
iculty+2.773
aline+2.701
LOAD+2.652
\":+2.508
Published+2.429
Neg logit contributions
nings-2.574
 Kap-2.542
idences-2.537
sung-2.422
rament-2.420
Token can
Token ID460
Feature activation+1.08e+00

Pos logit contributions
iculty+4.062
aline+4.010
LOAD+3.797
\":+3.637
olo+3.559
Neg logit contributions
nings-4.182
 Kap-4.101
idences-4.086
sung-3.936
rament-3.862
Token be
Token ID307
Feature activation+1.95e+00

Pos logit contributions
iculty+7.719
aline+7.439
LOAD+6.621
olo+6.572
erate+6.218
Neg logit contributions
 Kap-9.847
nings-9.482
rament-9.477
idences-9.298
 Cth-9.291
Token a
Token ID257
Feature activation+1.20e+00

Pos logit contributions
iculty+10.861
aline+10.284
 anytime+10.113
LOAD+9.801
easy+9.720
Neg logit contributions
idences-16.592
rament-16.027
nings-15.849
 dstg-15.092
 Kap-15.062
Token pain
Token ID2356
Feature activation+5.26e-01

Pos logit contributions
iculty+9.153
aline+8.531
LOAD+8.248
olo+7.673
 anytime+7.662
Neg logit contributions
nings-9.826
idences-9.807
pai-9.488
pps-9.346
 Kap-9.326
Token,
Token ID11
Feature activation+8.00e-01

Pos logit contributions
iculty+4.140
aline+4.091
LOAD+3.805
olo+3.630
\":+3.486
Neg logit contributions
 Kap-4.889
nings-4.866
idences-4.840
 Cth-4.632
sung-4.631
Token just
Token ID655
Feature activation+1.04e+00

Pos logit contributions
iculty+6.544
aline+6.396
LOAD+6.125
olo+5.797
easy+5.651
Neg logit contributions
nings-6.752
 Kap-6.604
idences-6.508
rament-6.362
pai-6.220
Token take
Token ID1011
Feature activation-3.58e-01

Pos logit contributions
iculty+7.519
aline+7.458
LOAD+7.296
olo+6.802
\":+6.774
Neg logit contributions
nings-8.952
 Kap-8.720
idences-8.648
rament-8.585
 dstg-8.346
Token invited
Token ID9392
Feature activation+1.24e+00

Pos logit contributions
 Kap+1.792
nings+1.777
idences+1.730
ings+1.714
sung+1.707
Neg logit contributions
iculty-2.028
aline-1.979
LOAD-1.870
\":-1.842
olo-1.748
Token O
Token ID440
Feature activation-3.56e-03

Pos logit contributions
aline+8.601
iculty+8.405
LOAD+8.258
\":+8.083
 DOI+8.069
Neg logit contributions
sung-10.424
idences-10.241
nings-10.126
pai-9.968
rament-9.945
TokenAS
Token ID1921
Feature activation+2.56e-01

Pos logit contributions
 Kap+0.030
nings+0.030
idences+0.029
ings+0.029
sung+0.028
Neg logit contributions
iculty-0.033
aline-0.032
LOAD-0.030
\":-0.030
olo-0.028
Token Secretary
Token ID4986
Feature activation+4.15e-01

Pos logit contributions
iculty+2.374
aline+2.316
LOAD+2.213
\":+2.160
olo+2.071
Neg logit contributions
nings-2.079
 Kap-2.067
idences-2.057
sung-2.012
ings-1.972
Token General
Token ID3611
Feature activation+7.87e-01

Pos logit contributions
iculty+3.281
aline+3.275
LOAD+2.983
\":+2.954
olo+2.911
Neg logit contributions
nings-3.864
idences-3.790
 Kap-3.740
sung-3.697
pai-3.606
Token Jose
Token ID5264
Feature activation+1.92e+00

Pos logit contributions
iculty+5.847
aline+5.840
LOAD+5.422
\":+5.419
olo+5.343
Neg logit contributions
idences-6.863
nings-6.813
sung-6.754
rament-6.615
 dstg-6.589
Token Miguel
Token ID29825
Feature activation+1.56e+00

Pos logit contributions
aline+13.139
olo+12.688
 Rodriguez+12.555
iculty+12.508
 Webb+11.846
Neg logit contributions
nings-13.619
idences-13.086
 dstg-12.836
 unwittingly-12.730
hetical-12.373
Token Ins
Token ID7088
Feature activation+4.51e-01

Pos logit contributions
aline+11.617
iculty+11.162
olo+10.939
 Rodriguez+10.601
 Webb+10.499
Neg logit contributions
nings-10.958
 dstg-10.950
idences-10.938
sung-10.523
 unwittingly-10.441
Tokenul
Token ID377
Feature activation+7.18e-01

Pos logit contributions
aline+3.622
iculty+3.557
olo+3.267
\":+3.111
LOAD+3.079
Neg logit contributions
 Kap-4.204
nings-4.131
idences-3.996
 Cth-3.967
 dstg-3.959
Tokenza
Token ID4496
Feature activation+8.82e-01

Pos logit contributions
aline+6.349
iculty+6.176
olo+5.841
\":+5.650
LOAD+5.589
Neg logit contributions
 Kap-5.543
idences-5.389
nings-5.381
ivable-5.192
 Cth-5.181
Token,
Token ID11
Feature activation+1.54e-01

Pos logit contributions
aline+6.557
iculty+6.341
LOAD+6.104
olo+6.064
\":+5.905
Neg logit contributions
idences-7.845
nings-7.485
sung-7.439
rament-7.309
 dstg-7.281
Token
Token ID198
Feature activation+8.06e-01

Pos logit contributions
iculty+3.067
aline+3.002
\":+2.944
LOAD+2.894
Published+2.749
Neg logit contributions
idences-3.158
nings-3.140
 Kap-3.109
rament-3.057
 Cth-3.054
TokenAmong
Token ID14311
Feature activation-8.37e-01

Pos logit contributions
LOAD+6.513
aline+6.197
iculty+6.175
Published+6.137
\":+5.908
Neg logit contributions
 Kap-7.021
idences-6.974
nings-6.853
 unwittingly-6.803
 Cth-6.765
Token those
Token ID883
Feature activation+2.81e-02

Pos logit contributions
 Kap+6.349
nings+6.314
ings+6.174
idences+6.013
sung+5.897
Neg logit contributions
iculty-7.897
aline-7.627
LOAD-7.235
\":-7.228
olo-6.749
Token arrested
Token ID5169
Feature activation+5.75e-01

Pos logit contributions
iculty+0.226
aline+0.221
LOAD+0.208
\":+0.206
olo+0.193
Neg logit contributions
 Kap-0.262
nings-0.261
idences-0.260
sung-0.254
rament-0.252
Token was
Token ID373
Feature activation+5.54e-01

Pos logit contributions
iculty+4.536
aline+4.532
LOAD+4.310
\":+4.268
olo+3.986
Neg logit contributions
 Kap-5.003
idences-5.001
sung-4.940
nings-4.922
rament-4.870
Token Jose
Token ID5264
Feature activation+1.81e+00

Pos logit contributions
iculty+4.334
aline+4.317
LOAD+4.063
\":+4.037
olo+3.860
Neg logit contributions
nings-4.944
idences-4.856
 dstg-4.762
 Kap-4.759
sung-4.755
Token Fer
Token ID12880
Feature activation+7.37e-01

Pos logit contributions
aline+12.465
olo+11.953
iculty+11.690
icio+11.304
 Rodriguez+11.272
Neg logit contributions
nings-12.970
idences-12.498
 dstg-12.360
 unwittingly-12.003
hetical-11.952
Tokenrer
Token ID11751
Feature activation+1.06e+00

Pos logit contributions
aline+6.868
iculty+6.791
LOAD+6.155
olo+6.132
\":+6.118
Neg logit contributions
 Kap-5.652
 dstg-5.424
 Cth-5.398
nings-5.397
idences-5.320
Token,
Token ID11
Feature activation+6.70e-02

Pos logit contributions
aline+7.872
iculty+7.668
olo+7.391
\":+7.149
LOAD+7.081
Neg logit contributions
idences-8.871
nings-8.795
sung-8.705
 dstg-8.496
 Cth-8.361
Token leader
Token ID3554
Feature activation+2.02e-01

Pos logit contributions
iculty+0.563
aline+0.548
LOAD+0.515
\":+0.509
olo+0.480
Neg logit contributions
nings-0.606
 Kap-0.605
idences-0.592
ings-0.584
sung-0.583
Token of
Token ID286
Feature activation+1.65e-01

Pos logit contributions
iculty+1.707
aline+1.661
LOAD+1.559
\":+1.545
olo+1.465
Neg logit contributions
 Kap-1.811
nings-1.807
idences-1.780
sung-1.751
ings-1.731
Token a
Token ID257
Feature activation+2.93e-01

Pos logit contributions
iculty+4.342
aline+4.245
\":+4.014
LOAD+3.950
 DOI+3.801
Neg logit contributions
nings-5.517
 Kap-5.349
sung-5.323
idences-5.319
rament-5.229
Token time
Token ID640
Feature activation+5.02e-01

Pos logit contributions
iculty+2.368
aline+2.305
LOAD+2.158
\":+2.115
 DOI+2.010
Neg logit contributions
nings-2.747
 Kap-2.687
idences-2.636
sung-2.633
ings-2.629
Token when
Token ID618
Feature activation+7.42e-01

Pos logit contributions
aline+4.046
iculty+3.985
\":+3.786
LOAD+3.730
 DOI+3.497
Neg logit contributions
nings-4.429
 Kap-4.422
idences-4.416
sung-4.360
rament-4.306
Token Prime
Token ID5537
Feature activation+3.82e-01

Pos logit contributions
iculty+5.755
aline+5.650
LOAD+5.267
\":+5.245
 DOI+5.135
Neg logit contributions
nings-6.458
 dstg-6.418
sung-6.388
rament-6.364
idences-6.363
Token Minister
Token ID4139
Feature activation+8.42e-01

Pos logit contributions
iculty+3.362
aline+3.318
LOAD+3.114
\":+3.082
olo+2.937
Neg logit contributions
nings-3.261
idences-3.163
 Kap-3.095
sung-3.080
rament-3.049
Token Justin
Token ID10799
Feature activation+1.79e+00

Pos logit contributions
aline+6.023
iculty+5.988
\":+5.636
LOAD+5.409
olo+5.354
Neg logit contributions
nings-7.981
rament-7.694
 dstg-7.541
ğ-7.528
gart-7.418
Token Trudeau
Token ID17524
Feature activation+5.36e-01

Pos logit contributions
aline+5.376
iculty+5.327
 Webb+4.868
olo+4.824
\":+4.620
Neg logit contributions
 dstg-19.642
gart-19.611
rament-19.472
idences-19.461
 glim-19.435
Token has
Token ID468
Feature activation+5.09e-01

Pos logit contributions
iculty+4.451
aline+4.333
\":+4.075
LOAD+4.061
olo+3.926
Neg logit contributions
idences-4.629
 Kap-4.568
nings-4.540
 dstg-4.501
sung-4.470
Token promised
Token ID8072
Feature activation+8.02e-01

Pos logit contributions
iculty+3.986
aline+3.808
\":+3.545
LOAD+3.486
olo+3.353
Neg logit contributions
idences-4.793
nings-4.653
 Kap-4.651
sung-4.550
 dstg-4.470
Token to
Token ID284
Feature activation+5.93e-01

Pos logit contributions
iculty+6.150
aline+5.914
\":+5.679
LOAD+5.493
 DOI+5.183
Neg logit contributions
idences-7.147
 Kap-7.143
sung-7.034
nings-7.015
rament-7.008
Token build
Token ID1382
Feature activation+5.23e-01

Pos logit contributions
iculty+4.787
aline+4.755
\":+4.457
 DOI+4.238
LOAD+4.234
Neg logit contributions
sung-5.131
 Kap-5.100
nings-5.050
idences-5.046
 dstg-4.888
Token
Token ID447
Feature activation+2.29e-02

Pos logit contributions
iculty+0.545
aline+0.528
LOAD+0.502
\":+0.493
olo+0.469
Neg logit contributions
 Kap-0.468
nings-0.458
idences-0.452
sung-0.444
ings-0.441
Token
Token ID247
Feature activation+3.33e-01

Pos logit contributions
iculty+0.210
aline+0.204
LOAD+0.193
\":+0.190
olo+0.181
Neg logit contributions
 Kap-0.195
nings-0.194
idences-0.188
sung-0.185
ings-0.184
Tokens
Token ID82
Feature activation+2.43e-01

Pos logit contributions
iculty+3.409
aline+3.248
LOAD+3.096
\":+3.080
olo+2.956
Neg logit contributions
 Kap-2.406
nings-2.289
idences-2.288
rament-2.201
 dstg-2.201
Token prestige
Token ID29063
Feature activation-7.80e-02

Pos logit contributions
iculty+2.004
aline+1.895
LOAD+1.814
\":+1.753
olo+1.665
Neg logit contributions
 Kap-2.246
nings-2.209
idences-2.202
sung-2.162
 dstg-2.120
Token to
Token ID284
Feature activation+8.54e-01

Pos logit contributions
 Kap+0.695
nings+0.684
idences+0.672
sung+0.662
ings+0.658
Neg logit contributions
iculty-0.680
aline-0.663
LOAD-0.621
\":-0.613
olo-0.579
Token new
Token ID649
Feature activation+1.78e+00

Pos logit contributions
aline+6.112
iculty+6.105
\":+5.850
LOAD+5.761
olo+5.388
Neg logit contributions
nings-7.576
idences-7.483
rament-7.417
ings-7.308
 Kap-7.293
Token heights
Token ID23245
Feature activation+2.14e-01

Pos logit contributions
iculty+9.072
aline+8.633
 heights+7.970
LOAD+7.831
\":+7.667
Neg logit contributions
nings-16.032
idences-15.915
rament-15.903
sung-15.796
ivable-15.501
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
iculty+1.749
aline+1.702
LOAD+1.580
\":+1.568
olo+1.483
Neg logit contributions
 Kap-1.994
nings-1.933
idences-1.920
sung-1.905
rament-1.892
Token
Token ID198
Feature activation-1.08e-01

Pos logit contributions
 Kap+1.011
nings+1.010
idences+0.984
ings+0.968
sung+0.967
Neg logit contributions
iculty-1.284
aline-1.254
LOAD-1.194
\":-1.179
olo-1.116
Token
Token ID198
Feature activation-4.88e-02

Pos logit contributions
 Kap+0.815
nings+0.809
idences+0.787
ings+0.773
sung+0.773
Neg logit contributions
iculty-1.092
aline-1.067
LOAD-1.015
\":-1.004
olo-0.953
TokenAnd
Token ID1870
Feature activation-2.60e-01

Pos logit contributions
 Kap+0.403
nings+0.399
idences+0.391
ings+0.382
rament+0.379
Neg logit contributions
iculty-0.455
aline-0.444
LOAD-0.428
\":-0.418
Published-0.398
Token which
Token ID543
Feature activation+2.22e-01

Pos logit contributions
 Kap+2.367
nings+2.348
idences+2.288
ings+2.274
sung+2.260
Neg logit contributions
iculty-2.223
aline-2.162
LOAD-2.032
\":-1.996
olo-1.883
Token claim
Token ID1624
Feature activation-1.15e-02

Pos logit contributions
iculty+1.967
aline+1.924
LOAD+1.830
\":+1.793
olo+1.702
Neg logit contributions
nings-1.875
 Kap-1.840
sung-1.830
idences-1.809
ings-1.774
Token to
Token ID284
Feature activation+5.99e-01

Pos logit contributions
nings+0.104
 Kap+0.104
idences+0.101
sung+0.100
ings+0.100
Neg logit contributions
iculty-0.098
aline-0.096
LOAD-0.090
\":-0.088
olo-0.084
Token run
Token ID1057
Feature activation+2.31e-01

Pos logit contributions
iculty+4.173
aline+4.061
LOAD+3.792
\":+3.757
olo+3.595
Neg logit contributions
 Kap-5.775
nings-5.766
sung-5.709
idences-5.619
rament-5.589
Token universally
Token ID26208
Feature activation+3.45e-01

Pos logit contributions
iculty+1.790
aline+1.788
LOAD+1.703
\":+1.637
olo+1.557
Neg logit contributions
nings-2.162
idences-2.135
 Kap-2.123
sung-2.103
rament-2.058
Token across
Token ID1973
Feature activation+1.78e+00

Pos logit contributions
iculty+2.554
aline+2.533
LOAD+2.440
\":+2.391
Published+2.247
Neg logit contributions
nings-3.314
idences-3.272
 Kap-3.243
rament-3.179
sung-3.174
Token Linux
Token ID7020
Feature activation+4.64e-01

Pos logit contributions
LOAD+9.455
aline+9.370
 anytime+9.039
iculty+8.948
 DOS+8.894
Neg logit contributions
idences-15.376
nings-15.255
sung-15.109
rament-14.857
pai-14.548
Token distributions
Token ID24570
Feature activation+1.36e-01

Pos logit contributions
iculty+3.030
aline+2.968
LOAD+2.832
\":+2.747
Published+2.611
Neg logit contributions
 Kap-4.823
nings-4.779
idences-4.747
sung-4.697
rament-4.688
Token,
Token ID11
Feature activation-7.99e-02

Pos logit contributions
iculty+1.101
aline+1.079
LOAD+1.016
\":+0.998
olo+0.943
Neg logit contributions
 Kap-1.269
nings-1.259
idences-1.239
sung-1.222
rament-1.207
Token bund
Token ID12207
Feature activation-6.94e-03

Pos logit contributions
nings+0.662
 Kap+0.660
idences+0.643
sung+0.636
ings+0.634
Neg logit contributions
iculty-0.737
aline-0.723
LOAD-0.685
\":-0.674
olo-0.637
Tokenling
Token ID1359
Feature activation+1.21e-01

Pos logit contributions
 Kap+0.030
nings+0.029
idences+0.028
sung+0.027
ings+0.026
Neg logit contributions
iculty-0.092
aline-0.091
LOAD-0.088
\":-0.086
olo-0.084
Token being
Token ID852
Feature activation+2.00e-01

Pos logit contributions
iculty+2.289
aline+2.259
LOAD+2.154
\":+2.112
olo+2.006
Neg logit contributions
nings-2.289
 Kap-2.276
idences-2.241
sung-2.236
 dstg-2.200
Token awakened
Token ID39217
Feature activation-1.95e-01

Pos logit contributions
iculty+1.661
aline+1.630
LOAD+1.555
\":+1.525
olo+1.444
Neg logit contributions
nings-1.799
idences-1.774
 Kap-1.768
sung-1.734
ings-1.709
Token at
Token ID379
Feature activation-1.27e-01

Pos logit contributions
 Kap+1.691
nings+1.677
idences+1.639
ings+1.616
sung+1.615
Neg logit contributions
iculty-1.736
aline-1.694
LOAD-1.596
\":-1.569
olo-1.486
Token 6
Token ID718
Feature activation+9.69e-01

Pos logit contributions
 Kap+1.158
nings+1.154
idences+1.127
sung+1.117
ings+1.113
Neg logit contributions
iculty-1.077
aline-1.045
LOAD-0.982
\":-0.970
olo-0.912
Token:
Token ID25
Feature activation-1.52e-01

Pos logit contributions
aline+6.279
iculty+6.198
\":+6.014
LOAD+5.853
 DOI+5.353
Neg logit contributions
idences-9.354
 Kap-9.208
nings-9.106
sung-9.088
rament-9.019
Token30
Token ID1270
Feature activation+1.77e+00

Pos logit contributions
 Kap+1.423
nings+1.415
idences+1.383
sung+1.364
ings+1.363
Neg logit contributions
iculty-1.241
aline-1.208
LOAD-1.134
\":-1.118
olo-1.049
Token a
Token ID257
Feature activation+1.93e-01

Pos logit contributions
aline+11.099
iculty+10.749
\":+10.442
LOAD+10.359
 DOI+10.021
Neg logit contributions
idences-13.889
sung-13.588
nings-13.210
 Kap-13.201
 dstg-13.194
Token.
Token ID13
Feature activation+3.59e-01

Pos logit contributions
iculty+1.550
aline+1.505
LOAD+1.430
\":+1.424
olo+1.318
Neg logit contributions
 Kap-1.804
nings-1.798
idences-1.764
ings-1.730
sung-1.729
Tokenm
Token ID76
Feature activation+7.31e-01

Pos logit contributions
iculty+2.781
aline+2.735
\":+2.638
LOAD+2.561
olo+2.425
Neg logit contributions
 Kap-3.394
idences-3.324
nings-3.298
 Cth-3.278
sung-3.211
Token.
Token ID13
Feature activation+7.01e-01

Pos logit contributions
iculty+5.198
aline+5.057
LOAD+4.658
\":+4.590
olo+4.378
Neg logit contributions
 Kap-7.111
nings-6.997
idences-6.912
sung-6.881
rament-6.716
Token by
Token ID416
Feature activation-5.47e-01

Pos logit contributions
iculty+5.222
aline+5.196
LOAD+4.973
\":+4.864
olo+4.542
Neg logit contributions
idences-6.247
nings-6.217
 Kap-6.212
sung-6.171
 dstg-6.032
Token35
Token ID2327
Feature activation+5.00e-01

Pos logit contributions
iculty+0.631
aline+0.619
LOAD+0.590
\":+0.579
Published+0.542
Neg logit contributions
 Kap-0.680
nings-0.678
idences-0.670
sung-0.649
ings-0.647
Tokenz
Token ID89
Feature activation+8.18e-02

Pos logit contributions
iculty+4.006
aline+3.971
LOAD+3.704
\":+3.568
olo+3.475
Neg logit contributions
 Kap-4.527
nings-4.514
idences-4.470
ings-4.368
sung-4.367
Token 1
Token ID352
Feature activation+2.26e-01

Pos logit contributions
iculty+0.704
aline+0.689
LOAD+0.657
\":+0.638
olo+0.603
Neg logit contributions
nings-0.727
 Kap-0.720
idences-0.710
sung-0.698
ings-0.696
Token GER
Token ID44186
Feature activation+9.18e-01

Pos logit contributions
iculty+1.933
aline+1.895
LOAD+1.816
\":+1.762
olo+1.669
Neg logit contributions
nings-1.995
 Kap-1.968
idences-1.954
sung-1.915
ings-1.905
TokenON
Token ID1340
Feature activation+8.63e-01

Pos logit contributions
iculty+5.993
LOAD+5.720
aline+5.686
\":+5.191
OND+5.129
Neg logit contributions
idences-9.327
nings-9.242
 Kap-9.100
sung-8.942
ings-8.825
TokenIM
Token ID3955
Feature activation+1.74e+00

Pos logit contributions
iculty+4.877
LOAD+4.874
aline+4.686
\":+4.364
Published+4.240
Neg logit contributions
idences-9.394
 Kap-9.344
nings-9.344
 unwittingly-9.062
pps-9.054
TokenO
Token ID46
Feature activation+6.12e-01

Pos logit contributions
LOAD+5.466
OE+4.317
NOR+4.300
OND+4.291
\":+4.273
Neg logit contributions
 glim-21.852
raints-21.667
idences-21.554
sung-21.277
nings-20.781
Token T
Token ID309
Feature activation-5.67e-01

Pos logit contributions
iculty+5.212
aline+5.087
LOAD+5.015
\":+4.756
 DOI+4.576
Neg logit contributions
nings-5.170
idences-5.057
sung-5.030
 Kap-4.990
rament-4.964
TokenOC
Token ID4503
Feature activation+6.68e-01

Pos logit contributions
 Kap+4.651
nings+4.595
ings+4.529
idences+4.424
sung+4.374
Neg logit contributions
iculty-5.128
aline-5.065
\":-4.570
LOAD-4.493
olo-4.310
Token IN
Token ID3268
Feature activation+1.67e-01

Pos logit contributions
iculty+4.946
LOAD+4.804
aline+4.760
\":+4.456
 DOI+4.338
Neg logit contributions
nings-6.273
idences-6.214
sung-6.142
rament-6.099
 Kap-6.088
TokenIT
Token ID2043
Feature activation+4.40e-02

Pos logit contributions
iculty+1.477
aline+1.431
LOAD+1.394
\":+1.334
olo+1.270
Neg logit contributions
nings-1.448
 Kap-1.434
idences-1.416
sung-1.401
ings-1.389
Token Children
Token ID8990
Feature activation+3.72e-01

Pos logit contributions
iculty+4.580
aline+4.428
LOAD+4.269
\":+4.201
Published+4.094
Neg logit contributions
nings-4.782
idences-4.742
rament-4.726
sung-4.657
 Kap-4.537
Token learn
Token ID2193
Feature activation+1.93e-01

Pos logit contributions
iculty+3.516
aline+3.427
\":+3.203
LOAD+3.166
olo+3.119
Neg logit contributions
 Kap-2.876
rament-2.800
idences-2.789
nings-2.771
 dstg-2.752
Token quickly
Token ID2952
Feature activation+2.69e-01

Pos logit contributions
iculty+1.660
aline+1.605
LOAD+1.514
\":+1.476
olo+1.403
Neg logit contributions
 Kap-1.704
nings-1.692
idences-1.685
sung-1.648
rament-1.634
Token how
Token ID703
Feature activation+8.53e-01

Pos logit contributions
iculty+2.226
aline+2.160
LOAD+2.021
\":+1.977
olo+1.896
Neg logit contributions
 Kap-2.451
idences-2.413
nings-2.401
sung-2.391
rament-2.387
Token to
Token ID284
Feature activation+3.82e-01

Pos logit contributions
iculty+6.290
aline+6.046
LOAD+5.657
\":+5.614
olo+5.187
Neg logit contributions
idences-7.877
rament-7.819
 Kap-7.818
nings-7.712
sung-7.681
Token try
Token ID1949
Feature activation+1.73e+00

Pos logit contributions
iculty+2.918
aline+2.785
LOAD+2.518
\":+2.487
olo+2.434
Neg logit contributions
 Kap-3.733
nings-3.696
rament-3.645
idences-3.627
sung-3.597
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
iculty+10.757
aline+10.452
LOAD+9.788
\":+9.175
 anytime+9.027
Neg logit contributions
idences-14.646
rament-14.632
 Kap-14.528
nings-14.401
 dstg-14.274
Token fit
Token ID4197
Feature activation+2.74e-01

Pos logit contributions
iculty+7.651
aline+7.521
olo+6.663
\":+6.601
 DOI+6.548
Neg logit contributions
 Kap-9.284
rament-9.272
nings-9.151
 dstg-8.995
idences-8.938
Token in
Token ID287
Feature activation+7.16e-01

Pos logit contributions
iculty+2.437
aline+2.420
LOAD+2.273
\":+2.224
olo+2.122
Neg logit contributions
 Kap-2.308
nings-2.267
idences-2.265
sung-2.223
rament-2.183
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
aline+5.211
iculty+5.161
\":+4.654
LOAD+4.654
 anytime+4.493
Neg logit contributions
 Kap-6.711
idences-6.647
sung-6.601
rament-6.562
nings-6.446
Token However
Token ID2102
Feature activation+5.31e-01

Pos logit contributions
iculty+5.007
aline+4.812
LOAD+4.629
\":+4.544
Published+4.428
Neg logit contributions
idences-5.241
nings-5.235
rament-5.227
sung-5.140
 Kap-5.031
Token with
Token ID351
Feature activation-5.75e-01

Pos logit contributions
 Kap+0.766
nings+0.758
idences+0.745
ings+0.733
sung+0.731
Neg logit contributions
iculty-0.783
aline-0.765
LOAD-0.725
\":-0.710
olo-0.674
Token honorary
Token ID44358
Feature activation-5.92e-01

Pos logit contributions
 Kap+5.011
nings+4.881
ings+4.733
sung+4.720
idences+4.695
Neg logit contributions
iculty-5.176
aline-4.962
LOAD-4.660
\":-4.564
olo-4.333
Token tattoo
Token ID18785
Feature activation-5.83e-01

Pos logit contributions
 Kap+5.041
nings+4.924
idences+4.803
ings+4.765
sung+4.737
Neg logit contributions
iculty-5.430
aline-5.257
LOAD-4.963
\":-4.863
olo-4.606
Token
Token ID198
Feature activation-3.30e-01

Pos logit contributions
 Kap+4.343
nings+4.302
ings+4.228
sung+4.078
idences+4.016
Neg logit contributions
iculty-5.869
aline-5.723
LOAD-5.430
\":-5.344
Published-5.006
Token
Token ID198
Feature activation-1.55e-01

Pos logit contributions
 Kap+2.321
nings+2.285
ings+2.217
idences+2.185
sung+2.176
Neg logit contributions
iculty-3.504
aline-3.429
LOAD-3.247
\":-3.178
olo-3.060
TokenCh
Token ID1925
Feature activation+1.71e+00

Pos logit contributions
 Kap+1.324
nings+1.312
idences+1.282
ings+1.265
sung+1.258
Neg logit contributions
iculty-1.410
aline-1.376
LOAD-1.306
\":-1.281
olo-1.213
Tokenad
Token ID324
Feature activation+3.78e-01

Pos logit contributions
aline+13.245
iculty+12.489
olo+12.345
Published+10.688
icio+10.685
Neg logit contributions
nings-13.168
 Cth-12.924
 unwittingly-12.080
 Kap-11.884
 dstg-11.569
Token "
Token ID366
Feature activation-1.79e-01

Pos logit contributions
aline+3.323
iculty+3.276
\":+3.033
LOAD+3.011
olo+2.972
Neg logit contributions
nings-3.183
idences-3.098
sung-3.026
 Kap-3.013
 dstg-2.991
TokenO
Token ID46
Feature activation+2.58e-02

Pos logit contributions
 Kap+1.501
nings+1.489
idences+1.448
ings+1.436
sung+1.428
Neg logit contributions
iculty-1.642
aline-1.601
LOAD-1.513
\":-1.488
olo-1.411
Tokench
Token ID354
Feature activation+1.35e-01

Pos logit contributions
iculty+0.243
aline+0.239
LOAD+0.226
\":+0.222
olo+0.213
Neg logit contributions
 Kap-0.209
nings-0.208
idences-0.204
ings-0.200
sung-0.197
Tokenoc
Token ID420
Feature activation+1.69e-01

Pos logit contributions
aline+1.230
iculty+1.215
\":+1.105
LOAD+1.101
olo+1.085
Neg logit contributions
nings-1.136
 Kap-1.114
idences-1.092
sung-1.080
rament-1.061
Token been
Token ID587
Feature activation+2.96e-01

Pos logit contributions
iculty+1.970
aline+1.903
LOAD+1.806
\":+1.764
Published+1.675
Neg logit contributions
idences-2.132
nings-2.121
 Kap-2.120
sung-2.064
rament-2.047
Token hard
Token ID1327
Feature activation+3.58e-01

Pos logit contributions
iculty+2.295
aline+2.214
LOAD+2.086
\":+2.081
Published+1.967
Neg logit contributions
idences-2.787
nings-2.777
 Kap-2.759
sung-2.722
rament-2.699
Token to
Token ID284
Feature activation+6.12e-01

Pos logit contributions
iculty+2.967
aline+2.902
LOAD+2.692
\":+2.669
olo+2.544
Neg logit contributions
 Kap-3.246
idences-3.189
nings-3.174
sung-3.116
rament-3.075
Token reach
Token ID3151
Feature activation+6.65e-01

Pos logit contributions
iculty+4.603
aline+4.535
\":+4.118
 DOI+3.957
olo+3.920
Neg logit contributions
 Kap-5.689
nings-5.684
sung-5.533
idences-5.520
rament-5.462
Token the
Token ID262
Feature activation+3.42e-01

Pos logit contributions
aline+4.644
iculty+4.615
\":+4.265
LOAD+4.138
 DOI+4.017
Neg logit contributions
nings-6.401
sung-6.348
 Kap-6.328
pai-6.249
idences-6.246
Token public
Token ID1171
Feature activation+1.71e+00

Pos logit contributions
iculty+2.812
aline+2.795
\":+2.603
LOAD+2.543
 DOI+2.452
Neg logit contributions
nings-3.067
 Kap-3.028
sung-2.988
idences-2.975
pai-2.939
Token with
Token ID351
Feature activation+6.06e-01

Pos logit contributions
aline+12.362
iculty+10.937
 anytime+10.875
\":+10.769
 DOI+10.401
Neg logit contributions
nings-12.563
 dstg-12.539
sung-12.495
gart-12.488
pai-12.437
Token the
Token ID262
Feature activation+6.01e-01

Pos logit contributions
aline+4.862
iculty+4.777
\":+4.510
LOAD+4.377
 DOI+4.170
Neg logit contributions
nings-5.318
 Kap-5.267
idences-5.201
sung-5.188
pai-5.144
Token information
Token ID1321
Feature activation+2.10e-01

Pos logit contributions
aline+4.892
iculty+4.844
\":+4.525
LOAD+4.409
 DOI+4.313
Neg logit contributions
nings-5.275
sung-5.112
 Kap-5.100
idences-5.071
pai-5.053
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
aline+1.651
iculty+1.649
LOAD+1.530
\":+1.523
Published+1.436
Neg logit contributions
nings-1.939
 Kap-1.925
idences-1.909
sung-1.895
rament-1.854
Token Despite
Token ID7945
Feature activation-7.28e-02

Pos logit contributions
aline+2.922
iculty+2.922
LOAD+2.756
\":+2.719
 DOI+2.620
Neg logit contributions
nings-2.882
idences-2.831
sung-2.797
 Kap-2.783
rament-2.771
Token planning
Token ID5410
Feature activation+4.12e-01

Pos logit contributions
iculty+2.145
aline+2.103
LOAD+1.992
\":+1.980
olo+1.866
Neg logit contributions
idences-2.320
nings-2.307
 Kap-2.283
sung-2.269
rament-2.224
Token to
Token ID284
Feature activation+5.50e-01

Pos logit contributions
iculty+3.315
aline+3.204
LOAD+3.053
\":+3.004
olo+2.766
Neg logit contributions
idences-3.773
 Kap-3.759
sung-3.728
nings-3.722
rament-3.624
Token get
Token ID651
Feature activation+1.05e+00

Pos logit contributions
iculty+3.797
aline+3.658
\":+3.414
LOAD+3.362
olo+3.184
Neg logit contributions
idences-5.438
 Kap-5.400
sung-5.379
nings-5.345
rament-5.268
Token Racing
Token ID14954
Feature activation+4.76e-01

Pos logit contributions
iculty+7.448
aline+7.332
LOAD+6.873
\":+6.780
Published+6.503
Neg logit contributions
idences-9.073
sung-8.833
nings-8.809
 dstg-8.745
rament-8.642
Token Apex
Token ID49440
Feature activation+1.21e+00

Pos logit contributions
iculty+4.758
aline+4.684
LOAD+4.428
\":+4.372
Published+4.169
Neg logit contributions
idences-3.345
 Kap-3.339
sung-3.325
nings-3.323
ings-3.173
Token on
Token ID319
Feature activation+1.71e+00

Pos logit contributions
aline+8.373
iculty+8.215
LOAD+7.950
 DOS+7.758
 anytime+7.581
Neg logit contributions
idences-10.318
nings-9.848
sung-9.711
 Kap-9.687
rament-9.647
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
aline+10.393
 DOS+10.270
LOAD+9.860
iculty+9.634
\":+9.555
Neg logit contributions
idences-14.025
rament-13.623
sung-13.543
nings-13.329
ğ-13.177
Token consoles
Token ID22629
Feature activation+6.03e-01

Pos logit contributions
aline+9.927
LOAD+9.419
iculty+9.362
 DOS+9.333
\":+9.044
Neg logit contributions
idences-12.278
rament-11.994
sung-11.971
 dstg-11.793
nings-11.688
Token in
Token ID287
Feature activation+4.98e-01

Pos logit contributions
iculty+4.318
aline+4.254
LOAD+4.036
\":+3.972
 anytime+3.758
Neg logit contributions
nings-5.684
 Kap-5.673
idences-5.667
sung-5.562
 dstg-5.483
Token the
Token ID262
Feature activation+5.67e-01

Pos logit contributions
iculty+3.624
aline+3.547
\":+3.394
LOAD+3.341
Published+3.112
Neg logit contributions
nings-4.697
idences-4.650
 Kap-4.593
sung-4.581
 dstg-4.569
Token near
Token ID1474
Feature activation-3.68e-01

Pos logit contributions
iculty+4.104
aline+4.070
LOAD+3.877
\":+3.847
Published+3.589
Neg logit contributions
nings-5.333
idences-5.275
 Kap-5.208
sung-5.138
rament-5.116
Token.
Token ID13
Feature activation+3.87e-01

Pos logit contributions
 Kap+0.272
nings+0.268
idences+0.263
ings+0.257
sung+0.257
Neg logit contributions
iculty-0.270
aline-0.265
LOAD-0.250
\":-0.243
olo-0.232
Token It
Token ID632
Feature activation+7.80e-01

Pos logit contributions
iculty+3.466
aline+3.408
LOAD+3.347
\":+3.131
Published+3.057
Neg logit contributions
nings-3.193
idences-3.144
 Kap-3.136
rament-3.018
sung-2.979
Token can
Token ID460
Feature activation+9.40e-01

Pos logit contributions
aline+6.274
iculty+6.151
LOAD+5.683
\":+5.515
olo+5.416
Neg logit contributions
 Kap-6.593
nings-6.557
rament-6.433
idences-6.420
 Cth-6.312
Token be
Token ID307
Feature activation+2.04e+00

Pos logit contributions
iculty+6.776
aline+6.639
LOAD+5.879
olo+5.702
erate+5.621
Neg logit contributions
 Kap-8.789
nings-8.501
rament-8.365
idences-8.312
 Cth-8.129
Token a
Token ID257
Feature activation+1.35e+00

Pos logit contributions
iculty+12.400
aline+11.859
 anytime+11.582
LOAD+11.400
easy+11.275
Neg logit contributions
idences-15.897
rament-15.000
nings-14.778
 Kap-14.536
pai-13.884
Token bit
Token ID1643
Feature activation+1.70e+00

Pos logit contributions
iculty+9.653
aline+8.985
LOAD+8.669
erate+8.160
olo+8.106
Neg logit contributions
idences-11.349
nings-11.092
pai-10.799
sung-10.719
 Kap-10.646
Token of
Token ID286
Feature activation+1.15e+00

Pos logit contributions
iculty+10.611
aline+10.302
olo+9.236
easy+8.838
Published+8.752
Neg logit contributions
idences-15.073
rament-14.631
 Kap-13.771
sung-13.744
nings-13.732
Token a
Token ID257
Feature activation+1.47e+00

Pos logit contributions
iculty+7.776
aline+7.362
olo+6.879
LOAD+6.656
Published+6.219
Neg logit contributions
idences-10.919
rament-10.662
pps-10.384
nings-10.331
pai-10.248
Token pain
Token ID2356
Feature activation+8.77e-01

Pos logit contributions
iculty+8.845
aline+7.979
LOAD+7.634
olo+7.331
 DOI+7.280
Neg logit contributions
idences-13.589
rament-13.158
pai-13.143
sung-13.014
nings-12.982
Token,
Token ID11
Feature activation+6.86e-01

Pos logit contributions
iculty+6.609
aline+6.431
LOAD+6.109
olo+5.944
easy+5.429
Neg logit contributions
 Kap-7.852
idences-7.839
nings-7.780
rament-7.460
 Cth-7.430
Token but
Token ID475
Feature activation+5.31e-01

Pos logit contributions
iculty+5.600
aline+5.500
LOAD+5.259
olo+4.987
\":+4.832
Neg logit contributions
nings-5.881
 Kap-5.815
idences-5.712
rament-5.552
 Cth-5.462
Token giving
Token ID3501
Feature activation+2.45e-01

Pos logit contributions
iculty+2.427
aline+2.373
LOAD+2.214
\":+2.190
olo+2.045
Neg logit contributions
 Kap-2.721
idences-2.714
nings-2.694
sung-2.658
rament-2.624
Token Body
Token ID12290
Feature activation+5.05e-01

Pos logit contributions
iculty+2.122
aline+2.072
LOAD+1.914
\":+1.902
olo+1.801
Neg logit contributions
nings-2.181
idences-2.151
 Kap-2.135
sung-2.084
 dstg-2.055
Tokenconscious
Token ID16796
Feature activation+7.53e-01

Pos logit contributions
aline+4.528
iculty+4.495
\":+4.191
LOAD+4.176
olo+4.102
Neg logit contributions
nings-4.047
idences-3.963
 Kap-3.918
sung-3.895
pai-3.776
Token a
Token ID257
Feature activation+1.18e+00

Pos logit contributions
iculty+6.445
aline+6.398
\":+5.990
LOAD+5.972
olo+5.801
Neg logit contributions
idences-6.072
nings-5.972
 Kap-5.858
sung-5.727
rament-5.653
Token low
Token ID1877
Feature activation+4.10e-01

Pos logit contributions
iculty+8.471
aline+8.227
LOAD+7.794
\":+7.736
 DOI+7.733
Neg logit contributions
nings-10.116
idences-9.998
 dstg-9.937
sung-9.537
ings-9.445
Token grade
Token ID9559
Feature activation+1.69e+00

Pos logit contributions
iculty+3.161
aline+2.996
LOAD+2.834
\":+2.768
erate+2.629
Neg logit contributions
idences-3.950
nings-3.946
 Kap-3.883
 dstg-3.800
rament-3.793
Token is
Token ID318
Feature activation-7.62e-02

Pos logit contributions
iculty+10.828
aline+9.779
\":+9.220
 anytime+9.106
erate+9.062
Neg logit contributions
idences-14.749
sung-14.150
 dstg-13.704
nings-13.493
 Kap-13.487
Token wondering
Token ID11263
Feature activation-3.57e-01

Pos logit contributions
 Kap+0.668
nings+0.666
idences+0.651
sung+0.640
ings+0.640
Neg logit contributions
iculty-0.674
aline-0.653
LOAD-0.616
\":-0.606
olo-0.572
Token who
Token ID508
Feature activation+9.51e-02

Pos logit contributions
 Kap+2.889
nings+2.864
ings+2.773
idences+2.767
sung+2.736
Neg logit contributions
iculty-3.386
aline-3.299
LOAD-3.127
\":-3.072
olo-2.918
Token it
Token ID340
Feature activation+4.38e-02

Pos logit contributions
iculty+0.869
aline+0.853
LOAD+0.805
\":+0.804
olo+0.752
Neg logit contributions
nings-0.775
 Kap-0.770
idences-0.770
sung-0.754
ings-0.743
Token could
Token ID714
Feature activation+7.22e-01

Pos logit contributions
iculty+0.414
aline+0.408
\":+0.382
LOAD+0.378
olo+0.362
Neg logit contributions
 Kap-0.352
nings-0.348
idences-0.346
sung-0.336
rament-0.333
Token is
Token ID318
Feature activation+5.01e-01

Pos logit contributions
aline+1.838
iculty+1.828
\":+1.703
LOAD+1.685
olo+1.612
Neg logit contributions
 Kap-1.976
idences-1.973
nings-1.961
sung-1.931
rament-1.903
Token planned
Token ID6027
Feature activation+8.21e-02

Pos logit contributions
iculty+3.988
aline+3.975
\":+3.709
LOAD+3.644
Published+3.533
Neg logit contributions
idences-4.458
nings-4.405
sung-4.390
 Kap-4.386
rament-4.285
Token for
Token ID329
Feature activation+8.67e-01

Pos logit contributions
iculty+0.715
aline+0.702
LOAD+0.660
\":+0.651
olo+0.613
Neg logit contributions
 Kap-0.722
nings-0.714
idences-0.704
sung-0.695
ings-0.688
Token a
Token ID257
Feature activation+1.07e+00

Pos logit contributions
aline+6.289
iculty+6.192
\":+5.970
LOAD+5.779
 DOI+5.615
Neg logit contributions
sung-7.714
 dstg-7.683
nings-7.660
idences-7.650
rament-7.578
Token release
Token ID2650
Feature activation+7.52e-01

Pos logit contributions
aline+7.618
iculty+7.582
\":+7.377
LOAD+7.045
 DOI+6.934
Neg logit contributions
 dstg-9.143
nings-9.103
 Kap-9.086
idences-9.069
sung-9.066
Token on
Token ID319
Feature activation+1.69e+00

Pos logit contributions
aline+4.831
iculty+4.773
\":+4.522
LOAD+4.471
 anytime+4.304
Neg logit contributions
idences-7.508
nings-7.296
sung-7.264
 Kap-7.190
 dstg-7.132
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
aline+8.647
 DOI+8.392
\":+8.120
iculty+8.068
 anytime+7.979
Neg logit contributions
idences-15.510
rament-15.492
 dstg-15.467
nings-15.085
ğ-15.033
Token Xbox
Token ID9445
Feature activation+8.84e-01

Pos logit contributions
aline+8.388
iculty+8.064
\":+7.661
 DOI+7.593
Published+7.525
Neg logit contributions
 dstg-12.119
idences-11.778
rament-11.626
nings-11.585
ğ-11.446
Token 360
Token ID11470
Feature activation+9.60e-01

Pos logit contributions
aline+6.293
iculty+6.219
\":+5.793
LOAD+5.786
olo+5.722
Neg logit contributions
nings-8.001
 Kap-7.929
 dstg-7.849
idences-7.832
sung-7.640
Token and
Token ID290
Feature activation+3.74e-01

Pos logit contributions
iculty+5.967
aline+5.777
LOAD+5.603
\":+5.548
Published+5.271
Neg logit contributions
idences-9.277
 Kap-9.219
nings-9.191
 dstg-9.115
sung-9.092
Token Microsoft
Token ID5413
Feature activation+2.59e-01

Pos logit contributions
aline+2.694
iculty+2.692
LOAD+2.496
\":+2.437
Published+2.306
Neg logit contributions
 Kap-3.714
idences-3.653
nings-3.595
sung-3.586
rament-3.566
Token applicable
Token ID9723
Feature activation+5.08e-01

Pos logit contributions
iculty+5.368
aline+5.131
LOAD+5.046
\":+4.848
olo+4.804
Neg logit contributions
nings-6.142
 Kap-6.057
idences-6.024
rament-6.008
sung-5.922
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
iculty+4.045
aline+3.952
LOAD+3.778
\":+3.631
olo+3.471
Neg logit contributions
 Kap-4.598
nings-4.575
idences-4.556
sung-4.484
rament-4.446
Token this
Token ID428
Feature activation+4.38e-01

Pos logit contributions
iculty+8.249
aline+7.841
LOAD+7.675
\":+7.349
 DOI+7.093
Neg logit contributions
nings-9.402
sung-9.217
idences-9.126
 dstg-9.061
pai-9.002
Token article
Token ID2708
Feature activation+5.32e-01

Pos logit contributions
iculty+3.409
aline+3.253
LOAD+3.104
\":+3.001
Published+2.939
Neg logit contributions
nings-4.130
idences-4.077
 Kap-4.069
sung-4.008
ings-3.962
Token can
Token ID460
Feature activation+1.43e+00

Pos logit contributions
iculty+4.269
aline+4.135
LOAD+3.985
\":+3.850
Published+3.703
Neg logit contributions
 Kap-4.713
nings-4.706
idences-4.631
sung-4.563
rament-4.537
Token be
Token ID307
Feature activation+1.69e+00

Pos logit contributions
iculty+8.335
aline+8.140
LOAD+7.742
Published+7.256
 DOI+7.174
Neg logit contributions
 Kap-13.139
nings-12.990
rament-12.964
 Cth-12.957
hetical-12.705
Token found
Token ID1043
Feature activation+7.49e-01

Pos logit contributions
iculty+8.490
aline+8.031
LOAD+7.843
 DOI+7.556
Published+7.267
Neg logit contributions
idences-15.889
nings-15.470
 Cth-15.428
rament-15.284
 Kap-15.240
Token here
Token ID994
Feature activation+1.15e-01

Pos logit contributions
iculty+5.278
aline+5.080
LOAD+5.060
\":+4.725
olo+4.533
Neg logit contributions
idences-7.138
nings-6.996
 Kap-6.969
rament-6.879
sung-6.833
Token.
Token ID13
Feature activation+4.92e-01

Pos logit contributions
iculty+0.964
aline+0.930
LOAD+0.892
\":+0.862
olo+0.821
Neg logit contributions
 Kap-1.053
nings-1.049
idences-1.033
sung-1.012
ings-1.008
Token Unfortunately
Token ID8989
Feature activation+1.10e+00

Pos logit contributions
iculty+4.224
aline+4.073
LOAD+3.954
\":+3.775
Published+3.742
Neg logit contributions
nings-4.163
idences-4.136
 Kap-4.016
rament-4.012
sung-3.980
Token it
Token ID340
Feature activation+9.85e-01

Pos logit contributions
iculty+7.583
aline+7.197
LOAD+6.962
Published+6.457
olo+6.296
Neg logit contributions
nings-10.068
sung-9.936
idences-9.677
 Kap-9.659
pai-9.551
Token-
Token ID12
Feature activation+5.30e-01

Pos logit contributions
iculty+6.356
aline+6.316
\":+5.788
LOAD+5.653
olo+5.328
Neg logit contributions
idences-9.676
nings-9.605
 Kap-9.588
 dstg-9.540
sung-9.331
Tokena
Token ID64
Feature activation+5.04e-01

Pos logit contributions
iculty+2.861
aline+2.727
LOAD+2.471
olo+2.256
\":+2.238
Neg logit contributions
 Kap-6.240
idences-6.139
nings-6.059
 Cth-5.994
 dstg-5.990
Token-
Token ID12
Feature activation+3.28e-01

Pos logit contributions
iculty+3.813
aline+3.673
\":+3.444
LOAD+3.418
Published+3.209
Neg logit contributions
 Kap-4.801
nings-4.799
idences-4.711
sung-4.609
 dstg-4.569
Tokenhalf
Token ID13959
Feature activation+1.39e+00

Pos logit contributions
iculty+3.043
aline+2.949
LOAD+2.823
\":+2.723
Published+2.647
Neg logit contributions
 Kap-2.725
nings-2.608
idences-2.595
 Cth-2.555
ings-2.512
Token-
Token ID12
Feature activation+1.39e+00

Pos logit contributions
iculty+9.552
aline+9.043
LOAD+8.430
Published+7.792
olo+7.725
Neg logit contributions
 dstg-12.010
idences-11.833
rament-11.727
nings-11.663
pps-11.407
Tokendeep
Token ID22089
Feature activation+1.67e+00

Pos logit contributions
iculty+9.935
aline+9.525
easy+8.461
LOAD+8.437
Published+8.178
Neg logit contributions
 dstg-11.866
 Kap-11.764
 Cth-11.335
idences-11.202
nings-11.193
Token "
Token ID366
Feature activation+6.92e-01

Pos logit contributions
aline+11.505
iculty+11.267
\":+10.564
 DOI+10.446
LOAD+10.382
Neg logit contributions
idences-13.113
 dstg-12.717
nings-11.933
pps-11.901
oldown-11.878
Tokensource
Token ID10459
Feature activation-5.29e-02

Pos logit contributions
aline+6.327
iculty+6.307
LOAD+5.917
\":+5.791
Published+5.535
Neg logit contributions
 Kap-5.433
idences-5.308
nings-5.234
 Cth-5.160
 dstg-5.131
Token rock
Token ID3881
Feature activation+2.16e-01

Pos logit contributions
 Kap+0.467
nings+0.463
idences+0.453
sung+0.447
ings+0.444
Neg logit contributions
iculty-0.463
aline-0.453
LOAD-0.425
\":-0.419
olo-0.395
Token"
Token ID1
Feature activation+1.09e+00

Pos logit contributions
iculty+1.935
aline+1.893
LOAD+1.761
\":+1.747
olo+1.650
Neg logit contributions
 Kap-1.833
nings-1.810
idences-1.794
sung-1.764
 dstg-1.749
Token out
Token ID503
Feature activation+2.96e-01

Pos logit contributions
iculty+7.815
aline+7.654
LOAD+6.938
\":+6.828
 DOI+6.609
Neg logit contributions
idences-9.527
 dstg-9.470
nings-9.261
pps-9.164
 Kap-9.109
Token 17
Token ID1596
Feature activation+2.25e-01

Pos logit contributions
iculty+3.958
aline+3.912
LOAD+3.726
\":+3.621
olo+3.410
Neg logit contributions
idences-5.131
nings-5.099
 Kap-5.008
sung-4.944
rament-4.935
Token Krypt
Token ID44679
Feature activation+3.39e-01

Pos logit contributions
iculty+2.079
aline+2.033
LOAD+1.947
\":+1.899
olo+1.813
Neg logit contributions
nings-1.807
 Kap-1.792
idences-1.790
sung-1.749
ings-1.733
Tokenon
Token ID261
Feature activation+2.73e-01

Pos logit contributions
iculty+2.116
aline+2.076
\":+1.836
LOAD+1.820
olo+1.769
Neg logit contributions
 Kap-3.800
nings-3.748
idences-3.720
sung-3.647
 Cth-3.625
Token as
Token ID355
Feature activation+8.24e-01

Pos logit contributions
iculty+2.212
aline+2.160
LOAD+2.054
\":+2.002
olo+1.909
Neg logit contributions
nings-2.504
 Kap-2.498
idences-2.495
sung-2.435
rament-2.424
Token dexter
Token ID44405
Feature activation+9.37e-01

Pos logit contributions
aline+7.023
iculty+6.876
LOAD+6.475
\":+6.397
olo+6.363
Neg logit contributions
nings-6.427
idences-6.378
pai-6.284
sung-6.267
rament-6.241
Tokenously
Token ID3481
Feature activation+1.66e+00

Pos logit contributions
aline+6.616
iculty+6.397
\":+5.959
LOAD+5.874
olo+5.646
Neg logit contributions
 Kap-8.569
nings-8.514
idences-8.465
sung-8.426
rament-8.181
Token as
Token ID355
Feature activation+2.23e-01

Pos logit contributions
aline+10.370
iculty+10.191
LOAD+9.826
Published+9.791
\":+9.722
Neg logit contributions
 dstg-13.337
rament-13.277
idences-12.999
 Kap-12.977
nings-12.961
Token Kodi
Token ID44129
Feature activation+4.92e-01

Pos logit contributions
iculty+2.085
aline+2.077
LOAD+1.947
\":+1.932
olo+1.843
Neg logit contributions
nings-1.754
 Kap-1.734
idences-1.729
sung-1.708
rament-1.668
Token 16
Token ID1467
Feature activation+4.04e-02

Pos logit contributions
aline+4.256
iculty+4.220
LOAD+4.011
\":+3.933
olo+3.809
Neg logit contributions
idences-4.098
nings-4.098
 Kap-4.038
sung-3.993
rament-3.942
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
iculty+0.356
aline+0.350
LOAD+0.331
\":+0.325
olo+0.309
Neg logit contributions
 Kap-0.347
nings-0.346
idences-0.341
sung-0.335
ings-0.334
Token1
Token ID16
Feature activation+3.24e-01

Pos logit contributions
iculty+1.252
aline+1.233
LOAD+1.193
\":+1.158
Published+1.107
Neg logit contributions
nings-1.099
idences-1.081
 Kap-1.076
sung-1.054
ings-1.048
Token dominant
Token ID11410
Feature activation+1.65e-01

Pos logit contributions
iculty+2.988
aline+2.918
LOAD+2.737
\":+2.670
olo+2.548
Neg logit contributions
nings-3.046
 Kap-2.984
idences-2.970
sung-2.938
ings-2.931
Token tactical
Token ID16106
Feature activation-2.88e-02

Pos logit contributions
iculty+1.234
aline+1.217
LOAD+1.127
\":+1.091
olo+1.048
Neg logit contributions
nings-1.647
 Kap-1.621
idences-1.614
sung-1.589
ings-1.568
Token advantage
Token ID4621
Feature activation-1.48e-01

Pos logit contributions
nings+0.283
 Kap+0.281
idences+0.275
sung+0.272
ings+0.271
Neg logit contributions
iculty-0.224
aline-0.218
LOAD-0.204
\":-0.199
olo-0.189
Token over
Token ID625
Feature activation+1.56e-01

Pos logit contributions
 Kap+1.254
nings+1.241
idences+1.214
sung+1.198
ings+1.195
Neg logit contributions
iculty-1.344
aline-1.310
LOAD-1.235
\":-1.217
olo-1.156
Token the
Token ID262
Feature activation+1.47e-01

Pos logit contributions
iculty+1.343
aline+1.318
LOAD+1.229
\":+1.215
olo+1.154
Neg logit contributions
nings-1.360
 Kap-1.355
idences-1.347
sung-1.333
ings-1.308
Token ch
Token ID442
Feature activation+1.66e+00

Pos logit contributions
iculty+1.204
aline+1.180
LOAD+1.096
\":+1.092
olo+1.025
Neg logit contributions
nings-1.368
 Kap-1.354
idences-1.336
sung-1.326
ings-1.314
Tokenasm
Token ID8597
Feature activation+6.16e-01

Pos logit contributions
olo+12.471
aline+12.200
iculty+12.188
asms+10.679
erate+10.285
Neg logit contributions
nings-13.479
 Kap-12.817
 dstg-11.979
 Cth-11.923
 Aadhaar-11.857
Token below
Token ID2174
Feature activation+3.55e-01

Pos logit contributions
iculty+5.218
aline+5.171
LOAD+4.681
olo+4.654
\":+4.511
Neg logit contributions
nings-5.238
idences-5.207
 Kap-5.118
rament-5.081
pps-5.012
Token.
Token ID13
Feature activation+2.82e-01

Pos logit contributions
iculty+2.859
aline+2.827
LOAD+2.636
\":+2.523
olo+2.460
Neg logit contributions
nings-3.256
 Kap-3.201
idences-3.180
sung-3.143
rament-3.105
Token War
Token ID1810
Feature activation-7.69e-02

Pos logit contributions
iculty+2.560
aline+2.495
LOAD+2.372
\":+2.325
Published+2.202
Neg logit contributions
nings-2.352
idences-2.297
 Kap-2.243
rament-2.238
sung-2.230
Tokenth
Token ID400
Feature activation+2.33e-01

Pos logit contributions
 Kap+0.596
nings+0.592
idences+0.575
ings+0.565
sung+0.565
Neg logit contributions
iculty-0.754
aline-0.738
LOAD-0.700
\":-0.686
olo-0.659
TokenExample
Token ID16281
Feature activation+1.21e-01

Pos logit contributions
iculty+1.401
aline+1.344
LOAD+1.259
\":+1.221
olo+1.153
Neg logit contributions
 Kap-2.076
nings-2.055
idences-2.046
sung-2.003
ings-1.989
Token:
Token ID25
Feature activation-8.63e-01

Pos logit contributions
iculty+0.830
aline+0.803
\":+0.758
LOAD+0.747
olo+0.681
Neg logit contributions
 Kap-1.278
nings-1.275
idences-1.269
sung-1.249
rament-1.234
Token Your
Token ID3406
Feature activation-5.84e-01

Pos logit contributions
 Kap+3.557
nings+3.382
ings+3.256
 Cth+3.042
sung+3.016
Neg logit contributions
iculty-10.914
aline-10.913
LOAD-10.500
\":-10.375
Published-10.007
Token +
Token ID1343
Feature activation-3.65e-01

Pos logit contributions
 Kap+4.719
nings+4.568
ings+4.467
sung+4.417
idences+4.389
Neg logit contributions
iculty-5.493
aline-5.251
LOAD-5.062
\":-4.885
olo-4.720
Token1
Token ID16
Feature activation-1.12e+00

Pos logit contributions
 Kap+2.206
nings+2.190
ings+2.099
idences+2.079
sung+2.066
Neg logit contributions
iculty-4.215
aline-4.124
LOAD-3.941
\":-3.869
olo-3.732
Token shield
Token ID7614
Feature activation-1.99e+00

Pos logit contributions
 Kap+5.267
 invis+4.833
ings+4.618
nings+4.561
sung+4.414
Neg logit contributions
iculty-13.701
LOAD-13.062
aline-12.970
Published-12.391
\":-12.363
Token keeps
Token ID7622
Feature activation-2.30e-01

Pos logit contributions
nings+5.880
ings+5.309
 Klan+4.758
idences+4.708
 ling+4.656
Neg logit contributions
iculty-24.949
aline-23.841
LOAD-23.688
erate-22.584
olo-22.352
Token it
Token ID340
Feature activation-2.91e-01

Pos logit contributions
 Kap+2.073
nings+2.053
idences+1.997
ings+1.993
sung+1.975
Neg logit contributions
iculty-1.974
aline-1.918
LOAD-1.806
\":-1.768
olo-1.673
Token
Token ID447
Feature activation-3.98e-01

Pos logit contributions
 Kap+2.491
nings+2.464
idences+2.403
ings+2.375
sung+2.367
Neg logit contributions
iculty-2.641
aline-2.580
LOAD-2.428
\":-2.401
olo-2.267
Token
Token ID247
Feature activation-4.04e-01

Pos logit contributions
 Kap+2.921
nings+2.887
idences+2.800
ings+2.785
sung+2.764
Neg logit contributions
iculty-4.064
aline-3.963
LOAD-3.768
\":-3.709
olo-3.532
Tokens
Token ID82
Feature activation-6.37e-01

Pos logit contributions
nings+2.423
 Kap+2.412
ings+2.330
idences+2.297
sung+2.290
Neg logit contributions
iculty-4.680
aline-4.573
LOAD-4.391
\":-4.309
olo-4.142
Tokenogen
Token ID6644
Feature activation+2.09e-01

Pos logit contributions
aline+1.404
iculty+1.374
LOAD+1.295
\":+1.270
olo+1.264
Neg logit contributions
nings-0.779
 Kap-0.736
idences-0.723
ings-0.704
sung-0.701
Token F
Token ID376
Feature activation+2.39e-02

Pos logit contributions
aline+1.672
iculty+1.672
LOAD+1.548
\":+1.535
olo+1.443
Neg logit contributions
nings-1.951
 Kap-1.942
idences-1.921
ings-1.890
sung-1.878
Tokenoul
Token ID2852
Feature activation+5.60e-01

Pos logit contributions
iculty+0.223
aline+0.220
LOAD+0.205
\":+0.201
olo+0.195
Neg logit contributions
 Kap-0.197
nings-0.194
idences-0.189
sung-0.187
ings-0.186
Tokenkes
Token ID5209
Feature activation+5.77e-01

Pos logit contributions
iculty+5.779
aline+5.549
olo+5.204
LOAD+5.157
\":+5.146
Neg logit contributions
 Kap-3.837
idences-3.811
nings-3.776
 Cth-3.727
rament-3.719
Token,
Token ID11
Feature activation-3.87e-01

Pos logit contributions
iculty+4.490
aline+4.478
LOAD+4.088
\":+4.014
olo+4.012
Neg logit contributions
nings-5.317
 Kap-5.189
idences-5.168
 Cth-5.083
sung-5.043
Token in
Token ID287
Feature activation-1.95e+00

Pos logit contributions
 Kap+3.421
nings+3.360
idences+3.277
ings+3.259
sung+3.245
Neg logit contributions
iculty-3.393
aline-3.296
LOAD-3.114
\":-3.062
olo-2.878
Token Geneva
Token ID20552
Feature activation-8.34e-01

Pos logit contributions
 Kap+12.223
 Bengal+10.624
 Ky+10.195
 Seoul+10.175
nings+10.110
Neg logit contributions
iculty-18.179
aline-17.313
LOAD-17.082
\":-16.113
erate-15.565
Token,
Token ID11
Feature activation+1.28e-01

Pos logit contributions
 Kap+6.116
nings+5.956
ings+5.851
sung+5.672
idences+5.647
Neg logit contributions
iculty-8.123
aline-7.782
\":-7.529
LOAD-7.507
olo-6.875
Token says
Token ID1139
Feature activation-9.87e-02

Pos logit contributions
iculty+1.040
aline+1.026
LOAD+0.952
\":+0.934
olo+0.890
Neg logit contributions
nings-1.191
 Kap-1.185
idences-1.166
sung-1.141
ings-1.140
Token that
Token ID326
Feature activation-2.05e-01

Pos logit contributions
 Kap+0.840
nings+0.839
idences+0.816
sung+0.808
ings+0.805
Neg logit contributions
iculty-0.892
aline-0.871
LOAD-0.819
\":-0.807
olo-0.765
Token while
Token ID981
Feature activation-1.64e-02

Pos logit contributions
 Kap+1.779
nings+1.769
idences+1.722
ings+1.706
sung+1.703
Neg logit contributions
iculty-1.823
aline-1.777
LOAD-1.673
\":-1.645
olo-1.559
Token is
Token ID318
Feature activation-3.14e-02

Pos logit contributions
iculty+0.278
aline+0.272
LOAD+0.256
\":+0.251
olo+0.238
Neg logit contributions
nings-0.283
 Kap-0.281
idences-0.277
sung-0.275
ings-0.270
Token relevant
Token ID5981
Feature activation-1.62e-01

Pos logit contributions
nings+0.265
 Kap+0.264
idences+0.262
sung+0.256
ings+0.253
Neg logit contributions
iculty-0.284
aline-0.276
LOAD-0.262
\":-0.256
olo-0.245
Token,"
Token ID553
Feature activation-7.56e-01

Pos logit contributions
 Kap+1.391
nings+1.382
idences+1.349
sung+1.332
ings+1.331
Neg logit contributions
iculty-1.464
aline-1.430
LOAD-1.347
\":-1.326
olo-1.255
Token he
Token ID339
Feature activation-6.09e-01

Pos logit contributions
 Kap+6.352
nings+6.015
ings+5.940
idences+5.826
sung+5.762
Neg logit contributions
iculty-6.895
aline-6.680
LOAD-6.338
\":-6.196
olo-5.870
Token said
Token ID531
Feature activation-1.02e+00

Pos logit contributions
 Kap+5.503
nings+5.405
ings+5.282
idences+5.205
sung+5.137
Neg logit contributions
iculty-5.257
aline-4.964
LOAD-4.773
\":-4.629
olo-4.312
Token at
Token ID379
Feature activation-1.91e+00

Pos logit contributions
 Kap+7.631
nings+7.348
ings+7.334
 unwittingly+6.946
idences+6.900
Neg logit contributions
iculty-9.941
aline-9.463
LOAD-9.117
\":-8.993
olo-8.423
Token an
Token ID281
Feature activation-1.34e+00

Pos logit contributions
 Kap+13.954
nings+12.084
 Bengal+11.958
 Lok+11.940
ings+11.837
Neg logit contributions
iculty-15.621
aline-14.805
LOAD-14.618
\":-14.122
erate-13.428
Token interactive
Token ID14333
Feature activation-5.92e-01

Pos logit contributions
 Kap+9.533
ings+9.046
 Aadhaar+8.655
idences+8.527
nings+8.526
Neg logit contributions
iculty-12.286
LOAD-11.927
aline-11.482
\":-11.103
erate-10.677
Token session
Token ID6246
Feature activation-4.87e-01

Pos logit contributions
 Kap+4.771
nings+4.589
ings+4.518
idences+4.481
sung+4.422
Neg logit contributions
iculty-5.668
aline-5.444
LOAD-5.172
\":-5.142
olo-4.841
Token after
Token ID706
Feature activation-1.36e+00

Pos logit contributions
 Kap+4.105
nings+4.032
ings+3.941
idences+3.880
sung+3.838
Neg logit contributions
iculty-4.509
aline-4.370
LOAD-4.107
\":-4.080
olo-3.831
Token delivering
Token ID13630
Feature activation-1.51e+00

Pos logit contributions
 Kap+10.655
nings+9.852
ings+9.749
 unwittingly+9.385
idences+9.178
Neg logit contributions
iculty-11.843
aline-11.531
\":-10.839
LOAD-10.828
erate-10.058
Tokenury
Token ID1601
Feature activation-6.20e-01

Pos logit contributions
iculty+0.125
aline+0.122
\":+0.113
LOAD+0.112
olo+0.106
Neg logit contributions
 Kap-0.140
nings-0.137
idences-0.133
ings-0.132
sung-0.132
Token agreed
Token ID4987
Feature activation-6.11e-01

Pos logit contributions
 Kap+4.851
nings+4.788
ings+4.649
idences+4.579
sung+4.532
Neg logit contributions
iculty-6.040
aline-5.819
LOAD-5.539
\":-5.363
olo-5.085
Token.
Token ID13
Feature activation-3.83e-01

Pos logit contributions
 Kap+4.740
nings+4.719
ings+4.525
idences+4.484
sung+4.439
Neg logit contributions
iculty-5.931
aline-5.759
LOAD-5.408
\":-5.318
olo-5.054
Token
Token ID851
Feature activation-4.42e-01

Pos logit contributions
 Kap+2.920
nings+2.865
ings+2.757
idences+2.754
sung+2.724
Neg logit contributions
iculty-3.860
aline-3.757
LOAD-3.552
\":-3.497
olo-3.346
Token Ren
Token ID7152
Feature activation+2.08e-02

Pos logit contributions
 Kap+3.913
nings+3.849
idences+3.719
ings+3.716
sung+3.695
Neg logit contributions
iculty-3.893
aline-3.731
LOAD-3.561
\":-3.458
olo-3.282
Tokenaud
Token ID3885
Feature activation-1.85e+00

Pos logit contributions
iculty+0.200
aline+0.197
LOAD+0.185
\":+0.183
olo+0.174
Neg logit contributions
 Kap-0.165
nings-0.163
idences-0.159
sung-0.157
ings-0.156
Token Lav
Token ID21438
Feature activation+2.87e-02

Pos logit contributions
 Kap+7.827
 Caps+5.186
 Klan+5.077
ings+5.021
 Lok+4.791
Neg logit contributions
iculty-22.928
aline-20.979
LOAD-20.387
aturdays-20.361
erate-20.071
Tokeno
Token ID78
Feature activation+1.44e-01

Pos logit contributions
iculty+0.143
aline+0.138
LOAD+0.121
\":+0.119
olo+0.109
Neg logit contributions
 Kap-0.362
nings-0.359
idences-0.353
sung-0.350
ings-0.350
Tokenie
Token ID494
Feature activation-6.92e-01

Pos logit contributions
iculty+0.688
aline+0.654
\":+0.584
LOAD+0.569
olo+0.518
Neg logit contributions
 Kap-1.839
nings-1.821
idences-1.792
sung-1.775
 Cth-1.772
Token (@
Token ID4275
Feature activation-1.03e+00

Pos logit contributions
 Kap+5.119
nings+4.930
ings+4.807
sung+4.614
idences+4.580
Neg logit contributions
iculty-7.018
aline-6.894
LOAD-6.377
\":-6.009
erate-5.991
Tokenren
Token ID918
Feature activation-1.32e+00

Pos logit contributions
nings+5.648
sung+5.596
ings+5.464
rament+5.236
idences+5.209
Neg logit contributions
iculty-11.447
LOAD-11.131
aline-10.991
\":-10.614
 DOI-10.161
Token Th
Token ID536
Feature activation-2.36e-01

Pos logit contributions
iculty+2.683
aline+2.618
LOAD+2.460
\":+2.457
olo+2.376
Neg logit contributions
nings-2.289
 Kap-2.275
idences-2.252
sung-2.217
rament-2.167
Tokenad
Token ID324
Feature activation+2.79e-02

Pos logit contributions
 Kap+1.952
nings+1.935
idences+1.880
ings+1.863
sung+1.855
Neg logit contributions
iculty-2.211
aline-2.157
LOAD-2.036
\":-2.006
olo-1.906
Tokenik
Token ID1134
Feature activation-2.99e-01

Pos logit contributions
iculty+0.256
aline+0.254
LOAD+0.236
\":+0.234
olo+0.226
Neg logit contributions
nings-0.228
 Kap-0.228
idences-0.224
sung-0.219
ings-0.218
Tokenomb
Token ID2381
Feature activation+1.15e-01

Pos logit contributions
 Kap+2.522
nings+2.485
idences+2.415
ings+2.410
sung+2.386
Neg logit contributions
iculty-2.758
aline-2.673
LOAD-2.541
\":-2.489
olo-2.347
Tokenu
Token ID84
Feature activation-3.98e-01

Pos logit contributions
iculty+0.930
aline+0.911
\":+0.834
LOAD+0.832
olo+0.811
Neg logit contributions
nings-1.080
 Kap-1.072
idences-1.052
sung-1.027
ings-1.025
Token D
Token ID360
Feature activation-1.80e+00

Pos logit contributions
 Kap+3.341
nings+3.273
ings+3.188
idences+3.182
sung+3.148
Neg logit contributions
iculty-3.661
aline-3.572
LOAD-3.380
\":-3.307
olo-3.107
Tokenind
Token ID521
Feature activation-3.89e-01

Pos logit contributions
ings+15.357
 Kap+15.200
sung+14.784
idences+14.667
nings+14.344
Neg logit contributions
iculty-11.520
aline-10.470
\":-10.161
LOAD-10.099
 DOI-9.337
Tokenig
Token ID328
Feature activation-2.66e-01

Pos logit contributions
 Kap+3.533
nings+3.492
idences+3.402
ings+3.400
sung+3.370
Neg logit contributions
iculty-3.305
aline-3.190
LOAD-3.042
\":-2.968
Published-2.765
Tokenul
Token ID377
Feature activation-3.69e-01

Pos logit contributions
 Kap+2.349
nings+2.330
idences+2.269
ings+2.253
sung+2.242
Neg logit contributions
iculty-2.339
aline-2.277
LOAD-2.145
\":-2.110
olo-1.993
Token
Token ID198
Feature activation-5.27e-01

Pos logit contributions
 Kap+3.206
nings+3.151
ings+3.071
idences+3.062
sung+3.037
Neg logit contributions
iculty-3.307
aline-3.207
LOAD-3.048
\":-2.977
olo-2.786
Token
Token ID198
Feature activation-3.66e-01

Pos logit contributions
 Kap+3.647
nings+3.563
ings+3.479
sung+3.438
idences+3.430
Neg logit contributions
iculty-5.499
aline-5.416
LOAD-5.083
\":-5.042
olo-4.832
TokenThe
Token ID464
Feature activation-3.47e-01

Pos logit contributions
iculty+0.567
aline+0.558
LOAD+0.542
\":+0.523
Published+0.505
Neg logit contributions
 Kap-0.525
nings-0.522
idences-0.514
rament-0.499
ings-0.497
Token BBC
Token ID7823
Feature activation-6.22e-01

Pos logit contributions
 Kap+3.020
nings+2.971
idences+2.895
ings+2.883
sung+2.855
Neg logit contributions
iculty-3.098
aline-3.012
LOAD-2.839
\":-2.795
olo-2.639
Token's
Token ID338
Feature activation-5.33e-01

Pos logit contributions
 Kap+4.679
nings+4.538
ings+4.448
idences+4.421
sung+4.357
Neg logit contributions
iculty-6.218
aline-6.031
LOAD-5.738
\":-5.645
olo-5.340
Token Jonah
Token ID45447
Feature activation+9.89e-01

Pos logit contributions
 Kap+4.819
nings+4.671
ings+4.562
idences+4.540
sung+4.494
Neg logit contributions
iculty-4.575
aline-4.404
LOAD-4.172
\":-4.088
olo-3.814
Token Fisher
Token ID14388
Feature activation+6.47e-01

Pos logit contributions
aline+7.902
iculty+7.238
 Webb+7.211
LOAD+7.185
olo+7.152
Neg logit contributions
nings-8.711
idences-8.149
pai-7.926
 dstg-7.882
rament-7.835
Token in
Token ID287
Feature activation-1.78e+00

Pos logit contributions
aline+5.305
iculty+5.240
\":+4.863
LOAD+4.749
olo+4.677
Neg logit contributions
idences-5.537
nings-5.439
 Kap-5.398
 Cth-5.330
sung-5.328
Token Yang
Token ID10998
Feature activation-1.55e-01

Pos logit contributions
 Kap+13.153
 Bengal+11.812
 Ky+11.131
pps+11.063
ğ+10.885
Neg logit contributions
iculty-15.579
aline-14.897
LOAD-14.125
\":-13.462
olo-12.948
Tokenon
Token ID261
Feature activation-2.62e-01

Pos logit contributions
 Kap+1.615
nings+1.605
idences+1.576
ings+1.556
sung+1.548
Neg logit contributions
iculty-1.099
aline-1.069
LOAD-0.993
\":-0.962
olo-0.915
Token,
Token ID11
Feature activation-1.02e-01

Pos logit contributions
 Kap+2.270
nings+2.248
idences+2.186
ings+2.173
sung+2.161
Neg logit contributions
iculty-2.353
aline-2.290
LOAD-2.162
\":-2.125
olo-2.008
Token also
Token ID635
Feature activation-4.55e-01

Pos logit contributions
nings+0.807
 Kap+0.802
idences+0.784
ings+0.774
sung+0.768
Neg logit contributions
iculty-0.983
aline-0.965
LOAD-0.914
\":-0.899
olo-0.858
Token known
Token ID1900
Feature activation-1.33e-02

Pos logit contributions
 Kap+3.974
nings+3.903
ings+3.812
idences+3.767
sung+3.760
Neg logit contributions
iculty-4.051
aline-3.929
LOAD-3.698
\":-3.633
olo-3.411
Token the
Token ID262
Feature activation-7.90e-01

Pos logit contributions
 Kap+7.633
nings+7.157
ings+7.131
idences+6.896
pps+6.849
Neg logit contributions
iculty-9.546
aline-9.239
LOAD-8.841
\":-8.598
Published-8.214
Token victims
Token ID4970
Feature activation-3.93e-01

Pos logit contributions
 Kap+6.600
nings+6.322
ings+6.223
idences+6.087
sung+6.025
Neg logit contributions
iculty-7.197
aline-6.969
LOAD-6.652
\":-6.474
olo-6.125
Token
Token ID447
Feature activation-1.46e-01

Pos logit contributions
 Kap+3.010
nings+2.981
idences+2.885
ings+2.866
sung+2.843
Neg logit contributions
iculty-3.899
aline-3.817
LOAD-3.606
\":-3.545
olo-3.362
Token
Token ID247
Feature activation-2.94e-01

Pos logit contributions
 Kap+1.232
nings+1.224
idences+1.189
ings+1.178
sung+1.173
Neg logit contributions
iculty-1.344
aline-1.310
LOAD-1.237
\":-1.218
olo-1.155
Token group
Token ID1448
Feature activation-8.14e-01

Pos logit contributions
 Kap+2.429
nings+2.408
idences+2.338
ings+2.324
sung+2.309
Neg logit contributions
iculty-2.755
aline-2.683
LOAD-2.538
\":-2.497
olo-2.367
Token U
Token ID471
Feature activation-1.77e+00

Pos logit contributions
 Kap+6.692
nings+6.505
ings+6.391
idences+6.153
sung+6.091
Neg logit contributions
iculty-7.642
aline-7.301
LOAD-6.874
\":-6.741
Published-6.372
TokenOK
Token ID11380
Feature activation-4.01e-01

Pos logit contributions
pps+14.167
nings+13.551
ings+13.300
 Kap+12.983
sung+12.840
Neg logit contributions
iculty-13.847
aline-13.107
\":-12.418
LOAD-11.999
Published-11.657
TokenG
Token ID38
Feature activation-7.66e-01

Pos logit contributions
 Kap+3.465
nings+3.422
idences+3.322
ings+3.311
sung+3.280
Neg logit contributions
iculty-3.603
aline-3.498
LOAD-3.279
\":-3.241
olo-3.050
Token,
Token ID11
Feature activation+1.65e-01

Pos logit contributions
 Kap+6.385
nings+6.310
ings+6.118
idences+5.965
sung+5.930
Neg logit contributions
iculty-7.003
aline-6.745
\":-6.277
LOAD-6.258
Published-5.793
Token said
Token ID531
Feature activation-1.50e-01

Pos logit contributions
iculty+1.231
aline+1.225
LOAD+1.136
\":+1.132
olo+1.052
Neg logit contributions
nings-1.591
 Kap-1.591
idences-1.583
sung-1.557
ings-1.538
Token Ike
Token ID34880
Feature activation-2.37e-01

Pos logit contributions
 Kap+1.315
nings+1.310
idences+1.281
sung+1.262
ings+1.261
Neg logit contributions
iculty-1.309
aline-1.276
LOAD-1.203
\":-1.185
olo-1.118
Token
Token ID198
Feature activation-6.33e-01

Pos logit contributions
 Kap+2.316
nings+2.279
ings+2.211
idences+2.180
sung+2.167
Neg logit contributions
iculty-3.539
aline-3.460
LOAD-3.274
\":-3.201
olo-3.092
TokenThe
Token ID464
Feature activation-4.89e-01

Pos logit contributions
 Kap+4.892
nings+4.876
sung+4.774
ings+4.760
idences+4.614
Neg logit contributions
iculty-6.097
aline-5.883
LOAD-5.442
\":-5.400
olo-5.210
Token No
Token ID1400
Feature activation-5.45e-01

Pos logit contributions
 Kap+4.372
nings+4.277
ings+4.169
idences+4.166
sung+4.116
Neg logit contributions
iculty-4.228
aline-4.095
LOAD-3.880
\":-3.775
olo-3.587
Token.
Token ID13
Feature activation-6.60e-01

Pos logit contributions
 Kap+4.514
nings+4.454
ings+4.387
sung+4.352
idences+4.340
Neg logit contributions
iculty-5.011
aline-4.845
LOAD-4.592
\":-4.491
olo-4.244
Token 4
Token ID604
Feature activation-3.04e-01

Pos logit contributions
 Kap+5.328
nings+5.174
ings+5.089
sung+5.006
idences+4.943
Neg logit contributions
iculty-6.255
aline-6.083
LOAD-5.685
\":-5.616
olo-5.332
Token U
Token ID471
Feature activation-1.76e+00

Pos logit contributions
 Kap+2.207
nings+2.177
idences+2.105
ings+2.101
sung+2.084
Neg logit contributions
iculty-3.141
aline-3.077
LOAD-2.920
\":-2.880
olo-2.746
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
ings+9.997
pps+9.913
sung+9.758
 Kap+9.146
nings+8.909
Neg logit contributions
iculty-16.490
aline-16.253
LOAD-15.163
\":-15.052
erate-14.475
TokenS
Token ID50
Feature activation-1.45e-01

Pos logit contributions
 Kap+2.492
nings+2.450
ings+2.370
idences+2.368
sung+2.354
Neg logit contributions
iculty-3.362
aline-3.282
LOAD-3.102
\":-3.053
olo-2.917
Token.
Token ID13
Feature activation+2.32e-02

Pos logit contributions
 Kap+1.268
nings+1.254
idences+1.221
ings+1.220
sung+1.216
Neg logit contributions
iculty-1.279
aline-1.248
LOAD-1.177
\":-1.158
olo-1.086
Token mobile
Token ID5175
Feature activation-3.87e-01

Pos logit contributions
iculty+0.275
aline+0.269
LOAD+0.259
\":+0.255
olo+0.247
Neg logit contributions
nings-0.134
 Kap-0.132
idences-0.127
ings-0.123
sung-0.122
Token network
Token ID3127
Feature activation-5.13e-01

Pos logit contributions
 Kap+2.539
nings+2.505
idences+2.412
ings+2.407
sung+2.382
Neg logit contributions
iculty-4.260
aline-4.176
LOAD-3.984
\":-3.923
olo-3.750
Token13
Token ID1485
Feature activation+2.87e-01

Pos logit contributions
iculty+1.430
aline+1.410
LOAD+1.336
\":+1.305
olo+1.241
Neg logit contributions
 Kap-1.369
idences-1.353
nings-1.343
rament-1.296
 dstg-1.287
Tokenth
Token ID400
Feature activation+1.58e-01

Pos logit contributions
iculty+2.544
aline+2.448
LOAD+2.311
\":+2.294
olo+2.203
Neg logit contributions
 Kap-2.443
nings-2.383
idences-2.365
rament-2.316
 dstg-2.283
Token century
Token ID4289
Feature activation+1.17e-01

Pos logit contributions
iculty+1.558
aline+1.518
LOAD+1.438
\":+1.429
olo+1.363
Neg logit contributions
 Kap-1.182
nings-1.181
idences-1.156
sung-1.129
rament-1.126
Token Sanskrit
Token ID46178
Feature activation-6.32e-01

Pos logit contributions
iculty+1.006
aline+0.987
LOAD+0.929
\":+0.911
olo+0.874
Neg logit contributions
nings-1.020
idences-1.013
 Kap-1.008
sung-0.985
rament-0.977
Token,
Token ID11
Feature activation-7.43e-01

Pos logit contributions
 Kap+5.663
nings+5.428
ings+5.412
sung+5.269
idences+5.224
Neg logit contributions
iculty-5.527
aline-5.297
LOAD-5.026
\":-4.900
olo-4.583
Token U
Token ID471
Feature activation-1.76e+00

Pos logit contributions
 Kap+6.065
nings+5.623
ings+5.617
sung+5.544
 Cth+5.492
Neg logit contributions
iculty-7.148
aline-6.849
LOAD-6.461
\":-6.396
olo-6.049
Tokeny
Token ID88
Feature activation-5.92e-01

Pos logit contributions
pps+13.645
sung+13.449
nings+13.281
ğ+13.097
ings+13.079
Neg logit contributions
iculty-13.033
aline-12.345
\":-12.140
LOAD-12.099
 DOI-11.607
Tokengh
Token ID456
Feature activation-5.69e-02

Pos logit contributions
 Kap+5.144
nings+5.087
ings+4.975
sung+4.944
idences+4.910
Neg logit contributions
iculty-5.206
aline-4.993
LOAD-4.855
\":-4.697
Published-4.392
Tokenur
Token ID333
Feature activation-6.69e-02

Pos logit contributions
 Kap+0.628
nings+0.625
idences+0.613
ings+0.603
sung+0.602
Neg logit contributions
iculty-0.367
aline-0.362
\":-0.323
LOAD-0.322
olo-0.301
Token,
Token ID11
Feature activation-8.21e-01

Pos logit contributions
nings+0.596
 Kap+0.595
idences+0.582
ings+0.571
sung+0.568
Neg logit contributions
iculty-0.575
aline-0.563
LOAD-0.527
\":-0.519
olo-0.493
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
 Kap+6.609
nings+6.108
ings+6.090
 Cth+6.050
sung+6.024
Neg logit contributions
iculty-7.950
aline-7.517
LOAD-7.217
\":-7.047
olo-6.698
Token means
Token ID1724
Feature activation-5.41e-01

Pos logit contributions
iculty+1.168
aline+1.135
LOAD+1.069
\":+1.053
olo+0.998
Neg logit contributions
nings-1.245
 Kap-1.224
idences-1.218
sung-1.199
rament-1.191
Token BW
Token ID37869
Feature activation-7.89e-01

Pos logit contributions
 Kap+4.507
nings+4.441
ings+4.396
idences+4.299
sung+4.215
Neg logit contributions
iculty-4.979
aline-4.859
LOAD-4.573
\":-4.486
Published-4.230
Token Editor
Token ID12058
Feature activation-1.10e+00

Pos logit contributions
 Kap+6.529
ings+6.411
nings+6.356
sung+6.191
idences+6.156
Neg logit contributions
iculty-7.053
aline-6.823
LOAD-6.385
\":-6.265
Published-5.868
Token-
Token ID12
Feature activation-9.92e-01

Pos logit contributions
 Kap+8.820
nings+8.231
ings+8.216
sung+8.160
idences+7.911
Neg logit contributions
iculty-10.005
aline-9.875
LOAD-9.129
\":-8.860
olo-8.143
TokenIn
Token ID818
Feature activation-8.83e-01

Pos logit contributions
 Kap+7.951
sung+7.492
ings+7.426
nings+7.383
idences+7.318
Neg logit contributions
iculty-8.963
aline-8.838
LOAD-8.154
\":-7.956
 DOI-7.438
Token-
Token ID12
Feature activation-1.75e+00

Pos logit contributions
 Kap+6.146
nings+5.877
ings+5.738
idences+5.613
sung+5.606
Neg logit contributions
aline-8.755
iculty-8.664
LOAD-8.300
\":-7.933
olo-7.729
TokenChief
Token ID23675
Feature activation-9.20e-01

Pos logit contributions
sung+3.501
 Kap+3.272
pps+3.080
Ky+2.983
ring+2.843
Neg logit contributions
aline-22.447
iculty-21.160
\":-20.986
 DOI-20.945
LOAD-20.820
Token "
Token ID366
Feature activation-2.69e-01

Pos logit contributions
 Kap+7.225
nings+6.808
ings+6.670
idences+6.613
pai+6.510
Neg logit contributions
aline-8.718
iculty-8.637
LOAD-7.912
\":-7.801
Published-7.324
TokenWatch
Token ID10723
Feature activation-4.48e-01

Pos logit contributions
 Kap+2.328
nings+2.309
idences+2.248
ings+2.232
sung+2.221
Neg logit contributions
iculty-2.399
aline-2.338
LOAD-2.203
\":-2.167
olo-2.051
Token Bake
Token ID38493
Feature activation-5.44e-01

Pos logit contributions
 Kap+3.917
nings+3.868
ings+3.768
idences+3.739
sung+3.715
Neg logit contributions
iculty-3.983
aline-3.864
LOAD-3.621
\":-3.591
olo-3.382
Tokenmon
Token ID2144
Feature activation+1.19e-01

Pos logit contributions
 Kap+3.566
nings+3.403
ings+3.396
sung+3.316
idences+3.233
Neg logit contributions
iculty-6.017
aline-5.833
LOAD-5.564
\":-5.435
Published-5.225
Token out
Token ID503
Feature activation-8.67e-01

Pos logit contributions
iculty+2.479
aline+2.390
\":+2.227
LOAD+2.177
Published+2.039
Neg logit contributions
 Kap-4.507
nings-4.462
idences-4.424
sung-4.390
rament-4.382
Token how
Token ID703
Feature activation-5.58e-01

Pos logit contributions
 Kap+5.161
ings+4.953
nings+4.855
idences+4.544
 unwittingly+4.494
Neg logit contributions
iculty-9.928
aline-9.604
LOAD-9.153
\":-8.833
erate-8.650
Token the
Token ID262
Feature activation-2.68e-01

Pos logit contributions
 Kap+4.457
nings+4.365
ings+4.266
idences+4.166
sung+4.154
Neg logit contributions
iculty-5.404
aline-5.187
LOAD-4.938
\":-4.759
olo-4.635
Token Dan
Token ID6035
Feature activation+8.19e-01

Pos logit contributions
 Kap+2.385
nings+2.359
idences+2.297
ings+2.281
sung+2.271
Neg logit contributions
iculty-2.348
aline-2.281
LOAD-2.150
\":-2.109
olo-1.997
Tokenie
Token ID494
Feature activation-5.59e-01

Pos logit contributions
aline+3.089
iculty+2.957
\":+2.756
olo+2.356
LOAD+2.231
Neg logit contributions
 Cth-10.657
 Kap-10.621
idences-10.234
nings-10.230
 unwittingly-10.160
Tokenll
Token ID297
Feature activation-1.74e+00

Pos logit contributions
 Kap+4.462
nings+4.421
ings+4.369
idences+4.340
sung+4.314
Neg logit contributions
iculty-5.208
aline-4.922
LOAD-4.775
\":-4.553
Published-4.390
Token cell
Token ID2685
Feature activation-1.23e+00

Pos logit contributions
 Kap+7.666
ings+6.641
 pocket+6.295
nings+6.228
sung+6.115
Neg logit contributions
iculty-20.372
LOAD-18.545
aline-18.012
Published-17.862
\":-17.810
Token battery
Token ID6555
Feature activation-4.32e-01

Pos logit contributions
 Kap+6.789
ings+6.282
nings+6.178
rament+5.892
sung+5.878
Neg logit contributions
iculty-13.891
LOAD-12.692
aline-12.649
\":-12.117
Published-11.747
Token is
Token ID318
Feature activation-2.07e-01

Pos logit contributions
 Kap+3.491
nings+3.445
ings+3.400
idences+3.300
sung+3.283
Neg logit contributions
iculty-4.117
aline-3.997
LOAD-3.803
\":-3.696
olo-3.503
Token constructed
Token ID12006
Feature activation-5.69e-01

Pos logit contributions
 Kap+1.747
nings+1.727
idences+1.677
ings+1.673
sung+1.660
Neg logit contributions
iculty-1.902
aline-1.859
LOAD-1.754
\":-1.714
olo-1.629
Token.
Token ID13
Feature activation-6.12e-01

Pos logit contributions
 Kap+4.409
nings+4.350
ings+4.248
sung+4.118
idences+4.117
Neg logit contributions
iculty-5.546
aline-5.371
LOAD-5.059
\":-4.958
olo-4.727
Token total
Token ID2472
Feature activation-6.10e-01

Pos logit contributions
 Kap+4.960
nings+4.863
ings+4.746
idences+4.636
sung+4.558
Neg logit contributions
iculty-6.239
aline-6.148
\":-5.785
LOAD-5.778
olo-5.491
Token to
Token ID284
Feature activation-7.51e-01

Pos logit contributions
 Kap+4.934
nings+4.839
ings+4.771
idences+4.650
sung+4.588
Neg logit contributions
iculty-5.752
aline-5.679
\":-5.287
LOAD-5.284
olo-4.986
Token 166
Token ID26753
Feature activation-1.37e-01

Pos logit contributions
 Kap+6.052
nings+5.874
ings+5.753
idences+5.647
sung+5.544
Neg logit contributions
iculty-7.103
aline-6.891
LOAD-6.513
\":-6.362
olo-6.030
Token.
Token ID13
Feature activation-7.99e-01

Pos logit contributions
 Kap+1.210
nings+1.202
idences+1.170
ings+1.159
sung+1.157
Neg logit contributions
iculty-1.208
aline-1.176
LOAD-1.109
\":-1.088
olo-1.031
Token The
Token ID383
Feature activation-2.33e-01

Pos logit contributions
 Kap+6.104
nings+5.652
ings+5.618
idences+5.377
sung+5.314
Neg logit contributions
iculty-8.029
aline-7.758
\":-7.293
LOAD-7.243
olo-6.863
Token U
Token ID471
Feature activation-1.73e+00

Pos logit contributions
 Kap+2.017
nings+2.002
idences+1.949
ings+1.933
sung+1.924
Neg logit contributions
iculty-2.088
aline-2.035
LOAD-1.918
\":-1.887
olo-1.786
Token.
Token ID13
Feature activation-2.85e-01

Pos logit contributions
ings+9.210
nings+8.694
pps+8.602
sung+8.451
 Kap+8.125
Neg logit contributions
iculty-17.244
aline-16.724
 destro-16.476
\":-15.915
LOAD-15.464
TokenS
Token ID50
Feature activation-1.02e-01

Pos logit contributions
 Kap+2.149
nings+2.123
idences+2.048
ings+2.045
sung+2.033
Neg logit contributions
iculty-2.882
aline-2.815
LOAD-2.652
\":-2.621
olo-2.506
Token.,
Token ID1539
Feature activation-5.76e-01

Pos logit contributions
 Kap+0.907
nings+0.898
idences+0.877
ings+0.872
sung+0.867
Neg logit contributions
iculty-0.886
aline-0.860
LOAD-0.812
\":-0.798
olo-0.749
Token by
Token ID416
Feature activation-5.69e-02

Pos logit contributions
 Kap+4.721
nings+4.582
ings+4.473
sung+4.381
idences+4.367
Neg logit contributions
iculty-5.451
aline-5.320
LOAD-5.000
\":-4.929
olo-4.656
Token comparison
Token ID7208
Feature activation-2.01e-01

Pos logit contributions
 Kap+0.522
nings+0.520
idences+0.512
sung+0.505
ings+0.501
Neg logit contributions
iculty-0.473
aline-0.458
LOAD-0.435
\":-0.426
olo-0.402
Token dr
Token ID1553
Feature activation+6.23e-01

Pos logit contributions
nings+0.120
 Kap+0.118
idences+0.116
ings+0.115
sung+0.115
Neg logit contributions
iculty-0.131
aline-0.128
LOAD-0.121
\":-0.120
olo-0.113
Tokenab
Token ID397
Feature activation+9.12e-02

Pos logit contributions
aline+3.990
iculty+3.769
olo+3.245
\":+3.186
Published+3.050
Neg logit contributions
 Kap-6.804
nings-6.791
 Cth-6.448
idences-6.387
 unwittingly-6.382
Token brick
Token ID17214
Feature activation-1.04e-01

Pos logit contributions
iculty+0.830
aline+0.823
LOAD+0.770
\":+0.763
olo+0.730
Neg logit contributions
nings-0.755
 Kap-0.742
idences-0.734
sung-0.724
ings-0.714
Token building
Token ID2615
Feature activation+6.44e-02

Pos logit contributions
nings+0.892
 Kap+0.889
idences+0.864
sung+0.858
ings+0.849
Neg logit contributions
iculty-0.937
aline-0.916
LOAD-0.863
\":-0.852
olo-0.809
Token in
Token ID287
Feature activation-8.26e-01

Pos logit contributions
iculty+0.536
aline+0.523
LOAD+0.494
\":+0.484
olo+0.460
Neg logit contributions
nings-0.585
 Kap-0.582
idences-0.573
sung-0.565
ings-0.558
Token D
Token ID360
Feature activation-1.72e+00

Pos logit contributions
 Kap+6.343
nings+5.909
ings+5.775
sung+5.665
 Cth+5.582
Neg logit contributions
iculty-8.269
aline-8.012
LOAD-7.642
\":-7.441
olo-7.056
Tokenae
Token ID3609
Feature activation-1.35e+00

Pos logit contributions
sung+13.908
ings+13.407
 Kap+13.055
nings+12.670
pps+12.513
Neg logit contributions
iculty-13.119
aline-11.752
LOAD-11.696
\":-11.523
erate-10.908
Tokenje
Token ID18015
Feature activation-4.66e-01

Pos logit contributions
 Kap+7.868
sung+7.308
ings+7.099
nings+6.960
idences+6.902
Neg logit contributions
iculty-14.804
LOAD-14.405
aline-13.987
\":-13.343
erate-12.498
Tokenon
Token ID261
Feature activation-4.11e-01

Pos logit contributions
 Kap+4.356
sung+4.235
nings+4.228
ings+4.186
idences+4.178
Neg logit contributions
iculty-3.784
aline-3.657
LOAD-3.496
\":-3.400
Published-3.053
Token,
Token ID11
Feature activation-3.27e-01

Pos logit contributions
 Kap+3.309
nings+3.249
ings+3.151
sung+3.147
idences+3.134
Neg logit contributions
iculty-3.953
aline-3.839
LOAD-3.670
\":-3.599
olo-3.362
Token South
Token ID2520
Feature activation-9.81e-01

Pos logit contributions
 Kap+2.770
nings+2.724
idences+2.645
ings+2.639
sung+2.626
Neg logit contributions
iculty-3.006
aline-2.922
LOAD-2.762
\":-2.715
olo-2.575
Token February
Token ID3945
Feature activation-6.56e-01

Pos logit contributions
 Kap+3.849
nings+3.792
idences+3.663
ings+3.623
sung+3.604
Neg logit contributions
iculty-4.402
aline-4.284
LOAD-4.012
\":-3.987
olo-3.757
Token 2012
Token ID2321
Feature activation-9.53e-01

Pos logit contributions
 Kap+5.109
nings+4.946
ings+4.795
idences+4.720
sung+4.707
Neg logit contributions
iculty-6.441
aline-6.255
LOAD-5.882
\":-5.826
olo-5.463
Token report
Token ID989
Feature activation-2.03e-01

Pos logit contributions
 Kap+7.689
nings+7.364
ings+7.068
 Cth+6.994
idences+6.886
Neg logit contributions
iculty-8.825
aline-8.518
\":-7.992
LOAD-7.949
olo-7.404
Token from
Token ID422
Feature activation-7.52e-01

Pos logit contributions
 Kap+1.742
nings+1.727
idences+1.676
ings+1.669
sung+1.656
Neg logit contributions
iculty-1.825
aline-1.782
LOAD-1.678
\":-1.653
olo-1.562
Token the
Token ID262
Feature activation-8.88e-01

Pos logit contributions
 Kap+5.949
nings+5.710
ings+5.511
idences+5.447
 Cth+5.380
Neg logit contributions
iculty-7.172
aline-7.013
\":-6.608
LOAD-6.602
olo-6.098
Token U
Token ID471
Feature activation-1.71e+00

Pos logit contributions
 Kap+6.993
nings+6.666
ings+6.417
 Cth+6.251
idences+6.244
Neg logit contributions
iculty-8.482
aline-8.211
LOAD-7.801
\":-7.728
olo-7.193
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
pps+9.776
nings+9.434
ings+9.214
sung+9.006
 Kap+8.903
Neg logit contributions
iculty-17.243
aline-16.738
\":-15.806
LOAD-14.848
 destro-14.492
TokenS
Token ID50
Feature activation-3.48e-01

Pos logit contributions
 Kap+2.132
nings+2.100
idences+2.037
ings+2.020
sung+2.010
Neg logit contributions
iculty-2.979
aline-2.916
LOAD-2.761
\":-2.736
olo-2.591
Token.
Token ID13
Feature activation+7.32e-02

Pos logit contributions
 Kap+3.309
nings+3.254
idences+3.212
ings+3.190
sung+3.183
Neg logit contributions
iculty-2.774
aline-2.697
LOAD-2.536
\":-2.500
olo-2.270
Token Treasury
Token ID13419
Feature activation+4.07e-02

Pos logit contributions
iculty+0.660
aline+0.645
LOAD+0.612
\":+0.596
olo+0.580
Neg logit contributions
nings-0.616
 Kap-0.614
idences-0.597
sung-0.588
ings-0.588
Token Department
Token ID2732
Feature activation+4.91e-03

Pos logit contributions
iculty+0.369
aline+0.357
LOAD+0.338
\":+0.331
olo+0.317
Neg logit contributions
nings-0.347
 Kap-0.347
idences-0.337
ings-0.332
sung-0.331
Token.
Token ID13
Feature activation-6.41e-01

Pos logit contributions
 Kap+4.617
nings+4.501
ings+4.421
idences+4.356
sung+4.318
Neg logit contributions
iculty-5.163
aline-4.977
LOAD-4.711
\":-4.669
olo-4.365
Token
Token ID198
Feature activation-5.21e-01

Pos logit contributions
 Kap+4.742
nings+4.446
ings+4.378
sung+4.288
idences+4.265
Neg logit contributions
iculty-6.617
aline-6.425
LOAD-6.096
\":-6.005
olo-5.746
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
 Kap+3.363
nings+3.245
ings+3.225
sung+3.070
idences+3.044
Neg logit contributions
iculty-5.798
aline-5.680
LOAD-5.372
\":-5.198
olo-5.082
Token5
Token ID20
Feature activation-1.05e-01

Pos logit contributions
 Kap+1.132
nings+1.123
idences+1.089
ings+1.077
sung+1.071
Neg logit contributions
iculty-1.509
aline-1.475
LOAD-1.401
\":-1.380
olo-1.315
Token)
Token ID8
Feature activation-1.19e+00

Pos logit contributions
 Kap+0.613
nings+0.608
idences+0.588
ings+0.574
sung+0.573
Neg logit contributions
iculty-1.226
aline-1.204
LOAD-1.152
\":-1.139
olo-1.094
Token D
Token ID360
Feature activation-1.70e+00

Pos logit contributions
 Kap+9.837
ings+8.740
 Cth+8.699
nings+8.684
sung+8.479
Neg logit contributions
iculty-10.651
aline-10.058
LOAD-9.717
\":-9.340
olo-8.931
Tokenhan
Token ID7637
Feature activation-1.17e-01

Pos logit contributions
ings+14.469
 Kap+14.267
sung+14.086
nings+13.797
idences+13.675
Neg logit contributions
iculty-11.490
LOAD-10.544
aline-10.249
\":-10.158
 DOI-9.337
Tokenwant
Token ID42949
Feature activation-1.99e-01

Pos logit contributions
nings+1.050
 Kap+1.050
idences+1.024
ings+1.009
sung+1.007
Neg logit contributions
iculty-1.001
aline-0.985
LOAD-0.908
\":-0.902
olo-0.861
Tokenri
Token ID380
Feature activation-7.17e-01

Pos logit contributions
 Kap+1.714
nings+1.710
idences+1.667
ings+1.647
sung+1.640
Neg logit contributions
iculty-1.783
aline-1.742
LOAD-1.633
\":-1.613
olo-1.530
Token :
Token ID1058
Feature activation-1.57e-01

Pos logit contributions
 Kap+5.629
nings+5.361
ings+5.267
sung+5.158
idences+5.115
Neg logit contributions
iculty-6.920
aline-6.710
LOAD-6.551
\":-6.181
Published-5.981
Token There
Token ID1318
Feature activation-4.42e-01

Pos logit contributions
 Kap+1.282
nings+1.281
idences+1.244
ings+1.230
sung+1.226
Neg logit contributions
iculty-1.471
aline-1.435
LOAD-1.355
\":-1.336
olo-1.267
Token save
Token ID3613
Feature activation+5.91e-02

Pos logit contributions
 Kap+1.439
nings+1.424
idences+1.377
ings+1.365
sung+1.355
Neg logit contributions
iculty-2.185
aline-2.137
LOAD-2.034
\":-2.007
olo-1.917
Token,
Token ID11
Feature activation-5.27e-01

Pos logit contributions
iculty+0.487
aline+0.473
LOAD+0.444
\":+0.444
Published+0.414
Neg logit contributions
 Kap-0.545
nings-0.544
idences-0.538
sung-0.528
rament-0.520
Token your
Token ID534
Feature activation-8.63e-01

Pos logit contributions
 Kap+3.122
nings+3.042
ings+2.945
idences+2.852
sung+2.826
Neg logit contributions
iculty-6.117
aline-6.034
LOAD-5.757
\":-5.619
olo-5.430
Token character
Token ID2095
Feature activation-3.28e-01

Pos logit contributions
 Kap+4.203
ings+4.006
nings+3.879
sung+3.621
 unwittingly+3.590
Neg logit contributions
iculty-10.659
aline-10.393
LOAD-10.192
\":-9.768
Published-9.441
Token now
Token ID783
Feature activation-9.97e-01

Pos logit contributions
 Kap+2.912
nings+2.899
idences+2.818
ings+2.805
sung+2.779
Neg logit contributions
iculty-2.860
aline-2.789
LOAD-2.628
\":-2.572
olo-2.431
Token receives
Token ID11583
Feature activation-1.68e+00

Pos logit contributions
 Kap+5.076
ings+4.957
nings+4.883
 unwittingly+4.485
idences+4.477
Neg logit contributions
aline-11.768
iculty-11.696
LOAD-11.349
\":-10.875
Published-10.704
Token secret
Token ID3200
Feature activation-1.49e+00

Pos logit contributions
 invis+7.554
ings+6.976
sung+6.803
nings+6.791
 Kap+6.723
Neg logit contributions
iculty-18.750
LOAD-18.350
aline-18.134
\":-17.459
aturdays-16.929
Token voices
Token ID10839
Feature activation-1.58e-02

Pos logit contributions
ings+6.993
nings+6.704
 invis+6.114
sung+6.097
 Kap+5.987
Neg logit contributions
iculty-17.596
LOAD-17.129
aline-17.078
\":-15.961
erate-15.944
Token in
Token ID287
Feature activation-2.46e-01

Pos logit contributions
 Kap+0.154
nings+0.153
idences+0.151
sung+0.149
ings+0.148
Neg logit contributions
iculty-0.122
aline-0.118
LOAD-0.111
\":-0.110
olo-0.102
Token his
Token ID465
Feature activation-1.64e-01

Pos logit contributions
 Kap+1.742
nings+1.725
idences+1.674
ings+1.649
sung+1.644
Neg logit contributions
iculty-2.594
aline-2.536
LOAD-2.413
\":-2.385
olo-2.277
Token or
Token ID393
Feature activation-6.16e-01

Pos logit contributions
 Kap+1.649
nings+1.639
idences+1.607
sung+1.586
ings+1.586
Neg logit contributions
iculty-1.242
aline-1.204
LOAD-1.118
\":-1.102
olo-1.030
Token held
Token ID2714
Feature activation-9.13e-01

Pos logit contributions
 Kap+0.222
nings+0.221
idences+0.217
sung+0.213
rament+0.210
Neg logit contributions
iculty-0.214
aline-0.209
LOAD-0.195
\":-0.193
olo-0.183
Token as
Token ID355
Feature activation-7.86e-01

Pos logit contributions
 Kap+7.072
ings+7.018
nings+6.867
 unwittingly+6.812
 invis+6.495
Neg logit contributions
iculty-8.568
aline-8.301
LOAD-8.027
\":-7.721
erate-7.312
Token '
Token ID705
Feature activation-1.53e-01

Pos logit contributions
 Kap+5.847
ings+5.710
nings+5.643
idences+5.467
 unwittingly+5.424
Neg logit contributions
iculty-7.812
aline-7.640
LOAD-7.163
\":-7.001
erate-6.666
Tokenobject
Token ID15252
Feature activation-2.36e-01

Pos logit contributions
 Kap+1.251
nings+1.238
idences+1.201
ings+1.184
sung+1.178
Neg logit contributions
iculty-1.452
aline-1.417
LOAD-1.342
\":-1.321
olo-1.257
Tokenively
Token ID2280
Feature activation-2.09e-01

Pos logit contributions
 Kap+1.862
nings+1.845
idences+1.792
ings+1.779
sung+1.767
Neg logit contributions
iculty-2.293
aline-2.238
LOAD-2.123
\":-2.089
olo-1.987
Token and
Token ID290
Feature activation-1.68e+00

Pos logit contributions
 Kap+1.420
nings+1.411
idences+1.362
ings+1.339
sung+1.339
Neg logit contributions
iculty-2.266
aline-2.218
LOAD-2.116
\":-2.089
olo-1.999
Token universally
Token ID26208
Feature activation-7.01e-01

Pos logit contributions
 unwittingly+11.372
ings+11.153
 Kap+10.992
 invis+10.873
 unintentionally+10.491
Neg logit contributions
aline-14.497
iculty-14.265
LOAD-13.968
\":-13.005
erate-12.550
Token true
Token ID2081
Feature activation-3.62e-01

Pos logit contributions
 Kap+4.924
ings+4.834
nings+4.715
sung+4.638
idences+4.604
Neg logit contributions
iculty-7.292
aline-7.115
LOAD-6.705
\":-6.539
olo-6.258
Token'
Token ID6
Feature activation+7.06e-02

Pos logit contributions
 Kap+3.109
nings+3.089
idences+2.999
ings+2.996
sung+2.961
Neg logit contributions
iculty-3.258
aline-3.175
LOAD-2.999
\":-2.924
olo-2.777
Token under
Token ID739
Feature activation-2.84e-02

Pos logit contributions
iculty+0.579
aline+0.571
\":+0.528
LOAD+0.527
olo+0.499
Neg logit contributions
 Kap-0.647
nings-0.642
idences-0.633
sung-0.628
rament-0.620
Token some
Token ID617
Feature activation+2.53e-01

Pos logit contributions
nings+0.261
 Kap+0.261
idences+0.256
sung+0.252
ings+0.250
Neg logit contributions
iculty-0.235
aline-0.229
LOAD-0.217
\":-0.213
olo-0.201
Token {
Token ID1391
Feature activation+2.72e-01

Pos logit contributions
 Kap+4.250
nings+4.194
ings+4.070
idences+4.067
sung+3.998
Neg logit contributions
iculty-3.735
aline-3.650
LOAD-3.409
\":-3.260
olo-3.107
Token(
Token ID7
Feature activation+1.02e-01

Pos logit contributions
iculty+2.258
aline+2.184
LOAD+2.067
\":+2.045
olo+1.929
Neg logit contributions
 Kap-2.503
nings-2.470
idences-2.425
sung-2.388
ings-2.378
Token8
Token ID23
Feature activation-1.04e-01

Pos logit contributions
iculty+0.958
aline+0.937
LOAD+0.893
\":+0.886
olo+0.838
Neg logit contributions
 Kap-0.825
nings-0.813
idences-0.799
sung-0.778
ings-0.774
Tokens
Token ID82
Feature activation-3.46e-01

Pos logit contributions
 Kap+0.681
nings+0.676
idences+0.651
ings+0.646
sung+0.640
Neg logit contributions
iculty-1.155
aline-1.131
LOAD-1.079
\":-1.064
olo-1.018
Token)}
Token ID38165
Feature activation-9.37e-01

Pos logit contributions
 Kap+1.862
nings+1.843
idences+1.764
ings+1.754
sung+1.720
Neg logit contributions
iculty-4.217
aline-4.120
LOAD-3.978
\":-3.926
olo-3.741
Token R
Token ID371
Feature activation-1.68e+00

Pos logit contributions
 Kap+6.980
nings+6.609
ings+6.347
 Cth+6.256
idences+6.250
Neg logit contributions
iculty-9.342
aline-9.205
LOAD-8.637
\":-8.142
Published-7.924
Tokenf
Token ID69
Feature activation-5.39e-01

Pos logit contributions
nings+10.753
ings+10.575
pps+10.150
sung+10.002
Ky+9.876
Neg logit contributions
iculty-15.010
aline-14.607
LOAD-14.165
 DOI-12.929
\":-12.812
Token8
Token ID23
Feature activation-2.80e-01

Pos logit contributions
nings+3.600
 Kap+3.586
ings+3.514
idences+3.469
sung+3.463
Neg logit contributions
iculty-5.748
aline-5.663
LOAD-5.398
\":-5.276
Published-5.013
Token {
Token ID1391
Feature activation+1.95e-01

Pos logit contributions
 Kap+2.728
nings+2.709
idences+2.640
ings+2.628
sung+2.608
Neg logit contributions
iculty-2.204
aline-2.138
LOAD-1.998
\":-1.953
olo-1.830
Token(
Token ID7
Feature activation+1.37e-01

Pos logit contributions
iculty+1.518
aline+1.463
LOAD+1.401
\":+1.399
Published+1.307
Neg logit contributions
 Kap-1.887
nings-1.857
idences-1.848
sung-1.797
rament-1.793
Token206
Token ID22136
Feature activation-5.10e-01

Pos logit contributions
iculty+1.389
aline+1.360
LOAD+1.305
\":+1.292
olo+1.229
Neg logit contributions
 Kap-1.006
nings-0.988
idences-0.976
sung-0.943
ings-0.936
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
 Kap+7.578
nings+7.318
ings+7.049
idences+6.718
 unwittingly+6.616
Neg logit contributions
iculty-10.446
aline-10.068
LOAD-9.593
\":-9.588
olo-8.740
Token Google
Token ID3012
Feature activation-8.40e-01

Pos logit contributions
 Kap+8.053
nings+7.498
ings+7.330
 Cth+7.179
idences+7.090
Neg logit contributions
iculty-9.373
aline-8.906
\":-8.537
LOAD-8.463
olo-7.983
Token disclosed
Token ID16404
Feature activation-3.41e-01

Pos logit contributions
 Kap+7.160
nings+6.921
ings+6.756
sung+6.689
 unwittingly+6.663
Neg logit contributions
iculty-7.527
aline-7.121
\":-6.811
LOAD-6.739
erate-6.254
Token that
Token ID326
Feature activation-4.92e-01

Pos logit contributions
 Kap+2.966
nings+2.929
ings+2.841
idences+2.838
sung+2.812
Neg logit contributions
iculty-3.043
aline-2.963
LOAD-2.796
\":-2.752
olo-2.600
Token the
Token ID262
Feature activation-4.33e-01

Pos logit contributions
 Kap+4.297
nings+4.156
ings+4.053
idences+4.041
sung+4.006
Neg logit contributions
iculty-4.395
aline-4.265
LOAD-4.027
\":-4.002
olo-3.747
Token U
Token ID471
Feature activation-1.67e+00

Pos logit contributions
 Kap+3.796
nings+3.699
idences+3.601
ings+3.601
sung+3.569
Neg logit contributions
iculty-3.863
aline-3.762
LOAD-3.541
\":-3.486
olo-3.304
Token.
Token ID13
Feature activation-3.60e-01

Pos logit contributions
ings+8.870
pps+8.705
nings+8.452
sung+8.451
 Kap+8.309
Neg logit contributions
iculty-17.012
aline-16.564
 destro-16.127
\":-15.283
LOAD-14.964
TokenS
Token ID50
Feature activation+1.22e-01

Pos logit contributions
 Kap+2.500
nings+2.453
sung+2.387
idences+2.368
ings+2.356
Neg logit contributions
iculty-3.834
aline-3.728
LOAD-3.554
\":-3.536
olo-3.340
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
iculty+0.974
aline+0.946
LOAD+0.885
\":+0.869
olo+0.817
Neg logit contributions
 Kap-1.166
nings-1.158
idences-1.130
ings-1.122
sung-1.116
Token Federal
Token ID5618
Feature activation+2.00e-01

Pos logit contributions
iculty+2.216
aline+2.173
LOAD+2.053
\":+2.003
olo+1.915
Neg logit contributions
nings-2.661
 Kap-2.591
idences-2.553
rament-2.519
pai-2.514
Token Trade
Token ID9601
Feature activation+6.02e-01

Pos logit contributions
iculty+1.779
aline+1.747
LOAD+1.643
\":+1.622
olo+1.555
Neg logit contributions
nings-1.677
 Kap-1.656
sung-1.607
idences-1.604
ings-1.589
Token Singh
Token ID14403
Feature activation-2.22e-01

Pos logit contributions
iculty+1.068
aline+1.066
LOAD+0.968
\":+0.964
olo+0.953
Neg logit contributions
nings-1.183
 Kap-1.161
idences-1.147
sung-1.123
ings-1.121
Token Dal
Token ID12348
Feature activation-2.12e-01

Pos logit contributions
 Kap+1.862
nings+1.848
idences+1.798
ings+1.783
sung+1.775
Neg logit contributions
iculty-2.037
aline-1.986
LOAD-1.875
\":-1.846
olo-1.751
Tokenal
Token ID282
Feature activation-3.97e-02

Pos logit contributions
 Kap+1.807
nings+1.792
idences+1.744
ings+1.730
sung+1.721
Neg logit contributions
iculty-1.924
aline-1.875
LOAD-1.769
\":-1.741
olo-1.650
Token (
Token ID357
Feature activation-9.00e-01

Pos logit contributions
nings+0.334
 Kap+0.332
idences+0.327
sung+0.323
ings+0.320
Neg logit contributions
iculty-0.358
aline-0.353
LOAD-0.330
\":-0.326
olo-0.313
TokenRet
Token ID9781
Feature activation-8.65e-01

Pos logit contributions
 Kap+7.683
nings+7.475
sung+7.413
ings+7.350
idences+7.267
Neg logit contributions
iculty-7.698
aline-7.447
LOAD-6.845
\":-6.817
olo-6.464
Tokend
Token ID67
Feature activation-1.67e+00

Pos logit contributions
 Kap+6.227
nings+5.923
idences+5.831
ings+5.821
rament+5.630
Neg logit contributions
iculty-8.739
aline-8.344
LOAD-8.268
\":-8.197
olo-7.475
Token)
Token ID8
Feature activation-1.98e-01

Pos logit contributions
 Kap+11.780
ings+10.935
nings+10.712
idences+10.584
sung+10.238
Neg logit contributions
iculty-14.467
aline-13.909
LOAD-13.709
\":-13.287
easy-12.116
Token to
Token ID284
Feature activation-2.84e-01

Pos logit contributions
 Kap+1.680
nings+1.672
idences+1.629
ings+1.612
sung+1.607
Neg logit contributions
iculty-1.801
aline-1.758
LOAD-1.659
\":-1.636
olo-1.548
Token stir
Token ID11240
Feature activation-6.22e-01

Pos logit contributions
 Kap+2.362
nings+2.341
idences+2.275
ings+2.259
sung+2.246
Neg logit contributions
iculty-2.633
aline-2.567
LOAD-2.425
\":-2.387
olo-2.264
Token up
Token ID510
Feature activation-5.87e-01

Pos logit contributions
 Kap+4.748
nings+4.622
ings+4.540
idences+4.465
sung+4.406
Neg logit contributions
iculty-6.176
aline-5.915
LOAD-5.729
\":-5.587
olo-5.316
Token trouble
Token ID5876
Feature activation+6.14e-02

Pos logit contributions
 Kap+5.145
nings+5.015
ings+4.890
idences+4.852
sung+4.765
Neg logit contributions
iculty-5.201
aline-5.014
LOAD-4.781
\":-4.670
olo-4.432
Token current
Token ID1459
Feature activation+7.28e-01

Pos logit contributions
aline+7.047
iculty+6.701
\":+6.646
LOAD+6.431
 DOI+6.323
Neg logit contributions
idences-8.870
sung-8.778
nings-8.741
 dstg-8.709
 Kap-8.675
Token 6
Token ID718
Feature activation+5.44e-01

Pos logit contributions
aline+5.608
iculty+5.396
\":+5.252
LOAD+5.183
 DOI+4.929
Neg logit contributions
idences-6.432
sung-6.331
nings-6.323
 Kap-6.199
 dstg-6.101
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
iculty+4.162
aline+4.148
\":+3.952
LOAD+3.895
olo+3.576
Neg logit contributions
idences-4.978
 Kap-4.951
sung-4.911
nings-4.903
rament-4.890
Token2
Token ID17
Feature activation+8.62e-01

Pos logit contributions
 Kap+1.124
nings+1.121
idences+1.094
sung+1.074
ings+1.069
Neg logit contributions
iculty-1.655
aline-1.617
LOAD-1.551
\":-1.531
olo-1.456
Token-
Token ID12
Feature activation+6.05e-01

Pos logit contributions
iculty+6.330
aline+6.186
\":+5.927
LOAD+5.909
olo+5.318
Neg logit contributions
 Kap-7.877
sung-7.818
rament-7.808
idences-7.794
nings-7.727
Tokenliter
Token ID17201
Feature activation+1.05e+00

Pos logit contributions
iculty+5.828
aline+5.733
LOAD+5.556
\":+5.441
Published+5.094
Neg logit contributions
 Kap-4.441
idences-4.303
nings-4.262
sung-4.151
 Cth-4.151
Token V
Token ID569
Feature activation+1.58e-01

Pos logit contributions
aline+7.680
iculty+7.317
\":+7.089
LOAD+7.048
 DOI+6.984
Neg logit contributions
idences-9.250
sung-8.922
pps-8.725
 dstg-8.656
nings-8.643
Token8
Token ID23
Feature activation+9.95e-01

Pos logit contributions
iculty+1.550
aline+1.521
LOAD+1.457
\":+1.431
olo+1.365
Neg logit contributions
 Kap-1.216
nings-1.207
idences-1.170
sung-1.169
ings-1.147
Token,
Token ID11
Feature activation+1.73e-01

Pos logit contributions
iculty+6.678
aline+6.615
\":+6.412
LOAD+6.349
 DOI+5.889
Neg logit contributions
idences-9.194
nings-8.907
pai-8.839
 Kap-8.790
 dstg-8.702
Token which
Token ID543
Feature activation+2.05e-01

Pos logit contributions
iculty+1.504
aline+1.496
LOAD+1.408
\":+1.391
olo+1.303
Neg logit contributions
nings-1.474
 Kap-1.459
idences-1.449
sung-1.428
ings-1.413
Token is
Token ID318
Feature activation+3.60e-01

Pos logit contributions
iculty+1.812
aline+1.781
LOAD+1.677
\":+1.666
olo+1.554
Neg logit contributions
nings-1.734
 Kap-1.726
idences-1.719
sung-1.694
ings-1.659
Token votes
Token ID5690
Feature activation+1.05e-01

Pos logit contributions
iculty+1.085
aline+1.033
LOAD+1.005
\":+0.986
olo+0.925
Neg logit contributions
 Kap-0.962
nings-0.952
idences-0.948
sung-0.930
rament-0.908
Token.
Token ID13
Feature activation+5.31e-02

Pos logit contributions
iculty+0.887
aline+0.858
LOAD+0.811
\":+0.805
olo+0.755
Neg logit contributions
 Kap-0.943
nings-0.932
idences-0.914
sung-0.905
ings-0.895
Token Georgia
Token ID7859
Feature activation-4.44e-01

Pos logit contributions
iculty+0.484
aline+0.471
LOAD+0.450
\":+0.444
Published+0.416
Neg logit contributions
nings-0.446
 Kap-0.438
idences-0.432
sung-0.427
ings-0.425
Token Rep
Token ID1432
Feature activation-1.19e-01

Pos logit contributions
 Kap+3.605
nings+3.543
idences+3.469
ings+3.460
sung+3.399
Neg logit contributions
iculty-4.174
aline-4.075
LOAD-3.852
\":-3.783
olo-3.588
Token.
Token ID13
Feature activation+3.52e-01

Pos logit contributions
 Kap+1.004
nings+0.986
idences+0.967
ings+0.965
sung+0.956
Neg logit contributions
iculty-1.099
aline-1.066
LOAD-1.024
\":-0.996
olo-0.937
Token St
Token ID520
Feature activation+1.13e+00

Pos logit contributions
iculty+2.828
aline+2.817
\":+2.635
LOAD+2.627
Published+2.462
Neg logit contributions
nings-3.196
idences-3.146
sung-3.045
 Kap-3.030
rament-3.004
Tokenacey
Token ID25415
Feature activation+2.13e-01

Pos logit contributions
aline+6.149
iculty+5.268
olo+5.029
\":+4.384
LOAD+4.333
Neg logit contributions
 Kap-12.533
nings-12.466
 unwittingly-12.134
 dstg-12.085
ğ-11.913
Token Evans
Token ID13922
Feature activation+5.34e-02

Pos logit contributions
iculty+1.328
aline+1.303
\":+1.193
LOAD+1.160
olo+1.102
Neg logit contributions
nings-2.375
 Kap-2.322
idences-2.316
sung-2.273
ings-2.254
Token (
Token ID357
Feature activation-3.81e-01

Pos logit contributions
iculty+0.473
aline+0.463
LOAD+0.437
\":+0.433
olo+0.411
Neg logit contributions
nings-0.460
 Kap-0.460
idences-0.447
sung-0.442
ings-0.441
TokenD
Token ID35
Feature activation-1.04e+00

Pos logit contributions
 Kap+3.419
nings+3.377
ings+3.307
idences+3.290
sung+3.282
Neg logit contributions
iculty-3.273
aline-3.153
LOAD-2.947
\":-2.918
olo-2.744
Token)
Token ID8
Feature activation-6.24e-01

Pos logit contributions
 Kap+6.905
ings+6.727
nings+6.661
idences+6.449
pps+6.367
Neg logit contributions
iculty-10.498
aline-10.002
LOAD-9.862
\":-9.526
Published-8.956
Token successful
Token ID4388
Feature activation+6.60e-01

Pos logit contributions
iculty+4.208
aline+4.181
LOAD+3.978
\":+3.861
olo+3.667
Neg logit contributions
 Kap-4.068
idences-4.057
nings-4.056
sung-3.992
rament-3.888
Token strategy
Token ID4811
Feature activation+3.54e-01

Pos logit contributions
iculty+5.447
aline+5.421
LOAD+5.167
\":+5.030
olo+4.823
Neg logit contributions
idences-5.464
nings-5.418
sung-5.383
 Kap-5.382
rament-5.244
Token to
Token ID284
Feature activation+7.61e-01

Pos logit contributions
iculty+2.825
aline+2.793
LOAD+2.605
\":+2.599
olo+2.448
Neg logit contributions
 Kap-3.246
idences-3.204
nings-3.186
sung-3.126
rament-3.094
Token block
Token ID2512
Feature activation+1.56e+00

Pos logit contributions
iculty+6.174
aline+6.156
\":+5.613
LOAD+5.561
olo+5.322
Neg logit contributions
idences-6.390
nings-6.278
 Kap-6.241
sung-6.229
rament-6.047
Token and
Token ID290
Feature activation+8.47e-01

Pos logit contributions
aline+10.479
iculty+10.326
LOAD+9.925
 anytime+9.529
\":+9.446
Neg logit contributions
nings-12.337
idences-12.151
sung-12.043
gart-11.875
rament-11.532
Token blame
Token ID8138
Feature activation+1.31e+00

Pos logit contributions
iculty+6.959
aline+6.837
\":+6.282
LOAD+6.143
 DOI+5.894
Neg logit contributions
sung-7.016
idences-7.011
 Kap-6.917
nings-6.913
rament-6.669
Token President
Token ID1992
Feature activation+2.25e-01

Pos logit contributions
aline+8.918
iculty+8.684
LOAD+8.609
\":+8.307
erate+7.974
Neg logit contributions
nings-11.143
idences-10.945
sung-10.784
gart-10.446
rament-10.264
Token Obama
Token ID2486
Feature activation+1.21e-02

Pos logit contributions
iculty+1.749
aline+1.722
LOAD+1.622
\":+1.590
olo+1.501
Neg logit contributions
nings-2.150
 Kap-2.131
idences-2.119
sung-2.075
ings-2.055
Token.
Token ID13
Feature activation+1.06e-01

Pos logit contributions
iculty+0.108
aline+0.106
LOAD+0.100
\":+0.097
olo+0.093
Neg logit contributions
 Kap-0.105
nings-0.103
idences-0.102
sung-0.100
ings-0.099
Token Their
Token ID5334
Feature activation+4.78e-01

Pos logit contributions
iculty+0.976
aline+0.959
LOAD+0.921
\":+0.906
Published+0.851
Neg logit contributions
nings-0.866
idences-0.851
 Kap-0.847
sung-0.840
rament-0.828
Token anger
Token ID8993
Feature activation+6.18e-01

Pos logit contributions
iculty+4.005
aline+3.901
LOAD+3.719
\":+3.594
olo+3.398
Neg logit contributions
 Kap-4.102
idences-4.083
nings-4.079
sung-4.017
 dstg-3.929
Token going
Token ID1016
Feature activation-3.28e-01

Pos logit contributions
 Kap+1.453
nings+1.447
idences+1.411
ings+1.392
sung+1.390
Neg logit contributions
iculty-1.432
aline-1.395
LOAD-1.316
\":-1.303
olo-1.222
Token to
Token ID284
Feature activation+1.34e-01

Pos logit contributions
 Kap+2.595
nings+2.570
ings+2.495
idences+2.493
sung+2.464
Neg logit contributions
iculty-3.166
aline-3.085
LOAD-2.926
\":-2.863
olo-2.754
Token have
Token ID423
Feature activation+2.71e-01

Pos logit contributions
iculty+1.191
aline+1.168
\":+1.096
LOAD+1.086
olo+1.034
Neg logit contributions
 Kap-1.146
nings-1.142
ings-1.109
idences-1.107
sung-1.106
Token the
Token ID262
Feature activation+5.65e-01

Pos logit contributions
iculty+2.219
aline+2.149
\":+2.026
LOAD+2.019
olo+1.860
Neg logit contributions
nings-2.484
 Kap-2.443
idences-2.411
sung-2.392
ings-2.363
Token incredible
Token ID8082
Feature activation+9.79e-01

Pos logit contributions
iculty+4.614
aline+4.392
\":+4.252
LOAD+4.128
olo+3.855
Neg logit contributions
nings-4.945
idences-4.874
 Kap-4.842
ings-4.805
sung-4.789
Token excitement
Token ID14067
Feature activation+1.13e+00

Pos logit contributions
iculty+8.308
aline+8.003
\":+7.805
LOAD+7.407
olo+7.343
Neg logit contributions
nings-7.418
sung-7.344
idences-7.327
pai-7.223
 dstg-7.015
Token among
Token ID1871
Feature activation-3.81e-02

Pos logit contributions
\":+7.698
iculty+7.679
aline+7.639
LOAD+7.326
olo+6.986
Neg logit contributions
sung-9.652
idences-9.394
 Kap-9.370
nings-9.357
pai-9.344
Token 18
Token ID1248
Feature activation+2.88e-01

Pos logit contributions
 Kap+0.323
nings+0.323
idences+0.316
sung+0.311
ings+0.310
Neg logit contributions
iculty-0.343
aline-0.335
LOAD-0.318
\":-0.314
olo-0.295
Token-
Token ID12
Feature activation+1.77e-01

Pos logit contributions
iculty+2.523
aline+2.414
LOAD+2.312
\":+2.304
olo+2.149
Neg logit contributions
nings-2.495
 Kap-2.486
idences-2.449
sung-2.389
rament-2.379
Token to
Token ID284
Feature activation+5.45e-02

Pos logit contributions
iculty+1.576
aline+1.533
LOAD+1.452
\":+1.438
Published+1.344
Neg logit contributions
 Kap-1.518
nings-1.512
idences-1.480
ings-1.435
sung-1.434
Token 22
Token ID2534
Feature activation+9.13e-01

Pos logit contributions
iculty+0.453
aline+0.434
LOAD+0.411
\":+0.409
olo+0.384
Neg logit contributions
nings-0.506
 Kap-0.502
idences-0.494
sung-0.481
ings-0.481
Token Bl
Token ID1086
Feature activation+1.31e-01

Pos logit contributions
iculty+4.698
aline+4.592
\":+4.281
LOAD+4.166
olo+4.024
Neg logit contributions
nings-7.781
idences-7.531
rament-7.404
 dstg-7.372
ings-7.242
Tokenume
Token ID2454
Feature activation+3.26e-01

Pos logit contributions
iculty+0.922
aline+0.898
\":+0.801
LOAD+0.791
olo+0.763
Neg logit contributions
 Kap-1.398
nings-1.374
idences-1.331
sung-1.318
 Cth-1.313
Token,
Token ID11
Feature activation+8.30e-01

Pos logit contributions
iculty+2.750
aline+2.691
LOAD+2.508
\":+2.506
olo+2.414
Neg logit contributions
nings-2.886
idences-2.839
 Kap-2.835
sung-2.772
rament-2.766
Token is
Token ID318
Feature activation+1.39e+00

Pos logit contributions
iculty+6.339
aline+6.324
Published+5.909
 DOI+5.889
\":+5.869
Neg logit contributions
idences-7.008
nings-6.981
rament-6.824
 dstg-6.781
pai-6.776
Token published
Token ID3199
Feature activation+1.06e+00

Pos logit contributions
iculty+7.531
Published+7.084
aline+6.573
 DOI+6.564
olo+6.526
Neg logit contributions
idences-14.200
nings-13.761
rament-13.524
 dstg-13.474
sung-13.150
Token on
Token ID319
Feature activation+1.38e+00

Pos logit contributions
iculty+6.697
aline+6.274
LOAD+6.087
\":+5.969
 DOI+5.846
Neg logit contributions
idences-10.413
 dstg-10.166
nings-10.144
sung-9.911
pai-9.888
Token 4
Token ID604
Feature activation+8.12e-01

Pos logit contributions
aline+9.923
iculty+9.850
 DOI+9.327
LOAD+9.101
Published+9.063
Neg logit contributions
nings-11.555
rament-11.275
 dstg-11.067
idences-10.979
pai-10.854
Token June
Token ID2795
Feature activation+4.73e-01

Pos logit contributions
iculty+4.998
aline+4.938
LOAD+4.501
\":+4.445
olo+4.222
Neg logit contributions
 Kap-8.413
nings-8.382
rament-8.381
idences-8.363
 dstg-8.208
Token by
Token ID416
Feature activation+5.08e-01

Pos logit contributions
iculty+3.888
aline+3.781
LOAD+3.617
\":+3.596
 DOI+3.452
Neg logit contributions
idences-4.164
nings-4.130
 Kap-4.022
sung-4.011
 dstg-3.940
Token Pic
Token ID15085
Feature activation+3.46e-02

Pos logit contributions
iculty+4.382
aline+4.355
\":+4.047
LOAD+4.002
Published+3.871
Neg logit contributions
nings-4.260
idences-4.180
sung-4.114
pai-4.056
rament-4.018
Tokenador
Token ID7079
Feature activation+1.05e+00

Pos logit contributions
iculty+0.187
aline+0.183
LOAD+0.159
\":+0.155
olo+0.147
Neg logit contributions
 Kap-0.420
nings-0.419
idences-0.413
sung-0.411
ings-0.406
TokenN
Token ID45
Feature activation-8.32e-03

Pos logit contributions
iculty+1.013
aline+1.003
LOAD+0.963
\":+0.932
olo+0.899
Neg logit contributions
nings-0.940
 Kap-0.934
idences-0.919
sung-0.894
ings-0.893
Token.
Token ID13
Feature activation+2.59e-01

Pos logit contributions
 Kap+0.074
nings+0.074
idences+0.072
ings+0.071
sung+0.070
Neg logit contributions
iculty-0.072
aline-0.070
LOAD-0.066
\":-0.065
olo-0.062
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
iculty+2.422
aline+2.386
LOAD+2.291
\":+2.223
Published+2.133
Neg logit contributions
nings-2.051
 Kap-2.021
idences-2.019
rament-1.969
ings-1.965
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
 Kap+0.997
nings+0.990
idences+0.963
ings+0.947
sung+0.946
Neg logit contributions
iculty-1.343
aline-1.313
LOAD-1.248
\":-1.235
olo-1.172
Token
Token ID447
Feature activation+2.88e-01

Pos logit contributions
iculty+0.986
aline+0.972
LOAD+0.944
\":+0.914
Published+0.868
Neg logit contributions
 Kap-0.945
nings-0.935
idences-0.926
rament-0.902
ings-0.898
Token
Token ID250
Feature activation+3.62e-01

Pos logit contributions
iculty+2.448
aline+2.353
LOAD+2.259
\":+2.211
olo+2.063
Neg logit contributions
 Kap-2.593
nings-2.580
idences-2.519
sung-2.491
ings-2.456
TokenNo
Token ID2949
Feature activation+4.69e-01

Pos logit contributions
iculty+3.119
aline+3.015
LOAD+2.960
\":+2.867
Published+2.726
Neg logit contributions
 Kap-3.128
nings-3.125
idences-3.098
rament-3.038
 unwittingly-3.001
Token news
Token ID1705
Feature activation+5.65e-01

Pos logit contributions
iculty+3.755
aline+3.663
LOAD+3.498
\":+3.431
olo+3.291
Neg logit contributions
nings-4.294
 Kap-4.227
idences-4.120
sung-4.107
rament-4.065
Token here
Token ID994
Feature activation+1.20e-01

Pos logit contributions
iculty+4.599
aline+4.551
LOAD+4.460
\":+4.381
olo+4.120
Neg logit contributions
idences-4.787
nings-4.781
 Kap-4.770
sung-4.691
rament-4.549
Token,
Token ID11
Feature activation+2.82e-01

Pos logit contributions
iculty+1.016
aline+0.997
LOAD+0.943
\":+0.928
olo+0.875
Neg logit contributions
nings-1.070
 Kap-1.070
idences-1.050
sung-1.037
ings-1.022
Token
Token ID447
Feature activation+1.67e-01

Pos logit contributions
iculty+2.450
aline+2.405
LOAD+2.299
\":+2.252
olo+2.118
Neg logit contributions
nings-2.421
 Kap-2.350
idences-2.340
sung-2.300
ings-2.283
Tokenimage
Token ID9060
Feature activation-2.59e-01

Pos logit contributions
iculty+1.649
aline+1.648
LOAD+1.452
\":+1.419
olo+1.344
Neg logit contributions
 Kap-3.261
nings-3.233
 Cth-3.183
idences-3.179
rament-3.119
TokenPath
Token ID15235
Feature activation-4.07e-01

Pos logit contributions
 Kap+2.114
nings+2.099
ings+2.033
idences+2.025
sung+2.013
Neg logit contributions
iculty-2.438
aline-2.378
LOAD-2.228
\":-2.202
olo-2.095
Token)
Token ID8
Feature activation-6.41e-02

Pos logit contributions
 Kap+2.010
nings+1.977
ings+1.931
sung+1.869
idences+1.853
Neg logit contributions
iculty-5.112
aline-4.998
LOAD-4.796
\":-4.754
olo-4.551
Token image
Token ID2939
Feature activation-2.78e-01

Pos logit contributions
 Kap+0.614
nings+0.611
idences+0.597
ings+0.591
sung+0.590
Neg logit contributions
iculty-0.512
aline-0.498
LOAD-0.466
\":-0.458
olo-0.430
Token =
Token ID796
Feature activation+7.61e-02

Pos logit contributions
 Kap+2.305
nings+2.284
ings+2.214
sung+2.196
idences+2.194
Neg logit contributions
iculty-2.584
aline-2.515
LOAD-2.354
\":-2.317
olo-2.210
Token im
Token ID545
Feature activation+8.05e-01

Pos logit contributions
iculty+0.868
aline+0.850
LOAD+0.810
\":+0.805
olo+0.766
Neg logit contributions
nings-0.464
 Kap-0.460
idences-0.445
ings-0.435
sung-0.433
Tokenutils
Token ID26791
Feature activation+4.31e-01

Pos logit contributions
iculty+6.615
aline+6.426
LOAD+6.060
\":+5.854
olo+5.713
Neg logit contributions
 Kap-7.105
 Cth-6.801
rament-6.567
nings-6.553
sung-6.518
Token.
Token ID13
Feature activation+3.65e-01

Pos logit contributions
iculty+3.131
aline+3.041
\":+2.899
LOAD+2.888
Published+2.710
Neg logit contributions
nings-4.228
idences-4.218
 Kap-4.216
 Cth-4.109
sung-4.080
Tokenres
Token ID411
Feature activation+1.22e+00

Pos logit contributions
iculty+4.436
aline+4.352
LOAD+4.189
\":+4.086
erate+3.941
Neg logit contributions
 Kap-1.967
nings-1.903
idences-1.867
 Cth-1.835
sung-1.791
Tokenize
Token ID1096
Feature activation+5.12e-01

Pos logit contributions
iculty+5.104
aline+5.055
olo+4.157
LOAD+3.956
\":+3.827
Neg logit contributions
 Cth-15.205
 Kap-14.722
 glim-14.662
 unwittingly-14.239
rament-14.072
Token(
Token ID7
Feature activation+5.39e-01

Pos logit contributions
iculty+3.477
aline+3.335
LOAD+3.302
\":+3.276
Published+3.060
Neg logit contributions
 Kap-5.248
idences-5.184
nings-5.163
rament-5.075
 Cth-5.045
Token-
Token ID12
Feature activation+6.08e-01

Pos logit contributions
aline+6.358
iculty+6.116
\":+5.730
LOAD+5.695
olo+5.638
Neg logit contributions
idences-6.881
nings-6.600
rament-6.559
 Kap-6.489
ğ-6.415
Tokentrack
Token ID11659
Feature activation+6.69e-01

Pos logit contributions
aline+4.805
iculty+4.777
LOAD+4.367
\":+4.353
olo+4.121
Neg logit contributions
 Kap-5.561
idences-5.511
nings-5.465
 Cth-5.403
ğ-5.399
Token session
Token ID6246
Feature activation+5.23e-02

Pos logit contributions
aline+5.430
iculty+5.366
\":+5.036
LOAD+4.943
olo+4.760
Neg logit contributions
idences-5.618
nings-5.515
rament-5.459
sung-5.431
 Kap-5.417
Token at
Token ID379
Feature activation+5.61e-01

Pos logit contributions
iculty+0.431
aline+0.426
LOAD+0.396
\":+0.389
olo+0.370
Neg logit contributions
 Kap-0.477
nings-0.473
idences-0.470
sung-0.460
ings-0.454
Token Circuit
Token ID13588
Feature activation+5.44e-01

Pos logit contributions
aline+4.704
iculty+4.610
\":+4.322
LOAD+4.220
olo+4.130
Neg logit contributions
idences-4.770
nings-4.720
sung-4.603
ings-4.551
 Kap-4.534
Token Paul
Token ID3362
Feature activation+2.48e-01

Pos logit contributions
aline+4.479
iculty+4.461
\":+4.100
olo+4.004
LOAD+3.987
Neg logit contributions
nings-4.745
sung-4.685
idences-4.684
 Kap-4.672
rament-4.496
Token Ric
Token ID15868
Feature activation+6.21e-01

Pos logit contributions
iculty+2.110
aline+2.099
LOAD+1.897
\":+1.884
olo+1.882
Neg logit contributions
nings-2.168
 Kap-2.163
idences-2.110
sung-2.086
 Cth-2.053
Tokenard
Token ID446
Feature activation+1.04e+00

Pos logit contributions
aline+4.306
iculty+4.279
olo+4.026
\":+3.674
LOAD+3.434
Neg logit contributions
 Kap-6.170
idences-6.103
nings-6.018
 Cth-5.935
sung-5.898
Token.
Token ID13
Feature activation+3.61e-01

Pos logit contributions
aline+8.321
iculty+8.155
olo+7.702
\":+7.310
LOAD+7.229
Neg logit contributions
 Kap-8.391
idences-8.369
 Cth-8.255
sung-8.173
nings-8.135
Token
Token ID198
Feature activation+4.63e-02

Pos logit contributions
aline+3.245
iculty+3.211
LOAD+3.018
\":+2.990
olo+2.881
Neg logit contributions
idences-2.938
nings-2.902
 Kap-2.820
rament-2.799
 unwittingly-2.775
Token
Token ID198
Feature activation+3.51e-01

Pos logit contributions
iculty+0.442
aline+0.432
\":+0.412
LOAD+0.412
Published+0.386
Neg logit contributions
nings-0.365
 Kap-0.365
idences-0.359
sung-0.350
rament-0.346
Token could
Token ID714
Feature activation-1.47e-01

Pos logit contributions
 Kap+3.525
nings+3.474
ings+3.384
idences+3.322
sung+3.282
Neg logit contributions
iculty-4.711
aline-4.624
LOAD-4.362
\":-4.288
olo-4.094
Token pass
Token ID1208
Feature activation+1.19e-01

Pos logit contributions
 Kap+1.261
nings+1.246
idences+1.224
sung+1.210
ings+1.201
Neg logit contributions
iculty-1.330
aline-1.293
LOAD-1.214
\":-1.197
olo-1.134
Token the
Token ID262
Feature activation+2.21e-01

Pos logit contributions
iculty+0.994
aline+0.981
LOAD+0.917
\":+0.900
olo+0.844
Neg logit contributions
 Kap-1.075
nings-1.074
idences-1.057
sung-1.043
ings-1.022
Token Republican
Token ID3415
Feature activation+2.79e-01

Pos logit contributions
iculty+2.008
aline+1.967
LOAD+1.859
\":+1.818
olo+1.735
Neg logit contributions
nings-1.854
 Kap-1.836
idences-1.820
sung-1.775
ings-1.773
Token-
Token ID12
Feature activation+4.79e-01

Pos logit contributions
iculty+2.270
aline+2.230
LOAD+2.085
\":+2.064
olo+1.937
Neg logit contributions
 Kap-2.577
nings-2.546
idences-2.508
sung-2.476
rament-2.420
Tokenled
Token ID992
Feature activation+3.38e-01

Pos logit contributions
iculty+2.539
aline+2.536
LOAD+2.252
\":+2.130
Published+2.037
Neg logit contributions
 Kap-5.832
nings-5.663
idences-5.608
 Cth-5.562
rament-5.481
Token House
Token ID2097
Feature activation-7.76e-02

Pos logit contributions
iculty+2.982
aline+2.961
LOAD+2.774
\":+2.717
 DOI+2.633
Neg logit contributions
nings-2.849
 Kap-2.809
idences-2.774
rament-2.729
sung-2.709
Token by
Token ID416
Feature activation+4.96e-01

Pos logit contributions
 Kap+0.698
nings+0.694
idences+0.676
ings+0.668
sung+0.667
Neg logit contributions
iculty-0.669
aline-0.651
LOAD-0.612
\":-0.601
olo-0.568
Token a
Token ID257
Feature activation+5.53e-01

Pos logit contributions
iculty+3.628
aline+3.464
\":+3.315
LOAD+3.282
Published+3.062
Neg logit contributions
nings-4.886
idences-4.869
 Kap-4.792
sung-4.748
rament-4.678
Token simple
Token ID2829
Feature activation+7.37e-01

Pos logit contributions
iculty+4.231
aline+4.111
\":+3.904
LOAD+3.839
 DOI+3.616
Neg logit contributions
nings-5.210
idences-5.115
 Kap-5.090
sung-5.074
ings-5.035
Token majority
Token ID3741
Feature activation+5.61e-01

Pos logit contributions
iculty+2.918
aline+2.718
LOAD+2.516
\":+2.487
 DOI+2.167
Neg logit contributions
idences-9.482
nings-9.400
 Kap-9.308
sung-9.249
pai-9.056
Token is
Token ID318
Feature activation+4.67e-01

Pos logit contributions
 Kap+1.859
nings+1.843
idences+1.797
sung+1.775
ings+1.773
Neg logit contributions
iculty-2.038
aline-1.987
LOAD-1.874
\":-1.848
olo-1.754
Token open
Token ID1280
Feature activation+4.97e-01

Pos logit contributions
iculty+3.630
aline+3.616
\":+3.363
LOAD+3.346
olo+3.213
Neg logit contributions
idences-4.265
nings-4.197
 Kap-4.178
sung-4.096
ğ-4.034
Token to
Token ID284
Feature activation+1.51e+00

Pos logit contributions
iculty+3.974
aline+3.930
LOAD+3.766
\":+3.653
olo+3.379
Neg logit contributions
 Kap-4.457
nings-4.414
idences-4.400
sung-4.369
 Cth-4.260
Token both
Token ID1111
Feature activation+5.37e-01

Pos logit contributions
aline+9.952
\":+9.886
iculty+9.886
LOAD+9.398
 anytime+9.347
Neg logit contributions
sung-11.983
pai-11.983
nings-11.967
idences-11.780
 dstg-11.726
Token consumers
Token ID7008
Feature activation+6.98e-01

Pos logit contributions
iculty+4.354
aline+4.340
\":+4.170
LOAD+4.090
olo+3.889
Neg logit contributions
nings-4.665
idences-4.588
sung-4.509
pai-4.495
 Kap-4.486
Token and
Token ID290
Feature activation+4.48e-01

Pos logit contributions
iculty+4.975
aline+4.905
LOAD+4.581
\":+4.568
olo+4.306
Neg logit contributions
 Kap-6.714
nings-6.659
idences-6.566
sung-6.548
rament-6.489
Token beauty
Token ID8737
Feature activation+2.23e-01

Pos logit contributions
iculty+3.566
aline+3.501
\":+3.348
LOAD+3.255
olo+3.119
Neg logit contributions
nings-4.134
 Kap-3.979
idences-3.966
sung-3.956
pai-3.902
Token industry
Token ID2831
Feature activation+6.33e-01

Pos logit contributions
iculty+1.849
aline+1.844
\":+1.717
LOAD+1.707
olo+1.616
Neg logit contributions
 Kap-1.987
nings-1.983
idences-1.944
sung-1.929
ings-1.898
Token professionals
Token ID11153
Feature activation+1.36e+00

Pos logit contributions
aline+5.080
iculty+4.978
\":+4.831
LOAD+4.557
olo+4.442
Neg logit contributions
nings-5.545
sung-5.414
idences-5.371
 Kap-5.371
pai-5.272
Token.
Token ID13
Feature activation+4.13e-01

Pos logit contributions
iculty+8.592
aline+8.334
 anytime+7.836
LOAD+7.647
\":+7.645
Neg logit contributions
idences-12.029
sung-11.916
nings-11.915
pai-11.745
 Kap-11.575
Token Consumers
Token ID45103
Feature activation+5.93e-01

Pos logit contributions
iculty+3.560
aline+3.484
LOAD+3.400
\":+3.344
Published+3.151
Neg logit contributions
nings-3.499
idences-3.422
sung-3.382
 Kap-3.343
rament-3.324
Token identified
Token ID5174
Feature activation+5.75e-01

Pos logit contributions
 Kap+5.302
nings+5.216
ings+5.144
sung+5.008
idences+4.977
Neg logit contributions
iculty-6.129
aline-5.944
LOAD-5.596
\":-5.541
olo-5.208
Token itself
Token ID2346
Feature activation+1.64e-01

Pos logit contributions
iculty+3.997
aline+3.977
LOAD+3.744
\":+3.731
olo+3.414
Neg logit contributions
nings-5.627
 Kap-5.603
idences-5.583
rament-5.475
sung-5.469
Token with
Token ID351
Feature activation+3.22e-01

Pos logit contributions
iculty+1.327
aline+1.289
LOAD+1.209
\":+1.194
olo+1.119
Neg logit contributions
 Kap-1.552
nings-1.540
idences-1.510
sung-1.492
ings-1.484
Token.
Token ID13
Feature activation+1.29e-02

Pos logit contributions
iculty+2.530
aline+2.485
LOAD+2.329
\":+2.313
olo+2.165
Neg logit contributions
 Kap-3.003
idences-2.985
nings-2.985
sung-2.890
rament-2.871
Token Between
Token ID14307
Feature activation-4.94e-01

Pos logit contributions
iculty+0.122
aline+0.120
LOAD+0.114
\":+0.113
olo+0.107
Neg logit contributions
nings-0.103
 Kap-0.102
idences-0.101
sung-0.099
ings-0.099
Token 1850
Token ID35745
Feature activation-4.33e-01

Pos logit contributions
 Kap+4.201
nings+4.124
ings+4.018
idences+3.976
sung+3.960
Neg logit contributions
iculty-4.530
aline-4.352
LOAD-4.124
\":-4.041
olo-3.820
Token and
Token ID290
Feature activation-7.26e-01

Pos logit contributions
nings+3.486
 Kap+3.466
ings+3.365
idences+3.316
sung+3.311
Neg logit contributions
iculty-4.094
aline-4.002
LOAD-3.820
\":-3.683
olo-3.527
Token 1890
Token ID31982
Feature activation-1.82e-01

Pos logit contributions
 Kap+5.294
nings+5.156
ings+5.018
idences+4.897
sung+4.896
Neg logit contributions
iculty-7.423
aline-7.216
LOAD-6.947
\":-6.680
olo-6.420
Token,
Token ID11
Feature activation-5.08e-01

Pos logit contributions
 Kap+1.560
nings+1.545
idences+1.500
ings+1.493
sung+1.482
Neg logit contributions
iculty-1.649
aline-1.601
LOAD-1.515
\":-1.488
olo-1.405
Token new
Token ID649
Feature activation+5.96e-02

Pos logit contributions
 Kap+4.358
nings+4.256
idences+4.154
ings+4.136
sung+4.101
Neg logit contributions
iculty-4.608
aline-4.448
LOAD-4.207
\":-4.111
olo-3.896
Token academic
Token ID8233
Feature activation-2.79e-01

Pos logit contributions
iculty+0.540
aline+0.525
LOAD+0.498
\":+0.492
olo+0.467
Neg logit contributions
 Kap-0.499
nings-0.499
idences-0.488
sung-0.480
rament-0.478
Token,
Token ID11
Feature activation-1.10e-01

Pos logit contributions
iculty+0.545
aline+0.538
LOAD+0.505
\":+0.499
olo+0.473
Neg logit contributions
 Kap-0.589
nings-0.584
idences-0.571
sung-0.562
ings-0.561
Token and
Token ID290
Feature activation-2.45e-01

Pos logit contributions
 Kap+0.975
nings+0.973
idences+0.949
ings+0.934
sung+0.932
Neg logit contributions
iculty-0.956
aline-0.935
LOAD-0.876
\":-0.867
olo-0.820
Token even
Token ID772
Feature activation-4.50e-01

Pos logit contributions
 Kap+2.100
nings+2.089
idences+2.033
ings+2.010
sung+2.005
Neg logit contributions
iculty-2.204
aline-2.151
LOAD-2.025
\":-1.997
olo-1.891
Token discussed
Token ID6693
Feature activation-3.60e-01

Pos logit contributions
 Kap+3.906
nings+3.849
ings+3.765
idences+3.731
sung+3.722
Neg logit contributions
iculty-4.011
aline-3.883
LOAD-3.677
\":-3.601
olo-3.394
Token by
Token ID416
Feature activation-4.38e-01

Pos logit contributions
 Kap+3.213
nings+3.171
ings+3.086
idences+3.084
sung+3.059
Neg logit contributions
iculty-3.119
aline-3.029
LOAD-2.851
\":-2.800
olo-2.648
Token director
Token ID3437
Feature activation-1.13e-01

Pos logit contributions
 Kap+3.877
nings+3.794
ings+3.701
idences+3.698
sung+3.657
Neg logit contributions
iculty-3.838
aline-3.713
LOAD-3.492
\":-3.416
olo-3.240
Token Bryan
Token ID17857
Feature activation+3.65e-01

Pos logit contributions
nings+1.170
 Kap+1.166
idences+1.154
sung+1.130
ings+1.129
Neg logit contributions
iculty-0.816
aline-0.801
LOAD-0.732
\":-0.730
olo-0.683
Token Singer
Token ID28755
Feature activation-4.24e-02

Pos logit contributions
iculty+2.582
aline+2.579
\":+2.350
olo+2.280
LOAD+2.247
Neg logit contributions
nings-3.715
idences-3.626
rament-3.551
 dstg-3.550
pai-3.530
Token during
Token ID1141
Feature activation-7.55e-01

Pos logit contributions
 Kap+0.372
nings+0.369
idences+0.362
ings+0.356
sung+0.355
Neg logit contributions
iculty-0.371
aline-0.365
LOAD-0.343
\":-0.340
olo-0.321
Token the
Token ID262
Feature activation-3.65e-01

Pos logit contributions
 Kap+6.040
nings+5.941
ings+5.796
sung+5.665
idences+5.615
Neg logit contributions
iculty-7.080
aline-6.802
LOAD-6.557
\":-6.344
olo-6.057
Token press
Token ID1803
Feature activation-1.51e-01

Pos logit contributions
 Kap+3.108
nings+3.068
idences+2.975
ings+2.972
sung+2.944
Neg logit contributions
iculty-3.322
aline-3.227
LOAD-3.055
\":-2.994
olo-2.843
Token and
Token ID290
Feature activation-2.53e-01

Pos logit contributions
iculty+3.600
aline+3.559
LOAD+3.284
\":+3.244
olo+3.187
Neg logit contributions
 Kap-4.210
nings-4.057
idences-4.050
sung-4.014
rament-3.980
Token n
Token ID299
Feature activation+5.02e-01

Pos logit contributions
 Kap+2.235
nings+2.218
idences+2.161
ings+2.143
sung+2.135
Neg logit contributions
iculty-2.211
aline-2.153
LOAD-2.027
\":-1.993
olo-1.885
Tokenines
Token ID1127
Feature activation+8.30e-01

Pos logit contributions
iculty+3.178
aline+3.056
olo+2.846
\":+2.748
LOAD+2.679
Neg logit contributions
 Kap-5.407
nings-5.303
 dstg-5.281
rament-5.159
 Cth-5.150
Token.
Token ID13
Feature activation-7.75e-02

Pos logit contributions
aline+5.863
iculty+5.781
\":+5.312
LOAD+5.309
olo+5.240
Neg logit contributions
 Kap-7.698
idences-7.484
sung-7.384
rament-7.378
 dstg-7.229
Token
Token ID198
Feature activation-3.64e-02

Pos logit contributions
nings+0.637
 Kap+0.632
idences+0.623
sung+0.614
ings+0.608
Neg logit contributions
iculty-0.717
aline-0.701
LOAD-0.668
\":-0.655
olo-0.621
Token
Token ID198
Feature activation-1.97e-01

Pos logit contributions
 Kap+0.280
nings+0.279
idences+0.273
sung+0.267
ings+0.265
Neg logit contributions
iculty-0.357
aline-0.350
LOAD-0.332
\":-0.330
olo-0.312
TokenRF
Token ID32754
Feature activation-3.94e-01

Pos logit contributions
 Kap+1.379
nings+1.365
idences+1.321
ings+1.307
sung+1.299
Neg logit contributions
iculty-2.085
aline-2.040
LOAD-1.942
\":-1.915
olo-1.830
Token:
Token ID25
Feature activation+1.10e-03

Pos logit contributions
 Kap+3.702
nings+3.654
ings+3.592
idences+3.526
sung+3.510
Neg logit contributions
iculty-3.221
aline-3.135
LOAD-2.934
\":-2.805
olo-2.692
Token After
Token ID2293
Feature activation-3.29e-01

Pos logit contributions
iculty+0.010
aline+0.010
LOAD+0.010
\":+0.010
Published+0.009
Neg logit contributions
nings-0.009
idences-0.008
 Kap-0.008
sung-0.008
rament-0.008
Token working
Token ID1762
Feature activation+1.81e-02

Pos logit contributions
 Kap+2.920
nings+2.894
idences+2.814
ings+2.801
sung+2.781
Neg logit contributions
iculty-2.868
aline-2.790
LOAD-2.629
\":-2.585
olo-2.441
Token with
Token ID351
Feature activation-1.90e-02

Pos logit contributions
iculty+0.146
aline+0.143
LOAD+0.135
\":+0.131
olo+0.124
Neg logit contributions
nings-0.169
 Kap-0.167
idences-0.166
sung-0.163
ings-0.161
Token
Token ID198
Feature activation-5.07e-01

Pos logit contributions
 Kap+1.270
nings+1.264
idences+1.225
ings+1.213
sung+1.209
Neg logit contributions
iculty-1.670
aline-1.633
LOAD-1.552
\":-1.528
olo-1.455
Token
Token ID198
Feature activation-4.44e-01

Pos logit contributions
 Kap+3.309
nings+3.196
ings+3.153
idences+3.019
sung+3.017
Neg logit contributions
iculty-5.644
aline-5.508
LOAD-5.201
\":-5.027
olo-4.937
TokenAccording
Token ID4821
Feature activation-5.99e-01

Pos logit contributions
 Kap+3.609
nings+3.566
ings+3.471
idences+3.455
sung+3.438
Neg logit contributions
iculty-4.184
aline-4.032
\":-3.748
LOAD-3.734
olo-3.569
Token to
Token ID284
Feature activation-1.61e-01

Pos logit contributions
 Kap+4.450
nings+4.380
ings+4.249
idences+4.210
sung+4.113
Neg logit contributions
iculty-5.997
aline-5.838
LOAD-5.557
\":-5.407
olo-5.186
Token TRA
Token ID29125
Feature activation-2.80e-01

Pos logit contributions
 Kap+1.356
nings+1.346
idences+1.309
ings+1.298
sung+1.293
Neg logit contributions
iculty-1.471
aline-1.434
LOAD-1.354
\":-1.332
olo-1.263
Token's
Token ID338
Feature activation-3.63e-01

Pos logit contributions
 Kap+2.340
nings+2.303
idences+2.243
ings+2.232
sung+2.205
Neg logit contributions
iculty-2.592
aline-2.520
LOAD-2.382
\":-2.347
olo-2.222
Token '
Token ID705
Feature activation-7.08e-01

Pos logit contributions
 Kap+3.362
nings+3.315
idences+3.227
ings+3.217
sung+3.180
Neg logit contributions
iculty-3.046
aline-2.947
LOAD-2.766
\":-2.725
olo-2.558
TokenRegistration
Token ID47133
Feature activation-2.70e-01

Pos logit contributions
 Kap+5.968
nings+5.938
ings+5.801
idences+5.730
sung+5.720
Neg logit contributions
iculty-6.178
aline-5.978
\":-5.604
LOAD-5.551
olo-5.246
Token Requirements
Token ID24422
Feature activation-5.08e-01

Pos logit contributions
 Kap+2.391
nings+2.370
idences+2.309
ings+2.295
sung+2.279
Neg logit contributions
iculty-2.360
aline-2.297
LOAD-2.164
\":-2.125
olo-2.008
Token for
Token ID329
Feature activation-1.92e-01

Pos logit contributions
 Kap+4.300
nings+4.225
ings+4.121
idences+4.087
sung+4.020
Neg logit contributions
iculty-4.644
aline-4.515
LOAD-4.246
\":-4.110
olo-3.925
Token Mobile
Token ID12173
Feature activation-6.85e-01

Pos logit contributions
 Kap+1.368
nings+1.358
idences+1.314
ings+1.300
sung+1.295
Neg logit contributions
iculty-2.013
aline-1.970
LOAD-1.874
\":-1.848
olo-1.765
Token
Token ID447
Feature activation-1.52e-01

Pos logit contributions
iculty+1.760
aline+1.723
\":+1.641
LOAD+1.635
olo+1.560
Neg logit contributions
nings-1.044
 Kap-1.036
idences-1.023
sung-1.008
ings-0.988
Token
Token ID247
Feature activation-9.34e-02

Pos logit contributions
 Kap+1.206
nings+1.199
idences+1.161
ings+1.148
sung+1.144
Neg logit contributions
iculty-1.467
aline-1.433
LOAD-1.356
\":-1.339
olo-1.273
Tokens
Token ID82
Feature activation+1.87e-01

Pos logit contributions
 Kap+0.619
nings+0.609
idences+0.594
ings+0.580
sung+0.579
Neg logit contributions
iculty-1.026
aline-1.003
LOAD-0.952
\":-0.949
olo-0.905
Token approach
Token ID3164
Feature activation-1.81e-01

Pos logit contributions
iculty+1.679
aline+1.638
\":+1.524
LOAD+1.521
olo+1.445
Neg logit contributions
 Kap-1.589
nings-1.584
idences-1.576
sung-1.531
ings-1.525
Token to
Token ID284
Feature activation+5.16e-01

Pos logit contributions
 Kap+1.483
nings+1.475
idences+1.427
ings+1.425
sung+1.409
Neg logit contributions
iculty-1.705
aline-1.664
LOAD-1.577
\":-1.547
olo-1.470
Token storytelling
Token ID23689
Feature activation-1.49e-02

Pos logit contributions
iculty+4.427
aline+4.284
\":+3.982
LOAD+3.928
olo+3.821
Neg logit contributions
nings-4.369
idences-4.353
 Kap-4.274
 dstg-4.254
rament-4.238
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 Kap+0.136
nings+0.135
idences+0.132
ings+0.130
sung+0.130
Neg logit contributions
iculty-0.127
aline-0.123
LOAD-0.116
\":-0.114
olo-0.108
Token Bio
Token ID16024
Feature activation-3.88e-01

Pos logit contributions
iculty+1.425
aline+1.386
LOAD+1.311
\":+1.309
Published+1.235
Neg logit contributions
nings-1.265
idences-1.237
 Kap-1.226
sung-1.213
ings-1.206
TokenShock
Token ID31646
Feature activation-2.23e-01

Pos logit contributions
 Kap+2.213
nings+2.159
idences+2.104
ings+2.083
sung+2.064
Neg logit contributions
iculty-4.545
aline-4.471
LOAD-4.279
\":-4.211
olo-4.032
Token is
Token ID318
Feature activation-1.59e-02

Pos logit contributions
 Kap+1.841
nings+1.825
idences+1.775
ings+1.759
sung+1.752
Neg logit contributions
iculty-2.087
aline-2.035
LOAD-1.922
\":-1.895
olo-1.798
Token famous
Token ID5863
Feature activation+2.50e-01

Pos logit contributions
 Kap+0.142
nings+0.141
idences+0.139
ings+0.137
sung+0.136
Neg logit contributions
iculty-0.138
aline-0.135
LOAD-0.126
\":-0.125
olo-0.118
Tokent
Token ID83
Feature activation-2.44e-01

Pos logit contributions
 Kap+1.117
nings+1.104
idences+1.077
ings+1.063
sung+1.059
Neg logit contributions
iculty-1.153
aline-1.121
LOAD-1.056
\":-1.046
olo-0.988
Token understand
Token ID1833
Feature activation-3.63e-01

Pos logit contributions
 Kap+1.880
nings+1.861
idences+1.810
ings+1.788
sung+1.785
Neg logit contributions
iculty-2.425
aline-2.368
LOAD-2.243
\":-2.219
olo-2.111
Token this
Token ID428
Feature activation+3.52e-01

Pos logit contributions
 Kap+3.086
nings+3.049
idences+2.954
ings+2.951
sung+2.918
Neg logit contributions
iculty-3.310
aline-3.220
LOAD-3.038
\":-2.980
olo-2.834
Token stuff
Token ID3404
Feature activation-8.18e-02

Pos logit contributions
iculty+2.521
aline+2.469
LOAD+2.312
\":+2.275
olo+2.074
Neg logit contributions
 Kap-3.522
idences-3.510
nings-3.506
sung-3.485
ings-3.398
Token.
Token ID13
Feature activation+1.48e-01

Pos logit contributions
 Kap+0.710
nings+0.705
idences+0.684
ings+0.682
sung+0.676
Neg logit contributions
iculty-0.732
aline-0.712
LOAD-0.671
\":-0.660
olo-0.625
Token If
Token ID1002
Feature activation-1.03e+00

Pos logit contributions
iculty+1.260
aline+1.231
LOAD+1.153
\":+1.148
olo+1.077
Neg logit contributions
nings-1.326
 Kap-1.311
idences-1.300
sung-1.282
ings-1.276
Token the
Token ID262
Feature activation-8.69e-01

Pos logit contributions
 Kap+7.132
nings+6.897
ings+6.689
 Cth+6.484
idences+6.360
Neg logit contributions
iculty-10.539
aline-10.089
LOAD-9.598
\":-9.130
olo-8.957
Token universe
Token ID6881
Feature activation-2.67e-01

Pos logit contributions
nings+5.231
 Kap+5.189
ings+4.848
idences+4.799
 Cth+4.721
Neg logit contributions
iculty-9.778
aline-9.466
LOAD-9.077
\":-8.742
olo-8.443
Token is
Token ID318
Feature activation+8.72e-02

Pos logit contributions
nings+2.256
 Kap+2.253
ings+2.181
idences+2.178
sung+2.146
Neg logit contributions
iculty-2.443
aline-2.358
LOAD-2.249
\":-2.185
olo-2.082
Token everything
Token ID2279
Feature activation-1.94e-01

Pos logit contributions
iculty+0.430
aline+0.423
\":+0.380
LOAD+0.377
olo+0.325
Neg logit contributions
 Kap-1.085
nings-1.068
idences-1.066
sung-1.064
ings-1.052
Token,
Token ID11
Feature activation-5.86e-01

Pos logit contributions
 Kap+1.655
nings+1.641
idences+1.592
ings+1.586
sung+1.571
Neg logit contributions
iculty-1.758
aline-1.709
LOAD-1.616
\":-1.583
olo-1.503
Token least
Token ID1551
Feature activation-1.20e+00

Pos logit contributions
 Kap+3.291
nings+3.210
idences+3.123
ings+3.121
sung+3.082
Neg logit contributions
iculty-4.302
aline-4.207
LOAD-4.032
\":-3.923
olo-3.706
Token from
Token ID422
Feature activation-5.27e-01

Pos logit contributions
 Kap+9.003
 unwittingly+8.750
nings+8.561
ings+8.510
pps+8.262
Neg logit contributions
iculty-10.989
aline-10.653
LOAD-10.416
\":-10.172
Published-9.497
Token a
Token ID257
Feature activation-2.05e-01

Pos logit contributions
 Kap+4.664
nings+4.513
ings+4.370
idences+4.345
sung+4.322
Neg logit contributions
iculty-4.677
aline-4.543
LOAD-4.316
\":-4.158
olo-3.949
Token punk
Token ID22782
Feature activation-5.13e-01

Pos logit contributions
 Kap+1.657
nings+1.653
idences+1.605
ings+1.591
sung+1.584
Neg logit contributions
iculty-1.947
aline-1.893
LOAD-1.793
\":-1.768
olo-1.682
Token ideological
Token ID15735
Feature activation-2.37e-01

Pos logit contributions
 Kap+4.419
nings+4.349
ings+4.275
idences+4.235
sung+4.181
Neg logit contributions
iculty-4.542
aline-4.406
LOAD-4.185
\":-4.123
olo-3.813
Token standpoint
Token ID22116
Feature activation-1.00e+00

Pos logit contributions
 Kap+2.100
nings+2.081
idences+2.027
ings+2.016
sung+2.002
Neg logit contributions
iculty-2.066
aline-2.011
LOAD-1.894
\":-1.862
olo-1.758
Token)
Token ID8
Feature activation-1.70e-01

Pos logit contributions
ings+4.171
nings+4.165
 Kap+4.031
 unwittingly+3.745
idences+3.652
Neg logit contributions
iculty-12.403
aline-12.135
LOAD-12.102
\":-11.940
Published-11.267
Token one
Token ID530
Feature activation-1.53e-01

Pos logit contributions
 Kap+1.576
nings+1.572
idences+1.536
sung+1.515
ings+1.514
Neg logit contributions
iculty-1.408
aline-1.367
LOAD-1.282
\":-1.262
olo-1.191
Token of
Token ID286
Feature activation-3.98e-01

Pos logit contributions
 Kap+1.284
nings+1.278
idences+1.250
sung+1.231
ings+1.229
Neg logit contributions
iculty-1.412
aline-1.375
LOAD-1.298
\":-1.285
olo-1.218
Token the
Token ID262
Feature activation-5.93e-02

Pos logit contributions
 Kap+3.348
nings+3.305
ings+3.198
idences+3.184
sung+3.155
Neg logit contributions
iculty-3.665
aline-3.574
LOAD-3.367
\":-3.310
olo-3.124
Token most
Token ID749
Feature activation-4.04e-02

Pos logit contributions
nings+0.561
 Kap+0.561
idences+0.550
sung+0.543
ings+0.541
Neg logit contributions
iculty-0.475
aline-0.458
LOAD-0.433
\":-0.425
olo-0.401
Tokenn
Token ID77
Feature activation-1.05e+00

Pos logit contributions
iculty+1.062
aline+1.041
\":+0.972
LOAD+0.958
olo+0.924
Neg logit contributions
 Kap-0.958
nings-0.913
idences-0.905
 Cth-0.890
ings-0.884
Token wary
Token ID23300
Feature activation-1.34e-01

Pos logit contributions
 Kap+8.527
ings+8.265
nings+8.233
 unwittingly+7.925
ğ+7.891
Neg logit contributions
aline-9.191
iculty-9.151
LOAD-8.605
\":-8.210
olo-7.820
Token,
Token ID11
Feature activation-2.99e-01

Pos logit contributions
 Kap+1.171
nings+1.161
idences+1.131
ings+1.122
sung+1.117
Neg logit contributions
iculty-1.193
aline-1.162
LOAD-1.095
\":-1.078
olo-1.019
Token to
Token ID284
Feature activation-3.50e-01

Pos logit contributions
 Kap+2.585
nings+2.552
idences+2.486
ings+2.472
sung+2.459
Neg logit contributions
iculty-2.690
aline-2.618
LOAD-2.473
\":-2.423
olo-2.304
Token say
Token ID910
Feature activation-1.03e+00

Pos logit contributions
 Kap+2.593
nings+2.544
idences+2.470
ings+2.453
sung+2.448
Neg logit contributions
iculty-3.581
aline-3.504
LOAD-3.315
\":-3.265
olo-3.112
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
 Kap+7.391
ings+7.166
nings+6.937
sung+6.821
idences+6.711
Neg logit contributions
iculty-10.465
aline-10.174
LOAD-9.677
\":-9.372
Published-8.897
Token least
Token ID1551
Feature activation-1.46e-01

Pos logit contributions
 Kap+6.239
nings+5.690
ings+5.641
idences+5.509
sung+5.451
Neg logit contributions
iculty-12.559
aline-12.278
LOAD-11.538
\":-11.402
olo-10.964
Token.
Token ID13
Feature activation-1.85e-01

Pos logit contributions
 Kap+1.247
nings+1.236
ings+1.200
idences+1.196
sung+1.186
Neg logit contributions
iculty-1.326
aline-1.295
LOAD-1.220
\":-1.200
olo-1.134
Token
Token ID564
Feature activation+3.14e-01

Pos logit contributions
 Kap+1.417
nings+1.409
idences+1.367
ings+1.353
sung+1.346
Neg logit contributions
iculty-1.840
aline-1.797
LOAD-1.707
\":-1.683
olo-1.600
Token
Token ID250
Feature activation-3.20e-02

Pos logit contributions
iculty+2.651
aline+2.565
LOAD+2.455
\":+2.436
Published+2.287
Neg logit contributions
nings-2.819
 Kap-2.783
idences-2.754
sung-2.683
rament-2.677
TokenSome
Token ID4366
Feature activation-8.49e-02

Pos logit contributions
 Kap+0.267
nings+0.267
idences+0.260
ings+0.255
rament+0.254
Neg logit contributions
iculty-0.295
aline-0.288
LOAD-0.276
\":-0.272
Published-0.259
Token health
Token ID1535
Feature activation+3.36e-01

Pos logit contributions
aline+3.791
iculty+3.790
\":+3.558
LOAD+3.541
olo+3.359
Neg logit contributions
nings-3.913
sung-3.897
idences-3.886
 Kap-3.814
rament-3.779
Token care
Token ID1337
Feature activation-9.45e-02

Pos logit contributions
iculty+2.927
aline+2.910
LOAD+2.712
\":+2.701
olo+2.592
Neg logit contributions
 Kap-2.810
nings-2.793
idences-2.746
sung-2.722
 Cth-2.683
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
 Kap+0.848
nings+0.842
idences+0.822
sung+0.813
ings+0.810
Neg logit contributions
iculty-0.815
aline-0.793
LOAD-0.746
\":-0.734
olo-0.694
Token In
Token ID554
Feature activation-4.13e-03

Pos logit contributions
iculty+1.357
aline+1.335
LOAD+1.280
\":+1.266
Published+1.198
Neg logit contributions
nings-1.219
idences-1.191
sung-1.175
 Kap-1.168
rament-1.163
Token the
Token ID262
Feature activation+1.86e-01

Pos logit contributions
nings+0.035
 Kap+0.035
idences+0.034
sung+0.034
ings+0.034
Neg logit contributions
iculty-0.037
aline-0.037
LOAD-0.034
\":-0.034
olo-0.032
Token U
Token ID471
Feature activation-1.00e+00

Pos logit contributions
iculty+1.627
aline+1.613
\":+1.533
LOAD+1.517
olo+1.428
Neg logit contributions
nings-1.570
idences-1.535
 Kap-1.520
sung-1.518
ings-1.501
Token.
Token ID13
Feature activation+9.90e-02

Pos logit contributions
ings+6.470
nings+6.467
 Kap+6.409
sung+6.347
pps+6.243
Neg logit contributions
iculty-10.358
aline-10.122
LOAD-9.394
\":-9.020
erate-8.834
TokenS
Token ID50
Feature activation+4.40e-02

Pos logit contributions
iculty+0.870
aline+0.850
LOAD+0.817
\":+0.806
Published+0.763
Neg logit contributions
nings-0.842
 Kap-0.839
idences-0.822
ings-0.808
rament-0.805
Token.,
Token ID1539
Feature activation-1.28e-01

Pos logit contributions
iculty+0.348
aline+0.338
LOAD+0.316
\":+0.310
olo+0.290
Neg logit contributions
 Kap-0.427
nings-0.423
idences-0.414
ings-0.411
sung-0.409
Token minority
Token ID9137
Feature activation+8.34e-02

Pos logit contributions
nings+1.124
 Kap+1.117
idences+1.088
sung+1.085
ings+1.079
Neg logit contributions
iculty-1.123
aline-1.099
LOAD-1.035
\":-1.020
olo-0.965
Token African
Token ID5510
Feature activation-1.03e-02

Pos logit contributions
iculty+0.764
aline+0.738
LOAD+0.716
\":+0.697
olo+0.671
Neg logit contributions
nings-0.687
 Kap-0.673
idences-0.668
sung-0.663
rament-0.654
Token Comcast
Token ID21830
Feature activation-3.93e-01

Pos logit contributions
 Kap+2.066
nings+2.049
idences+1.992
ings+1.976
sung+1.966
Neg logit contributions
iculty-2.319
aline-2.261
LOAD-2.137
\":-2.104
olo-1.996
Token has
Token ID468
Feature activation+1.00e-01

Pos logit contributions
 Kap+3.416
nings+3.365
ings+3.269
idences+3.262
sung+3.214
Neg logit contributions
iculty-3.510
aline-3.401
LOAD-3.222
\":-3.160
olo-2.980
Token started
Token ID2067
Feature activation+2.96e-01

Pos logit contributions
iculty+0.818
aline+0.791
LOAD+0.739
\":+0.737
olo+0.698
Neg logit contributions
nings-0.919
 Kap-0.918
idences-0.913
sung-0.892
rament-0.881
Token "
Token ID366
Feature activation-9.91e-02

Pos logit contributions
iculty+2.537
aline+2.449
\":+2.304
LOAD+2.296
olo+2.180
Neg logit contributions
idences-2.541
 Kap-2.532
nings-2.496
sung-2.492
rament-2.442
Tokenim
Token ID320
Feature activation-3.90e-02

Pos logit contributions
 Kap+0.835
nings+0.822
idences+0.808
ings+0.793
sung+0.787
Neg logit contributions
iculty-0.913
aline-0.893
LOAD-0.845
\":-0.828
olo-0.786
Tokenposing
Token ID32927
Feature activation-1.21e+00

Pos logit contributions
 Kap+0.341
nings+0.338
idences+0.331
sung+0.326
ings+0.325
Neg logit contributions
iculty-0.347
aline-0.338
LOAD-0.316
\":-0.309
olo-0.296
Token extra
Token ID3131
Feature activation-6.88e-01

Pos logit contributions
 Kap+11.120
nings+10.908
ings+10.820
sung+10.262
 unwittingly+10.204
Neg logit contributions
iculty-8.933
aline-8.769
LOAD-8.037
\":-7.902
olo-7.397
Token fees
Token ID6642
Feature activation-7.23e-01

Pos logit contributions
 Kap+5.500
nings+5.389
ings+5.268
idences+5.151
sung+5.102
Neg logit contributions
iculty-6.509
aline-6.371
LOAD-6.016
\":-5.964
olo-5.653
Token on
Token ID319
Feature activation-7.07e-02

Pos logit contributions
 Kap+5.484
nings+5.404
ings+5.281
idences+5.105
sung+5.077
Neg logit contributions
iculty-7.102
aline-6.899
LOAD-6.518
\":-6.420
olo-6.039
Token Internet
Token ID4455
Feature activation-1.93e-01

Pos logit contributions
nings+0.596
 Kap+0.591
idences+0.580
sung+0.571
ings+0.569
Neg logit contributions
iculty-0.643
aline-0.628
LOAD-0.595
\":-0.589
olo-0.556
Token customers
Token ID4297
Feature activation-2.68e-01

Pos logit contributions
 Kap+1.717
nings+1.716
idences+1.670
sung+1.652
ings+1.649
Neg logit contributions
iculty-1.673
aline-1.628
LOAD-1.538
\":-1.513
olo-1.429