TOP ACTIVATIONS
MAX = 15.851

Economic Sciences in Memory of Alfred Nobel , founder of the
Economic Sciences in Memory of Alfred Nobel A ward
Access Point : Not - Associated T x - Power =
and music composed by Er na Tau ro . The production
S enth il Kumar / Press Trust of India via AP
. ( Wa ub ges hig Rice / CBC )
ender Singh Tom ar ( Press Trust of India photo )
astronauts Thomas Pes quet and Peggy Wh its on planned to
15 of 15 Photo : Erin Bre th auer , The
. I was saying to Brian , " We cannot hack
. ( Wa ub ges hig Rice / CBC ) Carter
At this pre - trial hearing on Thursday , Mel
paper . He complained to Alfred Kn op f , who
Turkey has a debt to GDP ratio of only 2 .
S enth il Kumar / Press Trust of India via AP
that origin that Batman and Dick have of having gone through
famous author and humanitarian . Anne Sullivan stayed at her side
t . A US 5 Robbie M ce wen ( Green
a result of that , Fred 's right , we look
1989 Fred Savage 1990 Ted D anson 1990

MIN ACTIVATIONS
MIN = -14.495

com ,[ 91 ] NBC Sports . com ,[ 92 ]
four unknown assailants . A lecturer at the School of Journal
Lear y and cornerback Marcus R ios were held out of
com ,[ 96 ] NBC Sports . com ,[ 92 ]
Today ,[ 95 ] CBS Sports . com ,[ 96 ]
's suspension of a male student who argues he was completely
Cre ighton University , the Rev . Timothy L annon ,
More : Judge sides with UC grad student accused of rape
Airport Network , appearing on telev isions in airports across the
3 . Daniels � � Prom inent Family Ran a Chain
Al Az m , a professor of Middle East history at
@ Ann C oul ter HAVE YOU SE EN
W The Nvidia GTX 1080 ships with 25 60
following his junior season .[ 90 ] College career
the box . Invalid email address . Please re - enter
6 and 7 GW of solar generating capacity . Off shore
young evangelicals are abandoning their faith , but that they are
as well as funding for remed ial courses used to prepare
: 47 ] < c erv ant as > lol m
million deficit , cut its athletics budget . It is a

INTERVAL 8.265 - 15.851
CONTAINS 0.177%

, the main point of Gene alogy � � s first
Shadow Demon instead ( where as originally it was going to
economy can present over - exp ansion of the money supply
White Rock by Conservative candidate Dianne Watts , is being called
with water , her skull — upside down , teeth remarkably

INTERVAL 0.678 - 8.265
CONTAINS 38.404%

Memory 4 66 284 4 64 406 Mem c py Reading
TV play ) that have been added to this title .
. But research proves that the foul cal um
to play physical and re - route Baldwin to throw the
a must - read for the basic intuition about how transaction

INTERVAL -6.908 - 0.678
CONTAINS 60.799%

parts of northern Syria in partnership with some Arab tribes there
its similarly configured rival Pro Quest . Since the release of
Q & A session ― refer to the Republican presidential nominee
� � here and now . � � Touch
read well , was extremely successful in similar situations at Ole

INTERVAL -14.495 - -6.908
CONTAINS 0.619%

hearts , and hands ."[ 59 ] The university 's vice
cartridge , O car ina of Time was the last Zelda
and access to information .[ 109 ] In some
all the difference . <|endoftext|> Speaking to reporters in Barcelona today
Hitchcock enlisted his g ifted title designer , Saul Bass ,
Token Economic
Token ID11279
Feature activation+5.33e+00

Pos logit contributions
orld+0.375
iggins+0.363
cessive+0.359
phal+0.347
angel+0.337
Neg logit contributions
 Ratio-0.265
 Spy-0.221
 Lerner-0.218
 UNCLASSIFIED-0.211
 DOI-0.208
Token Sciences
Token ID13473
Feature activation+4.28e-01

Pos logit contributions
 Ratio+3.456
 Spy+2.851
 Lerner+2.802
 Depression+2.716
 DOI+2.711
Neg logit contributions
orld-4.742
iggins-4.600
cessive-4.552
phal-4.389
angel-4.296
Token in
Token ID287
Feature activation-1.87e+00

Pos logit contributions
 Ratio+0.301
 Spy+0.255
 Lerner+0.251
 UNCLASSIFIED+0.244
 DOI+0.243
Neg logit contributions
orld-0.359
iggins-0.347
cessive-0.343
phal-0.331
angel-0.321
Token Memory
Token ID14059
Feature activation+2.03e+00

Pos logit contributions
orld+1.884
cessive+1.828
iggins+1.822
phal+1.770
idi+1.707
Neg logit contributions
 Ratio-0.951
 Spy-0.758
 Lerner-0.741
 UNCLASSIFIED-0.720
 DOI-0.699
Token of
Token ID286
Feature activation+2.64e+00

Pos logit contributions
 Ratio+1.496
 Spy+1.275
 Lerner+1.261
 UNCLASSIFIED+1.228
 DOI+1.216
Neg logit contributions
orld-1.639
iggins-1.578
cessive-1.559
phal-1.507
angel-1.457
Token Alfred
Token ID22044
Feature activation+1.59e+01

Pos logit contributions
 Ratio+1.335
 Spy+1.051
 Lerner+1.029
 UNCLASSIFIED+0.987
 DOI+0.978
Neg logit contributions
orld-2.733
iggins-2.659
cessive-2.631
phal-2.558
angel-2.497
Token Nobel
Token ID20715
Feature activation-2.13e+00

Pos logit contributions
 Ratio+2.546
 Nobel+2.062
 DOI+1.405
 Spy+1.261
 Lerner+1.135
Neg logit contributions
iggins-19.940
orld-19.713
cessive-19.222
phal-18.678
angel-18.397
Token,
Token ID11
Feature activation+3.30e+00

Pos logit contributions
orld+1.706
iggins+1.638
cessive+1.609
phal+1.565
angel+1.508
Neg logit contributions
 Ratio-1.562
 Spy-1.333
 Lerner-1.316
 UNCLASSIFIED-1.274
 DOI-1.263
Token founder
Token ID9119
Feature activation-1.10e+00

Pos logit contributions
 Ratio+2.167
 Spy+1.825
 Lerner+1.780
 DOI+1.733
 UNCLASSIFIED+1.730
Neg logit contributions
orld-2.917
iggins-2.830
cessive-2.796
phal-2.691
angel-2.619
Token of
Token ID286
Feature activation+3.93e+00

Pos logit contributions
orld+0.846
iggins+0.819
cessive+0.811
phal+0.779
angel+0.761
Neg logit contributions
 Ratio-0.835
 Spy-0.705
 Lerner-0.703
 UNCLASSIFIED-0.691
 DOI-0.683
Token the
Token ID262
Feature activation+4.50e+00

Pos logit contributions
 Ratio+2.533
 Spy+2.112
 Lerner+2.078
 Depression+2.003
 UNCLASSIFIED+1.998
Neg logit contributions
orld-3.531
iggins-3.434
cessive-3.378
phal-3.285
angel-3.188
Token Economic
Token ID11279
Feature activation+7.25e+00

Pos logit contributions
 Ratio+0.416
 Spy+0.348
 Lerner+0.342
 UNCLASSIFIED+0.332
 DOI+0.326
Neg logit contributions
orld-0.574
iggins-0.556
cessive-0.550
phal-0.532
angel-0.515
Token Sciences
Token ID13473
Feature activation+3.03e+00

Pos logit contributions
 Ratio+3.989
 Spy+3.083
 Depression+3.028
 Prevention+3.020
 Sachs+2.966
Neg logit contributions
orld-7.164
iggins-7.025
cessive-7.010
phal-6.689
idi-6.593
Token in
Token ID287
Feature activation-7.62e-01

Pos logit contributions
 Ratio+2.118
 Spy+1.788
 Lerner+1.766
 UNCLASSIFIED+1.717
 DOI+1.706
Neg logit contributions
orld-2.553
iggins-2.466
cessive-2.439
phal-2.352
angel-2.285
Token Memory
Token ID14059
Feature activation+3.91e+00

Pos logit contributions
orld+0.842
iggins+0.819
cessive+0.809
phal+0.787
angel+0.772
Neg logit contributions
 Ratio-0.340
 Spy-0.258
 Lerner-0.252
 UNCLASSIFIED-0.239
 DOI-0.236
Token of
Token ID286
Feature activation+4.78e+00

Pos logit contributions
 Ratio+2.346
 Spy+1.987
 Lerner+1.924
 UNCLASSIFIED+1.919
 DOI+1.893
Neg logit contributions
orld-3.667
iggins-3.561
cessive-3.528
phal-3.280
inent-3.252
Token Alfred
Token ID22044
Feature activation+1.54e+01

Pos logit contributions
 Ratio+2.219
 Spy+1.710
 Lerner+1.675
 UNCLASSIFIED+1.606
 DOI+1.591
Neg logit contributions
orld-5.143
iggins-5.018
cessive-4.937
phal-4.801
angel-4.693
Token Nobel
Token ID20715
Feature activation-2.96e+00

Pos logit contributions
 Ratio+1.683
 Nobel+1.369
 DOI+0.687
 Spy+0.443
 UNCLASSIFIED+0.315
Neg logit contributions
iggins-20.483
orld-20.188
cessive-19.773
phal-18.963
angel-18.894
Token
Token ID198
Feature activation-8.97e-01

Pos logit contributions
orld+2.219
iggins+2.114
cessive+2.085
phal+2.024
idi+1.937
Neg logit contributions
 Ratio-2.306
 Spy-1.982
 Lerner-1.976
 UNCLASSIFIED-1.902
 Scythe-1.897
Token
Token ID198
Feature activation-5.06e-02

Pos logit contributions
orld+0.825
iggins+0.805
cessive+0.790
phal+0.761
idi+0.737
Neg logit contributions
 Ratio-0.563
 Spy-0.472
 Lerner-0.463
 UNCLASSIFIED-0.457
 DOI-0.452
TokenA
Token ID32
Feature activation+8.14e-01

Pos logit contributions
orld+0.036
iggins+0.035
cessive+0.034
phal+0.033
angel+0.031
Neg logit contributions
 Ratio-0.042
 Spy-0.036
 Lerner-0.036
 UNCLASSIFIED-0.035
 DOI-0.035
Tokenward
Token ID904
Feature activation+1.69e+00

Pos logit contributions
 Ratio+0.582
 Spy+0.495
 Lerner+0.486
 UNCLASSIFIED+0.474
 DOI+0.472
Neg logit contributions
orld-0.673
iggins-0.651
cessive-0.642
phal-0.617
angel-0.605
Token Access
Token ID8798
Feature activation+5.65e+00

Pos logit contributions
 Ratio+2.654
 Spy+2.129
 UNCLASSIFIED+2.092
 Prevention+1.973
 DOI+1.968
Neg logit contributions
orld-4.956
iggins-4.849
cessive-4.704
phal-4.631
idi-4.438
Token Point
Token ID6252
Feature activation+7.86e+00

Pos logit contributions
 Ratio+3.328
 Spy+2.653
 UNCLASSIFIED+2.525
 Prevention+2.496
 Radius+2.477
Neg logit contributions
orld-5.532
iggins-5.411
cessive-5.342
idi-5.059
phal-5.037
Token:
Token ID25
Feature activation+3.02e+00

Pos logit contributions
 Ratio+4.137
 Spy+3.355
 UNCLASSIFIED+3.253
 Lerner+3.194
 DOI+3.155
Neg logit contributions
orld-7.761
iggins-7.690
cessive-7.522
phal-7.186
inent-6.974
Token Not
Token ID1892
Feature activation+4.51e+00

Pos logit contributions
 Ratio+2.093
 Spy+1.765
 Lerner+1.737
 UNCLASSIFIED+1.698
 DOI+1.673
Neg logit contributions
orld-2.570
iggins-2.478
cessive-2.447
phal-2.369
angel-2.298
Token-
Token ID12
Feature activation+1.01e+00

Pos logit contributions
 Ratio+2.723
 Spy+2.294
 UNCLASSIFIED+2.221
 Lerner+2.214
 Depression+2.120
Neg logit contributions
orld-4.277
iggins-4.203
cessive-4.026
phal-3.869
idi-3.762
TokenAssociated
Token ID29014
Feature activation+1.49e+01

Pos logit contributions
 Ratio+0.860
 Spy+0.754
 UNCLASSIFIED+0.751
 Lerner+0.750
 Prevention+0.726
Neg logit contributions
orld-0.705
iggins-0.685
cessive-0.642
phal-0.620
amph-0.584
Token T
Token ID309
Feature activation+4.54e+00

Pos logit contributions
 Ratio+6.980
 Spy+5.336
 UNCLASSIFIED+4.999
 DOI+4.790
 Depression+4.739
Neg logit contributions
orld-15.127
iggins-14.692
phal-14.394
cessive-14.272
angel-13.969
Tokenx
Token ID87
Feature activation+7.94e+00

Pos logit contributions
 Ratio+2.557
 UNCLASSIFIED+2.085
 Spy+2.077
alty+2.064
 DOI+2.023
Neg logit contributions
orld-4.246
iggins-4.102
cessive-4.089
phal-3.898
inent-3.722
Token-
Token ID12
Feature activation+3.22e+00

Pos logit contributions
 Ratio+4.143
 Spy+3.389
 UNCLASSIFIED+3.246
 Depression+3.142
 DOI+3.087
Neg logit contributions
orld-7.879
iggins-7.810
cessive-7.477
estial-7.153
phal-7.120
TokenPower
Token ID13434
Feature activation+5.77e+00

Pos logit contributions
 Ratio+2.204
 Spy+1.818
 UNCLASSIFIED+1.796
 Prevention+1.754
range+1.751
Neg logit contributions
orld-2.803
iggins-2.733
cessive-2.602
phal-2.530
idi-2.467
Token=
Token ID28
Feature activation-7.22e+00

Pos logit contributions
 Ratio+3.431
 UNCLASSIFIED+2.675
 Spy+2.674
 PID+2.602
 Depression+2.600
Neg logit contributions
orld-5.529
iggins-5.377
cessive-5.203
phal-5.095
idi-4.975
Token and
Token ID290
Feature activation+2.80e+00

Pos logit contributions
 Ratio+0.740
 Spy+0.627
 Lerner+0.623
 UNCLASSIFIED+0.601
 Depression+0.593
Neg logit contributions
orld-0.905
iggins-0.874
cessive-0.864
phal-0.832
angel-0.806
Token music
Token ID2647
Feature activation-4.99e+00

Pos logit contributions
 Ratio+1.847
 Spy+1.569
 Lerner+1.548
 UNCLASSIFIED+1.484
 Depression+1.478
Neg logit contributions
orld-2.493
iggins-2.396
cessive-2.367
phal-2.290
angel-2.204
Token composed
Token ID13160
Feature activation-2.24e+00

Pos logit contributions
orld+3.920
iggins+3.795
cessive+3.745
phal+3.646
angel+3.546
Neg logit contributions
 Ratio-3.639
 Spy-3.078
 Lerner-3.012
 UNCLASSIFIED-2.973
 DOI-2.943
Token by
Token ID416
Feature activation+9.17e-01

Pos logit contributions
orld+1.819
iggins+1.754
cessive+1.733
phal+1.680
angel+1.633
Neg logit contributions
 Ratio-1.620
 Spy-1.364
 Lerner-1.346
 UNCLASSIFIED-1.319
 DOI-1.311
Token Er
Token ID5256
Feature activation+5.41e+00

Pos logit contributions
 Ratio+0.572
 Spy+0.476
 Lerner+0.469
 UNCLASSIFIED+0.450
 DOI+0.446
Neg logit contributions
orld-0.846
iggins-0.818
cessive-0.809
phal-0.784
angel-0.759
Tokenna
Token ID2616
Feature activation+1.44e+01

Pos logit contributions
 Ratio+3.657
 Spy+3.135
 Lerner+3.064
 UNCLASSIFIED+2.978
 Depression+2.962
Neg logit contributions
orld-4.535
iggins-4.361
cessive-4.282
phal-4.103
angel-3.970
Token Tau
Token ID36849
Feature activation-8.59e-01

Pos logit contributions
 Ratio+7.047
 Spy+5.889
 Lerner+5.880
 UNCLASSIFIED+5.397
 Scythe+5.343
Neg logit contributions
orld-13.628
iggins-13.323
cessive-13.219
phal-12.600
inent-12.335
Tokenro
Token ID305
Feature activation-6.29e+00

Pos logit contributions
orld+0.633
iggins+0.618
cessive+0.609
phal+0.575
idi+0.560
Neg logit contributions
 Ratio-0.683
 Lerner-0.584
 Spy-0.584
 UNCLASSIFIED-0.575
 DOI-0.566
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
orld+4.681
iggins+4.471
cessive+4.403
phal+4.331
angel+4.198
Neg logit contributions
 Ratio-4.795
 Spy-4.064
 Lerner-3.987
 UNCLASSIFIED-3.959
 DOI-3.959
Token The
Token ID383
Feature activation-1.78e+00

Pos logit contributions
orld+1.091
iggins+1.053
cessive+1.043
phal+1.008
angel+0.981
Neg logit contributions
 Ratio-0.850
 Spy-0.711
 Lerner-0.703
 UNCLASSIFIED-0.683
 DOI-0.677
Token production
Token ID3227
Feature activation+1.06e-01

Pos logit contributions
orld+1.539
iggins+1.491
cessive+1.474
phal+1.427
angel+1.387
Neg logit contributions
 Ratio-1.198
 Spy-1.000
 Lerner-0.987
 UNCLASSIFIED-0.961
 DOI-0.953
Token S
Token ID311
Feature activation-5.18e+00

Pos logit contributions
orld+0.580
iggins+0.561
cessive+0.555
phal+0.538
angel+0.523
Neg logit contributions
 Ratio-0.404
 Spy-0.335
 Lerner-0.330
 UNCLASSIFIED-0.320
 DOI-0.316
Tokenenth
Token ID7944
Feature activation+7.94e-01

Pos logit contributions
orld+2.761
iggins+2.612
cessive+2.565
phal+2.428
angel+2.304
Neg logit contributions
 Ratio-5.206
 Spy-4.650
 Lerner-4.608
 UNCLASSIFIED-4.515
 DOI-4.494
Tokenil
Token ID346
Feature activation+7.60e-01

Pos logit contributions
 Ratio+0.561
 Spy+0.475
 UNCLASSIFIED+0.470
 Lerner+0.470
 DOI+0.458
Neg logit contributions
orld-0.649
iggins-0.623
cessive-0.622
phal-0.592
inent-0.571
Token Kumar
Token ID26105
Feature activation+9.16e-01

Pos logit contributions
 Ratio+0.395
 Spy+0.319
 Lerner+0.317
 UNCLASSIFIED+0.302
 DOI+0.297
Neg logit contributions
orld-0.786
iggins-0.762
cessive-0.755
phal-0.721
angel-0.712
Token/
Token ID14
Feature activation+4.92e+00

Pos logit contributions
 Ratio+0.668
 Spy+0.567
 Lerner+0.560
 UNCLASSIFIED+0.543
 DOI+0.538
Neg logit contributions
orld-0.744
iggins-0.720
cessive-0.707
phal-0.688
angel-0.664
Token Press
Token ID4332
Feature activation+1.42e+01

Pos logit contributions
 Ratio+3.577
 Spy+3.047
 Lerner+2.995
 UNCLASSIFIED+2.967
 DOI+2.927
Neg logit contributions
orld-3.968
iggins-3.793
cessive-3.761
phal-3.566
angel-3.486
Token Trust
Token ID9870
Feature activation+4.96e+00

Pos logit contributions
 Ratio+5.145
 Spy+4.169
 UNCLASSIFIED+4.036
 Depression+3.235
 DOI+3.206
Neg logit contributions
orld-15.266
iggins-15.031
cessive-15.017
phal-14.502
angel-14.383
Token of
Token ID286
Feature activation+2.98e+00

Pos logit contributions
 Ratio+2.997
 Spy+2.491
 UNCLASSIFIED+2.405
 DOI+2.358
 Lerner+2.341
Neg logit contributions
orld-4.669
iggins-4.592
cessive-4.560
phal-4.247
inent-4.207
Token India
Token ID3794
Feature activation+1.96e+00

Pos logit contributions
 Ratio+1.689
 Spy+1.322
 UNCLASSIFIED+1.291
 DOI+1.290
 Lerner+1.286
Neg logit contributions
orld-2.937
iggins-2.890
cessive-2.882
phal-2.756
amph-2.720
Token via
Token ID2884
Feature activation+4.10e+00

Pos logit contributions
 Ratio+1.476
 Spy+1.264
 Lerner+1.251
 UNCLASSIFIED+1.229
 DOI+1.215
Neg logit contributions
orld-1.553
iggins-1.496
cessive-1.475
phal-1.415
angel-1.382
Token AP
Token ID3486
Feature activation+4.79e+00

Pos logit contributions
 Ratio+2.993
 Spy+2.550
 Lerner+2.510
 UNCLASSIFIED+2.461
 DOI+2.444
Neg logit contributions
orld-3.319
iggins-3.196
cessive-3.166
phal-3.048
angel-2.954
Token.
Token ID13
Feature activation+5.41e-01

Pos logit contributions
orld+0.499
iggins+0.481
cessive+0.475
phal+0.460
angel+0.447
Neg logit contributions
 Ratio-0.406
 Spy-0.341
 Lerner-0.337
 UNCLASSIFIED-0.327
 DOI-0.324
Token (
Token ID357
Feature activation+5.56e+00

Pos logit contributions
 Ratio+0.373
 Spy+0.316
 Lerner+0.312
 UNCLASSIFIED+0.303
 DOI+0.300
Neg logit contributions
orld-0.464
iggins-0.447
cessive-0.442
phal-0.427
angel-0.414
TokenWa
Token ID33484
Feature activation+1.85e+00

Pos logit contributions
 Ratio+4.460
 Spy+3.904
 Lerner+3.901
 UNCLASSIFIED+3.833
 DOI+3.784
Neg logit contributions
orld-3.904
iggins-3.692
cessive-3.598
phal-3.459
inent-3.350
Tokenub
Token ID549
Feature activation-4.41e-01

Pos logit contributions
 Ratio+1.365
 UNCLASSIFIED+1.170
 Spy+1.163
 Lerner+1.150
alty+1.131
Neg logit contributions
orld-1.415
iggins-1.383
cessive-1.365
phal-1.316
idi-1.259
Tokenges
Token ID3212
Feature activation+3.62e+00

Pos logit contributions
orld+0.367
iggins+0.349
cessive+0.348
phal+0.333
idi+0.328
Neg logit contributions
 Ratio-0.308
 Spy-0.261
 UNCLASSIFIED-0.258
 Lerner-0.257
 DOI-0.253
Tokenhig
Token ID25196
Feature activation+1.40e+01

Pos logit contributions
 Ratio+3.277
 UNCLASSIFIED+2.967
 Spy+2.917
 Lerner+2.916
alty+2.909
Neg logit contributions
orld-2.063
iggins-1.937
cessive-1.913
phal-1.766
idi-1.729
Token Rice
Token ID13823
Feature activation-2.18e+00

Pos logit contributions
 Ratio+2.763
 UNCLASSIFIED+1.301
 Spy+1.259
 DOI+1.167
 Lerner+1.159
Neg logit contributions
orld-17.345
cessive-17.263
iggins-16.749
inent-16.728
phal-16.684
Token/
Token ID14
Feature activation+3.08e+00

Pos logit contributions
orld+1.879
iggins+1.814
cessive+1.795
phal+1.726
angel+1.677
Neg logit contributions
 Ratio-1.499
 Spy-1.265
 Lerner-1.246
 UNCLASSIFIED-1.217
 DOI-1.210
TokenCBC
Token ID29208
Feature activation+2.68e+00

Pos logit contributions
 Ratio+2.150
 UNCLASSIFIED+1.872
 Spy+1.838
 DOI+1.822
 Lerner+1.818
Neg logit contributions
orld-2.564
iggins-2.464
cessive-2.424
phal-2.261
inent-2.234
Token)
Token ID8
Feature activation-4.09e+00

Pos logit contributions
 Ratio+1.964
 Spy+1.674
 Lerner+1.658
 UNCLASSIFIED+1.609
 DOI+1.594
Neg logit contributions
orld-2.172
iggins-2.093
cessive-2.068
phal-1.994
angel-1.931
Token
Token ID198
Feature activation-5.60e-01

Pos logit contributions
orld+3.369
iggins+3.268
cessive+3.259
phal+3.150
idi+3.041
Neg logit contributions
 Ratio-2.795
 Spy-2.342
 Lerner-2.295
geries-2.238
 DOI-2.223
Tokenender
Token ID2194
Feature activation+3.89e+00

Pos logit contributions
orld+0.048
iggins+0.047
cessive+0.046
phal+0.045
angel+0.044
Neg logit contributions
 Ratio-0.026
 Spy-0.021
 Lerner-0.020
 UNCLASSIFIED-0.019
 DOI-0.019
Token Singh
Token ID14403
Feature activation+8.18e-01

Pos logit contributions
 Ratio+2.319
 Spy+1.954
 Lerner+1.914
 UNCLASSIFIED+1.807
 Depression+1.798
Neg logit contributions
orld-3.661
iggins-3.572
cessive-3.523
phal-3.331
angel-3.303
Token Tom
Token ID4186
Feature activation+2.38e+00

Pos logit contributions
 Ratio+0.581
 Spy+0.495
 Lerner+0.491
 UNCLASSIFIED+0.473
 DOI+0.470
Neg logit contributions
orld-0.681
iggins-0.660
cessive-0.652
phal-0.622
angel-0.608
Tokenar
Token ID283
Feature activation-3.14e+00

Pos logit contributions
 Ratio+1.831
 Spy+1.590
 Lerner+1.573
 UNCLASSIFIED+1.545
 DOI+1.523
Neg logit contributions
orld-1.826
iggins-1.745
cessive-1.731
phal-1.631
angel-1.575
Token (
Token ID357
Feature activation+6.81e+00

Pos logit contributions
orld+2.479
iggins+2.381
cessive+2.350
phal+2.285
idi+2.213
Neg logit contributions
 Ratio-2.348
 Spy-1.998
 Lerner-1.969
 UNCLASSIFIED-1.921
 DOI-1.906
TokenPress
Token ID13800
Feature activation+1.36e+01

Pos logit contributions
 Ratio+4.729
 Spy+4.030
 Lerner+3.946
 DOI+3.897
 UNCLASSIFIED+3.897
Neg logit contributions
orld-5.497
iggins-5.292
cessive-5.273
phal-4.829
angel-4.749
Token Trust
Token ID9870
Feature activation+2.59e+00

Pos logit contributions
 Ratio+6.855
 Spy+5.505
 DOI+5.129
 UNCLASSIFIED+5.126
 Doors+5.027
Neg logit contributions
orld-13.304
iggins-12.823
cessive-12.761
angel-12.240
inent-12.181
Token of
Token ID286
Feature activation+2.74e+00

Pos logit contributions
 Ratio+1.769
 Spy+1.471
 UNCLASSIFIED+1.425
 Lerner+1.404
 Prevention+1.380
Neg logit contributions
orld-2.256
iggins-2.210
cessive-2.188
phal-2.078
angel-2.057
Token India
Token ID3794
Feature activation+2.07e+00

Pos logit contributions
 Ratio+1.723
 Spy+1.428
 Lerner+1.398
 UNCLASSIFIED+1.377
 DOI+1.341
Neg logit contributions
orld-2.505
iggins-2.433
cessive-2.410
phal-2.309
angel-2.270
Token photo
Token ID4590
Feature activation+2.45e+00

Pos logit contributions
 Ratio+1.457
 Spy+1.232
 Lerner+1.212
 UNCLASSIFIED+1.183
 DOI+1.172
Neg logit contributions
orld-1.735
iggins-1.674
cessive-1.657
phal-1.591
angel-1.550
Token)
Token ID8
Feature activation-5.81e-01

Pos logit contributions
 Ratio+1.715
 Spy+1.445
 Lerner+1.419
 DOI+1.389
 UNCLASSIFIED+1.386
Neg logit contributions
orld-2.070
iggins-1.994
cessive-1.969
phal-1.893
angel-1.846
Token astronauts
Token ID26835
Feature activation+2.91e+00

Pos logit contributions
orld+1.171
iggins+1.130
cessive+1.117
phal+1.078
angel+1.045
Neg logit contributions
 Ratio-1.018
 Spy-0.863
 Lerner-0.853
 UNCLASSIFIED-0.829
 DOI-0.823
Token Thomas
Token ID5658
Feature activation+6.54e+00

Pos logit contributions
 Ratio+1.788
 Spy+1.517
 Lerner+1.474
 UNCLASSIFIED+1.452
 Depression+1.410
Neg logit contributions
orld-2.668
iggins-2.579
cessive-2.554
phal-2.482
inent-2.395
Token Pes
Token ID38434
Feature activation+6.81e-01

Pos logit contributions
 Ratio+3.687
 Lerner+3.082
 Spy+3.037
 UNCLASSIFIED+2.873
 Ples+2.806
Neg logit contributions
orld-6.220
cessive-6.085
iggins-5.997
phal-5.870
amph-5.684
Tokenquet
Token ID21108
Feature activation-1.81e+00

Pos logit contributions
 Ratio+0.529
 Spy+0.454
 Lerner+0.450
 UNCLASSIFIED+0.439
 DOI+0.435
Neg logit contributions
orld-0.519
iggins-0.499
cessive-0.493
phal-0.475
angel-0.456
Token and
Token ID290
Feature activation+3.34e+00

Pos logit contributions
orld+1.548
iggins+1.497
cessive+1.472
phal+1.435
angel+1.397
Neg logit contributions
 Ratio-1.245
 Spy-1.038
 Lerner-1.029
 UNCLASSIFIED-0.996
 DOI-0.993
Token Peggy
Token ID40910
Feature activation+1.34e+01

Pos logit contributions
 Ratio+2.144
 Spy+1.809
 Lerner+1.775
 UNCLASSIFIED+1.704
 Depression+1.687
Neg logit contributions
orld-3.020
iggins-2.907
cessive-2.876
phal-2.794
idi-2.691
Token Wh
Token ID854
Feature activation-5.31e+00

Pos logit contributions
 Ratio+4.699
 Lerner+4.054
 Gou+3.729
 Spy+3.626
 Depression+3.510
Neg logit contributions
orld-14.881
cessive-14.427
iggins-14.282
idi-14.071
phal-13.964
Tokenits
Token ID896
Feature activation-2.37e+00

Pos logit contributions
orld+3.764
iggins+3.660
cessive+3.576
phal+3.398
angel+3.333
Neg logit contributions
 Ratio-4.404
 Spy-3.846
 Lerner-3.811
 UNCLASSIFIED-3.707
 Depression-3.697
Tokenon
Token ID261
Feature activation+1.04e+00

Pos logit contributions
orld+1.866
iggins+1.800
cessive+1.782
phal+1.714
angel+1.666
Neg logit contributions
 Ratio-1.784
 Spy-1.520
 Lerner-1.502
 UNCLASSIFIED-1.466
 DOI-1.456
Token planned
Token ID6027
Feature activation+1.79e+00

Pos logit contributions
 Ratio+0.683
 Spy+0.577
 Lerner+0.565
 UNCLASSIFIED+0.553
 DOI+0.544
Neg logit contributions
orld-0.908
iggins-0.878
cessive-0.870
phal-0.840
angel-0.814
Token to
Token ID284
Feature activation+7.68e-01

Pos logit contributions
 Ratio+1.254
 Spy+1.062
 Lerner+1.047
 UNCLASSIFIED+1.020
 DOI+1.009
Neg logit contributions
orld-1.508
iggins-1.454
cessive-1.438
phal-1.389
angel-1.347
Token 15
Token ID1315
Feature activation-8.42e-01

Pos logit contributions
orld+3.180
iggins+3.062
cessive+3.038
phal+2.937
angel+2.853
Neg logit contributions
 Ratio-2.542
 Spy-2.136
 Lerner-2.125
 UNCLASSIFIED-2.050
 DOI-2.029
Token of
Token ID286
Feature activation+1.93e-02

Pos logit contributions
orld+0.740
iggins+0.725
cessive+0.713
phal+0.677
inent+0.662
Neg logit contributions
 Ratio-0.558
 Spy-0.478
 Lerner-0.467
 UNCLASSIFIED-0.461
 DOI-0.455
Token 15
Token ID1315
Feature activation+1.74e+00

Pos logit contributions
 Ratio+0.012
 Spy+0.010
 UNCLASSIFIED+0.010
 Lerner+0.010
 DOI+0.010
Neg logit contributions
orld-0.018
iggins-0.017
cessive-0.017
phal-0.016
idi-0.016
Token Photo
Token ID5555
Feature activation+3.26e+00

Pos logit contributions
 Ratio+1.087
 Spy+0.900
 Lerner+0.894
 UNCLASSIFIED+0.855
 DOI+0.847
Neg logit contributions
orld-1.585
iggins-1.530
cessive-1.520
phal-1.473
angel-1.436
Token:
Token ID25
Feature activation-6.87e-01

Pos logit contributions
 Ratio+1.887
 Spy+1.587
 UNCLASSIFIED+1.550
 Lerner+1.533
 DOI+1.509
Neg logit contributions
orld-3.117
iggins-3.080
cessive-3.012
phal-2.871
inent-2.821
Token Erin
Token ID28894
Feature activation+1.30e+01

Pos logit contributions
orld+0.612
iggins+0.595
cessive+0.588
phal+0.572
angel+0.554
Neg logit contributions
 Ratio-0.439
 Spy-0.361
 Lerner-0.356
 UNCLASSIFIED-0.347
 DOI-0.346
Token Bre
Token ID3719
Feature activation-3.31e+00

Pos logit contributions
 Ratio+6.881
 Spy+6.051
 UNCLASSIFIED+5.670
 Lerner+5.664
 Gou+5.381
Neg logit contributions
iggins-12.445
orld-12.433
cessive-12.078
inent-11.640
phal-11.285
Tokenth
Token ID400
Feature activation+2.50e+00

Pos logit contributions
orld+2.359
iggins+2.292
cessive+2.242
phal+2.117
angel+2.053
Neg logit contributions
 Ratio-2.728
 Spy-2.370
 Lerner-2.335
 UNCLASSIFIED-2.314
 DOI-2.283
Tokenauer
Token ID16261
Feature activation+2.26e+00

Pos logit contributions
 Ratio+1.014
 Spy+0.754
 UNCLASSIFIED+0.730
 Lerner+0.728
 DOI+0.709
Neg logit contributions
orld-2.750
cessive-2.660
iggins-2.647
phal-2.571
idi-2.512
Token,
Token ID11
Feature activation+3.53e+00

Pos logit contributions
 Ratio+1.314
 Spy+1.100
 Lerner+1.052
 UNCLASSIFIED+1.052
 DOI+1.047
Neg logit contributions
orld-2.198
iggins-2.160
cessive-2.128
phal-2.015
inent-2.001
Token The
Token ID383
Feature activation+5.08e+00

Pos logit contributions
 Ratio+2.160
 Spy+1.814
 UNCLASSIFIED+1.696
 Lerner+1.690
 Prevention+1.674
Neg logit contributions
orld-3.428
iggins-3.384
cessive-3.316
inent-3.118
phal-3.112
Token.
Token ID13
Feature activation+2.71e+00

Pos logit contributions
 Ratio+0.461
 Spy+0.392
 Lerner+0.386
 UNCLASSIFIED+0.373
 DOI+0.369
Neg logit contributions
orld-0.588
iggins-0.566
cessive-0.562
phal-0.542
angel-0.524
Token I
Token ID314
Feature activation+1.90e+00

Pos logit contributions
 Ratio+1.712
 Spy+1.442
 Lerner+1.414
 Depression+1.355
 UNCLASSIFIED+1.354
Neg logit contributions
orld-2.516
iggins-2.441
cessive-2.419
phal-2.329
inent-2.271
Token was
Token ID373
Feature activation-2.30e-01

Pos logit contributions
 Ratio+1.228
 Spy+1.051
 Lerner+1.018
 UNCLASSIFIED+1.005
 Depression+0.977
Neg logit contributions
orld-1.695
iggins-1.638
cessive-1.625
phal-1.565
angel-1.519
Token saying
Token ID2282
Feature activation+2.06e+00

Pos logit contributions
orld+0.198
iggins+0.192
cessive+0.190
phal+0.183
angel+0.177
Neg logit contributions
 Ratio-0.157
 Spy-0.133
 Lerner-0.131
 UNCLASSIFIED-0.127
 DOI-0.125
Token to
Token ID284
Feature activation+4.02e-01

Pos logit contributions
 Ratio+1.367
 Spy+1.151
 Lerner+1.131
 UNCLASSIFIED+1.094
 DOI+1.085
Neg logit contributions
orld-1.813
iggins-1.752
cessive-1.732
phal-1.678
angel-1.628
Token Brian
Token ID8403
Feature activation+1.30e+01

Pos logit contributions
 Ratio+0.268
 Spy+0.225
 Lerner+0.222
 UNCLASSIFIED+0.214
 DOI+0.213
Neg logit contributions
orld-0.353
iggins-0.341
cessive-0.337
phal-0.326
angel-0.316
Token,
Token ID11
Feature activation+7.20e+00

Pos logit contributions
 Ratio+7.225
 Lerner+6.173
 Spy+6.046
 Depression+5.702
 UNCLASSIFIED+5.620
Neg logit contributions
orld-11.420
cessive-11.126
iggins-11.087
phal-10.696
inent-10.480
Token "
Token ID366
Feature activation-7.74e-01

Pos logit contributions
 Ratio+4.421
 Spy+3.735
 Lerner+3.639
 Depression+3.481
 UNCLASSIFIED+3.459
Neg logit contributions
orld-6.559
iggins-6.278
cessive-6.249
phal-6.107
inent-5.899
TokenWe
Token ID1135
Feature activation-1.77e-01

Pos logit contributions
orld+0.554
iggins+0.531
cessive+0.524
phal+0.502
angel+0.482
Neg logit contributions
 Ratio-0.641
 Spy-0.559
 Lerner-0.552
 UNCLASSIFIED-0.538
 DOI-0.534
Token cannot
Token ID2314
Feature activation-2.92e+00

Pos logit contributions
orld+0.164
iggins+0.158
cessive+0.157
phal+0.152
angel+0.147
Neg logit contributions
 Ratio-0.108
 Spy-0.090
 Lerner-0.088
 UNCLASSIFIED-0.086
 DOI-0.085
Token hack
Token ID8156
Feature activation+8.10e-01

Pos logit contributions
orld+2.527
iggins+2.445
cessive+2.423
phal+2.336
angel+2.271
Neg logit contributions
 Ratio-1.956
 Lerner-1.620
 Spy-1.617
 UNCLASSIFIED-1.556
 DOI-1.545
Token.
Token ID13
Feature activation+2.22e+00

Pos logit contributions
 Ratio+2.738
 Spy+2.311
 Lerner+2.288
 DOI+2.226
 UNCLASSIFIED+2.217
Neg logit contributions
orld-3.685
iggins-3.580
cessive-3.536
phal-3.405
angel-3.328
Token (
Token ID357
Feature activation+6.06e+00

Pos logit contributions
 Ratio+1.489
 Spy+1.254
 Lerner+1.244
 UNCLASSIFIED+1.199
 Depression+1.191
Neg logit contributions
orld-1.953
iggins-1.889
cessive-1.857
phal-1.794
angel-1.748
TokenWa
Token ID33484
Feature activation+1.87e+00

Pos logit contributions
 Ratio+4.836
 Lerner+4.228
 Spy+4.221
 UNCLASSIFIED+4.144
 DOI+4.109
Neg logit contributions
orld-4.230
iggins-4.008
cessive-3.898
phal-3.738
inent-3.613
Tokenub
Token ID549
Feature activation-1.18e-01

Pos logit contributions
 Ratio+1.352
 UNCLASSIFIED+1.167
 Spy+1.154
 Lerner+1.138
alty+1.119
Neg logit contributions
orld-1.456
iggins-1.423
cessive-1.399
phal-1.353
idi-1.294
Tokenges
Token ID3212
Feature activation+4.13e+00

Pos logit contributions
orld+0.099
iggins+0.094
cessive+0.094
phal+0.090
idi+0.089
Neg logit contributions
 Ratio-0.080
 Spy-0.068
 UNCLASSIFIED-0.067
 Lerner-0.066
geries-0.066
Tokenhig
Token ID25196
Feature activation+1.29e+01

Pos logit contributions
 Ratio+3.630
 UNCLASSIFIED+3.318
alty+3.266
 DOI+3.257
 Lerner+3.254
Neg logit contributions
orld-2.408
iggins-2.263
cessive-2.219
phal-2.068
izations-2.031
Token Rice
Token ID13823
Feature activation-8.74e-01

Pos logit contributions
 Ratio+2.908
 Spy+1.526
 DOI+1.438
 UNCLASSIFIED+1.418
 Lerner+1.382
Neg logit contributions
orld-15.950
cessive-15.805
iggins-15.466
phal-15.335
inent-15.285
Token/
Token ID14
Feature activation+3.38e+00

Pos logit contributions
orld+0.773
iggins+0.748
cessive+0.740
phal+0.710
angel+0.690
Neg logit contributions
 Ratio-0.584
 Spy-0.491
 Lerner-0.483
 UNCLASSIFIED-0.473
 DOI-0.472
TokenCBC
Token ID29208
Feature activation+2.37e+00

Pos logit contributions
 Ratio+2.318
 UNCLASSIFIED+2.019
 DOI+1.980
 Spy+1.978
 Lerner+1.969
Neg logit contributions
orld-2.852
iggins-2.744
cessive-2.707
phal-2.523
inent-2.494
Token)
Token ID8
Feature activation-3.61e+00

Pos logit contributions
 Ratio+1.545
 Spy+1.277
 UNCLASSIFIED+1.248
 Lerner+1.237
 DOI+1.236
Neg logit contributions
orld-2.119
iggins-2.058
cessive-2.035
phal-1.953
angel-1.901
Token Carter
Token ID10831
Feature activation+1.38e+00

Pos logit contributions
orld+2.872
iggins+2.777
cessive+2.776
phal+2.662
idi+2.580
Neg logit contributions
 Ratio-2.586
 Spy-2.191
 Lerner-2.133
geries-2.101
 UNCLASSIFIED-2.092
Token
Token ID198
Feature activation+1.43e+00

Pos logit contributions
orld+0.289
iggins+0.282
cessive+0.278
phal+0.266
inent+0.259
Neg logit contributions
 Ratio-0.194
 Spy-0.163
 Lerner-0.160
 UNCLASSIFIED-0.158
 DOI-0.157
TokenAt
Token ID2953
Feature activation+3.14e+00

Pos logit contributions
 Ratio+1.197
 Spy+1.048
 Lerner+1.041
 UNCLASSIFIED+1.025
 DOI+1.008
Neg logit contributions
orld-1.003
iggins-0.962
cessive-0.943
phal-0.905
inent-0.864
Token this
Token ID428
Feature activation+3.39e+00

Pos logit contributions
 Ratio+2.073
 Spy+1.749
 Lerner+1.714
 UNCLASSIFIED+1.662
 DOI+1.659
Neg logit contributions
orld-2.749
iggins-2.652
cessive-2.628
phal-2.553
angel-2.463
Token pre
Token ID662
Feature activation-1.45e+00

Pos logit contributions
 Ratio+2.390
 Spy+2.035
 Lerner+2.016
 UNCLASSIFIED+1.964
 Depression+1.943
Neg logit contributions
orld-2.826
iggins-2.717
cessive-2.692
phal-2.604
angel-2.517
Token-
Token ID12
Feature activation+4.24e+00

Pos logit contributions
orld+1.143
cessive+1.112
iggins+1.111
phal+1.069
angel+1.029
Neg logit contributions
 Ratio-1.087
 Spy-0.923
 Lerner-0.906
 UNCLASSIFIED-0.887
 DOI-0.881
Tokentrial
Token ID45994
Feature activation+1.29e+01

Pos logit contributions
 Ratio+2.559
 Spy+2.145
 Lerner+2.142
 UNCLASSIFIED+2.057
 Depression+2.025
Neg logit contributions
orld-3.888
iggins-3.754
cessive-3.667
phal-3.594
idi-3.434
Token hearing
Token ID4854
Feature activation+7.48e+00

Pos logit contributions
 Ratio+6.094
 Lerner+5.173
 Spy+5.077
 UNCLASSIFIED+4.977
 Depression+4.863
Neg logit contributions
orld-12.159
iggins-11.854
cessive-11.802
phal-11.509
amph-11.108
Token on
Token ID319
Feature activation+2.01e+00

Pos logit contributions
 Ratio+4.656
 Spy+3.993
 Lerner+3.954
 UNCLASSIFIED+3.752
 DOI+3.692
Neg logit contributions
orld-6.676
iggins-6.507
cessive-6.441
phal-6.250
inent-6.045
Token Thursday
Token ID3635
Feature activation+3.95e+00

Pos logit contributions
 Ratio+1.517
 Spy+1.312
 Lerner+1.299
 UNCLASSIFIED+1.263
 DOI+1.252
Neg logit contributions
orld-1.583
iggins-1.513
cessive-1.495
phal-1.452
inent-1.390
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
 Ratio+2.750
 Spy+2.357
 Lerner+2.339
 UNCLASSIFIED+2.267
 DOI+2.236
Neg logit contributions
orld-3.305
iggins-3.204
cessive-3.168
phal-3.070
angel-2.946
Token Mel
Token ID5616
Feature activation-3.24e-01

Pos logit contributions
 Ratio+1.016
 Spy+0.834
 Lerner+0.822
 UNCLASSIFIED+0.792
 DOI+0.784
Neg logit contributions
orld-1.623
iggins-1.573
cessive-1.558
phal-1.514
angel-1.471
Token paper
Token ID3348
Feature activation-4.36e-01

Pos logit contributions
orld+0.789
iggins+0.762
cessive+0.753
phal+0.728
angel+0.707
Neg logit contributions
 Ratio-0.630
 Spy-0.530
 Lerner-0.523
 UNCLASSIFIED-0.508
 DOI-0.504
Token.
Token ID13
Feature activation-2.68e+00

Pos logit contributions
orld+0.349
iggins+0.336
cessive+0.332
phal+0.320
angel+0.310
Neg logit contributions
 Ratio-0.323
 Spy-0.275
 Lerner-0.272
 UNCLASSIFIED-0.265
 DOI-0.263
Token He
Token ID679
Feature activation-1.87e+00

Pos logit contributions
orld+2.289
iggins+2.209
cessive+2.188
phal+2.115
angel+2.058
Neg logit contributions
 Ratio-1.812
 Spy-1.517
 Lerner-1.502
 UNCLASSIFIED-1.459
 DOI-1.446
Token complained
Token ID13832
Feature activation-1.48e+00

Pos logit contributions
orld+1.558
iggins+1.503
cessive+1.486
phal+1.434
angel+1.391
Neg logit contributions
 Ratio-1.329
 Spy-1.127
 Lerner-1.113
 UNCLASSIFIED-1.083
 DOI-1.073
Token to
Token ID284
Feature activation-3.90e-01

Pos logit contributions
orld+1.243
iggins+1.200
cessive+1.187
phal+1.146
angel+1.112
Neg logit contributions
 Ratio-1.031
 Spy-0.869
 Lerner-0.858
 UNCLASSIFIED-0.835
 DOI-0.829
Token Alfred
Token ID22044
Feature activation+1.27e+01

Pos logit contributions
orld+0.352
iggins+0.341
cessive+0.337
phal+0.326
angel+0.317
Neg logit contributions
 Ratio-0.249
 Spy-0.207
 Lerner-0.204
 UNCLASSIFIED-0.198
 DOI-0.196
Token Kn
Token ID6102
Feature activation+3.22e+00

Pos logit contributions
 Ratio+7.319
 Lerner+6.549
 Spy+6.437
 Depression+5.913
 Sachs+5.804
Neg logit contributions
orld-11.313
cessive-10.894
iggins-10.716
phal-10.440
angel-10.047
Tokenop
Token ID404
Feature activation+5.21e+00

Pos logit contributions
 Ratio+2.006
 Spy+1.687
 Lerner+1.660
 DOI+1.619
 UNCLASSIFIED+1.612
Neg logit contributions
orld-2.805
cessive-2.752
iggins-2.734
phal-2.643
inent-2.547
Tokenf
Token ID69
Feature activation+5.28e+00

Pos logit contributions
 Ratio+3.338
range+2.918
 Spy+2.902
 UNCLASSIFIED+2.891
 Lerner+2.863
Neg logit contributions
orld-4.209
iggins-4.140
cessive-4.072
angel-3.773
phal-3.754
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
 Ratio+3.266
 Spy+2.764
 Lerner+2.709
 Depression+2.610
 UNCLASSIFIED+2.601
Neg logit contributions
orld-4.730
iggins-4.584
cessive-4.558
phal-4.412
angel-4.268
Token who
Token ID508
Feature activation+1.01e+00

Pos logit contributions
 Ratio+2.357
 Spy+2.014
 Lerner+1.956
 Depression+1.885
 UNCLASSIFIED+1.867
Neg logit contributions
orld-3.288
iggins-3.147
cessive-3.123
phal-3.033
idi-2.927
Token Turkey
Token ID7137
Feature activation+2.29e+00

Pos logit contributions
 Ratio+1.105
 Spy+0.930
 Lerner+0.913
 UNCLASSIFIED+0.888
 DOI+0.879
Neg logit contributions
orld-1.406
iggins-1.360
cessive-1.346
phal-1.301
angel-1.262
Token has
Token ID468
Feature activation-2.17e+00

Pos logit contributions
 Ratio+1.652
 Spy+1.414
 Lerner+1.390
 UNCLASSIFIED+1.357
 Depression+1.334
Neg logit contributions
orld-1.863
iggins-1.803
cessive-1.795
phal-1.720
angel-1.660
Token a
Token ID257
Feature activation-1.13e+00

Pos logit contributions
orld+1.868
iggins+1.803
cessive+1.786
phal+1.723
angel+1.673
Neg logit contributions
 Ratio-1.469
 Spy-1.235
 Lerner-1.219
 UNCLASSIFIED-1.182
 DOI-1.176
Token debt
Token ID5057
Feature activation-4.21e+00

Pos logit contributions
orld+0.877
iggins+0.843
cessive+0.833
phal+0.802
angel+0.776
Neg logit contributions
 Ratio-0.870
 Spy-0.747
 Lerner-0.739
 UNCLASSIFIED-0.720
 DOI-0.715
Token to
Token ID284
Feature activation-9.89e-01

Pos logit contributions
orld+3.216
iggins+3.104
cessive+3.069
phal+2.907
angel+2.863
Neg logit contributions
 Ratio-3.032
 Spy-2.717
 Lerner-2.702
 UNCLASSIFIED-2.624
 DOI-2.591
Token GDP
Token ID12396
Feature activation+1.27e+01

Pos logit contributions
orld+0.862
iggins+0.834
cessive+0.824
phal+0.797
angel+0.774
Neg logit contributions
 Ratio-0.656
 Spy-0.553
 Lerner-0.546
 UNCLASSIFIED-0.527
 DOI-0.523
Token ratio
Token ID8064
Feature activation+2.60e+00

Pos logit contributions
 Ratio+4.468
 Spy+1.974
 Lerner+1.917
 UNCLASSIFIED+1.852
 Depression+1.802
Neg logit contributions
orld-14.945
cessive-14.409
iggins-14.309
phal-14.130
inent-13.881
Token of
Token ID286
Feature activation-4.05e+00

Pos logit contributions
 Ratio+1.864
 Spy+1.573
 Lerner+1.555
 UNCLASSIFIED+1.512
 DOI+1.498
Neg logit contributions
orld-2.145
iggins-2.066
cessive-2.048
phal-1.971
angel-1.908
Token only
Token ID691
Feature activation-8.48e-01

Pos logit contributions
orld+3.162
iggins+3.047
cessive+3.007
phal+2.900
angel+2.816
Neg logit contributions
 Ratio-3.033
 Spy-2.597
 Lerner-2.570
 UNCLASSIFIED-2.488
 DOI-2.478
Token 2
Token ID362
Feature activation+9.15e-01

Pos logit contributions
orld+0.710
iggins+0.685
cessive+0.676
phal+0.653
angel+0.632
Neg logit contributions
 Ratio-0.602
 Spy-0.510
 Lerner-0.504
 UNCLASSIFIED-0.491
 DOI-0.486
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
 Ratio+0.826
 Spy+0.726
 Lerner+0.719
 UNCLASSIFIED+0.705
 DOI+0.700
Neg logit contributions
orld-0.585
iggins-0.557
cessive-0.549
phal-0.524
angel-0.502
Token S
Token ID311
Feature activation-2.35e+00

Pos logit contributions
orld+1.959
iggins+1.880
cessive+1.846
phal+1.773
angel+1.717
Neg logit contributions
 Ratio-2.500
 Spy-2.186
 Lerner-2.160
 UNCLASSIFIED-2.106
 DOI-2.097
Tokenenth
Token ID7944
Feature activation-3.67e+00

Pos logit contributions
orld+2.121
iggins+2.055
cessive+2.027
phal+1.977
angel+1.922
Neg logit contributions
 Ratio-1.486
 Spy-1.228
 Lerner-1.206
 UNCLASSIFIED-1.180
 DOI-1.169
Tokenil
Token ID346
Feature activation-1.73e+00

Pos logit contributions
orld+2.799
iggins+2.702
cessive+2.639
phal+2.571
idi+2.483
Neg logit contributions
 Ratio-2.846
 Spy-2.442
 Lerner-2.431
 UNCLASSIFIED-2.362
 DOI-2.349
Token Kumar
Token ID26105
Feature activation-1.88e+00

Pos logit contributions
orld+1.444
iggins+1.394
cessive+1.375
phal+1.337
angel+1.294
Neg logit contributions
 Ratio-1.217
 Spy-1.024
 Lerner-1.012
 UNCLASSIFIED-0.989
 DOI-0.980
Token/
Token ID14
Feature activation+3.40e+00

Pos logit contributions
orld+1.604
iggins+1.548
cessive+1.530
phal+1.489
angel+1.441
Neg logit contributions
 Ratio-1.286
 Spy-1.077
 Lerner-1.064
 UNCLASSIFIED-1.036
 DOI-1.027
TokenPress
Token ID13800
Feature activation+1.26e+01

Pos logit contributions
 Ratio+2.675
 Spy+2.302
 Lerner+2.252
 Prevention+2.232
 UNCLASSIFIED+2.229
Neg logit contributions
orld-2.530
iggins-2.435
cessive-2.410
phal-2.263
angel-2.226
Token Trust
Token ID9870
Feature activation+2.04e+00

Pos logit contributions
 Ratio+6.618
 Spy+5.450
 UNCLASSIFIED+5.242
 DOI+5.087
 Prevention+5.014
Neg logit contributions
orld-12.052
iggins-11.622
cessive-11.487
angel-11.217
phal-11.001
Token of
Token ID286
Feature activation+1.29e+00

Pos logit contributions
 Ratio+1.405
 Spy+1.170
 UNCLASSIFIED+1.140
 Lerner+1.131
 DOI+1.111
Neg logit contributions
orld-1.753
iggins-1.716
cessive-1.701
phal-1.610
angel-1.596
Token India
Token ID3794
Feature activation+2.50e+00

Pos logit contributions
 Ratio+0.778
 Spy+0.636
 Lerner+0.624
 UNCLASSIFIED+0.618
 DOI+0.600
Neg logit contributions
orld-1.215
cessive-1.182
iggins-1.181
phal-1.131
angel-1.110
Token via
Token ID2884
Feature activation+5.66e+00

Pos logit contributions
 Ratio+1.682
 Spy+1.411
 Lerner+1.381
 UNCLASSIFIED+1.353
 DOI+1.341
Neg logit contributions
orld-2.183
iggins-2.121
cessive-2.093
phal-2.016
angel-1.965
Token AP
Token ID3486
Feature activation+6.43e+00

Pos logit contributions
 Ratio+3.608
 Spy+3.018
 Lerner+2.905
 UNCLASSIFIED+2.890
 DOI+2.865
Neg logit contributions
orld-5.099
iggins-4.882
cessive-4.848
phal-4.676
angel-4.578
Token that
Token ID326
Feature activation+3.54e+00

Pos logit contributions
 Ratio+2.456
 Spy+2.085
 Lerner+2.011
 Depression+1.948
 DOI+1.943
Neg logit contributions
orld-3.404
iggins-3.328
cessive-3.265
phal-3.182
angel-3.091
Token origin
Token ID8159
Feature activation+6.61e-01

Pos logit contributions
 Ratio+2.376
 Spy+2.037
 Lerner+1.961
 Depression+1.920
 UNCLASSIFIED+1.909
Neg logit contributions
orld-3.045
iggins-2.972
cessive-2.947
phal-2.843
amph-2.753
Token that
Token ID326
Feature activation-1.53e+00

Pos logit contributions
 Ratio+0.441
 Spy+0.370
 Lerner+0.364
 UNCLASSIFIED+0.355
 DOI+0.350
Neg logit contributions
orld-0.577
iggins-0.559
cessive-0.553
phal-0.533
angel-0.518
Token Batman
Token ID9827
Feature activation+1.41e+00

Pos logit contributions
orld+1.303
iggins+1.256
cessive+1.243
phal+1.202
angel+1.168
Neg logit contributions
 Ratio-1.047
 Spy-0.880
 Lerner-0.870
 UNCLASSIFIED-0.847
 DOI-0.838
Token and
Token ID290
Feature activation+2.32e+00

Pos logit contributions
 Ratio+1.033
 Spy+0.886
 Lerner+0.874
 UNCLASSIFIED+0.851
 DOI+0.841
Neg logit contributions
orld-1.140
iggins-1.105
cessive-1.100
phal-1.048
angel-1.012
Token Dick
Token ID11740
Feature activation+1.26e+01

Pos logit contributions
 Ratio+1.437
 Spy+1.209
 Lerner+1.157
 Depression+1.121
 DOI+1.120
Neg logit contributions
orld-2.146
iggins-2.095
cessive-2.063
phal-2.002
angel-1.940
Token have
Token ID423
Feature activation+7.26e-01

Pos logit contributions
 Ratio+7.464
 Spy+6.235
 Lerner+6.156
 Raf+5.763
 DOI+5.751
Neg logit contributions
orld-10.607
cessive-10.571
iggins-10.499
phal-9.991
amph-9.747
Token of
Token ID286
Feature activation-1.42e+00

Pos logit contributions
 Ratio+0.499
 Spy+0.422
 Lerner+0.416
 UNCLASSIFIED+0.405
 DOI+0.400
Neg logit contributions
orld-0.619
iggins-0.600
cessive-0.593
phal-0.571
angel-0.555
Token having
Token ID1719
Feature activation-2.01e+00

Pos logit contributions
orld+1.252
iggins+1.207
cessive+1.199
phal+1.159
angel+1.129
Neg logit contributions
 Ratio-0.937
 Spy-0.772
 Lerner-0.770
 UNCLASSIFIED-0.751
 DOI-0.740
Token gone
Token ID3750
Feature activation+4.55e+00

Pos logit contributions
orld+1.692
iggins+1.625
cessive+1.615
phal+1.561
idi+1.515
Neg logit contributions
 Ratio-1.389
 Lerner-1.156
 Spy-1.153
 UNCLASSIFIED-1.126
 DOI-1.109
Token through
Token ID832
Feature activation+5.26e-01

Pos logit contributions
 Ratio+2.953
 Spy+2.537
 Lerner+2.419
 DOI+2.379
 UNCLASSIFIED+2.363
Neg logit contributions
orld-3.992
iggins-3.885
cessive-3.839
phal-3.722
inent-3.578
Tokenfamous
Token ID45143
Feature activation-4.01e-01

Pos logit contributions
orld+1.356
iggins+1.306
cessive+1.301
phal+1.261
angel+1.238
Neg logit contributions
 Ratio-1.090
 Spy-0.899
 Lerner-0.894
 DOI-0.866
 UNCLASSIFIED-0.861
Token author
Token ID1772
Feature activation-2.66e+00

Pos logit contributions
orld+0.342
iggins+0.330
cessive+0.326
phal+0.314
angel+0.304
Neg logit contributions
 Ratio-0.277
 Spy-0.235
 Lerner-0.231
 UNCLASSIFIED-0.225
 DOI-0.223
Token and
Token ID290
Feature activation+2.45e+00

Pos logit contributions
orld+2.234
iggins+2.154
cessive+2.134
phal+2.068
angel+2.019
Neg logit contributions
 Ratio-1.857
 Spy-1.559
 Lerner-1.546
 UNCLASSIFIED-1.500
 DOI-1.489
Token humanitarian
Token ID15706
Feature activation+2.08e-01

Pos logit contributions
 Ratio+1.702
 Spy+1.470
 Lerner+1.432
 UNCLASSIFIED+1.393
 Depression+1.393
Neg logit contributions
orld-2.074
iggins-1.996
cessive-1.981
phal-1.906
idi-1.835
Token.
Token ID13
Feature activation+1.78e+00

Pos logit contributions
 Ratio+0.142
 Spy+0.122
 Lerner+0.119
 UNCLASSIFIED+0.116
 DOI+0.115
Neg logit contributions
orld-0.178
iggins-0.171
cessive-0.170
phal-0.164
idi-0.158
Token Anne
Token ID15397
Feature activation+1.26e+01

Pos logit contributions
 Ratio+1.095
 Spy+0.909
 Lerner+0.894
 UNCLASSIFIED+0.864
 Depression+0.859
Neg logit contributions
orld-1.675
iggins-1.625
cessive-1.601
phal-1.546
idi-1.500
Token Sullivan
Token ID18501
Feature activation+4.90e+00

Pos logit contributions
 Ratio+7.120
 Lerner+6.249
 Spy+6.130
 Depression+5.851
 Gou+5.691
Neg logit contributions
orld-11.270
iggins-11.040
cessive-10.835
phal-10.573
idi-10.037
Token stayed
Token ID9658
Feature activation+6.83e-01

Pos logit contributions
 Ratio+3.237
 Lerner+2.748
 Spy+2.746
 UNCLASSIFIED+2.611
 Depression+2.606
Neg logit contributions
orld-4.294
iggins-4.128
cessive-4.106
phal-3.908
inent-3.806
Token at
Token ID379
Feature activation+5.08e-01

Pos logit contributions
 Ratio+0.464
 Spy+0.391
 Lerner+0.385
 UNCLASSIFIED+0.377
 Depression+0.371
Neg logit contributions
orld-0.588
iggins-0.572
cessive-0.562
phal-0.544
angel-0.527
Token her
Token ID607
Feature activation+1.40e+00

Pos logit contributions
 Ratio+0.320
 Spy+0.264
 Lerner+0.261
 UNCLASSIFIED+0.253
 DOI+0.250
Neg logit contributions
orld-0.463
iggins-0.448
cessive-0.443
phal-0.428
angel-0.418
Token side
Token ID1735
Feature activation+2.26e+00

Pos logit contributions
 Ratio+0.957
 Spy+0.809
 Lerner+0.795
 UNCLASSIFIED+0.772
 Depression+0.769
Neg logit contributions
orld-1.216
iggins-1.169
cessive-1.155
phal-1.118
angel-1.082
Tokent
Token ID83
Feature activation+3.71e+00

Pos logit contributions
 Ratio+1.495
 Spy+1.229
 Lerner+1.197
 UNCLASSIFIED+1.185
 DOI+1.149
Neg logit contributions
orld-2.815
iggins-2.771
cessive-2.711
phal-2.612
inent-2.538
Token.
Token ID13
Feature activation+6.69e+00

Pos logit contributions
 Ratio+2.196
 Spy+1.886
 UNCLASSIFIED+1.886
 DOI+1.839
 Lerner+1.839
Neg logit contributions
orld-3.456
iggins-3.375
cessive-3.350
phal-3.218
inent-3.108
Token A
Token ID317
Feature activation+2.31e+00

Pos logit contributions
 Ratio+4.103
 Spy+3.492
 Lerner+3.354
 UNCLASSIFIED+3.303
 DOI+3.254
Neg logit contributions
orld-6.103
iggins-5.965
cessive-5.954
phal-5.764
inent-5.603
TokenUS
Token ID2937
Feature activation+5.30e-01

Pos logit contributions
 Ratio+1.930
 Spy+1.693
 Lerner+1.661
 UNCLASSIFIED+1.626
 DOI+1.617
Neg logit contributions
orld-1.617
iggins-1.545
cessive-1.539
phal-1.482
angel-1.411
Token 5
Token ID642
Feature activation-2.08e+00

Pos logit contributions
 Ratio+0.372
 Lerner+0.312
 Spy+0.312
 UNCLASSIFIED+0.301
 Depression+0.297
Neg logit contributions
orld-0.445
iggins-0.431
cessive-0.426
phal-0.405
angel-0.396
Token Robbie
Token ID34855
Feature activation+1.25e+01

Pos logit contributions
orld+1.871
iggins+1.811
cessive+1.788
phal+1.735
angel+1.695
Neg logit contributions
 Ratio-1.312
 Spy-1.080
 Lerner-1.068
 UNCLASSIFIED-1.038
 DOI-1.027
Token M
Token ID337
Feature activation-4.79e+00

Pos logit contributions
 Ratio+5.695
 Spy+4.831
 Lerner+4.750
 UNCLASSIFIED+4.344
 Raf+4.114
Neg logit contributions
orld-12.511
cessive-12.053
iggins-11.876
phal-11.813
inent-11.745
Tokence
Token ID344
Feature activation-2.34e+00

Pos logit contributions
iggins+3.424
orld+3.413
cessive+3.228
angel+3.190
phal+3.149
Neg logit contributions
 Ratio-3.795
 Spy-3.329
 Lerner-3.223
 DOI-3.213
 UNCLASSIFIED-3.202
Tokenwen
Token ID21006
Feature activation+5.20e-02

Pos logit contributions
orld+1.852
iggins+1.783
cessive+1.760
phal+1.700
angel+1.644
Neg logit contributions
 Ratio-1.756
 Spy-1.500
 Lerner-1.484
 UNCLASSIFIED-1.445
 DOI-1.433
Token (
Token ID357
Feature activation+8.88e+00

Pos logit contributions
 Ratio+0.039
 Spy+0.033
 Lerner+0.033
 UNCLASSIFIED+0.032
 DOI+0.032
Neg logit contributions
orld-0.041
iggins-0.040
cessive-0.039
phal-0.038
angel-0.037
TokenGreen
Token ID13719
Feature activation+4.50e+00

Pos logit contributions
 Ratio+6.371
 Spy+5.776
 Lerner+5.570
 Scythe+5.446
range+5.314
Neg logit contributions
orld-6.375
cessive-6.232
iggins-6.221
phal-5.931
inent-5.786
Token a
Token ID257
Feature activation-1.07e-01

Pos logit contributions
 Ratio+0.554
 Spy+0.466
 Lerner+0.460
 UNCLASSIFIED+0.443
 Depression+0.440
Neg logit contributions
orld-0.749
iggins-0.723
cessive-0.715
phal-0.688
angel-0.669
Token result
Token ID1255
Feature activation+7.26e-01

Pos logit contributions
orld+0.089
iggins+0.086
cessive+0.085
phal+0.082
angel+0.080
Neg logit contributions
 Ratio-0.076
 Spy-0.064
 Lerner-0.063
 UNCLASSIFIED-0.061
 DOI-0.061
Token of
Token ID286
Feature activation-5.36e-01

Pos logit contributions
 Ratio+0.509
 Spy+0.430
 Lerner+0.425
 UNCLASSIFIED+0.413
 DOI+0.410
Neg logit contributions
orld-0.608
iggins-0.587
cessive-0.580
phal-0.561
angel-0.545
Token that
Token ID326
Feature activation+1.92e+00

Pos logit contributions
orld+0.475
iggins+0.460
cessive+0.455
phal+0.440
angel+0.428
Neg logit contributions
 Ratio-0.351
 Spy-0.293
 Lerner-0.289
 UNCLASSIFIED-0.280
 DOI-0.278
Token,
Token ID11
Feature activation+6.84e-01

Pos logit contributions
 Ratio+1.241
 Spy+1.038
 Lerner+1.028
 UNCLASSIFIED+0.995
 Depression+0.984
Neg logit contributions
orld-1.727
iggins-1.672
cessive-1.653
phal-1.605
angel-1.550
Token Fred
Token ID8559
Feature activation+1.24e+01

Pos logit contributions
 Ratio+0.454
 Spy+0.380
 Lerner+0.375
 UNCLASSIFIED+0.362
 DOI+0.360
Neg logit contributions
orld-0.605
iggins-0.585
cessive-0.577
phal-0.560
angel-0.541
Token's
Token ID338
Feature activation+3.66e+00

Pos logit contributions
 Ratio+7.819
 Spy+6.852
 Lerner+6.699
 UNCLASSIFIED+6.587
 Raf+6.238
Neg logit contributions
orld-10.135
cessive-9.740
iggins-9.649
phal-9.192
amph-8.993
Token right
Token ID826
Feature activation-2.95e+00

Pos logit contributions
 Ratio+2.459
 Spy+2.077
 Lerner+2.040
 UNCLASSIFIED+1.991
 Depression+1.973
Neg logit contributions
orld-3.182
iggins-3.076
cessive-3.032
phal-2.947
angel-2.848
Token,
Token ID11
Feature activation+2.86e-01

Pos logit contributions
orld+2.412
iggins+2.317
cessive+2.290
phal+2.226
angel+2.168
Neg logit contributions
 Ratio-2.108
 Spy-1.772
 Lerner-1.748
 UNCLASSIFIED-1.698
 DOI-1.697
Token we
Token ID356
Feature activation-8.66e-01

Pos logit contributions
 Ratio+0.192
 Spy+0.161
 Lerner+0.159
 UNCLASSIFIED+0.154
 DOI+0.153
Neg logit contributions
orld-0.250
iggins-0.242
cessive-0.239
phal-0.232
angel-0.224
Token look
Token ID804
Feature activation+3.64e+00

Pos logit contributions
orld+0.747
iggins+0.722
cessive+0.714
phal+0.690
angel+0.670
Neg logit contributions
 Ratio-0.588
 Spy-0.494
 Lerner-0.487
 UNCLASSIFIED-0.473
 DOI-0.469
Token
Token ID198
Feature activation-5.19e+00

Pos logit contributions
orld+1.171
iggins+1.142
cessive+1.124
phal+1.080
idi+1.046
Neg logit contributions
 Ratio-0.827
 Spy-0.698
 Lerner-0.684
 UNCLASSIFIED-0.672
 DOI-0.664
Token1989
Token ID25475
Feature activation-2.22e-01

Pos logit contributions
orld+3.672
cessive+3.540
iggins+3.502
phal+3.301
anth+3.245
Neg logit contributions
 Ratio-4.275
 Lerner-3.706
 Spy-3.690
 Scythe-3.590
 UNCLASSIFIED-3.567
Token Fred
Token ID8559
Feature activation+7.38e+00

Pos logit contributions
orld+0.197
iggins+0.191
cessive+0.189
phal+0.183
angel+0.177
Neg logit contributions
 Ratio-0.145
 Spy-0.121
 Lerner-0.120
 UNCLASSIFIED-0.116
 DOI-0.115
Token Savage
Token ID22287
Feature activation-3.97e+00

Pos logit contributions
 Ratio+4.562
 Lerner+3.897
 Spy+3.881
 UNCLASSIFIED+3.619
 Depression+3.559
Neg logit contributions
orld-6.598
iggins-6.338
cessive-6.331
phal-6.093
inent-5.904
Token 1990
Token ID6303
Feature activation-2.46e+00

Pos logit contributions
orld+3.146
iggins+3.027
cessive+2.988
phal+2.885
angel+2.780
Neg logit contributions
 Ratio-2.884
 Spy-2.434
 Lerner-2.401
 UNCLASSIFIED-2.378
 DOI-2.339
Token Ted
Token ID11396
Feature activation+1.24e+01

Pos logit contributions
orld+2.136
iggins+2.067
cessive+2.046
phal+1.987
angel+1.939
Neg logit contributions
 Ratio-1.618
 Spy-1.336
 Lerner-1.325
 UNCLASSIFIED-1.292
 DOI-1.284
Token D
Token ID360
Feature activation-2.27e+00

Pos logit contributions
 Ratio+5.001
 Lerner+4.179
 Spy+4.060
 UNCLASSIFIED+3.921
 DOI+3.782
Neg logit contributions
orld-13.171
iggins-12.903
cessive-12.849
phal-12.336
amph-12.220
Tokenanson
Token ID23103
Feature activation-5.55e-02

Pos logit contributions
orld+1.260
iggins+1.183
cessive+1.170
phal+1.111
idi+1.044
Neg logit contributions
 Ratio-2.231
 Spy-1.984
 Lerner-1.973
 UNCLASSIFIED-1.948
 DOI-1.931
Token
Token ID198
Feature activation-7.77e-01

Pos logit contributions
orld+0.046
iggins+0.044
cessive+0.043
phal+0.042
angel+0.040
Neg logit contributions
 Ratio-0.040
 Spy-0.034
 Lerner-0.034
 UNCLASSIFIED-0.033
 DOI-0.032
Token
Token ID198
Feature activation-4.59e+00

Pos logit contributions
orld+0.712
iggins+0.697
cessive+0.684
phal+0.657
inent+0.637
Neg logit contributions
 Ratio-0.487
 Spy-0.410
 Lerner-0.402
 UNCLASSIFIED-0.396
 DOI-0.392
Token1990
Token ID19891
Feature activation-2.04e+00

Pos logit contributions
orld+3.417
cessive+3.282
iggins+3.266
phal+3.109
angel+3.038
Neg logit contributions
 Ratio-3.633
 Spy-3.155
 Lerner-3.112
 DOI-3.035
 Scythe-3.009
Tokencom
Token ID785
Feature activation-6.58e-01

Pos logit contributions
orld+0.924
iggins+0.868
cessive+0.851
phal+0.799
angel+0.761
Neg logit contributions
 Ratio-1.909
 Spy-1.708
 Lerner-1.698
 UNCLASSIFIED-1.668
 DOI-1.656
Token,[
Token ID17414
Feature activation-4.46e+00

Pos logit contributions
orld+0.551
iggins+0.532
cessive+0.527
phal+0.508
angel+0.493
Neg logit contributions
 Ratio-0.464
 Spy-0.393
 Lerner-0.386
 UNCLASSIFIED-0.375
 DOI-0.372
Token91
Token ID6420
Feature activation-8.78e+00

Pos logit contributions
orld+2.959
iggins+2.834
cessive+2.776
phal+2.671
angel+2.597
Neg logit contributions
 Ratio-3.897
 Spy-3.409
 Lerner-3.401
 UNCLASSIFIED-3.313
 Depression-3.251
Token]
Token ID60
Feature activation-4.56e+00

Pos logit contributions
orld+3.102
iggins+2.847
cessive+2.806
phal+2.579
angel+2.434
Neg logit contributions
 Ratio-9.699
 Lerner-8.821
 Spy-8.727
 UNCLASSIFIED-8.725
geries-8.568
Token NBC
Token ID9716
Feature activation-5.23e+00

Pos logit contributions
orld+3.787
iggins+3.657
cessive+3.591
phal+3.468
idi+3.376
Neg logit contributions
 Ratio-3.172
 Lerner-2.646
 Spy-2.618
 UNCLASSIFIED-2.569
 DOI-2.545
TokenSports
Token ID18153
Feature activation-1.45e+01

Pos logit contributions
orld+3.781
iggins+3.622
cessive+3.541
phal+3.429
angel+3.321
Neg logit contributions
 Ratio-4.238
 Lerner-3.654
 Spy-3.625
 UNCLASSIFIED-3.541
 DOI-3.525
Token.
Token ID13
Feature activation-5.10e+00

Pos logit contributions
orld+9.377
iggins+8.417
cessive+8.154
phal+8.027
angel+7.865
Neg logit contributions
 Ratio-11.602
 Lerner-10.063
 UNCLASSIFIED-9.838
geries-9.814
 Spy-9.763
Tokencom
Token ID785
Feature activation-3.67e+00

Pos logit contributions
orld+2.520
iggins+2.362
cessive+2.318
phal+2.177
angel+2.068
Neg logit contributions
 Ratio-5.328
 Spy-4.770
 Lerner-4.737
 UNCLASSIFIED-4.653
 DOI-4.626
Token,[
Token ID17414
Feature activation-4.29e+00

Pos logit contributions
orld+2.876
iggins+2.745
cessive+2.705
phal+2.617
angel+2.525
Neg logit contributions
 Ratio-2.768
 Lerner-2.356
 Spy-2.348
 UNCLASSIFIED-2.291
 DOI-2.265
Token92
Token ID5892
Feature activation-6.73e+00

Pos logit contributions
orld+2.346
iggins+2.235
cessive+2.166
phal+2.071
angel+2.028
Neg logit contributions
 Ratio-4.239
 Spy-3.729
 Lerner-3.705
 UNCLASSIFIED-3.658
 DOI-3.626
Token]
Token ID60
Feature activation-4.44e+00

Pos logit contributions
orld+2.606
iggins+2.354
cessive+2.338
phal+2.196
angel+2.024
Neg logit contributions
 Ratio-7.406
 Lerner-6.682
 UNCLASSIFIED-6.665
 Spy-6.646
geries-6.538
Token four
Token ID1440
Feature activation+3.35e-01

Pos logit contributions
 Ratio+0.051
 Spy+0.043
 Lerner+0.042
 UNCLASSIFIED+0.041
 DOI+0.041
Neg logit contributions
orld-0.064
iggins-0.062
cessive-0.061
phal-0.059
angel-0.057
Token unknown
Token ID6439
Feature activation-2.49e+00

Pos logit contributions
 Ratio+0.232
 Spy+0.197
 Lerner+0.193
 UNCLASSIFIED+0.189
 DOI+0.188
Neg logit contributions
orld-0.286
iggins-0.275
cessive-0.271
phal-0.261
angel-0.253
Token assailants
Token ID49690
Feature activation-1.56e+00

Pos logit contributions
orld+1.890
iggins+1.820
cessive+1.800
phal+1.737
angel+1.677
Neg logit contributions
 Ratio-1.936
 Spy-1.661
 Lerner-1.648
 UNCLASSIFIED-1.597
 DOI-1.588
Token.
Token ID13
Feature activation-8.08e-01

Pos logit contributions
orld+1.323
iggins+1.280
cessive+1.263
phal+1.220
angel+1.181
Neg logit contributions
 Ratio-1.082
 Spy-0.917
 Lerner-0.902
 UNCLASSIFIED-0.880
 DOI-0.872
Token A
Token ID317
Feature activation-1.79e+00

Pos logit contributions
orld+0.709
iggins+0.686
cessive+0.677
phal+0.655
angel+0.634
Neg logit contributions
 Ratio-0.542
 Spy-0.457
 Lerner-0.449
 UNCLASSIFIED-0.437
 DOI-0.433
Token lecturer
Token ID40228
Feature activation-1.42e+01

Pos logit contributions
orld+1.495
iggins+1.443
cessive+1.427
phal+1.378
angel+1.337
Neg logit contributions
 Ratio-1.257
 Spy-1.062
 Lerner-1.050
 UNCLASSIFIED-1.020
 DOI-1.011
Token at
Token ID379
Feature activation-4.89e+00

Pos logit contributions
orld+9.583
iggins+9.222
cessive+9.170
idi+8.931
phal+8.906
Neg logit contributions
 Ratio-10.499
 DOI-8.626
 Lerner-8.588
 Spy-8.583
geries-8.557
Token the
Token ID262
Feature activation-2.88e-01

Pos logit contributions
orld+4.107
iggins+3.964
cessive+3.929
phal+3.787
angel+3.706
Neg logit contributions
 Ratio-3.292
 Spy-2.719
 Lerner-2.694
 UNCLASSIFIED-2.654
 DOI-2.604
Token School
Token ID3961
Feature activation-3.41e+00

Pos logit contributions
orld+0.253
iggins+0.244
cessive+0.242
phal+0.234
angel+0.227
Neg logit contributions
 Ratio-0.191
 Spy-0.160
 Lerner-0.158
 UNCLASSIFIED-0.153
 DOI-0.152
Token of
Token ID286
Feature activation-3.29e+00

Pos logit contributions
orld+2.666
iggins+2.567
cessive+2.533
phal+2.462
angel+2.376
Neg logit contributions
 Ratio-2.544
 Spy-2.171
 Lerner-2.142
 UNCLASSIFIED-2.099
 DOI-2.073
Token Journal
Token ID4913
Feature activation-1.07e-01

Pos logit contributions
orld+2.945
iggins+2.835
cessive+2.807
phal+2.718
angel+2.648
Neg logit contributions
 Ratio-2.083
 Spy-1.724
 Lerner-1.708
 UNCLASSIFIED-1.655
 DOI-1.641
Token Lear
Token ID8010
Feature activation-1.32e+00

Pos logit contributions
 Ratio+1.031
 Lerner+0.852
 Spy+0.844
 UNCLASSIFIED+0.795
 Depression+0.790
Neg logit contributions
orld-1.425
cessive-1.368
iggins-1.356
phal-1.337
idi-1.276
Tokeny
Token ID88
Feature activation-2.48e+00

Pos logit contributions
orld+0.861
iggins+0.821
cessive+0.805
phal+0.773
angel+0.744
Neg logit contributions
 Ratio-1.170
 Spy-1.023
 Lerner-1.014
 UNCLASSIFIED-0.995
 DOI-0.988
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
orld+2.089
iggins+2.011
cessive+1.974
phal+1.922
angel+1.873
Neg logit contributions
 Ratio-1.738
 Spy-1.460
 Lerner-1.439
 UNCLASSIFIED-1.395
 DOI-1.392
Token cornerback
Token ID26683
Feature activation-2.67e+00

Pos logit contributions
orld+0.890
iggins+0.857
cessive+0.846
phal+0.817
angel+0.791
Neg logit contributions
 Ratio-0.817
 Spy-0.694
 Lerner-0.687
 UNCLASSIFIED-0.670
 DOI-0.664
Token Marcus
Token ID17068
Feature activation-2.67e+00

Pos logit contributions
orld+2.375
iggins+2.299
cessive+2.283
phal+2.222
angel+2.166
Neg logit contributions
 Ratio-1.684
 Spy-1.379
 Lerner-1.370
 UNCLASSIFIED-1.333
 DOI-1.327
Token R
Token ID371
Feature activation-1.42e+01

Pos logit contributions
orld+2.503
iggins+2.428
cessive+2.401
phal+2.325
angel+2.274
Neg logit contributions
 Ratio-1.604
 Spy-1.305
 Lerner-1.287
 UNCLASSIFIED-1.247
 DOI-1.234
Tokenios
Token ID4267
Feature activation-6.64e+00

Pos logit contributions
iggins+9.978
angel+9.250
orld+9.245
amph+8.745
phal+8.618
Neg logit contributions
 Ratio-11.922
 Spy-10.026
 Prevention-9.969
 Lerner-9.840
 ã‚µãƒ¼ãƒ†ã‚£ãƒ¯ãƒ³-9.825
Token were
Token ID547
Feature activation-7.83e-01

Pos logit contributions
orld+5.299
iggins+5.102
cessive+4.988
phal+4.836
idi+4.776
Neg logit contributions
 Ratio-4.827
 Spy-4.063
 Lerner-4.007
 DOI-3.879
 Depression-3.870
Token held
Token ID2714
Feature activation+3.94e-01

Pos logit contributions
orld+0.643
iggins+0.619
cessive+0.612
phal+0.591
angel+0.572
Neg logit contributions
 Ratio-0.565
 Spy-0.478
 Lerner-0.473
 UNCLASSIFIED-0.461
 DOI-0.457
Token out
Token ID503
Feature activation-7.69e-01

Pos logit contributions
 Ratio+0.274
 Spy+0.232
 Lerner+0.228
 UNCLASSIFIED+0.222
 DOI+0.221
Neg logit contributions
orld-0.333
iggins-0.322
cessive-0.318
phal-0.307
angel-0.298
Token of
Token ID286
Feature activation-2.18e-01

Pos logit contributions
orld+0.630
iggins+0.608
cessive+0.601
phal+0.579
angel+0.561
Neg logit contributions
 Ratio-0.554
 Spy-0.472
 Lerner-0.466
 UNCLASSIFIED-0.452
 DOI-0.449
Tokencom
Token ID785
Feature activation-2.64e+00

Pos logit contributions
orld+2.252
iggins+2.122
cessive+2.081
phal+1.957
angel+1.852
Neg logit contributions
 Ratio-4.663
 Spy-4.177
 Lerner-4.143
 UNCLASSIFIED-4.070
 DOI-4.048
Token,[
Token ID17414
Feature activation-4.43e+00

Pos logit contributions
orld+2.071
iggins+1.986
cessive+1.956
phal+1.891
angel+1.829
Neg logit contributions
 Ratio-1.991
 Spy-1.697
 Lerner-1.689
 UNCLASSIFIED-1.646
 DOI-1.631
Token96
Token ID4846
Feature activation-6.92e+00

Pos logit contributions
orld+2.542
iggins+2.448
cessive+2.366
phal+2.276
angel+2.216
Neg logit contributions
 Ratio-4.213
 Lerner-3.735
 Spy-3.684
 UNCLASSIFIED-3.607
 DOI-3.563
Token]
Token ID60
Feature activation-1.71e+00

Pos logit contributions
orld+2.679
iggins+2.423
cessive+2.405
phal+2.226
angel+2.099
Neg logit contributions
 Ratio-7.594
 Lerner-6.867
 UNCLASSIFIED-6.813
 Spy-6.770
 DOI-6.690
Token NBC
Token ID9716
Feature activation-3.99e+00

Pos logit contributions
orld+1.477
iggins+1.427
cessive+1.409
phal+1.363
angel+1.323
Neg logit contributions
 Ratio-1.152
 Spy-0.959
 Lerner-0.953
 UNCLASSIFIED-0.925
 DOI-0.918
TokenSports
Token ID18153
Feature activation-1.37e+01

Pos logit contributions
orld+2.604
iggins+2.485
cessive+2.426
phal+2.337
angel+2.247
Neg logit contributions
 Ratio-3.532
 Spy-3.074
 Lerner-3.073
 UNCLASSIFIED-3.002
 DOI-2.990
Token.
Token ID13
Feature activation-4.80e+00

Pos logit contributions
orld+8.605
iggins+7.725
cessive+7.241
phal+7.213
angel+7.129
Neg logit contributions
 Ratio-11.323
 UNCLASSIFIED-9.672
 Lerner-9.632
geries-9.546
 Scythe-9.506
Tokencom
Token ID785
Feature activation-3.22e+00

Pos logit contributions
orld+2.410
iggins+2.272
cessive+2.228
phal+2.095
angel+1.982
Neg logit contributions
 Ratio-4.990
 Spy-4.471
 Lerner-4.434
 UNCLASSIFIED-4.356
 DOI-4.332
Token,[
Token ID17414
Feature activation-2.63e+00

Pos logit contributions
orld+2.238
iggins+2.126
cessive+2.085
phal+2.009
angel+1.937
Neg logit contributions
 Ratio-2.720
 Spy-2.350
 Lerner-2.348
 UNCLASSIFIED-2.301
 DOI-2.275
Token92
Token ID5892
Feature activation-3.24e+00

Pos logit contributions
orld+1.625
iggins+1.567
cessive+1.514
phal+1.448
angel+1.410
Neg logit contributions
 Ratio-2.427
 Lerner-2.116
 Spy-2.098
 UNCLASSIFIED-2.064
 Depression-2.046
Token]
Token ID60
Feature activation-8.51e-01

Pos logit contributions
orld+1.438
iggins+1.306
cessive+1.304
phal+1.215
angel+1.135
Neg logit contributions
 Ratio-3.511
 Lerner-3.152
 Spy-3.139
 UNCLASSIFIED-3.135
 DOI-3.085
Token Today
Token ID6288
Feature activation-4.38e+00

Pos logit contributions
orld+4.314
iggins+4.143
cessive+4.054
phal+3.979
angel+3.824
Neg logit contributions
 Ratio-3.692
 Lerner-3.085
 Spy-3.060
 UNCLASSIFIED-2.984
 DOI-2.982
Token,[
Token ID17414
Feature activation-2.56e+00

Pos logit contributions
orld+3.456
iggins+3.314
cessive+3.248
phal+3.145
angel+3.040
Neg logit contributions
 Ratio-3.270
 Lerner-2.783
 Spy-2.770
 UNCLASSIFIED-2.718
 DOI-2.696
Token95
Token ID3865
Feature activation-8.27e+00

Pos logit contributions
orld+1.673
iggins+1.601
cessive+1.576
phal+1.515
angel+1.458
Neg logit contributions
 Ratio-2.275
 Spy-1.988
 Lerner-1.980
 UNCLASSIFIED-1.931
 DOI-1.912
Token]
Token ID60
Feature activation-2.94e+00

Pos logit contributions
orld+3.058
iggins+2.772
cessive+2.755
phal+2.506
angel+2.405
Neg logit contributions
 Ratio-9.025
 Lerner-8.208
 UNCLASSIFIED-8.104
 Spy-8.083
 DOI-7.996
Token CBS
Token ID11133
Feature activation-3.38e+00

Pos logit contributions
orld+2.532
iggins+2.448
cessive+2.406
phal+2.331
angel+2.265
Neg logit contributions
 Ratio-1.984
 Lerner-1.648
 Spy-1.642
 UNCLASSIFIED-1.589
 DOI-1.579
TokenSports
Token ID18153
Feature activation-1.32e+01

Pos logit contributions
orld+2.358
iggins+2.255
cessive+2.213
phal+2.130
angel+2.057
Neg logit contributions
 Ratio-2.847
 Spy-2.471
 Lerner-2.460
 UNCLASSIFIED-2.401
 DOI-2.390
Token.
Token ID13
Feature activation-4.49e+00

Pos logit contributions
orld+8.635
iggins+7.807
cessive+7.425
phal+7.398
angel+7.313
Neg logit contributions
 Ratio-10.707
 Lerner-9.267
geries-9.062
 UNCLASSIFIED-9.020
 Spy-8.995
Tokencom
Token ID785
Feature activation-2.64e+00

Pos logit contributions
orld+2.252
iggins+2.122
cessive+2.081
phal+1.957
angel+1.852
Neg logit contributions
 Ratio-4.663
 Spy-4.177
 Lerner-4.143
 UNCLASSIFIED-4.070
 DOI-4.048
Token,[
Token ID17414
Feature activation-4.43e+00

Pos logit contributions
orld+2.071
iggins+1.986
cessive+1.956
phal+1.891
angel+1.829
Neg logit contributions
 Ratio-1.991
 Spy-1.697
 Lerner-1.689
 UNCLASSIFIED-1.646
 DOI-1.631
Token96
Token ID4846
Feature activation-6.92e+00

Pos logit contributions
orld+2.542
iggins+2.448
cessive+2.366
phal+2.276
angel+2.216
Neg logit contributions
 Ratio-4.213
 Lerner-3.735
 Spy-3.684
 UNCLASSIFIED-3.607
 DOI-3.563
Token]
Token ID60
Feature activation-1.71e+00

Pos logit contributions
orld+2.679
iggins+2.423
cessive+2.405
phal+2.226
angel+2.099
Neg logit contributions
 Ratio-7.594
 Lerner-6.867
 UNCLASSIFIED-6.813
 Spy-6.770
 DOI-6.690
Token's
Token ID338
Feature activation-2.74e+00

Pos logit contributions
orld+6.987
iggins+6.832
cessive+6.758
phal+6.485
idi+6.312
Neg logit contributions
 Ratio-7.154
 Spy-6.078
 Lerner-5.961
 UNCLASSIFIED-5.876
 Scythe-5.849
Token suspension
Token ID11461
Feature activation-4.81e+00

Pos logit contributions
orld+2.413
iggins+2.345
cessive+2.322
phal+2.242
angel+2.177
Neg logit contributions
 Ratio-1.787
 Spy-1.488
 Lerner-1.461
 UNCLASSIFIED-1.426
 DOI-1.407
Token of
Token ID286
Feature activation-3.43e+00

Pos logit contributions
orld+3.649
iggins+3.545
cessive+3.512
phal+3.405
angel+3.251
Neg logit contributions
 Ratio-3.660
 Spy-3.099
 Lerner-3.047
 UNCLASSIFIED-3.024
 DOI-2.972
Token a
Token ID257
Feature activation-1.31e+00

Pos logit contributions
orld+2.895
iggins+2.814
cessive+2.788
phal+2.689
angel+2.614
Neg logit contributions
 Ratio-2.321
 Spy-1.940
 Lerner-1.891
 UNCLASSIFIED-1.877
 DOI-1.845
Token male
Token ID4257
Feature activation-7.29e-01

Pos logit contributions
orld+1.075
iggins+1.036
cessive+1.024
phal+0.988
angel+0.958
Neg logit contributions
 Ratio-0.944
 Spy-0.802
 Lerner-0.792
 UNCLASSIFIED-0.771
 DOI-0.765
Token student
Token ID3710
Feature activation-1.32e+01

Pos logit contributions
orld+0.579
iggins+0.555
cessive+0.549
phal+0.527
angel+0.510
Neg logit contributions
 Ratio-0.549
 Spy-0.473
 Lerner-0.467
 UNCLASSIFIED-0.453
 DOI-0.451
Token who
Token ID508
Feature activation-5.24e+00

Pos logit contributions
orld+8.802
cessive+8.592
iggins+8.421
phal+8.325
idi+8.053
Neg logit contributions
 Ratio-9.856
geries-8.100
 Spy-8.076
 DOI-8.013
 Scythe-7.961
Token argues
Token ID11673
Feature activation+4.18e-01

Pos logit contributions
orld+4.178
iggins+4.007
cessive+3.998
phal+3.856
idi+3.738
Neg logit contributions
 Ratio-3.870
 Spy-3.241
 Lerner-3.180
 UNCLASSIFIED-3.115
 DOI-3.103
Token he
Token ID339
Feature activation+3.88e-03

Pos logit contributions
 Ratio+0.252
 Spy+0.207
 Lerner+0.204
 UNCLASSIFIED+0.197
 DOI+0.195
Neg logit contributions
orld-0.392
iggins-0.380
cessive-0.376
phal-0.365
angel-0.355
Token was
Token ID373
Feature activation-1.15e+00

Pos logit contributions
 Ratio+0.003
 Spy+0.002
 Lerner+0.002
 UNCLASSIFIED+0.002
 DOI+0.002
Neg logit contributions
orld-0.003
iggins-0.003
cessive-0.003
phal-0.003
angel-0.003
Token completely
Token ID3190
Feature activation+3.09e-01

Pos logit contributions
orld+0.918
iggins+0.886
cessive+0.874
phal+0.843
angel+0.816
Neg logit contributions
 Ratio-0.850
 Spy-0.727
 Lerner-0.718
 UNCLASSIFIED-0.699
 DOI-0.693
Token Cre
Token ID5844
Feature activation-2.62e+00

Pos logit contributions
orld+4.435
iggins+4.275
cessive+4.238
phal+4.054
angel+3.996
Neg logit contributions
 Ratio-3.858
 Spy-3.306
 Lerner-3.191
 UNCLASSIFIED-3.183
 DOI-3.132
Tokenighton
Token ID42993
Feature activation-4.23e+00

Pos logit contributions
orld+1.784
iggins+1.701
cessive+1.679
phal+1.595
angel+1.533
Neg logit contributions
 Ratio-2.260
 Spy-1.969
 Lerner-1.958
 UNCLASSIFIED-1.921
 DOI-1.905
Token University
Token ID2059
Feature activation-5.79e+00

Pos logit contributions
orld+3.893
iggins+3.758
cessive+3.724
phal+3.620
angel+3.537
Neg logit contributions
 Ratio-2.571
 Spy-2.129
 Lerner-2.071
 UNCLASSIFIED-2.008
 DOI-2.007
Token,
Token ID11
Feature activation+2.63e+00

Pos logit contributions
orld+4.516
iggins+4.366
cessive+4.304
phal+4.160
angel+4.064
Neg logit contributions
 Ratio-4.209
 Spy-3.610
 Lerner-3.484
 DOI-3.415
 UNCLASSIFIED-3.400
Token the
Token ID262
Feature activation+3.25e-01

Pos logit contributions
 Ratio+1.758
 Spy+1.471
 Lerner+1.468
 UNCLASSIFIED+1.402
 Depression+1.393
Neg logit contributions
orld-2.326
iggins-2.230
cessive-2.216
phal-2.142
idi-2.057
Token Rev
Token ID5416
Feature activation-1.31e+01

Pos logit contributions
 Ratio+0.234
 Spy+0.197
 Lerner+0.196
 UNCLASSIFIED+0.190
 DOI+0.188
Neg logit contributions
orld-0.270
iggins-0.259
cessive-0.257
phal-0.249
angel-0.240
Token.
Token ID13
Feature activation-7.88e+00

Pos logit contributions
orld+7.719
iggins+7.683
cessive+7.530
angel+7.501
idi+7.036
Neg logit contributions
 Ratio-10.876
geries-9.318
 Spy-9.276
 DOI-9.241
 ã‚µãƒ¼ãƒ†ã‚£ãƒ¯ãƒ³-9.131
Token Timothy
Token ID22283
Feature activation+9.71e-01

Pos logit contributions
orld+6.285
iggins+6.083
cessive+6.045
angel+5.878
phal+5.859
Neg logit contributions
 Ratio-5.144
 Spy-4.249
geries-4.183
 DOI-4.182
 UNCLASSIFIED-4.147
Token L
Token ID406
Feature activation-4.13e+00

Pos logit contributions
 Ratio+0.572
 Spy+0.467
 Lerner+0.463
 UNCLASSIFIED+0.442
 DOI+0.438
Neg logit contributions
orld-0.931
iggins-0.902
cessive-0.895
phal-0.865
angel-0.841
Tokenannon
Token ID8825
Feature activation-3.38e+00

Pos logit contributions
orld+3.157
iggins+3.092
cessive+2.950
angel+2.874
phal+2.841
Neg logit contributions
 Ratio-3.157
 Spy-2.718
 DOI-2.638
 Lerner-2.629
 UNCLASSIFIED-2.597
Token,
Token ID11
Feature activation-7.28e-01

Pos logit contributions
orld+2.719
iggins+2.630
cessive+2.560
phal+2.527
angel+2.462
Neg logit contributions
 Ratio-2.424
 Spy-2.055
 UNCLASSIFIED-1.980
 DOI-1.974
 Lerner-1.950
TokenMore
Token ID5167
Feature activation+3.08e+00

Pos logit contributions
 Ratio+0.177
 Spy+0.153
 Lerner+0.153
 UNCLASSIFIED+0.150
 DOI+0.148
Neg logit contributions
orld-0.164
iggins-0.157
cessive-0.154
phal-0.148
angel-0.142
Token:
Token ID25
Feature activation-3.49e+00

Pos logit contributions
 Ratio+1.675
 Spy+1.341
 Lerner+1.332
 UNCLASSIFIED+1.292
 DOI+1.271
Neg logit contributions
orld-3.068
iggins-2.940
cessive-2.932
phal-2.829
angel-2.786
Token Judge
Token ID8974
Feature activation-2.96e+00

Pos logit contributions
orld+3.064
iggins+2.953
cessive+2.935
phal+2.847
angel+2.766
Neg logit contributions
 Ratio-2.243
 Spy-1.866
 Lerner-1.837
 UNCLASSIFIED-1.784
 DOI-1.761
Token sides
Token ID5389
Feature activation+2.83e+00

Pos logit contributions
orld+2.265
iggins+2.188
cessive+2.161
phal+2.072
angel+2.003
Neg logit contributions
 Ratio-2.265
 Spy-1.944
 Lerner-1.913
 UNCLASSIFIED-1.867
 DOI-1.857
Token with
Token ID351
Feature activation+3.65e+00

Pos logit contributions
 Ratio+1.866
 Spy+1.558
 Lerner+1.539
 UNCLASSIFIED+1.492
 DOI+1.479
Neg logit contributions
orld-2.487
iggins-2.404
cessive-2.377
phal-2.300
angel-2.235
Token UC
Token ID14417
Feature activation-1.30e+01

Pos logit contributions
 Ratio+2.472
 Lerner+2.092
 Spy+2.077
 DOI+1.993
 Depression+1.975
Neg logit contributions
orld-3.199
iggins-3.041
cessive-3.011
phal-2.910
inent-2.829
Token grad
Token ID3915
Feature activation-9.26e+00

Pos logit contributions
orld+9.318
iggins+9.308
cessive+9.122
phal+8.714
angel+8.660
Neg logit contributions
 Ratio-8.910
 Spy-7.830
geries-7.680
 Lerner-7.634
 UNCLASSIFIED-7.611
Token student
Token ID3710
Feature activation-6.48e+00

Pos logit contributions
orld+6.852
iggins+6.793
cessive+6.753
phal+6.436
angel+6.288
Neg logit contributions
 Ratio-6.606
 Spy-5.492
 Lerner-5.446
 DOI-5.330
 UNCLASSIFIED-5.325
Token accused
Token ID5371
Feature activation-6.50e-01

Pos logit contributions
orld+5.073
iggins+4.941
cessive+4.908
phal+4.728
idi+4.585
Neg logit contributions
 Ratio-4.613
 Spy-3.856
 Lerner-3.801
 UNCLASSIFIED-3.717
 DOI-3.710
Token of
Token ID286
Feature activation-4.69e-01

Pos logit contributions
orld+0.508
iggins+0.487
cessive+0.483
phal+0.466
angel+0.452
Neg logit contributions
 Ratio-0.494
 Spy-0.421
 Lerner-0.416
 UNCLASSIFIED-0.408
 DOI-0.402
Token rape
Token ID7262
Feature activation+1.62e+00

Pos logit contributions
orld+0.384
iggins+0.370
cessive+0.364
phal+0.351
angel+0.342
Neg logit contributions
 Ratio-0.340
 Spy-0.291
 Lerner-0.287
 UNCLASSIFIED-0.279
 DOI-0.277
Token Airport
Token ID12690
Feature activation+4.34e+00

Pos logit contributions
orld+0.054
iggins+0.053
cessive+0.052
phal+0.051
angel+0.049
Neg logit contributions
 Ratio-0.041
 Spy-0.035
 Lerner-0.034
 UNCLASSIFIED-0.033
 DOI-0.033
Token Network
Token ID7311
Feature activation+6.33e-01

Pos logit contributions
 Ratio+2.849
 Spy+2.432
 Lerner+2.340
 DOI+2.309
 UNCLASSIFIED+2.303
Neg logit contributions
orld-3.742
cessive-3.654
iggins-3.640
phal-3.537
angel-3.387
Token,
Token ID11
Feature activation+1.37e+00

Pos logit contributions
 Ratio+0.437
 Spy+0.370
 Lerner+0.364
 UNCLASSIFIED+0.355
 DOI+0.351
Neg logit contributions
orld-0.536
iggins-0.519
cessive-0.515
phal-0.498
angel-0.480
Token appearing
Token ID12655
Feature activation+4.77e+00

Pos logit contributions
 Ratio+0.922
 Spy+0.781
 Lerner+0.765
 UNCLASSIFIED+0.745
 DOI+0.735
Neg logit contributions
orld-1.187
iggins-1.148
cessive-1.141
phal-1.105
angel-1.068
Token on
Token ID319
Feature activation+5.15e-01

Pos logit contributions
 Ratio+2.995
 Spy+2.570
 Lerner+2.500
 UNCLASSIFIED+2.425
 DOI+2.388
Neg logit contributions
orld-4.195
iggins-4.052
cessive-4.032
phal-3.959
idi-3.777
Token telev
Token ID22884
Feature activation-1.29e+01

Pos logit contributions
 Ratio+0.346
 Spy+0.291
 Lerner+0.286
 UNCLASSIFIED+0.279
 DOI+0.275
Neg logit contributions
orld-0.448
iggins-0.433
cessive-0.428
phal-0.415
angel-0.402
Tokenisions
Token ID3279
Feature activation-3.27e+00

Pos logit contributions
orld+7.595
angel+7.326
iggins+6.967
cessive+6.709
inent+6.469
Neg logit contributions
 Ratio-11.706
 Lerner-10.287
 Spy-10.196
 Scythe-9.912
 ã‚µãƒ¼ãƒ†ã‚£ãƒ¯ãƒ³-9.898
Token in
Token ID287
Feature activation+3.54e-01

Pos logit contributions
orld+2.732
iggins+2.629
cessive+2.588
phal+2.501
angel+2.459
Neg logit contributions
 Ratio-2.300
 Spy-1.926
 Lerner-1.912
 DOI-1.844
 UNCLASSIFIED-1.844
Token airports
Token ID22600
Feature activation+3.70e+00

Pos logit contributions
 Ratio+0.252
 Spy+0.215
 Lerner+0.212
 UNCLASSIFIED+0.206
 DOI+0.203
Neg logit contributions
orld-0.294
iggins-0.283
cessive-0.280
phal-0.271
angel-0.261
Token across
Token ID1973
Feature activation-1.17e+00

Pos logit contributions
 Ratio+2.316
 Spy+1.973
 Lerner+1.939
 UNCLASSIFIED+1.888
 DOI+1.870
Neg logit contributions
orld-3.287
cessive-3.213
iggins-3.207
phal-3.087
idi-2.962
Token the
Token ID262
Feature activation+3.92e-01

Pos logit contributions
orld+0.977
iggins+0.942
cessive+0.931
phal+0.899
angel+0.874
Neg logit contributions
 Ratio-0.821
 Spy-0.694
 Lerner-0.686
 UNCLASSIFIED-0.666
 DOI-0.660
Token3
Token ID18
Feature activation+2.18e+00

Pos logit contributions
 Ratio+1.286
 Spy+1.124
 Lerner+1.118
 UNCLASSIFIED+1.104
 DOI+1.099
Neg logit contributions
orld-1.131
iggins-1.085
cessive-1.061
phal-1.028
inent-0.973
Token.
Token ID13
Feature activation-2.24e+00

Pos logit contributions
 Ratio+1.531
 Spy+1.299
 Lerner+1.272
 UNCLASSIFIED+1.261
 DOI+1.242
Neg logit contributions
orld-1.830
iggins-1.759
cessive-1.732
phal-1.684
angel-1.625
Token Daniels
Token ID28162
Feature activation+1.73e+00

Pos logit contributions
orld+1.949
iggins+1.886
cessive+1.865
phal+1.805
angel+1.756
Neg logit contributions
 Ratio-1.488
 Spy-1.241
 Lerner-1.226
 UNCLASSIFIED-1.188
 DOI-1.174
Token�
Token ID447
Feature activation-1.05e+00

Pos logit contributions
 Ratio+1.237
 Spy+1.053
 Lerner+1.041
 UNCLASSIFIED+1.016
 DOI+1.005
Neg logit contributions
orld-1.437
iggins-1.377
cessive-1.364
phal-1.324
angel-1.272
Token�
Token ID247
Feature activation+1.39e+00

Pos logit contributions
orld+0.527
iggins+0.494
cessive+0.482
phal+0.455
angel+0.433
Neg logit contributions
 Ratio-1.089
 Spy-0.978
 Lerner-0.971
 UNCLASSIFIED-0.955
 DOI-0.947
Token Prom
Token ID10335
Feature activation-1.29e+01

Pos logit contributions
 Ratio+0.944
 Spy+0.798
 Lerner+0.781
 UNCLASSIFIED+0.766
 DOI+0.759
Neg logit contributions
orld-1.197
iggins-1.156
cessive-1.139
phal-1.106
angel-1.072
Tokeninent
Token ID7233
Feature activation-8.01e+00

Pos logit contributions
orld+8.375
inent+8.345
cessive+8.227
iggins+8.134
angel+7.806
Neg logit contributions
 Ratio-10.078
 Spy-8.593
 Lerner-8.512
 UNCLASSIFIED-8.451
 DOI-8.403
Token Family
Token ID7884
Feature activation+6.16e+00

Pos logit contributions
orld+6.772
iggins+6.634
cessive+6.611
phal+6.342
idi+6.235
Neg logit contributions
 Ratio-4.912
 Lerner-4.036
 Spy-3.982
 UNCLASSIFIED-3.898
geries-3.882
Token Ran
Token ID23075
Feature activation+5.82e+00

Pos logit contributions
 Ratio+3.960
 Spy+3.382
 Lerner+3.235
 UNCLASSIFIED+3.173
 Depression+3.117
Neg logit contributions
orld-5.593
iggins-5.356
cessive-5.318
phal-5.109
angel-4.938
Token a
Token ID257
Feature activation-6.03e-01

Pos logit contributions
 Ratio+3.809
 Spy+3.256
 Lerner+3.185
 Prevention+3.108
 Depression+3.080
Neg logit contributions
orld-5.087
iggins-4.905
cessive-4.746
phal-4.657
angel-4.553
Token Chain
Token ID21853
Feature activation-2.31e+00

Pos logit contributions
orld+0.548
iggins+0.531
cessive+0.526
phal+0.510
angel+0.495
Neg logit contributions
 Ratio-0.379
 Spy-0.313
 Lerner-0.309
 UNCLASSIFIED-0.300
 DOI-0.297
Token Al
Token ID978
Feature activation-7.42e+00

Pos logit contributions
orld+3.079
iggins+2.983
cessive+2.942
phal+2.848
idi+2.807
Neg logit contributions
 Ratio-2.228
 Spy-1.829
 Lerner-1.788
 DOI-1.748
 UNCLASSIFIED-1.747
Token Az
Token ID7578
Feature activation-3.85e+00

Pos logit contributions
orld+5.962
iggins+5.782
cessive+5.707
phal+5.518
idi+5.432
Neg logit contributions
 Ratio-5.241
 Spy-4.367
 DOI-4.247
 Depression-4.227
 Lerner-4.226
Tokenm
Token ID76
Feature activation-3.24e+00

Pos logit contributions
orld+2.827
iggins+2.736
cessive+2.697
phal+2.589
idi+2.532
Neg logit contributions
 Ratio-3.090
 Spy-2.647
 Lerner-2.611
 DOI-2.547
 UNCLASSIFIED-2.538
Token,
Token ID11
Feature activation-3.38e+00

Pos logit contributions
orld+2.498
iggins+2.408
cessive+2.361
phal+2.293
idi+2.247
Neg logit contributions
 Ratio-2.471
 Spy-2.109
 Lerner-2.081
 UNCLASSIFIED-2.020
 DOI-2.014
Token a
Token ID257
Feature activation-2.59e+00

Pos logit contributions
orld+2.894
iggins+2.818
cessive+2.762
phal+2.666
angel+2.602
Neg logit contributions
 Ratio-2.272
 Spy-1.886
 Lerner-1.870
 Depression-1.806
 UNCLASSIFIED-1.802
Token professor
Token ID6240
Feature activation-1.28e+01

Pos logit contributions
orld+2.174
iggins+2.108
cessive+2.080
phal+2.011
angel+1.951
Neg logit contributions
 Ratio-1.794
 Spy-1.504
 Lerner-1.489
 UNCLASSIFIED-1.442
 DOI-1.435
Token of
Token ID286
Feature activation-6.38e+00

Pos logit contributions
orld+7.844
iggins+7.321
phal+7.226
cessive+7.137
idi+7.111
Neg logit contributions
 Ratio-10.319
 Spy-8.818
 DOI-8.643
 Scythe-8.605
geries-8.599
Token Middle
Token ID6046
Feature activation-2.22e-01

Pos logit contributions
orld+4.885
iggins+4.699
cessive+4.574
phal+4.441
angel+4.312
Neg logit contributions
 Ratio-4.774
 Spy-4.005
 Lerner-3.979
 DOI-3.865
alty-3.852
Token East
Token ID3687
Feature activation+1.69e+00

Pos logit contributions
orld+0.191
iggins+0.184
cessive+0.182
phal+0.177
angel+0.172
Neg logit contributions
 Ratio-0.151
 Spy-0.127
 Lerner-0.125
 UNCLASSIFIED-0.121
 DOI-0.120
Token history
Token ID2106
Feature activation-2.88e+00

Pos logit contributions
 Ratio+1.214
 Spy+1.035
 Lerner+1.015
 UNCLASSIFIED+1.004
 DOI+0.984
Neg logit contributions
orld-1.375
iggins-1.333
cessive-1.326
phal-1.269
angel-1.227
Token at
Token ID379
Feature activation-4.33e+00

Pos logit contributions
orld+2.205
iggins+2.125
cessive+2.093
phal+2.035
angel+1.972
Neg logit contributions
 Ratio-2.205
 Spy-1.882
 Lerner-1.860
 UNCLASSIFIED-1.803
 DOI-1.797
Token
Token ID198
Feature activation-1.59e+00

Pos logit contributions
 Ratio+0.257
 Spy+0.216
 Lerner+0.213
 UNCLASSIFIED+0.207
 DOI+0.205
Neg logit contributions
orld-0.339
iggins-0.328
cessive-0.324
phal-0.314
angel-0.304
Token
Token ID198
Feature activation-1.78e-01

Pos logit contributions
orld+1.441
iggins+1.403
cessive+1.382
phal+1.331
idi+1.286
Neg logit contributions
 Ratio-1.021
 Spy-0.861
 Lerner-0.843
 UNCLASSIFIED-0.830
 DOI-0.822
Token@
Token ID31
Feature activation-4.33e+00

Pos logit contributions
orld+0.127
iggins+0.122
cessive+0.119
phal+0.115
idi+0.110
Neg logit contributions
 Ratio-0.147
 Spy-0.129
 Lerner-0.127
 UNCLASSIFIED-0.125
 DOI-0.123
TokenAnn
Token ID18858
Feature activation+1.71e+00

Pos logit contributions
orld+2.992
iggins+2.880
cessive+2.847
phal+2.733
angel+2.686
Neg logit contributions
 Ratio-3.658
 Spy-3.156
 Lerner-3.144
 DOI-3.044
 UNCLASSIFIED-3.043
TokenC
Token ID34
Feature activation-1.89e+00

Pos logit contributions
 Ratio+1.330
 Spy+1.154
 Lerner+1.141
 UNCLASSIFIED+1.120
 DOI+1.097
Neg logit contributions
orld-1.295
iggins-1.243
cessive-1.226
phal-1.188
idi-1.128
Tokenoul
Token ID2852
Feature activation-1.28e+01

Pos logit contributions
orld+1.438
iggins+1.385
cessive+1.359
phal+1.310
angel+1.280
Neg logit contributions
 Ratio-1.478
 Spy-1.273
 Lerner-1.253
 DOI-1.223
 Depression-1.219
Tokenter
Token ID353
Feature activation-9.27e-01

Pos logit contributions
orld+7.789
iggins+7.776
cessive+7.647
phal+7.342
idi+7.311
Neg logit contributions
 Ratio-10.826
 Spy-9.289
 Lerner-9.155
 Scythe-9.146
 Depression-9.091
Token HAVE
Token ID21515
Feature activation-7.77e-01

Pos logit contributions
orld+0.747
iggins+0.720
cessive+0.711
phal+0.685
angel+0.665
Neg logit contributions
 Ratio-0.682
 Spy-0.580
 Lerner-0.574
 UNCLASSIFIED-0.558
 DOI-0.555
Token YOU
Token ID7013
Feature activation+3.07e+00

Pos logit contributions
orld+0.675
iggins+0.652
cessive+0.645
phal+0.623
angel+0.605
Neg logit contributions
 Ratio-0.522
 Spy-0.436
 Lerner-0.430
 UNCLASSIFIED-0.416
 DOI-0.413
Token SE
Token ID7946
Feature activation+4.12e-01

Pos logit contributions
 Ratio+2.134
 Spy+1.834
 Lerner+1.789
 UNCLASSIFIED+1.786
 DOI+1.748
Neg logit contributions
orld-2.610
iggins-2.504
cessive-2.480
phal-2.393
angel-2.316
TokenEN
Token ID1677
Feature activation+3.70e+00

Pos logit contributions
 Ratio+0.398
 Spy+0.353
 Lerner+0.350
 UNCLASSIFIED+0.344
 DOI+0.341
Neg logit contributions
orld-0.237
iggins-0.225
cessive-0.221
phal-0.209
angel-0.200
Token W
Token ID370
Feature activation-1.04e+00

Pos logit contributions
 Ratio+2.070
 Spy+1.625
 Lerner+1.564
 UNCLASSIFIED+1.525
 DOI+1.495
Neg logit contributions
orld-4.161
iggins-4.054
cessive-3.973
phal-3.893
angel-3.793
Token
Token ID198
Feature activation-6.26e-01

Pos logit contributions
orld+0.958
iggins+0.928
cessive+0.920
phal+0.890
angel+0.867
Neg logit contributions
 Ratio-0.639
 Spy-0.529
 Lerner-0.525
 UNCLASSIFIED-0.506
 DOI-0.501
Token
Token ID198
Feature activation+7.43e-01

Pos logit contributions
orld+0.571
iggins+0.556
cessive+0.548
phal+0.528
idi+0.509
Neg logit contributions
 Ratio-0.401
 Spy-0.332
 Lerner-0.326
 UNCLASSIFIED-0.320
 DOI-0.317
TokenThe
Token ID464
Feature activation-1.27e+00

Pos logit contributions
 Ratio+0.614
 Spy+0.534
 Lerner+0.527
 UNCLASSIFIED+0.515
 DOI+0.512
Neg logit contributions
orld-0.529
iggins-0.509
cessive-0.499
phal-0.480
angel-0.457
Token Nvidia
Token ID27699
Feature activation-4.04e+00

Pos logit contributions
orld+1.116
iggins+1.078
cessive+1.068
phal+1.032
angel+1.003
Neg logit contributions
 Ratio-0.839
 Spy-0.701
 Lerner-0.694
 UNCLASSIFIED-0.678
 DOI-0.666
Token GTX
Token ID12646
Feature activation-1.27e+01

Pos logit contributions
orld+3.321
iggins+3.186
cessive+3.168
phal+3.052
angel+2.979
Neg logit contributions
 Ratio-2.788
 Spy-2.350
 Lerner-2.331
 UNCLASSIFIED-2.319
geries-2.245
Token1080
Token ID24045
Feature activation-1.00e+01

Pos logit contributions
orld+9.353
iggins+9.066
cessive+8.817
angel+8.519
phal+8.345
Neg logit contributions
 Ratio-8.497
 Lerner-7.426
geries-7.355
 UNCLASSIFIED-7.344
alty-7.292
Token ships
Token ID7937
Feature activation-1.51e-01

Pos logit contributions
orld+7.489
iggins+7.122
cessive+7.061
angel+6.823
phal+6.719
Neg logit contributions
 Ratio-7.101
 Lerner-6.116
 UNCLASSIFIED-6.060
 Spy-5.903
alty-5.896
Token with
Token ID351
Feature activation-4.52e-01

Pos logit contributions
orld+0.132
iggins+0.127
cessive+0.126
phal+0.122
angel+0.118
Neg logit contributions
 Ratio-0.100
 Spy-0.084
 Lerner-0.083
 UNCLASSIFIED-0.080
 DOI-0.080
Token 25
Token ID1679
Feature activation+1.53e+00

Pos logit contributions
orld+0.395
iggins+0.382
cessive+0.378
phal+0.366
angel+0.355
Neg logit contributions
 Ratio-0.302
 Spy-0.253
 Lerner-0.249
 UNCLASSIFIED-0.242
 DOI-0.240
Token60
Token ID1899
Feature activation+6.79e-01

Pos logit contributions
 Ratio+1.273
 Spy+1.108
 Lerner+1.093
 UNCLASSIFIED+1.067
 DOI+1.064
Neg logit contributions
orld-1.095
iggins-1.058
cessive-1.032
phal-0.992
angel-0.948
Token following
Token ID1708
Feature activation-1.79e+00

Pos logit contributions
orld+1.786
iggins+1.724
cessive+1.699
phal+1.641
angel+1.590
Neg logit contributions
 Ratio-1.522
 Spy-1.284
 Lerner-1.272
 UNCLASSIFIED-1.237
 DOI-1.228
Token his
Token ID465
Feature activation-1.12e+00

Pos logit contributions
orld+1.575
iggins+1.525
cessive+1.505
phal+1.451
angel+1.410
Neg logit contributions
 Ratio-1.167
 Spy-0.968
 Lerner-0.959
 UNCLASSIFIED-0.937
 DOI-0.928
Token junior
Token ID13430
Feature activation-7.10e+00

Pos logit contributions
orld+0.934
iggins+0.901
cessive+0.891
phal+0.861
angel+0.836
Neg logit contributions
 Ratio-0.790
 Spy-0.671
 Lerner-0.661
 UNCLASSIFIED-0.643
 DOI-0.638
Token season
Token ID1622
Feature activation-2.55e+00

Pos logit contributions
orld+5.019
iggins+4.837
cessive+4.740
phal+4.530
idi+4.414
Neg logit contributions
 Ratio-5.741
 Lerner-4.951
 Spy-4.904
 UNCLASSIFIED-4.789
 DOI-4.766
Token.[
Token ID3693
Feature activation-2.43e+00

Pos logit contributions
orld+2.069
iggins+1.993
cessive+1.968
phal+1.897
angel+1.839
Neg logit contributions
 Ratio-1.863
 Spy-1.578
 Lerner-1.567
 UNCLASSIFIED-1.521
 DOI-1.509
Token90
Token ID3829
Feature activation-1.27e+01

Pos logit contributions
orld+1.583
iggins+1.513
cessive+1.491
phal+1.432
angel+1.378
Neg logit contributions
 Ratio-2.158
 Spy-1.893
 Lerner-1.879
 UNCLASSIFIED-1.834
 DOI-1.814
Token]
Token ID60
Feature activation-3.00e+00

Pos logit contributions
orld+4.320
cessive+4.007
iggins+3.930
phal+3.732
angel+3.478
Neg logit contributions
 Ratio-13.347
 Lerner-12.176
 Spy-12.042
 UNCLASSIFIED-11.865
geries-11.857
Token
Token ID198
Feature activation-3.47e+00

Pos logit contributions
orld+2.550
iggins+2.455
cessive+2.428
phal+2.354
angel+2.291
Neg logit contributions
 Ratio-2.003
 Spy-1.673
 Lerner-1.654
 UNCLASSIFIED-1.612
 DOI-1.604
Token
Token ID198
Feature activation-2.43e+00

Pos logit contributions
orld+2.967
iggins+2.867
cessive+2.833
phal+2.735
angel+2.651
Neg logit contributions
 Ratio-2.384
 Spy-2.010
 Lerner-1.982
 UNCLASSIFIED-1.929
 DOI-1.912
TokenCollege
Token ID38951
Feature activation-8.70e+00

Pos logit contributions
orld+1.708
iggins+1.636
cessive+1.617
phal+1.547
angel+1.499
Neg logit contributions
 Ratio-2.041
 Spy-1.772
 Lerner-1.756
 UNCLASSIFIED-1.715
 DOI-1.705
Token career
Token ID3451
Feature activation+1.03e+00

Pos logit contributions
orld+6.973
iggins+6.703
cessive+6.569
phal+6.241
idi+6.098
Neg logit contributions
 Ratio-5.854
 Lerner-5.028
 Spy-4.911
 Scythe-4.855
 DOI-4.855
Token the
Token ID262
Feature activation-3.32e+00

Pos logit contributions
 Ratio+1.515
 Spy+1.268
 Lerner+1.209
 DOI+1.203
 UNCLASSIFIED+1.198
Neg logit contributions
orld-2.555
iggins-2.493
cessive-2.471
phal-2.369
inent-2.356
Token box
Token ID3091
Feature activation-5.02e+00

Pos logit contributions
orld+2.567
iggins+2.487
cessive+2.458
phal+2.349
idi+2.288
Neg logit contributions
 Ratio-2.551
 Spy-2.222
 Lerner-2.170
 UNCLASSIFIED-2.138
 DOI-2.117
Token.
Token ID13
Feature activation-6.38e+00

Pos logit contributions
orld+4.046
iggins+3.897
cessive+3.851
phal+3.716
angel+3.600
Neg logit contributions
 Ratio-3.686
 Spy-3.137
 Lerner-3.101
 UNCLASSIFIED-3.015
 DOI-2.993
Token Invalid
Token ID17665
Feature activation-6.45e+00

Pos logit contributions
orld+4.358
iggins+4.153
cessive+4.081
phal+3.951
angel+3.805
Neg logit contributions
 Ratio-5.350
 Spy-4.657
 Lerner-4.645
 UNCLASSIFIED-4.496
 DOI-4.485
Token email
Token ID3053
Feature activation-5.47e+00

Pos logit contributions
orld+5.124
iggins+4.963
cessive+4.869
phal+4.717
angel+4.586
Neg logit contributions
 Ratio-4.723
 Spy-4.004
 Lerner-3.995
 UNCLASSIFIED-3.847
 DOI-3.814
Token address
Token ID2209
Feature activation-1.26e+01

Pos logit contributions
orld+5.657
iggins+5.502
cessive+5.428
phal+5.302
angel+5.189
Neg logit contributions
 Ratio-2.718
 Spy-2.098
 Lerner-2.085
 UNCLASSIFIED-1.945
 DOI-1.942
Token.
Token ID13
Feature activation-5.80e+00

Pos logit contributions
orld+7.894
iggins+7.397
phal+7.306
cessive+7.291
angel+7.065
Neg logit contributions
 Ratio-10.246
 Lerner-8.799
geries-8.690
 seiz-8.616
 Spy-8.600
Token Please
Token ID4222
Feature activation-1.09e+01

Pos logit contributions
orld+4.763
iggins+4.599
cessive+4.540
phal+4.385
angel+4.256
Neg logit contributions
 Ratio-4.146
 Spy-3.513
 Lerner-3.481
 UNCLASSIFIED-3.375
 DOI-3.348
Token re
Token ID302
Feature activation-3.08e+00

Pos logit contributions
orld+5.777
iggins+5.617
cessive+5.396
phal+5.145
idi+4.844
Neg logit contributions
 Ratio-10.560
 Spy-9.357
 Lerner-9.306
 Depression-9.072
 Scythe-8.986
Token-
Token ID12
Feature activation+1.57e+00

Pos logit contributions
orld+2.695
iggins+2.606
cessive+2.559
phal+2.468
idi+2.414
Neg logit contributions
 Ratio-2.068
 Spy-1.753
 Lerner-1.736
 UNCLASSIFIED-1.692
 DOI-1.662
Tokenenter
Token ID9255
Feature activation+5.30e-02

Pos logit contributions
 Ratio+1.618
 Lerner+1.490
 UNCLASSIFIED+1.478
 Spy+1.476
alty+1.446
Neg logit contributions
orld-0.756
iggins-0.712
cessive-0.671
phal-0.620
idi-0.590
Token 6
Token ID718
Feature activation+5.65e-01

Pos logit contributions
orld+1.997
iggins+1.928
cessive+1.905
phal+1.841
angel+1.787
Neg logit contributions
 Ratio-1.679
 Spy-1.416
 Lerner-1.398
 UNCLASSIFIED-1.360
 DOI-1.348
Token and
Token ID290
Feature activation-2.43e-01

Pos logit contributions
 Ratio+0.396
 Spy+0.335
 Lerner+0.331
 UNCLASSIFIED+0.321
 DOI+0.318
Neg logit contributions
orld-0.475
iggins-0.458
cessive-0.453
phal-0.438
angel-0.425
Token 7
Token ID767
Feature activation+1.05e+00

Pos logit contributions
orld+0.215
iggins+0.209
cessive+0.206
phal+0.199
idi+0.193
Neg logit contributions
 Ratio-0.161
 Spy-0.135
 Lerner-0.133
 UNCLASSIFIED-0.129
 DOI-0.128
Token GW
Token ID27164
Feature activation-1.93e+00

Pos logit contributions
 Ratio+0.714
 Spy+0.599
 Lerner+0.592
 UNCLASSIFIED+0.574
 DOI+0.569
Neg logit contributions
orld-0.905
iggins-0.874
cessive-0.864
phal-0.836
angel-0.811
Token of
Token ID286
Feature activation-1.31e+00

Pos logit contributions
orld+1.513
iggins+1.462
cessive+1.427
phal+1.392
angel+1.349
Neg logit contributions
 Ratio-1.463
 Spy-1.242
 Lerner-1.231
 UNCLASSIFIED-1.200
 Depression-1.192
Token solar
Token ID6591
Feature activation-1.25e+01

Pos logit contributions
orld+1.326
iggins+1.290
cessive+1.276
phal+1.242
angel+1.211
Neg logit contributions
 Ratio-0.694
 Spy-0.548
 Lerner-0.539
 UNCLASSIFIED-0.517
 DOI-0.508
Token generating
Token ID15453
Feature activation-1.06e+01

Pos logit contributions
orld+8.433
iggins+8.208
cessive+8.065
phal+8.046
angel+7.790
Neg logit contributions
 Ratio-9.465
 Spy-8.068
 Lerner-7.975
geries-7.852
 Scythe-7.832
Token capacity
Token ID5339
Feature activation-3.40e+00

Pos logit contributions
orld+6.992
iggins+6.928
phal+6.713
cessive+6.710
izations+6.366
Neg logit contributions
 Ratio-8.273
 Spy-7.023
 Lerner-6.885
geries-6.729
 Scythe-6.723
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
orld+2.722
iggins+2.616
cessive+2.569
phal+2.506
angel+2.431
Neg logit contributions
 Ratio-2.505
 Spy-2.140
 Lerner-2.109
 UNCLASSIFIED-2.040
 Depression-2.034
Token Off
Token ID3242
Feature activation-1.59e+00

Pos logit contributions
 Ratio+0.977
 Spy+0.819
 Lerner+0.807
 UNCLASSIFIED+0.785
 DOI+0.778
Neg logit contributions
orld-1.270
iggins-1.234
cessive-1.215
phal-1.163
inent-1.136
Tokenshore
Token ID14640
Feature activation+4.13e+00

Pos logit contributions
orld+0.909
iggins+0.866
cessive+0.852
phal+0.812
angel+0.778
Neg logit contributions
 Ratio-1.535
 Spy-1.367
 Lerner-1.353
 UNCLASSIFIED-1.327
 DOI-1.319
Token young
Token ID1862
Feature activation-7.93e-01

Pos logit contributions
orld+0.246
iggins+0.237
cessive+0.234
phal+0.224
angel+0.218
Neg logit contributions
 Ratio-0.215
 Spy-0.183
 Lerner-0.181
 UNCLASSIFIED-0.177
 DOI-0.175
Token evangelicals
Token ID44881
Feature activation-5.35e+00

Pos logit contributions
orld+0.679
iggins+0.656
cessive+0.648
phal+0.627
angel+0.609
Neg logit contributions
 Ratio-0.544
 Spy-0.458
 Lerner-0.451
 UNCLASSIFIED-0.438
 DOI-0.435
Token are
Token ID389
Feature activation-3.10e+00

Pos logit contributions
orld+4.626
iggins+4.445
cessive+4.381
phal+4.281
angel+4.227
Neg logit contributions
 Ratio-3.540
 Spy-2.943
 Lerner-2.828
 DOI-2.761
 Depression-2.723
Token abandoning
Token ID31309
Feature activation-1.77e+00

Pos logit contributions
orld+2.522
iggins+2.429
cessive+2.402
phal+2.322
angel+2.254
Neg logit contributions
 Ratio-2.253
 Spy-1.908
 Lerner-1.882
 UNCLASSIFIED-1.827
 DOI-1.821
Token their
Token ID511
Feature activation-4.01e+00

Pos logit contributions
orld+1.437
iggins+1.386
cessive+1.370
phal+1.321
angel+1.282
Neg logit contributions
 Ratio-1.292
 Spy-1.099
 Lerner-1.085
 UNCLASSIFIED-1.056
 DOI-1.049
Token faith
Token ID4562
Feature activation-1.23e+01

Pos logit contributions
orld+3.081
iggins+2.967
cessive+2.925
phal+2.803
angel+2.745
Neg logit contributions
 Ratio-3.060
 Spy-2.624
 Lerner-2.603
 DOI-2.512
 UNCLASSIFIED-2.509
Token,
Token ID11
Feature activation-3.29e+00

Pos logit contributions
orld+8.599
iggins+8.137
phal+7.914
cessive+7.800
izations+7.636
Neg logit contributions
 Ratio-9.500
 Spy-8.124
 Lerner-7.906
 Prevention-7.677
 DOI-7.509
Token but
Token ID475
Feature activation-3.29e+00

Pos logit contributions
orld+2.601
iggins+2.509
cessive+2.476
phal+2.396
angel+2.330
Neg logit contributions
 Ratio-2.443
 Spy-2.075
 Lerner-2.052
 DOI-1.984
 UNCLASSIFIED-1.983
Token that
Token ID326
Feature activation+3.41e-01

Pos logit contributions
orld+2.745
iggins+2.649
cessive+2.627
phal+2.537
angel+2.474
Neg logit contributions
 Ratio-2.296
 Spy-1.936
 Lerner-1.908
 DOI-1.849
 UNCLASSIFIED-1.842
Token they
Token ID484
Feature activation+1.49e-01

Pos logit contributions
 Ratio+0.241
 Spy+0.203
 Lerner+0.202
 UNCLASSIFIED+0.197
 Depression+0.194
Neg logit contributions
orld-0.286
iggins-0.276
cessive-0.272
phal-0.263
angel-0.253
Token are
Token ID389
Feature activation-3.31e+00

Pos logit contributions
 Ratio+0.091
 Spy+0.075
 Lerner+0.074
 UNCLASSIFIED+0.072
 DOI+0.071
Neg logit contributions
orld-0.139
iggins-0.135
cessive-0.133
phal-0.129
angel-0.126
Token as
Token ID355
Feature activation-1.47e+00

Pos logit contributions
orld+3.744
iggins+3.616
cessive+3.563
phal+3.439
idi+3.358
Neg logit contributions
 Ratio-3.545
 Spy-3.005
 Lerner-2.956
 DOI-2.877
 UNCLASSIFIED-2.870
Token well
Token ID880
Feature activation-3.60e+00

Pos logit contributions
orld+1.365
iggins+1.327
cessive+1.315
phal+1.275
angel+1.239
Neg logit contributions
 Ratio-0.893
 Spy-0.734
 Lerner-0.719
 UNCLASSIFIED-0.696
 DOI-0.692
Token as
Token ID355
Feature activation+1.03e+00

Pos logit contributions
orld+2.636
iggins+2.575
cessive+2.526
phal+2.416
angel+2.348
Neg logit contributions
 Ratio-2.841
 Spy-2.434
 Lerner-2.366
 UNCLASSIFIED-2.330
 DOI-2.315
Token funding
Token ID4918
Feature activation-4.37e+00

Pos logit contributions
 Ratio+0.732
 Spy+0.620
 Lerner+0.612
 UNCLASSIFIED+0.597
 DOI+0.592
Neg logit contributions
orld-0.868
iggins-0.832
cessive-0.822
phal-0.795
angel-0.772
Token for
Token ID329
Feature activation-2.24e+00

Pos logit contributions
orld+3.492
iggins+3.431
cessive+3.346
phal+3.257
idi+3.163
Neg logit contributions
 Ratio-3.119
 Spy-2.622
 Lerner-2.601
 UNCLASSIFIED-2.516
 DOI-2.515
Token remed
Token ID20100
Feature activation-1.23e+01

Pos logit contributions
orld+1.834
iggins+1.778
cessive+1.748
phal+1.689
angel+1.641
Neg logit contributions
 Ratio-1.596
 Spy-1.351
 Lerner-1.336
 UNCLASSIFIED-1.301
 DOI-1.287
Tokenial
Token ID498
Feature activation-7.19e+00

Pos logit contributions
orld+6.592
iggins+6.495
cessive+6.297
angel+6.172
phal+5.839
Neg logit contributions
 Ratio-11.226
 Spy-10.047
 Lerner-10.042
 UNCLASSIFIED-9.752
 Scythe-9.747
Token courses
Token ID10902
Feature activation-7.99e+00

Pos logit contributions
orld+5.341
iggins+5.165
cessive+5.045
phal+4.874
angel+4.775
Neg logit contributions
 Ratio-5.554
 Spy-4.709
 Lerner-4.655
 UNCLASSIFIED-4.595
 Scythe-4.502
Token used
Token ID973
Feature activation+7.53e-01

Pos logit contributions
orld+5.995
iggins+5.840
cessive+5.682
phal+5.561
angel+5.443
Neg logit contributions
 Ratio-5.840
 Spy-4.946
 Lerner-4.754
 DOI-4.743
 UNCLASSIFIED-4.717
Token to
Token ID284
Feature activation-8.59e-01

Pos logit contributions
 Ratio+0.507
 Spy+0.428
 Lerner+0.423
 UNCLASSIFIED+0.413
 DOI+0.407
Neg logit contributions
orld-0.652
iggins-0.630
cessive-0.623
phal-0.602
angel-0.584
Token prepare
Token ID8335
Feature activation-3.84e-01

Pos logit contributions
orld+0.723
iggins+0.696
cessive+0.690
phal+0.666
angel+0.646
Neg logit contributions
 Ratio-0.604
 Spy-0.512
 Lerner-0.504
 UNCLASSIFIED-0.494
 DOI-0.488
Token:
Token ID25
Feature activation-3.83e+00

Pos logit contributions
orld+2.328
iggins+2.259
cessive+2.227
phal+2.140
angel+2.071
Neg logit contributions
 Ratio-1.912
 Spy-1.628
 Lerner-1.598
 UNCLASSIFIED-1.566
 DOI-1.549
Token47
Token ID2857
Feature activation-2.83e-01

Pos logit contributions
orld+2.299
iggins+2.197
cessive+2.148
idi+2.015
phal+2.013
Neg logit contributions
 Ratio-3.572
 Lerner-3.104
 Spy-3.062
 UNCLASSIFIED-3.029
 Prevention-3.023
Token]
Token ID60
Feature activation-1.99e+00

Pos logit contributions
orld+0.168
iggins+0.163
cessive+0.158
phal+0.146
idi+0.141
Neg logit contributions
 Ratio-0.269
 Spy-0.241
 UNCLASSIFIED-0.238
 Lerner-0.238
 DOI-0.236
Token <
Token ID1279
Feature activation-2.20e+00

Pos logit contributions
orld+1.474
iggins+1.422
cessive+1.410
phal+1.359
angel+1.304
Neg logit contributions
 Ratio-1.566
 Lerner-1.342
 Spy-1.333
 Depression-1.275
 DOI-1.273
Tokenc
Token ID66
Feature activation+1.16e+00

Pos logit contributions
orld+1.627
iggins+1.561
cessive+1.545
phal+1.490
angel+1.440
Neg logit contributions
 Ratio-1.763
 Spy-1.523
 Lerner-1.508
 UNCLASSIFIED-1.471
 DOI-1.462
Tokenerv
Token ID712
Feature activation-1.23e+01

Pos logit contributions
 Ratio+0.912
 Spy+0.787
 Lerner+0.778
 UNCLASSIFIED+0.763
 DOI+0.753
Neg logit contributions
orld-0.865
iggins-0.830
cessive-0.821
phal-0.786
angel-0.756
Tokenant
Token ID415
Feature activation-7.07e+00

Pos logit contributions
orld+8.398
iggins+8.162
cessive+8.043
angel+7.871
phal+7.748
Neg logit contributions
 Ratio-9.553
 Spy-8.237
 Lerner-8.065
 Depression-8.046
 Prevention-7.910
Tokenas
Token ID292
Feature activation-1.37e+00

Pos logit contributions
orld+5.301
iggins+5.114
cessive+5.052
phal+4.931
angel+4.854
Neg logit contributions
 Ratio-5.502
 Spy-4.738
 Lerner-4.681
 DOI-4.532
 UNCLASSIFIED-4.529
Token>
Token ID29
Feature activation-3.53e+00

Pos logit contributions
orld+0.995
iggins+0.954
cessive+0.942
phal+0.907
angel+0.875
Neg logit contributions
 Ratio-1.126
 Spy-0.977
 Lerner-0.966
 UNCLASSIFIED-0.943
 DOI-0.938
Token lol
Token ID19462
Feature activation-9.46e-01

Pos logit contributions
orld+2.955
iggins+2.845
cessive+2.833
phal+2.737
angel+2.649
Neg logit contributions
 Ratio-2.420
 Lerner-2.030
 Spy-2.026
 UNCLASSIFIED-1.940
 DOI-1.935
Token m
Token ID285
Feature activation+6.12e-01

Pos logit contributions
orld+0.731
iggins+0.701
cessive+0.694
phal+0.669
angel+0.648
Neg logit contributions
 Ratio-0.727
 Spy-0.623
 Lerner-0.618
 UNCLASSIFIED-0.599
 DOI-0.596
Token million
Token ID1510
Feature activation+5.46e-01

Pos logit contributions
 Ratio+1.437
 Spy+1.207
 Lerner+1.186
 UNCLASSIFIED+1.151
 DOI+1.147
Neg logit contributions
orld-1.899
iggins-1.832
cessive-1.811
phal-1.756
angel-1.706
Token deficit
Token ID11807
Feature activation+2.86e+00

Pos logit contributions
 Ratio+0.364
 Spy+0.305
 Lerner+0.301
 UNCLASSIFIED+0.290
 DOI+0.288
Neg logit contributions
orld-0.481
iggins-0.462
cessive-0.456
phal-0.445
angel-0.431
Token,
Token ID11
Feature activation-1.30e+00

Pos logit contributions
 Ratio+1.904
 Spy+1.599
 Lerner+1.576
 UNCLASSIFIED+1.535
 DOI+1.514
Neg logit contributions
orld-2.477
iggins-2.399
cessive-2.368
phal-2.285
angel-2.220
Token cut
Token ID2005
Feature activation+1.71e+00

Pos logit contributions
orld+1.080
iggins+1.042
cessive+1.030
phal+0.993
angel+0.966
Neg logit contributions
 Ratio-0.928
 Spy-0.791
 Lerner-0.781
 UNCLASSIFIED-0.760
 DOI-0.752
Token its
Token ID663
Feature activation-1.57e+00

Pos logit contributions
 Ratio+1.163
 Spy+0.978
 Lerner+0.978
 DOI+0.933
 UNCLASSIFIED+0.931
Neg logit contributions
orld-1.476
iggins-1.424
cessive-1.403
phal-1.353
angel-1.323
Token athletics
Token ID35691
Feature activation-1.20e+01

Pos logit contributions
orld+1.284
iggins+1.237
cessive+1.222
phal+1.180
angel+1.143
Neg logit contributions
 Ratio-1.137
 Spy-0.965
 Lerner-0.955
 UNCLASSIFIED-0.926
 DOI-0.921
Token budget
Token ID4466
Feature activation-1.59e+00

Pos logit contributions
orld+8.688
iggins+8.321
cessive+8.226
phal+7.840
angel+7.655
Neg logit contributions
 Ratio-8.658
 Lerner-7.353
 Spy-7.311
 UNCLASSIFIED-7.262
 Scythe-7.122
Token.
Token ID13
Feature activation-1.39e+00

Pos logit contributions
orld+1.333
iggins+1.286
cessive+1.272
phal+1.228
angel+1.191
Neg logit contributions
 Ratio-1.122
 Spy-0.949
 Lerner-0.937
 UNCLASSIFIED-0.911
 DOI-0.903
Token It
Token ID632
Feature activation-2.73e-01

Pos logit contributions
orld+1.192
iggins+1.151
cessive+1.139
phal+1.101
angel+1.069
Neg logit contributions
 Ratio-0.945
 Spy-0.795
 Lerner-0.784
 UNCLASSIFIED-0.762
 DOI-0.755
Token is
Token ID318
Feature activation-2.11e+00

Pos logit contributions
orld+0.222
iggins+0.214
cessive+0.211
phal+0.204
angel+0.197
Neg logit contributions
 Ratio-0.199
 Spy-0.170
 Lerner-0.168
 UNCLASSIFIED-0.164
 DOI-0.161
Token a
Token ID257
Feature activation+2.06e+00

Pos logit contributions
orld+1.774
iggins+1.715
cessive+1.700
phal+1.641
angel+1.589
Neg logit contributions
 Ratio-1.465
 Spy-1.232
 Lerner-1.210
 UNCLASSIFIED-1.178
 DOI-1.177
Token,
Token ID11
Feature activation+1.55e+00

Pos logit contributions
 Ratio+1.523
 Spy+1.276
 Lerner+1.256
 UNCLASSIFIED+1.216
 DOI+1.205
Neg logit contributions
orld-2.036
iggins-1.974
cessive-1.948
phal-1.887
angel-1.836
Token the
Token ID262
Feature activation-1.04e+00

Pos logit contributions
 Ratio+1.046
 Spy+0.885
 Lerner+0.872
 UNCLASSIFIED+0.841
 Depression+0.833
Neg logit contributions
orld-1.341
iggins-1.299
cessive-1.278
phal-1.244
angel-1.210
Token main
Token ID1388
Feature activation+1.59e-01

Pos logit contributions
orld+0.869
iggins+0.839
cessive+0.830
phal+0.799
angel+0.776
Neg logit contributions
 Ratio-0.738
 Spy-0.624
 Lerner-0.617
 UNCLASSIFIED-0.600
 DOI-0.595
Token point
Token ID966
Feature activation+2.82e+00

Pos logit contributions
 Ratio+0.105
 Spy+0.088
 Lerner+0.087
 UNCLASSIFIED+0.084
 DOI+0.084
Neg logit contributions
orld-0.140
iggins-0.135
cessive-0.134
phal-0.130
angel-0.126
Token of
Token ID286
Feature activation+8.06e-01

Pos logit contributions
 Ratio+1.956
 Spy+1.667
 Lerner+1.654
 UNCLASSIFIED+1.599
 DOI+1.565
Neg logit contributions
orld-2.377
iggins-2.288
cessive-2.273
phal-2.182
angel-2.131
Token Gene
Token ID13005
Feature activation+8.79e+00

Pos logit contributions
 Ratio+0.526
 Spy+0.440
 Lerner+0.434
 UNCLASSIFIED+0.419
 DOI+0.416
Neg logit contributions
orld-0.716
iggins-0.692
cessive-0.684
phal-0.664
angel-0.645
Tokenalogy
Token ID48909
Feature activation+3.50e+00

Pos logit contributions
 Ratio+5.643
 Lerner+4.686
 Spy+4.613
 UNCLASSIFIED+4.366
 Depression+4.360
Neg logit contributions
orld-7.563
iggins-7.351
cessive-7.113
phal-6.809
angel-6.640
Token�
Token ID447
Feature activation+2.82e-02

Pos logit contributions
 Ratio+2.530
 Spy+2.146
 Lerner+2.120
 UNCLASSIFIED+2.060
 DOI+2.045
Neg logit contributions
orld-2.871
iggins-2.771
cessive-2.741
phal-2.637
angel-2.558
Token�
Token ID247
Feature activation-3.06e-01

Pos logit contributions
 Ratio+0.027
 Spy+0.024
 Lerner+0.024
 UNCLASSIFIED+0.023
 Depression+0.023
Neg logit contributions
orld-0.017
iggins-0.016
cessive-0.016
phal-0.015
idi-0.014
Tokens
Token ID82
Feature activation+2.12e+00

Pos logit contributions
orld+0.217
iggins+0.208
cessive+0.204
phal+0.197
angel+0.190
Neg logit contributions
 Ratio-0.256
 Spy-0.222
 Lerner-0.219
 UNCLASSIFIED-0.215
 DOI-0.214
Token first
Token ID717
Feature activation-1.88e+00

Pos logit contributions
 Ratio+1.433
 Spy+1.201
 Lerner+1.188
 UNCLASSIFIED+1.150
 DOI+1.141
Neg logit contributions
orld-1.832
iggins-1.764
cessive-1.748
phal-1.691
angel-1.641
Token Shadow
Token ID8843
Feature activation-2.63e+00

Pos logit contributions
orld+3.495
cessive+3.361
iggins+3.352
phal+3.206
angel+3.139
Neg logit contributions
 Ratio-3.125
 Lerner-2.646
 Spy-2.615
 UNCLASSIFIED-2.565
 DOI-2.531
Token Demon
Token ID7814
Feature activation-3.19e+00

Pos logit contributions
orld+2.245
iggins+2.178
cessive+2.160
phal+2.103
angel+2.063
Neg logit contributions
 Ratio-1.747
 Lerner-1.464
 Spy-1.443
 DOI-1.426
 UNCLASSIFIED-1.415
Token instead
Token ID2427
Feature activation-2.26e-02

Pos logit contributions
orld+2.577
iggins+2.484
cessive+2.451
phal+2.367
angel+2.307
Neg logit contributions
 Ratio-2.309
 Lerner-1.954
 Spy-1.947
 UNCLASSIFIED-1.898
 DOI-1.895
Token (
Token ID357
Feature activation+4.31e+00

Pos logit contributions
orld+0.019
iggins+0.018
cessive+0.018
phal+0.017
angel+0.017
Neg logit contributions
 Ratio-0.016
 Spy-0.014
 Lerner-0.014
 UNCLASSIFIED-0.013
 DOI-0.013
Tokenwhere
Token ID3003
Feature activation+4.38e+00

Pos logit contributions
 Ratio+3.177
 Spy+2.788
 Lerner+2.730
 UNCLASSIFIED+2.643
 Scythe+2.617
Neg logit contributions
orld-3.343
iggins-3.229
cessive-3.175
phal-3.030
inent-2.917
Tokenas
Token ID292
Feature activation+8.38e+00

Pos logit contributions
 Ratio+2.852
 Spy+2.470
 Lerner+2.348
 Depression+2.291
 UNCLASSIFIED+2.283
Neg logit contributions
orld-3.852
iggins-3.706
cessive-3.665
phal-3.521
inent-3.436
Token originally
Token ID6198
Feature activation+4.98e+00

Pos logit contributions
 Ratio+5.335
 Spy+4.702
 Lerner+4.419
 Scythe+4.378
 Gou+4.302
Neg logit contributions
orld-7.393
iggins-7.112
cessive-6.979
phal-6.699
inent-6.617
Token it
Token ID340
Feature activation+1.63e+00

Pos logit contributions
 Ratio+3.198
 Spy+2.775
 Lerner+2.681
 UNCLASSIFIED+2.602
 Gou+2.544
Neg logit contributions
orld-4.426
iggins-4.328
cessive-4.200
phal-4.021
inent-3.929
Token was
Token ID373
Feature activation-1.79e+00

Pos logit contributions
 Ratio+1.041
 Spy+0.874
 Lerner+0.859
 UNCLASSIFIED+0.837
 DOI+0.822
Neg logit contributions
orld-1.456
iggins-1.411
cessive-1.398
phal-1.350
angel-1.304
Token going
Token ID1016
Feature activation-2.35e-01

Pos logit contributions
orld+1.535
iggins+1.483
cessive+1.471
phal+1.421
angel+1.383
Neg logit contributions
 Ratio-1.217
 Spy-1.016
 Lerner-1.009
 UNCLASSIFIED-0.979
 DOI-0.974
Token to
Token ID284
Feature activation-4.92e+00

Pos logit contributions
orld+0.187
iggins+0.179
cessive+0.178
phal+0.172
angel+0.167
Neg logit contributions
 Ratio-0.174
 Lerner-0.148
 Spy-0.147
 UNCLASSIFIED-0.144
 DOI-0.142
Token economy
Token ID3773
Feature activation+5.22e+00

Pos logit contributions
 Ratio+0.978
 Spy+0.823
 Lerner+0.817
 UNCLASSIFIED+0.797
 DOI+0.789
Neg logit contributions
orld-1.153
iggins-1.106
cessive-1.095
phal-1.064
idi-1.024
Token can
Token ID460
Feature activation+7.82e-01

Pos logit contributions
 Ratio+3.296
 Lerner+2.779
 Spy+2.774
 UNCLASSIFIED+2.728
 Depression+2.680
Neg logit contributions
orld-4.592
iggins-4.500
cessive-4.418
phal-4.215
inent-4.182
Token present
Token ID1944
Feature activation+2.85e+00

Pos logit contributions
 Ratio+0.531
 Spy+0.447
 Lerner+0.437
 UNCLASSIFIED+0.432
 Depression+0.425
Neg logit contributions
orld-0.672
iggins-0.651
cessive-0.640
phal-0.625
angel-0.604
Token over
Token ID625
Feature activation-1.54e+00

Pos logit contributions
 Ratio+1.906
 Spy+1.583
 Lerner+1.576
 Depression+1.548
 UNCLASSIFIED+1.541
Neg logit contributions
orld-2.476
iggins-2.392
cessive-2.352
phal-2.294
angel-2.223
Token-
Token ID12
Feature activation+1.39e+00

Pos logit contributions
orld+1.230
iggins+1.187
cessive+1.178
phal+1.131
angel+1.100
Neg logit contributions
 Ratio-1.139
 Spy-0.974
 Lerner-0.960
 UNCLASSIFIED-0.934
 DOI-0.931
Tokenexp
Token ID11201
Feature activation+1.20e+01

Pos logit contributions
 Ratio+1.056
 Spy+0.903
 Lerner+0.899
 UNCLASSIFIED+0.876
 DOI+0.865
Neg logit contributions
orld-1.083
iggins-1.044
cessive-1.022
phal-0.989
angel-0.949
Tokenansion
Token ID5487
Feature activation+1.66e+00

Pos logit contributions
 Ratio+5.856
 Spy+4.569
range+4.560
alty+4.500
 Lerner+4.420
Neg logit contributions
orld-11.014
phal-10.685
iggins-10.500
cessive-10.102
idi-9.830
Token of
Token ID286
Feature activation-1.31e+00

Pos logit contributions
 Ratio+1.213
 Spy+1.029
 Lerner+1.014
 Depression+0.992
 UNCLASSIFIED+0.992
Neg logit contributions
orld-1.344
iggins-1.290
cessive-1.280
phal-1.229
angel-1.191
Token the
Token ID262
Feature activation-6.95e-01

Pos logit contributions
orld+1.090
iggins+1.052
cessive+1.040
phal+1.004
angel+0.974
Neg logit contributions
 Ratio-0.928
 Spy-0.785
 Lerner-0.776
 UNCLASSIFIED-0.754
 DOI-0.748
Token money
Token ID1637
Feature activation-2.15e+00

Pos logit contributions
orld+0.575
iggins+0.555
cessive+0.549
phal+0.530
angel+0.513
Neg logit contributions
 Ratio-0.497
 Spy-0.421
 Lerner-0.416
 UNCLASSIFIED-0.405
 DOI-0.401
Token supply
Token ID5127
Feature activation+4.85e+00

Pos logit contributions
orld+1.347
iggins+1.284
cessive+1.264
phal+1.206
angel+1.157
Neg logit contributions
 Ratio-1.960
 Spy-1.728
 Lerner-1.712
 UNCLASSIFIED-1.676
 DOI-1.666
TokenWhite
Token ID12256
Feature activation+4.89e-01

Pos logit contributions
 Ratio+2.888
 Spy+2.511
 Lerner+2.478
 UNCLASSIFIED+2.388
 Depression+2.373
Neg logit contributions
orld-3.008
iggins-2.905
cessive-2.840
phal-2.737
angel-2.616
Token Rock
Token ID4631
Feature activation-6.76e+00

Pos logit contributions
 Ratio+0.375
 Spy+0.323
 Lerner+0.318
 UNCLASSIFIED+0.310
 DOI+0.307
Neg logit contributions
orld-0.379
iggins-0.365
cessive-0.360
phal-0.346
angel-0.335
Token by
Token ID416
Feature activation+4.62e+00

Pos logit contributions
orld+5.406
iggins+5.208
cessive+5.080
phal+4.976
angel+4.812
Neg logit contributions
 Ratio-4.704
 Spy-4.019
 Lerner-3.971
 DOI-3.889
 UNCLASSIFIED-3.827
Token Conservative
Token ID11132
Feature activation-7.76e-02

Pos logit contributions
 Ratio+3.043
 Spy+2.548
 Lerner+2.504
 DOI+2.399
 Depression+2.384
Neg logit contributions
orld-4.056
iggins-3.877
cessive-3.867
phal-3.746
idi-3.632
Token candidate
Token ID4540
Feature activation-2.44e+00

Pos logit contributions
orld+0.067
iggins+0.065
cessive+0.064
phal+0.062
angel+0.060
Neg logit contributions
 Ratio-0.053
 Spy-0.044
 Lerner-0.044
 UNCLASSIFIED-0.042
 DOI-0.042
Token Dianne
Token ID49197
Feature activation+9.28e+00

Pos logit contributions
orld+2.160
iggins+2.101
cessive+2.070
phal+2.006
angel+1.959
Neg logit contributions
 Ratio-1.574
 Spy-1.298
 Lerner-1.284
 UNCLASSIFIED-1.252
 DOI-1.246
Token Watts
Token ID27555
Feature activation-6.53e+00

Pos logit contributions
 Ratio+5.547
 Spy+4.778
 Lerner+4.762
 Ples+4.292
 Depression+4.291
Neg logit contributions
orld-8.315
cessive-8.042
iggins-8.024
phal-7.845
angel-7.569
Token,
Token ID11
Feature activation+2.25e+00

Pos logit contributions
orld+5.209
iggins+5.057
cessive+4.916
phal+4.767
angel+4.720
Neg logit contributions
 Ratio-4.616
 Spy-3.895
 Lerner-3.801
 UNCLASSIFIED-3.775
 DOI-3.765
Token is
Token ID318
Feature activation-4.49e-02

Pos logit contributions
 Ratio+1.532
 Spy+1.302
 Lerner+1.279
 UNCLASSIFIED+1.237
 DOI+1.225
Neg logit contributions
orld-1.950
iggins-1.872
cessive-1.860
phal-1.789
angel-1.738
Token being
Token ID852
Feature activation+2.58e+00

Pos logit contributions
orld+0.038
iggins+0.036
cessive+0.036
phal+0.034
angel+0.034
Neg logit contributions
 Ratio-0.031
 Spy-0.027
 Lerner-0.027
 UNCLASSIFIED-0.026
 DOI-0.026
Token called
Token ID1444
Feature activation-3.04e+00

Pos logit contributions
 Ratio+1.765
 Spy+1.537
 Lerner+1.506
 UNCLASSIFIED+1.461
 DOI+1.438
Neg logit contributions
orld-2.181
iggins-2.112
cessive-2.092
phal-2.002
angel-1.975
Token with
Token ID351
Feature activation-6.37e-01

Pos logit contributions
orld+0.988
iggins+0.955
cessive+0.944
phal+0.912
angel+0.886
Neg logit contributions
 Ratio-0.786
 Spy-0.661
 Lerner-0.652
 UNCLASSIFIED-0.634
 DOI-0.628
Token water
Token ID1660
Feature activation-3.29e+00

Pos logit contributions
orld+0.511
iggins+0.492
cessive+0.486
phal+0.468
angel+0.453
Neg logit contributions
 Ratio-0.472
 Spy-0.404
 Lerner-0.398
 UNCLASSIFIED-0.388
 DOI-0.385
Token,
Token ID11
Feature activation+9.92e-02

Pos logit contributions
orld+2.761
iggins+2.659
cessive+2.633
phal+2.560
angel+2.475
Neg logit contributions
 Ratio-2.296
 Spy-1.939
 Lerner-1.917
 UNCLASSIFIED-1.854
 DOI-1.840
Token her
Token ID607
Feature activation+4.36e+00

Pos logit contributions
 Ratio+0.067
 Spy+0.056
 Lerner+0.056
 UNCLASSIFIED+0.054
 DOI+0.053
Neg logit contributions
orld-0.086
iggins-0.083
cessive-0.082
phal-0.079
angel-0.077
Token skull
Token ID14511
Feature activation+7.36e+00

Pos logit contributions
 Ratio+2.935
 Spy+2.513
 Lerner+2.443
 DOI+2.412
 Depression+2.404
Neg logit contributions
orld-3.709
iggins-3.589
cessive-3.557
phal-3.424
angel-3.326
Token —
Token ID851
Feature activation+9.89e+00

Pos logit contributions
 Ratio+4.607
 Spy+3.968
 Lerner+3.934
 UNCLASSIFIED+3.880
 DOI+3.829
Neg logit contributions
orld-6.276
iggins-6.163
cessive-6.162
phal-5.759
angel-5.723
Token upside
Token ID17196
Feature activation+7.83e+00

Pos logit contributions
 Ratio+6.197
 Spy+5.329
 Lerner+5.224
 Depression+5.108
 UNCLASSIFIED+5.047
Neg logit contributions
orld-8.198
iggins-8.121
cessive-7.787
phal-7.395
inent-7.370
Token down
Token ID866
Feature activation+4.15e+00

Pos logit contributions
 Ratio+5.610
 Spy+4.877
 Lerner+4.829
 Depression+4.783
 DOI+4.733
Neg logit contributions
orld-5.930
iggins-5.806
cessive-5.738
phal-5.456
inent-5.289
Token,
Token ID11
Feature activation-5.42e-01

Pos logit contributions
 Ratio+2.632
 Spy+2.206
 Lerner+2.178
 UNCLASSIFIED+2.113
 DOI+2.091
Neg logit contributions
orld-3.688
iggins-3.597
cessive-3.573
phal-3.426
amph-3.328
Token teeth
Token ID9941
Feature activation+2.10e+00

Pos logit contributions
orld+0.467
iggins+0.451
cessive+0.446
phal+0.431
angel+0.419
Neg logit contributions
 Ratio-0.368
 Spy-0.309
 Lerner-0.305
 UNCLASSIFIED-0.296
 DOI-0.293
Token remarkably
Token ID21196
Feature activation+1.87e+00

Pos logit contributions
 Ratio+1.453
 Spy+1.230
 Lerner+1.219
 UNCLASSIFIED+1.190
 DOI+1.168
Neg logit contributions
orld-1.767
iggins-1.727
cessive-1.687
phal-1.614
angel-1.585
TokenMemory
Token ID30871
Feature activation+4.10e+00

Pos logit contributions
 Ratio+0.798
 Spy+0.685
 Lerner+0.671
 UNCLASSIFIED+0.653
 DOI+0.647
Neg logit contributions
orld-0.776
iggins-0.746
cessive-0.732
phal-0.710
angel-0.684
Token 4
Token ID604
Feature activation+4.43e+00

Pos logit contributions
 Ratio+3.034
 Spy+2.545
 Lerner+2.447
 UNCLASSIFIED+2.407
 DOI+2.405
Neg logit contributions
orld-3.312
iggins-3.206
cessive-3.167
phal-3.086
angel-2.956
Token66
Token ID2791
Feature activation+4.16e+00

Pos logit contributions
 Ratio+3.283
 Spy+2.790
 Lerner+2.719
 UNCLASSIFIED+2.693
 DOI+2.673
Neg logit contributions
orld-3.525
iggins-3.402
cessive-3.345
phal-3.253
inent-3.127
Token284
Token ID30336
Feature activation+4.47e+00

Pos logit contributions
 Ratio+3.153
 Spy+2.708
 Lerner+2.651
 UNCLASSIFIED+2.595
 DOI+2.582
Neg logit contributions
orld-3.226
iggins-3.116
cessive-3.061
phal-2.982
inent-2.854
Token 4
Token ID604
Feature activation+3.84e+00

Pos logit contributions
 Ratio+3.226
 Spy+2.738
 Lerner+2.666
 UNCLASSIFIED+2.617
 DOI+2.613
Neg logit contributions
orld-3.655
iggins-3.533
cessive-3.472
phal-3.404
inent-3.259
Token64
Token ID2414
Feature activation+8.83e-01

Pos logit contributions
 Ratio+2.908
 Spy+2.479
 Lerner+2.444
 DOI+2.397
 UNCLASSIFIED+2.392
Neg logit contributions
orld-2.997
iggins-2.883
cessive-2.842
phal-2.753
inent-2.644
Token406
Token ID29703
Feature activation+2.84e+00

Pos logit contributions
 Ratio+0.655
 Spy+0.559
 Lerner+0.552
 UNCLASSIFIED+0.539
 DOI+0.535
Neg logit contributions
orld-0.707
iggins-0.682
cessive-0.673
phal-0.650
angel-0.627
Token Mem
Token ID4942
Feature activation+5.96e+00

Pos logit contributions
 Ratio+1.927
 Spy+1.615
 Lerner+1.569
 UNCLASSIFIED+1.541
 DOI+1.535
Neg logit contributions
orld-2.471
iggins-2.400
cessive-2.367
phal-2.301
inent-2.214
Tokenc
Token ID66
Feature activation+7.42e+00

Pos logit contributions
 Ratio+4.178
 Spy+3.545
 Lerner+3.388
 UNCLASSIFIED+3.324
 DOI+3.323
Neg logit contributions
orld-4.883
iggins-4.699
cessive-4.652
phal-4.489
angel-4.339
Tokenpy
Token ID9078
Feature activation+6.30e+00

Pos logit contributions
 Ratio+3.913
 Spy+3.128
range+3.125
 UNCLASSIFIED+3.058
alty+2.991
Neg logit contributions
orld-6.980
iggins-6.954
cessive-6.618
phal-6.290
estial-6.232
TokenReading
Token ID36120
Feature activation+3.62e-01

Pos logit contributions
 Ratio+5.126
 Spy+4.395
 UNCLASSIFIED+4.224
 DOI+4.216
 Lerner+4.215
Neg logit contributions
orld-4.412
iggins-4.266
cessive-4.182
phal-4.142
angel-3.876
TokenTV
Token ID6849
Feature activation-2.53e+00

Pos logit contributions
 Ratio+0.908
 Spy+0.771
 Lerner+0.771
 UNCLASSIFIED+0.742
 DOI+0.728
Neg logit contributions
orld-0.988
iggins-0.952
cessive-0.939
phal-0.902
inent-0.864
Token play
Token ID711
Feature activation-5.12e+00

Pos logit contributions
orld+2.431
iggins+2.357
cessive+2.334
phal+2.265
angel+2.207
Neg logit contributions
 Ratio-1.470
 Spy-1.195
 Lerner-1.175
 UNCLASSIFIED-1.134
 DOI-1.121
Token)
Token ID8
Feature activation+2.27e+00

Pos logit contributions
orld+3.756
iggins+3.576
cessive+3.510
phal+3.322
angel+3.321
Neg logit contributions
 Ratio-4.054
 Lerner-3.460
 Spy-3.457
 DOI-3.419
 UNCLASSIFIED-3.382
Token that
Token ID326
Feature activation-3.85e+00

Pos logit contributions
 Ratio+1.249
 Spy+1.062
 Lerner+1.026
 UNCLASSIFIED+0.965
 Prevention+0.947
Neg logit contributions
orld-2.233
iggins-2.188
cessive-2.136
phal-2.098
amph-2.027
Token have
Token ID423
Feature activation-1.38e+00

Pos logit contributions
orld+3.267
iggins+3.161
cessive+3.157
phal+3.051
angel+2.967
Neg logit contributions
 Ratio-2.566
 Spy-2.160
 Lerner-2.113
 DOI-2.101
 Scythe-2.068
Token been
Token ID587
Feature activation+2.01e+00

Pos logit contributions
orld+1.213
iggins+1.172
cessive+1.161
phal+1.120
angel+1.095
Neg logit contributions
 Ratio-0.906
 Spy-0.757
 Lerner-0.746
 UNCLASSIFIED-0.723
 DOI-0.718
Token added
Token ID2087
Feature activation+8.02e-02

Pos logit contributions
 Ratio+1.493
 Spy+1.275
 UNCLASSIFIED+1.268
 Lerner+1.235
 Depression+1.206
Neg logit contributions
orld-1.625
iggins-1.610
cessive-1.543
phal-1.485
letes-1.440
Token to
Token ID284
Feature activation+9.76e-01

Pos logit contributions
 Ratio+0.052
 Spy+0.044
 Lerner+0.043
 UNCLASSIFIED+0.042
 DOI+0.042
Neg logit contributions
orld-0.072
iggins-0.070
cessive-0.068
phal-0.066
angel-0.064
Token this
Token ID428
Feature activation+2.53e+00

Pos logit contributions
 Ratio+0.765
 Spy+0.659
 Lerner+0.649
 UNCLASSIFIED+0.634
 DOI+0.628
Neg logit contributions
orld-0.749
iggins-0.719
cessive-0.706
phal-0.681
angel-0.656
Token title
Token ID3670
Feature activation-2.12e+00

Pos logit contributions
 Ratio+1.629
 Spy+1.405
 Lerner+1.344
 UNCLASSIFIED+1.328
 Depression+1.298
Neg logit contributions
orld-2.244
iggins-2.189
cessive-2.163
phal-2.120
angel-2.021
Token.
Token ID13
Feature activation+3.10e+00

Pos logit contributions
orld+1.745
iggins+1.691
cessive+1.672
phal+1.616
angel+1.555
Neg logit contributions
 Ratio-1.521
 Spy-1.292
 Lerner-1.273
 UNCLASSIFIED-1.245
 DOI-1.229
Token.
Token ID13
Feature activation+9.17e-01

Pos logit contributions
orld+2.821
iggins+2.709
cessive+2.679
phal+2.581
angel+2.522
Neg logit contributions
 Ratio-2.675
 Spy-2.278
 Lerner-2.249
 UNCLASSIFIED-2.190
 DOI-2.186
Token
Token ID198
Feature activation-1.30e+00

Pos logit contributions
 Ratio+0.625
 Spy+0.529
 Lerner+0.520
 UNCLASSIFIED+0.503
 DOI+0.498
Neg logit contributions
orld-0.799
iggins-0.775
cessive-0.762
phal-0.734
angel-0.713
Token
Token ID198
Feature activation-5.69e-01

Pos logit contributions
orld+1.178
iggins+1.151
cessive+1.128
phal+1.083
idi+1.049
Neg logit contributions
 Ratio-0.839
 Spy-0.706
 Lerner-0.691
 UNCLASSIFIED-0.678
 DOI-0.672
TokenBut
Token ID1537
Feature activation+4.88e-01

Pos logit contributions
orld+0.387
iggins+0.372
cessive+0.362
phal+0.346
idi+0.332
Neg logit contributions
 Ratio-0.490
 Spy-0.432
 Lerner-0.427
 UNCLASSIFIED-0.419
 DOI-0.414
Token research
Token ID2267
Feature activation-3.46e+00

Pos logit contributions
 Ratio+0.341
 Spy+0.291
 Lerner+0.286
 UNCLASSIFIED+0.277
 DOI+0.276
Neg logit contributions
orld-0.414
iggins-0.399
cessive-0.395
phal-0.380
angel-0.369
Token proves
Token ID17021
Feature activation+1.60e+00

Pos logit contributions
orld+2.713
iggins+2.615
cessive+2.578
phal+2.497
angel+2.410
Neg logit contributions
 Ratio-2.600
 Spy-2.212
 Lerner-2.192
 UNCLASSIFIED-2.130
 DOI-2.118
Token that
Token ID326
Feature activation+1.64e+00

Pos logit contributions
 Ratio+0.998
 Spy+0.832
 Lerner+0.818
 UNCLASSIFIED+0.786
 DOI+0.781
Neg logit contributions
orld-1.479
iggins-1.431
cessive-1.410
phal-1.362
angel-1.332
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
 Ratio+1.105
 Spy+0.935
 Lerner+0.915
 UNCLASSIFIED+0.881
 DOI+0.879
Neg logit contributions
orld-1.438
iggins-1.394
cessive-1.371
phal-1.325
angel-1.288
Token foul
Token ID15626
Feature activation-1.59e+00

Pos logit contributions
 Ratio+0.781
 Spy+0.663
 Lerner+0.651
 UNCLASSIFIED+0.628
 DOI+0.626
Neg logit contributions
orld-0.956
iggins-0.922
cessive-0.909
phal-0.878
angel-0.852
Token cal
Token ID2386
Feature activation-3.69e+00

Pos logit contributions
orld+1.312
iggins+1.265
cessive+1.251
phal+1.208
angel+1.171
Neg logit contributions
 Ratio-1.136
 Spy-0.962
 Lerner-0.952
 UNCLASSIFIED-0.928
 DOI-0.918
Tokenum
Token ID388
Feature activation-6.76e+00

Pos logit contributions
orld+2.606
iggins+2.514
cessive+2.482
phal+2.401
idi+2.300
Neg logit contributions
 Ratio-3.068
 Spy-2.653
 Lerner-2.632
 UNCLASSIFIED-2.566
 Depression-2.545
Token to
Token ID284
Feature activation+1.22e+00

Pos logit contributions
 Ratio+0.831
 Spy+0.700
 Lerner+0.690
 UNCLASSIFIED+0.667
 DOI+0.659
Neg logit contributions
orld-1.066
iggins-1.030
cessive-1.020
phal-0.985
angel-0.961
Token play
Token ID711
Feature activation-9.06e-01

Pos logit contributions
 Ratio+0.791
 Spy+0.664
 Lerner+0.645
 UNCLASSIFIED+0.636
 DOI+0.622
Neg logit contributions
orld-1.093
iggins-1.056
cessive-1.045
phal-1.018
angel-0.990
Token physical
Token ID3518
Feature activation-1.98e+00

Pos logit contributions
orld+0.779
iggins+0.754
cessive+0.746
phal+0.721
angel+0.701
Neg logit contributions
 Ratio-0.617
 Spy-0.521
 Lerner-0.513
 UNCLASSIFIED-0.499
 DOI-0.493
Token and
Token ID290
Feature activation+7.22e-01

Pos logit contributions
orld+1.692
iggins+1.633
cessive+1.615
phal+1.561
angel+1.515
Neg logit contributions
 Ratio-1.365
 Spy-1.149
 Lerner-1.134
 UNCLASSIFIED-1.102
 DOI-1.092
Token re
Token ID302
Feature activation+2.14e+00

Pos logit contributions
 Ratio+0.473
 Spy+0.396
 Lerner+0.390
 UNCLASSIFIED+0.379
 DOI+0.374
Neg logit contributions
orld-0.640
iggins-0.620
cessive-0.613
phal-0.593
angel-0.579
Token-
Token ID12
Feature activation+4.08e+00

Pos logit contributions
 Ratio+1.564
 Spy+1.345
 Lerner+1.318
 UNCLASSIFIED+1.278
 DOI+1.265
Neg logit contributions
orld-1.716
iggins-1.655
cessive-1.632
phal-1.579
angel-1.528
Tokenroute
Token ID38629
Feature activation+3.35e+00

Pos logit contributions
 Ratio+3.271
 Spy+2.885
 Lerner+2.836
range+2.765
 UNCLASSIFIED+2.752
Neg logit contributions
orld-2.855
iggins-2.746
cessive-2.714
phal-2.609
inent-2.496
Token Baldwin
Token ID26402
Feature activation+5.48e+00

Pos logit contributions
 Ratio+2.146
 Spy+1.847
 Lerner+1.782
 Depression+1.710
 DOI+1.684
Neg logit contributions
orld-2.991
iggins-2.897
cessive-2.865
phal-2.754
inent-2.696
Token to
Token ID284
Feature activation-2.40e+00

Pos logit contributions
 Ratio+3.517
 Spy+3.018
 Lerner+2.885
 UNCLASSIFIED+2.849
 Depression+2.804
Neg logit contributions
orld-4.692
iggins-4.654
cessive-4.565
phal-4.309
inent-4.301
Token throw
Token ID3714
Feature activation+3.59e-01

Pos logit contributions
orld+2.101
iggins+2.033
cessive+2.013
phal+1.947
angel+1.891
Neg logit contributions
 Ratio-1.585
 Spy-1.320
 Lerner-1.306
 UNCLASSIFIED-1.274
 DOI-1.258
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
 Ratio+0.248
 Spy+0.211
 Lerner+0.206
 UNCLASSIFIED+0.200
 DOI+0.198
Neg logit contributions
orld-0.306
iggins-0.296
cessive-0.292
phal-0.282
angel-0.275
Token a
Token ID257
Feature activation-8.32e-01

Pos logit contributions
orld+3.237
iggins+3.145
cessive+3.113
phal+3.010
idi+2.914
Neg logit contributions
 Ratio-2.796
 Spy-2.350
 Lerner-2.328
 UNCLASSIFIED-2.245
 DOI-2.240
Token must
Token ID1276
Feature activation-2.93e+00

Pos logit contributions
orld+0.704
iggins+0.674
cessive+0.668
phal+0.644
angel+0.627
Neg logit contributions
 Ratio-0.584
 Spy-0.496
 Lerner-0.487
 UNCLASSIFIED-0.476
 DOI-0.472
Token-
Token ID12
Feature activation+5.21e-01

Pos logit contributions
orld+2.452
iggins+2.370
cessive+2.338
phal+2.260
angel+2.199
Neg logit contributions
 Ratio-2.053
 Spy-1.732
 Lerner-1.719
 UNCLASSIFIED-1.657
 DOI-1.646
Tokenread
Token ID961
Feature activation+5.92e-01

Pos logit contributions
 Ratio+0.347
 Spy+0.293
 Lerner+0.288
 UNCLASSIFIED+0.281
 DOI+0.279
Neg logit contributions
orld-0.452
iggins-0.436
cessive-0.430
phal-0.417
angel-0.401
Token for
Token ID329
Feature activation+2.67e+00

Pos logit contributions
 Ratio+0.391
 Spy+0.332
 Lerner+0.324
 UNCLASSIFIED+0.319
 DOI+0.315
Neg logit contributions
orld-0.523
iggins-0.500
cessive-0.499
phal-0.483
angel-0.467
Token the
Token ID262
Feature activation+9.07e-01

Pos logit contributions
 Ratio+1.795
 Spy+1.503
 Lerner+1.488
 DOI+1.443
 UNCLASSIFIED+1.427
Neg logit contributions
orld-2.342
iggins-2.239
cessive-2.227
phal-2.154
angel-2.092
Token basic
Token ID4096
Feature activation+4.06e+00

Pos logit contributions
 Ratio+0.630
 Spy+0.531
 Lerner+0.524
 DOI+0.510
 UNCLASSIFIED+0.507
Neg logit contributions
orld-0.775
iggins-0.741
cessive-0.736
phal-0.714
angel-0.688
Token intuition
Token ID31563
Feature activation+2.53e+00

Pos logit contributions
 Ratio+2.790
 Spy+2.382
 Lerner+2.361
 DOI+2.291
 UNCLASSIFIED+2.270
Neg logit contributions
orld-3.405
cessive-3.253
iggins-3.236
phal-3.161
angel-3.047
Token about
Token ID546
Feature activation+4.82e-01

Pos logit contributions
 Ratio+1.748
 Spy+1.489
 Lerner+1.477
 UNCLASSIFIED+1.437
 DOI+1.413
Neg logit contributions
orld-2.124
iggins-2.040
cessive-2.036
phal-1.943
angel-1.896
Token how
Token ID703
Feature activation+8.03e-01

Pos logit contributions
 Ratio+0.320
 Spy+0.268
 Lerner+0.264
 UNCLASSIFIED+0.256
 DOI+0.254
Neg logit contributions
orld-0.426
iggins-0.409
cessive-0.405
phal-0.392
angel-0.382
Token transaction
Token ID8611
Feature activation+4.55e+00

Pos logit contributions
 Ratio+0.553
 Spy+0.467
 Lerner+0.461
 UNCLASSIFIED+0.447
 DOI+0.445
Neg logit contributions
orld-0.690
iggins-0.663
cessive-0.655
phal-0.632
angel-0.617
Token parts
Token ID3354
Feature activation-2.82e+00

Pos logit contributions
 Ratio+0.634
 Spy+0.550
 Lerner+0.543
 UNCLASSIFIED+0.532
 DOI+0.527
Neg logit contributions
orld-0.606
iggins-0.582
cessive-0.575
phal-0.552
angel-0.532
Token of
Token ID286
Feature activation+2.58e+00

Pos logit contributions
orld+2.056
iggins+1.946
cessive+1.936
phal+1.878
idi+1.847
Neg logit contributions
 Ratio-2.270
 Spy-1.917
 Lerner-1.884
 DOI-1.872
 Depression-1.865
Token northern
Token ID7840
Feature activation+2.39e+00

Pos logit contributions
 Ratio+1.802
 Spy+1.537
 Lerner+1.526
 UNCLASSIFIED+1.466
 DOI+1.459
Neg logit contributions
orld-2.172
iggins-2.104
cessive-2.081
phal-2.008
angel-1.949
Token Syria
Token ID4392
Feature activation+3.00e+00

Pos logit contributions
 Ratio+1.563
 Spy+1.308
 Lerner+1.300
 UNCLASSIFIED+1.250
 DOI+1.241
Neg logit contributions
orld-2.118
iggins-2.053
cessive-2.034
phal-1.962
angel-1.906
Token in
Token ID287
Feature activation+2.15e+00

Pos logit contributions
 Ratio+2.019
 Spy+1.716
 Lerner+1.703
 UNCLASSIFIED+1.665
 DOI+1.622
Neg logit contributions
orld-2.579
iggins-2.516
cessive-2.481
phal-2.388
angel-2.306
Token partnership
Token ID10413
Feature activation-8.67e-01

Pos logit contributions
 Ratio+1.389
 Spy+1.164
 Lerner+1.155
 UNCLASSIFIED+1.110
 DOI+1.102
Neg logit contributions
orld-1.925
iggins-1.860
cessive-1.843
phal-1.798
angel-1.736
Token with
Token ID351
Feature activation+9.04e-02

Pos logit contributions
orld+0.716
iggins+0.684
cessive+0.679
phal+0.657
idi+0.635
Neg logit contributions
 Ratio-0.620
 Spy-0.523
 Lerner-0.520
 DOI-0.500
 Depression-0.498
Token some
Token ID617
Feature activation-3.24e+00

Pos logit contributions
 Ratio+0.057
 Spy+0.047
 Lerner+0.046
 UNCLASSIFIED+0.045
 DOI+0.045
Neg logit contributions
orld-0.082
iggins-0.080
cessive-0.079
phal-0.076
angel-0.074
Token Arab
Token ID4498
Feature activation+6.54e-01

Pos logit contributions
orld+2.692
iggins+2.611
cessive+2.571
phal+2.507
angel+2.434
Neg logit contributions
 Ratio-2.230
 Spy-1.863
 Lerner-1.840
 DOI-1.779
 Depression-1.762
Token tribes
Token ID15663
Feature activation-1.23e+00

Pos logit contributions
 Ratio+0.554
 Spy+0.487
 Lerner+0.479
 UNCLASSIFIED+0.475
 Depression+0.466
Neg logit contributions
orld-0.452
iggins-0.433
cessive-0.429
phal-0.406
angel-0.391
Token there
Token ID612
Feature activation+1.99e+00

Pos logit contributions
orld+1.013
iggins+0.975
cessive+0.965
phal+0.934
angel+0.905
Neg logit contributions
 Ratio-0.888
 Spy-0.753
 Lerner-0.742
 UNCLASSIFIED-0.721
 DOI-0.718
Token its
Token ID663
Feature activation+4.10e-01

Pos logit contributions
 Ratio+0.679
 Spy+0.565
 Lerner+0.556
 UNCLASSIFIED+0.537
 DOI+0.533
Neg logit contributions
orld-0.971
iggins-0.939
cessive-0.931
phal-0.900
angel-0.874
Token similarly
Token ID12470
Feature activation-2.49e+00

Pos logit contributions
 Ratio+0.276
 Spy+0.233
 Lerner+0.229
 DOI+0.221
 UNCLASSIFIED+0.221
Neg logit contributions
orld-0.357
iggins-0.345
cessive-0.342
phal-0.331
angel-0.320
Token configured
Token ID17839
Feature activation+1.91e+00

Pos logit contributions
orld+2.088
iggins+2.014
cessive+1.989
phal+1.938
angel+1.870
Neg logit contributions
 Ratio-1.732
 Spy-1.457
 Lerner-1.435
 UNCLASSIFIED-1.397
 DOI-1.383
Token rival
Token ID8976
Feature activation-4.14e+00

Pos logit contributions
 Ratio+1.251
 Spy+1.054
 Lerner+1.038
 DOI+1.005
 UNCLASSIFIED+0.992
Neg logit contributions
orld-1.707
cessive-1.645
iggins-1.643
phal-1.587
angel-1.529
Token Pro
Token ID1041
Feature activation-7.37e+00

Pos logit contributions
orld+3.551
iggins+3.432
cessive+3.368
phal+3.280
angel+3.186
Neg logit contributions
 Ratio-2.746
 Spy-2.276
 Lerner-2.263
 UNCLASSIFIED-2.218
 Depression-2.179
TokenQuest
Token ID12166
Feature activation-3.67e-01

Pos logit contributions
orld+5.349
iggins+5.161
cessive+5.098
phal+4.888
idi+4.780
Neg logit contributions
 Ratio-5.712
 Spy-4.897
 Lerner-4.857
 DOI-4.774
 UNCLASSIFIED-4.757
Token.
Token ID13
Feature activation-6.68e-01

Pos logit contributions
orld+0.309
iggins+0.298
cessive+0.296
phal+0.285
angel+0.276
Neg logit contributions
 Ratio-0.257
 Spy-0.217
 Lerner-0.214
 UNCLASSIFIED-0.208
 DOI-0.206
Token Since
Token ID4619
Feature activation+2.20e+00

Pos logit contributions
orld+0.585
iggins+0.566
cessive+0.560
phal+0.541
angel+0.526
Neg logit contributions
 Ratio-0.446
 Spy-0.373
 Lerner-0.368
 UNCLASSIFIED-0.357
 DOI-0.354
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
 Ratio+1.400
 Spy+1.189
 Lerner+1.163
 DOI+1.112
 UNCLASSIFIED+1.106
Neg logit contributions
orld-2.006
iggins-1.929
cessive-1.918
phal-1.860
angel-1.802
Token release
Token ID2650
Feature activation+4.02e+00

Pos logit contributions
 Ratio+0.072
 Spy+0.060
 Lerner+0.059
 DOI+0.057
 UNCLASSIFIED+0.057
Neg logit contributions
orld-0.098
iggins-0.094
cessive-0.093
phal-0.090
angel-0.087
Token of
Token ID286
Feature activation+6.25e-01

Pos logit contributions
 Ratio+2.878
 Spy+2.451
 Lerner+2.411
 UNCLASSIFIED+2.347
 DOI+2.337
Neg logit contributions
orld-3.307
iggins-3.180
cessive-3.151
phal-3.034
angel-2.935
Token Q
Token ID1195
Feature activation+1.65e+00

Pos logit contributions
 Ratio+0.957
 Spy+0.810
 Lerner+0.795
 UNCLASSIFIED+0.775
 Depression+0.770
Neg logit contributions
orld-1.192
iggins-1.152
cessive-1.131
phal-1.091
angel-1.063
Token&
Token ID5
Feature activation-1.55e+00

Pos logit contributions
 Ratio+1.228
 Spy+1.046
 Lerner+1.033
 UNCLASSIFIED+1.011
 DOI+0.996
Neg logit contributions
orld-1.309
iggins-1.262
cessive-1.245
phal-1.202
angel-1.159
TokenA
Token ID32
Feature activation+2.03e+00

Pos logit contributions
orld+1.227
iggins+1.184
cessive+1.167
phal+1.126
angel+1.084
Neg logit contributions
 Ratio-1.167
 Spy-0.997
 Lerner-0.984
 UNCLASSIFIED-0.960
 DOI-0.950
Token session
Token ID6246
Feature activation-1.77e-01

Pos logit contributions
 Ratio+1.329
 Spy+1.110
 Lerner+1.090
 UNCLASSIFIED+1.069
 Depression+1.055
Neg logit contributions
orld-1.790
iggins-1.740
cessive-1.711
phal-1.661
angel-1.611
Token ―
Token ID21096
Feature activation-6.41e-01

Pos logit contributions
orld+0.151
iggins+0.146
cessive+0.144
phal+0.139
angel+0.135
Neg logit contributions
 Ratio-0.123
 Spy-0.104
 Lerner-0.102
 UNCLASSIFIED-0.100
 DOI-0.098
Token refer
Token ID3522
Feature activation-4.09e+00

Pos logit contributions
orld+0.550
iggins+0.532
cessive+0.526
phal+0.508
angel+0.493
Neg logit contributions
 Ratio-0.438
 Spy-0.368
 Lerner-0.364
 UNCLASSIFIED-0.353
 DOI-0.350
Token to
Token ID284
Feature activation-3.80e+00

Pos logit contributions
orld+3.013
cessive+2.894
iggins+2.883
phal+2.811
angel+2.665
Neg logit contributions
 Ratio-3.202
 Spy-2.727
 Lerner-2.711
 UNCLASSIFIED-2.648
 Prevention-2.596
Token the
Token ID262
Feature activation-1.01e+00

Pos logit contributions
orld+3.190
iggins+3.091
cessive+3.070
phal+2.970
angel+2.883
Neg logit contributions
 Ratio-2.583
 Spy-2.123
 Lerner-2.118
 UNCLASSIFIED-2.072
 DOI-2.043
Token Republican
Token ID3415
Feature activation+4.29e-02

Pos logit contributions
orld+0.854
iggins+0.825
cessive+0.816
phal+0.788
angel+0.765
Neg logit contributions
 Ratio-0.694
 Spy-0.584
 Lerner-0.577
 UNCLASSIFIED-0.561
 DOI-0.556
Token presidential
Token ID4787
Feature activation-4.58e+00

Pos logit contributions
 Ratio+0.032
 Spy+0.028
 Lerner+0.028
 UNCLASSIFIED+0.027
 DOI+0.027
Neg logit contributions
orld-0.034
iggins-0.032
cessive-0.032
phal-0.031
angel-0.030
Token nominee
Token ID10429
Feature activation-2.16e+00

Pos logit contributions
orld+3.586
iggins+3.456
cessive+3.351
phal+3.277
angel+3.187
Neg logit contributions
 Ratio-3.416
 Lerner-2.877
 Spy-2.858
 UNCLASSIFIED-2.796
 Prevention-2.769
Token �
Token ID564
Feature activation+1.41e+00

Pos logit contributions
 Ratio+1.466
 Spy+1.252
 Lerner+1.233
 UNCLASSIFIED+1.196
 Depression+1.187
Neg logit contributions
orld-1.684
iggins-1.624
cessive-1.609
phal-1.553
angel-1.498
Token�
Token ID250
Feature activation-6.59e-01

Pos logit contributions
 Ratio+1.069
 Spy+0.917
 Lerner+0.902
 UNCLASSIFIED+0.880
 Depression+0.870
Neg logit contributions
orld-1.095
iggins-1.062
cessive-1.053
phal-1.002
angel-0.976
Tokenhere
Token ID1456
Feature activation-2.04e+00

Pos logit contributions
orld+0.503
iggins+0.483
cessive+0.477
phal+0.459
angel+0.444
Neg logit contributions
 Ratio-0.513
 Spy-0.442
 Lerner-0.437
 UNCLASSIFIED-0.426
 DOI-0.423
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
orld+1.666
iggins+1.593
cessive+1.574
phal+1.518
angel+1.483
Neg logit contributions
 Ratio-1.488
 Spy-1.260
 Lerner-1.247
 UNCLASSIFIED-1.213
 DOI-1.211
Token now
Token ID783
Feature activation+4.48e-02

Pos logit contributions
orld+1.020
iggins+0.980
cessive+0.963
phal+0.931
angel+0.904
Neg logit contributions
 Ratio-0.918
 Spy-0.779
 Lerner-0.771
 UNCLASSIFIED-0.745
 DOI-0.743
Token.
Token ID13
Feature activation-1.42e+00

Pos logit contributions
 Ratio+0.033
 Spy+0.028
 Lerner+0.027
 UNCLASSIFIED+0.027
 DOI+0.027
Neg logit contributions
orld-0.036
iggins-0.035
cessive-0.035
phal-0.033
angel-0.032
Token�
Token ID447
Feature activation+2.40e-01

Pos logit contributions
orld+1.211
iggins+1.169
cessive+1.156
phal+1.117
angel+1.085
Neg logit contributions
 Ratio-0.981
 Spy-0.826
 Lerner-0.816
 UNCLASSIFIED-0.792
 DOI-0.785
Token�
Token ID251
Feature activation-8.69e-01

Pos logit contributions
 Ratio+0.217
 Spy+0.191
 Lerner+0.189
 UNCLASSIFIED+0.185
 DOI+0.184
Neg logit contributions
orld-0.153
iggins-0.146
cessive-0.144
phal-0.136
angel-0.132
Token
Token ID198
Feature activation-2.02e+00

Pos logit contributions
orld+0.733
iggins+0.707
cessive+0.699
phal+0.675
angel+0.655
Neg logit contributions
 Ratio-0.606
 Spy-0.512
 Lerner-0.506
 UNCLASSIFIED-0.491
 DOI-0.487
Token
Token ID198
Feature activation-1.20e+00

Pos logit contributions
orld+1.761
iggins+1.705
cessive+1.684
phal+1.624
angel+1.568
Neg logit contributions
 Ratio-1.371
 Spy-1.150
 Lerner-1.136
 UNCLASSIFIED-1.108
 DOI-1.096
TokenTouch
Token ID35211
Feature activation-3.82e+00

Pos logit contributions
orld+0.864
iggins+0.828
cessive+0.817
phal+0.784
angel+0.755
Neg logit contributions
 Ratio-0.985
 Spy-0.855
 Lerner-0.847
 UNCLASSIFIED-0.827
 DOI-0.821
Tokenread
Token ID961
Feature activation-2.06e+00

Pos logit contributions
 Ratio+0.050
 Spy+0.039
 UNCLASSIFIED+0.038
 Lerner+0.038
 DOI+0.037
Neg logit contributions
orld-0.110
cessive-0.107
iggins-0.106
phal-0.103
angel-0.100
Tokenwell
Token ID4053
Feature activation-3.07e+00

Pos logit contributions
orld+1.540
cessive+1.472
iggins+1.472
phal+1.397
idi+1.350
Neg logit contributions
 Ratio-1.604
 Spy-1.404
 Lerner-1.402
 UNCLASSIFIED-1.376
 DOI-1.347
Token,
Token ID11
Feature activation+9.00e-01

Pos logit contributions
orld+2.552
iggins+2.457
cessive+2.396
phal+2.336
angel+2.274
Neg logit contributions
 Ratio-2.165
 Spy-1.814
 Lerner-1.798
 DOI-1.752
 UNCLASSIFIED-1.749
Token was
Token ID373
Feature activation-7.34e-01

Pos logit contributions
 Ratio+0.641
 Spy+0.551
 Lerner+0.542
 UNCLASSIFIED+0.528
 DOI+0.520
Neg logit contributions
orld-0.744
iggins-0.716
cessive-0.715
phal-0.686
angel-0.664
Token extremely
Token ID4457
Feature activation-1.25e+00

Pos logit contributions
orld+0.616
iggins+0.595
cessive+0.589
phal+0.568
angel+0.551
Neg logit contributions
 Ratio-0.515
 Spy-0.437
 Lerner-0.431
 UNCLASSIFIED-0.419
 DOI-0.414
Token successful
Token ID4388
Feature activation-3.77e-01

Pos logit contributions
orld+1.083
iggins+1.047
cessive+1.035
phal+1.001
angel+0.973
Neg logit contributions
 Ratio-0.838
 Spy-0.704
 Lerner-0.694
 UNCLASSIFIED-0.675
 DOI-0.667
Token in
Token ID287
Feature activation+4.46e-01

Pos logit contributions
orld+0.316
iggins+0.306
cessive+0.303
phal+0.292
angel+0.283
Neg logit contributions
 Ratio-0.264
 Spy-0.225
 Lerner-0.221
 UNCLASSIFIED-0.215
 DOI-0.213
Token similar
Token ID2092
Feature activation+7.61e-01

Pos logit contributions
 Ratio+0.297
 Spy+0.251
 Lerner+0.246
 UNCLASSIFIED+0.238
 DOI+0.235
Neg logit contributions
orld-0.392
iggins-0.378
cessive-0.374
phal-0.363
angel-0.353
Token situations
Token ID7445
Feature activation+3.08e+00

Pos logit contributions
 Ratio+0.481
 Spy+0.404
 Lerner+0.393
 UNCLASSIFIED+0.382
 DOI+0.378
Neg logit contributions
orld-0.690
iggins-0.668
cessive-0.661
phal-0.642
angel-0.625
Token at
Token ID379
Feature activation+2.88e+00

Pos logit contributions
 Ratio+2.052
 Spy+1.764
 Lerner+1.731
 UNCLASSIFIED+1.675
 DOI+1.649
Neg logit contributions
orld-2.641
iggins-2.571
cessive-2.544
phal-2.453
inent-2.374
Token Ole
Token ID30093
Feature activation-1.19e+00

Pos logit contributions
 Ratio+1.793
 Spy+1.505
 Lerner+1.474
 UNCLASSIFIED+1.418
 DOI+1.402
Neg logit contributions
orld-2.642
iggins-2.551
cessive-2.532
phal-2.457
angel-2.385
Token hearts
Token ID11954
Feature activation-3.02e+00

Pos logit contributions
orld+0.491
iggins+0.474
cessive+0.469
phal+0.452
angel+0.438
Neg logit contributions
 Ratio-0.424
 Spy-0.359
 Lerner-0.355
 UNCLASSIFIED-0.345
 DOI-0.343
Token,
Token ID11
Feature activation+2.02e-02

Pos logit contributions
orld+2.426
iggins+2.304
cessive+2.302
phal+2.252
angel+2.171
Neg logit contributions
 Ratio-2.210
 Spy-1.891
 Lerner-1.832
 UNCLASSIFIED-1.775
 DOI-1.770
Token and
Token ID290
Feature activation+1.67e+00

Pos logit contributions
 Ratio+0.015
 Spy+0.013
 Lerner+0.012
 UNCLASSIFIED+0.012
 DOI+0.012
Neg logit contributions
orld-0.016
iggins-0.016
cessive-0.015
phal-0.015
angel-0.014
Token hands
Token ID2832
Feature activation-1.00e+00

Pos logit contributions
 Ratio+1.432
 Spy+1.251
 Lerner+1.242
 UNCLASSIFIED+1.217
 DOI+1.208
Neg logit contributions
orld-1.134
iggins-1.092
cessive-1.073
phal-1.020
angel-0.988
Token."[
Token ID18161
Feature activation+2.45e-01

Pos logit contributions
orld+0.795
iggins+0.763
cessive+0.755
phal+0.729
angel+0.706
Neg logit contributions
 Ratio-0.750
 Spy-0.642
 Lerner-0.633
 UNCLASSIFIED-0.616
 DOI-0.611
Token59
Token ID3270
Feature activation-6.91e+00

Pos logit contributions
 Ratio+0.247
 Spy+0.220
 Lerner+0.219
 UNCLASSIFIED+0.215
 DOI+0.213
Neg logit contributions
orld-0.130
iggins-0.123
cessive-0.121
phal-0.114
angel-0.108
Token]
Token ID60
Feature activation-1.66e+00

Pos logit contributions
orld+3.015
cessive+2.802
iggins+2.780
phal+2.664
angel+2.497
Neg logit contributions
 Ratio-7.393
 Spy-6.621
 Lerner-6.590
geries-6.444
 UNCLASSIFIED-6.431
Token The
Token ID383
Feature activation+6.16e-03

Pos logit contributions
orld+1.404
iggins+1.357
cessive+1.343
phal+1.298
angel+1.262
Neg logit contributions
 Ratio-1.146
 Spy-0.966
 Lerner-0.950
 UNCLASSIFIED-0.927
 DOI-0.916
Token university
Token ID6403
Feature activation-6.28e+00

Pos logit contributions
 Ratio+0.004
 Spy+0.004
 Lerner+0.004
 UNCLASSIFIED+0.004
 DOI+0.004
Neg logit contributions
orld-0.005
iggins-0.005
cessive-0.005
phal-0.005
angel-0.005
Token's
Token ID338
Feature activation-1.69e+00

Pos logit contributions
orld+4.638
iggins+4.457
cessive+4.377
phal+4.238
idi+4.122
Neg logit contributions
 Ratio-4.810
 Spy-4.092
 Lerner-4.028
 UNCLASSIFIED-3.957
 DOI-3.915
Token vice
Token ID7927
Feature activation-4.15e+00

Pos logit contributions
orld+1.466
iggins+1.420
cessive+1.403
phal+1.355
angel+1.317
Neg logit contributions
 Ratio-1.136
 Spy-0.953
 Lerner-0.936
 UNCLASSIFIED-0.914
 DOI-0.903
Token cartridge
Token ID26904
Feature activation-7.48e-01

Pos logit contributions
orld+1.553
iggins+1.499
cessive+1.483
phal+1.434
angel+1.391
Neg logit contributions
 Ratio-1.263
 Spy-1.063
 Lerner-1.050
 UNCLASSIFIED-1.021
 DOI-1.012
Token,
Token ID11
Feature activation-2.80e+00

Pos logit contributions
orld+0.636
iggins+0.614
cessive+0.608
phal+0.586
angel+0.569
Neg logit contributions
 Ratio-0.518
 Spy-0.437
 Lerner-0.431
 UNCLASSIFIED-0.419
 DOI-0.415
Token O
Token ID440
Feature activation-3.10e+00

Pos logit contributions
orld+2.350
iggins+2.262
cessive+2.241
phal+2.168
angel+2.097
Neg logit contributions
 Ratio-1.948
 Spy-1.639
 Lerner-1.623
 UNCLASSIFIED-1.584
 DOI-1.569
Tokencar
Token ID7718
Feature activation-4.32e+00

Pos logit contributions
orld+1.871
iggins+1.746
cessive+1.740
phal+1.680
idi+1.588
Neg logit contributions
 Ratio-2.916
 Spy-2.564
 Lerner-2.563
 DOI-2.518
 UNCLASSIFIED-2.505
Tokenina
Token ID1437
Feature activation+1.57e+00

Pos logit contributions
orld+3.474
iggins+3.346
cessive+3.305
phal+3.180
angel+3.077
Neg logit contributions
 Ratio-3.199
 Spy-2.724
 Lerner-2.690
 UNCLASSIFIED-2.632
 DOI-2.607
Token of
Token ID286
Feature activation-6.97e+00

Pos logit contributions
 Ratio+1.151
 Spy+0.980
 Lerner+0.968
 UNCLASSIFIED+0.943
 DOI+0.935
Neg logit contributions
orld-1.273
iggins-1.226
cessive-1.212
phal-1.169
angel-1.133
Token Time
Token ID3862
Feature activation-2.94e+00

Pos logit contributions
orld+5.488
iggins+5.200
cessive+5.150
phal+4.997
angel+4.846
Neg logit contributions
 Ratio-5.079
 Lerner-4.343
 Spy-4.317
 UNCLASSIFIED-4.220
 DOI-4.177
Token was
Token ID373
Feature activation-8.39e-01

Pos logit contributions
orld+2.383
iggins+2.286
cessive+2.251
phal+2.184
angel+2.118
Neg logit contributions
 Ratio-2.122
 Spy-1.794
 Lerner-1.777
 UNCLASSIFIED-1.735
 DOI-1.728
Token the
Token ID262
Feature activation-2.81e+00

Pos logit contributions
orld+0.715
iggins+0.691
cessive+0.683
phal+0.660
angel+0.640
Neg logit contributions
 Ratio-0.579
 Spy-0.489
 Lerner-0.482
 UNCLASSIFIED-0.468
 DOI-0.464
Token last
Token ID938
Feature activation-1.72e+00

Pos logit contributions
orld+2.414
iggins+2.337
cessive+2.310
phal+2.233
angel+2.170
Neg logit contributions
 Ratio-1.901
 Spy-1.589
 Lerner-1.580
 UNCLASSIFIED-1.532
 DOI-1.518
Token Zelda
Token ID22166
Feature activation+1.08e-01

Pos logit contributions
orld+1.413
iggins+1.362
cessive+1.347
phal+1.302
angel+1.260
Neg logit contributions
 Ratio-1.242
 Spy-1.052
 Lerner-1.041
 UNCLASSIFIED-1.012
 DOI-1.005
Token and
Token ID290
Feature activation-1.15e+00

Pos logit contributions
orld+0.062
iggins+0.060
cessive+0.059
phal+0.057
angel+0.056
Neg logit contributions
 Ratio-0.049
 Spy-0.041
 Lerner-0.040
 UNCLASSIFIED-0.039
 DOI-0.039
Token access
Token ID1895
Feature activation-1.26e+00

Pos logit contributions
orld+0.963
iggins+0.929
cessive+0.919
phal+0.889
angel+0.860
Neg logit contributions
 Ratio-0.805
 Spy-0.680
 Lerner-0.671
 UNCLASSIFIED-0.650
 DOI-0.646
Token to
Token ID284
Feature activation-3.09e+00

Pos logit contributions
orld+0.991
iggins+0.956
cessive+0.940
phal+0.918
idi+0.888
Neg logit contributions
 Ratio-0.947
 Spy-0.814
 Lerner-0.812
 Depression-0.776
 UNCLASSIFIED-0.776
Token information
Token ID1321
Feature activation-5.09e-01

Pos logit contributions
orld+2.465
iggins+2.383
cessive+2.346
phal+2.284
idi+2.206
Neg logit contributions
 Ratio-2.252
 Spy-1.911
 Lerner-1.874
 UNCLASSIFIED-1.825
 DOI-1.812
Token.[
Token ID3693
Feature activation+4.10e-01

Pos logit contributions
orld+0.431
iggins+0.416
cessive+0.411
phal+0.398
angel+0.386
Neg logit contributions
 Ratio-0.354
 Spy-0.298
 Lerner-0.294
 UNCLASSIFIED-0.285
 DOI-0.283
Token109
Token ID14454
Feature activation-7.73e+00

Pos logit contributions
 Ratio+0.255
 Spy+0.210
 Lerner+0.207
 UNCLASSIFIED+0.200
 DOI+0.198
Neg logit contributions
orld-0.377
iggins-0.365
cessive-0.361
phal-0.350
angel-0.340
Token]
Token ID60
Feature activation-1.93e+00

Pos logit contributions
orld+3.032
cessive+2.831
iggins+2.761
phal+2.667
angel+2.471
Neg logit contributions
 Ratio-8.412
 Lerner-7.628
 Spy-7.538
geries-7.437
 UNCLASSIFIED-7.385
Token
Token ID198
Feature activation+3.80e-01

Pos logit contributions
orld+1.611
iggins+1.554
cessive+1.542
phal+1.494
angel+1.447
Neg logit contributions
 Ratio-1.335
 Spy-1.126
 Lerner-1.109
 UNCLASSIFIED-1.079
 DOI-1.070
Token
Token ID198
Feature activation-1.86e-01

Pos logit contributions
 Ratio+0.228
 Spy+0.192
 Lerner+0.186
 UNCLASSIFIED+0.186
 DOI+0.183
Neg logit contributions
orld-0.361
iggins-0.353
cessive-0.345
phal-0.331
inent-0.322
TokenIn
Token ID818
Feature activation+8.03e-01

Pos logit contributions
orld+0.130
iggins+0.125
cessive+0.122
phal+0.117
angel+0.112
Neg logit contributions
 Ratio-0.157
 Spy-0.137
 Lerner-0.136
 UNCLASSIFIED-0.133
 DOI-0.132
Token some
Token ID617
Feature activation-5.10e+00

Pos logit contributions
 Ratio+0.547
 Spy+0.460
 Lerner+0.456
 UNCLASSIFIED+0.442
 Depression+0.438
Neg logit contributions
orld-0.691
iggins-0.665
cessive-0.659
phal-0.636
angel-0.619
Token all
Token ID477
Feature activation+2.52e+00

Pos logit contributions
 Ratio+1.083
 Spy+0.913
 Lerner+0.897
 UNCLASSIFIED+0.858
 DOI+0.857
Neg logit contributions
orld-1.491
iggins-1.435
cessive-1.426
phal-1.379
angel-1.340
Token the
Token ID262
Feature activation+3.08e+00

Pos logit contributions
 Ratio+1.641
 Spy+1.394
 Lerner+1.370
 UNCLASSIFIED+1.298
 DOI+1.295
Neg logit contributions
orld-2.238
iggins-2.161
cessive-2.143
phal-2.070
inent-2.007
Token difference
Token ID3580
Feature activation-5.07e-01

Pos logit contributions
 Ratio+1.723
 Spy+1.454
 Lerner+1.425
 Depression+1.343
 UNCLASSIFIED+1.326
Neg logit contributions
orld-3.022
cessive-2.889
iggins-2.884
phal-2.816
angel-2.714
Token.
Token ID13
Feature activation-1.86e+00

Pos logit contributions
orld+0.421
iggins+0.406
cessive+0.402
phal+0.388
angel+0.376
Neg logit contributions
 Ratio-0.360
 Spy-0.307
 Lerner-0.302
 UNCLASSIFIED-0.294
 DOI-0.291
Token<|endoftext|>
Token ID50256
Feature activation-4.04e+00

Pos logit contributions
orld+1.583
iggins+1.526
cessive+1.510
phal+1.460
angel+1.419
Neg logit contributions
 Ratio-1.265
 Spy-1.063
 Lerner-1.051
 UNCLASSIFIED-1.022
 DOI-1.010
TokenSpeaking
Token ID13887
Feature activation-8.70e+00

Pos logit contributions
orld+2.830
iggins+2.697
cessive+2.691
phal+2.591
angel+2.517
Neg logit contributions
 Ratio-3.380
 Spy-2.921
 Lerner-2.897
 UNCLASSIFIED-2.819
 DOI-2.812
Token to
Token ID284
Feature activation+1.80e+00

Pos logit contributions
orld+6.402
iggins+6.149
cessive+6.075
phal+5.843
angel+5.675
Neg logit contributions
 Ratio-6.452
 Lerner-5.610
 Spy-5.496
 DOI-5.437
 UNCLASSIFIED-5.374
Token reporters
Token ID7638
Feature activation+4.18e+00

Pos logit contributions
 Ratio+1.195
 Spy+1.005
 Lerner+0.985
 UNCLASSIFIED+0.954
 DOI+0.944
Neg logit contributions
orld-1.589
iggins-1.534
cessive-1.521
phal-1.471
angel-1.428
Token in
Token ID287
Feature activation+3.01e+00

Pos logit contributions
 Ratio+2.822
 Spy+2.429
 Lerner+2.384
 UNCLASSIFIED+2.355
 Depression+2.300
Neg logit contributions
orld-3.485
iggins-3.434
cessive-3.382
phal-3.236
amph-3.124
Token Barcelona
Token ID15142
Feature activation+2.41e-01

Pos logit contributions
 Ratio+1.961
 Spy+1.664
 Lerner+1.642
 UNCLASSIFIED+1.576
 Depression+1.564
Neg logit contributions
orld-2.689
iggins-2.578
cessive-2.561
phal-2.475
angel-2.396
Token today
Token ID1909
Feature activation+1.94e+00

Pos logit contributions
 Ratio+0.166
 Spy+0.141
 Lerner+0.139
 UNCLASSIFIED+0.135
 DOI+0.133
Neg logit contributions
orld-0.206
iggins-0.199
cessive-0.197
phal-0.190
angel-0.183
Token Hitchcock
Token ID49817
Feature activation+3.78e+00

Pos logit contributions
 Ratio+0.741
 Spy+0.630
 Lerner+0.615
 Depression+0.591
 UNCLASSIFIED+0.589
Neg logit contributions
orld-0.924
iggins-0.894
cessive-0.878
phal-0.847
angel-0.820
Token enlisted
Token ID30166
Feature activation-2.45e+00

Pos logit contributions
 Ratio+2.581
 Spy+2.230
 Lerner+2.183
 UNCLASSIFIED+2.109
 Depression+2.066
Neg logit contributions
orld-3.142
iggins-3.053
cessive-3.028
phal-2.900
inent-2.791
Token his
Token ID465
Feature activation-3.17e+00

Pos logit contributions
orld+2.061
iggins+1.999
cessive+1.980
phal+1.911
angel+1.864
Neg logit contributions
 Ratio-1.702
 Spy-1.422
 Lerner-1.410
 UNCLASSIFIED-1.383
 DOI-1.373
Tokeng
Token ID70
Feature activation-4.14e+00

Pos logit contributions
orld+2.569
iggins+2.496
cessive+2.471
phal+2.379
angel+2.317
Neg logit contributions
 Ratio-2.282
 Spy-1.919
 Lerner-1.913
 UNCLASSIFIED-1.861
 DOI-1.858
Tokenifted
Token ID21715
Feature activation-3.81e+00

Pos logit contributions
orld+3.048
iggins+2.939
cessive+2.877
phal+2.777
angel+2.726
Neg logit contributions
 Ratio-3.290
 Spy-2.849
 Lerner-2.819
 UNCLASSIFIED-2.750
 DOI-2.742
Token title
Token ID3670
Feature activation-8.06e+00

Pos logit contributions
orld+3.098
iggins+3.004
cessive+2.964
phal+2.859
angel+2.781
Neg logit contributions
 Ratio-2.738
 Spy-2.307
 Lerner-2.286
 UNCLASSIFIED-2.238
 DOI-2.230
Token designer
Token ID11915
Feature activation-3.95e+00

Pos logit contributions
orld+6.271
iggins+6.185
cessive+6.050
phal+5.919
angel+5.723
Neg logit contributions
 Ratio-5.844
 Lerner-4.863
 Spy-4.820
 DOI-4.786
 UNCLASSIFIED-4.773
Token,
Token ID11
Feature activation-8.64e-01

Pos logit contributions
orld+3.354
iggins+3.257
cessive+3.227
phal+3.119
angel+3.054
Neg logit contributions
 Ratio-2.641
 Spy-2.189
 Lerner-2.185
 DOI-2.141
 UNCLASSIFIED-2.136
Token Saul
Token ID31603
Feature activation-1.25e+00

Pos logit contributions
orld+0.755
iggins+0.730
cessive+0.723
phal+0.699
angel+0.680
Neg logit contributions
 Ratio-0.576
 Spy-0.481
 Lerner-0.475
 UNCLASSIFIED-0.462
 DOI-0.458
Token Bass
Token ID19829
Feature activation-3.74e+00

Pos logit contributions
orld+1.176
iggins+1.140
cessive+1.128
phal+1.094
angel+1.066
Neg logit contributions
 Ratio-0.748
 Spy-0.612
 Lerner-0.601
 UNCLASSIFIED-0.583
 DOI-0.578
Token,
Token ID11
Feature activation-2.40e-01

Pos logit contributions
orld+2.873
iggins+2.795
cessive+2.730
phal+2.651
idi+2.589
Neg logit contributions
 Ratio-2.843
 Spy-2.423
 Lerner-2.386
 DOI-2.352
 UNCLASSIFIED-2.352