TOP ACTIVATIONS
MAX = 11.297

64 406 Mem c py Reading 55 80 953 16 27
64 406 Mem c py Reading 55 80 953 16 27
better . <|endoftext|> Facebook Twitter Reddit Google + Go
ON ALL PERSON EL UPDATE : 05 45 z K USH
ally in charge of Cohen probe : report Vietnamese airline takes
| count | | ---- -+ -------------------------------- + ------- | |
ally in charge of Cohen probe : report Vietnamese airline takes
: " 9 - to - 5 Working Women Education Fund
ally in charge of Cohen probe : report Vietnamese airline takes
G ED NET CALL UPDATE : 05 50 z VOL UN
2 3 void vert ( in out app data _ full
G ED NET CALL UPDATE : 05 43 z ST OW
G ED NET CALL UPDATE : 05 41 z BLACK FO
64 406 Mem c py Reading 55 80 953 16 27
. normal * 1 ; } The result pushes
8 5 Estonia 8 . 5 6 Congo , Dem .
G ED NET CALL UPDATE : 05 42 z SH AM
a home for Sword Base . Editor 's Note
G ED NET CALL UPDATE : 0 551 z VOL UN
<|endoftext|> Facebook Twitter Reddit Google + Go to the

MIN ACTIVATIONS
MIN = -8.831

21 , 21 )) # loop over the input image paths
x . x . x . 141 112 08 80 x
with the j agg ies since even though they are there
Own ed < T > { ... } The
him prisoner if he could .[ 28 ] During
: Just about the key w ording of potions .
players from other team such as sc a . GET OUT
at the University of Alabama at Birmingham School of Medicine .
, and perhaps a partial or half
< T > { pub fn new ( t : T
< T > { pub fn load < ' a >
C BA contexts . Industrial due process is that bare minimum
B orm es les Mim os as (@ b orm
x . x . x . 149 110 22 26 x
ICA ( Ar chie ) Here s a
the j agg ies since even though they are there at
x . x . x . 135 106 19 41 x
impl Board { pub fn copy _ board ( &
x . x . x . 150 110 81 12 x
x . x . x . 152 101 8 604 x

INTERVAL 6.265 - 11.297
CONTAINS 0.037%

| | ---- -+ -------------------------------- + ------- | | 1 |
Suzuki , The Chronicle 5 of 15 Photo : Erin Bre
c v 2 . COL OR _ B GR 2 G
file . file and creates corresponding templates . This
( gray , ( 3 , 3 ) , 0 )

INTERVAL 1.233 - 6.265
CONTAINS 20.328%

ON . B ) 37 62 c 5 378
Some researchers ( for example , Victor Su vor
the dram at ization of O . J . Simpson
and I 'm still trying to figure out a few things
rewarded the Bills with two years of excellent play . On

INTERVAL -3.799 - 1.233
CONTAINS 78.739%

's on to Los Angeles . Ramsey is playing at an
were people making up all kinds of stories about pur ges
lesbian , gay , bisexual and transgender people , that harassment
's India follows a protracted legal dispute with CPR L that
on the ticket , the one who might reassure Americans afraid

INTERVAL -8.831 - -3.799
CONTAINS 0.896%

10 , 15 }, {{ 9 , 14 }, { 9
published Dec . 1 , 9 a . m . PT
educational attainment and frequently in rural regions with limited access to
... } pub fn undo _ move ( & mut self
bind ers as well . Mostly , you ll
Token64
Token ID2414
Feature activation+8.16e-01

Pos logit contributions
instein+2.361
oston+2.338
kefeller+2.309
OF+2.294
artney+2.265
Neg logit contributions
 apologizing-1.901
 BG-1.876
 Ange-1.807
 adjourn-1.797
 Ge-1.788
Token406
Token ID29703
Feature activation+4.64e-01

Pos logit contributions
instein+0.778
oston+0.770
kefeller+0.768
artney+0.759
OF+0.750
Neg logit contributions
 apologizing-0.431
 BG-0.429
 Ange-0.409
 Ge-0.408
 adjourn-0.403
Token Mem
Token ID4942
Feature activation+4.07e+00

Pos logit contributions
instein+0.511
oston+0.506
kefeller+0.505
artney+0.500
OF+0.495
Neg logit contributions
 apologizing-0.177
 BG-0.175
 Ange-0.164
 Ge-0.163
 adjourn-0.161
Tokenc
Token ID66
Feature activation+5.14e+00

Pos logit contributions
instein+3.223
oston+3.189
kefeller+3.109
artney+3.082
UTC+3.069
Neg logit contributions
 apologizing-2.814
 BG-2.773
 Ange-2.677
 adjourn-2.630
 Ge-2.621
Tokenpy
Token ID9078
Feature activation+5.59e+00

Pos logit contributions
instein+3.864
oston+3.840
kefeller+3.719
UTC+3.682
artney+3.657
Neg logit contributions
 apologizing-3.768
 BG-3.674
 Ange-3.623
 adjourn-3.590
 Ge-3.511
TokenReading
Token ID36120
Feature activation+1.13e+01

Pos logit contributions
OF+4.005
oston+3.959
instein+3.958
UTC+3.922
kefeller+3.769
Neg logit contributions
 apologizing-4.173
 BG-4.109
 adjourn-4.046
 Ange-4.044
 Ge-3.945
Token 55
Token ID5996
Feature activation+4.89e+00

Pos logit contributions
oston+7.852
instein+7.743
kefeller+7.703
artney+7.588
OF+7.440
Neg logit contributions
 apologizing-8.242
 Ange-7.909
 BG-7.775
 adjourn-7.675
 Ge-7.590
Token80
Token ID1795
Feature activation+1.10e+00

Pos logit contributions
instein+5.043
oston+5.010
OF+4.999
UTC+4.954
kefeller+4.928
Neg logit contributions
 apologizing-2.135
 BG-2.064
 Ange-1.981
 Ge-1.916
 adjourn-1.912
Token953
Token ID49649
Feature activation+3.78e+00

Pos logit contributions
instein+0.714
kefeller+0.702
oston+0.701
artney+0.688
OF+0.677
Neg logit contributions
 apologizing-0.921
 BG-0.915
 Ge-0.893
 Ange-0.892
 adjourn-0.883
Token 16
Token ID1467
Feature activation+2.77e+00

Pos logit contributions
instein+3.374
oston+3.353
kefeller+3.306
OF+3.290
artney+3.271
Neg logit contributions
 apologizing-2.187
 BG-2.105
 Ange-2.029
 Ge-2.022
 adjourn-2.018
Token27
Token ID1983
Feature activation+3.84e-01

Pos logit contributions
instein+2.259
oston+2.233
kefeller+2.184
OF+2.182
artney+2.162
Neg logit contributions
 apologizing-1.848
 BG-1.820
 Ange-1.766
 Ge-1.741
 adjourn-1.740
Token64
Token ID2414
Feature activation+7.54e-01

Pos logit contributions
instein+1.933
oston+1.913
kefeller+1.897
OF+1.868
artney+1.867
Neg logit contributions
 apologizing-1.481
 BG-1.467
 Ange-1.410
 Ge-1.400
 adjourn-1.399
Token406
Token ID29703
Feature activation+1.17e+00

Pos logit contributions
instein+0.719
oston+0.712
kefeller+0.710
artney+0.702
OF+0.693
Neg logit contributions
 apologizing-0.399
 BG-0.397
 Ange-0.379
 Ge-0.378
 adjourn-0.373
Token Mem
Token ID4942
Feature activation+3.74e+00

Pos logit contributions
instein+1.283
oston+1.272
kefeller+1.266
artney+1.255
OF+1.243
Neg logit contributions
 apologizing-0.449
 BG-0.443
 Ange-0.415
 Ge-0.413
 adjourn-0.409
Tokenc
Token ID66
Feature activation+3.82e+00

Pos logit contributions
instein+2.995
oston+2.961
kefeller+2.896
artney+2.871
UTC+2.849
Neg logit contributions
 apologizing-2.562
 BG-2.531
 Ange-2.443
 adjourn-2.397
 Ge-2.395
Tokenpy
Token ID9078
Feature activation+5.32e+00

Pos logit contributions
instein+2.969
oston+2.943
kefeller+2.882
artney+2.839
UTC+2.822
Neg logit contributions
 apologizing-2.710
 BG-2.658
 Ange-2.601
 adjourn-2.576
 Ge-2.541
TokenReading
Token ID36120
Feature activation+1.08e+01

Pos logit contributions
instein+3.857
oston+3.850
OF+3.843
UTC+3.754
kefeller+3.719
Neg logit contributions
 apologizing-3.896
 BG-3.853
 Ange-3.797
 adjourn-3.764
 Ge-3.711
Token 55
Token ID5996
Feature activation+4.62e+00

Pos logit contributions
instein+7.833
oston+7.790
kefeller+7.710
artney+7.573
OF+7.515
Neg logit contributions
 apologizing-7.488
 Ange-7.110
 BG-7.010
 adjourn-6.913
 Ge-6.850
Token80
Token ID1795
Feature activation+1.54e+00

Pos logit contributions
instein+4.791
oston+4.761
OF+4.728
UTC+4.690
kefeller+4.682
Neg logit contributions
 apologizing-1.995
 BG-1.936
 Ange-1.853
 Ge-1.796
 adjourn-1.785
Token953
Token ID49649
Feature activation+3.84e+00

Pos logit contributions
instein+0.988
oston+0.970
kefeller+0.970
artney+0.952
OF+0.935
Neg logit contributions
 apologizing-1.296
 BG-1.290
 Ge-1.256
 Ange-1.254
 adjourn-1.244
Token 16
Token ID1467
Feature activation+2.26e+00

Pos logit contributions
instein+3.438
oston+3.425
kefeller+3.394
artney+3.346
OF+3.335
Neg logit contributions
 apologizing-2.221
 BG-2.149
 Ange-2.071
 Ge-2.067
 adjourn-2.057
Token27
Token ID1983
Feature activation+7.79e-01

Pos logit contributions
instein+1.876
oston+1.854
kefeller+1.824
OF+1.808
artney+1.807
Neg logit contributions
 apologizing-1.468
 BG-1.448
 Ange-1.400
 Ge-1.387
 adjourn-1.381
Token better
Token ID1365
Feature activation+5.17e-01

Pos logit contributions
 apologizing+0.360
 BG+0.358
 Ge+0.336
 Ange+0.335
 adjourn+0.331
Neg logit contributions
instein-0.920
oston-0.911
kefeller-0.911
artney-0.900
OF-0.890
Token.
Token ID13
Feature activation-2.27e+00

Pos logit contributions
instein+0.456
oston+0.450
kefeller+0.449
artney+0.444
OF+0.438
Neg logit contributions
 apologizing-0.310
 BG-0.308
 Ge-0.296
 Ange-0.295
 adjourn-0.292
Token<|endoftext|>
Token ID50256
Feature activation+7.34e-01

Pos logit contributions
 BG+1.388
 apologizing+1.366
 Ge+1.339
 Ange+1.328
Ge+1.298
Neg logit contributions
instein-1.987
kefeller-1.973
oston-1.959
artney-1.942
OF-1.888
TokenFacebook
Token ID12025
Feature activation-5.74e-01

Pos logit contributions
instein+0.568
oston+0.560
kefeller+0.558
artney+0.550
OF+0.543
Neg logit contributions
 apologizing-0.520
 BG-0.518
 Ange-0.499
 Ge-0.499
 adjourn-0.495
Token Twitter
Token ID3009
Feature activation+4.59e+00

Pos logit contributions
 apologizing+0.361
 BG+0.359
 Ge+0.346
 Ange+0.344
 adjourn+0.340
Neg logit contributions
instein-0.491
kefeller-0.487
oston-0.486
artney-0.480
OF-0.472
Token Reddit
Token ID10750
Feature activation+1.04e+01

Pos logit contributions
instein+2.946
oston+2.881
kefeller+2.827
artney+2.770
OF+2.761
Neg logit contributions
 apologizing-3.788
 BG-3.735
 adjourn-3.664
 Ange-3.650
iot-3.614
Token Google
Token ID3012
Feature activation+6.24e+00

Pos logit contributions
oston+6.534
instein+6.491
UTC+6.422
 Levin+6.409
OF+6.396
Neg logit contributions
 apologizing-8.206
 adjourn-7.841
cipline-7.824
iot-7.801
 BG-7.793
Token+
Token ID10
Feature activation+8.75e+00

Pos logit contributions
instein+4.989
oston+4.915
kefeller+4.860
UTC+4.809
artney+4.794
Neg logit contributions
 apologizing-4.158
 BG-4.045
 adjourn-3.993
iot-3.930
 Ange-3.877
Token
Token ID198
Feature activation+6.67e-01

Pos logit contributions
instein+6.243
artney+6.079
 Levin+6.060
UTC+6.053
oston+6.034
Neg logit contributions
 apologizing-6.214
iot-6.001
 adjourn-5.958
 BG-5.949
cipline-5.916
Token
Token ID198
Feature activation-7.17e-01

Pos logit contributions
instein+0.563
oston+0.556
kefeller+0.554
artney+0.547
OF+0.540
Neg logit contributions
 apologizing-0.425
 BG-0.423
 Ange-0.406
 Ge-0.406
 adjourn-0.402
TokenGo
Token ID5247
Feature activation-8.07e-02

Pos logit contributions
 apologizing+0.505
 BG+0.503
 Ge+0.488
 Ange+0.487
Ge+0.484
Neg logit contributions
instein-0.556
kefeller-0.551
oston-0.548
artney-0.542
OF-0.526
Token ON
Token ID6177
Feature activation-1.45e+00

Pos logit contributions
 BG+0.577
 apologizing+0.575
 Ge+0.542
 Ange+0.541
 adjourn+0.532
Neg logit contributions
instein-1.294
kefeller-1.280
oston-1.277
artney-1.265
OF-1.241
Token ALL
Token ID11096
Feature activation-1.86e+00

Pos logit contributions
 BG+0.948
 apologizing+0.945
 Ge+0.909
 Ange+0.905
 adjourn+0.897
Neg logit contributions
instein-1.206
kefeller-1.190
oston-1.188
artney-1.172
OF-1.146
Token PERSON
Token ID46740
Feature activation+2.62e+00

Pos logit contributions
 BG+1.049
 apologizing+1.039
 Ge+0.998
 Ange+0.990
 adjourn+0.978
Neg logit contributions
instein-1.723
kefeller-1.705
oston-1.696
artney-1.675
OF-1.634
TokenEL
Token ID3698
Feature activation-1.07e+00

Pos logit contributions
instein+1.345
oston+1.333
OF+1.328
kefeller+1.309
artney+1.306
Neg logit contributions
 apologizing-2.496
 BG-2.448
 Ange-2.401
 adjourn-2.397
 Ge-2.384
Token UPDATE
Token ID35717
Feature activation+2.65e-01

Pos logit contributions
 BG+0.613
 apologizing+0.612
 Ge+0.584
 Ange+0.584
 adjourn+0.576
Neg logit contributions
instein-0.971
oston-0.959
kefeller-0.959
artney-0.946
OF-0.930
Token:
Token ID25
Feature activation+1.01e+01

Pos logit contributions
instein+0.243
oston+0.241
kefeller+0.239
artney+0.237
OF+0.235
Neg logit contributions
 apologizing-0.149
 BG-0.147
 Ge-0.141
 Ange-0.141
 adjourn-0.140
Token 05
Token ID8870
Feature activation+2.50e+00

Pos logit contributions
instein+6.060
oston+5.983
kefeller+5.851
OF+5.850
 Levin+5.812
Neg logit contributions
 apologizing-8.120
 BG-7.787
Ge-7.714
 Ange-7.707
cipline-7.698
Token45
Token ID2231
Feature activation-1.45e+00

Pos logit contributions
instein+2.415
oston+2.392
OF+2.367
kefeller+2.361
artney+2.346
Neg logit contributions
 apologizing-1.272
 BG-1.235
 adjourn-1.193
 Ange-1.191
 Ge-1.174
Tokenz
Token ID89
Feature activation+2.04e+00

Pos logit contributions
 BG+0.840
 apologizing+0.836
 Ge+0.806
 Ange+0.801
Ge+0.790
Neg logit contributions
instein-1.307
kefeller-1.304
oston-1.291
artney-1.288
OF-1.256
Token K
Token ID509
Feature activation-7.59e-01

Pos logit contributions
instein+1.752
oston+1.738
kefeller+1.710
OF+1.694
artney+1.694
Neg logit contributions
 apologizing-1.260
 BG-1.233
 Ange-1.197
 Ge-1.190
 adjourn-1.188
TokenUSH
Token ID27143
Feature activation-2.08e+00

Pos logit contributions
 BG+0.520
 apologizing+0.519
 Ge+0.503
 Ange+0.498
 adjourn+0.495
Neg logit contributions
instein-0.603
kefeller-0.597
oston-0.591
artney-0.585
OF-0.567
Token ally
Token ID12525
Feature activation+1.37e+00

Pos logit contributions
instein+0.689
oston+0.657
kefeller+0.645
artney+0.630
OF+0.625
Neg logit contributions
 apologizing-1.332
 BG-1.331
 Ge-1.285
 Ange-1.284
 adjourn-1.283
Token in
Token ID287
Feature activation+2.51e-01

Pos logit contributions
instein+1.128
oston+1.112
kefeller+1.104
artney+1.097
OF+1.083
Neg logit contributions
 apologizing-0.903
 BG-0.900
 Ange-0.864
 Ge-0.863
 adjourn-0.858
Token charge
Token ID3877
Feature activation+6.03e+00

Pos logit contributions
instein+0.214
kefeller+0.213
oston+0.212
artney+0.210
OF+0.207
Neg logit contributions
 apologizing-0.162
 BG-0.159
 Ge-0.154
 Ange-0.153
 adjourn-0.151
Token of
Token ID286
Feature activation+2.85e+00

Pos logit contributions
instein+4.826
oston+4.691
kefeller+4.575
OF+4.519
 Levin+4.487
Neg logit contributions
 BG-4.026
 apologizing-3.975
 adjourn-3.846
 Ange-3.830
 Ge-3.815
Token Cohen
Token ID17154
Feature activation+3.51e+00

Pos logit contributions
instein+2.315
oston+2.249
kefeller+2.229
artney+2.170
OF+2.162
Neg logit contributions
 BG-1.933
 apologizing-1.909
 Ange-1.876
 adjourn-1.853
iot-1.820
Token probe
Token ID12774
Feature activation+1.01e+01

Pos logit contributions
instein+2.953
 Levin+2.765
kefeller+2.757
oston+2.739
artney+2.736
Neg logit contributions
 adjourn-2.249
cipline-2.242
 BG-2.230
 apologizing-2.220
iot-2.176
Token:
Token ID25
Feature activation+4.03e+00

Pos logit contributions
oston+6.615
instein+6.508
 Levin+6.265
 Dolphin+6.082
OF+6.075
Neg logit contributions
 apologizing-7.961
 BG-7.877
cipline-7.708
 Ange-7.678
 adjourn-7.651
Token report
Token ID989
Feature activation+3.42e+00

Pos logit contributions
instein+2.856
 Dolphin+2.692
oston+2.653
OF+2.641
 Levin+2.596
Neg logit contributions
 apologizing-3.108
 adjourn-3.075
 BG-3.071
Ge-2.942
 Ge-2.941
Token Vietnamese
Token ID23618
Feature activation+4.29e+00

Pos logit contributions
instein+2.518
oston+2.487
kefeller+2.431
OF+2.407
 Levin+2.400
Neg logit contributions
 apologizing-2.528
 BG-2.489
 adjourn-2.421
iot-2.400
 Ange-2.399
Token airline
Token ID18091
Feature activation+5.41e+00

Pos logit contributions
oston+3.417
instein+3.397
kefeller+3.334
OF+3.297
artney+3.268
Neg logit contributions
 BG-2.896
 apologizing-2.826
 Ange-2.812
cipline-2.734
 Ge-2.713
Token takes
Token ID2753
Feature activation+3.43e+00

Pos logit contributions
instein+4.159
oston+4.091
 Dolphin+3.896
kefeller+3.881
UTC+3.869
Neg logit contributions
 BG-3.632
cipline-3.521
 Ange-3.495
 apologizing-3.486
iot-3.480
Token |
Token ID930
Feature activation+3.36e+00

Pos logit contributions
instein+3.930
oston+3.896
kefeller+3.851
artney+3.812
OF+3.803
Neg logit contributions
 apologizing-3.681
 BG-3.631
 Ange-3.509
 Ge-3.491
iot-3.483
Token count
Token ID954
Feature activation+1.68e+00

Pos logit contributions
instein+2.694
oston+2.652
kefeller+2.639
artney+2.610
OF+2.575
Neg logit contributions
 apologizing-2.271
 BG-2.255
 Ange-2.171
 Ge-2.166
 adjourn-2.146
Token |
Token ID930
Feature activation+3.86e+00

Pos logit contributions
instein+1.346
oston+1.329
kefeller+1.326
artney+1.308
OF+1.285
Neg logit contributions
 apologizing-1.148
 BG-1.145
 Ge-1.104
 Ange-1.103
 adjourn-1.093
Token |
Token ID930
Feature activation+2.17e+00

Pos logit contributions
instein+3.152
oston+3.097
kefeller+3.080
artney+3.048
OF+3.019
Neg logit contributions
 apologizing-2.529
 BG-2.509
 Ange-2.416
 Ge-2.401
iot-2.385
Token ----
Token ID13498
Feature activation-1.81e+00

Pos logit contributions
instein+2.058
oston+2.031
kefeller+2.024
artney+2.004
OF+1.982
Neg logit contributions
 apologizing-1.159
 BG-1.150
 Ange-1.097
 Ge-1.093
 adjourn-1.080
Token-+
Token ID19529
Feature activation+1.01e+01

Pos logit contributions
 BG+1.194
 apologizing+1.190
 Ge+1.157
 Ange+1.151
 adjourn+1.137
Neg logit contributions
instein-1.484
kefeller-1.483
oston-1.469
artney-1.452
OF-1.408
Token--------------------------------
Token ID3880
Feature activation+2.66e+00

Pos logit contributions
instein+5.415
oston+5.327
OF+5.251
UTC+5.137
artney+5.093
Neg logit contributions
 apologizing-8.716
 BG-8.705
cipline-8.364
 Ange-8.353
iot-8.327
Token+
Token ID10
Feature activation+8.32e+00

Pos logit contributions
instein+1.897
oston+1.873
kefeller+1.854
artney+1.832
OF+1.813
Neg logit contributions
 apologizing-2.038
 BG-2.024
 Ange-1.955
 Ge-1.952
 adjourn-1.936
Token-------
Token ID26866
Feature activation+3.76e+00

Pos logit contributions
instein+5.106
oston+5.023
OF+4.892
artney+4.877
UTC+4.852
Neg logit contributions
 apologizing-6.794
 BG-6.751
 Ange-6.496
cipline-6.459
 Ge-6.448
Token|
Token ID91
Feature activation+5.48e+00

Pos logit contributions
instein+2.633
oston+2.596
kefeller+2.566
artney+2.526
OF+2.524
Neg logit contributions
 apologizing-2.904
 BG-2.883
 Ange-2.782
 Ge-2.775
 adjourn-2.765
Token |
Token ID930
Feature activation+8.10e-01

Pos logit contributions
instein+4.156
oston+4.099
kefeller+4.034
artney+4.026
OF+3.996
Neg logit contributions
 apologizing-3.868
 BG-3.823
 Ange-3.684
 Ge-3.671
iot-3.642
Token ally
Token ID12525
Feature activation+1.91e+00

Pos logit contributions
instein+1.532
oston+1.482
kefeller+1.455
OF+1.438
artney+1.434
Neg logit contributions
 apologizing-1.545
 BG-1.539
 adjourn-1.474
 Ange-1.468
 Ge-1.468
Token in
Token ID287
Feature activation+6.19e-01

Pos logit contributions
instein+1.614
oston+1.593
kefeller+1.584
artney+1.570
OF+1.549
Neg logit contributions
 apologizing-1.217
 BG-1.212
 Ange-1.163
 Ge-1.163
 adjourn-1.153
Token charge
Token ID3877
Feature activation+6.01e+00

Pos logit contributions
instein+0.511
kefeller+0.507
oston+0.505
artney+0.500
OF+0.491
Neg logit contributions
 apologizing-0.412
 BG-0.406
 Ge-0.393
 Ange-0.392
 adjourn-0.387
Token of
Token ID286
Feature activation+3.18e+00

Pos logit contributions
instein+4.907
oston+4.789
kefeller+4.685
OF+4.628
 Levin+4.579
Neg logit contributions
 BG-3.943
 apologizing-3.888
 adjourn-3.757
 Ange-3.757
 Ge-3.741
Token Cohen
Token ID17154
Feature activation+3.31e+00

Pos logit contributions
instein+2.555
oston+2.508
kefeller+2.484
artney+2.430
OF+2.430
Neg logit contributions
 BG-2.169
 apologizing-2.147
 Ange-2.093
 adjourn-2.069
cipline-2.033
Token probe
Token ID12774
Feature activation+9.91e+00

Pos logit contributions
instein+2.795
kefeller+2.647
artney+2.647
oston+2.640
OF+2.630
Neg logit contributions
 BG-2.083
 apologizing-2.075
 adjourn-2.070
cipline-2.053
 Ange-2.016
Token:
Token ID25
Feature activation+4.80e+00

Pos logit contributions
oston+6.128
instein+6.076
 Levin+5.846
kefeller+5.642
OF+5.635
Neg logit contributions
 apologizing-8.208
 BG-8.064
cipline-7.939
 Ange-7.883
 adjourn-7.876
Token report
Token ID989
Feature activation+3.98e+00

Pos logit contributions
instein+3.423
 Dolphin+3.263
OF+3.195
oston+3.172
UTC+3.147
Neg logit contributions
 apologizing-3.614
 BG-3.608
 adjourn-3.602
 Ge-3.470
 Ange-3.457
Token Vietnamese
Token ID23618
Feature activation+4.51e+00

Pos logit contributions
instein+2.920
oston+2.871
kefeller+2.844
OF+2.787
 Levin+2.779
Neg logit contributions
 apologizing-2.937
 BG-2.895
 adjourn-2.819
iot-2.806
 Ange-2.791
Token airline
Token ID18091
Feature activation+4.82e+00

Pos logit contributions
instein+3.524
oston+3.519
kefeller+3.462
OF+3.405
artney+3.390
Neg logit contributions
 BG-3.101
 apologizing-3.053
 Ange-3.023
cipline-2.941
 Ge-2.932
Token takes
Token ID2753
Feature activation+3.75e+00

Pos logit contributions
instein+3.795
oston+3.745
kefeller+3.606
 Dolphin+3.557
artney+3.532
Neg logit contributions
 BG-3.164
 apologizing-3.090
cipline-3.063
 Ange-3.040
iot-3.031
Token:
Token ID25
Feature activation-1.41e+00

Pos logit contributions
instein+2.265
oston+2.243
kefeller+2.233
artney+2.200
OF+2.181
Neg logit contributions
 BG-1.449
 apologizing-1.448
 Ange-1.395
 Ge-1.373
 adjourn-1.364
Token "
Token ID366
Feature activation+5.20e-02

Pos logit contributions
 apologizing+0.844
 BG+0.842
 Ge+0.812
 Ange+0.808
 adjourn+0.790
Neg logit contributions
instein-1.249
oston-1.233
kefeller-1.229
artney-1.216
OF-1.201
Token9
Token ID24
Feature activation+3.76e+00

Pos logit contributions
instein+0.041
oston+0.041
kefeller+0.041
artney+0.040
OF+0.039
Neg logit contributions
 apologizing-0.036
 BG-0.036
 Ge-0.034
 Ange-0.034
 adjourn-0.034
Token-
Token ID12
Feature activation+7.72e+00

Pos logit contributions
instein+3.185
kefeller+3.156
oston+3.152
artney+3.076
OF+3.065
Neg logit contributions
 apologizing-2.368
 BG-2.340
 adjourn-2.255
 Ange-2.220
 Ge-2.207
Tokento
Token ID1462
Feature activation+8.36e+00

Pos logit contributions
instein+5.345
kefeller+5.299
oston+5.205
OF+5.161
artney+5.078
Neg logit contributions
 apologizing-5.934
 BG-5.847
 adjourn-5.684
 Ange-5.618
 Ge-5.522
Token-
Token ID12
Feature activation+9.65e+00

Pos logit contributions
instein+6.310
oston+6.297
OF+6.142
UTC+6.139
kefeller+6.056
Neg logit contributions
 apologizing-5.824
 BG-5.542
 adjourn-5.420
 Ange-5.346
iot-5.270
Token5
Token ID20
Feature activation+4.91e+00

Pos logit contributions
instein+5.667
oston+5.590
OF+5.564
UTC+5.520
kefeller+5.443
Neg logit contributions
 apologizing-8.367
 BG-8.106
 adjourn-7.977
 Ange-7.941
 substitutes-7.742
Token Working
Token ID14594
Feature activation+8.84e-01

Pos logit contributions
instein+2.889
oston+2.874
kefeller+2.868
artney+2.766
OF+2.711
Neg logit contributions
 apologizing-4.328
 BG-4.293
 adjourn-4.206
 Ange-4.181
 Ge-4.150
Token Women
Token ID6926
Feature activation-2.08e+00

Pos logit contributions
instein+0.846
oston+0.837
kefeller+0.835
artney+0.825
OF+0.815
Neg logit contributions
 apologizing-0.465
 BG-0.462
 Ge-0.440
 Ange-0.440
 adjourn-0.434
Token Education
Token ID7868
Feature activation+3.69e+00

Pos logit contributions
 BG+1.098
 apologizing+1.082
 Ge+1.060
 Ange+1.043
Ge+1.015
Neg logit contributions
instein-1.982
oston-1.963
kefeller-1.960
artney-1.944
OF-1.915
Token Fund
Token ID7557
Feature activation+4.56e+00

Pos logit contributions
instein+2.712
oston+2.671
kefeller+2.662
artney+2.612
OF+2.607
Neg logit contributions
 apologizing-2.727
 BG-2.683
 adjourn-2.611
 Ange-2.607
 Ge-2.576
Token ally
Token ID12525
Feature activation+1.65e+00

Pos logit contributions
instein+1.085
oston+1.052
kefeller+1.040
artney+1.024
OF+1.016
Neg logit contributions
 apologizing-1.060
 BG-1.056
 adjourn-1.010
 Ange-1.008
 Ge-1.008
Token in
Token ID287
Feature activation+1.75e-01

Pos logit contributions
instein+1.397
oston+1.380
kefeller+1.376
artney+1.359
OF+1.340
Neg logit contributions
 apologizing-1.042
 BG-1.037
 Ge-0.996
 Ange-0.996
 adjourn-0.985
Token charge
Token ID3877
Feature activation+5.63e+00

Pos logit contributions
instein+0.154
kefeller+0.154
oston+0.153
artney+0.152
OF+0.150
Neg logit contributions
 apologizing-0.107
 BG-0.105
 Ge-0.101
 Ange-0.101
 adjourn-0.100
Token of
Token ID286
Feature activation+3.22e+00

Pos logit contributions
instein+4.608
oston+4.495
kefeller+4.414
OF+4.348
artney+4.318
Neg logit contributions
 BG-3.680
 apologizing-3.651
 adjourn-3.510
 Ange-3.509
 Ge-3.489
Token Cohen
Token ID17154
Feature activation+3.33e+00

Pos logit contributions
instein+2.701
oston+2.639
kefeller+2.625
artney+2.570
OF+2.553
Neg logit contributions
 BG-2.078
 apologizing-2.064
 Ange-2.006
 adjourn-1.983
cipline-1.952
Token probe
Token ID12774
Feature activation+9.60e+00

Pos logit contributions
instein+2.796
oston+2.648
kefeller+2.645
artney+2.642
OF+2.622
Neg logit contributions
 BG-2.120
 apologizing-2.113
 adjourn-2.101
cipline-2.084
 Ange-2.051
Token:
Token ID25
Feature activation+4.52e+00

Pos logit contributions
oston+5.685
instein+5.628
 Levin+5.402
artney+5.265
kefeller+5.256
Neg logit contributions
 apologizing-8.237
 BG-8.090
cipline-7.977
 Ange-7.919
 adjourn-7.884
Token report
Token ID989
Feature activation+3.50e+00

Pos logit contributions
instein+3.226
 Dolphin+3.019
oston+2.945
OF+2.933
UTC+2.890
Neg logit contributions
 adjourn-3.486
 apologizing-3.471
 BG-3.451
 Ge-3.328
cipline-3.318
Token Vietnamese
Token ID23618
Feature activation+4.58e+00

Pos logit contributions
instein+2.512
oston+2.474
kefeller+2.432
 Levin+2.391
OF+2.388
Neg logit contributions
 apologizing-2.647
 BG-2.597
 adjourn-2.535
iot-2.528
 Ange-2.514
Token airline
Token ID18091
Feature activation+5.35e+00

Pos logit contributions
oston+3.606
instein+3.592
kefeller+3.523
OF+3.478
artney+3.447
Neg logit contributions
 BG-3.118
 apologizing-3.050
 Ange-3.037
cipline-2.959
 Ge-2.923
Token takes
Token ID2753
Feature activation+3.84e+00

Pos logit contributions
instein+4.364
oston+4.306
kefeller+4.114
 Dolphin+4.110
 Levin+4.071
Neg logit contributions
 BG-3.330
cipline-3.233
 apologizing-3.227
iot-3.205
 Ange-3.203
TokenG
Token ID38
Feature activation-1.43e+00

Pos logit contributions
 BG+1.037
 apologizing+1.024
 Ge+0.987
 Ange+0.972
Ge+0.969
Neg logit contributions
instein-2.145
kefeller-2.124
oston-2.114
artney-2.097
OF-2.034
TokenED
Token ID1961
Feature activation+9.78e-01

Pos logit contributions
 BG+1.001
 apologizing+0.996
 Ge+0.964
 Ange+0.957
 adjourn+0.950
Neg logit contributions
instein-1.119
kefeller-1.106
oston-1.103
artney-1.091
OF-1.057
Token NET
Token ID30502
Feature activation-4.34e+00

Pos logit contributions
instein+0.817
oston+0.812
kefeller+0.802
artney+0.793
OF+0.792
Neg logit contributions
 apologizing-0.625
 BG-0.612
 Ange-0.596
 Ge-0.593
 adjourn-0.590
Token CALL
Token ID42815
Feature activation-2.23e+00

Pos logit contributions
 BG+2.011
 apologizing+1.926
 Ge+1.875
 Ange+1.837
 adjourn+1.794
Neg logit contributions
instein-4.558
kefeller-4.532
oston-4.480
artney-4.462
OF-4.244
Token UPDATE
Token ID35717
Feature activation+1.10e+00

Pos logit contributions
 BG+1.093
 apologizing+1.080
 Ge+1.025
 Ange+1.023
 adjourn+1.005
Neg logit contributions
instein-2.235
kefeller-2.210
oston-2.203
artney-2.186
OF-2.128
Token:
Token ID25
Feature activation+9.56e+00

Pos logit contributions
instein+1.011
oston+1.001
kefeller+0.992
OF+0.981
artney+0.980
Neg logit contributions
 apologizing-0.619
 BG-0.608
 Ange-0.584
 Ge-0.581
 adjourn-0.578
Token 05
Token ID8870
Feature activation+5.38e+00

Pos logit contributions
instein+5.196
oston+5.122
kefeller+5.042
artney+5.010
OF+4.960
Neg logit contributions
 apologizing-8.316
 BG-8.040
Ge-7.965
 Ange-7.959
 Ge-7.953
Token50
Token ID1120
Feature activation-2.88e+00

Pos logit contributions
instein+4.795
OF+4.779
oston+4.748
UTC+4.720
kefeller+4.708
Neg logit contributions
 apologizing-3.054
 BG-2.899
 adjourn-2.878
 Ange-2.870
iot-2.781
Tokenz
Token ID89
Feature activation+2.28e+00

Pos logit contributions
 BG+1.606
 apologizing+1.583
 Ge+1.546
 Ange+1.528
Ge+1.517
Neg logit contributions
kefeller-2.646
instein-2.631
artney-2.619
oston-2.599
OF-2.531
Token VOL
Token ID38570
Feature activation-3.50e+00

Pos logit contributions
instein+1.962
oston+1.946
kefeller+1.916
OF+1.900
artney+1.895
Neg logit contributions
 apologizing-1.391
 BG-1.358
 Ange-1.318
 adjourn-1.312
 Ge-1.308
TokenUN
Token ID4944
Feature activation-4.09e+00

Pos logit contributions
 BG+2.838
 apologizing+2.790
 Ge+2.733
 Ange+2.708
Ge+2.706
Neg logit contributions
instein-2.393
kefeller-2.347
oston-2.332
artney-2.313
OF-2.168
Token 2
Token ID362
Feature activation+3.69e+00

Pos logit contributions
instein+3.267
oston+3.226
OF+3.188
artney+3.156
kefeller+3.151
Neg logit contributions
 apologizing-2.474
 BG-2.431
 Ange-2.340
 Ge-2.330
 adjourn-2.323
Token 3
Token ID513
Feature activation+4.32e+00

Pos logit contributions
instein+2.982
oston+2.964
kefeller+2.889
artney+2.871
OF+2.868
Neg logit contributions
 apologizing-2.465
 BG-2.436
 Ange-2.345
 adjourn-2.334
 Ge-2.331
Token void
Token ID7951
Feature activation-5.86e-01

Pos logit contributions
instein+3.363
oston+3.334
OF+3.223
kefeller+3.214
artney+3.183
Neg logit contributions
 apologizing-2.968
 BG-2.908
 Ange-2.837
 adjourn-2.819
iot-2.816
Token vert
Token ID9421
Feature activation+3.50e+00

Pos logit contributions
 BG+0.237
 apologizing+0.234
 Ge+0.223
 Ange+0.219
 adjourn+0.217
Neg logit contributions
instein-0.632
oston-0.627
kefeller-0.627
artney-0.623
OF-0.613
Token (
Token ID357
Feature activation+6.36e+00

Pos logit contributions
instein+2.884
oston+2.847
kefeller+2.807
artney+2.771
OF+2.762
Neg logit contributions
 apologizing-2.308
 BG-2.254
 Ange-2.179
 adjourn-2.173
 Ge-2.161
Token in
Token ID287
Feature activation+9.38e+00

Pos logit contributions
instein+4.826
oston+4.777
artney+4.583
OF+4.549
kefeller+4.542
Neg logit contributions
 apologizing-4.479
 BG-4.401
 Ange-4.333
cipline-4.254
Ge-4.250
Tokenout
Token ID448
Feature activation+2.04e+00

Pos logit contributions
oston+5.460
instein+5.401
OF+4.990
artney+4.913
kefeller+4.714
Neg logit contributions
 apologizing-8.101
 BG-8.000
 Ange-7.989
Ge-7.893
 Ge-7.862
Token app
Token ID598
Feature activation+3.66e+00

Pos logit contributions
instein+1.853
oston+1.843
kefeller+1.808
artney+1.788
OF+1.782
Neg logit contributions
 apologizing-1.150
 BG-1.139
 Ange-1.094
 Ge-1.087
Ge-1.080
Tokendata
Token ID7890
Feature activation+4.96e+00

Pos logit contributions
oston+2.526
instein+2.524
kefeller+2.429
OF+2.371
artney+2.358
Neg logit contributions
 apologizing-2.932
 BG-2.868
 Ange-2.824
 adjourn-2.793
 Ge-2.773
Token _
Token ID4808
Feature activation+3.09e-01

Pos logit contributions
oston+3.435
instein+3.415
OF+3.400
UTC+3.337
kefeller+3.274
Neg logit contributions
 apologizing-3.827
 BG-3.635
 adjourn-3.580
 Ange-3.577
iot-3.536
Token full
Token ID1336
Feature activation-6.04e-01

Pos logit contributions
instein+0.283
oston+0.280
kefeller+0.279
artney+0.276
OF+0.272
Neg logit contributions
 apologizing-0.175
 BG-0.175
 Ge-0.167
 Ange-0.167
 adjourn-0.165
TokenG
Token ID38
Feature activation-1.14e+00

Pos logit contributions
 BG+0.804
 apologizing+0.798
 Ge+0.765
 Ange+0.757
Ge+0.748
Neg logit contributions
instein-1.625
kefeller-1.608
oston-1.603
artney-1.588
OF-1.550
TokenED
Token ID1961
Feature activation+8.14e-01

Pos logit contributions
 BG+0.803
 apologizing+0.801
 Ge+0.774
 Ange+0.769
 adjourn+0.763
Neg logit contributions
instein-0.891
kefeller-0.880
oston-0.878
artney-0.867
OF-0.842
Token NET
Token ID30502
Feature activation-5.40e+00

Pos logit contributions
instein+0.684
oston+0.678
kefeller+0.671
artney+0.664
OF+0.661
Neg logit contributions
 apologizing-0.517
 BG-0.508
 Ange-0.494
 Ge-0.491
 adjourn-0.489
Token CALL
Token ID42815
Feature activation-3.09e+00

Pos logit contributions
 BG+2.447
 apologizing+2.294
 Ge+2.275
 Ange+2.204
 adjourn+2.136
Neg logit contributions
instein-5.783
kefeller-5.765
artney-5.675
oston-5.669
etsk-5.370
Token UPDATE
Token ID35717
Feature activation+7.75e-01

Pos logit contributions
 BG+1.497
 apologizing+1.463
 Ge+1.392
 Ange+1.389
 adjourn+1.360
Neg logit contributions
instein-3.117
kefeller-3.084
oston-3.064
artney-3.052
OF-2.946
Token:
Token ID25
Feature activation+9.37e+00

Pos logit contributions
instein+0.717
oston+0.710
kefeller+0.705
artney+0.697
OF+0.694
Neg logit contributions
 apologizing-0.430
 BG-0.425
 Ange-0.406
 Ge-0.406
 adjourn-0.402
Token 05
Token ID8870
Feature activation+2.83e-01

Pos logit contributions
instein+5.181
oston+5.105
kefeller+5.048
artney+4.977
OF+4.954
Neg logit contributions
 apologizing-8.093
 BG-7.829
 Ange-7.715
Ge-7.715
 Ge-7.709
Token43
Token ID3559
Feature activation-1.45e+00

Pos logit contributions
instein+0.284
oston+0.281
kefeller+0.280
artney+0.277
OF+0.275
Neg logit contributions
 apologizing-0.135
 BG-0.134
 Ange-0.127
 Ge-0.127
 adjourn-0.125
Tokenz
Token ID89
Feature activation+1.39e+00

Pos logit contributions
 BG+0.835
 apologizing+0.834
 Ge+0.803
 Ange+0.797
Ge+0.787
Neg logit contributions
instein-1.308
kefeller-1.305
oston-1.292
artney-1.290
OF-1.260
Token ST
Token ID3563
Feature activation-8.85e-01

Pos logit contributions
instein+1.211
oston+1.199
kefeller+1.187
artney+1.174
OF+1.168
Neg logit contributions
 apologizing-0.850
 BG-0.836
 Ange-0.809
 Ge-0.805
 adjourn-0.801
TokenOW
Token ID3913
Feature activation+7.56e-01

Pos logit contributions
 BG+0.667
 apologizing+0.667
 Ge+0.644
 Ange+0.643
 adjourn+0.637
Neg logit contributions
instein-0.645
oston-0.635
kefeller-0.634
artney-0.624
OF-0.607
TokenG
Token ID38
Feature activation-1.92e+00

Pos logit contributions
 BG+1.023
 apologizing+1.008
 Ge+0.974
 Ange+0.959
Ge+0.956
Neg logit contributions
instein-2.120
kefeller-2.101
oston-2.089
artney-2.073
OF-2.015
TokenED
Token ID1961
Feature activation-6.72e-01

Pos logit contributions
 BG+1.315
 apologizing+1.298
 Ge+1.265
 Ange+1.248
 adjourn+1.242
Neg logit contributions
instein-1.547
kefeller-1.542
oston-1.529
artney-1.518
OF-1.448
Token NET
Token ID30502
Feature activation-5.45e+00

Pos logit contributions
 apologizing+0.430
 BG+0.430
 Ge+0.412
 Ange+0.411
 adjourn+0.407
Neg logit contributions
instein-0.567
oston-0.560
kefeller-0.560
artney-0.552
OF-0.542
Token CALL
Token ID42815
Feature activation-2.50e+00

Pos logit contributions
 BG+2.586
 apologizing+2.493
 Ge+2.410
 Ange+2.359
 adjourn+2.321
Neg logit contributions
instein-5.598
kefeller-5.518
oston-5.468
artney-5.451
OF-5.209
Token UPDATE
Token ID35717
Feature activation+1.57e+00

Pos logit contributions
 BG+1.489
 apologizing+1.465
 Ge+1.408
 Ange+1.402
 adjourn+1.383
Neg logit contributions
instein-2.243
kefeller-2.224
oston-2.201
artney-2.196
OF-2.113
Token:
Token ID25
Feature activation+9.26e+00

Pos logit contributions
instein+1.413
oston+1.405
kefeller+1.390
OF+1.374
artney+1.368
Neg logit contributions
 apologizing-0.898
 BG-0.879
 Ange-0.851
 Ge-0.848
 adjourn-0.836
Token 05
Token ID8870
Feature activation-4.69e-01

Pos logit contributions
instein+5.526
oston+5.446
kefeller+5.350
OF+5.333
artney+5.283
Neg logit contributions
 apologizing-7.609
 BG-7.356
 Ange-7.258
cipline-7.214
Ge-7.206
Token41
Token ID3901
Feature activation-3.47e+00

Pos logit contributions
 BG+0.237
 apologizing+0.236
 Ge+0.226
 Ange+0.223
Ge+0.221
Neg logit contributions
instein-0.459
kefeller-0.454
oston-0.453
artney-0.449
OF-0.438
Tokenz
Token ID89
Feature activation+1.39e+00

Pos logit contributions
 BG+1.908
 apologizing+1.875
 Ge+1.830
 Ange+1.806
Ge+1.785
Neg logit contributions
kefeller-3.233
instein-3.191
artney-3.186
oston-3.153
OF-3.086
Token BLACK
Token ID31963
Feature activation-1.58e+00

Pos logit contributions
instein+1.174
oston+1.162
kefeller+1.148
artney+1.137
OF+1.132
Neg logit contributions
 apologizing-0.884
 BG-0.870
 Ange-0.842
 Ge-0.839
 adjourn-0.835
TokenFO
Token ID6080
Feature activation+2.19e-01

Pos logit contributions
 BG+0.922
 apologizing+0.922
 Ge+0.882
 Ange+0.880
 adjourn+0.867
Neg logit contributions
instein-1.418
oston-1.399
kefeller-1.399
artney-1.381
OF-1.352
Token64
Token ID2414
Feature activation+2.64e+00

Pos logit contributions
instein+2.045
oston+2.021
kefeller+2.021
artney+1.982
OF+1.970
Neg logit contributions
 apologizing-1.564
 BG-1.558
 Ange-1.492
 Ge-1.485
 adjourn-1.479
Token406
Token ID29703
Feature activation+3.79e+00

Pos logit contributions
instein+2.427
oston+2.415
kefeller+2.403
artney+2.379
OF+2.349
Neg logit contributions
 apologizing-1.478
 BG-1.465
 Ange-1.422
 Ge-1.393
 adjourn-1.388
Token Mem
Token ID4942
Feature activation+4.21e+00

Pos logit contributions
instein+3.977
oston+3.961
kefeller+3.910
artney+3.885
OF+3.863
Neg logit contributions
 apologizing-1.583
 BG-1.522
 adjourn-1.463
 Ange-1.454
cipline-1.421
Tokenc
Token ID66
Feature activation+4.43e+00

Pos logit contributions
instein+3.345
oston+3.305
kefeller+3.231
artney+3.213
UTC+3.182
Neg logit contributions
 apologizing-2.888
 BG-2.858
 Ange-2.761
 Ge-2.703
 adjourn-2.698
Tokenpy
Token ID9078
Feature activation+5.34e+00

Pos logit contributions
instein+3.391
oston+3.363
kefeller+3.283
artney+3.251
UTC+3.237
Neg logit contributions
 apologizing-3.182
 BG-3.114
 Ange-3.051
 adjourn-3.020
 Ge-2.966
TokenReading
Token ID36120
Feature activation+9.16e+00

Pos logit contributions
oston+3.882
instein+3.878
OF+3.876
UTC+3.790
artney+3.743
Neg logit contributions
 apologizing-3.897
 BG-3.838
 Ange-3.788
 adjourn-3.754
 Ge-3.688
Token 55
Token ID5996
Feature activation+5.00e+00

Pos logit contributions
instein+6.859
oston+6.851
kefeller+6.759
artney+6.690
OF+6.666
Neg logit contributions
 apologizing-6.279
 Ange-6.010
 BG-6.006
 adjourn-5.859
 Ge-5.832
Token80
Token ID1795
Feature activation+2.14e+00

Pos logit contributions
instein+5.087
oston+5.064
OF+5.040
UTC+5.014
kefeller+4.960
Neg logit contributions
 apologizing-2.207
 BG-2.140
 Ange-2.054
 Ge-1.992
 adjourn-1.973
Token953
Token ID49649
Feature activation+3.78e+00

Pos logit contributions
instein+1.368
oston+1.343
kefeller+1.341
artney+1.316
OF+1.292
Neg logit contributions
 apologizing-1.811
 BG-1.802
 Ge-1.753
 Ange-1.752
 adjourn-1.737
Token 16
Token ID1467
Feature activation+2.86e+00

Pos logit contributions
oston+3.370
instein+3.366
kefeller+3.341
artney+3.290
OF+3.285
Neg logit contributions
 apologizing-2.194
 BG-2.113
 Ange-2.041
 adjourn-2.032
 Ge-2.027
Token27
Token ID1983
Feature activation+1.51e+00

Pos logit contributions
instein+2.370
oston+2.342
kefeller+2.310
OF+2.287
artney+2.286
Neg logit contributions
 apologizing-1.867
 BG-1.841
 Ange-1.785
 Ge-1.771
 adjourn-1.753
Token.
Token ID764
Feature activation+1.47e+00

Pos logit contributions
instein+2.495
oston+2.481
kefeller+2.420
OF+2.416
artney+2.382
Neg logit contributions
 apologizing-1.941
 BG-1.883
 Ange-1.837
 adjourn-1.825
iot-1.808
Token normal
Token ID3487
Feature activation+2.14e+00

Pos logit contributions
instein+0.788
oston+0.777
kefeller+0.770
artney+0.751
OF+0.738
Neg logit contributions
 apologizing-1.389
 BG-1.382
 Ange-1.351
 Ge-1.345
 adjourn-1.341
Token *
Token ID1635
Feature activation+6.40e+00

Pos logit contributions
instein+2.263
oston+2.248
kefeller+2.230
OF+2.208
artney+2.205
Neg logit contributions
 apologizing-0.903
 BG-0.879
 Ange-0.835
 adjourn-0.828
 Ge-0.821
Token 1
Token ID352
Feature activation+5.04e+00

Pos logit contributions
instein+3.982
oston+3.885
OF+3.776
kefeller+3.757
artney+3.691
Neg logit contributions
 apologizing-5.386
 BG-5.188
 adjourn-5.132
 Ange-5.123
iot-5.110
Token ;
Token ID2162
Feature activation+4.47e+00

Pos logit contributions
instein+3.438
oston+3.386
OF+3.319
kefeller+3.309
UTC+3.249
Neg logit contributions
 apologizing-3.940
 BG-3.850
 Ange-3.761
cipline-3.731
 adjourn-3.718
Token }
Token ID1782
Feature activation+9.13e+00

Pos logit contributions
instein+3.917
oston+3.856
kefeller+3.791
artney+3.765
OF+3.737
Neg logit contributions
 apologizing-2.633
 BG-2.553
cipline-2.480
 Ange-2.478
iot-2.454
Token
Token ID198
Feature activation+2.66e+00

Pos logit contributions
instein+6.243
oston+6.217
OF+6.134
artney+6.049
 Dolphin+6.004
Neg logit contributions
 apologizing-6.804
 adjourn-6.420
 BG-6.387
iot-6.361
 Ange-6.254
Token
Token ID198
Feature activation+2.22e+00

Pos logit contributions
instein+2.096
oston+2.072
OF+2.033
kefeller+2.021
UTC+2.003
Neg logit contributions
 apologizing-1.834
 BG-1.793
 Ange-1.734
 adjourn-1.728
 Ge-1.699
TokenThe
Token ID464
Feature activation+1.61e+00

Pos logit contributions
instein+1.688
oston+1.662
kefeller+1.638
artney+1.623
OF+1.623
Neg logit contributions
 apologizing-1.613
 BG-1.608
 Ange-1.552
 adjourn-1.540
 Ge-1.539
Token result
Token ID1255
Feature activation+1.14e+00

Pos logit contributions
instein+1.350
oston+1.328
kefeller+1.313
artney+1.303
OF+1.293
Neg logit contributions
 apologizing-1.030
 BG-1.018
 Ange-0.985
 adjourn-0.973
 Ge-0.972
Token pushes
Token ID20070
Feature activation-3.20e+00

Pos logit contributions
instein+0.978
oston+0.967
kefeller+0.959
artney+0.947
OF+0.940
Neg logit contributions
 apologizing-0.706
 BG-0.701
 Ange-0.675
 Ge-0.670
 adjourn-0.666
Token8
Token ID23
Feature activation+3.78e+00

Pos logit contributions
instein+1.163
oston+1.150
kefeller+1.142
artney+1.127
OF+1.118
Neg logit contributions
 apologizing-1.058
 BG-1.048
 Ange-1.012
 Ge-1.009
 adjourn-1.008
Token 5
Token ID642
Feature activation+4.84e-01

Pos logit contributions
instein+2.612
oston+2.580
kefeller+2.576
artney+2.515
OF+2.470
Neg logit contributions
 apologizing-2.967
 BG-2.926
 adjourn-2.853
 Ange-2.839
 Ge-2.825
Token Estonia
Token ID30912
Feature activation+4.40e-01

Pos logit contributions
instein+0.367
oston+0.362
kefeller+0.361
artney+0.356
OF+0.350
Neg logit contributions
 apologizing-0.350
 BG-0.349
 Ge-0.337
 Ange-0.336
 adjourn-0.333
Token 8
Token ID807
Feature activation+1.13e+00

Pos logit contributions
instein+0.359
oston+0.355
kefeller+0.354
artney+0.349
OF+0.344
Neg logit contributions
 apologizing-0.293
 BG-0.293
 Ge-0.282
 Ange-0.281
 adjourn-0.278
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
instein+0.979
oston+0.968
kefeller+0.964
artney+0.952
OF+0.939
Neg logit contributions
 apologizing-0.695
 BG-0.691
 Ange-0.662
 Ge-0.662
 adjourn-0.657
Token5
Token ID20
Feature activation+8.92e+00

Pos logit contributions
instein+1.341
oston+1.326
kefeller+1.318
artney+1.299
OF+1.288
Neg logit contributions
 apologizing-1.204
 BG-1.194
 Ange-1.153
 Ge-1.148
 adjourn-1.147
Token 6
Token ID718
Feature activation+1.48e-01

Pos logit contributions
oston+5.722
instein+5.673
kefeller+5.662
artney+5.539
 Dolphin+5.449
Neg logit contributions
 apologizing-7.102
 BG-6.773
 adjourn-6.733
iot-6.686
 Ange-6.569
Token Congo
Token ID25935
Feature activation-1.38e+00

Pos logit contributions
instein+0.114
oston+0.112
kefeller+0.112
artney+0.110
OF+0.108
Neg logit contributions
 apologizing-0.106
 BG-0.106
 Ge-0.103
 Ange-0.103
 adjourn-0.101
Token,
Token ID11
Feature activation+2.41e+00

Pos logit contributions
 BG+0.919
 apologizing+0.915
 Ge+0.885
 Ange+0.879
 adjourn+0.866
Neg logit contributions
instein-1.133
kefeller-1.119
oston-1.116
artney-1.098
OF-1.080
Token Dem
Token ID1897
Feature activation-1.68e+00

Pos logit contributions
instein+1.725
oston+1.704
OF+1.671
artney+1.668
kefeller+1.655
Neg logit contributions
 apologizing-1.809
 BG-1.779
 Ange-1.745
cipline-1.730
 adjourn-1.723
Token.
Token ID13
Feature activation-1.36e-01

Pos logit contributions
 BG+1.061
 apologizing+1.056
 Ge+1.024
 Ange+1.014
 adjourn+0.996
Neg logit contributions
instein-1.431
kefeller-1.423
oston-1.416
artney-1.398
OF-1.373
TokenG
Token ID38
Feature activation-1.38e+00

Pos logit contributions
 BG+0.725
 apologizing+0.720
 Ge+0.689
 Ange+0.683
Ge+0.673
Neg logit contributions
instein-1.457
kefeller-1.442
oston-1.438
artney-1.424
OF-1.392
TokenED
Token ID1961
Feature activation+4.23e-01

Pos logit contributions
 BG+0.965
 apologizing+0.958
 Ge+0.929
 Ange+0.921
 adjourn+0.915
Neg logit contributions
instein-1.089
kefeller-1.077
oston-1.072
artney-1.062
OF-1.026
Token NET
Token ID30502
Feature activation-5.10e+00

Pos logit contributions
instein+0.358
oston+0.355
kefeller+0.351
artney+0.348
OF+0.346
Neg logit contributions
 apologizing-0.266
 BG-0.262
 Ange-0.254
 Ge-0.253
 adjourn-0.252
Token CALL
Token ID42815
Feature activation-3.08e+00

Pos logit contributions
 BG+2.358
 apologizing+2.279
 Ge+2.193
 Ange+2.149
 adjourn+2.112
Neg logit contributions
instein-5.314
kefeller-5.241
oston-5.202
artney-5.177
OF-4.970
Token UPDATE
Token ID35717
Feature activation+1.20e+00

Pos logit contributions
 BG+1.624
 apologizing+1.586
 Ge+1.518
 Ange+1.512
 adjourn+1.484
Neg logit contributions
instein-2.983
kefeller-2.960
oston-2.927
artney-2.925
OF-2.809
Token:
Token ID25
Feature activation+8.91e+00

Pos logit contributions
instein+1.085
oston+1.078
kefeller+1.067
OF+1.053
artney+1.051
Neg logit contributions
 apologizing-0.680
 BG-0.668
 Ange-0.645
 Ge-0.643
 adjourn-0.635
Token 05
Token ID8870
Feature activation+7.17e-01

Pos logit contributions
instein+5.138
oston+5.059
kefeller+4.993
OF+4.933
artney+4.930
Neg logit contributions
 apologizing-7.549
 BG-7.309
 Ange-7.207
 Ge-7.168
Ge-7.166
Token42
Token ID3682
Feature activation-2.30e+00

Pos logit contributions
instein+0.738
oston+0.731
kefeller+0.726
artney+0.720
OF+0.719
Neg logit contributions
 apologizing-0.323
 BG-0.319
 Ange-0.302
 Ge-0.300
 adjourn-0.300
Tokenz
Token ID89
Feature activation+1.52e+00

Pos logit contributions
 BG+1.294
 apologizing+1.283
 Ge+1.243
 Ange+1.231
Ge+1.215
Neg logit contributions
kefeller-2.104
instein-2.093
artney-2.076
oston-2.067
OF-2.017
Token SH
Token ID6006
Feature activation-8.85e-01

Pos logit contributions
instein+1.360
oston+1.348
kefeller+1.330
artney+1.317
OF+1.315
Neg logit contributions
 apologizing-0.880
 BG-0.861
 Ange-0.833
 Ge-0.828
 adjourn-0.826
TokenAM
Token ID2390
Feature activation+1.21e+00

Pos logit contributions
 apologizing+0.595
 BG+0.594
 Ge+0.572
 Ange+0.570
 adjourn+0.564
Neg logit contributions
instein-0.719
kefeller-0.709
oston-0.708
artney-0.697
OF-0.681
Token a
Token ID257
Feature activation-2.88e+00

Pos logit contributions
instein+2.862
oston+2.819
kefeller+2.754
OF+2.722
artney+2.707
Neg logit contributions
 apologizing-2.796
 BG-2.718
 Ange-2.657
iot-2.643
 adjourn-2.640
Token home
Token ID1363
Feature activation-4.38e+00

Pos logit contributions
 BG+2.040
 apologizing+2.028
 Ge+1.976
 Ange+1.950
 adjourn+1.938
Neg logit contributions
instein-2.253
kefeller-2.252
artney-2.235
oston-2.212
OF-2.151
Token for
Token ID329
Feature activation+2.93e-01

Pos logit contributions
 BG+2.241
 apologizing+2.185
 Ge+2.156
 Ange+2.083
Ge+2.054
Neg logit contributions
kefeller-4.297
instein-4.294
artney-4.249
oston-4.225
OF-4.102
Token Sword
Token ID11535
Feature activation+2.87e-01

Pos logit contributions
instein+0.290
oston+0.287
kefeller+0.287
artney+0.284
OF+0.279
Neg logit contributions
 BG-0.144
 apologizing-0.144
 Ge-0.137
 Ange-0.137
 adjourn-0.134
Token Base
Token ID7308
Feature activation+4.26e-02

Pos logit contributions
instein+0.242
oston+0.239
kefeller+0.239
artney+0.236
OF+0.232
Neg logit contributions
 apologizing-0.184
 BG-0.183
 Ge-0.176
 Ange-0.176
 adjourn-0.174
Token.
Token ID13
Feature activation+8.91e+00

Pos logit contributions
instein+0.037
oston+0.037
kefeller+0.037
artney+0.037
OF+0.036
Neg logit contributions
 apologizing-0.026
 BG-0.026
 Ge-0.024
 Ange-0.024
 adjourn-0.024
Token
Token ID198
Feature activation+2.56e-01

Pos logit contributions
oston+5.916
instein+5.916
 Levin+5.859
OF+5.857
UTC+5.765
Neg logit contributions
 apologizing-6.755
 adjourn-6.422
 substitutes-6.352
 looted-6.315
cipline-6.289
Token
Token ID198
Feature activation-1.52e+00

Pos logit contributions
instein+0.217
oston+0.214
kefeller+0.213
artney+0.211
OF+0.208
Neg logit contributions
 apologizing-0.163
 BG-0.162
 Ange-0.156
 Ge-0.156
 adjourn-0.154
TokenEditor
Token ID17171
Feature activation+7.77e-01

Pos logit contributions
 BG+1.052
 apologizing+1.052
 Ge+1.019
Ge+1.017
 Ange+1.016
Neg logit contributions
instein-1.201
kefeller-1.191
oston-1.179
artney-1.170
OF-1.127
Token's
Token ID338
Feature activation-7.16e-01

Pos logit contributions
instein+0.634
oston+0.626
kefeller+0.623
artney+0.614
OF+0.607
Neg logit contributions
 apologizing-0.516
 BG-0.513
 Ge-0.493
 Ange-0.493
 adjourn-0.489
Token Note
Token ID5740
Feature activation-9.99e-01

Pos logit contributions
 apologizing+0.471
 BG+0.468
 Ge+0.452
 Ange+0.451
 adjourn+0.446
Neg logit contributions
instein-0.591
oston-0.586
kefeller-0.586
artney-0.579
OF-0.569
TokenG
Token ID38
Feature activation-1.98e+00

Pos logit contributions
 BG+1.200
 apologizing+1.180
 Ge+1.143
Ge+1.123
 Ange+1.122
Neg logit contributions
instein-2.501
kefeller-2.480
oston-2.463
artney-2.446
OF-2.368
TokenED
Token ID1961
Feature activation-4.95e-02

Pos logit contributions
 BG+1.374
 apologizing+1.360
 Ge+1.323
 Ange+1.308
 adjourn+1.298
Neg logit contributions
instein-1.569
kefeller-1.556
oston-1.546
artney-1.533
OF-1.475
Token NET
Token ID30502
Feature activation-4.69e+00

Pos logit contributions
 apologizing+0.032
 BG+0.031
 Ange+0.030
 Ge+0.030
 adjourn+0.030
Neg logit contributions
instein-0.042
oston-0.041
kefeller-0.041
artney-0.041
OF-0.040
Token CALL
Token ID42815
Feature activation-2.41e+00

Pos logit contributions
 BG+2.151
 apologizing+2.040
 Ge+2.002
 Ange+1.952
 adjourn+1.905
Neg logit contributions
instein-4.973
kefeller-4.949
oston-4.888
artney-4.875
etsk-4.615
Token UPDATE
Token ID35717
Feature activation+1.26e+00

Pos logit contributions
 BG+1.169
 apologizing+1.149
 Ge+1.092
 Ange+1.090
 adjourn+1.069
Neg logit contributions
instein-2.424
kefeller-2.397
oston-2.387
artney-2.371
OF-2.301
Token:
Token ID25
Feature activation+8.80e+00

Pos logit contributions
instein+1.152
oston+1.140
kefeller+1.129
OF+1.119
artney+1.116
Neg logit contributions
 apologizing-0.713
 BG-0.699
 Ange-0.671
 Ge-0.668
 adjourn-0.665
Token 0
Token ID657
Feature activation+2.47e+00

Pos logit contributions
instein+4.961
oston+4.912
kefeller+4.828
artney+4.785
OF+4.777
Neg logit contributions
 apologizing-7.561
 BG-7.313
 Ange-7.222
Ge-7.190
 Ge-7.187
Token551
Token ID43697
Feature activation+5.29e-01

Pos logit contributions
instein+2.042
oston+2.022
kefeller+2.008
artney+1.984
OF+1.977
Neg logit contributions
 apologizing-1.608
 BG-1.586
 Ange-1.532
 Ge-1.525
 adjourn-1.524
Tokenz
Token ID89
Feature activation+1.36e+00

Pos logit contributions
instein+0.462
oston+0.457
kefeller+0.455
artney+0.450
OF+0.444
Neg logit contributions
 apologizing-0.322
 BG-0.320
 Ange-0.307
 Ge-0.307
 adjourn-0.304
Token VOL
Token ID38570
Feature activation-1.45e+00

Pos logit contributions
instein+1.166
oston+1.156
kefeller+1.138
OF+1.130
artney+1.130
Neg logit contributions
 apologizing-0.838
 BG-0.821
 Ange-0.795
 adjourn-0.790
 Ge-0.790
TokenUN
Token ID4944
Feature activation-2.57e+00

Pos logit contributions
 BG+1.448
 apologizing+1.438
 Ge+1.408
Ge+1.400
 Ange+1.397
Neg logit contributions
instein-0.718
kefeller-0.706
artney-0.692
oston-0.691
 Levin-0.636
Token<|endoftext|>
Token ID50256
Feature activation+7.34e-01

Pos logit contributions
 BG+1.388
 apologizing+1.366
 Ge+1.339
 Ange+1.328
Ge+1.298
Neg logit contributions
instein-1.987
kefeller-1.973
oston-1.959
artney-1.942
OF-1.888
TokenFacebook
Token ID12025
Feature activation-5.74e-01

Pos logit contributions
instein+0.568
oston+0.560
kefeller+0.558
artney+0.550
OF+0.543
Neg logit contributions
 apologizing-0.520
 BG-0.518
 Ange-0.499
 Ge-0.499
 adjourn-0.495
Token Twitter
Token ID3009
Feature activation+4.59e+00

Pos logit contributions
 apologizing+0.361
 BG+0.359
 Ge+0.346
 Ange+0.344
 adjourn+0.340
Neg logit contributions
instein-0.491
kefeller-0.487
oston-0.486
artney-0.480
OF-0.472
Token Reddit
Token ID10750
Feature activation+1.04e+01

Pos logit contributions
instein+2.946
oston+2.881
kefeller+2.827
artney+2.770
OF+2.761
Neg logit contributions
 apologizing-3.788
 BG-3.735
 adjourn-3.664
 Ange-3.650
iot-3.614
Token Google
Token ID3012
Feature activation+6.24e+00

Pos logit contributions
oston+6.534
instein+6.491
UTC+6.422
 Levin+6.409
OF+6.396
Neg logit contributions
 apologizing-8.206
 adjourn-7.841
cipline-7.824
iot-7.801
 BG-7.793
Token+
Token ID10
Feature activation+8.75e+00

Pos logit contributions
instein+4.989
oston+4.915
kefeller+4.860
UTC+4.809
artney+4.794
Neg logit contributions
 apologizing-4.158
 BG-4.045
 adjourn-3.993
iot-3.930
 Ange-3.877
Token
Token ID198
Feature activation+6.67e-01

Pos logit contributions
instein+6.243
artney+6.079
 Levin+6.060
UTC+6.053
oston+6.034
Neg logit contributions
 apologizing-6.214
iot-6.001
 adjourn-5.958
 BG-5.949
cipline-5.916
Token
Token ID198
Feature activation-7.17e-01

Pos logit contributions
instein+0.563
oston+0.556
kefeller+0.554
artney+0.547
OF+0.540
Neg logit contributions
 apologizing-0.425
 BG-0.423
 Ange-0.406
 Ge-0.406
 adjourn-0.402
TokenGo
Token ID5247
Feature activation-8.07e-02

Pos logit contributions
 apologizing+0.505
 BG+0.503
 Ge+0.488
 Ange+0.487
Ge+0.484
Neg logit contributions
instein-0.556
kefeller-0.551
oston-0.548
artney-0.542
OF-0.526
Token to
Token ID284
Feature activation+6.10e-01

Pos logit contributions
 apologizing+0.054
 BG+0.054
 Ge+0.052
 Ange+0.052
 adjourn+0.051
Neg logit contributions
instein-0.066
oston-0.065
kefeller-0.065
artney-0.064
OF-0.063
Token the
Token ID262
Feature activation+7.01e-01

Pos logit contributions
instein+0.504
oston+0.498
kefeller+0.497
artney+0.490
OF+0.483
Neg logit contributions
 apologizing-0.400
 BG-0.398
 Ge-0.383
 Ange-0.383
 adjourn-0.379
Token21
Token ID2481
Feature activation-8.42e-01

Pos logit contributions
instein+0.441
oston+0.437
kefeller+0.434
artney+0.431
OF+0.426
Neg logit contributions
 apologizing-0.261
 BG-0.259
 Ange-0.249
 Ge-0.248
 adjourn-0.245
Token,
Token ID11
Feature activation-1.40e+00

Pos logit contributions
 BG+0.491
 apologizing+0.489
 Ge+0.471
 Ange+0.468
 adjourn+0.462
Neg logit contributions
instein-0.758
kefeller-0.752
oston-0.748
artney-0.742
OF-0.725
Token 21
Token ID2310
Feature activation+8.33e-03

Pos logit contributions
 BG+0.713
 apologizing+0.710
 Ge+0.674
 Ange+0.671
 adjourn+0.663
Neg logit contributions
instein-1.364
kefeller-1.352
oston-1.349
artney-1.332
OF-1.312
Token))
Token ID4008
Feature activation+3.53e+00

Pos logit contributions
instein+0.008
oston+0.008
kefeller+0.008
artney+0.008
OF+0.008
Neg logit contributions
 apologizing-0.004
 BG-0.004
 Ange-0.004
 Ge-0.004
 adjourn-0.004
Token #
Token ID1303
Feature activation-2.50e+00

Pos logit contributions
instein+3.213
oston+3.199
kefeller+3.143
OF+3.119
artney+3.098
Neg logit contributions
 apologizing-1.915
 BG-1.914
 Ange-1.838
 adjourn-1.828
iot-1.820
Token loop
Token ID9052
Feature activation-8.83e+00

Pos logit contributions
 BG+1.315
 apologizing+1.306
 Ge+1.259
 adjourn+1.244
 Ange+1.233
Neg logit contributions
instein-2.376
oston-2.363
kefeller-2.357
artney-2.331
OF-2.283
Token over
Token ID625
Feature activation-2.10e+00

Pos logit contributions
 BG+4.396
 Ge+4.231
 apologizing+4.197
 adjourn+4.189
 Ange+3.972
Neg logit contributions
artney-8.614
kefeller-8.512
instein-8.472
oston-8.446
OF-8.078
Token the
Token ID262
Feature activation+6.78e-01

Pos logit contributions
 BG+1.278
 apologizing+1.258
 Ge+1.242
 Ange+1.216
 adjourn+1.208
Neg logit contributions
instein-1.838
kefeller-1.837
oston-1.821
artney-1.821
OF-1.753
Token input
Token ID5128
Feature activation-2.38e+00

Pos logit contributions
instein+0.502
oston+0.495
kefeller+0.493
artney+0.486
OF+0.478
Neg logit contributions
 apologizing-0.503
 BG-0.502
 Ge-0.485
 Ange-0.485
 adjourn-0.480
Token image
Token ID2939
Feature activation-2.75e-01

Pos logit contributions
 BG+1.410
 apologizing+1.398
 Ge+1.355
 Ange+1.340
 adjourn+1.322
Neg logit contributions
instein-2.126
kefeller-2.112
oston-2.105
artney-2.092
OF-2.042
Token paths
Token ID13532
Feature activation-4.11e-01

Pos logit contributions
 apologizing+0.185
 BG+0.185
 Ge+0.180
 Ange+0.178
 adjourn+0.177
Neg logit contributions
instein-0.224
kefeller-0.220
oston-0.219
artney-0.218
OF-0.212
Token x
Token ID2124
Feature activation-1.90e+00

Pos logit contributions
instein+1.095
oston+1.085
kefeller+1.079
artney+1.068
OF+1.061
Neg logit contributions
 apologizing-0.390
 BG-0.383
 Ange-0.359
 Ge-0.357
 adjourn-0.355
Token.
Token ID13
Feature activation-3.69e-02

Pos logit contributions
 BG+1.092
 apologizing+1.083
 Ge+1.054
 Ange+1.043
Ge+1.029
Neg logit contributions
instein-1.726
kefeller-1.718
oston-1.705
artney-1.703
OF-1.651
Tokenx
Token ID87
Feature activation-1.75e+00

Pos logit contributions
 apologizing+0.016
 BG+0.016
 Ange+0.015
 Ge+0.015
 adjourn+0.015
Neg logit contributions
instein-0.038
oston-0.038
kefeller-0.038
artney-0.037
OF-0.037
Token.
Token ID13
Feature activation-3.90e+00

Pos logit contributions
 BG+1.010
 apologizing+1.004
 Ge+0.973
 Ange+0.965
Ge+0.950
Neg logit contributions
instein-1.575
kefeller-1.568
oston-1.556
artney-1.552
OF-1.509
Tokenx
Token ID87
Feature activation-6.50e+00

Pos logit contributions
 BG+1.496
 apologizing+1.459
 Ge+1.414
Ge+1.388
 Ange+1.382
Neg logit contributions
kefeller-4.256
instein-4.233
artney-4.220
oston-4.189
OF-4.080
Token.
Token ID13
Feature activation-8.74e+00

Pos logit contributions
 BG+3.270
 apologizing+3.131
 Ge+3.113
Ge+3.035
iot+3.031
Neg logit contributions
kefeller-6.349
artney-6.254
instein-6.209
oston-6.124
OF-5.925
Token141
Token ID23756
Feature activation-4.68e-01

Pos logit contributions
 BG+3.924
iot+3.879
 apologizing+3.826
capt+3.814
cipline+3.677
Neg logit contributions
kefeller-8.629
instein-8.484
artney-8.306
 Dolphin-8.271
oston-8.268
Token 112
Token ID13539
Feature activation-2.51e-01

Pos logit contributions
 BG+0.242
 apologizing+0.240
 Ge+0.230
 Ange+0.228
 adjourn+0.225
Neg logit contributions
instein-0.453
kefeller-0.449
oston-0.448
artney-0.444
OF-0.436
Token08
Token ID2919
Feature activation-2.38e+00

Pos logit contributions
 apologizing+0.140
 BG+0.140
 Ge+0.133
 Ange+0.133
 adjourn+0.132
Neg logit contributions
instein-0.232
oston-0.229
kefeller-0.229
artney-0.226
OF-0.223
Token80
Token ID1795
Feature activation-3.33e+00

Pos logit contributions
 BG+1.152
 apologizing+1.140
 Ge+1.104
 Ange+1.081
Ge+1.076
Neg logit contributions
instein-2.375
kefeller-2.362
oston-2.344
artney-2.331
OF-2.260
Token x
Token ID2124
Feature activation-2.07e+00

Pos logit contributions
 BG+1.098
 apologizing+1.050
 Ge+1.038
 Ange+1.008
Ge+0.962
Neg logit contributions
kefeller-3.861
instein-3.859
oston-3.831
artney-3.828
OF-3.725
Token with
Token ID351
Feature activation-2.00e+00

Pos logit contributions
 BG+1.446
 apologizing+1.433
 Ge+1.384
 Ange+1.348
 adjourn+1.346
Neg logit contributions
instein-2.691
kefeller-2.680
oston-2.672
artney-2.654
OF-2.589
Token the
Token ID262
Feature activation-4.20e+00

Pos logit contributions
 BG+1.246
 apologizing+1.228
 Ge+1.203
 Ange+1.182
 adjourn+1.169
Neg logit contributions
instein-1.732
kefeller-1.722
oston-1.717
artney-1.708
OF-1.657
Token j
Token ID474
Feature activation-3.40e+00

Pos logit contributions
 apologizing+2.671
 BG+2.660
 Ge+2.584
 adjourn+2.547
 Ange+2.516
Neg logit contributions
instein-3.516
oston-3.512
kefeller-3.504
artney-3.499
OF-3.423
Tokenagg
Token ID9460
Feature activation+1.86e-01

Pos logit contributions
 apologizing+1.994
 BG+1.984
 Ge+1.899
 Ange+1.891
iot+1.887
Neg logit contributions
kefeller-2.996
instein-2.980
oston-2.962
artney-2.932
OF-2.891
Tokenies
Token ID444
Feature activation-4.23e+00

Pos logit contributions
instein+0.148
oston+0.146
kefeller+0.146
artney+0.144
OF+0.142
Neg logit contributions
 apologizing-0.127
 BG-0.126
 Ge-0.122
 Ange-0.122
 adjourn-0.121
Token since
Token ID1201
Feature activation-8.63e+00

Pos logit contributions
 BG+2.236
 apologizing+2.129
 Ge+2.073
 Ange+2.008
 adjourn+1.977
Neg logit contributions
kefeller-4.116
oston-4.088
instein-4.071
artney-4.014
OF-3.944
Token even
Token ID772
Feature activation-7.65e+00

Pos logit contributions
 BG+3.244
 apologizing+2.902
 Ge+2.778
 Ange+2.765
 adjourn+2.765
Neg logit contributions
oston-9.559
instein-9.526
artney-9.492
kefeller-9.436
OF-9.153
Token though
Token ID996
Feature activation-3.57e+00

Pos logit contributions
 BG+2.993
 apologizing+2.855
 Ange+2.688
 Ge+2.655
iot+2.554
Neg logit contributions
kefeller-8.413
oston-8.280
instein-8.273
artney-8.221
UTC-8.014
Token they
Token ID484
Feature activation-3.37e+00

Pos logit contributions
 BG+2.013
 apologizing+2.002
 Ge+1.955
 Ange+1.895
 adjourn+1.894
Neg logit contributions
kefeller-3.274
oston-3.262
artney-3.259
instein-3.259
OF-3.165
Token are
Token ID389
Feature activation-3.36e+00

Pos logit contributions
 BG+2.216
 apologizing+2.198
 Ge+2.143
 Ange+2.118
 adjourn+2.092
Neg logit contributions
kefeller-2.779
instein-2.762
oston-2.748
artney-2.744
OF-2.664
Token there
Token ID612
Feature activation-3.39e+00

Pos logit contributions
 BG+1.698
 apologizing+1.693
 Ge+1.614
 adjourn+1.595
 Ange+1.585
Neg logit contributions
kefeller-3.319
instein-3.265
oston-3.261
artney-3.248
OF-3.174
Token Own
Token ID11744
Feature activation+1.48e+00

Pos logit contributions
instein+2.952
kefeller+2.924
oston+2.919
artney+2.877
OF+2.848
Neg logit contributions
 apologizing-1.609
 BG-1.544
 adjourn-1.482
 Ange-1.476
cipline-1.473
Tokened
Token ID276
Feature activation+9.41e-03

Pos logit contributions
instein+1.102
oston+1.087
kefeller+1.080
artney+1.062
OF+1.052
Neg logit contributions
 apologizing-1.099
 BG-1.090
 Ange-1.053
 Ge-1.050
 adjourn-1.046
Token <
Token ID1279
Feature activation-3.96e+00

Pos logit contributions
instein+0.011
oston+0.011
kefeller+0.011
artney+0.010
OF+0.010
Neg logit contributions
 apologizing-0.003
 BG-0.003
 Ge-0.003
 Ange-0.003
 adjourn-0.003
Token T
Token ID309
Feature activation-3.62e+00

Pos logit contributions
 BG+2.094
 Ge+2.011
 apologizing+2.001
 Ange+1.960
 adjourn+1.911
Neg logit contributions
kefeller-3.824
instein-3.824
artney-3.796
oston-3.770
OF-3.631
Token >
Token ID1875
Feature activation-3.45e+00

Pos logit contributions
 BG+1.969
 apologizing+1.899
 Ge+1.892
 Ange+1.852
Ge+1.818
Neg logit contributions
instein-3.433
kefeller-3.404
artney-3.358
oston-3.356
OF-3.254
Token {
Token ID1391
Feature activation-8.31e+00

Pos logit contributions
 BG+2.022
 apologizing+1.975
 Ge+1.948
 Ange+1.903
 adjourn+1.871
Neg logit contributions
instein-3.122
kefeller-3.114
oston-3.077
artney-3.061
OF-2.974
Token...
Token ID2644
Feature activation-4.63e+00

Pos logit contributions
 BG+5.084
 apologizing+5.021
 Ge+4.938
 Ange+4.911
 adjourn+4.758
Neg logit contributions
instein-6.969
kefeller-6.929
artney-6.915
oston-6.878
OF-6.734
Token }
Token ID1782
Feature activation-1.96e+00

Pos logit contributions
 BG+2.408
 apologizing+2.379
 Ge+2.335
 Ange+2.292
 adjourn+2.261
Neg logit contributions
kefeller-4.435
instein-4.429
oston-4.407
artney-4.377
OF-4.299
Token
Token ID198
Feature activation-4.85e-01

Pos logit contributions
 apologizing+1.209
 BG+1.207
 Ge+1.162
 Ange+1.157
 adjourn+1.146
Neg logit contributions
instein-1.692
kefeller-1.681
oston-1.681
artney-1.660
OF-1.628
Token
Token ID198
Feature activation+1.87e+00

Pos logit contributions
 apologizing+0.302
 BG+0.302
 Ge+0.291
 Ange+0.289
 adjourn+0.286
Neg logit contributions
instein-0.417
kefeller-0.412
oston-0.412
artney-0.407
OF-0.399
TokenThe
Token ID464
Feature activation+9.86e-01

Pos logit contributions
instein+1.426
oston+1.401
kefeller+1.390
artney+1.372
OF+1.371
Neg logit contributions
 apologizing-1.348
 BG-1.343
 Ange-1.294
 Ge-1.288
 adjourn-1.282
Token him
Token ID683
Feature activation-2.17e-01

Pos logit contributions
 apologizing+0.575
 BG+0.574
 Ge+0.554
 Ange+0.553
 adjourn+0.542
Neg logit contributions
instein-0.860
oston-0.851
kefeller-0.851
artney-0.838
OF-0.826
Token prisoner
Token ID17234
Feature activation-1.17e+00

Pos logit contributions
 apologizing+0.131
 BG+0.131
 Ge+0.126
 Ange+0.125
 adjourn+0.124
Neg logit contributions
instein-0.190
oston-0.188
kefeller-0.188
artney-0.185
OF-0.183
Token if
Token ID611
Feature activation+2.82e-01

Pos logit contributions
 apologizing+0.708
 BG+0.699
 Ge+0.674
 Ange+0.670
 adjourn+0.665
Neg logit contributions
instein-1.031
kefeller-1.019
oston-1.018
artney-1.005
OF-0.991
Token he
Token ID339
Feature activation-1.44e+00

Pos logit contributions
instein+0.231
oston+0.228
kefeller+0.227
artney+0.224
OF+0.221
Neg logit contributions
 apologizing-0.187
 BG-0.186
 Ge-0.179
 Ange-0.179
 adjourn-0.178
Token could
Token ID714
Feature activation+1.25e+00

Pos logit contributions
 apologizing+0.884
 BG+0.878
 Ge+0.844
 Ange+0.841
 adjourn+0.838
Neg logit contributions
instein-1.257
kefeller-1.248
oston-1.244
artney-1.229
OF-1.210
Token.[
Token ID3693
Feature activation-8.15e+00

Pos logit contributions
instein+1.046
oston+1.029
kefeller+1.024
artney+1.010
OF+0.999
Neg logit contributions
 apologizing-0.804
 BG-0.800
 Ange-0.769
 Ge-0.767
 adjourn-0.757
Token28
Token ID2078
Feature activation-2.02e+00

Pos logit contributions
 apologizing+2.920
 BG+2.899
Ge+2.844
 Ge+2.809
 Ange+2.761
Neg logit contributions
kefeller-8.767
artney-8.744
instein-8.698
oston-8.559
 Levin-8.384
Token]
Token ID60
Feature activation-2.79e+00

Pos logit contributions
 apologizing+1.020
 BG+1.015
 Ge+0.965
 Ange+0.965
 adjourn+0.951
Neg logit contributions
instein-1.979
oston-1.957
kefeller-1.953
artney-1.932
OF-1.907
Token
Token ID198
Feature activation-2.16e+00

Pos logit contributions
 BG+1.744
 apologizing+1.716
 Ge+1.704
 Ange+1.693
Ge+1.627
Neg logit contributions
kefeller-2.397
instein-2.389
oston-2.353
artney-2.342
OF-2.274
Token
Token ID198
Feature activation-2.38e+00

Pos logit contributions
 BG+1.270
 apologizing+1.248
 Ge+1.238
 Ange+1.217
Ge+1.204
Neg logit contributions
kefeller-1.943
instein-1.939
artney-1.919
oston-1.916
OF-1.841
TokenDuring
Token ID7191
Feature activation-7.03e-01

Pos logit contributions
 apologizing+1.657
 BG+1.655
Ge+1.632
 Ge+1.619
 Ange+1.609
Neg logit contributions
kefeller-1.854
instein-1.847
oston-1.819
artney-1.802
 Levin-1.733
Token:
Token ID25
Feature activation+1.41e+00

Pos logit contributions
 BG+0.597
 apologizing+0.594
 Ge+0.572
 Ange+0.567
 adjourn+0.558
Neg logit contributions
instein-1.091
kefeller-1.083
oston-1.079
artney-1.070
OF-1.047
Token Just
Token ID2329
Feature activation-1.68e-01

Pos logit contributions
instein+1.191
oston+1.177
kefeller+1.164
artney+1.156
OF+1.150
Neg logit contributions
 apologizing-0.892
 BG-0.876
 adjourn-0.846
 Ange-0.842
 Ge-0.838
Token about
Token ID546
Feature activation+8.09e-01

Pos logit contributions
 apologizing+0.103
 BG+0.102
 Ge+0.098
 Ange+0.098
 adjourn+0.097
Neg logit contributions
instein-0.146
kefeller-0.145
oston-0.145
artney-0.142
OF-0.140
Token the
Token ID262
Feature activation+1.87e-01

Pos logit contributions
instein+0.708
oston+0.699
kefeller+0.695
artney+0.687
OF+0.678
Neg logit contributions
 apologizing-0.489
 BG-0.487
 Ange-0.468
 Ge-0.467
 adjourn-0.462
Token key
Token ID1994
Feature activation-4.45e-01

Pos logit contributions
instein+0.163
oston+0.161
kefeller+0.161
artney+0.159
OF+0.157
Neg logit contributions
 apologizing-0.115
 BG-0.114
 Ge-0.110
 Ange-0.110
 adjourn-0.109
Tokenw
Token ID86
Feature activation-8.11e+00

Pos logit contributions
 apologizing+0.278
 BG+0.277
 Ange+0.266
 Ge+0.266
 adjourn+0.263
Neg logit contributions
instein-0.381
oston-0.377
kefeller-0.375
artney-0.371
OF-0.366
Tokenording
Token ID1284
Feature activation-9.77e-01

Pos logit contributions
 BG+4.890
 apologizing+4.819
 Ange+4.782
iot+4.742
 Ge+4.741
Neg logit contributions
kefeller-6.814
instein-6.702
oston-6.608
artney-6.598
OF-6.400
Token of
Token ID286
Feature activation+3.08e-01

Pos logit contributions
 apologizing+0.577
 BG+0.575
 Ge+0.551
 Ange+0.550
 adjourn+0.544
Neg logit contributions
instein-0.873
kefeller-0.866
oston-0.863
artney-0.852
OF-0.838
Token potions
Token ID28074
Feature activation+2.33e-01

Pos logit contributions
instein+0.262
oston+0.259
kefeller+0.258
artney+0.255
OF+0.251
Neg logit contributions
 apologizing-0.195
 BG-0.194
 Ange-0.186
 Ge-0.186
 adjourn-0.184
Token.
Token ID13
Feature activation-1.86e-01

Pos logit contributions
instein+0.205
oston+0.202
kefeller+0.202
artney+0.199
OF+0.197
Neg logit contributions
 apologizing-0.141
 BG-0.141
 Ge-0.135
 Ange-0.135
 adjourn-0.133
Token
Token ID198
Feature activation+1.70e+00

Pos logit contributions
 apologizing+0.118
 BG+0.118
 Ge+0.113
 Ange+0.113
 adjourn+0.112
Neg logit contributions
instein-0.158
oston-0.156
kefeller-0.156
artney-0.154
OF-0.151
Token players
Token ID1938
Feature activation-1.01e+00

Pos logit contributions
instein+2.323
oston+2.321
kefeller+2.298
artney+2.265
OF+2.262
Neg logit contributions
 apologizing-1.191
 BG-1.164
 Ange-1.123
 Ge-1.102
 adjourn-1.100
Token from
Token ID422
Feature activation-2.11e+00

Pos logit contributions
 apologizing+0.500
 BG+0.498
 Ge+0.473
 Ange+0.471
 adjourn+0.465
Neg logit contributions
instein-0.994
oston-0.983
kefeller-0.983
artney-0.972
OF-0.958
Token other
Token ID584
Feature activation-1.89e+00

Pos logit contributions
 BG+1.161
 apologizing+1.154
 Ge+1.118
 Ange+1.101
 adjourn+1.095
Neg logit contributions
instein-1.966
kefeller-1.954
oston-1.949
artney-1.933
OF-1.887
Token team
Token ID1074
Feature activation+4.69e-01

Pos logit contributions
 BG+1.141
 apologizing+1.138
 Ge+1.096
 Ange+1.087
 adjourn+1.074
Neg logit contributions
instein-1.670
kefeller-1.652
oston-1.642
artney-1.633
OF-1.601
Token such
Token ID884
Feature activation+2.44e-01

Pos logit contributions
instein+0.462
oston+0.459
kefeller+0.454
artney+0.450
OF+0.446
Neg logit contributions
 apologizing-0.232
 BG-0.230
 Ange-0.218
 Ge-0.217
 adjourn-0.215
Token as
Token ID355
Feature activation-8.08e+00

Pos logit contributions
instein+0.213
oston+0.210
kefeller+0.209
artney+0.206
OF+0.204
Neg logit contributions
 apologizing-0.149
 BG-0.147
 Ange-0.142
 Ge-0.141
 adjourn-0.140
Token sc
Token ID629
Feature activation-9.95e-01

Pos logit contributions
 BG+4.082
 apologizing+3.938
 Ge+3.886
 adjourn+3.766
 Ange+3.737
Neg logit contributions
artney-7.931
kefeller-7.796
instein-7.736
oston-7.587
OF-7.509
Tokena
Token ID64
Feature activation-1.32e+00

Pos logit contributions
 BG+0.762
 apologizing+0.761
 Ge+0.738
 Ange+0.737
 adjourn+0.729
Neg logit contributions
instein-0.709
kefeller-0.700
oston-0.698
artney-0.688
OF-0.673
Token.
Token ID13
Feature activation+3.12e+00

Pos logit contributions
 BG+0.758
 apologizing+0.749
 Ge+0.731
 Ange+0.720
Ge+0.706
Neg logit contributions
instein-1.208
kefeller-1.202
oston-1.192
artney-1.190
OF-1.153
Token GET
Token ID17151
Feature activation-6.13e-01

Pos logit contributions
instein+2.553
oston+2.522
kefeller+2.505
artney+2.483
OF+2.463
Neg logit contributions
 apologizing-2.035
 BG-2.010
 adjourn-1.934
 Ange-1.934
 Ge-1.927
Token OUT
Token ID16289
Feature activation-2.24e-01

Pos logit contributions
 BG+0.388
 apologizing+0.387
 Ge+0.372
 Ange+0.371
 adjourn+0.366
Neg logit contributions
instein-0.523
kefeller-0.518
oston-0.517
artney-0.510
OF-0.498
Token at
Token ID379
Feature activation-3.13e+00

Pos logit contributions
instein+1.103
oston+1.089
kefeller+1.085
artney+1.071
OF+1.059
Neg logit contributions
 apologizing-0.796
 BG-0.790
 Ange-0.760
 Ge-0.758
 adjourn-0.752
Token the
Token ID262
Feature activation-2.20e+00

Pos logit contributions
 BG+2.219
 apologizing+2.147
 Ge+2.143
 Ange+2.128
Ge+2.056
Neg logit contributions
instein-2.444
oston-2.415
kefeller-2.408
artney-2.392
OF-2.346
Token University
Token ID2059
Feature activation+4.68e-01

Pos logit contributions
 BG+1.480
 apologizing+1.458
 Ge+1.444
 Ange+1.435
Ge+1.386
Neg logit contributions
instein-1.782
oston-1.762
kefeller-1.751
artney-1.739
OF-1.707
Token of
Token ID286
Feature activation-2.04e+00

Pos logit contributions
instein+0.420
kefeller+0.415
oston+0.415
artney+0.411
OF+0.402
Neg logit contributions
 BG-0.274
 apologizing-0.273
 Ge-0.264
 Ange-0.263
 adjourn-0.257
Token Alabama
Token ID9266
Feature activation-3.03e+00

Pos logit contributions
 BG+1.698
 apologizing+1.666
 Ge+1.663
 Ange+1.642
Ge+1.609
Neg logit contributions
instein-1.358
kefeller-1.343
artney-1.324
oston-1.318
OF-1.271
Token at
Token ID379
Feature activation-8.07e+00

Pos logit contributions
 apologizing+1.902
 BG+1.890
 Ge+1.831
 Ange+1.825
Ge+1.769
Neg logit contributions
instein-2.618
kefeller-2.581
artney-2.559
oston-2.548
OF-2.511
Token Birmingham
Token ID18899
Feature activation-4.02e+00

Pos logit contributions
 BG+4.171
 Ange+4.128
 apologizing+4.119
 Ge+4.050
Ge+3.796
Neg logit contributions
instein-7.738
artney-7.608
kefeller-7.576
OF-7.472
oston-7.454
Token School
Token ID3961
Feature activation-4.97e-01

Pos logit contributions
 apologizing+2.384
 BG+2.382
 Ge+2.307
 Ange+2.279
iot+2.218
Neg logit contributions
instein-3.572
kefeller-3.535
oston-3.503
artney-3.492
OF-3.439
Token of
Token ID286
Feature activation-3.08e+00

Pos logit contributions
 BG+0.290
 apologizing+0.289
 Ge+0.279
 Ange+0.277
 adjourn+0.274
Neg logit contributions
instein-0.447
kefeller-0.442
oston-0.442
artney-0.437
OF-0.428
Token Medicine
Token ID11558
Feature activation+3.12e-01

Pos logit contributions
 BG+2.106
 apologizing+2.059
 Ge+2.055
 Ange+2.003
Ge+1.975
Neg logit contributions
instein-2.480
kefeller-2.445
oston-2.437
artney-2.407
OF-2.366
Token.
Token ID13
Feature activation+6.50e-01

Pos logit contributions
instein+0.272
oston+0.270
kefeller+0.268
artney+0.265
OF+0.262
Neg logit contributions
 apologizing-0.190
 BG-0.189
 Ange-0.181
 Ge-0.181
 adjourn-0.180
Token,
Token ID11
Feature activation-3.60e-01

Pos logit contributions
instein+2.005
oston+1.981
kefeller+1.971
artney+1.947
OF+1.926
Neg logit contributions
 apologizing-1.403
 BG-1.396
 Ange-1.343
 Ge-1.336
 adjourn-1.326
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
 apologizing+0.211
 BG+0.210
 Ge+0.202
 Ange+0.201
 adjourn+0.198
Neg logit contributions
instein-0.323
kefeller-0.319
oston-0.319
artney-0.315
OF-0.310
Token perhaps
Token ID3737
Feature activation-2.01e+00

Pos logit contributions
 apologizing+0.110
 BG+0.110
 Ge+0.106
 Ange+0.106
 adjourn+0.105
Neg logit contributions
instein-0.154
kefeller-0.152
oston-0.152
artney-0.150
OF-0.148
Token a
Token ID257
Feature activation-3.95e-01

Pos logit contributions
 apologizing+1.133
 BG+1.130
 Ge+1.091
 Ange+1.078
 adjourn+1.077
Neg logit contributions
instein-1.838
oston-1.829
kefeller-1.827
artney-1.807
OF-1.772
Token partial
Token ID13027
Feature activation-5.51e-01

Pos logit contributions
 apologizing+0.240
 BG+0.240
 Ge+0.231
 Ange+0.228
 adjourn+0.227
Neg logit contributions
instein-0.345
oston-0.342
kefeller-0.341
artney-0.337
OF-0.331
Token or
Token ID393
Feature activation-7.90e+00

Pos logit contributions
 apologizing+0.349
 BG+0.348
 Ge+0.335
 Ange+0.333
 adjourn+0.331
Neg logit contributions
instein-0.467
oston-0.462
kefeller-0.461
artney-0.456
OF-0.448
Token
Token ID564
Feature activation-2.50e+00

Pos logit contributions
 BG+4.257
 apologizing+4.236
 adjourn+4.111
 Ge+4.043
iot+4.038
Neg logit contributions
artney-7.130
oston-7.099
instein-7.098
kefeller-7.033
OF-6.903
Token
Token ID250
Feature activation-2.11e+00

Pos logit contributions
 BG+1.531
 apologizing+1.531
 Ge+1.463
 Ange+1.461
Ge+1.441
Neg logit contributions
instein-2.173
kefeller-2.156
oston-2.141
artney-2.118
OF-2.072
Tokenhalf
Token ID13959
Feature activation-7.93e-01

Pos logit contributions
 apologizing+1.424
 BG+1.421
 Ge+1.372
 Ange+1.363
iot+1.362
Neg logit contributions
instein-1.686
kefeller-1.672
oston-1.668
artney-1.646
OF-1.611
Token
Token ID447
Feature activation-1.00e+00

Pos logit contributions
 apologizing+0.512
 BG+0.509
 Ge+0.491
 Ange+0.489
 adjourn+0.485
Neg logit contributions
instein-0.662
kefeller-0.656
oston-0.655
artney-0.648
OF-0.633
Token
Token ID251
Feature activation-9.33e-01

Pos logit contributions
 apologizing+0.506
 BG+0.502
 Ange+0.478
 Ge+0.478
 adjourn+0.472
Neg logit contributions
instein-0.977
kefeller-0.969
oston-0.964
artney-0.955
OF-0.935
Token <
Token ID1279
Feature activation-2.66e+00

Pos logit contributions
instein+1.143
oston+1.133
kefeller+1.128
artney+1.114
OF+1.110
Neg logit contributions
 apologizing-0.488
 BG-0.481
 Ange-0.456
 Ge-0.452
 adjourn-0.450
Token T
Token ID309
Feature activation-4.27e+00

Pos logit contributions
 BG+1.503
 apologizing+1.495
 Ge+1.446
 Ange+1.427
 adjourn+1.414
Neg logit contributions
instein-2.444
kefeller-2.423
oston-2.415
artney-2.399
OF-2.342
Token >
Token ID1875
Feature activation-3.29e+00

Pos logit contributions
 BG+2.375
 apologizing+2.321
 Ge+2.280
 Ange+2.233
Ge+2.216
Neg logit contributions
instein-3.948
kefeller-3.927
artney-3.880
oston-3.857
OF-3.759
Token {
Token ID1391
Feature activation-3.51e+00

Pos logit contributions
 BG+1.822
 apologizing+1.799
 Ge+1.757
 Ange+1.706
Ge+1.688
Neg logit contributions
instein-3.057
kefeller-3.034
oston-3.029
artney-2.996
OF-2.933
Token pub
Token ID2240
Feature activation-2.26e+00

Pos logit contributions
 apologizing+1.888
 BG+1.885
 Ge+1.820
 Ange+1.788
 adjourn+1.772
Neg logit contributions
instein-3.307
kefeller-3.301
oston-3.264
artney-3.246
OF-3.176
Token fn
Token ID24714
Feature activation-7.86e+00

Pos logit contributions
 BG+1.063
 apologizing+1.061
 Ge+1.013
 Ange+1.006
 adjourn+0.992
Neg logit contributions
instein-2.287
kefeller-2.271
oston-2.270
artney-2.251
OF-2.207
Token new
Token ID649
Feature activation-1.34e+00

Pos logit contributions
 BG+4.687
 apologizing+4.571
 adjourn+4.474
 Ge+4.450
 Ange+4.392
Neg logit contributions
oston-6.770
instein-6.748
kefeller-6.732
artney-6.722
OF-6.551
Token (
Token ID357
Feature activation+2.84e+00

Pos logit contributions
 apologizing+0.813
 BG+0.812
 Ge+0.780
 Ange+0.778
 adjourn+0.769
Neg logit contributions
instein-1.172
oston-1.158
kefeller-1.157
artney-1.146
OF-1.123
Token t
Token ID256
Feature activation+4.76e-01

Pos logit contributions
instein+2.495
oston+2.471
kefeller+2.438
OF+2.417
artney+2.397
Neg logit contributions
 apologizing-1.679
 BG-1.648
 Ange-1.596
 Ge-1.582
 adjourn-1.566
Token :
Token ID1058
Feature activation-1.44e+00

Pos logit contributions
instein+0.437
oston+0.432
kefeller+0.431
artney+0.426
OF+0.420
Neg logit contributions
 apologizing-0.269
 BG-0.267
 Ange-0.255
 Ge-0.255
 adjourn-0.252
Token T
Token ID309
Feature activation+1.14e-01

Pos logit contributions
 apologizing+0.824
 BG+0.823
 Ge+0.788
 Ange+0.785
 adjourn+0.777
Neg logit contributions
instein-1.310
oston-1.295
kefeller-1.294
artney-1.280
OF-1.258
Token <
Token ID1279
Feature activation-1.50e+00

Pos logit contributions
instein+1.338
oston+1.330
kefeller+1.321
artney+1.309
OF+1.305
Neg logit contributions
 apologizing-0.486
 BG-0.478
 Ange-0.450
 Ge-0.445
 adjourn-0.443
Token T
Token ID309
Feature activation-2.54e+00

Pos logit contributions
 BG+0.876
 apologizing+0.874
 Ge+0.839
 Ange+0.834
 adjourn+0.825
Neg logit contributions
instein-1.344
kefeller-1.329
oston-1.326
artney-1.314
OF-1.287
Token >
Token ID1875
Feature activation-5.26e-01

Pos logit contributions
 BG+1.444
 apologizing+1.428
 Ge+1.380
 Ange+1.366
 adjourn+1.351
Neg logit contributions
instein-2.330
kefeller-2.303
oston-2.286
artney-2.273
OF-2.229
Token {
Token ID1391
Feature activation-4.47e+00

Pos logit contributions
 BG+0.322
 apologizing+0.322
 Ge+0.310
 Ange+0.308
 adjourn+0.304
Neg logit contributions
instein-0.460
kefeller-0.455
oston-0.453
artney-0.447
OF-0.439
Token pub
Token ID2240
Feature activation-2.15e+00

Pos logit contributions
 BG+2.605
 apologizing+2.594
 Ge+2.507
 Ange+2.490
 adjourn+2.443
Neg logit contributions
instein-3.993
kefeller-3.987
oston-3.940
artney-3.921
OF-3.841
Token fn
Token ID24714
Feature activation-7.85e+00

Pos logit contributions
 BG+0.937
 apologizing+0.933
 Ge+0.886
 Ange+0.880
 adjourn+0.864
Neg logit contributions
instein-2.247
kefeller-2.228
oston-2.226
artney-2.207
OF-2.169
Token load
Token ID3440
Feature activation-7.19e-01

Pos logit contributions
 BG+4.562
 apologizing+4.347
 Ge+4.276
 adjourn+4.203
 Ange+4.196
Neg logit contributions
kefeller-7.065
instein-7.008
artney-6.961
oston-6.868
OF-6.724
Token <
Token ID1279
Feature activation-1.50e+00

Pos logit contributions
 BG+0.457
 apologizing+0.456
 Ge+0.438
 Ange+0.438
 adjourn+0.432
Neg logit contributions
instein-0.608
kefeller-0.600
oston-0.600
artney-0.593
OF-0.580
Token '
Token ID705
Feature activation-4.90e-01

Pos logit contributions
 apologizing+1.000
 BG+0.997
 Ge+0.959
 Ange+0.958
 adjourn+0.949
Neg logit contributions
instein-1.231
oston-1.215
kefeller-1.212
artney-1.196
OF-1.177
Tokena
Token ID64
Feature activation-2.01e+00

Pos logit contributions
 apologizing+0.379
 BG+0.378
 Ge+0.366
 Ange+0.365
 adjourn+0.362
Neg logit contributions
instein-0.347
oston-0.342
kefeller-0.341
artney-0.336
OF-0.330
Token >
Token ID1875
Feature activation-2.92e+00

Pos logit contributions
 BG+1.327
 apologizing+1.317
 Ge+1.273
 Ange+1.266
 adjourn+1.253
Neg logit contributions
instein-1.659
kefeller-1.641
oston-1.630
artney-1.617
OF-1.582
Token C
Token ID327
Feature activation-7.37e-02

Pos logit contributions
instein+2.040
oston+2.016
kefeller+1.993
artney+1.969
OF+1.958
Neg logit contributions
 apologizing-1.521
 BG-1.506
 Ange-1.454
 Ge-1.448
 adjourn-1.432
TokenBA
Token ID4339
Feature activation-4.73e-01

Pos logit contributions
 apologizing+0.036
 BG+0.035
 Ge+0.034
 Ange+0.034
 adjourn+0.033
Neg logit contributions
instein-0.073
oston-0.073
kefeller-0.073
artney-0.072
OF-0.071
Token contexts
Token ID26307
Feature activation+9.47e-01

Pos logit contributions
 apologizing+0.265
 BG+0.265
 Ge+0.253
 Ange+0.252
 adjourn+0.249
Neg logit contributions
instein-0.436
kefeller-0.433
oston-0.432
artney-0.427
OF-0.419
Token.
Token ID13
Feature activation+4.90e-01

Pos logit contributions
instein+0.832
oston+0.823
kefeller+0.819
artney+0.810
OF+0.800
Neg logit contributions
 apologizing-0.571
 BG-0.568
 Ange-0.545
 Ge-0.544
 adjourn-0.539
Token Industrial
Token ID19034
Feature activation-2.74e+00

Pos logit contributions
instein+0.417
kefeller+0.412
oston+0.412
artney+0.407
OF+0.398
Neg logit contributions
 apologizing-0.309
 BG-0.309
 Ge-0.297
 Ange-0.297
 adjourn-0.292
Token due
Token ID2233
Feature activation-7.68e+00

Pos logit contributions
 apologizing+1.711
 BG+1.700
 Ge+1.634
 Ange+1.628
 adjourn+1.612
Neg logit contributions
kefeller-2.341
instein-2.339
artney-2.308
oston-2.308
OF-2.234
Token process
Token ID1429
Feature activation-1.46e+00

Pos logit contributions
 apologizing+3.363
 BG+3.287
 Ge+3.199
 adjourn+3.174
iot+2.970
Neg logit contributions
kefeller-8.133
artney-7.981
instein-7.890
oston-7.857
OF-7.667
Token is
Token ID318
Feature activation-8.98e-01

Pos logit contributions
 apologizing+0.893
 BG+0.889
 Ge+0.856
 Ange+0.848
 adjourn+0.844
Neg logit contributions
instein-1.274
kefeller-1.272
oston-1.261
artney-1.256
OF-1.222
Token that
Token ID326
Feature activation-2.53e+00

Pos logit contributions
 apologizing+0.558
 BG+0.555
 Ge+0.533
 Ange+0.531
 adjourn+0.527
Neg logit contributions
instein-0.775
kefeller-0.770
oston-0.766
artney-0.762
OF-0.745
Token bare
Token ID6247
Feature activation-2.02e+00

Pos logit contributions
 apologizing+1.465
 BG+1.448
 Ge+1.389
 Ange+1.373
 adjourn+1.372
Neg logit contributions
kefeller-2.297
instein-2.295
artney-2.275
oston-2.273
OF-2.206
Token minimum
Token ID5288
Feature activation-2.28e+00

Pos logit contributions
 apologizing+1.115
 BG+1.110
 Ge+1.063
 Ange+1.055
 adjourn+1.047
Neg logit contributions
instein-1.875
kefeller-1.859
oston-1.855
artney-1.834
OF-1.804
Token
Token ID851
Feature activation+1.43e-01

Pos logit contributions
 BG+0.915
 apologizing+0.896
 Ge+0.867
 Ange+0.844
Ge+0.822
Neg logit contributions
instein-3.018
kefeller-3.008
oston-2.968
artney-2.958
OF-2.891
Token B
Token ID347
Feature activation-1.11e+00

Pos logit contributions
instein+0.123
oston+0.121
kefeller+0.121
artney+0.119
OF+0.118
Neg logit contributions
 apologizing-0.090
 BG-0.090
 Ge-0.086
 Ange-0.086
 adjourn-0.085
Tokenorm
Token ID579
Feature activation-6.17e-01

Pos logit contributions
 apologizing+0.766
 BG+0.763
 Ge+0.741
iot+0.735
 Ange+0.735
Neg logit contributions
instein-0.873
kefeller-0.867
oston-0.858
artney-0.847
OF-0.826
Tokenes
Token ID274
Feature activation-8.74e-01

Pos logit contributions
 apologizing+0.226
 BG+0.224
 Ge+0.208
 Ange+0.208
 adjourn+0.204
Neg logit contributions
instein-0.690
oston-0.683
kefeller-0.682
artney-0.675
OF-0.668
Token les
Token ID10287
Feature activation+6.53e-01

Pos logit contributions
 BG+0.543
 apologizing+0.542
 Ge+0.522
 Ange+0.519
 adjourn+0.513
Neg logit contributions
instein-0.752
kefeller-0.743
oston-0.743
artney-0.731
OF-0.722
Token Mim
Token ID39312
Feature activation-7.68e+00

Pos logit contributions
instein+0.558
oston+0.551
kefeller+0.549
artney+0.543
OF+0.535
Neg logit contributions
 apologizing-0.410
 BG-0.408
 Ange-0.392
 Ge-0.392
 adjourn-0.388
Tokenos
Token ID418
Feature activation-2.15e+00

Pos logit contributions
 Ge+4.072
 BG+4.052
iot+4.044
 apologizing+3.926
 Ange+3.903
Neg logit contributions
kefeller-6.912
instein-6.835
artney-6.808
oston-6.725
OF-6.711
Tokenas
Token ID292
Feature activation-1.64e+00

Pos logit contributions
 BG+1.428
 apologizing+1.424
 Ge+1.379
 Ange+1.375
 adjourn+1.361
Neg logit contributions
instein-1.745
kefeller-1.724
oston-1.722
artney-1.689
OF-1.670
Token (@
Token ID4275
Feature activation-8.96e-01

Pos logit contributions
 BG+0.837
 apologizing+0.827
 Ge+0.799
 Ange+0.791
 adjourn+0.775
Neg logit contributions
instein-1.610
kefeller-1.601
oston-1.585
artney-1.573
OF-1.541
Tokenb
Token ID65
Feature activation-1.19e+00

Pos logit contributions
 BG+0.596
 apologizing+0.595
 Ge+0.573
 Ange+0.571
Ge+0.565
Neg logit contributions
instein-0.730
kefeller-0.720
oston-0.720
artney-0.709
OF-0.695
Tokenorm
Token ID579
Feature activation+4.11e-01

Pos logit contributions
 apologizing+0.940
 BG+0.933
 Ge+0.908
 Ange+0.902
iot+0.901
Neg logit contributions
instein-0.820
kefeller-0.809
oston-0.803
artney-0.792
OF-0.780
Token x
Token ID2124
Feature activation-2.32e+00

Pos logit contributions
 BG+0.818
 apologizing+0.795
 Ge+0.768
 Ange+0.751
Ge+0.723
Neg logit contributions
instein-2.775
kefeller-2.766
oston-2.749
artney-2.741
OF-2.680
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
 BG+1.320
 apologizing+1.305
 Ge+1.276
 Ange+1.260
Ge+1.245
Neg logit contributions
instein-2.120
kefeller-2.115
artney-2.098
oston-2.094
OF-2.026
Tokenx
Token ID87
Feature activation-1.57e+00

Pos logit contributions
instein+0.250
oston+0.248
kefeller+0.246
artney+0.243
OF+0.242
Neg logit contributions
 apologizing-0.110
 BG-0.108
 Ange-0.103
 Ge-0.102
 adjourn-0.102
Token.
Token ID13
Feature activation-3.15e+00

Pos logit contributions
 BG+0.911
 apologizing+0.907
 Ge+0.878
 Ange+0.871
Ge+0.856
Neg logit contributions
instein-1.413
kefeller-1.404
oston-1.395
artney-1.390
OF-1.353
Tokenx
Token ID87
Feature activation-4.52e+00

Pos logit contributions
 BG+1.246
 apologizing+1.225
 Ge+1.182
 Ange+1.165
Ge+1.159
Neg logit contributions
kefeller-3.405
instein-3.399
artney-3.377
oston-3.369
OF-3.279
Token.
Token ID13
Feature activation-7.67e+00

Pos logit contributions
 BG+2.390
 apologizing+2.347
 Ge+2.316
 Ange+2.276
Ge+2.264
Neg logit contributions
kefeller-4.275
artney-4.269
instein-4.244
oston-4.211
OF-4.067
Token149
Token ID19442
Feature activation-6.54e-01

Pos logit contributions
 apologizing+1.982
iot+1.908
 adjourn+1.852
Ge+1.839
 Ge+1.786
Neg logit contributions
oston-9.108
artney-9.097
kefeller-9.085
instein-8.999
 Levin-8.911
Token 110
Token ID9796
Feature activation-1.21e+00

Pos logit contributions
 BG+0.333
 apologizing+0.331
 Ge+0.318
 Ange+0.315
 adjourn+0.310
Neg logit contributions
instein-0.639
kefeller-0.633
oston-0.631
artney-0.625
OF-0.614
Token22
Token ID1828
Feature activation-2.89e+00

Pos logit contributions
 BG+0.676
 apologizing+0.675
 Ge+0.649
 Ange+0.643
Ge+0.635
Neg logit contributions
instein-1.120
kefeller-1.106
oston-1.104
artney-1.093
OF-1.067
Token26
Token ID2075
Feature activation-1.71e+00

Pos logit contributions
 BG+1.396
 apologizing+1.378
 Ge+1.335
 Ange+1.303
Ge+1.293
Neg logit contributions
instein-2.903
kefeller-2.887
oston-2.859
artney-2.849
OF-2.756
Token x
Token ID2124
Feature activation-2.58e+00

Pos logit contributions
 BG+0.602
 apologizing+0.592
 Ge+0.564
 Ange+0.557
 adjourn+0.539
Neg logit contributions
instein-1.941
kefeller-1.931
oston-1.923
artney-1.913
OF-1.876
TokenICA
Token ID25241
Feature activation-2.75e+00

Pos logit contributions
 apologizing+3.766
 BG+3.753
 Ge+3.677
 Ange+3.660
Ge+3.622
Neg logit contributions
instein-2.443
kefeller-2.403
oston-2.385
artney-2.331
OF-2.219
Token (
Token ID357
Feature activation-3.00e+00

Pos logit contributions
 BG+1.748
 apologizing+1.719
 Ge+1.676
 Ange+1.664
 adjourn+1.621
Neg logit contributions
instein-2.360
kefeller-2.349
oston-2.329
artney-2.301
OF-2.207
TokenAr
Token ID3163
Feature activation+1.58e-01

Pos logit contributions
 BG+2.042
 apologizing+2.033
 Ge+1.978
Ge+1.977
 Ange+1.966
Neg logit contributions
instein-2.386
kefeller-2.365
oston-2.340
artney-2.321
 Levin-2.220
Tokenchie
Token ID3043
Feature activation-3.60e+00

Pos logit contributions
instein+0.135
oston+0.133
kefeller+0.133
artney+0.132
OF+0.130
Neg logit contributions
 apologizing-0.098
 BG-0.098
 Ge-0.094
 Ange-0.094
 adjourn-0.093
Token)
Token ID8
Feature activation-3.85e+00

Pos logit contributions
 BG+2.245
 apologizing+2.217
 Ge+2.188
 Ange+2.175
Ge+2.104
Neg logit contributions
instein-3.060
kefeller-3.040
oston-3.026
artney-2.980
OF-2.927
Token
Token ID784
Feature activation-7.65e+00

Pos logit contributions
 BG+2.526
 apologizing+2.505
 Ge+2.437
 Ange+2.431
Ge+2.367
Neg logit contributions
instein-3.170
kefeller-3.136
oston-3.105
artney-3.088
OF-2.997
Token Here
Token ID3423
Feature activation-8.56e-01

Pos logit contributions
 BG+4.851
 Ange+4.651
 apologizing+4.643
 Ge+4.633
 Pie+4.425
Neg logit contributions
kefeller-6.577
instein-6.358
oston-6.351
artney-6.322
UTC-6.020
Token
Token ID447
Feature activation+3.01e+00

Pos logit contributions
 apologizing+0.533
 BG+0.532
 Ge+0.512
 Ange+0.511
 adjourn+0.500
Neg logit contributions
instein-0.732
kefeller-0.730
oston-0.726
artney-0.715
OF-0.707
Token
Token ID247
Feature activation+1.93e+00

Pos logit contributions
instein+2.358
oston+2.322
kefeller+2.288
artney+2.277
OF+2.258
Neg logit contributions
 apologizing-2.070
 BG-2.056
 Ange-1.995
 Ge-1.988
 adjourn-1.985
Tokens
Token ID82
Feature activation+2.56e-01

Pos logit contributions
instein+1.644
oston+1.624
kefeller+1.611
artney+1.599
OF+1.577
Neg logit contributions
 apologizing-1.207
 BG-1.201
 Ge-1.151
 Ange-1.151
 adjourn-1.145
Token a
Token ID257
Feature activation-3.54e-01

Pos logit contributions
instein+0.214
oston+0.211
kefeller+0.211
artney+0.208
OF+0.205
Neg logit contributions
 apologizing-0.166
 BG-0.166
 Ge-0.159
 Ange-0.159
 adjourn-0.158
Token the
Token ID262
Feature activation-4.20e+00

Pos logit contributions
 BG+1.246
 apologizing+1.228
 Ge+1.203
 Ange+1.182
 adjourn+1.169
Neg logit contributions
instein-1.732
kefeller-1.722
oston-1.717
artney-1.708
OF-1.657
Token j
Token ID474
Feature activation-3.40e+00

Pos logit contributions
 apologizing+2.671
 BG+2.660
 Ge+2.584
 adjourn+2.547
 Ange+2.516
Neg logit contributions
instein-3.516
oston-3.512
kefeller-3.504
artney-3.499
OF-3.423
Tokenagg
Token ID9460
Feature activation+1.86e-01

Pos logit contributions
 apologizing+1.994
 BG+1.984
 Ge+1.899
 Ange+1.891
iot+1.887
Neg logit contributions
kefeller-2.996
instein-2.980
oston-2.962
artney-2.932
OF-2.891
Tokenies
Token ID444
Feature activation-4.23e+00

Pos logit contributions
instein+0.148
oston+0.146
kefeller+0.146
artney+0.144
OF+0.142
Neg logit contributions
 apologizing-0.127
 BG-0.126
 Ge-0.122
 Ange-0.122
 adjourn-0.121
Token since
Token ID1201
Feature activation-8.63e+00

Pos logit contributions
 BG+2.236
 apologizing+2.129
 Ge+2.073
 Ange+2.008
 adjourn+1.977
Neg logit contributions
kefeller-4.116
oston-4.088
instein-4.071
artney-4.014
OF-3.944
Token even
Token ID772
Feature activation-7.65e+00

Pos logit contributions
 BG+3.244
 apologizing+2.902
 Ge+2.778
 Ange+2.765
 adjourn+2.765
Neg logit contributions
oston-9.559
instein-9.526
artney-9.492
kefeller-9.436
OF-9.153
Token though
Token ID996
Feature activation-3.57e+00

Pos logit contributions
 BG+2.993
 apologizing+2.855
 Ange+2.688
 Ge+2.655
iot+2.554
Neg logit contributions
kefeller-8.413
oston-8.280
instein-8.273
artney-8.221
UTC-8.014
Token they
Token ID484
Feature activation-3.37e+00

Pos logit contributions
 BG+2.013
 apologizing+2.002
 Ge+1.955
 Ange+1.895
 adjourn+1.894
Neg logit contributions
kefeller-3.274
oston-3.262
artney-3.259
instein-3.259
OF-3.165
Token are
Token ID389
Feature activation-3.36e+00

Pos logit contributions
 BG+2.216
 apologizing+2.198
 Ge+2.143
 Ange+2.118
 adjourn+2.092
Neg logit contributions
kefeller-2.779
instein-2.762
oston-2.748
artney-2.744
OF-2.664
Token there
Token ID612
Feature activation-3.39e+00

Pos logit contributions
 BG+1.698
 apologizing+1.693
 Ge+1.614
 adjourn+1.595
 Ange+1.585
Neg logit contributions
kefeller-3.319
instein-3.265
oston-3.261
artney-3.248
OF-3.174
Token at
Token ID379
Feature activation+1.10e+00

Pos logit contributions
 BG+1.906
 apologizing+1.870
 Ge+1.814
 Ange+1.773
 adjourn+1.763
Neg logit contributions
kefeller-3.161
instein-3.131
oston-3.114
artney-3.111
OF-3.027
Token x
Token ID2124
Feature activation-2.18e+00

Pos logit contributions
 BG+0.260
 apologizing+0.259
 Ge+0.243
 Ange+0.241
 adjourn+0.235
Neg logit contributions
instein-0.808
kefeller-0.802
oston-0.801
artney-0.794
OF-0.783
Token.
Token ID13
Feature activation+1.35e+00

Pos logit contributions
 BG+1.246
 apologizing+1.234
 Ge+1.205
 Ange+1.190
Ge+1.175
Neg logit contributions
instein-1.990
kefeller-1.985
artney-1.967
oston-1.967
OF-1.903
Tokenx
Token ID87
Feature activation-6.41e-01

Pos logit contributions
instein+1.359
oston+1.347
kefeller+1.329
OF+1.318
artney+1.313
Neg logit contributions
 apologizing-0.643
 BG-0.626
 Ange-0.596
 adjourn-0.594
 Ge-0.589
Token.
Token ID13
Feature activation-1.85e+00

Pos logit contributions
 apologizing+0.383
 BG+0.382
 Ge+0.367
 Ange+0.366
 adjourn+0.362
Neg logit contributions
instein-0.567
oston-0.560
kefeller-0.560
artney-0.553
OF-0.544
Tokenx
Token ID87
Feature activation-3.42e+00

Pos logit contributions
 BG+0.765
 apologizing+0.763
 Ge+0.725
 Ange+0.719
Ge+0.706
Neg logit contributions
instein-1.962
kefeller-1.954
oston-1.944
artney-1.935
OF-1.896
Token.
Token ID13
Feature activation-7.61e+00

Pos logit contributions
 BG+1.862
 apologizing+1.839
 Ge+1.815
 Ange+1.779
Ge+1.775
Neg logit contributions
kefeller-3.192
instein-3.168
artney-3.161
oston-3.139
OF-3.030
Token135
Token ID17059
Feature activation+9.91e-01

Pos logit contributions
Ge+2.811
iot+2.759
 Ge+2.748
 apologizing+2.717
ge+2.668
Neg logit contributions
kefeller-8.269
artney-8.087
oston-8.054
instein-8.021
 Dolphin-7.834
Token 106
Token ID15696
Feature activation+1.77e+00

Pos logit contributions
instein+0.939
oston+0.930
kefeller+0.925
artney+0.916
OF+0.906
Neg logit contributions
 apologizing-0.529
 BG-0.524
 Ange-0.501
 Ge-0.500
 adjourn-0.495
Token19
Token ID1129
Feature activation-1.39e+00

Pos logit contributions
instein+1.616
oston+1.606
kefeller+1.582
OF+1.573
artney+1.566
Neg logit contributions
 apologizing-1.001
 BG-0.980
 Ange-0.945
 adjourn-0.938
 Ge-0.930
Token41
Token ID3901
Feature activation-1.61e+00

Pos logit contributions
 BG+0.710
 apologizing+0.708
 Ge+0.678
 Ange+0.672
 adjourn+0.660
Neg logit contributions
instein-1.353
kefeller-1.340
oston-1.335
artney-1.323
OF-1.295
Token x
Token ID2124
Feature activation-3.41e+00

Pos logit contributions
 BG+0.565
 apologizing+0.556
 Ge+0.530
 Ange+0.524
 adjourn+0.507
Neg logit contributions
instein-1.833
kefeller-1.825
oston-1.818
artney-1.807
OF-1.774
Token
Token ID198
Feature activation-2.03e+00

Pos logit contributions
 BG+0.751
 apologizing+0.748
 Ge+0.725
 Ange+0.719
 adjourn+0.709
Neg logit contributions
instein-1.069
kefeller-1.063
oston-1.057
artney-1.049
OF-1.021
Tokenimpl
Token ID23928
Feature activation-1.33e+00

Pos logit contributions
 apologizing+1.335
 BG+1.332
 Ge+1.286
 Ange+1.285
Ge+1.278
Neg logit contributions
instein-1.648
kefeller-1.645
oston-1.636
artney-1.620
OF-1.569
Token Board
Token ID5926
Feature activation-7.62e-02

Pos logit contributions
 apologizing+0.620
 BG+0.617
 Ge+0.584
 Ange+0.584
 adjourn+0.575
Neg logit contributions
instein-1.354
oston-1.340
kefeller-1.338
artney-1.324
OF-1.308
Token {
Token ID1391
Feature activation-2.49e+00

Pos logit contributions
 apologizing+0.046
 BG+0.046
 Ge+0.044
 Ange+0.044
 adjourn+0.044
Neg logit contributions
instein-0.067
oston-0.066
kefeller-0.066
artney-0.065
OF-0.064
Token pub
Token ID2240
Feature activation-2.12e+00

Pos logit contributions
 BG+1.423
 apologizing+1.419
 Ge+1.365
 Ange+1.359
 adjourn+1.341
Neg logit contributions
instein-2.276
kefeller-2.269
oston-2.253
artney-2.234
OF-2.190
Token fn
Token ID24714
Feature activation-7.60e+00

Pos logit contributions
 BG+0.955
 apologizing+0.947
 Ge+0.903
 Ange+0.900
 adjourn+0.886
Neg logit contributions
instein-2.189
oston-2.176
kefeller-2.173
artney-2.155
OF-2.111
Token copy
Token ID4866
Feature activation-1.08e-01

Pos logit contributions
 BG+4.806
 apologizing+4.756
 Ange+4.635
 Ge+4.621
 adjourn+4.606
Neg logit contributions
kefeller-6.307
instein-6.277
artney-6.231
oston-6.222
OF-6.058
Token_
Token ID62
Feature activation-2.80e-01

Pos logit contributions
 apologizing+0.065
 BG+0.064
 Ge+0.062
 Ange+0.062
 adjourn+0.061
Neg logit contributions
instein-0.096
oston-0.094
kefeller-0.094
artney-0.093
OF-0.092
Tokenboard
Token ID3526
Feature activation+2.95e+00

Pos logit contributions
 apologizing+0.176
 BG+0.175
 Ge+0.168
 Ange+0.168
 adjourn+0.166
Neg logit contributions
instein-0.239
oston-0.236
kefeller-0.235
artney-0.232
OF-0.229
Token (
Token ID357
Feature activation+3.09e+00

Pos logit contributions
instein+2.508
oston+2.489
kefeller+2.460
OF+2.420
artney+2.405
Neg logit contributions
 apologizing-1.850
 BG-1.817
 Ge-1.737
 Ange-1.734
 adjourn-1.729
Token &
Token ID1222
Feature activation+2.08e+00

Pos logit contributions
instein+2.701
oston+2.685
kefeller+2.634
OF+2.615
artney+2.602
Neg logit contributions
 apologizing-1.838
 BG-1.787
 Ange-1.740
 Ge-1.725
cipline-1.703
Token x
Token ID2124
Feature activation-2.58e+00

Pos logit contributions
 BG+0.602
 apologizing+0.592
 Ge+0.564
 Ange+0.557
 adjourn+0.539
Neg logit contributions
instein-1.941
kefeller-1.931
oston-1.923
artney-1.913
OF-1.876
Token.
Token ID13
Feature activation+6.86e-02

Pos logit contributions
 BG+1.454
 apologizing+1.435
 Ge+1.409
 Ange+1.388
Ge+1.373
Neg logit contributions
kefeller-2.375
instein-2.374
artney-2.358
oston-2.346
OF-2.268
Tokenx
Token ID87
Feature activation-1.96e+00

Pos logit contributions
instein+0.071
oston+0.070
kefeller+0.070
artney+0.069
OF+0.069
Neg logit contributions
 apologizing-0.031
 BG-0.030
 Ange-0.029
 Ge-0.029
 adjourn-0.028
Token.
Token ID13
Feature activation-2.63e+00

Pos logit contributions
 BG+1.128
 apologizing+1.119
 Ge+1.088
 Ange+1.076
Ge+1.060
Neg logit contributions
instein-1.782
kefeller-1.776
oston-1.760
artney-1.759
OF-1.706
Tokenx
Token ID87
Feature activation-4.18e+00

Pos logit contributions
 BG+1.049
 apologizing+1.031
 Ge+0.995
 Ange+0.979
Ge+0.973
Neg logit contributions
instein-2.834
kefeller-2.833
artney-2.807
oston-2.806
OF-2.732
Token.
Token ID13
Feature activation-7.58e+00

Pos logit contributions
 BG+2.255
 apologizing+2.202
 Ge+2.176
 Ange+2.133
Ge+2.117
Neg logit contributions
kefeller-3.941
instein-3.922
artney-3.920
oston-3.885
OF-3.747
Token150
Token ID8628
Feature activation-9.57e-01

Pos logit contributions
 apologizing+2.033
 BG+1.992
iot+1.952
 adjourn+1.943
 Ge+1.905
Neg logit contributions
oston-8.912
kefeller-8.883
instein-8.818
artney-8.792
 Levin-8.746
Token 110
Token ID9796
Feature activation+8.07e-01

Pos logit contributions
 BG+0.485
 apologizing+0.482
 Ge+0.463
 Ange+0.458
 adjourn+0.451
Neg logit contributions
instein-0.936
kefeller-0.929
oston-0.925
artney-0.915
OF-0.899
Token81
Token ID6659
Feature activation-1.52e+00

Pos logit contributions
instein+0.746
oston+0.738
kefeller+0.734
artney+0.725
OF+0.722
Neg logit contributions
 apologizing-0.449
 BG-0.444
 Ange-0.426
 Ge-0.423
 adjourn-0.422
Token12
Token ID1065
Feature activation-2.26e+00

Pos logit contributions
 BG+0.740
 apologizing+0.733
 Ge+0.706
 Ange+0.696
Ge+0.684
Neg logit contributions
instein-1.518
kefeller-1.509
oston-1.499
artney-1.487
OF-1.451
Token x
Token ID2124
Feature activation-2.26e+00

Pos logit contributions
 BG+0.778
 apologizing+0.761
 Ge+0.731
 Ange+0.717
 adjourn+0.693
Neg logit contributions
instein-2.586
kefeller-2.577
oston-2.566
artney-2.552
OF-2.498
Token x
Token ID2124
Feature activation-3.43e+00

Pos logit contributions
 BG+1.083
 apologizing+1.038
 Ge+1.021
 Ange+0.985
Ge+0.944
Neg logit contributions
instein-3.805
kefeller-3.803
oston-3.778
artney-3.771
OF-3.668
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
 BG+1.886
 apologizing+1.849
 Ge+1.834
 Ange+1.798
Ge+1.787
Neg logit contributions
kefeller-3.204
instein-3.190
artney-3.185
oston-3.151
OF-3.042
Tokenx
Token ID87
Feature activation-3.27e+00

Pos logit contributions
 BG+0.431
 apologizing+0.431
 Ge+0.407
 Ange+0.405
 adjourn+0.397
Neg logit contributions
instein-1.065
kefeller-1.055
oston-1.054
artney-1.045
OF-1.028
Token.
Token ID13
Feature activation-4.14e+00

Pos logit contributions
 BG+1.803
 apologizing+1.765
 Ge+1.748
 Ange+1.712
Ge+1.699
Neg logit contributions
kefeller-3.048
instein-3.038
artney-3.028
oston-2.999
OF-2.900
Tokenx
Token ID87
Feature activation-5.58e+00

Pos logit contributions
 BG+1.534
 apologizing+1.475
 Ge+1.467
Ge+1.444
 Ange+1.416
Neg logit contributions
kefeller-4.582
instein-4.559
artney-4.549
oston-4.511
OF-4.380
Token.
Token ID13
Feature activation-7.52e+00

Pos logit contributions
 BG+2.880
 Ge+2.797
 apologizing+2.763
 Ange+2.727
Ge+2.716
Neg logit contributions
kefeller-5.380
artney-5.355
instein-5.330
oston-5.259
OF-5.079
Token152
Token ID17827
Feature activation-9.79e-01

Pos logit contributions
iot+2.569
 apologizing+2.525
 BG+2.473
 Ge+2.437
 Ange+2.384
Neg logit contributions
kefeller-8.309
instein-8.291
oston-8.256
artney-8.154
 Levin-8.043
Token 101
Token ID8949
Feature activation+1.55e-01

Pos logit contributions
 BG+0.499
 apologizing+0.495
 Ge+0.476
 Ange+0.471
 adjourn+0.463
Neg logit contributions
instein-0.955
kefeller-0.948
oston-0.944
artney-0.935
OF-0.919
Token8
Token ID23
Feature activation-1.95e-01

Pos logit contributions
instein+0.144
oston+0.142
kefeller+0.142
artney+0.140
OF+0.139
Neg logit contributions
 apologizing-0.086
 BG-0.085
 Ange-0.082
 Ge-0.081
 adjourn-0.081
Token604
Token ID31916
Feature activation-8.92e-01

Pos logit contributions
 apologizing+0.115
 BG+0.115
 Ge+0.110
 Ange+0.110
 adjourn+0.109
Neg logit contributions
instein-0.173
oston-0.171
kefeller-0.171
artney-0.169
OF-0.166
Token x
Token ID2124
Feature activation-2.44e+00

Pos logit contributions
 BG+0.320
 apologizing+0.317
 Ge+0.299
 Ange+0.297
 adjourn+0.289
Neg logit contributions
instein-1.004
kefeller-0.996
oston-0.995
artney-0.987
OF-0.971
Token |
Token ID930
Feature activation+3.86e+00

Pos logit contributions
instein+1.346
oston+1.329
kefeller+1.326
artney+1.308
OF+1.285
Neg logit contributions
 apologizing-1.148
 BG-1.145
 Ge-1.104
 Ange-1.103
 adjourn-1.093
Token |
Token ID930
Feature activation+2.17e+00

Pos logit contributions
instein+3.152
oston+3.097
kefeller+3.080
artney+3.048
OF+3.019
Neg logit contributions
 apologizing-2.529
 BG-2.509
 Ange-2.416
 Ge-2.401
iot-2.385
Token ----
Token ID13498
Feature activation-1.81e+00

Pos logit contributions
instein+2.058
oston+2.031
kefeller+2.024
artney+2.004
OF+1.982
Neg logit contributions
 apologizing-1.159
 BG-1.150
 Ange-1.097
 Ge-1.093
 adjourn-1.080
Token-+
Token ID19529
Feature activation+1.01e+01

Pos logit contributions
 BG+1.194
 apologizing+1.190
 Ge+1.157
 Ange+1.151
 adjourn+1.137
Neg logit contributions
instein-1.484
kefeller-1.483
oston-1.469
artney-1.452
OF-1.408
Token--------------------------------
Token ID3880
Feature activation+2.66e+00

Pos logit contributions
instein+5.415
oston+5.327
OF+5.251
UTC+5.137
artney+5.093
Neg logit contributions
 apologizing-8.716
 BG-8.705
cipline-8.364
 Ange-8.353
iot-8.327
Token+
Token ID10
Feature activation+8.32e+00

Pos logit contributions
instein+1.897
oston+1.873
kefeller+1.854
artney+1.832
OF+1.813
Neg logit contributions
 apologizing-2.038
 BG-2.024
 Ange-1.955
 Ge-1.952
 adjourn-1.936
Token-------
Token ID26866
Feature activation+3.76e+00

Pos logit contributions
instein+5.106
oston+5.023
OF+4.892
artney+4.877
UTC+4.852
Neg logit contributions
 apologizing-6.794
 BG-6.751
 Ange-6.496
cipline-6.459
 Ge-6.448
Token|
Token ID91
Feature activation+5.48e+00

Pos logit contributions
instein+2.633
oston+2.596
kefeller+2.566
artney+2.526
OF+2.524
Neg logit contributions
 apologizing-2.904
 BG-2.883
 Ange-2.782
 Ge-2.775
 adjourn-2.765
Token |
Token ID930
Feature activation+8.10e-01

Pos logit contributions
instein+4.156
oston+4.099
kefeller+4.034
artney+4.026
OF+3.996
Neg logit contributions
 apologizing-3.868
 BG-3.823
 Ange-3.684
 Ge-3.671
iot-3.642
Token 1
Token ID352
Feature activation-1.01e+00

Pos logit contributions
instein+0.757
oston+0.749
kefeller+0.748
artney+0.738
OF+0.729
Neg logit contributions
 apologizing-0.445
 BG-0.443
 Ge-0.424
 Ange-0.422
 adjourn-0.418
Token |
Token ID930
Feature activation+1.29e+00

Pos logit contributions
 BG+0.599
 apologizing+0.596
 Ge+0.575
 Ange+0.571
 adjourn+0.565
Neg logit contributions
instein-0.892
kefeller-0.881
oston-0.878
artney-0.871
OF-0.852
Token Suzuki
Token ID35807
Feature activation-4.03e-01

Pos logit contributions
instein+0.925
oston+0.902
kefeller+0.897
artney+0.872
OF+0.842
Neg logit contributions
 apologizing-2.570
 BG-2.565
 Ge-2.507
 Ange-2.506
 adjourn-2.489
Token,
Token ID11
Feature activation+6.79e-01

Pos logit contributions
 BG+0.235
 apologizing+0.234
 Ge+0.226
 Ange+0.225
 adjourn+0.220
Neg logit contributions
instein-0.364
kefeller-0.361
oston-0.359
artney-0.356
OF-0.348
Token The
Token ID383
Feature activation-4.45e+00

Pos logit contributions
instein+0.537
oston+0.529
kefeller+0.529
artney+0.521
OF+0.512
Neg logit contributions
 apologizing-0.469
 BG-0.469
 Ge-0.453
 Ange-0.452
 adjourn-0.446
Token Chronicle
Token ID14160
Feature activation+1.20e+00

Pos logit contributions
 Ge+1.683
 BG+1.668
 apologizing+1.588
Ge+1.565
 Ange+1.537
Neg logit contributions
instein-4.913
artney-4.895
kefeller-4.892
oston-4.819
OF-4.710
Token 5
Token ID642
Feature activation-3.83e-01

Pos logit contributions
instein+0.893
kefeller+0.888
oston+0.876
artney+0.876
OF+0.849
Neg logit contributions
 BG-0.895
 apologizing-0.884
 Ge-0.870
 Ange-0.865
 adjourn-0.844
Token of
Token ID286
Feature activation+6.38e+00

Pos logit contributions
 apologizing+0.225
 BG+0.224
 Ge+0.216
 Ange+0.215
 adjourn+0.212
Neg logit contributions
instein-0.342
kefeller-0.338
oston-0.338
artney-0.335
OF-0.327
Token 15
Token ID1315
Feature activation+3.09e+00

Pos logit contributions
oston+5.180
instein+5.127
OF+4.980
kefeller+4.960
artney+4.923
Neg logit contributions
 apologizing-4.180
 BG-3.977
 adjourn-3.860
cipline-3.807
iot-3.798
Token Photo
Token ID5555
Feature activation-1.96e+00

Pos logit contributions
instein+2.813
oston+2.781
kefeller+2.759
artney+2.730
OF+2.711
Neg logit contributions
 apologizing-1.762
 BG-1.733
 Ange-1.656
 adjourn-1.654
 Ge-1.647
Token:
Token ID25
Feature activation+3.02e+00

Pos logit contributions
 BG+0.996
 apologizing+0.988
 Ge+0.960
 Ange+0.946
Ge+0.931
Neg logit contributions
instein-1.911
kefeller-1.901
oston-1.886
artney-1.884
OF-1.829
Token Erin
Token ID28894
Feature activation+1.14e+00

Pos logit contributions
instein+2.537
oston+2.522
kefeller+2.503
artney+2.466
OF+2.446
Neg logit contributions
 apologizing-1.918
 BG-1.873
 adjourn-1.803
 Ange-1.779
cipline-1.778
Token Bre
Token ID3719
Feature activation-3.58e-01

Pos logit contributions
instein+0.941
oston+0.929
kefeller+0.927
artney+0.914
OF+0.901
Neg logit contributions
 apologizing-0.752
 BG-0.749
 Ge-0.721
 Ange-0.721
 adjourn-0.713
Token c
Token ID269
Feature activation+1.36e+00

Pos logit contributions
instein+2.004
oston+1.981
kefeller+1.944
artney+1.919
OF+1.897
Neg logit contributions
 apologizing-2.339
 BG-2.312
 Ange-2.260
 Ge-2.238
Ge-2.229
Tokenv
Token ID85
Feature activation-3.40e+00

Pos logit contributions
instein+0.870
oston+0.856
kefeller+0.846
artney+0.833
OF+0.820
Neg logit contributions
 apologizing-1.147
 BG-1.139
 Ange-1.108
 Ge-1.104
 adjourn-1.099
Token2
Token ID17
Feature activation-1.72e+00

Pos logit contributions
 BG+1.984
 apologizing+1.955
 Ge+1.912
Ge+1.884
 Ange+1.875
Neg logit contributions
instein-3.054
kefeller-3.044
artney-3.013
oston-3.003
OF-2.893
Token.
Token ID764
Feature activation-5.64e+00

Pos logit contributions
 BG+0.975
 apologizing+0.963
 Ge+0.937
 Ange+0.921
Ge+0.909
Neg logit contributions
instein-1.585
kefeller-1.574
artney-1.559
oston-1.559
OF-1.510
Token COL
Token ID20444
Feature activation-1.66e+00

Pos logit contributions
 BG+3.228
 apologizing+3.078
 Ge+3.049
Ge+2.920
 adjourn+2.897
Neg logit contributions
instein-5.208
kefeller-5.194
oston-5.148
artney-5.095
OF-4.950
TokenOR
Token ID1581
Feature activation+7.71e+00

Pos logit contributions
 BG+1.097
 apologizing+1.091
 Ge+1.057
 Ange+1.047
Ge+1.037
Neg logit contributions
instein-1.362
kefeller-1.345
oston-1.344
artney-1.328
OF-1.287
Token_
Token ID62
Feature activation+2.29e+00

Pos logit contributions
OF+4.948
UTC+4.866
oston+4.713
instein+4.622
 Levin+4.451
Neg logit contributions
 apologizing-6.297
 adjourn-5.993
 looted-5.899
 Ange-5.898
 substitutes-5.881
TokenB
Token ID33
Feature activation-4.84e-01

Pos logit contributions
OF+1.842
instein+1.837
oston+1.817
kefeller+1.791
UTC+1.790
Neg logit contributions
 apologizing-1.531
 Ange-1.472
 BG-1.468
 adjourn-1.463
iot-1.443
TokenGR
Token ID10761
Feature activation+1.39e+00

Pos logit contributions
 apologizing+0.284
 BG+0.281
 Ange+0.270
 Ge+0.269
 adjourn+0.268
Neg logit contributions
instein-0.434
oston-0.430
kefeller-0.427
artney-0.422
OF-0.420
Token2
Token ID17
Feature activation-1.66e+00

Pos logit contributions
instein+1.106
oston+1.104
OF+1.099
kefeller+1.082
UTC+1.071
Neg logit contributions
 apologizing-0.941
 BG-0.911
 adjourn-0.897
 Ange-0.894
iot-0.877
TokenG
Token ID38
Feature activation-8.70e-01

Pos logit contributions
 BG+0.821
 apologizing+0.810
 Ge+0.779
 Ange+0.770
Ge+0.763
Neg logit contributions
instein-1.660
kefeller-1.640
oston-1.636
artney-1.621
OF-1.577
Token file
Token ID2393
Feature activation-1.44e+00

Pos logit contributions
 apologizing+2.506
 BG+2.483
 Ge+2.451
 Ange+2.370
iot+2.358
Neg logit contributions
kefeller-3.138
instein-3.131
oston-3.080
artney-3.060
OF-2.978
Token.
Token ID13
Feature activation-2.68e+00

Pos logit contributions
 apologizing+0.837
 BG+0.831
 Ge+0.802
 Ange+0.796
 adjourn+0.783
Neg logit contributions
instein-1.298
kefeller-1.283
oston-1.280
artney-1.266
OF-1.243
Token
Token ID198
Feature activation-3.34e+00

Pos logit contributions
 apologizing+1.607
 BG+1.605
 Ge+1.575
 Ange+1.544
Ge+1.510
Neg logit contributions
instein-2.372
kefeller-2.366
oston-2.348
artney-2.327
OF-2.249
Token
Token ID198
Feature activation-1.67e+00

Pos logit contributions
 BG+1.896
 Ge+1.861
 apologizing+1.859
 Ange+1.808
Ge+1.807
Neg logit contributions
kefeller-3.079
instein-3.048
artney-3.040
oston-3.018
OF-2.894
Tokenfile
Token ID7753
Feature activation+3.51e+00

Pos logit contributions
 apologizing+1.245
 BG+1.230
 Ge+1.204
Ge+1.201
 Ange+1.191
Neg logit contributions
kefeller-1.225
instein-1.222
oston-1.211
artney-1.190
OF-1.150
Token and
Token ID290
Feature activation+8.32e+00

Pos logit contributions
instein+2.931
oston+2.892
kefeller+2.846
OF+2.807
artney+2.801
Neg logit contributions
 apologizing-2.226
 BG-2.190
 Ange-2.135
 adjourn-2.098
Ge-2.078
Token creates
Token ID8075
Feature activation-9.65e-01

Pos logit contributions
oston+6.246
kefeller+6.208
instein+6.203
artney+6.138
UTC+6.049
Neg logit contributions
 apologizing-5.610
 BG-5.555
cipline-5.396
 Ange-5.340
 adjourn-5.338
Token corresponding
Token ID11188
Feature activation+3.99e+00

Pos logit contributions
 apologizing+0.502
 BG+0.499
 Ange+0.475
 Ge+0.475
 adjourn+0.469
Neg logit contributions
instein-0.930
oston-0.920
kefeller-0.917
artney-0.906
OF-0.896
Token templates
Token ID24019
Feature activation+1.53e+00

Pos logit contributions
instein+2.988
oston+2.946
kefeller+2.914
artney+2.908
OF+2.871
Neg logit contributions
 apologizing-2.855
 BG-2.835
 Ange-2.739
 adjourn-2.716
cipline-2.708
Token.
Token ID13
Feature activation+5.36e-01

Pos logit contributions
instein+1.296
oston+1.282
kefeller+1.266
OF+1.250
artney+1.249
Neg logit contributions
 apologizing-0.968
 BG-0.956
 Ange-0.923
 adjourn-0.917
 Ge-0.911
Token This
Token ID770
Feature activation-9.38e-01

Pos logit contributions
instein+0.428
kefeller+0.423
oston+0.421
artney+0.417
OF+0.407
Neg logit contributions
 BG-0.370
 apologizing-0.368
 Ge-0.358
 Ange-0.355
 adjourn-0.350
Token (
Token ID357
Feature activation+3.19e+00

Pos logit contributions
instein+1.412
oston+1.399
kefeller+1.387
OF+1.369
artney+1.365
Neg logit contributions
 apologizing-1.096
 BG-1.075
 Ange-1.044
 adjourn-1.035
 Ge-1.033
Token gray
Token ID12768
Feature activation+2.39e+00

Pos logit contributions
instein+2.537
oston+2.483
kefeller+2.430
artney+2.391
UTC+2.371
Neg logit contributions
 apologizing-2.132
 BG-2.098
 Ange-2.077
cipline-2.050
iot-2.047
Token,
Token ID837
Feature activation+2.41e+00

Pos logit contributions
instein+1.801
oston+1.788
kefeller+1.749
artney+1.729
OF+1.727
Neg logit contributions
 apologizing-1.735
 BG-1.700
 Ange-1.662
 adjourn-1.648
 Ge-1.644
Token (
Token ID357
Feature activation+5.71e+00

Pos logit contributions
instein+1.964
oston+1.940
kefeller+1.905
artney+1.884
OF+1.875
Neg logit contributions
 apologizing-1.587
 BG-1.567
 Ange-1.518
 Ge-1.502
 adjourn-1.494
Token 3
Token ID513
Feature activation+3.87e+00

Pos logit contributions
instein+3.823
oston+3.787
OF+3.590
kefeller+3.576
artney+3.561
Neg logit contributions
 apologizing-4.590
 BG-4.512
 Ange-4.381
 adjourn-4.377
cipline-4.341
Token,
Token ID837
Feature activation+6.77e+00

Pos logit contributions
oston+2.777
instein+2.772
OF+2.719
kefeller+2.671
UTC+2.661
Neg logit contributions
 apologizing-2.909
 BG-2.819
 Ange-2.773
 adjourn-2.750
iot-2.716
Token 3
Token ID513
Feature activation+4.24e+00

Pos logit contributions
instein+4.280
oston+4.256
OF+4.030
 Levin+4.021
kefeller+3.949
Neg logit contributions
 apologizing-5.593
 BG-5.470
 Ange-5.332
 adjourn-5.319
cipline-5.304
Token )
Token ID1267
Feature activation+1.57e+00

Pos logit contributions
instein+2.943
oston+2.914
kefeller+2.852
OF+2.836
artney+2.822
Neg logit contributions
 apologizing-3.297
 BG-3.223
 Ange-3.177
 adjourn-3.133
 Ge-3.122
Token,
Token ID837
Feature activation+3.29e+00

Pos logit contributions
instein+1.212
oston+1.198
kefeller+1.184
artney+1.172
OF+1.164
Neg logit contributions
 apologizing-1.118
 BG-1.104
 Ange-1.076
 Ge-1.065
 adjourn-1.062
Token 0
Token ID657
Feature activation+1.71e+00

Pos logit contributions
instein+2.689
oston+2.652
kefeller+2.580
OF+2.573
artney+2.553
Neg logit contributions
 apologizing-2.150
 BG-2.100
 Ange-2.060
 Ge-2.011
cipline-2.010
Token )
Token ID1267
Feature activation+2.03e+00

Pos logit contributions
instein+1.345
oston+1.331
kefeller+1.308
OF+1.296
artney+1.294
Neg logit contributions
 apologizing-1.183
 BG-1.165
 Ange-1.135
 Ge-1.121
 adjourn-1.119
TokenON
Token ID1340
Feature activation-5.89e-02

Pos logit contributions
instein+1.175
oston+1.158
kefeller+1.153
artney+1.139
OF+1.130
Neg logit contributions
 apologizing-0.897
 BG-0.891
 Ange-0.859
 Ge-0.856
 adjourn-0.849
Token.
Token ID13
Feature activation+4.95e-01

Pos logit contributions
 apologizing+0.037
 BG+0.037
 Ge+0.035
 Ange+0.035
 adjourn+0.035
Neg logit contributions
instein-0.051
oston-0.050
kefeller-0.050
artney-0.049
OF-0.048
TokenB
Token ID33
Feature activation+6.33e-01

Pos logit contributions
instein+0.396
oston+0.391
kefeller+0.389
artney+0.384
OF+0.379
Neg logit contributions
 apologizing-0.337
 BG-0.336
 Ange-0.324
 Ge-0.323
 adjourn-0.321
Token)
Token ID8
Feature activation-1.93e+00

Pos logit contributions
instein+0.513
oston+0.507
kefeller+0.504
artney+0.497
OF+0.493
Neg logit contributions
 apologizing-0.424
 BG-0.423
 Ange-0.408
 Ge-0.406
 adjourn-0.403
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
 apologizing+1.115
 BG+1.113
 Ge+1.070
 Ange+1.059
 adjourn+1.049
Neg logit contributions
instein-1.751
oston-1.733
kefeller-1.730
artney-1.712
OF-1.678
Token
Token ID198
Feature activation+1.77e+00

Pos logit contributions
instein+0.917
oston+0.906
kefeller+0.894
OF+0.883
artney+0.882
Neg logit contributions
 apologizing-0.743
 BG-0.732
 Ange-0.706
 adjourn-0.700
 Ge-0.700
Token37
Token ID2718
Feature activation+1.13e+00

Pos logit contributions
instein+1.515
oston+1.493
kefeller+1.481
artney+1.467
OF+1.455
Neg logit contributions
 apologizing-1.106
 BG-1.102
 Ange-1.058
 Ge-1.055
 adjourn-1.046
Token62
Token ID5237
Feature activation+5.61e-01

Pos logit contributions
instein+0.982
oston+0.970
kefeller+0.964
artney+0.954
OF+0.942
Neg logit contributions
 apologizing-0.693
 BG-0.689
 Ange-0.662
 Ge-0.660
 adjourn-0.654
Tokenc
Token ID66
Feature activation-9.05e-01

Pos logit contributions
instein+0.462
oston+0.456
kefeller+0.455
artney+0.449
OF+0.443
Neg logit contributions
 apologizing-0.370
 BG-0.368
 Ge-0.355
 Ange-0.354
 adjourn-0.351
Token5
Token ID20
Feature activation+2.95e+00

Pos logit contributions
 apologizing+0.550
 BG+0.549
 Ge+0.527
 Ange+0.524
 adjourn+0.519
Neg logit contributions
instein-0.788
kefeller-0.779
oston-0.779
artney-0.768
OF-0.757
Token378
Token ID30695
Feature activation-3.54e-01

Pos logit contributions
instein+2.428
oston+2.393
kefeller+2.359
artney+2.346
OF+2.314
Neg logit contributions
 apologizing-1.946
 BG-1.929
 Ange-1.860
 Ge-1.850
 adjourn-1.841
Token
Token ID198
Feature activation+2.12e-02

Pos logit contributions
instein+0.368
kefeller+0.364
oston+0.363
artney+0.358
OF+0.350
Neg logit contributions
 BG-0.298
 apologizing-0.297
 Ge-0.288
 Ange-0.287
 adjourn-0.281
Token
Token ID198
Feature activation+4.75e-01

Pos logit contributions
instein+0.018
oston+0.018
kefeller+0.018
artney+0.018
OF+0.017
Neg logit contributions
 apologizing-0.013
 BG-0.013
 Ge-0.013
 Ange-0.013
 adjourn-0.013
TokenSome
Token ID4366
Feature activation-4.36e-01

Pos logit contributions
instein+0.359
kefeller+0.356
oston+0.355
artney+0.350
OF+0.340
Neg logit contributions
 apologizing-0.343
 BG-0.342
 Ge-0.332
 Ange-0.331
Ge-0.329
Token researchers
Token ID4837
Feature activation+1.82e+00

Pos logit contributions
 apologizing+0.265
 BG+0.265
 Ge+0.256
 Ange+0.253
 adjourn+0.251
Neg logit contributions
instein-0.381
kefeller-0.379
oston-0.378
artney-0.372
OF-0.366
Token (
Token ID357
Feature activation+1.58e+00

Pos logit contributions
instein+1.574
oston+1.543
kefeller+1.534
artney+1.521
OF+1.501
Neg logit contributions
 apologizing-1.121
 BG-1.119
 Ange-1.073
 Ge-1.067
 adjourn-1.054
Tokenfor
Token ID1640
Feature activation+1.57e+00

Pos logit contributions
instein+1.267
oston+1.249
kefeller+1.239
artney+1.227
OF+1.218
Neg logit contributions
 apologizing-1.077
 BG-1.068
 Ange-1.027
 adjourn-1.025
 Ge-1.024
Token example
Token ID1672
Feature activation-1.97e+00

Pos logit contributions
instein+1.303
oston+1.286
kefeller+1.278
artney+1.263
OF+1.244
Neg logit contributions
 apologizing-1.018
 BG-1.011
 Ange-0.971
 Ge-0.969
 adjourn-0.965
Token,
Token ID11
Feature activation-6.98e-01

Pos logit contributions
 BG+1.224
 apologizing+1.221
 Ge+1.189
 Ange+1.178
 adjourn+1.153
Neg logit contributions
instein-1.699
kefeller-1.684
oston-1.680
artney-1.660
OF-1.627
Token Victor
Token ID12622
Feature activation-1.15e+00

Pos logit contributions
 apologizing+0.460
 BG+0.460
 Ge+0.445
 Ange+0.443
 adjourn+0.436
Neg logit contributions
instein-0.575
oston-0.568
kefeller-0.568
artney-0.560
OF-0.550
Token Su
Token ID1778
Feature activation+7.53e-01

Pos logit contributions
 BG+0.729
 apologizing+0.727
 Ge+0.707
 Ange+0.703
 adjourn+0.690
Neg logit contributions
instein-0.976
oston-0.968
kefeller-0.966
artney-0.953
OF-0.937
Tokenvor
Token ID20867
Feature activation+6.14e-02

Pos logit contributions
instein+0.563
oston+0.553
kefeller+0.551
artney+0.542
OF+0.533
Neg logit contributions
 apologizing-0.558
 BG-0.554
 Ange-0.536
 Ge-0.532
 adjourn-0.531
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
 apologizing+0.581
 BG+0.579
 Ge+0.556
 Ange+0.555
 adjourn+0.548
Neg logit contributions
instein-0.843
oston-0.832
kefeller-0.831
artney-0.821
OF-0.809
Token dram
Token ID4850
Feature activation-4.43e-01

Pos logit contributions
instein+0.768
oston+0.757
kefeller+0.753
artney+0.743
OF+0.732
Neg logit contributions
 apologizing-0.804
 BG-0.800
 Ange-0.775
 Ge-0.773
 adjourn-0.768
Tokenat
Token ID265
Feature activation-6.40e-01

Pos logit contributions
 apologizing+0.337
 BG+0.336
 Ge+0.325
 Ange+0.325
 adjourn+0.322
Neg logit contributions
instein-0.320
oston-0.314
kefeller-0.314
artney-0.309
OF-0.304
Tokenization
Token ID1634
Feature activation+2.37e+00

Pos logit contributions
 apologizing+0.452
 BG+0.450
 Ge+0.433
 Ange+0.430
 adjourn+0.427
Neg logit contributions
instein-0.498
kefeller-0.491
oston-0.489
artney-0.485
OF-0.476
Token of
Token ID286
Feature activation+1.11e+00

Pos logit contributions
instein+2.009
oston+1.983
kefeller+1.972
artney+1.950
OF+1.937
Neg logit contributions
 apologizing-1.488
 BG-1.477
 Ange-1.425
 Ge-1.415
 adjourn-1.409
Token O
Token ID440
Feature activation+2.93e+00

Pos logit contributions
instein+0.969
oston+0.960
kefeller+0.951
artney+0.942
OF+0.932
Neg logit contributions
 apologizing-0.674
 BG-0.669
 Ange-0.644
 Ge-0.640
 adjourn-0.636
Token.
Token ID13
Feature activation-1.89e+00

Pos logit contributions
instein+2.455
oston+2.446
kefeller+2.399
OF+2.377
artney+2.359
Neg logit contributions
 apologizing-1.875
 BG-1.867
 Ange-1.803
 adjourn-1.777
 Ge-1.764
TokenJ
Token ID41
Feature activation+6.01e-01

Pos logit contributions
 BG+1.256
 apologizing+1.254
 Ge+1.222
Ge+1.202
 Ange+1.202
Neg logit contributions
instein-1.546
kefeller-1.532
oston-1.518
artney-1.513
OF-1.455
Token.
Token ID13
Feature activation-1.06e+00

Pos logit contributions
instein+0.511
oston+0.505
kefeller+0.504
artney+0.498
OF+0.491
Neg logit contributions
 apologizing-0.379
 BG-0.377
 Ge-0.363
 Ange-0.362
 adjourn-0.358
Token Simpson
Token ID20531
Feature activation-1.17e+00

Pos logit contributions
 apologizing+0.670
 BG+0.667
 Ge+0.644
 Ange+0.635
 adjourn+0.628
Neg logit contributions
instein-0.912
oston-0.899
kefeller-0.898
artney-0.887
OF-0.873
Token
Token ID447
Feature activation-6.41e-01

Pos logit contributions
 apologizing+0.729
 BG+0.724
 Ge+0.697
 Ange+0.690
 adjourn+0.681
Neg logit contributions
instein-1.005
kefeller-0.997
oston-0.994
artney-0.983
OF-0.968
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
 BG+1.043
 apologizing+1.039
 Ge+0.999
 Ange+0.992
 adjourn+0.981
Neg logit contributions
instein-1.424
kefeller-1.409
oston-1.403
artney-1.386
OF-1.358
Token I
Token ID314
Feature activation+1.85e-01

Pos logit contributions
 BG+0.955
 apologizing+0.947
 Ge+0.910
 Ange+0.905
 adjourn+0.894
Neg logit contributions
instein-1.385
kefeller-1.371
oston-1.370
artney-1.355
OF-1.327
Token'm
Token ID1101
Feature activation+1.05e+00

Pos logit contributions
instein+0.142
oston+0.140
kefeller+0.140
artney+0.138
OF+0.136
Neg logit contributions
 apologizing-0.131
 BG-0.131
 Ge-0.126
 Ange-0.126
 adjourn-0.125
Token still
Token ID991
Feature activation+7.02e-01

Pos logit contributions
instein+0.913
oston+0.902
kefeller+0.896
artney+0.886
OF+0.877
Neg logit contributions
 apologizing-0.635
 BG-0.633
 Ange-0.608
 Ge-0.607
 adjourn-0.600
Token trying
Token ID2111
Feature activation+9.05e-01

Pos logit contributions
instein+0.593
oston+0.586
kefeller+0.586
artney+0.578
OF+0.568
Neg logit contributions
 apologizing-0.450
 BG-0.447
 Ge-0.429
 Ange-0.429
 adjourn-0.425
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
instein+0.749
oston+0.740
kefeller+0.736
artney+0.726
OF+0.717
Neg logit contributions
 apologizing-0.590
 BG-0.585
 Ange-0.564
 Ge-0.563
 adjourn-0.560
Token figure
Token ID3785
Feature activation-6.90e-01

Pos logit contributions
instein+1.297
oston+1.277
kefeller+1.271
artney+1.256
OF+1.243
Neg logit contributions
 apologizing-0.930
 BG-0.921
 Ange-0.887
 Ge-0.884
 adjourn-0.874
Token out
Token ID503
Feature activation+1.19e-01

Pos logit contributions
 apologizing+0.422
 BG+0.422
 Ge+0.404
 Ange+0.404
 adjourn+0.399
Neg logit contributions
instein-0.603
kefeller-0.596
oston-0.595
artney-0.588
OF-0.579
Token a
Token ID257
Feature activation-2.99e-01

Pos logit contributions
instein+0.102
oston+0.101
kefeller+0.101
artney+0.099
OF+0.098
Neg logit contributions
 apologizing-0.075
 BG-0.075
 Ge-0.072
 Ange-0.072
 adjourn-0.071
Token few
Token ID1178
Feature activation+1.13e+00

Pos logit contributions
 apologizing+0.170
 BG+0.170
 Ge+0.163
 Ange+0.162
 adjourn+0.160
Neg logit contributions
instein-0.273
oston-0.270
kefeller-0.270
artney-0.266
OF-0.263
Token things
Token ID1243
Feature activation+4.04e-01

Pos logit contributions
instein+0.964
oston+0.953
kefeller+0.946
artney+0.935
OF+0.926
Neg logit contributions
 apologizing-0.713
 BG-0.707
 Ange-0.681
 Ge-0.679
 adjourn-0.672
Token rewarded
Token ID20945
Feature activation-1.21e+00

Pos logit contributions
instein+0.193
kefeller+0.192
oston+0.191
artney+0.189
OF+0.186
Neg logit contributions
 apologizing-0.130
 BG-0.129
 Ge-0.124
 Ange-0.123
 adjourn-0.122
Token the
Token ID262
Feature activation-1.68e+00

Pos logit contributions
 apologizing+0.695
 BG+0.689
 Ge+0.665
 Ange+0.659
 adjourn+0.652
Neg logit contributions
instein-1.098
kefeller-1.087
oston-1.085
artney-1.076
OF-1.055
Token Bills
Token ID20209
Feature activation+1.17e+00

Pos logit contributions
 apologizing+1.101
 BG+1.095
 Ge+1.063
 Ange+1.049
 adjourn+1.041
Neg logit contributions
instein-1.400
kefeller-1.387
oston-1.375
artney-1.361
OF-1.323
Token with
Token ID351
Feature activation-1.57e+00

Pos logit contributions
instein+1.039
oston+1.028
kefeller+1.024
artney+1.012
OF+1.000
Neg logit contributions
 apologizing-0.688
 BG-0.686
 Ange-0.657
 Ge-0.655
 adjourn-0.649
Token two
Token ID734
Feature activation-1.05e+00

Pos logit contributions
 apologizing+0.975
 BG+0.971
 Ge+0.938
 Ange+0.933
 adjourn+0.921
Neg logit contributions
instein-1.358
kefeller-1.341
oston-1.337
artney-1.328
OF-1.299
Token years
Token ID812
Feature activation+1.56e+00

Pos logit contributions
 BG+0.700
 apologizing+0.700
 Ge+0.676
 Ange+0.674
 adjourn+0.666
Neg logit contributions
instein-0.863
kefeller-0.854
oston-0.849
artney-0.843
OF-0.817
Token of
Token ID286
Feature activation+1.95e+00

Pos logit contributions
instein+1.373
oston+1.357
kefeller+1.353
artney+1.336
OF+1.318
Neg logit contributions
 apologizing-0.935
 BG-0.931
 Ge-0.892
 Ange-0.892
 adjourn-0.882
Token excellent
Token ID6275
Feature activation-1.82e+00

Pos logit contributions
instein+1.663
oston+1.642
kefeller+1.633
artney+1.612
OF+1.598
Neg logit contributions
 apologizing-1.219
 BG-1.213
 Ange-1.164
 Ge-1.163
 adjourn-1.152
Token play
Token ID711
Feature activation+2.24e-01

Pos logit contributions
 apologizing+1.210
 BG+1.208
 Ge+1.166
 Ange+1.163
 adjourn+1.138
Neg logit contributions
instein-1.504
kefeller-1.499
oston-1.482
artney-1.479
OF-1.424
Token.
Token ID13
Feature activation-2.13e+00

Pos logit contributions
instein+0.198
oston+0.196
kefeller+0.196
artney+0.193
OF+0.190
Neg logit contributions
 apologizing-0.135
 BG-0.134
 Ge-0.128
 Ange-0.128
 adjourn-0.127
Token On
Token ID1550
Feature activation-1.59e+00

Pos logit contributions
 BG+1.428
 apologizing+1.402
 Ge+1.383
 Ange+1.379
Ge+1.317
Neg logit contributions
instein-1.749
kefeller-1.745
artney-1.715
oston-1.711
OF-1.649
Token's
Token ID338
Feature activation-5.75e-01

Pos logit contributions
 apologizing+0.153
 BG+0.152
 Ge+0.146
 Ange+0.146
 adjourn+0.145
Neg logit contributions
instein-0.196
oston-0.193
kefeller-0.193
artney-0.191
OF-0.188
Token on
Token ID319
Feature activation+4.05e-01

Pos logit contributions
 apologizing+0.372
 BG+0.369
 Ge+0.355
 Ange+0.355
 adjourn+0.352
Neg logit contributions
instein-0.483
oston-0.477
kefeller-0.477
artney-0.472
OF-0.464
Token to
Token ID284
Feature activation-2.69e+00

Pos logit contributions
instein+0.339
oston+0.335
kefeller+0.334
artney+0.329
OF+0.325
Neg logit contributions
 apologizing-0.261
 BG-0.260
 Ge-0.250
 Ange-0.250
 adjourn-0.247
Token Los
Token ID5401
Feature activation-1.38e-01

Pos logit contributions
 apologizing+1.707
 BG+1.699
 Ge+1.643
 adjourn+1.623
 Ange+1.616
Neg logit contributions
instein-2.300
kefeller-2.286
oston-2.257
artney-2.238
OF-2.182
Token Angeles
Token ID5652
Feature activation+1.60e+00

Pos logit contributions
 BG+0.108
 apologizing+0.107
 Ange+0.105
 Ge+0.105
 adjourn+0.103
Neg logit contributions
instein-0.097
kefeller-0.096
oston-0.095
artney-0.094
OF-0.092
Token.
Token ID13
Feature activation-1.26e+00

Pos logit contributions
instein+1.343
oston+1.326
kefeller+1.323
artney+1.305
OF+1.286
Neg logit contributions
 apologizing-1.030
 BG-1.026
 Ge-0.986
 Ange-0.985
 adjourn-0.975
Token Ramsey
Token ID30552
Feature activation+2.72e+00

Pos logit contributions
 BG+0.800
 apologizing+0.788
 Ge+0.777
 Ange+0.768
Ge+0.746
Neg logit contributions
instein-1.080
kefeller-1.075
oston-1.058
artney-1.056
OF-1.018
Token is
Token ID318
Feature activation-1.31e-01

Pos logit contributions
instein+2.290
oston+2.260
kefeller+2.222
artney+2.209
OF+2.199
Neg logit contributions
 BG-1.710
 apologizing-1.698
 Ange-1.654
 Ge-1.629
Ge-1.618
Token playing
Token ID2712
Feature activation+2.18e+00

Pos logit contributions
 apologizing+0.079
 BG+0.079
 Ge+0.076
 Ange+0.075
 adjourn+0.075
Neg logit contributions
instein-0.116
kefeller-0.114
oston-0.114
artney-0.113
OF-0.111
Token at
Token ID379
Feature activation-1.00e+00

Pos logit contributions
instein+1.883
oston+1.856
kefeller+1.836
artney+1.814
OF+1.807
Neg logit contributions
 apologizing-1.330
 BG-1.329
 Ange-1.281
 Ge-1.269
 adjourn-1.264
Token an
Token ID281
Feature activation-4.91e-01

Pos logit contributions
 apologizing+0.651
 BG+0.650
 Ge+0.627
 Ange+0.624
 adjourn+0.616
Neg logit contributions
instein-0.841
kefeller-0.833
oston-0.830
artney-0.819
OF-0.805
Token were
Token ID547
Feature activation-8.03e-01

Pos logit contributions
instein+0.683
oston+0.675
kefeller+0.672
artney+0.664
OF+0.656
Neg logit contributions
 apologizing-0.465
 BG-0.463
 Ge-0.444
 Ange-0.444
 adjourn-0.439
Token people
Token ID661
Feature activation+1.19e+00

Pos logit contributions
 apologizing+0.489
 BG+0.487
 Ange+0.469
 Ge+0.469
 adjourn+0.461
Neg logit contributions
instein-0.703
kefeller-0.696
oston-0.692
artney-0.687
OF-0.676
Token making
Token ID1642
Feature activation+1.69e+00

Pos logit contributions
instein+1.062
oston+1.050
kefeller+1.042
artney+1.033
OF+1.020
Neg logit contributions
 BG-0.691
 apologizing-0.689
 Ange-0.660
 Ge-0.659
 adjourn-0.653
Token up
Token ID510
Feature activation+6.74e-01

Pos logit contributions
instein+1.433
oston+1.420
kefeller+1.409
artney+1.390
OF+1.376
Neg logit contributions
 apologizing-1.065
 BG-1.060
 Ange-1.013
 Ge-1.013
 adjourn-1.010
Token all
Token ID477
Feature activation-1.47e+00

Pos logit contributions
instein+0.554
oston+0.547
kefeller+0.546
artney+0.538
OF+0.530
Neg logit contributions
 apologizing-0.446
 BG-0.444
 Ge-0.428
 Ange-0.428
 adjourn-0.423
Token kinds
Token ID6982
Feature activation-5.11e-01

Pos logit contributions
 apologizing+0.854
 BG+0.847
 Ge+0.815
 Ange+0.813
 adjourn+0.803
Neg logit contributions
instein-1.327
kefeller-1.307
oston-1.300
artney-1.286
OF-1.267
Token of
Token ID286
Feature activation-7.14e-01

Pos logit contributions
 apologizing+0.297
 BG+0.295
 Ge+0.283
 Ange+0.282
 adjourn+0.279
Neg logit contributions
instein-0.463
kefeller-0.460
oston-0.457
artney-0.453
OF-0.442
Token stories
Token ID3923
Feature activation+7.06e-02

Pos logit contributions
 BG+0.473
 apologizing+0.473
 Ge+0.456
 Ange+0.456
 adjourn+0.449
Neg logit contributions
instein-0.588
kefeller-0.581
oston-0.577
artney-0.573
OF-0.563
Token about
Token ID546
Feature activation+1.33e+00

Pos logit contributions
instein+0.065
kefeller+0.064
oston+0.064
artney+0.063
OF+0.062
Neg logit contributions
 apologizing-0.040
 BG-0.040
 Ge-0.038
 Ange-0.038
 adjourn-0.038
Token pur
Token ID1308
Feature activation-2.45e+00

Pos logit contributions
instein+1.084
oston+1.074
kefeller+1.062
artney+1.052
OF+1.036
Neg logit contributions
 apologizing-0.883
 BG-0.878
 Ange-0.843
 Ge-0.843
 adjourn-0.838
Tokenges
Token ID3212
Feature activation-2.42e+00

Pos logit contributions
 BG+1.118
 apologizing+1.110
 Ge+1.081
 Ange+1.058
Ge+1.046
Neg logit contributions
instein-2.494
kefeller-2.477
oston-2.460
artney-2.444
OF-2.414
Token lesbian
Token ID17834
Feature activation-4.00e+00

Pos logit contributions
 BG+0.840
 apologizing+0.840
 Ge+0.812
 Ange+0.807
 adjourn+0.790
Neg logit contributions
instein-1.137
kefeller-1.132
oston-1.121
artney-1.113
OF-1.098
Token,
Token ID11
Feature activation-2.11e+00

Pos logit contributions
 BG+2.324
 apologizing+2.321
 Ge+2.265
 Ange+2.230
iot+2.175
Neg logit contributions
kefeller-3.615
instein-3.580
artney-3.531
oston-3.523
OF-3.460
Token gay
Token ID5650
Feature activation-1.20e+00

Pos logit contributions
 apologizing+1.270
 BG+1.263
 Ge+1.223
 Ange+1.208
 adjourn+1.192
Neg logit contributions
instein-1.866
kefeller-1.851
oston-1.842
artney-1.829
OF-1.798
Token,
Token ID11
Feature activation-1.30e+00

Pos logit contributions
 apologizing+0.709
 BG+0.705
 Ge+0.676
 Ange+0.675
 adjourn+0.668
Neg logit contributions
instein-1.063
oston-1.050
kefeller-1.048
artney-1.035
OF-1.021
Token bisexual
Token ID24249
Feature activation-1.84e+00

Pos logit contributions
 apologizing+0.952
 BG+0.948
 Ange+0.916
 Ge+0.915
 adjourn+0.908
Neg logit contributions
instein-0.977
oston-0.962
kefeller-0.958
artney-0.945
OF-0.930
Token and
Token ID290
Feature activation-2.27e+00

Pos logit contributions
 apologizing+1.031
 BG+1.027
 Ge+0.985
 Ange+0.980
 adjourn+0.965
Neg logit contributions
instein-1.689
kefeller-1.675
oston-1.667
artney-1.650
OF-1.625
Token transgender
Token ID10637
Feature activation-9.47e-01

Pos logit contributions
 apologizing+1.384
 BG+1.378
 Ge+1.334
 Ange+1.320
 adjourn+1.304
Neg logit contributions
instein-1.974
kefeller-1.958
oston-1.951
artney-1.932
OF-1.898
Token people
Token ID661
Feature activation+1.79e+00

Pos logit contributions
 apologizing+0.586
 BG+0.584
 Ge+0.564
 Ange+0.561
 adjourn+0.550
Neg logit contributions
instein-0.818
kefeller-0.813
oston-0.806
artney-0.799
OF-0.784
Token,
Token ID11
Feature activation+1.22e+00

Pos logit contributions
instein+1.554
oston+1.537
kefeller+1.524
artney+1.512
OF+1.492
Neg logit contributions
 apologizing-1.089
 BG-1.088
 Ange-1.043
 Ge-1.038
 adjourn-1.034
Token that
Token ID326
Feature activation+2.80e+00

Pos logit contributions
instein+1.029
oston+1.016
kefeller+1.012
artney+0.999
OF+0.986
Neg logit contributions
 apologizing-0.784
 BG-0.781
 Ange-0.750
 Ge-0.749
 adjourn-0.743
Token harassment
Token ID10556
Feature activation+1.60e+00

Pos logit contributions
instein+2.343
oston+2.307
kefeller+2.276
artney+2.258
OF+2.236
Neg logit contributions
 apologizing-1.787
 BG-1.782
 Ange-1.718
 adjourn-1.704
 Ge-1.696
Token's
Token ID338
Feature activation+4.50e-02

Pos logit contributions
 apologizing+0.351
 BG+0.349
 Ge+0.337
 Ange+0.335
 adjourn+0.332
Neg logit contributions
instein-0.467
oston-0.460
kefeller-0.459
artney-0.454
OF-0.447
Token India
Token ID3794
Feature activation-8.20e-01

Pos logit contributions
instein+0.041
kefeller+0.040
oston+0.040
artney+0.040
OF+0.039
Neg logit contributions
 apologizing-0.026
 BG-0.026
 Ge-0.025
 Ange-0.025
 adjourn-0.024
Token follows
Token ID5679
Feature activation-8.13e-01

Pos logit contributions
 apologizing+0.492
 BG+0.488
 Ge+0.468
 Ange+0.464
 adjourn+0.461
Neg logit contributions
instein-0.732
kefeller-0.720
oston-0.719
artney-0.710
OF-0.700
Token a
Token ID257
Feature activation-9.30e-01

Pos logit contributions
 BG+0.519
 apologizing+0.519
 Ge+0.498
 Ange+0.496
 adjourn+0.491
Neg logit contributions
instein-0.688
kefeller-0.679
oston-0.678
artney-0.669
OF-0.662
Token protracted
Token ID44466
Feature activation-1.83e+00

Pos logit contributions
 BG+0.582
 apologizing+0.581
 Ge+0.559
 Ange+0.553
 adjourn+0.551
Neg logit contributions
instein-0.799
kefeller-0.791
oston-0.790
artney-0.781
OF-0.769
Token legal
Token ID2742
Feature activation-4.58e-01

Pos logit contributions
 BG+1.219
 apologizing+1.218
 Ge+1.174
 Ange+1.167
 adjourn+1.164
Neg logit contributions
instein-1.491
oston-1.474
kefeller-1.472
artney-1.458
OF-1.428
Token dispute
Token ID11047
Feature activation-6.98e-01

Pos logit contributions
 apologizing+0.297
 BG+0.295
 Ge+0.285
 Ange+0.284
 adjourn+0.282
Neg logit contributions
instein-0.383
oston-0.379
kefeller-0.378
artney-0.374
OF-0.367
Token with
Token ID351
Feature activation-1.71e+00

Pos logit contributions
 apologizing+0.420
 BG+0.418
 Ge+0.402
 Ange+0.400
 adjourn+0.398
Neg logit contributions
instein-0.618
oston-0.610
kefeller-0.610
artney-0.601
OF-0.592
Token CPR
Token ID42920
Feature activation+1.13e+00

Pos logit contributions
 BG+1.147
 apologizing+1.131
 Ge+1.100
 Ange+1.089
 adjourn+1.074
Neg logit contributions
instein-1.404
kefeller-1.392
oston-1.388
artney-1.371
OF-1.349
TokenL
Token ID43
Feature activation-1.14e+00

Pos logit contributions
instein+0.697
oston+0.685
kefeller+0.683
artney+0.670
OF+0.656
Neg logit contributions
 apologizing-0.983
 BG-0.980
 Ge-0.952
 Ange-0.951
 adjourn-0.944
Token that
Token ID326
Feature activation-1.98e+00

Pos logit contributions
 apologizing+0.705
 BG+0.702
 Ge+0.674
 Ange+0.668
 adjourn+0.663
Neg logit contributions
instein-0.997
kefeller-0.984
oston-0.983
artney-0.969
OF-0.955
Token on
Token ID319
Feature activation-2.85e+00

Pos logit contributions
instein+0.635
oston+0.628
kefeller+0.627
artney+0.618
OF+0.607
Neg logit contributions
 apologizing-0.559
 BG-0.556
 Ge-0.537
 Ange-0.536
 adjourn-0.531
Token the
Token ID262
Feature activation-9.86e-01

Pos logit contributions
 apologizing+1.814
 BG+1.787
 Ge+1.740
 Ange+1.714
 adjourn+1.692
Neg logit contributions
instein-2.417
kefeller-2.393
oston-2.385
artney-2.366
OF-2.321
Token ticket
Token ID7846
Feature activation+6.57e-01

Pos logit contributions
 apologizing+0.615
 BG+0.612
 Ge+0.591
 Ange+0.587
 adjourn+0.581
Neg logit contributions
instein-0.847
oston-0.839
kefeller-0.838
artney-0.830
OF-0.815
Token,
Token ID11
Feature activation+1.39e-01

Pos logit contributions
instein+0.583
oston+0.577
kefeller+0.576
artney+0.568
OF+0.560
Neg logit contributions
 apologizing-0.392
 BG-0.389
 Ge-0.374
 Ange-0.373
 adjourn-0.369
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
instein+0.125
kefeller+0.123
oston+0.123
artney+0.122
OF+0.120
Neg logit contributions
 apologizing-0.082
 BG-0.081
 Ge-0.078
 Ange-0.078
 adjourn-0.077
Token one
Token ID530
Feature activation+5.03e-02

Pos logit contributions
instein+0.877
oston+0.867
kefeller+0.865
artney+0.854
OF+0.840
Neg logit contributions
 apologizing-0.693
 BG-0.689
 Ge-0.664
 Ange-0.663
 adjourn-0.656
Token who
Token ID508
Feature activation-2.88e+00

Pos logit contributions
instein+0.044
oston+0.043
kefeller+0.043
artney+0.043
OF+0.042
Neg logit contributions
 apologizing-0.031
 BG-0.031
 Ge-0.029
 Ange-0.029
 adjourn-0.029
Token might
Token ID1244
Feature activation+3.35e-01

Pos logit contributions
 apologizing+1.756
 BG+1.724
 Ge+1.678
 adjourn+1.658
 Ange+1.657
Neg logit contributions
kefeller-2.512
instein-2.510
oston-2.491
artney-2.471
OF-2.416
Token reassure
Token ID37762
Feature activation+1.73e+00

Pos logit contributions
instein+0.280
oston+0.278
kefeller+0.277
artney+0.273
OF+0.269
Neg logit contributions
 apologizing-0.215
 BG-0.214
 Ge-0.206
 Ange-0.206
 adjourn-0.204
Token Americans
Token ID3399
Feature activation+4.67e-01

Pos logit contributions
instein+1.440
oston+1.420
kefeller+1.414
artney+1.392
OF+1.377
Neg logit contributions
 apologizing-1.118
 BG-1.114
 Ange-1.069
 Ge-1.069
 adjourn-1.061
Token afraid
Token ID7787
Feature activation+3.91e-01

Pos logit contributions
instein+0.441
oston+0.436
kefeller+0.436
artney+0.430
OF+0.424
Neg logit contributions
 apologizing-0.252
 BG-0.250
 Ge-0.239
 Ange-0.238
 adjourn-0.235
Token10
Token ID940
Feature activation-3.51e+00

Pos logit contributions
 BG+1.795
 apologizing+1.792
 Ge+1.731
 Ange+1.704
Ge+1.704
Neg logit contributions
kefeller-3.621
instein-3.619
oston-3.567
artney-3.566
OF-3.457
Token,
Token ID11
Feature activation+7.31e-01

Pos logit contributions
 BG+1.892
 apologizing+1.847
 Ge+1.832
 Ange+1.792
Ge+1.774
Neg logit contributions
kefeller-3.343
instein-3.307
artney-3.289
oston-3.265
OF-3.138
Token 15
Token ID1315
Feature activation-3.16e+00

Pos logit contributions
instein+0.670
oston+0.664
kefeller+0.661
artney+0.653
OF+0.647
Neg logit contributions
 apologizing-0.412
 BG-0.408
 Ange-0.390
 Ge-0.390
 adjourn-0.385
Token},
Token ID5512
Feature activation-1.24e+00

Pos logit contributions
 BG+1.862
 apologizing+1.831
 Ge+1.805
 Ange+1.767
Ge+1.763
Neg logit contributions
kefeller-2.792
instein-2.792
oston-2.779
artney-2.760
OF-2.660
Token {{
Token ID22935
Feature activation-1.52e+00

Pos logit contributions
 apologizing+0.685
 BG+0.684
 Ge+0.655
 Ange+0.652
 adjourn+0.645
Neg logit contributions
instein-1.148
oston-1.134
kefeller-1.133
artney-1.121
OF-1.104
Token9
Token ID24
Feature activation-6.10e+00

Pos logit contributions
 BG+0.970
 apologizing+0.967
 Ge+0.935
 Ange+0.932
Ge+0.924
Neg logit contributions
instein-1.271
kefeller-1.266
oston-1.249
artney-1.242
OF-1.204
Token,
Token ID11
Feature activation+1.10e-01

Pos logit contributions
 BG+2.981
 Ge+2.921
 apologizing+2.832
Ge+2.813
 Ange+2.779
Neg logit contributions
kefeller-6.121
artney-5.989
instein-5.966
oston-5.918
OF-5.633
Token 14
Token ID1478
Feature activation-3.43e+00

Pos logit contributions
instein+0.108
oston+0.107
kefeller+0.107
artney+0.105
OF+0.104
Neg logit contributions
 apologizing-0.055
 BG-0.055
 Ge-0.052
 Ange-0.052
 adjourn-0.051
Token},
Token ID5512
Feature activation+5.65e-01

Pos logit contributions
 BG+1.993
 apologizing+1.954
 Ge+1.937
Ge+1.885
 Ange+1.880
Neg logit contributions
instein-3.051
oston-3.043
kefeller-3.033
artney-3.015
OF-2.919
Token {
Token ID1391
Feature activation-2.72e+00

Pos logit contributions
instein+0.473
oston+0.468
kefeller+0.465
artney+0.459
OF+0.455
Neg logit contributions
 apologizing-0.363
 BG-0.360
 Ange-0.347
 Ge-0.346
 adjourn-0.343
Token9
Token ID24
Feature activation-2.28e+00

Pos logit contributions
 apologizing+1.724
 BG+1.719
 Ge+1.665
 Ange+1.659
Ge+1.646
Neg logit contributions
instein-2.287
kefeller-2.269
oston-2.252
artney-2.241
OF-2.173
Token published
Token ID3199
Feature activation+4.76e-01

Pos logit contributions
 BG+0.893
 apologizing+0.891
 Ge+0.858
 Ange+0.854
 adjourn+0.847
Neg logit contributions
instein-1.281
kefeller-1.263
oston-1.263
artney-1.244
OF-1.226
Token Dec
Token ID4280
Feature activation-2.22e+00

Pos logit contributions
instein+0.437
kefeller+0.433
oston+0.432
artney+0.427
OF+0.421
Neg logit contributions
 apologizing-0.269
 BG-0.269
 Ge-0.259
 Ange-0.257
 adjourn-0.253
Token.
Token ID13
Feature activation-8.86e-01

Pos logit contributions
 BG+1.245
 apologizing+1.229
 Ge+1.205
 Ange+1.188
 adjourn+1.160
Neg logit contributions
instein-2.034
kefeller-2.031
artney-2.007
oston-1.993
OF-1.951
Token 1
Token ID352
Feature activation-7.69e-01

Pos logit contributions
 BG+0.523
 apologizing+0.520
 Ge+0.505
 Ange+0.501
 adjourn+0.491
Neg logit contributions
instein-0.794
kefeller-0.789
oston-0.783
artney-0.778
OF-0.759
Token,
Token ID11
Feature activation-1.41e+00

Pos logit contributions
 apologizing+0.452
 BG+0.452
 Ge+0.434
 Ange+0.431
 adjourn+0.426
Neg logit contributions
instein-0.687
oston-0.679
kefeller-0.679
artney-0.670
OF-0.660
Token 9
Token ID860
Feature activation-4.28e+00

Pos logit contributions
 BG+0.782
 apologizing+0.780
 Ge+0.756
 Ange+0.744
 adjourn+0.732
Neg logit contributions
instein-1.308
kefeller-1.300
oston-1.293
artney-1.279
OF-1.258
Token a
Token ID257
Feature activation+1.01e+00

Pos logit contributions
 apologizing+2.234
 BG+2.233
 Ge+2.144
 Ange+2.122
Ge+2.085
Neg logit contributions
kefeller-4.088
instein-4.078
artney-4.017
oston-3.988
OF-3.901
Token.
Token ID13
Feature activation+1.18e+00

Pos logit contributions
instein+0.882
oston+0.872
kefeller+0.868
artney+0.857
OF+0.847
Neg logit contributions
 apologizing-0.613
 BG-0.609
 Ange-0.584
 Ge-0.584
 adjourn-0.578
Tokenm
Token ID76
Feature activation+2.32e+00

Pos logit contributions
instein+0.862
oston+0.853
kefeller+0.833
OF+0.821
artney+0.819
Neg logit contributions
 apologizing-0.889
 BG-0.877
 Ange-0.852
 adjourn-0.845
 Ge-0.841
Token.
Token ID13
Feature activation+1.39e-01

Pos logit contributions
instein+1.933
oston+1.917
kefeller+1.897
artney+1.866
OF+1.845
Neg logit contributions
 apologizing-1.500
 BG-1.491
 Ange-1.435
 adjourn-1.433
 Ge-1.431
Token PT
Token ID19310
Feature activation-1.65e+00

Pos logit contributions
instein+0.086
oston+0.084
kefeller+0.084
artney+0.082
OF+0.081
Neg logit contributions
 apologizing-0.120
 BG-0.119
 Ange-0.116
 Ge-0.115
 adjourn-0.115
Token educational
Token ID9856
Feature activation+7.16e-01

Pos logit contributions
instein+0.829
oston+0.819
kefeller+0.816
artney+0.805
OF+0.794
Neg logit contributions
 apologizing-0.690
 BG-0.686
 Ange-0.661
 Ge-0.660
 adjourn-0.654
Token attainment
Token ID40375
Feature activation+1.66e+00

Pos logit contributions
instein+0.654
oston+0.646
kefeller+0.644
artney+0.636
OF+0.629
Neg logit contributions
 apologizing-0.407
 BG-0.403
 Ange-0.386
 Ge-0.385
 adjourn-0.381
Token and
Token ID290
Feature activation-1.34e+00

Pos logit contributions
instein+1.441
oston+1.429
kefeller+1.416
artney+1.404
OF+1.389
Neg logit contributions
 apologizing-1.015
 BG-1.011
 Ange-0.967
 Ge-0.962
 adjourn-0.960
Token frequently
Token ID6777
Feature activation+1.21e+00

Pos logit contributions
 apologizing+0.867
 BG+0.864
 Ge+0.842
 adjourn+0.830
 Ange+0.829
Neg logit contributions
instein-1.109
kefeller-1.103
oston-1.099
artney-1.087
OF-1.061
Token in
Token ID287
Feature activation-3.20e-02

Pos logit contributions
instein+0.976
oston+0.963
kefeller+0.959
artney+0.946
OF+0.934
Neg logit contributions
 apologizing-0.812
 BG-0.808
 Ange-0.779
 Ge-0.777
 adjourn-0.769
Token rural
Token ID10016
Feature activation-5.40e+00

Pos logit contributions
 apologizing+0.019
 BG+0.019
 Ange+0.018
 Ge+0.018
 adjourn+0.018
Neg logit contributions
instein-0.028
oston-0.028
kefeller-0.028
artney-0.028
OF-0.027
Token regions
Token ID7652
Feature activation-1.54e+00

Pos logit contributions
 BG+2.729
 apologizing+2.695
 Ge+2.617
 Ange+2.561
 adjourn+2.488
Neg logit contributions
instein-5.288
kefeller-5.244
artney-5.194
oston-5.146
OF-5.085
Token with
Token ID351
Feature activation-9.55e-01

Pos logit contributions
 apologizing+0.856
 BG+0.845
 Ge+0.815
 Ange+0.809
 adjourn+0.802
Neg logit contributions
instein-1.432
kefeller-1.413
oston-1.409
artney-1.391
OF-1.371
Token limited
Token ID3614
Feature activation-1.44e+00

Pos logit contributions
 apologizing+0.529
 BG+0.526
 Ge+0.506
 Ange+0.502
 adjourn+0.498
Neg logit contributions
instein-0.889
oston-0.880
kefeller-0.877
artney-0.868
OF-0.855
Token access
Token ID1895
Feature activation+1.68e+00

Pos logit contributions
 apologizing+0.902
 BG+0.900
 Ge+0.876
 Ange+0.861
 adjourn+0.859
Neg logit contributions
instein-1.246
kefeller-1.226
oston-1.226
artney-1.220
OF-1.191
Token to
Token ID284
Feature activation+1.42e+00

Pos logit contributions
instein+1.362
kefeller+1.350
oston+1.346
artney+1.323
OF+1.307
Neg logit contributions
 apologizing-1.128
 BG-1.113
 Ange-1.081
 adjourn-1.073
 Ge-1.069
Token...
Token ID2644
Feature activation+1.53e-01

Pos logit contributions
instein+1.589
oston+1.569
kefeller+1.552
artney+1.533
OF+1.527
Neg logit contributions
 apologizing-1.119
 BG-1.105
 Ange-1.055
 Ge-1.053
iot-1.049
Token }
Token ID1782
Feature activation+1.81e+00

Pos logit contributions
instein+0.142
oston+0.141
kefeller+0.140
artney+0.139
OF+0.137
Neg logit contributions
 apologizing-0.084
 BG-0.083
 Ange-0.079
 Ge-0.079
 adjourn-0.078
Token pub
Token ID2240
Feature activation-1.53e+00

Pos logit contributions
instein+1.572
oston+1.547
kefeller+1.539
artney+1.521
OF+1.510
Neg logit contributions
 apologizing-1.100
 BG-1.090
 Ange-1.045
 Ge-1.043
 adjourn-1.030
Token fn
Token ID24714
Feature activation-4.67e+00

Pos logit contributions
 apologizing+0.691
 BG+0.687
 Ge+0.649
 Ange+0.648
 adjourn+0.639
Neg logit contributions
instein-1.579
oston-1.563
kefeller-1.560
artney-1.544
OF-1.525
Token undo
Token ID23981
Feature activation-6.96e-01

Pos logit contributions
 apologizing+2.394
 BG+2.369
 adjourn+2.283
 Ge+2.265
 Ange+2.254
Neg logit contributions
artney-4.487
kefeller-4.487
oston-4.477
instein-4.459
OF-4.359
Token_
Token ID62
Feature activation-4.67e+00

Pos logit contributions
 apologizing+0.476
 BG+0.474
 Ge+0.457
 Ange+0.457
 adjourn+0.453
Neg logit contributions
instein-0.556
oston-0.548
kefeller-0.547
artney-0.540
OF-0.531
Tokenmove
Token ID21084
Feature activation-3.04e+00

Pos logit contributions
 apologizing+1.981
 BG+1.977
 Ge+1.908
 Ange+1.897
iot+1.892
Neg logit contributions
kefeller-4.831
instein-4.817
artney-4.808
oston-4.772
OF-4.669
Token (
Token ID357
Feature activation-2.66e+00

Pos logit contributions
 BG+1.776
 apologizing+1.768
 Ge+1.713
 Ange+1.707
 adjourn+1.683
Neg logit contributions
instein-2.704
kefeller-2.690
oston-2.676
artney-2.662
OF-2.591
Token &
Token ID1222
Feature activation+1.18e+00

Pos logit contributions
 apologizing+1.418
 BG+1.414
 Ge+1.355
 Ange+1.348
 adjourn+1.335
Neg logit contributions
instein-2.521
oston-2.491
kefeller-2.491
artney-2.467
OF-2.421
Token mut
Token ID4517
Feature activation-5.57e+00

Pos logit contributions
instein+1.108
oston+1.098
kefeller+1.086
artney+1.071
OF+1.066
Neg logit contributions
 apologizing-0.636
 BG-0.633
 Ange-0.606
 Ge-0.600
 adjourn-0.594
Token self
Token ID2116
Feature activation-4.33e+00

Pos logit contributions
 BG+2.879
 apologizing+2.808
 Ge+2.775
 adjourn+2.715
 Ange+2.691
Neg logit contributions
kefeller-5.451
artney-5.373
instein-5.364
oston-5.319
OF-5.213
Token bind
Token ID11007
Feature activation+5.78e-02

Pos logit contributions
 BG+1.318
 apologizing+1.313
 Ge+1.259
 Ange+1.248
 adjourn+1.239
Neg logit contributions
instein-1.774
kefeller-1.756
oston-1.748
artney-1.729
OF-1.697
Tokeners
Token ID364
Feature activation-1.52e+00

Pos logit contributions
instein+0.057
kefeller+0.057
oston+0.056
artney+0.056
OF+0.055
Neg logit contributions
 apologizing-0.029
 BG-0.029
 Ge-0.027
 Ange-0.027
 adjourn-0.027
Token as
Token ID355
Feature activation-1.45e-01

Pos logit contributions
 apologizing+0.865
 BG+0.861
 Ge+0.822
 Ange+0.820
 adjourn+0.808
Neg logit contributions
instein-1.392
kefeller-1.377
oston-1.373
artney-1.354
OF-1.336
Token well
Token ID880
Feature activation-1.68e+00

Pos logit contributions
 apologizing+0.086
 BG+0.086
 Ge+0.082
 Ange+0.082
 adjourn+0.081
Neg logit contributions
instein-0.129
kefeller-0.127
oston-0.127
artney-0.125
OF-0.123
Token.
Token ID13
Feature activation-2.32e+00

Pos logit contributions
 apologizing+0.967
 BG+0.962
 Ge+0.922
 Ange+0.916
 adjourn+0.900
Neg logit contributions
instein-1.530
kefeller-1.514
oston-1.507
artney-1.494
OF-1.469
Token Mostly
Token ID33495
Feature activation-4.00e+00

Pos logit contributions
 BG+1.450
 apologizing+1.414
 Ge+1.404
 Ange+1.391
Ge+1.342
Neg logit contributions
instein-2.029
kefeller-2.022
oston-2.000
artney-1.985
OF-1.912
Token,
Token ID11
Feature activation-5.89e-02

Pos logit contributions
 BG+2.363
 apologizing+2.348
 Ge+2.265
 Ange+2.242
 substitutes+2.192
Neg logit contributions
instein-3.610
kefeller-3.596
artney-3.544
oston-3.530
OF-3.432
Token you
Token ID345
Feature activation+2.55e+00

Pos logit contributions
 apologizing+0.036
 BG+0.036
 Ge+0.034
 Ange+0.034
 adjourn+0.034
Neg logit contributions
instein-0.052
oston-0.051
kefeller-0.051
artney-0.050
OF-0.050
Token
Token ID447
Feature activation-1.37e+00

Pos logit contributions
instein+1.962
oston+1.931
kefeller+1.913
artney+1.888
OF+1.867
Neg logit contributions
 apologizing-1.813
 BG-1.794
 Ange-1.737
 Ge-1.733
 adjourn-1.715
Token
Token ID247
Feature activation+1.68e-01

Pos logit contributions
 apologizing+0.781
 BG+0.780
 Ge+0.746
 Ange+0.744
 adjourn+0.735
Neg logit contributions
instein-1.252
kefeller-1.243
oston-1.240
artney-1.227
OF-1.201
Tokenll
Token ID297
Feature activation-1.59e+00

Pos logit contributions
instein+0.142
oston+0.140
kefeller+0.140
artney+0.138
OF+0.136
Neg logit contributions
 apologizing-0.108
 BG-0.107
 Ange-0.103
 Ge-0.103
 adjourn-0.102