TOP ACTIVATIONS
MAX = 3.569

listening to speeches in Tra fal gar Square following the march
. Car la del P onte , a member
Cutter Capitol Group , was del isted from the NAS DAQ
. Mal ab ar , FL : K rie ger Publishing
a 2014 interview with the Tampa Bay Times , Rubio did
"[ 41 ] and Peterson conc urs that the Muslims wanted
According to Mur ad Do ğ uk an li , writing
wrote in May about the del uge of anti - Semitic
Alternative Facts in
dust ? man del ights not me ; no
the family also del ved into the oil -
sound , something that Phil adel ph ians do more frequently
said too about what Phil adel ph ians do with their
of Virginia . Del . Todd Gilbert ( R
- govern ed organisation " del ivers emergency aid to people
consequences of the del ayed unfolding of independence
manner in which Black Phil adel ph ians use a neutral
But efforts to restore ties fal tered amid new Israeli action
The way Phil adel ph ians say the au
bar re ling through the Tampa Bay Lightning , prevailing through

MIN ACTIVATIONS
MIN = -3.320

45 366 . Topics : community - and -
into them ". Topics : marine - biology ,
News 24 . Topics : beef - c attle
41 - year - old Herm osa Beach Karen K la
Britain to issue a su k uk , a form of
'." He then asks : " Is there really
Rec alling that , Ik ki changed his plan immediately
differences between the biology and sociology of race and gender to
Fel ton , D arian Owens , and now Allow ay
be available only at a laboratory in San Juan Cap ist
, Lost Woods ) are classics
is a h ul king postwar structure that j uts into
Hispanic vote . Topics : Hispanic / Lat ino
22 . S overe ign Lux ury Travel (
: " ann data su k hi b h ava "
bipartisan spirit . Contact us at editors @ time
McLaren 's Champions of the Future k art ing series 1998
, or at least establish contact , with his most determined
now . I 'm glad nan ocl oud wasn 't the
13 years old . The contact had started via Instagram with

INTERVAL 1.847 - 3.569
CONTAINS 0.251%

Chern iv ts i , Lv iv and Iv ano -
actually show a toilet in ab ath room seemed oddly appropriate
amount of inconvenience to the traveler , he told
original , we had no idea . Now , on movie
Mid n ighter doesn 't exactly jump to mind since he

INTERVAL 0.125 - 1.847
CONTAINS 37.499%

what a sewing machine does . Teach them how to thread
measures to keep the government open . While the budget and
working in IT , I 'm sure you 'll find much
post - mobile world Why it s
in the wake of the Republican victory . Sent

INTERVAL -1.597 - 0.125
CONTAINS 61.564%

to show up , and a constantly changing line - up
the 2008 election . Christ ie is widely considered
corner . You Want Some Pictures of That Corner
enough to eat at home . Later in the
(@ m f lynn JR

INTERVAL -3.320 - -1.597
CONTAINS 0.686%

About 30 or 40 policemen took Mr Tom ar away this
oke on a stool , ask him directions to the T
less experienced teachers . Their lifetime incomes sal ary plus
the exception of anti - tank guns .[ 19 ] Bob
not match their body 's biological sex , they wish to
Token listening
Token ID8680
Feature activation+7.83e-01

Pos logit contributions
phe+0.109
rophe+0.096
ths+0.087
riad+0.085
ient+0.084
Neg logit contributions
 Probe-0.101
 Nish-0.096
ersen-0.092
 Rik-0.086
 COURT-0.086
Token to
Token ID284
Feature activation+1.48e-01

Pos logit contributions
phe+1.793
rophe+1.596
ths+1.453
riad+1.438
ient+1.414
Neg logit contributions
 Probe-1.373
 Nish-1.289
ersen-1.231
 Rik-1.146
 COURT-1.138
Token speeches
Token ID18138
Feature activation+7.48e-01

Pos logit contributions
phe+0.326
rophe+0.290
ths+0.262
riad+0.260
ient+0.255
Neg logit contributions
 Probe-0.272
 Nish-0.256
ersen-0.244
 Rik-0.229
 COURT-0.227
Token in
Token ID287
Feature activation-1.63e-01

Pos logit contributions
phe+1.628
rophe+1.439
ths+1.304
riad+1.287
ient+1.266
Neg logit contributions
 Probe-1.396
 Nish-1.314
ersen-1.260
 Rik-1.178
 COURT-1.169
Token Tra
Token ID4759
Feature activation-1.50e-01

Pos logit contributions
 Probe+0.279
 Nish+0.262
ersen+0.247
 Rik+0.232
 Trafford+0.230
Neg logit contributions
phe-0.380
rophe-0.340
ths-0.310
riad-0.307
ient-0.302
Tokenfal
Token ID42932
Feature activation+3.57e+00

Pos logit contributions
 Probe+0.316
 Nish+0.301
ersen+0.290
 Rik+0.274
 Trafford+0.272
Neg logit contributions
phe-0.287
rophe-0.250
ths-0.222
riad-0.220
ient-0.215
Tokengar
Token ID4563
Feature activation+7.39e-01

Pos logit contributions
phe+5.228
rophe+4.142
ient+3.692
riad+3.604
plain+3.566
Neg logit contributions
 Probe-8.968
 Nish-8.555
ersen-8.472
hift-7.928
 COURT-7.838
Token Square
Token ID9276
Feature activation+1.88e-01

Pos logit contributions
phe+1.685
rophe+1.498
ths+1.366
riad+1.347
ient+1.327
Neg logit contributions
 Probe-1.304
 Nish-1.226
ersen-1.172
 Rik-1.089
 COURT-1.080
Token following
Token ID1708
Feature activation-4.49e-01

Pos logit contributions
phe+0.410
rophe+0.363
ths+0.329
riad+0.325
ient+0.320
Neg logit contributions
 Probe-0.351
 Nish-0.331
ersen-0.316
 Rik-0.297
 COURT-0.295
Token the
Token ID262
Feature activation-3.35e-01

Pos logit contributions
 Probe+0.837
 Nish+0.785
ersen+0.749
 Rik+0.702
 Trafford+0.701
Neg logit contributions
phe-0.986
rophe-0.874
ths-0.786
riad-0.782
ient-0.771
Token march
Token ID9960
Feature activation+9.23e-02

Pos logit contributions
 Probe+0.631
 Nish+0.593
ersen+0.565
 Rik+0.530
 NP+0.529
Neg logit contributions
phe-0.729
rophe-0.647
ths-0.583
riad-0.577
ient-0.569
Token.
Token ID13
Feature activation-8.66e-02

Pos logit contributions
 Probe+1.529
 Nish+1.430
ersen+1.368
 Rik+1.279
 COURT+1.272
Neg logit contributions
phe-1.813
rophe-1.600
riad-1.451
ths-1.446
ient-1.407
Token
Token ID198
Feature activation+2.65e-02

Pos logit contributions
 Probe+0.142
 Nish+0.132
ersen+0.125
 Rik+0.117
 COURT+0.116
Neg logit contributions
phe-0.209
rophe-0.188
ths-0.171
riad-0.170
ient-0.168
Token
Token ID198
Feature activation+9.57e-02

Pos logit contributions
phe+0.061
rophe+0.054
ths+0.050
riad+0.049
ient+0.048
Neg logit contributions
 Probe-0.046
 Nish-0.043
ersen-0.041
 Rik-0.038
 Trafford-0.038
TokenCar
Token ID9914
Feature activation+1.93e+00

Pos logit contributions
phe+0.208
rophe+0.184
ths+0.166
riad+0.164
ient+0.162
Neg logit contributions
 Probe-0.178
 Nish-0.168
ersen-0.161
 Rik-0.151
 COURT-0.149
Tokenla
Token ID5031
Feature activation+2.09e+00

Pos logit contributions
phe+3.973
rophe+3.486
ths+3.192
riad+3.115
ient+3.067
Neg logit contributions
 Probe-3.801
 Nish-3.583
ersen-3.450
 Rik-3.236
 NP-3.232
Token del
Token ID1619
Feature activation+3.55e+00

Pos logit contributions
phe+4.861
rophe+4.306
ths+3.936
riad+3.882
ient+3.839
Neg logit contributions
 Probe-3.559
 Nish-3.276
ersen-3.183
 COURT-2.944
 NP-2.922
Token P
Token ID350
Feature activation+4.13e-01

Pos logit contributions
phe+8.772
rophe+7.656
ths+6.944
ient+6.909
riad+6.845
Neg logit contributions
 Probe-5.300
 Nish-4.822
ersen-4.819
 Trafford-4.331
 COURT-4.204
Tokenonte
Token ID38599
Feature activation-1.66e-01

Pos logit contributions
phe+0.871
rophe+0.765
ths+0.688
riad+0.683
ient+0.680
Neg logit contributions
 Probe-0.808
 Nish-0.759
ersen-0.722
 Rik-0.686
 NP-0.681
Token,
Token ID11
Feature activation+3.87e-01

Pos logit contributions
 Probe+0.304
 Nish+0.286
ersen+0.273
 Rik+0.256
 COURT+0.253
Neg logit contributions
phe-0.368
rophe-0.327
ths-0.296
riad-0.293
ient-0.287
Token a
Token ID257
Feature activation+4.71e-01

Pos logit contributions
phe+0.806
rophe+0.707
ths+0.638
riad+0.629
ient+0.618
Neg logit contributions
 Probe-0.759
 Nish-0.717
ersen-0.689
 Rik-0.646
 COURT-0.643
Token member
Token ID2888
Feature activation+7.43e-01

Pos logit contributions
phe+1.084
rophe+0.962
ths+0.881
riad+0.870
ient+0.853
Neg logit contributions
 Probe-0.814
 Nish-0.763
ersen-0.735
 COURT-0.679
 Rik-0.677
Token Cutter
Token ID49160
Feature activation-2.49e-01

Pos logit contributions
phe+0.564
rophe+0.498
ths+0.451
riad+0.445
ient+0.437
Neg logit contributions
 Probe-0.495
 Nish-0.467
ersen-0.448
 Rik-0.419
 COURT-0.416
Token Capitol
Token ID13241
Feature activation+1.23e+00

Pos logit contributions
 Probe+0.448
 Nish+0.420
ersen+0.402
 Rik+0.375
 NP+0.373
Neg logit contributions
phe-0.560
rophe-0.498
ths-0.452
riad-0.448
ient-0.438
Token Group
Token ID4912
Feature activation+4.14e-01

Pos logit contributions
phe+2.719
rophe+2.412
ths+2.184
riad+2.161
ient+2.142
Neg logit contributions
 Probe-2.214
 Nish-2.096
ersen-2.005
 COURT-1.873
 Rik-1.872
Token,
Token ID11
Feature activation+2.59e-01

Pos logit contributions
phe+0.900
rophe+0.796
ths+0.721
riad+0.712
ient+0.700
Neg logit contributions
 Probe-0.776
 Nish-0.731
ersen-0.700
 Rik-0.655
 NP-0.651
Token was
Token ID373
Feature activation+5.79e-01

Pos logit contributions
phe+0.539
rophe+0.473
ths+0.427
riad+0.421
ient+0.414
Neg logit contributions
 Probe-0.506
 Nish-0.479
ersen-0.460
 Rik-0.431
 COURT-0.429
Token del
Token ID1619
Feature activation+3.39e+00

Pos logit contributions
phe+1.160
rophe+1.013
ths+0.909
riad+0.896
ient+0.887
Neg logit contributions
 Probe-1.169
 Nish-1.107
ersen-1.074
 Rik-1.001
 COURT-1.001
Tokenisted
Token ID6347
Feature activation+9.85e-01

Pos logit contributions
phe+6.338
rophe+5.357
ths+4.787
ient+4.738
riad+4.670
Neg logit contributions
 Probe-7.303
 Nish-6.876
ersen-6.618
 Trafford-6.346
 Rik-6.264
Token from
Token ID422
Feature activation+3.79e-01

Pos logit contributions
phe+2.077
rophe+1.834
ths+1.660
riad+1.631
ient+1.611
Neg logit contributions
 Probe-1.892
 Nish-1.785
ersen-1.717
 Rik-1.606
 COURT-1.596
Token the
Token ID262
Feature activation+1.81e-01

Pos logit contributions
phe+0.796
rophe+0.701
ths+0.632
riad+0.623
ient+0.613
Neg logit contributions
 Probe-0.738
 Nish-0.696
ersen-0.668
 Rik-0.627
 NP-0.623
Token NAS
Token ID7210
Feature activation-2.80e-01

Pos logit contributions
phe+0.433
rophe+0.388
ths+0.355
riad+0.350
ient+0.346
Neg logit contributions
 Probe-0.300
 Nish-0.280
ersen-0.266
 Rik-0.247
 NP-0.246
TokenDAQ
Token ID46640
Feature activation+5.40e-01

Pos logit contributions
 Probe+0.554
 Nish+0.520
ersen+0.496
 Rik+0.469
 NP+0.467
Neg logit contributions
phe-0.579
rophe-0.505
ths-0.457
riad-0.451
ient-0.444
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
phe+0.149
rophe+0.131
ths+0.119
riad+0.117
ient+0.115
Neg logit contributions
 Probe-0.128
 Nish-0.120
ersen-0.115
 Rik-0.108
 COURT-0.107
Token Mal
Token ID4434
Feature activation+4.92e-01

Pos logit contributions
phe+0.324
rophe+0.292
ths+0.269
riad+0.267
ient+0.263
Neg logit contributions
 Probe-0.185
 Nish-0.171
ersen-0.164
 Rik-0.149
 COURT-0.147
Tokenab
Token ID397
Feature activation+8.46e-01

Pos logit contributions
phe+1.007
rophe+0.882
ths+0.793
riad+0.783
ient+0.769
Neg logit contributions
 Probe-0.984
 Nish-0.930
ersen-0.894
 Rik-0.840
 COURT-0.835
Tokenar
Token ID283
Feature activation-2.09e-02

Pos logit contributions
phe+1.441
rophe+1.245
riad+1.100
ths+1.092
ient+1.088
Neg logit contributions
 Probe-1.965
 Nish-1.853
ersen-1.770
 Rik-1.713
 COURT-1.686
Token,
Token ID11
Feature activation+3.05e-01

Pos logit contributions
 Probe+0.038
 Nish+0.035
ersen+0.034
 Rik+0.032
 NP+0.031
Neg logit contributions
phe-0.047
rophe-0.041
ths-0.038
riad-0.037
ient-0.037
Token FL
Token ID9977
Feature activation+3.25e+00

Pos logit contributions
phe+0.779
rophe+0.704
ths+0.649
riad+0.644
ient+0.634
Neg logit contributions
 Probe-0.448
 Nish-0.414
ersen-0.393
 Rik-0.357
 Trafford-0.354
Token :
Token ID1058
Feature activation+4.00e-01

Pos logit contributions
phe+6.487
rophe+5.623
ths+5.116
ient+5.071
riad+5.017
Neg logit contributions
 Probe-6.285
 Nish-5.906
ersen-5.739
 Rik-5.372
 Trafford-5.311
Token K
Token ID509
Feature activation-6.48e-01

Pos logit contributions
phe+0.964
rophe+0.859
ths+0.788
riad+0.781
ient+0.765
Neg logit contributions
 Probe-0.650
 Nish-0.607
ersen-0.585
 Rik-0.533
 COURT-0.531
Tokenrie
Token ID5034
Feature activation-3.69e-01

Pos logit contributions
 Probe+1.252
 Nish+1.185
ersen+1.144
 Rik+1.070
 Trafford+1.061
Neg logit contributions
phe-1.360
rophe-1.189
ths-1.081
riad-1.062
ient-1.044
Tokenger
Token ID1362
Feature activation+7.39e-02

Pos logit contributions
 Probe+0.480
 Nish+0.442
ersen+0.415
 Rik+0.374
 COURT+0.373
Neg logit contributions
phe-1.004
rophe-0.917
ths-0.847
riad-0.844
ient-0.830
Token Publishing
Token ID23499
Feature activation+1.26e+00

Pos logit contributions
phe+0.162
rophe+0.143
ths+0.130
riad+0.128
ient+0.126
Neg logit contributions
 Probe-0.137
 Nish-0.129
ersen-0.123
 Rik-0.115
 COURT-0.114
Token a
Token ID257
Feature activation+2.94e-01

Pos logit contributions
 Probe+0.105
 Nish+0.099
ersen+0.095
 Rik+0.089
 COURT+0.088
Neg logit contributions
phe-0.114
rophe-0.101
ths-0.091
riad-0.090
ient-0.088
Token 2014
Token ID1946
Feature activation-1.12e+00

Pos logit contributions
phe+0.604
rophe+0.530
ths+0.478
riad+0.471
ient+0.459
Neg logit contributions
 Probe-0.583
 Nish-0.552
ersen-0.532
 Rik-0.499
 Trafford-0.496
Token interview
Token ID2720
Feature activation-1.10e+00

Pos logit contributions
 Probe+2.481
 Nish+2.342
ersen+2.244
 NP+2.143
 Rik+2.126
Neg logit contributions
phe-2.044
rophe-1.767
ths-1.554
riad-1.548
ient-1.522
Token with
Token ID351
Feature activation-3.75e-01

Pos logit contributions
 Probe+1.879
 Nish+1.766
ersen+1.669
 Rik+1.577
 NP+1.556
Neg logit contributions
phe-2.535
rophe-2.241
riad-2.053
ths-2.051
ient-2.025
Token the
Token ID262
Feature activation+5.37e-02

Pos logit contributions
 Probe+0.665
 Nish+0.619
ersen+0.590
 Rik+0.554
 NP+0.550
Neg logit contributions
phe-0.858
rophe-0.766
ths-0.692
riad-0.691
ient-0.682
Token Tampa
Token ID15528
Feature activation+3.23e+00

Pos logit contributions
phe+0.129
rophe+0.116
ths+0.106
riad+0.105
ient+0.103
Neg logit contributions
 Probe-0.089
 Nish-0.082
ersen-0.078
 Rik-0.073
 NP-0.072
Token Bay
Token ID4696
Feature activation+1.73e+00

Pos logit contributions
phe+5.075
rophe+4.382
ths+3.812
riad+3.641
ient+3.578
Neg logit contributions
 Probe-7.556
 Nish-7.340
ersen-7.074
 Trafford-6.797
 Rik-6.765
Token Times
Token ID3782
Feature activation+6.90e-01

Pos logit contributions
phe+4.416
rophe+3.980
ths+3.682
riad+3.634
ient+3.582
Neg logit contributions
 Probe-2.542
 Nish-2.377
ersen-2.254
 Rik-2.055
 COURT-2.041
Token,
Token ID11
Feature activation+1.88e-01

Pos logit contributions
phe+1.500
rophe+1.329
ths+1.203
riad+1.187
ient+1.167
Neg logit contributions
 Probe-1.286
 Nish-1.213
ersen-1.162
 Rik-1.087
 COURT-1.080
Token Rubio
Token ID15476
Feature activation+5.57e-01

Pos logit contributions
phe+0.398
rophe+0.350
ths+0.316
riad+0.312
ient+0.307
Neg logit contributions
 Probe-0.362
 Nish-0.342
ersen-0.328
 Rik-0.308
 NP-0.305
Token did
Token ID750
Feature activation+8.86e-01

Pos logit contributions
phe+1.206
rophe+1.064
ths+0.967
riad+0.952
ient+0.936
Neg logit contributions
 Probe-1.045
 Nish-0.984
ersen-0.942
 Rik-0.880
 COURT-0.879
Token"[
Token ID17912
Feature activation+1.27e+00

Pos logit contributions
phe+1.352
rophe+1.182
ths+1.057
riad+1.046
ient+1.023
Neg logit contributions
 Probe-1.395
 Nish-1.322
ersen-1.272
 Rik-1.200
 COURT-1.190
Token41
Token ID3901
Feature activation-3.75e-01

Pos logit contributions
phe+3.807
rophe+3.472
ths+3.242
riad+3.205
ient+3.183
Neg logit contributions
 Probe-1.320
 Nish-1.170
ersen-1.082
 Rik-0.944
 Trafford-0.933
Token]
Token ID60
Feature activation+1.04e+00

Pos logit contributions
 Probe+0.738
 Nish+0.699
ersen+0.668
 Rik+0.631
 NP+0.629
Neg logit contributions
phe-0.773
rophe-0.681
ths-0.613
riad-0.607
ient-0.595
Token and
Token ID290
Feature activation+3.82e-01

Pos logit contributions
phe+2.232
rophe+1.968
ths+1.794
riad+1.754
ient+1.731
Neg logit contributions
 Probe-1.939
 Nish-1.812
ersen-1.750
 COURT-1.626
 NP-1.623
Token Peterson
Token ID18914
Feature activation-9.30e-01

Pos logit contributions
phe+0.789
rophe+0.694
ths+0.624
riad+0.614
ient+0.606
Neg logit contributions
 Probe-0.752
 Nish-0.708
ersen-0.682
 Rik-0.639
 COURT-0.637
Token conc
Token ID1673
Feature activation+3.22e+00

Pos logit contributions
 Probe+1.726
 Nish+1.636
ersen+1.564
 Rik+1.464
 NP+1.460
Neg logit contributions
phe-2.024
rophe-1.797
ths-1.614
riad-1.614
ient-1.573
Tokenurs
Token ID1834
Feature activation+5.05e-01

Pos logit contributions
phe+6.099
rophe+5.469
ths+5.069
riad+4.874
ient+4.813
Neg logit contributions
 Probe-6.712
 Nish-6.314
ersen-5.889
 COURT-5.688
 NP-5.672
Token that
Token ID326
Feature activation+2.15e-01

Pos logit contributions
phe+1.056
rophe+0.928
ths+0.835
riad+0.826
ient+0.809
Neg logit contributions
 Probe-0.990
 Nish-0.936
ersen-0.897
 Rik-0.845
 NP-0.837
Token the
Token ID262
Feature activation+7.38e-02

Pos logit contributions
phe+0.463
rophe+0.409
ths+0.370
riad+0.365
ient+0.360
Neg logit contributions
 Probe-0.404
 Nish-0.380
ersen-0.365
 Rik-0.341
 COURT-0.340
Token Muslims
Token ID7045
Feature activation+2.03e+00

Pos logit contributions
phe+0.156
rophe+0.137
ths+0.124
riad+0.122
ient+0.120
Neg logit contributions
 Probe-0.143
 Nish-0.135
ersen-0.129
 Rik-0.121
 COURT-0.120
Token wanted
Token ID2227
Feature activation+2.72e-01

Pos logit contributions
phe+4.165
rophe+3.586
ths+3.268
ient+3.151
riad+3.134
Neg logit contributions
 Probe-3.987
 Nish-3.725
ersen-3.626
 Trafford-3.394
 Rik-3.373
Token According
Token ID4784
Feature activation-2.22e-01

Pos logit contributions
phe+0.100
rophe+0.089
ths+0.081
riad+0.080
ient+0.079
Neg logit contributions
 Probe-0.074
 Nish-0.069
ersen-0.066
 Rik-0.061
 COURT-0.061
Token to
Token ID284
Feature activation+3.25e-01

Pos logit contributions
 Probe+0.399
 Nish+0.373
ersen+0.358
 Rik+0.333
 COURT+0.332
Neg logit contributions
phe-0.498
rophe-0.443
ths-0.405
riad-0.399
ient-0.393
Token Mur
Token ID5921
Feature activation+6.88e-01

Pos logit contributions
phe+0.702
rophe+0.621
ths+0.561
riad+0.554
ient+0.545
Neg logit contributions
 Probe-0.614
 Nish-0.580
ersen-0.554
 Rik-0.520
 COURT-0.516
Tokenad
Token ID324
Feature activation+8.42e-01

Pos logit contributions
phe+1.298
rophe+1.131
ths+1.001
riad+0.987
ient+0.968
Neg logit contributions
 Probe-1.481
 Nish-1.410
ersen-1.355
 Rik-1.283
 COURT-1.274
Token Do
Token ID2141
Feature activation+7.05e-02

Pos logit contributions
phe+1.964
rophe+1.750
ths+1.601
riad+1.584
ient+1.560
Neg logit contributions
 Probe-1.435
 Nish-1.338
ersen-1.285
 Rik-1.186
 COURT-1.185
Tokenğ
Token ID33133
Feature activation+3.18e+00

Pos logit contributions
phe+0.096
rophe+0.079
ths+0.066
riad+0.064
ient+0.061
Neg logit contributions
 Probe-0.188
 Nish-0.182
ersen-0.176
 Rik-0.168
 Trafford-0.167
Tokenuk
Token ID2724
Feature activation+1.94e-01

Pos logit contributions
phe+4.448
rophe+3.561
riad+3.215
ths+3.192
ient+3.043
Neg logit contributions
 Probe-8.184
 Nish-7.736
ersen-7.497
 Trafford-7.267
 COURT-7.243
Tokenan
Token ID272
Feature activation+1.06e+00

Pos logit contributions
phe+0.419
rophe+0.369
ths+0.334
riad+0.329
ient+0.324
Neg logit contributions
 Probe-0.367
 Nish-0.348
ersen-0.332
 Rik-0.312
 COURT-0.309
Tokenli
Token ID4528
Feature activation+1.90e+00

Pos logit contributions
phe+2.296
rophe+2.030
ths+1.842
riad+1.822
ient+1.792
Neg logit contributions
 Probe-1.964
 Nish-1.847
ersen-1.774
 Rik-1.655
 COURT-1.650
Token,
Token ID11
Feature activation+5.35e-01

Pos logit contributions
phe+4.035
rophe+3.554
ths+3.236
riad+3.202
ient+3.162
Neg logit contributions
 Probe-3.586
 Nish-3.366
ersen-3.239
 Trafford-3.041
 COURT-3.035
Token writing
Token ID3597
Feature activation+4.89e-01

Pos logit contributions
phe+1.125
rophe+0.989
ths+0.896
riad+0.883
ient+0.867
Neg logit contributions
 Probe-1.033
 Nish-0.974
ersen-0.939
 Rik-0.878
 COURT-0.875
Token wrote
Token ID2630
Feature activation+5.21e-01

Pos logit contributions
 Probe+1.877
 Nish+1.764
ersen+1.688
 Rik+1.595
 NP+1.579
Neg logit contributions
phe-2.039
rophe-1.804
ths-1.615
riad-1.595
ient-1.563
Token in
Token ID287
Feature activation+1.79e-01

Pos logit contributions
phe+1.031
rophe+0.903
ths+0.811
riad+0.797
ient+0.783
Neg logit contributions
 Probe-1.068
 Nish-1.012
ersen-0.976
 Rik-0.918
 COURT-0.914
Token May
Token ID1737
Feature activation+1.22e+00

Pos logit contributions
phe+0.377
rophe+0.334
ths+0.302
riad+0.297
ient+0.292
Neg logit contributions
 Probe-0.342
 Nish-0.324
ersen-0.312
 Rik-0.292
 COURT-0.290
Token about
Token ID546
Feature activation+5.47e-01

Pos logit contributions
phe+2.436
rophe+2.146
ths+1.929
riad+1.890
ient+1.861
Neg logit contributions
 Probe-2.456
 Nish-2.325
ersen-2.241
 Rik-2.112
 Trafford-2.098
Token the
Token ID262
Feature activation-1.21e-01

Pos logit contributions
phe+1.125
rophe+0.987
ths+0.891
riad+0.877
ient+0.861
Neg logit contributions
 Probe-1.083
 Nish-1.024
ersen-0.984
 Rik-0.925
 COURT-0.920
Token del
Token ID1619
Feature activation+3.14e+00

Pos logit contributions
 Probe+0.232
 Nish+0.219
ersen+0.210
 Rik+0.197
 NP+0.196
Neg logit contributions
phe-0.257
rophe-0.226
ths-0.204
riad-0.201
ient-0.198
Tokenuge
Token ID2217
Feature activation+1.27e+00

Pos logit contributions
phe+4.860
rophe+3.999
ths+3.459
riad+3.342
ient+3.322
Neg logit contributions
 Probe-7.764
 Nish-7.408
ersen-7.126
 Trafford-6.873
 Rik-6.827
Token of
Token ID286
Feature activation+4.49e-02

Pos logit contributions
phe+3.015
rophe+2.698
ths+2.471
riad+2.439
ient+2.405
Neg logit contributions
 Probe-2.117
 Nish-1.976
ersen-1.882
 Rik-1.744
 COURT-1.732
Token anti
Token ID3098
Feature activation-1.19e+00

Pos logit contributions
phe+0.103
rophe+0.092
ths+0.084
riad+0.083
ient+0.082
Neg logit contributions
 Probe-0.078
 Nish-0.073
ersen-0.070
 Rik-0.065
 COURT-0.065
Token-
Token ID12
Feature activation-5.25e-01

Pos logit contributions
 Probe+1.826
 Nish+1.697
ersen+1.602
 NP+1.468
 Rik+1.466
Neg logit contributions
phe-2.955
rophe-2.651
ths-2.443
riad-2.419
ient-2.386
TokenSemitic
Token ID28753
Feature activation+1.73e-01

Pos logit contributions
 Probe+1.136
 Nish+1.080
ersen+1.040
 NP+0.981
 Rik+0.979
Neg logit contributions
phe-0.979
rophe-0.850
ths-0.752
riad-0.741
ient-0.727
Token
Token ID40367
Feature activation-6.18e-01

Pos logit contributions
 Probe+0.593
 Nish+0.553
ersen+0.524
 Rik+0.483
 COURT+0.479
Neg logit contributions
phe-0.946
rophe-0.850
ths-0.781
riad-0.773
ient-0.762
Token
Token ID253
Feature activation+2.94e-01

Pos logit contributions
 Probe+0.952
 Nish+0.885
ersen+0.840
 Rik+0.773
 COURT+0.766
Neg logit contributions
phe-1.546
rophe-1.391
ths-1.279
riad-1.266
ient-1.248
Token
Token ID161
Feature activation+1.29e+00

Pos logit contributions
phe+0.736
rophe+0.662
ths+0.609
riad+0.604
ient+0.594
Neg logit contributions
 Probe-0.451
 Nish-0.415
ersen-0.397
 Rik-0.362
 COURT-0.361
Token
Token ID254
Feature activation+4.13e-01

Pos logit contributions
phe+2.763
rophe+2.451
ths+2.225
riad+2.199
ient+2.160
Neg logit contributions
 Probe-2.423
 Nish-2.269
ersen-2.176
 Rik-2.037
 COURT-2.034
Token
Token ID109
Feature activation+4.80e-01

Pos logit contributions
phe+0.858
rophe+0.753
ths+0.680
riad+0.670
ient+0.660
Neg logit contributions
 Probe-0.811
 Nish-0.766
ersen-0.735
 Rik-0.691
 COURT-0.687
Token
Token ID27332
Feature activation+3.11e+00

Pos logit contributions
phe+1.183
rophe+1.065
ths+0.976
riad+0.969
ient+0.954
Neg logit contributions
 Probe-0.757
 Nish-0.700
ersen-0.666
 Rik-0.612
 Trafford-0.610
Token
Token ID121
Feature activation-5.84e-01

Pos logit contributions
phe+6.315
rophe+5.693
ths+5.091
riad+4.983
ient+4.953
Neg logit contributions
 Probe-5.886
 Nish-5.483
ersen-5.290
 Rik-4.986
 COURT-4.943
Token
Token ID252
Feature activation+1.97e-01

Pos logit contributions
 Probe+1.013
 Nish+0.951
ersen+0.912
 Rik+0.842
 Trafford+0.836
Neg logit contributions
phe-1.352
rophe-1.195
ths-1.087
riad-1.080
ient-1.059
Token Alternative
Token ID27182
Feature activation-1.13e-01

Pos logit contributions
phe+0.495
rophe+0.446
ths+0.410
riad+0.406
ient+0.400
Neg logit contributions
 Probe-0.300
 Nish-0.279
ersen-0.264
 Rik-0.243
 COURT-0.241
Token Facts
Token ID26972
Feature activation+1.18e+00

Pos logit contributions
 Probe+0.215
 Nish+0.203
ersen+0.194
 Rik+0.183
 NP+0.181
Neg logit contributions
phe-0.242
rophe-0.214
ths-0.193
riad-0.191
ient-0.188
Token in
Token ID287
Feature activation-1.41e-02

Pos logit contributions
phe+2.566
rophe+2.276
ths+2.065
riad+2.044
ient+2.011
Neg logit contributions
 Probe-2.189
 Nish-2.055
ersen-1.986
 Rik-1.837
 COURT-1.836
Token dust
Token ID8977
Feature activation-3.94e-01

Pos logit contributions
phe+0.391
rophe+0.344
ths+0.309
riad+0.306
ient+0.300
Neg logit contributions
 Probe-0.366
 Nish-0.346
ersen-0.331
 Rik-0.312
 NP-0.310
Token?
Token ID30
Feature activation+7.22e-02

Pos logit contributions
 Probe+0.692
 Nish+0.649
ersen+0.618
 Rik+0.579
 NP+0.573
Neg logit contributions
phe-0.893
rophe-0.798
ths-0.724
riad-0.717
ient-0.702
Token
Token ID198
Feature activation+3.57e-01

Pos logit contributions
phe+0.170
rophe+0.152
ths+0.139
riad+0.137
ient+0.135
Neg logit contributions
 Probe-0.122
 Nish-0.114
ersen-0.109
 Rik-0.101
 COURT-0.100
Token
Token ID198
Feature activation+2.97e-01

Pos logit contributions
phe+0.816
rophe+0.724
ths+0.670
riad+0.657
ient+0.645
Neg logit contributions
 Probe-0.627
 Nish-0.585
ersen-0.564
 Rik-0.522
 COURT-0.520
Tokenman
Token ID805
Feature activation+1.26e-01

Pos logit contributions
phe+0.629
rophe+0.555
ths+0.501
riad+0.494
ient+0.486
Neg logit contributions
 Probe-0.573
 Nish-0.541
ersen-0.519
 Rik-0.486
 NP-0.483
Token del
Token ID1619
Feature activation+3.09e+00

Pos logit contributions
phe+0.270
rophe+0.239
ths+0.216
riad+0.214
ient+0.210
Neg logit contributions
 Probe-0.237
 Nish-0.224
ersen-0.214
 Rik-0.201
 NP-0.199
Tokenights
Token ID2337
Feature activation+8.81e-01

Pos logit contributions
phe+4.593
rophe+3.618
ths+3.264
ient+3.096
riad+3.065
Neg logit contributions
 Probe-7.836
 Nish-7.466
ersen-7.182
 Trafford-6.959
 Rik-6.948
Token not
Token ID407
Feature activation+1.32e-01

Pos logit contributions
phe+1.912
rophe+1.688
ths+1.536
riad+1.512
ient+1.492
Neg logit contributions
 Probe-1.646
 Nish-1.548
ersen-1.488
 Rik-1.388
 COURT-1.384
Token me
Token ID502
Feature activation+6.06e-01

Pos logit contributions
phe+0.292
rophe+0.259
ths+0.234
riad+0.232
ient+0.228
Neg logit contributions
 Probe-0.241
 Nish-0.227
ersen-0.216
 Rik-0.203
 NP-0.201
Token;
Token ID26
Feature activation+3.45e-01

Pos logit contributions
phe+1.323
rophe+1.172
ths+1.060
riad+1.048
ient+1.029
Neg logit contributions
 Probe-1.127
 Nish-1.061
ersen-1.016
 Rik-0.951
 COURT-0.943
Token no
Token ID645
Feature activation+5.20e-01

Pos logit contributions
phe+0.785
rophe+0.698
ths+0.635
riad+0.628
ient+0.618
Neg logit contributions
 Probe-0.611
 Nish-0.574
ersen-0.548
 Rik-0.511
 COURT-0.507
Token the
Token ID262
Feature activation-3.10e-03

Pos logit contributions
phe+0.524
rophe+0.464
ths+0.421
riad+0.416
ient+0.409
Neg logit contributions
 Probe-0.442
 Nish-0.417
ersen-0.399
 Rik-0.373
 COURT-0.370
Token family
Token ID1641
Feature activation-2.59e-01

Pos logit contributions
 Probe+0.006
 Nish+0.006
ersen+0.006
 Rik+0.005
 NP+0.005
Neg logit contributions
phe-0.006
rophe-0.005
ths-0.005
riad-0.005
ient-0.005
Token also
Token ID635
Feature activation+8.11e-01

Pos logit contributions
 Probe+0.519
 Nish+0.492
ersen+0.473
 Rik+0.444
 NP+0.442
Neg logit contributions
phe-0.526
rophe-0.463
ths-0.414
riad-0.410
ient-0.403
Token
Token ID564
Feature activation+2.19e-01

Pos logit contributions
phe+1.497
rophe+1.288
ths+1.143
riad+1.124
ient+1.104
Neg logit contributions
 Probe-1.781
 Nish-1.688
ersen-1.629
 Rik-1.542
 COURT-1.533
Token
Token ID250
Feature activation-3.33e-01

Pos logit contributions
phe+0.627
rophe+0.574
ths+0.533
riad+0.529
ient+0.523
Neg logit contributions
 Probe-0.259
 Nish-0.236
ersen-0.218
 Rik-0.196
 Trafford-0.193
Tokendel
Token ID12381
Feature activation+3.05e+00

Pos logit contributions
 Probe+0.667
 Nish+0.633
ersen+0.614
 Rik+0.572
 COURT+0.569
Neg logit contributions
phe-0.668
rophe-0.590
ths-0.528
riad-0.520
ient-0.512
Tokenved
Token ID1079
Feature activation+8.08e-01

Pos logit contributions
phe+3.974
rophe+3.116
ths+2.610
ient+2.606
riad+2.582
Neg logit contributions
 Probe-8.316
 Nish-7.876
ersen-7.679
 Trafford-7.376
 Rik-7.368
Token into
Token ID656
Feature activation-9.68e-02

Pos logit contributions
phe+2.004
rophe+1.799
ths+1.656
riad+1.637
ient+1.616
Neg logit contributions
 Probe-1.263
 Nish-1.175
ersen-1.115
 Rik-1.029
 COURT-1.020
Token the
Token ID262
Feature activation+1.82e-01

Pos logit contributions
 Probe+0.187
 Nish+0.177
ersen+0.169
 Rik+0.159
 NP+0.158
Neg logit contributions
phe-0.205
rophe-0.181
ths-0.163
riad-0.161
ient-0.158
Token oil
Token ID3056
Feature activation+4.12e-01

Pos logit contributions
phe+0.386
rophe+0.340
ths+0.307
riad+0.303
ient+0.298
Neg logit contributions
 Probe-0.348
 Nish-0.329
ersen-0.315
 Rik-0.296
 COURT-0.293
Token-
Token ID12
Feature activation+5.80e-01

Pos logit contributions
phe+0.872
rophe+0.771
ths+0.697
riad+0.687
ient+0.675
Neg logit contributions
 Probe-0.792
 Nish-0.747
ersen-0.713
 Rik-0.672
 NP-0.667
Token sound
Token ID2128
Feature activation+7.46e-01

Pos logit contributions
 Probe+0.011
 Nish+0.011
ersen+0.010
 Rik+0.009
 COURT+0.009
Neg logit contributions
phe-0.013
rophe-0.012
ths-0.011
riad-0.011
ient-0.010
Token,
Token ID11
Feature activation-1.02e-02

Pos logit contributions
phe+1.587
rophe+1.400
ths+1.265
riad+1.248
ient+1.228
Neg logit contributions
 Probe-1.431
 Nish-1.349
ersen-1.294
 Rik-1.212
 COURT-1.205
Token something
Token ID1223
Feature activation-1.13e+00

Pos logit contributions
 Probe+0.019
 Nish+0.018
ersen+0.018
 Rik+0.016
 NP+0.016
Neg logit contributions
phe-0.022
rophe-0.019
ths-0.018
riad-0.017
ient-0.017
Token that
Token ID326
Feature activation+3.28e-01

Pos logit contributions
 Probe+2.220
 Nish+2.109
ersen+2.003
 Rik+1.909
 NP+1.896
Neg logit contributions
phe-2.344
rophe-2.052
riad-1.842
ths-1.841
ient-1.784
Token Phil
Token ID4543
Feature activation+1.52e+00

Pos logit contributions
phe+0.683
rophe+0.600
ths+0.541
riad+0.532
ient+0.525
Neg logit contributions
 Probe-0.642
 Nish-0.605
ersen-0.583
 Rik-0.545
 COURT-0.544
Tokenadel
Token ID6959
Feature activation+3.05e+00

Pos logit contributions
phe+3.328
rophe+2.933
ths+2.672
riad+2.626
ient+2.595
Neg logit contributions
 Probe-2.825
 Nish-2.656
ersen-2.548
 Rik-2.386
 COURT-2.373
Tokenph
Token ID746
Feature activation+1.34e+00

Pos logit contributions
phe+3.780
rophe+2.858
ths+2.394
riad+2.289
ient+2.267
Neg logit contributions
 Probe-8.534
 Nish-8.256
ersen-8.043
 COURT-7.704
 Rik-7.700
Tokenians
Token ID1547
Feature activation+1.40e+00

Pos logit contributions
phe+2.446
rophe+2.107
ths+1.861
riad+1.835
ient+1.792
Neg logit contributions
 Probe-2.971
 Nish-2.823
ersen-2.728
 Rik-2.580
 COURT-2.563
Token do
Token ID466
Feature activation+4.16e-01

Pos logit contributions
phe+2.774
rophe+2.431
ths+2.206
ient+2.121
riad+2.115
Neg logit contributions
 Probe-2.856
 Nish-2.687
ersen-2.599
 Rik-2.436
 Trafford-2.427
Token more
Token ID517
Feature activation-7.70e-01

Pos logit contributions
phe+0.880
rophe+0.775
ths+0.698
riad+0.691
ient+0.678
Neg logit contributions
 Probe-0.803
 Nish-0.759
ersen-0.728
 Rik-0.683
 COURT-0.679
Token frequently
Token ID6777
Feature activation+8.22e-01

Pos logit contributions
 Probe+1.614
 Nish+1.544
ersen+1.484
 Rik+1.407
 NP+1.399
Neg logit contributions
phe-1.489
rophe-1.298
riad-1.141
ths-1.141
ovie-1.112
Token said
Token ID531
Feature activation-1.84e-02

Pos logit contributions
phe+0.580
rophe+0.516
ths+0.470
riad+0.465
ient+0.458
Neg logit contributions
 Probe-0.444
 Nish-0.416
ersen-0.398
 Rik-0.370
 COURT-0.368
Token too
Token ID1165
Feature activation-1.19e+00

Pos logit contributions
 Probe+0.033
 Nish+0.031
ersen+0.030
 Rik+0.028
 NP+0.027
Neg logit contributions
phe-0.041
rophe-0.037
ths-0.033
riad-0.033
ient-0.032
Token about
Token ID546
Feature activation+1.88e-01

Pos logit contributions
 Probe+2.185
 Nish+2.071
ersen+1.982
 Rik+1.852
 NP+1.825
Neg logit contributions
phe-2.608
rophe-2.300
riad-2.089
ths-2.083
ient-2.012
Token what
Token ID644
Feature activation-9.30e-01

Pos logit contributions
phe+0.388
rophe+0.341
ths+0.307
riad+0.304
ient+0.297
Neg logit contributions
 Probe-0.372
 Nish-0.352
ersen-0.337
 Rik-0.317
 NP-0.315
Token Phil
Token ID4543
Feature activation+1.70e+00

Pos logit contributions
 Probe+1.783
 Nish+1.687
ersen+1.607
 Rik+1.520
 NP+1.513
Neg logit contributions
phe-1.961
rophe-1.735
ths-1.565
riad-1.551
ient-1.514
Tokenadel
Token ID6959
Feature activation+3.05e+00

Pos logit contributions
phe+2.563
rophe+2.132
ths+1.832
riad+1.787
ient+1.758
Neg logit contributions
 Probe-4.323
 Nish-4.140
ersen-3.995
 Rik-3.826
 COURT-3.814
Tokenph
Token ID746
Feature activation+1.59e+00

Pos logit contributions
phe+3.613
rophe+2.576
ths+2.220
ient+2.083
riad+2.032
Neg logit contributions
 Probe-8.776
 Nish-8.520
ersen-8.164
 Rik-8.017
 COURT-7.937
Tokenians
Token ID1547
Feature activation+1.46e+00

Pos logit contributions
phe+2.833
rophe+2.435
ths+2.150
riad+2.112
ient+2.077
Neg logit contributions
 Probe-3.579
 Nish-3.411
ersen-3.287
 Rik-3.122
 COURT-3.105
Token do
Token ID466
Feature activation+7.49e-01

Pos logit contributions
phe+2.911
rophe+2.564
ths+2.317
ient+2.243
riad+2.226
Neg logit contributions
 Probe-2.959
 Nish-2.771
ersen-2.693
 Trafford-2.520
 Rik-2.516
Token with
Token ID351
Feature activation+1.61e-01

Pos logit contributions
phe+1.601
rophe+1.413
ths+1.277
riad+1.261
ient+1.240
Neg logit contributions
 Probe-1.429
 Nish-1.348
ersen-1.292
 Rik-1.211
 COURT-1.203
Token their
Token ID511
Feature activation+3.99e-01

Pos logit contributions
phe+0.308
rophe+0.268
ths+0.239
riad+0.236
ient+0.230
Neg logit contributions
 Probe-0.344
 Nish-0.327
ersen-0.314
 Rik-0.297
 NP-0.296
Token of
Token ID286
Feature activation+9.93e-02

Pos logit contributions
 Probe+0.108
 Nish+0.102
ersen+0.098
 Rik+0.091
 NP+0.091
Neg logit contributions
phe-0.126
rophe-0.112
ths-0.101
riad-0.100
ient-0.098
Token Virginia
Token ID6025
Feature activation+8.56e-01

Pos logit contributions
phe+0.219
rophe+0.194
ths+0.176
riad+0.174
ient+0.171
Neg logit contributions
 Probe-0.183
 Nish-0.172
ersen-0.164
 Rik-0.154
 COURT-0.153
Token.
Token ID13
Feature activation+7.99e-03

Pos logit contributions
phe+1.845
rophe+1.626
ths+1.473
riad+1.455
ient+1.438
Neg logit contributions
 Probe-1.607
 Nish-1.520
ersen-1.461
 Rik-1.363
 Trafford-1.355
Token
Token ID198
Feature activation-7.66e-02

Pos logit contributions
phe+0.019
rophe+0.017
ths+0.015
riad+0.015
ient+0.015
Neg logit contributions
 Probe-0.014
 Nish-0.013
ersen-0.012
 Rik-0.011
 COURT-0.011
Token
Token ID198
Feature activation-2.55e-02

Pos logit contributions
 Probe+0.131
 Nish+0.122
ersen+0.117
 Rik+0.108
 Trafford+0.108
Neg logit contributions
phe-0.179
rophe-0.159
ths-0.147
riad-0.145
ient-0.142
TokenDel
Token ID13856
Feature activation+3.00e+00

Pos logit contributions
 Probe+0.047
 Nish+0.044
ersen+0.042
 COURT+0.039
 Rik+0.039
Neg logit contributions
phe-0.056
rophe-0.050
ths-0.045
riad-0.044
ient-0.043
Token.
Token ID13
Feature activation+4.65e-02

Pos logit contributions
phe+6.033
rophe+5.223
ths+4.730
ient+4.676
riad+4.643
Neg logit contributions
 Probe-6.008
 Nish-5.651
ersen-5.444
 Rik-5.147
 Trafford-5.128
Token Todd
Token ID14377
Feature activation+6.65e-01

Pos logit contributions
phe+0.104
rophe+0.092
ths+0.084
riad+0.083
ient+0.081
Neg logit contributions
 Probe-0.084
 Nish-0.080
ersen-0.076
 Rik-0.071
 COURT-0.070
Token Gilbert
Token ID24023
Feature activation+8.34e-01

Pos logit contributions
phe+1.551
rophe+1.384
ths+1.263
riad+1.249
ient+1.229
Neg logit contributions
 Probe-1.140
 Nish-1.069
ersen-1.019
 Rik-0.947
 COURT-0.939
Token (
Token ID357
Feature activation+5.01e-01

Pos logit contributions
phe+1.796
rophe+1.581
ths+1.437
riad+1.412
ient+1.400
Neg logit contributions
 Probe-1.566
 Nish-1.475
ersen-1.419
 Rik-1.324
 Trafford-1.318
TokenR
Token ID49
Feature activation+7.09e-01

Pos logit contributions
phe+1.059
rophe+0.935
ths+0.844
riad+0.830
ient+0.817
Neg logit contributions
 Probe-0.966
 Nish-0.911
ersen-0.874
 Rik-0.821
 COURT-0.815
Token-
Token ID12
Feature activation-1.60e-01

Pos logit contributions
 Probe+0.554
 Nish+0.522
ersen+0.486
 Rik+0.458
 NP+0.452
Neg logit contributions
phe-0.861
rophe-0.770
ths-0.705
riad-0.698
ient-0.687
Tokengovern
Token ID47866
Feature activation-1.20e+00

Pos logit contributions
 Probe+0.307
 Nish+0.291
ersen+0.279
 Rik+0.262
 NP+0.260
Neg logit contributions
phe-0.338
rophe-0.298
ths-0.269
riad-0.265
ient-0.260
Tokened
Token ID276
Feature activation+3.62e-01

Pos logit contributions
 Probe+2.232
 Nish+2.100
ersen+2.024
 Rik+1.885
 NP+1.878
Neg logit contributions
phe-2.615
rophe-2.270
ths-2.049
riad-2.031
ovie-1.972
Token organisation
Token ID12684
Feature activation+5.22e-03

Pos logit contributions
phe+0.764
rophe+0.674
ths+0.611
riad+0.602
ient+0.593
Neg logit contributions
 Probe-0.696
 Nish-0.654
ersen-0.631
 Rik-0.589
 COURT-0.588
Token "
Token ID366
Feature activation+1.87e-01

Pos logit contributions
phe+0.011
rophe+0.010
ths+0.009
riad+0.009
ient+0.009
Neg logit contributions
 Probe-0.010
 Nish-0.009
ersen-0.009
 Rik-0.008
 COURT-0.008
Tokendel
Token ID12381
Feature activation+2.96e+00

Pos logit contributions
phe+0.403
rophe+0.356
ths+0.322
riad+0.318
ient+0.313
Neg logit contributions
 Probe-0.353
 Nish-0.333
ersen-0.318
 Rik-0.298
 COURT-0.297
Tokenivers
Token ID1191
Feature activation+7.68e-01

Pos logit contributions
phe+4.928
rophe+4.089
ths+3.676
ient+3.573
riad+3.512
Neg logit contributions
 Probe-7.015
 Nish-6.604
ersen-6.399
 Rik-6.115
 Trafford-6.092
Token emergency
Token ID6334
Feature activation+1.44e+00

Pos logit contributions
phe+1.605
rophe+1.419
ths+1.284
riad+1.261
ient+1.249
Neg logit contributions
 Probe-1.487
 Nish-1.397
ersen-1.351
 Rik-1.261
 COURT-1.257
Token aid
Token ID6133
Feature activation+6.50e-01

Pos logit contributions
phe+2.817
rophe+2.452
ths+2.221
riad+2.163
ient+2.142
Neg logit contributions
 Probe-2.958
 Nish-2.771
ersen-2.703
 Rik-2.531
 COURT-2.525
Token to
Token ID284
Feature activation+1.25e-02

Pos logit contributions
phe+1.419
rophe+1.256
ths+1.142
riad+1.123
ient+1.109
Neg logit contributions
 Probe-1.205
 Nish-1.133
ersen-1.087
 Rik-1.014
 COURT-1.011
Token people
Token ID661
Feature activation+9.43e-01

Pos logit contributions
phe+0.026
rophe+0.023
ths+0.021
riad+0.020
ient+0.020
Neg logit contributions
 Probe-0.025
 Nish-0.023
ersen-0.022
 Rik-0.021
 NP-0.021
Token consequences
Token ID6948
Feature activation-9.75e-01

Pos logit contributions
 Probe+0.263
 Nish+0.248
ersen+0.238
 Rik+0.224
 NP+0.222
Neg logit contributions
phe-0.286
rophe-0.252
ths-0.227
riad-0.224
ient-0.220
Token of
Token ID286
Feature activation+4.32e-02

Pos logit contributions
 Probe+1.492
 Nish+1.407
ersen+1.335
 Rik+1.237
 NP+1.228
Neg logit contributions
phe-2.432
rophe-2.169
ths-2.003
riad-2.000
ient-1.930
Token the
Token ID262
Feature activation-9.76e-02

Pos logit contributions
phe+0.089
rophe+0.078
ths+0.070
riad+0.069
ient+0.068
Neg logit contributions
 Probe-0.086
 Nish-0.082
ersen-0.078
 Rik-0.074
 NP-0.074
Token
Token ID564
Feature activation-1.50e-01

Pos logit contributions
 Probe+0.181
 Nish+0.171
ersen+0.163
 Rik+0.153
 NP+0.152
Neg logit contributions
phe-0.214
rophe-0.189
ths-0.171
riad-0.170
ient-0.167
Token
Token ID246
Feature activation+3.22e-01

Pos logit contributions
 Probe+0.275
 Nish+0.259
ersen+0.248
 Rik+0.232
 COURT+0.230
Neg logit contributions
phe-0.332
rophe-0.294
ths-0.267
riad-0.264
ient-0.259
Tokendel
Token ID12381
Feature activation+2.94e+00

Pos logit contributions
phe+0.682
rophe+0.600
ths+0.542
riad+0.536
ient+0.527
Neg logit contributions
 Probe-0.620
 Nish-0.585
ersen-0.562
 Rik-0.527
 COURT-0.523
Tokenayed
Token ID16548
Feature activation+1.38e+00

Pos logit contributions
phe+3.781
rophe+3.057
ths+2.496
ient+2.446
riad+2.408
Neg logit contributions
 Probe-8.031
 Nish-7.724
ersen-7.414
 Rik-7.217
 Trafford-7.150
Token unfolding
Token ID34502
Feature activation+7.28e-01

Pos logit contributions
phe+2.895
rophe+2.567
ths+2.315
riad+2.269
ient+2.236
Neg logit contributions
 Probe-2.661
 Nish-2.516
ersen-2.414
 Rik-2.273
 COURT-2.243
Token of
Token ID286
Feature activation+2.78e-01

Pos logit contributions
phe+1.750
rophe+1.564
ths+1.432
riad+1.418
ient+1.397
Neg logit contributions
 Probe-1.195
 Nish-1.118
ersen-1.065
 Rik-0.985
 COURT-0.977
Token independence
Token ID10404
Feature activation+1.00e+00

Pos logit contributions
phe+0.614
rophe+0.542
ths+0.491
riad+0.486
ient+0.478
Neg logit contributions
 Probe-0.513
 Nish-0.484
ersen-0.462
 Rik-0.434
 NP-0.431
Token
Token ID447
Feature activation+4.23e-02

Pos logit contributions
phe+2.113
rophe+1.865
ths+1.682
riad+1.656
ient+1.630
Neg logit contributions
 Probe-1.944
 Nish-1.835
ersen-1.760
 Rik-1.650
 COURT-1.639
Token manner
Token ID5642
Feature activation+1.21e-01

Pos logit contributions
phe+0.059
rophe+0.053
ths+0.047
riad+0.047
ient+0.046
Neg logit contributions
 Probe-0.054
 Nish-0.051
ersen-0.048
 Rik-0.046
 NP-0.046
Token in
Token ID287
Feature activation+8.98e-02

Pos logit contributions
phe+0.279
rophe+0.249
ths+0.226
riad+0.225
ient+0.220
Neg logit contributions
 Probe-0.209
 Nish-0.197
ersen-0.187
 Rik-0.174
 NP-0.173
Token which
Token ID543
Feature activation-3.15e-01

Pos logit contributions
phe+0.186
rophe+0.164
ths+0.147
riad+0.146
ient+0.143
Neg logit contributions
 Probe-0.177
 Nish-0.168
ersen-0.160
 Rik-0.151
 NP-0.150
Token Black
Token ID2619
Feature activation-5.67e-01

Pos logit contributions
 Probe+0.569
 Nish+0.539
ersen+0.505
 Rik+0.479
 NP+0.478
Neg logit contributions
phe-0.702
rophe-0.625
ths-0.566
riad-0.563
ovie-0.550
Token Phil
Token ID4543
Feature activation+1.50e+00

Pos logit contributions
 Probe+1.203
 Nish+1.145
ersen+1.089
 Rik+1.036
 NP+1.034
Neg logit contributions
phe-1.078
rophe-0.943
riad-0.830
ths-0.828
ovie-0.808
Tokenadel
Token ID6959
Feature activation+2.94e+00

Pos logit contributions
phe+2.283
rophe+1.905
ths+1.642
riad+1.601
ient+1.566
Neg logit contributions
 Probe-3.777
 Nish-3.615
ersen-3.496
 Rik-3.343
 COURT-3.326
Tokenph
Token ID746
Feature activation+1.39e+00

Pos logit contributions
phe+3.520
rophe+2.516
ths+2.221
ient+2.069
riad+1.992
Neg logit contributions
 Probe-8.419
 Nish-8.141
ersen-7.871
 Rik-7.663
 COURT-7.584
Tokenians
Token ID1547
Feature activation+1.33e+00

Pos logit contributions
phe+2.511
rophe+2.160
ths+1.908
riad+1.878
ient+1.839
Neg logit contributions
 Probe-3.124
 Nish-2.973
ersen-2.869
 Rik-2.719
 COURT-2.703
Token use
Token ID779
Feature activation-1.67e-01

Pos logit contributions
phe+2.379
rophe+2.033
ths+1.820
riad+1.755
ient+1.730
Neg logit contributions
 Probe-2.985
 Nish-2.816
ersen-2.740
 Rik-2.589
 Trafford-2.579
Token a
Token ID257
Feature activation+1.51e-01

Pos logit contributions
 Probe+0.352
 Nish+0.335
ersen+0.320
 Rik+0.305
 NP+0.304
Neg logit contributions
phe-0.321
rophe-0.279
ths-0.248
riad-0.246
ient-0.241
Token neutral
Token ID8500
Feature activation-8.53e-01

Pos logit contributions
phe+0.237
rophe+0.199
ths+0.172
riad+0.168
ient+0.164
Neg logit contributions
 Probe-0.373
 Nish-0.357
ersen-0.345
 Rik-0.329
 NP-0.327
Token But
Token ID887
Feature activation-7.10e-01

Pos logit contributions
 Probe+0.205
 Nish+0.191
ersen+0.181
 Rik+0.167
 COURT+0.166
Neg logit contributions
phe-0.308
rophe-0.276
ths-0.253
riad-0.251
ient-0.247
Token efforts
Token ID4040
Feature activation-5.87e-01

Pos logit contributions
 Probe+1.238
 Nish+1.167
ersen+1.114
 Rik+1.031
 NP+1.025
Neg logit contributions
phe-1.641
rophe-1.454
ths-1.319
riad-1.312
ient-1.288
Token to
Token ID284
Feature activation-1.34e-01

Pos logit contributions
 Probe+1.026
 Nish+0.964
ersen+0.922
 Rik+0.856
 NP+0.847
Neg logit contributions
phe-1.348
rophe-1.200
ths-1.091
riad-1.082
ient-1.061
Token restore
Token ID11169
Feature activation+5.77e-01

Pos logit contributions
 Probe+0.258
 Nish+0.243
ersen+0.233
 Rik+0.220
 Trafford+0.218
Neg logit contributions
phe-0.281
rophe-0.247
ths-0.224
riad-0.220
ient-0.217
Token ties
Token ID8470
Feature activation-8.26e-01

Pos logit contributions
phe+1.301
rophe+1.158
ths+1.058
riad+1.040
ient+1.030
Neg logit contributions
 Probe-1.023
 Nish-0.956
ersen-0.916
 COURT-0.854
 Rik-0.853
Token fal
Token ID24215
Feature activation+2.93e+00

Pos logit contributions
 Probe+1.534
 Nish+1.449
ersen+1.391
 Rik+1.297
 NP+1.290
Neg logit contributions
phe-1.801
rophe-1.590
ths-1.439
riad-1.430
ient-1.396
Tokentered
Token ID4400
Feature activation+8.78e-01

Pos logit contributions
phe+4.355
rophe+3.580
ths+3.228
riad+3.019
ient+2.980
Neg logit contributions
 Probe-7.367
 Nish-7.016
ersen-6.785
 COURT-6.574
 Trafford-6.561
Token amid
Token ID10371
Feature activation-8.41e-03

Pos logit contributions
phe+1.929
rophe+1.708
ths+1.553
riad+1.530
ient+1.508
Neg logit contributions
 Probe-1.618
 Nish-1.522
ersen-1.458
 Rik-1.363
 COURT-1.353
Token new
Token ID649
Feature activation+2.25e-01

Pos logit contributions
 Probe+0.015
 Nish+0.015
ersen+0.014
 Rik+0.013
 COURT+0.013
Neg logit contributions
phe-0.019
rophe-0.016
ths-0.015
riad-0.015
ient-0.014
Token Israeli
Token ID6085
Feature activation-4.73e-01

Pos logit contributions
phe+0.506
rophe+0.451
ths+0.417
riad+0.409
ient+0.401
Neg logit contributions
 Probe-0.394
 Nish-0.369
ersen-0.356
 COURT-0.330
 Rik-0.330
Token action
Token ID2223
Feature activation-1.32e-01

Pos logit contributions
 Probe+0.845
 Nish+0.796
ersen+0.764
 Rik+0.710
 Trafford+0.707
Neg logit contributions
phe-1.059
rophe-0.941
ths-0.862
riad-0.849
ient-0.834
Token
Token ID198
Feature activation-1.87e-01

Pos logit contributions
 Probe+0.187
 Nish+0.175
ersen+0.166
 Rik+0.155
 COURT+0.154
Neg logit contributions
phe-0.251
rophe-0.224
ths-0.204
riad-0.203
ient-0.200
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
 Probe+0.317
 Nish+0.296
ersen+0.283
 Rik+0.262
 Trafford+0.260
Neg logit contributions
phe-0.440
rophe-0.392
ths-0.361
riad-0.356
ient-0.350
TokenThe
Token ID464
Feature activation+5.29e-02

Pos logit contributions
 Probe+0.240
 Nish+0.227
ersen+0.217
 Rik+0.203
 NP+0.202
Neg logit contributions
phe-0.272
rophe-0.242
ths-0.217
riad-0.216
ient-0.212
Token way
Token ID835
Feature activation+1.85e-01

Pos logit contributions
phe+0.112
rophe+0.099
ths+0.089
riad+0.088
ient+0.087
Neg logit contributions
 Probe-0.101
 Nish-0.096
ersen-0.091
 Rik-0.086
 NP-0.086
Token Phil
Token ID4543
Feature activation+1.91e+00

Pos logit contributions
phe+0.410
rophe+0.364
ths+0.330
riad+0.326
ient+0.321
Neg logit contributions
 Probe-0.338
 Nish-0.318
ersen-0.304
 Rik-0.285
 COURT-0.282
Tokenadel
Token ID6959
Feature activation+2.92e+00

Pos logit contributions
phe+2.757
rophe+2.260
ths+1.948
riad+1.890
ient+1.866
Neg logit contributions
 Probe-4.945
 Nish-4.741
ersen-4.541
 Rik-4.391
 COURT-4.376
Tokenph
Token ID746
Feature activation+1.48e+00

Pos logit contributions
phe+3.362
rophe+2.362
ths+2.047
ient+1.884
riad+1.820
Neg logit contributions
 Probe-8.580
 Nish-8.329
ersen-7.976
 Rik-7.868
 COURT-7.760
Tokenians
Token ID1547
Feature activation+1.44e+00

Pos logit contributions
phe+2.654
rophe+2.281
ths+2.017
riad+1.982
ient+1.946
Neg logit contributions
 Probe-3.331
 Nish-3.175
ersen-3.057
 Rik-2.903
 COURT-2.886
Token say
Token ID910
Feature activation-4.00e-01

Pos logit contributions
phe+2.814
rophe+2.444
ths+2.209
riad+2.139
ient+2.116
Neg logit contributions
 Probe-2.983
 Nish-2.801
ersen-2.720
 Rik-2.562
 Trafford-2.553
Token the
Token ID262
Feature activation+1.26e-01

Pos logit contributions
 Probe+0.771
 Nish+0.735
ersen+0.698
 Rik+0.661
 NP+0.659
Neg logit contributions
phe-0.840
rophe-0.739
ths-0.664
riad-0.661
ient-0.646
Token au
Token ID35851
Feature activation-3.64e-01

Pos logit contributions
phe+0.245
rophe+0.213
ths+0.190
riad+0.189
ient+0.185
Neg logit contributions
 Probe-0.264
 Nish-0.252
ersen-0.241
 Rik-0.228
 NP-0.227
Token bar
Token ID2318
Feature activation+8.80e-01

Pos logit contributions
 Probe+0.274
 Nish+0.258
ersen+0.248
 Rik+0.232
 COURT+0.231
Neg logit contributions
phe-0.301
rophe-0.265
ths-0.240
riad-0.236
ient-0.233
Tokenre
Token ID260
Feature activation-2.35e-01

Pos logit contributions
phe+1.815
rophe+1.594
ths+1.439
riad+1.418
ient+1.395
Neg logit contributions
 Probe-1.742
 Nish-1.644
ersen-1.580
 Rik-1.483
 COURT-1.479
Tokenling
Token ID1359
Feature activation-1.85e-01

Pos logit contributions
 Probe+0.341
 Nish+0.316
ersen+0.299
 Rik+0.274
 NP+0.273
Neg logit contributions
phe-0.608
rophe-0.547
ths-0.504
riad-0.501
ient-0.493
Token through
Token ID832
Feature activation+6.06e-01

Pos logit contributions
 Probe+0.345
 Nish+0.326
ersen+0.311
 Rik+0.293
 NP+0.290
Neg logit contributions
phe-0.405
rophe-0.358
ths-0.323
riad-0.321
ient-0.314
Token the
Token ID262
Feature activation+5.90e-02

Pos logit contributions
phe+1.258
rophe+1.101
ths+1.002
riad+0.981
ient+0.971
Neg logit contributions
 Probe-1.189
 Nish-1.119
ersen-1.080
 COURT-1.008
 Rik-1.008
Token Tampa
Token ID15528
Feature activation+2.89e+00

Pos logit contributions
phe+0.133
rophe+0.118
ths+0.108
riad+0.106
ient+0.105
Neg logit contributions
 Probe-0.105
 Nish-0.099
ersen-0.095
 Rik-0.088
 COURT-0.088
Token Bay
Token ID4696
Feature activation+1.23e+00

Pos logit contributions
phe+4.203
rophe+3.569
ths+3.214
riad+2.942
ient+2.940
Neg logit contributions
 Probe-7.182
 Nish-6.918
ersen-6.737
 COURT-6.403
 Rik-6.367
Token Lightning
Token ID12469
Feature activation+7.11e-01

Pos logit contributions
phe+2.222
rophe+1.910
ths+1.713
riad+1.671
ient+1.639
Neg logit contributions
 Probe-2.730
 Nish-2.603
ersen-2.520
 COURT-2.368
 Rik-2.367
Token,
Token ID11
Feature activation-5.11e-02

Pos logit contributions
phe+1.535
rophe+1.356
ths+1.233
riad+1.211
ient+1.193
Neg logit contributions
 Probe-1.333
 Nish-1.257
ersen-1.206
 Rik-1.127
 COURT-1.122
Token prevailing
Token ID26602
Feature activation-2.56e-01

Pos logit contributions
 Probe+0.092
 Nish+0.086
ersen+0.083
 Rik+0.077
 COURT+0.076
Neg logit contributions
phe-0.115
rophe-0.102
ths-0.093
riad-0.092
ient-0.090
Token through
Token ID832
Feature activation-1.37e-02

Pos logit contributions
 Probe+0.457
 Nish+0.430
ersen+0.410
 Rik+0.383
 NP+0.380
Neg logit contributions
phe-0.580
rophe-0.516
ths-0.468
riad-0.464
ient-0.455
Token45
Token ID2231
Feature activation-6.08e-01

Pos logit contributions
phe+1.859
rophe+1.661
ths+1.528
riad+1.506
ient+1.486
Neg logit contributions
 Probe-1.220
 Nish-1.136
ersen-1.096
 Rik-0.994
 Trafford-0.989
Token 366
Token ID44856
Feature activation+7.89e-01

Pos logit contributions
 Probe+0.974
 Nish+0.912
ersen+0.856
 Rik+0.799
 NP+0.798
Neg logit contributions
phe-1.484
rophe-1.333
ths-1.221
riad-1.208
ient-1.190
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
phe+1.674
rophe+1.476
ths+1.336
riad+1.316
ient+1.296
Neg logit contributions
 Probe-1.515
 Nish-1.429
ersen-1.375
 Rik-1.284
 COURT-1.276
Token
Token ID198
Feature activation-4.15e-01

Pos logit contributions
 Probe+0.288
 Nish+0.267
ersen+0.250
 Rik+0.232
 COURT+0.231
Neg logit contributions
phe-0.486
rophe-0.438
ths-0.404
riad-0.401
ient-0.396
Token
Token ID198
Feature activation-3.68e-01

Pos logit contributions
 Probe+0.697
 Nish+0.651
ersen+0.621
 Rik+0.575
 COURT+0.570
Neg logit contributions
phe-0.985
rophe-0.879
ths-0.809
riad-0.796
ient-0.785
TokenTopics
Token ID25902
Feature activation-3.32e+00

Pos logit contributions
 Probe+0.615
 Nish+0.577
ersen+0.553
 Rik+0.513
 COURT+0.508
Neg logit contributions
phe-0.858
rophe-0.768
ths-0.699
riad-0.695
ient-0.681
Token:
Token ID25
Feature activation-1.13e+00

Pos logit contributions
 Probe+5.520
 Nish+5.106
ersen+4.851
 NP+4.553
 COURT+4.542
Neg logit contributions
phe-7.733
rophe-6.934
 tremend-6.415
ths-6.226
riad-6.214
Token community
Token ID2055
Feature activation-5.38e-01

Pos logit contributions
 Probe+2.036
 Nish+1.945
ersen+1.850
 COURT+1.729
 NP+1.726
Neg logit contributions
phe-2.511
rophe-2.218
ths-2.007
riad-2.000
ovie-1.957
Token-
Token ID12
Feature activation-7.57e-02

Pos logit contributions
 Probe+0.894
 Nish+0.834
ersen+0.792
 Rik+0.735
 COURT+0.730
Neg logit contributions
phe-1.281
rophe-1.148
ths-1.052
riad-1.039
ient-1.027
Tokenand
Token ID392
Feature activation-9.44e-02

Pos logit contributions
 Probe+0.184
 Nish+0.177
ersen+0.171
 Rik+0.162
 NP+0.162
Neg logit contributions
phe-0.121
rophe-0.102
ths-0.088
riad-0.087
ient-0.084
Token-
Token ID12
Feature activation+1.36e-01

Pos logit contributions
 Probe+0.162
 Nish+0.152
ersen+0.143
 Rik+0.134
 COURT+0.134
Neg logit contributions
phe-0.220
rophe-0.197
ths-0.181
riad-0.178
ient-0.176
Token into
Token ID656
Feature activation-3.77e-01

Pos logit contributions
phe+1.097
rophe+0.971
ths+0.886
riad+0.870
ient+0.863
Neg logit contributions
 Probe-0.905
 Nish-0.849
ersen-0.813
 Rik-0.758
 COURT-0.754
Token them
Token ID606
Feature activation+4.53e-01

Pos logit contributions
 Probe+0.733
 Nish+0.696
ersen+0.668
 Rik+0.628
 NP+0.623
Neg logit contributions
phe-0.792
rophe-0.698
ths-0.626
riad-0.620
ient-0.607
Token".
Token ID1911
Feature activation+3.16e-01

Pos logit contributions
phe+0.966
rophe+0.853
ths+0.770
riad+0.761
ient+0.746
Neg logit contributions
 Probe-0.866
 Nish-0.817
ersen-0.784
 Rik-0.735
 COURT-0.730
Token
Token ID198
Feature activation-4.97e-01

Pos logit contributions
phe+0.752
rophe+0.671
ths+0.616
riad+0.608
ient+0.599
Neg logit contributions
 Probe-0.525
 Nish-0.492
ersen-0.468
 Rik-0.434
 COURT-0.431
Token
Token ID198
Feature activation-3.96e-01

Pos logit contributions
 Probe+0.818
 Nish+0.764
ersen+0.727
 Rik+0.673
 COURT+0.668
Neg logit contributions
phe-1.193
rophe-1.068
ths-0.979
riad-0.967
ient-0.953
TokenTopics
Token ID25902
Feature activation-3.27e+00

Pos logit contributions
 Probe+0.742
 Nish+0.701
ersen+0.677
 Rik+0.631
 NP+0.627
Neg logit contributions
phe-0.845
rophe-0.748
ths-0.672
riad-0.665
ient-0.653
Token:
Token ID25
Feature activation-1.11e+00

Pos logit contributions
 Probe+5.440
 Nish+5.043
ersen+4.779
 NP+4.500
 Rik+4.473
Neg logit contributions
phe-7.607
rophe-6.822
 tremend-6.340
ths-6.118
riad-6.107
Token marine
Token ID16050
Feature activation-1.27e+00

Pos logit contributions
 Probe+1.842
 Nish+1.743
ersen+1.655
 NP+1.536
 Trafford+1.529
Neg logit contributions
phe-2.627
rophe-2.337
ths-2.130
riad-2.122
ovie-2.071
Token-
Token ID12
Feature activation+1.24e-01

Pos logit contributions
 Probe+1.847
 Nish+1.718
ersen+1.680
 NP+1.492
 Trafford+1.488
Neg logit contributions
phe-3.215
rophe-2.914
ths-2.676
riad-2.665
 kidding-2.584
Tokenbiology
Token ID43592
Feature activation-1.58e+00

Pos logit contributions
phe+0.222
rophe+0.191
ths+0.168
riad+0.166
ient+0.161
Neg logit contributions
 Probe-0.276
 Nish-0.263
ersen-0.255
 NP-0.241
 Rik-0.240
Token,
Token ID11
Feature activation-2.05e-01

Pos logit contributions
 Probe+2.780
 Nish+2.603
ersen+2.495
 NP+2.313
 Rik+2.311
Neg logit contributions
phe-3.615
rophe-3.205
ths-2.888
riad-2.888
ient-2.820
Token News
Token ID3000
Feature activation+4.88e-01

Pos logit contributions
 Probe+0.303
 Nish+0.284
ersen+0.272
 Rik+0.253
 COURT+0.251
Neg logit contributions
phe-0.386
rophe-0.343
ths-0.312
riad-0.309
ient-0.304
Token 24
Token ID1987
Feature activation+7.03e-01

Pos logit contributions
phe+1.092
rophe+0.971
ths+0.885
riad+0.874
ient+0.860
Neg logit contributions
 Probe-0.874
 Nish-0.824
ersen-0.791
 Rik-0.733
 COURT-0.730
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
phe+1.528
rophe+1.347
ths+1.234
riad+1.211
ient+1.193
Neg logit contributions
 Probe-1.306
 Nish-1.234
ersen-1.187
 Rik-1.099
 COURT-1.096
Token
Token ID198
Feature activation-1.07e-01

Pos logit contributions
phe+0.253
rophe+0.227
ths+0.209
riad+0.206
ient+0.203
Neg logit contributions
 Probe-0.159
 Nish-0.148
ersen-0.141
 Rik-0.129
 COURT-0.128
Token
Token ID198
Feature activation-4.63e-02

Pos logit contributions
 Probe+0.184
 Nish+0.172
ersen+0.165
 Rik+0.153
 Trafford+0.152
Neg logit contributions
phe-0.247
rophe-0.220
ths-0.203
riad-0.199
ient-0.196
TokenTopics
Token ID25902
Feature activation-3.07e+00

Pos logit contributions
 Probe+0.078
 Nish+0.073
ersen+0.070
 Rik+0.065
 COURT+0.064
Neg logit contributions
phe-0.109
rophe-0.097
ths-0.089
riad-0.088
ient-0.086
Token:
Token ID25
Feature activation-9.22e-01

Pos logit contributions
 Probe+5.175
 Nish+4.792
ersen+4.571
 Rik+4.262
 NP+4.255
Neg logit contributions
phe-7.142
rophe-6.373
 tremend-5.864
riad-5.725
ths-5.725
Token beef
Token ID12023
Feature activation-1.23e+00

Pos logit contributions
 Probe+1.649
 Nish+1.555
ersen+1.483
 NP+1.383
 Rik+1.383
Neg logit contributions
phe-2.071
rophe-1.839
ths-1.667
riad-1.653
ient-1.620
Token-
Token ID12
Feature activation+2.16e-01

Pos logit contributions
 Probe+2.019
 Nish+1.875
ersen+1.807
 Rik+1.653
ega+1.650
Neg logit contributions
phe-2.908
rophe-2.598
ths-2.363
riad-2.362
ovie-2.309
Tokenc
Token ID66
Feature activation+2.51e-01

Pos logit contributions
phe+0.423
rophe+0.369
ths+0.330
riad+0.325
ient+0.319
Neg logit contributions
 Probe-0.450
 Nish-0.426
ersen-0.412
 Rik-0.387
 NP-0.386
Tokenattle
Token ID1999
Feature activation+4.33e-01

Pos logit contributions
phe+0.607
rophe+0.544
ths+0.499
riad+0.493
ient+0.486
Neg logit contributions
 Probe-0.408
 Nish-0.381
ersen-0.361
 Rik-0.335
 COURT-0.332
Token 41
Token ID6073
Feature activation-4.72e-01

Pos logit contributions
 Probe+0.712
 Nish+0.668
ersen+0.636
 Rik+0.592
 COURT+0.588
Neg logit contributions
phe-0.947
rophe-0.844
ths-0.769
riad-0.761
ient-0.749
Token-
Token ID12
Feature activation+1.31e-01

Pos logit contributions
 Probe+0.852
 Nish+0.801
ersen+0.766
 Rik+0.715
 COURT+0.710
Neg logit contributions
phe-1.057
rophe-0.939
ths-0.853
riad-0.843
ient-0.830
Tokenyear
Token ID1941
Feature activation-4.72e-01

Pos logit contributions
phe+0.295
rophe+0.263
ths+0.239
riad+0.236
ient+0.232
Neg logit contributions
 Probe-0.235
 Nish-0.220
ersen-0.210
 Rik-0.196
 COURT-0.195
Token-
Token ID12
Feature activation-2.36e-02

Pos logit contributions
 Probe+0.809
 Nish+0.760
ersen+0.732
 Rik+0.674
 COURT+0.671
Neg logit contributions
phe-1.092
rophe-0.975
ths-0.893
riad-0.879
ient-0.866
Tokenold
Token ID727
Feature activation-3.34e-01

Pos logit contributions
 Probe+0.029
 Nish+0.027
ersen+0.024
 Rik+0.022
 COURT+0.022
Neg logit contributions
phe-0.067
rophe-0.061
ths-0.057
riad-0.056
ient-0.056
Token Herm
Token ID18113
Feature activation-3.03e+00

Pos logit contributions
 Probe+0.608
 Nish+0.572
ersen+0.547
 Rik+0.511
 COURT+0.507
Neg logit contributions
phe-0.743
rophe-0.659
ths-0.599
riad-0.591
ient-0.582
Tokenosa
Token ID8546
Feature activation-1.04e+00

Pos logit contributions
 Probe+4.791
 Nish+4.526
ersen+4.373
 Rik+4.018
 NP+3.954
Neg logit contributions
phe-7.066
rophe-6.393
ths-5.825
 kidding-5.736
riad-5.731
Token Beach
Token ID8511
Feature activation-5.70e-01

Pos logit contributions
 Probe+1.571
 Nish+1.451
ersen+1.363
 Rik+1.260
 COURT+1.253
Neg logit contributions
phe-2.654
rophe-2.383
ths-2.201
riad-2.184
ient-2.133
Token Karen
Token ID18678
Feature activation-4.02e-01

Pos logit contributions
 Probe+1.070
 Nish+1.005
ersen+0.958
 COURT+0.900
 Rik+0.897
Neg logit contributions
phe-1.245
rophe-1.100
ths-0.993
riad-0.984
ient-0.967
Token K
Token ID509
Feature activation-1.01e+00

Pos logit contributions
 Probe+0.709
 Nish+0.666
ersen+0.634
 Rik+0.591
 COURT+0.587
Neg logit contributions
phe-0.919
rophe-0.818
ths-0.744
riad-0.737
ient-0.725
Tokenla
Token ID5031
Feature activation+8.65e-01

Pos logit contributions
 Probe+1.992
 Nish+1.890
ersen+1.833
 Rik+1.703
 COURT+1.686
Neg logit contributions
phe-2.075
rophe-1.802
ths-1.637
riad-1.609
 kidding-1.579
Token Britain
Token ID5491
Feature activation+4.91e-01

Pos logit contributions
phe+0.686
rophe+0.606
ths+0.547
riad+0.541
ient+0.532
Neg logit contributions
 Probe-0.613
 Nish-0.578
ersen-0.555
 Rik-0.519
 COURT-0.516
Token to
Token ID284
Feature activation-5.26e-02

Pos logit contributions
phe+1.126
rophe+1.002
ths+0.912
riad+0.903
ient+0.889
Neg logit contributions
 Probe-0.864
 Nish-0.811
ersen-0.773
 Rik-0.722
 COURT-0.715
Token issue
Token ID2071
Feature activation+5.17e-01

Pos logit contributions
 Probe+0.107
 Nish+0.102
ersen+0.098
 Rik+0.092
 COURT+0.091
Neg logit contributions
phe-0.105
rophe-0.092
ths-0.082
riad-0.081
ient-0.080
Token a
Token ID257
Feature activation+2.14e-02

Pos logit contributions
phe+1.123
rophe+0.995
ths+0.905
riad+0.892
ient+0.876
Neg logit contributions
 Probe-0.962
 Nish-0.903
ersen-0.869
 Rik-0.809
 COURT-0.805
Token su
Token ID424
Feature activation-4.46e-02

Pos logit contributions
phe+0.050
rophe+0.044
ths+0.041
riad+0.040
ient+0.039
Neg logit contributions
 Probe-0.036
 Nish-0.034
ersen-0.033
 Rik-0.030
 COURT-0.030
Tokenk
Token ID74
Feature activation-2.96e+00

Pos logit contributions
 Probe+0.080
 Nish+0.075
ersen+0.071
 Rik+0.067
 COURT+0.066
Neg logit contributions
phe-0.101
rophe-0.090
ths-0.082
riad-0.081
ient-0.079
Tokenuk
Token ID2724
Feature activation-1.16e+00

Pos logit contributions
 Probe+5.554
 Nish+5.488
ersen+5.144
 Rik+4.921
 NP+4.912
Neg logit contributions
phe-5.941
rophe-5.169
ths-4.583
 kidding-4.515
ovie-4.497
Token,
Token ID837
Feature activation+2.59e-01

Pos logit contributions
 Probe+2.125
 Nish+2.051
ersen+1.948
 Rik+1.831
 NP+1.810
Neg logit contributions
phe-2.538
rophe-2.236
ths-2.010
riad-1.995
ient-1.964
Token a
Token ID257
Feature activation+2.13e-02

Pos logit contributions
phe+0.540
rophe+0.475
ths+0.429
riad+0.423
ient+0.415
Neg logit contributions
 Probe-0.508
 Nish-0.480
ersen-0.461
 Rik-0.433
 COURT-0.430
Token form
Token ID1296
Feature activation-1.26e-01

Pos logit contributions
phe+0.048
rophe+0.042
ths+0.039
riad+0.038
ient+0.037
Neg logit contributions
 Probe-0.038
 Nish-0.036
ersen-0.035
 Rik-0.032
 COURT-0.032
Token of
Token ID286
Feature activation+2.22e-01

Pos logit contributions
 Probe+0.202
 Nish+0.190
ersen+0.180
 Rik+0.167
 NP+0.165
Neg logit contributions
phe-0.305
rophe-0.274
ths-0.251
riad-0.249
ient-0.244
Token'."
Token ID30827
Feature activation+5.05e-01

Pos logit contributions
 Probe+2.578
 Nish+2.460
ersen+2.320
 Rik+2.208
 NP+2.177
Neg logit contributions
phe-3.012
rophe-2.678
ths-2.413
riad-2.412
ient-2.349
Token
Token ID198
Feature activation-4.24e-01

Pos logit contributions
phe+1.222
rophe+1.094
ths+1.003
riad+0.992
ient+0.978
Neg logit contributions
 Probe-0.818
 Nish-0.763
ersen-0.728
 Rik-0.672
 COURT-0.667
Token
Token ID198
Feature activation-3.36e-01

Pos logit contributions
 Probe+0.709
 Nish+0.663
ersen+0.633
 Rik+0.586
 COURT+0.581
Neg logit contributions
phe-1.004
rophe-0.897
ths-0.825
riad-0.813
ient-0.800
TokenHe
Token ID1544
Feature activation-3.07e-01

Pos logit contributions
 Probe+0.629
 Nish+0.596
ersen+0.568
 Rik+0.533
 COURT+0.529
Neg logit contributions
phe-0.721
rophe-0.641
ths-0.575
riad-0.571
ient-0.561
Token then
Token ID788
Feature activation-1.13e+00

Pos logit contributions
 Probe+0.577
 Nish+0.544
ersen+0.522
 Rik+0.488
 NP+0.484
Neg logit contributions
phe-0.664
rophe-0.587
ths-0.529
riad-0.525
ient-0.515
Token asks
Token ID7893
Feature activation-2.93e+00

Pos logit contributions
 Probe+2.187
 Nish+2.072
ersen+1.986
 Rik+1.867
 NP+1.853
Neg logit contributions
phe-2.373
rophe-2.090
ths-1.869
riad-1.861
ient-1.820
Token:
Token ID25
Feature activation-1.67e+00

Pos logit contributions
 Probe+5.363
 Nish+5.226
ersen+4.863
 Rik+4.569
 NP+4.542
Neg logit contributions
phe-6.313
rophe-5.571
riad-5.083
ovie-4.975
ths-4.961
Token "
Token ID366
Feature activation-3.13e-01

Pos logit contributions
 Probe+2.640
 Nish+2.491
ersen+2.332
 Rik+2.170
 COURT+2.137
Neg logit contributions
phe-4.033
rophe-3.615
riad-3.320
ths-3.305
ient-3.258
TokenIs
Token ID3792
Feature activation-2.59e-02

Pos logit contributions
 Probe+0.599
 Nish+0.566
ersen+0.541
 Rik+0.508
 NP+0.505
Neg logit contributions
phe-0.665
rophe-0.590
ths-0.530
riad-0.524
ient-0.516
Token there
Token ID612
Feature activation-2.84e-01

Pos logit contributions
 Probe+0.052
 Nish+0.049
ersen+0.047
 Rik+0.044
 NP+0.044
Neg logit contributions
phe-0.053
rophe-0.047
ths-0.042
riad-0.041
ient-0.041
Token really
Token ID1107
Feature activation-2.29e-01

Pos logit contributions
 Probe+0.606
 Nish+0.577
ersen+0.554
 Rik+0.524
 NP+0.521
Neg logit contributions
phe-0.544
rophe-0.471
ths-0.419
riad-0.414
ient-0.406
Token
Token ID198
Feature activation+2.93e-01

Pos logit contributions
phe+0.757
rophe+0.671
ths+0.621
riad+0.608
ient+0.597
Neg logit contributions
 Probe-0.587
 Nish-0.545
ersen-0.526
 Rik-0.486
 Trafford-0.485
TokenRec
Token ID6690
Feature activation+9.89e-01

Pos logit contributions
phe+0.661
rophe+0.587
ths+0.534
riad+0.528
ient+0.520
Neg logit contributions
 Probe-0.525
 Nish-0.494
ersen-0.472
 Rik-0.440
 COURT-0.437
Tokenalling
Token ID9221
Feature activation+1.24e-01

Pos logit contributions
phe+2.217
rophe+1.971
ths+1.791
riad+1.771
ient+1.746
Neg logit contributions
 Probe-1.779
 Nish-1.675
ersen-1.597
 Rik-1.493
 COURT-1.482
Token that
Token ID326
Feature activation+3.27e-01

Pos logit contributions
phe+0.254
rophe+0.222
ths+0.200
riad+0.197
ient+0.194
Neg logit contributions
 Probe-0.248
 Nish-0.235
ersen-0.226
 Rik-0.213
 NP-0.211
Token,
Token ID11
Feature activation-1.92e-01

Pos logit contributions
phe+0.691
rophe+0.609
ths+0.549
riad+0.543
ient+0.535
Neg logit contributions
 Probe-0.629
 Nish-0.593
ersen-0.570
 Rik-0.533
 COURT-0.532
Token Ik
Token ID32840
Feature activation-2.91e+00

Pos logit contributions
 Probe+0.332
 Nish+0.313
ersen+0.297
 Rik+0.278
 NP+0.274
Neg logit contributions
phe-0.444
rophe-0.396
ths-0.361
riad-0.357
ient-0.352
Tokenki
Token ID4106
Feature activation-3.08e-01

Pos logit contributions
 Probe+4.645
 Nish+4.517
ersen+4.244
 Rik+3.979
ega+3.887
Neg logit contributions
phe-6.716
rophe-6.137
ths-5.659
 kidding-5.433
ient-5.415
Token changed
Token ID3421
Feature activation+3.33e-01

Pos logit contributions
 Probe+0.591
 Nish+0.558
ersen+0.534
 Rik+0.502
 COURT+0.497
Neg logit contributions
phe-0.657
rophe-0.580
ths-0.524
riad-0.517
ient-0.508
Token his
Token ID465
Feature activation-4.49e-01

Pos logit contributions
phe+0.712
rophe+0.628
ths+0.567
riad+0.560
ient+0.550
Neg logit contributions
 Probe-0.635
 Nish-0.600
ersen-0.575
 Rik-0.540
 NP-0.535
Token plan
Token ID1410
Feature activation+1.07e+00

Pos logit contributions
 Probe+0.908
 Nish+0.863
ersen+0.825
 Rik+0.781
 NP+0.776
Neg logit contributions
phe-0.911
rophe-0.799
ths-0.714
riad-0.703
ient-0.692
Token immediately
Token ID3393
Feature activation+6.83e-01

Pos logit contributions
phe+2.245
rophe+1.977
ths+1.789
riad+1.765
ient+1.739
Neg logit contributions
 Probe-2.083
 Nish-1.962
ersen-1.893
 Rik-1.764
 COURT-1.759
Token differences
Token ID5400
Feature activation+5.47e-01

Pos logit contributions
 Probe+0.253
 Nish+0.238
ersen+0.229
 Rik+0.214
 COURT+0.213
Neg logit contributions
phe-0.282
rophe-0.249
ths-0.225
riad-0.222
ient-0.218
Token between
Token ID1022
Feature activation-7.17e-01

Pos logit contributions
phe+1.198
rophe+1.059
ths+0.959
riad+0.948
ient+0.931
Neg logit contributions
 Probe-1.017
 Nish-0.958
ersen-0.916
 Rik-0.859
 NP-0.852
Token the
Token ID262
Feature activation-1.50e-01

Pos logit contributions
 Probe+1.373
 Nish+1.308
ersen+1.232
 Rik+1.173
 NP+1.171
Neg logit contributions
phe-1.527
rophe-1.344
ths-1.196
riad-1.191
ient-1.172
Token biology
Token ID17219
Feature activation-9.75e-01

Pos logit contributions
 Probe+0.330
 Nish+0.314
ersen+0.302
 Rik+0.286
 NP+0.285
Neg logit contributions
phe-0.278
rophe-0.241
ths-0.213
riad-0.210
ient-0.206
Token and
Token ID290
Feature activation+8.94e-02

Pos logit contributions
 Probe+1.548
 Nish+1.469
ersen+1.365
 Rik+1.291
 NP+1.285
Neg logit contributions
phe-2.346
rophe-2.108
ths-1.905
ovie-1.890
riad-1.890
Token sociology
Token ID39625
Feature activation-2.91e+00

Pos logit contributions
phe+0.192
rophe+0.170
ths+0.153
riad+0.151
ient+0.149
Neg logit contributions
 Probe-0.170
 Nish-0.160
ersen-0.153
 Rik-0.144
 COURT-0.143
Token of
Token ID286
Feature activation-4.50e-01

Pos logit contributions
 Probe+3.939
 Nish+3.800
ersen+3.460
 NP+3.325
 Rik+3.289
Neg logit contributions
phe-7.327
rophe-6.617
ovie-6.067
riad-5.978
ient-5.918
Token race
Token ID3234
Feature activation-5.35e-01

Pos logit contributions
 Probe+0.852
 Nish+0.812
ersen+0.766
 Rik+0.727
 NP+0.725
Neg logit contributions
phe-0.967
rophe-0.851
ths-0.761
riad-0.756
ient-0.747
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
 Probe+0.958
 Nish+0.905
ersen+0.852
 Rik+0.813
 NP+0.811
Neg logit contributions
phe-1.188
rophe-1.057
riad-0.954
ths-0.951
ovie-0.937
Token gender
Token ID5279
Feature activation-1.93e-02

Pos logit contributions
 Probe+0.170
 Nish+0.159
ersen+0.151
 Rik+0.141
 NP+0.140
Neg logit contributions
phe-0.242
rophe-0.216
ths-0.197
riad-0.196
ient-0.192
Token to
Token ID284
Feature activation-1.53e-01

Pos logit contributions
 Probe+0.036
 Nish+0.034
ersen+0.032
 Rik+0.031
 NP+0.030
Neg logit contributions
phe-0.042
rophe-0.037
ths-0.033
riad-0.033
ient-0.033
Token Fel
Token ID13937
Feature activation-4.41e-01

Pos logit contributions
 Probe+1.529
 Nish+1.448
ersen+1.326
 Rik+1.291
 NP+1.251
Neg logit contributions
phe-2.443
rophe-2.205
ths-2.053
riad-2.000
 kidding-1.984
Tokenton
Token ID1122
Feature activation-8.40e-01

Pos logit contributions
 Probe+0.783
 Nish+0.737
ersen+0.715
 COURT+0.660
ega+0.655
Neg logit contributions
phe-0.983
rophe-0.888
ths-0.788
riad-0.786
 kidding-0.781
Token,
Token ID11
Feature activation+1.18e-03

Pos logit contributions
 Probe+1.476
 Nish+1.391
ersen+1.339
 Rik+1.242
 NP+1.231
Neg logit contributions
phe-1.905
rophe-1.700
ths-1.539
riad-1.538
ient-1.499
Token D
Token ID360
Feature activation-4.77e-02

Pos logit contributions
phe+0.003
rophe+0.002
ths+0.002
riad+0.002
ient+0.002
Neg logit contributions
 Probe-0.002
 Nish-0.002
ersen-0.002
 Rik-0.002
 Trafford-0.002
Tokenarian
Token ID3699
Feature activation+1.93e-01

Pos logit contributions
 Probe+0.087
 Nish+0.082
ersen+0.079
 Rik+0.073
 COURT+0.073
Neg logit contributions
phe-0.106
rophe-0.093
ths-0.085
riad-0.084
ient-0.082
Token Owens
Token ID31005
Feature activation-2.89e+00

Pos logit contributions
phe+0.538
rophe+0.489
ths+0.454
riad+0.450
ient+0.445
Neg logit contributions
 Probe-0.244
 Nish-0.223
ersen-0.209
 Rik-0.188
 COURT-0.186
Token,
Token ID11
Feature activation-9.36e-02

Pos logit contributions
 Probe+4.608
 Nish+4.396
ersen+4.227
 NP+3.856
ega+3.826
Neg logit contributions
phe-6.726
rophe-5.988
riad-5.539
ths-5.502
ient-5.385
Token and
Token ID290
Feature activation+3.04e-02

Pos logit contributions
 Probe+0.164
 Nish+0.154
ersen+0.146
 Rik+0.137
 Trafford+0.135
Neg logit contributions
phe-0.215
rophe-0.192
ths-0.174
riad-0.173
ient-0.170
Token now
Token ID783
Feature activation-1.16e+00

Pos logit contributions
phe+0.071
rophe+0.064
ths+0.058
riad+0.057
ient+0.057
Neg logit contributions
 Probe-0.052
 Nish-0.049
ersen-0.046
 Rik-0.043
 COURT-0.043
Token Allow
Token ID22507
Feature activation-4.79e-01

Pos logit contributions
 Probe+1.984
 Nish+1.880
ersen+1.779
 Rik+1.678
 NP+1.648
Neg logit contributions
phe-2.676
rophe-2.401
riad-2.159
ths-2.155
ient-2.119
Tokenay
Token ID323
Feature activation+3.53e-03

Pos logit contributions
 Probe+1.236
 Nish+1.185
ersen+1.152
 Rik+1.095
 Trafford+1.093
Neg logit contributions
phe-0.699
rophe-0.578
ths-0.491
riad-0.478
ient-0.465
Token be
Token ID307
Feature activation-4.65e-01

Pos logit contributions
 Probe+0.663
 Nish+0.628
ersen+0.604
 Rik+0.570
 COURT+0.566
Neg logit contributions
phe-0.632
rophe-0.552
ths-0.494
riad-0.487
ient-0.478
Token available
Token ID1695
Feature activation-2.51e-01

Pos logit contributions
 Probe+0.882
 Nish+0.831
ersen+0.797
 Rik+0.746
 COURT+0.741
Neg logit contributions
phe-0.999
rophe-0.882
ths-0.798
riad-0.788
ient-0.775
Token only
Token ID691
Feature activation+2.41e-02

Pos logit contributions
 Probe+0.430
 Nish+0.403
ersen+0.384
 Rik+0.358
 NP+0.355
Neg logit contributions
phe-0.584
rophe-0.520
ths-0.474
riad-0.470
ient-0.460
Token at
Token ID379
Feature activation+2.01e-01

Pos logit contributions
phe+0.055
rophe+0.049
ths+0.045
riad+0.044
ient+0.043
Neg logit contributions
 Probe-0.042
 Nish-0.040
ersen-0.038
 Rik-0.036
 COURT-0.035
Token a
Token ID257
Feature activation+7.72e-02

Pos logit contributions
phe+0.399
rophe+0.348
ths+0.312
riad+0.308
ient+0.302
Neg logit contributions
 Probe-0.414
 Nish-0.392
ersen-0.378
 Rik-0.355
 COURT-0.354
Token laboratory
Token ID14010
Feature activation-2.85e+00

Pos logit contributions
phe+0.165
rophe+0.145
ths+0.132
riad+0.130
ient+0.128
Neg logit contributions
 Probe-0.147
 Nish-0.139
ersen-0.133
 Rik-0.124
 COURT-0.124
Token in
Token ID287
Feature activation-6.86e-02

Pos logit contributions
 Probe+4.825
 Nish+4.571
ersen+4.263
 Rik+4.104
 NP+4.102
Neg logit contributions
phe-6.465
rophe-5.628
ths-5.266
ovie-5.178
riad-5.108
Token San
Token ID2986
Feature activation+1.32e+00

Pos logit contributions
 Probe+0.116
 Nish+0.108
ersen+0.103
 Rik+0.096
 NP+0.095
Neg logit contributions
phe-0.162
rophe-0.145
ths-0.132
riad-0.131
ient-0.129
Token Juan
Token ID16852
Feature activation-1.04e+00

Pos logit contributions
phe+2.753
rophe+2.413
ths+2.191
riad+2.159
ient+2.136
Neg logit contributions
 Probe-2.578
 Nish-2.431
ersen-2.343
 COURT-2.188
 NP-2.185
Token Cap
Token ID4476
Feature activation-7.45e-01

Pos logit contributions
 Probe+1.887
 Nish+1.745
ersen+1.668
 NP+1.585
 Rik+1.572
Neg logit contributions
phe-2.318
rophe-2.064
ths-1.890
riad-1.868
ovie-1.827
Tokenist
Token ID396
Feature activation+7.38e-01

Pos logit contributions
 Probe+1.313
 Nish+1.235
ersen+1.189
 COURT+1.107
ega+1.104
Neg logit contributions
phe-1.691
rophe-1.479
ths-1.359
riad-1.343
 kidding-1.312
Token
Token ID247
Feature activation-3.97e-02

Pos logit contributions
phe+0.078
rophe+0.069
ths+0.062
riad+0.061
ient+0.060
Neg logit contributions
 Probe-0.074
 Nish-0.070
ersen-0.067
 Rik-0.063
 Trafford-0.063
Token,
Token ID11
Feature activation+1.38e-01

Pos logit contributions
 Probe+0.068
 Nish+0.064
ersen+0.061
 Rik+0.056
 COURT+0.056
Neg logit contributions
phe-0.092
rophe-0.082
ths-0.075
riad-0.075
ient-0.073
Token
Token ID564
Feature activation+3.63e-01

Pos logit contributions
phe+0.311
rophe+0.276
ths+0.251
riad+0.249
ient+0.245
Neg logit contributions
 Probe-0.248
 Nish-0.233
ersen-0.222
 Rik-0.208
 COURT-0.206
Token
Token ID246
Feature activation-7.52e-02

Pos logit contributions
phe+0.790
rophe+0.701
ths+0.634
riad+0.626
ient+0.616
Neg logit contributions
 Probe-0.676
 Nish-0.637
ersen-0.609
 Rik-0.571
 COURT-0.567
TokenLost
Token ID31042
Feature activation+1.56e-01

Pos logit contributions
 Probe+0.145
 Nish+0.137
ersen+0.131
 Rik+0.124
 NP+0.122
Neg logit contributions
phe-0.158
rophe-0.140
ths-0.126
riad-0.124
ient-0.122
Token Woods
Token ID16483
Feature activation-2.84e+00

Pos logit contributions
phe+0.407
rophe+0.368
ths+0.340
riad+0.336
ient+0.332
Neg logit contributions
 Probe-0.223
 Nish-0.206
ersen-0.195
 Rik-0.178
 COURT-0.177
Token
Token ID447
Feature activation+8.19e-02

Pos logit contributions
 Probe+4.764
 Nish+4.497
ersen+4.296
 NP+4.057
 COURT+4.022
Neg logit contributions
phe-6.265
rophe-5.567
ths-5.078
riad-5.052
ient-4.989
Token
Token ID247
Feature activation+7.16e-04

Pos logit contributions
phe+0.167
rophe+0.148
ths+0.133
riad+0.131
ient+0.129
Neg logit contributions
 Probe-0.164
 Nish-0.154
ersen-0.148
 Rik-0.139
 Trafford-0.139
Token)
Token ID8
Feature activation+2.50e-01

Pos logit contributions
phe+0.002
rophe+0.002
ths+0.001
riad+0.001
ient+0.001
Neg logit contributions
 Probe-0.001
 Nish-0.001
ersen-0.001
 Rik-0.001
 COURT-0.001
Token are
Token ID389
Feature activation-1.16e-01

Pos logit contributions
phe+0.514
rophe+0.451
ths+0.406
riad+0.401
ient+0.393
Neg logit contributions
 Probe-0.498
 Nish-0.470
ersen-0.452
 Rik-0.425
 COURT-0.423
Token classics
Token ID29039
Feature activation-1.02e+00

Pos logit contributions
 Probe+0.229
 Nish+0.217
ersen+0.208
 Rik+0.196
 NP+0.195
Neg logit contributions
phe-0.238
rophe-0.209
ths-0.188
riad-0.185
ient-0.182
Token is
Token ID318
Feature activation-3.43e-01

Pos logit contributions
 Probe+0.259
 Nish+0.244
ersen+0.233
 Rik+0.218
 COURT+0.217
Neg logit contributions
phe-0.317
rophe-0.281
ths-0.254
riad-0.253
ient-0.248
Token a
Token ID257
Feature activation-3.42e-01

Pos logit contributions
 Probe+0.673
 Nish+0.637
ersen+0.608
 Rik+0.574
 COURT+0.571
Neg logit contributions
phe-0.715
rophe-0.629
ths-0.566
riad-0.559
ient-0.548
Token h
Token ID289
Feature activation+1.29e-01

Pos logit contributions
 Probe+0.637
 Nish+0.600
ersen+0.574
 Rik+0.537
 COURT+0.534
Neg logit contributions
phe-0.747
rophe-0.661
ths-0.599
riad-0.591
ient-0.582
Tokenul
Token ID377
Feature activation+3.83e-01

Pos logit contributions
phe+0.262
rophe+0.229
ths+0.207
riad+0.203
ient+0.198
Neg logit contributions
 Probe-0.259
 Nish-0.245
ersen-0.237
 Rik-0.221
 Trafford-0.220
Tokenking
Token ID3364
Feature activation-8.42e-01

Pos logit contributions
phe+0.905
rophe+0.810
ths+0.739
riad+0.730
ient+0.715
Neg logit contributions
 Probe-0.642
 Nish-0.602
ersen-0.574
 Rik-0.534
 COURT-0.527
Token postwar
Token ID44060
Feature activation-2.84e+00

Pos logit contributions
 Probe+1.550
 Nish+1.464
ersen+1.382
 Rik+1.300
 NP+1.294
Neg logit contributions
phe-1.850
rophe-1.642
ths-1.483
riad-1.464
ient-1.444
Token structure
Token ID4645
Feature activation-3.66e-01

Pos logit contributions
 Probe+5.088
 Nish+4.878
ersen+4.504
 Rik+4.254
 Trafford+4.234
Neg logit contributions
phe-6.117
rophe-5.467
ths-4.926
riad-4.854
udder-4.808
Token that
Token ID326
Feature activation-2.32e-01

Pos logit contributions
 Probe+0.686
 Nish+0.652
ersen+0.620
 Rik+0.583
 COURT+0.581
Neg logit contributions
phe-0.792
rophe-0.697
ths-0.631
riad-0.621
ient-0.611
Token j
Token ID474
Feature activation-5.03e-01

Pos logit contributions
 Probe+0.429
 Nish+0.405
ersen+0.386
 Rik+0.362
 COURT+0.359
Neg logit contributions
phe-0.509
rophe-0.452
ths-0.409
riad-0.404
ient-0.398
Tokenuts
Token ID5500
Feature activation-1.76e+00

Pos logit contributions
 Probe+0.747
 Nish+0.692
ersen+0.659
 Rik+0.600
 COURT+0.596
Neg logit contributions
phe-1.286
rophe-1.159
ths-1.067
riad-1.056
ient-1.043
Token into
Token ID656
Feature activation-5.13e-01

Pos logit contributions
 Probe+3.614
 Nish+3.458
ersen+3.263
 Rik+3.147
 NP+3.107
Neg logit contributions
phe-3.434
rophe-3.016
riad-2.637
ths-2.628
ovie-2.566
Token Hispanic
Token ID16949
Feature activation+4.82e-01

Pos logit contributions
phe+0.200
rophe+0.178
ths+0.164
riad+0.162
ient+0.159
Neg logit contributions
 Probe-0.143
 Nish-0.134
ersen-0.128
 Rik-0.118
 COURT-0.117
Token vote
Token ID3015
Feature activation+1.36e-01

Pos logit contributions
phe+1.018
rophe+0.898
ths+0.814
riad+0.802
ient+0.788
Neg logit contributions
 Probe-0.930
 Nish-0.878
ersen-0.843
 Rik-0.789
 Trafford-0.783
Token.
Token ID13
Feature activation-5.77e-03

Pos logit contributions
phe+0.290
rophe+0.255
ths+0.230
riad+0.229
ient+0.224
Neg logit contributions
 Probe-0.260
 Nish-0.245
ersen-0.235
 Rik-0.221
 COURT-0.220
Token
Token ID198
Feature activation-2.47e-01

Pos logit contributions
 Probe+0.009
 Nish+0.009
ersen+0.008
 Rik+0.008
 Trafford+0.008
Neg logit contributions
phe-0.014
rophe-0.012
ths-0.011
riad-0.011
ient-0.011
Token
Token ID198
Feature activation-1.86e-01

Pos logit contributions
 Probe+0.421
 Nish+0.394
ersen+0.377
 Rik+0.349
 Trafford+0.347
Neg logit contributions
phe-0.579
rophe-0.515
ths-0.475
riad-0.467
ient-0.460
TokenTopics
Token ID25902
Feature activation-2.83e+00

Pos logit contributions
 Probe+0.343
 Nish+0.323
ersen+0.310
 Rik+0.289
 NP+0.288
Neg logit contributions
phe-0.408
rophe-0.362
ths-0.327
riad-0.324
ient-0.318
Token:
Token ID25
Feature activation-8.71e-01

Pos logit contributions
 Probe+4.926
 Nish+4.536
ersen+4.237
 Rik+4.070
 NP+4.018
Neg logit contributions
phe-6.470
rophe-5.745
ths-5.205
riad-5.164
ient-5.050
Token Hispanic
Token ID16949
Feature activation-5.42e-02

Pos logit contributions
 Probe+1.464
 Nish+1.365
ersen+1.292
 Rik+1.204
 COURT+1.199
Neg logit contributions
phe-2.067
rophe-1.845
ths-1.682
riad-1.671
ient-1.639
Token/
Token ID14
Feature activation+5.08e-02

Pos logit contributions
 Probe+0.092
 Nish+0.086
ersen+0.082
 Rik+0.076
 COURT+0.076
Neg logit contributions
phe-0.127
rophe-0.113
ths-0.104
riad-0.102
ient-0.101
TokenLat
Token ID24220
Feature activation-7.01e-01

Pos logit contributions
phe+0.103
rophe+0.090
ths+0.081
riad+0.080
ient+0.078
Neg logit contributions
 Probe-0.103
 Nish-0.097
ersen-0.094
 Rik-0.088
 COURT-0.088
Tokenino
Token ID2879
Feature activation-7.41e-02

Pos logit contributions
 Probe+0.838
 Nish+0.761
ersen+0.710
 Rik+0.634
 COURT+0.627
Neg logit contributions
phe-1.995
rophe-1.819
ths-1.692
riad-1.674
ient-1.653
Token 22
Token ID2534
Feature activation+4.34e-02

Pos logit contributions
phe+1.130
rophe+1.010
ths+0.941
riad+0.914
ient+0.908
Neg logit contributions
 Probe-0.760
 Nish-0.709
ersen-0.688
 Rik-0.622
 COURT-0.620
Token.
Token ID13
Feature activation-3.54e-01

Pos logit contributions
phe+0.094
rophe+0.083
ths+0.075
riad+0.074
ient+0.073
Neg logit contributions
 Probe-0.082
 Nish-0.077
ersen-0.074
 Rik-0.069
 COURT-0.069
Token
Token ID198
Feature activation-3.86e-01

Pos logit contributions
 Probe+0.584
 Nish+0.545
ersen+0.514
 Rik+0.481
 COURT+0.476
Neg logit contributions
phe-0.853
rophe-0.764
ths-0.698
riad-0.692
ient-0.684
Token
Token ID198
Feature activation-3.00e-01

Pos logit contributions
 Probe+0.651
 Nish+0.609
ersen+0.583
 Rik+0.538
 COURT+0.534
Neg logit contributions
phe-0.910
rophe-0.813
ths-0.748
riad-0.736
ient-0.724
TokenS
Token ID50
Feature activation-1.03e+00

Pos logit contributions
 Probe+0.551
 Nish+0.519
ersen+0.498
 Rik+0.466
 Trafford+0.463
Neg logit contributions
phe-0.657
rophe-0.583
ths-0.524
riad-0.519
ient-0.511
Tokenovere
Token ID33518
Feature activation-2.82e+00

Pos logit contributions
 Probe+1.987
 Nish+1.874
ersen+1.819
ega+1.698
 Rik+1.693
Neg logit contributions
phe-2.136
rophe-1.865
ths-1.694
riad-1.663
 kidding-1.627
Tokenign
Token ID570
Feature activation-7.51e-01

Pos logit contributions
 Probe+4.408
 Nish+4.115
ersen+3.880
 Rik+3.655
 Trafford+3.632
Neg logit contributions
phe-6.730
rophe-6.012
riad-5.557
ths-5.454
ovie-5.406
Token Lux
Token ID17145
Feature activation+2.15e-02

Pos logit contributions
 Probe+1.375
 Nish+1.295
ersen+1.236
 Rik+1.159
 Trafford+1.151
Neg logit contributions
phe-1.666
rophe-1.474
ths-1.335
riad-1.320
ient-1.292
Tokenury
Token ID1601
Feature activation-1.26e-01

Pos logit contributions
phe+0.050
rophe+0.044
ths+0.041
riad+0.040
ient+0.039
Neg logit contributions
 Probe-0.037
 Nish-0.035
ersen-0.033
 Rik-0.031
 NP-0.031
Token Travel
Token ID13524
Feature activation+1.24e+00

Pos logit contributions
 Probe+0.228
 Nish+0.214
ersen+0.205
 Rik+0.191
 COURT+0.190
Neg logit contributions
phe-0.281
rophe-0.250
ths-0.226
riad-0.224
ient-0.220
Token (
Token ID357
Feature activation-2.02e-01

Pos logit contributions
phe+2.740
rophe+2.425
ths+2.209
riad+2.171
ient+2.148
Neg logit contributions
 Probe-2.259
 Nish-2.132
ersen-2.039
 Rik-1.896
 COURT-1.893
Token:
Token ID25
Feature activation-5.91e-01

Pos logit contributions
phe+0.230
rophe+0.202
ths+0.183
riad+0.180
ient+0.177
Neg logit contributions
 Probe-0.209
 Nish-0.197
ersen-0.189
 Rik-0.178
 COURT-0.176
Token "
Token ID366
Feature activation+1.82e-01

Pos logit contributions
 Probe+0.948
 Nish+0.895
ersen+0.836
 Rik+0.781
 NP+0.777
Neg logit contributions
phe-1.442
rophe-1.289
ths-1.180
riad-1.176
ient-1.160
Tokenann
Token ID1236
Feature activation+8.67e-01

Pos logit contributions
phe+0.395
rophe+0.349
ths+0.316
riad+0.312
ient+0.307
Neg logit contributions
 Probe-0.342
 Nish-0.322
ersen-0.309
 Rik-0.289
 COURT-0.287
Token data
Token ID1366
Feature activation-8.85e-01

Pos logit contributions
phe+1.786
rophe+1.572
ths+1.412
riad+1.394
ient+1.375
Neg logit contributions
 Probe-1.718
 Nish-1.610
ersen-1.558
 Rik-1.458
 Trafford-1.452
Token su
Token ID424
Feature activation-4.33e-03

Pos logit contributions
 Probe+1.601
 Nish+1.516
ersen+1.446
 Rik+1.356
 NP+1.347
Neg logit contributions
phe-1.963
rophe-1.736
ths-1.578
riad-1.559
ovie-1.524
Tokenk
Token ID74
Feature activation-2.81e+00

Pos logit contributions
 Probe+0.008
 Nish+0.008
ersen+0.007
 Rik+0.007
 COURT+0.007
Neg logit contributions
phe-0.009
rophe-0.008
ths-0.007
riad-0.007
ient-0.007
Tokenhi
Token ID5303
Feature activation-2.55e-01

Pos logit contributions
 Probe+5.271
 Nish+5.109
ersen+4.890
 NP+4.599
ega+4.555
Neg logit contributions
phe-5.745
rophe-5.007
ths-4.480
riad-4.403
ovie-4.377
Token b
Token ID275
Feature activation-2.03e-01

Pos logit contributions
 Probe+0.437
 Nish+0.411
ersen+0.391
 Rik+0.364
 COURT+0.361
Neg logit contributions
phe-0.595
rophe-0.531
ths-0.484
riad-0.479
ient-0.471
Tokenh
Token ID71
Feature activation-2.63e-01

Pos logit contributions
 Probe+0.402
 Nish+0.383
ersen+0.369
 Rik+0.345
 NP+0.343
Neg logit contributions
phe-0.414
rophe-0.363
ths-0.326
riad-0.321
ient-0.315
Tokenava
Token ID4170
Feature activation+7.26e-02

Pos logit contributions
 Probe+0.531
 Nish+0.503
ersen+0.484
 Rik+0.455
 COURT+0.452
Neg logit contributions
phe-0.530
rophe-0.463
ths-0.416
riad-0.409
ient-0.401
Token"
Token ID1
Feature activation+2.98e-01

Pos logit contributions
phe+0.167
rophe+0.149
ths+0.136
riad+0.134
ient+0.132
Neg logit contributions
 Probe-0.127
 Nish-0.119
ersen-0.113
 Rik-0.106
 COURT-0.105
Token bipartisan
Token ID20953
Feature activation+6.41e-02

Pos logit contributions
 Probe+0.434
 Nish+0.408
ersen+0.391
 Rik+0.365
 COURT+0.363
Neg logit contributions
phe-0.518
rophe-0.459
ths-0.417
riad-0.412
ient-0.404
Token spirit
Token ID4437
Feature activation-7.53e-01

Pos logit contributions
phe+0.145
rophe+0.129
ths+0.118
riad+0.117
ient+0.114
Neg logit contributions
 Probe-0.113
 Nish-0.106
ersen-0.102
 Rik-0.095
 COURT-0.094
Token.
Token ID13
Feature activation-6.80e-02

Pos logit contributions
 Probe+1.364
 Nish+1.292
ersen+1.233
 Rik+1.155
 NP+1.150
Neg logit contributions
phe-1.668
rophe-1.471
riad-1.333
ths-1.323
ient-1.286
Token
Token ID198
Feature activation-1.12e-01

Pos logit contributions
 Probe+0.112
 Nish+0.104
ersen+0.099
 Rik+0.092
 COURT+0.091
Neg logit contributions
phe-0.163
rophe-0.146
ths-0.134
riad-0.132
ient-0.130
Token
Token ID198
Feature activation-4.21e-02

Pos logit contributions
 Probe+0.191
 Nish+0.179
ersen+0.171
 Rik+0.159
 Trafford+0.158
Neg logit contributions
phe-0.261
rophe-0.232
ths-0.214
riad-0.210
ient-0.207
TokenContact
Token ID17829
Feature activation-2.80e+00

Pos logit contributions
 Probe+0.079
 Nish+0.074
ersen+0.072
 Rik+0.067
 COURT+0.066
Neg logit contributions
phe-0.091
rophe-0.081
ths-0.072
riad-0.072
ient-0.071
Token us
Token ID514
Feature activation-2.68e-01

Pos logit contributions
 Probe+4.799
 Nish+4.456
ersen+4.222
 Rik+4.048
 NP+3.950
Neg logit contributions
phe-6.520
rophe-5.759
ths-5.187
riad-5.144
udder-5.091
Token at
Token ID379
Feature activation-1.87e-01

Pos logit contributions
 Probe+0.510
 Nish+0.481
ersen+0.461
 Rik+0.432
 COURT+0.429
Neg logit contributions
phe-0.572
rophe-0.505
ths-0.456
riad-0.451
ient-0.443
Token editors
Token ID15719
Feature activation+4.74e-01

Pos logit contributions
 Probe+0.354
 Nish+0.334
ersen+0.319
 Rik+0.300
 NP+0.298
Neg logit contributions
phe-0.404
rophe-0.357
ths-0.322
riad-0.319
ient-0.313
Token@
Token ID31
Feature activation-8.27e-01

Pos logit contributions
phe+1.437
rophe+1.318
ths+1.233
riad+1.221
ient+1.208
Neg logit contributions
 Probe-0.481
 Nish-0.430
ersen-0.395
 Rik-0.344
 COURT-0.338
Tokentime
Token ID2435
Feature activation+2.32e-01

Pos logit contributions
 Probe+1.549
 Nish+1.462
ersen+1.411
 NP+1.309
 COURT+1.304
Neg logit contributions
phe-1.755
rophe-1.566
ths-1.390
riad-1.390
ient-1.371
Token McLaren
Token ID33017
Feature activation-1.90e+00

Pos logit contributions
 Probe+0.588
 Nish+0.553
ersen+0.528
 Rik+0.494
 NP+0.490
Neg logit contributions
phe-0.731
rophe-0.648
ths-0.589
riad-0.582
ient-0.573
Token's
Token ID338
Feature activation+6.36e-02

Pos logit contributions
 Probe+3.246
 Nish+3.041
ersen+2.859
 Rik+2.702
 NP+2.686
Neg logit contributions
phe-4.456
rophe-3.956
riad-3.570
ths-3.564
ient-3.541
Token Champions
Token ID6662
Feature activation+2.74e-01

Pos logit contributions
phe+0.148
rophe+0.132
ths+0.121
riad+0.119
ient+0.117
Neg logit contributions
 Probe-0.109
 Nish-0.102
ersen-0.097
 Rik-0.090
 COURT-0.090
Token of
Token ID286
Feature activation-5.04e-01

Pos logit contributions
phe+0.644
rophe+0.575
ths+0.526
riad+0.520
ient+0.512
Neg logit contributions
 Probe-0.464
 Nish-0.434
ersen-0.414
 Rik-0.383
 COURT-0.381
Token the
Token ID262
Feature activation-4.62e-01

Pos logit contributions
 Probe+0.901
 Nish+0.848
ersen+0.794
 Rik+0.756
 NP+0.752
Neg logit contributions
phe-1.146
rophe-1.024
ths-0.924
riad-0.919
ient-0.904
Token Future
Token ID10898
Feature activation-2.79e+00

Pos logit contributions
 Probe+0.793
 Nish+0.745
ersen+0.695
 Rik+0.660
 NP+0.657
Neg logit contributions
phe-1.088
rophe-0.962
ths-0.884
riad-0.866
ient-0.861
Token k
Token ID479
Feature activation-1.36e+00

Pos logit contributions
 Probe+5.158
 Nish+4.937
ersen+4.608
 Rik+4.449
 NP+4.381
Neg logit contributions
phe-6.022
rophe-5.283
riad-4.731
ths-4.649
ient-4.638
Tokenart
Token ID433
Feature activation-1.44e+00

Pos logit contributions
 Probe+2.941
 Nish+2.793
ersen+2.740
 Trafford+2.563
 COURT+2.551
Neg logit contributions
phe-2.509
rophe-2.163
ths-1.905
riad-1.869
 kidding-1.828
Tokening
Token ID278
Feature activation-4.80e-01

Pos logit contributions
 Probe+2.636
 Nish+2.515
ersen+2.399
 Rik+2.244
 NP+2.218
Neg logit contributions
phe-3.178
rophe-2.790
ths-2.525
riad-2.510
ient-2.480
Token series
Token ID2168
Feature activation-1.67e-01

Pos logit contributions
 Probe+0.983
 Nish+0.934
ersen+0.895
 Rik+0.846
 NP+0.839
Neg logit contributions
phe-0.960
rophe-0.837
ths-0.746
riad-0.739
ient-0.727
Token 1998
Token ID7795
Feature activation-7.88e-01

Pos logit contributions
 Probe+0.281
 Nish+0.264
ersen+0.250
 Rik+0.233
 COURT+0.231
Neg logit contributions
phe-0.398
rophe-0.355
ths-0.324
riad-0.321
ient-0.317
Token,
Token ID11
Feature activation-1.37e-01

Pos logit contributions
 Probe+0.305
 Nish+0.287
ersen+0.273
 Rik+0.256
 NP+0.255
Neg logit contributions
phe-0.393
rophe-0.349
ths-0.315
riad-0.314
ient-0.306
Token or
Token ID393
Feature activation+2.65e-01

Pos logit contributions
 Probe+0.267
 Nish+0.252
ersen+0.242
 Rik+0.227
 COURT+0.226
Neg logit contributions
phe-0.287
rophe-0.253
ths-0.228
riad-0.225
ient-0.221
Token at
Token ID379
Feature activation-1.34e-01

Pos logit contributions
phe+0.542
rophe+0.475
ths+0.428
riad+0.421
ient+0.414
Neg logit contributions
 Probe-0.528
 Nish-0.499
ersen-0.480
 Rik-0.451
 COURT-0.449
Token least
Token ID1551
Feature activation-2.71e-02

Pos logit contributions
 Probe+0.263
 Nish+0.248
ersen+0.238
 Rik+0.224
 NP+0.223
Neg logit contributions
phe-0.278
rophe-0.244
ths-0.219
riad-0.216
ient-0.212
Token establish
Token ID4474
Feature activation-1.68e-01

Pos logit contributions
 Probe+0.056
 Nish+0.053
ersen+0.051
 Rik+0.048
 COURT+0.048
Neg logit contributions
phe-0.054
rophe-0.047
ths-0.042
riad-0.041
ient-0.041
Token contact
Token ID2800
Feature activation-2.76e+00

Pos logit contributions
 Probe+0.337
 Nish+0.319
ersen+0.306
 Rik+0.288
 NP+0.287
Neg logit contributions
phe-0.341
rophe-0.299
ths-0.268
riad-0.265
ient-0.260
Token,
Token ID11
Feature activation-2.14e-01

Pos logit contributions
 Probe+4.184
 Nish+4.003
ersen+3.746
 Rik+3.570
 NP+3.520
Neg logit contributions
phe-6.608
rophe-5.930
riad-5.404
ovie-5.262
ths-5.262
Token with
Token ID351
Feature activation-4.43e-01

Pos logit contributions
 Probe+0.387
 Nish+0.365
ersen+0.349
 Rik+0.326
 NP+0.323
Neg logit contributions
phe-0.480
rophe-0.426
ths-0.386
riad-0.383
ient-0.375
Token his
Token ID465
Feature activation-4.40e-01

Pos logit contributions
 Probe+0.834
 Nish+0.788
ersen+0.753
 Rik+0.706
 NP+0.702
Neg logit contributions
phe-0.960
rophe-0.848
ths-0.766
riad-0.757
ient-0.744
Token most
Token ID749
Feature activation-1.34e+00

Pos logit contributions
 Probe+0.829
 Nish+0.780
ersen+0.749
 Rik+0.700
 COURT+0.696
Neg logit contributions
phe-0.949
rophe-0.836
ths-0.761
riad-0.753
ient-0.739
Token determined
Token ID5295
Feature activation-1.84e-02

Pos logit contributions
 Probe+2.387
 Nish+2.265
ersen+2.155
 Rik+2.012
 NP+2.011
Neg logit contributions
phe-3.002
rophe-2.676
ths-2.416
riad-2.384
ient-2.345
Token now
Token ID783
Feature activation-7.74e-01

Pos logit contributions
 Probe+0.214
 Nish+0.201
ersen+0.192
 Rik+0.179
 COURT+0.177
Neg logit contributions
phe-0.274
rophe-0.243
ths-0.221
riad-0.219
ient-0.215
Token.
Token ID13
Feature activation-4.63e-01

Pos logit contributions
 Probe+1.435
 Nish+1.352
ersen+1.287
 Rik+1.212
 NP+1.206
Neg logit contributions
phe-1.702
rophe-1.503
ths-1.355
riad-1.351
ient-1.314
Token I
Token ID314
Feature activation-3.54e-01

Pos logit contributions
 Probe+0.781
 Nish+0.729
ersen+0.691
 Rik+0.644
 NP+0.641
Neg logit contributions
phe-1.094
rophe-0.976
ths-0.892
riad-0.885
ient-0.871
Token'm
Token ID1101
Feature activation+1.21e-01

Pos logit contributions
 Probe+0.753
 Nish+0.711
ersen+0.686
 Rik+0.645
 NP+0.642
Neg logit contributions
phe-0.683
rophe-0.594
ths-0.524
riad-0.520
ient-0.512
Token glad
Token ID9675
Feature activation+1.44e-01

Pos logit contributions
phe+0.239
rophe+0.209
ths+0.187
riad+0.184
ient+0.181
Neg logit contributions
 Probe-0.251
 Nish-0.238
ersen-0.229
 Rik-0.216
 COURT-0.215
Token nan
Token ID15709
Feature activation-2.76e+00

Pos logit contributions
phe+0.308
rophe+0.272
ths+0.247
riad+0.243
ient+0.239
Neg logit contributions
 Probe-0.273
 Nish-0.257
ersen-0.247
 Rik-0.231
 COURT-0.230
Tokenocl
Token ID38679
Feature activation-3.02e-01

Pos logit contributions
 Probe+5.159
 Nish+4.839
ersen+4.638
 Rik+4.323
ega+4.320
Neg logit contributions
phe-5.659
rophe-5.011
riad-4.430
ths-4.394
azeera-4.308
Tokenoud
Token ID2778
Feature activation+1.31e+00

Pos logit contributions
 Probe+0.641
 Nish+0.608
ersen+0.590
 Trafford+0.555
 NP+0.552
Neg logit contributions
phe-0.568
rophe-0.499
riad-0.440
ths-0.439
 kidding-0.424
Token wasn
Token ID2492
Feature activation+5.49e-01

Pos logit contributions
phe+2.718
rophe+2.386
ths+2.165
riad+2.118
ient+2.104
Neg logit contributions
 Probe-2.563
 Nish-2.423
ersen-2.318
 Rik-2.190
 COURT-2.188
Token't
Token ID470
Feature activation+6.49e-01

Pos logit contributions
phe+0.931
rophe+0.786
ths+0.702
riad+0.685
ient+0.674
Neg logit contributions
 Probe-1.290
 Nish-1.232
ersen-1.191
 Rik-1.131
 Trafford-1.127
Token the
Token ID262
Feature activation+1.93e-01

Pos logit contributions
phe+1.300
rophe+1.137
ths+1.019
riad+1.005
ient+0.988
Neg logit contributions
 Probe-1.322
 Nish-1.252
ersen-1.205
 Rik-1.133
 COURT-1.128
Token 13
Token ID1511
Feature activation-2.11e-01

Pos logit contributions
 Probe+0.093
 Nish+0.088
ersen+0.084
 Rik+0.079
 COURT+0.079
Neg logit contributions
phe-0.096
rophe-0.085
ths-0.076
riad-0.075
ient-0.074
Token years
Token ID812
Feature activation-8.60e-01

Pos logit contributions
 Probe+0.382
 Nish+0.358
ersen+0.342
 Rik+0.321
 NP+0.318
Neg logit contributions
phe-0.474
rophe-0.421
riad-0.378
ths-0.377
ient-0.369
Token old
Token ID1468
Feature activation-5.64e-01

Pos logit contributions
 Probe+1.607
 Nish+1.529
ersen+1.450
 Rik+1.373
 NP+1.358
Neg logit contributions
phe-1.854
rophe-1.651
ths-1.476
riad-1.476
ient-1.428
Token.
Token ID13
Feature activation-5.35e-01

Pos logit contributions
 Probe+1.054
 Nish+0.998
ersen+0.953
 Rik+0.899
 Trafford+0.888
Neg logit contributions
phe-1.228
rophe-1.088
ths-0.971
riad-0.970
ient-0.943
Token The
Token ID383
Feature activation-1.40e-01

Pos logit contributions
 Probe+0.942
 Nish+0.885
ersen+0.841
 Rik+0.787
 COURT+0.781
Neg logit contributions
phe-1.223
rophe-1.091
ths-0.991
riad-0.980
ient-0.966
Token contact
Token ID2800
Feature activation-2.73e+00

Pos logit contributions
 Probe+0.228
 Nish+0.213
ersen+0.203
 Rik+0.187
 NP+0.185
Neg logit contributions
phe-0.337
rophe-0.302
ths-0.279
riad-0.275
ient-0.271
Token had
Token ID550
Feature activation+7.89e-01

Pos logit contributions
 Probe+4.967
 Nish+4.730
ersen+4.504
 Rik+4.266
 Trafford+4.254
Neg logit contributions
phe-5.816
rophe-5.179
ths-4.563
riad-4.553
ovie-4.464
Token started
Token ID2067
Feature activation-1.28e-01

Pos logit contributions
phe+1.498
rophe+1.307
ths+1.172
riad+1.151
ient+1.139
Neg logit contributions
 Probe-1.678
 Nish-1.578
ersen-1.530
 COURT-1.434
 Rik-1.432
Token via
Token ID2884
Feature activation+6.57e-01

Pos logit contributions
 Probe+0.240
 Nish+0.226
ersen+0.216
 Rik+0.202
 COURT+0.201
Neg logit contributions
phe-0.278
rophe-0.246
ths-0.223
riad-0.220
ient-0.216
Token Instagram
Token ID10767
Feature activation-3.83e-01

Pos logit contributions
phe+1.302
rophe+1.136
ths+1.024
riad+1.011
ient+0.998
Neg logit contributions
 Probe-1.339
 Nish-1.264
ersen-1.228
 Rik-1.147
 COURT-1.144
Token with
Token ID351
Feature activation-4.08e-01

Pos logit contributions
 Probe+0.719
 Nish+0.678
ersen+0.649
 Rik+0.609
 Trafford+0.604
Neg logit contributions
phe-0.830
rophe-0.733
ths-0.661
riad-0.654
ient-0.643
Token Chern
Token ID37243
Feature activation-1.32e-01

Pos logit contributions
 Probe+0.623
 Nish+0.587
ersen+0.555
 Rik+0.526
 NP+0.522
Neg logit contributions
phe-0.750
rophe-0.667
ths-0.604
riad-0.596
ient-0.586
Tokeniv
Token ID452
Feature activation-3.18e-01

Pos logit contributions
 Probe+0.288
 Nish+0.275
ersen+0.264
 Rik+0.251
 NP+0.249
Neg logit contributions
phe-0.246
rophe-0.212
ths-0.189
riad-0.186
ient-0.181
Tokents
Token ID912
Feature activation-3.96e-01

Pos logit contributions
 Probe+0.627
 Nish+0.591
ersen+0.568
 Rik+0.533
 NP+0.532
Neg logit contributions
phe-0.661
rophe-0.581
ths-0.522
riad-0.518
ient-0.505
Tokeni
Token ID72
Feature activation+1.37e-01

Pos logit contributions
 Probe+0.748
 Nish+0.704
ersen+0.670
 Rik+0.636
 NP+0.629
Neg logit contributions
phe-0.854
rophe-0.754
ths-0.685
riad-0.671
ient-0.658
Token,
Token ID11
Feature activation+7.47e-02

Pos logit contributions
phe+0.303
rophe+0.268
ths+0.243
riad+0.240
ient+0.236
Neg logit contributions
 Probe-0.253
 Nish-0.238
ersen-0.227
 Rik-0.213
 NP-0.212
Token Lv
Token ID24388
Feature activation+1.85e+00

Pos logit contributions
phe+0.165
rophe+0.146
ths+0.133
riad+0.131
ient+0.129
Neg logit contributions
 Probe-0.137
 Nish-0.129
ersen-0.124
 Rik-0.116
 COURT-0.115
Tokeniv
Token ID452
Feature activation-4.97e-01

Pos logit contributions
phe+3.016
rophe+2.546
ths+2.206
riad+2.181
ient+2.123
Neg logit contributions
 Probe-4.467
 Nish-4.250
ersen-4.131
 COURT-3.911
 Rik-3.907
Token and
Token ID290
Feature activation-4.91e-02

Pos logit contributions
 Probe+0.911
 Nish+0.854
ersen+0.811
 Rik+0.766
 NP+0.765
Neg logit contributions
phe-1.102
rophe-0.979
ths-0.886
riad-0.876
ient-0.856
Token Iv
Token ID16975
Feature activation-5.90e-01

Pos logit contributions
 Probe+0.087
 Nish+0.082
ersen+0.078
 Rik+0.073
 NP+0.072
Neg logit contributions
phe-0.111
rophe-0.099
ths-0.090
riad-0.089
ient-0.088
Tokenano
Token ID5733
Feature activation-2.40e-01

Pos logit contributions
 Probe+1.363
 Nish+1.297
ersen+1.252
 Rik+1.191
 COURT+1.188
Neg logit contributions
phe-1.028
rophe-0.880
ths-0.771
riad-0.762
ient-0.745
Token-
Token ID12
Feature activation+4.79e-01

Pos logit contributions
 Probe+0.362
 Nish+0.337
ersen+0.312
 Rik+0.297
 NP+0.294
Neg logit contributions
phe-0.612
rophe-0.551
ths-0.506
riad-0.502
ient-0.489
Token actually
Token ID1682
Feature activation+1.20e+00

Pos logit contributions
phe+0.527
rophe+0.458
ths+0.410
riad+0.404
ient+0.396
Neg logit contributions
 Probe-0.558
 Nish-0.531
ersen-0.511
 Rik-0.482
 Trafford-0.480
Token show
Token ID905
Feature activation+8.90e-01

Pos logit contributions
phe+2.317
rophe+2.000
ths+1.781
riad+1.760
ient+1.723
Neg logit contributions
 Probe-2.492
 Nish-2.370
ersen-2.288
 Trafford-2.168
 Rik-2.156
Token a
Token ID257
Feature activation+3.26e-01

Pos logit contributions
phe+1.831
rophe+1.608
ths+1.444
riad+1.423
ient+1.400
Neg logit contributions
 Probe-1.758
 Nish-1.661
ersen-1.607
 Rik-1.498
 COURT-1.494
Token toilet
Token ID16146
Feature activation-6.93e-01

Pos logit contributions
phe+0.710
rophe+0.626
ths+0.566
riad+0.561
ient+0.549
Neg logit contributions
 Probe-0.608
 Nish-0.573
ersen-0.553
 Rik-0.513
 COURT-0.511
Token in
Token ID287
Feature activation-8.01e-02

Pos logit contributions
 Probe+1.276
 Nish+1.194
ersen+1.144
 Rik+1.074
 NP+1.068
Neg logit contributions
phe-1.515
rophe-1.333
ths-1.228
riad-1.197
ient-1.172
Token ab
Token ID450
Feature activation+1.95e+00

Pos logit contributions
 Probe+0.153
 Nish+0.145
ersen+0.138
 Rik+0.130
 COURT+0.129
Neg logit contributions
phe-0.171
rophe-0.151
ths-0.136
riad-0.134
ient-0.132
Tokenath
Token ID776
Feature activation-2.27e-01

Pos logit contributions
phe+5.068
rophe+4.589
ths+4.259
riad+4.221
ient+4.144
Neg logit contributions
 Probe-2.794
 Nish-2.612
ersen-2.412
 Rik-2.245
 Trafford-2.214
Tokenroom
Token ID3823
Feature activation-4.62e-01

Pos logit contributions
 Probe+0.422
 Nish+0.399
ersen+0.378
 Rik+0.362
 NP+0.359
Neg logit contributions
phe-0.485
rophe-0.427
ths-0.385
riad-0.380
ient-0.372
Token seemed
Token ID3947
Feature activation+2.32e-01

Pos logit contributions
 Probe+0.886
 Nish+0.837
ersen+0.800
 Rik+0.753
 COURT+0.749
Neg logit contributions
phe-0.978
rophe-0.862
ths-0.777
riad-0.771
ient-0.755
Token oddly
Token ID31414
Feature activation+3.34e-01

Pos logit contributions
phe+0.494
rophe+0.436
ths+0.394
riad+0.389
ient+0.382
Neg logit contributions
 Probe-0.444
 Nish-0.419
ersen-0.402
 Rik-0.376
 COURT-0.374
Token appropriate
Token ID5035
Feature activation-2.73e-01

Pos logit contributions
phe+0.736
rophe+0.653
ths+0.593
riad+0.586
ient+0.577
Neg logit contributions
 Probe-0.614
 Nish-0.576
ersen-0.553
 Rik-0.516
 COURT-0.515
Token amount
Token ID2033
Feature activation+1.10e-01

Pos logit contributions
phe+1.571
rophe+1.383
ths+1.253
riad+1.238
ient+1.212
Neg logit contributions
 Probe-1.455
 Nish-1.375
ersen-1.317
 Rik-1.239
 COURT-1.228
Token of
Token ID286
Feature activation+3.76e-01

Pos logit contributions
phe+0.271
rophe+0.244
ths+0.222
riad+0.221
ient+0.217
Neg logit contributions
 Probe-0.173
 Nish-0.162
ersen-0.155
 NP-0.142
 Rik-0.142
Token inconvenience
Token ID31818
Feature activation+1.03e+00

Pos logit contributions
phe+0.772
rophe+0.679
ths+0.613
riad+0.604
ient+0.592
Neg logit contributions
 Probe-0.744
 Nish-0.703
ersen-0.675
 Rik-0.634
 COURT-0.630
Token to
Token ID284
Feature activation+2.55e-01

Pos logit contributions
phe+2.199
rophe+1.939
ths+1.752
riad+1.728
ient+1.702
Neg logit contributions
 Probe-1.981
 Nish-1.869
ersen-1.792
 Rik-1.679
 COURT-1.667
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
phe+0.503
rophe+0.439
ths+0.391
riad+0.386
ient+0.379
Neg logit contributions
 Probe-0.531
 Nish-0.503
ersen-0.485
 Rik-0.457
 NP-0.455
Token traveler
Token ID40168
Feature activation+2.39e+00

Pos logit contributions
phe+0.445
rophe+0.390
ths+0.350
riad+0.345
ient+0.339
Neg logit contributions
 Probe-0.435
 Nish-0.411
ersen-0.395
 Rik-0.372
 NP-0.369
Token,
Token ID11
Feature activation+6.32e-02

Pos logit contributions
phe+4.810
rophe+4.180
ths+3.774
riad+3.715
ient+3.683
Neg logit contributions
 Probe-4.811
 Nish-4.551
ersen-4.333
 Rik-4.085
 Trafford-4.060
Token
Token ID447
Feature activation+1.71e-01

Pos logit contributions
phe+0.135
rophe+0.119
ths+0.108
riad+0.107
ient+0.105
Neg logit contributions
 Probe-0.120
 Nish-0.114
ersen-0.109
 Rik-0.102
 NP-0.102
Token
Token ID251
Feature activation-1.56e-01

Pos logit contributions
phe+0.422
rophe+0.378
ths+0.346
riad+0.344
ient+0.338
Neg logit contributions
 Probe-0.269
 Nish-0.251
ersen-0.238
 Rik-0.219
 COURT-0.218
Token he
Token ID339
Feature activation-1.03e-01

Pos logit contributions
 Probe+0.295
 Nish+0.279
ersen+0.268
 Rik+0.251
 COURT+0.250
Neg logit contributions
phe-0.333
rophe-0.295
ths-0.264
riad-0.263
ient-0.257
Token told
Token ID1297
Feature activation+1.02e-01

Pos logit contributions
 Probe+0.183
 Nish+0.173
ersen+0.165
 Rik+0.154
 NP+0.153
Neg logit contributions
phe-0.232
rophe-0.206
ths-0.187
riad-0.186
ient-0.182
Token original
Token ID2656
Feature activation+6.06e-01

Pos logit contributions
 Probe+0.913
 Nish+0.854
ersen+0.805
 Rik+0.749
 NP+0.744
Neg logit contributions
phe-1.395
rophe-1.252
ths-1.147
riad-1.138
ient-1.120
Token,
Token ID11
Feature activation+5.02e-01

Pos logit contributions
phe+1.303
rophe+1.151
ths+1.040
riad+1.028
ient+1.011
Neg logit contributions
 Probe-1.149
 Nish-1.081
ersen-1.037
 Rik-0.970
 COURT-0.965
Token we
Token ID356
Feature activation+5.44e-01

Pos logit contributions
phe+1.042
rophe+0.916
ths+0.828
riad+0.816
ient+0.805
Neg logit contributions
 Probe-0.977
 Nish-0.919
ersen-0.881
 Trafford-0.829
 COURT-0.826
Token had
Token ID550
Feature activation+9.02e-01

Pos logit contributions
phe+1.083
rophe+0.947
ths+0.853
riad+0.837
ient+0.824
Neg logit contributions
 Probe-1.114
 Nish-1.055
ersen-1.011
 Rik-0.955
 Trafford-0.953
Token no
Token ID645
Feature activation+2.37e-01

Pos logit contributions
phe+2.025
rophe+1.815
ths+1.654
riad+1.636
ient+1.615
Neg logit contributions
 Probe-1.600
 Nish-1.495
ersen-1.421
 Rik-1.329
 Trafford-1.328
Token idea
Token ID2126
Feature activation+2.59e+00

Pos logit contributions
phe+0.368
rophe+0.310
ths+0.268
riad+0.263
ient+0.257
Neg logit contributions
 Probe-0.591
 Nish-0.564
ersen-0.547
 Rik-0.521
 COURT-0.518
Token.
Token ID13
Feature activation+3.44e-01

Pos logit contributions
phe+4.873
rophe+4.263
 kidding+3.748
ths+3.745
ient+3.720
Neg logit contributions
 Probe-5.333
 Nish-5.009
ersen-4.852
 Trafford-4.580
 COURT-4.573
Token Now
Token ID2735
Feature activation-4.77e-01

Pos logit contributions
phe+0.768
rophe+0.678
ths+0.618
riad+0.611
ient+0.596
Neg logit contributions
 Probe-0.615
 Nish-0.578
ersen-0.564
 Rik-0.514
 Trafford-0.513
Token,
Token ID11
Feature activation-1.25e-01

Pos logit contributions
 Probe+0.901
 Nish+0.849
ersen+0.813
 Rik+0.764
 NP+0.757
Neg logit contributions
phe-1.029
rophe-0.909
ths-0.821
riad-0.813
ient-0.800
Token on
Token ID319
Feature activation+2.03e-01

Pos logit contributions
 Probe+0.236
 Nish+0.222
ersen+0.213
 Rik+0.199
 Trafford+0.198
Neg logit contributions
phe-0.270
rophe-0.239
ths-0.217
riad-0.214
ient-0.210
Token movie
Token ID3807
Feature activation+4.92e-02

Pos logit contributions
phe+0.439
rophe+0.388
ths+0.352
riad+0.347
ient+0.341
Neg logit contributions
 Probe-0.377
 Nish-0.357
ersen-0.343
 Rik-0.319
 Trafford-0.318
Token Mid
Token ID7215
Feature activation-3.37e-01

Pos logit contributions
phe+0.103
rophe+0.090
ths+0.080
riad+0.079
ient+0.078
Neg logit contributions
 Probe-0.106
 Nish-0.100
ersen-0.096
 Rik-0.091
 COURT-0.090
Tokenn
Token ID77
Feature activation-5.37e-01

Pos logit contributions
 Probe+0.664
 Nish+0.629
ersen+0.605
 Rik+0.567
 NP+0.566
Neg logit contributions
phe-0.695
rophe-0.610
ths-0.548
riad-0.539
ient-0.531
Tokenighter
Token ID4799
Feature activation+5.46e-01

Pos logit contributions
 Probe+1.253
 Nish+1.194
ersen+1.163
 Rik+1.096
 Trafford+1.092
Neg logit contributions
phe-0.914
rophe-0.775
ths-0.679
riad-0.666
ient-0.648
Token doesn
Token ID1595
Feature activation+2.41e-01

Pos logit contributions
phe+1.153
rophe+1.015
ths+0.919
riad+0.905
ient+0.892
Neg logit contributions
 Probe-1.051
 Nish-0.991
ersen-0.954
 Rik-0.890
 COURT-0.888
Token't
Token ID470
Feature activation+1.11e+00

Pos logit contributions
phe+0.417
rophe+0.355
ths+0.316
riad+0.309
ient+0.301
Neg logit contributions
 Probe-0.559
 Nish-0.532
ersen-0.515
 Rik-0.488
 Trafford-0.486
Token exactly
Token ID3446
Feature activation+1.87e+00

Pos logit contributions
phe+2.020
rophe+1.744
ths+1.567
riad+1.513
 kidding+1.493
Neg logit contributions
 Probe-2.454
 Nish-2.339
ersen-2.265
 Trafford-2.138
 Rik-2.136
Token jump
Token ID4391
Feature activation+7.58e-01

Pos logit contributions
phe+3.576
rophe+3.123
ths+2.819
 kidding+2.711
riad+2.710
Neg logit contributions
 Probe-3.873
 Nish-3.671
ersen-3.553
 Trafford-3.352
 Rik-3.333
Token to
Token ID284
Feature activation+1.95e-02

Pos logit contributions
phe+1.613
rophe+1.422
ths+1.284
riad+1.268
ient+1.247
Neg logit contributions
 Probe-1.451
 Nish-1.370
ersen-1.313
 Rik-1.232
 COURT-1.223
Token mind
Token ID2000
Feature activation+8.19e-01

Pos logit contributions
phe+0.041
rophe+0.037
ths+0.033
riad+0.033
ient+0.032
Neg logit contributions
 Probe-0.037
 Nish-0.035
ersen-0.034
 Rik-0.032
 COURT-0.032
Token since
Token ID1201
Feature activation+6.24e-02

Pos logit contributions
phe+1.691
rophe+1.486
ths+1.338
riad+1.318
ient+1.301
Neg logit contributions
 Probe-1.618
 Nish-1.527
ersen-1.469
 Rik-1.378
 COURT-1.370
Token he
Token ID339
Feature activation-1.23e-01

Pos logit contributions
phe+0.135
rophe+0.119
ths+0.108
riad+0.107
ient+0.105
Neg logit contributions
 Probe-0.117
 Nish-0.110
ersen-0.106
 Rik-0.099
 COURT-0.098
Token what
Token ID644
Feature activation-6.12e-01

Pos logit contributions
phe+0.468
rophe+0.412
ths+0.371
riad+0.366
ient+0.360
Neg logit contributions
 Probe-0.441
 Nish-0.417
ersen-0.400
 Rik-0.376
 COURT-0.374
Token a
Token ID257
Feature activation+2.59e-01

Pos logit contributions
 Probe+1.160
 Nish+1.096
ersen+1.047
 Rik+0.985
 NP+0.976
Neg logit contributions
phe-1.318
rophe-1.163
ths-1.049
riad-1.038
ient-1.021
Token sewing
Token ID35356
Feature activation-3.54e-01

Pos logit contributions
phe+0.581
rophe+0.517
ths+0.471
riad+0.466
ient+0.455
Neg logit contributions
 Probe-0.462
 Nish-0.433
ersen-0.417
 Rik-0.386
 Trafford-0.385
Token machine
Token ID4572
Feature activation-9.56e-01

Pos logit contributions
 Probe+0.608
 Nish+0.571
ersen+0.542
 Rik+0.505
 NP+0.502
Neg logit contributions
phe-0.824
rophe-0.732
ths-0.669
riad-0.662
ient-0.652
Token does
Token ID857
Feature activation+5.01e-01

Pos logit contributions
 Probe+1.782
 Nish+1.679
ersen+1.601
 Rik+1.503
 COURT+1.492
Neg logit contributions
phe-2.090
rophe-1.845
ths-1.665
riad-1.656
ient-1.621
Token.
Token ID13
Feature activation+1.51e-01

Pos logit contributions
phe+1.085
rophe+0.960
ths+0.870
riad+0.857
ient+0.846
Neg logit contributions
 Probe-0.940
 Nish-0.886
ersen-0.848
 Rik-0.794
 COURT-0.789
Token Teach
Token ID42015
Feature activation+4.73e-01

Pos logit contributions
phe+0.341
rophe+0.303
ths+0.278
riad+0.272
ient+0.268
Neg logit contributions
 Probe-0.266
 Nish-0.250
ersen-0.242
 Rik-0.222
 Trafford-0.222
Token them
Token ID606
Feature activation+3.88e-01

Pos logit contributions
phe+0.956
rophe+0.837
ths+0.758
riad+0.743
ient+0.732
Neg logit contributions
 Probe-0.953
 Nish-0.899
ersen-0.866
 Rik-0.813
 COURT-0.809
Token how
Token ID703
Feature activation-2.71e-02

Pos logit contributions
phe+0.744
rophe+0.647
ths+0.577
riad+0.568
ient+0.557
Neg logit contributions
 Probe-0.824
 Nish-0.781
ersen-0.752
 Rik-0.710
 COURT-0.706
Token to
Token ID284
Feature activation+1.16e-01

Pos logit contributions
 Probe+0.047
 Nish+0.044
ersen+0.042
 Rik+0.040
 COURT+0.039
Neg logit contributions
phe-0.062
rophe-0.055
ths-0.050
riad-0.050
ient-0.049
Token thread
Token ID4704
Feature activation-1.23e+00

Pos logit contributions
phe+0.236
rophe+0.207
ths+0.188
riad+0.184
ient+0.180
Neg logit contributions
 Probe-0.232
 Nish-0.219
ersen-0.211
 Rik-0.199
 Trafford-0.198
Token measures
Token ID5260
Feature activation+6.46e-01

Pos logit contributions
phe+1.089
rophe+0.974
ths+0.891
riad+0.883
ient+0.869
Neg logit contributions
 Probe-0.748
 Nish-0.699
ersen-0.665
 Rik-0.617
 NP-0.612
Token to
Token ID284
Feature activation+6.40e-02

Pos logit contributions
phe+1.377
rophe+1.217
ths+1.101
riad+1.082
ient+1.069
Neg logit contributions
 Probe-1.234
 Nish-1.160
ersen-1.117
 Rik-1.043
 COURT-1.038
Token keep
Token ID1394
Feature activation+5.09e-01

Pos logit contributions
phe+0.114
rophe+0.098
ths+0.087
riad+0.085
ient+0.083
Neg logit contributions
 Probe-0.144
 Nish-0.138
ersen-0.133
 Rik-0.126
 COURT-0.125
Token the
Token ID262
Feature activation+2.72e-01

Pos logit contributions
phe+1.072
rophe+0.944
ths+0.851
riad+0.841
ient+0.826
Neg logit contributions
 Probe-0.987
 Nish-0.932
ersen-0.894
 Rik-0.839
 COURT-0.833
Token government
Token ID1230
Feature activation+3.11e-01

Pos logit contributions
phe+0.659
rophe+0.589
ths+0.540
riad+0.534
ient+0.526
Neg logit contributions
 Probe-0.440
 Nish-0.412
ersen-0.390
 Rik-0.362
 NP-0.360
Token open
Token ID1280
Feature activation+1.95e-01

Pos logit contributions
phe+0.627
rophe+0.547
ths+0.490
riad+0.486
ient+0.474
Neg logit contributions
 Probe-0.630
 Nish-0.600
ersen-0.574
 Rik-0.543
 NP-0.542
Token.
Token ID13
Feature activation-1.20e-01

Pos logit contributions
phe+0.419
rophe+0.368
ths+0.332
riad+0.331
ient+0.323
Neg logit contributions
 Probe-0.371
 Nish-0.351
ersen-0.336
 Rik-0.315
 NP-0.313
Token While
Token ID2893
Feature activation-1.02e-01

Pos logit contributions
 Probe+0.205
 Nish+0.192
ersen+0.182
 Rik+0.170
 COURT+0.168
Neg logit contributions
phe-0.283
rophe-0.253
ths-0.230
riad-0.228
ient-0.225
Token the
Token ID262
Feature activation-1.03e-01

Pos logit contributions
 Probe+0.197
 Nish+0.186
ersen+0.178
 Rik+0.168
 NP+0.167
Neg logit contributions
phe-0.215
rophe-0.189
ths-0.171
riad-0.169
ient-0.166
Token budget
Token ID4466
Feature activation+7.25e-01

Pos logit contributions
 Probe+0.184
 Nish+0.173
ersen+0.165
 Rik+0.154
 NP+0.153
Neg logit contributions
phe-0.233
rophe-0.207
ths-0.188
riad-0.186
ient-0.183
Token and
Token ID290
Feature activation+2.81e-01

Pos logit contributions
phe+1.480
rophe+1.306
ths+1.171
riad+1.151
ient+1.141
Neg logit contributions
 Probe-1.443
 Nish-1.366
ersen-1.308
 Rik-1.235
 Trafford-1.229
Token working
Token ID1762
Feature activation-9.16e-01

Pos logit contributions
 Probe+3.422
 Nish+3.228
ersen+3.047
 Rik+2.865
 NP+2.854
Neg logit contributions
phe-4.507
rophe-4.058
riad-3.613
ths-3.604
ovie-3.522
Token in
Token ID287
Feature activation-4.61e-01

Pos logit contributions
 Probe+1.647
 Nish+1.556
ersen+1.466
 NP+1.391
 Rik+1.388
Neg logit contributions
phe-2.052
rophe-1.829
ths-1.646
riad-1.626
ient-1.592
Token IT
Token ID7283
Feature activation-3.51e-01

Pos logit contributions
 Probe+0.870
 Nish+0.822
ersen+0.781
 Rik+0.739
 NP+0.735
Neg logit contributions
phe-0.996
rophe-0.881
ths-0.794
riad-0.781
ient-0.768
Token,
Token ID11
Feature activation-2.22e-01

Pos logit contributions
 Probe+0.649
 Nish+0.610
ersen+0.584
 Rik+0.546
 NP+0.544
Neg logit contributions
phe-0.769
rophe-0.682
ths-0.619
riad-0.610
ient-0.599
Token I
Token ID314
Feature activation-2.46e-01

Pos logit contributions
 Probe+0.435
 Nish+0.411
ersen+0.394
 Rik+0.371
 NP+0.368
Neg logit contributions
phe-0.464
rophe-0.409
ths-0.368
riad-0.364
ient-0.357
Token'm
Token ID1101
Feature activation+2.50e-01

Pos logit contributions
 Probe+0.508
 Nish+0.478
ersen+0.462
 Rik+0.434
 NP+0.433
Neg logit contributions
phe-0.490
rophe-0.428
ths-0.380
riad-0.378
ient-0.370
Token sure
Token ID1654
Feature activation-4.22e-01

Pos logit contributions
phe+0.508
rophe+0.445
ths+0.400
riad+0.395
ient+0.388
Neg logit contributions
 Probe-0.503
 Nish-0.476
ersen-0.458
 Rik-0.431
 COURT-0.428
Token you
Token ID345
Feature activation-3.80e-01

Pos logit contributions
 Probe+0.830
 Nish+0.786
ersen+0.751
 Rik+0.711
 NP+0.705
Neg logit contributions
phe-0.880
rophe-0.774
ths-0.695
riad-0.688
ient-0.674
Token'll
Token ID1183
Feature activation-6.11e-01

Pos logit contributions
 Probe+0.768
 Nish+0.726
ersen+0.698
 Rik+0.657
 COURT+0.652
Neg logit contributions
phe-0.769
rophe-0.674
ths-0.600
riad-0.596
ient-0.584
Token find
Token ID1064
Feature activation-1.12e+00

Pos logit contributions
 Probe+1.236
 Nish+1.169
ersen+1.127
 Rik+1.059
 NP+1.052
Neg logit contributions
phe-1.236
rophe-1.083
ths-0.960
riad-0.958
ient-0.937
Token much
Token ID881
Feature activation-1.04e+00

Pos logit contributions
 Probe+2.259
 Nish+2.164
ersen+2.055
 Rik+1.962
 NP+1.952
Neg logit contributions
phe-2.244
rophe-1.962
ths-1.736
riad-1.727
ovie-1.689
Token post
Token ID1281
Feature activation-1.45e+00

Pos logit contributions
phe+0.131
rophe+0.116
ths+0.105
riad+0.104
ient+0.102
Neg logit contributions
 Probe-0.109
 Nish-0.103
ersen-0.099
 Rik-0.092
 COURT-0.091
Token-
Token ID12
Feature activation-1.61e-01

Pos logit contributions
 Probe+2.298
 Nish+2.162
ersen+2.006
 Rik+1.879
 NP+1.877
Neg logit contributions
phe-3.510
rophe-3.117
ths-2.926
riad-2.860
ient-2.814
Tokenmobile
Token ID24896
Feature activation+5.44e-01

Pos logit contributions
 Probe+0.343
 Nish+0.325
ersen+0.313
 NP+0.296
 Rik+0.296
Neg logit contributions
phe-0.306
rophe-0.265
ths-0.237
riad-0.232
ient-0.228
Token world
Token ID995
Feature activation+2.64e-01

Pos logit contributions
phe+1.031
rophe+0.892
ths+0.797
riad+0.785
ient+0.768
Neg logit contributions
 Probe-1.167
 Nish-1.107
ersen-1.071
 Rik-1.010
 COURT-1.002
Token
Token ID198
Feature activation+4.15e-01

Pos logit contributions
phe+0.634
rophe+0.567
ths+0.519
riad+0.514
ient+0.507
Neg logit contributions
 Probe-0.435
 Nish-0.406
ersen-0.386
 Rik-0.358
 COURT-0.356
Token
Token ID198
Feature activation+3.43e-01

Pos logit contributions
phe+0.942
rophe+0.834
ths+0.774
riad+0.756
ient+0.744
Neg logit contributions
 Probe-0.734
 Nish-0.686
ersen-0.661
 Rik-0.614
 Trafford-0.610
TokenWhy
Token ID5195
Feature activation+3.71e-02

Pos logit contributions
phe+0.741
rophe+0.657
ths+0.592
riad+0.586
ient+0.576
Neg logit contributions
 Probe-0.644
 Nish-0.607
ersen-0.581
 Rik-0.545
 NP-0.541
Token it
Token ID340
Feature activation+1.88e-01

Pos logit contributions
phe+0.080
rophe+0.071
ths+0.064
riad+0.063
ient+0.062
Neg logit contributions
 Probe-0.070
 Nish-0.066
ersen-0.063
 Rik-0.059
 NP-0.059
Token
Token ID447
Feature activation-2.17e-01

Pos logit contributions
phe+0.391
rophe+0.345
ths+0.310
riad+0.306
ient+0.300
Neg logit contributions
 Probe-0.370
 Nish-0.350
ersen-0.336
 Rik-0.316
 NP-0.313
Token
Token ID247
Feature activation+1.28e-01

Pos logit contributions
 Probe+0.412
 Nish+0.389
ersen+0.373
 Rik+0.349
 COURT+0.347
Neg logit contributions
phe-0.466
rophe-0.411
ths-0.372
riad-0.367
ient-0.361
Tokens
Token ID82
Feature activation-6.67e-02

Pos logit contributions
phe+0.285
rophe+0.253
ths+0.229
riad+0.227
ient+0.223
Neg logit contributions
 Probe-0.232
 Nish-0.218
ersen-0.209
 Rik-0.194
 NP-0.193
Token in
Token ID287
Feature activation-2.16e-02

Pos logit contributions
phe+0.714
rophe+0.632
ths+0.572
riad+0.566
ient+0.556
Neg logit contributions
 Probe-0.587
 Nish-0.553
ersen-0.528
 Rik-0.494
 NP-0.490
Token the
Token ID262
Feature activation+2.02e-01

Pos logit contributions
 Probe+0.042
 Nish+0.039
ersen+0.038
 Rik+0.035
 NP+0.035
Neg logit contributions
phe-0.046
rophe-0.040
ths-0.036
riad-0.036
ient-0.035
Token wake
Token ID7765
Feature activation+6.16e-01

Pos logit contributions
phe+0.405
rophe+0.354
ths+0.317
riad+0.313
ient+0.307
Neg logit contributions
 Probe-0.411
 Nish-0.389
ersen-0.374
 Rik-0.352
 COURT-0.350
Token of
Token ID286
Feature activation-1.84e-01

Pos logit contributions
phe+1.402
rophe+1.251
ths+1.154
riad+1.132
ient+1.113
Neg logit contributions
 Probe-1.083
 Nish-1.004
ersen-0.960
 COURT-0.895
 Trafford-0.894
Token the
Token ID262
Feature activation-1.68e-01

Pos logit contributions
 Probe+0.337
 Nish+0.316
ersen+0.304
 Rik+0.284
 NP+0.282
Neg logit contributions
phe-0.410
rophe-0.363
ths-0.328
riad-0.325
ient-0.320
Token Republican
Token ID3415
Feature activation+1.47e+00

Pos logit contributions
 Probe+0.306
 Nish+0.288
ersen+0.275
 Rik+0.257
 NP+0.256
Neg logit contributions
phe-0.375
rophe-0.332
ths-0.301
riad-0.298
ient-0.293
Token victory
Token ID5373
Feature activation+8.21e-01

Pos logit contributions
phe+2.994
rophe+2.645
ths+2.389
riad+2.367
ient+2.306
Neg logit contributions
 Probe-2.872
 Nish-2.735
ersen-2.650
 Trafford-2.459
 Rik-2.458
Token.
Token ID13
Feature activation+7.96e-02

Pos logit contributions
phe+1.747
rophe+1.540
ths+1.390
riad+1.373
ient+1.349
Neg logit contributions
 Probe-1.575
 Nish-1.487
ersen-1.426
 Rik-1.337
 COURT-1.327
Token
Token ID198
Feature activation+3.01e-01

Pos logit contributions
phe+0.186
rophe+0.166
ths+0.152
riad+0.150
ient+0.148
Neg logit contributions
 Probe-0.135
 Nish-0.127
ersen-0.121
 Rik-0.112
 COURT-0.111
Token
Token ID198
Feature activation+2.39e-01

Pos logit contributions
phe+0.681
rophe+0.602
ths+0.558
riad+0.548
ient+0.538
Neg logit contributions
 Probe-0.535
 Nish-0.500
ersen-0.483
 Rik-0.446
 Trafford-0.444
TokenSent
Token ID31837
Feature activation-1.04e+00

Pos logit contributions
phe+0.506
rophe+0.448
ths+0.402
riad+0.397
ient+0.392
Neg logit contributions
 Probe-0.458
 Nish-0.432
ersen-0.414
 Rik-0.389
 COURT-0.386
Token to
Token ID284
Feature activation+6.02e-02

Pos logit contributions
phe+0.328
rophe+0.291
ths+0.266
riad+0.265
ient+0.259
Neg logit contributions
 Probe-0.241
 Nish-0.226
ersen-0.215
 Rik-0.201
 NP-0.200
Token show
Token ID905
Feature activation+7.05e-01

Pos logit contributions
phe+0.117
rophe+0.102
ths+0.091
riad+0.090
ient+0.088
Neg logit contributions
 Probe-0.125
 Nish-0.119
ersen-0.115
 Rik-0.108
 Trafford-0.108
Token up
Token ID510
Feature activation-1.68e-01

Pos logit contributions
phe+1.454
rophe+1.279
ths+1.151
riad+1.136
ient+1.115
Neg logit contributions
 Probe-1.389
 Nish-1.310
ersen-1.263
 Rik-1.180
 Trafford-1.176
Token,
Token ID11
Feature activation-2.22e-02

Pos logit contributions
 Probe+0.305
 Nish+0.288
ersen+0.274
 Rik+0.257
 NP+0.256
Neg logit contributions
phe-0.375
rophe-0.333
ths-0.302
riad-0.299
ient-0.292
Token and
Token ID290
Feature activation+2.93e-01

Pos logit contributions
 Probe+0.041
 Nish+0.038
ersen+0.037
 Rik+0.034
 NP+0.034
Neg logit contributions
phe-0.049
rophe-0.043
ths-0.039
riad-0.039
ient-0.038
Token a
Token ID257
Feature activation+6.34e-02

Pos logit contributions
phe+0.598
rophe+0.524
ths+0.472
riad+0.464
ient+0.457
Neg logit contributions
 Probe-0.586
 Nish-0.553
ersen-0.532
 Rik-0.500
 COURT-0.498
Token constantly
Token ID7558
Feature activation+5.53e-01

Pos logit contributions
phe+0.135
rophe+0.119
ths+0.108
riad+0.106
ient+0.104
Neg logit contributions
 Probe-0.121
 Nish-0.114
ersen-0.110
 Rik-0.103
 COURT-0.102
Token changing
Token ID5609
Feature activation+6.24e-01

Pos logit contributions
phe+1.200
rophe+1.061
ths+0.960
riad+0.949
ient+0.937
Neg logit contributions
 Probe-1.035
 Nish-0.972
ersen-0.937
 Rik-0.873
 COURT-0.869
Token line
Token ID1627
Feature activation-1.81e-02

Pos logit contributions
phe+1.198
rophe+1.042
ths+0.929
riad+0.918
ient+0.901
Neg logit contributions
 Probe-1.323
 Nish-1.250
ersen-1.211
 Rik-1.138
 COURT-1.135
Token-
Token ID12
Feature activation+5.31e-01

Pos logit contributions
 Probe+0.029
 Nish+0.027
ersen+0.026
 Rik+0.024
 NP+0.024
Neg logit contributions
phe-0.044
rophe-0.039
ths-0.036
riad-0.036
ient-0.035
Tokenup
Token ID929
Feature activation+1.53e-01

Pos logit contributions
phe+1.082
rophe+0.948
ths+0.852
riad+0.840
ient+0.825
Neg logit contributions
 Probe-1.065
 Nish-1.008
ersen-0.968
 Rik-0.911
 COURT-0.905
Token the
Token ID262
Feature activation-1.70e-01

Pos logit contributions
 Probe+0.813
 Nish+0.755
ersen+0.716
 Rik+0.661
 NP+0.658
Neg logit contributions
phe-1.297
rophe-1.165
ths-1.068
riad-1.060
ient-1.042
Token 2008
Token ID3648
Feature activation-1.66e+00

Pos logit contributions
 Probe+0.281
 Nish+0.263
ersen+0.251
 Rik+0.232
 COURT+0.230
Neg logit contributions
phe-0.405
rophe-0.362
ths-0.332
riad-0.328
ient-0.324
Token election
Token ID3071
Feature activation-5.27e-01

Pos logit contributions
 Probe+2.618
 Nish+2.409
ersen+2.285
 Rik+2.126
 NP+2.125
Neg logit contributions
phe-4.093
rophe-3.642
ths-3.322
riad-3.312
ient-3.238
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
 Probe+0.992
 Nish+0.929
ersen+0.894
 Rik+0.841
 NP+0.838
Neg logit contributions
phe-1.141
rophe-0.997
ths-0.900
riad-0.900
ient-0.882
Token
Token ID198
Feature activation-2.64e-01

Pos logit contributions
 Probe+0.407
 Nish+0.381
ersen+0.366
 Rik+0.337
 COURT+0.334
Neg logit contributions
phe-0.576
rophe-0.514
ths-0.472
riad-0.465
ient-0.458
Token
Token ID198
Feature activation-2.02e-01

Pos logit contributions
 Probe+0.451
 Nish+0.422
ersen+0.404
 Rik+0.374
 Trafford+0.371
Neg logit contributions
phe-0.619
rophe-0.550
ths-0.507
riad-0.499
ient-0.491
TokenChrist
Token ID10684
Feature activation+9.77e-01

Pos logit contributions
 Probe+0.373
 Nish+0.351
ersen+0.336
 Rik+0.315
 NP+0.313
Neg logit contributions
phe-0.443
rophe-0.393
ths-0.355
riad-0.351
ient-0.345
Tokenie
Token ID494
Feature activation-2.23e-01

Pos logit contributions
phe+2.025
rophe+1.791
ths+1.608
riad+1.588
ient+1.586
Neg logit contributions
 Probe-1.929
 Nish-1.818
ersen-1.708
 Trafford-1.635
 COURT-1.635
Token is
Token ID318
Feature activation-4.06e-01

Pos logit contributions
 Probe+0.425
 Nish+0.401
ersen+0.385
 Rik+0.361
 NP+0.358
Neg logit contributions
phe-0.476
rophe-0.421
ths-0.379
riad-0.376
ient-0.368
Token widely
Token ID6768
Feature activation+4.33e-01

Pos logit contributions
 Probe+0.797
 Nish+0.753
ersen+0.722
 Rik+0.678
 COURT+0.674
Neg logit contributions
phe-0.847
rophe-0.744
ths-0.671
riad-0.662
ient-0.651
Token considered
Token ID3177
Feature activation-7.57e-01

Pos logit contributions
phe+0.911
rophe+0.791
ths+0.725
riad+0.710
ient+0.699
Neg logit contributions
 Probe-0.844
 Nish-0.791
ersen-0.762
 Rik-0.712
 COURT-0.712
Token corner
Token ID5228
Feature activation-1.17e+00

Pos logit contributions
phe+0.477
rophe+0.421
ths+0.379
riad+0.375
ient+0.369
Neg logit contributions
 Probe-0.432
 Nish-0.407
ersen-0.390
 Rik-0.366
 NP-0.364
Token.
Token ID13
Feature activation-2.65e-02

Pos logit contributions
 Probe+2.161
 Nish+2.028
ersen+1.929
 NP+1.831
 Rik+1.806
Neg logit contributions
phe-2.532
rophe-2.249
ths-2.014
riad-2.003
ient-1.967
Token
Token ID198
Feature activation-5.59e-02

Pos logit contributions
 Probe+0.046
 Nish+0.043
ersen+0.042
 Rik+0.038
 COURT+0.038
Neg logit contributions
phe-0.061
rophe-0.054
ths-0.050
riad-0.049
ient-0.048
Token
Token ID198
Feature activation-3.34e-02

Pos logit contributions
 Probe+0.096
 Nish+0.090
ersen+0.086
 Rik+0.080
 Trafford+0.079
Neg logit contributions
phe-0.130
rophe-0.116
ths-0.107
riad-0.105
ient-0.103
TokenYou
Token ID1639
Feature activation+6.94e-02

Pos logit contributions
 Probe+0.063
 Nish+0.059
ersen+0.057
 Rik+0.053
 NP+0.053
Neg logit contributions
phe-0.072
rophe-0.064
ths-0.057
riad-0.057
ient-0.056
Token Want
Token ID16168
Feature activation-5.19e-01

Pos logit contributions
phe+0.139
rophe+0.121
ths+0.109
riad+0.107
ient+0.105
Neg logit contributions
 Probe-0.142
 Nish-0.134
ersen-0.129
 Rik-0.122
 COURT-0.121
Token Some
Token ID2773
Feature activation-1.99e+00

Pos logit contributions
 Probe+0.907
 Nish+0.848
ersen+0.801
 Rik+0.752
 NP+0.748
Neg logit contributions
phe-1.197
rophe-1.069
ths-0.966
riad-0.964
ient-0.948
Token Pictures
Token ID20051
Feature activation-5.29e-01

Pos logit contributions
 Probe+3.371
 Nish+3.121
ersen+2.927
 NP+2.768
 Rik+2.760
Neg logit contributions
phe-4.689
rophe-4.251
riad-3.814
ths-3.779
ient-3.740
Token of
Token ID286
Feature activation-6.47e-02

Pos logit contributions
 Probe+0.840
 Nish+0.781
ersen+0.740
 Rik+0.684
 COURT+0.679
Neg logit contributions
phe-1.302
rophe-1.172
ths-1.071
riad-1.064
ient-1.048
Token That
Token ID1320
Feature activation+1.39e-01

Pos logit contributions
 Probe+0.118
 Nish+0.111
ersen+0.106
 Rik+0.099
 COURT+0.098
Neg logit contributions
phe-0.144
rophe-0.127
ths-0.116
riad-0.114
ient-0.113
Token Corner
Token ID26212
Feature activation-1.98e+00

Pos logit contributions
phe+0.339
rophe+0.303
ths+0.279
riad+0.275
ient+0.271
Neg logit contributions
 Probe-0.223
 Nish-0.209
ersen-0.201
 Rik-0.184
 COURT-0.182
Token enough
Token ID1576
Feature activation+2.56e-01

Pos logit contributions
phe+2.237
rophe+1.970
ths+1.793
riad+1.770
ient+1.741
Neg logit contributions
 Probe-2.031
 Nish-1.901
ersen-1.839
 Rik-1.708
 Trafford-1.707
Token to
Token ID284
Feature activation+1.14e-01

Pos logit contributions
phe+0.560
rophe+0.495
ths+0.452
riad+0.444
ient+0.439
Neg logit contributions
 Probe-0.473
 Nish-0.444
ersen-0.425
 Rik-0.397
 Trafford-0.396
Token eat
Token ID4483
Feature activation+1.59e+00

Pos logit contributions
phe+0.213
rophe+0.184
ths+0.164
riad+0.161
ient+0.158
Neg logit contributions
 Probe-0.247
 Nish-0.234
ersen-0.226
 Rik-0.214
 COURT-0.212
Token at
Token ID379
Feature activation+5.13e-01

Pos logit contributions
phe+3.211
rophe+2.812
ths+2.547
riad+2.492
ient+2.483
Neg logit contributions
 Probe-3.175
 Nish-2.972
ersen-2.877
 Rik-2.693
 Trafford-2.693
Token home
Token ID1363
Feature activation-3.90e-01

Pos logit contributions
phe+1.066
rophe+0.937
ths+0.845
riad+0.833
ient+0.819
Neg logit contributions
 Probe-1.007
 Nish-0.951
ersen-0.914
 Rik-0.858
 COURT-0.852
Token.
Token ID13
Feature activation-5.36e-02

Pos logit contributions
 Probe+0.733
 Nish+0.698
ersen+0.664
 Rik+0.625
 COURT+0.622
Neg logit contributions
phe-0.839
rophe-0.739
ths-0.664
riad-0.659
ient-0.642
Token
Token ID447
Feature activation-1.51e-01

Pos logit contributions
 Probe+0.094
 Nish+0.088
ersen+0.084
 Rik+0.078
 COURT+0.078
Neg logit contributions
phe-0.123
rophe-0.110
ths-0.100
riad-0.099
ient-0.097
Token
Token ID251
Feature activation-2.63e-01

Pos logit contributions
 Probe+0.256
 Nish+0.240
ersen+0.228
 Rik+0.212
 COURT+0.212
Neg logit contributions
phe-0.355
rophe-0.315
ths-0.287
riad-0.286
ient-0.281
Token Later
Token ID11450
Feature activation-1.29e+00

Pos logit contributions
 Probe+0.440
 Nish+0.411
ersen+0.390
 Rik+0.363
 COURT+0.361
Neg logit contributions
phe-0.623
rophe-0.558
ths-0.509
riad-0.504
ient-0.497
Token in
Token ID287
Feature activation-5.30e-01

Pos logit contributions
 Probe+2.305
 Nish+2.159
ersen+2.069
 Rik+1.934
 NP+1.920
Neg logit contributions
phe-2.901
rophe-2.578
riad-2.333
ths-2.291
ient-2.264
Token the
Token ID262
Feature activation-5.44e-02

Pos logit contributions
 Probe+0.982
 Nish+0.925
ersen+0.882
 COURT+0.827
 Rik+0.824
Neg logit contributions
phe-1.161
rophe-1.028
ths-0.925
riad-0.925
ient-0.910
Token
Token ID8582
Feature activation+1.17e+00

Pos logit contributions
phe+1.000
rophe+0.895
ths+0.820
riad+0.811
ient+0.799
Neg logit contributions
 Probe-0.679
 Nish-0.634
ersen-0.604
 Rik-0.559
 COURT-0.554
Token
Token ID229
Feature activation-1.40e-01

Pos logit contributions
phe+2.979
rophe+2.700
ths+2.492
riad+2.471
ient+2.432
Neg logit contributions
 Probe-1.731
 Nish-1.602
ersen-1.510
 Rik-1.389
 Trafford-1.373
Token
Token ID118
Feature activation+2.74e-01

Pos logit contributions
 Probe+0.238
 Nish+0.223
ersen+0.213
 Rik+0.198
 Trafford+0.196
Neg logit contributions
phe-0.327
rophe-0.292
ths-0.266
riad-0.264
ient-0.259
Token
Token ID8582
Feature activation+9.00e-01

Pos logit contributions
phe+0.662
rophe+0.593
ths+0.543
riad+0.538
ient+0.530
Neg logit contributions
 Probe-0.447
 Nish-0.418
ersen-0.397
 Rik-0.367
 COURT-0.365
Token
Token ID229
Feature activation-3.44e-01

Pos logit contributions
phe+2.295
rophe+2.080
ths+1.919
riad+1.906
ient+1.879
Neg logit contributions
 Probe-1.332
 Nish-1.230
ersen-1.160
 Rik-1.067
 COURT-1.057
Token
Token ID116
Feature activation-2.71e-02

Pos logit contributions
 Probe+0.538
 Nish+0.503
ersen+0.478
 Rik+0.440
 Trafford+0.436
Neg logit contributions
phe-0.852
rophe-0.766
ths-0.700
riad-0.696
ient-0.683
Token (@
Token ID4275
Feature activation+4.80e-01

Pos logit contributions
 Probe+0.040
 Nish+0.037
ersen+0.035
 Rik+0.033
 COURT+0.032
Neg logit contributions
phe-0.069
rophe-0.062
ths-0.057
riad-0.057
ient-0.056
Tokenm
Token ID76
Feature activation-3.04e-01

Pos logit contributions
phe+1.025
rophe+0.904
ths+0.818
riad+0.807
ient+0.794
Neg logit contributions
 Probe-0.916
 Nish-0.864
ersen-0.828
 Rik-0.776
 COURT-0.771
Tokenf
Token ID69
Feature activation+5.85e-02

Pos logit contributions
 Probe+0.585
 Nish+0.553
ersen+0.536
 Rik+0.499
 COURT+0.498
Neg logit contributions
phe-0.633
rophe-0.560
ths-0.500
riad-0.495
ient-0.486
Tokenlynn
Token ID12935
Feature activation+5.48e-01

Pos logit contributions
phe+0.129
rophe+0.114
ths+0.104
riad+0.102
ient+0.101
Neg logit contributions
 Probe-0.107
 Nish-0.101
ersen-0.097
 Rik-0.090
 COURT-0.090
TokenJR
Token ID44817
Feature activation+5.74e-02

Pos logit contributions
phe+1.266
rophe+1.128
ths+1.030
riad+1.017
ient+1.001
Neg logit contributions
 Probe-0.952
 Nish-0.893
ersen-0.850
 Rik-0.793
 COURT-0.785
TokenAbout
Token ID8585
Feature activation+1.19e-01

Pos logit contributions
 Probe+0.528
 Nish+0.499
ersen+0.477
 Rik+0.446
 NP+0.444
Neg logit contributions
phe-0.634
rophe-0.562
ths-0.506
riad-0.502
ient-0.493
Token 30
Token ID1542
Feature activation-2.47e-01

Pos logit contributions
phe+0.285
rophe+0.255
ths+0.234
riad+0.231
ient+0.228
Neg logit contributions
 Probe-0.195
 Nish-0.182
ersen-0.174
 Rik-0.160
 COURT-0.159
Token or
Token ID393
Feature activation-2.72e-02

Pos logit contributions
 Probe+0.454
 Nish+0.427
ersen+0.408
 Rik+0.382
 COURT+0.379
Neg logit contributions
phe-0.548
rophe-0.485
ths-0.440
riad-0.435
ient-0.428
Token 40
Token ID2319
Feature activation-1.20e-01

Pos logit contributions
 Probe+0.041
 Nish+0.038
ersen+0.036
 Rik+0.033
 NP+0.033
Neg logit contributions
phe-0.069
rophe-0.062
ths-0.057
riad-0.057
ient-0.056
Token policemen
Token ID33777
Feature activation-1.44e+00

Pos logit contributions
 Probe+0.219
 Nish+0.206
ersen+0.198
 Rik+0.185
 Trafford+0.183
Neg logit contributions
phe-0.264
rophe-0.234
ths-0.213
riad-0.210
ient-0.207
Token took
Token ID1718
Feature activation-1.62e+00

Pos logit contributions
 Probe+2.689
 Nish+2.541
ersen+2.426
 Rik+2.270
 NP+2.262
Neg logit contributions
phe-3.112
rophe-2.777
ths-2.484
riad-2.475
ient-2.407
Token Mr
Token ID1770
Feature activation-1.42e+00

Pos logit contributions
 Probe+2.767
 Nish+2.606
ersen+2.450
 Rik+2.300
 NP+2.286
Neg logit contributions
phe-3.810
rophe-3.399
ths-3.081
riad-3.063
ient-3.015
Token Tom
Token ID4186
Feature activation-4.88e-01

Pos logit contributions
 Probe+1.904
 Nish+1.771
ersen+1.630
 Rik+1.496
 NP+1.482
Neg logit contributions
phe-3.884
rophe-3.498
riad-3.208
ths-3.203
ient-3.160
Tokenar
Token ID283
Feature activation+6.32e-02

Pos logit contributions
 Probe+1.139
 Nish+1.087
ersen+1.046
 Rik+0.996
 COURT+0.992
Neg logit contributions
phe-0.836
rophe-0.714
ths-0.626
riad-0.616
ient-0.603
Token away
Token ID1497
Feature activation+7.94e-01

Pos logit contributions
phe+0.137
rophe+0.121
ths+0.110
riad+0.108
ient+0.107
Neg logit contributions
 Probe-0.118
 Nish-0.110
ersen-0.106
 Rik-0.099
 Trafford-0.099
Token this
Token ID428
Feature activation+1.46e-01

Pos logit contributions
phe+1.639
rophe+1.435
ths+1.303
riad+1.275
ient+1.265
Neg logit contributions
 Probe-1.560
 Nish-1.466
ersen-1.419
 Rik-1.327
 Trafford-1.325
Tokenoke
Token ID2088
Feature activation-5.02e-01

Pos logit contributions
phe+0.400
rophe+0.355
ths+0.324
riad+0.320
ient+0.314
Neg logit contributions
 Probe-0.317
 Nish-0.298
ersen-0.286
 Rik-0.265
 Trafford-0.264
Token on
Token ID319
Feature activation-1.71e-01

Pos logit contributions
 Probe+0.954
 Nish+0.904
ersen+0.863
 Rik+0.813
 Trafford+0.803
Neg logit contributions
phe-1.075
rophe-0.948
ths-0.856
riad-0.846
ient-0.831
Token a
Token ID257
Feature activation-3.96e-01

Pos logit contributions
 Probe+0.335
 Nish+0.316
ersen+0.304
 Rik+0.285
 COURT+0.283
Neg logit contributions
phe-0.356
rophe-0.313
ths-0.282
riad-0.278
ient-0.274
Token stool
Token ID38753
Feature activation-8.72e-01

Pos logit contributions
 Probe+0.731
 Nish+0.687
ersen+0.659
 Rik+0.614
 COURT+0.610
Neg logit contributions
phe-0.869
rophe-0.769
ths-0.699
riad-0.691
ient-0.678
Token,
Token ID11
Feature activation-1.01e-01

Pos logit contributions
 Probe+1.637
 Nish+1.548
ersen+1.480
 Rik+1.394
 COURT+1.380
Neg logit contributions
phe-1.877
rophe-1.660
ths-1.491
riad-1.476
ient-1.456
Token ask
Token ID1265
Feature activation-2.07e+00

Pos logit contributions
 Probe+0.199
 Nish+0.188
ersen+0.180
 Rik+0.169
 COURT+0.168
Neg logit contributions
phe-0.210
rophe-0.184
ths-0.166
riad-0.164
ient-0.161
Token him
Token ID683
Feature activation-2.77e-01

Pos logit contributions
 Probe+3.763
 Nish+3.577
ersen+3.418
 Rik+3.194
 COURT+3.153
Neg logit contributions
phe-4.565
rophe-4.011
riad-3.604
ths-3.603
ient-3.553
Token directions
Token ID11678
Feature activation-5.78e-01

Pos logit contributions
 Probe+0.547
 Nish+0.516
ersen+0.495
 Rik+0.465
 COURT+0.463
Neg logit contributions
phe-0.571
rophe-0.502
ths-0.453
riad-0.446
ient-0.437
Token to
Token ID284
Feature activation-4.02e-01

Pos logit contributions
 Probe+1.042
 Nish+0.987
ersen+0.940
 Rik+0.881
 NP+0.870
Neg logit contributions
phe-1.298
rophe-1.144
ths-1.037
riad-1.029
ient-1.003
Token the
Token ID262
Feature activation-1.11e-01

Pos logit contributions
 Probe+0.789
 Nish+0.747
ersen+0.715
 Rik+0.673
 Trafford+0.669
Neg logit contributions
phe-0.837
rophe-0.736
ths-0.664
riad-0.654
ient-0.643
Token T
Token ID309
Feature activation-1.43e-01

Pos logit contributions
 Probe+0.209
 Nish+0.197
ersen+0.189
 Rik+0.177
 COURT+0.175
Neg logit contributions
phe-0.241
rophe-0.213
ths-0.193
riad-0.190
ient-0.187
Token less
Token ID1342
Feature activation-1.38e-01

Pos logit contributions
 Probe+0.541
 Nish+0.513
ersen+0.492
 Rik+0.463
 NP+0.462
Neg logit contributions
phe-0.561
rophe-0.493
ths-0.442
riad-0.437
ient-0.428
Token experienced
Token ID5924
Feature activation+1.13e+00

Pos logit contributions
 Probe+0.269
 Nish+0.255
ersen+0.244
 Rik+0.229
 NP+0.228
Neg logit contributions
phe-0.290
rophe-0.255
ths-0.230
riad-0.227
ient-0.223
Token teachers
Token ID7799
Feature activation+3.33e-01

Pos logit contributions
phe+2.303
rophe+2.018
ths+1.848
riad+1.802
ient+1.790
Neg logit contributions
 Probe-2.233
 Nish-2.086
ersen-2.022
 COURT-1.886
 Rik-1.882
Token.
Token ID13
Feature activation+1.01e-01

Pos logit contributions
phe+0.707
rophe+0.623
ths+0.563
riad+0.556
ient+0.546
Neg logit contributions
 Probe-0.640
 Nish-0.605
ersen-0.580
 Rik-0.544
 COURT-0.540
Token Their
Token ID5334
Feature activation+2.37e-01

Pos logit contributions
phe+0.230
rophe+0.203
ths+0.187
riad+0.184
ient+0.180
Neg logit contributions
 Probe-0.178
 Nish-0.167
ersen-0.162
 COURT-0.148
 Rik-0.148
Token lifetime
Token ID10869
Feature activation-1.62e+00

Pos logit contributions
phe+0.487
rophe+0.427
ths+0.387
riad+0.380
ient+0.374
Neg logit contributions
 Probe-0.472
 Nish-0.444
ersen-0.427
 Rik-0.401
 COURT-0.399
Token incomes
Token ID18696
Feature activation+1.91e-01

Pos logit contributions
 Probe+3.266
 Nish+3.142
ersen+2.978
 NP+2.870
 Rik+2.833
Neg logit contributions
phe-3.243
rophe-2.869
riad-2.530
ths-2.500
ovie-2.454
Token
Token ID960
Feature activation-7.68e-01

Pos logit contributions
phe+0.404
rophe+0.355
ths+0.321
riad+0.316
ient+0.312
Neg logit contributions
 Probe-0.369
 Nish-0.349
ersen-0.334
 Rik-0.314
 COURT-0.312
Tokensal
Token ID21680
Feature activation+2.46e+00

Pos logit contributions
 Probe+1.572
 Nish+1.500
ersen+1.440
 NP+1.368
 Rik+1.363
Neg logit contributions
phe-1.489
rophe-1.329
ths-1.167
riad-1.151
ient-1.125
Tokenary
Token ID560
Feature activation-1.83e-01

Pos logit contributions
phe+5.462
rophe+4.757
ient+4.434
ths+4.397
riad+4.357
Neg logit contributions
 Probe-4.486
 Nish-4.197
ersen-4.013
 Trafford-3.735
 Rik-3.724
Token plus
Token ID5556
Feature activation+4.17e-01

Pos logit contributions
 Probe+0.339
 Nish+0.319
ersen+0.304
 Rik+0.287
 NP+0.287
Neg logit contributions
phe-0.397
rophe-0.356
riad-0.318
ths-0.316
ient-0.310
Token the
Token ID262
Feature activation-1.60e-01

Pos logit contributions
 Probe+0.835
 Nish+0.791
ersen+0.750
 Rik+0.706
 NP+0.702
Neg logit contributions
phe-1.020
rophe-0.906
ths-0.817
riad-0.808
ient-0.796
Token exception
Token ID6631
Feature activation+6.79e-01

Pos logit contributions
 Probe+0.305
 Nish+0.289
ersen+0.275
 Rik+0.259
 NP+0.257
Neg logit contributions
phe-0.344
rophe-0.304
ths-0.274
riad-0.271
ient-0.267
Token of
Token ID286
Feature activation+3.02e-01

Pos logit contributions
phe+1.624
rophe+1.454
ths+1.329
riad+1.318
ient+1.298
Neg logit contributions
 Probe-1.123
 Nish-1.053
ersen-1.002
 Rik-0.929
 COURT-0.920
Token anti
Token ID3098
Feature activation-1.49e+00

Pos logit contributions
phe+0.672
rophe+0.596
ths+0.541
riad+0.534
ient+0.526
Neg logit contributions
 Probe-0.551
 Nish-0.519
ersen-0.496
 Rik-0.464
 COURT-0.460
Token-
Token ID12
Feature activation-6.98e-01

Pos logit contributions
 Probe+2.243
 Nish+2.108
ersen+1.978
 Rik+1.834
 NP+1.819
Neg logit contributions
phe-3.723
rophe-3.341
ths-3.050
riad-3.049
ient-3.004
Tokentank
Token ID28451
Feature activation-1.66e+00

Pos logit contributions
 Probe+1.193
 Nish+1.126
ersen+1.073
 Rik+0.996
 NP+0.992
Neg logit contributions
phe-1.618
rophe-1.445
ths-1.318
riad-1.296
ient-1.278
Token guns
Token ID6541
Feature activation-6.47e-03

Pos logit contributions
 Probe+3.331
 Nish+3.186
ersen+3.008
 NP+2.880
 Rik+2.863
Neg logit contributions
phe-3.377
rophe-2.938
ths-2.595
riad-2.578
ient-2.531
Token.[
Token ID3693
Feature activation+5.47e-01

Pos logit contributions
 Probe+0.012
 Nish+0.012
ersen+0.011
 Rik+0.011
 NP+0.011
Neg logit contributions
phe-0.014
rophe-0.012
ths-0.011
riad-0.011
ient-0.011
Token19
Token ID1129
Feature activation-5.20e-01

Pos logit contributions
phe+1.496
rophe+1.356
ths+1.259
riad+1.245
ient+1.230
Neg logit contributions
 Probe-0.719
 Nish-0.658
ersen-0.617
 Rik-0.560
 COURT-0.551
Token]
Token ID60
Feature activation+4.56e-01

Pos logit contributions
 Probe+1.010
 Nish+0.953
ersen+0.914
 Rik+0.862
 NP+0.861
Neg logit contributions
phe-1.090
rophe-0.960
ths-0.862
riad-0.861
ient-0.839
Token Bob
Token ID5811
Feature activation+5.25e-01

Pos logit contributions
phe+1.048
rophe+0.926
ths+0.858
riad+0.841
ient+0.824
Neg logit contributions
 Probe-0.781
 Nish-0.730
ersen-0.712
 Rik-0.649
 COURT-0.648
Token not
Token ID407
Feature activation+1.51e-01

Pos logit contributions
 Probe+0.061
 Nish+0.058
ersen+0.056
 NP+0.052
 Rik+0.052
Neg logit contributions
phe-0.063
rophe-0.055
ths-0.049
riad-0.048
ient-0.048
Token match
Token ID2872
Feature activation+1.73e-01

Pos logit contributions
phe+0.323
rophe+0.285
ths+0.257
riad+0.254
ient+0.250
Neg logit contributions
 Probe-0.288
 Nish-0.271
ersen-0.260
 Rik-0.244
 COURT-0.242
Token their
Token ID511
Feature activation-8.85e-02

Pos logit contributions
phe+0.345
rophe+0.302
ths+0.269
riad+0.265
ient+0.261
Neg logit contributions
 Probe-0.354
 Nish-0.336
ersen-0.322
 Rik-0.305
 NP-0.303
Token body
Token ID1767
Feature activation-7.21e-01

Pos logit contributions
 Probe+0.167
 Nish+0.157
ersen+0.151
 Rik+0.141
 COURT+0.140
Neg logit contributions
phe-0.191
rophe-0.169
ths-0.153
riad-0.151
ient-0.148
Token's
Token ID338
Feature activation+3.30e-01

Pos logit contributions
 Probe+1.319
 Nish+1.253
ersen+1.191
 Rik+1.125
 NP+1.123
Neg logit contributions
phe-1.589
rophe-1.410
ths-1.259
riad-1.249
ient-1.226
Token biological
Token ID10685
Feature activation-1.89e+00

Pos logit contributions
phe+0.711
rophe+0.628
ths+0.569
riad+0.562
ient+0.552
Neg logit contributions
 Probe-0.622
 Nish-0.585
ersen-0.562
 Rik-0.525
 COURT-0.522
Token sex
Token ID1714
Feature activation-3.14e-02

Pos logit contributions
 Probe+3.435
 Nish+3.318
ersen+3.088
 NP+3.003
 Rik+2.990
Neg logit contributions
phe-4.085
rophe-3.648
ovie-3.213
ths-3.186
riad-3.138
Token,
Token ID11
Feature activation+2.42e-02

Pos logit contributions
 Probe+0.057
 Nish+0.054
ersen+0.052
 Rik+0.049
 NP+0.049
Neg logit contributions
phe-0.069
rophe-0.062
ths-0.055
riad-0.055
ient-0.054
Token they
Token ID484
Feature activation+4.86e-01

Pos logit contributions
phe+0.050
rophe+0.044
ths+0.039
riad+0.039
ient+0.039
Neg logit contributions
 Probe-0.048
 Nish-0.045
ersen-0.043
 Rik-0.041
 NP-0.040
Token wish
Token ID4601
Feature activation+2.01e-01

Pos logit contributions
phe+1.042
rophe+0.919
ths+0.839
riad+0.820
ient+0.808
Neg logit contributions
 Probe-0.924
 Nish-0.870
ersen-0.833
 Rik-0.782
 Trafford-0.779
Token to
Token ID284
Feature activation-4.04e-01

Pos logit contributions
phe+0.470
rophe+0.419
ths+0.379
riad+0.377
ient+0.370
Neg logit contributions
 Probe-0.344
 Nish-0.325
ersen-0.310
 Rik-0.289
 NP-0.287