TOP ACTIVATIONS
MAX = 7.079

indeed , part of her cac het is that her poems
governor . Advertisement Continue reading the main story
easy bank loans and the cac het of having planes land
rarely . Advertisement Continue reading the main story
Advertisement Continue reading the main story
mg / kg ), a selective serotonin re upt ake inhibitor
Advertisement Continue reading the main story
reported . Advertisement Continue reading the main story
. Available at http :// public . health . ore gon
Newsletter Sign Up Continue reading the main story Please
himself as ' Gu cc ifer 2 . 0 ,' who
was about homosexuality Reviewer : k imp unk rock
Newsletter Sign Up Continue reading the main story Please
that the DPRK has a multip arty system once again :
July . Advertisement Continue reading the main story
Russian Mars probe , Ph obos - Gr unt , last
Journal of Research on Ad olesc ence , 8 , 333
Newsletter Sign Up Continue reading the main story Please
Newsletter Sign Up Continue reading the main story Please
Journal of Youth and Ad olesc ence , 39 , 10

MIN ACTIVATIONS
MIN = -6.209

. area , north of Gerrard St . E . and
fil lets are the same exact size .
<|endoftext|> E . S . Glover 's bird sey e map
on the data with acceptable forecasting errors to give the confidence
of the board to the exact previous state . If we
happen with solid - organ transpl ants , where a patient
real estate magn ate turned philanthrop ist Herbert Hale Williams ,
ement d ans c es dro its en rest re ignant
st aking out is the exact subject which most of the
Vil as ra o Des hm uk h for allot ting
of the mission means the exact date and method of delivery
bees , which she is allergic to . She doesn 't
1994 Jerry Se infeld 1995 Kel sey Gram mer
: E . S . Glover D wn ld : Full
1992 Jerry Se infeld 1993 Ted D anson
provides an unbiased and efficient estim ator of the round -
bad debts . Standing Lord Gan esh and Vish nu D
1993 Jerry Se infeld 1993 Garry Sh
1995 Jerry Se infeld 1995 Garry Sh
. Rel ieved , Par amount jumped at the chance to

INTERVAL 3.757 - 7.079
CONTAINS 0.147%

to ber ating herself for letting this happen , and it
club . Story continues below advertisement He
. Not everyone has the opportunity to spend time by or
" Some satellites take decades to come back after finishing
234 . Google Scholar Crossref | Medline | ISI

INTERVAL 0.435 - 3.757
CONTAINS 31.520%

into the # met oo thing she just writes very
back to haunt you if you don t protect
future economic outlook , according to the venue
case . The smart money would still say that the most
had to take a guess as to what the lasting appeal

INTERVAL -2.887 - 0.435
CONTAINS 67.453%

type of practical effects we do in Horror Nights all the
during Washington s practice May 25 . ( John
s win over Wer rib ee on Saturday .
of game you would point to when someone sne ers and
Mem c py Reading 55 80 953 16 27 145 Flat

INTERVAL -6.209 - -2.887
CONTAINS 0.879%

the high rate of such digestive cancers are due to the
This is his true core compet ency . He is the
price is not the most indicative of underlying trends , which
ab ut protected areas set aside for the region 's extraordinary
the top three queries so far have been for reservations ,
Token indeed
Token ID5600
Feature activation-3.88e-01

Pos logit contributions
ranch+0.040
 Kut+0.034
 Scot+0.034
 Property+0.034
Men+0.033
Neg logit contributions
hift-0.052
Reviewer-0.051
eeper-0.048
REDACTED-0.047
 spiked-0.045
Token,
Token ID11
Feature activation+6.68e-02

Pos logit contributions
ranch+0.254
 Kut+0.214
 Property+0.213
 Scot+0.212
Men+0.211
Neg logit contributions
hift-0.309
Reviewer-0.306
eeper-0.286
REDACTED-0.281
 spiked-0.268
Token part
Token ID636
Feature activation-2.51e+00

Pos logit contributions
hift+0.055
Reviewer+0.055
eeper+0.051
REDACTED+0.051
 certific+0.048
Neg logit contributions
ranch-0.041
 Kut-0.035
 Scot-0.035
 Property-0.035
Men-0.034
Token of
Token ID286
Feature activation-1.47e+00

Pos logit contributions
ranch+1.669
 Kut+1.401
 Property+1.399
 Scot+1.389
Men+1.382
Neg logit contributions
hift-1.961
Reviewer-1.956
eeper-1.810
REDACTED-1.796
 spiked-1.719
Token her
Token ID607
Feature activation-1.03e+00

Pos logit contributions
ranch+0.843
 Scot+0.717
 Kut+0.713
 Property+0.709
Men+0.689
Neg logit contributions
hift-1.266
Reviewer-1.265
eeper-1.185
REDACTED-1.170
 certific-1.109
Token cac
Token ID40428
Feature activation+7.08e+00

Pos logit contributions
ranch+0.679
 Kut+0.582
 Scot+0.581
 Property+0.577
Men+0.567
Neg logit contributions
Reviewer-0.803
hift-0.799
eeper-0.744
REDACTED-0.742
anamo-0.701
Tokenhet
Token ID3202
Feature activation+1.66e+00

Pos logit contributions
hift+6.835
Reviewer+6.774
eeper+6.485
REDACTED+6.328
 Paste+6.185
Neg logit contributions
ranch-3.229
 Kut-2.542
 Scot-2.476
 Property-2.430
Men-2.412
Token is
Token ID318
Feature activation+6.22e-02

Pos logit contributions
Reviewer+1.236
hift+1.230
eeper+1.141
REDACTED+1.115
 certific+1.055
Neg logit contributions
ranch-1.180
 Kut-1.008
 Property-0.998
 Scot-0.996
Men-0.995
Token that
Token ID326
Feature activation+1.49e-01

Pos logit contributions
Reviewer+0.047
hift+0.047
eeper+0.044
REDACTED+0.042
 DOI+0.040
Neg logit contributions
ranch-0.043
 Kut-0.037
 Scot-0.037
 Property-0.037
Men-0.036
Token her
Token ID607
Feature activation-8.11e-01

Pos logit contributions
hift+0.094
Reviewer+0.094
eeper+0.086
REDACTED+0.084
 spiked+0.079
Neg logit contributions
ranch-0.121
 Kut-0.106
 Scot-0.106
 Property-0.106
Men-0.105
Token poems
Token ID31888
Feature activation+9.13e-01

Pos logit contributions
ranch+0.567
 Scot+0.495
 Kut+0.493
 Property+0.490
Men+0.479
Neg logit contributions
Reviewer-0.604
hift-0.603
eeper-0.558
REDACTED-0.550
anamo-0.523
Token governor
Token ID8153
Feature activation-8.49e-01

Pos logit contributions
hift+0.328
Reviewer+0.327
eeper+0.306
REDACTED+0.304
anamo+0.285
Neg logit contributions
ranch-0.241
 Scot-0.200
 Kut-0.199
 Property-0.198
Men-0.195
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
ranch+0.567
 Scot+0.485
 Kut+0.481
Men+0.478
 Property+0.478
Neg logit contributions
hift-0.650
Reviewer-0.648
eeper-0.607
REDACTED-0.604
anamo-0.565
Token
Token ID198
Feature activation-1.46e+00

Pos logit contributions
ranch+0.105
 Property+0.087
 Kut+0.086
 Scot+0.085
Men+0.084
Neg logit contributions
hift-0.173
Reviewer-0.172
eeper-0.163
REDACTED-0.160
 certific-0.154
Token
Token ID198
Feature activation-6.59e-01

Pos logit contributions
ranch+0.698
Men+0.551
 Property+0.551
 Kut+0.542
 Scot+0.541
Neg logit contributions
hift-1.438
Reviewer-1.429
eeper-1.369
 certific-1.337
REDACTED-1.314
TokenAdvertisement
Token ID4723
Feature activation-1.02e+00

Pos logit contributions
ranch+0.429
Men+0.365
 Property+0.355
 Kut+0.354
 Scot+0.350
Neg logit contributions
hift-0.533
Reviewer-0.519
eeper-0.498
REDACTED-0.480
 spiked-0.474
Token Continue
Token ID10054
Feature activation+7.08e+00

Pos logit contributions
ranch+0.933
 Property+0.841
 Scot+0.819
Men+0.816
 Rivals+0.802
Neg logit contributions
Reviewer-0.550
hift-0.545
eeper-0.491
REDACTED-0.479
 certific-0.462
Token reading
Token ID3555
Feature activation+1.07e+00

Pos logit contributions
hift+4.203
Reviewer+4.154
eeper+3.798
REDACTED+3.756
 Paste+3.544
Neg logit contributions
ranch-6.021
 Kut-5.167
Men-5.126
 Property-5.122
 Scot-5.080
Token the
Token ID262
Feature activation+9.91e-01

Pos logit contributions
hift+0.945
Reviewer+0.942
eeper+0.881
REDACTED+0.865
 certific+0.836
Neg logit contributions
ranch-0.619
 Property-0.510
 Kut-0.508
 Scot-0.503
Men-0.501
Token main
Token ID1388
Feature activation+1.62e+00

Pos logit contributions
Reviewer+0.697
hift+0.694
REDACTED+0.631
eeper+0.627
 certific+0.595
Neg logit contributions
ranch-0.756
 Property-0.655
Men-0.646
 Scot-0.646
 Kut-0.642
Token story
Token ID1621
Feature activation+1.22e+00

Pos logit contributions
Reviewer+1.091
hift+1.082
eeper+0.986
REDACTED+0.969
 certific+0.915
Neg logit contributions
ranch-1.280
 Property-1.115
 Scot-1.111
Men-1.109
 Kut-1.103
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
hift+1.161
Reviewer+1.154
eeper+1.099
REDACTED+1.067
 certific+1.050
Neg logit contributions
ranch-0.620
 Property-0.502
Men-0.495
 Scot-0.494
 Kut-0.492
Token easy
Token ID2562
Feature activation-4.57e-01

Pos logit contributions
ranch+0.120
 Kut+0.101
 Property+0.100
 Scot+0.100
Men+0.097
Neg logit contributions
Reviewer-0.162
hift-0.162
eeper-0.151
REDACTED-0.150
 spiked-0.141
Token bank
Token ID3331
Feature activation-3.33e+00

Pos logit contributions
ranch+0.350
 Kut+0.298
 Property+0.297
 Scot+0.297
Men+0.295
Neg logit contributions
hift-0.313
Reviewer-0.312
eeper-0.291
REDACTED-0.287
 Paste-0.268
Token loans
Token ID10021
Feature activation-3.16e+00

Pos logit contributions
ranch+2.460
 Kut+2.068
 Scot+2.055
 Property+2.043
 Wong+2.024
Neg logit contributions
Reviewer-2.395
hift-2.383
REDACTED-2.218
eeper-2.167
 spiked-2.045
Token and
Token ID290
Feature activation-6.73e-01

Pos logit contributions
ranch+1.949
 Kut+1.631
 Property+1.623
 Scot+1.600
Men+1.600
Neg logit contributions
Reviewer-2.591
hift-2.564
eeper-2.425
REDACTED-2.418
anamo-2.294
Token the
Token ID262
Feature activation-5.72e-01

Pos logit contributions
ranch+0.461
 Kut+0.392
 Property+0.390
 Scot+0.389
 Wong+0.378
Neg logit contributions
Reviewer-0.520
hift-0.513
eeper-0.478
REDACTED-0.476
anamo-0.444
Token cac
Token ID40428
Feature activation+6.97e+00

Pos logit contributions
ranch+0.400
 Property+0.345
 Kut+0.343
 Scot+0.339
 Wong+0.333
Neg logit contributions
Reviewer-0.437
hift-0.429
eeper-0.401
REDACTED-0.395
anamo-0.370
Tokenhet
Token ID3202
Feature activation+1.48e+00

Pos logit contributions
hift+6.418
Reviewer+6.333
eeper+6.015
REDACTED+5.897
 Paste+5.724
Neg logit contributions
ranch-3.578
 Kut-2.913
 Scot-2.870
Men-2.787
 Property-2.780
Token of
Token ID286
Feature activation-2.55e-02

Pos logit contributions
hift+1.102
Reviewer+1.094
eeper+1.013
REDACTED+0.996
 spiked+0.955
Neg logit contributions
ranch-1.044
 Kut-0.877
 Property-0.877
Men-0.871
 Scot-0.870
Token having
Token ID1719
Feature activation-8.32e-01

Pos logit contributions
ranch+0.016
 Kut+0.013
 Property+0.013
 Scot+0.013
Men+0.013
Neg logit contributions
Reviewer-0.021
hift-0.021
eeper-0.020
REDACTED-0.020
 spiked-0.019
Token planes
Token ID13016
Feature activation+1.62e+00

Pos logit contributions
ranch+0.569
 Kut+0.487
 Scot+0.482
 Property+0.480
Men+0.474
Neg logit contributions
Reviewer-0.638
hift-0.636
eeper-0.591
REDACTED-0.583
 Insight-0.544
Token land
Token ID1956
Feature activation-1.13e-01

Pos logit contributions
Reviewer+1.164
hift+1.136
eeper+1.061
REDACTED+1.055
 Insight+0.972
Neg logit contributions
ranch-1.208
 Scot-1.038
 Kut-1.033
 Property-1.028
Men-1.019
Token rarely
Token ID8365
Feature activation+2.66e-01

Pos logit contributions
hift+1.608
Reviewer+1.607
eeper+1.485
REDACTED+1.463
 spiked+1.383
Neg logit contributions
ranch-1.510
 Kut-1.289
 Property-1.283
 Scot-1.282
Men-1.271
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
Reviewer+0.203
hift+0.202
eeper+0.187
REDACTED+0.184
 Paste+0.175
Neg logit contributions
ranch-0.183
 Kut-0.156
 Scot-0.156
 Property-0.156
Men-0.155
Token
Token ID198
Feature activation-1.29e+00

Pos logit contributions
ranch+0.166
 Property+0.136
 Kut+0.135
 Scot+0.134
Men+0.133
Neg logit contributions
hift-0.261
Reviewer-0.259
eeper-0.245
REDACTED-0.241
 spiked-0.232
Token
Token ID198
Feature activation-4.23e-01

Pos logit contributions
ranch+0.622
Men+0.491
 Property+0.490
 Kut+0.484
 Scot+0.482
Neg logit contributions
hift-1.259
Reviewer-1.254
eeper-1.198
 certific-1.163
REDACTED-1.152
TokenAdvertisement
Token ID4723
Feature activation-9.95e-01

Pos logit contributions
ranch+0.267
Men+0.225
 Kut+0.221
 Property+0.219
 Scot+0.218
Neg logit contributions
hift-0.350
Reviewer-0.342
eeper-0.326
REDACTED-0.316
 spiked-0.311
Token Continue
Token ID10054
Feature activation+6.95e+00

Pos logit contributions
ranch+0.918
 Property+0.825
 Scot+0.811
Men+0.802
 Kut+0.796
Neg logit contributions
Reviewer-0.534
hift-0.529
eeper-0.474
REDACTED-0.463
 certific-0.442
Token reading
Token ID3555
Feature activation+9.25e-01

Pos logit contributions
hift+4.135
Reviewer+4.087
eeper+3.744
REDACTED+3.699
 Paste+3.496
Neg logit contributions
ranch-5.908
 Kut-5.062
Men-5.031
 Property-5.020
 Scot-4.976
Token the
Token ID262
Feature activation+1.10e+00

Pos logit contributions
hift+0.818
Reviewer+0.814
eeper+0.763
REDACTED+0.747
 certific+0.723
Neg logit contributions
ranch-0.528
 Property-0.436
 Kut-0.434
 Scot-0.431
Men-0.428
Token main
Token ID1388
Feature activation+1.35e+00

Pos logit contributions
Reviewer+0.778
hift+0.775
REDACTED+0.702
eeper+0.701
anamo+0.662
Neg logit contributions
ranch-0.841
 Property-0.730
 Scot-0.720
Men-0.719
 Kut-0.715
Token story
Token ID1621
Feature activation+9.79e-01

Pos logit contributions
Reviewer+0.913
hift+0.906
eeper+0.825
REDACTED+0.807
anamo+0.765
Neg logit contributions
ranch-1.073
 Property-0.935
 Scot-0.933
Men-0.929
 Kut-0.929
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
hift+0.936
Reviewer+0.930
eeper+0.887
REDACTED+0.858
 certific+0.849
Neg logit contributions
ranch-0.491
 Property-0.398
Men-0.393
 Scot-0.391
 Kut-0.389
Token
Token ID447
Feature activation-2.66e-01

Pos logit contributions
ranch+0.368
 Kut+0.298
 Property+0.298
Men+0.294
 Scot+0.293
Neg logit contributions
hift-0.582
Reviewer-0.578
eeper-0.545
REDACTED-0.537
 spiked-0.517
Token
Token ID251
Feature activation+9.63e-01

Pos logit contributions
ranch+0.156
 Property+0.130
 Kut+0.128
Men+0.127
 Scot+0.127
Neg logit contributions
hift-0.229
Reviewer-0.228
eeper-0.213
REDACTED-0.211
 certific-0.204
Token
Token ID198
Feature activation-6.69e-01

Pos logit contributions
hift+0.856
Reviewer+0.850
eeper+0.800
REDACTED+0.788
 spiked+0.758
Neg logit contributions
ranch-0.545
 Kut-0.441
 Property-0.440
Men-0.435
 Scot-0.434
Token
Token ID198
Feature activation+1.31e-01

Pos logit contributions
ranch+0.328
 Property+0.259
Men+0.258
 Kut+0.256
 Scot+0.255
Neg logit contributions
hift-0.648
Reviewer-0.644
eeper-0.615
 certific-0.596
REDACTED-0.594
TokenAdvertisement
Token ID4723
Feature activation-8.26e-01

Pos logit contributions
hift+0.106
Reviewer+0.104
eeper+0.098
REDACTED+0.096
 spiked+0.093
Neg logit contributions
ranch-0.086
Men-0.072
 Kut-0.071
 Property-0.071
 Scot-0.070
Token Continue
Token ID10054
Feature activation+6.85e+00

Pos logit contributions
ranch+0.748
 Property+0.690
 Scot+0.671
Men+0.664
 Kut+0.660
Neg logit contributions
Reviewer-0.459
hift-0.442
eeper-0.404
REDACTED-0.394
 certific-0.379
Token reading
Token ID3555
Feature activation+8.38e-01

Pos logit contributions
hift+4.045
Reviewer+3.993
eeper+3.668
REDACTED+3.621
 spiked+3.443
Neg logit contributions
ranch-5.844
 Kut-5.012
Men-4.987
 Property-4.981
 Scot-4.928
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
hift+0.734
Reviewer+0.731
eeper+0.685
REDACTED+0.670
 certific+0.651
Neg logit contributions
ranch-0.485
 Kut-0.400
 Property-0.400
 Scot-0.396
Men-0.394
Token main
Token ID1388
Feature activation+8.23e-01

Pos logit contributions
Reviewer+0.710
hift+0.706
eeper+0.640
REDACTED+0.636
 certific+0.603
Neg logit contributions
ranch-0.780
 Property-0.675
 Scot-0.673
 Kut-0.671
Men-0.670
Token story
Token ID1621
Feature activation+1.16e+00

Pos logit contributions
Reviewer+0.550
hift+0.543
eeper+0.495
REDACTED+0.481
 certific+0.458
Neg logit contributions
ranch-0.655
 Scot-0.572
 Property-0.571
Men-0.570
 Kut-0.568
Token
Token ID198
Feature activation+1.21e+00

Pos logit contributions
hift+1.096
Reviewer+1.091
eeper+1.036
REDACTED+1.006
 certific+0.990
Neg logit contributions
ranch-0.592
 Property-0.480
Men-0.473
 Scot-0.471
 Kut-0.471
Tokenmg
Token ID11296
Feature activation+4.58e-01

Pos logit contributions
hift+1.070
Reviewer+1.066
eeper+1.000
REDACTED+0.985
 spiked+0.945
Neg logit contributions
ranch-0.691
 Kut-0.559
 Property-0.557
Men-0.553
 Scot-0.551
Token/
Token ID14
Feature activation+2.45e+00

Pos logit contributions
hift+0.315
Reviewer+0.315
eeper+0.289
REDACTED+0.284
 spiked+0.268
Neg logit contributions
ranch-0.349
 Kut-0.299
 Property-0.299
 Scot-0.297
Men-0.297
Tokenkg
Token ID10025
Feature activation+1.67e+00

Pos logit contributions
hift+2.224
Reviewer+2.192
eeper+2.073
REDACTED+2.059
 spiked+1.971
Neg logit contributions
ranch-1.320
 Property-1.059
 Kut-1.056
 Scot-1.044
Men-1.021
Token),
Token ID828
Feature activation+1.33e+00

Pos logit contributions
hift+0.991
Reviewer+0.985
eeper+0.895
REDACTED+0.875
 spiked+0.823
Neg logit contributions
ranch-1.433
 Kut-1.249
 Property-1.247
Men-1.240
 Scot-1.239
Token a
Token ID257
Feature activation+1.22e+00

Pos logit contributions
Reviewer+0.985
hift+0.968
eeper+0.894
REDACTED+0.874
anamo+0.827
Neg logit contributions
ranch-0.960
 Scot-0.821
 Kut-0.816
Men-0.815
 Property-0.813
Token selective
Token ID21792
Feature activation+6.67e+00

Pos logit contributions
hift+0.895
Reviewer+0.894
eeper+0.823
REDACTED+0.809
anamo+0.755
Neg logit contributions
ranch-0.888
 Property-0.753
 Scot-0.751
 Kut-0.746
Men-0.740
Token serotonin
Token ID32236
Feature activation+1.76e+00

Pos logit contributions
hift+4.629
Reviewer+4.579
eeper+4.240
REDACTED+4.157
 spiked+4.034
Neg logit contributions
ranch-4.985
 Kut-4.257
Men-4.235
 Property-4.218
 Scot-4.170
Token re
Token ID302
Feature activation+2.91e+00

Pos logit contributions
Reviewer+1.363
hift+1.357
eeper+1.261
REDACTED+1.248
anamo+1.182
Neg logit contributions
ranch-1.223
 Property-1.007
 Scot-1.005
 Kut-0.999
Men-0.974
Tokenupt
Token ID37623
Feature activation+3.41e-01

Pos logit contributions
hift+1.858
Reviewer+1.844
eeper+1.685
REDACTED+1.663
 spiked+1.566
Neg logit contributions
ranch-2.385
 Property-2.061
 Scot-2.047
 Kut-2.042
Men-2.040
Tokenake
Token ID539
Feature activation+3.34e+00

Pos logit contributions
Reviewer+0.146
hift+0.139
REDACTED+0.122
eeper+0.117
 certific+0.109
Neg logit contributions
ranch-0.358
 Property-0.316
Men-0.314
 Kut-0.314
 Scot-0.313
Token inhibitor
Token ID41704
Feature activation+1.90e+00

Pos logit contributions
hift+2.240
Reviewer+2.234
eeper+2.056
REDACTED+2.019
 spiked+1.903
Neg logit contributions
ranch-2.614
 Property-2.241
 Kut-2.239
Men-2.221
 Scot-2.221
Token
Token ID447
Feature activation-3.02e-02

Pos logit contributions
hift+0.309
Reviewer+0.307
eeper+0.282
REDACTED+0.277
 spiked+0.264
Neg logit contributions
ranch-0.360
 Kut-0.309
 Property-0.308
Men-0.306
 Scot-0.306
Token
Token ID251
Feature activation+2.81e-01

Pos logit contributions
ranch+0.019
 Kut+0.016
 Property+0.016
Men+0.016
 Scot+0.016
Neg logit contributions
hift-0.024
Reviewer-0.024
eeper-0.023
REDACTED-0.022
 spiked-0.021
Token
Token ID198
Feature activation-9.56e-01

Pos logit contributions
hift+0.250
Reviewer+0.248
eeper+0.234
REDACTED+0.231
 certific+0.222
Neg logit contributions
ranch-0.158
 Kut-0.129
 Property-0.129
Men-0.127
 Scot-0.127
Token
Token ID198
Feature activation-7.98e-02

Pos logit contributions
ranch+0.463
 Property+0.366
Men+0.365
 Kut+0.361
 Scot+0.360
Neg logit contributions
hift-0.933
Reviewer-0.928
eeper-0.886
 certific-0.860
REDACTED-0.853
TokenAdvertisement
Token ID4723
Feature activation-1.08e+00

Pos logit contributions
ranch+0.051
Men+0.043
 Kut+0.043
 Property+0.042
 Scot+0.042
Neg logit contributions
hift-0.065
Reviewer-0.064
eeper-0.061
REDACTED-0.059
 spiked-0.058
Token Continue
Token ID10054
Feature activation+6.63e+00

Pos logit contributions
ranch+1.012
 Property+0.907
 Scot+0.893
Men+0.887
 Kut+0.884
Neg logit contributions
Reviewer-0.557
hift-0.557
eeper-0.495
REDACTED-0.484
 certific-0.458
Token reading
Token ID3555
Feature activation+5.53e-01

Pos logit contributions
hift+4.012
Reviewer+3.972
eeper+3.637
REDACTED+3.592
 spiked+3.399
Neg logit contributions
ranch-5.572
 Kut-4.784
 Property-4.755
Men-4.753
 Scot-4.711
Token the
Token ID262
Feature activation+7.93e-01

Pos logit contributions
hift+0.496
Reviewer+0.494
eeper+0.462
REDACTED+0.453
anamo+0.440
Neg logit contributions
ranch-0.308
 Property-0.255
 Kut-0.253
 Scot-0.252
Men-0.250
Token main
Token ID1388
Feature activation+9.10e-01

Pos logit contributions
Reviewer+0.564
hift+0.563
REDACTED+0.508
eeper+0.508
 certific+0.481
Neg logit contributions
ranch-0.599
 Property-0.519
 Scot-0.512
Men-0.512
 Kut-0.509
Token story
Token ID1621
Feature activation+6.69e-01

Pos logit contributions
Reviewer+0.627
hift+0.620
eeper+0.565
REDACTED+0.553
anamo+0.526
Neg logit contributions
ranch-0.709
 Property-0.617
 Scot-0.616
Men-0.614
 Kut-0.612
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
hift+0.643
Reviewer+0.639
eeper+0.610
REDACTED+0.589
 certific+0.583
Neg logit contributions
ranch-0.332
 Property-0.270
 Scot-0.266
Men-0.266
 Kut-0.264
Token reported
Token ID2098
Feature activation+5.97e-01

Pos logit contributions
Reviewer+0.433
hift+0.433
eeper+0.397
REDACTED+0.394
 Paste+0.370
Neg logit contributions
ranch-0.457
 Kut-0.394
 Property-0.393
 Scot-0.393
Men-0.389
Token.
Token ID13
Feature activation-4.15e-01

Pos logit contributions
hift+0.389
Reviewer+0.389
eeper+0.355
REDACTED+0.348
 spiked+0.334
Neg logit contributions
ranch-0.476
 Property-0.408
 Kut-0.406
 Scot-0.403
Men-0.401
Token
Token ID198
Feature activation-1.41e+00

Pos logit contributions
ranch+0.244
 Kut+0.196
 Scot+0.194
 Property+0.194
Men+0.193
Neg logit contributions
hift-0.357
Reviewer-0.355
eeper-0.332
REDACTED-0.328
 spiked-0.316
Token
Token ID198
Feature activation-6.52e-01

Pos logit contributions
ranch+0.680
Men+0.538
 Property+0.538
 Kut+0.530
 Scot+0.528
Neg logit contributions
hift-1.382
Reviewer-1.373
eeper-1.315
 certific-1.283
REDACTED-1.262
TokenAdvertisement
Token ID4723
Feature activation-9.00e-01

Pos logit contributions
ranch+0.415
Men+0.351
 Kut+0.341
 Property+0.341
 Scot+0.335
Neg logit contributions
hift-0.536
Reviewer-0.526
eeper-0.501
REDACTED-0.484
 spiked-0.477
Token Continue
Token ID10054
Feature activation+6.56e+00

Pos logit contributions
ranch+0.832
 Property+0.748
 Scot+0.733
Men+0.732
 Rivals+0.721
Neg logit contributions
Reviewer-0.472
hift-0.470
eeper-0.420
REDACTED-0.415
 certific-0.401
Token reading
Token ID3555
Feature activation+7.05e-01

Pos logit contributions
hift+3.961
Reviewer+3.900
eeper+3.571
REDACTED+3.539
 spiked+3.349
Neg logit contributions
ranch-5.541
 Kut-4.758
 Property-4.730
Men-4.718
 Scot-4.685
Token the
Token ID262
Feature activation+8.12e-01

Pos logit contributions
hift+0.620
Reviewer+0.619
eeper+0.580
REDACTED+0.566
 certific+0.550
Neg logit contributions
ranch-0.404
 Kut-0.334
 Property-0.334
 Scot-0.329
Men-0.329
Token main
Token ID1388
Feature activation+1.35e+00

Pos logit contributions
Reviewer+0.563
hift+0.560
eeper+0.508
REDACTED+0.504
 certific+0.479
Neg logit contributions
ranch-0.624
 Property-0.542
Men-0.536
 Scot-0.534
 Kut-0.534
Token story
Token ID1621
Feature activation+1.02e+00

Pos logit contributions
Reviewer+0.917
hift+0.911
eeper+0.830
REDACTED+0.808
 certific+0.768
Neg logit contributions
ranch-1.061
 Property-0.927
Men-0.921
 Kut-0.921
 Scot-0.921
Token
Token ID198
Feature activation+1.09e+00

Pos logit contributions
hift+0.968
Reviewer+0.963
eeper+0.916
REDACTED+0.888
 certific+0.876
Neg logit contributions
ranch-0.514
 Property-0.415
Men-0.410
 Scot-0.407
 Kut-0.406
Token.
Token ID13
Feature activation+1.90e+00

Pos logit contributions
hift+1.314
Reviewer+1.312
eeper+1.223
REDACTED+1.185
 certific+1.142
Neg logit contributions
ranch-1.277
 Kut-1.088
 Property-1.084
Men-1.078
 Scot-1.077
Token Available
Token ID14898
Feature activation+8.09e-01

Pos logit contributions
hift+1.551
Reviewer+1.543
eeper+1.441
REDACTED+1.421
 spiked+1.362
Neg logit contributions
ranch-1.200
 Kut-0.987
 Property-0.985
Men-0.977
 Scot-0.975
Token at
Token ID379
Feature activation-2.53e-02

Pos logit contributions
hift+0.651
Reviewer+0.650
eeper+0.603
REDACTED+0.588
anamo+0.563
Neg logit contributions
ranch-0.530
 Kut-0.451
 Property-0.450
 Scot-0.450
Men-0.444
Token http
Token ID2638
Feature activation+3.85e-02

Pos logit contributions
ranch+0.016
 Kut+0.013
 Property+0.013
Men+0.013
 Scot+0.013
Neg logit contributions
hift-0.021
Reviewer-0.021
eeper-0.019
REDACTED-0.019
 spiked-0.018
Token://
Token ID1378
Feature activation-7.95e-01

Pos logit contributions
Reviewer+0.024
hift+0.024
eeper+0.022
 certific+0.021
anamo+0.020
Neg logit contributions
ranch-0.032
Men-0.028
 Kut-0.028
 Property-0.028
 Scot-0.028
Tokenpublic
Token ID11377
Feature activation+6.53e+00

Pos logit contributions
ranch+0.502
Men+0.409
 Property+0.407
 Kut+0.407
 Scot+0.402
Neg logit contributions
Reviewer-0.656
hift-0.654
eeper-0.611
REDACTED-0.602
 spiked-0.581
Token.
Token ID13
Feature activation+2.66e+00

Pos logit contributions
hift+4.224
Reviewer+4.034
eeper+3.794
REDACTED+3.639
 Insight+3.477
Neg logit contributions
ranch-5.158
 Kut-4.561
 Scot-4.529
 Property-4.501
Men-4.433
Tokenhealth
Token ID13948
Feature activation-2.84e-01

Pos logit contributions
hift+2.041
Reviewer+2.033
eeper+1.888
REDACTED+1.857
 spiked+1.777
Neg logit contributions
ranch-1.829
 Kut-1.533
 Property-1.530
Men-1.519
 Scot-1.516
Token.
Token ID13
Feature activation+2.36e+00

Pos logit contributions
ranch+0.208
 Property+0.175
 Kut+0.175
Men+0.174
 Scot+0.173
Neg logit contributions
Reviewer-0.205
hift-0.204
eeper-0.189
REDACTED-0.186
 spiked-0.177
Tokenore
Token ID382
Feature activation+1.21e+00

Pos logit contributions
hift+2.244
Reviewer+2.232
eeper+2.107
REDACTED+2.080
 spiked+2.006
Neg logit contributions
ranch-1.177
 Kut-0.922
 Property-0.918
 Scot-0.906
Men-0.906
Tokengon
Token ID14520
Feature activation+1.12e+00

Pos logit contributions
Reviewer+1.089
hift+1.038
REDACTED+0.995
eeper+0.987
 certific+0.954
Neg logit contributions
ranch-0.714
 Property-0.572
Men-0.558
 Scot-0.552
 Kut-0.537
Token
Token ID198
Feature activation-7.00e-01

Pos logit contributions
hift+0.231
Reviewer+0.230
eeper+0.217
REDACTED+0.213
 certific+0.205
Neg logit contributions
ranch-0.143
 Kut-0.118
 Property-0.117
 Scot-0.116
Men-0.115
Token
Token ID198
Feature activation+4.92e-03

Pos logit contributions
ranch+0.348
 Property+0.273
Men+0.272
 Kut+0.271
 Scot+0.269
Neg logit contributions
hift-0.674
Reviewer-0.672
eeper-0.639
REDACTED-0.618
 certific-0.618
TokenNewsletter
Token ID33031
Feature activation+1.80e+00

Pos logit contributions
hift+0.004
Reviewer+0.004
eeper+0.004
REDACTED+0.004
 spiked+0.004
Neg logit contributions
ranch-0.003
Men-0.002
 Kut-0.002
 Property-0.002
 Scot-0.002
Token Sign
Token ID5865
Feature activation+2.19e+00

Pos logit contributions
Reviewer+1.197
hift+1.188
eeper+1.088
REDACTED+1.078
 certific+1.016
Neg logit contributions
ranch-1.421
 Property-1.260
 Scot-1.239
 Kut-1.236
Men-1.230
Token Up
Token ID3205
Feature activation+3.20e+00

Pos logit contributions
hift+1.788
Reviewer+1.776
eeper+1.663
REDACTED+1.639
 spiked+1.571
Neg logit contributions
ranch-1.403
 Property-1.184
 Kut-1.173
 Scot-1.170
Men-1.160
Token Continue
Token ID10054
Feature activation+6.40e+00

Pos logit contributions
hift+1.480
Reviewer+1.467
eeper+1.282
REDACTED+1.243
 spiked+1.145
Neg logit contributions
ranch-3.181
 Property-2.850
 Kut-2.843
 Scot-2.837
Men-2.829
Token reading
Token ID3555
Feature activation+1.83e+00

Pos logit contributions
hift+3.962
Reviewer+3.927
eeper+3.604
REDACTED+3.537
 spiked+3.351
Neg logit contributions
ranch-5.323
 Kut-4.573
 Property-4.539
Men-4.531
 Scot-4.514
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
hift+1.574
Reviewer+1.567
eeper+1.463
REDACTED+1.441
 spiked+1.382
Neg logit contributions
ranch-1.084
 Property-0.892
 Kut-0.889
 Scot-0.880
Men-0.880
Token main
Token ID1388
Feature activation+2.11e+00

Pos logit contributions
hift+1.041
Reviewer+1.034
REDACTED+0.949
eeper+0.928
anamo+0.882
Neg logit contributions
ranch-1.090
 Property-0.958
 Kut-0.941
 Scot-0.939
Men-0.933
Token story
Token ID1621
Feature activation+2.85e+00

Pos logit contributions
Reviewer+1.389
hift+1.388
eeper+1.251
REDACTED+1.225
 certific+1.158
Neg logit contributions
ranch-1.693
 Property-1.484
 Scot-1.471
 Kut-1.469
Men-1.468
Token Please
Token ID4222
Feature activation+3.20e+00

Pos logit contributions
hift+2.045
Reviewer+2.036
eeper+1.884
REDACTED+1.826
 spiked+1.734
Neg logit contributions
ranch-2.113
 Property-1.841
 Kut-1.833
 Scot-1.830
Men-1.815
Token himself
Token ID2241
Feature activation-1.48e+00

Pos logit contributions
ranch+0.753
 Kut+0.650
 Property+0.643
 Scot+0.641
Men+0.632
Neg logit contributions
Reviewer-0.830
hift-0.823
eeper-0.763
REDACTED-0.744
 certific-0.705
Token as
Token ID355
Feature activation+3.52e-01

Pos logit contributions
ranch+0.933
 Kut+0.762
 Property+0.761
Men+0.756
 Scot+0.752
Neg logit contributions
hift-1.227
Reviewer-1.222
eeper-1.139
REDACTED-1.127
 spiked-1.082
Token '
Token ID705
Feature activation-9.10e-01

Pos logit contributions
Reviewer+0.320
hift+0.319
eeper+0.299
REDACTED+0.293
 certific+0.282
Neg logit contributions
ranch-0.191
 Kut-0.156
 Property-0.156
 Scot-0.154
Men-0.152
TokenGu
Token ID8205
Feature activation+3.06e+00

Pos logit contributions
ranch+0.593
Men+0.499
 Kut+0.490
 Property+0.490
 Scot+0.482
Neg logit contributions
hift-0.732
Reviewer-0.724
eeper-0.678
REDACTED-0.651
 certific-0.645
Tokencc
Token ID535
Feature activation+3.47e+00

Pos logit contributions
hift+2.567
Reviewer+2.554
eeper+2.374
REDACTED+2.356
 spiked+2.256
Neg logit contributions
ranch-1.887
 Property-1.545
 Kut-1.539
Men-1.533
 Scot-1.526
Tokenifer
Token ID7087
Feature activation+6.25e+00

Pos logit contributions
hift+1.986
Reviewer+1.974
eeper+1.788
REDACTED+1.748
 spiked+1.639
Neg logit contributions
ranch-3.044
 Kut-2.664
 Property-2.662
Men-2.646
 Scot-2.645
Token 2
Token ID362
Feature activation+3.30e+00

Pos logit contributions
hift+4.312
Reviewer+4.285
eeper+3.953
REDACTED+3.892
 spiked+3.705
Neg logit contributions
ranch-4.723
 Kut-4.013
 Property-3.995
Men-3.995
 Scot-3.966
Token.
Token ID13
Feature activation+4.69e-01

Pos logit contributions
hift+2.339
Reviewer+2.323
eeper+2.147
REDACTED+2.107
 spiked+2.005
Neg logit contributions
ranch-2.452
 Property-2.091
 Kut-2.090
Men-2.075
 Scot-2.072
Token0
Token ID15
Feature activation-2.15e+00

Pos logit contributions
hift+0.341
Reviewer+0.327
eeper+0.305
REDACTED+0.293
 certific+0.288
Neg logit contributions
ranch-0.346
 Property-0.305
 Kut-0.303
Men-0.300
 Scot-0.300
Token,'
Token ID4032
Feature activation+1.30e+00

Pos logit contributions
ranch+1.222
 Kut+1.012
 Property+1.007
 Scot+0.994
Men+0.987
Neg logit contributions
Reviewer-1.880
hift-1.874
eeper-1.736
REDACTED-1.728
 certific-1.659
Token who
Token ID508
Feature activation+1.38e+00

Pos logit contributions
Reviewer+1.039
hift+1.039
eeper+0.957
REDACTED+0.943
 certific+0.896
Neg logit contributions
ranch-0.843
 Kut-0.722
 Property-0.716
 Scot-0.709
Men-0.704
Token was
Token ID373
Feature activation-3.33e-01

Pos logit contributions
hift+0.460
Reviewer+0.459
eeper+0.422
REDACTED+0.414
 certific+0.390
Neg logit contributions
ranch-0.470
 Kut-0.405
 Property-0.403
Men-0.401
 Scot-0.401
Token about
Token ID546
Feature activation+1.22e+00

Pos logit contributions
ranch+0.214
 Kut+0.183
 Scot+0.182
 Property+0.181
Men+0.180
Neg logit contributions
hift-0.267
Reviewer-0.264
eeper-0.249
REDACTED-0.243
anamo-0.230
Token homosexuality
Token ID19822
Feature activation+3.90e-01

Pos logit contributions
hift+0.965
Reviewer+0.963
eeper+0.896
REDACTED+0.879
 spiked+0.829
Neg logit contributions
ranch-0.806
 Kut-0.692
 Scot-0.685
 Property-0.682
Men-0.676
Token
Token ID198
Feature activation+1.44e-01

Pos logit contributions
Reviewer+0.299
hift+0.299
eeper+0.277
REDACTED+0.272
 certific+0.257
Neg logit contributions
ranch-0.264
 Kut-0.226
 Property-0.225
Men-0.225
 Scot-0.224
Token
Token ID198
Feature activation+8.31e-01

Pos logit contributions
hift+0.130
Reviewer+0.129
eeper+0.122
REDACTED+0.120
 spiked+0.116
Neg logit contributions
ranch-0.079
 Kut-0.062
Men-0.062
 Property-0.062
 Scot-0.062
TokenReviewer
Token ID35407
Feature activation+6.23e+00

Pos logit contributions
hift+0.620
Reviewer+0.596
eeper+0.572
REDACTED+0.557
 spiked+0.542
Neg logit contributions
ranch-0.592
Men-0.506
 Kut-0.493
 Property-0.491
 Scot-0.490
Token:
Token ID25
Feature activation+1.25e+00

Pos logit contributions
hift+3.568
Reviewer+3.548
eeper+3.208
REDACTED+3.138
 spiked+2.953
Neg logit contributions
ranch-5.462
 Kut-4.767
 Property-4.754
Men-4.733
 Scot-4.724
Token k
Token ID479
Feature activation+2.21e+00

Pos logit contributions
hift+0.966
Reviewer+0.960
eeper+0.895
REDACTED+0.880
 spiked+0.834
Neg logit contributions
ranch-0.856
 Kut-0.723
 Property-0.723
 Scot-0.719
Men-0.713
Tokenimp
Token ID11011
Feature activation+1.09e+00

Pos logit contributions
hift+1.715
Reviewer+1.706
eeper+1.582
REDACTED+1.566
 spiked+1.496
Neg logit contributions
ranch-1.500
 Kut-1.248
 Property-1.245
Men-1.240
 Scot-1.237
Tokenunk
Token ID2954
Feature activation+7.94e-01

Pos logit contributions
hift+0.826
Reviewer+0.825
eeper+0.763
REDACTED+0.756
 spiked+0.719
Neg logit contributions
ranch-0.759
 Kut-0.632
 Property-0.632
Men-0.628
 Scot-0.627
Tokenrock
Token ID10823
Feature activation+6.88e-01

Pos logit contributions
hift+0.574
Reviewer+0.571
eeper+0.528
REDACTED+0.523
 spiked+0.497
Neg logit contributions
ranch-0.582
 Property-0.490
 Kut-0.489
 Scot-0.487
Men-0.487
Token
Token ID198
Feature activation-1.84e+00

Pos logit contributions
ranch+0.245
 Kut+0.197
 Property+0.196
 Scot+0.194
Men+0.194
Neg logit contributions
hift-0.371
Reviewer-0.370
eeper-0.346
REDACTED-0.342
 spiked-0.329
Token
Token ID198
Feature activation-8.69e-01

Pos logit contributions
ranch+0.866
 Property+0.683
Men+0.682
 Kut+0.672
 Scot+0.670
Neg logit contributions
hift-1.812
Reviewer-1.802
eeper-1.726
 certific-1.686
REDACTED-1.658
TokenNewsletter
Token ID33031
Feature activation+1.72e+00

Pos logit contributions
ranch+0.541
Men+0.457
 Kut+0.446
 Property+0.444
 Scot+0.437
Neg logit contributions
hift-0.726
Reviewer-0.710
eeper-0.678
REDACTED-0.656
 spiked-0.644
Token Sign
Token ID5865
Feature activation+2.29e+00

Pos logit contributions
Reviewer+1.132
hift+1.122
eeper+1.028
REDACTED+1.014
 certific+0.954
Neg logit contributions
ranch-1.367
 Property-1.209
 Kut-1.192
 Scot-1.191
Men-1.187
Token Up
Token ID3205
Feature activation+3.07e+00

Pos logit contributions
hift+1.865
Reviewer+1.854
eeper+1.736
REDACTED+1.710
 spiked+1.639
Neg logit contributions
ranch-1.468
 Property-1.243
 Kut-1.232
 Scot-1.227
Men-1.219
Token Continue
Token ID10054
Feature activation+6.20e+00

Pos logit contributions
hift+1.433
Reviewer+1.425
eeper+1.246
REDACTED+1.208
 spiked+1.114
Neg logit contributions
ranch-3.032
 Property-2.726
 Kut-2.714
 Scot-2.712
Men-2.702
Token reading
Token ID3555
Feature activation+1.06e+00

Pos logit contributions
hift+3.837
Reviewer+3.800
eeper+3.484
REDACTED+3.423
 spiked+3.251
Neg logit contributions
ranch-5.171
 Kut-4.434
 Property-4.410
Men-4.402
 Scot-4.377
Token the
Token ID262
Feature activation+1.26e+00

Pos logit contributions
hift+0.919
Reviewer+0.915
eeper+0.857
REDACTED+0.840
 certific+0.810
Neg logit contributions
ranch-0.622
 Kut-0.514
 Property-0.513
 Scot-0.507
Men-0.506
Token main
Token ID1388
Feature activation+1.45e+00

Pos logit contributions
Reviewer+0.895
hift+0.889
eeper+0.802
REDACTED+0.802
anamo+0.758
Neg logit contributions
ranch-0.950
 Property-0.833
 Kut-0.830
 Scot-0.828
Men-0.817
Token story
Token ID1621
Feature activation+2.55e+00

Pos logit contributions
Reviewer+0.943
hift+0.936
eeper+0.845
REDACTED+0.820
 certific+0.786
Neg logit contributions
ranch-1.176
 Property-1.029
 Kut-1.026
 Scot-1.023
Men-1.021
Token Please
Token ID4222
Feature activation+2.93e+00

Pos logit contributions
hift+1.839
Reviewer+1.833
eeper+1.702
REDACTED+1.647
 certific+1.565
Neg logit contributions
ranch-1.863
 Property-1.636
 Kut-1.625
 Scot-1.619
Men-1.603
Token that
Token ID326
Feature activation-3.28e-01

Pos logit contributions
ranch+0.731
 Kut+0.616
 Property+0.612
 Scot+0.605
Men+0.601
Neg logit contributions
Reviewer-0.867
hift-0.860
eeper-0.802
REDACTED-0.797
 Paste-0.750
Token the
Token ID262
Feature activation+8.58e-02

Pos logit contributions
ranch+0.212
 Kut+0.180
 Property+0.178
 Scot+0.177
Men+0.175
Neg logit contributions
Reviewer-0.265
hift-0.262
eeper-0.245
REDACTED-0.243
 spiked-0.229
Token DPRK
Token ID48363
Feature activation+5.09e-01

Pos logit contributions
Reviewer+0.067
hift+0.067
eeper+0.062
REDACTED+0.062
 spiked+0.058
Neg logit contributions
ranch-0.058
 Kut-0.049
 Property-0.049
 Scot-0.048
Men-0.048
Token has
Token ID468
Feature activation+2.20e-01

Pos logit contributions
Reviewer+0.383
hift+0.381
eeper+0.349
REDACTED+0.344
anamo+0.324
Neg logit contributions
ranch-0.361
 Property-0.309
 Kut-0.306
 Scot-0.303
Men-0.302
Token a
Token ID257
Feature activation+7.77e-01

Pos logit contributions
Reviewer+0.172
hift+0.171
eeper+0.158
REDACTED+0.157
anamo+0.149
Neg logit contributions
ranch-0.149
 Kut-0.126
 Property-0.126
 Scot-0.125
Men-0.123
Token multip
Token ID18540
Feature activation+6.15e+00

Pos logit contributions
Reviewer+0.584
hift+0.576
REDACTED+0.531
eeper+0.531
anamo+0.496
Neg logit contributions
ranch-0.551
 Property-0.472
 Kut-0.472
 Scot-0.463
Men-0.456
Tokenarty
Token ID25494
Feature activation+1.61e+00

Pos logit contributions
hift+1.190
Reviewer+1.083
eeper+0.863
REDACTED+0.682
arty+0.663
Neg logit contributions
ranch-7.559
 Kut-7.028
 Property-6.987
Men-6.933
 Scot-6.925
Token system
Token ID1080
Feature activation+8.12e-01

Pos logit contributions
Reviewer+1.306
hift+1.298
eeper+1.219
REDACTED+1.195
 spiked+1.133
Neg logit contributions
ranch-1.050
 Property-0.878
 Kut-0.876
 Scot-0.866
Men-0.850
Token once
Token ID1752
Feature activation+2.40e-01

Pos logit contributions
Reviewer+0.627
hift+0.623
eeper+0.581
REDACTED+0.569
 Insight+0.541
Neg logit contributions
ranch-0.554
 Property-0.464
 Kut-0.463
 Scot-0.461
Men-0.457
Token again
Token ID757
Feature activation-4.80e-01

Pos logit contributions
Reviewer+0.191
hift+0.190
eeper+0.177
REDACTED+0.174
 Paste+0.165
Neg logit contributions
ranch-0.158
 Kut-0.133
 Property-0.132
 Scot-0.131
Men-0.131
Token:
Token ID25
Feature activation-1.03e+00

Pos logit contributions
ranch+0.337
 Kut+0.287
 Property+0.286
 Scot+0.285
Men+0.284
Neg logit contributions
Reviewer-0.358
hift-0.356
eeper-0.333
REDACTED-0.324
 Insight-0.307
Token July
Token ID2901
Feature activation+3.77e-01

Pos logit contributions
Reviewer+0.227
hift+0.224
eeper+0.211
REDACTED+0.208
 Paste+0.197
Neg logit contributions
ranch-0.152
 Kut-0.126
 Scot-0.126
 Property-0.124
Men-0.124
Token.
Token ID13
Feature activation-2.28e-01

Pos logit contributions
Reviewer+0.289
hift+0.286
eeper+0.267
REDACTED+0.261
 certific+0.248
Neg logit contributions
ranch-0.259
 Kut-0.221
 Property-0.219
 Scot-0.219
Men-0.219
Token
Token ID198
Feature activation-1.55e+00

Pos logit contributions
ranch+0.126
 Property+0.103
 Kut+0.103
Men+0.101
 Scot+0.101
Neg logit contributions
hift-0.205
Reviewer-0.204
eeper-0.192
REDACTED-0.189
 certific-0.182
Token
Token ID198
Feature activation-7.22e-01

Pos logit contributions
ranch+0.737
 Property+0.576
Men+0.575
 Kut+0.569
 Scot+0.566
Neg logit contributions
hift-1.522
Reviewer-1.516
eeper-1.447
 certific-1.412
REDACTED-1.395
TokenAdvertisement
Token ID4723
Feature activation-1.14e+00

Pos logit contributions
ranch+0.447
Men+0.376
 Kut+0.365
 Property+0.364
 Scot+0.358
Neg logit contributions
hift-0.605
Reviewer-0.592
eeper-0.568
REDACTED-0.547
 spiked-0.539
Token Continue
Token ID10054
Feature activation+6.11e+00

Pos logit contributions
ranch+1.046
 Property+0.957
 Scot+0.938
Men+0.925
 Wong+0.924
Neg logit contributions
Reviewer-0.623
hift-0.604
eeper-0.548
REDACTED-0.528
 certific-0.516
Token reading
Token ID3555
Feature activation+7.25e-01

Pos logit contributions
hift+3.753
Reviewer+3.710
eeper+3.407
REDACTED+3.356
 spiked+3.182
Neg logit contributions
ranch-5.092
 Kut-4.385
 Property-4.367
Men-4.352
 Scot-4.321
Token the
Token ID262
Feature activation+9.82e-01

Pos logit contributions
hift+0.636
Reviewer+0.634
eeper+0.595
REDACTED+0.581
 certific+0.565
Neg logit contributions
ranch-0.418
 Kut-0.346
 Property-0.344
 Scot-0.341
Men-0.339
Token main
Token ID1388
Feature activation+1.20e+00

Pos logit contributions
Reviewer+0.692
hift+0.685
eeper+0.624
REDACTED+0.618
anamo+0.590
Neg logit contributions
ranch-0.748
 Scot-0.648
 Kut-0.646
 Property-0.645
Men-0.641
Token story
Token ID1621
Feature activation+1.02e+00

Pos logit contributions
Reviewer+0.761
hift+0.755
eeper+0.685
REDACTED+0.664
 certific+0.633
Neg logit contributions
ranch-0.994
Men-0.861
 Kut-0.861
 Scot-0.860
 Property-0.859
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
hift+0.960
Reviewer+0.956
eeper+0.907
REDACTED+0.880
 certific+0.868
Neg logit contributions
ranch-0.525
 Property-0.424
 Kut-0.419
Men-0.419
 Scot-0.417
Token Russian
Token ID3394
Feature activation+1.78e+00

Pos logit contributions
Reviewer+0.265
hift+0.260
eeper+0.245
REDACTED+0.243
 spiked+0.228
Neg logit contributions
ranch-0.219
 Scot-0.191
 Kut-0.188
 Property-0.187
Men-0.184
Token Mars
Token ID8706
Feature activation-1.99e+00

Pos logit contributions
Reviewer+1.360
hift+1.358
eeper+1.264
REDACTED+1.245
 spiked+1.182
Neg logit contributions
ranch-1.215
 Kut-1.036
 Property-1.026
 Scot-1.026
Men-1.014
Token probe
Token ID12774
Feature activation-6.68e-02

Pos logit contributions
ranch+1.487
 Kut+1.290
 Scot+1.279
 Property+1.274
Men+1.264
Neg logit contributions
Reviewer-1.387
hift-1.367
eeper-1.265
REDACTED-1.245
 certific-1.197
Token,
Token ID11
Feature activation+2.80e-01

Pos logit contributions
ranch+0.046
 Kut+0.041
 Scot+0.040
 Property+0.040
Men+0.039
Neg logit contributions
Reviewer-0.050
hift-0.050
eeper-0.046
REDACTED-0.046
 certific-0.043
Token Ph
Token ID1380
Feature activation-7.04e-01

Pos logit contributions
Reviewer+0.218
hift+0.218
eeper+0.201
REDACTED+0.199
 spiked+0.188
Neg logit contributions
ranch-0.189
 Kut-0.163
 Scot-0.160
 Property-0.160
Men-0.158
Tokenobos
Token ID49878
Feature activation+6.02e+00

Pos logit contributions
ranch+0.625
Men+0.549
 Property+0.543
 Kut+0.542
 Scot+0.540
Neg logit contributions
Reviewer-0.396
hift-0.387
eeper-0.352
REDACTED-0.349
 certific-0.336
Token-
Token ID12
Feature activation+1.68e+00

Pos logit contributions
hift+4.141
Reviewer+4.079
eeper+3.772
REDACTED+3.717
 spiked+3.535
Neg logit contributions
ranch-4.598
 Kut-3.919
 Property-3.916
 Scot-3.882
Men-3.878
TokenGr
Token ID8642
Feature activation+2.01e+00

Pos logit contributions
Reviewer+1.353
hift+1.348
eeper+1.275
REDACTED+1.224
 spiked+1.204
Neg logit contributions
ranch-1.080
Men-0.920
 Property-0.881
 Scot-0.879
 Kut-0.874
Tokenunt
Token ID2797
Feature activation+1.46e+00

Pos logit contributions
hift+1.899
Reviewer+1.886
eeper+1.784
REDACTED+1.757
 spiked+1.693
Neg logit contributions
ranch-1.019
 Kut-0.801
 Property-0.798
Men-0.786
 Scot-0.786
Token,
Token ID11
Feature activation+3.27e-01

Pos logit contributions
Reviewer+1.068
hift+1.062
eeper+0.987
REDACTED+0.963
 certific+0.919
Neg logit contributions
ranch-1.053
 Kut-0.900
 Property-0.892
 Scot-0.886
Men-0.886
Token last
Token ID938
Feature activation+1.01e+00

Pos logit contributions
Reviewer+0.243
hift+0.241
eeper+0.222
REDACTED+0.220
anamo+0.205
Neg logit contributions
ranch-0.231
 Kut-0.202
 Scot-0.201
 Property-0.198
Men-0.196
Token Journal
Token ID4913
Feature activation+9.47e-01

Pos logit contributions
Reviewer+0.966
hift+0.966
eeper+0.895
REDACTED+0.879
 certific+0.832
Neg logit contributions
ranch-0.932
 Property-0.813
 Scot-0.810
 Kut-0.809
Men-0.803
Token of
Token ID286
Feature activation+1.80e+00

Pos logit contributions
Reviewer+0.769
hift+0.765
eeper+0.718
REDACTED+0.702
 certific+0.683
Neg logit contributions
ranch-0.610
 Property-0.513
 Scot-0.510
 Kut-0.509
Men-0.506
Token Research
Token ID4992
Feature activation-3.74e-01

Pos logit contributions
Reviewer+1.473
hift+1.468
eeper+1.366
REDACTED+1.352
 certific+1.281
Neg logit contributions
ranch-1.146
 Property-0.969
 Kut-0.963
 Scot-0.956
Men-0.951
Token on
Token ID319
Feature activation+1.53e+00

Pos logit contributions
ranch+0.281
 Property+0.243
 Kut+0.241
Men+0.240
 Scot+0.240
Neg logit contributions
hift-0.262
Reviewer-0.260
eeper-0.241
REDACTED-0.235
 spiked-0.224
Token Ad
Token ID1215
Feature activation+1.00e+00

Pos logit contributions
hift+1.335
Reviewer+1.330
eeper+1.250
REDACTED+1.240
 certific+1.175
Neg logit contributions
ranch-0.895
 Property-0.753
 Kut-0.744
Men-0.738
 Scot-0.737
Tokenolesc
Token ID16850
Feature activation+5.84e+00

Pos logit contributions
hift+0.705
Reviewer+0.699
eeper+0.649
REDACTED+0.642
 spiked+0.604
Neg logit contributions
ranch-0.749
 Property-0.645
 Kut-0.642
Men-0.641
 Scot-0.635
Tokenence
Token ID594
Feature activation+2.77e+00

Pos logit contributions
hift+3.181
Reviewer+3.116
eeper+2.923
REDACTED+2.767
 Insight+2.663
Neg logit contributions
ranch-5.154
 Scot-4.501
 Property-4.474
 Kut-4.461
WC-4.419
Token,
Token ID11
Feature activation+1.46e+00

Pos logit contributions
hift+1.910
Reviewer+1.899
eeper+1.737
REDACTED+1.705
 DOI+1.680
Neg logit contributions
ranch-2.087
 Kut-1.733
 Property-1.716
 Scot-1.711
Men-1.711
Token 8
Token ID807
Feature activation+8.96e-01

Pos logit contributions
Reviewer+1.286
hift+1.280
eeper+1.213
REDACTED+1.177
 certific+1.138
Neg logit contributions
ranch-0.839
 Kut-0.711
 Property-0.709
 Scot-0.704
Men-0.698
Token,
Token ID11
Feature activation+9.67e-01

Pos logit contributions
Reviewer+0.623
hift+0.620
eeper+0.567
REDACTED+0.566
 DOI+0.543
Neg logit contributions
ranch-0.672
 Property-0.568
 Kut-0.565
 Scot-0.564
Men-0.555
Token 333
Token ID23460
Feature activation+6.63e-01

Pos logit contributions
hift+0.770
Reviewer+0.768
eeper+0.720
REDACTED+0.702
 spiked+0.671
Neg logit contributions
ranch-0.633
 Kut-0.535
 Property-0.534
 Scot-0.529
Men-0.526
Token
Token ID198
Feature activation-1.97e+00

Pos logit contributions
ranch+0.318
 Property+0.263
 Kut+0.262
 Scot+0.259
Men+0.258
Neg logit contributions
hift-0.509
Reviewer-0.505
eeper-0.478
REDACTED-0.470
 certific-0.454
Token
Token ID198
Feature activation-1.11e+00

Pos logit contributions
ranch+0.924
Men+0.732
 Property+0.727
 Scot+0.719
 Kut+0.715
Neg logit contributions
hift-1.947
Reviewer-1.936
eeper-1.853
 certific-1.809
anamo-1.776
TokenNewsletter
Token ID33031
Feature activation+8.67e-01

Pos logit contributions
ranch+0.716
Men+0.608
WC+0.584
 Kut+0.583
 Property+0.583
Neg logit contributions
hift-0.898
Reviewer-0.871
eeper-0.840
REDACTED-0.809
 spiked-0.809
Token Sign
Token ID5865
Feature activation+1.87e+00

Pos logit contributions
Reviewer+0.598
hift+0.585
eeper+0.539
REDACTED+0.539
anamo+0.505
Neg logit contributions
ranch-0.659
 Property-0.596
 Scot-0.582
 Kut-0.574
Men-0.574
Token Up
Token ID3205
Feature activation+2.89e+00

Pos logit contributions
hift+1.547
Reviewer+1.539
eeper+1.440
REDACTED+1.425
 spiked+1.364
Neg logit contributions
ranch-1.169
 Property-0.993
 Scot-0.978
 Kut-0.976
Men-0.965
Token Continue
Token ID10054
Feature activation+5.82e+00

Pos logit contributions
hift+1.353
Reviewer+1.346
eeper+1.171
REDACTED+1.138
 spiked+1.050
Neg logit contributions
ranch-2.857
 Property-2.564
 Scot-2.554
 Kut-2.553
Men-2.543
Token reading
Token ID3555
Feature activation+8.13e-01

Pos logit contributions
hift+3.643
Reviewer+3.614
eeper+3.317
REDACTED+3.251
 spiked+3.087
Neg logit contributions
ranch-4.807
 Kut-4.128
 Property-4.103
Men-4.098
 Scot-4.080
Token the
Token ID262
Feature activation+9.12e-01

Pos logit contributions
hift+0.712
Reviewer+0.709
eeper+0.662
REDACTED+0.650
 certific+0.627
Neg logit contributions
ranch-0.469
 Property-0.389
 Kut-0.386
 Scot-0.382
Men-0.381
Token main
Token ID1388
Feature activation+6.48e-01

Pos logit contributions
Reviewer+0.644
hift+0.640
REDACTED+0.583
eeper+0.578
anamo+0.547
Neg logit contributions
ranch-0.685
 Property-0.606
 Kut-0.596
 Scot-0.594
Men-0.588
Token story
Token ID1621
Feature activation+2.30e+00

Pos logit contributions
Reviewer+0.446
hift+0.442
eeper+0.401
REDACTED+0.390
 certific+0.373
Neg logit contributions
ranch-0.500
 Property-0.442
 Scot-0.437
Men-0.437
 Kut-0.435
Token Please
Token ID4222
Feature activation+2.86e+00

Pos logit contributions
hift+1.674
Reviewer+1.665
eeper+1.546
REDACTED+1.494
 certific+1.427
Neg logit contributions
ranch-1.672
 Property-1.472
 Scot-1.460
 Kut-1.455
Men-1.442
Token
Token ID198
Feature activation-6.10e-01

Pos logit contributions
hift+0.485
Reviewer+0.481
eeper+0.454
REDACTED+0.448
 certific+0.430
Neg logit contributions
ranch-0.309
 Kut-0.255
 Property-0.254
Men-0.251
 Scot-0.251
Token
Token ID198
Feature activation+9.27e-02

Pos logit contributions
ranch+0.302
 Property+0.240
Men+0.238
 Kut+0.237
 Scot+0.236
Neg logit contributions
hift-0.587
Reviewer-0.584
eeper-0.557
REDACTED-0.538
 certific-0.538
TokenNewsletter
Token ID33031
Feature activation+1.11e+00

Pos logit contributions
hift+0.079
Reviewer+0.078
eeper+0.074
REDACTED+0.072
 spiked+0.071
Neg logit contributions
ranch-0.056
Men-0.047
 Kut-0.046
 Property-0.045
 Scot-0.045
Token Sign
Token ID5865
Feature activation+1.59e+00

Pos logit contributions
Reviewer+0.738
hift+0.732
eeper+0.672
REDACTED+0.661
 spiked+0.620
Neg logit contributions
ranch-0.881
 Property-0.781
 Scot-0.769
 Kut-0.768
Men-0.764
Token Up
Token ID3205
Feature activation+2.75e+00

Pos logit contributions
hift+1.316
Reviewer+1.309
eeper+1.226
REDACTED+1.212
 certific+1.160
Neg logit contributions
ranch-1.002
 Property-0.854
 Scot-0.842
 Kut-0.840
Men-0.831
Token Continue
Token ID10054
Feature activation+5.76e+00

Pos logit contributions
hift+1.322
Reviewer+1.313
eeper+1.143
REDACTED+1.114
 certific+1.022
Neg logit contributions
ranch-2.698
 Property-2.434
 Scot-2.423
 Kut-2.414
Men-2.404
Token reading
Token ID3555
Feature activation+9.10e-01

Pos logit contributions
hift+3.651
Reviewer+3.627
eeper+3.325
REDACTED+3.262
 spiked+3.091
Neg logit contributions
ranch-4.706
 Kut-4.050
 Property-4.034
Men-4.018
 Scot-4.003
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
hift+0.800
Reviewer+0.797
eeper+0.746
REDACTED+0.730
 certific+0.707
Neg logit contributions
ranch-0.524
 Property-0.433
 Kut-0.432
 Scot-0.428
Men-0.425
Token main
Token ID1388
Feature activation+7.84e-01

Pos logit contributions
Reviewer+0.734
hift+0.728
REDACTED+0.656
eeper+0.653
anamo+0.622
Neg logit contributions
ranch-0.755
 Property-0.663
 Scot-0.661
 Kut-0.660
Men-0.650
Token story
Token ID1621
Feature activation+2.35e+00

Pos logit contributions
Reviewer+0.519
hift+0.514
eeper+0.464
REDACTED+0.450
 certific+0.436
Neg logit contributions
ranch-0.629
 Property-0.548
 Scot-0.548
 Kut-0.546
Men-0.546
Token Please
Token ID4222
Feature activation+2.91e+00

Pos logit contributions
hift+1.708
Reviewer+1.696
eeper+1.575
REDACTED+1.526
 certific+1.450
Neg logit contributions
ranch-1.708
 Property-1.502
 Scot-1.492
 Kut-1.490
Men-1.471
Token Journal
Token ID4913
Feature activation+1.15e+00

Pos logit contributions
hift+1.393
Reviewer+1.390
eeper+1.287
REDACTED+1.267
 certific+1.196
Neg logit contributions
ranch-1.359
 Property-1.176
 Kut-1.171
 Scot-1.171
Men-1.164
Token of
Token ID286
Feature activation+2.00e+00

Pos logit contributions
Reviewer+0.930
hift+0.923
eeper+0.867
REDACTED+0.848
 certific+0.819
Neg logit contributions
ranch-0.738
 Property-0.624
 Kut-0.620
 Scot-0.620
Men-0.614
Token Youth
Token ID17582
Feature activation+4.04e-01

Pos logit contributions
Reviewer+1.615
hift+1.609
eeper+1.495
REDACTED+1.482
 spiked+1.401
Neg logit contributions
ranch-1.293
 Property-1.092
 Kut-1.087
 Scot-1.079
Men-1.075
Token and
Token ID290
Feature activation+9.70e-01

Pos logit contributions
Reviewer+0.325
hift+0.319
eeper+0.297
REDACTED+0.296
anamo+0.278
Neg logit contributions
ranch-0.264
 Property-0.227
 Kut-0.225
Men-0.225
 Scot-0.224
Token Ad
Token ID1215
Feature activation+8.26e-01

Pos logit contributions
Reviewer+0.919
hift+0.914
eeper+0.861
REDACTED+0.851
 spiked+0.816
Neg logit contributions
ranch-0.495
 Property-0.396
 Kut-0.394
 Scot-0.391
Men-0.388
Tokenolesc
Token ID16850
Feature activation+5.52e+00

Pos logit contributions
hift+0.629
Reviewer+0.626
eeper+0.591
REDACTED+0.587
 certific+0.558
Neg logit contributions
ranch-0.566
 Property-0.492
Men-0.488
 Kut-0.486
 Scot-0.479
Tokenence
Token ID594
Feature activation+2.29e+00

Pos logit contributions
hift+3.316
Reviewer+3.288
eeper+3.002
REDACTED+2.934
 spiked+2.766
Neg logit contributions
ranch-4.692
 Kut-4.080
 Property-4.074
 Scot-4.050
Men-4.048
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
hift+1.620
Reviewer+1.603
eeper+1.476
REDACTED+1.446
 DOI+1.425
Neg logit contributions
ranch-1.716
 Kut-1.411
 Scot-1.399
Men-1.397
 Property-1.395
Token 39
Token ID5014
Feature activation+7.71e-01

Pos logit contributions
Reviewer+1.136
hift+1.129
eeper+1.071
REDACTED+1.038
 certific+1.005
Neg logit contributions
ranch-0.744
 Kut-0.635
 Property-0.634
 Scot-0.626
Men-0.621
Token,
Token ID11
Feature activation+8.89e-01

Pos logit contributions
hift+0.527
Reviewer+0.522
eeper+0.479
REDACTED+0.476
 DOI+0.455
Neg logit contributions
ranch-0.588
 Property-0.499
 Kut-0.495
 Scot-0.492
Men-0.485
Token 10
Token ID838
Feature activation+5.43e-01

Pos logit contributions
hift+0.698
Reviewer+0.697
eeper+0.653
REDACTED+0.635
 spiked+0.607
Neg logit contributions
ranch-0.592
 Kut-0.503
 Property-0.501
 Scot-0.497
Men-0.495
Token.
Token ID13
Feature activation-3.31e-01

Pos logit contributions
hift+0.243
Reviewer+0.240
eeper+0.223
REDACTED+0.218
 spiked+0.208
Neg logit contributions
ranch-0.261
 Kut-0.223
 Property-0.222
Men-0.221
 Scot-0.220
Token area
Token ID1989
Feature activation-2.69e-01

Pos logit contributions
ranch+0.213
 Property+0.179
 Kut+0.177
Men+0.175
 Scot+0.175
Neg logit contributions
Reviewer-0.270
hift-0.267
eeper-0.250
REDACTED-0.245
 certific-0.234
Token,
Token ID11
Feature activation-9.08e-01

Pos logit contributions
ranch+0.188
 Property+0.162
 Kut+0.162
 Scot+0.162
Men+0.159
Neg logit contributions
Reviewer-0.206
hift-0.200
eeper-0.188
REDACTED-0.183
 certific-0.174
Token north
Token ID5093
Feature activation-1.51e+00

Pos logit contributions
ranch+0.639
 Property+0.555
 Scot+0.552
 Kut+0.552
Men+0.540
Neg logit contributions
Reviewer-0.698
hift-0.679
eeper-0.625
REDACTED-0.614
 certific-0.581
Token of
Token ID286
Feature activation-1.94e+00

Pos logit contributions
ranch+1.031
 Kut+0.866
 Property+0.863
Men+0.857
 Scot+0.856
Neg logit contributions
hift-1.160
Reviewer-1.154
eeper-1.073
REDACTED-1.056
 spiked-1.009
Token Gerrard
Token ID49314
Feature activation-6.21e+00

Pos logit contributions
ranch+1.179
 Property+0.999
 Scot+0.996
 Kut+0.990
Men+0.957
Neg logit contributions
Reviewer-1.680
hift-1.631
eeper-1.541
REDACTED-1.511
 certific-1.450
Token St
Token ID520
Feature activation-5.38e-01

Pos logit contributions
ranch+3.371
 Property+2.878
 Scot+2.828
 Kut+2.818
 Wong+2.707
Neg logit contributions
Reviewer-5.763
hift-5.396
eeper-5.156
REDACTED-5.120
 certific-4.961
Token.
Token ID13
Feature activation-1.27e+00

Pos logit contributions
ranch+0.393
 Kut+0.335
 Property+0.333
Men+0.332
 Scot+0.330
Neg logit contributions
hift-0.384
Reviewer-0.375
eeper-0.356
REDACTED-0.345
 spiked-0.334
Token E
Token ID412
Feature activation-2.91e+00

Pos logit contributions
ranch+0.885
 Property+0.765
 Kut+0.755
 Scot+0.751
Men+0.745
Neg logit contributions
Reviewer-0.966
hift-0.948
eeper-0.877
REDACTED-0.861
 certific-0.822
Token.
Token ID13
Feature activation-1.44e+00

Pos logit contributions
ranch+1.986
 Property+1.674
 Scot+1.660
 Kut+1.654
Men+1.638
Neg logit contributions
Reviewer-2.276
hift-2.236
eeper-2.065
REDACTED-2.046
 certific-1.941
Token and
Token ID290
Feature activation-9.78e-01

Pos logit contributions
ranch+1.044
 Property+0.906
 Kut+0.897
 Scot+0.891
Men+0.884
Neg logit contributions
Reviewer-1.070
hift-1.046
eeper-0.969
REDACTED-0.947
 certific-0.906
Token fil
Token ID1226
Feature activation+9.49e-01

Pos logit contributions
Reviewer+0.317
hift+0.315
eeper+0.291
REDACTED+0.288
 certific+0.269
Neg logit contributions
ranch-0.299
 Kut-0.253
 Scot-0.250
 Property-0.250
Men-0.248
Tokenlets
Token ID5289
Feature activation+1.68e+00

Pos logit contributions
hift+0.486
Reviewer+0.482
eeper+0.429
REDACTED+0.426
 spiked+0.388
Neg logit contributions
ranch-0.902
 Property-0.792
 Kut-0.791
Men-0.788
 Scot-0.785
Token are
Token ID389
Feature activation+6.62e-01

Pos logit contributions
Reviewer+1.281
hift+1.259
eeper+1.169
REDACTED+1.161
 certific+1.090
Neg logit contributions
ranch-1.180
 Kut-1.000
 Property-0.990
 Scot-0.987
Men-0.980
Token the
Token ID262
Feature activation+1.71e-01

Pos logit contributions
Reviewer+0.487
hift+0.482
eeper+0.446
REDACTED+0.438
 Insight+0.409
Neg logit contributions
ranch-0.480
 Kut-0.411
 Property-0.410
 Scot-0.409
Men-0.404
Token same
Token ID976
Feature activation-2.98e+00

Pos logit contributions
hift+0.163
Reviewer+0.163
eeper+0.153
REDACTED+0.151
 Insight+0.145
Neg logit contributions
ranch-0.086
 Property-0.069
 Kut-0.069
 Scot-0.067
Men-0.066
Token exact
Token ID2748
Feature activation-6.19e+00

Pos logit contributions
ranch+1.788
 Kut+1.496
 Scot+1.474
 Property+1.472
Men+1.461
Neg logit contributions
hift-2.511
Reviewer-2.491
eeper-2.350
REDACTED-2.294
anamo-2.218
Token size
Token ID2546
Feature activation-1.69e+00

Pos logit contributions
ranch+3.520
 Kut+2.871
 Scot+2.861
 Property+2.848
Men+2.773
Neg logit contributions
hift-5.279
Reviewer-5.208
eeper-4.981
REDACTED-4.891
anamo-4.705
Token.
Token ID13
Feature activation-4.21e-01

Pos logit contributions
ranch+1.155
 Kut+0.976
 Scot+0.968
 Property+0.966
Men+0.965
Neg logit contributions
hift-1.300
Reviewer-1.288
eeper-1.202
REDACTED-1.182
 Insight-1.118
Token
Token ID447
Feature activation-3.54e-02

Pos logit contributions
ranch+0.239
 Kut+0.201
 Property+0.199
 Scot+0.199
Men+0.196
Neg logit contributions
hift-0.373
Reviewer-0.371
eeper-0.351
REDACTED-0.343
 certific-0.334
Token
Token ID251
Feature activation+8.22e-01

Pos logit contributions
ranch+0.024
 Kut+0.020
 Property+0.020
Men+0.020
 Scot+0.020
Neg logit contributions
hift-0.027
Reviewer-0.027
eeper-0.025
REDACTED-0.025
 spiked-0.024
Token
Token ID198
Feature activation-5.01e-01

Pos logit contributions
hift+0.706
Reviewer+0.700
eeper+0.660
REDACTED+0.648
 certific+0.622
Neg logit contributions
ranch-0.491
 Kut-0.406
 Property-0.403
 Scot-0.402
Men-0.400
Token<|endoftext|>
Token ID50256
Feature activation-5.27e-01

Pos logit contributions
ranch+0.086
 Kut+0.070
 Property+0.070
Men+0.069
 Scot+0.069
Neg logit contributions
hift-0.125
Reviewer-0.124
eeper-0.116
REDACTED-0.115
 spiked-0.110
TokenE
Token ID36
Feature activation-3.61e-01

Pos logit contributions
ranch+0.341
Men+0.288
 Kut+0.279
 Property+0.279
 Scot+0.277
Neg logit contributions
hift-0.427
Reviewer-0.416
eeper-0.398
REDACTED-0.385
 spiked-0.379
Token.
Token ID13
Feature activation-3.17e-01

Pos logit contributions
ranch+0.254
Men+0.212
 Property+0.212
 Kut+0.211
 Scot+0.211
Neg logit contributions
hift-0.272
Reviewer-0.268
eeper-0.250
REDACTED-0.248
 spiked-0.237
TokenS
Token ID50
Feature activation-8.82e-01

Pos logit contributions
ranch+0.196
 Kut+0.165
 Property+0.164
 Scot+0.163
Men+0.162
Neg logit contributions
hift-0.266
Reviewer-0.262
eeper-0.248
REDACTED-0.242
 spiked-0.232
Token.
Token ID13
Feature activation-1.51e+00

Pos logit contributions
ranch+0.665
 Kut+0.571
 Property+0.568
Men+0.565
 Scot+0.565
Neg logit contributions
hift-0.611
Reviewer-0.608
eeper-0.562
REDACTED-0.549
 spiked-0.523
Token Glover
Token ID43487
Feature activation-5.89e+00

Pos logit contributions
ranch+0.951
 Kut+0.807
 Property+0.802
 Scot+0.797
Men+0.793
Neg logit contributions
hift-1.240
Reviewer-1.233
eeper-1.150
REDACTED-1.127
 spiked-1.076
Token's
Token ID338
Feature activation-2.06e+00

Pos logit contributions
ranch+3.972
 Property+3.519
 Kut+3.513
 Scot+3.489
Men+3.430
Neg logit contributions
Reviewer-4.398
hift-4.379
eeper-4.097
REDACTED-3.939
anamo-3.769
Token bird
Token ID6512
Feature activation-6.78e-01

Pos logit contributions
ranch+1.374
 Property+1.192
 Kut+1.178
 Scot+1.173
Men+1.156
Neg logit contributions
Reviewer-1.606
hift-1.594
eeper-1.483
REDACTED-1.461
anamo-1.372
Tokensey
Token ID4397
Feature activation-1.54e+00

Pos logit contributions
ranch+0.501
 Kut+0.428
 Property+0.426
Men+0.423
 Scot+0.421
Neg logit contributions
hift-0.485
Reviewer-0.479
eeper-0.445
REDACTED-0.445
 spiked-0.413
Tokene
Token ID68
Feature activation-2.53e+00

Pos logit contributions
ranch+0.647
 Kut+0.485
 Property+0.478
 Scot+0.466
Men+0.457
Neg logit contributions
hift-1.585
Reviewer-1.578
eeper-1.495
REDACTED-1.495
 certific-1.439
Token map
Token ID3975
Feature activation-3.86e+00

Pos logit contributions
ranch+1.802
 Property+1.543
 Scot+1.539
 Kut+1.537
Men+1.506
Neg logit contributions
Reviewer-1.860
hift-1.849
eeper-1.735
REDACTED-1.702
anamo-1.622
Token on
Token ID319
Feature activation-4.66e-01

Pos logit contributions
ranch+1.847
 Kut+1.551
 Property+1.548
Men+1.536
 Scot+1.533
Neg logit contributions
hift-2.072
Reviewer-2.063
eeper-1.918
REDACTED-1.881
 spiked-1.802
Token the
Token ID262
Feature activation-3.82e-01

Pos logit contributions
ranch+0.300
 Property+0.255
 Scot+0.253
 Kut+0.253
Men+0.247
Neg logit contributions
Reviewer-0.373
hift-0.372
eeper-0.347
REDACTED-0.346
 spiked-0.324
Token data
Token ID1366
Feature activation-3.53e+00

Pos logit contributions
ranch+0.256
 Property+0.221
 Scot+0.218
 Kut+0.216
Men+0.211
Neg logit contributions
Reviewer-0.298
hift-0.294
eeper-0.275
REDACTED-0.273
anamo-0.256
Token with
Token ID351
Feature activation+1.43e-01

Pos logit contributions
ranch+2.365
 Scot+2.107
 Kut+2.103
 Property+2.064
Men+2.010
Neg logit contributions
Reviewer-2.634
hift-2.579
REDACTED-2.434
eeper-2.408
anamo-2.280
Token acceptable
Token ID10909
Feature activation+1.55e+00

Pos logit contributions
Reviewer+0.103
hift+0.102
REDACTED+0.094
eeper+0.094
 Paste+0.087
Neg logit contributions
ranch-0.106
 Scot-0.091
 Kut-0.090
 Property-0.090
Men-0.089
Token forecasting
Token ID41164
Feature activation-5.71e+00

Pos logit contributions
hift+0.918
Reviewer+0.915
eeper+0.833
REDACTED+0.818
 spiked+0.752
Neg logit contributions
ranch-1.324
 Scot-1.162
 Kut-1.160
 Property-1.160
Men-1.152
Token errors
Token ID8563
Feature activation-8.88e-01

Pos logit contributions
ranch+3.754
 Scot+3.361
 Kut+3.323
 Property+3.299
Men+3.198
Neg logit contributions
hift-4.241
Reviewer-4.238
eeper-3.911
REDACTED-3.875
anamo-3.713
Token to
Token ID284
Feature activation-2.62e-01

Pos logit contributions
ranch+0.583
 Kut+0.494
 Scot+0.492
 Property+0.490
Men+0.483
Neg logit contributions
Reviewer-0.696
hift-0.692
REDACTED-0.645
eeper-0.642
 Paste-0.606
Token give
Token ID1577
Feature activation+1.24e+00

Pos logit contributions
ranch+0.172
 Kut+0.144
 Property+0.144
 Scot+0.143
Men+0.140
Neg logit contributions
Reviewer-0.211
hift-0.208
eeper-0.196
REDACTED-0.195
anamo-0.185
Token the
Token ID262
Feature activation+9.48e-01

Pos logit contributions
Reviewer+0.933
hift+0.928
eeper+0.861
REDACTED+0.859
 Paste+0.802
Neg logit contributions
ranch-0.866
 Scot-0.741
 Property-0.738
 Kut-0.735
Men-0.730
Token confidence
Token ID6628
Feature activation+7.05e-02

Pos logit contributions
hift+0.726
Reviewer+0.725
eeper+0.673
REDACTED+0.666
 Paste+0.622
Neg logit contributions
ranch-0.651
 Property-0.560
 Scot-0.557
 Kut-0.552
Men-0.546
Token of
Token ID286
Feature activation-3.67e-01

Pos logit contributions
ranch+0.186
 Property+0.159
 Scot+0.157
 Kut+0.156
Men+0.155
Neg logit contributions
Reviewer-0.207
hift-0.206
eeper-0.191
REDACTED-0.188
anamo-0.179
Token the
Token ID262
Feature activation-3.60e-01

Pos logit contributions
ranch+0.237
 Kut+0.197
Men+0.196
 Property+0.195
 Scot+0.194
Neg logit contributions
hift-0.295
Reviewer-0.293
eeper-0.275
REDACTED-0.270
 spiked-0.259
Token board
Token ID3096
Feature activation-1.00e+00

Pos logit contributions
ranch+0.215
 Property+0.176
 Kut+0.175
 Scot+0.173
Men+0.172
Neg logit contributions
hift-0.308
Reviewer-0.308
eeper-0.288
REDACTED-0.284
 spiked-0.272
Token to
Token ID284
Feature activation+1.46e-02

Pos logit contributions
ranch+0.702
 Property+0.598
 Scot+0.592
 Kut+0.591
Men+0.585
Neg logit contributions
hift-0.750
Reviewer-0.749
eeper-0.689
REDACTED-0.680
anamo-0.646
Token the
Token ID262
Feature activation-3.81e-01

Pos logit contributions
hift+0.011
Reviewer+0.011
eeper+0.010
REDACTED+0.010
 spiked+0.009
Neg logit contributions
ranch-0.011
 Property-0.009
 Kut-0.009
 Scot-0.009
Men-0.009
Token exact
Token ID2748
Feature activation-5.67e+00

Pos logit contributions
ranch+0.235
 Property+0.196
 Kut+0.194
 Scot+0.192
Men+0.190
Neg logit contributions
hift-0.318
Reviewer-0.317
eeper-0.296
REDACTED-0.291
 spiked-0.279
Token previous
Token ID2180
Feature activation-2.61e-01

Pos logit contributions
ranch+3.242
 Property+2.672
 Scot+2.584
 Kut+2.525
Men+2.511
Neg logit contributions
Reviewer-4.834
hift-4.796
REDACTED-4.470
eeper-4.453
anamo-4.388
Token state
Token ID1181
Feature activation-4.86e-01

Pos logit contributions
ranch+0.155
 Property+0.127
 Kut+0.126
 Scot+0.125
Men+0.124
Neg logit contributions
Reviewer-0.224
hift-0.224
eeper-0.209
REDACTED-0.206
 spiked-0.197
Token.
Token ID13
Feature activation-5.12e-01

Pos logit contributions
ranch+0.335
 Property+0.286
 Scot+0.283
 Kut+0.283
Men+0.279
Neg logit contributions
Reviewer-0.366
hift-0.366
eeper-0.339
REDACTED-0.334
 spiked-0.318
Token If
Token ID1002
Feature activation-1.83e-01

Pos logit contributions
ranch+0.285
 Property+0.233
 Kut+0.230
 Scot+0.228
Men+0.226
Neg logit contributions
hift-0.459
Reviewer-0.457
eeper-0.430
REDACTED-0.425
 spiked-0.407
Token we
Token ID356
Feature activation+8.01e-01

Pos logit contributions
ranch+0.123
 Property+0.103
 Kut+0.103
 Scot+0.102
Men+0.102
Neg logit contributions
hift-0.143
Reviewer-0.142
eeper-0.132
REDACTED-0.130
 spiked-0.124
Token happen
Token ID1645
Feature activation-1.00e+00

Pos logit contributions
ranch+0.486
 Kut+0.405
 Property+0.403
 Scot+0.403
Men+0.400
Neg logit contributions
Reviewer-0.565
hift-0.558
eeper-0.517
REDACTED-0.512
anamo-0.488
Token with
Token ID351
Feature activation+1.37e-01

Pos logit contributions
ranch+0.658
 Kut+0.556
 Scot+0.555
 Property+0.550
Men+0.550
Neg logit contributions
Reviewer-0.799
hift-0.795
eeper-0.733
REDACTED-0.731
 certific-0.690
Token solid
Token ID4735
Feature activation+1.28e+00

Pos logit contributions
hift+0.107
Reviewer+0.106
eeper+0.099
REDACTED+0.097
 certific+0.092
Neg logit contributions
ranch-0.093
 Kut-0.079
 Scot-0.079
 Property-0.079
Men-0.077
Token-
Token ID12
Feature activation-1.67e-01

Pos logit contributions
hift+0.944
Reviewer+0.935
eeper+0.870
REDACTED+0.857
 spiked+0.810
Neg logit contributions
ranch-0.918
 Property-0.778
 Kut-0.776
Men-0.772
 Scot-0.771
Tokenorgan
Token ID9971
Feature activation-4.15e-01

Pos logit contributions
ranch+0.100
 Kut+0.082
 Property+0.081
Men+0.081
 Scot+0.080
Neg logit contributions
hift-0.143
Reviewer-0.142
eeper-0.134
REDACTED-0.132
 spiked-0.126
Token transpl
Token ID37100
Feature activation-5.63e+00

Pos logit contributions
ranch+0.278
 Kut+0.236
 Property+0.236
 Scot+0.235
Men+0.233
Neg logit contributions
hift-0.326
Reviewer-0.317
eeper-0.305
REDACTED-0.296
 certific-0.280
Tokenants
Token ID1187
Feature activation-1.70e+00

Pos logit contributions
ranch+3.334
Men+2.703
 Property+2.670
 Kut+2.619
 Wong+2.607
Neg logit contributions
Reviewer-4.928
hift-4.854
REDACTED-4.558
eeper-4.439
 certific-4.308
Token,
Token ID11
Feature activation+1.60e-03

Pos logit contributions
ranch+1.075
 Kut+0.909
 Scot+0.895
 Property+0.893
Men+0.892
Neg logit contributions
Reviewer-1.380
hift-1.369
eeper-1.281
REDACTED-1.262
 Insight-1.192
Token where
Token ID810
Feature activation-3.92e-01

Pos logit contributions
hift+0.001
Reviewer+0.001
eeper+0.001
REDACTED+0.001
 certific+0.001
Neg logit contributions
ranch-0.001
 Kut-0.001
 Scot-0.001
 Property-0.001
Men-0.001
Token a
Token ID257
Feature activation+5.84e-02

Pos logit contributions
ranch+0.260
 Kut+0.219
 Property+0.218
 Scot+0.218
Men+0.215
Neg logit contributions
hift-0.310
Reviewer-0.307
eeper-0.287
REDACTED-0.282
 certific-0.268
Token patient
Token ID5827
Feature activation+7.65e-01

Pos logit contributions
hift+0.044
Reviewer+0.043
eeper+0.040
REDACTED+0.039
anamo+0.037
Neg logit contributions
ranch-0.042
 Kut-0.036
 Scot-0.036
 Property-0.035
Men-0.035
Token real
Token ID1103
Feature activation+3.48e-02

Pos logit contributions
ranch+0.400
 Property+0.345
 Scot+0.342
 Kut+0.340
Men+0.329
Neg logit contributions
Reviewer-0.433
hift-0.431
REDACTED-0.401
eeper-0.401
 Paste-0.372
Token estate
Token ID7964
Feature activation-1.64e+00

Pos logit contributions
hift+0.033
Reviewer+0.033
eeper+0.031
REDACTED+0.031
 spiked+0.030
Neg logit contributions
ranch-0.017
 Property-0.013
 Kut-0.013
Men-0.013
 Scot-0.013
Token magn
Token ID7842
Feature activation-1.93e-01

Pos logit contributions
ranch+1.145
 Property+0.993
 Kut+0.986
 Scot+0.985
Men+0.964
Neg logit contributions
Reviewer-1.228
hift-1.221
eeper-1.133
REDACTED-1.127
anamo-1.054
Tokenate
Token ID378
Feature activation-1.50e-01

Pos logit contributions
ranch+0.127
 Property+0.106
 Kut+0.105
Men+0.105
 Scot+0.105
Neg logit contributions
Reviewer-0.155
hift-0.153
REDACTED-0.142
eeper-0.141
 spiked-0.135
Token turned
Token ID2900
Feature activation+1.73e+00

Pos logit contributions
ranch+0.083
 Scot+0.069
 Kut+0.069
 Property+0.069
Men+0.066
Neg logit contributions
Reviewer-0.136
hift-0.134
eeper-0.126
REDACTED-0.125
 certific-0.118
Token philanthrop
Token ID28150
Feature activation-5.55e+00

Pos logit contributions
Reviewer+1.483
hift+1.477
eeper+1.383
REDACTED+1.366
 spiked+1.298
Neg logit contributions
ranch-1.035
 Property-0.855
 Kut-0.849
 Scot-0.847
Men-0.828
Tokenist
Token ID396
Feature activation-1.83e+00

Pos logit contributions
ranch+1.452
 Scot+0.859
 Property+0.821
 Kut+0.794
Men+0.760
Neg logit contributions
Reviewer-6.512
hift-6.455
REDACTED-6.265
eeper-6.057
 spiked-5.992
Token Herbert
Token ID28648
Feature activation-1.71e+00

Pos logit contributions
ranch+1.026
 Scot+0.881
 Property+0.869
 Kut+0.868
 Wong+0.842
Neg logit contributions
Reviewer-1.639
hift-1.611
eeper-1.515
REDACTED-1.507
 certific-1.422
Token Hale
Token ID35056
Feature activation-1.72e+00

Pos logit contributions
ranch+0.997
 Property+0.855
 Kut+0.854
 Scot+0.844
 Wong+0.813
Neg logit contributions
Reviewer-1.516
hift-1.494
eeper-1.388
REDACTED-1.387
 certific-1.325
Token Williams
Token ID6484
Feature activation-2.39e+00

Pos logit contributions
ranch+1.156
 Property+0.999
 Kut+0.997
 Scot+0.982
Men+0.963
Neg logit contributions
hift-1.339
Reviewer-1.336
eeper-1.242
REDACTED-1.228
 certific-1.169
Token,
Token ID11
Feature activation-3.91e-01

Pos logit contributions
ranch+1.597
 Property+1.399
 Kut+1.398
 Scot+1.376
Men+1.357
Neg logit contributions
Reviewer-1.860
hift-1.791
eeper-1.710
REDACTED-1.690
 certific-1.599
Tokenement
Token ID972
Feature activation+8.90e-01

Pos logit contributions
ranch+1.778
 Property+1.517
 Kut+1.513
 Scot+1.510
Men+1.499
Neg logit contributions
Reviewer-1.919
hift-1.896
eeper-1.736
REDACTED-1.733
 spiked-1.645
Token d
Token ID288
Feature activation-2.60e-01

Pos logit contributions
hift+0.637
Reviewer+0.637
eeper+0.587
REDACTED+0.575
 spiked+0.547
Neg logit contributions
ranch-0.657
 Kut-0.560
 Property-0.560
 Scot-0.556
Men-0.555
Tokenans
Token ID504
Feature activation-2.06e+00

Pos logit contributions
ranch+0.174
Men+0.144
 Kut+0.144
 Property+0.144
 Scot+0.142
Neg logit contributions
Reviewer-0.207
hift-0.206
REDACTED-0.189
eeper-0.188
 spiked-0.181
Token c
Token ID269
Feature activation+8.69e-01

Pos logit contributions
ranch+1.412
 Property+1.203
 Kut+1.196
 Scot+1.191
Men+1.189
Neg logit contributions
Reviewer-1.590
hift-1.567
eeper-1.461
REDACTED-1.417
 spiked-1.355
Tokenes
Token ID274
Feature activation-1.30e+00

Pos logit contributions
Reviewer+0.618
hift+0.615
eeper+0.563
REDACTED+0.557
 spiked+0.534
Neg logit contributions
ranch-0.654
 Kut-0.551
 Property-0.549
Men-0.548
 Scot-0.545
Token dro
Token ID3102
Feature activation-5.51e+00

Pos logit contributions
ranch+0.907
 Property+0.767
 Kut+0.766
 Scot+0.762
Men+0.760
Neg logit contributions
Reviewer-0.981
hift-0.973
eeper-0.911
REDACTED-0.890
 spiked-0.843
Tokenits
Token ID896
Feature activation+3.01e-01

Pos logit contributions
ranch+3.467
Men+2.765
 Scot+2.752
 Kut+2.711
 Property+2.699
Neg logit contributions
Reviewer-4.472
hift-4.433
REDACTED-4.205
eeper-4.084
 spiked-3.968
Token en
Token ID551
Feature activation-1.60e+00

Pos logit contributions
Reviewer+0.212
hift+0.211
eeper+0.196
REDACTED+0.191
 spiked+0.181
Neg logit contributions
ranch-0.226
 Kut-0.194
 Property-0.194
 Scot-0.192
Men-0.192
Token rest
Token ID1334
Feature activation-2.50e+00

Pos logit contributions
ranch+1.109
 Kut+0.935
 Property+0.933
 Scot+0.926
Men+0.922
Neg logit contributions
Reviewer-1.218
hift-1.210
eeper-1.126
REDACTED-1.095
 spiked-1.047
Tokenre
Token ID260
Feature activation-1.23e+00

Pos logit contributions
ranch+1.414
 Kut+1.105
 Property+1.100
 Scot+1.098
Men+1.097
Neg logit contributions
Reviewer-2.242
hift-2.241
eeper-2.078
REDACTED-2.063
 spiked-1.999
Tokenignant
Token ID25114
Feature activation-4.61e-01

Pos logit contributions
ranch+0.872
 Kut+0.738
 Property+0.738
 Scot+0.736
Men+0.731
Neg logit contributions
Reviewer-0.914
hift-0.911
eeper-0.848
REDACTED-0.828
 spiked-0.786
Token st
Token ID336
Feature activation+9.92e-01

Pos logit contributions
ranch+0.026
 Kut+0.023
 Scot+0.022
 Property+0.022
Men+0.022
Neg logit contributions
Reviewer-0.029
hift-0.029
eeper-0.026
REDACTED-0.026
 Paste-0.025
Tokenaking
Token ID868
Feature activation-1.85e+00

Pos logit contributions
hift+0.670
Reviewer+0.668
REDACTED+0.610
eeper+0.592
 spiked+0.575
Neg logit contributions
ranch-0.775
Men-0.658
 Property-0.656
 Kut-0.656
 Scot-0.649
Token out
Token ID503
Feature activation-8.87e-01

Pos logit contributions
ranch+1.231
 Kut+1.055
 Property+1.048
 Scot+1.040
Men+1.028
Neg logit contributions
Reviewer-1.461
hift-1.429
eeper-1.332
REDACTED-1.309
 Paste-1.244
Token is
Token ID318
Feature activation-2.07e-01

Pos logit contributions
ranch+0.608
 Kut+0.526
 Scot+0.517
 Property+0.515
Men+0.514
Neg logit contributions
Reviewer-0.683
hift-0.673
eeper-0.620
REDACTED-0.615
 Paste-0.582
Token the
Token ID262
Feature activation-5.43e-01

Pos logit contributions
ranch+0.139
 Kut+0.117
 Property+0.116
 Scot+0.116
Men+0.115
Neg logit contributions
hift-0.162
Reviewer-0.162
eeper-0.150
REDACTED-0.148
 certific-0.140
Token exact
Token ID2748
Feature activation-5.49e+00

Pos logit contributions
ranch+0.342
 Kut+0.284
 Property+0.283
 Scot+0.279
Men+0.272
Neg logit contributions
Reviewer-0.450
hift-0.450
eeper-0.418
REDACTED-0.416
 Paste-0.392
Token subject
Token ID2426
Feature activation+3.44e-01

Pos logit contributions
ranch+2.508
 Kut+1.921
 Property+1.880
 Scot+1.848
Men+1.757
Neg logit contributions
Reviewer-5.418
hift-5.341
REDACTED-5.105
eeper-5.099
 Paste-4.814
Token which
Token ID543
Feature activation-2.15e+00

Pos logit contributions
hift+0.265
Reviewer+0.263
eeper+0.245
REDACTED+0.240
 spiked+0.230
Neg logit contributions
ranch-0.236
 Kut-0.199
 Property-0.199
 Scot-0.198
Men-0.197
Token most
Token ID749
Feature activation-1.96e+00

Pos logit contributions
ranch+1.457
 Kut+1.273
 Scot+1.266
 Property+1.258
Men+1.255
Neg logit contributions
Reviewer-1.644
hift-1.635
eeper-1.519
REDACTED-1.464
 certific-1.395
Token of
Token ID286
Feature activation-1.52e+00

Pos logit contributions
ranch+1.223
 Kut+1.044
 Scot+1.037
Men+1.032
 Property+1.028
Neg logit contributions
Reviewer-1.623
hift-1.585
eeper-1.514
REDACTED-1.467
anamo-1.377
Token the
Token ID262
Feature activation-8.93e-01

Pos logit contributions
ranch+0.958
 Kut+0.814
 Scot+0.811
 Property+0.798
Men+0.798
Neg logit contributions
Reviewer-1.246
hift-1.221
eeper-1.164
REDACTED-1.135
 certific-1.079
Token Vil
Token ID34037
Feature activation-2.44e-01

Pos logit contributions
ranch+1.330
 Kut+1.164
 Property+1.140
 Scot+1.139
Men+1.121
Neg logit contributions
Reviewer-1.748
hift-1.738
eeper-1.627
REDACTED-1.620
 certific-1.534
Tokenas
Token ID292
Feature activation-2.32e+00

Pos logit contributions
ranch+0.132
 Property+0.107
 Kut+0.106
 Scot+0.105
Men+0.104
Neg logit contributions
Reviewer-0.223
hift-0.222
REDACTED-0.208
eeper-0.206
 spiked-0.199
Tokenra
Token ID430
Feature activation-3.14e+00

Pos logit contributions
ranch+1.324
 Kut+1.096
 Property+1.087
 Scot+1.082
Men+1.065
Neg logit contributions
Reviewer-2.062
hift-2.044
eeper-1.912
REDACTED-1.907
 certific-1.805
Tokeno
Token ID78
Feature activation-2.84e+00

Pos logit contributions
ranch+1.776
 Property+1.508
 Kut+1.505
Men+1.489
 Scot+1.472
Neg logit contributions
Reviewer-2.792
hift-2.773
REDACTED-2.583
eeper-2.569
 certific-2.449
Token Des
Token ID2935
Feature activation-3.44e+00

Pos logit contributions
ranch+1.707
 Kut+1.509
 Property+1.482
 Scot+1.464
Men+1.428
Neg logit contributions
Reviewer-2.426
hift-2.387
eeper-2.244
REDACTED-2.231
 certific-2.120
Tokenhm
Token ID23940
Feature activation-5.47e+00

Pos logit contributions
ranch+1.919
 Property+1.557
 Kut+1.557
Men+1.527
 Scot+1.505
Neg logit contributions
Reviewer-3.157
hift-3.099
eeper-2.878
REDACTED-2.864
 spiked-2.775
Tokenuk
Token ID2724
Feature activation-2.07e+00

Pos logit contributions
ranch+2.647
 Scot+2.028
 Property+1.987
Men+1.976
 Kut+1.962
Neg logit contributions
Reviewer-5.459
hift-5.407
eeper-4.926
REDACTED-4.911
 certific-4.890
Tokenh
Token ID71
Feature activation-2.74e+00

Pos logit contributions
ranch+1.060
 Kut+0.834
 Property+0.824
Men+0.812
 Scot+0.806
Neg logit contributions
Reviewer-1.962
hift-1.946
REDACTED-1.832
eeper-1.828
 certific-1.757
Token for
Token ID329
Feature activation-7.68e-01

Pos logit contributions
ranch+2.032
 Kut+1.759
 Property+1.748
 Scot+1.745
Men+1.717
Neg logit contributions
hift-1.955
Reviewer-1.914
eeper-1.772
REDACTED-1.752
 certific-1.661
Token allot
Token ID47405
Feature activation-3.75e-01

Pos logit contributions
ranch+0.507
 Kut+0.433
 Property+0.428
 Scot+0.425
Men+0.418
Neg logit contributions
hift-0.609
Reviewer-0.608
eeper-0.566
REDACTED-0.554
 Paste-0.526
Tokenting
Token ID889
Feature activation-5.16e-01

Pos logit contributions
ranch+0.270
 Kut+0.230
 Property+0.230
 Scot+0.226
Men+0.225
Neg logit contributions
hift-0.278
Reviewer-0.274
eeper-0.253
REDACTED-0.253
 spiked-0.239
Token of
Token ID286
Feature activation-2.81e-01

Pos logit contributions
hift+0.999
Reviewer+0.986
eeper+0.910
REDACTED+0.900
 spiked+0.870
Neg logit contributions
ranch-1.056
 Property-0.886
 Kut-0.885
 Scot-0.881
Men-0.879
Token the
Token ID262
Feature activation+2.03e-01

Pos logit contributions
ranch+0.172
 Kut+0.144
 Property+0.143
Men+0.140
 Scot+0.140
Neg logit contributions
Reviewer-0.236
hift-0.235
eeper-0.220
REDACTED-0.216
 certific-0.205
Token mission
Token ID4365
Feature activation+2.45e-01

Pos logit contributions
Reviewer+0.155
hift+0.153
eeper+0.145
REDACTED+0.140
 certific+0.133
Neg logit contributions
ranch-0.139
 Kut-0.120
 Property-0.119
 Scot-0.117
Men-0.117
Token means
Token ID1724
Feature activation+1.21e-01

Pos logit contributions
Reviewer+0.188
hift+0.187
eeper+0.174
REDACTED+0.171
 certific+0.161
Neg logit contributions
ranch-0.169
 Kut-0.145
 Property-0.143
Men-0.141
 Scot-0.141
Token the
Token ID262
Feature activation+1.45e-01

Pos logit contributions
Reviewer+0.088
hift+0.087
eeper+0.081
REDACTED+0.080
 spiked+0.075
Neg logit contributions
ranch-0.087
 Kut-0.075
 Property-0.075
 Scot-0.074
Men-0.074
Token exact
Token ID2748
Feature activation-5.47e+00

Pos logit contributions
Reviewer+0.110
hift+0.108
eeper+0.102
REDACTED+0.099
anamo+0.093
Neg logit contributions
ranch-0.101
 Kut-0.088
 Property-0.087
 Scot-0.087
Men-0.086
Token date
Token ID3128
Feature activation-2.99e+00

Pos logit contributions
ranch+3.442
 Kut+2.985
 Property+2.927
 Scot+2.891
Men+2.874
Neg logit contributions
Reviewer-4.372
hift-4.177
eeper-3.983
REDACTED-3.916
anamo-3.747
Token and
Token ID290
Feature activation-4.16e-01

Pos logit contributions
ranch+1.846
 Kut+1.587
 Property+1.581
 Scot+1.568
Men+1.557
Neg logit contributions
Reviewer-2.438
hift-2.399
eeper-2.249
REDACTED-2.217
 certific-2.120
Token method
Token ID2446
Feature activation-6.50e-01

Pos logit contributions
ranch+0.237
 Kut+0.195
 Property+0.195
 Scot+0.193
Men+0.191
Neg logit contributions
Reviewer-0.366
hift-0.363
eeper-0.341
REDACTED-0.336
anamo-0.322
Token of
Token ID286
Feature activation-7.05e-01

Pos logit contributions
ranch+0.426
 Kut+0.362
 Property+0.361
Men+0.357
 Scot+0.357
Neg logit contributions
Reviewer-0.510
hift-0.509
eeper-0.475
REDACTED-0.467
 certific-0.447
Token delivery
Token ID7585
Feature activation-4.74e-01

Pos logit contributions
ranch+0.563
 Kut+0.491
 Property+0.491
 Scot+0.486
Men+0.483
Neg logit contributions
Reviewer-0.455
hift-0.455
eeper-0.420
REDACTED-0.410
anamo-0.381
Token bees
Token ID17002
Feature activation-5.46e-01

Pos logit contributions
ranch+0.415
 Scot+0.353
 Kut+0.353
 Property+0.350
Men+0.343
Neg logit contributions
Reviewer-0.482
hift-0.481
eeper-0.451
REDACTED-0.444
 Paste-0.415
Token,
Token ID11
Feature activation+3.36e-01

Pos logit contributions
ranch+0.379
 Kut+0.321
 Scot+0.317
 Property+0.316
Men+0.313
Neg logit contributions
Reviewer-0.416
hift-0.411
eeper-0.387
REDACTED-0.382
 Insight-0.358
Token which
Token ID543
Feature activation-1.44e+00

Pos logit contributions
Reviewer+0.255
hift+0.255
eeper+0.237
REDACTED+0.233
 certific+0.216
Neg logit contributions
ranch-0.232
 Kut-0.200
 Scot-0.200
 Property-0.198
Men-0.194
Token she
Token ID673
Feature activation-7.54e-01

Pos logit contributions
ranch+0.993
 Scot+0.884
 Property+0.880
 Kut+0.878
Men+0.842
Neg logit contributions
Reviewer-1.092
hift-1.071
eeper-1.009
REDACTED-0.994
anamo-0.923
Token is
Token ID318
Feature activation-1.23e+00

Pos logit contributions
ranch+0.549
 Kut+0.482
 Scot+0.477
 Property+0.474
Men+0.463
Neg logit contributions
Reviewer-0.547
hift-0.538
eeper-0.503
REDACTED-0.495
anamo-0.461
Token allergic
Token ID31856
Feature activation-5.46e+00

Pos logit contributions
ranch+0.838
 Kut+0.731
 Scot+0.726
 Property+0.724
Men+0.699
Neg logit contributions
Reviewer-0.927
hift-0.919
eeper-0.875
REDACTED-0.855
anamo-0.804
Token to
Token ID284
Feature activation-7.82e-01

Pos logit contributions
ranch+4.014
 Kut+3.423
 Property+3.411
 Scot+3.395
Men+3.388
Neg logit contributions
hift-3.897
Reviewer-3.862
eeper-3.572
REDACTED-3.506
 spiked-3.316
Token.
Token ID13
Feature activation-5.31e-01

Pos logit contributions
ranch+0.541
 Kut+0.458
 Property+0.457
 Scot+0.453
Men+0.451
Neg logit contributions
Reviewer-0.593
hift-0.592
eeper-0.550
REDACTED-0.542
 certific-0.509
Token She
Token ID1375
Feature activation+1.45e-01

Pos logit contributions
ranch+0.289
 Property+0.248
 Scot+0.246
 Kut+0.245
Men+0.237
Neg logit contributions
Reviewer-0.478
hift-0.478
eeper-0.453
REDACTED-0.445
 certific-0.431
Token doesn
Token ID1595
Feature activation-3.19e-01

Pos logit contributions
Reviewer+0.111
hift+0.109
eeper+0.101
REDACTED+0.100
anamo+0.094
Neg logit contributions
ranch-0.101
 Kut-0.088
 Scot-0.087
 Property-0.087
Men-0.085
Token't
Token ID470
Feature activation-7.94e-01

Pos logit contributions
ranch+0.152
Men+0.119
 Kut+0.119
 Property+0.118
 Scot+0.118
Neg logit contributions
hift-0.313
Reviewer-0.309
eeper-0.296
 certific-0.288
REDACTED-0.286
Token
Token ID198
Feature activation+7.33e-01

Pos logit contributions
ranch+0.753
 Kut+0.615
 Property+0.614
 Scot+0.609
Men+0.608
Neg logit contributions
hift-1.066
Reviewer-1.061
eeper-0.994
REDACTED-0.979
 spiked-0.939
Token
Token ID198
Feature activation+5.95e-01

Pos logit contributions
hift+0.685
Reviewer+0.682
eeper+0.646
REDACTED+0.629
 certific+0.617
Neg logit contributions
ranch-0.383
 Kut-0.304
 Property-0.304
Men-0.302
 Scot-0.301
Token1994
Token ID22666
Feature activation-1.06e+00

Pos logit contributions
hift+0.498
Reviewer+0.488
eeper+0.462
REDACTED+0.449
 spiked+0.437
Neg logit contributions
ranch-0.372
Men-0.305
 Property-0.302
 Kut-0.302
 Scot-0.300
Token Jerry
Token ID13075
Feature activation-1.13e+00

Pos logit contributions
ranch+0.583
 Scot+0.503
 Kut+0.499
 Property+0.489
Men+0.476
Neg logit contributions
hift-0.945
Reviewer-0.942
eeper-0.890
REDACTED-0.870
 certific-0.835
Token Se
Token ID1001
Feature activation-2.15e+00

Pos logit contributions
ranch+0.420
 Scot+0.341
 Property+0.337
 Kut+0.336
 Wong+0.324
Neg logit contributions
Reviewer-1.240
hift-1.215
eeper-1.158
REDACTED-1.139
anamo-1.105
Tokeninfeld
Token ID47187
Feature activation-5.40e+00

Pos logit contributions
ranch+1.074
Men+0.835
 Property+0.823
WC+0.802
 Scot+0.781
Neg logit contributions
hift-2.061
Reviewer-2.021
eeper-1.918
REDACTED-1.917
 certific-1.870
Token 1995
Token ID8735
Feature activation-2.13e-01

Pos logit contributions
ranch+3.043
 Property+2.459
 Scot+2.445
 Kut+2.443
Men+2.426
Neg logit contributions
hift-4.763
Reviewer-4.745
eeper-4.450
REDACTED-4.372
 certific-4.176
Token Kel
Token ID15150
Feature activation-9.87e-01

Pos logit contributions
ranch+0.120
 Kut+0.100
 Scot+0.100
 Property+0.099
Men+0.097
Neg logit contributions
hift-0.189
Reviewer-0.188
eeper-0.177
REDACTED-0.174
 spiked-0.167
Tokensey
Token ID4397
Feature activation-2.09e+00

Pos logit contributions
ranch+0.785
 Kut+0.690
 Scot+0.686
 Property+0.679
Men+0.676
Neg logit contributions
hift-0.654
Reviewer-0.648
REDACTED-0.594
eeper-0.579
 certific-0.567
Token Gram
Token ID20159
Feature activation-3.69e+00

Pos logit contributions
ranch+0.892
 Property+0.764
 Scot+0.732
 Kut+0.727
 Wong+0.709
Neg logit contributions
hift-2.127
Reviewer-2.101
eeper-2.018
REDACTED-1.983
anamo-1.937
Tokenmer
Token ID647
Feature activation-1.02e+00

Pos logit contributions
ranch+1.054
Men+0.639
 Property+0.581
 Scot+0.559
 Kut+0.520
Neg logit contributions
Reviewer-4.294
hift-4.273
REDACTED-4.100
 certific-4.081
eeper-4.043
Token:
Token ID25
Feature activation-9.83e-01

Pos logit contributions
ranch+0.550
 Kut+0.480
 Property+0.479
Men+0.476
 Scot+0.476
Neg logit contributions
hift-0.373
Reviewer-0.371
eeper-0.337
REDACTED-0.329
 spiked-0.310
Token E
Token ID412
Feature activation-8.30e-01

Pos logit contributions
ranch+0.579
 Scot+0.492
 Property+0.488
 Kut+0.487
Men+0.475
Neg logit contributions
hift-0.846
Reviewer-0.841
eeper-0.787
REDACTED-0.779
 spiked-0.742
Token.
Token ID13
Feature activation-6.69e-01

Pos logit contributions
ranch+0.641
 Kut+0.557
 Property+0.556
Men+0.548
 Scot+0.544
Neg logit contributions
hift-0.556
Reviewer-0.554
eeper-0.511
REDACTED-0.497
 Paste-0.474
TokenS
Token ID50
Feature activation-1.22e+00

Pos logit contributions
ranch+0.378
 Kut+0.305
 Property+0.304
Men+0.301
 Scot+0.300
Neg logit contributions
hift-0.594
Reviewer-0.591
eeper-0.555
REDACTED-0.548
 spiked-0.527
Token.
Token ID13
Feature activation-1.57e+00

Pos logit contributions
ranch+0.889
 Property+0.754
 Kut+0.752
 Scot+0.748
Men+0.746
Neg logit contributions
hift-0.886
Reviewer-0.883
eeper-0.814
REDACTED-0.803
 spiked-0.765
Token Glover
Token ID43487
Feature activation-5.38e+00

Pos logit contributions
ranch+0.937
 Scot+0.777
 Property+0.777
 Kut+0.777
Men+0.771
Neg logit contributions
hift-1.327
Reviewer-1.327
eeper-1.244
REDACTED-1.216
 spiked-1.173
Token D
Token ID360
Feature activation-1.67e+00

Pos logit contributions
ranch+3.372
 Property+2.958
 Scot+2.914
 Kut+2.879
Men+2.788
Neg logit contributions
hift-4.354
Reviewer-4.289
eeper-4.076
REDACTED-4.001
anamo-3.820
Tokenwn
Token ID675
Feature activation-2.62e+00

Pos logit contributions
ranch+1.131
 Kut+0.931
 Property+0.924
Men+0.921
 Scot+0.917
Neg logit contributions
hift-1.323
Reviewer-1.314
REDACTED-1.207
eeper-1.190
 spiked-1.161
Tokenld
Token ID335
Feature activation-1.11e+00

Pos logit contributions
ranch+1.925
 Property+1.651
 Scot+1.628
 Kut+1.624
Men+1.613
Neg logit contributions
hift-1.878
Reviewer-1.867
REDACTED-1.719
eeper-1.711
 spiked-1.640
Token:
Token ID25
Feature activation-1.58e+00

Pos logit contributions
ranch+0.813
 Property+0.693
 Kut+0.686
 Scot+0.685
Men+0.678
Neg logit contributions
hift-0.806
Reviewer-0.804
REDACTED-0.739
eeper-0.736
 spiked-0.699
Token Full
Token ID6462
Feature activation-2.76e+00

Pos logit contributions
ranch+0.960
 Property+0.814
 Scot+0.812
 Kut+0.805
Men+0.794
Neg logit contributions
hift-1.329
Reviewer-1.324
eeper-1.239
REDACTED-1.215
 spiked-1.164
Token
Token ID198
Feature activation+6.93e-01

Pos logit contributions
ranch+0.416
 Kut+0.338
Men+0.337
 Property+0.335
 Scot+0.332
Neg logit contributions
hift-0.594
Reviewer-0.589
eeper-0.550
REDACTED-0.547
 Paste-0.531
Token
Token ID198
Feature activation+6.80e-01

Pos logit contributions
hift+0.647
Reviewer+0.645
eeper+0.610
REDACTED+0.595
 certific+0.583
Neg logit contributions
ranch-0.363
 Kut-0.288
 Property-0.287
Men-0.285
 Scot-0.284
Token1992
Token ID23847
Feature activation-4.83e-02

Pos logit contributions
Reviewer+0.516
hift+0.500
REDACTED+0.470
eeper+0.463
 Paste+0.431
Neg logit contributions
ranch-0.470
 Kut-0.406
 Property-0.405
 Scot-0.402
 Wong-0.390
Token Jerry
Token ID13075
Feature activation-8.38e-01

Pos logit contributions
ranch+0.027
 Scot+0.023
 Kut+0.023
 Property+0.022
Men+0.022
Neg logit contributions
hift-0.043
Reviewer-0.043
eeper-0.040
REDACTED-0.040
 certific-0.038
Token Se
Token ID1001
Feature activation-1.80e+00

Pos logit contributions
ranch+0.487
 Kut+0.412
 Property+0.412
 Scot+0.411
Men+0.400
Neg logit contributions
Reviewer-0.734
hift-0.727
eeper-0.679
REDACTED-0.674
 certific-0.637
Tokeninfeld
Token ID47187
Feature activation-5.37e+00

Pos logit contributions
ranch+0.925
Men+0.738
 Kut+0.712
 Property+0.705
WC+0.701
Neg logit contributions
hift-1.683
Reviewer-1.678
REDACTED-1.578
eeper-1.575
 certific-1.517
Token 1993
Token ID9656
Feature activation+2.85e-01

Pos logit contributions
ranch+2.850
 Kut+2.268
 Property+2.265
 Scot+2.240
Men+2.226
Neg logit contributions
hift-4.940
Reviewer-4.911
eeper-4.631
REDACTED-4.560
 spiked-4.383
Token Ted
Token ID11396
Feature activation-1.39e+00

Pos logit contributions
hift+0.250
Reviewer+0.249
eeper+0.235
REDACTED+0.231
 spiked+0.221
Neg logit contributions
ranch-0.163
 Scot-0.135
 Kut-0.135
 Property-0.134
Men-0.131
Token D
Token ID360
Feature activation-2.68e+00

Pos logit contributions
ranch+1.004
 Kut+0.917
 Property+0.908
 Scot+0.905
 Wong+0.897
Neg logit contributions
hift-1.008
Reviewer-1.008
eeper-0.935
 certific-0.888
REDACTED-0.887
Tokenanson
Token ID23103
Feature activation-3.25e+00

Pos logit contributions
ranch+1.142
Men+0.790
 Kut+0.771
 Property+0.752
 Wong+0.742
Neg logit contributions
hift-2.815
Reviewer-2.781
REDACTED-2.637
eeper-2.599
 certific-2.591
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
ranch+1.698
 Property+1.345
 Kut+1.341
 Scot+1.328
Men+1.323
Neg logit contributions
hift-3.022
Reviewer-3.017
eeper-2.839
REDACTED-2.799
 certific-2.701
Token provides
Token ID3769
Feature activation+1.23e+00

Pos logit contributions
ranch+0.371
 Property+0.322
 Kut+0.317
 Scot+0.316
Men+0.311
Neg logit contributions
Reviewer-0.407
hift-0.404
eeper-0.375
REDACTED-0.369
anamo-0.355
Token an
Token ID281
Feature activation-1.24e-01

Pos logit contributions
hift+0.953
Reviewer+0.950
eeper+0.881
REDACTED+0.875
 spiked+0.826
Neg logit contributions
ranch-0.822
 Kut-0.700
 Property-0.695
 Scot-0.692
Men-0.681
Token unbiased
Token ID46735
Feature activation+2.20e+00

Pos logit contributions
ranch+0.093
 Property+0.081
 Kut+0.081
 Scot+0.080
Men+0.079
Neg logit contributions
hift-0.086
Reviewer-0.085
REDACTED-0.078
eeper-0.077
 Paste-0.073
Token and
Token ID290
Feature activation+3.61e-01

Pos logit contributions
hift+1.785
Reviewer+1.775
eeper+1.659
REDACTED+1.640
 spiked+1.556
Neg logit contributions
ranch-1.404
 Property-1.169
 Kut-1.169
 Scot-1.160
Men-1.149
Token efficient
Token ID6942
Feature activation-2.73e-01

Pos logit contributions
Reviewer+0.253
hift+0.251
eeper+0.229
REDACTED+0.229
anamo+0.217
Neg logit contributions
ranch-0.272
 Property-0.235
 Scot-0.234
 Kut-0.234
Men-0.229
Token estim
Token ID3959
Feature activation-5.35e+00

Pos logit contributions
ranch+0.171
 Property+0.145
 Kut+0.144
 Scot+0.144
Men+0.139
Neg logit contributions
hift-0.223
Reviewer-0.223
eeper-0.209
REDACTED-0.207
anamo-0.198
Tokenator
Token ID1352
Feature activation-2.10e+00

Pos logit contributions
ranch+1.731
 Scot+1.253
 Kut+1.147
 Property+1.125
Men+1.104
Neg logit contributions
hift-5.804
Reviewer-5.798
REDACTED-5.591
 certific-5.473
eeper-5.448
Token of
Token ID286
Feature activation+3.86e-01

Pos logit contributions
ranch+1.347
 Property+1.142
 Scot+1.140
 Kut+1.135
Men+1.119
Neg logit contributions
hift-1.655
Reviewer-1.652
eeper-1.545
REDACTED-1.522
 spiked-1.453
Token the
Token ID262
Feature activation-1.60e-02

Pos logit contributions
hift+0.305
Reviewer+0.304
eeper+0.283
REDACTED+0.281
anamo+0.265
Neg logit contributions
ranch-0.255
 Kut-0.216
 Property-0.215
 Scot-0.214
Men-0.208
Token round
Token ID2835
Feature activation+2.00e-01

Pos logit contributions
ranch+0.012
 Kut+0.010
 Property+0.010
 Scot+0.010
Men+0.010
Neg logit contributions
hift-0.011
Reviewer-0.011
eeper-0.011
REDACTED-0.010
anamo-0.010
Token-
Token ID12
Feature activation+3.78e-01

Pos logit contributions
hift+0.120
Reviewer+0.120
eeper+0.109
REDACTED+0.106
 spiked+0.100
Neg logit contributions
ranch-0.171
 Kut-0.149
 Property-0.148
Men-0.147
 Scot-0.147
Token bad
Token ID2089
Feature activation-1.43e+00

Pos logit contributions
ranch+0.106
 Kut+0.091
 Property+0.090
 Scot+0.090
Men+0.089
Neg logit contributions
Reviewer-0.121
hift-0.118
eeper-0.110
REDACTED-0.110
 spiked-0.102
Token debts
Token ID20250
Feature activation-2.38e+00

Pos logit contributions
ranch+0.923
 Kut+0.782
 Property+0.771
 Scot+0.768
Men+0.768
Neg logit contributions
Reviewer-1.159
hift-1.139
eeper-1.072
REDACTED-1.061
 spiked-0.996
Token.
Token ID13
Feature activation-1.69e-01

Pos logit contributions
ranch+1.597
 Kut+1.376
 Property+1.365
Men+1.360
 Scot+1.359
Neg logit contributions
Reviewer-1.846
hift-1.805
REDACTED-1.678
eeper-1.668
 Paste-1.580
Token Standing
Token ID22493
Feature activation+7.32e-01

Pos logit contributions
ranch+0.095
 Kut+0.079
 Property+0.079
 Scot+0.078
Men+0.078
Neg logit contributions
Reviewer-0.150
hift-0.150
eeper-0.141
REDACTED-0.139
 certific-0.133
Token Lord
Token ID4453
Feature activation-2.83e+00

Pos logit contributions
Reviewer+0.556
hift+0.549
REDACTED+0.511
eeper+0.506
 Paste+0.471
Neg logit contributions
ranch-0.510
 Kut-0.441
 Property-0.435
 Scot-0.433
Men-0.433
Token Gan
Token ID23207
Feature activation-5.30e+00

Pos logit contributions
ranch+1.966
 Kut+1.785
 Property+1.737
 Scot+1.731
Men+1.691
Neg logit contributions
Reviewer-2.112
hift-2.078
REDACTED-1.906
eeper-1.901
 spiked-1.759
Tokenesh
Token ID5069
Feature activation+4.38e-01

Pos logit contributions
ranch+3.615
 Kut+3.095
 Property+2.981
Men+2.976
 Scot+2.914
Neg logit contributions
Reviewer-4.056
hift-3.990
REDACTED-3.796
eeper-3.604
 certific-3.533
Token and
Token ID290
Feature activation-3.66e-01

Pos logit contributions
Reviewer+0.333
hift+0.330
eeper+0.304
REDACTED+0.303
 spiked+0.285
Neg logit contributions
ranch-0.306
 Kut-0.263
 Property-0.259
Men-0.258
 Scot-0.258
Token Vish
Token ID36900
Feature activation-1.61e+00

Pos logit contributions
ranch+0.243
 Kut+0.211
 Scot+0.207
 Property+0.207
Men+0.204
Neg logit contributions
Reviewer-0.289
hift-0.286
eeper-0.265
REDACTED-0.263
 spiked-0.247
Tokennu
Token ID28803
Feature activation-4.28e-01

Pos logit contributions
ranch+0.847
 Kut+0.675
 Property+0.666
Men+0.664
 Scot+0.658
Neg logit contributions
hift-1.486
Reviewer-1.483
eeper-1.386
REDACTED-1.385
 spiked-1.328
Token D
Token ID360
Feature activation-1.79e+00

Pos logit contributions
ranch+0.293
 Kut+0.257
 Property+0.252
 Scot+0.252
Men+0.249
Neg logit contributions
Reviewer-0.334
hift-0.324
REDACTED-0.300
eeper-0.299
 Paste-0.279
Token
Token ID198
Feature activation+8.58e-01

Pos logit contributions
ranch+0.396
 Kut+0.325
 Property+0.322
Men+0.321
 Scot+0.316
Neg logit contributions
hift-0.551
Reviewer-0.549
eeper-0.512
REDACTED-0.507
 spiked-0.494
Token
Token ID198
Feature activation+7.93e-01

Pos logit contributions
hift+0.799
Reviewer+0.796
eeper+0.753
REDACTED+0.734
 certific+0.718
Neg logit contributions
ranch-0.451
 Kut-0.358
 Property-0.357
Men-0.355
 Scot-0.354
Token1993
Token ID24465
Feature activation-1.22e-03

Pos logit contributions
Reviewer+0.644
hift+0.635
REDACTED+0.592
eeper+0.590
 Paste+0.556
Neg logit contributions
ranch-0.506
 Kut-0.426
 Property-0.424
 Scot-0.417
Men-0.410
Token Jerry
Token ID13075
Feature activation-1.20e+00

Pos logit contributions
ranch+0.001
 Scot+0.001
 Kut+0.001
 Property+0.001
Men+0.001
Neg logit contributions
hift-0.001
Reviewer-0.001
eeper-0.001
REDACTED-0.001
 spiked-0.001
Token Se
Token ID1001
Feature activation-1.96e+00

Pos logit contributions
ranch+0.462
 Scot+0.369
 Kut+0.365
 Property+0.365
 Wong+0.350
Neg logit contributions
Reviewer-1.281
hift-1.269
eeper-1.212
REDACTED-1.186
 certific-1.155
Tokeninfeld
Token ID47187
Feature activation-5.28e+00

Pos logit contributions
ranch+0.994
Men+0.772
 Property+0.765
WC+0.744
 Kut+0.729
Neg logit contributions
hift-1.862
Reviewer-1.825
eeper-1.736
REDACTED-1.731
 certific-1.692
Token
Token ID198
Feature activation+6.46e-01

Pos logit contributions
ranch+3.099
 Kut+2.518
 Property+2.514
 Scot+2.493
Men+2.492
Neg logit contributions
hift-4.566
Reviewer-4.541
eeper-4.261
REDACTED-4.195
 spiked-4.024
Token
Token ID198
Feature activation+6.17e-01

Pos logit contributions
hift+0.606
Reviewer+0.604
eeper+0.572
REDACTED+0.557
 certific+0.546
Neg logit contributions
ranch-0.336
 Property-0.266
 Kut-0.266
Men-0.264
 Scot-0.264
Token1993
Token ID24465
Feature activation+2.90e-02

Pos logit contributions
Reviewer+0.490
hift+0.471
REDACTED+0.447
eeper+0.440
 Paste+0.412
Neg logit contributions
ranch-0.407
 Kut-0.351
 Property-0.349
 Scot-0.341
 Wong-0.332
Token Garry
Token ID45354
Feature activation-1.84e+00

Pos logit contributions
hift+0.026
Reviewer+0.026
eeper+0.024
REDACTED+0.024
 certific+0.023
Neg logit contributions
ranch-0.016
 Scot-0.014
 Kut-0.014
 Property-0.013
Men-0.013
Token Sh
Token ID911
Feature activation-9.46e-01

Pos logit contributions
ranch+1.016
 Kut+0.861
 Scot+0.854
 Property+0.846
Men+0.834
Neg logit contributions
hift-1.657
Reviewer-1.643
eeper-1.560
REDACTED-1.543
 certific-1.486
Token
Token ID198
Feature activation+7.21e-01

Pos logit contributions
ranch+0.656
 Property+0.524
 Scot+0.520
 Kut+0.515
Men+0.509
Neg logit contributions
hift-1.248
Reviewer-1.241
eeper-1.180
REDACTED-1.148
 certific-1.123
Token
Token ID198
Feature activation+6.10e-01

Pos logit contributions
hift+0.675
Reviewer+0.673
eeper+0.637
REDACTED+0.620
 certific+0.609
Neg logit contributions
ranch-0.376
 Property-0.298
 Kut-0.298
Men-0.296
 Scot-0.295
Token1995
Token ID21908
Feature activation-4.56e-01

Pos logit contributions
hift+0.526
Reviewer+0.497
eeper+0.488
 certific+0.462
 spiked+0.461
Neg logit contributions
ranch-0.376
Men-0.314
WC-0.306
 Property-0.303
 Scot-0.298
Token Jerry
Token ID13075
Feature activation-1.09e+00

Pos logit contributions
ranch+0.245
 Scot+0.206
 Kut+0.205
 Property+0.201
Men+0.196
Neg logit contributions
hift-0.415
Reviewer-0.413
eeper-0.390
REDACTED-0.383
 certific-0.368
Token Se
Token ID1001
Feature activation-2.22e+00

Pos logit contributions
ranch+0.398
 Scot+0.312
 Property+0.311
 Kut+0.309
 Wong+0.292
Neg logit contributions
Reviewer-1.180
hift-1.173
eeper-1.117
REDACTED-1.098
 certific-1.072
Tokeninfeld
Token ID47187
Feature activation-5.25e+00

Pos logit contributions
ranch+1.127
Men+0.875
 Property+0.860
WC+0.839
 Scot+0.830
Neg logit contributions
hift-2.113
Reviewer-2.068
eeper-1.969
REDACTED-1.958
 certific-1.922
Token
Token ID198
Feature activation+5.91e-01

Pos logit contributions
ranch+2.826
 Property+2.270
 Kut+2.254
 Scot+2.242
Men+2.227
Neg logit contributions
hift-4.781
Reviewer-4.731
eeper-4.473
REDACTED-4.383
 certific-4.210
Token
Token ID198
Feature activation+5.55e-01

Pos logit contributions
hift+0.556
Reviewer+0.553
eeper+0.525
REDACTED+0.510
 certific+0.502
Neg logit contributions
ranch-0.306
 Property-0.242
 Kut-0.242
Men-0.241
 Scot-0.240
Token1995
Token ID21908
Feature activation-4.56e-01

Pos logit contributions
Reviewer+0.421
hift+0.413
eeper+0.384
REDACTED+0.383
 Paste+0.361
Neg logit contributions
ranch-0.388
 Kut-0.330
 Property-0.327
 Scot-0.325
Men-0.320
Token Garry
Token ID45354
Feature activation-1.94e+00

Pos logit contributions
ranch+0.255
 Scot+0.217
 Kut+0.217
 Property+0.213
Men+0.208
Neg logit contributions
hift-0.404
Reviewer-0.402
eeper-0.379
REDACTED-0.371
 certific-0.357
Token Sh
Token ID911
Feature activation-8.57e-01

Pos logit contributions
ranch+0.848
 Scot+0.720
 Property+0.711
 Kut+0.702
Men+0.666
Neg logit contributions
Reviewer-1.965
hift-1.936
eeper-1.881
REDACTED-1.824
 certific-1.821
Token.
Token ID13
Feature activation-8.32e-02

Pos logit contributions
hift+1.692
Reviewer+1.683
eeper+1.559
REDACTED+1.537
 spiked+1.460
Neg logit contributions
ranch-1.649
 Kut-1.398
 Property-1.392
Men-1.383
 Scot-1.382
Token Rel
Token ID4718
Feature activation-1.86e+00

Pos logit contributions
ranch+0.047
 Property+0.038
 Kut+0.038
 Scot+0.038
Men+0.038
Neg logit contributions
hift-0.074
Reviewer-0.074
eeper-0.070
REDACTED-0.068
 certific-0.066
Tokenieved
Token ID39591
Feature activation-9.24e-01

Pos logit contributions
ranch+1.440
 Kut+1.235
 Property+1.229
Men+1.223
 Scot+1.217
Neg logit contributions
hift-1.278
Reviewer-1.268
eeper-1.153
REDACTED-1.140
 spiked-1.088
Token,
Token ID11
Feature activation+3.25e-01

Pos logit contributions
ranch+0.650
 Kut+0.553
 Property+0.551
 Scot+0.548
Men+0.546
Neg logit contributions
hift-0.692
Reviewer-0.687
eeper-0.637
REDACTED-0.624
 spiked-0.597
TokenPar
Token ID10044
Feature activation-3.24e-01

Pos logit contributions
hift+0.273
Reviewer+0.272
eeper+0.255
REDACTED+0.251
 spiked+0.240
Neg logit contributions
ranch-0.199
 Kut-0.163
 Property-0.163
 Scot-0.161
Men-0.161
Tokenamount
Token ID17287
Feature activation-5.24e+00

Pos logit contributions
ranch+0.109
 Property+0.073
Men+0.071
 Kut+0.071
 Scot+0.071
Neg logit contributions
Reviewer-0.365
hift-0.365
eeper-0.344
REDACTED-0.343
 Paste-0.332
Token jumped
Token ID11687
Feature activation-1.33e+00

Pos logit contributions
ranch+3.898
 Property+3.392
 Kut+3.351
 Scot+3.344
WC+3.338
Neg logit contributions
Reviewer-3.604
hift-3.578
eeper-3.279
REDACTED-3.222
anamo-3.063
Token at
Token ID379
Feature activation-6.36e-01

Pos logit contributions
ranch+0.893
 Kut+0.755
 Scot+0.752
 Property+0.750
Men+0.741
Neg logit contributions
Reviewer-1.035
hift-1.025
REDACTED-0.954
eeper-0.947
 Paste-0.899
Token the
Token ID262
Feature activation-2.05e-01

Pos logit contributions
ranch+0.397
 Kut+0.327
 Property+0.326
Men+0.324
 Scot+0.323
Neg logit contributions
hift-0.525
Reviewer-0.523
eeper-0.489
REDACTED-0.481
 spiked-0.462
Token chance
Token ID2863
Feature activation+5.05e-01

Pos logit contributions
ranch+0.148
 Property+0.125
 Kut+0.124
 Scot+0.124
Men+0.123
Neg logit contributions
hift-0.151
Reviewer-0.150
eeper-0.139
REDACTED-0.137
 spiked-0.129
Token to
Token ID284
Feature activation-1.56e-01

Pos logit contributions
hift+0.363
Reviewer+0.361
eeper+0.334
REDACTED+0.328
 spiked+0.313
Neg logit contributions
ranch-0.370
 Kut-0.315
 Property-0.314
Men-0.311
 Scot-0.311
Token to
Token ID284
Feature activation+5.50e-01

Pos logit contributions
hift+0.461
Reviewer+0.457
eeper+0.417
REDACTED+0.413
 spiked+0.387
Neg logit contributions
ranch-0.527
 Kut-0.459
 Property-0.456
 Scot-0.455
Men-0.453
Token ber
Token ID18157
Feature activation+1.47e-01

Pos logit contributions
Reviewer+0.398
hift+0.392
eeper+0.364
REDACTED+0.357
 Paste+0.335
Neg logit contributions
ranch-0.404
 Kut-0.349
 Scot-0.345
 Property-0.344
Men-0.341
Tokenating
Token ID803
Feature activation+7.86e-02

Pos logit contributions
hift+0.116
Reviewer+0.115
eeper+0.106
REDACTED+0.106
 spiked+0.101
Neg logit contributions
ranch-0.098
 Kut-0.082
 Property-0.081
Men-0.081
 Scot-0.080
Token herself
Token ID5223
Feature activation-1.38e+00

Pos logit contributions
Reviewer+0.060
hift+0.059
eeper+0.055
REDACTED+0.054
 Paste+0.051
Neg logit contributions
ranch-0.054
 Kut-0.047
 Scot-0.047
 Property-0.046
Men-0.046
Token for
Token ID329
Feature activation+8.20e-02

Pos logit contributions
ranch+0.843
 Kut+0.730
 Scot+0.725
Men+0.718
 Property+0.716
Neg logit contributions
Reviewer-1.128
hift-1.119
eeper-1.038
REDACTED-1.033
 Paste-0.981
Token letting
Token ID9616
Feature activation+4.27e+00

Pos logit contributions
Reviewer+0.065
hift+0.064
eeper+0.060
REDACTED+0.059
anamo+0.056
Neg logit contributions
ranch-0.054
 Kut-0.047
 Scot-0.047
 Property-0.046
Men-0.045
Token this
Token ID428
Feature activation+1.32e+00

Pos logit contributions
hift+3.308
Reviewer+3.273
eeper+3.057
REDACTED+3.010
 spiked+2.903
Neg logit contributions
ranch-2.896
 Property-2.386
 Kut-2.377
Men-2.377
 Scot-2.350
Token happen
Token ID1645
Feature activation+2.39e-01

Pos logit contributions
Reviewer+1.034
hift+1.033
eeper+0.959
REDACTED+0.939
 Paste+0.894
Neg logit contributions
ranch-0.879
 Kut-0.743
 Scot-0.739
 Property-0.738
Men-0.725
Token,
Token ID11
Feature activation+5.05e-01

Pos logit contributions
Reviewer+0.178
hift+0.176
eeper+0.162
REDACTED+0.161
 Paste+0.152
Neg logit contributions
ranch-0.170
 Kut-0.145
 Property-0.144
Men-0.144
 Scot-0.144
Token and
Token ID290
Feature activation+4.07e-01

Pos logit contributions
Reviewer+0.406
hift+0.405
eeper+0.374
REDACTED+0.370
 spiked+0.348
Neg logit contributions
ranch-0.324
 Kut-0.278
 Scot-0.277
 Property-0.275
Men-0.271
Token it
Token ID340
Feature activation+6.17e-01

Pos logit contributions
Reviewer+0.322
hift+0.321
eeper+0.297
REDACTED+0.294
anamo+0.276
Neg logit contributions
ranch-0.266
 Kut-0.228
 Scot-0.228
 Property-0.226
Men-0.223
Token club
Token ID3430
Feature activation+1.41e+00

Pos logit contributions
ranch+0.053
 Scot+0.046
 Kut+0.046
 Property+0.045
Men+0.044
Neg logit contributions
Reviewer-0.060
hift-0.059
eeper-0.055
REDACTED-0.054
 spiked-0.051
Token.
Token ID13
Feature activation-2.08e-01

Pos logit contributions
hift+1.030
Reviewer+1.029
eeper+0.948
REDACTED+0.935
 spiked+0.882
Neg logit contributions
ranch-1.026
 Kut-0.877
 Scot-0.872
 Property-0.871
Men-0.864
Token
Token ID198
Feature activation-1.12e+00

Pos logit contributions
ranch+0.113
 Kut+0.091
 Property+0.091
 Scot+0.090
Men+0.089
Neg logit contributions
hift-0.190
Reviewer-0.189
eeper-0.179
REDACTED-0.175
 spiked-0.169
Token
Token ID198
Feature activation-5.14e-01

Pos logit contributions
ranch+0.530
 Property+0.417
Men+0.416
 Scot+0.411
 Kut+0.409
Neg logit contributions
hift-1.101
Reviewer-1.100
eeper-1.047
 certific-1.022
REDACTED-1.007
TokenStory
Token ID11605
Feature activation+1.39e+00

Pos logit contributions
ranch+0.320
Men+0.270
 Kut+0.262
 Property+0.260
 Scot+0.260
Neg logit contributions
hift-0.430
Reviewer-0.420
eeper-0.403
REDACTED-0.387
 spiked-0.385
Token continues
Token ID4477
Feature activation+4.17e+00

Pos logit contributions
Reviewer+0.867
hift+0.857
eeper+0.771
REDACTED+0.744
 certific+0.713
Neg logit contributions
ranch-1.184
 Property-1.031
Men-1.028
 Kut-1.023
 Scot-1.022
Token below
Token ID2174
Feature activation-1.52e+00

Pos logit contributions
hift+2.074
Reviewer+2.062
eeper+1.833
REDACTED+1.781
 spiked+1.652
Neg logit contributions
ranch-3.983
 Kut-3.529
 Property-3.524
Men-3.506
 Scot-3.504
Token advertisement
Token ID15422
Feature activation+2.48e+00

Pos logit contributions
ranch+1.061
 Scot+0.922
 Property+0.914
 Kut+0.903
Men+0.894
Neg logit contributions
Reviewer-1.141
hift-1.123
eeper-1.052
REDACTED-1.025
anamo-0.993
Token
Token ID198
Feature activation+1.23e+00

Pos logit contributions
hift+2.239
Reviewer+2.229
eeper+2.097
REDACTED+2.066
 spiked+1.991
Neg logit contributions
ranch-1.368
 Kut-1.095
 Property-1.092
Men-1.081
 Scot-1.079
Token
Token ID198
Feature activation+1.32e+00

Pos logit contributions
hift+1.140
Reviewer+1.135
eeper+1.072
REDACTED+1.048
 certific+1.021
Neg logit contributions
ranch-0.649
 Property-0.516
 Kut-0.515
Men-0.513
 Scot-0.510
TokenHe
Token ID1544
Feature activation+7.42e-01

Pos logit contributions
hift+1.132
Reviewer+1.121
eeper+1.059
REDACTED+1.035
 spiked+1.005
Neg logit contributions
ranch-0.787
Men-0.642
 Kut-0.642
 Property-0.638
 Scot-0.634
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
hift+1.050
Reviewer+1.044
eeper+0.969
REDACTED+0.954
 spiked+0.905
Neg logit contributions
ranch-1.006
 Kut-0.853
 Property-0.849
Men-0.844
 Scot-0.844
Token Not
Token ID1892
Feature activation-1.76e-01

Pos logit contributions
hift+0.012
Reviewer+0.012
eeper+0.012
REDACTED+0.011
 certific+0.011
Neg logit contributions
ranch-0.007
 Property-0.006
 Kut-0.006
 Scot-0.006
Men-0.006
Token everyone
Token ID2506
Feature activation+1.32e+00

Pos logit contributions
ranch+0.113
 Kut+0.094
 Property+0.094
 Scot+0.094
Men+0.093
Neg logit contributions
hift-0.141
Reviewer-0.141
eeper-0.132
REDACTED-0.130
 spiked-0.124
Token has
Token ID468
Feature activation+6.40e-01

Pos logit contributions
Reviewer+1.012
hift+1.008
eeper+0.934
REDACTED+0.925
anamo+0.860
Neg logit contributions
ranch-0.908
 Scot-0.785
 Kut-0.778
 Property-0.770
Men-0.764
Token the
Token ID262
Feature activation+1.41e+00

Pos logit contributions
Reviewer+0.519
hift+0.518
eeper+0.478
REDACTED+0.473
anamo+0.445
Neg logit contributions
ranch-0.413
 Kut-0.349
 Scot-0.349
 Property-0.346
Men-0.339
Token opportunity
Token ID3663
Feature activation+3.79e+00

Pos logit contributions
hift+1.181
Reviewer+1.179
eeper+1.103
REDACTED+1.082
 spiked+1.024
Neg logit contributions
ranch-0.872
 Kut-0.723
 Scot-0.722
 Property-0.721
Men-0.706
Token to
Token ID284
Feature activation+8.73e-01

Pos logit contributions
hift+2.543
Reviewer+2.535
eeper+2.313
REDACTED+2.278
 spiked+2.171
Neg logit contributions
ranch-2.938
 Property-2.499
 Kut-2.497
Men-2.481
 Scot-2.459
Token spend
Token ID4341
Feature activation-9.00e-01

Pos logit contributions
Reviewer+0.670
hift+0.663
eeper+0.611
REDACTED+0.611
anamo+0.576
Neg logit contributions
ranch-0.609
 Kut-0.512
 Scot-0.511
 Property-0.508
Men-0.499
Token time
Token ID640
Feature activation+8.24e-01

Pos logit contributions
ranch+0.557
 Scot+0.479
 Kut+0.475
 Property+0.474
Men+0.461
Neg logit contributions
Reviewer-0.747
hift-0.731
eeper-0.683
REDACTED-0.677
anamo-0.640
Token by
Token ID416
Feature activation+2.05e+00

Pos logit contributions
hift+0.609
Reviewer+0.607
eeper+0.562
REDACTED+0.550
 Paste+0.520
Neg logit contributions
ranch-0.588
 Kut-0.503
 Scot-0.502
 Property-0.496
Men-0.494
Token or
Token ID393
Feature activation+3.49e-01

Pos logit contributions
hift+1.617
Reviewer+1.609
eeper+1.499
REDACTED+1.475
 spiked+1.411
Neg logit contributions
ranch-1.365
 Kut-1.139
 Property-1.136
Men-1.127
 Scot-1.126
Token
Token ID198
Feature activation+1.02e+00

Pos logit contributions
hift+0.454
Reviewer+0.452
eeper+0.429
REDACTED+0.417
 certific+0.412
Neg logit contributions
ranch-0.248
 Property-0.196
 Kut-0.196
Men-0.195
 Scot-0.194
Token"
Token ID1
Feature activation+1.42e+00

Pos logit contributions
hift+0.840
Reviewer+0.823
eeper+0.784
REDACTED+0.761
 spiked+0.747
Neg logit contributions
ranch-0.644
Men-0.542
 Kut-0.533
 Property-0.529
 Scot-0.525
TokenSome
Token ID4366
Feature activation+6.15e-01

Pos logit contributions
hift+1.141
Reviewer+1.127
eeper+1.065
REDACTED+1.034
 spiked+1.010
Neg logit contributions
ranch-0.920
Men-0.777
 Kut-0.769
 Property-0.766
 Scot-0.759
Token satellites
Token ID20372
Feature activation+4.07e-01

Pos logit contributions
Reviewer+0.484
hift+0.478
eeper+0.447
REDACTED+0.446
 spiked+0.419
Neg logit contributions
ranch-0.406
 Kut-0.346
 Scot-0.344
 Property-0.343
Men-0.339
Token take
Token ID1011
Feature activation+4.87e-02

Pos logit contributions
Reviewer+0.330
hift+0.318
REDACTED+0.302
eeper+0.299
anamo+0.281
Neg logit contributions
ranch-0.265
 Scot-0.224
 Kut-0.224
 Property-0.222
Men-0.218
Token decades
Token ID4647
Feature activation+3.77e+00

Pos logit contributions
Reviewer+0.039
hift+0.038
REDACTED+0.036
eeper+0.036
anamo+0.033
Neg logit contributions
ranch-0.031
 Scot-0.027
 Kut-0.027
 Property-0.027
Men-0.026
Token to
Token ID284
Feature activation+1.10e+00

Pos logit contributions
hift+2.677
Reviewer+2.665
eeper+2.461
REDACTED+2.423
 spiked+2.302
Neg logit contributions
ranch-2.788
 Kut-2.377
 Property-2.370
Men-2.355
 Scot-2.354
Token come
Token ID1282
Feature activation+8.14e-01

Pos logit contributions
Reviewer+0.818
hift+0.799
REDACTED+0.747
eeper+0.742
anamo+0.701
Neg logit contributions
ranch-0.788
 Scot-0.675
 Kut-0.670
 Property-0.670
Men-0.651
Token back
Token ID736
Feature activation+2.82e+00

Pos logit contributions
Reviewer+0.657
hift+0.650
eeper+0.618
REDACTED+0.609
anamo+0.573
Neg logit contributions
ranch-0.516
 Scot-0.443
 Kut-0.439
Men-0.432
 Property-0.431
Token after
Token ID706
Feature activation-3.67e-01

Pos logit contributions
hift+2.083
Reviewer+2.074
eeper+1.926
REDACTED+1.903
 spiked+1.788
Neg logit contributions
ranch-2.001
 Kut-1.709
 Scot-1.703
 Property-1.701
Men-1.686
Token finishing
Token ID12848
Feature activation+1.86e+00

Pos logit contributions
ranch+0.242
 Kut+0.210
 Scot+0.210
 Property+0.206
Men+0.203
Neg logit contributions
Reviewer-0.282
hift-0.282
eeper-0.264
REDACTED-0.263
anamo-0.247
Token 234
Token ID34323
Feature activation+3.58e-02

Pos logit contributions
ranch+0.225
 Property+0.188
 Kut+0.188
 Scot+0.185
Men+0.183
Neg logit contributions
hift-0.343
Reviewer-0.343
eeper-0.324
REDACTED-0.314
 certific-0.305
Token.
Token ID764
Feature activation+2.26e+00

Pos logit contributions
Reviewer+0.029
hift+0.029
eeper+0.027
 certific+0.026
REDACTED+0.026
Neg logit contributions
ranch-0.023
 Property-0.020
Men-0.020
 Scot-0.020
 Kut-0.020
Token
Token ID198
Feature activation+5.58e-01

Pos logit contributions
hift+2.071
Reviewer+2.064
eeper+1.944
REDACTED+1.911
 spiked+1.844
Neg logit contributions
ranch-1.216
 Kut-0.974
 Property-0.971
 Scot-0.963
Men-0.962
Token
Token ID198
Feature activation+1.16e+00

Pos logit contributions
hift+0.524
Reviewer+0.522
eeper+0.494
REDACTED+0.482
 certific+0.473
Neg logit contributions
ranch-0.289
 Kut-0.228
 Property-0.228
Men-0.227
 Scot-0.226
TokenGoogle
Token ID11708
Feature activation+1.84e+00

Pos logit contributions
hift+1.009
Reviewer+0.988
eeper+0.950
 spiked+0.912
 certific+0.911
Neg logit contributions
ranch-0.681
Men-0.574
 Kut-0.554
WC-0.550
 Scot-0.547
Token Scholar
Token ID11713
Feature activation+3.76e+00

Pos logit contributions
hift+0.947
Reviewer+0.943
eeper+0.842
REDACTED+0.811
 certific+0.761
Neg logit contributions
ranch-1.733
 Property-1.550
 Kut-1.543
 Scot-1.540
Men-1.530
Token Crossref
Token ID23613
Feature activation+2.73e+00

Pos logit contributions
hift+2.363
Reviewer+2.315
eeper+2.145
REDACTED+2.106
 spiked+2.029
Neg logit contributions
ranch-3.079
 Kut-2.630
 Property-2.621
Men-2.607
 Scot-2.587
Token |
Token ID930
Feature activation-9.15e-01

Pos logit contributions
hift+2.021
Reviewer+1.983
eeper+1.864
REDACTED+1.835
 DOI+1.773
Neg logit contributions
ranch-1.904
 Property-1.600
 Kut-1.599
Men-1.586
 Scot-1.568
Token Medline
Token ID34582
Feature activation+3.89e+00

Pos logit contributions
ranch+0.782
 Property+0.724
 Kut+0.720
 Scot+0.714
Men+0.708
Neg logit contributions
hift-0.535
Reviewer-0.522
eeper-0.493
 certific-0.454
REDACTED-0.452
Token |
Token ID930
Feature activation-4.48e-01

Pos logit contributions
hift+2.707
Reviewer+2.691
eeper+2.519
REDACTED+2.484
 DOI+2.389
Neg logit contributions
ranch-2.832
 Property-2.385
 Kut-2.384
Men-2.346
 Scot-2.345
Token ISI
Token ID22048
Feature activation+4.51e+00

Pos logit contributions
ranch+0.386
 Property+0.349
 Kut+0.349
 Scot+0.348
Men+0.346
Neg logit contributions
hift-0.262
Reviewer-0.256
eeper-0.244
REDACTED-0.223
 certific-0.221
Token into
Token ID656
Feature activation+2.14e+00

Pos logit contributions
hift+0.939
Reviewer+0.933
eeper+0.868
REDACTED+0.854
 spiked+0.810
Neg logit contributions
ranch-0.845
 Kut-0.710
 Property-0.708
 Scot-0.704
Men-0.704
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
hift+1.709
Reviewer+1.700
eeper+1.587
REDACTED+1.561
 spiked+1.497
Neg logit contributions
ranch-1.391
 Kut-1.154
 Property-1.152
Men-1.142
 Scot-1.140
Token #
Token ID1303
Feature activation-1.02e+00

Pos logit contributions
hift+0.091
Reviewer+0.091
eeper+0.084
REDACTED+0.083
 certific+0.078
Neg logit contributions
ranch-0.077
 Kut-0.065
 Scot-0.065
 Property-0.065
Men-0.065
Tokenmet
Token ID4164
Feature activation+6.32e-01

Pos logit contributions
ranch+0.679
Men+0.585
 Kut+0.562
 Property+0.559
WC+0.555
Neg logit contributions
hift-0.802
Reviewer-0.798
eeper-0.752
REDACTED-0.726
 spiked-0.710
Tokenoo
Token ID2238
Feature activation+2.38e-01

Pos logit contributions
hift+0.494
Reviewer+0.494
eeper+0.456
REDACTED+0.453
 Paste+0.430
Neg logit contributions
ranch-0.426
Men-0.355
 Property-0.354
 Kut-0.353
 Scot-0.353
Token thing
Token ID1517
Feature activation+1.93e+00

Pos logit contributions
Reviewer+0.192
hift+0.191
eeper+0.177
REDACTED+0.174
 Paste+0.168
Neg logit contributions
ranch-0.157
Men-0.129
 Kut-0.128
 Property-0.128
 Scot-0.128
Token
Token ID784
Feature activation-2.89e-01

Pos logit contributions
hift+1.402
Reviewer+1.396
eeper+1.291
REDACTED+1.269
 spiked+1.209
Neg logit contributions
ranch-1.394
 Kut-1.184
 Property-1.180
Men-1.173
 Scot-1.173
Token she
Token ID673
Feature activation+2.54e-01

Pos logit contributions
ranch+0.189
 Kut+0.160
 Property+0.160
Men+0.159
 Scot+0.159
Neg logit contributions
hift-0.230
Reviewer-0.228
eeper-0.211
REDACTED-0.210
 certific-0.199
Token just
Token ID655
Feature activation-4.14e-01

Pos logit contributions
Reviewer+0.179
hift+0.179
eeper+0.162
REDACTED+0.161
 Paste+0.150
Neg logit contributions
ranch-0.191
 Kut-0.165
 Scot-0.163
 Property-0.163
Men-0.162
Token writes
Token ID6797
Feature activation-7.26e-01

Pos logit contributions
ranch+0.311
 Kut+0.272
 Scot+0.268
 Property+0.266
Men+0.264
Neg logit contributions
Reviewer-0.296
hift-0.290
eeper-0.265
REDACTED-0.263
 Insight-0.243
Token very
Token ID845
Feature activation+7.49e-01

Pos logit contributions
ranch+0.484
 Kut+0.413
 Property+0.411
 Scot+0.411
Men+0.407
Neg logit contributions
Reviewer-0.569
hift-0.567
eeper-0.523
REDACTED-0.518
anamo-0.492
Token back
Token ID736
Feature activation+2.78e+00

Pos logit contributions
hift+0.732
Reviewer+0.731
eeper+0.679
REDACTED+0.666
 Paste+0.630
Neg logit contributions
ranch-0.654
 Kut-0.560
 Property-0.559
 Scot-0.557
Men-0.550
Token to
Token ID284
Feature activation+4.11e-02

Pos logit contributions
hift+2.020
Reviewer+2.009
eeper+1.859
REDACTED+1.827
 spiked+1.740
Neg logit contributions
ranch-2.019
 Kut-1.714
 Property-1.711
Men-1.698
 Scot-1.697
Token haunt
Token ID36334
Feature activation+1.64e+00

Pos logit contributions
Reviewer+0.026
hift+0.025
eeper+0.023
REDACTED+0.023
 certific+0.021
Neg logit contributions
ranch-0.034
 Property-0.030
 Scot-0.030
 Kut-0.030
Men-0.029
Token you
Token ID345
Feature activation-9.37e-02

Pos logit contributions
hift+0.986
Reviewer+0.981
eeper+0.892
REDACTED+0.874
 spiked+0.823
Neg logit contributions
ranch-1.388
 Kut-1.209
 Property-1.207
Men-1.200
 Scot-1.199
Token if
Token ID611
Feature activation+5.54e-01

Pos logit contributions
ranch+0.064
 Kut+0.055
 Property+0.055
 Scot+0.054
Men+0.054
Neg logit contributions
Reviewer-0.072
hift-0.071
eeper-0.066
REDACTED-0.065
 spiked-0.062
Token you
Token ID345
Feature activation+1.16e+00

Pos logit contributions
hift+0.366
Reviewer+0.365
eeper+0.334
REDACTED+0.327
 spiked+0.308
Neg logit contributions
ranch-0.437
 Kut-0.380
 Property-0.379
 Scot-0.377
Men-0.374
Token don
Token ID836
Feature activation+6.81e-01

Pos logit contributions
Reviewer+0.860
hift+0.858
eeper+0.788
REDACTED+0.784
 spiked+0.735
Neg logit contributions
ranch-0.838
 Property-0.710
 Kut-0.708
 Scot-0.704
Men-0.698
Token
Token ID447
Feature activation+9.86e-01

Pos logit contributions
hift+0.429
Reviewer+0.425
eeper+0.390
REDACTED+0.373
 certific+0.369
Neg logit contributions
ranch-0.566
 Kut-0.492
Men-0.491
 Property-0.489
 Scot-0.487
Token
Token ID247
Feature activation+1.38e-01

Pos logit contributions
Reviewer+0.642
hift+0.629
eeper+0.566
REDACTED+0.562
 spiked+0.543
Neg logit contributions
ranch-0.811
Men-0.692
 Property-0.688
 Kut-0.688
 Scot-0.679
Tokent
Token ID83
Feature activation+1.07e+00

Pos logit contributions
hift+0.122
Reviewer+0.122
eeper+0.112
REDACTED+0.112
 certific+0.108
Neg logit contributions
ranch-0.080
 Kut-0.065
 Property-0.064
Men-0.064
 Scot-0.063
Token protect
Token ID1805
Feature activation+2.71e-01

Pos logit contributions
Reviewer+0.800
hift+0.796
eeper+0.739
REDACTED+0.728
anamo+0.691
Neg logit contributions
ranch-0.753
 Property-0.638
 Kut-0.636
 Scot-0.632
Men-0.624
Token future
Token ID2003
Feature activation-1.04e+00

Pos logit contributions
ranch+0.385
 Scot+0.322
 Property+0.322
 Kut+0.321
Men+0.316
Neg logit contributions
hift-0.444
Reviewer-0.442
eeper-0.414
REDACTED-0.408
 Paste-0.384
Token economic
Token ID3034
Feature activation-1.01e+00

Pos logit contributions
ranch+0.739
 Property+0.636
 Scot+0.632
 Kut+0.631
Men+0.610
Neg logit contributions
Reviewer-0.768
hift-0.764
eeper-0.723
REDACTED-0.722
 spiked-0.661
Token outlook
Token ID19360
Feature activation-3.36e+00

Pos logit contributions
ranch+0.689
 Scot+0.605
 Property+0.597
 Kut+0.596
 Rivals+0.575
Neg logit contributions
Reviewer-0.761
hift-0.760
eeper-0.718
REDACTED-0.713
anamo-0.662
Token,
Token ID11
Feature activation-1.32e-01

Pos logit contributions
ranch+2.258
 Kut+1.915
 Property+1.914
 Scot+1.907
Men+1.881
Neg logit contributions
Reviewer-2.540
hift-2.535
eeper-2.364
REDACTED-2.353
 certific-2.223
Token
Token ID447
Feature activation-9.47e-02

Pos logit contributions
ranch+0.087
 Property+0.073
 Kut+0.073
 Scot+0.073
Men+0.071
Neg logit contributions
hift-0.105
Reviewer-0.105
eeper-0.097
REDACTED-0.096
 spiked-0.091
Token
Token ID251
Feature activation+6.08e-01

Pos logit contributions
ranch+0.055
 Property+0.045
 Kut+0.044
 Scot+0.044
Men+0.043
Neg logit contributions
hift-0.083
Reviewer-0.082
eeper-0.077
REDACTED-0.076
 Paste-0.074
Token according
Token ID1864
Feature activation+2.14e-03

Pos logit contributions
hift+0.526
Reviewer+0.524
eeper+0.490
REDACTED+0.485
 spiked+0.462
Neg logit contributions
ranch-0.359
 Kut-0.294
 Property-0.294
 Scot-0.290
Men-0.287
Token to
Token ID284
Feature activation-6.85e-01

Pos logit contributions
hift+0.002
Reviewer+0.002
eeper+0.002
 certific+0.001
REDACTED+0.001
Neg logit contributions
ranch-0.001
 Property-0.001
 Scot-0.001
 Kut-0.001
Men-0.001
Token the
Token ID262
Feature activation+5.66e-02

Pos logit contributions
ranch+0.378
 Property+0.315
 Scot+0.311
 Kut+0.310
Men+0.297
Neg logit contributions
hift-0.616
Reviewer-0.613
eeper-0.577
REDACTED-0.571
 spiked-0.543
Token venue
Token ID14359
Feature activation+9.78e-01

Pos logit contributions
hift+0.044
Reviewer+0.043
eeper+0.040
REDACTED+0.040
 spiked+0.037
Neg logit contributions
ranch-0.039
 Property-0.034
 Scot-0.034
 Kut-0.033
Men-0.032
Token
Token ID447
Feature activation+3.60e-01

Pos logit contributions
hift+0.661
Reviewer+0.653
eeper+0.602
REDACTED+0.595
 spiked+0.559
Neg logit contributions
ranch-0.761
 Property-0.655
 Kut-0.655
 Scot-0.650
Men-0.648
Token case
Token ID1339
Feature activation-1.36e+00

Pos logit contributions
ranch+0.318
 Property+0.270
 Kut+0.270
 Scot+0.267
Men+0.265
Neg logit contributions
hift-0.335
Reviewer-0.332
eeper-0.308
REDACTED-0.303
 spiked-0.289
Token.
Token ID13
Feature activation-6.13e-01

Pos logit contributions
ranch+0.950
 Kut+0.810
 Property+0.798
 Scot+0.798
Men+0.797
Neg logit contributions
hift-1.017
Reviewer-1.009
eeper-0.934
REDACTED-0.923
 Paste-0.872
Token The
Token ID383
Feature activation-2.84e-01

Pos logit contributions
ranch+0.338
 Property+0.281
 Kut+0.279
 Scot+0.276
Men+0.273
Neg logit contributions
hift-0.553
Reviewer-0.547
eeper-0.519
REDACTED-0.507
 certific-0.493
Token smart
Token ID4451
Feature activation-1.67e+00

Pos logit contributions
ranch+0.188
 Property+0.162
 Kut+0.160
 Scot+0.159
Men+0.156
Neg logit contributions
hift-0.224
Reviewer-0.222
eeper-0.207
REDACTED-0.205
 spiked-0.194
Token money
Token ID1637
Feature activation-2.84e+00

Pos logit contributions
ranch+1.063
 Kut+0.879
 Property+0.877
 Scot+0.876
Men+0.876
Neg logit contributions
hift-1.355
Reviewer-1.343
eeper-1.269
REDACTED-1.248
 Paste-1.198
Token would
Token ID561
Feature activation+6.46e-01

Pos logit contributions
ranch+1.936
 Kut+1.687
Men+1.647
 Scot+1.640
 Property+1.621
Neg logit contributions
hift-2.156
Reviewer-2.137
eeper-1.991
REDACTED-1.963
 Paste-1.855
Token still
Token ID991
Feature activation+1.03e+00

Pos logit contributions
hift+0.495
Reviewer+0.492
eeper+0.457
REDACTED+0.451
 spiked+0.429
Neg logit contributions
ranch-0.444
 Kut-0.374
 Property-0.372
Men-0.369
 Scot-0.369
Token say
Token ID910
Feature activation+1.55e+00

Pos logit contributions
hift+0.786
Reviewer+0.785
eeper+0.727
REDACTED+0.718
 spiked+0.681
Neg logit contributions
ranch-0.714
 Kut-0.602
 Property-0.599
Men-0.594
 Scot-0.594
Token that
Token ID326
Feature activation+3.65e-01

Pos logit contributions
hift+1.079
Reviewer+1.073
eeper+0.991
REDACTED+0.973
 spiked+0.925
Neg logit contributions
ranch-1.169
 Kut-0.998
 Property-0.996
Men-0.990
 Scot-0.987
Token the
Token ID262
Feature activation-1.96e-01

Pos logit contributions
hift+0.307
Reviewer+0.305
eeper+0.286
REDACTED+0.282
 spiked+0.270
Neg logit contributions
ranch-0.223
 Kut-0.184
 Property-0.184
 Scot-0.182
Men-0.181
Token most
Token ID749
Feature activation-8.94e-01

Pos logit contributions
ranch+0.126
 Property+0.107
 Kut+0.106
 Scot+0.105
Men+0.104
Neg logit contributions
hift-0.159
Reviewer-0.158
eeper-0.147
REDACTED-0.145
 spiked-0.138
Token had
Token ID550
Feature activation+1.25e+00

Pos logit contributions
Reviewer+1.743
hift+1.718
eeper+1.588
REDACTED+1.579
 Paste+1.479
Neg logit contributions
ranch-1.662
 Kut-1.408
 Property-1.401
 Scot-1.395
Men-1.386
Token to
Token ID284
Feature activation+2.03e+00

Pos logit contributions
hift+0.927
Reviewer+0.924
eeper+0.851
REDACTED+0.843
 Paste+0.790
Neg logit contributions
ranch-0.892
 Kut-0.765
 Property-0.761
 Scot-0.759
Men-0.752
Token take
Token ID1011
Feature activation+2.89e-01

Pos logit contributions
Reviewer+1.432
hift+1.427
eeper+1.318
REDACTED+1.293
 spiked+1.221
Neg logit contributions
ranch-1.524
 Kut-1.299
 Property-1.298
 Scot-1.290
Men-1.284
Token a
Token ID257
Feature activation+1.48e+00

Pos logit contributions
Reviewer+0.229
hift+0.229
eeper+0.212
REDACTED+0.210
 spiked+0.199
Neg logit contributions
ranch-0.190
 Kut-0.161
 Property-0.160
 Scot-0.160
Men-0.157
Token guess
Token ID4724
Feature activation+8.74e-01

Pos logit contributions
hift+1.122
Reviewer+1.117
eeper+1.031
REDACTED+1.021
 spiked+0.963
Neg logit contributions
ranch-1.037
 Kut-0.877
 Property-0.875
 Scot-0.870
Men-0.863
Token as
Token ID355
Feature activation+2.07e+00

Pos logit contributions
Reviewer+0.685
hift+0.683
eeper+0.633
REDACTED+0.625
 spiked+0.594
Neg logit contributions
ranch-0.584
 Kut-0.491
 Property-0.488
 Scot-0.487
Men-0.485
Token to
Token ID284
Feature activation+1.81e-01

Pos logit contributions
Reviewer+1.319
hift+1.299
eeper+1.199
REDACTED+1.175
 Paste+1.123
Neg logit contributions
ranch-1.690
 Property-1.440
 Kut-1.439
Men-1.436
 Scot-1.408
Token what
Token ID644
Feature activation-1.98e-01

Pos logit contributions
hift+0.152
Reviewer+0.152
eeper+0.141
REDACTED+0.139
 spiked+0.133
Neg logit contributions
ranch-0.112
 Property-0.093
 Kut-0.093
 Scot-0.092
Men-0.091
Token the
Token ID262
Feature activation-1.68e-01

Pos logit contributions
ranch+0.131
 Property+0.111
 Kut+0.110
 Scot+0.110
Men+0.108
Neg logit contributions
Reviewer-0.157
hift-0.157
eeper-0.145
REDACTED-0.143
 certific-0.136
Token lasting
Token ID15727
Feature activation+1.34e+00

Pos logit contributions
ranch+0.108
 Property+0.092
 Scot+0.091
 Kut+0.090
Men+0.088
Neg logit contributions
Reviewer-0.136
hift-0.135
eeper-0.125
REDACTED-0.124
anamo-0.118
Token appeal
Token ID5198
Feature activation-1.83e-01

Pos logit contributions
Reviewer+0.991
hift+0.987
eeper+0.913
REDACTED+0.903
 spiked+0.847
Neg logit contributions
ranch-0.958
 Property-0.820
 Kut-0.811
 Scot-0.811
Men-0.805
Token type
Token ID2099
Feature activation+7.54e-01

Pos logit contributions
Reviewer+0.110
hift+0.109
eeper+0.101
REDACTED+0.100
anamo+0.094
Neg logit contributions
ranch-0.099
 Kut-0.085
 Scot-0.085
 Property-0.084
Men-0.082
Token of
Token ID286
Feature activation-8.03e-01

Pos logit contributions
Reviewer+0.601
hift+0.600
eeper+0.560
REDACTED+0.549
 spiked+0.526
Neg logit contributions
ranch-0.495
 Kut-0.415
 Property-0.413
 Scot-0.413
Men-0.409
Token practical
Token ID8472
Feature activation-6.99e-01

Pos logit contributions
ranch+0.534
 Scot+0.458
 Kut+0.456
 Property+0.450
Men+0.443
Neg logit contributions
Reviewer-0.637
hift-0.623
eeper-0.590
REDACTED-0.575
anamo-0.548
Token effects
Token ID3048
Feature activation-2.32e+00

Pos logit contributions
ranch+0.503
 Scot+0.437
 Kut+0.434
 Property+0.430
Men+0.426
Neg logit contributions
Reviewer-0.511
hift-0.505
eeper-0.479
REDACTED-0.463
anamo-0.438
Token we
Token ID356
Feature activation-1.13e+00

Pos logit contributions
ranch+1.617
 Kut+1.421
 Scot+1.419
 Property+1.393
Men+1.369
Neg logit contributions
Reviewer-1.731
hift-1.680
eeper-1.601
REDACTED-1.595
 certific-1.485
Token do
Token ID466
Feature activation-1.71e+00

Pos logit contributions
ranch+0.796
 Scot+0.674
 Kut+0.673
 Property+0.661
Men+0.649
Neg logit contributions
Reviewer-0.894
hift-0.851
REDACTED-0.814
eeper-0.792
anamo-0.746
Token in
Token ID287
Feature activation-1.73e-01

Pos logit contributions
ranch+1.127
 Scot+0.964
 Kut+0.956
Men+0.937
 Property+0.935
Neg logit contributions
Reviewer-1.376
hift-1.326
REDACTED-1.250
eeper-1.247
anamo-1.175
Token Horror
Token ID22732
Feature activation+2.71e-01

Pos logit contributions
ranch+0.120
 Property+0.102
 Kut+0.102
 Scot+0.101
Men+0.099
Neg logit contributions
Reviewer-0.134
hift-0.131
REDACTED-0.122
eeper-0.122
 certific-0.113
Token Nights
Token ID27760
Feature activation+3.08e+00

Pos logit contributions
hift+0.121
Reviewer+0.120
eeper+0.106
REDACTED+0.103
 spiked+0.094
Neg logit contributions
ranch-0.272
 Kut-0.243
 Property-0.242
Men-0.241
 Scot-0.241
Token all
Token ID477
Feature activation+1.01e+00

Pos logit contributions
hift+2.223
Reviewer+2.211
eeper+2.046
REDACTED+2.012
 spiked+1.915
Neg logit contributions
ranch-2.250
 Kut-1.912
 Property-1.907
Men-1.894
 Scot-1.893
Token the
Token ID262
Feature activation-4.93e-01

Pos logit contributions
hift+0.847
Reviewer+0.844
eeper+0.789
REDACTED+0.778
 spiked+0.745
Neg logit contributions
ranch-0.618
 Kut-0.508
 Property-0.506
 Scot-0.502
Men-0.501
Token during
Token ID1141
Feature activation-1.07e+00

Pos logit contributions
ranch+1.430
 Scot+1.237
 Property+1.209
 Kut+1.204
Men+1.200
Neg logit contributions
Reviewer-1.889
hift-1.822
eeper-1.670
REDACTED-1.669
anamo-1.597
Token Washington
Token ID2669
Feature activation-1.18e+00

Pos logit contributions
ranch+0.660
 Scot+0.557
 Kut+0.549
 Property+0.548
Men+0.542
Neg logit contributions
Reviewer-0.913
hift-0.900
eeper-0.841
REDACTED-0.831
anamo-0.785
Token
Token ID447
Feature activation+1.92e-02

Pos logit contributions
ranch+0.932
 Property+0.809
 Scot+0.807
Men+0.805
 Kut+0.804
Neg logit contributions
hift-0.789
Reviewer-0.784
eeper-0.717
REDACTED-0.707
 spiked-0.659
Token
Token ID247
Feature activation-3.34e-02

Pos logit contributions
hift+0.014
Reviewer+0.014
eeper+0.013
REDACTED+0.013
 spiked+0.012
Neg logit contributions
ranch-0.014
 Property-0.012
 Kut-0.012
 Scot-0.012
Men-0.012
Tokens
Token ID82
Feature activation+7.44e-01

Pos logit contributions
ranch+0.023
 Scot+0.020
 Property+0.020
Men+0.020
 Kut+0.020
Neg logit contributions
hift-0.026
Reviewer-0.026
eeper-0.023
REDACTED-0.023
 spiked-0.022
Token practice
Token ID3357
Feature activation-3.36e-01

Pos logit contributions
Reviewer+0.633
hift+0.622
eeper+0.578
REDACTED+0.574
anamo+0.544
Neg logit contributions
ranch-0.465
 Scot-0.391
 Property-0.387
 Kut-0.385
Men-0.385
Token May
Token ID1737
Feature activation+2.26e-01

Pos logit contributions
ranch+0.204
 Scot+0.169
 Property+0.168
 Kut+0.168
Men+0.167
Neg logit contributions
Reviewer-0.290
hift-0.285
eeper-0.266
REDACTED-0.261
 Paste-0.250
Token 25
Token ID1679
Feature activation-1.33e-01

Pos logit contributions
Reviewer+0.197
hift+0.192
eeper+0.182
REDACTED+0.178
 certific+0.174
Neg logit contributions
ranch-0.134
 Scot-0.112
 Property-0.112
 Kut-0.112
Men-0.111
Token.
Token ID13
Feature activation+3.68e-01

Pos logit contributions
ranch+0.091
 Kut+0.077
 Property+0.077
Men+0.076
 Scot+0.076
Neg logit contributions
hift-0.102
Reviewer-0.101
eeper-0.094
REDACTED-0.092
 spiked-0.088
Token (
Token ID357
Feature activation-2.24e+00

Pos logit contributions
hift+0.328
Reviewer+0.326
eeper+0.307
REDACTED+0.301
 spiked+0.291
Neg logit contributions
ranch-0.208
 Kut-0.170
 Property-0.169
 Scot-0.168
Men-0.167
TokenJohn
Token ID7554
Feature activation+5.59e-01

Pos logit contributions
ranch+1.118
Men+0.916
 Scot+0.898
 Kut+0.878
WC+0.871
Neg logit contributions
hift-2.137
Reviewer-2.080
eeper-2.016
 spiked-1.932
REDACTED-1.921
Token
Token ID447
Feature activation-3.69e-02

Pos logit contributions
Reviewer+0.349
hift+0.346
eeper+0.317
REDACTED+0.312
 certific+0.295
Neg logit contributions
ranch-0.381
 Kut-0.328
 Property-0.328
 Scot-0.325
Men-0.323
Token
Token ID247
Feature activation-4.46e-01

Pos logit contributions
ranch+0.025
 Kut+0.021
 Property+0.021
 Scot+0.020
Men+0.020
Neg logit contributions
Reviewer-0.029
hift-0.029
eeper-0.027
REDACTED-0.027
 spiked-0.025
Tokens
Token ID82
Feature activation+6.86e-01

Pos logit contributions
ranch+0.310
 Kut+0.260
 Property+0.258
 Scot+0.257
Men+0.256
Neg logit contributions
Reviewer-0.346
hift-0.346
eeper-0.317
REDACTED-0.312
 certific-0.302
Token win
Token ID1592
Feature activation+2.02e+00

Pos logit contributions
Reviewer+0.537
hift+0.528
eeper+0.489
REDACTED+0.480
 certific+0.455
Neg logit contributions
ranch-0.473
 Property-0.401
 Kut-0.399
 Scot-0.398
Men-0.395
Token over
Token ID625
Feature activation+1.67e+00

Pos logit contributions
hift+1.555
Reviewer+1.550
eeper+1.437
REDACTED+1.415
 spiked+1.351
Neg logit contributions
ranch-1.387
 Kut-1.164
 Property-1.162
 Scot-1.154
Men-1.153
Token Wer
Token ID27648
Feature activation+4.21e-01

Pos logit contributions
hift+1.459
Reviewer+1.456
eeper+1.365
REDACTED+1.344
 spiked+1.292
Neg logit contributions
ranch-0.968
 Kut-0.785
 Property-0.785
 Scot-0.776
Men-0.776
Tokenrib
Token ID822
Feature activation+5.34e-01

Pos logit contributions
Reviewer+0.281
hift+0.279
eeper+0.254
REDACTED+0.251
 certific+0.237
Neg logit contributions
ranch-0.333
 Property-0.289
 Kut-0.286
Men-0.286
 Scot-0.284
Tokenee
Token ID1453
Feature activation-6.39e-01

Pos logit contributions
Reviewer+0.488
hift+0.486
eeper+0.454
REDACTED+0.450
 spiked+0.434
Neg logit contributions
ranch-0.291
 Kut-0.233
 Property-0.231
 Scot-0.230
Men-0.228
Token on
Token ID319
Feature activation+6.22e-01

Pos logit contributions
ranch+0.420
 Property+0.358
 Scot+0.353
Men+0.351
 Kut+0.351
Neg logit contributions
Reviewer-0.513
hift-0.506
eeper-0.474
REDACTED-0.463
 certific-0.443
Token Saturday
Token ID3909
Feature activation+8.68e-01

Pos logit contributions
Reviewer+0.629
hift+0.624
eeper+0.592
REDACTED+0.583
 certific+0.561
Neg logit contributions
ranch-0.279
 Property-0.214
 Kut-0.212
 Scot-0.210
Men-0.206
Token.
Token ID13
Feature activation-2.39e-01

Pos logit contributions
Reviewer+0.641
hift+0.636
eeper+0.587
REDACTED+0.579
 spiked+0.549
Neg logit contributions
ranch-0.625
 Property-0.530
 Kut-0.528
Men-0.525
 Scot-0.524
Token of
Token ID286
Feature activation-3.51e-01

Pos logit contributions
hift+0.350
Reviewer+0.346
eeper+0.321
REDACTED+0.315
 spiked+0.302
Neg logit contributions
ranch-0.358
 Kut-0.304
 Property-0.304
Men-0.300
 Scot-0.300
Token game
Token ID983
Feature activation+1.79e+00

Pos logit contributions
ranch+0.253
 Kut+0.217
 Scot+0.217
 Property+0.215
Men+0.214
Neg logit contributions
Reviewer-0.255
hift-0.253
eeper-0.237
REDACTED-0.230
anamo-0.217
Token you
Token ID345
Feature activation+3.77e-01

Pos logit contributions
hift+1.202
Reviewer+1.199
eeper+1.102
REDACTED+1.079
 spiked+1.019
Neg logit contributions
ranch-1.396
 Kut-1.203
 Property-1.199
 Scot-1.195
Men-1.193
Token would
Token ID561
Feature activation+7.08e-01

Pos logit contributions
Reviewer+0.307
hift+0.301
eeper+0.281
REDACTED+0.279
anamo+0.264
Neg logit contributions
ranch-0.246
 Kut-0.207
 Scot-0.205
 Property-0.205
Men-0.202
Token point
Token ID966
Feature activation+1.89e+00

Pos logit contributions
Reviewer+0.558
hift+0.557
eeper+0.518
REDACTED+0.513
anamo+0.485
Neg logit contributions
ranch-0.471
 Kut-0.395
 Property-0.393
 Scot-0.392
Men-0.386
Token to
Token ID284
Feature activation+6.55e-02

Pos logit contributions
hift+1.400
Reviewer+1.393
eeper+1.292
REDACTED+1.269
 spiked+1.210
Neg logit contributions
ranch-1.351
 Kut-1.143
 Property-1.140
Men-1.132
 Scot-1.131
Token when
Token ID618
Feature activation-6.11e-01

Pos logit contributions
Reviewer+0.052
hift+0.052
eeper+0.048
REDACTED+0.047
 certific+0.045
Neg logit contributions
ranch-0.043
 Kut-0.037
 Scot-0.036
 Property-0.036
Men-0.036
Token someone
Token ID2130
Feature activation+1.37e+00

Pos logit contributions
ranch+0.463
 Kut+0.406
 Scot+0.405
 Property+0.401
Men+0.397
Neg logit contributions
Reviewer-0.423
hift-0.418
eeper-0.390
REDACTED-0.378
 Paste-0.354
Token sne
Token ID10505
Feature activation-9.10e-01

Pos logit contributions
Reviewer+0.992
hift+0.972
eeper+0.898
REDACTED+0.890
anamo+0.827
Neg logit contributions
ranch-1.013
 Kut-0.891
 Scot-0.888
 Property-0.876
Men-0.862
Tokeners
Token ID364
Feature activation-7.67e-01

Pos logit contributions
ranch+0.581
 Kut+0.478
 Scot+0.476
 Property+0.472
Men+0.469
Neg logit contributions
hift-0.742
Reviewer-0.742
REDACTED-0.689
eeper-0.687
 spiked-0.653
Token and
Token ID290
Feature activation+8.54e-02

Pos logit contributions
ranch+0.504
 Kut+0.432
 Scot+0.430
 Property+0.424
Men+0.417
Neg logit contributions
Reviewer-0.614
hift-0.604
eeper-0.562
REDACTED-0.555
anamo-0.526
Token Mem
Token ID4942
Feature activation-1.27e+00

Pos logit contributions
ranch+0.737
 Kut+0.631
 Property+0.631
Men+0.625
 Scot+0.625
Neg logit contributions
hift-0.685
Reviewer-0.681
eeper-0.629
REDACTED-0.618
 spiked-0.586
Tokenc
Token ID66
Feature activation+8.42e-01

Pos logit contributions
ranch+0.830
Men+0.683
 Property+0.667
 Scot+0.665
 Kut+0.664
Neg logit contributions
Reviewer-1.030
hift-1.024
eeper-0.964
REDACTED-0.942
 certific-0.934
Tokenpy
Token ID9078
Feature activation-1.45e+00

Pos logit contributions
hift+0.621
Reviewer+0.614
eeper+0.574
REDACTED+0.561
 certific+0.554
Neg logit contributions
ranch-0.614
Men-0.516
 Kut-0.513
 Property-0.513
 Scot-0.505
TokenReading
Token ID36120
Feature activation-2.86e+00

Pos logit contributions
ranch+0.991
Men+0.838
 Property+0.835
 Kut+0.830
 Scot+0.819
Neg logit contributions
hift-1.117
Reviewer-1.095
eeper-1.034
REDACTED-1.008
 spiked-0.980
Token 55
Token ID5996
Feature activation+1.42e+00

Pos logit contributions
ranch+2.112
 Property+1.806
 Kut+1.798
Men+1.776
 Scot+1.774
Neg logit contributions
hift-2.029
Reviewer-2.018
eeper-1.865
REDACTED-1.843
 spiked-1.736
Token80
Token ID1795
Feature activation-1.48e+00

Pos logit contributions
hift+1.045
Reviewer+1.044
eeper+0.969
REDACTED+0.937
 certific+0.919
Neg logit contributions
ranch-1.019
Men-0.863
 Property-0.863
 Kut-0.859
 Scot-0.853
Token953
Token ID49649
Feature activation-1.25e+00

Pos logit contributions
ranch+1.022
Men+0.849
 Kut+0.843
 Property+0.843
 Scot+0.832
Neg logit contributions
hift-1.140
Reviewer-1.128
eeper-1.054
REDACTED-1.028
 certific-1.012
Token 16
Token ID1467
Feature activation-2.37e-01

Pos logit contributions
ranch+0.848
 Kut+0.703
Men+0.703
 Property+0.702
 Scot+0.698
Neg logit contributions
hift-0.975
Reviewer-0.968
eeper-0.901
REDACTED-0.887
 spiked-0.850
Token27
Token ID1983
Feature activation-2.43e+00

Pos logit contributions
ranch+0.142
 Property+0.117
Men+0.117
 Kut+0.116
 Scot+0.116
Neg logit contributions
hift-0.201
Reviewer-0.200
eeper-0.190
REDACTED-0.183
 certific-0.183
Token145
Token ID18781
Feature activation-1.22e+00

Pos logit contributions
ranch+1.668
Men+1.375
Aust+1.358
 Property+1.355
 Scot+1.325
Neg logit contributions
hift-1.874
Reviewer-1.856
eeper-1.771
 certific-1.721
REDACTED-1.663
Token Flat
Token ID21939
Feature activation+5.88e-01

Pos logit contributions
ranch+0.733
 Property+0.620
 Kut+0.618
 Scot+0.609
Men+0.599
Neg logit contributions
hift-1.025
Reviewer-1.022
eeper-0.957
REDACTED-0.949
 certific-0.904
Token the
Token ID262
Feature activation+5.69e-02

Pos logit contributions
Reviewer+0.342
hift+0.335
eeper+0.313
REDACTED+0.311
anamo+0.292
Neg logit contributions
ranch-0.275
 Kut-0.238
 Scot-0.233
 Property-0.233
Men-0.232
Token high
Token ID1029
Feature activation+6.36e-01

Pos logit contributions
Reviewer+0.044
hift+0.043
eeper+0.040
REDACTED+0.039
anamo+0.037
Neg logit contributions
ranch-0.039
 Kut-0.035
 Property-0.034
 Scot-0.034
Men-0.033
Token rate
Token ID2494
Feature activation+5.00e-01

Pos logit contributions
Reviewer+0.441
hift+0.435
eeper+0.404
REDACTED+0.395
 Insight+0.370
Neg logit contributions
ranch-0.484
 Kut-0.422
 Property-0.418
 Scot-0.416
Men-0.416
Token of
Token ID286
Feature activation+1.04e+00

Pos logit contributions
hift+0.358
Reviewer+0.352
eeper+0.330
REDACTED+0.321
 spiked+0.319
Neg logit contributions
ranch-0.368
 Property-0.312
 Kut-0.311
Men-0.306
 Scot-0.304
Token such
Token ID884
Feature activation-4.20e-03

Pos logit contributions
Reviewer+0.780
hift+0.773
eeper+0.717
REDACTED+0.702
 certific+0.661
Neg logit contributions
ranch-0.729
 Kut-0.623
 Property-0.620
 Scot-0.616
Men-0.613
Token digestive
Token ID34616
Feature activation-3.62e+00

Pos logit contributions
ranch+0.003
 Kut+0.002
 Scot+0.002
 Property+0.002
Men+0.002
Neg logit contributions
Reviewer-0.003
hift-0.003
eeper-0.003
REDACTED-0.003
anamo-0.003
Token cancers
Token ID25217
Feature activation+1.80e-01

Pos logit contributions
ranch+2.270
 Scot+1.891
 Kut+1.879
 Property+1.870
Men+1.856
Neg logit contributions
Reviewer-3.019
hift-2.994
eeper-2.781
REDACTED-2.722
anamo-2.634
Token are
Token ID389
Feature activation+6.32e-01

Pos logit contributions
Reviewer+0.141
hift+0.138
eeper+0.128
REDACTED+0.128
anamo+0.120
Neg logit contributions
ranch-0.122
 Kut-0.103
Men-0.102
 Scot-0.102
 Property-0.102
Token due
Token ID2233
Feature activation-1.11e+00

Pos logit contributions
Reviewer+0.481
hift+0.477
eeper+0.444
REDACTED+0.439
anamo+0.414
Neg logit contributions
ranch-0.435
 Scot-0.371
 Kut-0.371
 Property-0.370
Men-0.368
Token to
Token ID284
Feature activation-4.98e-01

Pos logit contributions
ranch+0.866
 Kut+0.743
 Property+0.743
Men+0.733
 Scot+0.732
Neg logit contributions
Reviewer-0.751
hift-0.751
eeper-0.689
REDACTED-0.674
 spiked-0.644
Token the
Token ID262
Feature activation+8.46e-02

Pos logit contributions
ranch+0.353
 Kut+0.305
 Scot+0.304
 Property+0.301
Men+0.297
Neg logit contributions
Reviewer-0.371
hift-0.363
eeper-0.338
REDACTED-0.334
anamo-0.315
TokenThis
Token ID1212
Feature activation+5.02e-01

Pos logit contributions
hift+0.086
Reviewer+0.084
eeper+0.081
REDACTED+0.078
 spiked+0.076
Neg logit contributions
ranch-0.068
Men-0.057
 Kut-0.056
 Property-0.056
 Scot-0.055
Token is
Token ID318
Feature activation-2.79e-01

Pos logit contributions
hift+0.388
Reviewer+0.388
eeper+0.360
REDACTED+0.353
 spiked+0.333
Neg logit contributions
ranch-0.341
 Property-0.290
 Kut-0.289
 Scot-0.287
Men-0.284
Token his
Token ID465
Feature activation+1.16e+00

Pos logit contributions
ranch+0.170
 Kut+0.142
 Property+0.142
 Scot+0.141
Men+0.139
Neg logit contributions
hift-0.233
Reviewer-0.233
eeper-0.219
REDACTED-0.215
 spiked-0.204
Token true
Token ID2081
Feature activation-2.25e+00

Pos logit contributions
hift+0.886
Reviewer+0.884
eeper+0.818
REDACTED+0.812
 certific+0.756
Neg logit contributions
ranch-0.803
 Property-0.685
 Kut-0.683
 Scot-0.679
Men-0.667
Token core
Token ID4755
Feature activation+9.42e-01

Pos logit contributions
ranch+1.528
 Scot+1.312
 Property+1.309
 Kut+1.283
Men+1.264
Neg logit contributions
Reviewer-1.727
hift-1.724
eeper-1.611
REDACTED-1.596
 Paste-1.482
Token compet
Token ID2307
Feature activation-3.20e+00

Pos logit contributions
hift+0.722
Reviewer+0.722
eeper+0.675
REDACTED+0.658
 spiked+0.628
Neg logit contributions
ranch-0.647
 Property-0.546
 Kut-0.544
 Scot-0.541
Men-0.538
Tokenency
Token ID1387
Feature activation-1.02e+00

Pos logit contributions
ranch+1.388
 Property+1.033
 Scot+1.026
Men+1.025
 Kut+1.013
Neg logit contributions
Reviewer-3.282
hift-3.222
REDACTED-3.049
eeper-3.021
 spiked-2.960
Token.
Token ID13
Feature activation-2.12e-01

Pos logit contributions
ranch+0.728
 Kut+0.627
 Property+0.627
 Scot+0.624
Men+0.617
Neg logit contributions
Reviewer-0.741
hift-0.740
eeper-0.685
REDACTED-0.665
 Paste-0.634
Token He
Token ID679
Feature activation-1.76e-02

Pos logit contributions
ranch+0.114
 Property+0.096
 Kut+0.095
 Scot+0.094
Men+0.093
Neg logit contributions
hift-0.195
Reviewer-0.194
eeper-0.184
REDACTED-0.179
 certific-0.174
Token is
Token ID318
Feature activation-6.63e-01

Pos logit contributions
ranch+0.012
 Kut+0.010
 Scot+0.010
 Property+0.010
Men+0.010
Neg logit contributions
Reviewer-0.014
hift-0.014
eeper-0.013
REDACTED-0.012
 Insight-0.012
Token the
Token ID262
Feature activation-2.42e-01

Pos logit contributions
ranch+0.427
 Kut+0.360
 Scot+0.360
 Property+0.360
Men+0.354
Neg logit contributions
hift-0.533
Reviewer-0.531
eeper-0.497
REDACTED-0.487
 spiked-0.462
Token price
Token ID2756
Feature activation+1.48e+00

Pos logit contributions
ranch+0.850
 Property+0.731
 Scot+0.728
 Kut+0.713
Men+0.706
Neg logit contributions
Reviewer-1.044
hift-1.033
REDACTED-0.965
eeper-0.958
 Paste-0.904
Token is
Token ID318
Feature activation+1.01e+00

Pos logit contributions
Reviewer+1.087
hift+1.082
eeper+0.996
REDACTED+0.991
 Paste+0.928
Neg logit contributions
ranch-1.058
 Property-0.906
 Scot-0.903
 Kut-0.900
Men-0.892
Token not
Token ID407
Feature activation+9.06e-01

Pos logit contributions
Reviewer+0.807
hift+0.801
eeper+0.744
REDACTED+0.740
 Paste+0.697
Neg logit contributions
ranch-0.659
 Scot-0.558
 Property-0.558
 Kut-0.558
Men-0.548
Token the
Token ID262
Feature activation+8.83e-01

Pos logit contributions
Reviewer+0.735
hift+0.731
eeper+0.679
REDACTED+0.676
anamo+0.636
Neg logit contributions
ranch-0.577
 Scot-0.489
 Kut-0.488
 Property-0.488
Men-0.477
Token most
Token ID749
Feature activation-2.42e-01

Pos logit contributions
Reviewer+0.731
hift+0.729
eeper+0.676
REDACTED+0.675
 Paste+0.635
Neg logit contributions
ranch-0.551
 Property-0.465
 Scot-0.462
 Kut-0.462
Men-0.449
Token indicative
Token ID29105
Feature activation-3.34e+00

Pos logit contributions
ranch+0.184
 Scot+0.163
 Property+0.161
 Kut+0.160
Men+0.158
Neg logit contributions
Reviewer-0.167
hift-0.165
REDACTED-0.152
eeper-0.152
anamo-0.141
Token of
Token ID286
Feature activation-4.45e-01

Pos logit contributions
ranch+1.995
 Scot+1.748
 Property+1.747
 Kut+1.710
 Wong+1.642
Neg logit contributions
Reviewer-2.765
hift-2.717
eeper-2.581
REDACTED-2.571
anamo-2.470
Token underlying
Token ID10238
Feature activation+1.22e-01

Pos logit contributions
ranch+0.277
 Property+0.233
 Scot+0.232
 Kut+0.231
Men+0.225
Neg logit contributions
Reviewer-0.369
hift-0.368
REDACTED-0.342
eeper-0.342
 Paste-0.322
Token trends
Token ID11257
Feature activation-2.07e+00

Pos logit contributions
Reviewer+0.100
hift+0.100
eeper+0.093
REDACTED+0.093
 Paste+0.087
Neg logit contributions
ranch-0.077
 Property-0.066
 Scot-0.065
 Kut-0.065
Men-0.063
Token,
Token ID11
Feature activation-1.96e-01

Pos logit contributions
ranch+1.341
 Kut+1.161
 Scot+1.158
 Property+1.140
Men+1.134
Neg logit contributions
Reviewer-1.622
hift-1.608
eeper-1.499
REDACTED-1.483
anamo-1.410
Token which
Token ID543
Feature activation-1.22e+00

Pos logit contributions
ranch+0.130
 Kut+0.111
 Property+0.111
 Scot+0.110
Men+0.108
Neg logit contributions
hift-0.154
Reviewer-0.154
eeper-0.142
REDACTED-0.141
 Paste-0.133
Token ab
Token ID450
Feature activation+7.78e-01

Pos logit contributions
Reviewer+0.260
hift+0.253
eeper+0.235
REDACTED+0.235
 Insight+0.220
Neg logit contributions
ranch-0.235
 Kut-0.200
 Property-0.198
 Scot-0.198
Men-0.196
Tokenut
Token ID315
Feature activation+1.70e+00

Pos logit contributions
Reviewer+0.847
hift+0.837
REDACTED+0.799
eeper+0.795
 certific+0.771
Neg logit contributions
ranch-0.297
Men-0.201
 Kut-0.198
 Scot-0.195
 Property-0.193
Token protected
Token ID6861
Feature activation+2.23e-01

Pos logit contributions
hift+1.342
Reviewer+1.334
eeper+1.245
REDACTED+1.223
 spiked+1.173
Neg logit contributions
ranch-1.124
 Kut-0.936
 Property-0.934
Men-0.928
 Scot-0.926
Token areas
Token ID3006
Feature activation-3.26e-01

Pos logit contributions
hift+0.163
Reviewer+0.161
eeper+0.149
REDACTED+0.149
 Paste+0.138
Neg logit contributions
ranch-0.162
 Kut-0.139
 Property-0.138
 Scot-0.137
Men-0.136
Token set
Token ID900
Feature activation-2.45e+00

Pos logit contributions
ranch+0.221
 Kut+0.188
 Property+0.187
 Scot+0.185
Men+0.183
Neg logit contributions
Reviewer-0.256
hift-0.249
eeper-0.233
REDACTED-0.232
 Paste-0.217
Token aside
Token ID7263
Feature activation-3.38e+00

Pos logit contributions
ranch+1.648
 Kut+1.418
 Property+1.411
 Scot+1.401
Men+1.384
Neg logit contributions
Reviewer-1.883
hift-1.858
eeper-1.736
REDACTED-1.717
 Paste-1.628
Token for
Token ID329
Feature activation-1.72e+00

Pos logit contributions
ranch+2.134
 Property+1.817
 Kut+1.814
 Scot+1.793
Men+1.739
Neg logit contributions
Reviewer-2.735
hift-2.688
eeper-2.536
REDACTED-2.525
anamo-2.359
Token the
Token ID262
Feature activation+3.79e-01

Pos logit contributions
ranch+1.235
 Kut+1.067
 Property+1.054
 Scot+1.046
Men+1.020
Neg logit contributions
Reviewer-1.271
hift-1.249
eeper-1.175
REDACTED-1.158
anamo-1.078
Token region
Token ID3814
Feature activation-2.68e-01

Pos logit contributions
hift+0.273
Reviewer+0.273
eeper+0.253
REDACTED+0.245
 spiked+0.230
Neg logit contributions
ranch-0.278
 Kut-0.242
 Property-0.240
 Scot-0.237
Men-0.233
Token's
Token ID338
Feature activation-1.00e+00

Pos logit contributions
ranch+0.204
 Kut+0.177
 Property+0.175
 Scot+0.175
Men+0.174
Neg logit contributions
hift-0.185
Reviewer-0.184
REDACTED-0.169
eeper-0.169
 Paste-0.157
Token extraordinary
Token ID11359
Feature activation+1.82e+00

Pos logit contributions
ranch+0.697
 Kut+0.610
 Property+0.603
 Scot+0.600
Men+0.580
Neg logit contributions
Reviewer-0.760
hift-0.755
eeper-0.698
REDACTED-0.682
anamo-0.651
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
Reviewer+0.325
hift+0.320
REDACTED+0.294
eeper+0.294
 certific+0.277
Neg logit contributions
ranch-0.286
 Kut-0.245
 Property-0.244
 Scot-0.241
Men-0.240
Token top
Token ID1353
Feature activation+3.18e-01

Pos logit contributions
Reviewer+0.095
hift+0.094
eeper+0.086
REDACTED+0.086
 spiked+0.081
Neg logit contributions
ranch-0.080
 Kut-0.068
 Property-0.068
 Scot-0.067
Men-0.066
Token three
Token ID1115
Feature activation-1.01e+00

Pos logit contributions
Reviewer+0.233
hift+0.231
eeper+0.217
REDACTED+0.211
 spiked+0.198
Neg logit contributions
ranch-0.229
 Kut-0.198
 Property-0.197
 Scot-0.197
Men-0.193
Token queries
Token ID20743
Feature activation-1.93e+00

Pos logit contributions
ranch+0.671
 Kut+0.568
 Property+0.567
 Scot+0.565
Men+0.555
Neg logit contributions
Reviewer-0.810
hift-0.799
eeper-0.743
REDACTED-0.733
anamo-0.693
Token so
Token ID523
Feature activation-5.34e-01

Pos logit contributions
ranch+1.302
 Kut+1.166
 Property+1.144
 Scot+1.139
Men+1.123
Neg logit contributions
Reviewer-1.506
hift-1.454
eeper-1.350
REDACTED-1.344
anamo-1.278
Token far
Token ID1290
Feature activation-3.04e+00

Pos logit contributions
ranch+0.367
 Kut+0.315
 Property+0.314
 Scot+0.312
Men+0.310
Neg logit contributions
Reviewer-0.408
hift-0.405
eeper-0.376
REDACTED-0.370
 certific-0.347
Token have
Token ID423
Feature activation-2.73e-01

Pos logit contributions
ranch+2.004
 Kut+1.743
 Property+1.735
 Scot+1.728
Men+1.689
Neg logit contributions
Reviewer-2.401
hift-2.327
REDACTED-2.175
eeper-2.169
anamo-2.047
Token been
Token ID587
Feature activation+9.52e-01

Pos logit contributions
ranch+0.189
 Kut+0.162
 Scot+0.161
 Property+0.160
Men+0.159
Neg logit contributions
Reviewer-0.208
hift-0.207
eeper-0.190
REDACTED-0.189
 certific-0.178
Token for
Token ID329
Feature activation+3.13e-01

Pos logit contributions
hift+0.694
Reviewer+0.693
eeper+0.639
REDACTED+0.629
 certific+0.592
Neg logit contributions
ranch-0.691
 Kut-0.593
 Property-0.592
Men-0.588
 Scot-0.588
Token reservations
Token ID24722
Feature activation-1.46e+00

Pos logit contributions
Reviewer+0.242
hift+0.241
eeper+0.223
REDACTED+0.220
 spiked+0.206
Neg logit contributions
ranch-0.215
 Kut-0.183
 Property-0.182
Men-0.181
 Scot-0.181
Token,
Token ID11
Feature activation+1.59e-01

Pos logit contributions
ranch+0.905
 Kut+0.777
 Property+0.774
Men+0.765
 Scot+0.762
Neg logit contributions
Reviewer-1.191
hift-1.189
REDACTED-1.099
eeper-1.097
anamo-1.032