TOP ACTIVATIONS
MAX = 3.573

which is the bread and butter for these guys . They
It warns that B . C . s regulatory
from being mol ested . BT W , Sid Davis updated
attempt at the B . C . Legislature ."
made an is - a / has - a mistake .
pres idents and co - C OO s . Win kel
in 2004 , saying the CM cannot bend or bypass rules
, to turn B . C . into a world -
sa Maj est é , CM DR Kay mel Kan ,
. C . K .: Christian s
ness of a small c ot erie of the super -
MP P grade : C - In an
also discuss possible post - Brexit trade ties .
ings , updates , breaking news , memories , old photos
ying , union - b usting heads of a major energy
As Pope John XX III warned , The Madonna
foreign policy , economics , trade and energy security .
girl , and D . C s Legends of
greenhouse gas emissions to 2 C from the pre - industrial
bundled into a carry c ot and on and off various

MIN ACTIVATIONS
MIN = -3.534

the Donald P ritz ker Entreprene ur ial Law Center
SM ISSION /// ---------------------------------------------------------------- ---------------------------------------------------------------- ------------------------ -- /// L ett re
10 . 13 71 / journal . p one . 00
he said . <|endoftext|> Meg ach urch founder Rev . Ted
). S ensitivity analysis Sens itivity analysis will be
School officials announced that the Entreprene urs hip Law Center will
the Johnson County Republicans Sp aghetti Dinner at Clear Creek Aman
Kodi builds . Follow these steps to install Net fl i
Stephen P aea , Ricky Jean Francois and Zig gy Hood
22 . S overe ign Lux ury Travel (
was awarded to Ragnar Fr isch and Jan Tin berg en
17 ) <|endoftext|> United States Supreme Court case Santa
[ Math ieu ] What kind of projects are done with
this could happen ," said Jean ette Sk j el m
Compl iment with previous ser ach locations *) step New =
and active bystand er training .
, investigators cannot pinpoint the exact mechanism that triggered the stream
theme forces for WAR M ACH INE and H ORD ES
324 . Google Scholar Crossref M asure
using Hel ium makes them very expensive . Materials that achieve

INTERVAL 1.796 - 3.573
CONTAINS 0.460%

supporters , many wearing the colours of FC Barcelona or flying
Last Prison er of War noted that political calculations were first
will prohibit any photograph , video or digital image of a
mann , San i , T omm i L ä nt
being bundled into a carry c ot and on and off

INTERVAL 0.019 - 1.796
CONTAINS 48.468%

killing the Trump marketing myst ique . So those
her enc ore of Love More and
was a big mistake . Hitchcock shot everything on as ho
an elegant upd o and red lipstick very Su zy
AND AN P FAC IL ITIES ARE TO BE SE ARCH

INTERVAL -1.758 - 0.019
CONTAINS 50.585%

V ern acular architecture is a style linked to a particular
ic ] trainers and guys on the team , so it
to say , Red Son ja is the queen of all
write . The horizontal and vertical motion
experiment of the Arab Spring , which has affected the region

INTERVAL -3.534 - -1.758
CONTAINS 0.487%

| , | : mk view | and | : m
/ 8 11 700 . pdf . Z al osh n
ive of Peoples ' Comm iss ariat of Defence No .
( 0 . 0 f ) ; // The grid resulting
ue , along with his prof use apologies to the Motion
Token which
Token ID543
Feature activation-2.98e-02

Pos logit contributions
ucl+0.133
ACA+0.123
UL+0.118
UC+0.116
AMA+0.113
Neg logit contributions
 Spo-0.079
furt-0.077
 reprodu-0.070
irlf-0.069
-0.068
Token is
Token ID318
Feature activation-6.23e-01

Pos logit contributions
 Spo+0.058
furt+0.057
irlf+0.051
 reprodu+0.051
+0.050
Neg logit contributions
ucl-0.101
ACA-0.093
UL-0.090
UC-0.088
AMA-0.086
Token the
Token ID262
Feature activation+2.26e-02

Pos logit contributions
 Spo+1.189
furt+1.141
 reprodu+1.042
irlf+1.020
+0.997
Neg logit contributions
ucl-2.175
ACA-1.983
UL-1.921
UC-1.885
AMA-1.840
Token bread
Token ID8509
Feature activation+7.08e-01

Pos logit contributions
ucl+0.067
ACA+0.061
UL+0.058
UC+0.057
AMA+0.055
Neg logit contributions
 Spo-0.054
furt-0.053
 reprodu-0.049
irlf-0.049
-0.048
Token and
Token ID290
Feature activation+1.93e+00

Pos logit contributions
ucl+1.989
ACA+1.794
UL+1.717
UC+1.683
AMA+1.634
Neg logit contributions
 Spo-1.798
furt-1.733
irlf-1.629
 reprodu-1.628
-1.588
Token butter
Token ID9215
Feature activation+3.57e+00

Pos logit contributions
ucl+4.500
ACA+4.048
UL+3.707
AMA+3.554
UC+3.546
Neg logit contributions
furt-5.522
 Spo-5.384
irlf-5.104
-5.046
blance-5.041
Token for
Token ID329
Feature activation+6.99e-01

Pos logit contributions
ucl+8.836
ACA+8.300
UL+7.806
UC+7.627
AMA+7.518
Neg logit contributions
 Spo-8.729
furt-8.636
irlf-8.391
ptoms-8.034
vironments-7.962
Token these
Token ID777
Feature activation+4.95e-01

Pos logit contributions
ucl+2.243
ACA+2.068
UL+1.984
UC+1.944
AMA+1.900
Neg logit contributions
 Spo-1.464
furt-1.456
irlf-1.303
-1.286
ptoms-1.280
Token guys
Token ID3730
Feature activation-5.13e-01

Pos logit contributions
ucl+1.415
ACA+1.282
UL+1.225
UC+1.198
AMA+1.159
Neg logit contributions
 Spo-1.238
furt-1.217
irlf-1.124
 reprodu-1.108
-1.094
Token.
Token ID13
Feature activation+9.72e-02

Pos logit contributions
 Spo+1.212
furt+1.172
 reprodu+1.093
irlf+1.071
+1.052
Neg logit contributions
ucl-1.557
ACA-1.397
UL-1.347
UC-1.316
AMA-1.279
Token They
Token ID1119
Feature activation+2.84e-01

Pos logit contributions
ucl+0.285
ACA+0.260
UL+0.250
UC+0.244
AMA+0.238
Neg logit contributions
 Spo-0.233
furt-0.231
 reprodu-0.217
irlf-0.215
ptoms-0.208
TokenIt
Token ID1026
Feature activation+9.74e-02

Pos logit contributions
ucl+0.574
ACA+0.514
UL+0.496
UC+0.485
AMA+0.469
Neg logit contributions
 Spo-0.492
furt-0.476
 reprodu-0.446
irlf-0.439
-0.429
Token warns
Token ID22145
Feature activation+1.24e+00

Pos logit contributions
ucl+0.340
ACA+0.306
UL+0.300
UC+0.295
AMA+0.286
Neg logit contributions
 Spo-0.188
furt-0.180
irlf-0.162
 reprodu-0.159
-0.158
Token that
Token ID326
Feature activation-2.29e-01

Pos logit contributions
ucl+3.697
ACA+3.252
UL+3.204
UC+3.105
AMA+2.982
Neg logit contributions
 Spo-2.949
furt-2.921
irlf-2.637
 reprodu-2.626
-2.608
Token B
Token ID347
Feature activation+1.13e+00

Pos logit contributions
 Spo+0.482
furt+0.468
 reprodu+0.426
irlf+0.420
+0.413
Neg logit contributions
ucl-0.757
ACA-0.683
UL-0.664
UC-0.649
AMA-0.630
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
ucl+3.162
ACA+2.847
UL+2.754
UC+2.696
AMA+2.601
Neg logit contributions
 Spo-2.923
furt-2.836
 reprodu-2.659
irlf-2.595
-2.558
TokenC
Token ID34
Feature activation+3.38e+00

Pos logit contributions
ucl+4.390
ACA+4.067
UL+3.889
UC+3.853
AMA+3.614
Neg logit contributions
 Spo-5.356
furt-5.237
irlf-5.097
 reprodu-4.982
ptoms-4.869
Token.
Token ID13
Feature activation+1.25e+00

Pos logit contributions
ucl+7.772
ACA+6.655
UL+6.618
UC+6.480
AMA+5.918
Neg logit contributions
 Spo-9.617
furt-9.587
 reprodu-8.834
irlf-8.773
-8.717
Token
Token ID447
Feature activation+5.18e-01

Pos logit contributions
ucl+4.053
ACA+3.643
UL+3.603
UC+3.537
AMA+3.355
Neg logit contributions
furt-2.608
 Spo-2.603
irlf-2.318
 reprodu-2.306
-2.279
Token
Token ID247
Feature activation+6.21e-01

Pos logit contributions
ucl+1.522
ACA+1.363
UL+1.312
UC+1.284
AMA+1.248
Neg logit contributions
 Spo-1.276
furt-1.232
 reprodu-1.154
irlf-1.122
-1.112
Tokens
Token ID82
Feature activation+4.58e-01

Pos logit contributions
ucl+1.936
ACA+1.729
UL+1.683
UC+1.647
AMA+1.583
Neg logit contributions
 Spo-1.428
furt-1.377
 reprodu-1.278
irlf-1.267
-1.242
Token regulatory
Token ID11344
Feature activation+7.41e-01

Pos logit contributions
ucl+1.372
ACA+1.225
UL+1.192
UC+1.165
AMA+1.118
Neg logit contributions
 Spo-1.097
furt-1.076
 reprodu-0.987
irlf-0.980
-0.963
Token from
Token ID422
Feature activation-2.80e-01

Pos logit contributions
 Spo+1.533
furt+1.478
 reprodu+1.366
irlf+1.340
+1.314
Neg logit contributions
ucl-2.268
ACA-2.050
UL-1.980
UC-1.939
AMA-1.887
Token being
Token ID852
Feature activation-7.02e-02

Pos logit contributions
 Spo+0.569
furt+0.552
irlf+0.497
 reprodu+0.492
+0.488
Neg logit contributions
ucl-0.944
ACA-0.855
UL-0.827
UC-0.812
AMA-0.791
Token mol
Token ID18605
Feature activation-5.39e-01

Pos logit contributions
 Spo+0.153
furt+0.148
 reprodu+0.134
irlf+0.134
+0.133
Neg logit contributions
ucl-0.226
ACA-0.204
UL-0.197
UC-0.193
AMA-0.188
Tokenested
Token ID7287
Feature activation-5.37e-01

Pos logit contributions
 Spo+1.351
furt+1.310
 reprodu+1.222
irlf+1.200
+1.187
Neg logit contributions
ucl-1.566
ACA-1.395
UL-1.343
UC-1.312
AMA-1.270
Token.
Token ID13
Feature activation-1.95e-02

Pos logit contributions
 Spo+1.231
furt+1.188
 reprodu+1.104
irlf+1.083
+1.063
Neg logit contributions
ucl-1.671
ACA-1.504
UL-1.451
UC-1.420
AMA-1.380
TokenBT
Token ID19313
Feature activation+3.35e+00

Pos logit contributions
 Spo+0.045
furt+0.044
 reprodu+0.041
irlf+0.040
+0.039
Neg logit contributions
ucl-0.060
ACA-0.054
UL-0.052
UC-0.051
AMA-0.050
TokenW
Token ID54
Feature activation+1.70e+00

Pos logit contributions
ucl+8.846
ACA+8.176
UL+7.861
UC+7.598
AMA+7.434
Neg logit contributions
furt-8.247
 Spo-8.181
 reprodu-7.716
irlf-7.576
-7.554
Token,
Token ID11
Feature activation+4.89e-01

Pos logit contributions
ucl+4.882
ACA+4.474
UL+4.288
UC+4.178
AMA+4.095
Neg logit contributions
 Spo-4.054
furt-4.021
 reprodu-3.678
irlf-3.667
-3.617
Token Sid
Token ID15686
Feature activation-1.33e+00

Pos logit contributions
ucl+1.591
ACA+1.447
UL+1.400
UC+1.369
AMA+1.339
Neg logit contributions
 Spo-1.021
furt-1.004
 reprodu-0.911
irlf-0.906
-0.898
Token Davis
Token ID7802
Feature activation+2.98e-02

Pos logit contributions
 Spo+2.917
furt+2.791
 reprodu+2.597
irlf+2.506
+2.473
Neg logit contributions
ucl-4.222
ACA-3.841
UL-3.691
UC-3.601
AMA-3.527
Token updated
Token ID6153
Feature activation+8.13e-01

Pos logit contributions
ucl+0.100
ACA+0.090
UL+0.088
UC+0.086
AMA+0.084
Neg logit contributions
 Spo-0.061
furt-0.059
 reprodu-0.054
irlf-0.053
-0.052
Token attempt
Token ID2230
Feature activation-5.03e-01

Pos logit contributions
 Spo+0.271
furt+0.263
 reprodu+0.244
irlf+0.242
+0.238
Neg logit contributions
ucl-0.345
ACA-0.309
UL-0.298
UC-0.291
AMA-0.282
Token at
Token ID379
Feature activation-6.87e-01

Pos logit contributions
 Spo+0.965
furt+0.925
 reprodu+0.843
irlf+0.826
+0.808
Neg logit contributions
ucl-1.755
ACA-1.596
UL-1.549
UC-1.520
AMA-1.481
Token the
Token ID262
Feature activation+4.39e-01

Pos logit contributions
 Spo+1.547
furt+1.498
 reprodu+1.383
irlf+1.362
+1.338
Neg logit contributions
ucl-2.161
ACA-1.947
UL-1.881
UC-1.841
AMA-1.790
Token B
Token ID347
Feature activation+2.09e+00

Pos logit contributions
ucl+1.290
ACA+1.150
UL+1.111
UC+1.092
AMA+1.046
Neg logit contributions
 Spo-1.062
furt-1.043
 reprodu-0.962
irlf-0.958
-0.947
Token.
Token ID13
Feature activation+2.09e+00

Pos logit contributions
ucl+5.515
ACA+5.040
UL+4.919
UC+4.747
AMA+4.598
Neg logit contributions
 Spo-5.551
furt-5.338
 reprodu-5.075
irlf-4.922
-4.884
TokenC
Token ID34
Feature activation+3.30e+00

Pos logit contributions
ucl+4.693
ACA+4.291
UC+4.147
UL+4.114
MT+3.805
Neg logit contributions
 Spo-5.984
furt-5.922
ptoms-5.642
irlf-5.627
 reprodu-5.538
Token.
Token ID13
Feature activation+1.58e+00

Pos logit contributions
ucl+8.266
ACA+7.291
UL+7.144
UC+7.058
AMA+6.606
Neg logit contributions
 Spo-8.705
furt-8.542
 reprodu-8.028
irlf-7.824
ptoms-7.775
Token Legislature
Token ID20815
Feature activation+1.59e-01

Pos logit contributions
ucl+4.570
ACA+4.154
UL+4.044
UC+4.020
AMA+3.760
Neg logit contributions
 Spo-3.733
furt-3.629
ptoms-3.379
 reprodu-3.367
irlf-3.355
Token."
Token ID526
Feature activation-2.08e-01

Pos logit contributions
ucl+0.492
ACA+0.446
UL+0.430
UC+0.424
AMA+0.410
Neg logit contributions
 Spo-0.366
furt-0.352
 reprodu-0.325
irlf-0.324
-0.316
Token
Token ID198
Feature activation+7.97e-02

Pos logit contributions
 Spo+0.437
furt+0.418
 reprodu+0.385
irlf+0.378
+0.371
Neg logit contributions
ucl-0.684
ACA-0.620
UL-0.598
UC-0.587
AMA-0.570
Token
Token ID198
Feature activation+3.86e-01

Pos logit contributions
ucl+0.242
ACA+0.218
UL+0.212
UC+0.207
AMA+0.201
Neg logit contributions
 Spo-0.189
furt-0.182
irlf-0.173
 reprodu-0.172
ptoms-0.165
Token made
Token ID925
Feature activation-7.97e-01

Pos logit contributions
 Spo+0.329
furt+0.319
irlf+0.288
 reprodu+0.281
+0.280
Neg logit contributions
ucl-0.577
ACA-0.522
UL-0.508
UC-0.498
AMA-0.483
Token an
Token ID281
Feature activation-2.48e-01

Pos logit contributions
 Spo+1.569
furt+1.505
 reprodu+1.382
irlf+1.348
+1.320
Neg logit contributions
ucl-2.735
ACA-2.489
UL-2.408
UC-2.363
AMA-2.304
Token is
Token ID318
Feature activation+7.66e-01

Pos logit contributions
 Spo+0.540
furt+0.523
 reprodu+0.481
irlf+0.479
+0.466
Neg logit contributions
ucl-0.796
ACA-0.719
UL-0.696
UC-0.680
AMA-0.664
Token-
Token ID12
Feature activation+8.99e-01

Pos logit contributions
ucl+2.231
ACA+1.991
UL+1.908
UC+1.872
AMA+1.813
Neg logit contributions
 Spo-1.897
furt-1.828
 reprodu-1.701
irlf-1.681
ptoms-1.671
Tokena
Token ID64
Feature activation+9.85e-01

Pos logit contributions
ucl+2.883
ACA+2.595
UL+2.509
UC+2.475
AMA+2.405
Neg logit contributions
 Spo-1.978
furt-1.851
 reprodu-1.740
irlf-1.721
ptoms-1.700
Token/
Token ID14
Feature activation+3.29e+00

Pos logit contributions
ucl+2.891
ACA+2.574
UL+2.492
UC+2.443
AMA+2.349
Neg logit contributions
 Spo-2.403
furt-2.333
 reprodu-2.159
irlf-2.148
ptoms-2.114
Tokenhas
Token ID10134
Feature activation+1.39e+00

Pos logit contributions
ucl+10.035
ACA+8.954
UL+8.591
UC+8.405
AMA+8.162
Neg logit contributions
 Spo-7.069
furt-6.500
 reprodu-6.223
irlf-6.157
ptoms-5.984
Token-
Token ID12
Feature activation-4.72e-01

Pos logit contributions
ucl+3.989
ACA+3.588
UL+3.432
UC+3.358
AMA+3.271
Neg logit contributions
 Spo-3.450
furt-3.332
 reprodu-3.111
irlf-3.100
-3.016
Tokena
Token ID64
Feature activation+3.50e-01

Pos logit contributions
 Spo+0.949
furt+0.926
 reprodu+0.840
irlf+0.827
+0.813
Neg logit contributions
ucl-1.587
ACA-1.439
UL-1.399
UC-1.367
AMA-1.324
Token mistake
Token ID7457
Feature activation-3.52e-01

Pos logit contributions
ucl+1.068
ACA+0.959
UL+0.926
UC+0.906
AMA+0.878
Neg logit contributions
 Spo-0.822
furt-0.792
 reprodu-0.735
irlf-0.729
-0.713
Token.
Token ID13
Feature activation-7.51e-02

Pos logit contributions
 Spo+0.806
furt+0.781
 reprodu+0.719
irlf+0.713
+0.698
Neg logit contributions
ucl-1.093
ACA-0.985
UL-0.952
UC-0.931
AMA-0.905
Tokenpres
Token ID18302
Feature activation-3.34e-01

Pos logit contributions
ucl+1.244
ACA+1.097
UL+1.049
UC+1.021
AMA+0.983
Neg logit contributions
 Spo-1.452
furt-1.397
 reprodu-1.334
irlf-1.314
ptoms-1.288
Tokenidents
Token ID3231
Feature activation+2.22e-01

Pos logit contributions
 Spo+0.834
furt+0.805
 reprodu+0.754
irlf+0.741
+0.728
Neg logit contributions
ucl-0.975
ACA-0.871
UL-0.840
UC-0.819
AMA-0.795
Token and
Token ID290
Feature activation+4.52e-01

Pos logit contributions
ucl+0.648
ACA+0.584
UL+0.563
UC+0.551
AMA+0.534
Neg logit contributions
 Spo-0.548
furt-0.529
 reprodu-0.495
irlf-0.487
ptoms-0.477
Token co
Token ID763
Feature activation+8.39e-03

Pos logit contributions
ucl+1.295
ACA+1.169
UL+1.125
UC+1.097
AMA+1.065
Neg logit contributions
 Spo-1.107
furt-1.096
irlf-1.006
ptoms-1.005
 reprodu-1.004
Token-
Token ID12
Feature activation+1.39e-01

Pos logit contributions
ucl+0.025
ACA+0.022
UL+0.021
UC+0.021
AMA+0.020
Neg logit contributions
 Spo-0.021
furt-0.020
 reprodu-0.019
irlf-0.018
-0.018
TokenC
Token ID34
Feature activation+3.23e+00

Pos logit contributions
ucl+0.378
ACA+0.336
UL+0.322
UC+0.314
AMA+0.304
Neg logit contributions
 Spo-0.371
furt-0.359
 reprodu-0.338
irlf-0.333
-0.327
TokenOO
Token ID6684
Feature activation+9.81e-01

Pos logit contributions
ucl+6.568
ACA+6.129
UL+5.893
UC+5.583
MT+5.198
Neg logit contributions
 Spo-10.087
 reprodu-9.604
furt-9.388
-9.159
ptoms-9.069
Tokens
Token ID82
Feature activation+4.38e-01

Pos logit contributions
ucl+2.797
ACA+2.535
UL+2.461
UC+2.398
AMA+2.331
Neg logit contributions
 Spo-2.438
furt-2.347
 reprodu-2.201
irlf-2.175
ptoms-2.112
Token.
Token ID13
Feature activation+3.08e-01

Pos logit contributions
ucl+1.265
ACA+1.138
UL+1.100
UC+1.075
AMA+1.040
Neg logit contributions
 Spo-1.088
furt-1.053
 reprodu-0.980
irlf-0.966
ptoms-0.946
Token Win
Token ID7178
Feature activation+7.70e-01

Pos logit contributions
ucl+0.949
ACA+0.868
UL+0.838
UC+0.820
AMA+0.796
Neg logit contributions
 Spo-0.694
furt-0.682
 reprodu-0.646
irlf-0.633
ptoms-0.613
Tokenkel
Token ID7750
Feature activation+4.84e-01

Pos logit contributions
ucl+1.774
ACA+1.512
UL+1.447
UC+1.389
AMA+1.331
Neg logit contributions
 Spo-2.431
furt-2.293
 reprodu-2.256
irlf-2.208
-2.193
Token in
Token ID287
Feature activation+1.89e-01

Pos logit contributions
ucl+0.408
ACA+0.368
UL+0.358
UC+0.350
AMA+0.340
Neg logit contributions
 Spo-0.267
furt-0.261
 reprodu-0.237
irlf-0.236
-0.230
Token 2004
Token ID5472
Feature activation+5.01e-01

Pos logit contributions
ucl+0.585
ACA+0.524
UL+0.507
UC+0.496
AMA+0.481
Neg logit contributions
 Spo-0.432
furt-0.421
 reprodu-0.388
irlf-0.382
-0.377
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
ucl+1.563
ACA+1.411
UL+1.364
UC+1.338
AMA+1.293
Neg logit contributions
 Spo-1.136
furt-1.099
 reprodu-1.015
irlf-1.009
-0.982
Token saying
Token ID2282
Feature activation+5.08e-01

Pos logit contributions
ucl+0.522
ACA+0.474
UL+0.458
UC+0.448
AMA+0.435
Neg logit contributions
 Spo-0.354
furt-0.347
 reprodu-0.313
irlf-0.313
-0.309
Token the
Token ID262
Feature activation-3.33e-02

Pos logit contributions
ucl+1.651
ACA+1.492
UL+1.452
UC+1.418
AMA+1.376
Neg logit contributions
 Spo-1.067
furt-1.048
irlf-0.943
 reprodu-0.942
-0.932
Token CM
Token ID16477
Feature activation+3.22e+00

Pos logit contributions
 Spo+0.080
furt+0.078
 reprodu+0.072
irlf+0.071
+0.070
Neg logit contributions
ucl-0.100
ACA-0.089
UL-0.086
UC-0.084
AMA-0.082
Token cannot
Token ID2314
Feature activation+1.49e+00

Pos logit contributions
ucl+10.049
ACA+9.123
UC+9.039
UL+8.953
AMA+8.472
Neg logit contributions
 Spo-6.252
furt-6.206
irlf-5.754
ptoms-5.517
 reprodu-5.387
Token bend
Token ID19396
Feature activation-2.98e-01

Pos logit contributions
ucl+4.312
ACA+3.867
UL+3.717
UC+3.651
AMA+3.497
Neg logit contributions
furt-3.574
 Spo-3.532
irlf-3.187
-3.157
ptoms-3.133
Token or
Token ID393
Feature activation+7.68e-01

Pos logit contributions
 Spo+0.581
furt+0.556
 reprodu+0.511
irlf+0.498
+0.487
Neg logit contributions
ucl-1.028
ACA-0.935
UL-0.905
UC-0.888
AMA-0.866
Token bypass
Token ID17286
Feature activation+2.79e-01

Pos logit contributions
ucl+2.294
ACA+2.060
UL+1.976
UC+1.937
AMA+1.869
Neg logit contributions
 Spo-1.830
furt-1.805
irlf-1.636
 reprodu-1.620
ptoms-1.606
Token rules
Token ID3173
Feature activation+8.78e-01

Pos logit contributions
ucl+0.911
ACA+0.827
UL+0.802
UC+0.784
AMA+0.762
Neg logit contributions
 Spo-0.594
furt-0.575
 reprodu-0.524
irlf-0.519
ptoms-0.508
Token,
Token ID11
Feature activation+1.96e-01

Pos logit contributions
ucl+0.200
ACA+0.179
UL+0.173
UC+0.169
AMA+0.164
Neg logit contributions
 Spo-0.158
furt-0.153
 reprodu-0.142
irlf-0.140
-0.137
Token to
Token ID284
Feature activation+4.23e-01

Pos logit contributions
ucl+0.610
ACA+0.551
UL+0.530
UC+0.520
AMA+0.505
Neg logit contributions
 Spo-0.449
furt-0.435
 reprodu-0.400
irlf-0.396
-0.388
Token turn
Token ID1210
Feature activation+1.01e-01

Pos logit contributions
ucl+1.283
ACA+1.158
UL+1.117
UC+1.093
AMA+1.064
Neg logit contributions
furt-0.982
 Spo-0.980
 reprodu-0.874
irlf-0.872
-0.863
Token B
Token ID347
Feature activation+1.53e+00

Pos logit contributions
ucl+0.333
ACA+0.302
UL+0.292
UC+0.286
AMA+0.279
Neg logit contributions
 Spo-0.213
furt-0.206
 reprodu-0.189
irlf-0.187
-0.183
Token.
Token ID13
Feature activation+1.63e+00

Pos logit contributions
ucl+4.073
ACA+3.715
UL+3.555
UC+3.501
AMA+3.371
Neg logit contributions
 Spo-4.151
furt-3.953
 reprodu-3.761
irlf-3.675
-3.627
TokenC
Token ID34
Feature activation+3.19e+00

Pos logit contributions
ucl+3.888
ACA+3.622
UL+3.404
UC+3.385
AMA+3.185
Neg logit contributions
 Spo-4.673
furt-4.580
irlf-4.444
 reprodu-4.371
ptoms-4.262
Token.
Token ID13
Feature activation+9.73e-01

Pos logit contributions
ucl+7.797
ACA+6.930
UL+6.695
UC+6.543
AMA+6.245
Neg logit contributions
 Spo-8.722
furt-8.474
 reprodu-7.980
irlf-7.937
-7.781
Token into
Token ID656
Feature activation+6.35e-01

Pos logit contributions
ucl+3.010
ACA+2.724
UL+2.633
UC+2.580
AMA+2.504
Neg logit contributions
 Spo-2.206
furt-2.147
irlf-1.980
 reprodu-1.975
-1.931
Token a
Token ID257
Feature activation+3.10e-01

Pos logit contributions
ucl+2.171
ACA+1.978
UL+1.914
UC+1.878
AMA+1.837
Neg logit contributions
 Spo-1.244
furt-1.201
 reprodu-1.085
irlf-1.080
-1.055
Token world
Token ID995
Feature activation+6.41e-01

Pos logit contributions
ucl+0.904
ACA+0.808
UL+0.777
UC+0.762
AMA+0.740
Neg logit contributions
 Spo-0.764
furt-0.742
irlf-0.691
 reprodu-0.683
ptoms-0.672
Token-
Token ID12
Feature activation+2.37e-01

Pos logit contributions
ucl+1.839
ACA+1.638
UL+1.576
UC+1.545
AMA+1.500
Neg logit contributions
 Spo-1.615
furt-1.559
irlf-1.452
 reprodu-1.448
-1.419
Token sa
Token ID473
Feature activation-5.09e-02

Pos logit contributions
 Spo+0.142
furt+0.138
 reprodu+0.128
irlf+0.126
+0.124
Neg logit contributions
ucl-0.188
ACA-0.168
UL-0.162
UC-0.159
AMA-0.154
Token Maj
Token ID12390
Feature activation-5.34e-01

Pos logit contributions
 Spo+0.118
furt+0.115
 reprodu+0.106
irlf+0.105
+0.103
Neg logit contributions
ucl-0.157
ACA-0.141
UL-0.136
UC-0.133
AMA-0.129
Tokenest
Token ID395
Feature activation-6.51e-01

Pos logit contributions
 Spo+1.625
furt+1.586
 reprodu+1.501
irlf+1.480
+1.457
Neg logit contributions
ucl-1.254
ACA-1.090
UL-1.037
UC-1.006
AMA-0.966
Tokené
Token ID2634
Feature activation-2.15e+00

Pos logit contributions
 Spo+1.494
furt+1.444
 reprodu+1.347
irlf+1.315
+1.297
Neg logit contributions
ucl-2.013
ACA-1.805
UL-1.749
UC-1.708
AMA-1.657
Token,
Token ID11
Feature activation+9.51e-02

Pos logit contributions
 Spo+4.615
furt+4.384
 reprodu+4.142
irlf+3.927
+3.878
Neg logit contributions
ucl-6.819
ACA-6.154
UL-5.929
UC-5.829
AMA-5.646
TokenCM
Token ID24187
Feature activation+3.19e+00

Pos logit contributions
ucl+0.305
ACA+0.276
UL+0.267
UC+0.261
AMA+0.254
Neg logit contributions
 Spo-0.207
furt-0.201
 reprodu-0.185
irlf-0.182
-0.179
TokenDR
Token ID7707
Feature activation+1.68e+00

Pos logit contributions
ucl+4.749
ACA+3.914
UL+3.897
UC+3.500
AMA+3.412
Neg logit contributions
 Spo-11.609
furt-11.410
 reprodu-10.905
irlf-10.858
-10.677
Token Kay
Token ID17356
Feature activation+1.20e+00

Pos logit contributions
ucl+5.871
ACA+5.386
UL+5.267
UC+5.155
AMA+5.046
Neg logit contributions
 Spo-2.901
furt-2.855
irlf-2.561
 reprodu-2.550
ptoms-2.453
Tokenmel
Token ID17694
Feature activation+1.24e+00

Pos logit contributions
ucl+3.400
ACA+3.005
UL+2.913
UC+2.812
AMA+2.718
Neg logit contributions
 Spo-3.090
furt-2.940
 reprodu-2.787
irlf-2.732
-2.710
Token Kan
Token ID14248
Feature activation+8.41e-01

Pos logit contributions
ucl+4.403
ACA+4.033
UL+3.930
UC+3.858
AMA+3.744
Neg logit contributions
 Spo-2.187
furt-2.125
irlf-1.926
 reprodu-1.909
-1.843
Token,
Token ID11
Feature activation+9.49e-01

Pos logit contributions
ucl+2.574
ACA+2.324
UL+2.252
UC+2.196
AMA+2.137
Neg logit contributions
 Spo-1.935
furt-1.871
 reprodu-1.723
irlf-1.709
-1.677
Token.
Token ID13
Feature activation+4.59e-01

Pos logit contributions
 Spo+0.263
furt+0.254
 reprodu+0.235
irlf+0.230
+0.227
Neg logit contributions
ucl-0.373
ACA-0.336
UL-0.325
UC-0.319
AMA-0.310
Token
Token ID198
Feature activation+3.97e-01

Pos logit contributions
ucl+1.386
ACA+1.254
UL+1.213
UC+1.189
AMA+1.155
Neg logit contributions
 Spo-1.072
furt-1.053
 reprodu-0.990
irlf-0.971
ptoms-0.954
Token
Token ID198
Feature activation+5.74e-01

Pos logit contributions
ucl+1.186
ACA+1.080
UL+1.050
UC+1.028
AMA+0.999
Neg logit contributions
 Spo-0.954
furt-0.914
irlf-0.884
 reprodu-0.879
ptoms-0.842
TokenC
Token ID34
Feature activation+2.15e+00

Pos logit contributions
ucl+1.444
ACA+1.281
UL+1.228
UC+1.196
AMA+1.155
Neg logit contributions
 Spo-1.657
furt-1.605
 reprodu-1.531
irlf-1.507
ptoms-1.473
Token.
Token ID13
Feature activation+1.73e+00

Pos logit contributions
ucl+4.462
ACA+3.866
UL+3.769
UC+3.607
AMA+3.400
Neg logit contributions
 Spo-7.097
furt-6.885
 reprodu-6.586
ptoms-6.414
-6.396
TokenK
Token ID42
Feature activation+3.10e+00

Pos logit contributions
ucl+4.782
ACA+4.394
UL+4.303
UC+4.208
AMA+3.999
Neg logit contributions
furt-4.257
 Spo-4.234
 reprodu-4.005
irlf-3.999
ptoms-3.972
Token.:
Token ID11207
Feature activation+1.19e+00

Pos logit contributions
ucl+6.666
ACA+5.748
UL+5.702
UC+5.450
AMA+5.162
Neg logit contributions
 Spo-9.408
furt-9.162
 reprodu-8.769
irlf-8.605
ptoms-8.520
Token Christian
Token ID4302
Feature activation+4.45e-01

Pos logit contributions
ucl+3.472
ACA+3.121
UL+3.045
UC+2.967
AMA+2.930
Neg logit contributions
 Spo-2.751
furt-2.750
 reprodu-2.602
ptoms-2.561
irlf-2.556
Token
Token ID447
Feature activation-3.55e-01

Pos logit contributions
ucl+1.492
ACA+1.347
UL+1.306
UC+1.284
AMA+1.248
Neg logit contributions
 Spo-0.903
furt-0.874
 reprodu-0.795
irlf-0.790
ptoms-0.779
Token
Token ID247
Feature activation-6.55e-03

Pos logit contributions
 Spo+0.876
furt+0.848
 reprodu+0.793
irlf+0.779
+0.766
Neg logit contributions
ucl-1.040
ACA-0.930
UL-0.895
UC-0.874
AMA-0.848
Tokens
Token ID82
Feature activation-3.86e-01

Pos logit contributions
 Spo+0.015
furt+0.014
 reprodu+0.013
irlf+0.013
+0.013
Neg logit contributions
ucl-0.021
ACA-0.018
UL-0.018
UC-0.017
AMA-0.017
Tokenness
Token ID1108
Feature activation+4.50e-01

Pos logit contributions
 Spo+0.547
furt+0.529
 reprodu+0.494
irlf+0.484
+0.476
Neg logit contributions
ucl-0.678
ACA-0.608
UL-0.585
UC-0.572
AMA-0.556
Token of
Token ID286
Feature activation-5.46e-01

Pos logit contributions
ucl+1.383
ACA+1.243
UL+1.206
UC+1.177
AMA+1.142
Neg logit contributions
 Spo-1.034
furt-1.010
irlf-0.921
 reprodu-0.920
-0.900
Token a
Token ID257
Feature activation-1.92e-01

Pos logit contributions
 Spo+1.174
furt+1.131
 reprodu+1.045
irlf+1.023
+1.003
Neg logit contributions
ucl-1.777
ACA-1.608
UL-1.553
UC-1.522
AMA-1.482
Token small
Token ID1402
Feature activation+7.64e-01

Pos logit contributions
 Spo+0.450
furt+0.437
 reprodu+0.405
irlf+0.400
+0.392
Neg logit contributions
ucl-0.584
ACA-0.524
UL-0.506
UC-0.494
AMA-0.480
Token c
Token ID269
Feature activation+1.64e+00

Pos logit contributions
ucl+2.301
ACA+2.059
UL+1.988
UC+1.948
AMA+1.867
Neg logit contributions
 Spo-1.816
furt-1.768
 reprodu-1.622
irlf-1.620
ptoms-1.590
Tokenot
Token ID313
Feature activation+3.10e+00

Pos logit contributions
ucl+4.354
ACA+3.794
UL+3.670
UC+3.563
AMA+3.412
Neg logit contributions
 Spo-4.526
furt-4.438
 reprodu-4.147
-4.065
ptoms-4.024
Tokenerie
Token ID18287
Feature activation+5.48e-01

Pos logit contributions
ucl+7.968
ACA+7.007
UL+6.759
UC+6.626
AMA+6.351
Neg logit contributions
 Spo-8.108
furt-7.834
ptoms-7.324
irlf-7.293
-7.218
Token of
Token ID286
Feature activation+3.90e-01

Pos logit contributions
ucl+1.704
ACA+1.536
UL+1.484
UC+1.456
AMA+1.411
Neg logit contributions
 Spo-1.243
furt-1.206
 reprodu-1.108
irlf-1.103
-1.073
Token the
Token ID262
Feature activation+6.12e-01

Pos logit contributions
ucl+1.142
ACA+1.023
UL+0.985
UC+0.962
AMA+0.930
Neg logit contributions
 Spo-0.947
furt-0.930
 reprodu-0.854
irlf-0.852
-0.838
Token super
Token ID2208
Feature activation-1.25e+00

Pos logit contributions
ucl+1.765
ACA+1.580
UL+1.525
UC+1.489
AMA+1.431
Neg logit contributions
 Spo-1.512
furt-1.480
 reprodu-1.368
irlf-1.363
-1.338
Token-
Token ID12
Feature activation-8.97e-01

Pos logit contributions
 Spo+3.580
furt+3.482
 reprodu+3.312
irlf+3.215
+3.194
Neg logit contributions
ucl-3.127
ACA-2.744
UL-2.597
UC-2.525
AMA-2.458
Token
Token ID3581
Feature activation-2.58e-02

Pos logit contributions
 Spo+0.186
furt+0.183
 reprodu+0.171
irlf+0.170
+0.169
Neg logit contributions
ucl-0.142
ACA-0.122
UL-0.116
UC-0.112
AMA-0.108
Token MP
Token ID4904
Feature activation+1.51e+00

Pos logit contributions
 Spo+0.062
furt+0.060
 reprodu+0.056
irlf+0.055
+0.054
Neg logit contributions
ucl-0.077
ACA-0.069
UL-0.067
UC-0.065
AMA-0.063
TokenP
Token ID47
Feature activation+1.29e+00

Pos logit contributions
ucl+4.312
ACA+4.030
UL+3.898
UC+3.828
AMA+3.732
Neg logit contributions
 Spo-3.587
furt-3.552
irlf-3.457
 reprodu-3.403
vironments-3.266
Token grade
Token ID9559
Feature activation-1.28e-01

Pos logit contributions
ucl+3.037
ACA+2.645
UL+2.529
UC+2.484
AMA+2.385
Neg logit contributions
 Spo-3.791
furt-3.751
irlf-3.524
-3.464
ptoms-3.457
Token:
Token ID25
Feature activation+9.77e-01

Pos logit contributions
 Spo+0.313
furt+0.303
 reprodu+0.283
irlf+0.282
+0.274
Neg logit contributions
ucl-0.376
ACA-0.336
UL-0.325
UC-0.318
AMA-0.308
Token C
Token ID327
Feature activation+3.10e+00

Pos logit contributions
ucl+3.151
ACA+2.866
UL+2.762
UC+2.727
AMA+2.656
Neg logit contributions
 Spo-2.041
furt-2.002
irlf-1.864
 reprodu-1.841
ptoms-1.794
Token-
Token ID12
Feature activation+1.74e+00

Pos logit contributions
ucl+8.273
ACA+7.551
UL+7.197
UC+7.185
AMA+6.870
Neg logit contributions
 Spo-7.705
furt-7.402
 reprodu-7.019
irlf-6.939
ptoms-6.762
Token
Token ID198
Feature activation+8.22e-01

Pos logit contributions
ucl+5.211
ACA+4.763
UL+4.568
UC+4.504
AMA+4.342
Neg logit contributions
 Spo-4.035
furt-3.862
 reprodu-3.647
irlf-3.591
-3.496
Token
Token ID198
Feature activation+4.72e-01

Pos logit contributions
ucl+2.354
ACA+2.119
UL+2.060
UC+2.016
AMA+1.948
Neg logit contributions
 Spo-2.054
furt-1.974
irlf-1.945
 reprodu-1.898
ptoms-1.830
TokenIn
Token ID818
Feature activation+4.28e-01

Pos logit contributions
ucl+1.176
ACA+1.042
UL+0.995
UC+0.968
AMA+0.931
Neg logit contributions
 Spo-1.378
furt-1.334
 reprodu-1.274
irlf-1.252
-1.224
Token an
Token ID281
Feature activation-2.12e-02

Pos logit contributions
ucl+1.332
ACA+1.194
UL+1.156
UC+1.130
AMA+1.098
Neg logit contributions
 Spo-0.975
furt-0.941
 reprodu-0.875
irlf-0.865
-0.850
Token also
Token ID635
Feature activation+4.13e-01

Pos logit contributions
 Spo+0.757
furt+0.735
 reprodu+0.691
irlf+0.680
+0.670
Neg logit contributions
ucl-0.748
ACA-0.661
UL-0.634
UC-0.617
AMA-0.597
Token discuss
Token ID2112
Feature activation-8.85e-02

Pos logit contributions
ucl+1.042
ACA+0.913
UL+0.876
UC+0.853
AMA+0.820
Neg logit contributions
 Spo-1.180
furt-1.154
 reprodu-1.081
irlf-1.069
-1.058
Token possible
Token ID1744
Feature activation+6.23e-01

Pos logit contributions
 Spo+0.199
furt+0.193
 reprodu+0.177
irlf+0.176
+0.172
Neg logit contributions
ucl-0.279
ACA-0.251
UL-0.243
UC-0.238
AMA-0.231
Token post
Token ID1281
Feature activation+4.20e-01

Pos logit contributions
ucl+1.815
ACA+1.622
UL+1.563
UC+1.524
AMA+1.480
Neg logit contributions
 Spo-1.531
furt-1.496
irlf-1.381
 reprodu-1.367
-1.358
Token-
Token ID12
Feature activation+1.62e+00

Pos logit contributions
ucl+1.246
ACA+1.115
UL+1.075
UC+1.049
AMA+1.018
Neg logit contributions
 Spo-1.026
furt-0.993
 reprodu-0.926
irlf-0.914
-0.893
TokenBrexit
Token ID40730
Feature activation+3.09e+00

Pos logit contributions
ucl+4.043
ACA+3.615
UL+3.427
UC+3.310
AMA+3.184
Neg logit contributions
 Spo-4.701
furt-4.447
 reprodu-4.311
irlf-4.270
vironments-4.182
Token trade
Token ID3292
Feature activation+2.44e+00

Pos logit contributions
ucl+8.014
ACA+7.011
UL+6.821
UC+6.611
AMA+6.363
Neg logit contributions
 Spo-7.784
furt-7.717
ptoms-7.378
irlf-7.254
vironments-7.103
Token ties
Token ID8470
Feature activation+1.12e+00

Pos logit contributions
ucl+7.995
ACA+7.155
UL+7.074
UC+6.838
AMA+6.637
Neg logit contributions
 Spo-4.747
furt-4.681
irlf-4.405
ptoms-4.295
-4.142
Token.
Token ID13
Feature activation+4.51e-01

Pos logit contributions
ucl+3.264
ACA+2.940
UL+2.844
UC+2.783
AMA+2.674
Neg logit contributions
 Spo-2.685
furt-2.632
irlf-2.446
 reprodu-2.411
ptoms-2.377
Token
Token ID198
Feature activation+7.88e-01

Pos logit contributions
ucl+1.442
ACA+1.303
UL+1.260
UC+1.233
AMA+1.200
Neg logit contributions
 Spo-0.988
furt-0.954
 reprodu-0.886
irlf-0.868
-0.849
Token
Token ID198
Feature activation+9.00e-01

Pos logit contributions
ucl+2.228
ACA+2.008
UL+1.955
UC+1.917
AMA+1.843
Neg logit contributions
 Spo-1.979
furt-1.900
irlf-1.884
 reprodu-1.836
ptoms-1.775
Tokenings
Token ID654
Feature activation+1.23e+00

Pos logit contributions
ucl+2.972
ACA+2.671
UL+2.607
UC+2.549
AMA+2.488
Neg logit contributions
 Spo-2.001
furt-1.936
irlf-1.760
 reprodu-1.754
-1.713
Token,
Token ID11
Feature activation+1.07e+00

Pos logit contributions
ucl+3.528
ACA+3.120
UL+3.041
UC+2.974
AMA+2.867
Neg logit contributions
 Spo-3.050
furt-2.987
irlf-2.759
 reprodu-2.744
ptoms-2.679
Token updates
Token ID5992
Feature activation+2.46e+00

Pos logit contributions
ucl+3.301
ACA+2.943
UL+2.859
UC+2.815
AMA+2.732
Neg logit contributions
 Spo-2.418
furt-2.360
irlf-2.183
 reprodu-2.130
ptoms-2.121
Token,
Token ID11
Feature activation+8.99e-01

Pos logit contributions
ucl+6.771
ACA+5.932
UL+5.823
UC+5.701
AMA+5.426
Neg logit contributions
furt-6.121
 Spo-6.080
irlf-5.547
ptoms-5.455
 reprodu-5.401
Token breaking
Token ID7163
Feature activation-3.15e-02

Pos logit contributions
ucl+2.769
ACA+2.477
UL+2.402
UC+2.363
AMA+2.290
Neg logit contributions
 Spo-2.048
furt-1.995
irlf-1.844
 reprodu-1.802
ptoms-1.794
Token news
Token ID1705
Feature activation+3.05e+00

Pos logit contributions
 Spo+0.095
furt+0.093
 reprodu+0.088
irlf+0.086
+0.085
Neg logit contributions
ucl-0.075
ACA-0.065
UL-0.062
UC-0.060
AMA-0.058
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
ucl+7.830
ACA+6.813
UL+6.769
UC+6.624
AMA+6.232
Neg logit contributions
 Spo-7.727
furt-7.698
irlf-7.325
ptoms-7.035
ipal-6.950
Token memories
Token ID9846
Feature activation+1.29e+00

Pos logit contributions
ucl+3.097
ACA+2.769
UL+2.694
UC+2.648
AMA+2.563
Neg logit contributions
 Spo-2.276
furt-2.227
irlf-2.051
ptoms-2.006
 reprodu-1.996
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
ucl+3.710
ACA+3.295
UL+3.202
UC+3.145
AMA+3.019
Neg logit contributions
 Spo-3.172
furt-3.126
irlf-2.882
 reprodu-2.834
-2.777
Token old
Token ID1468
Feature activation+8.74e-01

Pos logit contributions
ucl+3.486
ACA+3.116
UL+3.027
UC+2.976
AMA+2.887
Neg logit contributions
 Spo-2.539
furt-2.494
irlf-2.297
ptoms-2.240
 reprodu-2.229
Token photos
Token ID5205
Feature activation+7.89e-01

Pos logit contributions
ucl+2.755
ACA+2.487
UL+2.400
UC+2.357
AMA+2.290
Neg logit contributions
 Spo-1.942
furt-1.889
irlf-1.736
 reprodu-1.723
-1.682
Tokenying
Token ID1112
Feature activation+1.90e+00

Pos logit contributions
ucl+1.145
ACA+0.982
UL+0.932
UC+0.909
AMA+0.867
Neg logit contributions
 Spo-1.545
furt-1.499
 reprodu-1.422
irlf-1.410
-1.382
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
ucl+5.108
ACA+4.488
UL+4.355
UC+4.261
AMA+4.040
Neg logit contributions
 Spo-4.878
furt-4.864
irlf-4.533
ptoms-4.449
 reprodu-4.357
Token union
Token ID6441
Feature activation+1.29e+00

Pos logit contributions
ucl+3.230
ACA+2.873
UL+2.778
UC+2.712
AMA+2.590
Neg logit contributions
 Spo-2.869
furt-2.815
irlf-2.607
ptoms-2.586
 reprodu-2.567
Token-
Token ID12
Feature activation+1.01e+00

Pos logit contributions
ucl+3.668
ACA+3.314
UL+3.175
UC+3.108
AMA+2.983
Neg logit contributions
 Spo-3.228
furt-3.138
irlf-2.877
 reprodu-2.871
ptoms-2.849
Tokenb
Token ID65
Feature activation+1.84e+00

Pos logit contributions
ucl+2.964
ACA+2.675
UL+2.569
UC+2.521
AMA+2.424
Neg logit contributions
 Spo-2.465
furt-2.335
 reprodu-2.185
irlf-2.150
ptoms-2.125
Tokenusting
Token ID32620
Feature activation+3.05e+00

Pos logit contributions
ucl+4.129
ACA+3.567
UL+3.419
UC+3.273
AMA+3.141
Neg logit contributions
 Spo-5.775
furt-5.615
 reprodu-5.326
-5.215
irlf-5.167
Token heads
Token ID6665
Feature activation+8.35e-01

Pos logit contributions
ucl+7.873
ACA+7.006
UL+6.804
UC+6.651
rug+6.237
Neg logit contributions
furt-7.539
 Spo-7.538
irlf-7.110
ptoms-7.027
blance-6.853
Token of
Token ID286
Feature activation+3.47e-01

Pos logit contributions
ucl+2.584
ACA+2.327
UL+2.258
UC+2.210
AMA+2.149
Neg logit contributions
 Spo-1.913
furt-1.842
 reprodu-1.707
irlf-1.688
ptoms-1.641
Token a
Token ID257
Feature activation+8.71e-01

Pos logit contributions
ucl+0.971
ACA+0.864
UL+0.835
UC+0.812
AMA+0.786
Neg logit contributions
 Spo-0.897
furt-0.872
 reprodu-0.811
irlf-0.805
ptoms-0.791
Token major
Token ID1688
Feature activation+1.43e+00

Pos logit contributions
ucl+2.558
ACA+2.279
UL+2.207
UC+2.163
AMA+2.066
Neg logit contributions
 Spo-2.119
furt-2.052
irlf-1.909
 reprodu-1.886
ptoms-1.884
Token energy
Token ID2568
Feature activation+2.11e+00

Pos logit contributions
ucl+3.837
ACA+3.332
UL+3.246
UC+3.184
AMA+3.006
Neg logit contributions
 Spo-3.822
furt-3.724
irlf-3.510
ptoms-3.481
 reprodu-3.426
Token As
Token ID1081
Feature activation+1.92e-01

Pos logit contributions
ucl+0.842
ACA+0.756
UL+0.730
UC+0.717
AMA+0.694
Neg logit contributions
 Spo-0.721
furt-0.711
 reprodu-0.668
irlf-0.665
ptoms-0.647
Token Pope
Token ID13258
Feature activation-5.70e-01

Pos logit contributions
ucl+0.630
ACA+0.568
UL+0.552
UC+0.543
AMA+0.526
Neg logit contributions
 Spo-0.401
furt-0.389
 reprodu-0.355
irlf-0.351
-0.348
Token John
Token ID1757
Feature activation-2.81e-01

Pos logit contributions
 Spo+1.373
furt+1.331
 reprodu+1.241
irlf+1.216
+1.193
Neg logit contributions
ucl-1.707
ACA-1.531
UL-1.472
UC-1.439
AMA-1.396
Token XX
Token ID21044
Feature activation+7.39e-01

Pos logit contributions
 Spo+0.610
furt+0.590
 reprodu+0.543
irlf+0.537
+0.524
Neg logit contributions
ucl-0.910
ACA-0.818
UL-0.793
UC-0.779
AMA-0.758
TokenIII
Token ID10855
Feature activation+1.49e+00

Pos logit contributions
ucl+1.759
ACA+1.533
UL+1.486
UC+1.443
AMA+1.369
Neg logit contributions
 Spo-2.213
furt-2.182
 reprodu-2.038
irlf-2.032
-2.016
Token warned
Token ID7728
Feature activation+3.04e+00

Pos logit contributions
ucl+4.742
ACA+4.316
UL+4.191
UC+4.130
AMA+3.993
Neg logit contributions
 Spo-3.067
furt-3.013
irlf-2.747
ptoms-2.738
-2.695
Token,
Token ID11
Feature activation+1.16e+00

Pos logit contributions
ucl+8.452
ACA+7.533
UL+7.378
UC+7.192
AMA+6.947
Neg logit contributions
 Spo-6.972
furt-6.911
ptoms-6.550
irlf-6.410
-6.398
Token
Token ID564
Feature activation+2.09e-01

Pos logit contributions
ucl+3.693
ACA+3.315
UL+3.221
UC+3.168
AMA+3.060
Neg logit contributions
 Spo-2.487
furt-2.419
irlf-2.240
ptoms-2.206
-2.199
Token
Token ID250
Feature activation+3.16e-01

Pos logit contributions
ucl+0.643
ACA+0.577
UL+0.555
UC+0.547
AMA+0.525
Neg logit contributions
 Spo-0.490
furt-0.464
 reprodu-0.439
irlf-0.437
ptoms-0.423
TokenThe
Token ID464
Feature activation+3.01e-01

Pos logit contributions
ucl+0.923
ACA+0.825
UL+0.794
UC+0.780
AMA+0.752
Neg logit contributions
 Spo-0.789
furt-0.758
 reprodu-0.712
irlf-0.706
-0.690
Token Madonna
Token ID42345
Feature activation+3.35e-01

Pos logit contributions
ucl+0.904
ACA+0.810
UL+0.781
UC+0.766
AMA+0.740
Neg logit contributions
 Spo-0.715
furt-0.696
 reprodu-0.643
irlf-0.639
-0.628
Token foreign
Token ID3215
Feature activation+5.66e-01

Pos logit contributions
 Spo+1.187
furt+1.143
 reprodu+1.058
irlf+1.033
+1.012
Neg logit contributions
ucl-1.798
ACA-1.628
UL-1.572
UC-1.540
AMA-1.499
Token policy
Token ID2450
Feature activation+3.34e-01

Pos logit contributions
ucl+1.573
ACA+1.398
UL+1.342
UC+1.308
AMA+1.267
Neg logit contributions
 Spo-1.483
furt-1.439
 reprodu-1.350
irlf-1.326
-1.306
Token,
Token ID11
Feature activation+3.44e-01

Pos logit contributions
ucl+0.985
ACA+0.879
UL+0.849
UC+0.830
AMA+0.803
Neg logit contributions
 Spo-0.821
furt-0.795
 reprodu-0.740
irlf-0.733
-0.717
Token economics
Token ID12446
Feature activation+1.40e+00

Pos logit contributions
ucl+0.974
ACA+0.863
UL+0.830
UC+0.812
AMA+0.783
Neg logit contributions
 Spo-0.882
furt-0.860
irlf-0.795
 reprodu-0.794
-0.781
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
ucl+4.034
ACA+3.598
UL+3.475
UC+3.402
AMA+3.281
Neg logit contributions
 Spo-3.475
furt-3.363
 reprodu-3.127
irlf-3.122
-3.045
Token trade
Token ID3292
Feature activation+3.02e+00

Pos logit contributions
ucl+3.947
ACA+3.427
UL+3.307
UC+3.247
AMA+3.109
Neg logit contributions
 Spo-3.695
furt-3.619
irlf-3.393
-3.334
ptoms-3.324
Token and
Token ID290
Feature activation+1.33e+00

Pos logit contributions
ucl+7.709
ACA+6.634
UL+6.563
UC+6.404
AMA+5.950
Neg logit contributions
 Spo-7.929
furt-7.867
irlf-7.606
ptoms-7.282
-7.259
Token energy
Token ID2568
Feature activation+1.78e+00

Pos logit contributions
ucl+3.871
ACA+3.379
UL+3.266
UC+3.213
AMA+3.079
Neg logit contributions
 Spo-3.285
furt-3.209
irlf-3.000
-2.956
ptoms-2.943
Token security
Token ID2324
Feature activation+1.22e+00

Pos logit contributions
ucl+4.884
ACA+4.225
UL+4.190
UC+4.093
AMA+3.915
Neg logit contributions
 Spo-4.569
furt-4.484
irlf-4.298
ptoms-4.166
 reprodu-4.100
Token.
Token ID13
Feature activation+5.84e-01

Pos logit contributions
ucl+3.493
ACA+3.081
UL+3.023
UC+2.954
AMA+2.838
Neg logit contributions
 Spo-3.047
furt-2.961
irlf-2.785
 reprodu-2.729
ptoms-2.684
Token
Token ID198
Feature activation+1.61e+00

Pos logit contributions
ucl+1.843
ACA+1.662
UL+1.611
UC+1.574
AMA+1.531
Neg logit contributions
 Spo-1.302
furt-1.262
 reprodu-1.179
irlf-1.156
-1.126
Tokengirl
Token ID15219
Feature activation-4.04e-01

Pos logit contributions
 Spo+1.583
furt+1.518
 reprodu+1.405
irlf+1.378
+1.349
Neg logit contributions
ucl-2.242
ACA-2.032
UL-1.944
UC-1.906
AMA-1.853
Token,
Token ID11
Feature activation+8.94e-01

Pos logit contributions
 Spo+0.952
furt+0.911
 reprodu+0.859
irlf+0.838
+0.826
Neg logit contributions
ucl-1.233
ACA-1.104
UL-1.060
UC-1.033
AMA-1.005
Token and
Token ID290
Feature activation+3.79e-02

Pos logit contributions
ucl+2.621
ACA+2.356
UL+2.283
UC+2.227
AMA+2.161
Neg logit contributions
 Spo-2.141
furt-2.130
 reprodu-1.963
irlf-1.930
ptoms-1.917
Token D
Token ID360
Feature activation+7.71e-01

Pos logit contributions
ucl+0.116
ACA+0.104
UL+0.101
UC+0.098
AMA+0.096
Neg logit contributions
 Spo-0.088
furt-0.086
 reprodu-0.080
irlf-0.078
-0.077
Token.
Token ID13
Feature activation+2.07e+00

Pos logit contributions
ucl+1.668
ACA+1.420
UL+1.346
UC+1.300
AMA+1.244
Neg logit contributions
 Spo-2.500
furt-2.440
 reprodu-2.314
irlf-2.295
-2.254
TokenC
Token ID34
Feature activation+3.00e+00

Pos logit contributions
ucl+5.256
ACA+4.723
UL+4.588
UC+4.464
AMA+4.238
Neg logit contributions
furt-5.646
 Spo-5.537
 reprodu-5.288
irlf-5.283
ptoms-5.164
Token
Token ID447
Feature activation+4.12e-01

Pos logit contributions
ucl+7.736
ACA+6.887
UL+6.658
UC+6.537
AMA+6.238
Neg logit contributions
 Spo-7.795
furt-7.729
 reprodu-7.230
ptoms-7.060
-6.972
Token
Token ID247
Feature activation+1.93e-01

Pos logit contributions
ucl+1.118
ACA+0.992
UL+0.959
UC+0.929
AMA+0.908
Neg logit contributions
 Spo-1.124
furt-1.073
 reprodu-1.011
irlf-0.985
-0.976
Tokens
Token ID82
Feature activation+2.19e-01

Pos logit contributions
ucl+0.604
ACA+0.544
UL+0.527
UC+0.514
AMA+0.500
Neg logit contributions
 Spo-0.440
furt-0.424
 reprodu-0.394
irlf-0.386
-0.380
Token Legends
Token ID14270
Feature activation-2.79e-02

Pos logit contributions
ucl+0.673
ACA+0.608
UL+0.587
UC+0.573
AMA+0.557
Neg logit contributions
 Spo-0.505
furt-0.491
 reprodu-0.455
irlf-0.447
-0.440
Token of
Token ID286
Feature activation-4.49e-01

Pos logit contributions
 Spo+0.061
furt+0.058
 reprodu+0.054
irlf+0.053
+0.052
Neg logit contributions
ucl-0.090
ACA-0.081
UL-0.079
UC-0.077
AMA-0.075
Token greenhouse
Token ID16325
Feature activation-1.25e+00

Pos logit contributions
 Spo+0.988
furt+0.958
 reprodu+0.886
irlf+0.882
+0.866
Neg logit contributions
ucl-1.169
ACA-1.043
UL-1.005
UC-0.982
AMA-0.950
Token gas
Token ID3623
Feature activation+7.69e-01

Pos logit contributions
 Spo+2.609
furt+2.524
 reprodu+2.372
+2.262
irlf+2.239
Neg logit contributions
ucl-4.072
ACA-3.736
UL-3.592
UC-3.525
AMA-3.447
Token emissions
Token ID8971
Feature activation+8.57e-01

Pos logit contributions
ucl+1.934
ACA+1.643
UL+1.611
UC+1.551
AMA+1.480
Neg logit contributions
 Spo-2.217
furt-2.169
irlf-2.064
ipal-2.003
 reprodu-1.988
Token to
Token ID284
Feature activation-2.22e-02

Pos logit contributions
ucl+2.690
ACA+2.412
UL+2.343
UC+2.301
AMA+2.223
Neg logit contributions
 Spo-1.917
furt-1.869
irlf-1.735
 reprodu-1.690
ptoms-1.646
Token 2
Token ID362
Feature activation+1.42e-01

Pos logit contributions
 Spo+0.046
furt+0.044
irlf+0.040
 reprodu+0.040
+0.039
Neg logit contributions
ucl-0.075
ACA-0.067
UL-0.065
UC-0.064
AMA-0.062
TokenC
Token ID34
Feature activation+2.99e+00

Pos logit contributions
ucl+0.376
ACA+0.331
UL+0.318
UC+0.311
AMA+0.298
Neg logit contributions
 Spo-0.393
furt-0.384
 reprodu-0.358
irlf-0.355
-0.349
Token from
Token ID422
Feature activation+1.08e+00

Pos logit contributions
ucl+8.542
ACA+7.594
UL+7.437
UC+7.371
AMA+6.888
Neg logit contributions
furt-6.986
 Spo-6.946
irlf-6.388
ipal-6.230
 reprodu-6.062
Token the
Token ID262
Feature activation+1.87e-01

Pos logit contributions
ucl+3.390
ACA+3.023
UL+3.000
UC+2.916
AMA+2.819
Neg logit contributions
 Spo-2.358
furt-2.326
irlf-2.119
-2.095
 reprodu-2.041
Token pre
Token ID662
Feature activation+2.15e-01

Pos logit contributions
ucl+0.593
ACA+0.526
UL+0.517
UC+0.507
AMA+0.485
Neg logit contributions
 Spo-0.417
furt-0.407
irlf-0.374
 reprodu-0.367
-0.366
Token-
Token ID12
Feature activation+8.84e-01

Pos logit contributions
ucl+0.636
ACA+0.569
UL+0.547
UC+0.536
AMA+0.519
Neg logit contributions
 Spo-0.524
furt-0.507
 reprodu-0.473
irlf-0.466
-0.458
Tokenindustrial
Token ID31130
Feature activation+6.35e-01

Pos logit contributions
ucl+2.266
ACA+1.995
UL+1.895
UC+1.868
AMA+1.768
Neg logit contributions
 Spo-2.547
furt-2.427
 reprodu-2.320
irlf-2.319
-2.247
Token bundled
Token ID34358
Feature activation+1.13e+00

Pos logit contributions
ucl+3.076
ACA+2.772
UL+2.699
UC+2.633
AMA+2.543
Neg logit contributions
 Spo-2.053
furt-2.018
-1.798
irlf-1.788
 reprodu-1.788
Token into
Token ID656
Feature activation-7.83e-03

Pos logit contributions
ucl+3.573
ACA+3.203
UL+3.126
UC+3.046
AMA+2.958
Neg logit contributions
 Spo-2.471
furt-2.452
irlf-2.213
ptoms-2.204
-2.198
Token a
Token ID257
Feature activation-3.17e-03

Pos logit contributions
 Spo+0.016
furt+0.015
 reprodu+0.014
irlf+0.014
+0.013
Neg logit contributions
ucl-0.027
ACA-0.024
UL-0.023
UC-0.023
AMA-0.022
Token carry
Token ID3283
Feature activation+1.59e+00

Pos logit contributions
 Spo+0.008
furt+0.007
 reprodu+0.007
irlf+0.007
+0.007
Neg logit contributions
ucl-0.010
ACA-0.009
UL-0.008
UC-0.008
AMA-0.008
Token c
Token ID269
Feature activation+1.89e+00

Pos logit contributions
ucl+4.618
ACA+4.157
UL+3.962
UC+3.869
AMA+3.765
Neg logit contributions
 Spo-3.905
furt-3.784
 reprodu-3.471
ptoms-3.453
-3.452
Tokenot
Token ID313
Feature activation+2.98e+00

Pos logit contributions
ucl+4.765
ACA+4.175
UL+3.950
UC+3.821
AMA+3.767
Neg logit contributions
 Spo-5.440
furt-5.255
 reprodu-5.018
-4.894
ptoms-4.826
Token and
Token ID290
Feature activation+6.82e-01

Pos logit contributions
ucl+8.759
ACA+7.810
UL+7.565
UC+7.416
AMA+7.162
Neg logit contributions
 Spo-6.739
furt-6.424
ptoms-5.894
 reprodu-5.844
irlf-5.803
Token on
Token ID319
Feature activation-5.15e-01

Pos logit contributions
ucl+2.203
ACA+1.990
UL+1.922
UC+1.882
AMA+1.829
Neg logit contributions
 Spo-1.456
furt-1.426
irlf-1.288
 reprodu-1.281
-1.272
Token and
Token ID290
Feature activation+5.37e-01

Pos logit contributions
 Spo+1.087
furt+1.045
 reprodu+0.967
irlf+0.945
+0.924
Neg logit contributions
ucl-1.691
ACA-1.533
UL-1.481
UC-1.451
AMA-1.417
Token off
Token ID572
Feature activation+3.50e-01

Pos logit contributions
ucl+1.748
ACA+1.586
UL+1.530
UC+1.496
AMA+1.452
Neg logit contributions
 Spo-1.146
furt-1.110
 reprodu-1.013
irlf-0.997
-0.982
Token various
Token ID2972
Feature activation+5.48e-01

Pos logit contributions
ucl+1.134
ACA+1.024
UL+0.994
UC+0.972
AMA+0.940
Neg logit contributions
 Spo-0.760
furt-0.737
 reprodu-0.671
irlf-0.661
-0.655
Token the
Token ID262
Feature activation-8.41e-02

Pos logit contributions
 Spo+0.397
furt+0.383
 reprodu+0.353
irlf+0.345
+0.338
Neg logit contributions
ucl-0.643
ACA-0.583
UL-0.564
UC-0.553
AMA-0.539
Token Donald
Token ID3759
Feature activation-1.09e+00

Pos logit contributions
 Spo+0.185
furt+0.179
 reprodu+0.166
irlf+0.162
+0.159
Neg logit contributions
ucl-0.268
ACA-0.243
UL-0.235
UC-0.230
AMA-0.223
Token P
Token ID350
Feature activation+6.62e-01

Pos logit contributions
 Spo+2.445
furt+2.338
 reprodu+2.160
irlf+2.106
+2.079
Neg logit contributions
ucl-3.433
ACA-3.098
UL-2.987
UC-2.922
AMA-2.833
Tokenritz
Token ID29574
Feature activation-1.52e-01

Pos logit contributions
ucl+1.873
ACA+1.668
UL+1.605
UC+1.566
AMA+1.514
Neg logit contributions
 Spo-1.707
furt-1.655
 reprodu-1.552
irlf-1.523
-1.502
Tokenker
Token ID6122
Feature activation-8.52e-01

Pos logit contributions
 Spo+0.298
furt+0.286
 reprodu+0.262
irlf+0.256
+0.250
Neg logit contributions
ucl-0.520
ACA-0.474
UL-0.459
UC-0.449
AMA-0.439
Token Entreprene
Token ID42071
Feature activation-3.53e+00

Pos logit contributions
 Spo+1.799
furt+1.717
 reprodu+1.591
irlf+1.554
+1.528
Neg logit contributions
ucl-2.807
ACA-2.539
UL-2.448
UC-2.397
AMA-2.339
Tokenur
Token ID333
Feature activation-1.84e+00

Pos logit contributions
 Spo+7.718
furt+7.400
 reprodu+6.780
irlf+6.675
ptoms+6.640
Neg logit contributions
ucl-10.372
ACA-9.409
UL-8.949
UC-8.586
AMA-8.527
Tokenial
Token ID498
Feature activation-8.83e-01

Pos logit contributions
 Spo+3.991
furt+3.809
 reprodu+3.513
irlf+3.451
+3.400
Neg logit contributions
ucl-5.874
ACA-5.311
UL-5.098
UC-4.967
AMA-4.856
Token Law
Token ID3854
Feature activation-8.97e-02

Pos logit contributions
 Spo+1.830
furt+1.744
 reprodu+1.614
irlf+1.576
+1.547
Neg logit contributions
ucl-2.945
ACA-2.669
UL-2.579
UC-2.523
AMA-2.462
Token Center
Token ID3337
Feature activation-7.45e-01

Pos logit contributions
 Spo+0.176
furt+0.170
 reprodu+0.155
irlf+0.152
+0.148
Neg logit contributions
ucl-0.308
ACA-0.281
UL-0.272
UC-0.267
AMA-0.260
Token
Token ID960
Feature activation-1.22e+00

Pos logit contributions
 Spo+1.740
furt+1.676
 reprodu+1.562
irlf+1.532
+1.508
Neg logit contributions
ucl-2.281
ACA-2.045
UL-1.967
UC-1.921
AMA-1.875
TokenSM
Token ID12310
Feature activation-3.03e-01

Pos logit contributions
ucl+0.652
ACA+0.556
UL+0.527
UC+0.508
AMA+0.485
Neg logit contributions
 Spo-1.044
furt-1.020
 reprodu-0.971
irlf-0.959
-0.945
TokenISSION
Token ID40373
Feature activation-5.79e-01

Pos logit contributions
 Spo+0.578
furt+0.556
 reprodu+0.508
irlf+0.489
+0.488
Neg logit contributions
ucl-1.051
ACA-0.954
UL-0.916
UC-0.899
AMA-0.882
Token ///
Token ID34013
Feature activation-1.01e-01

Pos logit contributions
 Spo+1.372
furt+1.323
 reprodu+1.233
irlf+1.207
+1.194
Neg logit contributions
ucl-1.754
ACA-1.569
UL-1.507
UC-1.476
AMA-1.433
Token----------------------------------------------------------------
Token ID10097
Feature activation-1.02e+00

Pos logit contributions
 Spo+0.231
furt+0.223
 reprodu+0.207
irlf+0.204
+0.199
Neg logit contributions
ucl-0.314
ACA-0.283
UL-0.273
UC-0.267
AMA-0.260
Token----------------------------------------------------------------
Token ID10097
Feature activation-1.38e+00

Pos logit contributions
 Spo+2.377
furt+2.289
 reprodu+2.129
irlf+2.082
+2.071
Neg logit contributions
ucl-3.121
ACA-2.802
UL-2.695
UC-2.638
AMA-2.566
Token------------------------
Token ID22369
Feature activation-3.18e+00

Pos logit contributions
 Spo+3.195
furt+3.081
 reprodu+2.849
+2.792
irlf+2.786
Neg logit contributions
ucl-4.210
ACA-3.771
UL-3.623
UC-3.553
AMA-3.456
Token--
Token ID438
Feature activation-2.58e+00

Pos logit contributions
 Spo+7.323
furt+6.964
+6.404
 reprodu+6.377
ptoms+6.291
Neg logit contributions
ucl-9.207
ACA-8.028
UL-7.657
UC-7.506
AMA-7.349
Token///
Token ID20379
Feature activation-8.87e-01

Pos logit contributions
 Spo+6.654
furt+6.382
 reprodu+5.930
+5.917
irlf+5.813
Neg logit contributions
ucl-6.920
ACA-6.019
UL-5.682
UC-5.573
AMA-5.424
Token L
Token ID406
Feature activation+1.88e-02

Pos logit contributions
 Spo+2.081
furt+1.998
 reprodu+1.857
irlf+1.821
+1.808
Neg logit contributions
ucl-2.709
ACA-2.422
UL-2.330
UC-2.281
AMA-2.217
Tokenett
Token ID3087
Feature activation-4.44e-01

Pos logit contributions
ucl+0.043
ACA+0.037
UL+0.035
UC+0.034
AMA+0.033
Neg logit contributions
 Spo-0.058
furt-0.057
 reprodu-0.054
irlf-0.053
-0.052
Tokenre
Token ID260
Feature activation-3.04e-02

Pos logit contributions
 Spo+1.570
furt+1.534
 reprodu+1.462
irlf+1.446
+1.427
Neg logit contributions
ucl-0.826
ACA-0.690
UL-0.645
UC-0.617
AMA-0.588
Token10
Token ID940
Feature activation-5.79e-01

Pos logit contributions
 Spo+0.185
furt+0.178
 reprodu+0.165
irlf+0.163
+0.159
Neg logit contributions
ucl-0.232
ACA-0.205
UL-0.200
UC-0.196
AMA-0.189
Token.
Token ID13
Feature activation-1.28e-01

Pos logit contributions
 Spo+1.476
furt+1.432
 reprodu+1.339
irlf+1.326
+1.289
Neg logit contributions
ucl-1.664
ACA-1.485
UL-1.433
UC-1.401
AMA-1.356
Token13
Token ID1485
Feature activation-1.43e+00

Pos logit contributions
 Spo+0.321
furt+0.313
 reprodu+0.291
irlf+0.290
+0.281
Neg logit contributions
ucl-0.371
ACA-0.329
UL-0.318
UC-0.309
AMA-0.304
Token71
Token ID4869
Feature activation-8.77e-01

Pos logit contributions
 Spo+3.831
furt+3.699
 reprodu+3.491
irlf+3.420
+3.378
Neg logit contributions
ucl-3.888
ACA-3.445
UL-3.308
UC-3.221
AMA-3.119
Token/
Token ID14
Feature activation+2.74e-01

Pos logit contributions
 Spo+2.057
furt+1.987
 reprodu+1.850
irlf+1.815
+1.783
Neg logit contributions
ucl-2.683
ACA-2.411
UL-2.324
UC-2.273
AMA-2.209
Tokenjournal
Token ID24891
Feature activation-3.13e+00

Pos logit contributions
ucl+0.855
ACA+0.770
UL+0.741
UC+0.739
AMA+0.711
Neg logit contributions
 Spo-0.623
furt-0.621
irlf-0.555
 reprodu-0.553
-0.540
Token.
Token ID13
Feature activation-2.30e-01

Pos logit contributions
 Spo+6.199
furt+6.174
 reprodu+5.434
+5.379
ptoms+5.305
Neg logit contributions
ucl-9.931
ACA-9.058
UL-8.599
elaide-8.421
UC-8.404
Tokenp
Token ID79
Feature activation+4.80e-01

Pos logit contributions
 Spo+0.532
furt+0.511
irlf+0.471
 reprodu+0.471
+0.457
Neg logit contributions
ucl-0.722
ACA-0.641
UL-0.622
UC-0.612
AMA-0.592
Tokenone
Token ID505
Feature activation-5.95e-01

Pos logit contributions
ucl+1.180
ACA+1.013
UL+0.954
UC+0.944
AMA+0.910
Neg logit contributions
 Spo-1.425
furt-1.384
 reprodu-1.313
-1.296
irlf-1.279
Token.
Token ID13
Feature activation+2.35e-01

Pos logit contributions
 Spo+1.489
furt+1.439
 reprodu+1.350
irlf+1.328
+1.301
Neg logit contributions
ucl-1.730
ACA-1.545
UL-1.487
UC-1.454
AMA-1.409
Token00
Token ID405
Feature activation+5.17e-01

Pos logit contributions
ucl+0.766
ACA+0.691
UL+0.669
UC+0.659
AMA+0.637
Neg logit contributions
 Spo-0.512
furt-0.490
 reprodu-0.456
irlf-0.446
-0.434
Token he
Token ID339
Feature activation-2.25e-01

Pos logit contributions
 Spo+1.234
furt+1.182
 reprodu+1.089
irlf+1.064
+1.042
Neg logit contributions
ucl-2.050
ACA-1.855
UL-1.796
UC-1.762
AMA-1.717
Token said
Token ID531
Feature activation-6.63e-02

Pos logit contributions
 Spo+0.467
furt+0.449
 reprodu+0.414
irlf+0.405
+0.397
Neg logit contributions
ucl-0.746
ACA-0.677
UL-0.655
UC-0.641
AMA-0.625
Token.
Token ID13
Feature activation-3.70e-01

Pos logit contributions
 Spo+0.160
furt+0.155
 reprodu+0.145
irlf+0.141
+0.139
Neg logit contributions
ucl-0.198
ACA-0.178
UL-0.171
UC-0.167
AMA-0.162
Token<|endoftext|>
Token ID50256
Feature activation+9.21e-01

Pos logit contributions
 Spo+0.804
furt+0.775
 reprodu+0.717
irlf+0.702
+0.689
Neg logit contributions
ucl-1.194
ACA-1.079
UL-1.043
UC-1.021
AMA-0.994
TokenMeg
Token ID42672
Feature activation-3.87e-01

Pos logit contributions
ucl+2.609
ACA+2.351
UL+2.267
UC+2.219
AMA+2.138
Neg logit contributions
 Spo-2.379
furt-2.278
 reprodu-2.166
irlf-2.134
ptoms-2.056
Tokenach
Token ID620
Feature activation-3.13e+00

Pos logit contributions
 Spo+0.982
furt+0.949
 reprodu+0.887
irlf+0.872
+0.857
Neg logit contributions
ucl-1.108
ACA-0.990
UL-0.951
UC-0.929
AMA-0.899
Tokenurch
Token ID2575
Feature activation-9.76e-01

Pos logit contributions
 Spo+7.664
furt+7.446
 reprodu+7.047
irlf+6.897
ipal+6.891
Neg logit contributions
ucl-8.348
ACA-7.548
UL-7.133
UC-6.901
ORPG-6.729
Token founder
Token ID9119
Feature activation-1.93e-01

Pos logit contributions
 Spo+2.249
furt+2.148
 reprodu+2.017
irlf+1.959
+1.926
Neg logit contributions
ucl-3.031
ACA-2.726
UL-2.628
UC-2.569
AMA-2.492
Token Rev
Token ID5416
Feature activation+8.61e-02

Pos logit contributions
 Spo+0.439
furt+0.427
 reprodu+0.394
irlf+0.390
+0.381
Neg logit contributions
ucl-0.601
ACA-0.541
UL-0.524
UC-0.513
AMA-0.498
Token.
Token ID13
Feature activation-1.16e-02

Pos logit contributions
ucl+0.262
ACA+0.235
UL+0.226
UC+0.222
AMA+0.215
Neg logit contributions
 Spo-0.203
furt-0.197
 reprodu-0.183
irlf-0.180
-0.177
Token Ted
Token ID11396
Feature activation+4.22e-01

Pos logit contributions
 Spo+0.026
furt+0.026
 reprodu+0.024
irlf+0.024
+0.023
Neg logit contributions
ucl-0.036
ACA-0.032
UL-0.031
UC-0.031
AMA-0.030
Token).
Token ID737
Feature activation-7.16e-01

Pos logit contributions
 Spo+2.340
furt+2.220
 reprodu+2.097
irlf+1.975
+1.968
Neg logit contributions
ucl-3.997
ACA-3.636
UL-3.502
UC-3.412
AMA-3.357
Token
Token ID198
Feature activation-7.20e-01

Pos logit contributions
 Spo+1.549
furt+1.497
 reprodu+1.381
irlf+1.358
+1.327
Neg logit contributions
ucl-2.320
ACA-2.099
UL-2.029
UC-1.989
AMA-1.936
Token
Token ID198
Feature activation-5.12e-01

Pos logit contributions
 Spo+1.587
furt+1.530
 reprodu+1.419
irlf+1.397
+1.363
Neg logit contributions
ucl-2.304
ACA-2.082
UL-2.012
UC-1.970
AMA-1.917
TokenS
Token ID50
Feature activation-4.19e-01

Pos logit contributions
 Spo+1.189
furt+1.150
 reprodu+1.069
irlf+1.049
+1.031
Neg logit contributions
ucl-1.575
ACA-1.416
UL-1.365
UC-1.335
AMA-1.298
Tokenensitivity
Token ID40545
Feature activation-1.54e+00

Pos logit contributions
 Spo+1.156
furt+1.123
 reprodu+1.058
irlf+1.041
+1.025
Neg logit contributions
ucl-1.107
ACA-0.977
UL-0.935
UC-0.911
AMA-0.880
Token analysis
Token ID3781
Feature activation-3.11e+00

Pos logit contributions
 Spo+3.741
furt+3.602
 reprodu+3.412
irlf+3.292
+3.282
Neg logit contributions
ucl-4.547
ACA-4.068
UL-3.896
UC-3.817
AMA-3.709
Token Sens
Token ID14173
Feature activation-5.88e-01

Pos logit contributions
 Spo+6.133
furt+5.689
 reprodu+5.580
+5.092
irlf+5.034
Neg logit contributions
ucl-10.204
ACA-9.277
UL-8.860
UC-8.716
AMA-8.518
Tokenitivity
Token ID11365
Feature activation-8.97e-01

Pos logit contributions
 Spo+1.000
furt+0.950
 reprodu+0.854
irlf+0.837
+0.814
Neg logit contributions
ucl-2.183
ACA-1.997
UL-1.943
UC-1.912
AMA-1.862
Token analysis
Token ID3781
Feature activation-2.55e+00

Pos logit contributions
 Spo+2.266
furt+2.194
 reprodu+2.068
irlf+2.013
+1.992
Neg logit contributions
ucl-2.565
ACA-2.289
UL-2.197
UC-2.147
AMA-2.084
Token will
Token ID481
Feature activation-1.18e-01

Pos logit contributions
 Spo+4.867
furt+4.550
 reprodu+4.403
irlf+4.039
+4.005
Neg logit contributions
ucl-8.661
ACA-7.945
UL-7.597
UC-7.470
AMA-7.311
Token be
Token ID307
Feature activation-2.37e-01

Pos logit contributions
 Spo+0.275
furt+0.271
irlf+0.246
 reprodu+0.241
+0.240
Neg logit contributions
ucl-0.361
ACA-0.324
UL-0.315
UC-0.308
AMA-0.295
Token School
Token ID3961
Feature activation-5.27e-01

Pos logit contributions
ucl+1.375
ACA+1.243
UL+1.203
UC+1.179
AMA+1.144
Neg logit contributions
 Spo-0.977
furt-0.959
 reprodu-0.889
irlf-0.872
ptoms-0.853
Token officials
Token ID2828
Feature activation+4.88e-01

Pos logit contributions
 Spo+1.331
furt+1.291
 reprodu+1.206
irlf+1.186
+1.168
Neg logit contributions
ucl-1.518
ACA-1.355
UL-1.303
UC-1.273
AMA-1.234
Token announced
Token ID3414
Feature activation+4.27e-01

Pos logit contributions
ucl+1.508
ACA+1.347
UL+1.307
UC+1.280
AMA+1.241
Neg logit contributions
 Spo-1.115
furt-1.090
 reprodu-0.997
irlf-0.994
-0.974
Token that
Token ID326
Feature activation+1.65e-02

Pos logit contributions
ucl+1.393
ACA+1.255
UL+1.226
UC+1.194
AMA+1.163
Neg logit contributions
 Spo-0.902
furt-0.881
 reprodu-0.807
irlf-0.802
-0.786
Token the
Token ID262
Feature activation-4.63e-01

Pos logit contributions
ucl+0.055
ACA+0.049
UL+0.048
UC+0.047
AMA+0.046
Neg logit contributions
 Spo-0.034
furt-0.033
 reprodu-0.030
irlf-0.030
-0.029
Token Entreprene
Token ID42071
Feature activation-3.09e+00

Pos logit contributions
 Spo+1.153
furt+1.117
 reprodu+1.044
irlf+1.026
+1.010
Neg logit contributions
ucl-1.349
ACA-1.204
UL-1.159
UC-1.132
AMA-1.098
Tokenurs
Token ID1834
Feature activation-1.14e+00

Pos logit contributions
 Spo+6.886
furt+6.634
 reprodu+6.059
irlf+5.997
ipal+5.899
Neg logit contributions
ucl-9.208
ACA-8.221
UL-7.906
UC-7.655
AMA-7.460
Tokenhip
Token ID1056
Feature activation-7.76e-01

Pos logit contributions
 Spo+2.633
furt+2.526
 reprodu+2.351
irlf+2.296
+2.270
Neg logit contributions
ucl-3.519
ACA-3.148
UL-3.030
UC-2.962
AMA-2.874
Token Law
Token ID3854
Feature activation+3.20e-01

Pos logit contributions
 Spo+1.658
furt+1.584
 reprodu+1.469
irlf+1.434
+1.410
Neg logit contributions
ucl-2.539
ACA-2.290
UL-2.211
UC-2.163
AMA-2.109
Token Center
Token ID3337
Feature activation-1.96e-01

Pos logit contributions
ucl+1.037
ACA+0.944
UL+0.912
UC+0.894
AMA+0.867
Neg logit contributions
 Spo-0.684
furt-0.665
 reprodu-0.610
irlf-0.603
-0.588
Token will
Token ID481
Feature activation+7.95e-02

Pos logit contributions
 Spo+0.411
furt+0.397
 reprodu+0.365
irlf+0.358
+0.350
Neg logit contributions
ucl-0.645
ACA-0.585
UL-0.566
UC-0.555
AMA-0.540
Token the
Token ID262
Feature activation+2.43e-01

Pos logit contributions
ucl+1.674
ACA+1.522
UL+1.476
UC+1.448
AMA+1.413
Neg logit contributions
 Spo-0.987
furt-0.964
 reprodu-0.878
irlf-0.862
-0.846
Token Johnson
Token ID5030
Feature activation-4.86e-01

Pos logit contributions
ucl+0.681
ACA+0.607
UL+0.585
UC+0.571
AMA+0.552
Neg logit contributions
 Spo-0.622
furt-0.611
 reprodu-0.567
irlf-0.561
-0.552
Token County
Token ID3418
Feature activation+8.73e-01

Pos logit contributions
 Spo+1.014
furt+0.975
 reprodu+0.899
irlf+0.880
+0.862
Neg logit contributions
ucl-1.610
ACA-1.458
UL-1.410
UC-1.382
AMA-1.346
Token Republicans
Token ID4734
Feature activation-1.81e-01

Pos logit contributions
ucl+2.580
ACA+2.332
UL+2.260
UC+2.208
AMA+2.147
Neg logit contributions
 Spo-2.067
furt-2.056
irlf-1.878
 reprodu-1.877
ptoms-1.848
Token Sp
Token ID1338
Feature activation-1.24e+00

Pos logit contributions
 Spo+0.476
furt+0.462
 reprodu+0.433
irlf+0.426
+0.419
Neg logit contributions
ucl-0.502
ACA-0.446
UL-0.429
UC-0.418
AMA-0.405
Tokenaghetti
Token ID35812
Feature activation-3.09e+00

Pos logit contributions
 Spo+3.422
furt+3.317
 reprodu+3.117
irlf+3.102
+3.031
Neg logit contributions
ucl-3.196
ACA-2.807
UL-2.699
UC-2.639
AMA-2.517
Token Dinner
Token ID34584
Feature activation-1.20e+00

Pos logit contributions
 Spo+6.817
furt+6.266
 reprodu+5.961
+5.578
 Portable+5.513
Neg logit contributions
ucl-9.602
ACA-8.604
UL-8.073
UC-7.950
elaide-7.833
Token at
Token ID379
Feature activation+6.17e-01

Pos logit contributions
 Spo+2.434
furt+2.318
 reprodu+2.122
irlf+2.035
+2.020
Neg logit contributions
ucl-4.070
ACA-3.667
UL-3.541
UC-3.471
AMA-3.380
Token Clear
Token ID11459
Feature activation-8.12e-02

Pos logit contributions
ucl+1.982
ACA+1.792
UL+1.735
UC+1.703
AMA+1.661
Neg logit contributions
 Spo-1.311
furt-1.297
 reprodu-1.190
irlf-1.170
ptoms-1.157
Token Creek
Token ID13509
Feature activation+9.19e-01

Pos logit contributions
 Spo+0.179
furt+0.173
 reprodu+0.160
irlf+0.157
+0.154
Neg logit contributions
ucl-0.259
ACA-0.234
UL-0.226
UC-0.222
AMA-0.215
Token Aman
Token ID42614
Feature activation+1.04e-01

Pos logit contributions
ucl+2.891
ACA+2.646
UL+2.571
UC+2.529
AMA+2.466
Neg logit contributions
 Spo-1.954
furt-1.936
 reprodu-1.784
irlf-1.779
ptoms-1.749
Token Kodi
Token ID44129
Feature activation+1.64e+00

Pos logit contributions
ucl+0.521
ACA+0.468
UL+0.452
UC+0.442
AMA+0.429
Neg logit contributions
 Spo-0.415
furt-0.405
 reprodu-0.373
irlf-0.369
-0.361
Token builds
Token ID12188
Feature activation-2.13e-01

Pos logit contributions
ucl+4.966
ACA+4.436
UL+4.354
UC+4.264
AMA+4.118
Neg logit contributions
 Spo-3.670
furt-3.656
irlf-3.326
 reprodu-3.253
-3.229
Token.
Token ID13
Feature activation+2.96e-01

Pos logit contributions
 Spo+0.513
furt+0.497
 reprodu+0.462
irlf+0.455
+0.447
Neg logit contributions
ucl-0.635
ACA-0.570
UL-0.549
UC-0.537
AMA-0.521
Token Follow
Token ID7281
Feature activation-1.06e+00

Pos logit contributions
ucl+0.905
ACA+0.815
UL+0.791
UC+0.774
AMA+0.749
Neg logit contributions
 Spo-0.678
furt-0.670
 reprodu-0.618
irlf-0.611
ptoms-0.596
Token these
Token ID777
Feature activation-1.32e+00

Pos logit contributions
 Spo+1.982
furt+1.886
 reprodu+1.740
irlf+1.679
+1.655
Neg logit contributions
ucl-3.733
ACA-3.408
UL-3.297
UC-3.238
AMA-3.161
Token steps
Token ID4831
Feature activation-3.08e+00

Pos logit contributions
 Spo+2.285
furt+2.165
 reprodu+2.012
+1.887
irlf+1.884
Neg logit contributions
ucl-4.825
ACA-4.415
UL-4.264
UC-4.194
AMA-4.116
Token to
Token ID284
Feature activation-8.77e-01

Pos logit contributions
 Spo+5.764
furt+5.403
 reprodu+5.340
+4.972
irlf+4.879
Neg logit contributions
ucl-10.286
ACA-9.325
UL-8.839
UC-8.664
ORPG-8.526
Token install
Token ID2721
Feature activation-5.22e-01

Pos logit contributions
 Spo+1.855
furt+1.767
 reprodu+1.659
irlf+1.607
+1.580
Neg logit contributions
ucl-2.887
ACA-2.608
UL-2.516
UC-2.467
AMA-2.404
Token Net
Token ID3433
Feature activation+8.69e-01

Pos logit contributions
 Spo+1.120
furt+1.074
 reprodu+0.996
irlf+0.975
+0.957
Neg logit contributions
ucl-1.702
ACA-1.542
UL-1.487
UC-1.458
AMA-1.420
Tokenfl
Token ID2704
Feature activation+4.95e-01

Pos logit contributions
ucl+2.352
ACA+2.061
UL+2.003
UC+1.941
AMA+1.863
Neg logit contributions
 Spo-2.334
furt-2.264
 reprodu-2.145
irlf-2.091
-2.072
Tokeni
Token ID72
Feature activation+2.83e-01

Pos logit contributions
ucl+1.463
ACA+1.270
UL+1.259
UC+1.230
AMA+1.172
Neg logit contributions
 Spo-1.234
furt-1.173
 reprodu-1.110
irlf-1.089
-1.072
Token Stephen
Token ID7970
Feature activation-1.42e+00

Pos logit contributions
ucl+0.803
ACA+0.721
UL+0.699
UC+0.685
AMA+0.661
Neg logit contributions
 Spo-0.589
furt-0.580
 reprodu-0.535
irlf-0.529
-0.519
Token P
Token ID350
Feature activation+1.47e+00

Pos logit contributions
 Spo+3.141
furt+2.937
 reprodu+2.756
irlf+2.632
ipal+2.584
Neg logit contributions
ucl-4.496
ACA-4.158
UL-3.969
UC-3.877
AMA-3.780
Tokenaea
Token ID44705
Feature activation+3.48e-01

Pos logit contributions
ucl+3.675
ACA+3.147
UL+3.050
UC+3.001
AMA+2.831
Neg logit contributions
 Spo-4.249
furt-4.164
 reprodu-3.916
-3.850
irlf-3.755
Token,
Token ID11
Feature activation+8.93e-01

Pos logit contributions
ucl+1.025
ACA+0.916
UL+0.885
UC+0.865
AMA+0.836
Neg logit contributions
 Spo-0.854
furt-0.827
 reprodu-0.771
irlf-0.762
-0.745
Token Ricky
Token ID31668
Feature activation+3.07e-01

Pos logit contributions
ucl+2.736
ACA+2.443
UL+2.381
UC+2.335
AMA+2.246
Neg logit contributions
 Spo-2.013
furt-2.000
 reprodu-1.846
irlf-1.829
-1.792
Token Jean
Token ID11320
Feature activation-3.08e+00

Pos logit contributions
ucl+0.973
ACA+0.877
UL+0.847
UC+0.831
AMA+0.801
Neg logit contributions
 Spo-0.675
furt-0.660
 reprodu-0.612
irlf-0.606
-0.589
Token Francois
Token ID40419
Feature activation-1.03e+00

Pos logit contributions
 Spo+6.055
furt+5.851
 reprodu+5.258
+5.239
ipal+5.092
Neg logit contributions
ucl-10.053
ACA-9.001
UL-8.693
UC-8.458
AMA-8.404
Token and
Token ID290
Feature activation+2.51e-01

Pos logit contributions
 Spo+2.391
furt+2.305
 reprodu+2.150
irlf+2.080
+2.063
Neg logit contributions
ucl-3.164
ACA-2.847
UL-2.733
UC-2.672
AMA-2.612
Token Zig
Token ID24992
Feature activation+3.17e-01

Pos logit contributions
ucl+0.780
ACA+0.700
UL+0.679
UC+0.666
AMA+0.643
Neg logit contributions
 Spo-0.567
furt-0.559
 reprodu-0.514
irlf-0.508
-0.499
Tokengy
Token ID1360
Feature activation+4.10e-01

Pos logit contributions
ucl+0.760
ACA+0.665
UL+0.631
UC+0.612
AMA+0.590
Neg logit contributions
 Spo-0.956
furt-0.930
 reprodu-0.879
irlf-0.865
-0.853
Token Hood
Token ID17233
Feature activation-5.20e-01

Pos logit contributions
ucl+1.283
ACA+1.154
UL+1.117
UC+1.093
AMA+1.065
Neg logit contributions
 Spo-0.923
furt-0.898
 reprodu-0.833
irlf-0.821
-0.804
Token 22
Token ID2534
Feature activation-4.37e-01

Pos logit contributions
 Spo+1.299
furt+1.245
 reprodu+1.144
irlf+1.111
+1.089
Neg logit contributions
ucl-2.230
ACA-2.027
UL-1.959
UC-1.923
AMA-1.875
Token.
Token ID13
Feature activation-1.92e-01

Pos logit contributions
 Spo+1.081
furt+1.046
 reprodu+0.980
irlf+0.955
+0.937
Neg logit contributions
ucl-1.275
ACA-1.140
UL-1.094
UC-1.070
AMA-1.044
Token
Token ID198
Feature activation-1.36e-01

Pos logit contributions
 Spo+0.428
furt+0.414
 reprodu+0.385
irlf+0.377
+0.369
Neg logit contributions
ucl-0.609
ACA-0.550
UL-0.532
UC-0.521
AMA-0.506
Token
Token ID198
Feature activation+1.87e-01

Pos logit contributions
 Spo+0.314
furt+0.303
 reprodu+0.285
irlf+0.284
ptoms+0.272
Neg logit contributions
ucl-0.420
ACA-0.380
UL-0.368
UC-0.361
AMA-0.349
TokenS
Token ID50
Feature activation-5.48e-03

Pos logit contributions
ucl+0.523
ACA+0.466
UL+0.448
UC+0.438
AMA+0.423
Neg logit contributions
 Spo-0.486
furt-0.470
 reprodu-0.443
irlf-0.435
-0.426
Tokenovere
Token ID33518
Feature activation-3.08e+00

Pos logit contributions
 Spo+0.015
furt+0.015
 reprodu+0.014
irlf+0.014
+0.014
Neg logit contributions
ucl-0.014
ACA-0.013
UL-0.012
UC-0.012
AMA-0.011
Tokenign
Token ID570
Feature activation-1.25e+00

Pos logit contributions
 Spo+7.664
furt+7.262
 reprodu+7.017
ptoms+6.725
irlf+6.627
Neg logit contributions
ucl-8.304
ACA-7.427
UL-7.199
UC-6.941
ORPG-6.675
Token Lux
Token ID17145
Feature activation-1.47e+00

Pos logit contributions
 Spo+2.844
furt+2.714
 reprodu+2.536
irlf+2.468
+2.436
Neg logit contributions
ucl-3.879
ACA-3.469
UL-3.358
UC-3.272
AMA-3.174
Tokenury
Token ID1601
Feature activation-4.45e-01

Pos logit contributions
 Spo+3.458
furt+3.324
 reprodu+3.098
irlf+3.004
+2.969
Neg logit contributions
ucl-4.452
ACA-3.982
UL-3.840
UC-3.744
AMA-3.659
Token Travel
Token ID13524
Feature activation-1.19e+00

Pos logit contributions
 Spo+0.942
furt+0.912
 reprodu+0.840
irlf+0.826
+0.807
Neg logit contributions
ucl-1.460
ACA-1.324
UL-1.281
UC-1.256
AMA-1.222
Token (
Token ID357
Feature activation-1.27e-01

Pos logit contributions
 Spo+2.510
furt+2.377
 reprodu+2.208
irlf+2.146
+2.111
Neg logit contributions
ucl-3.935
ACA-3.545
UL-3.412
UC-3.352
AMA-3.274
Token was
Token ID373
Feature activation+3.83e-01

Pos logit contributions
ucl+0.343
ACA+0.310
UL+0.301
UC+0.296
AMA+0.287
Neg logit contributions
 Spo-0.213
furt-0.206
irlf-0.189
 reprodu-0.187
-0.182
Token awarded
Token ID11343
Feature activation+8.82e-02

Pos logit contributions
ucl+1.276
ACA+1.164
UL+1.127
UC+1.111
AMA+1.079
Neg logit contributions
 Spo-0.773
furt-0.754
irlf-0.683
 reprodu-0.673
ptoms-0.672
Token to
Token ID284
Feature activation+6.47e-01

Pos logit contributions
ucl+0.299
ACA+0.270
UL+0.263
UC+0.258
AMA+0.250
Neg logit contributions
 Spo-0.178
furt-0.173
irlf-0.156
 reprodu-0.156
ptoms-0.151
Token Ragnar
Token ID32797
Feature activation-4.32e-01

Pos logit contributions
ucl+2.002
ACA+1.792
UL+1.748
UC+1.711
AMA+1.658
Neg logit contributions
 Spo-1.462
furt-1.427
 reprodu-1.308
irlf-1.306
ptoms-1.284
Token Fr
Token ID1305
Feature activation-7.80e-01

Pos logit contributions
 Spo+0.930
furt+0.896
 reprodu+0.827
irlf+0.809
+0.794
Neg logit contributions
ucl-1.403
ACA-1.271
UL-1.227
UC-1.202
AMA-1.170
Tokenisch
Token ID25308
Feature activation-3.02e+00

Pos logit contributions
 Spo+1.928
furt+1.884
 reprodu+1.743
irlf+1.726
+1.675
Neg logit contributions
ucl-2.264
ACA-2.030
UL-1.944
UC-1.905
AMA-1.846
Token and
Token ID290
Feature activation-1.87e-01

Pos logit contributions
furt+6.475
 Spo+6.440
 reprodu+5.748
+5.557
ipal+5.552
Neg logit contributions
ucl-9.198
ACA-8.556
UL-7.936
UC-7.782
AMA-7.674
Token Jan
Token ID2365
Feature activation-9.52e-01

Pos logit contributions
 Spo+0.411
furt+0.398
 reprodu+0.366
irlf+0.362
+0.354
Neg logit contributions
ucl-0.600
ACA-0.542
UL-0.525
UC-0.513
AMA-0.499
Token Tin
Token ID22894
Feature activation+1.16e+00

Pos logit contributions
 Spo+2.121
furt+2.049
 reprodu+1.879
irlf+1.825
+1.789
Neg logit contributions
ucl-3.021
ACA-2.749
UL-2.638
UC-2.590
AMA-2.510
Tokenberg
Token ID3900
Feature activation+1.84e-01

Pos logit contributions
ucl+3.570
ACA+3.168
UL+3.087
UC+3.016
AMA+2.960
Neg logit contributions
 Spo-2.618
furt-2.479
 reprodu-2.356
irlf-2.332
-2.311
Tokenen
Token ID268
Feature activation-5.20e-01

Pos logit contributions
ucl+0.583
ACA+0.524
UL+0.508
UC+0.497
AMA+0.484
Neg logit contributions
 Spo-0.409
furt-0.392
 reprodu-0.366
irlf-0.360
-0.352
Token17
Token ID1558
Feature activation+1.74e-02

Pos logit contributions
ucl+0.257
ACA+0.231
UL+0.223
UC+0.218
AMA+0.210
Neg logit contributions
 Spo-0.230
furt-0.222
 reprodu-0.210
irlf-0.208
ptoms-0.201
Token)
Token ID8
Feature activation-3.77e-01

Pos logit contributions
ucl+0.050
ACA+0.045
UL+0.043
UC+0.042
AMA+0.041
Neg logit contributions
 Spo-0.044
furt-0.043
 reprodu-0.041
irlf-0.040
ptoms-0.039
Token<|endoftext|>
Token ID50256
Feature activation+8.31e-01

Pos logit contributions
 Spo+0.794
furt+0.758
 reprodu+0.702
irlf+0.680
+0.672
Neg logit contributions
ucl-1.240
ACA-1.125
UL-1.086
UC-1.064
AMA-1.040
TokenUnited
Token ID17013
Feature activation-4.29e-01

Pos logit contributions
ucl+2.351
ACA+2.116
UL+2.040
UC+1.996
AMA+1.924
Neg logit contributions
 Spo-2.149
furt-2.061
 reprodu-1.956
irlf-1.926
-1.859
Token States
Token ID1829
Feature activation-1.65e+00

Pos logit contributions
 Spo+0.921
furt+0.885
 reprodu+0.819
irlf+0.800
+0.786
Neg logit contributions
ucl-1.395
ACA-1.263
UL-1.220
UC-1.195
AMA-1.161
Token Supreme
Token ID5617
Feature activation-3.00e+00

Pos logit contributions
 Spo+3.607
furt+3.386
 reprodu+3.258
+3.066
irlf+3.065
Neg logit contributions
ucl-5.230
ACA-4.672
UL-4.500
UC-4.396
AMA-4.297
Token Court
Token ID3078
Feature activation-1.65e+00

Pos logit contributions
 Spo+3.960
furt+3.580
 reprodu+3.204
irlf+2.867
+2.847
Neg logit contributions
ucl-12.074
ACA-11.117
UL-10.685
UC-10.598
AMA-10.333
Token case
Token ID1339
Feature activation-1.11e+00

Pos logit contributions
 Spo+3.404
furt+3.263
 reprodu+3.056
irlf+2.926
+2.899
Neg logit contributions
ucl-5.408
ACA-4.852
UL-4.620
UC-4.593
AMA-4.434
Token
Token ID198
Feature activation-2.05e-01

Pos logit contributions
 Spo+2.422
furt+2.317
 reprodu+2.177
irlf+2.093
+2.057
Neg logit contributions
ucl-3.568
ACA-3.217
UL-3.096
UC-3.034
AMA-2.963
Token
Token ID198
Feature activation-1.50e-01

Pos logit contributions
 Spo+0.471
furt+0.452
 reprodu+0.426
irlf+0.423
ptoms+0.407
Neg logit contributions
ucl-0.638
ACA-0.581
UL-0.562
UC-0.551
AMA-0.534
TokenSanta
Token ID42694
Feature activation-1.80e-01

Pos logit contributions
 Spo+0.389
furt+0.377
 reprodu+0.353
irlf+0.347
+0.341
Neg logit contributions
ucl-0.423
ACA-0.376
UL-0.361
UC-0.353
AMA-0.342
Token[
Token ID58
Feature activation-2.85e-01

Pos logit contributions
ucl+0.033
ACA+0.029
UL+0.028
UC+0.027
AMA+0.027
Neg logit contributions
 Spo-0.028
furt-0.027
 reprodu-0.026
irlf-0.025
-0.025
TokenMath
Token ID37372
Feature activation+8.53e-02

Pos logit contributions
 Spo+0.722
furt+0.700
 reprodu+0.655
irlf+0.644
+0.634
Neg logit contributions
ucl-0.819
ACA-0.730
UL-0.702
UC-0.685
AMA-0.664
Tokenieu
Token ID22304
Feature activation-8.61e-01

Pos logit contributions
ucl+0.214
ACA+0.188
UL+0.179
UC+0.174
AMA+0.168
Neg logit contributions
 Spo-0.247
furt-0.240
 reprodu-0.227
irlf-0.223
-0.220
Token]
Token ID60
Feature activation-1.22e+00

Pos logit contributions
 Spo+1.781
furt+1.717
 reprodu+1.569
+1.513
irlf+1.508
Neg logit contributions
ucl-2.851
ACA-2.595
UL-2.491
UC-2.448
AMA-2.390
Token What
Token ID1867
Feature activation-8.64e-01

Pos logit contributions
 Spo+2.685
furt+2.543
 reprodu+2.370
irlf+2.281
+2.275
Neg logit contributions
ucl-3.885
ACA-3.517
UL-3.374
UC-3.310
AMA-3.206
Token kind
Token ID1611
Feature activation-2.99e+00

Pos logit contributions
 Spo+1.701
furt+1.621
 reprodu+1.506
irlf+1.455
+1.428
Neg logit contributions
ucl-2.961
ACA-2.699
UL-2.603
UC-2.551
AMA-2.489
Token of
Token ID286
Feature activation-2.46e+00

Pos logit contributions
 Spo+5.247
furt+5.060
 reprodu+4.728
+4.396
ptoms+4.299
Neg logit contributions
ucl-10.161
ACA-9.309
UL-8.924
UC-8.710
elaide-8.608
Token projects
Token ID4493
Feature activation-8.48e-01

Pos logit contributions
 Spo+5.672
furt+5.291
 reprodu+5.256
 exploits+4.950
+4.870
Neg logit contributions
ucl-7.449
ACA-6.616
UL-6.302
UC-6.193
AMA-6.002
Token are
Token ID389
Feature activation-9.38e-01

Pos logit contributions
 Spo+1.759
furt+1.685
 reprodu+1.562
irlf+1.519
+1.491
Neg logit contributions
ucl-2.826
ACA-2.561
UL-2.473
UC-2.423
AMA-2.362
Token done
Token ID1760
Feature activation-8.60e-01

Pos logit contributions
 Spo+1.812
furt+1.734
 reprodu+1.592
irlf+1.549
+1.517
Neg logit contributions
ucl-3.260
ACA-2.968
UL-2.874
UC-2.820
AMA-2.751
Token with
Token ID351
Feature activation-3.38e-02

Pos logit contributions
 Spo+1.760
furt+1.695
 reprodu+1.555
irlf+1.525
+1.492
Neg logit contributions
ucl-2.884
ACA-2.617
UL-2.534
UC-2.484
AMA-2.421
Token this
Token ID428
Feature activation-3.94e-01

Pos logit contributions
 Spo+0.982
furt+0.955
irlf+0.873
 reprodu+0.866
ptoms+0.848
Neg logit contributions
ucl-1.586
ACA-1.441
UL-1.399
UC-1.366
AMA-1.333
Token could
Token ID714
Feature activation+8.17e-01

Pos logit contributions
 Spo+0.917
furt+0.896
irlf+0.834
 reprodu+0.819
ptoms+0.808
Neg logit contributions
ucl-1.206
ACA-1.087
UL-1.047
UC-1.026
AMA-0.994
Token happen
Token ID1645
Feature activation-6.19e-01

Pos logit contributions
ucl+2.132
ACA+1.891
UL+1.799
UC+1.756
AMA+1.670
Neg logit contributions
 Spo-2.226
furt-2.208
irlf-2.044
 reprodu-2.004
-2.001
Token,"
Token ID553
Feature activation+2.96e-01

Pos logit contributions
 Spo+1.322
furt+1.278
 reprodu+1.170
irlf+1.156
+1.133
Neg logit contributions
ucl-2.021
ACA-1.830
UL-1.769
UC-1.733
AMA-1.686
Token said
Token ID531
Feature activation+9.12e-01

Pos logit contributions
ucl+0.960
ACA+0.869
UL+0.846
UC+0.824
AMA+0.805
Neg logit contributions
 Spo-0.623
furt-0.612
 reprodu-0.560
irlf-0.552
-0.540
Token Jean
Token ID11320
Feature activation-2.97e+00

Pos logit contributions
ucl+2.840
ACA+2.550
UL+2.504
UC+2.435
AMA+2.385
Neg logit contributions
 Spo-1.969
furt-1.957
 reprodu-1.789
irlf-1.778
ptoms-1.774
Tokenette
Token ID5857
Feature activation-1.04e-01

Pos logit contributions
 Spo+6.449
furt+6.236
 reprodu+5.513
ipal+5.204
vironments+5.178
Neg logit contributions
ucl-9.105
ACA-8.382
UL-7.993
UC-7.854
AMA-7.600
Token Sk
Token ID3661
Feature activation-1.73e-01

Pos logit contributions
 Spo+0.223
furt+0.216
 reprodu+0.200
irlf+0.197
+0.192
Neg logit contributions
ucl-0.335
ACA-0.303
UL-0.293
UC-0.287
AMA-0.279
Tokenj
Token ID73
Feature activation-1.59e+00

Pos logit contributions
 Spo+0.465
furt+0.452
 reprodu+0.424
irlf+0.418
+0.411
Neg logit contributions
ucl-0.468
ACA-0.415
UL-0.398
UC-0.388
AMA-0.375
Tokenel
Token ID417
Feature activation-1.57e+00

Pos logit contributions
 Spo+4.242
furt+4.164
irlf+3.880
 reprodu+3.809
ptoms+3.754
Neg logit contributions
ucl-4.190
ACA-3.730
UL-3.549
UC-3.460
AMA-3.366
Tokenm
Token ID76
Feature activation-7.72e-01

Pos logit contributions
 Spo+3.945
furt+3.842
 reprodu+3.556
irlf+3.475
ptoms+3.412
Neg logit contributions
ucl-4.536
ACA-4.065
UL-3.893
UC-3.791
AMA-3.686
Token Compl
Token ID20011
Feature activation-4.79e-01

Pos logit contributions
 Spo+0.788
furt+0.764
 reprodu+0.703
irlf+0.691
+0.678
Neg logit contributions
ucl-1.201
ACA-1.085
UL-1.052
UC-1.029
AMA-1.001
Tokeniment
Token ID3681
Feature activation-5.99e-01

Pos logit contributions
 Spo+1.286
furt+1.257
 reprodu+1.178
irlf+1.158
+1.136
Neg logit contributions
ucl-1.301
ACA-1.151
UL-1.096
UC-1.071
AMA-1.041
Token with
Token ID351
Feature activation-3.06e-01

Pos logit contributions
 Spo+1.236
furt+1.191
 reprodu+1.095
irlf+1.073
+1.051
Neg logit contributions
ucl-2.000
ACA-1.814
UL-1.756
UC-1.720
AMA-1.676
Token previous
Token ID2180
Feature activation+3.93e-01

Pos logit contributions
 Spo+0.627
furt+0.606
 reprodu+0.551
irlf+0.547
+0.534
Neg logit contributions
ucl-1.024
ACA-0.929
UL-0.901
UC-0.883
AMA-0.859
Token ser
Token ID1055
Feature activation-1.52e+00

Pos logit contributions
ucl+1.307
ACA+1.180
UL+1.150
UC+1.121
AMA+1.093
Neg logit contributions
 Spo-0.810
furt-0.785
irlf-0.717
 reprodu-0.703
-0.691
Tokenach
Token ID620
Feature activation-2.95e+00

Pos logit contributions
 Spo+4.364
furt+4.306
 reprodu+4.031
irlf+3.988
ptoms+3.931
Neg logit contributions
ucl-3.709
ACA-3.260
UL-3.137
UC-3.043
AMA-2.936
Token locations
Token ID7064
Feature activation+3.15e-01

Pos logit contributions
 Spo+7.621
furt+7.411
 reprodu+7.019
ptoms+6.853
irlf+6.825
Neg logit contributions
ucl-7.561
ACA-6.818
UL-6.466
UC-6.339
AMA-6.201
Token *)
Token ID31936
Feature activation-4.77e-01

Pos logit contributions
ucl+1.013
ACA+0.914
UL+0.888
UC+0.872
AMA+0.844
Neg logit contributions
 Spo-0.677
furt-0.661
irlf-0.602
 reprodu-0.596
ptoms-0.587
Token step
Token ID2239
Feature activation-7.11e-01

Pos logit contributions
 Spo+1.030
furt+0.996
 reprodu+0.919
irlf+0.901
+0.884
Neg logit contributions
ucl-1.549
ACA-1.400
UL-1.355
UC-1.326
AMA-1.290
TokenNew
Token ID3791
Feature activation-1.59e-01

Pos logit contributions
 Spo+2.066
furt+2.008
 reprodu+1.894
irlf+1.868
+1.846
Neg logit contributions
ucl-1.767
ACA-1.545
UL-1.470
UC-1.431
AMA-1.383
Token =
Token ID796
Feature activation+3.50e-01

Pos logit contributions
 Spo+0.354
furt+0.342
 reprodu+0.317
irlf+0.311
+0.304
Neg logit contributions
ucl-0.506
ACA-0.457
UL-0.442
UC-0.433
AMA-0.420
Token
Token ID251
Feature activation+3.21e-01

Pos logit contributions
 Spo+0.144
furt+0.138
 reprodu+0.130
irlf+0.127
+0.124
Neg logit contributions
ucl-0.184
ACA-0.166
UL-0.159
UC-0.156
AMA-0.152
Token and
Token ID290
Feature activation-6.12e-01

Pos logit contributions
ucl+0.987
ACA+0.893
UL+0.861
UC+0.847
AMA+0.816
Neg logit contributions
 Spo-0.741
furt-0.717
 reprodu-0.663
irlf-0.662
-0.645
Token
Token ID564
Feature activation-1.32e+00

Pos logit contributions
 Spo+1.409
furt+1.359
 reprodu+1.266
irlf+1.239
+1.216
Neg logit contributions
ucl-1.900
ACA-1.710
UL-1.649
UC-1.613
AMA-1.568
Token
Token ID250
Feature activation-8.78e-01

Pos logit contributions
 Spo+2.598
furt+2.507
 reprodu+2.271
irlf+2.238
+2.201
Neg logit contributions
ucl-4.478
ACA-4.067
UL-3.948
UC-3.849
AMA-3.773
Tokenactive
Token ID5275
Feature activation-9.10e-01

Pos logit contributions
 Spo+2.051
furt+2.024
 reprodu+1.854
irlf+1.815
+1.796
Neg logit contributions
ucl-2.632
ACA-2.363
UL-2.275
UC-2.210
AMA-2.150
Token bystand
Token ID33326
Feature activation-2.93e+00

Pos logit contributions
 Spo+2.178
furt+2.098
 reprodu+1.977
irlf+1.919
+1.892
Neg logit contributions
ucl-2.731
ACA-2.454
UL-2.354
UC-2.296
AMA-2.241
Tokener
Token ID263
Feature activation-3.73e-01

Pos logit contributions
 Spo+3.639
furt+3.513
 reprodu+3.004
ipal+2.869
irlf+2.774
Neg logit contributions
ucl-11.565
ACA-10.834
UL-10.522
UC-10.271
AMA-10.171
Token training
Token ID3047
Feature activation+2.50e-01

Pos logit contributions
 Spo+0.845
furt+0.816
 reprodu+0.757
irlf+0.743
+0.729
Neg logit contributions
ucl-1.169
ACA-1.053
UL-1.016
UC-0.995
AMA-0.967
Token
Token ID447
Feature activation+2.68e-01

Pos logit contributions
ucl+0.748
ACA+0.671
UL+0.648
UC+0.637
AMA+0.614
Neg logit contributions
 Spo-0.600
furt-0.582
 reprodu-0.540
irlf-0.534
-0.526
Token
Token ID251
Feature activation+5.84e-01

Pos logit contributions
ucl+0.772
ACA+0.694
UL+0.657
UC+0.650
AMA+0.633
Neg logit contributions
 Spo-0.669
furt-0.643
 reprodu-0.610
irlf-0.593
-0.582
Token.
Token ID13
Feature activation+3.49e-01

Pos logit contributions
ucl+1.714
ACA+1.542
UL+1.483
UC+1.459
AMA+1.403
Neg logit contributions
 Spo-1.422
furt-1.381
 reprodu-1.286
irlf-1.281
-1.249
Token,
Token ID11
Feature activation-2.23e-01

Pos logit contributions
ucl+1.686
ACA+1.518
UL+1.476
UC+1.442
AMA+1.394
Neg logit contributions
 Spo-1.236
furt-1.208
irlf-1.100
-1.084
 reprodu-1.082
Token investigators
Token ID10093
Feature activation-8.39e-01

Pos logit contributions
 Spo+0.440
furt+0.424
 reprodu+0.384
irlf+0.381
+0.372
Neg logit contributions
ucl-0.764
ACA-0.699
UL-0.676
UC-0.661
AMA-0.647
Token cannot
Token ID2314
Feature activation+3.79e-01

Pos logit contributions
 Spo+1.658
furt+1.586
 reprodu+1.466
irlf+1.424
+1.390
Neg logit contributions
ucl-2.875
ACA-2.618
UL-2.532
UC-2.483
AMA-2.423
Token pinpoint
Token ID30534
Feature activation-1.11e+00

Pos logit contributions
ucl+1.089
ACA+0.961
UL+0.934
UC+0.909
AMA+0.872
Neg logit contributions
 Spo-0.950
furt-0.943
irlf-0.853
-0.846
 reprodu-0.842
Token the
Token ID262
Feature activation-4.03e-01

Pos logit contributions
 Spo+1.929
furt+1.838
 reprodu+1.682
irlf+1.609
+1.571
Neg logit contributions
ucl-4.058
ACA-3.713
UL-3.591
UC-3.540
AMA-3.454
Token exact
Token ID2748
Feature activation-2.92e+00

Pos logit contributions
 Spo+0.826
furt+0.798
 reprodu+0.724
irlf+0.721
+0.706
Neg logit contributions
ucl-1.348
ACA-1.225
UL-1.185
UC-1.161
AMA-1.130
Token mechanism
Token ID9030
Feature activation-1.24e+00

Pos logit contributions
 Spo+5.416
furt+5.123
 reprodu+5.003
 exploits+4.607
ptoms+4.428
Neg logit contributions
ucl-9.860
ACA-8.954
UL-8.707
UC-8.573
AMA-8.380
Token that
Token ID326
Feature activation-1.22e+00

Pos logit contributions
 Spo+2.660
furt+2.543
 reprodu+2.385
irlf+2.294
+2.262
Neg logit contributions
ucl-4.005
ACA-3.612
UL-3.472
UC-3.423
AMA-3.335
Token triggered
Token ID13973
Feature activation-2.23e-01

Pos logit contributions
 Spo+2.154
furt+2.049
 reprodu+1.872
irlf+1.812
+1.769
Neg logit contributions
ucl-4.416
ACA-4.038
UL-3.915
UC-3.845
AMA-3.757
Token the
Token ID262
Feature activation-4.45e-01

Pos logit contributions
 Spo+0.434
furt+0.417
 reprodu+0.380
irlf+0.373
+0.365
Neg logit contributions
ucl-0.770
ACA-0.702
UL-0.680
UC-0.666
AMA-0.651
Token stream
Token ID4269
Feature activation-5.82e-01

Pos logit contributions
 Spo+1.026
furt+0.995
 reprodu+0.917
irlf+0.907
+0.890
Neg logit contributions
ucl-1.380
ACA-1.241
UL-1.199
UC-1.171
AMA-1.139
Token theme
Token ID7505
Feature activation-1.01e+00

Pos logit contributions
 Spo+0.745
furt+0.718
 reprodu+0.665
irlf+0.652
+0.639
Neg logit contributions
ucl-1.080
ACA-0.976
UL-0.942
UC-0.923
AMA-0.898
Token forces
Token ID3386
Feature activation-1.73e-01

Pos logit contributions
 Spo+2.158
furt+2.065
 reprodu+1.934
irlf+1.849
+1.835
Neg logit contributions
ucl-3.274
ACA-2.968
UL-2.856
UC-2.788
AMA-2.727
Token for
Token ID329
Feature activation-1.29e+00

Pos logit contributions
 Spo+0.383
furt+0.373
 reprodu+0.341
irlf+0.341
+0.332
Neg logit contributions
ucl-0.550
ACA-0.495
UL-0.481
UC-0.471
AMA-0.458
Token WAR
Token ID11837
Feature activation-4.09e-01

Pos logit contributions
 Spo+2.761
furt+2.601
 reprodu+2.449
irlf+2.354
+2.314
Neg logit contributions
ucl-4.207
ACA-3.815
UL-3.667
UC-3.594
AMA-3.497
TokenM
Token ID44
Feature activation-1.73e-01

Pos logit contributions
 Spo+1.281
furt+1.240
 reprodu+1.180
irlf+1.165
+1.151
Neg logit contributions
ucl-0.932
ACA-0.801
UL-0.758
UC-0.738
AMA-0.703
TokenACH
Token ID16219
Feature activation-2.91e+00

Pos logit contributions
 Spo+0.479
furt+0.464
 reprodu+0.438
irlf+0.429
+0.424
Neg logit contributions
ucl-0.459
ACA-0.403
UL-0.385
UC-0.376
AMA-0.362
TokenINE
Token ID8881
Feature activation-5.31e-01

Pos logit contributions
 Spo+6.835
furt+6.307
 reprodu+6.246
+5.943
ptoms+5.834
Neg logit contributions
ucl-8.624
ACA-7.114
UL-6.852
UC-6.790
elaide-6.727
Token and
Token ID290
Feature activation+1.85e-01

Pos logit contributions
 Spo+1.201
furt+1.155
 reprodu+1.075
irlf+1.048
+1.033
Neg logit contributions
ucl-1.672
ACA-1.503
UL-1.448
UC-1.417
AMA-1.377
Token H
Token ID367
Feature activation-6.88e-02

Pos logit contributions
ucl+0.596
ACA+0.542
UL+0.525
UC+0.513
AMA+0.501
Neg logit contributions
 Spo-0.399
furt-0.393
 reprodu-0.358
irlf-0.355
ptoms-0.347
TokenORD
Token ID12532
Feature activation-6.69e-01

Pos logit contributions
 Spo+0.231
furt+0.226
 reprodu+0.214
irlf+0.213
+0.210
Neg logit contributions
ucl-0.140
ACA-0.116
UL-0.108
UC-0.105
AMA-0.100
TokenES
Token ID1546
Feature activation-1.72e-01

Pos logit contributions
 Spo+2.206
furt+2.159
 reprodu+2.051
irlf+2.020
+1.996
Neg logit contributions
ucl-1.393
ACA-1.168
UL-1.104
UC-1.074
AMA-1.024
Token 324
Token ID38595
Feature activation-1.43e-02

Pos logit contributions
 Spo+0.283
furt+0.274
 reprodu+0.252
irlf+0.247
+0.241
Neg logit contributions
ucl-0.448
ACA-0.407
UL-0.393
UC-0.385
AMA-0.375
Token.
Token ID764
Feature activation-8.49e-01

Pos logit contributions
 Spo+0.029
furt+0.028
 reprodu+0.026
irlf+0.026
+0.025
Neg logit contributions
ucl-0.048
ACA-0.043
UL-0.042
UC-0.041
AMA-0.040
Token
Token ID198
Feature activation-4.86e-01

Pos logit contributions
 Spo+1.732
furt+1.663
 reprodu+1.532
irlf+1.491
+1.470
Neg logit contributions
ucl-2.857
ACA-2.582
UL-2.499
UC-2.449
AMA-2.387
Token
Token ID198
Feature activation-3.52e-01

Pos logit contributions
 Spo+1.080
furt+1.041
 reprodu+0.971
irlf+0.956
+0.926
Neg logit contributions
ucl-1.544
ACA-1.400
UL-1.354
UC-1.326
AMA-1.288
TokenGoogle
Token ID11708
Feature activation-1.30e+00

Pos logit contributions
 Spo+0.877
furt+0.850
 reprodu+0.793
irlf+0.777
+0.769
Neg logit contributions
ucl-1.024
ACA-0.911
UL-0.876
UC-0.855
AMA-0.831
Token Scholar
Token ID11713
Feature activation-2.89e+00

Pos logit contributions
 Spo+2.339
furt+2.233
 reprodu+2.039
irlf+1.968
+1.930
Neg logit contributions
ucl-4.677
ACA-4.269
UL-4.131
UC-4.056
AMA-3.967
Token Crossref
Token ID23613
Feature activation-1.12e+00

Pos logit contributions
 Spo+4.668
furt+4.445
 reprodu+4.106
+3.916
irlf+3.909
Neg logit contributions
ucl-10.387
ACA-9.371
UL-9.045
UC-8.901
ORPG-8.782
Token
Token ID198
Feature activation-5.26e-01

Pos logit contributions
 Spo+2.551
furt+2.451
 reprodu+2.295
irlf+2.217
+2.212
Neg logit contributions
ucl-3.444
ACA-3.095
UL-2.960
UC-2.901
AMA-2.826
Token
Token ID198
Feature activation-5.31e-01

Pos logit contributions
 Spo+1.189
furt+1.147
 reprodu+1.072
irlf+1.064
+1.029
Neg logit contributions
ucl-1.653
ACA-1.492
UL-1.444
UC-1.414
AMA-1.373
TokenM
Token ID44
Feature activation+5.44e-01

Pos logit contributions
 Spo+1.362
furt+1.326
 reprodu+1.226
+1.204
irlf+1.203
Neg logit contributions
ucl-1.492
ACA-1.316
UL-1.261
UC-1.228
AMA-1.195
Tokenasure
Token ID5015
Feature activation-1.49e+00

Pos logit contributions
ucl+1.434
ACA+1.264
UL+1.210
UC+1.177
AMA+1.138
Neg logit contributions
 Spo-1.513
furt-1.467
 reprodu-1.389
irlf-1.358
-1.345
Token using
Token ID1262
Feature activation+2.68e-01

Pos logit contributions
ucl+0.253
ACA+0.228
UL+0.222
UC+0.218
AMA+0.212
Neg logit contributions
 Spo-0.157
furt-0.152
irlf-0.139
 reprodu-0.136
-0.135
Token Hel
Token ID5053
Feature activation-1.61e+00

Pos logit contributions
ucl+0.844
ACA+0.759
UL+0.735
UC+0.715
AMA+0.699
Neg logit contributions
 Spo-0.603
furt-0.587
irlf-0.534
 reprodu-0.533
-0.525
Tokenium
Token ID1505
Feature activation-7.80e-01

Pos logit contributions
 Spo+4.338
furt+4.279
 reprodu+4.019
irlf+3.918
ipal+3.905
Neg logit contributions
ucl-4.098
ACA-3.691
UL-3.479
UC-3.407
AMA-3.286
Token makes
Token ID1838
Feature activation-8.99e-02

Pos logit contributions
 Spo+1.804
furt+1.732
 reprodu+1.619
irlf+1.573
+1.554
Neg logit contributions
ucl-2.398
ACA-2.155
UL-2.082
UC-2.043
AMA-1.983
Token them
Token ID606
Feature activation-5.68e-01

Pos logit contributions
 Spo+0.183
furt+0.177
 reprodu+0.160
irlf+0.159
+0.156
Neg logit contributions
ucl-0.303
ACA-0.274
UL-0.266
UC-0.260
AMA-0.254
Token very
Token ID845
Feature activation-2.86e+00

Pos logit contributions
 Spo+1.231
furt+1.186
 reprodu+1.097
irlf+1.074
+1.053
Neg logit contributions
ucl-1.839
ACA-1.663
UL-1.606
UC-1.573
AMA-1.532
Token expensive
Token ID5789
Feature activation-1.91e+00

Pos logit contributions
 Spo+5.163
 reprodu+4.726
furt+4.682
 exploits+4.073
+4.050
Neg logit contributions
ucl-9.939
ACA-9.264
UL-8.799
UC-8.695
AMA-8.566
Token.
Token ID13
Feature activation-4.92e-01

Pos logit contributions
 Spo+4.080
furt+3.879
 reprodu+3.746
+3.475
irlf+3.425
Neg logit contributions
ucl-6.103
ACA-5.567
UL-5.299
UC-5.221
AMA-5.071
Token Materials
Token ID24310
Feature activation+7.51e-01

Pos logit contributions
 Spo+1.094
furt+1.062
 reprodu+0.985
irlf+0.971
+0.944
Neg logit contributions
ucl-1.561
ACA-1.409
UL-1.362
UC-1.334
AMA-1.296
Token that
Token ID326
Feature activation-7.52e-01

Pos logit contributions
ucl+2.367
ACA+2.119
UL+2.075
UC+2.025
AMA+1.954
Neg logit contributions
 Spo-1.652
furt-1.623
irlf-1.491
 reprodu-1.457
ptoms-1.445
Token achieve
Token ID4620
Feature activation-4.98e-01

Pos logit contributions
 Spo+1.690
furt+1.631
 reprodu+1.513
irlf+1.484
+1.456
Neg logit contributions
ucl-2.374
ACA-2.141
UL-2.066
UC-2.023
AMA-1.967
Token supporters
Token ID5941
Feature activation+4.56e-01

Pos logit contributions
ucl+2.241
ACA+2.009
UL+1.930
UC+1.887
AMA+1.821
Neg logit contributions
 Spo-2.094
furt-2.047
 reprodu-1.904
irlf-1.880
-1.854
Token,
Token ID11
Feature activation+3.66e-01

Pos logit contributions
ucl+1.287
ACA+1.151
UL+1.112
UC+1.082
AMA+1.044
Neg logit contributions
 Spo-1.161
furt-1.136
irlf-1.047
 reprodu-1.045
-1.030
Token many
Token ID867
Feature activation+5.84e-01

Pos logit contributions
ucl+1.176
ACA+1.069
UL+1.033
UC+1.015
AMA+0.978
Neg logit contributions
 Spo-0.793
furt-0.767
 reprodu-0.700
irlf-0.697
-0.689
Token wearing
Token ID5762
Feature activation+5.25e-01

Pos logit contributions
ucl+1.795
ACA+1.630
UL+1.571
UC+1.541
AMA+1.487
Neg logit contributions
 Spo-1.331
furt-1.293
irlf-1.187
 reprodu-1.176
-1.160
Token the
Token ID262
Feature activation+7.25e-01

Pos logit contributions
ucl+1.564
ACA+1.410
UL+1.354
UC+1.335
AMA+1.290
Neg logit contributions
 Spo-1.236
furt-1.220
irlf-1.134
ptoms-1.111
-1.106
Token colours
Token ID18915
Feature activation+2.11e+00

Pos logit contributions
ucl+2.044
ACA+1.830
UL+1.767
UC+1.743
AMA+1.667
Neg logit contributions
 Spo-1.813
furt-1.795
irlf-1.669
-1.634
 reprodu-1.634
Token of
Token ID286
Feature activation+5.96e-01

Pos logit contributions
ucl+6.016
ACA+5.450
UL+5.324
UC+5.265
AMA+5.024
Neg logit contributions
 Spo-5.020
furt-4.923
irlf-4.659
 reprodu-4.497
ipal-4.456
Token FC
Token ID10029
Feature activation+6.56e-01

Pos logit contributions
ucl+1.740
ACA+1.553
UL+1.504
UC+1.473
AMA+1.420
Neg logit contributions
 Spo-1.454
furt-1.424
 reprodu-1.308
irlf-1.301
-1.299
Token Barcelona
Token ID15142
Feature activation+8.33e-01

Pos logit contributions
ucl+1.595
ACA+1.399
UL+1.336
UC+1.294
AMA+1.232
Neg logit contributions
 Spo-1.953
furt-1.887
 reprodu-1.800
irlf-1.782
-1.768
Token or
Token ID393
Feature activation+9.86e-01

Pos logit contributions
ucl+2.437
ACA+2.197
UL+2.117
UC+2.062
AMA+1.990
Neg logit contributions
 Spo-2.038
furt-1.968
irlf-1.834
 reprodu-1.831
-1.787
Token flying
Token ID7348
Feature activation+2.09e-01

Pos logit contributions
ucl+3.100
ACA+2.827
UL+2.732
UC+2.682
AMA+2.587
Neg logit contributions
 Spo-2.149
furt-2.085
irlf-1.932
ptoms-1.895
 reprodu-1.890
Token Last
Token ID4586
Feature activation+7.18e-02

Pos logit contributions
ucl+0.153
ACA+0.137
UL+0.132
UC+0.129
AMA+0.126
Neg logit contributions
 Spo-0.117
furt-0.113
 reprodu-0.105
irlf-0.103
-0.101
Token Prison
Token ID16999
Feature activation-1.63e+00

Pos logit contributions
ucl+0.222
ACA+0.200
UL+0.194
UC+0.189
AMA+0.185
Neg logit contributions
 Spo-0.165
furt-0.160
 reprodu-0.148
irlf-0.146
-0.143
Tokener
Token ID263
Feature activation+3.64e-01

Pos logit contributions
 Spo+3.025
furt+2.824
 reprodu+2.575
irlf+2.493
+2.440
Neg logit contributions
ucl-5.815
ACA-5.263
UL-5.100
UC-5.016
AMA-4.865
Token of
Token ID286
Feature activation-5.16e-02

Pos logit contributions
ucl+1.172
ACA+1.064
UL+1.030
UC+1.009
AMA+0.985
Neg logit contributions
 Spo-0.778
furt-0.760
 reprodu-0.692
irlf-0.684
-0.668
Token War
Token ID1810
Feature activation+4.18e-01

Pos logit contributions
 Spo+0.113
furt+0.109
 reprodu+0.101
irlf+0.099
+0.097
Neg logit contributions
ucl-0.166
ACA-0.150
UL-0.145
UC-0.142
AMA-0.138
Token noted
Token ID4367
Feature activation+1.83e+00

Pos logit contributions
ucl+1.296
ACA+1.172
UL+1.131
UC+1.105
AMA+1.079
Neg logit contributions
 Spo-0.944
furt-0.928
 reprodu-0.845
irlf-0.837
-0.820
Token that
Token ID326
Feature activation+2.36e-01

Pos logit contributions
ucl+5.350
ACA+4.807
UL+4.638
UC+4.552
AMA+4.447
Neg logit contributions
 Spo-4.245
furt-4.200
irlf-3.855
-3.823
ptoms-3.819
Token political
Token ID1964
Feature activation+4.35e-01

Pos logit contributions
ucl+0.755
ACA+0.684
UL+0.660
UC+0.648
AMA+0.631
Neg logit contributions
 Spo-0.517
furt-0.503
 reprodu-0.461
irlf-0.455
-0.449
Token calculations
Token ID16765
Feature activation+1.21e+00

Pos logit contributions
ucl+1.237
ACA+1.100
UL+1.059
UC+1.033
AMA+1.004
Neg logit contributions
 Spo-1.103
furt-1.072
irlf-0.997
 reprodu-0.995
-0.976
Token were
Token ID547
Feature activation+9.52e-01

Pos logit contributions
ucl+3.838
ACA+3.458
UL+3.370
UC+3.312
AMA+3.195
Neg logit contributions
 Spo-2.518
furt-2.493
irlf-2.271
 reprodu-2.241
-2.213
Token first
Token ID717
Feature activation+3.07e-01

Pos logit contributions
ucl+3.044
ACA+2.776
UL+2.693
UC+2.665
AMA+2.546
Neg logit contributions
 Spo-1.955
furt-1.901
ptoms-1.774
irlf-1.774
-1.716
Token will
Token ID481
Feature activation-9.22e-02

Pos logit contributions
ucl+2.472
ACA+2.252
UL+2.194
UC+2.139
AMA+2.099
Neg logit contributions
 Spo-1.591
furt-1.564
irlf-1.418
 reprodu-1.411
-1.382
Token prohibit
Token ID20767
Feature activation-7.98e-02

Pos logit contributions
 Spo+0.221
furt+0.216
 reprodu+0.199
irlf+0.197
+0.194
Neg logit contributions
ucl-0.275
ACA-0.247
UL-0.239
UC-0.233
AMA-0.226
Token any
Token ID597
Feature activation+7.84e-01

Pos logit contributions
 Spo+0.159
furt+0.156
irlf+0.141
 reprodu+0.139
+0.138
Neg logit contributions
ucl-0.269
ACA-0.244
UL-0.237
UC-0.232
AMA-0.228
Token photograph
Token ID8408
Feature activation+3.36e-01

Pos logit contributions
ucl+2.536
ACA+2.299
UL+2.237
UC+2.185
AMA+2.134
Neg logit contributions
 Spo-1.652
furt-1.616
irlf-1.492
ptoms-1.451
-1.429
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
ucl+1.033
ACA+0.927
UL+0.899
UC+0.879
AMA+0.855
Neg logit contributions
 Spo-0.773
furt-0.754
irlf-0.696
 reprodu-0.681
ptoms-0.679
Token video
Token ID2008
Feature activation+2.05e+00

Pos logit contributions
ucl+3.007
ACA+2.698
UL+2.641
UC+2.574
AMA+2.513
Neg logit contributions
 Spo-2.356
furt-2.337
irlf-2.171
ptoms-2.130
vironments-2.086
Token or
Token ID393
Feature activation+2.46e+00

Pos logit contributions
ucl+5.554
UL+4.870
ACA+4.839
UC+4.718
AMA+4.501
Neg logit contributions
 Spo-5.175
furt-5.139
irlf-4.832
ptoms-4.710
vironments-4.670
Token digital
Token ID4875
Feature activation+1.33e+00

Pos logit contributions
ucl+7.489
ACA+6.739
UL+6.723
UC+6.523
AMA+6.475
Neg logit contributions
furt-5.123
 Spo-5.007
irlf-4.725
ptoms-4.672
vironments-4.525
Token image
Token ID2939
Feature activation+1.03e+00

Pos logit contributions
ucl+3.936
ACA+3.486
UL+3.406
UC+3.324
AMA+3.234
Neg logit contributions
 Spo-3.142
furt-3.119
irlf-2.860
ptoms-2.855
vironments-2.775
Token of
Token ID286
Feature activation+4.27e-01

Pos logit contributions
ucl+3.278
ACA+2.939
UL+2.878
UC+2.806
AMA+2.756
Neg logit contributions
 Spo-2.157
furt-2.125
irlf-1.981
ptoms-1.961
vironments-1.898
Token a
Token ID257
Feature activation+2.52e-01

Pos logit contributions
ucl+1.458
ACA+1.314
UL+1.285
UC+1.270
AMA+1.230
Neg logit contributions
 Spo-0.830
furt-0.810
irlf-0.723
ptoms-0.716
 reprodu-0.712
Tokenmann
Token ID9038
Feature activation-1.13e-01

Pos logit contributions
 Spo+0.387
furt+0.374
 reprodu+0.349
irlf+0.343
+0.337
Neg logit contributions
ucl-0.506
ACA-0.453
UL-0.438
UC-0.429
AMA-0.417
Token,
Token ID11
Feature activation+2.92e-01

Pos logit contributions
 Spo+0.265
furt+0.257
 reprodu+0.238
irlf+0.235
+0.230
Neg logit contributions
ucl-0.345
ACA-0.309
UL-0.299
UC-0.292
AMA-0.284
Token San
Token ID2986
Feature activation-6.58e-01

Pos logit contributions
ucl+0.921
ACA+0.830
UL+0.809
UC+0.790
AMA+0.765
Neg logit contributions
 Spo-0.640
furt-0.628
irlf-0.570
 reprodu-0.568
ptoms-0.562
Tokeni
Token ID72
Feature activation+3.25e-02

Pos logit contributions
 Spo+1.615
furt+1.567
 reprodu+1.458
irlf+1.433
+1.405
Neg logit contributions
ucl-1.939
ACA-1.742
UL-1.674
UC-1.634
AMA-1.587
Token,
Token ID11
Feature activation+1.15e+00

Pos logit contributions
ucl+0.104
ACA+0.094
UL+0.091
UC+0.089
AMA+0.087
Neg logit contributions
 Spo-0.071
furt-0.069
 reprodu-0.064
irlf-0.062
-0.061
Token T
Token ID309
Feature activation+2.04e+00

Pos logit contributions
ucl+3.527
ACA+3.156
UL+3.097
UC+3.019
AMA+2.921
Neg logit contributions
 Spo-2.523
furt-2.504
irlf-2.295
 reprodu-2.279
ptoms-2.250
Tokenomm
Token ID2002
Feature activation+5.28e-02

Pos logit contributions
ucl+5.520
ACA+4.852
UL+4.775
UC+4.617
AMA+4.497
Neg logit contributions
 Spo-5.319
furt-5.131
 reprodu-4.854
-4.728
irlf-4.690
Tokeni
Token ID72
Feature activation+2.35e-01

Pos logit contributions
ucl+0.171
ACA+0.153
UL+0.149
UC+0.145
AMA+0.142
Neg logit contributions
 Spo-0.115
furt-0.111
 reprodu-0.103
irlf-0.101
-0.099
Token L
Token ID406
Feature activation+1.19e+00

Pos logit contributions
ucl+0.746
ACA+0.672
UL+0.650
UC+0.636
AMA+0.620
Neg logit contributions
 Spo-0.520
furt-0.503
 reprodu-0.467
irlf-0.459
-0.450
Tokenä
Token ID11033
Feature activation-3.88e-01

Pos logit contributions
ucl+3.278
ACA+2.893
UL+2.802
UC+2.742
AMA+2.645
Neg logit contributions
 Spo-3.135
furt-3.023
 reprodu-2.861
-2.794
irlf-2.766
Tokennt
Token ID429
Feature activation+5.68e-01

Pos logit contributions
 Spo+1.010
furt+0.979
 reprodu+0.918
irlf+0.903
+0.889
Neg logit contributions
ucl-1.087
ACA-0.966
UL-0.928
UC-0.906
AMA-0.877
Token being
Token ID852
Feature activation+9.57e-01

Pos logit contributions
ucl+3.690
ACA+3.305
UL+3.221
UC+3.145
AMA+3.032
Neg logit contributions
 Spo-2.544
furt-2.503
irlf-2.273
 reprodu-2.250
ptoms-2.241
Token bundled
Token ID34358
Feature activation+1.13e+00

Pos logit contributions
ucl+3.076
ACA+2.772
UL+2.699
UC+2.633
AMA+2.543
Neg logit contributions
 Spo-2.053
furt-2.018
-1.798
irlf-1.788
 reprodu-1.788
Token into
Token ID656
Feature activation-7.83e-03

Pos logit contributions
ucl+3.573
ACA+3.203
UL+3.126
UC+3.046
AMA+2.958
Neg logit contributions
 Spo-2.471
furt-2.452
irlf-2.213
ptoms-2.204
-2.198
Token a
Token ID257
Feature activation-3.17e-03

Pos logit contributions
 Spo+0.016
furt+0.015
 reprodu+0.014
irlf+0.014
+0.013
Neg logit contributions
ucl-0.027
ACA-0.024
UL-0.023
UC-0.023
AMA-0.022
Token carry
Token ID3283
Feature activation+1.59e+00

Pos logit contributions
 Spo+0.008
furt+0.007
 reprodu+0.007
irlf+0.007
+0.007
Neg logit contributions
ucl-0.010
ACA-0.009
UL-0.008
UC-0.008
AMA-0.008
Token c
Token ID269
Feature activation+1.89e+00

Pos logit contributions
ucl+4.618
ACA+4.157
UL+3.962
UC+3.869
AMA+3.765
Neg logit contributions
 Spo-3.905
furt-3.784
 reprodu-3.471
ptoms-3.453
-3.452
Tokenot
Token ID313
Feature activation+2.98e+00

Pos logit contributions
ucl+4.765
ACA+4.175
UL+3.950
UC+3.821
AMA+3.767
Neg logit contributions
 Spo-5.440
furt-5.255
 reprodu-5.018
-4.894
ptoms-4.826
Token and
Token ID290
Feature activation+6.82e-01

Pos logit contributions
ucl+8.759
ACA+7.810
UL+7.565
UC+7.416
AMA+7.162
Neg logit contributions
 Spo-6.739
furt-6.424
ptoms-5.894
 reprodu-5.844
irlf-5.803
Token on
Token ID319
Feature activation-5.15e-01

Pos logit contributions
ucl+2.203
ACA+1.990
UL+1.922
UC+1.882
AMA+1.829
Neg logit contributions
 Spo-1.456
furt-1.426
irlf-1.288
 reprodu-1.281
-1.272
Token and
Token ID290
Feature activation+5.37e-01

Pos logit contributions
 Spo+1.087
furt+1.045
 reprodu+0.967
irlf+0.945
+0.924
Neg logit contributions
ucl-1.691
ACA-1.533
UL-1.481
UC-1.451
AMA-1.417
Token off
Token ID572
Feature activation+3.50e-01

Pos logit contributions
ucl+1.748
ACA+1.586
UL+1.530
UC+1.496
AMA+1.452
Neg logit contributions
 Spo-1.146
furt-1.110
 reprodu-1.013
irlf-0.997
-0.982
Token killing
Token ID5170
Feature activation+1.67e-01

Pos logit contributions
ucl+2.865
ACA+2.577
UL+2.524
UC+2.451
AMA+2.367
Neg logit contributions
 Spo-2.276
furt-2.253
irlf-2.066
 reprodu-2.051
ptoms-2.032
Token the
Token ID262
Feature activation+3.61e-01

Pos logit contributions
ucl+0.540
ACA+0.490
UL+0.474
UC+0.463
AMA+0.452
Neg logit contributions
 Spo-0.358
furt-0.349
 reprodu-0.316
irlf-0.315
-0.309
Token Trump
Token ID1301
Feature activation-3.25e-02

Pos logit contributions
ucl+1.048
ACA+0.939
UL+0.908
UC+0.884
AMA+0.855
Neg logit contributions
 Spo-0.891
furt-0.871
 reprodu-0.801
irlf-0.799
ptoms-0.786
Token marketing
Token ID7124
Feature activation-1.30e-01

Pos logit contributions
 Spo+0.079
furt+0.077
 reprodu+0.071
irlf+0.070
+0.069
Neg logit contributions
ucl-0.097
ACA-0.087
UL-0.084
UC-0.082
AMA-0.079
Token myst
Token ID21619
Feature activation+1.10e+00

Pos logit contributions
 Spo+0.328
furt+0.318
 reprodu+0.297
irlf+0.293
+0.287
Neg logit contributions
ucl-0.373
ACA-0.334
UL-0.321
UC-0.313
AMA-0.304
Tokenique
Token ID2350
Feature activation+9.27e-01

Pos logit contributions
ucl+2.296
ACA+1.931
UL+1.877
UC+1.790
AMA+1.710
Neg logit contributions
 Spo-3.591
furt-3.529
 reprodu-3.347
irlf-3.285
-3.270
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
ucl+2.625
ACA+2.368
UL+2.291
UC+2.235
AMA+2.162
Neg logit contributions
 Spo-2.318
furt-2.287
irlf-2.092
 reprodu-2.090
-2.036
Token
Token ID198
Feature activation+2.41e-01

Pos logit contributions
ucl+1.257
ACA+1.138
UL+1.103
UC+1.080
AMA+1.051
Neg logit contributions
 Spo-0.913
furt-0.898
 reprodu-0.836
irlf-0.824
ptoms-0.798
Token
Token ID198
Feature activation+3.76e-01

Pos logit contributions
ucl+0.718
ACA+0.650
UL+0.630
UC+0.617
AMA+0.597
Neg logit contributions
 Spo-0.579
furt-0.558
irlf-0.534
 reprodu-0.529
ptoms-0.508
TokenSo
Token ID2396
Feature activation-9.04e-01

Pos logit contributions
ucl+1.063
ACA+0.957
UL+0.922
UC+0.901
AMA+0.871
Neg logit contributions
 Spo-0.973
furt-0.940
 reprodu-0.887
irlf-0.873
ptoms-0.846
Token those
Token ID883
Feature activation-4.17e-01

Pos logit contributions
 Spo+1.808
furt+1.728
 reprodu+1.595
irlf+1.549
+1.518
Neg logit contributions
ucl-3.078
ACA-2.792
UL-2.698
UC-2.651
AMA-2.585
Token her
Token ID607
Feature activation+9.58e-02

Pos logit contributions
 Spo+0.350
furt+0.337
 reprodu+0.308
irlf+0.302
+0.296
Neg logit contributions
ucl-0.605
ACA-0.550
UL-0.533
UC-0.523
AMA-0.510
Token enc
Token ID2207
Feature activation+2.74e-01

Pos logit contributions
ucl+0.291
ACA+0.261
UL+0.253
UC+0.248
AMA+0.240
Neg logit contributions
 Spo-0.225
furt-0.219
 reprodu-0.202
irlf-0.200
-0.197
Tokenore
Token ID382
Feature activation-9.78e-02

Pos logit contributions
ucl+0.526
ACA+0.438
UL+0.410
UC+0.398
AMA+0.377
Neg logit contributions
 Spo-0.959
furt-0.932
 reprodu-0.892
irlf-0.881
-0.873
Token of
Token ID286
Feature activation-6.70e-01

Pos logit contributions
 Spo+0.226
furt+0.218
 reprodu+0.203
irlf+0.199
+0.195
Neg logit contributions
ucl-0.302
ACA-0.272
UL-0.262
UC-0.257
AMA-0.250
Token
Token ID564
Feature activation-2.37e-01

Pos logit contributions
 Spo+1.428
furt+1.360
 reprodu+1.262
irlf+1.233
+1.208
Neg logit contributions
ucl-2.198
ACA-1.990
UL-1.922
UC-1.881
AMA-1.835
Token
Token ID246
Feature activation+3.18e-01

Pos logit contributions
 Spo+0.523
furt+0.505
 reprodu+0.471
irlf+0.460
+0.448
Neg logit contributions
ucl-0.755
ACA-0.686
UL-0.660
UC-0.648
AMA-0.628
TokenLove
Token ID18565
Feature activation-8.48e-01

Pos logit contributions
ucl+0.965
ACA+0.872
UL+0.840
UC+0.823
AMA+0.799
Neg logit contributions
 Spo-0.745
furt-0.723
 reprodu-0.678
irlf-0.663
-0.647
Token More
Token ID3125
Feature activation+2.30e-01

Pos logit contributions
 Spo+1.865
furt+1.791
 reprodu+1.656
irlf+1.624
+1.577
Neg logit contributions
ucl-2.715
ACA-2.457
UL-2.372
UC-2.318
AMA-2.259
Token
Token ID447
Feature activation+3.53e-01

Pos logit contributions
ucl+0.781
ACA+0.709
UL+0.687
UC+0.675
AMA+0.657
Neg logit contributions
 Spo-0.459
furt-0.444
 reprodu-0.407
irlf-0.397
-0.389
Token
Token ID247
Feature activation+2.47e-01

Pos logit contributions
ucl+1.010
ACA+0.906
UL+0.865
UC+0.848
AMA+0.820
Neg logit contributions
 Spo-0.893
furt-0.862
 reprodu-0.816
irlf-0.799
ptoms-0.776
Token and
Token ID290
Feature activation-6.16e-01

Pos logit contributions
ucl+0.782
ACA+0.707
UL+0.683
UC+0.668
AMA+0.651
Neg logit contributions
 Spo-0.551
furt-0.531
 reprodu-0.494
irlf-0.483
-0.474
Token was
Token ID373
Feature activation+2.88e-01

Pos logit contributions
ucl+1.991
ACA+1.829
UL+1.762
UC+1.732
AMA+1.667
Neg logit contributions
 Spo-1.313
furt-1.288
 reprodu-1.171
irlf-1.160
-1.140
Token a
Token ID257
Feature activation-1.01e-01

Pos logit contributions
ucl+0.974
ACA+0.890
UL+0.864
UC+0.846
AMA+0.821
Neg logit contributions
 Spo-0.573
furt-0.553
 reprodu-0.506
irlf-0.497
-0.490
Token big
Token ID1263
Feature activation+7.71e-01

Pos logit contributions
 Spo+0.244
furt+0.236
 reprodu+0.220
irlf+0.216
+0.212
Neg logit contributions
ucl-0.301
ACA-0.270
UL-0.260
UC-0.254
AMA-0.247
Token mistake
Token ID7457
Feature activation-1.17e+00

Pos logit contributions
ucl+2.182
ACA+1.950
UL+1.873
UC+1.832
AMA+1.764
Neg logit contributions
 Spo-1.961
furt-1.906
 reprodu-1.791
irlf-1.772
ptoms-1.747
Token.
Token ID13
Feature activation+1.14e-01

Pos logit contributions
 Spo+2.734
furt+2.623
 reprodu+2.456
irlf+2.387
+2.350
Neg logit contributions
ucl-3.605
ACA-3.211
UL-3.088
UC-3.028
AMA-2.943
Token Hitchcock
Token ID49817
Feature activation+3.55e-01

Pos logit contributions
ucl+0.335
ACA+0.302
UL+0.292
UC+0.285
AMA+0.277
Neg logit contributions
 Spo-0.276
furt-0.268
 reprodu-0.252
irlf-0.247
ptoms-0.241
Token shot
Token ID2823
Feature activation-3.42e-01

Pos logit contributions
ucl+1.209
ACA+1.101
UL+1.066
UC+1.047
AMA+1.019
Neg logit contributions
 Spo-0.700
furt-0.676
 reprodu-0.609
irlf-0.605
ptoms-0.590
Token everything
Token ID2279
Feature activation+6.24e-01

Pos logit contributions
 Spo+0.710
furt+0.684
 reprodu+0.629
irlf+0.616
+0.604
Neg logit contributions
ucl-1.140
ACA-1.034
UL-1.000
UC-0.980
AMA-0.955
Token on
Token ID319
Feature activation-1.56e-01

Pos logit contributions
ucl+1.990
ACA+1.820
UL+1.756
UC+1.716
AMA+1.663
Neg logit contributions
 Spo-1.348
furt-1.319
 reprodu-1.189
ptoms-1.184
irlf-1.181
Token as
Token ID355
Feature activation+1.56e-01

Pos logit contributions
 Spo+0.349
furt+0.337
 reprodu+0.311
irlf+0.307
+0.301
Neg logit contributions
ucl-0.491
ACA-0.443
UL-0.428
UC-0.419
AMA-0.406
Tokenho
Token ID8873
Feature activation+4.60e-01

Pos logit contributions
ucl+0.538
ACA+0.489
UL+0.477
UC+0.466
AMA+0.450
Neg logit contributions
 Spo-0.305
furt-0.293
 reprodu-0.266
irlf-0.262
-0.259
Token an
Token ID281
Feature activation-3.59e-01

Pos logit contributions
 Spo+0.668
furt+0.640
 reprodu+0.594
irlf+0.577
+0.567
Neg logit contributions
ucl-1.037
ACA-0.940
UL-0.909
UC-0.890
AMA-0.868
Token elegant
Token ID19992
Feature activation-3.12e-02

Pos logit contributions
 Spo+0.841
furt+0.810
 reprodu+0.757
irlf+0.739
+0.728
Neg logit contributions
ucl-1.097
ACA-0.986
UL-0.950
UC-0.930
AMA-0.902
Token upd
Token ID2325
Feature activation+6.51e-01

Pos logit contributions
 Spo+0.077
furt+0.074
 reprodu+0.069
irlf+0.068
+0.067
Neg logit contributions
ucl-0.092
ACA-0.082
UL-0.079
UC-0.077
AMA-0.075
Tokeno
Token ID78
Feature activation+6.01e-02

Pos logit contributions
ucl+2.405
ACA+2.205
UL+2.140
UC+2.105
AMA+2.059
Neg logit contributions
 Spo-1.106
furt-1.050
 reprodu-0.952
irlf-0.930
-0.899
Token and
Token ID290
Feature activation-5.22e-01

Pos logit contributions
ucl+0.181
ACA+0.162
UL+0.156
UC+0.152
AMA+0.148
Neg logit contributions
 Spo-0.144
furt-0.139
 reprodu-0.130
irlf-0.128
-0.126
Token red
Token ID2266
Feature activation+1.16e+00

Pos logit contributions
 Spo+0.954
furt+0.915
 reprodu+0.830
irlf+0.812
+0.792
Neg logit contributions
ucl-1.863
ACA-1.701
UL-1.650
UC-1.620
AMA-1.581
Token lipstick
Token ID41429
Feature activation-5.46e-01

Pos logit contributions
ucl+3.305
ACA+2.939
UL+2.839
UC+2.784
AMA+2.665
Neg logit contributions
 Spo-2.924
furt-2.843
 reprodu-2.649
irlf-2.634
-2.587
Token
Token ID851
Feature activation-8.41e-01

Pos logit contributions
 Spo+1.280
furt+1.227
 reprodu+1.157
irlf+1.122
+1.105
Neg logit contributions
ucl-1.669
ACA-1.497
UL-1.441
UC-1.412
AMA-1.369
Token very
Token ID845
Feature activation-2.00e+00

Pos logit contributions
 Spo+1.761
furt+1.687
 reprodu+1.563
irlf+1.525
+1.495
Neg logit contributions
ucl-2.785
ACA-2.523
UL-2.438
UC-2.388
AMA-2.327
Token Su
Token ID1778
Feature activation-8.96e-01

Pos logit contributions
 Spo+4.108
furt+3.837
 reprodu+3.644
+3.443
irlf+3.431
Neg logit contributions
ucl-6.640
ACA-6.043
UL-5.852
UC-5.735
AMA-5.568
Tokenzy
Token ID7357
Feature activation-1.18e+00

Pos logit contributions
 Spo+2.077
furt+2.006
 reprodu+1.854
irlf+1.836
+1.795
Neg logit contributions
ucl-2.745
ACA-2.478
UL-2.380
UC-2.329
AMA-2.269
Token AND
Token ID5357
Feature activation+3.10e-01

Pos logit contributions
ucl+1.789
ACA+1.631
UL+1.586
UC+1.549
AMA+1.506
Neg logit contributions
 Spo-1.259
furt-1.230
 reprodu-1.139
irlf-1.132
ptoms-1.098
Token AN
Token ID3537
Feature activation-1.53e-01

Pos logit contributions
ucl+0.986
ACA+0.900
UL+0.874
UC+0.855
AMA+0.833
Neg logit contributions
 Spo-0.676
furt-0.663
 reprodu-0.612
irlf-0.608
ptoms-0.590
TokenP
Token ID47
Feature activation+2.03e+00

Pos logit contributions
 Spo+0.458
furt+0.445
 reprodu+0.421
irlf+0.414
+0.410
Neg logit contributions
ucl-0.369
ACA-0.320
UL-0.305
UC-0.295
AMA-0.284
Token FAC
Token ID44216
Feature activation+5.30e-01

Pos logit contributions
ucl+4.894
ACA+4.461
UL+4.369
UC+4.160
AMA+3.951
Neg logit contributions
 Spo-5.808
furt-5.675
irlf-5.382
 reprodu-5.306
-5.201
TokenIL
Token ID4146
Feature activation+9.28e-01

Pos logit contributions
ucl+0.960
ACA+0.810
UL+0.778
UC+0.735
AMA+0.689
Neg logit contributions
 Spo-1.898
furt-1.861
 reprodu-1.781
irlf-1.763
-1.724
TokenITIES
Token ID30383
Feature activation+1.24e+00

Pos logit contributions
ucl+2.148
ACA+1.889
UL+1.835
UC+1.753
AMA+1.677
Neg logit contributions
 Spo-2.858
furt-2.801
 reprodu-2.638
irlf-2.633
-2.555
Token ARE
Token ID15986
Feature activation+7.58e-01

Pos logit contributions
ucl+3.851
ACA+3.514
UL+3.465
UC+3.376
AMA+3.276
Neg logit contributions
furt-2.675
 Spo-2.659
irlf-2.457
 reprodu-2.428
ptoms-2.381
Token TO
Token ID5390
Feature activation+1.42e+00

Pos logit contributions
ucl+2.367
ACA+2.159
UL+2.105
UC+2.061
AMA+1.999
Neg logit contributions
 Spo-1.674
furt-1.659
irlf-1.529
 reprodu-1.521
ptoms-1.489
Token BE
Token ID9348
Feature activation+7.52e-01

Pos logit contributions
ucl+3.965
ACA+3.596
UL+3.501
UC+3.398
AMA+3.280
Neg logit contributions
 Spo-3.604
furt-3.542
 reprodu-3.343
irlf-3.309
ptoms-3.219
Token SE
Token ID7946
Feature activation-2.02e+00

Pos logit contributions
ucl+2.389
ACA+2.190
UL+2.136
UC+2.086
AMA+2.038
Neg logit contributions
 Spo-1.626
furt-1.593
 reprodu-1.480
irlf-1.478
ptoms-1.450
TokenARCH
Token ID31315
Feature activation-9.79e-01

Pos logit contributions
 Spo+6.478
furt+6.271
 reprodu+5.847
+5.749
irlf+5.677
Neg logit contributions
ucl-4.240
ACA-3.490
UL-3.208
elaide-3.193
milo-3.114
TokenV
Token ID53
Feature activation+1.38e+00

Pos logit contributions
ucl+2.642
ACA+2.380
UL+2.295
UC+2.246
AMA+2.165
Neg logit contributions
 Spo-2.410
furt-2.308
 reprodu-2.193
irlf-2.161
ptoms-2.082
Tokenern
Token ID1142
Feature activation-7.32e-01

Pos logit contributions
ucl+3.467
ACA+3.074
UL+2.954
UC+2.865
AMA+2.732
Neg logit contributions
 Spo-3.978
furt-3.866
 reprodu-3.657
irlf-3.576
-3.558
Tokenacular
Token ID12754
Feature activation-1.27e+00

Pos logit contributions
 Spo+1.638
furt+1.576
 reprodu+1.458
irlf+1.425
+1.398
Neg logit contributions
ucl-2.316
ACA-2.088
UL-2.013
UC-1.974
AMA-1.914
Token architecture
Token ID10959
Feature activation-5.56e-01

Pos logit contributions
 Spo+2.942
furt+2.795
 reprodu+2.646
irlf+2.547
+2.524
Neg logit contributions
ucl-3.885
ACA-3.504
UL-3.359
UC-3.277
AMA-3.198
Token is
Token ID318
Feature activation+5.85e-01

Pos logit contributions
 Spo+1.152
furt+1.098
 reprodu+1.027
irlf+0.988
+0.976
Neg logit contributions
ucl-1.850
ACA-1.678
UL-1.620
UC-1.586
AMA-1.551
Token a
Token ID257
Feature activation-1.37e-01

Pos logit contributions
ucl+1.972
ACA+1.783
UL+1.737
UC+1.700
AMA+1.658
Neg logit contributions
 Spo-1.162
furt-1.131
irlf-1.035
ptoms-1.004
 reprodu-1.001
Token style
Token ID3918
Feature activation+2.87e-01

Pos logit contributions
 Spo+0.326
furt+0.316
 reprodu+0.294
irlf+0.289
+0.284
Neg logit contributions
ucl-0.412
ACA-0.369
UL-0.356
UC-0.348
AMA-0.338
Token linked
Token ID6692
Feature activation+4.61e-01

Pos logit contributions
ucl+0.912
ACA+0.819
UL+0.796
UC+0.778
AMA+0.755
Neg logit contributions
 Spo-0.638
furt-0.617
 reprodu-0.567
irlf-0.567
-0.552
Token to
Token ID284
Feature activation-9.86e-02

Pos logit contributions
ucl+1.611
ACA+1.464
UL+1.421
UC+1.396
AMA+1.357
Neg logit contributions
 Spo-0.878
furt-0.845
 reprodu-0.763
irlf-0.758
-0.732
Token a
Token ID257
Feature activation-1.20e-01

Pos logit contributions
 Spo+0.220
furt+0.213
 reprodu+0.196
irlf+0.194
+0.190
Neg logit contributions
ucl-0.313
ACA-0.281
UL-0.272
UC-0.266
AMA-0.259
Token particular
Token ID1948
Feature activation+3.66e-01

Pos logit contributions
 Spo+0.278
furt+0.269
 reprodu+0.249
irlf+0.246
+0.241
Neg logit contributions
ucl-0.372
ACA-0.334
UL-0.323
UC-0.316
AMA-0.306
Tokenic
Token ID291
Feature activation+2.79e-02

Pos logit contributions
ucl+1.773
ACA+1.592
UL+1.534
UC+1.509
AMA+1.467
Neg logit contributions
 Spo-1.186
furt-1.142
 reprodu-1.062
irlf-1.049
-1.024
Token]
Token ID60
Feature activation-4.40e-01

Pos logit contributions
ucl+0.075
ACA+0.067
UL+0.064
UC+0.062
AMA+0.061
Neg logit contributions
 Spo-0.076
furt-0.073
 reprodu-0.069
irlf-0.068
-0.067
Token trainers
Token ID28514
Feature activation+7.72e-01

Pos logit contributions
 Spo+1.242
furt+1.200
 reprodu+1.134
irlf+1.115
+1.094
Neg logit contributions
ucl-1.134
ACA-1.005
UL-0.958
UC-0.935
AMA-0.904
Token and
Token ID290
Feature activation+9.29e-01

Pos logit contributions
ucl+2.263
ACA+2.023
UL+1.960
UC+1.913
AMA+1.848
Neg logit contributions
 Spo-1.865
furt-1.827
 reprodu-1.695
irlf-1.680
-1.652
Token guys
Token ID3730
Feature activation+3.50e-01

Pos logit contributions
ucl+2.767
ACA+2.466
UL+2.385
UC+2.335
AMA+2.282
Neg logit contributions
 Spo-2.149
furt-2.148
 reprodu-1.963
irlf-1.949
-1.942
Token on
Token ID319
Feature activation-3.20e-02

Pos logit contributions
ucl+1.082
ACA+0.973
UL+0.942
UC+0.921
AMA+0.895
Neg logit contributions
 Spo-0.794
furt-0.774
 reprodu-0.713
irlf-0.709
-0.696
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
 Spo+0.068
furt+0.066
 reprodu+0.061
irlf+0.060
+0.059
Neg logit contributions
ucl-0.104
ACA-0.094
UL-0.091
UC-0.089
AMA-0.087
Token team
Token ID1074
Feature activation+6.28e-01

Pos logit contributions
ucl+0.587
ACA+0.521
UL+0.501
UC+0.489
AMA+0.473
Neg logit contributions
 Spo-0.549
furt-0.539
 reprodu-0.501
irlf-0.495
-0.489
Token,
Token ID11
Feature activation+2.12e-01

Pos logit contributions
ucl+1.827
ACA+1.637
UL+1.577
UC+1.544
AMA+1.496
Neg logit contributions
 Spo-1.528
furt-1.498
 reprodu-1.381
irlf-1.380
-1.361
Token so
Token ID523
Feature activation-8.09e-01

Pos logit contributions
ucl+0.680
ACA+0.613
UL+0.593
UC+0.580
AMA+0.565
Neg logit contributions
 Spo-0.460
furt-0.451
 reprodu-0.415
irlf-0.409
-0.403
Token it
Token ID340
Feature activation+3.87e-01

Pos logit contributions
 Spo+1.609
furt+1.537
 reprodu+1.415
irlf+1.372
+1.341
Neg logit contributions
ucl-2.766
ACA-2.518
UL-2.435
UC-2.389
AMA-2.328
Token to
Token ID284
Feature activation-9.72e-03

Pos logit contributions
ucl+0.055
ACA+0.050
UL+0.048
UC+0.047
AMA+0.046
Neg logit contributions
 Spo-0.032
furt-0.030
 reprodu-0.028
irlf-0.027
-0.027
Token say
Token ID910
Feature activation+7.86e-02

Pos logit contributions
 Spo+0.029
furt+0.028
 reprodu+0.027
irlf+0.026
+0.026
Neg logit contributions
ucl-0.023
ACA-0.020
UL-0.019
UC-0.019
AMA-0.018
Token,
Token ID11
Feature activation+8.26e-01

Pos logit contributions
ucl+0.253
ACA+0.229
UL+0.221
UC+0.216
AMA+0.211
Neg logit contributions
 Spo-0.171
furt-0.166
 reprodu-0.153
irlf-0.150
-0.147
Token Red
Token ID2297
Feature activation+1.34e+00

Pos logit contributions
ucl+2.654
ACA+2.395
UL+2.341
UC+2.287
AMA+2.227
Neg logit contributions
furt-1.737
 Spo-1.734
 reprodu-1.557
ptoms-1.542
irlf-1.539
Token Son
Token ID6295
Feature activation-9.48e-01

Pos logit contributions
ucl+3.737
ACA+3.313
UL+3.209
UC+3.137
AMA+3.068
Neg logit contributions
 Spo-3.241
furt-3.230
 reprodu-3.017
-2.979
ptoms-2.953
Tokenja
Token ID6592
Feature activation-1.21e+00

Pos logit contributions
 Spo+1.751
furt+1.677
 reprodu+1.521
irlf+1.489
+1.444
Neg logit contributions
ucl-3.361
ACA-3.074
UL-2.988
UC-2.926
AMA-2.844
Token is
Token ID318
Feature activation+1.09e-01

Pos logit contributions
 Spo+2.289
furt+2.148
 reprodu+2.013
irlf+1.933
+1.897
Neg logit contributions
ucl-4.250
ACA-3.868
UL-3.735
UC-3.672
AMA-3.577
Token the
Token ID262
Feature activation+1.22e-01

Pos logit contributions
ucl+0.368
ACA+0.334
UL+0.324
UC+0.317
AMA+0.310
Neg logit contributions
 Spo-0.220
furt-0.213
 reprodu-0.194
irlf-0.190
ptoms-0.187
Token queen
Token ID16599
Feature activation-4.67e-02

Pos logit contributions
ucl+0.372
ACA+0.334
UL+0.322
UC+0.316
AMA+0.306
Neg logit contributions
 Spo-0.284
furt-0.279
 reprodu-0.256
irlf-0.251
-0.248
Token of
Token ID286
Feature activation+2.82e-01

Pos logit contributions
 Spo+0.104
furt+0.100
 reprodu+0.093
irlf+0.091
+0.089
Neg logit contributions
ucl-0.149
ACA-0.134
UL-0.130
UC-0.127
AMA-0.123
Token all
Token ID477
Feature activation+1.42e+00

Pos logit contributions
ucl+0.891
ACA+0.804
UL+0.778
UC+0.759
AMA+0.740
Neg logit contributions
 Spo-0.631
furt-0.614
 reprodu-0.564
irlf-0.554
-0.545
Token write
Token ID3551
Feature activation-8.01e-01

Pos logit contributions
 Spo+2.531
furt+2.448
 reprodu+2.320
irlf+2.254
+2.231
Neg logit contributions
ucl-3.003
ACA-2.689
UL-2.579
UC-2.509
AMA-2.436
Token.
Token ID13
Feature activation+4.47e-01

Pos logit contributions
 Spo+1.849
furt+1.786
 reprodu+1.680
irlf+1.631
+1.606
Neg logit contributions
ucl-2.466
ACA-2.218
UL-2.127
UC-2.088
AMA-2.018
Token
Token ID198
Feature activation+2.47e-01

Pos logit contributions
ucl+1.381
ACA+1.248
UL+1.209
UC+1.183
AMA+1.148
Neg logit contributions
 Spo-1.014
furt-1.001
 reprodu-0.930
irlf-0.917
ptoms-0.887
Token
Token ID198
Feature activation+3.05e-01

Pos logit contributions
ucl+0.735
ACA+0.662
UL+0.643
UC+0.630
AMA+0.610
Neg logit contributions
 Spo-0.598
furt-0.576
irlf-0.553
 reprodu-0.544
ptoms-0.523
Token
Token ID447
Feature activation+5.28e-01

Pos logit contributions
ucl+0.876
ACA+0.786
UL+0.758
UC+0.741
AMA+0.715
Neg logit contributions
 Spo-0.775
furt-0.747
 reprodu-0.704
irlf-0.692
-0.671
Token
Token ID250
Feature activation-4.87e-02

Pos logit contributions
ucl+1.642
ACA+1.479
UL+1.426
UC+1.400
AMA+1.360
Neg logit contributions
 Spo-1.202
furt-1.159
 reprodu-1.080
irlf-1.059
-1.027
TokenThe
Token ID464
Feature activation-1.97e-01

Pos logit contributions
 Spo+0.122
furt+0.118
 reprodu+0.110
irlf+0.108
+0.106
Neg logit contributions
ucl-0.142
ACA-0.127
UL-0.122
UC-0.119
AMA-0.115
Token horizontal
Token ID16021
Feature activation-1.15e+00

Pos logit contributions
 Spo+0.474
furt+0.460
 reprodu+0.426
irlf+0.420
+0.413
Neg logit contributions
ucl-0.590
ACA-0.528
UL-0.509
UC-0.498
AMA-0.482
Token and
Token ID290
Feature activation+3.32e-01

Pos logit contributions
 Spo+2.796
furt+2.688
 reprodu+2.565
irlf+2.466
+2.436
Neg logit contributions
ucl-3.365
ACA-3.036
UL-2.916
UC-2.845
AMA-2.770
Token vertical
Token ID11723
Feature activation-1.39e-01

Pos logit contributions
ucl+1.078
ACA+0.977
UL+0.941
UC+0.922
AMA+0.893
Neg logit contributions
 Spo-0.712
furt-0.692
irlf-0.621
 reprodu-0.617
ptoms-0.603
Token motion
Token ID6268
Feature activation+4.08e-01

Pos logit contributions
 Spo+0.323
furt+0.313
 reprodu+0.287
irlf+0.286
+0.279
Neg logit contributions
ucl-0.430
ACA-0.386
UL-0.373
UC-0.364
AMA-0.354
Token experiment
Token ID6306
Feature activation-6.04e-01

Pos logit contributions
ucl+1.380
ACA+1.222
UL+1.187
UC+1.159
AMA+1.116
Neg logit contributions
 Spo-1.130
furt-1.112
irlf-1.036
 reprodu-1.022
-1.015
Token of
Token ID286
Feature activation-5.49e-02

Pos logit contributions
 Spo+1.333
furt+1.289
 reprodu+1.188
irlf+1.170
+1.146
Neg logit contributions
ucl-1.927
ACA-1.740
UL-1.683
UC-1.647
AMA-1.602
Token the
Token ID262
Feature activation+3.93e-01

Pos logit contributions
 Spo+0.118
furt+0.115
irlf+0.105
 reprodu+0.104
ptoms+0.103
Neg logit contributions
ucl-0.177
ACA-0.160
UL-0.155
UC-0.152
AMA-0.147
Token Arab
Token ID4498
Feature activation-9.86e-03

Pos logit contributions
ucl+1.182
ACA+1.055
UL+1.024
UC+1.003
AMA+0.965
Neg logit contributions
 Spo-0.914
furt-0.904
irlf-0.828
 reprodu-0.825
ptoms-0.816
Token Spring
Token ID8225
Feature activation-4.37e-01

Pos logit contributions
 Spo+0.023
furt+0.022
 reprodu+0.020
irlf+0.020
+0.020
Neg logit contributions
ucl-0.030
ACA-0.027
UL-0.026
UC-0.026
AMA-0.025
Token,
Token ID11
Feature activation-1.16e-02

Pos logit contributions
 Spo+1.021
furt+0.985
 reprodu+0.919
irlf+0.898
+0.883
Neg logit contributions
ucl-1.341
ACA-1.204
UL-1.160
UC-1.135
AMA-1.103
Token which
Token ID543
Feature activation-1.63e-01

Pos logit contributions
 Spo+0.024
furt+0.023
irlf+0.021
 reprodu+0.021
+0.021
Neg logit contributions
ucl-0.038
ACA-0.035
UL-0.034
UC-0.033
AMA-0.032
Token has
Token ID468
Feature activation-5.53e-01

Pos logit contributions
 Spo+0.323
furt+0.314
irlf+0.283
 reprodu+0.280
+0.276
Neg logit contributions
ucl-0.555
ACA-0.506
UL-0.490
UC-0.481
AMA-0.466
Token affected
Token ID5676
Feature activation+4.03e-01

Pos logit contributions
 Spo+1.111
furt+1.074
 reprodu+0.974
irlf+0.966
+0.944
Neg logit contributions
ucl-1.871
ACA-1.701
UL-1.648
UC-1.615
AMA-1.573
Token the
Token ID262
Feature activation-4.96e-01

Pos logit contributions
ucl+1.330
ACA+1.202
UL+1.167
UC+1.143
AMA+1.113
Neg logit contributions
 Spo-0.829
furt-0.813
irlf-0.736
 reprodu-0.728
-0.721
Token region
Token ID3814
Feature activation+6.82e-01

Pos logit contributions
 Spo+1.158
furt+1.116
 reprodu+1.042
irlf+1.017
+1.000
Neg logit contributions
ucl-1.522
ACA-1.370
UL-1.319
UC-1.290
AMA-1.255
Token|
Token ID91
Feature activation-6.03e-01

Pos logit contributions
 Spo+1.121
furt+1.083
 reprodu+1.003
irlf+0.986
+0.966
Neg logit contributions
ucl-1.554
ACA-1.400
UL-1.353
UC-1.323
AMA-1.286
Token,
Token ID11
Feature activation-3.26e-01

Pos logit contributions
 Spo+1.297
furt+1.250
 reprodu+1.154
irlf+1.132
+1.109
Neg logit contributions
ucl-1.960
ACA-1.773
UL-1.714
UC-1.678
AMA-1.633
Token |
Token ID930
Feature activation-1.01e+00

Pos logit contributions
 Spo+0.694
furt+0.668
 reprodu+0.608
irlf+0.605
+0.591
Neg logit contributions
ucl-1.071
ACA-0.971
UL-0.941
UC-0.919
AMA-0.894
Token:
Token ID25
Feature activation+7.09e-02

Pos logit contributions
 Spo+2.320
furt+2.255
 reprodu+2.099
irlf+2.045
+2.018
Neg logit contributions
ucl-3.111
ACA-2.798
UL-2.685
UC-2.627
AMA-2.562
Tokenmk
Token ID28015
Feature activation-1.78e-01

Pos logit contributions
ucl+0.189
ACA+0.167
UL+0.160
UC+0.156
AMA+0.150
Neg logit contributions
 Spo-0.195
furt-0.189
 reprodu-0.178
irlf-0.175
-0.172
Tokenview
Token ID1177
Feature activation-1.78e+00

Pos logit contributions
 Spo+0.411
furt+0.396
 reprodu+0.368
irlf+0.361
+0.355
Neg logit contributions
ucl-0.552
ACA-0.497
UL-0.479
UC-0.469
AMA-0.455
Token|
Token ID91
Feature activation+2.57e-01

Pos logit contributions
 Spo+4.055
furt+3.986
 reprodu+3.678
irlf+3.609
+3.528
Neg logit contributions
ucl-5.398
ACA-4.859
UL-4.671
UC-4.577
AMA-4.470
Token and
Token ID290
Feature activation-5.40e-01

Pos logit contributions
ucl+0.762
ACA+0.683
UL+0.664
UC+0.647
AMA+0.623
Neg logit contributions
 Spo-0.626
furt-0.603
irlf-0.556
 reprodu-0.554
ptoms-0.542
Token |
Token ID930
Feature activation-1.57e+00

Pos logit contributions
 Spo+1.071
furt+1.032
 reprodu+0.931
irlf+0.929
+0.906
Neg logit contributions
ucl-1.847
ACA-1.686
UL-1.631
UC-1.596
AMA-1.556
Token:
Token ID25
Feature activation-7.31e-02

Pos logit contributions
 Spo+3.387
furt+3.352
 reprodu+3.075
irlf+2.982
+2.969
Neg logit contributions
ucl-4.963
ACA-4.468
UL-4.276
UC-4.179
AMA-4.100
Tokenm
Token ID76
Feature activation+1.06e+00

Pos logit contributions
 Spo+0.212
furt+0.206
 reprodu+0.195
irlf+0.192
+0.189
Neg logit contributions
ucl-0.183
ACA-0.160
UL-0.153
UC-0.149
AMA-0.143
Token/
Token ID14
Feature activation-2.96e-02

Pos logit contributions
 Spo+0.187
furt+0.181
 reprodu+0.168
irlf+0.165
+0.162
Neg logit contributions
ucl-0.249
ACA-0.224
UL-0.216
UC-0.211
AMA-0.205
Token8
Token ID23
Feature activation-6.87e-01

Pos logit contributions
 Spo+0.073
furt+0.070
 reprodu+0.066
irlf+0.064
+0.063
Neg logit contributions
ucl-0.088
ACA-0.079
UL-0.076
UC-0.074
AMA-0.072
Token11
Token ID1157
Feature activation-3.27e-01

Pos logit contributions
 Spo+1.664
furt+1.613
 reprodu+1.503
irlf+1.476
+1.452
Neg logit contributions
ucl-2.038
ACA-1.827
UL-1.757
UC-1.716
AMA-1.668
Token700
Token ID9879
Feature activation-7.05e-01

Pos logit contributions
 Spo+0.783
furt+0.757
 reprodu+0.705
irlf+0.693
+0.680
Neg logit contributions
ucl-0.985
ACA-0.884
UL-0.852
UC-0.833
AMA-0.809
Token.
Token ID13
Feature activation-8.52e-01

Pos logit contributions
 Spo+1.763
furt+1.710
 reprodu+1.597
irlf+1.563
+1.548
Neg logit contributions
ucl-2.033
ACA-1.813
UL-1.740
UC-1.701
AMA-1.651
Tokenpdf
Token ID12315
Feature activation-1.80e+00

Pos logit contributions
 Spo+2.141
furt+2.066
 reprodu+1.938
irlf+1.892
+1.870
Neg logit contributions
ucl-2.437
ACA-2.180
UL-2.084
UC-2.036
AMA-1.971
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
 Spo+3.945
furt+3.750
 reprodu+3.479
+3.404
irlf+3.352
Neg logit contributions
ucl-5.531
ACA-5.005
UL-4.763
UC-4.654
AMA-4.567
Token Z
Token ID1168
Feature activation-7.54e-02

Pos logit contributions
 Spo+2.636
furt+2.506
 reprodu+2.337
irlf+2.271
+2.240
Neg logit contributions
ucl-3.711
ACA-3.350
UL-3.216
UC-3.148
AMA-3.062
Tokenal
Token ID282
Feature activation+5.42e-01

Pos logit contributions
 Spo+0.205
furt+0.199
 reprodu+0.187
irlf+0.184
+0.181
Neg logit contributions
ucl-0.203
ACA-0.179
UL-0.172
UC-0.168
AMA-0.162
Tokenosh
Token ID3768
Feature activation+5.05e-01

Pos logit contributions
ucl+1.637
ACA+1.472
UL+1.418
UC+1.385
AMA+1.353
Neg logit contributions
 Spo-1.286
furt-1.233
 reprodu-1.158
irlf-1.142
-1.123
Tokenn
Token ID77
Feature activation-1.36e+00

Pos logit contributions
ucl+1.575
ACA+1.415
UL+1.366
UC+1.337
AMA+1.307
Neg logit contributions
 Spo-1.146
furt-1.112
 reprodu-1.034
irlf-1.023
-0.995
Tokenive
Token ID425
Feature activation-3.29e-02

Pos logit contributions
 Spo+2.313
furt+2.234
 reprodu+2.072
irlf+2.016
+1.994
Neg logit contributions
ucl-3.044
ACA-2.748
UL-2.648
UC-2.579
AMA-2.503
Token of
Token ID286
Feature activation+4.42e-01

Pos logit contributions
 Spo+0.072
furt+0.070
 reprodu+0.064
irlf+0.064
+0.062
Neg logit contributions
ucl-0.106
ACA-0.095
UL-0.092
UC-0.090
AMA-0.088
Token Peoples
Token ID41770
Feature activation-6.80e-01

Pos logit contributions
ucl+1.453
ACA+1.309
UL+1.271
UC+1.245
AMA+1.222
Neg logit contributions
 Spo-0.916
furt-0.897
 reprodu-0.822
irlf-0.808
-0.796
Token'
Token ID6
Feature activation-1.07e+00

Pos logit contributions
 Spo+1.799
furt+1.732
 reprodu+1.634
irlf+1.596
+1.577
Neg logit contributions
ucl-1.874
ACA-1.671
UL-1.598
UC-1.557
AMA-1.506
Token Comm
Token ID1520
Feature activation+4.28e-01

Pos logit contributions
 Spo+2.288
furt+2.172
 reprodu+2.018
irlf+1.957
+1.923
Neg logit contributions
ucl-3.487
ACA-3.168
UL-3.045
UC-2.978
AMA-2.893
Tokeniss
Token ID747
Feature activation-2.04e+00

Pos logit contributions
ucl+1.208
ACA+1.070
UL+1.032
UC+1.007
AMA+0.972
Neg logit contributions
 Spo-1.107
furt-1.076
 reprodu-1.008
irlf-0.989
-0.976
Tokenariat
Token ID21621
Feature activation-7.42e-01

Pos logit contributions
 Spo+5.410
furt+5.292
 reprodu+4.869
irlf+4.797
ipal+4.785
Neg logit contributions
ucl-5.408
ACA-4.879
UL-4.628
UC-4.491
AMA-4.356
Token of
Token ID286
Feature activation-2.33e-01

Pos logit contributions
 Spo+1.622
furt+1.561
 reprodu+1.451
irlf+1.405
+1.387
Neg logit contributions
ucl-2.383
ACA-2.164
UL-2.080
UC-2.038
AMA-1.982
Token Defence
Token ID16721
Feature activation+5.18e-01

Pos logit contributions
 Spo+0.468
furt+0.451
 reprodu+0.414
irlf+0.405
+0.397
Neg logit contributions
ucl-0.787
ACA-0.715
UL-0.692
UC-0.679
AMA-0.662
Token No
Token ID1400
Feature activation-2.10e-02

Pos logit contributions
ucl+1.586
ACA+1.418
UL+1.374
UC+1.350
AMA+1.307
Neg logit contributions
 Spo-1.198
furt-1.171
irlf-1.081
 reprodu-1.077
ptoms-1.047
Token.
Token ID13
Feature activation-5.62e-02

Pos logit contributions
 Spo+0.051
furt+0.049
 reprodu+0.046
irlf+0.045
+0.044
Neg logit contributions
ucl-0.063
ACA-0.056
UL-0.054
UC-0.053
AMA-0.052
Token(
Token ID7
Feature activation+7.18e-01

Pos logit contributions
 Spo+1.946
furt+1.875
 reprodu+1.744
irlf+1.711
+1.697
Neg logit contributions
ucl-2.620
ACA-2.362
UL-2.261
UC-2.213
AMA-2.161
Token 0
Token ID657
Feature activation+1.39e+00

Pos logit contributions
ucl+2.308
ACA+2.065
UL+2.022
UC+1.970
AMA+1.898
Neg logit contributions
 Spo-1.565
furt-1.522
 reprodu-1.391
irlf-1.376
-1.349
Token.
Token ID13
Feature activation+3.55e-01

Pos logit contributions
ucl+4.355
ACA+3.912
UL+3.856
UC+3.748
AMA+3.607
Neg logit contributions
 Spo-3.101
furt-2.991
 reprodu-2.792
irlf-2.733
-2.663
Token0
Token ID15
Feature activation+5.76e-01

Pos logit contributions
ucl+0.927
ACA+0.821
UL+0.790
UC+0.769
AMA+0.738
Neg logit contributions
 Spo-0.988
furt-0.963
 reprodu-0.910
irlf-0.901
ptoms-0.880
Tokenf
Token ID69
Feature activation-9.03e-01

Pos logit contributions
ucl+1.779
ACA+1.601
UL+1.560
UC+1.520
AMA+1.472
Neg logit contributions
 Spo-1.326
furt-1.286
 reprodu-1.196
irlf-1.177
-1.143
Token )
Token ID1267
Feature activation-1.79e+00

Pos logit contributions
 Spo+1.909
furt+1.817
 reprodu+1.685
irlf+1.652
+1.636
Neg logit contributions
ucl-2.945
ACA-2.670
UL-2.534
UC-2.490
AMA-2.442
Token ;
Token ID2162
Feature activation-1.10e+00

Pos logit contributions
 Spo+2.813
furt+2.587
 reprodu+2.405
ptoms+2.298
irlf+2.287
Neg logit contributions
ucl-6.642
ACA-6.185
UL-5.845
UC-5.792
AMA-5.743
Token //
Token ID3373
Feature activation+1.35e-01

Pos logit contributions
 Spo+2.635
furt+2.523
 reprodu+2.382
irlf+2.326
+2.286
Neg logit contributions
ucl-3.271
ACA-2.956
UL-2.821
UC-2.771
AMA-2.702
Token The
Token ID383
Feature activation-1.72e-01

Pos logit contributions
ucl+0.436
ACA+0.393
UL+0.381
UC+0.372
AMA+0.362
Neg logit contributions
 Spo-0.296
furt-0.287
 reprodu-0.262
irlf-0.259
-0.254
Token grid
Token ID10706
Feature activation-2.99e-01

Pos logit contributions
 Spo+0.396
furt+0.385
 reprodu+0.354
irlf+0.350
+0.344
Neg logit contributions
ucl-0.532
ACA-0.476
UL-0.461
UC-0.450
AMA-0.436
Token resulting
Token ID7186
Feature activation-1.00e+00

Pos logit contributions
 Spo+0.608
furt+0.585
 reprodu+0.534
irlf+0.526
+0.514
Neg logit contributions
ucl-1.009
ACA-0.913
UL-0.886
UC-0.868
AMA-0.844
Tokenue
Token ID518
Feature activation-3.66e-01

Pos logit contributions
ucl+0.089
ACA+0.079
UL+0.075
UC+0.073
AMA+0.071
Neg logit contributions
 Spo-0.094
furt-0.091
 reprodu-0.086
irlf-0.085
-0.083
Token,
Token ID11
Feature activation-2.39e-01

Pos logit contributions
 Spo+0.862
furt+0.834
 reprodu+0.775
irlf+0.763
+0.748
Neg logit contributions
ucl-1.112
ACA-0.999
UL-0.964
UC-0.943
AMA-0.915
Token along
Token ID1863
Feature activation-6.05e-01

Pos logit contributions
 Spo+0.535
furt+0.523
 reprodu+0.478
irlf+0.474
+0.466
Neg logit contributions
ucl-0.751
ACA-0.677
UL-0.655
UC-0.643
AMA-0.624
Token with
Token ID351
Feature activation-2.71e-01

Pos logit contributions
 Spo+1.262
furt+1.215
 reprodu+1.122
irlf+1.094
+1.075
Neg logit contributions
ucl-2.004
ACA-1.815
UL-1.757
UC-1.719
AMA-1.677
Token his
Token ID465
Feature activation-4.26e-01

Pos logit contributions
 Spo+0.530
furt+0.510
 reprodu+0.465
irlf+0.458
+0.447
Neg logit contributions
ucl-0.931
ACA-0.846
UL-0.820
UC-0.805
AMA-0.785
Token prof
Token ID1534
Feature activation-2.11e+00

Pos logit contributions
 Spo+1.036
furt+1.003
 reprodu+0.936
irlf+0.918
+0.903
Neg logit contributions
ucl-1.267
ACA-1.135
UL-1.092
UC-1.068
AMA-1.036
Tokenuse
Token ID1904
Feature activation+5.42e-01

Pos logit contributions
 Spo+5.815
furt+5.752
 reprodu+5.327
irlf+5.312
ipal+5.193
Neg logit contributions
ucl-5.295
ACA-4.771
UL-4.521
UC-4.376
AMA-4.185
Token apologies
Token ID32920
Feature activation+3.02e-01

Pos logit contributions
ucl+1.443
ACA+1.260
UL+1.221
UC+1.183
AMA+1.148
Neg logit contributions
 Spo-1.470
furt-1.434
 reprodu-1.342
irlf-1.338
-1.317
Token to
Token ID284
Feature activation+1.03e-01

Pos logit contributions
ucl+0.941
ACA+0.844
UL+0.820
UC+0.803
AMA+0.780
Neg logit contributions
 Spo-0.686
furt-0.666
 reprodu-0.614
irlf-0.608
-0.591
Token the
Token ID262
Feature activation+2.57e-02

Pos logit contributions
ucl+0.321
ACA+0.289
UL+0.279
UC+0.273
AMA+0.265
Neg logit contributions
 Spo-0.235
furt-0.229
 reprodu-0.211
irlf-0.207
-0.204
Token Motion
Token ID20843
Feature activation+2.00e-01

Pos logit contributions
ucl+0.076
ACA+0.068
UL+0.065
UC+0.064
AMA+0.062
Neg logit contributions
 Spo-0.063
furt-0.062
 reprodu-0.057
irlf-0.056
-0.056