TOP ACTIVATIONS
MAX = 1.459

unk rock - favorite favorite favorite favorite - December 13 ,
ler oy - favorite favorite favorite favorite - January 4 ,
EW AN - favorite favorite favorite favorite favorite - September 26
imp unk rock - favorite favorite favorite favorite - December 13
rock - favorite favorite favorite favorite - December 13 , 2010
r Pix - favorite favorite favorite favorite favorite - January 15
oy - favorite favorite favorite favorite - January 4 , 2010
iz r Pix - favorite favorite favorite favorite favorite - January
wt 0001 - favorite favorite favorite favorite - July 6 ,
ur ker - favorite favorite favorite favorite favorite - February 9
AN - favorite favorite favorite favorite favorite - September 26 ,
M anners - favorite favorite favorite favorite favorite - December 5
_ ler oy - favorite favorite favorite favorite - January 4
anners - favorite favorite favorite favorite favorite - December 5 ,
ATCH EW AN - favorite favorite favorite favorite favorite - September
j wt 0001 - favorite favorite favorite favorite - July 6
B iz r Pix - favorite favorite favorite favorite favorite -
L ur ker - favorite favorite favorite favorite favorite - February
Pix - favorite favorite favorite favorite favorite - January 15 ,
ker - favorite favorite favorite favorite favorite - February 9 ,

MIN ACTIVATIONS
MIN = -1.262

Women in politics . Hide Caption 7 of 15 Photos :
, check out our latest issue Travel With The
Target = & ' a T ; ... }
Pasadena , California . Hide Caption 12 of 15 Photos :
7 { ( 17 s )} 27 . R xd 8
20 , 2015 . Hide Caption 8 of 15 Photos :
Le a Suzuki , The Chronicle 7 of 15 Photo :
s invoke local - f test - p data . json
H L D att u and C K Pr as ad
Council House National Historic Site .[ 57 ] School
" P ulse amount " , Range ( 0 , 0
Nation is reader supported : Chip in $ 10 or more
212 x . x . x . 108 10 46 358
- excerpt from The Bh ag av ad G ita
= & ' a T ; ... } Unlike
click below to become an In Daily supporter . Powered by
7 { ( 150 s )} 37 . f xe 6
A bench of justices H L D att u
app data _ full v ) { // Time component const
Path ) image = im utils . resize ( image ,

INTERVAL 0.779 - 1.459
CONTAINS 0.039%

Rated 5 out of 5 by Quake D ude
, the ice sheet contains enough water to raise global sea
unk rock - favorite favorite favorite favorite - December 13 ,
rock - favorite favorite favorite favorite - December 13 , 2010
AN - favorite favorite favorite favorite favorite - September 26 ,

INTERVAL 0.098 - 0.779
CONTAINS 29.033%

, San Francisco , and Seattle , she followed public stair
. Mil burn Stand representative must hold a valid season ticket
. So there can be little doubt that , had Ign
for Tuesday — including a stop at the University of Iowa
State has a beef with Trump it can � � t

INTERVAL -0.582 - 0.098
CONTAINS 70.661%

standing in the security line : a tall , pale man
. The adrenaline was really pumping at this moment , and
olla has connected development of new products to their strategy since
as one cohesive public arena , the intersection of Y onge
10 ) ( 2 ) 23 . Peter M

INTERVAL -1.262 - -0.582
CONTAINS 0.267%

full v ) { // Time component const float TA U
much hurry ." <|endoftext|> Behind every smart TV show , there
Cut the carrots into thin slices and the broccoli into small
_ m oves : u 16 , // How many moves
and sol es ") is an autos omal dominant ker at
Tokenunk
Token ID2954
Feature activation-1.88e-01

Pos logit contributions
 eleph+0.555
=\"+0.521
oute+0.511
horn+0.511
aba+0.508
Neg logit contributions
REDACTED-0.631
 Austral-0.533
Austral-0.518
Rus-0.495
SPA-0.489
Tokenrock
Token ID10823
Feature activation-3.57e-01

Pos logit contributions
REDACTED+1.627
 Austral+1.379
Austral+1.340
Rus+1.289
alg+1.288
Neg logit contributions
 eleph-1.415
=\"-1.321
oute-1.279
horn-1.275
aba-1.272
Token -
Token ID532
Feature activation+3.40e-01

Pos logit contributions
REDACTED+3.029
 Austral+2.567
Austral+2.487
Rus+2.402
alg+2.400
Neg logit contributions
 eleph-2.723
=\"-2.540
oute-2.446
horn-2.429
aba-2.428
Token favorite
Token ID4004
Feature activation+1.20e+00

Pos logit contributions
 eleph+1.394
=\"+1.267
horn+1.257
aba+1.223
oute+1.197
Neg logit contributions
REDACTED-4.031
Austral-3.461
 Austral-3.445
SPA-3.387
Rus-3.323
Token favorite
Token ID4004
Feature activation+1.42e+00

Pos logit contributions
=\"+3.519
horn+3.342
oute+3.287
 eleph+3.161
aba+2.903
Neg logit contributions
REDACTED-14.305
Austral-12.753
 Austral-12.428
SPA-12.323
Rus-12.089
Token favorite
Token ID4004
Feature activation+1.46e+00

Pos logit contributions
=\"+4.197
 eleph+4.061
horn+4.012
oute+3.895
aba+3.441
Neg logit contributions
REDACTED-16.241
Austral-14.588
 Austral-14.171
SPA-13.915
Rus-13.707
Token favorite
Token ID4004
Feature activation+1.41e+00

Pos logit contributions
 eleph+5.689
=\"+5.302
horn+5.169
oute+5.025
aba+4.565
Neg logit contributions
REDACTED-15.486
Austral-13.934
 Austral-13.558
SPA-13.050
Rus-12.962
Token -
Token ID532
Feature activation+6.70e-01

Pos logit contributions
 eleph+6.645
=\"+6.219
horn+6.126
oute+5.923
aba+5.519
Neg logit contributions
REDACTED-13.940
Austral-12.476
 Austral-12.171
SPA-11.623
Rus-11.550
Token December
Token ID3426
Feature activation+6.63e-02

Pos logit contributions
 eleph+5.199
=\"+5.148
aba+5.045
horn+5.028
oute+4.886
Neg logit contributions
REDACTED-4.988
Austral-3.987
 Austral-3.911
Rus-3.763
alg-3.755
Token 13
Token ID1511
Feature activation+9.83e-02

Pos logit contributions
 eleph+0.526
=\"+0.508
horn+0.491
oute+0.490
aba+0.489
Neg logit contributions
REDACTED-0.530
 Austral-0.441
Austral-0.429
alg-0.412
Rus-0.411
Token,
Token ID11
Feature activation+4.36e-01

Pos logit contributions
 eleph+0.643
=\"+0.608
horn+0.582
oute+0.582
aba+0.580
Neg logit contributions
REDACTED-0.930
 Austral-0.801
Austral-0.781
alg-0.755
Rus-0.755
Tokenler
Token ID1754
Feature activation+1.65e-01

Pos logit contributions
REDACTED+0.575
 Austral+0.488
Austral+0.471
Rus+0.453
alg+0.451
Neg logit contributions
 eleph-0.521
=\"-0.489
horn-0.476
oute-0.475
aba-0.473
Tokenoy
Token ID726
Feature activation+8.44e-02

Pos logit contributions
 eleph+1.196
=\"+1.138
oute+1.125
horn+1.123
aba+1.111
Neg logit contributions
REDACTED-1.454
 Austral-1.222
Austral-1.187
Rus-1.138
alg-1.135
Token -
Token ID532
Feature activation+5.18e-01

Pos logit contributions
 eleph+0.619
=\"+0.575
horn+0.565
oute+0.565
aba+0.563
Neg logit contributions
REDACTED-0.741
 Austral-0.630
Austral-0.613
Rus-0.587
alg-0.582
Token favorite
Token ID4004
Feature activation+1.27e+00

Pos logit contributions
=\"+2.061
 eleph+2.035
horn+2.035
aba+1.977
oute+1.952
Neg logit contributions
REDACTED-5.933
Austral-5.158
 Austral-5.136
Rus-5.009
SPA-4.980
Token favorite
Token ID4004
Feature activation+1.35e+00

Pos logit contributions
=\"+4.442
horn+4.376
oute+3.978
aba+3.730
 Dover+3.263
Neg logit contributions
REDACTED-14.246
Austral-12.843
 Austral-12.424
Rus-12.422
SPA-12.216
Token favorite
Token ID4004
Feature activation+1.43e+00

Pos logit contributions
=\"+4.568
horn+4.501
oute+4.082
aba+3.784
 Dover+3.335
Neg logit contributions
REDACTED-15.082
Austral-13.646
Rus-13.154
 Austral-13.140
SPA-12.950
Token favorite
Token ID4004
Feature activation+1.39e+00

Pos logit contributions
=\"+4.988
horn+4.921
oute+4.475
aba+4.083
 eleph+3.698
Neg logit contributions
REDACTED-15.740
Austral-14.325
 Austral-13.688
Rus-13.670
SPA-13.488
Token -
Token ID532
Feature activation+5.35e-01

Pos logit contributions
 eleph+5.807
horn+5.396
=\"+5.392
oute+5.014
aba+4.614
Neg logit contributions
REDACTED-14.440
Austral-13.179
 Austral-12.812
Rus-12.333
SPA-12.142
Token January
Token ID3269
Feature activation-1.40e-02

Pos logit contributions
=\"+4.069
horn+3.941
aba+3.919
 eleph+3.843
oute+3.756
Neg logit contributions
REDACTED-4.216
Austral-3.404
 Austral-3.337
alg-3.288
Rus-3.200
Token 4
Token ID604
Feature activation+4.86e-01

Pos logit contributions
REDACTED+0.112
 Austral+0.093
Austral+0.090
alg+0.087
Rus+0.087
Neg logit contributions
 eleph-0.113
=\"-0.107
oute-0.103
horn-0.103
aba-0.103
Token,
Token ID11
Feature activation+7.80e-01

Pos logit contributions
=\"+2.662
horn+2.561
aba+2.514
oute+2.387
acket+2.284
Neg logit contributions
REDACTED-4.832
 Austral-4.241
Austral-4.158
alg-4.063
SPA-4.023
TokenEW
Token ID6217
Feature activation-2.81e-01

Pos logit contributions
REDACTED+2.249
 Austral+1.907
Austral+1.851
Rus+1.784
alg+1.782
Neg logit contributions
 eleph-1.966
=\"-1.833
oute-1.772
horn-1.762
aba-1.758
TokenAN
Token ID1565
Feature activation-2.43e-01

Pos logit contributions
REDACTED+2.417
 Austral+2.047
Austral+1.987
Rus+1.915
alg+1.914
Neg logit contributions
 eleph-2.129
=\"-1.985
oute-1.919
horn-1.909
aba-1.905
Token -
Token ID532
Feature activation+6.29e-01

Pos logit contributions
REDACTED+2.077
 Austral+1.761
Austral+1.710
Rus+1.642
alg+1.641
Neg logit contributions
 eleph-1.848
=\"-1.724
oute-1.671
horn-1.668
aba-1.663
Token favorite
Token ID4004
Feature activation+1.29e+00

Pos logit contributions
=\"+2.469
horn+2.441
aba+2.351
 eleph+2.340
oute+2.326
Neg logit contributions
REDACTED-7.187
Austral-6.276
 Austral-6.209
Rus-6.081
SPA-6.044
Token favorite
Token ID4004
Feature activation+1.34e+00

Pos logit contributions
=\"+4.280
horn+4.226
oute+3.838
aba+3.540
acket+3.111
Neg logit contributions
REDACTED-14.668
Austral-13.221
Rus-12.807
 Austral-12.787
SPA-12.634
Token favorite
Token ID4004
Feature activation+1.43e+00

Pos logit contributions
=\"+4.347
horn+4.303
oute+3.898
aba+3.553
 Dover+3.118
Neg logit contributions
REDACTED-15.241
Austral-13.779
Rus-13.312
 Austral-13.285
SPA-13.118
Token favorite
Token ID4004
Feature activation+1.36e+00

Pos logit contributions
=\"+4.756
horn+4.710
oute+4.318
aba+3.852
 eleph+3.473
Neg logit contributions
REDACTED-15.853
Austral-14.425
 Austral-13.813
Rus-13.794
SPA-13.617
Token favorite
Token ID4004
Feature activation+1.19e+00

Pos logit contributions
horn+5.022
 eleph+4.993
=\"+4.934
oute+4.648
aba+4.167
Neg logit contributions
REDACTED-14.635
Austral-13.374
 Austral-12.948
Rus-12.582
SPA-12.333
Token -
Token ID532
Feature activation+5.33e-01

Pos logit contributions
 eleph+6.188
horn+5.882
=\"+5.843
oute+5.508
aba+5.212
Neg logit contributions
REDACTED-11.479
Austral-10.389
 Austral-10.152
Rus-9.705
SPA-9.516
Token September
Token ID2693
Feature activation+1.17e-01

Pos logit contributions
=\"+3.953
horn+3.829
aba+3.803
 eleph+3.794
oute+3.673
Neg logit contributions
REDACTED-4.307
Austral-3.497
 Austral-3.443
alg-3.351
Rus-3.301
Token 26
Token ID2608
Feature activation+9.79e-02

Pos logit contributions
 eleph+0.917
=\"+0.899
horn+0.871
aba+0.866
oute+0.863
Neg logit contributions
REDACTED-0.939
 Austral-0.780
Austral-0.761
alg-0.731
Rus-0.726
Tokenimp
Token ID11011
Feature activation+7.35e-02

Pos logit contributions
 eleph+3.041
=\"+2.938
aba+2.926
horn+2.914
oute+2.914
Neg logit contributions
REDACTED-3.395
 Austral-2.877
Austral-2.747
Rus-2.624
SPA-2.606
Tokenunk
Token ID2954
Feature activation-1.88e-01

Pos logit contributions
 eleph+0.555
=\"+0.521
oute+0.511
horn+0.511
aba+0.508
Neg logit contributions
REDACTED-0.631
 Austral-0.533
Austral-0.518
Rus-0.495
SPA-0.489
Tokenrock
Token ID10823
Feature activation-3.57e-01

Pos logit contributions
REDACTED+1.627
 Austral+1.379
Austral+1.340
Rus+1.289
alg+1.288
Neg logit contributions
 eleph-1.415
=\"-1.321
oute-1.279
horn-1.275
aba-1.272
Token -
Token ID532
Feature activation+3.40e-01

Pos logit contributions
REDACTED+3.029
 Austral+2.567
Austral+2.487
Rus+2.402
alg+2.400
Neg logit contributions
 eleph-2.723
=\"-2.540
oute-2.446
horn-2.429
aba-2.428
Token favorite
Token ID4004
Feature activation+1.20e+00

Pos logit contributions
 eleph+1.394
=\"+1.267
horn+1.257
aba+1.223
oute+1.197
Neg logit contributions
REDACTED-4.031
Austral-3.461
 Austral-3.445
SPA-3.387
Rus-3.323
Token favorite
Token ID4004
Feature activation+1.42e+00

Pos logit contributions
=\"+3.519
horn+3.342
oute+3.287
 eleph+3.161
aba+2.903
Neg logit contributions
REDACTED-14.305
Austral-12.753
 Austral-12.428
SPA-12.323
Rus-12.089
Token favorite
Token ID4004
Feature activation+1.46e+00

Pos logit contributions
=\"+4.197
 eleph+4.061
horn+4.012
oute+3.895
aba+3.441
Neg logit contributions
REDACTED-16.241
Austral-14.588
 Austral-14.171
SPA-13.915
Rus-13.707
Token favorite
Token ID4004
Feature activation+1.41e+00

Pos logit contributions
 eleph+5.689
=\"+5.302
horn+5.169
oute+5.025
aba+4.565
Neg logit contributions
REDACTED-15.486
Austral-13.934
 Austral-13.558
SPA-13.050
Rus-12.962
Token -
Token ID532
Feature activation+6.70e-01

Pos logit contributions
 eleph+6.645
=\"+6.219
horn+6.126
oute+5.923
aba+5.519
Neg logit contributions
REDACTED-13.940
Austral-12.476
 Austral-12.171
SPA-11.623
Rus-11.550
Token December
Token ID3426
Feature activation+6.63e-02

Pos logit contributions
 eleph+5.199
=\"+5.148
aba+5.045
horn+5.028
oute+4.886
Neg logit contributions
REDACTED-4.988
Austral-3.987
 Austral-3.911
Rus-3.763
alg-3.755
Token 13
Token ID1511
Feature activation+9.83e-02

Pos logit contributions
 eleph+0.526
=\"+0.508
horn+0.491
oute+0.490
aba+0.489
Neg logit contributions
REDACTED-0.530
 Austral-0.441
Austral-0.429
alg-0.412
Rus-0.411
Tokenrock
Token ID10823
Feature activation-3.57e-01

Pos logit contributions
REDACTED+1.627
 Austral+1.379
Austral+1.340
Rus+1.289
alg+1.288
Neg logit contributions
 eleph-1.415
=\"-1.321
oute-1.279
horn-1.275
aba-1.272
Token -
Token ID532
Feature activation+3.40e-01

Pos logit contributions
REDACTED+3.029
 Austral+2.567
Austral+2.487
Rus+2.402
alg+2.400
Neg logit contributions
 eleph-2.723
=\"-2.540
oute-2.446
horn-2.429
aba-2.428
Token favorite
Token ID4004
Feature activation+1.20e+00

Pos logit contributions
 eleph+1.394
=\"+1.267
horn+1.257
aba+1.223
oute+1.197
Neg logit contributions
REDACTED-4.031
Austral-3.461
 Austral-3.445
SPA-3.387
Rus-3.323
Token favorite
Token ID4004
Feature activation+1.42e+00

Pos logit contributions
=\"+3.519
horn+3.342
oute+3.287
 eleph+3.161
aba+2.903
Neg logit contributions
REDACTED-14.305
Austral-12.753
 Austral-12.428
SPA-12.323
Rus-12.089
Token favorite
Token ID4004
Feature activation+1.46e+00

Pos logit contributions
=\"+4.197
 eleph+4.061
horn+4.012
oute+3.895
aba+3.441
Neg logit contributions
REDACTED-16.241
Austral-14.588
 Austral-14.171
SPA-13.915
Rus-13.707
Token favorite
Token ID4004
Feature activation+1.41e+00

Pos logit contributions
 eleph+5.689
=\"+5.302
horn+5.169
oute+5.025
aba+4.565
Neg logit contributions
REDACTED-15.486
Austral-13.934
 Austral-13.558
SPA-13.050
Rus-12.962
Token -
Token ID532
Feature activation+6.70e-01

Pos logit contributions
 eleph+6.645
=\"+6.219
horn+6.126
oute+5.923
aba+5.519
Neg logit contributions
REDACTED-13.940
Austral-12.476
 Austral-12.171
SPA-11.623
Rus-11.550
Token December
Token ID3426
Feature activation+6.63e-02

Pos logit contributions
 eleph+5.199
=\"+5.148
aba+5.045
horn+5.028
oute+4.886
Neg logit contributions
REDACTED-4.988
Austral-3.987
 Austral-3.911
Rus-3.763
alg-3.755
Token 13
Token ID1511
Feature activation+9.83e-02

Pos logit contributions
 eleph+0.526
=\"+0.508
horn+0.491
oute+0.490
aba+0.489
Neg logit contributions
REDACTED-0.530
 Austral-0.441
Austral-0.429
alg-0.412
Rus-0.411
Token,
Token ID11
Feature activation+4.36e-01

Pos logit contributions
 eleph+0.643
=\"+0.608
horn+0.582
oute+0.582
aba+0.580
Neg logit contributions
REDACTED-0.930
 Austral-0.801
Austral-0.781
alg-0.755
Rus-0.755
Token 2010
Token ID3050
Feature activation+1.79e-01

Pos logit contributions
 eleph+2.576
=\"+2.447
horn+2.389
aba+2.294
oute+2.253
Neg logit contributions
REDACTED-4.359
 Austral-3.695
Austral-3.693
Rus-3.547
SPA-3.511
Tokenr
Token ID81
Feature activation-5.31e-03

Pos logit contributions
REDACTED+2.254
 Austral+1.915
Austral+1.861
Rus+1.795
alg+1.793
Neg logit contributions
 eleph-1.922
=\"-1.789
oute-1.728
horn-1.718
aba-1.714
Token Pix
Token ID21642
Feature activation+1.96e-01

Pos logit contributions
REDACTED+0.046
 Austral+0.039
Austral+0.038
Rus+0.036
alg+0.036
Neg logit contributions
 eleph-0.040
=\"-0.037
oute-0.036
horn-0.036
aba-0.036
Token -
Token ID532
Feature activation+6.72e-01

Pos logit contributions
 eleph+1.430
=\"+1.341
aba+1.317
horn+1.316
oute+1.312
Neg logit contributions
REDACTED-1.733
 Austral-1.451
Austral-1.416
SPA-1.352
Rus-1.346
Token favorite
Token ID4004
Feature activation+1.33e+00

Pos logit contributions
horn+2.333
=\"+2.298
aba+2.234
 eleph+2.201
oute+2.175
Neg logit contributions
REDACTED-7.910
Austral-6.953
 Austral-6.815
Rus-6.704
SPA-6.701
Token favorite
Token ID4004
Feature activation+1.38e+00

Pos logit contributions
=\"+4.405
horn+4.381
oute+3.970
aba+3.683
 Dover+3.203
Neg logit contributions
REDACTED-15.073
Austral-13.602
 Austral-13.113
Rus-13.094
SPA-12.950
Token favorite
Token ID4004
Feature activation+1.39e+00

Pos logit contributions
=\"+4.649
horn+4.604
oute+4.194
aba+3.876
 Dover+3.409
Neg logit contributions
REDACTED-15.355
Austral-13.875
 Austral-13.327
Rus-13.297
SPA-13.155
Token favorite
Token ID4004
Feature activation+1.31e+00

Pos logit contributions
horn+4.607
=\"+4.602
 eleph+4.421
oute+4.264
aba+3.891
Neg logit contributions
REDACTED-15.389
Austral-13.994
 Austral-13.515
Rus-13.122
SPA-12.956
Token favorite
Token ID4004
Feature activation+1.16e+00

Pos logit contributions
 eleph+5.866
horn+5.327
=\"+5.315
oute+5.061
aba+4.685
Neg logit contributions
REDACTED-13.577
Austral-12.310
 Austral-12.019
Rus-11.472
SPA-11.304
Token -
Token ID532
Feature activation+4.91e-01

Pos logit contributions
 eleph+6.421
horn+5.944
=\"+5.925
oute+5.604
aba+5.427
Neg logit contributions
REDACTED-10.994
Austral-9.820
 Austral-9.646
Rus-9.098
SPA-9.028
Token January
Token ID3269
Feature activation+6.29e-02

Pos logit contributions
=\"+3.934
horn+3.809
aba+3.781
 eleph+3.705
oute+3.636
Neg logit contributions
REDACTED-3.688
Austral-2.946
 Austral-2.887
alg-2.823
SPA-2.749
Token 15
Token ID1315
Feature activation+2.12e-01

Pos logit contributions
 eleph+0.500
=\"+0.484
horn+0.469
aba+0.467
oute+0.466
Neg logit contributions
REDACTED-0.500
 Austral-0.415
Austral-0.405
alg-0.388
Rus-0.386
Tokenoy
Token ID726
Feature activation+8.44e-02

Pos logit contributions
 eleph+1.196
=\"+1.138
oute+1.125
horn+1.123
aba+1.111
Neg logit contributions
REDACTED-1.454
 Austral-1.222
Austral-1.187
Rus-1.138
alg-1.135
Token -
Token ID532
Feature activation+5.18e-01

Pos logit contributions
 eleph+0.619
=\"+0.575
horn+0.565
oute+0.565
aba+0.563
Neg logit contributions
REDACTED-0.741
 Austral-0.630
Austral-0.613
Rus-0.587
alg-0.582
Token favorite
Token ID4004
Feature activation+1.27e+00

Pos logit contributions
=\"+2.061
 eleph+2.035
horn+2.035
aba+1.977
oute+1.952
Neg logit contributions
REDACTED-5.933
Austral-5.158
 Austral-5.136
Rus-5.009
SPA-4.980
Token favorite
Token ID4004
Feature activation+1.35e+00

Pos logit contributions
=\"+4.442
horn+4.376
oute+3.978
aba+3.730
 Dover+3.263
Neg logit contributions
REDACTED-14.246
Austral-12.843
 Austral-12.424
Rus-12.422
SPA-12.216
Token favorite
Token ID4004
Feature activation+1.43e+00

Pos logit contributions
=\"+4.568
horn+4.501
oute+4.082
aba+3.784
 Dover+3.335
Neg logit contributions
REDACTED-15.082
Austral-13.646
Rus-13.154
 Austral-13.140
SPA-12.950
Token favorite
Token ID4004
Feature activation+1.39e+00

Pos logit contributions
=\"+4.988
horn+4.921
oute+4.475
aba+4.083
 eleph+3.698
Neg logit contributions
REDACTED-15.740
Austral-14.325
 Austral-13.688
Rus-13.670
SPA-13.488
Token -
Token ID532
Feature activation+5.35e-01

Pos logit contributions
 eleph+5.807
horn+5.396
=\"+5.392
oute+5.014
aba+4.614
Neg logit contributions
REDACTED-14.440
Austral-13.179
 Austral-12.812
Rus-12.333
SPA-12.142
Token January
Token ID3269
Feature activation-1.40e-02

Pos logit contributions
=\"+4.069
horn+3.941
aba+3.919
 eleph+3.843
oute+3.756
Neg logit contributions
REDACTED-4.216
Austral-3.404
 Austral-3.337
alg-3.288
Rus-3.200
Token 4
Token ID604
Feature activation+4.86e-01

Pos logit contributions
REDACTED+0.112
 Austral+0.093
Austral+0.090
alg+0.087
Rus+0.087
Neg logit contributions
 eleph-0.113
=\"-0.107
oute-0.103
horn-0.103
aba-0.103
Token,
Token ID11
Feature activation+7.80e-01

Pos logit contributions
=\"+2.662
horn+2.561
aba+2.514
oute+2.387
acket+2.284
Neg logit contributions
REDACTED-4.832
 Austral-4.241
Austral-4.158
alg-4.063
SPA-4.023
Token 2010
Token ID3050
Feature activation+3.04e-01

Pos logit contributions
horn+3.463
=\"+3.453
aba+3.213
oute+3.036
 eleph+2.914
Neg logit contributions
REDACTED-8.391
Austral-7.348
Rus-7.176
alg-7.147
 Austral-7.135
Tokeniz
Token ID528
Feature activation-2.58e-01

Pos logit contributions
 eleph+1.206
=\"+1.147
aba+1.129
oute+1.128
horn+1.125
Neg logit contributions
REDACTED-1.338
 Austral-1.130
Austral-1.089
Rus-1.041
alg-1.036
Tokenr
Token ID81
Feature activation-5.31e-03

Pos logit contributions
REDACTED+2.254
 Austral+1.915
Austral+1.861
Rus+1.795
alg+1.793
Neg logit contributions
 eleph-1.922
=\"-1.789
oute-1.728
horn-1.718
aba-1.714
Token Pix
Token ID21642
Feature activation+1.96e-01

Pos logit contributions
REDACTED+0.046
 Austral+0.039
Austral+0.038
Rus+0.036
alg+0.036
Neg logit contributions
 eleph-0.040
=\"-0.037
oute-0.036
horn-0.036
aba-0.036
Token -
Token ID532
Feature activation+6.72e-01

Pos logit contributions
 eleph+1.430
=\"+1.341
aba+1.317
horn+1.316
oute+1.312
Neg logit contributions
REDACTED-1.733
 Austral-1.451
Austral-1.416
SPA-1.352
Rus-1.346
Token favorite
Token ID4004
Feature activation+1.33e+00

Pos logit contributions
horn+2.333
=\"+2.298
aba+2.234
 eleph+2.201
oute+2.175
Neg logit contributions
REDACTED-7.910
Austral-6.953
 Austral-6.815
Rus-6.704
SPA-6.701
Token favorite
Token ID4004
Feature activation+1.38e+00

Pos logit contributions
=\"+4.405
horn+4.381
oute+3.970
aba+3.683
 Dover+3.203
Neg logit contributions
REDACTED-15.073
Austral-13.602
 Austral-13.113
Rus-13.094
SPA-12.950
Token favorite
Token ID4004
Feature activation+1.39e+00

Pos logit contributions
=\"+4.649
horn+4.604
oute+4.194
aba+3.876
 Dover+3.409
Neg logit contributions
REDACTED-15.355
Austral-13.875
 Austral-13.327
Rus-13.297
SPA-13.155
Token favorite
Token ID4004
Feature activation+1.31e+00

Pos logit contributions
horn+4.607
=\"+4.602
 eleph+4.421
oute+4.264
aba+3.891
Neg logit contributions
REDACTED-15.389
Austral-13.994
 Austral-13.515
Rus-13.122
SPA-12.956
Token favorite
Token ID4004
Feature activation+1.16e+00

Pos logit contributions
 eleph+5.866
horn+5.327
=\"+5.315
oute+5.061
aba+4.685
Neg logit contributions
REDACTED-13.577
Austral-12.310
 Austral-12.019
Rus-11.472
SPA-11.304
Token -
Token ID532
Feature activation+4.91e-01

Pos logit contributions
 eleph+6.421
horn+5.944
=\"+5.925
oute+5.604
aba+5.427
Neg logit contributions
REDACTED-10.994
Austral-9.820
 Austral-9.646
Rus-9.098
SPA-9.028
Token January
Token ID3269
Feature activation+6.29e-02

Pos logit contributions
=\"+3.934
horn+3.809
aba+3.781
 eleph+3.705
oute+3.636
Neg logit contributions
REDACTED-3.688
Austral-2.946
 Austral-2.887
alg-2.823
SPA-2.749
Tokenwt
Token ID46569
Feature activation-1.22e-02

Pos logit contributions
 eleph+3.545
=\"+3.437
oute+3.424
aba+3.414
horn+3.408
Neg logit contributions
REDACTED-4.143
 Austral-3.565
Austral-3.392
Rus-3.225
SPA-3.207
Token0001
Token ID18005
Feature activation-2.20e-01

Pos logit contributions
REDACTED+0.102
 Austral+0.086
Austral+0.084
Rus+0.080
alg+0.080
Neg logit contributions
 eleph-0.095
=\"-0.089
oute-0.087
horn-0.086
aba-0.086
Token -
Token ID532
Feature activation+7.45e-01

Pos logit contributions
REDACTED+1.892
 Austral+1.603
Austral+1.559
Rus+1.497
alg+1.493
Neg logit contributions
 eleph-1.658
=\"-1.548
oute-1.498
horn-1.496
aba-1.491
Token favorite
Token ID4004
Feature activation+1.26e+00

Pos logit contributions
horn+2.618
=\"+2.605
aba+2.488
oute+2.434
 eleph+2.357
Neg logit contributions
REDACTED-8.742
Austral-7.640
 Austral-7.536
SPA-7.443
Rus-7.376
Token favorite
Token ID4004
Feature activation+1.34e+00

Pos logit contributions
=\"+4.195
horn+4.086
oute+3.807
aba+3.448
orf+3.016
Neg logit contributions
REDACTED-14.504
Austral-13.009
 Austral-12.587
Rus-12.488
SPA-12.459
Token favorite
Token ID4004
Feature activation+1.37e+00

Pos logit contributions
=\"+4.451
horn+4.341
oute+4.026
aba+3.640
 Dover+3.225
Neg logit contributions
REDACTED-15.142
Austral-13.624
 Austral-13.111
Rus-13.021
SPA-12.980
Token favorite
Token ID4004
Feature activation+1.29e+00

Pos logit contributions
=\"+4.537
horn+4.503
 eleph+4.340
oute+4.237
aba+3.782
Neg logit contributions
REDACTED-15.207
Austral-13.798
 Austral-13.370
Rus-12.932
SPA-12.864
Token -
Token ID532
Feature activation+6.15e-01

Pos logit contributions
 eleph+6.139
=\"+5.701
horn+5.676
oute+5.428
aba+5.036
Neg logit contributions
REDACTED-12.963
Austral-11.668
 Austral-11.456
Rus-10.838
SPA-10.783
Token July
Token ID2901
Feature activation+4.35e-02

Pos logit contributions
=\"+4.654
horn+4.502
aba+4.464
 eleph+4.368
oute+4.288
Neg logit contributions
REDACTED-4.815
Austral-3.898
 Austral-3.824
alg-3.738
SPA-3.659
Token 6
Token ID718
Feature activation+3.69e-01

Pos logit contributions
 eleph+0.346
=\"+0.334
horn+0.323
aba+0.322
oute+0.322
Neg logit contributions
REDACTED-0.346
 Austral-0.288
Austral-0.280
alg-0.269
Rus-0.268
Token,
Token ID11
Feature activation+6.92e-01

Pos logit contributions
=\"+2.087
horn+2.002
aba+1.971
oute+1.903
 eleph+1.833
Neg logit contributions
REDACTED-3.649
 Austral-3.182
Austral-3.113
alg-3.037
SPA-3.012
Tokenur
Token ID333
Feature activation-1.77e-02

Pos logit contributions
REDACTED+2.361
 Austral+1.994
Austral+1.935
alg+1.862
Rus+1.862
Neg logit contributions
 eleph-2.186
=\"-2.038
oute-1.970
horn-1.960
aba-1.956
Tokenker
Token ID6122
Feature activation-4.27e-01

Pos logit contributions
REDACTED+0.152
 Austral+0.128
Austral+0.124
alg+0.119
Rus+0.119
Neg logit contributions
 eleph-0.134
=\"-0.125
horn-0.122
oute-0.122
aba-0.121
Token -
Token ID532
Feature activation+6.42e-01

Pos logit contributions
REDACTED+3.563
 Austral+3.005
Austral+2.920
Rus+2.822
alg+2.808
Neg logit contributions
 eleph-3.322
=\"-3.110
oute-2.990
aba-2.967
horn-2.955
Token favorite
Token ID4004
Feature activation+1.26e+00

Pos logit contributions
horn+2.381
=\"+2.378
aba+2.283
 eleph+2.260
oute+2.238
Neg logit contributions
REDACTED-7.446
Austral-6.488
 Austral-6.412
SPA-6.283
Rus-6.276
Token favorite
Token ID4004
Feature activation+1.32e+00

Pos logit contributions
=\"+4.320
horn+4.268
oute+3.923
aba+3.654
 Dover+3.175
Neg logit contributions
REDACTED-14.320
Austral-12.857
 Austral-12.421
Rus-12.395
SPA-12.288
Token favorite
Token ID4004
Feature activation+1.37e+00

Pos logit contributions
=\"+4.506
horn+4.451
oute+4.098
aba+3.800
 Dover+3.332
Neg logit contributions
REDACTED-14.797
Austral-13.324
 Austral-12.818
Rus-12.796
SPA-12.685
Token favorite
Token ID4004
Feature activation+1.30e+00

Pos logit contributions
=\"+4.705
horn+4.677
oute+4.343
 eleph+4.050
aba+4.021
Neg logit contributions
REDACTED-15.101
Austral-13.675
 Austral-13.138
Rus-12.861
SPA-12.755
Token favorite
Token ID4004
Feature activation+1.10e+00

Pos logit contributions
 eleph+6.209
horn+5.698
=\"+5.693
oute+5.404
aba+5.067
Neg logit contributions
REDACTED-13.044
Austral-11.778
 Austral-11.521
Rus-10.952
SPA-10.839
Token -
Token ID532
Feature activation+4.53e-01

Pos logit contributions
 eleph+6.242
=\"+5.846
horn+5.800
oute+5.410
aba+5.369
Neg logit contributions
REDACTED-10.398
Austral-9.199
 Austral-9.084
SPA-8.544
Rus-8.531
Token February
Token ID3945
Feature activation-5.99e-02

Pos logit contributions
=\"+3.903
horn+3.776
aba+3.756
 eleph+3.670
oute+3.621
Neg logit contributions
REDACTED-3.142
Austral-2.457
 Austral-2.406
alg-2.347
SPA-2.299
Token 9
Token ID860
Feature activation+3.57e-01

Pos logit contributions
REDACTED+0.470
 Austral+0.392
Austral+0.379
Rus+0.363
alg+0.363
Neg logit contributions
 eleph-0.498
=\"-0.467
oute-0.453
horn-0.451
aba-0.450
TokenAN
Token ID1565
Feature activation-2.43e-01

Pos logit contributions
REDACTED+2.417
 Austral+2.047
Austral+1.987
Rus+1.915
alg+1.914
Neg logit contributions
 eleph-2.129
=\"-1.985
oute-1.919
horn-1.909
aba-1.905
Token -
Token ID532
Feature activation+6.29e-01

Pos logit contributions
REDACTED+2.077
 Austral+1.761
Austral+1.710
Rus+1.642
alg+1.641
Neg logit contributions
 eleph-1.848
=\"-1.724
oute-1.671
horn-1.668
aba-1.663
Token favorite
Token ID4004
Feature activation+1.29e+00

Pos logit contributions
=\"+2.469
horn+2.441
aba+2.351
 eleph+2.340
oute+2.326
Neg logit contributions
REDACTED-7.187
Austral-6.276
 Austral-6.209
Rus-6.081
SPA-6.044
Token favorite
Token ID4004
Feature activation+1.34e+00

Pos logit contributions
=\"+4.280
horn+4.226
oute+3.838
aba+3.540
acket+3.111
Neg logit contributions
REDACTED-14.668
Austral-13.221
Rus-12.807
 Austral-12.787
SPA-12.634
Token favorite
Token ID4004
Feature activation+1.43e+00

Pos logit contributions
=\"+4.347
horn+4.303
oute+3.898
aba+3.553
 Dover+3.118
Neg logit contributions
REDACTED-15.241
Austral-13.779
Rus-13.312
 Austral-13.285
SPA-13.118
Token favorite
Token ID4004
Feature activation+1.36e+00

Pos logit contributions
=\"+4.756
horn+4.710
oute+4.318
aba+3.852
 eleph+3.473
Neg logit contributions
REDACTED-15.853
Austral-14.425
 Austral-13.813
Rus-13.794
SPA-13.617
Token favorite
Token ID4004
Feature activation+1.19e+00

Pos logit contributions
horn+5.022
 eleph+4.993
=\"+4.934
oute+4.648
aba+4.167
Neg logit contributions
REDACTED-14.635
Austral-13.374
 Austral-12.948
Rus-12.582
SPA-12.333
Token -
Token ID532
Feature activation+5.33e-01

Pos logit contributions
 eleph+6.188
horn+5.882
=\"+5.843
oute+5.508
aba+5.212
Neg logit contributions
REDACTED-11.479
Austral-10.389
 Austral-10.152
Rus-9.705
SPA-9.516
Token September
Token ID2693
Feature activation+1.17e-01

Pos logit contributions
=\"+3.953
horn+3.829
aba+3.803
 eleph+3.794
oute+3.673
Neg logit contributions
REDACTED-4.307
Austral-3.497
 Austral-3.443
alg-3.351
Rus-3.301
Token 26
Token ID2608
Feature activation+9.79e-02

Pos logit contributions
 eleph+0.917
=\"+0.899
horn+0.871
aba+0.866
oute+0.863
Neg logit contributions
REDACTED-0.939
 Austral-0.780
Austral-0.761
alg-0.731
Rus-0.726
Token,
Token ID11
Feature activation+6.40e-01

Pos logit contributions
 eleph+0.608
=\"+0.598
horn+0.572
aba+0.568
oute+0.565
Neg logit contributions
REDACTED-0.934
 Austral-0.807
Austral-0.787
alg-0.763
Rus-0.761
TokenM
Token ID44
Feature activation-1.59e-01

Pos logit contributions
REDACTED+0.049
 Austral+0.041
Austral+0.040
alg+0.039
Rus+0.038
Neg logit contributions
 eleph-0.046
=\"-0.043
horn-0.042
oute-0.042
aba-0.042
Tokenanners
Token ID15672
Feature activation-3.92e-01

Pos logit contributions
REDACTED+1.059
 Austral+0.852
Austral+0.816
Rus+0.774
alg+0.773
Neg logit contributions
 eleph-1.509
=\"-1.437
oute-1.402
horn-1.396
aba-1.395
Token -
Token ID532
Feature activation+7.10e-01

Pos logit contributions
REDACTED+3.283
 Austral+2.768
Austral+2.684
Rus+2.590
alg+2.580
Neg logit contributions
 eleph-3.046
=\"-2.849
oute-2.752
aba-2.728
horn-2.727
Token favorite
Token ID4004
Feature activation+1.20e+00

Pos logit contributions
horn+2.657
=\"+2.634
aba+2.518
oute+2.500
 eleph+2.438
Neg logit contributions
REDACTED-8.173
Austral-7.158
 Austral-7.056
Rus-6.962
SPA-6.901
Token favorite
Token ID4004
Feature activation+1.27e+00

Pos logit contributions
=\"+4.164
horn+4.106
oute+3.732
aba+3.485
 eleph+3.150
Neg logit contributions
REDACTED-13.592
Austral-12.251
 Austral-11.876
Rus-11.855
SPA-11.643
Token favorite
Token ID4004
Feature activation+1.36e+00

Pos logit contributions
=\"+4.324
horn+4.271
oute+3.875
aba+3.582
 eleph+3.213
Neg logit contributions
REDACTED-14.335
Austral-12.970
 Austral-12.520
Rus-12.515
SPA-12.290
Token favorite
Token ID4004
Feature activation+1.34e+00

Pos logit contributions
=\"+4.749
horn+4.700
oute+4.293
aba+3.893
 eleph+3.619
Neg logit contributions
REDACTED-15.031
Austral-13.680
 Austral-13.111
Rus-13.080
SPA-12.879
Token favorite
Token ID4004
Feature activation+1.21e+00

Pos logit contributions
 eleph+5.625
horn+5.360
=\"+5.270
oute+4.948
aba+4.559
Neg logit contributions
REDACTED-13.976
Austral-12.755
 Austral-12.397
Rus-11.983
SPA-11.715
Token -
Token ID532
Feature activation+6.10e-01

Pos logit contributions
 eleph+6.633
horn+6.132
=\"+6.073
oute+5.714
aba+5.546
Neg logit contributions
REDACTED-11.348
Austral-10.211
 Austral-10.023
Rus-9.507
SPA-9.383
Token December
Token ID3426
Feature activation+1.27e-01

Pos logit contributions
=\"+4.327
horn+4.216
aba+4.178
 eleph+4.119
oute+3.994
Neg logit contributions
REDACTED-5.062
Austral-4.147
 Austral-4.051
alg-4.023
Rus-3.920
Token 5
Token ID642
Feature activation+3.41e-01

Pos logit contributions
 eleph+0.988
=\"+0.970
horn+0.941
aba+0.935
oute+0.930
Neg logit contributions
REDACTED-1.015
 Austral-0.839
Austral-0.821
alg-0.789
Rus-0.783
Token_
Token ID62
Feature activation-6.80e-02

Pos logit contributions
 eleph+2.898
horn+2.708
=\"+2.701
aba+2.673
oute+2.669
Neg logit contributions
REDACTED-3.025
 Austral-2.561
Austral-2.449
Rus-2.349
SPA-2.331
Tokenler
Token ID1754
Feature activation+1.65e-01

Pos logit contributions
REDACTED+0.575
 Austral+0.488
Austral+0.471
Rus+0.453
alg+0.451
Neg logit contributions
 eleph-0.521
=\"-0.489
horn-0.476
oute-0.475
aba-0.473
Tokenoy
Token ID726
Feature activation+8.44e-02

Pos logit contributions
 eleph+1.196
=\"+1.138
oute+1.125
horn+1.123
aba+1.111
Neg logit contributions
REDACTED-1.454
 Austral-1.222
Austral-1.187
Rus-1.138
alg-1.135
Token -
Token ID532
Feature activation+5.18e-01

Pos logit contributions
 eleph+0.619
=\"+0.575
horn+0.565
oute+0.565
aba+0.563
Neg logit contributions
REDACTED-0.741
 Austral-0.630
Austral-0.613
Rus-0.587
alg-0.582
Token favorite
Token ID4004
Feature activation+1.27e+00

Pos logit contributions
=\"+2.061
 eleph+2.035
horn+2.035
aba+1.977
oute+1.952
Neg logit contributions
REDACTED-5.933
Austral-5.158
 Austral-5.136
Rus-5.009
SPA-4.980
Token favorite
Token ID4004
Feature activation+1.35e+00

Pos logit contributions
=\"+4.442
horn+4.376
oute+3.978
aba+3.730
 Dover+3.263
Neg logit contributions
REDACTED-14.246
Austral-12.843
 Austral-12.424
Rus-12.422
SPA-12.216
Token favorite
Token ID4004
Feature activation+1.43e+00

Pos logit contributions
=\"+4.568
horn+4.501
oute+4.082
aba+3.784
 Dover+3.335
Neg logit contributions
REDACTED-15.082
Austral-13.646
Rus-13.154
 Austral-13.140
SPA-12.950
Token favorite
Token ID4004
Feature activation+1.39e+00

Pos logit contributions
=\"+4.988
horn+4.921
oute+4.475
aba+4.083
 eleph+3.698
Neg logit contributions
REDACTED-15.740
Austral-14.325
 Austral-13.688
Rus-13.670
SPA-13.488
Token -
Token ID532
Feature activation+5.35e-01

Pos logit contributions
 eleph+5.807
horn+5.396
=\"+5.392
oute+5.014
aba+4.614
Neg logit contributions
REDACTED-14.440
Austral-13.179
 Austral-12.812
Rus-12.333
SPA-12.142
Token January
Token ID3269
Feature activation-1.40e-02

Pos logit contributions
=\"+4.069
horn+3.941
aba+3.919
 eleph+3.843
oute+3.756
Neg logit contributions
REDACTED-4.216
Austral-3.404
 Austral-3.337
alg-3.288
Rus-3.200
Token 4
Token ID604
Feature activation+4.86e-01

Pos logit contributions
REDACTED+0.112
 Austral+0.093
Austral+0.090
alg+0.087
Rus+0.087
Neg logit contributions
 eleph-0.113
=\"-0.107
oute-0.103
horn-0.103
aba-0.103
Tokenanners
Token ID15672
Feature activation-3.92e-01

Pos logit contributions
REDACTED+1.059
 Austral+0.852
Austral+0.816
Rus+0.774
alg+0.773
Neg logit contributions
 eleph-1.509
=\"-1.437
oute-1.402
horn-1.396
aba-1.395
Token -
Token ID532
Feature activation+7.10e-01

Pos logit contributions
REDACTED+3.283
 Austral+2.768
Austral+2.684
Rus+2.590
alg+2.580
Neg logit contributions
 eleph-3.046
=\"-2.849
oute-2.752
aba-2.728
horn-2.727
Token favorite
Token ID4004
Feature activation+1.20e+00

Pos logit contributions
horn+2.657
=\"+2.634
aba+2.518
oute+2.500
 eleph+2.438
Neg logit contributions
REDACTED-8.173
Austral-7.158
 Austral-7.056
Rus-6.962
SPA-6.901
Token favorite
Token ID4004
Feature activation+1.27e+00

Pos logit contributions
=\"+4.164
horn+4.106
oute+3.732
aba+3.485
 eleph+3.150
Neg logit contributions
REDACTED-13.592
Austral-12.251
 Austral-11.876
Rus-11.855
SPA-11.643
Token favorite
Token ID4004
Feature activation+1.36e+00

Pos logit contributions
=\"+4.324
horn+4.271
oute+3.875
aba+3.582
 eleph+3.213
Neg logit contributions
REDACTED-14.335
Austral-12.970
 Austral-12.520
Rus-12.515
SPA-12.290
Token favorite
Token ID4004
Feature activation+1.34e+00

Pos logit contributions
=\"+4.749
horn+4.700
oute+4.293
aba+3.893
 eleph+3.619
Neg logit contributions
REDACTED-15.031
Austral-13.680
 Austral-13.111
Rus-13.080
SPA-12.879
Token favorite
Token ID4004
Feature activation+1.21e+00

Pos logit contributions
 eleph+5.625
horn+5.360
=\"+5.270
oute+4.948
aba+4.559
Neg logit contributions
REDACTED-13.976
Austral-12.755
 Austral-12.397
Rus-11.983
SPA-11.715
Token -
Token ID532
Feature activation+6.10e-01

Pos logit contributions
 eleph+6.633
horn+6.132
=\"+6.073
oute+5.714
aba+5.546
Neg logit contributions
REDACTED-11.348
Austral-10.211
 Austral-10.023
Rus-9.507
SPA-9.383
Token December
Token ID3426
Feature activation+1.27e-01

Pos logit contributions
=\"+4.327
horn+4.216
aba+4.178
 eleph+4.119
oute+3.994
Neg logit contributions
REDACTED-5.062
Austral-4.147
 Austral-4.051
alg-4.023
Rus-3.920
Token 5
Token ID642
Feature activation+3.41e-01

Pos logit contributions
 eleph+0.988
=\"+0.970
horn+0.941
aba+0.935
oute+0.930
Neg logit contributions
REDACTED-1.015
 Austral-0.839
Austral-0.821
alg-0.789
Rus-0.783
Token,
Token ID11
Feature activation+6.08e-01

Pos logit contributions
=\"+1.952
horn+1.883
aba+1.853
oute+1.796
 eleph+1.795
Neg logit contributions
REDACTED-3.347
 Austral-2.908
Austral-2.844
alg-2.777
SPA-2.756
TokenATCH
Token ID11417
Feature activation-2.61e-01

Pos logit contributions
REDACTED+0.379
 Austral+0.321
Austral+0.312
alg+0.300
Rus+0.299
Neg logit contributions
 eleph-0.339
=\"-0.318
oute-0.308
horn-0.308
aba-0.307
TokenEW
Token ID6217
Feature activation-2.81e-01

Pos logit contributions
REDACTED+2.249
 Austral+1.907
Austral+1.851
Rus+1.784
alg+1.782
Neg logit contributions
 eleph-1.966
=\"-1.833
oute-1.772
horn-1.762
aba-1.758
TokenAN
Token ID1565
Feature activation-2.43e-01

Pos logit contributions
REDACTED+2.417
 Austral+2.047
Austral+1.987
Rus+1.915
alg+1.914
Neg logit contributions
 eleph-2.129
=\"-1.985
oute-1.919
horn-1.909
aba-1.905
Token -
Token ID532
Feature activation+6.29e-01

Pos logit contributions
REDACTED+2.077
 Austral+1.761
Austral+1.710
Rus+1.642
alg+1.641
Neg logit contributions
 eleph-1.848
=\"-1.724
oute-1.671
horn-1.668
aba-1.663
Token favorite
Token ID4004
Feature activation+1.29e+00

Pos logit contributions
=\"+2.469
horn+2.441
aba+2.351
 eleph+2.340
oute+2.326
Neg logit contributions
REDACTED-7.187
Austral-6.276
 Austral-6.209
Rus-6.081
SPA-6.044
Token favorite
Token ID4004
Feature activation+1.34e+00

Pos logit contributions
=\"+4.280
horn+4.226
oute+3.838
aba+3.540
acket+3.111
Neg logit contributions
REDACTED-14.668
Austral-13.221
Rus-12.807
 Austral-12.787
SPA-12.634
Token favorite
Token ID4004
Feature activation+1.43e+00

Pos logit contributions
=\"+4.347
horn+4.303
oute+3.898
aba+3.553
 Dover+3.118
Neg logit contributions
REDACTED-15.241
Austral-13.779
Rus-13.312
 Austral-13.285
SPA-13.118
Token favorite
Token ID4004
Feature activation+1.36e+00

Pos logit contributions
=\"+4.756
horn+4.710
oute+4.318
aba+3.852
 eleph+3.473
Neg logit contributions
REDACTED-15.853
Austral-14.425
 Austral-13.813
Rus-13.794
SPA-13.617
Token favorite
Token ID4004
Feature activation+1.19e+00

Pos logit contributions
horn+5.022
 eleph+4.993
=\"+4.934
oute+4.648
aba+4.167
Neg logit contributions
REDACTED-14.635
Austral-13.374
 Austral-12.948
Rus-12.582
SPA-12.333
Token -
Token ID532
Feature activation+5.33e-01

Pos logit contributions
 eleph+6.188
horn+5.882
=\"+5.843
oute+5.508
aba+5.212
Neg logit contributions
REDACTED-11.479
Austral-10.389
 Austral-10.152
Rus-9.705
SPA-9.516
Token September
Token ID2693
Feature activation+1.17e-01

Pos logit contributions
=\"+3.953
horn+3.829
aba+3.803
 eleph+3.794
oute+3.673
Neg logit contributions
REDACTED-4.307
Austral-3.497
 Austral-3.443
alg-3.351
Rus-3.301
Token j
Token ID474
Feature activation+4.90e-01

Pos logit contributions
 eleph+4.438
=\"+4.135
horn+4.106
oute+4.051
aba+4.012
Neg logit contributions
REDACTED-4.357
Austral-3.582
 Austral-3.512
Rus-3.374
SPA-3.347
Tokenwt
Token ID46569
Feature activation-1.22e-02

Pos logit contributions
 eleph+3.545
=\"+3.437
oute+3.424
aba+3.414
horn+3.408
Neg logit contributions
REDACTED-4.143
 Austral-3.565
Austral-3.392
Rus-3.225
SPA-3.207
Token0001
Token ID18005
Feature activation-2.20e-01

Pos logit contributions
REDACTED+0.102
 Austral+0.086
Austral+0.084
Rus+0.080
alg+0.080
Neg logit contributions
 eleph-0.095
=\"-0.089
oute-0.087
horn-0.086
aba-0.086
Token -
Token ID532
Feature activation+7.45e-01

Pos logit contributions
REDACTED+1.892
 Austral+1.603
Austral+1.559
Rus+1.497
alg+1.493
Neg logit contributions
 eleph-1.658
=\"-1.548
oute-1.498
horn-1.496
aba-1.491
Token favorite
Token ID4004
Feature activation+1.26e+00

Pos logit contributions
horn+2.618
=\"+2.605
aba+2.488
oute+2.434
 eleph+2.357
Neg logit contributions
REDACTED-8.742
Austral-7.640
 Austral-7.536
SPA-7.443
Rus-7.376
Token favorite
Token ID4004
Feature activation+1.34e+00

Pos logit contributions
=\"+4.195
horn+4.086
oute+3.807
aba+3.448
orf+3.016
Neg logit contributions
REDACTED-14.504
Austral-13.009
 Austral-12.587
Rus-12.488
SPA-12.459
Token favorite
Token ID4004
Feature activation+1.37e+00

Pos logit contributions
=\"+4.451
horn+4.341
oute+4.026
aba+3.640
 Dover+3.225
Neg logit contributions
REDACTED-15.142
Austral-13.624
 Austral-13.111
Rus-13.021
SPA-12.980
Token favorite
Token ID4004
Feature activation+1.29e+00

Pos logit contributions
=\"+4.537
horn+4.503
 eleph+4.340
oute+4.237
aba+3.782
Neg logit contributions
REDACTED-15.207
Austral-13.798
 Austral-13.370
Rus-12.932
SPA-12.864
Token -
Token ID532
Feature activation+6.15e-01

Pos logit contributions
 eleph+6.139
=\"+5.701
horn+5.676
oute+5.428
aba+5.036
Neg logit contributions
REDACTED-12.963
Austral-11.668
 Austral-11.456
Rus-10.838
SPA-10.783
Token July
Token ID2901
Feature activation+4.35e-02

Pos logit contributions
=\"+4.654
horn+4.502
aba+4.464
 eleph+4.368
oute+4.288
Neg logit contributions
REDACTED-4.815
Austral-3.898
 Austral-3.824
alg-3.738
SPA-3.659
Token 6
Token ID718
Feature activation+3.69e-01

Pos logit contributions
 eleph+0.346
=\"+0.334
horn+0.323
aba+0.322
oute+0.322
Neg logit contributions
REDACTED-0.346
 Austral-0.288
Austral-0.280
alg-0.269
Rus-0.268
Token B
Token ID347
Feature activation+1.59e-01

Pos logit contributions
 eleph+4.556
=\"+4.255
horn+4.215
oute+4.171
aba+4.126
Neg logit contributions
REDACTED-4.429
Austral-3.646
 Austral-3.554
Rus-3.421
alg-3.395
Tokeniz
Token ID528
Feature activation-2.58e-01

Pos logit contributions
 eleph+1.206
=\"+1.147
aba+1.129
oute+1.128
horn+1.125
Neg logit contributions
REDACTED-1.338
 Austral-1.130
Austral-1.089
Rus-1.041
alg-1.036
Tokenr
Token ID81
Feature activation-5.31e-03

Pos logit contributions
REDACTED+2.254
 Austral+1.915
Austral+1.861
Rus+1.795
alg+1.793
Neg logit contributions
 eleph-1.922
=\"-1.789
oute-1.728
horn-1.718
aba-1.714
Token Pix
Token ID21642
Feature activation+1.96e-01

Pos logit contributions
REDACTED+0.046
 Austral+0.039
Austral+0.038
Rus+0.036
alg+0.036
Neg logit contributions
 eleph-0.040
=\"-0.037
oute-0.036
horn-0.036
aba-0.036
Token -
Token ID532
Feature activation+6.72e-01

Pos logit contributions
 eleph+1.430
=\"+1.341
aba+1.317
horn+1.316
oute+1.312
Neg logit contributions
REDACTED-1.733
 Austral-1.451
Austral-1.416
SPA-1.352
Rus-1.346
Token favorite
Token ID4004
Feature activation+1.33e+00

Pos logit contributions
horn+2.333
=\"+2.298
aba+2.234
 eleph+2.201
oute+2.175
Neg logit contributions
REDACTED-7.910
Austral-6.953
 Austral-6.815
Rus-6.704
SPA-6.701
Token favorite
Token ID4004
Feature activation+1.38e+00

Pos logit contributions
=\"+4.405
horn+4.381
oute+3.970
aba+3.683
 Dover+3.203
Neg logit contributions
REDACTED-15.073
Austral-13.602
 Austral-13.113
Rus-13.094
SPA-12.950
Token favorite
Token ID4004
Feature activation+1.39e+00

Pos logit contributions
=\"+4.649
horn+4.604
oute+4.194
aba+3.876
 Dover+3.409
Neg logit contributions
REDACTED-15.355
Austral-13.875
 Austral-13.327
Rus-13.297
SPA-13.155
Token favorite
Token ID4004
Feature activation+1.31e+00

Pos logit contributions
horn+4.607
=\"+4.602
 eleph+4.421
oute+4.264
aba+3.891
Neg logit contributions
REDACTED-15.389
Austral-13.994
 Austral-13.515
Rus-13.122
SPA-12.956
Token favorite
Token ID4004
Feature activation+1.16e+00

Pos logit contributions
 eleph+5.866
horn+5.327
=\"+5.315
oute+5.061
aba+4.685
Neg logit contributions
REDACTED-13.577
Austral-12.310
 Austral-12.019
Rus-11.472
SPA-11.304
Token -
Token ID532
Feature activation+4.91e-01

Pos logit contributions
 eleph+6.421
horn+5.944
=\"+5.925
oute+5.604
aba+5.427
Neg logit contributions
REDACTED-10.994
Austral-9.820
 Austral-9.646
Rus-9.098
SPA-9.028
TokenL
Token ID43
Feature activation-2.81e-01

Pos logit contributions
REDACTED+0.560
 Austral+0.473
Austral+0.458
alg+0.441
Rus+0.439
Neg logit contributions
 eleph-0.501
=\"-0.470
horn-0.457
oute-0.455
aba-0.453
Tokenur
Token ID333
Feature activation-1.77e-02

Pos logit contributions
REDACTED+2.361
 Austral+1.994
Austral+1.935
alg+1.862
Rus+1.862
Neg logit contributions
 eleph-2.186
=\"-2.038
oute-1.970
horn-1.960
aba-1.956
Tokenker
Token ID6122
Feature activation-4.27e-01

Pos logit contributions
REDACTED+0.152
 Austral+0.128
Austral+0.124
alg+0.119
Rus+0.119
Neg logit contributions
 eleph-0.134
=\"-0.125
horn-0.122
oute-0.122
aba-0.121
Token -
Token ID532
Feature activation+6.42e-01

Pos logit contributions
REDACTED+3.563
 Austral+3.005
Austral+2.920
Rus+2.822
alg+2.808
Neg logit contributions
 eleph-3.322
=\"-3.110
oute-2.990
aba-2.967
horn-2.955
Token favorite
Token ID4004
Feature activation+1.26e+00

Pos logit contributions
horn+2.381
=\"+2.378
aba+2.283
 eleph+2.260
oute+2.238
Neg logit contributions
REDACTED-7.446
Austral-6.488
 Austral-6.412
SPA-6.283
Rus-6.276
Token favorite
Token ID4004
Feature activation+1.32e+00

Pos logit contributions
=\"+4.320
horn+4.268
oute+3.923
aba+3.654
 Dover+3.175
Neg logit contributions
REDACTED-14.320
Austral-12.857
 Austral-12.421
Rus-12.395
SPA-12.288
Token favorite
Token ID4004
Feature activation+1.37e+00

Pos logit contributions
=\"+4.506
horn+4.451
oute+4.098
aba+3.800
 Dover+3.332
Neg logit contributions
REDACTED-14.797
Austral-13.324
 Austral-12.818
Rus-12.796
SPA-12.685
Token favorite
Token ID4004
Feature activation+1.30e+00

Pos logit contributions
=\"+4.705
horn+4.677
oute+4.343
 eleph+4.050
aba+4.021
Neg logit contributions
REDACTED-15.101
Austral-13.675
 Austral-13.138
Rus-12.861
SPA-12.755
Token favorite
Token ID4004
Feature activation+1.10e+00

Pos logit contributions
 eleph+6.209
horn+5.698
=\"+5.693
oute+5.404
aba+5.067
Neg logit contributions
REDACTED-13.044
Austral-11.778
 Austral-11.521
Rus-10.952
SPA-10.839
Token -
Token ID532
Feature activation+4.53e-01

Pos logit contributions
 eleph+6.242
=\"+5.846
horn+5.800
oute+5.410
aba+5.369
Neg logit contributions
REDACTED-10.398
Austral-9.199
 Austral-9.084
SPA-8.544
Rus-8.531
Token February
Token ID3945
Feature activation-5.99e-02

Pos logit contributions
=\"+3.903
horn+3.776
aba+3.756
 eleph+3.670
oute+3.621
Neg logit contributions
REDACTED-3.142
Austral-2.457
 Austral-2.406
alg-2.347
SPA-2.299
Token Pix
Token ID21642
Feature activation+1.96e-01

Pos logit contributions
REDACTED+0.046
 Austral+0.039
Austral+0.038
Rus+0.036
alg+0.036
Neg logit contributions
 eleph-0.040
=\"-0.037
oute-0.036
horn-0.036
aba-0.036
Token -
Token ID532
Feature activation+6.72e-01

Pos logit contributions
 eleph+1.430
=\"+1.341
aba+1.317
horn+1.316
oute+1.312
Neg logit contributions
REDACTED-1.733
 Austral-1.451
Austral-1.416
SPA-1.352
Rus-1.346
Token favorite
Token ID4004
Feature activation+1.33e+00

Pos logit contributions
horn+2.333
=\"+2.298
aba+2.234
 eleph+2.201
oute+2.175
Neg logit contributions
REDACTED-7.910
Austral-6.953
 Austral-6.815
Rus-6.704
SPA-6.701
Token favorite
Token ID4004
Feature activation+1.38e+00

Pos logit contributions
=\"+4.405
horn+4.381
oute+3.970
aba+3.683
 Dover+3.203
Neg logit contributions
REDACTED-15.073
Austral-13.602
 Austral-13.113
Rus-13.094
SPA-12.950
Token favorite
Token ID4004
Feature activation+1.39e+00

Pos logit contributions
=\"+4.649
horn+4.604
oute+4.194
aba+3.876
 Dover+3.409
Neg logit contributions
REDACTED-15.355
Austral-13.875
 Austral-13.327
Rus-13.297
SPA-13.155
Token favorite
Token ID4004
Feature activation+1.31e+00

Pos logit contributions
horn+4.607
=\"+4.602
 eleph+4.421
oute+4.264
aba+3.891
Neg logit contributions
REDACTED-15.389
Austral-13.994
 Austral-13.515
Rus-13.122
SPA-12.956
Token favorite
Token ID4004
Feature activation+1.16e+00

Pos logit contributions
 eleph+5.866
horn+5.327
=\"+5.315
oute+5.061
aba+4.685
Neg logit contributions
REDACTED-13.577
Austral-12.310
 Austral-12.019
Rus-11.472
SPA-11.304
Token -
Token ID532
Feature activation+4.91e-01

Pos logit contributions
 eleph+6.421
horn+5.944
=\"+5.925
oute+5.604
aba+5.427
Neg logit contributions
REDACTED-10.994
Austral-9.820
 Austral-9.646
Rus-9.098
SPA-9.028
Token January
Token ID3269
Feature activation+6.29e-02

Pos logit contributions
=\"+3.934
horn+3.809
aba+3.781
 eleph+3.705
oute+3.636
Neg logit contributions
REDACTED-3.688
Austral-2.946
 Austral-2.887
alg-2.823
SPA-2.749
Token 15
Token ID1315
Feature activation+2.12e-01

Pos logit contributions
 eleph+0.500
=\"+0.484
horn+0.469
aba+0.467
oute+0.466
Neg logit contributions
REDACTED-0.500
 Austral-0.415
Austral-0.405
alg-0.388
Rus-0.386
Token,
Token ID11
Feature activation+7.70e-01

Pos logit contributions
=\"+1.260
 eleph+1.218
horn+1.210
aba+1.196
oute+1.176
Neg logit contributions
REDACTED-2.052
 Austral-1.783
Austral-1.740
alg-1.689
Rus-1.681
Tokenker
Token ID6122
Feature activation-4.27e-01

Pos logit contributions
REDACTED+0.152
 Austral+0.128
Austral+0.124
alg+0.119
Rus+0.119
Neg logit contributions
 eleph-0.134
=\"-0.125
horn-0.122
oute-0.122
aba-0.121
Token -
Token ID532
Feature activation+6.42e-01

Pos logit contributions
REDACTED+3.563
 Austral+3.005
Austral+2.920
Rus+2.822
alg+2.808
Neg logit contributions
 eleph-3.322
=\"-3.110
oute-2.990
aba-2.967
horn-2.955
Token favorite
Token ID4004
Feature activation+1.26e+00

Pos logit contributions
horn+2.381
=\"+2.378
aba+2.283
 eleph+2.260
oute+2.238
Neg logit contributions
REDACTED-7.446
Austral-6.488
 Austral-6.412
SPA-6.283
Rus-6.276
Token favorite
Token ID4004
Feature activation+1.32e+00

Pos logit contributions
=\"+4.320
horn+4.268
oute+3.923
aba+3.654
 Dover+3.175
Neg logit contributions
REDACTED-14.320
Austral-12.857
 Austral-12.421
Rus-12.395
SPA-12.288
Token favorite
Token ID4004
Feature activation+1.37e+00

Pos logit contributions
=\"+4.506
horn+4.451
oute+4.098
aba+3.800
 Dover+3.332
Neg logit contributions
REDACTED-14.797
Austral-13.324
 Austral-12.818
Rus-12.796
SPA-12.685
Token favorite
Token ID4004
Feature activation+1.30e+00

Pos logit contributions
=\"+4.705
horn+4.677
oute+4.343
 eleph+4.050
aba+4.021
Neg logit contributions
REDACTED-15.101
Austral-13.675
 Austral-13.138
Rus-12.861
SPA-12.755
Token favorite
Token ID4004
Feature activation+1.10e+00

Pos logit contributions
 eleph+6.209
horn+5.698
=\"+5.693
oute+5.404
aba+5.067
Neg logit contributions
REDACTED-13.044
Austral-11.778
 Austral-11.521
Rus-10.952
SPA-10.839
Token -
Token ID532
Feature activation+4.53e-01

Pos logit contributions
 eleph+6.242
=\"+5.846
horn+5.800
oute+5.410
aba+5.369
Neg logit contributions
REDACTED-10.398
Austral-9.199
 Austral-9.084
SPA-8.544
Rus-8.531
Token February
Token ID3945
Feature activation-5.99e-02

Pos logit contributions
=\"+3.903
horn+3.776
aba+3.756
 eleph+3.670
oute+3.621
Neg logit contributions
REDACTED-3.142
Austral-2.457
 Austral-2.406
alg-2.347
SPA-2.299
Token 9
Token ID860
Feature activation+3.57e-01

Pos logit contributions
REDACTED+0.470
 Austral+0.392
Austral+0.379
Rus+0.363
alg+0.363
Neg logit contributions
 eleph-0.498
=\"-0.467
oute-0.453
horn-0.451
aba-0.450
Token,
Token ID11
Feature activation+6.59e-01

Pos logit contributions
=\"+2.038
horn+1.955
aba+1.921
oute+1.861
 eleph+1.821
Neg logit contributions
REDACTED-3.509
 Austral-3.060
Austral-2.993
alg-2.908
SPA-2.886
Token Women
Token ID6926
Feature activation-2.97e-01

Pos logit contributions
REDACTED+1.696
 Austral+1.410
Austral+1.371
Rus+1.315
alg+1.312
Neg logit contributions
 eleph-1.714
=\"-1.615
oute-1.562
horn-1.555
aba-1.553
Token in
Token ID287
Feature activation+2.12e-01

Pos logit contributions
REDACTED+2.618
 Austral+2.249
Austral+2.167
Rus+2.100
alg+2.100
Neg logit contributions
 eleph-2.173
=\"-1.996
oute-1.946
horn-1.921
aba-1.913
Token politics
Token ID4819
Feature activation+1.35e-02

Pos logit contributions
 eleph+1.504
=\"+1.430
aba+1.382
oute+1.377
horn+1.373
Neg logit contributions
REDACTED-1.907
 Austral-1.581
Austral-1.573
Rus-1.523
alg-1.492
Token.
Token ID13
Feature activation+1.40e-02

Pos logit contributions
 eleph+0.095
=\"+0.089
horn+0.086
aba+0.086
oute+0.086
Neg logit contributions
REDACTED-0.123
 Austral-0.104
Austral-0.102
alg-0.099
Rus-0.098
Token Hide
Token ID10415
Feature activation-3.02e-01

Pos logit contributions
 eleph+0.110
=\"+0.106
horn+0.102
aba+0.102
oute+0.102
Neg logit contributions
REDACTED-0.114
 Austral-0.095
Austral-0.093
alg-0.089
Rus-0.089
Token Caption
Token ID11260
Feature activation-1.26e+00

Pos logit contributions
REDACTED+2.080
 Austral+1.692
Austral+1.619
Rus+1.544
alg+1.533
Neg logit contributions
 eleph-2.844
=\"-2.641
oute-2.578
horn-2.558
aba-2.551
Token 7
Token ID767
Feature activation+5.11e-01

Pos logit contributions
REDACTED+6.216
 Austral+5.111
Austral+4.360
Rus+4.335
 Mercury+4.063
Neg logit contributions
 eleph-14.991
oute-12.555
enhagen-12.114
=\"-11.917
yip-11.762
Token of
Token ID286
Feature activation+2.08e-01

Pos logit contributions
=\"+3.033
horn+2.921
aba+2.822
oute+2.732
acket+2.579
Neg logit contributions
REDACTED-4.871
 Austral-4.185
Austral-4.103
alg-3.987
Rus-3.974
Token 15
Token ID1315
Feature activation-4.42e-02

Pos logit contributions
=\"+1.477
 eleph+1.447
horn+1.427
aba+1.393
oute+1.392
Neg logit contributions
REDACTED-1.791
 Austral-1.498
Austral-1.476
alg-1.426
Rus-1.418
Token Photos
Token ID9434
Feature activation-2.52e-01

Pos logit contributions
REDACTED+0.338
 Austral+0.277
Austral+0.269
alg+0.259
Rus+0.257
Neg logit contributions
 eleph-0.361
=\"-0.357
horn-0.344
aba-0.343
oute-0.342
Token:
Token ID25
Feature activation-1.17e-01

Pos logit contributions
REDACTED+2.117
 Austral+1.786
Austral+1.731
Rus+1.673
alg+1.666
Neg logit contributions
 eleph-2.069
=\"-1.810
oute-1.788
aba-1.749
horn-1.742
Token,
Token ID11
Feature activation+1.14e-02

Pos logit contributions
REDACTED+4.456
 Austral+3.772
Austral+3.605
Rus+3.491
alg+3.459
Neg logit contributions
 eleph-4.561
oute-3.731
=\"-3.669
aba-3.570
horn-3.565
Token check
Token ID2198
Feature activation-2.73e-01

Pos logit contributions
 eleph+0.079
=\"+0.075
horn+0.072
oute+0.072
aba+0.072
Neg logit contributions
REDACTED-0.105
 Austral-0.089
Austral-0.087
Rus-0.084
alg-0.084
Token out
Token ID503
Feature activation-2.34e-01

Pos logit contributions
REDACTED+2.292
 Austral+1.950
Austral+1.877
alg+1.813
Rus+1.810
Neg logit contributions
 eleph-2.207
=\"-1.948
oute-1.895
aba-1.884
horn-1.871
Token our
Token ID674
Feature activation-2.29e-02

Pos logit contributions
REDACTED+1.817
 Austral+1.522
Austral+1.465
alg+1.408
Rus+1.404
Neg logit contributions
 eleph-1.988
=\"-1.814
oute-1.772
aba-1.765
horn-1.756
Token latest
Token ID3452
Feature activation-4.63e-02

Pos logit contributions
REDACTED+0.214
 Austral+0.183
Austral+0.178
Rus+0.172
alg+0.172
Neg logit contributions
 eleph-0.155
=\"-0.145
horn-0.139
oute-0.139
aba-0.139
Token issue
Token ID2071
Feature activation-1.23e+00

Pos logit contributions
REDACTED+0.403
 Austral+0.342
Austral+0.332
alg+0.320
Rus+0.320
Neg logit contributions
 eleph-0.348
=\"-0.321
oute-0.311
aba-0.309
horn-0.309
Token
Token ID198
Feature activation-6.30e-01

Pos logit contributions
REDACTED+6.636
 Austral+5.326
Austral+5.130
Rus+4.688
SPA+4.505
Neg logit contributions
 eleph-14.334
oute-11.402
�-11.261
enhagen-11.247
aba-11.058
Token
Token ID198
Feature activation-3.27e-01

Pos logit contributions
REDACTED+4.166
 Austral+3.302
Austral+3.205
Rus+3.034
alg+2.992
Neg logit contributions
 eleph-6.671
oute-5.539
=\"-5.504
aba-5.397
enhagen-5.352
TokenTravel
Token ID33074
Feature activation-8.95e-02

Pos logit contributions
REDACTED+2.669
 Austral+2.240
Austral+2.178
Rus+2.089
alg+2.087
Neg logit contributions
 eleph-2.633
=\"-2.439
oute-2.370
horn-2.356
aba-2.352
Token With
Token ID2080
Feature activation+1.15e-01

Pos logit contributions
REDACTED+0.789
 Austral+0.657
Austral+0.642
alg+0.622
Rus+0.622
Neg logit contributions
 eleph-0.660
=\"-0.617
oute-0.599
aba-0.596
horn-0.596
Token The
Token ID383
Feature activation+1.80e-01

Pos logit contributions
 eleph+1.076
=\"+1.023
oute+0.997
horn+0.994
aba+0.993
Neg logit contributions
REDACTED-0.775
 Austral-0.619
Austral-0.599
Rus-0.568
alg-0.567
Token Target
Token ID12744
Feature activation-4.81e-01

Pos logit contributions
REDACTED+1.565
 Austral+1.307
Austral+1.268
Rus+1.215
alg+1.213
Neg logit contributions
 eleph-1.595
=\"-1.499
oute-1.451
horn-1.446
aba-1.444
Token =
Token ID796
Feature activation+1.35e-01

Pos logit contributions
REDACTED+4.678
 Austral+3.990
Austral+3.894
Rus+3.825
alg+3.783
Neg logit contributions
 eleph-3.392
oute-2.726
=\"-2.692
aba-2.650
horn-2.647
Token &
Token ID1222
Feature activation-3.29e-01

Pos logit contributions
 eleph+0.962
=\"+0.909
oute+0.867
aba+0.858
horn+0.858
Neg logit contributions
REDACTED-1.199
 Austral-1.030
Austral-1.025
Rus-0.970
alg-0.960
Token '
Token ID705
Feature activation+8.13e-02

Pos logit contributions
REDACTED+2.529
 Austral+2.096
Austral+2.025
Rus+1.940
alg+1.939
Neg logit contributions
 eleph-2.790
=\"-2.623
oute-2.546
horn-2.535
aba-2.530
Tokena
Token ID64
Feature activation-4.42e-01

Pos logit contributions
 eleph+0.520
=\"+0.503
aba+0.486
horn+0.485
oute+0.481
Neg logit contributions
REDACTED-0.768
 Austral-0.667
Austral-0.646
Rus-0.624
alg-0.618
Token T
Token ID309
Feature activation-1.16e+00

Pos logit contributions
REDACTED+3.281
 Austral+2.709
Austral+2.601
alg+2.501
Rus+2.492
Neg logit contributions
 eleph-3.849
=\"-3.620
oute-3.517
horn-3.503
aba-3.494
Token ;
Token ID2162
Feature activation-9.99e-01

Pos logit contributions
REDACTED+5.864
 Austral+4.379
alg+4.021
Austral+3.951
SPA+3.903
Neg logit contributions
 eleph-12.109
=\"-11.461
horn-11.238
oute-11.172
aba-10.991
Token...
Token ID2644
Feature activation-6.01e-01

Pos logit contributions
REDACTED+7.008
 Austral+5.701
alg+5.329
Rus+5.243
Austral+5.240
Neg logit contributions
 eleph-8.623
=\"-8.148
oute-7.899
horn-7.862
aba-7.720
Token }
Token ID1782
Feature activation-2.85e-01

Pos logit contributions
REDACTED+4.055
 Austral+3.241
Austral+3.044
alg+2.996
Rus+2.964
Neg logit contributions
 eleph-5.785
=\"-5.298
oute-5.173
horn-5.106
aba-5.085
Token
Token ID198
Feature activation+5.72e-02

Pos logit contributions
REDACTED+2.352
 Austral+1.977
Austral+1.923
Rus+1.841
alg+1.840
Neg logit contributions
 eleph-2.253
=\"-2.109
oute-2.045
horn-2.034
aba-2.034
Token
Token ID198
Feature activation-3.90e-02

Pos logit contributions
=\"+0.449
 eleph+0.436
horn+0.435
aba+0.432
oute+0.429
Neg logit contributions
REDACTED-0.447
 Austral-0.373
Austral-0.360
alg-0.348
Rus-0.345
Token Pasadena
Token ID41010
Feature activation-9.51e-02

Pos logit contributions
REDACTED+2.115
 Austral+1.801
Austral+1.757
alg+1.692
Rus+1.691
Neg logit contributions
 eleph-1.668
=\"-1.558
oute-1.503
horn-1.498
aba-1.494
Token,
Token ID11
Feature activation-4.52e-02

Pos logit contributions
REDACTED+0.882
 Austral+0.754
Austral+0.738
alg+0.709
Rus+0.709
Neg logit contributions
 eleph-0.657
=\"-0.610
oute-0.587
horn-0.586
aba-0.585
Token California
Token ID3442
Feature activation-6.02e-01

Pos logit contributions
REDACTED+0.408
 Austral+0.342
Austral+0.336
Rus+0.324
alg+0.324
Neg logit contributions
 eleph-0.327
=\"-0.303
horn-0.293
aba-0.292
oute-0.292
Token.
Token ID13
Feature activation-3.08e-01

Pos logit contributions
REDACTED+4.601
 Austral+3.869
Austral+3.682
alg+3.564
Rus+3.555
Neg logit contributions
 eleph-4.973
=\"-4.671
oute-4.540
horn-4.488
aba-4.482
Token Hide
Token ID10415
Feature activation-5.88e-01

Pos logit contributions
REDACTED+2.355
 Austral+1.974
Austral+1.895
Rus+1.819
alg+1.807
Neg logit contributions
 eleph-2.630
=\"-2.433
oute-2.367
horn-2.347
aba-2.347
Token Caption
Token ID11260
Feature activation-1.15e+00

Pos logit contributions
REDACTED+3.750
 Austral+3.047
Austral+2.848
Rus+2.728
alg+2.664
Neg logit contributions
 eleph-6.087
=\"-5.346
oute-5.275
horn-5.166
aba-5.140
Token 12
Token ID1105
Feature activation+4.41e-01

Pos logit contributions
REDACTED+6.679
 Austral+5.500
Austral+4.867
alg+4.734
Rus+4.715
Neg logit contributions
 eleph-12.841
oute-10.446
enhagen-10.438
=\"-10.230
aba-9.875
Token of
Token ID286
Feature activation+1.08e-01

Pos logit contributions
=\"+2.698
horn+2.608
aba+2.530
oute+2.458
orf+2.311
Neg logit contributions
REDACTED-4.155
 Austral-3.567
Austral-3.498
alg-3.397
SPA-3.395
Token 15
Token ID1315
Feature activation-4.57e-01

Pos logit contributions
 eleph+0.785
=\"+0.779
horn+0.752
oute+0.742
aba+0.740
Neg logit contributions
REDACTED-0.911
 Austral-0.763
Austral-0.748
alg-0.720
Rus-0.720
Token Photos
Token ID9434
Feature activation-3.10e-01

Pos logit contributions
REDACTED+3.168
 Austral+2.609
Austral+2.492
Rus+2.379
alg+2.351
Neg logit contributions
 eleph-4.394
=\"-3.914
oute-3.867
horn-3.810
aba-3.803
Token:
Token ID25
Feature activation-2.46e-01

Pos logit contributions
REDACTED+2.567
 Austral+2.162
Austral+2.094
Rus+2.025
alg+2.014
Neg logit contributions
 eleph-2.614
=\"-2.244
oute-2.227
aba-2.171
horn-2.159
Token7
Token ID22
Feature activation-1.98e-01

Pos logit contributions
REDACTED+2.706
 Austral+2.242
Austral+2.177
alg+2.100
Rus+2.096
Neg logit contributions
 eleph-2.959
=\"-2.696
horn-2.604
oute-2.603
aba-2.575
Token {
Token ID1391
Feature activation-3.29e-01

Pos logit contributions
REDACTED+1.715
 Austral+1.456
Austral+1.413
Rus+1.362
alg+1.360
Neg logit contributions
 eleph-1.483
=\"-1.377
oute-1.332
horn-1.325
aba-1.323
Token(
Token ID7
Feature activation-2.66e-02

Pos logit contributions
REDACTED+2.917
 Austral+2.446
Austral+2.412
Rus+2.331
alg+2.305
Neg logit contributions
 eleph-2.617
=\"-2.218
oute-2.190
aba-2.143
horn-2.132
Token17
Token ID1558
Feature activation-3.14e-01

Pos logit contributions
REDACTED+0.207
 Austral+0.173
Austral+0.167
alg+0.160
Rus+0.159
Neg logit contributions
 eleph-0.220
=\"-0.209
oute-0.203
horn-0.202
aba-0.202
Tokens
Token ID82
Feature activation-1.84e-01

Pos logit contributions
REDACTED+2.436
 Austral+2.023
Austral+1.957
Rus+1.868
alg+1.868
Neg logit contributions
 eleph-2.636
=\"-2.486
oute-2.404
horn-2.403
aba-2.398
Token)}
Token ID38165
Feature activation-1.13e+00

Pos logit contributions
REDACTED+1.981
 Austral+1.736
Austral+1.700
Rus+1.658
alg+1.657
Neg logit contributions
 eleph-1.037
=\"-0.894
oute-0.857
horn-0.849
aba-0.846
Token 27
Token ID2681
Feature activation-1.20e-01

Pos logit contributions
REDACTED+6.351
 Austral+4.995
Rus+4.650
Austral+4.633
SPA+4.489
Neg logit contributions
 eleph-11.424
=\"-10.759
oute-10.600
aba-10.303
horn-10.258
Token.
Token ID13
Feature activation-3.87e-02

Pos logit contributions
REDACTED+0.980
 Austral+0.823
Austral+0.797
Rus+0.766
alg+0.765
Neg logit contributions
 eleph-0.953
=\"-0.891
oute-0.864
horn-0.860
aba-0.858
Token R
Token ID371
Feature activation+1.43e-01

Pos logit contributions
REDACTED+0.287
 Austral+0.236
Austral+0.229
alg+0.218
Rus+0.217
Neg logit contributions
 eleph-0.337
=\"-0.319
oute-0.310
horn-0.309
aba-0.309
Tokenxd
Token ID24954
Feature activation+2.63e-01

Pos logit contributions
 eleph+0.887
=\"+0.853
aba+0.828
oute+0.824
horn+0.819
Neg logit contributions
REDACTED-1.385
 Austral-1.205
Austral-1.166
Rus-1.118
SPA-1.115
Token8
Token ID23
Feature activation+1.87e-01

Pos logit contributions
 eleph+1.970
=\"+1.945
aba+1.876
horn+1.871
oute+1.865
Neg logit contributions
REDACTED-2.182
 Austral-1.847
Austral-1.799
Rus-1.716
alg-1.698
Token 20
Token ID1160
Feature activation+2.06e-02

Pos logit contributions
REDACTED+0.372
 Austral+0.309
Austral+0.298
Rus+0.286
alg+0.285
Neg logit contributions
 eleph-0.412
=\"-0.386
oute-0.375
horn-0.373
aba-0.373
Token,
Token ID11
Feature activation+2.79e-01

Pos logit contributions
 eleph+0.145
=\"+0.133
oute+0.128
horn+0.127
aba+0.127
Neg logit contributions
REDACTED-0.190
 Austral-0.163
Austral-0.158
alg-0.153
Rus-0.152
Token 2015
Token ID1853
Feature activation-2.07e-01

Pos logit contributions
 eleph+1.379
=\"+1.280
horn+1.221
aba+1.196
oute+1.195
Neg logit contributions
REDACTED-3.086
 Austral-2.687
Austral-2.663
SPA-2.578
Rus-2.572
Token.
Token ID13
Feature activation+5.55e-02

Pos logit contributions
REDACTED+1.698
 Austral+1.429
Austral+1.382
Rus+1.329
alg+1.328
Neg logit contributions
 eleph-1.647
=\"-1.542
oute-1.493
horn-1.484
aba-1.481
Token Hide
Token ID10415
Feature activation-2.75e-01

Pos logit contributions
 eleph+0.433
=\"+0.418
horn+0.403
aba+0.402
oute+0.402
Neg logit contributions
REDACTED-0.454
 Austral-0.376
Austral-0.368
alg-0.355
Rus-0.352
Token Caption
Token ID11260
Feature activation-1.12e+00

Pos logit contributions
REDACTED+1.895
 Austral+1.542
Austral+1.475
Rus+1.407
alg+1.398
Neg logit contributions
 eleph-2.595
=\"-2.407
oute-2.350
horn-2.331
aba-2.326
Token 8
Token ID807
Feature activation+4.81e-01

Pos logit contributions
REDACTED+6.670
 Austral+5.499
Austral+5.044
Rus+4.919
Helper+4.738
Neg logit contributions
 eleph-12.342
oute-9.754
enhagen-9.666
=\"-9.404
�-9.392
Token of
Token ID286
Feature activation+1.14e-01

Pos logit contributions
=\"+2.902
horn+2.798
aba+2.709
oute+2.639
orf+2.484
Neg logit contributions
REDACTED-4.551
 Austral-3.911
Austral-3.838
alg-3.729
SPA-3.712
Token 15
Token ID1315
Feature activation-2.26e-01

Pos logit contributions
 eleph+0.834
=\"+0.827
horn+0.800
oute+0.787
aba+0.787
Neg logit contributions
REDACTED-0.969
 Austral-0.811
Austral-0.795
alg-0.766
Rus-0.765
Token Photos
Token ID9434
Feature activation-1.87e-01

Pos logit contributions
REDACTED+1.653
 Austral+1.358
Austral+1.309
Rus+1.250
alg+1.248
Neg logit contributions
 eleph-2.002
=\"-1.879
oute-1.829
horn-1.820
aba-1.817
Token:
Token ID25
Feature activation-1.25e-01

Pos logit contributions
REDACTED+1.596
 Austral+1.350
Austral+1.310
Rus+1.264
alg+1.260
Neg logit contributions
 eleph-1.484
=\"-1.329
oute-1.302
aba-1.281
horn-1.279
Token Le
Token ID1004
Feature activation-2.01e-01

Pos logit contributions
REDACTED+3.317
 Austral+2.808
Austral+2.720
alg+2.618
Rus+2.617
Neg logit contributions
 eleph-3.011
=\"-2.810
oute-2.719
horn-2.705
aba-2.702
Tokena
Token ID64
Feature activation+1.27e-01

Pos logit contributions
REDACTED+1.806
 Austral+1.540
Austral+1.503
alg+1.456
Rus+1.451
Neg logit contributions
 eleph-1.507
=\"-1.341
oute-1.293
horn-1.273
aba-1.272
Token Suzuki
Token ID35807
Feature activation-1.11e-01

Pos logit contributions
 eleph+0.931
=\"+0.918
horn+0.894
aba+0.883
oute+0.877
Neg logit contributions
REDACTED-1.071
 Austral-0.890
Austral-0.878
alg-0.850
Rus-0.835
Token,
Token ID11
Feature activation-3.27e-01

Pos logit contributions
REDACTED+1.049
 Austral+0.901
Austral+0.879
alg+0.851
Rus+0.850
Neg logit contributions
 eleph-0.735
=\"-0.693
horn-0.663
oute-0.663
aba-0.661
Token The
Token ID383
Feature activation-6.10e-03

Pos logit contributions
REDACTED+2.120
 Austral+1.711
Austral+1.625
Rus+1.542
alg+1.533
Neg logit contributions
 eleph-3.216
=\"-2.984
oute-2.920
horn-2.894
aba-2.892
Token Chronicle
Token ID14160
Feature activation-1.08e+00

Pos logit contributions
REDACTED+0.053
 Austral+0.045
Austral+0.044
alg+0.042
Rus+0.042
Neg logit contributions
 eleph-0.043
=\"-0.042
horn-0.041
oute-0.041
aba-0.040
Token 7
Token ID767
Feature activation+4.80e-01

Pos logit contributions
REDACTED+6.361
 Austral+5.407
Austral+4.867
Rus+4.644
alg+4.578
Neg logit contributions
 eleph-11.162
=\"-9.842
oute-9.813
aba-9.432
horn-9.390
Token of
Token ID286
Feature activation+1.06e-02

Pos logit contributions
=\"+2.897
horn+2.790
aba+2.697
oute+2.638
 eleph+2.507
Neg logit contributions
REDACTED-4.570
 Austral-3.906
Austral-3.844
alg-3.721
Rus-3.716
Token 15
Token ID1315
Feature activation-5.63e-02

Pos logit contributions
 eleph+0.080
=\"+0.078
horn+0.075
oute+0.074
aba+0.074
Neg logit contributions
REDACTED-0.089
 Austral-0.074
Austral-0.073
alg-0.070
Rus-0.070
Token Photo
Token ID5555
Feature activation-1.62e-01

Pos logit contributions
REDACTED+0.406
 Austral+0.327
Austral+0.319
alg+0.306
Rus+0.305
Neg logit contributions
 eleph-0.491
=\"-0.479
horn-0.464
oute-0.462
aba-0.461
Token:
Token ID25
Feature activation-3.68e-01

Pos logit contributions
REDACTED+1.399
 Austral+1.185
Austral+1.150
Rus+1.108
alg+1.107
Neg logit contributions
 eleph-1.248
=\"-1.141
oute-1.110
aba-1.098
horn-1.098
Tokens
Token ID82
Feature activation+4.23e-02

Pos logit contributions
REDACTED+2.713
 Austral+2.243
Austral+2.166
alg+2.077
Rus+2.076
Neg logit contributions
 eleph-3.067
=\"-2.867
oute-2.780
horn-2.771
aba-2.766
Token invoke
Token ID26342
Feature activation-2.78e-01

Pos logit contributions
 eleph+0.318
=\"+0.300
oute+0.294
aba+0.290
horn+0.285
Neg logit contributions
REDACTED-0.364
 Austral-0.307
Austral-0.301
Rus-0.286
alg-0.285
Token local
Token ID1957
Feature activation-1.31e-01

Pos logit contributions
REDACTED+2.372
 Austral+2.004
Austral+1.934
alg+1.872
Rus+1.864
Neg logit contributions
 eleph-2.116
=\"-1.971
horn-1.906
oute-1.898
aba-1.897
Token -
Token ID532
Feature activation-6.97e-02

Pos logit contributions
REDACTED+1.044
 Austral+0.872
Austral+0.845
Rus+0.810
alg+0.809
Neg logit contributions
 eleph-1.073
=\"-1.008
oute-0.977
horn-0.972
aba-0.971
Tokenf
Token ID69
Feature activation+2.95e-01

Pos logit contributions
REDACTED+0.654
 Austral+0.563
Austral+0.548
Rus+0.528
alg+0.525
Neg logit contributions
 eleph-0.470
=\"-0.438
oute-0.423
horn-0.421
aba-0.420
Token test
Token ID1332
Feature activation-1.08e+00

Pos logit contributions
 eleph+2.479
=\"+2.355
oute+2.291
aba+2.257
horn+2.247
Neg logit contributions
REDACTED-2.251
 Austral-1.882
Austral-1.843
Rus-1.762
alg-1.728
Token -
Token ID532
Feature activation-6.14e-02

Pos logit contributions
REDACTED+6.986
 Austral+5.483
alg+5.129
Austral+5.086
Rus+4.954
Neg logit contributions
 eleph-9.682
=\"-8.962
horn-8.863
aba-8.810
oute-8.737
Tokenp
Token ID79
Feature activation+2.62e-01

Pos logit contributions
REDACTED+0.534
 Austral+0.454
Austral+0.442
Rus+0.425
alg+0.423
Neg logit contributions
 eleph-0.456
=\"-0.427
oute-0.413
horn-0.411
aba-0.410
Token data
Token ID1366
Feature activation-3.46e-03

Pos logit contributions
 eleph+2.127
=\"+2.012
oute+1.954
aba+1.929
horn+1.918
Neg logit contributions
REDACTED-2.086
 Austral-1.756
Austral-1.719
Rus-1.639
alg-1.614
Token.
Token ID13
Feature activation-4.76e-02

Pos logit contributions
REDACTED+0.027
 Austral+0.023
Austral+0.022
Rus+0.021
SPA+0.021
Neg logit contributions
 eleph-0.028
=\"-0.027
oute-0.026
horn-0.026
aba-0.026
Tokenjson
Token ID17752
Feature activation-8.74e-01

Pos logit contributions
REDACTED+0.343
 Austral+0.284
Austral+0.273
Rus+0.259
alg+0.257
Neg logit contributions
 eleph-0.423
=\"-0.402
oute-0.392
aba-0.390
horn-0.389
Token H
Token ID367
Feature activation-4.93e-01

Pos logit contributions
REDACTED+7.467
 Austral+6.609
Austral+5.997
alg+5.925
Rus+5.904
Neg logit contributions
 eleph-7.837
=\"-7.291
oute-7.091
horn-6.988
aba-6.870
Token L
Token ID406
Feature activation-5.65e-01

Pos logit contributions
REDACTED+3.790
 Austral+3.190
Austral+3.069
alg+2.960
Rus+2.954
Neg logit contributions
 eleph-4.138
=\"-3.880
oute-3.745
horn-3.715
aba-3.686
Token D
Token ID360
Feature activation-2.85e-01

Pos logit contributions
REDACTED+4.210
 Austral+3.561
Austral+3.385
alg+3.312
Rus+3.272
Neg logit contributions
 eleph-4.844
=\"-4.562
oute-4.366
horn-4.366
aba-4.325
Tokenatt
Token ID1078
Feature activation-7.66e-02

Pos logit contributions
REDACTED+2.253
 Austral+1.884
Austral+1.821
alg+1.749
Rus+1.747
Neg logit contributions
 eleph-2.350
=\"-2.205
oute-2.133
horn-2.125
aba-2.121
Tokenu
Token ID84
Feature activation-2.78e-01

Pos logit contributions
REDACTED+0.575
 Austral+0.469
Austral+0.454
Rus+0.434
alg+0.431
Neg logit contributions
 eleph-0.664
=\"-0.626
horn-0.611
oute-0.611
aba-0.611
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
REDACTED+2.561
 Austral+2.194
Austral+2.138
Rus+2.063
alg+2.062
Neg logit contributions
 eleph-1.927
=\"-1.787
oute-1.722
horn-1.715
aba-1.712
Token C
Token ID327
Feature activation-7.03e-01

Pos logit contributions
REDACTED+7.585
 Austral+6.752
Austral+6.075
Rus+5.949
alg+5.905
Neg logit contributions
 eleph-9.158
=\"-8.354
oute-8.167
horn-7.967
aba-7.866
Token K
Token ID509
Feature activation-6.17e-01

Pos logit contributions
REDACTED+5.125
 Austral+4.329
Austral+4.074
alg+3.944
Rus+3.942
Neg logit contributions
 eleph-6.116
=\"-5.793
oute-5.531
horn-5.507
aba-5.456
Token Pr
Token ID1736
Feature activation-2.80e-01

Pos logit contributions
REDACTED+4.574
 Austral+3.884
Austral+3.687
alg+3.587
Rus+3.550
Neg logit contributions
 eleph-5.298
=\"-4.997
oute-4.788
horn-4.765
aba-4.719
Tokenas
Token ID292
Feature activation+3.18e-01

Pos logit contributions
REDACTED+2.221
 Austral+1.865
Austral+1.802
alg+1.743
Rus+1.729
Neg logit contributions
 eleph-2.304
=\"-2.148
oute-2.067
horn-2.065
aba-2.048
Tokenad
Token ID324
Feature activation+1.79e-01

Pos logit contributions
 eleph+1.918
aba+1.888
oute+1.828
horn+1.822
=\"+1.797
Neg logit contributions
REDACTED-3.189
 Austral-2.652
Austral-2.597
Rus-2.534
SPA-2.499
Token Council
Token ID4281
Feature activation-1.59e-01

Pos logit contributions
REDACTED+1.740
 Austral+1.414
Austral+1.364
Rus+1.298
alg+1.298
Neg logit contributions
 eleph-2.240
=\"-2.117
oute-2.059
horn-2.053
aba-2.049
Token House
Token ID2097
Feature activation-2.73e-01

Pos logit contributions
REDACTED+1.653
 Austral+1.443
Austral+1.411
Rus+1.368
alg+1.368
Neg logit contributions
 eleph-0.916
=\"-0.838
oute-0.801
horn-0.798
aba-0.795
Token National
Token ID2351
Feature activation+1.18e-01

Pos logit contributions
REDACTED+2.479
 Austral+2.125
Austral+2.058
Rus+1.989
alg+1.987
Neg logit contributions
 eleph-1.928
=\"-1.786
oute-1.724
horn-1.710
aba-1.707
Token Historic
Token ID30426
Feature activation-5.99e-02

Pos logit contributions
 eleph+0.822
=\"+0.783
horn+0.762
aba+0.754
oute+0.748
Neg logit contributions
REDACTED-1.066
Austral-0.884
 Austral-0.884
Rus-0.843
alg-0.842
Token Site
Token ID14413
Feature activation-3.22e-01

Pos logit contributions
REDACTED+0.467
 Austral+0.385
Austral+0.375
Rus+0.359
alg+0.358
Neg logit contributions
 eleph-0.500
=\"-0.472
oute-0.459
horn-0.457
aba-0.457
Token.[
Token ID3693
Feature activation-1.04e+00

Pos logit contributions
REDACTED+2.846
 Austral+2.436
Austral+2.339
Rus+2.264
alg+2.259
Neg logit contributions
 eleph-2.364
=\"-2.192
oute-2.120
horn-2.099
aba-2.089
Token57
Token ID3553
Feature activation-1.92e-02

Pos logit contributions
REDACTED+5.704
 Austral+4.059
Austral+3.878
alg+3.810
Rus+3.806
Neg logit contributions
 eleph-11.151
=\"-10.289
oute-10.132
horn-9.862
aba-9.839
Token]
Token ID60
Feature activation+1.54e-01

Pos logit contributions
REDACTED+0.147
 Austral+0.122
Austral+0.118
Rus+0.113
alg+0.113
Neg logit contributions
 eleph-0.163
=\"-0.153
oute-0.148
horn-0.148
aba-0.148
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
 eleph+1.235
=\"+1.212
horn+1.185
aba+1.180
oute+1.167
Neg logit contributions
REDACTED-1.194
 Austral-0.961
Austral-0.956
alg-0.918
Rus-0.915
Token
Token ID198
Feature activation+5.73e-02

Pos logit contributions
=\"+1.139
horn+1.103
aba+1.092
oute+1.078
 eleph+1.068
Neg logit contributions
REDACTED-1.170
 Austral-0.982
Austral-0.948
alg-0.915
Rus-0.908
TokenSchool
Token ID26130
Feature activation+8.35e-02

Pos logit contributions
 eleph+0.423
=\"+0.423
horn+0.412
aba+0.410
oute+0.406
Neg logit contributions
REDACTED-0.469
 Austral-0.402
Austral-0.378
alg-0.367
Rus-0.364
Token "
Token ID366
Feature activation+1.16e-01

Pos logit contributions
REDACTED+4.547
 Austral+3.871
Austral+3.685
alg+3.617
Rus+3.584
Neg logit contributions
 eleph-4.141
=\"-3.886
oute-3.779
horn-3.737
aba-3.736
TokenP
Token ID47
Feature activation+2.54e-02

Pos logit contributions
 eleph+0.577
=\"+0.520
oute+0.492
horn+0.489
aba+0.487
Neg logit contributions
REDACTED-1.298
 Austral-1.146
Austral-1.121
alg-1.091
Rus-1.090
Tokenulse
Token ID9615
Feature activation+3.26e-01

Pos logit contributions
 eleph+0.141
=\"+0.129
oute+0.123
horn+0.123
aba+0.122
Neg logit contributions
REDACTED-0.268
 Austral-0.235
Austral-0.230
Rus-0.223
alg-0.223
Token amount
Token ID2033
Feature activation-4.96e-01

Pos logit contributions
 eleph+1.459
=\"+1.397
horn+1.305
aba+1.299
oute+1.283
Neg logit contributions
REDACTED-3.682
 Austral-3.260
Austral-3.207
Rus-3.136
SPA-3.134
Token"
Token ID1
Feature activation-7.58e-01

Pos logit contributions
REDACTED+3.872
 Austral+3.296
Austral+3.147
Rus+3.032
SPA+2.978
Neg logit contributions
 eleph-4.329
oute-3.754
=\"-3.724
horn-3.658
aba-3.614
Token,
Token ID837
Feature activation-1.04e+00

Pos logit contributions
REDACTED+3.385
 Austral+2.464
Austral+2.194
Rus+2.145
alg+2.071
Neg logit contributions
 eleph-8.861
=\"-8.210
oute-8.137
aba-7.940
horn-7.935
Token Range
Token ID13667
Feature activation-6.71e-01

Pos logit contributions
REDACTED+8.896
 Austral+7.955
Austral+7.351
Rus+7.310
alg+7.302
Neg logit contributions
 eleph-7.535
=\"-6.590
oute-6.451
aba-6.328
horn-6.258
Token (
Token ID357
Feature activation-8.14e-02

Pos logit contributions
REDACTED+5.212
 Austral+4.279
Austral+4.121
Rus+4.043
SPA+3.969
Neg logit contributions
 eleph-6.148
oute-5.123
=\"-5.109
aba-4.922
horn-4.889
Token 0
Token ID657
Feature activation-4.73e-01

Pos logit contributions
REDACTED+0.664
 Austral+0.558
Austral+0.540
alg+0.518
Rus+0.518
Neg logit contributions
 eleph-0.652
=\"-0.610
oute-0.591
horn-0.588
aba-0.587
Token,
Token ID837
Feature activation-3.39e-01

Pos logit contributions
REDACTED+2.521
 Austral+1.897
Austral+1.762
Rus+1.670
alg+1.632
Neg logit contributions
 eleph-5.369
=\"-4.842
oute-4.802
horn-4.685
aba-4.680
Token 0
Token ID657
Feature activation-1.24e-01

Pos logit contributions
REDACTED+2.646
 Austral+2.206
Austral+2.116
Rus+2.036
alg+2.034
Neg logit contributions
 eleph-2.920
=\"-2.638
oute-2.586
aba-2.545
horn-2.543
Token Nation
Token ID8741
Feature activation-3.77e-01

Pos logit contributions
REDACTED+2.153
 Austral+1.913
Austral+1.870
alg+1.817
Rus+1.817
Neg logit contributions
 eleph-0.868
=\"-0.770
oute-0.718
horn-0.714
aba-0.713
Token is
Token ID318
Feature activation+1.57e-01

Pos logit contributions
REDACTED+3.427
 Austral+2.949
Austral+2.825
Rus+2.759
alg+2.703
Neg logit contributions
 eleph-2.762
=\"-2.414
oute-2.395
aba-2.330
horn-2.309
Token reader
Token ID9173
Feature activation-2.73e-01

Pos logit contributions
 eleph+0.993
=\"+0.925
aba+0.888
horn+0.887
oute+0.877
Neg logit contributions
REDACTED-1.508
 Austral-1.311
Austral-1.278
alg-1.243
SPA-1.243
Token supported
Token ID4855
Feature activation-2.51e-01

Pos logit contributions
REDACTED+2.147
 Austral+1.789
Austral+1.726
alg+1.657
Rus+1.657
Neg logit contributions
 eleph-2.288
=\"-2.129
oute-2.078
horn-2.055
aba-2.048
Token:
Token ID25
Feature activation-3.58e-01

Pos logit contributions
REDACTED+2.109
 Austral+1.767
Austral+1.715
alg+1.654
Rus+1.653
Neg logit contributions
 eleph-2.047
=\"-1.799
oute-1.783
aba-1.750
horn-1.744
Token Chip
Token ID17869
Feature activation-1.03e+00

Pos logit contributions
REDACTED+3.300
 Austral+2.822
Austral+2.710
alg+2.645
Rus+2.636
Neg logit contributions
 eleph-2.592
=\"-2.276
oute-2.232
horn-2.151
aba-2.151
Token in
Token ID287
Feature activation-4.60e-01

Pos logit contributions
REDACTED+7.277
 Austral+6.177
Austral+5.812
Rus+5.537
alg+5.471
Neg logit contributions
 eleph-9.802
=\"-8.078
oute-8.076
enhagen-7.829
aba-7.814
Token $
Token ID720
Feature activation-4.23e-01

Pos logit contributions
REDACTED+3.996
 Austral+3.421
Austral+3.282
alg+3.189
Rus+3.175
Neg logit contributions
 eleph-3.600
=\"-3.166
oute-3.115
aba-3.007
horn-3.006
Token10
Token ID940
Feature activation-2.29e-01

Pos logit contributions
REDACTED+3.069
 Austral+2.469
Austral+2.424
alg+2.319
Rus+2.308
Neg logit contributions
 eleph-4.049
=\"-3.530
oute-3.482
aba-3.380
horn-3.339
Token or
Token ID393
Feature activation-2.63e-01

Pos logit contributions
REDACTED+2.079
 Austral+1.782
Austral+1.727
alg+1.670
Rus+1.666
Neg logit contributions
 eleph-1.677
=\"-1.500
oute-1.463
aba-1.438
horn-1.434
Token more
Token ID517
Feature activation+3.46e-01

Pos logit contributions
REDACTED+2.097
 Austral+1.760
Austral+1.695
alg+1.630
Rus+1.622
Neg logit contributions
 eleph-2.197
=\"-2.004
oute-1.961
aba-1.932
horn-1.928
Token212
Token ID21777
Feature activation+1.44e-01

Pos logit contributions
REDACTED+2.391
 Austral+1.979
Austral+1.915
Rus+1.837
alg+1.835
Neg logit contributions
 eleph-2.587
=\"-2.390
oute-2.325
horn-2.308
aba-2.292
Token x
Token ID2124
Feature activation-1.69e-01

Pos logit contributions
 eleph+0.911
=\"+0.881
horn+0.841
oute+0.841
aba+0.837
Neg logit contributions
REDACTED-1.368
 Austral-1.173
Austral-1.155
alg-1.118
Rus-1.113
Token.
Token ID13
Feature activation-1.78e-01

Pos logit contributions
REDACTED+1.381
 Austral+1.158
Austral+1.122
Rus+1.077
alg+1.077
Neg logit contributions
 eleph-1.361
=\"-1.265
oute-1.227
horn-1.220
aba-1.218
Tokenx
Token ID87
Feature activation-6.39e-01

Pos logit contributions
REDACTED+1.741
 Austral+1.504
Austral+1.469
alg+1.421
Rus+1.421
Neg logit contributions
 eleph-1.165
=\"-1.051
oute-1.012
horn-1.002
aba-1.001
Token.
Token ID13
Feature activation-7.80e-01

Pos logit contributions
REDACTED+4.758
 Austral+3.878
Austral+3.789
Rus+3.605
alg+3.598
Neg logit contributions
 eleph-6.253
=\"-5.076
oute-5.052
enhagen-4.894
aba-4.886
Tokenx
Token ID87
Feature activation-1.01e+00

Pos logit contributions
REDACTED+6.996
Austral+5.754
 Austral+5.743
alg+5.691
Rus+5.603
Neg logit contributions
 eleph-6.330
=\"-5.007
oute-4.900
enhagen-4.738
aba-4.675
Token.
Token ID13
Feature activation-6.21e-01

Pos logit contributions
REDACTED+7.001
 Austral+5.386
Austral+5.341
alg+5.263
Rus+5.211
Neg logit contributions
 eleph-10.357
oute-8.362
enhagen-8.338
=\"-8.316
horn-7.851
Token108
Token ID15711
Feature activation-1.40e-01

Pos logit contributions
REDACTED+3.716
 Austral+2.838
Austral+2.735
alg+2.621
Rus+2.606
Neg logit contributions
 eleph-6.375
=\"-5.994
oute-5.855
horn-5.772
aba-5.766
Token 10
Token ID838
Feature activation+4.16e-02

Pos logit contributions
REDACTED+1.188
 Austral+1.007
Austral+0.975
Rus+0.938
alg+0.937
Neg logit contributions
 eleph-1.082
=\"-1.005
oute-0.973
horn-0.966
aba-0.966
Token46
Token ID3510
Feature activation-1.99e-01

Pos logit contributions
 eleph+0.355
=\"+0.346
aba+0.335
horn+0.335
oute+0.333
Neg logit contributions
REDACTED-0.306
 Austral-0.253
Austral-0.244
Rus-0.233
alg-0.232
Token358
Token ID31128
Feature activation+1.13e-01

Pos logit contributions
REDACTED+1.563
 Austral+1.298
Austral+1.256
Rus+1.205
alg+1.203
Neg logit contributions
 eleph-1.660
=\"-1.547
oute-1.502
horn-1.493
aba-1.488
Token-
Token ID12
Feature activation+4.80e-01

Pos logit contributions
 eleph+3.267
=\"+3.222
horn+3.168
oute+3.121
aba+3.096
Neg logit contributions
REDACTED-4.341
 Austral-3.737
Austral-3.534
alg-3.463
SPA-3.459
Token excerpt
Token ID20911
Feature activation-1.26e-01

Pos logit contributions
 eleph+3.319
=\"+2.996
oute+2.977
horn+2.960
aba+2.938
Neg logit contributions
REDACTED-4.376
 Austral-3.676
Austral-3.618
SPA-3.526
Rus-3.510
Token from
Token ID422
Feature activation+5.62e-02

Pos logit contributions
REDACTED+1.194
 Austral+1.024
Austral+1.004
alg+0.967
Rus+0.967
Neg logit contributions
 eleph-0.846
=\"-0.780
oute-0.753
horn-0.752
aba-0.749
Token The
Token ID383
Feature activation+3.00e-01

Pos logit contributions
 eleph+0.462
=\"+0.438
horn+0.426
oute+0.425
aba+0.424
Neg logit contributions
REDACTED-0.445
 Austral-0.361
Austral-0.355
alg-0.340
Rus-0.339
Token Bh
Token ID16581
Feature activation+2.61e-02

Pos logit contributions
 eleph+2.284
=\"+2.188
horn+2.154
oute+2.142
aba+2.129
Neg logit contributions
REDACTED-2.505
 Austral-2.034
Austral-2.020
SPA-1.921
alg-1.912
Tokenag
Token ID363
Feature activation-1.01e+00

Pos logit contributions
 eleph+0.153
=\"+0.145
aba+0.143
oute+0.141
horn+0.140
Neg logit contributions
REDACTED-0.267
 Austral-0.230
Austral-0.223
Rus-0.215
SPA-0.215
Tokenav
Token ID615
Feature activation-8.50e-02

Pos logit contributions
REDACTED+5.669
alg+4.727
 Austral+4.675
Rus+4.400
Austral+4.290
Neg logit contributions
 eleph-10.009
=\"-9.275
oute-9.063
horn-8.706
aba-8.526
Tokenad
Token ID324
Feature activation+4.85e-02

Pos logit contributions
REDACTED+0.787
 Austral+0.667
Austral+0.649
Rus+0.627
SPA+0.622
Neg logit contributions
 eleph-0.589
=\"-0.553
oute-0.535
aba-0.534
horn-0.533
Token G
Token ID402
Feature activation-6.69e-01

Pos logit contributions
 eleph+0.472
=\"+0.448
oute+0.436
horn+0.435
aba+0.434
Neg logit contributions
REDACTED-0.311
 Austral-0.247
Austral-0.237
Rus-0.224
alg-0.224
Tokenita
Token ID5350
Feature activation-1.14e-01

Pos logit contributions
REDACTED+4.609
 Austral+4.225
alg+4.068
SPA+3.951
Austral+3.918
Neg logit contributions
 eleph-5.859
=\"-5.246
oute-4.923
enhagen-4.871
horn-4.850
Token
Token ID198
Feature activation+1.01e-01

Pos logit contributions
REDACTED+0.890
 Austral+0.734
Austral+0.714
Rus+0.682
alg+0.681
Neg logit contributions
 eleph-0.959
=\"-0.904
oute-0.879
horn-0.875
aba-0.874
Token =
Token ID796
Feature activation+1.35e-01

Pos logit contributions
REDACTED+4.678
 Austral+3.990
Austral+3.894
Rus+3.825
alg+3.783
Neg logit contributions
 eleph-3.392
oute-2.726
=\"-2.692
aba-2.650
horn-2.647
Token &
Token ID1222
Feature activation-3.29e-01

Pos logit contributions
 eleph+0.962
=\"+0.909
oute+0.867
aba+0.858
horn+0.858
Neg logit contributions
REDACTED-1.199
 Austral-1.030
Austral-1.025
Rus-0.970
alg-0.960
Token '
Token ID705
Feature activation+8.13e-02

Pos logit contributions
REDACTED+2.529
 Austral+2.096
Austral+2.025
Rus+1.940
alg+1.939
Neg logit contributions
 eleph-2.790
=\"-2.623
oute-2.546
horn-2.535
aba-2.530
Tokena
Token ID64
Feature activation-4.42e-01

Pos logit contributions
 eleph+0.520
=\"+0.503
aba+0.486
horn+0.485
oute+0.481
Neg logit contributions
REDACTED-0.768
 Austral-0.667
Austral-0.646
Rus-0.624
alg-0.618
Token T
Token ID309
Feature activation-1.16e+00

Pos logit contributions
REDACTED+3.281
 Austral+2.709
Austral+2.601
alg+2.501
Rus+2.492
Neg logit contributions
 eleph-3.849
=\"-3.620
oute-3.517
horn-3.503
aba-3.494
Token ;
Token ID2162
Feature activation-9.99e-01

Pos logit contributions
REDACTED+5.864
 Austral+4.379
alg+4.021
Austral+3.951
SPA+3.903
Neg logit contributions
 eleph-12.109
=\"-11.461
horn-11.238
oute-11.172
aba-10.991
Token...
Token ID2644
Feature activation-6.01e-01

Pos logit contributions
REDACTED+7.008
 Austral+5.701
alg+5.329
Rus+5.243
Austral+5.240
Neg logit contributions
 eleph-8.623
=\"-8.148
oute-7.899
horn-7.862
aba-7.720
Token }
Token ID1782
Feature activation-2.85e-01

Pos logit contributions
REDACTED+4.055
 Austral+3.241
Austral+3.044
alg+2.996
Rus+2.964
Neg logit contributions
 eleph-5.785
=\"-5.298
oute-5.173
horn-5.106
aba-5.085
Token
Token ID198
Feature activation+5.72e-02

Pos logit contributions
REDACTED+2.352
 Austral+1.977
Austral+1.923
Rus+1.841
alg+1.840
Neg logit contributions
 eleph-2.253
=\"-2.109
oute-2.045
horn-2.034
aba-2.034
Token
Token ID198
Feature activation-3.90e-02

Pos logit contributions
=\"+0.449
 eleph+0.436
horn+0.435
aba+0.432
oute+0.429
Neg logit contributions
REDACTED-0.447
 Austral-0.373
Austral-0.360
alg-0.348
Rus-0.345
TokenUnlike
Token ID18521
Feature activation+2.05e-01

Pos logit contributions
REDACTED+0.317
 Austral+0.270
Austral+0.257
alg+0.247
Rus+0.245
Neg logit contributions
 eleph-0.303
=\"-0.293
horn-0.285
oute-0.283
aba-0.283
Token click
Token ID3904
Feature activation-2.67e-01

Pos logit contributions
 eleph+0.045
=\"+0.041
oute+0.040
horn+0.040
aba+0.039
Neg logit contributions
REDACTED-0.065
 Austral-0.056
Austral-0.054
Rus-0.052
alg-0.052
Token below
Token ID2174
Feature activation-4.10e-01

Pos logit contributions
REDACTED+2.001
 Austral+1.650
Austral+1.594
alg+1.520
Rus+1.519
Neg logit contributions
 eleph-2.327
=\"-2.185
oute-2.127
horn-2.114
aba-2.107
Token to
Token ID284
Feature activation+8.76e-02

Pos logit contributions
REDACTED+3.769
 Austral+3.240
Austral+3.150
Rus+3.047
alg+3.041
Neg logit contributions
 eleph-2.842
=\"-2.652
oute-2.551
horn-2.538
aba-2.533
Token become
Token ID1716
Feature activation+2.55e-02

Pos logit contributions
 eleph+0.583
=\"+0.554
aba+0.529
oute+0.528
horn+0.528
Neg logit contributions
REDACTED-0.815
 Austral-0.698
Austral-0.680
SPA-0.664
Rus-0.662
Token an
Token ID281
Feature activation-1.56e-01

Pos logit contributions
 eleph+0.199
=\"+0.186
oute+0.179
horn+0.179
aba+0.178
Neg logit contributions
REDACTED-0.212
 Austral-0.178
Austral-0.173
alg-0.168
Rus-0.167
Token In
Token ID554
Feature activation-9.84e-01

Pos logit contributions
REDACTED+1.123
 Austral+0.914
Austral+0.882
alg+0.842
Rus+0.841
Neg logit contributions
 eleph-1.392
=\"-1.317
oute-1.277
horn-1.273
aba-1.271
TokenDaily
Token ID28545
Feature activation-5.64e-01

Pos logit contributions
REDACTED+3.880
Austral+2.609
 Austral+2.549
Rus+2.207
alg+2.038
Neg logit contributions
 eleph-12.463
=\"-10.920
oute-10.835
horn-10.773
enhagen-10.720
Token supporter
Token ID15525
Feature activation-5.96e-01

Pos logit contributions
REDACTED+4.592
 Austral+3.903
Austral+3.770
Rus+3.600
alg+3.595
Neg logit contributions
 eleph-4.523
=\"-4.214
oute-4.113
horn-4.080
aba-4.041
Token.
Token ID13
Feature activation-3.70e-01

Pos logit contributions
REDACTED+4.665
 Austral+3.907
Austral+3.746
alg+3.614
Rus+3.614
Neg logit contributions
 eleph-4.986
=\"-4.574
oute-4.502
horn-4.470
aba-4.443
Token Powered
Token ID45090
Feature activation-1.95e-01

Pos logit contributions
REDACTED+2.132
 Austral+1.647
Austral+1.568
Rus+1.469
alg+1.468
Neg logit contributions
 eleph-3.844
=\"-3.657
oute-3.570
horn-3.558
aba-3.553
Token by
Token ID416
Feature activation+2.40e-01

Pos logit contributions
REDACTED+1.794
 Austral+1.552
Austral+1.498
alg+1.455
Rus+1.432
Neg logit contributions
 eleph-1.392
=\"-1.249
oute-1.210
horn-1.209
aba-1.206
Token7
Token ID22
Feature activation-2.59e-01

Pos logit contributions
 eleph+1.772
=\"+1.764
aba+1.726
oute+1.720
horn+1.699
Neg logit contributions
REDACTED-2.157
 Austral-1.854
Austral-1.770
SPA-1.686
Rus-1.686
Token {
Token ID1391
Feature activation-1.85e-01

Pos logit contributions
REDACTED+2.187
 Austral+1.859
Austral+1.793
Rus+1.734
alg+1.725
Neg logit contributions
 eleph-2.037
=\"-1.836
oute-1.793
aba-1.772
horn-1.768
Token(
Token ID7
Feature activation-4.76e-03

Pos logit contributions
REDACTED+1.679
 Austral+1.429
Austral+1.396
Rus+1.348
alg+1.343
Neg logit contributions
 eleph-1.355
=\"-1.217
oute-1.184
aba-1.169
horn-1.168
Token150
Token ID8628
Feature activation-1.90e-01

Pos logit contributions
REDACTED+0.037
 Austral+0.031
Austral+0.030
alg+0.029
Rus+0.029
Neg logit contributions
 eleph-0.039
=\"-0.037
oute-0.036
horn-0.036
aba-0.036
Tokens
Token ID82
Feature activation-7.71e-02

Pos logit contributions
REDACTED+1.451
 Austral+1.201
Austral+1.161
Rus+1.111
alg+1.110
Neg logit contributions
 eleph-1.633
=\"-1.527
oute-1.486
horn-1.477
aba-1.475
Token)}
Token ID38165
Feature activation-9.80e-01

Pos logit contributions
REDACTED+0.843
 Austral+0.742
Austral+0.725
Rus+0.705
alg+0.705
Neg logit contributions
 eleph-0.405
=\"-0.363
oute-0.346
horn-0.343
aba-0.342
Token 37
Token ID5214
Feature activation-8.46e-02

Pos logit contributions
REDACTED+5.359
 Austral+4.290
Austral+3.959
Rus+3.882
alg+3.820
Neg logit contributions
 eleph-9.961
=\"-9.391
oute-9.376
aba-8.959
horn-8.921
Token.
Token ID13
Feature activation+9.75e-02

Pos logit contributions
REDACTED+0.699
 Austral+0.588
Austral+0.570
alg+0.547
Rus+0.547
Neg logit contributions
 eleph-0.660
=\"-0.625
oute-0.604
horn-0.603
aba-0.602
Token f
Token ID277
Feature activation+1.36e-01

Pos logit contributions
 eleph+0.820
=\"+0.780
oute+0.757
aba+0.757
horn+0.757
Neg logit contributions
REDACTED-0.746
 Austral-0.614
Austral-0.599
alg-0.567
Rus-0.566
Tokenxe
Token ID27705
Feature activation-1.64e-01

Pos logit contributions
 eleph+0.895
=\"+0.874
oute+0.852
aba+0.850
horn+0.840
Neg logit contributions
REDACTED-1.254
 Austral-1.082
Austral-1.046
Rus-1.000
SPA-0.995
Token6
Token ID21
Feature activation+2.81e-01

Pos logit contributions
REDACTED+1.348
 Austral+1.133
Austral+1.098
Rus+1.055
alg+1.054
Neg logit contributions
 eleph-1.294
=\"-1.214
oute-1.175
horn-1.170
aba-1.168
Token
Token ID198
Feature activation-5.79e-01

Pos logit contributions
REDACTED+3.015
 Austral+2.514
Austral+2.432
Rus+2.331
alg+2.330
Neg logit contributions
 eleph-3.158
=\"-2.964
oute-2.875
horn-2.862
aba-2.857
Token
Token ID198
Feature activation-3.54e-01

Pos logit contributions
REDACTED+3.925
 Austral+3.136
Austral+3.055
Rus+2.917
SPA+2.864
Neg logit contributions
 eleph-5.922
oute-5.042
=\"-5.000
aba-4.897
horn-4.858
TokenA
Token ID32
Feature activation-3.00e-01

Pos logit contributions
REDACTED+2.851
 Austral+2.392
Austral+2.309
alg+2.215
Rus+2.213
Neg logit contributions
 eleph-2.843
=\"-2.687
oute-2.604
horn-2.597
aba-2.592
Token bench
Token ID7624
Feature activation-5.47e-01

Pos logit contributions
REDACTED+2.549
 Austral+2.153
Austral+2.095
Rus+2.013
alg+2.005
Neg logit contributions
 eleph-2.292
=\"-2.143
oute-2.074
aba-2.067
horn-2.066
Token of
Token ID286
Feature activation-7.33e-01

Pos logit contributions
REDACTED+4.404
 Austral+3.805
Austral+3.571
alg+3.508
Rus+3.475
Neg logit contributions
 eleph-4.350
=\"-4.003
oute-3.907
horn-3.869
aba-3.830
Token justices
Token ID27818
Feature activation-9.77e-01

Pos logit contributions
REDACTED+5.210
 Austral+4.379
Austral+4.080
alg+3.999
Rus+3.940
Neg logit contributions
 eleph-6.497
=\"-6.103
oute-5.939
horn-5.938
aba-5.870
Token H
Token ID367
Feature activation-4.93e-01

Pos logit contributions
REDACTED+7.467
 Austral+6.609
Austral+5.997
alg+5.925
Rus+5.904
Neg logit contributions
 eleph-7.837
=\"-7.291
oute-7.091
horn-6.988
aba-6.870
Token L
Token ID406
Feature activation-5.65e-01

Pos logit contributions
REDACTED+3.790
 Austral+3.190
Austral+3.069
alg+2.960
Rus+2.954
Neg logit contributions
 eleph-4.138
=\"-3.880
oute-3.745
horn-3.715
aba-3.686
Token D
Token ID360
Feature activation-2.85e-01

Pos logit contributions
REDACTED+4.210
 Austral+3.561
Austral+3.385
alg+3.312
Rus+3.272
Neg logit contributions
 eleph-4.844
=\"-4.562
oute-4.366
horn-4.366
aba-4.325
Tokenatt
Token ID1078
Feature activation-7.66e-02

Pos logit contributions
REDACTED+2.253
 Austral+1.884
Austral+1.821
alg+1.749
Rus+1.747
Neg logit contributions
 eleph-2.350
=\"-2.205
oute-2.133
horn-2.125
aba-2.121
Tokenu
Token ID84
Feature activation-2.78e-01

Pos logit contributions
REDACTED+0.575
 Austral+0.469
Austral+0.454
Rus+0.434
alg+0.431
Neg logit contributions
 eleph-0.664
=\"-0.626
horn-0.611
oute-0.611
aba-0.611
Token app
Token ID598
Feature activation-5.64e-01

Pos logit contributions
REDACTED+3.104
 Austral+2.618
Austral+2.470
alg+2.402
Rus+2.371
Neg logit contributions
 eleph-3.421
=\"-3.131
oute-3.023
horn-3.012
aba-3.003
Tokendata
Token ID7890
Feature activation-5.32e-01

Pos logit contributions
REDACTED+5.216
 Austral+4.435
Austral+4.353
alg+4.251
Rus+4.159
Neg logit contributions
 eleph-4.123
=\"-3.530
oute-3.408
horn-3.339
aba-3.336
Token _
Token ID4808
Feature activation-4.84e-01

Pos logit contributions
REDACTED+3.621
 Austral+2.961
Austral+2.797
alg+2.680
Rus+2.658
Neg logit contributions
 eleph-5.075
=\"-4.639
oute-4.524
horn-4.493
aba-4.486
Token full
Token ID1336
Feature activation-8.62e-01

Pos logit contributions
REDACTED+3.549
 Austral+2.961
Austral+2.813
alg+2.690
Rus+2.646
Neg logit contributions
 eleph-4.239
=\"-4.002
aba-3.869
oute-3.858
horn-3.845
Token v
Token ID410
Feature activation-7.04e-01

Pos logit contributions
REDACTED+4.536
 Austral+3.597
Austral+3.228
Rus+3.142
alg+3.046
Neg logit contributions
 eleph-9.306
=\"-8.494
oute-8.409
aba-8.408
horn-8.377
Token )
Token ID1267
Feature activation-9.76e-01

Pos logit contributions
REDACTED+4.871
 Austral+4.011
Austral+3.778
Rus+3.690
alg+3.662
Neg logit contributions
 eleph-6.615
=\"-5.962
oute-5.794
horn-5.693
aba-5.675
Token {
Token ID1391
Feature activation-6.73e-01

Pos logit contributions
REDACTED+7.057
 Austral+5.953
Austral+5.535
Rus+5.518
alg+5.289
Neg logit contributions
 eleph-9.100
oute-7.519
=\"-7.492
aba-7.370
horn-7.284
Token //
Token ID3373
Feature activation-2.26e-01

Pos logit contributions
REDACTED+4.629
 Austral+3.771
Austral+3.565
alg+3.463
Rus+3.443
Neg logit contributions
 eleph-6.082
=\"-5.780
horn-5.585
oute-5.573
aba-5.525
Token Time
Token ID3862
Feature activation-6.22e-01

Pos logit contributions
REDACTED+1.888
 Austral+1.590
Austral+1.544
Rus+1.484
alg+1.482
Neg logit contributions
 eleph-1.761
=\"-1.648
oute-1.596
horn-1.589
aba-1.586
Token component
Token ID7515
Feature activation-5.12e-01

Pos logit contributions
REDACTED+4.881
 Austral+4.126
Austral+3.919
alg+3.810
Rus+3.766
Neg logit contributions
 eleph-5.217
=\"-4.735
oute-4.613
aba-4.599
horn-4.571
Token const
Token ID1500
Feature activation-2.77e-02

Pos logit contributions
REDACTED+2.704
 Austral+2.074
Austral+1.890
alg+1.797
Rus+1.780
Neg logit contributions
 eleph-5.598
=\"-5.212
horn-5.127
aba-5.105
oute-5.093
TokenPath
Token ID15235
Feature activation-7.13e-01

Pos logit contributions
REDACTED+4.158
 Austral+3.518
Austral+3.436
Rus+3.338
alg+3.314
Neg logit contributions
 eleph-3.477
=\"-2.951
oute-2.925
aba-2.852
horn-2.812
Token )
Token ID1267
Feature activation-1.19e-01

Pos logit contributions
REDACTED+4.996
 Austral+4.084
Austral+3.876
Rus+3.749
alg+3.739
Neg logit contributions
 eleph-6.968
=\"-5.993
oute-5.927
horn-5.724
aba-5.709
Token image
Token ID2939
Feature activation-1.92e-01

Pos logit contributions
REDACTED+0.962
 Austral+0.805
Austral+0.782
Rus+0.749
alg+0.747
Neg logit contributions
 eleph-0.964
=\"-0.908
oute-0.881
horn-0.876
aba-0.874
Token =
Token ID796
Feature activation-7.08e-02

Pos logit contributions
REDACTED+1.960
 Austral+1.707
Austral+1.663
Rus+1.618
alg+1.613
Neg logit contributions
 eleph-1.177
=\"-1.040
oute-1.006
horn-0.989
aba-0.986
Token im
Token ID545
Feature activation-4.97e-01

Pos logit contributions
REDACTED+0.598
 Austral+0.502
Austral+0.490
alg+0.471
Rus+0.471
Neg logit contributions
 eleph-0.543
=\"-0.513
oute-0.495
horn-0.493
aba-0.493
Tokenutils
Token ID26791
Feature activation-9.75e-01

Pos logit contributions
REDACTED+3.160
 Austral+2.494
Austral+2.396
Rus+2.298
alg+2.282
Neg logit contributions
 eleph-4.990
=\"-4.555
horn-4.434
oute-4.426
aba-4.412
Token.
Token ID764
Feature activation-9.15e-01

Pos logit contributions
REDACTED+3.470
 Austral+2.279
Austral+1.899
Rus+1.816
alg+1.762
Neg logit contributions
 eleph-13.077
=\"-11.263
oute-11.122
enhagen-11.046
horn-10.892
Token resize
Token ID47558
Feature activation-8.06e-01

Pos logit contributions
REDACTED+5.301
 Austral+4.317
Austral+3.902
alg+3.818
Rus+3.816
Neg logit contributions
 eleph-9.435
=\"-8.398
aba-8.218
horn-8.153
oute-8.058
Token (
Token ID357
Feature activation-4.96e-02

Pos logit contributions
REDACTED+6.179
 Austral+5.099
Austral+4.928
Rus+4.808
SPA+4.732
Neg logit contributions
 eleph-7.436
=\"-6.195
oute-6.105
aba-5.886
horn-5.852
Token image
Token ID2939
Feature activation-4.31e-01

Pos logit contributions
REDACTED+0.399
 Austral+0.334
Austral+0.324
Rus+0.311
alg+0.310
Neg logit contributions
 eleph-0.401
=\"-0.377
oute-0.366
horn-0.364
aba-0.364
Token,
Token ID837
Feature activation+7.85e-02

Pos logit contributions
REDACTED+2.141
 Austral+1.541
Austral+1.433
Rus+1.358
alg+1.321
Neg logit contributions
 eleph-5.060
=\"-4.589
oute-4.558
aba-4.461
horn-4.444
Token
Token ID198
Feature activation+3.21e-01

Pos logit contributions
REDACTED+1.379
 Austral+1.134
Austral+1.098
alg+1.049
Rus+1.046
Neg logit contributions
 eleph-1.587
=\"-1.501
oute-1.459
horn-1.453
aba-1.451
Token
Token ID198
Feature activation+3.98e-01

Pos logit contributions
=\"+2.384
horn+2.314
aba+2.274
oute+2.211
acket+2.112
Neg logit contributions
REDACTED-2.608
 Austral-2.220
Austral-2.142
alg-2.076
Rus-2.059
TokenRated
Token ID15322
Feature activation+3.19e-02

Pos logit contributions
=\"+2.155
horn+2.065
aba+2.045
oute+1.973
acket+1.850
Neg logit contributions
REDACTED-3.983
 Austral-3.607
Austral-3.390
alg-3.307
Rus-3.276
Token 5
Token ID642
Feature activation+5.02e-01

Pos logit contributions
 eleph+0.252
=\"+0.240
oute+0.232
aba+0.232
horn+0.231
Neg logit contributions
REDACTED-0.259
 Austral-0.216
Austral-0.212
alg-0.202
Rus-0.202
Token out
Token ID503
Feature activation+3.51e-01

Pos logit contributions
=\"+2.951
horn+2.850
aba+2.792
oute+2.773
 eleph+2.659
Neg logit contributions
REDACTED-4.830
 Austral-4.161
Austral-4.105
alg-3.975
SPA-3.965
Token of
Token ID286
Feature activation+7.97e-01

Pos logit contributions
=\"+2.229
horn+2.158
aba+2.096
oute+2.046
 eleph+1.982
Neg logit contributions
REDACTED-3.260
 Austral-2.789
Austral-2.734
alg-2.652
Rus-2.637
Token 5
Token ID642
Feature activation+5.38e-01

Pos logit contributions
=\"+4.982
horn+4.823
aba+4.646
oute+4.539
acket+4.333
Neg logit contributions
REDACTED-7.190
Austral-6.131
 Austral-6.046
Rus-5.865
alg-5.834
Token by
Token ID416
Feature activation+4.74e-01

Pos logit contributions
=\"+2.730
horn+2.593
aba+2.558
oute+2.488
acket+2.314
Neg logit contributions
REDACTED-5.614
 Austral-4.852
Austral-4.793
alg-4.702
SPA-4.655
Token Quake
Token ID42901
Feature activation+2.32e-01

Pos logit contributions
 eleph+3.842
=\"+3.614
horn+3.567
oute+3.559
aba+3.535
Neg logit contributions
REDACTED-3.664
Austral-2.954
 Austral-2.924
alg-2.790
Rus-2.775
TokenD
Token ID35
Feature activation+4.73e-02

Pos logit contributions
 eleph+1.682
=\"+1.582
horn+1.567
oute+1.567
aba+1.557
Neg logit contributions
REDACTED-2.007
 Austral-1.702
Austral-1.657
alg-1.590
SPA-1.566
Tokenude
Token ID2507
Feature activation-5.00e-02

Pos logit contributions
 eleph+0.327
=\"+0.304
oute+0.296
horn+0.295
aba+0.295
Neg logit contributions
REDACTED-0.436
 Austral-0.373
Austral-0.362
Rus-0.349
alg-0.347
Token,
Token ID11
Feature activation-4.48e-02

Pos logit contributions
REDACTED+1.482
 Austral+1.267
Austral+1.231
alg+1.188
Rus+1.187
Neg logit contributions
 eleph-1.175
=\"-1.092
oute-1.054
horn-1.048
aba-1.045
Token the
Token ID262
Feature activation-3.15e-02

Pos logit contributions
REDACTED+0.375
 Austral+0.311
Austral+0.305
Rus+0.294
alg+0.293
Neg logit contributions
 eleph-0.350
=\"-0.327
oute-0.317
horn-0.316
aba-0.315
Token ice
Token ID4771
Feature activation+2.13e-02

Pos logit contributions
REDACTED+0.276
 Austral+0.232
Austral+0.227
Rus+0.219
alg+0.218
Neg logit contributions
 eleph-0.232
=\"-0.216
oute-0.209
horn-0.209
aba-0.207
Token sheet
Token ID9629
Feature activation+1.28e-01

Pos logit contributions
 eleph+0.141
=\"+0.130
oute+0.125
horn+0.124
aba+0.124
Neg logit contributions
REDACTED-0.204
 Austral-0.176
Austral-0.171
Rus-0.166
alg-0.166
Token contains
Token ID4909
Feature activation+2.31e-01

Pos logit contributions
 eleph+0.883
=\"+0.827
oute+0.795
horn+0.792
aba+0.789
Neg logit contributions
REDACTED-1.174
 Austral-0.995
Austral-0.974
Rus-0.942
alg-0.938
Token enough
Token ID1576
Feature activation+8.08e-01

Pos logit contributions
 eleph+1.730
=\"+1.617
oute+1.567
aba+1.564
horn+1.557
Neg logit contributions
REDACTED-1.980
 Austral-1.638
Austral-1.616
Rus-1.566
SPA-1.556
Token water
Token ID1660
Feature activation+1.93e-01

Pos logit contributions
 eleph+5.595
=\"+5.349
aba+5.159
horn+5.157
oute+5.073
Neg logit contributions
REDACTED-6.856
Austral-5.725
 Austral-5.617
Rus-5.528
SPA-5.506
Token to
Token ID284
Feature activation-3.20e-03

Pos logit contributions
 eleph+1.309
=\"+1.217
horn+1.169
oute+1.162
aba+1.161
Neg logit contributions
REDACTED-1.809
 Austral-1.545
Austral-1.517
Rus-1.458
SPA-1.450
Token raise
Token ID5298
Feature activation-8.96e-02

Pos logit contributions
REDACTED+0.029
 Austral+0.025
Austral+0.024
Rus+0.023
alg+0.023
Neg logit contributions
 eleph-0.023
=\"-0.021
oute-0.020
aba-0.020
horn-0.020
Token global
Token ID3298
Feature activation+2.97e-01

Pos logit contributions
REDACTED+0.756
 Austral+0.637
Austral+0.620
Rus+0.596
alg+0.595
Neg logit contributions
 eleph-0.691
=\"-0.645
oute-0.626
horn-0.622
aba-0.621
Token sea
Token ID5417
Feature activation+5.14e-02

Pos logit contributions
 eleph+2.128
=\"+1.967
horn+1.928
oute+1.922
aba+1.920
Neg logit contributions
REDACTED-2.654
 Austral-2.223
Austral-2.203
Rus-2.134
alg-2.090
Tokenunk
Token ID2954
Feature activation-1.88e-01

Pos logit contributions
 eleph+0.555
=\"+0.521
oute+0.511
horn+0.511
aba+0.508
Neg logit contributions
REDACTED-0.631
 Austral-0.533
Austral-0.518
Rus-0.495
SPA-0.489
Tokenrock
Token ID10823
Feature activation-3.57e-01

Pos logit contributions
REDACTED+1.627
 Austral+1.379
Austral+1.340
Rus+1.289
alg+1.288
Neg logit contributions
 eleph-1.415
=\"-1.321
oute-1.279
horn-1.275
aba-1.272
Token -
Token ID532
Feature activation+3.40e-01

Pos logit contributions
REDACTED+3.029
 Austral+2.567
Austral+2.487
Rus+2.402
alg+2.400
Neg logit contributions
 eleph-2.723
=\"-2.540
oute-2.446
horn-2.429
aba-2.428
Token favorite
Token ID4004
Feature activation+1.20e+00

Pos logit contributions
 eleph+1.394
=\"+1.267
horn+1.257
aba+1.223
oute+1.197
Neg logit contributions
REDACTED-4.031
Austral-3.461
 Austral-3.445
SPA-3.387
Rus-3.323
Token favorite
Token ID4004
Feature activation+1.42e+00

Pos logit contributions
=\"+3.519
horn+3.342
oute+3.287
 eleph+3.161
aba+2.903
Neg logit contributions
REDACTED-14.305
Austral-12.753
 Austral-12.428
SPA-12.323
Rus-12.089
Token favorite
Token ID4004
Feature activation+1.46e+00

Pos logit contributions
=\"+4.197
 eleph+4.061
horn+4.012
oute+3.895
aba+3.441
Neg logit contributions
REDACTED-16.241
Austral-14.588
 Austral-14.171
SPA-13.915
Rus-13.707
Token favorite
Token ID4004
Feature activation+1.41e+00

Pos logit contributions
 eleph+5.689
=\"+5.302
horn+5.169
oute+5.025
aba+4.565
Neg logit contributions
REDACTED-15.486
Austral-13.934
 Austral-13.558
SPA-13.050
Rus-12.962
Token -
Token ID532
Feature activation+6.70e-01

Pos logit contributions
 eleph+6.645
=\"+6.219
horn+6.126
oute+5.923
aba+5.519
Neg logit contributions
REDACTED-13.940
Austral-12.476
 Austral-12.171
SPA-11.623
Rus-11.550
Token December
Token ID3426
Feature activation+6.63e-02

Pos logit contributions
 eleph+5.199
=\"+5.148
aba+5.045
horn+5.028
oute+4.886
Neg logit contributions
REDACTED-4.988
Austral-3.987
 Austral-3.911
Rus-3.763
alg-3.755
Token 13
Token ID1511
Feature activation+9.83e-02

Pos logit contributions
 eleph+0.526
=\"+0.508
horn+0.491
oute+0.490
aba+0.489
Neg logit contributions
REDACTED-0.530
 Austral-0.441
Austral-0.429
alg-0.412
Rus-0.411
Token,
Token ID11
Feature activation+4.36e-01

Pos logit contributions
 eleph+0.643
=\"+0.608
horn+0.582
oute+0.582
aba+0.580
Neg logit contributions
REDACTED-0.930
 Austral-0.801
Austral-0.781
alg-0.755
Rus-0.755
Tokenrock
Token ID10823
Feature activation-3.57e-01

Pos logit contributions
REDACTED+1.627
 Austral+1.379
Austral+1.340
Rus+1.289
alg+1.288
Neg logit contributions
 eleph-1.415
=\"-1.321
oute-1.279
horn-1.275
aba-1.272
Token -
Token ID532
Feature activation+3.40e-01

Pos logit contributions
REDACTED+3.029
 Austral+2.567
Austral+2.487
Rus+2.402
alg+2.400
Neg logit contributions
 eleph-2.723
=\"-2.540
oute-2.446
horn-2.429
aba-2.428
Token favorite
Token ID4004
Feature activation+1.20e+00

Pos logit contributions
 eleph+1.394
=\"+1.267
horn+1.257
aba+1.223
oute+1.197
Neg logit contributions
REDACTED-4.031
Austral-3.461
 Austral-3.445
SPA-3.387
Rus-3.323
Token favorite
Token ID4004
Feature activation+1.42e+00

Pos logit contributions
=\"+3.519
horn+3.342
oute+3.287
 eleph+3.161
aba+2.903
Neg logit contributions
REDACTED-14.305
Austral-12.753
 Austral-12.428
SPA-12.323
Rus-12.089
Token favorite
Token ID4004
Feature activation+1.46e+00

Pos logit contributions
=\"+4.197
 eleph+4.061
horn+4.012
oute+3.895
aba+3.441
Neg logit contributions
REDACTED-16.241
Austral-14.588
 Austral-14.171
SPA-13.915
Rus-13.707
Token favorite
Token ID4004
Feature activation+1.41e+00

Pos logit contributions
 eleph+5.689
=\"+5.302
horn+5.169
oute+5.025
aba+4.565
Neg logit contributions
REDACTED-15.486
Austral-13.934
 Austral-13.558
SPA-13.050
Rus-12.962
Token -
Token ID532
Feature activation+6.70e-01

Pos logit contributions
 eleph+6.645
=\"+6.219
horn+6.126
oute+5.923
aba+5.519
Neg logit contributions
REDACTED-13.940
Austral-12.476
 Austral-12.171
SPA-11.623
Rus-11.550
Token December
Token ID3426
Feature activation+6.63e-02

Pos logit contributions
 eleph+5.199
=\"+5.148
aba+5.045
horn+5.028
oute+4.886
Neg logit contributions
REDACTED-4.988
Austral-3.987
 Austral-3.911
Rus-3.763
alg-3.755
Token 13
Token ID1511
Feature activation+9.83e-02

Pos logit contributions
 eleph+0.526
=\"+0.508
horn+0.491
oute+0.490
aba+0.489
Neg logit contributions
REDACTED-0.530
 Austral-0.441
Austral-0.429
alg-0.412
Rus-0.411
Token,
Token ID11
Feature activation+4.36e-01

Pos logit contributions
 eleph+0.643
=\"+0.608
horn+0.582
oute+0.582
aba+0.580
Neg logit contributions
REDACTED-0.930
 Austral-0.801
Austral-0.781
alg-0.755
Rus-0.755
Token 2010
Token ID3050
Feature activation+1.79e-01

Pos logit contributions
 eleph+2.576
=\"+2.447
horn+2.389
aba+2.294
oute+2.253
Neg logit contributions
REDACTED-4.359
 Austral-3.695
Austral-3.693
Rus-3.547
SPA-3.511
TokenAN
Token ID1565
Feature activation-2.43e-01

Pos logit contributions
REDACTED+2.417
 Austral+2.047
Austral+1.987
Rus+1.915
alg+1.914
Neg logit contributions
 eleph-2.129
=\"-1.985
oute-1.919
horn-1.909
aba-1.905
Token -
Token ID532
Feature activation+6.29e-01

Pos logit contributions
REDACTED+2.077
 Austral+1.761
Austral+1.710
Rus+1.642
alg+1.641
Neg logit contributions
 eleph-1.848
=\"-1.724
oute-1.671
horn-1.668
aba-1.663
Token favorite
Token ID4004
Feature activation+1.29e+00

Pos logit contributions
=\"+2.469
horn+2.441
aba+2.351
 eleph+2.340
oute+2.326
Neg logit contributions
REDACTED-7.187
Austral-6.276
 Austral-6.209
Rus-6.081
SPA-6.044
Token favorite
Token ID4004
Feature activation+1.34e+00

Pos logit contributions
=\"+4.280
horn+4.226
oute+3.838
aba+3.540
acket+3.111
Neg logit contributions
REDACTED-14.668
Austral-13.221
Rus-12.807
 Austral-12.787
SPA-12.634
Token favorite
Token ID4004
Feature activation+1.43e+00

Pos logit contributions
=\"+4.347
horn+4.303
oute+3.898
aba+3.553
 Dover+3.118
Neg logit contributions
REDACTED-15.241
Austral-13.779
Rus-13.312
 Austral-13.285
SPA-13.118
Token favorite
Token ID4004
Feature activation+1.36e+00

Pos logit contributions
=\"+4.756
horn+4.710
oute+4.318
aba+3.852
 eleph+3.473
Neg logit contributions
REDACTED-15.853
Austral-14.425
 Austral-13.813
Rus-13.794
SPA-13.617
Token favorite
Token ID4004
Feature activation+1.19e+00

Pos logit contributions
horn+5.022
 eleph+4.993
=\"+4.934
oute+4.648
aba+4.167
Neg logit contributions
REDACTED-14.635
Austral-13.374
 Austral-12.948
Rus-12.582
SPA-12.333
Token -
Token ID532
Feature activation+5.33e-01

Pos logit contributions
 eleph+6.188
horn+5.882
=\"+5.843
oute+5.508
aba+5.212
Neg logit contributions
REDACTED-11.479
Austral-10.389
 Austral-10.152
Rus-9.705
SPA-9.516
Token September
Token ID2693
Feature activation+1.17e-01

Pos logit contributions
=\"+3.953
horn+3.829
aba+3.803
 eleph+3.794
oute+3.673
Neg logit contributions
REDACTED-4.307
Austral-3.497
 Austral-3.443
alg-3.351
Rus-3.301
Token 26
Token ID2608
Feature activation+9.79e-02

Pos logit contributions
 eleph+0.917
=\"+0.899
horn+0.871
aba+0.866
oute+0.863
Neg logit contributions
REDACTED-0.939
 Austral-0.780
Austral-0.761
alg-0.731
Rus-0.726
Token,
Token ID11
Feature activation+6.40e-01

Pos logit contributions
 eleph+0.608
=\"+0.598
horn+0.572
aba+0.568
oute+0.565
Neg logit contributions
REDACTED-0.934
 Austral-0.807
Austral-0.787
alg-0.763
Rus-0.761
Token,
Token ID11
Feature activation+4.28e-02

Pos logit contributions
 eleph+0.858
=\"+0.795
oute+0.764
aba+0.764
horn+0.759
Neg logit contributions
REDACTED-1.249
 Austral-1.067
Austral-1.053
alg-1.010
Rus-1.008
Token San
Token ID2986
Feature activation+1.38e-02

Pos logit contributions
 eleph+0.294
=\"+0.273
oute+0.264
aba+0.262
horn+0.261
Neg logit contributions
REDACTED-0.398
 Austral-0.338
Austral-0.332
Rus-0.321
alg-0.318
Token Francisco
Token ID6033
Feature activation-3.65e-02

Pos logit contributions
 eleph+0.099
=\"+0.092
aba+0.090
horn+0.090
oute+0.090
Neg logit contributions
REDACTED-0.123
 Austral-0.105
Austral-0.102
Rus-0.098
SPA-0.098
Token,
Token ID11
Feature activation+3.43e-02

Pos logit contributions
REDACTED+0.345
 Austral+0.296
Austral+0.289
Rus+0.279
alg+0.279
Neg logit contributions
 eleph-0.245
=\"-0.227
oute-0.218
aba-0.217
horn-0.217
Token and
Token ID290
Feature activation+1.73e-01

Pos logit contributions
 eleph+0.243
=\"+0.228
oute+0.220
aba+0.220
horn+0.219
Neg logit contributions
REDACTED-0.310
 Austral-0.261
Austral-0.257
Rus-0.247
alg-0.247
Token Seattle
Token ID7312
Feature activation+1.19e-01

Pos logit contributions
 eleph+1.295
=\"+1.240
aba+1.206
oute+1.199
horn+1.198
Neg logit contributions
REDACTED-1.476
 Austral-1.211
Austral-1.208
alg-1.158
Rus-1.156
Token,
Token ID11
Feature activation-5.77e-02

Pos logit contributions
 eleph+0.783
=\"+0.722
oute+0.696
aba+0.695
horn+0.689
Neg logit contributions
REDACTED-1.145
 Austral-0.977
Austral-0.963
alg-0.924
Rus-0.923
Token she
Token ID673
Feature activation+6.47e-02

Pos logit contributions
REDACTED+0.533
 Austral+0.454
Austral+0.445
Rus+0.429
alg+0.427
Neg logit contributions
 eleph-0.400
=\"-0.371
oute-0.360
horn-0.356
aba-0.356
Token followed
Token ID3940
Feature activation-1.67e-01

Pos logit contributions
 eleph+0.481
=\"+0.450
oute+0.434
horn+0.433
aba+0.432
Neg logit contributions
REDACTED-0.565
 Austral-0.478
Austral-0.465
alg-0.447
Rus-0.446
Token public
Token ID1171
Feature activation+1.55e-01

Pos logit contributions
REDACTED+1.400
 Austral+1.180
Austral+1.147
Rus+1.102
alg+1.100
Neg logit contributions
 eleph-1.294
=\"-1.209
oute-1.172
horn-1.166
aba-1.165
Token stair
Token ID18761
Feature activation+6.04e-02

Pos logit contributions
 eleph+1.080
=\"+0.999
oute+0.973
horn+0.968
aba+0.965
Neg logit contributions
REDACTED-1.427
 Austral-1.215
Austral-1.187
Rus-1.140
SPA-1.134
Token.
Token ID13
Feature activation+3.15e-01

Pos logit contributions
 eleph+2.483
=\"+2.366
aba+2.281
oute+2.253
horn+2.227
Neg logit contributions
REDACTED-3.121
 Austral-2.631
Austral-2.578
Rus-2.525
alg-2.472
Token Mil
Token ID4460
Feature activation+1.73e-01

Pos logit contributions
 eleph+2.264
=\"+2.166
aba+2.075
oute+2.057
horn+2.050
Neg logit contributions
REDACTED-2.751
 Austral-2.290
Austral-2.265
Rus-2.219
alg-2.172
Tokenburn
Token ID10899
Feature activation+2.85e-02

Pos logit contributions
 eleph+1.282
=\"+1.213
horn+1.208
aba+1.197
oute+1.193
Neg logit contributions
REDACTED-1.497
 Austral-1.253
Austral-1.226
Rus-1.172
alg-1.161
Token Stand
Token ID5751
Feature activation-1.13e-01

Pos logit contributions
 eleph+0.203
=\"+0.189
oute+0.182
horn+0.182
aba+0.182
Neg logit contributions
REDACTED-0.258
 Austral-0.219
Austral-0.214
Rus-0.207
alg-0.207
Token representative
Token ID8852
Feature activation+6.94e-02

Pos logit contributions
REDACTED+1.040
 Austral+0.890
Austral+0.866
Rus+0.836
alg+0.836
Neg logit contributions
 eleph-0.785
=\"-0.730
oute-0.703
horn-0.701
aba-0.699
Token must
Token ID1276
Feature activation+4.37e-01

Pos logit contributions
 eleph+0.468
=\"+0.437
oute+0.420
aba+0.418
horn+0.418
Neg logit contributions
REDACTED-0.651
 Austral-0.554
Austral-0.544
Rus-0.527
alg-0.526
Token hold
Token ID1745
Feature activation+2.47e-01

Pos logit contributions
 eleph+3.040
=\"+3.002
aba+2.811
horn+2.810
oute+2.784
Neg logit contributions
REDACTED-3.844
 Austral-3.205
Austral-3.182
Rus-3.100
SPA-3.058
Token a
Token ID257
Feature activation-2.57e-01

Pos logit contributions
 eleph+1.793
=\"+1.721
aba+1.640
horn+1.636
oute+1.631
Neg logit contributions
REDACTED-2.145
 Austral-1.779
Austral-1.762
Rus-1.730
alg-1.687
Token valid
Token ID4938
Feature activation-5.35e-02

Pos logit contributions
REDACTED+2.286
 Austral+1.943
Austral+1.899
Rus+1.836
alg+1.825
Neg logit contributions
 eleph-1.859
=\"-1.742
oute-1.677
horn-1.672
aba-1.670
Token season
Token ID1622
Feature activation+2.09e-02

Pos logit contributions
REDACTED+0.468
 Austral+0.395
Austral+0.388
Rus+0.375
alg+0.371
Neg logit contributions
 eleph-0.393
=\"-0.368
oute-0.355
aba-0.355
horn-0.354
Token ticket
Token ID7846
Feature activation-1.07e-01

Pos logit contributions
 eleph+0.175
=\"+0.166
horn+0.162
oute+0.161
aba+0.161
Neg logit contributions
REDACTED-0.160
 Austral-0.132
Austral-0.129
Rus-0.125
alg-0.123
Token.
Token ID13
Feature activation+8.03e-02

Pos logit contributions
REDACTED+1.590
 Austral+1.354
Austral+1.317
Rus+1.270
alg+1.269
Neg logit contributions
 eleph-1.302
=\"-1.212
oute-1.170
horn-1.164
aba-1.162
Token So
Token ID1406
Feature activation-1.16e-01

Pos logit contributions
 eleph+0.642
=\"+0.630
horn+0.612
aba+0.609
oute+0.607
Neg logit contributions
REDACTED-0.625
 Austral-0.510
Austral-0.497
alg-0.487
SPA-0.480
Token there
Token ID612
Feature activation-2.79e-01

Pos logit contributions
REDACTED+0.999
 Austral+0.847
Austral+0.822
Rus+0.791
alg+0.791
Neg logit contributions
 eleph-0.877
=\"-0.818
oute-0.791
horn-0.787
aba-0.786
Token can
Token ID460
Feature activation-2.30e-01

Pos logit contributions
REDACTED+2.532
 Austral+2.165
Austral+2.105
Rus+2.034
alg+2.032
Neg logit contributions
 eleph-1.971
=\"-1.834
oute-1.765
horn-1.757
aba-1.754
Token be
Token ID307
Feature activation-6.12e-02

Pos logit contributions
REDACTED+1.970
 Austral+1.667
Austral+1.619
Rus+1.558
alg+1.557
Neg logit contributions
 eleph-1.751
=\"-1.636
oute-1.580
horn-1.573
aba-1.571
Token little
Token ID1310
Feature activation+1.04e-01

Pos logit contributions
REDACTED+0.525
 Austral+0.445
Austral+0.432
Rus+0.416
alg+0.415
Neg logit contributions
 eleph-0.463
=\"-0.434
oute-0.419
aba-0.417
horn-0.416
Token doubt
Token ID4719
Feature activation-1.63e-01

Pos logit contributions
 eleph+0.766
=\"+0.720
oute+0.697
aba+0.695
horn+0.692
Neg logit contributions
REDACTED-0.905
 Austral-0.763
Austral-0.745
SPA-0.718
Rus-0.716
Token that
Token ID326
Feature activation-2.06e-01

Pos logit contributions
REDACTED+1.423
 Austral+1.209
Austral+1.176
Rus+1.133
alg+1.133
Neg logit contributions
 eleph-1.202
=\"-1.123
oute-1.084
horn-1.080
aba-1.076
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
REDACTED+1.718
 Austral+1.450
Austral+1.404
Rus+1.349
alg+1.349
Neg logit contributions
 eleph-1.604
=\"-1.499
oute-1.450
horn-1.444
aba-1.441
Token had
Token ID550
Feature activation-3.27e-01

Pos logit contributions
 eleph+1.095
=\"+1.033
aba+1.002
horn+0.999
oute+0.992
Neg logit contributions
REDACTED-1.483
 Austral-1.259
Austral-1.239
alg-1.207
Rus-1.200
Token Ign
Token ID16583
Feature activation+9.52e-02

Pos logit contributions
REDACTED+2.624
 Austral+2.206
Austral+2.126
alg+2.036
Rus+2.032
Neg logit contributions
 eleph-2.660
=\"-2.487
oute-2.408
horn-2.400
aba-2.397
Token for
Token ID329
Feature activation-6.57e-02

Pos logit contributions
 eleph+1.718
=\"+1.625
aba+1.549
horn+1.548
oute+1.541
Neg logit contributions
REDACTED-2.315
 Austral-1.955
Austral-1.939
alg-1.872
Rus-1.866
Token Tuesday
Token ID3431
Feature activation-1.65e-01

Pos logit contributions
REDACTED+0.538
 Austral+0.445
Austral+0.437
alg+0.419
Rus+0.418
Neg logit contributions
 eleph-0.521
=\"-0.495
oute-0.477
aba-0.476
horn-0.475
Token —
Token ID851
Feature activation+3.74e-01

Pos logit contributions
REDACTED+1.444
 Austral+1.227
Austral+1.193
alg+1.149
Rus+1.148
Neg logit contributions
 eleph-1.224
=\"-1.141
oute-1.102
horn-1.097
aba-1.096
Token including
Token ID1390
Feature activation+3.66e-01

Pos logit contributions
 eleph+2.845
=\"+2.687
aba+2.607
oute+2.578
horn+2.575
Neg logit contributions
REDACTED-3.128
 Austral-2.556
Austral-2.550
alg-2.457
SPA-2.444
Token a
Token ID257
Feature activation+3.18e-02

Pos logit contributions
 eleph+2.811
=\"+2.691
aba+2.591
oute+2.561
horn+2.559
Neg logit contributions
REDACTED-3.014
Austral-2.464
 Austral-2.456
alg-2.359
Rus-2.357
Token stop
Token ID2245
Feature activation+1.30e-01

Pos logit contributions
 eleph+0.233
=\"+0.221
oute+0.212
horn+0.211
aba+0.211
Neg logit contributions
REDACTED-0.280
 Austral-0.235
Austral-0.231
Rus-0.223
alg-0.222
Token at
Token ID379
Feature activation-4.29e-02

Pos logit contributions
 eleph+0.868
=\"+0.813
horn+0.783
oute+0.778
aba+0.777
Neg logit contributions
REDACTED-1.222
 Austral-1.050
Austral-1.029
alg-0.985
Rus-0.984
Token the
Token ID262
Feature activation+7.53e-03

Pos logit contributions
REDACTED+0.362
 Austral+0.303
Austral+0.297
Rus+0.284
alg+0.284
Neg logit contributions
 eleph-0.332
=\"-0.312
oute-0.302
aba-0.300
horn-0.300
Token University
Token ID2059
Feature activation+2.13e-01

Pos logit contributions
 eleph+0.053
=\"+0.050
oute+0.048
horn+0.048
aba+0.048
Neg logit contributions
REDACTED-0.068
 Austral-0.058
Austral-0.057
Rus-0.054
alg-0.054
Token of
Token ID286
Feature activation+3.15e-01

Pos logit contributions
 eleph+1.501
=\"+1.410
horn+1.361
aba+1.342
oute+1.332
Neg logit contributions
REDACTED-1.933
 Austral-1.634
Austral-1.622
Rus-1.545
alg-1.537
Token Iowa
Token ID9406
Feature activation+4.62e-01

Pos logit contributions
=\"+1.755
 eleph+1.748
horn+1.673
aba+1.653
oute+1.641
Neg logit contributions
REDACTED-3.234
Austral-2.729
 Austral-2.685
alg-2.641
SPA-2.598
Token State
Token ID1812
Feature activation+6.73e-02

Pos logit contributions
 eleph+2.476
=\"+2.418
horn+2.338
oute+2.318
aba+2.303
Neg logit contributions
REDACTED-3.427
 Austral-2.957
Austral-2.942
alg-2.780
SPA-2.740
Token has
Token ID468
Feature activation-3.29e-02

Pos logit contributions
 eleph+0.481
=\"+0.456
oute+0.436
horn+0.436
aba+0.435
Neg logit contributions
REDACTED-0.599
 Austral-0.511
Austral-0.500
alg-0.480
Rus-0.478
Token a
Token ID257
Feature activation-8.93e-02

Pos logit contributions
REDACTED+0.279
 Austral+0.236
Austral+0.230
Rus+0.221
alg+0.221
Neg logit contributions
 eleph-0.250
=\"-0.236
oute-0.228
horn-0.227
aba-0.226
Token beef
Token ID12023
Feature activation+2.36e-01

Pos logit contributions
REDACTED+0.796
 Austral+0.680
Austral+0.664
Rus+0.638
alg+0.637
Neg logit contributions
 eleph-0.643
=\"-0.601
oute-0.582
horn-0.579
aba-0.575
Token with
Token ID351
Feature activation+6.04e-02

Pos logit contributions
 eleph+1.505
=\"+1.406
horn+1.347
oute+1.345
aba+1.335
Neg logit contributions
REDACTED-2.259
 Austral-1.971
Austral-1.927
alg-1.854
Rus-1.851
Token Trump
Token ID1301
Feature activation+2.02e-01

Pos logit contributions
 eleph+0.439
=\"+0.416
oute+0.400
aba+0.398
horn+0.397
Neg logit contributions
REDACTED-0.530
 Austral-0.449
Austral-0.440
alg-0.423
Rus-0.423
Token it
Token ID340
Feature activation+3.38e-01

Pos logit contributions
 eleph+1.399
=\"+1.320
oute+1.264
aba+1.262
horn+1.261
Neg logit contributions
REDACTED-1.832
 Austral-1.583
Austral-1.540
alg-1.476
Rus-1.470
Token can
Token ID460
Feature activation+1.67e-02

Pos logit contributions
 eleph+2.259
=\"+2.164
horn+2.065
aba+2.054
oute+2.047
Neg logit contributions
REDACTED-3.107
 Austral-2.684
Austral-2.613
alg-2.498
Rus-2.491
Token�
Token ID447
Feature activation-4.12e-02

Pos logit contributions
 eleph+0.124
=\"+0.117
oute+0.113
horn+0.113
aba+0.113
Neg logit contributions
REDACTED-0.144
 Austral-0.123
Austral-0.121
Rus-0.115
alg-0.115
Token�
Token ID247
Feature activation+2.54e-01

Pos logit contributions
REDACTED+0.368
 Austral+0.315
Austral+0.307
alg+0.295
Rus+0.294
Neg logit contributions
 eleph-0.294
=\"-0.278
oute-0.267
horn-0.266
aba-0.265
Tokent
Token ID83
Feature activation+9.13e-02

Pos logit contributions
 eleph+1.356
=\"+1.313
oute+1.282
aba+1.263
horn+1.248
Neg logit contributions
REDACTED-2.665
 Austral-2.342
Austral-2.289
Rus-2.210
SPA-2.194
Token standing
Token ID5055
Feature activation-3.85e-03

Pos logit contributions
REDACTED+0.212
 Austral+0.178
Austral+0.175
alg+0.170
Rus+0.170
Neg logit contributions
 eleph-0.173
=\"-0.162
horn-0.156
oute-0.156
aba-0.155
Token in
Token ID287
Feature activation+1.15e-01

Pos logit contributions
REDACTED+0.036
 Austral+0.030
Austral+0.030
alg+0.029
Rus+0.029
Neg logit contributions
 eleph-0.027
=\"-0.025
horn-0.024
oute-0.024
aba-0.024
Token the
Token ID262
Feature activation+1.23e-01

Pos logit contributions
 eleph+0.818
=\"+0.759
aba+0.740
oute+0.738
horn+0.737
Neg logit contributions
REDACTED-1.037
 Austral-0.868
Austral-0.853
Rus-0.825
SPA-0.824
Token security
Token ID2324
Feature activation+7.56e-02

Pos logit contributions
 eleph+0.885
=\"+0.825
oute+0.805
horn+0.805
aba+0.803
Neg logit contributions
REDACTED-1.091
 Austral-0.913
Austral-0.901
Rus-0.869
SPA-0.864
Token line
Token ID1627
Feature activation+9.90e-02

Pos logit contributions
 eleph+0.531
=\"+0.495
oute+0.480
aba+0.479
horn+0.477
Neg logit contributions
REDACTED-0.689
 Austral-0.582
Austral-0.571
Rus-0.553
alg-0.551
Token:
Token ID25
Feature activation-6.82e-02

Pos logit contributions
 eleph+0.702
=\"+0.655
aba+0.634
horn+0.632
oute+0.631
Neg logit contributions
REDACTED-0.897
 Austral-0.757
Austral-0.745
alg-0.721
Rus-0.719
Token a
Token ID257
Feature activation+9.79e-03

Pos logit contributions
REDACTED+0.562
 Austral+0.466
Austral+0.457
alg+0.441
Rus+0.440
Neg logit contributions
 eleph-0.540
=\"-0.505
horn-0.488
aba-0.488
oute-0.487
Token tall
Token ID7331
Feature activation-1.46e-01

Pos logit contributions
 eleph+0.071
=\"+0.066
oute+0.065
horn+0.065
aba+0.064
Neg logit contributions
REDACTED-0.086
 Austral-0.073
Austral-0.072
Rus-0.069
alg-0.069
Token,
Token ID11
Feature activation-2.09e-01

Pos logit contributions
REDACTED+1.337
 Austral+1.146
Austral+1.115
Rus+1.076
alg+1.076
Neg logit contributions
 eleph-1.021
=\"-0.948
oute-0.914
horn-0.909
aba-0.907
Token pale
Token ID14005
Feature activation-1.06e-01

Pos logit contributions
REDACTED+1.817
 Austral+1.533
Austral+1.497
Rus+1.443
alg+1.440
Neg logit contributions
 eleph-1.562
=\"-1.455
oute-1.409
horn-1.406
aba-1.399
Token man
Token ID582
Feature activation-7.66e-02

Pos logit contributions
REDACTED+0.953
 Austral+0.811
Austral+0.792
Rus+0.763
alg+0.763
Neg logit contributions
 eleph-0.758
=\"-0.702
oute-0.679
horn-0.679
aba-0.676
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
REDACTED+0.239
 Austral+0.202
Austral+0.197
alg+0.189
Rus+0.189
Neg logit contributions
 eleph-0.217
=\"-0.204
oute-0.197
horn-0.197
aba-0.196
Token The
Token ID383
Feature activation-7.81e-02

Pos logit contributions
REDACTED+1.379
 Austral+1.138
Austral+1.101
alg+1.053
Rus+1.052
Neg logit contributions
 eleph-1.558
=\"-1.472
oute-1.428
horn-1.423
aba-1.421
Token adrenaline
Token ID37607
Feature activation-9.50e-02

Pos logit contributions
REDACTED+0.669
 Austral+0.562
Austral+0.552
Rus+0.528
alg+0.526
Neg logit contributions
 eleph-0.589
=\"-0.556
oute-0.537
horn-0.534
aba-0.531
Token was
Token ID373
Feature activation-2.12e-01

Pos logit contributions
REDACTED+0.868
 Austral+0.742
Austral+0.723
Rus+0.697
alg+0.696
Neg logit contributions
 eleph-0.667
=\"-0.621
oute-0.600
horn-0.596
aba-0.595
Token really
Token ID1107
Feature activation-3.08e-01

Pos logit contributions
REDACTED+1.876
 Austral+1.598
Austral+1.550
alg+1.497
Rus+1.495
Neg logit contributions
 eleph-1.553
=\"-1.445
oute-1.395
horn-1.387
aba-1.384
Token pumping
Token ID26916
Feature activation-7.67e-02

Pos logit contributions
REDACTED+2.773
 Austral+2.378
Austral+2.305
alg+2.228
Rus+2.221
Neg logit contributions
 eleph-2.204
=\"-2.041
oute-1.971
horn-1.957
aba-1.948
Token at
Token ID379
Feature activation-4.64e-02

Pos logit contributions
REDACTED+0.677
 Austral+0.575
Austral+0.561
alg+0.539
Rus+0.539
Neg logit contributions
 eleph-0.561
=\"-0.525
oute-0.508
horn-0.505
aba-0.505
Token this
Token ID428
Feature activation-4.89e-02

Pos logit contributions
REDACTED+0.416
 Austral+0.352
Austral+0.346
Rus+0.332
alg+0.328
Neg logit contributions
 eleph-0.333
=\"-0.315
oute-0.304
aba-0.300
horn-0.297
Token moment
Token ID2589
Feature activation-1.74e-01

Pos logit contributions
REDACTED+0.452
 Austral+0.387
Austral+0.381
Rus+0.368
alg+0.365
Neg logit contributions
 eleph-0.332
=\"-0.311
horn-0.300
oute-0.300
aba-0.297
Token,
Token ID11
Feature activation-7.39e-02

Pos logit contributions
REDACTED+1.496
 Austral+1.268
Austral+1.232
Rus+1.186
alg+1.185
Neg logit contributions
 eleph-1.309
=\"-1.222
oute-1.181
horn-1.176
aba-1.173
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
REDACTED+0.685
 Austral+0.586
Austral+0.573
Rus+0.553
alg+0.551
Neg logit contributions
 eleph-0.507
=\"-0.472
oute-0.456
aba-0.453
horn-0.453
Tokenolla
Token ID33011
Feature activation-1.16e-01

Pos logit contributions
REDACTED+1.249
 Austral+0.982
Austral+0.944
alg+0.896
Rus+0.889
Neg logit contributions
 eleph-2.045
=\"-1.910
oute-1.859
horn-1.856
aba-1.848
Token has
Token ID468
Feature activation-1.89e-01

Pos logit contributions
REDACTED+1.059
 Austral+0.904
Austral+0.881
Rus+0.851
alg+0.851
Neg logit contributions
 eleph-0.811
=\"-0.756
oute-0.728
horn-0.725
aba-0.723
Token connected
Token ID5884
Feature activation-8.82e-02

Pos logit contributions
REDACTED+1.659
 Austral+1.408
Austral+1.370
Rus+1.320
alg+1.319
Neg logit contributions
 eleph-1.398
=\"-1.307
oute-1.260
horn-1.255
aba-1.253
Token development
Token ID2478
Feature activation-2.30e-01

Pos logit contributions
REDACTED+0.765
 Austral+0.644
Austral+0.628
Rus+0.606
alg+0.605
Neg logit contributions
 eleph-0.660
=\"-0.618
oute-0.598
aba-0.594
horn-0.594
Token of
Token ID286
Feature activation-1.46e-01

Pos logit contributions
REDACTED+2.017
 Austral+1.718
Austral+1.666
alg+1.608
Rus+1.606
Neg logit contributions
 eleph-1.696
=\"-1.576
oute-1.524
horn-1.513
aba-1.511
Token new
Token ID649
Feature activation-2.14e-01

Pos logit contributions
REDACTED+1.206
 Austral+1.006
Austral+0.979
Rus+0.944
alg+0.937
Neg logit contributions
 eleph-1.158
=\"-1.086
oute-1.053
horn-1.047
aba-1.047
Token products
Token ID3186
Feature activation-1.34e-02

Pos logit contributions
REDACTED+1.767
 Austral+1.488
Austral+1.441
alg+1.385
Rus+1.384
Neg logit contributions
 eleph-1.695
=\"-1.586
oute-1.536
horn-1.529
aba-1.526
Token to
Token ID284
Feature activation-9.47e-02

Pos logit contributions
REDACTED+0.125
 Austral+0.107
Austral+0.104
alg+0.101
Rus+0.101
Neg logit contributions
 eleph-0.091
=\"-0.085
oute-0.081
horn-0.081
aba-0.081
Token their
Token ID511
Feature activation-1.54e-01

Pos logit contributions
REDACTED+0.795
 Austral+0.658
Austral+0.643
Rus+0.622
alg+0.617
Neg logit contributions
 eleph-0.736
=\"-0.696
oute-0.671
aba-0.669
horn-0.667
Token strategy
Token ID4811
Feature activation-2.41e-01

Pos logit contributions
REDACTED+1.374
 Austral+1.164
Austral+1.134
Rus+1.096
alg+1.094
Neg logit contributions
 eleph-1.115
=\"-1.039
oute-1.004
horn-0.998
aba-0.998
Token since
Token ID1201
Feature activation+2.45e-02

Pos logit contributions
REDACTED+2.128
 Austral+1.811
Austral+1.760
Rus+1.696
alg+1.695
Neg logit contributions
 eleph-1.772
=\"-1.652
oute-1.595
horn-1.588
aba-1.584
Token as
Token ID355
Feature activation+2.72e-01

Pos logit contributions
 eleph+0.408
=\"+0.382
oute+0.368
aba+0.367
horn+0.366
Neg logit contributions
REDACTED-0.526
 Austral-0.444
Austral-0.439
Rus-0.421
alg-0.419
Token one
Token ID530
Feature activation+2.40e-02

Pos logit contributions
 eleph+2.040
=\"+1.924
oute+1.856
aba+1.851
horn+1.824
Neg logit contributions
REDACTED-2.328
 Austral-1.935
Austral-1.926
Rus-1.854
alg-1.822
Token cohesive
Token ID42403
Feature activation+2.04e-01

Pos logit contributions
 eleph+0.175
=\"+0.163
oute+0.157
horn+0.157
aba+0.155
Neg logit contributions
REDACTED-0.214
 Austral-0.181
Austral-0.178
Rus-0.171
alg-0.171
Token public
Token ID1171
Feature activation-6.92e-02

Pos logit contributions
 eleph+1.351
=\"+1.263
oute+1.217
horn+1.213
aba+1.210
Neg logit contributions
REDACTED-1.915
 Austral-1.623
Austral-1.611
Rus-1.550
alg-1.546
Token arena
Token ID13478
Feature activation-3.81e-01

Pos logit contributions
REDACTED+0.613
 Austral+0.518
Austral+0.506
Rus+0.485
alg+0.485
Neg logit contributions
 eleph-0.507
=\"-0.473
oute-0.458
aba-0.456
horn-0.455
Token,
Token ID11
Feature activation-3.73e-02

Pos logit contributions
REDACTED+3.390
 Austral+2.920
Austral+2.795
alg+2.719
Rus+2.716
Neg logit contributions
 eleph-2.746
=\"-2.523
oute-2.437
horn-2.418
aba-2.413
Token the
Token ID262
Feature activation-1.81e-01

Pos logit contributions
REDACTED+0.313
 Austral+0.262
Austral+0.257
Rus+0.245
alg+0.245
Neg logit contributions
 eleph-0.289
=\"-0.271
oute-0.263
horn-0.262
aba-0.262
Token intersection
Token ID16246
Feature activation-4.36e-02

Pos logit contributions
REDACTED+1.565
 Austral+1.320
Austral+1.291
Rus+1.238
alg+1.235
Neg logit contributions
 eleph-1.365
=\"-1.277
oute-1.237
horn-1.232
aba-1.229
Token of
Token ID286
Feature activation+6.32e-02

Pos logit contributions
REDACTED+0.390
 Austral+0.331
Austral+0.325
alg+0.312
Rus+0.312
Neg logit contributions
 eleph-0.313
=\"-0.292
oute-0.283
horn-0.281
aba-0.280
Token Y
Token ID575
Feature activation+9.19e-02

Pos logit contributions
 eleph+0.491
=\"+0.465
oute+0.452
aba+0.448
horn+0.448
Neg logit contributions
REDACTED-0.525
 Austral-0.433
Austral-0.431
Rus-0.409
alg-0.408
Tokenonge
Token ID14220
Feature activation-1.01e-01

Pos logit contributions
 eleph+0.527
=\"+0.488
oute+0.482
horn+0.481
aba+0.478
Neg logit contributions
REDACTED-0.947
 Austral-0.827
Austral-0.808
Rus-0.772
SPA-0.770
Token10
Token ID940
Feature activation+1.23e-01

Pos logit contributions
REDACTED+0.273
 Austral+0.228
Austral+0.217
alg+0.209
Rus+0.208
Neg logit contributions
 eleph-0.308
=\"-0.299
horn-0.290
aba-0.289
oute-0.289
Token)
Token ID8
Feature activation+9.23e-03

Pos logit contributions
 eleph+0.638
=\"+0.613
horn+0.580
aba+0.578
oute+0.574
Neg logit contributions
REDACTED-1.312
 Austral-1.151
Austral-1.124
alg-1.096
Rus-1.092
Token (
Token ID357
Feature activation-1.08e-01

Pos logit contributions
 eleph+0.064
=\"+0.060
horn+0.058
oute+0.058
aba+0.057
Neg logit contributions
REDACTED-0.085
 Austral-0.072
Austral-0.070
alg-0.068
Rus-0.068
Token2
Token ID17
Feature activation-5.26e-02

Pos logit contributions
REDACTED+0.885
 Austral+0.749
Austral+0.721
alg+0.694
Rus+0.692
Neg logit contributions
 eleph-0.839
=\"-0.801
horn-0.775
oute-0.774
aba-0.773
Token)
Token ID8
Feature activation-1.91e-01

Pos logit contributions
REDACTED+0.547
 Austral+0.478
Austral+0.467
Rus+0.453
alg+0.453
Neg logit contributions
 eleph-0.308
=\"-0.276
oute-0.265
horn-0.262
aba-0.262
Token
Token ID198
Feature activation+6.79e-02

Pos logit contributions
REDACTED+1.466
 Austral+1.209
Austral+1.172
Rus+1.121
alg+1.121
Neg logit contributions
 eleph-1.620
=\"-1.522
oute-1.478
horn-1.473
aba-1.472
Token
Token ID198
Feature activation-2.73e-01

Pos logit contributions
=\"+0.536
 eleph+0.524
horn+0.518
aba+0.515
oute+0.511
Neg logit contributions
REDACTED-0.527
 Austral-0.439
Austral-0.423
alg-0.408
Rus-0.406
Token23
Token ID1954
Feature activation+2.25e-02

Pos logit contributions
REDACTED+2.270
 Austral+1.899
Austral+1.859
Rus+1.786
alg+1.772
Neg logit contributions
 eleph-2.161
=\"-1.994
oute-1.937
horn-1.922
aba-1.915
Token.
Token ID13
Feature activation-4.82e-02

Pos logit contributions
 eleph+0.169
=\"+0.164
horn+0.159
aba+0.158
oute+0.158
Neg logit contributions
REDACTED-0.189
 Austral-0.159
Austral-0.154
alg-0.149
Rus-0.148
Token Peter
Token ID5613
Feature activation+1.54e-01

Pos logit contributions
REDACTED+0.388
 Austral+0.321
Austral+0.313
alg+0.301
SPA+0.299
Neg logit contributions
 eleph-0.384
=\"-0.366
horn-0.356
aba-0.355
oute-0.355
Token M
Token ID337
Feature activation+4.80e-01

Pos logit contributions
 eleph+1.202
=\"+1.163
horn+1.152
aba+1.141
oute+1.140
Neg logit contributions
REDACTED-1.251
 Austral-1.017
Austral-1.002
alg-0.965
SPA-0.955
Token full
Token ID1336
Feature activation-8.62e-01

Pos logit contributions
REDACTED+3.549
 Austral+2.961
Austral+2.813
alg+2.690
Rus+2.646
Neg logit contributions
 eleph-4.239
=\"-4.002
aba-3.869
oute-3.858
horn-3.845
Token v
Token ID410
Feature activation-7.04e-01

Pos logit contributions
REDACTED+4.536
 Austral+3.597
Austral+3.228
Rus+3.142
alg+3.046
Neg logit contributions
 eleph-9.306
=\"-8.494
oute-8.409
aba-8.408
horn-8.377
Token )
Token ID1267
Feature activation-9.76e-01

Pos logit contributions
REDACTED+4.871
 Austral+4.011
Austral+3.778
Rus+3.690
alg+3.662
Neg logit contributions
 eleph-6.615
=\"-5.962
oute-5.794
horn-5.693
aba-5.675
Token {
Token ID1391
Feature activation-6.73e-01

Pos logit contributions
REDACTED+7.057
 Austral+5.953
Austral+5.535
Rus+5.518
alg+5.289
Neg logit contributions
 eleph-9.100
oute-7.519
=\"-7.492
aba-7.370
horn-7.284
Token //
Token ID3373
Feature activation-2.26e-01

Pos logit contributions
REDACTED+4.629
 Austral+3.771
Austral+3.565
alg+3.463
Rus+3.443
Neg logit contributions
 eleph-6.082
=\"-5.780
horn-5.585
oute-5.573
aba-5.525
Token Time
Token ID3862
Feature activation-6.22e-01

Pos logit contributions
REDACTED+1.888
 Austral+1.590
Austral+1.544
Rus+1.484
alg+1.482
Neg logit contributions
 eleph-1.761
=\"-1.648
oute-1.596
horn-1.589
aba-1.586
Token component
Token ID7515
Feature activation-5.12e-01

Pos logit contributions
REDACTED+4.881
 Austral+4.126
Austral+3.919
alg+3.810
Rus+3.766
Neg logit contributions
 eleph-5.217
=\"-4.735
oute-4.613
aba-4.599
horn-4.571
Token const
Token ID1500
Feature activation-2.77e-02

Pos logit contributions
REDACTED+2.704
 Austral+2.074
Austral+1.890
alg+1.797
Rus+1.780
Neg logit contributions
 eleph-5.598
=\"-5.212
horn-5.127
aba-5.105
oute-5.093
Token float
Token ID12178
Feature activation-8.85e-02

Pos logit contributions
REDACTED+0.227
 Austral+0.190
Austral+0.186
Rus+0.178
alg+0.177
Neg logit contributions
 eleph-0.218
=\"-0.208
oute-0.200
aba-0.199
horn-0.199
Token TA
Token ID21664
Feature activation+3.61e-01

Pos logit contributions
REDACTED+0.881
 Austral+0.765
Austral+0.744
Rus+0.721
alg+0.721
Neg logit contributions
 eleph-0.550
=\"-0.502
oute-0.482
horn-0.479
aba-0.477
TokenU
Token ID52
Feature activation-5.77e-01

Pos logit contributions
=\"+1.602
aba+1.471
horn+1.468
 eleph+1.460
oute+1.454
Neg logit contributions
REDACTED-4.025
 Austral-3.605
Austral-3.507
alg-3.444
Rus-3.395
Token much
Token ID881
Feature activation-1.01e-01

Pos logit contributions
 eleph+0.228
=\"+0.214
aba+0.209
horn+0.209
oute+0.207
Neg logit contributions
REDACTED-0.246
 Austral-0.206
Austral-0.200
Rus-0.193
alg-0.192
Token hurry
Token ID23290
Feature activation-2.26e-01

Pos logit contributions
REDACTED+0.736
 Austral+0.600
Austral+0.580
Rus+0.555
alg+0.552
Neg logit contributions
 eleph-0.901
=\"-0.851
horn-0.827
oute-0.827
aba-0.826
Token."
Token ID526
Feature activation-3.91e-01

Pos logit contributions
REDACTED+1.954
 Austral+1.659
Austral+1.608
Rus+1.550
alg+1.549
Neg logit contributions
 eleph-1.695
=\"-1.580
oute-1.528
horn-1.519
aba-1.516
Token<|endoftext|>
Token ID50256
Feature activation+9.37e-02

Pos logit contributions
REDACTED+3.109
 Austral+2.598
Austral+2.514
Rus+2.416
alg+2.408
Neg logit contributions
 eleph-3.209
=\"-3.001
oute-2.914
horn-2.896
aba-2.891
TokenBehind
Token ID34163
Feature activation-2.72e-01

Pos logit contributions
=\"+0.687
 eleph+0.685
horn+0.667
aba+0.662
oute+0.662
Neg logit contributions
REDACTED-0.772
 Austral-0.664
Austral-0.621
alg-0.608
Rus-0.598
Token every
Token ID790
Feature activation-6.53e-01

Pos logit contributions
REDACTED+2.231
 Austral+1.873
Austral+1.816
Rus+1.744
alg+1.743
Neg logit contributions
 eleph-2.162
=\"-2.025
oute-1.963
horn-1.954
aba-1.950
Token smart
Token ID4451
Feature activation-5.85e-02

Pos logit contributions
REDACTED+5.627
 Austral+4.895
Austral+4.687
alg+4.526
Rus+4.489
Neg logit contributions
 eleph-4.792
=\"-4.392
oute-4.240
aba-4.226
horn-4.159
Token TV
Token ID3195
Feature activation-2.60e-01

Pos logit contributions
REDACTED+0.523
 Austral+0.446
Austral+0.434
alg+0.418
Rus+0.418
Neg logit contributions
 eleph-0.422
=\"-0.393
oute-0.379
horn-0.378
aba-0.377
Token show
Token ID905
Feature activation-3.50e-01

Pos logit contributions
REDACTED+2.360
 Austral+2.038
Austral+1.976
Rus+1.908
alg+1.897
Neg logit contributions
 eleph-1.834
=\"-1.679
oute-1.616
aba-1.607
horn-1.602
Token,
Token ID11
Feature activation-4.84e-01

Pos logit contributions
REDACTED+3.202
 Austral+2.754
Austral+2.679
Rus+2.581
alg+2.578
Neg logit contributions
 eleph-2.439
=\"-2.252
oute-2.173
aba-2.152
horn-2.140
Token there
Token ID612
Feature activation-4.07e-01

Pos logit contributions
REDACTED+3.926
 Austral+3.337
Austral+3.206
alg+3.073
Rus+3.072
Neg logit contributions
 eleph-3.853
=\"-3.597
oute-3.489
aba-3.452
horn-3.448
Token Cut
Token ID9712
Feature activation-9.23e-02

Pos logit contributions
REDACTED+0.697
 Austral+0.582
Austral+0.566
alg+0.545
Rus+0.542
Neg logit contributions
 eleph-0.680
=\"-0.646
oute-0.627
horn-0.625
aba-0.624
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
REDACTED+0.781
 Austral+0.658
Austral+0.643
Rus+0.620
alg+0.615
Neg logit contributions
 eleph-0.707
=\"-0.662
oute-0.647
horn-0.641
aba-0.640
Token carrots
Token ID34397
Feature activation-1.06e-01

Pos logit contributions
REDACTED+1.966
 Austral+1.671
Austral+1.624
Rus+1.564
alg+1.564
Neg logit contributions
 eleph-1.659
=\"-1.546
oute-1.494
horn-1.487
aba-1.484
Token into
Token ID656
Feature activation-1.13e-01

Pos logit contributions
REDACTED+0.975
 Austral+0.832
Austral+0.816
alg+0.787
Rus+0.786
Neg logit contributions
 eleph-0.727
=\"-0.679
oute-0.658
horn-0.653
aba-0.651
Token thin
Token ID7888
Feature activation-3.91e-01

Pos logit contributions
REDACTED+0.970
 Austral+0.818
Austral+0.798
Rus+0.770
alg+0.767
Neg logit contributions
 eleph-0.850
=\"-0.797
oute-0.770
horn-0.766
aba-0.765
Token slices
Token ID24314
Feature activation-5.94e-01

Pos logit contributions
REDACTED+3.561
 Austral+3.090
Austral+2.969
alg+2.890
Rus+2.854
Neg logit contributions
 eleph-2.750
=\"-2.523
oute-2.424
horn-2.420
aba-2.396
Token and
Token ID290
Feature activation-1.78e-01

Pos logit contributions
REDACTED+4.962
 Austral+4.247
Austral+4.037
alg+3.911
Rus+3.886
Neg logit contributions
 eleph-4.556
=\"-4.176
oute-4.007
horn-4.002
aba-3.990
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
REDACTED+1.634
 Austral+1.400
Austral+1.363
alg+1.315
Rus+1.314
Neg logit contributions
 eleph-1.239
=\"-1.153
oute-1.113
horn-1.106
aba-1.104
Token broccoli
Token ID44653
Feature activation-2.07e-01

Pos logit contributions
REDACTED+1.592
 Austral+1.344
Austral+1.307
Rus+1.257
alg+1.256
Neg logit contributions
 eleph-1.437
=\"-1.344
oute-1.303
horn-1.297
aba-1.294
Token into
Token ID656
Feature activation-1.11e-01

Pos logit contributions
REDACTED+1.896
 Austral+1.621
Austral+1.581
Rus+1.527
alg+1.524
Neg logit contributions
 eleph-1.449
=\"-1.349
oute-1.306
horn-1.297
aba-1.293
Token small
Token ID1402
Feature activation-3.57e-01

Pos logit contributions
REDACTED+0.964
 Austral+0.813
Austral+0.795
Rus+0.768
alg+0.763
Neg logit contributions
 eleph-0.828
=\"-0.778
oute-0.752
horn-0.750
aba-0.747
Token_
Token ID62
Feature activation-3.25e-01

Pos logit contributions
REDACTED+0.714
 Austral+0.601
Austral+0.581
Rus+0.556
alg+0.555
Neg logit contributions
 eleph-0.694
=\"-0.653
horn-0.633
oute-0.631
aba-0.628
Tokenm
Token ID76
Feature activation-3.31e-02

Pos logit contributions
REDACTED+2.775
 Austral+2.333
Austral+2.279
alg+2.206
Rus+2.193
Neg logit contributions
 eleph-2.486
=\"-2.300
oute-2.232
aba-2.215
horn-2.201
Tokenoves
Token ID5241
Feature activation-4.07e-01

Pos logit contributions
REDACTED+0.295
 Austral+0.251
Austral+0.244
Rus+0.235
alg+0.233
Neg logit contributions
 eleph-0.238
=\"-0.224
oute-0.218
horn-0.216
aba-0.215
Token :
Token ID1058
Feature activation-3.10e-01

Pos logit contributions
REDACTED+2.672
 Austral+2.132
Austral+2.047
alg+1.955
Rus+1.948
Neg logit contributions
 eleph-3.892
=\"-3.678
oute-3.587
horn-3.567
aba-3.565
Token u
Token ID334
Feature activation-1.63e-01

Pos logit contributions
REDACTED+2.504
 Austral+2.105
Austral+2.028
alg+1.956
Rus+1.951
Neg logit contributions
 eleph-2.509
=\"-2.341
oute-2.272
horn-2.262
aba-2.257
Token16
Token ID1433
Feature activation-6.44e-01

Pos logit contributions
REDACTED+1.331
 Austral+1.114
Austral+1.079
alg+1.034
Rus+1.033
Neg logit contributions
 eleph-1.307
=\"-1.233
oute-1.196
horn-1.189
aba-1.189
Token,
Token ID837
Feature activation-3.97e-01

Pos logit contributions
REDACTED+4.174
 Austral+3.314
Austral+3.136
alg+3.076
Rus+3.058
Neg logit contributions
 eleph-6.157
=\"-5.800
oute-5.645
horn-5.640
aba-5.572
Token //
Token ID3373
Feature activation+8.43e-02

Pos logit contributions
REDACTED+2.946
 Austral+2.429
Austral+2.333
alg+2.245
Rus+2.236
Neg logit contributions
 eleph-3.468
=\"-3.258
oute-3.168
horn-3.156
aba-3.148
Token How
Token ID1374
Feature activation+2.67e-01

Pos logit contributions
 eleph+0.679
=\"+0.649
oute+0.630
horn+0.627
aba+0.623
Neg logit contributions
REDACTED-0.673
 Austral-0.556
Austral-0.548
Rus-0.519
SPA-0.515
Token many
Token ID867
Feature activation+7.25e-01

Pos logit contributions
 eleph+1.946
=\"+1.872
oute+1.802
horn+1.790
aba+1.782
Neg logit contributions
REDACTED-2.328
 Austral-1.935
Austral-1.917
Rus-1.853
alg-1.815
Token moves
Token ID6100
Feature activation+1.68e-01

Pos logit contributions
 eleph+4.632
=\"+4.599
horn+4.297
oute+4.278
aba+4.197
Neg logit contributions
REDACTED-6.557
Austral-5.495
 Austral-5.453
Rus-5.177
SPA-5.167
Token and
Token ID290
Feature activation-2.04e-01

Pos logit contributions
REDACTED+0.109
 Austral+0.093
Austral+0.091
Rus+0.088
alg+0.088
Neg logit contributions
 eleph-0.077
=\"-0.072
oute-0.069
horn-0.069
aba-0.068
Token sol
Token ID1540
Feature activation-1.87e-01

Pos logit contributions
REDACTED+1.553
 Austral+1.284
Austral+1.242
Rus+1.188
alg+1.186
Neg logit contributions
 eleph-1.734
=\"-1.639
oute-1.590
horn-1.585
aba-1.581
Tokenes
Token ID274
Feature activation+2.44e-01

Pos logit contributions
REDACTED+1.876
 Austral+1.631
Austral+1.594
alg+1.548
Rus+1.542
Neg logit contributions
 eleph-1.180
=\"-1.048
oute-1.006
aba-0.996
horn-0.995
Token")
Token ID4943
Feature activation+8.79e-02

Pos logit contributions
=\"+1.191
 eleph+1.172
aba+1.126
horn+1.124
oute+1.101
Neg logit contributions
REDACTED-2.633
 Austral-2.299
Austral-2.260
Rus-2.186
alg-2.186
Token is
Token ID318
Feature activation+5.27e-02

Pos logit contributions
 eleph+0.533
=\"+0.507
horn+0.484
oute+0.482
aba+0.482
Neg logit contributions
REDACTED-0.869
 Austral-0.752
Austral-0.735
Rus-0.713
alg-0.711
Token an
Token ID281
Feature activation-8.43e-01

Pos logit contributions
 eleph+0.409
=\"+0.387
oute+0.374
horn+0.373
aba+0.373
Neg logit contributions
REDACTED-0.438
 Austral-0.367
Austral-0.357
Rus-0.344
alg-0.343
Token autos
Token ID44619
Feature activation-8.60e-01

Pos logit contributions
REDACTED+3.046
 Austral+2.119
Austral+1.796
alg+1.732
Rus+1.544
Neg logit contributions
 eleph-10.645
=\"-9.887
aba-9.664
oute-9.652
horn-9.646
Tokenomal
Token ID18048
Feature activation-1.42e-01

Pos logit contributions
REDACTED+4.656
 Austral+3.620
Austral+3.498
alg+3.452
SPA+3.141
Neg logit contributions
 eleph-9.814
=\"-8.331
horn-8.216
oute-8.136
aba-8.062
Token dominant
Token ID11410
Feature activation-3.30e-01

Pos logit contributions
REDACTED+1.240
 Austral+1.057
Austral+1.023
Rus+0.984
alg+0.983
Neg logit contributions
 eleph-1.068
=\"-0.986
oute-0.954
aba-0.950
horn-0.949
Token ker
Token ID41927
Feature activation-2.52e-01

Pos logit contributions
REDACTED+3.276
 Austral+2.850
Austral+2.761
alg+2.694
Rus+2.670
Neg logit contributions
 eleph-2.087
=\"-1.882
oute-1.809
aba-1.783
horn-1.782
Tokenat
Token ID265
Feature activation-1.32e-01

Pos logit contributions
REDACTED+2.235
 Austral+1.905
Austral+1.865
alg+1.803
Rus+1.793
Neg logit contributions
 eleph-1.928
=\"-1.687
oute-1.636
horn-1.609
aba-1.602