TOP ACTIVATIONS
MAX = 2.501

<|endoftext|> WASHINGTON A Cy gn us cargo mission to the
Southern Poverty Law Center , ass ailed the move :
rer , leader of the Santiago de Cuba - based Patri
semester ), Beta The ta Pi ($ 400 per semester ),
to the Governor General , ass uring her that the coalition
s largest party abandoned President Dil ma Rousse ff
picked up by ABC affiliate KN X N .
m sorry , sir , but you
worsened of late , with Madrid being accused regularly of politic
B Jade . Ven ue : Poly techn ic
As riel 's d æ mon is a snow le opard
The government faces aboriginal opposition to Kinder Morgan
voters , than the bully pul pit . PK claims that
iber g said . # Bull s Cody Wester lund
fair play . Ven ky s plans
likely voters , according to Real Clear Politics , said a
Mac , Android , Raspberry Pi , Nvidia Shield , Chrom
are this week meeting in Beijing for a high - level
Nat Ames : No , sir . In fact , nobody
back track a bit . Prior to voting to repeal net

MIN ACTIVATIONS
MIN = -2.408

a " Man hun ting Timeline " in 2010 by US
charges as a function of prosecut orial discretion should not be
mism atched heroes , Jack Aub rey , a British naval
Doctor ), around the fifth episode in , when we were
18 . 7 10 United Arab Emirates 19 Having
the ' 2 , 315 polling stations all over the region
oppression , humanity was at last free from the cruel tyranny
my New York adventures on Snapchat today ( I
a Jewish majority city ; thus , Israel s
the April 1986 launch vehicle accident , Per kin - El
up . And at the last minute he called and said
for the lower house ; thus only 40 of the 76
seen the trailer for David F inc her s
the main story Thus Massachusetts is one of the
: 13 PH OTOS Donald Trump 's potential running
Best Buy Is Microsoft a cooler company than it
meaning he can lock the Quin jet or Kr ang down
Israel isation of Arab land serves the overall Zionist
, it moves a plastic tab away from the shif ter
[ Play boy CEO Scott F landers ] said , "

INTERVAL 1.274 - 2.501
CONTAINS 0.410%

David Wise ( SN ES , 1995 )
$ 8 million due to Carlos Quentin . But fact oring
and will likely face much resistance from tobacco companies , which
on his second day in Tokyo . Desc ribing
this season , including last week when Cutler scored 22 .

INTERVAL 0.047 - 1.274
CONTAINS 45.113%

full - length statue of a poet in Toronto , the
has got to music . We 're getting a cross -
The better known is the main Brazilian branch of the As
men s Et iquette , 18 77 .
or , alone ly young man named Norman Bates ( Anthony

INTERVAL -1.180 - 0.047
CONTAINS 53.863%

Pal let Ads - He ath Led ger Dead Naked -
and career would change forever . He took eight months off
Should Dev ial et s Silver Phant oms
Rhodes must stay focused on deep throws and fades .
ill essen , who started very high up the pitch .

INTERVAL -2.408 - -1.180
CONTAINS 0.613%

anything that questions the new State religion of supposedly catastrophic man
Inc ., 213 50 42 Ontario Inc . and 213 50
s no limit . When there s people motivated
5 . 25 million ( 21 st among guards )
make the math simpler , let 's say it 's a
Token<|endoftext|>
Token ID50256
Feature activation+1.41e+00

Pos logit contributions
yip+0.103
ific+0.097
ibaba+0.097
wrapper+0.095
iang+0.084
Neg logit contributions
 ILCS-0.061
ARB-0.058
RAG-0.055
ESH-0.055
ONS-0.053
TokenWASHINGTON
Token ID21793
Feature activation-2.05e-01

Pos logit contributions
yip+4.824
ific+4.490
ibaba+4.458
wrapper+4.438
iang+3.833
Neg logit contributions
 ILCS-3.578
ARB-3.365
RAG-3.225
 Powers-3.201
ESH-3.199
Token
Token ID851
Feature activation-4.45e-01

Pos logit contributions
 ILCS+0.511
ARB+0.493
RAG+0.470
ESH+0.466
ONS+0.454
Neg logit contributions
yip-0.729
ibaba-0.676
ific-0.670
wrapper-0.660
ignt-0.575
Token A
Token ID317
Feature activation+4.93e-02

Pos logit contributions
 ILCS+1.050
ARB+1.008
RAG+0.965
ESH+0.959
 Powers+0.950
Neg logit contributions
yip-1.662
ibaba-1.527
ific-1.506
wrapper-1.492
ciplinary-1.319
Token Cy
Token ID5934
Feature activation-2.66e-01

Pos logit contributions
yip+0.176
ibaba+0.162
ific+0.161
wrapper+0.158
ignt+0.139
Neg logit contributions
 ILCS-0.122
ARB-0.118
RAG-0.114
ESH-0.113
 Powers-0.110
Tokengn
Token ID4593
Feature activation+2.50e+00

Pos logit contributions
 ILCS+0.756
ARB+0.730
RAG+0.699
 Powers+0.688
ESH+0.688
Neg logit contributions
yip-0.852
ibaba-0.779
ific-0.778
wrapper-0.757
ignt-0.645
Tokenus
Token ID385
Feature activation+1.10e+00

Pos logit contributions
yip+6.436
ific+6.129
ibaba+5.846
wrapper+5.745
uador+4.921
Neg logit contributions
 ILCS-8.122
ARB-7.896
RAG-7.738
Wikipedia-7.419
ESH-7.413
Token cargo
Token ID15892
Feature activation-2.72e-01

Pos logit contributions
yip+3.803
ific+3.549
ibaba+3.521
wrapper+3.492
iang+3.022
Neg logit contributions
 ILCS-2.809
ARB-2.685
RAG-2.565
ESH-2.543
Wikipedia-2.466
Token mission
Token ID4365
Feature activation+3.98e-01

Pos logit contributions
 ILCS+0.702
ARB+0.685
RAG+0.660
ESH+0.652
 Powers+0.636
Neg logit contributions
yip-0.942
ibaba-0.860
ific-0.847
wrapper-0.834
ignt-0.730
Token to
Token ID284
Feature activation+1.04e-01

Pos logit contributions
yip+1.402
ibaba+1.298
ific+1.292
wrapper+1.271
ignt+1.106
Neg logit contributions
 ILCS-1.002
ARB-0.968
RAG-0.925
ESH-0.917
ONS-0.891
Token the
Token ID262
Feature activation-5.50e-02

Pos logit contributions
yip+0.354
ific+0.323
ibaba+0.321
wrapper+0.318
ignt+0.273
Neg logit contributions
 ILCS-0.273
ARB-0.267
RAG-0.258
ESH-0.254
 Powers-0.250
Token Southern
Token ID8050
Feature activation+1.64e-01

Pos logit contributions
yip+2.125
ibaba+1.978
ific+1.969
wrapper+1.941
ignt+1.701
Neg logit contributions
 ILCS-1.299
ARB-1.250
RAG-1.188
ESH-1.177
 Powers-1.143
Token Poverty
Token ID30627
Feature activation-1.91e-01

Pos logit contributions
yip+0.517
ibaba+0.474
ific+0.473
wrapper+0.464
ignt+0.395
Neg logit contributions
 ILCS-0.473
ARB-0.458
RAG-0.441
ESH-0.437
ONS-0.427
Token Law
Token ID3854
Feature activation-2.34e-01

Pos logit contributions
 ILCS+0.369
ARB+0.351
RAG+0.326
ESH+0.323
 Powers+0.319
Neg logit contributions
yip-0.787
ibaba-0.745
ific-0.731
wrapper-0.719
ignt-0.646
Token Center
Token ID3337
Feature activation-4.92e-01

Pos logit contributions
 ILCS+0.494
ARB+0.475
RAG+0.450
ESH+0.443
ONS+0.431
Neg logit contributions
yip-0.929
ibaba-0.855
ific-0.847
wrapper-0.842
ignt-0.749
Token,
Token ID11
Feature activation+3.41e-01

Pos logit contributions
 ILCS+0.985
ARB+0.951
RAG+0.899
ESH+0.887
 Powers+0.859
Neg logit contributions
yip-1.974
ibaba-1.843
ific-1.830
wrapper-1.812
ignt-1.602
Token ass
Token ID840
Feature activation+2.38e+00

Pos logit contributions
yip+1.186
ibaba+1.096
ific+1.091
wrapper+1.073
ignt+0.931
Neg logit contributions
 ILCS-0.871
ARB-0.843
RAG-0.806
ESH-0.798
 Powers-0.778
Tokenailed
Token ID6255
Feature activation+8.97e-01

Pos logit contributions
yip+7.026
ific+6.570
ibaba+6.425
wrapper+6.195
ignt+5.446
Neg logit contributions
 ILCS-7.070
ARB-6.648
ESH-6.379
Leary-6.379
RAG-6.320
Token the
Token ID262
Feature activation+4.29e-01

Pos logit contributions
yip+3.046
ific+2.848
ibaba+2.830
wrapper+2.776
ignt+2.424
Neg logit contributions
 ILCS-2.313
ARB-2.211
RAG-2.135
ESH-2.086
ONS-2.057
Token move
Token ID1445
Feature activation+2.48e-01

Pos logit contributions
yip+1.459
ibaba+1.345
ific+1.339
wrapper+1.318
ignt+1.138
Neg logit contributions
 ILCS-1.132
ARB-1.093
RAG-1.045
ESH-1.038
ONS-1.011
Token:
Token ID25
Feature activation-3.59e-01

Pos logit contributions
yip+0.836
ibaba+0.770
ific+0.764
wrapper+0.754
ignt+0.649
Neg logit contributions
 ILCS-0.662
ARB-0.640
RAG-0.614
ESH-0.609
Wikipedia-0.594
Token
Token ID198
Feature activation+9.08e-02

Pos logit contributions
 ILCS+0.832
ARB+0.793
RAG+0.756
ESH+0.746
ONS+0.723
Neg logit contributions
yip-1.331
ific-1.253
ibaba-1.245
wrapper-1.221
ignt-1.076
Tokenrer
Token ID11751
Feature activation+2.77e-01

Pos logit contributions
 ILCS+1.076
ARB+1.050
RAG+1.006
ESH+0.989
ONS+0.960
Neg logit contributions
yip-1.624
ibaba-1.500
wrapper-1.485
ific-1.478
ignt-1.289
Token,
Token ID11
Feature activation+5.81e-02

Pos logit contributions
yip+1.066
ibaba+0.992
ific+0.986
wrapper+0.977
ignt+0.859
Neg logit contributions
 ILCS-0.607
ARB-0.583
RAG-0.553
ESH-0.546
 Powers-0.533
Token leader
Token ID3554
Feature activation+3.68e-01

Pos logit contributions
yip+0.206
ibaba+0.191
ific+0.190
wrapper+0.187
ignt+0.162
Neg logit contributions
 ILCS-0.144
ARB-0.140
RAG-0.134
ESH-0.133
 Powers-0.130
Token of
Token ID286
Feature activation+1.30e-01

Pos logit contributions
yip+1.178
ific+1.083
ibaba+1.081
wrapper+1.058
ignt+0.907
Neg logit contributions
 ILCS-1.042
ARB-1.007
RAG-0.968
ESH-0.960
ONS-0.938
Token the
Token ID262
Feature activation+1.96e-01

Pos logit contributions
yip+0.454
ibaba+0.420
ific+0.418
wrapper+0.411
ignt+0.355
Neg logit contributions
 ILCS-0.331
ARB-0.320
RAG-0.308
ESH-0.305
 Powers-0.298
Token Santiago
Token ID34802
Feature activation+2.29e+00

Pos logit contributions
yip+0.685
ibaba+0.636
ific+0.630
wrapper+0.618
ignt+0.536
Neg logit contributions
 ILCS-0.497
ARB-0.478
RAG-0.459
ESH-0.455
 Powers-0.449
Token de
Token ID390
Feature activation+5.32e-01

Pos logit contributions
yip+7.722
ific+7.247
ibaba+7.168
wrapper+7.058
ignt+6.276
Neg logit contributions
 ILCS-5.770
ARB-5.635
ESH-5.363
RAG-5.315
Wikipedia-5.176
Token Cuba
Token ID14159
Feature activation-9.04e-02

Pos logit contributions
yip+1.942
ific+1.805
ibaba+1.805
wrapper+1.774
ignt+1.551
Neg logit contributions
 ILCS-1.265
ARB-1.216
RAG-1.160
ESH-1.148
ONS-1.114
Token-
Token ID12
Feature activation+2.53e-01

Pos logit contributions
 ILCS+0.240
ARB+0.230
RAG+0.222
ESH+0.220
 Powers+0.216
Neg logit contributions
yip-0.306
ific-0.282
ibaba-0.281
wrapper-0.276
ciplinary-0.237
Tokenbased
Token ID3106
Feature activation+2.00e-01

Pos logit contributions
yip+0.863
ibaba+0.804
ific+0.795
wrapper+0.778
ignt+0.672
Neg logit contributions
 ILCS-0.659
ARB-0.646
RAG-0.617
ESH-0.610
Wikipedia-0.598
Token Patri
Token ID9606
Feature activation+4.86e-01

Pos logit contributions
yip+0.700
ibaba+0.647
ific+0.641
wrapper+0.632
ciplinary+0.547
Neg logit contributions
 ILCS-0.509
ARB-0.489
RAG-0.470
ESH-0.467
 Powers-0.458
Token semester
Token ID24878
Feature activation-3.64e-01

Pos logit contributions
 ILCS+1.338
ARB+1.285
ESH+1.233
RAG+1.232
 Avery+1.210
Neg logit contributions
yip-1.375
ibaba-1.271
ific-1.229
wrapper-1.222
ignt-1.049
Token),
Token ID828
Feature activation+3.93e-01

Pos logit contributions
 ILCS+0.766
ARB+0.729
RAG+0.688
ESH+0.681
ONS+0.660
Neg logit contributions
yip-1.437
ibaba-1.337
ific-1.323
wrapper-1.313
ignt-1.165
Token Beta
Token ID17993
Feature activation-1.51e-01

Pos logit contributions
yip+1.386
ibaba+1.281
ific+1.276
wrapper+1.257
ignt+1.092
Neg logit contributions
 ILCS-0.990
ARB-0.953
RAG-0.911
ESH-0.903
 Powers-0.878
Token The
Token ID383
Feature activation+8.62e-01

Pos logit contributions
 ILCS+0.386
ARB+0.369
RAG+0.355
ESH+0.353
 Powers+0.346
Neg logit contributions
yip-0.527
ibaba-0.486
ific-0.482
wrapper-0.478
ignt-0.412
Tokenta
Token ID8326
Feature activation+6.20e-01

Pos logit contributions
yip+2.590
ibaba+2.365
ific+2.353
wrapper+2.312
ignt+1.948
Neg logit contributions
 ILCS-2.614
ARB-2.535
RAG-2.443
ESH-2.422
ONS-2.369
Token Pi
Token ID13993
Feature activation+2.19e+00

Pos logit contributions
yip+2.083
ibaba+1.919
ific+1.914
wrapper+1.882
ignt+1.621
Neg logit contributions
 ILCS-1.661
ARB-1.603
RAG-1.537
ESH-1.524
ONS-1.485
Token ($
Token ID7198
Feature activation+1.87e-01

Pos logit contributions
yip+6.468
ific+6.085
ibaba+5.922
wrapper+5.862
ignt+5.018
Neg logit contributions
 ILCS-6.286
ARB-6.094
RAG-5.844
ESH-5.770
Wikipedia-5.709
Token400
Token ID7029
Feature activation+6.99e-02

Pos logit contributions
yip+0.719
ibaba+0.670
ific+0.666
wrapper+0.656
ignt+0.580
Neg logit contributions
 ILCS-0.414
ARB-0.399
RAG-0.379
ESH-0.375
Wikipedia-0.364
Token per
Token ID583
Feature activation-8.55e-01

Pos logit contributions
yip+0.247
ibaba+0.228
ific+0.226
wrapper+0.223
ignt+0.194
Neg logit contributions
 ILCS-0.177
ARB-0.170
RAG-0.162
ESH-0.161
ONS-0.157
Token semester
Token ID24878
Feature activation-7.11e-01

Pos logit contributions
 ILCS+2.621
ARB+2.538
RAG+2.438
ESH+2.429
 Avery+2.385
Neg logit contributions
yip-2.527
ibaba-2.322
ific-2.267
wrapper-2.240
ignt-1.912
Token),
Token ID828
Feature activation+4.32e-01

Pos logit contributions
 ILCS+1.481
ARB+1.404
RAG+1.326
ESH+1.310
ONS+1.269
Neg logit contributions
yip-2.822
ibaba-2.623
ific-2.604
wrapper-2.584
ignt-2.291
Token to
Token ID284
Feature activation+3.88e-01

Pos logit contributions
yip+3.093
ific+2.881
ibaba+2.863
wrapper+2.822
ignt+2.470
Neg logit contributions
 ILCS-2.075
ARB-1.993
RAG-1.900
ESH-1.879
ONS-1.828
Token the
Token ID262
Feature activation+1.52e-01

Pos logit contributions
yip+1.324
ibaba+1.222
ific+1.220
wrapper+1.199
ignt+1.036
Neg logit contributions
 ILCS-1.017
ARB-0.982
RAG-0.940
ESH-0.931
ONS-0.908
Token Governor
Token ID10807
Feature activation+3.94e-01

Pos logit contributions
yip+0.547
ibaba+0.509
ific+0.503
wrapper+0.496
ignt+0.432
Neg logit contributions
 ILCS-0.371
ARB-0.359
RAG-0.341
ESH-0.337
 Powers-0.334
Token General
Token ID3611
Feature activation+3.02e-01

Pos logit contributions
yip+1.458
ibaba+1.355
ific+1.353
wrapper+1.331
ignt+1.167
Neg logit contributions
 ILCS-0.917
ARB-0.881
RAG-0.838
ESH-0.830
ONS-0.805
Token,
Token ID11
Feature activation+3.62e-01

Pos logit contributions
yip+1.120
ibaba+1.040
ific+1.032
wrapper+1.020
ignt+0.890
Neg logit contributions
 ILCS-0.708
ARB-0.681
RAG-0.648
ESH-0.640
 Powers-0.628
Token ass
Token ID840
Feature activation+2.15e+00

Pos logit contributions
yip+1.308
ibaba+1.214
ific+1.206
wrapper+1.191
ignt+1.036
Neg logit contributions
 ILCS-0.883
ARB-0.851
RAG-0.811
ESH-0.803
 Powers-0.783
Tokenuring
Token ID870
Feature activation+4.01e-01

Pos logit contributions
yip+6.009
ific+5.629
ibaba+5.459
wrapper+5.333
ignt+4.625
Neg logit contributions
 ILCS-6.714
ARB-6.479
RAG-6.247
ESH-6.184
ONS-6.080
Token her
Token ID607
Feature activation-1.51e-01

Pos logit contributions
yip+1.409
ibaba+1.304
ific+1.301
wrapper+1.279
ignt+1.111
Neg logit contributions
 ILCS-1.010
ARB-0.974
RAG-0.931
ESH-0.921
ONS-0.897
Token that
Token ID326
Feature activation+9.92e-02

Pos logit contributions
 ILCS+0.328
ARB+0.316
RAG+0.301
ESH+0.297
 Powers+0.289
Neg logit contributions
yip-0.584
ibaba-0.545
ific-0.540
wrapper-0.533
iang-0.469
Token the
Token ID262
Feature activation-4.65e-02

Pos logit contributions
yip+0.393
ibaba+0.367
ific+0.366
wrapper+0.361
ignt+0.319
Neg logit contributions
 ILCS-0.206
ARB-0.197
RAG-0.186
ESH-0.184
ONS-0.178
Token coalition
Token ID9906
Feature activation-2.36e-01

Pos logit contributions
 ILCS+0.112
ARB+0.109
RAG+0.103
 Powers+0.102
ESH+0.102
Neg logit contributions
yip-0.169
ibaba-0.157
ific-0.155
wrapper-0.153
iang-0.133
Tokens
Token ID82
Feature activation+1.01e-02

Pos logit contributions
 ILCS+1.583
ARB+1.514
RAG+1.448
ESH+1.432
 Powers+1.403
Neg logit contributions
yip-2.421
ific-2.234
ibaba-2.233
wrapper-2.211
ignt-1.933
Token largest
Token ID4387
Feature activation+4.27e-01

Pos logit contributions
yip+0.036
ibaba+0.033
ific+0.033
wrapper+0.033
ignt+0.028
Neg logit contributions
 ILCS-0.025
ARB-0.024
RAG-0.023
ESH-0.023
 Powers-0.022
Token party
Token ID2151
Feature activation+1.99e-01

Pos logit contributions
yip+1.538
ibaba+1.424
ific+1.421
wrapper+1.399
ignt+1.221
Neg logit contributions
 ILCS-1.041
ARB-1.001
RAG-0.955
ESH-0.945
ONS-0.919
Token abandoned
Token ID9958
Feature activation+6.17e-02

Pos logit contributions
yip+0.729
ibaba+0.679
ific+0.673
wrapper+0.666
ignt+0.582
Neg logit contributions
 ILCS-0.466
ARB-0.454
RAG-0.433
ESH-0.428
 Powers-0.417
Token President
Token ID1992
Feature activation+4.64e-01

Pos logit contributions
yip+0.220
ibaba+0.203
ific+0.202
wrapper+0.200
ignt+0.173
Neg logit contributions
 ILCS-0.151
ARB-0.146
RAG-0.141
ESH-0.140
 Powers-0.137
Token Dil
Token ID29328
Feature activation+2.14e+00

Pos logit contributions
yip+1.369
ibaba+1.248
ific+1.244
wrapper+1.220
ignt+1.025
Neg logit contributions
 ILCS-1.433
ARB-1.390
RAG-1.341
ESH-1.331
ONS-1.302
Tokenma
Token ID2611
Feature activation+7.70e-01

Pos logit contributions
yip+4.299
ibaba+3.927
ific+3.907
wrapper+3.795
iang+2.999
Neg logit contributions
 ILCS-8.414
ARB-8.157
RAG-7.953
ESH-7.885
ONS-7.791
Token Rousse
Token ID42849
Feature activation+1.65e+00

Pos logit contributions
yip+2.774
ibaba+2.531
ific+2.520
wrapper+2.496
ignt+2.126
Neg logit contributions
 ILCS-1.906
ARB-1.861
RAG-1.803
ESH-1.772
 Powers-1.742
Tokenff
Token ID487
Feature activation+5.86e-01

Pos logit contributions
yip+4.192
ibaba+3.751
ific+3.726
wrapper+3.635
ignt+2.926
Neg logit contributions
 ILCS-5.755
ARB-5.623
RAG-5.456
ESH-5.410
Wikipedia-5.315
Token
Token ID447
Feature activation+2.54e-01

Pos logit contributions
yip+2.113
ibaba+1.957
ific+1.951
wrapper+1.922
ignt+1.675
Neg logit contributions
 ILCS-1.424
ARB-1.371
RAG-1.312
ESH-1.297
ONS-1.260
Token
Token ID247
Feature activation-1.96e-01

Pos logit contributions
yip+0.925
ibaba+0.849
ific+0.845
wrapper+0.834
ignt+0.724
Neg logit contributions
 ILCS-0.610
ARB-0.593
RAG-0.569
ESH-0.559
 Powers-0.545
Token picked
Token ID6497
Feature activation+9.61e-02

Pos logit contributions
yip+0.291
ibaba+0.270
ific+0.264
wrapper+0.262
ignt+0.229
Neg logit contributions
 ILCS-0.219
ARB-0.214
RAG-0.204
ESH-0.201
 Powers-0.198
Token up
Token ID510
Feature activation-1.66e-01

Pos logit contributions
yip+0.340
ibaba+0.317
ific+0.311
wrapper+0.309
ignt+0.269
Neg logit contributions
 ILCS-0.239
ARB-0.231
RAG-0.221
ESH-0.219
 Powers-0.213
Token by
Token ID416
Feature activation+3.06e-01

Pos logit contributions
 ILCS+0.411
ARB+0.398
RAG+0.381
ESH+0.378
 Powers+0.369
Neg logit contributions
yip-0.587
ibaba-0.542
ific-0.538
wrapper-0.530
ignt-0.464
TokenABC
Token ID24694
Feature activation-6.86e-01

Pos logit contributions
yip+1.119
ibaba+1.040
ific+1.035
wrapper+1.018
ignt+0.894
Neg logit contributions
 ILCS-0.726
ARB-0.703
RAG-0.671
ESH-0.663
 Powers-0.645
Token affiliate
Token ID17375
Feature activation+2.72e-01

Pos logit contributions
 ILCS+1.707
ARB+1.696
RAG+1.610
ESH+1.604
 Powers+1.566
Neg logit contributions
yip-2.426
ibaba-2.216
ific-2.194
wrapper-2.154
ciplinary-1.884
Token KN
Token ID22466
Feature activation+2.10e+00

Pos logit contributions
yip+0.917
ibaba+0.845
ific+0.833
wrapper+0.819
ciplinary+0.707
Neg logit contributions
 ILCS-0.736
ARB-0.715
RAG-0.689
ESH-0.689
 Powers-0.670
TokenX
Token ID55
Feature activation+1.16e+00

Pos logit contributions
yip+7.514
ific+7.149
wrapper+6.927
ibaba+6.859
iang+6.119
Neg logit contributions
 ILCS-4.766
ARB-4.414
RAG-4.244
Wikipedia-4.219
 Avery-4.154
TokenN
Token ID45
Feature activation+7.42e-01

Pos logit contributions
yip+3.961
ific+3.661
ibaba+3.655
wrapper+3.590
ignt+3.110
Neg logit contributions
 ILCS-3.001
ARB-2.885
RAG-2.758
ESH-2.732
Wikipedia-2.671
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
yip+2.714
ific+2.534
ibaba+2.518
wrapper+2.486
ignt+2.176
Neg logit contributions
 ILCS-1.742
ARB-1.669
RAG-1.587
ESH-1.561
Wikipedia-1.533
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
yip+1.274
ibaba+1.189
ific+1.187
wrapper+1.169
ignt+1.035
Neg logit contributions
 ILCS-0.673
ARB-0.644
RAG-0.609
ESH-0.602
ONS-0.582
Token
Token ID198
Feature activation-1.67e-01

Pos logit contributions
 ILCS+0.400
ARB+0.388
ESH+0.361
RAG+0.357
Wikipedia+0.349
Neg logit contributions
yip-1.018
ibaba-0.948
ific-0.919
wrapper-0.915
ignt-0.837
Token
Token ID447
Feature activation+1.49e-01

Pos logit contributions
yip+1.120
ibaba+1.035
ific+1.029
wrapper+1.014
ignt+0.881
Neg logit contributions
 ILCS-0.798
ARB-0.772
RAG-0.736
ESH-0.732
ONS-0.714
Token
Token ID247
Feature activation+2.57e-02

Pos logit contributions
yip+0.585
ibaba+0.544
ific+0.541
wrapper+0.533
ignt+0.471
Neg logit contributions
 ILCS-0.313
ARB-0.302
RAG-0.287
ESH-0.282
Wikipedia-0.273
Tokenm
Token ID76
Feature activation+6.23e-01

Pos logit contributions
yip+0.089
ibaba+0.083
ific+0.082
wrapper+0.081
uador+0.070
Neg logit contributions
 ILCS-0.065
ARB-0.062
RAG-0.060
ESH-0.060
 Powers-0.058
Token sorry
Token ID7926
Feature activation-3.77e-01

Pos logit contributions
yip+2.216
ibaba+2.053
ific+2.050
wrapper+2.016
ignt+1.756
Neg logit contributions
 ILCS-1.540
ARB-1.482
RAG-1.415
ESH-1.402
ONS-1.362
Token,
Token ID11
Feature activation+7.38e-01

Pos logit contributions
 ILCS+0.795
ARB+0.765
RAG+0.721
ESH+0.715
 Powers+0.695
Neg logit contributions
yip-1.474
ibaba-1.378
ific-1.363
wrapper-1.347
ignt-1.194
Token sir
Token ID15967
Feature activation+2.09e+00

Pos logit contributions
yip+2.717
ibaba+2.523
ific+2.518
wrapper+2.478
ignt+2.170
Neg logit contributions
 ILCS-1.735
ARB-1.667
RAG-1.588
ESH-1.572
ONS-1.526
Token,
Token ID11
Feature activation+6.92e-01

Pos logit contributions
yip+7.377
ific+7.094
ibaba+6.853
wrapper+6.847
ignt+5.979
Neg logit contributions
 ILCS-4.789
ARB-4.606
RAG-4.483
ESH-4.398
ONS-4.342
Token but
Token ID475
Feature activation+6.14e-01

Pos logit contributions
yip+2.432
ibaba+2.250
ific+2.248
wrapper+2.209
ignt+1.920
Neg logit contributions
 ILCS-1.740
ARB-1.676
RAG-1.602
ESH-1.587
ONS-1.544
Token you
Token ID345
Feature activation+9.39e-01

Pos logit contributions
yip+2.181
ific+2.022
ibaba+2.021
wrapper+1.985
ignt+1.728
Neg logit contributions
 ILCS-1.518
ARB-1.459
RAG-1.396
ESH-1.380
ONS-1.344
Token
Token ID447
Feature activation+4.14e-01

Pos logit contributions
yip+2.966
ific+2.730
ibaba+2.720
wrapper+2.672
ignt+2.278
Neg logit contributions
 ILCS-2.684
ARB-2.592
RAG-2.490
ESH-2.472
Wikipedia-2.408
Token
Token ID247
Feature activation+1.95e-01

Pos logit contributions
yip+1.598
ibaba+1.475
ific+1.467
wrapper+1.448
ignt+1.270
Neg logit contributions
 ILCS-0.911
ARB-0.885
RAG-0.836
ESH-0.821
 Powers-0.800
Token worsened
Token ID44838
Feature activation+5.92e-01

Pos logit contributions
yip+0.011
ibaba+0.010
ific+0.010
wrapper+0.010
ignt+0.009
Neg logit contributions
 ILCS-0.011
ARB-0.011
RAG-0.010
ESH-0.010
 Powers-0.010
Token of
Token ID286
Feature activation+7.68e-01

Pos logit contributions
yip+1.971
ibaba+1.814
ific+1.801
wrapper+1.773
ignt+1.528
Neg logit contributions
 ILCS-1.603
ARB-1.550
RAG-1.488
ESH-1.477
 Powers-1.440
Token late
Token ID2739
Feature activation+9.34e-01

Pos logit contributions
yip+2.360
ific+2.162
ibaba+2.157
wrapper+2.112
ignt+1.791
Neg logit contributions
 ILCS-2.272
ARB-2.195
RAG-2.113
ESH-2.098
ONS-2.049
Token,
Token ID11
Feature activation+4.63e-01

Pos logit contributions
yip+3.471
ific+3.244
ibaba+3.230
wrapper+3.176
ignt+2.792
Neg logit contributions
 ILCS-2.141
ARB-2.061
RAG-1.962
ESH-1.934
Wikipedia-1.878
Token with
Token ID351
Feature activation+1.01e-01

Pos logit contributions
yip+1.677
ibaba+1.557
ific+1.548
wrapper+1.529
ignt+1.332
Neg logit contributions
 ILCS-1.116
ARB-1.077
RAG-1.028
ESH-1.017
 Powers-0.991
Token Madrid
Token ID14708
Feature activation+2.09e+00

Pos logit contributions
yip+0.344
ibaba+0.320
ific+0.315
wrapper+0.312
ignt+0.269
Neg logit contributions
 ILCS-0.264
ARB-0.256
RAG-0.245
ESH-0.241
 Powers-0.238
Token being
Token ID852
Feature activation-6.92e-03

Pos logit contributions
yip+7.460
ific+7.062
ibaba+6.851
wrapper+6.849
ignt+6.070
Neg logit contributions
 ILCS-4.852
ARB-4.633
RAG-4.402
ESH-4.340
Wikipedia-4.279
Token accused
Token ID5371
Feature activation-8.87e-02

Pos logit contributions
 ILCS+0.017
ARB+0.016
RAG+0.016
ESH+0.016
 Powers+0.015
Neg logit contributions
yip-0.025
ibaba-0.023
ific-0.023
wrapper-0.022
ignt-0.019
Token regularly
Token ID7987
Feature activation+2.95e-01

Pos logit contributions
 ILCS+0.258
ARB+0.248
RAG+0.237
ESH+0.236
Wikipedia+0.231
Neg logit contributions
yip-0.279
ific-0.258
ibaba-0.255
wrapper-0.250
ignt-0.215
Token of
Token ID286
Feature activation+2.16e-01

Pos logit contributions
yip+0.910
ific+0.845
ibaba+0.826
wrapper+0.812
ignt+0.696
Neg logit contributions
 ILCS-0.865
ARB-0.839
RAG-0.801
ESH-0.799
Wikipedia-0.780
Token politic
Token ID31723
Feature activation+8.39e-01

Pos logit contributions
yip+0.809
ibaba+0.754
ific+0.748
wrapper+0.738
ignt+0.647
Neg logit contributions
 ILCS-0.493
ARB-0.473
RAG-0.451
ESH-0.447
 Powers-0.435
TokenB
Token ID33
Feature activation-3.93e-01

Pos logit contributions
 ILCS+0.069
ARB+0.066
RAG+0.063
ESH+0.063
 Powers+0.061
Neg logit contributions
yip-0.100
ific-0.092
ibaba-0.092
wrapper-0.090
ciplinary-0.079
Token Jade
Token ID23386
Feature activation+3.29e-01

Pos logit contributions
 ILCS+0.933
ARB+0.906
RAG+0.866
ESH+0.852
ONS+0.832
Neg logit contributions
yip-1.444
ibaba-1.333
ific-1.319
wrapper-1.300
ciplinary-1.152
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
yip+1.221
ibaba+1.134
ific+1.129
wrapper+1.112
ignt+0.975
Neg logit contributions
 ILCS-0.767
ARB-0.735
RAG-0.703
ESH-0.698
 Powers-0.680
Token
Token ID198
Feature activation+1.85e-01

Pos logit contributions
yip+0.832
ibaba+0.774
ific+0.769
wrapper+0.759
ignt+0.670
Neg logit contributions
 ILCS-0.504
ARB-0.481
RAG-0.459
ESH-0.456
 Powers-0.443
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
yip+0.787
ibaba+0.734
ific+0.720
wrapper+0.714
ignt+0.644
Neg logit contributions
 ILCS-0.342
ARB-0.329
RAG-0.307
ESH-0.307
Wikipedia-0.296
TokenVen
Token ID37522
Feature activation+2.07e+00

Pos logit contributions
yip+0.665
ibaba+0.613
ific+0.606
wrapper+0.595
ignt+0.519
Neg logit contributions
 ILCS-0.514
ARB-0.501
RAG-0.480
ESH-0.477
Wikipedia-0.467
Tokenue
Token ID518
Feature activation+8.19e-01

Pos logit contributions
yip+7.597
ific+7.103
ibaba+6.993
wrapper+6.954
iang+6.122
Neg logit contributions
 ILCS-4.808
ARB-4.595
RAG-4.317
ESH-4.270
 Avery-4.169
Token:
Token ID25
Feature activation+3.79e-02

Pos logit contributions
yip+2.980
ibaba+2.775
ific+2.771
wrapper+2.726
iang+2.371
Neg logit contributions
 ILCS-1.944
ARB-1.875
RAG-1.782
ESH-1.754
ONS-1.713
Token Poly
Token ID12280
Feature activation-5.54e-03

Pos logit contributions
yip+0.145
ibaba+0.135
ific+0.135
wrapper+0.133
ignt+0.118
Neg logit contributions
 ILCS-0.084
ARB-0.080
RAG-0.076
ESH-0.075
ONS-0.073
Tokentechn
Token ID23873
Feature activation+6.43e-01

Pos logit contributions
 ILCS+0.015
ARB+0.014
RAG+0.014
ESH+0.013
ONS+0.013
Neg logit contributions
yip-0.019
ific-0.017
ibaba-0.017
wrapper-0.017
ignt-0.015
Tokenic
Token ID291
Feature activation-1.26e-01

Pos logit contributions
yip+1.959
ibaba+1.788
ific+1.783
wrapper+1.751
ignt+1.482
Neg logit contributions
 ILCS-1.921
ARB-1.862
RAG-1.792
ESH-1.779
ONS-1.739
Token As
Token ID1081
Feature activation+9.06e-02

Pos logit contributions
yip+0.280
ibaba+0.260
ific+0.257
wrapper+0.257
ignt+0.221
Neg logit contributions
 ILCS-0.208
ARB-0.200
RAG-0.192
ESH-0.189
 Powers-0.187
Tokenriel
Token ID11719
Feature activation-1.60e-01

Pos logit contributions
yip+0.305
ibaba+0.282
ific+0.280
wrapper+0.275
ignt+0.239
Neg logit contributions
 ILCS-0.237
ARB-0.229
RAG-0.220
ESH-0.216
 Powers-0.214
Token's
Token ID338
Feature activation-6.44e-01

Pos logit contributions
 ILCS+0.374
ARB+0.359
RAG+0.342
ESH+0.339
 Powers+0.334
Neg logit contributions
yip-0.595
ibaba-0.550
ific-0.547
wrapper-0.541
ignt-0.472
Token d
Token ID288
Feature activation-2.36e-01

Pos logit contributions
 ILCS+1.609
ARB+1.539
RAG+1.498
ESH+1.467
 Powers+1.453
Neg logit contributions
yip-2.257
ibaba-2.106
ific-2.074
wrapper-2.031
uador-1.787
Tokenæ
Token ID21241
Feature activation-6.15e-01

Pos logit contributions
 ILCS+0.611
ARB+0.598
RAG+0.573
ESH+0.561
ONS+0.561
Neg logit contributions
yip-0.806
ibaba-0.731
wrapper-0.719
ific-0.715
ignt-0.622
Tokenmon
Token ID2144
Feature activation+2.06e+00

Pos logit contributions
 ILCS+2.060
ARB+2.002
RAG+1.949
ESH+1.918
ONS+1.900
Neg logit contributions
yip-1.615
ibaba-1.447
ific-1.436
wrapper-1.405
uador-1.152
Token is
Token ID318
Feature activation+1.03e-01

Pos logit contributions
yip+7.130
ific+6.675
ibaba+6.582
wrapper+6.415
iang+5.688
Neg logit contributions
 ILCS-5.125
ARB-4.926
RAG-4.687
ESH-4.645
Wikipedia-4.521
Token a
Token ID257
Feature activation+1.56e-01

Pos logit contributions
yip+0.375
ibaba+0.348
ific+0.344
wrapper+0.339
ignt+0.297
Neg logit contributions
 ILCS-0.245
ARB-0.236
RAG-0.227
ESH-0.223
Wikipedia-0.218
Token snow
Token ID6729
Feature activation-4.53e-01

Pos logit contributions
yip+0.542
ibaba+0.501
ific+0.498
wrapper+0.488
ignt+0.427
Neg logit contributions
 ILCS-0.400
ARB-0.385
RAG-0.373
ESH-0.367
Wikipedia-0.358
Token le
Token ID443
Feature activation+1.17e-01

Pos logit contributions
 ILCS+1.086
ARB+1.047
RAG+0.994
ESH+0.986
ONS+0.961
Neg logit contributions
yip-1.650
ific-1.527
ibaba-1.527
wrapper-1.503
ignt-1.311
Tokenopard
Token ID15478
Feature activation-5.45e-01

Pos logit contributions
yip+0.377
ibaba+0.347
ific+0.344
wrapper+0.341
uador+0.294
Neg logit contributions
 ILCS-0.323
ARB-0.311
RAG-0.295
ESH-0.293
 Avery-0.292
Token
Token ID198
Feature activation-2.78e-01

Pos logit contributions
 ILCS+0.575
ARB+0.559
ESH+0.519
RAG+0.514
Wikipedia+0.503
Neg logit contributions
yip-1.516
ibaba-1.410
ific-1.362
wrapper-1.359
ignt-1.248
TokenThe
Token ID464
Feature activation+5.34e-01

Pos logit contributions
 ILCS+0.619
ARB+0.614
RAG+0.579
ESH+0.572
ONS+0.562
Neg logit contributions
yip-1.063
ibaba-0.990
ific-0.975
wrapper-0.958
ignt-0.846
Token government
Token ID1230
Feature activation+7.02e-01

Pos logit contributions
yip+1.939
ibaba+1.798
ific+1.791
wrapper+1.760
ignt+1.539
Neg logit contributions
 ILCS-1.290
ARB-1.242
RAG-1.183
ESH-1.172
 Powers-1.142
Token faces
Token ID6698
Feature activation+2.14e-01

Pos logit contributions
yip+2.498
ific+2.321
ibaba+2.312
wrapper+2.277
ignt+1.984
Neg logit contributions
 ILCS-1.723
ARB-1.658
RAG-1.586
ESH-1.570
Wikipedia-1.525
Token aboriginal
Token ID48328
Feature activation-6.89e-01

Pos logit contributions
yip+0.740
ibaba+0.684
ific+0.681
wrapper+0.670
ignt+0.581
Neg logit contributions
 ILCS-0.548
ARB-0.530
RAG-0.507
ESH-0.503
ONS-0.490
Token opposition
Token ID5471
Feature activation+2.06e+00

Pos logit contributions
 ILCS+1.553
ARB+1.510
RAG+1.436
 Powers+1.432
ESH+1.413
Neg logit contributions
yip-2.591
ibaba-2.407
ific-2.372
wrapper-2.335
ciplinary-2.049
Token to
Token ID284
Feature activation+6.18e-01

Pos logit contributions
yip+6.559
ific+6.144
ibaba+6.057
wrapper+6.027
ignt+5.229
Neg logit contributions
 ILCS-5.597
ARB-5.293
RAG-5.113
Wikipedia-5.071
ESH-5.043
Token Kinder
Token ID46832
Feature activation+9.72e-01

Pos logit contributions
yip+2.191
ibaba+2.028
ific+2.027
wrapper+1.993
ignt+1.735
Neg logit contributions
 ILCS-1.535
ARB-1.477
RAG-1.411
ESH-1.397
ONS-1.359
Token Morgan
Token ID10805
Feature activation+2.43e-01

Pos logit contributions
yip+3.668
ibaba+3.412
ific+3.383
wrapper+3.338
ignt+2.923
Neg logit contributions
 ILCS-2.217
ARB-2.126
RAG-2.026
ESH-2.005
 Powers-1.950
Token
Token ID447
Feature activation+1.40e-02

Pos logit contributions
yip+0.910
ibaba+0.847
ific+0.841
wrapper+0.830
ignt+0.727
Neg logit contributions
 ILCS-0.559
ARB-0.538
RAG-0.512
ESH-0.507
 Powers-0.494
Token
Token ID247
Feature activation-3.07e-01

Pos logit contributions
yip+0.060
ibaba+0.056
ific+0.056
wrapper+0.055
ignt+0.049
Neg logit contributions
 ILCS-0.025
ARB-0.024
RAG-0.022
ESH-0.022
Wikipedia-0.021
Token voters
Token ID4446
Feature activation-8.09e-01

Pos logit contributions
yip+0.053
ibaba+0.049
ific+0.049
wrapper+0.048
ignt+0.042
Neg logit contributions
 ILCS-0.035
ARB-0.034
RAG-0.033
ESH-0.032
 Powers-0.032
Token,
Token ID11
Feature activation+1.83e-01

Pos logit contributions
 ILCS+2.081
ARB+2.013
RAG+1.943
ESH+1.931
 Powers+1.896
Neg logit contributions
yip-2.772
ibaba-2.573
ific-2.506
wrapper-2.497
ignt-2.160
Token than
Token ID621
Feature activation-1.76e-01

Pos logit contributions
yip+0.639
ibaba+0.590
ific+0.586
wrapper+0.581
ignt+0.502
Neg logit contributions
 ILCS-0.468
ARB-0.452
RAG-0.434
ESH-0.429
 Powers-0.422
Token the
Token ID262
Feature activation+1.64e-01

Pos logit contributions
 ILCS+0.436
ARB+0.419
RAG+0.405
ESH+0.401
 Powers+0.392
Neg logit contributions
yip-0.625
ibaba-0.580
ific-0.574
wrapper-0.567
ignt-0.496
Token bully
Token ID27410
Feature activation+5.21e-01

Pos logit contributions
yip+0.555
ibaba+0.514
ific+0.509
wrapper+0.499
ignt+0.433
Neg logit contributions
 ILCS-0.438
ARB-0.422
RAG-0.405
ESH-0.402
 Powers-0.395
Token pul
Token ID17472
Feature activation+2.06e+00

Pos logit contributions
yip+1.580
ibaba+1.444
ific+1.433
wrapper+1.406
ignt+1.192
Neg logit contributions
 ILCS-1.567
ARB-1.517
RAG-1.463
ESH-1.452
Wikipedia-1.420
Tokenpit
Token ID15544
Feature activation+7.86e-01

Pos logit contributions
yip+3.278
ific+2.764
ibaba+2.710
wrapper+2.634
ignt+1.773
Neg logit contributions
 ILCS-9.068
ARB-8.957
RAG-8.715
ESH-8.637
ONS-8.515
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
yip+2.935
ibaba+2.732
ific+2.719
wrapper+2.679
ignt+2.353
Neg logit contributions
 ILCS-1.812
ARB-1.737
RAG-1.654
ESH-1.639
Wikipedia-1.589
Token PK
Token ID29673
Feature activation+1.29e+00

Pos logit contributions
yip+0.604
ibaba+0.559
ific+0.552
wrapper+0.546
ignt+0.479
Neg logit contributions
 ILCS-0.410
ARB-0.393
RAG-0.379
ESH-0.376
 Powers-0.368
Token claims
Token ID3667
Feature activation-1.28e-01

Pos logit contributions
yip+4.551
ific+4.220
ibaba+4.199
wrapper+4.125
iang+3.603
Neg logit contributions
 ILCS-3.194
ARB-3.062
RAG-2.917
ESH-2.895
ONS-2.819
Token that
Token ID326
Feature activation-1.99e-01

Pos logit contributions
 ILCS+0.287
ARB+0.277
RAG+0.264
ESH+0.261
 Powers+0.253
Neg logit contributions
yip-0.482
ibaba-0.449
ific-0.447
wrapper-0.440
ignt-0.387
Tokeniber
Token ID1856
Feature activation-7.65e-01

Pos logit contributions
 ILCS+1.213
ARB+1.213
ESH+1.121
RAG+1.119
 Avery+1.108
Neg logit contributions
yip-1.414
ibaba-1.279
ific-1.272
wrapper-1.245
ciplinary-1.079
Tokeng
Token ID70
Feature activation-1.99e-01

Pos logit contributions
 ILCS+2.564
ARB+2.553
RAG+2.479
ESH+2.444
 Avery+2.414
Neg logit contributions
yip-2.015
ibaba-1.826
ific-1.727
wrapper-1.709
ignt-1.418
Token said
Token ID531
Feature activation-6.22e-01

Pos logit contributions
 ILCS+0.535
ARB+0.518
RAG+0.493
ESH+0.490
 Powers+0.486
Neg logit contributions
yip-0.673
ibaba-0.614
ific-0.600
wrapper-0.596
ignt-0.522
Token.
Token ID13
Feature activation-1.25e-01

Pos logit contributions
 ILCS+1.333
ARB+1.281
RAG+1.212
ESH+1.209
 Avery+1.197
Neg logit contributions
yip-2.423
ibaba-2.258
ific-2.214
wrapper-2.201
ignt-1.942
Token #
Token ID1303
Feature activation-5.66e-01

Pos logit contributions
 ILCS+0.270
ARB+0.258
RAG+0.246
ESH+0.244
 Powers+0.237
Neg logit contributions
yip-0.486
ibaba-0.450
ific-0.446
wrapper-0.442
ignt-0.393
TokenBull
Token ID39549
Feature activation+2.03e+00

Pos logit contributions
 ILCS+1.434
ARB+1.411
RAG+1.352
ESH+1.336
Wikipedia+1.304
Neg logit contributions
yip-1.962
ibaba-1.823
ific-1.801
wrapper-1.771
ignt-1.549
Tokens
Token ID82
Feature activation+8.26e-01

Pos logit contributions
yip+6.854
ific+6.363
ibaba+6.240
wrapper+6.235
ignt+5.446
Neg logit contributions
 ILCS-5.239
ARB-4.918
ESH-4.793
RAG-4.770
ONS-4.625
Token
Token ID851
Feature activation-1.78e-01

Pos logit contributions
yip+2.887
ific+2.675
ibaba+2.667
wrapper+2.624
ignt+2.289
Neg logit contributions
 ILCS-2.084
ARB-1.992
RAG-1.914
ESH-1.895
Wikipedia-1.839
Token Cody
Token ID27035
Feature activation+3.53e-02

Pos logit contributions
 ILCS+0.406
ARB+0.389
RAG+0.369
ESH+0.367
 Powers+0.358
Neg logit contributions
yip-0.674
ibaba-0.626
ific-0.621
wrapper-0.614
ignt-0.541
Token Wester
Token ID32104
Feature activation-1.38e+00

Pos logit contributions
yip+0.130
ibaba+0.122
wrapper+0.119
ific+0.119
ignt+0.104
Neg logit contributions
 ILCS-0.083
ARB-0.080
 Powers-0.076
RAG-0.076
 Avery-0.075
Tokenlund
Token ID37525
Feature activation-2.22e-01

Pos logit contributions
 ILCS+3.045
ARB+2.858
RAG+2.751
ESH+2.728
 Powers+2.681
Neg logit contributions
yip-5.315
ibaba-4.868
wrapper-4.807
ific-4.768
ciplinary-4.222
Token fair
Token ID3148
Feature activation-3.29e-01

Pos logit contributions
yip+0.979
ibaba+0.908
ific+0.908
wrapper+0.892
ignt+0.779
Neg logit contributions
 ILCS-0.659
ARB-0.634
RAG-0.605
ESH-0.599
ONS-0.582
Token play
Token ID711
Feature activation+1.96e-01

Pos logit contributions
 ILCS+0.903
ARB+0.872
RAG+0.838
ESH+0.826
 Powers+0.815
Neg logit contributions
yip-1.083
ibaba-0.994
ific-0.980
wrapper-0.971
ignt-0.835
Token.
Token ID13
Feature activation-1.07e-01

Pos logit contributions
yip+0.747
ibaba+0.695
ific+0.694
wrapper+0.683
ignt+0.601
Neg logit contributions
 ILCS-0.439
ARB-0.420
RAG-0.399
ESH-0.395
ONS-0.383
Token
Token ID198
Feature activation-2.65e-01

Pos logit contributions
 ILCS+0.227
ARB+0.214
RAG+0.205
ESH+0.201
 Powers+0.200
Neg logit contributions
yip-0.426
ibaba-0.395
ific-0.388
wrapper-0.386
ignt-0.343
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
 ILCS+0.461
ARB+0.444
ESH+0.414
RAG+0.413
Wikipedia+0.400
Neg logit contributions
yip-1.164
ibaba-1.082
ific-1.051
wrapper-1.045
ignt-0.955
TokenVen
Token ID37522
Feature activation+1.99e+00

Pos logit contributions
 ILCS+0.593
ARB+0.587
RAG+0.562
ESH+0.552
ONS+0.547
Neg logit contributions
yip-0.822
ibaba-0.758
ific-0.736
wrapper-0.726
ignt-0.637
Tokenky
Token ID2584
Feature activation-1.81e-01

Pos logit contributions
yip+5.318
ific+4.799
ibaba+4.792
wrapper+4.657
ignt+3.830
Neg logit contributions
 ILCS-6.683
ARB-6.519
RAG-6.285
ESH-6.251
ONS-6.132
Token
Token ID447
Feature activation+1.41e-01

Pos logit contributions
 ILCS+0.430
ARB+0.412
RAG+0.393
ESH+0.387
 Powers+0.383
Neg logit contributions
yip-0.663
ibaba-0.613
ific-0.609
wrapper-0.602
ignt-0.526
Token
Token ID247
Feature activation-2.81e-01

Pos logit contributions
yip+0.551
ibaba+0.516
ific+0.516
wrapper+0.508
ignt+0.449
Neg logit contributions
 ILCS-0.296
ARB-0.282
RAG-0.267
ESH-0.265
Wikipedia-0.256
Tokens
Token ID82
Feature activation+7.44e-02

Pos logit contributions
 ILCS+1.001
ARB+0.973
RAG+0.944
ESH+0.940
 Powers+0.931
Neg logit contributions
yip-0.696
ibaba-0.619
ific-0.608
wrapper-0.599
ignt-0.484
Token plans
Token ID3352
Feature activation-8.27e-02

Pos logit contributions
yip+0.247
ibaba+0.227
ific+0.226
wrapper+0.223
ignt+0.191
Neg logit contributions
 ILCS-0.202
ARB-0.196
RAG-0.188
ESH-0.186
 Powers-0.182
Token likely
Token ID1884
Feature activation+2.94e-02

Pos logit contributions
yip+1.413
ibaba+1.303
ific+1.299
wrapper+1.280
ignt+1.110
Neg logit contributions
 ILCS-1.015
ARB-0.980
RAG-0.933
ESH-0.928
 Powers-0.900
Token voters
Token ID4446
Feature activation-5.15e-01

Pos logit contributions
yip+0.112
ibaba+0.103
ific+0.103
wrapper+0.102
ignt+0.089
Neg logit contributions
 ILCS-0.067
ARB-0.064
RAG-0.061
ESH-0.060
 Powers-0.058
Token,
Token ID11
Feature activation+1.91e-01

Pos logit contributions
 ILCS+1.129
ARB+1.085
ESH+1.021
RAG+1.021
 Avery+0.981
Neg logit contributions
yip-1.990
ibaba-1.842
ific-1.817
wrapper-1.807
ignt-1.591
Token according
Token ID1864
Feature activation+3.54e-01

Pos logit contributions
yip+0.643
ibaba+0.592
ific+0.586
wrapper+0.577
ignt+0.498
Neg logit contributions
 ILCS-0.511
ARB-0.497
RAG-0.475
ESH-0.472
 Powers-0.462
Token to
Token ID284
Feature activation+1.51e-01

Pos logit contributions
yip+1.273
ibaba+1.163
ific+1.141
wrapper+1.129
ignt+0.997
Neg logit contributions
 ILCS-0.893
ARB-0.871
RAG-0.827
ESH-0.821
 Powers-0.805
Token Real
Token ID6416
Feature activation+1.99e+00

Pos logit contributions
yip+0.531
ific+0.483
ibaba+0.483
wrapper+0.476
ignt+0.415
Neg logit contributions
 ILCS-0.389
ARB-0.374
RAG-0.357
ESH-0.356
 Powers-0.348
TokenClear
Token ID19856
Feature activation+6.57e-01

Pos logit contributions
yip+7.025
ific+6.668
ibaba+6.493
wrapper+6.464
ignt+5.689
Neg logit contributions
 ILCS-4.741
ARB-4.520
RAG-4.262
ESH-4.241
ONS-4.156
TokenPolitics
Token ID43921
Feature activation+9.44e-01

Pos logit contributions
yip+2.287
ibaba+2.104
wrapper+2.038
ific+2.038
uador+1.765
Neg logit contributions
ARB-1.709
 ILCS-1.697
RAG-1.606
ESH-1.592
ONS-1.568
Token,
Token ID11
Feature activation+2.15e-01

Pos logit contributions
yip+3.405
ific+3.231
ibaba+3.161
wrapper+3.154
iang+2.751
Neg logit contributions
 ILCS-2.216
ARB-2.066
RAG-2.023
ESH-1.968
Wikipedia-1.923
Token said
Token ID531
Feature activation-4.01e-01

Pos logit contributions
yip+0.750
ibaba+0.690
ific+0.684
wrapper+0.674
ignt+0.584
Neg logit contributions
 ILCS-0.551
ARB-0.537
RAG-0.512
ESH-0.507
 Avery-0.496
Token a
Token ID257
Feature activation-3.88e-03

Pos logit contributions
 ILCS+0.890
ARB+0.870
RAG+0.822
ESH+0.808
 Powers+0.789
Neg logit contributions
yip-1.528
ibaba-1.420
ific-1.406
wrapper-1.391
ciplinary-1.221
Token Mac
Token ID4100
Feature activation-4.11e-01

Pos logit contributions
yip+0.509
ibaba+0.471
ific+0.470
wrapper+0.461
ciplinary+0.403
Neg logit contributions
 ILCS-0.364
ARB-0.350
RAG-0.335
ESH-0.332
 Powers-0.326
Token,
Token ID11
Feature activation+1.37e-01

Pos logit contributions
 ILCS+0.874
ARB+0.829
RAG+0.786
ESH+0.780
 Powers+0.762
Neg logit contributions
yip-1.603
ibaba-1.498
ific-1.487
wrapper-1.462
ignt-1.305
Token Android
Token ID5565
Feature activation-5.47e-01

Pos logit contributions
yip+0.491
ibaba+0.454
ific+0.454
wrapper+0.446
ciplinary+0.390
Neg logit contributions
 ILCS-0.336
ARB-0.324
RAG-0.309
ESH-0.305
 Powers-0.300
Token,
Token ID11
Feature activation+9.53e-02

Pos logit contributions
 ILCS+1.200
ARB+1.137
RAG+1.079
ESH+1.076
 Powers+1.056
Neg logit contributions
yip-2.110
ibaba-1.965
ific-1.946
wrapper-1.915
ciplinary-1.703
Token Raspberry
Token ID24244
Feature activation+5.40e-01

Pos logit contributions
yip+0.387
ibaba+0.362
ific+0.360
wrapper+0.356
ciplinary+0.317
Neg logit contributions
 ILCS-0.189
ARB-0.181
RAG-0.170
ESH-0.167
 Powers-0.165
Token Pi
Token ID13993
Feature activation+1.97e+00

Pos logit contributions
yip+1.756
ific+1.619
ibaba+1.612
wrapper+1.586
ignt+1.362
Neg logit contributions
 ILCS-1.497
ARB-1.447
RAG-1.386
ESH-1.374
ONS-1.343
Token,
Token ID11
Feature activation+2.02e-01

Pos logit contributions
yip+7.060
ific+6.620
wrapper+6.550
ibaba+6.472
ignt+5.676
Neg logit contributions
 ILCS-4.564
ARB-4.367
RAG-4.135
ESH-4.085
ONS-3.979
Token Nvidia
Token ID27699
Feature activation-1.35e+00

Pos logit contributions
yip+0.765
ibaba+0.711
ific+0.709
wrapper+0.699
ignt+0.615
Neg logit contributions
 ILCS-0.455
ARB-0.435
RAG-0.414
ESH-0.408
 Powers-0.400
Token Shield
Token ID10022
Feature activation-9.82e-02

Pos logit contributions
 ILCS+3.112
ARB+3.018
RAG+2.875
 Powers+2.826
ESH+2.823
Neg logit contributions
yip-4.948
ibaba-4.624
ific-4.509
wrapper-4.445
ignt-3.930
Token,
Token ID11
Feature activation-1.88e-02

Pos logit contributions
 ILCS+0.217
ARB+0.208
RAG+0.199
ESH+0.196
 Powers+0.191
Neg logit contributions
yip-0.376
ibaba-0.351
ific-0.348
wrapper-0.343
ignt-0.302
Token Chrom
Token ID18255
Feature activation-1.96e-01

Pos logit contributions
 ILCS+0.043
ARB+0.041
RAG+0.040
ESH+0.039
 Powers+0.038
Neg logit contributions
yip-0.070
ibaba-0.065
ific-0.065
wrapper-0.064
ignt-0.056
Token are
Token ID389
Feature activation-1.55e-01

Pos logit contributions
 ILCS+0.040
ARB+0.039
RAG+0.037
ESH+0.037
ONS+0.036
Neg logit contributions
yip-0.059
ibaba-0.054
ific-0.054
wrapper-0.053
ignt-0.046
Token this
Token ID428
Feature activation-2.12e-01

Pos logit contributions
 ILCS+0.360
ARB+0.347
RAG+0.332
ESH+0.328
ONS+0.321
Neg logit contributions
yip-0.579
ibaba-0.535
ific-0.531
wrapper-0.522
ignt-0.459
Token week
Token ID1285
Feature activation+6.42e-01

Pos logit contributions
 ILCS+0.620
ARB+0.600
RAG+0.581
ESH+0.580
ONS+0.564
Neg logit contributions
yip-0.658
ific-0.595
ibaba-0.594
wrapper-0.583
ignt-0.499
Token meeting
Token ID3249
Feature activation+8.44e-02

Pos logit contributions
yip+2.463
ific+2.305
ibaba+2.304
wrapper+2.266
ignt+1.998
Neg logit contributions
 ILCS-1.400
ARB-1.338
RAG-1.271
ESH-1.251
Wikipedia-1.214
Token in
Token ID287
Feature activation+5.43e-01

Pos logit contributions
yip+0.314
ibaba+0.292
ific+0.291
wrapper+0.287
ignt+0.251
Neg logit contributions
 ILCS-0.195
ARB-0.188
RAG-0.179
ESH-0.177
ONS-0.172
Token Beijing
Token ID11618
Feature activation+1.95e+00

Pos logit contributions
yip+1.620
ibaba+1.477
ific+1.475
wrapper+1.444
iang+1.215
Neg logit contributions
 ILCS-1.653
ARB-1.605
RAG-1.542
ESH-1.535
ONS-1.499
Token for
Token ID329
Feature activation+3.91e-01

Pos logit contributions
yip+6.320
ific+5.941
wrapper+5.854
ibaba+5.853
iang+5.105
Neg logit contributions
 ILCS-5.156
ARB-4.860
ESH-4.684
RAG-4.642
ONS-4.571
Token a
Token ID257
Feature activation+5.63e-01

Pos logit contributions
yip+1.328
ibaba+1.226
ific+1.224
wrapper+1.203
ignt+1.039
Neg logit contributions
 ILCS-1.031
ARB-0.994
RAG-0.951
ESH-0.944
ONS-0.919
Token high
Token ID1029
Feature activation+6.22e-01

Pos logit contributions
yip+1.914
ibaba+1.764
ific+1.761
wrapper+1.729
ignt+1.495
Neg logit contributions
 ILCS-1.480
ARB-1.433
RAG-1.376
ESH-1.360
 Powers-1.329
Token-
Token ID12
Feature activation+3.12e-01

Pos logit contributions
yip+2.194
ific+2.051
ibaba+2.031
wrapper+1.985
ciplinary+1.748
Neg logit contributions
 ILCS-1.541
ARB-1.486
RAG-1.433
ESH-1.405
ONS-1.374
Tokenlevel
Token ID5715
Feature activation-1.89e-01

Pos logit contributions
yip+1.012
ibaba+0.930
ific+0.928
wrapper+0.910
ignt+0.780
Neg logit contributions
 ILCS-0.873
ARB-0.845
RAG-0.811
ESH-0.804
ONS-0.785
TokenNat
Token ID47849
Feature activation+2.35e-01

Pos logit contributions
yip+0.597
ibaba+0.544
ific+0.542
wrapper+0.531
ignt+0.448
Neg logit contributions
 ILCS-0.612
ARB-0.595
RAG-0.573
ESH-0.569
Wikipedia-0.557
Token Ames
Token ID46843
Feature activation-3.91e-01

Pos logit contributions
yip+1.077
ibaba+1.011
ific+1.009
wrapper+0.999
ignt+0.901
Neg logit contributions
 ILCS-0.347
ARB-0.325
RAG-0.298
ESH-0.294
 Powers-0.283
Token:
Token ID25
Feature activation+1.73e-01

Pos logit contributions
 ILCS+0.808
ARB+0.776
ESH+0.729
 Powers+0.724
RAG+0.723
Neg logit contributions
yip-1.579
ibaba-1.458
ific-1.430
wrapper-1.411
ignt-1.275
Token No
Token ID1400
Feature activation+5.31e-01

Pos logit contributions
yip+0.605
ibaba+0.558
ific+0.553
wrapper+0.547
ignt+0.477
Neg logit contributions
 ILCS-0.444
ARB-0.428
RAG-0.409
ESH-0.407
 Powers-0.399
Token,
Token ID11
Feature activation+2.12e-01

Pos logit contributions
yip+1.998
ific+1.882
ibaba+1.854
wrapper+1.837
iang+1.613
Neg logit contributions
 ILCS-1.195
ARB-1.137
RAG-1.096
ESH-1.063
Wikipedia-1.025
Token sir
Token ID15967
Feature activation+1.95e+00

Pos logit contributions
yip+0.751
ibaba+0.695
ific+0.693
wrapper+0.682
ignt+0.594
Neg logit contributions
 ILCS-0.526
ARB-0.507
RAG-0.484
ESH-0.480
Wikipedia-0.467
Token.
Token ID13
Feature activation+9.71e-01

Pos logit contributions
yip+6.666
ific+6.439
ibaba+6.189
wrapper+6.153
ignt+5.421
Neg logit contributions
 ILCS-4.605
ARB-4.379
RAG-4.313
ESH-4.243
ONS-4.154
Token In
Token ID554
Feature activation+4.96e-01

Pos logit contributions
yip+3.414
ific+3.216
ibaba+3.182
wrapper+3.133
iang+2.735
Neg logit contributions
 ILCS-2.373
ARB-2.280
RAG-2.182
ESH-2.147
ONS-2.081
Token fact
Token ID1109
Feature activation-3.09e-01

Pos logit contributions
yip+1.541
ibaba+1.412
ific+1.411
wrapper+1.383
ignt+1.174
Neg logit contributions
 ILCS-1.447
ARB-1.399
RAG-1.348
ESH-1.335
ONS-1.306
Token,
Token ID11
Feature activation-1.38e-01

Pos logit contributions
 ILCS+0.662
ARB+0.627
RAG+0.601
ESH+0.586
ONS+0.572
Neg logit contributions
yip-1.210
ific-1.132
ibaba-1.126
wrapper-1.109
ignt-0.981
Token nobody
Token ID8168
Feature activation-9.74e-02

Pos logit contributions
 ILCS+0.326
ARB+0.316
ESH+0.298
RAG+0.298
 Powers+0.292
Neg logit contributions
yip-0.506
ibaba-0.470
ific-0.467
wrapper-0.462
ignt-0.405
Token back
Token ID736
Feature activation-4.12e-01

Pos logit contributions
 ILCS+0.204
ARB+0.198
RAG+0.190
ESH+0.187
 Powers+0.185
Neg logit contributions
yip-0.244
ibaba-0.225
ific-0.224
wrapper-0.220
ciplinary-0.189
Tokentrack
Token ID11659
Feature activation+3.49e-01

Pos logit contributions
 ILCS+0.989
ARB+0.973
RAG+0.932
ESH+0.912
 Powers+0.893
Neg logit contributions
yip-1.497
ibaba-1.380
ific-1.353
wrapper-1.337
ciplinary-1.172
Token a
Token ID257
Feature activation+4.36e-01

Pos logit contributions
yip+1.245
ibaba+1.152
ific+1.147
wrapper+1.131
ignt+0.984
Neg logit contributions
 ILCS-0.865
ARB-0.834
RAG-0.795
ESH-0.787
 Powers-0.767
Token bit
Token ID1643
Feature activation+7.46e-01

Pos logit contributions
yip+1.403
ibaba+1.284
ific+1.274
wrapper+1.258
ignt+1.079
Neg logit contributions
 ILCS-1.234
ARB-1.193
RAG-1.146
ESH-1.134
 Powers-1.113
Token.
Token ID13
Feature activation+3.79e-01

Pos logit contributions
yip+2.773
ibaba+2.577
ific+2.574
wrapper+2.532
ignt+2.220
Neg logit contributions
 ILCS-1.726
ARB-1.658
RAG-1.578
ESH-1.562
ONS-1.515
Token Prior
Token ID14481
Feature activation+1.94e+00

Pos logit contributions
yip+1.427
ibaba+1.324
ific+1.308
wrapper+1.296
ignt+1.142
Neg logit contributions
 ILCS-0.876
ARB-0.840
RAG-0.798
ESH-0.790
 Powers-0.772
Token to
Token ID284
Feature activation+1.46e-01

Pos logit contributions
yip+5.391
ific+5.285
ibaba+5.042
wrapper+4.957
uador+4.304
Neg logit contributions
 ILCS-5.920
ARB-5.606
RAG-5.417
ESH-5.354
ONS-5.166
Token voting
Token ID6709
Feature activation-7.40e-01

Pos logit contributions
yip+0.536
ibaba+0.499
ific+0.493
wrapper+0.488
ignt+0.427
Neg logit contributions
 ILCS-0.348
ARB-0.335
RAG-0.319
ESH-0.316
 Powers-0.309
Token to
Token ID284
Feature activation+7.95e-02

Pos logit contributions
 ILCS+1.680
ARB+1.632
RAG+1.578
ESH+1.550
 Powers+1.513
Neg logit contributions
yip-2.764
ibaba-2.574
ific-2.527
wrapper-2.497
ciplinary-2.212
Token repeal
Token ID14634
Feature activation+2.21e-01

Pos logit contributions
yip+0.268
ibaba+0.247
ific+0.245
wrapper+0.242
ignt+0.207
Neg logit contributions
 ILCS-0.213
ARB-0.206
RAG-0.198
ESH-0.197
 Powers-0.192
Token net
Token ID2010
Feature activation-6.45e-01

Pos logit contributions
yip+0.827
ibaba+0.769
ific+0.761
wrapper+0.753
ciplinary+0.658
Neg logit contributions
 ILCS-0.513
ARB-0.495
RAG-0.468
ESH-0.464
Wikipedia-0.450
Token a
Token ID257
Feature activation-2.80e-01

Pos logit contributions
 ILCS+2.405
ARB+2.341
ESH+2.247
RAG+2.221
Wikipedia+2.202
Neg logit contributions
yip-3.845
ific-3.569
ibaba-3.525
wrapper-3.520
ignt-3.086
Token "
Token ID366
Feature activation+3.54e-01

Pos logit contributions
 ILCS+0.661
ARB+0.635
RAG+0.607
ESH+0.604
Wikipedia+0.592
Neg logit contributions
yip-1.018
ibaba-0.946
ific-0.944
wrapper-0.931
ignt-0.821
TokenMan
Token ID5124
Feature activation+5.40e-01

Pos logit contributions
yip+1.155
ibaba+1.061
ific+1.059
wrapper+1.040
ignt+0.891
Neg logit contributions
 ILCS-0.982
ARB-0.949
RAG-0.911
ESH-0.903
ONS-0.881
Tokenhun
Token ID20088
Feature activation-7.24e-01

Pos logit contributions
yip+1.804
ibaba+1.662
ific+1.659
wrapper+1.630
ignt+1.404
Neg logit contributions
 ILCS-1.453
ARB-1.403
RAG-1.345
ESH-1.333
ONS-1.299
Tokenting
Token ID889
Feature activation-5.95e-01

Pos logit contributions
 ILCS+2.187
ARB+2.120
ESH+2.079
RAG+2.073
Wikipedia+2.029
Neg logit contributions
yip-2.094
ific-1.889
ibaba-1.887
wrapper-1.881
ciplinary-1.578
Token Timeline
Token ID43040
Feature activation-2.41e+00

Pos logit contributions
 ILCS+1.398
ARB+1.332
RAG+1.280
ESH+1.272
 Powers+1.248
Neg logit contributions
yip-2.185
ific-2.037
ibaba-2.028
wrapper-2.005
ignt-1.757
Token"
Token ID1
Feature activation+4.39e-01

Pos logit contributions
 ILCS+3.591
ARB+3.452
 Powers+3.286
RAG+3.185
Wikipedia+3.149
Neg logit contributions
yip-10.227
ibaba-9.712
wrapper-9.562
ific-9.549
ignt-8.602
Token in
Token ID287
Feature activation-5.73e-02

Pos logit contributions
yip+1.526
ibaba+1.411
ific+1.409
wrapper+1.384
ignt+1.200
Neg logit contributions
 ILCS-1.123
ARB-1.083
RAG-1.035
ESH-1.026
ONS-0.999
Token 2010
Token ID3050
Feature activation-1.62e-01

Pos logit contributions
 ILCS+0.169
ARB+0.165
RAG+0.158
ESH+0.157
 Powers+0.157
Neg logit contributions
yip-0.175
ibaba-0.162
ific-0.159
wrapper-0.157
ignt-0.133
Token by
Token ID416
Feature activation+8.28e-02

Pos logit contributions
 ILCS+0.367
ARB+0.351
RAG+0.334
ESH+0.331
 Powers+0.322
Neg logit contributions
yip-0.613
ibaba-0.572
ific-0.566
wrapper-0.560
ignt-0.492
Token US
Token ID1294
Feature activation+1.95e-01

Pos logit contributions
yip+0.297
ibaba+0.277
ific+0.275
wrapper+0.271
ignt+0.236
Neg logit contributions
 ILCS-0.201
ARB-0.195
RAG-0.186
ESH-0.185
 Powers-0.181
Token charges
Token ID4530
Feature activation-6.26e-01

Pos logit contributions
 ILCS+1.052
ARB+1.007
RAG+0.947
ESH+0.938
 Powers+0.917
Neg logit contributions
yip-1.951
ibaba-1.806
ific-1.789
wrapper-1.778
ignt-1.573
Token as
Token ID355
Feature activation+6.04e-02

Pos logit contributions
 ILCS+1.371
ARB+1.320
RAG+1.242
ESH+1.234
 Powers+1.223
Neg logit contributions
yip-2.410
ibaba-2.241
ific-2.205
wrapper-2.192
ignt-1.927
Token a
Token ID257
Feature activation+2.54e-01

Pos logit contributions
yip+0.216
ibaba+0.200
ific+0.199
wrapper+0.196
ignt+0.170
Neg logit contributions
 ILCS-0.150
ARB-0.145
RAG-0.137
ESH-0.136
ONS-0.133
Token function
Token ID2163
Feature activation-3.25e-01

Pos logit contributions
yip+0.801
ibaba+0.736
ific+0.731
wrapper+0.716
ignt+0.611
Neg logit contributions
 ILCS-0.738
ARB-0.712
RAG-0.686
ESH-0.679
 Powers-0.664
Token of
Token ID286
Feature activation-2.99e-01

Pos logit contributions
 ILCS+0.941
ARB+0.912
RAG+0.889
ESH+0.883
ONS+0.864
Neg logit contributions
yip-1.012
ific-0.935
ibaba-0.921
wrapper-0.903
ciplinary-0.773
Token prosecut
Token ID6627
Feature activation-2.32e+00

Pos logit contributions
 ILCS+0.685
ARB+0.656
ESH+0.618
RAG+0.617
 Powers+0.604
Neg logit contributions
yip-1.124
ibaba-1.045
ific-1.041
wrapper-1.025
ignt-0.903
Tokenorial
Token ID5132
Feature activation-9.31e-01

Pos logit contributions
 ILCS+4.914
ARB+4.741
 Avery+4.613
ESH+4.552
ONS+4.544
Neg logit contributions
yip-8.557
ibaba-7.756
wrapper-7.624
ific-7.567
uador-6.643
Token discretion
Token ID14130
Feature activation-1.93e-01

Pos logit contributions
 ILCS+1.292
ARB+1.197
 Powers+1.090
ESH+1.081
RAG+1.064
Neg logit contributions
yip-4.374
ibaba-4.103
ific-4.024
wrapper-4.011
ignt-3.629
Token should
Token ID815
Feature activation+1.36e-02

Pos logit contributions
 ILCS+0.455
ARB+0.437
RAG+0.413
ESH+0.410
 Powers+0.402
Neg logit contributions
yip-0.710
ibaba-0.659
ific-0.655
wrapper-0.646
ignt-0.565
Token not
Token ID407
Feature activation-2.91e-01

Pos logit contributions
yip+0.044
ibaba+0.040
ific+0.040
wrapper+0.039
ignt+0.034
Neg logit contributions
 ILCS-0.038
ARB-0.037
RAG-0.035
ESH-0.035
 Powers-0.034
Token be
Token ID307
Feature activation-2.28e-01

Pos logit contributions
 ILCS+0.799
ARB+0.771
RAG+0.737
ESH+0.732
 Powers+0.716
Neg logit contributions
yip-0.948
ibaba-0.881
ific-0.869
wrapper-0.857
ignt-0.735
Token mism
Token ID32691
Feature activation+4.67e-01

Pos logit contributions
 ILCS+0.010
ARB+0.010
RAG+0.009
ESH+0.009
 Powers+0.009
Neg logit contributions
yip-0.016
ibaba-0.015
ific-0.015
wrapper-0.015
ignt-0.013
Tokenatched
Token ID14265
Feature activation-2.72e-01

Pos logit contributions
yip+1.155
ibaba+1.032
ific+1.030
wrapper+1.004
ignt+0.809
Neg logit contributions
 ILCS-1.664
ARB-1.621
RAG-1.571
ESH-1.561
ONS-1.532
Token heroes
Token ID10281
Feature activation-9.54e-01

Pos logit contributions
 ILCS+0.612
ARB+0.585
RAG+0.558
ESH+0.552
 Powers+0.549
Neg logit contributions
yip-1.036
ibaba-0.963
ific-0.949
wrapper-0.941
ignt-0.829
Token,
Token ID11
Feature activation-8.44e-02

Pos logit contributions
 ILCS+2.007
ARB+1.926
RAG+1.841
ESH+1.822
 Powers+1.811
Neg logit contributions
yip-3.724
ibaba-3.468
ific-3.400
wrapper-3.395
uador-2.971
Token Jack
Token ID3619
Feature activation+5.75e-01

Pos logit contributions
 ILCS+0.199
ARB+0.193
RAG+0.184
ESH+0.182
 Powers+0.178
Neg logit contributions
yip-0.311
ibaba-0.288
ific-0.285
wrapper-0.282
ignt-0.246
Token Aub
Token ID19723
Feature activation-2.31e+00

Pos logit contributions
yip+2.085
ibaba+1.937
ific+1.926
wrapper+1.901
ignt+1.658
Neg logit contributions
 ILCS-1.385
ARB-1.337
RAG-1.269
ESH-1.259
 Powers-1.233
Tokenrey
Token ID4364
Feature activation-2.34e-01

Pos logit contributions
ARB+3.795
 Powers+3.679
 ILCS+3.617
RAG+3.571
 Avery+3.570
Neg logit contributions
yip-9.176
ibaba-8.590
wrapper-8.529
ific-8.253
ciplinary-7.580
Token,
Token ID11
Feature activation+3.78e-01

Pos logit contributions
 ILCS+0.570
ARB+0.549
RAG+0.524
ESH+0.518
 Powers+0.504
Neg logit contributions
yip-0.840
ibaba-0.779
ific-0.777
wrapper-0.765
ignt-0.667
Token a
Token ID257
Feature activation+3.24e-01

Pos logit contributions
yip+1.319
ific+1.221
ibaba+1.220
wrapper+1.197
ignt+1.041
Neg logit contributions
 ILCS-0.960
ARB-0.924
RAG-0.884
ESH-0.875
Wikipedia-0.851
Token British
Token ID3517
Feature activation-2.19e-01

Pos logit contributions
yip+1.167
ibaba+1.081
ific+1.075
wrapper+1.057
ignt+0.924
Neg logit contributions
 ILCS-0.789
ARB-0.760
RAG-0.726
ESH-0.720
 Powers-0.702
Token naval
Token ID19014
Feature activation-3.38e-01

Pos logit contributions
 ILCS+0.531
ARB+0.508
RAG+0.484
ESH+0.482
 Powers+0.472
Neg logit contributions
yip-0.798
ibaba-0.738
ific-0.732
wrapper-0.724
ignt-0.630
Token Doctor
Token ID9356
Feature activation+3.40e-01

Pos logit contributions
yip+0.188
ibaba+0.175
ific+0.172
wrapper+0.171
ciplinary+0.149
Neg logit contributions
 ILCS-0.129
ARB-0.124
RAG-0.119
ESH-0.118
 Powers-0.117
Token),
Token ID828
Feature activation+1.87e-01

Pos logit contributions
yip+1.263
ific+1.177
ibaba+1.174
wrapper+1.154
ignt+1.015
Neg logit contributions
 ILCS-0.786
ARB-0.755
RAG-0.719
ESH-0.710
Wikipedia-0.688
Token around
Token ID1088
Feature activation+1.10e+00

Pos logit contributions
yip+0.691
ibaba+0.642
ific+0.638
wrapper+0.630
ignt+0.550
Neg logit contributions
 ILCS-0.441
ARB-0.425
RAG-0.405
ESH-0.401
 Powers-0.391
Token the
Token ID262
Feature activation-2.02e-01

Pos logit contributions
yip+3.775
ific+3.495
ibaba+3.479
wrapper+3.422
iang+2.968
Neg logit contributions
 ILCS-2.819
ARB-2.712
RAG-2.599
ESH-2.563
Wikipedia-2.496
Token fifth
Token ID8150
Feature activation-2.38e-01

Pos logit contributions
 ILCS+0.505
ARB+0.484
RAG+0.464
ESH+0.460
 Powers+0.455
Neg logit contributions
yip-0.713
ibaba-0.662
ific-0.655
wrapper-0.643
ignt-0.563
Token episode
Token ID4471
Feature activation-2.14e+00

Pos logit contributions
 ILCS+0.595
ARB+0.569
RAG+0.545
ESH+0.539
 Powers+0.530
Neg logit contributions
yip-0.846
ibaba-0.787
ific-0.776
wrapper-0.770
ignt-0.669
Token in
Token ID287
Feature activation-4.40e-01

Pos logit contributions
 ILCS+4.087
ARB+3.987
 Avery+3.840
 Powers+3.828
ESH+3.819
Neg logit contributions
yip-8.363
ibaba-7.861
wrapper-7.628
ific-7.568
ignt-6.819
Token,
Token ID11
Feature activation-2.65e-02

Pos logit contributions
 ILCS+1.065
ARB+1.035
RAG+0.984
ESH+0.974
 Powers+0.963
Neg logit contributions
yip-1.582
ibaba-1.462
ific-1.449
wrapper-1.439
ignt-1.248
Token when
Token ID618
Feature activation-3.95e-02

Pos logit contributions
 ILCS+0.064
ARB+0.062
RAG+0.059
ESH+0.059
 Powers+0.058
Neg logit contributions
yip-0.095
ibaba-0.089
ific-0.088
wrapper-0.087
ignt-0.076
Token we
Token ID356
Feature activation-2.46e-01

Pos logit contributions
 ILCS+0.082
ARB+0.079
RAG+0.075
ESH+0.074
 Powers+0.071
Neg logit contributions
yip-0.156
ibaba-0.146
ific-0.145
wrapper-0.143
ignt-0.126
Token were
Token ID547
Feature activation+4.60e-02

Pos logit contributions
 ILCS+0.619
ARB+0.602
RAG+0.572
ESH+0.572
 Powers+0.562
Neg logit contributions
yip-0.862
ibaba-0.796
ific-0.787
wrapper-0.782
ignt-0.676
Token 18
Token ID1248
Feature activation-3.87e-01

Pos logit contributions
 ILCS+0.174
ARB+0.167
RAG+0.160
ESH+0.159
 Powers+0.156
Neg logit contributions
yip-0.238
ibaba-0.220
ific-0.217
wrapper-0.215
ignt-0.187
Token.
Token ID13
Feature activation-5.23e-02

Pos logit contributions
 ILCS+0.867
ARB+0.834
RAG+0.794
ESH+0.784
ONS+0.761
Neg logit contributions
yip-1.463
ibaba-1.364
ific-1.362
wrapper-1.339
ignt-1.177
Token7
Token ID22
Feature activation+3.76e-01

Pos logit contributions
 ILCS+0.124
ARB+0.118
RAG+0.113
ESH+0.111
Wikipedia+0.108
Neg logit contributions
yip-0.192
ific-0.179
ibaba-0.178
wrapper-0.176
ignt-0.154
Token 10
Token ID838
Feature activation-6.16e-01

Pos logit contributions
yip+1.202
ific+1.121
ibaba+1.120
wrapper+1.080
iang+0.945
Neg logit contributions
 ILCS-1.043
ARB-1.021
RAG-0.982
ESH-0.981
Leary-0.947
Token United
Token ID1578
Feature activation-6.33e-01

Pos logit contributions
 ILCS+1.627
ARB+1.547
RAG+1.488
 Powers+1.482
ESH+1.480
Neg logit contributions
yip-2.142
ibaba-1.961
ific-1.929
wrapper-1.907
ignt-1.675
Token Arab
Token ID4498
Feature activation-2.14e+00

Pos logit contributions
 ILCS+1.121
ARB+1.061
RAG+0.993
ESH+0.980
 Powers+0.946
Neg logit contributions
yip-2.690
ibaba-2.525
ific-2.521
wrapper-2.489
ignt-2.222
Token Emirates
Token ID24880
Feature activation+6.76e-02

Pos logit contributions
 ILCS+3.399
 Powers+3.296
ARB+3.059
RAG+2.949
 Avery+2.896
Neg logit contributions
yip-9.367
ibaba-8.535
wrapper-8.404
ific-8.366
ciplinary-7.565
Token 19
Token ID678
Feature activation+6.57e-02

Pos logit contributions
yip+0.238
ibaba+0.219
ific+0.216
wrapper+0.214
ignt+0.187
Neg logit contributions
 ILCS-0.173
ARB-0.166
RAG-0.159
ESH-0.158
 Powers-0.155
Token
Token ID198
Feature activation+4.03e-01

Pos logit contributions
yip+0.243
ific+0.229
ibaba+0.228
wrapper+0.224
iang+0.198
Neg logit contributions
 ILCS-0.151
ARB-0.146
RAG-0.139
ESH-0.137
Wikipedia-0.134
Token
Token ID198
Feature activation+4.91e-01

Pos logit contributions
yip+1.679
ibaba+1.565
ific+1.548
wrapper+1.530
ignt+1.372
Neg logit contributions
 ILCS-0.770
ARB-0.739
RAG-0.693
ESH-0.688
Wikipedia-0.664
TokenHaving
Token ID14698
Feature activation+4.75e-01

Pos logit contributions
yip+1.719
ibaba+1.586
ific+1.582
wrapper+1.554
ignt+1.353
Neg logit contributions
 ILCS-1.248
ARB-1.208
RAG-1.156
ESH-1.144
Wikipedia-1.119
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
yip+0.562
ibaba+0.519
ific+0.517
wrapper+0.508
ignt+0.438
Neg logit contributions
 ILCS-0.446
ARB-0.431
RAG-0.413
ESH-0.409
ONS-0.399
Token '
Token ID705
Feature activation+8.75e-02

Pos logit contributions
 ILCS+0.365
ARB+0.352
RAG+0.336
ESH+0.332
 Powers+0.328
Neg logit contributions
yip-0.496
ibaba-0.459
ific-0.454
wrapper-0.446
ignt-0.388
Token2
Token ID17
Feature activation-2.56e-01

Pos logit contributions
yip+0.290
ibaba+0.268
ific+0.266
wrapper+0.260
ignt+0.225
Neg logit contributions
 ILCS-0.237
ARB-0.230
RAG-0.221
ESH-0.218
Wikipedia-0.214
Token,
Token ID11
Feature activation-4.35e-01

Pos logit contributions
 ILCS+0.594
ARB+0.570
RAG+0.542
ESH+0.535
 Powers+0.524
Neg logit contributions
yip-0.951
ibaba-0.886
ific-0.882
wrapper-0.869
ciplinary-0.763
Token315
Token ID27936
Feature activation-5.13e-01

Pos logit contributions
 ILCS+1.023
ARB+0.988
RAG+0.938
ESH+0.925
 Powers+0.903
Neg logit contributions
yip-1.590
ibaba-1.485
ific-1.469
wrapper-1.452
ciplinary-1.270
Token polling
Token ID13985
Feature activation-2.12e+00

Pos logit contributions
 ILCS+1.227
ARB+1.173
RAG+1.117
ESH+1.102
 Powers+1.080
Neg logit contributions
yip-1.870
ibaba-1.745
ific-1.729
wrapper-1.702
ignt-1.486
Token stations
Token ID8985
Feature activation+4.76e-02

Pos logit contributions
 ILCS+4.424
ARB+4.237
ESH+4.059
 Powers+4.045
RAG+3.980
Neg logit contributions
yip-8.060
ibaba-7.539
ific-7.262
wrapper-7.186
uador-6.478
Token all
Token ID477
Feature activation-1.63e-01

Pos logit contributions
yip+0.175
ibaba+0.163
ific+0.162
wrapper+0.159
ignt+0.140
Neg logit contributions
 ILCS-0.112
ARB-0.108
RAG-0.103
ESH-0.102
ONS-0.099
Token over
Token ID625
Feature activation+2.64e-01

Pos logit contributions
 ILCS+0.396
ARB+0.387
RAG+0.368
ESH+0.362
 Powers+0.358
Neg logit contributions
yip-0.594
ibaba-0.547
wrapper-0.531
ific-0.530
ignt-0.461
Token the
Token ID262
Feature activation-2.10e-01

Pos logit contributions
yip+0.816
ific+0.748
ibaba+0.745
wrapper+0.733
ignt+0.621
Neg logit contributions
 ILCS-0.771
ARB-0.747
RAG-0.718
ESH-0.718
ONS-0.704
Token region
Token ID3814
Feature activation-4.89e-01

Pos logit contributions
 ILCS+0.531
ARB+0.512
RAG+0.489
ESH+0.482
 Powers+0.475
Neg logit contributions
yip-0.735
ibaba-0.683
ific-0.676
wrapper-0.667
ignt-0.583
Token oppression
Token ID18345
Feature activation+5.42e-02

Pos logit contributions
yip+0.510
ibaba+0.473
ific+0.468
wrapper+0.462
ignt+0.403
Neg logit contributions
 ILCS-0.357
ARB-0.344
RAG-0.331
ESH-0.326
 Powers-0.318
Token,
Token ID11
Feature activation-3.40e-01

Pos logit contributions
yip+0.206
ibaba+0.192
ific+0.192
wrapper+0.189
ignt+0.166
Neg logit contributions
 ILCS-0.121
ARB-0.116
RAG-0.110
ESH-0.109
ONS-0.106
Token humanity
Token ID9265
Feature activation-1.00e-01

Pos logit contributions
 ILCS+0.806
ARB+0.774
RAG+0.743
ESH+0.730
 Powers+0.721
Neg logit contributions
yip-1.239
ibaba-1.145
ific-1.140
wrapper-1.132
ignt-0.985
Token was
Token ID373
Feature activation+1.51e-01

Pos logit contributions
 ILCS+0.237
ARB+0.226
RAG+0.215
ESH+0.214
 Powers+0.209
Neg logit contributions
yip-0.369
ibaba-0.342
ific-0.339
wrapper-0.336
ignt-0.294
Token at
Token ID379
Feature activation-1.25e+00

Pos logit contributions
yip+0.575
ibaba+0.533
ific+0.528
wrapper+0.524
ignt+0.460
Neg logit contributions
 ILCS-0.336
ARB-0.324
RAG-0.309
ESH-0.304
ONS-0.297
Token last
Token ID938
Feature activation-2.10e+00

Pos logit contributions
 ILCS+2.941
ARB+2.914
RAG+2.772
 Avery+2.682
ESH+2.674
Neg logit contributions
yip-4.475
ibaba-4.127
wrapper-4.101
ific-4.083
ciplinary-3.565
Token free
Token ID1479
Feature activation-6.68e-01

Pos logit contributions
 ILCS+4.456
ARB+4.392
RAG+4.229
ESH+4.162
ONS+4.143
Neg logit contributions
yip-7.826
ibaba-7.044
wrapper-7.025
ific-6.900
ciplinary-6.204
Token from
Token ID422
Feature activation+9.09e-02

Pos logit contributions
 ILCS+1.648
ARB+1.579
RAG+1.528
ESH+1.513
 Powers+1.471
Neg logit contributions
yip-2.369
ibaba-2.198
ific-2.159
wrapper-2.145
ignt-1.876
Token the
Token ID262
Feature activation+1.29e-01

Pos logit contributions
yip+0.325
ibaba+0.299
ific+0.298
wrapper+0.294
ignt+0.256
Neg logit contributions
 ILCS-0.224
ARB-0.216
RAG-0.208
ESH-0.205
 Powers-0.200
Token cruel
Token ID12177
Feature activation-6.24e-01

Pos logit contributions
yip+0.453
ibaba+0.418
ific+0.414
wrapper+0.408
ignt+0.355
Neg logit contributions
 ILCS-0.327
ARB-0.315
RAG-0.302
ESH-0.297
 Powers-0.293
Token tyranny
Token ID29977
Feature activation+3.31e-01

Pos logit contributions
 ILCS+1.549
ARB+1.496
RAG+1.427
ESH+1.406
 Powers+1.402
Neg logit contributions
yip-2.223
ibaba-2.031
ific-1.988
wrapper-1.981
ciplinary-1.730
Token my
Token ID616
Feature activation+9.75e-02

Pos logit contributions
 ILCS+0.848
ARB+0.816
RAG+0.786
ESH+0.782
 Powers+0.762
Neg logit contributions
yip-1.250
ibaba-1.155
ific-1.131
wrapper-1.122
ignt-0.987
Token New
Token ID968
Feature activation+4.13e-01

Pos logit contributions
yip+0.345
ibaba+0.318
ific+0.313
wrapper+0.309
ignt+0.271
Neg logit contributions
 ILCS-0.246
ARB-0.236
RAG-0.227
ESH-0.225
ONS-0.220
Token York
Token ID1971
Feature activation+3.72e-01

Pos logit contributions
yip+1.391
ibaba+1.283
ific+1.279
wrapper+1.257
ignt+1.086
Neg logit contributions
 ILCS-1.100
ARB-1.061
RAG-1.018
ESH-1.010
ONS-0.984
Token adventures
Token ID17545
Feature activation-6.48e-01

Pos logit contributions
yip+1.238
ibaba+1.140
ific+1.136
wrapper+1.116
ignt+0.961
Neg logit contributions
 ILCS-1.011
ARB-0.976
RAG-0.936
ESH-0.930
 Powers-0.906
Token on
Token ID319
Feature activation-6.39e-02

Pos logit contributions
 ILCS+1.392
ARB+1.308
ESH+1.275
RAG+1.262
ONS+1.232
Neg logit contributions
yip-2.525
ibaba-2.331
ific-2.279
wrapper-2.260
ignt-2.027
Token Snapchat
Token ID31546
Feature activation-2.08e+00

Pos logit contributions
 ILCS+0.154
ARB+0.148
RAG+0.141
ESH+0.141
 Powers+0.137
Neg logit contributions
yip-0.233
ibaba-0.215
ific-0.213
wrapper-0.210
ignt-0.185
Token today
Token ID1909
Feature activation+5.04e-01

Pos logit contributions
 ILCS+4.659
ARB+4.321
ESH+4.195
 Powers+4.136
RAG+4.132
Neg logit contributions
yip-7.658
ibaba-7.064
wrapper-6.832
ific-6.810
ignt-6.161
Token (
Token ID357
Feature activation-5.13e-01

Pos logit contributions
yip+1.880
ibaba+1.747
ific+1.744
wrapper+1.716
ignt+1.506
Neg logit contributions
 ILCS-1.159
ARB-1.113
RAG-1.059
ESH-1.048
ONS-1.017
TokenI
Token ID40
Feature activation-3.27e-01

Pos logit contributions
 ILCS+1.304
ARB+1.285
ESH+1.233
RAG+1.226
Wikipedia+1.208
Neg logit contributions
yip-1.764
ibaba-1.643
ific-1.600
wrapper-1.556
ignt-1.390
Token
Token ID447
Feature activation-2.56e-01

Pos logit contributions
 ILCS+0.810
ARB+0.786
RAG+0.749
ESH+0.746
ONS+0.731
Neg logit contributions
yip-1.166
ibaba-1.076
ific-1.064
wrapper-1.050
ignt-0.921
Token
Token ID247
Feature activation-6.58e-02

Pos logit contributions
 ILCS+0.464
ARB+0.444
RAG+0.423
ESH+0.409
ONS+0.395
Neg logit contributions
yip-1.090
ibaba-1.014
ific-1.006
wrapper-0.985
ignt-0.887
Token a
Token ID257
Feature activation-7.73e-02

Pos logit contributions
 ILCS+1.606
ARB+1.555
RAG+1.489
ESH+1.465
 Powers+1.450
Neg logit contributions
yip-2.315
ibaba-2.129
ific-2.118
wrapper-2.084
ignt-1.827
Token Jewish
Token ID5582
Feature activation-7.16e-01

Pos logit contributions
 ILCS+0.179
ARB+0.172
RAG+0.165
ESH+0.162
 Powers+0.160
Neg logit contributions
yip-0.287
ibaba-0.267
ific-0.265
wrapper-0.260
ignt-0.231
Token majority
Token ID3741
Feature activation-3.90e-01

Pos logit contributions
 ILCS+1.616
ARB+1.545
 Powers+1.456
RAG+1.455
ESH+1.445
Neg logit contributions
yip-2.690
ibaba-2.509
ific-2.459
wrapper-2.446
ignt-2.177
Token city
Token ID1748
Feature activation-1.54e-01

Pos logit contributions
 ILCS+0.966
ARB+0.934
RAG+0.893
ESH+0.882
 Powers+0.864
Neg logit contributions
yip-1.380
ibaba-1.290
ific-1.268
wrapper-1.255
ignt-1.091
Token;
Token ID26
Feature activation+7.50e-02

Pos logit contributions
 ILCS+0.350
ARB+0.336
RAG+0.319
ESH+0.316
Wikipedia+0.307
Neg logit contributions
yip-0.582
ibaba-0.542
ific-0.538
wrapper-0.531
ignt-0.467
Token thus
Token ID4145
Feature activation-2.07e+00

Pos logit contributions
yip+0.267
ibaba+0.247
ific+0.246
wrapper+0.243
ciplinary+0.211
Neg logit contributions
 ILCS-0.186
ARB-0.179
RAG-0.171
ESH-0.169
 Powers-0.165
Token,
Token ID11
Feature activation-3.74e-01

Pos logit contributions
 ILCS+4.294
ARB+4.115
 Powers+3.933
RAG+3.907
ESH+3.903
Neg logit contributions
yip-7.773
ibaba-7.231
ific-7.133
wrapper-7.102
ciplinary-6.350
Token Israel
Token ID2692
Feature activation-1.86e-01

Pos logit contributions
 ILCS+0.887
ARB+0.853
RAG+0.816
ESH+0.807
 Powers+0.792
Neg logit contributions
yip-1.369
ibaba-1.269
ific-1.259
wrapper-1.246
ciplinary-1.088
Token
Token ID447
Feature activation-5.08e-02

Pos logit contributions
 ILCS+0.445
ARB+0.425
RAG+0.405
ESH+0.401
 Powers+0.393
Neg logit contributions
yip-0.681
ibaba-0.627
ific-0.623
wrapper-0.617
ciplinary-0.536
Token
Token ID247
Feature activation-3.37e-01

Pos logit contributions
 ILCS+0.086
ARB+0.081
RAG+0.075
ESH+0.074
Wikipedia+0.071
Neg logit contributions
yip-0.226
ibaba-0.208
ific-0.206
wrapper-0.203
uador-0.182
Tokens
Token ID82
Feature activation-7.13e-02

Pos logit contributions
 ILCS+1.135
ARB+1.089
ESH+1.084
RAG+1.051
Wikipedia+1.051
Neg logit contributions
yip-0.925
ibaba-0.797
ific-0.763
wrapper-0.754
uador-0.636
Token the
Token ID262
Feature activation-1.97e-01

Pos logit contributions
yip+0.531
ibaba+0.489
ific+0.487
wrapper+0.480
ignt+0.419
Neg logit contributions
 ILCS-0.371
ARB-0.360
RAG-0.343
ESH-0.339
 Powers-0.334
Token April
Token ID3035
Feature activation-2.00e-01

Pos logit contributions
 ILCS+0.488
ARB+0.477
RAG+0.452
ESH+0.445
 Powers+0.444
Neg logit contributions
yip-0.704
ibaba-0.651
ific-0.644
wrapper-0.633
ignt-0.556
Token 1986
Token ID12113
Feature activation-1.75e-03

Pos logit contributions
 ILCS+0.463
ARB+0.443
RAG+0.422
ESH+0.421
 Powers+0.416
Neg logit contributions
yip-0.753
ibaba-0.699
ific-0.683
wrapper-0.680
ignt-0.600
Token launch
Token ID4219
Feature activation-6.18e-01

Pos logit contributions
 ILCS+0.005
ARB+0.004
RAG+0.004
ESH+0.004
 Powers+0.004
Neg logit contributions
yip-0.006
ibaba-0.006
ific-0.006
wrapper-0.005
ignt-0.005
Token vehicle
Token ID4038
Feature activation+9.88e-02

Pos logit contributions
 ILCS+1.403
ARB+1.375
RAG+1.286
ESH+1.279
ONS+1.271
Neg logit contributions
yip-2.309
ibaba-2.145
ific-2.099
wrapper-2.051
ignt-1.828
Token accident
Token ID5778
Feature activation-2.06e+00

Pos logit contributions
yip+0.339
ibaba+0.311
ific+0.310
wrapper+0.306
ignt+0.264
Neg logit contributions
 ILCS-0.257
ARB-0.250
RAG-0.239
ESH-0.237
ONS-0.231
Token,
Token ID11
Feature activation-3.85e-01

Pos logit contributions
 ILCS+4.081
ARB+4.014
RAG+3.781
 Powers+3.777
 Avery+3.737
Neg logit contributions
yip-8.063
ibaba-7.401
ific-7.282
wrapper-7.266
ignt-6.528
Token Per
Token ID2448
Feature activation-1.07e-01

Pos logit contributions
 ILCS+0.911
ARB+0.878
RAG+0.837
ESH+0.826
 Powers+0.812
Neg logit contributions
yip-1.421
ibaba-1.313
ific-1.303
wrapper-1.288
ignt-1.129
Tokenkin
Token ID5116
Feature activation+1.39e-01

Pos logit contributions
 ILCS+0.294
ARB+0.284
RAG+0.274
ESH+0.270
Wikipedia+0.264
Neg logit contributions
yip-0.354
ibaba-0.326
ific-0.322
wrapper-0.319
ignt-0.274
Token-
Token ID12
Feature activation+6.35e-02

Pos logit contributions
yip+0.494
ibaba+0.459
ific+0.453
wrapper+0.447
ignt+0.391
Neg logit contributions
 ILCS-0.344
ARB-0.331
RAG-0.315
ESH-0.315
ONS-0.308
TokenEl
Token ID9527
Feature activation+2.18e-02

Pos logit contributions
yip+0.196
ibaba+0.180
ific+0.179
wrapper+0.175
ignt+0.149
Neg logit contributions
 ILCS-0.187
ARB-0.182
RAG-0.175
ESH-0.174
Wikipedia-0.171
Token up
Token ID510
Feature activation-4.30e-01

Pos logit contributions
yip+0.271
ibaba+0.251
ific+0.248
wrapper+0.246
ignt+0.214
Neg logit contributions
 ILCS-0.192
ARB-0.185
RAG-0.177
ESH-0.176
 Powers-0.171
Token.
Token ID13
Feature activation-3.81e-01

Pos logit contributions
 ILCS+0.972
ARB+0.940
RAG+0.895
ESH+0.891
 Powers+0.869
Neg logit contributions
yip-1.612
ibaba-1.508
ific-1.481
wrapper-1.460
ciplinary-1.295
Token And
Token ID843
Feature activation-2.97e-01

Pos logit contributions
 ILCS+0.829
ARB+0.794
RAG+0.758
ESH+0.749
 Powers+0.748
Neg logit contributions
yip-1.496
ibaba-1.383
ific-1.357
wrapper-1.355
ciplinary-1.208
Token at
Token ID379
Feature activation-7.49e-01

Pos logit contributions
 ILCS+0.632
ARB+0.610
RAG+0.578
ESH+0.570
 Powers+0.562
Neg logit contributions
yip-1.160
ibaba-1.077
ific-1.069
wrapper-1.060
ignt-0.939
Token the
Token ID262
Feature activation-7.70e-01

Pos logit contributions
 ILCS+1.714
ARB+1.669
RAG+1.579
ESH+1.568
 Powers+1.545
Neg logit contributions
yip-2.783
ibaba-2.583
ific-2.548
wrapper-2.530
ciplinary-2.233
Token last
Token ID938
Feature activation-2.06e+00

Pos logit contributions
 ILCS+1.764
ARB+1.696
RAG+1.619
ESH+1.611
 Avery+1.606
Neg logit contributions
yip-2.855
ibaba-2.676
ific-2.635
wrapper-2.574
ignt-2.288
Token minute
Token ID5664
Feature activation+2.25e-02

Pos logit contributions
 ILCS+4.492
ARB+4.282
ESH+4.071
RAG+4.042
 Avery+4.042
Neg logit contributions
yip-7.740
ibaba-7.159
wrapper-6.858
ific-6.748
ciplinary-6.228
Token he
Token ID339
Feature activation-6.00e-01

Pos logit contributions
yip+0.092
ific+0.086
ibaba+0.086
wrapper+0.085
ignt+0.075
Neg logit contributions
 ILCS-0.044
ARB-0.042
RAG-0.039
ESH-0.039
ONS-0.037
Token called
Token ID1444
Feature activation-9.06e-01

Pos logit contributions
 ILCS+1.418
ARB+1.386
ESH+1.320
 Powers+1.319
 Avery+1.313
Neg logit contributions
yip-2.181
ibaba-2.019
ific-1.955
wrapper-1.955
ignt-1.726
Token and
Token ID290
Feature activation-4.83e-01

Pos logit contributions
 ILCS+2.219
ARB+2.203
ESH+2.089
RAG+2.086
 Powers+2.073
Neg logit contributions
yip-3.212
ibaba-2.969
ific-2.904
wrapper-2.896
ignt-2.547
Token said
Token ID531
Feature activation-4.64e-01

Pos logit contributions
 ILCS+1.382
ARB+1.357
ESH+1.300
RAG+1.292
 Powers+1.287
Neg logit contributions
yip-1.522
ibaba-1.392
ific-1.357
wrapper-1.350
ignt-1.167
Token for
Token ID329
Feature activation+4.08e-01

Pos logit contributions
yip+1.185
ific+1.108
ibaba+1.107
wrapper+1.091
ignt+0.966
Neg logit contributions
 ILCS-0.601
ARB-0.574
RAG-0.543
ESH-0.536
ONS-0.517
Token the
Token ID262
Feature activation+2.60e-01

Pos logit contributions
yip+1.436
ibaba+1.330
ific+1.323
wrapper+1.302
ignt+1.132
Neg logit contributions
 ILCS-1.030
ARB-0.993
RAG-0.947
ESH-0.940
 Powers-0.915
Token lower
Token ID2793
Feature activation+9.73e-01

Pos logit contributions
yip+0.948
ibaba+0.884
ific+0.876
wrapper+0.863
ignt+0.756
Neg logit contributions
 ILCS-0.622
ARB-0.597
RAG-0.567
ESH-0.563
 Powers-0.551
Token house
Token ID2156
Feature activation+6.67e-02

Pos logit contributions
yip+3.866
ific+3.628
ibaba+3.584
wrapper+3.536
iang+3.151
Neg logit contributions
 ILCS-1.952
ARB-1.876
RAG-1.792
ESH-1.757
Wikipedia-1.710
Token;
Token ID26
Feature activation+1.88e-01

Pos logit contributions
yip+0.272
ibaba+0.254
ific+0.251
wrapper+0.249
ignt+0.221
Neg logit contributions
 ILCS-0.131
ARB-0.126
RAG-0.118
ESH-0.116
 Powers-0.114
Token thus
Token ID4145
Feature activation-2.03e+00

Pos logit contributions
yip+0.679
ibaba+0.632
ific+0.625
wrapper+0.619
ciplinary+0.538
Neg logit contributions
 ILCS-0.459
ARB-0.443
RAG-0.421
ESH-0.416
 Powers-0.409
Token only
Token ID691
Feature activation-4.44e-01

Pos logit contributions
 ILCS+3.907
ARB+3.882
 Powers+3.672
RAG+3.576
ESH+3.562
Neg logit contributions
yip-7.740
ibaba-7.423
wrapper-7.212
ific-7.132
ciplinary-6.474
Token 40
Token ID2319
Feature activation+1.52e-01

Pos logit contributions
 ILCS+1.044
ARB+1.007
RAG+0.957
 Powers+0.952
ESH+0.951
Neg logit contributions
yip-1.623
ibaba-1.532
wrapper-1.489
ific-1.485
ciplinary-1.311
Token of
Token ID286
Feature activation-4.72e-01

Pos logit contributions
yip+0.520
ibaba+0.484
ific+0.479
wrapper+0.473
ignt+0.409
Neg logit contributions
 ILCS-0.396
ARB-0.381
RAG-0.364
ESH-0.361
 Powers-0.352
Token the
Token ID262
Feature activation-3.92e-01

Pos logit contributions
 ILCS+1.147
ARB+1.097
RAG+1.043
ESH+1.032
 Powers+1.017
Neg logit contributions
yip-1.698
ibaba-1.590
ific-1.561
wrapper-1.555
ciplinary-1.363
Token 76
Token ID8684
Feature activation-1.53e-03

Pos logit contributions
 ILCS+0.954
ARB+0.905
RAG+0.868
 Powers+0.858
ESH+0.857
Neg logit contributions
yip-1.408
ibaba-1.330
ific-1.298
wrapper-1.288
ciplinary-1.131
Token seen
Token ID1775
Feature activation+1.61e-01

Pos logit contributions
yip+3.094
ific+2.868
ibaba+2.867
wrapper+2.816
ignt+2.452
Neg logit contributions
 ILCS-2.167
ARB-2.084
RAG-1.989
ESH-1.971
ONS-1.916
Token the
Token ID262
Feature activation+1.82e-01

Pos logit contributions
yip+0.575
ibaba+0.531
ific+0.530
wrapper+0.522
ignt+0.454
Neg logit contributions
 ILCS-0.400
ARB-0.386
RAG-0.368
ESH-0.365
Wikipedia-0.355
Token trailer
Token ID12268
Feature activation-4.30e-01

Pos logit contributions
yip+0.674
ibaba+0.625
ific+0.624
wrapper+0.614
ignt+0.538
Neg logit contributions
 ILCS-0.425
ARB-0.408
RAG-0.388
ESH-0.384
ONS-0.373
Token for
Token ID329
Feature activation+6.63e-01

Pos logit contributions
 ILCS+0.977
ARB+0.949
RAG+0.900
ESH+0.893
 Powers+0.870
Neg logit contributions
yip-1.612
ibaba-1.490
ific-1.482
wrapper-1.465
ignt-1.292
Token David
Token ID3271
Feature activation+1.36e-01

Pos logit contributions
yip+2.370
ific+2.204
ibaba+2.202
wrapper+2.162
ignt+1.886
Neg logit contributions
 ILCS-1.618
ARB-1.552
RAG-1.482
ESH-1.463
ONS-1.427
Token F
Token ID376
Feature activation-2.03e+00

Pos logit contributions
yip+0.489
ibaba+0.451
ific+0.450
wrapper+0.443
ignt+0.385
Neg logit contributions
 ILCS-0.333
ARB-0.322
RAG-0.305
ESH-0.302
 Powers-0.298
Tokeninc
Token ID1939
Feature activation-1.68e-01

Pos logit contributions
ARB+3.351
 ILCS+3.281
RAG+3.153
 Avery+3.117
 Powers+3.101
Neg logit contributions
yip-8.523
ibaba-8.127
wrapper-7.828
ific-7.697
ignt-7.098
Tokenher
Token ID372
Feature activation-7.06e-01

Pos logit contributions
 ILCS+0.522
ARB+0.506
RAG+0.487
ESH+0.486
ONS+0.474
Neg logit contributions
yip-0.493
ibaba-0.444
ific-0.439
wrapper-0.436
ignt-0.367
Token
Token ID447
Feature activation+1.45e-01

Pos logit contributions
 ILCS+1.642
ARB+1.578
RAG+1.499
ESH+1.488
 Powers+1.470
Neg logit contributions
yip-2.630
ibaba-2.423
ific-2.386
wrapper-2.364
ignt-2.080
Token
Token ID247
Feature activation-2.48e-01

Pos logit contributions
yip+0.605
ibaba+0.562
ific+0.561
wrapper+0.553
ignt+0.492
Neg logit contributions
 ILCS-0.275
ARB-0.265
RAG-0.248
ESH-0.243
 Powers-0.235
Tokens
Token ID82
Feature activation-1.90e-01

Pos logit contributions
 ILCS+0.646
ARB+0.626
RAG+0.598
ESH+0.597
 Powers+0.585
Neg logit contributions
yip-0.845
ibaba-0.778
ific-0.770
wrapper-0.760
ignt-0.659
Token the
Token ID262
Feature activation-3.25e-01

Pos logit contributions
 ILCS+0.957
ARB+0.924
ESH+0.889
RAG+0.881
 Avery+0.875
Neg logit contributions
yip-1.687
ibaba-1.545
wrapper-1.494
ific-1.494
ignt-1.348
Token main
Token ID1388
Feature activation+1.40e-01

Pos logit contributions
 ILCS+0.620
ARB+0.586
ESH+0.565
 Avery+0.561
RAG+0.561
Neg logit contributions
yip-1.348
ibaba-1.246
ific-1.213
wrapper-1.197
ignt-1.098
Token story
Token ID1621
Feature activation-6.16e-01

Pos logit contributions
yip+0.465
ibaba+0.424
ific+0.411
wrapper+0.403
ignt+0.362
Neg logit contributions
 ILCS-0.388
ARB-0.374
ESH-0.361
RAG-0.360
 Powers-0.356
Token
Token ID198
Feature activation+1.71e-01

Pos logit contributions
 ILCS+1.030
ARB+0.979
ESH+0.896
RAG+0.894
 Powers+0.883
Neg logit contributions
yip-2.768
ibaba-2.565
ific-2.487
wrapper-2.474
ignt-2.295
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
yip+0.728
ibaba+0.678
ific+0.665
wrapper+0.660
ignt+0.595
Neg logit contributions
 ILCS-0.318
ARB-0.305
RAG-0.285
ESH-0.284
Wikipedia-0.274
TokenThus
Token ID19093
Feature activation-2.03e+00

Pos logit contributions
 ILCS+0.272
ARB+0.267
RAG+0.255
ESH+0.251
ONS+0.246
Neg logit contributions
yip-0.403
ibaba-0.371
ific-0.364
wrapper-0.358
ciplinary-0.315
Token Massachusetts
Token ID10140
Feature activation-2.41e-01

Pos logit contributions
 ILCS+3.844
ARB+3.768
ESH+3.533
 Powers+3.524
RAG+3.501
Neg logit contributions
yip-8.145
ibaba-7.456
ific-7.389
wrapper-7.352
ciplinary-6.556
Token is
Token ID318
Feature activation-4.65e-01

Pos logit contributions
 ILCS+0.560
ARB+0.538
RAG+0.511
ESH+0.506
 Powers+0.492
Neg logit contributions
yip-0.901
ibaba-0.834
ific-0.830
wrapper-0.818
ignt-0.715
Token one
Token ID530
Feature activation-5.43e-01

Pos logit contributions
 ILCS+1.126
ARB+1.092
RAG+1.037
ESH+1.029
 Powers+1.005
Neg logit contributions
yip-1.686
ibaba-1.553
ific-1.524
wrapper-1.511
ignt-1.319
Token of
Token ID286
Feature activation-4.28e-01

Pos logit contributions
 ILCS+1.510
ARB+1.460
RAG+1.402
ESH+1.390
ONS+1.355
Neg logit contributions
yip-1.764
ibaba-1.622
ific-1.618
wrapper-1.589
ignt-1.363
Token the
Token ID262
Feature activation-4.56e-01

Pos logit contributions
 ILCS+1.074
ARB+1.038
RAG+0.993
ESH+0.982
 Powers+0.960
Neg logit contributions
yip-1.504
ibaba-1.394
ific-1.375
wrapper-1.362
ignt-1.184
Token:
Token ID25
Feature activation-2.95e-01

Pos logit contributions
yip+0.666
ibaba+0.598
ific+0.594
wrapper+0.581
ignt+0.473
Neg logit contributions
 ILCS-0.934
ARB-0.915
RAG-0.889
ESH-0.879
ONS-0.863
Token
Token ID198
Feature activation-1.43e-01

Pos logit contributions
 ILCS+0.635
ARB+0.614
RAG+0.582
ESH+0.576
 Powers+0.562
Neg logit contributions
yip-1.141
ibaba-1.062
ific-1.059
wrapper-1.046
ciplinary-0.923
Token
Token ID198
Feature activation-1.00e-01

Pos logit contributions
 ILCS+0.249
ARB+0.241
ESH+0.225
RAG+0.222
Wikipedia+0.217
Neg logit contributions
yip-0.625
ibaba-0.582
ific-0.565
wrapper-0.562
ignt-0.513
Token13
Token ID1485
Feature activation-3.11e-01

Pos logit contributions
 ILCS+0.244
ARB+0.242
RAG+0.230
ESH+0.228
Wikipedia+0.225
Neg logit contributions
yip-0.361
ibaba-0.336
ific-0.330
wrapper-0.324
ignt-0.285
Token PH
Token ID9370
Feature activation+2.21e-01

Pos logit contributions
 ILCS+0.727
ARB+0.699
RAG+0.664
ESH+0.656
 Powers+0.643
Neg logit contributions
yip-1.155
ibaba-1.071
ific-1.065
wrapper-1.049
ignt-0.922
TokenOTOS
Token ID33291
Feature activation-2.02e+00

Pos logit contributions
yip+1.070
ific+1.012
ibaba+1.011
wrapper+0.998
ignt+0.906
Neg logit contributions
 ILCS-0.261
ARB-0.240
RAG-0.217
ESH-0.211
Wikipedia-0.198
Token Donald
Token ID3759
Feature activation+6.90e-01

Pos logit contributions
ARB+4.454
 ILCS+4.218
 Powers+4.184
RAG+4.175
 Avery+4.139
Neg logit contributions
yip-7.612
ibaba-7.149
wrapper-6.866
ific-6.741
ciplinary-6.145
Token Trump
Token ID1301
Feature activation-1.26e-02

Pos logit contributions
yip+2.551
ibaba+2.370
ific+2.359
wrapper+2.324
ignt+2.037
Neg logit contributions
 ILCS-1.614
ARB-1.555
RAG-1.481
ESH-1.465
ONS-1.422
Token's
Token ID338
Feature activation-9.17e-01

Pos logit contributions
 ILCS+0.030
ARB+0.029
RAG+0.028
ESH+0.028
 Powers+0.027
Neg logit contributions
yip-0.046
ibaba-0.042
ific-0.042
wrapper-0.041
ignt-0.036
Token potential
Token ID2785
Feature activation-3.61e-01

Pos logit contributions
 ILCS+2.151
ARB+2.116
 Powers+2.031
RAG+2.022
ONS+1.997
Neg logit contributions
yip-3.329
ibaba-3.079
ific-2.987
wrapper-2.945
ignt-2.627
Token running
Token ID2491
Feature activation-3.46e-01

Pos logit contributions
 ILCS+0.940
ARB+0.933
RAG+0.893
 Powers+0.880
ESH+0.876
Neg logit contributions
yip-1.225
ibaba-1.123
ific-1.109
wrapper-1.095
ignt-0.962
TokenBest
Token ID13014
Feature activation-6.61e-01

Pos logit contributions
yip+5.144
ific+4.789
ibaba+4.752
wrapper+4.738
iang+4.093
Neg logit contributions
 ILCS-3.787
ARB-3.561
RAG-3.411
 Powers-3.390
ESH-3.384
Token Buy
Token ID11763
Feature activation+6.73e-01

Pos logit contributions
 ILCS+1.720
ARB+1.650
RAG+1.583
 Powers+1.556
ESH+1.553
Neg logit contributions
yip-2.264
ibaba-2.087
ific-2.049
wrapper-2.040
ignt-1.778
Token
Token ID198
Feature activation-1.87e-01

Pos logit contributions
yip+2.422
ific+2.252
ibaba+2.249
wrapper+2.213
ignt+1.926
Neg logit contributions
 ILCS-1.633
ARB-1.567
RAG-1.499
ESH-1.482
ONS-1.440
Token
Token ID198
Feature activation-2.01e-01

Pos logit contributions
 ILCS+0.324
ARB+0.317
ESH+0.292
RAG+0.291
Wikipedia+0.285
Neg logit contributions
yip-0.823
ibaba-0.763
ific-0.742
wrapper-0.738
ignt-0.674
TokenIs
Token ID3792
Feature activation-4.30e-01

Pos logit contributions
 ILCS+0.515
ARB+0.510
RAG+0.484
ESH+0.482
Wikipedia+0.476
Neg logit contributions
yip-0.697
ibaba-0.640
ific-0.633
wrapper-0.618
ignt-0.543
Token Microsoft
Token ID5413
Feature activation-2.01e+00

Pos logit contributions
 ILCS+1.007
ARB+0.973
RAG+0.928
ESH+0.914
Wikipedia+0.895
Neg logit contributions
yip-1.581
ific-1.456
ibaba-1.453
wrapper-1.435
ignt-1.259
Token a
Token ID257
Feature activation-8.42e-01

Pos logit contributions
 ILCS+4.445
ARB+4.374
 Powers+4.155
RAG+4.133
 Avery+4.105
Neg logit contributions
yip-7.388
ibaba-6.691
wrapper-6.589
ific-6.583
ignt-5.897
Token cooler
Token ID19346
Feature activation+3.01e-01

Pos logit contributions
 ILCS+2.048
ARB+1.945
RAG+1.909
 Powers+1.888
ESH+1.852
Neg logit contributions
yip-2.992
ibaba-2.740
ific-2.731
wrapper-2.680
ignt-2.377
Token company
Token ID1664
Feature activation-2.45e-01

Pos logit contributions
yip+1.089
ibaba+1.008
ific+1.003
wrapper+0.989
ignt+0.866
Neg logit contributions
 ILCS-0.728
ARB-0.701
RAG-0.669
ESH-0.661
 Powers-0.644
Token than
Token ID621
Feature activation-4.52e-01

Pos logit contributions
 ILCS+0.678
ARB+0.660
RAG+0.632
ESH+0.625
 Powers+0.615
Neg logit contributions
yip-0.798
ibaba-0.734
ific-0.724
wrapper-0.717
ignt-0.617
Token it
Token ID340
Feature activation-2.56e-01

Pos logit contributions
 ILCS+0.932
ARB+0.896
RAG+0.845
ESH+0.837
 Powers+0.826
Neg logit contributions
yip-1.793
ibaba-1.663
ific-1.660
wrapper-1.639
ciplinary-1.463
Token meaning
Token ID3616
Feature activation-5.50e-01

Pos logit contributions
yip+0.409
ibaba+0.377
ific+0.374
wrapper+0.368
ignt+0.321
Neg logit contributions
 ILCS-0.293
ARB-0.283
RAG-0.271
ESH-0.269
 Powers-0.265
Token he
Token ID339
Feature activation-3.08e-01

Pos logit contributions
 ILCS+1.107
ARB+1.065
RAG+1.005
ESH+1.002
 Powers+0.983
Neg logit contributions
yip-2.198
ibaba-2.062
ific-2.037
wrapper-2.010
ignt-1.791
Token can
Token ID460
Feature activation-1.04e-01

Pos logit contributions
 ILCS+0.827
ARB+0.800
RAG+0.767
ESH+0.767
 Powers+0.754
Neg logit contributions
yip-1.027
ibaba-0.941
ific-0.932
wrapper-0.919
ignt-0.787
Token lock
Token ID5793
Feature activation-3.42e-01

Pos logit contributions
 ILCS+0.290
ARB+0.281
RAG+0.270
ESH+0.269
 Powers+0.265
Neg logit contributions
yip-0.336
ibaba-0.308
ific-0.306
wrapper-0.301
iang-0.257
Token the
Token ID262
Feature activation-3.28e-01

Pos logit contributions
 ILCS+0.913
ARB+0.884
RAG+0.849
ESH+0.839
 Powers+0.827
Neg logit contributions
yip-1.145
ibaba-1.064
ific-1.051
wrapper-1.032
ciplinary-0.894
Token Quin
Token ID29338
Feature activation-2.01e+00

Pos logit contributions
 ILCS+0.814
ARB+0.786
RAG+0.753
ESH+0.744
 Powers+0.733
Neg logit contributions
yip-1.166
ibaba-1.078
ific-1.069
wrapper-1.050
ignt-0.920
Tokenjet
Token ID31173
Feature activation-6.20e-03

Pos logit contributions
 ILCS+6.008
RAG+5.742
ARB+5.629
 Powers+5.545
ESH+5.528
Neg logit contributions
yip-5.563
ibaba-5.121
wrapper-4.924
ific-4.906
ignt-4.230
Token or
Token ID393
Feature activation+9.35e-02

Pos logit contributions
 ILCS+0.016
ARB+0.016
RAG+0.015
ESH+0.015
 Powers+0.015
Neg logit contributions
yip-0.021
ibaba-0.020
ific-0.019
wrapper-0.019
ignt-0.017
Token Kr
Token ID13685
Feature activation+7.12e-01

Pos logit contributions
yip+0.320
ibaba+0.295
ific+0.293
wrapper+0.288
ignt+0.250
Neg logit contributions
 ILCS-0.245
ARB-0.237
RAG-0.227
ESH-0.225
 Powers-0.220
Tokenang
Token ID648
Feature activation+3.56e-01

Pos logit contributions
yip+2.192
ibaba+1.979
ific+1.961
wrapper+1.933
ignt+1.639
Neg logit contributions
 ILCS-2.121
ARB-2.068
RAG-1.991
ESH-1.981
ONS-1.932
Token down
Token ID866
Feature activation+7.83e-01

Pos logit contributions
yip+1.247
ibaba+1.156
ific+1.149
wrapper+1.130
ignt+0.983
Neg logit contributions
 ILCS-0.899
ARB-0.867
RAG-0.830
ESH-0.822
 Powers-0.801
TokenIsrael
Token ID14040
Feature activation-1.63e-01

Pos logit contributions
yip+0.675
ibaba+0.622
ific+0.619
wrapper+0.609
ignt+0.525
Neg logit contributions
 ILCS-0.534
ARB-0.516
RAG-0.496
ESH-0.492
Wikipedia-0.480
Tokenisation
Token ID5612
Feature activation-3.67e-01

Pos logit contributions
 ILCS+0.348
ARB+0.332
RAG+0.314
ESH+0.311
 Powers+0.302
Neg logit contributions
yip-0.635
ibaba-0.590
ific-0.587
wrapper-0.581
ignt-0.511
Token
Token ID447
Feature activation+3.51e-01

Pos logit contributions
 ILCS+0.737
ARB+0.692
RAG+0.659
ESH+0.648
 Powers+0.637
Neg logit contributions
yip-1.485
ibaba-1.383
ific-1.374
wrapper-1.355
iang-1.203
Token
Token ID251
Feature activation+9.46e-03

Pos logit contributions
yip+1.394
ific+1.284
ibaba+1.284
wrapper+1.271
ignt+1.119
Neg logit contributions
 ILCS-0.752
ARB-0.719
RAG-0.679
ESH-0.668
Wikipedia-0.648
Token of
Token ID286
Feature activation+1.24e-01

Pos logit contributions
yip+0.033
ibaba+0.030
ific+0.030
wrapper+0.029
ignt+0.026
Neg logit contributions
 ILCS-0.024
ARB-0.023
RAG-0.023
ESH-0.022
Wikipedia-0.022
Token Arab
Token ID4498
Feature activation-2.01e+00

Pos logit contributions
yip+0.451
ibaba+0.418
ific+0.418
wrapper+0.411
ignt+0.359
Neg logit contributions
 ILCS-0.298
ARB-0.286
RAG-0.273
ESH-0.270
ONS-0.262
Token land
Token ID1956
Feature activation-5.49e-01

Pos logit contributions
 ILCS+4.448
ARB+4.247
 Powers+4.089
ESH+4.050
RAG+3.980
Neg logit contributions
yip-7.413
ibaba-6.809
wrapper-6.755
ific-6.690
uador-5.960
Token serves
Token ID9179
Feature activation-1.96e-02

Pos logit contributions
 ILCS+1.294
ARB+1.235
RAG+1.183
ESH+1.175
 Powers+1.155
Neg logit contributions
yip-2.028
ibaba-1.864
ific-1.837
wrapper-1.823
uador-1.597
Token the
Token ID262
Feature activation-4.00e-01

Pos logit contributions
 ILCS+0.053
ARB+0.051
RAG+0.049
ESH+0.049
 Powers+0.047
Neg logit contributions
yip-0.065
ibaba-0.060
ific-0.060
wrapper-0.059
ignt-0.051
Token overall
Token ID4045
Feature activation+3.46e-01

Pos logit contributions
 ILCS+0.961
ARB+0.927
RAG+0.884
ESH+0.871
 Powers+0.870
Neg logit contributions
yip-1.454
ibaba-1.347
ific-1.334
wrapper-1.313
iang-1.149
Token Zionist
Token ID29390
Feature activation-3.03e-01

Pos logit contributions
yip+1.183
ibaba+1.092
ific+1.090
wrapper+1.071
ignt+0.926
Neg logit contributions
 ILCS-0.906
ARB-0.874
RAG-0.837
ESH-0.829
ONS-0.808
Token,
Token ID11
Feature activation-2.78e-01

Pos logit contributions
yip+0.873
ibaba+0.812
ific+0.809
wrapper+0.797
ignt+0.699
Neg logit contributions
 ILCS-0.537
ARB-0.516
RAG-0.491
ESH-0.486
ONS-0.472
Token it
Token ID340
Feature activation-1.29e-01

Pos logit contributions
 ILCS+0.655
ARB+0.629
RAG+0.600
ESH+0.594
 Powers+0.581
Neg logit contributions
yip-1.023
ibaba-0.947
ific-0.943
wrapper-0.934
ciplinary-0.815
Token moves
Token ID6100
Feature activation-2.79e-01

Pos logit contributions
 ILCS+0.279
ARB+0.267
RAG+0.253
ESH+0.252
 Powers+0.247
Neg logit contributions
yip-0.503
ibaba-0.467
ific-0.462
wrapper-0.457
ignt-0.403
Token a
Token ID257
Feature activation-4.09e-01

Pos logit contributions
 ILCS+0.683
ARB+0.657
RAG+0.630
ESH+0.627
 Powers+0.611
Neg logit contributions
yip-0.997
ibaba-0.926
ific-0.913
wrapper-0.904
ciplinary-0.790
Token plastic
Token ID7309
Feature activation+3.25e-03

Pos logit contributions
 ILCS+1.031
ARB+0.982
RAG+0.954
ESH+0.936
 Powers+0.913
Neg logit contributions
yip-1.438
ibaba-1.334
ific-1.317
wrapper-1.303
iang-1.133
Token tab
Token ID7400
Feature activation-1.99e+00

Pos logit contributions
yip+0.011
ibaba+0.010
ific+0.010
wrapper+0.010
ignt+0.009
Neg logit contributions
 ILCS-0.008
ARB-0.008
RAG-0.008
ESH-0.008
ONS-0.008
Token away
Token ID1497
Feature activation-5.88e-01

Pos logit contributions
 ILCS+4.584
ARB+4.467
RAG+4.256
ESH+4.216
ONS+4.175
Neg logit contributions
yip-6.875
ibaba-6.504
wrapper-6.297
ific-6.268
ciplinary-5.602
Token from
Token ID422
Feature activation-7.76e-02

Pos logit contributions
 ILCS+1.773
ARB+1.724
RAG+1.660
ESH+1.655
Wikipedia+1.619
Neg logit contributions
yip-1.751
ibaba-1.601
ific-1.583
wrapper-1.562
ciplinary-1.319
Token the
Token ID262
Feature activation-9.19e-02

Pos logit contributions
 ILCS+0.200
ARB+0.193
RAG+0.185
ESH+0.183
ONS+0.178
Neg logit contributions
yip-0.268
ibaba-0.247
ific-0.247
wrapper-0.243
ignt-0.210
Token shif
Token ID7143
Feature activation+1.34e-01

Pos logit contributions
 ILCS+0.224
ARB+0.215
RAG+0.205
ESH+0.204
 Powers+0.199
Neg logit contributions
yip-0.332
ibaba-0.307
ific-0.305
wrapper-0.301
ignt-0.262
Tokenter
Token ID353
Feature activation+3.28e-01

Pos logit contributions
yip+0.538
ibaba+0.502
ific+0.496
wrapper+0.487
uador+0.435
Neg logit contributions
 ILCS-0.269
ARB-0.256
ESH-0.242
RAG-0.242
 Avery-0.235
Token [
Token ID685
Feature activation-6.43e-01

Pos logit contributions
yip+1.262
ibaba+1.172
ific+1.169
wrapper+1.151
ignt+1.008
Neg logit contributions
 ILCS-0.808
ARB-0.777
RAG-0.740
ESH-0.733
ONS-0.711
TokenPlay
Token ID11002
Feature activation-5.80e-01

Pos logit contributions
ARB+1.618
 ILCS+1.578
RAG+1.532
ESH+1.525
Wikipedia+1.518
Neg logit contributions
yip-2.242
ibaba-2.084
ific-2.035
wrapper-1.978
ignt-1.771
Tokenboy
Token ID7081
Feature activation+1.43e+00

Pos logit contributions
ARB+1.356
 ILCS+1.339
RAG+1.280
ESH+1.263
Wikipedia+1.244
Neg logit contributions
yip-2.095
ibaba-1.947
wrapper-1.895
ific-1.864
ignt-1.665
Token CEO
Token ID6123
Feature activation-1.12e-01

Pos logit contributions
yip+5.133
ific+4.847
ibaba+4.766
wrapper+4.734
iang+4.134
Neg logit contributions
 ILCS-3.417
ARB-3.210
RAG-3.094
ESH-3.047
Wikipedia-2.961
Token Scott
Token ID4746
Feature activation-7.98e-01

Pos logit contributions
 ILCS+0.240
ARB+0.234
RAG+0.220
ESH+0.218
 Powers+0.216
Neg logit contributions
yip-0.436
ibaba-0.408
ific-0.402
wrapper-0.400
ignt-0.352
Token F
Token ID376
Feature activation-1.99e+00

Pos logit contributions
 ILCS+1.593
ARB+1.575
 Powers+1.540
 Avery+1.480
RAG+1.465
Neg logit contributions
yip-3.146
ibaba-2.935
wrapper-2.862
ific-2.853
ciplinary-2.533
Tokenlanders
Token ID32358
Feature activation-7.79e-01

Pos logit contributions
ARB+5.388
 ILCS+5.375
RAG+5.221
 Powers+5.143
ESH+5.117
Neg logit contributions
yip-6.274
ibaba-5.749
wrapper-5.470
ific-5.340
ignt-4.739
Token]
Token ID60
Feature activation+3.97e-02

Pos logit contributions
 ILCS+1.676
ARB+1.642
RAG+1.550
ESH+1.534
ONS+1.495
Neg logit contributions
yip-2.986
ibaba-2.793
ific-2.759
wrapper-2.726
ignt-2.398
Token said
Token ID531
Feature activation-2.01e-01

Pos logit contributions
yip+0.121
ibaba+0.111
ific+0.110
wrapper+0.109
ignt+0.092
Neg logit contributions
 ILCS-0.117
ARB-0.115
RAG-0.111
ESH-0.110
 Powers-0.108
Token,
Token ID11
Feature activation+9.27e-03

Pos logit contributions
 ILCS+0.451
ARB+0.439
RAG+0.419
ESH+0.414
 Powers+0.404
Neg logit contributions
yip-0.755
ibaba-0.704
ific-0.694
wrapper-0.687
ignt-0.602
Token "
Token ID366
Feature activation+8.09e-01

Pos logit contributions
yip+0.034
ibaba+0.032
ific+0.032
wrapper+0.031
ignt+0.028
Neg logit contributions
 ILCS-0.021
ARB-0.021
RAG-0.020
ESH-0.020
ONS-0.019
TokenDavid
Token ID11006
Feature activation-1.66e-02

Pos logit contributions
yip+1.866
ibaba+1.723
ific+1.722
wrapper+1.693
ignt+1.465
Neg logit contributions
 ILCS-1.398
ARB-1.346
RAG-1.288
ESH-1.276
ONS-1.243
Token Wise
Token ID28908
Feature activation-6.69e-01

Pos logit contributions
 ILCS+0.040
ARB+0.038
RAG+0.037
ESH+0.036
 Powers+0.036
Neg logit contributions
yip-0.060
ibaba-0.056
ific-0.055
wrapper-0.055
ignt-0.048
Token
Token ID198
Feature activation+6.97e-01

Pos logit contributions
 ILCS+1.323
ARB+1.268
RAG+1.195
ESH+1.178
 Powers+1.170
Neg logit contributions
yip-2.700
ibaba-2.516
ific-2.501
wrapper-2.462
ignt-2.195
Token
Token ID198
Feature activation+6.50e-01

Pos logit contributions
yip+2.816
ibaba+2.632
ific+2.626
wrapper+2.589
ignt+2.299
Neg logit contributions
 ILCS-1.390
ARB-1.327
RAG-1.251
ESH-1.237
ONS-1.193
Token(
Token ID7
Feature activation+6.11e-01

Pos logit contributions
yip+2.331
ific+2.187
wrapper+2.166
ibaba+2.162
ciplinary+1.873
Neg logit contributions
 ILCS-1.560
ARB-1.467
RAG-1.407
ESH-1.383
 Powers-1.374
TokenSN
Token ID15571
Feature activation+1.45e+00

Pos logit contributions
yip+2.329
ific+2.176
ibaba+2.168
wrapper+2.147
iang+1.882
Neg logit contributions
 ILCS-1.344
ARB-1.274
RAG-1.210
ESH-1.197
 Powers-1.163
TokenES
Token ID1546
Feature activation+1.18e+00

Pos logit contributions
yip+5.553
ific+5.242
ibaba+5.203
wrapper+5.146
iang+4.547
Neg logit contributions
 ILCS-3.030
ARB-2.872
RAG-2.744
ESH-2.666
 Powers-2.629
Token,
Token ID11
Feature activation-7.32e-02

Pos logit contributions
yip+4.156
ific+3.956
wrapper+3.917
ibaba+3.856
iang+3.408
Neg logit contributions
 ILCS-2.721
ARB-2.564
RAG-2.474
ESH-2.392
Leary-2.380
Token 1995
Token ID8735
Feature activation+7.89e-01

Pos logit contributions
 ILCS+0.182
ARB+0.175
RAG+0.167
ESH+0.165
 Powers+0.163
Neg logit contributions
yip-0.264
ibaba-0.244
ific-0.240
wrapper-0.236
ignt-0.209
Token)
Token ID8
Feature activation+8.91e-02

Pos logit contributions
yip+2.815
ific+2.667
ibaba+2.620
wrapper+2.617
iang+2.282
Neg logit contributions
 ILCS-1.810
ARB-1.766
ESH-1.684
RAG-1.683
ONS-1.599
Token
Token ID198
Feature activation+4.41e-01

Pos logit contributions
yip+0.381
ibaba+0.353
ific+0.348
wrapper+0.345
ignt+0.311
Neg logit contributions
 ILCS-0.164
ARB-0.157
RAG-0.146
ESH-0.146
 Powers-0.142
Token $
Token ID720
Feature activation-1.81e-01

Pos logit contributions
 ILCS+0.310
ARB+0.299
RAG+0.285
ESH+0.284
 Avery+0.281
Neg logit contributions
yip-0.435
ibaba-0.403
ific-0.397
wrapper-0.394
ignt-0.343
Token8
Token ID23
Feature activation+1.46e-01

Pos logit contributions
 ILCS+0.390
ARB+0.380
RAG+0.361
ESH+0.359
Wikipedia+0.349
Neg logit contributions
yip-0.712
ibaba-0.662
ific-0.648
wrapper-0.637
ignt-0.571
Token million
Token ID1510
Feature activation-2.08e-01

Pos logit contributions
yip+0.563
ific+0.526
ibaba+0.526
wrapper+0.517
iang+0.455
Neg logit contributions
 ILCS-0.312
ARB-0.302
RAG-0.287
ESH-0.282
Wikipedia-0.275
Token due
Token ID2233
Feature activation+1.77e-01

Pos logit contributions
 ILCS+0.504
ARB+0.488
RAG+0.463
ESH+0.462
 Avery+0.457
Neg logit contributions
yip-0.749
ibaba-0.697
ific-0.685
wrapper-0.681
ignt-0.592
Token to
Token ID284
Feature activation-1.38e-01

Pos logit contributions
yip+0.624
ibaba+0.578
ific+0.572
wrapper+0.565
ignt+0.490
Neg logit contributions
 ILCS-0.443
ARB-0.429
RAG-0.408
ESH-0.406
 Powers-0.398
Token Carlos
Token ID17409
Feature activation+1.45e+00

Pos logit contributions
 ILCS+0.331
ARB+0.317
ESH+0.301
RAG+0.301
 Powers+0.297
Neg logit contributions
yip-0.503
ibaba-0.467
ific-0.464
wrapper-0.458
ignt-0.402
Token Quentin
Token ID42447
Feature activation-7.12e-01

Pos logit contributions
yip+4.977
ific+4.626
ibaba+4.610
wrapper+4.546
ignt+3.935
Neg logit contributions
 ILCS-3.701
ARB-3.592
RAG-3.435
ESH-3.424
ONS-3.333
Token.
Token ID13
Feature activation+1.38e-01

Pos logit contributions
 ILCS+1.532
ARB+1.453
RAG+1.384
ESH+1.378
 Powers+1.365
Neg logit contributions
yip-2.764
ibaba-2.550
ific-2.508
wrapper-2.492
ignt-2.194
Token But
Token ID887
Feature activation-1.29e-02

Pos logit contributions
yip+0.508
ibaba+0.469
ific+0.460
wrapper+0.458
ignt+0.404
Neg logit contributions
 ILCS-0.335
ARB-0.319
RAG-0.304
ESH-0.302
 Powers-0.300
Token fact
Token ID1109
Feature activation+8.73e-02

Pos logit contributions
 ILCS+0.030
ARB+0.029
RAG+0.028
ESH+0.028
 Powers+0.027
Neg logit contributions
yip-0.048
ibaba-0.044
ific-0.044
wrapper-0.043
ignt-0.038
Tokenoring
Token ID3255
Feature activation-1.23e+00

Pos logit contributions
yip+0.303
ibaba+0.280
ific+0.278
wrapper+0.274
ignt+0.238
Neg logit contributions
 ILCS-0.224
ARB-0.217
RAG-0.207
ESH-0.205
Wikipedia-0.200
Token and
Token ID290
Feature activation+2.17e-01

Pos logit contributions
yip+0.906
ibaba+0.837
ific+0.831
wrapper+0.821
ignt+0.718
Neg logit contributions
 ILCS-0.618
ARB-0.599
RAG-0.571
ESH-0.566
 Powers-0.550
Token will
Token ID481
Feature activation+3.99e-01

Pos logit contributions
yip+0.756
ibaba+0.696
ific+0.693
wrapper+0.683
ignt+0.595
Neg logit contributions
 ILCS-0.551
ARB-0.536
RAG-0.512
ESH-0.508
 Powers-0.494
Token likely
Token ID1884
Feature activation+6.03e-01

Pos logit contributions
yip+1.299
ibaba+1.196
ific+1.191
wrapper+1.171
ignt+1.003
Neg logit contributions
 ILCS-1.106
ARB-1.071
RAG-1.028
ESH-1.019
ONS-0.995
Token face
Token ID1986
Feature activation+6.55e-01

Pos logit contributions
yip+1.944
ific+1.784
ibaba+1.784
wrapper+1.749
ignt+1.498
Neg logit contributions
 ILCS-1.692
ARB-1.636
RAG-1.571
ESH-1.559
Wikipedia-1.521
Token much
Token ID881
Feature activation+8.65e-01

Pos logit contributions
yip+2.180
ibaba+2.008
ific+2.005
wrapper+1.969
ignt+1.694
Neg logit contributions
 ILCS-1.773
ARB-1.712
RAG-1.642
ESH-1.627
ONS-1.587
Token resistance
Token ID6625
Feature activation+1.58e+00

Pos logit contributions
yip+2.745
ibaba+2.517
ific+2.511
wrapper+2.464
ignt+2.104
Neg logit contributions
 ILCS-2.474
ARB-2.394
RAG-2.302
ESH-2.282
ONS-2.228
Token from
Token ID422
Feature activation+6.16e-01

Pos logit contributions
yip+4.785
ific+4.426
ibaba+4.390
wrapper+4.310
iang+3.629
Neg logit contributions
 ILCS-4.704
ARB-4.548
RAG-4.390
ESH-4.361
Wikipedia-4.271
Token tobacco
Token ID11219
Feature activation+2.62e-01

Pos logit contributions
yip+2.272
ibaba+2.109
ific+2.103
wrapper+2.068
ignt+1.814
Neg logit contributions
 ILCS-1.449
ARB-1.395
RAG-1.328
ESH-1.315
ONS-1.277
Token companies
Token ID2706
Feature activation+4.03e-01

Pos logit contributions
yip+1.063
ibaba+0.992
ific+0.984
wrapper+0.972
ignt+0.864
Neg logit contributions
 ILCS-0.521
ARB-0.500
RAG-0.474
ESH-0.469
 Powers-0.455
Token,
Token ID11
Feature activation+1.35e-01

Pos logit contributions
yip+1.533
ibaba+1.428
ific+1.427
wrapper+1.404
ignt+1.236
Neg logit contributions
 ILCS-0.897
ARB-0.859
RAG-0.816
ESH-0.807
ONS-0.782
Token which
Token ID543
Feature activation+3.72e-01

Pos logit contributions
yip+0.473
ibaba+0.435
ific+0.433
wrapper+0.427
ignt+0.372
Neg logit contributions
 ILCS-0.343
ARB-0.334
RAG-0.317
ESH-0.315
 Powers-0.308
Token on
Token ID319
Feature activation+2.01e-02

Pos logit contributions
 ILCS+0.030
ARB+0.029
RAG+0.028
ESH+0.027
 Powers+0.027
Neg logit contributions
yip-0.052
ibaba-0.049
ific-0.048
wrapper-0.048
ignt-0.042
Token his
Token ID465
Feature activation-3.28e-01

Pos logit contributions
yip+0.075
ific+0.070
ibaba+0.070
wrapper+0.069
ignt+0.060
Neg logit contributions
 ILCS-0.046
ARB-0.044
RAG-0.042
ESH-0.042
 Powers-0.041
Token second
Token ID1218
Feature activation+4.28e-01

Pos logit contributions
 ILCS+0.742
ARB+0.718
RAG+0.679
 Powers+0.674
ESH+0.671
Neg logit contributions
yip-1.219
ibaba-1.147
ific-1.124
wrapper-1.112
ignt-0.980
Token day
Token ID1110
Feature activation+6.40e-01

Pos logit contributions
yip+1.519
ibaba+1.416
ific+1.405
wrapper+1.382
ignt+1.207
Neg logit contributions
 ILCS-1.061
ARB-1.025
RAG-0.979
ESH-0.970
ONS-0.951
Token in
Token ID287
Feature activation+2.36e-01

Pos logit contributions
yip+2.143
ific+1.977
ibaba+1.975
wrapper+1.937
ignt+1.672
Neg logit contributions
 ILCS-1.723
ARB-1.657
RAG-1.585
ESH-1.573
Wikipedia-1.534
Token Tokyo
Token ID11790
Feature activation+1.84e+00

Pos logit contributions
yip+0.794
ibaba+0.732
ific+0.730
wrapper+0.719
ignt+0.618
Neg logit contributions
 ILCS-0.629
ARB-0.609
RAG-0.584
ESH-0.578
 Powers-0.565
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
yip+6.520
ific+6.189
ibaba+6.057
wrapper+6.011
iang+5.380
Neg logit contributions
 ILCS-4.361
ARB-4.096
RAG-3.950
ESH-3.918
Wikipedia-3.852
Token
Token ID198
Feature activation-6.78e-02

Pos logit contributions
yip+0.582
ific+0.548
ibaba+0.546
wrapper+0.535
iang+0.473
Neg logit contributions
 ILCS-0.356
ARB-0.341
RAG-0.320
ESH-0.318
ONS-0.307
Token
Token ID198
Feature activation-3.30e-02

Pos logit contributions
 ILCS+0.120
ARB+0.117
ESH+0.109
RAG+0.108
Wikipedia+0.105
Neg logit contributions
yip-0.294
ibaba-0.274
ific-0.267
wrapper-0.265
ignt-0.242
TokenDesc
Token ID24564
Feature activation+2.90e-01

Pos logit contributions
 ILCS+0.072
ARB+0.071
RAG+0.067
ESH+0.067
Wikipedia+0.065
Neg logit contributions
yip-0.127
ibaba-0.119
ific-0.117
wrapper-0.116
ignt-0.102
Tokenribing
Token ID23098
Feature activation+4.81e-01

Pos logit contributions
yip+0.968
ibaba+0.889
ific+0.885
wrapper+0.875
ignt+0.751
Neg logit contributions
 ILCS-0.781
ARB-0.760
RAG-0.728
ESH-0.719
ONS-0.703
Token this
Token ID428
Feature activation+4.31e-01

Pos logit contributions
yip+2.402
ibaba+2.225
ific+2.215
wrapper+2.183
ignt+1.903
Neg logit contributions
 ILCS-1.676
ARB-1.618
RAG-1.546
ESH-1.534
ONS-1.490
Token season
Token ID1622
Feature activation+5.45e-01

Pos logit contributions
yip+1.436
ific+1.331
ibaba+1.326
wrapper+1.299
ciplinary+1.120
Neg logit contributions
 ILCS-1.167
ARB-1.110
RAG-1.074
ESH-1.055
ONS-1.032
Token,
Token ID11
Feature activation+3.47e-01

Pos logit contributions
yip+2.109
ibaba+1.966
ific+1.965
wrapper+1.933
ignt+1.705
Neg logit contributions
 ILCS-1.181
ARB-1.131
RAG-1.072
ESH-1.060
ONS-1.026
Token including
Token ID1390
Feature activation-5.54e-01

Pos logit contributions
yip+1.282
ibaba+1.178
ific+1.172
wrapper+1.167
ignt+1.016
Neg logit contributions
 ILCS-0.822
ARB-0.794
RAG-0.757
ESH-0.754
 Powers-0.743
Token last
Token ID938
Feature activation+6.69e-03

Pos logit contributions
 ILCS+1.247
ARB+1.229
 Avery+1.199
 Powers+1.194
RAG+1.183
Neg logit contributions
yip-2.041
ibaba-1.890
ific-1.857
wrapper-1.854
ignt-1.629
Token week
Token ID1285
Feature activation+1.50e+00

Pos logit contributions
yip+0.020
ibaba+0.018
ific+0.018
wrapper+0.017
ignt+0.015
Neg logit contributions
 ILCS-0.021
ARB-0.020
RAG-0.020
ESH-0.019
 Powers-0.019
Token when
Token ID618
Feature activation-4.44e-02

Pos logit contributions
yip+5.269
ific+4.952
ibaba+4.906
wrapper+4.795
iang+4.209
Neg logit contributions
 ILCS-3.722
ARB-3.548
RAG-3.368
ESH-3.315
Wikipedia-3.296
Token Cutler
Token ID37194
Feature activation+2.13e-01

Pos logit contributions
 ILCS+0.100
ARB+0.097
RAG+0.092
ESH+0.092
 Powers+0.090
Neg logit contributions
yip-0.167
ibaba-0.155
ific-0.154
wrapper-0.153
ignt-0.134
Token scored
Token ID7781
Feature activation+2.90e-01

Pos logit contributions
yip+0.765
ibaba+0.707
ific+0.701
wrapper+0.690
ignt+0.606
Neg logit contributions
 ILCS-0.522
ARB-0.506
RAG-0.484
ESH-0.481
 Powers-0.469
Token 22
Token ID2534
Feature activation-2.33e-02

Pos logit contributions
yip+0.999
ibaba+0.920
ific+0.911
wrapper+0.902
ignt+0.780
Neg logit contributions
 ILCS-0.751
ARB-0.732
RAG-0.703
ESH-0.698
 Powers-0.684
Token.
Token ID13
Feature activation+2.55e-02

Pos logit contributions
 ILCS+0.068
ARB+0.066
RAG+0.063
ESH+0.063
ONS+0.062
Neg logit contributions
yip-0.073
ibaba-0.067
ific-0.066
wrapper-0.065
ignt-0.056
Token full
Token ID1336
Feature activation-1.08e-01

Pos logit contributions
yip+0.108
ibaba+0.101
ific+0.099
wrapper+0.098
ignt+0.085
Neg logit contributions
 ILCS-0.079
ARB-0.075
RAG-0.073
ESH-0.073
 Avery-0.071
Token-
Token ID12
Feature activation+2.89e-01

Pos logit contributions
 ILCS+0.244
ARB+0.230
ESH+0.219
RAG+0.218
 Avery+0.215
Neg logit contributions
yip-0.413
ibaba-0.381
ific-0.376
wrapper-0.373
uador-0.327
Tokenlength
Token ID13664
Feature activation-3.50e-01

Pos logit contributions
yip+1.005
ibaba+0.928
ific+0.925
wrapper+0.905
ignt+0.789
Neg logit contributions
 ILCS-0.740
ARB-0.714
RAG-0.682
ESH-0.677
Wikipedia-0.662
Token statue
Token ID15207
Feature activation+3.05e-01

Pos logit contributions
 ILCS+0.882
ARB+0.833
ESH+0.802
RAG+0.800
 Avery+0.798
Neg logit contributions
yip-1.246
ibaba-1.150
ific-1.120
wrapper-1.117
uador-0.975
Token of
Token ID286
Feature activation+2.89e-01

Pos logit contributions
yip+1.078
ibaba+0.995
ific+0.979
wrapper+0.970
iang+0.840
Neg logit contributions
 ILCS-0.775
ARB-0.743
RAG-0.710
ESH-0.710
 Avery-0.694
Token a
Token ID257
Feature activation+4.10e-01

Pos logit contributions
yip+1.004
ibaba+0.929
ific+0.922
wrapper+0.909
ignt+0.788
Neg logit contributions
 ILCS-0.745
ARB-0.716
RAG-0.686
ESH-0.681
 Powers-0.665
Token poet
Token ID21810
Feature activation-7.75e-01

Pos logit contributions
yip+1.464
ibaba+1.358
ific+1.342
wrapper+1.324
ignt+1.152
Neg logit contributions
 ILCS-1.023
ARB-0.973
RAG-0.940
ESH-0.930
 Powers-0.910
Token in
Token ID287
Feature activation+3.63e-03

Pos logit contributions
 ILCS+1.837
ARB+1.725
ESH+1.673
RAG+1.660
 Avery+1.625
Neg logit contributions
yip-2.839
ibaba-2.644
wrapper-2.569
ific-2.560
iang-2.236
Token Toronto
Token ID6586
Feature activation-4.27e-01

Pos logit contributions
yip+0.013
ibaba+0.012
ific+0.012
wrapper+0.012
ciplinary+0.010
Neg logit contributions
 ILCS-0.009
ARB-0.008
RAG-0.008
ESH-0.008
 Powers-0.008
Token,
Token ID11
Feature activation+4.97e-02

Pos logit contributions
 ILCS+0.982
ARB+0.925
RAG+0.880
ESH+0.879
 Powers+0.857
Neg logit contributions
yip-1.615
ibaba-1.493
ific-1.474
wrapper-1.458
uador-1.278
Token the
Token ID262
Feature activation+2.63e-01

Pos logit contributions
yip+0.183
ibaba+0.170
ific+0.167
wrapper+0.166
ignt+0.144
Neg logit contributions
 ILCS-0.119
ARB-0.114
RAG-0.109
ESH-0.108
 Powers-0.106
Token has
Token ID468
Feature activation-1.37e-01

Pos logit contributions
 ILCS+0.359
ARB+0.348
RAG+0.332
ESH+0.331
 Powers+0.326
Neg logit contributions
yip-0.518
ibaba-0.481
ific-0.473
wrapper-0.469
ignt-0.409
Token got
Token ID1392
Feature activation-8.35e-01

Pos logit contributions
 ILCS+0.364
ARB+0.356
RAG+0.341
ESH+0.340
 Powers+0.335
Neg logit contributions
yip-0.460
ibaba-0.427
ific-0.413
wrapper-0.411
ignt-0.359
Token to
Token ID284
Feature activation+3.41e-01

Pos logit contributions
 ILCS+1.962
ARB+1.926
RAG+1.861
ESH+1.852
 Powers+1.818
Neg logit contributions
yip-2.977
ibaba-2.770
ific-2.696
wrapper-2.679
ignt-2.380
Token music
Token ID2647
Feature activation+3.28e-01

Pos logit contributions
yip+1.092
ibaba+1.006
ific+0.998
wrapper+0.982
ignt+0.840
Neg logit contributions
 ILCS-0.966
ARB-0.935
RAG-0.899
ESH-0.892
ONS-0.871
Token.
Token ID13
Feature activation+1.60e-01

Pos logit contributions
yip+1.214
ibaba+1.130
ific+1.118
wrapper+1.104
ignt+0.972
Neg logit contributions
 ILCS-0.764
ARB-0.736
RAG-0.700
ESH-0.695
 Powers-0.674
Token We
Token ID775
Feature activation+9.50e-02

Pos logit contributions
yip+0.597
ibaba+0.552
ific+0.545
wrapper+0.539
ignt+0.475
Neg logit contributions
 ILCS-0.376
ARB-0.361
RAG-0.345
ESH-0.342
 Powers-0.337
Token're
Token ID821
Feature activation-8.03e-02

Pos logit contributions
yip+0.300
ibaba+0.275
ific+0.271
wrapper+0.270
ignt+0.230
Neg logit contributions
 ILCS-0.273
ARB-0.266
ESH-0.255
RAG-0.254
 Powers-0.250
Token getting
Token ID1972
Feature activation-3.64e-01

Pos logit contributions
 ILCS+0.197
ARB+0.190
RAG+0.182
ESH+0.181
 Powers+0.178
Neg logit contributions
yip-0.287
ibaba-0.267
ific-0.262
wrapper-0.260
ignt-0.227
Token a
Token ID257
Feature activation+2.55e-01

Pos logit contributions
 ILCS+0.855
ARB+0.835
ESH+0.803
RAG+0.801
 Powers+0.777
Neg logit contributions
yip-1.331
ibaba-1.233
ific-1.213
wrapper-1.192
ignt-1.064
Token cross
Token ID3272
Feature activation+2.01e-01

Pos logit contributions
yip+0.886
ibaba+0.821
ific+0.811
wrapper+0.796
ignt+0.700
Neg logit contributions
 ILCS-0.655
ARB-0.632
RAG-0.607
ESH-0.605
 Powers-0.588
Token-
Token ID12
Feature activation+1.59e-01

Pos logit contributions
yip+0.703
ibaba+0.652
ific+0.649
wrapper+0.638
ignt+0.556
Neg logit contributions
 ILCS-0.509
ARB-0.490
RAG-0.468
ESH-0.464
Wikipedia-0.452
Token The
Token ID383
Feature activation+3.52e-01

Pos logit contributions
 ILCS+0.397
ARB+0.379
RAG+0.363
ESH+0.358
 Powers+0.358
Neg logit contributions
yip-0.615
ibaba-0.567
ific-0.558
wrapper-0.554
ignt-0.488
Token better
Token ID1365
Feature activation+2.67e-01

Pos logit contributions
yip+1.463
ibaba+1.370
ific+1.369
wrapper+1.350
ignt+1.203
Neg logit contributions
 ILCS-0.658
ARB-0.626
RAG-0.589
ESH-0.581
ONS-0.559
Token known
Token ID1900
Feature activation+6.09e-01

Pos logit contributions
yip+0.991
ibaba+0.921
ific+0.918
wrapper+0.904
ignt+0.792
Neg logit contributions
 ILCS-0.619
ARB-0.595
RAG-0.566
ESH-0.561
 Powers-0.544
Token is
Token ID318
Feature activation+1.08e-01

Pos logit contributions
yip+2.252
ibaba+2.091
ific+2.087
wrapper+2.054
ignt+1.800
Neg logit contributions
 ILCS-1.422
ARB-1.367
RAG-1.301
ESH-1.288
ONS-1.250
Token the
Token ID262
Feature activation+1.04e-01

Pos logit contributions
yip+0.380
ibaba+0.351
ific+0.348
wrapper+0.344
ignt+0.299
Neg logit contributions
 ILCS-0.272
ARB-0.263
RAG-0.251
ESH-0.249
 Powers-0.243
Token main
Token ID1388
Feature activation+5.34e-01

Pos logit contributions
yip+0.365
ibaba+0.338
ific+0.335
wrapper+0.329
ignt+0.287
Neg logit contributions
 ILCS-0.265
ARB-0.254
RAG-0.243
ESH-0.241
 Powers-0.238
Token Brazilian
Token ID17036
Feature activation-1.25e-01

Pos logit contributions
yip+1.808
ibaba+1.671
ific+1.664
wrapper+1.634
ignt+1.411
Neg logit contributions
 ILCS-1.416
ARB-1.364
RAG-1.308
ESH-1.297
 Powers-1.268
Token branch
Token ID8478
Feature activation+3.10e-01

Pos logit contributions
 ILCS+0.321
ARB+0.308
RAG+0.293
ESH+0.292
 Powers+0.291
Neg logit contributions
yip-0.435
ibaba-0.403
ific-0.399
wrapper-0.393
uador-0.342
Token of
Token ID286
Feature activation+1.42e-01

Pos logit contributions
yip+1.083
ibaba+1.002
ific+0.999
wrapper+0.983
ignt+0.853
Neg logit contributions
 ILCS-0.788
ARB-0.760
RAG-0.726
ESH-0.720
ONS-0.701
Token the
Token ID262
Feature activation+4.50e-02

Pos logit contributions
yip+0.503
ibaba+0.467
ific+0.463
wrapper+0.455
ignt+0.397
Neg logit contributions
 ILCS-0.355
ARB-0.343
RAG-0.326
ESH-0.324
 Powers-0.317
Token As
Token ID1081
Feature activation+6.11e-02

Pos logit contributions
yip+0.162
ibaba+0.150
ific+0.148
wrapper+0.146
ignt+0.127
Neg logit contributions
 ILCS-0.111
ARB-0.106
RAG-0.101
ESH-0.101
 Powers-0.100
Tokenmen
Token ID3653
Feature activation+4.86e-01

Pos logit contributions
yip+0.010
ibaba+0.010
ific+0.009
wrapper+0.009
uador+0.008
Neg logit contributions
 ILCS-0.007
ARB-0.007
RAG-0.006
ESH-0.006
ONS-0.006
Token
Token ID447
Feature activation+3.41e-01

Pos logit contributions
yip+2.010
ibaba+1.876
ific+1.867
wrapper+1.843
ignt+1.636
Neg logit contributions
 ILCS-0.941
ARB-0.892
RAG-0.840
ESH-0.830
ONS-0.803
Token
Token ID247
Feature activation-3.73e-02

Pos logit contributions
yip+1.452
ibaba+1.343
ific+1.334
wrapper+1.316
ignt+1.181
Neg logit contributions
 ILCS-0.625
ARB-0.600
RAG-0.565
ESH-0.551
ONS-0.532
Tokens
Token ID82
Feature activation+1.27e-01

Pos logit contributions
 ILCS+0.133
ARB+0.129
ESH+0.126
RAG+0.125
 Powers+0.124
Neg logit contributions
yip-0.095
ibaba-0.083
ific-0.079
wrapper-0.079
uador-0.065
Token Et
Token ID17906
Feature activation+1.57e-01

Pos logit contributions
yip+0.411
ific+0.380
ibaba+0.379
wrapper+0.372
ignt+0.320
Neg logit contributions
 ILCS-0.354
ARB-0.344
RAG-0.329
ESH-0.326
ONS-0.318
Tokeniquette
Token ID40387
Feature activation+2.48e-01

Pos logit contributions
yip+0.477
ibaba+0.433
ific+0.425
wrapper+0.423
ignt+0.360
Neg logit contributions
 ILCS-0.471
ARB-0.453
RAG-0.438
ESH-0.437
Wikipedia-0.435
Token,
Token ID11
Feature activation+3.04e-01

Pos logit contributions
yip+0.953
ific+0.900
ibaba+0.888
wrapper+0.879
ignt+0.780
Neg logit contributions
 ILCS-0.533
ARB-0.513
RAG-0.490
ESH-0.486
ONS-0.463
Token 18
Token ID1248
Feature activation+4.76e-01

Pos logit contributions
yip+1.016
ific+0.943
ibaba+0.936
wrapper+0.915
ignt+0.794
Neg logit contributions
 ILCS-0.805
ARB-0.779
RAG-0.750
ESH-0.745
Wikipedia-0.721
Token77
Token ID3324
Feature activation+8.72e-01

Pos logit contributions
yip+1.944
ific+1.830
ibaba+1.819
wrapper+1.800
ignt+1.595
Neg logit contributions
 ILCS-0.913
ARB-0.867
RAG-0.818
ESH-0.807
ONS-0.777
Token.
Token ID13
Feature activation+7.51e-02

Pos logit contributions
yip+3.385
ibaba+3.156
ific+3.153
wrapper+3.105
ignt+2.740
Neg logit contributions
 ILCS-1.875
ARB-1.796
RAG-1.702
ESH-1.683
ONS-1.629
Token
Token ID198
Feature activation+5.70e-01

Pos logit contributions
yip+0.308
ibaba+0.287
ific+0.286
wrapper+0.282
ignt+0.251
Neg logit contributions
 ILCS-0.148
ARB-0.141
RAG-0.133
ESH-0.131
Wikipedia-0.127
Tokenor
Token ID273
Feature activation-1.61e-01

Pos logit contributions
yip+1.810
ibaba+1.668
ific+1.661
wrapper+1.635
ignt+1.407
Neg logit contributions
 ILCS-1.447
ARB-1.401
RAG-1.341
ESH-1.329
ONS-1.295
Token,
Token ID11
Feature activation-7.11e-03

Pos logit contributions
 ILCS+0.353
ARB+0.340
RAG+0.322
ESH+0.319
 Powers+0.309
Neg logit contributions
yip-0.620
ibaba-0.578
ific-0.574
wrapper-0.567
ignt-0.499
Token alone
Token ID3436
Feature activation-6.48e-01

Pos logit contributions
 ILCS+0.017
ARB+0.017
RAG+0.016
ESH+0.016
 Powers+0.015
Neg logit contributions
yip-0.026
ibaba-0.024
ific-0.024
wrapper-0.023
ignt-0.020
Tokenly
Token ID306
Feature activation-8.40e-01

Pos logit contributions
 ILCS+1.435
ARB+1.374
RAG+1.324
ESH+1.294
 Powers+1.264
Neg logit contributions
yip-2.472
ibaba-2.296
ific-2.267
wrapper-2.244
ignt-1.973
Token young
Token ID1862
Feature activation-9.20e-02

Pos logit contributions
 ILCS+1.927
ARB+1.840
RAG+1.756
ESH+1.728
 Powers+1.699
Neg logit contributions
yip-3.160
ibaba-2.912
ific-2.870
wrapper-2.841
ignt-2.486
Token man
Token ID582
Feature activation+4.91e-01

Pos logit contributions
 ILCS+0.218
ARB+0.210
RAG+0.200
ESH+0.198
ONS+0.192
Neg logit contributions
yip-0.337
ibaba-0.313
ific-0.311
wrapper-0.307
ignt-0.269
Token named
Token ID3706
Feature activation-6.90e-01

Pos logit contributions
yip+1.800
ific+1.684
ibaba+1.671
wrapper+1.646
ignt+1.442
Neg logit contributions
 ILCS-1.149
ARB-1.103
RAG-1.052
ESH-1.043
ONS-1.012
Token Norman
Token ID19636
Feature activation+1.65e-01

Pos logit contributions
 ILCS+1.619
ARB+1.560
RAG+1.481
ESH+1.465
 Powers+1.445
Neg logit contributions
yip-2.553
ibaba-2.385
ific-2.340
wrapper-2.316
ignt-2.043
Token Bates
Token ID35591
Feature activation-2.17e-01

Pos logit contributions
yip+0.605
ibaba+0.562
ific+0.560
wrapper+0.552
ignt+0.483
Neg logit contributions
 ILCS-0.389
ARB-0.374
RAG-0.357
ESH-0.353
 Powers-0.344
Token (
Token ID357
Feature activation-6.19e-01

Pos logit contributions
 ILCS+0.493
ARB+0.474
RAG+0.451
ESH+0.446
ONS+0.434
Neg logit contributions
yip-0.814
ibaba-0.756
ific-0.754
wrapper-0.743
ignt-0.651
TokenAnthony
Token ID32697
Feature activation-1.03e+00

Pos logit contributions
 ILCS+1.516
ARB+1.494
ESH+1.410
RAG+1.409
Wikipedia+1.389
Neg logit contributions
yip-2.209
ibaba-2.063
ific-2.015
wrapper-1.981
ignt-1.749
Token Pal
Token ID3175
Feature activation+7.20e-01

Pos logit contributions
 ILCS+0.443
ARB+0.431
RAG+0.409
ESH+0.408
 Powers+0.404
Neg logit contributions
yip-0.585
ibaba-0.540
ific-0.532
wrapper-0.528
ciplinary-0.455
Tokenlet
Token ID1616
Feature activation-4.85e-01

Pos logit contributions
yip+2.562
ific+2.376
ibaba+2.376
wrapper+2.330
ignt+2.032
Neg logit contributions
 ILCS-1.776
ARB-1.718
RAG-1.638
ESH-1.620
ONS-1.576
Token Ads
Token ID47442
Feature activation-7.37e-02

Pos logit contributions
 ILCS+1.167
ARB+1.117
RAG+1.068
ESH+1.064
 Powers+1.055
Neg logit contributions
yip-1.759
ibaba-1.619
ific-1.590
wrapper-1.579
ciplinary-1.389
Token-
Token ID12
Feature activation+7.50e-01

Pos logit contributions
 ILCS+0.182
ARB+0.174
RAG+0.167
ESH+0.165
Wikipedia+0.160
Neg logit contributions
yip-0.262
ific-0.245
ibaba-0.244
wrapper-0.240
iang-0.209
TokenHe
Token ID1544
Feature activation-8.77e-02

Pos logit contributions
yip+2.626
ific+2.440
ibaba+2.428
wrapper+2.400
ignt+2.080
Neg logit contributions
 ILCS-1.882
ARB-1.796
RAG-1.719
ESH-1.699
 Powers-1.656
Tokenath
Token ID776
Feature activation-2.62e-01

Pos logit contributions
 ILCS+0.235
ARB+0.229
RAG+0.219
ESH+0.217
 Powers+0.215
Neg logit contributions
yip-0.294
ibaba-0.268
ific-0.266
wrapper-0.265
ignt-0.226
Token Led
Token ID22964
Feature activation-2.73e-02

Pos logit contributions
 ILCS+0.591
ARB+0.570
RAG+0.536
 Powers+0.528
ESH+0.527
Neg logit contributions
yip-0.999
ibaba-0.921
ific-0.914
wrapper-0.906
ignt-0.793
Tokenger
Token ID1362
Feature activation-4.54e-02

Pos logit contributions
 ILCS+0.083
ARB+0.082
RAG+0.078
ESH+0.078
 Powers+0.077
Neg logit contributions
yip-0.081
ibaba-0.074
ific-0.073
wrapper-0.072
ignt-0.061
Token Dead
Token ID5542
Feature activation+5.88e-01

Pos logit contributions
 ILCS+0.109
ARB+0.105
RAG+0.100
ESH+0.099
 Powers+0.098
Neg logit contributions
yip-0.165
ibaba-0.153
ific-0.151
wrapper-0.149
ignt-0.131
Token Naked
Token ID41619
Feature activation+5.43e-01

Pos logit contributions
yip+2.127
ibaba+1.967
ific+1.959
wrapper+1.930
ignt+1.684
Neg logit contributions
 ILCS-1.430
ARB-1.374
RAG-1.313
ESH-1.299
 Powers-1.265
Token-
Token ID12
Feature activation+1.04e+00

Pos logit contributions
yip+1.896
ific+1.767
ibaba+1.762
wrapper+1.731
iang+1.506
Neg logit contributions
 ILCS-1.371
ARB-1.320
RAG-1.265
ESH-1.247
ONS-1.211
Token and
Token ID290
Feature activation+2.22e-01

Pos logit contributions
yip+1.023
ibaba+0.951
ific+0.947
wrapper+0.933
ignt+0.821
Neg logit contributions
 ILCS-0.623
ARB-0.600
RAG-0.571
ESH-0.567
 Powers-0.551
Token career
Token ID3451
Feature activation+2.00e-01

Pos logit contributions
yip+0.799
ific+0.742
ibaba+0.741
wrapper+0.728
ignt+0.635
Neg logit contributions
 ILCS-0.542
ARB-0.521
RAG-0.498
ESH-0.492
ONS-0.479
Token would
Token ID561
Feature activation+5.12e-01

Pos logit contributions
yip+0.732
ibaba+0.677
ific+0.672
wrapper+0.663
ignt+0.581
Neg logit contributions
 ILCS-0.478
ARB-0.462
RAG-0.441
ESH-0.438
 Powers-0.427
Token change
Token ID1487
Feature activation+8.12e-01

Pos logit contributions
yip+1.635
ibaba+1.500
ific+1.498
wrapper+1.469
ignt+1.255
Neg logit contributions
 ILCS-1.457
ARB-1.409
RAG-1.354
ESH-1.343
ONS-1.311
Token forever
Token ID8097
Feature activation-5.06e-01

Pos logit contributions
yip+2.855
ific+2.654
ibaba+2.644
wrapper+2.603
ignt+2.262
Neg logit contributions
 ILCS-2.031
ARB-1.954
RAG-1.867
ESH-1.847
Wikipedia-1.796
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
 ILCS+1.158
ARB+1.097
RAG+1.046
ESH+1.037
 Avery+1.027
Neg logit contributions
yip-1.914
ibaba-1.775
ific-1.730
wrapper-1.723
ignt-1.511
Token He
Token ID679
Feature activation+2.39e-02

Pos logit contributions
 ILCS+0.350
ARB+0.335
RAG+0.320
ESH+0.316
 Powers+0.310
Neg logit contributions
yip-0.605
ibaba-0.561
ific-0.556
wrapper-0.551
ignt-0.486
Token took
Token ID1718
Feature activation-7.60e-01

Pos logit contributions
yip+0.088
ibaba+0.081
ific+0.080
wrapper+0.079
ignt+0.069
Neg logit contributions
 ILCS-0.057
ARB-0.055
RAG-0.052
ESH-0.052
 Powers-0.051
Token eight
Token ID3624
Feature activation+3.95e-01

Pos logit contributions
 ILCS+1.714
ARB+1.690
RAG+1.589
ESH+1.572
 Powers+1.570
Neg logit contributions
yip-2.860
ibaba-2.635
ific-2.599
wrapper-2.582
ignt-2.259
Token months
Token ID1933
Feature activation+4.98e-01

Pos logit contributions
yip+1.409
ibaba+1.302
ific+1.294
wrapper+1.276
ignt+1.110
Neg logit contributions
 ILCS-0.985
ARB-0.949
RAG-0.903
ESH-0.896
 Powers-0.875
Token off
Token ID572
Feature activation-2.19e-01

Pos logit contributions
yip+1.693
ific+1.574
ibaba+1.570
wrapper+1.541
ignt+1.334
Neg logit contributions
 ILCS-1.304
ARB-1.256
RAG-1.203
ESH-1.188
ONS-1.162
Token
Token ID198
Feature activation-1.17e-01

Pos logit contributions
 ILCS+0.205
ARB+0.199
ESH+0.185
RAG+0.185
Wikipedia+0.178
Neg logit contributions
yip-0.498
ibaba-0.463
ific-0.451
wrapper-0.449
ignt-0.407
TokenShould
Token ID19926
Feature activation+1.14e-01

Pos logit contributions
 ILCS+0.293
ARB+0.291
RAG+0.277
ESH+0.273
Wikipedia+0.270
Neg logit contributions
yip-0.418
ibaba-0.385
ific-0.378
wrapper-0.372
uador-0.327
Token Dev
Token ID6245
Feature activation+4.57e-02

Pos logit contributions
yip+0.407
ibaba+0.376
ific+0.375
wrapper+0.369
ignt+0.322
Neg logit contributions
 ILCS-0.279
ARB-0.270
RAG-0.258
ESH-0.255
 Powers-0.249
Tokenial
Token ID498
Feature activation+1.41e-01

Pos logit contributions
yip+0.169
ibaba+0.154
wrapper+0.153
ific+0.152
uador+0.135
Neg logit contributions
 ILCS-0.104
ARB-0.101
RAG-0.095
ESH-0.093
Wikipedia-0.092
Tokenet
Token ID316
Feature activation-1.34e-01

Pos logit contributions
yip+0.421
ibaba+0.387
ific+0.377
wrapper+0.377
ciplinary+0.315
Neg logit contributions
 ILCS-0.425
ARB-0.411
ESH-0.400
RAG-0.391
Wikipedia-0.389
Token
Token ID447
Feature activation+2.20e-02

Pos logit contributions
 ILCS+0.287
ARB+0.275
RAG+0.261
ESH+0.257
 Powers+0.250
Neg logit contributions
yip-0.521
ibaba-0.485
ific-0.483
wrapper-0.477
ignt-0.420
Token
Token ID247
Feature activation-2.86e-01

Pos logit contributions
yip+0.095
ibaba+0.088
ific+0.087
wrapper+0.086
ignt+0.077
Neg logit contributions
 ILCS-0.039
ARB-0.037
RAG-0.035
ESH-0.034
ONS-0.033
Tokens
Token ID82
Feature activation+3.65e-02

Pos logit contributions
 ILCS+0.967
ARB+0.947
ESH+0.929
RAG+0.914
 Powers+0.903
Neg logit contributions
yip-0.766
ibaba-0.670
ific-0.635
wrapper-0.625
ignt-0.536
Token Silver
Token ID7698
Feature activation+1.07e-01

Pos logit contributions
yip+0.127
ibaba+0.117
ific+0.116
wrapper+0.115
ignt+0.100
Neg logit contributions
 ILCS-0.093
ARB-0.091
RAG-0.087
ESH-0.085
 Powers-0.084
Token Phant
Token ID38699
Feature activation-5.98e-01

Pos logit contributions
yip+0.436
ibaba+0.408
ific+0.406
wrapper+0.401
ignt+0.356
Neg logit contributions
 ILCS-0.208
ARB-0.198
RAG-0.187
ESH-0.184
 Powers-0.179
Tokenoms
Token ID3150
Feature activation+8.66e-02

Pos logit contributions
 ILCS+1.229
ARB+1.204
ONS+1.138
 Powers+1.116
RAG+1.106
Neg logit contributions
yip-2.360
ibaba-2.167
wrapper-2.138
ific-2.100
uador-1.902
Token Rhodes
Token ID25656
Feature activation+2.33e-01

Pos logit contributions
 ILCS+0.733
ARB+0.700
RAG+0.669
ESH+0.666
 Powers+0.661
Neg logit contributions
yip-1.205
ibaba-1.118
ific-1.106
wrapper-1.096
ignt-0.968
Token must
Token ID1276
Feature activation+2.37e-01

Pos logit contributions
yip+0.850
ibaba+0.787
ific+0.783
wrapper+0.773
ignt+0.676
Neg logit contributions
 ILCS-0.555
ARB-0.534
RAG-0.509
ESH-0.506
 Powers-0.491
Token stay
Token ID2652
Feature activation+5.24e-03

Pos logit contributions
yip+0.774
ibaba+0.710
ific+0.707
wrapper+0.695
ignt+0.596
Neg logit contributions
 ILCS-0.655
ARB-0.631
RAG-0.608
ESH-0.604
ONS-0.590
Token focused
Token ID5670
Feature activation-3.25e-01

Pos logit contributions
yip+0.019
ibaba+0.017
ific+0.017
wrapper+0.017
ignt+0.015
Neg logit contributions
 ILCS-0.013
ARB-0.012
RAG-0.012
ESH-0.012
 Avery-0.012
Token on
Token ID319
Feature activation-4.55e-01

Pos logit contributions
 ILCS+0.734
ARB+0.704
RAG+0.674
ESH+0.671
 Powers+0.661
Neg logit contributions
yip-1.225
ibaba-1.139
ific-1.123
wrapper-1.112
ignt-0.979
Token deep
Token ID2769
Feature activation-2.11e-01

Pos logit contributions
 ILCS+0.981
ARB+0.949
RAG+0.905
 Avery+0.900
ESH+0.900
Neg logit contributions
yip-1.744
ibaba-1.617
ific-1.604
wrapper-1.580
ignt-1.400
Token throws
Token ID12542
Feature activation+4.32e-01

Pos logit contributions
 ILCS+0.479
ARB+0.460
RAG+0.439
ESH+0.438
 Powers+0.426
Neg logit contributions
yip-0.800
ibaba-0.739
ific-0.736
wrapper-0.723
ignt-0.638
Token and
Token ID290
Feature activation+2.36e-01

Pos logit contributions
yip+1.582
ibaba+1.469
ific+1.464
wrapper+1.440
ignt+1.263
Neg logit contributions
 ILCS-1.026
ARB-0.988
RAG-0.941
ESH-0.932
 Powers-0.905
Token fades
Token ID41381
Feature activation-3.89e-01

Pos logit contributions
yip+0.816
ibaba+0.751
ific+0.747
wrapper+0.735
ignt+0.640
Neg logit contributions
 ILCS-0.609
ARB-0.588
RAG-0.563
ESH-0.559
 Powers-0.549
Token.
Token ID13
Feature activation+2.64e-01

Pos logit contributions
 ILCS+0.890
ARB+0.853
ESH+0.820
RAG+0.816
 Powers+0.802
Neg logit contributions
yip-1.450
ibaba-1.342
ific-1.331
wrapper-1.314
ignt-1.158
Token
Token ID198
Feature activation+7.53e-02

Pos logit contributions
yip+1.016
ibaba+0.942
ific+0.933
wrapper+0.923
ignt+0.820
Neg logit contributions
 ILCS-0.590
ARB-0.561
RAG-0.537
ESH-0.531
 Powers-0.524
Tokenill
Token ID359
Feature activation-3.97e-01

Pos logit contributions
 ILCS+2.637
ARB+2.571
RAG+2.487
ESH+2.451
ONS+2.402
Neg logit contributions
yip-2.696
ibaba-2.401
wrapper-2.367
ific-2.345
ignt-2.003
Tokenessen
Token ID44483
Feature activation-1.84e-01

Pos logit contributions
 ILCS+0.990
ARB+0.957
RAG+0.925
 Powers+0.921
ESH+0.920
Neg logit contributions
yip-1.394
ibaba-1.259
wrapper-1.255
ific-1.251
ignt-1.069
Token,
Token ID11
Feature activation+4.06e-01

Pos logit contributions
 ILCS+0.428
ARB+0.408
RAG+0.388
ESH+0.384
 Powers+0.378
Neg logit contributions
yip-0.686
ibaba-0.632
ific-0.628
wrapper-0.621
ignt-0.547
Token who
Token ID508
Feature activation+2.98e-01

Pos logit contributions
yip+1.451
ibaba+1.340
ific+1.333
wrapper+1.315
ignt+1.146
Neg logit contributions
 ILCS-1.008
ARB-0.967
RAG-0.924
ESH-0.914
 Powers-0.895
Token started
Token ID2067
Feature activation+3.93e-01

Pos logit contributions
yip+1.064
ibaba+0.985
ific+0.982
wrapper+0.967
ignt+0.843
Neg logit contributions
 ILCS-0.735
ARB-0.707
RAG-0.675
ESH-0.668
ONS-0.650
Token very
Token ID845
Feature activation+3.83e-02

Pos logit contributions
yip+1.391
ibaba+1.285
ific+1.280
wrapper+1.261
ignt+1.096
Neg logit contributions
 ILCS-0.983
ARB-0.947
RAG-0.906
ESH-0.898
 Powers-0.874
Token high
Token ID1029
Feature activation+9.70e-01

Pos logit contributions
yip+0.137
ibaba+0.126
ific+0.125
wrapper+0.124
ignt+0.108
Neg logit contributions
 ILCS-0.095
ARB-0.091
RAG-0.087
ESH-0.087
 Powers-0.085
Token up
Token ID510
Feature activation+8.90e-01

Pos logit contributions
yip+3.319
ific+3.092
ibaba+3.072
wrapper+3.013
iang+2.599
Neg logit contributions
 ILCS-2.499
ARB-2.428
RAG-2.326
ESH-2.296
Wikipedia-2.242
Token the
Token ID262
Feature activation+5.19e-01

Pos logit contributions
yip+3.062
ibaba+2.835
ific+2.834
wrapper+2.779
ignt+2.403
Neg logit contributions
 ILCS-2.297
ARB-2.218
RAG-2.122
ESH-2.104
ONS-2.047
Token pitch
Token ID7078
Feature activation+1.36e+00

Pos logit contributions
yip+1.351
ibaba+1.213
ific+1.209
wrapper+1.183
ignt+0.967
Neg logit contributions
 ILCS-1.785
ARB-1.735
RAG-1.679
ESH-1.669
ONS-1.636
Token.
Token ID13
Feature activation+1.83e-01

Pos logit contributions
yip+4.911
ific+4.646
ibaba+4.582
wrapper+4.520
iang+3.949
Neg logit contributions
 ILCS-3.186
ARB-3.098
RAG-2.953
ESH-2.894
Wikipedia-2.832
Token anything
Token ID1997
Feature activation+1.01e+00

Pos logit contributions
yip+1.438
ibaba+1.328
ific+1.309
wrapper+1.294
ignt+1.128
Neg logit contributions
 ILCS-1.022
ARB-0.992
RAG-0.944
ESH-0.943
Wikipedia-0.918
Token that
Token ID326
Feature activation+5.81e-01

Pos logit contributions
yip+3.542
ific+3.296
ibaba+3.276
wrapper+3.228
ignt+2.803
Neg logit contributions
 ILCS-2.548
ARB-2.454
RAG-2.347
ESH-2.323
ONS-2.255
Token questions
Token ID2683
Feature activation-1.00e-02

Pos logit contributions
yip+2.078
ibaba+1.924
ific+1.921
wrapper+1.889
ignt+1.646
Neg logit contributions
 ILCS-1.429
ARB-1.376
RAG-1.313
ESH-1.301
ONS-1.265
Token the
Token ID262
Feature activation+1.50e-01

Pos logit contributions
 ILCS+0.025
ARB+0.024
RAG+0.023
ESH+0.023
 Powers+0.023
Neg logit contributions
yip-0.036
ibaba-0.033
ific-0.032
wrapper-0.032
ignt-0.028
Token new
Token ID649
Feature activation-5.33e-01

Pos logit contributions
yip+0.523
ibaba+0.483
ific+0.471
wrapper+0.468
ignt+0.408
Neg logit contributions
 ILCS-0.384
ARB-0.373
RAG-0.354
ESH-0.354
 Powers-0.348
Token State
Token ID1812
Feature activation-1.21e+00

Pos logit contributions
 ILCS+1.261
ARB+1.239
ESH+1.177
 Powers+1.167
Wikipedia+1.163
Neg logit contributions
yip-1.957
ibaba-1.801
wrapper-1.739
ific-1.734
ignt-1.534
Token religion
Token ID5737
Feature activation-3.28e-01

Pos logit contributions
 ILCS+3.081
ARB+2.980
RAG+2.800
 Powers+2.777
ESH+2.745
Neg logit contributions
yip-4.265
ibaba-3.940
wrapper-3.840
ific-3.795
iang-3.324
Token of
Token ID286
Feature activation-1.49e-01

Pos logit contributions
 ILCS+0.806
ARB+0.771
RAG+0.726
ESH+0.718
 Powers+0.714
Neg logit contributions
yip-1.190
ibaba-1.103
wrapper-1.073
ific-1.072
ignt-0.937
Token supposedly
Token ID13519
Feature activation-5.93e-01

Pos logit contributions
 ILCS+0.371
ARB+0.364
RAG+0.345
ESH+0.344
 Avery+0.338
Neg logit contributions
yip-0.530
ibaba-0.489
ific-0.476
wrapper-0.474
ignt-0.415
Token catastrophic
Token ID22694
Feature activation+2.78e-01

Pos logit contributions
 ILCS+1.391
ARB+1.341
RAG+1.283
ESH+1.278
 Avery+1.255
Neg logit contributions
yip-2.190
ibaba-2.033
ific-1.968
wrapper-1.955
ignt-1.720
Token man
Token ID582
Feature activation+5.77e-01

Pos logit contributions
yip+0.940
ibaba+0.867
ific+0.852
wrapper+0.845
ignt+0.729
Neg logit contributions
 ILCS-0.743
ARB-0.721
RAG-0.687
ESH-0.684
 Powers-0.670
Token Inc
Token ID3457
Feature activation-6.15e-01

Pos logit contributions
 ILCS+3.422
ARB+3.245
 Powers+3.136
 Avery+3.101
RAG+3.075
Neg logit contributions
yip-5.804
ibaba-5.371
ific-5.305
wrapper-5.298
ciplinary-4.702
Token.,
Token ID1539
Feature activation-6.84e-02

Pos logit contributions
 ILCS+1.399
ARB+1.342
RAG+1.276
ESH+1.263
Wikipedia+1.224
Neg logit contributions
yip-2.315
ibaba-2.154
ific-2.153
wrapper-2.116
ignt-1.861
Token 213
Token ID28658
Feature activation-4.94e-01

Pos logit contributions
 ILCS+0.169
ARB+0.163
RAG+0.156
ESH+0.154
 Powers+0.150
Neg logit contributions
yip-0.245
ibaba-0.227
ific-0.225
wrapper-0.222
iang-0.193
Token50
Token ID1120
Feature activation+1.29e-01

Pos logit contributions
 ILCS+1.054
ARB+1.013
RAG+0.955
ESH+0.947
 Powers+0.916
Neg logit contributions
yip-1.952
ibaba-1.821
ific-1.787
wrapper-1.768
ignt-1.578
Token42
Token ID3682
Feature activation-2.92e-01

Pos logit contributions
yip+0.513
ibaba+0.480
ific+0.476
wrapper+0.469
ignt+0.417
Neg logit contributions
 ILCS-0.271
ARB-0.258
RAG-0.244
ESH-0.241
 Powers-0.234
Token Ontario
Token ID10553
Feature activation-1.24e+00

Pos logit contributions
 ILCS+0.510
ARB+0.485
RAG+0.451
ESH+0.442
 Powers+0.431
Neg logit contributions
yip-1.258
ibaba-1.180
ific-1.173
wrapper-1.159
ignt-1.038
Token Inc
Token ID3457
Feature activation-4.85e-01

Pos logit contributions
 ILCS+2.327
ARB+2.203
RAG+2.070
 Powers+2.038
ESH+2.028
Neg logit contributions
yip-5.083
ibaba-4.755
ific-4.726
wrapper-4.701
ciplinary-4.191
Token.
Token ID13
Feature activation-3.91e-01

Pos logit contributions
 ILCS+1.122
ARB+1.075
RAG+1.024
ESH+1.013
Wikipedia+0.983
Neg logit contributions
yip-1.808
ific-1.684
ibaba-1.683
wrapper-1.653
ignt-1.453
Token and
Token ID290
Feature activation-4.69e-01

Pos logit contributions
 ILCS+0.933
ARB+0.904
RAG+0.861
ESH+0.847
ONS+0.831
Neg logit contributions
yip-1.421
ibaba-1.315
ific-1.312
wrapper-1.297
iang-1.129
Token 213
Token ID28658
Feature activation-5.71e-01

Pos logit contributions
 ILCS+1.160
ARB+1.120
RAG+1.066
ESH+1.052
 Powers+1.045
Neg logit contributions
yip-1.675
ibaba-1.562
ific-1.528
wrapper-1.509
ignt-1.319
Token50
Token ID1120
Feature activation-1.89e-01

Pos logit contributions
 ILCS+1.216
ARB+1.170
RAG+1.103
ESH+1.099
 Powers+1.063
Neg logit contributions
yip-2.261
ibaba-2.114
ific-2.059
wrapper-2.033
ciplinary-1.828
Tokens
Token ID82
Feature activation+1.41e-01

Pos logit contributions
 ILCS+0.102
ARB+0.099
RAG+0.094
ESH+0.093
 Powers+0.091
Neg logit contributions
yip-0.155
ibaba-0.144
ific-0.143
wrapper-0.141
ignt-0.123
Token no
Token ID645
Feature activation-1.81e-01

Pos logit contributions
yip+0.500
ibaba+0.463
ific+0.462
wrapper+0.454
ignt+0.395
Neg logit contributions
 ILCS-0.351
ARB-0.338
RAG-0.323
ESH-0.320
ONS-0.311
Token limit
Token ID4179
Feature activation-9.74e-02

Pos logit contributions
 ILCS+0.494
ARB+0.478
RAG+0.459
ESH+0.454
 Powers+0.445
Neg logit contributions
yip-0.600
ibaba-0.551
ific-0.547
wrapper-0.538
ignt-0.462
Token.
Token ID13
Feature activation-4.13e-01

Pos logit contributions
 ILCS+0.232
ARB+0.223
RAG+0.211
ESH+0.210
ONS+0.205
Neg logit contributions
yip-0.357
ibaba-0.332
ific-0.328
wrapper-0.324
ignt-0.284
Token When
Token ID1649
Feature activation+1.61e-02

Pos logit contributions
 ILCS+0.969
ARB+0.925
RAG+0.886
ESH+0.879
 Powers+0.868
Neg logit contributions
yip-1.538
ibaba-1.422
ific-1.405
wrapper-1.395
ciplinary-1.228
Token there
Token ID612
Feature activation-1.31e+00

Pos logit contributions
yip+0.058
ibaba+0.054
ific+0.054
wrapper+0.053
ignt+0.046
Neg logit contributions
 ILCS-0.039
ARB-0.037
RAG-0.036
ESH-0.035
 Powers-0.034
Token
Token ID447
Feature activation-2.67e-01

Pos logit contributions
 ILCS+3.016
ARB+2.993
RAG+2.796
ESH+2.794
 Powers+2.752
Neg logit contributions
yip-4.671
ibaba-4.288
ific-4.287
wrapper-4.236
ignt-3.717
Token
Token ID247
Feature activation-1.76e-01

Pos logit contributions
 ILCS+0.609
ARB+0.584
RAG+0.560
ESH+0.554
ONS+0.538
Neg logit contributions
yip-1.005
ibaba-0.930
ific-0.930
wrapper-0.911
uador-0.796
Tokens
Token ID82
Feature activation+4.48e-02

Pos logit contributions
 ILCS+0.463
ARB+0.447
RAG+0.429
ESH+0.426
Wikipedia+0.417
Neg logit contributions
yip-0.600
ibaba-0.550
ific-0.549
wrapper-0.540
ignt-0.467
Token people
Token ID661
Feature activation-2.99e-01

Pos logit contributions
yip+0.156
ibaba+0.144
ific+0.144
wrapper+0.141
ignt+0.123
Neg logit contributions
 ILCS-0.114
ARB-0.110
RAG-0.106
ESH-0.104
ONS-0.102
Token motivated
Token ID13338
Feature activation-8.06e-01

Pos logit contributions
 ILCS+0.717
ARB+0.684
RAG+0.656
ESH+0.649
 Powers+0.635
Neg logit contributions
yip-1.086
ibaba-1.004
ific-0.998
wrapper-0.987
ignt-0.862
Token5
Token ID20
Feature activation+1.55e-01

Pos logit contributions
yip+0.053
ibaba+0.049
ific+0.048
wrapper+0.048
ignt+0.042
Neg logit contributions
 ILCS-0.030
ARB-0.029
RAG-0.027
ESH-0.027
ONS-0.026
Token.
Token ID13
Feature activation+1.31e-01

Pos logit contributions
yip+0.592
ific+0.553
ibaba+0.550
wrapper+0.544
iang+0.479
Neg logit contributions
 ILCS-0.339
ARB-0.327
RAG-0.312
ESH-0.306
Wikipedia-0.300
Token25
Token ID1495
Feature activation-2.86e-02

Pos logit contributions
yip+0.509
ibaba+0.474
ific+0.470
wrapper+0.463
ignt+0.411
Neg logit contributions
 ILCS-0.283
ARB-0.272
RAG-0.258
ESH-0.256
ONS-0.248
Token million
Token ID1510
Feature activation-4.48e-01

Pos logit contributions
 ILCS+0.058
ARB+0.055
RAG+0.052
ESH+0.051
Wikipedia+0.050
Neg logit contributions
yip-0.115
ibaba-0.107
ific-0.107
wrapper-0.105
iang-0.094
Token (
Token ID357
Feature activation-1.19e+00

Pos logit contributions
 ILCS+0.986
ARB+0.950
RAG+0.897
ESH+0.891
ONS+0.863
Neg logit contributions
yip-1.717
ibaba-1.599
ific-1.586
wrapper-1.570
ignt-1.385
Token21
Token ID2481
Feature activation-1.33e+00

Pos logit contributions
ARB+2.598
 ILCS+2.586
RAG+2.452
ESH+2.438
Wikipedia+2.391
Neg logit contributions
yip-4.449
ibaba-4.174
ific-4.092
wrapper-3.963
ignt-3.569
Tokenst
Token ID301
Feature activation-3.60e-01

Pos logit contributions
 ILCS+2.462
ARB+2.367
ESH+2.192
RAG+2.180
ONS+2.116
Neg logit contributions
yip-5.549
ibaba-5.134
ific-5.132
wrapper-5.102
ignt-4.536
Token among
Token ID1871
Feature activation-6.98e-01

Pos logit contributions
 ILCS+0.924
ARB+0.887
RAG+0.846
ESH+0.839
ONS+0.818
Neg logit contributions
yip-1.253
ibaba-1.155
ific-1.151
wrapper-1.138
ignt-0.984
Token guards
Token ID10942
Feature activation-1.35e-01

Pos logit contributions
 ILCS+1.515
ARB+1.453
ESH+1.378
RAG+1.365
 Powers+1.365
Neg logit contributions
yip-2.717
ibaba-2.498
wrapper-2.457
ific-2.453
ciplinary-2.169
Token)
Token ID8
Feature activation+4.16e-01

Pos logit contributions
 ILCS+0.282
ARB+0.270
RAG+0.255
ESH+0.252
Wikipedia+0.244
Neg logit contributions
yip-0.533
ific-0.499
ibaba-0.498
wrapper-0.490
ignt-0.434
Token
Token ID198
Feature activation+2.90e-01

Pos logit contributions
yip+1.553
ific+1.478
ibaba+1.473
wrapper+1.434
ciplinary+1.262
Neg logit contributions
 ILCS-0.928
ARB-0.872
RAG-0.844
ESH-0.831
Wikipedia-0.811
Token make
Token ID787
Feature activation+1.84e-01

Pos logit contributions
yip+2.837
ibaba+2.610
ific+2.605
wrapper+2.558
ignt+2.196
Neg logit contributions
 ILCS-2.371
ARB-2.292
RAG-2.199
ESH-2.180
ONS-2.127
Token the
Token ID262
Feature activation+3.53e-01

Pos logit contributions
yip+0.667
ibaba+0.617
ific+0.612
wrapper+0.604
ignt+0.528
Neg logit contributions
 ILCS-0.444
ARB-0.430
RAG-0.409
ESH-0.406
 Powers-0.395
Token math
Token ID10688
Feature activation+1.42e-01

Pos logit contributions
yip+1.246
ibaba+1.155
ific+1.142
wrapper+1.124
ignt+0.984
Neg logit contributions
 ILCS-0.887
ARB-0.856
RAG-0.815
ESH-0.809
 Powers-0.791
Token simpler
Token ID18599
Feature activation+6.43e-02

Pos logit contributions
yip+0.497
ibaba+0.459
ific+0.450
wrapper+0.447
ignt+0.389
Neg logit contributions
 ILCS-0.361
ARB-0.351
RAG-0.336
ESH-0.334
 Avery-0.325
Token,
Token ID11
Feature activation-4.07e-01

Pos logit contributions
yip+0.260
ibaba+0.243
ific+0.242
wrapper+0.239
ignt+0.213
Neg logit contributions
 ILCS-0.128
ARB-0.122
RAG-0.115
ESH-0.114
ONS-0.110
Token let
Token ID1309
Feature activation-1.38e+00

Pos logit contributions
 ILCS+0.911
ARB+0.880
RAG+0.829
ESH+0.822
 Powers+0.814
Neg logit contributions
yip-1.546
ibaba-1.432
ific-1.422
wrapper-1.401
ignt-1.241
Token's
Token ID338
Feature activation-6.96e-01

Pos logit contributions
ARB+3.026
 ILCS+3.014
RAG+2.815
 Powers+2.782
ESH+2.777
Neg logit contributions
yip-5.148
ibaba-4.746
ific-4.740
wrapper-4.652
ignt-4.161
Token say
Token ID910
Feature activation-1.07e-01

Pos logit contributions
 ILCS+1.818
ARB+1.765
 Powers+1.673
RAG+1.662
ESH+1.662
Neg logit contributions
yip-2.356
ibaba-2.176
ific-2.131
wrapper-2.098
ciplinary-1.815
Token it
Token ID340
Feature activation+1.19e-01

Pos logit contributions
 ILCS+0.249
ARB+0.242
RAG+0.229
ESH+0.227
 Powers+0.223
Neg logit contributions
yip-0.394
ibaba-0.365
ific-0.363
wrapper-0.357
ciplinary-0.313
Token's
Token ID338
Feature activation+2.40e-01

Pos logit contributions
yip+0.429
ibaba+0.397
ific+0.392
wrapper+0.386
ignt+0.338
Neg logit contributions
 ILCS-0.291
ARB-0.282
RAG-0.268
ESH-0.267
 Powers-0.260
Token a
Token ID257
Feature activation+4.71e-01

Pos logit contributions
yip+0.837
ibaba+0.774
ific+0.761
wrapper+0.750
ignt+0.653
Neg logit contributions
 ILCS-0.612
ARB-0.597
RAG-0.567
ESH-0.564
 Powers-0.555