TOP ACTIVATIONS
MAX = 2.238

unto thy hands , thou shalt sm ite every male therein
all the spoil thereof , shalt thou take unto thy self
A : Fort y years ago , 40 percent
how to remove the center console and pin back the troublesome
made captive ." 1 The Prophet said to him : "
pur ak ushi Profile Joined August 2012 United States 3
increases combined with big new tax cuts without vastly increasing
limit . E ighty professionals , including four who
www . cd c . gov / in jury / w
we are here to take care of one another .
in favor of banning marriage equality . No heterosexual 's right
president must take care that the laws be faithfully
be looking to g ank immedi at ly after 6 .
dots on the upper left front corner of the new polymer
but by now , you either buy into these rumors with
du makes sure he takes care of the defensive side of
. <|endoftext|> Abstract Background Depression presents a significant burden
a bit easier . Take care with focusing , though :
www . cd c . gov / home andre cre ational
ial Name (' system . windows . forms ') | out

MIN ACTIVATIONS
MIN = -2.263

buy you ? While AT & T 's powerhouse FISA lobbyist
Press is reporting that AT & T , the defendant in
. For 2007 , AT & T spent over $ 16
chief campaign strategist , AT & T now has former United
. Because of the T & A , this title is
Russia . Image copyright AFP Image caption The Kurds
two hits and 12 ( yes , TW EL VE )
a licensing division of AT & T / Western Electric ,
this into perspective , AT & T 's spending for three
went missing , and a cro c is a prime suspect
CD - ROM Rating : E SR B :
, a former executive at Bear St ear ns , a
historian of culture , he summed up Z t oh oven
S . soldier sue Omar Kh adr for nearly $ 50
Foundation President Donna Shal ala summed up the complicated unt angling
, who had started at Goldman eight years before Cohn ,
happens in Greater Cincinnati . Please try again soon , or
ranking every Halo map -- yes , all 110 of them
There could be more , yes ." No announcement has been
cost property taxpayers paying M ow att Trust for their loss

INTERVAL 1.113 - 2.238
CONTAINS 0.339%

fried - rice dish with esc olar and Chinese sausage that
times . The drummer looked like Tim Y oh annon with
2010 United States 21 50 Posts # 16 Glad to hear
ager _ bi ography . html Chuck
paid and what you are paying for at the supermarket ,"

INTERVAL -0.012 - 1.113
CONTAINS 52.241%

variety of health and recovery purposes . At most , it
must be tough for the little guy ( which , come
? and how does it look from the perspective
NC ) people , such as gender flu id and gender
aro ( 117 ) ( 25 ) 21 .

INTERVAL -1.138 - -0.012
CONTAINS 47.097%

, even Iran . Our job on the international cinema stage
says in the ad . After we completed the
of government , a senior clergy man has warned .
.... I have never said that in my life to her
I gave up I probably would 've been the manager at

INTERVAL -2.263 - -1.138
CONTAINS 0.323%

map in time for Kat ow ice ? We will know
and back ." Let ting teenagers sleep later typically
"… Sure please do ." She
cons cript ed into forced labour in Germany and few wanted
age depicting the principal M upp ets being reunited ( Row
Token unto
Token ID12722
Feature activation+7.40e-01

Pos logit contributions
ignty+0.808
acted+0.796
lette+0.782
hedral+0.781
 Okinawa+0.747
Neg logit contributions
 synd-1.012
dylib-0.941
ulner-0.930
ater-0.927
aters-0.921
Token thy
Token ID11906
Feature activation-3.32e-01

Pos logit contributions
ignty+0.941
acted+0.922
lette+0.902
hedral+0.900
 Okinawa+0.846
Neg logit contributions
 synd-1.832
dylib-1.722
ulner-1.704
ater-1.702
aters-1.692
Token hands
Token ID2832
Feature activation+3.53e-01

Pos logit contributions
 synd+0.633
dylib+0.579
ulner+0.575
ater+0.574
aters+0.570
Neg logit contributions
ignty-0.612
acted-0.604
hedral-0.596
lette-0.593
 Okinawa-0.571
Token,
Token ID11
Feature activation+4.06e-01

Pos logit contributions
ignty+0.623
acted+0.615
lette+0.605
hedral+0.605
 Okinawa+0.579
Neg logit contributions
 synd-0.699
dylib-0.647
ulner-0.640
ater-0.638
aters-0.633
Token thou
Token ID14210
Feature activation+8.43e-02

Pos logit contributions
ignty+0.592
acted+0.581
lette+0.570
hedral+0.570
 Okinawa+0.540
Neg logit contributions
 synd-0.931
dylib-0.871
ulner-0.862
ater-0.860
aters-0.855
Token shalt
Token ID36258
Feature activation+2.24e+00

Pos logit contributions
ignty+0.088
acted+0.086
hedral+0.084
lette+0.083
 Okinawa+0.078
Neg logit contributions
 synd-0.228
dylib-0.215
ulner-0.213
ater-0.213
aters-0.212
Token sm
Token ID895
Feature activation+3.23e-01

Pos logit contributions
ignty+3.509
acted+3.485
 thou+3.485
lette+3.469
 wedge+3.447
Neg logit contributions
 synd-4.540
dylib-4.352
culosis-4.329
ulner-4.302
ater-4.253
Tokenite
Token ID578
Feature activation-1.29e-01

Pos logit contributions
ignty+0.448
acted+0.440
lette+0.430
hedral+0.430
 Okinawa+0.408
Neg logit contributions
 synd-0.760
dylib-0.714
ulner-0.707
ater-0.705
aters-0.701
Token every
Token ID790
Feature activation+2.20e-01

Pos logit contributions
 synd+0.286
dylib+0.264
ater+0.264
ulner+0.263
aters+0.262
Neg logit contributions
ignty-0.199
acted-0.196
hedral-0.192
lette-0.191
 Okinawa-0.181
Token male
Token ID4257
Feature activation+4.02e-01

Pos logit contributions
ignty+0.375
acted+0.369
lette+0.363
hedral+0.362
 wedge+0.347
Neg logit contributions
 synd-0.449
dylib-0.418
ulner-0.413
ater-0.411
aters-0.409
Token therein
Token ID27258
Feature activation+5.10e-01

Pos logit contributions
ignty+0.699
acted+0.688
lette+0.681
hedral+0.671
 wedge+0.649
Neg logit contributions
 synd-0.803
dylib-0.751
ulner-0.739
ater-0.732
aters-0.731
Token all
Token ID477
Feature activation+3.32e-01

Pos logit contributions
ignty+0.411
acted+0.405
lette+0.397
hedral+0.395
 Okinawa+0.378
Neg logit contributions
 synd-0.589
dylib-0.551
ulner-0.545
ater-0.542
aters-0.539
Token the
Token ID262
Feature activation+2.11e-01

Pos logit contributions
ignty+0.589
acted+0.581
lette+0.570
hedral+0.568
 Okinawa+0.547
Neg logit contributions
 synd-0.652
dylib-0.605
ulner-0.597
ater-0.594
aters-0.589
Token spoil
Token ID20851
Feature activation+1.39e-01

Pos logit contributions
ignty+0.392
acted+0.387
lette+0.381
hedral+0.379
 wedge+0.366
Neg logit contributions
 synd-0.398
dylib-0.368
ulner-0.363
ater-0.361
aters-0.358
Token thereof
Token ID15370
Feature activation+6.79e-01

Pos logit contributions
ignty+0.216
acted+0.212
hedral+0.211
lette+0.209
 Okinawa+0.201
Neg logit contributions
 synd-0.305
dylib-0.284
ater-0.282
ulner-0.280
aters-0.279
Token,
Token ID11
Feature activation+5.81e-01

Pos logit contributions
ignty+1.119
acted+1.103
lette+1.083
hedral+1.080
 Okinawa+1.033
Neg logit contributions
 synd-1.422
dylib-1.324
ulner-1.308
ater-1.303
aters-1.295
Token shalt
Token ID36258
Feature activation+2.21e+00

Pos logit contributions
ignty+0.827
acted+0.813
lette+0.796
hedral+0.794
 Okinawa+0.755
Neg logit contributions
 synd-1.347
dylib-1.264
ulner-1.251
ater-1.246
aters-1.239
Token thou
Token ID14210
Feature activation+2.06e-01

Pos logit contributions
ignty+3.106
 thou+3.075
acted+3.040
lette+3.005
hedral+2.937
Neg logit contributions
 synd-4.972
dylib-4.727
ulner-4.691
culosis-4.682
ater-4.654
Token take
Token ID1011
Feature activation+4.75e-01

Pos logit contributions
ignty+0.358
acted+0.352
hedral+0.346
lette+0.346
 Okinawa+0.332
Neg logit contributions
 synd-0.416
dylib-0.386
ulner-0.381
ater-0.380
aters-0.377
Token unto
Token ID12722
Feature activation+1.50e+00

Pos logit contributions
ignty+0.839
acted+0.827
lette+0.814
hedral+0.813
 Okinawa+0.780
Neg logit contributions
 synd-0.938
dylib-0.869
ulner-0.858
ater-0.855
aters-0.849
Token thy
Token ID11906
Feature activation+3.60e-01

Pos logit contributions
ignty+1.035
acted+0.994
lette+0.954
hedral+0.952
 wedge+0.850
Neg logit contributions
 synd-4.534
dylib-4.332
ulner-4.288
ater-4.270
culosis-4.259
Tokenself
Token ID944
Feature activation+1.54e+00

Pos logit contributions
ignty+0.605
acted+0.595
lette+0.588
hedral+0.585
 wedge+0.562
Neg logit contributions
 synd-0.741
dylib-0.692
ulner-0.682
ater-0.679
aters-0.674
Token
Token ID198
Feature activation+2.30e-01

Pos logit contributions
 synd+0.808
dylib+0.754
ulner+0.745
ater+0.744
aters+0.739
Neg logit contributions
ignty-0.616
acted-0.603
hedral-0.594
lette-0.593
 wedge-0.565
Token
Token ID198
Feature activation+1.64e-01

Pos logit contributions
ignty+0.338
acted+0.336
lette+0.329
hedral+0.327
 Okinawa+0.312
Neg logit contributions
 synd-0.519
dylib-0.484
ulner-0.482
ater-0.478
culosis-0.476
TokenA
Token ID32
Feature activation-3.06e-01

Pos logit contributions
ignty+0.350
acted+0.346
lette+0.341
hedral+0.341
 Okinawa+0.330
Neg logit contributions
 synd-0.264
dylib-0.241
ulner-0.237
ater-0.236
aters-0.234
Token:
Token ID25
Feature activation+3.85e-02

Pos logit contributions
 synd+0.680
dylib+0.637
ater+0.628
ulner+0.627
aters+0.623
Neg logit contributions
ignty-0.467
acted-0.456
hedral-0.448
lette-0.448
 Okinawa-0.427
TokenFort
Token ID21926
Feature activation+9.38e-01

Pos logit contributions
ignty+0.074
acted+0.073
lette+0.072
hedral+0.072
 Okinawa+0.069
Neg logit contributions
 synd-0.070
dylib-0.064
ulner-0.064
ater-0.063
aters-0.063
Tokeny
Token ID88
Feature activation+2.14e+00

Pos logit contributions
ignty+1.554
acted+1.537
lette+1.514
hedral+1.488
 Okinawa+1.440
Neg logit contributions
 synd-1.944
dylib-1.818
ulner-1.796
ater-1.782
aters-1.776
Token years
Token ID812
Feature activation+6.17e-01

Pos logit contributions
acted+3.419
lette+3.410
ignty+3.392
hedral+3.307
 thou+3.253
Neg logit contributions
 synd-4.294
dylib-4.159
ulner-4.107
culosis-4.061
ater-3.972
Token ago
Token ID2084
Feature activation-1.31e-01

Pos logit contributions
ignty+0.804
acted+0.789
lette+0.769
hedral+0.768
 Okinawa+0.726
Neg logit contributions
 synd-1.513
dylib-1.422
ulner-1.405
ater-1.403
aters-1.394
Token,
Token ID11
Feature activation-2.46e-02

Pos logit contributions
 synd+0.257
dylib+0.238
ulner+0.235
ater+0.234
aters+0.233
Neg logit contributions
ignty-0.235
acted-0.231
lette-0.228
hedral-0.227
 Okinawa-0.218
Token 40
Token ID2319
Feature activation-2.36e-01

Pos logit contributions
 synd+0.051
dylib+0.047
ulner+0.047
ater+0.047
aters+0.046
Neg logit contributions
ignty-0.041
acted-0.041
lette-0.040
hedral-0.040
 Okinawa-0.038
Token percent
Token ID1411
Feature activation-2.63e-01

Pos logit contributions
 synd+0.487
dylib+0.453
ulner+0.448
ater+0.446
aters+0.443
Neg logit contributions
ignty-0.396
acted-0.391
lette-0.384
hedral-0.384
 Okinawa-0.367
Token how
Token ID703
Feature activation-4.31e-02

Pos logit contributions
 synd+0.308
dylib+0.283
ater+0.279
ulner+0.279
aters+0.277
Neg logit contributions
ignty-0.310
acted-0.305
lette-0.300
hedral-0.300
 Okinawa-0.288
Token to
Token ID284
Feature activation-6.04e-02

Pos logit contributions
 synd+0.088
dylib+0.082
ater+0.081
ulner+0.081
aters+0.080
Neg logit contributions
ignty-0.074
acted-0.072
hedral-0.071
lette-0.071
 Okinawa-0.068
Token remove
Token ID4781
Feature activation+6.07e-01

Pos logit contributions
 synd+0.124
dylib+0.116
ulner+0.114
ater+0.114
aters+0.113
Neg logit contributions
ignty-0.101
acted-0.100
lette-0.099
hedral-0.098
 wedge-0.095
Token the
Token ID262
Feature activation+2.30e-01

Pos logit contributions
ignty+0.906
acted+0.895
lette+0.876
hedral+0.874
 wedge+0.838
Neg logit contributions
 synd-1.357
dylib-1.274
ulner-1.258
culosis-1.248
ater-1.247
Token center
Token ID3641
Feature activation+8.31e-01

Pos logit contributions
ignty+0.456
acted+0.451
lette+0.445
hedral+0.444
 Okinawa+0.428
Neg logit contributions
 synd-0.404
dylib-0.372
ulner-0.366
ater-0.364
aters-0.362
Token console
Token ID8624
Feature activation+2.06e+00

Pos logit contributions
ignty+1.444
acted+1.426
lette+1.410
hedral+1.402
 wedge+1.354
Neg logit contributions
 synd-1.657
dylib-1.542
ulner-1.522
ater-1.509
aters-1.507
Token and
Token ID290
Feature activation+2.60e-01

Pos logit contributions
ignty+3.371
acted+3.333
lette+3.320
hedral+3.260
 wedge+3.248
Neg logit contributions
 synd-4.197
dylib-3.957
culosis-3.914
ulner-3.887
aters-3.848
Token pin
Token ID6757
Feature activation+6.66e-03

Pos logit contributions
ignty+0.449
acted+0.442
lette+0.436
hedral+0.435
 wedge+0.417
Neg logit contributions
 synd-0.524
dylib-0.487
ulner-0.482
ater-0.478
aters-0.476
Token back
Token ID736
Feature activation+2.34e-01

Pos logit contributions
ignty+0.012
acted+0.011
lette+0.011
hedral+0.011
 Okinawa+0.011
Neg logit contributions
 synd-0.013
dylib-0.012
ulner-0.012
ater-0.012
aters-0.012
Token the
Token ID262
Feature activation+1.16e-01

Pos logit contributions
ignty+0.391
acted+0.385
lette+0.379
hedral+0.378
 Okinawa+0.362
Neg logit contributions
 synd-0.484
dylib-0.451
ulner-0.446
ater-0.443
aters-0.441
Token troublesome
Token ID35778
Feature activation+7.94e-01

Pos logit contributions
ignty+0.211
acted+0.208
lette+0.205
hedral+0.205
 wedge+0.197
Neg logit contributions
 synd-0.221
dylib-0.205
ulner-0.202
ater-0.201
aters-0.200
Token made
Token ID925
Feature activation+4.53e-01

Pos logit contributions
ignty+1.052
acted+1.038
lette+1.020
hedral+1.014
 wedge+0.972
Neg logit contributions
 synd-1.390
dylib-1.302
ulner-1.287
ater-1.278
culosis-1.271
Token captive
Token ID25798
Feature activation-2.04e-01

Pos logit contributions
ignty+0.771
acted+0.763
lette+0.751
hedral+0.744
 wedge+0.719
Neg logit contributions
 synd-0.918
dylib-0.856
ulner-0.846
ater-0.841
culosis-0.837
Token."
Token ID526
Feature activation+3.93e-01

Pos logit contributions
 synd+0.422
dylib+0.390
ater+0.386
ulner+0.385
aters+0.383
Neg logit contributions
ignty-0.344
acted-0.337
hedral-0.335
lette-0.332
 Okinawa-0.319
Token1
Token ID16
Feature activation+4.11e-01

Pos logit contributions
ignty+0.604
acted+0.592
lette+0.582
hedral+0.581
 Okinawa+0.552
Neg logit contributions
 synd-0.868
dylib-0.811
ulner-0.801
ater-0.799
aters-0.794
Token The
Token ID383
Feature activation+7.46e-01

Pos logit contributions
ignty+0.634
acted+0.625
lette+0.613
hedral+0.611
 Okinawa+0.584
Neg logit contributions
 synd-0.902
dylib-0.842
ulner-0.833
ater-0.830
aters-0.825
Token Prophet
Token ID13583
Feature activation+2.05e+00

Pos logit contributions
ignty+1.252
acted+1.239
lette+1.213
hedral+1.213
 Okinawa+1.164
Neg logit contributions
 synd-1.529
dylib-1.429
ulner-1.413
ater-1.396
culosis-1.389
Token said
Token ID531
Feature activation+1.26e+00

Pos logit contributions
ignty+2.822
acted+2.815
hedral+2.730
lette+2.726
 Okinawa+2.598
Neg logit contributions
 synd-4.698
dylib-4.485
ulner-4.430
culosis-4.368
ater-4.347
Token to
Token ID284
Feature activation+4.85e-01

Pos logit contributions
ignty+2.107
acted+2.082
lette+2.037
hedral+2.030
 Okinawa+1.948
Neg logit contributions
 synd-2.598
dylib-2.432
ulner-2.400
ater-2.379
aters-2.363
Token him
Token ID683
Feature activation-1.15e-01

Pos logit contributions
ignty+0.737
acted+0.720
lette+0.708
hedral+0.705
 Okinawa+0.669
Neg logit contributions
 synd-1.089
dylib-1.012
ater-1.002
ulner-1.000
aters-0.997
Token:
Token ID25
Feature activation+8.16e-01

Pos logit contributions
 synd+0.224
dylib+0.205
ater+0.204
ulner+0.204
aters+0.203
Neg logit contributions
ignty-0.207
acted-0.203
hedral-0.203
lette-0.202
 wedge-0.193
Token "
Token ID366
Feature activation-1.82e-02

Pos logit contributions
ignty+1.225
acted+1.208
lette+1.183
hedral+1.180
 Okinawa+1.126
Neg logit contributions
 synd-1.824
dylib-1.706
ulner-1.687
ater-1.681
aters-1.671
Token
Token ID198
Feature activation-3.80e-02

Pos logit contributions
ignty+0.124
acted+0.122
lette+0.119
hedral+0.119
 Okinawa+0.113
Neg logit contributions
 synd-0.185
dylib-0.173
ulner-0.172
ater-0.170
aters-0.170
Tokenpur
Token ID14225
Feature activation-1.82e-02

Pos logit contributions
 synd+0.075
dylib+0.069
ater+0.068
ulner+0.068
aters+0.068
Neg logit contributions
ignty-0.068
acted-0.066
hedral-0.065
lette-0.065
 Okinawa-0.063
Tokenak
Token ID461
Feature activation-7.08e-02

Pos logit contributions
 synd+0.038
dylib+0.035
ulner+0.034
ater+0.034
aters+0.034
Neg logit contributions
ignty-0.031
acted-0.030
hedral-0.030
lette-0.030
 Okinawa-0.028
Tokenushi
Token ID17731
Feature activation+4.59e-01

Pos logit contributions
 synd+0.151
dylib+0.141
ater+0.139
ulner+0.139
aters+0.138
Neg logit contributions
ignty-0.114
acted-0.112
hedral-0.110
lette-0.110
 Okinawa-0.105
Token Profile
Token ID13118
Feature activation+5.91e-01

Pos logit contributions
ignty+0.766
acted+0.757
lette+0.745
hedral+0.743
 Okinawa+0.712
Neg logit contributions
 synd-0.948
dylib-0.886
ulner-0.875
ater-0.867
aters-0.863
Token Joined
Token ID16709
Feature activation+2.04e+00

Pos logit contributions
ignty+0.480
acted+0.463
hedral+0.446
lette+0.445
 Okinawa+0.402
Neg logit contributions
 synd-1.736
dylib-1.648
ulner-1.633
ater-1.631
aters-1.624
Token August
Token ID2932
Feature activation+9.24e-01

Pos logit contributions
acted+2.763
ignty+2.761
lette+2.746
 Okinawa+2.713
hedral+2.654
Neg logit contributions
 synd-4.562
ulner-4.420
culosis-4.411
dylib-4.396
ater-4.298
Token 2012
Token ID2321
Feature activation+2.23e-01

Pos logit contributions
ignty+1.000
acted+0.985
lette+0.959
hedral+0.945
 Okinawa+0.901
Neg logit contributions
 synd-2.445
dylib-2.313
ulner-2.303
ater-2.281
culosis-2.276
Token United
Token ID1578
Feature activation+4.45e-01

Pos logit contributions
ignty+0.431
acted+0.426
lette+0.419
hedral+0.419
 Okinawa+0.403
Neg logit contributions
 synd-0.405
dylib-0.373
ulner-0.368
ater-0.366
aters-0.363
Token States
Token ID1829
Feature activation+4.93e-01

Pos logit contributions
ignty+1.035
acted+1.029
lette+1.011
hedral+1.009
 Okinawa+0.985
Neg logit contributions
 synd-0.624
dylib-0.566
ulner-0.558
ater-0.543
aters-0.539
Token 3
Token ID513
Feature activation-3.13e-01

Pos logit contributions
ignty+0.718
acted+0.711
lette+0.698
hedral+0.690
 Okinawa+0.672
Neg logit contributions
 synd-1.113
dylib-1.044
ulner-1.039
ater-1.028
culosis-1.025
Token increases
Token ID5732
Feature activation+4.52e-01

Pos logit contributions
ignty+0.356
acted+0.352
lette+0.348
hedral+0.348
 Okinawa+0.337
Neg logit contributions
 synd-0.214
dylib-0.191
ulner-0.188
ater-0.187
aters-0.185
Token combined
Token ID5929
Feature activation-7.56e-01

Pos logit contributions
ignty+0.786
acted+0.777
lette+0.762
hedral+0.761
 Okinawa+0.731
Neg logit contributions
 synd-0.903
dylib-0.839
ulner-0.830
ater-0.825
aters-0.820
Token with
Token ID351
Feature activation-2.87e-02

Pos logit contributions
 synd+1.335
dylib+1.217
ater+1.198
ulner+1.197
aters+1.183
Neg logit contributions
ignty-1.504
acted-1.476
lette-1.455
hedral-1.455
 Okinawa-1.400
Token big
Token ID1263
Feature activation-5.03e-01

Pos logit contributions
 synd+0.059
dylib+0.054
ulner+0.054
ater+0.054
aters+0.053
Neg logit contributions
ignty-0.049
acted-0.048
lette-0.047
hedral-0.047
 Okinawa-0.045
Token new
Token ID649
Feature activation+4.29e-01

Pos logit contributions
 synd+0.895
dylib+0.817
ulner+0.808
ater+0.806
aters+0.799
Neg logit contributions
ignty-0.993
acted-0.979
lette-0.966
hedral-0.965
 Okinawa-0.928
Token tax
Token ID1687
Feature activation+2.02e+00

Pos logit contributions
ignty+0.810
acted+0.796
lette+0.791
hedral+0.787
 wedge+0.771
Neg logit contributions
 synd-0.780
dylib-0.733
ulner-0.722
ater-0.711
culosis-0.706
Token cuts
Token ID6630
Feature activation+1.47e+00

Pos logit contributions
acted+4.316
ignty+4.284
 wedge+4.260
hedral+4.251
front+4.232
Neg logit contributions
 synd-2.931
dylib-2.838
ulner-2.785
ater-2.724
aters-2.664
Token
Token ID960
Feature activation+2.38e-01

Pos logit contributions
acted+2.427
ignty+2.420
lette+2.364
hedral+2.362
 wedge+2.271
Neg logit contributions
 synd-2.991
ulner-2.797
dylib-2.796
culosis-2.751
ater-2.749
Tokenwithout
Token ID19419
Feature activation-3.51e-01

Pos logit contributions
ignty+0.331
acted+0.325
lette+0.318
hedral+0.318
 Okinawa+0.301
Neg logit contributions
 synd-0.560
dylib-0.526
ulner-0.520
ater-0.519
aters-0.516
Token vastly
Token ID22056
Feature activation+4.80e-01

Pos logit contributions
 synd+0.746
dylib+0.681
ater+0.676
ulner+0.674
aters+0.671
Neg logit contributions
ignty-0.579
acted-0.566
hedral-0.557
lette-0.557
 Okinawa-0.527
Token increasing
Token ID3649
Feature activation+5.39e-01

Pos logit contributions
ignty+0.839
acted+0.829
lette+0.813
hedral+0.812
 Okinawa+0.780
Neg logit contributions
 synd-0.951
dylib-0.887
ulner-0.877
ater-0.869
aters-0.864
Token limit
Token ID4179
Feature activation+5.22e-01

Pos logit contributions
ignty+0.066
acted+0.064
hedral+0.063
lette+0.062
 Okinawa+0.061
Neg logit contributions
 synd-0.083
dylib-0.077
ater-0.076
ulner-0.075
aters-0.075
Token.
Token ID13
Feature activation-9.94e-03

Pos logit contributions
ignty+0.809
acted+0.795
hedral+0.780
lette+0.780
 Okinawa+0.743
Neg logit contributions
 synd-1.146
dylib-1.069
ulner-1.056
ater-1.054
aters-1.047
Token
Token ID198
Feature activation-1.75e-01

Pos logit contributions
 synd+0.022
dylib+0.020
ulner+0.020
ater+0.020
aters+0.020
Neg logit contributions
ignty-0.016
acted-0.015
lette-0.015
hedral-0.015
 Okinawa-0.014
Token
Token ID198
Feature activation-7.33e-02

Pos logit contributions
 synd+0.385
dylib+0.360
ulner+0.355
ater+0.355
aters+0.352
Neg logit contributions
ignty-0.270
acted-0.265
hedral-0.260
lette-0.260
 Okinawa-0.248
TokenE
Token ID36
Feature activation+6.87e-02

Pos logit contributions
 synd+0.141
dylib+0.130
ulner+0.129
ater+0.128
aters+0.127
Neg logit contributions
ignty-0.134
acted-0.132
hedral-0.129
lette-0.129
 Okinawa-0.124
Tokenighty
Token ID14400
Feature activation+1.99e+00

Pos logit contributions
ignty+0.127
acted+0.125
lette+0.123
hedral+0.123
 Okinawa+0.118
Neg logit contributions
 synd-0.131
dylib-0.121
ulner-0.119
ater-0.119
aters-0.118
Token professionals
Token ID11153
Feature activation+4.37e-01

Pos logit contributions
acted+2.625
lette+2.619
ignty+2.615
hedral+2.529
 wedge+2.481
Neg logit contributions
 synd-4.578
dylib-4.405
ulner-4.344
ater-4.280
culosis-4.275
Token,
Token ID11
Feature activation-1.22e-01

Pos logit contributions
ignty+0.730
acted+0.720
lette+0.708
hedral+0.706
 Okinawa+0.676
Neg logit contributions
 synd-0.904
dylib-0.843
ulner-0.833
ater-0.829
aters-0.824
Token including
Token ID1390
Feature activation-2.08e-01

Pos logit contributions
 synd+0.248
dylib+0.229
ulner+0.226
ater+0.226
aters+0.224
Neg logit contributions
ignty-0.211
acted-0.207
hedral-0.203
lette-0.203
 Okinawa-0.194
Token four
Token ID1440
Feature activation-2.27e-01

Pos logit contributions
 synd+0.430
dylib+0.400
ulner+0.395
ater+0.394
aters+0.391
Neg logit contributions
ignty-0.347
acted-0.342
lette-0.336
hedral-0.335
 Okinawa-0.321
Token who
Token ID508
Feature activation-3.90e-02

Pos logit contributions
 synd+0.431
dylib+0.402
ulner+0.398
ater+0.394
aters+0.391
Neg logit contributions
ignty-0.414
acted-0.411
lette-0.406
hedral-0.405
 Okinawa-0.390
Tokenwww
Token ID2503
Feature activation+1.04e+00

Pos logit contributions
ignty+0.367
acted+0.362
lette+0.355
hedral+0.354
 Okinawa+0.337
Neg logit contributions
 synd-0.475
dylib-0.442
ulner-0.435
ater-0.434
aters-0.431
Token.
Token ID13
Feature activation+5.79e-01

Pos logit contributions
acted+1.411
ignty+1.398
lette+1.376
hedral+1.354
front+1.300
Neg logit contributions
 synd-2.482
ulner-2.318
dylib-2.306
culosis-2.304
ater-2.283
Tokencd
Token ID10210
Feature activation+1.70e-01

Pos logit contributions
ignty+0.652
acted+0.641
lette+0.620
hedral+0.619
front+0.576
Neg logit contributions
 synd-1.518
dylib-1.434
ulner-1.415
ater-1.410
aters-1.406
Tokenc
Token ID66
Feature activation+1.05e+00

Pos logit contributions
ignty+0.225
acted+0.225
hedral+0.219
lette+0.218
front+0.205
Neg logit contributions
 synd-0.410
dylib-0.385
ulner-0.382
ater-0.378
aters-0.377
Token.
Token ID13
Feature activation+5.24e-01

Pos logit contributions
ignty+1.420
acted+1.410
lette+1.381
hedral+1.377
front+1.301
Neg logit contributions
 synd-2.500
dylib-2.351
ulner-2.319
culosis-2.297
ater-2.285
Tokengov
Token ID9567
Feature activation+1.98e+00

Pos logit contributions
ignty+0.613
acted+0.602
lette+0.587
hedral+0.585
 Okinawa+0.546
Neg logit contributions
 synd-1.348
dylib-1.272
ulner-1.259
ater-1.256
aters-1.250
Token/
Token ID14
Feature activation+5.73e-01

Pos logit contributions
acted+3.141
ignty+3.078
lette+3.061
hedral+3.007
front+2.946
Neg logit contributions
 synd-4.130
dylib-3.854
ulner-3.815
culosis-3.776
ater-3.730
Tokenin
Token ID259
Feature activation+5.30e-01

Pos logit contributions
ignty+0.914
acted+0.910
lette+0.883
hedral+0.874
front+0.846
Neg logit contributions
 synd-1.228
dylib-1.143
ater-1.117
aters-1.112
ulner-1.111
Tokenjury
Token ID21871
Feature activation+6.09e-02

Pos logit contributions
acted+1.074
ignty+1.071
hedral+1.047
lette+1.045
front+1.020
Neg logit contributions
 synd-0.909
dylib-0.825
ulner-0.805
ater-0.794
culosis-0.794
Token/
Token ID14
Feature activation+2.82e-01

Pos logit contributions
ignty+0.106
acted+0.104
lette+0.103
hedral+0.102
 Okinawa+0.098
Neg logit contributions
 synd-0.122
dylib-0.113
ulner-0.112
ater-0.111
aters-0.111
Tokenw
Token ID86
Feature activation+5.72e-01

Pos logit contributions
ignty+0.452
acted+0.445
lette+0.435
hedral+0.432
 Okinawa+0.415
Neg logit contributions
 synd-0.601
dylib-0.561
ater-0.550
ulner-0.550
aters-0.548
Token we
Token ID356
Feature activation+1.23e-02

Pos logit contributions
ignty+0.595
acted+0.586
lette+0.579
hedral+0.574
 Okinawa+0.553
Neg logit contributions
 synd-0.713
dylib-0.668
ulner-0.659
ater-0.654
aters-0.650
Token are
Token ID389
Feature activation+1.13e-01

Pos logit contributions
ignty+0.019
acted+0.019
lette+0.018
hedral+0.018
 Okinawa+0.018
Neg logit contributions
 synd-0.027
dylib-0.025
ulner-0.025
ater-0.025
aters-0.025
Token here
Token ID994
Feature activation-2.83e-01

Pos logit contributions
ignty+0.178
acted+0.177
hedral+0.173
lette+0.173
 Okinawa+0.165
Neg logit contributions
 synd-0.241
dylib-0.228
ulner-0.226
culosis-0.223
ater-0.223
Token to
Token ID284
Feature activation+2.39e-01

Pos logit contributions
 synd+0.555
dylib+0.513
ulner+0.506
ater+0.506
aters+0.502
Neg logit contributions
ignty-0.506
acted-0.498
hedral-0.490
lette-0.490
 Okinawa-0.471
Token take
Token ID1011
Feature activation-2.98e-02

Pos logit contributions
ignty+0.420
acted+0.414
lette+0.409
hedral+0.407
 wedge+0.395
Neg logit contributions
 synd-0.471
dylib-0.439
ulner-0.434
ater-0.431
culosis-0.428
Token care
Token ID1337
Feature activation+1.96e+00

Pos logit contributions
 synd+0.058
dylib+0.054
ulner+0.053
ater+0.053
aters+0.052
Neg logit contributions
ignty-0.054
acted-0.053
lette-0.052
hedral-0.052
 Okinawa-0.050
Token of
Token ID286
Feature activation+2.32e-01

Pos logit contributions
ignty+2.805
acted+2.744
lette+2.720
hedral+2.643
 Okinawa+2.539
Neg logit contributions
 synd-4.479
dylib-4.175
ulner-4.137
ater-4.106
culosis-4.092
Token one
Token ID530
Feature activation+1.62e-01

Pos logit contributions
ignty+0.394
acted+0.388
lette+0.381
hedral+0.381
 Okinawa+0.365
Neg logit contributions
 synd-0.474
dylib-0.440
ulner-0.435
ater-0.433
aters-0.430
Token another
Token ID1194
Feature activation-1.19e-01

Pos logit contributions
ignty+0.183
acted+0.179
lette+0.174
hedral+0.174
 Okinawa+0.163
Neg logit contributions
 synd-0.424
dylib-0.401
ulner-0.397
ater-0.396
aters-0.394
Token.
Token ID13
Feature activation+1.98e-01

Pos logit contributions
 synd+0.275
dylib+0.258
ulner+0.255
ater+0.254
aters+0.252
Neg logit contributions
ignty-0.172
acted-0.169
lette-0.166
hedral-0.165
 Okinawa-0.157
Token
Token ID447
Feature activation+1.85e-01

Pos logit contributions
ignty+0.311
acted+0.306
lette+0.300
hedral+0.300
 Okinawa+0.286
Neg logit contributions
 synd-0.431
dylib-0.403
ulner-0.398
ater-0.397
aters-0.394
Token in
Token ID287
Feature activation-6.61e-02

Pos logit contributions
 synd+1.090
dylib+1.007
ater+0.998
ulner+0.989
aters+0.988
Neg logit contributions
ignty-0.973
acted-0.957
hedral-0.948
lette-0.940
 Okinawa-0.908
Token favor
Token ID2661
Feature activation-7.88e-01

Pos logit contributions
 synd+0.144
dylib+0.133
ater+0.132
ulner+0.131
aters+0.131
Neg logit contributions
ignty-0.106
acted-0.103
hedral-0.101
lette-0.101
 Okinawa-0.096
Token of
Token ID286
Feature activation-1.19e-01

Pos logit contributions
 synd+1.636
dylib+1.505
ater+1.475
ulner+1.462
aters+1.461
Neg logit contributions
ignty-1.331
acted-1.307
hedral-1.290
lette-1.279
 Okinawa-1.217
Token banning
Token ID19479
Feature activation+1.17e+00

Pos logit contributions
 synd+0.247
dylib+0.229
ulner+0.226
ater+0.225
aters+0.224
Neg logit contributions
ignty-0.200
acted-0.197
lette-0.194
hedral-0.194
 Okinawa-0.184
Token marriage
Token ID4845
Feature activation+6.92e-01

Pos logit contributions
ignty+1.984
acted+1.955
lette+1.931
hedral+1.922
 Okinawa+1.856
Neg logit contributions
 synd-2.356
dylib-2.198
ulner-2.181
ater-2.164
aters-2.146
Token equality
Token ID10537
Feature activation+1.96e+00

Pos logit contributions
ignty+1.151
acted+1.132
lette+1.112
hedral+1.111
 wedge+1.065
Neg logit contributions
 synd-1.427
dylib-1.335
ulner-1.324
ater-1.315
aters-1.305
Token.
Token ID13
Feature activation+6.32e-02

Pos logit contributions
ignty+3.008
acted+2.949
lette+2.896
hedral+2.857
 wedge+2.764
Neg logit contributions
 synd-4.188
ulner-3.988
dylib-3.976
ater-3.899
culosis-3.893
Token No
Token ID1400
Feature activation-5.45e-01

Pos logit contributions
ignty+0.100
acted+0.098
hedral+0.097
lette+0.096
 wedge+0.091
Neg logit contributions
 synd-0.137
dylib-0.128
ater-0.126
ulner-0.126
aters-0.125
Token heterosexual
Token ID24026
Feature activation-9.64e-02

Pos logit contributions
 synd+1.122
dylib+1.031
ater+1.027
ulner+1.015
aters+1.014
Neg logit contributions
ignty-0.932
acted-0.914
hedral-0.899
lette-0.891
 Okinawa-0.860
Token's
Token ID338
Feature activation+8.03e-01

Pos logit contributions
 synd+0.184
dylib+0.169
ulner+0.167
ater+0.167
aters+0.165
Neg logit contributions
ignty-0.178
acted-0.175
hedral-0.172
lette-0.172
 Okinawa-0.166
Token right
Token ID826
Feature activation+1.31e-01

Pos logit contributions
ignty+1.365
acted+1.348
lette+1.329
hedral+1.324
 wedge+1.276
Neg logit contributions
 synd-1.623
dylib-1.522
ulner-1.506
ater-1.494
aters-1.486
Token president
Token ID1893
Feature activation+3.37e-01

Pos logit contributions
ignty+0.538
acted+0.530
lette+0.522
hedral+0.521
 Okinawa+0.500
Neg logit contributions
 synd-0.613
dylib-0.568
ulner-0.561
ater-0.559
aters-0.555
Token
Token ID564
Feature activation+2.16e-01

Pos logit contributions
ignty+0.486
acted+0.478
lette+0.468
hedral+0.468
 Okinawa+0.444
Neg logit contributions
 synd-0.775
dylib-0.725
ulner-0.717
ater-0.716
aters-0.712
Token
Token ID250
Feature activation-6.58e-02

Pos logit contributions
ignty+0.386
acted+0.380
lette+0.376
hedral+0.375
 Okinawa+0.360
Neg logit contributions
 synd-0.422
dylib-0.391
ater-0.386
ulner-0.385
aters-0.383
Tokenmust
Token ID27238
Feature activation+6.04e-01

Pos logit contributions
 synd+0.147
dylib+0.137
ater+0.135
ulner+0.135
aters+0.134
Neg logit contributions
ignty-0.100
acted-0.098
hedral-0.096
lette-0.096
 Okinawa-0.092
Token take
Token ID1011
Feature activation-3.18e-02

Pos logit contributions
ignty+0.782
acted+0.766
lette+0.749
hedral+0.741
 Okinawa+0.704
Neg logit contributions
 synd-1.478
dylib-1.396
ulner-1.379
ater-1.369
aters-1.365
Token care
Token ID1337
Feature activation+1.95e+00

Pos logit contributions
 synd+0.062
dylib+0.057
ater+0.056
ulner+0.056
aters+0.056
Neg logit contributions
ignty-0.057
acted-0.056
hedral-0.056
lette-0.056
 Okinawa-0.053
Token that
Token ID326
Feature activation-3.30e-02

Pos logit contributions
ignty+3.003
acted+2.938
lette+2.860
hedral+2.771
 thou+2.729
Neg logit contributions
 synd-4.282
dylib-4.035
culosis-3.964
ulner-3.939
ater-3.919
Token the
Token ID262
Feature activation+1.89e-01

Pos logit contributions
 synd+0.072
dylib+0.067
ulner+0.066
ater+0.066
aters+0.066
Neg logit contributions
ignty-0.052
acted-0.051
lette-0.050
hedral-0.050
 Okinawa-0.047
Token laws
Token ID3657
Feature activation+7.73e-01

Pos logit contributions
ignty+0.358
acted+0.354
lette+0.348
hedral+0.348
 Okinawa+0.335
Neg logit contributions
 synd-0.351
dylib-0.323
ulner-0.319
ater-0.317
aters-0.315
Token be
Token ID307
Feature activation+1.12e-01

Pos logit contributions
ignty+1.195
acted+1.166
lette+1.139
hedral+1.123
 Okinawa+1.091
Neg logit contributions
 synd-1.694
dylib-1.596
ulner-1.576
culosis-1.575
ater-1.563
Token faithfully
Token ID42353
Feature activation-3.32e-01

Pos logit contributions
ignty+0.165
acted+0.163
lette+0.159
hedral+0.158
 Okinawa+0.151
Neg logit contributions
 synd-0.255
dylib-0.240
ulner-0.237
ater-0.236
culosis-0.235
Token be
Token ID307
Feature activation+2.60e-01

Pos logit contributions
ignty+0.735
acted+0.727
lette+0.717
hedral+0.715
 wedge+0.689
Neg logit contributions
 synd-0.926
dylib-0.866
ulner-0.855
ater-0.849
aters-0.843
Token looking
Token ID2045
Feature activation+4.42e-01

Pos logit contributions
ignty+0.419
acted+0.415
lette+0.408
hedral+0.407
 Okinawa+0.390
Neg logit contributions
 synd-0.549
dylib-0.514
ulner-0.508
ater-0.504
aters-0.501
Token to
Token ID284
Feature activation-1.42e-01

Pos logit contributions
ignty+0.824
acted+0.815
lette+0.803
hedral+0.801
 Okinawa+0.772
Neg logit contributions
 synd-0.827
dylib-0.766
ulner-0.754
ater-0.748
aters-0.744
Token g
Token ID308
Feature activation-2.51e-01

Pos logit contributions
 synd+0.283
dylib+0.262
ulner+0.259
ater+0.258
aters+0.256
Neg logit contributions
ignty-0.249
acted-0.245
lette-0.241
hedral-0.241
 Okinawa-0.231
Tokenank
Token ID962
Feature activation+9.95e-02

Pos logit contributions
 synd+0.559
dylib+0.519
ater+0.519
ulner+0.516
aters+0.515
Neg logit contributions
ignty-0.382
acted-0.374
hedral-0.367
lette-0.366
 Okinawa-0.352
Token immedi
Token ID2729
Feature activation+1.95e+00

Pos logit contributions
ignty+0.166
acted+0.163
lette+0.161
hedral+0.160
 Okinawa+0.153
Neg logit contributions
 synd-0.206
dylib-0.192
ulner-0.190
ater-0.189
aters-0.188
Tokenat
Token ID265
Feature activation+4.30e-01

Pos logit contributions
ignty+3.966
acted+3.961
lette+3.877
hedral+3.770
onso+3.674
Neg logit contributions
 synd-3.295
dylib-2.979
ulner-2.925
culosis-2.891
 creditor-2.786
Tokenly
Token ID306
Feature activation+5.85e-01

Pos logit contributions
ignty+0.794
acted+0.781
lette+0.772
hedral+0.768
 wedge+0.733
Neg logit contributions
 synd-0.816
dylib-0.757
ulner-0.743
ater-0.737
aters-0.733
Token after
Token ID706
Feature activation-2.98e-01

Pos logit contributions
ignty+0.963
acted+0.951
lette+0.933
hedral+0.931
 wedge+0.893
Neg logit contributions
 synd-1.216
dylib-1.137
ulner-1.126
ater-1.116
aters-1.109
Token 6
Token ID718
Feature activation-4.17e-01

Pos logit contributions
 synd+0.643
dylib+0.596
ater+0.591
ulner+0.590
aters+0.586
Neg logit contributions
ignty-0.478
acted-0.467
hedral-0.459
lette-0.458
 Okinawa-0.437
Token.
Token ID13
Feature activation-3.24e-01

Pos logit contributions
 synd+0.940
dylib+0.876
ater+0.869
ulner+0.866
aters+0.863
Neg logit contributions
ignty-0.623
acted-0.612
lette-0.599
hedral-0.598
 Okinawa-0.568
Token dots
Token ID22969
Feature activation+5.45e-01

Pos logit contributions
ignty+0.968
acted+0.954
lette+0.937
hedral+0.935
 Okinawa+0.894
Neg logit contributions
 synd-1.248
dylib-1.162
ulner-1.149
ater-1.145
aters-1.137
Token on
Token ID319
Feature activation+2.29e-01

Pos logit contributions
ignty+0.858
acted+0.847
lette+0.835
hedral+0.834
 wedge+0.797
Neg logit contributions
 synd-1.167
dylib-1.099
ulner-1.088
ater-1.076
aters-1.069
Token the
Token ID262
Feature activation+1.98e-01

Pos logit contributions
ignty+0.362
acted+0.354
lette+0.348
hedral+0.347
 Okinawa+0.330
Neg logit contributions
 synd-0.500
dylib-0.465
ulner-0.459
ater-0.459
aters-0.456
Token upper
Token ID6727
Feature activation+2.79e-01

Pos logit contributions
ignty+0.343
acted+0.338
lette+0.332
hedral+0.332
 Okinawa+0.318
Neg logit contributions
 synd-0.401
dylib-0.371
ulner-0.367
ater-0.366
aters-0.364
Token left
Token ID1364
Feature activation+6.02e-01

Pos logit contributions
ignty+0.478
acted+0.469
lette+0.460
hedral+0.460
 Okinawa+0.440
Neg logit contributions
 synd-0.572
dylib-0.529
ulner-0.522
ater-0.521
aters-0.518
Token front
Token ID2166
Feature activation+1.93e+00

Pos logit contributions
ignty+0.989
acted+0.974
lette+0.957
hedral+0.956
 Okinawa+0.914
Neg logit contributions
 synd-1.265
dylib-1.178
ulner-1.164
ater-1.160
aters-1.153
Token corner
Token ID5228
Feature activation+1.23e+00

Pos logit contributions
acted+3.079
ignty+3.059
lette+3.041
hedral+3.021
 wedge+2.948
Neg logit contributions
 synd-4.018
dylib-3.807
ater-3.703
ulner-3.701
culosis-3.691
Token of
Token ID286
Feature activation+3.61e-01

Pos logit contributions
ignty+1.835
acted+1.826
lette+1.795
hedral+1.794
 wedge+1.716
Neg logit contributions
 synd-2.726
dylib-2.583
ulner-2.551
ater-2.534
aters-2.514
Token the
Token ID262
Feature activation+3.01e-01

Pos logit contributions
ignty+0.564
acted+0.552
lette+0.542
hedral+0.541
 Okinawa+0.516
Neg logit contributions
 synd-0.790
dylib-0.735
ulner-0.727
ater-0.726
aters-0.722
Token new
Token ID649
Feature activation+3.44e-01

Pos logit contributions
ignty+0.518
acted+0.509
lette+0.501
hedral+0.500
 Okinawa+0.479
Neg logit contributions
 synd-0.610
dylib-0.565
ulner-0.558
ater-0.557
aters-0.554
Token polymer
Token ID30528
Feature activation+7.39e-01

Pos logit contributions
ignty+0.587
acted+0.579
lette+0.570
hedral+0.569
 Okinawa+0.545
Neg logit contributions
 synd-0.701
dylib-0.651
ulner-0.644
ater-0.641
aters-0.637
Token but
Token ID475
Feature activation-4.11e-02

Pos logit contributions
ignty+0.123
acted+0.121
lette+0.119
hedral+0.119
 Okinawa+0.114
Neg logit contributions
 synd-0.153
dylib-0.142
ulner-0.140
ater-0.140
aters-0.139
Token by
Token ID416
Feature activation+6.11e-02

Pos logit contributions
 synd+0.084
dylib+0.078
ulner+0.077
ater+0.077
aters+0.076
Neg logit contributions
ignty-0.070
acted-0.069
lette-0.068
hedral-0.068
 Okinawa-0.065
Token now
Token ID783
Feature activation-4.75e-01

Pos logit contributions
ignty+0.098
acted+0.096
lette+0.094
hedral+0.094
 Okinawa+0.090
Neg logit contributions
 synd-0.132
dylib-0.122
ater-0.121
ulner-0.121
aters-0.120
Token,
Token ID11
Feature activation-5.09e-02

Pos logit contributions
 synd+0.981
dylib+0.900
ater+0.894
ulner+0.889
aters+0.888
Neg logit contributions
ignty-0.807
acted-0.792
hedral-0.780
lette-0.775
 Okinawa-0.742
Token you
Token ID345
Feature activation-1.07e-01

Pos logit contributions
 synd+0.107
dylib+0.100
ulner+0.099
ater+0.098
aters+0.098
Neg logit contributions
ignty-0.083
acted-0.082
lette-0.081
hedral-0.081
 Okinawa-0.077
Token either
Token ID2035
Feature activation+1.91e+00

Pos logit contributions
 synd+0.251
dylib+0.235
ater+0.233
ulner+0.232
aters+0.231
Neg logit contributions
ignty-0.152
acted-0.149
hedral-0.146
lette-0.146
 Okinawa-0.138
Token buy
Token ID2822
Feature activation-1.37e-02

Pos logit contributions
acted+3.036
ignty+3.027
lette+3.001
hedral+2.935
 thou+2.839
Neg logit contributions
 synd-3.898
dylib-3.756
ulner-3.703
culosis-3.569
ater-3.559
Token into
Token ID656
Feature activation-4.41e-01

Pos logit contributions
 synd+0.030
dylib+0.028
ulner+0.028
ater+0.028
aters+0.028
Neg logit contributions
ignty-0.021
acted-0.021
lette-0.021
hedral-0.021
 Okinawa-0.020
Token these
Token ID777
Feature activation-1.41e-01

Pos logit contributions
 synd+0.943
dylib+0.875
ulner+0.865
ater+0.865
aters+0.860
Neg logit contributions
ignty-0.712
acted-0.696
lette-0.687
hedral-0.685
 Okinawa-0.653
Token rumors
Token ID14923
Feature activation+2.43e-01

Pos logit contributions
 synd+0.275
dylib+0.254
ulner+0.251
ater+0.250
aters+0.248
Neg logit contributions
ignty-0.253
acted-0.249
lette-0.245
hedral-0.245
 Okinawa-0.234
Token with
Token ID351
Feature activation-1.78e-01

Pos logit contributions
ignty+0.413
acted+0.408
lette+0.401
hedral+0.400
 Okinawa+0.384
Neg logit contributions
 synd-0.493
dylib-0.459
ulner-0.454
ater-0.451
aters-0.448
Tokendu
Token ID646
Feature activation+9.05e-02

Pos logit contributions
ignty+0.231
acted+0.226
hedral+0.219
lette+0.217
 Okinawa+0.208
Neg logit contributions
 synd-0.536
dylib-0.509
ater-0.503
ulner-0.501
aters-0.500
Token makes
Token ID1838
Feature activation-6.82e-02

Pos logit contributions
ignty+0.138
acted+0.135
hedral+0.133
lette+0.132
 Okinawa+0.126
Neg logit contributions
 synd-0.202
dylib-0.188
ulner-0.186
ater-0.186
aters-0.185
Token sure
Token ID1654
Feature activation+1.27e-01

Pos logit contributions
 synd+0.142
dylib+0.131
ater+0.130
ulner+0.130
aters+0.128
Neg logit contributions
ignty-0.115
acted-0.112
lette-0.110
hedral-0.110
 Okinawa-0.105
Token he
Token ID339
Feature activation-1.11e-01

Pos logit contributions
ignty+0.205
acted+0.202
lette+0.199
hedral+0.198
 Okinawa+0.190
Neg logit contributions
 synd-0.270
dylib-0.252
ulner-0.249
ater-0.248
aters-0.247
Token takes
Token ID2753
Feature activation-4.69e-01

Pos logit contributions
 synd+0.236
dylib+0.220
ulner+0.217
ater+0.216
aters+0.215
Neg logit contributions
ignty-0.178
acted-0.175
lette-0.172
hedral-0.172
 Okinawa-0.164
Token care
Token ID1337
Feature activation+1.90e+00

Pos logit contributions
 synd+0.970
dylib+0.893
ater+0.886
ulner+0.885
aters+0.879
Neg logit contributions
ignty-0.794
acted-0.778
hedral-0.763
lette-0.763
 Okinawa-0.726
Token of
Token ID286
Feature activation+1.30e-01

Pos logit contributions
ignty+2.729
acted+2.670
lette+2.663
hedral+2.595
 Okinawa+2.490
Neg logit contributions
 synd-4.330
dylib-4.055
ater-3.986
culosis-3.981
ulner-3.973
Token the
Token ID262
Feature activation-6.17e-02

Pos logit contributions
ignty+0.216
acted+0.212
lette+0.209
hedral+0.209
 Okinawa+0.200
Neg logit contributions
 synd-0.272
dylib-0.253
ulner-0.250
ater-0.249
aters-0.247
Token defensive
Token ID6110
Feature activation-1.21e-01

Pos logit contributions
 synd+0.121
dylib+0.112
ulner+0.111
ater+0.110
aters+0.109
Neg logit contributions
ignty-0.110
acted-0.108
lette-0.107
hedral-0.106
 Okinawa-0.102
Token side
Token ID1735
Feature activation+6.43e-03

Pos logit contributions
 synd+0.220
dylib+0.202
ulner+0.200
ater+0.199
aters+0.197
Neg logit contributions
ignty-0.233
acted-0.230
lette-0.226
hedral-0.226
 Okinawa-0.218
Token of
Token ID286
Feature activation-1.33e-01

Pos logit contributions
ignty+0.011
acted+0.011
lette+0.010
hedral+0.010
 Okinawa+0.010
Neg logit contributions
 synd-0.013
dylib-0.012
ulner-0.012
ater-0.012
aters-0.012
Token.
Token ID13
Feature activation-1.64e-02

Pos logit contributions
 synd+0.531
dylib+0.494
ater+0.490
ulner+0.487
aters+0.486
Neg logit contributions
ignty-0.399
acted-0.392
hedral-0.385
lette-0.382
 Okinawa-0.369
Token<|endoftext|>
Token ID50256
Feature activation-4.95e-01

Pos logit contributions
 synd+0.036
dylib+0.034
ater+0.033
ulner+0.033
aters+0.033
Neg logit contributions
ignty-0.025
acted-0.025
hedral-0.024
lette-0.024
 wedge-0.023
TokenAbstract
Token ID23839
Feature activation+1.19e+00

Pos logit contributions
 synd+0.937
dylib+0.875
ater+0.860
ulner+0.858
aters+0.851
Neg logit contributions
ignty-0.914
acted-0.893
hedral-0.882
lette-0.880
 Okinawa-0.853
Token
Token ID198
Feature activation+7.54e-01

Pos logit contributions
ignty+1.807
acted+1.778
lette+1.748
hedral+1.729
 Okinawa+1.691
Neg logit contributions
 synd-2.617
dylib-2.467
ulner-2.432
ater-2.411
aters-2.408
Token
Token ID198
Feature activation+6.72e-01

Pos logit contributions
acted+1.043
ignty+1.033
lette+1.018
hedral+1.002
front+0.969
Neg logit contributions
 synd-1.756
ulner-1.654
culosis-1.644
dylib-1.634
ater-1.622
TokenBackground
Token ID21756
Feature activation+1.87e+00

Pos logit contributions
ignty+1.268
acted+1.263
lette+1.243
hedral+1.232
front+1.191
Neg logit contributions
 synd-1.241
dylib-1.136
ulner-1.125
ater-1.114
aters-1.112
Token Depression
Token ID22483
Feature activation+1.09e+00

Pos logit contributions
ignty+2.909
acted+2.899
lette+2.891
 Okinawa+2.837
hedral+2.817
Neg logit contributions
 synd-3.871
dylib-3.674
ulner-3.651
aters-3.586
culosis-3.578
Token presents
Token ID10969
Feature activation+6.21e-01

Pos logit contributions
ignty+1.637
acted+1.620
lette+1.600
hedral+1.586
 wedge+1.526
Neg logit contributions
 synd-2.325
dylib-2.241
ulner-2.228
ater-2.201
aters-2.196
Token a
Token ID257
Feature activation+1.33e-01

Pos logit contributions
ignty+0.969
acted+0.956
hedral+0.944
lette+0.939
 Okinawa+0.901
Neg logit contributions
 synd-1.322
dylib-1.262
ulner-1.238
ater-1.236
culosis-1.232
Token significant
Token ID2383
Feature activation+4.42e-01

Pos logit contributions
ignty+0.227
acted+0.222
lette+0.218
hedral+0.218
 Okinawa+0.209
Neg logit contributions
 synd-0.274
dylib-0.253
ulner-0.250
ater-0.249
aters-0.248
Token burden
Token ID10538
Feature activation+2.43e-01

Pos logit contributions
ignty+0.948
acted+0.946
hedral+0.933
lette+0.932
 wedge+0.908
Neg logit contributions
 synd-0.683
dylib-0.635
ulner-0.622
ater-0.616
aters-0.613
Token a
Token ID257
Feature activation+1.10e-01

Pos logit contributions
 synd+1.353
dylib+1.246
ater+1.235
ulner+1.227
aters+1.221
Neg logit contributions
ignty-0.932
acted-0.913
lette-0.896
hedral-0.890
 Okinawa-0.844
Token bit
Token ID1643
Feature activation+6.45e-01

Pos logit contributions
ignty+0.191
acted+0.188
lette+0.185
hedral+0.185
 Okinawa+0.177
Neg logit contributions
 synd-0.220
dylib-0.203
ater-0.201
ulner-0.200
aters-0.199
Token easier
Token ID4577
Feature activation-7.62e-03

Pos logit contributions
ignty+1.153
acted+1.139
lette+1.122
hedral+1.121
 Okinawa+1.078
Neg logit contributions
 synd-1.251
dylib-1.161
ulner-1.149
ater-1.136
aters-1.132
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
 synd+0.016
dylib+0.015
ulner+0.014
ater+0.014
aters+0.014
Neg logit contributions
ignty-0.013
acted-0.013
lette-0.012
hedral-0.012
 Okinawa-0.012
Token Take
Token ID7214
Feature activation-6.52e-01

Pos logit contributions
ignty+0.312
acted+0.307
lette+0.301
hedral+0.301
 Okinawa+0.287
Neg logit contributions
 synd-0.441
dylib-0.412
ulner-0.408
ater-0.406
aters-0.404
Token care
Token ID1337
Feature activation+1.86e+00

Pos logit contributions
 synd+1.422
dylib+1.319
ater+1.314
ulner+1.309
aters+1.302
Neg logit contributions
ignty-1.020
acted-1.000
hedral-0.985
lette-0.979
 Okinawa-0.933
Token with
Token ID351
Feature activation+8.70e-02

Pos logit contributions
ignty+3.068
lette+3.009
acted+3.002
hedral+2.927
 thou+2.865
Neg logit contributions
 synd-3.802
dylib-3.562
ulner-3.508
culosis-3.497
ater-3.463
Token focusing
Token ID10759
Feature activation-4.26e-01

Pos logit contributions
ignty+0.143
acted+0.141
lette+0.139
hedral+0.138
 Okinawa+0.132
Neg logit contributions
 synd-0.182
dylib-0.170
ulner-0.168
ater-0.167
aters-0.166
Token,
Token ID11
Feature activation-4.13e-02

Pos logit contributions
 synd+0.850
dylib+0.780
ulner+0.772
ater+0.769
aters+0.764
Neg logit contributions
ignty-0.746
acted-0.742
hedral-0.723
lette-0.723
 Okinawa-0.692
Token though
Token ID996
Feature activation+4.06e-01

Pos logit contributions
 synd+0.080
dylib+0.073
ulner+0.072
ater+0.072
aters+0.072
Neg logit contributions
ignty-0.075
acted-0.075
lette-0.073
hedral-0.073
 Okinawa-0.070
Token:
Token ID25
Feature activation+1.76e-03

Pos logit contributions
ignty+0.656
acted+0.645
lette+0.634
hedral+0.633
 Okinawa+0.605
Neg logit contributions
 synd-0.865
dylib-0.806
ulner-0.797
ater-0.794
aters-0.789
Tokenwww
Token ID2503
Feature activation+1.00e+00

Pos logit contributions
ignty+0.278
acted+0.274
lette+0.269
hedral+0.268
 Okinawa+0.256
Neg logit contributions
 synd-0.356
dylib-0.332
ulner-0.328
ater-0.327
aters-0.325
Token.
Token ID13
Feature activation+5.86e-01

Pos logit contributions
acted+1.357
ignty+1.347
lette+1.326
hedral+1.305
front+1.252
Neg logit contributions
 synd-2.379
ulner-2.224
dylib-2.212
culosis-2.211
ater-2.190
Tokencd
Token ID10210
Feature activation+1.93e-01

Pos logit contributions
ignty+0.650
acted+0.638
lette+0.618
hedral+0.617
front+0.572
Neg logit contributions
 synd-1.545
dylib-1.461
ulner-1.443
ater-1.439
aters-1.433
Tokenc
Token ID66
Feature activation+1.05e+00

Pos logit contributions
ignty+0.263
acted+0.261
hedral+0.254
lette+0.254
front+0.238
Neg logit contributions
 synd-0.461
dylib-0.432
ulner-0.429
ater-0.425
aters-0.423
Token.
Token ID13
Feature activation+5.27e-01

Pos logit contributions
acted+1.415
ignty+1.413
lette+1.388
hedral+1.375
front+1.307
Neg logit contributions
 synd-2.481
ulner-2.319
dylib-2.317
culosis-2.302
ater-2.279
Tokengov
Token ID9567
Feature activation+1.86e+00

Pos logit contributions
ignty+0.593
acted+0.586
lette+0.569
hedral+0.564
front+0.527
Neg logit contributions
 synd-1.382
dylib-1.301
ulner-1.289
ater-1.279
aters-1.273
Token/
Token ID14
Feature activation+5.40e-01

Pos logit contributions
acted+2.959
lette+2.868
ignty+2.834
hedral+2.802
front+2.787
Neg logit contributions
 synd-3.889
culosis-3.650
ulner-3.644
dylib-3.558
 corrid-3.526
Tokenhome
Token ID11195
Feature activation+1.03e+00

Pos logit contributions
ignty+0.817
acted+0.814
lette+0.788
hedral+0.782
front+0.757
Neg logit contributions
 synd-1.202
dylib-1.123
ater-1.099
ulner-1.098
aters-1.094
Tokenandre
Token ID49078
Feature activation+1.09e-01

Pos logit contributions
ignty+1.812
acted+1.801
hedral+1.760
lette+1.757
front+1.705
Neg logit contributions
 synd-2.043
dylib-1.894
ulner-1.857
ater-1.852
aters-1.841
Tokencre
Token ID7513
Feature activation+1.31e-01

Pos logit contributions
ignty+0.187
acted+0.184
lette+0.181
hedral+0.180
 Okinawa+0.173
Neg logit contributions
 synd-0.221
dylib-0.205
ulner-0.202
ater-0.201
aters-0.200
Tokenational
Token ID864
Feature activation+6.00e-01

Pos logit contributions
ignty+0.222
acted+0.219
lette+0.216
hedral+0.215
 Okinawa+0.204
Neg logit contributions
 synd-0.269
dylib-0.250
ulner-0.246
ater-0.244
aters-0.243
Tokenial
Token ID498
Feature activation+2.71e-01

Pos logit contributions
 synd+0.425
dylib+0.393
ater+0.387
ulner+0.387
aters+0.383
Neg logit contributions
ignty-0.437
acted-0.430
lette-0.424
hedral-0.423
 Okinawa-0.409
TokenName
Token ID5376
Feature activation-9.34e-02

Pos logit contributions
ignty+0.452
acted+0.446
lette+0.438
hedral+0.437
 Okinawa+0.417
Neg logit contributions
 synd-0.562
dylib-0.519
ulner-0.516
ater-0.512
aters-0.510
Token('
Token ID10786
Feature activation+2.94e-01

Pos logit contributions
 synd+0.218
dylib+0.205
ater+0.203
ulner+0.203
aters+0.201
Neg logit contributions
ignty-0.132
acted-0.129
hedral-0.126
lette-0.126
 Okinawa-0.120
Tokensystem
Token ID10057
Feature activation+3.13e-01

Pos logit contributions
ignty+0.508
acted+0.501
lette+0.492
hedral+0.492
 Okinawa+0.470
Neg logit contributions
 synd-0.594
dylib-0.549
ulner-0.544
ater-0.541
aters-0.538
Token.
Token ID13
Feature activation+7.17e-02

Pos logit contributions
ignty+0.456
acted+0.449
lette+0.440
hedral+0.439
 Okinawa+0.416
Neg logit contributions
 synd-0.717
dylib-0.670
ulner-0.664
ater-0.661
aters-0.658
Tokenwindows
Token ID28457
Feature activation+1.84e+00

Pos logit contributions
ignty+0.115
acted+0.113
lette+0.111
hedral+0.111
 Okinawa+0.106
Neg logit contributions
 synd-0.153
dylib-0.144
ulner-0.141
ater-0.141
aters-0.140
Token.
Token ID13
Feature activation+2.81e-01

Pos logit contributions
ignty+2.550
acted+2.504
hedral+2.455
lette+2.448
front+2.357
Neg logit contributions
 synd-4.310
dylib-4.000
culosis-3.999
ulner-3.969
aters-3.944
Tokenforms
Token ID23914
Feature activation+5.51e-01

Pos logit contributions
ignty+0.415
acted+0.407
lette+0.399
hedral+0.399
 Okinawa+0.379
Neg logit contributions
 synd-0.638
dylib-0.598
ulner-0.591
ater-0.589
aters-0.586
Token')
Token ID11537
Feature activation+8.13e-01

Pos logit contributions
ignty+0.782
acted+0.767
lette+0.753
hedral+0.749
 Okinawa+0.709
Neg logit contributions
 synd-1.284
dylib-1.202
ulner-1.191
ater-1.183
aters-1.178
Token |
Token ID930
Feature activation-5.37e-02

Pos logit contributions
ignty+1.072
acted+1.060
lette+1.035
hedral+1.026
 Okinawa+0.976
Neg logit contributions
 synd-1.953
dylib-1.840
ulner-1.823
ater-1.815
aters-1.810
Token out
Token ID503
Feature activation+8.69e-02

Pos logit contributions
 synd+0.078
dylib+0.070
ulner+0.068
ater+0.068
aters+0.067
Neg logit contributions
ignty-0.124
acted-0.122
lette-0.121
hedral-0.121
 Okinawa-0.117
Token buy
Token ID2822
Feature activation-7.12e-02

Pos logit contributions
ignty+0.458
acted+0.451
lette+0.443
hedral+0.443
 Okinawa+0.424
Neg logit contributions
 synd-0.564
dylib-0.524
ulner-0.517
ater-0.516
aters-0.513
Token you
Token ID345
Feature activation+1.18e-01

Pos logit contributions
 synd+0.161
dylib+0.150
ulner+0.148
ater+0.148
aters+0.147
Neg logit contributions
ignty-0.106
acted-0.104
lette-0.102
hedral-0.102
 Okinawa-0.097
Token?
Token ID30
Feature activation-3.66e-01

Pos logit contributions
ignty+0.168
acted+0.165
lette+0.161
hedral+0.161
 Okinawa+0.153
Neg logit contributions
 synd-0.275
dylib-0.258
ulner-0.255
ater-0.254
aters-0.253
Token While
Token ID2893
Feature activation+1.17e-01

Pos logit contributions
 synd+0.756
dylib+0.703
ater+0.694
ulner+0.691
aters+0.689
Neg logit contributions
ignty-0.621
acted-0.605
hedral-0.601
lette-0.596
 wedge-0.567
Token AT
Token ID5161
Feature activation+2.40e-03

Pos logit contributions
ignty+0.218
acted+0.215
lette+0.212
hedral+0.212
 Okinawa+0.203
Neg logit contributions
 synd-0.219
dylib-0.202
ulner-0.199
ater-0.199
aters-0.197
Token&
Token ID5
Feature activation-2.26e+00

Pos logit contributions
acted+0.003
ignty+0.003
lette+0.003
hedral+0.003
front+0.003
Neg logit contributions
 synd-0.006
dylib-0.006
ulner-0.006
culosis-0.005
ater-0.005
TokenT
Token ID51
Feature activation-1.40e+00

Pos logit contributions
 synd+4.167
dylib+3.892
ater+3.872
aters+3.806
itis+3.742
Neg logit contributions
ignty-4.306
hedral-4.116
acted-4.077
lette-4.038
 wedge-3.901
Token's
Token ID338
Feature activation+5.03e-01

Pos logit contributions
 synd+2.928
ater+2.668
aters+2.658
dylib+2.641
ulner+2.619
Neg logit contributions
ignty-2.316
acted-2.245
hedral-2.242
lette-2.208
 Okinawa-2.104
Token powerhouse
Token ID34886
Feature activation+1.74e-01

Pos logit contributions
ignty+0.919
acted+0.910
lette+0.894
hedral+0.890
 Okinawa+0.863
Neg logit contributions
 synd-0.953
dylib-0.890
ulner-0.875
ater-0.869
aters-0.862
Token FISA
Token ID44073
Feature activation-4.52e-02

Pos logit contributions
ignty+0.292
acted+0.289
lette+0.284
hedral+0.283
 Okinawa+0.273
Neg logit contributions
 synd-0.356
dylib-0.334
ulner-0.329
ater-0.327
aters-0.325
Token lobbyist
Token ID35196
Feature activation-3.03e-02

Pos logit contributions
 synd+0.090
dylib+0.083
ulner+0.082
ater+0.082
aters+0.081
Neg logit contributions
ignty-0.079
acted-0.078
lette-0.077
hedral-0.077
 Okinawa-0.074
Token Press
Token ID4332
Feature activation-1.11e-02

Pos logit contributions
 synd+0.782
dylib+0.738
ulner+0.733
ater+0.728
culosis+0.724
Neg logit contributions
ignty-0.394
acted-0.391
lette-0.381
hedral-0.380
 Okinawa-0.362
Token is
Token ID318
Feature activation+1.59e-01

Pos logit contributions
 synd+0.027
dylib+0.025
ulner+0.025
ater+0.025
aters+0.025
Neg logit contributions
ignty-0.015
acted-0.014
lette-0.014
hedral-0.014
 Okinawa-0.013
Token reporting
Token ID6447
Feature activation+1.04e+00

Pos logit contributions
ignty+0.251
acted+0.249
lette+0.247
hedral+0.245
 Okinawa+0.240
Neg logit contributions
 synd-0.332
dylib-0.319
ulner-0.311
culosis-0.311
ater-0.309
Token that
Token ID326
Feature activation+2.24e-01

Pos logit contributions
ignty+1.827
acted+1.815
lette+1.784
hedral+1.773
 Okinawa+1.735
Neg logit contributions
 synd-2.019
dylib-1.886
ulner-1.871
ater-1.860
aters-1.837
Token AT
Token ID5161
Feature activation+2.22e-01

Pos logit contributions
ignty+0.370
acted+0.365
lette+0.359
hedral+0.358
 Okinawa+0.343
Neg logit contributions
 synd-0.466
dylib-0.435
ulner-0.430
ater-0.428
aters-0.425
Token&
Token ID5
Feature activation-2.20e+00

Pos logit contributions
acted+0.284
ignty+0.283
lette+0.277
hedral+0.275
 Okinawa+0.262
Neg logit contributions
 synd-0.539
dylib-0.512
ulner-0.503
ater-0.499
culosis-0.497
TokenT
Token ID51
Feature activation-1.13e+00

Pos logit contributions
 synd+3.993
dylib+3.732
ater+3.710
aters+3.648
 iT+3.626
Neg logit contributions
ignty-4.270
hedral-4.101
acted-4.004
lette-3.947
 wedge-3.854
Token,
Token ID11
Feature activation+1.34e-02

Pos logit contributions
 synd+2.469
dylib+2.256
ater+2.256
ulner+2.249
aters+2.246
Neg logit contributions
ignty-1.793
acted-1.725
hedral-1.724
lette-1.694
 Okinawa-1.609
Token the
Token ID262
Feature activation-4.46e-02

Pos logit contributions
ignty+0.022
acted+0.022
lette+0.021
hedral+0.021
 Okinawa+0.020
Neg logit contributions
 synd-0.028
dylib-0.026
ulner-0.026
ater-0.026
aters-0.026
Token defendant
Token ID11304
Feature activation-1.09e-01

Pos logit contributions
 synd+0.084
dylib+0.078
ulner+0.077
ater+0.076
aters+0.076
Neg logit contributions
ignty-0.082
acted-0.081
lette-0.080
hedral-0.080
 Okinawa-0.077
Token in
Token ID287
Feature activation+1.38e-02

Pos logit contributions
 synd+0.205
dylib+0.189
ulner+0.187
ater+0.186
aters+0.185
Neg logit contributions
ignty-0.202
acted-0.200
lette-0.197
hedral-0.196
 Okinawa-0.189
Token.
Token ID13
Feature activation+3.68e-02

Pos logit contributions
 synd+0.228
dylib+0.210
ulner+0.207
ater+0.207
aters+0.206
Neg logit contributions
ignty-0.196
acted-0.192
hedral-0.189
lette-0.189
 Okinawa-0.181
Token For
Token ID1114
Feature activation+1.35e-01

Pos logit contributions
ignty+0.058
acted+0.057
lette+0.056
hedral+0.056
 Okinawa+0.053
Neg logit contributions
 synd-0.080
dylib-0.075
ulner-0.074
ater-0.073
aters-0.073
Token 2007
Token ID4343
Feature activation-8.41e-02

Pos logit contributions
ignty+0.222
acted+0.218
hedral+0.214
lette+0.214
 Okinawa+0.204
Neg logit contributions
 synd-0.283
dylib-0.263
ulner-0.260
ater-0.259
aters-0.257
Token,
Token ID11
Feature activation+3.69e-03

Pos logit contributions
 synd+0.167
dylib+0.155
ulner+0.153
ater+0.153
aters+0.151
Neg logit contributions
ignty-0.148
acted-0.146
hedral-0.143
lette-0.143
 Okinawa-0.137
Token AT
Token ID5161
Feature activation+1.14e-01

Pos logit contributions
ignty+0.007
acted+0.007
lette+0.007
hedral+0.007
 Okinawa+0.007
Neg logit contributions
 synd-0.006
dylib-0.006
ulner-0.006
ater-0.006
aters-0.006
Token&
Token ID5
Feature activation-2.17e+00

Pos logit contributions
acted+0.144
ignty+0.141
lette+0.139
hedral+0.136
front+0.131
Neg logit contributions
 synd-0.283
dylib-0.265
ulner-0.264
culosis-0.264
ater-0.261
TokenT
Token ID51
Feature activation-1.38e+00

Pos logit contributions
 synd+4.027
dylib+3.766
ater+3.739
aters+3.678
itis+3.611
Neg logit contributions
ignty-4.095
hedral-3.915
acted-3.882
lette-3.846
 Okinawa-3.713
Token spent
Token ID3377
Feature activation-5.71e-01

Pos logit contributions
 synd+2.689
ater+2.426
aters+2.414
dylib+2.392
ulner+2.383
Neg logit contributions
ignty-2.532
hedral-2.459
acted-2.448
lette-2.420
 Okinawa-2.307
Token over
Token ID625
Feature activation-4.63e-02

Pos logit contributions
 synd+1.142
dylib+1.044
ater+1.036
ulner+1.034
aters+1.027
Neg logit contributions
ignty-1.008
acted-0.988
hedral-0.978
lette-0.973
 Okinawa-0.926
Token $
Token ID720
Feature activation-1.27e-02

Pos logit contributions
 synd+0.091
dylib+0.084
ater+0.083
ulner+0.083
aters+0.082
Neg logit contributions
ignty-0.082
acted-0.081
hedral-0.080
lette-0.080
 Okinawa-0.076
Token16
Token ID1433
Feature activation+4.49e-01

Pos logit contributions
 synd+0.028
dylib+0.026
ulner+0.026
ater+0.026
aters+0.026
Neg logit contributions
ignty-0.020
acted-0.019
hedral-0.019
lette-0.019
 Okinawa-0.018
Token chief
Token ID4039
Feature activation-3.88e-02

Pos logit contributions
ignty+1.642
acted+1.639
lette+1.605
hedral+1.604
 Okinawa+1.564
Neg logit contributions
 synd-1.779
dylib-1.676
ulner-1.650
culosis-1.629
ater-1.625
Token campaign
Token ID1923
Feature activation-3.63e-01

Pos logit contributions
 synd+0.080
dylib+0.074
ulner+0.073
ater+0.073
aters+0.073
Neg logit contributions
ignty-0.065
acted-0.064
lette-0.063
hedral-0.063
 Okinawa-0.060
Token strategist
Token ID25651
Feature activation-2.55e-01

Pos logit contributions
 synd+0.614
dylib+0.555
ater+0.547
ulner+0.546
aters+0.540
Neg logit contributions
ignty-0.755
acted-0.742
hedral-0.733
lette-0.731
 Okinawa-0.704
Token,
Token ID11
Feature activation+5.41e-02

Pos logit contributions
 synd+0.506
dylib+0.467
ater+0.460
ulner+0.460
aters+0.457
Neg logit contributions
ignty-0.452
acted-0.445
hedral-0.437
lette-0.436
 Okinawa-0.419
Token AT
Token ID5161
Feature activation-6.26e-02

Pos logit contributions
ignty+0.093
acted+0.091
lette+0.090
hedral+0.089
 Okinawa+0.086
Neg logit contributions
 synd-0.110
dylib-0.102
ulner-0.101
ater-0.100
aters-0.100
Token&
Token ID5
Feature activation-2.12e+00

Pos logit contributions
 synd+0.152
dylib+0.143
ulner+0.142
ater+0.141
culosis+0.141
Neg logit contributions
acted-0.081
ignty-0.081
lette-0.078
hedral-0.077
 Okinawa-0.074
TokenT
Token ID51
Feature activation-1.15e+00

Pos logit contributions
 synd+3.929
dylib+3.688
ater+3.642
aters+3.577
 iT+3.517
Neg logit contributions
ignty-4.034
hedral-3.852
acted-3.824
lette-3.782
 wedge-3.645
Token now
Token ID783
Feature activation-7.23e-01

Pos logit contributions
 synd+2.436
dylib+2.221
ater+2.221
aters+2.207
ulner+2.199
Neg logit contributions
ignty-1.886
acted-1.831
hedral-1.828
lette-1.808
 Okinawa-1.721
Token has
Token ID468
Feature activation-7.01e-02

Pos logit contributions
 synd+1.550
dylib+1.421
ater+1.412
ulner+1.405
aters+1.401
Neg logit contributions
ignty-1.172
acted-1.155
hedral-1.140
lette-1.127
 Okinawa-1.077
Token former
Token ID1966
Feature activation-1.29e-01

Pos logit contributions
 synd+0.153
dylib+0.143
ulner+0.141
ater+0.141
aters+0.140
Neg logit contributions
ignty-0.109
acted-0.108
lette-0.106
hedral-0.105
 Okinawa-0.101
Token United
Token ID1578
Feature activation-2.47e-01

Pos logit contributions
 synd+0.250
dylib+0.232
ulner+0.229
ater+0.228
aters+0.226
Neg logit contributions
ignty-0.233
acted-0.230
lette-0.227
hedral-0.226
 Okinawa-0.218
Token.
Token ID13
Feature activation+1.32e-02

Pos logit contributions
ignty+0.408
acted+0.400
lette+0.392
hedral+0.392
 Okinawa+0.377
Neg logit contributions
 synd-0.528
dylib-0.490
ater-0.485
ulner-0.484
aters-0.480
Token Because
Token ID4362
Feature activation+2.40e-01

Pos logit contributions
ignty+0.021
acted+0.021
hedral+0.020
lette+0.020
 wedge+0.019
Neg logit contributions
 synd-0.028
dylib-0.027
ater-0.026
ulner-0.026
aters-0.026
Token of
Token ID286
Feature activation-4.92e-01

Pos logit contributions
ignty+0.383
acted+0.377
lette+0.370
hedral+0.369
 Okinawa+0.353
Neg logit contributions
 synd-0.515
dylib-0.480
ulner-0.475
ater-0.473
aters-0.470
Token the
Token ID262
Feature activation-3.76e-02

Pos logit contributions
 synd+1.014
dylib+0.935
ulner+0.922
ater+0.921
aters+0.912
Neg logit contributions
ignty-0.842
acted-0.816
lette-0.810
hedral-0.805
 Okinawa-0.769
Token T
Token ID309
Feature activation+1.93e-01

Pos logit contributions
 synd+0.077
dylib+0.071
ulner+0.070
ater+0.070
aters+0.069
Neg logit contributions
ignty-0.065
acted-0.063
lette-0.062
hedral-0.062
 Okinawa-0.060
Token&
Token ID5
Feature activation-2.12e+00

Pos logit contributions
ignty+0.343
acted+0.338
lette+0.332
hedral+0.332
 Okinawa+0.318
Neg logit contributions
 synd-0.380
dylib-0.352
ulner-0.347
ater-0.346
aters-0.343
TokenA
Token ID32
Feature activation-1.06e+00

Pos logit contributions
 synd+3.315
dylib+3.071
ater+3.061
aters+3.042
ulner+2.960
Neg logit contributions
ignty-4.608
acted-4.406
hedral-4.345
lette-4.305
 Okinawa-4.175
Token,
Token ID11
Feature activation-8.51e-02

Pos logit contributions
 synd+2.147
dylib+1.964
ater+1.955
aters+1.938
ulner+1.938
Neg logit contributions
ignty-1.830
acted-1.803
lette-1.757
hedral-1.754
 Okinawa-1.677
Token this
Token ID428
Feature activation-1.75e-02

Pos logit contributions
 synd+0.181
dylib+0.168
ulner+0.166
ater+0.166
aters+0.165
Neg logit contributions
ignty-0.139
acted-0.136
lette-0.134
hedral-0.134
 Okinawa-0.127
Token title
Token ID3670
Feature activation+3.49e-02

Pos logit contributions
 synd+0.038
dylib+0.035
ater+0.035
ulner+0.035
aters+0.035
Neg logit contributions
ignty-0.028
acted-0.027
hedral-0.027
lette-0.026
 Okinawa-0.025
Token is
Token ID318
Feature activation+3.15e-01

Pos logit contributions
ignty+0.051
acted+0.050
lette+0.049
hedral+0.049
 Okinawa+0.047
Neg logit contributions
 synd-0.080
dylib-0.074
ulner-0.074
ater-0.073
aters-0.073
Token Russia
Token ID3284
Feature activation+9.89e-02

Pos logit contributions
 synd+0.386
dylib+0.359
ulner+0.354
ater+0.353
aters+0.351
Neg logit contributions
ignty-0.325
acted-0.320
lette-0.315
hedral-0.314
 Okinawa-0.301
Token.
Token ID13
Feature activation-9.07e-02

Pos logit contributions
ignty+0.167
acted+0.164
hedral+0.162
lette+0.162
 Okinawa+0.154
Neg logit contributions
 synd-0.203
dylib-0.189
ulner-0.187
ater-0.186
aters-0.185
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
 synd+0.196
dylib+0.184
ater+0.181
ulner+0.181
aters+0.180
Neg logit contributions
ignty-0.143
acted-0.140
hedral-0.139
lette-0.138
 wedge-0.132
Token
Token ID198
Feature activation-4.50e-02

Pos logit contributions
 synd+0.305
dylib+0.285
ater+0.281
ulner+0.281
aters+0.279
Neg logit contributions
ignty-0.216
acted-0.212
hedral-0.208
lette-0.208
 Okinawa-0.198
TokenImage
Token ID5159
Feature activation-6.23e-01

Pos logit contributions
 synd+0.087
dylib+0.081
ulner+0.079
ater+0.079
aters+0.079
Neg logit contributions
ignty-0.081
acted-0.080
hedral-0.079
lette-0.079
 Okinawa-0.076
Token copyright
Token ID6634
Feature activation-2.11e+00

Pos logit contributions
 synd+0.796
dylib+0.688
ulner+0.668
ater+0.667
aters+0.665
Neg logit contributions
ignty-1.554
acted-1.537
lette-1.514
hedral-1.506
 Okinawa-1.467
Token AFP
Token ID8916
Feature activation-1.57e+00

Pos logit contributions
 synd+4.582
ater+4.156
aters+4.147
dylib+4.111
ulner+4.094
Neg logit contributions
ignty-3.353
acted-3.258
hedral-3.229
lette-3.221
 wedge-2.997
Token Image
Token ID7412
Feature activation-4.12e-01

Pos logit contributions
 synd+3.242
dylib+2.972
ulner+2.968
ater+2.961
aters+2.941
Neg logit contributions
ignty-2.595
acted-2.550
hedral-2.518
lette-2.509
 wedge-2.386
Token caption
Token ID8305
Feature activation-1.12e+00

Pos logit contributions
 synd+0.639
dylib+0.580
ulner+0.574
ater+0.569
aters+0.564
Neg logit contributions
ignty-0.902
acted-0.892
lette-0.882
hedral-0.880
 Okinawa-0.850
Token The
Token ID383
Feature activation+5.78e-01

Pos logit contributions
 synd+2.324
dylib+2.148
ater+2.133
aters+2.132
ulner+2.125
Neg logit contributions
ignty-1.855
hedral-1.811
acted-1.811
lette-1.796
 wedge-1.711
Token Kurds
Token ID23880
Feature activation+6.96e-01

Pos logit contributions
ignty+0.966
acted+0.950
lette+0.928
hedral+0.926
 wedge+0.905
Neg logit contributions
 synd-1.174
dylib-1.114
ulner-1.102
culosis-1.100
ater-1.082
Token two
Token ID734
Feature activation+1.84e-01

Pos logit contributions
 synd+0.148
dylib+0.137
ater+0.135
ulner+0.135
aters+0.134
Neg logit contributions
ignty-0.103
acted-0.100
hedral-0.099
lette-0.098
 Okinawa-0.093
Token hits
Token ID7127
Feature activation-4.16e-01

Pos logit contributions
ignty+0.305
acted+0.298
hedral+0.293
lette+0.292
 Okinawa+0.279
Neg logit contributions
 synd-0.388
dylib-0.358
ater-0.355
ulner-0.354
aters-0.352
Token and
Token ID290
Feature activation-1.26e-01

Pos logit contributions
 synd+0.818
dylib+0.749
ater+0.742
ulner+0.739
aters+0.733
Neg logit contributions
ignty-0.745
acted-0.733
hedral-0.723
lette-0.718
 Okinawa-0.690
Token 12
Token ID1105
Feature activation-4.06e-01

Pos logit contributions
 synd+0.287
dylib+0.265
ater+0.264
ulner+0.263
aters+0.261
Neg logit contributions
ignty-0.187
acted-0.181
hedral-0.179
lette-0.177
 Okinawa-0.167
Token (
Token ID357
Feature activation+3.08e-01

Pos logit contributions
 synd+0.800
dylib+0.734
ater+0.730
ulner+0.727
aters+0.723
Neg logit contributions
ignty-0.729
acted-0.711
hedral-0.701
lette-0.701
 Okinawa-0.668
Tokenyes
Token ID8505
Feature activation-2.11e+00

Pos logit contributions
ignty+0.458
acted+0.450
lette+0.441
hedral+0.441
 Okinawa+0.419
Neg logit contributions
 synd-0.697
dylib-0.652
ulner-0.645
ater-0.643
aters-0.639
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
 synd+3.796
ater+3.395
aters+3.376
dylib+3.308
ulner+3.286
Neg logit contributions
ignty-4.029
acted-3.948
hedral-3.937
lette-3.911
 wedge-3.783
Token TW
Token ID17306
Feature activation+3.42e-01

Pos logit contributions
 synd+0.253
dylib+0.235
ulner+0.233
ater+0.233
aters+0.231
Neg logit contributions
ignty-0.188
acted-0.184
lette-0.181
hedral-0.181
 Okinawa-0.172
TokenEL
Token ID3698
Feature activation-1.72e-01

Pos logit contributions
ignty+0.595
acted+0.589
lette+0.580
hedral+0.575
 Okinawa+0.555
Neg logit contributions
 synd-0.682
dylib-0.634
ulner-0.625
ater-0.621
aters-0.615
TokenVE
Token ID6089
Feature activation+9.30e-02

Pos logit contributions
 synd+0.346
dylib+0.321
ulner+0.317
ater+0.316
aters+0.314
Neg logit contributions
ignty-0.297
acted-0.293
lette-0.288
hedral-0.288
 Okinawa-0.276
Token)
Token ID8
Feature activation-2.77e-01

Pos logit contributions
ignty+0.144
acted+0.142
lette+0.139
hedral+0.139
 Okinawa+0.133
Neg logit contributions
 synd-0.204
dylib-0.190
ulner-0.188
ater-0.188
aters-0.186
Token a
Token ID257
Feature activation-1.42e-01

Pos logit contributions
 synd+0.745
dylib+0.693
ater+0.683
ulner+0.683
aters+0.680
Neg logit contributions
ignty-0.514
acted-0.506
lette-0.496
hedral-0.495
 wedge-0.469
Token licensing
Token ID15665
Feature activation-9.24e-01

Pos logit contributions
 synd+0.281
dylib+0.261
ulner+0.259
ater+0.257
aters+0.256
Neg logit contributions
ignty-0.249
acted-0.246
lette-0.243
hedral-0.242
 Okinawa-0.234
Token division
Token ID7297
Feature activation-3.60e-01

Pos logit contributions
 synd+1.676
dylib+1.503
ater+1.486
ulner+1.473
aters+1.466
Neg logit contributions
ignty-1.824
acted-1.791
hedral-1.770
lette-1.754
 Okinawa-1.687
Token of
Token ID286
Feature activation-2.43e-03

Pos logit contributions
 synd+0.759
dylib+0.704
ulner+0.696
ater+0.695
aters+0.689
Neg logit contributions
ignty-0.591
acted-0.581
hedral-0.570
lette-0.570
 Okinawa-0.545
Token AT
Token ID5161
Feature activation-8.02e-02

Pos logit contributions
 synd+0.005
dylib+0.005
ulner+0.005
ater+0.005
aters+0.005
Neg logit contributions
ignty-0.004
acted-0.004
lette-0.004
hedral-0.004
 Okinawa-0.004
Token&
Token ID5
Feature activation-2.09e+00

Pos logit contributions
 synd+0.190
dylib+0.180
ulner+0.178
ater+0.177
aters+0.176
Neg logit contributions
ignty-0.107
acted-0.107
lette-0.105
hedral-0.104
 Okinawa-0.100
TokenT
Token ID51
Feature activation-1.04e+00

Pos logit contributions
 synd+3.736
dylib+3.555
ater+3.453
 iT+3.417
aters+3.398
Neg logit contributions
ignty-4.090
hedral-3.912
acted-3.849
lette-3.791
alsh-3.700
Token/
Token ID14
Feature activation-1.44e+00

Pos logit contributions
 synd+2.130
dylib+1.979
ulner+1.955
ater+1.949
aters+1.936
Neg logit contributions
ignty-1.753
acted-1.726
lette-1.697
hedral-1.694
 Okinawa-1.622
TokenWestern
Token ID24227
Feature activation+3.67e-02

Pos logit contributions
 synd+2.701
dylib+2.478
ater+2.454
aters+2.440
ulner+2.430
Neg logit contributions
ignty-2.691
acted-2.623
hedral-2.616
lette-2.576
 wedge-2.501
Token Electric
Token ID13944
Feature activation-5.14e-01

Pos logit contributions
ignty+0.074
acted+0.073
lette+0.072
hedral+0.072
 Okinawa+0.070
Neg logit contributions
 synd-0.063
dylib-0.059
ulner-0.057
ater-0.057
aters-0.057
Token,
Token ID11
Feature activation-7.23e-02

Pos logit contributions
 synd+1.067
dylib+1.000
ulner+0.987
ater+0.982
aters+0.977
Neg logit contributions
ignty-0.850
acted-0.839
lette-0.828
hedral-0.820
 Okinawa-0.791
Token this
Token ID428
Feature activation+2.88e-01

Pos logit contributions
 synd+0.440
dylib+0.408
ulner+0.403
ater+0.402
aters+0.399
Neg logit contributions
ignty-0.377
acted-0.370
hedral-0.364
lette-0.364
 Okinawa-0.348
Token into
Token ID656
Feature activation+2.15e-02

Pos logit contributions
ignty+0.503
acted+0.497
lette+0.490
hedral+0.487
 Okinawa+0.469
Neg logit contributions
 synd-0.573
dylib-0.535
ulner-0.529
ater-0.524
aters-0.522
Token perspective
Token ID6650
Feature activation+4.03e-01

Pos logit contributions
ignty+0.027
acted+0.026
lette+0.026
hedral+0.026
 Okinawa+0.024
Neg logit contributions
 synd-0.054
dylib-0.050
ulner-0.050
ater-0.050
aters-0.049
Token,
Token ID11
Feature activation+1.58e-01

Pos logit contributions
ignty+0.694
acted+0.684
lette+0.673
hedral+0.670
 Okinawa+0.644
Neg logit contributions
 synd-0.812
dylib-0.756
ulner-0.747
ater-0.744
aters-0.739
Token AT
Token ID5161
Feature activation+3.95e-01

Pos logit contributions
ignty+0.282
acted+0.279
lette+0.275
hedral+0.273
 Okinawa+0.264
Neg logit contributions
 synd-0.309
dylib-0.286
ulner-0.284
ater-0.282
aters-0.279
Token&
Token ID5
Feature activation-2.07e+00

Pos logit contributions
acted+0.481
lette+0.468
ignty+0.467
hedral+0.451
front+0.439
Neg logit contributions
 synd-0.986
dylib-0.929
culosis-0.926
ulner-0.925
ater-0.911
TokenT
Token ID51
Feature activation-1.24e+00

Pos logit contributions
 synd+3.883
dylib+3.632
ater+3.600
aters+3.544
itis+3.473
Neg logit contributions
ignty-3.879
hedral-3.718
acted-3.682
lette-3.635
 wedge-3.523
Token's
Token ID338
Feature activation+6.88e-01

Pos logit contributions
 synd+2.384
ater+2.154
aters+2.145
dylib+2.142
ulner+2.118
Neg logit contributions
ignty-2.278
hedral-2.217
acted-2.205
lette-2.179
 Okinawa-2.093
Token spending
Token ID4581
Feature activation-4.34e-01

Pos logit contributions
ignty+1.207
acted+1.201
lette+1.177
hedral+1.166
 wedge+1.137
Neg logit contributions
 synd-1.344
dylib-1.262
ulner-1.242
ater-1.228
culosis-1.222
Token for
Token ID329
Feature activation+6.28e-02

Pos logit contributions
 synd+0.847
dylib+0.775
ater+0.763
ulner+0.761
aters+0.758
Neg logit contributions
ignty-0.789
acted-0.770
hedral-0.763
lette-0.759
 Okinawa-0.726
Token three
Token ID1115
Feature activation-3.11e-01

Pos logit contributions
ignty+0.099
acted+0.097
hedral+0.095
lette+0.095
 Okinawa+0.090
Neg logit contributions
 synd-0.137
dylib-0.128
ulner-0.126
ater-0.126
aters-0.125
Token went
Token ID1816
Feature activation+1.08e-01

Pos logit contributions
 synd+0.290
dylib+0.270
ulner+0.267
ater+0.266
aters+0.265
Neg logit contributions
ignty-0.206
acted-0.202
hedral-0.198
lette-0.198
 Okinawa-0.189
Token missing
Token ID4814
Feature activation+3.22e-03

Pos logit contributions
ignty+0.184
acted+0.182
lette+0.179
hedral+0.178
 Okinawa+0.171
Neg logit contributions
 synd-0.221
dylib-0.206
ulner-0.203
ater-0.202
aters-0.201
Token,
Token ID11
Feature activation+7.80e-02

Pos logit contributions
ignty+0.006
acted+0.006
hedral+0.005
lette+0.005
 wedge+0.005
Neg logit contributions
 synd-0.006
dylib-0.006
ulner-0.006
ater-0.006
aters-0.006
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
ignty+0.133
acted+0.130
lette+0.128
hedral+0.128
 Okinawa+0.122
Neg logit contributions
 synd-0.160
dylib-0.148
ulner-0.146
ater-0.146
aters-0.145
Token a
Token ID257
Feature activation-8.14e-02

Pos logit contributions
 synd+0.032
dylib+0.029
ulner+0.029
ater+0.029
aters+0.029
Neg logit contributions
ignty-0.024
acted-0.023
lette-0.023
hedral-0.023
 Okinawa-0.022
Token cro
Token ID6763
Feature activation-2.05e+00

Pos logit contributions
 synd+0.165
dylib+0.153
ulner+0.151
ater+0.150
aters+0.149
Neg logit contributions
ignty-0.141
acted-0.138
hedral-0.136
lette-0.136
 Okinawa-0.130
Tokenc
Token ID66
Feature activation+3.00e-02

Pos logit contributions
 synd+4.150
ater+3.999
aters+3.843
dylib+3.838
ulner+3.741
Neg logit contributions
ignty-3.505
hedral-3.301
acted-3.280
lette-3.265
 Okinawa-3.240
Token is
Token ID318
Feature activation+1.18e-01

Pos logit contributions
ignty+0.046
acted+0.045
lette+0.045
hedral+0.045
 wedge+0.043
Neg logit contributions
 synd-0.066
dylib-0.062
ulner-0.061
ater-0.060
aters-0.060
Token a
Token ID257
Feature activation+2.54e-01

Pos logit contributions
ignty+0.185
acted+0.183
lette+0.179
hedral+0.179
 Okinawa+0.172
Neg logit contributions
 synd-0.254
dylib-0.240
ulner-0.236
ater-0.235
culosis-0.234
Token prime
Token ID6994
Feature activation-3.78e-01

Pos logit contributions
ignty+0.426
acted+0.421
lette+0.413
hedral+0.412
 Okinawa+0.396
Neg logit contributions
 synd-0.521
dylib-0.486
ulner-0.480
ater-0.478
aters-0.475
Token suspect
Token ID4099
Feature activation+2.70e-01

Pos logit contributions
 synd+0.730
dylib+0.666
ater+0.662
ulner+0.658
aters+0.654
Neg logit contributions
ignty-0.695
acted-0.680
lette-0.670
hedral-0.669
 Okinawa-0.640
Token CD
Token ID6458
Feature activation-2.57e-01

Pos logit contributions
 synd+1.133
dylib+1.044
ater+1.029
ulner+1.028
aters+1.020
Neg logit contributions
ignty-1.063
acted-1.044
hedral-1.026
lette-1.025
 Okinawa-0.979
Token-
Token ID12
Feature activation-3.18e-01

Pos logit contributions
 synd+0.586
dylib+0.548
ater+0.541
ulner+0.540
aters+0.537
Neg logit contributions
ignty-0.378
acted-0.371
hedral-0.363
lette-0.362
 Okinawa-0.345
TokenROM
Token ID33676
Feature activation-3.44e-01

Pos logit contributions
 synd+0.764
dylib+0.717
ater+0.710
ulner+0.709
aters+0.707
Neg logit contributions
ignty-0.427
acted-0.416
hedral-0.407
lette-0.406
 Okinawa-0.386
Token
Token ID198
Feature activation-1.31e-01

Pos logit contributions
 synd+0.710
dylib+0.658
ulner+0.650
ater+0.649
aters+0.644
Neg logit contributions
ignty-0.582
acted-0.571
hedral-0.561
lette-0.560
 Okinawa-0.536
Token
Token ID198
Feature activation-9.92e-02

Pos logit contributions
 synd+0.287
dylib+0.269
ater+0.265
ulner+0.264
aters+0.263
Neg logit contributions
ignty-0.204
acted-0.200
hedral-0.196
lette-0.196
 Okinawa-0.187
TokenRating
Token ID29321
Feature activation-2.04e+00

Pos logit contributions
 synd+0.163
dylib+0.149
ulner+0.146
ater+0.146
aters+0.144
Neg logit contributions
ignty-0.209
acted-0.206
lette-0.203
hedral-0.203
 Okinawa-0.196
Token:
Token ID25
Feature activation-5.15e-01

Pos logit contributions
 synd+4.203
dylib+3.879
ater+3.865
ulner+3.825
aters+3.823
Neg logit contributions
ignty-3.399
acted-3.326
hedral-3.293
lette-3.256
judicial-3.130
Token E
Token ID412
Feature activation-3.22e-01

Pos logit contributions
 synd+1.035
dylib+0.959
ater+0.945
ulner+0.942
aters+0.938
Neg logit contributions
ignty-0.898
acted-0.882
hedral-0.867
lette-0.865
 Okinawa-0.827
TokenSR
Token ID12562
Feature activation+1.38e-01

Pos logit contributions
 synd+0.701
dylib+0.653
ulner+0.647
ater+0.646
aters+0.642
Neg logit contributions
ignty-0.505
acted-0.495
hedral-0.486
lette-0.485
 Okinawa-0.463
TokenB
Token ID33
Feature activation+2.29e-01

Pos logit contributions
ignty+0.267
acted+0.263
hedral+0.259
lette+0.259
 Okinawa+0.250
Neg logit contributions
 synd-0.249
dylib-0.229
ater-0.226
ulner-0.226
aters-0.223
Token:
Token ID25
Feature activation-3.14e-01

Pos logit contributions
ignty+0.355
acted+0.349
lette+0.342
hedral+0.342
 Okinawa+0.326
Neg logit contributions
 synd-0.505
dylib-0.471
ulner-0.466
ater-0.465
aters-0.462
Token,
Token ID11
Feature activation+1.54e-01

Pos logit contributions
ignty+0.783
acted+0.778
lette+0.772
hedral+0.763
 wedge+0.731
Neg logit contributions
 synd-0.892
dylib-0.832
ulner-0.814
culosis-0.812
aters-0.807
Token a
Token ID257
Feature activation-2.09e-01

Pos logit contributions
ignty+0.260
acted+0.257
lette+0.253
hedral+0.252
 Okinawa+0.242
Neg logit contributions
 synd-0.315
dylib-0.294
ulner-0.291
ater-0.289
aters-0.288
Token former
Token ID1966
Feature activation-2.71e-01

Pos logit contributions
 synd+0.363
dylib+0.335
ulner+0.332
ater+0.329
aters+0.328
Neg logit contributions
ignty-0.414
acted-0.410
lette-0.405
hedral-0.405
 Okinawa-0.391
Token executive
Token ID4640
Feature activation-6.61e-01

Pos logit contributions
 synd+0.446
dylib+0.411
ulner+0.407
ater+0.402
aters+0.400
Neg logit contributions
ignty-0.559
acted-0.555
lette-0.548
hedral-0.547
 Okinawa-0.531
Token at
Token ID379
Feature activation+3.69e-01

Pos logit contributions
 synd+1.101
dylib+0.996
ulner+0.983
ater+0.980
aters+0.970
Neg logit contributions
ignty-1.383
acted-1.363
lette-1.342
hedral-1.342
 Okinawa-1.296
Token Bear
Token ID14732
Feature activation-2.03e+00

Pos logit contributions
acted+0.720
ignty+0.720
lette+0.707
hedral+0.707
 Okinawa+0.684
Neg logit contributions
 synd-0.646
dylib-0.599
ulner-0.593
ater-0.587
culosis-0.582
Token St
Token ID520
Feature activation-6.48e-01

Pos logit contributions
 synd+3.691
ater+3.306
dylib+3.273
ulner+3.266
aters+3.242
Neg logit contributions
ignty-4.099
acted-3.857
hedral-3.847
lette-3.770
 wedge-3.713
Tokenear
Token ID451
Feature activation+4.36e-01

Pos logit contributions
 synd+1.317
dylib+1.219
ater+1.213
ulner+1.209
aters+1.204
Neg logit contributions
ignty-1.111
acted-1.086
hedral-1.076
lette-1.069
 Okinawa-1.028
Tokenns
Token ID5907
Feature activation-2.31e-01

Pos logit contributions
ignty+0.728
acted+0.728
lette+0.722
hedral+0.695
front+0.680
Neg logit contributions
 synd-0.886
dylib-0.842
ulner-0.817
culosis-0.813
ater-0.807
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
 synd+0.449
dylib+0.417
ulner+0.411
ater+0.409
aters+0.407
Neg logit contributions
ignty-0.414
acted-0.409
lette-0.403
hedral-0.402
 Okinawa-0.386
Token a
Token ID257
Feature activation-2.96e-01

Pos logit contributions
 synd+0.315
dylib+0.293
ulner+0.290
ater+0.290
aters+0.287
Neg logit contributions
ignty-0.217
acted-0.212
lette-0.209
hedral-0.209
 Okinawa-0.198
Token historian
Token ID18026
Feature activation+4.18e-01

Pos logit contributions
 synd+0.075
dylib+0.069
ulner+0.069
ater+0.068
aters+0.068
Neg logit contributions
ignty-0.069
acted-0.068
lette-0.067
hedral-0.067
 Okinawa-0.064
Token of
Token ID286
Feature activation+1.62e-03

Pos logit contributions
ignty+0.719
acted+0.708
lette+0.696
hedral+0.695
 Okinawa+0.666
Neg logit contributions
 synd-0.848
dylib-0.786
ulner-0.776
ater-0.775
aters-0.768
Token culture
Token ID3968
Feature activation+5.86e-01

Pos logit contributions
ignty+0.003
acted+0.003
lette+0.003
hedral+0.003
 Okinawa+0.003
Neg logit contributions
 synd-0.003
dylib-0.003
ulner-0.003
ater-0.003
aters-0.003
Token,
Token ID11
Feature activation-6.37e-02

Pos logit contributions
ignty+1.050
acted+1.035
lette+1.018
hedral+1.017
 Okinawa+0.976
Neg logit contributions
 synd-1.143
dylib-1.058
ulner-1.045
ater-1.041
aters-1.034
Token he
Token ID339
Feature activation-2.09e-01

Pos logit contributions
 synd+0.143
dylib+0.133
ater+0.131
ulner+0.131
aters+0.130
Neg logit contributions
ignty-0.097
acted-0.095
lette-0.093
hedral-0.093
 Okinawa-0.088
Token summed
Token ID32794
Feature activation-2.01e+00

Pos logit contributions
 synd+0.498
dylib+0.463
ater+0.461
ulner+0.460
aters+0.456
Neg logit contributions
ignty-0.289
acted-0.283
hedral-0.279
lette-0.275
 Okinawa-0.261
Token up
Token ID510
Feature activation-3.82e-01

Pos logit contributions
 synd+3.323
ater+2.910
dylib+2.903
ulner+2.866
aters+2.838
Neg logit contributions
ignty-4.243
acted-4.113
lette-4.071
hedral-4.064
 wedge-3.913
Token Z
Token ID1168
Feature activation+1.17e-01

Pos logit contributions
 synd+0.798
dylib+0.739
ulner+0.730
ater+0.729
aters+0.723
Neg logit contributions
ignty-0.637
acted-0.626
lette-0.616
hedral-0.614
 Okinawa-0.587
Tokent
Token ID83
Feature activation-2.93e-01

Pos logit contributions
ignty+0.186
acted+0.183
lette+0.180
hedral+0.179
 Okinawa+0.171
Neg logit contributions
 synd-0.252
dylib-0.235
ulner-0.232
ater-0.232
aters-0.230
Tokenoh
Token ID1219
Feature activation-1.18e+00

Pos logit contributions
 synd+0.608
dylib+0.565
ulner+0.559
ater+0.557
aters+0.554
Neg logit contributions
ignty-0.488
acted-0.480
lette-0.471
hedral-0.471
 Okinawa-0.451
Tokenoven
Token ID16206
Feature activation-1.39e-01

Pos logit contributions
 synd+2.450
dylib+2.268
ater+2.254
ulner+2.248
aters+2.226
Neg logit contributions
ignty-1.966
acted-1.931
hedral-1.888
lette-1.877
 Okinawa-1.827
TokenS
Token ID50
Feature activation+2.95e-01

Pos logit contributions
ignty+0.187
acted+0.184
hedral+0.181
lette+0.181
 Okinawa+0.173
Neg logit contributions
 synd-0.221
dylib-0.205
ulner-0.202
ater-0.202
aters-0.200
Token.
Token ID13
Feature activation-7.67e-02

Pos logit contributions
ignty+0.428
acted+0.421
lette+0.413
hedral+0.411
 Okinawa+0.392
Neg logit contributions
 synd-0.674
dylib-0.633
ulner-0.624
ater-0.622
aters-0.618
Token soldier
Token ID10686
Feature activation+1.74e-02

Pos logit contributions
 synd+0.154
dylib+0.143
ulner+0.141
ater+0.140
aters+0.140
Neg logit contributions
ignty-0.133
acted-0.131
lette-0.129
hedral-0.129
 Okinawa-0.124
Token sue
Token ID20889
Feature activation+6.11e-01

Pos logit contributions
ignty+0.029
acted+0.028
lette+0.028
hedral+0.028
 Okinawa+0.027
Neg logit contributions
 synd-0.036
dylib-0.034
ulner-0.033
ater-0.033
aters-0.033
Token Omar
Token ID24980
Feature activation+4.67e-02

Pos logit contributions
ignty+1.109
acted+1.100
lette+1.087
hedral+1.084
 Okinawa+1.058
Neg logit contributions
 synd-1.158
dylib-1.086
ulner-1.077
ater-1.061
culosis-1.057
Token Kh
Token ID5311
Feature activation-1.99e+00

Pos logit contributions
ignty+0.085
acted+0.084
lette+0.082
hedral+0.082
 Okinawa+0.079
Neg logit contributions
 synd-0.090
dylib-0.083
ulner-0.082
ater-0.082
aters-0.081
Tokenadr
Token ID41909
Feature activation-7.58e-02

Pos logit contributions
 synd+4.360
ater+4.201
dylib+4.114
aters+4.103
ulner+4.012
Neg logit contributions
ignty-3.000
acted-2.843
lette-2.827
hedral-2.823
 Okinawa-2.819
Token for
Token ID329
Feature activation+7.71e-02

Pos logit contributions
 synd+0.151
dylib+0.139
ulner+0.138
ater+0.137
aters+0.136
Neg logit contributions
ignty-0.133
acted-0.131
lette-0.129
hedral-0.129
 Okinawa-0.124
Token nearly
Token ID3016
Feature activation-5.62e-01

Pos logit contributions
ignty+0.141
acted+0.139
lette+0.137
hedral+0.136
 Okinawa+0.131
Neg logit contributions
 synd-0.148
dylib-0.137
ulner-0.135
ater-0.134
aters-0.134
Token $
Token ID720
Feature activation-6.78e-02

Pos logit contributions
 synd+1.110
ater+1.009
dylib+1.006
aters+0.995
ulner+0.983
Neg logit contributions
ignty-1.013
acted-0.999
hedral-0.976
lette-0.969
onso-0.928
Token50
Token ID1120
Feature activation+7.02e-01

Pos logit contributions
 synd+0.150
dylib+0.141
ater+0.140
ulner+0.139
aters+0.138
Neg logit contributions
ignty-0.104
acted-0.102
hedral-0.100
lette-0.100
 Okinawa-0.095
Token Foundation
Token ID5693
Feature activation-4.52e-01

Pos logit contributions
 synd+0.253
dylib+0.236
ulner+0.234
ater+0.234
aters+0.233
Neg logit contributions
ignty-0.148
acted-0.144
lette-0.141
hedral-0.141
 Okinawa-0.133
Token President
Token ID1992
Feature activation+4.78e-02

Pos logit contributions
 synd+0.833
dylib+0.753
ulner+0.751
ater+0.748
aters+0.743
Neg logit contributions
ignty-0.871
acted-0.853
hedral-0.842
lette-0.839
 Okinawa-0.808
Token Donna
Token ID29216
Feature activation-2.60e-01

Pos logit contributions
ignty+0.079
acted+0.078
hedral+0.076
lette+0.076
 Okinawa+0.073
Neg logit contributions
 synd-0.100
dylib-0.092
ater-0.092
ulner-0.092
aters-0.091
Token Shal
Token ID47974
Feature activation-3.00e-01

Pos logit contributions
 synd+0.567
ater+0.520
aters+0.518
ulner+0.518
dylib+0.516
Neg logit contributions
ignty-0.414
acted-0.398
hedral-0.397
lette-0.395
 Okinawa-0.377
Tokenala
Token ID6081
Feature activation-3.13e-01

Pos logit contributions
 synd+0.833
ater+0.787
dylib+0.780
ulner+0.778
aters+0.777
Neg logit contributions
ignty-0.293
hedral-0.281
acted-0.280
lette-0.273
 Okinawa-0.260
Token summed
Token ID32794
Feature activation-1.99e+00

Pos logit contributions
 synd+0.751
dylib+0.697
ulner+0.694
ater+0.693
aters+0.687
Neg logit contributions
ignty-0.428
acted-0.415
hedral-0.409
lette-0.406
 Okinawa-0.385
Token up
Token ID510
Feature activation-4.07e-01

Pos logit contributions
 synd+3.373
ater+2.967
dylib+2.953
ulner+2.933
aters+2.913
Neg logit contributions
ignty-4.166
acted-3.991
hedral-3.957
lette-3.953
 Okinawa-3.800
Token the
Token ID262
Feature activation+1.83e-01

Pos logit contributions
 synd+0.896
dylib+0.835
ulner+0.826
ater+0.823
aters+0.818
Neg logit contributions
ignty-0.630
acted-0.617
lette-0.606
hedral-0.605
 Okinawa-0.577
Token complicated
Token ID8253
Feature activation+6.70e-01

Pos logit contributions
ignty+0.339
acted+0.335
hedral+0.330
lette+0.330
 wedge+0.318
Neg logit contributions
 synd-0.342
dylib-0.318
ulner-0.314
ater-0.311
aters-0.309
Token unt
Token ID1418
Feature activation+5.45e-01

Pos logit contributions
ignty+1.301
acted+1.300
lette+1.289
hedral+1.285
 wedge+1.268
Neg logit contributions
 synd-1.151
dylib-1.090
ulner-1.072
ater-1.062
culosis-1.062
Tokenangling
Token ID27499
Feature activation-5.99e-01

Pos logit contributions
acted+1.064
ignty+1.047
lette+1.034
hedral+1.026
front+0.992
Neg logit contributions
 synd-0.990
dylib-0.895
culosis-0.891
ulner-0.878
ater-0.869
Token,
Token ID11
Feature activation+2.58e-01

Pos logit contributions
ignty+0.596
acted+0.592
lette+0.581
hedral+0.576
 Okinawa+0.549
Neg logit contributions
 synd-0.886
dylib-0.837
ulner-0.826
ater-0.819
aters-0.815
Token who
Token ID508
Feature activation+1.99e-01

Pos logit contributions
ignty+0.453
acted+0.448
lette+0.440
hedral+0.439
 Okinawa+0.423
Neg logit contributions
 synd-0.510
dylib-0.474
ulner-0.469
ater-0.466
aters-0.463
Token had
Token ID550
Feature activation+5.88e-01

Pos logit contributions
ignty+0.285
acted+0.282
lette+0.276
hedral+0.274
 Okinawa+0.262
Neg logit contributions
 synd-0.455
dylib-0.429
ulner-0.424
ater-0.422
aters-0.419
Token started
Token ID2067
Feature activation+3.90e-01

Pos logit contributions
acted+0.880
ignty+0.880
lette+0.862
hedral+0.859
 Okinawa+0.823
Neg logit contributions
 synd-1.287
dylib-1.228
ater-1.210
ulner-1.210
aters-1.204
Token at
Token ID379
Feature activation+2.81e-01

Pos logit contributions
acted+0.707
ignty+0.706
hedral+0.694
lette+0.693
 Okinawa+0.667
Neg logit contributions
 synd-0.734
dylib-0.688
ulner-0.683
ater-0.675
aters-0.671
Token Goldman
Token ID18740
Feature activation-1.97e+00

Pos logit contributions
ignty+0.547
acted+0.546
lette+0.536
hedral+0.536
 Okinawa+0.518
Neg logit contributions
 synd-0.496
dylib-0.458
ulner-0.453
ater-0.449
aters-0.446
Token eight
Token ID3624
Feature activation+1.17e-02

Pos logit contributions
 synd+3.315
dylib+2.955
ulner+2.933
ater+2.927
aters+2.891
Neg logit contributions
ignty-4.088
acted-4.008
hedral-3.926
lette-3.907
 wedge-3.790
Token years
Token ID812
Feature activation+3.50e-02

Pos logit contributions
ignty+0.023
acted+0.023
lette+0.023
hedral+0.023
 Okinawa+0.022
Neg logit contributions
 synd-0.020
dylib-0.019
ulner-0.018
ater-0.018
aters-0.018
Token before
Token ID878
Feature activation-4.19e-01

Pos logit contributions
ignty+0.057
acted+0.056
lette+0.055
hedral+0.055
 Okinawa+0.052
Neg logit contributions
 synd-0.074
dylib-0.069
ulner-0.068
ater-0.068
aters-0.068
Token Cohn
Token ID45005
Feature activation-3.45e-01

Pos logit contributions
 synd+0.843
dylib+0.781
ulner+0.770
ater+0.768
aters+0.762
Neg logit contributions
ignty-0.729
acted-0.716
hedral-0.705
lette-0.704
 Okinawa-0.674
Token,
Token ID11
Feature activation+1.92e-01

Pos logit contributions
 synd+0.706
dylib+0.654
ulner+0.647
ater+0.646
aters+0.642
Neg logit contributions
ignty-0.588
acted-0.578
lette-0.568
hedral-0.568
 Okinawa-0.544
Token happens
Token ID4325
Feature activation-3.65e-02

Pos logit contributions
 synd+0.199
dylib+0.183
ulner+0.180
ater+0.179
aters+0.178
Neg logit contributions
ignty-0.220
acted-0.217
lette-0.214
hedral-0.214
 Okinawa-0.206
Token in
Token ID287
Feature activation-9.55e-02

Pos logit contributions
 synd+0.080
dylib+0.075
ulner+0.074
ater+0.074
aters+0.073
Neg logit contributions
ignty-0.057
acted-0.056
hedral-0.055
lette-0.055
 Okinawa-0.052
Token Greater
Token ID18169
Feature activation+1.87e-02

Pos logit contributions
 synd+0.161
dylib+0.145
ater+0.143
ulner+0.143
aters+0.141
Neg logit contributions
ignty-0.200
acted-0.196
lette-0.194
hedral-0.192
 Okinawa-0.185
Token Cincinnati
Token ID16137
Feature activation+5.75e-01

Pos logit contributions
ignty+0.042
acted+0.041
lette+0.041
hedral+0.041
 Okinawa+0.039
Neg logit contributions
 synd-0.028
dylib-0.026
ulner-0.025
ater-0.025
aters-0.025
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
ignty+0.914
acted+0.900
lette+0.883
hedral+0.882
 Okinawa+0.842
Neg logit contributions
 synd-1.237
dylib-1.153
ulner-1.140
ater-1.136
aters-1.129
Token Please
Token ID4222
Feature activation-1.92e+00

Pos logit contributions
ignty+0.227
acted+0.224
hedral+0.220
lette+0.220
 Okinawa+0.210
Neg logit contributions
 synd-0.251
dylib-0.232
ater-0.228
ulner-0.228
aters-0.226
Token try
Token ID1949
Feature activation-4.87e-01

Pos logit contributions
 synd+3.460
ater+3.078
dylib+3.052
aters+3.043
ulner+3.025
Neg logit contributions
ignty-3.945
acted-3.646
hedral-3.642
lette-3.536
 Okinawa-3.491
Token again
Token ID757
Feature activation-2.49e-01

Pos logit contributions
 synd+1.075
dylib+0.997
ater+0.989
ulner+0.984
aters+0.982
Neg logit contributions
ignty-0.756
acted-0.740
hedral-0.724
lette-0.723
 Okinawa-0.689
Token soon
Token ID2582
Feature activation+9.32e-02

Pos logit contributions
 synd+0.592
dylib+0.555
ater+0.548
ulner+0.547
aters+0.545
Neg logit contributions
ignty-0.343
acted-0.336
hedral-0.329
lette-0.328
 Okinawa-0.311
Token,
Token ID11
Feature activation+9.92e-02

Pos logit contributions
ignty+0.145
acted+0.141
hedral+0.139
lette+0.137
 wedge+0.131
Neg logit contributions
 synd-0.206
dylib-0.192
ater-0.191
ulner-0.190
aters-0.189
Token or
Token ID393
Feature activation-3.74e-01

Pos logit contributions
ignty+0.174
acted+0.172
lette+0.169
hedral+0.169
 Okinawa+0.163
Neg logit contributions
 synd-0.195
dylib-0.182
ulner-0.180
ater-0.178
culosis-0.178
Token ranking
Token ID12759
Feature activation+3.18e-01

Pos logit contributions
ignty+0.199
acted+0.196
lette+0.192
hedral+0.192
 Okinawa+0.182
Neg logit contributions
 synd-0.291
dylib-0.271
ulner-0.268
ater-0.268
aters-0.266
Token every
Token ID790
Feature activation-1.60e-01

Pos logit contributions
ignty+0.445
acted+0.437
lette+0.428
hedral+0.427
 Okinawa+0.405
Neg logit contributions
 synd-0.746
dylib-0.698
ulner-0.691
ater-0.690
aters-0.686
Token Halo
Token ID18597
Feature activation+1.35e+00

Pos logit contributions
 synd+0.351
dylib+0.323
ater+0.323
ulner+0.320
aters+0.320
Neg logit contributions
ignty-0.249
acted-0.244
hedral-0.239
lette-0.239
 Okinawa-0.228
Token map
Token ID3975
Feature activation+1.02e+00

Pos logit contributions
ignty+2.410
acted+2.380
hedral+2.351
lette+2.349
 wedge+2.258
Neg logit contributions
 synd-2.633
dylib-2.451
ulner-2.426
culosis-2.392
ater-2.385
Token --
Token ID1377
Feature activation-2.05e-02

Pos logit contributions
ignty+1.726
acted+1.700
lette+1.688
hedral+1.670
 wedge+1.613
Neg logit contributions
 synd-2.037
dylib-1.918
ulner-1.904
culosis-1.884
ater-1.877
Token yes
Token ID3763
Feature activation-1.91e+00

Pos logit contributions
 synd+0.042
dylib+0.039
ater+0.039
ulner+0.039
aters+0.039
Neg logit contributions
ignty-0.034
acted-0.034
hedral-0.033
lette-0.033
 Okinawa-0.032
Token,
Token ID11
Feature activation-1.43e-02

Pos logit contributions
 synd+3.462
ater+3.059
dylib+3.049
aters+3.031
ulner+2.986
Neg logit contributions
acted-3.679
ignty-3.668
hedral-3.608
lette-3.509
 Okinawa-3.439
Token all
Token ID477
Feature activation+2.60e-01

Pos logit contributions
 synd+0.032
dylib+0.030
ater+0.029
ulner+0.029
aters+0.029
Neg logit contributions
ignty-0.022
acted-0.022
hedral-0.021
lette-0.021
 Okinawa-0.020
Token 110
Token ID9796
Feature activation-2.99e-01

Pos logit contributions
ignty+0.405
acted+0.399
lette+0.391
hedral+0.391
 Okinawa+0.372
Neg logit contributions
 synd-0.568
dylib-0.529
ulner-0.523
ater-0.522
aters-0.519
Token of
Token ID286
Feature activation+2.16e-01

Pos logit contributions
 synd+0.670
dylib+0.621
ater+0.617
aters+0.614
ulner+0.612
Neg logit contributions
ignty-0.453
acted-0.445
hedral-0.435
lette-0.434
 Okinawa-0.416
Token them
Token ID606
Feature activation+4.79e-01

Pos logit contributions
ignty+0.291
acted+0.283
lette+0.278
hedral+0.278
 Okinawa+0.262
Neg logit contributions
 synd-0.520
dylib-0.487
ater-0.482
ulner-0.480
aters-0.479
Token There
Token ID1318
Feature activation+1.47e-01

Pos logit contributions
 synd+0.051
dylib+0.048
ulner+0.047
ater+0.047
aters+0.047
Neg logit contributions
ignty-0.043
acted-0.042
hedral-0.042
lette-0.042
 wedge-0.040
Token could
Token ID714
Feature activation+3.89e-01

Pos logit contributions
ignty+0.234
acted+0.231
hedral+0.227
lette+0.226
 Okinawa+0.216
Neg logit contributions
 synd-0.318
dylib-0.296
ater-0.293
ulner-0.293
aters-0.291
Token be
Token ID307
Feature activation+4.60e-02

Pos logit contributions
ignty+0.599
acted+0.585
hedral+0.576
lette+0.575
 wedge+0.549
Neg logit contributions
 synd-0.855
dylib-0.801
ulner-0.791
ater-0.788
aters-0.785
Token more
Token ID517
Feature activation-3.25e-01

Pos logit contributions
ignty+0.075
acted+0.073
hedral+0.072
lette+0.072
 Okinawa+0.069
Neg logit contributions
 synd-0.098
dylib-0.091
ulner-0.090
ater-0.090
aters-0.089
Token,
Token ID11
Feature activation+5.55e-03

Pos logit contributions
 synd+0.622
dylib+0.568
ater+0.560
ulner+0.559
aters+0.557
Neg logit contributions
ignty-0.600
acted-0.589
hedral-0.579
lette-0.578
 Okinawa-0.554
Token yes
Token ID3763
Feature activation-1.90e+00

Pos logit contributions
ignty+0.009
acted+0.009
lette+0.009
hedral+0.009
 Okinawa+0.009
Neg logit contributions
 synd-0.011
dylib-0.010
ater-0.010
ulner-0.010
aters-0.010
Token."
Token ID526
Feature activation+3.55e-01

Pos logit contributions
 synd+3.799
ater+3.460
dylib+3.450
aters+3.409
ulner+3.375
Neg logit contributions
acted-3.267
ignty-3.263
hedral-3.212
lette-3.156
 wedge-3.065
TokenNo
Token ID2949
Feature activation-1.85e-01

Pos logit contributions
ignty+0.628
acted+0.619
lette+0.609
hedral+0.608
 Okinawa+0.583
Neg logit contributions
 synd-0.702
dylib-0.650
ulner-0.642
ater-0.640
aters-0.635
Token announcement
Token ID8009
Feature activation-2.38e-02

Pos logit contributions
 synd+0.354
dylib+0.325
ater+0.322
ulner+0.321
aters+0.319
Neg logit contributions
ignty-0.342
acted-0.337
hedral-0.331
lette-0.330
 Okinawa-0.319
Token has
Token ID468
Feature activation+3.69e-05

Pos logit contributions
 synd+0.054
dylib+0.050
ater+0.050
ulner+0.049
aters+0.049
Neg logit contributions
ignty-0.035
acted-0.035
hedral-0.034
lette-0.034
 Okinawa-0.032
Token been
Token ID587
Feature activation+3.35e-01

Pos logit contributions
ignty+0.000
acted+0.000
hedral+0.000
lette+0.000
 Okinawa+0.000
Neg logit contributions
 synd-0.000
dylib-0.000
ulner-0.000
ater-0.000
aters-0.000
Token cost
Token ID1575
Feature activation-1.63e-01

Pos logit contributions
ignty+0.342
acted+0.337
lette+0.331
hedral+0.331
 Okinawa+0.316
Neg logit contributions
 synd-0.443
dylib-0.413
ulner-0.408
ater-0.406
aters-0.404
Token property
Token ID3119
Feature activation-3.36e-01

Pos logit contributions
 synd+0.334
dylib+0.309
ater+0.305
ulner+0.305
aters+0.303
Neg logit contributions
ignty-0.277
acted-0.273
hedral-0.268
lette-0.267
 Okinawa-0.256
Token taxpayers
Token ID14086
Feature activation+1.03e+00

Pos logit contributions
 synd+0.592
dylib+0.538
ater+0.530
ulner+0.529
aters+0.527
Neg logit contributions
ignty-0.668
acted-0.662
hedral-0.652
lette-0.650
 Okinawa-0.625
Token paying
Token ID5989
Feature activation+6.71e-01

Pos logit contributions
ignty+1.668
acted+1.639
lette+1.621
hedral+1.613
 wedge+1.553
Neg logit contributions
 synd-2.153
dylib-2.021
ulner-2.006
ater-1.978
aters-1.974
Token M
Token ID337
Feature activation+2.24e-01

Pos logit contributions
ignty+1.241
acted+1.228
lette+1.215
hedral+1.207
 wedge+1.172
Neg logit contributions
 synd-1.252
dylib-1.165
ulner-1.157
ater-1.141
culosis-1.134
Tokenow
Token ID322
Feature activation-1.90e+00

Pos logit contributions
ignty+0.399
acted+0.392
hedral+0.384
lette+0.384
 Okinawa+0.370
Neg logit contributions
 synd-0.440
dylib-0.407
ater-0.406
ulner-0.403
aters-0.400
Tokenatt
Token ID1078
Feature activation+6.25e-01

Pos logit contributions
 synd+4.221
ater+4.012
aters+3.981
dylib+3.954
itis+3.816
Neg logit contributions
ignty-2.848
acted-2.763
hedral-2.716
lette-2.628
 Okinawa-2.580
Token Trust
Token ID9870
Feature activation+2.29e-01

Pos logit contributions
ignty+1.138
acted+1.124
lette+1.110
hedral+1.104
 Okinawa+1.065
Neg logit contributions
 synd-1.193
dylib-1.105
ulner-1.097
ater-1.082
aters-1.074
Token for
Token ID329
Feature activation-8.81e-02

Pos logit contributions
ignty+0.412
acted+0.406
lette+0.400
hedral+0.398
 Okinawa+0.383
Neg logit contributions
 synd-0.446
dylib-0.414
ulner-0.409
ater-0.406
aters-0.404
Token their
Token ID511
Feature activation-3.32e-02

Pos logit contributions
 synd+0.178
dylib+0.165
ater+0.163
ulner+0.162
aters+0.162
Neg logit contributions
ignty-0.152
acted-0.150
hedral-0.148
lette-0.147
 Okinawa-0.140
Token loss
Token ID2994
Feature activation+5.81e-01

Pos logit contributions
 synd+0.063
dylib+0.058
ulner+0.057
ater+0.057
aters+0.057
Neg logit contributions
ignty-0.062
acted-0.061
hedral-0.060
lette-0.060
 Okinawa-0.057
Tokenfried
Token ID25520
Feature activation+8.99e-01

Pos logit contributions
ignty+0.535
acted+0.526
lette+0.516
hedral+0.513
 Okinawa+0.488
Neg logit contributions
 synd-0.901
dylib-0.845
ulner-0.837
ater-0.833
aters-0.829
Token-
Token ID12
Feature activation+5.11e-01

Pos logit contributions
ignty+1.480
acted+1.468
lette+1.456
hedral+1.441
 wedge+1.407
Neg logit contributions
 synd-1.846
dylib-1.737
ulner-1.725
aters-1.697
ater-1.694
Tokenrice
Token ID20970
Feature activation+5.44e-01

Pos logit contributions
ignty+0.784
acted+0.773
lette+0.760
hedral+0.756
 Okinawa+0.720
Neg logit contributions
 synd-1.130
dylib-1.054
ulner-1.045
ater-1.037
aters-1.033
Token dish
Token ID9433
Feature activation+2.10e-01

Pos logit contributions
ignty+0.850
acted+0.838
lette+0.825
hedral+0.821
 Okinawa+0.784
Neg logit contributions
 synd-1.182
dylib-1.106
ulner-1.095
ater-1.088
aters-1.083
Token with
Token ID351
Feature activation+2.75e-01

Pos logit contributions
ignty+0.373
acted+0.367
hedral+0.360
lette+0.360
 Okinawa+0.345
Neg logit contributions
 synd-0.415
dylib-0.384
ulner-0.379
ater-0.378
aters-0.375
Token esc
Token ID3671
Feature activation+1.68e+00

Pos logit contributions
ignty+0.429
acted+0.421
hedral+0.413
lette+0.413
 Okinawa+0.392
Neg logit contributions
 synd-0.603
dylib-0.561
ater-0.554
ulner-0.554
aters-0.550
Tokenolar
Token ID6192
Feature activation+7.08e-01

Pos logit contributions
ignty+2.851
acted+2.843
lette+2.824
hedral+2.771
onso+2.665
Neg logit contributions
 synd-3.430
dylib-3.161
ulner-3.117
culosis-3.086
 creditor-3.004
Token and
Token ID290
Feature activation+2.17e-01

Pos logit contributions
ignty+1.166
acted+1.145
hedral+1.124
lette+1.116
 Okinawa+1.073
Neg logit contributions
 synd-1.493
dylib-1.385
ater-1.369
ulner-1.366
aters-1.357
Token Chinese
Token ID3999
Feature activation+7.88e-01

Pos logit contributions
ignty+0.349
acted+0.343
hedral+0.336
lette+0.335
 Okinawa+0.317
Neg logit contributions
 synd-0.466
dylib-0.430
ater-0.426
ulner-0.424
aters-0.422
Token sausage
Token ID31889
Feature activation+1.06e+00

Pos logit contributions
ignty+1.236
acted+1.218
lette+1.197
hedral+1.196
 Okinawa+1.142
Neg logit contributions
 synd-1.704
dylib-1.595
ulner-1.579
ater-1.568
aters-1.560
Token that
Token ID326
Feature activation+4.55e-02

Pos logit contributions
ignty+1.756
acted+1.739
lette+1.722
hedral+1.710
 wedge+1.648
Neg logit contributions
 synd-2.172
dylib-2.035
ulner-2.021
ater-1.997
culosis-1.991
Token times
Token ID1661
Feature activation+5.54e-01

Pos logit contributions
ignty+0.096
acted+0.095
lette+0.093
hedral+0.093
 Okinawa+0.089
Neg logit contributions
 synd-0.113
dylib-0.105
ulner-0.103
ater-0.103
aters-0.102
Token.
Token ID13
Feature activation+8.48e-02

Pos logit contributions
ignty+0.869
acted+0.856
lette+0.843
hedral+0.840
 Okinawa+0.803
Neg logit contributions
 synd-1.197
dylib-1.121
ulner-1.112
ater-1.102
aters-1.097
Token The
Token ID383
Feature activation-1.32e-01

Pos logit contributions
ignty+0.133
acted+0.130
hedral+0.128
lette+0.128
 wedge+0.121
Neg logit contributions
 synd-0.185
dylib-0.173
ater-0.170
ulner-0.170
aters-0.169
Token drummer
Token ID34269
Feature activation+1.78e-01

Pos logit contributions
 synd+0.222
dylib+0.203
ulner+0.200
ater+0.199
aters+0.197
Neg logit contributions
ignty-0.272
acted-0.269
lette-0.265
hedral-0.265
 Okinawa-0.256
Token looked
Token ID3114
Feature activation+8.65e-01

Pos logit contributions
ignty+0.216
acted+0.209
lette+0.204
hedral+0.204
 Okinawa+0.192
Neg logit contributions
 synd-0.453
dylib-0.426
ater-0.423
ulner-0.422
aters-0.419
Token like
Token ID588
Feature activation+1.12e+00

Pos logit contributions
ignty+1.170
acted+1.160
lette+1.135
hedral+1.131
 Okinawa+1.077
Neg logit contributions
 synd-2.049
dylib-1.931
ulner-1.912
ater-1.894
aters-1.891
Token Tim
Token ID5045
Feature activation-1.02e-01

Pos logit contributions
ignty+1.630
acted+1.615
lette+1.584
hedral+1.580
 Okinawa+1.509
Neg logit contributions
 synd-2.549
dylib-2.392
ulner-2.373
ater-2.348
aters-2.342
Token Y
Token ID575
Feature activation+1.20e-01

Pos logit contributions
 synd+0.213
dylib+0.198
ulner+0.196
ater+0.195
aters+0.194
Neg logit contributions
ignty-0.167
acted-0.165
lette-0.162
hedral-0.162
 Okinawa-0.155
Tokenoh
Token ID1219
Feature activation-1.11e+00

Pos logit contributions
ignty+0.204
acted+0.201
lette+0.198
hedral+0.197
 Okinawa+0.188
Neg logit contributions
 synd-0.247
dylib-0.229
ulner-0.226
ater-0.225
aters-0.224
Tokenannon
Token ID8825
Feature activation+1.33e-01

Pos logit contributions
 synd+2.425
ater+2.233
dylib+2.222
ulner+2.221
aters+2.208
Neg logit contributions
ignty-1.735
acted-1.696
hedral-1.665
lette-1.636
 Okinawa-1.599
Token with
Token ID351
Feature activation-7.25e-02

Pos logit contributions
ignty+0.213
acted+0.209
lette+0.206
hedral+0.205
 Okinawa+0.196
Neg logit contributions
 synd-0.285
dylib-0.266
ulner-0.263
ater-0.262
aters-0.260
Token 2010
Token ID3050
Feature activation-5.25e-02

Pos logit contributions
ignty+0.494
acted+0.477
hedral+0.469
lette+0.464
 wedge+0.434
Neg logit contributions
 synd-1.075
dylib-1.009
ater-0.997
ulner-0.992
aters-0.991
Token United
Token ID1578
Feature activation+4.62e-01

Pos logit contributions
 synd+0.094
dylib+0.086
ater+0.085
ulner+0.085
aters+0.084
Neg logit contributions
ignty-0.103
acted-0.101
hedral-0.100
lette-0.100
 wedge-0.096
Token States
Token ID1829
Feature activation+4.21e-01

Pos logit contributions
ignty+1.082
acted+1.074
lette+1.057
hedral+1.054
 Okinawa+1.029
Neg logit contributions
 synd-0.641
dylib-0.579
ulner-0.568
ater-0.560
aters-0.553
Token 21
Token ID2310
Feature activation-3.74e-01

Pos logit contributions
ignty+0.621
acted+0.612
lette+0.600
hedral+0.597
 Okinawa+0.573
Neg logit contributions
 synd-0.951
dylib-0.891
ulner-0.884
ater-0.878
aters-0.873
Token50
Token ID1120
Feature activation+3.19e-01

Pos logit contributions
 synd+0.837
dylib+0.776
ater+0.765
ulner+0.764
aters+0.761
Neg logit contributions
ignty-0.575
acted-0.559
hedral-0.554
lette-0.550
 wedge-0.521
Token Posts
Token ID12043
Feature activation+1.43e+00

Pos logit contributions
ignty+0.441
acted+0.436
lette+0.428
hedral+0.423
 Okinawa+0.408
Neg logit contributions
 synd-0.746
dylib-0.700
ulner-0.696
ater-0.691
culosis-0.687
Token #
Token ID1303
Feature activation-1.16e-01

Pos logit contributions
ignty+2.794
acted+2.738
lette+2.726
hedral+2.694
 Okinawa+2.622
Neg logit contributions
 synd-2.516
dylib-2.334
ulner-2.310
ater-2.284
aters-2.277
Token16
Token ID1433
Feature activation+8.35e-01

Pos logit contributions
 synd+0.230
ulner+0.213
dylib+0.210
aters+0.207
ater+0.207
Neg logit contributions
acted-0.205
ignty-0.204
lette-0.202
hedral-0.197
front-0.193
Token Glad
Token ID28442
Feature activation+7.43e-01

Pos logit contributions
ignty+1.327
acted+1.313
hedral+1.289
lette+1.289
 Okinawa+1.233
Neg logit contributions
 synd-1.772
dylib-1.669
ulner-1.654
ater-1.635
aters-1.625
Token to
Token ID284
Feature activation+2.70e-01

Pos logit contributions
ignty+1.211
acted+1.198
lette+1.177
hedral+1.172
 Okinawa+1.126
Neg logit contributions
 synd-1.543
dylib-1.459
ulner-1.445
ater-1.421
culosis-1.413
Token hear
Token ID3285
Feature activation-1.91e-01

Pos logit contributions
ignty+0.553
acted+0.549
hedral+0.544
lette+0.544
 wedge+0.529
Neg logit contributions
 synd-0.444
dylib-0.410
ulner-0.405
ater-0.399
aters-0.397
Tokenager
Token ID3536
Feature activation+9.04e-01

Pos logit contributions
 synd+0.797
ater+0.751
dylib+0.751
ulner+0.744
aters+0.743
Neg logit contributions
ignty-0.485
acted-0.469
hedral-0.465
lette-0.462
 Okinawa-0.447
Token_
Token ID62
Feature activation+7.09e-01

Pos logit contributions
ignty+1.424
acted+1.409
lette+1.378
hedral+1.376
 Okinawa+1.311
Neg logit contributions
 synd-1.961
dylib-1.820
ulner-1.801
ater-1.789
aters-1.783
Tokenbi
Token ID8482
Feature activation-2.24e-01

Pos logit contributions
ignty+1.122
acted+1.106
lette+1.085
hedral+1.082
 Okinawa+1.033
Neg logit contributions
 synd-1.533
dylib-1.426
ulner-1.410
ater-1.402
aters-1.395
Tokenography
Token ID4867
Feature activation-1.12e-02

Pos logit contributions
 synd+0.485
dylib+0.454
ulner+0.451
ater+0.451
aters+0.446
Neg logit contributions
ignty-0.351
acted-0.343
lette-0.339
hedral-0.336
 wedge-0.324
Token.
Token ID13
Feature activation+2.82e-01

Pos logit contributions
 synd+0.025
dylib+0.023
ulner+0.023
ater+0.023
aters+0.023
Neg logit contributions
ignty-0.017
acted-0.017
lette-0.017
hedral-0.017
 wedge-0.016
Tokenhtml
Token ID6494
Feature activation+1.45e+00

Pos logit contributions
ignty+0.336
acted+0.328
lette+0.321
hedral+0.320
 Okinawa+0.300
Neg logit contributions
 synd-0.720
dylib-0.680
ulner-0.673
ater-0.672
aters-0.668
Token
Token ID198
Feature activation+3.28e-01

Pos logit contributions
ignty+2.138
acted+2.128
lette+2.095
hedral+2.080
 Okinawa+2.003
Neg logit contributions
 synd-3.233
dylib-3.026
ulner-3.003
ater-2.973
aters-2.968
Token
Token ID198
Feature activation+2.61e-01

Pos logit contributions
ignty+0.483
acted+0.478
lette+0.468
hedral+0.466
 Okinawa+0.445
Neg logit contributions
 synd-0.740
dylib-0.690
ulner-0.688
ater-0.681
culosis-0.679
Token
Token ID447
Feature activation-1.55e-01

Pos logit contributions
ignty+0.418
acted+0.409
hedral+0.402
lette+0.401
 wedge+0.384
Neg logit contributions
 synd-0.559
dylib-0.524
ater-0.517
ulner-0.517
aters-0.513
Token
Token ID250
Feature activation+2.72e-01

Pos logit contributions
 synd+0.222
dylib+0.201
ater+0.197
ulner+0.197
aters+0.195
Neg logit contributions
ignty-0.357
acted-0.353
lette-0.349
hedral-0.348
 Okinawa-0.338
TokenChuck
Token ID44324
Feature activation-3.88e-01

Pos logit contributions
ignty+0.496
acted+0.487
lette+0.479
hedral+0.479
 Okinawa+0.461
Neg logit contributions
 synd-0.524
dylib-0.485
ulner-0.478
ater-0.477
aters-0.473
Token paid
Token ID3432
Feature activation+7.05e-01

Pos logit contributions
ignty+0.363
acted+0.361
lette+0.355
hedral+0.353
 wedge+0.339
Neg logit contributions
 synd-0.448
dylib-0.423
ulner-0.419
culosis-0.414
ater-0.414
Token and
Token ID290
Feature activation+5.02e-02

Pos logit contributions
ignty+1.223
acted+1.214
lette+1.187
hedral+1.179
 wedge+1.138
Neg logit contributions
 synd-1.396
dylib-1.315
ulner-1.298
ater-1.282
culosis-1.280
Token what
Token ID644
Feature activation+1.58e-01

Pos logit contributions
ignty+0.092
acted+0.091
lette+0.090
hedral+0.089
 Okinawa+0.086
Neg logit contributions
 synd-0.095
dylib-0.088
ulner-0.088
ater-0.086
culosis-0.086
Token you
Token ID345
Feature activation+3.32e-01

Pos logit contributions
ignty+0.248
acted+0.245
lette+0.240
hedral+0.238
 Okinawa+0.229
Neg logit contributions
 synd-0.342
dylib-0.322
ulner-0.319
ater-0.316
aters-0.314
Token are
Token ID389
Feature activation+4.05e-01

Pos logit contributions
ignty+0.540
acted+0.534
lette+0.525
hedral+0.523
 wedge+0.503
Neg logit contributions
 synd-0.696
dylib-0.657
ulner-0.650
ater-0.642
culosis-0.638
Token paying
Token ID5989
Feature activation+1.18e+00

Pos logit contributions
ignty+0.698
acted+0.692
lette+0.681
hedral+0.677
 wedge+0.653
Neg logit contributions
 synd-0.808
dylib-0.760
ulner-0.754
ater-0.743
culosis-0.743
Token for
Token ID329
Feature activation+8.94e-02

Pos logit contributions
acted+1.933
ignty+1.928
lette+1.897
hedral+1.866
 Okinawa+1.833
Neg logit contributions
 synd-2.384
dylib-2.289
ulner-2.275
culosis-2.238
ater-2.224
Token at
Token ID379
Feature activation+2.32e-01

Pos logit contributions
ignty+0.145
acted+0.142
lette+0.140
hedral+0.140
 Okinawa+0.133
Neg logit contributions
 synd-0.190
dylib-0.177
ulner-0.175
ater-0.175
aters-0.174
Token the
Token ID262
Feature activation+2.20e-02

Pos logit contributions
ignty+0.359
acted+0.355
lette+0.349
hedral+0.347
 Okinawa+0.333
Neg logit contributions
 synd-0.505
dylib-0.476
ulner-0.471
ater-0.467
aters-0.465
Token supermarket
Token ID26454
Feature activation-1.31e-01

Pos logit contributions
ignty+0.038
acted+0.038
lette+0.037
hedral+0.037
 wedge+0.036
Neg logit contributions
 synd-0.044
dylib-0.041
ulner-0.041
ater-0.040
aters-0.040
Token,"
Token ID553
Feature activation-1.47e-01

Pos logit contributions
 synd+0.298
dylib+0.278
ulner+0.275
ater+0.274
aters+0.273
Neg logit contributions
ignty-0.194
acted-0.190
hedral-0.186
lette-0.186
 Okinawa-0.177
Token variety
Token ID4996
Feature activation-5.54e-02

Pos logit contributions
 synd+0.576
dylib+0.532
ater+0.525
ulner+0.522
aters+0.518
Neg logit contributions
ignty-0.435
acted-0.419
hedral-0.412
lette-0.411
 Okinawa-0.394
Token of
Token ID286
Feature activation+1.67e-01

Pos logit contributions
 synd+0.119
dylib+0.111
ulner+0.109
ater+0.109
aters+0.108
Neg logit contributions
ignty-0.089
acted-0.087
lette-0.086
hedral-0.086
 Okinawa-0.082
Token health
Token ID1535
Feature activation-2.55e-01

Pos logit contributions
ignty+0.325
acted+0.321
lette+0.316
hedral+0.316
 Okinawa+0.304
Neg logit contributions
 synd-0.302
dylib-0.277
ulner-0.273
ater-0.272
aters-0.270
Token and
Token ID290
Feature activation+5.48e-02

Pos logit contributions
 synd+0.448
dylib+0.406
ulner+0.402
ater+0.401
aters+0.397
Neg logit contributions
ignty-0.511
acted-0.503
lette-0.496
hedral-0.496
 Okinawa-0.479
Token recovery
Token ID7628
Feature activation-1.11e-01

Pos logit contributions
ignty+0.097
acted+0.095
lette+0.093
hedral+0.093
 Okinawa+0.089
Neg logit contributions
 synd-0.109
dylib-0.101
ater-0.099
ulner-0.099
aters-0.098
Token purposes
Token ID4959
Feature activation+1.26e-01

Pos logit contributions
 synd+0.188
dylib+0.170
ulner+0.167
ater+0.167
aters+0.165
Neg logit contributions
ignty-0.229
acted-0.225
hedral-0.222
lette-0.222
 Okinawa-0.215
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
ignty+0.196
acted+0.192
lette+0.189
hedral+0.189
 Okinawa+0.180
Neg logit contributions
 synd-0.275
dylib-0.257
ulner-0.254
ater-0.253
aters-0.251
Token At
Token ID1629
Feature activation+2.84e-01

Pos logit contributions
 synd+0.264
dylib+0.246
ater+0.242
ulner+0.242
aters+0.241
Neg logit contributions
ignty-0.197
acted-0.193
hedral-0.191
lette-0.189
 wedge-0.180
Token most
Token ID749
Feature activation-1.20e-01

Pos logit contributions
ignty+0.437
acted+0.431
lette+0.423
hedral+0.422
 Okinawa+0.403
Neg logit contributions
 synd-0.624
dylib-0.585
ulner-0.577
ater-0.575
aters-0.572
Token,
Token ID11
Feature activation-2.09e-01

Pos logit contributions
 synd+0.241
dylib+0.221
ater+0.219
ulner+0.218
aters+0.217
Neg logit contributions
ignty-0.210
acted-0.207
hedral-0.203
lette-0.202
 Okinawa-0.193
Token it
Token ID340
Feature activation+7.79e-03

Pos logit contributions
 synd+0.450
dylib+0.415
ater+0.411
ulner+0.410
aters+0.409
Neg logit contributions
ignty-0.337
acted-0.330
hedral-0.325
lette-0.324
 Okinawa-0.307
Token must
Token ID1276
Feature activation-1.71e-01

Pos logit contributions
 synd+0.747
dylib+0.694
ater+0.688
ulner+0.685
aters+0.683
Neg logit contributions
ignty-0.523
acted-0.515
lette-0.503
hedral-0.503
 Okinawa-0.479
Token be
Token ID307
Feature activation+2.02e-02

Pos logit contributions
 synd+0.364
dylib+0.339
ater+0.335
ulner+0.335
aters+0.332
Neg logit contributions
ignty-0.278
acted-0.273
hedral-0.268
lette-0.268
 Okinawa-0.257
Token tough
Token ID5802
Feature activation+3.63e-01

Pos logit contributions
ignty+0.031
acted+0.031
lette+0.030
hedral+0.030
 Okinawa+0.029
Neg logit contributions
 synd-0.045
dylib-0.042
ulner-0.041
ater-0.041
aters-0.041
Token for
Token ID329
Feature activation+9.46e-02

Pos logit contributions
ignty+0.647
acted+0.641
lette+0.630
hedral+0.629
 Okinawa+0.604
Neg logit contributions
 synd-0.708
dylib-0.659
ulner-0.650
ater-0.646
aters-0.642
Token the
Token ID262
Feature activation-2.79e-01

Pos logit contributions
ignty+0.156
acted+0.154
lette+0.151
hedral+0.151
 Okinawa+0.144
Neg logit contributions
 synd-0.198
dylib-0.184
ulner-0.182
ater-0.182
aters-0.181
Token little
Token ID1310
Feature activation+1.09e-02

Pos logit contributions
 synd+0.599
dylib+0.555
ater+0.551
ulner+0.548
aters+0.547
Neg logit contributions
ignty-0.448
acted-0.441
lette-0.433
hedral-0.433
 Okinawa-0.411
Token guy
Token ID3516
Feature activation-2.52e-01

Pos logit contributions
ignty+0.017
acted+0.017
lette+0.017
hedral+0.017
 Okinawa+0.016
Neg logit contributions
 synd-0.023
dylib-0.022
ulner-0.021
ater-0.021
aters-0.021
Token (
Token ID357
Feature activation+1.38e-01

Pos logit contributions
 synd+0.533
dylib+0.494
ater+0.489
ulner+0.488
aters+0.486
Neg logit contributions
ignty-0.413
acted-0.407
lette-0.399
hedral-0.398
 Okinawa-0.380
Tokenwhich
Token ID4758
Feature activation-2.84e-01

Pos logit contributions
ignty+0.207
acted+0.203
hedral+0.199
lette+0.199
 Okinawa+0.190
Neg logit contributions
 synd-0.311
dylib-0.291
ulner-0.287
ater-0.287
aters-0.285
Token,
Token ID11
Feature activation-1.60e-03

Pos logit contributions
 synd+0.630
dylib+0.584
ater+0.580
ulner+0.577
aters+0.575
Neg logit contributions
ignty-0.436
acted-0.428
hedral-0.420
lette-0.420
 wedge-0.398
Token come
Token ID1282
Feature activation-7.98e-02

Pos logit contributions
 synd+0.003
dylib+0.003
ulner+0.003
ater+0.003
aters+0.003
Neg logit contributions
ignty-0.003
acted-0.003
lette-0.003
hedral-0.003
 Okinawa-0.003
Token?
Token ID30
Feature activation-7.71e-01

Pos logit contributions
 synd+0.856
dylib+0.802
ulner+0.793
ater+0.792
aters+0.788
Neg logit contributions
ignty-0.510
acted-0.500
lette-0.490
hedral-0.489
 Okinawa-0.465
Token
Token ID198
Feature activation-2.01e-01

Pos logit contributions
 synd+1.658
dylib+1.536
ulner+1.519
ater+1.517
aters+1.507
Neg logit contributions
ignty-1.239
acted-1.220
lette-1.201
hedral-1.196
 Okinawa-1.138
Token
Token ID198
Feature activation-9.65e-02

Pos logit contributions
 synd+0.441
dylib+0.413
ater+0.406
ulner+0.406
aters+0.403
Neg logit contributions
ignty-0.313
acted-0.306
hedral-0.301
lette-0.301
 Okinawa-0.286
Tokenand
Token ID392
Feature activation+1.91e-01

Pos logit contributions
 synd+0.205
dylib+0.191
ulner+0.189
ater+0.189
aters+0.187
Neg logit contributions
ignty-0.156
acted-0.153
hedral-0.151
lette-0.151
 Okinawa-0.145
Token how
Token ID703
Feature activation+2.52e-01

Pos logit contributions
ignty+0.384
acted+0.379
lette+0.373
hedral+0.373
 Okinawa+0.359
Neg logit contributions
 synd-0.331
dylib-0.303
ulner-0.298
ater-0.297
aters-0.295
Token does
Token ID857
Feature activation+5.80e-01

Pos logit contributions
ignty+0.430
acted+0.423
lette+0.418
hedral+0.415
 Okinawa+0.400
Neg logit contributions
 synd-0.508
dylib-0.473
ulner-0.470
ater-0.467
aters-0.464
Token it
Token ID340
Feature activation+1.44e-01

Pos logit contributions
ignty+0.760
acted+0.759
lette+0.741
hedral+0.739
 wedge+0.704
Neg logit contributions
 synd-1.381
ulner-1.308
dylib-1.307
culosis-1.300
ater-1.289
Token look
Token ID804
Feature activation+4.89e-01

Pos logit contributions
ignty+0.150
acted+0.146
hedral+0.143
lette+0.143
 wedge+0.133
Neg logit contributions
 synd-0.387
dylib-0.367
ulner-0.366
ater-0.363
aters-0.362
Token from
Token ID422
Feature activation+1.77e-01

Pos logit contributions
ignty+0.735
acted+0.725
lette+0.711
hedral+0.711
 wedge+0.676
Neg logit contributions
 synd-1.090
dylib-1.021
ulner-1.013
ater-1.007
aters-1.003
Token the
Token ID262
Feature activation+1.03e-02

Pos logit contributions
ignty+0.270
acted+0.266
lette+0.261
hedral+0.261
 Okinawa+0.249
Neg logit contributions
 synd-0.393
dylib-0.368
ulner-0.364
ater-0.363
aters-0.361
Token perspective
Token ID6650
Feature activation+6.14e-01

Pos logit contributions
ignty+0.012
acted+0.012
lette+0.011
hedral+0.011
 Okinawa+0.011
Neg logit contributions
 synd-0.027
dylib-0.025
ater-0.025
ulner-0.025
aters-0.025
TokenNC
Token ID7792
Feature activation-5.77e-01

Pos logit contributions
 synd+1.002
dylib+0.906
ater+0.897
ulner+0.891
aters+0.889
Neg logit contributions
ignty-1.235
acted-1.220
hedral-1.198
lette-1.192
 Okinawa-1.155
Token)
Token ID8
Feature activation-4.66e-01

Pos logit contributions
 synd+1.179
dylib+1.094
ulner+1.081
ater+1.078
aters+1.071
Neg logit contributions
ignty-0.984
acted-0.969
lette-0.952
hedral-0.951
 Okinawa-0.911
Token people
Token ID661
Feature activation-1.42e-01

Pos logit contributions
 synd+0.881
dylib+0.810
ulner+0.799
ater+0.797
aters+0.792
Neg logit contributions
ignty-0.866
acted-0.853
lette-0.840
hedral-0.839
 Okinawa-0.807
Token,
Token ID11
Feature activation-2.48e-01

Pos logit contributions
 synd+0.301
dylib+0.279
ulner+0.276
ater+0.276
aters+0.274
Neg logit contributions
ignty-0.231
acted-0.227
hedral-0.223
lette-0.223
 Okinawa-0.214
Token such
Token ID884
Feature activation-2.41e-01

Pos logit contributions
 synd+0.489
dylib+0.453
ulner+0.446
ater+0.445
aters+0.442
Neg logit contributions
ignty-0.439
acted-0.432
lette-0.425
hedral-0.425
 Okinawa-0.407
Token as
Token ID355
Feature activation+2.35e-02

Pos logit contributions
 synd+0.419
dylib+0.387
ulner+0.385
ater+0.380
aters+0.377
Neg logit contributions
ignty-0.481
acted-0.475
lette-0.470
hedral-0.468
 Okinawa-0.453
Token gender
Token ID5279
Feature activation+2.47e-01

Pos logit contributions
ignty+0.043
acted+0.043
lette+0.042
hedral+0.042
 Okinawa+0.041
Neg logit contributions
 synd-0.044
dylib-0.041
ulner-0.041
ater-0.040
aters-0.040
Tokenflu
Token ID35522
Feature activation+1.91e-01

Pos logit contributions
ignty+0.395
acted+0.389
lette+0.382
hedral+0.381
 Okinawa+0.364
Neg logit contributions
 synd-0.531
dylib-0.494
ulner-0.488
ater-0.487
aters-0.484
Tokenid
Token ID312
Feature activation+7.02e-01

Pos logit contributions
ignty+0.382
acted+0.377
lette+0.371
hedral+0.371
 Okinawa+0.358
Neg logit contributions
 synd-0.335
dylib-0.307
ulner-0.303
ater-0.301
aters-0.299
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
ignty+1.231
acted+1.224
lette+1.215
hedral+1.201
 Okinawa+1.160
Neg logit contributions
 synd-1.366
dylib-1.278
ulner-1.271
ater-1.248
culosis-1.241
Token gender
Token ID5279
Feature activation+3.22e-01

Pos logit contributions
ignty+0.401
acted+0.396
lette+0.391
hedral+0.391
 Okinawa+0.378
Neg logit contributions
 synd-0.400
dylib-0.372
ulner-0.370
ater-0.365
aters-0.363
Tokenaro
Token ID12022
Feature activation-1.66e-01

Pos logit contributions
 synd+1.794
dylib+1.707
ater+1.701
ulner+1.666
aters+1.664
Neg logit contributions
ignty-1.566
acted-1.535
hedral-1.506
lette-1.476
 Okinawa-1.457
Token (
Token ID357
Feature activation+3.25e-02

Pos logit contributions
 synd+0.376
dylib+0.351
ulner+0.349
ater+0.348
aters+0.345
Neg logit contributions
ignty-0.246
acted-0.241
hedral-0.237
lette-0.235
 Okinawa-0.224
Token117
Token ID17657
Feature activation-5.08e-01

Pos logit contributions
ignty+0.053
acted+0.053
lette+0.052
hedral+0.052
 Okinawa+0.049
Neg logit contributions
 synd-0.068
dylib-0.064
ulner-0.063
ater-0.063
aters-0.062
Token)
Token ID8
Feature activation-4.49e-01

Pos logit contributions
 synd+1.122
dylib+1.056
ater+1.042
aters+1.031
ulner+1.029
Neg logit contributions
ignty-0.784
acted-0.753
hedral-0.751
lette-0.747
 Okinawa-0.712
Token (
Token ID357
Feature activation-9.10e-02

Pos logit contributions
 synd+1.009
dylib+0.944
ater+0.931
ulner+0.926
aters+0.923
Neg logit contributions
ignty-0.679
acted-0.660
hedral-0.651
lette-0.649
 wedge-0.613
Token25
Token ID1495
Feature activation+3.71e-02

Pos logit contributions
 synd+0.193
dylib+0.181
ulner+0.178
ater+0.178
aters+0.176
Neg logit contributions
ignty-0.148
acted-0.145
hedral-0.142
lette-0.142
 Okinawa-0.136
Token)
Token ID8
Feature activation-4.34e-01

Pos logit contributions
ignty+0.055
acted+0.053
hedral+0.052
lette+0.052
 Okinawa+0.050
Neg logit contributions
 synd-0.085
dylib-0.079
ater-0.078
ulner-0.078
aters-0.078
Token
Token ID198
Feature activation-5.16e-02

Pos logit contributions
 synd+0.930
dylib+0.868
ulner+0.858
ater+0.855
aters+0.848
Neg logit contributions
ignty-0.695
acted-0.682
hedral-0.672
lette-0.671
 wedge-0.637
Token
Token ID198
Feature activation-4.49e-02

Pos logit contributions
 synd+0.114
dylib+0.106
ulner+0.105
ater+0.105
aters+0.104
Neg logit contributions
ignty-0.079
acted-0.078
lette-0.076
hedral-0.076
 Okinawa-0.073
Token21
Token ID2481
Feature activation-2.18e-01

Pos logit contributions
 synd+0.095
dylib+0.089
ulner+0.088
ater+0.088
aters+0.087
Neg logit contributions
ignty-0.073
acted-0.071
lette-0.070
hedral-0.070
 Okinawa-0.067
Token.
Token ID13
Feature activation+7.71e-02

Pos logit contributions
 synd+0.487
dylib+0.457
ater+0.450
ulner+0.449
aters+0.447
Neg logit contributions
ignty-0.332
acted-0.324
hedral-0.319
lette-0.318
 wedge-0.301
Token,
Token ID11
Feature activation+1.27e-01

Pos logit contributions
 synd+1.385
dylib+1.272
ater+1.260
ulner+1.251
aters+1.246
Neg logit contributions
ignty-1.291
acted-1.272
hedral-1.253
lette-1.243
 wedge-1.198
Token even
Token ID772
Feature activation-3.47e-01

Pos logit contributions
ignty+0.227
acted+0.225
lette+0.221
hedral+0.220
 Okinawa+0.214
Neg logit contributions
 synd-0.247
dylib-0.231
ulner-0.229
ater-0.226
aters-0.224
Token Iran
Token ID4068
Feature activation-8.70e-01

Pos logit contributions
 synd+0.702
dylib+0.650
ulner+0.642
ater+0.641
aters+0.637
Neg logit contributions
ignty-0.599
acted-0.589
lette-0.579
hedral-0.579
 Okinawa-0.553
Token.
Token ID13
Feature activation+4.31e-02

Pos logit contributions
 synd+1.721
dylib+1.582
ater+1.574
aters+1.561
ulner+1.560
Neg logit contributions
ignty-1.543
acted-1.516
hedral-1.490
lette-1.479
 wedge-1.430
Token Our
Token ID3954
Feature activation-2.71e-01

Pos logit contributions
ignty+0.068
acted+0.066
lette+0.065
hedral+0.065
 Okinawa+0.062
Neg logit contributions
 synd-0.094
dylib-0.088
ulner-0.087
ater-0.086
aters-0.086
Token job
Token ID1693
Feature activation-2.45e-01

Pos logit contributions
 synd+0.552
dylib+0.507
ater+0.504
ulner+0.502
aters+0.500
Neg logit contributions
ignty-0.470
acted-0.459
hedral-0.453
lette-0.453
 Okinawa-0.432
Token on
Token ID319
Feature activation+2.58e-01

Pos logit contributions
 synd+0.537
dylib+0.500
ulner+0.495
ater+0.494
aters+0.491
Neg logit contributions
ignty-0.382
acted-0.375
lette-0.368
hedral-0.368
 Okinawa-0.351
Token the
Token ID262
Feature activation+1.37e-01

Pos logit contributions
ignty+0.425
acted+0.419
lette+0.411
hedral+0.411
 Okinawa+0.394
Neg logit contributions
 synd-0.538
dylib-0.501
ulner-0.496
ater-0.493
aters-0.490
Token international
Token ID3230
Feature activation+1.89e-01

Pos logit contributions
ignty+0.243
acted+0.240
lette+0.236
hedral+0.235
 Okinawa+0.227
Neg logit contributions
 synd-0.269
dylib-0.250
ulner-0.247
ater-0.245
aters-0.243
Token cinema
Token ID22041
Feature activation+1.42e-01

Pos logit contributions
ignty+0.406
acted+0.404
lette+0.398
hedral+0.397
 Okinawa+0.386
Neg logit contributions
 synd-0.296
dylib-0.273
ulner-0.269
ater-0.265
aters-0.263
Token stage
Token ID3800
Feature activation-3.06e-01

Pos logit contributions
ignty+0.269
acted+0.266
lette+0.262
hedral+0.261
 Okinawa+0.252
Neg logit contributions
 synd-0.261
dylib-0.242
ulner-0.239
ater-0.237
aters-0.235
Token says
Token ID1139
Feature activation-1.17e-01

Pos logit contributions
 synd+0.189
dylib+0.176
ulner+0.174
ater+0.174
aters+0.173
Neg logit contributions
ignty-0.139
acted-0.136
hedral-0.134
lette-0.133
 Okinawa-0.128
Token in
Token ID287
Feature activation+1.65e-01

Pos logit contributions
 synd+0.252
dylib+0.235
ulner+0.232
ater+0.232
aters+0.230
Neg logit contributions
ignty-0.188
acted-0.184
lette-0.180
hedral-0.180
 Okinawa-0.172
Token the
Token ID262
Feature activation+2.16e-01

Pos logit contributions
ignty+0.246
acted+0.242
lette+0.237
hedral+0.236
 Okinawa+0.225
Neg logit contributions
 synd-0.374
dylib-0.350
ulner-0.346
ater-0.345
aters-0.343
Token ad
Token ID512
Feature activation+2.93e-01

Pos logit contributions
ignty+0.387
acted+0.385
lette+0.378
hedral+0.377
 Okinawa+0.364
Neg logit contributions
 synd-0.414
dylib-0.388
ulner-0.384
ater-0.378
aters-0.377
Token.
Token ID13
Feature activation+3.00e-01

Pos logit contributions
ignty+0.443
acted+0.438
lette+0.431
hedral+0.428
 Okinawa+0.408
Neg logit contributions
 synd-0.649
dylib-0.609
ulner-0.602
ater-0.598
aters-0.596
Token
Token ID564
Feature activation-1.99e-01

Pos logit contributions
ignty+0.459
acted+0.451
lette+0.442
hedral+0.442
 Okinawa+0.420
Neg logit contributions
 synd-0.664
dylib-0.621
ulner-0.613
ater-0.612
aters-0.608
Token
Token ID250
Feature activation+1.61e-01

Pos logit contributions
 synd+0.285
dylib+0.257
ulner+0.252
ater+0.251
aters+0.249
Neg logit contributions
ignty-0.458
acted-0.452
lette-0.447
hedral-0.447
 Okinawa-0.433
TokenAfter
Token ID3260
Feature activation-2.01e-03

Pos logit contributions
ignty+0.286
acted+0.282
lette+0.277
hedral+0.277
 Okinawa+0.266
Neg logit contributions
 synd-0.317
dylib-0.294
ulner-0.290
ater-0.289
aters-0.287
Token we
Token ID356
Feature activation-2.38e-01

Pos logit contributions
 synd+0.004
dylib+0.004
ulner+0.004
ater+0.004
aters+0.004
Neg logit contributions
ignty-0.003
acted-0.003
lette-0.003
hedral-0.003
 Okinawa-0.003
Token completed
Token ID5668
Feature activation-3.41e-01

Pos logit contributions
 synd+0.511
dylib+0.477
ulner+0.471
ater+0.469
aters+0.466
Neg logit contributions
ignty-0.381
acted-0.376
lette-0.369
hedral-0.368
 Okinawa-0.352
Token the
Token ID262
Feature activation-6.58e-02

Pos logit contributions
 synd+0.694
dylib+0.645
ulner+0.637
ater+0.635
aters+0.631
Neg logit contributions
ignty-0.583
acted-0.575
lette-0.565
hedral-0.564
 Okinawa-0.540
Token of
Token ID286
Feature activation-2.11e-01

Pos logit contributions
 synd+2.840
dylib+2.633
ater+2.603
aters+2.580
ulner+2.559
Neg logit contributions
ignty-2.482
acted-2.405
hedral-2.351
lette-2.344
 Okinawa-2.265
Token government
Token ID1230
Feature activation+2.60e-01

Pos logit contributions
 synd+0.373
dylib+0.343
ulner+0.338
ater+0.336
aters+0.334
Neg logit contributions
ignty-0.416
acted-0.410
lette-0.404
hedral-0.404
 Okinawa-0.389
Token,
Token ID11
Feature activation-2.00e-02

Pos logit contributions
ignty+0.465
acted+0.457
lette+0.452
hedral+0.451
 Okinawa+0.432
Neg logit contributions
 synd-0.508
dylib-0.471
ulner-0.465
ater-0.463
aters-0.460
Token a
Token ID257
Feature activation-3.92e-02

Pos logit contributions
 synd+0.042
dylib+0.039
ulner+0.038
ater+0.038
aters+0.038
Neg logit contributions
ignty-0.033
acted-0.033
lette-0.032
hedral-0.032
 Okinawa-0.031
Token senior
Token ID4664
Feature activation+2.53e-02

Pos logit contributions
 synd+0.077
dylib+0.072
ulner+0.071
ater+0.071
aters+0.070
Neg logit contributions
ignty-0.069
acted-0.068
lette-0.067
hedral-0.067
 wedge-0.064
Token clergy
Token ID26014
Feature activation-1.08e-01

Pos logit contributions
ignty+0.048
acted+0.047
hedral+0.047
lette+0.047
 Okinawa+0.045
Neg logit contributions
 synd-0.046
dylib-0.043
ulner-0.042
ater-0.042
aters-0.042
Tokenman
Token ID805
Feature activation+1.41e-01

Pos logit contributions
 synd+0.222
dylib+0.204
ater+0.202
ulner+0.202
aters+0.199
Neg logit contributions
ignty-0.185
acted-0.182
lette-0.178
hedral-0.178
 Okinawa-0.171
Token has
Token ID468
Feature activation+7.96e-02

Pos logit contributions
ignty+0.185
acted+0.181
lette+0.177
hedral+0.177
 Okinawa+0.167
Neg logit contributions
 synd-0.346
dylib-0.324
ulner-0.322
ater-0.321
aters-0.319
Token warned
Token ID7728
Feature activation+7.36e-01

Pos logit contributions
ignty+0.093
acted+0.091
lette+0.089
hedral+0.089
 Okinawa+0.083
Neg logit contributions
 synd-0.205
dylib-0.193
ulner-0.191
ater-0.191
aters-0.190
Token.
Token ID13
Feature activation+1.79e-01

Pos logit contributions
ignty+1.100
acted+1.078
lette+1.062
hedral+1.062
 Okinawa+1.015
Neg logit contributions
 synd-1.638
dylib-1.541
ulner-1.522
ater-1.519
aters-1.508
Token
Token ID198
Feature activation+6.34e-02

Pos logit contributions
ignty+0.278
acted+0.273
lette+0.268
hedral+0.268
 wedge+0.254
Neg logit contributions
 synd-0.392
dylib-0.367
ulner-0.362
ater-0.361
aters-0.359
Token....
Token ID1106
Feature activation+4.35e-01

Pos logit contributions
ignty+0.863
acted+0.850
lette+0.835
hedral+0.832
 Okinawa+0.791
Neg logit contributions
 synd-1.270
dylib-1.188
ulner-1.176
ater-1.169
aters-1.163
Token I
Token ID314
Feature activation+2.12e-01

Pos logit contributions
ignty+0.677
acted+0.664
hedral+0.652
lette+0.652
 Okinawa+0.621
Neg logit contributions
 synd-0.953
dylib-0.891
ulner-0.880
ater-0.879
aters-0.873
Token have
Token ID423
Feature activation+1.33e-01

Pos logit contributions
ignty+0.301
acted+0.292
hedral+0.288
lette+0.286
 Okinawa+0.273
Neg logit contributions
 synd-0.495
dylib-0.462
ater-0.458
ulner-0.458
aters-0.455
Token never
Token ID1239
Feature activation+7.63e-02

Pos logit contributions
ignty+0.209
acted+0.204
hedral+0.201
lette+0.201
 Okinawa+0.192
Neg logit contributions
 synd-0.290
dylib-0.270
ulner-0.267
ater-0.267
aters-0.265
Token said
Token ID531
Feature activation+7.01e-02

Pos logit contributions
ignty+0.113
acted+0.109
hedral+0.108
lette+0.108
 Okinawa+0.103
Neg logit contributions
 synd-0.174
dylib-0.162
ulner-0.161
ater-0.161
aters-0.159
Token that
Token ID326
Feature activation-2.40e-02

Pos logit contributions
ignty+0.120
acted+0.119
lette+0.117
hedral+0.116
 Okinawa+0.112
Neg logit contributions
 synd-0.142
dylib-0.132
ulner-0.130
ater-0.130
aters-0.129
Token in
Token ID287
Feature activation+2.85e-01

Pos logit contributions
 synd+0.052
dylib+0.049
ater+0.048
ulner+0.048
aters+0.048
Neg logit contributions
ignty-0.037
acted-0.037
hedral-0.036
lette-0.036
 Okinawa-0.034
Token my
Token ID616
Feature activation+4.87e-02

Pos logit contributions
ignty+0.519
acted+0.510
hedral+0.503
lette+0.502
 Okinawa+0.481
Neg logit contributions
 synd-0.551
dylib-0.505
ater-0.501
ulner-0.500
aters-0.496
Token life
Token ID1204
Feature activation+3.91e-01

Pos logit contributions
ignty+0.082
acted+0.081
lette+0.079
hedral+0.079
 Okinawa+0.076
Neg logit contributions
 synd-0.100
dylib-0.093
ater-0.092
ulner-0.092
aters-0.091
Token to
Token ID284
Feature activation-1.62e-01

Pos logit contributions
ignty+0.570
acted+0.560
lette+0.549
hedral+0.548
 Okinawa+0.521
Neg logit contributions
 synd-0.895
dylib-0.838
ulner-0.829
ater-0.827
aters-0.822
Token her
Token ID607
Feature activation-1.08e-01

Pos logit contributions
 synd+0.379
dylib+0.353
ater+0.350
ulner+0.350
aters+0.348
Neg logit contributions
ignty-0.230
acted-0.225
hedral-0.219
lette-0.218
 Okinawa-0.207
Token I
Token ID314
Feature activation-2.67e-01

Pos logit contributions
 synd+0.263
dylib+0.246
ulner+0.243
ater+0.243
aters+0.241
Neg logit contributions
ignty-0.167
acted-0.165
lette-0.161
hedral-0.161
 Okinawa-0.153
Token gave
Token ID2921
Feature activation+3.52e-01

Pos logit contributions
 synd+0.617
dylib+0.578
ulner+0.572
ater+0.570
aters+0.567
Neg logit contributions
ignty-0.384
acted-0.377
lette-0.369
hedral-0.369
 Okinawa-0.350
Token up
Token ID510
Feature activation-2.76e-02

Pos logit contributions
ignty+0.691
acted+0.683
lette+0.674
hedral+0.672
 Okinawa+0.648
Neg logit contributions
 synd-0.623
dylib-0.573
ulner-0.566
ater-0.562
aters-0.559
Token I
Token ID314
Feature activation+2.69e-01

Pos logit contributions
 synd+0.055
dylib+0.051
ater+0.050
ulner+0.050
aters+0.050
Neg logit contributions
ignty-0.049
acted-0.048
hedral-0.047
lette-0.047
 Okinawa-0.045
Token probably
Token ID2192
Feature activation+1.04e-03

Pos logit contributions
ignty+0.331
acted+0.325
lette+0.317
hedral+0.317
 Okinawa+0.298
Neg logit contributions
 synd-0.676
dylib-0.638
ulner-0.632
ater-0.629
aters-0.625
Token would
Token ID561
Feature activation-3.26e-01

Pos logit contributions
ignty+0.001
acted+0.001
lette+0.001
hedral+0.001
 Okinawa+0.001
Neg logit contributions
 synd-0.003
dylib-0.002
ulner-0.002
ater-0.002
aters-0.002
Token've
Token ID1053
Feature activation-2.20e-01

Pos logit contributions
 synd+0.755
dylib+0.707
ater+0.700
ulner+0.696
aters+0.693
Neg logit contributions
ignty-0.469
acted-0.459
lette-0.447
hedral-0.446
 Okinawa-0.428
Token been
Token ID587
Feature activation+5.95e-02

Pos logit contributions
 synd+0.469
dylib+0.434
ater+0.428
ulner+0.427
aters+0.424
Neg logit contributions
ignty-0.357
acted-0.348
hedral-0.343
lette-0.342
 Okinawa-0.327
Token the
Token ID262
Feature activation-1.70e-01

Pos logit contributions
ignty+0.099
acted+0.097
lette+0.095
hedral+0.095
 Okinawa+0.091
Neg logit contributions
 synd-0.124
dylib-0.116
ulner-0.114
ater-0.114
aters-0.113
Token manager
Token ID4706
Feature activation-6.39e-02

Pos logit contributions
 synd+0.356
dylib+0.330
ater+0.327
ulner+0.326
aters+0.324
Neg logit contributions
ignty-0.284
acted-0.278
hedral-0.273
lette-0.273
 Okinawa-0.261
Token at
Token ID379
Feature activation+2.37e-02

Pos logit contributions
 synd+0.134
dylib+0.124
ater+0.123
ulner+0.123
aters+0.122
Neg logit contributions
ignty-0.106
acted-0.104
hedral-0.102
lette-0.102
 Okinawa-0.098
Token map
Token ID3975
Feature activation+6.61e-01

Pos logit contributions
ignty+0.961
acted+0.949
lette+0.935
hedral+0.934
 Okinawa+0.902
Neg logit contributions
 synd-0.828
dylib-0.760
ulner-0.749
ater-0.745
aters-0.740
Token in
Token ID287
Feature activation+1.32e-01

Pos logit contributions
ignty+1.056
lette+1.028
acted+1.026
hedral+1.016
front+0.982
Neg logit contributions
 synd-1.399
dylib-1.320
ulner-1.311
culosis-1.295
ater-1.286
Token time
Token ID640
Feature activation+2.41e-01

Pos logit contributions
ignty+0.209
acted+0.205
lette+0.202
hedral+0.201
 Okinawa+0.192
Neg logit contributions
 synd-0.286
dylib-0.267
ulner-0.264
ater-0.263
aters-0.261
Token for
Token ID329
Feature activation-1.71e-01

Pos logit contributions
ignty+0.430
acted+0.423
lette+0.418
hedral+0.416
 Okinawa+0.401
Neg logit contributions
 synd-0.470
dylib-0.439
ulner-0.434
ater-0.430
aters-0.427
Token Kat
Token ID8595
Feature activation+3.47e-01

Pos logit contributions
 synd+0.360
dylib+0.332
ater+0.329
ulner+0.329
aters+0.327
Neg logit contributions
ignty-0.284
acted-0.279
lette-0.274
hedral-0.274
 Okinawa-0.260
Tokenow
Token ID322
Feature activation-1.36e+00

Pos logit contributions
ignty+0.728
acted+0.719
hedral+0.711
lette+0.710
 Okinawa+0.684
Neg logit contributions
 synd-0.572
dylib-0.524
ulner-0.514
culosis-0.508
ater-0.506
Tokenice
Token ID501
Feature activation-5.20e-01

Pos logit contributions
 synd+3.006
ater+2.854
aters+2.836
dylib+2.831
ulner+2.790
Neg logit contributions
ignty-2.006
acted-1.973
hedral-1.941
lette-1.906
 Okinawa-1.873
Token?
Token ID30
Feature activation-5.54e-01

Pos logit contributions
 synd+1.228
dylib+1.143
ater+1.136
ulner+1.133
aters+1.129
Neg logit contributions
ignty-0.725
acted-0.715
hedral-0.696
lette-0.694
 Okinawa-0.654
Token We
Token ID775
Feature activation-6.95e-02

Pos logit contributions
 synd+1.160
dylib+1.078
ater+1.064
aters+1.057
ulner+1.053
Neg logit contributions
ignty-0.928
acted-0.905
hedral-0.893
lette-0.889
 wedge-0.846
Token will
Token ID481
Feature activation+3.58e-02

Pos logit contributions
 synd+0.152
dylib+0.142
ulner+0.141
ater+0.140
aters+0.140
Neg logit contributions
ignty-0.108
acted-0.106
lette-0.104
hedral-0.104
 Okinawa-0.099
Token know
Token ID760
Feature activation-1.92e-01

Pos logit contributions
ignty+0.061
acted+0.060
lette+0.059
hedral+0.059
 Okinawa+0.056
Neg logit contributions
 synd-0.074
dylib-0.068
ater-0.067
ulner-0.067
aters-0.067
Token and
Token ID290
Feature activation+3.86e-02

Pos logit contributions
 synd+0.370
dylib+0.339
ater+0.338
ulner+0.336
aters+0.335
Neg logit contributions
ignty-0.310
acted-0.304
lette-0.299
hedral-0.299
 Okinawa-0.285
Token back
Token ID736
Feature activation+1.59e-01

Pos logit contributions
ignty+0.067
acted+0.066
lette+0.065
hedral+0.065
 Okinawa+0.063
Neg logit contributions
 synd-0.077
dylib-0.071
ulner-0.071
ater-0.070
aters-0.070
Token."
Token ID526
Feature activation+1.16e-01

Pos logit contributions
ignty+0.262
acted+0.257
hedral+0.253
lette+0.253
 Okinawa+0.241
Neg logit contributions
 synd-0.333
dylib-0.308
ulner-0.305
ater-0.305
aters-0.303
Token
Token ID198
Feature activation-2.25e-01

Pos logit contributions
ignty+0.183
acted+0.179
hedral+0.176
lette+0.175
 wedge+0.167
Neg logit contributions
 synd-0.252
dylib-0.236
ater-0.233
ulner-0.233
aters-0.232
Token
Token ID198
Feature activation-1.40e-01

Pos logit contributions
 synd+0.492
dylib+0.461
ater+0.454
ulner+0.453
aters+0.450
Neg logit contributions
ignty-0.352
acted-0.344
hedral-0.338
lette-0.338
 Okinawa-0.322
TokenLet
Token ID5756
Feature activation-1.53e+00

Pos logit contributions
 synd+0.265
dylib+0.248
ater+0.244
ulner+0.243
aters+0.243
Neg logit contributions
ignty-0.259
acted-0.251
hedral-0.250
lette-0.248
 Okinawa-0.240
Tokenting
Token ID889
Feature activation-1.91e-01

Pos logit contributions
 synd+2.912
ater+2.642
dylib+2.637
aters+2.635
culosis+2.624
Neg logit contributions
ignty-2.832
acted-2.723
hedral-2.703
lette-2.703
 wedge-2.593
Token teenagers
Token ID17366
Feature activation+5.34e-01

Pos logit contributions
 synd+0.382
dylib+0.353
ater+0.348
ulner+0.348
aters+0.346
Neg logit contributions
ignty-0.335
acted-0.329
lette-0.323
hedral-0.323
 Okinawa-0.309
Token sleep
Token ID3993
Feature activation-5.22e-01

Pos logit contributions
ignty+0.897
acted+0.888
lette+0.877
hedral+0.874
 wedge+0.847
Neg logit contributions
 synd-1.086
dylib-1.019
ulner-1.008
ater-0.994
culosis-0.989
Token later
Token ID1568
Feature activation-2.52e-01

Pos logit contributions
 synd+1.056
dylib+0.972
ater+0.966
ulner+0.963
aters+0.958
Neg logit contributions
ignty-0.903
acted-0.886
lette-0.875
hedral-0.872
 Okinawa-0.833
Token typically
Token ID6032
Feature activation+1.45e-02

Pos logit contributions
 synd+0.548
dylib+0.510
ater+0.504
ulner+0.504
aters+0.500
Neg logit contributions
ignty-0.396
acted-0.388
hedral-0.382
lette-0.381
 Okinawa-0.363
Token
Token ID198
Feature activation+4.57e-01

Pos logit contributions
ignty+0.721
acted+0.713
lette+0.698
hedral+0.694
 Okinawa+0.666
Neg logit contributions
 synd-1.063
dylib-0.995
ulner-0.983
ater-0.977
aters-0.972
Token
Token ID198
Feature activation+4.19e-01

Pos logit contributions
ignty+0.661
acted+0.659
lette+0.644
hedral+0.637
 Okinawa+0.611
Neg logit contributions
 synd-1.042
ulner-0.971
dylib-0.970
culosis-0.961
ater-0.960
Token"…
Token ID24426
Feature activation+2.13e-01

Pos logit contributions
ignty+0.680
acted+0.671
lette+0.659
hedral+0.657
 Okinawa+0.628
Neg logit contributions
 synd-0.889
dylib-0.827
ulner-0.818
ater-0.815
aters-0.809
TokenSure
Token ID19457
Feature activation+4.38e-01

Pos logit contributions
ignty+0.393
acted+0.390
lette+0.384
hedral+0.382
 Okinawa+0.369
Neg logit contributions
 synd-0.404
dylib-0.372
ulner-0.368
ater-0.365
aters-0.363
Token
Token ID1399
Feature activation+2.05e-01

Pos logit contributions
ignty+0.639
acted+0.630
lette+0.624
hedral+0.610
 Okinawa+0.591
Neg logit contributions
 synd-0.997
dylib-0.935
ulner-0.927
culosis-0.915
ater-0.914
Token please
Token ID3387
Feature activation-1.26e+00

Pos logit contributions
ignty+0.330
acted+0.326
lette+0.320
hedral+0.319
 Okinawa+0.306
Neg logit contributions
 synd-0.435
dylib-0.405
ulner-0.401
ater-0.398
aters-0.396
Token do
Token ID466
Feature activation+4.63e-01

Pos logit contributions
 synd+2.577
ater+2.341
ulner+2.330
dylib+2.322
aters+2.312
Neg logit contributions
ignty-2.203
acted-2.124
hedral-2.100
lette-2.023
 Okinawa-1.962
Token."
Token ID526
Feature activation-1.81e-01

Pos logit contributions
ignty+0.689
acted+0.682
lette+0.677
hedral+0.663
 wedge+0.642
Neg logit contributions
 synd-1.031
dylib-0.971
ulner-0.960
culosis-0.952
ater-0.952
Token
Token ID198
Feature activation+4.71e-01

Pos logit contributions
 synd+0.395
dylib+0.370
ater+0.364
ulner+0.363
aters+0.361
Neg logit contributions
ignty-0.285
acted-0.278
hedral-0.275
lette-0.273
 wedge-0.260
Token
Token ID198
Feature activation+4.35e-01

Pos logit contributions
acted+0.673
ignty+0.672
lette+0.658
hedral+0.649
 Okinawa+0.623
Neg logit contributions
 synd-1.079
ulner-1.008
dylib-1.004
culosis-0.998
ater-0.994
TokenShe
Token ID3347
Feature activation+8.12e-02

Pos logit contributions
ignty+0.711
acted+0.704
lette+0.693
hedral+0.688
 Okinawa+0.659
Neg logit contributions
 synd-0.917
dylib-0.850
ulner-0.843
ater-0.837
aters-0.832
Token cons
Token ID762
Feature activation-4.91e-01

Pos logit contributions
ignty+0.148
acted+0.146
lette+0.144
hedral+0.143
 Okinawa+0.138
Neg logit contributions
 synd-0.207
dylib-0.194
ulner-0.192
ater-0.191
culosis-0.190
Tokencript
Token ID6519
Feature activation+7.85e-01

Pos logit contributions
 synd+0.659
dylib+0.583
ulner+0.582
ater+0.575
aters+0.565
Neg logit contributions
ignty-1.175
acted-1.161
hedral-1.156
lette-1.154
 wedge-1.121
Tokened
Token ID276
Feature activation-3.60e-01

Pos logit contributions
acted+0.599
ignty+0.595
lette+0.565
hedral+0.562
 Okinawa+0.523
Neg logit contributions
 synd-2.310
dylib-2.219
ulner-2.185
ater-2.184
aters-2.170
Token into
Token ID656
Feature activation-8.86e-01

Pos logit contributions
 synd+0.707
dylib+0.650
ulner+0.641
ater+0.641
aters+0.636
Neg logit contributions
ignty-0.643
acted-0.632
hedral-0.626
lette-0.623
 wedge-0.595
Token forced
Token ID4137
Feature activation+5.71e-01

Pos logit contributions
 synd+1.852
dylib+1.696
ater+1.690
ulner+1.672
aters+1.668
Neg logit contributions
ignty-1.499
acted-1.450
hedral-1.443
lette-1.428
 wedge-1.354
Token labour
Token ID10515
Feature activation-1.19e+00

Pos logit contributions
ignty+0.895
acted+0.881
lette+0.870
hedral+0.862
 Okinawa+0.830
Neg logit contributions
 synd-1.233
dylib-1.157
ulner-1.147
ater-1.136
aters-1.132
Token in
Token ID287
Feature activation-2.73e-01

Pos logit contributions
 synd+2.284
dylib+2.066
ater+2.041
ulner+2.023
aters+2.022
Neg logit contributions
ignty-2.183
acted-2.155
hedral-2.144
lette-2.125
 Okinawa-2.028
Token Germany
Token ID4486
Feature activation-4.89e-01

Pos logit contributions
 synd+0.554
dylib+0.513
ulner+0.507
ater+0.506
aters+0.502
Neg logit contributions
ignty-0.468
acted-0.461
lette-0.453
hedral-0.453
 Okinawa-0.432
Token and
Token ID290
Feature activation-1.16e-01

Pos logit contributions
 synd+0.950
dylib+0.875
ater+0.861
ulner+0.857
aters+0.855
Neg logit contributions
ignty-0.883
acted-0.869
hedral-0.858
lette-0.857
 wedge-0.819
Token few
Token ID1178
Feature activation-6.13e-01

Pos logit contributions
 synd+0.259
dylib+0.241
ulner+0.238
ater+0.238
aters+0.236
Neg logit contributions
ignty-0.177
acted-0.174
hedral-0.171
lette-0.170
 Okinawa-0.161
Token wanted
Token ID2227
Feature activation-6.43e-01

Pos logit contributions
 synd+1.333
dylib+1.237
ater+1.221
ulner+1.220
aters+1.215
Neg logit contributions
ignty-0.972
acted-0.951
hedral-0.936
lette-0.934
 wedge-0.885
Tokenage
Token ID496
Feature activation-6.88e-02

Pos logit contributions
 synd+1.332
dylib+1.239
ulner+1.225
ater+1.208
culosis+1.207
Neg logit contributions
ignty-1.101
acted-1.090
lette-1.076
hedral-1.069
 Okinawa-1.018
Token depicting
Token ID27561
Feature activation+4.75e-01

Pos logit contributions
 synd+0.140
dylib+0.131
ulner+0.129
ater+0.129
aters+0.128
Neg logit contributions
ignty-0.117
acted-0.115
lette-0.113
hedral-0.113
 Okinawa-0.108
Token the
Token ID262
Feature activation+2.04e-01

Pos logit contributions
ignty+0.695
acted+0.689
lette+0.675
hedral+0.673
 Okinawa+0.645
Neg logit contributions
 synd-1.070
dylib-1.006
ulner-1.000
ater-0.989
aters-0.985
Token principal
Token ID10033
Feature activation+1.05e-01

Pos logit contributions
ignty+0.361
acted+0.360
lette+0.354
hedral+0.353
 wedge+0.342
Neg logit contributions
 synd-0.392
dylib-0.368
ulner-0.365
culosis-0.358
ater-0.356
Token M
Token ID337
Feature activation+2.88e-01

Pos logit contributions
ignty+0.177
acted+0.175
lette+0.172
hedral+0.172
 wedge+0.166
Neg logit contributions
 synd-0.214
dylib-0.200
ulner-0.198
ater-0.196
culosis-0.195
Tokenupp
Token ID7211
Feature activation-1.41e+00

Pos logit contributions
ignty+0.421
acted+0.416
lette+0.409
hedral+0.409
 wedge+0.385
Neg logit contributions
 synd-0.653
dylib-0.613
ulner-0.603
culosis-0.600
aters-0.593
Tokenets
Token ID1039
Feature activation-1.10e-01

Pos logit contributions
 synd+1.930
dylib+1.745
ater+1.721
ulner+1.700
aters+1.692
Neg logit contributions
ignty-3.321
acted-3.272
hedral-3.235
lette-3.226
 Okinawa-3.151
Token being
Token ID852
Feature activation+1.21e-01

Pos logit contributions
 synd+0.220
dylib+0.205
ulner+0.202
ater+0.201
aters+0.200
Neg logit contributions
ignty-0.190
acted-0.188
lette-0.185
hedral-0.185
 Okinawa-0.177
Token reunited
Token ID35440
Feature activation+1.78e-02

Pos logit contributions
ignty+0.194
acted+0.191
lette+0.188
hedral+0.188
 Okinawa+0.179
Neg logit contributions
 synd-0.256
dylib-0.239
ulner-0.237
ater-0.236
aters-0.234
Token (
Token ID357
Feature activation+2.26e-01

Pos logit contributions
ignty+0.032
acted+0.032
lette+0.031
hedral+0.031
 Okinawa+0.030
Neg logit contributions
 synd-0.035
dylib-0.032
ulner-0.032
ater-0.032
aters-0.031
TokenRow
Token ID25166
Feature activation-2.70e-01

Pos logit contributions
ignty+0.327
acted+0.321
lette+0.315
hedral+0.314
 Okinawa+0.299
Neg logit contributions
 synd-0.519
dylib-0.486
ulner-0.481
ater-0.480
aters-0.477