TOP ACTIVATIONS
MAX = 10.348

, someone had left a partly eaten bag el as an
to protest against the violent eviction of the squats of Villa
of the argument offer , partly , too , because frequent
he is likely to face eviction from the Embassy of Ecuador
Bo ass on H agen finished close by in third
it is technically interesting , partly the other Democracy Club volunteers
with its Indian viewership , mainly because I imagine they are
over 1 billion people , mainly very poor , relay on
arg est Planes // Global Business Editor Marcus We is ger
ald Bo ass on H agen ( Sky Pro cy cling
articles , and arguments , partly also because those things that
proper -- finally has a tenant that people will brave the
most of the 1980 s partly due to spl inter ing
The non - profit , self - govern ed organisation "
eventual occupation by two major tenants . Google has signed on
ANS BUR G , N . J . Hundreds of
MORE INFO : Our sub - 2 marathon page has
, Rubio told Fusion in 2014 . " And
suggested to Joseph Stalin a preempt ive attack against Germany through
written by other authors , partly on account of the multiplication

MIN ACTIVATIONS
MIN = -10.313

valuable , the 1000 Swiss franc note ( USD 10 63
Brend a Lee Mar quez McC ool , 49
statement on their positive case starts by
Ass assin s Creed ® Liberation HD , is
class ( in the wrong language !) to becoming a Junior
s . All told , Trump s
idea here is that any distribution which provides the Snap framework
International Monetary Fund chief Domin ique Strauss - K ahn .
s attorney , Joey McC all , told WIN Z
Back in the Days Before Positive Rein forcement , you would
made easy in some other dist ros . In
experiment , 1 , 000 low - income families will be
death of his young daughter Olivia and the struggles of his
apartment , and jumping out on top of Ann al
. End ing negative gearing won 't matter .
Over on Reddit Oculus Rift creator Palmer Luc key has
. The best way to avoid this type of
who desire physical transition (" trans sexual " people ) and
Deb ate 2016 pic . twitter . com / F c
oth through the portal , laying waste to everything in sight

INTERVAL 5.182 - 10.348
CONTAINS 0.516%

parties . Si le d és acc ord persist e ,
16 81 , is U . S . Federal Government legislation
Frances con , a migrant originally from Italy . Though it
mention those tapes or their destruction . <|endoftext|> Now playing :
ocking stability . As for thermal solution , MSI R 78

INTERVAL 0.017 - 5.182
CONTAINS 49.079%

Haiti does not participate in their leftist , anti - American
super enthusiastic about this starch , but it s
Mil itant faction which would emerge triumphant . It was this
micro - trans actions ." In an hour -
However , the United States determination to undercut these

INTERVAL -5.148 - 0.017
CONTAINS 49.892%

and thus sought to portray me as a r aving
relationship with Jennifer , and then Bradley Cooper starts to fall
Lap eral of Behavioral Dynamics , said she had developed a
years and older , by sex , United States , 1991
conference in Paris in December . French President Francois

INTERVAL -10.313 - -5.148
CONTAINS 0.512%

extr usion v . ver tex . xy z += v
after the student appealed his University of Cincinnati suspension . He
fan G K ert es z , MD , is a
. tv / d anks h rine Weekly SC 2 Podcast
desire physical transition (" trans sexual " people ) and other
Token,
Token ID11
Feature activation+2.97e-01

Pos logit contributions
 plural+0.106
hin+0.088
unct+0.087
 spoiled+0.085
 hinge+0.084
Neg logit contributions
jac-0.101
arc-0.101
Birth-0.100
acion-0.096
ivia-0.096
Token someone
Token ID2130
Feature activation+2.63e+00

Pos logit contributions
 plural+0.201
unct+0.163
hin+0.163
 spoiled+0.158
 hinge+0.157
Neg logit contributions
jac-0.230
arc-0.229
Birth-0.228
acion-0.220
ivia-0.219
Token had
Token ID550
Feature activation+2.74e-01

Pos logit contributions
 plural+1.871
hin+1.518
 spoiled+1.514
unct+1.491
 hinge+1.468
Neg logit contributions
Birth-1.955
jac-1.949
arc-1.938
acion-1.881
 Spending-1.845
Token left
Token ID1364
Feature activation+2.81e+00

Pos logit contributions
 plural+0.194
hin+0.159
unct+0.158
 spoiled+0.156
 hinge+0.154
Neg logit contributions
jac-0.203
arc-0.202
Birth-0.202
acion-0.195
ivia-0.193
Token a
Token ID257
Feature activation-2.62e-01

Pos logit contributions
 plural+1.932
hin+1.556
unct+1.551
 spoiled+1.540
 hinge+1.501
Neg logit contributions
Birth-2.153
jac-2.144
arc-2.141
acion-2.070
ivia-2.058
Token partly
Token ID11476
Feature activation+1.03e+01

Pos logit contributions
jac+0.184
arc+0.183
Birth+0.183
acion+0.176
ivia+0.175
Neg logit contributions
 plural-0.196
hin-0.162
unct-0.162
 spoiled-0.158
 hinge-0.157
Token eaten
Token ID17065
Feature activation+1.23e+00

Pos logit contributions
 plural+6.656
 spoiled+5.558
hin+5.375
 hinge+5.289
 hers+5.171
Neg logit contributions
Birth-8.008
jac-7.758
acion-7.739
arc-7.718
ivia-7.550
Token bag
Token ID6131
Feature activation-3.55e-01

Pos logit contributions
 plural+0.900
unct+0.738
hin+0.735
 spoiled+0.728
 hinge+0.719
Neg logit contributions
jac-0.890
Birth-0.888
arc-0.882
acion-0.853
ivia-0.848
Tokenel
Token ID417
Feature activation-3.40e+00

Pos logit contributions
jac+0.263
arc+0.263
Birth+0.262
acion+0.253
ivia+0.251
Neg logit contributions
 plural-0.250
hin-0.205
unct-0.205
 spoiled-0.199
 hinge-0.197
Token as
Token ID355
Feature activation-1.07e+00

Pos logit contributions
jac+2.418
arc+2.409
Birth+2.366
acion+2.316
 Spending+2.266
Neg logit contributions
 plural-2.454
hin-2.057
unct-2.045
 hinge-1.939
 spoiled-1.938
Token an
Token ID281
Feature activation+1.66e-01

Pos logit contributions
jac+0.822
arc+0.821
Birth+0.817
acion+0.789
ivia+0.782
Neg logit contributions
 plural-0.723
unct-0.590
hin-0.589
 spoiled-0.569
 hinge-0.566
Token to
Token ID284
Feature activation+1.37e+00

Pos logit contributions
jac+0.441
arc+0.439
Birth+0.437
acion+0.421
ivia+0.417
Neg logit contributions
 plural-0.484
hin-0.403
unct-0.403
 spoiled-0.392
 hinge-0.389
Token protest
Token ID5402
Feature activation+2.32e+00

Pos logit contributions
 plural+0.945
unct+0.767
hin+0.765
 spoiled+0.746
 hinge+0.743
Neg logit contributions
jac-1.043
arc-1.039
Birth-1.035
acion-1.001
ivia-0.995
Token against
Token ID1028
Feature activation-1.93e-01

Pos logit contributions
 plural+1.587
hin+1.279
unct+1.278
 spoiled+1.260
 hinge+1.236
Neg logit contributions
jac-1.783
arc-1.772
Birth-1.769
acion-1.704
ivia-1.693
Token the
Token ID262
Feature activation+7.00e-01

Pos logit contributions
jac+0.143
arc+0.143
Birth+0.142
acion+0.137
ivia+0.136
Neg logit contributions
 plural-0.136
unct-0.111
hin-0.111
 spoiled-0.108
 hinge-0.107
Token violent
Token ID6590
Feature activation+5.46e-01

Pos logit contributions
 plural+0.511
unct+0.418
hin+0.415
 spoiled+0.413
 hinge+0.406
Neg logit contributions
jac-0.504
arc-0.501
Birth-0.500
acion-0.482
ivia-0.480
Token eviction
Token ID38738
Feature activation+1.03e+01

Pos logit contributions
 plural+0.385
hin+0.314
unct+0.313
 spoiled+0.307
 hinge+0.303
Neg logit contributions
jac-0.407
arc-0.405
Birth-0.404
acion-0.389
ivia-0.387
Token of
Token ID286
Feature activation+1.58e-01

Pos logit contributions
 plural+5.821
 spoiled+4.695
 hinge+4.465
hin+4.425
 Jin+4.323
Neg logit contributions
jac-8.512
Birth-8.361
arc-8.265
 Spending-8.070
ivia-8.000
Token the
Token ID262
Feature activation+9.24e-01

Pos logit contributions
 plural+0.115
hin+0.095
unct+0.095
 spoiled+0.092
 hinge+0.091
Neg logit contributions
jac-0.114
arc-0.113
Birth-0.113
acion-0.109
ivia-0.108
Token squats
Token ID48871
Feature activation+6.57e-01

Pos logit contributions
 plural+0.655
unct+0.536
hin+0.535
 spoiled+0.523
 hinge+0.518
Neg logit contributions
jac-0.684
arc-0.681
Birth-0.678
acion-0.653
ivia-0.650
Token of
Token ID286
Feature activation-1.32e+00

Pos logit contributions
 plural+0.473
hin+0.390
unct+0.389
 spoiled+0.378
 hinge+0.375
Neg logit contributions
jac-0.477
arc-0.475
Birth-0.474
acion-0.456
ivia-0.453
Token Villa
Token ID25018
Feature activation+2.51e+00

Pos logit contributions
jac+0.964
arc+0.961
Birth+0.954
acion+0.925
ivia+0.915
Neg logit contributions
 plural-0.935
hin-0.775
unct-0.773
 spoiled-0.746
 hinge-0.743
Token of
Token ID286
Feature activation+1.25e+00

Pos logit contributions
 plural+2.090
hin+1.692
unct+1.686
 spoiled+1.680
 hinge+1.641
Neg logit contributions
jac-2.279
Birth-2.245
arc-2.244
acion-2.167
 Spending-2.147
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
 plural+0.872
unct+0.707
hin+0.707
 spoiled+0.690
 hinge+0.683
Neg logit contributions
jac-0.941
arc-0.941
Birth-0.935
acion-0.906
ivia-0.897
Token argument
Token ID4578
Feature activation+3.99e+00

Pos logit contributions
 plural+0.833
unct+0.675
hin+0.674
 spoiled+0.660
 hinge+0.656
Neg logit contributions
jac-0.877
arc-0.875
Birth-0.870
acion-0.843
ivia-0.834
Token offer
Token ID2897
Feature activation+4.20e+00

Pos logit contributions
 plural+2.756
unct+2.265
 spoiled+2.229
hin+2.229
 hinge+2.182
Neg logit contributions
jac-3.019
Birth-3.007
arc-2.953
 Spending-2.867
acion-2.862
Token,
Token ID11
Feature activation+2.45e+00

Pos logit contributions
 plural+2.932
unct+2.429
hin+2.420
 spoiled+2.370
 hinge+2.329
Neg logit contributions
jac-3.093
Birth-3.086
arc-3.061
acion-2.948
 Spending-2.944
Token partly
Token ID11476
Feature activation+1.03e+01

Pos logit contributions
 plural+1.856
hin+1.519
unct+1.510
 spoiled+1.495
 hinge+1.486
Neg logit contributions
jac-1.720
Birth-1.705
arc-1.694
acion-1.636
ivia-1.614
Token,
Token ID11
Feature activation+1.01e+00

Pos logit contributions
 plural+5.888
 spoiled+4.601
hin+4.507
 hinge+4.412
unct+4.364
Neg logit contributions
jac-8.700
Birth-8.677
arc-8.494
 Spending-8.257
acion-8.231
Token too
Token ID1165
Feature activation+3.15e-01

Pos logit contributions
 plural+0.687
hin+0.555
unct+0.554
 spoiled+0.541
 hinge+0.536
Neg logit contributions
jac-0.783
arc-0.779
Birth-0.777
acion-0.751
ivia-0.744
Token,
Token ID11
Feature activation-8.36e-02

Pos logit contributions
 plural+0.243
unct+0.204
hin+0.204
 spoiled+0.198
 hinge+0.197
Neg logit contributions
arc-0.211
jac-0.211
Birth-0.210
acion-0.202
ivia-0.200
Token because
Token ID780
Feature activation+3.12e+00

Pos logit contributions
arc+0.068
jac+0.068
Birth+0.068
acion+0.066
ivia+0.065
Neg logit contributions
 plural-0.052
unct-0.042
hin-0.042
 spoiled-0.040
 hinge-0.040
Token frequent
Token ID10792
Feature activation+1.90e+00

Pos logit contributions
 plural+2.179
hin+1.740
 spoiled+1.721
unct+1.716
 hinge+1.687
Neg logit contributions
jac-2.373
arc-2.366
Birth-2.357
acion-2.274
ivia-2.264
Token he
Token ID339
Feature activation+1.30e+00

Pos logit contributions
 plural+1.908
hin+1.522
 spoiled+1.504
unct+1.481
 hinge+1.464
Neg logit contributions
arc-2.064
jac-2.060
Birth-2.041
acion-1.970
ivia-1.958
Token is
Token ID318
Feature activation-4.27e-01

Pos logit contributions
 plural+0.921
hin+0.750
 spoiled+0.742
unct+0.737
 hinge+0.721
Neg logit contributions
Birth-0.965
jac-0.964
arc-0.961
acion-0.921
 Spending-0.920
Token likely
Token ID1884
Feature activation+2.97e+00

Pos logit contributions
jac+0.314
arc+0.312
Birth+0.312
acion+0.302
ivia+0.299
Neg logit contributions
 plural-0.306
hin-0.250
unct-0.249
 spoiled-0.245
 hinge-0.241
Token to
Token ID284
Feature activation+1.96e+00

Pos logit contributions
 plural+2.129
hin+1.725
 spoiled+1.716
unct+1.709
 hinge+1.674
Neg logit contributions
jac-2.183
Birth-2.170
arc-2.168
acion-2.096
ivia-2.079
Token face
Token ID1986
Feature activation+1.17e+00

Pos logit contributions
 plural+1.282
hin+1.017
 hinge+1.000
unct+0.995
 spoiled+0.993
Neg logit contributions
jac-1.573
Birth-1.570
arc-1.569
acion-1.526
ivia-1.509
Token eviction
Token ID38738
Feature activation+9.85e+00

Pos logit contributions
 plural+0.905
hin+0.742
unct+0.738
 spoiled+0.730
 hinge+0.716
Neg logit contributions
jac-0.811
Birth-0.808
arc-0.807
acion-0.774
ivia-0.765
Token from
Token ID422
Feature activation+1.25e+00

Pos logit contributions
 plural+5.852
 spoiled+4.593
hin+4.586
 hinge+4.454
unct+4.281
Neg logit contributions
jac-8.212
Birth-7.977
arc-7.889
 Spending-7.633
ivia-7.610
Token the
Token ID262
Feature activation+7.75e-01

Pos logit contributions
 plural+0.994
hin+0.829
unct+0.827
 spoiled+0.814
 hinge+0.806
Neg logit contributions
arc-0.818
jac-0.817
Birth-0.814
acion-0.780
ivia-0.775
Token Embassy
Token ID23233
Feature activation+6.51e-01

Pos logit contributions
 plural+0.695
hin+0.595
unct+0.594
 spoiled+0.584
 hinge+0.580
Neg logit contributions
jac-0.428
arc-0.427
Birth-0.424
acion-0.404
ivia-0.401
Token of
Token ID286
Feature activation-8.04e-01

Pos logit contributions
 plural+0.461
unct+0.380
hin+0.380
 spoiled+0.368
 hinge+0.365
Neg logit contributions
jac-0.479
arc-0.478
Birth-0.476
acion-0.460
ivia-0.455
Token Ecuador
Token ID25794
Feature activation-8.54e-01

Pos logit contributions
jac+0.479
arc+0.478
Birth+0.475
acion+0.455
ivia+0.451
Neg logit contributions
 plural-0.684
hin-0.582
unct-0.582
 spoiled-0.569
 hinge-0.565
Token
Token ID198
Feature activation+1.20e+00

Pos logit contributions
 plural+0.604
hin+0.507
unct+0.506
 spoiled+0.492
 hinge+0.489
Neg logit contributions
jac-0.513
arc-0.511
Birth-0.509
acion-0.490
ivia-0.486
TokenBo
Token ID16635
Feature activation+6.30e-01

Pos logit contributions
 plural+0.844
hin+0.706
unct+0.701
 spoiled+0.672
 hinge+0.661
Neg logit contributions
arc-0.864
jac-0.864
Birth-0.851
acion-0.838
ivia-0.830
Tokenass
Token ID562
Feature activation+4.49e+00

Pos logit contributions
 plural+0.404
hin+0.334
unct+0.329
 spoiled+0.320
 hinge+0.314
Neg logit contributions
jac-0.496
Birth-0.496
arc-0.490
acion-0.485
 Spending-0.475
Tokenon
Token ID261
Feature activation-1.05e+00

Pos logit contributions
 plural+2.923
hin+2.579
unct+2.414
 spoiled+2.339
 Jin+2.281
Neg logit contributions
Birth-3.452
acion-3.372
arc-3.356
jac-3.342
 Spending-3.225
Token H
Token ID367
Feature activation-2.88e+00

Pos logit contributions
jac+0.789
arc+0.784
Birth+0.775
acion+0.742
 Spending+0.742
Neg logit contributions
 plural-0.736
unct-0.597
hin-0.587
 spoiled-0.575
 hinge-0.568
Tokenagen
Token ID11286
Feature activation+9.83e+00

Pos logit contributions
jac+1.915
arc+1.901
Birth+1.901
acion+1.825
ivia+1.804
Neg logit contributions
 plural-2.252
unct-1.893
hin-1.890
 spoiled-1.835
 hinge-1.822
Token finished
Token ID5201
Feature activation+2.51e-01

Pos logit contributions
 plural+6.350
hin+5.355
 spoiled+5.293
unct+5.132
 Jin+4.903
Neg logit contributions
Birth-7.333
arc-7.206
jac-7.084
acion-7.028
ivia-6.846
Token close
Token ID1969
Feature activation+6.46e-01

Pos logit contributions
 plural+0.169
hin+0.138
unct+0.137
 spoiled+0.134
 hinge+0.132
Neg logit contributions
jac-0.194
arc-0.193
Birth-0.192
acion-0.186
ivia-0.184
Token by
Token ID416
Feature activation+7.52e-01

Pos logit contributions
 plural+0.415
hin+0.333
unct+0.332
 spoiled+0.322
 hinge+0.319
Neg logit contributions
jac-0.520
arc-0.519
Birth-0.517
acion-0.500
ivia-0.496
Token in
Token ID287
Feature activation-2.45e-01

Pos logit contributions
 plural+0.550
hin+0.455
unct+0.452
 spoiled+0.443
 hinge+0.437
Neg logit contributions
jac-0.538
arc-0.538
Birth-0.536
acion-0.517
ivia-0.512
Token third
Token ID2368
Feature activation-4.66e-01

Pos logit contributions
jac+0.187
arc+0.186
Birth+0.185
acion+0.179
ivia+0.178
Neg logit contributions
 plural-0.167
hin-0.137
unct-0.137
 spoiled-0.132
 hinge-0.131
Token it
Token ID340
Feature activation-1.47e-01

Pos logit contributions
 plural+4.969
 spoiled+3.977
hin+3.959
unct+3.901
 hinge+3.787
Neg logit contributions
jac-6.827
arc-6.776
Birth-6.710
ivia-6.538
acion-6.534
Token is
Token ID318
Feature activation-3.91e-01

Pos logit contributions
arc+0.115
jac+0.115
Birth+0.114
acion+0.110
ivia+0.110
Neg logit contributions
 plural-0.098
hin-0.079
unct-0.079
 spoiled-0.076
 hinge-0.076
Token technically
Token ID14497
Feature activation+4.79e+00

Pos logit contributions
jac+0.327
arc+0.327
Birth+0.326
acion+0.316
 Spending+0.314
Neg logit contributions
 plural-0.236
unct-0.188
hin-0.187
 spoiled-0.179
 hinge-0.178
Token interesting
Token ID3499
Feature activation-1.47e-01

Pos logit contributions
 plural+3.270
unct+2.626
hin+2.623
 spoiled+2.577
 hinge+2.520
Neg logit contributions
jac-3.710
Birth-3.651
arc-3.615
acion-3.539
ivia-3.508
Token,
Token ID11
Feature activation+3.48e-01

Pos logit contributions
jac+0.107
arc+0.107
Birth+0.106
acion+0.102
ivia+0.102
Neg logit contributions
 plural-0.105
unct-0.087
hin-0.087
 spoiled-0.084
 hinge-0.083
Token partly
Token ID11476
Feature activation+9.68e+00

Pos logit contributions
 plural+0.201
hin+0.156
unct+0.156
 spoiled+0.151
 hinge+0.149
Neg logit contributions
jac-0.304
arc-0.302
Birth-0.301
acion-0.293
ivia-0.291
Token the
Token ID262
Feature activation-6.37e-01

Pos logit contributions
 plural+5.592
 spoiled+4.360
unct+4.288
hin+4.272
 analytic+4.195
Neg logit contributions
jac-8.158
Birth-7.989
arc-7.791
acion-7.773
ivia-7.753
Token other
Token ID584
Feature activation-9.68e-01

Pos logit contributions
jac+0.455
arc+0.453
Birth+0.451
acion+0.435
ivia+0.432
Neg logit contributions
 plural-0.469
unct-0.387
hin-0.386
 spoiled-0.376
 hinge-0.374
Token Democracy
Token ID20265
Feature activation+1.60e+00

Pos logit contributions
jac+0.628
arc+0.626
Birth+0.623
acion+0.598
ivia+0.593
Neg logit contributions
 plural-0.771
hin-0.650
unct-0.649
 spoiled-0.632
 hinge-0.628
Token Club
Token ID6289
Feature activation-8.54e-01

Pos logit contributions
 plural+1.144
unct+0.942
hin+0.937
 spoiled+0.911
 hinge+0.909
Neg logit contributions
jac-1.166
arc-1.155
Birth-1.154
acion-1.112
ivia-1.109
Token volunteers
Token ID11661
Feature activation+2.04e+00

Pos logit contributions
jac+0.647
arc+0.646
Birth+0.642
acion+0.621
ivia+0.616
Neg logit contributions
 plural-0.587
hin-0.480
unct-0.480
 spoiled-0.465
 hinge-0.461
Token with
Token ID351
Feature activation+3.49e+00

Pos logit contributions
 plural+0.766
hin+0.626
unct+0.626
 spoiled+0.610
 hinge+0.605
Neg logit contributions
jac-0.805
arc-0.799
Birth-0.799
acion-0.770
ivia-0.763
Token its
Token ID663
Feature activation+2.36e+00

Pos logit contributions
 plural+2.420
unct+1.946
hin+1.932
 spoiled+1.918
 hinge+1.866
Neg logit contributions
jac-2.660
arc-2.634
Birth-2.632
acion-2.552
ivia-2.524
Token Indian
Token ID3942
Feature activation+1.20e+00

Pos logit contributions
 plural+1.699
unct+1.376
hin+1.372
 spoiled+1.359
 hinge+1.340
Neg logit contributions
jac-1.739
Birth-1.721
arc-1.719
acion-1.668
ivia-1.653
Token viewership
Token ID48347
Feature activation+3.80e+00

Pos logit contributions
 plural+0.853
unct+0.696
hin+0.692
 spoiled+0.678
 hinge+0.674
Neg logit contributions
jac-0.881
arc-0.876
Birth-0.875
acion-0.844
ivia-0.840
Token,
Token ID11
Feature activation+1.91e+00

Pos logit contributions
 plural+2.561
hin+2.043
unct+2.021
 spoiled+2.017
 hinge+2.006
Neg logit contributions
jac-2.944
arc-2.933
Birth-2.930
acion-2.838
ivia-2.803
Token mainly
Token ID8384
Feature activation+9.65e+00

Pos logit contributions
 plural+1.322
hin+1.061
unct+1.055
 spoiled+1.047
 hinge+1.029
Neg logit contributions
jac-1.456
arc-1.444
Birth-1.443
acion-1.399
ivia-1.386
Token because
Token ID780
Feature activation+1.67e+00

Pos logit contributions
 plural+4.962
 spoiled+3.671
hin+3.569
 hinge+3.536
unct+3.486
Neg logit contributions
arc-8.795
jac-8.781
Birth-8.718
ivia-8.448
acion-8.443
Token I
Token ID314
Feature activation+1.07e+00

Pos logit contributions
 plural+1.154
hin+0.933
unct+0.929
 spoiled+0.911
 hinge+0.896
Neg logit contributions
jac-1.269
arc-1.265
Birth-1.260
acion-1.221
ivia-1.213
Token imagine
Token ID5967
Feature activation+8.96e-01

Pos logit contributions
 plural+0.688
hin+0.553
unct+0.551
 spoiled+0.536
 hinge+0.530
Neg logit contributions
jac-0.862
Birth-0.859
arc-0.857
acion-0.831
 Spending-0.821
Token they
Token ID484
Feature activation-7.00e-01

Pos logit contributions
 plural+0.620
hin+0.503
unct+0.502
 spoiled+0.491
 hinge+0.483
Neg logit contributions
jac-0.679
arc-0.678
Birth-0.675
acion-0.653
ivia-0.648
Token are
Token ID389
Feature activation-1.70e+00

Pos logit contributions
jac+0.522
arc+0.521
Birth+0.519
acion+0.501
ivia+0.497
Neg logit contributions
 plural-0.489
hin-0.401
unct-0.401
 spoiled-0.389
 hinge-0.386
Token over
Token ID625
Feature activation-8.84e-01

Pos logit contributions
jac+1.607
arc+1.602
Birth+1.596
acion+1.537
 Spending+1.522
Neg logit contributions
 plural-1.640
unct-1.373
hin-1.367
 spoiled-1.317
 hinge-1.314
Token 1
Token ID352
Feature activation+5.39e-01

Pos logit contributions
jac+0.638
arc+0.636
Birth+0.633
acion+0.611
ivia+0.606
Neg logit contributions
 plural-0.641
hin-0.530
unct-0.529
 spoiled-0.514
 hinge-0.509
Token billion
Token ID2997
Feature activation+1.61e+00

Pos logit contributions
 plural+0.434
hin+0.365
unct+0.364
 spoiled+0.356
 hinge+0.353
Neg logit contributions
jac-0.345
Birth-0.345
arc-0.343
acion-0.329
ivia-0.326
Token people
Token ID661
Feature activation+8.71e-01

Pos logit contributions
 plural+1.237
hin+1.027
unct+1.020
 spoiled+1.008
 hinge+0.998
Neg logit contributions
arc-1.099
jac-1.098
Birth-1.096
acion-1.050
ivia-1.048
Token,
Token ID11
Feature activation+6.20e-01

Pos logit contributions
 plural+0.516
hin+0.405
unct+0.404
 spoiled+0.391
 hinge+0.388
Neg logit contributions
jac-0.745
arc-0.743
Birth-0.741
acion-0.719
ivia-0.713
Token mainly
Token ID8384
Feature activation+9.62e+00

Pos logit contributions
 plural+0.453
hin+0.371
unct+0.369
 spoiled+0.363
 hinge+0.362
Neg logit contributions
jac-0.449
arc-0.445
Birth-0.444
acion-0.429
ivia-0.425
Token very
Token ID845
Feature activation+4.74e-01

Pos logit contributions
 plural+6.473
 spoiled+5.252
 hinge+5.169
hin+5.080
 hers+5.031
Neg logit contributions
arc-7.151
jac-7.109
Birth-7.061
acion-6.974
ivia-6.918
Token poor
Token ID3595
Feature activation-1.50e+00

Pos logit contributions
 plural+0.361
hin+0.299
unct+0.299
 spoiled+0.293
 hinge+0.290
Neg logit contributions
jac-0.327
arc-0.325
Birth-0.324
acion-0.313
ivia-0.310
Token,
Token ID11
Feature activation-8.20e-01

Pos logit contributions
jac+0.960
arc+0.957
Birth+0.952
acion+0.914
ivia+0.905
Neg logit contributions
 plural-1.197
unct-1.012
hin-1.012
 spoiled-0.982
 hinge-0.977
Token relay
Token ID24248
Feature activation-2.59e+00

Pos logit contributions
jac+0.722
arc+0.720
Birth+0.718
acion+0.697
ivia+0.692
Neg logit contributions
 plural-0.461
hin-0.360
unct-0.360
 spoiled-0.344
 hinge-0.339
Token on
Token ID319
Feature activation-6.33e-01

Pos logit contributions
jac+1.861
arc+1.856
Birth+1.849
acion+1.784
ivia+1.768
Neg logit contributions
 plural-1.873
unct-1.556
hin-1.553
 spoiled-1.503
 hinge-1.496
Tokenarg
Token ID853
Feature activation+3.73e-01

Pos logit contributions
jac+0.918
arc+0.911
Birth+0.910
acion+0.864
ivia+0.852
Neg logit contributions
 plural-1.672
hin-1.450
unct-1.450
 spoiled-1.414
 hinge-1.408
Tokenest
Token ID395
Feature activation-1.05e+00

Pos logit contributions
 plural+0.310
hin+0.266
unct+0.265
 spoiled+0.257
 hinge+0.252
Neg logit contributions
Birth-0.228
jac-0.226
arc-0.225
acion-0.217
ivia-0.213
Token Planes
Token ID42788
Feature activation+1.20e+00

Pos logit contributions
jac+0.788
arc+0.786
Birth+0.782
acion+0.755
ivia+0.749
Neg logit contributions
 plural-0.736
hin-0.603
unct-0.603
 spoiled-0.585
 hinge-0.580
Token //
Token ID3373
Feature activation-2.45e+00

Pos logit contributions
 plural+0.897
hin+0.755
unct+0.748
 spoiled+0.728
 hinge+0.719
Neg logit contributions
jac-0.832
arc-0.825
Birth-0.823
acion-0.799
ivia-0.789
Token Global
Token ID8060
Feature activation+4.01e+00

Pos logit contributions
jac+1.808
arc+1.807
Birth+1.798
acion+1.727
 Spending+1.718
Neg logit contributions
 plural-1.727
hin-1.419
unct-1.419
 hinge-1.378
 spoiled-1.377
Token Business
Token ID7320
Feature activation+9.54e+00

Pos logit contributions
 plural+3.046
hin+2.555
unct+2.554
 spoiled+2.471
 hinge+2.443
Neg logit contributions
jac-2.746
arc-2.729
Birth-2.712
acion-2.620
ivia-2.611
Token Editor
Token ID12058
Feature activation-5.06e-01

Pos logit contributions
 plural+7.031
unct+6.070
hin+5.840
 spoiled+5.747
 Jin+5.683
Neg logit contributions
jac-6.134
arc-6.060
Birth-6.044
ivia-5.864
acion-5.809
Token Marcus
Token ID17068
Feature activation-8.43e-01

Pos logit contributions
jac+0.356
arc+0.355
Birth+0.353
acion+0.340
ivia+0.338
Neg logit contributions
 plural-0.375
hin-0.311
unct-0.311
 spoiled-0.302
 hinge-0.300
Token We
Token ID775
Feature activation-4.39e-01

Pos logit contributions
jac+0.602
arc+0.601
Birth+0.599
acion+0.577
ivia+0.571
Neg logit contributions
 plural-0.615
unct-0.509
hin-0.508
 spoiled-0.494
 hinge-0.491
Tokenis
Token ID271
Feature activation-8.55e-01

Pos logit contributions
jac+0.300
arc+0.298
Birth+0.298
acion+0.287
 Spending+0.284
Neg logit contributions
 plural-0.334
hin-0.280
unct-0.279
 spoiled-0.270
 hinge-0.269
Tokenger
Token ID1362
Feature activation+1.58e+00

Pos logit contributions
jac+0.611
arc+0.609
Birth+0.607
acion+0.585
ivia+0.580
Neg logit contributions
 plural-0.625
hin-0.517
unct-0.517
 spoiled-0.502
 hinge-0.498
Tokenald
Token ID1940
Feature activation-7.32e-01

Pos logit contributions
 plural+1.136
hin+0.978
unct+0.965
 Jin+0.927
 hinge+0.906
Neg logit contributions
Birth-1.323
jac-1.308
arc-1.278
 Spending-1.254
76561-1.247
Token Bo
Token ID3248
Feature activation+3.00e-01

Pos logit contributions
jac+0.592
arc+0.588
Birth+0.587
ivia+0.570
acion+0.569
Neg logit contributions
 plural-0.465
unct-0.375
hin-0.369
 spoiled-0.361
 hinge-0.357
Tokenass
Token ID562
Feature activation+5.56e+00

Pos logit contributions
 plural+0.190
hin+0.158
unct+0.156
 spoiled+0.150
 hinge+0.147
Neg logit contributions
Birth-0.239
jac-0.238
arc-0.236
acion-0.231
 Spending-0.229
Tokenon
Token ID261
Feature activation-7.33e-01

Pos logit contributions
 plural+3.289
hin+2.817
unct+2.733
 spoiled+2.540
 Jin+2.502
Neg logit contributions
Birth-4.574
jac-4.438
arc-4.410
acion-4.362
 Spending-4.300
Token H
Token ID367
Feature activation-2.94e-01

Pos logit contributions
jac+0.539
arc+0.536
Birth+0.535
 Spending+0.509
ivia+0.501
Neg logit contributions
 plural-0.539
unct-0.440
 spoiled-0.420
hin-0.418
zbollah-0.409
Tokenagen
Token ID11286
Feature activation+9.53e+00

Pos logit contributions
jac+0.200
Birth+0.199
arc+0.198
acion+0.191
 Spending+0.189
Neg logit contributions
 plural-0.223
unct-0.188
hin-0.188
 spoiled-0.182
 hinge-0.180
Token (
Token ID357
Feature activation+1.41e+00

Pos logit contributions
 plural+5.642
hin+4.717
unct+4.561
 spoiled+4.495
 Jin+4.374
Neg logit contributions
Birth-7.530
arc-7.499
jac-7.498
acion-7.302
ivia-7.235
TokenSky
Token ID22308
Feature activation+1.41e+00

Pos logit contributions
 plural+1.099
hin+0.928
unct+0.921
 spoiled+0.888
 hinge+0.883
Neg logit contributions
jac-0.914
arc-0.907
acion-0.890
Birth-0.882
ivia-0.880
Token Pro
Token ID1041
Feature activation+7.56e-01

Pos logit contributions
 plural+1.134
hin+0.961
unct+0.959
 spoiled+0.931
 hinge+0.921
Neg logit contributions
jac-0.902
arc-0.897
Birth-0.896
acion-0.871
ivia-0.860
Tokency
Token ID948
Feature activation+4.94e-01

Pos logit contributions
 plural+0.851
hin+0.760
unct+0.758
 spoiled+0.741
 hinge+0.739
Neg logit contributions
jac-0.237
arc-0.237
Birth-0.234
acion-0.216
ivia-0.213
Tokencling
Token ID8493
Feature activation-7.37e-01

Pos logit contributions
 plural+0.417
hin+0.355
unct+0.355
 spoiled+0.347
 hinge+0.345
Neg logit contributions
jac-0.297
arc-0.295
Birth-0.294
acion-0.281
ivia-0.279
Token articles
Token ID6685
Feature activation+2.95e+00

Pos logit contributions
 plural+0.392
hin+0.311
unct+0.309
 spoiled+0.303
 hinge+0.298
Neg logit contributions
jac-0.500
arc-0.497
Birth-0.496
acion-0.481
ivia-0.477
Token,
Token ID11
Feature activation+8.15e-01

Pos logit contributions
 plural+2.110
 spoiled+1.713
hin+1.709
unct+1.691
 hinge+1.649
Neg logit contributions
jac-2.198
Birth-2.186
arc-2.172
acion-2.092
 Spending-2.084
Token and
Token ID290
Feature activation+1.49e+00

Pos logit contributions
 plural+0.634
unct+0.525
hin+0.522
 spoiled+0.515
 hinge+0.510
Neg logit contributions
jac-0.551
Birth-0.548
arc-0.546
acion-0.525
ivia-0.520
Token arguments
Token ID7159
Feature activation+3.48e+00

Pos logit contributions
 plural+1.073
unct+0.871
hin+0.870
 spoiled+0.856
 hinge+0.846
Neg logit contributions
jac-1.093
Birth-1.089
arc-1.087
acion-1.047
ivia-1.039
Token,
Token ID11
Feature activation+1.13e+00

Pos logit contributions
 plural+2.445
 spoiled+1.974
hin+1.974
unct+1.966
 hinge+1.907
Neg logit contributions
jac-2.614
Birth-2.599
arc-2.573
acion-2.490
ivia-2.479
Token partly
Token ID11476
Feature activation+9.36e+00

Pos logit contributions
 plural+0.834
hin+0.683
unct+0.680
 spoiled+0.671
 hinge+0.661
Neg logit contributions
jac-0.809
Birth-0.802
arc-0.801
acion-0.770
ivia-0.766
Token also
Token ID635
Feature activation+2.37e-01

Pos logit contributions
 plural+5.408
 spoiled+4.141
hin+4.092
unct+4.034
 hinge+3.957
Neg logit contributions
jac-7.955
Birth-7.909
arc-7.827
ivia-7.540
 Spending-7.523
Token because
Token ID780
Feature activation+1.16e+00

Pos logit contributions
 plural+0.148
unct+0.119
hin+0.119
 spoiled+0.113
 hinge+0.113
Neg logit contributions
arc-0.193
jac-0.193
Birth-0.192
acion-0.187
ivia-0.185
Token those
Token ID883
Feature activation-5.15e-01

Pos logit contributions
 plural+0.823
hin+0.669
unct+0.664
 spoiled+0.655
 hinge+0.645
Neg logit contributions
jac-0.863
arc-0.860
Birth-0.856
acion-0.826
ivia-0.822
Token things
Token ID1243
Feature activation-1.37e+00

Pos logit contributions
jac+0.387
arc+0.386
Birth+0.385
acion+0.372
ivia+0.369
Neg logit contributions
 plural-0.356
hin-0.292
unct-0.292
 spoiled-0.282
 hinge-0.280
Token that
Token ID326
Feature activation-1.51e+00

Pos logit contributions
jac+1.009
arc+1.008
Birth+1.002
acion+0.968
ivia+0.961
Neg logit contributions
 plural-0.969
unct-0.800
hin-0.799
 spoiled-0.770
 hinge-0.766
Token proper
Token ID1774
Feature activation+2.27e+00

Pos logit contributions
 plural+0.705
unct+0.578
hin+0.577
 spoiled+0.564
 hinge+0.560
Neg logit contributions
jac-0.697
Birth-0.694
arc-0.689
ivia-0.667
acion-0.665
Token --
Token ID1377
Feature activation+1.59e+00

Pos logit contributions
 plural+1.795
hin+1.489
unct+1.479
 spoiled+1.446
 hinge+1.441
Neg logit contributions
jac-1.500
Birth-1.499
arc-1.481
acion-1.427
ivia-1.425
Token finally
Token ID3443
Feature activation-9.44e-01

Pos logit contributions
 plural+1.092
unct+0.891
hin+0.890
 spoiled+0.871
 hinge+0.862
Neg logit contributions
jac-1.203
Birth-1.197
arc-1.195
acion-1.153
 Spending-1.144
Token has
Token ID468
Feature activation-6.24e-01

Pos logit contributions
jac+0.681
arc+0.680
Birth+0.677
acion+0.653
ivia+0.648
Neg logit contributions
 plural-0.681
unct-0.564
hin-0.562
 spoiled-0.544
 hinge-0.541
Token a
Token ID257
Feature activation+1.42e-01

Pos logit contributions
jac+0.468
arc+0.466
Birth+0.464
acion+0.449
ivia+0.445
Neg logit contributions
 plural-0.434
unct-0.356
hin-0.356
 spoiled-0.345
 hinge-0.342
Token tenant
Token ID18285
Feature activation+9.34e+00

Pos logit contributions
 plural+0.100
unct+0.082
hin+0.082
 spoiled+0.080
 hinge+0.079
Neg logit contributions
jac-0.106
arc-0.105
Birth-0.105
acion-0.101
ivia-0.100
Token that
Token ID326
Feature activation-5.24e-01

Pos logit contributions
 plural+6.027
 spoiled+4.847
hin+4.819
 hinge+4.714
 Jin+4.575
Neg logit contributions
Birth-7.096
jac-7.011
arc-6.898
 Spending-6.858
acion-6.844
Token people
Token ID661
Feature activation+1.40e+00

Pos logit contributions
jac+0.392
arc+0.391
Birth+0.389
acion+0.376
ivia+0.373
Neg logit contributions
 plural-0.365
unct-0.299
hin-0.299
 spoiled-0.290
 hinge-0.288
Token will
Token ID481
Feature activation+6.70e-01

Pos logit contributions
 plural+0.968
hin+0.787
unct+0.779
 spoiled+0.769
 hinge+0.761
Neg logit contributions
jac-1.069
Birth-1.069
arc-1.064
acion-1.029
ivia-1.015
Token brave
Token ID14802
Feature activation-1.00e+00

Pos logit contributions
 plural+0.434
hin+0.348
unct+0.346
 spoiled+0.339
 hinge+0.338
Neg logit contributions
jac-0.537
Birth-0.536
arc-0.534
acion-0.518
ivia-0.512
Token the
Token ID262
Feature activation-9.67e-01

Pos logit contributions
jac+0.717
arc+0.715
Birth+0.712
acion+0.686
ivia+0.680
Neg logit contributions
 plural-0.737
hin-0.610
unct-0.609
 spoiled-0.592
 hinge-0.588
Token most
Token ID749
Feature activation-7.87e-02

Pos logit contributions
jac+0.758
arc+0.755
Birth+0.752
acion+0.723
ivia+0.717
Neg logit contributions
 plural-0.872
hin-0.730
unct-0.730
 spoiled-0.710
 hinge-0.705
Token of
Token ID286
Feature activation+1.04e+00

Pos logit contributions
jac+0.059
arc+0.059
Birth+0.059
acion+0.057
ivia+0.056
Neg logit contributions
 plural-0.054
unct-0.044
hin-0.044
 spoiled-0.043
 hinge-0.043
Token the
Token ID262
Feature activation+8.14e-01

Pos logit contributions
 plural+0.699
unct+0.565
hin+0.561
 spoiled+0.553
 hinge+0.542
Neg logit contributions
jac-0.803
arc-0.797
Birth-0.797
acion-0.768
ivia-0.764
Token 1980
Token ID7169
Feature activation+2.65e+00

Pos logit contributions
 plural+0.565
unct+0.457
hin+0.455
 spoiled+0.448
 hinge+0.441
Neg logit contributions
jac-0.619
arc-0.614
Birth-0.614
ivia-0.589
acion-0.588
Tokens
Token ID82
Feature activation+1.07e+00

Pos logit contributions
 plural+1.746
unct+1.423
hin+1.409
 spoiled+1.360
 hinge+1.339
Neg logit contributions
jac-2.075
Birth-2.065
arc-2.060
acion-1.994
ivia-1.988
Token partly
Token ID11476
Feature activation+9.24e+00

Pos logit contributions
 plural+0.772
hin+0.635
unct+0.633
 spoiled+0.619
 hinge+0.612
Neg logit contributions
jac-0.778
arc-0.773
Birth-0.771
acion-0.743
ivia-0.740
Token due
Token ID2233
Feature activation-6.76e-01

Pos logit contributions
 plural+4.702
 spoiled+3.658
hin+3.531
 hinge+3.474
unct+3.384
Neg logit contributions
jac-8.454
Birth-8.358
arc-8.326
acion-8.120
ivia-8.073
Token to
Token ID284
Feature activation+2.02e+00

Pos logit contributions
jac+0.441
arc+0.439
Birth+0.438
acion+0.422
ivia+0.415
Neg logit contributions
 plural-0.528
unct-0.449
hin-0.447
 spoiled-0.431
 hinge-0.428
Token spl
Token ID4328
Feature activation+5.32e-02

Pos logit contributions
 plural+1.397
hin+1.129
unct+1.126
 spoiled+1.113
 hinge+1.087
Neg logit contributions
jac-1.541
arc-1.535
Birth-1.529
acion-1.476
ivia-1.472
Tokeninter
Token ID3849
Feature activation+4.11e+00

Pos logit contributions
 plural+0.042
hin+0.036
unct+0.036
 spoiled+0.035
 hinge+0.034
Neg logit contributions
jac-0.035
Birth-0.034
arc-0.034
acion-0.033
ivia-0.033
Tokening
Token ID278
Feature activation+1.14e+00

Pos logit contributions
 plural+3.443
hin+2.903
unct+2.885
 spoiled+2.862
 hinge+2.798
Neg logit contributions
jac-2.502
Birth-2.455
arc-2.425
ivia-2.351
acion-2.316
TokenThe
Token ID464
Feature activation+1.28e-01

Pos logit contributions
jac+0.488
arc+0.486
Birth+0.483
acion+0.467
ivia+0.464
Neg logit contributions
 plural-0.547
hin-0.459
unct-0.458
 spoiled-0.442
 hinge-0.439
Token non
Token ID1729
Feature activation-1.74e+00

Pos logit contributions
 plural+0.090
hin+0.073
unct+0.073
 spoiled+0.070
 hinge+0.070
Neg logit contributions
jac-0.097
arc-0.097
Birth-0.096
acion-0.093
ivia-0.092
Token-
Token ID12
Feature activation+1.77e+00

Pos logit contributions
jac+1.229
arc+1.227
Birth+1.227
acion+1.176
 Spending+1.163
Neg logit contributions
 plural-1.252
unct-1.049
hin-1.043
 spoiled-1.029
 hinge-1.012
Tokenprofit
Token ID9183
Feature activation+2.57e+00

Pos logit contributions
 plural+1.040
hin+0.823
unct+0.818
 spoiled+0.768
 hinge+0.767
Neg logit contributions
jac-1.515
arc-1.504
Birth-1.495
ivia-1.465
acion-1.464
Token,
Token ID11
Feature activation+3.30e+00

Pos logit contributions
 plural+1.700
hin+1.360
unct+1.343
 hinge+1.320
 spoiled+1.297
Neg logit contributions
jac-2.041
arc-2.011
Birth-2.004
ivia-1.942
acion-1.934
Token self
Token ID2116
Feature activation+9.24e+00

Pos logit contributions
 plural+2.314
hin+1.845
unct+1.838
 hinge+1.806
 spoiled+1.789
Neg logit contributions
jac-2.526
arc-2.474
Birth-2.463
acion-2.373
ivia-2.373
Token-
Token ID12
Feature activation+2.37e+00

Pos logit contributions
 plural+5.790
unct+4.528
hin+4.391
 spoiled+4.347
 hinge+4.319
Neg logit contributions
jac-7.565
arc-7.434
Birth-7.387
acion-7.231
ivia-7.212
Tokengovern
Token ID47866
Feature activation-9.81e-01

Pos logit contributions
 plural+1.773
unct+1.483
hin+1.476
 spoiled+1.425
 hinge+1.411
Neg logit contributions
jac-1.651
arc-1.639
Birth-1.635
acion-1.581
ivia-1.572
Tokened
Token ID276
Feature activation-1.10e+00

Pos logit contributions
jac+0.641
arc+0.641
Birth+0.635
acion+0.613
ivia+0.606
Neg logit contributions
 plural-0.776
hin-0.652
unct-0.650
 spoiled-0.635
 hinge-0.631
Token organisation
Token ID12684
Feature activation+9.79e-01

Pos logit contributions
jac+0.832
arc+0.830
Birth+0.827
acion+0.799
ivia+0.792
Neg logit contributions
 plural-0.759
hin-0.620
unct-0.620
 spoiled-0.601
 hinge-0.596
Token "
Token ID366
Feature activation-5.79e-01

Pos logit contributions
 plural+0.705
hin+0.578
unct+0.575
 spoiled+0.561
 hinge+0.558
Neg logit contributions
jac-0.718
arc-0.710
Birth-0.710
acion-0.681
ivia-0.678
Token eventual
Token ID19657
Feature activation+2.86e-01

Pos logit contributions
 plural+0.261
unct+0.211
hin+0.211
 spoiled+0.204
 hinge+0.202
Neg logit contributions
jac-0.310
arc-0.310
Birth-0.308
acion-0.298
ivia-0.296
Token occupation
Token ID13755
Feature activation+3.01e+00

Pos logit contributions
 plural+0.192
hin+0.156
unct+0.156
 spoiled+0.151
 hinge+0.149
Neg logit contributions
jac-0.222
arc-0.221
Birth-0.220
acion-0.213
ivia-0.211
Token by
Token ID416
Feature activation-3.57e-01

Pos logit contributions
 plural+2.144
hin+1.760
unct+1.739
 spoiled+1.701
 hinge+1.699
Neg logit contributions
jac-2.222
arc-2.197
Birth-2.184
acion-2.102
ivia-2.085
Token two
Token ID734
Feature activation+3.97e+00

Pos logit contributions
jac+0.260
arc+0.260
Birth+0.258
acion+0.249
ivia+0.247
Neg logit contributions
 plural-0.256
hin-0.211
unct-0.211
 spoiled-0.205
 hinge-0.203
Token major
Token ID1688
Feature activation+1.43e+00

Pos logit contributions
 plural+2.934
hin+2.436
unct+2.424
 spoiled+2.382
 hinge+2.359
Neg logit contributions
jac-2.810
Birth-2.785
arc-2.758
acion-2.677
ivia-2.652
Token tenants
Token ID21445
Feature activation+9.22e+00

Pos logit contributions
 plural+1.082
hin+0.901
unct+0.899
 spoiled+0.876
 hinge+0.870
Neg logit contributions
jac-0.998
Birth-0.989
arc-0.987
acion-0.952
ivia-0.943
Token.
Token ID13
Feature activation+8.77e-01

Pos logit contributions
 plural+5.795
hin+4.760
 spoiled+4.513
 hinge+4.494
unct+4.483
Neg logit contributions
jac-7.138
Birth-7.076
arc-6.965
ivia-6.809
acion-6.769
Token Google
Token ID3012
Feature activation+5.01e+00

Pos logit contributions
 plural+0.654
hin+0.546
unct+0.545
 spoiled+0.529
 hinge+0.524
Neg logit contributions
jac-0.612
arc-0.608
Birth-0.605
acion-0.585
ivia-0.581
Token has
Token ID468
Feature activation+1.27e+00

Pos logit contributions
 plural+3.533
unct+2.896
hin+2.862
 spoiled+2.828
 hinge+2.763
Neg logit contributions
jac-3.671
Birth-3.659
arc-3.609
acion-3.440
ivia-3.436
Token signed
Token ID4488
Feature activation-3.46e-02

Pos logit contributions
 plural+0.905
hin+0.741
unct+0.740
 spoiled+0.727
 hinge+0.712
Neg logit contributions
jac-0.943
Birth-0.937
arc-0.936
acion-0.898
ivia-0.891
Token on
Token ID319
Feature activation+5.86e-01

Pos logit contributions
jac+0.026
arc+0.026
Birth+0.026
acion+0.025
ivia+0.025
Neg logit contributions
 plural-0.024
unct-0.020
hin-0.020
 spoiled-0.019
 hinge-0.019
TokenANS
Token ID15037
Feature activation+4.57e+00

Pos logit contributions
 plural+0.400
hin+0.333
unct+0.329
 spoiled+0.322
 hinge+0.320
Neg logit contributions
jac-0.376
arc-0.375
Birth-0.374
acion-0.361
ivia-0.358
TokenBUR
Token ID38926
Feature activation+1.09e+00

Pos logit contributions
 plural+3.028
hin+2.550
 spoiled+2.443
unct+2.404
 hinge+2.376
Neg logit contributions
jac-3.480
arc-3.404
Birth-3.397
acion-3.337
ivia-3.330
TokenG
Token ID38
Feature activation-2.85e-02

Pos logit contributions
 plural+0.763
hin+0.639
 spoiled+0.621
unct+0.611
 hinge+0.607
Neg logit contributions
jac-0.795
arc-0.788
Birth-0.779
acion-0.763
 Spending-0.759
Token,
Token ID11
Feature activation+5.17e+00

Pos logit contributions
jac+0.019
arc+0.019
Birth+0.019
acion+0.018
ivia+0.018
Neg logit contributions
 plural-0.022
hin-0.019
unct-0.019
 spoiled-0.018
 hinge-0.018
Token N
Token ID399
Feature activation+2.94e+00

Pos logit contributions
 plural+3.763
 spoiled+3.017
hin+3.016
 hinge+2.993
unct+2.972
Neg logit contributions
jac-3.743
arc-3.649
acion-3.613
Birth-3.587
ivia-3.507
Token.
Token ID13
Feature activation+9.21e+00

Pos logit contributions
 plural+1.946
hin+1.631
unct+1.604
 spoiled+1.539
 hinge+1.527
Neg logit contributions
jac-2.230
Birth-2.220
arc-2.212
acion-2.154
ivia-2.149
TokenJ
Token ID41
Feature activation+4.86e+00

Pos logit contributions
 plural+4.933
hin+3.839
 Jin+3.775
Dur+3.708
 spoiled+3.616
Neg logit contributions
jac-7.751
ivia-7.719
arc-7.698
acion-7.692
ivo-7.511
Token.
Token ID13
Feature activation+6.49e+00

Pos logit contributions
 plural+3.160
unct+2.617
hin+2.606
 spoiled+2.481
 hinge+2.432
Neg logit contributions
jac-3.729
Birth-3.640
arc-3.636
acion-3.580
ivia-3.519
Token
Token ID851
Feature activation+2.93e+00

Pos logit contributions
 plural+4.783
hin+3.944
unct+3.886
 spoiled+3.857
 Jin+3.842
Neg logit contributions
jac-4.460
arc-4.302
acion-4.249
Birth-4.177
ivia-4.143
Token Hundreds
Token ID32636
Feature activation+2.25e+00

Pos logit contributions
 plural+2.093
hin+1.744
unct+1.732
 spoiled+1.666
 Jin+1.665
Neg logit contributions
arc-2.115
jac-2.112
Birth-2.070
acion-2.035
ivia-2.034
Token of
Token ID286
Feature activation-1.77e+00

Pos logit contributions
 plural+1.513
hin+1.224
unct+1.213
 spoiled+1.197
 hinge+1.175
Neg logit contributions
jac-1.742
arc-1.734
Birth-1.731
ivia-1.659
acion-1.659
Token
Token ID198
Feature activation+4.62e-01

Pos logit contributions
 plural+0.300
hin+0.252
unct+0.252
 spoiled+0.245
 hinge+0.243
Neg logit contributions
jac-0.252
arc-0.251
Birth-0.249
acion-0.240
ivia-0.238
TokenMORE
Token ID23346
Feature activation+1.90e+00

Pos logit contributions
 plural+0.347
hin+0.291
unct+0.290
 spoiled+0.281
 hinge+0.278
Neg logit contributions
jac-0.318
arc-0.316
Birth-0.314
acion-0.304
ivia-0.302
Token INFO
Token ID24890
Feature activation+3.15e+00

Pos logit contributions
 plural+1.230
hin+0.996
unct+0.982
 spoiled+0.960
 hinge+0.943
Neg logit contributions
jac-1.515
arc-1.507
Birth-1.501
acion-1.460
ivia-1.446
Token:
Token ID25
Feature activation-5.98e-01

Pos logit contributions
 plural+2.040
hin+1.647
unct+1.626
 spoiled+1.595
 hinge+1.556
Neg logit contributions
jac-2.476
arc-2.459
Birth-2.426
acion-2.379
ivia-2.361
Token Our
Token ID3954
Feature activation+4.04e+00

Pos logit contributions
jac+0.402
arc+0.401
Birth+0.399
acion+0.384
ivia+0.381
Neg logit contributions
 plural-0.461
unct-0.386
hin-0.386
 spoiled-0.376
 hinge-0.373
Token sub
Token ID850
Feature activation+9.19e+00

Pos logit contributions
 plural+2.834
hin+2.316
unct+2.287
 spoiled+2.258
 hinge+2.218
Neg logit contributions
jac-2.990
Birth-2.970
arc-2.967
acion-2.850
ivia-2.820
Token-
Token ID12
Feature activation+3.26e+00

Pos logit contributions
 plural+5.835
unct+4.685
hin+4.540
 hinge+4.476
 spoiled+4.421
Neg logit contributions
Birth-7.000
jac-6.976
arc-6.847
acion-6.825
ivia-6.780
Token2
Token ID17
Feature activation+2.31e+00

Pos logit contributions
 plural+2.407
unct+2.004
hin+1.994
 spoiled+1.914
 hinge+1.895
Neg logit contributions
jac-2.263
Birth-2.251
arc-2.234
acion-2.186
 Spending-2.164
Token marathon
Token ID22336
Feature activation-2.20e+00

Pos logit contributions
 plural+1.544
hin+1.248
unct+1.247
 spoiled+1.205
 hinge+1.197
Neg logit contributions
jac-1.792
arc-1.784
Birth-1.778
acion-1.721
ivia-1.705
Token page
Token ID2443
Feature activation+3.51e+00

Pos logit contributions
jac+1.638
arc+1.635
Birth+1.624
acion+1.570
ivia+1.562
Neg logit contributions
 plural-1.528
unct-1.259
hin-1.254
 spoiled-1.213
 hinge-1.203
Token has
Token ID468
Feature activation+4.04e+00

Pos logit contributions
 plural+2.463
hin+2.028
unct+2.016
 spoiled+1.973
 hinge+1.940
Neg logit contributions
jac-2.601
Birth-2.566
arc-2.558
acion-2.444
ivia-2.432
Token,
Token ID11
Feature activation+1.82e+00

Pos logit contributions
 plural+1.791
hin+1.474
unct+1.446
 spoiled+1.415
 hinge+1.405
Neg logit contributions
jac-1.874
arc-1.866
Birth-1.864
acion-1.808
ivia-1.785
Token
Token ID447
Feature activation+2.05e+00

Pos logit contributions
 plural+1.177
hin+0.943
unct+0.934
 spoiled+0.912
 hinge+0.901
Neg logit contributions
jac-1.456
arc-1.450
Birth-1.447
acion-1.402
ivia-1.395
Token
Token ID251
Feature activation+2.44e+00

Pos logit contributions
 plural+1.816
hin+1.562
unct+1.549
 spoiled+1.522
 hinge+1.513
Neg logit contributions
jac-1.153
arc-1.138
Birth-1.129
acion-1.080
ivia-1.073
Token Rubio
Token ID15476
Feature activation+1.48e+00

Pos logit contributions
 plural+1.632
hin+1.317
unct+1.298
 spoiled+1.274
 hinge+1.259
Neg logit contributions
jac-1.899
arc-1.892
Birth-1.891
acion-1.831
ivia-1.821
Token told
Token ID1297
Feature activation-4.52e+00

Pos logit contributions
 plural+1.293
hin+1.105
unct+1.101
 spoiled+1.082
 hinge+1.069
Neg logit contributions
jac-0.847
arc-0.840
Birth-0.836
acion-0.801
ivia-0.794
Token Fusion
Token ID21278
Feature activation+9.11e+00

Pos logit contributions
jac+3.272
arc+3.266
Birth+3.255
acion+3.137
 Spending+3.125
Neg logit contributions
 plural-3.167
hin-2.630
unct-2.604
 spoiled-2.541
 hinge-2.525
Token in
Token ID287
Feature activation+7.01e-01

Pos logit contributions
 plural+5.649
unct+4.574
 spoiled+4.522
hin+4.515
 hinge+4.350
Neg logit contributions
jac-7.107
arc-6.885
Birth-6.847
ivia-6.728
acion-6.675
Token 2014
Token ID1946
Feature activation-1.61e+00

Pos logit contributions
 plural+0.490
hin+0.399
unct+0.399
 spoiled+0.388
 hinge+0.385
Neg logit contributions
jac-0.525
arc-0.523
Birth-0.521
acion-0.503
ivia-0.501
Token.
Token ID13
Feature activation-4.79e-02

Pos logit contributions
arc+1.101
Birth+1.101
jac+1.100
acion+1.056
ivia+1.045
Neg logit contributions
 plural-1.214
unct-1.015
hin-1.011
 spoiled-0.985
 hinge-0.977
Token "
Token ID366
Feature activation-8.52e-01

Pos logit contributions
jac+0.032
arc+0.032
Birth+0.032
acion+0.030
ivia+0.030
Neg logit contributions
 plural-0.038
hin-0.031
unct-0.031
 spoiled-0.031
 hinge-0.030
TokenAnd
Token ID1870
Feature activation+1.32e+00

Pos logit contributions
jac+0.626
arc+0.624
Birth+0.621
acion+0.600
ivia+0.595
Neg logit contributions
 plural-0.606
hin-0.499
unct-0.498
 spoiled-0.483
 hinge-0.480
Token suggested
Token ID5220
Feature activation+6.33e-01

Pos logit contributions
 plural+0.844
hin+0.695
unct+0.686
 spoiled+0.676
 hinge+0.667
Neg logit contributions
jac-0.870
arc-0.866
Birth-0.864
acion-0.833
ivia-0.824
Token to
Token ID284
Feature activation-3.79e+00

Pos logit contributions
 plural+0.441
hin+0.358
unct+0.354
 spoiled+0.348
 hinge+0.346
Neg logit contributions
arc-0.478
jac-0.478
Birth-0.474
acion-0.460
ivia-0.456
Token Joseph
Token ID7212
Feature activation+2.58e+00

Pos logit contributions
jac+2.735
arc+2.730
Birth+2.729
acion+2.618
 Spending+2.611
Neg logit contributions
 plural-2.653
unct-2.216
hin-2.183
 spoiled-2.119
 hinge-2.075
Token Stalin
Token ID17482
Feature activation+4.08e+00

Pos logit contributions
 plural+1.537
hin+1.236
unct+1.213
 spoiled+1.178
 hinge+1.163
Neg logit contributions
jac-2.173
Birth-2.168
arc-2.147
acion-2.112
ivia-2.077
Token a
Token ID257
Feature activation+8.23e-01

Pos logit contributions
 plural+2.724
hin+2.202
unct+2.138
 spoiled+2.123
 hinge+2.104
Neg logit contributions
jac-3.168
arc-3.158
Birth-3.143
acion-3.058
ivia-3.012
Token preempt
Token ID38726
Feature activation+9.05e+00

Pos logit contributions
 plural+0.551
hin+0.444
unct+0.443
 spoiled+0.432
 hinge+0.429
Neg logit contributions
jac-0.642
arc-0.642
Birth-0.639
acion-0.620
ivia-0.614
Tokenive
Token ID425
Feature activation+3.61e+00

Pos logit contributions
 plural+6.970
unct+5.918
hin+5.876
 spoiled+5.672
 hinge+5.629
Neg logit contributions
jac-5.580
arc-5.522
Birth-5.503
acion-5.355
 Spending-5.164
Token attack
Token ID1368
Feature activation+1.62e+00

Pos logit contributions
 plural+2.294
hin+1.796
 hinge+1.771
unct+1.762
 spoiled+1.760
Neg logit contributions
jac-2.948
arc-2.936
Birth-2.928
acion-2.859
ivia-2.825
Token against
Token ID1028
Feature activation+1.12e+00

Pos logit contributions
 plural+1.099
hin+0.891
unct+0.884
 spoiled+0.868
 hinge+0.860
Neg logit contributions
jac-1.254
arc-1.245
Birth-1.242
acion-1.202
ivia-1.186
Token Germany
Token ID4486
Feature activation-9.28e-01

Pos logit contributions
 plural+0.776
hin+0.631
unct+0.627
 spoiled+0.614
 hinge+0.608
Neg logit contributions
jac-0.851
arc-0.847
Birth-0.842
acion-0.816
ivia-0.806
Token through
Token ID832
Feature activation-7.31e-01

Pos logit contributions
arc+0.670
jac+0.669
Birth+0.666
acion+0.643
ivia+0.639
Neg logit contributions
 plural-0.667
unct-0.553
hin-0.551
 spoiled-0.533
 hinge-0.531
Token written
Token ID3194
Feature activation-1.92e+00

Pos logit contributions
jac+0.416
arc+0.416
Birth+0.414
acion+0.399
ivia+0.396
Neg logit contributions
 plural-0.395
hin-0.324
unct-0.324
 spoiled-0.314
 hinge-0.312
Token by
Token ID416
Feature activation-1.17e+00

Pos logit contributions
arc+1.370
jac+1.359
Birth+1.354
acion+1.304
ivia+1.296
Neg logit contributions
 plural-1.388
hin-1.160
unct-1.158
 spoiled-1.115
 hinge-1.113
Token other
Token ID584
Feature activation-1.42e+00

Pos logit contributions
jac+0.813
arc+0.811
Birth+0.807
acion+0.778
ivia+0.771
Neg logit contributions
 plural-0.873
hin-0.726
unct-0.726
 spoiled-0.706
 hinge-0.701
Token authors
Token ID7035
Feature activation-2.75e+00

Pos logit contributions
jac+1.010
arc+1.008
Birth+1.003
acion+0.966
ivia+0.958
Neg logit contributions
 plural-1.037
hin-0.861
unct-0.860
 spoiled-0.834
 hinge-0.829
Token,
Token ID11
Feature activation-6.01e-01

Pos logit contributions
arc+1.890
jac+1.880
Birth+1.867
acion+1.794
ivia+1.785
Neg logit contributions
 plural-2.062
unct-1.729
hin-1.723
 spoiled-1.673
 hinge-1.671
Token partly
Token ID11476
Feature activation+9.04e+00

Pos logit contributions
jac+0.435
arc+0.433
Birth+0.432
acion+0.416
ivia+0.413
Neg logit contributions
 plural-0.435
hin-0.359
unct-0.358
 spoiled-0.349
 hinge-0.346
Token on
Token ID319
Feature activation+2.42e+00

Pos logit contributions
 plural+5.281
 spoiled+4.136
hin+4.008
unct+3.924
 hinge+3.883
Neg logit contributions
jac-7.630
Birth-7.630
arc-7.529
acion-7.310
ivia-7.273
Token account
Token ID1848
Feature activation+2.37e+00

Pos logit contributions
 plural+1.572
unct+1.239
hin+1.238
 spoiled+1.217
 hinge+1.206
Neg logit contributions
jac-1.950
arc-1.931
Birth-1.928
acion-1.869
ivia-1.853
Token of
Token ID286
Feature activation-1.85e-01

Pos logit contributions
 plural+1.619
hin+1.316
unct+1.305
 spoiled+1.275
 hinge+1.262
Neg logit contributions
jac-1.823
arc-1.813
Birth-1.807
acion-1.748
ivia-1.736
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
jac+0.135
arc+0.134
Birth+0.134
acion+0.129
ivia+0.128
Neg logit contributions
 plural-0.134
hin-0.110
unct-0.110
 spoiled-0.107
 hinge-0.106
Token multiplication
Token ID48473
Feature activation+7.86e-01

Pos logit contributions
 plural+0.901
hin+0.729
unct+0.728
 spoiled+0.719
 hinge+0.708
Neg logit contributions
jac-0.903
arc-0.897
Birth-0.895
acion-0.865
ivia-0.858
Token valuable
Token ID8119
Feature activation-2.10e-01

Pos logit contributions
arc+1.645
jac+1.639
Birth+1.633
acion+1.588
ivia+1.580
Neg logit contributions
 plural-0.870
unct-0.662
hin-0.657
 hinge-0.618
 spoiled-0.614
Token,
Token ID11
Feature activation+9.03e-01

Pos logit contributions
jac+0.137
arc+0.136
Birth+0.136
acion+0.130
ivia+0.130
Neg logit contributions
 plural-0.168
hin-0.140
unct-0.140
 spoiled-0.137
 hinge-0.136
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 plural+0.640
hin+0.522
unct+0.521
 spoiled+0.512
 hinge+0.503
Neg logit contributions
jac-0.671
arc-0.665
Birth-0.663
acion-0.643
ivia-0.638
Token 1000
Token ID8576
Feature activation+2.12e-01

Pos logit contributions
jac+0.915
arc+0.914
Birth+0.910
acion+0.878
ivia+0.870
Neg logit contributions
 plural-0.900
hin-0.743
unct-0.742
 spoiled-0.720
 hinge-0.714
Token Swiss
Token ID14780
Feature activation-1.96e+00

Pos logit contributions
 plural+0.156
hin+0.129
unct+0.129
 spoiled+0.126
 hinge+0.124
Neg logit contributions
jac-0.150
arc-0.149
Birth-0.149
acion-0.143
ivia-0.143
Token franc
Token ID46754
Feature activation-1.03e+01

Pos logit contributions
arc+1.072
jac+1.067
Birth+1.059
acion+1.012
ivia+0.992
Neg logit contributions
 plural-1.753
unct-1.513
hin-1.512
 spoiled-1.474
 hinge-1.466
Token note
Token ID3465
Feature activation-3.30e+00

Pos logit contributions
arc+6.625
Birth+6.400
jac+6.362
acion+6.159
ivia+6.048
Neg logit contributions
 plural-7.508
unct-6.575
hin-6.514
 hinge-6.167
zbollah-6.128
Token (
Token ID357
Feature activation-2.08e-01

Pos logit contributions
arc+2.325
jac+2.309
Birth+2.303
acion+2.221
ivia+2.197
Neg logit contributions
 plural-2.413
hin-2.021
unct-2.020
 hinge-1.934
 spoiled-1.933
TokenUSD
Token ID29072
Feature activation-4.53e+00

Pos logit contributions
jac+0.144
arc+0.144
Birth+0.143
acion+0.138
ivia+0.137
Neg logit contributions
 plural-0.155
unct-0.129
hin-0.129
 spoiled-0.125
 hinge-0.123
Token 10
Token ID838
Feature activation-5.67e-01

Pos logit contributions
arc+3.256
jac+3.253
Birth+3.249
acion+3.123
 Spending+3.100
Neg logit contributions
 plural-3.225
unct-2.678
hin-2.671
 spoiled-2.595
 hinge-2.572
Token63
Token ID5066
Feature activation-2.54e+00

Pos logit contributions
jac+0.394
arc+0.392
Birth+0.391
acion+0.376
ivia+0.373
Neg logit contributions
 plural-0.426
hin-0.354
unct-0.354
 spoiled-0.345
 hinge-0.342
TokenBrend
Token ID48015
Feature activation-3.45e+00

Pos logit contributions
jac+0.235
arc+0.234
Birth+0.232
acion+0.225
ivia+0.224
Neg logit contributions
 plural-0.247
hin-0.206
unct-0.205
 spoiled-0.199
 hinge-0.197
Tokena
Token ID64
Feature activation-4.80e+00

Pos logit contributions
arc+2.516
jac+2.513
Birth+2.463
acion+2.417
ivia+2.391
Neg logit contributions
 plural-2.483
unct-2.020
hin-1.996
 spoiled-1.996
 hinge-1.984
Token Lee
Token ID5741
Feature activation-5.61e+00

Pos logit contributions
arc+3.398
jac+3.390
Birth+3.354
acion+3.270
ivia+3.229
Neg logit contributions
 plural-3.499
unct-2.905
hin-2.889
 hinge-2.814
 spoiled-2.813
Token Mar
Token ID1526
Feature activation-5.16e+00

Pos logit contributions
arc+3.968
jac+3.960
Birth+3.912
acion+3.826
ivia+3.773
Neg logit contributions
 plural-4.037
unct-3.386
hin-3.289
 hinge-3.244
 spoiled-3.238
Tokenquez
Token ID22281
Feature activation-1.36e+00

Pos logit contributions
arc+2.594
jac+2.581
Birth+2.499
acion+2.458
ivia+2.402
Neg logit contributions
 plural-4.899
unct-4.196
hin-4.172
 spoiled-4.139
 hinge-4.138
Token McC
Token ID5108
Feature activation-9.86e+00

Pos logit contributions
jac+0.939
arc+0.936
Birth+0.932
acion+0.898
ivia+0.890
Neg logit contributions
 plural-1.032
unct-0.861
hin-0.860
 spoiled-0.836
 hinge-0.831
Tokenool
Token ID970
Feature activation+1.22e-01

Pos logit contributions
arc+6.506
jac+6.321
Birth+6.180
ivia+6.173
acion+6.140
Neg logit contributions
 plural-7.714
 hinge-6.289
 spoiled-6.251
hin-6.056
unct-6.049
Token,
Token ID11
Feature activation-2.16e-01

Pos logit contributions
 plural+0.092
hin+0.076
unct+0.076
 spoiled+0.074
 hinge+0.073
Neg logit contributions
jac-0.085
arc-0.085
Birth-0.084
acion-0.081
ivia-0.081
Token 49
Token ID5125
Feature activation-9.09e-01

Pos logit contributions
jac+0.171
arc+0.170
Birth+0.169
acion+0.164
ivia+0.163
Neg logit contributions
 plural-0.142
hin-0.115
unct-0.114
 spoiled-0.111
 hinge-0.110
Token
Token ID198
Feature activation-2.15e+00

Pos logit contributions
jac+0.589
arc+0.587
Birth+0.586
acion+0.562
ivia+0.556
Neg logit contributions
 plural-0.724
unct-0.611
hin-0.611
 spoiled-0.595
 hinge-0.591
Token
Token ID198
Feature activation-2.10e+00

Pos logit contributions
jac+1.323
arc+1.322
Birth+1.319
acion+1.249
 Spending+1.237
Neg logit contributions
 plural-1.779
unct-1.511
hin-1.505
 spoiled-1.481
 hinge-1.477
Token statement
Token ID2643
Feature activation-5.74e-03

Pos logit contributions
 plural+0.087
unct+0.071
hin+0.071
 spoiled+0.069
 hinge+0.068
Neg logit contributions
jac-0.096
arc-0.096
Birth-0.096
acion-0.092
ivia-0.092
Token on
Token ID319
Feature activation+8.00e-01

Pos logit contributions
jac+0.004
arc+0.004
Birth+0.004
acion+0.004
ivia+0.004
Neg logit contributions
 plural-0.004
hin-0.003
unct-0.003
 spoiled-0.003
 hinge-0.003
Token their
Token ID511
Feature activation+2.84e+00

Pos logit contributions
 plural+0.560
unct+0.453
hin+0.452
 spoiled+0.441
 hinge+0.438
Neg logit contributions
jac-0.603
arc-0.602
Birth-0.597
acion-0.580
ivia-0.575
Token
Token ID564
Feature activation-2.26e+00

Pos logit contributions
 plural+2.016
unct+1.626
hin+1.609
 spoiled+1.596
 hinge+1.585
Neg logit contributions
jac-2.118
arc-2.095
Birth-2.082
acion-2.022
ivia-2.010
Token
Token ID246
Feature activation+6.36e-01

Pos logit contributions
arc+1.091
jac+1.085
Birth+1.078
ivia+1.010
acion+1.010
Neg logit contributions
 plural-2.172
hin-1.893
unct-1.887
 spoiled-1.852
 hinge-1.841
Tokenpositive
Token ID24561
Feature activation-9.28e+00

Pos logit contributions
 plural+0.477
unct+0.398
hin+0.396
 spoiled+0.385
 hinge+0.381
Neg logit contributions
jac-0.441
arc-0.439
Birth-0.437
acion-0.423
ivia-0.419
Token case
Token ID1339
Feature activation+9.54e-01

Pos logit contributions
jac+6.967
Birth+6.960
arc+6.927
 Spending+6.671
ivia+6.666
Neg logit contributions
 plural-5.610
hin-4.883
unct-4.794
 spoiled-4.658
 hers-4.507
Token
Token ID447
Feature activation-1.01e+00

Pos logit contributions
 plural+0.654
unct+0.530
hin+0.527
 spoiled+0.514
 hinge+0.513
Neg logit contributions
jac-0.730
arc-0.727
Birth-0.725
acion-0.701
ivia-0.695
Token
Token ID247
Feature activation+1.31e-02

Pos logit contributions
jac+0.554
arc+0.553
Birth+0.549
acion+0.524
ivia+0.518
Neg logit contributions
 plural-0.904
hin-0.778
unct-0.777
 spoiled-0.760
 hinge-0.755
Token starts
Token ID4940
Feature activation-2.85e-01

Pos logit contributions
 plural+0.009
hin+0.007
unct+0.007
 spoiled+0.007
 hinge+0.007
Neg logit contributions
jac-0.010
arc-0.010
Birth-0.010
acion-0.010
ivia-0.009
Token by
Token ID416
Feature activation-2.29e+00

Pos logit contributions
jac+0.199
arc+0.197
Birth+0.196
acion+0.189
ivia+0.188
Neg logit contributions
 plural-0.215
hin-0.178
unct-0.178
 spoiled-0.174
 hinge-0.173
TokenAss
Token ID8021
Feature activation+2.09e+00

Pos logit contributions
jac+0.513
arc+0.512
Birth+0.508
acion+0.491
ivia+0.487
Neg logit contributions
 plural-0.601
hin-0.506
unct-0.506
 spoiled-0.490
 hinge-0.487
Tokenassin
Token ID44961
Feature activation-8.83e-01

Pos logit contributions
 plural+1.547
hin+1.315
unct+1.311
 spoiled+1.250
 hinge+1.245
Neg logit contributions
Birth-1.447
jac-1.438
arc-1.432
acion-1.378
 Spending-1.371
Token
Token ID447
Feature activation-4.37e-01

Pos logit contributions
jac+0.691
arc+0.689
Birth+0.686
acion+0.663
ivia+0.658
Neg logit contributions
 plural-0.586
hin-0.475
unct-0.475
 spoiled-0.460
 hinge-0.456
Token
Token ID247
Feature activation-4.22e-01

Pos logit contributions
jac+0.231
arc+0.230
Birth+0.229
acion+0.217
ivia+0.215
Neg logit contributions
 plural-0.400
hin-0.346
unct-0.346
 spoiled-0.338
 hinge-0.336
Tokens
Token ID82
Feature activation-2.28e-01

Pos logit contributions
jac+0.291
arc+0.290
Birth+0.288
acion+0.279
ivia+0.276
Neg logit contributions
 plural-0.318
hin-0.266
unct-0.265
 spoiled-0.258
 hinge-0.255
Token Creed
Token ID24299
Feature activation-9.27e+00

Pos logit contributions
jac+0.156
arc+0.156
Birth+0.155
acion+0.150
ivia+0.148
Neg logit contributions
 plural-0.174
hin-0.145
unct-0.145
 spoiled-0.141
 hinge-0.140
Token®
Token ID7461
Feature activation-5.28e+00

Pos logit contributions
arc+5.903
Birth+5.901
jac+5.759
 Spending+5.591
ivia+5.492
Neg logit contributions
 plural-7.043
hin-6.034
unct-5.967
 hinge-5.777
 spoiled-5.733
Token Liberation
Token ID29235
Feature activation-5.39e+00

Pos logit contributions
arc+3.539
Birth+3.538
jac+3.512
 Spending+3.345
acion+3.331
Neg logit contributions
 plural-4.031
unct-3.395
hin-3.392
 hinge-3.279
 spoiled-3.277
Token HD
Token ID5572
Feature activation-4.94e+00

Pos logit contributions
arc+3.600
Birth+3.595
jac+3.567
acion+3.432
ivia+3.415
Neg logit contributions
 plural-4.143
hin-3.484
unct-3.468
 hinge-3.341
 spoiled-3.339
Token,
Token ID11
Feature activation+8.01e-02

Pos logit contributions
Birth+3.356
arc+3.344
jac+3.333
 Spending+3.178
ivia+3.165
Neg logit contributions
 plural-3.746
unct-3.147
hin-3.121
 hinge-3.012
 spoiled-2.998
Token is
Token ID318
Feature activation-2.88e-01

Pos logit contributions
 plural+0.061
hin+0.051
unct+0.051
 spoiled+0.050
 hinge+0.049
Neg logit contributions
jac-0.055
arc-0.055
Birth-0.054
acion-0.053
ivia-0.052
Token class
Token ID1398
Feature activation+9.63e-01

Pos logit contributions
jac+3.032
arc+3.021
Birth+3.001
acion+2.889
ivia+2.888
Neg logit contributions
 plural-2.595
hin-2.159
unct-2.143
 spoiled-2.057
 hinge-2.041
Token (
Token ID357
Feature activation-8.84e-01

Pos logit contributions
 plural+0.685
hin+0.561
unct+0.559
 spoiled+0.548
 hinge+0.539
Neg logit contributions
jac-0.712
Birth-0.707
arc-0.706
acion-0.683
ivia-0.671
Tokenin
Token ID259
Feature activation-1.82e+00

Pos logit contributions
jac+0.623
arc+0.621
Birth+0.619
acion+0.597
ivia+0.592
Neg logit contributions
 plural-0.654
hin-0.544
unct-0.543
 spoiled-0.527
 hinge-0.523
Token the
Token ID262
Feature activation-4.97e-01

Pos logit contributions
jac+1.369
arc+1.366
Birth+1.358
acion+1.311
ivia+1.302
Neg logit contributions
 plural-1.261
hin-1.034
unct-1.032
 spoiled-1.002
 hinge-0.994
Token wrong
Token ID2642
Feature activation-3.94e+00

Pos logit contributions
jac+0.376
arc+0.375
Birth+0.373
acion+0.361
ivia+0.358
Neg logit contributions
 plural-0.342
hin-0.280
unct-0.280
 spoiled-0.271
 hinge-0.269
Token language
Token ID3303
Feature activation-9.18e+00

Pos logit contributions
jac+2.751
arc+2.749
Birth+2.739
acion+2.632
ivia+2.613
Neg logit contributions
 plural-2.872
hin-2.417
unct-2.405
 spoiled-2.338
 hinge-2.315
Token!)
Token ID8133
Feature activation-2.54e+00

Pos logit contributions
arc+5.015
jac+4.922
Birth+4.811
ivia+4.712
acion+4.598
Neg logit contributions
 plural-7.547
unct-6.756
hin-6.712
zbollah-6.507
 hinge-6.428
Token to
Token ID284
Feature activation-6.99e-01

Pos logit contributions
arc+1.719
jac+1.719
Birth+1.709
acion+1.644
ivia+1.636
Neg logit contributions
 plural-1.933
unct-1.622
hin-1.615
 spoiled-1.566
 hinge-1.566
Token becoming
Token ID5033
Feature activation-3.39e-01

Pos logit contributions
jac+0.508
arc+0.506
Birth+0.504
acion+0.487
ivia+0.482
Neg logit contributions
 plural-0.504
hin-0.416
unct-0.415
 spoiled-0.403
 hinge-0.400
Token a
Token ID257
Feature activation+1.65e+00

Pos logit contributions
jac+0.259
arc+0.258
Birth+0.257
acion+0.248
ivia+0.247
Neg logit contributions
 plural-0.230
hin-0.188
unct-0.188
 spoiled-0.182
 hinge-0.181
Token Junior
Token ID20000
Feature activation+3.29e+00

Pos logit contributions
 plural+0.901
unct+0.688
hin+0.684
 spoiled+0.662
 hinge+0.650
Neg logit contributions
jac-1.493
arc-1.486
Birth-1.485
acion-1.459
ivia-1.437
Tokens
Token ID82
Feature activation-9.24e-01

Pos logit contributions
 plural+4.033
unct+3.222
hin+3.192
 spoiled+3.135
 hinge+3.093
Neg logit contributions
Birth-4.104
jac-4.066
arc-3.972
acion-3.857
ivia-3.845
Token.
Token ID13
Feature activation-9.66e-02

Pos logit contributions
jac+0.643
arc+0.643
Birth+0.638
acion+0.616
ivia+0.611
Neg logit contributions
 plural-0.690
unct-0.575
hin-0.574
 spoiled-0.559
 hinge-0.554
Token
Token ID198
Feature activation+1.36e+00

Pos logit contributions
jac+0.066
arc+0.066
Birth+0.066
acion+0.063
ivia+0.063
Neg logit contributions
 plural-0.073
hin-0.061
unct-0.061
 spoiled-0.059
 hinge-0.059
Token
Token ID198
Feature activation+1.06e+00

Pos logit contributions
 plural+1.051
hin+0.881
unct+0.878
 spoiled+0.852
 hinge+0.846
Neg logit contributions
jac-0.917
arc-0.913
Birth-0.907
acion-0.877
ivia-0.871
TokenAll
Token ID3237
Feature activation+3.08e+00

Pos logit contributions
 plural+0.772
hin+0.646
unct+0.643
 spoiled+0.616
 hinge+0.610
Neg logit contributions
jac-0.751
arc-0.745
Birth-0.735
acion-0.722
ivia-0.720
Token told
Token ID1297
Feature activation-9.15e+00

Pos logit contributions
 plural+2.106
hin+1.700
unct+1.697
 spoiled+1.662
 hinge+1.649
Neg logit contributions
jac-2.364
Birth-2.342
arc-2.328
acion-2.246
ivia-2.244
Token,
Token ID11
Feature activation+5.90e-01

Pos logit contributions
Birth+5.306
arc+5.250
 Spending+5.092
jac+5.017
acion+5.001
Neg logit contributions
 plural-7.222
unct-6.267
hin-6.230
zbollah-6.210
 hinge-6.162
Token Trump
Token ID1301
Feature activation+1.72e+00

Pos logit contributions
 plural+0.357
hin+0.281
unct+0.280
 spoiled+0.271
 hinge+0.268
Neg logit contributions
jac-0.499
arc-0.496
Birth-0.494
acion-0.480
ivia-0.477
Token
Token ID447
Feature activation-5.32e-01

Pos logit contributions
 plural+1.232
hin+1.010
unct+1.002
 spoiled+0.985
 hinge+0.967
Neg logit contributions
jac-1.265
arc-1.253
Birth-1.251
acion-1.194
ivia-1.193
Token
Token ID247
Feature activation-1.50e+00

Pos logit contributions
arc+0.289
jac+0.289
Birth+0.288
acion+0.275
ivia+0.271
Neg logit contributions
 plural-0.479
unct-0.412
hin-0.412
 spoiled-0.403
 hinge-0.400
Tokens
Token ID82
Feature activation-8.22e-02

Pos logit contributions
jac+1.086
arc+1.085
Birth+1.074
acion+1.034
ivia+1.031
Neg logit contributions
 plural-1.087
unct-0.898
hin-0.891
 spoiled-0.871
 hinge-0.870
Token idea
Token ID2126
Feature activation-1.48e+00

Pos logit contributions
jac+0.114
arc+0.113
Birth+0.113
acion+0.109
ivia+0.108
Neg logit contributions
 plural-0.107
hin-0.088
unct-0.088
 spoiled-0.085
 hinge-0.084
Token here
Token ID994
Feature activation+7.90e-01

Pos logit contributions
arc+1.164
jac+1.164
Birth+1.155
acion+1.119
ivia+1.108
Neg logit contributions
 plural-0.964
hin-0.781
unct-0.780
 spoiled-0.749
 hinge-0.743
Token is
Token ID318
Feature activation+2.87e-01

Pos logit contributions
 plural+0.527
unct+0.426
hin+0.425
 spoiled+0.414
 hinge+0.413
Neg logit contributions
jac-0.617
Birth-0.612
arc-0.612
acion-0.594
ivia-0.588
Token that
Token ID326
Feature activation-3.60e+00

Pos logit contributions
 plural+0.199
hin+0.161
unct+0.161
 spoiled+0.157
 hinge+0.156
Neg logit contributions
jac-0.218
arc-0.217
Birth-0.216
acion-0.209
ivia-0.208
Token any
Token ID597
Feature activation-2.49e+00

Pos logit contributions
jac+2.503
arc+2.494
Birth+2.472
acion+2.376
 Spending+2.376
Neg logit contributions
 plural-2.636
unct-2.228
hin-2.225
 spoiled-2.141
 hinge-2.121
Token distribution
Token ID6082
Feature activation-9.11e+00

Pos logit contributions
jac+1.803
arc+1.797
Birth+1.786
acion+1.724
ivia+1.708
Neg logit contributions
 plural-1.782
unct-1.479
hin-1.478
 spoiled-1.427
 hinge-1.416
Token which
Token ID543
Feature activation-2.04e-01

Pos logit contributions
arc+5.801
jac+5.762
Birth+5.690
 Spending+5.518
ivia+5.430
Neg logit contributions
 plural-6.616
unct-5.890
hin-5.760
zbollah-5.667
 spoiled-5.356
Token provides
Token ID3769
Feature activation+1.45e+00

Pos logit contributions
jac+0.151
arc+0.151
Birth+0.150
acion+0.145
ivia+0.144
Neg logit contributions
 plural-0.145
hin-0.119
unct-0.119
 spoiled-0.115
 hinge-0.114
Token the
Token ID262
Feature activation+1.23e-01

Pos logit contributions
 plural+1.014
hin+0.819
unct+0.815
 spoiled+0.798
 hinge+0.795
Neg logit contributions
jac-1.089
arc-1.087
Birth-1.086
acion-1.046
ivia-1.044
Token Snap
Token ID16026
Feature activation+6.66e-01

Pos logit contributions
 plural+0.087
hin+0.071
unct+0.071
 spoiled+0.069
 hinge+0.069
Neg logit contributions
jac-0.091
arc-0.091
Birth-0.091
acion-0.088
ivia-0.087
Token framework
Token ID9355
Feature activation-1.08e+00

Pos logit contributions
 plural+0.474
hin+0.387
unct+0.385
 spoiled+0.376
 hinge+0.376
Neg logit contributions
Birth-0.489
jac-0.489
arc-0.485
acion-0.471
ivia-0.468
Token International
Token ID4037
Feature activation-2.01e+00

Pos logit contributions
 plural+0.702
hin+0.582
unct+0.580
 spoiled+0.572
 hinge+0.566
Neg logit contributions
jac-0.597
arc-0.593
Birth-0.591
acion-0.566
ivia-0.563
Token Monetary
Token ID34112
Feature activation-3.66e+00

Pos logit contributions
arc+1.246
jac+1.243
Birth+1.243
acion+1.192
 Spending+1.177
Neg logit contributions
 plural-1.650
unct-1.404
hin-1.397
 spoiled-1.364
 hinge-1.358
Token Fund
Token ID7557
Feature activation-7.31e+00

Pos logit contributions
Birth+1.637
arc+1.636
jac+1.612
acion+1.538
 Spending+1.525
Neg logit contributions
 plural-3.641
unct-3.182
hin-3.169
 spoiled-3.109
 hinge-3.089
Token chief
Token ID4039
Feature activation-5.10e-01

Pos logit contributions
jac+5.147
Birth+5.093
arc+5.053
 Spending+5.049
acion+4.957
Neg logit contributions
 plural-4.970
hin-4.246
unct-4.217
zbollah-3.999
 spoiled-3.995
Token Domin
Token ID11817
Feature activation-2.76e+00

Pos logit contributions
jac+0.270
arc+0.269
Birth+0.268
acion+0.255
ivia+0.252
Neg logit contributions
 plural-0.467
hin-0.402
unct-0.402
 spoiled-0.393
 hinge-0.391
Tokenique
Token ID2350
Feature activation-9.07e+00

Pos logit contributions
arc+1.507
jac+1.500
Birth+1.462
acion+1.424
ivia+1.401
Neg logit contributions
 plural-2.500
unct-2.142
hin-2.137
 hinge-2.101
 spoiled-2.100
Token Strauss
Token ID42971
Feature activation-1.84e+00

Pos logit contributions
jac+5.072
Birth+5.047
arc+5.040
 Spending+4.796
acion+4.695
Neg logit contributions
 plural-7.702
unct-6.747
hin-6.654
 spoiled-6.424
 hinge-6.378
Token-
Token ID12
Feature activation-4.37e+00

Pos logit contributions
arc+1.316
jac+1.307
Birth+1.290
 Spending+1.251
acion+1.220
Neg logit contributions
 plural-1.332
unct-1.125
hin-1.106
 hinge-1.074
 spoiled-1.063
TokenK
Token ID42
Feature activation-2.04e+00

Pos logit contributions
jac+3.492
Birth+3.458
arc+3.415
ivia+3.280
acion+3.266
Neg logit contributions
 plural-2.853
unct-2.340
hin-2.315
 hinge-2.229
 spoiled-2.208
Tokenahn
Token ID15386
Feature activation-1.93e+00

Pos logit contributions
jac+1.688
arc+1.688
ivia+1.602
Birth+1.597
ivo+1.537
Neg logit contributions
 plural-1.342
unct-1.056
 hinge-1.040
 spoiled-1.033
hin-1.017
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
jac+1.364
arc+1.359
Birth+1.351
acion+1.303
ivia+1.292
Neg logit contributions
 plural-1.425
unct-1.184
hin-1.181
 spoiled-1.146
 hinge-1.140
Token
Token ID247
Feature activation-1.13e+00

Pos logit contributions
jac+0.361
arc+0.359
Birth+0.356
acion+0.341
ivia+0.338
Neg logit contributions
 plural-0.542
hin-0.465
unct-0.464
 spoiled-0.453
 hinge-0.451
Tokens
Token ID82
Feature activation-7.66e-01

Pos logit contributions
jac+0.853
arc+0.851
Birth+0.848
acion+0.819
ivia+0.812
Neg logit contributions
 plural-0.787
hin-0.644
unct-0.643
 spoiled-0.624
 hinge-0.619
Token attorney
Token ID6136
Feature activation-1.56e+00

Pos logit contributions
jac+0.555
arc+0.553
Birth+0.550
acion+0.531
ivia+0.527
Neg logit contributions
 plural-0.556
hin-0.457
unct-0.456
 spoiled-0.445
 hinge-0.441
Token,
Token ID11
Feature activation+2.02e-01

Pos logit contributions
jac+1.095
arc+1.093
Birth+1.087
acion+1.049
ivia+1.039
Neg logit contributions
 plural-1.162
hin-0.967
unct-0.966
 spoiled-0.938
 hinge-0.931
Token Joey
Token ID26154
Feature activation+5.78e-02

Pos logit contributions
 plural+0.146
hin+0.120
unct+0.120
 spoiled+0.116
 hinge+0.115
Neg logit contributions
jac-0.147
arc-0.147
Birth-0.146
acion-0.141
ivia-0.140
Token McC
Token ID5108
Feature activation-9.07e+00

Pos logit contributions
 plural+0.043
hin+0.035
unct+0.035
 spoiled+0.034
 hinge+0.034
Neg logit contributions
jac-0.041
arc-0.041
Birth-0.041
acion-0.039
ivia-0.039
Tokenall
Token ID439
Feature activation-6.32e-02

Pos logit contributions
arc+6.121
jac+5.959
ivia+5.754
Birth+5.743
acion+5.734
Neg logit contributions
 plural-7.050
 hinge-5.727
 spoiled-5.694
hin-5.535
unct-5.484
Token,
Token ID11
Feature activation-4.50e-02

Pos logit contributions
jac+0.047
arc+0.047
Birth+0.047
acion+0.045
ivia+0.045
Neg logit contributions
 plural-0.044
hin-0.036
unct-0.036
 spoiled-0.035
 hinge-0.035
Token told
Token ID1297
Feature activation-4.04e+00

Pos logit contributions
jac+0.031
Birth+0.031
arc+0.031
acion+0.029
ivia+0.029
Neg logit contributions
 plural-0.034
hin-0.029
unct-0.029
 spoiled-0.028
 hinge-0.028
Token WIN
Token ID25779
Feature activation+3.70e-01

Pos logit contributions
jac+2.792
arc+2.784
Birth+2.766
acion+2.678
 Spending+2.641
Neg logit contributions
 plural-2.987
unct-2.515
hin-2.488
 spoiled-2.427
 hinge-2.411
TokenZ
Token ID57
Feature activation+1.85e-01

Pos logit contributions
 plural+0.251
hin+0.205
unct+0.204
 spoiled+0.198
 hinge+0.196
Neg logit contributions
jac-0.284
arc-0.284
Birth-0.282
acion-0.273
ivia-0.271
TokenBack
Token ID7282
Feature activation+1.04e+00

Pos logit contributions
jac+0.593
arc+0.591
Birth+0.586
acion+0.567
ivia+0.563
Neg logit contributions
 plural-0.706
hin-0.596
unct-0.594
 spoiled-0.576
 hinge-0.574
Token in
Token ID287
Feature activation-6.55e-01

Pos logit contributions
 plural+0.765
unct+0.633
hin+0.632
 hinge+0.615
 spoiled+0.613
Neg logit contributions
Birth-0.734
jac-0.732
arc-0.728
acion-0.702
ivia-0.702
Token the
Token ID262
Feature activation+1.13e+00

Pos logit contributions
jac+0.488
arc+0.487
Birth+0.485
acion+0.469
ivia+0.465
Neg logit contributions
 plural-0.459
hin-0.376
unct-0.376
 spoiled-0.365
 hinge-0.362
Token Days
Token ID12579
Feature activation+2.41e+00

Pos logit contributions
 plural+0.854
hin+0.708
unct+0.702
 spoiled+0.692
 hinge+0.689
Neg logit contributions
jac-0.783
arc-0.777
Birth-0.772
acion-0.748
ivia-0.741
Token Before
Token ID7413
Feature activation+1.34e+00

Pos logit contributions
 plural+1.716
hin+1.435
unct+1.405
 spoiled+1.373
 hinge+1.359
Neg logit contributions
jac-1.747
arc-1.745
Birth-1.732
acion-1.674
ivia-1.667
Token Positive
Token ID33733
Feature activation-9.01e+00

Pos logit contributions
 plural+1.009
hin+0.846
unct+0.833
 spoiled+0.816
 hinge+0.809
Neg logit contributions
arc-0.923
jac-0.922
Birth-0.916
acion-0.885
ivia-0.884
Token Rein
Token ID22299
Feature activation-2.50e+00

Pos logit contributions
jac+5.746
Birth+5.678
arc+5.599
 Spending+5.551
acion+5.437
Neg logit contributions
 plural-6.928
unct-5.978
hin-5.906
 spoiled-5.793
 hinge-5.652
Tokenforcement
Token ID13442
Feature activation-1.04e+00

Pos logit contributions
jac+1.676
arc+1.669
Birth+1.662
acion+1.598
ivia+1.583
Neg logit contributions
 plural-1.940
unct-1.626
hin-1.625
 spoiled-1.583
 hinge-1.571
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
jac+0.693
arc+0.691
Birth+0.688
acion+0.662
ivia+0.656
Neg logit contributions
 plural-0.810
hin-0.678
unct-0.677
 spoiled-0.661
 hinge-0.657
Token you
Token ID345
Feature activation+1.15e-01

Pos logit contributions
jac+0.893
arc+0.890
Birth+0.887
acion+0.855
ivia+0.848
Neg logit contributions
 plural-0.910
unct-0.754
hin-0.753
 spoiled-0.731
 hinge-0.726
Token would
Token ID561
Feature activation-2.10e+00

Pos logit contributions
 plural+0.079
hin+0.064
unct+0.063
 spoiled+0.063
 hinge+0.062
Neg logit contributions
jac-0.087
arc-0.087
Birth-0.087
acion-0.084
ivia-0.083
Token made
Token ID925
Feature activation-3.70e+00

Pos logit contributions
 plural+0.525
hin+0.424
unct+0.424
 spoiled+0.417
 hinge+0.410
Neg logit contributions
jac-0.600
Birth-0.593
arc-0.592
acion-0.574
ivia-0.568
Token easy
Token ID2562
Feature activation+2.45e+00

Pos logit contributions
arc+2.541
jac+2.529
Birth+2.516
acion+2.416
ivia+2.392
Neg logit contributions
 plural-2.771
hin-2.331
unct-2.323
 spoiled-2.242
 hinge-2.239
Token in
Token ID287
Feature activation+1.22e-01

Pos logit contributions
 plural+1.733
hin+1.408
unct+1.396
 spoiled+1.380
 hinge+1.362
Neg logit contributions
jac-1.825
Birth-1.814
arc-1.808
acion-1.761
ivia-1.734
Token some
Token ID617
Feature activation-1.15e+00

Pos logit contributions
 plural+0.094
hin+0.078
unct+0.078
 spoiled+0.077
 hinge+0.076
Neg logit contributions
jac-0.083
arc-0.082
Birth-0.082
acion-0.080
ivia-0.079
Token other
Token ID584
Feature activation-3.94e+00

Pos logit contributions
jac+0.765
arc+0.759
Birth+0.758
acion+0.729
ivia+0.722
Neg logit contributions
 plural-0.899
hin-0.751
unct-0.749
 spoiled-0.735
 hinge-0.728
Token dist
Token ID1233
Feature activation-8.96e+00

Pos logit contributions
arc+2.632
jac+2.607
Birth+2.596
 Spending+2.490
acion+2.489
Neg logit contributions
 plural-3.011
unct-2.547
hin-2.535
 hinge-2.436
 spoiled-2.433
Tokenros
Token ID4951
Feature activation-5.16e+00

Pos logit contributions
arc+5.319
jac+5.066
Birth+4.835
acion+4.797
ivo+4.788
Neg logit contributions
 plural-7.685
hin-6.411
unct-6.360
 hinge-6.301
 spoiled-6.261
Token.
Token ID13
Feature activation-2.50e-01

Pos logit contributions
arc+3.425
jac+3.389
Birth+3.356
ivia+3.225
 Spending+3.207
Neg logit contributions
 plural-3.954
unct-3.352
hin-3.307
 hinge-3.198
 spoiled-3.183
Token
Token ID198
Feature activation+1.85e+00

Pos logit contributions
jac+0.169
arc+0.169
Birth+0.167
acion+0.162
ivia+0.161
Neg logit contributions
 plural-0.193
hin-0.162
unct-0.161
 spoiled-0.157
 hinge-0.156
Token
Token ID198
Feature activation+1.33e+00

Pos logit contributions
 plural+1.399
hin+1.170
unct+1.164
 spoiled+1.128
 hinge+1.120
Neg logit contributions
jac-1.263
arc-1.258
Birth-1.247
acion-1.213
ivia-1.205
TokenIn
Token ID818
Feature activation+3.15e-01

Pos logit contributions
 plural+0.974
hin+0.824
unct+0.820
 spoiled+0.787
 hinge+0.779
Neg logit contributions
jac-0.924
arc-0.921
Birth-0.906
acion-0.898
ivia-0.888
Token experiment
Token ID6306
Feature activation+7.04e-01

Pos logit contributions
 plural+3.243
 hinge+2.604
hin+2.602
 spoiled+2.580
unct+2.578
Neg logit contributions
jac-3.317
Birth-3.272
arc-3.271
ivia-3.166
acion-3.137
Token,
Token ID11
Feature activation+3.60e-01

Pos logit contributions
 plural+0.525
unct+0.433
hin+0.431
 spoiled+0.423
 hinge+0.421
Neg logit contributions
jac-0.497
arc-0.492
Birth-0.492
acion-0.472
ivia-0.469
Token 1
Token ID352
Feature activation-3.21e-01

Pos logit contributions
 plural+0.260
hin+0.210
unct+0.210
 spoiled+0.208
 hinge+0.206
Neg logit contributions
jac-0.264
arc-0.261
Birth-0.261
acion-0.252
ivia-0.251
Token,
Token ID11
Feature activation+2.10e+00

Pos logit contributions
jac+0.223
arc+0.221
Birth+0.221
acion+0.213
ivia+0.211
Neg logit contributions
 plural-0.243
unct-0.201
hin-0.201
 spoiled-0.197
 hinge-0.195
Token000
Token ID830
Feature activation+6.18e-02

Pos logit contributions
 plural+1.422
hin+1.151
unct+1.149
 spoiled+1.120
 hinge+1.114
Neg logit contributions
jac-1.619
Birth-1.601
arc-1.600
acion-1.542
ivia-1.539
Token low
Token ID1877
Feature activation-8.92e+00

Pos logit contributions
 plural+0.048
hin+0.040
unct+0.040
 spoiled+0.039
 hinge+0.039
Neg logit contributions
jac-0.042
arc-0.041
Birth-0.041
acion-0.040
ivia-0.039
Token-
Token ID12
Feature activation-4.33e+00

Pos logit contributions
Birth+5.569
arc+5.519
jac+5.472
ivia+5.313
acion+5.277
Neg logit contributions
 plural-6.814
hin-5.985
unct-5.926
zbollah-5.647
 hinge-5.491
Tokenincome
Token ID12519
Feature activation-3.49e+00

Pos logit contributions
arc+2.424
Birth+2.415
jac+2.414
acion+2.254
ivia+2.252
Neg logit contributions
 plural-3.828
hin-3.283
unct-3.277
 hinge-3.189
 spoiled-3.173
Token families
Token ID4172
Feature activation-2.41e+00

Pos logit contributions
arc+2.199
jac+2.173
Birth+2.165
acion+2.098
 Spending+2.047
Neg logit contributions
 plural-2.773
unct-2.413
hin-2.402
zbollah-2.268
 hinge-2.262
Token will
Token ID481
Feature activation+4.50e-03

Pos logit contributions
jac+1.767
arc+1.764
Birth+1.748
acion+1.700
ivia+1.682
Neg logit contributions
 plural-1.689
unct-1.408
hin-1.399
 spoiled-1.337
 hinge-1.329
Token be
Token ID307
Feature activation-7.46e-02

Pos logit contributions
 plural+0.003
hin+0.002
unct+0.002
 hinge+0.002
 spoiled+0.002
Neg logit contributions
jac-0.004
arc-0.004
Birth-0.004
acion-0.003
ivia-0.003
Token death
Token ID1918
Feature activation+1.84e+00

Pos logit contributions
jac+1.049
arc+1.045
Birth+1.040
acion+1.004
ivia+0.996
Neg logit contributions
 plural-1.059
hin-0.874
unct-0.874
 spoiled-0.850
 hinge-0.843
Token of
Token ID286
Feature activation-7.15e-01

Pos logit contributions
 plural+1.264
hin+1.031
unct+1.027
 spoiled+1.010
 hinge+0.996
Neg logit contributions
jac-1.401
arc-1.387
Birth-1.384
acion-1.335
ivia-1.335
Token his
Token ID465
Feature activation+1.83e+00

Pos logit contributions
jac+0.510
arc+0.509
Birth+0.506
acion+0.489
ivia+0.485
Neg logit contributions
 plural-0.524
hin-0.434
unct-0.433
 spoiled-0.422
 hinge-0.418
Token young
Token ID1862
Feature activation-9.79e-01

Pos logit contributions
 plural+1.265
hin+1.013
 spoiled+1.003
unct+1.000
 hinge+0.983
Neg logit contributions
jac-1.411
arc-1.398
Birth-1.385
acion-1.348
ivia-1.342
Token daughter
Token ID4957
Feature activation-1.77e+00

Pos logit contributions
jac+0.760
arc+0.758
Birth+0.755
acion+0.730
ivia+0.724
Neg logit contributions
 plural-0.654
hin-0.531
unct-0.531
 spoiled-0.514
 hinge-0.510
Token Olivia
Token ID33911
Feature activation-8.72e+00

Pos logit contributions
jac+1.283
arc+1.281
Birth+1.275
acion+1.230
ivia+1.220
Neg logit contributions
 plural-1.269
unct-1.053
hin-1.047
 hinge-1.008
 spoiled-1.005
Token and
Token ID290
Feature activation-1.71e+00

Pos logit contributions
jac+6.075
arc+6.054
Birth+6.002
ivia+5.811
acion+5.774
Neg logit contributions
 plural-5.997
unct-5.082
hin-4.943
zbollah-4.767
 hinge-4.749
Token the
Token ID262
Feature activation-1.27e+00

Pos logit contributions
jac+1.224
arc+1.223
Birth+1.222
acion+1.175
ivia+1.163
Neg logit contributions
 plural-1.236
unct-1.026
hin-1.025
 spoiled-0.989
 hinge-0.987
Token struggles
Token ID12766
Feature activation-3.30e+00

Pos logit contributions
jac+0.855
arc+0.853
Birth+0.850
acion+0.817
ivia+0.809
Neg logit contributions
 plural-0.979
hin-0.821
unct-0.821
 spoiled-0.796
 hinge-0.791
Token of
Token ID286
Feature activation-2.53e+00

Pos logit contributions
jac+2.309
arc+2.306
Birth+2.303
acion+2.224
ivia+2.188
Neg logit contributions
 plural-2.422
unct-2.016
hin-1.993
 hinge-1.935
 spoiled-1.925
Token his
Token ID465
Feature activation+7.46e-01

Pos logit contributions
jac+1.766
arc+1.761
Birth+1.757
acion+1.687
ivia+1.673
Neg logit contributions
 plural-1.878
unct-1.568
hin-1.567
 spoiled-1.515
 hinge-1.512
Token
Token ID247
Feature activation+1.15e-01

Pos logit contributions
 plural+0.060
unct+0.052
hin+0.052
 spoiled+0.050
 hinge+0.050
Neg logit contributions
jac-0.041
arc-0.040
Birth-0.040
acion-0.038
ivia-0.038
Token apartment
Token ID7962
Feature activation-9.29e-01

Pos logit contributions
 plural+0.079
hin+0.065
unct+0.064
 spoiled+0.062
 hinge+0.061
Neg logit contributions
jac-0.088
Birth-0.087
arc-0.087
acion-0.084
ivia-0.083
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
jac+0.656
arc+0.654
Birth+0.651
acion+0.627
ivia+0.622
Neg logit contributions
 plural-0.687
hin-0.570
unct-0.570
 spoiled-0.553
 hinge-0.549
Token and
Token ID290
Feature activation-1.41e+00

Pos logit contributions
jac+1.382
arc+1.378
Birth+1.372
acion+1.322
ivia+1.311
Neg logit contributions
 plural-1.432
unct-1.188
hin-1.188
 spoiled-1.152
 hinge-1.145
Token jumping
Token ID14284
Feature activation-3.95e+00

Pos logit contributions
jac+1.032
arc+1.026
Birth+1.023
acion+0.988
ivia+0.980
Neg logit contributions
 plural-1.010
hin-0.832
unct-0.829
 spoiled-0.809
 hinge-0.800
Token out
Token ID503
Feature activation-8.69e+00

Pos logit contributions
jac+2.652
arc+2.647
Birth+2.626
acion+2.525
ivia+2.515
Neg logit contributions
 plural-3.015
unct-2.542
hin-2.523
 hinge-2.440
 spoiled-2.430
Token on
Token ID319
Feature activation+3.86e-02

Pos logit contributions
arc+5.595
jac+5.560
Birth+5.506
acion+5.412
ivia+5.277
Neg logit contributions
 plural-6.465
unct-5.550
hin-5.395
zbollah-5.257
 hinge-5.211
Token top
Token ID1353
Feature activation-8.72e-01

Pos logit contributions
 plural+0.028
hin+0.023
unct+0.023
 spoiled+0.022
 hinge+0.022
Neg logit contributions
jac-0.028
arc-0.028
Birth-0.028
acion-0.027
ivia-0.027
Token of
Token ID286
Feature activation-4.25e+00

Pos logit contributions
jac+0.645
arc+0.643
Birth+0.640
acion+0.619
ivia+0.613
Neg logit contributions
 plural-0.616
unct-0.506
hin-0.506
 spoiled-0.490
 hinge-0.487
Token Ann
Token ID5506
Feature activation-2.99e+00

Pos logit contributions
jac+2.974
arc+2.973
Birth+2.931
acion+2.833
ivia+2.812
Neg logit contributions
 plural-3.102
unct-2.613
hin-2.596
 hinge-2.503
 spoiled-2.493
Tokenal
Token ID282
Feature activation-1.52e+00

Pos logit contributions
arc+1.754
jac+1.750
Birth+1.722
acion+1.666
ivia+1.642
Neg logit contributions
 plural-2.573
unct-2.196
hin-2.190
 spoiled-2.154
 hinge-2.141
Token.
Token ID13
Feature activation-5.42e-01

Pos logit contributions
 plural+1.871
hin+1.515
unct+1.501
 hinge+1.495
 spoiled+1.490
Neg logit contributions
jac-1.906
arc-1.890
Birth-1.889
acion-1.830
ivia-1.795
Token
Token ID198
Feature activation+2.64e-01

Pos logit contributions
jac+0.362
arc+0.361
Birth+0.359
acion+0.347
ivia+0.343
Neg logit contributions
 plural-0.422
hin-0.353
unct-0.352
 spoiled-0.342
 hinge-0.340
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
 plural+0.211
unct+0.178
hin+0.178
 spoiled+0.173
 hinge+0.172
Neg logit contributions
jac-0.171
arc-0.171
Birth-0.170
acion-0.163
ivia-0.161
TokenEnd
Token ID12915
Feature activation-1.25e+00

Pos logit contributions
 plural+0.115
hin+0.096
unct+0.096
 spoiled+0.093
 hinge+0.092
Neg logit contributions
jac-0.103
arc-0.103
Birth-0.102
acion-0.099
ivia-0.098
Tokening
Token ID278
Feature activation-1.56e+00

Pos logit contributions
jac+0.827
arc+0.825
Birth+0.821
acion+0.790
ivia+0.782
Neg logit contributions
 plural-0.975
hin-0.820
unct-0.818
 spoiled-0.796
 hinge-0.792
Token negative
Token ID4633
Feature activation-8.67e+00

Pos logit contributions
jac+1.102
arc+1.099
Birth+1.095
acion+1.053
ivia+1.048
Neg logit contributions
 plural-1.140
unct-0.952
hin-0.950
 spoiled-0.918
 hinge-0.913
Token gearing
Token ID40592
Feature activation-3.37e+00

Pos logit contributions
jac+5.873
arc+5.835
Birth+5.816
 Spending+5.720
ivia+5.624
Neg logit contributions
 plural-5.973
unct-5.308
hin-4.982
 spoiled-4.932
zbollah-4.875
Token won
Token ID1839
Feature activation-5.12e-01

Pos logit contributions
jac+2.352
arc+2.349
Birth+2.344
 Spending+2.255
ivia+2.241
Neg logit contributions
 plural-2.442
unct-2.068
hin-2.058
 spoiled-1.967
 hinge-1.944
Token't
Token ID470
Feature activation-1.12e+00

Pos logit contributions
jac+0.411
arc+0.409
Birth+0.408
acion+0.396
ivia+0.392
Neg logit contributions
 plural-0.329
hin-0.265
unct-0.264
 spoiled-0.257
 hinge-0.254
Token matter
Token ID2300
Feature activation+1.48e+00

Pos logit contributions
jac+0.871
arc+0.869
Birth+0.865
acion+0.834
ivia+0.831
Neg logit contributions
 plural-0.745
unct-0.608
hin-0.605
 spoiled-0.585
 hinge-0.575
Token.
Token ID13
Feature activation-3.47e-01

Pos logit contributions
 plural+1.035
hin+0.850
unct+0.837
 spoiled+0.823
 hinge+0.818
Neg logit contributions
jac-1.124
arc-1.114
Birth-1.111
acion-1.076
ivia-1.064
Token
Token ID198
Feature activation+2.04e+00

Pos logit contributions
 plural+2.299
hin+1.905
unct+1.897
 spoiled+1.815
 Jin+1.793
Neg logit contributions
jac-2.279
arc-2.261
acion-2.216
Birth-2.212
ivia-2.201
TokenOver
Token ID5886
Feature activation+2.36e-02

Pos logit contributions
 plural+1.456
hin+1.233
unct+1.222
 spoiled+1.159
 hinge+1.146
Neg logit contributions
jac-1.443
arc-1.434
Birth-1.414
acion-1.413
ivia-1.396
Token on
Token ID319
Feature activation+1.21e+00

Pos logit contributions
 plural+0.016
hin+0.013
unct+0.013
 spoiled+0.013
 hinge+0.013
Neg logit contributions
jac-0.018
arc-0.018
Birth-0.018
acion-0.017
ivia-0.017
Token Reddit
Token ID10750
Feature activation+1.45e-01

Pos logit contributions
 plural+0.886
hin+0.731
unct+0.730
 spoiled+0.711
 hinge+0.704
Neg logit contributions
jac-0.866
arc-0.863
Birth-0.859
acion-0.833
ivia-0.824
Token Oculus
Token ID22334
Feature activation-5.72e+00

Pos logit contributions
 plural+0.105
hin+0.087
unct+0.087
 spoiled+0.084
 hinge+0.083
Neg logit contributions
jac-0.105
arc-0.104
Birth-0.104
acion-0.100
ivia-0.099
Token Rift
Token ID21222
Feature activation-8.60e+00

Pos logit contributions
arc+4.029
jac+4.025
Birth+4.013
acion+3.868
 Spending+3.844
Neg logit contributions
 plural-4.051
unct-3.428
hin-3.392
 hinge-3.254
 spoiled-3.242
Token creator
Token ID13172
Feature activation-5.02e+00

Pos logit contributions
Birth+5.841
arc+5.806
jac+5.754
 Spending+5.598
ivo+5.545
Neg logit contributions
 plural-6.075
unct-5.152
hin-5.096
zbollah-4.861
 hinge-4.828
Token Palmer
Token ID18918
Feature activation-2.81e+00

Pos logit contributions
jac+3.354
arc+3.343
Birth+3.285
 Spending+3.149
acion+3.127
Neg logit contributions
 plural-3.783
unct-3.225
hin-3.169
 spoiled-3.082
 hinge-3.069
Token Luc
Token ID7598
Feature activation-6.79e+00

Pos logit contributions
jac+1.688
arc+1.684
Birth+1.654
acion+1.576
 Spending+1.573
Neg logit contributions
 plural-2.335
unct-2.025
hin-2.018
 spoiled-1.938
 hinge-1.935
Tokenkey
Token ID2539
Feature activation-2.98e+00

Pos logit contributions
arc+3.232
jac+3.143
Birth+3.078
acion+2.977
ivo+2.939
Neg logit contributions
 plural-6.425
unct-5.747
hin-5.635
 spoiled-5.537
 hinge-5.521
Token has
Token ID468
Feature activation-2.56e+00

Pos logit contributions
arc+2.083
jac+2.082
Birth+2.065
acion+1.986
ivia+1.977
Neg logit contributions
 plural-2.189
unct-1.826
hin-1.822
 hinge-1.762
 spoiled-1.751
Token.
Token ID13
Feature activation-4.87e-01

Pos logit contributions
arc+1.476
jac+1.472
Birth+1.459
acion+1.405
ivia+1.386
Neg logit contributions
 plural-1.665
unct-1.393
hin-1.391
 spoiled-1.349
 hinge-1.336
Token
Token ID198
Feature activation+1.00e+00

Pos logit contributions
jac+0.332
arc+0.330
Birth+0.329
acion+0.318
ivia+0.317
Neg logit contributions
 plural-0.371
hin-0.311
unct-0.308
 spoiled-0.300
 hinge-0.297
Token
Token ID198
Feature activation+5.82e-01

Pos logit contributions
 plural+0.772
hin+0.647
unct+0.645
 spoiled+0.626
 hinge+0.622
Neg logit contributions
jac-0.674
arc-0.671
Birth-0.667
acion-0.645
ivia-0.641
TokenThe
Token ID464
Feature activation-3.08e-01

Pos logit contributions
 plural+0.437
hin+0.372
unct+0.368
 spoiled+0.353
 hinge+0.349
Neg logit contributions
jac-0.393
arc-0.391
Birth-0.386
ivia-0.381
acion-0.381
Token best
Token ID1266
Feature activation-3.15e+00

Pos logit contributions
jac+0.230
arc+0.229
Birth+0.229
acion+0.221
ivia+0.220
Neg logit contributions
 plural-0.217
hin-0.177
unct-0.175
 spoiled-0.172
 hinge-0.171
Token way
Token ID835
Feature activation-8.59e+00

Pos logit contributions
jac+1.986
arc+1.982
Birth+1.963
acion+1.893
ivia+1.872
Neg logit contributions
 plural-2.531
unct-2.167
hin-2.153
 spoiled-2.069
 hinge-2.067
Token to
Token ID284
Feature activation-4.17e+00

Pos logit contributions
arc+5.189
jac+5.082
Birth+4.909
acion+4.867
 Spending+4.783
Neg logit contributions
 plural-6.818
unct-5.970
hin-5.811
zbollah-5.694
 hinge-5.606
Token avoid
Token ID3368
Feature activation-2.53e+00

Pos logit contributions
arc+3.224
jac+3.216
Birth+3.167
ivia+3.083
 Spending+3.063
Neg logit contributions
 plural-2.687
unct-2.232
hin-2.198
zbollah-2.094
 spoiled-2.091
Token this
Token ID428
Feature activation-1.23e+00

Pos logit contributions
jac+1.821
arc+1.817
Birth+1.803
acion+1.742
ivia+1.725
Neg logit contributions
 plural-1.821
unct-1.512
hin-1.504
 spoiled-1.456
 hinge-1.450
Token type
Token ID2099
Feature activation-3.03e+00

Pos logit contributions
jac+0.832
arc+0.827
Birth+0.826
acion+0.795
ivia+0.788
Neg logit contributions
 plural-0.945
hin-0.790
unct-0.788
 spoiled-0.770
 hinge-0.764
Token of
Token ID286
Feature activation-1.95e+00

Pos logit contributions
arc+2.112
jac+2.094
Birth+2.071
acion+2.001
 Spending+1.986
Neg logit contributions
 plural-2.236
unct-1.892
hin-1.865
 spoiled-1.799
 hinge-1.790
Token who
Token ID508
Feature activation+1.26e+00

Pos logit contributions
jac+1.255
arc+1.251
Birth+1.245
acion+1.203
ivia+1.193
Neg logit contributions
 plural-1.188
hin-0.974
unct-0.973
 spoiled-0.945
 hinge-0.938
Token desire
Token ID6227
Feature activation+1.66e-01

Pos logit contributions
 plural+0.849
hin+0.684
unct+0.680
 hinge+0.664
 spoiled+0.663
Neg logit contributions
jac-0.977
arc-0.974
Birth-0.970
acion-0.940
ivia-0.931
Token physical
Token ID3518
Feature activation+2.13e+00

Pos logit contributions
 plural+0.121
hin+0.099
unct+0.097
 spoiled+0.096
 hinge+0.096
Neg logit contributions
jac-0.120
arc-0.120
Birth-0.119
acion-0.115
ivia-0.115
Token transition
Token ID6801
Feature activation-1.32e+00

Pos logit contributions
 plural+1.633
hin+1.303
 hinge+1.279
 spoiled+1.273
unct+1.256
Neg logit contributions
arc-1.515
jac-1.514
Birth-1.478
ivia-1.445
acion-1.444
Token ("
Token ID5855
Feature activation-1.51e+00

Pos logit contributions
jac+0.941
arc+0.937
Birth+0.932
acion+0.900
ivia+0.892
Neg logit contributions
 plural-0.972
hin-0.804
unct-0.799
 spoiled-0.779
 hinge-0.773
Tokentrans
Token ID7645
Feature activation-8.58e+00

Pos logit contributions
jac+1.030
arc+1.028
Birth+1.023
acion+0.986
ivia+0.978
Neg logit contributions
 plural-1.150
hin-0.961
unct-0.959
 spoiled-0.932
 hinge-0.926
Tokensexual
Token ID18338
Feature activation-7.34e+00

Pos logit contributions
Birth+4.666
arc+4.657
jac+4.630
acion+4.461
ivia+4.376
Neg logit contributions
 plural-7.336
hin-6.409
unct-6.383
 spoiled-6.275
 hinge-6.199
Token"
Token ID1
Feature activation-3.40e+00

Pos logit contributions
jac+4.656
arc+4.650
Birth+4.645
acion+4.418
ivia+4.403
Neg logit contributions
 plural-5.629
unct-4.934
hin-4.871
 spoiled-4.714
 hinge-4.639
Token people
Token ID661
Feature activation-2.41e+00

Pos logit contributions
arc+2.453
jac+2.442
Birth+2.437
acion+2.340
ivia+2.321
Neg logit contributions
 plural-2.399
unct-2.029
hin-2.019
 spoiled-1.946
 hinge-1.930
Token)
Token ID8
Feature activation-8.65e-01

Pos logit contributions
arc+1.752
jac+1.752
Birth+1.740
acion+1.674
ivia+1.666
Neg logit contributions
 plural-1.692
unct-1.419
hin-1.407
 spoiled-1.363
 hinge-1.348
Token and
Token ID290
Feature activation-8.97e-01

Pos logit contributions
jac+0.598
arc+0.593
Birth+0.592
acion+0.572
ivia+0.564
Neg logit contributions
 plural-0.659
hin-0.546
unct-0.541
 spoiled-0.531
 hinge-0.526
TokenDeb
Token ID16587
Feature activation-4.20e+00

Pos logit contributions
jac+3.768
Birth+3.730
arc+3.722
acion+3.443
 Spending+3.441
Neg logit contributions
 plural-4.862
unct-4.057
hin-4.047
 spoiled-4.035
 hinge-3.999
Tokenate
Token ID378
Feature activation-2.50e+00

Pos logit contributions
arc+2.387
jac+2.372
Birth+2.345
acion+2.273
ivia+2.246
Neg logit contributions
 plural-3.715
hin-3.135
unct-3.130
 spoiled-3.104
 hinge-3.088
Token2016
Token ID5304
Feature activation-1.68e+00

Pos logit contributions
jac+1.789
arc+1.786
Birth+1.783
acion+1.711
ivia+1.697
Neg logit contributions
 plural-1.820
hin-1.504
unct-1.501
 spoiled-1.457
 hinge-1.453
Token pic
Token ID8301
Feature activation-4.99e+00

Pos logit contributions
jac+1.009
arc+1.008
Birth+1.003
acion+0.959
ivia+0.948
Neg logit contributions
 plural-1.416
hin-1.205
unct-1.204
 spoiled-1.172
 hinge-1.167
Token.
Token ID13
Feature activation-1.47e+00

Pos logit contributions
jac+3.056
arc+3.053
Birth+3.039
acion+2.854
 Spending+2.850
Neg logit contributions
 plural-4.088
unct-3.475
zbollah-3.449
hin-3.437
 hinge-3.415
Tokentwitter
Token ID6956
Feature activation-8.55e+00

Pos logit contributions
jac+1.114
arc+1.112
Birth+1.104
acion+1.065
ivia+1.046
Neg logit contributions
 plural-1.032
unct-0.834
hin-0.833
 spoiled-0.822
 hinge-0.817
Token.
Token ID13
Feature activation-4.75e+00

Pos logit contributions
arc+4.644
jac+4.595
Birth+4.584
acion+4.193
 Spending+4.139
Neg logit contributions
 plural-7.372
 millenn-6.839
zbollah-6.506
 hinge-6.376
unct-6.305
Tokencom
Token ID785
Feature activation-4.37e+00

Pos logit contributions
arc+2.621
jac+2.612
Birth+2.570
acion+2.451
ivo+2.343
Neg logit contributions
 plural-4.273
 spoiled-3.641
 hinge-3.639
unct-3.603
zbollah-3.537
Token/
Token ID14
Feature activation-3.03e+00

Pos logit contributions
arc+2.488
jac+2.487
Birth+2.484
acion+2.336
 Spending+2.294
Neg logit contributions
 plural-3.764
unct-3.249
hin-3.196
zbollah-3.189
 hinge-3.180
TokenF
Token ID37
Feature activation+2.48e+00

Pos logit contributions
jac+2.142
arc+2.135
Birth+2.108
acion+2.033
ivia+2.016
Neg logit contributions
 plural-2.271
unct-1.866
hin-1.846
 spoiled-1.835
 hinge-1.819
Tokenc
Token ID66
Feature activation-1.30e+00

Pos logit contributions
 plural+1.832
hin+1.570
unct+1.552
 spoiled+1.483
 hinge+1.474
Neg logit contributions
Birth-1.694
jac-1.687
arc-1.680
 Spending-1.629
acion-1.623
Tokenoth
Token ID849
Feature activation-6.34e-01

Pos logit contributions
Birth+0.197
jac+0.197
arc+0.195
acion+0.189
 Spending+0.187
Neg logit contributions
 plural-0.225
hin-0.190
unct-0.189
 spoiled-0.183
 hinge-0.181
Token through
Token ID832
Feature activation-1.28e+00

Pos logit contributions
jac+0.447
Birth+0.447
arc+0.445
acion+0.431
ivia+0.424
Neg logit contributions
 plural-0.466
hin-0.392
unct-0.385
 spoiled-0.376
 hinge-0.372
Token the
Token ID262
Feature activation-8.16e-01

Pos logit contributions
jac+0.944
arc+0.942
Birth+0.938
acion+0.907
ivia+0.898
Neg logit contributions
 plural-0.913
hin-0.752
unct-0.749
 spoiled-0.728
 hinge-0.722
Token portal
Token ID17898
Feature activation+2.78e+00

Pos logit contributions
jac+0.610
arc+0.607
Birth+0.606
acion+0.586
ivia+0.581
Neg logit contributions
 plural-0.572
hin-0.468
unct-0.466
 spoiled-0.454
 hinge-0.451
Token,
Token ID11
Feature activation-1.73e+00

Pos logit contributions
 plural+2.009
hin+1.667
 spoiled+1.613
unct+1.610
 hinge+1.599
Neg logit contributions
jac-1.999
Birth-1.996
arc-1.984
acion-1.937
ivia-1.903
Token laying
Token ID16299
Feature activation-8.55e+00

Pos logit contributions
jac+1.196
arc+1.192
Birth+1.187
acion+1.144
ivia+1.132
Neg logit contributions
 plural-1.309
hin-1.090
unct-1.086
 spoiled-1.061
 hinge-1.051
Token waste
Token ID7030
Feature activation-1.50e+00

Pos logit contributions
jac+5.698
arc+5.678
Birth+5.659
 Spending+5.466
acion+5.388
Neg logit contributions
 plural-6.093
unct-5.332
hin-5.055
zbollah-4.962
 hinge-4.841
Token to
Token ID284
Feature activation+1.42e+00

Pos logit contributions
jac+1.003
arc+1.002
Birth+0.996
acion+0.959
ivia+0.950
Neg logit contributions
 plural-1.155
unct-0.979
hin-0.969
 hinge-0.943
 spoiled-0.940
Token everything
Token ID2279
Feature activation+2.42e+00

Pos logit contributions
 plural+1.027
hin+0.845
unct+0.832
 spoiled+0.819
 hinge+0.805
Neg logit contributions
jac-1.038
arc-1.033
Birth-1.031
acion-1.004
ivia-0.980
Token in
Token ID287
Feature activation-1.27e+00

Pos logit contributions
 plural+1.718
hin+1.400
unct+1.382
 spoiled+1.366
 hinge+1.343
Neg logit contributions
jac-1.795
arc-1.783
Birth-1.783
acion-1.742
ivia-1.694
Token sight
Token ID6504
Feature activation-5.18e+00

Pos logit contributions
jac+0.921
arc+0.918
Birth+0.914
acion+0.887
ivia+0.876
Neg logit contributions
 plural-0.918
hin-0.756
unct-0.756
 spoiled-0.733
 hinge-0.729
Token parties
Token ID4671
Feature activation+1.51e+00

Pos logit contributions
 plural+2.212
hin+1.792
unct+1.770
 spoiled+1.761
 hinge+1.735
Neg logit contributions
jac-2.265
Birth-2.255
arc-2.236
ivia-2.180
acion-2.178
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
 plural+1.064
hin+0.871
unct+0.863
 spoiled+0.847
 hinge+0.836
Neg logit contributions
jac-1.118
Birth-1.115
arc-1.112
ivia-1.073
acion-1.070
Token Si
Token ID15638
Feature activation-5.86e-01

Pos logit contributions
 plural+0.093
hin+0.077
unct+0.077
 spoiled+0.075
 hinge+0.074
Neg logit contributions
jac-0.089
arc-0.089
Birth-0.088
acion-0.085
ivia-0.085
Token le
Token ID443
Feature activation+2.45e+00

Pos logit contributions
jac+0.458
arc+0.456
Birth+0.455
acion+0.440
ivia+0.438
Neg logit contributions
 plural-0.390
hin-0.315
unct-0.315
 spoiled-0.306
 hinge-0.303
Token d
Token ID288
Feature activation+5.06e+00

Pos logit contributions
 plural+1.888
unct+1.564
hin+1.556
 spoiled+1.527
 hinge+1.500
Neg logit contributions
Birth-1.652
jac-1.651
arc-1.620
acion-1.592
ivia-1.587
Tokenés
Token ID20954
Feature activation+7.55e+00

Pos logit contributions
 plural+3.447
unct+2.920
hin+2.912
 spoiled+2.724
 hinge+2.660
Neg logit contributions
Birth-3.739
jac-3.664
 Spending-3.633
arc-3.611
acion-3.518
Tokenacc
Token ID4134
Feature activation+1.93e+00

Pos logit contributions
 plural+5.957
unct+5.229
hin+5.193
 spoiled+4.920
 hinge+4.860
Neg logit contributions
Birth-4.522
jac-4.392
 Spending-4.385
ivia-4.342
arc-4.228
Tokenord
Token ID585
Feature activation+8.92e-01

Pos logit contributions
 plural+1.534
unct+1.316
hin+1.314
 spoiled+1.259
 hinge+1.254
Neg logit contributions
Birth-1.241
jac-1.231
arc-1.212
 Spending-1.175
acion-1.162
Token persist
Token ID21160
Feature activation+2.51e+00

Pos logit contributions
 plural+0.622
hin+0.508
unct+0.507
 spoiled+0.493
 hinge+0.487
Neg logit contributions
jac-0.670
Birth-0.669
arc-0.666
ivia-0.640
acion-0.639
Tokene
Token ID68
Feature activation-9.22e-02

Pos logit contributions
 plural+1.954
unct+1.658
hin+1.655
 spoiled+1.584
 hinge+1.566
Neg logit contributions
Birth-1.658
jac-1.648
arc-1.638
 Spending-1.576
ivia-1.542
Token,
Token ID11
Feature activation+5.90e-01

Pos logit contributions
jac+0.066
arc+0.066
Birth+0.066
acion+0.064
ivia+0.063
Neg logit contributions
 plural-0.067
unct-0.055
hin-0.055
 spoiled-0.054
 hinge-0.053
Token 16
Token ID1467
Feature activation+1.00e+00

Pos logit contributions
 plural+2.343
unct+1.918
hin+1.874
 spoiled+1.825
 hinge+1.805
Neg logit contributions
jac-2.858
Birth-2.850
arc-2.832
acion-2.800
ivia-2.789
Token81
Token ID6659
Feature activation-8.13e-02

Pos logit contributions
 plural+0.669
unct+0.544
hin+0.544
 spoiled+0.526
 hinge+0.521
Neg logit contributions
jac-0.781
arc-0.778
Birth-0.776
acion-0.749
ivia-0.747
Token,
Token ID11
Feature activation+1.76e+00

Pos logit contributions
jac+0.057
arc+0.057
Birth+0.057
acion+0.055
ivia+0.055
Neg logit contributions
 plural-0.060
hin-0.050
unct-0.050
 spoiled-0.048
 hinge-0.048
Token is
Token ID318
Feature activation+4.30e-01

Pos logit contributions
 plural+1.186
unct+0.959
hin+0.953
 spoiled+0.936
 hinge+0.923
Neg logit contributions
jac-1.365
arc-1.352
Birth-1.351
acion-1.305
ivia-1.299
Token U
Token ID471
Feature activation+4.48e-01

Pos logit contributions
 plural+0.292
hin+0.238
unct+0.237
 spoiled+0.231
 hinge+0.229
Neg logit contributions
jac-0.331
arc-0.328
Birth-0.327
acion-0.318
ivia-0.315
Token.
Token ID13
Feature activation+5.37e+00

Pos logit contributions
 plural+0.325
hin+0.269
unct+0.269
 spoiled+0.261
 hinge+0.259
Neg logit contributions
jac-0.322
arc-0.320
Birth-0.319
acion-0.308
ivia-0.306
TokenS
Token ID50
Feature activation-1.45e+00

Pos logit contributions
 plural+3.574
hin+2.933
unct+2.886
 spoiled+2.787
 Jin+2.768
Neg logit contributions
jac-4.049
arc-4.047
Birth-3.951
acion-3.948
ivia-3.945
Token.
Token ID13
Feature activation-2.88e-01

Pos logit contributions
jac+1.011
arc+1.008
Birth+1.003
acion+0.964
ivia+0.955
Neg logit contributions
 plural-1.086
unct-0.903
hin-0.902
 spoiled-0.882
 hinge-0.876
Token Federal
Token ID5618
Feature activation-1.32e+00

Pos logit contributions
jac+0.185
arc+0.185
Birth+0.184
acion+0.177
ivia+0.176
Neg logit contributions
 plural-0.232
hin-0.196
unct-0.194
 spoiled-0.190
 hinge-0.188
Token Government
Token ID5070
Feature activation-8.38e-01

Pos logit contributions
jac+0.837
arc+0.834
Birth+0.830
acion+0.797
ivia+0.789
Neg logit contributions
 plural-1.075
hin-0.908
unct-0.908
 spoiled-0.885
 hinge-0.879
Token legislation
Token ID5520
Feature activation-2.75e+00

Pos logit contributions
jac+0.578
arc+0.576
Birth+0.574
acion+0.553
ivia+0.549
Neg logit contributions
 plural-0.635
hin-0.528
unct-0.528
 spoiled-0.514
 hinge-0.510
Token Frances
Token ID27025
Feature activation-4.38e+00

Pos logit contributions
jac+1.330
arc+1.329
Birth+1.318
acion+1.273
ivia+1.260
Neg logit contributions
 plural-1.471
hin-1.226
unct-1.226
 spoiled-1.194
 hinge-1.187
Tokencon
Token ID1102
Feature activation-9.16e-01

Pos logit contributions
jac+2.981
arc+2.977
Birth+2.918
acion+2.826
ivia+2.806
Neg logit contributions
 plural-3.346
unct-2.797
hin-2.743
 hinge-2.718
 spoiled-2.718
Token,
Token ID11
Feature activation+6.83e-01

Pos logit contributions
jac+0.634
Birth+0.632
arc+0.632
acion+0.607
ivia+0.601
Neg logit contributions
 plural-0.688
hin-0.574
unct-0.573
 spoiled-0.556
 hinge-0.552
Token a
Token ID257
Feature activation+1.75e+00

Pos logit contributions
 plural+0.472
hin+0.382
unct+0.382
 spoiled+0.372
 hinge+0.368
Neg logit contributions
jac-0.519
arc-0.516
Birth-0.515
acion-0.498
ivia-0.495
Token migrant
Token ID23088
Feature activation+2.29e+00

Pos logit contributions
 plural+1.227
unct+0.990
hin+0.984
 spoiled+0.975
 hinge+0.962
Neg logit contributions
jac-1.322
arc-1.321
Birth-1.317
acion-1.284
ivia-1.276
Token originally
Token ID6198
Feature activation+5.21e+00

Pos logit contributions
 plural+1.549
hin+1.247
unct+1.229
 spoiled+1.225
 hinge+1.204
Neg logit contributions
jac-1.791
arc-1.770
Birth-1.761
acion-1.713
ivia-1.705
Token from
Token ID422
Feature activation+1.22e+00

Pos logit contributions
 plural+3.292
hin+2.565
 spoiled+2.551
unct+2.551
 hinge+2.469
Neg logit contributions
jac-4.303
arc-4.267
Birth-4.248
ivia-4.096
acion-4.085
Token Italy
Token ID8031
Feature activation-2.57e-01

Pos logit contributions
 plural+0.922
hin+0.763
unct+0.762
 spoiled+0.746
 hinge+0.741
Neg logit contributions
jac-0.851
arc-0.846
Birth-0.844
acion-0.811
ivia-0.804
Token.
Token ID13
Feature activation+6.11e-01

Pos logit contributions
jac+0.187
arc+0.186
Birth+0.185
acion+0.179
ivia+0.177
Neg logit contributions
 plural-0.186
hin-0.153
unct-0.153
 spoiled-0.149
 hinge-0.148
Token Though
Token ID7486
Feature activation+1.46e-01

Pos logit contributions
 plural+0.450
unct+0.375
hin+0.374
 spoiled+0.358
 Jin+0.355
Neg logit contributions
jac-0.431
arc-0.429
Birth-0.424
ivia-0.413
acion-0.411
Token it
Token ID340
Feature activation+1.72e+00

Pos logit contributions
 plural+0.101
unct+0.082
hin+0.082
 spoiled+0.080
 hinge+0.079
Neg logit contributions
jac-0.111
arc-0.111
Birth-0.110
acion-0.106
ivia-0.106
Token mention
Token ID3068
Feature activation+4.69e+00

Pos logit contributions
 plural+3.299
 hinge+2.666
hin+2.635
unct+2.571
 spoiled+2.561
Neg logit contributions
Birth-3.909
jac-3.899
arc-3.832
acion-3.783
ivia-3.738
Token those
Token ID883
Feature activation+5.01e-01

Pos logit contributions
 plural+3.277
hin+2.576
 spoiled+2.562
unct+2.546
 hinge+2.518
Neg logit contributions
Birth-3.563
arc-3.559
jac-3.547
ivia-3.463
acion-3.425
Token tapes
Token ID23695
Feature activation-1.17e+00

Pos logit contributions
 plural+0.415
hin+0.351
unct+0.350
 spoiled+0.343
 hinge+0.340
Neg logit contributions
jac-0.311
arc-0.309
Birth-0.308
acion-0.296
ivia-0.293
Token or
Token ID393
Feature activation+7.67e-01

Pos logit contributions
arc+0.825
jac+0.824
Birth+0.819
acion+0.789
ivia+0.784
Neg logit contributions
 plural-0.855
unct-0.714
hin-0.710
 spoiled-0.687
 hinge-0.684
Token their
Token ID511
Feature activation+1.53e+00

Pos logit contributions
 plural+0.547
hin+0.448
unct+0.445
 spoiled+0.435
 hinge+0.432
Neg logit contributions
jac-0.565
arc-0.563
Birth-0.563
acion-0.542
ivia-0.540
Token destruction
Token ID8166
Feature activation+5.95e+00

Pos logit contributions
 plural+1.175
hin+0.961
unct+0.959
 spoiled+0.952
 hinge+0.940
Neg logit contributions
jac-1.053
Birth-1.045
arc-1.044
acion-1.003
ivia-0.998
Token.
Token ID13
Feature activation+1.71e+00

Pos logit contributions
 plural+3.947
hin+3.135
 spoiled+3.107
 hinge+3.085
unct+3.016
Neg logit contributions
jac-4.711
arc-4.606
Birth-4.594
acion-4.467
 Spending-4.422
Token<|endoftext|>
Token ID50256
Feature activation+6.75e-01

Pos logit contributions
 plural+1.274
hin+1.062
unct+1.054
 Jin+1.017
 spoiled+1.015
Neg logit contributions
jac-1.189
arc-1.183
Birth-1.178
ivia-1.148
acion-1.147
TokenNow
Token ID3844
Feature activation+5.29e+00

Pos logit contributions
 plural+0.515
hin+0.435
unct+0.434
 spoiled+0.418
 hinge+0.414
Neg logit contributions
jac-0.453
arc-0.451
Birth-0.447
acion-0.437
ivia-0.433
Token playing
Token ID2712
Feature activation-6.36e-01

Pos logit contributions
 plural+3.520
hin+2.912
unct+2.829
 spoiled+2.810
 hinge+2.770
Neg logit contributions
jac-4.065
arc-3.973
Birth-3.962
acion-3.847
ivia-3.815
Token:
Token ID25
Feature activation+9.45e-01

Pos logit contributions
jac+0.458
arc+0.455
Birth+0.453
 Spending+0.436
acion+0.436
Neg logit contributions
 plural-0.456
unct-0.380
hin-0.379
 hinge-0.368
 spoiled-0.365
Tokenocking
Token ID8629
Feature activation+6.64e-01

Pos logit contributions
jac+0.231
Birth+0.229
arc+0.229
acion+0.218
ivia+0.216
Neg logit contributions
 plural-0.319
unct-0.274
hin-0.273
 spoiled-0.266
 hinge-0.264
Token stability
Token ID10159
Feature activation+6.10e-01

Pos logit contributions
 plural+0.512
hin+0.426
unct+0.423
 spoiled+0.418
 hinge+0.416
Neg logit contributions
jac-0.451
arc-0.450
Birth-0.448
acion-0.431
ivia-0.425
Token.
Token ID13
Feature activation+6.49e-01

Pos logit contributions
 plural+0.441
hin+0.365
unct+0.361
 spoiled+0.356
 hinge+0.356
Neg logit contributions
jac-0.441
arc-0.438
Birth-0.438
acion-0.422
ivia-0.415
Token As
Token ID1081
Feature activation-1.26e-01

Pos logit contributions
 plural+0.503
hin+0.428
unct+0.425
 spoiled+0.414
 hinge+0.411
Neg logit contributions
jac-0.431
arc-0.428
Birth-0.424
acion-0.413
ivia-0.405
Token for
Token ID329
Feature activation+1.56e-01

Pos logit contributions
jac+0.098
Birth+0.097
arc+0.097
acion+0.094
ivia+0.093
Neg logit contributions
 plural-0.085
hin-0.069
unct-0.069
 spoiled-0.067
 hinge-0.066
Token thermal
Token ID18411
Feature activation+5.91e+00

Pos logit contributions
 plural+0.109
hin+0.089
unct+0.089
 spoiled+0.087
 hinge+0.086
Neg logit contributions
jac-0.118
arc-0.117
Birth-0.117
acion-0.113
ivia-0.111
Token solution
Token ID4610
Feature activation+1.69e+00

Pos logit contributions
 plural+3.804
 spoiled+3.082
 hinge+3.066
hin+3.036
unct+2.982
Neg logit contributions
jac-4.694
Birth-4.661
arc-4.614
acion-4.428
ivia-4.406
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
 plural+1.219
hin+1.008
unct+0.998
 spoiled+0.990
 hinge+0.986
Neg logit contributions
jac-1.215
Birth-1.215
arc-1.208
acion-1.158
ivia-1.147
Token MSI
Token ID30781
Feature activation+4.72e-01

Pos logit contributions
 plural+1.273
hin+1.088
unct+1.085
 spoiled+1.067
 hinge+1.061
Neg logit contributions
jac-0.911
Birth-0.906
arc-0.905
acion-0.867
ivia-0.849
Token R
Token ID371
Feature activation+1.76e+00

Pos logit contributions
 plural+0.333
hin+0.274
unct+0.273
 spoiled+0.267
 hinge+0.265
Neg logit contributions
jac-0.350
arc-0.349
Birth-0.348
acion-0.336
ivia-0.332
Token78
Token ID3695
Feature activation-5.92e-01

Pos logit contributions
 plural+1.314
unct+1.096
hin+1.095
 spoiled+1.065
 hinge+1.061
Neg logit contributions
jac-1.222
arc-1.221
Birth-1.220
acion-1.176
ivia-1.158
Token Haiti
Token ID25051
Feature activation-1.67e+00

Pos logit contributions
 plural+0.107
hin+0.088
unct+0.088
 spoiled+0.086
 hinge+0.085
Neg logit contributions
jac-0.104
arc-0.103
Birth-0.103
acion-0.099
ivia-0.098
Token does
Token ID857
Feature activation+1.51e+00

Pos logit contributions
jac+1.277
arc+1.275
Birth+1.270
acion+1.228
ivia+1.218
Neg logit contributions
 plural-1.132
hin-0.925
unct-0.924
 spoiled-0.894
 hinge-0.886
Token not
Token ID407
Feature activation+1.28e+00

Pos logit contributions
 plural+0.972
hin+0.774
unct+0.768
 hinge+0.751
 spoiled+0.743
Neg logit contributions
jac-1.230
Birth-1.220
arc-1.219
acion-1.180
ivia-1.175
Token participate
Token ID8277
Feature activation-8.58e-02

Pos logit contributions
 plural+0.833
hin+0.671
unct+0.664
 hinge+0.648
 spoiled+0.646
Neg logit contributions
jac-1.021
arc-1.013
Birth-1.013
acion-0.980
ivia-0.971
Token in
Token ID287
Feature activation-4.07e-01

Pos logit contributions
jac+0.064
arc+0.064
Birth+0.064
acion+0.061
ivia+0.061
Neg logit contributions
 plural-0.060
hin-0.049
unct-0.049
 spoiled-0.048
 hinge-0.047
Token their
Token ID511
Feature activation+6.67e-01

Pos logit contributions
jac+0.303
arc+0.301
Birth+0.300
acion+0.289
ivia+0.288
Neg logit contributions
 plural-0.287
hin-0.235
unct-0.234
 spoiled-0.228
 hinge-0.226
Token leftist
Token ID28318
Feature activation+9.80e-01

Pos logit contributions
 plural+0.477
hin+0.387
unct+0.387
 spoiled+0.380
 hinge+0.375
Neg logit contributions
jac-0.494
Birth-0.490
arc-0.489
acion-0.470
ivia-0.468
Token,
Token ID11
Feature activation+2.73e+00

Pos logit contributions
 plural+0.691
unct+0.559
hin+0.557
 spoiled+0.545
 hinge+0.541
Neg logit contributions
jac-0.735
Birth-0.733
arc-0.729
acion-0.701
ivia-0.700
Token anti
Token ID3098
Feature activation+1.44e+00

Pos logit contributions
 plural+1.754
hin+1.364
unct+1.354
 spoiled+1.343
 hinge+1.317
Neg logit contributions
jac-2.246
Birth-2.230
arc-2.212
ivia-2.151
acion-2.148
Token-
Token ID12
Feature activation+4.20e+00

Pos logit contributions
 plural+0.979
hin+0.796
unct+0.795
 spoiled+0.766
 hinge+0.757
Neg logit contributions
jac-1.094
arc-1.091
Birth-1.091
acion-1.056
ivia-1.048
TokenAmerican
Token ID7437
Feature activation+4.23e+00

Pos logit contributions
 plural+2.890
hin+2.344
unct+2.329
 spoiled+2.230
 hinge+2.158
Neg logit contributions
jac-3.099
acion-3.076
Birth-3.069
arc-3.059
ivia-3.033
Token super
Token ID2208
Feature activation+2.15e-01

Pos logit contributions
 plural+0.696
hin+0.579
unct+0.571
 spoiled+0.568
 hinge+0.554
Neg logit contributions
jac-0.655
arc-0.653
Birth-0.653
acion-0.627
ivia-0.622
Token enthusiastic
Token ID21848
Feature activation+2.30e+00

Pos logit contributions
 plural+0.153
unct+0.126
hin+0.125
 spoiled+0.123
 hinge+0.120
Neg logit contributions
jac-0.159
Birth-0.158
arc-0.158
acion-0.153
ivia-0.151
Token about
Token ID546
Feature activation+8.26e-01

Pos logit contributions
 plural+1.275
hin+0.972
unct+0.971
 spoiled+0.948
 hinge+0.928
Neg logit contributions
jac-2.054
Birth-2.047
arc-2.044
acion-1.989
ivia-1.970
Token this
Token ID428
Feature activation+2.01e+00

Pos logit contributions
 plural+0.631
hin+0.518
unct+0.517
 spoiled+0.511
 hinge+0.504
Neg logit contributions
jac-0.569
Birth-0.568
arc-0.567
acion-0.549
ivia-0.546
Token starch
Token ID39515
Feature activation-2.43e+00

Pos logit contributions
 plural+1.437
hin+1.156
 spoiled+1.150
unct+1.146
 hinge+1.138
Neg logit contributions
jac-1.501
Birth-1.488
arc-1.485
acion-1.450
ivia-1.435
Token,
Token ID11
Feature activation+1.21e+00

Pos logit contributions
arc+1.781
jac+1.778
Birth+1.763
acion+1.699
ivia+1.687
Neg logit contributions
 plural-1.707
unct-1.420
hin-1.416
 spoiled-1.357
 hinge-1.349
Token but
Token ID475
Feature activation+2.34e+00

Pos logit contributions
 plural+0.778
hin+0.618
unct+0.617
 spoiled+0.604
 hinge+0.596
Neg logit contributions
jac-0.986
Birth-0.979
arc-0.978
acion-0.949
ivia-0.941
Token it
Token ID340
Feature activation-1.99e-01

Pos logit contributions
 plural+1.501
hin+1.184
unct+1.176
 spoiled+1.169
 hinge+1.150
Neg logit contributions
Birth-1.887
jac-1.886
arc-1.878
acion-1.824
ivia-1.817
Token
Token ID447
Feature activation-3.34e-01

Pos logit contributions
jac+0.151
arc+0.151
Birth+0.151
acion+0.145
ivia+0.144
Neg logit contributions
 plural-0.136
hin-0.111
unct-0.111
 spoiled-0.108
 hinge-0.107
Token
Token ID247
Feature activation-2.49e-01

Pos logit contributions
jac+0.188
arc+0.187
Birth+0.186
acion+0.178
ivia+0.176
Neg logit contributions
 plural-0.295
hin-0.253
unct-0.253
 spoiled-0.247
 hinge-0.245
Tokens
Token ID82
Feature activation-4.75e-01

Pos logit contributions
jac+0.191
Birth+0.190
arc+0.190
acion+0.184
 Spending+0.182
Neg logit contributions
 plural-0.168
hin-0.138
unct-0.137
 spoiled-0.132
 hinge-0.131
Token Mil
Token ID4460
Feature activation-7.90e-01

Pos logit contributions
 plural+0.797
unct+0.646
hin+0.645
 spoiled+0.630
 hinge+0.622
Neg logit contributions
jac-0.895
arc-0.892
Birth-0.889
acion-0.860
ivia-0.854
Tokenitant
Token ID23737
Feature activation+3.78e+00

Pos logit contributions
jac+0.715
arc+0.714
Birth+0.710
acion+0.692
ivia+0.687
Neg logit contributions
 plural-0.428
unct-0.326
hin-0.325
 spoiled-0.315
 hinge-0.313
Token faction
Token ID17710
Feature activation+2.97e+00

Pos logit contributions
 plural+2.623
hin+2.101
unct+2.080
 spoiled+2.059
 hinge+2.024
Neg logit contributions
Birth-2.847
jac-2.846
arc-2.823
acion-2.732
ivia-2.723
Token which
Token ID543
Feature activation+1.48e+00

Pos logit contributions
 plural+2.043
hin+1.652
unct+1.627
 spoiled+1.617
 hinge+1.590
Neg logit contributions
jac-2.256
Birth-2.255
arc-2.247
acion-2.181
ivia-2.154
Token would
Token ID561
Feature activation+2.11e-01

Pos logit contributions
 plural+0.993
hin+0.803
unct+0.800
 spoiled+0.783
 hinge+0.772
Neg logit contributions
jac-1.151
arc-1.146
Birth-1.143
acion-1.105
ivia-1.095
Token emerge
Token ID14740
Feature activation+5.80e-01

Pos logit contributions
 plural+0.136
hin+0.110
unct+0.109
 spoiled+0.106
 hinge+0.105
Neg logit contributions
jac-0.169
arc-0.169
Birth-0.168
acion-0.163
ivia-0.161
Token triumphant
Token ID44005
Feature activation+5.19e-01

Pos logit contributions
 plural+0.413
hin+0.339
unct+0.339
 spoiled+0.330
 hinge+0.327
Neg logit contributions
jac-0.427
arc-0.425
Birth-0.423
acion-0.409
ivia-0.405
Token.
Token ID13
Feature activation-2.44e-01

Pos logit contributions
 plural+0.367
hin+0.301
unct+0.300
 spoiled+0.292
 hinge+0.289
Neg logit contributions
jac-0.384
arc-0.383
Birth-0.382
acion-0.369
ivia-0.365
Token It
Token ID632
Feature activation-3.23e-01

Pos logit contributions
jac+0.168
arc+0.167
Birth+0.166
acion+0.160
ivia+0.159
Neg logit contributions
 plural-0.186
hin-0.155
unct-0.155
 spoiled-0.150
 hinge-0.149
Token was
Token ID373
Feature activation-1.39e+00

Pos logit contributions
jac+0.247
arc+0.247
Birth+0.245
acion+0.237
ivia+0.236
Neg logit contributions
 plural-0.220
hin-0.179
unct-0.179
 spoiled-0.173
 hinge-0.172
Token this
Token ID428
Feature activation-1.29e+00

Pos logit contributions
jac+1.028
arc+1.025
Birth+1.022
acion+0.984
ivia+0.976
Neg logit contributions
 plural-0.970
unct-0.802
hin-0.801
 spoiled-0.772
 hinge-0.770
Token micro
Token ID4580
Feature activation+1.25e-02

Pos logit contributions
jac+0.408
arc+0.406
Birth+0.405
acion+0.390
ivia+0.387
Neg logit contributions
 plural-0.409
hin-0.338
unct-0.338
 spoiled-0.328
 hinge-0.325
Token-
Token ID12
Feature activation+1.31e+00

Pos logit contributions
 plural+0.010
hin+0.008
unct+0.008
 spoiled+0.008
 hinge+0.008
Neg logit contributions
jac-0.008
arc-0.008
Birth-0.008
acion-0.008
ivia-0.008
Tokentrans
Token ID7645
Feature activation-1.85e+00

Pos logit contributions
 plural+1.012
hin+0.849
unct+0.847
 spoiled+0.821
 hinge+0.812
Neg logit contributions
jac-0.868
Birth-0.863
arc-0.857
acion-0.830
ivia-0.829
Tokenactions
Token ID4658
Feature activation-7.61e-01

Pos logit contributions
arc+1.184
jac+1.175
Birth+1.162
acion+1.133
ivia+1.110
Neg logit contributions
 plural-1.495
hin-1.257
unct-1.252
 spoiled-1.226
 hinge-1.219
Token."
Token ID526
Feature activation-3.88e-01

Pos logit contributions
jac+0.561
arc+0.559
Birth+0.557
acion+0.538
ivia+0.533
Neg logit contributions
 plural-0.541
hin-0.444
unct-0.442
 spoiled-0.431
 hinge-0.428
Token
Token ID198
Feature activation+4.60e-01

Pos logit contributions
jac+0.260
arc+0.259
Birth+0.258
acion+0.248
ivia+0.246
Neg logit contributions
 plural-0.301
hin-0.252
unct-0.252
 spoiled-0.245
 hinge-0.243
Token
Token ID198
Feature activation+1.91e-01

Pos logit contributions
 plural+0.362
hin+0.304
unct+0.304
 spoiled+0.296
 hinge+0.294
Neg logit contributions
jac-0.303
arc-0.302
Birth-0.301
acion-0.289
ivia-0.287
TokenIn
Token ID818
Feature activation-1.04e+00

Pos logit contributions
 plural+0.146
unct+0.123
hin+0.122
 spoiled+0.118
 hinge+0.117
Neg logit contributions
jac-0.130
arc-0.129
Birth-0.128
acion-0.124
ivia-0.123
Token an
Token ID281
Feature activation-1.47e+00

Pos logit contributions
jac+0.729
arc+0.725
Birth+0.724
acion+0.697
ivia+0.692
Neg logit contributions
 plural-0.776
hin-0.642
unct-0.642
 spoiled-0.625
 hinge-0.621
Token hour
Token ID1711
Feature activation-2.35e+00

Pos logit contributions
jac+1.213
arc+1.206
Birth+1.204
acion+1.165
ivia+1.156
Neg logit contributions
 plural-0.922
hin-0.732
unct-0.731
 spoiled-0.708
 hinge-0.702
Token-
Token ID12
Feature activation+1.32e+00

Pos logit contributions
arc+1.682
jac+1.672
Birth+1.664
acion+1.602
ivia+1.589
Neg logit contributions
 plural-1.709
hin-1.424
unct-1.416
 spoiled-1.372
 hinge-1.365
TokenHowever
Token ID4864
Feature activation+8.35e-01

Pos logit contributions
 plural+0.717
hin+0.608
unct+0.608
 spoiled+0.581
 hinge+0.574
Neg logit contributions
jac-0.637
arc-0.636
Birth-0.624
acion-0.616
ivia-0.613
Token,
Token ID11
Feature activation+1.26e-01

Pos logit contributions
 plural+0.623
hin+0.516
unct+0.516
 spoiled+0.505
 hinge+0.500
Neg logit contributions
jac-0.588
arc-0.585
Birth-0.583
acion-0.562
ivia-0.559
Token the
Token ID262
Feature activation-5.47e-01

Pos logit contributions
 plural+0.090
hin+0.073
unct+0.073
 spoiled+0.072
 hinge+0.071
Neg logit contributions
jac-0.093
arc-0.093
Birth-0.093
acion-0.089
ivia-0.089
Token United
Token ID1578
Feature activation+1.76e+00

Pos logit contributions
jac+0.416
arc+0.414
Birth+0.414
acion+0.399
ivia+0.397
Neg logit contributions
 plural-0.377
unct-0.306
hin-0.306
 spoiled-0.299
 hinge-0.296
Token States
Token ID1829
Feature activation-5.85e-01

Pos logit contributions
 plural+1.385
hin+1.163
unct+1.157
 spoiled+1.131
 hinge+1.119
Neg logit contributions
jac-1.158
arc-1.155
Birth-1.148
acion-1.104
ivia-1.097
Token
Token ID447
Feature activation+1.33e+00

Pos logit contributions
jac+0.441
arc+0.440
Birth+0.438
acion+0.423
ivia+0.420
Neg logit contributions
 plural-0.404
hin-0.330
unct-0.330
 spoiled-0.320
 hinge-0.317
Token
Token ID247
Feature activation+4.47e-01

Pos logit contributions
 plural+1.143
hin+0.979
unct+0.977
 spoiled+0.952
 hinge+0.947
Neg logit contributions
jac-0.783
arc-0.776
Birth-0.771
acion-0.738
ivia-0.732
Token determination
Token ID12123
Feature activation+1.29e-01

Pos logit contributions
 plural+0.305
hin+0.250
unct+0.249
 spoiled+0.242
 hinge+0.239
Neg logit contributions
jac-0.340
arc-0.339
Birth-0.339
acion-0.327
ivia-0.324
Token to
Token ID284
Feature activation+8.79e-01

Pos logit contributions
 plural+0.097
hin+0.081
unct+0.081
 spoiled+0.078
 hinge+0.078
Neg logit contributions
jac-0.090
arc-0.089
Birth-0.089
acion-0.086
ivia-0.085
Token undercut
Token ID43726
Feature activation+1.17e+00

Pos logit contributions
 plural+0.552
hin+0.438
unct+0.435
 spoiled+0.424
 hinge+0.422
Neg logit contributions
jac-0.724
Birth-0.720
arc-0.720
acion-0.699
ivia-0.694
Token these
Token ID777
Feature activation-1.33e+00

Pos logit contributions
 plural+0.950
hin+0.796
unct+0.794
 spoiled+0.782
 hinge+0.774
Neg logit contributions
jac-0.750
Birth-0.746
arc-0.744
acion-0.710
ivia-0.703
Token
Token ID784
Feature activation+1.24e+00

Pos logit contributions
jac+1.551
arc+1.547
Birth+1.534
ivia+1.475
acion+1.473
Neg logit contributions
 plural-1.586
unct-1.328
hin-1.319
 spoiled-1.280
 hinge-1.274
Token and
Token ID290
Feature activation+8.68e-01

Pos logit contributions
 plural+0.851
hin+0.692
unct+0.688
 spoiled+0.671
 hinge+0.664
Neg logit contributions
jac-0.944
Birth-0.938
arc-0.938
acion-0.908
ivia-0.897
Token thus
Token ID4145
Feature activation-1.18e+00

Pos logit contributions
 plural+0.628
hin+0.516
unct+0.514
 spoiled+0.503
 hinge+0.499
Neg logit contributions
jac-0.631
arc-0.628
Birth-0.627
acion-0.605
ivia-0.600
Token sought
Token ID7194
Feature activation+6.56e-01

Pos logit contributions
jac+0.893
arc+0.891
Birth+0.886
acion+0.854
ivia+0.849
Neg logit contributions
 plural-0.806
unct-0.665
hin-0.663
 spoiled-0.634
 hinge-0.633
Token to
Token ID284
Feature activation-4.27e-01

Pos logit contributions
 plural+0.490
hin+0.406
unct+0.406
 spoiled+0.395
 hinge+0.392
Neg logit contributions
jac-0.460
arc-0.458
Birth-0.456
acion-0.440
ivia-0.436
Token portray
Token ID10993
Feature activation-3.11e+00

Pos logit contributions
jac+0.326
arc+0.325
Birth+0.324
acion+0.314
ivia+0.311
Neg logit contributions
 plural-0.291
hin-0.237
unct-0.237
 spoiled-0.230
 hinge-0.229
Token me
Token ID502
Feature activation+1.68e-01

Pos logit contributions
jac+2.329
arc+2.317
Birth+2.294
acion+2.211
ivia+2.209
Neg logit contributions
 plural-2.109
unct-1.771
hin-1.763
 spoiled-1.681
 hinge-1.678
Token as
Token ID355
Feature activation-7.88e-01

Pos logit contributions
 plural+0.117
hin+0.096
unct+0.096
 spoiled+0.093
 hinge+0.092
Neg logit contributions
jac-0.126
arc-0.125
Birth-0.125
acion-0.121
ivia-0.120
Token a
Token ID257
Feature activation+1.65e+00

Pos logit contributions
jac+0.582
arc+0.581
Birth+0.577
acion+0.556
ivia+0.552
Neg logit contributions
 plural-0.554
unct-0.458
hin-0.456
 spoiled-0.441
 hinge-0.439
Token r
Token ID374
Feature activation+3.39e+00

Pos logit contributions
 plural+1.146
hin+0.928
unct+0.922
 spoiled+0.918
 hinge+0.897
Neg logit contributions
jac-1.255
arc-1.253
Birth-1.251
acion-1.221
ivia-1.204
Tokenaving
Token ID2703
Feature activation+5.17e+00

Pos logit contributions
 plural+2.850
hin+2.492
unct+2.482
 spoiled+2.374
 hinge+2.340
Neg logit contributions
jac-1.989
Birth-1.982
arc-1.929
acion-1.892
 Spending-1.887
Token relationship
Token ID2776
Feature activation-1.63e+00

Pos logit contributions
arc+1.645
jac+1.640
Birth+1.634
acion+1.571
ivia+1.556
Neg logit contributions
 plural-1.904
unct-1.606
hin-1.602
 spoiled-1.553
 hinge-1.543
Token with
Token ID351
Feature activation+9.75e-01

Pos logit contributions
arc+1.092
jac+1.092
Birth+1.086
acion+1.044
ivia+1.035
Neg logit contributions
 plural-1.259
hin-1.058
unct-1.057
 spoiled-1.024
 hinge-1.017
Token Jennifer
Token ID16348
Feature activation-1.89e-01

Pos logit contributions
 plural+0.692
hin+0.561
unct+0.559
 spoiled+0.549
 hinge+0.541
Neg logit contributions
jac-0.728
arc-0.722
Birth-0.720
acion-0.696
ivia-0.690
Token,
Token ID11
Feature activation+5.64e-01

Pos logit contributions
jac+0.135
arc+0.134
Birth+0.134
acion+0.129
ivia+0.128
Neg logit contributions
 plural-0.138
hin-0.114
unct-0.114
 spoiled-0.111
 hinge-0.110
Token and
Token ID290
Feature activation-3.64e-02

Pos logit contributions
 plural+0.403
hin+0.329
unct+0.328
 spoiled+0.322
 hinge+0.317
Neg logit contributions
jac-0.416
arc-0.413
Birth-0.412
acion-0.399
ivia-0.395
Token then
Token ID788
Feature activation-1.71e+00

Pos logit contributions
jac+0.028
arc+0.028
Birth+0.027
acion+0.027
ivia+0.026
Neg logit contributions
 plural-0.025
hin-0.020
unct-0.020
 spoiled-0.020
 hinge-0.020
Token Bradley
Token ID16182
Feature activation+8.73e-01

Pos logit contributions
jac+1.261
arc+1.260
Birth+1.253
acion+1.209
ivia+1.199
Neg logit contributions
 plural-1.195
unct-0.988
hin-0.985
 spoiled-0.952
 hinge-0.946
Token Cooper
Token ID10382
Feature activation-2.40e+00

Pos logit contributions
 plural+0.619
hin+0.507
unct+0.507
 spoiled+0.494
 hinge+0.488
Neg logit contributions
jac-0.645
arc-0.642
Birth-0.639
acion-0.618
ivia-0.612
Token starts
Token ID4940
Feature activation-2.21e+00

Pos logit contributions
jac+1.792
arc+1.789
Birth+1.780
acion+1.719
ivia+1.706
Neg logit contributions
 plural-1.667
unct-1.371
hin-1.369
 spoiled-1.323
 hinge-1.320
Token to
Token ID284
Feature activation-7.52e-01

Pos logit contributions
jac+1.576
arc+1.571
Birth+1.564
acion+1.507
ivia+1.494
Neg logit contributions
 plural-1.621
unct-1.345
hin-1.343
 spoiled-1.302
 hinge-1.295
Token fall
Token ID2121
Feature activation-2.42e+00

Pos logit contributions
jac+0.598
arc+0.596
Birth+0.594
acion+0.576
ivia+0.571
Neg logit contributions
 plural-0.491
hin-0.396
unct-0.394
 spoiled-0.383
 hinge-0.381
Token Lap
Token ID26944
Feature activation+2.88e+00

Pos logit contributions
 plural+1.751
hin+1.467
unct+1.454
 spoiled+1.413
 Jin+1.407
Neg logit contributions
jac-1.675
Birth-1.663
arc-1.661
acion-1.593
ivia-1.589
Tokeneral
Token ID1691
Feature activation+1.44e+00

Pos logit contributions
 plural+2.160
hin+1.868
unct+1.865
 spoiled+1.762
 hinge+1.752
Neg logit contributions
Birth-1.950
jac-1.930
arc-1.896
 Spending-1.845
ivia-1.820
Token of
Token ID286
Feature activation+3.82e-01

Pos logit contributions
 plural+0.999
hin+0.842
unct+0.823
 spoiled+0.799
 hinge+0.786
Neg logit contributions
Birth-1.067
jac-1.056
arc-1.049
 Spending-1.012
ivia-1.008
Token Behavioral
Token ID38483
Feature activation+9.29e-01

Pos logit contributions
 plural+0.266
unct+0.218
hin+0.217
 spoiled+0.211
 hinge+0.209
Neg logit contributions
jac-0.286
arc-0.285
Birth-0.284
acion-0.274
ivia-0.272
Token Dynamics
Token ID33806
Feature activation+9.92e-01

Pos logit contributions
 plural+0.769
hin+0.654
unct+0.654
 spoiled+0.632
 hinge+0.628
Neg logit contributions
jac-0.572
arc-0.570
Birth-0.567
acion-0.539
ivia-0.537
Token,
Token ID11
Feature activation-1.44e-01

Pos logit contributions
 plural+0.748
hin+0.625
unct+0.622
 spoiled+0.603
 hinge+0.598
Neg logit contributions
jac-0.688
arc-0.685
Birth-0.681
acion-0.652
ivia-0.648
Token said
Token ID531
Feature activation+1.84e-01

Pos logit contributions
jac+0.105
arc+0.105
Birth+0.104
acion+0.101
ivia+0.100
Neg logit contributions
 plural-0.104
hin-0.086
unct-0.086
 spoiled-0.083
 hinge-0.082
Token she
Token ID673
Feature activation-9.86e-01

Pos logit contributions
 plural+0.122
hin+0.098
unct+0.098
 spoiled+0.095
 hinge+0.094
Neg logit contributions
jac-0.145
arc-0.144
Birth-0.144
acion-0.139
ivia-0.138
Token had
Token ID550
Feature activation-4.32e-01

Pos logit contributions
jac+0.719
arc+0.718
Birth+0.713
acion+0.690
ivia+0.685
Neg logit contributions
 plural-0.703
unct-0.581
hin-0.580
 spoiled-0.561
 hinge-0.558
Token developed
Token ID4166
Feature activation-1.66e+00

Pos logit contributions
jac+0.324
arc+0.323
Birth+0.322
acion+0.310
ivia+0.308
Neg logit contributions
 plural-0.303
hin-0.248
unct-0.247
 spoiled-0.241
 hinge-0.238
Token a
Token ID257
Feature activation+9.27e-01

Pos logit contributions
jac+1.230
arc+1.228
Birth+1.221
acion+1.178
ivia+1.171
Neg logit contributions
 plural-1.166
hin-0.961
unct-0.960
 spoiled-0.930
 hinge-0.922
Token years
Token ID812
Feature activation+6.47e-01

Pos logit contributions
jac+1.363
arc+1.361
Birth+1.360
acion+1.307
ivia+1.295
Neg logit contributions
 plural-1.381
unct-1.147
hin-1.146
 spoiled-1.109
 hinge-1.102
Token and
Token ID290
Feature activation-1.99e+00

Pos logit contributions
 plural+0.509
hin+0.426
unct+0.426
 spoiled+0.417
 hinge+0.414
Neg logit contributions
jac-0.428
arc-0.425
Birth-0.421
acion-0.408
ivia-0.403
Token older
Token ID4697
Feature activation+1.37e+00

Pos logit contributions
arc+1.314
Birth+1.313
jac+1.310
acion+1.254
 Spending+1.242
Neg logit contributions
 plural-1.540
unct-1.306
hin-1.304
 hinge-1.253
 spoiled-1.253
Token,
Token ID11
Feature activation+2.32e-01

Pos logit contributions
 plural+0.952
hin+0.782
unct+0.776
 spoiled+0.760
 hinge+0.752
Neg logit contributions
jac-1.022
arc-1.016
Birth-1.015
acion-0.986
ivia-0.972
Token by
Token ID416
Feature activation+6.24e-01

Pos logit contributions
 plural+0.160
hin+0.130
unct+0.130
 spoiled+0.127
 hinge+0.126
Neg logit contributions
jac-0.176
arc-0.175
Birth-0.174
acion-0.169
ivia-0.168
Token sex
Token ID1714
Feature activation-4.31e+00

Pos logit contributions
 plural+0.477
unct+0.391
hin+0.390
 spoiled+0.384
 hinge+0.383
Neg logit contributions
jac-0.435
arc-0.430
Birth-0.425
acion-0.414
ivia-0.411
Token,
Token ID11
Feature activation+3.73e-01

Pos logit contributions
arc+2.797
jac+2.773
Birth+2.771
acion+2.659
 Spending+2.639
Neg logit contributions
 plural-3.375
unct-2.877
hin-2.867
 spoiled-2.786
 hinge-2.752
Token United
Token ID1578
Feature activation+2.24e+00

Pos logit contributions
 plural+0.263
hin+0.214
unct+0.213
 spoiled+0.209
 hinge+0.209
Neg logit contributions
jac-0.278
arc-0.276
Birth-0.274
acion-0.267
ivia-0.265
Token States
Token ID1829
Feature activation-1.93e+00

Pos logit contributions
 plural+1.730
hin+1.448
unct+1.436
 spoiled+1.407
 hinge+1.390
Neg logit contributions
jac-1.500
arc-1.485
Birth-1.484
acion-1.431
ivia-1.426
Token,
Token ID11
Feature activation+6.88e-01

Pos logit contributions
jac+1.358
arc+1.355
Birth+1.349
acion+1.299
ivia+1.288
Neg logit contributions
 plural-1.436
unct-1.193
hin-1.193
 spoiled-1.159
 hinge-1.151
Token 1991
Token ID10249
Feature activation+2.63e+00

Pos logit contributions
 plural+0.497
hin+0.407
unct+0.405
 spoiled+0.397
 hinge+0.396
Neg logit contributions
jac-0.500
arc-0.497
Birth-0.493
acion-0.480
ivia-0.477
Token conference
Token ID4495
Feature activation-2.01e-01

Pos logit contributions
jac+0.220
arc+0.219
Birth+0.219
acion+0.212
ivia+0.210
Neg logit contributions
 plural-0.174
hin-0.139
unct-0.139
 spoiled-0.135
 hinge-0.134
Token in
Token ID287
Feature activation+1.60e+00

Pos logit contributions
jac+0.149
arc+0.148
Birth+0.147
acion+0.143
ivia+0.141
Neg logit contributions
 plural-0.141
hin-0.116
unct-0.116
 spoiled-0.113
 hinge-0.112
Token Paris
Token ID6342
Feature activation+2.24e+00

Pos logit contributions
 plural+1.084
unct+0.880
hin+0.877
 spoiled+0.853
 hinge+0.846
Neg logit contributions
arc-1.235
jac-1.235
Birth-1.231
acion-1.187
ivia-1.177
Token in
Token ID287
Feature activation+1.57e+00

Pos logit contributions
 plural+1.458
hin+1.172
unct+1.168
 spoiled+1.136
 hinge+1.132
Neg logit contributions
jac-1.781
Birth-1.775
arc-1.768
acion-1.706
ivia-1.700
Token December
Token ID3426
Feature activation-8.76e-01

Pos logit contributions
 plural+1.077
unct+0.876
hin+0.872
 spoiled+0.850
 hinge+0.844
Neg logit contributions
jac-1.199
arc-1.198
Birth-1.195
acion-1.152
ivia-1.143
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
jac+0.621
arc+0.619
Birth+0.616
acion+0.594
ivia+0.589
Neg logit contributions
 plural-0.645
hin-0.535
unct-0.535
 spoiled-0.519
 hinge-0.515
Token
Token ID198
Feature activation+7.84e-01

Pos logit contributions
jac+0.115
arc+0.115
Birth+0.114
acion+0.110
ivia+0.109
Neg logit contributions
 plural-0.133
unct-0.111
hin-0.111
 spoiled-0.108
 hinge-0.107
Token
Token ID198
Feature activation+8.41e-01

Pos logit contributions
 plural+0.615
hin+0.516
unct+0.516
 spoiled+0.502
 hinge+0.498
Neg logit contributions
jac-0.517
arc-0.516
Birth-0.513
acion-0.494
ivia-0.490
TokenFrench
Token ID24111
Feature activation+5.53e-01

Pos logit contributions
 plural+0.613
unct+0.512
hin+0.509
 spoiled+0.488
 hinge+0.484
Neg logit contributions
arc-0.594
jac-0.593
Birth-0.587
acion-0.572
ivia-0.571
Token President
Token ID1992
Feature activation+5.37e-01

Pos logit contributions
 plural+0.450
unct+0.374
hin+0.372
 spoiled+0.367
 hinge+0.365
Neg logit contributions
Birth-0.354
jac-0.353
arc-0.352
acion-0.338
ivia-0.335
Token Francois
Token ID40419
Feature activation-5.29e+00

Pos logit contributions
 plural+0.461
hin+0.388
unct+0.388
 spoiled+0.382
 hinge+0.378
Neg logit contributions
jac-0.319
Birth-0.319
arc-0.318
acion-0.305
ivia-0.302
Token extr
Token ID22820
Feature activation-2.25e+00

Pos logit contributions
 plural+0.252
unct+0.208
hin+0.207
 hinge+0.203
 spoiled+0.203
Neg logit contributions
jac-0.229
arc-0.226
Birth-0.224
ivia-0.216
acion-0.216
Tokenusion
Token ID4241
Feature activation+4.07e-01

Pos logit contributions
arc+1.608
jac+1.607
Birth+1.597
acion+1.548
ivia+1.524
Neg logit contributions
 plural-1.641
hin-1.354
unct-1.348
 spoiled-1.324
 hinge-1.307
Token v
Token ID410
Feature activation-2.84e-01

Pos logit contributions
 plural+0.304
unct+0.250
hin+0.249
 hinge+0.245
 spoiled+0.243
Neg logit contributions
jac-0.288
Birth-0.285
arc-0.284
acion-0.275
ivia-0.274
Token.
Token ID13
Feature activation+2.56e+00

Pos logit contributions
jac+0.204
arc+0.203
Birth+0.202
acion+0.195
ivia+0.194
Neg logit contributions
 plural-0.207
unct-0.171
hin-0.171
 spoiled-0.165
 hinge-0.165
Tokenver
Token ID332
Feature activation-8.94e-01

Pos logit contributions
 plural+1.981
unct+1.692
hin+1.670
 hinge+1.600
 spoiled+1.586
Neg logit contributions
jac-1.676
Birth-1.650
arc-1.619
 Spending-1.595
ivia-1.595
Tokentex
Token ID16886
Feature activation-5.24e+00

Pos logit contributions
jac+0.615
arc+0.613
Birth+0.609
acion+0.588
ivia+0.583
Neg logit contributions
 plural-0.678
hin-0.565
unct-0.564
 spoiled-0.551
 hinge-0.546
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
arc+3.462
Birth+3.460
jac+3.414
acion+3.285
ivia+3.212
Neg logit contributions
 plural-4.040
hin-3.410
unct-3.382
 spoiled-3.346
 hinge-3.273
Tokenxy
Token ID5431
Feature activation-9.12e-01

Pos logit contributions
 plural+0.898
hin+0.736
unct+0.733
 hinge+0.687
 spoiled+0.683
Neg logit contributions
jac-1.171
Birth-1.163
arc-1.150
acion-1.128
ivia-1.128
Tokenz
Token ID89
Feature activation-3.20e+00

Pos logit contributions
jac+0.484
arc+0.481
Birth+0.480
acion+0.457
ivia+0.453
Neg logit contributions
 plural-0.833
hin-0.720
unct-0.719
 spoiled-0.701
 hinge-0.698
Token +=
Token ID15853
Feature activation+9.92e-02

Pos logit contributions
arc+1.652
jac+1.640
Birth+1.629
acion+1.545
ivia+1.521
Neg logit contributions
 plural-2.970
hin-2.568
unct-2.563
 spoiled-2.525
 hinge-2.499
Token v
Token ID410
Feature activation-7.70e-01

Pos logit contributions
 plural+0.071
unct+0.058
hin+0.058
 hinge+0.056
 spoiled+0.056
Neg logit contributions
jac-0.073
arc-0.073
Birth-0.072
acion-0.070
ivia-0.070
Token after
Token ID706
Feature activation+1.07e+00

Pos logit contributions
 plural+2.049
hin+1.651
 spoiled+1.641
unct+1.639
 hinge+1.633
Neg logit contributions
jac-2.325
arc-2.307
Birth-2.305
acion-2.226
ivia-2.219
Token the
Token ID262
Feature activation-2.89e-02

Pos logit contributions
 plural+0.753
unct+0.614
hin+0.612
 spoiled+0.595
 hinge+0.591
Neg logit contributions
jac-0.794
arc-0.794
Birth-0.790
acion-0.763
ivia-0.759
Token student
Token ID3710
Feature activation-3.27e-01

Pos logit contributions
jac+0.022
arc+0.022
Birth+0.022
acion+0.021
ivia+0.021
Neg logit contributions
 plural-0.020
unct-0.016
hin-0.016
 spoiled-0.016
 hinge-0.016
Token appealed
Token ID22280
Feature activation+3.63e-01

Pos logit contributions
jac+0.240
arc+0.239
Birth+0.238
acion+0.230
ivia+0.228
Neg logit contributions
 plural-0.233
unct-0.193
hin-0.192
 spoiled-0.186
 hinge-0.185
Token his
Token ID465
Feature activation-7.02e-01

Pos logit contributions
 plural+0.257
unct+0.211
hin+0.211
 spoiled+0.205
 hinge+0.203
Neg logit contributions
jac-0.268
arc-0.267
Birth-0.266
acion-0.256
ivia-0.255
Token University
Token ID2059
Feature activation-5.49e+00

Pos logit contributions
jac+0.505
arc+0.503
Birth+0.502
acion+0.484
ivia+0.479
Neg logit contributions
 plural-0.508
hin-0.421
unct-0.420
 spoiled-0.407
 hinge-0.404
Token of
Token ID286
Feature activation-7.95e-01

Pos logit contributions
Birth+3.654
jac+3.637
arc+3.605
acion+3.555
 Spending+3.522
Neg logit contributions
 plural-4.101
hin-3.506
unct-3.449
zbollah-3.410
 spoiled-3.380
Token Cincinnati
Token ID16137
Feature activation-1.60e+00

Pos logit contributions
jac+0.614
arc+0.612
Birth+0.610
acion+0.590
ivia+0.585
Neg logit contributions
 plural-0.535
hin-0.435
unct-0.435
 spoiled-0.421
 hinge-0.418
Token suspension
Token ID11461
Feature activation-2.71e+00

Pos logit contributions
jac+1.154
arc+1.147
Birth+1.145
acion+1.104
ivia+1.091
Neg logit contributions
 plural-1.149
hin-0.953
unct-0.950
 spoiled-0.921
 hinge-0.914
Token.
Token ID13
Feature activation-5.10e-01

Pos logit contributions
jac+1.860
arc+1.855
Birth+1.847
acion+1.785
ivia+1.760
Neg logit contributions
 plural-2.031
unct-1.702
hin-1.691
 spoiled-1.633
 hinge-1.610
Token He
Token ID679
Feature activation-5.76e-01

Pos logit contributions
jac+0.347
arc+0.346
Birth+0.345
acion+0.332
ivia+0.329
Neg logit contributions
 plural-0.391
hin-0.326
unct-0.326
 spoiled-0.317
 hinge-0.315
Tokenfan
Token ID24408
Feature activation-2.29e-01

Pos logit contributions
jac+3.221
arc+3.212
Birth+3.130
acion+3.056
ivia+3.017
Neg logit contributions
 plural-4.125
unct-3.449
 spoiled-3.421
hin-3.416
 hinge-3.416
Token G
Token ID402
Feature activation-2.73e+00

Pos logit contributions
jac+0.167
arc+0.167
Birth+0.166
acion+0.160
ivia+0.159
Neg logit contributions
 plural-0.164
hin-0.136
unct-0.136
 spoiled-0.131
 hinge-0.130
Token K
Token ID509
Feature activation-5.88e-01

Pos logit contributions
arc+1.839
jac+1.836
Birth+1.814
acion+1.753
ivia+1.743
Neg logit contributions
 plural-2.111
unct-1.755
hin-1.752
 spoiled-1.720
 hinge-1.705
Tokenert
Token ID861
Feature activation+1.13e+00

Pos logit contributions
jac+0.389
arc+0.387
Birth+0.387
acion+0.371
ivia+0.367
Neg logit contributions
 plural-0.460
hin-0.388
unct-0.387
 spoiled-0.376
 hinge-0.373
Tokenes
Token ID274
Feature activation-3.97e+00

Pos logit contributions
 plural+0.847
hin+0.717
unct+0.711
 spoiled+0.688
 hinge+0.682
Neg logit contributions
Birth-0.773
jac-0.768
arc-0.765
acion-0.738
ivia-0.731
Tokenz
Token ID89
Feature activation-5.29e+00

Pos logit contributions
jac+2.289
arc+2.283
Birth+2.240
acion+2.155
 Spending+2.123
Neg logit contributions
 plural-3.454
unct-2.947
hin-2.921
 spoiled-2.891
 hinge-2.872
Token,
Token ID11
Feature activation+7.67e-01

Pos logit contributions
jac+3.752
arc+3.694
Birth+3.649
acion+3.511
 Spending+3.482
Neg logit contributions
 plural-3.846
unct-3.160
hin-3.137
 spoiled-3.116
 hinge-3.091
Token MD
Token ID10670
Feature activation+1.54e+00

Pos logit contributions
 plural+0.547
unct+0.450
hin+0.449
 spoiled+0.436
 hinge+0.432
Neg logit contributions
jac-0.563
arc-0.561
Birth-0.559
acion-0.540
ivia-0.537
Token,
Token ID11
Feature activation+3.08e+00

Pos logit contributions
 plural+1.104
hin+0.918
unct+0.917
 spoiled+0.878
 hinge+0.872
Neg logit contributions
jac-1.115
arc-1.109
Birth-1.108
acion-1.076
ivia-1.067
Token is
Token ID318
Feature activation+1.14e+00

Pos logit contributions
 plural+2.212
unct+1.826
hin+1.804
 spoiled+1.761
 hinge+1.741
Neg logit contributions
jac-2.233
Birth-2.218
arc-2.214
acion-2.147
ivia-2.140
Token a
Token ID257
Feature activation+1.78e+00

Pos logit contributions
 plural+0.786
unct+0.644
hin+0.638
 spoiled+0.619
 hinge+0.612
Neg logit contributions
jac-0.860
arc-0.857
Birth-0.855
acion-0.828
ivia-0.823
Token.
Token ID13
Feature activation+6.80e-01

Pos logit contributions
jac+0.578
arc+0.577
Birth+0.573
acion+0.553
ivia+0.548
Neg logit contributions
 plural-0.595
hin-0.493
unct-0.493
 spoiled-0.478
 hinge-0.475
Tokentv
Token ID14981
Feature activation-1.96e+00

Pos logit contributions
 plural+0.509
hin+0.425
unct+0.423
 spoiled+0.409
 hinge+0.406
Neg logit contributions
jac-0.472
Birth-0.468
arc-0.468
acion-0.452
ivia-0.449
Token/
Token ID14
Feature activation-1.36e-01

Pos logit contributions
arc+1.275
jac+1.274
Birth+1.265
acion+1.215
ivia+1.205
Neg logit contributions
 plural-1.549
unct-1.308
hin-1.305
 spoiled-1.270
 hinge-1.262
Tokend
Token ID67
Feature activation+8.09e-01

Pos logit contributions
jac+0.095
arc+0.094
Birth+0.094
acion+0.091
ivia+0.090
Neg logit contributions
 plural-0.102
hin-0.085
unct-0.085
 spoiled-0.082
 hinge-0.082
Tokenanks
Token ID2283
Feature activation+2.42e+00

Pos logit contributions
 plural+0.640
hin+0.547
unct+0.541
 spoiled+0.522
 hinge+0.520
Neg logit contributions
Birth-0.521
jac-0.519
arc-0.515
acion-0.497
 Spending-0.496
Tokenh
Token ID71
Feature activation-6.38e+00

Pos logit contributions
 plural+1.872
hin+1.606
unct+1.572
 spoiled+1.529
 hinge+1.501
Neg logit contributions
Birth-1.599
jac-1.594
arc-1.580
acion-1.542
 Spending-1.517
Tokenrine
Token ID7640
Feature activation-6.26e-01

Pos logit contributions
arc+4.183
jac+4.069
Birth+4.057
acion+3.981
ivia+3.929
Neg logit contributions
 plural-5.052
unct-4.132
hin-4.126
 hinge-4.116
 spoiled-4.109
Token Weekly
Token ID18168
Feature activation+2.20e+00

Pos logit contributions
jac+0.427
arc+0.426
Birth+0.424
acion+0.409
ivia+0.405
Neg logit contributions
 plural-0.478
hin-0.399
unct-0.398
 spoiled-0.388
 hinge-0.384
Token SC
Token ID6374
Feature activation+1.04e+00

Pos logit contributions
 plural+1.646
hin+1.363
unct+1.352
 spoiled+1.322
 hinge+1.304
Neg logit contributions
jac-1.550
arc-1.528
Birth-1.517
acion-1.483
ivia-1.472
Token2
Token ID17
Feature activation-3.26e-01

Pos logit contributions
 plural+0.779
hin+0.643
unct+0.633
 spoiled+0.627
 hinge+0.622
Neg logit contributions
jac-0.729
Birth-0.718
arc-0.718
acion-0.699
ivia-0.691
Token Podcast
Token ID16036
Feature activation+7.30e-01

Pos logit contributions
jac+0.230
arc+0.230
Birth+0.229
acion+0.220
ivia+0.219
Neg logit contributions
 plural-0.241
unct-0.200
hin-0.200
 spoiled-0.194
 hinge-0.193
Token desire
Token ID6227
Feature activation+1.66e-01

Pos logit contributions
 plural+0.849
hin+0.684
unct+0.680
 hinge+0.664
 spoiled+0.663
Neg logit contributions
jac-0.977
arc-0.974
Birth-0.970
acion-0.940
ivia-0.931
Token physical
Token ID3518
Feature activation+2.13e+00

Pos logit contributions
 plural+0.121
hin+0.099
unct+0.097
 spoiled+0.096
 hinge+0.096
Neg logit contributions
jac-0.120
arc-0.120
Birth-0.119
acion-0.115
ivia-0.115
Token transition
Token ID6801
Feature activation-1.32e+00

Pos logit contributions
 plural+1.633
hin+1.303
 hinge+1.279
 spoiled+1.273
unct+1.256
Neg logit contributions
arc-1.515
jac-1.514
Birth-1.478
ivia-1.445
acion-1.444
Token ("
Token ID5855
Feature activation-1.51e+00

Pos logit contributions
jac+0.941
arc+0.937
Birth+0.932
acion+0.900
ivia+0.892
Neg logit contributions
 plural-0.972
hin-0.804
unct-0.799
 spoiled-0.779
 hinge-0.773
Tokentrans
Token ID7645
Feature activation-8.58e+00

Pos logit contributions
jac+1.030
arc+1.028
Birth+1.023
acion+0.986
ivia+0.978
Neg logit contributions
 plural-1.150
hin-0.961
unct-0.959
 spoiled-0.932
 hinge-0.926
Tokensexual
Token ID18338
Feature activation-7.34e+00

Pos logit contributions
Birth+4.666
arc+4.657
jac+4.630
acion+4.461
ivia+4.376
Neg logit contributions
 plural-7.336
hin-6.409
unct-6.383
 spoiled-6.275
 hinge-6.199
Token"
Token ID1
Feature activation-3.40e+00

Pos logit contributions
jac+4.656
arc+4.650
Birth+4.645
acion+4.418
ivia+4.403
Neg logit contributions
 plural-5.629
unct-4.934
hin-4.871
 spoiled-4.714
 hinge-4.639
Token people
Token ID661
Feature activation-2.41e+00

Pos logit contributions
arc+2.453
jac+2.442
Birth+2.437
acion+2.340
ivia+2.321
Neg logit contributions
 plural-2.399
unct-2.029
hin-2.019
 spoiled-1.946
 hinge-1.930
Token)
Token ID8
Feature activation-8.65e-01

Pos logit contributions
arc+1.752
jac+1.752
Birth+1.740
acion+1.674
ivia+1.666
Neg logit contributions
 plural-1.692
unct-1.419
hin-1.407
 spoiled-1.363
 hinge-1.348
Token and
Token ID290
Feature activation-8.97e-01

Pos logit contributions
jac+0.598
arc+0.593
Birth+0.592
acion+0.572
ivia+0.564
Neg logit contributions
 plural-0.659
hin-0.546
unct-0.541
 spoiled-0.531
 hinge-0.526
Token other
Token ID584
Feature activation-3.56e-01

Pos logit contributions
jac+0.622
arc+0.619
Birth+0.617
acion+0.594
ivia+0.589
Neg logit contributions
 plural-0.680
hin-0.564
unct-0.562
 spoiled-0.549
 hinge-0.545