TOP ACTIVATIONS
MAX = 17.700

house on the r ue du Vie ux - Col omb
10 35 ) Per form 2 aerial gestures before
about which player out - per forms the other . Norman
Pro Marshal Y anda per Pro Football Focus .
voters that they will not dev iate an inch from the
954 - 12 76 or ed raper @ den ver post
Amount ; } 1 2 3 4 5 6 7 8
targets for 23 yards , per Next Gen Stats . Sherman
. . @ dev on ms ayers reports that
in DV OA ( value per play ), and fourth in
r ags - to - ric hes story " <|endoftext|> (
h entry in ` ser ach Space `, find its N
ulse Amount ; } 1 2 3 4 5 6 7
lement ser te ur du programme C . O .
version of the Rift would employ such a thing ( as
For example , the Buddhist monk , Hai y un ,
wrong language !) to becoming a Junior Developer within one year
( t ) = exp (- ( t - t 0
in the aftermath of 9 / 11 , when 19 guys
. com / politics / c nn - drops - re

MIN ACTIVATIONS
MIN = -18.750

the spoil thereof , shalt thou take unto thy self .
od , brother - in - law Raj nik anth Singh
­ on to this pres ­ id ­ ent ,
his new brother - in - law , E ad gar
contemporary society , are not uncont roversial or immune from any
away . Note that on some radios , when you un
In arm ad illo le pro sy , lesions did not
. Cha vel ( New York : Sh ilo Publishing House
Defense and commander - in - chief of the German Army
well as any sense of inev itability about what happens in
We must commit ourselves very res olutely because there isn
, he says j okingly . One of the
in fond ness and in j ib es , identify accents
at 16 80 x 10 50 the pixels are small enough
Alpha Part icle X - Ray Spect rom eter ( AP
it unto thy hands , thou shalt sm ite every male
thrust of his 2012 pres ­ id ­ en ­ t
the two different types of block syntax besides the fact that
ve had as pres ­ id ­ ent since at
not persist at the site of entry in animals , M

INTERVAL 8.587 - 17.700
CONTAINS 0.422%

1 $ g db -- quiet ./ rop as
office tied to power - bro k ering with the far
relief from business problems and bad debts . Standing Lord Gan
, 2 , .. *) graph List = Table [ {
privacy - centric variant called Focus and speed improvements transferred from

INTERVAL -0.525 - 8.587
CONTAINS 56.772%

that the valves are supposed to satisfy conflicting demands
Clintons outside # Deb ates 2016 pic . twitter . com
. It happened in 2015 ; it s happening
taste or better ; otherwise they won t go
's home farm , and some from another farm .

INTERVAL -9.637 - -0.525
CONTAINS 42.605%

money and their families have good benefits , everybody gains from
was twice as high at charter schools ( St uit and
Mer ck x said just after he set the mark .
The opening ceremony featured a model wearing a dress created from
self - lo athing , fear and despair that ultimately gets

INTERVAL -18.750 - -9.637
CONTAINS 0.201%

or pushing shadows at higher IS Os . Color is academic
contemporary society , are not uncont roversial or immune from any
6 { ( 105 s )} Ne 7 { ( 69
in public ' for viewing South Korean movies and owning B
t improved seemingly at all since those days . They have
Token house
Token ID2156
Feature activation+4.44e+00

Pos logit contributions
unks+0.121
oppers+0.118
imir+0.114
arn+0.103
avis+0.100
Neg logit contributions
Maker-0.102
gently-0.101
breaks-0.098
 Decoder-0.094
yon-0.094
Token on
Token ID319
Feature activation+1.98e+00

Pos logit contributions
unks+3.051
oppers+2.940
imir+2.856
arn+2.587
avis+2.493
Neg logit contributions
Maker-3.000
gently-2.938
breaks-2.814
 Decoder-2.811
ahime-2.743
Token the
Token ID262
Feature activation-4.48e+00

Pos logit contributions
unks+1.488
oppers+1.459
imir+1.416
arn+1.285
avis+1.246
Neg logit contributions
Maker-1.206
gently-1.202
breaks-1.163
 Decoder-1.107
yon-1.103
Token r
Token ID374
Feature activation+2.26e+00

Pos logit contributions
Maker+2.730
gently+2.715
breaks+2.617
 Decoder+2.527
yon+2.516
Neg logit contributions
unks-3.377
oppers-3.281
imir-3.171
arn-2.830
=-=-=-=-=-=-=-=--2.766
Tokenue
Token ID518
Feature activation+2.31e+00

Pos logit contributions
unks+1.321
oppers+1.272
imir+1.212
arn+1.072
avis+1.024
Neg logit contributions
Maker-1.787
gently-1.773
breaks-1.732
 Decoder-1.680
ahime-1.651
Token du
Token ID7043
Feature activation+1.77e+01

Pos logit contributions
unks+1.724
oppers+1.688
imir+1.642
arn+1.489
avis+1.436
Neg logit contributions
Maker-1.424
gently-1.417
breaks-1.364
 Decoder-1.309
yon-1.306
Token Vie
Token ID47154
Feature activation+1.18e+00

Pos logit contributions
oppers+11.315
unks+11.310
imir+11.049
 Continent+10.452
arn+10.287
Neg logit contributions
gently-9.817
breaks-9.395
ahime-9.162
Maker-9.054
topic-8.640
Tokenux
Token ID2821
Feature activation+3.39e+00

Pos logit contributions
unks+0.829
oppers+0.808
imir+0.781
arn+0.703
avis+0.677
Neg logit contributions
Maker-0.786
gently-0.782
breaks-0.756
 Decoder-0.729
yon-0.723
Token-
Token ID12
Feature activation+2.80e+00

Pos logit contributions
unks+2.465
oppers+2.415
imir+2.352
arn+2.143
avis+2.066
Neg logit contributions
Maker-2.139
gently-2.134
breaks-2.053
 Decoder-1.969
ahime-1.956
TokenCol
Token ID5216
Feature activation-4.06e+00

Pos logit contributions
unks+1.904
oppers+1.860
imir+1.800
arn+1.633
avis+1.566
Neg logit contributions
gently-1.897
Maker-1.871
breaks-1.828
 Decoder-1.777
yon-1.747
Tokenomb
Token ID2381
Feature activation-6.37e-01

Pos logit contributions
Maker+2.747
gently+2.734
breaks+2.670
yon+2.578
ona+2.573
Neg logit contributions
unks-2.790
oppers-2.692
imir-2.575
arn-2.266
=-=-=-=-=-=-=-=--2.261
Token10
Token ID940
Feature activation-4.24e-01

Pos logit contributions
Maker+1.285
gently+1.274
breaks+1.240
yon+1.183
 Decoder+1.177
Neg logit contributions
unks-1.461
oppers-1.427
imir-1.373
arn-1.228
avis-1.197
Token35
Token ID2327
Feature activation-3.47e+00

Pos logit contributions
Maker+0.259
gently+0.257
breaks+0.249
 Decoder+0.238
yon+0.237
Neg logit contributions
unks-0.322
oppers-0.315
imir-0.304
arn-0.275
avis-0.267
Token)
Token ID8
Feature activation+1.21e+00

Pos logit contributions
Maker+2.056
gently+2.037
breaks+1.956
 Decoder+1.878
yon+1.857
Neg logit contributions
unks-2.692
oppers-2.627
imir-2.555
arn-2.285
avis-2.246
Token
Token ID198
Feature activation+7.31e-01

Pos logit contributions
unks+0.926
oppers+0.904
imir+0.872
arn+0.792
avis+0.768
Neg logit contributions
Maker-0.737
gently-0.732
breaks-0.707
yon-0.677
 Decoder-0.676
Token
Token ID198
Feature activation+2.89e+00

Pos logit contributions
unks+0.558
oppers+0.544
imir+0.527
arn+0.478
avis+0.463
Neg logit contributions
Maker-0.444
gently-0.442
breaks-0.426
 Decoder-0.409
yon-0.407
TokenPer
Token ID5990
Feature activation+1.70e+01

Pos logit contributions
unks+1.824
oppers+1.763
imir+1.694
arn+1.531
=-=-=-=-=-=-=-=-+1.461
Neg logit contributions
gently-2.091
Maker-2.085
breaks-2.034
 Decoder-1.985
ahime-1.965
Tokenform
Token ID687
Feature activation+3.95e+00

Pos logit contributions
unks+9.075
oppers+8.782
imir+8.773
arn+7.821
ores+7.245
Neg logit contributions
breaks-11.312
Maker-11.232
gently-10.980
 Decoder-10.866
ahime-10.805
Token 2
Token ID362
Feature activation-9.62e-02

Pos logit contributions
unks+2.813
oppers+2.749
imir+2.664
arn+2.392
avis+2.328
Neg logit contributions
Maker-2.540
gently-2.514
breaks-2.430
yon-2.330
ahime-2.320
Token aerial
Token ID17789
Feature activation-1.39e+00

Pos logit contributions
Maker+0.060
gently+0.060
breaks+0.058
 Decoder+0.055
yon+0.055
Neg logit contributions
unks-0.072
oppers-0.070
imir-0.068
arn-0.061
avis-0.060
Token gestures
Token ID24621
Feature activation+4.76e+00

Pos logit contributions
Maker+0.885
gently+0.880
breaks+0.857
 Decoder+0.820
yon+0.816
Neg logit contributions
unks-1.023
oppers-1.002
imir-0.966
arn-0.871
avis-0.841
Token before
Token ID878
Feature activation+2.65e+00

Pos logit contributions
unks+3.220
oppers+3.136
imir+3.031
arn+2.754
avis+2.639
Neg logit contributions
Maker-3.199
gently-3.150
breaks-3.056
ahime-2.970
yon-2.970
Token about
Token ID546
Feature activation-1.17e+00

Pos logit contributions
Maker+2.552
gently+2.492
breaks+2.428
 Decoder+2.357
yon+2.317
Neg logit contributions
unks-3.191
oppers-3.098
imir-2.987
arn-2.670
=-=-=-=-=-=-=-=--2.644
Token which
Token ID543
Feature activation+4.07e+00

Pos logit contributions
Maker+0.721
gently+0.714
breaks+0.691
 Decoder+0.666
yon+0.660
Neg logit contributions
unks-0.879
oppers-0.856
imir-0.828
arn-0.746
avis-0.727
Token player
Token ID2137
Feature activation+2.92e+00

Pos logit contributions
unks+2.946
oppers+2.862
imir+2.770
arn+2.536
avis+2.436
Neg logit contributions
gently-2.580
Maker-2.567
breaks-2.463
yon-2.374
ahime-2.350
Token out
Token ID503
Feature activation+6.92e+00

Pos logit contributions
unks+2.064
oppers+1.996
imir+1.935
arn+1.752
avis+1.669
Neg logit contributions
Maker-1.939
gently-1.927
breaks-1.844
yon-1.798
 Decoder-1.787
Token-
Token ID12
Feature activation+3.40e+00

Pos logit contributions
unks+4.478
oppers+4.285
imir+4.160
arn+3.860
avis+3.707
Neg logit contributions
Maker-4.803
gently-4.783
 Decoder-4.550
ona-4.447
ahime-4.435
Tokenper
Token ID525
Feature activation+1.65e+01

Pos logit contributions
unks+2.323
oppers+2.253
imir+2.174
arn+1.996
avis+1.914
Neg logit contributions
Maker-2.292
gently-2.266
 Decoder-2.170
breaks-2.168
yon-2.107
Tokenforms
Token ID23914
Feature activation-1.74e+00

Pos logit contributions
unks+7.431
oppers+7.328
imir+6.966
arn+6.273
ores+5.844
Neg logit contributions
Maker-13.181
 Decoder-12.833
gently-12.360
ahime-12.133
yon-12.029
Token the
Token ID262
Feature activation+4.81e+00

Pos logit contributions
Maker+1.047
gently+1.038
breaks+1.004
 Decoder+0.966
yon+0.958
Neg logit contributions
unks-1.340
oppers-1.306
imir-1.260
arn-1.135
avis-1.112
Token other
Token ID584
Feature activation+6.31e+00

Pos logit contributions
unks+3.182
oppers+3.114
imir+2.974
arn+2.717
avis+2.578
Neg logit contributions
gently-3.319
Maker-3.304
breaks-3.195
yon-3.115
ahime-3.094
Token.
Token ID13
Feature activation-4.98e-01

Pos logit contributions
unks+4.261
oppers+4.071
imir+3.971
arn+3.690
avis+3.465
Neg logit contributions
gently-4.263
Maker-4.181
breaks-3.992
yon-3.947
ahime-3.942
Token Norman
Token ID19636
Feature activation-3.22e+00

Pos logit contributions
Maker+0.306
gently+0.303
breaks+0.293
 Decoder+0.282
yon+0.281
Neg logit contributions
unks-0.378
oppers-0.369
imir-0.357
arn-0.322
avis-0.313
TokenPro
Token ID2964
Feature activation+6.28e-01

Pos logit contributions
Maker+1.202
gently+1.176
breaks+1.132
 Decoder+1.079
yon+1.071
Neg logit contributions
unks-1.843
oppers-1.803
imir-1.751
arn-1.590
avis-1.553
Token Marshal
Token ID37899
Feature activation+3.76e+00

Pos logit contributions
unks+0.482
oppers+0.470
imir+0.456
arn+0.417
avis+0.402
Neg logit contributions
gently-0.378
Maker-0.375
breaks-0.362
 Decoder-0.346
ahime-0.344
Token Y
Token ID575
Feature activation+4.80e+00

Pos logit contributions
unks+2.818
oppers+2.781
imir+2.724
arn+2.477
avis+2.374
Neg logit contributions
gently-2.301
Maker-2.283
breaks-2.224
ahime-2.135
 Decoder-2.124
Tokenanda
Token ID5282
Feature activation-1.93e+00

Pos logit contributions
unks+1.794
oppers+1.732
imir+1.676
arn+1.413
avis+1.238
Neg logit contributions
Maker-4.776
gently-4.761
breaks-4.747
ahime-4.531
 Decoder-4.521
Token
Token ID851
Feature activation-5.98e-01

Pos logit contributions
Maker+1.270
gently+1.253
breaks+1.217
yon+1.175
 Decoder+1.175
Neg logit contributions
unks-1.373
oppers-1.336
imir-1.281
arn-1.145
avis-1.114
Token per
Token ID583
Feature activation+1.64e+01

Pos logit contributions
Maker+0.374
gently+0.372
breaks+0.360
 Decoder+0.345
yon+0.344
Neg logit contributions
unks-0.446
oppers-0.435
imir-0.420
arn-0.380
avis-0.368
Token Pro
Token ID1041
Feature activation-1.82e+00

Pos logit contributions
unks+10.485
oppers+10.083
imir+9.583
arn+9.317
avis+8.713
Neg logit contributions
Maker-9.710
gently-9.476
yon-9.191
breaks-9.146
ahime-9.081
Token Football
Token ID9957
Feature activation+3.99e+00

Pos logit contributions
Maker+1.027
gently+0.987
breaks+0.975
 Decoder+0.938
yon+0.914
Neg logit contributions
unks-1.485
oppers-1.452
imir-1.414
arn-1.279
avis-1.245
Token Focus
Token ID17061
Feature activation+1.53e+00

Pos logit contributions
unks+2.070
oppers+1.962
imir+1.838
arn+1.676
avis+1.513
Neg logit contributions
gently-3.320
Maker-3.281
breaks-3.241
yon-3.222
ahime-3.204
Token.
Token ID13
Feature activation+5.34e+00

Pos logit contributions
unks+1.116
oppers+1.084
imir+1.048
arn+0.948
avis+0.915
Neg logit contributions
Maker-0.985
gently-0.983
breaks-0.943
yon-0.906
 Decoder-0.903
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
unks+3.693
oppers+3.541
imir+3.525
arn+3.147
avis+3.079
Neg logit contributions
gently-3.544
Maker-3.434
breaks-3.339
ahime-3.238
yon-3.205
Token voters
Token ID4446
Feature activation+5.89e-01

Pos logit contributions
Maker+1.241
gently+1.226
breaks+1.181
 Decoder+1.139
yon+1.132
Neg logit contributions
unks-1.677
oppers-1.640
imir-1.587
arn-1.442
avis-1.410
Token that
Token ID326
Feature activation-6.91e-01

Pos logit contributions
unks+0.423
oppers+0.413
imir+0.399
arn+0.360
avis+0.348
Neg logit contributions
Maker-0.384
gently-0.382
breaks-0.370
 Decoder-0.355
yon-0.355
Token they
Token ID484
Feature activation+2.95e+00

Pos logit contributions
Maker+0.432
gently+0.429
breaks+0.415
 Decoder+0.398
yon+0.397
Neg logit contributions
unks-0.515
oppers-0.503
imir-0.486
arn-0.438
avis-0.427
Token will
Token ID481
Feature activation-1.62e-01

Pos logit contributions
unks+2.054
oppers+2.007
imir+1.950
arn+1.751
avis+1.682
Neg logit contributions
Maker-1.979
gently-1.946
breaks-1.898
 Decoder-1.819
yon-1.808
Token not
Token ID407
Feature activation-4.71e+00

Pos logit contributions
Maker+0.105
gently+0.105
breaks+0.101
 Decoder+0.097
yon+0.097
Neg logit contributions
unks-0.117
oppers-0.114
imir-0.110
arn-0.099
avis-0.096
Token dev
Token ID1614
Feature activation+1.61e+01

Pos logit contributions
gently+2.843
Maker+2.787
breaks+2.711
ona+2.622
yon+2.620
Neg logit contributions
unks-3.506
oppers-3.477
imir-3.310
arn-2.939
=-=-=-=-=-=-=-=--2.899
Tokeniate
Token ID9386
Feature activation+9.04e-01

Pos logit contributions
unks+6.065
oppers+5.556
imir+5.014
arn+4.969
avis+4.479
Neg logit contributions
Maker-14.213
breaks-13.859
gently-13.854
 Decoder-13.768
WAYS-13.284
Token an
Token ID281
Feature activation-2.89e+00

Pos logit contributions
unks+0.632
oppers+0.616
imir+0.593
arn+0.533
avis+0.515
Neg logit contributions
Maker-0.608
gently-0.604
breaks-0.585
 Decoder-0.563
yon-0.563
Token inch
Token ID11111
Feature activation-1.28e-01

Pos logit contributions
Maker+1.808
gently+1.804
breaks+1.740
 Decoder+1.681
ona+1.661
Neg logit contributions
unks-2.128
oppers-2.082
imir-2.009
arn-1.808
avis-1.738
Token from
Token ID422
Feature activation+4.33e-01

Pos logit contributions
Maker+0.085
gently+0.084
breaks+0.082
 Decoder+0.078
yon+0.078
Neg logit contributions
unks-0.091
oppers-0.089
imir-0.086
arn-0.077
avis-0.074
Token the
Token ID262
Feature activation+4.43e+00

Pos logit contributions
unks+0.327
oppers+0.319
imir+0.308
arn+0.279
avis+0.270
Neg logit contributions
Maker-0.267
gently-0.265
breaks-0.256
 Decoder-0.245
yon-0.245
Token954
Token ID48372
Feature activation+2.48e+00

Pos logit contributions
unks+4.029
oppers+3.905
imir+3.746
arn+3.491
avis+3.250
Neg logit contributions
Maker-5.404
gently-5.249
ahime-5.129
breaks-5.120
 Decoder-5.092
Token-
Token ID12
Feature activation+3.86e+00

Pos logit contributions
unks+1.703
oppers+1.650
imir+1.605
arn+1.450
avis+1.390
Neg logit contributions
gently-1.686
Maker-1.659
breaks-1.619
yon-1.561
 Decoder-1.560
Token12
Token ID1065
Feature activation+6.87e+00

Pos logit contributions
unks+2.469
oppers+2.412
imir+2.337
arn+2.093
=-=-=-=-=-=-=-=-+1.992
Neg logit contributions
gently-2.771
Maker-2.730
breaks-2.653
ahime-2.588
 Decoder-2.569
Token76
Token ID4304
Feature activation-2.33e+00

Pos logit contributions
unks+3.930
oppers+3.804
imir+3.649
arn+3.375
=-=-=-=-=-=-=-=-+3.146
Neg logit contributions
gently-5.214
ahime-5.076
Maker-5.029
��-4.996
breaks-4.915
Token or
Token ID393
Feature activation+7.95e+00

Pos logit contributions
Maker+1.408
gently+1.392
breaks+1.346
 Decoder+1.297
ona+1.279
Neg logit contributions
unks-1.790
oppers-1.740
imir-1.684
arn-1.516
avis-1.477
Token ed
Token ID1225
Feature activation+1.60e+01

Pos logit contributions
imir+5.477
unks+5.417
oppers+5.344
avis+4.789
arn+4.734
Neg logit contributions
Maker-5.052
breaks-4.842
gently-4.829
yon-4.637
roller-4.480
Tokenraper
Token ID38545
Feature activation+8.00e+00

Pos logit contributions
imir+7.056
avis+6.599
unks+6.446
oppers+6.169
arn+5.461
Neg logit contributions
 Committees-13.864
 Decoder-13.370
Maker-13.304
 Khe-13.298
 Xander-13.259
Token@
Token ID31
Feature activation+7.23e-01

Pos logit contributions
unks+4.968
imir+4.849
oppers+4.846
arn+4.285
avis+4.098
Neg logit contributions
Maker-5.703
gently-5.573
breaks-5.558
 Decoder-5.327
ahime-5.287
Tokenden
Token ID6559
Feature activation+4.99e+00

Pos logit contributions
unks+0.333
oppers+0.317
imir+0.303
arn+0.253
avis+0.238
Neg logit contributions
Maker-0.659
gently-0.658
breaks-0.640
 Decoder-0.624
yon-0.620
Tokenver
Token ID332
Feature activation-1.65e-01

Pos logit contributions
unks+2.370
oppers+2.310
imir+2.247
arn+1.917
avis+1.802
Neg logit contributions
gently-4.420
Maker-4.413
breaks-4.275
 Decoder-4.232
ahime-4.163
Tokenpost
Token ID7353
Feature activation-4.76e+00

Pos logit contributions
gently+0.090
Maker+0.090
breaks+0.086
yon+0.082
 Decoder+0.082
Neg logit contributions
unks-0.136
oppers-0.133
imir-0.129
arn-0.117
avis-0.115
TokenAmount
Token ID31264
Feature activation+6.85e+00

Pos logit contributions
unks+1.681
oppers+1.634
imir+1.588
arn+1.452
=-=-=-=-=-=-=-=-+1.402
Neg logit contributions
gently-1.382
Maker-1.338
breaks-1.317
ahime-1.263
yon-1.255
Token;
Token ID26
Feature activation-7.10e-01

Pos logit contributions
unks+4.456
oppers+4.217
imir+4.136
arn+3.736
=-=-=-=-=-=-=-=-+3.626
Neg logit contributions
gently-4.762
Maker-4.678
breaks-4.515
 Decoder-4.411
ahime-4.411
Token }
Token ID1782
Feature activation+4.71e+00

Pos logit contributions
Maker+0.434
gently+0.431
breaks+0.417
 Decoder+0.402
yon+0.399
Neg logit contributions
unks-0.539
oppers-0.528
imir-0.509
arn-0.458
avis-0.447
Token 1
Token ID352
Feature activation+4.89e+00

Pos logit contributions
unks+3.394
oppers+3.280
imir+3.206
arn+2.922
=-=-=-=-=-=-=-=-+2.797
Neg logit contributions
gently-3.000
Maker-2.991
breaks-2.890
ahime-2.765
roller-2.747
Token 2
Token ID362
Feature activation+1.48e+01

Pos logit contributions
unks+3.296
oppers+3.169
imir+3.113
arn+2.765
=-=-=-=-=-=-=-=-+2.612
Neg logit contributions
Maker-3.366
gently-3.343
breaks-3.228
yon-3.094
ahime-3.074
Token 3
Token ID513
Feature activation+1.60e+01

Pos logit contributions
unks+7.499
imir+7.122
oppers+6.914
arn+6.283
 Edge+5.575
Neg logit contributions
ahime-11.102
gently-10.959
breaks-10.949
Maker-10.835
ona-10.363
Token 4
Token ID604
Feature activation+9.94e+00

Pos logit contributions
unks+7.518
oppers+6.977
imir+6.895
arn+5.386
ores+5.268
Neg logit contributions
Maker-12.360
breaks-12.220
gently-12.035
ahime-12.031
ona-11.457
Token 5
Token ID642
Feature activation+7.41e+00

Pos logit contributions
unks+5.397
oppers+5.125
imir+4.928
arn+4.111
=-=-=-=-=-=-=-=-+3.904
Neg logit contributions
Maker-7.863
gently-7.758
breaks-7.596
ahime-7.579
 Decoder-7.248
Token 6
Token ID718
Feature activation+6.77e+00

Pos logit contributions
unks+5.241
oppers+5.100
imir+4.931
arn+4.341
avis+4.239
Neg logit contributions
Maker-4.732
gently-4.682
breaks-4.637
ahime-4.347
 Decoder-4.302
Token 7
Token ID767
Feature activation+7.84e+00

Pos logit contributions
unks+4.830
oppers+4.667
imir+4.607
arn+4.109
=-=-=-=-=-=-=-=-+3.935
Neg logit contributions
gently-4.337
Maker-4.279
breaks-4.194
ahime-3.985
 Decoder-3.952
Token 8
Token ID807
Feature activation+4.85e+00

Pos logit contributions
unks+5.421
imir+5.197
oppers+5.119
arn+4.541
avis+4.339
Neg logit contributions
gently-5.215
Maker-5.050
breaks-5.005
ahime-4.722
yon-4.622
Token targets
Token ID6670
Feature activation+1.32e+00

Pos logit contributions
unks+1.128
oppers+1.106
imir+1.068
arn+0.990
avis+0.963
Neg logit contributions
Maker-0.628
gently-0.620
breaks-0.594
 Decoder-0.567
yon-0.561
Token for
Token ID329
Feature activation-1.95e+00

Pos logit contributions
unks+0.945
oppers+0.923
imir+0.892
arn+0.807
avis+0.781
Neg logit contributions
Maker-0.865
gently-0.860
breaks-0.832
 Decoder-0.801
yon-0.799
Token 23
Token ID2242
Feature activation-1.06e+00

Pos logit contributions
Maker+1.114
gently+1.111
breaks+1.072
 Decoder+1.022
yon+1.013
Neg logit contributions
unks-1.562
oppers-1.520
imir-1.472
arn-1.334
avis-1.303
Token yards
Token ID5695
Feature activation-8.36e+00

Pos logit contributions
Maker+0.799
gently+0.797
breaks+0.775
yon+0.749
 Decoder+0.748
Neg logit contributions
unks-0.652
oppers-0.628
imir-0.607
arn-0.528
avis-0.512
Token,
Token ID11
Feature activation-4.09e+00

Pos logit contributions
gently+4.215
Maker+4.187
breaks+4.106
 Decoder+3.935
ona+3.928
Neg logit contributions
unks-6.868
oppers-6.673
imir-6.524
=-=-=-=-=-=-=-=--5.871
arn-5.730
Token per
Token ID583
Feature activation+1.57e+01

Pos logit contributions
gently+2.407
Maker+2.399
breaks+2.314
 Decoder+2.226
yon+2.196
Neg logit contributions
unks-3.185
oppers-3.100
imir-2.980
arn-2.680
=-=-=-=-=-=-=-=--2.645
Token Next
Token ID7406
Feature activation+6.94e+00

Pos logit contributions
oppers+10.060
unks+9.949
imir+9.387
arn+8.807
avis+8.626
Neg logit contributions
Maker-9.682
gently-9.214
ahime-9.020
breaks-8.933
yon-8.573
Token Gen
Token ID5215
Feature activation+3.53e+00

Pos logit contributions
unks+4.683
oppers+4.581
imir+4.322
arn+3.986
avis+3.898
Neg logit contributions
gently-4.666
Maker-4.551
breaks-4.378
ahime-4.248
yon-4.197
Token Stats
Token ID20595
Feature activation+8.79e-01

Pos logit contributions
unks+2.415
oppers+2.350
imir+2.254
arn+2.052
avis+1.975
Neg logit contributions
gently-2.402
Maker-2.402
breaks-2.329
yon-2.221
ahime-2.211
Token.
Token ID13
Feature activation+5.02e-01

Pos logit contributions
unks+0.650
oppers+0.634
imir+0.611
arn+0.553
avis+0.536
Neg logit contributions
Maker-0.556
gently-0.555
breaks-0.535
yon-0.511
 Decoder-0.510
Token Sherman
Token ID23198
Feature activation-3.65e+00

Pos logit contributions
unks+0.382
oppers+0.372
imir+0.361
arn+0.327
avis+0.317
Neg logit contributions
Maker-0.307
gently-0.306
breaks-0.295
yon-0.282
 Decoder-0.281
Token.
Token ID13
Feature activation+4.86e-01

Pos logit contributions
unks+2.579
oppers+2.508
imir+2.399
arn+2.221
avis+2.113
Neg logit contributions
Maker-2.557
gently-2.543
breaks-2.447
 Decoder-2.358
yon-2.333
Token
Token ID198
Feature activation+1.77e+00

Pos logit contributions
unks+0.368
oppers+0.359
imir+0.347
arn+0.314
avis+0.305
Neg logit contributions
Maker-0.298
gently-0.297
breaks-0.286
 Decoder-0.273
yon-0.273
Token
Token ID198
Feature activation+2.59e+00

Pos logit contributions
unks+1.306
oppers+1.263
imir+1.229
arn+1.118
avis+1.078
Neg logit contributions
gently-1.110
Maker-1.106
breaks-1.066
 Decoder-1.027
ahime-1.026
Token.
Token ID13
Feature activation+3.57e+00

Pos logit contributions
unks+1.743
oppers+1.690
imir+1.633
arn+1.478
avis+1.409
Neg logit contributions
gently-1.778
Maker-1.758
breaks-1.716
 Decoder-1.677
ahime-1.668
Token@
Token ID31
Feature activation-3.29e+00

Pos logit contributions
unks+2.544
oppers+2.486
imir+2.397
arn+2.176
avis+2.104
Neg logit contributions
Maker-2.320
gently-2.312
breaks-2.238
 Decoder-2.142
ahime-2.134
Tokendev
Token ID7959
Feature activation+1.54e+01

Pos logit contributions
Maker+2.371
gently+2.340
breaks+2.282
yon+2.237
ona+2.194
Neg logit contributions
unks-2.133
oppers-2.084
imir-1.995
arn-1.738
=-=-=-=-=-=-=-=--1.704
Tokenon
Token ID261
Feature activation+5.68e+00

Pos logit contributions
unks+10.763
oppers+10.551
imir+9.875
arn+9.568
avis+9.209
Neg logit contributions
gently-8.578
Maker-8.513
 Decoder-8.485
 Wim-7.994
ahime-7.990
Tokenms
Token ID907
Feature activation+2.41e+00

Pos logit contributions
unks+4.164
oppers+4.023
imir+3.983
arn+3.660
avis+3.516
Neg logit contributions
Maker-3.515
gently-3.465
breaks-3.323
 Decoder-3.268
ahime-3.170
Tokenayers
Token ID6962
Feature activation+2.64e+00

Pos logit contributions
unks+1.663
oppers+1.615
imir+1.570
arn+1.410
avis+1.360
Neg logit contributions
Maker-1.649
gently-1.633
breaks-1.586
 Decoder-1.522
yon-1.502
Token reports
Token ID3136
Feature activation+4.83e+00

Pos logit contributions
unks+1.903
oppers+1.850
imir+1.792
arn+1.620
avis+1.566
Neg logit contributions
Maker-1.706
gently-1.693
breaks-1.632
 Decoder-1.564
ahime-1.560
Token that
Token ID326
Feature activation-7.24e-01

Pos logit contributions
unks+3.332
oppers+3.251
imir+3.125
arn+2.796
avis+2.738
Neg logit contributions
Maker-3.236
gently-3.232
breaks-3.114
ahime-2.985
roller-2.936
Token in
Token ID287
Feature activation+1.51e+00

Pos logit contributions
unks+1.322
oppers+1.282
imir+1.239
arn+1.129
avis+1.081
Neg logit contributions
Maker-1.241
gently-1.227
breaks-1.186
 Decoder-1.153
yon-1.145
Token DV
Token ID29854
Feature activation+1.01e+01

Pos logit contributions
unks+1.128
oppers+1.101
imir+1.061
arn+0.967
avis+0.935
Neg logit contributions
Maker-0.937
gently-0.932
breaks-0.897
yon-0.859
 Decoder-0.859
TokenOA
Token ID23621
Feature activation+4.95e+00

Pos logit contributions
unks+5.000
oppers+4.932
arn+4.383
imir+4.317
ores+3.942
Neg logit contributions
gently-8.190
ahime-8.040
yon-7.951
breaks-7.915
anamo-7.894
Token (
Token ID357
Feature activation+8.51e+00

Pos logit contributions
unks+3.311
oppers+3.222
imir+3.123
arn+2.812
avis+2.722
Neg logit contributions
gently-3.413
Maker-3.379
breaks-3.235
yon-3.181
 Decoder-3.129
Tokenvalue
Token ID8367
Feature activation+9.00e+00

Pos logit contributions
unks+5.705
oppers+5.613
imir+5.330
arn+5.068
avis+4.861
Neg logit contributions
Maker-5.377
gently-5.372
 Decoder-5.204
ahime-5.108
breaks-5.059
Token per
Token ID583
Feature activation+1.54e+01

Pos logit contributions
unks+5.807
oppers+5.712
imir+5.658
arn+5.073
avis+5.000
Neg logit contributions
gently-6.088
Maker-5.949
breaks-5.714
ahime-5.539
yon-5.501
Token play
Token ID711
Feature activation+1.99e+00

Pos logit contributions
unks+8.867
oppers+8.835
imir+8.182
arn+7.962
avis+7.482
Neg logit contributions
Maker-10.164
gently-10.088
breaks-9.658
anamo-9.486
yon-9.455
Token),
Token ID828
Feature activation+7.13e+00

Pos logit contributions
unks+1.447
oppers+1.409
imir+1.363
arn+1.232
avis+1.195
Neg logit contributions
Maker-1.280
gently-1.279
breaks-1.230
 Decoder-1.182
yon-1.182
Token and
Token ID290
Feature activation+6.93e+00

Pos logit contributions
unks+4.813
oppers+4.664
imir+4.567
arn+4.207
avis+3.966
Neg logit contributions
Maker-4.714
gently-4.581
breaks-4.435
 Decoder-4.299
roller-4.285
Token fourth
Token ID5544
Feature activation+4.01e+00

Pos logit contributions
unks+4.977
oppers+4.831
imir+4.749
arn+4.405
 merits+4.149
Neg logit contributions
Maker-4.238
gently-4.127
breaks-3.992
ahime-3.913
 Decoder-3.875
Token in
Token ID287
Feature activation+3.20e+00

Pos logit contributions
unks+2.668
oppers+2.576
imir+2.494
arn+2.302
avis+2.167
Neg logit contributions
Maker-2.797
gently-2.738
breaks-2.661
 Decoder-2.604
ahime-2.582
Token r
Token ID374
Feature activation+6.24e+00

Pos logit contributions
Maker+1.185
gently+1.172
breaks+1.143
 Decoder+1.102
yon+1.098
Neg logit contributions
unks-1.141
oppers-1.115
imir-1.076
arn-0.950
avis-0.929
Tokenags
Token ID3775
Feature activation-8.04e-01

Pos logit contributions
unks+4.137
oppers+3.930
imir+3.801
arn+3.484
avis+3.300
Neg logit contributions
Maker-4.358
gently-4.324
breaks-4.231
 Decoder-4.152
ahime-4.047
Token-
Token ID12
Feature activation-6.90e-01

Pos logit contributions
Maker+0.519
gently+0.516
breaks+0.499
yon+0.479
 Decoder+0.478
Neg logit contributions
unks-0.583
oppers-0.570
imir-0.549
arn-0.494
avis-0.479
Tokento
Token ID1462
Feature activation+6.81e-01

Pos logit contributions
Maker+0.528
gently+0.527
breaks+0.512
yon+0.494
 Decoder+0.493
Neg logit contributions
unks-0.418
oppers-0.405
imir-0.388
arn-0.340
avis-0.327
Token-
Token ID12
Feature activation+1.26e+00

Pos logit contributions
unks+0.489
oppers+0.477
imir+0.460
arn+0.414
avis+0.401
Neg logit contributions
Maker-0.446
gently-0.443
breaks-0.429
 Decoder-0.412
yon-0.411
Tokenric
Token ID1173
Feature activation+1.50e+01

Pos logit contributions
unks+0.565
oppers+0.542
imir+0.511
arn+0.427
avis+0.402
Neg logit contributions
Maker-1.166
gently-1.159
breaks-1.134
 Decoder-1.103
yon-1.101
Tokenhes
Token ID956
Feature activation+2.43e+00

Pos logit contributions
unks+3.176
oppers+2.626
arn+2.381
imir+2.257
inia+1.750
Neg logit contributions
gently-16.637
Maker-16.614
 Decoder-16.447
ahime-16.082
��-15.792
Token story
Token ID1621
Feature activation+1.62e+00

Pos logit contributions
unks+1.932
oppers+1.886
imir+1.834
arn+1.690
avis+1.615
Neg logit contributions
gently-1.380
Maker-1.380
breaks-1.320
 Decoder-1.253
yon-1.249
Token"
Token ID1
Feature activation+3.28e-01

Pos logit contributions
unks+1.128
oppers+1.094
imir+1.062
arn+0.950
avis+0.914
Neg logit contributions
Maker-1.090
gently-1.089
breaks-1.049
 Decoder-1.015
yon-1.014
Token<|endoftext|>
Token ID50256
Feature activation-8.99e-01

Pos logit contributions
unks+0.251
oppers+0.245
imir+0.237
arn+0.215
avis+0.208
Neg logit contributions
Maker-0.199
gently-0.198
breaks-0.191
 Decoder-0.183
yon-0.183
Token(
Token ID7
Feature activation+4.83e+00

Pos logit contributions
Maker+0.597
gently+0.591
breaks+0.572
yon+0.549
 Decoder+0.547
Neg logit contributions
unks-0.639
oppers-0.624
imir-0.601
arn-0.538
avis-0.522
Tokenh
Token ID71
Feature activation+1.40e+00

Pos logit contributions
unks+2.901
oppers+2.839
imir+2.719
arn+2.483
avis+2.368
Neg logit contributions
gently-2.734
Maker-2.698
breaks-2.604
 Decoder-2.522
yon-2.499
Token entry
Token ID5726
Feature activation-1.30e-01

Pos logit contributions
unks+1.018
oppers+0.993
imir+0.955
arn+0.865
avis+0.834
Neg logit contributions
Maker-0.903
gently-0.900
breaks-0.870
 Decoder-0.835
yon-0.832
Token in
Token ID287
Feature activation+2.66e+00

Pos logit contributions
Maker+0.092
gently+0.091
breaks+0.088
 Decoder+0.085
yon+0.085
Neg logit contributions
unks-0.087
oppers-0.085
imir-0.082
arn-0.073
avis-0.070
Token `
Token ID4600
Feature activation+4.26e+00

Pos logit contributions
unks+1.928
oppers+1.882
imir+1.812
arn+1.640
avis+1.582
Neg logit contributions
gently-1.699
Maker-1.692
breaks-1.629
yon-1.565
roller-1.545
Tokenser
Token ID2655
Feature activation+8.94e+00

Pos logit contributions
unks+2.938
oppers+2.870
imir+2.749
arn+2.524
avis+2.410
Neg logit contributions
gently-2.799
Maker-2.777
breaks-2.685
 Decoder-2.608
yon-2.605
Tokenach
Token ID620
Feature activation+1.48e+01

Pos logit contributions
unks+5.090
oppers+4.771
imir+4.586
arn+4.216
avis+4.090
Neg logit contributions
Maker-6.780
gently-6.620
breaks-6.589
 Decoder-6.445
yon-6.375
TokenSpace
Token ID14106
Feature activation+6.77e+00

Pos logit contributions
unks+9.808
oppers+9.508
imir+8.872
arn+8.498
avis+7.916
Neg logit contributions
gently-9.010
Maker-8.148
breaks-8.132
ahime-8.050
 Xander-7.989
Token`,
Token ID47671
Feature activation+1.93e+00

Pos logit contributions
unks+4.744
oppers+4.616
imir+4.449
arn+4.077
avis+3.873
Neg logit contributions
gently-4.361
Maker-4.135
breaks-4.057
yon-3.974
ahime-3.931
Token find
Token ID1064
Feature activation+6.56e+00

Pos logit contributions
unks+1.396
oppers+1.360
imir+1.313
arn+1.188
avis+1.142
Neg logit contributions
Maker-1.249
gently-1.244
breaks-1.200
yon-1.157
 Decoder-1.144
Token its
Token ID663
Feature activation+3.88e+00

Pos logit contributions
unks+4.575
oppers+4.471
imir+4.244
arn+3.889
avis+3.737
Neg logit contributions
gently-4.190
Maker-4.088
breaks-4.001
roller-3.905
yon-3.893
Token N
Token ID399
Feature activation-2.89e+00

Pos logit contributions
unks+2.886
oppers+2.799
imir+2.678
arn+2.444
avis+2.336
Neg logit contributions
Maker-2.416
gently-2.413
breaks-2.319
yon-2.242
roller-2.212
Tokenulse
Token ID9615
Feature activation+2.24e+00

Pos logit contributions
unks+5.594
oppers+5.457
imir+5.344
arn+4.736
avis+4.590
Neg logit contributions
gently-5.482
Maker-5.342
breaks-5.318
ahime-5.139
 Xander-5.093
TokenAmount
Token ID31264
Feature activation+6.85e+00

Pos logit contributions
unks+1.681
oppers+1.634
imir+1.588
arn+1.452
=-=-=-=-=-=-=-=-+1.402
Neg logit contributions
gently-1.382
Maker-1.338
breaks-1.317
ahime-1.263
yon-1.255
Token;
Token ID26
Feature activation-7.10e-01

Pos logit contributions
unks+4.456
oppers+4.217
imir+4.136
arn+3.736
=-=-=-=-=-=-=-=-+3.626
Neg logit contributions
gently-4.762
Maker-4.678
breaks-4.515
 Decoder-4.411
ahime-4.411
Token }
Token ID1782
Feature activation+4.71e+00

Pos logit contributions
Maker+0.434
gently+0.431
breaks+0.417
 Decoder+0.402
yon+0.399
Neg logit contributions
unks-0.539
oppers-0.528
imir-0.509
arn-0.458
avis-0.447
Token 1
Token ID352
Feature activation+4.89e+00

Pos logit contributions
unks+3.394
oppers+3.280
imir+3.206
arn+2.922
=-=-=-=-=-=-=-=-+2.797
Neg logit contributions
gently-3.000
Maker-2.991
breaks-2.890
ahime-2.765
roller-2.747
Token 2
Token ID362
Feature activation+1.48e+01

Pos logit contributions
unks+3.296
oppers+3.169
imir+3.113
arn+2.765
=-=-=-=-=-=-=-=-+2.612
Neg logit contributions
Maker-3.366
gently-3.343
breaks-3.228
yon-3.094
ahime-3.074
Token 3
Token ID513
Feature activation+1.60e+01

Pos logit contributions
unks+7.499
imir+7.122
oppers+6.914
arn+6.283
 Edge+5.575
Neg logit contributions
ahime-11.102
gently-10.959
breaks-10.949
Maker-10.835
ona-10.363
Token 4
Token ID604
Feature activation+9.94e+00

Pos logit contributions
unks+7.518
oppers+6.977
imir+6.895
arn+5.386
ores+5.268
Neg logit contributions
Maker-12.360
breaks-12.220
gently-12.035
ahime-12.031
ona-11.457
Token 5
Token ID642
Feature activation+7.41e+00

Pos logit contributions
unks+5.397
oppers+5.125
imir+4.928
arn+4.111
=-=-=-=-=-=-=-=-+3.904
Neg logit contributions
Maker-7.863
gently-7.758
breaks-7.596
ahime-7.579
 Decoder-7.248
Token 6
Token ID718
Feature activation+6.77e+00

Pos logit contributions
unks+5.241
oppers+5.100
imir+4.931
arn+4.341
avis+4.239
Neg logit contributions
Maker-4.732
gently-4.682
breaks-4.637
ahime-4.347
 Decoder-4.302
Token 7
Token ID767
Feature activation+7.84e+00

Pos logit contributions
unks+4.830
oppers+4.667
imir+4.607
arn+4.109
=-=-=-=-=-=-=-=-+3.935
Neg logit contributions
gently-4.337
Maker-4.279
breaks-4.194
ahime-3.985
 Decoder-3.952
Tokenlement
Token ID1732
Feature activation+6.26e+00

Pos logit contributions
Maker+1.814
gently+1.802
breaks+1.773
yon+1.693
 Decoder+1.669
Neg logit contributions
unks-1.990
oppers-1.925
imir-1.880
arn-1.664
avis-1.618
Token
Token ID39073
Feature activation+8.37e+00

Pos logit contributions
unks+4.459
oppers+4.417
imir+4.217
arn+3.884
avis+3.776
Neg logit contributions
gently-3.934
Maker-3.907
breaks-3.693
 Decoder-3.614
roller-3.558
Tokenser
Token ID2655
Feature activation-1.24e-01

Pos logit contributions
unks+5.709
oppers+5.647
imir+5.351
arn+4.950
avis+4.885
Neg logit contributions
Maker-5.342
gently-5.188
breaks-4.977
 Decoder-4.956
ahime-4.869
Tokente
Token ID660
Feature activation+3.75e+00

Pos logit contributions
Maker+0.086
gently+0.085
breaks+0.083
 Decoder+0.080
yon+0.080
Neg logit contributions
unks-0.084
oppers-0.082
imir-0.079
arn-0.071
avis-0.068
Tokenur
Token ID333
Feature activation+3.12e+00

Pos logit contributions
unks+2.653
oppers+2.595
imir+2.493
arn+2.269
avis+2.197
Neg logit contributions
Maker-2.447
gently-2.439
breaks-2.348
 Decoder-2.279
yon-2.237
Token du
Token ID7043
Feature activation+1.48e+01

Pos logit contributions
unks+2.301
oppers+2.253
imir+2.166
arn+1.972
avis+1.918
Neg logit contributions
Maker-1.953
gently-1.943
breaks-1.869
 Decoder-1.799
yon-1.786
Token programme
Token ID11383
Feature activation+5.35e+00

Pos logit contributions
oppers+9.796
unks+9.577
imir+9.344
avis+8.712
arn+8.536
Neg logit contributions
gently-8.359
Maker-8.106
breaks-8.041
roller-7.520
ahime-7.503
Token C
Token ID327
Feature activation+4.29e+00

Pos logit contributions
unks+3.799
oppers+3.747
imir+3.631
arn+3.313
avis+3.222
Neg logit contributions
gently-3.378
Maker-3.307
breaks-3.221
yon-3.052
 Decoder-3.050
Token.
Token ID13
Feature activation+6.94e-01

Pos logit contributions
unks+2.925
oppers+2.822
imir+2.775
arn+2.508
avis+2.399
Neg logit contributions
gently-2.936
Maker-2.869
breaks-2.798
ahime-2.715
 Decoder-2.686
TokenO
Token ID46
Feature activation+1.66e+00

Pos logit contributions
unks+0.406
oppers+0.394
imir+0.376
arn+0.330
avis+0.316
Neg logit contributions
gently-0.544
Maker-0.544
breaks-0.529
 Decoder-0.511
yon-0.511
Token.
Token ID13
Feature activation+2.05e+00

Pos logit contributions
unks+1.192
oppers+1.156
imir+1.123
arn+1.017
avis+0.981
Neg logit contributions
gently-1.075
Maker-1.068
breaks-1.033
 Decoder-0.987
ahime-0.986
Token version
Token ID2196
Feature activation+4.14e+00

Pos logit contributions
unks+2.424
imir+2.267
oppers+2.190
arn+1.769
=-=-=-=-=-=-=-=-+1.623
Neg logit contributions
gently-5.675
Maker-5.604
breaks-5.563
ahime-5.417
roller-5.348
Token of
Token ID286
Feature activation+6.77e+00

Pos logit contributions
unks+2.666
oppers+2.544
imir+2.514
arn+2.230
avis+2.149
Neg logit contributions
gently-2.941
Maker-2.887
breaks-2.799
ahime-2.784
 Decoder-2.690
Token the
Token ID262
Feature activation+3.35e+00

Pos logit contributions
unks+4.168
imir+3.988
oppers+3.938
arn+3.514
=-=-=-=-=-=-=-=-+3.367
Neg logit contributions
gently-4.814
Maker-4.723
ahime-4.717
breaks-4.634
yon-4.449
Token Rift
Token ID21222
Feature activation+7.80e+00

Pos logit contributions
unks+2.076
oppers+2.022
imir+1.951
arn+1.739
=-=-=-=-=-=-=-=-+1.647
Neg logit contributions
gently-2.472
Maker-2.443
breaks-2.385
ahime-2.312
yon-2.291
Token would
Token ID561
Feature activation+2.09e+00

Pos logit contributions
unks+4.298
imir+4.196
oppers+3.959
avis+3.463
arn+3.420
Neg logit contributions
gently-6.035
Maker-5.973
breaks-5.861
ahime-5.831
 Decoder-5.608
Token employ
Token ID1873
Feature activation+1.46e+01

Pos logit contributions
unks+1.360
oppers+1.317
imir+1.299
arn+1.140
avis+1.113
Neg logit contributions
Maker-1.504
gently-1.472
breaks-1.436
yon-1.392
 Decoder-1.383
Token such
Token ID884
Feature activation+7.22e+00

Pos logit contributions
unks+4.511
imir+4.402
oppers+3.809
arn+3.646
 merits+3.428
Neg logit contributions
Maker-13.764
ahime-13.361
roller-13.344
anamo-13.310
gently-13.232
Token a
Token ID257
Feature activation+2.27e-01

Pos logit contributions
unks+4.257
imir+4.052
oppers+3.981
arn+3.597
avis+3.500
Neg logit contributions
gently-5.336
Maker-5.189
ahime-5.031
breaks-5.028
roller-4.942
Token thing
Token ID1517
Feature activation+2.59e+00

Pos logit contributions
unks+0.209
oppers+0.205
imir+0.199
arn+0.184
avis+0.179
Neg logit contributions
Maker-0.103
gently-0.102
breaks-0.098
 Decoder-0.092
yon-0.092
Token (
Token ID357
Feature activation+1.80e+00

Pos logit contributions
unks+1.738
oppers+1.670
imir+1.623
arn+1.455
=-=-=-=-=-=-=-=-+1.396
Neg logit contributions
gently-1.800
Maker-1.785
breaks-1.734
ahime-1.668
yon-1.655
Tokenas
Token ID292
Feature activation+3.89e+00

Pos logit contributions
unks+0.740
oppers+0.698
imir+0.668
arn+0.560
avis+0.516
Neg logit contributions
Maker-1.723
gently-1.708
breaks-1.672
 Decoder-1.646
yon-1.626
Token For
Token ID1114
Feature activation-1.26e+00

Pos logit contributions
unks+0.080
oppers+0.078
imir+0.075
arn+0.068
avis+0.066
Neg logit contributions
Maker-0.066
gently-0.065
breaks-0.063
 Decoder-0.061
yon-0.061
Token example
Token ID1672
Feature activation+4.98e+00

Pos logit contributions
Maker+0.861
gently+0.857
breaks+0.832
yon+0.799
 Decoder+0.797
Neg logit contributions
unks-0.868
oppers-0.848
imir-0.813
arn-0.727
avis-0.703
Token,
Token ID11
Feature activation+2.94e+00

Pos logit contributions
unks+3.412
oppers+3.299
imir+3.259
arn+2.921
avis+2.789
Neg logit contributions
gently-3.301
Maker-3.298
breaks-3.200
 Decoder-3.068
roller-3.053
Token the
Token ID262
Feature activation+4.94e-01

Pos logit contributions
unks+2.196
oppers+2.133
imir+2.085
arn+1.885
avis+1.822
Neg logit contributions
gently-1.805
Maker-1.805
breaks-1.743
 Decoder-1.667
yon-1.654
Token Buddhist
Token ID20838
Feature activation+4.87e+00

Pos logit contributions
unks+0.409
oppers+0.399
imir+0.388
arn+0.354
avis+0.345
Neg logit contributions
Maker-0.269
gently-0.267
breaks-0.257
 Decoder-0.244
yon-0.243
Token monk
Token ID27266
Feature activation+1.44e+01

Pos logit contributions
unks+4.200
oppers+4.099
imir+4.027
arn+3.732
avis+3.597
Neg logit contributions
gently-2.390
Maker-2.355
breaks-2.294
 Decoder-2.150
roller-2.115
Token,
Token ID11
Feature activation+1.33e+01

Pos logit contributions
unks+9.995
imir+9.573
oppers+9.293
arn+8.728
avis+8.277
Neg logit contributions
gently-7.932
Maker-7.774
breaks-7.660
ahime-7.516
 Decoder-7.366
Token Hai
Token ID36876
Feature activation+6.62e+00

Pos logit contributions
unks+9.444
imir+9.103
oppers+8.893
arn+8.188
avis+7.918
Neg logit contributions
gently-7.280
Maker-7.186
breaks-7.133
ahime-6.886
roller-6.737
Tokeny
Token ID88
Feature activation+5.16e+00

Pos logit contributions
unks+4.833
oppers+4.628
imir+4.602
arn+4.192
avis+3.957
Neg logit contributions
Maker-4.110
gently-4.067
breaks-3.934
 Decoder-3.860
ahime-3.729
Tokenun
Token ID403
Feature activation+5.28e+00

Pos logit contributions
unks+3.718
oppers+3.584
imir+3.551
arn+3.228
avis+3.079
Neg logit contributions
Maker-3.345
gently-3.303
breaks-3.201
 Decoder-3.125
roller-2.985
Token,
Token ID11
Feature activation+4.38e+00

Pos logit contributions
unks+3.747
oppers+3.617
imir+3.554
arn+3.227
avis+3.062
Neg logit contributions
Maker-3.403
gently-3.358
breaks-3.291
 Decoder-3.191
roller-3.076
Token wrong
Token ID2642
Feature activation+2.81e+00

Pos logit contributions
Maker+0.764
gently+0.754
breaks+0.727
 Decoder+0.701
yon+0.697
Neg logit contributions
unks-0.933
oppers-0.912
imir-0.883
arn-0.796
avis-0.774
Token language
Token ID3303
Feature activation+3.93e+00

Pos logit contributions
unks+1.969
oppers+1.907
imir+1.839
arn+1.664
avis+1.598
Neg logit contributions
gently-1.856
Maker-1.850
breaks-1.796
yon-1.725
 Decoder-1.720
Token!)
Token ID8133
Feature activation-3.74e+00

Pos logit contributions
unks+2.733
oppers+2.649
imir+2.580
arn+2.334
avis+2.231
Neg logit contributions
gently-2.577
Maker-2.574
breaks-2.479
yon-2.428
 Decoder-2.392
Token to
Token ID284
Feature activation+8.80e+00

Pos logit contributions
Maker+2.262
gently+2.234
breaks+2.158
 Decoder+2.050
yon+2.049
Neg logit contributions
unks-2.875
oppers-2.796
imir-2.694
arn-2.437
=-=-=-=-=-=-=-=--2.384
Token becoming
Token ID5033
Feature activation+1.03e+01

Pos logit contributions
unks+5.676
oppers+5.533
imir+5.377
arn+4.925
avis+4.661
Neg logit contributions
gently-5.741
breaks-5.573
Maker-5.548
yon-5.444
ahime-5.422
Token a
Token ID257
Feature activation+1.43e+01

Pos logit contributions
unks+6.263
oppers+6.152
imir+5.896
arn+5.425
avis+5.169
Neg logit contributions
gently-7.040
breaks-6.733
Maker-6.692
ahime-6.521
yon-6.514
Token Junior
Token ID20000
Feature activation+6.26e+00

Pos logit contributions
unks+7.906
oppers+7.772
imir+7.255
arn+6.829
 merits+6.416
Neg logit contributions
gently-9.719
breaks-9.464
ahime-9.305
Maker-9.000
aundering-8.887
Token Developer
Token ID23836
Feature activation+5.29e+00

Pos logit contributions
unks+4.550
oppers+4.431
imir+4.291
arn+3.964
avis+3.760
Neg logit contributions
gently-3.887
breaks-3.760
Maker-3.702
ahime-3.550
yon-3.514
Token within
Token ID1626
Feature activation+1.06e+01

Pos logit contributions
unks+3.572
oppers+3.498
imir+3.346
arn+3.073
avis+2.888
Neg logit contributions
gently-3.543
Maker-3.434
breaks-3.419
yon-3.298
ahime-3.230
Token one
Token ID530
Feature activation+3.60e+00

Pos logit contributions
unks+6.605
oppers+6.414
imir+5.980
arn+5.575
avis+5.255
Neg logit contributions
gently-7.153
Maker-6.865
ahime-6.729
breaks-6.669
yon-6.653
Token year
Token ID614
Feature activation+6.87e-01

Pos logit contributions
unks+2.418
oppers+2.317
imir+2.247
arn+2.028
=-=-=-=-=-=-=-=-+1.912
Neg logit contributions
gently-2.480
Maker-2.469
breaks-2.392
 Decoder-2.298
yon-2.294
Token(
Token ID7
Feature activation+9.91e+00

Pos logit contributions
unks+4.537
oppers+4.419
imir+4.336
arn+3.943
avis+3.765
Neg logit contributions
gently-4.383
Maker-4.361
breaks-4.155
ahime-4.014
yon-3.996
Tokent
Token ID83
Feature activation+3.57e+00

Pos logit contributions
unks+6.714
oppers+6.476
imir+6.449
arn+5.824
avis+5.579
Neg logit contributions
Maker-6.205
gently-6.178
breaks-5.847
 Decoder-5.823
 Xander-5.716
Token)
Token ID8
Feature activation+7.30e+00

Pos logit contributions
unks+2.582
oppers+2.486
imir+2.437
arn+2.221
avis+2.127
Neg logit contributions
Maker-2.291
gently-2.287
breaks-2.199
 Decoder-2.108
yon-2.100
Token =
Token ID796
Feature activation+6.43e+00

Pos logit contributions
unks+5.269
oppers+5.132
imir+5.034
arn+4.615
avis+4.410
Neg logit contributions
gently-4.469
Maker-4.454
breaks-4.227
ahime-4.057
yon-4.049
Token exp
Token ID1033
Feature activation+9.95e+00

Pos logit contributions
unks+4.386
oppers+4.250
imir+4.181
arn+3.760
avis+3.606
Neg logit contributions
Maker-4.248
gently-4.242
breaks-4.079
ahime-3.922
roller-3.896
Token(-
Token ID32590
Feature activation+1.43e+01

Pos logit contributions
unks+5.435
oppers+5.315
imir+5.256
arn+4.698
avis+4.345
Neg logit contributions
gently-7.523
Maker-7.467
breaks-7.187
ahime-7.023
 Decoder-7.004
Token (
Token ID357
Feature activation+1.28e+01

Pos logit contributions
unks+8.327
oppers+8.058
imir+7.979
arn+7.119
avis+6.878
Neg logit contributions
gently-9.450
Maker-9.396
 Decoder-8.924
breaks-8.844
 Xander-8.840
Tokent
Token ID83
Feature activation+5.15e+00

Pos logit contributions
unks+7.913
oppers+7.632
imir+7.606
arn+6.789
avis+6.468
Neg logit contributions
Maker-8.309
gently-8.277
 Decoder-7.844
breaks-7.823
 Xander-7.738
Token-
Token ID12
Feature activation+7.61e+00

Pos logit contributions
unks+3.691
oppers+3.557
imir+3.491
arn+3.180
avis+3.039
Neg logit contributions
Maker-3.302
gently-3.300
breaks-3.146
 Decoder-3.039
yon-3.011
Tokent
Token ID83
Feature activation+7.64e+00

Pos logit contributions
unks+5.129
oppers+5.002
imir+4.916
arn+4.395
avis+4.250
Neg logit contributions
Maker-5.001
gently-4.986
breaks-4.786
 Decoder-4.673
ahime-4.587
Token 0
Token ID657
Feature activation+4.99e+00

Pos logit contributions
unks+5.003
oppers+4.780
imir+4.725
arn+4.324
avis+4.112
Neg logit contributions
gently-5.245
Maker-5.239
breaks-5.021
 Decoder-4.883
ahime-4.827
Token in
Token ID287
Feature activation+7.44e+00

Pos logit contributions
unks+2.105
oppers+2.025
imir+1.981
arn+1.788
avis+1.722
Neg logit contributions
Maker-2.211
gently-2.189
breaks-2.111
 Decoder-2.072
ahime-2.053
Token the
Token ID262
Feature activation+4.25e+00

Pos logit contributions
unks+4.848
oppers+4.614
imir+4.518
arn+4.107
avis+3.939
Neg logit contributions
Maker-5.171
gently-5.126
breaks-4.924
 Decoder-4.821
ahime-4.758
Token aftermath
Token ID16280
Feature activation+1.08e+00

Pos logit contributions
unks+3.018
oppers+2.913
imir+2.836
arn+2.564
avis+2.467
Neg logit contributions
Maker-2.794
gently-2.755
breaks-2.667
 Decoder-2.593
yon-2.574
Token of
Token ID286
Feature activation+1.01e+00

Pos logit contributions
unks+0.776
oppers+0.752
imir+0.727
arn+0.656
avis+0.633
Neg logit contributions
Maker-0.704
gently-0.701
breaks-0.674
 Decoder-0.652
yon-0.646
Token 9
Token ID860
Feature activation+1.69e+00

Pos logit contributions
unks+0.768
oppers+0.747
imir+0.723
arn+0.655
avis+0.635
Neg logit contributions
Maker-0.624
gently-0.619
breaks-0.597
 Decoder-0.573
yon-0.571
Token/
Token ID14
Feature activation+1.42e+01

Pos logit contributions
unks+1.263
oppers+1.230
imir+1.194
arn+1.086
avis+1.040
Neg logit contributions
Maker-1.055
gently-1.052
breaks-1.015
 Decoder-0.972
yon-0.968
Token11
Token ID1157
Feature activation+3.00e+00

Pos logit contributions
unks+6.307
imir+5.733
oppers+5.357
arn+5.274
avis+4.357
Neg logit contributions
gently-12.356
ahime-12.340
��-11.768
 Decoder-11.674
breaks-11.487
Token,
Token ID11
Feature activation+7.41e-01

Pos logit contributions
unks+2.099
oppers+2.017
imir+1.963
arn+1.776
avis+1.692
Neg logit contributions
gently-2.023
Maker-2.021
breaks-1.934
 Decoder-1.879
yon-1.864
Token when
Token ID618
Feature activation-1.83e+00

Pos logit contributions
unks+0.550
oppers+0.535
imir+0.518
arn+0.468
avis+0.452
Neg logit contributions
Maker-0.468
gently-0.465
breaks-0.449
 Decoder-0.431
yon-0.430
Token 19
Token ID678
Feature activation-2.61e+00

Pos logit contributions
Maker+1.115
gently+1.105
breaks+1.075
yon+1.030
 Decoder+1.024
Neg logit contributions
unks-1.385
oppers-1.359
imir-1.309
arn-1.183
avis-1.157
Token guys
Token ID3730
Feature activation-1.26e+00

Pos logit contributions
Maker+1.670
gently+1.656
breaks+1.601
yon+1.548
 Decoder+1.537
Neg logit contributions
unks-1.886
oppers-1.850
imir-1.787
arn-1.594
avis-1.574
Token.
Token ID13
Feature activation+3.78e+00

Pos logit contributions
unks+4.353
oppers+4.273
imir+4.014
arn+3.749
avis+3.583
Neg logit contributions
Maker-4.251
gently-4.095
 Decoder-3.981
ahime-3.946
breaks-3.905
Tokencom
Token ID785
Feature activation+3.99e+00

Pos logit contributions
unks+2.754
oppers+2.651
imir+2.578
arn+2.354
avis+2.278
Neg logit contributions
Maker-2.426
gently-2.395
breaks-2.307
 Decoder-2.244
ahime-2.223
Token/
Token ID14
Feature activation+5.58e+00

Pos logit contributions
unks+2.875
oppers+2.780
imir+2.715
arn+2.434
avis+2.367
Neg logit contributions
Maker-2.592
gently-2.560
breaks-2.457
yon-2.374
 Decoder-2.359
Tokenpolitics
Token ID34127
Feature activation+5.31e+00

Pos logit contributions
unks+3.893
oppers+3.758
imir+3.661
arn+3.376
avis+3.204
Neg logit contributions
Maker-3.618
gently-3.606
breaks-3.424
 Decoder-3.381
ahime-3.375
Token/
Token ID14
Feature activation+6.18e+00

Pos logit contributions
unks+3.719
oppers+3.615
imir+3.533
arn+3.161
avis+3.081
Neg logit contributions
gently-3.471
Maker-3.465
breaks-3.274
ahime-3.231
yon-3.224
Tokenc
Token ID66
Feature activation+1.42e+01

Pos logit contributions
unks+4.258
oppers+4.129
imir+4.043
arn+3.715
avis+3.509
Neg logit contributions
Maker-4.060
gently-4.018
breaks-3.802
ahime-3.787
 Decoder-3.775
Tokennn
Token ID20471
Feature activation+5.45e+00

Pos logit contributions
unks+8.250
oppers+7.548
arn+7.369
imir+7.348
ores+6.414
Neg logit contributions
Maker-10.870
gently-10.127
 Decoder-9.981
ahime-9.818
breaks-9.755
Token-
Token ID12
Feature activation+7.10e+00

Pos logit contributions
unks+3.627
oppers+3.531
imir+3.456
arn+3.122
avis+3.047
Neg logit contributions
Maker-3.709
gently-3.683
breaks-3.506
 Decoder-3.469
ahime-3.461
Tokendrops
Token ID49253
Feature activation+6.82e+00

Pos logit contributions
unks+4.762
oppers+4.593
imir+4.532
arn+4.191
avis+4.020
Neg logit contributions
Maker-4.693
gently-4.546
 Decoder-4.459
ahime-4.404
breaks-4.303
Token-
Token ID12
Feature activation+4.56e+00

Pos logit contributions
unks+4.299
oppers+4.086
imir+4.054
arn+3.654
avis+3.517
Neg logit contributions
Maker-4.819
gently-4.713
breaks-4.580
 Decoder-4.517
ahime-4.455
Tokenre
Token ID260
Feature activation-1.04e+00

Pos logit contributions
unks+3.104
oppers+2.986
imir+2.926
arn+2.674
avis+2.565
Neg logit contributions
Maker-3.037
gently-3.011
 Decoder-2.898
breaks-2.887
ahime-2.844
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
Maker+1.751
gently+1.727
breaks+1.678
yon+1.610
ona+1.585
Neg logit contributions
unks-2.222
oppers-2.175
imir-2.111
arn-1.881
avis-1.843
Token spoil
Token ID20851
Feature activation-4.36e+00

Pos logit contributions
unks+1.252
oppers+1.216
imir+1.169
arn+1.062
avis+1.026
Neg logit contributions
Maker-1.133
gently-1.131
breaks-1.090
 Decoder-1.054
yon-1.044
Token thereof
Token ID15370
Feature activation-9.52e+00

Pos logit contributions
Maker+2.240
gently+2.180
breaks+2.173
yon+2.056
ona+2.015
Neg logit contributions
unks-3.687
oppers-3.621
imir-3.503
=-=-=-=-=-=-=-=--3.130
arn-3.123
Token,
Token ID11
Feature activation-1.11e+01

Pos logit contributions
Maker+4.805
breaks+4.757
gently+4.734
yon+4.571
ona+4.475
Neg logit contributions
unks-7.692
oppers-7.585
imir-7.236
=-=-=-=-=-=-=-=--6.561
avis-6.421
Token shalt
Token ID36258
Feature activation-5.26e+00

Pos logit contributions
Maker+5.573
breaks+5.517
gently+5.475
yon+5.252
ona+5.146
Neg logit contributions
unks-8.927
oppers-8.749
imir-8.331
=-=-=-=-=-=-=-=--7.539
avis-7.393
Token thou
Token ID14210
Feature activation-1.87e+01

Pos logit contributions
Maker+3.053
gently+3.041
breaks+2.957
yon+2.860
ona+2.832
Neg logit contributions
unks-4.084
oppers-4.018
imir-3.832
=-=-=-=-=-=-=-=--3.407
arn-3.369
Token take
Token ID1011
Feature activation-9.85e+00

Pos logit contributions
gently+8.933
Maker+8.437
breaks+8.233
 prize+7.976
yon+7.937
Neg logit contributions
unks-13.681
oppers-13.522
imir-12.883
=-=-=-=-=-=-=-=--11.710
inia-11.307
Token unto
Token ID12722
Feature activation-4.66e+00

Pos logit contributions
Maker+5.324
gently+5.204
breaks+5.100
yon+4.959
ona+4.752
Neg logit contributions
unks-7.630
oppers-7.387
imir-7.322
=-=-=-=-=-=-=-=--6.431
avis-6.419
Token thy
Token ID11906
Feature activation-6.67e+00

Pos logit contributions
Maker+1.882
gently+1.833
breaks+1.758
yon+1.646
ona+1.638
Neg logit contributions
unks-4.465
oppers-4.391
imir-4.255
arn-3.868
=-=-=-=-=-=-=-=--3.843
Tokenself
Token ID944
Feature activation-5.01e+00

Pos logit contributions
Maker+4.182
gently+4.032
breaks+4.002
 Decoder+3.744
ona+3.720
Neg logit contributions
unks-4.835
oppers-4.702
imir-4.662
arn-4.024
avis-4.001
Token.
Token ID13
Feature activation-9.64e-01

Pos logit contributions
Maker+2.910
gently+2.876
breaks+2.813
yon+2.663
ona+2.629
Neg logit contributions
unks-3.878
oppers-3.820
imir-3.710
avis-3.264
arn-3.261
Tokenod
Token ID375
Feature activation+2.98e+00

Pos logit contributions
Maker+1.736
gently+1.719
breaks+1.685
yon+1.637
 Decoder+1.626
Neg logit contributions
unks-1.357
oppers-1.326
imir-1.255
arn-1.097
=-=-=-=-=-=-=-=--1.061
Token,
Token ID11
Feature activation+1.90e+00

Pos logit contributions
unks+2.375
oppers+2.318
imir+2.279
arn+2.072
avis+2.026
Neg logit contributions
gently-1.698
Maker-1.697
breaks-1.629
 Decoder-1.540
roller-1.530
Token brother
Token ID3956
Feature activation+3.08e+00

Pos logit contributions
unks+1.389
oppers+1.357
imir+1.311
arn+1.188
avis+1.153
Neg logit contributions
Maker-1.204
gently-1.201
breaks-1.159
 Decoder-1.107
yon-1.107
Token-
Token ID12
Feature activation-5.94e+00

Pos logit contributions
unks+2.168
oppers+2.098
imir+2.047
arn+1.849
avis+1.784
Neg logit contributions
Maker-2.031
gently-2.023
breaks-1.952
 Decoder-1.888
ahime-1.860
Tokenin
Token ID259
Feature activation-8.68e+00

Pos logit contributions
Maker+4.160
gently+4.064
breaks+4.014
yon+3.802
roller+3.764
Neg logit contributions
oppers-3.904
unks-3.898
imir-3.690
=-=-=-=-=-=-=-=--3.228
arn-3.167
Token-
Token ID12
Feature activation-1.84e+01

Pos logit contributions
Maker+4.503
gently+4.466
breaks+4.457
ona+4.099
yon+4.094
Neg logit contributions
unks-7.027
oppers-7.005
imir-6.581
 disg-5.899
=-=-=-=-=-=-=-=--5.886
Tokenlaw
Token ID6270
Feature activation+1.99e+00

Pos logit contributions
Maker+4.319
gently+3.982
breaks+3.874
yon+3.570
roller+2.968
Neg logit contributions
oppers-19.065
unks-18.637
imir-17.772
=-=-=-=-=-=-=-=--16.956
OPLE-16.593
Token Raj
Token ID13308
Feature activation-9.80e-01

Pos logit contributions
unks+1.488
oppers+1.446
imir+1.409
arn+1.280
avis+1.237
Neg logit contributions
gently-1.237
Maker-1.233
breaks-1.187
 Decoder-1.134
yon-1.126
Tokennik
Token ID17187
Feature activation+1.94e+00

Pos logit contributions
Maker+0.672
gently+0.667
breaks+0.648
yon+0.626
 Decoder+0.622
Neg logit contributions
unks-0.670
oppers-0.655
imir-0.629
arn-0.560
avis-0.541
Tokenanth
Token ID29313
Feature activation+1.17e+00

Pos logit contributions
unks+1.452
oppers+1.410
imir+1.376
arn+1.249
avis+1.212
Neg logit contributions
gently-1.207
Maker-1.202
breaks-1.157
 Decoder-1.110
yon-1.097
Token Singh
Token ID14403
Feature activation+3.68e+00

Pos logit contributions
unks+0.870
oppers+0.848
imir+0.822
arn+0.745
avis+0.723
Neg logit contributions
Maker-0.727
gently-0.725
breaks-0.700
 Decoder-0.669
yon-0.667
Token­
Token ID3907
Feature activation-1.13e+01

Pos logit contributions
Maker+0.057
gently+0.056
breaks+0.054
yon+0.052
 Decoder+0.052
Neg logit contributions
unks-0.083
oppers-0.081
imir-0.079
arn-0.072
avis-0.070
Tokenon
Token ID261
Feature activation+9.20e-01

Pos logit contributions
gently+6.978
breaks+6.706
Maker+6.658
ona+6.574
yon+6.474
Neg logit contributions
unks-7.835
oppers-7.624
imir-7.270
=-=-=-=-=-=-=-=--6.731
 Continent-6.433
Token to
Token ID284
Feature activation-1.90e+00

Pos logit contributions
unks+0.673
oppers+0.654
imir+0.635
arn+0.573
avis+0.553
Neg logit contributions
Maker-0.589
gently-0.585
breaks-0.567
 Decoder-0.545
yon-0.542
Token this
Token ID428
Feature activation+2.23e-01

Pos logit contributions
Maker+1.119
gently+1.113
breaks+1.072
 Decoder+1.032
yon+1.027
Neg logit contributions
unks-1.474
oppers-1.443
imir-1.394
arn-1.264
avis-1.232
Token pres
Token ID906
Feature activation-3.15e+00

Pos logit contributions
unks+0.161
oppers+0.157
imir+0.151
arn+0.136
avis+0.132
Neg logit contributions
Maker-0.145
gently-0.144
breaks-0.139
 Decoder-0.134
yon-0.133
Token­
Token ID3907
Feature activation-1.75e+01

Pos logit contributions
gently+1.626
Maker+1.619
breaks+1.556
yon+1.512
ona+1.491
Neg logit contributions
unks-2.652
oppers-2.593
imir-2.511
=-=-=-=-=-=-=-=--2.261
arn-2.256
Tokenid
Token ID312
Feature activation-6.38e+00

Pos logit contributions
gently+9.658
yon+9.491
ona+9.321
topic+9.194
roller+9.097
Neg logit contributions
unks-11.574
oppers-10.572
=-=-=-=-=-=-=-=--10.502
OPLE-9.711
 Continent-9.695
Token­
Token ID3907
Feature activation-1.07e+01

Pos logit contributions
gently+3.074
Maker+3.026
ona+2.846
yon+2.843
breaks+2.774
Neg logit contributions
unks-5.503
oppers-5.413
imir-5.185
=-=-=-=-=-=-=-=--4.808
arn-4.680
Tokenent
Token ID298
Feature activation-1.30e-01

Pos logit contributions
gently+6.140
ona+5.872
breaks+5.811
roller+5.634
yon+5.628
Neg logit contributions
unks-7.918
oppers-7.739
=-=-=-=-=-=-=-=--7.139
imir-6.957
 Continent-6.678
Token,
Token ID11
Feature activation+2.82e-01

Pos logit contributions
Maker+0.081
gently+0.081
breaks+0.078
 Decoder+0.075
yon+0.075
Neg logit contributions
unks-0.097
oppers-0.095
imir-0.092
arn-0.083
avis-0.080
Token
Token ID447
Feature activation+1.49e+00

Pos logit contributions
unks+0.212
oppers+0.207
imir+0.200
arn+0.181
avis+0.175
Neg logit contributions
Maker-0.176
gently-0.174
breaks-0.169
 Decoder-0.162
yon-0.161
Token his
Token ID465
Feature activation-9.69e-02

Pos logit contributions
Maker+0.280
gently+0.278
breaks+0.268
 Decoder+0.256
yon+0.256
Neg logit contributions
unks-0.379
oppers-0.371
imir-0.358
arn-0.326
avis-0.317
Token new
Token ID649
Feature activation+1.10e+00

Pos logit contributions
Maker+0.059
gently+0.059
breaks+0.057
 Decoder+0.054
yon+0.054
Neg logit contributions
unks-0.074
oppers-0.072
imir-0.070
arn-0.063
avis-0.061
Token brother
Token ID3956
Feature activation+2.67e+00

Pos logit contributions
unks+0.861
oppers+0.839
imir+0.814
arn+0.741
avis+0.719
Neg logit contributions
Maker-0.651
gently-0.647
breaks-0.625
 Decoder-0.597
yon-0.597
Token-
Token ID12
Feature activation-4.85e+00

Pos logit contributions
unks+2.009
oppers+1.951
imir+1.905
arn+1.730
avis+1.667
Neg logit contributions
gently-1.627
Maker-1.624
breaks-1.554
 Decoder-1.498
ahime-1.489
Tokenin
Token ID259
Feature activation-8.59e+00

Pos logit contributions
Maker+3.524
gently+3.467
breaks+3.365
yon+3.189
roller+3.161
Neg logit contributions
unks-3.133
oppers-3.091
imir-2.923
=-=-=-=-=-=-=-=--2.538
arn-2.527
Token-
Token ID12
Feature activation-1.66e+01

Pos logit contributions
Maker+4.523
gently+4.419
breaks+4.365
yon+4.050
ona+4.038
Neg logit contributions
unks-7.012
oppers-7.000
imir-6.520
=-=-=-=-=-=-=-=--5.867
avis-5.836
Tokenlaw
Token ID6270
Feature activation+5.41e-01

Pos logit contributions
Maker+5.466
breaks+5.084
gently+5.032
yon+4.785
ona+4.257
Neg logit contributions
oppers-16.128
unks-15.436
imir-14.766
=-=-=-=-=-=-=-=--14.445
 disg-13.832
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
unks+0.411
oppers+0.401
imir+0.389
arn+0.352
avis+0.341
Neg logit contributions
Maker-0.330
gently-0.328
breaks-0.317
 Decoder-0.303
yon-0.302
Token E
Token ID412
Feature activation+2.06e+00

Pos logit contributions
unks+0.128
oppers+0.125
imir+0.121
arn+0.110
avis+0.107
Neg logit contributions
Maker-0.095
gently-0.095
breaks-0.091
 Decoder-0.087
yon-0.087
Tokenad
Token ID324
Feature activation+4.51e-01

Pos logit contributions
unks+0.993
oppers+0.951
imir+0.917
arn+0.791
avis+0.748
Neg logit contributions
Maker-1.829
gently-1.814
breaks-1.775
 Decoder-1.734
yon-1.711
Tokengar
Token ID4563
Feature activation-5.69e+00

Pos logit contributions
unks+0.211
oppers+0.202
imir+0.192
arn+0.161
avis+0.153
Neg logit contributions
Maker-0.408
gently-0.405
breaks-0.396
 Decoder-0.385
yon-0.384
Token contemporary
Token ID11811
Feature activation-1.33e+00

Pos logit contributions
Maker+1.132
gently+1.132
breaks+1.092
 Decoder+1.048
yon+1.048
Neg logit contributions
unks-1.414
oppers-1.382
imir-1.334
arn-1.201
avis-1.168
Token society
Token ID3592
Feature activation-6.37e+00

Pos logit contributions
Maker+0.815
gently+0.812
breaks+0.786
yon+0.754
 Decoder+0.750
Neg logit contributions
unks-1.000
oppers-0.979
imir-0.944
arn-0.850
avis-0.828
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
Maker+3.482
gently+3.471
breaks+3.402
 Decoder+3.222
ona+3.211
Neg logit contributions
unks-5.043
oppers-4.950
imir-4.800
arn-4.301
avis-4.180
Token are
Token ID389
Feature activation-1.31e+00

Pos logit contributions
Maker+0.891
gently+0.890
breaks+0.859
yon+0.826
 Decoder+0.825
Neg logit contributions
unks-1.077
oppers-1.054
imir-1.010
arn-0.910
avis-0.885
Token not
Token ID407
Feature activation-2.88e+00

Pos logit contributions
Maker+0.857
gently+0.851
breaks+0.823
 Decoder+0.791
yon+0.790
Neg logit contributions
unks-0.937
oppers-0.916
imir-0.876
arn-0.786
avis-0.760
Token uncont
Token ID28605
Feature activation-1.62e+01

Pos logit contributions
Maker+1.834
gently+1.831
breaks+1.772
 Decoder+1.707
yon+1.704
Neg logit contributions
unks-2.111
oppers-2.065
imir-1.960
arn-1.744
avis-1.709
Tokenroversial
Token ID46927
Feature activation-2.69e+00

Pos logit contributions
gently+6.613
breaks+6.305
ona+5.932
roller+5.833
yon+5.725
Neg logit contributions
unks-13.795
oppers-13.453
imir-13.009
OME-12.360
 disg-12.357
Token or
Token ID393
Feature activation-5.16e+00

Pos logit contributions
Maker+1.603
gently+1.603
breaks+1.555
yon+1.484
 Decoder+1.471
Neg logit contributions
unks-2.063
oppers-2.045
imir-1.959
arn-1.737
avis-1.704
Token immune
Token ID10900
Feature activation-2.86e+00

Pos logit contributions
gently+3.167
Maker+3.162
breaks+3.062
yon+2.953
 Decoder+2.947
Neg logit contributions
unks-3.848
oppers-3.760
imir-3.567
arn-3.148
avis-3.111
Token from
Token ID422
Feature activation-4.09e+00

Pos logit contributions
gently+1.754
Maker+1.744
breaks+1.678
yon+1.622
ona+1.598
Neg logit contributions
unks-2.157
oppers-2.103
imir-2.040
arn-1.822
=-=-=-=-=-=-=-=--1.777
Token any
Token ID597
Feature activation-2.05e-01

Pos logit contributions
gently+2.479
Maker+2.462
breaks+2.423
yon+2.324
 Decoder+2.319
Neg logit contributions
unks-3.086
oppers-3.020
imir-2.888
arn-2.575
avis-2.508
Token away
Token ID1497
Feature activation+1.93e-01

Pos logit contributions
unks+0.713
oppers+0.693
imir+0.666
arn+0.591
avis+0.568
Neg logit contributions
Maker-0.826
gently-0.819
breaks-0.797
 Decoder-0.768
yon-0.766
Token.
Token ID13
Feature activation-3.21e+00

Pos logit contributions
unks+0.144
oppers+0.140
imir+0.136
arn+0.122
avis+0.119
Neg logit contributions
Maker-0.121
gently-0.120
breaks-0.116
 Decoder-0.112
yon-0.111
Token Note
Token ID5740
Feature activation+3.89e+00

Pos logit contributions
Maker+1.890
gently+1.873
breaks+1.808
 Decoder+1.776
yon+1.726
Neg logit contributions
unks-2.506
oppers-2.470
imir-2.366
arn-2.130
avis-2.076
Token that
Token ID326
Feature activation-1.79e+00

Pos logit contributions
unks+2.711
oppers+2.620
imir+2.561
arn+2.322
avis+2.214
Neg logit contributions
Maker-2.587
gently-2.527
breaks-2.463
ahime-2.388
yon-2.386
Token on
Token ID319
Feature activation-1.07e+01

Pos logit contributions
Maker+1.109
gently+1.105
breaks+1.068
 Decoder+1.031
yon+1.016
Neg logit contributions
unks-1.348
oppers-1.317
imir-1.268
arn-1.138
avis-1.116
Token some
Token ID617
Feature activation-1.60e+01

Pos logit contributions
gently+5.494
Maker+5.476
breaks+5.302
 Decoder+5.216
ona+5.057
Neg logit contributions
unks-8.361
oppers-8.306
imir-7.896
estial-7.093
arn-7.032
Token radios
Token ID40023
Feature activation-4.87e+00

Pos logit contributions
Maker+7.836
breaks+7.675
 Decoder+7.367
gently+7.349
FX+7.185
Neg logit contributions
unks-12.000
oppers-11.885
imir-11.291
estial-10.233
avis-10.074
Token,
Token ID11
Feature activation-5.31e+00

Pos logit contributions
Maker+2.818
gently+2.766
breaks+2.734
 Decoder+2.614
yon+2.563
Neg logit contributions
unks-3.818
oppers-3.726
imir-3.592
arn-3.243
avis-3.209
Token when
Token ID618
Feature activation-1.48e+00

Pos logit contributions
Maker+3.027
gently+3.023
breaks+2.946
 Decoder+2.842
yon+2.767
Neg logit contributions
unks-4.169
oppers-4.071
imir-3.916
arn-3.533
avis-3.493
Token you
Token ID345
Feature activation-2.07e+00

Pos logit contributions
Maker+0.898
gently+0.894
breaks+0.864
 Decoder+0.831
yon+0.822
Neg logit contributions
unks-1.133
oppers-1.108
imir-1.071
arn-0.966
avis-0.943
Token un
Token ID555
Feature activation-7.07e+00

Pos logit contributions
gently+1.313
Maker+1.294
breaks+1.253
 Decoder+1.219
yon+1.202
Neg logit contributions
unks-1.525
oppers-1.492
imir-1.426
arn-1.285
avis-1.263
Token In
Token ID554
Feature activation-3.30e+00

Pos logit contributions
Maker+1.473
gently+1.458
breaks+1.423
 Decoder+1.360
yon+1.346
Neg logit contributions
unks-1.912
oppers-1.872
imir-1.816
arn-1.630
avis-1.582
Token arm
Token ID3211
Feature activation-7.64e-01

Pos logit contributions
gently+2.003
Maker+1.995
breaks+1.961
yon+1.849
ona+1.820
Neg logit contributions
unks-2.494
oppers-2.437
imir-2.366
arn-2.118
avis-2.057
Tokenad
Token ID324
Feature activation+2.28e+00

Pos logit contributions
Maker+0.665
gently+0.663
breaks+0.649
yon+0.628
 Decoder+0.627
Neg logit contributions
unks-0.380
oppers-0.367
imir-0.348
arn-0.294
avis-0.279
Tokenillo
Token ID16111
Feature activation-1.92e+00

Pos logit contributions
unks+0.709
oppers+0.657
imir+0.598
arn+0.451
avis+0.410
Neg logit contributions
Maker-2.440
gently-2.413
breaks-2.365
 Decoder-2.330
yon-2.301
Token le
Token ID443
Feature activation-5.93e+00

Pos logit contributions
Maker+1.237
gently+1.234
breaks+1.211
yon+1.156
ona+1.141
Neg logit contributions
unks-1.372
oppers-1.339
imir-1.300
arn-1.159
=-=-=-=-=-=-=-=--1.127
Tokenpro
Token ID1676
Feature activation-1.57e+01

Pos logit contributions
Maker+3.546
gently+3.538
breaks+3.475
yon+3.360
ona+3.339
Neg logit contributions
unks-4.401
oppers-4.245
imir-4.204
arn-3.768
=-=-=-=-=-=-=-=--3.746
Tokensy
Token ID1837
Feature activation-3.44e+00

Pos logit contributions
gently+6.166
yon+5.914
breaks+5.754
topic+5.442
ona+5.258
Neg logit contributions
unks-13.260
oppers-13.048
imir-12.958
=-=-=-=-=-=-=-=--12.134
 disg-12.028
Token,
Token ID11
Feature activation-5.24e+00

Pos logit contributions
Maker+2.095
gently+2.092
breaks+2.068
ona+1.952
yon+1.947
Neg logit contributions
unks-2.567
oppers-2.503
imir-2.451
arn-2.197
=-=-=-=-=-=-=-=--2.140
Token lesions
Token ID35258
Feature activation+2.75e-01

Pos logit contributions
gently+3.121
Maker+3.110
breaks+3.083
 Decoder+2.870
ona+2.859
Neg logit contributions
unks-3.978
oppers-3.858
imir-3.759
arn-3.384
=-=-=-=-=-=-=-=--3.304
Token did
Token ID750
Feature activation+2.75e+00

Pos logit contributions
unks+0.198
oppers+0.193
imir+0.186
arn+0.167
avis+0.162
Neg logit contributions
Maker-0.179
gently-0.178
breaks-0.173
 Decoder-0.166
yon-0.165
Token not
Token ID407
Feature activation+3.26e+00

Pos logit contributions
unks+1.776
oppers+1.722
imir+1.663
arn+1.479
avis+1.438
Neg logit contributions
Maker-1.974
gently-1.951
breaks-1.902
 Decoder-1.855
yon-1.841
Token.
Token ID13
Feature activation+1.89e+00

Pos logit contributions
unks+1.546
oppers+1.492
imir+1.442
arn+1.312
avis+1.272
Neg logit contributions
Maker-1.390
gently-1.376
breaks-1.336
 Decoder-1.284
yon-1.268
Token Cha
Token ID20703
Feature activation+3.29e+00

Pos logit contributions
unks+1.468
oppers+1.426
imir+1.387
arn+1.262
avis+1.226
Neg logit contributions
gently-1.127
Maker-1.113
breaks-1.081
ahime-1.027
 Decoder-1.021
Tokenvel
Token ID626
Feature activation-4.13e+00

Pos logit contributions
unks+2.467
oppers+2.401
imir+2.310
arn+2.112
avis+2.060
Neg logit contributions
Maker-2.071
gently-2.068
breaks-1.991
 Decoder-1.899
yon-1.877
Token (
Token ID357
Feature activation-5.86e+00

Pos logit contributions
Maker+2.654
gently+2.615
breaks+2.549
yon+2.483
 Decoder+2.440
Neg logit contributions
unks-2.957
oppers-2.894
imir-2.775
arn-2.474
=-=-=-=-=-=-=-=--2.447
TokenNew
Token ID3791
Feature activation-4.19e+00

Pos logit contributions
Maker+3.655
gently+3.522
breaks+3.408
yon+3.362
ona+3.237
Neg logit contributions
unks-4.298
oppers-4.284
imir-4.140
=-=-=-=-=-=-=-=--3.615
arn-3.589
Token York
Token ID1971
Feature activation-1.57e+01

Pos logit contributions
Maker+2.140
gently+2.109
breaks+2.018
yon+1.967
 Decoder+1.966
Neg logit contributions
unks-3.562
oppers-3.499
imir-3.381
arn-3.066
=-=-=-=-=-=-=-=--3.064
Token:
Token ID25
Feature activation-6.89e+00

Pos logit contributions
Maker+6.177
gently+5.699
 Decoder+5.698
ona+5.643
 Petty+5.395
Neg logit contributions
oppers-13.148
unks-13.024
imir-12.505
=-=-=-=-=-=-=-=--11.766
avis-11.358
Token Sh
Token ID911
Feature activation+2.26e+00

Pos logit contributions
Maker+3.828
 Decoder+3.541
breaks+3.492
gently+3.492
yon+3.414
Neg logit contributions
oppers-5.550
unks-5.506
imir-5.327
arn-4.719
=-=-=-=-=-=-=-=--4.681
Tokenilo
Token ID18526
Feature activation-3.72e+00

Pos logit contributions
unks+1.620
oppers+1.601
imir+1.510
arn+1.384
avis+1.318
Neg logit contributions
gently-1.492
Maker-1.491
breaks-1.432
 Decoder-1.392
ahime-1.362
Token Publishing
Token ID23499
Feature activation+5.37e+00

Pos logit contributions
Maker+2.246
gently+2.156
breaks+2.103
yon+2.089
 Decoder+2.061
Neg logit contributions
unks-2.820
oppers-2.781
imir-2.703
arn-2.410
avis-2.356
Token House
Token ID2097
Feature activation-2.38e+00

Pos logit contributions
unks+3.943
oppers+3.764
imir+3.649
arn+3.371
avis+3.192
Neg logit contributions
gently-3.396
Maker-3.283
breaks-3.257
ahime-3.124
yon-3.084
Token Defense
Token ID5947
Feature activation-2.05e+00

Pos logit contributions
unks+2.657
oppers+2.588
imir+2.509
arn+2.290
avis+2.221
Neg logit contributions
Maker-2.173
gently-2.153
breaks-2.079
yon-1.972
 Decoder-1.960
Token and
Token ID290
Feature activation+7.60e-01

Pos logit contributions
Maker+1.192
gently+1.179
breaks+1.138
 Decoder+1.097
yon+1.091
Neg logit contributions
unks-1.624
oppers-1.594
imir-1.529
arn-1.381
avis-1.350
Token commander
Token ID11561
Feature activation+2.22e+00

Pos logit contributions
unks+0.587
oppers+0.573
imir+0.556
arn+0.505
avis+0.490
Neg logit contributions
Maker-0.454
gently-0.451
breaks-0.436
yon-0.415
 Decoder-0.415
Token-
Token ID12
Feature activation-5.18e+00

Pos logit contributions
unks+1.578
oppers+1.531
imir+1.489
arn+1.346
avis+1.297
Neg logit contributions
Maker-1.455
gently-1.450
breaks-1.407
 Decoder-1.350
yon-1.349
Tokenin
Token ID259
Feature activation-5.34e+00

Pos logit contributions
Maker+3.642
gently+3.618
breaks+3.520
roller+3.398
yon+3.369
Neg logit contributions
unks-3.378
oppers-3.332
imir-3.152
arn-2.742
=-=-=-=-=-=-=-=--2.710
Token-
Token ID12
Feature activation-1.55e+01

Pos logit contributions
Maker+2.994
gently+2.979
breaks+2.876
yon+2.727
 Decoder+2.712
Neg logit contributions
unks-4.290
oppers-4.246
imir-3.957
arn-3.574
avis-3.530
Tokenchief
Token ID17351
Feature activation-4.71e+00

Pos logit contributions
gently+5.892
Maker+5.565
breaks+5.478
roller+5.374
topic+5.154
Neg logit contributions
oppers-14.209
unks-13.808
imir-12.966
=-=-=-=-=-=-=-=--12.272
OME-11.995
Token of
Token ID286
Feature activation+3.34e+00

Pos logit contributions
Maker+2.568
gently+2.539
breaks+2.442
 Decoder+2.366
yon+2.342
Neg logit contributions
unks-3.840
oppers-3.792
imir-3.613
arn-3.266
avis-3.210
Token the
Token ID262
Feature activation+3.78e+00

Pos logit contributions
unks+2.371
oppers+2.325
imir+2.257
arn+2.038
avis+1.970
Neg logit contributions
Maker-2.187
gently-2.154
breaks-2.087
yon-2.008
 Decoder-1.977
Token German
Token ID2679
Feature activation+5.29e+00

Pos logit contributions
unks+2.747
oppers+2.682
imir+2.618
arn+2.385
avis+2.288
Neg logit contributions
Maker-2.411
gently-2.373
breaks-2.309
yon-2.201
ahime-2.179
Token Army
Token ID5407
Feature activation-1.38e+00

Pos logit contributions
unks+4.034
oppers+3.977
imir+3.845
arn+3.560
avis+3.405
Neg logit contributions
Maker-3.148
gently-3.102
breaks-2.993
yon-2.846
ahime-2.838
Token well
Token ID880
Feature activation-1.68e+00

Pos logit contributions
unks+0.392
oppers+0.381
imir+0.367
arn+0.329
avis+0.317
Neg logit contributions
Maker-0.405
gently-0.403
breaks-0.390
 Decoder-0.376
yon-0.375
Token as
Token ID355
Feature activation+1.83e+00

Pos logit contributions
Maker+1.064
gently+1.051
breaks+1.023
 Decoder+0.981
yon+0.970
Neg logit contributions
unks-1.247
oppers-1.225
imir-1.178
arn-1.048
avis-1.027
Token any
Token ID597
Feature activation+1.44e+00

Pos logit contributions
unks+1.342
oppers+1.303
imir+1.265
arn+1.146
avis+1.103
Neg logit contributions
Maker-1.169
gently-1.159
breaks-1.116
 Decoder-1.075
yon-1.074
Token sense
Token ID2565
Feature activation+6.75e-01

Pos logit contributions
unks+1.039
oppers+1.009
imir+0.978
arn+0.883
avis+0.852
Neg logit contributions
Maker-0.939
gently-0.931
breaks-0.900
 Decoder-0.866
yon-0.863
Token of
Token ID286
Feature activation-1.39e+00

Pos logit contributions
unks+0.490
oppers+0.476
imir+0.461
arn+0.416
avis+0.403
Neg logit contributions
Maker-0.435
gently-0.433
breaks-0.419
 Decoder-0.402
yon-0.401
Token inev
Token ID9026
Feature activation-1.53e+01

Pos logit contributions
Maker+0.851
gently+0.845
breaks+0.817
 Decoder+0.785
yon+0.782
Neg logit contributions
unks-1.053
oppers-1.031
imir-0.990
arn-0.895
avis-0.873
Tokenitability
Token ID34147
Feature activation-5.16e+00

Pos logit contributions
ona+8.948
gently+8.900
yon+8.710
Maker+8.643
breaks+8.557
Neg logit contributions
oppers-9.975
unks-9.570
imir-9.129
=-=-=-=-=-=-=-=--9.129
 Continent-8.316
Token about
Token ID546
Feature activation-2.51e+00

Pos logit contributions
Maker+2.991
gently+2.949
breaks+2.865
 Decoder+2.726
yon+2.718
Neg logit contributions
unks-4.000
oppers-3.954
imir-3.756
arn-3.342
avis-3.333
Token what
Token ID644
Feature activation-4.14e+00

Pos logit contributions
Maker+1.475
gently+1.459
breaks+1.414
 Decoder+1.355
yon+1.347
Neg logit contributions
unks-1.964
oppers-1.932
imir-1.847
arn-1.671
avis-1.645
Token happens
Token ID4325
Feature activation-3.25e+00

Pos logit contributions
Maker+2.532
gently+2.484
breaks+2.455
 Decoder+2.319
yon+2.309
Neg logit contributions
unks-3.120
oppers-3.075
imir-2.893
arn-2.619
avis-2.615
Token in
Token ID287
Feature activation-3.16e+00

Pos logit contributions
Maker+1.952
gently+1.943
breaks+1.903
 Decoder+1.799
yon+1.788
Neg logit contributions
unks-2.486
oppers-2.442
imir-2.326
arn-2.087
avis-2.054
TokenWe
Token ID1135
Feature activation+2.46e+00

Pos logit contributions
unks+2.168
oppers+2.111
imir+2.035
arn+1.849
avis+1.761
Neg logit contributions
gently-2.122
Maker-2.098
breaks-2.032
 Decoder-1.995
ahime-1.973
Token must
Token ID1276
Feature activation-9.80e-01

Pos logit contributions
unks+1.784
oppers+1.752
imir+1.691
arn+1.541
avis+1.486
Neg logit contributions
Maker-1.583
gently-1.560
breaks-1.514
 Decoder-1.449
yon-1.443
Token commit
Token ID4589
Feature activation-2.75e+00

Pos logit contributions
Maker+0.582
gently+0.581
breaks+0.561
 Decoder+0.536
yon+0.535
Neg logit contributions
unks-0.760
oppers-0.742
imir-0.717
arn-0.648
avis-0.631
Token ourselves
Token ID6731
Feature activation-3.33e+00

Pos logit contributions
gently+1.574
Maker+1.573
breaks+1.515
 Decoder+1.440
yon+1.436
Neg logit contributions
unks-2.181
oppers-2.125
imir-2.049
arn-1.853
avis-1.816
Token very
Token ID845
Feature activation-7.79e+00

Pos logit contributions
gently+2.050
Maker+2.027
breaks+1.971
yon+1.888
 Decoder+1.867
Neg logit contributions
unks-2.488
oppers-2.425
imir-2.341
arn-2.094
=-=-=-=-=-=-=-=--2.057
Token res
Token ID581
Feature activation-1.46e+01

Pos logit contributions
gently+4.673
Maker+4.387
breaks+4.336
yon+4.204
ona+4.039
Neg logit contributions
unks-5.809
oppers-5.786
imir-5.566
=-=-=-=-=-=-=-=--4.948
avis-4.842
Tokenolutely
Token ID16780
Feature activation-3.82e+00

Pos logit contributions
gently+5.919
breaks+5.067
ona+4.970
yon+4.629
roller+4.359
Neg logit contributions
unks-12.987
oppers-12.485
imir-12.068
=-=-=-=-=-=-=-=--11.661
 disg-11.328
Token because
Token ID780
Feature activation+7.97e+00

Pos logit contributions
Maker+1.997
gently+1.996
breaks+1.917
yon+1.826
 Decoder+1.825
Neg logit contributions
unks-3.190
oppers-3.128
imir-3.020
arn-2.718
avis-2.688
Token there
Token ID612
Feature activation+9.35e+00

Pos logit contributions
unks+5.165
oppers+4.994
imir+4.849
arn+4.464
=-=-=-=-=-=-=-=-+4.148
Neg logit contributions
gently-5.511
Maker-5.398
breaks-5.168
 Decoder-5.072
roller-5.048
Token isn
Token ID2125
Feature activation+3.48e+00

Pos logit contributions
unks+5.594
oppers+5.478
imir+5.421
arn+4.761
inia+4.354
Neg logit contributions
Maker-6.740
gently-6.737
breaks-6.390
 Decoder-6.245
yon-6.231
Token
Token ID447
Feature activation-5.34e-01

Pos logit contributions
unks+2.377
oppers+2.301
imir+2.264
arn+2.013
=-=-=-=-=-=-=-=-+1.931
Neg logit contributions
gently-2.356
Maker-2.354
breaks-2.265
 Decoder-2.200
ahime-2.181
Token,
Token ID11
Feature activation-1.83e-01

Pos logit contributions
Maker+0.883
gently+0.875
breaks+0.850
 Decoder+0.814
yon+0.811
Neg logit contributions
unks-1.079
oppers-1.056
imir-1.019
arn-0.920
avis-0.894
Token
Token ID447
Feature activation+4.11e-01

Pos logit contributions
Maker+0.115
gently+0.115
breaks+0.111
 Decoder+0.106
yon+0.106
Neg logit contributions
unks-0.135
oppers-0.132
imir-0.127
arn-0.115
avis-0.112
Token
Token ID251
Feature activation+1.24e+00

Pos logit contributions
unks+0.248
oppers+0.240
imir+0.231
arn+0.203
avis+0.195
Neg logit contributions
Maker-0.317
gently-0.314
breaks-0.304
 Decoder-0.296
yon-0.295
Token he
Token ID339
Feature activation-2.41e-01

Pos logit contributions
unks+0.910
oppers+0.885
imir+0.859
arn+0.773
avis+0.749
Neg logit contributions
Maker-0.793
gently-0.786
breaks-0.760
 Decoder-0.732
yon-0.731
Token says
Token ID1139
Feature activation+7.28e-01

Pos logit contributions
Maker+0.155
gently+0.154
breaks+0.149
 Decoder+0.143
yon+0.143
Neg logit contributions
unks-0.176
oppers-0.172
imir-0.166
arn-0.150
avis-0.145
Token j
Token ID474
Feature activation-1.46e+01

Pos logit contributions
unks+0.533
oppers+0.519
imir+0.503
arn+0.454
avis+0.439
Neg logit contributions
Maker-0.465
gently-0.461
breaks-0.446
 Decoder-0.429
yon-0.429
Tokenokingly
Token ID48343
Feature activation-1.71e+00

Pos logit contributions
gently+5.584
ona+5.380
breaks+5.061
yon+4.999
Maker+4.976
Neg logit contributions
oppers-12.711
unks-12.551
imir-12.116
=-=-=-=-=-=-=-=--11.502
OME-11.335
Token.
Token ID13
Feature activation-9.00e-01

Pos logit contributions
gently+1.035
Maker+1.032
breaks+0.997
 Decoder+0.948
yon+0.945
Neg logit contributions
unks-1.309
oppers-1.285
imir-1.232
arn-1.112
avis-1.086
Token One
Token ID1881
Feature activation+8.77e-02

Pos logit contributions
Maker+0.555
gently+0.551
breaks+0.534
 Decoder+0.512
yon+0.509
Neg logit contributions
unks-0.679
oppers-0.665
imir-0.639
arn-0.579
avis-0.562
Token of
Token ID286
Feature activation+2.15e+00

Pos logit contributions
unks+0.065
oppers+0.064
imir+0.062
arn+0.056
avis+0.054
Neg logit contributions
Maker-0.055
gently-0.055
breaks-0.053
 Decoder-0.051
yon-0.050
Token the
Token ID262
Feature activation+4.93e+00

Pos logit contributions
unks+1.608
oppers+1.560
imir+1.525
arn+1.372
avis+1.318
Neg logit contributions
Maker-1.342
gently-1.332
breaks-1.287
yon-1.236
 Decoder-1.229
Token in
Token ID287
Feature activation-3.00e+00

Pos logit contributions
Maker+1.500
gently+1.486
breaks+1.434
 Decoder+1.380
yon+1.374
Neg logit contributions
unks-1.800
oppers-1.762
imir-1.699
arn-1.533
avis-1.490
Token fond
Token ID16245
Feature activation+3.15e+00

Pos logit contributions
Maker+1.859
gently+1.836
breaks+1.780
 Decoder+1.713
yon+1.709
Neg logit contributions
unks-2.243
oppers-2.191
imir-2.121
arn-1.899
avis-1.846
Tokenness
Token ID1108
Feature activation-6.63e-01

Pos logit contributions
unks+2.280
oppers+2.206
imir+2.127
arn+1.947
avis+1.857
Neg logit contributions
Maker-2.020
gently-2.006
breaks-1.936
 Decoder-1.889
yon-1.871
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
Maker+0.416
gently+0.414
breaks+0.400
 Decoder+0.383
yon+0.383
Neg logit contributions
unks-0.493
oppers-0.481
imir-0.464
arn-0.419
avis-0.407
Token in
Token ID287
Feature activation+1.25e-01

Pos logit contributions
unks+0.121
oppers+0.118
imir+0.114
arn+0.102
avis+0.099
Neg logit contributions
Maker-0.113
gently-0.112
breaks-0.108
 Decoder-0.104
yon-0.104
Token j
Token ID474
Feature activation-1.45e+01

Pos logit contributions
unks+0.089
oppers+0.087
imir+0.084
arn+0.076
avis+0.073
Neg logit contributions
Maker-0.082
gently-0.082
breaks-0.079
 Decoder-0.076
yon-0.076
Tokenib
Token ID571
Feature activation-1.42e+00

Pos logit contributions
breaks+7.560
gently+7.366
ona+7.306
efe+7.279
Maker+7.197
Neg logit contributions
oppers-10.167
imir-9.824
unks-9.814
=-=-=-=-=-=-=-=--9.008
 Continent-8.790
Tokenes
Token ID274
Feature activation-3.38e+00

Pos logit contributions
Maker+0.836
gently+0.835
breaks+0.807
yon+0.772
 Decoder+0.767
Neg logit contributions
unks-1.105
oppers-1.083
imir-1.047
arn-0.946
avis-0.923
Token,
Token ID11
Feature activation+3.01e+00

Pos logit contributions
Maker+2.022
gently+2.019
breaks+1.940
yon+1.864
 Decoder+1.855
Neg logit contributions
unks-2.596
oppers-2.533
imir-2.426
arn-2.186
avis-2.141
Token identify
Token ID5911
Feature activation+2.00e+00

Pos logit contributions
unks+2.045
oppers+1.988
imir+1.929
arn+1.730
avis+1.676
Neg logit contributions
Maker-2.047
gently-2.025
breaks-1.987
yon-1.905
 Decoder-1.899
Token accents
Token ID39271
Feature activation+3.10e+00

Pos logit contributions
unks+1.449
oppers+1.411
imir+1.369
arn+1.235
avis+1.198
Neg logit contributions
Maker-1.289
gently-1.275
breaks-1.239
yon-1.190
 Decoder-1.188
Token at
Token ID379
Feature activation-2.14e+00

Pos logit contributions
Maker+3.390
gently+3.356
breaks+3.268
 Decoder+3.149
roller+3.068
Neg logit contributions
unks-4.495
oppers-4.382
imir-4.205
arn-3.800
avis-3.729
Token 16
Token ID1467
Feature activation-4.28e+00

Pos logit contributions
Maker+1.267
gently+1.250
breaks+1.215
 Decoder+1.168
yon+1.160
Neg logit contributions
unks-1.670
oppers-1.626
imir-1.577
arn-1.420
avis-1.392
Token80
Token ID1795
Feature activation-7.75e+00

Pos logit contributions
gently+2.492
Maker+2.486
breaks+2.381
 Decoder+2.314
ona+2.287
Neg logit contributions
unks-3.326
oppers-3.238
imir-3.158
arn-2.814
avis-2.762
Tokenx
Token ID87
Feature activation+5.78e+00

Pos logit contributions
Maker+4.079
gently+3.999
breaks+3.898
FX+3.829
ona+3.777
Neg logit contributions
unks-6.377
oppers-6.153
imir-6.040
arn-5.296
=-=-=-=-=-=-=-=--5.255
Token10
Token ID940
Feature activation+3.76e+00

Pos logit contributions
unks+3.898
oppers+3.839
imir+3.684
arn+3.407
avis+3.301
Neg logit contributions
gently-3.871
Maker-3.855
breaks-3.754
ahime-3.559
 Decoder-3.531
Token50
Token ID1120
Feature activation-1.44e+01

Pos logit contributions
unks+2.707
oppers+2.694
imir+2.580
arn+2.402
avis+2.267
Neg logit contributions
Maker-2.328
gently-2.321
breaks-2.255
yon-2.204
roller-2.147
Token the
Token ID262
Feature activation-5.02e+00

Pos logit contributions
Maker+7.185
gently+7.013
FX+6.996
breaks+6.891
 Decoder+6.718
Neg logit contributions
unks-10.999
oppers-10.546
imir-10.410
=-=-=-=-=-=-=-=--9.285
avis-9.175
Token pixels
Token ID17848
Feature activation-5.62e+00

Pos logit contributions
Maker+2.954
gently+2.895
breaks+2.853
 Decoder+2.791
yon+2.714
Neg logit contributions
unks-3.869
oppers-3.748
imir-3.609
arn-3.269
avis-3.200
Token are
Token ID389
Feature activation-4.77e-01

Pos logit contributions
Maker+3.353
gently+3.273
breaks+3.260
 Decoder+3.121
roller+3.110
Neg logit contributions
unks-4.199
oppers-4.159
imir-3.929
arn-3.564
avis-3.543
Token small
Token ID1402
Feature activation-5.40e-01

Pos logit contributions
Maker+0.302
gently+0.300
breaks+0.290
 Decoder+0.279
yon+0.277
Neg logit contributions
unks-0.352
oppers-0.343
imir-0.331
arn-0.299
avis-0.290
Token enough
Token ID1576
Feature activation-3.65e+00

Pos logit contributions
Maker+0.375
gently+0.373
breaks+0.362
 Decoder+0.349
yon+0.348
Neg logit contributions
unks-0.365
oppers-0.355
imir-0.342
arn-0.305
avis-0.295
Token Alpha
Token ID12995
Feature activation+5.18e-01

Pos logit contributions
Maker+2.954
gently+2.911
breaks+2.810
 Decoder+2.740
yon+2.702
Neg logit contributions
unks-3.587
oppers-3.501
imir-3.398
arn-3.042
avis-2.939
Token Part
Token ID2142
Feature activation+7.92e-01

Pos logit contributions
unks+0.385
oppers+0.376
imir+0.363
arn+0.327
avis+0.316
Neg logit contributions
Maker-0.327
gently-0.324
breaks-0.312
 Decoder-0.302
yon-0.301
Tokenicle
Token ID1548
Feature activation-3.16e+00

Pos logit contributions
unks+0.277
oppers+0.262
imir+0.247
arn+0.192
avis+0.177
Neg logit contributions
gently-0.812
Maker-0.808
breaks-0.790
 Decoder-0.766
yon-0.765
Token X
Token ID1395
Feature activation+1.39e+00

Pos logit contributions
Maker+1.952
gently+1.902
breaks+1.834
 Decoder+1.799
yon+1.761
Neg logit contributions
unks-2.424
oppers-2.351
imir-2.270
arn-2.055
avis-1.981
Token-
Token ID12
Feature activation+1.86e-01

Pos logit contributions
unks+0.950
oppers+0.927
imir+0.901
arn+0.810
avis+0.783
Neg logit contributions
Maker-0.940
gently-0.938
breaks-0.909
yon-0.876
 Decoder-0.873
TokenRay
Token ID19591
Feature activation-1.44e+01

Pos logit contributions
unks+0.095
oppers+0.092
imir+0.087
arn+0.075
avis+0.072
Neg logit contributions
Maker-0.160
gently-0.160
breaks-0.155
 Decoder-0.151
yon-0.151
Token Spect
Token ID13058
Feature activation-6.22e+00

Pos logit contributions
Maker+7.226
 Decoder+6.912
breaks+6.548
yon+6.475
 Terra+6.470
Neg logit contributions
unks-11.656
oppers-11.519
imir-10.700
arn-9.806
=-=-=-=-=-=-=-=--9.792
Tokenrom
Token ID398
Feature activation-7.40e+00

Pos logit contributions
Maker+3.246
gently+3.145
roller+3.084
 Decoder+3.034
ona+3.024
Neg logit contributions
unks-5.169
oppers-5.013
imir-4.910
arn-4.404
=-=-=-=-=-=-=-=--4.353
Tokeneter
Token ID2357
Feature activation-4.80e+00

Pos logit contributions
Maker+3.010
gently+2.820
 Decoder+2.698
yon+2.641
ona+2.622
Neg logit contributions
unks-7.073
oppers-6.769
imir-6.661
arn-6.100
=-=-=-=-=-=-=-=--6.059
Token (
Token ID357
Feature activation+1.16e+00

Pos logit contributions
Maker+2.852
gently+2.811
breaks+2.687
 Decoder+2.646
yon+2.586
Neg logit contributions
unks-3.696
oppers-3.633
imir-3.530
arn-3.158
=-=-=-=-=-=-=-=--3.073
TokenAP
Token ID2969
Feature activation-3.08e+00

Pos logit contributions
unks+0.722
oppers+0.704
imir+0.671
arn+0.603
avis+0.582
Neg logit contributions
Maker-0.858
gently-0.856
breaks-0.837
 Decoder-0.810
yon-0.804
Token it
Token ID340
Feature activation-1.96e-01

Pos logit contributions
Maker+2.328
gently+2.309
breaks+2.214
yon+2.163
ona+2.107
Neg logit contributions
unks-3.227
oppers-3.141
imir-3.070
arn-2.723
=-=-=-=-=-=-=-=--2.699
Token unto
Token ID12722
Feature activation-1.18e+00

Pos logit contributions
Maker+0.129
gently+0.128
breaks+0.124
yon+0.119
 Decoder+0.119
Neg logit contributions
unks-0.140
oppers-0.137
imir-0.132
arn-0.118
avis-0.115
Token thy
Token ID11906
Feature activation-9.91e+00

Pos logit contributions
Maker+0.670
gently+0.665
breaks+0.641
yon+0.612
 Decoder+0.611
Neg logit contributions
unks-0.946
oppers-0.924
imir-0.895
arn-0.812
avis-0.791
Token hands
Token ID2832
Feature activation-4.36e+00

Pos logit contributions
Maker+5.869
gently+5.645
breaks+5.447
ona+5.281
 Decoder+5.244
Neg logit contributions
unks-7.396
oppers-7.066
imir-6.975
=-=-=-=-=-=-=-=--6.225
arn-6.015
Token,
Token ID11
Feature activation-6.70e+00

Pos logit contributions
Maker+2.570
gently+2.537
breaks+2.478
yon+2.374
 Decoder+2.333
Neg logit contributions
unks-3.387
oppers-3.290
imir-3.182
arn-2.833
=-=-=-=-=-=-=-=--2.814
Token thou
Token ID14210
Feature activation-1.43e+01

Pos logit contributions
Maker+3.787
gently+3.757
breaks+3.653
yon+3.485
 Decoder+3.450
Neg logit contributions
unks-5.304
oppers-5.133
imir-4.942
arn-4.411
=-=-=-=-=-=-=-=--4.402
Token shalt
Token ID36258
Feature activation-7.33e+00

Pos logit contributions
gently+5.681
Maker+5.474
breaks+5.181
 Decoder+5.075
ona+5.009
Neg logit contributions
unks-12.573
oppers-12.098
imir-11.909
=-=-=-=-=-=-=-=--10.863
arn-10.342
Token sm
Token ID895
Feature activation-2.10e+00

Pos logit contributions
gently+4.315
Maker+4.258
breaks+4.128
ona+3.984
yon+3.961
Neg logit contributions
unks-5.571
oppers-5.472
imir-5.198
=-=-=-=-=-=-=-=--4.675
arn-4.521
Tokenite
Token ID578
Feature activation-5.87e-01

Pos logit contributions
gently+1.196
Maker+1.194
breaks+1.150
yon+1.112
ona+1.094
Neg logit contributions
unks-1.677
oppers-1.640
imir-1.581
arn-1.419
=-=-=-=-=-=-=-=--1.400
Token every
Token ID790
Feature activation-2.04e+00

Pos logit contributions
Maker+0.362
gently+0.360
breaks+0.348
 Decoder+0.333
yon+0.333
Neg logit contributions
unks-0.443
oppers-0.432
imir-0.418
arn-0.377
avis-0.367
Token male
Token ID4257
Feature activation+6.47e-01

Pos logit contributions
Maker+1.322
gently+1.307
breaks+1.267
 Decoder+1.215
yon+1.214
Neg logit contributions
unks-1.485
oppers-1.450
imir-1.400
arn-1.247
avis-1.215
Token thrust
Token ID14613
Feature activation-1.84e-01

Pos logit contributions
unks+0.148
oppers+0.144
imir+0.139
arn+0.126
avis+0.122
Neg logit contributions
Maker-0.127
gently-0.126
breaks-0.122
 Decoder-0.117
yon-0.116
Token of
Token ID286
Feature activation-3.07e+00

Pos logit contributions
Maker+0.117
gently+0.116
breaks+0.113
 Decoder+0.108
yon+0.108
Neg logit contributions
unks-0.136
oppers-0.132
imir-0.128
arn-0.115
avis-0.112
Token his
Token ID465
Feature activation-6.48e-01

Pos logit contributions
Maker+1.849
gently+1.834
breaks+1.770
yon+1.709
 Decoder+1.702
Neg logit contributions
unks-2.325
oppers-2.285
imir-2.199
arn-1.980
avis-1.932
Token 2012
Token ID2321
Feature activation-2.80e+00

Pos logit contributions
Maker+0.418
gently+0.415
breaks+0.401
 Decoder+0.386
yon+0.386
Neg logit contributions
unks-0.471
oppers-0.460
imir-0.443
arn-0.398
avis-0.387
Token pres
Token ID906
Feature activation-2.64e+00

Pos logit contributions
Maker+1.729
gently+1.708
breaks+1.667
yon+1.617
 Decoder+1.596
Neg logit contributions
unks-2.091
oppers-2.046
imir-1.975
arn-1.756
=-=-=-=-=-=-=-=--1.729
Token­
Token ID3907
Feature activation-1.43e+01

Pos logit contributions
gently+1.365
Maker+1.362
breaks+1.307
yon+1.262
ona+1.237
Neg logit contributions
unks-2.233
oppers-2.203
imir-2.120
arn-1.913
=-=-=-=-=-=-=-=--1.910
Tokenid
Token ID312
Feature activation-2.65e+00

Pos logit contributions
gently+8.092
breaks+7.966
yon+7.944
ona+7.777
topic+7.775
Neg logit contributions
unks-9.597
oppers-9.190
=-=-=-=-=-=-=-=--8.807
imir-8.118
 disg-8.090
Token­
Token ID3907
Feature activation-1.23e+01

Pos logit contributions
gently+1.366
Maker+1.356
breaks+1.293
yon+1.260
ona+1.243
Neg logit contributions
unks-2.251
oppers-2.219
imir-2.122
=-=-=-=-=-=-=-=--1.930
arn-1.929
Tokenen
Token ID268
Feature activation-4.84e+00

Pos logit contributions
gently+6.369
ona+6.139
breaks+6.043
yon+5.892
efe+5.841
Neg logit contributions
unks-9.605
oppers-9.486
=-=-=-=-=-=-=-=--8.641
imir-8.493
 disg-8.417
Token­
Token ID3907
Feature activation-8.62e+00

Pos logit contributions
gently+2.345
Maker+2.339
breaks+2.205
yon+2.170
ona+2.162
Neg logit contributions
unks-4.201
oppers-4.116
imir-3.994
arn-3.607
=-=-=-=-=-=-=-=--3.584
Tokent
Token ID83
Feature activation-3.19e+00

Pos logit contributions
gently+6.543
yon+6.237
breaks+6.204
Maker+6.188
ona+6.049
Neg logit contributions
unks-5.067
oppers-4.906
imir-4.626
=-=-=-=-=-=-=-=--4.077
arn-4.052
Token the
Token ID262
Feature activation-6.20e+00

Pos logit contributions
Maker+4.709
breaks+4.586
gently+4.570
 Decoder+4.553
ona+4.242
Neg logit contributions
oppers-6.728
unks-6.660
imir-6.417
avis-5.852
=-=-=-=-=-=-=-=--5.780
Token two
Token ID734
Feature activation-9.31e+00

Pos logit contributions
Maker+3.605
gently+3.575
breaks+3.478
 Decoder+3.431
yon+3.294
Neg logit contributions
unks-4.660
oppers-4.659
imir-4.440
avis-3.975
arn-3.969
Token different
Token ID1180
Feature activation-3.89e+00

Pos logit contributions
Maker+4.719
gently+4.640
breaks+4.576
 Decoder+4.415
yon+4.275
Neg logit contributions
oppers-7.484
unks-7.374
imir-7.196
avis-6.464
=-=-=-=-=-=-=-=--6.350
Token types
Token ID3858
Feature activation-3.38e+00

Pos logit contributions
Maker+2.222
gently+2.192
breaks+2.140
 Decoder+2.064
yon+2.034
Neg logit contributions
unks-3.042
oppers-3.018
imir-2.902
arn-2.608
avis-2.585
Token of
Token ID286
Feature activation-6.03e+00

Pos logit contributions
Maker+1.952
gently+1.942
breaks+1.882
 Decoder+1.823
yon+1.783
Neg logit contributions
unks-2.628
oppers-2.580
imir-2.506
arn-2.253
avis-2.202
Token block
Token ID2512
Feature activation-1.41e+01

Pos logit contributions
Maker+3.142
gently+3.093
breaks+3.062
 Decoder+2.977
ona+2.842
Neg logit contributions
oppers-4.896
unks-4.888
imir-4.693
avis-4.230
arn-4.220
Token syntax
Token ID15582
Feature activation-5.00e+00

Pos logit contributions
Maker+5.619
breaks+5.555
gently+5.204
 Decoder+5.037
yon+4.831
Neg logit contributions
oppers-11.965
unks-11.954
imir-11.563
inia-10.481
estial-10.462
Token besides
Token ID13769
Feature activation-1.28e+01

Pos logit contributions
gently+2.751
Maker+2.736
breaks+2.665
 Decoder+2.566
yon+2.515
Neg logit contributions
unks-3.976
oppers-3.931
imir-3.814
arn-3.420
avis-3.370
Token the
Token ID262
Feature activation-4.83e+00

Pos logit contributions
Maker+6.642
 Decoder+6.549
breaks+6.498
gently+6.460
ona+5.988
Neg logit contributions
oppers-9.550
unks-9.241
imir-8.822
avis-8.254
=-=-=-=-=-=-=-=--8.201
Token fact
Token ID1109
Feature activation-3.89e+00

Pos logit contributions
Maker+2.930
gently+2.877
breaks+2.835
 Decoder+2.771
yon+2.664
Neg logit contributions
unks-3.566
oppers-3.549
imir-3.384
arn-3.041
avis-2.989
Token that
Token ID326
Feature activation-3.93e+00

Pos logit contributions
Maker+2.285
gently+2.245
breaks+2.210
 Decoder+2.123
yon+2.079
Neg logit contributions
unks-2.972
oppers-2.954
imir-2.843
arn-2.574
avis-2.510
Token
Token ID247
Feature activation+2.03e+00

Pos logit contributions
unks+0.093
oppers+0.091
imir+0.087
arn+0.077
avis+0.075
Neg logit contributions
Maker-0.105
gently-0.105
breaks-0.102
 Decoder-0.098
yon-0.098
Tokenve
Token ID303
Feature activation-4.60e+00

Pos logit contributions
unks+1.403
oppers+1.375
imir+1.328
arn+1.202
avis+1.150
Neg logit contributions
Maker-1.380
gently-1.354
breaks-1.322
 Decoder-1.283
yon-1.262
Token had
Token ID550
Feature activation-3.49e+00

Pos logit contributions
Maker+2.994
gently+2.965
breaks+2.872
yon+2.801
roller+2.772
Neg logit contributions
unks-3.246
oppers-3.188
imir-3.002
arn-2.636
avis-2.619
Token as
Token ID355
Feature activation+1.26e+00

Pos logit contributions
Maker+2.126
gently+2.092
breaks+2.038
yon+1.961
 Decoder+1.939
Neg logit contributions
unks-2.633
oppers-2.573
imir-2.474
arn-2.213
avis-2.171
Token pres
Token ID906
Feature activation-5.27e+00

Pos logit contributions
unks+0.915
oppers+0.892
imir+0.864
arn+0.780
avis+0.755
Neg logit contributions
Maker-0.806
gently-0.803
breaks-0.776
 Decoder-0.745
yon-0.739
Token­
Token ID3907
Feature activation-1.41e+01

Pos logit contributions
gently+2.579
Maker+2.565
breaks+2.462
yon+2.396
ona+2.366
Neg logit contributions
unks-4.520
oppers-4.461
imir-4.255
=-=-=-=-=-=-=-=--3.929
arn-3.828
Tokenid
Token ID312
Feature activation-4.79e+00

Pos logit contributions
gently+8.174
roller+7.908
yon+7.888
breaks+7.776
ona+7.739
Neg logit contributions
unks-9.538
oppers-9.145
=-=-=-=-=-=-=-=--8.827
imir-8.156
 disg-8.020
Token­
Token ID3907
Feature activation-1.18e+01

Pos logit contributions
gently+2.381
Maker+2.341
breaks+2.200
yon+2.175
ona+2.171
Neg logit contributions
unks-4.101
oppers-4.064
imir-3.844
=-=-=-=-=-=-=-=--3.589
arn-3.509
Tokenent
Token ID298
Feature activation-4.54e-02

Pos logit contributions
gently+6.363
ona+5.970
breaks+5.873
roller+5.842
yon+5.724
Neg logit contributions
unks-9.067
oppers-8.927
=-=-=-=-=-=-=-=--8.173
 disg-7.942
imir-7.922
Token since
Token ID1201
Feature activation-4.04e+00

Pos logit contributions
Maker+0.029
gently+0.029
breaks+0.028
 Decoder+0.026
yon+0.026
Neg logit contributions
unks-0.034
oppers-0.033
imir-0.032
arn-0.029
avis-0.028
Token at
Token ID379
Feature activation-2.40e+00

Pos logit contributions
Maker+2.299
gently+2.279
breaks+2.210
yon+2.128
 Decoder+2.118
Neg logit contributions
unks-3.173
oppers-3.136
imir-3.008
arn-2.740
=-=-=-=-=-=-=-=--2.680
Token not
Token ID407
Feature activation+3.26e+00

Pos logit contributions
unks+1.776
oppers+1.722
imir+1.663
arn+1.479
avis+1.438
Neg logit contributions
Maker-1.974
gently-1.951
breaks-1.902
 Decoder-1.855
yon-1.841
Token persist
Token ID21160
Feature activation+4.47e+00

Pos logit contributions
unks+2.199
oppers+2.137
imir+2.065
arn+1.858
avis+1.780
Neg logit contributions
Maker-2.263
gently-2.207
breaks-2.155
 Decoder-2.101
ahime-2.087
Token at
Token ID379
Feature activation-3.41e+00

Pos logit contributions
unks+2.993
oppers+2.874
imir+2.793
arn+2.531
avis+2.426
Neg logit contributions
Maker-3.126
gently-3.067
breaks-2.962
 Decoder-2.918
ahime-2.892
Token the
Token ID262
Feature activation-2.34e+00

Pos logit contributions
Maker+2.053
gently+2.047
breaks+2.009
yon+1.908
 Decoder+1.901
Neg logit contributions
unks-2.582
oppers-2.510
imir-2.437
arn-2.192
avis-2.140
Token site
Token ID2524
Feature activation-6.28e+00

Pos logit contributions
Maker+1.559
gently+1.557
breaks+1.516
yon+1.452
 Decoder+1.447
Neg logit contributions
unks-1.644
oppers-1.600
imir-1.548
arn-1.381
avis-1.338
Token of
Token ID286
Feature activation-1.41e+01

Pos logit contributions
gently+3.460
Maker+3.415
breaks+3.406
ona+3.193
yon+3.139
Neg logit contributions
unks-4.993
oppers-4.897
imir-4.762
arn-4.283
=-=-=-=-=-=-=-=--4.247
Token entry
Token ID5726
Feature activation-1.28e-01

Pos logit contributions
breaks+6.811
gently+6.770
Maker+6.362
 Xander+6.153
 Decoder+6.058
Neg logit contributions
unks-11.214
oppers-11.148
imir-10.868
=-=-=-=-=-=-=-=--9.548
estial-9.545
Token in
Token ID287
Feature activation-5.33e+00

Pos logit contributions
Maker+0.080
gently+0.080
breaks+0.077
 Decoder+0.074
yon+0.074
Neg logit contributions
unks-0.095
oppers-0.093
imir-0.090
arn-0.081
avis-0.079
Token animals
Token ID4695
Feature activation-3.10e+00

Pos logit contributions
gently+3.173
Maker+3.148
breaks+3.091
 Decoder+2.886
ona+2.885
Neg logit contributions
unks-4.061
oppers-3.966
imir-3.852
arn-3.464
=-=-=-=-=-=-=-=--3.365
Token,
Token ID11
Feature activation-1.54e+00

Pos logit contributions
gently+1.897
Maker+1.892
breaks+1.844
ona+1.743
yon+1.742
Neg logit contributions
unks-2.324
oppers-2.293
imir-2.209
arn-1.977
=-=-=-=-=-=-=-=--1.927
Token M
Token ID337
Feature activation+6.58e-01

Pos logit contributions
Maker+0.959
gently+0.954
breaks+0.925
 Decoder+0.883
yon+0.879
Neg logit contributions
unks-1.146
oppers-1.119
imir-1.084
arn-0.976
avis-0.948
Token
Token ID198
Feature activation+1.37e+00

Pos logit contributions
unks+0.843
oppers+0.820
imir+0.792
arn+0.718
avis+0.695
Neg logit contributions
Maker-0.705
gently-0.701
breaks-0.677
yon-0.648
 Decoder-0.647
Token
Token ID198
Feature activation+1.87e+00

Pos logit contributions
unks+1.028
oppers+0.996
imir+0.969
arn+0.879
avis+0.851
Neg logit contributions
Maker-0.848
gently-0.847
breaks-0.815
 Decoder-0.788
ahime-0.779
Token1
Token ID16
Feature activation+4.79e+00

Pos logit contributions
unks+1.329
oppers+1.288
imir+1.249
arn+1.131
avis+1.086
Neg logit contributions
Maker-1.229
gently-1.219
breaks-1.183
 Decoder-1.152
yon-1.132
Token $
Token ID720
Feature activation+1.24e+00

Pos logit contributions
unks+3.270
oppers+3.145
imir+3.079
arn+2.817
avis+2.649
Neg logit contributions
gently-3.195
Maker-3.188
breaks-3.088
ahime-2.974
roller-2.943
Token g
Token ID308
Feature activation-1.87e+00

Pos logit contributions
unks+0.800
oppers+0.776
imir+0.747
arn+0.667
avis+0.640
Neg logit contributions
Maker-0.896
gently-0.892
breaks-0.865
 Decoder-0.835
yon-0.832
Tokendb
Token ID9945
Feature activation+1.17e+01

Pos logit contributions
Maker+1.231
gently+1.219
breaks+1.184
yon+1.136
 Decoder+1.136
Neg logit contributions
unks-1.340
oppers-1.311
imir-1.261
arn-1.125
avis-1.098
Token --
Token ID1377
Feature activation+1.48e+00

Pos logit contributions
unks+7.558
oppers+7.036
imir+6.778
arn+6.498
avis+6.081
Neg logit contributions
gently-7.483
Maker-7.426
breaks-7.406
ahime-7.310
 Decoder-6.917
Tokenquiet
Token ID39624
Feature activation+8.34e+00

Pos logit contributions
unks+1.039
oppers+1.010
imir+0.970
arn+0.881
avis+0.848
Neg logit contributions
Maker-0.994
gently-0.986
breaks-0.952
 Decoder-0.922
yon-0.917
Token./
Token ID24457
Feature activation+5.23e-01

Pos logit contributions
unks+5.563
oppers+5.410
imir+5.208
arn+4.785
avis+4.532
Neg logit contributions
gently-5.386
Maker-5.337
breaks-5.189
ahime-5.059
yon-4.999
Tokenrop
Token ID1773
Feature activation+6.98e+00

Pos logit contributions
unks+0.361
oppers+0.351
imir+0.338
arn+0.304
avis+0.293
Neg logit contributions
Maker-0.357
gently-0.354
breaks-0.343
 Decoder-0.331
yon-0.330
Tokenas
Token ID292
Feature activation+4.45e+00

Pos logit contributions
unks+4.573
oppers+4.425
imir+4.171
arn+3.778
avis+3.698
Neg logit contributions
Maker-4.932
gently-4.925
breaks-4.748
 Decoder-4.640
roller-4.521
Token office
Token ID2607
Feature activation+3.83e+00

Pos logit contributions
unks+0.128
oppers+0.124
imir+0.119
arn+0.106
avis+0.103
Neg logit contributions
Maker-0.137
gently-0.136
breaks-0.132
 Decoder-0.127
yon-0.127
Token tied
Token ID8165
Feature activation+4.09e+00

Pos logit contributions
unks+2.652
oppers+2.539
imir+2.488
arn+2.247
avis+2.161
Neg logit contributions
Maker-2.569
gently-2.555
breaks-2.454
 Decoder-2.398
yon-2.386
Token to
Token ID284
Feature activation+4.06e+00

Pos logit contributions
unks+2.660
oppers+2.602
imir+2.555
arn+2.278
avis+2.197
Neg logit contributions
Maker-2.872
gently-2.821
breaks-2.742
 Decoder-2.715
ahime-2.689
Token power
Token ID1176
Feature activation+4.66e+00

Pos logit contributions
unks+2.873
oppers+2.821
imir+2.703
arn+2.453
avis+2.367
Neg logit contributions
Maker-2.636
gently-2.629
breaks-2.534
yon-2.424
ahime-2.421
Token-
Token ID12
Feature activation+5.54e+00

Pos logit contributions
unks+3.205
oppers+3.052
imir+2.959
arn+2.723
avis+2.580
Neg logit contributions
gently-3.171
Maker-3.084
breaks-2.969
 Decoder-2.926
yon-2.907
Tokenbro
Token ID7957
Feature activation+1.19e+01

Pos logit contributions
unks+4.330
oppers+4.220
imir+4.050
arn+3.785
avis+3.665
Neg logit contributions
gently-3.115
Maker-3.066
breaks-2.962
 Decoder-2.928
yon-2.868
Tokenk
Token ID74
Feature activation+9.34e-01

Pos logit contributions
unks+5.789
oppers+5.310
imir+5.103
arn+4.546
avis+4.208
Neg logit contributions
gently-9.779
Maker-9.547
breaks-9.523
ahime-9.406
 Decoder-9.357
Tokenering
Token ID1586
Feature activation+3.99e+00

Pos logit contributions
unks+0.342
oppers+0.326
imir+0.303
arn+0.239
avis+0.221
Neg logit contributions
Maker-0.940
gently-0.936
breaks-0.917
 Decoder-0.894
yon-0.892
Token with
Token ID351
Feature activation+1.25e+00

Pos logit contributions
unks+2.796
oppers+2.696
imir+2.614
arn+2.376
avis+2.281
Neg logit contributions
gently-2.645
Maker-2.619
breaks-2.533
 Decoder-2.453
yon-2.448
Token the
Token ID262
Feature activation+3.72e+00

Pos logit contributions
unks+0.901
oppers+0.879
imir+0.847
arn+0.762
avis+0.740
Neg logit contributions
Maker-0.817
gently-0.812
breaks-0.785
 Decoder-0.752
yon-0.752
Token far
Token ID1290
Feature activation+3.45e+00

Pos logit contributions
unks+2.550
oppers+2.497
imir+2.382
arn+2.155
avis+2.086
Neg logit contributions
gently-2.519
Maker-2.512
breaks-2.442
ahime-2.335
yon-2.316
Token relief
Token ID8259
Feature activation+2.66e-01

Pos logit contributions
unks+3.289
oppers+3.199
imir+3.073
arn+2.797
avis+2.687
Neg logit contributions
Maker-2.924
gently-2.854
breaks-2.767
ahime-2.699
yon-2.677
Token from
Token ID422
Feature activation-3.91e+00

Pos logit contributions
unks+0.198
oppers+0.193
imir+0.186
arn+0.168
avis+0.163
Neg logit contributions
Maker-0.167
gently-0.166
breaks-0.161
 Decoder-0.154
yon-0.154
Token business
Token ID1597
Feature activation+4.67e+00

Pos logit contributions
gently+2.449
Maker+2.446
breaks+2.387
yon+2.278
 Decoder+2.268
Neg logit contributions
unks-2.850
oppers-2.787
imir-2.694
arn-2.386
=-=-=-=-=-=-=-=--2.331
Token problems
Token ID2761
Feature activation+8.03e+00

Pos logit contributions
unks+3.330
oppers+3.212
imir+3.074
arn+2.835
avis+2.744
Neg logit contributions
gently-3.023
Maker-3.015
 Decoder-2.844
breaks-2.831
yon-2.803
Token and
Token ID290
Feature activation+2.66e+00

Pos logit contributions
unks+5.174
oppers+5.045
imir+4.759
arn+4.418
avis+4.221
Neg logit contributions
Maker-5.520
gently-5.476
breaks-5.221
 Decoder-5.164
yon-5.116
Token bad
Token ID2089
Feature activation+8.85e+00

Pos logit contributions
unks+1.911
oppers+1.866
imir+1.794
arn+1.639
avis+1.577
Neg logit contributions
Maker-1.726
gently-1.709
breaks-1.644
 Decoder-1.593
ahime-1.589
Token debts
Token ID20250
Feature activation+4.36e+00

Pos logit contributions
unks+5.759
oppers+5.514
imir+5.351
arn+4.859
avis+4.733
Neg logit contributions
Maker-6.092
gently-6.041
 Decoder-5.787
breaks-5.671
yon-5.608
Token.
Token ID13
Feature activation+2.64e+00

Pos logit contributions
unks+3.020
oppers+2.953
imir+2.825
arn+2.584
avis+2.487
Neg logit contributions
Maker-2.903
gently-2.892
breaks-2.756
 Decoder-2.705
yon-2.672
Token Standing
Token ID22493
Feature activation-9.36e-01

Pos logit contributions
unks+1.939
oppers+1.881
imir+1.816
arn+1.651
avis+1.598
Neg logit contributions
gently-1.670
Maker-1.669
breaks-1.596
ahime-1.538
yon-1.535
Token Lord
Token ID4453
Feature activation+4.00e-01

Pos logit contributions
Maker+0.592
gently+0.586
breaks+0.570
 Decoder+0.547
yon+0.544
Neg logit contributions
unks-0.691
oppers-0.674
imir-0.652
arn-0.586
avis-0.569
Token Gan
Token ID23207
Feature activation-1.52e+00

Pos logit contributions
unks+0.311
oppers+0.304
imir+0.294
arn+0.267
avis+0.259
Neg logit contributions
Maker-0.238
gently-0.237
breaks-0.228
 Decoder-0.218
yon-0.218
Token,
Token ID11
Feature activation+5.46e+00

Pos logit contributions
unks+2.455
oppers+2.390
imir+2.292
arn+2.109
avis+1.998
Neg logit contributions
Maker-2.308
gently-2.297
breaks-2.204
yon-2.138
roller-2.115
Token2
Token ID17
Feature activation+4.56e+00

Pos logit contributions
unks+3.883
oppers+3.820
imir+3.599
arn+3.294
avis+3.170
Neg logit contributions
gently-3.517
Maker-3.510
breaks-3.304
yon-3.259
roller-3.213
Token,
Token ID11
Feature activation+5.14e+00

Pos logit contributions
unks+3.233
oppers+3.140
imir+2.995
arn+2.780
avis+2.630
Neg logit contributions
gently-2.986
Maker-2.966
breaks-2.831
yon-2.759
roller-2.742
Token..
Token ID492
Feature activation+2.25e+00

Pos logit contributions
unks+3.291
oppers+3.203
imir+3.039
arn+2.784
avis+2.630
Neg logit contributions
gently-3.660
Maker-3.642
breaks-3.463
ahime-3.421
yon-3.407
Token *)
Token ID31936
Feature activation+5.97e+00

Pos logit contributions
unks+1.598
oppers+1.568
imir+1.496
arn+1.363
avis+1.307
Neg logit contributions
Maker-1.483
gently-1.481
breaks-1.415
yon-1.370
 Decoder-1.363
Token graph
Token ID4823
Feature activation+1.01e+01

Pos logit contributions
unks+4.273
oppers+4.235
imir+4.047
arn+3.679
avis+3.551
Neg logit contributions
gently-3.716
Maker-3.670
breaks-3.534
ahime-3.448
yon-3.439
TokenList
Token ID8053
Feature activation+3.23e+00

Pos logit contributions
unks+6.742
oppers+6.419
imir+6.058
arn+5.730
avis+5.439
Neg logit contributions
gently-6.523
breaks-6.194
Maker-6.052
ahime-5.949
yon-5.912
Token =
Token ID796
Feature activation+6.46e+00

Pos logit contributions
unks+2.344
oppers+2.277
imir+2.177
arn+1.996
avis+1.920
Neg logit contributions
gently-2.070
Maker-2.034
breaks-1.984
yon-1.900
 Decoder-1.894
Token Table
Token ID8655
Feature activation+3.13e+00

Pos logit contributions
unks+4.410
oppers+4.296
imir+4.141
arn+3.763
avis+3.592
Neg logit contributions
gently-4.229
Maker-4.182
breaks-4.130
ahime-3.979
ona-3.932
Token[
Token ID58
Feature activation+1.43e+00

Pos logit contributions
unks+2.140
oppers+2.063
imir+1.987
arn+1.805
avis+1.743
Neg logit contributions
gently-2.123
Maker-2.093
breaks-2.044
 Decoder-1.972
yon-1.966
Token {
Token ID1391
Feature activation+1.80e-01

Pos logit contributions
unks+0.993
oppers+0.970
imir+0.931
arn+0.840
avis+0.807
Neg logit contributions
Maker-0.959
gently-0.956
breaks-0.923
yon-0.889
 Decoder-0.889
Token privacy
Token ID6782
Feature activation+7.39e+00

Pos logit contributions
unks+1.698
oppers+1.663
imir+1.594
arn+1.436
avis+1.394
Neg logit contributions
gently-1.518
Maker-1.515
breaks-1.470
 Decoder-1.400
yon-1.398
Token-
Token ID12
Feature activation+3.02e+00

Pos logit contributions
oppers+4.938
unks+4.938
imir+4.636
arn+4.207
=-=-=-=-=-=-=-=-+4.086
Neg logit contributions
Maker-4.847
gently-4.801
breaks-4.589
ahime-4.503
yon-4.473
Tokencentric
Token ID28577
Feature activation+6.97e+00

Pos logit contributions
unks+2.063
oppers+2.021
imir+1.931
arn+1.744
avis+1.684
Neg logit contributions
Maker-2.056
gently-2.012
breaks-1.944
 Decoder-1.920
yon-1.891
Token variant
Token ID15304
Feature activation-5.47e-01

Pos logit contributions
unks+4.804
oppers+4.725
imir+4.432
arn+4.053
avis+3.853
Neg logit contributions
gently-4.567
Maker-4.495
breaks-4.395
yon-4.187
ahime-4.146
Token called
Token ID1444
Feature activation+6.23e+00

Pos logit contributions
Maker+0.339
gently+0.336
breaks+0.325
 Decoder+0.312
yon+0.311
Neg logit contributions
unks-0.412
oppers-0.402
imir-0.389
arn-0.351
avis-0.341
Token Focus
Token ID17061
Feature activation+9.09e+00

Pos logit contributions
unks+4.421
oppers+4.279
imir+4.092
arn+3.694
avis+3.601
Neg logit contributions
gently-4.000
Maker-3.960
breaks-3.875
yon-3.683
ahime-3.666
Token and
Token ID290
Feature activation+4.29e+00

Pos logit contributions
unks+6.175
oppers+5.951
imir+5.737
arn+5.191
avis+5.139
Neg logit contributions
gently-5.902
breaks-5.679
Maker-5.577
ahime-5.414
 Decoder-5.262
Token speed
Token ID2866
Feature activation+1.27e+00

Pos logit contributions
unks+3.090
oppers+3.023
imir+2.900
arn+2.604
avis+2.541
Neg logit contributions
gently-2.751
Maker-2.713
breaks-2.643
yon-2.532
ahime-2.508
Token improvements
Token ID8561
Feature activation+2.97e+00

Pos logit contributions
unks+0.883
oppers+0.861
imir+0.829
arn+0.743
avis+0.719
Neg logit contributions
gently-0.852
Maker-0.851
breaks-0.820
 Decoder-0.791
yon-0.787
Token transferred
Token ID11172
Feature activation-3.27e+00

Pos logit contributions
unks+2.094
oppers+2.037
imir+1.960
arn+1.768
avis+1.701
Neg logit contributions
gently-1.961
Maker-1.956
breaks-1.899
yon-1.811
 Decoder-1.804
Token from
Token ID422
Feature activation+1.70e+00

Pos logit contributions
gently+1.998
Maker+1.997
breaks+1.905
yon+1.836
 Decoder+1.824
Neg logit contributions
unks-2.476
oppers-2.417
imir-2.316
arn-2.096
=-=-=-=-=-=-=-=--2.054
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
unks+2.438
oppers+2.380
imir+2.302
arn+2.067
avis+1.995
Neg logit contributions
Maker-2.218
gently-2.201
breaks-2.130
yon-2.051
 Decoder-2.049
Token
Token ID198
Feature activation+2.81e-01

Pos logit contributions
Maker+0.819
gently+0.809
breaks+0.782
yon+0.744
 Decoder+0.744
Neg logit contributions
unks-1.162
oppers-1.145
imir-1.102
arn-0.998
avis-0.974
Tokenthat
Token ID5562
Feature activation-2.15e+00

Pos logit contributions
unks+0.209
oppers+0.204
imir+0.197
arn+0.178
avis+0.172
Neg logit contributions
Maker-0.177
gently-0.175
breaks-0.170
 Decoder-0.163
yon-0.162
Token the
Token ID262
Feature activation-4.10e+00

Pos logit contributions
Maker+1.321
gently+1.301
breaks+1.262
 Decoder+1.207
yon+1.202
Neg logit contributions
unks-1.636
oppers-1.597
imir-1.543
arn-1.397
avis-1.357
Token valves
Token ID41843
Feature activation+2.81e+00

Pos logit contributions
Maker+2.587
gently+2.525
breaks+2.456
 Decoder+2.391
yon+2.354
Neg logit contributions
unks-3.020
oppers-2.940
imir-2.826
arn-2.560
avis-2.472
Token are
Token ID389
Feature activation-3.59e-02

Pos logit contributions
unks+1.925
oppers+1.873
imir+1.822
arn+1.633
avis+1.567
Neg logit contributions
Maker-1.911
gently-1.899
breaks-1.835
 Decoder-1.773
yon-1.773
Token supposed
Token ID4385
Feature activation-2.62e+00

Pos logit contributions
Maker+0.024
gently+0.024
breaks+0.023
 Decoder+0.022
yon+0.022
Neg logit contributions
unks-0.026
oppers-0.025
imir-0.024
arn-0.022
avis-0.021
Token to
Token ID284
Feature activation+3.58e+00

Pos logit contributions
gently+1.579
Maker+1.576
breaks+1.510
yon+1.448
 Decoder+1.444
Neg logit contributions
unks-2.012
oppers-1.977
imir-1.885
arn-1.699
avis-1.665
Token satisfy
Token ID15959
Feature activation-3.08e+00

Pos logit contributions
unks+2.579
oppers+2.495
imir+2.454
arn+2.216
avis+2.115
Neg logit contributions
Maker-2.314
gently-2.268
breaks-2.236
 Decoder-2.141
yon-2.132
Token conflicting
Token ID24916
Feature activation+2.13e+00

Pos logit contributions
Maker+1.880
gently+1.870
breaks+1.798
 Decoder+1.715
yon+1.709
Neg logit contributions
unks-2.338
oppers-2.272
imir-2.197
arn-1.985
avis-1.942
Token demands
Token ID8665
Feature activation-1.78e+00

Pos logit contributions
unks+1.545
oppers+1.499
imir+1.449
arn+1.312
avis+1.260
Neg logit contributions
Maker-1.369
gently-1.361
breaks-1.323
 Decoder-1.273
yon-1.271
Token Clintons
Token ID30234
Feature activation-1.74e+00

Pos logit contributions
unks+1.853
oppers+1.817
imir+1.747
arn+1.582
avis+1.528
Neg logit contributions
Maker-1.647
gently-1.642
breaks-1.583
yon-1.521
 Decoder-1.513
Token outside
Token ID2354
Feature activation+2.87e+00

Pos logit contributions
Maker+1.116
gently+1.111
breaks+1.071
 Decoder+1.037
yon+1.030
Neg logit contributions
unks-1.263
oppers-1.229
imir-1.186
arn-1.068
avis-1.037
Token #
Token ID1303
Feature activation+2.69e+00

Pos logit contributions
unks+2.028
oppers+1.989
imir+1.911
arn+1.728
avis+1.666
Neg logit contributions
gently-1.895
Maker-1.891
breaks-1.814
yon-1.731
 Decoder-1.723
TokenDeb
Token ID16587
Feature activation+1.11e+01

Pos logit contributions
unks+1.701
oppers+1.648
imir+1.587
arn+1.421
avis+1.356
Neg logit contributions
gently-1.962
Maker-1.961
breaks-1.894
 Decoder-1.849
yon-1.826
Tokenates
Token ID689
Feature activation+2.64e+00

Pos logit contributions
unks+6.392
oppers+5.985
imir+5.695
arn+5.196
avis+4.978
Neg logit contributions
gently-8.410
Maker-8.391
ahime-8.173
 Decoder-8.136
 Xander-8.022
Token2016
Token ID5304
Feature activation+1.49e-01

Pos logit contributions
unks+1.959
oppers+1.911
imir+1.848
arn+1.678
avis+1.624
Neg logit contributions
gently-1.644
Maker-1.635
breaks-1.573
yon-1.509
 Decoder-1.508
Token pic
Token ID8301
Feature activation-9.08e-01

Pos logit contributions
unks+0.115
oppers+0.112
imir+0.108
arn+0.098
avis+0.095
Neg logit contributions
Maker-0.089
gently-0.089
breaks-0.086
 Decoder-0.082
yon-0.082
Token.
Token ID13
Feature activation-9.22e-01

Pos logit contributions
Maker+0.546
gently+0.537
breaks+0.521
yon+0.500
 Decoder+0.499
Neg logit contributions
unks-0.701
oppers-0.687
imir-0.663
arn-0.598
avis-0.584
Tokentwitter
Token ID6956
Feature activation-2.09e+00

Pos logit contributions
Maker+0.786
gently+0.782
breaks+0.764
yon+0.740
 Decoder+0.737
Neg logit contributions
unks-0.479
oppers-0.463
imir-0.440
arn-0.376
avis-0.361
Token.
Token ID13
Feature activation-9.49e-01

Pos logit contributions
Maker+1.215
gently+1.186
breaks+1.157
yon+1.110
 Decoder+1.100
Neg logit contributions
unks-1.653
oppers-1.634
imir-1.568
arn-1.411
avis-1.382
Tokencom
Token ID785
Feature activation-3.19e+00

Pos logit contributions
Maker+0.523
gently+0.517
breaks+0.500
yon+0.478
 Decoder+0.472
Neg logit contributions
unks-0.779
oppers-0.764
imir-0.738
arn-0.670
avis-0.655
Token.
Token ID13
Feature activation-1.57e+00

Pos logit contributions
Maker+1.458
gently+1.455
breaks+1.408
yon+1.342
 Decoder+1.338
Neg logit contributions
unks-1.863
oppers-1.834
imir-1.780
arn-1.592
avis-1.556
Token It
Token ID632
Feature activation-2.97e-01

Pos logit contributions
Maker+0.900
gently+0.884
breaks+0.857
 Decoder+0.828
yon+0.819
Neg logit contributions
unks-1.256
oppers-1.232
imir-1.191
arn-1.078
avis-1.052
Token happened
Token ID3022
Feature activation+2.62e-01

Pos logit contributions
Maker+0.192
gently+0.191
breaks+0.185
 Decoder+0.177
yon+0.177
Neg logit contributions
unks-0.215
oppers-0.210
imir-0.202
arn-0.182
avis-0.177
Token in
Token ID287
Feature activation+2.70e+00

Pos logit contributions
unks+0.190
oppers+0.185
imir+0.179
arn+0.161
avis+0.156
Neg logit contributions
Maker-0.170
gently-0.168
breaks-0.163
 Decoder-0.157
yon-0.156
Token 2015
Token ID1853
Feature activation+1.81e+00

Pos logit contributions
unks+1.944
oppers+1.890
imir+1.818
arn+1.650
avis+1.608
Neg logit contributions
Maker-1.749
gently-1.742
breaks-1.679
 Decoder-1.597
yon-1.596
Token;
Token ID26
Feature activation+1.57e-01

Pos logit contributions
unks+1.303
oppers+1.269
imir+1.228
arn+1.109
avis+1.071
Neg logit contributions
Maker-1.174
gently-1.160
breaks-1.123
yon-1.079
 Decoder-1.077
Token it
Token ID340
Feature activation+2.72e+00

Pos logit contributions
unks+0.115
oppers+0.112
imir+0.108
arn+0.098
avis+0.095
Neg logit contributions
Maker-0.100
gently-0.099
breaks-0.096
 Decoder-0.092
yon-0.092
Token
Token ID447
Feature activation+3.29e+00

Pos logit contributions
unks+1.875
oppers+1.826
imir+1.780
arn+1.598
avis+1.541
Neg logit contributions
Maker-1.842
gently-1.822
breaks-1.750
 Decoder-1.716
yon-1.697
Token
Token ID247
Feature activation+2.28e+00

Pos logit contributions
unks+2.689
oppers+2.630
imir+2.574
arn+2.350
avis+2.293
Neg logit contributions
Maker-1.782
gently-1.737
breaks-1.666
yon-1.625
 Decoder-1.609
Tokens
Token ID82
Feature activation+4.18e+00

Pos logit contributions
unks+1.828
oppers+1.789
imir+1.734
arn+1.583
avis+1.539
Neg logit contributions
Maker-1.302
gently-1.284
breaks-1.234
 Decoder-1.195
yon-1.183
Token happening
Token ID5836
Feature activation+4.87e-01

Pos logit contributions
unks+2.878
oppers+2.813
imir+2.764
arn+2.501
avis+2.406
Neg logit contributions
Maker-2.754
gently-2.750
breaks-2.654
 Decoder-2.567
ahime-2.542
Token taste
Token ID6938
Feature activation-6.04e-01

Pos logit contributions
unks+1.603
oppers+1.544
imir+1.501
arn+1.352
avis+1.293
Neg logit contributions
Maker-1.542
gently-1.532
breaks-1.487
 Decoder-1.433
yon-1.427
Token or
Token ID393
Feature activation-6.52e-01

Pos logit contributions
Maker+0.381
gently+0.377
breaks+0.366
 Decoder+0.350
yon+0.350
Neg logit contributions
unks-0.447
oppers-0.437
imir-0.421
arn-0.379
avis-0.369
Token better
Token ID1365
Feature activation+4.07e+00

Pos logit contributions
Maker+0.408
gently+0.405
breaks+0.392
yon+0.376
 Decoder+0.376
Neg logit contributions
unks-0.486
oppers-0.474
imir-0.457
arn-0.412
avis-0.401
Token;
Token ID26
Feature activation-3.75e+00

Pos logit contributions
unks+2.820
oppers+2.696
imir+2.645
arn+2.396
=-=-=-=-=-=-=-=-+2.284
Neg logit contributions
Maker-2.747
gently-2.727
breaks-2.624
 Decoder-2.542
yon-2.530
Token otherwise
Token ID4306
Feature activation-1.76e-01

Pos logit contributions
gently+2.256
Maker+2.255
breaks+2.173
 Decoder+2.055
ona+2.051
Neg logit contributions
unks-2.850
oppers-2.795
imir-2.685
arn-2.415
avis-2.361
Token they
Token ID484
Feature activation-2.25e-01

Pos logit contributions
Maker+0.112
gently+0.111
breaks+0.107
 Decoder+0.103
yon+0.103
Neg logit contributions
unks-0.130
oppers-0.127
imir-0.123
arn-0.111
avis-0.107
Token won
Token ID1839
Feature activation-2.10e+00

Pos logit contributions
Maker+0.142
gently+0.141
breaks+0.137
yon+0.131
 Decoder+0.131
Neg logit contributions
unks-0.166
oppers-0.162
imir-0.156
arn-0.141
avis-0.137
Token
Token ID447
Feature activation-3.06e-01

Pos logit contributions
Maker+1.188
gently+1.171
breaks+1.139
yon+1.085
 Decoder+1.076
Neg logit contributions
unks-1.697
oppers-1.674
imir-1.599
arn-1.458
avis-1.428
Token
Token ID247
Feature activation+3.48e-01

Pos logit contributions
Maker+0.150
gently+0.148
breaks+0.142
 Decoder+0.134
yon+0.134
Neg logit contributions
unks-0.270
oppers-0.264
imir-0.257
arn-0.236
avis-0.230
Tokent
Token ID83
Feature activation+2.09e+00

Pos logit contributions
unks+0.198
oppers+0.192
imir+0.183
arn+0.160
avis+0.153
Neg logit contributions
Maker-0.279
gently-0.277
breaks-0.270
 Decoder-0.261
yon-0.261
Token go
Token ID467
Feature activation+3.06e-01

Pos logit contributions
unks+1.488
oppers+1.445
imir+1.397
arn+1.266
avis+1.218
Neg logit contributions
Maker-1.384
gently-1.371
breaks-1.336
 Decoder-1.281
yon-1.273
Token's
Token ID338
Feature activation+6.80e+00

Pos logit contributions
unks+3.664
oppers+3.536
imir+3.451
arn+3.140
avis+2.989
Neg logit contributions
Maker-3.411
gently-3.376
breaks-3.228
 Decoder-3.198
yon-3.150
Token home
Token ID1363
Feature activation+4.40e+00

Pos logit contributions
unks+4.536
oppers+4.335
imir+4.202
arn+3.828
avis+3.616
Neg logit contributions
Maker-4.637
gently-4.537
breaks-4.362
ahime-4.322
 Decoder-4.297
Token farm
Token ID5318
Feature activation-7.31e-01

Pos logit contributions
unks+2.987
oppers+2.865
imir+2.783
arn+2.539
avis+2.407
Neg logit contributions
Maker-2.998
gently-2.936
 Decoder-2.815
breaks-2.803
yon-2.763
Token,
Token ID11
Feature activation+2.53e+00

Pos logit contributions
Maker+0.461
gently+0.458
breaks+0.444
 Decoder+0.424
yon+0.424
Neg logit contributions
unks-0.541
oppers-0.528
imir-0.510
arn-0.460
avis-0.446
Token and
Token ID290
Feature activation+4.30e+00

Pos logit contributions
unks+1.824
oppers+1.776
imir+1.719
arn+1.553
avis+1.500
Neg logit contributions
Maker-1.627
gently-1.611
breaks-1.561
yon-1.505
 Decoder-1.505
Token some
Token ID617
Feature activation+3.42e+00

Pos logit contributions
unks+2.961
oppers+2.880
imir+2.794
arn+2.515
=-=-=-=-=-=-=-=-+2.400
Neg logit contributions
Maker-2.860
gently-2.840
breaks-2.754
ahime-2.676
yon-2.666
Token from
Token ID422
Feature activation+3.83e+00

Pos logit contributions
unks+2.325
oppers+2.246
imir+2.184
arn+1.940
avis+1.865
Neg logit contributions
Maker-2.341
gently-2.321
breaks-2.252
 Decoder-2.191
ahime-2.170
Token another
Token ID1194
Feature activation+6.25e-01

Pos logit contributions
unks+2.785
oppers+2.710
imir+2.627
arn+2.375
avis+2.287
Neg logit contributions
Maker-2.418
gently-2.408
breaks-2.316
yon-2.249
ahime-2.244
Token farm
Token ID5318
Feature activation-2.25e+00

Pos logit contributions
unks+0.454
oppers+0.441
imir+0.427
arn+0.385
avis+0.372
Neg logit contributions
Maker-0.403
gently-0.401
breaks-0.388
 Decoder-0.373
yon-0.372
Token.
Token ID13
Feature activation+1.93e+00

Pos logit contributions
Maker+1.401
gently+1.392
breaks+1.354
yon+1.282
 Decoder+1.276
Neg logit contributions
unks-1.678
oppers-1.654
imir-1.588
arn-1.430
avis-1.389
Token
Token ID198
Feature activation-1.34e+00

Pos logit contributions
unks+1.415
oppers+1.371
imir+1.333
arn+1.202
avis+1.160
Neg logit contributions
gently-1.226
Maker-1.219
breaks-1.179
yon-1.128
ahime-1.124
Token money
Token ID1637
Feature activation-6.04e+00

Pos logit contributions
unks+1.055
oppers+1.024
imir+0.982
arn+0.885
avis+0.852
Neg logit contributions
Maker-1.109
gently-1.105
breaks-1.062
 Decoder-1.035
yon-1.029
Token and
Token ID290
Feature activation-2.31e+00

Pos logit contributions
Maker+3.478
gently+3.349
breaks+3.344
ona+3.180
roller+3.128
Neg logit contributions
unks-4.749
oppers-4.654
imir-4.494
arn-3.975
avis-3.917
Token their
Token ID511
Feature activation-2.76e+00

Pos logit contributions
Maker+1.439
gently+1.422
breaks+1.380
 Decoder+1.324
yon+1.317
Neg logit contributions
unks-1.731
oppers-1.691
imir-1.638
arn-1.466
avis-1.432
Token families
Token ID4172
Feature activation+2.50e+00

Pos logit contributions
Maker+1.789
gently+1.770
breaks+1.719
 Decoder+1.639
yon+1.637
Neg logit contributions
unks-1.993
oppers-1.942
imir-1.885
arn-1.694
avis-1.648
Token have
Token ID423
Feature activation+1.72e+00

Pos logit contributions
unks+1.776
oppers+1.710
imir+1.667
arn+1.505
avis+1.439
Neg logit contributions
Maker-1.647
gently-1.641
breaks-1.578
 Decoder-1.533
ahime-1.518
Token good
Token ID922
Feature activation-2.80e+00

Pos logit contributions
unks+1.243
oppers+1.206
imir+1.168
arn+1.051
avis+1.017
Neg logit contributions
Maker-1.118
gently-1.115
breaks-1.074
 Decoder-1.039
yon-1.033
Token benefits
Token ID4034
Feature activation-2.13e+00

Pos logit contributions
Maker+1.752
gently+1.734
breaks+1.704
ona+1.604
yon+1.603
Neg logit contributions
unks-2.059
oppers-2.030
imir-1.952
arn-1.765
avis-1.711
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
Maker+1.297
gently+1.278
breaks+1.249
 Decoder+1.185
yon+1.184
Neg logit contributions
unks-1.620
oppers-1.588
imir-1.525
arn-1.372
avis-1.340
Token everybody
Token ID7288
Feature activation-2.74e+00

Pos logit contributions
Maker+0.684
gently+0.679
breaks+0.657
 Decoder+0.629
yon+0.627
Neg logit contributions
unks-0.812
oppers-0.792
imir-0.764
arn-0.689
avis-0.670
Token gains
Token ID8810
Feature activation-1.21e+00

Pos logit contributions
Maker+1.717
gently+1.696
breaks+1.665
yon+1.579
 Decoder+1.578
Neg logit contributions
unks-2.022
oppers-1.984
imir-1.903
arn-1.700
avis-1.676
Token from
Token ID422
Feature activation+2.48e+00

Pos logit contributions
Maker+0.744
gently+0.737
breaks+0.714
 Decoder+0.684
yon+0.679
Neg logit contributions
unks-0.919
oppers-0.898
imir-0.867
arn-0.781
avis-0.761
Token was
Token ID373
Feature activation+1.59e+00

Pos logit contributions
Maker+0.860
gently+0.853
breaks+0.829
 Decoder+0.790
yon+0.790
Neg logit contributions
unks-1.031
oppers-1.008
imir-0.975
arn-0.877
avis-0.854
Token twice
Token ID5403
Feature activation+3.24e+00

Pos logit contributions
unks+1.105
oppers+1.077
imir+1.039
arn+0.932
avis+0.904
Neg logit contributions
Maker-1.077
gently-1.068
breaks-1.039
yon-0.998
 Decoder-0.996
Token as
Token ID355
Feature activation+1.07e+00

Pos logit contributions
unks+2.193
oppers+2.146
imir+2.050
arn+1.858
avis+1.813
Neg logit contributions
gently-2.229
Maker-2.222
breaks-2.138
yon-2.074
 Decoder-2.069
Token high
Token ID1029
Feature activation-1.62e+00

Pos logit contributions
unks+0.788
oppers+0.770
imir+0.743
arn+0.675
avis+0.654
Neg logit contributions
Maker-0.673
gently-0.669
breaks-0.647
 Decoder-0.622
yon-0.621
Token at
Token ID379
Feature activation-1.90e+00

Pos logit contributions
Maker+0.993
gently+0.986
breaks+0.958
 Decoder+0.914
yon+0.911
Neg logit contributions
unks-1.229
oppers-1.201
imir-1.162
arn-1.042
avis-1.015
Token charter
Token ID17422
Feature activation-8.45e+00

Pos logit contributions
Maker+1.236
gently+1.228
breaks+1.190
yon+1.143
 Decoder+1.142
Neg logit contributions
unks-1.360
oppers-1.330
imir-1.281
arn-1.152
avis-1.117
Token schools
Token ID4266
Feature activation-8.66e-01

Pos logit contributions
Maker+4.108
gently+4.045
breaks+3.947
yon+3.800
ona+3.786
Neg logit contributions
oppers-7.008
unks-6.970
imir-6.730
arn-6.073
avis-6.031
Token (
Token ID357
Feature activation+7.33e-01

Pos logit contributions
Maker+0.558
gently+0.553
breaks+0.537
 Decoder+0.515
yon+0.514
Neg logit contributions
unks-0.630
oppers-0.615
imir-0.594
arn-0.533
avis-0.518
TokenSt
Token ID1273
Feature activation+4.12e+00

Pos logit contributions
unks+0.520
oppers+0.506
imir+0.489
arn+0.439
avis+0.425
Neg logit contributions
Maker-0.484
gently-0.482
breaks-0.467
 Decoder-0.449
yon-0.447
Tokenuit
Token ID5013
Feature activation+3.14e+00

Pos logit contributions
unks+2.276
oppers+2.200
imir+2.130
arn+1.864
avis+1.787
Neg logit contributions
Maker-3.388
gently-3.341
breaks-3.278
 Decoder-3.167
ahime-3.135
Token and
Token ID290
Feature activation-7.50e-01

Pos logit contributions
unks+2.260
oppers+2.191
imir+2.126
arn+1.944
avis+1.862
Neg logit contributions
Maker-2.043
gently-2.037
breaks-1.953
 Decoder-1.863
yon-1.862
Token Mer
Token ID4638
Feature activation+1.39e+00

Pos logit contributions
Maker+0.336
gently+0.333
breaks+0.323
 Decoder+0.310
yon+0.309
Neg logit contributions
unks-0.380
oppers-0.371
imir-0.357
arn-0.322
avis-0.312
Tokenck
Token ID694
Feature activation-5.82e-01

Pos logit contributions
unks+0.956
oppers+0.927
imir+0.898
arn+0.805
avis+0.778
Neg logit contributions
Maker-0.946
gently-0.939
breaks-0.915
 Decoder-0.883
yon-0.872
Tokenx
Token ID87
Feature activation+1.90e+00

Pos logit contributions
Maker+0.338
gently+0.335
breaks+0.323
yon+0.309
 Decoder+0.309
Neg logit contributions
unks-0.461
oppers-0.450
imir-0.435
arn-0.395
avis-0.385
Token said
Token ID531
Feature activation+1.89e+00

Pos logit contributions
unks+1.359
oppers+1.327
imir+1.287
arn+1.163
avis+1.121
Neg logit contributions
gently-1.230
Maker-1.224
breaks-1.184
yon-1.140
 Decoder-1.136
Token just
Token ID655
Feature activation-1.78e+00

Pos logit contributions
unks+1.356
oppers+1.320
imir+1.280
arn+1.156
avis+1.115
Neg logit contributions
Maker-1.227
gently-1.219
breaks-1.179
yon-1.138
 Decoder-1.135
Token after
Token ID706
Feature activation-4.50e+00

Pos logit contributions
Maker+1.040
gently+1.037
breaks+1.003
yon+0.955
 Decoder+0.953
Neg logit contributions
unks-1.392
oppers-1.359
imir-1.308
arn-1.187
=-=-=-=-=-=-=-=--1.157
Token he
Token ID339
Feature activation-1.80e-01

Pos logit contributions
Maker+2.623
gently+2.564
breaks+2.523
 Decoder+2.412
ona+2.398
Neg logit contributions
unks-3.486
oppers-3.449
imir-3.299
arn-2.971
avis-2.923
Token set
Token ID900
Feature activation-2.06e+00

Pos logit contributions
Maker+0.129
gently+0.128
breaks+0.124
 Decoder+0.120
yon+0.120
Neg logit contributions
unks-0.118
oppers-0.114
imir-0.110
arn-0.098
avis-0.094
Token the
Token ID262
Feature activation-6.61e+00

Pos logit contributions
Maker+1.292
gently+1.279
breaks+1.250
yon+1.182
 Decoder+1.179
Neg logit contributions
unks-1.536
oppers-1.496
imir-1.441
arn-1.299
avis-1.268
Token mark
Token ID1317
Feature activation-6.97e-01

Pos logit contributions
Maker+4.074
gently+4.042
breaks+4.016
 Decoder+3.761
 Wim+3.725
Neg logit contributions
unks-4.797
oppers-4.665
imir-4.509
arn-4.003
avis-3.978
Token.
Token ID13
Feature activation+5.48e-01

Pos logit contributions
Maker+0.437
gently+0.434
breaks+0.421
 Decoder+0.402
yon+0.401
Neg logit contributions
unks-0.520
oppers-0.507
imir-0.490
arn-0.442
avis-0.429
TokenThe
Token ID464
Feature activation-2.45e+00

Pos logit contributions
unks+0.094
oppers+0.092
imir+0.089
arn+0.080
avis+0.077
Neg logit contributions
Maker-0.089
gently-0.089
breaks-0.086
 Decoder-0.083
yon-0.083
Token opening
Token ID4756
Feature activation+5.22e+00

Pos logit contributions
Maker+1.552
gently+1.542
breaks+1.479
 Decoder+1.428
yon+1.410
Neg logit contributions
unks-1.798
oppers-1.749
imir-1.702
arn-1.523
avis-1.485
Token ceremony
Token ID10542
Feature activation-2.52e+00

Pos logit contributions
unks+3.336
oppers+3.268
imir+3.180
arn+2.766
 merits+2.701
Neg logit contributions
gently-3.667
Maker-3.657
breaks-3.525
 Decoder-3.463
ahime-3.460
Token featured
Token ID8096
Feature activation+2.31e+00

Pos logit contributions
Maker+1.598
gently+1.578
breaks+1.534
 Decoder+1.464
yon+1.455
Neg logit contributions
unks-1.868
oppers-1.821
imir-1.748
arn-1.581
avis-1.539
Token a
Token ID257
Feature activation+8.93e-01

Pos logit contributions
unks+1.686
oppers+1.648
imir+1.576
arn+1.426
avis+1.375
Neg logit contributions
Maker-1.471
gently-1.460
breaks-1.415
yon-1.371
 Decoder-1.365
Token model
Token ID2746
Feature activation-7.24e-01

Pos logit contributions
unks+0.647
oppers+0.631
imir+0.608
arn+0.548
avis+0.530
Neg logit contributions
Maker-0.577
gently-0.574
breaks-0.557
 Decoder-0.533
yon-0.533
Token wearing
Token ID5762
Feature activation-9.81e-01

Pos logit contributions
Maker+0.467
gently+0.463
breaks+0.448
 Decoder+0.430
yon+0.429
Neg logit contributions
unks-0.527
oppers-0.514
imir-0.496
arn-0.446
avis-0.433
Token a
Token ID257
Feature activation+1.98e+00

Pos logit contributions
Maker+0.604
gently+0.600
breaks+0.579
 Decoder+0.555
yon+0.553
Neg logit contributions
unks-0.741
oppers-0.723
imir-0.699
arn-0.632
avis-0.614
Token dress
Token ID6576
Feature activation+4.84e+00

Pos logit contributions
unks+1.414
oppers+1.378
imir+1.328
arn+1.198
avis+1.157
Neg logit contributions
Maker-1.285
gently-1.277
breaks-1.248
yon-1.191
 Decoder-1.191
Token created
Token ID2727
Feature activation+1.18e+00

Pos logit contributions
unks+3.231
oppers+3.148
imir+3.081
arn+2.764
avis+2.649
Neg logit contributions
gently-3.254
Maker-3.224
breaks-3.173
 Decoder-3.069
yon-3.037
Token from
Token ID422
Feature activation+2.59e+00

Pos logit contributions
unks+0.835
oppers+0.814
imir+0.783
arn+0.704
avis+0.682
Neg logit contributions
Maker-0.785
gently-0.779
breaks-0.757
 Decoder-0.729
yon-0.724
Token self
Token ID2116
Feature activation-6.07e-01

Pos logit contributions
Maker+0.938
gently+0.935
breaks+0.904
yon+0.871
 Decoder+0.871
Neg logit contributions
unks-0.979
oppers-0.956
imir-0.919
arn-0.820
avis-0.796
Token-
Token ID12
Feature activation-8.46e-01

Pos logit contributions
Maker+0.391
gently+0.389
breaks+0.377
yon+0.361
 Decoder+0.361
Neg logit contributions
unks-0.441
oppers-0.431
imir-0.415
arn-0.373
avis-0.362
Tokenlo
Token ID5439
Feature activation+4.47e+00

Pos logit contributions
Maker+0.591
gently+0.589
breaks+0.572
yon+0.550
 Decoder+0.548
Neg logit contributions
unks-0.569
oppers-0.555
imir-0.533
arn-0.473
avis-0.458
Tokenathing
Token ID26927
Feature activation-2.92e+00

Pos logit contributions
unks+2.153
oppers+2.084
imir+1.960
arn+1.716
avis+1.660
Neg logit contributions
Maker-3.984
gently-3.925
breaks-3.801
 Decoder-3.778
ahime-3.709
Token,
Token ID11
Feature activation-5.18e+00

Pos logit contributions
Maker+1.722
gently+1.712
breaks+1.669
yon+1.580
 Decoder+1.572
Neg logit contributions
unks-2.260
oppers-2.237
imir-2.149
arn-1.931
=-=-=-=-=-=-=-=--1.893
Token fear
Token ID3252
Feature activation-2.40e+00

Pos logit contributions
gently+3.046
Maker+3.045
breaks+2.954
yon+2.833
 Decoder+2.794
Neg logit contributions
unks-3.953
oppers-3.914
imir-3.742
arn-3.341
avis-3.279
Token and
Token ID290
Feature activation-6.04e+00

Pos logit contributions
Maker+1.434
gently+1.419
breaks+1.386
yon+1.317
 Decoder+1.307
Neg logit contributions
unks-1.846
oppers-1.812
imir-1.746
arn-1.571
avis-1.535
Token despair
Token ID20234
Feature activation-7.61e+00

Pos logit contributions
Maker+3.802
gently+3.766
breaks+3.684
yon+3.585
 Decoder+3.509
Neg logit contributions
unks-4.348
oppers-4.304
imir-4.068
arn-3.580
avis-3.554
Token that
Token ID326
Feature activation-3.55e+00

Pos logit contributions
Maker+4.013
gently+3.979
breaks+3.952
yon+3.710
ona+3.705
Neg logit contributions
oppers-6.084
unks-6.074
imir-5.840
=-=-=-=-=-=-=-=--5.230
arn-5.178
Token ultimately
Token ID6165
Feature activation-2.92e+00

Pos logit contributions
Maker+2.215
gently+2.197
breaks+2.144
yon+2.044
 Decoder+2.043
Neg logit contributions
unks-2.618
oppers-2.574
imir-2.468
arn-2.207
avis-2.178
Token gets
Token ID3011
Feature activation-1.98e+00

Pos logit contributions
Maker+1.802
gently+1.794
breaks+1.770
yon+1.669
 Decoder+1.657
Neg logit contributions
unks-2.160
oppers-2.132
imir-2.048
arn-1.825
avis-1.799
Token or
Token ID393
Feature activation-4.15e+00

Pos logit contributions
Maker+0.438
gently+0.436
breaks+0.422
 Decoder+0.405
yon+0.402
Neg logit contributions
unks-0.527
oppers-0.514
imir-0.496
arn-0.447
avis-0.435
Token pushing
Token ID7796
Feature activation+2.63e-01

Pos logit contributions
gently+2.541
Maker+2.530
breaks+2.466
 Decoder+2.386
yon+2.324
Neg logit contributions
unks-3.120
oppers-3.013
imir-2.904
arn-2.600
avis-2.560
Token shadows
Token ID16187
Feature activation+1.49e+00

Pos logit contributions
unks+0.195
oppers+0.190
imir+0.184
arn+0.166
avis+0.161
Neg logit contributions
Maker-0.166
gently-0.165
breaks-0.160
 Decoder-0.153
yon-0.152
Token at
Token ID379
Feature activation+1.35e+00

Pos logit contributions
unks+1.055
oppers+1.030
imir+0.997
arn+0.897
avis+0.866
Neg logit contributions
Maker-0.985
gently-0.975
breaks-0.942
 Decoder-0.910
yon-0.908
Token higher
Token ID2440
Feature activation-6.36e+00

Pos logit contributions
unks+0.976
oppers+0.953
imir+0.919
arn+0.829
avis+0.802
Neg logit contributions
Maker-0.876
gently-0.870
breaks-0.840
yon-0.809
 Decoder-0.807
Token IS
Token ID3180
Feature activation-1.11e+01

Pos logit contributions
Maker+3.495
gently+3.482
breaks+3.435
 Decoder+3.277
yon+3.218
Neg logit contributions
unks-5.020
oppers-4.899
imir-4.779
arn-4.283
avis-4.278
TokenOs
Token ID16748
Feature activation-3.08e+00

Pos logit contributions
Maker+5.801
breaks+5.698
gently+5.474
FX+5.363
 Decoder+5.328
Neg logit contributions
unks-8.781
oppers-8.327
imir-8.019
arn-7.357
avis-7.352
Token.
Token ID13
Feature activation-1.14e+00

Pos logit contributions
Maker+1.816
gently+1.816
breaks+1.774
 Decoder+1.675
yon+1.672
Neg logit contributions
unks-2.379
oppers-2.319
imir-2.217
arn-2.009
avis-1.968
Token Color
Token ID5315
Feature activation-2.09e+00

Pos logit contributions
Maker+0.704
gently+0.692
breaks+0.672
 Decoder+0.651
yon+0.642
Neg logit contributions
unks-0.864
oppers-0.846
imir-0.814
arn-0.734
avis-0.715
Token is
Token ID318
Feature activation-2.85e-01

Pos logit contributions
Maker+1.313
gently+1.298
breaks+1.266
 Decoder+1.210
yon+1.200
Neg logit contributions
unks-1.558
oppers-1.523
imir-1.457
arn-1.317
avis-1.287
Token academic
Token ID8233
Feature activation-2.90e-01

Pos logit contributions
Maker+0.182
gently+0.181
breaks+0.175
 Decoder+0.169
yon+0.168
Neg logit contributions
unks-0.209
oppers-0.204
imir-0.197
arn-0.177
avis-0.172
Token contemporary
Token ID11811
Feature activation-1.33e+00

Pos logit contributions
Maker+1.132
gently+1.132
breaks+1.092
 Decoder+1.048
yon+1.048
Neg logit contributions
unks-1.414
oppers-1.382
imir-1.334
arn-1.201
avis-1.168
Token society
Token ID3592
Feature activation-6.37e+00

Pos logit contributions
Maker+0.815
gently+0.812
breaks+0.786
yon+0.754
 Decoder+0.750
Neg logit contributions
unks-1.000
oppers-0.979
imir-0.944
arn-0.850
avis-0.828
Token,
Token ID11
Feature activation-1.44e+00

Pos logit contributions
Maker+3.482
gently+3.471
breaks+3.402
 Decoder+3.222
ona+3.211
Neg logit contributions
unks-5.043
oppers-4.950
imir-4.800
arn-4.301
avis-4.180
Token are
Token ID389
Feature activation-1.31e+00

Pos logit contributions
Maker+0.891
gently+0.890
breaks+0.859
yon+0.826
 Decoder+0.825
Neg logit contributions
unks-1.077
oppers-1.054
imir-1.010
arn-0.910
avis-0.885
Token not
Token ID407
Feature activation-2.88e+00

Pos logit contributions
Maker+0.857
gently+0.851
breaks+0.823
 Decoder+0.791
yon+0.790
Neg logit contributions
unks-0.937
oppers-0.916
imir-0.876
arn-0.786
avis-0.760
Token uncont
Token ID28605
Feature activation-1.62e+01

Pos logit contributions
Maker+1.834
gently+1.831
breaks+1.772
 Decoder+1.707
yon+1.704
Neg logit contributions
unks-2.111
oppers-2.065
imir-1.960
arn-1.744
avis-1.709
Tokenroversial
Token ID46927
Feature activation-2.69e+00

Pos logit contributions
gently+6.613
breaks+6.305
ona+5.932
roller+5.833
yon+5.725
Neg logit contributions
unks-13.795
oppers-13.453
imir-13.009
OME-12.360
 disg-12.357
Token or
Token ID393
Feature activation-5.16e+00

Pos logit contributions
Maker+1.603
gently+1.603
breaks+1.555
yon+1.484
 Decoder+1.471
Neg logit contributions
unks-2.063
oppers-2.045
imir-1.959
arn-1.737
avis-1.704
Token immune
Token ID10900
Feature activation-2.86e+00

Pos logit contributions
gently+3.167
Maker+3.162
breaks+3.062
yon+2.953
 Decoder+2.947
Neg logit contributions
unks-3.848
oppers-3.760
imir-3.567
arn-3.148
avis-3.111
Token from
Token ID422
Feature activation-4.09e+00

Pos logit contributions
gently+1.754
Maker+1.744
breaks+1.678
yon+1.622
ona+1.598
Neg logit contributions
unks-2.157
oppers-2.103
imir-2.040
arn-1.822
=-=-=-=-=-=-=-=--1.777
Token any
Token ID597
Feature activation-2.05e-01

Pos logit contributions
gently+2.479
Maker+2.462
breaks+2.423
yon+2.324
 Decoder+2.319
Neg logit contributions
unks-3.086
oppers-3.020
imir-2.888
arn-2.575
avis-2.508
Token6
Token ID21
Feature activation-7.48e+00

Pos logit contributions
Maker+1.926
gently+1.923
breaks+1.852
 Decoder+1.790
yon+1.785
Neg logit contributions
unks-2.384
oppers-2.332
imir-2.248
arn-2.020
=-=-=-=-=-=-=-=--1.978
Token {
Token ID1391
Feature activation+3.11e+00

Pos logit contributions
Maker+3.502
gently+3.410
breaks+3.349
 Decoder+3.122
yon+3.078
Neg logit contributions
unks-6.528
oppers-6.394
imir-6.226
=-=-=-=-=-=-=-=--5.666
arn-5.631
Token(
Token ID7
Feature activation-5.71e+00

Pos logit contributions
unks+1.799
oppers+1.714
imir+1.660
arn+1.482
avis+1.395
Neg logit contributions
gently-2.427
Maker-2.395
breaks-2.346
ahime-2.306
 Decoder-2.300
Token105
Token ID13348
Feature activation-6.76e+00

Pos logit contributions
Maker+3.566
gently+3.520
breaks+3.449
yon+3.285
 Decoder+3.280
Neg logit contributions
unks-4.154
oppers-4.060
imir-3.902
arn-3.464
avis-3.413
Tokens
Token ID82
Feature activation-5.53e+00

Pos logit contributions
Maker+3.027
breaks+2.959
gently+2.922
yon+2.676
ona+2.599
Neg logit contributions
oppers-6.029
unks-5.999
imir-5.827
=-=-=-=-=-=-=-=--5.253
arn-5.245
Token)}
Token ID38165
Feature activation-1.07e+01

Pos logit contributions
Maker+3.144
gently+3.118
breaks+2.995
yon+2.837
ona+2.830
Neg logit contributions
unks-4.353
oppers-4.269
imir-4.148
arn-3.782
avis-3.667
Token Ne
Token ID3169
Feature activation-2.71e+00

Pos logit contributions
breaks+5.519
Maker+5.439
gently+5.256
 Decoder+5.218
 Wim+5.044
Neg logit contributions
unks-8.295
oppers-8.242
imir-7.979
avis-7.172
=-=-=-=-=-=-=-=--7.116
Token7
Token ID22
Feature activation-3.36e+00

Pos logit contributions
Maker+1.645
gently+1.634
breaks+1.585
yon+1.520
ona+1.515
Neg logit contributions
unks-2.032
oppers-2.003
imir-1.932
arn-1.736
=-=-=-=-=-=-=-=--1.700
Token {
Token ID1391
Feature activation+4.22e+00

Pos logit contributions
Maker+1.792
gently+1.729
breaks+1.689
 Decoder+1.619
yon+1.595
Neg logit contributions
unks-2.834
oppers-2.783
imir-2.681
arn-2.426
avis-2.378
Token(
Token ID7
Feature activation-4.04e+00

Pos logit contributions
unks+2.351
oppers+2.225
imir+2.156
arn+1.936
=-=-=-=-=-=-=-=-+1.804
Neg logit contributions
gently-3.359
Maker-3.285
breaks-3.241
ahime-3.222
 Decoder-3.190
Token69
Token ID3388
Feature activation-7.75e+00

Pos logit contributions
Maker+2.599
gently+2.567
breaks+2.509
 Decoder+2.404
yon+2.396
Neg logit contributions
unks-2.907
oppers-2.834
imir-2.730
arn-2.429
avis-2.378
Token in
Token ID287
Feature activation+3.46e+00

Pos logit contributions
unks+0.806
oppers+0.786
imir+0.757
arn+0.685
avis+0.663
Neg logit contributions
Maker-0.739
gently-0.736
breaks-0.707
 Decoder-0.681
yon-0.679
Token public
Token ID1171
Feature activation+8.93e-01

Pos logit contributions
unks+2.502
oppers+2.438
imir+2.343
arn+2.133
avis+2.046
Neg logit contributions
Maker-2.235
gently-2.216
breaks-2.123
 Decoder-2.043
ahime-2.032
Token '
Token ID705
Feature activation+6.09e-01

Pos logit contributions
unks+0.661
oppers+0.641
imir+0.623
arn+0.563
avis+0.545
Neg logit contributions
Maker-0.565
gently-0.561
breaks-0.538
 Decoder-0.520
yon-0.517
Tokenfor
Token ID1640
Feature activation+4.05e+00

Pos logit contributions
unks+0.429
oppers+0.417
imir+0.403
arn+0.363
avis+0.350
Neg logit contributions
Maker-0.405
gently-0.402
breaks-0.389
 Decoder-0.377
yon-0.373
Token viewing
Token ID11681
Feature activation+1.21e+00

Pos logit contributions
unks+2.829
oppers+2.778
imir+2.683
arn+2.426
avis+2.334
Neg logit contributions
Maker-2.685
gently-2.631
breaks-2.547
 Decoder-2.459
ahime-2.458
Token South
Token ID2520
Feature activation-9.96e+00

Pos logit contributions
unks+0.893
oppers+0.872
imir+0.843
arn+0.763
avis+0.734
Neg logit contributions
Maker-0.764
gently-0.760
breaks-0.732
 Decoder-0.702
yon-0.701
Token Korean
Token ID6983
Feature activation-4.83e+00

Pos logit contributions
Maker+4.472
gently+4.407
breaks+4.335
yon+4.243
 Decoder+4.134
Neg logit contributions
unks-8.422
oppers-8.311
imir-8.089
=-=-=-=-=-=-=-=--7.363
avis-7.340
Token movies
Token ID6918
Feature activation-2.22e+00

Pos logit contributions
Maker+2.806
gently+2.787
breaks+2.709
 Decoder+2.621
yon+2.587
Neg logit contributions
unks-3.735
oppers-3.636
imir-3.525
avis-3.149
arn-3.144
Token and
Token ID290
Feature activation-2.70e+00

Pos logit contributions
Maker+1.309
gently+1.294
breaks+1.257
yon+1.206
 Decoder+1.199
Neg logit contributions
unks-1.728
oppers-1.686
imir-1.626
arn-1.468
avis-1.444
Token owning
Token ID23107
Feature activation-4.87e-01

Pos logit contributions
Maker+1.621
gently+1.603
breaks+1.556
 Decoder+1.492
yon+1.488
Neg logit contributions
unks-2.066
oppers-2.009
imir-1.960
arn-1.753
avis-1.738
Token B
Token ID347
Feature activation-5.48e+00

Pos logit contributions
Maker+0.303
gently+0.301
breaks+0.291
 Decoder+0.279
yon+0.278
Neg logit contributions
unks-0.365
oppers-0.356
imir-0.344
arn-0.311
avis-0.302
Tokent
Token ID83
Feature activation-4.69e+00

Pos logit contributions
Maker+0.122
gently+0.122
breaks+0.119
 Decoder+0.115
yon+0.115
Neg logit contributions
unks-0.075
oppers-0.072
imir-0.069
arn-0.059
avis-0.056
Token improved
Token ID6596
Feature activation-1.69e+00

Pos logit contributions
Maker+2.990
gently+2.988
breaks+2.908
yon+2.806
 Decoder+2.773
Neg logit contributions
unks-3.364
oppers-3.304
imir-3.127
avis-2.758
=-=-=-=-=-=-=-=--2.755
Token seemingly
Token ID9775
Feature activation-5.15e+00

Pos logit contributions
Maker+1.065
gently+1.055
breaks+1.025
 Decoder+0.984
yon+0.974
Neg logit contributions
unks-1.258
oppers-1.228
imir-1.178
arn-1.059
avis-1.031
Token at
Token ID379
Feature activation-2.33e+00

Pos logit contributions
Maker+3.103
gently+3.066
breaks+3.028
 Decoder+2.837
ona+2.836
Neg logit contributions
unks-3.925
oppers-3.829
imir-3.620
arn-3.218
=-=-=-=-=-=-=-=--3.195
Token all
Token ID477
Feature activation-4.42e+00

Pos logit contributions
Maker+1.427
gently+1.409
breaks+1.378
 Decoder+1.310
yon+1.309
Neg logit contributions
unks-1.760
oppers-1.733
imir-1.656
arn-1.489
avis-1.466
Token since
Token ID1201
Feature activation-9.96e+00

Pos logit contributions
Maker+2.656
gently+2.606
breaks+2.560
 Decoder+2.433
ona+2.419
Neg logit contributions
unks-3.373
oppers-3.325
imir-3.158
arn-2.804
avis-2.777
Token those
Token ID883
Feature activation-4.14e+00

Pos logit contributions
Maker+5.247
breaks+5.114
gently+5.092
ona+4.993
 Wim+4.968
Neg logit contributions
oppers-7.752
unks-7.600
imir-7.251
avis-6.747
arn-6.556
Token days
Token ID1528
Feature activation-3.94e+00

Pos logit contributions
gently+2.439
Maker+2.424
breaks+2.409
ona+2.282
yon+2.274
Neg logit contributions
oppers-3.124
unks-3.123
imir-2.991
arn-2.678
avis-2.677
Token.
Token ID13
Feature activation-6.07e+00

Pos logit contributions
Maker+2.259
gently+2.231
breaks+2.193
ona+2.083
 Decoder+2.077
Neg logit contributions
unks-3.077
oppers-3.021
imir-2.916
arn-2.619
avis-2.575
Token They
Token ID1119
Feature activation-1.08e+00

Pos logit contributions
Maker+3.514
gently+3.341
breaks+3.312
 Decoder+3.252
ona+3.169
Neg logit contributions
unks-4.724
oppers-4.697
imir-4.390
arn-4.035
avis-3.925
Token have
Token ID423
Feature activation-4.60e+00

Pos logit contributions
Maker+0.693
gently+0.690
breaks+0.668
 Decoder+0.641
yon+0.640
Neg logit contributions
unks-0.780
oppers-0.761
imir-0.732
arn-0.659
avis-0.639