TOP ACTIVATIONS
MAX = 1.247

M FL Y NN JR
( 0 , 0 , ($ SR . Width ), ($
, r ' \ [ +( \ S + )\ s
angle F ( 0 , ($ s z 1 . Height
angle F ( 0 , ($ i * $ FontSize *
a molecule expressed as exp (- t / u ) where
for a diplomatic resolution . Rather , he is signaling that
perception of potential conflicts of interest , and we have outlined
($ SR . Width ), ($ SR . Height ))) )
up the technical capability a hobby ist can buy for under
with them . Using they rather than their , as Thompson
are not exempt from mer gers and partnership deals . Bib
cold on the Bruins . Instead , Allow ay committed ,
algae within a body of water , to spread and intensify
And , second , rather than pushing a program he
indicated . All salary - cap figures courtesy of Over The
Mac ri a beneficiary , rather than a victim . Again
? OK , fair enough , if de vious )
day , so that the impact point would have the optimum
, try incorporating the following gimmick ( St olen from the

MIN ACTIVATIONS
MIN = -2.292

2012 United States 5 667 Posts # 12 I 'm glad
pur ak ushi Profile Joined August 2012 United States 3
2010 United States 10 53 Posts # 15 honestly if we
2012 United States 3 240 Posts # 10 Happy it worked
2002 United States 3 472 Posts # 11 On September 20
heart l xp Profile Joined September 2010 United States 10
June 2012 Canada 16 10 Posts # 14 On September 20
2011 United States 6 12 Posts # 17 Good to hear
2010 United States 21 50 Posts # 16 Glad to hear
2011 United States 228 65 Posts # 13 On September 20
re i Profile Blog Joined October 2002 United States 3
Dark horse Profile Blog Joined December 2011 United States 228
2010 United States 6 83 Posts Last Edited : 2013 -
was about homosexuality Reviewer : k imp unk rock
J aaaa asper Profile Blog Joined April 2012 United States 5
June 2009 Germany 247 34 Posts # 20 Nice , happy
, I think Reviewer : B iz r Pix
K ier an Profile Blog Joined August 2011 United States 6
2010 boys beware Reviewer : j wt 0001 -
TL AD T 235 21 Posts # 18 Good stuff !

INTERVAL 0.363 - 1.247
CONTAINS 4.822%

iler up as the presumptive franchise savior a title he
her support for off - shore drilling and fracking . It
all three trends . Cur iously , ne ol iber als
, and you return the backer survey , I will place
search Space ][ [ i ]] ], Keys [ search Space

INTERVAL -0.522 - 0.363
CONTAINS 93.891%

I was a pre teen . With all the
House of Representatives ( the lower house ) has 150 members
kilometers . And Hel heim is just one of many Greenland
like , Oh my god , he thinks I
about the series ' future . COM IC V

INTERVAL -1.407 - -0.522
CONTAINS 1.278%

readers don t love Arduino . As a long
he d been sent by central casting to play
or with your friends or roomm ates , for example .
Both of their lives have undergone a massive upheaval , and
. In some cases Hydro has difficulty retaining workers

INTERVAL -2.292 - -1.407
CONTAINS 0.008%

United States 6 12 Posts # 17 Good to hear .
2012 Canada 16 10 Posts # 14 On September 20 2013
2002 United States 3 472 Posts # 11 On September 20
United States 10 53 Posts # 15 honestly if we just
2010 United States 10 53 Posts # 15 honestly if we
Token
Token ID116
Feature activation-2.28e-02

Pos logit contributions
 etiquette+1.145
atar+1.124
 aquatic+1.123
 atmosp+1.087
edia+1.083
Neg logit contributions
���-0.589
龍�-0.547
 fixme-0.472
 Written-0.464
-0.388
TokenM
Token ID44
Feature activation+1.33e-01

Pos logit contributions
���+0.066
龍�+0.063
 fixme+0.058
 Written+0.057
+0.052
Neg logit contributions
 etiquette-0.058
 aquatic-0.057
atar-0.057
 atmosp-0.054
edia-0.054
TokenFL
Token ID3697
Feature activation+3.90e-01

Pos logit contributions
 etiquette+0.327
 aquatic+0.318
atar+0.316
 atmosp+0.303
edia+0.298
Neg logit contributions
���-0.400
龍�-0.384
 fixme-0.352
 Written-0.349
-0.317
TokenY
Token ID56
Feature activation+2.42e-01

Pos logit contributions
 etiquette+1.017
atar+0.990
 aquatic+0.990
 atmosp+0.943
edia+0.938
Neg logit contributions
���-1.120
龍�-1.064
 fixme-0.973
 Written-0.961
-0.867
TokenNN
Token ID6144
Feature activation+5.47e-01

Pos logit contributions
 etiquette+0.627
 aquatic+0.611
atar+0.609
 atmosp+0.581
edia+0.578
Neg logit contributions
���-0.697
龍�-0.662
 fixme-0.605
 Written-0.598
-0.540
TokenJR
Token ID44817
Feature activation+1.25e+00

Pos logit contributions
 etiquette+1.416
 aquatic+1.379
atar+1.378
 atmosp+1.312
edia+1.307
Neg logit contributions
���-1.579
龍�-1.499
 fixme-1.372
 Written-1.355
-1.224
Token
Token ID8582
Feature activation+1.59e-01

Pos logit contributions
 etiquette+3.208
atar+3.124
 aquatic+3.114
edia+2.978
 atmosp+2.939
Neg logit contributions
���-3.605
龍�-3.409
 fixme-3.117
 Written-3.052
-2.783
Token
Token ID229
Feature activation+4.98e-01

Pos logit contributions
 etiquette+0.387
atar+0.376
 aquatic+0.376
 atmosp+0.357
edia+0.355
Neg logit contributions
���-0.484
龍�-0.463
 fixme-0.425
 Written-0.421
-0.383
Token
Token ID118
Feature activation+2.99e-01

Pos logit contributions
 etiquette+1.770
atar+1.741
 aquatic+1.737
edia+1.681
 atmosp+1.679
Neg logit contributions
���-0.932
龍�-0.878
 fixme-0.762
 Written-0.748
-0.634
Token
Token ID8582
Feature activation+2.64e-02

Pos logit contributions
 etiquette+0.673
 aquatic+0.652
atar+0.652
 atmosp+0.615
edia+0.611
Neg logit contributions
���-0.964
龍�-0.919
 fixme-0.851
 Written-0.841
-0.769
Token
Token ID229
Feature activation+4.82e-01

Pos logit contributions
 etiquette+0.065
 aquatic+0.063
atar+0.063
 atmosp+0.060
edia+0.059
Neg logit contributions
���-0.080
龍�-0.076
 fixme-0.070
 Written-0.069
-0.063
Token(
Token ID7
Feature activation+3.04e-01

Pos logit contributions
���+1.787
龍�+1.698
 fixme+1.570
 Written+1.558
+1.403
Neg logit contributions
 etiquette-1.517
 aquatic-1.472
atar-1.465
 atmosp-1.405
edia-1.387
Token0
Token ID15
Feature activation-3.15e-01

Pos logit contributions
 etiquette+0.730
atar+0.713
 aquatic+0.712
 atmosp+0.674
edia+0.672
Neg logit contributions
���-0.931
龍�-0.888
 fixme-0.817
 Written-0.806
-0.736
Token,
Token ID11
Feature activation-2.41e-01

Pos logit contributions
���+1.007
龍�+0.966
 fixme+0.900
 Written+0.891
+0.812
Neg logit contributions
 etiquette-0.709
atar-0.686
 aquatic-0.686
 atmosp-0.647
edia-0.641
Token0
Token ID15
Feature activation+4.78e-02

Pos logit contributions
���+0.632
龍�+0.600
 fixme+0.544
 Written+0.538
+0.479
Neg logit contributions
 etiquette-0.685
 aquatic-0.668
atar-0.668
 atmosp-0.641
edia-0.634
Token,
Token ID11
Feature activation-2.42e-01

Pos logit contributions
 etiquette+0.107
atar+0.103
 aquatic+0.103
 atmosp+0.098
edia+0.097
Neg logit contributions
���-0.155
龍�-0.148
 fixme-0.137
 Written-0.136
-0.124
Token($
Token ID16763
Feature activation+1.25e+00

Pos logit contributions
���+0.625
龍�+0.592
 fixme+0.535
 Written+0.529
+0.470
Neg logit contributions
 etiquette-0.700
 aquatic-0.683
atar-0.682
 atmosp-0.655
edia-0.649
TokenSR
Token ID12562
Feature activation+2.47e-01

Pos logit contributions
atar+3.285
 etiquette+3.272
 aquatic+3.201
edia+3.146
 atmosp+3.038
Neg logit contributions
���-3.505
龍�-3.223
 fixme-2.967
 Written-2.953
-2.609
Token.
Token ID13
Feature activation-8.88e-02

Pos logit contributions
 etiquette+0.509
atar+0.493
 aquatic+0.493
 atmosp+0.464
edia+0.459
Neg logit contributions
���-0.843
龍�-0.808
 fixme-0.750
 Written-0.744
-0.684
TokenWidth
Token ID30916
Feature activation-4.96e-01

Pos logit contributions
���+0.214
龍�+0.202
 fixme+0.181
 Written+0.179
+0.158
Neg logit contributions
 etiquette-0.271
 aquatic-0.266
atar-0.265
 atmosp-0.255
edia-0.253
Token),
Token ID828
Feature activation+4.38e-02

Pos logit contributions
���+1.562
龍�+1.502
 fixme+1.403
 Written+1.393
+1.261
Neg logit contributions
 etiquette-1.125
atar-1.089
 aquatic-1.087
 atmosp-1.031
edia-1.021
Token($
Token ID16763
Feature activation+1.13e+00

Pos logit contributions
 etiquette+0.115
atar+0.112
 aquatic+0.112
 atmosp+0.107
edia+0.107
Neg logit contributions
���-0.124
龍�-0.118
 fixme-0.108
 Written-0.106
-0.096
Token,
Token ID837
Feature activation-6.81e-03

Pos logit contributions
 etiquette+0.191
 aquatic+0.185
atar+0.185
 atmosp+0.176
edia+0.174
Neg logit contributions
���-0.268
龍�-0.256
 fixme-0.237
 Written-0.235
-0.214
Token r
Token ID374
Feature activation+2.96e-01

Pos logit contributions
���+0.021
龍�+0.020
 fixme+0.018
 Written+0.018
+0.016
Neg logit contributions
 etiquette-0.016
atar-0.016
 aquatic-0.016
 atmosp-0.015
edia-0.015
Token '
Token ID705
Feature activation+6.38e-01

Pos logit contributions
 etiquette+0.772
 aquatic+0.752
atar+0.751
 atmosp+0.717
edia+0.711
Neg logit contributions
���-0.848
龍�-0.807
 fixme-0.738
 Written-0.730
-0.658
Token\
Token ID59
Feature activation+9.82e-02

Pos logit contributions
 etiquette+1.605
atar+1.567
 aquatic+1.565
 atmosp+1.486
edia+1.477
Neg logit contributions
���-1.889
龍�-1.790
 fixme-1.639
 Written-1.614
-1.462
Token[
Token ID58
Feature activation+7.64e-01

Pos logit contributions
 etiquette+0.261
 aquatic+0.254
atar+0.253
 atmosp+0.243
edia+0.240
Neg logit contributions
���-0.276
龍�-0.263
 fixme-0.240
 Written-0.238
-0.214
Token+(
Token ID33747
Feature activation+1.24e+00

Pos logit contributions
 etiquette+1.865
atar+1.830
 aquatic+1.823
 atmosp+1.730
edia+1.722
Neg logit contributions
���-2.307
龍�-2.169
 fixme-2.002
 Written-1.981
-1.790
Token\
Token ID59
Feature activation+1.84e-01

Pos logit contributions
 etiquette+2.960
atar+2.914
 aquatic+2.901
 atmosp+2.753
edia+2.735
Neg logit contributions
���-3.777
龍�-3.537
 fixme-3.255
 Written-3.226
EStream-2.910
TokenS
Token ID50
Feature activation+4.60e-01

Pos logit contributions
 etiquette+0.453
atar+0.441
 aquatic+0.440
 atmosp+0.419
edia+0.417
Neg logit contributions
���-0.556
龍�-0.529
 fixme-0.486
 Written-0.481
-0.437
Token+
Token ID10
Feature activation+3.78e-01

Pos logit contributions
 etiquette+1.000
 aquatic+0.969
atar+0.968
 atmosp+0.915
edia+0.906
Neg logit contributions
���-1.514
龍�-1.450
 fixme-1.343
 Written-1.331
-1.219
Token)\
Token ID19415
Feature activation+4.45e-01

Pos logit contributions
 etiquette+0.816
atar+0.793
 aquatic+0.791
 atmosp+0.747
edia+0.741
Neg logit contributions
���-1.254
龍�-1.199
 fixme-1.110
 Written-1.100
-1.008
Tokens
Token ID82
Feature activation+2.88e-01

Pos logit contributions
 etiquette+0.983
atar+0.957
 aquatic+0.953
 atmosp+0.902
edia+0.897
Neg logit contributions
���-1.448
龍�-1.383
 fixme-1.278
 Written-1.266
-1.160
Tokenangle
Token ID9248
Feature activation-5.86e-01

Pos logit contributions
���+1.655
龍�+1.602
 fixme+1.516
 Written+1.512
+1.418
Neg logit contributions
 etiquette-0.367
 aquatic-0.342
atar-0.334
 atmosp-0.301
edia-0.286
TokenF
Token ID37
Feature activation-4.83e-01

Pos logit contributions
���+1.905
龍�+1.817
 fixme+1.681
 Written+1.678
+1.520
Neg logit contributions
 etiquette-1.302
 aquatic-1.262
atar-1.256
 atmosp-1.197
edia-1.166
Token (
Token ID357
Feature activation-7.59e-02

Pos logit contributions
���+1.485
龍�+1.417
 fixme+1.305
 Written+1.295
+1.175
Neg logit contributions
 etiquette-1.153
 aquatic-1.120
atar-1.116
 atmosp-1.066
edia-1.051
Token0
Token ID15
Feature activation-3.15e-01

Pos logit contributions
���+0.262
龍�+0.252
 fixme+0.234
 Written+0.232
+0.214
Neg logit contributions
 etiquette-0.153
 aquatic-0.148
atar-0.148
 atmosp-0.139
edia-0.137
Token,
Token ID11
Feature activation-2.78e-01

Pos logit contributions
���+1.028
龍�+0.984
 fixme+0.912
 Written+0.904
+0.826
Neg logit contributions
 etiquette-0.693
 aquatic-0.672
atar-0.672
 atmosp-0.634
edia-0.628
Token($
Token ID16763
Feature activation+1.20e+00

Pos logit contributions
���+0.718
龍�+0.680
 fixme+0.615
 Written+0.608
+0.540
Neg logit contributions
 etiquette-0.803
 aquatic-0.784
atar-0.783
 atmosp-0.752
edia-0.744
Tokens
Token ID82
Feature activation+2.13e-01

Pos logit contributions
 etiquette+3.266
atar+3.259
 aquatic+3.170
edia+3.106
 atmosp+3.040
Neg logit contributions
���-3.274
龍�-3.018
 fixme-2.750
 Written-2.725
-2.408
Tokenz
Token ID89
Feature activation-7.13e-02

Pos logit contributions
 etiquette+0.364
 aquatic+0.349
atar+0.342
 atmosp+0.323
edia+0.315
Neg logit contributions
���-0.791
龍�-0.773
 fixme-0.724
 Written-0.720
-0.665
Token1
Token ID16
Feature activation-4.60e-02

Pos logit contributions
���+0.181
龍�+0.170
 fixme+0.154
 Written+0.152
+0.135
Neg logit contributions
 etiquette-0.210
atar-0.205
 aquatic-0.205
 atmosp-0.196
edia-0.195
Token.
Token ID13
Feature activation-3.33e-01

Pos logit contributions
���+0.149
龍�+0.143
 fixme+0.133
 Written+0.132
+0.120
Neg logit contributions
 etiquette-0.102
 aquatic-0.098
atar-0.098
 atmosp-0.093
edia-0.092
TokenHeight
Token ID23106
Feature activation-5.62e-01

Pos logit contributions
���+0.849
龍�+0.813
 fixme+0.739
 Written+0.732
+0.649
Neg logit contributions
 etiquette-0.965
 aquatic-0.943
atar-0.935
 atmosp-0.903
edia-0.891
Tokenangle
Token ID9248
Feature activation-5.46e-01

Pos logit contributions
���+1.301
龍�+1.291
 fixme+1.232
 Written+1.225
+1.146
Neg logit contributions
 etiquette-0.300
 aquatic-0.280
atar-0.256
 therap-0.253
 atmosp-0.245
TokenF
Token ID37
Feature activation-4.84e-01

Pos logit contributions
���+1.684
龍�+1.610
 fixme+1.479
 Written+1.474
+1.333
Neg logit contributions
 etiquette-1.302
 aquatic-1.266
atar-1.260
 atmosp-1.200
edia-1.185
Token (
Token ID357
Feature activation-7.81e-02

Pos logit contributions
龍�+1.409
���+1.384
 fixme+1.330
 Written+1.324
+1.167
Neg logit contributions
 etiquette-1.167
 aquatic-1.127
 therap-1.116
atar-1.106
 atmosp-1.065
Token0
Token ID15
Feature activation-3.29e-01

Pos logit contributions
���+0.196
龍�+0.185
 fixme+0.166
 Written+0.164
+0.146
Neg logit contributions
 etiquette-0.232
atar-0.227
 aquatic-0.226
 atmosp-0.217
edia-0.216
Token,
Token ID11
Feature activation-2.86e-01

Pos logit contributions
龍�+0.972
���+0.953
 fixme+0.912
 Written+0.898
+0.809
Neg logit contributions
 etiquette-0.784
 aquatic-0.758
 therap-0.746
atar-0.746
 atmosp-0.713
Token($
Token ID16763
Feature activation+1.16e+00

Pos logit contributions
���+0.975
龍�+0.936
 fixme+0.868
 Written+0.861
+0.790
Neg logit contributions
 etiquette-0.589
 aquatic-0.571
atar-0.570
 atmosp-0.537
edia-0.529
Tokeni
Token ID72
Feature activation+2.81e-02

Pos logit contributions
 etiquette+2.436
atar+2.432
 aquatic+2.355
edia+2.286
 atmosp+2.224
Neg logit contributions
���-3.905
龍�-3.684
 fixme-3.399
 Written-3.390
-3.091
Token*
Token ID9
Feature activation+5.16e-01

Pos logit contributions
 etiquette+0.066
atar+0.064
 aquatic+0.064
 atmosp+0.061
edia+0.061
Neg logit contributions
���-0.087
龍�-0.083
 fixme-0.077
 Written-0.076
-0.069
Token$
Token ID3
Feature activation+1.84e-02

Pos logit contributions
 etiquette+1.402
atar+1.381
 aquatic+1.366
 atmosp+1.308
edia+1.307
Neg logit contributions
���-1.419
龍�-1.333
 fixme-1.217
 Written-1.199
-1.073
TokenFontSize
Token ID38160
Feature activation-4.02e-01

Pos logit contributions
 etiquette+0.055
atar+0.054
 aquatic+0.054
 atmosp+0.052
edia+0.051
Neg logit contributions
���-0.046
龍�-0.043
 fixme-0.039
 Written-0.038
-0.034
Token*
Token ID9
Feature activation+9.71e-02

Pos logit contributions
���+1.275
龍�+1.235
 fixme+1.143
 Written+1.142
+1.036
Neg logit contributions
 etiquette-0.905
 aquatic-0.877
atar-0.874
 atmosp-0.832
edia-0.822
Token a
Token ID257
Feature activation-1.98e-01

Pos logit contributions
���+0.765
龍�+0.731
 fixme+0.681
 Written+0.680
+0.609
Neg logit contributions
 etiquette-0.567
atar-0.554
 aquatic-0.532
edia-0.525
 atmosp-0.508
Token molecule
Token ID27756
Feature activation-6.85e-01

Pos logit contributions
���+0.621
龍�+0.597
 fixme+0.553
 Written+0.552
+0.498
Neg logit contributions
 etiquette-0.451
atar-0.437
 aquatic-0.425
edia-0.412
 atmosp-0.405
Token expressed
Token ID6241
Feature activation+3.29e-01

Pos logit contributions
���+2.039
龍�+1.962
 Written+1.828
 fixme+1.821
+1.629
Neg logit contributions
 etiquette-1.634
atar-1.585
 aquatic-1.547
edia-1.508
 atmosp-1.477
Token as
Token ID355
Feature activation+1.50e-02

Pos logit contributions
 etiquette+0.793
 aquatic+0.771
atar+0.771
 atmosp+0.733
edia+0.726
Neg logit contributions
���-1.008
龍�-0.962
 fixme-0.884
 Written-0.876
-0.796
Token exp
Token ID1033
Feature activation-1.25e-01

Pos logit contributions
 etiquette+0.035
atar+0.034
 aquatic+0.034
edia+0.032
 atmosp+0.032
Neg logit contributions
���-0.046
龍�-0.045
 fixme-0.041
 Written-0.041
-0.037
Token(-
Token ID32590
Feature activation+1.15e+00

Pos logit contributions
���+0.422
龍�+0.405
 fixme+0.378
 Written+0.375
+0.344
Neg logit contributions
 etiquette-0.257
 aquatic-0.248
atar-0.247
 atmosp-0.233
edia-0.229
Tokent
Token ID83
Feature activation+8.06e-02

Pos logit contributions
 etiquette+3.055
atar+3.013
 aquatic+3.001
 atmosp+2.871
edia+2.851
Neg logit contributions
���-3.250
龍�-3.048
 fixme-2.788
 Written-2.751
-2.465
Token/
Token ID14
Feature activation+2.75e-01

Pos logit contributions
 etiquette+0.191
 aquatic+0.184
atar+0.184
 atmosp+0.175
edia+0.174
Neg logit contributions
���-0.250
龍�-0.239
 fixme-0.220
 Written-0.219
-0.198
Tokenu
Token ID84
Feature activation+4.21e-01

Pos logit contributions
 etiquette+0.726
 aquatic+0.702
atar+0.702
 atmosp+0.669
edia+0.665
Neg logit contributions
���-0.778
龍�-0.746
 fixme-0.681
 Written-0.673
-0.608
Token)
Token ID8
Feature activation+1.18e-01

Pos logit contributions
 etiquette+0.998
 aquatic+0.971
atar+0.970
 atmosp+0.922
edia+0.912
Neg logit contributions
���-1.306
龍�-1.246
 fixme-1.148
 Written-1.136
-1.035
Token where
Token ID810
Feature activation-1.57e-01

Pos logit contributions
 etiquette+0.283
atar+0.275
 aquatic+0.275
 atmosp+0.261
edia+0.259
Neg logit contributions
���-0.364
龍�-0.348
 fixme-0.321
 Written-0.318
-0.288
Token for
Token ID329
Feature activation+3.38e-02

Pos logit contributions
 etiquette+0.097
atar+0.094
 aquatic+0.094
 atmosp+0.089
edia+0.089
Neg logit contributions
���-0.128
龍�-0.122
 fixme-0.113
 Written-0.112
-0.102
Token a
Token ID257
Feature activation-4.98e-02

Pos logit contributions
 etiquette+0.074
atar+0.073
 aquatic+0.073
edia+0.068
 atmosp+0.068
Neg logit contributions
���-0.108
龍�-0.104
 fixme-0.097
 Written-0.096
-0.087
Token diplomatic
Token ID13093
Feature activation-3.44e-01

Pos logit contributions
���+0.185
龍�+0.178
 Written+0.168
 fixme+0.168
+0.153
Neg logit contributions
 etiquette-0.085
atar-0.082
 aquatic-0.082
edia-0.076
 atmosp-0.075
Token resolution
Token ID6323
Feature activation+3.13e-01

Pos logit contributions
���+1.059
龍�+1.021
 fixme+0.957
 Written+0.949
+0.849
Neg logit contributions
 etiquette-0.780
atar-0.769
 aquatic-0.761
edia-0.730
 atmosp-0.719
Token.
Token ID13
Feature activation+2.03e-01

Pos logit contributions
 etiquette+0.700
 aquatic+0.679
atar+0.678
 atmosp+0.641
edia+0.640
Neg logit contributions
���-1.003
龍�-0.963
 fixme-0.894
 Written-0.890
-0.811
Token Rather
Token ID11317
Feature activation+1.15e+00

Pos logit contributions
 etiquette+0.512
 aquatic+0.499
atar+0.496
 atmosp+0.474
edia+0.470
Neg logit contributions
���-0.596
龍�-0.570
 fixme-0.524
 Written-0.521
-0.468
Token,
Token ID11
Feature activation+1.26e-01

Pos logit contributions
 etiquette+2.382
 aquatic+2.301
atar+2.289
 atmosp+2.177
edia+2.128
Neg logit contributions
���-3.876
龍�-3.718
 fixme-3.438
 Written-3.415
-3.139
Token he
Token ID339
Feature activation+1.00e-01

Pos logit contributions
 etiquette+0.301
atar+0.294
 aquatic+0.294
 atmosp+0.278
edia+0.276
Neg logit contributions
���-0.386
龍�-0.370
 fixme-0.342
 Written-0.338
-0.307
Token is
Token ID318
Feature activation+5.59e-02

Pos logit contributions
 etiquette+0.229
 aquatic+0.222
atar+0.222
 atmosp+0.210
edia+0.209
Neg logit contributions
���-0.319
龍�-0.305
 fixme-0.282
 Written-0.280
-0.255
Token signaling
Token ID22049
Feature activation+1.87e-01

Pos logit contributions
 etiquette+0.135
atar+0.131
 aquatic+0.130
edia+0.124
 atmosp+0.124
Neg logit contributions
���-0.169
龍�-0.161
 fixme-0.149
 Written-0.148
-0.134
Token that
Token ID326
Feature activation+2.09e-01

Pos logit contributions
 etiquette+0.457
atar+0.448
 aquatic+0.447
edia+0.425
 atmosp+0.423
Neg logit contributions
���-0.554
龍�-0.529
 fixme-0.491
 Written-0.486
-0.438
Token perception
Token ID11202
Feature activation+2.06e-01

Pos logit contributions
���+0.660
龍�+0.631
 fixme+0.588
 Written+0.583
+0.527
Neg logit contributions
 etiquette-0.468
atar-0.460
 aquatic-0.456
edia-0.431
 atmosp-0.430
Token of
Token ID286
Feature activation-2.19e-01

Pos logit contributions
 etiquette+0.525
atar+0.511
 aquatic+0.510
 atmosp+0.487
edia+0.482
Neg logit contributions
���-0.603
龍�-0.574
 fixme-0.525
 Written-0.521
-0.472
Token potential
Token ID2785
Feature activation-4.52e-01

Pos logit contributions
���+0.680
龍�+0.651
 fixme+0.603
 Written+0.599
+0.543
Neg logit contributions
 etiquette-0.508
atar-0.497
 aquatic-0.495
 atmosp-0.469
edia-0.467
Token conflicts
Token ID12333
Feature activation+9.00e-02

Pos logit contributions
���+1.389
龍�+1.342
 fixme+1.251
 Written+1.236
+1.123
Neg logit contributions
 etiquette-1.040
atar-1.017
 aquatic-1.014
edia-0.960
 atmosp-0.958
Token of
Token ID286
Feature activation+8.14e-02

Pos logit contributions
 etiquette+0.221
 aquatic+0.215
atar+0.214
 atmosp+0.204
edia+0.202
Neg logit contributions
���-0.270
龍�-0.258
 fixme-0.238
 Written-0.236
-0.214
Token interest
Token ID1393
Feature activation+1.14e+00

Pos logit contributions
 etiquette+0.105
atar+0.103
 aquatic+0.102
edia+0.092
 atmosp+0.092
Neg logit contributions
���-0.334
龍�-0.322
 fixme-0.307
 Written-0.304
-0.282
Token,
Token ID11
Feature activation-1.44e-01

Pos logit contributions
 etiquette+2.419
 aquatic+2.334
atar+2.330
 atmosp+2.208
edia+2.178
Neg logit contributions
���-3.828
龍�-3.651
 fixme-3.366
 Written-3.310
-3.054
Token and
Token ID290
Feature activation-5.89e-02

Pos logit contributions
���+0.447
龍�+0.428
 fixme+0.398
 Written+0.395
+0.359
Neg logit contributions
 etiquette-0.334
 aquatic-0.326
atar-0.325
 atmosp-0.309
edia-0.308
Token we
Token ID356
Feature activation-7.60e-02

Pos logit contributions
���+0.175
龍�+0.167
 fixme+0.154
 Written+0.152
+0.138
Neg logit contributions
 etiquette-0.145
atar-0.142
 aquatic-0.142
 atmosp-0.135
edia-0.134
Token have
Token ID423
Feature activation+1.99e-03

Pos logit contributions
���+0.241
龍�+0.230
 fixme+0.215
 Written+0.213
+0.192
Neg logit contributions
 etiquette-0.172
atar-0.167
 aquatic-0.167
edia-0.158
 atmosp-0.158
Token outlined
Token ID16493
Feature activation+1.61e-01

Pos logit contributions
 etiquette+0.004
atar+0.004
 aquatic+0.004
edia+0.004
 atmosp+0.004
Neg logit contributions
���-0.006
龍�-0.006
 fixme-0.006
 Written-0.006
-0.005
Token($
Token ID16763
Feature activation+1.25e+00

Pos logit contributions
���+0.625
龍�+0.592
 fixme+0.535
 Written+0.529
+0.470
Neg logit contributions
 etiquette-0.700
 aquatic-0.683
atar-0.682
 atmosp-0.655
edia-0.649
TokenSR
Token ID12562
Feature activation+2.47e-01

Pos logit contributions
atar+3.285
 etiquette+3.272
 aquatic+3.201
edia+3.146
 atmosp+3.038
Neg logit contributions
���-3.505
龍�-3.223
 fixme-2.967
 Written-2.953
-2.609
Token.
Token ID13
Feature activation-8.88e-02

Pos logit contributions
 etiquette+0.509
atar+0.493
 aquatic+0.493
 atmosp+0.464
edia+0.459
Neg logit contributions
���-0.843
龍�-0.808
 fixme-0.750
 Written-0.744
-0.684
TokenWidth
Token ID30916
Feature activation-4.96e-01

Pos logit contributions
���+0.214
龍�+0.202
 fixme+0.181
 Written+0.179
+0.158
Neg logit contributions
 etiquette-0.271
 aquatic-0.266
atar-0.265
 atmosp-0.255
edia-0.253
Token),
Token ID828
Feature activation+4.38e-02

Pos logit contributions
���+1.562
龍�+1.502
 fixme+1.403
 Written+1.393
+1.261
Neg logit contributions
 etiquette-1.125
atar-1.089
 aquatic-1.087
 atmosp-1.031
edia-1.021
Token($
Token ID16763
Feature activation+1.13e+00

Pos logit contributions
 etiquette+0.115
atar+0.112
 aquatic+0.112
 atmosp+0.107
edia+0.107
Neg logit contributions
���-0.124
龍�-0.118
 fixme-0.108
 Written-0.106
-0.096
TokenSR
Token ID12562
Feature activation+1.85e-01

Pos logit contributions
atar+2.404
 etiquette+2.370
 aquatic+2.297
edia+2.254
 atmosp+2.164
Neg logit contributions
���-3.790
龍�-3.505
 fixme-3.273
 Written-3.259
-2.951
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
 etiquette+0.402
 aquatic+0.389
atar+0.386
 atmosp+0.366
edia+0.361
Neg logit contributions
���-0.604
龍�-0.588
 fixme-0.546
 Written-0.541
-0.495
TokenHeight
Token ID23106
Feature activation-5.44e-01

Pos logit contributions
���+0.406
龍�+0.383
 fixme+0.347
 Written+0.342
+0.304
Neg logit contributions
 etiquette-0.450
atar-0.440
 aquatic-0.439
 atmosp-0.421
edia-0.419
Token)))
Token ID22305
Feature activation-1.08e-01

Pos logit contributions
���+1.649
龍�+1.578
 fixme+1.461
 Written+1.449
+1.312
Neg logit contributions
 etiquette-1.311
atar-1.275
 aquatic-1.272
 atmosp-1.211
edia-1.199
Token )
Token ID1267
Feature activation+5.91e-02

Pos logit contributions
���+0.315
龍�+0.300
 fixme+0.275
 Written+0.273
+0.246
Neg logit contributions
 etiquette-0.275
 aquatic-0.267
atar-0.267
 atmosp-0.255
edia-0.253
Token up
Token ID510
Feature activation+5.01e-01

Pos logit contributions
���+0.444
龍�+0.424
 fixme+0.391
 Written+0.387
+0.351
Neg logit contributions
 etiquette-0.354
atar-0.346
 aquatic-0.344
 atmosp-0.328
edia-0.325
Token the
Token ID262
Feature activation+2.00e-01

Pos logit contributions
 etiquette+1.198
atar+1.167
 aquatic+1.164
 atmosp+1.106
edia+1.096
Neg logit contributions
���-1.538
龍�-1.468
 fixme-1.354
 Written-1.341
-1.219
Token technical
Token ID6276
Feature activation+2.39e-01

Pos logit contributions
 etiquette+0.481
atar+0.471
 aquatic+0.467
 atmosp+0.445
edia+0.444
Neg logit contributions
���-0.605
龍�-0.580
 fixme-0.537
 Written-0.530
-0.480
Token capability
Token ID12971
Feature activation+3.40e-02

Pos logit contributions
 etiquette+0.549
atar+0.539
 aquatic+0.535
edia+0.507
 atmosp+0.506
Neg logit contributions
���-0.747
龍�-0.718
 fixme-0.666
 Written-0.658
-0.599
Token a
Token ID257
Feature activation+9.16e-02

Pos logit contributions
 etiquette+0.082
atar+0.080
 aquatic+0.080
 atmosp+0.076
edia+0.076
Neg logit contributions
���-0.102
龍�-0.098
 fixme-0.090
 Written-0.090
-0.081
Token hobby
Token ID20005
Feature activation+1.12e+00

Pos logit contributions
 etiquette+0.200
atar+0.194
 aquatic+0.193
 atmosp+0.183
edia+0.181
Neg logit contributions
���-0.300
龍�-0.288
 fixme-0.267
 Written-0.265
-0.243
Tokenist
Token ID396
Feature activation+1.01e-01

Pos logit contributions
 etiquette+1.191
 aquatic+1.125
atar+1.121
 atmosp+0.987
edia+0.973
Neg logit contributions
���-4.953
龍�-4.791
 fixme-4.513
 Written-4.489
-4.222
Token can
Token ID460
Feature activation+2.01e-02

Pos logit contributions
 etiquette+0.241
atar+0.236
 aquatic+0.233
edia+0.223
 atmosp+0.222
Neg logit contributions
���-0.310
龍�-0.298
 fixme-0.275
 Written-0.273
-0.247
Token buy
Token ID2822
Feature activation-8.74e-02

Pos logit contributions
 etiquette+0.046
 aquatic+0.044
atar+0.043
 atmosp+0.041
edia+0.041
Neg logit contributions
���-0.064
龍�-0.061
 fixme-0.057
 Written-0.056
-0.052
Token for
Token ID329
Feature activation+2.84e-02

Pos logit contributions
���+0.270
龍�+0.259
 fixme+0.243
 Written+0.240
+0.217
Neg logit contributions
 etiquette-0.202
atar-0.199
 aquatic-0.195
edia-0.186
 atmosp-0.186
Token under
Token ID739
Feature activation-3.56e-01

Pos logit contributions
 etiquette+0.068
atar+0.067
 aquatic+0.066
 atmosp+0.063
edia+0.062
Neg logit contributions
���-0.087
龍�-0.083
 fixme-0.076
 Written-0.076
-0.068
Token with
Token ID351
Feature activation+1.73e-01

Pos logit contributions
 etiquette+0.621
 aquatic+0.603
atar+0.602
 atmosp+0.570
edia+0.565
Neg logit contributions
���-0.898
龍�-0.861
 fixme-0.797
 Written-0.791
-0.723
Token them
Token ID606
Feature activation-4.88e-02

Pos logit contributions
 etiquette+0.415
atar+0.406
 aquatic+0.404
 atmosp+0.383
edia+0.382
Neg logit contributions
���-0.528
龍�-0.505
 fixme-0.465
 Written-0.461
-0.418
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
���+0.157
龍�+0.151
 fixme+0.140
 Written+0.139
+0.127
Neg logit contributions
 etiquette-0.108
 aquatic-0.105
atar-0.105
 atmosp-0.099
edia-0.099
Token Using
Token ID8554
Feature activation-2.12e-01

Pos logit contributions
 etiquette+0.428
 aquatic+0.417
atar+0.416
 atmosp+0.397
edia+0.394
Neg logit contributions
���-0.486
龍�-0.463
 fixme-0.424
 Written-0.420
-0.379
Token they
Token ID484
Feature activation+1.79e-01

Pos logit contributions
���+0.646
龍�+0.617
 fixme+0.569
 Written+0.567
+0.510
Neg logit contributions
 etiquette-0.505
atar-0.498
 aquatic-0.493
edia-0.469
 atmosp-0.467
Token rather
Token ID2138
Feature activation+1.11e+00

Pos logit contributions
 etiquette+0.415
 aquatic+0.404
atar+0.403
 atmosp+0.382
edia+0.379
Neg logit contributions
���-0.561
龍�-0.537
 fixme-0.495
 Written-0.491
-0.447
Token than
Token ID621
Feature activation+1.74e-01

Pos logit contributions
 etiquette+2.571
 aquatic+2.493
atar+2.477
 atmosp+2.369
edia+2.315
Neg logit contributions
���-3.535
龍�-3.371
 fixme-3.110
 Written-3.067
-2.817
Token their
Token ID511
Feature activation+1.26e-01

Pos logit contributions
 etiquette+0.419
atar+0.409
 aquatic+0.408
 atmosp+0.386
edia+0.385
Neg logit contributions
���-0.531
龍�-0.508
 fixme-0.468
 Written-0.465
-0.420
Token,
Token ID11
Feature activation-7.98e-02

Pos logit contributions
 etiquette+0.293
atar+0.286
 aquatic+0.285
edia+0.270
 atmosp+0.270
Neg logit contributions
���-0.393
龍�-0.377
 fixme-0.348
 Written-0.347
-0.313
Token as
Token ID355
Feature activation+6.28e-02

Pos logit contributions
���+0.247
龍�+0.237
 fixme+0.219
 Written+0.218
+0.197
Neg logit contributions
 etiquette-0.186
atar-0.182
 aquatic-0.181
edia-0.171
 atmosp-0.171
Token Thompson
Token ID11654
Feature activation+4.28e-01

Pos logit contributions
 etiquette+0.151
atar+0.147
 aquatic+0.147
 atmosp+0.140
edia+0.139
Neg logit contributions
���-0.191
龍�-0.183
 fixme-0.168
 Written-0.167
-0.151
Token are
Token ID389
Feature activation-3.23e-01

Pos logit contributions
 etiquette+0.919
 aquatic+0.893
atar+0.892
 atmosp+0.847
edia+0.839
Neg logit contributions
���-1.209
龍�-1.156
 fixme-1.063
 Written-1.053
-0.960
Token not
Token ID407
Feature activation-6.78e-02

Pos logit contributions
���+0.982
龍�+0.935
 fixme+0.865
 Written+0.863
+0.773
Neg logit contributions
 etiquette-0.769
atar-0.752
 aquatic-0.747
 atmosp-0.712
edia-0.711
Token exempt
Token ID13068
Feature activation-2.28e-01

Pos logit contributions
���+0.213
龍�+0.204
 fixme+0.189
 Written+0.188
+0.170
Neg logit contributions
 etiquette-0.155
atar-0.150
 aquatic-0.150
 atmosp-0.143
edia-0.143
Token from
Token ID422
Feature activation+2.21e-02

Pos logit contributions
���+0.696
龍�+0.671
 fixme+0.622
 Written+0.616
+0.560
Neg logit contributions
 etiquette-0.541
 aquatic-0.525
atar-0.524
 atmosp-0.498
edia-0.495
Token mer
Token ID4017
Feature activation-9.92e-02

Pos logit contributions
 etiquette+0.052
atar+0.051
 aquatic+0.051
 atmosp+0.048
edia+0.048
Neg logit contributions
���-0.069
龍�-0.066
 fixme-0.060
 Written-0.060
-0.054
Tokengers
Token ID5355
Feature activation+1.11e+00

Pos logit contributions
���+0.260
龍�+0.245
 fixme+0.223
 Written+0.221
+0.196
Neg logit contributions
 etiquette-0.282
 aquatic-0.276
atar-0.274
 atmosp-0.264
edia-0.261
Token and
Token ID290
Feature activation+2.28e-01

Pos logit contributions
 etiquette+2.250
atar+2.174
 aquatic+2.158
 atmosp+2.045
edia+2.012
Neg logit contributions
���-3.839
龍�-3.660
 fixme-3.375
 Written-3.320
-3.079
Token partnership
Token ID10413
Feature activation-2.18e-01

Pos logit contributions
 etiquette+0.405
 aquatic+0.390
atar+0.390
 atmosp+0.362
edia+0.356
Neg logit contributions
���-0.844
龍�-0.811
 fixme-0.757
 Written-0.751
-0.695
Token deals
Token ID7529
Feature activation+8.71e-02

Pos logit contributions
���+0.676
龍�+0.651
 fixme+0.607
 Written+0.599
+0.543
Neg logit contributions
 etiquette-0.500
atar-0.493
 aquatic-0.490
edia-0.465
 atmosp-0.461
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
 etiquette+0.195
 aquatic+0.190
atar+0.190
 atmosp+0.179
edia+0.178
Neg logit contributions
���-0.279
龍�-0.268
 fixme-0.248
 Written-0.247
-0.225
Token Bib
Token ID43278
Feature activation-1.57e-01

Pos logit contributions
 etiquette+0.270
 aquatic+0.263
atar+0.263
 atmosp+0.251
edia+0.249
Neg logit contributions
���-0.292
龍�-0.278
 fixme-0.255
 Written-0.252
-0.227
Token cold
Token ID4692
Feature activation+5.58e-01

Pos logit contributions
 etiquette+0.678
atar+0.658
 aquatic+0.656
 atmosp+0.621
edia+0.617
Neg logit contributions
���-0.990
龍�-0.947
 fixme-0.877
 Written-0.870
-0.795
Token on
Token ID319
Feature activation+1.90e-01

Pos logit contributions
 etiquette+1.262
 aquatic+1.224
atar+1.223
 atmosp+1.158
edia+1.148
Neg logit contributions
���-1.792
龍�-1.714
 fixme-1.584
 Written-1.569
-1.434
Token the
Token ID262
Feature activation+1.09e-01

Pos logit contributions
 etiquette+0.472
 aquatic+0.459
atar+0.457
 atmosp+0.436
edia+0.432
Neg logit contributions
���-0.569
龍�-0.541
 fixme-0.497
 Written-0.491
-0.447
Token Bruins
Token ID27607
Feature activation+2.36e-01

Pos logit contributions
 etiquette+0.279
atar+0.271
 aquatic+0.271
 atmosp+0.258
edia+0.257
Neg logit contributions
���-0.319
龍�-0.304
 fixme-0.279
 Written-0.276
-0.249
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
 etiquette+0.528
 aquatic+0.512
atar+0.511
 atmosp+0.484
edia+0.478
Neg logit contributions
���-0.764
龍�-0.730
 fixme-0.675
 Written-0.668
-0.612
Token Instead
Token ID5455
Feature activation+1.11e+00

Pos logit contributions
 etiquette+0.264
atar+0.258
 aquatic+0.257
 atmosp+0.245
edia+0.244
Neg logit contributions
���-0.292
龍�-0.277
 fixme-0.252
 Written-0.249
-0.226
Token,
Token ID11
Feature activation+1.22e-01

Pos logit contributions
 etiquette+2.337
 aquatic+2.250
atar+2.227
 atmosp+2.086
edia+2.073
Neg logit contributions
���-3.720
龍�-3.548
 fixme-3.297
 Written-3.232
-3.022
Token Allow
Token ID22507
Feature activation+2.40e-01

Pos logit contributions
 etiquette+0.296
 aquatic+0.288
atar+0.287
 atmosp+0.273
edia+0.271
Neg logit contributions
���-0.373
龍�-0.356
 fixme-0.327
 Written-0.324
-0.294
Tokenay
Token ID323
Feature activation+1.10e-01

Pos logit contributions
 etiquette+0.616
 aquatic+0.599
atar+0.599
 atmosp+0.571
edia+0.567
Neg logit contributions
���-0.698
龍�-0.665
 fixme-0.609
 Written-0.602
-0.544
Token committed
Token ID5364
Feature activation+1.50e-01

Pos logit contributions
 etiquette+0.245
atar+0.238
 aquatic+0.238
 atmosp+0.225
edia+0.223
Neg logit contributions
���-0.353
龍�-0.337
 fixme-0.313
 Written-0.310
-0.282
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
 etiquette+0.344
atar+0.336
 aquatic+0.334
 atmosp+0.317
edia+0.315
Neg logit contributions
���-0.473
龍�-0.453
 fixme-0.420
 Written-0.418
-0.378
Token algae
Token ID33773
Feature activation+3.20e-01

Pos logit contributions
���+0.129
龍�+0.124
 fixme+0.115
 Written+0.114
+0.105
Neg logit contributions
 etiquette-0.087
atar-0.085
 aquatic-0.082
edia-0.079
 atmosp-0.079
Token within
Token ID1626
Feature activation+5.62e-02

Pos logit contributions
 etiquette+0.626
atar+0.604
 aquatic+0.604
 atmosp+0.567
edia+0.561
Neg logit contributions
���-1.122
龍�-1.078
 fixme-1.004
 Written-0.995
-0.918
Token a
Token ID257
Feature activation-3.51e-02

Pos logit contributions
 etiquette+0.133
atar+0.130
 aquatic+0.126
edia+0.122
 atmosp+0.121
Neg logit contributions
���-0.171
龍�-0.164
 fixme-0.152
 Written-0.150
-0.136
Token body
Token ID1767
Feature activation-2.10e-01

Pos logit contributions
���+0.110
龍�+0.106
 fixme+0.098
 Written+0.097
+0.088
Neg logit contributions
 etiquette-0.081
atar-0.078
 aquatic-0.076
 atmosp-0.073
edia-0.073
Token of
Token ID286
Feature activation-3.78e-02

Pos logit contributions
���+0.631
龍�+0.614
 fixme+0.562
 Written+0.555
+0.507
Neg logit contributions
 etiquette-0.504
atar-0.490
 aquatic-0.480
edia-0.463
 atmosp-0.460
Token water
Token ID1660
Feature activation+1.11e+00

Pos logit contributions
���+0.134
龍�+0.130
 fixme+0.122
 Written+0.121
+0.111
Neg logit contributions
 etiquette-0.070
atar-0.068
 aquatic-0.065
edia-0.063
 atmosp-0.062
Token,
Token ID11
Feature activation-1.96e-02

Pos logit contributions
 etiquette+2.484
 aquatic+2.434
atar+2.397
 atmosp+2.287
edia+2.237
Neg logit contributions
���-3.606
龍�-3.415
 fixme-3.157
 Written-3.117
-2.866
Token to
Token ID284
Feature activation+1.02e-01

Pos logit contributions
���+0.061
龍�+0.058
 fixme+0.054
 Written+0.053
+0.049
Neg logit contributions
 etiquette-0.046
atar-0.045
 aquatic-0.044
 atmosp-0.042
edia-0.042
Token spread
Token ID4104
Feature activation+1.20e-01

Pos logit contributions
 etiquette+0.239
atar+0.232
 aquatic+0.230
 atmosp+0.219
edia+0.218
Neg logit contributions
���-0.317
龍�-0.303
 fixme-0.280
 Written-0.277
-0.252
Token and
Token ID290
Feature activation-2.73e-01

Pos logit contributions
 etiquette+0.268
atar+0.260
 aquatic+0.258
 atmosp+0.246
edia+0.244
Neg logit contributions
���-0.385
龍�-0.369
 fixme-0.341
 Written-0.338
-0.309
Token intensify
Token ID47697
Feature activation-3.09e-01

Pos logit contributions
���+0.876
龍�+0.845
 fixme+0.788
 Written+0.774
+0.707
Neg logit contributions
 etiquette-0.603
atar-0.589
 aquatic-0.576
edia-0.551
 atmosp-0.549
Token
Token ID198
Feature activation-9.26e-02

Pos logit contributions
���+0.386
龍�+0.378
 fixme+0.346
 Written+0.343
+0.303
Neg logit contributions
 etiquette-0.421
 aquatic-0.410
atar-0.407
 atmosp-0.391
 therap-0.391
TokenAnd
Token ID1870
Feature activation+3.04e-01

Pos logit contributions
���+0.269
龍�+0.259
 fixme+0.238
 Written+0.236
+0.213
Neg logit contributions
 etiquette-0.235
 aquatic-0.229
atar-0.227
 atmosp-0.218
edia-0.215
Token,
Token ID11
Feature activation+1.03e-02

Pos logit contributions
 etiquette+0.782
 aquatic+0.761
atar+0.760
 atmosp+0.725
edia+0.719
Neg logit contributions
���-0.883
龍�-0.840
 fixme-0.768
 Written-0.760
-0.688
Token second
Token ID1218
Feature activation+4.18e-01

Pos logit contributions
 etiquette+0.026
atar+0.025
 aquatic+0.025
 atmosp+0.024
edia+0.024
Neg logit contributions
���-0.031
龍�-0.029
 fixme-0.027
 Written-0.027
-0.024
Token,
Token ID11
Feature activation+1.54e-01

Pos logit contributions
 etiquette+0.904
 aquatic+0.875
atar+0.875
 atmosp+0.826
edia+0.817
Neg logit contributions
���-1.385
龍�-1.327
 fixme-1.228
 Written-1.217
-1.117
Token rather
Token ID2138
Feature activation+1.10e+00

Pos logit contributions
 etiquette+0.365
 aquatic+0.355
atar+0.355
 atmosp+0.337
edia+0.334
Neg logit contributions
���-0.478
龍�-0.457
 fixme-0.421
 Written-0.417
-0.379
Token than
Token ID621
Feature activation+3.04e-01

Pos logit contributions
 etiquette+2.459
 aquatic+2.371
atar+2.319
 atmosp+2.253
edia+2.174
Neg logit contributions
���-3.546
龍�-3.414
 fixme-3.126
 Written-3.089
-2.877
Token pushing
Token ID7796
Feature activation+4.08e-01

Pos logit contributions
 etiquette+0.731
 aquatic+0.710
atar+0.709
 atmosp+0.675
edia+0.668
Neg logit contributions
���-0.934
龍�-0.891
 fixme-0.820
 Written-0.812
-0.739
Token a
Token ID257
Feature activation+1.28e-01

Pos logit contributions
 etiquette+0.980
atar+0.955
 aquatic+0.954
 atmosp+0.905
edia+0.899
Neg logit contributions
���-1.249
龍�-1.192
 fixme-1.099
 Written-1.088
-0.987
Token program
Token ID1430
Feature activation-9.20e-02

Pos logit contributions
 etiquette+0.295
atar+0.288
 aquatic+0.287
 atmosp+0.271
edia+0.271
Neg logit contributions
���-0.403
龍�-0.384
 fixme-0.355
 Written-0.352
-0.320
Token he
Token ID339
Feature activation-5.06e-02

Pos logit contributions
���+0.271
龍�+0.258
 fixme+0.237
 Written+0.236
+0.213
Neg logit contributions
 etiquette-0.231
 aquatic-0.225
atar-0.225
 atmosp-0.214
edia-0.212
Token indicated
Token ID8203
Feature activation+3.49e-01

Pos logit contributions
���+1.177
龍�+1.112
 fixme+1.039
 Written+1.039
+0.922
Neg logit contributions
 etiquette-0.967
 aquatic-0.941
atar-0.941
 atmosp-0.895
edia-0.891
Token.
Token ID13
Feature activation+3.41e-01

Pos logit contributions
 etiquette+0.747
atar+0.723
 aquatic+0.723
 atmosp+0.682
edia+0.675
Neg logit contributions
���-1.167
龍�-1.118
 fixme-1.034
 Written-1.021
-0.941
Token All
Token ID1439
Feature activation+2.94e-01

Pos logit contributions
 etiquette+0.891
atar+0.871
 aquatic+0.868
 atmosp+0.832
edia+0.825
Neg logit contributions
���-0.976
龍�-0.920
 fixme-0.837
 Written-0.820
-0.752
Token salary
Token ID9588
Feature activation+3.82e-01

Pos logit contributions
 etiquette+0.677
atar+0.659
 aquatic+0.658
 atmosp+0.621
edia+0.620
Neg logit contributions
���-0.924
龍�-0.884
 fixme-0.819
 Written-0.816
-0.738
Token-
Token ID12
Feature activation+1.19e-01

Pos logit contributions
 etiquette+0.832
atar+0.807
 aquatic+0.807
 atmosp+0.762
edia+0.755
Neg logit contributions
���-1.258
龍�-1.204
 fixme-1.116
 Written-1.107
-1.013
Tokencap
Token ID11128
Feature activation+1.10e+00

Pos logit contributions
 etiquette+0.103
 aquatic+0.096
atar+0.094
 atmosp+0.081
edia+0.078
Neg logit contributions
���-0.549
龍�-0.533
 fixme-0.505
 Written-0.503
-0.474
Token figures
Token ID5538
Feature activation+3.53e-01

Pos logit contributions
 etiquette+2.436
 aquatic+2.337
atar+2.317
 atmosp+2.219
edia+2.167
Neg logit contributions
���-3.625
龍�-3.451
 fixme-3.169
 Written-3.128
-2.872
Token courtesy
Token ID12537
Feature activation-1.74e-01

Pos logit contributions
 etiquette+0.816
 aquatic+0.794
atar+0.793
 atmosp+0.752
edia+0.743
Neg logit contributions
���-1.115
龍�-1.065
 fixme-0.980
 Written-0.968
-0.887
Token of
Token ID286
Feature activation+9.60e-02

Pos logit contributions
���+0.527
龍�+0.503
 fixme+0.462
 Written+0.457
+0.415
Neg logit contributions
 etiquette-0.424
 aquatic-0.412
atar-0.412
 atmosp-0.392
edia-0.389
Token Over
Token ID3827
Feature activation+6.15e-01

Pos logit contributions
 etiquette+0.260
 aquatic+0.253
atar+0.253
 atmosp+0.241
edia+0.240
Neg logit contributions
���-0.264
龍�-0.251
 fixme-0.230
 Written-0.228
-0.203
TokenThe
Token ID464
Feature activation+3.07e-01

Pos logit contributions
 etiquette+1.556
 aquatic+1.513
atar+1.511
 atmosp+1.442
edia+1.429
Neg logit contributions
���-1.810
龍�-1.725
 fixme-1.580
 Written-1.565
-1.416
Token Mac
Token ID4100
Feature activation-2.45e-01

Pos logit contributions
 etiquette+0.917
atar+0.893
 aquatic+0.891
edia+0.848
 atmosp+0.847
Neg logit contributions
���-1.069
龍�-1.023
 fixme-0.933
 Written-0.926
-0.836
Tokenri
Token ID380
Feature activation+9.99e-02

Pos logit contributions
���+0.585
龍�+0.572
 fixme+0.526
 Written+0.517
+0.451
Neg logit contributions
 etiquette-0.729
 aquatic-0.711
atar-0.698
 atmosp-0.681
 therap-0.677
Token a
Token ID257
Feature activation+5.06e-02

Pos logit contributions
 etiquette+0.237
atar+0.229
 aquatic+0.228
 atmosp+0.216
edia+0.216
Neg logit contributions
���-0.310
龍�-0.297
 fixme-0.272
 Written-0.270
-0.247
Token beneficiary
Token ID33867
Feature activation-2.42e-01

Pos logit contributions
 etiquette+0.117
atar+0.113
 aquatic+0.113
 atmosp+0.107
edia+0.106
Neg logit contributions
���-0.160
龍�-0.153
 fixme-0.142
 Written-0.140
-0.128
Token,
Token ID11
Feature activation+2.01e-01

Pos logit contributions
���+0.718
龍�+0.693
 fixme+0.637
 Written+0.632
+0.571
Neg logit contributions
 etiquette-0.590
 aquatic-0.571
atar-0.571
 atmosp-0.546
edia-0.545
Token rather
Token ID2138
Feature activation+1.09e+00

Pos logit contributions
 etiquette+0.489
atar+0.476
 aquatic+0.474
edia+0.453
 atmosp+0.450
Neg logit contributions
���-0.600
龍�-0.578
 fixme-0.533
 Written-0.532
-0.476
Token than
Token ID621
Feature activation+3.97e-01

Pos logit contributions
 etiquette+2.514
 aquatic+2.446
atar+2.401
 atmosp+2.318
edia+2.250
Neg logit contributions
���-3.475
龍�-3.325
 fixme-3.061
 Written-3.031
-2.775
Token a
Token ID257
Feature activation-8.09e-02

Pos logit contributions
 etiquette+0.892
atar+0.864
 aquatic+0.863
 atmosp+0.818
edia+0.814
Neg logit contributions
���-1.276
龍�-1.225
 fixme-1.131
 Written-1.123
-1.024
Token victim
Token ID3117
Feature activation+8.38e-02

Pos logit contributions
���+0.255
龍�+0.245
 fixme+0.227
 Written+0.226
+0.205
Neg logit contributions
 etiquette-0.184
atar-0.179
 aquatic-0.178
 atmosp-0.169
edia-0.168
Token.
Token ID13
Feature activation+2.32e-01

Pos logit contributions
 etiquette+0.184
 aquatic+0.178
atar+0.178
 atmosp+0.169
edia+0.167
Neg logit contributions
���-0.274
龍�-0.263
 fixme-0.243
 Written-0.241
-0.221
Token Again
Token ID6521
Feature activation+4.27e-01

Pos logit contributions
 etiquette+0.608
 aquatic+0.592
atar+0.589
 atmosp+0.563
edia+0.559
Neg logit contributions
���-0.659
龍�-0.631
 fixme-0.578
 Written-0.575
-0.514
Token
Token ID247
Feature activation+3.33e-02

Pos logit contributions
���+0.264
龍�+0.248
 fixme+0.220
 Written+0.217
+0.188
Neg logit contributions
 etiquette-0.388
 aquatic-0.380
atar-0.379
 atmosp-0.367
edia-0.364
Token?
Token ID30
Feature activation+8.87e-02

Pos logit contributions
 etiquette+0.078
 aquatic+0.076
atar+0.075
 atmosp+0.072
edia+0.071
Neg logit contributions
���-0.103
龍�-0.100
 fixme-0.092
 Written-0.091
-0.083
Token OK
Token ID7477
Feature activation-3.93e-01

Pos logit contributions
 etiquette+0.213
 aquatic+0.207
atar+0.207
 atmosp+0.197
edia+0.195
Neg logit contributions
���-0.271
龍�-0.260
 fixme-0.239
 Written-0.238
-0.215
Token,
Token ID11
Feature activation+1.47e-02

Pos logit contributions
���+1.218
龍�+1.186
 fixme+1.100
 Written+1.091
+0.982
Neg logit contributions
 etiquette-0.890
 aquatic-0.864
atar-0.863
 atmosp-0.829
edia-0.815
Token fair
Token ID3148
Feature activation+3.83e-01

Pos logit contributions
 etiquette+0.034
atar+0.033
 aquatic+0.033
 atmosp+0.032
edia+0.032
Neg logit contributions
���-0.045
龍�-0.044
 fixme-0.041
 Written-0.040
-0.036
Token enough
Token ID1576
Feature activation+1.09e+00

Pos logit contributions
 etiquette+0.882
 aquatic+0.858
atar+0.857
 atmosp+0.813
edia+0.807
Neg logit contributions
���-1.209
龍�-1.158
 fixme-1.070
 Written-1.060
-0.966
Token,
Token ID11
Feature activation+2.68e-04

Pos logit contributions
 etiquette+2.357
atar+2.276
 aquatic+2.270
 atmosp+2.141
edia+2.111
Neg logit contributions
���-3.629
龍�-3.452
 fixme-3.185
 Written-3.152
-2.909
Token if
Token ID611
Feature activation+2.29e-01

Pos logit contributions
 etiquette+0.001
atar+0.001
 aquatic+0.001
edia+0.001
 atmosp+0.001
Neg logit contributions
���-0.001
龍�-0.001
 fixme-0.001
 Written-0.001
-0.001
Token de
Token ID390
Feature activation-1.44e-01

Pos logit contributions
 etiquette+0.540
atar+0.525
 aquatic+0.525
 atmosp+0.498
edia+0.494
Neg logit contributions
���-0.711
龍�-0.681
 fixme-0.627
 Written-0.622
-0.564
Tokenvious
Token ID1442
Feature activation+1.57e-01

Pos logit contributions
���+0.445
龍�+0.425
 fixme+0.391
 Written+0.387
+0.352
Neg logit contributions
 etiquette-0.344
 aquatic-0.335
atar-0.335
 atmosp-0.318
edia-0.315
Token)
Token ID8
Feature activation+4.64e-01

Pos logit contributions
 etiquette+0.352
atar+0.343
 aquatic+0.342
 atmosp+0.325
edia+0.325
Neg logit contributions
���-0.500
龍�-0.482
 fixme-0.447
 Written-0.444
-0.402
Token day
Token ID1110
Feature activation-2.56e-01

Pos logit contributions
 etiquette+0.612
 aquatic+0.594
atar+0.592
 atmosp+0.563
edia+0.556
Neg logit contributions
���-0.909
龍�-0.869
 fixme-0.803
 Written-0.797
-0.730
Token,
Token ID11
Feature activation-6.77e-02

Pos logit contributions
���+0.774
龍�+0.748
 fixme+0.699
 Written+0.697
+0.622
Neg logit contributions
 etiquette-0.602
atar-0.583
 aquatic-0.582
edia-0.553
 atmosp-0.550
Token so
Token ID523
Feature activation-2.77e-01

Pos logit contributions
���+0.205
龍�+0.197
 fixme+0.183
 Written+0.182
+0.163
Neg logit contributions
 etiquette-0.160
atar-0.157
 aquatic-0.155
edia-0.149
 atmosp-0.147
Token that
Token ID326
Feature activation-8.78e-02

Pos logit contributions
���+0.795
龍�+0.760
 fixme+0.700
 Written+0.689
+0.616
Neg logit contributions
 etiquette-0.707
atar-0.695
 aquatic-0.686
 atmosp-0.652
edia-0.652
Token the
Token ID262
Feature activation-6.79e-02

Pos logit contributions
���+0.265
龍�+0.253
 fixme+0.235
 Written+0.232
+0.209
Neg logit contributions
 etiquette-0.211
atar-0.207
 aquatic-0.204
edia-0.194
 atmosp-0.194
Token impact
Token ID2928
Feature activation+1.09e+00

Pos logit contributions
���+0.216
龍�+0.207
 fixme+0.193
 Written+0.192
+0.173
Neg logit contributions
 etiquette-0.152
atar-0.149
 aquatic-0.147
edia-0.139
 atmosp-0.139
Token point
Token ID966
Feature activation+8.67e-02

Pos logit contributions
 etiquette+2.247
 aquatic+2.171
atar+2.170
 atmosp+2.052
edia+2.026
Neg logit contributions
���-3.701
龍�-3.545
 fixme-3.272
 Written-3.253
-2.982
Token would
Token ID561
Feature activation+1.62e-01

Pos logit contributions
 etiquette+0.186
atar+0.182
 aquatic+0.180
edia+0.170
 atmosp+0.169
Neg logit contributions
���-0.283
龍�-0.271
 fixme-0.254
 Written-0.252
-0.228
Token have
Token ID423
Feature activation-3.92e-01

Pos logit contributions
 etiquette+0.382
atar+0.372
 aquatic+0.371
 atmosp+0.351
edia+0.351
Neg logit contributions
���-0.500
龍�-0.477
 fixme-0.443
 Written-0.438
-0.397
Token the
Token ID262
Feature activation-6.84e-03

Pos logit contributions
���+1.241
龍�+1.168
 Written+1.104
 fixme+1.097
+0.984
Neg logit contributions
 etiquette-0.874
atar-0.862
 aquatic-0.836
edia-0.815
 therap-0.784
Token optimum
Token ID39475
Feature activation+3.37e-01

Pos logit contributions
���+0.021
龍�+0.020
 fixme+0.018
 Written+0.018
+0.016
Neg logit contributions
 etiquette-0.017
atar-0.016
 aquatic-0.016
edia-0.015
 atmosp-0.015
Token,
Token ID11
Feature activation-6.12e-02

Pos logit contributions
���+1.073
龍�+1.029
 fixme+0.963
 Written+0.961
+0.869
Neg logit contributions
 etiquette-0.800
 aquatic-0.784
atar-0.783
edia-0.748
 atmosp-0.730
Token try
Token ID1949
Feature activation+1.00e-01

Pos logit contributions
���+0.189
龍�+0.181
 fixme+0.168
 Written+0.166
+0.150
Neg logit contributions
 etiquette-0.143
atar-0.140
 aquatic-0.140
 atmosp-0.132
edia-0.132
Token incorporating
Token ID29927
Feature activation+1.13e-01

Pos logit contributions
 etiquette+0.241
atar+0.235
 aquatic+0.234
 atmosp+0.223
edia+0.221
Neg logit contributions
���-0.306
龍�-0.292
 fixme-0.269
 Written-0.267
-0.242
Token the
Token ID262
Feature activation+1.19e-01

Pos logit contributions
 etiquette+0.251
atar+0.245
 aquatic+0.243
edia+0.230
 atmosp+0.230
Neg logit contributions
���-0.364
龍�-0.349
 fixme-0.324
 Written-0.322
-0.292
Token following
Token ID1708
Feature activation-2.73e-01

Pos logit contributions
 etiquette+0.279
atar+0.272
 aquatic+0.271
edia+0.257
 atmosp+0.257
Neg logit contributions
���-0.370
龍�-0.354
 fixme-0.328
 Written-0.326
-0.294
Token gimmick
Token ID46029
Feature activation+1.09e+00

Pos logit contributions
���+0.831
龍�+0.796
 fixme+0.746
 Written+0.739
+0.661
Neg logit contributions
 etiquette-0.632
atar-0.624
 aquatic-0.621
edia-0.595
 atmosp-0.584
Token (
Token ID357
Feature activation-9.26e-02

Pos logit contributions
 etiquette+2.508
atar+2.420
 aquatic+2.417
 atmosp+2.304
edia+2.268
Neg logit contributions
���-3.467
龍�-3.311
 fixme-3.037
 Written-3.005
-2.746
TokenSt
Token ID1273
Feature activation-2.84e-01

Pos logit contributions
���+0.263
龍�+0.252
 fixme+0.231
 Written+0.229
+0.207
Neg logit contributions
 etiquette-0.240
 aquatic-0.235
atar-0.233
 atmosp-0.224
edia-0.222
Tokenolen
Token ID8622
Feature activation-2.88e-01

Pos logit contributions
���+0.824
龍�+0.789
 fixme+0.726
 Written+0.723
+0.645
Neg logit contributions
 etiquette-0.717
 aquatic-0.697
atar-0.688
 atmosp-0.661
edia-0.658
Token from
Token ID422
Feature activation+3.20e-03

Pos logit contributions
���+0.833
龍�+0.800
 fixme+0.737
 Written+0.736
+0.658
Neg logit contributions
 etiquette-0.725
 aquatic-0.707
atar-0.704
 atmosp-0.673
edia-0.670
Token the
Token ID262
Feature activation-1.15e-01

Pos logit contributions
 etiquette+0.008
atar+0.008
 aquatic+0.008
 atmosp+0.007
edia+0.007
Neg logit contributions
���-0.009
龍�-0.009
 fixme-0.008
 Written-0.008
-0.007
Token 2012
Token ID2321
Feature activation+3.10e-01

Pos logit contributions
���+0.418
龍�+0.407
 Written+0.378
 fixme+0.373
+0.324
Neg logit contributions
 etiquette-0.501
 aquatic-0.488
atar-0.487
 therap-0.471
 atmosp-0.466
Token United
Token ID1578
Feature activation+3.38e-01

Pos logit contributions
 etiquette+0.536
 aquatic+0.516
atar+0.514
 atmosp+0.479
edia+0.472
Neg logit contributions
���-1.159
龍�-1.119
 fixme-1.047
 Written-1.041
-0.963
Token States
Token ID1829
Feature activation+1.99e-01

Pos logit contributions
 etiquette+0.561
atar+0.539
 aquatic+0.538
 atmosp+0.498
edia+0.492
Neg logit contributions
���-1.291
龍�-1.243
 fixme-1.166
 Written-1.155
-1.075
Token 5
Token ID642
Feature activation+1.62e-01

Pos logit contributions
 etiquette+0.489
 aquatic+0.476
atar+0.475
 atmosp+0.452
edia+0.448
Neg logit contributions
���-0.595
龍�-0.571
 fixme-0.526
 Written-0.524
-0.471
Token667
Token ID28933
Feature activation+4.90e-02

Pos logit contributions
 etiquette+0.425
 aquatic+0.414
atar+0.414
 atmosp+0.395
edia+0.392
Neg logit contributions
���-0.462
龍�-0.439
 fixme-0.401
 Written-0.397
-0.358
Token Posts
Token ID12043
Feature activation-2.29e+00

Pos logit contributions
 etiquette+0.152
atar+0.149
 aquatic+0.149
 atmosp+0.143
edia+0.142
Neg logit contributions
���-0.118
龍�-0.109
 fixme-0.097
 Written-0.095
-0.084
Token #
Token ID1303
Feature activation-1.54e+00

Pos logit contributions
���+5.368
龍�+5.017
 Written+4.755
 fixme+4.483
EStream+4.032
Neg logit contributions
 etiquette-6.618
atar-6.523
 aquatic-6.509
 atmosp-6.286
edia-6.248
Token12
Token ID1065
Feature activation-1.63e-01

Pos logit contributions
龍�+3.083
 fixme+2.897
 Written+2.843
���+2.637
written+2.400
Neg logit contributions
 therap-5.049
 etiquette-5.037
 aquatic-4.856
 charism-4.737
 atmosp-4.717
Token I
Token ID314
Feature activation-1.15e-01

Pos logit contributions
���+0.480
龍�+0.461
 fixme+0.428
 Written+0.426
+0.380
Neg logit contributions
 etiquette-0.402
atar-0.391
 aquatic-0.390
 atmosp-0.373
edia-0.371
Token'm
Token ID1101
Feature activation+1.46e-01

Pos logit contributions
���+0.358
龍�+0.343
 fixme+0.318
 Written+0.316
+0.284
Neg logit contributions
 etiquette-0.268
atar-0.259
 aquatic-0.259
 atmosp-0.246
edia-0.245
Token glad
Token ID9675
Feature activation-1.89e-01

Pos logit contributions
 etiquette+0.329
atar+0.319
 aquatic+0.318
 atmosp+0.302
edia+0.300
Neg logit contributions
���-0.467
龍�-0.446
 fixme-0.413
 Written-0.410
-0.373
Token
Token ID198
Feature activation-7.62e-02

Pos logit contributions
���+0.406
龍�+0.401
 fixme+0.368
 Written+0.366
+0.323
Neg logit contributions
 etiquette-0.453
 aquatic-0.441
atar-0.436
 therap-0.422
 atmosp-0.421
Tokenpur
Token ID14225
Feature activation+1.73e-01

Pos logit contributions
���+0.215
龍�+0.207
 fixme+0.189
 Written+0.189
+0.170
Neg logit contributions
 etiquette-0.198
 aquatic-0.194
atar-0.192
 atmosp-0.185
edia-0.182
Tokenak
Token ID461
Feature activation-5.58e-01

Pos logit contributions
 etiquette+0.504
 aquatic+0.492
atar+0.492
 atmosp+0.472
edia+0.468
Neg logit contributions
���-0.444
龍�-0.420
 fixme-0.380
 Written-0.376
-0.333
Tokenushi
Token ID17731
Feature activation-7.90e-02

Pos logit contributions
���+1.674
龍�+1.609
 fixme+1.482
 Written+1.479
+1.333
Neg logit contributions
 etiquette-1.348
 aquatic-1.310
atar-1.296
 atmosp-1.253
edia-1.225
Token Profile
Token ID13118
Feature activation-8.06e-01

Pos logit contributions
���+0.231
龍�+0.221
 Written+0.203
 fixme+0.203
+0.182
Neg logit contributions
 etiquette-0.199
 aquatic-0.194
atar-0.193
 atmosp-0.185
edia-0.182
Token Joined
Token ID16709
Feature activation-2.29e+00

Pos logit contributions
���+0.853
龍�+0.753
 Written+0.591
 fixme+0.581
+0.366
Neg logit contributions
 etiquette-3.506
atar-3.450
 aquatic-3.446
 atmosp-3.355
edia-3.351
Token August
Token ID2932
Feature activation-4.54e-01

Pos logit contributions
 Written+4.708
龍�+4.678
���+4.587
 fixme+4.426
 Joined+3.964
Neg logit contributions
 etiquette-6.910
 therap-6.808
 aquatic-6.651
atar-6.630
edia-6.457
Token 2012
Token ID2321
Feature activation+1.02e-02

Pos logit contributions
���+1.135
龍�+1.101
 Written+1.009
 fixme+1.006
+0.881
Neg logit contributions
 etiquette-1.315
 aquatic-1.279
atar-1.272
 atmosp-1.223
 therap-1.215
Token United
Token ID1578
Feature activation+5.99e-02

Pos logit contributions
 etiquette+0.018
 aquatic+0.018
atar+0.017
 atmosp+0.016
edia+0.016
Neg logit contributions
���-0.037
龍�-0.036
 fixme-0.034
 Written-0.034
-0.031
Token States
Token ID1829
Feature activation-8.50e-02

Pos logit contributions
 etiquette+0.104
 aquatic+0.100
atar+0.099
 atmosp+0.092
edia+0.091
Neg logit contributions
���-0.223
龍�-0.215
 fixme-0.201
 Written-0.201
-0.184
Token 3
Token ID513
Feature activation+1.93e-01

Pos logit contributions
���+0.250
龍�+0.241
 Written+0.223
 fixme+0.222
+0.198
Neg logit contributions
 etiquette-0.212
 aquatic-0.206
atar-0.205
 atmosp-0.196
edia-0.193
Token 2010
Token ID3050
Feature activation-4.49e-03

Pos logit contributions
���+0.994
龍�+0.968
 Written+0.897
 fixme+0.891
+0.774
Neg logit contributions
 etiquette-1.188
atar-1.155
 aquatic-1.154
 therap-1.110
 atmosp-1.105
Token United
Token ID1578
Feature activation+5.77e-02

Pos logit contributions
���+0.016
龍�+0.016
 fixme+0.015
 Written+0.015
+0.014
Neg logit contributions
 etiquette-0.008
 aquatic-0.008
atar-0.008
 atmosp-0.007
edia-0.007
Token States
Token ID1829
Feature activation+3.57e-02

Pos logit contributions
 etiquette+0.096
 aquatic+0.093
atar+0.092
 atmosp+0.086
edia+0.085
Neg logit contributions
���-0.219
龍�-0.211
 fixme-0.197
 Written-0.196
-0.182
Token 10
Token ID838
Feature activation-7.93e-02

Pos logit contributions
 etiquette+0.088
 aquatic+0.086
atar+0.086
 atmosp+0.082
edia+0.081
Neg logit contributions
���-0.106
龍�-0.102
 fixme-0.094
 Written-0.094
-0.084
Token53
Token ID4310
Feature activation-1.02e-01

Pos logit contributions
���+0.221
龍�+0.211
 fixme+0.193
 Written+0.192
+0.172
Neg logit contributions
 etiquette-0.212
 aquatic-0.207
atar-0.206
 atmosp-0.197
edia-0.195
Token Posts
Token ID12043
Feature activation-2.21e+00

Pos logit contributions
���+0.246
龍�+0.226
 fixme+0.202
 Written+0.198
+0.175
Neg logit contributions
 etiquette-0.315
atar-0.310
 aquatic-0.309
 atmosp-0.297
edia-0.295
Token #
Token ID1303
Feature activation-1.61e+00

Pos logit contributions
���+5.272
龍�+4.944
 Written+4.656
 fixme+4.436
EStream+3.992
Neg logit contributions
 etiquette-6.330
atar-6.229
 aquatic-6.200
 atmosp-6.018
edia-5.968
Token15
Token ID1314
Feature activation-2.43e-01

Pos logit contributions
龍�+3.570
 Written+3.308
 fixme+3.238
���+3.116
written+2.936
Neg logit contributions
 therap-4.974
 etiquette-4.896
 aquatic-4.746
 charism-4.670
 atmosp-4.635
Token honestly
Token ID12698
Feature activation-2.04e-01

Pos logit contributions
���+0.717
龍�+0.687
 fixme+0.638
 Written+0.636
+0.567
Neg logit contributions
 etiquette-0.600
atar-0.584
 aquatic-0.582
 atmosp-0.557
edia-0.554
Token if
Token ID611
Feature activation-2.54e-01

Pos logit contributions
���+0.622
龍�+0.597
 fixme+0.554
 Written+0.550
+0.498
Neg logit contributions
 etiquette-0.482
atar-0.470
 aquatic-0.467
 atmosp-0.447
edia-0.442
Token we
Token ID356
Feature activation+9.00e-02

Pos logit contributions
���+0.750
龍�+0.721
 fixme+0.663
 Written+0.659
+0.594
Neg logit contributions
 etiquette-0.623
atar-0.610
 aquatic-0.605
 atmosp-0.579
edia-0.574
Token 2012
Token ID2321
Feature activation+1.02e-02

Pos logit contributions
���+1.135
龍�+1.101
 Written+1.009
 fixme+1.006
+0.881
Neg logit contributions
 etiquette-1.315
 aquatic-1.279
atar-1.272
 atmosp-1.223
 therap-1.215
Token United
Token ID1578
Feature activation+5.99e-02

Pos logit contributions
 etiquette+0.018
 aquatic+0.018
atar+0.017
 atmosp+0.016
edia+0.016
Neg logit contributions
���-0.037
龍�-0.036
 fixme-0.034
 Written-0.034
-0.031
Token States
Token ID1829
Feature activation-8.50e-02

Pos logit contributions
 etiquette+0.104
 aquatic+0.100
atar+0.099
 atmosp+0.092
edia+0.091
Neg logit contributions
���-0.223
龍�-0.215
 fixme-0.201
 Written-0.201
-0.184
Token 3
Token ID513
Feature activation+1.93e-01

Pos logit contributions
���+0.250
龍�+0.241
 Written+0.223
 fixme+0.222
+0.198
Neg logit contributions
 etiquette-0.212
 aquatic-0.206
atar-0.205
 atmosp-0.196
edia-0.193
Token240
Token ID16102
Feature activation-3.45e-03

Pos logit contributions
 etiquette+0.580
 aquatic+0.567
atar+0.567
 atmosp+0.544
edia+0.540
Neg logit contributions
���-0.476
龍�-0.449
 fixme-0.404
 Written-0.400
-0.352
Token Posts
Token ID12043
Feature activation-2.18e+00

Pos logit contributions
���+0.008
龍�+0.007
 fixme+0.007
 Written+0.007
+0.006
Neg logit contributions
 etiquette-0.011
 aquatic-0.011
atar-0.011
 atmosp-0.010
edia-0.010
Token #
Token ID1303
Feature activation-1.60e+00

Pos logit contributions
���+5.040
龍�+4.801
 Written+4.588
 fixme+4.334
 Joined+3.829
Neg logit contributions
 etiquette-6.279
atar-6.225
 aquatic-6.160
edia-5.953
 atmosp-5.938
Token10
Token ID940
Feature activation-3.09e-01

Pos logit contributions
龍�+4.029
���+3.743
 Written+3.640
 fixme+3.630
+3.209
Neg logit contributions
 etiquette-4.532
 aquatic-4.434
 therap-4.355
atar-4.237
 atmosp-4.234
Token Happy
Token ID14628
Feature activation-7.52e-01

Pos logit contributions
���+0.901
龍�+0.863
 fixme+0.799
 Written+0.798
+0.711
Neg logit contributions
 etiquette-0.770
 aquatic-0.749
atar-0.749
 atmosp-0.715
edia-0.710
Token it
Token ID340
Feature activation-8.17e-02

Pos logit contributions
���+2.244
龍�+2.147
 fixme+2.003
 Written+2.000
+1.788
Neg logit contributions
 etiquette-1.787
atar-1.783
 aquatic-1.743
edia-1.672
 atmosp-1.660
Token worked
Token ID3111
Feature activation+1.13e-01

Pos logit contributions
���+0.259
龍�+0.248
 fixme+0.230
 Written+0.228
+0.208
Neg logit contributions
 etiquette-0.186
atar-0.181
 aquatic-0.181
 atmosp-0.172
edia-0.170
Token 2002
Token ID6244
Feature activation+3.47e-01

Pos logit contributions
���+0.765
龍�+0.739
 fixme+0.677
 Written+0.676
+0.591
Neg logit contributions
 etiquette-0.894
atar-0.868
 aquatic-0.866
 atmosp-0.831
edia-0.828
Token United
Token ID1578
Feature activation+1.81e-01

Pos logit contributions
 etiquette+0.615
 aquatic+0.591
atar+0.591
 atmosp+0.550
edia+0.544
Neg logit contributions
���-1.286
龍�-1.237
 fixme-1.156
 Written-1.147
-1.063
Token States
Token ID1829
Feature activation+1.07e-01

Pos logit contributions
 etiquette+0.303
 aquatic+0.291
atar+0.291
 atmosp+0.270
edia+0.266
Neg logit contributions
���-0.686
龍�-0.660
 fixme-0.618
 Written-0.613
-0.570
Token 3
Token ID513
Feature activation+2.79e-01

Pos logit contributions
 etiquette+0.265
 aquatic+0.258
atar+0.257
 atmosp+0.245
edia+0.243
Neg logit contributions
���-0.317
龍�-0.305
 fixme-0.280
 Written-0.279
-0.251
Token472
Token ID37856
Feature activation+3.73e-02

Pos logit contributions
 etiquette+0.773
atar+0.756
 aquatic+0.754
 atmosp+0.722
edia+0.718
Neg logit contributions
���-0.752
龍�-0.709
 fixme-0.645
 Written-0.637
-0.569
Token Posts
Token ID12043
Feature activation-2.17e+00

Pos logit contributions
 etiquette+0.116
atar+0.114
 aquatic+0.114
 atmosp+0.109
edia+0.109
Neg logit contributions
���-0.089
龍�-0.082
 fixme-0.073
 Written-0.072
-0.064
Token #
Token ID1303
Feature activation-1.53e+00

Pos logit contributions
���+5.077
龍�+4.749
 Written+4.523
 fixme+4.282
EStream+3.757
Neg logit contributions
 etiquette-6.264
atar-6.208
 aquatic-6.162
edia-5.966
 atmosp-5.954
Token11
Token ID1157
Feature activation-7.44e-02

Pos logit contributions
龍�+3.772
���+3.530
 Written+3.474
 fixme+3.426
+3.018
Neg logit contributions
 etiquette-4.367
 therap-4.307
 aquatic-4.272
 atmosp-4.150
 charism-4.063
Token On
Token ID1550
Feature activation-6.37e-01

Pos logit contributions
���+0.219
龍�+0.209
 fixme+0.194
 Written+0.193
+0.172
Neg logit contributions
 etiquette-0.185
atar-0.180
 aquatic-0.180
 atmosp-0.172
edia-0.171
Token September
Token ID2693
Feature activation-5.00e-01

Pos logit contributions
���+1.591
龍�+1.529
 Written+1.434
 fixme+1.397
+1.239
Neg logit contributions
 etiquette-1.795
atar-1.771
 aquatic-1.754
 atmosp-1.689
edia-1.677
Token 20
Token ID1160
Feature activation-2.22e-01

Pos logit contributions
���+1.344
龍�+1.309
 Written+1.224
 fixme+1.223
+1.077
Neg logit contributions
 etiquette-1.337
 aquatic-1.299
atar-1.297
 therap-1.238
 atmosp-1.236
Token
Token ID198
Feature activation-5.67e-02

Pos logit contributions
���+0.156
龍�+0.152
 fixme+0.140
 Written+0.138
+0.122
Neg logit contributions
 etiquette-0.168
 aquatic-0.164
atar-0.163
 atmosp-0.157
 therap-0.156
Tokenheart
Token ID11499
Feature activation-1.16e-01

Pos logit contributions
���+0.160
龍�+0.154
 fixme+0.140
 Written+0.139
+0.126
Neg logit contributions
 etiquette-0.148
 aquatic-0.144
atar-0.143
 atmosp-0.138
edia-0.136
Tokenl
Token ID75
Feature activation-2.42e-01

Pos logit contributions
���+0.342
龍�+0.326
 fixme+0.299
 Written+0.297
+0.269
Neg logit contributions
 etiquette-0.290
 aquatic-0.282
atar-0.281
 atmosp-0.268
edia-0.266
Tokenxp
Token ID42372
Feature activation-9.93e-02

Pos logit contributions
���+0.720
龍�+0.694
 fixme+0.636
 Written+0.633
+0.571
Neg logit contributions
 etiquette-0.596
 aquatic-0.577
atar-0.572
 atmosp-0.550
edia-0.542
Token Profile
Token ID13118
Feature activation-9.60e-01

Pos logit contributions
���+0.287
龍�+0.273
 fixme+0.250
 Written+0.249
+0.224
Neg logit contributions
 etiquette-0.255
 aquatic-0.248
atar-0.247
 atmosp-0.237
edia-0.233
Token Joined
Token ID16709
Feature activation-2.15e+00

Pos logit contributions
���+1.131
龍�+1.017
 Written+0.813
 fixme+0.794
+0.551
Neg logit contributions
 etiquette-4.058
atar-3.989
 aquatic-3.979
 atmosp-3.875
edia-3.873
Token September
Token ID2693
Feature activation-4.08e-01

Pos logit contributions
 Written+4.467
龍�+4.454
���+4.324
 fixme+4.212
 Joined+3.710
Neg logit contributions
 etiquette-6.449
 therap-6.364
 aquatic-6.212
atar-6.202
edia-6.022
Token 2010
Token ID3050
Feature activation-4.49e-03

Pos logit contributions
���+0.994
龍�+0.968
 Written+0.897
 fixme+0.891
+0.774
Neg logit contributions
 etiquette-1.188
atar-1.155
 aquatic-1.154
 therap-1.110
 atmosp-1.105
Token United
Token ID1578
Feature activation+5.77e-02

Pos logit contributions
���+0.016
龍�+0.016
 fixme+0.015
 Written+0.015
+0.014
Neg logit contributions
 etiquette-0.008
 aquatic-0.008
atar-0.008
 atmosp-0.007
edia-0.007
Token States
Token ID1829
Feature activation+3.57e-02

Pos logit contributions
 etiquette+0.096
 aquatic+0.093
atar+0.092
 atmosp+0.086
edia+0.085
Neg logit contributions
���-0.219
龍�-0.211
 fixme-0.197
 Written-0.196
-0.182
Token 10
Token ID838
Feature activation-7.93e-02

Pos logit contributions
 etiquette+0.088
 aquatic+0.086
atar+0.086
 atmosp+0.082
edia+0.081
Neg logit contributions
���-0.106
龍�-0.102
 fixme-0.094
 Written-0.094
-0.084
Token June
Token ID2795
Feature activation-1.52e-01

Pos logit contributions
龍�+3.929
���+3.914
 Written+3.871
 fixme+3.686
 Joined+3.196
Neg logit contributions
 etiquette-5.530
 therap-5.384
atar-5.378
 aquatic-5.343
edia-5.179
Token 2012
Token ID2321
Feature activation+2.00e-01

Pos logit contributions
���+0.379
龍�+0.369
 Written+0.339
 fixme+0.337
+0.293
Neg logit contributions
 etiquette-0.440
 aquatic-0.429
atar-0.429
 therap-0.411
 atmosp-0.410
Token Canada
Token ID3340
Feature activation+2.94e-01

Pos logit contributions
 etiquette+0.348
 aquatic+0.335
atar+0.332
 atmosp+0.311
edia+0.306
Neg logit contributions
���-0.741
龍�-0.717
 fixme-0.671
 Written-0.668
-0.616
Token 16
Token ID1467
Feature activation+3.59e-01

Pos logit contributions
 etiquette+0.720
 aquatic+0.700
atar+0.699
 atmosp+0.665
edia+0.659
Neg logit contributions
���-0.888
龍�-0.850
 fixme-0.782
 Written-0.776
-0.702
Token10
Token ID940
Feature activation-6.77e-02

Pos logit contributions
 etiquette+0.927
atar+0.906
 aquatic+0.903
 atmosp+0.862
edia+0.856
Neg logit contributions
���-1.043
龍�-0.985
 fixme-0.901
 Written-0.890
-0.804
Token Posts
Token ID12043
Feature activation-2.14e+00

Pos logit contributions
���+0.161
龍�+0.150
 fixme+0.134
 Written+0.131
+0.116
Neg logit contributions
 etiquette-0.211
atar-0.207
 aquatic-0.206
 atmosp-0.198
edia-0.197
Token #
Token ID1303
Feature activation-1.49e+00

Pos logit contributions
���+5.107
龍�+4.761
 Written+4.492
 fixme+4.268
EStream+3.840
Neg logit contributions
 etiquette-6.184
atar-6.092
 aquatic-6.065
 atmosp-5.882
edia-5.839
Token14
Token ID1415
Feature activation-6.51e-02

Pos logit contributions
龍�+3.651
 Written+3.485
 fixme+3.415
���+3.158
written+3.097
Neg logit contributions
 therap-4.163
 etiquette-4.162
 aquatic-4.042
 advoc-3.911
 charism-3.909
Token On
Token ID1550
Feature activation-5.40e-01

Pos logit contributions
���+0.193
龍�+0.184
 fixme+0.171
 Written+0.170
+0.152
Neg logit contributions
 etiquette-0.161
atar-0.156
 aquatic-0.156
 atmosp-0.149
edia-0.148
Token September
Token ID2693
Feature activation-4.63e-01

Pos logit contributions
���+1.394
龍�+1.369
 Written+1.286
 fixme+1.278
+1.098
Neg logit contributions
 etiquette-1.463
atar-1.431
 aquatic-1.422
 therap-1.378
 atmosp-1.357
Token 20
Token ID1160
Feature activation-2.17e-01

Pos logit contributions
���+1.399
龍�+1.333
 fixme+1.227
 Written+1.218
+1.103
Neg logit contributions
 etiquette-1.131
atar-1.101
 aquatic-1.100
 atmosp-1.046
edia-1.036
Token 2011
Token ID2813
Feature activation+1.47e-02

Pos logit contributions
���+0.510
龍�+0.491
 Written+0.449
 fixme+0.448
+0.392
Neg logit contributions
 etiquette-0.583
 aquatic-0.568
atar-0.568
 atmosp-0.543
edia-0.541
Token United
Token ID1578
Feature activation+2.93e-01

Pos logit contributions
 etiquette+0.026
 aquatic+0.025
atar+0.025
 atmosp+0.023
edia+0.023
Neg logit contributions
���-0.054
龍�-0.053
 fixme-0.049
 Written-0.049
-0.045
Token States
Token ID1829
Feature activation+1.76e-01

Pos logit contributions
 etiquette+0.482
atar+0.463
 aquatic+0.462
 atmosp+0.427
edia+0.422
Neg logit contributions
���-1.124
龍�-1.082
 fixme-1.015
 Written-1.006
-0.936
Token 6
Token ID718
Feature activation+1.49e-01

Pos logit contributions
 etiquette+0.433
 aquatic+0.421
atar+0.421
 atmosp+0.401
edia+0.397
Neg logit contributions
���-0.526
龍�-0.506
 fixme-0.466
 Written-0.463
-0.418
Token12
Token ID1065
Feature activation-1.60e-01

Pos logit contributions
 etiquette+0.391
atar+0.381
 aquatic+0.381
 atmosp+0.363
edia+0.361
Neg logit contributions
���-0.427
龍�-0.405
 fixme-0.371
 Written-0.366
-0.330
Token Posts
Token ID12043
Feature activation-2.04e+00

Pos logit contributions
���+0.385
龍�+0.359
 fixme+0.321
 Written+0.315
+0.278
Neg logit contributions
 etiquette-0.493
atar-0.484
 aquatic-0.483
 atmosp-0.464
edia-0.461
Token #
Token ID1303
Feature activation-1.48e+00

Pos logit contributions
���+4.921
龍�+4.608
 Written+4.313
 fixme+4.146
EStream+3.708
Neg logit contributions
 etiquette-5.873
atar-5.772
 aquatic-5.754
 atmosp-5.567
edia-5.530
Token17
Token ID1558
Feature activation-1.58e-01

Pos logit contributions
龍�+3.400
���+3.133
 Written+3.112
 fixme+3.105
written+2.679
Neg logit contributions
 etiquette-4.469
 therap-4.418
 aquatic-4.309
 atmosp-4.213
atar-4.182
Token Good
Token ID4599
Feature activation+2.69e-01

Pos logit contributions
���+0.467
龍�+0.446
 fixme+0.414
 Written+0.412
+0.368
Neg logit contributions
 etiquette-0.392
atar-0.381
 aquatic-0.380
 atmosp-0.363
edia-0.361
Token to
Token ID284
Feature activation+9.81e-02

Pos logit contributions
 etiquette+0.709
 aquatic+0.690
atar+0.689
 atmosp+0.659
edia+0.653
Neg logit contributions
���-0.762
龍�-0.724
 fixme-0.661
 Written-0.653
-0.589
Token hear
Token ID3285
Feature activation+2.15e-01

Pos logit contributions
 etiquette+0.233
atar+0.226
 aquatic+0.226
 atmosp+0.215
edia+0.213
Neg logit contributions
���-0.304
龍�-0.290
 fixme-0.268
 Written-0.265
-0.241
Token 2010
Token ID3050
Feature activation+2.27e-01

Pos logit contributions
 etiquette+0.064
 aquatic+0.062
atar+0.062
 atmosp+0.059
edia+0.059
Neg logit contributions
���-0.057
龍�-0.055
 fixme-0.050
 Written-0.050
-0.044
Token United
Token ID1578
Feature activation+3.32e-01

Pos logit contributions
 etiquette+0.385
 aquatic+0.369
atar+0.368
 atmosp+0.343
edia+0.337
Neg logit contributions
���-0.857
龍�-0.827
 fixme-0.774
 Written-0.770
-0.713
Token States
Token ID1829
Feature activation+2.07e-01

Pos logit contributions
 etiquette+0.543
atar+0.522
 aquatic+0.520
 atmosp+0.482
edia+0.476
Neg logit contributions
���-1.272
龍�-1.224
 fixme-1.149
 Written-1.138
-1.060
Token 21
Token ID2310
Feature activation+2.89e-01

Pos logit contributions
 etiquette+0.507
 aquatic+0.493
atar+0.492
 atmosp+0.469
edia+0.464
Neg logit contributions
���-0.620
龍�-0.595
 fixme-0.548
 Written-0.544
-0.491
Token50
Token ID1120
Feature activation-2.56e-01

Pos logit contributions
 etiquette+0.829
atar+0.810
 aquatic+0.810
 atmosp+0.776
edia+0.771
Neg logit contributions
���-0.754
龍�-0.711
 fixme-0.643
 Written-0.634
-0.565
Token Posts
Token ID12043
Feature activation-1.99e+00

Pos logit contributions
���+0.569
龍�+0.545
 fixme+0.486
 Written+0.483
+0.416
Neg logit contributions
 etiquette-0.822
 aquatic-0.804
atar-0.800
 atmosp-0.773
edia-0.768
Token #
Token ID1303
Feature activation-1.40e+00

Pos logit contributions
���+4.817
龍�+4.513
 Written+4.225
 fixme+4.070
EStream+3.636
Neg logit contributions
 etiquette-5.705
atar-5.613
 aquatic-5.590
 atmosp-5.422
edia-5.379
Token16
Token ID1433
Feature activation+1.22e-02

Pos logit contributions
龍�+3.228
���+3.038
 fixme+2.935
 Written+2.912
+2.582
Neg logit contributions
 etiquette-4.195
 aquatic-4.114
 therap-4.097
 atmosp-3.983
atar-3.953
Token Glad
Token ID28442
Feature activation-3.96e-01

Pos logit contributions
 etiquette+0.030
atar+0.029
 aquatic+0.029
 atmosp+0.028
edia+0.028
Neg logit contributions
���-0.036
龍�-0.035
 fixme-0.032
 Written-0.032
-0.029
Token to
Token ID284
Feature activation-4.48e-02

Pos logit contributions
���+1.163
龍�+1.121
 fixme+1.031
 Written+1.029
+0.923
Neg logit contributions
 etiquette-0.973
atar-0.955
 aquatic-0.949
 atmosp-0.905
edia-0.899
Token hear
Token ID3285
Feature activation-1.12e-01

Pos logit contributions
���+0.142
龍�+0.136
 fixme+0.126
 Written+0.125
+0.113
Neg logit contributions
 etiquette-0.102
atar-0.100
 aquatic-0.099
 atmosp-0.094
edia-0.094
Token 2011
Token ID2813
Feature activation+1.58e-01

Pos logit contributions
���+0.193
龍�+0.188
 Written+0.173
 fixme+0.172
+0.149
Neg logit contributions
 etiquette-0.224
 aquatic-0.218
atar-0.217
 atmosp-0.208
 therap-0.208
Token United
Token ID1578
Feature activation+3.61e-01

Pos logit contributions
 etiquette+0.277
 aquatic+0.267
atar+0.265
 atmosp+0.248
edia+0.244
Neg logit contributions
���-0.585
龍�-0.566
 fixme-0.529
 Written-0.528
-0.486
Token States
Token ID1829
Feature activation+2.03e-01

Pos logit contributions
 etiquette+0.588
atar+0.566
 aquatic+0.563
 atmosp+0.521
edia+0.516
Neg logit contributions
���-1.385
龍�-1.333
 fixme-1.253
 Written-1.238
-1.156
Token 228
Token ID29041
Feature activation+2.47e-01

Pos logit contributions
 etiquette+0.498
 aquatic+0.484
atar+0.484
 atmosp+0.460
edia+0.456
Neg logit contributions
���-0.607
龍�-0.582
 fixme-0.537
 Written-0.534
-0.481
Token65
Token ID2996
Feature activation-1.44e-01

Pos logit contributions
 etiquette+0.652
atar+0.642
 aquatic+0.636
 atmosp+0.608
edia+0.607
Neg logit contributions
���-0.707
龍�-0.660
 fixme-0.600
 Written-0.587
-0.536
Token Posts
Token ID12043
Feature activation-1.98e+00

Pos logit contributions
���+0.336
龍�+0.314
 fixme+0.280
 Written+0.276
+0.242
Neg logit contributions
 etiquette-0.452
atar-0.442
 aquatic-0.442
 atmosp-0.425
edia-0.423
Token #
Token ID1303
Feature activation-1.39e+00

Pos logit contributions
���+4.772
龍�+4.457
 Written+4.189
 fixme+4.002
EStream+3.577
Neg logit contributions
 etiquette-5.694
atar-5.621
 aquatic-5.592
 atmosp-5.409
edia-5.375
Token13
Token ID1485
Feature activation-1.80e-01

Pos logit contributions
龍�+3.506
 Written+3.354
 fixme+3.298
���+3.186
written+2.948
Neg logit contributions
 etiquette-3.831
 therap-3.797
 aquatic-3.665
 atmosp-3.583
atar-3.566
Token On
Token ID1550
Feature activation-5.39e-01

Pos logit contributions
���+0.527
龍�+0.506
 fixme+0.470
 Written+0.468
+0.417
Neg logit contributions
 etiquette-0.445
atar-0.433
 aquatic-0.431
 atmosp-0.413
edia-0.410
Token September
Token ID2693
Feature activation-3.92e-01

Pos logit contributions
���+1.379
龍�+1.354
 Written+1.276
 fixme+1.261
+1.085
Neg logit contributions
 etiquette-1.468
atar-1.439
 aquatic-1.426
 therap-1.379
 atmosp-1.361
Token 20
Token ID1160
Feature activation-1.83e-01

Pos logit contributions
���+1.158
龍�+1.104
 fixme+1.018
 Written+1.014
+0.913
Neg logit contributions
 etiquette-0.980
atar-0.955
 aquatic-0.954
 atmosp-0.906
edia-0.899
Token
Token ID198
Feature activation-1.20e-01

Pos logit contributions
���+0.474
龍�+0.469
 fixme+0.431
 Written+0.428
+0.375
Neg logit contributions
 etiquette-0.534
 aquatic-0.519
atar-0.516
 therap-0.500
 atmosp-0.497
Tokenre
Token ID260
Feature activation-2.50e-01

Pos logit contributions
���+0.340
龍�+0.329
 fixme+0.299
 Written+0.297
+0.270
Neg logit contributions
 etiquette-0.310
 aquatic-0.302
atar-0.299
 atmosp-0.289
edia-0.283
Tokeni
Token ID72
Feature activation-5.43e-01

Pos logit contributions
���+0.742
龍�+0.720
 fixme+0.656
 Written+0.654
+0.594
Neg logit contributions
 etiquette-0.612
 aquatic-0.596
atar-0.588
 atmosp-0.569
 therap-0.554
Token Profile
Token ID13118
Feature activation-8.23e-01

Pos logit contributions
���+1.579
龍�+1.523
 Written+1.385
 fixme+1.383
+1.255
Neg logit contributions
 etiquette-1.367
 aquatic-1.329
atar-1.315
 atmosp-1.271
edia-1.245
Token Blog
Token ID14001
Feature activation-5.98e-01

Pos logit contributions
���+1.010
龍�+0.902
 Written+0.713
 fixme+0.708
+0.505
Neg logit contributions
 etiquette-3.464
atar-3.402
 aquatic-3.400
 atmosp-3.306
edia-3.302
Token Joined
Token ID16709
Feature activation-1.95e+00

Pos logit contributions
���+0.438
龍�+0.381
 fixme+0.245
 Written+0.239
+0.085
Neg logit contributions
 etiquette-2.805
 aquatic-2.756
atar-2.756
 atmosp-2.691
edia-2.672
Token October
Token ID3267
Feature activation-3.07e-01

Pos logit contributions
龍�+4.227
���+4.178
 Written+4.144
 fixme+3.943
 Joined+3.459
Neg logit contributions
 etiquette-5.817
 therap-5.626
atar-5.621
 aquatic-5.598
edia-5.419
Token 2002
Token ID6244
Feature activation+3.47e-01

Pos logit contributions
���+0.765
龍�+0.739
 fixme+0.677
 Written+0.676
+0.591
Neg logit contributions
 etiquette-0.894
atar-0.868
 aquatic-0.866
 atmosp-0.831
edia-0.828
Token United
Token ID1578
Feature activation+1.81e-01

Pos logit contributions
 etiquette+0.615
 aquatic+0.591
atar+0.591
 atmosp+0.550
edia+0.544
Neg logit contributions
���-1.286
龍�-1.237
 fixme-1.156
 Written-1.147
-1.063
Token States
Token ID1829
Feature activation+1.07e-01

Pos logit contributions
 etiquette+0.303
 aquatic+0.291
atar+0.291
 atmosp+0.270
edia+0.266
Neg logit contributions
���-0.686
龍�-0.660
 fixme-0.618
 Written-0.613
-0.570
Token 3
Token ID513
Feature activation+2.79e-01

Pos logit contributions
 etiquette+0.265
 aquatic+0.258
atar+0.257
 atmosp+0.245
edia+0.243
Neg logit contributions
���-0.317
龍�-0.305
 fixme-0.280
 Written-0.279
-0.251
Token
Token ID198
Feature activation-4.85e-02

Pos logit contributions
���+0.352
龍�+0.346
 fixme+0.318
 Written+0.315
+0.278
Neg logit contributions
 etiquette-0.389
 aquatic-0.378
atar-0.377
 atmosp-0.362
 therap-0.362
TokenDark
Token ID17367
Feature activation-7.05e-02

Pos logit contributions
���+0.138
龍�+0.132
 fixme+0.121
 Written+0.120
+0.108
Neg logit contributions
 etiquette-0.126
 aquatic-0.123
atar-0.122
 atmosp-0.118
edia-0.116
Tokenhorse
Token ID30527
Feature activation-4.33e-01

Pos logit contributions
���+0.208
龍�+0.200
 fixme+0.182
 Written+0.182
+0.165
Neg logit contributions
 etiquette-0.176
 aquatic-0.172
atar-0.171
 atmosp-0.163
edia-0.162
Token Profile
Token ID13118
Feature activation-9.86e-01

Pos logit contributions
���+1.232
龍�+1.183
 Written+1.076
 fixme+1.075
+0.969
Neg logit contributions
 etiquette-1.123
 aquatic-1.092
atar-1.089
 atmosp-1.045
edia-1.027
Token Blog
Token ID14001
Feature activation-6.31e-01

Pos logit contributions
���+1.357
龍�+1.218
 Written+1.009
 fixme+0.994
+0.747
Neg logit contributions
 etiquette-3.991
atar-3.921
 aquatic-3.911
 atmosp-3.807
edia-3.791
Token Joined
Token ID16709
Feature activation-1.94e+00

Pos logit contributions
���+0.417
龍�+0.374
 Written+0.242
 fixme+0.238
+0.063
Neg logit contributions
 etiquette-2.980
 aquatic-2.929
atar-2.920
 therap-2.860
 atmosp-2.856
Token December
Token ID3426
Feature activation-7.75e-02

Pos logit contributions
龍�+4.094
���+4.055
 Written+4.055
 fixme+3.836
 Joined+3.355
Neg logit contributions
 etiquette-5.811
 therap-5.671
atar-5.650
 aquatic-5.604
edia-5.444
Token 2011
Token ID2813
Feature activation+1.58e-01

Pos logit contributions
���+0.193
龍�+0.188
 Written+0.173
 fixme+0.172
+0.149
Neg logit contributions
 etiquette-0.224
 aquatic-0.218
atar-0.217
 atmosp-0.208
 therap-0.208
Token United
Token ID1578
Feature activation+3.61e-01

Pos logit contributions
 etiquette+0.277
 aquatic+0.267
atar+0.265
 atmosp+0.248
edia+0.244
Neg logit contributions
���-0.585
龍�-0.566
 fixme-0.529
 Written-0.528
-0.486
Token States
Token ID1829
Feature activation+2.03e-01

Pos logit contributions
 etiquette+0.588
atar+0.566
 aquatic+0.563
 atmosp+0.521
edia+0.516
Neg logit contributions
���-1.385
龍�-1.333
 fixme-1.253
 Written-1.238
-1.156
Token 228
Token ID29041
Feature activation+2.47e-01

Pos logit contributions
 etiquette+0.498
 aquatic+0.484
atar+0.484
 atmosp+0.460
edia+0.456
Neg logit contributions
���-0.607
龍�-0.582
 fixme-0.537
 Written-0.534
-0.481
Token 2010
Token ID3050
Feature activation+1.30e-01

Pos logit contributions
���+0.477
龍�+0.465
 Written+0.426
 fixme+0.425
+0.369
Neg logit contributions
 etiquette-0.564
 aquatic-0.550
atar-0.547
 atmosp-0.525
 therap-0.525
Token United
Token ID1578
Feature activation+3.13e-01

Pos logit contributions
 etiquette+0.231
 aquatic+0.222
atar+0.221
 atmosp+0.207
edia+0.203
Neg logit contributions
���-0.479
龍�-0.463
 fixme-0.433
 Written-0.432
-0.398
Token States
Token ID1829
Feature activation+1.70e-01

Pos logit contributions
 etiquette+0.535
 aquatic+0.514
atar+0.509
 atmosp+0.475
edia+0.467
Neg logit contributions
���-1.169
龍�-1.137
 fixme-1.063
 Written-1.059
-0.977
Token 6
Token ID718
Feature activation+3.00e-01

Pos logit contributions
 etiquette+0.420
 aquatic+0.408
atar+0.408
 atmosp+0.388
edia+0.384
Neg logit contributions
���-0.503
龍�-0.484
 fixme-0.445
 Written-0.443
-0.399
Token83
Token ID5999
Feature activation-2.78e-03

Pos logit contributions
 etiquette+0.782
atar+0.765
 aquatic+0.762
 atmosp+0.728
edia+0.723
Neg logit contributions
���-0.863
龍�-0.815
 fixme-0.745
 Written-0.734
-0.663
Token Posts
Token ID12043
Feature activation-1.94e+00

Pos logit contributions
���+0.007
龍�+0.006
 fixme+0.005
 Written+0.005
+0.005
Neg logit contributions
 etiquette-0.009
atar-0.008
 aquatic-0.008
 atmosp-0.008
edia-0.008
Token Last
Token ID4586
Feature activation-5.24e-01

Pos logit contributions
���+4.733
龍�+4.440
 Written+4.153
 fixme+4.013
EStream+3.587
Neg logit contributions
 etiquette-5.537
atar-5.448
 aquatic-5.427
 atmosp-5.253
edia-5.204
Token Edited
Token ID34212
Feature activation-1.79e+00

Pos logit contributions
���+0.568
龍�+0.555
 Written+0.445
 fixme+0.433
+0.280
Neg logit contributions
 etiquette-2.248
 aquatic-2.200
atar-2.197
 atmosp-2.140
 therap-2.139
Token:
Token ID25
Feature activation-6.93e-02

Pos logit contributions
龍�+4.038
 Written+3.847
 fixme+3.809
���+3.603
written+3.208
Neg logit contributions
 therap-5.404
 etiquette-5.388
 aquatic-5.139
 advoc-5.111
 charism-5.050
Token 2013
Token ID2211
Feature activation-2.09e-01

Pos logit contributions
���+0.187
龍�+0.183
 Written+0.172
 fixme+0.171
+0.148
Neg logit contributions
 etiquette-0.185
 aquatic-0.179
atar-0.178
 therap-0.173
 atmosp-0.171
Token-
Token ID12
Feature activation-1.91e-01

Pos logit contributions
���+0.621
龍�+0.600
 fixme+0.554
 Written+0.550
+0.497
Neg logit contributions
 etiquette-0.512
 aquatic-0.498
atar-0.496
 atmosp-0.472
edia-0.469
Token was
Token ID373
Feature activation-8.02e-02

Pos logit contributions
 etiquette+0.178
 aquatic+0.173
atar+0.173
 atmosp+0.164
edia+0.162
Neg logit contributions
���-0.249
龍�-0.238
 fixme-0.221
 Written-0.220
-0.199
Token about
Token ID546
Feature activation+1.33e-01

Pos logit contributions
���+0.244
龍�+0.232
 Written+0.218
 fixme+0.216
+0.193
Neg logit contributions
 etiquette-0.190
atar-0.186
 aquatic-0.185
 atmosp-0.177
edia-0.175
Token homosexuality
Token ID19822
Feature activation+6.28e-02

Pos logit contributions
 etiquette+0.315
atar+0.309
 aquatic+0.307
 atmosp+0.292
edia+0.291
Neg logit contributions
���-0.405
龍�-0.386
 fixme-0.357
 Written-0.356
-0.321
Token
Token ID198
Feature activation-4.36e-02

Pos logit contributions
 etiquette+0.145
 aquatic+0.141
atar+0.141
 atmosp+0.134
edia+0.132
Neg logit contributions
���-0.196
龍�-0.188
 fixme-0.174
 Written-0.174
-0.157
Token
Token ID198
Feature activation+2.92e-02

Pos logit contributions
���+0.118
龍�+0.113
 fixme+0.103
 Written+0.102
+0.091
Neg logit contributions
 etiquette-0.120
 aquatic-0.117
atar-0.116
 atmosp-0.112
edia-0.110
TokenReviewer
Token ID35407
Feature activation-1.92e+00

Pos logit contributions
 etiquette+0.077
 aquatic+0.075
atar+0.075
 atmosp+0.072
edia+0.070
Neg logit contributions
���-0.082
龍�-0.079
 fixme-0.072
 Written-0.071
-0.064
Token:
Token ID25
Feature activation-2.79e-01

Pos logit contributions
龍�+4.458
���+4.377
 Written+4.221
 fixme+4.211
+3.543
Neg logit contributions
 etiquette-5.403
 therap-5.259
 aquatic-5.250
atar-5.209
 atmosp-5.035
Token k
Token ID479
Feature activation-1.22e-01

Pos logit contributions
���+0.852
龍�+0.814
 fixme+0.753
 Written+0.749
+0.675
Neg logit contributions
 etiquette-0.664
atar-0.650
 aquatic-0.645
 atmosp-0.613
edia-0.613
Tokenimp
Token ID11011
Feature activation+3.30e-02

Pos logit contributions
���+0.352
龍�+0.338
 fixme+0.309
 Written+0.307
+0.276
Neg logit contributions
 etiquette-0.314
 aquatic-0.306
atar-0.303
 atmosp-0.292
edia-0.288
Tokenunk
Token ID2954
Feature activation-2.53e-01

Pos logit contributions
 etiquette+0.081
 aquatic+0.079
atar+0.079
 atmosp+0.075
edia+0.074
Neg logit contributions
���-0.100
龍�-0.095
 fixme-0.087
 Written-0.087
-0.079
Tokenrock
Token ID10823
Feature activation+1.05e-01

Pos logit contributions
���+0.812
龍�+0.780
 fixme+0.721
 Written+0.718
+0.654
Neg logit contributions
 etiquette-0.564
 aquatic-0.547
atar-0.544
 atmosp-0.519
edia-0.509
TokenJ
Token ID41
Feature activation-1.56e-01

Pos logit contributions
 etiquette+0.111
 aquatic+0.108
atar+0.107
 atmosp+0.103
edia+0.101
Neg logit contributions
���-0.120
龍�-0.116
 fixme-0.105
 Written-0.104
-0.094
Tokenaaaa
Token ID24794
Feature activation+2.77e-01

Pos logit contributions
���+0.465
龍�+0.453
 fixme+0.413
 Written+0.411
+0.373
Neg logit contributions
 etiquette-0.378
 aquatic-0.368
atar-0.361
 atmosp-0.350
edia-0.340
Tokenasper
Token ID32981
Feature activation+1.21e-03

Pos logit contributions
 etiquette+0.629
 aquatic+0.610
atar+0.606
 atmosp+0.579
edia+0.568
Neg logit contributions
���-0.886
龍�-0.851
 fixme-0.785
 Written-0.779
-0.712
Token Profile
Token ID13118
Feature activation-9.22e-01

Pos logit contributions
 etiquette+0.003
 aquatic+0.003
atar+0.003
 atmosp+0.003
edia+0.003
Neg logit contributions
���-0.003
龍�-0.003
 fixme-0.003
 Written-0.003
-0.003
Token Blog
Token ID14001
Feature activation-5.85e-01

Pos logit contributions
���+1.141
龍�+1.022
 Written+0.827
 fixme+0.813
+0.583
Neg logit contributions
 etiquette-3.850
 aquatic-3.781
atar-3.777
 atmosp-3.677
edia-3.668
Token Joined
Token ID16709
Feature activation-1.91e+00

Pos logit contributions
���+0.393
龍�+0.350
 fixme+0.226
 Written+0.223
+0.063
Neg logit contributions
 etiquette-2.759
 aquatic-2.712
atar-2.707
 atmosp-2.646
 therap-2.641
Token April
Token ID3035
Feature activation-1.72e-01

Pos logit contributions
龍�+4.043
 Written+4.007
���+3.986
 fixme+3.791
 Joined+3.321
Neg logit contributions
 etiquette-5.712
 therap-5.573
atar-5.539
 aquatic-5.511
edia-5.341
Token 2012
Token ID2321
Feature activation+3.10e-01

Pos logit contributions
���+0.418
龍�+0.407
 Written+0.378
 fixme+0.373
+0.324
Neg logit contributions
 etiquette-0.501
 aquatic-0.488
atar-0.487
 therap-0.471
 atmosp-0.466
Token United
Token ID1578
Feature activation+3.38e-01

Pos logit contributions
 etiquette+0.536
 aquatic+0.516
atar+0.514
 atmosp+0.479
edia+0.472
Neg logit contributions
���-1.159
龍�-1.119
 fixme-1.047
 Written-1.041
-0.963
Token States
Token ID1829
Feature activation+1.99e-01

Pos logit contributions
 etiquette+0.561
atar+0.539
 aquatic+0.538
 atmosp+0.498
edia+0.492
Neg logit contributions
���-1.291
龍�-1.243
 fixme-1.166
 Written-1.155
-1.075
Token 5
Token ID642
Feature activation+1.62e-01

Pos logit contributions
 etiquette+0.489
 aquatic+0.476
atar+0.475
 atmosp+0.452
edia+0.448
Neg logit contributions
���-0.595
龍�-0.571
 fixme-0.526
 Written-0.524
-0.471
Token June
Token ID2795
Feature activation-3.58e-02

Pos logit contributions
���+3.573
龍�+3.558
 Written+3.467
 fixme+3.333
+2.862
Neg logit contributions
 etiquette-4.872
atar-4.746
 aquatic-4.706
 therap-4.701
edia-4.547
Token 2009
Token ID3717
Feature activation+2.51e-01

Pos logit contributions
���+0.091
龍�+0.088
 fixme+0.080
 Written+0.080
+0.070
Neg logit contributions
 etiquette-0.103
atar-0.100
 aquatic-0.100
 atmosp-0.096
 therap-0.095
Token Germany
Token ID4486
Feature activation+2.39e-01

Pos logit contributions
 etiquette+0.400
 aquatic+0.384
atar+0.382
 atmosp+0.354
edia+0.348
Neg logit contributions
���-0.973
龍�-0.941
 fixme-0.882
 Written-0.878
-0.814
Token 247
Token ID30179
Feature activation-9.16e-02

Pos logit contributions
 etiquette+0.589
 aquatic+0.573
atar+0.572
 atmosp+0.544
edia+0.539
Neg logit contributions
���-0.713
龍�-0.686
 fixme-0.631
 Written-0.627
-0.565
Token34
Token ID2682
Feature activation-2.99e-02

Pos logit contributions
���+0.237
龍�+0.223
 fixme+0.202
 Written+0.199
+0.177
Neg logit contributions
 etiquette-0.264
atar-0.259
 aquatic-0.258
 atmosp-0.248
edia-0.246
Token Posts
Token ID12043
Feature activation-1.90e+00

Pos logit contributions
���+0.070
龍�+0.065
 fixme+0.058
 Written+0.057
+0.050
Neg logit contributions
 etiquette-0.094
atar-0.092
 aquatic-0.092
 atmosp-0.088
edia-0.088
Token #
Token ID1303
Feature activation-1.27e+00

Pos logit contributions
���+4.623
龍�+4.344
 Written+4.071
 fixme+3.924
EStream+3.500
Neg logit contributions
 etiquette-5.430
atar-5.341
 aquatic-5.324
 atmosp-5.144
edia-5.099
Token20
Token ID1238
Feature activation-3.28e-01

Pos logit contributions
龍�+2.443
���+2.298
 fixme+2.144
 Written+2.139
+1.830
Neg logit contributions
 etiquette-4.328
 aquatic-4.210
 therap-4.174
atar-4.165
 atmosp-4.061
Token Nice
Token ID18460
Feature activation+3.43e-02

Pos logit contributions
���+0.962
龍�+0.919
 fixme+0.857
 Written+0.853
+0.759
Neg logit contributions
 etiquette-0.813
atar-0.792
 aquatic-0.790
 atmosp-0.754
edia-0.750
Token,
Token ID11
Feature activation+1.69e-01

Pos logit contributions
 etiquette+0.081
atar+0.080
 aquatic+0.079
 atmosp+0.075
edia+0.075
Neg logit contributions
���-0.105
龍�-0.101
 fixme-0.093
 Written-0.093
-0.084
Token happy
Token ID3772
Feature activation-4.70e-01

Pos logit contributions
 etiquette+0.395
atar+0.385
 aquatic+0.384
 atmosp+0.365
edia+0.362
Neg logit contributions
���-0.525
龍�-0.502
 fixme-0.464
 Written-0.460
-0.418
Token,
Token ID11
Feature activation-3.71e-01

Pos logit contributions
龍�+1.369
���+1.369
 fixme+1.286
 Written+1.275
+1.131
Neg logit contributions
 etiquette-1.119
 aquatic-1.080
atar-1.072
 therap-1.041
 atmosp-1.024
Token I
Token ID314
Feature activation-1.99e-01

Pos logit contributions
���+1.034
龍�+0.982
 fixme+0.898
 Written+0.894
+0.792
Neg logit contributions
 etiquette-0.988
atar-0.967
 aquatic-0.965
 atmosp-0.922
edia-0.913
Token think
Token ID892
Feature activation-2.89e-02

Pos logit contributions
���+0.562
龍�+0.551
 fixme+0.512
 Written+0.508
+0.448
Neg logit contributions
 etiquette-0.506
atar-0.491
 aquatic-0.487
 atmosp-0.469
edia-0.466
Token
Token ID198
Feature activation-2.09e-01

Pos logit contributions
���+0.080
龍�+0.076
 fixme+0.069
 Written+0.069
+0.061
Neg logit contributions
 etiquette-0.078
 aquatic-0.076
atar-0.076
 atmosp-0.073
edia-0.072
Token
Token ID198
Feature activation-1.53e-01

Pos logit contributions
���+0.520
龍�+0.516
 fixme+0.475
 Written+0.472
+0.412
Neg logit contributions
 etiquette-0.596
 aquatic-0.579
atar-0.575
 therap-0.559
 atmosp-0.554
TokenReviewer
Token ID35407
Feature activation-1.90e+00

Pos logit contributions
���+0.287
龍�+0.270
 fixme+0.233
 Written+0.231
+0.193
Neg logit contributions
 etiquette-0.547
 aquatic-0.537
atar-0.534
 atmosp-0.519
edia-0.514
Token:
Token ID25
Feature activation-2.84e-01

Pos logit contributions
龍�+4.197
 fixme+3.917
 Written+3.911
���+3.551
written+3.387
Neg logit contributions
 therap-5.873
 etiquette-5.700
 aquatic-5.522
 advoc-5.474
atar-5.326
Token B
Token ID347
Feature activation-2.12e-01

Pos logit contributions
���+0.859
龍�+0.821
 fixme+0.759
 Written+0.755
+0.679
Neg logit contributions
 etiquette-0.683
atar-0.668
 aquatic-0.663
 atmosp-0.631
edia-0.630
Tokeniz
Token ID528
Feature activation-5.32e-01

Pos logit contributions
���+0.646
龍�+0.621
 fixme+0.569
 Written+0.568
+0.515
Neg logit contributions
 etiquette-0.508
 aquatic-0.493
atar-0.489
 atmosp-0.469
edia-0.460
Tokenr
Token ID81
Feature activation-2.19e-01

Pos logit contributions
���+1.544
龍�+1.482
 fixme+1.352
 Written+1.349
+1.214
Neg logit contributions
 etiquette-1.351
 aquatic-1.311
atar-1.302
 atmosp-1.259
edia-1.225
Token Pix
Token ID21642
Feature activation-2.82e-03

Pos logit contributions
���+0.654
龍�+0.630
 fixme+0.577
 Written+0.576
+0.520
Neg logit contributions
 etiquette-0.533
 aquatic-0.519
atar-0.514
 atmosp-0.496
edia-0.487
TokenK
Token ID42
Feature activation-2.87e-01

Pos logit contributions
���+1.744
龍�+1.681
 fixme+1.529
 Written+1.525
+1.383
Neg logit contributions
 etiquette-1.513
 aquatic-1.472
atar-1.463
 atmosp-1.406
edia-1.384
Tokenier
Token ID959
Feature activation-4.16e-01

Pos logit contributions
���+0.837
龍�+0.805
 fixme+0.736
 Written+0.731
+0.660
Neg logit contributions
 etiquette-0.723
 aquatic-0.704
atar-0.693
 atmosp-0.672
edia-0.657
Tokenan
Token ID272
Feature activation-4.44e-01

Pos logit contributions
���+1.229
龍�+1.175
 fixme+1.079
 Written+1.069
+0.969
Neg logit contributions
 etiquette-1.040
 aquatic-1.014
atar-1.008
 atmosp-0.965
edia-0.951
Token Profile
Token ID13118
Feature activation-7.39e-01

Pos logit contributions
���+1.222
龍�+1.171
 fixme+1.059
 Written+1.059
+0.949
Neg logit contributions
 etiquette-1.193
 aquatic-1.164
atar-1.154
 atmosp-1.111
edia-1.093
Token Blog
Token ID14001
Feature activation-5.72e-01

Pos logit contributions
���+1.112
龍�+1.015
 Written+0.847
 fixme+0.843
+0.652
Neg logit contributions
 etiquette-2.904
 aquatic-2.851
atar-2.850
 atmosp-2.766
edia-2.753
Token Joined
Token ID16709
Feature activation-1.88e+00

Pos logit contributions
���+0.386
龍�+0.343
 Written+0.221
 fixme+0.219
+0.062
Neg logit contributions
 etiquette-2.700
 aquatic-2.656
atar-2.648
 atmosp-2.588
 therap-2.584
Token August
Token ID2932
Feature activation-2.02e-01

Pos logit contributions
龍�+3.971
���+3.967
 Written+3.900
 fixme+3.716
 Joined+3.213
Neg logit contributions
 etiquette-5.646
atar-5.491
 therap-5.484
 aquatic-5.452
edia-5.281
Token 2011
Token ID2813
Feature activation+1.47e-02

Pos logit contributions
���+0.510
龍�+0.491
 Written+0.449
 fixme+0.448
+0.392
Neg logit contributions
 etiquette-0.583
 aquatic-0.568
atar-0.568
 atmosp-0.543
edia-0.541
Token United
Token ID1578
Feature activation+2.93e-01

Pos logit contributions
 etiquette+0.026
 aquatic+0.025
atar+0.025
 atmosp+0.023
edia+0.023
Neg logit contributions
���-0.054
龍�-0.053
 fixme-0.049
 Written-0.049
-0.045
Token States
Token ID1829
Feature activation+1.76e-01

Pos logit contributions
 etiquette+0.482
atar+0.463
 aquatic+0.462
 atmosp+0.427
edia+0.422
Neg logit contributions
���-1.124
龍�-1.082
 fixme-1.015
 Written-1.006
-0.936
Token 6
Token ID718
Feature activation+1.49e-01

Pos logit contributions
 etiquette+0.433
 aquatic+0.421
atar+0.421
 atmosp+0.401
edia+0.397
Neg logit contributions
���-0.526
龍�-0.506
 fixme-0.466
 Written-0.463
-0.418
Token 2010
Token ID3050
Feature activation-1.44e-01

Pos logit contributions
���+0.155
龍�+0.148
 fixme+0.135
 Written+0.134
+0.119
Neg logit contributions
 etiquette-0.167
atar-0.164
 aquatic-0.163
 atmosp-0.156
edia-0.155
Tokenboys
Token ID13202
Feature activation+5.24e-01

Pos logit contributions
���+0.389
龍�+0.371
 fixme+0.337
 Written+0.334
+0.299
Neg logit contributions
 etiquette-0.397
 aquatic-0.388
atar-0.386
 atmosp-0.371
edia-0.367
Token beware
Token ID40600
Feature activation-3.39e-01

Pos logit contributions
 etiquette+1.572
 aquatic+1.536
atar+1.531
 atmosp+1.472
edia+1.461
Neg logit contributions
���-1.300
龍�-1.225
 fixme-1.099
 Written-1.085
-0.964
Token
Token ID198
Feature activation-3.00e-01

Pos logit contributions
���+0.926
龍�+0.878
 fixme+0.803
 Written+0.798
+0.709
Neg logit contributions
 etiquette-0.925
 aquatic-0.903
atar-0.903
 atmosp-0.863
edia-0.856
Token
Token ID198
Feature activation-2.43e-01

Pos logit contributions
���+0.751
龍�+0.743
 fixme+0.685
 Written+0.680
+0.595
Neg logit contributions
 etiquette-0.855
 aquatic-0.833
atar-0.826
 therap-0.801
 atmosp-0.796
TokenReviewer
Token ID35407
Feature activation-1.88e+00

Pos logit contributions
���+0.446
龍�+0.419
 fixme+0.362
 Written+0.359
+0.300
Neg logit contributions
 etiquette-0.876
 aquatic-0.862
atar-0.856
 atmosp-0.833
edia-0.825
Token:
Token ID25
Feature activation-2.35e-01

Pos logit contributions
龍�+4.170
 fixme+3.887
 Written+3.867
���+3.534
written+3.368
Neg logit contributions
 therap-5.834
 etiquette-5.639
 aquatic-5.475
 advoc-5.427
atar-5.281
Token j
Token ID474
Feature activation+2.30e-01

Pos logit contributions
���+0.716
龍�+0.685
 fixme+0.633
 Written+0.630
+0.567
Neg logit contributions
 etiquette-0.562
atar-0.549
 aquatic-0.545
 atmosp-0.519
edia-0.518
Tokenwt
Token ID46569
Feature activation+6.26e-02

Pos logit contributions
 etiquette+0.577
 aquatic+0.561
atar+0.559
 atmosp+0.534
edia+0.528
Neg logit contributions
���-0.679
龍�-0.650
 fixme-0.596
 Written-0.591
-0.534
Token0001
Token ID18005
Feature activation+4.13e-02

Pos logit contributions
 etiquette+0.155
 aquatic+0.151
atar+0.150
 atmosp+0.143
edia+0.142
Neg logit contributions
���-0.187
龍�-0.179
 fixme-0.164
 Written-0.163
-0.147
Token -
Token ID532
Feature activation-7.30e-01

Pos logit contributions
 etiquette+0.083
atar+0.081
 aquatic+0.081
 atmosp+0.076
edia+0.075
Neg logit contributions
���-0.142
龍�-0.137
 fixme-0.127
 Written-0.126
-0.116
Token TL
Token ID24811
Feature activation+2.39e-01

Pos logit contributions
 etiquette+0.220
 aquatic+0.211
atar+0.209
 atmosp+0.195
edia+0.192
Neg logit contributions
���-0.503
龍�-0.486
 fixme-0.455
 Written-0.453
-0.419
TokenAD
Token ID2885
Feature activation+5.98e-01

Pos logit contributions
 etiquette+0.610
 aquatic+0.594
atar+0.593
 atmosp+0.567
edia+0.559
Neg logit contributions
���-0.698
龍�-0.666
 fixme-0.611
 Written-0.606
-0.546
TokenT
Token ID51
Feature activation-1.00e-01

Pos logit contributions
 etiquette+1.486
atar+1.449
 aquatic+1.445
 atmosp+1.374
edia+1.366
Neg logit contributions
���-1.791
龍�-1.701
 fixme-1.564
 Written-1.544
-1.403
Token 235
Token ID28878
Feature activation+1.95e-02

Pos logit contributions
���+0.296
龍�+0.284
 fixme+0.259
 Written+0.259
+0.232
Neg logit contributions
 etiquette-0.250
 aquatic-0.243
atar-0.243
 atmosp-0.233
edia-0.229
Token21
Token ID2481
Feature activation+9.05e-02

Pos logit contributions
 etiquette+0.053
 aquatic+0.052
atar+0.052
 atmosp+0.050
edia+0.049
Neg logit contributions
���-0.053
龍�-0.051
 fixme-0.046
 Written-0.045
-0.041
Token Posts
Token ID12043
Feature activation-1.87e+00

Pos logit contributions
 etiquette+0.285
 aquatic+0.279
atar+0.279
 atmosp+0.268
edia+0.267
Neg logit contributions
���-0.210
龍�-0.197
 fixme-0.176
 Written-0.174
-0.152
Token #
Token ID1303
Feature activation-1.34e+00

Pos logit contributions
���+4.575
龍�+4.281
 Written+4.000
 fixme+3.871
EStream+3.445
Neg logit contributions
 etiquette-5.364
atar-5.277
 aquatic-5.252
 atmosp-5.088
edia-5.048
Token18
Token ID1507
Feature activation-1.90e-02

Pos logit contributions
龍�+3.170
���+2.941
 Written+2.897
 fixme+2.883
+2.471
Neg logit contributions
 etiquette-3.938
 aquatic-3.802
 therap-3.798
atar-3.734
 atmosp-3.675
Token Good
Token ID4599
Feature activation+3.14e-01

Pos logit contributions
���+0.056
龍�+0.054
 fixme+0.050
 Written+0.049
+0.044
Neg logit contributions
 etiquette-0.047
atar-0.046
 aquatic-0.046
 atmosp-0.043
edia-0.043
Token stuff
Token ID3404
Feature activation+1.56e-01

Pos logit contributions
 etiquette+0.814
 aquatic+0.793
atar+0.792
 atmosp+0.756
edia+0.749
Neg logit contributions
���-0.905
龍�-0.861
 fixme-0.787
 Written-0.779
-0.703
Token!
Token ID0
Feature activation-8.56e-02

Pos logit contributions
 etiquette+0.352
 aquatic+0.341
atar+0.341
 atmosp+0.323
edia+0.320
Neg logit contributions
���-0.504
龍�-0.482
 fixme-0.445
 Written-0.441
-0.403
Tokeniler
Token ID5329
Feature activation-9.11e-03

Pos logit contributions
 etiquette+0.907
 aquatic+0.889
atar+0.880
 atmosp+0.851
 therap+0.843
Neg logit contributions
���-0.676
龍�-0.655
 fixme-0.592
 Written-0.582
-0.502
Token up
Token ID510
Feature activation+1.69e-01

Pos logit contributions
���+0.028
龍�+0.026
 fixme+0.024
 Written+0.024
+0.022
Neg logit contributions
 etiquette-0.022
 aquatic-0.021
atar-0.021
 atmosp-0.020
edia-0.020
Token as
Token ID355
Feature activation+1.61e-01

Pos logit contributions
 etiquette+0.390
 aquatic+0.379
atar+0.378
 atmosp+0.358
edia+0.356
Neg logit contributions
���-0.535
龍�-0.512
 fixme-0.473
 Written-0.469
-0.428
Token the
Token ID262
Feature activation+5.88e-02

Pos logit contributions
 etiquette+0.362
 aquatic+0.351
atar+0.351
 atmosp+0.332
edia+0.329
Neg logit contributions
���-0.519
龍�-0.496
 fixme-0.459
 Written-0.455
-0.415
Token presumptive
Token ID39715
Feature activation-3.87e-01

Pos logit contributions
 etiquette+0.146
 aquatic+0.142
atar+0.142
 atmosp+0.135
edia+0.134
Neg logit contributions
���-0.176
龍�-0.168
 fixme-0.154
 Written-0.152
-0.138
Token franchise
Token ID8663
Feature activation+4.66e-01

Pos logit contributions
���+1.077
龍�+1.026
 fixme+0.942
 Written+0.938
+0.831
Neg logit contributions
 etiquette-1.020
atar-1.002
 aquatic-0.993
 atmosp-0.951
edia-0.951
Token savior
Token ID47921
Feature activation+3.07e-01

Pos logit contributions
 etiquette+1.282
 aquatic+1.244
atar+1.240
 atmosp+1.189
edia+1.178
Neg logit contributions
���-1.288
龍�-1.219
 fixme-1.107
 Written-1.089
-0.975
Token
Token ID960
Feature activation+1.02e-03

Pos logit contributions
 etiquette+0.682
 aquatic+0.661
atar+0.661
 atmosp+0.625
edia+0.620
Neg logit contributions
���-0.994
龍�-0.954
 fixme-0.883
 Written-0.876
-0.800
Tokena
Token ID64
Feature activation+9.29e-02

Pos logit contributions
 etiquette+0.003
 aquatic+0.003
atar+0.002
 atmosp+0.002
edia+0.002
Neg logit contributions
���-0.003
龍�-0.003
 fixme-0.003
 Written-0.003
-0.002
Token title
Token ID3670
Feature activation-3.55e-03

Pos logit contributions
 etiquette+0.215
atar+0.209
 aquatic+0.208
 atmosp+0.198
edia+0.196
Neg logit contributions
���-0.292
龍�-0.279
 fixme-0.258
 Written-0.256
-0.233
Token he
Token ID339
Feature activation-2.14e-01

Pos logit contributions
���+0.011
龍�+0.010
 fixme+0.009
 Written+0.009
+0.008
Neg logit contributions
 etiquette-0.009
atar-0.009
 aquatic-0.009
 atmosp-0.008
edia-0.008
Token her
Token ID607
Feature activation-7.53e-03

Pos logit contributions
���+0.845
龍�+0.806
 fixme+0.747
 Written+0.742
+0.669
Neg logit contributions
 etiquette-0.629
atar-0.620
 aquatic-0.605
edia-0.579
 atmosp-0.575
Token support
Token ID1104
Feature activation-1.40e-02

Pos logit contributions
���+0.025
龍�+0.024
 fixme+0.022
 Written+0.022
+0.020
Neg logit contributions
 etiquette-0.016
atar-0.015
 aquatic-0.015
edia-0.014
 atmosp-0.014
Token for
Token ID329
Feature activation-1.37e-01

Pos logit contributions
���+0.043
龍�+0.041
 fixme+0.038
 Written+0.037
+0.034
Neg logit contributions
 etiquette-0.033
atar-0.032
 aquatic-0.032
 atmosp-0.031
edia-0.030
Token off
Token ID572
Feature activation-2.27e-01

Pos logit contributions
���+0.425
龍�+0.408
 fixme+0.377
 Written+0.374
+0.339
Neg logit contributions
 etiquette-0.320
atar-0.311
 aquatic-0.307
edia-0.294
 atmosp-0.292
Token-
Token ID12
Feature activation+1.28e-01

Pos logit contributions
���+0.653
龍�+0.623
 fixme+0.570
 Written+0.564
+0.512
Neg logit contributions
 etiquette-0.583
 aquatic-0.566
atar-0.563
 atmosp-0.539
edia-0.532
Tokenshore
Token ID14640
Feature activation+4.08e-01

Pos logit contributions
 etiquette+0.303
 aquatic+0.291
atar+0.289
 atmosp+0.279
 therap+0.273
Neg logit contributions
���-0.397
龍�-0.382
 fixme-0.351
 Written-0.349
-0.320
Token drilling
Token ID19747
Feature activation+1.57e-01

Pos logit contributions
 etiquette+0.802
atar+0.777
 aquatic+0.768
 atmosp+0.720
edia+0.720
Neg logit contributions
���-1.418
龍�-1.366
 fixme-1.278
 Written-1.264
-1.165
Token and
Token ID290
Feature activation+1.30e-01

Pos logit contributions
 etiquette+0.368
atar+0.360
 aquatic+0.358
edia+0.342
 atmosp+0.339
Neg logit contributions
���-0.476
龍�-0.459
 fixme-0.428
 Written-0.422
-0.387
Token fracking
Token ID25441
Feature activation+3.33e-01

Pos logit contributions
 etiquette+0.294
atar+0.288
 aquatic+0.283
edia+0.270
 atmosp+0.269
Neg logit contributions
���-0.414
龍�-0.396
 fixme-0.370
 Written-0.363
-0.331
Token.
Token ID13
Feature activation+1.71e-01

Pos logit contributions
 etiquette+0.753
atar+0.732
 aquatic+0.731
edia+0.693
 atmosp+0.690
Neg logit contributions
���-1.054
龍�-1.013
 fixme-0.943
 Written-0.930
-0.852
Token It
Token ID632
Feature activation+9.02e-02

Pos logit contributions
 etiquette+0.431
 aquatic+0.418
atar+0.417
 atmosp+0.397
edia+0.394
Neg logit contributions
���-0.503
龍�-0.482
 fixme-0.443
 Written-0.440
-0.396
Token all
Token ID477
Feature activation+2.55e-01

Pos logit contributions
 etiquette+0.201
atar+0.195
 aquatic+0.194
edia+0.186
 atmosp+0.184
Neg logit contributions
���-0.240
龍�-0.231
 fixme-0.212
 Written-0.211
-0.191
Token three
Token ID1115
Feature activation+4.28e-01

Pos logit contributions
 etiquette+0.649
 aquatic+0.630
atar+0.630
 atmosp+0.600
edia+0.599
Neg logit contributions
���-0.742
龍�-0.708
 fixme-0.648
 Written-0.642
-0.580
Token trends
Token ID11257
Feature activation-4.00e-03

Pos logit contributions
 etiquette+0.947
atar+0.916
 aquatic+0.914
edia+0.868
 atmosp+0.861
Neg logit contributions
���-1.378
龍�-1.330
 fixme-1.228
 Written-1.220
-1.116
Token.
Token ID13
Feature activation+1.33e-01

Pos logit contributions
���+0.012
龍�+0.012
 Written+0.011
 fixme+0.011
+0.010
Neg logit contributions
 etiquette-0.009
 aquatic-0.009
atar-0.009
edia-0.009
 atmosp-0.009
Token Cur
Token ID4424
Feature activation+1.06e-01

Pos logit contributions
 etiquette+0.346
 aquatic+0.336
atar+0.333
 atmosp+0.319
edia+0.316
Neg logit contributions
���-0.381
龍�-0.367
 fixme-0.336
 Written-0.336
-0.299
Tokeniously
Token ID6819
Feature activation+5.34e-01

Pos logit contributions
 etiquette+0.233
atar+0.225
 aquatic+0.225
 atmosp+0.213
edia+0.211
Neg logit contributions
���-0.345
龍�-0.330
 fixme-0.306
 Written-0.303
-0.277
Token,
Token ID11
Feature activation-1.54e-01

Pos logit contributions
 etiquette+1.144
 aquatic+1.112
atar+1.111
 atmosp+1.050
edia+1.031
Neg logit contributions
���-1.773
龍�-1.696
 fixme-1.578
 Written-1.558
-1.432
Token ne
Token ID497
Feature activation-1.08e-01

Pos logit contributions
���+0.457
龍�+0.441
 fixme+0.405
 Written+0.403
+0.361
Neg logit contributions
 etiquette-0.372
atar-0.364
 aquatic-0.362
edia-0.344
 atmosp-0.343
Tokenol
Token ID349
Feature activation-2.32e-02

Pos logit contributions
���+0.224
龍�+0.217
 fixme+0.188
 Written+0.187
+0.163
Neg logit contributions
 etiquette-0.359
 aquatic-0.353
atar-0.347
 atmosp-0.337
edia-0.336
Tokeniber
Token ID1856
Feature activation+1.40e-01

Pos logit contributions
���+0.079
龍�+0.076
 fixme+0.071
 Written+0.070
+0.064
Neg logit contributions
 etiquette-0.048
 aquatic-0.046
atar-0.045
 atmosp-0.043
edia-0.042
Tokenals
Token ID874
Feature activation-7.32e-02

Pos logit contributions
 etiquette+0.162
atar+0.153
 aquatic+0.152
 atmosp+0.135
edia+0.134
Neg logit contributions
���-0.603
龍�-0.583
 fixme-0.551
 Written-0.547
-0.513
Token,
Token ID11
Feature activation+1.79e-01

Pos logit contributions
 etiquette+0.547
 aquatic+0.531
atar+0.530
 atmosp+0.503
edia+0.496
Neg logit contributions
���-0.788
龍�-0.754
 fixme-0.701
 Written-0.697
-0.634
Token and
Token ID290
Feature activation+2.49e-01

Pos logit contributions
 etiquette+0.430
atar+0.418
 aquatic+0.418
 atmosp+0.398
edia+0.394
Neg logit contributions
���-0.544
龍�-0.520
 fixme-0.481
 Written-0.480
-0.433
Token you
Token ID345
Feature activation+6.78e-02

Pos logit contributions
 etiquette+0.589
atar+0.573
 aquatic+0.573
 atmosp+0.544
edia+0.540
Neg logit contributions
���-0.768
龍�-0.733
 fixme-0.678
 Written-0.674
-0.610
Token return
Token ID1441
Feature activation+2.63e-01

Pos logit contributions
 etiquette+0.158
 aquatic+0.152
atar+0.152
 atmosp+0.145
edia+0.144
Neg logit contributions
���-0.211
龍�-0.202
 fixme-0.188
 Written-0.187
-0.168
Token the
Token ID262
Feature activation-1.34e-02

Pos logit contributions
 etiquette+0.651
atar+0.636
 aquatic+0.631
 atmosp+0.604
edia+0.601
Neg logit contributions
���-0.784
龍�-0.743
 Written-0.690
 fixme-0.689
-0.619
Token backer
Token ID37617
Feature activation+5.73e-01

Pos logit contributions
���+0.039
龍�+0.037
 fixme+0.034
 Written+0.034
+0.031
Neg logit contributions
 etiquette-0.034
atar-0.033
 aquatic-0.033
edia-0.032
 atmosp-0.032
Token survey
Token ID5526
Feature activation+2.32e-01

Pos logit contributions
 etiquette+1.360
 aquatic+1.321
atar+1.321
 atmosp+1.254
edia+1.243
Neg logit contributions
���-1.772
龍�-1.693
 fixme-1.560
 Written-1.545
-1.406
Token,
Token ID11
Feature activation+2.16e-01

Pos logit contributions
 etiquette+0.529
 aquatic+0.516
atar+0.513
 atmosp+0.489
edia+0.483
Neg logit contributions
���-0.733
龍�-0.701
 fixme-0.650
 Written-0.646
-0.588
Token I
Token ID314
Feature activation+1.65e-01

Pos logit contributions
 etiquette+0.516
 aquatic+0.503
atar+0.503
 atmosp+0.479
edia+0.473
Neg logit contributions
���-0.660
龍�-0.629
 fixme-0.583
 Written-0.581
-0.526
Token will
Token ID481
Feature activation+4.43e-01

Pos logit contributions
 etiquette+0.314
 aquatic+0.301
atar+0.300
 atmosp+0.283
edia+0.282
Neg logit contributions
���-0.587
龍�-0.562
 fixme-0.529
 Written-0.524
-0.481
Token place
Token ID1295
Feature activation-3.63e-01

Pos logit contributions
 etiquette+1.018
 aquatic+0.989
atar+0.989
 atmosp+0.940
edia+0.933
Neg logit contributions
���-1.397
龍�-1.332
 fixme-1.238
 Written-1.227
-1.115
Tokensearch
Token ID12947
Feature activation+7.53e-01

Pos logit contributions
 etiquette+1.455
atar+1.428
 aquatic+1.413
edia+1.343
 atmosp+1.341
Neg logit contributions
���-1.660
龍�-1.576
 fixme-1.445
 Written-1.426
-1.290
TokenSpace
Token ID14106
Feature activation+5.20e-02

Pos logit contributions
 etiquette+1.261
 aquatic+1.210
atar+1.205
 atmosp+1.121
edia+1.103
Neg logit contributions
���-2.844
龍�-2.755
 fixme-2.583
 Written-2.561
-2.379
Token][
Token ID7131
Feature activation-7.05e-02

Pos logit contributions
 etiquette+0.150
 aquatic+0.147
atar+0.147
 atmosp+0.141
edia+0.140
Neg logit contributions
���-0.133
龍�-0.126
 fixme-0.115
 Written-0.113
-0.101
Token[
Token ID58
Feature activation+6.02e-01

Pos logit contributions
���+0.268
龍�+0.258
 fixme+0.242
 Written+0.240
+0.223
Neg logit contributions
 etiquette-0.118
 aquatic-0.113
atar-0.113
 atmosp-0.105
edia-0.103
Tokeni
Token ID72
Feature activation-6.64e-03

Pos logit contributions
 etiquette+1.607
atar+1.577
 aquatic+1.562
edia+1.494
 atmosp+1.493
Neg logit contributions
���-1.689
龍�-1.583
 fixme-1.447
 Written-1.429
-1.290
Token]]
Token ID11907
Feature activation+4.53e-01

Pos logit contributions
���+0.019
龍�+0.019
 fixme+0.017
 Written+0.017
+0.015
Neg logit contributions
 etiquette-0.017
 aquatic-0.016
atar-0.016
 atmosp-0.016
edia-0.015
Token],
Token ID4357
Feature activation+1.61e-01

Pos logit contributions
 etiquette+1.016
atar+0.985
 aquatic+0.985
 atmosp+0.932
edia+0.923
Neg logit contributions
���-1.463
龍�-1.399
 fixme-1.291
 Written-1.280
-1.170
Token Keys
Token ID26363
Feature activation-4.97e-03

Pos logit contributions
 etiquette+0.422
 aquatic+0.411
atar+0.411
 atmosp+0.392
edia+0.389
Neg logit contributions
���-0.461
龍�-0.439
 fixme-0.401
 Written-0.397
-0.357
Token[
Token ID58
Feature activation+3.79e-01

Pos logit contributions
���+0.020
龍�+0.019
 fixme+0.018
 Written+0.018
+0.016
Neg logit contributions
 etiquette-0.008
 aquatic-0.007
atar-0.007
 atmosp-0.007
edia-0.006
Tokensearch
Token ID12947
Feature activation+6.00e-01

Pos logit contributions
 etiquette+1.024
atar+1.005
 aquatic+0.998
 atmosp+0.951
edia+0.949
Neg logit contributions
���-1.050
龍�-0.995
 fixme-0.907
 Written-0.897
-0.807
TokenSpace
Token ID14106
Feature activation-8.65e-02

Pos logit contributions
 etiquette+0.964
 aquatic+0.924
atar+0.919
 atmosp+0.853
edia+0.838
Neg logit contributions
���-2.310
龍�-2.239
 fixme-2.102
 Written-2.084
-1.939
Token I
Token ID314
Feature activation-2.19e-01

Pos logit contributions
���+0.450
龍�+0.434
 fixme+0.398
 Written+0.397
+0.359
Neg logit contributions
 etiquette-0.360
 aquatic-0.350
atar-0.350
 atmosp-0.331
edia-0.330
Token was
Token ID373
Feature activation+4.34e-02

Pos logit contributions
���+0.607
龍�+0.581
 fixme+0.530
 Written+0.527
+0.466
Neg logit contributions
 etiquette-0.589
 aquatic-0.572
atar-0.569
 atmosp-0.545
edia-0.544
Token a
Token ID257
Feature activation+9.77e-02

Pos logit contributions
 etiquette+0.103
atar+0.099
 aquatic+0.099
 atmosp+0.094
edia+0.094
Neg logit contributions
���-0.133
龍�-0.128
 Written-0.118
 fixme-0.118
-0.105
Token pre
Token ID662
Feature activation+2.10e-02

Pos logit contributions
 etiquette+0.224
atar+0.217
 aquatic+0.216
 atmosp+0.204
edia+0.204
Neg logit contributions
���-0.307
龍�-0.298
 Written-0.274
 fixme-0.273
-0.247
Tokenteen
Token ID7821
Feature activation+2.78e-01

Pos logit contributions
 etiquette+0.051
atar+0.050
 aquatic+0.049
 atmosp+0.047
edia+0.047
Neg logit contributions
���-0.064
龍�-0.061
 fixme-0.056
 Written-0.056
-0.051
Token.
Token ID13
Feature activation-5.02e-02

Pos logit contributions
 etiquette+0.616
 aquatic+0.595
atar+0.593
 atmosp+0.564
edia+0.561
Neg logit contributions
���-0.894
龍�-0.866
 fixme-0.799
 Written-0.795
-0.724
Token
Token ID447
Feature activation+4.24e-02

Pos logit contributions
���+0.164
龍�+0.159
 fixme+0.147
 Written+0.147
+0.133
Neg logit contributions
 etiquette-0.110
 aquatic-0.107
atar-0.105
 atmosp-0.100
edia-0.099
Token
Token ID251
Feature activation+1.21e-01

Pos logit contributions
 etiquette+0.158
atar+0.155
 aquatic+0.155
 atmosp+0.150
edia+0.149
Neg logit contributions
���-0.073
龍�-0.068
 fixme-0.058
 Written-0.057
-0.046
TokenWith
Token ID3152
Feature activation-1.69e-01

Pos logit contributions
 etiquette+0.290
 aquatic+0.282
atar+0.281
 atmosp+0.268
edia+0.266
Neg logit contributions
���-0.370
龍�-0.354
 fixme-0.326
 Written-0.324
-0.294
Token all
Token ID477
Feature activation-8.82e-02

Pos logit contributions
���+0.496
龍�+0.475
 Written+0.435
 fixme+0.434
+0.392
Neg logit contributions
 etiquette-0.420
atar-0.411
 aquatic-0.410
edia-0.388
 atmosp-0.387
Token the
Token ID262
Feature activation+1.79e-01

Pos logit contributions
���+0.254
龍�+0.242
 fixme+0.222
 Written+0.220
+0.198
Neg logit contributions
 etiquette-0.226
 aquatic-0.220
atar-0.220
edia-0.210
 atmosp-0.210
Token House
Token ID2097
Feature activation-3.90e-01

Pos logit contributions
���+0.231
龍�+0.220
 fixme+0.204
 Written+0.203
+0.182
Neg logit contributions
 etiquette-0.180
atar-0.176
 aquatic-0.175
edia-0.166
 atmosp-0.166
Token of
Token ID286
Feature activation-1.68e-01

Pos logit contributions
���+1.172
龍�+1.118
 fixme+1.030
 Written+1.021
+0.922
Neg logit contributions
 etiquette-0.952
atar-0.929
 aquatic-0.929
 atmosp-0.882
edia-0.878
Token Representatives
Token ID17132
Feature activation-9.20e-02

Pos logit contributions
���+0.314
龍�+0.291
 fixme+0.248
 Written+0.242
+0.206
Neg logit contributions
 etiquette-0.607
 aquatic-0.596
atar-0.595
 atmosp-0.577
edia-0.570
Token (
Token ID357
Feature activation+1.03e-01

Pos logit contributions
���+0.286
龍�+0.273
 fixme+0.252
 Written+0.250
+0.227
Neg logit contributions
 etiquette-0.216
atar-0.210
 aquatic-0.210
 atmosp-0.199
edia-0.198
Tokenthe
Token ID1169
Feature activation-3.29e-04

Pos logit contributions
 etiquette+0.263
 aquatic+0.257
atar+0.256
 atmosp+0.245
edia+0.242
Neg logit contributions
���-0.299
龍�-0.286
 fixme-0.262
 Written-0.260
-0.234
Token lower
Token ID2793
Feature activation-1.88e-01

Pos logit contributions
���+0.001
龍�+0.001
 fixme+0.001
 Written+0.001
+0.001
Neg logit contributions
 etiquette-0.001
atar-0.001
 aquatic-0.001
 atmosp-0.001
edia-0.001
Token house
Token ID2156
Feature activation-5.13e-01

Pos logit contributions
���+0.581
龍�+0.555
 fixme+0.512
 Written+0.507
+0.461
Neg logit contributions
 etiquette-0.444
atar-0.432
 aquatic-0.431
 atmosp-0.409
edia-0.406
Token)
Token ID8
Feature activation+2.55e-02

Pos logit contributions
���+1.613
龍�+1.547
 fixme+1.433
 Written+1.423
+1.289
Neg logit contributions
 etiquette-1.166
atar-1.140
 aquatic-1.135
 atmosp-1.080
edia-1.071
Token has
Token ID468
Feature activation+1.08e-01

Pos logit contributions
 etiquette+0.061
atar+0.059
 aquatic+0.059
 atmosp+0.056
edia+0.056
Neg logit contributions
���-0.079
龍�-0.075
 fixme-0.069
 Written-0.069
-0.062
Token 150
Token ID6640
Feature activation-2.06e-01

Pos logit contributions
 etiquette+0.266
atar+0.261
 aquatic+0.259
edia+0.247
 atmosp+0.246
Neg logit contributions
���-0.322
龍�-0.307
 fixme-0.285
 Written-0.283
-0.252
Token members
Token ID1866
Feature activation-3.21e-01

Pos logit contributions
���+0.673
龍�+0.649
 fixme+0.603
 Written+0.597
+0.545
Neg logit contributions
 etiquette-0.444
atar-0.431
 aquatic-0.431
 atmosp-0.405
edia-0.405
Token kilometers
Token ID18212
Feature activation-9.63e-02

Pos logit contributions
 etiquette+0.472
atar+0.459
 aquatic+0.455
edia+0.432
 atmosp+0.430
Neg logit contributions
���-0.612
龍�-0.588
 fixme-0.540
 Written-0.536
-0.489
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
���+0.295
龍�+0.284
 fixme+0.264
 Written+0.263
+0.237
Neg logit contributions
 etiquette-0.228
atar-0.219
 aquatic-0.219
 atmosp-0.209
edia-0.207
Token And
Token ID843
Feature activation+1.04e-01

Pos logit contributions
���+0.388
龍�+0.375
 fixme+0.344
 Written+0.344
+0.305
Neg logit contributions
 etiquette-0.370
 aquatic-0.357
atar-0.355
 atmosp-0.340
edia-0.338
Token Hel
Token ID5053
Feature activation-1.34e-01

Pos logit contributions
 etiquette+0.259
atar+0.252
 aquatic+0.251
 atmosp+0.239
edia+0.238
Neg logit contributions
���-0.306
龍�-0.293
 fixme-0.270
 Written-0.268
-0.241
Tokenheim
Token ID9096
Feature activation-5.40e-02

Pos logit contributions
���+0.305
龍�+0.296
 fixme+0.266
 Written+0.261
+0.230
Neg logit contributions
 etiquette-0.420
 aquatic-0.410
atar-0.406
 atmosp-0.394
 therap-0.390
Token is
Token ID318
Feature activation+1.44e-01

Pos logit contributions
���+0.159
龍�+0.154
 fixme+0.141
 Written+0.140
+0.127
Neg logit contributions
 etiquette-0.134
atar-0.129
 aquatic-0.129
 atmosp-0.123
edia-0.123
Token just
Token ID655
Feature activation+3.23e-02

Pos logit contributions
 etiquette+0.332
atar+0.321
 aquatic+0.320
 atmosp+0.305
edia+0.304
Neg logit contributions
���-0.451
龍�-0.432
 fixme-0.399
 Written-0.397
-0.361
Token one
Token ID530
Feature activation+3.35e-01

Pos logit contributions
 etiquette+0.075
atar+0.073
 aquatic+0.072
edia+0.069
 atmosp+0.069
Neg logit contributions
���-0.101
龍�-0.097
 fixme-0.089
 Written-0.089
-0.081
Token of
Token ID286
Feature activation+1.07e-01

Pos logit contributions
 etiquette+0.811
atar+0.789
 aquatic+0.786
 atmosp+0.749
edia+0.745
Neg logit contributions
���-1.019
龍�-0.976
 fixme-0.900
 Written-0.889
-0.809
Token many
Token ID867
Feature activation+2.40e-01

Pos logit contributions
 etiquette+0.250
atar+0.242
 aquatic+0.242
 atmosp+0.230
edia+0.229
Neg logit contributions
���-0.332
龍�-0.318
 fixme-0.294
 Written-0.291
-0.265
Token Greenland
Token ID30155
Feature activation+4.31e-02

Pos logit contributions
 etiquette+0.510
atar+0.493
 aquatic+0.490
 atmosp+0.463
edia+0.463
Neg logit contributions
���-0.794
龍�-0.767
 fixme-0.712
 Written-0.705
-0.647
Token like
Token ID588
Feature activation+1.82e-01

Pos logit contributions
���+0.038
龍�+0.036
 Written+0.034
 fixme+0.034
+0.030
Neg logit contributions
 etiquette-0.030
atar-0.030
 aquatic-0.029
edia-0.028
 atmosp-0.028
Token,
Token ID11
Feature activation+1.05e-01

Pos logit contributions
 etiquette+0.392
atar+0.379
 aquatic+0.379
 atmosp+0.359
edia+0.357
Neg logit contributions
���-0.602
龍�-0.578
 fixme-0.537
 Written-0.534
-0.488
Token
Token ID564
Feature activation+2.95e-01

Pos logit contributions
 etiquette+0.253
atar+0.246
 aquatic+0.245
 atmosp+0.234
edia+0.233
Neg logit contributions
���-0.321
龍�-0.307
 fixme-0.284
 Written-0.282
-0.255
Token
Token ID250
Feature activation+7.07e-02

Pos logit contributions
 etiquette+1.101
 aquatic+1.082
atar+1.081
 atmosp+1.044
edia+1.040
Neg logit contributions
���-0.511
龍�-0.469
 fixme-0.400
 Written-0.392
-0.321
TokenOh
Token ID5812
Feature activation+2.21e-01

Pos logit contributions
 etiquette+0.180
 aquatic+0.176
atar+0.175
 atmosp+0.168
edia+0.166
Neg logit contributions
���-0.204
龍�-0.196
 fixme-0.180
 Written-0.179
-0.161
Token my
Token ID616
Feature activation-1.16e-01

Pos logit contributions
 etiquette+0.501
atar+0.486
 aquatic+0.485
 atmosp+0.461
edia+0.457
Neg logit contributions
���-0.699
龍�-0.670
 fixme-0.622
 Written-0.618
-0.564
Token god
Token ID5770
Feature activation+1.11e-01

Pos logit contributions
���+0.377
龍�+0.363
 fixme+0.339
 Written+0.339
+0.308
Neg logit contributions
 etiquette-0.249
atar-0.244
 aquatic-0.241
edia-0.231
 atmosp-0.229
Token,
Token ID11
Feature activation-1.94e-02

Pos logit contributions
 etiquette+0.247
 aquatic+0.240
atar+0.239
 atmosp+0.227
edia+0.225
Neg logit contributions
���-0.354
龍�-0.339
 fixme-0.315
 Written-0.313
-0.286
Token he
Token ID339
Feature activation-7.02e-02

Pos logit contributions
���+0.057
龍�+0.055
 fixme+0.051
 Written+0.051
+0.046
Neg logit contributions
 etiquette-0.048
atar-0.046
 aquatic-0.046
 atmosp-0.044
edia-0.044
Token thinks
Token ID6834
Feature activation-7.65e-02

Pos logit contributions
���+0.228
龍�+0.220
 fixme+0.204
 Written+0.203
+0.185
Neg logit contributions
 etiquette-0.153
 aquatic-0.148
atar-0.148
 atmosp-0.140
edia-0.139
Token I
Token ID314
Feature activation-3.05e-01

Pos logit contributions
���+0.232
龍�+0.221
 fixme+0.204
 Written+0.203
+0.184
Neg logit contributions
 etiquette-0.185
 aquatic-0.180
atar-0.180
 atmosp-0.171
edia-0.170
Token about
Token ID546
Feature activation+1.87e-02

Pos logit contributions
���+0.831
龍�+0.795
 fixme+0.732
 Written+0.731
+0.655
Neg logit contributions
 etiquette-0.709
atar-0.691
 aquatic-0.690
 atmosp-0.659
edia-0.656
Token the
Token ID262
Feature activation-3.27e-02

Pos logit contributions
 etiquette+0.049
atar+0.048
 aquatic+0.048
 atmosp+0.045
edia+0.045
Neg logit contributions
���-0.053
龍�-0.050
 fixme-0.046
 Written-0.046
-0.041
Token series
Token ID2168
Feature activation-1.96e-01

Pos logit contributions
���+0.101
龍�+0.097
 fixme+0.089
 Written+0.089
+0.080
Neg logit contributions
 etiquette-0.076
atar-0.075
 aquatic-0.074
edia-0.070
 atmosp-0.070
Token'
Token ID6
Feature activation-4.47e-01

Pos logit contributions
���+0.575
龍�+0.550
 Written+0.505
 fixme+0.505
+0.453
Neg logit contributions
 etiquette-0.486
atar-0.476
 aquatic-0.475
edia-0.451
 atmosp-0.450
Token future
Token ID2003
Feature activation-4.18e-01

Pos logit contributions
���+1.380
龍�+1.319
 Written+1.248
 fixme+1.235
+1.107
Neg logit contributions
atar-1.007
 etiquette-1.003
 aquatic-0.979
edia-0.945
 atmosp-0.920
Token.
Token ID13
Feature activation-2.24e-01

Pos logit contributions
���+1.303
龍�+1.259
 Written+1.179
 fixme+1.171
+1.058
Neg logit contributions
 etiquette-0.930
atar-0.917
 aquatic-0.912
edia-0.861
 atmosp-0.850
Token
Token ID198
Feature activation-5.06e-02

Pos logit contributions
���+0.617
龍�+0.591
 fixme+0.542
 Written+0.541
+0.479
Neg logit contributions
 etiquette-0.605
 aquatic-0.589
atar-0.588
 atmosp-0.563
edia-0.557
Token
Token ID198
Feature activation+1.17e-02

Pos logit contributions
���+0.137
龍�+0.132
 fixme+0.120
 Written+0.119
+0.106
Neg logit contributions
 etiquette-0.138
 aquatic-0.135
atar-0.135
 atmosp-0.129
edia-0.128
TokenCOM
Token ID9858
Feature activation+5.63e-02

Pos logit contributions
 etiquette+0.030
 aquatic+0.029
atar+0.029
 atmosp+0.028
edia+0.027
Neg logit contributions
���-0.034
龍�-0.033
 fixme-0.030
 Written-0.030
-0.027
TokenIC
Token ID2149
Feature activation+1.65e-01

Pos logit contributions
 etiquette+0.129
 aquatic+0.125
atar+0.125
 atmosp+0.118
edia+0.116
Neg logit contributions
���-0.178
龍�-0.172
 fixme-0.158
 Written-0.158
-0.143
Token V
Token ID569
Feature activation+1.29e-01

Pos logit contributions
 etiquette+0.426
 aquatic+0.414
atar+0.413
 atmosp+0.395
edia+0.392
Neg logit contributions
���-0.479
龍�-0.453
 fixme-0.415
 Written-0.408
-0.371
Token readers
Token ID7183
Feature activation+4.47e-01

Pos logit contributions
���+1.482
龍�+1.412
 fixme+1.320
 Written+1.314
+1.190
Neg logit contributions
 etiquette-1.048
atar-1.030
 aquatic-1.021
edia-0.967
 atmosp-0.965
Token don
Token ID836
Feature activation-3.38e-01

Pos logit contributions
 etiquette+0.969
 aquatic+0.939
atar+0.939
 atmosp+0.887
edia+0.878
Neg logit contributions
���-1.473
龍�-1.411
 fixme-1.306
 Written-1.295
-1.186
Token
Token ID447
Feature activation+1.59e-01

Pos logit contributions
龍�+1.159
���+1.150
 fixme+1.082
 Written+1.070
+0.984
Neg logit contributions
 etiquette-0.655
 aquatic-0.625
atar-0.609
 therap-0.598
 atmosp-0.584
Token
Token ID247
Feature activation+1.23e-01

Pos logit contributions
 etiquette+0.520
 aquatic+0.509
atar+0.508
 atmosp+0.490
edia+0.487
Neg logit contributions
���-0.349
龍�-0.327
 fixme-0.291
 Written-0.287
-0.248
Tokent
Token ID83
Feature activation+3.60e-02

Pos logit contributions
 etiquette+0.335
 aquatic+0.327
atar+0.325
 atmosp+0.314
edia+0.309
Neg logit contributions
���-0.334
龍�-0.319
 fixme-0.291
 Written-0.288
-0.258
Token love
Token ID1842
Feature activation-6.25e-01

Pos logit contributions
 etiquette+0.082
atar+0.080
 aquatic+0.080
 atmosp+0.076
edia+0.076
Neg logit contributions
���-0.113
龍�-0.108
 fixme-0.101
 Written-0.100
-0.090
Token Arduino
Token ID27634
Feature activation+1.43e-01

Pos logit contributions
���+1.716
龍�+1.639
 fixme+1.513
 Written+1.502
+1.330
Neg logit contributions
 etiquette-1.656
atar-1.641
 aquatic-1.610
edia-1.552
 atmosp-1.545
Token.
Token ID13
Feature activation-9.09e-02

Pos logit contributions
 etiquette+0.324
 aquatic+0.316
atar+0.316
 atmosp+0.298
edia+0.298
Neg logit contributions
���-0.452
龍�-0.435
 fixme-0.403
 Written-0.400
-0.364
Token As
Token ID1081
Feature activation-1.54e-01

Pos logit contributions
���+0.262
龍�+0.251
 fixme+0.232
 Written+0.231
+0.206
Neg logit contributions
 etiquette-0.233
 aquatic-0.227
atar-0.226
 atmosp-0.216
edia-0.214
Token a
Token ID257
Feature activation+5.23e-02

Pos logit contributions
���+0.471
龍�+0.450
 fixme+0.418
 Written+0.417
+0.375
Neg logit contributions
 etiquette-0.365
atar-0.356
 aquatic-0.354
 atmosp-0.339
edia-0.338
Token long
Token ID890
Feature activation+1.64e-02

Pos logit contributions
 etiquette+0.115
atar+0.114
 aquatic+0.112
 atmosp+0.107
edia+0.106
Neg logit contributions
���-0.169
龍�-0.161
 fixme-0.151
 Written-0.150
-0.136
Token he
Token ID339
Feature activation-2.19e-01

Pos logit contributions
 etiquette+0.136
atar+0.133
 aquatic+0.132
 atmosp+0.126
edia+0.125
Neg logit contributions
���-0.179
龍�-0.171
 fixme-0.158
 Written-0.157
-0.142
Token
Token ID447
Feature activation-7.20e-02

Pos logit contributions
���+0.710
龍�+0.681
 fixme+0.630
 Written+0.627
+0.571
Neg logit contributions
 etiquette-0.482
 aquatic-0.465
atar-0.463
edia-0.438
 atmosp-0.438
Token
Token ID247
Feature activation-1.27e-01

Pos logit contributions
���+0.158
龍�+0.149
 fixme+0.132
 Written+0.130
+0.113
Neg logit contributions
 etiquette-0.235
 aquatic-0.230
atar-0.230
 atmosp-0.222
edia-0.220
Tokend
Token ID67
Feature activation-1.40e-01

Pos logit contributions
���+0.388
龍�+0.374
 fixme+0.343
 Written+0.341
+0.309
Neg logit contributions
 etiquette-0.304
 aquatic-0.296
atar-0.293
 atmosp-0.280
edia-0.277
Token been
Token ID587
Feature activation-1.66e-01

Pos logit contributions
���+0.444
龍�+0.425
 fixme+0.392
 Written+0.391
+0.354
Neg logit contributions
 etiquette-0.320
 aquatic-0.310
atar-0.309
 atmosp-0.294
edia-0.293
Token sent
Token ID1908
Feature activation-5.87e-01

Pos logit contributions
���+0.517
龍�+0.492
 fixme+0.455
 Written+0.454
+0.411
Neg logit contributions
 etiquette-0.386
atar-0.375
 aquatic-0.374
 atmosp-0.355
edia-0.354
Token by
Token ID416
Feature activation-3.60e-01

Pos logit contributions
���+1.762
龍�+1.674
 Written+1.563
 fixme+1.558
+1.413
Neg logit contributions
 etiquette-1.395
atar-1.365
 aquatic-1.355
 atmosp-1.300
edia-1.298
Token central
Token ID4318
Feature activation+8.70e-02

Pos logit contributions
���+1.083
龍�+1.028
 fixme+0.950
 Written+0.948
+0.858
Neg logit contributions
 etiquette-0.861
atar-0.850
 aquatic-0.842
 atmosp-0.805
edia-0.796
Token casting
Token ID13092
Feature activation+6.67e-03

Pos logit contributions
 etiquette+0.200
atar+0.194
 aquatic+0.193
 atmosp+0.184
edia+0.183
Neg logit contributions
���-0.271
龍�-0.260
 fixme-0.242
 Written-0.239
-0.219
Token to
Token ID284
Feature activation-8.48e-02

Pos logit contributions
 etiquette+0.015
 aquatic+0.015
atar+0.015
 atmosp+0.014
edia+0.014
Neg logit contributions
���-0.021
龍�-0.020
 fixme-0.019
 Written-0.019
-0.017
Token play
Token ID711
Feature activation+1.22e-01

Pos logit contributions
���+0.262
龍�+0.249
 fixme+0.230
 Written+0.229
+0.208
Neg logit contributions
 etiquette-0.201
atar-0.196
 aquatic-0.195
 atmosp-0.186
edia-0.183
Token or
Token ID393
Feature activation+8.77e-02

Pos logit contributions
���+0.045
龍�+0.043
 fixme+0.039
 Written+0.039
+0.035
Neg logit contributions
 etiquette-0.036
atar-0.035
 aquatic-0.035
 atmosp-0.033
edia-0.033
Token with
Token ID351
Feature activation+1.08e-01

Pos logit contributions
 etiquette+0.211
atar+0.205
 aquatic+0.205
 atmosp+0.195
edia+0.193
Neg logit contributions
���-0.268
龍�-0.256
 fixme-0.236
 Written-0.234
-0.211
Token your
Token ID534
Feature activation-1.66e-01

Pos logit contributions
 etiquette+0.255
atar+0.248
 aquatic+0.247
 atmosp+0.235
edia+0.233
Neg logit contributions
���-0.334
龍�-0.319
 fixme-0.294
 Written-0.292
-0.265
Token friends
Token ID2460
Feature activation+3.52e-02

Pos logit contributions
���+0.510
龍�+0.488
 fixme+0.450
 Written+0.448
+0.404
Neg logit contributions
 etiquette-0.392
atar-0.383
 aquatic-0.380
 atmosp-0.363
edia-0.361
Token or
Token ID393
Feature activation-1.69e-02

Pos logit contributions
 etiquette+0.077
 aquatic+0.074
atar+0.074
 atmosp+0.070
edia+0.070
Neg logit contributions
���-0.115
龍�-0.111
 fixme-0.102
 Written-0.102
-0.093
Token roomm
Token ID26255
Feature activation-6.47e-01

Pos logit contributions
���+0.049
龍�+0.047
 fixme+0.043
 Written+0.043
+0.038
Neg logit contributions
 etiquette-0.043
atar-0.042
 aquatic-0.042
 atmosp-0.040
edia-0.040
Tokenates
Token ID689
Feature activation-3.79e-02

Pos logit contributions
���+1.674
龍�+1.614
 fixme+1.470
 Written+1.454
+1.278
Neg logit contributions
 etiquette-1.822
 aquatic-1.762
atar-1.716
 therap-1.692
 atmosp-1.689
Token,
Token ID11
Feature activation+1.07e-01

Pos logit contributions
���+0.122
龍�+0.117
 fixme+0.108
 Written+0.108
+0.098
Neg logit contributions
 etiquette-0.085
 aquatic-0.082
atar-0.082
 atmosp-0.078
edia-0.077
Token for
Token ID329
Feature activation+1.60e-01

Pos logit contributions
 etiquette+0.240
atar+0.234
 aquatic+0.233
 atmosp+0.221
edia+0.219
Neg logit contributions
���-0.342
龍�-0.328
 fixme-0.304
 Written-0.302
-0.274
Token example
Token ID1672
Feature activation-2.12e-01

Pos logit contributions
 etiquette+0.365
atar+0.356
 aquatic+0.355
 atmosp+0.336
edia+0.333
Neg logit contributions
���-0.509
龍�-0.487
 fixme-0.450
 Written-0.447
-0.407
Token.
Token ID13
Feature activation+7.17e-02

Pos logit contributions
���+0.692
龍�+0.664
 fixme+0.614
 Written+0.612
+0.558
Neg logit contributions
 etiquette-0.463
 aquatic-0.450
atar-0.447
 atmosp-0.424
edia-0.420
Token Both
Token ID5747
Feature activation+4.19e-01

Pos logit contributions
���+0.131
龍�+0.125
 fixme+0.115
 Written+0.114
+0.102
Neg logit contributions
 etiquette-0.124
 aquatic-0.121
atar-0.121
 atmosp-0.116
edia-0.115
Token of
Token ID286
Feature activation+2.45e-01

Pos logit contributions
 etiquette+1.084
 aquatic+1.052
atar+1.050
 atmosp+1.006
edia+0.991
Neg logit contributions
���-1.215
龍�-1.157
 fixme-1.054
 Written-1.038
-0.944
Token their
Token ID511
Feature activation+1.24e-01

Pos logit contributions
 etiquette+0.593
 aquatic+0.576
atar+0.576
 atmosp+0.548
edia+0.543
Neg logit contributions
���-0.747
龍�-0.713
 fixme-0.656
 Written-0.649
-0.590
Token lives
Token ID3160
Feature activation+1.52e-01

Pos logit contributions
 etiquette+0.261
atar+0.255
 aquatic+0.254
edia+0.239
 atmosp+0.238
Neg logit contributions
���-0.413
龍�-0.396
 fixme-0.368
 Written-0.366
-0.334
Token have
Token ID423
Feature activation+2.54e-02

Pos logit contributions
 etiquette+0.340
atar+0.330
 aquatic+0.329
 atmosp+0.311
edia+0.309
Neg logit contributions
���-0.490
龍�-0.469
 fixme-0.434
 Written-0.430
-0.393
Token undergone
Token ID29900
Feature activation-7.05e-01

Pos logit contributions
 etiquette+0.054
atar+0.053
 aquatic+0.053
edia+0.050
 atmosp+0.049
Neg logit contributions
���-0.084
龍�-0.080
 fixme-0.074
 Written-0.074
-0.068
Token a
Token ID257
Feature activation-1.24e-01

Pos logit contributions
���+2.172
龍�+2.071
 fixme+1.923
 Written+1.921
+1.724
Neg logit contributions
 etiquette-1.626
atar-1.596
 aquatic-1.587
edia-1.532
 atmosp-1.493
Token massive
Token ID4858
Feature activation+6.95e-02

Pos logit contributions
���+0.388
龍�+0.369
 fixme+0.342
 Written+0.340
+0.308
Neg logit contributions
 etiquette-0.288
atar-0.282
 aquatic-0.281
 atmosp-0.266
edia-0.265
Token upheaval
Token ID46907
Feature activation+1.17e-02

Pos logit contributions
 etiquette+0.142
atar+0.138
 aquatic+0.137
 atmosp+0.129
edia+0.128
Neg logit contributions
���-0.237
龍�-0.227
 fixme-0.212
 Written-0.211
-0.193
Token,
Token ID11
Feature activation-5.23e-02

Pos logit contributions
 etiquette+0.027
 aquatic+0.027
atar+0.027
 atmosp+0.025
edia+0.025
Neg logit contributions
���-0.036
龍�-0.035
 fixme-0.032
 Written-0.032
-0.029
Token and
Token ID290
Feature activation-4.38e-02

Pos logit contributions
���+0.164
龍�+0.157
 Written+0.146
 fixme+0.146
+0.132
Neg logit contributions
 etiquette-0.119
atar-0.116
 aquatic-0.116
edia-0.110
 atmosp-0.109
Token.
Token ID13
Feature activation+1.93e-02

Pos logit contributions
 etiquette+0.207
atar+0.203
 aquatic+0.201
 atmosp+0.191
edia+0.191
Neg logit contributions
���-0.262
龍�-0.251
 fixme-0.231
 Written-0.228
-0.207
Token
Token ID198
Feature activation-1.66e-02

Pos logit contributions
 etiquette+0.051
atar+0.049
 aquatic+0.049
 atmosp+0.047
edia+0.047
Neg logit contributions
���-0.055
龍�-0.053
 fixme-0.048
 Written-0.047
-0.043
Token
Token ID198
Feature activation-1.17e-02

Pos logit contributions
���+0.044
龍�+0.042
 fixme+0.039
 Written+0.038
+0.034
Neg logit contributions
 etiquette-0.046
 aquatic-0.045
atar-0.045
 atmosp-0.043
edia-0.042
TokenIn
Token ID818
Feature activation+2.35e-01

Pos logit contributions
���+0.034
龍�+0.033
 fixme+0.030
 Written+0.030
+0.027
Neg logit contributions
 etiquette-0.029
 aquatic-0.029
atar-0.028
 atmosp-0.027
edia-0.027
Token some
Token ID617
Feature activation-1.65e-01

Pos logit contributions
 etiquette+0.563
atar+0.547
 aquatic+0.546
 atmosp+0.519
edia+0.515
Neg logit contributions
���-0.723
龍�-0.690
 fixme-0.636
 Written-0.630
-0.572
Token cases
Token ID2663
Feature activation-5.56e-01

Pos logit contributions
���+0.564
龍�+0.544
 fixme+0.510
 Written+0.505
+0.461
Neg logit contributions
 etiquette-0.328
atar-0.321
 aquatic-0.313
edia-0.300
 atmosp-0.293
Token Hydro
Token ID32116
Feature activation-1.24e-01

Pos logit contributions
���+1.717
龍�+1.640
 fixme+1.522
 Written+1.503
+1.350
Neg logit contributions
 etiquette-1.290
atar-1.261
 aquatic-1.241
edia-1.187
 atmosp-1.179
Token has
Token ID468
Feature activation+3.75e-02

Pos logit contributions
���+0.376
龍�+0.358
 fixme+0.331
 Written+0.328
+0.296
Neg logit contributions
 etiquette-0.300
atar-0.292
 aquatic-0.291
 atmosp-0.277
edia-0.275
Token difficulty
Token ID8722
Feature activation+1.77e-01

Pos logit contributions
 etiquette+0.084
atar+0.083
 aquatic+0.081
edia+0.078
 atmosp+0.077
Neg logit contributions
���-0.119
龍�-0.113
 fixme-0.106
 Written-0.105
-0.094
Token retaining
Token ID26645
Feature activation-3.51e-02

Pos logit contributions
 etiquette+0.445
atar+0.434
 aquatic+0.432
edia+0.412
 atmosp+0.411
Neg logit contributions
���-0.518
龍�-0.493
 fixme-0.455
 Written-0.450
-0.405
Token workers
Token ID3259
Feature activation-5.18e-03

Pos logit contributions
���+0.105
龍�+0.099
 fixme+0.092
 Written+0.092
+0.082
Neg logit contributions
 etiquette-0.085
atar-0.083
 aquatic-0.082
edia-0.079
 atmosp-0.079
Token United
Token ID1578
Feature activation+2.93e-01

Pos logit contributions
 etiquette+0.026
 aquatic+0.025
atar+0.025
 atmosp+0.023
edia+0.023
Neg logit contributions
���-0.054
龍�-0.053
 fixme-0.049
 Written-0.049
-0.045
Token States
Token ID1829
Feature activation+1.76e-01

Pos logit contributions
 etiquette+0.482
atar+0.463
 aquatic+0.462
 atmosp+0.427
edia+0.422
Neg logit contributions
���-1.124
龍�-1.082
 fixme-1.015
 Written-1.006
-0.936
Token 6
Token ID718
Feature activation+1.49e-01

Pos logit contributions
 etiquette+0.433
 aquatic+0.421
atar+0.421
 atmosp+0.401
edia+0.397
Neg logit contributions
���-0.526
龍�-0.506
 fixme-0.466
 Written-0.463
-0.418
Token12
Token ID1065
Feature activation-1.60e-01

Pos logit contributions
 etiquette+0.391
atar+0.381
 aquatic+0.381
 atmosp+0.363
edia+0.361
Neg logit contributions
���-0.427
龍�-0.405
 fixme-0.371
 Written-0.366
-0.330
Token Posts
Token ID12043
Feature activation-2.04e+00

Pos logit contributions
���+0.385
龍�+0.359
 fixme+0.321
 Written+0.315
+0.278
Neg logit contributions
 etiquette-0.493
atar-0.484
 aquatic-0.483
 atmosp-0.464
edia-0.461
Token #
Token ID1303
Feature activation-1.48e+00

Pos logit contributions
���+4.921
龍�+4.608
 Written+4.313
 fixme+4.146
EStream+3.708
Neg logit contributions
 etiquette-5.873
atar-5.772
 aquatic-5.754
 atmosp-5.567
edia-5.530
Token17
Token ID1558
Feature activation-1.58e-01

Pos logit contributions
龍�+3.400
���+3.133
 Written+3.112
 fixme+3.105
written+2.679
Neg logit contributions
 etiquette-4.469
 therap-4.418
 aquatic-4.309
 atmosp-4.213
atar-4.182
Token Good
Token ID4599
Feature activation+2.69e-01

Pos logit contributions
���+0.467
龍�+0.446
 fixme+0.414
 Written+0.412
+0.368
Neg logit contributions
 etiquette-0.392
atar-0.381
 aquatic-0.380
 atmosp-0.363
edia-0.361
Token to
Token ID284
Feature activation+9.81e-02

Pos logit contributions
 etiquette+0.709
 aquatic+0.690
atar+0.689
 atmosp+0.659
edia+0.653
Neg logit contributions
���-0.762
龍�-0.724
 fixme-0.661
 Written-0.653
-0.589
Token hear
Token ID3285
Feature activation+2.15e-01

Pos logit contributions
 etiquette+0.233
atar+0.226
 aquatic+0.226
 atmosp+0.215
edia+0.213
Neg logit contributions
���-0.304
龍�-0.290
 fixme-0.268
 Written-0.265
-0.241
Token.
Token ID13
Feature activation-1.21e-01

Pos logit contributions
 etiquette+0.523
 aquatic+0.508
atar+0.508
 atmosp+0.483
edia+0.479
Neg logit contributions
���-0.654
龍�-0.624
 fixme-0.574
 Written-0.568
-0.516
Token 2012
Token ID2321
Feature activation+2.00e-01

Pos logit contributions
���+0.379
龍�+0.369
 Written+0.339
 fixme+0.337
+0.293
Neg logit contributions
 etiquette-0.440
 aquatic-0.429
atar-0.429
 therap-0.411
 atmosp-0.410
Token Canada
Token ID3340
Feature activation+2.94e-01

Pos logit contributions
 etiquette+0.348
 aquatic+0.335
atar+0.332
 atmosp+0.311
edia+0.306
Neg logit contributions
���-0.741
龍�-0.717
 fixme-0.671
 Written-0.668
-0.616
Token 16
Token ID1467
Feature activation+3.59e-01

Pos logit contributions
 etiquette+0.720
 aquatic+0.700
atar+0.699
 atmosp+0.665
edia+0.659
Neg logit contributions
���-0.888
龍�-0.850
 fixme-0.782
 Written-0.776
-0.702
Token10
Token ID940
Feature activation-6.77e-02

Pos logit contributions
 etiquette+0.927
atar+0.906
 aquatic+0.903
 atmosp+0.862
edia+0.856
Neg logit contributions
���-1.043
龍�-0.985
 fixme-0.901
 Written-0.890
-0.804
Token Posts
Token ID12043
Feature activation-2.14e+00

Pos logit contributions
���+0.161
龍�+0.150
 fixme+0.134
 Written+0.131
+0.116
Neg logit contributions
 etiquette-0.211
atar-0.207
 aquatic-0.206
 atmosp-0.198
edia-0.197
Token #
Token ID1303
Feature activation-1.49e+00

Pos logit contributions
���+5.107
龍�+4.761
 Written+4.492
 fixme+4.268
EStream+3.840
Neg logit contributions
 etiquette-6.184
atar-6.092
 aquatic-6.065
 atmosp-5.882
edia-5.839
Token14
Token ID1415
Feature activation-6.51e-02

Pos logit contributions
龍�+3.651
 Written+3.485
 fixme+3.415
���+3.158
written+3.097
Neg logit contributions
 therap-4.163
 etiquette-4.162
 aquatic-4.042
 advoc-3.911
 charism-3.909
Token On
Token ID1550
Feature activation-5.40e-01

Pos logit contributions
���+0.193
龍�+0.184
 fixme+0.171
 Written+0.170
+0.152
Neg logit contributions
 etiquette-0.161
atar-0.156
 aquatic-0.156
 atmosp-0.149
edia-0.148
Token September
Token ID2693
Feature activation-4.63e-01

Pos logit contributions
���+1.394
龍�+1.369
 Written+1.286
 fixme+1.278
+1.098
Neg logit contributions
 etiquette-1.463
atar-1.431
 aquatic-1.422
 therap-1.378
 atmosp-1.357
Token 20
Token ID1160
Feature activation-2.17e-01

Pos logit contributions
���+1.399
龍�+1.333
 fixme+1.227
 Written+1.218
+1.103
Neg logit contributions
 etiquette-1.131
atar-1.101
 aquatic-1.100
 atmosp-1.046
edia-1.036
Token 2013
Token ID2211
Feature activation-3.31e-01

Pos logit contributions
���+0.580
龍�+0.571
 Written+0.540
 fixme+0.539
+0.467
Neg logit contributions
 etiquette-0.574
 aquatic-0.557
atar-0.556
 therap-0.539
 atmosp-0.531
Token 2002
Token ID6244
Feature activation+3.47e-01

Pos logit contributions
���+0.765
龍�+0.739
 fixme+0.677
 Written+0.676
+0.591
Neg logit contributions
 etiquette-0.894
atar-0.868
 aquatic-0.866
 atmosp-0.831
edia-0.828
Token United
Token ID1578
Feature activation+1.81e-01

Pos logit contributions
 etiquette+0.615
 aquatic+0.591
atar+0.591
 atmosp+0.550
edia+0.544
Neg logit contributions
���-1.286
龍�-1.237
 fixme-1.156
 Written-1.147
-1.063
Token States
Token ID1829
Feature activation+1.07e-01

Pos logit contributions
 etiquette+0.303
 aquatic+0.291
atar+0.291
 atmosp+0.270
edia+0.266
Neg logit contributions
���-0.686
龍�-0.660
 fixme-0.618
 Written-0.613
-0.570
Token 3
Token ID513
Feature activation+2.79e-01

Pos logit contributions
 etiquette+0.265
 aquatic+0.258
atar+0.257
 atmosp+0.245
edia+0.243
Neg logit contributions
���-0.317
龍�-0.305
 fixme-0.280
 Written-0.279
-0.251
Token472
Token ID37856
Feature activation+3.73e-02

Pos logit contributions
 etiquette+0.773
atar+0.756
 aquatic+0.754
 atmosp+0.722
edia+0.718
Neg logit contributions
���-0.752
龍�-0.709
 fixme-0.645
 Written-0.637
-0.569
Token Posts
Token ID12043
Feature activation-2.17e+00

Pos logit contributions
 etiquette+0.116
atar+0.114
 aquatic+0.114
 atmosp+0.109
edia+0.109
Neg logit contributions
���-0.089
龍�-0.082
 fixme-0.073
 Written-0.072
-0.064
Token #
Token ID1303
Feature activation-1.53e+00

Pos logit contributions
���+5.077
龍�+4.749
 Written+4.523
 fixme+4.282
EStream+3.757
Neg logit contributions
 etiquette-6.264
atar-6.208
 aquatic-6.162
edia-5.966
 atmosp-5.954
Token11
Token ID1157
Feature activation-7.44e-02

Pos logit contributions
龍�+3.772
���+3.530
 Written+3.474
 fixme+3.426
+3.018
Neg logit contributions
 etiquette-4.367
 therap-4.307
 aquatic-4.272
 atmosp-4.150
 charism-4.063
Token On
Token ID1550
Feature activation-6.37e-01

Pos logit contributions
���+0.219
龍�+0.209
 fixme+0.194
 Written+0.193
+0.172
Neg logit contributions
 etiquette-0.185
atar-0.180
 aquatic-0.180
 atmosp-0.172
edia-0.171
Token September
Token ID2693
Feature activation-5.00e-01

Pos logit contributions
���+1.591
龍�+1.529
 Written+1.434
 fixme+1.397
+1.239
Neg logit contributions
 etiquette-1.795
atar-1.771
 aquatic-1.754
 atmosp-1.689
edia-1.677
Token 20
Token ID1160
Feature activation-2.22e-01

Pos logit contributions
���+1.344
龍�+1.309
 Written+1.224
 fixme+1.223
+1.077
Neg logit contributions
 etiquette-1.337
 aquatic-1.299
atar-1.297
 therap-1.238
 atmosp-1.236
Token United
Token ID1578
Feature activation+5.77e-02

Pos logit contributions
���+0.016
龍�+0.016
 fixme+0.015
 Written+0.015
+0.014
Neg logit contributions
 etiquette-0.008
 aquatic-0.008
atar-0.008
 atmosp-0.007
edia-0.007
Token States
Token ID1829
Feature activation+3.57e-02

Pos logit contributions
 etiquette+0.096
 aquatic+0.093
atar+0.092
 atmosp+0.086
edia+0.085
Neg logit contributions
���-0.219
龍�-0.211
 fixme-0.197
 Written-0.196
-0.182
Token 10
Token ID838
Feature activation-7.93e-02

Pos logit contributions
 etiquette+0.088
 aquatic+0.086
atar+0.086
 atmosp+0.082
edia+0.081
Neg logit contributions
���-0.106
龍�-0.102
 fixme-0.094
 Written-0.094
-0.084
Token53
Token ID4310
Feature activation-1.02e-01

Pos logit contributions
���+0.221
龍�+0.211
 fixme+0.193
 Written+0.192
+0.172
Neg logit contributions
 etiquette-0.212
 aquatic-0.207
atar-0.206
 atmosp-0.197
edia-0.195
Token Posts
Token ID12043
Feature activation-2.21e+00

Pos logit contributions
���+0.246
龍�+0.226
 fixme+0.202
 Written+0.198
+0.175
Neg logit contributions
 etiquette-0.315
atar-0.310
 aquatic-0.309
 atmosp-0.297
edia-0.295
Token #
Token ID1303
Feature activation-1.61e+00

Pos logit contributions
���+5.272
龍�+4.944
 Written+4.656
 fixme+4.436
EStream+3.992
Neg logit contributions
 etiquette-6.330
atar-6.229
 aquatic-6.200
 atmosp-6.018
edia-5.968
Token15
Token ID1314
Feature activation-2.43e-01

Pos logit contributions
龍�+3.570
 Written+3.308
 fixme+3.238
���+3.116
written+2.936
Neg logit contributions
 therap-4.974
 etiquette-4.896
 aquatic-4.746
 charism-4.670
 atmosp-4.635
Token honestly
Token ID12698
Feature activation-2.04e-01

Pos logit contributions
���+0.717
龍�+0.687
 fixme+0.638
 Written+0.636
+0.567
Neg logit contributions
 etiquette-0.600
atar-0.584
 aquatic-0.582
 atmosp-0.557
edia-0.554
Token if
Token ID611
Feature activation-2.54e-01

Pos logit contributions
���+0.622
龍�+0.597
 fixme+0.554
 Written+0.550
+0.498
Neg logit contributions
 etiquette-0.482
atar-0.470
 aquatic-0.467
 atmosp-0.447
edia-0.442
Token we
Token ID356
Feature activation+9.00e-02

Pos logit contributions
���+0.750
龍�+0.721
 fixme+0.663
 Written+0.659
+0.594
Neg logit contributions
 etiquette-0.623
atar-0.610
 aquatic-0.605
 atmosp-0.579
edia-0.574
Token just
Token ID655
Feature activation+2.16e-01

Pos logit contributions
 etiquette+0.205
 aquatic+0.198
atar+0.198
 atmosp+0.188
edia+0.187
Neg logit contributions
���-0.286
龍�-0.274
 fixme-0.254
 Written-0.252
-0.229
Token 2010
Token ID3050
Feature activation-4.49e-03

Pos logit contributions
���+0.994
龍�+0.968
 Written+0.897
 fixme+0.891
+0.774
Neg logit contributions
 etiquette-1.188
atar-1.155
 aquatic-1.154
 therap-1.110
 atmosp-1.105
Token United
Token ID1578
Feature activation+5.77e-02

Pos logit contributions
���+0.016
龍�+0.016
 fixme+0.015
 Written+0.015
+0.014
Neg logit contributions
 etiquette-0.008
 aquatic-0.008
atar-0.008
 atmosp-0.007
edia-0.007
Token States
Token ID1829
Feature activation+3.57e-02

Pos logit contributions
 etiquette+0.096
 aquatic+0.093
atar+0.092
 atmosp+0.086
edia+0.085
Neg logit contributions
���-0.219
龍�-0.211
 fixme-0.197
 Written-0.196
-0.182
Token 10
Token ID838
Feature activation-7.93e-02

Pos logit contributions
 etiquette+0.088
 aquatic+0.086
atar+0.086
 atmosp+0.082
edia+0.081
Neg logit contributions
���-0.106
龍�-0.102
 fixme-0.094
 Written-0.094
-0.084
Token53
Token ID4310
Feature activation-1.02e-01

Pos logit contributions
���+0.221
龍�+0.211
 fixme+0.193
 Written+0.192
+0.172
Neg logit contributions
 etiquette-0.212
 aquatic-0.207
atar-0.206
 atmosp-0.197
edia-0.195
Token Posts
Token ID12043
Feature activation-2.21e+00

Pos logit contributions
���+0.246
龍�+0.226
 fixme+0.202
 Written+0.198
+0.175
Neg logit contributions
 etiquette-0.315
atar-0.310
 aquatic-0.309
 atmosp-0.297
edia-0.295
Token #
Token ID1303
Feature activation-1.61e+00

Pos logit contributions
���+5.272
龍�+4.944
 Written+4.656
 fixme+4.436
EStream+3.992
Neg logit contributions
 etiquette-6.330
atar-6.229
 aquatic-6.200
 atmosp-6.018
edia-5.968
Token15
Token ID1314
Feature activation-2.43e-01

Pos logit contributions
龍�+3.570
 Written+3.308
 fixme+3.238
���+3.116
written+2.936
Neg logit contributions
 therap-4.974
 etiquette-4.896
 aquatic-4.746
 charism-4.670
 atmosp-4.635
Token honestly
Token ID12698
Feature activation-2.04e-01

Pos logit contributions
���+0.717
龍�+0.687
 fixme+0.638
 Written+0.636
+0.567
Neg logit contributions
 etiquette-0.600
atar-0.584
 aquatic-0.582
 atmosp-0.557
edia-0.554
Token if
Token ID611
Feature activation-2.54e-01

Pos logit contributions
���+0.622
龍�+0.597
 fixme+0.554
 Written+0.550
+0.498
Neg logit contributions
 etiquette-0.482
atar-0.470
 aquatic-0.467
 atmosp-0.447
edia-0.442
Token we
Token ID356
Feature activation+9.00e-02

Pos logit contributions
���+0.750
龍�+0.721
 fixme+0.663
 Written+0.659
+0.594
Neg logit contributions
 etiquette-0.623
atar-0.610
 aquatic-0.605
 atmosp-0.579
edia-0.574