TOP ACTIVATIONS
MAX = 3.868

2012 United States 3 240 Posts # 10 Happy it worked
2012 United States 5 667 Posts # 12 I 'm glad
2011 United States 228 65 Posts # 13 On September 20
controller =" Phone List Ctrl "> < ul > < li
pub struct Own ed < T > { ... }
controller =" Phone List Ctrl "> < ul > < li
June 2009 Germany 247 34 Posts # 20 Nice , happy
independence movement further down the line . The truth
2010 United States 10 53 Posts # 15 honestly if we
TL AD T 235 21 Posts # 18 Good stuff !
June 2012 Canada 16 10 Posts # 14 On September 20
counsel investigating Russia not to ... Trump on Wednesday
2002 United States 3 472 Posts # 11 On September 20
t weet / <|endoftext|> Story highlights Secretary of Defense Chuck Hag
inept offense and OU 's front eventually wearing out against the
OU SE An Av ond ale man returned home last
. Though shows like Or phan Black and The Vampire Di
2011 United States 6 12 Posts # 17 Good to hear
exit his Edmonton - Str ath con a seat . By
View more <|endoftext|> About R acing Apex

MIN ACTIVATIONS
MIN = -3.530

don 't want you t iring yourself babe , now be
to Go Back There Som eday ." In 1984
v ene er of moral rect itude for wars of plunder
. parse _ args ()) # initialize a rectangular and square
-- /// L ett re du bureau de l ' Amb
led immortal eld ritch mon stros ities year after year and
hold out for an obscene sum of money , as they
Sab ine , Chef G .' s , Ep och Fusion
, and D endi st rolled off to entertain ( or
handed Libya over to a patch work of tribal and Al
{ ( 270 s )} Rac 8 { ( 193 s
inv ade our communities pre ying on unsuspecting homeowners , often
args [" images "] ): # load the image , resize
utils import c v 2 # construct the argument parse and
in a more inclusive character isation of reason . He ikes
Piece > ; 64 ], // Where are the pieces ?
. Israel se izes every opportunity to reiterate its
Times of Israel <|endoftext|> St ress . We all know it
court documents as " Jane Roe " through a dating app
in 54 1 / 3 IP in 2014 D

INTERVAL 2.019 - 3.868
CONTAINS 0.173%

NHL sources one very close to the situation have
2010 United States 21 50 Posts # 16 Glad to hear
litter ing the battle - front s " Ent
release - that 's something slightly different . The
wrong . Like a new bie O O programmer violating the

INTERVAL 0.169 - 2.019
CONTAINS 37.100%

Even more interesting is that when chickens sleep ,
in clothes from three AS F OUR and Ch ado Ralph
Students were told to take shelter as the incident broke
comments on not being drug into another long and costly war
that reached 185 degrees F and 85 percent relative humidity for

INTERVAL -1.681 - 0.169
CONTAINS 62.163%

able to honor our ancestors by making this gift to Northwestern
be a res ounding endorsement , it topped the cool ness
or food in a state or weight ier subjects like
The occupational radiation doses to crew of nuclear vessels in very
re not sure how they would have voted had

INTERVAL -3.530 - -1.681
CONTAINS 0.564%

From the start , Hitchcock seemed intent on creating a
Ch lor op rene rubber goes into all sorts of
+ -------------------- --+ | ... | - doesn 't matter ,
be part of a DACA deal , Trump said
0 ) / 2 . 0 ; // [ 0 ,
Token 2012
Token ID2321
Feature activation+1.33e+00

Pos logit contributions
appropriate+1.028
 Bliss+1.008
uba+0.990
achment+0.986
 Klaus+0.971
Neg logit contributions
Gall-0.943
tera-0.901
 Rak-0.888
horse-0.882
 Serv-0.838
Token United
Token ID1578
Feature activation+1.84e+00

Pos logit contributions
appropriate+0.881
 Bliss+0.863
uba+0.857
achment+0.844
 Cohn+0.830
Neg logit contributions
Gall-0.909
tera-0.877
horse-0.867
 Rak-0.866
 Serv-0.827
Token States
Token ID1829
Feature activation+1.42e+00

Pos logit contributions
appropriate+1.089
uba+1.051
 Bliss+1.040
achment+1.037
��+1.032
Neg logit contributions
Gall-1.403
 Rak-1.357
horse-1.344
tera-1.342
 Serv-1.313
Token 3
Token ID513
Feature activation+1.05e+00

Pos logit contributions
appropriate+0.902
 Bliss+0.888
uba+0.877
achment+0.867
abouts+0.859
Neg logit contributions
Gall-1.001
tera-0.972
horse-0.960
 Rak-0.958
 Serv-0.915
Token240
Token ID16102
Feature activation-7.92e-02

Pos logit contributions
appropriate+0.404
uba+0.382
 Bliss+0.377
achment+0.376
��+0.369
Neg logit contributions
Gall-1.011
 Rak-0.985
horse-0.978
tera-0.977
 Serv-0.960
Token Posts
Token ID12043
Feature activation+3.87e+00

Pos logit contributions
Gall+0.086
 Rak+0.084
horse+0.083
tera+0.083
 Serv+0.081
Neg logit contributions
appropriate-0.021
uba-0.020
 Bliss-0.019
achment-0.019
��-0.019
Token #
Token ID1303
Feature activation+1.21e+00

Pos logit contributions
appropriate+2.141
uba+2.081
achment+2.069
 Bliss+2.063
��+2.010
Neg logit contributions
Gall-3.042
 Rak-2.917
tera-2.893
horse-2.882
 Serv-2.785
Token10
Token ID940
Feature activation+5.28e-01

Pos logit contributions
appropriate+0.809
uba+0.782
achment+0.779
 Bliss+0.773
reviewed+0.764
Neg logit contributions
Gall-0.818
 Rak-0.797
tera-0.787
horse-0.783
 Serv-0.766
Token Happy
Token ID14628
Feature activation+6.37e-01

Pos logit contributions
appropriate+0.347
uba+0.338
 Bliss+0.336
achment+0.334
��+0.330
Neg logit contributions
Gall-0.365
 Rak-0.348
horse-0.347
tera-0.347
 Serv-0.337
Token it
Token ID340
Feature activation+1.65e-01

Pos logit contributions
appropriate+0.423
 Bliss+0.411
uba+0.411
achment+0.407
��+0.403
Neg logit contributions
Gall-0.434
tera-0.415
 Rak-0.414
horse-0.413
 Serv-0.401
Token worked
Token ID3111
Feature activation-4.06e-01

Pos logit contributions
appropriate+0.113
uba+0.110
achment+0.109
 Bliss+0.109
��+0.107
Neg logit contributions
Gall-0.110
 Rak-0.105
tera-0.105
horse-0.104
 Serv-0.102
Token 2012
Token ID2321
Feature activation+8.01e-01

Pos logit contributions
appropriate+0.589
 Bliss+0.578
uba+0.570
achment+0.564
 Klaus+0.555
Neg logit contributions
Gall-0.570
tera-0.541
 Rak-0.534
horse-0.534
 Serv-0.506
Token United
Token ID1578
Feature activation+2.08e+00

Pos logit contributions
appropriate+0.539
 Bliss+0.528
uba+0.523
achment+0.517
 Cohn+0.507
Neg logit contributions
Gall-0.538
tera-0.518
 Rak-0.514
horse-0.513
 Serv-0.491
Token States
Token ID1829
Feature activation+1.20e+00

Pos logit contributions
appropriate+1.309
uba+1.254
achment+1.246
��+1.243
 Bliss+1.228
Neg logit contributions
Gall-1.527
 Rak-1.480
horse-1.457
 Serv-1.444
tera-1.441
Token 5
Token ID642
Feature activation+3.99e-01

Pos logit contributions
appropriate+0.753
 Bliss+0.744
uba+0.730
achment+0.724
abouts+0.718
Neg logit contributions
Gall-0.849
tera-0.826
horse-0.811
 Rak-0.810
 Serv-0.778
Token667
Token ID28933
Feature activation-5.49e-02

Pos logit contributions
appropriate+0.232
uba+0.224
 Bliss+0.223
achment+0.222
��+0.217
Neg logit contributions
Gall-0.307
 Rak-0.297
tera-0.294
horse-0.293
 Serv-0.287
Token Posts
Token ID12043
Feature activation+3.84e+00

Pos logit contributions
Gall+0.060
tera+0.059
 Rak+0.059
horse+0.058
 Serv+0.057
Neg logit contributions
appropriate-0.014
 Bliss-0.013
uba-0.013
achment-0.012
reviewed-0.012
Token #
Token ID1303
Feature activation+8.84e-01

Pos logit contributions
appropriate+2.247
uba+2.174
achment+2.151
 Bliss+2.147
��+2.135
Neg logit contributions
Gall-2.937
 Rak-2.836
tera-2.800
horse-2.798
 Serv-2.736
Token12
Token ID1065
Feature activation-4.05e-01

Pos logit contributions
appropriate+0.615
uba+0.589
achment+0.581
reviewed+0.576
 Bliss+0.573
Neg logit contributions
Gall-0.581
 Rak-0.569
tera-0.557
horse-0.546
 Serv-0.545
Token I
Token ID314
Feature activation+5.29e-01

Pos logit contributions
Gall+0.285
 Rak+0.274
horse+0.272
tera+0.272
 Serv+0.265
Neg logit contributions
appropriate-0.262
uba-0.253
 Bliss-0.252
achment-0.251
��-0.249
Token'm
Token ID1101
Feature activation+7.59e-03

Pos logit contributions
appropriate+0.313
uba+0.305
 Bliss+0.301
achment+0.297
reviewed+0.291
Neg logit contributions
Gall-0.403
 Rak-0.386
tera-0.385
horse-0.384
 Serv-0.373
Token glad
Token ID9675
Feature activation+2.18e-01

Pos logit contributions
appropriate+0.005
uba+0.005
 Bliss+0.005
��+0.005
achment+0.005
Neg logit contributions
Gall-0.005
 Rak-0.005
tera-0.005
horse-0.005
 Serv-0.005
Token 2011
Token ID2813
Feature activation+1.15e+00

Pos logit contributions
appropriate+0.478
 Bliss+0.474
uba+0.462
achment+0.455
 Klaus+0.455
Neg logit contributions
Gall-0.465
tera-0.448
horse-0.436
 Rak-0.436
ascript-0.420
Token United
Token ID1578
Feature activation+1.89e+00

Pos logit contributions
appropriate+0.765
 Bliss+0.750
uba+0.744
achment+0.734
��+0.720
Neg logit contributions
Gall-0.776
tera-0.749
 Rak-0.740
horse-0.739
 Serv-0.708
Token States
Token ID1829
Feature activation+1.10e+00

Pos logit contributions
appropriate+1.185
uba+1.137
achment+1.128
��+1.124
 Bliss+1.115
Neg logit contributions
Gall-1.388
 Rak-1.347
horse-1.325
tera-1.312
 Serv-1.311
Token 228
Token ID29041
Feature activation+5.30e-01

Pos logit contributions
appropriate+0.686
 Bliss+0.677
uba+0.663
achment+0.658
abouts+0.654
Neg logit contributions
Gall-0.781
tera-0.758
horse-0.744
 Rak-0.744
 Serv-0.715
Token65
Token ID2996
Feature activation+1.01e+00

Pos logit contributions
appropriate+0.319
 Bliss+0.311
uba+0.308
achment+0.306
reviewed+0.300
Neg logit contributions
Gall-0.395
tera-0.380
 Rak-0.380
horse-0.377
 Serv-0.366
Token Posts
Token ID12043
Feature activation+3.69e+00

Pos logit contributions
appropriate+0.235
 Bliss+0.227
uba+0.216
achment+0.215
abouts+0.200
Neg logit contributions
Gall-1.115
tera-1.094
horse-1.086
 Rak-1.085
 Serv-1.054
Token #
Token ID1303
Feature activation+1.25e+00

Pos logit contributions
appropriate+2.159
uba+2.090
achment+2.067
 Bliss+2.064
��+2.052
Neg logit contributions
Gall-2.820
 Rak-2.723
tera-2.690
horse-2.688
 Serv-2.630
Token13
Token ID1485
Feature activation-5.08e-01

Pos logit contributions
appropriate+0.802
reviewed+0.770
uba+0.761
achment+0.752
 Bliss+0.732
Neg logit contributions
 Rak-0.873
Gall-0.872
tera-0.848
 Serv-0.833
horse-0.820
Token On
Token ID1550
Feature activation+1.68e+00

Pos logit contributions
Gall+0.356
 Rak+0.341
horse+0.339
tera+0.339
 Serv+0.330
Neg logit contributions
appropriate-0.330
uba-0.320
 Bliss-0.319
achment-0.317
��-0.315
Token September
Token ID2693
Feature activation+2.01e-01

Pos logit contributions
appropriate+1.044
 Bliss+0.988
achment+0.986
abouts+0.983
uba+0.982
Neg logit contributions
Gall-1.235
tera-1.169
horse-1.163
 Rak-1.157
 Serv-1.116
Token 20
Token ID1160
Feature activation+1.67e-01

Pos logit contributions
appropriate+0.152
uba+0.147
achment+0.146
��+0.146
 Bliss+0.145
Neg logit contributions
Gall-0.121
 Rak-0.116
horse-0.114
tera-0.113
 Serv-0.112
Tokencontroller
Token ID36500
Feature activation+3.29e-01

Pos logit contributions
appropriate+0.345
uba+0.332
achment+0.329
��+0.326
 Bliss+0.324
Neg logit contributions
Gall-0.397
 Rak-0.382
horse-0.374
tera-0.374
 Serv-0.370
Token="
Token ID2625
Feature activation+1.15e+00

Pos logit contributions
appropriate+0.239
uba+0.231
 Bliss+0.230
achment+0.228
��+0.227
Neg logit contributions
Gall-0.206
 Rak-0.200
horse-0.197
tera-0.195
 Serv-0.193
TokenPhone
Token ID6132
Feature activation+1.75e+00

Pos logit contributions
appropriate+0.985
uba+0.956
achment+0.951
 Bliss+0.950
��+0.948
Neg logit contributions
Gall-0.563
 Rak-0.547
tera-0.532
horse-0.528
 Serv-0.521
TokenList
Token ID8053
Feature activation+4.15e-03

Pos logit contributions
appropriate+1.292
uba+1.253
 Bliss+1.252
achment+1.249
��+1.245
Neg logit contributions
Gall-1.055
 Rak-1.022
tera-1.010
horse-1.009
 Serv-0.978
TokenCtrl
Token ID40069
Feature activation+8.90e-02

Pos logit contributions
appropriate+0.002
uba+0.002
 Bliss+0.002
achment+0.002
��+0.002
Neg logit contributions
Gall-0.004
 Rak-0.004
horse-0.003
tera-0.003
 Serv-0.003
Token">
Token ID5320
Feature activation+3.61e+00

Pos logit contributions
appropriate+0.060
uba+0.058
achment+0.057
 Bliss+0.057
��+0.057
Neg logit contributions
Gall-0.061
 Rak-0.058
horse-0.058
tera-0.057
 Serv-0.057
Token <
Token ID1279
Feature activation+2.86e+00

Pos logit contributions
appropriate+3.321
uba+3.237
achment+3.223
��+3.221
 Bliss+3.211
Neg logit contributions
Gall-1.544
 Rak-1.423
tera-1.408
horse-1.399
 Serv-1.334
Tokenul
Token ID377
Feature activation-8.00e-01

Pos logit contributions
appropriate+2.391
uba+2.324
achment+2.324
��+2.295
reviewed+2.281
Neg logit contributions
Gall-1.462
 Rak-1.383
tera-1.338
 Serv-1.336
horse-1.332
Token>
Token ID29
Feature activation+6.09e-02

Pos logit contributions
Gall+0.412
 Rak+0.396
horse+0.388
tera+0.385
 Serv+0.378
Neg logit contributions
appropriate-0.669
uba-0.653
 Bliss-0.647
achment-0.646
��-0.644
Token <
Token ID1279
Feature activation+1.89e+00

Pos logit contributions
appropriate+0.060
uba+0.058
achment+0.058
 Bliss+0.058
��+0.058
Neg logit contributions
Gall-0.023
 Rak-0.022
horse-0.021
tera-0.021
 Serv-0.020
Tokenli
Token ID4528
Feature activation-5.64e-01

Pos logit contributions
appropriate+1.647
uba+1.600
achment+1.598
��+1.583
reviewed+1.573
Neg logit contributions
Gall-0.904
 Rak-0.857
tera-0.831
 Serv-0.825
horse-0.817
Token
Token ID198
Feature activation-6.98e-02

Pos logit contributions
Gall+0.362
 Rak+0.350
tera+0.348
horse+0.344
 Serv+0.339
Neg logit contributions
appropriate-0.309
uba-0.293
achment-0.293
 Bliss-0.292
reviewed-0.287
Tokenpub
Token ID12984
Feature activation+9.01e-02

Pos logit contributions
Gall+0.044
 Rak+0.043
horse+0.042
tera+0.042
 Serv+0.042
Neg logit contributions
appropriate-0.051
uba-0.048
achment-0.048
��-0.047
 Bliss-0.047
Token struct
Token ID2878
Feature activation-2.92e-01

Pos logit contributions
appropriate+0.052
uba+0.050
achment+0.050
 Bliss+0.050
��+0.049
Neg logit contributions
Gall-0.070
 Rak-0.067
horse-0.067
tera-0.066
 Serv-0.065
Token Own
Token ID11744
Feature activation+1.16e+00

Pos logit contributions
Gall+0.200
 Rak+0.191
horse+0.191
tera+0.190
 Serv+0.183
Neg logit contributions
appropriate-0.195
 Bliss-0.189
uba-0.188
achment-0.186
��-0.184
Tokened
Token ID276
Feature activation+1.63e+00

Pos logit contributions
appropriate+0.682
uba+0.653
achment+0.652
 Bliss+0.651
��+0.645
Neg logit contributions
Gall-0.871
 Rak-0.856
tera-0.847
horse-0.837
 Serv-0.821
Token <
Token ID1279
Feature activation+3.59e+00

Pos logit contributions
appropriate+1.334
achment+1.289
uba+1.281
 Bliss+1.280
��+1.278
Neg logit contributions
Gall-0.841
 Rak-0.839
tera-0.833
horse-0.814
 Serv-0.782
Token T
Token ID309
Feature activation+1.15e+00

Pos logit contributions
appropriate+2.420
achment+2.351
 Bliss+2.347
��+2.314
uba+2.308
Neg logit contributions
Gall-2.303
 Rak-2.250
horse-2.230
tera-2.218
 Serv-2.117
Token >
Token ID1875
Feature activation+2.06e+00

Pos logit contributions
appropriate+0.767
uba+0.746
achment+0.745
��+0.737
 Bliss+0.734
Neg logit contributions
Gall-0.785
 Rak-0.753
horse-0.743
tera-0.743
 Serv-0.721
Token {
Token ID1391
Feature activation+3.19e-01

Pos logit contributions
appropriate+1.596
 Bliss+1.555
achment+1.542
uba+1.541
��+1.535
Neg logit contributions
Gall-1.166
tera-1.132
 Rak-1.105
horse-1.104
 Serv-1.038
Token...
Token ID2644
Feature activation+1.21e+00

Pos logit contributions
appropriate+0.221
uba+0.213
 Bliss+0.212
achment+0.211
��+0.210
Neg logit contributions
Gall-0.212
 Rak-0.203
horse-0.201
tera-0.201
 Serv-0.195
Token }
Token ID1782
Feature activation-9.54e-01

Pos logit contributions
appropriate+0.804
uba+0.774
achment+0.772
 Bliss+0.770
��+0.766
Neg logit contributions
Gall-0.826
 Rak-0.790
horse-0.789
tera-0.786
 Serv-0.759
Tokencontroller
Token ID36500
Feature activation+4.62e-01

Pos logit contributions
appropriate+0.462
uba+0.443
achment+0.441
reviewed+0.435
��+0.433
Neg logit contributions
Gall-0.525
 Rak-0.497
tera-0.481
 Serv-0.481
horse-0.480
Token="
Token ID2625
Feature activation+1.32e+00

Pos logit contributions
appropriate+0.314
uba+0.305
achment+0.304
 Bliss+0.303
��+0.303
Neg logit contributions
Gall-0.311
 Rak-0.298
tera-0.297
horse-0.295
 Serv-0.285
TokenPhone
Token ID6132
Feature activation+8.89e-01

Pos logit contributions
appropriate+0.898
uba+0.868
achment+0.859
 Bliss+0.854
��+0.854
Neg logit contributions
Gall-0.879
 Rak-0.852
tera-0.837
horse-0.829
 Serv-0.826
TokenList
Token ID8053
Feature activation+1.46e-02

Pos logit contributions
appropriate+0.587
 Bliss+0.569
achment+0.564
uba+0.563
��+0.563
Neg logit contributions
Gall-0.604
 Rak-0.592
tera-0.580
horse-0.579
 Serv-0.565
TokenCtrl
Token ID40069
Feature activation-4.25e-01

Pos logit contributions
appropriate+0.008
uba+0.008
achment+0.008
 Bliss+0.008
��+0.007
Neg logit contributions
Gall-0.012
 Rak-0.011
tera-0.011
horse-0.011
 Serv-0.011
Token">
Token ID5320
Feature activation+3.54e+00

Pos logit contributions
Gall+0.286
 Rak+0.277
horse+0.273
tera+0.273
 Serv+0.266
Neg logit contributions
appropriate-0.287
uba-0.279
achment-0.277
 Bliss-0.277
��-0.275
Token <
Token ID1279
Feature activation+3.00e+00

Pos logit contributions
appropriate+3.016
 Bliss+2.945
��+2.928
achment+2.924
uba+2.914
Neg logit contributions
Gall-1.674
tera-1.581
 Rak-1.573
horse-1.547
 Serv-1.466
Tokenul
Token ID377
Feature activation-1.06e+00

Pos logit contributions
appropriate+2.067
achment+1.996
reviewed+1.977
uba+1.970
��+1.948
Neg logit contributions
Gall-1.949
 Rak-1.894
tera-1.834
 Serv-1.833
horse-1.812
Token>
Token ID29
Feature activation-7.36e-03

Pos logit contributions
Gall+0.592
 Rak+0.561
horse+0.554
tera+0.554
 Serv+0.536
Neg logit contributions
appropriate-0.848
uba-0.821
achment-0.818
 Bliss-0.817
��-0.814
Token <
Token ID1279
Feature activation+1.81e+00

Pos logit contributions
Gall+0.003
 Rak+0.003
tera+0.003
horse+0.003
 Serv+0.003
Neg logit contributions
appropriate-0.007
uba-0.007
 Bliss-0.007
achment-0.007
��-0.007
Tokenli
Token ID4528
Feature activation-6.44e-01

Pos logit contributions
appropriate+1.529
uba+1.486
achment+1.486
 Cohn+1.460
reviewed+1.459
Neg logit contributions
Gall-0.907
 Rak-0.876
tera-0.850
 Serv-0.838
horse-0.813
Token June
Token ID2795
Feature activation+1.17e+00

Pos logit contributions
 Bliss+1.697
appropriate+1.690
 Cheap+1.625
 Klaus+1.616
uba+1.615
Neg logit contributions
Gall-1.875
tera-1.799
horse-1.780
 Rak-1.762
ascript-1.706
Token 2009
Token ID3717
Feature activation+1.32e+00

Pos logit contributions
appropriate+0.788
 Bliss+0.785
uba+0.757
achment+0.752
 Klaus+0.749
Neg logit contributions
Gall-0.785
tera-0.752
horse-0.738
 Rak-0.734
ascript-0.709
Token Germany
Token ID4486
Feature activation+6.12e-01

Pos logit contributions
appropriate+0.899
 Bliss+0.875
uba+0.872
achment+0.864
��+0.850
Neg logit contributions
Gall-0.871
tera-0.834
 Rak-0.832
horse-0.831
 Serv-0.797
Token 247
Token ID30179
Feature activation-2.24e-02

Pos logit contributions
appropriate+0.386
 Bliss+0.380
uba+0.373
achment+0.369
abouts+0.366
Neg logit contributions
Gall-0.436
tera-0.422
 Rak-0.416
horse-0.416
 Serv-0.398
Token34
Token ID2682
Feature activation+3.68e-01

Pos logit contributions
Gall+0.019
tera+0.019
 Rak+0.019
horse+0.019
 Serv+0.018
Neg logit contributions
appropriate-0.011
 Bliss-0.010
uba-0.010
achment-0.010
reviewed-0.010
Token Posts
Token ID12043
Feature activation+3.46e+00

Pos logit contributions
appropriate+0.088
 Bliss+0.085
achment+0.080
uba+0.080
reviewed+0.075
Neg logit contributions
Gall-0.406
tera-0.398
 Rak-0.395
horse-0.394
 Serv-0.384
Token #
Token ID1303
Feature activation+1.04e+00

Pos logit contributions
appropriate+2.033
uba+1.965
achment+1.944
 Bliss+1.941
��+1.932
Neg logit contributions
Gall-2.644
 Rak-2.555
horse-2.523
tera-2.522
 Serv-2.471
Token20
Token ID1238
Feature activation-4.34e-01

Pos logit contributions
appropriate+0.708
uba+0.682
achment+0.664
reviewed+0.656
 Bliss+0.647
Neg logit contributions
 Rak-0.698
Gall-0.697
tera-0.682
 Serv-0.672
horse-0.668
Token Nice
Token ID18460
Feature activation+9.86e-01

Pos logit contributions
Gall+0.318
 Rak+0.306
horse+0.304
tera+0.303
 Serv+0.296
Neg logit contributions
appropriate-0.269
uba-0.260
 Bliss-0.259
achment-0.257
��-0.255
Token,
Token ID11
Feature activation+1.19e-01

Pos logit contributions
appropriate+0.655
uba+0.634
 Bliss+0.632
achment+0.629
��+0.623
Neg logit contributions
Gall-0.674
 Rak-0.643
tera-0.641
horse-0.638
 Serv-0.621
Token happy
Token ID3772
Feature activation+1.02e+00

Pos logit contributions
appropriate+0.080
uba+0.077
 Bliss+0.077
achment+0.077
��+0.076
Neg logit contributions
Gall-0.081
 Rak-0.078
tera-0.077
horse-0.077
 Serv-0.075
Token independence
Token ID10404
Feature activation+7.32e-01

Pos logit contributions
Gall+0.415
 Rak+0.399
horse+0.396
tera+0.395
 Serv+0.386
Neg logit contributions
appropriate-0.340
uba-0.327
 Bliss-0.326
achment-0.325
��-0.323
Token movement
Token ID3356
Feature activation+6.09e-02

Pos logit contributions
appropriate+0.499
achment+0.487
uba+0.487
 Bliss+0.486
��+0.479
Neg logit contributions
Gall-0.481
 Rak-0.464
tera-0.463
horse-0.455
 Serv-0.442
Token further
Token ID2252
Feature activation+4.23e-01

Pos logit contributions
appropriate+0.042
achment+0.041
uba+0.041
 Bliss+0.041
��+0.040
Neg logit contributions
Gall-0.040
 Rak-0.039
tera-0.038
horse-0.038
 Serv-0.037
Token down
Token ID866
Feature activation-4.62e-01

Pos logit contributions
appropriate+0.292
achment+0.288
uba+0.285
 Bliss+0.282
abouts+0.279
Neg logit contributions
Gall-0.277
 Rak-0.265
tera-0.262
horse-0.262
 Serv-0.256
Token the
Token ID262
Feature activation+4.12e-01

Pos logit contributions
Gall+0.327
 Rak+0.315
tera+0.312
horse+0.311
 Serv+0.305
Neg logit contributions
appropriate-0.300
uba-0.290
achment-0.288
 Bliss-0.285
��-0.284
Token line
Token ID1627
Feature activation+3.40e+00

Pos logit contributions
appropriate+0.317
achment+0.308
uba+0.307
 Bliss+0.306
��+0.301
Neg logit contributions
Gall-0.240
tera-0.229
 Rak-0.227
horse-0.226
 Serv-0.219
Token.
Token ID13
Feature activation-1.15e-01

Pos logit contributions
appropriate+2.178
achment+2.112
 Bliss+2.096
abouts+2.075
uba+2.073
Neg logit contributions
Gall-2.370
 Rak-2.283
tera-2.272
horse-2.236
 Serv-2.196
Token
Token ID198
Feature activation-1.74e-01

Pos logit contributions
Gall+0.083
tera+0.081
 Rak+0.080
horse+0.080
 Serv+0.077
Neg logit contributions
appropriate-0.070
 Bliss-0.069
achment-0.068
uba-0.068
��-0.067
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
Gall+0.129
 Rak+0.125
tera+0.125
horse+0.124
 Serv+0.122
Neg logit contributions
appropriate-0.105
achment-0.100
uba-0.100
 Bliss-0.100
reviewed-0.098
TokenThe
Token ID464
Feature activation-1.81e-01

Pos logit contributions
appropriate+0.100
achment+0.096
uba+0.095
 Bliss+0.094
reviewed+0.094
Neg logit contributions
 Rak-0.098
Gall-0.097
tera-0.095
horse-0.095
 Serv-0.094
Token truth
Token ID3872
Feature activation+1.01e+00

Pos logit contributions
Gall+0.124
 Rak+0.119
horse+0.118
tera+0.118
 Serv+0.115
Neg logit contributions
appropriate-0.120
uba-0.116
 Bliss-0.115
achment-0.115
��-0.115
Token 2010
Token ID3050
Feature activation+1.81e+00

Pos logit contributions
appropriate+1.120
 Bliss+1.103
uba+1.082
achment+1.072
 Klaus+1.061
Neg logit contributions
Gall-1.049
tera-0.999
 Rak-0.988
horse-0.987
ascript-0.942
Token United
Token ID1578
Feature activation+1.75e+00

Pos logit contributions
appropriate+1.198
 Bliss+1.172
uba+1.163
achment+1.145
abouts+1.126
Neg logit contributions
Gall-1.234
tera-1.189
horse-1.180
 Rak-1.176
 Serv-1.127
Token States
Token ID1829
Feature activation+8.57e-01

Pos logit contributions
appropriate+1.109
uba+1.064
achment+1.056
��+1.049
 Bliss+1.040
Neg logit contributions
Gall-1.273
 Rak-1.236
horse-1.213
 Serv-1.205
tera-1.202
Token 10
Token ID838
Feature activation+7.50e-01

Pos logit contributions
appropriate+0.537
 Bliss+0.527
uba+0.520
achment+0.515
abouts+0.510
Neg logit contributions
Gall-0.611
tera-0.591
 Rak-0.584
horse-0.584
 Serv-0.561
Token53
Token ID4310
Feature activation-5.11e-01

Pos logit contributions
appropriate+0.440
uba+0.425
 Bliss+0.425
achment+0.422
��+0.413
Neg logit contributions
Gall-0.569
 Rak-0.550
tera-0.547
horse-0.547
 Serv-0.532
Token Posts
Token ID12043
Feature activation+3.39e+00

Pos logit contributions
Gall+0.561
tera+0.549
 Rak+0.546
horse+0.546
 Serv+0.533
Neg logit contributions
appropriate-0.125
 Bliss-0.119
uba-0.115
achment-0.114
reviewed-0.107
Token #
Token ID1303
Feature activation+1.11e+00

Pos logit contributions
appropriate+2.032
uba+1.961
achment+1.941
 Bliss+1.938
��+1.931
Neg logit contributions
Gall-2.553
 Rak-2.470
horse-2.439
tera-2.433
 Serv-2.392
Token15
Token ID1314
Feature activation-7.92e-01

Pos logit contributions
appropriate+0.746
achment+0.706
reviewed+0.699
uba+0.695
abouts+0.664
Neg logit contributions
Gall-0.774
 Rak-0.765
tera-0.757
 Serv-0.726
horse-0.698
Token honestly
Token ID12698
Feature activation+2.60e-02

Pos logit contributions
Gall+0.555
 Rak+0.534
horse+0.529
tera+0.528
 Serv+0.517
Neg logit contributions
appropriate-0.515
uba-0.499
 Bliss-0.495
achment-0.494
��-0.491
Token if
Token ID611
Feature activation+6.25e-01

Pos logit contributions
appropriate+0.019
uba+0.018
 Bliss+0.018
achment+0.018
��+0.018
Neg logit contributions
Gall-0.016
 Rak-0.015
tera-0.015
horse-0.015
 Serv-0.015
Token we
Token ID356
Feature activation-9.96e-02

Pos logit contributions
appropriate+0.453
 Bliss+0.441
uba+0.437
achment+0.435
��+0.430
Neg logit contributions
Gall-0.389
horse-0.370
tera-0.369
 Rak-0.369
 Serv-0.357
Token TL
Token ID24811
Feature activation+1.35e+00

Pos logit contributions
appropriate+1.003
 Bliss+0.975
uba+0.974
achment+0.964
��+0.949
Neg logit contributions
Gall-0.983
tera-0.941
 Rak-0.940
horse-0.936
 Serv-0.904
TokenAD
Token ID2885
Feature activation+4.77e-01

Pos logit contributions
appropriate+0.907
 Bliss+0.890
uba+0.881
achment+0.866
 Cohn+0.856
Neg logit contributions
Gall-0.900
tera-0.879
horse-0.875
 Rak-0.871
 Serv-0.826
TokenT
Token ID51
Feature activation+5.46e-01

Pos logit contributions
appropriate+0.329
uba+0.320
 Bliss+0.316
achment+0.315
��+0.312
Neg logit contributions
Gall-0.316
 Rak-0.305
horse-0.301
tera-0.300
 Serv-0.293
Token 235
Token ID28878
Feature activation+8.17e-01

Pos logit contributions
appropriate+0.333
uba+0.323
 Bliss+0.321
achment+0.320
��+0.313
Neg logit contributions
Gall-0.403
 Rak-0.390
horse-0.387
tera-0.386
 Serv-0.376
Token21
Token ID2481
Feature activation-7.76e-02

Pos logit contributions
appropriate+0.510
 Bliss+0.497
uba+0.493
achment+0.490
��+0.480
Neg logit contributions
Gall-0.589
 Rak-0.565
tera-0.563
horse-0.560
 Serv-0.543
Token Posts
Token ID12043
Feature activation+3.37e+00

Pos logit contributions
Gall+0.085
tera+0.083
 Rak+0.083
horse+0.082
 Serv+0.080
Neg logit contributions
appropriate-0.019
 Bliss-0.018
uba-0.017
achment-0.017
reviewed-0.016
Token #
Token ID1303
Feature activation+1.04e+00

Pos logit contributions
appropriate+1.982
uba+1.914
achment+1.893
 Bliss+1.891
��+1.883
Neg logit contributions
Gall-2.567
 Rak-2.482
horse-2.451
tera-2.449
 Serv-2.403
Token18
Token ID1507
Feature activation-5.80e-01

Pos logit contributions
appropriate+0.684
uba+0.656
reviewed+0.640
achment+0.639
abouts+0.621
Neg logit contributions
Gall-0.736
 Rak-0.731
tera-0.697
 Serv-0.694
horse-0.692
Token Good
Token ID4599
Feature activation+9.89e-01

Pos logit contributions
Gall+0.423
 Rak+0.406
horse+0.404
tera+0.403
 Serv+0.394
Neg logit contributions
appropriate-0.360
uba-0.348
 Bliss-0.347
achment-0.345
��-0.343
Token stuff
Token ID3404
Feature activation-3.46e-01

Pos logit contributions
appropriate+0.645
 Bliss+0.630
uba+0.625
achment+0.621
��+0.614
Neg logit contributions
Gall-0.681
tera-0.651
 Rak-0.648
horse-0.643
 Serv-0.629
Token!
Token ID0
Feature activation-2.62e-01

Pos logit contributions
Gall+0.231
 Rak+0.221
horse+0.219
tera+0.219
 Serv+0.214
Neg logit contributions
appropriate-0.236
uba-0.230
 Bliss-0.229
achment-0.227
��-0.226
Token June
Token ID2795
Feature activation+1.10e+00

Pos logit contributions
 Bliss+1.695
appropriate+1.691
 Cheap+1.625
uba+1.617
 Klaus+1.615
Neg logit contributions
Gall-1.853
tera-1.783
horse-1.770
 Rak-1.747
ascript-1.686
Token 2012
Token ID2321
Feature activation+4.53e-01

Pos logit contributions
appropriate+0.740
 Bliss+0.736
uba+0.712
achment+0.708
 Klaus+0.705
Neg logit contributions
Gall-0.736
tera-0.706
horse-0.693
 Rak-0.690
ascript-0.666
Token Canada
Token ID3340
Feature activation+5.15e-01

Pos logit contributions
appropriate+0.306
 Bliss+0.298
uba+0.297
achment+0.293
��+0.289
Neg logit contributions
Gall-0.304
tera-0.292
 Rak-0.291
horse-0.290
 Serv-0.279
Token 16
Token ID1467
Feature activation-4.55e-02

Pos logit contributions
 Bliss+0.311
appropriate+0.311
uba+0.301
 Cohn+0.298
abouts+0.298
Neg logit contributions
Gall-0.378
tera-0.367
horse-0.361
 Rak-0.360
 Serv-0.343
Token10
Token ID940
Feature activation-6.22e-02

Pos logit contributions
Gall+0.035
 Rak+0.034
tera+0.033
horse+0.033
 Serv+0.032
Neg logit contributions
appropriate-0.027
uba-0.026
 Bliss-0.026
achment-0.026
��-0.025
Token Posts
Token ID12043
Feature activation+3.24e+00

Pos logit contributions
Gall+0.068
tera+0.067
 Rak+0.066
horse+0.066
 Serv+0.064
Neg logit contributions
appropriate-0.016
 Bliss-0.015
uba-0.014
achment-0.014
reviewed-0.013
Token #
Token ID1303
Feature activation+1.10e+00

Pos logit contributions
appropriate+1.946
uba+1.875
achment+1.856
 Bliss+1.852
��+1.847
Neg logit contributions
Gall-2.443
 Rak-2.365
horse-2.336
tera-2.329
 Serv-2.294
Token14
Token ID1415
Feature activation-5.64e-01

Pos logit contributions
appropriate+0.704
achment+0.661
reviewed+0.660
uba+0.657
abouts+0.632
Neg logit contributions
Gall-0.796
 Rak-0.788
tera-0.759
 Serv-0.751
horse-0.747
Token On
Token ID1550
Feature activation+1.47e+00

Pos logit contributions
Gall+0.394
 Rak+0.378
horse+0.375
tera+0.375
 Serv+0.366
Neg logit contributions
appropriate-0.368
uba-0.356
 Bliss-0.355
achment-0.353
��-0.351
Token September
Token ID2693
Feature activation-1.65e-01

Pos logit contributions
appropriate+0.915
 Bliss+0.869
achment+0.866
abouts+0.861
uba+0.861
Neg logit contributions
Gall-1.080
tera-1.026
horse-1.018
 Rak-1.014
 Serv-0.977
Token 20
Token ID1160
Feature activation-1.05e-01

Pos logit contributions
Gall+0.095
 Rak+0.091
tera+0.089
horse+0.088
 Serv+0.088
Neg logit contributions
appropriate-0.129
uba-0.125
��-0.125
achment-0.124
 Bliss-0.122
Token counsel
Token ID7739
Feature activation+3.08e-01

Pos logit contributions
appropriate+0.541
achment+0.524
uba+0.523
 Cohn+0.515
 Bliss+0.514
Neg logit contributions
Gall-0.506
tera-0.486
 Rak-0.481
horse-0.475
 Serv-0.460
Token investigating
Token ID10240
Feature activation+7.78e-02

Pos logit contributions
appropriate+0.207
achment+0.200
uba+0.199
 Bliss+0.197
��+0.195
Neg logit contributions
Gall-0.211
 Rak-0.201
horse-0.199
tera-0.198
 Serv-0.194
Token Russia
Token ID3284
Feature activation-5.62e-01

Pos logit contributions
appropriate+0.056
uba+0.055
achment+0.055
 Bliss+0.054
��+0.054
Neg logit contributions
Gall-0.049
 Rak-0.047
tera-0.046
horse-0.046
 Serv-0.045
Token not
Token ID407
Feature activation+4.94e-01

Pos logit contributions
Gall+0.370
 Rak+0.356
horse+0.350
tera+0.350
 Serv+0.343
Neg logit contributions
appropriate-0.392
uba-0.380
achment-0.378
 Bliss-0.374
��-0.373
Token to
Token ID284
Feature activation+4.31e-01

Pos logit contributions
appropriate+0.323
uba+0.307
achment+0.307
reviewed+0.297
abouts+0.296
Neg logit contributions
Gall-0.357
 Rak-0.342
horse-0.334
tera-0.330
 Serv-0.330
Token...
Token ID986
Feature activation+3.23e+00

Pos logit contributions
appropriate+0.253
uba+0.239
achment+0.237
 Bliss+0.234
 Cohn+0.230
Neg logit contributions
Gall-0.336
 Rak-0.322
horse-0.321
tera-0.318
 Serv-0.314
Token
Token ID198
Feature activation+7.68e-01

Pos logit contributions
appropriate+1.858
uba+1.790
achment+1.776
 Bliss+1.768
 Cohn+1.739
Neg logit contributions
Gall-2.428
tera-2.423
horse-2.369
 Rak-2.348
 Serv-2.288
Token
Token ID198
Feature activation+1.03e+00

Pos logit contributions
appropriate+0.455
achment+0.431
 Bliss+0.430
uba+0.429
reviewed+0.427
Neg logit contributions
Gall-0.576
tera-0.561
 Rak-0.558
horse-0.549
 Serv-0.541
TokenTrump
Token ID6170
Feature activation+5.98e-03

Pos logit contributions
appropriate+0.704
uba+0.670
achment+0.669
reviewed+0.657
��+0.655
Neg logit contributions
Gall-0.679
 Rak-0.676
tera-0.658
horse-0.657
 Serv-0.654
Token on
Token ID319
Feature activation+2.54e-02

Pos logit contributions
appropriate+0.004
uba+0.004
 Bliss+0.004
achment+0.004
��+0.004
Neg logit contributions
Gall-0.004
 Rak-0.004
horse-0.004
tera-0.004
 Serv-0.003
Token Wednesday
Token ID3583
Feature activation+1.53e-01

Pos logit contributions
appropriate+0.016
uba+0.016
 Bliss+0.016
achment+0.015
��+0.015
Neg logit contributions
Gall-0.018
 Rak-0.017
horse-0.017
tera-0.017
 Serv-0.017
Token 2002
Token ID6244
Feature activation+1.18e+00

Pos logit contributions
appropriate+0.877
 Bliss+0.857
uba+0.847
achment+0.838
 Cheap+0.824
Neg logit contributions
Gall-0.854
tera-0.813
 Rak-0.810
horse-0.797
ascript-0.758
Token United
Token ID1578
Feature activation+1.46e+00

Pos logit contributions
appropriate+0.793
 Bliss+0.774
uba+0.771
achment+0.760
abouts+0.747
Neg logit contributions
Gall-0.799
tera-0.769
horse-0.763
 Rak-0.762
 Serv-0.727
Token States
Token ID1829
Feature activation+9.10e-01

Pos logit contributions
appropriate+0.919
uba+0.881
achment+0.875
��+0.871
 Bliss+0.862
Neg logit contributions
Gall-1.067
 Rak-1.034
horse-1.016
 Serv-1.009
tera-1.006
Token 3
Token ID513
Feature activation+3.36e-01

Pos logit contributions
appropriate+0.564
 Bliss+0.555
uba+0.548
achment+0.543
abouts+0.537
Neg logit contributions
Gall-0.654
tera-0.634
 Rak-0.626
horse-0.625
 Serv-0.600
Token472
Token ID37856
Feature activation+2.55e-01

Pos logit contributions
appropriate+0.170
uba+0.163
 Bliss+0.161
achment+0.161
��+0.158
Neg logit contributions
Gall-0.284
 Rak-0.276
tera-0.274
horse-0.274
 Serv-0.268
Token Posts
Token ID12043
Feature activation+3.22e+00

Pos logit contributions
appropriate+0.059
 Bliss+0.058
uba+0.055
achment+0.054
 Guests+0.051
Neg logit contributions
Gall-0.282
tera-0.277
horse-0.275
 Rak-0.275
 Serv-0.267
Token #
Token ID1303
Feature activation+6.94e-01

Pos logit contributions
appropriate+1.859
uba+1.802
achment+1.781
 Bliss+1.781
��+1.763
Neg logit contributions
Gall-2.488
 Rak-2.398
tera-2.374
horse-2.370
 Serv-2.313
Token11
Token ID1157
Feature activation-5.45e-02

Pos logit contributions
appropriate+0.536
uba+0.511
achment+0.508
reviewed+0.506
 Bliss+0.505
Neg logit contributions
Gall-0.402
 Rak-0.397
tera-0.391
 Serv-0.380
horse-0.378
Token On
Token ID1550
Feature activation+1.94e+00

Pos logit contributions
Gall+0.039
 Rak+0.037
horse+0.037
tera+0.037
 Serv+0.036
Neg logit contributions
appropriate-0.035
uba-0.034
 Bliss-0.034
achment-0.033
��-0.033
Token September
Token ID2693
Feature activation+2.62e-01

Pos logit contributions
appropriate+1.322
 Bliss+1.280
achment+1.265
uba+1.260
 Cheap+1.228
Neg logit contributions
Gall-1.284
 Rak-1.227
horse-1.227
tera-1.211
 Serv-1.181
Token 20
Token ID1160
Feature activation+4.61e-01

Pos logit contributions
appropriate+0.153
 Bliss+0.150
uba+0.145
achment+0.145
abouts+0.143
Neg logit contributions
Gall-0.199
tera-0.193
horse-0.191
 Rak-0.190
 Serv-0.183
Tokent
Token ID83
Feature activation-2.55e-01

Pos logit contributions
appropriate+0.204
uba+0.195
achment+0.193
��+0.191
 Bliss+0.190
Neg logit contributions
Gall-0.197
 Rak-0.192
tera-0.187
 Serv-0.186
horse-0.186
Tokenweet
Token ID7277
Feature activation-5.49e-01

Pos logit contributions
Gall+0.209
 Rak+0.203
horse+0.200
tera+0.198
 Serv+0.197
Neg logit contributions
appropriate-0.139
uba-0.133
achment-0.132
 Bliss-0.128
��-0.128
Token/
Token ID14
Feature activation+1.14e+00

Pos logit contributions
Gall+0.317
 Rak+0.304
horse+0.300
tera+0.298
 Serv+0.292
Neg logit contributions
appropriate-0.426
uba-0.414
 Bliss-0.410
achment-0.410
��-0.410
Token<|endoftext|>
Token ID50256
Feature activation-1.68e-01

Pos logit contributions
appropriate+0.781
uba+0.759
achment+0.756
 Bliss+0.749
��+0.746
Neg logit contributions
Gall-0.749
 Rak-0.722
horse-0.711
tera-0.710
 Serv-0.699
TokenStory
Token ID11605
Feature activation+2.71e-01

Pos logit contributions
Gall+0.111
 Rak+0.109
tera+0.107
horse+0.106
 Serv+0.105
Neg logit contributions
appropriate-0.116
uba-0.112
achment-0.110
 Bliss-0.110
��-0.109
Token highlights
Token ID11330
Feature activation+3.21e+00

Pos logit contributions
appropriate+0.155
uba+0.148
 Bliss+0.147
achment+0.147
��+0.146
Neg logit contributions
Gall-0.212
 Rak-0.205
tera-0.204
horse-0.201
 Serv-0.197
Token Secretary
Token ID4986
Feature activation+3.37e-01

Pos logit contributions
appropriate+2.084
 Bliss+2.050
uba+2.035
 Cohn+2.008
achment+2.001
Neg logit contributions
Gall-2.205
tera-2.117
horse-2.104
 Rak-2.093
 Serv-2.017
Token of
Token ID286
Feature activation+3.14e-01

Pos logit contributions
appropriate+0.249
uba+0.243
 Bliss+0.242
achment+0.240
��+0.238
Neg logit contributions
Gall-0.207
 Rak-0.198
horse-0.196
tera-0.194
 Serv-0.190
Token Defense
Token ID5947
Feature activation-5.37e-01

Pos logit contributions
appropriate+0.235
uba+0.229
 Bliss+0.227
��+0.226
achment+0.226
Neg logit contributions
Gall-0.190
 Rak-0.181
horse-0.180
tera-0.178
 Serv-0.173
Token Chuck
Token ID14663
Feature activation-2.63e-02

Pos logit contributions
Gall+0.366
 Rak+0.349
horse+0.347
tera+0.346
 Serv+0.337
Neg logit contributions
appropriate-0.360
uba-0.350
 Bliss-0.350
achment-0.345
��-0.340
Token Hag
Token ID21375
Feature activation+6.83e-01

Pos logit contributions
Gall+0.014
tera+0.013
horse+0.013
 Rak+0.013
 Serv+0.012
Neg logit contributions
appropriate-0.022
 Bliss-0.021
uba-0.021
achment-0.021
 Cohn-0.021
Token inept
Token ID43891
Feature activation-1.82e-01

Pos logit contributions
Gall+0.239
 Rak+0.229
tera+0.228
horse+0.227
 Serv+0.223
Neg logit contributions
appropriate-0.218
uba-0.210
 Bliss-0.209
achment-0.208
��-0.206
Token offense
Token ID6907
Feature activation+4.39e-01

Pos logit contributions
Gall+0.114
 Rak+0.110
horse+0.108
tera+0.108
 Serv+0.105
Neg logit contributions
appropriate-0.132
uba-0.128
achment-0.127
 Bliss-0.126
��-0.126
Token and
Token ID290
Feature activation-5.06e-03

Pos logit contributions
appropriate+0.307
uba+0.297
achment+0.295
 Bliss+0.294
��+0.291
Neg logit contributions
Gall-0.287
 Rak-0.275
tera-0.273
horse-0.270
 Serv-0.266
Token OU
Token ID47070
Feature activation-1.25e+00

Pos logit contributions
Gall+0.003
 Rak+0.003
tera+0.003
horse+0.003
 Serv+0.003
Neg logit contributions
appropriate-0.003
uba-0.003
 Bliss-0.003
achment-0.003
��-0.003
Token's
Token ID338
Feature activation+4.25e-01

Pos logit contributions
Gall+0.752
 Rak+0.724
horse+0.716
tera+0.703
 Serv+0.696
Neg logit contributions
appropriate-0.940
uba-0.916
achment-0.906
��-0.906
 Bliss-0.904
Token front
Token ID2166
Feature activation+3.21e+00

Pos logit contributions
appropriate+0.274
 Bliss+0.264
uba+0.263
achment+0.261
��+0.257
Neg logit contributions
Gall-0.301
tera-0.288
 Rak-0.287
horse-0.284
 Serv-0.279
Token eventually
Token ID4191
Feature activation+4.47e-01

Pos logit contributions
appropriate+2.161
achment+2.148
uba+2.097
��+2.064
 Bliss+2.061
Neg logit contributions
Gall-2.122
tera-2.025
 Rak-2.000
horse-1.934
 Serv-1.920
Token wearing
Token ID5762
Feature activation+1.38e+00

Pos logit contributions
appropriate+0.302
uba+0.291
 Bliss+0.291
achment+0.290
��+0.284
Neg logit contributions
Gall-0.302
 Rak-0.289
tera-0.289
horse-0.284
 Serv-0.281
Token out
Token ID503
Feature activation+1.12e+00

Pos logit contributions
appropriate+0.964
 Bliss+0.935
uba+0.933
achment+0.927
��+0.908
Neg logit contributions
Gall-0.899
 Rak-0.856
tera-0.854
horse-0.843
 Serv-0.829
Token against
Token ID1028
Feature activation+7.91e-01

Pos logit contributions
appropriate+0.750
uba+0.727
 Bliss+0.726
achment+0.717
��+0.712
Neg logit contributions
Gall-0.757
 Rak-0.725
tera-0.723
horse-0.716
 Serv-0.701
Token the
Token ID262
Feature activation+5.52e-02

Pos logit contributions
appropriate+0.538
 Bliss+0.524
uba+0.522
achment+0.516
��+0.512
Neg logit contributions
Gall-0.529
 Rak-0.504
tera-0.504
horse-0.500
 Serv-0.487
TokenOU
Token ID2606
Feature activation-1.72e+00

Pos logit contributions
Gall+0.020
tera+0.019
 Rak+0.019
horse+0.019
 Serv+0.018
Neg logit contributions
appropriate-0.051
uba-0.051
 Bliss-0.050
achment-0.050
��-0.050
TokenSE
Token ID5188
Feature activation-2.85e-02

Pos logit contributions
Gall+0.983
 Rak+0.938
horse+0.915
tera+0.906
 Serv+0.906
Neg logit contributions
appropriate-1.347
uba-1.310
achment-1.298
��-1.296
 Bliss-1.288
Token
Token ID851
Feature activation-4.49e-02

Pos logit contributions
Gall+0.019
 Rak+0.018
tera+0.018
horse+0.018
 Serv+0.018
Neg logit contributions
appropriate-0.019
uba-0.019
 Bliss-0.019
achment-0.019
��-0.018
Token An
Token ID1052
Feature activation+5.24e-01

Pos logit contributions
Gall+0.032
 Rak+0.031
horse+0.030
tera+0.030
 Serv+0.030
Neg logit contributions
appropriate-0.029
uba-0.028
 Bliss-0.028
achment-0.028
��-0.027
Token Av
Token ID5184
Feature activation+5.99e-01

Pos logit contributions
appropriate+0.362
uba+0.355
achment+0.349
 Bliss+0.345
��+0.341
Neg logit contributions
Gall-0.349
 Rak-0.334
horse-0.330
tera-0.328
 Serv-0.322
Tokenond
Token ID623
Feature activation+3.20e+00

Pos logit contributions
appropriate+0.403
uba+0.392
achment+0.388
 Bliss+0.384
��+0.378
Neg logit contributions
Gall-0.409
 Rak-0.394
horse-0.386
tera-0.385
 Serv-0.380
Tokenale
Token ID1000
Feature activation+2.57e-02

Pos logit contributions
appropriate+1.718
uba+1.715
achment+1.703
 Bliss+1.675
reviewed+1.620
Neg logit contributions
Gall-2.523
tera-2.441
 Rak-2.440
horse-2.414
 Serv-2.357
Token man
Token ID582
Feature activation-6.81e-01

Pos logit contributions
appropriate+0.016
uba+0.015
achment+0.015
 Bliss+0.015
��+0.015
Neg logit contributions
Gall-0.019
 Rak-0.019
horse-0.018
tera-0.018
 Serv-0.018
Token returned
Token ID4504
Feature activation-1.28e-01

Pos logit contributions
Gall+0.443
 Rak+0.423
tera+0.421
horse+0.415
 Serv+0.408
Neg logit contributions
appropriate-0.477
uba-0.468
achment-0.461
 Bliss-0.461
��-0.451
Token home
Token ID1363
Feature activation-2.34e-01

Pos logit contributions
Gall+0.077
 Rak+0.073
tera+0.073
horse+0.072
 Serv+0.070
Neg logit contributions
appropriate-0.096
uba-0.094
 Bliss-0.093
achment-0.093
��-0.092
Token last
Token ID938
Feature activation+8.27e-01

Pos logit contributions
Gall+0.150
tera+0.143
 Rak+0.140
horse+0.139
 Serv+0.136
Neg logit contributions
appropriate-0.169
uba-0.165
 Bliss-0.163
achment-0.162
 Cohn-0.158
Token.
Token ID13
Feature activation+4.64e-01

Pos logit contributions
Gall+0.242
 Rak+0.233
horse+0.229
tera+0.229
 Serv+0.225
Neg logit contributions
appropriate-0.284
uba-0.275
achment-0.273
 Bliss-0.272
��-0.272
Token Though
Token ID7486
Feature activation+3.86e-02

Pos logit contributions
 Bliss+0.274
appropriate+0.273
uba+0.265
achment+0.264
 Cohn+0.261
Neg logit contributions
Gall-0.347
tera-0.339
horse-0.332
 Rak-0.330
 Serv-0.318
Token shows
Token ID2523
Feature activation-3.93e-01

Pos logit contributions
appropriate+0.027
 Bliss+0.026
uba+0.026
achment+0.025
reviewed+0.025
Neg logit contributions
Gall-0.026
tera-0.024
 Rak-0.024
horse-0.024
 Serv-0.023
Token like
Token ID588
Feature activation+5.97e-01

Pos logit contributions
Gall+0.292
 Rak+0.281
tera+0.280
horse+0.277
 Serv+0.272
Neg logit contributions
appropriate-0.240
uba-0.233
 Bliss-0.230
achment-0.229
��-0.224
Token Or
Token ID1471
Feature activation+5.18e-01

Pos logit contributions
appropriate+0.388
 Bliss+0.376
uba+0.373
achment+0.372
��+0.365
Neg logit contributions
Gall-0.421
tera-0.404
 Rak-0.402
horse-0.400
 Serv-0.388
Tokenphan
Token ID19080
Feature activation+3.19e+00

Pos logit contributions
appropriate+0.300
uba+0.290
achment+0.286
 Bliss+0.285
��+0.282
Neg logit contributions
Gall-0.402
 Rak-0.388
horse-0.383
tera-0.382
 Serv-0.376
Token Black
Token ID2619
Feature activation-1.62e+00

Pos logit contributions
appropriate+2.367
 Bliss+2.353
uba+2.351
achment+2.304
��+2.265
Neg logit contributions
Gall-1.898
tera-1.802
 Rak-1.793
horse-1.771
 Serv-1.698
Token and
Token ID290
Feature activation-3.13e-02

Pos logit contributions
Gall+1.041
 Rak+1.002
horse+0.988
tera+0.984
 Serv+0.967
Neg logit contributions
appropriate-1.145
uba-1.110
achment-1.101
 Bliss-1.099
��-1.096
Token The
Token ID383
Feature activation+3.64e-01

Pos logit contributions
Gall+0.022
 Rak+0.021
tera+0.021
horse+0.021
 Serv+0.020
Neg logit contributions
appropriate-0.021
uba-0.020
 Bliss-0.020
achment-0.020
��-0.019
Token Vampire
Token ID21463
Feature activation+1.40e+00

Pos logit contributions
appropriate+0.248
uba+0.240
 Bliss+0.239
achment+0.238
��+0.235
Neg logit contributions
Gall-0.245
 Rak-0.234
tera-0.233
horse-0.233
 Serv-0.226
Token Di
Token ID6031
Feature activation+1.01e+00

Pos logit contributions
appropriate+1.004
uba+0.981
 Bliss+0.977
achment+0.973
��+0.961
Neg logit contributions
Gall-0.888
tera-0.848
 Rak-0.848
horse-0.838
 Serv-0.815
Token 2011
Token ID2813
Feature activation+1.29e+00

Pos logit contributions
appropriate+0.895
 Bliss+0.880
uba+0.858
achment+0.851
 Klaus+0.845
Neg logit contributions
Gall-0.851
tera-0.818
 Rak-0.799
horse-0.798
ascript-0.764
Token United
Token ID1578
Feature activation+1.58e+00

Pos logit contributions
appropriate+0.860
 Bliss+0.840
uba+0.835
achment+0.823
abouts+0.807
Neg logit contributions
Gall-0.877
tera-0.847
 Rak-0.837
horse-0.837
 Serv-0.801
Token States
Token ID1829
Feature activation+9.26e-01

Pos logit contributions
appropriate+1.000
uba+0.960
achment+0.952
��+0.951
 Bliss+0.940
Neg logit contributions
Gall-1.155
 Rak-1.122
horse-1.102
 Serv-1.092
tera-1.090
Token 6
Token ID718
Feature activation+7.11e-01

Pos logit contributions
appropriate+0.580
 Bliss+0.571
uba+0.559
achment+0.556
abouts+0.553
Neg logit contributions
Gall-0.660
tera-0.641
 Rak-0.629
horse-0.629
 Serv-0.604
Token12
Token ID1065
Feature activation-1.00e+00

Pos logit contributions
appropriate+0.399
uba+0.385
 Bliss+0.383
achment+0.381
reviewed+0.371
Neg logit contributions
Gall-0.559
 Rak-0.542
tera-0.538
horse-0.536
 Serv-0.525
Token Posts
Token ID12043
Feature activation+3.16e+00

Pos logit contributions
Gall+1.063
 Rak+1.040
tera+1.040
horse+1.035
 Serv+1.011
Neg logit contributions
appropriate-0.280
 Bliss-0.266
uba-0.258
achment-0.256
reviewed-0.245
Token #
Token ID1303
Feature activation+9.21e-01

Pos logit contributions
appropriate+1.884
uba+1.815
achment+1.796
 Bliss+1.793
��+1.788
Neg logit contributions
Gall-2.387
 Rak-2.312
horse-2.284
tera-2.276
 Serv-2.244
Token17
Token ID1558
Feature activation-1.10e+00

Pos logit contributions
appropriate+0.632
uba+0.607
reviewed+0.601
achment+0.590
 Bliss+0.584
Neg logit contributions
Gall-0.617
 Rak-0.610
tera-0.594
 Serv-0.584
horse-0.575
Token Good
Token ID4599
Feature activation+1.19e+00

Pos logit contributions
Gall+0.793
 Rak+0.764
horse+0.757
tera+0.756
 Serv+0.740
Neg logit contributions
appropriate-0.688
uba-0.665
 Bliss-0.661
achment-0.659
��-0.655
Token to
Token ID284
Feature activation-3.91e-01

Pos logit contributions
appropriate+0.753
 Bliss+0.735
uba+0.728
achment+0.722
��+0.716
Neg logit contributions
Gall-0.839
tera-0.803
 Rak-0.800
horse-0.792
 Serv-0.777
Token hear
Token ID3285
Feature activation-3.46e-01

Pos logit contributions
Gall+0.318
 Rak+0.307
tera+0.306
horse+0.306
 Serv+0.299
Neg logit contributions
appropriate-0.210
uba-0.203
 Bliss-0.201
achment-0.199
��-0.198
Token exit
Token ID8420
Feature activation+9.11e-01

Pos logit contributions
appropriate+0.096
uba+0.092
 Bliss+0.091
achment+0.091
��+0.089
Neg logit contributions
Gall-0.098
 Rak-0.094
horse-0.093
tera-0.092
 Serv-0.091
Token his
Token ID465
Feature activation+3.42e-01

Pos logit contributions
appropriate+0.591
 Bliss+0.575
uba+0.574
achment+0.570
��+0.561
Neg logit contributions
Gall-0.636
 Rak-0.609
tera-0.607
horse-0.599
 Serv-0.585
Token Edmonton
Token ID21165
Feature activation+8.84e-01

Pos logit contributions
appropriate+0.235
 Bliss+0.230
achment+0.230
uba+0.228
 Cohn+0.225
Neg logit contributions
Gall-0.226
tera-0.215
 Rak-0.211
horse-0.210
 Serv-0.204
Token-
Token ID12
Feature activation-2.24e-01

Pos logit contributions
appropriate+0.624
achment+0.617
 Bliss+0.610
uba+0.606
 Cohn+0.597
Neg logit contributions
Gall-0.560
tera-0.531
 Rak-0.525
horse-0.523
 Serv-0.509
TokenStr
Token ID13290
Feature activation-8.59e-01

Pos logit contributions
Gall+0.148
 Rak+0.147
tera+0.143
 Serv+0.142
horse+0.141
Neg logit contributions
appropriate-0.152
uba-0.146
achment-0.146
 Bliss-0.143
reviewed-0.143
Tokenath
Token ID776
Feature activation+3.15e+00

Pos logit contributions
Gall+0.488
 Rak+0.471
tera+0.458
horse+0.456
 Serv+0.448
Neg logit contributions
appropriate-0.677
uba-0.662
achment-0.657
 Bliss-0.646
��-0.637
Tokencon
Token ID1102
Feature activation-1.62e+00

Pos logit contributions
achment+1.311
appropriate+1.310
uba+1.262
 Bliss+1.205
��+1.202
Neg logit contributions
Gall-2.917
 Rak-2.848
 Serv-2.767
tera-2.765
horse-2.730
Tokena
Token ID64
Feature activation-3.14e-03

Pos logit contributions
Gall+0.873
 Rak+0.835
horse+0.819
tera+0.815
 Serv+0.799
Neg logit contributions
appropriate-1.314
uba-1.279
achment-1.269
 Bliss-1.267
��-1.262
Token seat
Token ID5852
Feature activation+3.07e-01

Pos logit contributions
Gall+0.002
 Rak+0.002
tera+0.002
horse+0.002
 Serv+0.002
Neg logit contributions
appropriate-0.002
achment-0.002
 Bliss-0.002
uba-0.002
��-0.002
Token.
Token ID13
Feature activation+7.29e-02

Pos logit contributions
appropriate+0.212
achment+0.207
uba+0.205
 Bliss+0.205
��+0.202
Neg logit contributions
Gall-0.203
 Rak-0.194
tera-0.191
horse-0.190
 Serv-0.187
Token By
Token ID2750
Feature activation-5.94e-01

Pos logit contributions
appropriate+0.045
 Bliss+0.044
achment+0.044
uba+0.044
 Cohn+0.043
Neg logit contributions
Gall-0.052
tera-0.051
 Rak-0.051
horse-0.050
 Serv-0.049
Token
Token ID198
Feature activation+4.68e-01

Pos logit contributions
appropriate+0.490
 Bliss+0.478
achment+0.474
uba+0.473
��+0.462
Neg logit contributions
Gall-0.535
tera-0.516
horse-0.513
 Rak-0.512
 Serv-0.495
Token
Token ID198
Feature activation+4.36e-01

Pos logit contributions
appropriate+0.281
achment+0.269
 Bliss+0.268
reviewed+0.267
uba+0.266
Neg logit contributions
Gall-0.340
tera-0.339
 Rak-0.337
horse-0.331
 Serv-0.325
TokenView
Token ID7680
Feature activation+3.25e-01

Pos logit contributions
appropriate+0.269
achment+0.258
uba+0.256
reviewed+0.253
 Bliss+0.250
Neg logit contributions
Gall-0.313
 Rak-0.312
tera-0.305
horse-0.304
 Serv-0.302
Token more
Token ID517
Feature activation+1.76e-01

Pos logit contributions
appropriate+0.209
achment+0.205
 Bliss+0.204
��+0.198
uba+0.197
Neg logit contributions
Gall-0.226
horse-0.216
 Rak-0.215
tera-0.215
 Serv-0.209
Token<|endoftext|>
Token ID50256
Feature activation+8.90e-02

Pos logit contributions
appropriate+0.133
uba+0.128
 Bliss+0.128
achment+0.128
��+0.126
Neg logit contributions
Gall-0.106
 Rak-0.102
horse-0.100
tera-0.100
 Serv-0.098
TokenAbout
Token ID8585
Feature activation+3.15e+00

Pos logit contributions
appropriate+0.061
uba+0.059
achment+0.058
 Bliss+0.058
��+0.057
Neg logit contributions
Gall-0.059
 Rak-0.058
tera-0.056
horse-0.056
 Serv-0.056
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
appropriate+2.119
 Bliss+2.088
uba+2.054
achment+2.030
��+2.019
Neg logit contributions
Gall-2.097
tera-1.994
horse-1.992
 Rak-1.968
 Serv-1.908
Token
Token ID198
Feature activation+1.04e+00

Pos logit contributions
appropriate+0.590
 Bliss+0.561
uba+0.554
reviewed+0.553
achment+0.552
Neg logit contributions
Gall-0.761
tera-0.745
 Rak-0.741
horse-0.727
 Serv-0.716
TokenR
Token ID49
Feature activation+4.44e-02

Pos logit contributions
appropriate+0.718
uba+0.686
achment+0.679
 Bliss+0.678
reviewed+0.672
Neg logit contributions
Gall-0.672
 Rak-0.666
tera-0.651
horse-0.646
 Serv-0.644
Tokenacing
Token ID4092
Feature activation-1.19e+00

Pos logit contributions
appropriate+0.030
uba+0.029
achment+0.029
 Bliss+0.029
��+0.028
Neg logit contributions
Gall-0.030
 Rak-0.029
horse-0.029
tera-0.028
 Serv-0.028
Token Apex
Token ID49440
Feature activation+4.52e-01

Pos logit contributions
Gall+0.815
 Rak+0.786
tera+0.775
horse+0.773
 Serv+0.759
Neg logit contributions
appropriate-0.798
uba-0.772
 Bliss-0.766
achment-0.763
��-0.760
Token don
Token ID836
Feature activation+6.73e-01

Pos logit contributions
appropriate+0.126
uba+0.123
 Bliss+0.121
achment+0.120
abouts+0.117
Neg logit contributions
Gall-0.177
 Rak-0.169
tera-0.169
horse-0.167
 Serv-0.163
Token't
Token ID470
Feature activation-5.58e-01

Pos logit contributions
appropriate+0.410
uba+0.385
achment+0.384
 Bliss+0.382
reviewed+0.378
Neg logit contributions
Gall-0.503
 Rak-0.488
tera-0.484
horse-0.477
 Serv-0.471
Token want
Token ID765
Feature activation-3.42e-02

Pos logit contributions
Gall+0.441
 Rak+0.423
tera+0.420
horse+0.418
 Serv+0.409
Neg logit contributions
appropriate-0.321
uba-0.307
 Bliss-0.304
achment-0.303
��-0.297
Token you
Token ID345
Feature activation-5.04e-01

Pos logit contributions
Gall+0.021
 Rak+0.020
horse+0.020
tera+0.020
 Serv+0.020
Neg logit contributions
appropriate-0.025
uba-0.024
achment-0.024
 Bliss-0.024
��-0.024
Token t
Token ID256
Feature activation-2.95e-02

Pos logit contributions
Gall+0.335
 Rak+0.322
horse+0.318
tera+0.317
 Serv+0.311
Neg logit contributions
appropriate-0.349
uba-0.338
achment-0.335
 Bliss-0.334
��-0.332
Tokeniring
Token ID3428
Feature activation-3.53e+00

Pos logit contributions
Gall+0.021
 Rak+0.021
horse+0.020
tera+0.020
 Serv+0.020
Neg logit contributions
appropriate-0.019
uba-0.018
achment-0.018
 Bliss-0.018
��-0.018
Token yourself
Token ID3511
Feature activation-7.87e-01

Pos logit contributions
Gall+2.222
 Rak+2.158
horse+2.134
tera+2.110
 Serv+2.091
Neg logit contributions
appropriate-2.519
��-2.444
uba-2.435
achment-2.429
 Bliss-2.413
Token babe
Token ID42666
Feature activation-6.68e-01

Pos logit contributions
Gall+0.458
 Rak+0.437
tera+0.430
horse+0.430
 Serv+0.419
Neg logit contributions
appropriate-0.608
uba-0.591
 Bliss-0.587
achment-0.587
��-0.580
Token,
Token ID11
Feature activation-5.71e-01

Pos logit contributions
Gall+0.441
 Rak+0.427
horse+0.423
tera+0.419
 Serv+0.414
Neg logit contributions
appropriate-0.461
uba-0.446
achment-0.442
��-0.442
 Bliss-0.440
Token now
Token ID783
Feature activation-2.15e-01

Pos logit contributions
Gall+0.355
 Rak+0.341
horse+0.336
tera+0.335
 Serv+0.328
Neg logit contributions
appropriate-0.418
uba-0.405
achment-0.402
 Bliss-0.401
��-0.400
Token be
Token ID307
Feature activation+2.98e-01

Pos logit contributions
Gall+0.145
 Rak+0.139
tera+0.138
horse+0.137
 Serv+0.134
Neg logit contributions
appropriate-0.146
uba-0.142
 Bliss-0.140
achment-0.140
��-0.138
Token to
Token ID284
Feature activation-2.32e-01

Pos logit contributions
appropriate+0.558
uba+0.550
 Bliss+0.548
achment+0.539
 Cheap+0.530
Neg logit contributions
Gall-0.636
tera-0.600
horse-0.597
 Rak-0.595
 Serv-0.578
Token Go
Token ID1514
Feature activation-7.06e-02

Pos logit contributions
Gall+0.168
 Rak+0.160
horse+0.160
tera+0.159
 Serv+0.154
Neg logit contributions
appropriate-0.146
 Bliss-0.143
uba-0.142
achment-0.140
��-0.137
Token Back
Token ID5157
Feature activation-3.67e-01

Pos logit contributions
Gall+0.050
tera+0.047
 Rak+0.047
horse+0.046
 Serv+0.045
Neg logit contributions
appropriate-0.046
uba-0.045
 Bliss-0.045
achment-0.044
 Cheap-0.043
Token There
Token ID1318
Feature activation-5.82e-01

Pos logit contributions
Gall+0.245
 Rak+0.232
tera+0.230
horse+0.228
 Serv+0.225
Neg logit contributions
appropriate-0.253
uba-0.248
achment-0.245
 Bliss-0.244
abouts-0.241
Token Som
Token ID9995
Feature activation-8.18e-01

Pos logit contributions
Gall+0.381
 Rak+0.362
tera+0.359
horse+0.358
 Serv+0.351
Neg logit contributions
appropriate-0.407
uba-0.397
 Bliss-0.395
achment-0.393
abouts-0.388
Tokeneday
Token ID23712
Feature activation-3.39e+00

Pos logit contributions
Gall+0.679
 Rak+0.660
horse+0.652
tera+0.651
 Serv+0.642
Neg logit contributions
appropriate-0.427
uba-0.410
achment-0.405
 Bliss-0.404
��-0.402
Token."
Token ID526
Feature activation+3.48e-01

Pos logit contributions
Gall+2.039
 Rak+1.989
horse+1.952
tera+1.931
 Serv+1.916
Neg logit contributions
appropriate-2.523
uba-2.445
��-2.439
achment-2.418
 Bliss-2.402
Token
Token ID198
Feature activation+7.47e-01

Pos logit contributions
appropriate+0.224
 Bliss+0.219
uba+0.217
achment+0.215
reviewed+0.211
Neg logit contributions
Gall-0.242
tera-0.234
horse-0.232
 Rak-0.232
 Serv-0.225
Token
Token ID198
Feature activation+8.27e-01

Pos logit contributions
appropriate+0.443
 Bliss+0.420
achment+0.419
uba+0.418
reviewed+0.417
Neg logit contributions
Gall-0.560
tera-0.547
 Rak-0.544
horse-0.535
 Serv-0.526
TokenIn
Token ID818
Feature activation-3.21e-01

Pos logit contributions
appropriate+0.589
uba+0.559
achment+0.558
reviewed+0.556
 Bliss+0.550
Neg logit contributions
 Rak-0.521
Gall-0.512
tera-0.509
 Serv-0.503
horse-0.499
Token 1984
Token ID12844
Feature activation-3.44e-01

Pos logit contributions
Gall+0.215
 Rak+0.207
horse+0.205
tera+0.204
 Serv+0.200
Neg logit contributions
appropriate-0.221
uba-0.213
 Bliss-0.211
achment-0.211
��-0.209
Token v
Token ID410
Feature activation-8.81e-01

Pos logit contributions
Gall+0.315
tera+0.302
 Rak+0.302
horse+0.299
 Serv+0.291
Neg logit contributions
appropriate-0.264
uba-0.254
 Bliss-0.253
achment-0.252
��-0.248
Tokenene
Token ID1734
Feature activation+2.01e-01

Pos logit contributions
Gall+0.517
 Rak+0.496
horse+0.488
tera+0.487
 Serv+0.477
Neg logit contributions
appropriate-0.677
uba-0.658
achment-0.653
 Bliss-0.648
��-0.642
Tokener
Token ID263
Feature activation-2.38e-01

Pos logit contributions
appropriate+0.180
achment+0.173
 Bliss+0.173
uba+0.172
��+0.171
Neg logit contributions
Gall-0.092
 Rak-0.088
horse-0.085
tera-0.084
 Serv-0.083
Token of
Token ID286
Feature activation-3.85e-01

Pos logit contributions
Gall+0.139
 Rak+0.134
horse+0.131
tera+0.131
 Serv+0.128
Neg logit contributions
appropriate-0.182
uba-0.177
achment-0.176
 Bliss-0.176
��-0.175
Token moral
Token ID6573
Feature activation+7.77e-01

Pos logit contributions
Gall+0.264
 Rak+0.253
tera+0.252
horse+0.250
 Serv+0.243
Neg logit contributions
appropriate-0.260
uba-0.250
achment-0.248
 Bliss-0.248
��-0.246
Token rect
Token ID13621
Feature activation-3.33e+00

Pos logit contributions
appropriate+0.459
uba+0.442
 Bliss+0.436
achment+0.433
��+0.432
Neg logit contributions
Gall-0.598
tera-0.575
 Rak-0.574
horse-0.564
 Serv-0.552
Tokenitude
Token ID3984
Feature activation-1.07e+00

Pos logit contributions
Gall+1.803
 Rak+1.724
horse+1.696
tera+1.684
 Serv+1.660
Neg logit contributions
appropriate-2.697
uba-2.624
��-2.601
achment-2.598
 Bliss-2.597
Token for
Token ID329
Feature activation-4.01e-01

Pos logit contributions
Gall+0.682
 Rak+0.656
tera+0.646
horse+0.645
 Serv+0.631
Neg logit contributions
appropriate-0.764
uba-0.740
achment-0.734
 Bliss-0.732
��-0.727
Token wars
Token ID9976
Feature activation-1.03e-01

Pos logit contributions
Gall+0.266
tera+0.255
 Rak+0.255
horse+0.253
 Serv+0.246
Neg logit contributions
appropriate-0.276
uba-0.267
 Bliss-0.265
achment-0.265
��-0.262
Token of
Token ID286
Feature activation-4.21e-02

Pos logit contributions
Gall+0.069
tera+0.066
 Rak+0.066
horse+0.064
 Serv+0.063
Neg logit contributions
appropriate-0.071
uba-0.069
achment-0.068
 Bliss-0.068
��-0.067
Token plunder
Token ID42996
Feature activation+3.26e-02

Pos logit contributions
Gall+0.028
 Rak+0.027
tera+0.027
horse+0.027
 Serv+0.026
Neg logit contributions
appropriate-0.029
uba-0.028
 Bliss-0.028
achment-0.028
��-0.028
Token.
Token ID13
Feature activation+3.09e-01

Pos logit contributions
appropriate+0.756
achment+0.724
uba+0.724
reviewed+0.711
 Bliss+0.710
Neg logit contributions
Gall-0.943
 Rak-0.915
tera-0.911
horse-0.895
 Serv-0.886
Tokenparse
Token ID29572
Feature activation+3.73e-01

Pos logit contributions
appropriate+0.178
uba+0.166
achment+0.165
reviewed+0.162
abouts+0.159
Neg logit contributions
Gall-0.247
 Rak-0.242
 Serv-0.237
horse-0.232
tera-0.231
Token_
Token ID62
Feature activation+1.05e+00

Pos logit contributions
appropriate+0.230
achment+0.221
uba+0.219
 Bliss+0.218
reviewed+0.214
Neg logit contributions
Gall-0.271
 Rak-0.265
horse-0.262
tera-0.262
 Serv-0.258
Tokenargs
Token ID22046
Feature activation-6.59e-01

Pos logit contributions
appropriate+0.908
uba+0.872
achment+0.871
reviewed+0.859
abouts+0.846
Neg logit contributions
Gall-0.509
 Rak-0.496
tera-0.489
 Serv-0.483
horse-0.480
Token())
Token ID28955
Feature activation-1.13e+00

Pos logit contributions
Gall+0.286
 Rak+0.272
horse+0.267
tera+0.265
 Serv+0.259
Neg logit contributions
appropriate-0.604
uba-0.590
achment-0.586
 Bliss-0.583
��-0.577
Token #
Token ID1303
Feature activation-3.33e+00

Pos logit contributions
Gall+0.805
 Rak+0.789
 Serv+0.771
horse+0.771
tera+0.766
Neg logit contributions
appropriate-0.719
 Bliss-0.690
uba-0.690
achment-0.686
��-0.680
Token initialize
Token ID41216
Feature activation-2.95e-01

Pos logit contributions
Gall+2.424
 Rak+2.359
tera+2.316
horse+2.311
 Serv+2.284
Neg logit contributions
appropriate-2.047
uba-2.005
achment-1.976
��-1.967
 Bliss-1.957
Token a
Token ID257
Feature activation-3.93e-01

Pos logit contributions
Gall+0.187
 Rak+0.177
tera+0.177
horse+0.176
 Serv+0.169
Neg logit contributions
appropriate-0.215
achment-0.206
 Bliss-0.206
uba-0.205
 MFT-0.200
Token rectangular
Token ID36954
Feature activation-1.07e+00

Pos logit contributions
Gall+0.311
horse+0.301
 Rak+0.300
tera+0.297
 Serv+0.293
Neg logit contributions
appropriate-0.220
 Bliss-0.208
uba-0.207
achment-0.207
 MFT-0.202
Token and
Token ID290
Feature activation-1.11e+00

Pos logit contributions
Gall+0.746
 Rak+0.711
tera+0.711
horse+0.706
 Serv+0.685
Neg logit contributions
appropriate-0.714
 Bliss-0.684
achment-0.684
uba-0.681
 MFT-0.661
Token square
Token ID6616
Feature activation-1.08e+00

Pos logit contributions
Gall+0.629
 Rak+0.591
tera+0.589
horse+0.580
 Serv+0.569
Neg logit contributions
appropriate-0.893
uba-0.861
achment-0.852
 Bliss-0.849
abouts-0.835
Token--
Token ID438
Feature activation-5.01e-01

Pos logit contributions
appropriate+0.593
achment+0.571
uba+0.570
 Bliss+0.568
��+0.566
Neg logit contributions
Gall-0.446
 Rak-0.426
tera-0.420
horse-0.420
 Serv-0.409
Token///
Token ID20379
Feature activation-1.21e+00

Pos logit contributions
Gall+0.320
 Rak+0.307
horse+0.303
tera+0.303
 Serv+0.296
Neg logit contributions
appropriate-0.359
uba-0.347
achment-0.345
 Bliss-0.344
��-0.342
Token L
Token ID406
Feature activation-3.53e-01

Pos logit contributions
Gall+0.809
 Rak+0.780
horse+0.771
tera+0.770
 Serv+0.754
Neg logit contributions
appropriate-0.832
uba-0.804
achment-0.798
 Bliss-0.798
��-0.792
Tokenett
Token ID3087
Feature activation-7.78e-01

Pos logit contributions
Gall+0.240
 Rak+0.231
horse+0.229
tera+0.227
 Serv+0.223
Neg logit contributions
appropriate-0.237
uba-0.232
achment-0.230
 Bliss-0.226
��-0.225
Tokenre
Token ID260
Feature activation-9.30e-01

Pos logit contributions
Gall+0.542
 Rak+0.524
horse+0.517
tera+0.515
 Serv+0.506
Neg logit contributions
appropriate-0.510
uba-0.494
achment-0.490
 Bliss-0.488
��-0.483
Token du
Token ID7043
Feature activation-3.31e+00

Pos logit contributions
Gall+0.610
 Rak+0.587
horse+0.581
tera+0.576
 Serv+0.566
Neg logit contributions
appropriate-0.646
uba-0.627
achment-0.623
 Bliss-0.621
��-0.615
Token bureau
Token ID10583
Feature activation-2.26e+00

Pos logit contributions
Gall+2.237
 Rak+2.165
horse+2.125
tera+2.120
 Serv+2.092
Neg logit contributions
appropriate-2.247
uba-2.173
achment-2.152
��-2.147
 Bliss-2.147
Token de
Token ID390
Feature activation-9.82e-01

Pos logit contributions
Gall+1.613
 Rak+1.559
horse+1.539
tera+1.533
 Serv+1.507
Neg logit contributions
appropriate-1.450
uba-1.401
achment-1.390
 Bliss-1.387
��-1.382
Token l
Token ID300
Feature activation-4.22e-01

Pos logit contributions
Gall+0.683
 Rak+0.658
horse+0.652
tera+0.649
 Serv+0.635
Neg logit contributions
appropriate-0.646
uba-0.624
 Bliss-0.619
achment-0.618
��-0.615
Token'
Token ID6
Feature activation-1.59e+00

Pos logit contributions
Gall+0.215
 Rak+0.206
horse+0.203
tera+0.200
 Serv+0.194
Neg logit contributions
appropriate-0.357
uba-0.350
achment-0.345
 Bliss-0.342
��-0.340
TokenAmb
Token ID35649
Feature activation+8.26e-01

Pos logit contributions
Gall+1.057
 Rak+1.033
horse+1.011
tera+1.007
 Serv+0.993
Neg logit contributions
appropriate-1.101
uba-1.061
achment-1.051
��-1.040
 Bliss-1.040
Tokenled
Token ID992
Feature activation-1.18e-01

Pos logit contributions
appropriate+0.127
uba+0.120
achment+0.119
 Bliss+0.119
��+0.116
Neg logit contributions
Gall-0.285
 Rak-0.276
tera-0.274
horse-0.274
 Serv-0.270
Token immortal
Token ID26156
Feature activation-5.16e-01

Pos logit contributions
Gall+0.077
tera+0.074
 Rak+0.073
horse+0.073
 Serv+0.071
Neg logit contributions
appropriate-0.082
uba-0.080
 Bliss-0.080
achment-0.079
��-0.078
Token eld
Token ID18441
Feature activation-1.03e-01

Pos logit contributions
Gall+0.340
 Rak+0.323
tera+0.323
horse+0.322
 Serv+0.312
Neg logit contributions
appropriate-0.357
uba-0.348
 Bliss-0.346
achment-0.343
��-0.338
Tokenritch
Token ID46510
Feature activation-3.61e-01

Pos logit contributions
Gall+0.073
 Rak+0.070
tera+0.069
horse+0.069
 Serv+0.067
Neg logit contributions
appropriate-0.067
uba-0.066
achment-0.065
 Bliss-0.064
abouts-0.063
Token mon
Token ID937
Feature activation-8.23e-01

Pos logit contributions
Gall+0.249
 Rak+0.237
tera+0.236
horse+0.236
 Serv+0.229
Neg logit contributions
appropriate-0.238
uba-0.232
 Bliss-0.232
achment-0.229
��-0.225
Tokenstros
Token ID48288
Feature activation-3.25e+00

Pos logit contributions
Gall+0.547
 Rak+0.527
horse+0.520
tera+0.518
 Serv+0.509
Neg logit contributions
appropriate-0.566
uba-0.548
achment-0.544
 Bliss-0.543
��-0.541
Tokenities
Token ID871
Feature activation-1.04e+00

Pos logit contributions
Gall+1.162
 Rak+1.090
horse+1.054
tera+1.049
 Serv+1.018
Neg logit contributions
appropriate-3.228
uba-3.149
��-3.142
achment-3.134
 Bliss-3.131
Token year
Token ID614
Feature activation+1.85e-02

Pos logit contributions
Gall+0.633
 Rak+0.603
tera+0.599
horse+0.596
 Serv+0.582
Neg logit contributions
appropriate-0.776
uba-0.756
 Bliss-0.748
achment-0.748
��-0.737
Token after
Token ID706
Feature activation-1.26e-01

Pos logit contributions
appropriate+0.013
uba+0.013
achment+0.013
 Bliss+0.013
��+0.012
Neg logit contributions
Gall-0.012
 Rak-0.011
horse-0.011
tera-0.011
 Serv-0.011
Token year
Token ID614
Feature activation+3.72e-01

Pos logit contributions
Gall+0.067
tera+0.064
 Rak+0.064
horse+0.063
 Serv+0.061
Neg logit contributions
appropriate-0.104
 Bliss-0.100
uba-0.100
achment-0.100
��-0.098
Token and
Token ID290
Feature activation+2.23e-01

Pos logit contributions
appropriate+0.262
uba+0.254
achment+0.253
 Bliss+0.253
abouts+0.248
Neg logit contributions
Gall-0.241
 Rak-0.230
tera-0.229
horse-0.227
 Serv-0.221
Token hold
Token ID1745
Feature activation+9.29e-01

Pos logit contributions
appropriate+0.318
uba+0.306
 Bliss+0.302
achment+0.299
abouts+0.295
Neg logit contributions
Gall-0.328
horse-0.310
 Rak-0.309
tera-0.303
 Serv-0.299
Token out
Token ID503
Feature activation+3.86e-02

Pos logit contributions
appropriate+0.708
uba+0.691
 Bliss+0.687
achment+0.681
��+0.676
Neg logit contributions
Gall-0.539
horse-0.515
 Rak-0.514
tera-0.512
 Serv-0.494
Token for
Token ID329
Feature activation-8.79e-02

Pos logit contributions
appropriate+0.028
uba+0.027
 Bliss+0.027
achment+0.027
��+0.026
Neg logit contributions
Gall-0.025
 Rak-0.024
horse-0.023
tera-0.023
 Serv-0.023
Token an
Token ID281
Feature activation-5.79e-02

Pos logit contributions
Gall+0.058
horse+0.056
tera+0.056
 Rak+0.055
 Serv+0.054
Neg logit contributions
appropriate-0.060
uba-0.058
 Bliss-0.058
achment-0.058
��-0.057
Token obscene
Token ID33390
Feature activation-6.39e-01

Pos logit contributions
Gall+0.046
horse+0.045
tera+0.044
 Rak+0.044
 Serv+0.043
Neg logit contributions
appropriate-0.032
uba-0.031
achment-0.031
��-0.030
 Bliss-0.030
Token sum
Token ID2160
Feature activation-3.23e+00

Pos logit contributions
Gall+0.453
tera+0.429
horse+0.428
 Rak+0.427
 Serv+0.420
Neg logit contributions
appropriate-0.415
uba-0.401
achment-0.396
 Bliss-0.394
��-0.390
Token of
Token ID286
Feature activation+6.98e-01

Pos logit contributions
Gall+1.952
 Rak+1.882
horse+1.856
tera+1.845
 Serv+1.812
Neg logit contributions
appropriate-2.409
uba-2.339
achment-2.320
��-2.320
 Bliss-2.317
Token money
Token ID1637
Feature activation-2.38e-02

Pos logit contributions
appropriate+0.355
uba+0.337
 Bliss+0.337
achment+0.333
��+0.329
Neg logit contributions
Gall-0.586
horse-0.567
tera-0.566
 Rak-0.564
 Serv-0.552
Token,
Token ID11
Feature activation+1.78e-01

Pos logit contributions
Gall+0.016
 Rak+0.015
tera+0.015
horse+0.015
 Serv+0.015
Neg logit contributions
appropriate-0.017
uba-0.016
achment-0.016
 Bliss-0.016
��-0.016
Token as
Token ID355
Feature activation+1.97e-02

Pos logit contributions
appropriate+0.125
 Bliss+0.122
uba+0.121
achment+0.120
��+0.120
Neg logit contributions
Gall-0.114
 Rak-0.109
horse-0.109
tera-0.109
 Serv-0.106
Token they
Token ID484
Feature activation-4.31e-01

Pos logit contributions
appropriate+0.013
uba+0.013
 Bliss+0.013
achment+0.013
��+0.013
Neg logit contributions
Gall-0.013
horse-0.013
 Rak-0.013
tera-0.013
 Serv-0.012
Token Sab
Token ID9910
Feature activation-2.82e-01

Pos logit contributions
Gall+0.187
 Rak+0.180
horse+0.179
tera+0.179
 Serv+0.174
Neg logit contributions
appropriate-0.182
uba-0.177
 Bliss-0.176
achment-0.174
��-0.173
Tokenine
Token ID500
Feature activation-1.60e-01

Pos logit contributions
Gall+0.198
 Rak+0.192
horse+0.189
tera+0.189
 Serv+0.186
Neg logit contributions
appropriate-0.183
uba-0.177
achment-0.175
 Bliss-0.175
��-0.175
Token,
Token ID11
Feature activation-3.29e-01

Pos logit contributions
Gall+0.106
 Rak+0.102
horse+0.101
tera+0.101
 Serv+0.099
Neg logit contributions
appropriate-0.110
uba-0.106
 Bliss-0.106
achment-0.105
��-0.105
Token Chef
Token ID26383
Feature activation-5.52e-02

Pos logit contributions
Gall+0.227
 Rak+0.218
horse+0.216
tera+0.216
 Serv+0.211
Neg logit contributions
appropriate-0.217
uba-0.210
 Bliss-0.209
achment-0.208
��-0.207
Token G
Token ID402
Feature activation+5.75e-01

Pos logit contributions
Gall+0.038
 Rak+0.036
tera+0.036
horse+0.036
 Serv+0.035
Neg logit contributions
appropriate-0.037
uba-0.036
 Bliss-0.035
achment-0.035
��-0.035
Token.'
Token ID2637
Feature activation-3.23e+00

Pos logit contributions
appropriate+0.373
uba+0.367
 Bliss+0.359
achment+0.357
��+0.351
Neg logit contributions
Gall-0.404
 Rak-0.389
horse-0.384
tera-0.383
 Serv-0.375
Tokens
Token ID82
Feature activation+2.92e-01

Pos logit contributions
Gall+1.912
 Rak+1.827
horse+1.804
tera+1.793
 Serv+1.763
Neg logit contributions
appropriate-2.462
uba-2.390
achment-2.378
��-2.373
 Bliss-2.362
Token,
Token ID11
Feature activation+9.68e-02

Pos logit contributions
appropriate+0.191
uba+0.185
 Bliss+0.185
achment+0.182
��+0.181
Neg logit contributions
Gall-0.203
 Rak-0.195
horse-0.194
tera-0.193
 Serv-0.188
Token Ep
Token ID4551
Feature activation+3.09e-01

Pos logit contributions
appropriate+0.064
uba+0.062
 Bliss+0.061
achment+0.061
��+0.060
Neg logit contributions
Gall-0.067
 Rak-0.064
horse-0.064
tera-0.064
 Serv-0.062
Tokenoch
Token ID5374
Feature activation+1.84e+00

Pos logit contributions
appropriate+0.193
uba+0.186
achment+0.184
 Bliss+0.183
��+0.182
Neg logit contributions
Gall-0.225
 Rak-0.218
horse-0.215
tera-0.214
 Serv-0.211
Token Fusion
Token ID21278
Feature activation-5.49e-01

Pos logit contributions
appropriate+1.183
uba+1.142
 Bliss+1.141
achment+1.117
��+1.112
Neg logit contributions
Gall-1.301
 Rak-1.251
horse-1.233
tera-1.226
 Serv-1.194
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
Gall+0.404
 Rak+0.387
horse+0.383
tera+0.382
 Serv+0.376
Neg logit contributions
appropriate-0.396
uba-0.381
achment-0.379
 Bliss-0.378
��-0.376
Token and
Token ID290
Feature activation+6.16e-02

Pos logit contributions
Gall+0.070
 Rak+0.067
horse+0.066
tera+0.066
 Serv+0.065
Neg logit contributions
appropriate-0.072
uba-0.070
 Bliss-0.069
achment-0.069
��-0.069
Token D
Token ID360
Feature activation-3.48e-01

Pos logit contributions
appropriate+0.042
uba+0.040
 Bliss+0.040
achment+0.040
��+0.040
Neg logit contributions
Gall-0.041
 Rak-0.040
tera-0.040
horse-0.039
 Serv-0.038
Tokenendi
Token ID43109
Feature activation-6.12e-01

Pos logit contributions
Gall+0.335
 Rak+0.332
 Serv+0.324
horse+0.324
tera+0.324
Neg logit contributions
appropriate-0.135
uba-0.129
achment-0.128
 Bliss-0.121
��-0.119
Token st
Token ID336
Feature activation+4.78e-01

Pos logit contributions
Gall+0.421
 Rak+0.404
horse+0.401
tera+0.401
 Serv+0.391
Neg logit contributions
appropriate-0.405
uba-0.394
 Bliss-0.392
achment-0.390
��-0.384
Tokenrolled
Token ID8375
Feature activation-3.19e+00

Pos logit contributions
appropriate+0.387
uba+0.382
achment+0.374
 Bliss+0.370
reviewed+0.366
Neg logit contributions
Gall-0.262
 Rak-0.249
horse-0.244
tera-0.242
 Serv-0.240
Token off
Token ID572
Feature activation-6.97e-01

Pos logit contributions
Gall+1.868
 Rak+1.794
horse+1.766
tera+1.756
 Serv+1.726
Neg logit contributions
appropriate-2.441
uba-2.370
achment-2.352
 Bliss-2.346
��-2.345
Token to
Token ID284
Feature activation-4.70e-01

Pos logit contributions
Gall+0.463
 Rak+0.444
tera+0.438
horse+0.436
 Serv+0.427
Neg logit contributions
appropriate-0.482
uba-0.467
 Bliss-0.464
achment-0.463
��-0.459
Token entertain
Token ID8204
Feature activation-5.37e-01

Pos logit contributions
Gall+0.330
 Rak+0.318
horse+0.315
tera+0.315
 Serv+0.308
Neg logit contributions
appropriate-0.306
uba-0.296
 Bliss-0.294
achment-0.291
��-0.290
Token (
Token ID357
Feature activation-4.38e-01

Pos logit contributions
Gall+0.391
 Rak+0.377
horse+0.373
tera+0.372
 Serv+0.366
Neg logit contributions
appropriate-0.336
uba-0.325
 Bliss-0.321
achment-0.320
��-0.319
Tokenor
Token ID273
Feature activation+3.07e-01

Pos logit contributions
Gall+0.300
 Rak+0.292
tera+0.287
horse+0.285
 Serv+0.283
Neg logit contributions
appropriate-0.293
uba-0.281
achment-0.278
 Bliss-0.277
��-0.277
Token handed
Token ID10158
Feature activation-1.39e+00

Pos logit contributions
Gall+0.206
tera+0.200
horse+0.197
 Rak+0.196
 Serv+0.191
Neg logit contributions
appropriate-0.183
uba-0.179
 Bliss-0.177
achment-0.175
��-0.173
Token Libya
Token ID13465
Feature activation-1.91e-01

Pos logit contributions
Gall+0.854
 Rak+0.816
horse+0.808
tera+0.808
 Serv+0.787
Neg logit contributions
appropriate-1.027
uba-0.998
achment-0.989
 Bliss-0.989
��-0.985
Token over
Token ID625
Feature activation-2.31e-01

Pos logit contributions
Gall+0.124
 Rak+0.118
horse+0.117
tera+0.117
 Serv+0.114
Neg logit contributions
appropriate-0.135
uba-0.131
achment-0.130
 Bliss-0.130
��-0.128
Token to
Token ID284
Feature activation-1.09e+00

Pos logit contributions
Gall+0.144
 Rak+0.140
horse+0.138
tera+0.137
 Serv+0.135
Neg logit contributions
appropriate-0.168
uba-0.161
 Bliss-0.161
achment-0.161
��-0.160
Token a
Token ID257
Feature activation-6.29e-01

Pos logit contributions
Gall+0.724
 Rak+0.694
tera+0.690
horse+0.688
 Serv+0.670
Neg logit contributions
appropriate-0.747
uba-0.724
achment-0.717
 Bliss-0.716
��-0.712
Token patch
Token ID8529
Feature activation-3.18e+00

Pos logit contributions
Gall+0.399
tera+0.380
horse+0.379
 Rak+0.379
 Serv+0.364
Neg logit contributions
appropriate-0.451
uba-0.440
 Bliss-0.436
achment-0.435
��-0.430
Tokenwork
Token ID1818
Feature activation-5.33e-01

Pos logit contributions
Gall+1.297
 Rak+1.226
horse+1.205
tera+1.187
 Serv+1.161
Neg logit contributions
appropriate-3.000
uba-2.934
achment-2.916
��-2.915
 Bliss-2.913
Token of
Token ID286
Feature activation+3.86e-02

Pos logit contributions
Gall+0.316
 Rak+0.302
tera+0.298
horse+0.297
 Serv+0.289
Neg logit contributions
appropriate-0.406
uba-0.395
achment-0.392
 Bliss-0.392
��-0.390
Token tribal
Token ID17054
Feature activation-4.42e-01

Pos logit contributions
appropriate+0.027
uba+0.027
 Bliss+0.026
achment+0.026
��+0.026
Neg logit contributions
Gall-0.025
tera-0.024
 Rak-0.024
horse-0.024
 Serv-0.023
Token and
Token ID290
Feature activation-5.77e-01

Pos logit contributions
Gall+0.313
tera+0.299
 Rak+0.298
horse+0.295
 Serv+0.288
Neg logit contributions
appropriate-0.286
uba-0.277
 Bliss-0.275
achment-0.274
��-0.271
Token Al
Token ID978
Feature activation-6.58e-01

Pos logit contributions
Gall+0.361
tera+0.346
 Rak+0.343
horse+0.340
 Serv+0.329
Neg logit contributions
appropriate-0.419
uba-0.408
achment-0.404
 Bliss-0.404
��-0.401
Token {
Token ID1391
Feature activation+2.92e-01

Pos logit contributions
Gall+0.404
 Rak+0.380
tera+0.378
horse+0.377
 Serv+0.365
Neg logit contributions
appropriate-0.668
uba-0.652
 Bliss-0.647
achment-0.646
��-0.645
Token(
Token ID7
Feature activation-7.81e-01

Pos logit contributions
appropriate+0.253
reviewed+0.243
uba+0.242
achment+0.241
 Bliss+0.240
Neg logit contributions
Gall-0.139
 Rak-0.138
tera-0.136
horse-0.130
 Serv-0.130
Token270
Token ID20233
Feature activation+3.62e-01

Pos logit contributions
Gall+0.577
 Rak+0.558
tera+0.551
horse+0.551
 Serv+0.542
Neg logit contributions
appropriate-0.478
uba-0.461
achment-0.456
 Bliss-0.456
��-0.450
Tokens
Token ID82
Feature activation+1.64e-01

Pos logit contributions
appropriate+0.279
uba+0.269
 Bliss+0.265
achment+0.264
abouts+0.262
Neg logit contributions
Gall-0.214
 Rak-0.200
tera-0.200
horse-0.199
 Serv-0.194
Token)}
Token ID38165
Feature activation-5.03e-01

Pos logit contributions
appropriate+0.131
uba+0.127
achment+0.126
 Bliss+0.125
reviewed+0.125
Neg logit contributions
Gall-0.090
 Rak-0.087
tera-0.085
horse-0.084
 Serv-0.083
Token Rac
Token ID24746
Feature activation-3.18e+00

Pos logit contributions
Gall+0.325
tera+0.311
horse+0.310
 Rak+0.302
 Serv+0.296
Neg logit contributions
appropriate-0.352
 Bliss-0.347
uba-0.344
achment-0.338
 Klaus-0.336
Token8
Token ID23
Feature activation-8.83e-01

Pos logit contributions
Gall+2.104
 Rak+2.027
horse+1.998
tera+1.991
 Serv+1.958
Neg logit contributions
appropriate-2.195
uba-2.126
achment-2.107
 Bliss-2.102
��-2.095
Token {
Token ID1391
Feature activation+3.80e-01

Pos logit contributions
Gall+0.464
tera+0.436
 Rak+0.436
horse+0.432
 Serv+0.414
Neg logit contributions
appropriate-0.730
uba-0.709
 Bliss-0.708
achment-0.701
abouts-0.692
Token(
Token ID7
Feature activation-4.54e-01

Pos logit contributions
appropriate+0.327
reviewed+0.315
uba+0.313
achment+0.312
 Bliss+0.311
Neg logit contributions
Gall-0.181
 Rak-0.180
tera-0.178
horse-0.171
 Serv-0.170
Token193
Token ID24943
Feature activation+9.03e-01

Pos logit contributions
Gall+0.339
 Rak+0.329
tera+0.325
horse+0.325
 Serv+0.319
Neg logit contributions
appropriate-0.274
uba-0.264
achment-0.261
 Bliss-0.261
��-0.258
Tokens
Token ID82
Feature activation-7.75e-02

Pos logit contributions
appropriate+0.697
uba+0.673
 Bliss+0.666
achment+0.659
abouts+0.655
Neg logit contributions
Gall-0.532
tera-0.498
 Rak-0.494
horse-0.492
 Serv-0.481
Tokeninv
Token ID16340
Feature activation-7.41e-02

Pos logit contributions
appropriate+0.345
uba+0.328
achment+0.321
��+0.319
 Bliss+0.318
Neg logit contributions
Gall-0.375
 Rak-0.364
tera-0.355
 Serv-0.352
horse-0.352
Tokenade
Token ID671
Feature activation+9.96e-01

Pos logit contributions
Gall+0.052
 Rak+0.050
horse+0.049
tera+0.048
 Serv+0.048
Neg logit contributions
appropriate-0.050
uba-0.048
achment-0.047
 Bliss-0.046
��-0.046
Token our
Token ID674
Feature activation-4.57e-01

Pos logit contributions
appropriate+0.721
uba+0.695
 Bliss+0.688
achment+0.678
��+0.674
Neg logit contributions
Gall-0.636
tera-0.603
 Rak-0.602
horse-0.599
 Serv-0.578
Token communities
Token ID5348
Feature activation-6.59e-01

Pos logit contributions
Gall+0.302
 Rak+0.288
tera+0.287
horse+0.287
 Serv+0.278
Neg logit contributions
appropriate-0.318
uba-0.308
 Bliss-0.306
achment-0.302
��-0.299
Token pre
Token ID662
Feature activation+9.60e-02

Pos logit contributions
Gall+0.435
 Rak+0.416
tera+0.415
horse+0.406
 Serv+0.399
Neg logit contributions
appropriate-0.463
uba-0.448
 Bliss-0.442
achment-0.440
abouts-0.436
Tokenying
Token ID1112
Feature activation-3.16e+00

Pos logit contributions
appropriate+0.072
uba+0.070
achment+0.070
 Bliss+0.070
��+0.069
Neg logit contributions
Gall-0.057
 Rak-0.055
horse-0.054
tera-0.054
 Serv-0.053
Token on
Token ID319
Feature activation-3.30e-01

Pos logit contributions
Gall+1.677
 Rak+1.617
horse+1.597
tera+1.569
 Serv+1.552
Neg logit contributions
appropriate-2.576
uba-2.505
��-2.495
 Bliss-2.485
achment-2.480
Token unsuspecting
Token ID41975
Feature activation-8.30e-01

Pos logit contributions
Gall+0.219
 Rak+0.211
horse+0.209
tera+0.209
 Serv+0.203
Neg logit contributions
appropriate-0.227
uba-0.220
 Bliss-0.218
achment-0.217
��-0.216
Token homeowners
Token ID24692
Feature activation-5.89e-02

Pos logit contributions
Gall+0.591
 Rak+0.567
horse+0.565
tera+0.565
 Serv+0.548
Neg logit contributions
appropriate-0.530
uba-0.515
 Bliss-0.510
achment-0.506
��-0.502
Token,
Token ID11
Feature activation-5.63e-01

Pos logit contributions
Gall+0.040
 Rak+0.039
tera+0.038
horse+0.038
 Serv+0.037
Neg logit contributions
appropriate-0.040
uba-0.039
achment-0.038
 Bliss-0.038
��-0.037
Token often
Token ID1690
Feature activation-3.31e-01

Pos logit contributions
Gall+0.387
tera+0.370
 Rak+0.370
horse+0.366
 Serv+0.358
Neg logit contributions
appropriate-0.377
uba-0.363
 Bliss-0.360
achment-0.359
��-0.356
Tokenargs
Token ID22046
Feature activation-2.65e-01

Pos logit contributions
Gall+0.658
 Rak+0.630
tera+0.615
horse+0.614
 Serv+0.602
Neg logit contributions
appropriate-1.317
uba-1.279
achment-1.276
 Bliss-1.263
��-1.259
Token["
Token ID14692
Feature activation+3.81e-01

Pos logit contributions
Gall+0.162
 Rak+0.155
horse+0.153
tera+0.152
 Serv+0.149
Neg logit contributions
appropriate-0.197
uba-0.191
achment-0.190
 Bliss-0.189
��-0.188
Tokenimages
Token ID17566
Feature activation-1.21e+00

Pos logit contributions
appropriate+0.085
uba+0.074
achment+0.072
reviewed+0.067
abouts+0.065
Neg logit contributions
Gall-0.433
 Rak-0.428
tera-0.421
 Serv-0.420
horse-0.420
Token"]
Token ID8973
Feature activation-4.99e-01

Pos logit contributions
Gall+1.028
 Rak+0.997
horse+0.987
tera+0.984
 Serv+0.971
Neg logit contributions
appropriate-0.608
uba-0.582
��-0.576
 Bliss-0.575
achment-0.575
Token):
Token ID2599
Feature activation-1.80e+00

Pos logit contributions
Gall+0.332
 Rak+0.319
tera+0.315
horse+0.312
 Serv+0.307
Neg logit contributions
appropriate-0.344
uba-0.333
achment-0.331
 Bliss-0.326
reviewed-0.324
Token #
Token ID1303
Feature activation-3.15e+00

Pos logit contributions
Gall+1.408
 Rak+1.359
horse+1.347
tera+1.346
 Serv+1.315
Neg logit contributions
appropriate-1.033
uba-0.995
 Bliss-0.981
achment-0.981
��-0.972
Token load
Token ID3440
Feature activation-8.64e-02

Pos logit contributions
Gall+2.230
 Rak+2.156
horse+2.128
tera+2.120
 Serv+2.089
Neg logit contributions
appropriate-2.024
uba-1.956
achment-1.937
 Bliss-1.933
��-1.930
Token the
Token ID262
Feature activation-2.67e-01

Pos logit contributions
Gall+0.064
tera+0.062
 Rak+0.062
horse+0.061
 Serv+0.060
Neg logit contributions
appropriate-0.053
 Bliss-0.050
uba-0.050
achment-0.050
 MFT-0.048
Token image
Token ID2939
Feature activation-1.21e+00

Pos logit contributions
Gall+0.219
tera+0.210
 Rak+0.210
horse+0.209
 Serv+0.205
Neg logit contributions
appropriate-0.144
achment-0.137
uba-0.136
 Bliss-0.135
 MFT-0.131
Token,
Token ID11
Feature activation-1.28e+00

Pos logit contributions
Gall+0.761
 Rak+0.723
tera+0.721
horse+0.713
 Serv+0.698
Neg logit contributions
appropriate-0.877
 Bliss-0.846
uba-0.845
achment-0.844
reviewed-0.822
Token resize
Token ID47558
Feature activation-1.02e+00

Pos logit contributions
Gall+1.055
 Rak+1.024
horse+1.013
tera+1.010
 Serv+0.996
Neg logit contributions
appropriate-0.679
uba-0.651
achment-0.643
 Bliss-0.642
��-0.639
Tokenutils
Token ID26791
Feature activation+6.36e-01

Pos logit contributions
Gall+1.078
 Rak+1.048
horse+1.029
tera+1.026
 Serv+1.022
Neg logit contributions
appropriate-0.544
uba-0.515
achment-0.505
 Bliss-0.496
��-0.492
Token import
Token ID1330
Feature activation-1.35e+00

Pos logit contributions
appropriate+0.379
uba+0.364
 Bliss+0.355
achment+0.355
 MFT+0.347
Neg logit contributions
Gall-0.490
horse-0.464
 Rak-0.461
tera-0.458
 Serv-0.443
Token c
Token ID269
Feature activation-2.63e-01

Pos logit contributions
Gall+0.814
 Rak+0.795
horse+0.769
tera+0.766
 Serv+0.758
Neg logit contributions
appropriate-1.015
uba-0.984
��-0.976
achment-0.976
 Bliss-0.976
Tokenv
Token ID85
Feature activation-8.07e-02

Pos logit contributions
Gall+0.122
 Rak+0.115
 Serv+0.111
horse+0.110
tera+0.109
Neg logit contributions
appropriate-0.238
uba-0.233
achment-0.228
reviewed-0.224
abouts-0.224
Token2
Token ID17
Feature activation-7.35e-01

Pos logit contributions
Gall+0.057
 Rak+0.056
horse+0.055
tera+0.054
 Serv+0.054
Neg logit contributions
appropriate-0.052
uba-0.050
achment-0.049
 Bliss-0.049
reviewed-0.048
Token #
Token ID1303
Feature activation-3.14e+00

Pos logit contributions
Gall+0.520
 Rak+0.513
horse+0.502
tera+0.500
 Serv+0.499
Neg logit contributions
appropriate-0.469
uba-0.452
 Bliss-0.451
achment-0.451
��-0.441
Token construct
Token ID5678
Feature activation+1.03e+00

Pos logit contributions
Gall+1.962
 Rak+1.895
tera+1.862
horse+1.857
 Serv+1.823
Neg logit contributions
appropriate-2.270
uba-2.226
achment-2.198
 Bliss-2.189
��-2.182
Token the
Token ID262
Feature activation-7.31e-01

Pos logit contributions
appropriate+0.601
achment+0.572
 Bliss+0.569
uba+0.568
 MFT+0.558
Neg logit contributions
Gall-0.785
tera-0.760
horse-0.755
 Rak-0.750
 Serv-0.731
Token argument
Token ID4578
Feature activation-6.41e-01

Pos logit contributions
Gall+0.551
horse+0.531
 Rak+0.531
tera+0.531
 Serv+0.519
Neg logit contributions
appropriate-0.437
achment-0.417
uba-0.414
 Bliss-0.410
 MFT-0.404
Token parse
Token ID21136
Feature activation-5.36e-01

Pos logit contributions
Gall+0.357
 Rak+0.333
horse+0.332
tera+0.327
 Serv+0.320
Neg logit contributions
appropriate-0.518
uba-0.500
achment-0.499
 Bliss-0.496
��-0.494
Token and
Token ID290
Feature activation-1.43e+00

Pos logit contributions
Gall+0.375
 Rak+0.357
horse+0.356
tera+0.356
 Serv+0.345
Neg logit contributions
appropriate-0.351
achment-0.337
 Bliss-0.337
uba-0.336
 Cohn-0.326
Token in
Token ID287
Feature activation-2.12e-01

Pos logit contributions
Gall+0.231
 Rak+0.222
tera+0.219
horse+0.218
 Serv+0.213
Neg logit contributions
appropriate-0.240
uba-0.231
achment-0.229
 Bliss-0.229
��-0.226
Token a
Token ID257
Feature activation-4.66e-01

Pos logit contributions
Gall+0.144
tera+0.137
 Rak+0.137
horse+0.136
 Serv+0.132
Neg logit contributions
appropriate-0.147
 Bliss-0.138
achment-0.137
uba-0.137
reviewed-0.134
Token more
Token ID517
Feature activation-6.93e-01

Pos logit contributions
Gall+0.343
 Rak+0.329
tera+0.329
horse+0.327
 Serv+0.318
Neg logit contributions
appropriate-0.290
 Bliss-0.276
achment-0.275
uba-0.274
��-0.271
Token inclusive
Token ID19889
Feature activation-6.95e-01

Pos logit contributions
Gall+0.514
tera+0.494
 Rak+0.494
horse+0.491
 Serv+0.477
Neg logit contributions
appropriate-0.431
 Bliss-0.406
uba-0.405
achment-0.405
��-0.398
Token character
Token ID2095
Feature activation-1.26e+00

Pos logit contributions
Gall+0.490
 Rak+0.469
tera+0.468
horse+0.466
 Serv+0.452
Neg logit contributions
appropriate-0.456
uba-0.433
achment-0.433
 Bliss-0.433
��-0.425
Tokenisation
Token ID5612
Feature activation-3.14e+00

Pos logit contributions
Gall+0.737
 Rak+0.706
horse+0.695
tera+0.693
 Serv+0.678
Neg logit contributions
appropriate-0.978
uba-0.947
achment-0.941
 Bliss-0.937
��-0.934
Token of
Token ID286
Feature activation+1.15e-01

Pos logit contributions
Gall+1.796
 Rak+1.729
horse+1.710
tera+1.677
 Serv+1.672
Neg logit contributions
appropriate-2.414
uba-2.368
��-2.357
achment-2.344
 Bliss-2.338
Token reason
Token ID1738
Feature activation-5.87e-01

Pos logit contributions
appropriate+0.074
 Bliss+0.070
uba+0.070
achment+0.070
��+0.069
Neg logit contributions
Gall-0.083
 Rak-0.080
tera-0.079
horse-0.079
 Serv-0.076
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
Gall+0.392
 Rak+0.379
tera+0.373
horse+0.372
 Serv+0.365
Neg logit contributions
appropriate-0.404
uba-0.390
achment-0.387
 Bliss-0.386
��-0.382
Token He
Token ID679
Feature activation+5.45e-01

Pos logit contributions
Gall+0.074
tera+0.071
 Rak+0.071
horse+0.070
 Serv+0.068
Neg logit contributions
appropriate-0.065
 Bliss-0.064
uba-0.062
achment-0.062
 Cohn-0.061
Tokenikes
Token ID7938
Feature activation-5.18e-03

Pos logit contributions
appropriate+0.367
uba+0.355
 Bliss+0.350
achment+0.350
��+0.344
Neg logit contributions
Gall-0.375
 Rak-0.359
horse-0.352
tera-0.351
 Serv-0.346
Token Piece
Token ID27053
Feature activation+9.92e-01

Pos logit contributions
appropriate+1.885
achment+1.823
 Bliss+1.818
uba+1.813
��+1.796
Neg logit contributions
Gall-1.802
 Rak-1.720
horse-1.719
tera-1.697
 Serv-1.636
Token >
Token ID1875
Feature activation+8.23e-01

Pos logit contributions
appropriate+0.684
uba+0.660
achment+0.658
��+0.653
 Bliss+0.653
Neg logit contributions
Gall-0.655
 Rak-0.633
tera-0.624
horse-0.622
 Serv-0.606
Token ;
Token ID2162
Feature activation-3.12e-01

Pos logit contributions
appropriate+0.519
uba+0.501
��+0.497
achment+0.496
 Bliss+0.495
Neg logit contributions
Gall-0.594
 Rak-0.572
tera-0.571
horse-0.566
 Serv-0.551
Token 64
Token ID5598
Feature activation-7.44e-01

Pos logit contributions
Gall+0.207
 Rak+0.199
horse+0.196
tera+0.196
 Serv+0.191
Neg logit contributions
appropriate-0.215
uba-0.208
 Bliss-0.207
achment-0.207
��-0.205
Token ],
Token ID16589
Feature activation-1.87e+00

Pos logit contributions
Gall+0.501
 Rak+0.483
horse+0.477
tera+0.475
 Serv+0.467
Neg logit contributions
appropriate-0.505
uba-0.489
achment-0.485
 Bliss-0.484
��-0.483
Token //
Token ID3373
Feature activation-3.12e+00

Pos logit contributions
Gall+1.168
 Rak+1.127
horse+1.109
tera+1.103
 Serv+1.088
Neg logit contributions
appropriate-1.360
uba-1.320
achment-1.308
��-1.305
 Bliss-1.304
Token Where
Token ID6350
Feature activation-3.77e-02

Pos logit contributions
Gall+2.279
 Rak+2.220
horse+2.182
tera+2.174
 Serv+2.158
Neg logit contributions
appropriate-1.923
uba-1.866
achment-1.845
��-1.842
 Bliss-1.828
Token are
Token ID389
Feature activation-4.15e-01

Pos logit contributions
Gall+0.025
 Rak+0.024
horse+0.024
tera+0.024
 Serv+0.023
Neg logit contributions
appropriate-0.026
uba-0.025
 Bliss-0.025
achment-0.025
abouts-0.024
Token the
Token ID262
Feature activation-4.81e-01

Pos logit contributions
Gall+0.292
 Rak+0.281
horse+0.278
tera+0.277
 Serv+0.272
Neg logit contributions
appropriate-0.270
uba-0.261
achment-0.258
 Bliss-0.258
��-0.257
Token pieces
Token ID5207
Feature activation-3.40e-01

Pos logit contributions
Gall+0.403
 Rak+0.390
horse+0.387
tera+0.386
 Serv+0.379
Neg logit contributions
appropriate-0.249
uba-0.237
 Bliss-0.235
achment-0.235
��-0.233
Token?
Token ID30
Feature activation-4.40e-01

Pos logit contributions
Gall+0.208
 Rak+0.199
horse+0.196
tera+0.195
 Serv+0.190
Neg logit contributions
appropriate-0.253
uba-0.245
 Bliss-0.243
achment-0.243
��-0.240
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
Gall+0.353
 Rak+0.339
horse+0.334
tera+0.334
 Serv+0.327
Neg logit contributions
appropriate-0.391
uba-0.381
 Bliss-0.378
achment-0.377
��-0.375
Token
Token ID198
Feature activation-5.14e-02

Pos logit contributions
appropriate+0.067
 Bliss+0.067
uba+0.066
achment+0.065
 Cohn+0.064
Neg logit contributions
Gall-0.080
tera-0.078
horse-0.077
 Rak-0.077
 Serv-0.074
Token
Token ID198
Feature activation+2.03e-01

Pos logit contributions
Gall+0.038
tera+0.037
 Rak+0.037
horse+0.037
 Serv+0.036
Neg logit contributions
appropriate-0.031
achment-0.029
 Bliss-0.029
uba-0.029
reviewed-0.029
TokenIsrael
Token ID14040
Feature activation+1.98e-01

Pos logit contributions
appropriate+0.137
uba+0.131
achment+0.130
reviewed+0.129
 Bliss+0.129
Neg logit contributions
 Rak-0.135
Gall-0.134
tera-0.131
horse-0.131
 Serv-0.130
Token se
Token ID384
Feature activation-4.34e-01

Pos logit contributions
appropriate+0.148
uba+0.145
 Bliss+0.143
achment+0.143
��+0.141
Neg logit contributions
Gall-0.119
 Rak-0.114
horse-0.113
tera-0.112
 Serv-0.109
Tokenizes
Token ID4340
Feature activation-3.10e+00

Pos logit contributions
Gall+0.245
 Rak+0.234
horse+0.230
tera+0.229
 Serv+0.224
Neg logit contributions
appropriate-0.342
uba-0.334
achment-0.332
 Bliss-0.329
��-0.329
Token every
Token ID790
Feature activation-3.56e-02

Pos logit contributions
Gall+1.968
 Rak+1.905
horse+1.868
tera+1.859
 Serv+1.839
Neg logit contributions
appropriate-2.214
uba-2.144
achment-2.139
��-2.124
 Bliss-2.113
Token opportunity
Token ID3663
Feature activation-9.22e-01

Pos logit contributions
Gall+0.026
tera+0.025
 Rak+0.024
horse+0.024
 Serv+0.024
Neg logit contributions
appropriate-0.023
 Bliss-0.022
uba-0.022
achment-0.021
��-0.021
Token to
Token ID284
Feature activation-1.10e+00

Pos logit contributions
Gall+0.603
 Rak+0.580
horse+0.573
tera+0.571
 Serv+0.559
Neg logit contributions
appropriate-0.645
uba-0.626
 Bliss-0.619
achment-0.619
��-0.617
Token reiterate
Token ID49852
Feature activation-1.15e+00

Pos logit contributions
Gall+0.792
 Rak+0.760
horse+0.755
tera+0.749
 Serv+0.737
Neg logit contributions
appropriate-0.707
uba-0.682
 Bliss-0.675
achment-0.670
��-0.665
Token its
Token ID663
Feature activation-3.93e-01

Pos logit contributions
Gall+0.783
 Rak+0.749
horse+0.744
tera+0.742
 Serv+0.725
Neg logit contributions
appropriate-0.776
uba-0.751
 Bliss-0.747
achment-0.741
��-0.735
Token Times
Token ID3782
Feature activation-5.27e-01

Pos logit contributions
Gall+0.368
 Rak+0.352
horse+0.349
tera+0.349
 Serv+0.339
Neg logit contributions
appropriate-0.420
uba-0.408
 Bliss-0.404
achment-0.403
��-0.400
Token of
Token ID286
Feature activation-3.68e-01

Pos logit contributions
Gall+0.315
 Rak+0.302
horse+0.298
tera+0.297
 Serv+0.290
Neg logit contributions
appropriate-0.398
uba-0.388
achment-0.383
 Bliss-0.383
��-0.381
Token Israel
Token ID2692
Feature activation+1.86e-01

Pos logit contributions
Gall+0.248
horse+0.239
tera+0.238
 Rak+0.237
 Serv+0.228
Neg logit contributions
appropriate-0.246
uba-0.244
 Bliss-0.240
achment-0.237
 Cohn-0.232
Token<|endoftext|>
Token ID50256
Feature activation-1.66e-01

Pos logit contributions
appropriate+0.135
uba+0.132
achment+0.130
 Bliss+0.130
��+0.129
Neg logit contributions
Gall-0.117
 Rak-0.112
horse-0.111
tera-0.110
 Serv-0.107
TokenSt
Token ID1273
Feature activation+1.05e+00

Pos logit contributions
Gall+0.109
 Rak+0.107
tera+0.105
horse+0.104
 Serv+0.103
Neg logit contributions
appropriate-0.114
uba-0.110
achment-0.109
 Bliss-0.108
��-0.108
Tokenress
Token ID601
Feature activation-3.08e+00

Pos logit contributions
appropriate+0.730
uba+0.715
achment+0.706
 Bliss+0.704
��+0.681
Neg logit contributions
Gall-0.689
 Rak-0.663
tera-0.656
horse-0.655
 Serv-0.639
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
Gall+2.085
 Rak+2.005
horse+1.976
tera+1.967
 Serv+1.960
Neg logit contributions
appropriate-2.084
��-2.012
uba-2.000
achment-1.979
 Bliss-1.977
Token We
Token ID775
Feature activation+4.83e-01

Pos logit contributions
appropriate+0.083
 Bliss+0.081
uba+0.081
achment+0.080
��+0.079
Neg logit contributions
Gall-0.099
 Rak-0.096
tera-0.095
horse-0.095
 Serv-0.093
Token all
Token ID477
Feature activation-2.09e-01

Pos logit contributions
appropriate+0.304
uba+0.297
 Bliss+0.292
achment+0.289
��+0.283
Neg logit contributions
Gall-0.350
 Rak-0.338
horse-0.334
tera-0.333
 Serv-0.326
Token know
Token ID760
Feature activation+3.79e-01

Pos logit contributions
Gall+0.158
 Rak+0.151
horse+0.149
tera+0.149
 Serv+0.145
Neg logit contributions
appropriate-0.126
uba-0.123
 Bliss-0.121
achment-0.120
 Cohn-0.118
Token it
Token ID340
Feature activation-2.99e-01

Pos logit contributions
appropriate+0.256
uba+0.248
 Bliss+0.247
achment+0.246
��+0.243
Neg logit contributions
Gall-0.256
 Rak-0.246
tera-0.244
horse-0.243
 Serv-0.238
Token court
Token ID2184
Feature activation-4.55e-01

Pos logit contributions
Gall+0.031
 Rak+0.030
horse+0.030
tera+0.030
 Serv+0.029
Neg logit contributions
appropriate-0.028
uba-0.027
achment-0.027
 Bliss-0.027
��-0.027
Token documents
Token ID4963
Feature activation-1.37e+00

Pos logit contributions
Gall+0.288
 Rak+0.274
tera+0.273
horse+0.271
 Serv+0.265
Neg logit contributions
appropriate-0.330
uba-0.319
 Bliss-0.317
achment-0.317
��-0.312
Token as
Token ID355
Feature activation-1.62e-01

Pos logit contributions
Gall+0.882
 Rak+0.845
tera+0.837
horse+0.835
 Serv+0.816
Neg logit contributions
appropriate-0.980
uba-0.949
 Bliss-0.944
achment-0.941
��-0.928
Token "
Token ID366
Feature activation-4.82e-01

Pos logit contributions
Gall+0.118
 Rak+0.113
horse+0.112
tera+0.112
 Serv+0.109
Neg logit contributions
appropriate-0.102
uba-0.098
 Bliss-0.097
achment-0.097
��-0.096
TokenJane
Token ID41083
Feature activation+1.41e+00

Pos logit contributions
Gall+0.317
 Rak+0.307
tera+0.302
horse+0.301
 Serv+0.297
Neg logit contributions
appropriate-0.336
uba-0.323
achment-0.319
 Bliss-0.319
��-0.317
Token Roe
Token ID34904
Feature activation-3.08e+00

Pos logit contributions
appropriate+0.801
 Bliss+0.771
uba+0.769
achment+0.746
 Cohn+0.746
Neg logit contributions
Gall-1.109
tera-1.066
 Rak-1.064
horse-1.053
 Serv-1.027
Token"
Token ID1
Feature activation+8.35e-01

Pos logit contributions
Gall+1.982
 Rak+1.917
horse+1.887
tera+1.874
 Serv+1.855
Neg logit contributions
appropriate-2.178
uba-2.111
achment-2.092
��-2.089
 Bliss-2.074
Token through
Token ID832
Feature activation+1.09e+00

Pos logit contributions
appropriate+0.577
 Bliss+0.562
uba+0.555
achment+0.549
 Cohn+0.547
Neg logit contributions
Gall-0.552
tera-0.529
 Rak-0.522
horse-0.521
 Serv-0.505
Token a
Token ID257
Feature activation-2.21e-01

Pos logit contributions
appropriate+0.750
 Bliss+0.721
uba+0.716
achment+0.709
 Cohn+0.703
Neg logit contributions
Gall-0.728
tera-0.702
horse-0.696
 Rak-0.694
 Serv-0.673
Token dating
Token ID10691
Feature activation+8.49e-02

Pos logit contributions
Gall+0.166
tera+0.160
horse+0.160
 Rak+0.159
 Serv+0.154
Neg logit contributions
appropriate-0.134
 Bliss-0.128
uba-0.127
achment-0.126
��-0.124
Token app
Token ID598
Feature activation+1.88e+00

Pos logit contributions
appropriate+0.056
uba+0.054
 Bliss+0.054
achment+0.054
��+0.053
Neg logit contributions
Gall-0.059
 Rak-0.056
tera-0.056
horse-0.056
 Serv-0.054
Token in
Token ID287
Feature activation+6.08e-01

Pos logit contributions
Gall+1.440
 Rak+1.382
horse+1.364
tera+1.360
 Serv+1.331
Neg logit contributions
appropriate-1.667
uba-1.621
achment-1.607
 Bliss-1.605
��-1.599
Token 54
Token ID7175
Feature activation+3.44e-01

Pos logit contributions
appropriate+0.406
uba+0.394
 Bliss+0.393
achment+0.389
��+0.382
Neg logit contributions
Gall-0.411
tera-0.394
horse-0.391
 Rak-0.389
 Serv-0.375
Token 1
Token ID352
Feature activation+8.07e-01

Pos logit contributions
appropriate+0.240
uba+0.233
 Bliss+0.231
achment+0.230
��+0.229
Neg logit contributions
Gall-0.223
 Rak-0.214
tera-0.213
horse-0.213
 Serv-0.205
Token/
Token ID14
Feature activation+3.62e-01

Pos logit contributions
appropriate+0.638
uba+0.624
achment+0.621
 Bliss+0.618
��+0.617
Neg logit contributions
Gall-0.442
tera-0.421
 Rak-0.421
horse-0.417
 Serv-0.406
Token3
Token ID18
Feature activation-2.00e-01

Pos logit contributions
appropriate+0.263
uba+0.254
 Bliss+0.252
achment+0.251
��+0.248
Neg logit contributions
Gall-0.219
 Rak-0.215
tera-0.213
horse-0.209
 Serv-0.206
Token IP
Token ID6101
Feature activation-3.04e+00

Pos logit contributions
Gall+0.102
 Rak+0.097
tera+0.095
horse+0.095
 Serv+0.092
Neg logit contributions
appropriate-0.168
uba-0.164
 Bliss-0.163
achment-0.163
��-0.162
Token in
Token ID287
Feature activation+6.54e-01

Pos logit contributions
Gall+1.943
 Rak+1.873
horse+1.842
tera+1.834
 Serv+1.807
Neg logit contributions
appropriate-2.182
uba-2.110
achment-2.091
 Bliss-2.087
��-2.086
Token 2014
Token ID1946
Feature activation-3.72e-01

Pos logit contributions
appropriate+0.452
uba+0.438
 Bliss+0.437
achment+0.431
��+0.424
Neg logit contributions
Gall-0.429
tera-0.412
horse-0.408
 Rak-0.405
 Serv-0.391
Token
Token ID198
Feature activation-1.97e-01

Pos logit contributions
Gall+0.241
 Rak+0.232
horse+0.229
tera+0.228
 Serv+0.224
Neg logit contributions
appropriate-0.262
uba-0.255
achment-0.252
 Bliss-0.252
��-0.251
Token
Token ID198
Feature activation-1.40e-01

Pos logit contributions
Gall+0.145
 Rak+0.140
tera+0.140
horse+0.138
 Serv+0.136
Neg logit contributions
appropriate-0.120
uba-0.114
achment-0.114
 Bliss-0.114
reviewed-0.112
TokenD
Token ID35
Feature activation-4.62e-01

Pos logit contributions
Gall+0.094
 Rak+0.092
horse+0.090
tera+0.090
 Serv+0.089
Neg logit contributions
appropriate-0.095
uba-0.092
achment-0.090
 Bliss-0.090
��-0.090
Token NHL
Token ID9481
Feature activation-6.48e-01

Pos logit contributions
appropriate+0.007
uba+0.007
 Bliss+0.007
achment+0.007
��+0.007
Neg logit contributions
Gall-0.007
 Rak-0.007
horse-0.006
tera-0.006
 Serv-0.006
Token sources
Token ID4237
Feature activation-2.60e-02

Pos logit contributions
Gall+0.451
 Rak+0.434
horse+0.429
tera+0.427
 Serv+0.419
Neg logit contributions
appropriate-0.426
uba-0.412
 Bliss-0.409
achment-0.407
��-0.405
Token
Token ID784
Feature activation-3.37e-01

Pos logit contributions
Gall+0.016
 Rak+0.015
horse+0.015
tera+0.015
 Serv+0.014
Neg logit contributions
appropriate-0.019
uba-0.019
 Bliss-0.019
achment-0.019
abouts-0.019
Token one
Token ID530
Feature activation+3.34e-02

Pos logit contributions
Gall+0.210
 Rak+0.201
tera+0.199
horse+0.199
 Serv+0.194
Neg logit contributions
appropriate-0.246
uba-0.238
 Bliss-0.236
achment-0.234
��-0.234
Token very
Token ID845
Feature activation-9.08e-02

Pos logit contributions
appropriate+0.025
 Bliss+0.024
uba+0.024
achment+0.024
 Cohn+0.023
Neg logit contributions
Gall-0.020
tera-0.019
 Rak-0.019
horse-0.019
 Serv-0.019
Token close
Token ID1969
Feature activation+2.56e+00

Pos logit contributions
Gall+0.064
tera+0.061
 Rak+0.061
horse+0.061
 Serv+0.059
Neg logit contributions
appropriate-0.059
 Bliss-0.057
uba-0.057
achment-0.056
 Cohn-0.056
Token to
Token ID284
Feature activation+3.08e-01

Pos logit contributions
appropriate+1.763
uba+1.706
achment+1.691
 Bliss+1.681
��+1.676
Neg logit contributions
Gall-1.699
 Rak-1.634
horse-1.613
tera-1.608
 Serv-1.581
Token the
Token ID262
Feature activation+4.16e-01

Pos logit contributions
appropriate+0.213
uba+0.206
 Bliss+0.205
achment+0.204
��+0.203
Neg logit contributions
Gall-0.203
 Rak-0.195
horse-0.193
tera-0.193
 Serv-0.189
Token situation
Token ID3074
Feature activation+6.20e-01

Pos logit contributions
appropriate+0.262
 Bliss+0.254
uba+0.252
achment+0.250
 Cohn+0.246
Neg logit contributions
Gall-0.299
 Rak-0.285
horse-0.284
tera-0.284
 Serv-0.277
Token
Token ID784
Feature activation+5.02e-02

Pos logit contributions
appropriate+0.484
uba+0.468
 Bliss+0.467
achment+0.466
abouts+0.460
Neg logit contributions
Gall-0.357
 Rak-0.339
tera-0.335
horse-0.332
 Serv-0.327
Token have
Token ID423
Feature activation-4.62e-01

Pos logit contributions
appropriate+0.037
uba+0.036
 Bliss+0.035
achment+0.035
abouts+0.035
Neg logit contributions
Gall-0.031
 Rak-0.030
tera-0.029
horse-0.029
 Serv-0.029
Token 2010
Token ID3050
Feature activation+1.10e+00

Pos logit contributions
appropriate+0.733
 Bliss+0.729
uba+0.710
achment+0.698
 Klaus+0.697
Neg logit contributions
Gall-0.709
tera-0.683
 Rak-0.668
horse-0.663
ascript-0.637
Token United
Token ID1578
Feature activation+1.42e+00

Pos logit contributions
appropriate+0.752
 Bliss+0.732
uba+0.730
achment+0.722
��+0.711
Neg logit contributions
Gall-0.729
tera-0.698
 Rak-0.698
horse-0.695
 Serv-0.670
Token States
Token ID1829
Feature activation+8.04e-01

Pos logit contributions
appropriate+0.899
uba+0.862
achment+0.854
��+0.853
 Bliss+0.842
Neg logit contributions
Gall-1.029
 Rak-0.997
horse-0.981
 Serv-0.972
tera-0.969
Token 21
Token ID2310
Feature activation+3.39e-01

Pos logit contributions
appropriate+0.507
 Bliss+0.499
uba+0.490
achment+0.486
abouts+0.483
Neg logit contributions
Gall-0.570
tera-0.553
 Rak-0.544
horse-0.543
 Serv-0.523
Token50
Token ID1120
Feature activation+2.27e-01

Pos logit contributions
appropriate+0.200
uba+0.193
 Bliss+0.193
achment+0.191
��+0.188
Neg logit contributions
Gall-0.257
tera-0.249
 Rak-0.248
horse-0.246
 Serv-0.240
Token Posts
Token ID12043
Feature activation+2.87e+00

Pos logit contributions
appropriate+0.056
 Bliss+0.053
uba+0.052
achment+0.051
abouts+0.047
Neg logit contributions
Gall-0.250
tera-0.244
 Rak-0.243
horse-0.243
 Serv-0.236
Token #
Token ID1303
Feature activation+7.25e-01

Pos logit contributions
appropriate+1.736
uba+1.671
achment+1.654
 Bliss+1.650
��+1.646
Neg logit contributions
Gall-2.154
 Rak-2.087
horse-2.062
tera-2.052
 Serv-2.027
Token16
Token ID1433
Feature activation-1.07e+00

Pos logit contributions
appropriate+0.466
uba+0.444
achment+0.436
reviewed+0.433
 Bliss+0.431
Neg logit contributions
Gall-0.521
 Rak-0.511
tera-0.502
 Serv-0.491
horse-0.488
Token Glad
Token ID28442
Feature activation-5.73e-02

Pos logit contributions
Gall+0.747
 Rak+0.719
horse+0.712
tera+0.711
 Serv+0.697
Neg logit contributions
appropriate-0.699
uba-0.676
 Bliss-0.671
achment-0.670
��-0.666
Token to
Token ID284
Feature activation-2.62e-01

Pos logit contributions
Gall+0.037
 Rak+0.035
horse+0.035
tera+0.035
 Serv+0.034
Neg logit contributions
appropriate-0.041
uba-0.040
 Bliss-0.039
achment-0.039
��-0.039
Token hear
Token ID3285
Feature activation-4.93e-01

Pos logit contributions
Gall+0.212
 Rak+0.204
horse+0.203
tera+0.203
 Serv+0.198
Neg logit contributions
appropriate-0.143
uba-0.138
 Bliss-0.136
achment-0.135
��-0.134
Token litter
Token ID25359
Feature activation+1.43e+00

Pos logit contributions
Gall+0.110
 Rak+0.105
tera+0.104
horse+0.103
 Serv+0.101
Neg logit contributions
appropriate-0.124
uba-0.120
achment-0.119
 Bliss-0.119
��-0.118
Tokening
Token ID278
Feature activation-3.94e-01

Pos logit contributions
appropriate+0.964
uba+0.937
achment+0.922
 Bliss+0.922
abouts+0.901
Neg logit contributions
Gall-0.982
 Rak-0.945
tera-0.941
horse-0.925
 Serv-0.910
Token the
Token ID262
Feature activation-3.77e-01

Pos logit contributions
Gall+0.269
 Rak+0.258
tera+0.256
horse+0.255
 Serv+0.250
Neg logit contributions
appropriate-0.265
uba-0.257
 Bliss-0.254
achment-0.253
��-0.251
Token battle
Token ID3344
Feature activation+6.08e-01

Pos logit contributions
Gall+0.266
 Rak+0.255
tera+0.254
horse+0.253
 Serv+0.247
Neg logit contributions
appropriate-0.244
uba-0.235
 Bliss-0.234
achment-0.233
��-0.230
Token-
Token ID12
Feature activation+2.55e-01

Pos logit contributions
appropriate+0.352
uba+0.338
achment+0.335
 Bliss+0.334
��+0.328
Neg logit contributions
Gall-0.473
 Rak-0.457
tera-0.452
horse-0.448
 Serv-0.443
Tokenfront
Token ID8534
Feature activation+2.13e+00

Pos logit contributions
appropriate+0.145
uba+0.139
achment+0.137
 Bliss+0.136
��+0.135
Neg logit contributions
Gall-0.201
 Rak-0.195
tera-0.193
horse-0.190
 Serv-0.190
Tokens
Token ID82
Feature activation-1.38e-01

Pos logit contributions
appropriate+1.538
achment+1.485
uba+1.481
 Bliss+1.480
abouts+1.474
Neg logit contributions
Gall-1.342
tera-1.285
 Rak-1.271
horse-1.237
 Serv-1.216
Token"
Token ID1
Feature activation+3.18e-01

Pos logit contributions
Gall+0.087
 Rak+0.084
tera+0.083
horse+0.082
 Serv+0.080
Neg logit contributions
appropriate-0.099
uba-0.096
achment-0.096
 Bliss-0.096
��-0.095
Token
Token ID198
Feature activation+2.63e-01

Pos logit contributions
appropriate+0.210
uba+0.202
achment+0.201
 Bliss+0.200
��+0.197
Neg logit contributions
Gall-0.222
 Rak-0.212
tera-0.212
horse-0.210
 Serv-0.205
Token
Token ID198
Feature activation+6.23e-01

Pos logit contributions
appropriate+0.159
 Bliss+0.153
uba+0.152
achment+0.151
reviewed+0.148
Neg logit contributions
Gall-0.194
tera-0.190
 Rak-0.188
horse-0.186
 Serv-0.182
TokenEnt
Token ID14539
Feature activation-8.33e-01

Pos logit contributions
appropriate+0.430
uba+0.414
achment+0.410
 Bliss+0.406
��+0.402
Neg logit contributions
Gall-0.406
 Rak-0.402
tera-0.394
horse-0.390
 Serv-0.386
Token release
Token ID2650
Feature activation+7.58e-01

Pos logit contributions
appropriate+0.255
 Bliss+0.246
uba+0.246
achment+0.243
��+0.241
Neg logit contributions
Gall-0.284
horse-0.269
 Rak-0.269
tera-0.268
 Serv-0.258
Token -
Token ID532
Feature activation+2.06e-01

Pos logit contributions
appropriate+0.505
 Bliss+0.492
uba+0.492
achment+0.487
abouts+0.483
Neg logit contributions
Gall-0.516
 Rak-0.488
tera-0.488
horse-0.485
 Serv-0.469
Token that
Token ID326
Feature activation+1.81e-01

Pos logit contributions
appropriate+0.142
uba+0.137
 Bliss+0.137
achment+0.136
��+0.134
Neg logit contributions
Gall-0.137
tera-0.130
 Rak-0.130
horse-0.129
 Serv-0.125
Token's
Token ID338
Feature activation-3.13e-02

Pos logit contributions
appropriate+0.122
 Bliss+0.118
uba+0.118
achment+0.117
abouts+0.115
Neg logit contributions
Gall-0.122
 Rak-0.115
tera-0.115
horse-0.115
 Serv-0.111
Token something
Token ID1223
Feature activation-5.85e-01

Pos logit contributions
Gall+0.022
tera+0.021
 Rak+0.021
horse+0.021
 Serv+0.020
Neg logit contributions
appropriate-0.021
 Bliss-0.020
uba-0.020
achment-0.020
��-0.019
Token slightly
Token ID4622
Feature activation+2.38e+00

Pos logit contributions
Gall+0.374
 Rak+0.357
tera+0.354
horse+0.354
 Serv+0.345
Neg logit contributions
appropriate-0.419
uba-0.405
 Bliss-0.402
achment-0.401
��-0.398
Token different
Token ID1180
Feature activation+1.08e+00

Pos logit contributions
appropriate+1.414
 Bliss+1.347
uba+1.340
achment+1.330
 Cohn+1.326
Neg logit contributions
Gall-1.797
tera-1.742
horse-1.714
 Rak-1.702
 Serv-1.670
Token.
Token ID13
Feature activation+3.61e-01

Pos logit contributions
appropriate+0.788
uba+0.765
 Bliss+0.757
achment+0.752
abouts+0.748
Neg logit contributions
Gall-0.678
 Rak-0.640
tera-0.640
horse-0.631
 Serv-0.616
Token
Token ID198
Feature activation+2.55e-01

Pos logit contributions
appropriate+0.220
 Bliss+0.215
uba+0.212
achment+0.212
��+0.206
Neg logit contributions
Gall-0.266
tera-0.256
 Rak-0.254
horse-0.254
 Serv-0.246
Token
Token ID198
Feature activation+5.11e-01

Pos logit contributions
appropriate+0.152
achment+0.144
uba+0.144
 Bliss+0.144
reviewed+0.142
Neg logit contributions
Gall-0.191
tera-0.186
 Rak-0.185
horse-0.182
 Serv-0.180
TokenThe
Token ID464
Feature activation+5.80e-01

Pos logit contributions
appropriate+0.326
achment+0.313
uba+0.310
reviewed+0.309
 Bliss+0.307
Neg logit contributions
 Rak-0.353
Gall-0.350
tera-0.350
 Serv-0.343
horse-0.341
Token wrong
Token ID2642
Feature activation+6.33e-01

Pos logit contributions
appropriate+0.073
uba+0.071
 Bliss+0.070
achment+0.070
��+0.069
Neg logit contributions
Gall-0.071
tera-0.068
 Rak-0.068
horse-0.067
 Serv-0.066
Token.
Token ID13
Feature activation+2.02e-01

Pos logit contributions
appropriate+0.420
uba+0.405
achment+0.402
 Bliss+0.401
abouts+0.395
Neg logit contributions
Gall-0.437
 Rak-0.420
horse-0.414
tera-0.413
 Serv-0.407
Token Like
Token ID4525
Feature activation-5.73e-01

Pos logit contributions
appropriate+0.129
 Bliss+0.125
uba+0.125
achment+0.123
��+0.122
Neg logit contributions
Gall-0.144
horse-0.137
tera-0.137
 Rak-0.137
 Serv-0.133
Token a
Token ID257
Feature activation-2.18e-02

Pos logit contributions
Gall+0.363
tera+0.344
horse+0.344
 Rak+0.342
 Serv+0.332
Neg logit contributions
appropriate-0.410
 Bliss-0.399
uba-0.399
��-0.394
achment-0.393
Token new
Token ID649
Feature activation-2.06e-01

Pos logit contributions
Gall+0.018
 Rak+0.017
tera+0.017
horse+0.017
 Serv+0.017
Neg logit contributions
appropriate-0.012
uba-0.011
 Bliss-0.011
achment-0.011
��-0.011
Tokenbie
Token ID12590
Feature activation+2.03e+00

Pos logit contributions
Gall+0.144
 Rak+0.138
horse+0.136
tera+0.136
 Serv+0.133
Neg logit contributions
appropriate-0.135
uba-0.131
 Bliss-0.130
achment-0.130
��-0.129
Token O
Token ID440
Feature activation-9.31e-02

Pos logit contributions
appropriate+1.347
 Bliss+1.322
uba+1.299
 Cohn+1.285
 Klaus+1.277
Neg logit contributions
Gall-1.378
tera-1.328
horse-1.306
 Rak-1.289
 Serv-1.255
TokenO
Token ID46
Feature activation+3.16e-01

Pos logit contributions
Gall+0.061
 Rak+0.059
horse+0.058
tera+0.058
 Serv+0.057
Neg logit contributions
appropriate-0.065
uba-0.064
��-0.062
 Bliss-0.062
achment-0.062
Token programmer
Token ID24292
Feature activation+8.25e-01

Pos logit contributions
appropriate+0.200
uba+0.195
 Bliss+0.193
achment+0.192
��+0.191
Neg logit contributions
Gall-0.226
 Rak-0.217
tera-0.215
horse-0.215
 Serv-0.211
Token violating
Token ID18134
Feature activation-3.19e-01

Pos logit contributions
appropriate+0.578
uba+0.565
 Bliss+0.565
achment+0.557
��+0.553
Neg logit contributions
Gall-0.533
tera-0.510
 Rak-0.504
horse-0.503
 Serv-0.488
Token the
Token ID262
Feature activation+6.46e-01

Pos logit contributions
Gall+0.220
 Rak+0.209
horse+0.209
tera+0.209
 Serv+0.202
Neg logit contributions
appropriate-0.212
uba-0.206
 Bliss-0.204
��-0.203
achment-0.202
Token
Token ID198
Feature activation-3.26e-02

Pos logit contributions
appropriate+0.052
 Bliss+0.051
uba+0.050
achment+0.050
��+0.049
Neg logit contributions
Gall-0.063
tera-0.060
 Rak-0.060
horse-0.060
 Serv-0.058
Token
Token ID198
Feature activation+2.35e-01

Pos logit contributions
Gall+0.024
 Rak+0.023
tera+0.023
horse+0.023
 Serv+0.023
Neg logit contributions
appropriate-0.020
achment-0.019
uba-0.019
 Bliss-0.019
reviewed-0.019
TokenEven
Token ID6104
Feature activation+1.99e-01

Pos logit contributions
appropriate+0.149
achment+0.141
uba+0.140
reviewed+0.140
 Bliss+0.139
Neg logit contributions
 Rak-0.165
Gall-0.164
tera-0.162
 Serv-0.160
horse-0.159
Token more
Token ID517
Feature activation-1.40e-01

Pos logit contributions
appropriate+0.139
 Bliss+0.135
uba+0.134
achment+0.132
 Cohn+0.131
Neg logit contributions
Gall-0.129
tera-0.123
 Rak-0.122
horse-0.121
 Serv-0.118
Token interesting
Token ID3499
Feature activation+1.68e-01

Pos logit contributions
Gall+0.085
 Rak+0.081
tera+0.080
horse+0.079
 Serv+0.077
Neg logit contributions
appropriate-0.105
 Bliss-0.101
uba-0.101
achment-0.100
��-0.099
Token is
Token ID318
Feature activation+3.26e-01

Pos logit contributions
appropriate+0.128
uba+0.124
achment+0.123
 Bliss+0.123
��+0.122
Neg logit contributions
Gall-0.100
 Rak-0.096
tera-0.094
horse-0.094
 Serv-0.092
Token that
Token ID326
Feature activation+4.47e-01

Pos logit contributions
appropriate+0.222
uba+0.215
achment+0.213
 Bliss+0.213
��+0.212
Neg logit contributions
Gall-0.220
 Rak-0.212
horse-0.209
tera-0.208
 Serv-0.205
Token when
Token ID618
Feature activation+4.72e-01

Pos logit contributions
appropriate+0.304
 Bliss+0.294
uba+0.293
achment+0.290
��+0.288
Neg logit contributions
Gall-0.300
 Rak-0.286
tera-0.285
horse-0.283
 Serv-0.276
Token chickens
Token ID25972
Feature activation+2.69e-01

Pos logit contributions
appropriate+0.331
 Bliss+0.317
uba+0.316
achment+0.312
��+0.309
Neg logit contributions
Gall-0.309
tera-0.297
 Rak-0.294
horse-0.292
 Serv-0.284
Token sleep
Token ID3993
Feature activation+6.80e-01

Pos logit contributions
appropriate+0.175
uba+0.170
 Bliss+0.167
achment+0.164
abouts+0.163
Neg logit contributions
Gall-0.191
 Rak-0.183
tera-0.182
horse-0.181
 Serv-0.178
Token,
Token ID11
Feature activation+9.22e-01

Pos logit contributions
appropriate+0.492
uba+0.471
 Bliss+0.469
achment+0.467
abouts+0.463
Neg logit contributions
Gall-0.436
 Rak-0.413
tera-0.410
horse-0.406
 Serv-0.400
Token in
Token ID287
Feature activation-1.40e-01

Pos logit contributions
appropriate+0.224
uba+0.218
 Bliss+0.217
achment+0.214
��+0.213
Neg logit contributions
Gall-0.203
tera-0.194
 Rak-0.193
horse-0.192
 Serv-0.187
Token clothes
Token ID8242
Feature activation+5.11e-02

Pos logit contributions
Gall+0.098
 Rak+0.094
horse+0.093
tera+0.093
 Serv+0.091
Neg logit contributions
appropriate-0.092
uba-0.089
 Bliss-0.088
achment-0.088
��-0.088
Token from
Token ID422
Feature activation-3.06e-01

Pos logit contributions
appropriate+0.037
uba+0.036
 Bliss+0.036
achment+0.035
��+0.035
Neg logit contributions
Gall-0.032
tera-0.031
 Rak-0.031
horse-0.030
 Serv-0.030
Token three
Token ID1115
Feature activation+2.15e-01

Pos logit contributions
Gall+0.211
 Rak+0.204
horse+0.201
tera+0.201
 Serv+0.197
Neg logit contributions
appropriate-0.203
uba-0.196
achment-0.194
 Bliss-0.194
��-0.193
TokenAS
Token ID1921
Feature activation+6.66e-01

Pos logit contributions
appropriate+0.133
uba+0.128
 Bliss+0.127
achment+0.126
��+0.125
Neg logit contributions
Gall-0.159
 Rak-0.152
tera-0.151
horse-0.151
 Serv-0.148
TokenF
Token ID37
Feature activation+3.30e-01

Pos logit contributions
appropriate+0.442
uba+0.428
 Bliss+0.423
achment+0.422
��+0.421
Neg logit contributions
Gall-0.458
 Rak-0.443
horse-0.437
tera-0.435
 Serv-0.428
TokenOUR
Token ID11698
Feature activation+5.70e-01

Pos logit contributions
appropriate+0.214
uba+0.207
achment+0.206
 Bliss+0.205
��+0.205
Neg logit contributions
Gall-0.232
 Rak-0.224
horse-0.221
tera-0.220
 Serv-0.217
Token and
Token ID290
Feature activation+5.23e-01

Pos logit contributions
appropriate+0.403
uba+0.390
achment+0.387
 Bliss+0.386
��+0.385
Neg logit contributions
Gall-0.369
 Rak-0.355
horse-0.350
tera-0.349
 Serv-0.343
Token Ch
Token ID609
Feature activation+3.07e-01

Pos logit contributions
appropriate+0.359
 Bliss+0.347
uba+0.347
achment+0.343
��+0.341
Neg logit contributions
Gall-0.348
 Rak-0.332
tera-0.331
horse-0.331
 Serv-0.322
Tokenado
Token ID4533
Feature activation-9.11e-02

Pos logit contributions
appropriate+0.197
uba+0.191
achment+0.189
 Bliss+0.188
��+0.187
Neg logit contributions
Gall-0.218
 Rak-0.211
horse-0.208
tera-0.207
 Serv-0.204
Token Ralph
Token ID20993
Feature activation-1.31e+00

Pos logit contributions
Gall+0.061
 Rak+0.059
horse+0.058
tera+0.058
 Serv+0.057
Neg logit contributions
appropriate-0.062
uba-0.060
 Bliss-0.060
achment-0.059
��-0.059
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
Gall+0.044
 Rak+0.042
tera+0.042
horse+0.042
 Serv+0.041
Neg logit contributions
appropriate-0.036
achment-0.034
 Bliss-0.034
uba-0.034
reviewed-0.033
TokenStudents
Token ID28239
Feature activation-9.99e-01

Pos logit contributions
appropriate+0.080
uba+0.076
achment+0.076
 Bliss+0.074
reviewed+0.074
Neg logit contributions
Gall-0.074
 Rak-0.074
tera-0.071
 Serv-0.071
horse-0.071
Token were
Token ID547
Feature activation-4.83e-01

Pos logit contributions
Gall+0.651
 Rak+0.629
horse+0.622
tera+0.620
 Serv+0.605
Neg logit contributions
appropriate-0.699
uba-0.679
 Bliss-0.673
achment-0.667
��-0.663
Token told
Token ID1297
Feature activation-4.42e-02

Pos logit contributions
Gall+0.318
tera+0.305
 Rak+0.304
horse+0.302
 Serv+0.293
Neg logit contributions
appropriate-0.335
uba-0.326
 Bliss-0.325
achment-0.319
��-0.317
Token to
Token ID284
Feature activation+5.24e-01

Pos logit contributions
Gall+0.029
 Rak+0.028
horse+0.028
tera+0.028
 Serv+0.027
Neg logit contributions
appropriate-0.031
uba-0.030
achment-0.029
 Bliss-0.029
��-0.029
Token take
Token ID1011
Feature activation+1.75e-01

Pos logit contributions
appropriate+0.331
uba+0.315
 Bliss+0.314
achment+0.309
��+0.305
Neg logit contributions
Gall-0.384
 Rak-0.366
horse-0.365
tera-0.360
 Serv-0.356
Token shelter
Token ID11772
Feature activation-2.31e-01

Pos logit contributions
appropriate+0.116
uba+0.112
 Bliss+0.111
achment+0.110
��+0.109
Neg logit contributions
Gall-0.122
 Rak-0.117
horse-0.116
tera-0.116
 Serv-0.113
Token as
Token ID355
Feature activation+2.26e-01

Pos logit contributions
Gall+0.147
 Rak+0.140
horse+0.138
tera+0.138
 Serv+0.135
Neg logit contributions
appropriate-0.169
uba-0.162
 Bliss-0.160
achment-0.160
abouts-0.159
Token the
Token ID262
Feature activation-1.90e-01

Pos logit contributions
appropriate+0.152
uba+0.147
 Bliss+0.145
achment+0.145
��+0.144
Neg logit contributions
Gall-0.155
 Rak-0.149
horse-0.147
tera-0.146
 Serv-0.144
Token incident
Token ID4519
Feature activation-4.79e-01

Pos logit contributions
Gall+0.133
 Rak+0.127
horse+0.126
tera+0.126
 Serv+0.123
Neg logit contributions
appropriate-0.125
uba-0.121
 Bliss-0.120
achment-0.119
��-0.118
Token broke
Token ID6265
Feature activation-1.14e+00

Pos logit contributions
Gall+0.380
 Rak+0.365
tera+0.361
horse+0.359
 Serv+0.353
Neg logit contributions
appropriate-0.274
uba-0.261
achment-0.258
 Bliss-0.256
��-0.251
Token comments
Token ID3651
Feature activation-5.70e-01

Pos logit contributions
Gall+0.014
 Rak+0.013
tera+0.013
horse+0.013
 Serv+0.013
Neg logit contributions
appropriate-0.013
uba-0.013
achment-0.012
 Bliss-0.012
��-0.012
Token on
Token ID319
Feature activation-3.68e-01

Pos logit contributions
Gall+0.370
 Rak+0.355
tera+0.350
horse+0.350
 Serv+0.342
Neg logit contributions
appropriate-0.402
uba-0.390
achment-0.386
 Bliss-0.385
��-0.382
Token not
Token ID407
Feature activation+4.23e-01

Pos logit contributions
Gall+0.237
 Rak+0.227
horse+0.224
tera+0.224
 Serv+0.218
Neg logit contributions
appropriate-0.261
uba-0.253
 Bliss-0.251
achment-0.250
��-0.249
Token being
Token ID852
Feature activation+5.29e-02

Pos logit contributions
appropriate+0.302
uba+0.293
achment+0.291
 Bliss+0.287
��+0.284
Neg logit contributions
Gall-0.272
 Rak-0.260
horse-0.256
tera-0.256
 Serv-0.250
Token drug
Token ID2563
Feature activation+4.33e-01

Pos logit contributions
appropriate+0.035
uba+0.034
achment+0.034
 Bliss+0.033
��+0.033
Neg logit contributions
Gall-0.037
tera-0.035
 Rak-0.035
horse-0.035
 Serv-0.034
Token into
Token ID656
Feature activation+4.59e-01

Pos logit contributions
appropriate+0.278
uba+0.268
achment+0.264
 Bliss+0.263
��+0.261
Neg logit contributions
Gall-0.308
 Rak-0.299
tera-0.295
horse-0.295
 Serv-0.288
Token another
Token ID1194
Feature activation+3.71e-01

Pos logit contributions
appropriate+0.312
uba+0.303
 Bliss+0.298
achment+0.298
��+0.293
Neg logit contributions
Gall-0.310
 Rak-0.297
tera-0.295
horse-0.295
 Serv-0.285
Token long
Token ID890
Feature activation+1.19e+00

Pos logit contributions
appropriate+0.277
uba+0.270
achment+0.266
 Bliss+0.266
 MFT+0.261
Neg logit contributions
Gall-0.225
 Rak-0.213
horse-0.212
tera-0.211
 Serv-0.204
Token and
Token ID290
Feature activation+5.97e-03

Pos logit contributions
appropriate+0.870
uba+0.847
achment+0.839
 Bliss+0.828
��+0.816
Neg logit contributions
Gall-0.748
 Rak-0.707
tera-0.703
horse-0.695
 Serv-0.679
Token costly
Token ID16378
Feature activation-1.93e-01

Pos logit contributions
appropriate+0.004
uba+0.004
 Bliss+0.004
achment+0.004
reviewed+0.004
Neg logit contributions
Gall-0.004
tera-0.004
 Rak-0.004
horse-0.003
 Serv-0.003
Token war
Token ID1175
Feature activation+2.52e-01

Pos logit contributions
Gall+0.106
 Rak+0.100
tera+0.100
horse+0.099
 Serv+0.096
Neg logit contributions
appropriate-0.155
uba-0.151
 Bliss-0.150
achment-0.150
��-0.147
Token that
Token ID326
Feature activation-4.31e-02

Pos logit contributions
appropriate+0.803
uba+0.777
 Bliss+0.763
achment+0.756
abouts+0.746
Neg logit contributions
Gall-0.698
tera-0.672
horse-0.666
 Rak-0.665
 Serv-0.633
Token reached
Token ID4251
Feature activation-4.65e-01

Pos logit contributions
Gall+0.030
 Rak+0.028
horse+0.028
tera+0.028
 Serv+0.027
Neg logit contributions
appropriate-0.029
uba-0.028
 Bliss-0.027
achment-0.027
��-0.027
Token 185
Token ID22855
Feature activation+5.87e-01

Pos logit contributions
Gall+0.326
 Rak+0.314
horse+0.312
tera+0.311
 Serv+0.303
Neg logit contributions
appropriate-0.303
uba-0.293
 Bliss-0.289
achment-0.288
��-0.283
Token degrees
Token ID7370
Feature activation+7.83e-02

Pos logit contributions
appropriate+0.253
uba+0.241
achment+0.237
 Bliss+0.237
��+0.234
Neg logit contributions
Gall-0.538
 Rak-0.525
tera-0.519
horse-0.519
 Serv-0.510
Token F
Token ID376
Feature activation+5.08e-01

Pos logit contributions
appropriate+0.054
uba+0.052
 Bliss+0.052
achment+0.051
��+0.051
Neg logit contributions
Gall-0.052
 Rak-0.050
horse-0.050
tera-0.050
 Serv-0.049
Token and
Token ID290
Feature activation+4.83e-01

Pos logit contributions
appropriate+0.367
uba+0.357
 Bliss+0.353
achment+0.353
��+0.349
Neg logit contributions
Gall-0.320
 Rak-0.308
tera-0.303
horse-0.303
 Serv-0.295
Token 85
Token ID7600
Feature activation+3.56e-01

Pos logit contributions
appropriate+0.323
uba+0.314
 Bliss+0.314
achment+0.311
 Cohn+0.305
Neg logit contributions
Gall-0.325
tera-0.315
 Rak-0.313
horse-0.311
 Serv-0.302
Token percent
Token ID1411
Feature activation+3.81e-01

Pos logit contributions
appropriate+0.165
uba+0.156
achment+0.155
��+0.154
 Bliss+0.154
Neg logit contributions
Gall-0.319
 Rak-0.310
horse-0.307
tera-0.305
 Serv-0.303
Token relative
Token ID3585
Feature activation-9.81e-02

Pos logit contributions
appropriate+0.172
uba+0.165
 Bliss+0.163
achment+0.162
��+0.158
Neg logit contributions
Gall-0.342
 Rak-0.333
tera-0.331
horse-0.329
 Serv-0.323
Token humidity
Token ID27716
Feature activation+8.90e-01

Pos logit contributions
Gall+0.096
 Rak+0.094
horse+0.093
tera+0.093
 Serv+0.092
Neg logit contributions
appropriate-0.036
uba-0.034
achment-0.034
 Bliss-0.034
��-0.033
Token for
Token ID329
Feature activation-5.62e-01

Pos logit contributions
appropriate+0.612
uba+0.591
 Bliss+0.586
achment+0.584
��+0.575
Neg logit contributions
Gall-0.591
 Rak-0.569
tera-0.565
horse-0.562
 Serv-0.547
Token able
Token ID1498
Feature activation+4.58e-01

Pos logit contributions
appropriate+0.166
uba+0.160
 Bliss+0.159
achment+0.158
 MFT+0.157
Neg logit contributions
Gall-0.189
tera-0.184
 Rak-0.181
horse-0.180
 Serv-0.174
Token to
Token ID284
Feature activation+2.10e-01

Pos logit contributions
appropriate+0.301
uba+0.289
achment+0.288
 Bliss+0.284
abouts+0.279
Neg logit contributions
Gall-0.323
 Rak-0.308
horse-0.304
tera-0.301
 Serv-0.297
Token honor
Token ID7522
Feature activation+1.08e+00

Pos logit contributions
appropriate+0.135
uba+0.130
 Bliss+0.128
achment+0.127
 MFT+0.125
Neg logit contributions
Gall-0.151
 Rak-0.143
horse-0.143
tera-0.141
 Serv-0.138
Token our
Token ID674
Feature activation-2.04e-01

Pos logit contributions
appropriate+0.752
 Bliss+0.728
uba+0.728
achment+0.716
��+0.712
Neg logit contributions
Gall-0.712
tera-0.675
horse-0.672
 Rak-0.669
 Serv-0.643
Token ancestors
Token ID18668
Feature activation-4.43e-01

Pos logit contributions
Gall+0.147
tera+0.141
horse+0.139
 Rak+0.138
 Serv+0.133
Neg logit contributions
appropriate-0.129
 Bliss-0.127
uba-0.125
achment-0.125
 Cohn-0.122
Token by
Token ID416
Feature activation-4.71e-01

Pos logit contributions
Gall+0.282
 Rak+0.270
tera+0.266
horse+0.266
 Serv+0.259
Neg logit contributions
appropriate-0.317
uba-0.309
 Bliss-0.308
achment-0.306
��-0.302
Token making
Token ID1642
Feature activation+3.70e-01

Pos logit contributions
Gall+0.321
tera+0.308
 Rak+0.307
horse+0.305
 Serv+0.295
Neg logit contributions
appropriate-0.318
uba-0.305
 Bliss-0.304
achment-0.302
��-0.297
Token this
Token ID428
Feature activation-5.20e-01

Pos logit contributions
appropriate+0.275
 Bliss+0.264
uba+0.261
achment+0.260
 Cohn+0.258
Neg logit contributions
Gall-0.226
horse-0.216
 Rak-0.215
tera-0.215
 Serv-0.206
Token gift
Token ID6979
Feature activation-1.11e+00

Pos logit contributions
Gall+0.410
 Rak+0.395
tera+0.393
horse+0.391
 Serv+0.383
Neg logit contributions
appropriate-0.296
 Bliss-0.282
uba-0.280
achment-0.278
��-0.273
Token to
Token ID284
Feature activation-1.13e+00

Pos logit contributions
Gall+0.728
 Rak+0.700
tera+0.691
horse+0.688
 Serv+0.673
Neg logit contributions
appropriate-0.770
uba-0.740
achment-0.739
 Bliss-0.739
��-0.731
Token Northwestern
Token ID30197
Feature activation-2.98e-01

Pos logit contributions
Gall+0.760
horse+0.725
 Rak+0.724
tera+0.721
 Serv+0.696
Neg logit contributions
appropriate-0.774
uba-0.746
 Bliss-0.744
achment-0.739
��-0.733
Token be
Token ID307
Feature activation-8.28e-02

Pos logit contributions
Gall+0.254
 Rak+0.242
tera+0.240
horse+0.240
 Serv+0.235
Neg logit contributions
appropriate-0.239
uba-0.232
achment-0.227
 Bliss-0.227
abouts-0.223
Token a
Token ID257
Feature activation-3.23e-01

Pos logit contributions
Gall+0.058
tera+0.056
 Rak+0.056
horse+0.055
 Serv+0.054
Neg logit contributions
appropriate-0.054
uba-0.052
 Bliss-0.052
achment-0.051
��-0.051
Token res
Token ID581
Feature activation-1.28e+00

Pos logit contributions
Gall+0.234
tera+0.227
 Rak+0.226
horse+0.224
 Serv+0.219
Neg logit contributions
appropriate-0.204
 Bliss-0.195
uba-0.194
achment-0.192
��-0.189
Tokenounding
Token ID9969
Feature activation-1.45e+00

Pos logit contributions
Gall+0.724
 Rak+0.696
horse+0.688
tera+0.681
 Serv+0.667
Neg logit contributions
appropriate-1.014
uba-0.992
 Bliss-0.977
achment-0.976
��-0.962
Token endorsement
Token ID17819
Feature activation-8.96e-01

Pos logit contributions
Gall+0.978
 Rak+0.940
tera+0.931
horse+0.927
 Serv+0.911
Neg logit contributions
appropriate-0.986
uba-0.952
 Bliss-0.947
achment-0.945
��-0.937
Token,
Token ID11
Feature activation-5.48e-01

Pos logit contributions
Gall+0.538
 Rak+0.516
tera+0.510
horse+0.507
 Serv+0.497
Neg logit contributions
appropriate-0.673
uba-0.654
 Bliss-0.651
achment-0.648
��-0.643
Token it
Token ID340
Feature activation-6.39e-01

Pos logit contributions
Gall+0.370
 Rak+0.357
horse+0.356
tera+0.356
 Serv+0.346
Neg logit contributions
appropriate-0.369
uba-0.356
 Bliss-0.355
achment-0.352
��-0.349
Token topped
Token ID20633
Feature activation+1.81e-02

Pos logit contributions
Gall+0.423
tera+0.407
 Rak+0.403
horse+0.400
 Serv+0.393
Neg logit contributions
appropriate-0.441
uba-0.431
 Bliss-0.421
achment-0.421
abouts-0.420
Token the
Token ID262
Feature activation+4.60e-01

Pos logit contributions
appropriate+0.012
 Bliss+0.012
uba+0.012
achment+0.012
��+0.012
Neg logit contributions
Gall-0.012
tera-0.012
 Rak-0.012
horse-0.012
 Serv-0.011
Token cool
Token ID3608
Feature activation-3.23e-01

Pos logit contributions
appropriate+0.308
 Bliss+0.298
uba+0.297
achment+0.294
��+0.291
Neg logit contributions
Gall-0.310
tera-0.301
horse-0.298
 Rak-0.298
 Serv-0.290
Tokenness
Token ID1108
Feature activation+7.37e-01

Pos logit contributions
Gall+0.208
 Rak+0.200
tera+0.197
horse+0.196
 Serv+0.193
Neg logit contributions
appropriate-0.228
uba-0.222
achment-0.220
 Bliss-0.219
��-0.218
Token or
Token ID393
Feature activation-3.22e-01

Pos logit contributions
appropriate+0.477
uba+0.456
 Bliss+0.451
achment+0.449
��+0.443
Neg logit contributions
Gall-0.726
 Rak-0.705
tera-0.698
horse-0.694
 Serv-0.684
Token food
Token ID2057
Feature activation+1.15e+00

Pos logit contributions
Gall+0.211
 Rak+0.204
tera+0.201
horse+0.199
 Serv+0.197
Neg logit contributions
appropriate-0.224
uba-0.218
achment-0.215
 Bliss-0.215
��-0.215
Token in
Token ID287
Feature activation-2.51e-01

Pos logit contributions
appropriate+0.714
 Bliss+0.691
uba+0.689
achment+0.681
��+0.674
Neg logit contributions
Gall-0.834
tera-0.805
 Rak-0.805
horse-0.797
 Serv-0.778
Token a
Token ID257
Feature activation-4.11e-01

Pos logit contributions
Gall+0.165
 Rak+0.159
tera+0.158
horse+0.157
 Serv+0.153
Neg logit contributions
appropriate-0.176
uba-0.169
 Bliss-0.168
achment-0.167
��-0.166
Token state
Token ID1181
Feature activation+4.48e-01

Pos logit contributions
Gall+0.252
 Rak+0.241
tera+0.241
horse+0.239
 Serv+0.232
Neg logit contributions
appropriate-0.304
uba-0.294
 Bliss-0.293
achment-0.291
��-0.289
Token
Token ID960
Feature activation+8.41e-02

Pos logit contributions
appropriate+0.310
uba+0.300
achment+0.297
 Bliss+0.296
��+0.293
Neg logit contributions
Gall-0.295
 Rak-0.286
tera-0.282
horse-0.279
 Serv-0.275
Tokenor
Token ID273
Feature activation+2.35e-01

Pos logit contributions
appropriate+0.056
uba+0.053
achment+0.052
 Bliss+0.052
��+0.052
Neg logit contributions
Gall-0.058
 Rak-0.057
tera-0.056
 Serv-0.055
horse-0.055
Token weight
Token ID3463
Feature activation+1.11e+00

Pos logit contributions
appropriate+0.161
uba+0.155
 Bliss+0.155
��+0.154
achment+0.154
Neg logit contributions
Gall-0.156
 Rak-0.151
tera-0.149
horse-0.148
 Serv-0.146
Tokenier
Token ID959
Feature activation+4.46e-01

Pos logit contributions
appropriate+0.712
uba+0.684
achment+0.681
 Bliss+0.678
��+0.671
Neg logit contributions
Gall-0.788
 Rak-0.768
tera-0.759
horse-0.747
 Serv-0.735
Token subjects
Token ID7481
Feature activation+3.02e-01

Pos logit contributions
appropriate+0.293
 Bliss+0.285
uba+0.283
achment+0.279
abouts+0.277
Neg logit contributions
Gall-0.307
tera-0.296
 Rak-0.295
horse-0.291
 Serv-0.285
Token like
Token ID588
Feature activation-5.22e-02

Pos logit contributions
appropriate+0.201
uba+0.194
achment+0.193
 Bliss+0.192
��+0.189
Neg logit contributions
Gall-0.208
 Rak-0.200
tera-0.200
horse-0.196
 Serv-0.194
TokenThe
Token ID464
Feature activation-6.12e-01

Pos logit contributions
Gall+0.233
 Rak+0.232
tera+0.227
horse+0.225
 Serv+0.224
Neg logit contributions
appropriate-0.245
uba-0.237
achment-0.233
reviewed-0.231
 Bliss-0.229
Token occupational
Token ID34266
Feature activation-1.48e-01

Pos logit contributions
Gall+0.357
horse+0.338
tera+0.335
 Rak+0.335
 Serv+0.321
Neg logit contributions
appropriate-0.471
uba-0.461
 Bliss-0.456
achment-0.450
��-0.449
Token radiation
Token ID11881
Feature activation+8.74e-02

Pos logit contributions
Gall+0.112
 Rak+0.108
tera+0.107
horse+0.106
 Serv+0.103
Neg logit contributions
appropriate-0.089
uba-0.087
 Bliss-0.085
achment-0.084
��-0.084
Token doses
Token ID16376
Feature activation+7.95e-02

Pos logit contributions
appropriate+0.053
uba+0.051
 Bliss+0.050
achment+0.050
abouts+0.049
Neg logit contributions
Gall-0.066
 Rak-0.063
tera-0.063
horse-0.063
 Serv-0.061
Token to
Token ID284
Feature activation-3.98e-01

Pos logit contributions
appropriate+0.057
uba+0.056
 Bliss+0.055
achment+0.054
 Cohn+0.054
Neg logit contributions
Gall-0.050
 Rak-0.048
tera-0.048
horse-0.048
 Serv-0.046
Token crew
Token ID5462
Feature activation-6.63e-01

Pos logit contributions
Gall+0.263
 Rak+0.252
tera+0.251
horse+0.251
 Serv+0.241
Neg logit contributions
appropriate-0.276
uba-0.269
 Bliss-0.265
achment-0.261
��-0.259
Token of
Token ID286
Feature activation-2.33e-01

Pos logit contributions
Gall+0.487
 Rak+0.473
tera+0.468
horse+0.461
 Serv+0.454
Neg logit contributions
appropriate-0.411
uba-0.401
achment-0.389
 Bliss-0.388
abouts-0.381
Token nuclear
Token ID4523
Feature activation-6.69e-01

Pos logit contributions
Gall+0.151
tera+0.145
horse+0.144
 Rak+0.144
 Serv+0.137
Neg logit contributions
appropriate-0.164
uba-0.160
 Bliss-0.157
achment-0.154
 Cohn-0.154
Token vessels
Token ID14891
Feature activation-6.17e-02

Pos logit contributions
Gall+0.405
 Rak+0.386
tera+0.382
horse+0.379
 Serv+0.366
Neg logit contributions
appropriate-0.503
uba-0.493
 Bliss-0.484
achment-0.481
��-0.475
Token in
Token ID287
Feature activation-3.27e-01

Pos logit contributions
Gall+0.041
 Rak+0.040
tera+0.040
horse+0.039
 Serv+0.038
Neg logit contributions
appropriate-0.042
uba-0.041
achment-0.040
 Bliss-0.040
abouts-0.040
Token very
Token ID845
Feature activation-9.58e-01

Pos logit contributions
Gall+0.225
tera+0.216
 Rak+0.216
horse+0.215
 Serv+0.208
Neg logit contributions
appropriate-0.219
uba-0.211
 Bliss-0.208
achment-0.206
��-0.204
Token
Token ID447
Feature activation+4.02e-01

Pos logit contributions
Gall+0.037
 Rak+0.035
horse+0.035
tera+0.035
 Serv+0.034
Neg logit contributions
appropriate-0.030
uba-0.029
 Bliss-0.029
achment-0.029
abouts-0.028
Token
Token ID247
Feature activation+2.00e-01

Pos logit contributions
appropriate+0.210
��+0.202
uba+0.202
achment+0.200
 Bliss+0.197
Neg logit contributions
Gall-0.332
 Rak-0.326
tera-0.321
horse-0.321
 Serv-0.314
Tokenre
Token ID260
Feature activation+6.52e-01

Pos logit contributions
appropriate+0.134
uba+0.129
achment+0.128
 Bliss+0.126
reviewed+0.125
Neg logit contributions
Gall-0.139
 Rak-0.134
horse-0.132
tera-0.132
 Serv-0.129
Token not
Token ID407
Feature activation+6.89e-01

Pos logit contributions
appropriate+0.437
uba+0.416
achment+0.415
 Bliss+0.411
��+0.409
Neg logit contributions
Gall-0.450
 Rak-0.435
tera-0.431
horse-0.430
 Serv-0.415
Token sure
Token ID1654
Feature activation+8.41e-01

Pos logit contributions
appropriate+0.473
uba+0.455
 Bliss+0.450
achment+0.449
��+0.444
Neg logit contributions
Gall-0.460
tera-0.445
 Rak-0.442
horse-0.439
 Serv-0.425
Token how
Token ID703
Feature activation-1.51e-03

Pos logit contributions
appropriate+0.559
uba+0.538
 Bliss+0.538
achment+0.534
��+0.529
Neg logit contributions
Gall-0.578
 Rak-0.557
tera-0.552
horse-0.551
 Serv-0.538
Token they
Token ID484
Feature activation-4.00e-01

Pos logit contributions
Gall+0.001
 Rak+0.001
horse+0.001
tera+0.001
 Serv+0.001
Neg logit contributions
appropriate-0.001
uba-0.001
 Bliss-0.001
achment-0.001
��-0.001
Token would
Token ID561
Feature activation+2.78e-01

Pos logit contributions
Gall+0.285
 Rak+0.275
horse+0.271
tera+0.270
 Serv+0.266
Neg logit contributions
appropriate-0.258
uba-0.248
achment-0.247
 Bliss-0.246
��-0.243
Token have
Token ID423
Feature activation-1.15e+00

Pos logit contributions
appropriate+0.175
uba+0.167
achment+0.165
 Bliss+0.165
��+0.163
Neg logit contributions
Gall-0.205
 Rak-0.196
horse-0.195
tera-0.192
 Serv-0.190
Token voted
Token ID7052
Feature activation-1.31e+00

Pos logit contributions
Gall+0.798
 Rak+0.761
tera+0.758
horse+0.757
 Serv+0.736
Neg logit contributions
appropriate-0.773
uba-0.743
 Bliss-0.741
achment-0.736
abouts-0.727
Token had
Token ID550
Feature activation-8.88e-01

Pos logit contributions
Gall+0.829
 Rak+0.798
horse+0.786
tera+0.783
 Serv+0.769
Neg logit contributions
appropriate-0.944
uba-0.915
achment-0.907
 Bliss-0.906
��-0.905
Token
Token ID198
Feature activation-2.86e-01

Pos logit contributions
Gall+0.485
tera+0.471
 Rak+0.470
horse+0.463
 Serv+0.456
Neg logit contributions
appropriate-0.407
 Bliss-0.387
uba-0.387
achment-0.387
reviewed-0.382
TokenFrom
Token ID4863
Feature activation-6.83e-01

Pos logit contributions
Gall+0.191
 Rak+0.190
tera+0.187
horse+0.183
 Serv+0.183
Neg logit contributions
appropriate-0.194
uba-0.184
achment-0.184
 Bliss-0.183
reviewed-0.183
Token the
Token ID262
Feature activation-4.00e-01

Pos logit contributions
Gall+0.461
 Rak+0.445
tera+0.439
horse+0.438
 Serv+0.430
Neg logit contributions
appropriate-0.463
uba-0.448
 Bliss-0.444
achment-0.444
��-0.440
Token start
Token ID923
Feature activation-1.23e+00

Pos logit contributions
Gall+0.269
 Rak+0.259
tera+0.255
horse+0.255
 Serv+0.250
Neg logit contributions
appropriate-0.274
uba-0.265
 Bliss-0.262
achment-0.262
��-0.259
Token,
Token ID11
Feature activation+1.97e-01

Pos logit contributions
Gall+0.745
 Rak+0.715
horse+0.707
tera+0.700
 Serv+0.689
Neg logit contributions
appropriate-0.923
uba-0.898
achment-0.892
��-0.890
 Bliss-0.887
Token Hitchcock
Token ID49817
Feature activation-1.77e+00

Pos logit contributions
appropriate+0.101
uba+0.096
 Bliss+0.096
achment+0.095
��+0.093
Neg logit contributions
Gall-0.165
 Rak-0.160
tera-0.159
horse-0.158
 Serv-0.156
Token seemed
Token ID3947
Feature activation-7.13e-01

Pos logit contributions
Gall+1.197
 Rak+1.153
horse+1.136
tera+1.136
 Serv+1.115
Neg logit contributions
appropriate-1.199
uba-1.161
achment-1.149
 Bliss-1.148
��-1.140
Token intent
Token ID6824
Feature activation-3.33e-01

Pos logit contributions
Gall+0.474
 Rak+0.456
horse+0.450
tera+0.449
 Serv+0.441
Neg logit contributions
appropriate-0.492
uba-0.477
achment-0.472
 Bliss-0.472
��-0.469
Token on
Token ID319
Feature activation+1.43e-01

Pos logit contributions
Gall+0.178
 Rak+0.172
horse+0.170
tera+0.169
 Serv+0.164
Neg logit contributions
appropriate-0.271
uba-0.263
��-0.262
 Bliss-0.261
achment-0.260
Token creating
Token ID4441
Feature activation-2.82e-01

Pos logit contributions
appropriate+0.102
uba+0.099
 Bliss+0.098
achment+0.098
��+0.097
Neg logit contributions
Gall-0.092
 Rak-0.088
tera-0.088
horse-0.086
 Serv-0.085
Token a
Token ID257
Feature activation+8.25e-02

Pos logit contributions
Gall+0.174
 Rak+0.168
tera+0.165
horse+0.164
 Serv+0.161
Neg logit contributions
appropriate-0.208
uba-0.201
 Bliss-0.199
achment-0.199
��-0.196
Token
Token ID198
Feature activation-3.33e-01

Pos logit contributions
Gall+0.599
 Rak+0.580
tera+0.579
horse+0.571
 Serv+0.561
Neg logit contributions
appropriate-0.504
uba-0.480
 Bliss-0.480
achment-0.479
reviewed-0.470
TokenCh
Token ID1925
Feature activation-5.55e-01

Pos logit contributions
Gall+0.220
 Rak+0.217
tera+0.213
horse+0.212
 Serv+0.210
Neg logit contributions
appropriate-0.226
uba-0.218
achment-0.216
 Bliss-0.215
reviewed-0.212
Tokenlor
Token ID4685
Feature activation+3.30e-01

Pos logit contributions
Gall+0.360
 Rak+0.347
horse+0.342
tera+0.340
 Serv+0.335
Neg logit contributions
appropriate-0.391
uba-0.378
achment-0.375
 Bliss-0.375
��-0.374
Tokenop
Token ID404
Feature activation-6.37e-01

Pos logit contributions
appropriate+0.288
uba+0.280
 Bliss+0.280
achment+0.279
��+0.279
Neg logit contributions
Gall-0.158
 Rak-0.150
horse-0.148
tera-0.147
 Serv-0.143
Tokenrene
Token ID25924
Feature activation-1.22e-01

Pos logit contributions
Gall+0.635
 Rak+0.622
horse+0.611
 Serv+0.605
tera+0.604
Neg logit contributions
appropriate-0.233
uba-0.227
achment-0.222
 Bliss-0.215
reviewed-0.203
Token rubber
Token ID14239
Feature activation-2.18e+00

Pos logit contributions
Gall+0.085
 Rak+0.082
horse+0.081
tera+0.081
 Serv+0.079
Neg logit contributions
appropriate-0.079
uba-0.078
 Bliss-0.077
achment-0.076
��-0.075
Token goes
Token ID2925
Feature activation-7.61e-01

Pos logit contributions
Gall+1.341
 Rak+1.289
horse+1.268
tera+1.268
 Serv+1.237
Neg logit contributions
appropriate-1.605
uba-1.566
achment-1.549
 Bliss-1.548
��-1.535
Token into
Token ID656
Feature activation-2.40e-01

Pos logit contributions
Gall+0.481
 Rak+0.459
tera+0.454
horse+0.451
 Serv+0.440
Neg logit contributions
appropriate-0.547
uba-0.537
 Bliss-0.530
achment-0.526
��-0.521
Token all
Token ID477
Feature activation-3.64e-01

Pos logit contributions
Gall+0.171
 Rak+0.164
tera+0.163
horse+0.162
 Serv+0.158
Neg logit contributions
appropriate-0.155
uba-0.150
 Bliss-0.149
achment-0.148
��-0.145
Token sorts
Token ID10524
Feature activation-6.05e-02

Pos logit contributions
Gall+0.248
 Rak+0.238
horse+0.236
tera+0.235
 Serv+0.230
Neg logit contributions
appropriate-0.245
uba-0.238
 Bliss-0.235
achment-0.233
abouts-0.229
Token of
Token ID286
Feature activation+3.86e-01

Pos logit contributions
Gall+0.039
 Rak+0.037
tera+0.037
horse+0.036
 Serv+0.035
Neg logit contributions
appropriate-0.043
uba-0.042
 Bliss-0.042
achment-0.041
abouts-0.041
Token+
Token ID10
Feature activation-2.69e-01

Pos logit contributions
appropriate+0.340
uba+0.328
achment+0.323
��+0.320
reviewed+0.317
Neg logit contributions
Gall-0.311
 Rak-0.305
horse-0.294
tera-0.294
 Serv-0.293
Token--------------------
Token ID19351
Feature activation+1.53e+00

Pos logit contributions
Gall+0.146
 Rak+0.139
horse+0.138
tera+0.137
 Serv+0.133
Neg logit contributions
appropriate-0.216
uba-0.210
��-0.208
achment-0.208
 Bliss-0.207
Token--+
Token ID44785
Feature activation-2.08e-01

Pos logit contributions
appropriate+0.925
uba+0.887
��+0.884
achment+0.877
 Bliss+0.870
Neg logit contributions
Gall-1.144
 Rak-1.095
horse-1.091
tera-1.090
 Serv-1.060
Token |
Token ID930
Feature activation-1.13e+00

Pos logit contributions
Gall+0.118
 Rak+0.112
horse+0.111
tera+0.111
 Serv+0.107
Neg logit contributions
appropriate-0.163
uba-0.159
achment-0.157
 Bliss-0.157
��-0.157
Token...
Token ID2644
Feature activation+1.34e-01

Pos logit contributions
Gall+0.761
 Rak+0.732
horse+0.725
tera+0.722
 Serv+0.707
Neg logit contributions
appropriate-0.766
uba-0.740
 Bliss-0.734
achment-0.734
��-0.730
Token |
Token ID930
Feature activation-2.30e+00

Pos logit contributions
appropriate+0.111
uba+0.108
achment+0.108
��+0.107
 Bliss+0.107
Neg logit contributions
Gall-0.071
 Rak-0.066
horse-0.066
tera-0.066
 Serv-0.064
Token -
Token ID532
Feature activation-7.52e-01

Pos logit contributions
Gall+1.493
 Rak+1.438
horse+1.418
tera+1.413
 Serv+1.388
Neg logit contributions
appropriate-1.624
uba-1.574
achment-1.560
 Bliss-1.557
��-1.554
Token doesn
Token ID1595
Feature activation-4.58e-01

Pos logit contributions
Gall+0.431
 Rak+0.411
horse+0.407
tera+0.404
 Serv+0.396
Neg logit contributions
appropriate-0.587
uba-0.569
achment-0.565
 Bliss-0.563
��-0.562
Token't
Token ID470
Feature activation-1.17e+00

Pos logit contributions
Gall+0.331
 Rak+0.323
tera+0.315
horse+0.314
 Serv+0.312
Neg logit contributions
appropriate-0.291
uba-0.279
achment-0.275
 Bliss-0.271
reviewed-0.268
Token matter
Token ID2300
Feature activation-7.81e-01

Pos logit contributions
Gall+0.783
 Rak+0.748
horse+0.741
tera+0.739
 Serv+0.722
Neg logit contributions
appropriate-0.812
uba-0.787
 Bliss-0.776
achment-0.776
��-0.771
Token,
Token ID11
Feature activation-1.16e+00

Pos logit contributions
Gall+0.459
 Rak+0.439
horse+0.433
tera+0.432
 Serv+0.422
Neg logit contributions
appropriate-0.599
uba-0.582
achment-0.577
 Bliss-0.576
��-0.575
Token be
Token ID307
Feature activation-4.49e-01

Pos logit contributions
Gall+0.452
 Rak+0.435
horse+0.429
tera+0.427
 Serv+0.421
Neg logit contributions
appropriate-0.433
uba-0.418
achment-0.414
 Bliss-0.412
��-0.411
Token part
Token ID636
Feature activation-3.93e-01

Pos logit contributions
Gall+0.284
 Rak+0.272
tera+0.269
horse+0.268
 Serv+0.262
Neg logit contributions
appropriate-0.326
uba-0.315
 Bliss-0.314
achment-0.313
��-0.310
Token of
Token ID286
Feature activation-2.78e-01

Pos logit contributions
Gall+0.249
 Rak+0.238
tera+0.236
horse+0.233
 Serv+0.227
Neg logit contributions
appropriate-0.283
achment-0.275
uba-0.273
 Bliss-0.272
 Cohn-0.266
Token a
Token ID257
Feature activation-2.58e-01

Pos logit contributions
Gall+0.188
 Rak+0.180
horse+0.178
tera+0.177
 Serv+0.173
Neg logit contributions
appropriate-0.190
uba-0.182
 Bliss-0.181
��-0.181
achment-0.181
Token DACA
Token ID38061
Feature activation+2.16e-01

Pos logit contributions
Gall+0.169
 Rak+0.161
horse+0.160
tera+0.159
 Serv+0.154
Neg logit contributions
appropriate-0.180
 Bliss-0.175
uba-0.173
achment-0.173
��-0.172
Token deal
Token ID1730
Feature activation-2.13e+00

Pos logit contributions
appropriate+0.151
 Bliss+0.145
uba+0.145
achment+0.145
��+0.144
Neg logit contributions
Gall-0.142
 Rak-0.135
horse-0.134
tera-0.134
 Serv-0.130
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
Gall+1.360
 Rak+1.306
horse+1.288
tera+1.285
 Serv+1.260
Neg logit contributions
appropriate-1.527
uba-1.478
achment-1.469
 Bliss-1.466
��-1.461
Token
Token ID447
Feature activation-8.28e-01

Pos logit contributions
Gall+0.119
 Rak+0.114
horse+0.113
tera+0.113
 Serv+0.110
Neg logit contributions
appropriate-0.143
 Bliss-0.138
achment-0.138
uba-0.138
��-0.137
Token
Token ID247
Feature activation-4.91e-01

Pos logit contributions
Gall+0.682
 Rak+0.662
horse+0.655
tera+0.653
 Serv+0.644
Neg logit contributions
appropriate-0.439
uba-0.421
achment-0.416
 Bliss-0.415
��-0.413
Token Trump
Token ID1301
Feature activation+3.76e-01

Pos logit contributions
Gall+0.302
 Rak+0.288
horse+0.287
tera+0.286
 Serv+0.279
Neg logit contributions
appropriate-0.362
uba-0.351
 Bliss-0.349
achment-0.349
��-0.347
Token said
Token ID531
Feature activation-1.52e-02

Pos logit contributions
appropriate+0.298
uba+0.292
 Bliss+0.291
achment+0.289
��+0.287
Neg logit contributions
Gall-0.207
 Rak-0.197
horse-0.196
tera-0.194
 Serv-0.190
Token0
Token ID15
Feature activation-9.87e-01

Pos logit contributions
appropriate+0.346
achment+0.332
uba+0.331
 Bliss+0.330
��+0.323
Neg logit contributions
Gall-0.416
 Rak-0.404
tera-0.397
horse-0.397
 Serv-0.389
Token)
Token ID8
Feature activation+5.81e-01

Pos logit contributions
Gall+0.539
 Rak+0.518
horse+0.508
tera+0.507
 Serv+0.493
Neg logit contributions
appropriate-0.794
uba-0.774
 Bliss-0.769
achment-0.768
��-0.764
Token /
Token ID1220
Feature activation-5.88e-01

Pos logit contributions
appropriate+0.459
uba+0.446
 Bliss+0.443
achment+0.443
��+0.439
Neg logit contributions
Gall-0.324
 Rak-0.312
tera-0.309
horse-0.308
 Serv-0.297
Token 2
Token ID362
Feature activation-3.08e-01

Pos logit contributions
Gall+0.407
 Rak+0.393
horse+0.391
tera+0.385
 Serv+0.378
Neg logit contributions
appropriate-0.387
uba-0.376
 Bliss-0.375
achment-0.371
��-0.368
Token.
Token ID13
Feature activation-5.96e-01

Pos logit contributions
Gall+0.199
 Rak+0.191
horse+0.188
tera+0.188
 Serv+0.185
Neg logit contributions
appropriate-0.218
uba-0.211
achment-0.209
 Bliss-0.209
��-0.208
Token0
Token ID15
Feature activation-1.78e+00

Pos logit contributions
Gall+0.441
 Rak+0.427
horse+0.421
tera+0.420
 Serv+0.413
Neg logit contributions
appropriate-0.366
uba-0.352
achment-0.349
 Bliss-0.349
��-0.347
Token;
Token ID26
Feature activation-7.18e-01

Pos logit contributions
Gall+0.919
 Rak+0.875
horse+0.858
tera+0.856
 Serv+0.841
Neg logit contributions
appropriate-1.489
uba-1.450
achment-1.439
 Bliss-1.434
��-1.432
Token //
Token ID3373
Feature activation-2.32e+00

Pos logit contributions
Gall+0.424
 Rak+0.405
horse+0.401
tera+0.399
 Serv+0.387
Neg logit contributions
appropriate-0.546
uba-0.532
 Bliss-0.529
achment-0.525
��-0.524
Token [
Token ID685
Feature activation+6.67e-01

Pos logit contributions
Gall+1.682
 Rak+1.628
horse+1.603
tera+1.600
 Serv+1.579
Neg logit contributions
appropriate-1.463
uba-1.411
achment-1.401
��-1.394
 Bliss-1.391
Token0
Token ID15
Feature activation-9.48e-01

Pos logit contributions
appropriate+0.410
uba+0.393
 Bliss+0.388
achment+0.387
��+0.385
Neg logit contributions
Gall-0.489
 Rak-0.479
horse-0.471
tera-0.467
 Serv-0.464
Token,
Token ID11
Feature activation-6.79e-01

Pos logit contributions
Gall+0.560
 Rak+0.538
horse+0.530
tera+0.528
 Serv+0.520
Neg logit contributions
appropriate-0.723
uba-0.701
achment-0.694
 Bliss-0.693
��-0.692