TOP ACTIVATIONS
MAX = 11.647

Common Bounty gate Lit igation Q & As .
build ings . The signaling mast was bare in the wind
consume alcoholic beverages . The decl ination to file criminal charges
breast - blood spew ing Vul v atron .
se us tied to the mast , sailing away from the
with you , bear the decl ination with becoming grace ;
for def raud ing its Lif eline program , which subsid
Vault . There are some fram erate flaws present that don
all . The presumptive GOP presidential nominee said Friday
Hollywood to see the From Dusk Till Dawn maze coming this
. <|endoftext|> Ill ust rious Illum ination : Incredible TR ON
were negotiating over fees for collection , preservation , and transport
. <|endoftext|> WASHINGTON A Cy gn us cargo mission to
: blended roles that require compet ency in both .
somewhere : REF LECT _ CLASS _ ST ART
Kil me ade and Me gyn Kelly . Tr
occurring organic nitrogen - containing bases ) properties , something synt
Batman and Robin was unfor g ivable well , I
Sh and ling 1997 John Lith gow 1997 Michael
Sh and ling 1999 John Lith gow 1999 Michael

MIN ACTIVATIONS
MIN = -12.924

: https :// t . co / c Q 3 x
. https :// t . co / ni WB qv fi
Report https :// t . co / l g S wh
. https :// t . co / P 9 gc N
https :// t . co / X h 7 J
W add ell (@ s hel by y 512
https :// t . co / c Q 3 x mb
com / watch ? v = 2 g 7 p k
(@ m f lynn JR )
(@ m f lynn JR )
https :// t . co / ni WB qv fi W
Donald J . Trump (@ realDonaldTrump ) June 3 ,
o M Winnipeg Jets (@ N HL J ets )
Donald J . Trump (@ realDonaldTrump ) November 3 ,
Ren aud Lav o ie (@ ren lav o iet va
https :// t . co / l g S wh X
Donald J . Trump (@ realDonaldTrump ) June 3 ,
never called . pic . twitter . com / CS 3
to follow the hashtag # femin in ew omen for
pic . twitter . com / Eh U g I x

INTERVAL 5.504 - 11.647
CONTAINS 0.274%

with an army general ; Mr . Ram í rez also
, repeat ability , and surviv ability ) become embedded within
conclusion . While all three titles t ipt oe into min
norm ally protective lact ob ac illi
, procedural maps were already bog ging us in two factors

INTERVAL -0.638 - 5.504
CONTAINS 65.886%

ong , New Orleans is a big hub for Vietnamese families
toys , there is something for every kid . Some toys
gets in the way , as it 's possible for the
one sign of the government s nervous ness about
in controlling the crowd of fren z ied bargain - seekers

INTERVAL -6.781 - -0.638
CONTAINS 33.778%

around the eyes , nose and feet are the most common
be killed . Sun an Abu Daw ud 14 : 26
, and to show there is a global resistance to him
into their home to investigate the activity overnight . We really
/ 3 IP Nick Vincent : 3 . 60

INTERVAL -12.924 - -6.781
CONTAINS 0.062%

MD , Pruitt sent an email to supporters of one federal
pic . twitter . com / Eh U g I x
http :// www . ch ucky e ager . org
(@ m f lynn JR )
we 're continuing with maps # 85 - 61 , and
Token
Token ID198
Feature activation-9.44e-01

Pos logit contributions
ellen+0.200
aeda+0.189
ochet+0.183
uden+0.181
 therap+0.178
Neg logit contributions
selves-0.112
 Lords-0.110
 cabinets-0.104
 coy-0.104
 Desk-0.103
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
ellen+0.817
aeda+0.749
ochet+0.746
uden+0.720
Detailed+0.700
Neg logit contributions
selves-0.546
 Lords-0.538
 coy-0.526
 cabinets-0.522
 Desk-0.518
TokenCommon
Token ID17227
Feature activation+1.45e+00

Pos logit contributions
selves+0.139
 Lords+0.137
 cabinets+0.131
 coy+0.131
 Desk+0.131
Neg logit contributions
ellen-0.151
aeda-0.142
ochet-0.136
uden-0.134
 therap-0.133
Token Bounty
Token ID38166
Feature activation+2.79e+00

Pos logit contributions
selves+0.741
 Lords+0.733
 Desk+0.687
 cabinets+0.687
 Islands+0.686
Neg logit contributions
ellen-1.383
aeda-1.304
ochet-1.264
uden-1.251
 therap-1.231
Tokengate
Token ID10494
Feature activation+8.48e-01

Pos logit contributions
selves+1.458
 Lords+1.448
 Desk+1.363
 Islands+1.361
 coy+1.351
Neg logit contributions
ellen-2.615
aeda-2.482
ochet-2.387
uden-2.359
 therap-2.344
Token Lit
Token ID25659
Feature activation+1.16e+01

Pos logit contributions
selves+0.447
 Lords+0.440
 cabinets+0.416
 coy+0.415
 Desk+0.413
Neg logit contributions
ellen-0.795
aeda-0.751
ochet-0.726
uden-0.716
 therap-0.707
Tokenigation
Token ID7065
Feature activation+3.91e+00

Pos logit contributions
selves+1.740
 Lords+1.528
 Banks+1.426
 Desk+1.335
 suits+1.330
Neg logit contributions
ellen-14.708
aeda-14.016
 therap-13.725
ochet-13.705
uden-13.305
Token Q
Token ID1195
Feature activation+2.30e+00

Pos logit contributions
selves+2.004
 Lords+2.000
 Desk+1.886
 Islands+1.878
 cabinets+1.868
Neg logit contributions
ellen-3.715
aeda-3.517
ochet-3.401
uden-3.345
 therap-3.320
Token&
Token ID5
Feature activation+3.12e+00

Pos logit contributions
selves+1.143
 Lords+1.126
 cabinets+1.056
 coy+1.055
 Islands+1.054
Neg logit contributions
ellen-2.223
aeda-2.107
ochet-2.042
uden-2.013
 therap-1.994
TokenAs
Token ID1722
Feature activation+1.47e-01

Pos logit contributions
selves+2.106
 Lords+2.080
 cabinets+1.983
 coy+1.982
 Desk+1.981
Neg logit contributions
ellen-2.469
aeda-2.311
ochet-2.214
uden-2.176
 therap-2.150
Token.
Token ID13
Feature activation+1.83e+00

Pos logit contributions
selves+0.081
 Lords+0.080
 cabinets+0.076
 coy+0.076
 Islands+0.075
Neg logit contributions
ellen-0.134
aeda-0.126
ochet-0.122
uden-0.121
 therap-0.119
Tokenbuild
Token ID11249
Feature activation+1.14e+00

Pos logit contributions
selves+0.047
 Lords+0.047
 cabinets+0.045
 coy+0.044
 Desk+0.044
Neg logit contributions
ellen-0.067
aeda-0.062
ochet-0.060
uden-0.059
 therap-0.059
Tokenings
Token ID654
Feature activation+7.31e-01

Pos logit contributions
selves+0.579
 Lords+0.578
 Desk+0.542
 Islands+0.539
 coy+0.537
Neg logit contributions
ellen-1.076
aeda-1.008
ochet-0.984
uden-0.968
 therap-0.956
Token.
Token ID13
Feature activation+1.45e+00

Pos logit contributions
selves+0.399
 Lords+0.392
 cabinets+0.370
 Desk+0.370
 coy+0.370
Neg logit contributions
ellen-0.671
aeda-0.632
ochet-0.612
uden-0.603
 therap-0.596
Token The
Token ID383
Feature activation+1.64e+00

Pos logit contributions
selves+0.781
 Lords+0.762
 coy+0.731
 cabinets+0.729
 Desk+0.719
Neg logit contributions
ellen-1.332
aeda-1.255
ochet-1.216
uden-1.195
 therap-1.170
Token signaling
Token ID22049
Feature activation-7.44e-01

Pos logit contributions
selves+0.939
 Lords+0.922
 cabinets+0.875
 Desk+0.872
 coy+0.871
Neg logit contributions
ellen-1.464
aeda-1.379
ochet-1.332
uden-1.310
 therap-1.295
Token mast
Token ID17288
Feature activation+1.14e+01

Pos logit contributions
ellen+0.652
aeda+0.613
ochet+0.590
uden+0.578
 therap+0.577
Neg logit contributions
selves-0.431
 Lords-0.416
 Shiva-0.392
 Desk-0.392
 Islands-0.391
Token was
Token ID373
Feature activation-2.08e+00

Pos logit contributions
 Lords+5.732
selves+5.522
 cabinets+5.412
 coy+5.370
 Desk+5.231
Neg logit contributions
ellen-10.172
aeda-9.790
ochet-9.258
 therap-9.231
uden-9.217
Token bare
Token ID6247
Feature activation+3.70e+00

Pos logit contributions
ellen+1.904
aeda+1.781
ochet+1.749
uden+1.703
 therap+1.693
Neg logit contributions
selves-1.134
 Lords-1.097
 Desk-1.040
 cabinets-1.036
 Islands-1.024
Token in
Token ID287
Feature activation+6.24e-01

Pos logit contributions
selves+1.880
 Lords+1.856
 cabinets+1.769
 coy+1.760
 Islands+1.744
Neg logit contributions
ellen-3.530
aeda-3.331
ochet-3.200
uden-3.176
 therap-3.150
Token the
Token ID262
Feature activation-8.63e-01

Pos logit contributions
selves+0.355
 Lords+0.349
 cabinets+0.332
 coy+0.331
 Desk+0.330
Neg logit contributions
ellen-0.560
aeda-0.527
ochet-0.509
uden-0.501
 therap-0.495
Token wind
Token ID2344
Feature activation+3.39e+00

Pos logit contributions
ellen+0.759
aeda+0.715
ochet+0.690
uden+0.680
 therap+0.675
Neg logit contributions
selves-0.502
 Lords-0.483
 Desk-0.458
 cabinets-0.454
 coy-0.453
Token consume
Token ID15000
Feature activation+3.17e-01

Pos logit contributions
ellen+1.847
aeda+1.723
ochet+1.673
uden+1.646
 therap+1.595
Neg logit contributions
selves-1.462
 Lords-1.445
 Desk-1.391
 Islands-1.384
 Shiva-1.371
Token alcoholic
Token ID26016
Feature activation+1.66e+00

Pos logit contributions
selves+0.163
 Lords+0.159
 cabinets+0.150
 Desk+0.149
 coy+0.149
Neg logit contributions
ellen-0.300
aeda-0.283
ochet-0.274
uden-0.271
 therap-0.268
Token beverages
Token ID24173
Feature activation+1.43e+00

Pos logit contributions
selves+0.517
 Lords+0.504
 cabinets+0.457
 coy+0.455
 Desk+0.452
Neg logit contributions
ellen-1.918
aeda-1.831
ochet-1.782
uden-1.762
 therap-1.746
Token.
Token ID13
Feature activation-5.74e-01

Pos logit contributions
selves+0.803
 Lords+0.791
 cabinets+0.750
 coy+0.749
 Desk+0.745
Neg logit contributions
ellen-1.293
aeda-1.219
ochet-1.177
uden-1.159
 therap-1.146
Token The
Token ID383
Feature activation+5.14e-01

Pos logit contributions
ellen+0.520
aeda+0.490
ochet+0.474
uden+0.465
 therap+0.455
Neg logit contributions
selves-0.317
 Lords-0.310
 cabinets-0.299
 coy-0.298
 Desk-0.292
Token decl
Token ID2377
Feature activation+1.11e+01

Pos logit contributions
selves+0.292
 Lords+0.286
 cabinets+0.272
 coy+0.272
 Desk+0.270
Neg logit contributions
ellen-0.459
aeda-0.432
ochet-0.418
uden-0.412
 therap-0.407
Tokenination
Token ID1883
Feature activation+3.61e+00

Pos logit contributions
selves+5.447
 Lords+5.355
ials+5.200
 Islands+5.194
 coy+5.128
Neg logit contributions
ellen-9.488
aeda-9.231
 therap-9.162
��-8.538
Detailed-8.493
Token to
Token ID284
Feature activation+1.37e+00

Pos logit contributions
selves+1.937
 Lords+1.922
 cabinets+1.824
 coy+1.818
 Desk+1.815
Neg logit contributions
ellen-3.349
aeda-3.166
ochet-3.056
uden-3.011
 therap-2.971
Token file
Token ID2393
Feature activation+2.71e+00

Pos logit contributions
selves+0.829
 Lords+0.817
 cabinets+0.778
 coy+0.776
 Desk+0.774
Neg logit contributions
ellen-1.182
aeda-1.109
ochet-1.070
uden-1.053
 therap-1.040
Token criminal
Token ID4301
Feature activation+1.18e+00

Pos logit contributions
selves+1.089
 Lords+1.072
 cabinets+1.002
 coy+0.996
 Desk+0.985
Neg logit contributions
ellen-2.871
aeda-2.736
ochet-2.652
uden-2.621
 therap-2.593
Token charges
Token ID4530
Feature activation+1.21e+00

Pos logit contributions
selves+0.361
 Lords+0.352
 cabinets+0.317
 coy+0.316
 Desk+0.316
Neg logit contributions
ellen-1.360
aeda-1.297
ochet-1.264
uden-1.250
 therap-1.238
Token breast
Token ID9296
Feature activation+2.06e+00

Pos logit contributions
selves+0.640
 Lords+0.633
 coy+0.598
 cabinets+0.597
 Desk+0.594
Neg logit contributions
ellen-1.055
aeda-0.995
ochet-0.961
uden-0.949
 therap-0.935
Token-
Token ID12
Feature activation-9.23e-01

Pos logit contributions
selves+1.216
 Lords+1.204
 coy+1.151
 cabinets+1.148
 Islands+1.137
Neg logit contributions
ellen-1.795
aeda-1.693
ochet-1.627
uden-1.610
 therap-1.584
Tokenblood
Token ID18041
Feature activation+1.22e+00

Pos logit contributions
ellen+0.721
aeda+0.672
ochet+0.646
uden+0.634
 therap+0.625
Neg logit contributions
selves-0.630
 Lords-0.623
 cabinets-0.597
 coy-0.595
 Desk-0.595
Token spew
Token ID38645
Feature activation+2.04e-01

Pos logit contributions
selves+0.687
 Lords+0.677
 cabinets+0.642
 coy+0.641
 Desk+0.638
Neg logit contributions
ellen-1.104
aeda-1.041
ochet-1.005
uden-0.991
 therap-0.978
Tokening
Token ID278
Feature activation-1.34e+00

Pos logit contributions
selves+0.130
 Lords+0.129
 cabinets+0.123
 coy+0.123
 Desk+0.122
Neg logit contributions
ellen-0.168
aeda-0.157
ochet-0.151
uden-0.149
 therap-0.147
Token Vul
Token ID25442
Feature activation+1.00e+01

Pos logit contributions
ellen+1.189
aeda+1.122
ochet+1.088
uden+1.068
 therap+1.064
Neg logit contributions
selves-0.760
 Lords-0.743
 Desk-0.713
 cabinets-0.713
 Islands-0.709
Tokenv
Token ID85
Feature activation+5.63e+00

Pos logit contributions
 Lords+6.646
selves+6.544
 Islands+6.242
 cabinets+6.238
 Unic+6.207
Neg logit contributions
ellen-7.386
aeda-6.921
 therap-6.715
ochet-6.683
uden-6.486
Tokenatron
Token ID23484
Feature activation+1.14e+00

Pos logit contributions
selves+4.127
 Lords+4.100
 Islands+3.949
 coy+3.919
 Shiva+3.907
Neg logit contributions
ellen-3.895
aeda-3.654
ochet-3.465
 therap-3.455
uden-3.406
Token.
Token ID13
Feature activation+6.38e-01

Pos logit contributions
selves+0.648
 Lords+0.640
 cabinets+0.607
 coy+0.606
 Desk+0.603
Neg logit contributions
ellen-1.018
aeda-0.959
ochet-0.925
uden-0.912
 therap-0.901
Token
Token ID198
Feature activation+2.03e+00

Pos logit contributions
selves+0.340
 Lords+0.332
 cabinets+0.320
 coy+0.318
 Desk+0.315
Neg logit contributions
ellen-0.593
aeda-0.558
ochet-0.541
uden-0.532
 therap-0.523
Token
Token ID198
Feature activation+2.13e+00

Pos logit contributions
selves+1.069
 Lords+1.051
 coy+1.003
 cabinets+1.003
 Desk+0.996
Neg logit contributions
ellen-1.903
aeda-1.783
ochet-1.742
uden-1.704
 therap-1.633
Tokense
Token ID325
Feature activation-1.02e+00

Pos logit contributions
selves+2.751
 Lords+2.736
 coy+2.629
 Islands+2.627
 cabinets+2.625
Neg logit contributions
ellen-2.116
aeda-1.938
ochet-1.840
uden-1.795
 therap-1.783
Tokenus
Token ID385
Feature activation-1.11e+00

Pos logit contributions
ellen+0.772
aeda+0.719
ochet+0.688
uden+0.673
 therap+0.668
Neg logit contributions
selves-0.721
 Lords-0.716
 cabinets-0.685
 coy-0.683
 Islands-0.683
Token tied
Token ID8165
Feature activation-1.95e+00

Pos logit contributions
ellen+1.027
aeda+0.970
ochet+0.937
uden+0.928
 therap+0.916
Neg logit contributions
selves-0.599
 Lords-0.586
 cabinets-0.557
 Desk-0.553
 coy-0.551
Token to
Token ID284
Feature activation-1.24e+00

Pos logit contributions
ellen+1.695
aeda+1.625
ochet+1.569
 therap+1.542
uden+1.538
Neg logit contributions
selves-1.117
 Lords-1.086
 Desk-1.061
 cabinets-1.048
 Islands-1.040
Token the
Token ID262
Feature activation-9.39e-01

Pos logit contributions
ellen+1.138
aeda+1.084
ochet+1.049
 therap+1.031
uden+1.029
Neg logit contributions
selves-0.672
 Lords-0.651
 cabinets-0.620
 Desk-0.619
 Islands-0.613
Token mast
Token ID17288
Feature activation+9.98e+00

Pos logit contributions
ellen+0.831
aeda+0.787
ochet+0.760
uden+0.747
 therap+0.746
Neg logit contributions
selves-0.534
 Lords-0.519
 cabinets-0.497
 Desk-0.495
 coy-0.492
Token,
Token ID11
Feature activation-1.19e+00

Pos logit contributions
 Lords+6.135
selves+5.973
 Desk+5.919
 cabinets+5.902
 coy+5.870
Neg logit contributions
ellen-7.778
aeda-7.202
 therap-6.958
uden-6.863
ochet-6.835
Token sailing
Token ID28499
Feature activation+5.87e-01

Pos logit contributions
ellen+1.096
aeda+1.039
ochet+1.005
uden+0.990
 therap+0.984
Neg logit contributions
selves-0.644
 Lords-0.625
 cabinets-0.598
 Desk-0.594
 Islands-0.591
Token away
Token ID1497
Feature activation-8.51e-01

Pos logit contributions
selves+0.347
 Lords+0.340
 Desk+0.325
 cabinets+0.324
 coy+0.322
Neg logit contributions
ellen-0.510
aeda-0.481
ochet-0.464
uden-0.456
 therap-0.453
Token from
Token ID422
Feature activation+2.51e+00

Pos logit contributions
ellen+0.781
aeda+0.740
ochet+0.719
uden+0.706
 therap+0.698
Neg logit contributions
selves-0.456
 Lords-0.448
 Desk-0.427
 cabinets-0.427
 coy-0.422
Token the
Token ID262
Feature activation+5.26e-01

Pos logit contributions
 Lords+1.322
selves+1.322
 coy+1.248
 cabinets+1.242
 Islands+1.241
Neg logit contributions
ellen-2.358
aeda-2.213
ochet-2.142
uden-2.122
 therap-2.080
Token with
Token ID351
Feature activation-8.69e-01

Pos logit contributions
selves+1.391
 Lords+1.370
 cabinets+1.300
 coy+1.297
 Desk+1.292
Neg logit contributions
ellen-2.257
aeda-2.127
ochet-2.054
uden-2.025
 therap-2.000
Token you
Token ID345
Feature activation-9.45e-01

Pos logit contributions
ellen+0.775
aeda+0.728
ochet+0.706
uden+0.691
 therap+0.683
Neg logit contributions
selves-0.497
 Lords-0.481
 cabinets-0.460
 Desk-0.459
 Islands-0.458
Token,
Token ID11
Feature activation+9.88e-01

Pos logit contributions
ellen+0.888
aeda+0.831
ochet+0.811
uden+0.801
 therap+0.783
Neg logit contributions
selves-0.488
 Lords-0.479
 Desk-0.456
 cabinets-0.455
 Islands-0.452
Token bear
Token ID6842
Feature activation+7.41e-01

Pos logit contributions
selves+0.526
 Lords+0.515
 cabinets+0.488
 Desk+0.486
 coy+0.485
Neg logit contributions
ellen-0.921
aeda-0.869
ochet-0.841
uden-0.830
 therap-0.820
Token the
Token ID262
Feature activation-2.56e-01

Pos logit contributions
selves+0.375
 Lords+0.369
 cabinets+0.348
 Desk+0.346
 coy+0.346
Neg logit contributions
ellen-0.709
aeda-0.671
ochet-0.650
uden-0.641
 therap-0.633
Token decl
Token ID2377
Feature activation+9.58e+00

Pos logit contributions
ellen+0.225
aeda+0.211
ochet+0.205
uden+0.200
 therap+0.199
Neg logit contributions
selves-0.148
 Lords-0.143
 Desk-0.138
 cabinets-0.137
 Islands-0.137
Tokenination
Token ID1883
Feature activation+2.16e+00

Pos logit contributions
selves+5.720
 Lords+5.431
 coy+5.236
ials+5.231
 Islands+5.231
Neg logit contributions
ellen-7.570
 therap-7.288
aeda-7.226
uden-6.701
Detailed-6.661
Token with
Token ID351
Feature activation-5.32e-01

Pos logit contributions
selves+1.161
 Lords+1.149
 coy+1.088
 cabinets+1.085
 Desk+1.078
Neg logit contributions
ellen-2.000
aeda-1.890
ochet-1.823
uden-1.800
 therap-1.779
Token becoming
Token ID5033
Feature activation-3.04e-02

Pos logit contributions
ellen+0.489
aeda+0.460
ochet+0.446
uden+0.439
 therap+0.435
Neg logit contributions
selves-0.289
 Lords-0.282
 cabinets-0.268
 Desk-0.267
 Islands-0.267
Token grace
Token ID11542
Feature activation-6.41e-01

Pos logit contributions
ellen+0.027
aeda+0.026
ochet+0.025
uden+0.024
 therap+0.024
Neg logit contributions
selves-0.017
 Lords-0.017
 cabinets-0.016
 Desk-0.016
 Islands-0.016
Token;
Token ID26
Feature activation+2.62e+00

Pos logit contributions
ellen+0.567
aeda+0.530
ochet+0.515
uden+0.507
 therap+0.498
Neg logit contributions
selves-0.372
 Lords-0.365
 cabinets-0.347
 Desk-0.347
 Islands-0.345
Token for
Token ID329
Feature activation-1.14e+00

Pos logit contributions
selves+0.993
 Lords+0.979
 coy+0.925
 cabinets+0.924
 Desk+0.918
Neg logit contributions
ellen-1.827
aeda-1.727
ochet-1.669
uden-1.647
 therap-1.631
Token def
Token ID825
Feature activation+4.11e+00

Pos logit contributions
ellen+1.012
aeda+0.957
ochet+0.920
uden+0.911
 therap+0.894
Neg logit contributions
selves-0.654
 Lords-0.641
 Desk-0.608
 cabinets-0.604
 coy-0.602
Tokenraud
Token ID22863
Feature activation+2.38e+00

Pos logit contributions
 Lords+2.218
selves+2.213
 coy+2.097
 cabinets+2.095
 Islands+2.092
Neg logit contributions
ellen-3.739
aeda-3.535
 therap-3.410
ochet-3.369
uden-3.331
Tokening
Token ID278
Feature activation+5.16e-01

Pos logit contributions
selves+1.463
 Lords+1.444
 coy+1.376
 cabinets+1.376
 Desk+1.369
Neg logit contributions
ellen-2.023
aeda-1.901
ochet-1.826
uden-1.799
 therap-1.782
Token its
Token ID663
Feature activation+1.17e+00

Pos logit contributions
selves+0.286
 Lords+0.282
 cabinets+0.267
 coy+0.267
 Desk+0.266
Neg logit contributions
ellen-0.469
aeda-0.441
ochet-0.427
uden-0.420
 therap-0.415
Token Lif
Token ID21073
Feature activation+9.57e+00

Pos logit contributions
selves+0.587
 Lords+0.577
 cabinets+0.544
 coy+0.543
 Desk+0.541
Neg logit contributions
ellen-1.123
aeda-1.062
ochet-1.028
uden-1.014
 therap-1.003
Tokeneline
Token ID4470
Feature activation+1.31e+00

Pos logit contributions
selves+3.530
 Lords+3.476
 Desk+3.435
ials+3.269
 Unic+3.244
Neg logit contributions
ellen-9.750
aeda-9.481
 therap-9.040
ochet-8.982
uden-8.811
Token program
Token ID1430
Feature activation-9.24e-01

Pos logit contributions
selves+0.654
 Lords+0.641
 cabinets+0.607
 Desk+0.604
 Islands+0.604
Neg logit contributions
ellen-1.262
aeda-1.197
ochet-1.156
uden-1.142
 therap-1.126
Token,
Token ID11
Feature activation+1.16e+00

Pos logit contributions
ellen+0.833
aeda+0.785
ochet+0.757
uden+0.746
 therap+0.736
Neg logit contributions
selves-0.518
 Lords-0.511
 cabinets-0.483
 coy-0.481
 Desk-0.481
Token which
Token ID543
Feature activation-8.72e-01

Pos logit contributions
selves+0.607
 Lords+0.597
 cabinets+0.564
 coy+0.563
 Desk+0.561
Neg logit contributions
ellen-1.090
aeda-1.030
ochet-0.996
uden-0.982
 therap-0.971
Token subsid
Token ID7905
Feature activation+1.67e+00

Pos logit contributions
ellen+0.778
aeda+0.725
ochet+0.702
uden+0.692
 therap+0.687
Neg logit contributions
selves-0.499
 Lords-0.489
 cabinets-0.459
 Desk-0.457
 coy-0.456
Token Vault
Token ID23450
Feature activation+5.39e-01

Pos logit contributions
ellen+1.789
aeda+1.696
ochet+1.641
uden+1.610
 therap+1.597
Neg logit contributions
selves-0.996
 Lords-0.956
 coy-0.920
 Desk-0.912
 cabinets-0.907
Token.
Token ID13
Feature activation+6.89e-01

Pos logit contributions
selves+0.312
 Lords+0.309
 cabinets+0.294
 coy+0.293
 Desk+0.291
Neg logit contributions
ellen-0.477
aeda-0.448
ochet-0.432
uden-0.426
 therap-0.421
Token There
Token ID1318
Feature activation+2.98e-01

Pos logit contributions
selves+0.370
 Lords+0.357
 cabinets+0.348
 coy+0.347
 Desk+0.339
Neg logit contributions
ellen-0.634
aeda-0.599
ochet-0.581
uden-0.569
 therap-0.554
Token are
Token ID389
Feature activation+2.69e+00

Pos logit contributions
selves+0.169
 Lords+0.167
 cabinets+0.159
 Desk+0.158
 coy+0.158
Neg logit contributions
ellen-0.267
aeda-0.251
ochet-0.243
uden-0.239
 therap-0.236
Token some
Token ID617
Feature activation+2.88e+00

Pos logit contributions
selves+1.397
 Lords+1.382
 cabinets+1.305
 coy+1.303
 Islands+1.294
Neg logit contributions
ellen-2.542
aeda-2.402
ochet-2.322
uden-2.291
 therap-2.263
Token fram
Token ID5346
Feature activation+9.43e+00

Pos logit contributions
selves+1.753
 Lords+1.749
 coy+1.666
 cabinets+1.663
 Islands+1.655
Neg logit contributions
ellen-2.462
aeda-2.311
ochet-2.227
uden-2.189
 therap-2.156
Tokenerate
Token ID21620
Feature activation-6.58e-01

Pos logit contributions
selves+5.556
 Lords+5.464
ials+5.277
 Islands+5.267
 coy+5.257
Neg logit contributions
ellen-7.492
aeda-7.146
 therap-7.063
uden-6.887
ochet-6.691
Token flaws
Token ID17978
Feature activation+2.77e+00

Pos logit contributions
ellen+0.590
aeda+0.553
ochet+0.537
uden+0.529
 therap+0.527
Neg logit contributions
selves-0.372
 Lords-0.358
 Desk-0.344
 cabinets-0.340
 Islands-0.339
Token present
Token ID1944
Feature activation+3.94e+00

Pos logit contributions
selves+1.363
 Lords+1.353
 coy+1.272
 cabinets+1.263
 Islands+1.259
Neg logit contributions
ellen-2.688
aeda-2.551
ochet-2.448
uden-2.430
 therap-2.410
Token that
Token ID326
Feature activation+2.12e+00

Pos logit contributions
selves+1.850
 Lords+1.822
 coy+1.723
 cabinets+1.705
 Islands+1.690
Neg logit contributions
ellen-3.904
aeda-3.696
ochet-3.543
uden-3.542
 therap-3.516
Token don
Token ID836
Feature activation+1.17e+00

Pos logit contributions
selves+1.225
 Lords+1.213
 coy+1.150
 cabinets+1.150
 Desk+1.144
Neg logit contributions
ellen-1.875
aeda-1.768
ochet-1.701
uden-1.678
 therap-1.658
Token all
Token ID477
Feature activation-1.45e+00

Pos logit contributions
ellen+0.848
aeda+0.796
ochet+0.772
uden+0.763
 therap+0.752
Neg logit contributions
selves-0.504
 Lords-0.488
 cabinets-0.465
 Desk-0.462
 coy-0.460
Token.
Token ID13
Feature activation-2.67e-01

Pos logit contributions
ellen+1.277
aeda+1.197
ochet+1.159
uden+1.139
 therap+1.119
Neg logit contributions
selves-0.844
 Lords-0.825
 cabinets-0.789
 coy-0.782
 Desk-0.782
Token
Token ID198
Feature activation-1.27e-01

Pos logit contributions
ellen+0.246
aeda+0.231
ochet+0.224
uden+0.220
 therap+0.213
Neg logit contributions
selves-0.143
 Lords-0.140
 cabinets-0.136
 coy-0.135
 Islands-0.132
Token
Token ID198
Feature activation+5.74e-01

Pos logit contributions
ellen+0.113
aeda+0.105
ochet+0.104
uden+0.100
Detailed+0.097
Neg logit contributions
selves-0.071
 Lords-0.070
 coy-0.068
 cabinets-0.068
 Desk-0.067
TokenThe
Token ID464
Feature activation+4.72e-01

Pos logit contributions
selves+0.416
 Lords+0.410
 cabinets+0.393
 coy+0.393
 Desk+0.392
Neg logit contributions
ellen-0.424
aeda-0.397
ochet-0.378
uden-0.372
 therap-0.370
Token presumptive
Token ID39715
Feature activation+9.21e+00

Pos logit contributions
selves+0.260
 Lords+0.256
 cabinets+0.243
 coy+0.241
 Desk+0.241
Neg logit contributions
ellen-0.429
aeda-0.404
ochet-0.390
uden-0.385
 therap-0.381
Token GOP
Token ID6796
Feature activation+3.85e+00

Pos logit contributions
 Lords+4.433
 cabinets+4.385
selves+4.366
 coy+4.340
 Desk+4.277
Neg logit contributions
ellen-8.688
aeda-8.369
uden-8.020
ochet-7.999
 therap-7.756
Token presidential
Token ID4787
Feature activation+4.67e+00

Pos logit contributions
selves+2.172
 Lords+2.143
 coy+2.063
 cabinets+2.060
 Desk+2.056
Neg logit contributions
ellen-3.451
aeda-3.265
ochet-3.131
uden-3.110
 therap-3.063
Token nominee
Token ID10429
Feature activation+4.72e+00

Pos logit contributions
selves+2.295
 Lords+2.295
 coy+2.225
 cabinets+2.215
 Desk+2.175
Neg logit contributions
ellen-4.480
aeda-4.233
ochet-4.091
uden-4.072
 therap-4.000
Token said
Token ID531
Feature activation+1.58e+00

Pos logit contributions
selves+2.422
 Lords+2.399
 coy+2.327
 cabinets+2.281
 Islands+2.253
Neg logit contributions
ellen-4.439
aeda-4.252
ochet-4.101
uden-4.024
 therap-3.980
Token Friday
Token ID3217
Feature activation+1.76e+00

Pos logit contributions
selves+0.778
 Lords+0.764
 cabinets+0.720
 coy+0.716
 Desk+0.715
Neg logit contributions
ellen-1.542
aeda-1.459
ochet-1.413
uden-1.395
 therap-1.378
Token Hollywood
Token ID8502
Feature activation+2.20e+00

Pos logit contributions
selves+0.396
 Lords+0.387
 cabinets+0.366
 coy+0.365
 Desk+0.365
Neg logit contributions
ellen-0.672
aeda-0.635
ochet-0.613
uden-0.605
 therap-0.598
Token to
Token ID284
Feature activation-7.01e-01

Pos logit contributions
selves+1.098
 Lords+1.074
 cabinets+1.011
 coy+1.008
 Desk+1.005
Neg logit contributions
ellen-2.128
aeda-2.010
ochet-1.949
uden-1.923
 therap-1.901
Token see
Token ID766
Feature activation-4.43e-02

Pos logit contributions
ellen+0.588
aeda+0.550
ochet+0.537
uden+0.529
 therap+0.518
Neg logit contributions
selves-0.437
 Lords-0.424
 cabinets-0.405
 Desk-0.403
 Islands-0.400
Token the
Token ID262
Feature activation-5.92e-01

Pos logit contributions
ellen+0.042
aeda+0.039
ochet+0.038
uden+0.038
 therap+0.037
Neg logit contributions
selves-0.023
 Lords-0.022
 cabinets-0.021
 coy-0.021
 Desk-0.021
Token From
Token ID3574
Feature activation-1.51e+00

Pos logit contributions
ellen+0.489
aeda+0.460
ochet+0.444
uden+0.438
 therap+0.432
Neg logit contributions
selves-0.373
 Lords-0.361
 cabinets-0.347
 coy-0.347
 Desk-0.346
Token Dusk
Token ID42553
Feature activation+9.18e+00

Pos logit contributions
ellen+1.276
aeda+1.204
ochet+1.179
uden+1.165
 therap+1.128
Neg logit contributions
selves-0.914
 Lords-0.853
 cabinets-0.842
 Desk-0.827
 coy-0.825
Token Till
Token ID17888
Feature activation-1.28e+00

Pos logit contributions
 Lords+5.333
 coy+5.198
selves+5.018
 Standing+4.991
 Islands+4.969
Neg logit contributions
ellen-7.766
aeda-7.627
ochet-7.148
 therap-7.147
uden-7.016
Token Dawn
Token ID12258
Feature activation+4.21e+00

Pos logit contributions
ellen+1.185
aeda+1.116
uden+1.104
ochet+1.091
 therap+1.045
Neg logit contributions
selves-0.644
 cabinets-0.628
 Lords-0.612
 coy-0.605
 suits-0.573
Token maze
Token ID31237
Feature activation-1.83e+00

Pos logit contributions
 Lords+2.525
selves+2.493
 cabinets+2.391
 coy+2.386
 Islands+2.383
Neg logit contributions
ellen-3.669
aeda-3.432
ochet-3.308
uden-3.244
 therap-3.194
Token coming
Token ID2406
Feature activation-6.17e-01

Pos logit contributions
ellen+1.674
aeda+1.572
uden+1.526
ochet+1.526
 therap+1.482
Neg logit contributions
selves-0.992
 Lords-0.940
 Desk-0.904
 Islands-0.899
 coy-0.894
Token this
Token ID428
Feature activation+4.24e-01

Pos logit contributions
ellen+0.580
aeda+0.549
ochet+0.531
uden+0.525
 therap+0.516
Neg logit contributions
selves-0.322
 Lords-0.315
 Desk-0.299
 cabinets-0.298
 coy-0.296
Token.
Token ID13
Feature activation+5.10e-02

Pos logit contributions
selves+0.377
 Lords+0.372
 cabinets+0.353
 coy+0.352
 Desk+0.351
Neg logit contributions
ellen-0.573
aeda-0.539
ochet-0.520
uden-0.512
 therap-0.506
Token<|endoftext|>
Token ID50256
Feature activation-3.65e+00

Pos logit contributions
selves+0.027
 Lords+0.026
 cabinets+0.025
 coy+0.025
 Islands+0.025
Neg logit contributions
ellen-0.047
aeda-0.044
ochet-0.043
uden-0.042
 therap-0.041
TokenIll
Token ID21478
Feature activation-8.93e-01

Pos logit contributions
ellen+2.621
aeda+2.384
ochet+2.300
uden+2.232
 therap+2.154
Neg logit contributions
selves-2.729
 Lords-2.710
 cabinets-2.619
 coy-2.611
 Desk-2.597
Tokenust
Token ID436
Feature activation+1.70e+00

Pos logit contributions
ellen+0.732
aeda+0.685
ochet+0.660
uden+0.649
 therap+0.638
Neg logit contributions
selves-0.579
 Lords-0.568
 coy-0.544
 cabinets-0.544
 Desk-0.540
Tokenrious
Token ID32527
Feature activation+3.48e+00

Pos logit contributions
selves+0.982
 Lords+0.976
 Desk+0.924
 cabinets+0.924
 coy+0.923
Neg logit contributions
ellen-1.492
aeda-1.407
ochet-1.360
uden-1.337
 therap-1.329
Token Illum
Token ID39256
Feature activation+9.17e+00

Pos logit contributions
selves+2.039
 Lords+2.038
 coy+1.933
 Islands+1.925
 Desk+1.922
Neg logit contributions
ellen-3.026
aeda-2.871
ochet-2.750
uden-2.728
 therap-2.670
Tokenination
Token ID1883
Feature activation+3.98e+00

Pos logit contributions
 Lords+3.890
selves+3.883
 Desk+3.646
 cabinets+3.620
ials+3.550
Neg logit contributions
ellen-9.044
aeda-8.475
 therap-8.368
ochet-8.176
uden-8.093
Token:
Token ID25
Feature activation+5.67e+00

Pos logit contributions
 Lords+2.110
selves+2.103
 Desk+1.987
 Islands+1.977
 cabinets+1.974
Neg logit contributions
ellen-3.695
aeda-3.503
ochet-3.379
uden-3.332
 therap-3.291
Token Incredible
Token ID33832
Feature activation-1.08e+00

Pos logit contributions
 Lords+3.034
selves+2.991
 Shiva+2.834
 Desk+2.831
 Islands+2.831
Neg logit contributions
ellen-5.216
aeda-4.947
ochet-4.765
uden-4.707
 therap-4.684
Token TR
Token ID7579
Feature activation-2.62e+00

Pos logit contributions
ellen+1.003
aeda+0.944
ochet+0.916
uden+0.897
 therap+0.892
Neg logit contributions
selves-0.564
 Lords-0.550
 coy-0.525
 cabinets-0.525
 Desk-0.517
TokenON
Token ID1340
Feature activation-1.92e+00

Pos logit contributions
ellen+1.971
aeda+1.830
ochet+1.754
uden+1.728
 therap+1.697
Neg logit contributions
selves-1.862
 Lords-1.820
 cabinets-1.758
 coy-1.758
 Desk-1.739
Token were
Token ID547
Feature activation-2.95e-01

Pos logit contributions
ellen+0.527
aeda+0.498
ochet+0.481
uden+0.471
 therap+0.466
Neg logit contributions
selves-0.348
 Lords-0.347
 Desk-0.328
 cabinets-0.328
 Islands-0.326
Token negotiating
Token ID19025
Feature activation-1.21e+00

Pos logit contributions
ellen+0.263
aeda+0.249
ochet+0.240
uden+0.236
 therap+0.235
Neg logit contributions
selves-0.166
 Lords-0.164
 Desk-0.156
 cabinets-0.156
 Islands-0.155
Token over
Token ID625
Feature activation+1.85e+00

Pos logit contributions
ellen+1.049
aeda+0.993
ochet+0.955
 therap+0.938
uden+0.931
Neg logit contributions
selves-0.708
 Lords-0.694
 Desk-0.662
 Islands-0.660
 cabinets-0.654
Token fees
Token ID6642
Feature activation+1.94e+00

Pos logit contributions
selves+0.970
 Lords+0.959
 cabinets+0.908
 coy+0.908
 Desk+0.898
Neg logit contributions
ellen-1.743
aeda-1.645
ochet-1.590
uden-1.572
 therap-1.550
Token for
Token ID329
Feature activation+8.42e-01

Pos logit contributions
selves+1.015
 Lords+1.000
 cabinets+0.945
 coy+0.944
 Desk+0.937
Neg logit contributions
ellen-1.820
aeda-1.721
ochet-1.662
uden-1.641
 therap-1.623
Token collection
Token ID4947
Feature activation+9.16e+00

Pos logit contributions
selves+0.479
 Lords+0.472
 cabinets+0.447
 coy+0.447
 Desk+0.446
Neg logit contributions
ellen-0.754
aeda-0.710
ochet-0.686
uden-0.675
 therap-0.668
Token,
Token ID11
Feature activation+1.40e+00

Pos logit contributions
 cabinets+4.145
selves+4.128
 Lords+4.031
 suits+3.984
 coy+3.887
Neg logit contributions
ellen-9.026
aeda-8.522
ochet-8.110
uden-8.060
 therap-7.902
Token preservation
Token ID22459
Feature activation+4.63e+00

Pos logit contributions
selves+0.752
 Lords+0.740
 cabinets+0.700
 coy+0.699
 Desk+0.696
Neg logit contributions
ellen-1.292
aeda-1.220
ochet-1.178
uden-1.161
 therap-1.148
Token,
Token ID11
Feature activation+1.27e+00

Pos logit contributions
selves+2.073
 Lords+2.057
 cabinets+1.993
 coy+1.946
 Islands+1.919
Neg logit contributions
ellen-4.675
aeda-4.423
ochet-4.280
uden-4.262
 therap-4.153
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
selves+0.680
 Lords+0.670
 cabinets+0.635
 coy+0.633
 Desk+0.630
Neg logit contributions
ellen-1.183
aeda-1.117
ochet-1.080
uden-1.064
 therap-1.052
Token transport
Token ID4839
Feature activation+3.35e+00

Pos logit contributions
selves+0.977
 Lords+0.965
 cabinets+0.921
 coy+0.919
 Desk+0.914
Neg logit contributions
ellen-1.397
aeda-1.312
ochet-1.265
uden-1.245
 therap-1.228
Token.
Token ID13
Feature activation-5.08e-01

Pos logit contributions
selves+0.765
 Lords+0.755
 coy+0.714
 cabinets+0.711
 Islands+0.709
Neg logit contributions
ellen-1.359
aeda-1.288
ochet-1.240
uden-1.223
 therap-1.215
Token<|endoftext|>
Token ID50256
Feature activation-3.86e+00

Pos logit contributions
ellen+0.479
aeda+0.453
ochet+0.438
uden+0.432
 therap+0.427
Neg logit contributions
selves-0.266
 Lords-0.261
 cabinets-0.247
 coy-0.246
 Desk-0.246
TokenWASHINGTON
Token ID21793
Feature activation+2.24e-02

Pos logit contributions
ellen+2.775
aeda+2.522
ochet+2.435
uden+2.363
 therap+2.276
Neg logit contributions
selves-2.884
 Lords-2.865
 cabinets-2.770
 coy-2.763
 Desk-2.747
Token
Token ID851
Feature activation+3.06e+00

Pos logit contributions
selves+0.012
 Lords+0.012
 cabinets+0.012
 coy+0.011
 Islands+0.011
Neg logit contributions
ellen-0.020
aeda-0.019
ochet-0.019
uden-0.018
 therap-0.018
Token A
Token ID317
Feature activation+1.19e-01

Pos logit contributions
selves+1.583
 Lords+1.573
 Desk+1.482
 Islands+1.471
 coy+1.464
Neg logit contributions
ellen-2.891
aeda-2.736
ochet-2.644
uden-2.610
 therap-2.607
Token Cy
Token ID5934
Feature activation+9.00e+00

Pos logit contributions
selves+0.073
 Lords+0.072
 cabinets+0.069
 coy+0.068
 Desk+0.068
Neg logit contributions
ellen-0.101
aeda-0.094
ochet-0.091
uden-0.090
 therap-0.088
Tokengn
Token ID4593
Feature activation+1.96e-01

Pos logit contributions
 Lords+4.540
selves+4.429
 Standing+4.312
 Islands+4.290
 Desk+4.280
Neg logit contributions
ellen-8.123
aeda-7.613
ochet-7.519
uden-7.224
 therap-7.202
Tokenus
Token ID385
Feature activation-5.86e-01

Pos logit contributions
selves+0.141
 Lords+0.140
 cabinets+0.134
 coy+0.134
 Desk+0.134
Neg logit contributions
ellen-0.146
aeda-0.136
ochet-0.130
uden-0.128
 therap-0.126
Token cargo
Token ID15892
Feature activation+1.79e+00

Pos logit contributions
ellen+0.510
aeda+0.478
ochet+0.461
uden+0.454
 therap+0.451
Neg logit contributions
selves-0.350
 Lords-0.343
 cabinets-0.326
 coy-0.323
 Desk-0.323
Token mission
Token ID4365
Feature activation+1.26e+00

Pos logit contributions
selves+0.960
 Lords+0.953
 coy+0.908
 cabinets+0.902
 Islands+0.895
Neg logit contributions
ellen-1.662
aeda-1.568
ochet-1.514
uden-1.491
 therap-1.471
Token to
Token ID284
Feature activation+1.40e+00

Pos logit contributions
selves+0.660
 Lords+0.649
 cabinets+0.613
 coy+0.611
 Desk+0.610
Neg logit contributions
ellen-1.184
aeda-1.118
ochet-1.082
uden-1.066
 therap-1.054
Token:
Token ID25
Feature activation+2.12e+00

Pos logit contributions
ellen+2.672
aeda+2.514
ochet+2.420
uden+2.377
 therap+2.362
Neg logit contributions
selves-1.508
 Lords-1.488
 Desk-1.423
 cabinets-1.411
 coy-1.397
Token blended
Token ID32063
Feature activation-2.16e+00

Pos logit contributions
selves+1.221
 Lords+1.205
 cabinets+1.142
 Desk+1.140
 coy+1.138
Neg logit contributions
ellen-1.878
aeda-1.766
ochet-1.710
uden-1.683
 therap-1.655
Token roles
Token ID9176
Feature activation+3.72e+00

Pos logit contributions
ellen+1.841
aeda+1.744
ochet+1.671
 therap+1.660
uden+1.638
Neg logit contributions
selves-1.306
 Lords-1.248
 Islands-1.201
 coy-1.197
 Desk-1.191
Token that
Token ID326
Feature activation+2.23e+00

Pos logit contributions
selves+1.966
 Lords+1.959
 coy+1.857
 cabinets+1.853
 Desk+1.825
Neg logit contributions
ellen-3.424
aeda-3.247
ochet-3.141
uden-3.101
 therap-3.052
Token require
Token ID2421
Feature activation-1.84e+00

Pos logit contributions
selves+1.382
 Lords+1.367
 cabinets+1.303
 coy+1.302
 Desk+1.294
Neg logit contributions
ellen-1.880
aeda-1.767
ochet-1.701
uden-1.675
 therap-1.653
Token compet
Token ID2307
Feature activation+8.86e+00

Pos logit contributions
ellen+1.593
aeda+1.500
ochet+1.433
uden+1.424
 therap+1.418
Neg logit contributions
selves-1.078
 Lords-1.046
 Islands-1.006
 Desk-0.999
 cabinets-0.997
Tokenency
Token ID1387
Feature activation+2.75e+00

Pos logit contributions
 Lords+3.047
selves+3.003
 cabinets+2.870
 Desk+2.800
 Islands+2.794
Neg logit contributions
ellen-9.219
aeda-8.819
 therap-8.740
ochet-8.660
uden-8.605
Token in
Token ID287
Feature activation+1.61e+00

Pos logit contributions
selves+1.340
 Lords+1.327
 cabinets+1.252
 coy+1.242
 Islands+1.236
Neg logit contributions
ellen-2.677
aeda-2.537
ochet-2.464
uden-2.427
 therap-2.393
Token both
Token ID1111
Feature activation+1.49e-01

Pos logit contributions
selves+0.815
 Lords+0.801
 cabinets+0.755
 coy+0.753
 Desk+0.750
Neg logit contributions
ellen-1.541
aeda-1.458
ochet-1.410
uden-1.391
 therap-1.376
Token.
Token ID13
Feature activation+1.67e-01

Pos logit contributions
selves+0.079
 Lords+0.077
 cabinets+0.073
 coy+0.073
 Islands+0.073
Neg logit contributions
ellen-0.139
aeda-0.131
ochet-0.127
uden-0.125
 therap-0.124
Token
Token ID447
Feature activation-1.16e+00

Pos logit contributions
selves+0.114
 Lords+0.113
 cabinets+0.108
 coy+0.108
 Desk+0.107
Neg logit contributions
ellen-0.130
aeda-0.122
ochet-0.117
uden-0.115
 therap-0.113
Token somewhere
Token ID7382
Feature activation+3.76e+00

Pos logit contributions
ellen+2.005
aeda+1.880
ochet+1.845
uden+1.803
 therap+1.786
Neg logit contributions
selves-1.272
 Lords-1.256
 Desk-1.193
 cabinets-1.191
 Islands-1.186
Token:
Token ID25
Feature activation+2.29e+00

Pos logit contributions
selves+1.891
 Lords+1.851
 coy+1.779
 cabinets+1.757
 Islands+1.742
Neg logit contributions
ellen-3.603
aeda-3.415
ochet-3.269
uden-3.245
 therap-3.197
Token
Token ID198
Feature activation+1.55e+00

Pos logit contributions
selves+1.057
 Lords+1.046
 coy+0.989
 cabinets+0.978
 Desk+0.976
Neg logit contributions
ellen-2.294
aeda-2.185
ochet-2.098
uden-2.070
 therap-2.041
Token
Token ID198
Feature activation+1.87e+00

Pos logit contributions
selves+0.830
 Lords+0.819
 coy+0.782
 cabinets+0.780
 Desk+0.779
Neg logit contributions
ellen-1.437
aeda-1.348
ochet-1.316
uden-1.288
Detailed-1.227
TokenREF
Token ID31688
Feature activation-9.27e-01

Pos logit contributions
selves+1.258
 Lords+1.225
 coy+1.176
 cabinets+1.174
 Desk+1.169
Neg logit contributions
ellen-1.474
aeda-1.392
ochet-1.320
uden-1.302
 therap-1.300
TokenLECT
Token ID16779
Feature activation+8.83e+00

Pos logit contributions
ellen+0.796
aeda+0.748
ochet+0.722
uden+0.711
 therap+0.702
Neg logit contributions
selves-0.561
 Lords-0.552
 cabinets-0.526
 coy-0.524
 Desk-0.522
Token_
Token ID62
Feature activation+5.51e+00

Pos logit contributions
 Lords+6.626
selves+6.545
 coy+6.458
 Tropical+6.424
 Shiva+6.407
Neg logit contributions
ellen-5.706
aeda-5.516
uden-5.010
ochet-4.932
 therap-4.841
TokenCLASS
Token ID31631
Feature activation-1.82e+00

Pos logit contributions
selves+3.886
 Lords+3.802
 coy+3.649
 cabinets+3.648
 Desk+3.639
Neg logit contributions
ellen-4.050
aeda-3.887
ochet-3.645
 therap-3.600
uden-3.587
Token_
Token ID62
Feature activation+4.34e+00

Pos logit contributions
ellen+1.491
aeda+1.389
ochet+1.351
uden+1.317
 therap+1.309
Neg logit contributions
selves-1.168
 Lords-1.146
 cabinets-1.096
 Desk-1.091
 coy-1.090
TokenST
Token ID2257
Feature activation-1.17e+00

Pos logit contributions
selves+2.971
 Lords+2.938
 coy+2.799
 cabinets+2.797
 Islands+2.792
Neg logit contributions
ellen-3.333
aeda-3.148
ochet-2.969
uden-2.938
 therap-2.937
TokenART
Token ID7227
Feature activation-2.41e-01

Pos logit contributions
ellen+0.814
aeda+0.744
ochet+0.713
uden+0.698
 therap+0.680
Neg logit contributions
selves-0.902
 Lords-0.889
 coy-0.860
 cabinets-0.860
 Desk-0.853
Token Kil
Token ID19902
Feature activation+3.88e+00

Pos logit contributions
selves+0.243
 Lords+0.240
 coy+0.220
 Desk+0.220
 Islands+0.220
Neg logit contributions
ellen-0.650
aeda-0.623
ochet-0.601
uden-0.595
 therap-0.591
Tokenme
Token ID1326
Feature activation-1.32e+00

Pos logit contributions
selves+2.574
 Lords+2.543
 cabinets+2.433
 coy+2.429
 Desk+2.425
Neg logit contributions
ellen-3.102
aeda-2.901
ochet-2.788
uden-2.740
 therap-2.708
Tokenade
Token ID671
Feature activation-2.67e-01

Pos logit contributions
ellen+0.868
aeda+0.797
ochet+0.756
uden+0.744
 therap+0.724
Neg logit contributions
selves-1.069
 Lords-1.057
 cabinets-1.023
 coy-1.018
 Desk-1.018
Token and
Token ID290
Feature activation-3.66e-01

Pos logit contributions
ellen+0.261
aeda+0.248
ochet+0.240
uden+0.236
 therap+0.234
Neg logit contributions
selves-0.129
 Lords-0.128
 coy-0.120
 cabinets-0.120
 Desk-0.120
Token Me
Token ID2185
Feature activation-9.23e-01

Pos logit contributions
ellen+0.339
aeda+0.320
ochet+0.309
uden+0.305
 therap+0.301
Neg logit contributions
selves-0.197
 Lords-0.193
 cabinets-0.183
 coy-0.182
 Desk-0.181
Tokengyn
Token ID40183
Feature activation+8.81e+00

Pos logit contributions
ellen+0.817
aeda+0.763
ochet+0.735
uden+0.729
 therap+0.713
Neg logit contributions
selves-0.531
 Lords-0.529
 cabinets-0.506
 coy-0.505
 Desk-0.500
Token Kelly
Token ID9077
Feature activation+4.98e+00

Pos logit contributions
 Lords+4.179
selves+4.141
 Islands+3.959
 Banks+3.870
 Standing+3.842
Neg logit contributions
ellen-8.363
aeda-8.234
 therap-8.018
uden-7.733
ochet-7.697
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
selves+2.766
 Lords+2.742
 coy+2.617
 cabinets+2.593
 Desk+2.579
Neg logit contributions
ellen-4.415
aeda-4.226
ochet-4.049
 therap-3.984
uden-3.934
Token
Token ID198
Feature activation-1.76e-01

Pos logit contributions
selves+0.063
 Lords+0.061
 cabinets+0.058
 coy+0.058
 Desk+0.058
Neg logit contributions
ellen-0.112
aeda-0.106
ochet-0.102
uden-0.101
 therap-0.099
Token
Token ID198
Feature activation+2.57e-01

Pos logit contributions
ellen+0.156
aeda+0.144
ochet+0.142
uden+0.138
Detailed+0.134
Neg logit contributions
selves-0.100
 Lords-0.098
 coy-0.096
 cabinets-0.095
 Desk-0.094
TokenTr
Token ID2898
Feature activation-1.05e+00

Pos logit contributions
selves+0.185
 Lords+0.183
 Desk+0.175
 cabinets+0.175
 coy+0.175
Neg logit contributions
ellen-0.191
aeda-0.178
ochet-0.170
uden-0.168
 therap-0.167
Token occurring
Token ID14963
Feature activation-2.47e+00

Pos logit contributions
ellen+1.334
aeda+1.243
ochet+1.185
uden+1.171
 therap+1.160
Neg logit contributions
selves-1.238
 Lords-1.220
 cabinets-1.176
 Desk-1.173
 Islands-1.165
Token organic
Token ID10469
Feature activation+3.16e-01

Pos logit contributions
ellen+2.140
aeda+2.021
 therap+1.931
ochet+1.923
uden+1.914
Neg logit contributions
selves-1.456
 Lords-1.404
 Desk-1.346
 cabinets-1.337
 coy-1.326
Token nitrogen
Token ID23417
Feature activation+3.65e+00

Pos logit contributions
selves+0.183
 Lords+0.180
 cabinets+0.171
 Desk+0.171
 coy+0.170
Neg logit contributions
ellen-0.279
aeda-0.263
ochet-0.253
uden-0.250
 therap-0.247
Token-
Token ID12
Feature activation+8.47e-01

Pos logit contributions
selves+2.072
 Lords+2.067
 cabinets+1.966
 coy+1.960
 Desk+1.939
Neg logit contributions
ellen-3.245
aeda-3.061
ochet-2.966
uden-2.895
 therap-2.860
Tokencontaining
Token ID38301
Feature activation+6.32e-01

Pos logit contributions
selves+0.532
 Lords+0.525
 cabinets+0.501
 coy+0.500
 Desk+0.499
Neg logit contributions
ellen-0.708
aeda-0.664
ochet-0.639
uden-0.629
 therap-0.621
Token bases
Token ID12536
Feature activation+8.72e+00

Pos logit contributions
selves+0.388
 Lords+0.381
 cabinets+0.363
 Desk+0.362
 coy+0.361
Neg logit contributions
ellen-0.537
aeda-0.504
ochet-0.484
uden-0.480
 therap-0.475
Token)
Token ID8
Feature activation+1.14e+00

Pos logit contributions
selves+4.003
 Lords+3.979
 coy+3.832
 cabinets+3.751
 Shiva+3.726
Neg logit contributions
ellen-8.291
aeda-7.875
ochet-7.587
uden-7.534
 therap-7.454
Token properties
Token ID6608
Feature activation-2.67e+00

Pos logit contributions
selves+0.605
 Lords+0.596
 cabinets+0.563
 coy+0.562
 Desk+0.560
Neg logit contributions
ellen-1.069
aeda-1.010
ochet-0.976
uden-0.962
 therap-0.951
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
ellen+2.369
aeda+2.225
ochet+2.160
uden+2.122
 therap+2.100
Neg logit contributions
selves-1.517
 Lords-1.486
 Desk-1.427
 cabinets-1.413
 Islands-1.410
Token something
Token ID1223
Feature activation-6.83e-01

Pos logit contributions
selves+0.605
 Lords+0.595
 cabinets+0.564
 coy+0.563
 Desk+0.561
Neg logit contributions
ellen-1.021
aeda-0.963
ochet-0.930
uden-0.917
 therap-0.907
Token synt
Token ID7419
Feature activation+3.57e+00

Pos logit contributions
ellen+0.629
aeda+0.590
ochet+0.572
uden+0.565
 therap+0.562
Neg logit contributions
selves-0.373
 Lords-0.361
 cabinets-0.344
 coy-0.342
 Islands-0.341
Token Batman
Token ID9827
Feature activation+5.53e-01

Pos logit contributions
 Lords+2.367
selves+2.288
 coy+2.229
 Ghostbusters+2.189
 cabinets+2.180
Neg logit contributions
ellen-4.689
aeda-4.464
ochet-4.288
uden-4.189
 therap-4.148
Token and
Token ID290
Feature activation-1.94e+00

Pos logit contributions
selves+0.316
 Lords+0.311
 cabinets+0.296
 coy+0.295
 Desk+0.294
Neg logit contributions
ellen-0.494
aeda-0.465
ochet-0.449
uden-0.443
 therap-0.437
Token Robin
Token ID12325
Feature activation+9.64e-01

Pos logit contributions
ellen+1.875
aeda+1.786
ochet+1.735
uden+1.706
 therap+1.683
Neg logit contributions
selves-0.955
 Lords-0.898
 cabinets-0.872
 Desk-0.860
 Islands-0.858
Token was
Token ID373
Feature activation-2.50e-01

Pos logit contributions
selves+0.516
 Lords+0.507
 cabinets+0.481
 coy+0.479
 Desk+0.478
Neg logit contributions
ellen-0.895
aeda-0.845
ochet-0.817
uden-0.806
 therap-0.796
Token unfor
Token ID28371
Feature activation+4.04e-01

Pos logit contributions
ellen+0.227
aeda+0.214
ochet+0.207
uden+0.205
 therap+0.203
Neg logit contributions
selves-0.139
 Lords-0.135
 Desk-0.129
 cabinets-0.129
 Islands-0.128
Tokeng
Token ID70
Feature activation+8.68e+00

Pos logit contributions
selves+0.279
 Lords+0.276
 cabinets+0.264
 Desk+0.264
 coy+0.264
Neg logit contributions
ellen-0.313
aeda-0.291
ochet-0.279
uden-0.275
 therap-0.270
Tokenivable
Token ID21911
Feature activation+3.25e+00

Pos logit contributions
 Lords+3.124
selves+3.063
 cabinets+2.926
 Islands+2.907
 coy+2.873
Neg logit contributions
ellen-9.416
aeda-8.877
 therap-8.648
ochet-8.549
uden-8.514
Token
Token ID1399
Feature activation+3.46e+00

Pos logit contributions
selves+1.641
 Lords+1.626
 cabinets+1.537
 coy+1.536
 Islands+1.517
Neg logit contributions
ellen-3.119
aeda-2.954
ochet-2.850
uden-2.807
 therap-2.772
Token well
Token ID880
Feature activation-2.40e+00

Pos logit contributions
selves+1.915
 Lords+1.905
 coy+1.806
 cabinets+1.787
 Islands+1.777
Neg logit contributions
ellen-3.133
aeda-2.978
ochet-2.858
uden-2.807
 therap-2.779
Token,
Token ID11
Feature activation-1.39e+00

Pos logit contributions
ellen+2.259
aeda+2.118
ochet+2.062
uden+2.035
 therap+2.007
Neg logit contributions
selves-1.254
 Lords-1.226
 cabinets-1.169
 Desk-1.160
 Islands-1.154
Token I
Token ID314
Feature activation-4.29e-01

Pos logit contributions
ellen+1.251
aeda+1.166
ochet+1.138
uden+1.125
 therap+1.108
Neg logit contributions
selves-0.778
 Lords-0.754
 cabinets-0.724
 Desk-0.720
 Islands-0.717
Token Sh
Token ID911
Feature activation-2.74e+00

Pos logit contributions
selves+0.688
 Lords+0.670
 coy+0.649
 cabinets+0.645
 Desk+0.627
Neg logit contributions
ellen-1.545
aeda-1.446
ochet-1.417
uden-1.391
 therap-1.330
Tokenand
Token ID392
Feature activation-2.36e+00

Pos logit contributions
ellen+1.819
aeda+1.618
ochet+1.597
uden+1.533
uffer+1.395
Neg logit contributions
selves-2.215
 Lords-2.202
 coy-2.181
 Desk-2.177
 cabinets-2.173
Tokenling
Token ID1359
Feature activation-4.62e-01

Pos logit contributions
ellen+2.311
aeda+2.151
ochet+2.105
uden+2.079
piring+1.975
Neg logit contributions
 Lords-1.131
selves-1.131
 Desk-1.102
 coy-1.096
 cabinets-1.088
Token 1997
Token ID8309
Feature activation+1.14e+00

Pos logit contributions
ellen+0.433
aeda+0.409
ochet+0.395
uden+0.389
 therap+0.386
Neg logit contributions
selves-0.244
 Lords-0.241
 cabinets-0.227
 coy-0.227
 Desk-0.226
Token John
Token ID1757
Feature activation+4.04e-01

Pos logit contributions
selves+0.678
 Lords+0.674
 cabinets+0.638
 coy+0.638
 Desk+0.636
Neg logit contributions
ellen-0.994
aeda-0.935
ochet-0.899
uden-0.888
 therap-0.880
Token Lith
Token ID19730
Feature activation+8.66e+00

Pos logit contributions
selves+0.238
 Lords+0.235
 cabinets+0.224
 coy+0.223
 Desk+0.222
Neg logit contributions
ellen-0.354
aeda-0.333
ochet-0.321
uden-0.316
 therap-0.312
Tokengow
Token ID21175
Feature activation+3.54e+00

Pos logit contributions
selves+6.032
 Lords+5.811
 cabinets+5.574
 Islands+5.560
 Streets+5.534
Neg logit contributions
ellen-6.346
 therap-6.118
aeda-5.969
ochet-5.596
uden-5.493
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
selves+1.749
 Lords+1.724
 cabinets+1.620
 coy+1.615
 Islands+1.610
Neg logit contributions
ellen-3.429
aeda-3.250
ochet-3.142
uden-3.100
 therap-3.083
Token
Token ID198
Feature activation+3.11e-01

Pos logit contributions
selves+0.097
 Lords+0.095
 coy+0.093
 cabinets+0.092
 Desk+0.092
Neg logit contributions
ellen-0.153
aeda-0.141
ochet-0.139
uden-0.135
Detailed-0.130
Token1997
Token ID21498
Feature activation+1.58e+00

Pos logit contributions
selves+0.241
 Lords+0.237
 Islands+0.228
 cabinets+0.227
 Desk+0.227
Neg logit contributions
ellen-0.214
aeda-0.198
ochet-0.191
 therap-0.187
uden-0.186
Token Michael
Token ID3899
Feature activation+1.13e+00

Pos logit contributions
selves+1.004
 Lords+1.001
 coy+0.947
 Desk+0.947
 cabinets+0.946
Neg logit contributions
ellen-1.300
aeda-1.220
ochet-1.170
uden-1.158
 therap-1.153
Token Sh
Token ID911
Feature activation-2.80e+00

Pos logit contributions
selves+0.792
 Lords+0.774
 coy+0.745
 cabinets+0.741
 Desk+0.723
Neg logit contributions
ellen-1.803
aeda-1.696
ochet-1.659
uden-1.630
 therap-1.567
Tokenand
Token ID392
Feature activation-2.60e+00

Pos logit contributions
ellen+1.859
aeda+1.649
ochet+1.633
uden+1.564
Detailed+1.433
Neg logit contributions
selves-2.248
 Lords-2.240
 coy-2.228
 Desk-2.224
 cabinets-2.218
Tokenling
Token ID1359
Feature activation-3.46e-01

Pos logit contributions
ellen+2.531
aeda+2.355
ochet+2.308
uden+2.273
piring+2.159
Neg logit contributions
 Lords-1.249
selves-1.247
 Desk-1.221
 coy-1.214
 cabinets-1.208
Token 1999
Token ID7358
Feature activation+7.47e-01

Pos logit contributions
ellen+0.344
aeda+0.326
ochet+0.316
uden+0.311
 therap+0.308
Neg logit contributions
selves-0.163
 Lords-0.160
 cabinets-0.151
 coy-0.150
 Desk-0.150
Token John
Token ID1757
Feature activation+6.99e-01

Pos logit contributions
selves+0.473
 Lords+0.470
 cabinets+0.446
 coy+0.446
 Desk+0.445
Neg logit contributions
ellen-0.621
aeda-0.583
ochet-0.559
uden-0.552
 therap-0.547
Token Lith
Token ID19730
Feature activation+8.59e+00

Pos logit contributions
selves+0.430
 Lords+0.425
 cabinets+0.412
 coy+0.410
 Desk+0.404
Neg logit contributions
ellen-0.590
aeda-0.547
ochet-0.531
uden-0.519
 therap-0.502
Tokengow
Token ID21175
Feature activation+3.52e+00

Pos logit contributions
selves+6.040
 Lords+5.811
 cabinets+5.564
 Islands+5.548
 Streets+5.519
Neg logit contributions
ellen-6.295
 therap-6.040
aeda-5.908
ochet-5.510
uden-5.406
Token
Token ID198
Feature activation+1.42e-01

Pos logit contributions
selves+1.749
 Lords+1.721
 cabinets+1.620
 coy+1.615
 Desk+1.609
Neg logit contributions
ellen-3.408
aeda-3.228
ochet-3.122
uden-3.081
 therap-3.057
Token
Token ID198
Feature activation+2.47e-01

Pos logit contributions
selves+0.080
 Lords+0.079
 coy+0.077
 cabinets+0.076
 Desk+0.076
Neg logit contributions
ellen-0.127
aeda-0.117
ochet-0.115
uden-0.112
Detailed-0.108
Token1999
Token ID18946
Feature activation+1.73e+00

Pos logit contributions
selves+0.187
 Lords+0.184
 cabinets+0.177
 Islands+0.177
 coy+0.177
Neg logit contributions
ellen-0.175
aeda-0.162
ochet-0.155
uden-0.152
 therap-0.150
Token Michael
Token ID3899
Feature activation+1.60e+00

Pos logit contributions
 Lords+1.101
selves+1.101
 Islands+1.041
 coy+1.040
 Desk+1.039
Neg logit contributions
ellen-1.426
aeda-1.341
ochet-1.283
uden-1.273
 therap-1.267
Token:
Token ID25
Feature activation+2.34e+00

Pos logit contributions
ellen+1.821
aeda+1.715
ochet+1.649
uden+1.626
 therap+1.605
Neg logit contributions
selves-1.293
 Lords-1.260
 Desk-1.204
 coy-1.201
 cabinets-1.198
Token https
Token ID3740
Feature activation-1.97e+00

Pos logit contributions
selves+1.152
 Lords+1.146
 coy+1.076
 Desk+1.070
 cabinets+1.067
Neg logit contributions
ellen-2.260
aeda-2.151
ochet-2.071
uden-2.053
 therap-2.024
Token://
Token ID1378
Feature activation-3.14e+00

Pos logit contributions
ellen+1.131
aeda+0.983
ochet+0.976
uden+0.910
Detailed+0.850
Neg logit contributions
 Lords-1.730
selves-1.730
 coy-1.693
 cabinets-1.690
 Desk-1.686
Tokent
Token ID83
Feature activation-3.47e+00

Pos logit contributions
ellen+2.222
aeda+2.015
ochet+1.953
uden+1.892
piring+1.778
Neg logit contributions
 Lords-2.383
selves-2.363
 Desk-2.321
 coy-2.285
 Shiva-2.283
Token.
Token ID13
Feature activation-8.21e+00

Pos logit contributions
ellen+2.560
aeda+2.329
ochet+2.321
uden+2.213
Detailed+2.060
Neg logit contributions
selves-2.461
 Lords-2.439
 coy-2.391
 cabinets-2.385
 Desk-2.368
Tokenco
Token ID1073
Feature activation-1.29e+01

Pos logit contributions
ellen+4.072
aeda+3.466
ochet+3.432
uden+3.327
Ca+2.947
Neg logit contributions
selves-7.553
 Lords-7.505
 cabinets-7.394
 Desk-7.387
 Shiva-7.371
Token/
Token ID14
Feature activation-1.07e+01

Pos logit contributions
ellen+7.733
ochet+7.159
Ca+6.544
ryan+6.467
aeda+6.460
Neg logit contributions
selves-9.253
 coy-9.124
 Lords-9.084
 cabinets-9.039
 stairs-9.018
Tokenc
Token ID66
Feature activation-2.98e+00

Pos logit contributions
ellen+7.596
aeda+6.989
ochet+6.699
uden+6.591
Ca+6.422
Neg logit contributions
selves-7.478
 Lords-7.275
 cabinets-7.168
 Desk-7.038
 Shiva-7.013
TokenQ
Token ID48
Feature activation-9.82e-01

Pos logit contributions
ellen+2.185
aeda+2.022
ochet+1.944
uden+1.910
 therap+1.831
Neg logit contributions
selves-2.181
 Lords-2.143
 cabinets-2.077
 coy-2.068
 Desk-2.064
Token3
Token ID18
Feature activation-6.87e-01

Pos logit contributions
ellen+0.739
aeda+0.687
ochet+0.658
uden+0.646
 therap+0.631
Neg logit contributions
selves-0.703
 Lords-0.694
 cabinets-0.668
 coy-0.666
 Desk-0.664
Tokenx
Token ID87
Feature activation+4.34e+00

Pos logit contributions
ellen+0.524
aeda+0.487
ochet+0.467
uden+0.459
 therap+0.449
Neg logit contributions
selves-0.485
 Lords-0.479
 cabinets-0.460
 coy-0.459
 Desk-0.458
Token.
Token ID13
Feature activation-3.55e-01

Pos logit contributions
ellen+0.948
aeda+0.889
ochet+0.870
uden+0.854
 therap+0.846
Neg logit contributions
selves-0.590
 Lords-0.581
 cabinets-0.553
 Desk-0.548
 Islands-0.546
Token https
Token ID3740
Feature activation-3.93e+00

Pos logit contributions
ellen+0.328
aeda+0.308
ochet+0.299
uden+0.294
 therap+0.290
Neg logit contributions
selves-0.192
 Lords-0.188
 cabinets-0.180
 coy-0.178
 Islands-0.177
Token://
Token ID1378
Feature activation-5.14e+00

Pos logit contributions
ellen+2.083
ochet+1.788
aeda+1.765
uden+1.614
Detailed+1.579
Neg logit contributions
 Lords-3.594
selves-3.556
 coy-3.507
 cabinets-3.494
 Desk-3.489
Tokent
Token ID83
Feature activation-3.26e+00

Pos logit contributions
ellen+3.325
aeda+2.922
ochet+2.899
uden+2.763
piring+2.574
Neg logit contributions
 Lords-4.189
 Desk-4.103
selves-4.040
 cabinets-4.010
 Shiva-4.007
Token.
Token ID13
Feature activation-7.89e+00

Pos logit contributions
ellen+2.396
aeda+2.187
ochet+2.170
uden+2.077
Detailed+1.927
Neg logit contributions
selves-2.314
 Lords-2.297
 coy-2.250
 cabinets-2.245
 Desk-2.225
Tokenco
Token ID1073
Feature activation-1.28e+01

Pos logit contributions
ellen+3.899
aeda+3.376
ochet+3.330
uden+3.274
piring+2.868
Neg logit contributions
selves-7.261
 Lords-7.230
 Desk-7.098
 Shiva-7.088
 cabinets-7.082
Token/
Token ID14
Feature activation-1.01e+01

Pos logit contributions
ellen+7.720
ochet+7.077
aeda+6.522
Ca+6.478
ryan+6.474
Neg logit contributions
selves-9.096
 Lords-8.996
 coy-8.940
 cabinets-8.937
 stairs-8.800
Tokenni
Token ID8461
Feature activation-2.76e+00

Pos logit contributions
ellen+7.220
aeda+6.650
ochet+6.378
uden+6.277
Ca+6.087
Neg logit contributions
selves-7.068
 Lords-6.876
 cabinets-6.779
 Desk-6.657
 Shiva-6.627
TokenWB
Token ID45607
Feature activation-3.32e-01

Pos logit contributions
ellen+2.020
aeda+1.871
ochet+1.793
uden+1.762
 therap+1.688
Neg logit contributions
selves-2.030
 Lords-1.991
 cabinets-1.930
 coy-1.920
 Desk-1.912
Tokenqv
Token ID44179
Feature activation+4.86e+00

Pos logit contributions
ellen+0.246
aeda+0.229
ochet+0.219
uden+0.215
 therap+0.211
Neg logit contributions
selves-0.240
 Lords-0.237
 cabinets-0.228
 coy-0.227
 Desk-0.227
Tokenfi
Token ID12463
Feature activation-5.30e-01

Pos logit contributions
selves+3.398
 Lords+3.379
 coy+3.232
 Islands+3.223
 Desk+3.205
Neg logit contributions
ellen-3.557
aeda-3.328
 therap-3.194
ochet-3.180
uden-3.115
Token Report
Token ID6358
Feature activation-1.19e+00

Pos logit contributions
ellen+0.851
aeda+0.797
ochet+0.769
uden+0.756
 therap+0.745
Neg logit contributions
selves-0.623
 Lords-0.612
 cabinets-0.583
 coy-0.582
 Desk-0.581
Token https
Token ID3740
Feature activation-2.29e+00

Pos logit contributions
ellen+1.058
aeda+0.996
ochet+0.962
uden+0.947
 therap+0.933
Neg logit contributions
selves-0.689
 Lords-0.675
 cabinets-0.644
 coy-0.639
 Islands-0.635
Token://
Token ID1378
Feature activation-3.46e+00

Pos logit contributions
ellen+1.274
ochet+1.106
aeda+1.105
uden+1.025
Detailed+0.975
Neg logit contributions
selves-2.027
 Lords-2.026
 coy-1.983
 Desk-1.983
 cabinets-1.981
Tokent
Token ID83
Feature activation-3.20e+00

Pos logit contributions
ellen+2.373
aeda+2.128
ochet+2.096
uden+2.020
piring+1.878
Neg logit contributions
 Lords-2.662
selves-2.643
 Desk-2.618
 cabinets-2.583
 coy-2.569
Token.
Token ID13
Feature activation-8.22e+00

Pos logit contributions
ellen+2.323
aeda+2.114
ochet+2.110
uden+1.997
Detailed+1.887
Neg logit contributions
selves-2.295
 Lords-2.272
 coy-2.242
 cabinets-2.241
 Desk-2.226
Tokenco
Token ID1073
Feature activation-1.27e+01

Pos logit contributions
ellen+4.147
aeda+3.561
ochet+3.513
uden+3.381
Ca+3.073
Neg logit contributions
selves-7.484
 Lords-7.418
 cabinets-7.379
 Desk-7.359
 Shiva-7.349
Token/
Token ID14
Feature activation-9.87e+00

Pos logit contributions
ellen+7.567
ochet+6.988
Ca+6.473
aeda+6.450
ryan+6.350
Neg logit contributions
selves-9.118
 coy-8.892
 cabinets-8.852
 Lords-8.839
 stairs-8.765
Tokenl
Token ID75
Feature activation+4.92e-01

Pos logit contributions
ellen+7.054
aeda+6.491
ochet+6.228
uden+6.120
Ca+5.931
Neg logit contributions
selves-6.937
 Lords-6.740
 cabinets-6.651
 Desk-6.534
 Shiva-6.500
Tokeng
Token ID70
Feature activation+2.51e+00

Pos logit contributions
selves+0.355
 Lords+0.351
 cabinets+0.337
 coy+0.336
 Desk+0.335
Neg logit contributions
ellen-0.366
aeda-0.340
ochet-0.326
uden-0.320
 therap-0.315
TokenS
Token ID50
Feature activation-3.92e+00

Pos logit contributions
selves+1.799
 Lords+1.788
 coy+1.707
 cabinets+1.706
 Islands+1.702
Neg logit contributions
ellen-1.855
aeda-1.730
ochet-1.647
 therap-1.624
uden-1.615
Tokenwh
Token ID1929
Feature activation+1.79e+00

Pos logit contributions
ellen+2.832
aeda+2.636
ochet+2.519
uden+2.473
 therap+2.363
Neg logit contributions
selves-2.883
 Lords-2.826
 cabinets-2.759
 Desk-2.744
 coy-2.739
Token.
Token ID13
Feature activation-9.54e-01

Pos logit contributions
ellen+2.070
aeda+1.934
ochet+1.863
uden+1.833
 therap+1.805
Neg logit contributions
selves-1.629
 Lords-1.611
 cabinets-1.544
 coy-1.541
 Desk-1.537
Token https
Token ID3740
Feature activation-3.18e+00

Pos logit contributions
ellen+0.874
aeda+0.826
ochet+0.800
uden+0.786
 therap+0.770
Neg logit contributions
selves-0.515
 Lords-0.502
 cabinets-0.485
 coy-0.484
 Desk-0.474
Token://
Token ID1378
Feature activation-4.10e+00

Pos logit contributions
ellen+1.713
ochet+1.476
aeda+1.455
uden+1.339
Detailed+1.304
Neg logit contributions
 Lords-2.859
selves-2.844
 coy-2.810
 cabinets-2.792
 Desk-2.790
Tokent
Token ID83
Feature activation-3.30e+00

Pos logit contributions
ellen+2.928
aeda+2.665
ochet+2.582
uden+2.503
piring+2.345
Neg logit contributions
 Lords-3.072
selves-3.061
 Desk-3.002
 coy-2.967
 cabinets-2.953
Token.
Token ID13
Feature activation-7.63e+00

Pos logit contributions
ellen+2.461
aeda+2.262
ochet+2.237
uden+2.147
piring+1.978
Neg logit contributions
selves-2.323
 Lords-2.292
 coy-2.255
 cabinets-2.246
 Desk-2.231
Tokenco
Token ID1073
Feature activation-1.22e+01

Pos logit contributions
ellen+3.918
aeda+3.390
ochet+3.317
uden+3.252
piring+2.866
Neg logit contributions
selves-6.950
 Lords-6.878
 Desk-6.820
 cabinets-6.816
 Shiva-6.793
Token/
Token ID14
Feature activation-9.45e+00

Pos logit contributions
ellen+7.525
ochet+6.955
aeda+6.376
Ca+6.375
ryan+6.322
Neg logit contributions
selves-8.627
 coy-8.512
 Lords-8.473
 cabinets-8.427
 stairs-8.340
TokenP
Token ID47
Feature activation-5.91e+00

Pos logit contributions
ellen+6.802
aeda+6.263
ochet+6.006
uden+5.905
Ca+5.701
Neg logit contributions
selves-6.625
 Lords-6.465
 cabinets-6.357
 Desk-6.254
 Shiva-6.226
Token9
Token ID24
Feature activation-1.83e+00

Pos logit contributions
ellen+4.272
aeda+3.980
ochet+3.806
uden+3.718
 therap+3.517
Neg logit contributions
selves-4.335
 Lords-4.234
 cabinets-4.125
 Desk-4.094
 coy-4.078
Tokengc
Token ID36484
Feature activation+2.64e+00

Pos logit contributions
ellen+1.396
aeda+1.296
ochet+1.246
uden+1.224
 therap+1.186
Neg logit contributions
selves-1.292
 Lords-1.275
 cabinets-1.229
 coy-1.222
 Desk-1.222
TokenN
Token ID45
Feature activation-1.63e+00

Pos logit contributions
selves+1.889
 Lords+1.874
 coy+1.790
 cabinets+1.788
 Islands+1.785
Neg logit contributions
ellen-1.947
aeda-1.815
ochet-1.733
 therap-1.711
uden-1.700
Token
Token ID222
Feature activation-1.91e+00

Pos logit contributions
ellen+1.522
aeda+1.418
ochet+1.361
uden+1.334
 therap+1.314
Neg logit contributions
selves-1.427
 Lords-1.407
 cabinets-1.351
 coy-1.347
 Desk-1.346
Token https
Token ID3740
Feature activation-4.06e+00

Pos logit contributions
ellen+1.554
aeda+1.454
ochet+1.395
uden+1.374
 therap+1.355
Neg logit contributions
selves-1.241
 Lords-1.224
 cabinets-1.176
 coy-1.169
 Desk-1.163
Token://
Token ID1378
Feature activation-5.07e+00

Pos logit contributions
ellen+2.096
ochet+1.849
aeda+1.800
uden+1.642
Detailed+1.609
Neg logit contributions
 Lords-3.680
selves-3.668
 Desk-3.611
 coy-3.610
 cabinets-3.601
Tokent
Token ID83
Feature activation-3.71e+00

Pos logit contributions
ellen+3.435
aeda+3.075
ochet+2.992
uden+2.904
piring+2.699
Neg logit contributions
 Lords-3.973
selves-3.894
 Desk-3.882
 cabinets-3.807
 coy-3.798
Token.
Token ID13
Feature activation-7.42e+00

Pos logit contributions
ellen+2.732
aeda+2.508
ochet+2.485
uden+2.378
Detailed+2.197
Neg logit contributions
selves-2.629
 Lords-2.584
 cabinets-2.534
 coy-2.532
 Desk-2.524
Tokenco
Token ID1073
Feature activation-1.22e+01

Pos logit contributions
ellen+3.831
aeda+3.385
ochet+3.296
uden+3.207
Ca+2.844
Neg logit contributions
selves-6.720
 Lords-6.632
 Desk-6.607
 cabinets-6.575
 Shiva-6.503
Token/
Token ID14
Feature activation-9.32e+00

Pos logit contributions
ellen+7.552
ochet+6.920
aeda+6.532
Ca+6.331
ryan+6.329
Neg logit contributions
selves-8.654
 Lords-8.458
 cabinets-8.392
 coy-8.366
 stairs-8.244
TokenX
Token ID55
Feature activation-3.85e+00

Pos logit contributions
ellen+6.711
aeda+6.183
ochet+5.924
uden+5.827
Ca+5.611
Neg logit contributions
selves-6.558
 Lords-6.395
 cabinets-6.287
 Desk-6.193
 Shiva-6.157
Tokenh
Token ID71
Feature activation-2.70e+00

Pos logit contributions
ellen+2.876
aeda+2.668
ochet+2.552
uden+2.507
 therap+2.389
Neg logit contributions
selves-2.765
 Lords-2.724
 cabinets-2.636
 Desk-2.615
 coy-2.615
Token7
Token ID22
Feature activation-2.74e+00

Pos logit contributions
ellen+2.001
aeda+1.856
ochet+1.784
uden+1.754
 therap+1.690
Neg logit contributions
selves-1.957
 Lords-1.925
 cabinets-1.866
 Desk-1.857
 coy-1.852
TokenJ
Token ID41
Feature activation+4.28e-01

Pos logit contributions
ellen+2.077
aeda+1.927
ochet+1.852
uden+1.819
 therap+1.753
Neg logit contributions
selves-1.939
 Lords-1.912
 cabinets-1.846
 Desk-1.834
 coy-1.833
Token W
Token ID370
Feature activation-1.45e+00

Pos logit contributions
ellen+4.006
aeda+3.744
ochet+3.629
uden+3.559
 therap+3.500
Neg logit contributions
selves-2.602
 Lords-2.512
 cabinets-2.433
 coy-2.408
 Desk-2.389
Tokenadd
Token ID2860
Feature activation-4.16e+00

Pos logit contributions
ellen+1.143
aeda+1.067
ochet+1.021
uden+1.009
 therap+0.980
Neg logit contributions
selves-0.982
 Lords-0.967
 cabinets-0.935
 coy-0.926
 Desk-0.924
Tokenell
Token ID695
Feature activation-1.58e+00

Pos logit contributions
ellen+3.107
aeda+2.829
ochet+2.720
uden+2.662
 therap+2.558
Neg logit contributions
selves-3.007
 Lords-2.962
 cabinets-2.876
 coy-2.845
 Desk-2.837
Token
Token ID14519
Feature activation-7.90e-01

Pos logit contributions
ellen+1.092
aeda+1.006
ochet+0.962
uden+0.944
 therap+0.924
Neg logit contributions
selves-1.216
 Lords-1.195
 cabinets-1.157
 coy-1.153
 Desk-1.150
Token
Token ID234
Feature activation-2.46e+00

Pos logit contributions
ellen+0.530
aeda+0.488
ochet+0.464
uden+0.457
 therap+0.447
Neg logit contributions
selves-0.626
 Lords-0.620
 cabinets-0.598
 coy-0.597
 Desk-0.595
Token (@
Token ID4275
Feature activation-1.11e+01

Pos logit contributions
ellen+1.624
aeda+1.483
ochet+1.420
uden+1.384
 therap+1.357
Neg logit contributions
selves-1.966
 Lords-1.944
 cabinets-1.884
 coy-1.873
 Desk-1.863
Tokens
Token ID82
Feature activation-4.44e+00

Pos logit contributions
ellen+8.170
aeda+7.214
ochet+7.172
Detailed+6.846
uden+6.822
Neg logit contributions
 Lords-7.370
 cabinets-7.311
selves-7.309
 Desk-7.168
 Islands-7.133
Tokenhel
Token ID2978
Feature activation-6.30e-01

Pos logit contributions
ellen+3.112
aeda+2.842
ochet+2.742
uden+2.659
 therap+2.533
Neg logit contributions
selves-3.375
 Lords-3.366
 Desk-3.264
 cabinets-3.263
 Shiva-3.261
Tokenby
Token ID1525
Feature activation-9.79e-01

Pos logit contributions
ellen+0.449
aeda+0.417
ochet+0.398
uden+0.390
 therap+0.383
Neg logit contributions
selves-0.473
 Lords-0.469
 cabinets-0.451
 coy-0.450
 Desk-0.450
Tokeny
Token ID88
Feature activation+2.50e+00

Pos logit contributions
ellen+0.707
aeda+0.656
ochet+0.627
uden+0.615
 therap+0.605
Neg logit contributions
selves-0.727
 Lords-0.720
 cabinets-0.692
 coy-0.691
 Desk-0.689
Token512
Token ID25836
Feature activation+3.63e-03

Pos logit contributions
selves+1.795
 Lords+1.762
 coy+1.690
 cabinets+1.688
 Islands+1.679
Neg logit contributions
ellen-1.811
aeda-1.713
ochet-1.636
 therap-1.621
uden-1.591
Token https
Token ID3740
Feature activation-1.97e+00

Pos logit contributions
selves+1.152
 Lords+1.146
 coy+1.076
 Desk+1.070
 cabinets+1.067
Neg logit contributions
ellen-2.260
aeda-2.151
ochet-2.071
uden-2.053
 therap-2.024
Token://
Token ID1378
Feature activation-3.14e+00

Pos logit contributions
ellen+1.131
aeda+0.983
ochet+0.976
uden+0.910
Detailed+0.850
Neg logit contributions
 Lords-1.730
selves-1.730
 coy-1.693
 cabinets-1.690
 Desk-1.686
Tokent
Token ID83
Feature activation-3.47e+00

Pos logit contributions
ellen+2.222
aeda+2.015
ochet+1.953
uden+1.892
piring+1.778
Neg logit contributions
 Lords-2.383
selves-2.363
 Desk-2.321
 coy-2.285
 Shiva-2.283
Token.
Token ID13
Feature activation-8.21e+00

Pos logit contributions
ellen+2.560
aeda+2.329
ochet+2.321
uden+2.213
Detailed+2.060
Neg logit contributions
selves-2.461
 Lords-2.439
 coy-2.391
 cabinets-2.385
 Desk-2.368
Tokenco
Token ID1073
Feature activation-1.29e+01

Pos logit contributions
ellen+4.072
aeda+3.466
ochet+3.432
uden+3.327
Ca+2.947
Neg logit contributions
selves-7.553
 Lords-7.505
 cabinets-7.394
 Desk-7.387
 Shiva-7.371
Token/
Token ID14
Feature activation-1.07e+01

Pos logit contributions
ellen+7.733
ochet+7.159
Ca+6.544
ryan+6.467
aeda+6.460
Neg logit contributions
selves-9.253
 coy-9.124
 Lords-9.084
 cabinets-9.039
 stairs-9.018
Tokenc
Token ID66
Feature activation-2.98e+00

Pos logit contributions
ellen+7.596
aeda+6.989
ochet+6.699
uden+6.591
Ca+6.422
Neg logit contributions
selves-7.478
 Lords-7.275
 cabinets-7.168
 Desk-7.038
 Shiva-7.013
TokenQ
Token ID48
Feature activation-9.82e-01

Pos logit contributions
ellen+2.185
aeda+2.022
ochet+1.944
uden+1.910
 therap+1.831
Neg logit contributions
selves-2.181
 Lords-2.143
 cabinets-2.077
 coy-2.068
 Desk-2.064
Token3
Token ID18
Feature activation-6.87e-01

Pos logit contributions
ellen+0.739
aeda+0.687
ochet+0.658
uden+0.646
 therap+0.631
Neg logit contributions
selves-0.703
 Lords-0.694
 cabinets-0.668
 coy-0.666
 Desk-0.664
Tokenx
Token ID87
Feature activation+4.34e+00

Pos logit contributions
ellen+0.524
aeda+0.487
ochet+0.467
uden+0.459
 therap+0.449
Neg logit contributions
selves-0.485
 Lords-0.479
 cabinets-0.460
 coy-0.459
 Desk-0.458
Tokenmb
Token ID2022
Feature activation+4.67e-01

Pos logit contributions
selves+3.052
 Lords+3.031
 coy+2.898
 Islands+2.891
 Desk+2.882
Neg logit contributions
ellen-3.179
aeda-2.975
ochet-2.843
 therap-2.841
uden-2.783
Tokencom
Token ID785
Feature activation-3.93e+00

Pos logit contributions
ellen+2.080
aeda+1.898
ochet+1.858
uden+1.800
Detailed+1.664
Neg logit contributions
selves-2.248
 Lords-2.237
 cabinets-2.181
 Desk-2.174
 coy-2.159
Token/
Token ID14
Feature activation-6.06e+00

Pos logit contributions
ellen+3.274
aeda+3.042
ochet+2.976
uden+2.910
Detailed+2.751
Neg logit contributions
selves-2.389
 Lords-2.311
 cabinets-2.271
 coy-2.233
 Desk-2.219
Tokenwatch
Token ID8340
Feature activation-5.43e+00

Pos logit contributions
ellen+3.555
aeda+3.248
ochet+3.080
uden+2.968
 therap+2.818
Neg logit contributions
selves-5.214
 Lords-5.187
 cabinets-5.100
 Desk-5.059
 coy-4.994
Token?
Token ID30
Feature activation-6.85e-01

Pos logit contributions
ellen+3.659
aeda+3.196
ochet+3.156
uden+3.007
Detailed+2.987
Neg logit contributions
selves-4.071
 Lords-4.041
 cabinets-4.008
 Desk-3.981
 coy-3.897
Tokenv
Token ID85
Feature activation-1.13e+00

Pos logit contributions
ellen+0.537
aeda+0.501
ochet+0.481
uden+0.473
 therap+0.463
Neg logit contributions
selves-0.466
 Lords-0.462
 cabinets-0.443
 Desk-0.442
 coy-0.440
Token=
Token ID28
Feature activation-1.04e+01

Pos logit contributions
ellen+1.057
aeda+1.002
ochet+0.974
uden+0.957
 therap+0.922
Neg logit contributions
selves-0.593
 Lords-0.583
 Desk-0.556
 cabinets-0.553
 Shiva-0.553
Token2
Token ID17
Feature activation-4.08e+00

Pos logit contributions
ellen+7.441
aeda+6.758
ochet+6.480
uden+6.407
Ca+6.221
Neg logit contributions
selves-7.214
 Lords-7.072
 cabinets-6.989
 Desk-6.881
 Shiva-6.824
Tokeng
Token ID70
Feature activation+3.07e+00

Pos logit contributions
ellen+3.109
aeda+2.867
ochet+2.763
uden+2.708
 therap+2.586
Neg logit contributions
selves-2.861
 Lords-2.822
 cabinets-2.735
 Desk-2.715
 coy-2.705
Token7
Token ID22
Feature activation-2.24e+00

Pos logit contributions
selves+2.187
 Lords+2.171
 coy+2.073
 cabinets+2.070
 Islands+2.067
Neg logit contributions
ellen-2.271
aeda-2.122
ochet-2.023
 therap-1.997
uden-1.985
Tokenp
Token ID79
Feature activation-1.36e-01

Pos logit contributions
ellen+1.714
aeda+1.591
ochet+1.527
uden+1.500
 therap+1.452
Neg logit contributions
selves-1.579
 Lords-1.558
 cabinets-1.502
 Desk-1.493
 coy-1.493
Tokenk
Token ID74
Feature activation+2.09e+00

Pos logit contributions
ellen+0.101
aeda+0.094
ochet+0.090
uden+0.088
 therap+0.086
Neg logit contributions
selves-0.098
 Lords-0.097
 cabinets-0.093
 coy-0.093
 Desk-0.093
Token
Token ID229
Feature activation-1.66e+00

Pos logit contributions
ellen+2.752
aeda+2.509
ochet+2.372
uden+2.268
 therap+2.265
Neg logit contributions
selves-4.847
 Lords-4.743
 cabinets-4.674
 Desk-4.605
 coy-4.582
Token
Token ID118
Feature activation+5.17e-01

Pos logit contributions
ellen+1.145
aeda+1.055
ochet+1.014
uden+0.983
 therap+0.969
Neg logit contributions
selves-1.287
 Lords-1.267
 cabinets-1.226
 coy-1.222
 Desk-1.221
Token
Token ID8582
Feature activation-5.76e+00

Pos logit contributions
selves+0.414
 Lords+0.411
 coy+0.396
 Desk+0.396
 Islands+0.395
Neg logit contributions
ellen-0.342
aeda-0.316
ochet-0.300
uden-0.294
 therap-0.288
Token
Token ID229
Feature activation-2.55e+00

Pos logit contributions
ellen+2.790
aeda+2.486
ochet+2.375
 therap+2.228
uden+2.201
Neg logit contributions
selves-5.491
 Lords-5.382
 cabinets-5.320
 Desk-5.237
 coy-5.233
Token
Token ID116
Feature activation-1.08e-01

Pos logit contributions
ellen+1.672
aeda+1.543
ochet+1.478
uden+1.422
 therap+1.400
Neg logit contributions
selves-2.045
 Lords-2.008
 cabinets-1.954
 coy-1.948
 Desk-1.945
Token (@
Token ID4275
Feature activation-1.03e+01

Pos logit contributions
ellen+0.070
aeda+0.065
ochet+0.062
uden+0.060
 therap+0.059
Neg logit contributions
selves-0.088
 Lords-0.087
 coy-0.084
 cabinets-0.084
 Desk-0.084
Tokenm
Token ID76
Feature activation-1.42e+00

Pos logit contributions
ellen+7.840
aeda+6.972
ochet+6.868
uden+6.740
Detailed+6.597
Neg logit contributions
selves-6.677
 Lords-6.628
 cabinets-6.537
 Desk-6.485
 Shiva-6.398
Tokenf
Token ID69
Feature activation+2.64e-01

Pos logit contributions
ellen+1.089
aeda+1.013
ochet+0.971
uden+0.954
 therap+0.935
Neg logit contributions
selves-0.991
 Lords-0.983
 cabinets-0.943
 coy-0.942
 Desk-0.940
Tokenlynn
Token ID12935
Feature activation+4.34e-01

Pos logit contributions
selves+0.200
 Lords+0.198
 cabinets+0.191
 coy+0.190
 Desk+0.190
Neg logit contributions
ellen-0.187
aeda-0.173
ochet-0.165
uden-0.162
 therap-0.159
TokenJR
Token ID44817
Feature activation+9.39e-01

Pos logit contributions
selves+0.284
 Lords+0.281
 coy+0.268
 cabinets+0.268
 Desk+0.267
Neg logit contributions
ellen-0.350
aeda-0.328
ochet-0.315
uden-0.310
 therap-0.307
Token)
Token ID8
Feature activation-1.78e+00

Pos logit contributions
selves+0.466
 Lords+0.457
 coy+0.432
 Desk+0.431
 cabinets+0.430
Neg logit contributions
ellen-0.904
aeda-0.856
ochet-0.827
uden-0.817
 therap-0.812
Token
Token ID229
Feature activation-3.35e+00

Pos logit contributions
ellen+3.287
aeda+2.959
ochet+2.797
uden+2.727
 therap+2.701
Neg logit contributions
selves-5.339
 Lords-5.230
 cabinets-5.148
 Shiva-5.061
 coy-5.038
Token
Token ID118
Feature activation+3.57e-01

Pos logit contributions
ellen+2.152
aeda+1.970
ochet+1.900
uden+1.817
 therap+1.712
Neg logit contributions
selves-2.703
 Lords-2.642
 cabinets-2.578
 coy-2.574
 Desk-2.562
Token
Token ID8582
Feature activation-5.49e+00

Pos logit contributions
selves+0.289
 Lords+0.286
 cabinets+0.276
 coy+0.276
 Desk+0.275
Neg logit contributions
ellen-0.233
aeda-0.215
ochet-0.204
uden-0.200
 therap-0.197
Token
Token ID229
Feature activation-2.95e+00

Pos logit contributions
ellen+2.508
aeda+2.175
ochet+2.120
uden+1.915
Detailed+1.849
Neg logit contributions
selves-5.390
 Lords-5.283
 cabinets-5.230
 coy-5.167
 Desk-5.153
Token
Token ID116
Feature activation-5.69e-01

Pos logit contributions
ellen+1.821
aeda+1.647
ochet+1.597
uden+1.489
piring+1.411
Neg logit contributions
selves-2.468
 Lords-2.452
 cabinets-2.395
 coy-2.388
 Desk-2.355
Token (@
Token ID4275
Feature activation-1.01e+01

Pos logit contributions
ellen+0.406
aeda+0.377
ochet+0.360
uden+0.353
 therap+0.347
Neg logit contributions
selves-0.428
 Lords-0.423
 cabinets-0.407
 coy-0.407
 Desk-0.406
Tokenm
Token ID76
Feature activation-2.20e+00

Pos logit contributions
ellen+7.497
aeda+6.615
uden+6.407
ochet+6.375
Detailed+6.155
Neg logit contributions
selves-7.131
 Lords-7.021
 Shiva-6.861
 cabinets-6.824
 Islands-6.749
Tokenf
Token ID69
Feature activation-5.24e-01

Pos logit contributions
ellen+1.767
aeda+1.645
ochet+1.573
uden+1.539
piring+1.438
Neg logit contributions
 Lords-1.475
selves-1.462
 Desk-1.417
 coy-1.415
 Shiva-1.408
Tokenlynn
Token ID12935
Feature activation-1.57e-01

Pos logit contributions
ellen+0.356
aeda+0.326
ochet+0.308
uden+0.305
 therap+0.280
Neg logit contributions
selves-0.419
 Lords-0.416
 Desk-0.402
 cabinets-0.401
 coy-0.400
TokenJR
Token ID44817
Feature activation+6.99e-01

Pos logit contributions
ellen+0.105
aeda+0.096
ochet+0.092
uden+0.090
 therap+0.088
Neg logit contributions
selves-0.125
 Lords-0.124
 cabinets-0.120
 coy-0.120
 Desk-0.119
Token)
Token ID8
Feature activation-1.41e+00

Pos logit contributions
selves+0.352
 Lords+0.344
 coy+0.326
 cabinets+0.325
 Desk+0.322
Neg logit contributions
ellen-0.674
aeda-0.635
ochet-0.619
uden-0.606
 therap-0.582
Token https
Token ID3740
Feature activation-3.93e+00

Pos logit contributions
ellen+0.328
aeda+0.308
ochet+0.299
uden+0.294
 therap+0.290
Neg logit contributions
selves-0.192
 Lords-0.188
 cabinets-0.180
 coy-0.178
 Islands-0.177
Token://
Token ID1378
Feature activation-5.14e+00

Pos logit contributions
ellen+2.083
ochet+1.788
aeda+1.765
uden+1.614
Detailed+1.579
Neg logit contributions
 Lords-3.594
selves-3.556
 coy-3.507
 cabinets-3.494
 Desk-3.489
Tokent
Token ID83
Feature activation-3.26e+00

Pos logit contributions
ellen+3.325
aeda+2.922
ochet+2.899
uden+2.763
piring+2.574
Neg logit contributions
 Lords-4.189
 Desk-4.103
selves-4.040
 cabinets-4.010
 Shiva-4.007
Token.
Token ID13
Feature activation-7.89e+00

Pos logit contributions
ellen+2.396
aeda+2.187
ochet+2.170
uden+2.077
Detailed+1.927
Neg logit contributions
selves-2.314
 Lords-2.297
 coy-2.250
 cabinets-2.245
 Desk-2.225
Tokenco
Token ID1073
Feature activation-1.28e+01

Pos logit contributions
ellen+3.899
aeda+3.376
ochet+3.330
uden+3.274
piring+2.868
Neg logit contributions
selves-7.261
 Lords-7.230
 Desk-7.098
 Shiva-7.088
 cabinets-7.082
Token/
Token ID14
Feature activation-1.01e+01

Pos logit contributions
ellen+7.720
ochet+7.077
aeda+6.522
Ca+6.478
ryan+6.474
Neg logit contributions
selves-9.096
 Lords-8.996
 coy-8.940
 cabinets-8.937
 stairs-8.800
Tokenni
Token ID8461
Feature activation-2.76e+00

Pos logit contributions
ellen+7.220
aeda+6.650
ochet+6.378
uden+6.277
Ca+6.087
Neg logit contributions
selves-7.068
 Lords-6.876
 cabinets-6.779
 Desk-6.657
 Shiva-6.627
TokenWB
Token ID45607
Feature activation-3.32e-01

Pos logit contributions
ellen+2.020
aeda+1.871
ochet+1.793
uden+1.762
 therap+1.688
Neg logit contributions
selves-2.030
 Lords-1.991
 cabinets-1.930
 coy-1.920
 Desk-1.912
Tokenqv
Token ID44179
Feature activation+4.86e+00

Pos logit contributions
ellen+0.246
aeda+0.229
ochet+0.219
uden+0.215
 therap+0.211
Neg logit contributions
selves-0.240
 Lords-0.237
 cabinets-0.228
 coy-0.227
 Desk-0.227
Tokenfi
Token ID12463
Feature activation-5.30e-01

Pos logit contributions
selves+3.398
 Lords+3.379
 coy+3.232
 Islands+3.223
 Desk+3.205
Neg logit contributions
ellen-3.557
aeda-3.328
 therap-3.194
ochet-3.180
uden-3.115
TokenW
Token ID54
Feature activation+1.79e+00

Pos logit contributions
ellen+0.387
aeda+0.359
ochet+0.344
uden+0.338
 therap+0.330
Neg logit contributions
selves-0.390
 Lords-0.385
 cabinets-0.371
 coy-0.370
 Desk-0.369
Token
Token ID851
Feature activation-2.59e+00

Pos logit contributions
ellen+0.554
aeda+0.520
ochet+0.503
uden+0.495
 therap+0.489
Neg logit contributions
selves-0.352
 Lords-0.346
 cabinets-0.330
 coy-0.329
 Desk-0.326
Token Donald
Token ID3759
Feature activation-1.09e+00

Pos logit contributions
ellen+2.239
aeda+2.095
ochet+2.024
uden+1.995
 therap+1.969
Neg logit contributions
selves-1.556
 Lords-1.535
 cabinets-1.456
 coy-1.453
 Islands-1.445
Token J
Token ID449
Feature activation+1.91e+00

Pos logit contributions
ellen+1.015
aeda+0.938
ochet+0.938
uden+0.907
Detailed+0.889
Neg logit contributions
selves-0.565
 Lords-0.529
 cabinets-0.514
 coy-0.514
 Islands-0.495
Token.
Token ID13
Feature activation+9.15e-01

Pos logit contributions
selves+1.256
 Lords+1.236
 coy+1.189
 cabinets+1.185
 Desk+1.180
Neg logit contributions
ellen-1.546
aeda-1.445
ochet-1.394
uden-1.368
 therap-1.314
Token Trump
Token ID1301
Feature activation-1.43e-03

Pos logit contributions
selves+0.397
 Lords+0.388
 coy+0.373
 cabinets+0.368
 Desk+0.355
Neg logit contributions
ellen-0.935
aeda-0.870
ochet-0.862
uden-0.845
Detailed-0.811
Token (@
Token ID4275
Feature activation-1.01e+01

Pos logit contributions
ellen+0.001
aeda+0.001
ochet+0.001
uden+0.001
 therap+0.001
Neg logit contributions
selves-0.001
 Lords-0.001
 cabinets-0.001
 coy-0.001
 Shiva-0.001
TokenrealDonaldTrump
Token ID28024
Feature activation+2.65e-02

Pos logit contributions
ellen+2.880
aeda+1.996
ochet+1.952
uden+1.898
Detailed+1.640
Neg logit contributions
 Lords-11.649
selves-11.532
 Desk-11.259
 Shiva-11.147
 PACs-11.116
Token)
Token ID8
Feature activation-2.88e+00

Pos logit contributions
selves+0.014
 Lords+0.014
 coy+0.013
 cabinets+0.013
 Desk+0.013
Neg logit contributions
ellen-0.025
aeda-0.023
ochet-0.023
uden-0.022
Detailed-0.021
Token June
Token ID2795
Feature activation+9.41e-01

Pos logit contributions
ellen+2.403
aeda+2.252
ochet+2.196
uden+2.114
piring+2.075
Neg logit contributions
selves-1.721
 Lords-1.674
 cabinets-1.629
 coy-1.600
 Desk-1.561
Token 3
Token ID513
Feature activation+1.56e+00

Pos logit contributions
selves+0.522
 Lords+0.511
 cabinets+0.490
 coy+0.489
 Desk+0.483
Neg logit contributions
ellen-0.852
aeda-0.802
ochet-0.778
uden-0.761
 therap-0.738
Token,
Token ID11
Feature activation-6.39e-02

Pos logit contributions
selves+0.812
 Lords+0.789
 cabinets+0.755
 coy+0.754
 Desk+0.745
Neg logit contributions
ellen-1.483
aeda-1.387
ochet-1.357
uden-1.324
Detailed-1.265
Tokeno
Token ID78
Feature activation+3.34e+00

Pos logit contributions
ellen+1.990
aeda+1.845
ochet+1.773
uden+1.740
 therap+1.679
Neg logit contributions
selves-1.845
 Lords-1.820
 cabinets-1.758
 coy-1.746
 Desk-1.745
TokenM
Token ID44
Feature activation-6.39e-01

Pos logit contributions
selves+2.379
 Lords+2.365
 coy+2.259
 Islands+2.255
 cabinets+2.249
Neg logit contributions
ellen-2.451
aeda-2.293
ochet-2.190
 therap-2.160
uden-2.150
Token
Token ID851
Feature activation-3.12e+00

Pos logit contributions
ellen+0.543
aeda+0.510
ochet+0.491
uden+0.482
 therap+0.473
Neg logit contributions
selves-0.393
 Lords-0.386
 cabinets-0.370
 coy-0.368
 Desk-0.367
Token Winnipeg
Token ID23434
Feature activation+2.58e+00

Pos logit contributions
ellen+2.793
aeda+2.605
ochet+2.516
uden+2.496
 therap+2.438
Neg logit contributions
selves-1.772
 Lords-1.719
 cabinets-1.654
 coy-1.631
 Islands-1.623
Token Jets
Token ID14728
Feature activation-2.97e-01

Pos logit contributions
selves+0.852
 Lords+0.837
 Islands+0.766
 Desk+0.766
 coy+0.764
Neg logit contributions
ellen-2.912
aeda-2.790
ochet-2.703
uden-2.665
 therap-2.661
Token (@
Token ID4275
Feature activation-9.97e+00

Pos logit contributions
ellen+0.202
aeda+0.187
ochet+0.178
uden+0.174
 therap+0.172
Neg logit contributions
selves-0.232
 Lords-0.230
 coy-0.222
 cabinets-0.222
 Desk-0.221
TokenN
Token ID45
Feature activation-3.54e+00

Pos logit contributions
ellen+6.782
aeda+5.857
ochet+5.798
uden+5.708
Detailed+5.703
Neg logit contributions
 Lords-7.515
selves-7.513
 Shiva-7.373
 Desk-7.305
 coy-7.225
TokenHL
Token ID6581
Feature activation+8.31e-01

Pos logit contributions
ellen+1.770
aeda+1.589
ochet+1.471
uden+1.440
 therap+1.357
Neg logit contributions
selves-3.444
 Lords-3.385
 Desk-3.291
 cabinets-3.290
 coy-3.288
TokenJ
Token ID41
Feature activation+1.99e+00

Pos logit contributions
selves+0.529
 Lords+0.520
 cabinets+0.500
 coy+0.497
 Desk+0.495
Neg logit contributions
ellen-0.685
aeda-0.645
ochet-0.620
uden-0.609
 therap-0.606
Tokenets
Token ID1039
Feature activation-1.10e+00

Pos logit contributions
selves+1.002
 Lords+0.992
 cabinets+0.935
 Islands+0.931
 coy+0.928
Neg logit contributions
ellen-1.894
aeda-1.794
ochet-1.741
 therap-1.715
uden-1.707
Token)
Token ID8
Feature activation-2.69e+00

Pos logit contributions
ellen+1.087
aeda+1.030
ochet+0.998
uden+0.985
 therap+0.975
Neg logit contributions
selves-0.517
 Lords-0.508
 cabinets-0.477
 coy-0.476
 Desk-0.474
Token
Token ID851
Feature activation-2.92e+00

Pos logit contributions
ellen+0.260
aeda+0.244
ochet+0.236
uden+0.233
 therap+0.229
Neg logit contributions
selves-0.161
 Lords-0.158
 cabinets-0.151
 coy-0.150
 Islands-0.149
Token Donald
Token ID3759
Feature activation-7.40e-01

Pos logit contributions
ellen+2.466
aeda+2.297
ochet+2.220
uden+2.194
 therap+2.156
Neg logit contributions
selves-1.805
 Lords-1.778
 cabinets-1.693
 coy-1.678
 Islands-1.673
Token J
Token ID449
Feature activation+1.92e+00

Pos logit contributions
ellen+0.703
aeda+0.658
ochet+0.649
uden+0.632
Detailed+0.620
Neg logit contributions
selves-0.372
 Lords-0.346
 coy-0.341
 cabinets-0.339
 Islands-0.323
Token.
Token ID13
Feature activation+4.51e-01

Pos logit contributions
selves+1.257
 Lords+1.237
 coy+1.192
 cabinets+1.189
 Desk+1.182
Neg logit contributions
ellen-1.561
aeda-1.463
ochet-1.409
uden-1.385
 therap-1.331
Token Trump
Token ID1301
Feature activation-4.36e-02

Pos logit contributions
selves+0.197
 Lords+0.191
 coy+0.185
 cabinets+0.185
 Islands+0.176
Neg logit contributions
ellen-0.459
aeda-0.427
ochet-0.424
uden-0.417
Detailed-0.399
Token (@
Token ID4275
Feature activation-9.92e+00

Pos logit contributions
ellen+0.025
aeda+0.022
ochet+0.021
uden+0.021
 therap+0.019
Neg logit contributions
selves-0.039
 Lords-0.038
 coy-0.037
 cabinets-0.037
 Shiva-0.037
TokenrealDonaldTrump
Token ID28024
Feature activation-1.30e-01

Pos logit contributions
ellen+2.944
aeda+2.133
ochet+1.979
uden+1.914
Detailed+1.741
Neg logit contributions
 Lords-11.453
selves-11.308
 Desk-11.069
 Shiva-10.922
 cabinets-10.891
Token)
Token ID8
Feature activation-2.85e+00

Pos logit contributions
ellen+0.120
aeda+0.114
ochet+0.111
uden+0.108
Detailed+0.104
Neg logit contributions
selves-0.069
 Lords-0.068
 coy-0.065
 cabinets-0.065
 Desk-0.064
Token November
Token ID3389
Feature activation-3.66e-01

Pos logit contributions
ellen+2.330
aeda+2.178
ochet+2.119
uden+2.051
piring+2.011
Neg logit contributions
selves-1.745
 Lords-1.692
 cabinets-1.664
 coy-1.613
 Shiva-1.586
Token 3
Token ID513
Feature activation+1.34e+00

Pos logit contributions
ellen+0.321
aeda+0.301
ochet+0.293
uden+0.284
Detailed+0.272
Neg logit contributions
selves-0.210
 Lords-0.205
 cabinets-0.199
 coy-0.199
 Desk-0.194
Token,
Token ID11
Feature activation-4.34e-01

Pos logit contributions
selves+0.693
 Lords+0.674
 cabinets+0.644
 coy+0.641
 Desk+0.636
Neg logit contributions
ellen-1.281
aeda-1.202
ochet-1.175
uden-1.148
 therap-1.098
Token Ren
Token ID7152
Feature activation-2.05e+00

Pos logit contributions
ellen+1.261
aeda+1.161
ochet+1.121
uden+1.101
 therap+1.080
Neg logit contributions
selves-1.125
 Lords-1.100
 cabinets-1.063
 coy-1.056
 Islands-1.050
Tokenaud
Token ID3885
Feature activation-9.88e-01

Pos logit contributions
ellen+1.335
aeda+1.223
ochet+1.168
uden+1.141
 therap+1.112
Neg logit contributions
selves-1.669
 Lords-1.645
 cabinets-1.593
 coy-1.588
 Desk-1.584
Token Lav
Token ID21438
Feature activation+1.78e+00

Pos logit contributions
ellen+0.872
aeda+0.817
ochet+0.791
uden+0.776
 therap+0.762
Neg logit contributions
selves-0.574
 Lords-0.559
 cabinets-0.540
 coy-0.536
 Islands-0.529
Tokeno
Token ID78
Feature activation+3.96e-02

Pos logit contributions
selves+1.272
 Lords+1.261
 cabinets+1.214
 coy+1.209
 Desk+1.208
Neg logit contributions
ellen-1.306
aeda-1.227
ochet-1.167
uden-1.153
 therap-1.151
Tokenie
Token ID494
Feature activation-1.96e+00

Pos logit contributions
 Lords+0.023
selves+0.023
 Desk+0.022
 coy+0.022
 Islands+0.022
Neg logit contributions
ellen-0.034
aeda-0.033
ochet-0.032
uden-0.031
 therap-0.031
Token (@
Token ID4275
Feature activation-9.90e+00

Pos logit contributions
ellen+1.274
aeda+1.164
ochet+1.114
uden+1.091
 therap+1.065
Neg logit contributions
selves-1.600
 Lords-1.571
 cabinets-1.524
 coy-1.517
 Desk-1.509
Tokenren
Token ID918
Feature activation-2.98e+00

Pos logit contributions
ellen+6.295
aeda+5.380
ochet+5.316
uden+5.262
Detailed+5.025
Neg logit contributions
selves-7.826
 cabinets-7.682
 Lords-7.663
 Tropical-7.584
 Desk-7.575
Tokenlav
Token ID18809
Feature activation+3.12e+00

Pos logit contributions
ellen+2.132
aeda+1.969
ochet+1.890
uden+1.845
 therap+1.768
Neg logit contributions
selves-2.246
 Lords-2.238
 Desk-2.153
 cabinets-2.152
 coy-2.141
Tokeno
Token ID78
Feature activation+2.67e+00

Pos logit contributions
selves+2.076
 Lords+2.025
 cabinets+1.950
 coy+1.947
 Islands+1.946
Neg logit contributions
ellen-2.424
aeda-2.279
 therap-2.171
ochet-2.150
uden-2.144
Tokeniet
Token ID1155
Feature activation-2.43e+00

Pos logit contributions
selves+1.535
 Lords+1.516
 coy+1.438
 Islands+1.438
 cabinets+1.435
Neg logit contributions
ellen-2.309
aeda-2.194
ochet-2.121
 therap-2.101
uden-2.079
Tokenva
Token ID6862
Feature activation+5.05e-01

Pos logit contributions
ellen+1.887
aeda+1.755
ochet+1.687
uden+1.664
 therap+1.604
Neg logit contributions
selves-1.678
 Lords-1.660
 cabinets-1.598
 coy-1.592
 Desk-1.588
Token https
Token ID3740
Feature activation-2.29e+00

Pos logit contributions
ellen+1.058
aeda+0.996
ochet+0.962
uden+0.947
 therap+0.933
Neg logit contributions
selves-0.689
 Lords-0.675
 cabinets-0.644
 coy-0.639
 Islands-0.635
Token://
Token ID1378
Feature activation-3.46e+00

Pos logit contributions
ellen+1.274
ochet+1.106
aeda+1.105
uden+1.025
Detailed+0.975
Neg logit contributions
selves-2.027
 Lords-2.026
 coy-1.983
 Desk-1.983
 cabinets-1.981
Tokent
Token ID83
Feature activation-3.20e+00

Pos logit contributions
ellen+2.373
aeda+2.128
ochet+2.096
uden+2.020
piring+1.878
Neg logit contributions
 Lords-2.662
selves-2.643
 Desk-2.618
 cabinets-2.583
 coy-2.569
Token.
Token ID13
Feature activation-8.22e+00

Pos logit contributions
ellen+2.323
aeda+2.114
ochet+2.110
uden+1.997
Detailed+1.887
Neg logit contributions
selves-2.295
 Lords-2.272
 coy-2.242
 cabinets-2.241
 Desk-2.226
Tokenco
Token ID1073
Feature activation-1.27e+01

Pos logit contributions
ellen+4.147
aeda+3.561
ochet+3.513
uden+3.381
Ca+3.073
Neg logit contributions
selves-7.484
 Lords-7.418
 cabinets-7.379
 Desk-7.359
 Shiva-7.349
Token/
Token ID14
Feature activation-9.87e+00

Pos logit contributions
ellen+7.567
ochet+6.988
Ca+6.473
aeda+6.450
ryan+6.350
Neg logit contributions
selves-9.118
 coy-8.892
 cabinets-8.852
 Lords-8.839
 stairs-8.765
Tokenl
Token ID75
Feature activation+4.92e-01

Pos logit contributions
ellen+7.054
aeda+6.491
ochet+6.228
uden+6.120
Ca+5.931
Neg logit contributions
selves-6.937
 Lords-6.740
 cabinets-6.651
 Desk-6.534
 Shiva-6.500
Tokeng
Token ID70
Feature activation+2.51e+00

Pos logit contributions
selves+0.355
 Lords+0.351
 cabinets+0.337
 coy+0.336
 Desk+0.335
Neg logit contributions
ellen-0.366
aeda-0.340
ochet-0.326
uden-0.320
 therap-0.315
TokenS
Token ID50
Feature activation-3.92e+00

Pos logit contributions
selves+1.799
 Lords+1.788
 coy+1.707
 cabinets+1.706
 Islands+1.702
Neg logit contributions
ellen-1.855
aeda-1.730
ochet-1.647
 therap-1.624
uden-1.615
Tokenwh
Token ID1929
Feature activation+1.79e+00

Pos logit contributions
ellen+2.832
aeda+2.636
ochet+2.519
uden+2.473
 therap+2.363
Neg logit contributions
selves-2.883
 Lords-2.826
 cabinets-2.759
 Desk-2.744
 coy-2.739
TokenX
Token ID55
Feature activation+5.02e-01

Pos logit contributions
selves+1.274
 Lords+1.261
 coy+1.208
 cabinets+1.205
 Islands+1.203
Neg logit contributions
ellen-1.338
aeda-1.248
ochet-1.194
uden-1.172
 therap-1.171
Token
Token ID851
Feature activation-2.72e+00

Pos logit contributions
ellen+0.089
aeda+0.083
ochet+0.081
uden+0.079
 therap+0.078
Neg logit contributions
selves-0.055
 Lords-0.055
 cabinets-0.052
 coy-0.052
 Desk-0.051
Token Donald
Token ID3759
Feature activation-6.11e-01

Pos logit contributions
ellen+2.313
aeda+2.167
ochet+2.092
uden+2.061
 therap+2.036
Neg logit contributions
selves-1.664
 Lords-1.634
 cabinets-1.559
 coy-1.551
 Islands-1.542
Token J
Token ID449
Feature activation+2.08e+00

Pos logit contributions
ellen+0.580
aeda+0.539
ochet+0.537
uden+0.518
Detailed+0.504
Neg logit contributions
selves-0.307
 Lords-0.292
 cabinets-0.284
 coy-0.283
 Desk-0.272
Token.
Token ID13
Feature activation+5.29e-01

Pos logit contributions
selves+1.357
 Lords+1.338
 coy+1.288
 cabinets+1.285
 Desk+1.279
Neg logit contributions
ellen-1.697
aeda-1.584
ochet-1.531
uden-1.499
 therap-1.441
Token Trump
Token ID1301
Feature activation-1.45e-01

Pos logit contributions
selves+0.231
 Lords+0.225
 coy+0.216
 cabinets+0.215
 Desk+0.209
Neg logit contributions
ellen-0.537
aeda-0.502
ochet-0.498
uden-0.486
Detailed-0.466
Token (@
Token ID4275
Feature activation-9.87e+00

Pos logit contributions
ellen+0.084
aeda+0.074
ochet+0.072
uden+0.069
Detailed+0.064
Neg logit contributions
selves-0.128
 Lords-0.126
 coy-0.124
 cabinets-0.123
 Desk-0.122
TokenrealDonaldTrump
Token ID28024
Feature activation-2.02e-01

Pos logit contributions
ellen+2.610
aeda+1.868
ochet+1.805
uden+1.585
Detailed+1.529
Neg logit contributions
 Lords-11.448
selves-11.236
 Desk-11.129
 Shiva-11.118
 cabinets-11.028
Token)
Token ID8
Feature activation-2.81e+00

Pos logit contributions
ellen+0.185
aeda+0.174
ochet+0.170
uden+0.165
Detailed+0.159
Neg logit contributions
selves-0.109
 Lords-0.107
 coy-0.104
 cabinets-0.103
 Shiva-0.101
Token June
Token ID2795
Feature activation+1.15e+00

Pos logit contributions
ellen+2.338
aeda+2.173
ochet+2.108
uden+2.063
 therap+2.009
Neg logit contributions
selves-1.722
 Lords-1.689
 cabinets-1.637
 Shiva-1.586
 coy-1.582
Token 3
Token ID513
Feature activation+1.21e+00

Pos logit contributions
selves+0.685
 Lords+0.676
 cabinets+0.643
 coy+0.642
 Desk+0.640
Neg logit contributions
ellen-1.001
aeda-0.941
ochet-0.907
uden-0.893
 therap-0.883
Token,
Token ID11
Feature activation-5.71e-01

Pos logit contributions
selves+0.639
 Lords+0.626
 cabinets+0.600
 coy+0.598
 Desk+0.595
Neg logit contributions
ellen-1.139
aeda-1.059
ochet-1.041
uden-1.011
Detailed-0.973
Token never
Token ID1239
Feature activation-1.26e+00

Pos logit contributions
ellen+3.039
aeda+2.852
uden+2.748
ochet+2.740
 therap+2.705
Neg logit contributions
selves-2.061
 Lords-1.988
 Desk-1.909
 Islands-1.895
 cabinets-1.883
Token called
Token ID1444
Feature activation-3.37e-01

Pos logit contributions
ellen+1.123
aeda+1.061
uden+1.019
ochet+1.019
 therap+0.993
Neg logit contributions
selves-0.710
 Lords-0.693
 Desk-0.665
 cabinets-0.651
 Islands-0.645
Token.
Token ID13
Feature activation+5.29e-01

Pos logit contributions
ellen+0.293
aeda+0.274
ochet+0.265
uden+0.262
 therap+0.259
Neg logit contributions
selves-0.200
 Lords-0.196
 cabinets-0.187
 Desk-0.186
 coy-0.186
Token pic
Token ID8301
Feature activation-6.16e+00

Pos logit contributions
selves+0.277
 Lords+0.272
 cabinets+0.259
 coy+0.258
 Desk+0.255
Neg logit contributions
ellen-0.497
aeda-0.469
ochet-0.454
uden-0.447
 therap-0.441
Token.
Token ID13
Feature activation-7.12e+00

Pos logit contributions
ellen+3.971
ochet+3.654
aeda+3.482
uden+3.358
Ca+3.221
Neg logit contributions
selves-4.681
 Lords-4.591
 coy-4.542
 cabinets-4.516
 Desk-4.493
Tokentwitter
Token ID6956
Feature activation-9.65e+00

Pos logit contributions
ellen+1.355
aeda+0.852
ochet+0.819
uden+0.733
ryan+0.422
Neg logit contributions
selves-8.807
 Lords-8.769
 Desk-8.686
 cabinets-8.645
 coy-8.611
Token.
Token ID13
Feature activation-6.20e+00

Pos logit contributions
ellen+5.420
ochet+4.793
aeda+4.453
Ca+4.314
ryan+4.292
Neg logit contributions
 coy-7.689
 Lords-7.663
 cabinets-7.609
selves-7.605
 Desk-7.541
Tokencom
Token ID785
Feature activation-8.58e+00

Pos logit contributions
ellen+3.886
aeda+3.483
ochet+3.386
uden+3.220
Detailed+3.109
Neg logit contributions
 Lords-4.972
selves-4.935
 cabinets-4.883
 coy-4.864
 Desk-4.797
Token/
Token ID14
Feature activation-9.12e+00

Pos logit contributions
ellen+5.729
ochet+5.290
aeda+5.053
Ca+4.943
Detailed+4.924
Neg logit contributions
selves-5.885
 coy-5.743
 Lords-5.737
 cabinets-5.707
 stairs-5.661
TokenCS
Token ID7902
Feature activation-5.21e+00

Pos logit contributions
ellen+6.551
aeda+6.033
ochet+5.794
uden+5.696
Ca+5.488
Neg logit contributions
selves-6.451
 Lords-6.292
 cabinets-6.173
 Desk-6.089
 Shiva-6.055
Token3
Token ID18
Feature activation-1.74e+00

Pos logit contributions
ellen+3.828
aeda+3.554
ochet+3.392
uden+3.344
 therap+3.146
Neg logit contributions
selves-3.768
 Lords-3.693
 cabinets-3.602
 Desk-3.567
 coy-3.555
Token
Token ID251
Feature activation+1.99e+00

Pos logit contributions
selves+1.939
 Lords+1.915
 cabinets+1.826
 coy+1.818
 Islands+1.816
Neg logit contributions
ellen-2.555
aeda-2.370
ochet-2.307
uden-2.261
 therap-2.236
Token to
Token ID284
Feature activation+4.68e-01

Pos logit contributions
selves+1.050
 Lords+1.031
 coy+0.973
 cabinets+0.972
 Desk+0.967
Neg logit contributions
ellen-1.866
aeda-1.765
ochet-1.705
uden-1.681
 therap-1.662
Token follow
Token ID1061
Feature activation-3.09e-01

Pos logit contributions
selves+0.277
 Lords+0.272
 cabinets+0.259
 coy+0.258
 Desk+0.258
Neg logit contributions
ellen-0.407
aeda-0.382
ochet-0.370
uden-0.364
 therap-0.361
Token the
Token ID262
Feature activation-4.73e-02

Pos logit contributions
ellen+0.277
aeda+0.261
ochet+0.252
uden+0.248
 therap+0.245
Neg logit contributions
selves-0.175
 Lords-0.173
 cabinets-0.164
 coy-0.163
 Desk-0.163
Token hashtag
Token ID23950
Feature activation-3.08e+00

Pos logit contributions
ellen+0.040
aeda+0.037
ochet+0.036
uden+0.035
 therap+0.035
Neg logit contributions
selves-0.029
 Lords-0.029
 cabinets-0.027
 Islands-0.027
 coy-0.027
Token #
Token ID1303
Feature activation-9.59e+00

Pos logit contributions
ellen+2.587
aeda+2.416
ochet+2.339
 therap+2.291
uden+2.287
Neg logit contributions
selves-1.883
 Lords-1.848
 cabinets-1.783
 Desk-1.759
 Islands-1.746
Tokenfemin
Token ID33594
Feature activation+1.93e-01

Pos logit contributions
ellen+7.783
aeda+7.045
ochet+7.011
uden+6.779
 therap+6.718
Neg logit contributions
 Lords-5.624
 cabinets-5.533
selves-5.501
 Islands-5.402
 Tropical-5.383
Tokenin
Token ID259
Feature activation-6.71e-01

Pos logit contributions
selves+0.142
 Lords+0.140
 cabinets+0.135
 coy+0.134
 Desk+0.134
Neg logit contributions
ellen-0.140
aeda-0.130
ochet-0.125
uden-0.123
 therap-0.121
Tokenew
Token ID413
Feature activation-8.96e-01

Pos logit contributions
ellen+0.583
aeda+0.549
ochet+0.529
uden+0.518
 therap+0.509
Neg logit contributions
selves-0.398
 Lords-0.396
 cabinets-0.378
 Desk-0.376
 coy-0.376
Tokenomen
Token ID3674
Feature activation+1.13e-01

Pos logit contributions
ellen+0.697
aeda+0.650
ochet+0.622
uden+0.612
 therap+0.600
Neg logit contributions
selves-0.616
 Lords-0.611
 cabinets-0.586
 coy-0.583
 Desk-0.583
Tokenfor
Token ID1640
Feature activation-1.12e+00

Pos logit contributions
selves+0.075
 Lords+0.075
 cabinets+0.071
 coy+0.071
 Desk+0.071
Neg logit contributions
ellen-0.090
aeda-0.084
ochet-0.081
uden-0.079
 therap-0.078
Token pic
Token ID8301
Feature activation-5.86e+00

Pos logit contributions
selves+0.033
 Lords+0.032
 cabinets+0.030
 coy+0.030
 Desk+0.030
Neg logit contributions
ellen-0.059
aeda-0.055
ochet-0.053
uden-0.053
 therap-0.052
Token.
Token ID13
Feature activation-7.03e+00

Pos logit contributions
ellen+3.848
ochet+3.471
aeda+3.324
uden+3.153
Ca+3.104
Neg logit contributions
selves-4.453
 Lords-4.414
 coy-4.397
 cabinets-4.361
 Desk-4.302
Tokentwitter
Token ID6956
Feature activation-9.24e+00

Pos logit contributions
ellen+1.211
ochet+0.675
aeda+0.630
uden+0.457
ryan+0.324
Neg logit contributions
 Lords-8.742
selves-8.724
 cabinets-8.683
 Desk-8.661
 coy-8.597
Token.
Token ID13
Feature activation-6.28e+00

Pos logit contributions
ellen+5.357
ochet+4.709
aeda+4.396
Ca+4.261
ryan+4.247
Neg logit contributions
 coy-7.341
 Lords-7.313
 cabinets-7.311
selves-7.252
 stairs-7.186
Tokencom
Token ID785
Feature activation-8.46e+00

Pos logit contributions
ellen+3.905
aeda+3.481
ochet+3.411
uden+3.198
Detailed+3.112
Neg logit contributions
 Lords-5.041
selves-5.016
 cabinets-4.986
 coy-4.941
 Desk-4.882
Token/
Token ID14
Feature activation-9.46e+00

Pos logit contributions
ellen+5.748
ochet+5.267
aeda+5.050
Ca+4.938
Detailed+4.906
Neg logit contributions
selves-5.796
 coy-5.703
 cabinets-5.676
 Lords-5.670
 stairs-5.602
TokenEh
Token ID43894
Feature activation-4.21e+00

Pos logit contributions
ellen+6.773
aeda+6.233
ochet+5.987
uden+5.882
Ca+5.683
Neg logit contributions
selves-6.668
 Lords-6.497
 cabinets-6.388
 Desk-6.288
 Shiva-6.265
TokenU
Token ID52
Feature activation-3.07e+00

Pos logit contributions
ellen+3.104
aeda+2.875
ochet+2.757
uden+2.709
 therap+2.571
Neg logit contributions
selves-3.038
 Lords-2.999
 cabinets-2.915
 Desk-2.890
 coy-2.884
Tokeng
Token ID70
Feature activation+4.69e+00

Pos logit contributions
ellen+2.240
aeda+2.083
ochet+1.991
uden+1.955
 therap+1.882
Neg logit contributions
selves-2.271
 Lords-2.241
 cabinets-2.164
 coy-2.153
 Desk-2.146
TokenI
Token ID40
Feature activation-1.56e+00

Pos logit contributions
selves+3.285
 Lords+3.277
 coy+3.120
 Islands+3.109
 cabinets+3.106
Neg logit contributions
ellen-3.429
aeda-3.211
 therap-3.066
ochet-3.050
uden-2.991
Tokenx
Token ID87
Feature activation+5.19e+00

Pos logit contributions
ellen+1.137
aeda+1.057
ochet+1.011
uden+0.992
 therap+0.965
Neg logit contributions
selves-1.156
 Lords-1.139
 cabinets-1.099
 coy-1.095
 Desk-1.092
Token with
Token ID351
Feature activation-8.05e-01

Pos logit contributions
ellen+0.906
aeda+0.843
ochet+0.816
uden+0.803
 therap+0.790
Neg logit contributions
selves-0.729
 Lords-0.714
 Desk-0.682
 Islands-0.682
 cabinets-0.682
Token an
Token ID281
Feature activation-1.30e+00

Pos logit contributions
ellen+0.743
aeda+0.701
ochet+0.678
uden+0.667
 therap+0.659
Neg logit contributions
selves-0.435
 Lords-0.428
 cabinets-0.405
 coy-0.404
 Desk-0.403
Token army
Token ID5428
Feature activation-4.33e+00

Pos logit contributions
ellen+1.064
aeda+0.992
ochet+0.954
uden+0.944
 therap+0.926
Neg logit contributions
selves-0.852
 Lords-0.830
 cabinets-0.789
 Desk-0.789
 coy-0.786
Token general
Token ID2276
Feature activation-2.83e+00

Pos logit contributions
ellen+3.437
aeda+3.195
ochet+3.081
uden+3.044
 therap+3.008
Neg logit contributions
selves-2.877
 Lords-2.774
 Islands-2.651
 cabinets-2.642
 Desk-2.633
Token;
Token ID26
Feature activation+4.99e-01

Pos logit contributions
ellen+2.390
aeda+2.237
ochet+2.171
uden+2.127
 therap+2.083
Neg logit contributions
selves-1.751
 Lords-1.710
 Islands-1.629
 cabinets-1.628
 Desk-1.627
Token Mr
Token ID1770
Feature activation+5.80e+00

Pos logit contributions
selves+0.266
 Lords+0.261
 cabinets+0.246
 coy+0.246
 Desk+0.245
Neg logit contributions
ellen-0.465
aeda-0.439
ochet-0.425
uden-0.419
 therap-0.413
Token.
Token ID13
Feature activation+2.00e+00

Pos logit contributions
selves+3.383
 Lords+3.340
 Islands+3.165
 cabinets+3.151
 Desk+3.141
Neg logit contributions
ellen-5.034
aeda-4.795
 therap-4.742
ochet-4.558
uden-4.538
Token Ram
Token ID7431
Feature activation+1.83e+00

Pos logit contributions
selves+1.204
 Lords+1.198
 Desk+1.137
 cabinets+1.136
 coy+1.130
Neg logit contributions
ellen-1.724
aeda-1.624
ochet-1.555
uden-1.540
 therap-1.538
Tokení
Token ID8836
Feature activation+4.93e-01

Pos logit contributions
selves+0.699
 Lords+0.685
 cabinets+0.634
 coy+0.631
 Desk+0.628
Neg logit contributions
ellen-1.980
aeda-1.886
ochet-1.832
uden-1.810
 therap-1.784
Tokenrez
Token ID21107
Feature activation+1.60e+00

Pos logit contributions
selves+0.236
 Lords+0.231
 cabinets+0.219
 Desk+0.219
 coy+0.217
Neg logit contributions
ellen-0.487
aeda-0.461
ochet-0.448
uden-0.442
 therap-0.426
Token also
Token ID635
Feature activation+1.05e+00

Pos logit contributions
selves+0.865
 Lords+0.852
 cabinets+0.806
 coy+0.805
 Desk+0.801
Neg logit contributions
ellen-1.478
aeda-1.395
ochet-1.348
uden-1.329
 therap-1.313
Token,
Token ID11
Feature activation+7.23e-01

Pos logit contributions
selves+1.004
 Lords+0.994
 coy+0.935
 cabinets+0.934
 Desk+0.926
Neg logit contributions
ellen-2.235
aeda-2.122
ochet-2.061
uden-2.040
 therap-2.018
Token repeat
Token ID9585
Feature activation+2.26e+00

Pos logit contributions
selves+0.513
 Lords+0.508
 cabinets+0.487
 coy+0.487
 Desk+0.485
Neg logit contributions
ellen-0.545
aeda-0.507
ochet-0.486
uden-0.478
 therap-0.470
Tokenability
Token ID1799
Feature activation+1.62e+00

Pos logit contributions
selves+1.321
 Lords+1.303
 coy+1.238
 cabinets+1.237
 Desk+1.235
Neg logit contributions
ellen-2.001
aeda-1.887
ochet-1.817
uden-1.788
 therap-1.746
Token,
Token ID11
Feature activation+1.10e+00

Pos logit contributions
selves+0.738
 Lords+0.728
 cabinets+0.684
 coy+0.683
 Desk+0.675
Neg logit contributions
ellen-1.625
aeda-1.544
ochet-1.496
uden-1.479
 therap-1.463
Token and
Token ID290
Feature activation+1.21e+00

Pos logit contributions
selves+0.656
 Lords+0.648
 coy+0.618
 cabinets+0.617
 Desk+0.612
Neg logit contributions
ellen-0.962
aeda-0.902
ochet-0.871
uden-0.858
 therap-0.845
Token surviv
Token ID10132
Feature activation+6.12e+00

Pos logit contributions
selves+0.929
 Lords+0.920
 coy+0.885
 cabinets+0.885
 Desk+0.880
Neg logit contributions
ellen-0.843
aeda-0.778
ochet-0.743
uden-0.730
 therap-0.715
Tokenability
Token ID1799
Feature activation+3.81e+00

Pos logit contributions
selves+3.072
 Lords+2.996
 cabinets+2.810
 Islands+2.808
 coy+2.802
Neg logit contributions
ellen-5.765
aeda-5.434
 therap-5.407
ochet-5.276
uden-5.242
Token)
Token ID8
Feature activation+2.03e-01

Pos logit contributions
selves+1.693
 Lords+1.669
 cabinets+1.572
 coy+1.565
 Islands+1.535
Neg logit contributions
ellen-3.842
aeda-3.660
ochet-3.533
uden-3.505
 therap-3.462
Token become
Token ID1716
Feature activation-1.55e+00

Pos logit contributions
selves+0.122
 Lords+0.121
 Desk+0.115
 cabinets+0.115
 coy+0.114
Neg logit contributions
ellen-0.174
aeda-0.164
ochet-0.158
uden-0.155
 therap-0.153
Token embedded
Token ID14553
Feature activation-2.63e-02

Pos logit contributions
ellen+1.363
aeda+1.285
ochet+1.239
uden+1.218
 therap+1.212
Neg logit contributions
selves-0.891
 Lords-0.876
 Desk-0.837
 Islands-0.831
 cabinets-0.829
Token within
Token ID1626
Feature activation-1.26e+00

Pos logit contributions
ellen+0.026
aeda+0.024
ochet+0.024
uden+0.023
 therap+0.023
Neg logit contributions
selves-0.013
 Lords-0.013
 Desk-0.012
 cabinets-0.012
 coy-0.012
Token conclusion
Token ID7664
Feature activation+5.26e-01

Pos logit contributions
ellen+4.829
aeda+4.585
ochet+4.519
uden+4.342
 therap+4.308
Neg logit contributions
selves-3.429
 Lords-3.274
 Desk-3.184
 Islands-3.143
 coy-3.097
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
selves+0.308
 Lords+0.304
 cabinets+0.289
 coy+0.289
 Desk+0.287
Neg logit contributions
ellen-0.463
aeda-0.435
ochet-0.420
uden-0.414
 therap-0.408
Token While
Token ID2893
Feature activation+2.58e+00

Pos logit contributions
selves+0.067
 Lords+0.064
 cabinets+0.063
 coy+0.063
 Desk+0.061
Neg logit contributions
ellen-0.111
aeda-0.105
ochet-0.102
uden-0.099
 therap-0.096
Token all
Token ID477
Feature activation-3.47e-01

Pos logit contributions
selves+1.377
 Lords+1.364
 cabinets+1.285
 coy+1.285
 Desk+1.280
Neg logit contributions
ellen-2.401
aeda-2.266
ochet-2.188
uden-2.158
 therap-2.135
Token three
Token ID1115
Feature activation+1.93e+00

Pos logit contributions
ellen+0.308
aeda+0.292
ochet+0.283
uden+0.278
 therap+0.274
Neg logit contributions
selves-0.199
 Lords-0.191
 cabinets-0.183
 Desk-0.183
 coy-0.183
Token titles
Token ID8714
Feature activation+6.37e+00

Pos logit contributions
selves+1.068
 Lords+1.053
 cabinets+0.998
 coy+0.996
 Desk+0.992
Neg logit contributions
ellen-1.758
aeda-1.658
ochet-1.601
uden-1.578
 therap-1.559
Token t
Token ID256
Feature activation+4.99e-01

Pos logit contributions
selves+3.698
 Lords+3.675
 coy+3.549
 cabinets+3.496
 Islands+3.454
Neg logit contributions
ellen-5.550
aeda-5.246
 therap-4.953
ochet-4.934
uden-4.931
Tokenipt
Token ID10257
Feature activation-2.74e+00

Pos logit contributions
selves+0.354
 Lords+0.350
 cabinets+0.336
 coy+0.336
 Desk+0.334
Neg logit contributions
ellen-0.376
aeda-0.350
ochet-0.335
uden-0.329
 therap-0.325
Tokenoe
Token ID2577
Feature activation-1.72e+00

Pos logit contributions
ellen+1.851
ochet+1.639
aeda+1.638
uden+1.587
 therap+1.497
Neg logit contributions
selves-2.177
 Lords-2.154
 cabinets-2.090
 Islands-2.066
 Desk-2.065
Token into
Token ID656
Feature activation+6.96e-01

Pos logit contributions
ellen+1.407
aeda+1.311
ochet+1.274
uden+1.248
 therap+1.231
Neg logit contributions
selves-1.095
 Lords-1.077
 Desk-1.034
 cabinets-1.034
 Islands-1.023
Token min
Token ID949
Feature activation-1.38e+00

Pos logit contributions
selves+0.388
 Lords+0.380
 cabinets+0.361
 coy+0.360
 Desk+0.359
Neg logit contributions
ellen-0.632
aeda-0.596
ochet-0.575
uden-0.567
 therap-0.561
Token
Token ID564
Feature activation-5.95e-01

Pos logit contributions
selves+0.730
 Lords+0.719
 cabinets+0.684
 coy+0.682
 Desk+0.680
Neg logit contributions
ellen-1.106
aeda-1.042
ochet-1.003
uden-0.989
 therap-0.978
Token
Token ID250
Feature activation-8.41e-01

Pos logit contributions
ellen+0.471
aeda+0.441
ochet+0.422
uden+0.416
 therap+0.410
Neg logit contributions
selves-0.399
 Lords-0.395
 cabinets-0.379
 coy-0.378
 Desk-0.377
Tokennorm
Token ID27237
Feature activation-3.17e+00

Pos logit contributions
ellen+0.653
aeda+0.609
ochet+0.582
uden+0.572
 therap+0.563
Neg logit contributions
selves-0.578
 Lords-0.574
 cabinets-0.551
 coy-0.549
 Desk-0.548
Tokenally
Token ID453
Feature activation-2.53e+00

Pos logit contributions
ellen+2.669
aeda+2.496
ochet+2.366
uden+2.343
 therap+2.307
Neg logit contributions
 Lords-1.940
selves-1.935
 coy-1.856
 cabinets-1.850
 Islands-1.842
Token protective
Token ID14153
Feature activation+1.07e+00

Pos logit contributions
ellen+2.200
aeda+2.078
ochet+1.976
uden+1.946
 therap+1.944
Neg logit contributions
selves-1.475
 Lords-1.454
 cabinets-1.398
 Desk-1.379
 Islands-1.372
Token lact
Token ID26222
Feature activation+6.34e+00

Pos logit contributions
selves+0.613
 Lords+0.603
 cabinets+0.573
 coy+0.571
 Desk+0.570
Neg logit contributions
ellen-0.946
aeda-0.892
ochet-0.860
uden-0.847
 therap-0.839
Tokenob
Token ID672
Feature activation+5.03e-01

Pos logit contributions
selves+3.478
 Lords+3.413
 coy+3.238
 Desk+3.233
 Islands+3.218
Neg logit contributions
ellen-5.528
aeda-5.232
 therap-5.132
ochet-5.089
uden-5.001
Tokenac
Token ID330
Feature activation+3.39e+00

Pos logit contributions
selves+0.402
 Lords+0.399
 cabinets+0.387
 Desk+0.384
 coy+0.384
Neg logit contributions
ellen-0.336
aeda-0.312
ochet-0.293
uden-0.290
 therap-0.280
Tokenilli
Token ID50173
Feature activation-2.41e+00

Pos logit contributions
selves+2.582
 Lords+2.542
 coy+2.451
 cabinets+2.448
 Islands+2.441
Neg logit contributions
ellen-2.338
aeda-2.172
ochet-2.076
uden-2.039
 therap-2.030
Token
Token ID447
Feature activation+1.45e+00

Pos logit contributions
ellen+2.220
aeda+2.107
ochet+2.020
uden+1.991
 therap+1.980
Neg logit contributions
selves-1.276
 Lords-1.246
 coy-1.184
 cabinets-1.184
 Desk-1.177
Token
Token ID251
Feature activation+2.52e+00

Pos logit contributions
selves+0.966
 Lords+0.953
 cabinets+0.912
 coy+0.910
 Desk+0.906
Neg logit contributions
ellen-1.159
aeda-1.079
ochet-1.040
uden-1.022
 therap-1.009
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
ellen+0.957
aeda+0.905
ochet+0.877
uden+0.865
 therap+0.849
Neg logit contributions
selves-0.549
 Lords-0.525
 Desk-0.500
 Islands-0.500
 coy-0.499
Token procedural
Token ID27931
Feature activation-2.03e-01

Pos logit contributions
ellen+0.925
aeda+0.874
ochet+0.848
uden+0.835
 therap+0.817
Neg logit contributions
selves-0.576
 Lords-0.551
 Desk-0.526
 coy-0.525
 Islands-0.524
Token maps
Token ID8739
Feature activation+2.67e+00

Pos logit contributions
ellen+0.174
aeda+0.164
ochet+0.159
uden+0.156
 therap+0.154
Neg logit contributions
selves-0.122
 Lords-0.118
 Desk-0.113
 coy-0.113
 cabinets-0.113
Token were
Token ID547
Feature activation-1.89e+00

Pos logit contributions
selves+1.464
 Lords+1.442
 cabinets+1.368
 coy+1.364
 Desk+1.354
Neg logit contributions
ellen-2.440
aeda-2.302
ochet-2.222
uden-2.190
 therap-2.169
Token already
Token ID1541
Feature activation-1.31e+00

Pos logit contributions
ellen+1.659
aeda+1.562
ochet+1.510
uden+1.485
 therap+1.457
Neg logit contributions
selves-1.082
 Lords-1.063
 Desk-1.021
 cabinets-1.007
 Islands-1.002
Token bog
Token ID22922
Feature activation+6.60e+00

Pos logit contributions
ellen+1.166
aeda+1.103
ochet+1.060
uden+1.047
 therap+1.030
Neg logit contributions
selves-0.745
 Lords-0.728
 Desk-0.696
 cabinets-0.692
 coy-0.685
Tokenging
Token ID2667
Feature activation-3.07e+00

Pos logit contributions
selves+2.859
 Lords+2.741
 coy+2.632
 cabinets+2.591
 Unic+2.522
Neg logit contributions
ellen-6.689
aeda-6.394
ochet-6.104
 therap-6.057
uden-5.942
Token us
Token ID514
Feature activation-4.26e+00

Pos logit contributions
ellen+2.530
aeda+2.402
ochet+2.334
uden+2.281
 therap+2.245
Neg logit contributions
selves-1.867
 Lords-1.831
 Desk-1.764
 cabinets-1.750
 Islands-1.750
Token in
Token ID287
Feature activation+2.95e-01

Pos logit contributions
ellen+3.486
aeda+3.266
ochet+3.170
uden+3.106
 therap+3.063
Neg logit contributions
 Lords-2.601
selves-2.582
 cabinets-2.530
 Desk-2.525
 Islands-2.502
Token two
Token ID734
Feature activation-1.27e+00

Pos logit contributions
selves+0.165
 Lords+0.162
 cabinets+0.154
 coy+0.154
 Desk+0.154
Neg logit contributions
ellen-0.266
aeda-0.251
ochet-0.242
uden-0.239
 therap-0.236
Token factors
Token ID5087
Feature activation-2.08e-01

Pos logit contributions
ellen+1.108
aeda+1.047
ochet+1.004
uden+0.994
 therap+0.979
Neg logit contributions
selves-0.736
 Lords-0.720
 Desk-0.691
 coy-0.684
 cabinets-0.682
Tokenong
Token ID506
Feature activation-5.17e-01

Pos logit contributions
selves+1.293
 Lords+1.276
 cabinets+1.215
 coy+1.213
 Desk+1.210
Neg logit contributions
ellen-1.831
aeda-1.719
ochet-1.657
uden-1.632
 therap-1.606
Token,
Token ID11
Feature activation-2.11e+00

Pos logit contributions
ellen+0.496
aeda+0.468
ochet+0.453
uden+0.447
 therap+0.442
Neg logit contributions
selves-0.261
 Lords-0.257
 cabinets-0.242
 Desk-0.240
 coy-0.240
Token New
Token ID968
Feature activation+1.60e+00

Pos logit contributions
ellen+1.894
aeda+1.791
ochet+1.751
uden+1.709
 therap+1.703
Neg logit contributions
selves-1.170
 Lords-1.121
 Desk-1.073
 cabinets-1.060
 Islands-1.054
Token Orleans
Token ID12255
Feature activation-2.95e-01

Pos logit contributions
selves+0.647
 Lords+0.641
 cabinets+0.592
 coy+0.588
 Islands+0.585
Neg logit contributions
ellen-1.690
aeda-1.613
ochet-1.565
uden-1.546
 therap-1.530
Token is
Token ID318
Feature activation-8.90e-01

Pos logit contributions
ellen+0.263
aeda+0.248
ochet+0.239
uden+0.236
 therap+0.233
Neg logit contributions
selves-0.169
 Lords-0.165
 cabinets-0.157
 Desk-0.157
 coy-0.157
Token a
Token ID257
Feature activation+5.02e-01

Pos logit contributions
ellen+0.816
aeda+0.771
ochet+0.748
uden+0.734
 therap+0.729
Neg logit contributions
selves-0.482
 Lords-0.472
 Desk-0.451
 cabinets-0.449
 Islands-0.443
Token big
Token ID1263
Feature activation+1.85e+00

Pos logit contributions
selves+0.287
 Lords+0.281
 cabinets+0.268
 Desk+0.267
 coy+0.266
Neg logit contributions
ellen-0.448
aeda-0.422
ochet-0.408
uden-0.400
 therap-0.397
Token hub
Token ID12575
Feature activation-9.80e-01

Pos logit contributions
selves+0.997
 Lords+0.983
 cabinets+0.930
 coy+0.930
 Desk+0.924
Neg logit contributions
ellen-1.710
aeda-1.614
ochet-1.559
uden-1.538
 therap-1.519
Token for
Token ID329
Feature activation-1.07e+00

Pos logit contributions
ellen+0.923
aeda+0.873
ochet+0.844
uden+0.830
 therap+0.820
Neg logit contributions
selves-0.512
 Lords-0.505
 cabinets-0.477
 Desk-0.476
 coy-0.475
Token Vietnamese
Token ID23618
Feature activation-4.03e+00

Pos logit contributions
ellen+0.923
aeda+0.876
ochet+0.847
 therap+0.828
uden+0.825
Neg logit contributions
selves-0.640
 Lords-0.618
 Desk-0.592
 cabinets-0.586
 coy-0.583
Token families
Token ID4172
Feature activation-2.96e+00

Pos logit contributions
ellen+3.346
aeda+3.190
ochet+3.093
 therap+3.012
uden+2.988
Neg logit contributions
selves-2.461
 Lords-2.342
 Desk-2.251
 coy-2.225
 Standing-2.215
Token toys
Token ID14958
Feature activation+2.20e+00

Pos logit contributions
ellen+2.351
aeda+2.217
ochet+2.166
 therap+2.143
uden+2.125
Neg logit contributions
selves-1.047
 Lords-1.014
 Islands-0.943
 Desk-0.941
 coy-0.935
Token,
Token ID11
Feature activation-7.65e-01

Pos logit contributions
selves+1.086
 Lords+1.072
 cabinets+1.012
 coy+1.008
 Desk+1.001
Neg logit contributions
ellen-2.139
aeda-2.030
ochet-1.956
uden-1.935
 therap-1.913
Token there
Token ID612
Feature activation-1.92e+00

Pos logit contributions
ellen+0.729
aeda+0.686
ochet+0.672
uden+0.657
 therap+0.652
Neg logit contributions
selves-0.387
 Lords-0.377
 coy-0.354
 Desk-0.354
 Islands-0.354
Token is
Token ID318
Feature activation+8.88e-01

Pos logit contributions
ellen+1.686
aeda+1.575
ochet+1.522
uden+1.494
 therap+1.473
Neg logit contributions
selves-1.123
 Lords-1.112
 Desk-1.056
 cabinets-1.050
 coy-1.047
Token something
Token ID1223
Feature activation+1.74e-01

Pos logit contributions
selves+0.444
 Lords+0.435
 cabinets+0.409
 Desk+0.408
 coy+0.408
Neg logit contributions
ellen-0.857
aeda-0.810
ochet-0.784
uden-0.774
 therap-0.764
Token for
Token ID329
Feature activation-3.79e-01

Pos logit contributions
selves+0.091
 Lords+0.090
 Desk+0.085
 cabinets+0.084
 Islands+0.084
Neg logit contributions
ellen-0.164
aeda-0.154
ochet-0.150
uden-0.147
 therap-0.146
Token every
Token ID790
Feature activation-1.87e+00

Pos logit contributions
ellen+0.317
aeda+0.298
ochet+0.289
uden+0.282
 therap+0.281
Neg logit contributions
selves-0.236
 Lords-0.232
 Desk-0.221
 cabinets-0.220
 Islands-0.220
Token kid
Token ID5141
Feature activation-1.90e+00

Pos logit contributions
ellen+1.588
aeda+1.492
ochet+1.441
uden+1.414
 therap+1.401
Neg logit contributions
selves-1.137
 Lords-1.121
 cabinets-1.062
 coy-1.060
 Islands-1.059
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
ellen+1.648
aeda+1.541
ochet+1.498
uden+1.468
 therap+1.450
Neg logit contributions
selves-1.116
 Lords-1.107
 cabinets-1.052
 coy-1.052
 Desk-1.048
TokenSome
Token ID4366
Feature activation-1.31e+00

Pos logit contributions
ellen+1.368
aeda+1.280
ochet+1.248
uden+1.210
 therap+1.180
Neg logit contributions
selves-0.863
 Lords-0.851
 coy-0.828
 cabinets-0.821
 Desk-0.799
Token toys
Token ID14958
Feature activation+2.78e+00

Pos logit contributions
ellen+1.245
aeda+1.174
ochet+1.140
uden+1.123
 therap+1.112
Neg logit contributions
selves-0.664
 Lords-0.652
 coy-0.615
 cabinets-0.613
 Desk-0.613
Token gets
Token ID3011
Feature activation-8.32e-01

Pos logit contributions
ellen+0.605
aeda+0.570
ochet+0.555
uden+0.545
 therap+0.538
Neg logit contributions
selves-0.332
 Lords-0.327
 cabinets-0.307
 Desk-0.306
 coy-0.306
Token in
Token ID287
Feature activation+1.43e-01

Pos logit contributions
ellen+0.750
aeda+0.705
ochet+0.688
uden+0.672
 therap+0.671
Neg logit contributions
selves-0.463
 Lords-0.453
 cabinets-0.428
 Desk-0.423
 coy-0.423
Token the
Token ID262
Feature activation-1.23e+00

Pos logit contributions
selves+0.074
 Lords+0.073
 cabinets+0.069
 coy+0.069
 Desk+0.069
Neg logit contributions
ellen-0.136
aeda-0.128
ochet-0.124
uden-0.122
 therap-0.121
Token way
Token ID835
Feature activation+5.58e+00

Pos logit contributions
ellen+0.953
aeda+0.888
ochet+0.853
uden+0.836
 therap+0.827
Neg logit contributions
selves-0.849
 Lords-0.835
 cabinets-0.801
 coy-0.801
 Desk-0.796
Token,
Token ID11
Feature activation+7.82e-01

Pos logit contributions
selves+2.750
 Lords+2.731
 coy+2.621
 cabinets+2.590
 Desk+2.562
Neg logit contributions
ellen-5.325
aeda-5.089
ochet-4.824
uden-4.783
 therap-4.773
Token as
Token ID355
Feature activation+1.34e+00

Pos logit contributions
selves+0.419
 Lords+0.410
 cabinets+0.386
 coy+0.386
 Desk+0.385
Neg logit contributions
ellen-0.724
aeda-0.681
ochet-0.661
uden-0.652
 therap-0.644
Token it
Token ID340
Feature activation+8.80e-01

Pos logit contributions
selves+0.757
 Lords+0.749
 cabinets+0.711
 Desk+0.709
 coy+0.709
Neg logit contributions
ellen-1.208
aeda-1.139
ochet-1.098
uden-1.084
 therap-1.070
Token's
Token ID338
Feature activation+1.00e+00

Pos logit contributions
selves+0.507
 Lords+0.499
 cabinets+0.472
 Desk+0.471
 coy+0.470
Neg logit contributions
ellen-0.782
aeda-0.734
ochet-0.711
uden-0.699
 therap-0.689
Token possible
Token ID1744
Feature activation+2.46e+00

Pos logit contributions
selves+0.570
 Lords+0.562
 cabinets+0.533
 coy+0.532
 Desk+0.530
Neg logit contributions
ellen-0.901
aeda-0.849
ochet-0.820
uden-0.807
 therap-0.798
Token for
Token ID329
Feature activation+1.28e+00

Pos logit contributions
selves+1.138
 Lords+1.123
 cabinets+1.066
 Desk+1.066
 coy+1.060
Neg logit contributions
ellen-2.446
aeda-2.343
ochet-2.253
uden-2.237
 therap-2.206
Token the
Token ID262
Feature activation+2.96e-01

Pos logit contributions
selves+0.673
 Lords+0.665
 cabinets+0.629
 coy+0.627
 Desk+0.626
Neg logit contributions
ellen-1.202
aeda-1.137
ochet-1.099
uden-1.085
 therap-1.071
Token one
Token ID530
Feature activation-4.69e+00

Pos logit contributions
ellen+1.997
aeda+1.883
ochet+1.824
uden+1.780
 therap+1.755
Neg logit contributions
selves-1.292
 Lords-1.248
 cabinets-1.192
 coy-1.190
 Islands-1.182
Token sign
Token ID1051
Feature activation-3.90e+00

Pos logit contributions
ellen+3.811
aeda+3.589
ochet+3.474
uden+3.407
 therap+3.347
Neg logit contributions
selves-2.932
 Lords-2.842
 cabinets-2.760
 Desk-2.723
 Shiva-2.695
Token of
Token ID286
Feature activation-3.37e+00

Pos logit contributions
ellen+3.551
aeda+3.342
ochet+3.239
uden+3.186
 therap+3.144
Neg logit contributions
selves-2.150
 Lords-2.102
 cabinets-2.008
 Islands-1.992
 Desk-1.991
Token the
Token ID262
Feature activation-1.85e+00

Pos logit contributions
ellen+2.884
aeda+2.730
ochet+2.622
uden+2.601
 therap+2.549
Neg logit contributions
selves-2.004
 Lords-1.946
 Desk-1.865
 cabinets-1.861
 Islands-1.853
Token government
Token ID1230
Feature activation-1.74e-01

Pos logit contributions
ellen+1.476
aeda+1.382
ochet+1.321
uden+1.312
 therap+1.290
Neg logit contributions
selves-1.225
 Lords-1.200
 Desk-1.151
 cabinets-1.147
 Islands-1.145
Token
Token ID447
Feature activation+9.88e-01

Pos logit contributions
ellen+0.153
aeda+0.144
ochet+0.139
uden+0.137
 therap+0.135
Neg logit contributions
selves-0.102
 Lords-0.100
 cabinets-0.095
 coy-0.095
 Desk-0.095
Token
Token ID247
Feature activation+4.51e-01

Pos logit contributions
selves+0.669
 Lords+0.661
 cabinets+0.634
 coy+0.634
 Desk+0.633
Neg logit contributions
ellen-0.775
aeda-0.727
ochet-0.696
uden-0.685
 therap-0.671
Tokens
Token ID82
Feature activation-2.94e-01

Pos logit contributions
selves+0.292
 Lords+0.287
 coy+0.275
 cabinets+0.275
 Desk+0.273
Neg logit contributions
ellen-0.370
aeda-0.346
ochet-0.333
uden-0.327
 therap-0.324
Token nervous
Token ID10927
Feature activation-5.54e-01

Pos logit contributions
ellen+0.238
aeda+0.223
ochet+0.214
uden+0.211
 therap+0.208
Neg logit contributions
selves-0.192
 Lords-0.190
 cabinets-0.182
 coy-0.181
 Desk-0.181
Tokenness
Token ID1108
Feature activation+1.94e+00

Pos logit contributions
ellen+0.420
aeda+0.388
ochet+0.375
uden+0.367
 therap+0.363
Neg logit contributions
selves-0.391
 Lords-0.387
 cabinets-0.370
 coy-0.369
 Desk-0.369
Token about
Token ID546
Feature activation+1.42e+00

Pos logit contributions
selves+0.926
 Lords+0.910
 cabinets+0.856
 coy+0.856
 Desk+0.848
Neg logit contributions
ellen-1.919
aeda-1.822
ochet-1.762
uden-1.740
 therap-1.720
Token in
Token ID287
Feature activation+5.19e-01

Pos logit contributions
selves+0.826
 Lords+0.814
 coy+0.776
 cabinets+0.775
 Desk+0.771
Neg logit contributions
ellen-1.271
aeda-1.198
ochet-1.155
uden-1.135
 therap-1.121
Token controlling
Token ID12755
Feature activation-3.72e-01

Pos logit contributions
selves+0.303
 Lords+0.299
 cabinets+0.285
 coy+0.284
 Desk+0.283
Neg logit contributions
ellen-0.457
aeda-0.431
ochet-0.415
uden-0.409
 therap-0.404
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
ellen+0.339
aeda+0.320
ochet+0.310
uden+0.305
 therap+0.302
Neg logit contributions
selves-0.203
 Lords-0.201
 cabinets-0.190
 Desk-0.189
 coy-0.188
Token crowd
Token ID4315
Feature activation+3.07e+00

Pos logit contributions
selves+0.096
 Lords+0.095
 cabinets+0.090
 Desk+0.090
 coy+0.089
Neg logit contributions
ellen-0.151
aeda-0.142
ochet-0.137
uden-0.135
 therap-0.134
Token of
Token ID286
Feature activation-7.65e-01

Pos logit contributions
selves+1.664
 Lords+1.635
 coy+1.553
 cabinets+1.545
 Islands+1.525
Neg logit contributions
ellen-2.833
aeda-2.680
ochet-2.577
uden-2.558
 therap-2.521
Token fren
Token ID22832
Feature activation+2.62e+00

Pos logit contributions
ellen+0.660
aeda+0.620
ochet+0.603
uden+0.590
 therap+0.587
Neg logit contributions
selves-0.450
 Lords-0.447
 cabinets-0.423
 Desk-0.423
 coy-0.419
Tokenz
Token ID89
Feature activation+3.27e+00

Pos logit contributions
selves+1.767
 Lords+1.741
 coy+1.667
 cabinets+1.660
 Islands+1.653
Neg logit contributions
ellen-2.051
aeda-1.936
ochet-1.835
uden-1.827
 therap-1.806
Tokenied
Token ID798
Feature activation-1.34e+00

Pos logit contributions
selves+0.946
 Lords+0.923
 cabinets+0.832
 coy+0.823
 Desk+0.815
Neg logit contributions
ellen-3.834
aeda-3.669
ochet-3.572
uden-3.533
 therap-3.506
Token bargain
Token ID22803
Feature activation-6.25e-01

Pos logit contributions
ellen+1.130
aeda+1.061
ochet+1.028
uden+1.007
 therap+1.002
Neg logit contributions
selves-0.808
 Lords-0.801
 Desk-0.763
 cabinets-0.759
 Islands-0.759
Token-
Token ID12
Feature activation+1.43e+00

Pos logit contributions
ellen+0.471
aeda+0.436
ochet+0.420
uden+0.413
 therap+0.408
Neg logit contributions
selves-0.440
 Lords-0.435
 Desk-0.417
 coy-0.415
 cabinets-0.415
Tokenseekers
Token ID47971
Feature activation+1.09e-01

Pos logit contributions
selves+0.887
 Lords+0.874
 cabinets+0.833
 coy+0.832
 Desk+0.827
Neg logit contributions
ellen-1.208
aeda-1.138
ochet-1.094
uden-1.076
 therap-1.066
Token around
Token ID1088
Feature activation+2.63e+00

Pos logit contributions
selves+0.864
 Lords+0.851
 coy+0.806
 cabinets+0.806
 Desk+0.800
Neg logit contributions
ellen-1.464
aeda-1.382
ochet-1.334
uden-1.314
 therap-1.301
Token the
Token ID262
Feature activation+5.58e-01

Pos logit contributions
selves+1.458
 Lords+1.437
 cabinets+1.364
 coy+1.362
 Desk+1.354
Neg logit contributions
ellen-2.390
aeda-2.252
ochet-2.175
uden-2.145
 therap-2.115
Token eyes
Token ID2951
Feature activation-1.53e+00

Pos logit contributions
selves+0.347
 Lords+0.341
 Desk+0.325
 cabinets+0.325
 coy+0.323
Neg logit contributions
ellen-0.468
aeda-0.440
ochet-0.422
uden-0.416
 therap-0.414
Token,
Token ID11
Feature activation-8.02e-01

Pos logit contributions
ellen+1.391
aeda+1.301
ochet+1.259
uden+1.246
 therap+1.213
Neg logit contributions
selves-0.840
 Lords-0.829
 Desk-0.786
 cabinets-0.783
 Islands-0.781
Token nose
Token ID9686
Feature activation-1.99e+00

Pos logit contributions
ellen+0.715
aeda+0.672
ochet+0.649
uden+0.643
 therap+0.635
Neg logit contributions
selves-0.450
 Lords-0.439
 Desk-0.419
 Islands-0.417
 cabinets-0.414
Token and
Token ID290
Feature activation-2.18e+00

Pos logit contributions
ellen+1.888
aeda+1.783
ochet+1.715
uden+1.697
 therap+1.673
Neg logit contributions
selves-1.022
 Lords-1.000
 coy-0.941
 Desk-0.938
 Islands-0.935
Token feet
Token ID3625
Feature activation+5.83e-01

Pos logit contributions
ellen+2.007
aeda+1.892
ochet+1.833
uden+1.813
 therap+1.802
Neg logit contributions
selves-1.155
 Lords-1.120
 Desk-1.069
 Islands-1.068
 coy-1.053
Token are
Token ID389
Feature activation-1.16e+00

Pos logit contributions
selves+0.329
 Lords+0.325
 cabinets+0.308
 Desk+0.307
 coy+0.307
Neg logit contributions
ellen-0.524
aeda-0.493
ochet-0.476
uden-0.469
 therap-0.462
Token the
Token ID262
Feature activation+3.22e-01

Pos logit contributions
ellen+1.056
aeda+0.994
ochet+0.959
uden+0.947
 therap+0.939
Neg logit contributions
selves-0.639
 Lords-0.628
 Desk-0.595
 cabinets-0.594
 Islands-0.590
Token most
Token ID749
Feature activation-1.17e+00

Pos logit contributions
selves+0.184
 Lords+0.180
 Desk+0.172
 cabinets+0.171
 coy+0.170
Neg logit contributions
ellen-0.286
aeda-0.269
ochet-0.259
uden-0.256
 therap-0.253
Token common
Token ID2219
Feature activation-3.41e-01

Pos logit contributions
ellen+0.938
aeda+0.870
ochet+0.838
uden+0.826
 therap+0.818
Neg logit contributions
selves-0.772
 Lords-0.756
 Desk-0.728
 cabinets-0.722
 Islands-0.718
Token be
Token ID307
Feature activation+7.65e-01

Pos logit contributions
selves+0.218
 Lords+0.216
 cabinets+0.206
 coy+0.206
 Desk+0.206
Neg logit contributions
ellen-0.271
aeda-0.253
ochet-0.245
uden-0.240
 therap-0.237
Token killed
Token ID2923
Feature activation+1.82e+00

Pos logit contributions
selves+0.470
 Lords+0.464
 cabinets+0.443
 coy+0.441
 Desk+0.440
Neg logit contributions
ellen-0.650
aeda-0.610
ochet-0.588
uden-0.578
 therap-0.571
Token.
Token ID13
Feature activation-2.00e-01

Pos logit contributions
selves+1.074
 Lords+1.060
 coy+1.007
 cabinets+1.005
 Islands+0.999
Neg logit contributions
ellen-1.598
aeda-1.504
ochet-1.448
uden-1.428
 therap-1.411
Token Sun
Token ID3825
Feature activation-3.13e-01

Pos logit contributions
ellen+0.188
aeda+0.178
ochet+0.172
uden+0.169
 therap+0.166
Neg logit contributions
selves-0.104
 Lords-0.102
 cabinets-0.097
 coy-0.097
 Desk-0.095
Tokenan
Token ID272
Feature activation-3.67e+00

Pos logit contributions
ellen+0.238
aeda+0.221
ochet+0.212
uden+0.208
 therap+0.205
Neg logit contributions
selves-0.222
 Lords-0.219
 cabinets-0.210
 coy-0.210
 Desk-0.209
Token Abu
Token ID13098
Feature activation-3.89e+00

Pos logit contributions
ellen+3.232
aeda+3.024
uden+2.912
ochet+2.899
 therap+2.829
Neg logit contributions
selves-2.138
 Lords-2.038
 cabinets-1.965
 coy-1.935
 Desk-1.929
Token Daw
Token ID17552
Feature activation-3.86e+00

Pos logit contributions
ellen+3.322
aeda+3.153
ochet+2.999
uden+2.976
 therap+2.883
Neg logit contributions
selves-2.339
 Lords-2.271
 cabinets-2.210
 coy-2.167
 Desk-2.138
Tokenud
Token ID463
Feature activation-1.08e+00

Pos logit contributions
ellen+2.486
aeda+2.341
ochet+2.249
uden+2.185
uffer+1.985
Neg logit contributions
selves-3.168
 Lords-3.051
 cabinets-3.021
 Ghostbusters-2.964
 Unic-2.961
Token 14
Token ID1478
Feature activation-3.29e-01

Pos logit contributions
ellen+1.025
aeda+0.970
ochet+0.936
uden+0.927
 therap+0.912
Neg logit contributions
selves-0.557
 Lords-0.538
 cabinets-0.514
 Desk-0.511
 coy-0.508
Token:
Token ID25
Feature activation+1.47e+00

Pos logit contributions
ellen+0.286
aeda+0.269
ochet+0.259
uden+0.256
 therap+0.252
Neg logit contributions
selves-0.196
 Lords-0.193
 cabinets-0.184
 coy-0.183
 Desk-0.183
Token26
Token ID2075
Feature activation+2.39e+00

Pos logit contributions
selves+1.013
 Lords+1.003
 cabinets+0.959
 coy+0.957
 Desk+0.956
Neg logit contributions
ellen-1.136
aeda-1.059
ochet-1.017
uden-1.000
 therap-0.989
Token,
Token ID11
Feature activation-9.46e-01

Pos logit contributions
selves+0.429
 Lords+0.423
 cabinets+0.402
 coy+0.401
 Desk+0.399
Neg logit contributions
ellen-0.630
aeda-0.592
ochet-0.571
uden-0.562
 therap-0.555
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
ellen+0.731
aeda+0.680
ochet+0.654
uden+0.640
 therap+0.633
Neg logit contributions
selves-0.654
 Lords-0.645
 cabinets-0.619
 coy-0.615
 Desk-0.615
Token to
Token ID284
Feature activation-1.29e+00

Pos logit contributions
selves+0.561
 Lords+0.552
 cabinets+0.523
 coy+0.522
 Desk+0.520
Neg logit contributions
ellen-0.944
aeda-0.891
ochet-0.861
uden-0.848
 therap-0.839
Token show
Token ID905
Feature activation+1.54e+00

Pos logit contributions
ellen+1.049
aeda+0.984
ochet+0.954
uden+0.933
 therap+0.931
Neg logit contributions
selves-0.829
 Lords-0.815
 cabinets-0.775
 Desk-0.772
 Islands-0.771
Token there
Token ID612
Feature activation-1.87e+00

Pos logit contributions
selves+0.858
 Lords+0.847
 coy+0.803
 cabinets+0.801
 Islands+0.795
Neg logit contributions
ellen-1.401
aeda-1.320
ochet-1.267
uden-1.262
 therap-1.236
Token is
Token ID318
Feature activation-7.50e-01

Pos logit contributions
ellen+1.434
aeda+1.337
ochet+1.283
uden+1.255
 therap+1.243
Neg logit contributions
selves-1.285
 Lords-1.278
 cabinets-1.226
 Desk-1.226
 coy-1.221
Token a
Token ID257
Feature activation+4.73e-01

Pos logit contributions
ellen+0.702
aeda+0.664
ochet+0.642
uden+0.632
 therap+0.626
Neg logit contributions
selves-0.395
 Lords-0.389
 cabinets-0.367
 Desk-0.366
 coy-0.366
Token global
Token ID3298
Feature activation+2.34e+00

Pos logit contributions
selves+0.301
 Lords+0.297
 cabinets+0.284
 coy+0.284
 Desk+0.283
Neg logit contributions
ellen-0.391
aeda-0.367
ochet-0.353
uden-0.347
 therap-0.342
Token resistance
Token ID6625
Feature activation-2.61e+00

Pos logit contributions
selves+1.537
 Lords+1.524
 coy+1.457
 cabinets+1.449
 Shiva+1.444
Neg logit contributions
ellen-1.899
aeda-1.770
ochet-1.699
uden-1.695
 therap-1.640
Token to
Token ID284
Feature activation+5.87e-01

Pos logit contributions
ellen+2.383
aeda+2.250
ochet+2.176
uden+2.125
 therap+2.119
Neg logit contributions
selves-1.427
 Lords-1.406
 cabinets-1.338
 coy-1.329
 Desk-1.326
Token him
Token ID683
Feature activation-1.42e+00

Pos logit contributions
selves+0.328
 Lords+0.324
 cabinets+0.308
 coy+0.308
 Desk+0.306
Neg logit contributions
ellen-0.532
aeda-0.501
ochet-0.484
uden-0.477
 therap-0.470
Token into
Token ID656
Feature activation+2.81e+00

Pos logit contributions
ellen+0.537
aeda+0.509
ochet+0.492
uden+0.485
 therap+0.480
Neg logit contributions
selves-0.284
 Lords-0.281
 cabinets-0.265
 coy-0.264
 Desk-0.264
Token their
Token ID511
Feature activation+8.50e-01

Pos logit contributions
selves+1.408
 Lords+1.397
 cabinets+1.320
 coy+1.310
 Islands+1.306
Neg logit contributions
ellen-2.690
aeda-2.546
ochet-2.465
uden-2.434
 therap-2.399
Token home
Token ID1363
Feature activation-8.04e-01

Pos logit contributions
selves+0.488
 Lords+0.480
 coy+0.456
 cabinets+0.456
 Desk+0.455
Neg logit contributions
ellen-0.756
aeda-0.712
ochet-0.687
uden-0.677
 therap-0.670
Token to
Token ID284
Feature activation-7.50e-02

Pos logit contributions
ellen+0.717
aeda+0.675
ochet+0.652
uden+0.642
 therap+0.636
Neg logit contributions
selves-0.457
 Lords-0.450
 Desk-0.427
 coy-0.426
 cabinets-0.424
Token investigate
Token ID9161
Feature activation-1.53e+00

Pos logit contributions
ellen+0.063
aeda+0.059
ochet+0.057
uden+0.056
 therap+0.056
Neg logit contributions
selves-0.046
 Lords-0.045
 cabinets-0.043
 Desk-0.043
 coy-0.043
Token the
Token ID262
Feature activation-6.52e-01

Pos logit contributions
ellen+1.404
aeda+1.326
ochet+1.284
uden+1.264
 therap+1.260
Neg logit contributions
selves-0.827
 Lords-0.805
 Desk-0.769
 cabinets-0.761
 coy-0.754
Token activity
Token ID3842
Feature activation+7.79e-01

Pos logit contributions
ellen+0.581
aeda+0.545
ochet+0.527
uden+0.519
 therap+0.517
Neg logit contributions
selves-0.373
 Lords-0.362
 Desk-0.345
 cabinets-0.344
 coy-0.343
Token overnight
Token ID13417
Feature activation+7.36e-01

Pos logit contributions
selves+0.431
 Lords+0.424
 cabinets+0.403
 coy+0.401
 Desk+0.401
Neg logit contributions
ellen-0.709
aeda-0.669
ochet-0.645
uden-0.636
 therap-0.629
Token.
Token ID13
Feature activation+1.09e-01

Pos logit contributions
selves+0.433
 Lords+0.427
 cabinets+0.406
 coy+0.405
 Desk+0.405
Neg logit contributions
ellen-0.645
aeda-0.607
ochet-0.585
uden-0.576
 therap-0.569
Token We
Token ID775
Feature activation+1.11e+00

Pos logit contributions
selves+0.058
 Lords+0.057
 cabinets+0.055
 coy+0.054
 Desk+0.054
Neg logit contributions
ellen-0.101
aeda-0.095
ochet-0.092
uden-0.091
 therap-0.089
Token really
Token ID1107
Feature activation+2.33e+00

Pos logit contributions
selves+0.626
 Lords+0.617
 cabinets+0.587
 Desk+0.584
 coy+0.583
Neg logit contributions
ellen-0.997
aeda-0.938
ochet-0.908
uden-0.893
 therap-0.883
Token/
Token ID14
Feature activation-1.94e-02

Pos logit contributions
ellen+1.316
aeda+1.216
ochet+1.193
uden+1.170
Detailed+1.129
Neg logit contributions
selves-0.912
 Lords-0.881
 coy-0.867
 cabinets-0.855
 Desk-0.850
Token3
Token ID18
Feature activation+7.75e-01

Pos logit contributions
ellen+0.014
aeda+0.013
ochet+0.012
uden+0.012
 therap+0.012
Neg logit contributions
selves-0.014
 Lords-0.014
 cabinets-0.014
 Islands-0.014
 coy-0.014
Token IP
Token ID6101
Feature activation+1.50e+00

Pos logit contributions
selves+0.339
 Lords+0.329
 coy+0.309
 cabinets+0.309
 Desk+0.306
Neg logit contributions
ellen-0.796
aeda-0.756
ochet-0.733
uden-0.723
 therap-0.711
Token
Token ID198
Feature activation-1.16e+00

Pos logit contributions
selves+0.688
 Lords+0.682
 coy+0.638
 cabinets+0.636
 Islands+0.631
Neg logit contributions
ellen-1.510
aeda-1.426
ochet-1.385
uden-1.364
 therap-1.352
Token
Token ID198
Feature activation-3.95e-01

Pos logit contributions
ellen+1.001
aeda+0.919
ochet+0.915
uden+0.880
Detailed+0.856
Neg logit contributions
selves-0.681
 Lords-0.672
 coy-0.657
 cabinets-0.653
 Desk-0.649
TokenNick
Token ID23609
Feature activation-3.39e+00

Pos logit contributions
ellen+0.282
aeda+0.262
ochet+0.250
uden+0.246
 therap+0.243
Neg logit contributions
selves-0.295
 Lords-0.292
 cabinets-0.281
 coy-0.280
 Desk-0.280
Token Vincent
Token ID18653
Feature activation-1.08e+00

Pos logit contributions
ellen+3.068
aeda+2.867
ochet+2.794
uden+2.727
 therap+2.682
Neg logit contributions
selves-1.901
 Lords-1.831
 coy-1.763
 cabinets-1.761
 Desk-1.736
Token:
Token ID25
Feature activation+9.24e-01

Pos logit contributions
ellen+0.958
aeda+0.903
ochet+0.870
uden+0.858
 therap+0.848
Neg logit contributions
selves-0.624
 Lords-0.618
 coy-0.586
 cabinets-0.586
 Islands-0.583
Token 3
Token ID513
Feature activation+6.26e-01

Pos logit contributions
selves+0.554
 Lords+0.550
 coy+0.522
 cabinets+0.521
 Desk+0.520
Neg logit contributions
ellen-0.797
aeda-0.748
ochet-0.722
uden-0.710
 therap-0.704
Token.
Token ID13
Feature activation-2.66e+00

Pos logit contributions
selves+0.421
 Lords+0.411
 Desk+0.396
 coy+0.396
 cabinets+0.396
Neg logit contributions
ellen-0.498
aeda-0.460
ochet-0.449
uden-0.437
 therap-0.414
Token60
Token ID1899
Feature activation-1.39e+00

Pos logit contributions
ellen+1.869
aeda+1.713
ochet+1.651
uden+1.574
Detailed+1.555
Neg logit contributions
selves-1.981
 Lords-1.943
 cabinets-1.919
 coy-1.918
 Desk-1.873
TokenMD
Token ID12740
Feature activation+3.15e-01

Pos logit contributions
selves+0.318
 Lords+0.313
 coy+0.303
 cabinets+0.300
 Islands+0.300
Neg logit contributions
ellen-0.343
aeda-0.321
ochet-0.306
uden-0.302
 therap-0.289
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
selves+0.155
 Lords+0.152
 cabinets+0.143
 coy+0.143
 Desk+0.142
Neg logit contributions
ellen-0.307
aeda-0.290
ochet-0.281
uden-0.277
 therap-0.274
Token Pruitt
Token ID39637
Feature activation-2.86e+00

Pos logit contributions
ellen+0.921
aeda+0.871
ochet+0.839
uden+0.831
 therap+0.824
Neg logit contributions
selves-0.544
 Lords-0.522
 Desk-0.495
 cabinets-0.494
 coy-0.492
Token sent
Token ID1908
Feature activation-4.35e+00

Pos logit contributions
ellen+2.629
aeda+2.474
ochet+2.361
uden+2.353
 therap+2.337
Neg logit contributions
selves-1.540
 Lords-1.478
 Desk-1.401
 cabinets-1.398
 Islands-1.386
Token an
Token ID281
Feature activation-1.05e+00

Pos logit contributions
ellen+3.894
aeda+3.656
ochet+3.502
uden+3.461
 therap+3.417
Neg logit contributions
selves-2.465
 Lords-2.330
 Desk-2.204
 Islands-2.176
 coy-2.169
Token email
Token ID3053
Feature activation-7.17e+00

Pos logit contributions
ellen+0.718
aeda+0.663
ochet+0.628
uden+0.625
 therap+0.603
Neg logit contributions
selves-0.830
 Lords-0.801
 cabinets-0.775
 Desk-0.771
 coy-0.769
Token to
Token ID284
Feature activation-2.59e+00

Pos logit contributions
ellen+6.004
aeda+5.690
ochet+5.485
uden+5.397
 therap+5.370
Neg logit contributions
selves-4.103
 Lords-4.011
 Desk-3.830
 cabinets-3.782
 Islands-3.748
Token supporters
Token ID5941
Feature activation-2.28e+00

Pos logit contributions
ellen+2.327
aeda+2.188
ochet+2.107
uden+2.074
 therap+2.060
Neg logit contributions
selves-1.448
 Lords-1.391
 coy-1.316
 cabinets-1.309
 Islands-1.308
Token of
Token ID286
Feature activation-1.47e+00

Pos logit contributions
ellen+2.047
aeda+1.917
ochet+1.857
uden+1.840
 therap+1.813
Neg logit contributions
selves-1.268
 Lords-1.240
 cabinets-1.179
 Desk-1.179
 Islands-1.168
Token one
Token ID530
Feature activation-2.73e+00

Pos logit contributions
ellen+1.383
aeda+1.305
ochet+1.255
uden+1.239
 therap+1.224
Neg logit contributions
selves-0.764
 Lords-0.741
 cabinets-0.702
 coy-0.700
 Desk-0.698
Token federal
Token ID2717
Feature activation+6.03e-01

Pos logit contributions
ellen+2.453
aeda+2.303
ochet+2.217
uden+2.192
 therap+2.166
Neg logit contributions
selves-1.542
 Lords-1.507
 cabinets-1.435
 Desk-1.428
 coy-1.418
Token pic
Token ID8301
Feature activation-5.86e+00

Pos logit contributions
selves+0.033
 Lords+0.032
 cabinets+0.030
 coy+0.030
 Desk+0.030
Neg logit contributions
ellen-0.059
aeda-0.055
ochet-0.053
uden-0.053
 therap-0.052
Token.
Token ID13
Feature activation-7.03e+00

Pos logit contributions
ellen+3.848
ochet+3.471
aeda+3.324
uden+3.153
Ca+3.104
Neg logit contributions
selves-4.453
 Lords-4.414
 coy-4.397
 cabinets-4.361
 Desk-4.302
Tokentwitter
Token ID6956
Feature activation-9.24e+00

Pos logit contributions
ellen+1.211
ochet+0.675
aeda+0.630
uden+0.457
ryan+0.324
Neg logit contributions
 Lords-8.742
selves-8.724
 cabinets-8.683
 Desk-8.661
 coy-8.597
Token.
Token ID13
Feature activation-6.28e+00

Pos logit contributions
ellen+5.357
ochet+4.709
aeda+4.396
Ca+4.261
ryan+4.247
Neg logit contributions
 coy-7.341
 Lords-7.313
 cabinets-7.311
selves-7.252
 stairs-7.186
Tokencom
Token ID785
Feature activation-8.46e+00

Pos logit contributions
ellen+3.905
aeda+3.481
ochet+3.411
uden+3.198
Detailed+3.112
Neg logit contributions
 Lords-5.041
selves-5.016
 cabinets-4.986
 coy-4.941
 Desk-4.882
Token/
Token ID14
Feature activation-9.46e+00

Pos logit contributions
ellen+5.748
ochet+5.267
aeda+5.050
Ca+4.938
Detailed+4.906
Neg logit contributions
selves-5.796
 coy-5.703
 cabinets-5.676
 Lords-5.670
 stairs-5.602
TokenEh
Token ID43894
Feature activation-4.21e+00

Pos logit contributions
ellen+6.773
aeda+6.233
ochet+5.987
uden+5.882
Ca+5.683
Neg logit contributions
selves-6.668
 Lords-6.497
 cabinets-6.388
 Desk-6.288
 Shiva-6.265
TokenU
Token ID52
Feature activation-3.07e+00

Pos logit contributions
ellen+3.104
aeda+2.875
ochet+2.757
uden+2.709
 therap+2.571
Neg logit contributions
selves-3.038
 Lords-2.999
 cabinets-2.915
 Desk-2.890
 coy-2.884
Tokeng
Token ID70
Feature activation+4.69e+00

Pos logit contributions
ellen+2.240
aeda+2.083
ochet+1.991
uden+1.955
 therap+1.882
Neg logit contributions
selves-2.271
 Lords-2.241
 cabinets-2.164
 coy-2.153
 Desk-2.146
TokenI
Token ID40
Feature activation-1.56e+00

Pos logit contributions
selves+3.285
 Lords+3.277
 coy+3.120
 Islands+3.109
 cabinets+3.106
Neg logit contributions
ellen-3.429
aeda-3.211
 therap-3.066
ochet-3.050
uden-2.991
Tokenx
Token ID87
Feature activation+5.19e+00

Pos logit contributions
ellen+1.137
aeda+1.057
ochet+1.011
uden+0.992
 therap+0.965
Neg logit contributions
selves-1.156
 Lords-1.139
 cabinets-1.099
 coy-1.095
 Desk-1.092
Token http
Token ID2638
Feature activation-3.97e+00

Pos logit contributions
ellen+0.697
aeda+0.656
ochet+0.634
uden+0.625
 therap+0.618
Neg logit contributions
selves-0.427
 Lords-0.421
 cabinets-0.400
 coy-0.398
 Desk-0.396
Token://
Token ID1378
Feature activation-2.66e+00

Pos logit contributions
ellen+2.103
ochet+1.779
aeda+1.768
uden+1.636
Detailed+1.607
Neg logit contributions
 Lords-3.589
selves-3.575
 coy-3.551
 Desk-3.505
 cabinets-3.497
Tokenwww
Token ID2503
Feature activation-1.66e+00

Pos logit contributions
ellen+1.506
aeda+1.352
ochet+1.279
uden+1.242
 therap+1.205
Neg logit contributions
selves-2.385
 Lords-2.382
 cabinets-2.306
 coy-2.301
 Desk-2.300
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
ellen+1.250
aeda+1.136
ochet+1.115
uden+1.069
Detailed+1.027
Neg logit contributions
selves-1.167
 Lords-1.161
 coy-1.132
 cabinets-1.125
 Desk-1.119
Tokench
Token ID354
Feature activation-4.28e+00

Pos logit contributions
ellen+1.093
aeda+1.013
ochet+0.969
uden+0.951
 therap+0.932
Neg logit contributions
selves-1.078
 Lords-1.072
 cabinets-1.030
 coy-1.027
 Desk-1.025
Tokenucky
Token ID5309
Feature activation-6.89e+00

Pos logit contributions
ellen+2.704
aeda+2.461
ochet+2.350
uden+2.307
 therap+2.107
Neg logit contributions
 Lords-3.577
selves-3.565
 cabinets-3.465
 coy-3.447
 Desk-3.446
Tokene
Token ID68
Feature activation+4.38e-01

Pos logit contributions
ellen+5.460
aeda+5.077
ochet+4.896
uden+4.727
 therap+4.495
Neg logit contributions
 Lords-4.580
 Desk-4.448
selves-4.420
 cabinets-4.373
 coy-4.336
Tokenager
Token ID3536
Feature activation+1.70e+00

Pos logit contributions
selves+0.241
 Lords+0.236
 cabinets+0.224
 coy+0.223
 Islands+0.223
Neg logit contributions
ellen-0.400
aeda-0.377
ochet-0.365
uden-0.359
 therap-0.357
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
selves+1.037
 Lords+1.004
 Islands+0.962
 cabinets+0.959
 coy+0.957
Neg logit contributions
ellen-1.448
aeda-1.358
ochet-1.312
uden-1.299
 therap-1.296
Tokenorg
Token ID2398
Feature activation-2.52e+00

Pos logit contributions
ellen+1.137
aeda+1.058
ochet+1.015
uden+0.997
 therap+0.980
Neg logit contributions
selves-1.049
 Lords-1.038
 cabinets-0.996
 coy-0.994
 Desk-0.992
Token
Token ID198
Feature activation-7.79e-01

Pos logit contributions
ellen+2.310
aeda+2.171
ochet+2.109
uden+2.069
 therap+2.046
Neg logit contributions
selves-1.368
 Lords-1.346
 cabinets-1.273
 coy-1.272
 Desk-1.265
Token
Token ID229
Feature activation-1.66e+00

Pos logit contributions
ellen+2.752
aeda+2.509
ochet+2.372
uden+2.268
 therap+2.265
Neg logit contributions
selves-4.847
 Lords-4.743
 cabinets-4.674
 Desk-4.605
 coy-4.582
Token
Token ID118
Feature activation+5.17e-01

Pos logit contributions
ellen+1.145
aeda+1.055
ochet+1.014
uden+0.983
 therap+0.969
Neg logit contributions
selves-1.287
 Lords-1.267
 cabinets-1.226
 coy-1.222
 Desk-1.221
Token
Token ID8582
Feature activation-5.76e+00

Pos logit contributions
selves+0.414
 Lords+0.411
 coy+0.396
 Desk+0.396
 Islands+0.395
Neg logit contributions
ellen-0.342
aeda-0.316
ochet-0.300
uden-0.294
 therap-0.288
Token
Token ID229
Feature activation-2.55e+00

Pos logit contributions
ellen+2.790
aeda+2.486
ochet+2.375
 therap+2.228
uden+2.201
Neg logit contributions
selves-5.491
 Lords-5.382
 cabinets-5.320
 Desk-5.237
 coy-5.233
Token
Token ID116
Feature activation-1.08e-01

Pos logit contributions
ellen+1.672
aeda+1.543
ochet+1.478
uden+1.422
 therap+1.400
Neg logit contributions
selves-2.045
 Lords-2.008
 cabinets-1.954
 coy-1.948
 Desk-1.945
Token (@
Token ID4275
Feature activation-1.03e+01

Pos logit contributions
ellen+0.070
aeda+0.065
ochet+0.062
uden+0.060
 therap+0.059
Neg logit contributions
selves-0.088
 Lords-0.087
 coy-0.084
 cabinets-0.084
 Desk-0.084
Tokenm
Token ID76
Feature activation-1.42e+00

Pos logit contributions
ellen+7.840
aeda+6.972
ochet+6.868
uden+6.740
Detailed+6.597
Neg logit contributions
selves-6.677
 Lords-6.628
 cabinets-6.537
 Desk-6.485
 Shiva-6.398
Tokenf
Token ID69
Feature activation+2.64e-01

Pos logit contributions
ellen+1.089
aeda+1.013
ochet+0.971
uden+0.954
 therap+0.935
Neg logit contributions
selves-0.991
 Lords-0.983
 cabinets-0.943
 coy-0.942
 Desk-0.940
Tokenlynn
Token ID12935
Feature activation+4.34e-01

Pos logit contributions
selves+0.200
 Lords+0.198
 cabinets+0.191
 coy+0.190
 Desk+0.190
Neg logit contributions
ellen-0.187
aeda-0.173
ochet-0.165
uden-0.162
 therap-0.159
TokenJR
Token ID44817
Feature activation+9.39e-01

Pos logit contributions
selves+0.284
 Lords+0.281
 coy+0.268
 cabinets+0.268
 Desk+0.267
Neg logit contributions
ellen-0.350
aeda-0.328
ochet-0.315
uden-0.310
 therap-0.307
Token)
Token ID8
Feature activation-1.78e+00

Pos logit contributions
selves+0.466
 Lords+0.457
 coy+0.432
 Desk+0.431
 cabinets+0.430
Neg logit contributions
ellen-0.904
aeda-0.856
ochet-0.827
uden-0.817
 therap-0.812
Token we
Token ID356
Feature activation-1.15e+00

Pos logit contributions
ellen+0.587
aeda+0.552
ochet+0.536
uden+0.527
 therap+0.517
Neg logit contributions
selves-0.367
 Lords-0.354
 cabinets-0.341
 Desk-0.338
 coy-0.338
Token're
Token ID821
Feature activation+5.86e-01

Pos logit contributions
ellen+0.931
aeda+0.866
ochet+0.860
uden+0.835
 therap+0.811
Neg logit contributions
selves-0.742
 Lords-0.727
 cabinets-0.697
 Shiva-0.693
 Unic-0.692
Token continuing
Token ID8282
Feature activation-3.43e+00

Pos logit contributions
selves+0.324
 Lords+0.316
 cabinets+0.302
 Desk+0.301
 Islands+0.299
Neg logit contributions
ellen-0.531
aeda-0.499
ochet-0.485
uden-0.478
 therap-0.472
Token with
Token ID351
Feature activation-3.07e+00

Pos logit contributions
ellen+2.928
aeda+2.769
ochet+2.677
uden+2.637
 therap+2.586
Neg logit contributions
selves-2.050
 Lords-1.967
 cabinets-1.900
 Desk-1.864
 Islands-1.860
Token maps
Token ID8739
Feature activation+1.68e+00

Pos logit contributions
ellen+2.725
aeda+2.583
ochet+2.506
uden+2.464
 therap+2.408
Neg logit contributions
selves-1.712
 Lords-1.613
 cabinets-1.568
 coy-1.561
 Desk-1.544
Token #
Token ID1303
Feature activation-6.83e+00

Pos logit contributions
selves+0.880
 Lords+0.868
 cabinets+0.819
 coy+0.818
 Desk+0.814
Neg logit contributions
ellen-1.580
aeda-1.494
ochet-1.443
uden-1.423
 therap-1.408
Token85
Token ID5332
Feature activation-1.63e+00

Pos logit contributions
ellen+4.429
aeda+4.036
ochet+3.852
uden+3.708
Detailed+3.582
Neg logit contributions
selves-5.303
 Lords-5.219
 cabinets-5.142
 coy-5.139
 Desk-5.077
Token-
Token ID12
Feature activation-1.12e+00

Pos logit contributions
ellen+1.462
aeda+1.379
ochet+1.330
uden+1.303
 therap+1.283
Neg logit contributions
selves-0.927
 Lords-0.909
 cabinets-0.866
 Desk-0.861
 coy-0.860
Token61
Token ID5333
Feature activation+1.76e+00

Pos logit contributions
ellen+0.856
aeda+0.791
ochet+0.757
uden+0.740
 therap+0.714
Neg logit contributions
selves-0.788
 Lords-0.784
 cabinets-0.756
 coy-0.753
 Desk-0.751
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
selves+0.916
 Lords+0.903
 coy+0.852
 cabinets+0.851
 Desk+0.846
Neg logit contributions
ellen-1.654
aeda-1.563
ochet-1.510
uden-1.491
 therap-1.477
Token and
Token ID290
Feature activation+5.25e-01

Pos logit contributions
ellen+0.102
aeda+0.096
ochet+0.093
uden+0.092
 therap+0.090
Neg logit contributions
selves-0.062
 Lords-0.060
 cabinets-0.057
 Desk-0.057
 coy-0.057