TOP ACTIVATIONS
MAX = 1.674

Marvel 's latest interstellar sci - fi sequel truly is .
and 16 - 35 f / 2 . 8 II .
can be reached at j on . chest o @ gl
30 , or 100 mg / kg ) also showed an
his fel ­ low Demo ­ cr ats . At the
reddit Facebook St umble Upon What
and 16 - 35 f / 2 . 8 II ,
reddit Facebook St umble Upon What
we ll send you three meaningful actions you can
weird , slightly silly sci - fi mil ieu , along
we will take action against those intent on attempting to incite
z us on Twitter @ dan han z us and check
( 3 or 10 mg / kg ), but not flu
offers for programs that support The Nation s journalism
or 16 - 35 f / 2 . 8 II lens
. Carlton s Mick M alth ouse , silver
the 16 - 35 f / 4 IS lens is sharper
and memory ( Intel VT - x , AMD - V
, motor ized wheelchair ( Figure 2 A , [ Action
doses up to 100 mg / kg . Moreover , E

MIN ACTIVATIONS
MIN = -1.404

Arab lands without putting an end to the policy of
Le a Suzuki , The Chronicle 2 of 15 Photo :
uncertainty , but if you pin me down and say ,
tell me they have placed big w agers on India to
with traces of the a Cyr ill ic found in their
iff Tra x Reviewer : V the Happy L
Le a Suzuki , The Chronicle 15 of 15 Photo :
puts all other smooth ies to shame . It
2010 boys beware Reviewer : j wt 0001 -
put ting the finishing touches on a short - story
When she would put them to sleep each night as children
x . x . x . 169 11 66 589 x
, I think Reviewer : B iz r Pix
argument offer , partly , too , because frequent repetition brought
doubts , as the all too familiar and annoying
were play - testing . One was a Thought st eal
bad idea . Reviewer : Mike M anners -
accompanied a report in the magazine that Cameron had told Merkel
in Arkansas . You would think that at some point it
M urd ers on Al cat raz .

INTERVAL 0.905 - 1.674
CONTAINS 0.169%

AS ), Victoria ( V IC ) and Western Australia (
pages New challenge notifications , now showing up from
[ youtube https :// www . youtube . com / watch
- generation Twin Fro z r IV Thermal Design . The
eye - inspired panel survived those harsh conditions while still generating

INTERVAL 0.135 - 0.905
CONTAINS 30.218%

- hour na ï vet é about natural phenomena
all women was the vaginal microbiome . Sur pri
a song for a book , because you re
NC Q A to demand strict control of blood sugar in
others was limited . But the stories of marginalized and oppressed

INTERVAL -0.635 - 0.135
CONTAINS 68.183%

response , I still wasn t fully sold .
, these are the experts speaking out on this ,' M
. But sick leave isn
arguments throughout this case . The smart money would still say
raises a question , yet again , about the supposed socialism

INTERVAL -1.404 - -0.635
CONTAINS 1.431%

go for it ; one of my biggest coaches .
so because it bet rays an ignorance in most things having
drug - related charges . Today , it is closer to
to counting votes , and of course I couldn 't count
Steve Do ocy , Brian Kil me ade and Me gyn
Token Marvel
Token ID9923
Feature activation-2.08e-02

Pos logit contributions
 Ital+2.098
phabet+2.077
 Fall+1.900
 Alphabet+1.857
istine+1.732
Neg logit contributions
ients-2.226
housing-1.982
 isol-1.977
Module-1.972
 disrupted-1.958
Token's
Token ID338
Feature activation-7.06e-02

Pos logit contributions
 Ital+0.105
phabet+0.105
 Fall+0.092
 Alphabet+0.092
istine+0.086
Neg logit contributions
ients-0.123
 isol-0.111
housing-0.110
 disrupted-0.110
Module-0.108
Token latest
Token ID3452
Feature activation-2.88e-01

Pos logit contributions
phabet+0.407
 Ital+0.407
 Fall+0.365
 Alphabet+0.362
istine+0.345
Neg logit contributions
ients-0.365
 isol-0.325
housing-0.321
 disrupted-0.319
Module-0.314
Token interstellar
Token ID48851
Feature activation+4.18e-01

Pos logit contributions
 Ital+1.673
phabet+1.656
 Fall+1.521
 Alphabet+1.496
istine+1.412
Neg logit contributions
ients-1.474
 isol-1.298
housing-1.297
Module-1.276
 disrupted-1.276
Token sci
Token ID20681
Feature activation-5.97e-02

Pos logit contributions
ients+2.031
 isol+1.856
 disrupted+1.827
 extrad+1.770
 theorist+1.767
Neg logit contributions
phabet-2.541
 Ital-2.489
 Fall-2.196
 Alphabet-2.194
istine-2.121
Token-
Token ID12
Feature activation+1.67e+00

Pos logit contributions
 Ital+0.379
phabet+0.379
 Fall+0.344
 Alphabet+0.341
istine+0.325
Neg logit contributions
ients-0.275
 isol-0.240
housing-0.237
 disrupted-0.235
Module-0.234
Tokenfi
Token ID12463
Feature activation+4.11e-01

Pos logit contributions
ients+4.796
housing+4.263
busters+4.130
operative+4.069
 extrad+4.021
Neg logit contributions
 Ital-12.049
phabet-11.999
 Alphabet-10.891
 Fall-10.856
istine-10.505
Token sequel
Token ID16304
Feature activation+8.33e-02

Pos logit contributions
ients+1.999
 isol+1.828
 disrupted+1.798
housing+1.755
 terr+1.737
Neg logit contributions
phabet-2.477
 Ital-2.442
 Fall-2.159
 Alphabet-2.158
istine-2.082
Token truly
Token ID4988
Feature activation+2.43e-01

Pos logit contributions
ients+0.422
 isol+0.377
housing+0.370
 disrupted+0.369
Module+0.363
Neg logit contributions
phabet-0.488
 Ital-0.488
 Fall-0.439
 Alphabet-0.436
istine-0.414
Token is
Token ID318
Feature activation+4.14e-01

Pos logit contributions
ients+1.228
 isol+1.096
housing+1.077
 disrupted+1.074
Module+1.053
Neg logit contributions
 Ital-1.431
phabet-1.427
 Fall-1.284
 Alphabet-1.276
istine-1.215
Token.
Token ID13
Feature activation-1.09e-03

Pos logit contributions
ients+1.932
 isol+1.727
housing+1.679
 disrupted+1.676
 appealed+1.617
Neg logit contributions
phabet-2.579
 Ital-2.552
 Alphabet-2.298
 Fall-2.295
atron-2.191
Token and
Token ID290
Feature activation-3.70e-02

Pos logit contributions
ients+0.281
 isol+0.257
housing+0.255
 disrupted+0.253
Module+0.251
Neg logit contributions
 Ital-0.178
phabet-0.177
 Fall-0.153
 Alphabet-0.151
istine-0.140
Token 16
Token ID1467
Feature activation+1.55e-01

Pos logit contributions
phabet+0.215
 Ital+0.213
 Fall+0.191
 Alphabet+0.190
istine+0.182
Neg logit contributions
ients-0.189
 isol-0.172
housing-0.167
 disrupted-0.166
Module-0.164
Token-
Token ID12
Feature activation-3.63e-03

Pos logit contributions
ients+0.724
 isol+0.635
housing+0.628
 disrupted+0.620
Module+0.616
Neg logit contributions
 Ital-0.968
phabet-0.967
 Fall-0.880
 Alphabet-0.871
istine-0.830
Token35
Token ID2327
Feature activation-1.82e-01

Pos logit contributions
 Ital+0.018
phabet+0.018
 Fall+0.016
 Alphabet+0.016
istine+0.015
Neg logit contributions
ients-0.022
 isol-0.019
 disrupted-0.019
housing-0.019
Module-0.019
Token f
Token ID277
Feature activation+1.47e-01

Pos logit contributions
 Ital+0.763
phabet+0.753
 Fall+0.663
 Alphabet+0.644
istine+0.597
Neg logit contributions
ients-1.236
housing-1.126
 isol-1.121
 disrupted-1.115
Module-1.109
Token/
Token ID14
Feature activation+1.67e+00

Pos logit contributions
ients+0.783
housing+0.697
 isol+0.696
Module+0.694
 disrupted+0.674
Neg logit contributions
phabet-0.832
 Ital-0.820
 Alphabet-0.718
 Fall-0.717
istine-0.687
Token2
Token ID17
Feature activation+1.11e+00

Pos logit contributions
ients+6.524
housing+5.751
Module+5.724
 isol+5.462
 disrupted+5.260
Neg logit contributions
phabet-10.684
 Ital-10.676
 Alphabet-9.811
 Fall-9.478
istine-9.227
Token.
Token ID13
Feature activation+7.62e-02

Pos logit contributions
ients+4.839
 isol+4.374
housing+4.136
 disrupted+4.112
 appealed+4.014
Neg logit contributions
phabet-6.986
 Ital-6.892
 Alphabet-6.347
 Fall-6.261
istine-6.088
Token8
Token ID23
Feature activation+2.20e-01

Pos logit contributions
ients+0.439
 isol+0.394
housing+0.392
 disrupted+0.389
Module+0.385
Neg logit contributions
 Ital-0.396
phabet-0.392
 Fall-0.354
 Alphabet-0.347
atron-0.326
Token II
Token ID2873
Feature activation+8.78e-02

Pos logit contributions
ients+1.239
 isol+1.144
housing+1.096
 disrupted+1.090
Module+1.079
Neg logit contributions
phabet-1.156
 Ital-1.137
 Fall-1.012
 Alphabet-1.012
istine-0.970
Token.
Token ID13
Feature activation-5.98e-02

Pos logit contributions
ients+0.465
 isol+0.430
housing+0.412
 disrupted+0.408
Module+0.401
Neg logit contributions
phabet-0.490
 Ital-0.485
 Alphabet-0.433
 Fall-0.433
istine-0.413
Token can
Token ID460
Feature activation+6.09e-01

Pos logit contributions
ients+2.244
 isol+1.995
housing+1.975
 disrupted+1.956
Module+1.921
Neg logit contributions
phabet-2.597
 Ital-2.590
 Alphabet-2.325
 Fall-2.318
atron-2.217
Token be
Token ID307
Feature activation+7.59e-01

Pos logit contributions
ients+2.956
 isol+2.640
 disrupted+2.590
housing+2.581
 extrad+2.529
Neg logit contributions
 Ital-3.697
phabet-3.671
 Fall-3.270
 Alphabet-3.252
istine-3.140
Token reached
Token ID4251
Feature activation+4.44e-01

Pos logit contributions
ients+3.557
 isol+3.293
 disrupted+3.290
 extrad+3.201
 appealed+3.136
Neg logit contributions
phabet-4.702
 Ital-4.602
 Alphabet-4.041
 Fall-3.984
istine-3.957
Token at
Token ID379
Feature activation+5.57e-01

Pos logit contributions
ients+1.783
 isol+1.603
housing+1.598
 disrupted+1.519
Module+1.478
Neg logit contributions
 Ital-3.025
phabet-3.016
 Fall-2.687
 Alphabet-2.681
azon-2.589
Token j
Token ID474
Feature activation+7.18e-01

Pos logit contributions
ients+3.136
 isol+2.909
 disrupted+2.829
housing+2.812
 extrad+2.773
Neg logit contributions
phabet-2.884
 Ital-2.859
 Fall-2.549
 Alphabet-2.527
atron-2.381
Tokenon
Token ID261
Feature activation+1.58e+00

Pos logit contributions
ients+4.452
housing+4.091
 isol+3.865
busters+3.848
Module+3.828
Neg logit contributions
 Ital-3.462
phabet-3.334
 Fall-3.028
 Alphabet-3.009
 Juda-2.715
Token.
Token ID13
Feature activation+8.83e-01

Pos logit contributions
ients+6.230
housing+5.366
 isol+5.069
ector+5.064
busters+5.048
Neg logit contributions
 Ital-10.401
phabet-10.148
 Fall-9.539
 Alphabet-9.519
 Juda-9.119
Tokenchest
Token ID46713
Feature activation-3.95e-02

Pos logit contributions
ients+3.240
housing+2.798
busters+2.585
 isol+2.532
schild+2.505
Neg logit contributions
 Ital-6.431
phabet-6.300
 Fall-6.033
 Alphabet-5.919
istine-5.582
Tokeno
Token ID78
Feature activation+3.46e-01

Pos logit contributions
 Ital+0.218
phabet+0.216
 Fall+0.195
 Alphabet+0.193
istine+0.180
Neg logit contributions
ients-0.216
 isol-0.192
housing-0.192
 disrupted-0.188
Module-0.187
Token@
Token ID31
Feature activation+6.68e-01

Pos logit contributions
ients+1.829
 isol+1.639
housing+1.630
 disrupted+1.608
Module+1.588
Neg logit contributions
 Ital-1.962
phabet-1.945
 Fall-1.749
 Alphabet-1.737
istine-1.632
Tokengl
Token ID4743
Feature activation+9.61e-02

Pos logit contributions
ients+3.739
 isol+3.438
housing+3.408
 disrupted+3.357
Module+3.268
Neg logit contributions
 Ital-3.454
phabet-3.431
 Alphabet-3.086
 Fall-3.015
atron-2.824
Token 30
Token ID1542
Feature activation+1.80e-01

Pos logit contributions
ients+1.763
 isol+1.608
housing+1.559
 disrupted+1.557
Module+1.529
Neg logit contributions
phabet-2.100
 Ital-2.040
 Alphabet-1.848
 Fall-1.839
istine-1.742
Token,
Token ID11
Feature activation+3.34e-01

Pos logit contributions
ients+0.999
 isol+0.897
housing+0.888
 disrupted+0.879
Module+0.874
Neg logit contributions
 Ital-0.967
phabet-0.964
 Fall-0.864
 Alphabet-0.854
istine-0.808
Token or
Token ID393
Feature activation+3.96e-01

Pos logit contributions
ients+1.643
 isol+1.480
 disrupted+1.452
housing+1.430
Module+1.406
Neg logit contributions
phabet-2.010
 Ital-1.960
 Alphabet-1.786
 Fall-1.782
istine-1.692
Token 100
Token ID1802
Feature activation+4.54e-01

Pos logit contributions
ients+2.002
 isol+1.807
 disrupted+1.778
housing+1.745
Module+1.716
Neg logit contributions
phabet-2.354
 Ital-2.300
 Alphabet-2.070
 Fall-2.065
istine-1.975
Tokenmg
Token ID11296
Feature activation+7.47e-01

Pos logit contributions
ients+2.383
 isol+2.138
housing+2.078
 disrupted+2.076
Module+2.055
Neg logit contributions
phabet-2.583
 Ital-2.579
 Fall-2.297
 Alphabet-2.279
atron-2.191
Token/
Token ID14
Feature activation+1.57e+00

Pos logit contributions
ients+3.819
 isol+3.467
 disrupted+3.377
housing+3.298
Module+3.281
Neg logit contributions
 Ital-4.242
phabet-4.234
 Alphabet-3.764
 Fall-3.763
istine-3.621
Tokenkg
Token ID10025
Feature activation+7.20e-01

Pos logit contributions
ients+5.993
housing+5.344
Module+5.240
 isol+5.103
 disrupted+5.070
Neg logit contributions
 Ital-10.422
phabet-10.304
 Alphabet-9.452
 Fall-9.153
atron-8.809
Token)
Token ID8
Feature activation+3.36e-02

Pos logit contributions
ients+2.544
 isol+2.235
 disrupted+2.173
housing+2.107
Module+2.036
Neg logit contributions
phabet-5.213
 Ital-5.177
 Alphabet-4.764
 Fall-4.730
atron-4.585
Token also
Token ID635
Feature activation-1.62e-01

Pos logit contributions
ients+0.143
 isol+0.125
 disrupted+0.123
housing+0.123
Module+0.119
Neg logit contributions
phabet-0.225
 Ital-0.225
 Fall-0.205
 Alphabet-0.204
istine-0.195
Token showed
Token ID3751
Feature activation+2.59e-01

Pos logit contributions
 Ital+1.138
phabet+1.134
 Fall+1.046
 Alphabet+1.035
istine+0.993
Neg logit contributions
ients-0.633
 isol-0.538
housing-0.532
 disrupted-0.523
Module-0.520
Token an
Token ID281
Feature activation-3.53e-01

Pos logit contributions
ients+1.197
 isol+1.075
 disrupted+1.070
housing+1.039
Module+1.017
Neg logit contributions
phabet-1.638
 Ital-1.615
 Alphabet-1.467
 Fall-1.462
istine-1.408
Token his
Token ID465
Feature activation+6.46e-02

Pos logit contributions
ients+0.169
 isol+0.152
 disrupted+0.149
housing+0.149
Module+0.146
Neg logit contributions
phabet-0.175
 Ital-0.172
 Fall-0.155
 Alphabet-0.154
istine-0.147
Token fel
Token ID10756
Feature activation+4.17e-01

Pos logit contributions
ients+0.339
 isol+0.305
 disrupted+0.299
housing+0.296
Module+0.290
Neg logit contributions
phabet-0.371
 Ital-0.361
 Fall-0.326
 Alphabet-0.324
istine-0.312
Token­
Token ID3907
Feature activation+1.03e-01

Pos logit contributions
ients+1.684
 isol+1.398
housing+1.364
 disrupted+1.350
 appealed+1.309
Neg logit contributions
phabet-2.914
 Ital-2.829
 Alphabet-2.639
 Fall-2.597
istine-2.480
Tokenlow
Token ID9319
Feature activation-5.49e-03

Pos logit contributions
ients+0.552
housing+0.484
 isol+0.481
Module+0.468
 disrupted+0.464
Neg logit contributions
phabet-0.585
 Ital-0.581
 Fall-0.526
 Alphabet-0.525
atron-0.485
Token Demo
Token ID34588
Feature activation+2.86e-01

Pos logit contributions
phabet+0.033
 Ital+0.032
 Fall+0.029
 Alphabet+0.029
istine+0.028
Neg logit contributions
ients-0.028
 isol-0.024
housing-0.024
 disrupted-0.024
Module-0.023
Token­
Token ID3907
Feature activation+1.56e+00

Pos logit contributions
ients+1.135
housing+0.937
 isol+0.933
 disrupted+0.923
Module+0.917
Neg logit contributions
phabet-2.026
 Ital-1.979
 Alphabet-1.804
 Fall-1.779
istine-1.757
Tokencr
Token ID6098
Feature activation+5.66e-01

Pos logit contributions
ients+4.832
housing+4.078
ector+3.815
Module+3.769
 isol+3.509
Neg logit contributions
phabet-11.964
 Ital-11.786
 Alphabet-11.188
 Fall-10.497
atron-10.368
Tokenats
Token ID1381
Feature activation+1.04e-01

Pos logit contributions
ients+2.895
housing+2.533
 isol+2.520
Module+2.509
ector+2.484
Neg logit contributions
phabet-3.239
 Ital-3.211
 Alphabet-2.909
 Fall-2.861
handed-2.607
Token.
Token ID13
Feature activation-2.59e-01

Pos logit contributions
ients+0.537
 isol+0.479
housing+0.472
 disrupted+0.470
Module+0.457
Neg logit contributions
phabet-0.595
 Ital-0.590
 Alphabet-0.532
 Fall-0.531
istine-0.502
Token At
Token ID1629
Feature activation+3.43e-01

Pos logit contributions
phabet+1.393
 Ital+1.382
 Fall+1.236
 Alphabet+1.225
istine+1.164
Neg logit contributions
ients-1.442
 isol-1.294
housing-1.283
 disrupted-1.271
Module-1.265
Token the
Token ID262
Feature activation-5.93e-01

Pos logit contributions
ients+1.757
 isol+1.564
housing+1.552
 disrupted+1.549
Module+1.511
Neg logit contributions
phabet-1.986
 Ital-1.926
 Alphabet-1.733
 Fall-1.720
istine-1.667
Token
Token ID198
Feature activation+3.80e-01

Pos logit contributions
ients+2.686
Module+2.431
housing+2.353
"),+2.312
 isol+2.302
Neg logit contributions
phabet-4.295
 Ital-4.157
 Fall-3.651
 Alphabet-3.632
istine-3.607
Tokenreddit
Token ID10748
Feature activation+1.44e-01

Pos logit contributions
ients+1.965
 isol+1.744
housing+1.729
Module+1.713
 disrupted+1.710
Neg logit contributions
 Ital-2.210
phabet-2.168
 Fall-1.977
 Alphabet-1.939
istine-1.872
Token
Token ID198
Feature activation+8.28e-01

Pos logit contributions
ients+0.716
 isol+0.634
housing+0.629
 disrupted+0.620
Module+0.619
Neg logit contributions
phabet-0.859
 Ital-0.858
 Fall-0.772
 Alphabet-0.760
istine-0.728
Token
Token ID198
Feature activation+5.93e-01

Pos logit contributions
ients+3.250
Module+2.925
housing+2.913
"),+2.848
 isol+2.812
Neg logit contributions
phabet-5.734
 Ital-5.485
 Alphabet-4.807
 Fall-4.807
istine-4.776
TokenFacebook
Token ID12025
Feature activation-7.14e-02

Pos logit contributions
ients+3.521
housing+3.271
 disrupted+3.236
 isol+3.215
Module+3.172
Neg logit contributions
 Ital-2.894
phabet-2.787
 Fall-2.570
 Alphabet-2.437
istine-2.361
Token
Token ID198
Feature activation+1.56e+00

Pos logit contributions
 Ital+0.406
phabet+0.405
 Fall+0.365
 Alphabet+0.361
istine+0.342
Neg logit contributions
ients-0.375
 isol-0.334
housing-0.331
 disrupted-0.328
Module-0.325
Token
Token ID198
Feature activation+9.93e-01

Pos logit contributions
ients+4.628
"),+4.476
Module+4.371
'),+4.163
")+4.100
Neg logit contributions
phabet-11.590
 Ital-10.811
 oun-10.404
��-10.171
 occas-9.919
TokenSt
Token ID1273
Feature activation+5.32e-01

Pos logit contributions
ients+3.895
Module+3.649
housing+3.532
 disrupted+3.457
 isol+3.456
Neg logit contributions
 Ital-6.757
phabet-6.373
 Fall-6.004
 Alphabet-5.860
istine-5.696
Tokenumble
Token ID10344
Feature activation-2.11e-01

Pos logit contributions
ients+2.945
 isol+2.667
 disrupted+2.650
 appealed+2.617
"),+2.576
Neg logit contributions
 Ital-2.711
phabet-2.627
 Fall-2.421
atron-2.303
 Alphabet-2.297
TokenUpon
Token ID23792
Feature activation+7.60e-01

Pos logit contributions
 Ital+1.245
phabet+1.243
 Fall+1.126
 Alphabet+1.112
istine+1.058
Neg logit contributions
ients-1.064
 isol-0.945
housing-0.936
 disrupted-0.927
Module-0.920
Token What
Token ID1867
Feature activation+3.67e-01

Pos logit contributions
ients+3.093
 isol+2.715
 disrupted+2.625
housing+2.612
Module+2.532
Neg logit contributions
 Ital-5.186
phabet-5.183
 Fall-4.740
 Alphabet-4.669
istine-4.522
Token and
Token ID290
Feature activation-2.04e-01

Pos logit contributions
 Ital+0.620
phabet+0.618
 Fall+0.543
 Alphabet+0.535
istine+0.500
Neg logit contributions
ients-0.853
 isol-0.776
housing-0.770
 disrupted-0.764
Module-0.759
Token 16
Token ID1467
Feature activation+4.06e-01

Pos logit contributions
 Ital+1.208
phabet+1.201
 Fall+1.094
 Alphabet+1.079
istine+1.023
Neg logit contributions
ients-1.023
 isol-0.901
housing-0.898
 disrupted-0.887
Module-0.881
Token-
Token ID12
Feature activation+7.29e-02

Pos logit contributions
ients+1.665
 isol+1.453
housing+1.437
Module+1.415
 disrupted+1.386
Neg logit contributions
phabet-2.780
 Ital-2.717
 Alphabet-2.456
 Fall-2.449
istine-2.371
Token35
Token ID2327
Feature activation-2.01e-01

Pos logit contributions
ients+0.404
housing+0.365
 isol+0.364
Module+0.359
 disrupted+0.354
Neg logit contributions
phabet-0.395
 Ital-0.393
 Fall-0.348
 Alphabet-0.345
istine-0.327
Token f
Token ID277
Feature activation+2.31e-01

Pos logit contributions
 Ital+0.952
phabet+0.948
 Fall+0.837
 Alphabet+0.825
istine+0.772
Neg logit contributions
ients-1.252
 isol-1.136
housing-1.127
 disrupted-1.118
Module-1.111
Token/
Token ID14
Feature activation+1.54e+00

Pos logit contributions
ients+1.235
 isol+1.093
housing+1.084
Module+1.079
 disrupted+1.048
Neg logit contributions
phabet-1.319
 Ital-1.308
 Alphabet-1.149
 Fall-1.141
istine-1.093
Token2
Token ID17
Feature activation+9.95e-01

Pos logit contributions
ients+6.404
Module+5.600
housing+5.567
 isol+5.480
 disrupted+5.298
Neg logit contributions
 Ital-9.658
phabet-9.642
 Alphabet-8.835
 Fall-8.605
istine-8.383
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
ients+4.425
 isol+3.922
housing+3.758
 disrupted+3.749
Module+3.674
Neg logit contributions
phabet-6.291
 Ital-6.249
 Alphabet-5.724
 Fall-5.674
istine-5.452
Token8
Token ID23
Feature activation+1.98e-01

Pos logit contributions
ients+0.884
 isol+0.787
housing+0.779
Module+0.779
 disrupted+0.765
Neg logit contributions
phabet-1.024
 Ital-0.990
 Alphabet-0.886
 Fall-0.882
istine-0.854
Token II
Token ID2873
Feature activation+2.54e-01

Pos logit contributions
ients+0.980
 isol+0.878
housing+0.849
 disrupted+0.844
Module+0.835
Neg logit contributions
phabet-1.183
 Ital-1.177
 Fall-1.066
 Alphabet-1.062
istine-1.012
Token,
Token ID11
Feature activation+3.17e-02

Pos logit contributions
ients+1.321
 isol+1.230
housing+1.160
 disrupted+1.148
Module+1.125
Neg logit contributions
phabet-1.436
 Ital-1.423
 Alphabet-1.282
 Fall-1.280
atron-1.215
Token
Token ID198
Feature activation+3.27e-01

Pos logit contributions
ients+2.748
Module+2.490
housing+2.411
"),+2.376
 isol+2.365
Neg logit contributions
phabet-4.375
 Ital-4.212
 Fall-3.698
 Alphabet-3.685
istine-3.663
Tokenreddit
Token ID10748
Feature activation+1.85e-01

Pos logit contributions
ients+1.748
 isol+1.558
housing+1.546
 disrupted+1.531
Module+1.524
Neg logit contributions
 Ital-1.831
phabet-1.817
 Fall-1.641
 Alphabet-1.618
istine-1.540
Token
Token ID198
Feature activation+8.21e-01

Pos logit contributions
ients+0.921
 isol+0.813
housing+0.809
 disrupted+0.797
Module+0.796
Neg logit contributions
phabet-1.105
 Ital-1.104
 Fall-0.990
 Alphabet-0.977
istine-0.935
Token
Token ID198
Feature activation+5.66e-01

Pos logit contributions
ients+3.251
Module+2.934
housing+2.909
"),+2.858
 isol+2.813
Neg logit contributions
phabet-5.678
 Ital-5.427
 Alphabet-4.757
 Fall-4.747
istine-4.727
TokenFacebook
Token ID12025
Feature activation+3.53e-02

Pos logit contributions
ients+3.440
housing+3.188
 disrupted+3.153
 isol+3.131
Module+3.101
Neg logit contributions
 Ital-2.688
phabet-2.610
 Fall-2.380
 Alphabet-2.266
istine-2.195
Token
Token ID198
Feature activation+1.53e+00

Pos logit contributions
ients+0.182
 isol+0.162
housing+0.161
 disrupted+0.159
Module+0.158
Neg logit contributions
 Ital-0.204
phabet-0.203
 Fall-0.183
 Alphabet-0.180
istine-0.172
Token
Token ID198
Feature activation+9.67e-01

Pos logit contributions
ients+4.610
"),+4.436
Module+4.345
'),+4.150
")+4.065
Neg logit contributions
phabet-11.410
 Ital-10.607
 oun-10.170
��-9.960
 occas-9.784
TokenSt
Token ID1273
Feature activation+6.02e-01

Pos logit contributions
ients+3.965
Module+3.653
housing+3.538
 disrupted+3.486
 isol+3.462
Neg logit contributions
 Ital-6.460
phabet-6.158
 Fall-5.767
 Alphabet-5.625
istine-5.494
Tokenumble
Token ID10344
Feature activation-2.71e-01

Pos logit contributions
ients+3.281
 disrupted+2.962
 isol+2.944
 appealed+2.924
"),+2.869
Neg logit contributions
 Ital-3.100
phabet-3.019
 Fall-2.762
atron-2.671
llah-2.647
TokenUpon
Token ID23792
Feature activation+8.06e-01

Pos logit contributions
 Ital+1.584
phabet+1.579
 Fall+1.430
 Alphabet+1.413
istine+1.342
Neg logit contributions
ients-1.379
 isol-1.221
housing-1.210
 disrupted-1.198
Module-1.189
Token What
Token ID1867
Feature activation+3.87e-01

Pos logit contributions
ients+3.301
 isol+2.881
 disrupted+2.789
housing+2.784
Module+2.705
Neg logit contributions
 Ital-5.495
phabet-5.484
 Fall-5.023
 Alphabet-4.948
istine-4.788
Token we
Token ID356
Feature activation+1.02e+00

Pos logit contributions
ients+0.778
 isol+0.676
 disrupted+0.658
housing+0.658
Module+0.646
Neg logit contributions
phabet-1.301
 Ital-1.293
 Alphabet-1.180
 Fall-1.172
istine-1.125
Token
Token ID447
Feature activation+6.59e-01

Pos logit contributions
ients+3.184
housing+2.760
 disrupted+2.734
 isol+2.726
 extrad+2.711
Neg logit contributions
phabet-7.566
 Ital-7.483
 Alphabet-6.860
 Fall-6.609
atron-6.491
Token
Token ID247
Feature activation+1.06e+00

Pos logit contributions
ients+4.032
Module+3.711
housing+3.689
 isol+3.687
 extrad+3.606
Neg logit contributions
 Ital-2.981
phabet-2.942
 Fall-2.518
 Alphabet-2.517
istine-2.510
Tokenll
Token ID297
Feature activation+6.24e-01

Pos logit contributions
ients+5.923
 isol+5.270
housing+5.256
ector+5.222
 disrupted+5.149
Neg logit contributions
phabet-5.462
 Ital-5.416
 Alphabet-4.751
 Fall-4.619
atron-4.454
Token send
Token ID3758
Feature activation+9.19e-01

Pos logit contributions
ients+2.843
 extrad+2.532
 isol+2.491
housing+2.408
 disrupted+2.363
Neg logit contributions
phabet-4.074
 Ital-3.987
 Alphabet-3.569
 Fall-3.520
istine-3.425
Token you
Token ID345
Feature activation+1.50e+00

Pos logit contributions
ients+3.941
 isol+3.310
housing+3.270
 extrad+3.253
 disrupted+3.216
Neg logit contributions
phabet-6.172
 Ital-6.009
 Alphabet-5.278
istine-5.195
 Fall-5.183
Token three
Token ID1115
Feature activation+4.58e-01

Pos logit contributions
ients+2.672
 disrupted+2.035
 isol+1.978
 extrad+1.894
 terr+1.827
Neg logit contributions
phabet-13.075
 Ital-12.859
itas-11.762
 Alphabet-11.673
llah-11.670
Token meaningful
Token ID11570
Feature activation+3.20e-01

Pos logit contributions
ients+1.825
 isol+1.608
 disrupted+1.543
housing+1.538
Module+1.515
Neg logit contributions
 Ital-3.186
phabet-3.139
 Fall-2.886
 Alphabet-2.830
istine-2.778
Token actions
Token ID4028
Feature activation+4.56e-01

Pos logit contributions
ients+1.826
 isol+1.657
housing+1.603
 disrupted+1.587
Module+1.571
Neg logit contributions
phabet-1.705
 Ital-1.673
 Alphabet-1.471
 Fall-1.471
atron-1.415
Token you
Token ID345
Feature activation+2.75e-01

Pos logit contributions
ients+2.162
 isol+1.912
housing+1.861
Module+1.852
 disrupted+1.840
Neg logit contributions
phabet-2.882
 Ital-2.821
 Alphabet-2.494
 Fall-2.491
istine-2.417
Token can
Token ID460
Feature activation+4.15e-02

Pos logit contributions
ients+1.164
 isol+1.017
housing+0.986
 disrupted+0.984
Module+0.973
Neg logit contributions
phabet-1.862
 Ital-1.851
 Fall-1.682
 Alphabet-1.670
istine-1.595
Token weird
Token ID7650
Feature activation-5.64e-02

Pos logit contributions
ients+0.143
 isol+0.130
 disrupted+0.127
housing+0.127
Module+0.124
Neg logit contributions
phabet-0.157
 Ital-0.156
 Alphabet-0.138
 Fall-0.138
istine-0.133
Token,
Token ID11
Feature activation+3.32e-01

Pos logit contributions
phabet+0.323
 Ital+0.323
 Fall+0.289
 Alphabet+0.287
istine+0.274
Neg logit contributions
ients-0.290
 isol-0.262
housing-0.260
 disrupted-0.255
Module-0.251
Token slightly
Token ID4622
Feature activation+2.77e-01

Pos logit contributions
ients+1.653
 isol+1.540
 disrupted+1.490
housing+1.458
 extrad+1.447
Neg logit contributions
phabet-1.936
 Ital-1.910
 Fall-1.704
 Alphabet-1.700
istine-1.645
Token silly
Token ID14397
Feature activation-3.01e-01

Pos logit contributions
ients+1.365
 isol+1.265
 disrupted+1.233
housing+1.204
 extrad+1.187
Neg logit contributions
phabet-1.643
 Ital-1.628
 Fall-1.454
 Alphabet-1.450
istine-1.395
Token sci
Token ID20681
Feature activation-2.81e-01

Pos logit contributions
 Ital+1.665
phabet+1.665
 Fall+1.486
 Alphabet+1.472
istine+1.398
Neg logit contributions
ients-1.624
 isol-1.464
housing-1.449
 disrupted-1.434
Module-1.415
Token-
Token ID12
Feature activation+1.48e+00

Pos logit contributions
 Ital+1.682
phabet+1.671
 Fall+1.529
 Alphabet+1.504
istine+1.433
Neg logit contributions
ients-1.383
 isol-1.215
housing-1.210
 disrupted-1.194
Module-1.176
Tokenfi
Token ID12463
Feature activation+2.87e-01

Pos logit contributions
ients+4.552
housing+4.070
 isol+3.956
operative+3.851
busters+3.706
Neg logit contributions
 Ital-10.384
phabet-10.347
 Alphabet-9.610
 Fall-9.439
istine-9.148
Token mil
Token ID1465
Feature activation+2.52e-01

Pos logit contributions
ients+1.442
 isol+1.343
housing+1.305
 disrupted+1.300
 theorist+1.285
Neg logit contributions
phabet-1.657
 Ital-1.632
 Alphabet-1.439
 Fall-1.434
istine-1.389
Tokenieu
Token ID22304
Feature activation+3.92e-02

Pos logit contributions
ients+1.471
housing+1.333
 isol+1.320
 disrupted+1.304
Module+1.291
Neg logit contributions
phabet-1.273
 Ital-1.261
 Alphabet-1.110
 Fall-1.109
istine-1.028
Token,
Token ID11
Feature activation-1.94e-01

Pos logit contributions
ients+0.210
 isol+0.189
housing+0.186
 disrupted+0.185
Module+0.182
Neg logit contributions
phabet-0.219
 Ital-0.218
 Fall-0.196
 Alphabet-0.193
istine-0.183
Token along
Token ID1863
Feature activation-4.53e-01

Pos logit contributions
 Ital+1.138
phabet+1.134
 Fall+1.027
 Alphabet+1.015
istine+0.964
Neg logit contributions
ients-0.990
 isol-0.878
housing-0.870
 disrupted-0.861
Module-0.854
Token we
Token ID356
Feature activation+1.49e-01

Pos logit contributions
phabet+0.461
 Ital+0.460
 Fall+0.415
 Alphabet+0.411
istine+0.392
Neg logit contributions
ients-0.382
 isol-0.340
housing-0.335
 disrupted-0.333
Module-0.329
Token will
Token ID481
Feature activation+3.28e-01

Pos logit contributions
ients+0.715
 isol+0.639
 disrupted+0.632
housing+0.620
Module+0.612
Neg logit contributions
phabet-0.917
 Ital-0.909
 Alphabet-0.817
 Fall-0.813
istine-0.778
Token take
Token ID1011
Feature activation+5.16e-01

Pos logit contributions
ients+1.672
 isol+1.520
 disrupted+1.490
housing+1.469
 extrad+1.469
Neg logit contributions
phabet-1.950
 Ital-1.901
 Alphabet-1.712
 Fall-1.689
istine-1.626
Token action
Token ID2223
Feature activation+4.35e-01

Pos logit contributions
ients+2.385
 isol+2.244
 disrupted+2.184
 extrad+2.119
 prosecuting+2.112
Neg logit contributions
phabet-3.215
 Ital-3.150
 Alphabet-2.829
 Fall-2.800
istine-2.748
Token against
Token ID1028
Feature activation+7.41e-01

Pos logit contributions
ients+2.247
 isol+2.060
 disrupted+2.023
housing+1.995
 extrad+1.965
Neg logit contributions
phabet-2.510
 Ital-2.461
 Alphabet-2.208
 Fall-2.205
istine-2.094
Token those
Token ID883
Feature activation+1.46e+00

Pos logit contributions
ients+3.375
 isol+3.191
 disrupted+3.100
 terr+2.975
 extrad+2.975
Neg logit contributions
phabet-4.637
 Ital-4.402
 Alphabet-3.985
istine-3.973
 Fall-3.951
Token intent
Token ID6824
Feature activation-3.45e-01

Pos logit contributions
ients+5.469
 isol+5.270
 disrupted+5.219
 appealed+4.917
 terr+4.905
Neg logit contributions
phabet-9.963
 Ital-9.242
istine-8.530
vantage-8.357
 Alphabet-8.340
Token on
Token ID319
Feature activation+2.60e-01

Pos logit contributions
 Ital+1.515
phabet+1.514
 Fall+1.327
 Alphabet+1.290
istine+1.208
Neg logit contributions
ients-2.237
 isol-2.029
housing-2.012
Module-2.008
 disrupted-1.983
Token attempting
Token ID9361
Feature activation+6.02e-02

Pos logit contributions
ients+0.983
 isol+0.898
 disrupted+0.872
housing+0.840
 terr+0.817
Neg logit contributions
phabet-1.867
 Ital-1.828
 Fall-1.673
 Alphabet-1.668
istine-1.626
Token to
Token ID284
Feature activation+9.58e-02

Pos logit contributions
ients+0.361
 isol+0.327
housing+0.324
 disrupted+0.322
Module+0.319
Neg logit contributions
 Ital-0.297
phabet-0.296
 Fall-0.263
 Alphabet-0.259
istine-0.243
Token incite
Token ID49225
Feature activation-2.73e-01

Pos logit contributions
ients+0.393
 isol+0.349
 disrupted+0.344
housing+0.335
Module+0.329
Neg logit contributions
phabet-0.658
 Ital-0.650
 Fall-0.591
 Alphabet-0.590
istine-0.570
Tokenz
Token ID89
Feature activation-1.21e-01

Pos logit contributions
 Ital+0.887
phabet+0.887
 Fall+0.795
 Alphabet+0.787
istine+0.741
Neg logit contributions
ients-0.913
 isol-0.817
housing-0.812
 disrupted-0.803
Module-0.798
Tokenus
Token ID385
Feature activation+2.85e-01

Pos logit contributions
 Ital+0.547
phabet+0.545
 Fall+0.477
 Alphabet+0.472
istine+0.439
Neg logit contributions
ients-0.776
 isol-0.707
housing-0.699
 disrupted-0.694
Module-0.691
Token on
Token ID319
Feature activation-4.55e-02

Pos logit contributions
ients+1.630
 isol+1.483
housing+1.464
 disrupted+1.445
Module+1.444
Neg logit contributions
 Ital-1.461
phabet-1.447
 Fall-1.274
 Alphabet-1.268
istine-1.206
Token Twitter
Token ID3009
Feature activation+3.29e-01

Pos logit contributions
 Ital+0.232
phabet+0.231
 Fall+0.204
 Alphabet+0.200
istine+0.192
Neg logit contributions
ients-0.265
 isol-0.240
housing-0.237
 disrupted-0.236
Module-0.234
Token @
Token ID2488
Feature activation+6.36e-01

Pos logit contributions
ients+1.765
 isol+1.589
housing+1.585
 disrupted+1.555
Module+1.555
Neg logit contributions
 Ital-1.802
phabet-1.794
 Fall-1.582
 Alphabet-1.577
istine-1.524
Tokendan
Token ID25604
Feature activation+1.44e+00

Pos logit contributions
ients+3.094
housing+2.795
Module+2.710
 isol+2.642
 disrupted+2.638
Neg logit contributions
 Ital-3.829
phabet-3.706
 Fall-3.400
 Alphabet-3.379
istine-3.238
Tokenhan
Token ID7637
Feature activation+3.34e-01

Pos logit contributions
ients+6.906
housing+6.362
busters+5.675
 isol+5.653
schild+5.532
Neg logit contributions
 Ital-8.697
phabet-8.057
 Fall-7.728
 Alphabet-7.639
istine-7.119
Tokenz
Token ID89
Feature activation-1.15e-01

Pos logit contributions
ients+1.762
housing+1.574
 isol+1.533
 disrupted+1.518
Module+1.499
Neg logit contributions
 Ital-1.913
phabet-1.856
 Fall-1.704
 Alphabet-1.680
istine-1.588
Tokenus
Token ID385
Feature activation+3.06e-01

Pos logit contributions
 Ital+0.533
phabet+0.525
 Fall+0.463
 Alphabet+0.456
istine+0.426
Neg logit contributions
ients-0.736
 isol-0.666
housing-0.661
 disrupted-0.652
Module-0.649
Token and
Token ID290
Feature activation-1.17e-01

Pos logit contributions
ients+1.503
 isol+1.340
housing+1.336
 disrupted+1.305
Module+1.298
Neg logit contributions
 Ital-1.818
phabet-1.805
 Fall-1.617
 Alphabet-1.610
istine-1.536
Token check
Token ID2198
Feature activation+2.07e-01

Pos logit contributions
 Ital+0.602
phabet+0.597
 Fall+0.529
 Alphabet+0.521
istine+0.499
Neg logit contributions
ients-0.671
 isol-0.609
housing-0.598
 disrupted-0.594
Module-0.591
Token (
Token ID357
Feature activation+3.10e-01

Pos logit contributions
phabet+0.664
 Ital+0.663
 Fall+0.601
 Alphabet+0.596
istine+0.567
Neg logit contributions
ients-0.527
 isol-0.467
 disrupted-0.461
housing-0.460
Module-0.451
Token3
Token ID18
Feature activation+4.61e-01

Pos logit contributions
ients+1.642
housing+1.488
 isol+1.473
Module+1.457
 disrupted+1.441
Neg logit contributions
 Ital-1.750
phabet-1.734
 Alphabet-1.561
 Fall-1.558
istine-1.455
Token or
Token ID393
Feature activation+7.66e-01

Pos logit contributions
ients+2.075
 isol+1.835
housing+1.780
 disrupted+1.770
Module+1.748
Neg logit contributions
 Ital-2.958
phabet-2.952
 Fall-2.663
 Alphabet-2.654
istine-2.558
Token 10
Token ID838
Feature activation+5.11e-01

Pos logit contributions
ients+3.157
 isol+2.802
 disrupted+2.680
housing+2.657
Module+2.571
Neg logit contributions
 Ital-5.114
phabet-5.078
 Alphabet-4.577
 Fall-4.558
istine-4.507
Tokenmg
Token ID11296
Feature activation+5.52e-01

Pos logit contributions
ients+2.183
 isol+1.961
 disrupted+1.880
housing+1.850
Module+1.819
Neg logit contributions
phabet-3.393
 Ital-3.355
 Fall-3.035
 Alphabet-3.025
istine-2.925
Token/
Token ID14
Feature activation+1.43e+00

Pos logit contributions
ients+2.681
 isol+2.434
 disrupted+2.357
housing+2.304
Module+2.290
Neg logit contributions
phabet-3.303
 Ital-3.284
 Alphabet-2.945
 Fall-2.934
istine-2.838
Tokenkg
Token ID10025
Feature activation+3.69e-01

Pos logit contributions
ients+5.768
housing+5.035
Module+5.004
 disrupted+4.867
 isol+4.862
Neg logit contributions
phabet-9.448
 Ital-9.360
 Alphabet-8.418
 Fall-8.202
atron-7.969
Token),
Token ID828
Feature activation-1.10e-01

Pos logit contributions
ients+1.805
 isol+1.639
 disrupted+1.606
housing+1.582
Module+1.559
Neg logit contributions
phabet-2.220
 Ital-2.195
 Alphabet-1.985
 Fall-1.968
istine-1.885
Token but
Token ID475
Feature activation+1.67e-01

Pos logit contributions
phabet+0.650
 Ital+0.647
 Fall+0.581
 Alphabet+0.579
istine+0.549
Neg logit contributions
ients-0.556
 isol-0.501
 disrupted-0.494
housing-0.489
Module-0.477
Token not
Token ID407
Feature activation+2.57e-01

Pos logit contributions
ients+0.841
 isol+0.743
housing+0.736
 disrupted+0.728
Module+0.717
Neg logit contributions
 Ital-1.002
phabet-0.997
 Fall-0.902
 Alphabet-0.900
istine-0.849
Token flu
Token ID6562
Feature activation+8.31e-03

Pos logit contributions
ients+1.321
 isol+1.204
 disrupted+1.183
housing+1.166
Module+1.132
Neg logit contributions
phabet-1.487
 Ital-1.475
 Alphabet-1.318
 Fall-1.313
istine-1.261
Token offers
Token ID4394
Feature activation+4.29e-01

Pos logit contributions
ients+0.351
 isol+0.287
housing+0.279
 disrupted+0.273
Module+0.268
Neg logit contributions
phabet-0.908
 Ital-0.902
 Fall-0.830
 Alphabet-0.829
istine-0.799
Token for
Token ID329
Feature activation+8.43e-01

Pos logit contributions
ients+2.166
 isol+1.924
housing+1.895
Module+1.885
 disrupted+1.879
Neg logit contributions
phabet-2.597
 Ital-2.538
 Alphabet-2.218
 Fall-2.201
istine-2.145
Token programs
Token ID4056
Feature activation+5.66e-01

Pos logit contributions
ients+2.879
 isol+2.407
housing+2.359
 disrupted+2.337
Module+2.207
Neg logit contributions
phabet-6.487
 Ital-6.368
 Alphabet-5.760
 Fall-5.657
istine-5.620
Token that
Token ID326
Feature activation+1.98e-01

Pos logit contributions
ients+2.709
 isol+2.440
 disrupted+2.375
housing+2.349
Module+2.334
Neg logit contributions
phabet-3.558
 Ital-3.439
 Alphabet-3.045
 Fall-3.005
istine-2.929
Token support
Token ID1104
Feature activation+1.08e+00

Pos logit contributions
ients+0.573
 isol+0.472
housing+0.454
 disrupted+0.453
Module+0.439
Neg logit contributions
phabet-1.612
 Ital-1.591
 Alphabet-1.475
 Fall-1.466
istine-1.409
Token The
Token ID383
Feature activation+1.42e+00

Pos logit contributions
ients+4.503
housing+4.061
Module+4.006
busters+3.975
 isol+3.969
Neg logit contributions
phabet-6.986
 Ital-6.623
 Alphabet-5.961
 Fall-5.931
azon-5.862
Token Nation
Token ID8741
Feature activation+6.15e-01

Pos logit contributions
ients+7.624
Module+7.266
busters+6.875
"),+6.863
 disrupted+6.830
Neg logit contributions
phabet-7.015
 Ital-6.276
atron-5.521
��-5.512
istine-5.419
Token
Token ID447
Feature activation+1.07e-01

Pos logit contributions
ients+2.956
 isol+2.662
 disrupted+2.587
housing+2.582
 extrad+2.496
Neg logit contributions
phabet-3.763
 Ital-3.683
 Alphabet-3.284
 Fall-3.272
istine-3.113
Token
Token ID247
Feature activation+6.75e-02

Pos logit contributions
ients+0.728
housing+0.664
 isol+0.664
Module+0.659
 disrupted+0.643
Neg logit contributions
 Ital-0.443
phabet-0.437
 Fall-0.376
 Alphabet-0.369
istine-0.351
Tokens
Token ID82
Feature activation+7.69e-01

Pos logit contributions
ients+0.394
housing+0.353
 isol+0.349
Module+0.344
 disrupted+0.340
Neg logit contributions
phabet-0.351
 Ital-0.349
 Fall-0.305
 Alphabet-0.304
istine-0.288
Token journalism
Token ID14033
Feature activation+3.67e-01

Pos logit contributions
ients+1.564
housing+1.110
 isol+1.092
 disrupted+1.068
Module+1.027
Neg logit contributions
phabet-6.795
 Ital-6.793
 Fall-6.287
 Alphabet-6.236
istine-6.095
Token or
Token ID393
Feature activation+1.47e-01

Pos logit contributions
 Ital+0.307
phabet+0.307
 Fall+0.269
 Alphabet+0.266
istine+0.248
Neg logit contributions
ients-0.418
 isol-0.381
housing-0.377
 disrupted-0.374
Module-0.372
Token 16
Token ID1467
Feature activation+3.23e-01

Pos logit contributions
ients+0.731
 isol+0.645
housing+0.640
 disrupted+0.633
Module+0.628
Neg logit contributions
 Ital-0.882
phabet-0.879
 Fall-0.798
 Alphabet-0.789
istine-0.750
Token-
Token ID12
Feature activation-1.07e-01

Pos logit contributions
ients+1.357
 isol+1.192
housing+1.164
Module+1.145
 disrupted+1.143
Neg logit contributions
phabet-2.190
 Ital-2.140
 Alphabet-1.937
 Fall-1.933
istine-1.864
Token35
Token ID2327
Feature activation-2.07e-01

Pos logit contributions
 Ital+0.542
phabet+0.540
 Fall+0.482
 Alphabet+0.476
istine+0.447
Neg logit contributions
ients-0.623
 isol-0.561
housing-0.556
 disrupted-0.552
Module-0.547
Token f
Token ID277
Feature activation+2.06e-01

Pos logit contributions
 Ital+0.938
phabet+0.934
 Fall+0.820
 Alphabet+0.808
istine+0.753
Neg logit contributions
ients-1.323
 isol-1.203
housing-1.195
 disrupted-1.185
Module-1.178
Token/
Token ID14
Feature activation+1.41e+00

Pos logit contributions
ients+1.093
 isol+0.969
Module+0.966
housing+0.965
 disrupted+0.934
Neg logit contributions
phabet-1.177
 Ital-1.160
 Alphabet-1.018
 Fall-1.011
istine-0.971
Token2
Token ID17
Feature activation+9.40e-01

Pos logit contributions
ients+4.802
Module+4.616
housing+4.399
"),+4.177
'),+4.102
Neg logit contributions
phabet-10.065
 Ital-9.596
 Alphabet-8.675
istine-8.466
��-8.424
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
ients+3.527
 isol+3.162
Module+3.033
housing+3.000
'),+2.990
Neg logit contributions
phabet-6.616
 Ital-6.255
 Alphabet-5.728
istine-5.615
 Fall-5.582
Token8
Token ID23
Feature activation+2.35e-01

Pos logit contributions
ients+0.943
 isol+0.839
housing+0.834
Module+0.832
 disrupted+0.819
Neg logit contributions
phabet-1.086
 Ital-1.050
 Alphabet-0.940
 Fall-0.935
istine-0.906
Token II
Token ID2873
Feature activation+2.54e-01

Pos logit contributions
ients+1.106
 isol+0.996
housing+0.951
 disrupted+0.949
Module+0.937
Neg logit contributions
phabet-1.457
 Ital-1.443
 Alphabet-1.309
 Fall-1.307
istine-1.246
Token lens
Token ID10317
Feature activation-2.22e-01

Pos logit contributions
ients+1.265
 isol+1.173
housing+1.121
 disrupted+1.111
Module+1.082
Neg logit contributions
phabet-1.502
 Ital-1.481
 Alphabet-1.327
 Fall-1.323
istine-1.256
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
ients+0.403
 isol+0.361
 disrupted+0.356
housing+0.355
Module+0.345
Neg logit contributions
phabet-0.478
 Ital-0.476
 Fall-0.426
 Alphabet-0.424
istine-0.406
Token Carlton
Token ID44348
Feature activation+9.28e-02

Pos logit contributions
ients+1.077
 isol+0.965
housing+0.960
Module+0.954
 disrupted+0.945
Neg logit contributions
phabet-1.166
 Ital-1.122
 Alphabet-0.992
 Fall-0.991
istine-0.970
Token
Token ID447
Feature activation+1.89e-01

Pos logit contributions
ients+0.471
 isol+0.419
housing+0.412
 disrupted+0.411
Module+0.404
Neg logit contributions
phabet-0.545
 Ital-0.542
 Fall-0.487
 Alphabet-0.485
istine-0.463
Token
Token ID247
Feature activation+7.80e-02

Pos logit contributions
ients+1.257
 isol+1.153
housing+1.146
Module+1.134
 disrupted+1.126
Neg logit contributions
 Ital-0.807
phabet-0.798
 Fall-0.691
 Alphabet-0.682
istine-0.646
Tokens
Token ID82
Feature activation+2.48e-01

Pos logit contributions
ients+0.465
 isol+0.419
housing+0.417
 disrupted+0.409
Module+0.407
Neg logit contributions
 Ital-0.391
phabet-0.389
 Fall-0.344
 Alphabet-0.344
istine-0.320
Token Mick
Token ID32722
Feature activation+1.41e+00

Pos logit contributions
ients+1.270
 isol+1.144
 disrupted+1.123
housing+1.114
Module+1.092
Neg logit contributions
phabet-1.445
 Ital-1.431
 Fall-1.273
 Alphabet-1.273
istine-1.217
Token M
Token ID337
Feature activation+9.16e-01

Pos logit contributions
ients+5.593
 disrupted+4.889
 isol+4.858
 terr+4.687
 appealed+4.677
Neg logit contributions
phabet-8.923
 Ital-8.705
 Alphabet-7.899
istine-7.751
azon-7.540
Tokenalth
Token ID1094
Feature activation+9.45e-01

Pos logit contributions
ients+3.233
housing+2.562
 isol+2.480
ector+2.470
Module+2.388
Neg logit contributions
 Ital-6.708
phabet-6.570
 Alphabet-6.128
 Fall-6.032
 Juda-5.748
Tokenouse
Token ID1076
Feature activation+6.86e-01

Pos logit contributions
ients+3.019
housing+2.470
 isol+2.406
"),+2.272
ector+2.253
Neg logit contributions
 Ital-7.186
phabet-7.125
 Fall-6.548
 Alphabet-6.530
istine-6.197
Token,
Token ID11
Feature activation+2.66e-01

Pos logit contributions
ients+3.383
 isol+3.103
 disrupted+3.003
housing+2.975
 terr+2.932
Neg logit contributions
phabet-4.061
 Ital-3.950
 Alphabet-3.549
 Fall-3.533
istine-3.375
Token silver
Token ID8465
Feature activation+4.77e-01

Pos logit contributions
ients+1.410
 isol+1.280
 disrupted+1.257
housing+1.241
Module+1.214
Neg logit contributions
phabet-1.499
 Ital-1.472
 Fall-1.311
 Alphabet-1.308
istine-1.257
Token the
Token ID262
Feature activation+1.10e-01

Pos logit contributions
phabet+0.099
 Ital+0.098
 Fall+0.087
 Alphabet+0.087
istine+0.083
Neg logit contributions
ients-0.090
 isol-0.082
housing-0.079
 disrupted-0.079
Module-0.078
Token 16
Token ID1467
Feature activation+1.11e-01

Pos logit contributions
ients+0.572
 isol+0.523
 disrupted+0.506
housing+0.506
Module+0.495
Neg logit contributions
phabet-0.624
 Ital-0.621
 Fall-0.553
 Alphabet-0.550
istine-0.526
Token-
Token ID12
Feature activation+9.74e-02

Pos logit contributions
ients+0.544
 isol+0.475
housing+0.474
 disrupted+0.471
Module+0.465
Neg logit contributions
 Ital-0.670
phabet-0.667
 Fall-0.610
 Alphabet-0.600
istine-0.570
Token35
Token ID2327
Feature activation-1.91e-01

Pos logit contributions
ients+0.581
 isol+0.522
 disrupted+0.518
housing+0.517
Module+0.509
Neg logit contributions
 Ital-0.485
phabet-0.484
 Fall-0.431
 Alphabet-0.424
istine-0.399
Token f
Token ID277
Feature activation+3.97e-01

Pos logit contributions
 Ital+0.851
phabet+0.846
 Fall+0.746
 Alphabet+0.729
istine+0.680
Neg logit contributions
ients-1.242
 isol-1.125
housing-1.125
 disrupted-1.115
Module-1.109
Token/
Token ID14
Feature activation+1.41e+00

Pos logit contributions
ients+2.024
 isol+1.798
Module+1.793
housing+1.790
'),+1.737
Neg logit contributions
phabet-2.338
 Ital-2.313
 Alphabet-2.023
 Fall-1.996
istine-1.938
Token4
Token ID19
Feature activation+4.54e-01

Pos logit contributions
ients+5.664
Module+5.060
housing+5.003
 isol+4.873
 disrupted+4.695
Neg logit contributions
 Ital-9.083
phabet-9.055
 Alphabet-8.265
 Fall-8.105
istine-7.905
Token IS
Token ID3180
Feature activation-1.21e-02

Pos logit contributions
ients+2.483
 isol+2.281
 disrupted+2.200
housing+2.183
Module+2.159
Neg logit contributions
phabet-2.441
 Ital-2.413
 Alphabet-2.148
 Fall-2.135
istine-2.038
Token lens
Token ID10317
Feature activation-8.56e-02

Pos logit contributions
phabet+0.069
 Ital+0.069
 Fall+0.061
 Alphabet+0.061
istine+0.058
Neg logit contributions
ients-0.063
 isol-0.057
housing-0.056
 disrupted-0.056
Module-0.055
Token is
Token ID318
Feature activation-1.87e-01

Pos logit contributions
phabet+0.512
 Ital+0.511
 Fall+0.460
 Alphabet+0.458
istine+0.434
Neg logit contributions
ients-0.424
 isol-0.382
housing-0.375
 disrupted-0.370
Module-0.365
Token sharper
Token ID41415
Feature activation+2.03e-01

Pos logit contributions
phabet+1.109
 Ital+1.103
 Fall+0.990
 Alphabet+0.980
istine+0.935
Neg logit contributions
ients-0.943
 isol-0.853
housing-0.835
 disrupted-0.831
Module-0.818
Token and
Token ID290
Feature activation+3.62e-01

Pos logit contributions
phabet+0.719
 Ital+0.717
 Fall+0.642
 Alphabet+0.640
istine+0.610
Neg logit contributions
ients-0.633
 isol-0.568
housing-0.558
 disrupted-0.556
Module-0.547
Token memory
Token ID4088
Feature activation+3.05e-01

Pos logit contributions
ients+1.602
 isol+1.457
 disrupted+1.386
housing+1.376
Module+1.364
Neg logit contributions
phabet-2.377
 Ital-2.329
 Alphabet-2.102
 Fall-2.073
istine-2.055
Token (
Token ID357
Feature activation+4.86e-01

Pos logit contributions
ients+1.476
 isol+1.351
 disrupted+1.306
housing+1.304
Module+1.272
Neg logit contributions
phabet-1.860
 Ital-1.847
 Alphabet-1.658
 Fall-1.641
istine-1.605
TokenIntel
Token ID24123
Feature activation+1.24e-01

Pos logit contributions
ients+2.536
housing+2.369
Module+2.330
 isol+2.324
 disrupted+2.245
Neg logit contributions
 Ital-2.705
phabet-2.642
 Alphabet-2.405
 Fall-2.393
istine-2.277
Token VT
Token ID32751
Feature activation+6.47e-01

Pos logit contributions
ients+0.675
 isol+0.612
housing+0.603
 disrupted+0.598
Module+0.595
Neg logit contributions
phabet-0.684
 Ital-0.676
 Alphabet-0.604
 Fall-0.597
istine-0.580
Token-
Token ID12
Feature activation+1.40e+00

Pos logit contributions
ients+2.662
 isol+2.460
 disrupted+2.329
housing+2.310
Module+2.310
Neg logit contributions
 Ital-4.338
phabet-4.287
istine-3.941
 Alphabet-3.853
 Fall-3.843
Tokenx
Token ID87
Feature activation+7.03e-01

Pos logit contributions
ients+5.537
housing+4.723
Module+4.720
ector+4.691
"),+4.677
Neg logit contributions
 Ital-9.345
phabet-8.954
istine-8.642
 Alphabet-8.466
 Fall-8.053
Token,
Token ID11
Feature activation+3.97e-01

Pos logit contributions
ients+2.363
 isol+2.055
 disrupted+1.916
housing+1.915
Module+1.895
Neg logit contributions
phabet-5.268
 Ital-5.261
 Alphabet-4.841
istine-4.749
 Fall-4.743
Token AMD
Token ID10324
Feature activation+5.72e-01

Pos logit contributions
ients+2.079
 isol+1.900
 disrupted+1.856
Module+1.836
housing+1.823
Neg logit contributions
phabet-2.191
 Ital-2.185
 Fall-1.925
istine-1.919
 Alphabet-1.918
Token-
Token ID12
Feature activation+2.80e-01

Pos logit contributions
ients+3.021
 isol+2.765
Module+2.709
housing+2.694
 disrupted+2.690
Neg logit contributions
phabet-3.128
 Ital-3.032
 Alphabet-2.738
istine-2.730
 Fall-2.647
TokenV
Token ID53
Feature activation+4.45e-01

Pos logit contributions
ients+1.505
housing+1.368
Module+1.349
 isol+1.332
 disrupted+1.303
Neg logit contributions
 Ital-1.579
phabet-1.564
 Alphabet-1.402
 Fall-1.382
istine-1.341
Token,
Token ID11
Feature activation+1.26e-01

Pos logit contributions
phabet+0.950
 Ital+0.949
 Fall+0.857
 Alphabet+0.847
istine+0.807
Neg logit contributions
ients-0.799
 isol-0.710
housing-0.700
 disrupted-0.695
Module-0.686
Token motor
Token ID5584
Feature activation-2.76e-01

Pos logit contributions
ients+0.647
 isol+0.589
 disrupted+0.574
housing+0.572
Module+0.557
Neg logit contributions
phabet-0.736
 Ital-0.721
 Fall-0.647
 Alphabet-0.640
istine-0.619
Tokenized
Token ID1143
Feature activation+1.58e-01

Pos logit contributions
 Ital+1.781
phabet+1.775
 Fall+1.623
 Alphabet+1.606
istine+1.533
Neg logit contributions
ients-1.242
 isol-1.082
housing-1.071
 disrupted-1.058
Module-1.049
Token wheelchair
Token ID27181
Feature activation-3.74e-01

Pos logit contributions
ients+0.703
 isol+0.628
housing+0.609
 disrupted+0.609
Module+0.594
Neg logit contributions
phabet-1.026
 Ital-1.011
 Fall-0.916
 Alphabet-0.906
istine-0.882
Token (
Token ID357
Feature activation+4.20e-01

Pos logit contributions
 Ital+2.124
phabet+2.111
 Fall+1.913
 Alphabet+1.891
istine+1.786
Neg logit contributions
ients-1.967
 isol-1.744
housing-1.733
 disrupted-1.714
Module-1.706
TokenFigure
Token ID11337
Feature activation+1.39e+00

Pos logit contributions
ients+2.151
housing+1.963
Module+1.928
 isol+1.913
 disrupted+1.870
Neg logit contributions
 Ital-2.400
phabet-2.394
 Fall-2.164
 Alphabet-2.130
istine-2.053
Token 2
Token ID362
Feature activation+8.09e-01

Pos logit contributions
ients+5.227
 isol+4.773
Module+4.503
"),+4.421
 disrupted+4.386
Neg logit contributions
phabet-9.009
 Ital-8.897
istine-8.197
 Fall-8.113
 Alphabet-8.087
TokenA
Token ID32
Feature activation+8.46e-01

Pos logit contributions
ients+3.624
 isol+3.266
housing+3.238
Module+3.167
 disrupted+3.143
Neg logit contributions
phabet-4.939
 Ital-4.853
 Fall-4.440
 Alphabet-4.407
istine-4.358
Token,
Token ID11
Feature activation+5.81e-01

Pos logit contributions
ients+2.923
 isol+2.514
housing+2.452
 disrupted+2.393
Module+2.356
Neg logit contributions
phabet-6.082
 Ital-5.990
 Fall-5.575
 Alphabet-5.569
istine-5.447
Token [
Token ID685
Feature activation-2.32e-02

Pos logit contributions
ients+2.969
 isol+2.703
housing+2.652
 disrupted+2.634
Module+2.619
Neg logit contributions
phabet-3.254
 Ital-3.183
 Fall-2.912
 Alphabet-2.866
istine-2.825
TokenAction
Token ID12502
Feature activation+4.02e-01

Pos logit contributions
phabet+0.132
 Ital+0.132
 Fall+0.119
 Alphabet+0.118
istine+0.112
Neg logit contributions
ients-0.121
housing-0.108
 isol-0.108
Module-0.106
 disrupted-0.105
Token doses
Token ID16376
Feature activation+1.05e+00

Pos logit contributions
phabet+0.031
 Ital+0.031
 Fall+0.027
 Alphabet+0.027
istine+0.026
Neg logit contributions
ients-0.026
 isol-0.023
 disrupted-0.023
housing-0.023
Module-0.022
Token up
Token ID510
Feature activation+4.01e-01

Pos logit contributions
ients+4.507
 disrupted+4.035
 isol+4.008
housing+3.752
 extrad+3.747
Neg logit contributions
phabet-6.884
 Ital-6.552
 Alphabet-6.045
istine-5.942
 Fall-5.849
Token to
Token ID284
Feature activation+3.73e-01

Pos logit contributions
ients+2.187
 isol+1.984
 disrupted+1.948
housing+1.936
Module+1.897
Neg logit contributions
phabet-2.217
 Ital-2.163
 Alphabet-1.915
 Fall-1.907
istine-1.834
Token 100
Token ID1802
Feature activation+3.46e-01

Pos logit contributions
ients+1.859
 isol+1.686
 disrupted+1.647
housing+1.621
Module+1.596
Neg logit contributions
phabet-2.228
 Ital-2.187
 Alphabet-1.948
 Fall-1.942
istine-1.881
Tokenmg
Token ID11296
Feature activation+3.85e-01

Pos logit contributions
ients+1.788
 isol+1.592
housing+1.555
 disrupted+1.551
Module+1.535
Neg logit contributions
 Ital-2.012
phabet-2.006
 Fall-1.802
 Alphabet-1.792
istine-1.704
Token/
Token ID14
Feature activation+1.39e+00

Pos logit contributions
ients+2.094
 isol+1.887
 disrupted+1.853
housing+1.831
Module+1.809
Neg logit contributions
 Ital-2.117
phabet-2.113
 Fall-1.884
 Alphabet-1.866
istine-1.783
Tokenkg
Token ID10025
Feature activation+1.51e-01

Pos logit contributions
ients+5.547
housing+5.003
Module+4.858
 isol+4.765
 disrupted+4.728
Neg logit contributions
 Ital-9.189
phabet-9.038
 Alphabet-8.285
 Fall-8.131
atron-7.708
Token.
Token ID13
Feature activation+1.49e-01

Pos logit contributions
ients+0.746
 isol+0.665
 disrupted+0.652
housing+0.651
Module+0.634
Neg logit contributions
phabet-0.907
 Ital-0.903
 Fall-0.815
 Alphabet-0.812
istine-0.772
Token Moreover
Token ID10968
Feature activation+1.03e-01

Pos logit contributions
ients+0.786
 isol+0.712
housing+0.705
 disrupted+0.701
Module+0.693
Neg logit contributions
phabet-0.836
 Ital-0.807
 Fall-0.724
 Alphabet-0.721
istine-0.700
Token,
Token ID11
Feature activation+4.24e-01

Pos logit contributions
ients+0.571
 isol+0.517
housing+0.508
 disrupted+0.505
Module+0.499
Neg logit contributions
phabet-0.567
 Ital-0.556
 Fall-0.494
 Alphabet-0.491
atron-0.471
Token E
Token ID412
Feature activation+2.03e-01

Pos logit contributions
ients+2.302
 isol+2.132
 disrupted+2.090
housing+2.044
 extrad+1.981
Neg logit contributions
phabet-2.321
 Ital-2.264
 Alphabet-2.033
 Fall-2.032
atron-1.978
Token Arab
Token ID4498
Feature activation+1.34e-01

Pos logit contributions
phabet+0.774
 Ital+0.768
 Fall+0.687
 Alphabet+0.683
istine+0.647
Neg logit contributions
ients-0.763
 isol-0.687
housing-0.676
 disrupted-0.672
Module-0.663
Token lands
Token ID8604
Feature activation+2.42e-01

Pos logit contributions
ients+0.622
 isol+0.556
housing+0.543
 disrupted+0.537
 terr+0.521
Neg logit contributions
phabet-0.856
 Ital-0.829
 Alphabet-0.759
 Fall-0.755
atron-0.721
Token
Token ID1399
Feature activation+3.67e-01

Pos logit contributions
ients+1.246
 isol+1.151
 disrupted+1.117
housing+1.106
 extrad+1.080
Neg logit contributions
phabet-1.414
 Ital-1.364
 Alphabet-1.228
 Fall-1.222
istine-1.174
Tokenwithout
Token ID19419
Feature activation-6.80e-01

Pos logit contributions
ients+1.821
housing+1.684
 isol+1.674
Module+1.616
 disrupted+1.605
Neg logit contributions
phabet-2.106
 Ital-2.093
 Fall-1.883
 Alphabet-1.874
istine-1.806
Token putting
Token ID5137
Feature activation-7.47e-01

Pos logit contributions
 Ital+3.987
phabet+3.943
 Fall+3.620
 Alphabet+3.552
istine+3.364
Neg logit contributions
ients-3.443
housing-3.011
 isol-2.995
Module-2.980
 disrupted-2.958
Token an
Token ID281
Feature activation-1.40e+00

Pos logit contributions
 Ital+4.212
phabet+4.165
 Fall+3.795
 Alphabet+3.737
istine+3.512
Neg logit contributions
ients-3.938
housing-3.481
 isol-3.465
Module-3.446
 disrupted-3.422
Token end
Token ID886
Feature activation-1.63e-01

Pos logit contributions
 Ital+4.416
phabet+4.045
 Fall+3.659
 Alphabet+3.539
atron+2.974
Neg logit contributions
ients-10.645
housing-9.924
Module-9.857
 disrupted-9.612
busters-9.524
Token to
Token ID284
Feature activation-7.23e-02

Pos logit contributions
 Ital+0.777
phabet+0.771
 Fall+0.679
 Alphabet+0.672
istine+0.629
Neg logit contributions
ients-1.012
 isol-0.913
housing-0.909
 disrupted-0.903
Module-0.899
Token the
Token ID262
Feature activation-8.64e-02

Pos logit contributions
phabet+0.405
 Ital+0.400
 Fall+0.360
 Alphabet+0.357
istine+0.340
Neg logit contributions
ients-0.385
 isol-0.352
housing-0.343
 disrupted-0.342
Module-0.333
Token policy
Token ID2450
Feature activation-2.90e-01

Pos logit contributions
phabet+0.477
 Ital+0.469
 Fall+0.418
 Alphabet+0.417
istine+0.398
Neg logit contributions
ients-0.472
 isol-0.430
housing-0.420
 disrupted-0.419
Module-0.408
Token of
Token ID286
Feature activation+1.19e-01

Pos logit contributions
 Ital+1.737
phabet+1.736
 Fall+1.572
 Alphabet+1.555
istine+1.477
Neg logit contributions
ients-1.440
 isol-1.277
housing-1.261
 disrupted-1.250
Module-1.234
Token Le
Token ID1004
Feature activation-2.94e-01

Pos logit contributions
 Ital+3.535
phabet+3.443
 Fall+3.134
 Alphabet+3.098
istine+2.841
Neg logit contributions
ients-4.553
housing-4.105
 isol-4.093
 disrupted-4.056
Module-4.038
Tokena
Token ID64
Feature activation-4.02e-01

Pos logit contributions
phabet+1.418
 Ital+1.406
 Fall+1.237
 Alphabet+1.228
atron+1.145
Neg logit contributions
ients-1.806
 isol-1.634
housing-1.626
 disrupted-1.604
Module-1.598
Token Suzuki
Token ID35807
Feature activation-8.22e-01

Pos logit contributions
 Ital+1.671
phabet+1.657
 Fall+1.456
 Alphabet+1.424
istine+1.318
Neg logit contributions
ients-2.716
 isol-2.480
housing-2.467
 disrupted-2.444
Module-2.441
Token,
Token ID11
Feature activation-4.62e-01

Pos logit contributions
 Ital+3.884
phabet+3.708
 Fall+3.437
 Alphabet+3.404
istine+3.089
Neg logit contributions
ients-5.059
 isol-4.519
housing-4.516
Module-4.510
 disrupted-4.496
Token The
Token ID383
Feature activation-1.77e-01

Pos logit contributions
 Ital+2.254
phabet+2.244
 Fall+1.991
 Alphabet+1.963
istine+1.840
Neg logit contributions
ients-2.796
 isol-2.528
housing-2.511
 disrupted-2.489
Module-2.474
Token Chronicle
Token ID14160
Feature activation-1.38e+00

Pos logit contributions
phabet+0.704
 Ital+0.654
 Fall+0.562
 Alphabet+0.553
atron+0.532
Neg logit contributions
ients-1.268
 isol-1.167
 disrupted-1.161
housing-1.155
Module-1.146
Token 2
Token ID362
Feature activation-7.11e-01

Pos logit contributions
 Ital+7.342
phabet+6.908
 Alphabet+6.648
 Fall+6.615
istine+5.957
Neg logit contributions
ients-7.383
housing-6.542
Module-6.436
'),-6.419
 theorist-6.403
Token of
Token ID286
Feature activation-2.55e-01

Pos logit contributions
phabet+3.690
 Ital+3.677
 Fall+3.356
 Alphabet+3.302
atron+3.111
Neg logit contributions
ients-4.033
 isol-3.548
housing-3.501
 disrupted-3.478
Module-3.460
Token 15
Token ID1315
Feature activation-4.43e-01

Pos logit contributions
 Ital+1.553
phabet+1.549
 Fall+1.408
 Alphabet+1.392
istine+1.325
Neg logit contributions
ients-1.236
 isol-1.089
housing-1.078
 disrupted-1.066
Module-1.058
Token Photo
Token ID5555
Feature activation-1.37e-01

Pos logit contributions
 Ital+2.210
phabet+2.190
 Fall+1.955
 Alphabet+1.938
istine+1.807
Neg logit contributions
ients-2.640
 isol-2.377
housing-2.369
 disrupted-2.343
Module-2.338
Token:
Token ID25
Feature activation-7.27e-01

Pos logit contributions
 Ital+0.779
phabet+0.778
 Fall+0.701
 Alphabet+0.693
istine+0.657
Neg logit contributions
ients-0.719
 isol-0.639
housing-0.634
 disrupted-0.628
Module-0.623
Token uncertainty
Token ID13479
Feature activation-2.69e-01

Pos logit contributions
phabet+0.283
 Ital+0.282
 Fall+0.256
 Alphabet+0.254
istine+0.242
Neg logit contributions
ients-0.207
 isol-0.183
housing-0.179
 disrupted-0.178
Module-0.175
Token,
Token ID11
Feature activation+8.84e-02

Pos logit contributions
 Ital+1.541
phabet+1.537
 Fall+1.386
 Alphabet+1.370
istine+1.300
Neg logit contributions
ients-1.401
 isol-1.247
housing-1.235
 disrupted-1.223
Module-1.210
Token but
Token ID475
Feature activation+6.58e-02

Pos logit contributions
ients+0.450
 isol+0.404
 disrupted+0.395
housing+0.395
Module+0.384
Neg logit contributions
phabet-0.520
 Ital-0.518
 Fall-0.464
 Alphabet-0.462
atron-0.438
Token if
Token ID611
Feature activation-3.02e-03

Pos logit contributions
ients+0.329
 isol+0.296
 disrupted+0.289
housing+0.285
Module+0.281
Neg logit contributions
phabet-0.395
 Ital-0.391
 Fall-0.350
 Alphabet-0.347
istine-0.333
Token you
Token ID345
Feature activation-1.24e-01

Pos logit contributions
phabet+0.018
 Ital+0.018
 Fall+0.016
 Alphabet+0.016
istine+0.015
Neg logit contributions
ients-0.015
 isol-0.013
 disrupted-0.013
housing-0.013
Module-0.013
Token pin
Token ID6757
Feature activation-1.37e+00

Pos logit contributions
 Ital+0.727
phabet+0.727
 Fall+0.654
 Alphabet+0.648
istine+0.616
Neg logit contributions
ients-0.631
 isol-0.562
housing-0.553
 disrupted-0.551
Module-0.544
Token me
Token ID502
Feature activation-4.54e-01

Pos logit contributions
 Ital+6.718
phabet+6.608
 Fall+6.098
 Alphabet+5.964
istine+5.637
Neg logit contributions
ients-7.609
housing-6.814
Module-6.799
 disrupted-6.796
 isol-6.790
Token down
Token ID866
Feature activation-5.74e-01

Pos logit contributions
phabet+2.413
 Ital+2.410
 Fall+2.143
 Alphabet+2.118
istine+1.999
Neg logit contributions
ients-2.571
 isol-2.311
housing-2.289
 disrupted-2.268
Module-2.245
Token and
Token ID290
Feature activation-1.82e-01

Pos logit contributions
 Ital+3.126
phabet+3.104
 Fall+2.806
 Alphabet+2.773
istine+2.613
Neg logit contributions
ients-3.136
 isol-2.790
housing-2.786
 disrupted-2.751
Module-2.743
Token say
Token ID910
Feature activation-1.46e-01

Pos logit contributions
 Ital+1.047
phabet+1.043
 Fall+0.943
 Alphabet+0.932
istine+0.884
Neg logit contributions
ients-0.945
 isol-0.839
housing-0.832
 disrupted-0.824
Module-0.817
Token,
Token ID11
Feature activation+1.16e-01

Pos logit contributions
phabet+0.840
 Ital+0.836
 Fall+0.748
 Alphabet+0.742
istine+0.705
Neg logit contributions
ients-0.757
 isol-0.679
housing-0.667
 disrupted-0.667
Module-0.655
Token tell
Token ID1560
Feature activation+1.38e-01

Pos logit contributions
ients+0.057
 isol+0.050
 disrupted+0.050
housing+0.050
Module+0.049
Neg logit contributions
phabet-0.068
 Ital-0.068
 Fall-0.061
 Alphabet-0.061
istine-0.058
Token me
Token ID502
Feature activation+2.06e-01

Pos logit contributions
ients+0.682
 isol+0.610
 disrupted+0.598
housing+0.595
Module+0.587
Neg logit contributions
phabet-0.827
 Ital-0.818
 Fall-0.735
 Alphabet-0.734
istine-0.700
Token they
Token ID484
Feature activation-1.86e-01

Pos logit contributions
ients+1.062
 isol+0.968
 disrupted+0.948
housing+0.936
Module+0.925
Neg logit contributions
phabet-1.195
 Ital-1.164
 Fall-1.035
 Alphabet-1.035
istine-0.995
Token have
Token ID423
Feature activation-1.86e-01

Pos logit contributions
 Ital+1.105
phabet+1.102
 Fall+0.999
 Alphabet+0.988
istine+0.939
Neg logit contributions
ients-0.934
 isol-0.826
housing-0.819
 disrupted-0.810
Module-0.804
Token placed
Token ID4624
Feature activation-5.41e-01

Pos logit contributions
phabet+1.134
 Ital+1.134
 Fall+1.027
 Alphabet+1.017
istine+0.968
Neg logit contributions
ients-0.903
 isol-0.797
housing-0.787
 disrupted-0.782
Module-0.772
Token big
Token ID1263
Feature activation-1.37e+00

Pos logit contributions
 Ital+2.957
phabet+2.925
 Fall+2.655
 Alphabet+2.617
istine+2.462
Neg logit contributions
ients-2.961
 isol-2.638
housing-2.632
 disrupted-2.591
Module-2.587
Token w
Token ID266
Feature activation-6.65e-01

Pos logit contributions
 Ital+6.294
phabet+6.085
 Fall+5.583
 Alphabet+5.427
istine+4.996
Neg logit contributions
ients-8.259
housing-7.473
Module-7.385
 disrupted-7.384
 isol-7.369
Tokenagers
Token ID10321
Feature activation-6.60e-01

Pos logit contributions
 Ital+3.715
phabet+3.696
 Fall+3.351
 Alphabet+3.270
istine+3.145
Neg logit contributions
ients-3.478
 isol-3.134
housing-3.104
 disrupted-3.096
Module-3.060
Token on
Token ID319
Feature activation-2.64e-01

Pos logit contributions
 Ital+3.285
phabet+3.212
 Fall+2.907
 Alphabet+2.863
istine+2.683
Neg logit contributions
ients-3.906
 isol-3.509
housing-3.509
Module-3.487
 disrupted-3.458
Token India
Token ID3794
Feature activation+7.03e-03

Pos logit contributions
phabet+1.491
 Ital+1.490
 Fall+1.338
 Alphabet+1.324
istine+1.255
Neg logit contributions
ients-1.399
 isol-1.252
housing-1.237
 disrupted-1.228
Module-1.216
Token to
Token ID284
Feature activation-9.53e-02

Pos logit contributions
ients+0.036
 isol+0.033
housing+0.032
 disrupted+0.032
Module+0.032
Neg logit contributions
phabet-0.040
 Ital-0.040
 Fall-0.036
 Alphabet-0.036
istine-0.034
Token with
Token ID351
Feature activation-2.82e-01

Pos logit contributions
phabet+0.057
 Ital+0.056
 Fall+0.050
 Alphabet+0.050
istine+0.047
Neg logit contributions
ients-0.056
 isol-0.050
housing-0.049
 disrupted-0.049
Module-0.048
Token traces
Token ID20675
Feature activation-3.84e-01

Pos logit contributions
 Ital+1.587
phabet+1.575
 Fall+1.430
 Alphabet+1.414
istine+1.328
Neg logit contributions
ients-1.493
 isol-1.327
housing-1.323
 disrupted-1.303
Module-1.300
Token of
Token ID286
Feature activation-2.83e-01

Pos logit contributions
 Ital+2.226
phabet+2.192
 Fall+2.019
 Alphabet+1.988
istine+1.862
Neg logit contributions
ients-1.984
housing-1.741
Module-1.727
 isol-1.724
 disrupted-1.682
Token the
Token ID262
Feature activation-2.19e-01

Pos logit contributions
phabet+1.478
 Ital+1.476
 Fall+1.313
 Alphabet+1.296
istine+1.226
Neg logit contributions
ients-1.620
 isol-1.462
housing-1.444
 disrupted-1.438
Module-1.422
Token a
Token ID257
Feature activation-3.72e-01

Pos logit contributions
phabet+1.227
 Ital+1.218
 Fall+1.091
 Alphabet+1.077
istine+1.029
Neg logit contributions
ients-1.176
 isol-1.059
 disrupted-1.040
housing-1.036
Module-1.020
Token Cyr
Token ID40399
Feature activation-1.35e+00

Pos logit contributions
 Ital+1.991
phabet+1.982
 Fall+1.778
 Alphabet+1.756
istine+1.657
Neg logit contributions
ients-2.081
 isol-1.866
housing-1.850
 disrupted-1.836
Module-1.820
Tokenill
Token ID359
Feature activation-6.24e-01

Pos logit contributions
 Ital+7.499
phabet+7.324
 Fall+6.617
 Alphabet+6.608
istine+6.342
Neg logit contributions
ients-6.980
housing-6.374
 disrupted-6.193
Module-6.120
 extrad-6.080
Tokenic
Token ID291
Feature activation-7.89e-01

Pos logit contributions
 Ital+3.117
phabet+3.070
 Fall+2.749
 Alphabet+2.705
istine+2.571
Neg logit contributions
ients-3.707
housing-3.346
 isol-3.313
Module-3.304
 disrupted-3.303
Token found
Token ID1043
Feature activation-7.80e-02

Pos logit contributions
 Ital+4.267
phabet+4.192
 Alphabet+3.829
 Fall+3.798
istine+3.496
Neg logit contributions
ients-4.365
housing-3.921
Module-3.836
 isol-3.834
 disrupted-3.764
Token in
Token ID287
Feature activation-1.92e-01

Pos logit contributions
phabet+0.409
 Ital+0.407
 Fall+0.364
 Alphabet+0.357
istine+0.339
Neg logit contributions
ients-0.445
 isol-0.402
housing-0.396
 disrupted-0.395
Module-0.387
Token their
Token ID511
Feature activation-1.15e-01

Pos logit contributions
phabet+1.055
 Ital+1.054
 Fall+0.943
 Alphabet+0.930
istine+0.884
Neg logit contributions
ients-1.043
 isol-0.935
housing-0.921
 disrupted-0.918
Module-0.906
Tokeniff
Token ID733
Feature activation+6.76e-02

Pos logit contributions
 Ital+1.798
phabet+1.794
 Fall+1.624
 Alphabet+1.605
istine+1.526
Neg logit contributions
ients-1.548
 isol-1.374
housing-1.360
 disrupted-1.348
Module-1.336
TokenTra
Token ID15721
Feature activation-4.22e-01

Pos logit contributions
ients+0.353
housing+0.319
Module+0.318
 isol+0.316
 disrupted+0.310
Neg logit contributions
 Ital-0.383
phabet-0.382
 Fall-0.341
 Alphabet-0.336
istine-0.323
Tokenx
Token ID87
Feature activation-4.60e-01

Pos logit contributions
 Ital+2.299
phabet+2.288
 Fall+2.080
 Alphabet+2.048
atron+1.927
Neg logit contributions
ients-2.287
 isol-2.058
 disrupted-2.026
housing-2.016
Module-1.983
Token
Token ID198
Feature activation+1.52e-01

Pos logit contributions
 Ital+2.498
phabet+2.436
 Fall+2.236
 Alphabet+2.202
istine+2.074
Neg logit contributions
ients-2.537
 isol-2.253
housing-2.240
 disrupted-2.223
Module-2.205
Token
Token ID198
Feature activation-2.43e-01

Pos logit contributions
ients+0.749
 isol+0.659
housing+0.659
Module+0.651
 disrupted+0.642
Neg logit contributions
phabet-0.926
 Ital-0.913
 Fall-0.813
 Alphabet-0.811
istine-0.776
TokenReviewer
Token ID35407
Feature activation-1.35e+00

Pos logit contributions
 Ital+1.368
phabet+1.367
 Fall+1.231
 Alphabet+1.214
istine+1.154
Neg logit contributions
ients-1.282
 isol-1.145
housing-1.128
 disrupted-1.125
Module-1.110
Token:
Token ID25
Feature activation-3.46e-01

Pos logit contributions
 Ital+5.712
 Fall+5.398
phabet+5.370
 Alphabet+5.161
atron+4.644
Neg logit contributions
ients-8.358
 disrupted-7.683
 isol-7.657
 theorist-7.526
 prosecuting-7.407
Token V
Token ID569
Feature activation-5.81e-03

Pos logit contributions
 Ital+1.841
phabet+1.819
 Fall+1.648
 Alphabet+1.621
istine+1.522
Neg logit contributions
ients-1.948
housing-1.739
 isol-1.738
 disrupted-1.713
Module-1.708
Tokenthe
Token ID1169
Feature activation+3.34e-02

Pos logit contributions
 Ital+0.031
phabet+0.031
 Fall+0.028
 Alphabet+0.028
istine+0.026
Neg logit contributions
ients-0.033
 isol-0.029
housing-0.029
Module-0.028
 disrupted-0.028
TokenHappy
Token ID25082
Feature activation-2.44e-01

Pos logit contributions
ients+0.189
housing+0.169
 isol+0.169
Module+0.167
 disrupted+0.165
Neg logit contributions
 Ital-0.177
phabet-0.176
 Fall-0.157
 Alphabet-0.156
istine-0.146
TokenL
Token ID43
Feature activation-2.16e-01

Pos logit contributions
 Ital+1.359
phabet+1.355
 Fall+1.220
 Alphabet+1.205
istine+1.141
Neg logit contributions
ients-1.307
 isol-1.166
housing-1.156
 disrupted-1.145
Module-1.137
Token Le
Token ID1004
Feature activation-1.12e-01

Pos logit contributions
 Ital+3.826
phabet+3.577
 Fall+3.354
 Alphabet+3.292
istine+3.054
Neg logit contributions
ients-5.621
housing-5.175
Module-5.162
 isol-5.156
 disrupted-5.133
Tokena
Token ID64
Feature activation-1.48e-01

Pos logit contributions
phabet+0.533
 Ital+0.528
 Fall+0.457
 Alphabet+0.453
istine+0.425
Neg logit contributions
ients-0.703
housing-0.635
 isol-0.631
Module-0.627
 disrupted-0.614
Token Suzuki
Token ID35807
Feature activation-7.30e-01

Pos logit contributions
phabet+0.653
 Ital+0.647
 Fall+0.562
 Alphabet+0.552
istine+0.520
Neg logit contributions
ients-0.968
 isol-0.885
housing-0.879
Module-0.870
 disrupted-0.869
Token,
Token ID11
Feature activation-4.62e-01

Pos logit contributions
 Ital+3.302
phabet+3.127
 Fall+2.967
 Alphabet+2.897
atron+2.630
Neg logit contributions
ients-4.670
 isol-4.236
 disrupted-4.205
housing-4.167
 theorist-4.095
Token The
Token ID383
Feature activation-2.06e-01

Pos logit contributions
 Ital+2.440
phabet+2.361
 Fall+2.206
 Alphabet+2.165
istine+1.992
Neg logit contributions
ients-2.654
 isol-2.384
housing-2.354
 disrupted-2.351
Module-2.287
Token Chronicle
Token ID14160
Feature activation-1.34e+00

Pos logit contributions
 Ital+0.685
phabet+0.679
 Fall+0.569
 Alphabet+0.558
istine+0.499
Neg logit contributions
ients-1.566
 isol-1.447
housing-1.437
 disrupted-1.428
Module-1.419
Token 15
Token ID1315
Feature activation-7.03e-01

Pos logit contributions
 Ital+6.593
 Fall+6.133
phabet+5.962
 Alphabet+5.912
atron+5.215
Neg logit contributions
ients-7.697
 isol-6.900
 disrupted-6.881
 theorist-6.848
housing-6.824
Token of
Token ID286
Feature activation-4.23e-01

Pos logit contributions
 Ital+3.748
phabet+3.625
 Fall+3.461
 Alphabet+3.390
istine+3.126
Neg logit contributions
ients-3.856
 isol-3.430
 disrupted-3.394
housing-3.387
 theorist-3.318
Token 15
Token ID1315
Feature activation-2.73e-01

Pos logit contributions
 Ital+2.522
phabet+2.489
 Fall+2.282
 Alphabet+2.254
istine+2.135
Neg logit contributions
ients-2.107
 isol-1.859
housing-1.841
 disrupted-1.827
Module-1.810
Token Photo
Token ID5555
Feature activation+1.94e-01

Pos logit contributions
 Ital+1.282
phabet+1.263
 Fall+1.136
 Alphabet+1.130
istine+1.021
Neg logit contributions
ients-1.696
 isol-1.537
housing-1.536
 disrupted-1.515
Module-1.508
Token:
Token ID25
Feature activation-7.99e-01

Pos logit contributions
ients+0.945
housing+0.835
 isol+0.832
Module+0.825
 disrupted+0.814
Neg logit contributions
phabet-1.193
 Ital-1.165
 Fall-1.040
 Alphabet-1.034
istine-1.001
Token puts
Token ID7584
Feature activation-9.77e-02

Pos logit contributions
 Ital+1.717
phabet+1.717
 Fall+1.547
 Alphabet+1.531
istine+1.457
Neg logit contributions
ients-1.496
 isol-1.332
housing-1.315
 disrupted-1.306
Module-1.291
Token all
Token ID477
Feature activation+7.32e-02

Pos logit contributions
 Ital+0.567
phabet+0.565
 Fall+0.504
 Alphabet+0.503
istine+0.480
Neg logit contributions
ients-0.503
 isol-0.450
housing-0.441
 disrupted-0.439
Module-0.430
Token other
Token ID584
Feature activation-7.82e-02

Pos logit contributions
ients+0.371
 isol+0.331
 disrupted+0.323
housing+0.323
Module+0.315
Neg logit contributions
 Ital-0.430
phabet-0.429
 Fall-0.383
 Alphabet-0.382
atron-0.367
Token smooth
Token ID7209
Feature activation-1.95e-01

Pos logit contributions
phabet+0.413
 Ital+0.409
 Fall+0.360
 Alphabet+0.357
istine+0.345
Neg logit contributions
ients-0.447
 isol-0.406
 disrupted-0.396
housing-0.393
Module-0.386
Tokenies
Token ID444
Feature activation-6.12e-01

Pos logit contributions
 Ital+1.154
phabet+1.151
 Fall+1.047
 Alphabet+1.029
istine+0.977
Neg logit contributions
ients-0.970
 isol-0.860
housing-0.855
 disrupted-0.850
Module-0.846
Token to
Token ID284
Feature activation-1.34e+00

Pos logit contributions
 Ital+3.131
phabet+3.080
 Fall+2.795
 Alphabet+2.746
istine+2.576
Neg logit contributions
ients-3.541
housing-3.169
 isol-3.153
Module-3.130
 disrupted-3.121
Token shame
Token ID10195
Feature activation+2.93e-01

Pos logit contributions
 Ital+6.601
phabet+6.505
 Fall+5.964
 Alphabet+5.851
istine+5.405
Neg logit contributions
ients-7.445
housing-6.763
ector-6.656
Module-6.618
 disrupted-6.567
Token.
Token ID13
Feature activation-2.51e-01

Pos logit contributions
ients+1.366
 isol+1.212
housing+1.192
 disrupted+1.174
Module+1.148
Neg logit contributions
phabet-1.831
 Ital-1.816
 Fall-1.644
 Alphabet-1.632
istine-1.564
Token It
Token ID632
Feature activation-2.28e-01

Pos logit contributions
phabet+1.442
 Ital+1.413
 Fall+1.264
 Alphabet+1.255
istine+1.212
Neg logit contributions
ients-1.304
 isol-1.166
housing-1.157
Module-1.144
 disrupted-1.140
Token
Token ID447
Feature activation-1.07e-01

Pos logit contributions
phabet+1.364
 Ital+1.363
 Fall+1.233
 Alphabet+1.223
istine+1.160
Neg logit contributions
ients-1.134
 isol-1.008
housing-0.993
 disrupted-0.985
Module-0.973
Token
Token ID247
Feature activation+6.25e-02

Pos logit contributions
 Ital+0.425
phabet+0.425
 Fall+0.361
 Alphabet+0.357
istine+0.333
Neg logit contributions
ients-0.753
 isol-0.691
housing-0.685
Module-0.677
 disrupted-0.675
Token 2010
Token ID3050
Feature activation-5.31e-02

Pos logit contributions
 Ital+3.043
phabet+2.906
 Fall+2.764
 Alphabet+2.665
istine+2.363
Neg logit contributions
ients-5.281
 isol-4.881
housing-4.779
 disrupted-4.775
Module-4.715
Tokenboys
Token ID13202
Feature activation-1.37e-01

Pos logit contributions
 Ital+0.460
phabet+0.457
 Fall+0.429
 Alphabet+0.426
istine+0.411
Neg logit contributions
ients-0.124
 isol-0.092
housing-0.091
 disrupted-0.087
Module-0.086
Token beware
Token ID40600
Feature activation-3.34e-01

Pos logit contributions
 Ital+0.561
phabet+0.556
 Fall+0.487
 Alphabet+0.475
istine+0.441
Neg logit contributions
ients-0.933
 isol-0.851
housing-0.847
 disrupted-0.839
Module-0.837
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
 Ital+1.806
phabet+1.767
 Fall+1.624
 Alphabet+1.587
istine+1.497
Neg logit contributions
ients-1.860
 isol-1.656
 disrupted-1.634
housing-1.632
Module-1.627
Token
Token ID198
Feature activation-4.75e-01

Pos logit contributions
 Ital+0.785
phabet+0.781
 Fall+0.708
 Alphabet+0.698
istine+0.661
Neg logit contributions
ients-0.736
 isol-0.656
housing-0.649
 disrupted-0.644
Module-0.638
TokenReviewer
Token ID35407
Feature activation-1.33e+00

Pos logit contributions
phabet+2.549
 Ital+2.527
 Fall+2.272
 Alphabet+2.232
istine+2.131
Neg logit contributions
ients-2.612
 isol-2.364
 disrupted-2.337
housing-2.292
Module-2.265
Token:
Token ID25
Feature activation-3.23e-01

Pos logit contributions
 Ital+5.644
 Fall+5.350
phabet+5.314
 Alphabet+5.108
atron+4.590
Neg logit contributions
ients-8.273
 disrupted-7.588
 isol-7.557
 theorist-7.450
 prosecuting-7.325
Token j
Token ID474
Feature activation+3.54e-02

Pos logit contributions
 Ital+1.722
phabet+1.703
 Fall+1.542
 Alphabet+1.517
istine+1.426
Neg logit contributions
ients-1.814
 isol-1.618
housing-1.617
 disrupted-1.594
Module-1.589
Tokenwt
Token ID46569
Feature activation-1.76e-01

Pos logit contributions
ients+0.201
housing+0.179
 isol+0.179
 disrupted+0.174
Module+0.174
Neg logit contributions
 Ital-0.190
phabet-0.187
 Fall-0.168
 Alphabet-0.167
istine-0.155
Token0001
Token ID18005
Feature activation-1.91e-01

Pos logit contributions
 Ital+0.968
phabet+0.960
 Fall+0.863
 Alphabet+0.855
istine+0.804
Neg logit contributions
ients-0.959
 isol-0.856
housing-0.852
 disrupted-0.839
Module-0.833
Token -
Token ID532
Feature activation-5.36e-01

Pos logit contributions
 Ital+1.175
phabet+1.171
 Fall+1.072
 Alphabet+1.058
istine+1.006
Neg logit contributions
ients-0.917
 isol-0.803
housing-0.796
 disrupted-0.788
Module-0.782
Token
Token ID250
Feature activation+5.91e-02

Pos logit contributions
 Ital+1.111
phabet+1.110
 Fall+0.982
 Alphabet+0.976
istine+0.919
Neg logit contributions
ients-1.274
 isol-1.144
housing-1.131
 disrupted-1.122
Module-1.114
Tokenput
Token ID1996
Feature activation-5.07e-01

Pos logit contributions
ients+0.299
housing+0.267
 isol+0.266
 disrupted+0.258
Module+0.257
Neg logit contributions
 Ital-0.351
phabet-0.348
 Fall-0.314
 Alphabet-0.313
istine-0.297
Tokenting
Token ID889
Feature activation-5.77e-01

Pos logit contributions
 Ital+3.245
phabet+3.237
 Fall+2.959
 Alphabet+2.929
istine+2.794
Neg logit contributions
ients-2.289
 isol-1.998
housing-1.975
 disrupted-1.955
Module-1.941
Token the
Token ID262
Feature activation-6.40e-01

Pos logit contributions
 Ital+3.136
phabet+3.121
 Fall+2.815
 Alphabet+2.778
istine+2.616
Neg logit contributions
ients-3.161
 isol-2.825
housing-2.807
 disrupted-2.778
Module-2.767
Token finishing
Token ID12848
Feature activation-6.09e-01

Pos logit contributions
 Ital+3.836
phabet+3.771
 Fall+3.506
 Alphabet+3.446
istine+3.230
Neg logit contributions
ients-3.132
housing-2.757
 isol-2.737
Module-2.725
 disrupted-2.703
Token touches
Token ID18105
Feature activation-1.30e+00

Pos logit contributions
 Ital+3.663
phabet+3.521
 Fall+3.285
 Alphabet+3.242
istine+3.054
Neg logit contributions
ients-2.982
housing-2.619
 isol-2.590
Module-2.587
 disrupted-2.539
Token on
Token ID319
Feature activation+4.73e-02

Pos logit contributions
 Ital+6.071
phabet+5.840
 Fall+5.428
 Alphabet+5.252
istine+4.871
Neg logit contributions
ients-7.755
housing-6.972
 isol-6.956
Module-6.950
 disrupted-6.840
Token a
Token ID257
Feature activation+3.87e-01

Pos logit contributions
ients+0.248
 isol+0.221
housing+0.219
 disrupted+0.217
Module+0.214
Neg logit contributions
phabet-0.270
 Ital-0.268
 Fall-0.240
 Alphabet-0.238
istine-0.228
Token short
Token ID1790
Feature activation+4.68e-01

Pos logit contributions
ients+1.726
 isol+1.530
 disrupted+1.517
housing+1.513
 terr+1.467
Neg logit contributions
phabet-2.492
 Ital-2.453
 Alphabet-2.203
 Fall-2.180
istine-2.129
Token-
Token ID12
Feature activation-2.81e-01

Pos logit contributions
ients+1.930
housing+1.704
 isol+1.703
 disrupted+1.674
Module+1.623
Neg logit contributions
phabet-3.158
 Ital-3.102
 Alphabet-2.802
 Fall-2.781
istine-2.721
Tokenstory
Token ID13571
Feature activation+2.50e-01

Pos logit contributions
 Ital+1.498
phabet+1.488
 Fall+1.345
 Alphabet+1.322
istine+1.242
Neg logit contributions
ients-1.572
 isol-1.408
housing-1.388
 disrupted-1.385
Module-1.374
Token When
Token ID1649
Feature activation+2.95e-01

Pos logit contributions
phabet+0.969
 Ital+0.956
 Fall+0.857
 Alphabet+0.849
istine+0.817
Neg logit contributions
ients-0.888
 isol-0.793
housing-0.788
 disrupted-0.777
Module-0.773
Token she
Token ID673
Feature activation-1.17e-01

Pos logit contributions
ients+1.402
 isol+1.251
 disrupted+1.222
housing+1.211
Module+1.188
Neg logit contributions
phabet-1.816
 Ital-1.796
 Fall-1.620
 Alphabet-1.605
istine-1.563
Token would
Token ID561
Feature activation-9.18e-02

Pos logit contributions
phabet+0.711
 Ital+0.711
 Fall+0.643
 Alphabet+0.636
istine+0.606
Neg logit contributions
ients-0.573
 isol-0.507
housing-0.498
 disrupted-0.498
Module-0.488
Token put
Token ID1234
Feature activation-5.07e-01

Pos logit contributions
phabet+0.540
 Ital+0.539
 Fall+0.484
 Alphabet+0.480
istine+0.458
Neg logit contributions
ients-0.466
 isol-0.414
housing-0.408
 disrupted-0.407
Module-0.401
Token them
Token ID606
Feature activation-6.46e-01

Pos logit contributions
 Ital+2.823
phabet+2.797
 Fall+2.535
 Alphabet+2.507
istine+2.354
Neg logit contributions
ients-2.719
housing-2.423
 isol-2.417
Module-2.383
 disrupted-2.381
Token to
Token ID284
Feature activation-1.29e+00

Pos logit contributions
 Ital+3.402
phabet+3.344
 Fall+3.027
 Alphabet+3.004
istine+2.793
Neg logit contributions
ients-3.633
housing-3.259
 isol-3.243
Module-3.234
 disrupted-3.193
Token sleep
Token ID3993
Feature activation-4.48e-04

Pos logit contributions
 Ital+7.351
phabet+7.225
 Alphabet+6.674
 Fall+6.626
atron+6.023
Neg logit contributions
ients-6.353
housing-5.736
Module-5.616
 disrupted-5.558
 isol-5.543
Token each
Token ID1123
Feature activation+3.89e-01

Pos logit contributions
phabet+0.002
 Ital+0.002
 Fall+0.002
 Alphabet+0.002
istine+0.002
Neg logit contributions
ients-0.002
 isol-0.002
 disrupted-0.002
housing-0.002
Module-0.002
Token night
Token ID1755
Feature activation+3.97e-02

Pos logit contributions
ients+1.676
 isol+1.537
 disrupted+1.498
housing+1.466
 theorist+1.430
Neg logit contributions
phabet-2.565
 Ital-2.494
 Alphabet-2.225
 Fall-2.196
atron-2.188
Token as
Token ID355
Feature activation-4.89e-02

Pos logit contributions
ients+0.222
 isol+0.201
housing+0.198
 disrupted+0.196
Module+0.193
Neg logit contributions
phabet-0.212
 Ital-0.211
 Fall-0.188
 Alphabet-0.186
istine-0.177
Token children
Token ID1751
Feature activation-1.57e-01

Pos logit contributions
phabet+0.288
 Ital+0.287
 Fall+0.257
 Alphabet+0.254
istine+0.246
Neg logit contributions
ients-0.248
 isol-0.222
 disrupted-0.217
housing-0.215
Module-0.211
Token x
Token ID2124
Feature activation+1.20e-03

Pos logit contributions
 Ital+1.364
phabet+1.354
 Fall+1.198
 Alphabet+1.180
istine+1.099
Neg logit contributions
ients-1.844
 isol-1.672
housing-1.662
 disrupted-1.647
Module-1.636
Token.
Token ID13
Feature activation-5.18e-01

Pos logit contributions
ients+0.006
 isol+0.005
housing+0.005
Module+0.005
 disrupted+0.005
Neg logit contributions
phabet-0.007
 Ital-0.007
 Fall-0.007
 Alphabet-0.007
istine-0.006
Tokenx
Token ID87
Feature activation-4.14e-01

Pos logit contributions
 Ital+2.786
phabet+2.755
 Fall+2.528
 Alphabet+2.484
atron+2.344
Neg logit contributions
ients-2.836
 isol-2.563
 disrupted-2.528
housing-2.492
 theorist-2.464
Token.
Token ID13
Feature activation-9.57e-01

Pos logit contributions
 Ital+2.341
phabet+2.286
 Fall+2.146
 Alphabet+2.100
istine+1.977
Neg logit contributions
ients-2.167
 isol-1.937
 disrupted-1.917
housing-1.899
 theorist-1.864
Tokenx
Token ID87
Feature activation-7.21e-01

Pos logit contributions
 Ital+4.582
phabet+4.495
 Fall+4.239
 Alphabet+4.103
atron+3.905
Neg logit contributions
ients-5.545
 isol-5.119
 disrupted-5.081
 theorist-5.030
 prosecuting-4.853
Token.
Token ID13
Feature activation-1.29e+00

Pos logit contributions
 Ital+3.853
phabet+3.760
 Fall+3.582
 Alphabet+3.473
atron+3.279
Neg logit contributions
ients-3.909
 disrupted-3.488
 isol-3.485
 theorist-3.417
housing-3.401
Token169
Token ID22172
Feature activation-2.03e-01

Pos logit contributions
phabet+4.734
 Ital+4.649
 Fall+4.222
 Alphabet+4.091
atron+3.846
Neg logit contributions
ients-8.787
 disrupted-7.927
 isol-7.785
 theorist-7.773
housing-7.741
Token 11
Token ID1367
Feature activation-5.52e-01

Pos logit contributions
phabet+1.120
 Ital+1.118
 Fall+1.001
 Alphabet+0.990
istine+0.940
Neg logit contributions
ients-1.101
 isol-0.984
housing-0.977
 disrupted-0.967
Module-0.962
Token66
Token ID2791
Feature activation-2.84e-01

Pos logit contributions
 Ital+2.969
phabet+2.947
 Fall+2.663
 Alphabet+2.625
istine+2.475
Neg logit contributions
ients-3.067
 isol-2.746
housing-2.720
 disrupted-2.701
Module-2.671
Token589
Token ID44169
Feature activation+8.11e-02

Pos logit contributions
 Ital+1.623
phabet+1.615
 Fall+1.464
 Alphabet+1.446
istine+1.369
Neg logit contributions
ients-1.482
 isol-1.317
housing-1.306
 disrupted-1.294
Module-1.280
Token x
Token ID2124
Feature activation+2.90e-02

Pos logit contributions
ients+0.477
 isol+0.436
Module+0.429
 disrupted+0.427
housing+0.427
Neg logit contributions
phabet-0.414
 Ital-0.403
 Fall-0.354
 Alphabet-0.350
istine-0.337
Token,
Token ID11
Feature activation+2.98e-01

Pos logit contributions
ients+0.350
housing+0.314
Module+0.311
 isol+0.310
 disrupted+0.303
Neg logit contributions
phabet-0.373
 Ital-0.364
 Fall-0.321
 Alphabet-0.319
istine-0.309
Token I
Token ID314
Feature activation+3.39e-01

Pos logit contributions
ients+1.579
 isol+1.404
housing+1.396
 disrupted+1.380
Module+1.374
Neg logit contributions
 Ital-1.684
phabet-1.674
 Fall-1.515
 Alphabet-1.494
istine-1.414
Token think
Token ID892
Feature activation-8.04e-01

Pos logit contributions
ients+1.495
 isol+1.334
 disrupted+1.303
housing+1.297
Module+1.277
Neg logit contributions
 Ital-2.187
phabet-2.177
 Alphabet-1.944
 Fall-1.915
istine-1.854
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
 Ital+4.149
phabet+3.992
 Fall+3.711
 Alphabet+3.629
istine+3.362
Neg logit contributions
ients-4.607
 isol-4.129
 disrupted-4.097
housing-4.093
Module-4.085
Token
Token ID198
Feature activation-1.84e-01

Pos logit contributions
ients+0.736
 isol+0.649
housing+0.647
Module+0.639
 disrupted+0.632
Neg logit contributions
phabet-0.901
 Ital-0.889
 Fall-0.793
 Alphabet-0.789
istine-0.756
TokenReviewer
Token ID35407
Feature activation-1.27e+00

Pos logit contributions
phabet+1.039
 Ital+1.038
 Fall+0.934
 Alphabet+0.923
istine+0.875
Neg logit contributions
ients-0.974
 isol-0.870
housing-0.857
 disrupted-0.856
Module-0.843
Token:
Token ID25
Feature activation-2.82e-01

Pos logit contributions
 Ital+5.549
 Fall+5.227
phabet+5.218
 Alphabet+5.026
atron+4.520
Neg logit contributions
ients-7.868
 disrupted-7.195
 isol-7.191
 theorist-7.037
 prosecuting-6.930
Token B
Token ID347
Feature activation-1.26e-01

Pos logit contributions
 Ital+1.487
phabet+1.473
 Fall+1.328
 Alphabet+1.308
istine+1.229
Neg logit contributions
ients-1.599
 isol-1.430
housing-1.426
 disrupted-1.408
Module-1.402
Tokeniz
Token ID528
Feature activation-3.76e-01

Pos logit contributions
 Ital+0.685
phabet+0.681
 Fall+0.610
 Alphabet+0.606
istine+0.569
Neg logit contributions
ients-0.700
 isol-0.624
housing-0.621
 disrupted-0.613
Module-0.612
Tokenr
Token ID81
Feature activation+2.32e-01

Pos logit contributions
 Ital+2.026
phabet+2.021
 Fall+1.807
 Alphabet+1.790
istine+1.687
Neg logit contributions
ients-2.089
 isol-1.869
housing-1.860
 disrupted-1.836
Module-1.831
Token Pix
Token ID21642
Feature activation-5.18e-01

Pos logit contributions
ients+1.282
 isol+1.141
housing+1.139
Module+1.125
 disrupted+1.112
Neg logit contributions
 Ital-1.249
phabet-1.247
 Fall-1.112
 Alphabet-1.110
istine-1.032
Token argument
Token ID4578
Feature activation-2.82e-02

Pos logit contributions
ients+0.389
 isol+0.350
 disrupted+0.343
housing+0.338
Module+0.335
Neg logit contributions
phabet-0.489
 Ital-0.485
 Fall-0.436
 Alphabet-0.432
istine-0.421
Token offer
Token ID2897
Feature activation+5.74e-01

Pos logit contributions
phabet+0.164
 Ital+0.163
 Fall+0.147
 Alphabet+0.145
istine+0.138
Neg logit contributions
ients-0.147
 isol-0.131
 disrupted-0.129
housing-0.127
Module-0.125
Token,
Token ID11
Feature activation+7.31e-02

Pos logit contributions
ients+2.988
 disrupted+2.588
 isol+2.585
housing+2.545
 appealed+2.514
Neg logit contributions
phabet-3.312
 Ital-3.256
 Alphabet-2.899
 Fall-2.898
istine-2.772
Token partly
Token ID11476
Feature activation-3.97e-01

Pos logit contributions
ients+0.388
 isol+0.349
 disrupted+0.342
housing+0.340
Module+0.334
Neg logit contributions
phabet-0.412
 Ital-0.409
 Fall-0.367
 Alphabet-0.364
istine-0.348
Token,
Token ID11
Feature activation-5.41e-01

Pos logit contributions
 Ital+1.973
phabet+1.964
 Fall+1.746
 Alphabet+1.722
istine+1.617
Neg logit contributions
ients-2.371
 isol-2.141
housing-2.126
 disrupted-2.107
Module-2.094
Token too
Token ID1165
Feature activation-1.27e+00

Pos logit contributions
 Ital+2.879
phabet+2.856
 Fall+2.578
 Alphabet+2.538
istine+2.386
Neg logit contributions
ients-3.036
 isol-2.710
housing-2.708
Module-2.668
 disrupted-2.666
Token,
Token ID11
Feature activation-3.35e-01

Pos logit contributions
 Ital+5.382
phabet+5.086
 Fall+4.677
 Alphabet+4.453
istine+4.072
Neg logit contributions
ients-8.086
Module-7.335
housing-7.307
schild-7.205
 disrupted-7.203
Token because
Token ID780
Feature activation+2.32e-02

Pos logit contributions
 Ital+1.711
phabet+1.709
 Fall+1.519
 Alphabet+1.499
istine+1.414
Neg logit contributions
ients-1.955
 isol-1.766
housing-1.744
 disrupted-1.735
Module-1.716
Token frequent
Token ID10792
Feature activation+1.94e-01

Pos logit contributions
ients+0.121
 isol+0.109
 disrupted+0.106
housing+0.106
Module+0.105
Neg logit contributions
phabet-0.132
 Ital-0.131
 Fall-0.117
 Alphabet-0.116
istine-0.112
Token repetition
Token ID29693
Feature activation-1.43e-02

Pos logit contributions
ients+1.015
 isol+0.916
 disrupted+0.888
housing+0.888
Module+0.869
Neg logit contributions
phabet-1.099
 Ital-1.093
 Fall-0.982
 Alphabet-0.975
istine-0.947
Token brought
Token ID3181
Feature activation-6.03e-03

Pos logit contributions
phabet+0.086
 Ital+0.086
 Fall+0.077
 Alphabet+0.077
istine+0.073
Neg logit contributions
ients-0.071
 isol-0.064
 disrupted-0.062
housing-0.062
Module-0.060
Token doubts
Token ID17188
Feature activation-9.42e-02

Pos logit contributions
phabet+1.662
 Ital+1.658
 Fall+1.497
 Alphabet+1.484
istine+1.417
Neg logit contributions
ients-1.344
 isol-1.196
housing-1.175
 disrupted-1.174
Module-1.153
Token,
Token ID11
Feature activation-4.73e-02

Pos logit contributions
phabet+0.540
 Ital+0.540
 Fall+0.483
 Alphabet+0.479
istine+0.456
Neg logit contributions
ients-0.489
 isol-0.439
housing-0.433
 disrupted-0.428
Module-0.422
Token as
Token ID355
Feature activation-2.20e-01

Pos logit contributions
phabet+0.282
 Ital+0.282
 Fall+0.254
 Alphabet+0.253
istine+0.241
Neg logit contributions
ients-0.235
 isol-0.210
housing-0.206
 disrupted-0.205
Module-0.201
Token the
Token ID262
Feature activation-1.48e-02

Pos logit contributions
 Ital+1.274
phabet+1.272
 Fall+1.147
 Alphabet+1.135
istine+1.079
Neg logit contributions
ients-1.129
 isol-1.004
housing-0.991
 disrupted-0.984
Module-0.974
Token all
Token ID477
Feature activation+1.06e-01

Pos logit contributions
phabet+0.085
 Ital+0.085
 Fall+0.076
 Alphabet+0.075
istine+0.072
Neg logit contributions
ients-0.077
 isol-0.069
housing-0.068
 disrupted-0.068
Module-0.067
Token too
Token ID1165
Feature activation-1.27e+00

Pos logit contributions
ients+0.507
 isol+0.451
 disrupted+0.437
housing+0.436
Module+0.430
Neg logit contributions
 Ital-0.658
phabet-0.655
 Fall-0.593
 Alphabet-0.590
istine-0.561
Token familiar
Token ID5385
Feature activation-2.30e-01

Pos logit contributions
 Ital+7.080
phabet+6.880
 Fall+6.470
 Alphabet+6.285
istine+5.903
Neg logit contributions
ients-6.511
housing-5.803
Module-5.662
ector-5.559
 disrupted-5.463
Token and
Token ID290
Feature activation-1.07e-01

Pos logit contributions
phabet+1.379
 Ital+1.378
 Fall+1.245
 Alphabet+1.233
istine+1.174
Neg logit contributions
ients-1.132
 isol-1.004
housing-0.991
 disrupted-0.983
Module-0.971
Token annoying
Token ID15774
Feature activation-3.60e-01

Pos logit contributions
phabet+0.632
 Ital+0.629
 Fall+0.565
 Alphabet+0.561
istine+0.536
Neg logit contributions
ients-0.544
 isol-0.493
 disrupted-0.481
housing-0.477
Module-0.470
Token
Token ID564
Feature activation+1.01e-01

Pos logit contributions
 Ital+2.111
phabet+2.096
 Fall+1.909
 Alphabet+1.884
istine+1.785
Neg logit contributions
ients-1.832
 isol-1.615
housing-1.608
 disrupted-1.586
Module-1.580
Token
Token ID246
Feature activation-2.03e-02

Pos logit contributions
ients+0.673
 isol+0.615
housing+0.611
 disrupted+0.606
Module+0.603
Neg logit contributions
 Ital-0.435
phabet-0.433
 Fall-0.377
 Alphabet-0.370
istine-0.343
Token were
Token ID547
Feature activation+1.34e-01

Pos logit contributions
 Ital+0.241
phabet+0.240
 Fall+0.218
 Alphabet+0.216
istine+0.205
Neg logit contributions
ients-0.197
 isol-0.174
housing-0.172
 disrupted-0.170
Module-0.169
Token play
Token ID711
Feature activation+1.57e-01

Pos logit contributions
ients+0.471
 isol+0.397
housing+0.387
 disrupted+0.383
Module+0.377
Neg logit contributions
 Ital-0.991
phabet-0.990
 Fall-0.912
 Alphabet-0.906
istine-0.872
Token-
Token ID12
Feature activation-3.42e-01

Pos logit contributions
ients+0.702
 isol+0.613
housing+0.609
 disrupted+0.594
Module+0.588
Neg logit contributions
phabet-1.022
 Ital-1.019
 Fall-0.920
 Alphabet-0.915
istine-0.875
Tokentesting
Token ID33407
Feature activation-2.97e-01

Pos logit contributions
 Ital+2.065
phabet+2.060
 Fall+1.890
 Alphabet+1.864
atron+1.783
Neg logit contributions
ients-1.634
 isol-1.432
 disrupted-1.422
housing-1.392
Module-1.390
Token.
Token ID13
Feature activation-1.39e-01

Pos logit contributions
 Ital+1.723
phabet+1.713
 Fall+1.559
 Alphabet+1.540
istine+1.452
Neg logit contributions
ients-1.526
 isol-1.346
housing-1.338
 disrupted-1.322
Module-1.320
Token One
Token ID1881
Feature activation-1.27e+00

Pos logit contributions
phabet+0.709
 Ital+0.708
 Fall+0.627
 Alphabet+0.620
istine+0.586
Neg logit contributions
ients-0.813
 isol-0.733
housing-0.728
 disrupted-0.721
Module-0.718
Token was
Token ID373
Feature activation-8.52e-02

Pos logit contributions
 Ital+6.170
phabet+5.766
 Fall+5.614
 Alphabet+5.450
atron+4.923
Neg logit contributions
ients-7.291
housing-6.456
schild-6.363
 isol-6.282
Module-6.221
Token a
Token ID257
Feature activation-7.39e-02

Pos logit contributions
 Ital+0.463
phabet+0.461
 Fall+0.415
 Alphabet+0.409
istine+0.386
Neg logit contributions
ients-0.470
 isol-0.420
housing-0.417
 disrupted-0.412
Module-0.410
Token Thought
Token ID27522
Feature activation+1.71e-01

Pos logit contributions
phabet+0.371
 Ital+0.370
 Fall+0.326
 Alphabet+0.323
istine+0.305
Neg logit contributions
ients-0.437
 isol-0.395
housing-0.392
 disrupted-0.391
Module-0.386
Tokenst
Token ID301
Feature activation-2.13e-01

Pos logit contributions
ients+1.015
 isol+0.931
housing+0.928
 disrupted+0.925
Module+0.917
Neg logit contributions
phabet-0.833
 Ital-0.827
 Alphabet-0.725
 Fall-0.709
istine-0.680
Tokeneal
Token ID2287
Feature activation+5.62e-02

Pos logit contributions
 Ital+0.881
phabet+0.881
 Fall+0.752
 Alphabet+0.748
istine+0.682
Neg logit contributions
ients-1.454
 isol-1.329
housing-1.323
 disrupted-1.305
Module-1.301
Token bad
Token ID2089
Feature activation-6.77e-01

Pos logit contributions
 Ital+1.056
phabet+1.052
 Fall+0.949
 Alphabet+0.936
istine+0.884
Neg logit contributions
ients-1.043
 isol-0.932
housing-0.925
 disrupted-0.915
Module-0.909
Token idea
Token ID2126
Feature activation-4.81e-01

Pos logit contributions
 Ital+4.009
phabet+3.862
 Fall+3.707
 Alphabet+3.614
atron+3.385
Neg logit contributions
ients-3.403
housing-2.945
 disrupted-2.896
 isol-2.889
Module-2.882
Token.
Token ID13
Feature activation-3.66e-01

Pos logit contributions
 Ital+2.761
phabet+2.722
 Fall+2.520
 Alphabet+2.474
atron+2.313
Neg logit contributions
ients-2.495
 isol-2.212
housing-2.183
 disrupted-2.182
Module-2.138
Token
Token ID198
Feature activation-3.25e-01

Pos logit contributions
 Ital+2.061
phabet+1.973
 Fall+1.885
 Alphabet+1.834
atron+1.686
Neg logit contributions
ients-1.957
 isol-1.755
 disrupted-1.733
housing-1.727
Module-1.680
Token
Token ID198
Feature activation-5.07e-01

Pos logit contributions
 Ital+1.756
phabet+1.723
 Fall+1.594
 Alphabet+1.567
istine+1.467
Neg logit contributions
ients-1.792
 isol-1.604
 disrupted-1.585
housing-1.583
Module-1.548
TokenReviewer
Token ID35407
Feature activation-1.27e+00

Pos logit contributions
phabet+2.791
 Ital+2.765
 Fall+2.497
 Alphabet+2.461
atron+2.342
Neg logit contributions
ients-2.736
 isol-2.452
 disrupted-2.425
housing-2.400
Module-2.358
Token:
Token ID25
Feature activation-1.96e-01

Pos logit contributions
 Ital+5.540
 Fall+5.222
phabet+5.188
 Alphabet+5.001
atron+4.527
Neg logit contributions
ients-7.776
 disrupted-7.084
 isol-7.071
 theorist-6.929
housing-6.845
Token Mike
Token ID4995
Feature activation+7.54e-02

Pos logit contributions
 Ital+1.030
phabet+1.025
 Fall+0.918
 Alphabet+0.906
istine+0.854
Neg logit contributions
ients-1.113
 isol-0.999
housing-0.992
 disrupted-0.982
Module-0.976
TokenM
Token ID44
Feature activation+2.40e-01

Pos logit contributions
ients+0.418
 isol+0.375
housing+0.372
Module+0.369
 disrupted+0.367
Neg logit contributions
phabet-0.405
 Ital-0.405
 Fall-0.359
 Alphabet-0.357
istine-0.338
Tokenanners
Token ID15672
Feature activation-3.00e-01

Pos logit contributions
ients+1.446
 isol+1.297
housing+1.289
 disrupted+1.274
Module+1.272
Neg logit contributions
 Ital-1.187
phabet-1.168
 Fall-1.044
 Alphabet-1.027
istine-0.966
Token -
Token ID532
Feature activation-7.52e-01

Pos logit contributions
 Ital+1.756
phabet+1.750
 Fall+1.603
 Alphabet+1.576
istine+1.502
Neg logit contributions
ients-1.516
 isol-1.339
 disrupted-1.319
housing-1.315
Module-1.294
Token accompanied
Token ID11791
Feature activation-1.19e-01

Pos logit contributions
ients+0.016
 isol+0.014
 disrupted+0.014
housing+0.014
Module+0.013
Neg logit contributions
phabet-0.019
 Ital-0.019
 Fall-0.017
 Alphabet-0.017
istine-0.016
Token a
Token ID257
Feature activation-6.54e-02

Pos logit contributions
 Ital+0.666
phabet+0.666
 Fall+0.598
 Alphabet+0.590
istine+0.562
Neg logit contributions
ients-0.632
 isol-0.566
housing-0.558
 disrupted-0.555
Module-0.549
Token report
Token ID989
Feature activation+8.76e-02

Pos logit contributions
phabet+0.378
 Ital+0.376
 Fall+0.338
 Alphabet+0.335
istine+0.321
Neg logit contributions
ients-0.335
 isol-0.301
 disrupted-0.297
housing-0.297
Module-0.292
Token in
Token ID287
Feature activation-3.67e-02

Pos logit contributions
ients+0.478
 isol+0.436
 disrupted+0.425
housing+0.420
 extrad+0.415
Neg logit contributions
phabet-0.483
 Ital-0.476
 Fall-0.423
 Alphabet-0.416
istine-0.401
Token the
Token ID262
Feature activation-3.55e-01

Pos logit contributions
phabet+0.199
 Ital+0.195
 Fall+0.174
 Alphabet+0.173
istine+0.166
Neg logit contributions
ients-0.202
 isol-0.183
 disrupted-0.179
housing-0.179
Module-0.177
Token magazine
Token ID7093
Feature activation-1.27e+00

Pos logit contributions
phabet+1.704
 Ital+1.691
 Fall+1.488
 Alphabet+1.469
istine+1.385
Neg logit contributions
ients-2.174
 isol-1.977
housing-1.954
 disrupted-1.951
Module-1.932
Token that
Token ID326
Feature activation+4.09e-02

Pos logit contributions
 Ital+6.808
phabet+6.521
 Fall+6.203
 Alphabet+6.106
istine+5.526
Neg logit contributions
ients-6.846
housing-6.059
 isol-5.883
Module-5.880
 disrupted-5.827
Token Cameron
Token ID10821
Feature activation+2.07e-01

Pos logit contributions
ients+0.181
 isol+0.161
 disrupted+0.157
housing+0.154
Module+0.152
Neg logit contributions
phabet-0.266
 Ital-0.264
 Fall-0.240
 Alphabet-0.237
istine-0.230
Token had
Token ID550
Feature activation+2.10e-01

Pos logit contributions
ients+1.039
 isol+0.945
 disrupted+0.933
housing+0.908
 appealed+0.907
Neg logit contributions
phabet-1.220
 Ital-1.200
 Fall-1.082
 Alphabet-1.063
istine-1.023
Token told
Token ID1297
Feature activation+1.09e-01

Pos logit contributions
ients+0.948
 isol+0.856
 disrupted+0.853
 appealed+0.821
 extrad+0.814
Neg logit contributions
phabet-1.367
 Ital-1.323
 Fall-1.211
 Alphabet-1.190
istine-1.164
Token Merkel
Token ID18263
Feature activation-3.43e-02

Pos logit contributions
ients+0.576
 isol+0.521
 disrupted+0.511
housing+0.507
Module+0.498
Neg logit contributions
phabet-0.613
 Ital-0.600
 Fall-0.540
 Alphabet-0.534
istine-0.516
Token in
Token ID287
Feature activation-3.14e-01

Pos logit contributions
ients+1.100
 isol+0.981
 disrupted+0.966
housing+0.963
Module+0.944
Neg logit contributions
 Ital-1.225
phabet-1.219
 Fall-1.091
 Alphabet-1.085
atron-1.038
Token Arkansas
Token ID14538
Feature activation+4.25e-02

Pos logit contributions
phabet+1.375
 Ital+1.373
 Fall+1.188
 Alphabet+1.175
istine+1.093
Neg logit contributions
ients-2.064
 isol-1.888
housing-1.870
 disrupted-1.861
Module-1.843
Token.
Token ID13
Feature activation-2.30e-01

Pos logit contributions
ients+0.262
 isol+0.239
housing+0.237
 disrupted+0.235
Module+0.232
Neg logit contributions
 Ital-0.202
phabet-0.201
 Fall-0.175
 Alphabet-0.174
istine-0.163
Token You
Token ID921
Feature activation+1.56e-01

Pos logit contributions
phabet+1.308
 Ital+1.294
 Fall+1.157
 Alphabet+1.150
istine+1.098
Neg logit contributions
ients-1.204
 isol-1.075
housing-1.069
 disrupted-1.057
Module-1.056
Token would
Token ID561
Feature activation-2.30e-01

Pos logit contributions
ients+0.720
 isol+0.645
 disrupted+0.637
housing+0.621
Module+0.611
Neg logit contributions
phabet-0.988
 Ital-0.979
 Alphabet-0.879
 Fall-0.870
istine-0.844
Token think
Token ID892
Feature activation-1.26e+00

Pos logit contributions
 Ital+1.339
phabet+1.337
 Fall+1.204
 Alphabet+1.193
istine+1.134
Neg logit contributions
ients-1.175
 isol-1.045
housing-1.034
 disrupted-1.025
Module-1.015
Token that
Token ID326
Feature activation-7.94e-01

Pos logit contributions
 Ital+6.448
phabet+6.084
 Fall+5.929
 Alphabet+5.722
istine+5.203
Neg logit contributions
ients-7.002
housing-6.272
Module-6.224
 isol-6.142
 disrupted-6.039
Token at
Token ID379
Feature activation+8.64e-02

Pos logit contributions
 Ital+4.334
phabet+4.199
 Fall+3.951
 Alphabet+3.848
istine+3.588
Neg logit contributions
ients-4.290
housing-3.815
 isol-3.762
Module-3.761
 disrupted-3.701
Token some
Token ID617
Feature activation-3.84e-01

Pos logit contributions
ients+0.454
 isol+0.401
housing+0.399
 disrupted+0.393
Module+0.390
Neg logit contributions
phabet-0.495
 Ital-0.493
 Fall-0.436
 Alphabet-0.435
atron-0.416
Token point
Token ID966
Feature activation-3.64e-01

Pos logit contributions
phabet+2.170
 Ital+2.159
 Fall+1.927
 Alphabet+1.913
istine+1.811
Neg logit contributions
ients-2.045
 isol-1.827
housing-1.803
 disrupted-1.796
Module-1.770
Token it
Token ID340
Feature activation-2.84e-01

Pos logit contributions
 Ital+2.034
phabet+2.032
 Fall+1.819
 Alphabet+1.801
istine+1.710
Neg logit contributions
ients-1.957
 isol-1.746
housing-1.727
 disrupted-1.717
Module-1.699
TokenM
Token ID44
Feature activation+2.22e-01

Pos logit contributions
ients+2.420
housing+2.220
 isol+2.185
Module+2.179
 disrupted+2.113
Neg logit contributions
phabet-2.541
 Ital-2.527
 Fall-2.242
 Alphabet-2.217
istine-2.176
Tokenurd
Token ID2799
Feature activation+1.69e-01

Pos logit contributions
ients+1.284
 isol+1.149
housing+1.130
 disrupted+1.114
Module+1.113
Neg logit contributions
 Ital-1.155
phabet-1.126
 Fall-1.003
 Alphabet-1.001
istine-0.929
Tokeners
Token ID364
Feature activation+1.81e-01

Pos logit contributions
ients+0.832
 isol+0.720
housing+0.716
 disrupted+0.710
Module+0.704
Neg logit contributions
phabet-1.026
 Ital-1.022
 Fall-0.925
 Alphabet-0.915
istine-0.878
Token on
Token ID319
Feature activation-9.75e-02

Pos logit contributions
ients+0.915
 isol+0.810
housing+0.803
 disrupted+0.796
Module+0.784
Neg logit contributions
phabet-1.054
 Ital-1.046
 Fall-0.929
 Alphabet-0.927
istine-0.900
Token Al
Token ID978
Feature activation+2.21e-01

Pos logit contributions
phabet+0.521
 Ital+0.519
 Fall+0.463
 Alphabet+0.457
istine+0.437
Neg logit contributions
ients-0.544
 isol-0.487
housing-0.483
 disrupted-0.480
Module-0.476
Tokencat
Token ID9246
Feature activation-1.26e+00

Pos logit contributions
ients+1.413
 isol+1.287
housing+1.267
 disrupted+1.251
Module+1.249
Neg logit contributions
 Ital-0.993
phabet-0.982
 Fall-0.864
 Alphabet-0.855
istine-0.807
Tokenraz
Token ID3247
Feature activation+1.97e-01

Pos logit contributions
 Ital+6.189
phabet+5.971
 Fall+5.680
 Alphabet+5.441
istine+5.166
Neg logit contributions
ients-6.955
housing-6.536
 isol-6.498
Module-6.378
 theorist-6.262
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
ients+0.977
 isol+0.870
housing+0.859
 disrupted+0.844
Module+0.835
Neg logit contributions
phabet-1.165
 Ital-1.157
 Fall-1.038
 Alphabet-1.030
istine-0.997
Token
Token ID447
Feature activation+2.40e-01

Pos logit contributions
phabet+0.669
 Ital+0.665
 Fall+0.599
 Alphabet+0.592
istine+0.568
Neg logit contributions
ients-0.594
 isol-0.528
housing-0.525
 disrupted-0.517
Module-0.515
Token
Token ID251
Feature activation+2.93e-01

Pos logit contributions
ients+1.318
 isol+1.179
housing+1.177
 disrupted+1.151
Module+1.148
Neg logit contributions
phabet-1.286
 Ital-1.283
 Fall-1.150
 Alphabet-1.138
istine-1.099
Token
Token ID198
Feature activation+9.01e-03

Pos logit contributions
ients+1.445
 isol+1.278
housing+1.273
 disrupted+1.241
Module+1.240
Neg logit contributions
phabet-1.744
 Ital-1.705
 Fall-1.541
 Alphabet-1.526
istine-1.498
TokenAS
Token ID1921
Feature activation+6.77e-01

Pos logit contributions
ients+1.679
 isol+1.483
housing+1.470
 disrupted+1.461
"),+1.455
Neg logit contributions
 Ital-1.447
phabet-1.424
 Alphabet-1.285
 Fall-1.261
istine-1.182
Token),
Token ID828
Feature activation-2.26e-01

Pos logit contributions
ients+1.995
 isol+1.627
housing+1.608
 disrupted+1.589
Module+1.573
Neg logit contributions
 Ital-5.338
phabet-5.321
 Alphabet-4.930
 Fall-4.922
istine-4.769
Token Victoria
Token ID12313
Feature activation-1.02e-01

Pos logit contributions
 Ital+0.948
phabet+0.945
 Fall+0.818
 Alphabet+0.805
istine+0.747
Neg logit contributions
ients-1.523
 isol-1.393
housing-1.382
 disrupted-1.373
Module-1.364
Token (
Token ID357
Feature activation-2.35e-01

Pos logit contributions
 Ital+0.526
phabet+0.521
 Fall+0.472
 Alphabet+0.459
istine+0.431
Neg logit contributions
ients-0.590
 isol-0.532
housing-0.530
Module-0.524
 disrupted-0.522
TokenV
Token ID53
Feature activation+2.64e-01

Pos logit contributions
 Ital+1.211
phabet+1.206
 Fall+1.076
 Alphabet+1.062
istine+0.999
Neg logit contributions
ients-1.366
 isol-1.230
housing-1.222
 disrupted-1.209
Module-1.203
TokenIC
Token ID2149
Feature activation+1.17e+00

Pos logit contributions
ients+1.356
 isol+1.185
housing+1.175
Module+1.167
 disrupted+1.162
Neg logit contributions
phabet-1.554
 Ital-1.542
 Alphabet-1.392
 Fall-1.383
istine-1.305
Token)
Token ID8
Feature activation+4.03e-01

Pos logit contributions
ients+2.920
 isol+2.416
'),+2.351
"),+2.328
housing+2.285
Neg logit contributions
 Ital-9.365
phabet-9.338
 Alphabet-8.711
 Fall-8.666
istine-8.494
Token and
Token ID290
Feature activation-2.00e-01

Pos logit contributions
ients+2.138
 isol+1.904
housing+1.888
 disrupted+1.870
Module+1.857
Neg logit contributions
 Ital-2.274
phabet-2.266
 Fall-2.045
 Alphabet-2.020
istine-1.912
Token Western
Token ID4885
Feature activation-5.53e-01

Pos logit contributions
phabet+0.858
 Ital+0.857
 Fall+0.740
 Alphabet+0.731
istine+0.685
Neg logit contributions
ients-1.327
 isol-1.213
housing-1.200
 disrupted-1.196
Module-1.184
Token Australia
Token ID4505
Feature activation+3.95e-01

Pos logit contributions
 Ital+2.402
phabet+2.323
 Fall+2.113
 Alphabet+2.043
atron+1.874
Neg logit contributions
ients-3.612
housing-3.294
 isol-3.281
Module-3.252
 disrupted-3.228
Token (
Token ID357
Feature activation+1.74e-01

Pos logit contributions
ients+2.169
 isol+1.938
housing+1.924
 disrupted+1.904
Module+1.895
Neg logit contributions
 Ital-2.157
phabet-2.148
 Fall-1.932
 Alphabet-1.907
istine-1.800
Token pages
Token ID5468
Feature activation+4.24e-01

Pos logit contributions
ients+3.328
 isol+3.010
 disrupted+2.869
housing+2.860
 terr+2.823
Neg logit contributions
phabet-4.765
 Ital-4.668
 Fall-4.256
 Alphabet-4.180
istine-4.110
Token
Token ID198
Feature activation+2.32e-01

Pos logit contributions
ients+2.164
 isol+1.957
housing+1.909
 disrupted+1.901
Module+1.876
Neg logit contributions
phabet-2.440
 Ital-2.397
 Fall-2.165
 Alphabet-2.134
istine-2.083
Token
Token ID198
Feature activation+6.97e-02

Pos logit contributions
ients+1.110
housing+0.979
 isol+0.977
Module+0.972
 disrupted+0.948
Neg logit contributions
phabet-1.448
 Ital-1.415
 Fall-1.255
 Alphabet-1.252
istine-1.212
TokenNew
Token ID3791
Feature activation-6.10e-02

Pos logit contributions
ients+0.367
housing+0.329
Module+0.329
 isol+0.326
 disrupted+0.320
Neg logit contributions
phabet-0.394
 Ital-0.393
 Fall-0.354
 Alphabet-0.349
istine-0.333
Token challenge
Token ID4427
Feature activation+2.27e-01

Pos logit contributions
phabet+0.371
 Ital+0.367
 Fall+0.333
 Alphabet+0.330
istine+0.318
Neg logit contributions
ients-0.297
 isol-0.264
housing-0.259
 disrupted-0.257
Module-0.253
Token notifications
Token ID19605
Feature activation+9.23e-01

Pos logit contributions
ients+1.107
 isol+0.989
housing+0.968
 disrupted+0.959
Module+0.945
Neg logit contributions
phabet-1.366
 Ital-1.356
 Fall-1.223
 Alphabet-1.207
istine-1.171
Token,
Token ID11
Feature activation+2.31e-01

Pos logit contributions
ients+4.420
 isol+3.924
housing+3.854
 disrupted+3.846
Module+3.762
Neg logit contributions
phabet-5.543
 Ital-5.292
 Fall-4.787
 Alphabet-4.773
istine-4.718
Token now
Token ID783
Feature activation-6.91e-02

Pos logit contributions
ients+1.195
 isol+1.088
 disrupted+1.069
housing+1.053
Module+1.033
Neg logit contributions
phabet-1.329
 Ital-1.297
 Fall-1.165
 Alphabet-1.157
istine-1.122
Token showing
Token ID4478
Feature activation+1.83e-01

Pos logit contributions
phabet+0.413
 Ital+0.409
 Fall+0.368
 Alphabet+0.365
istine+0.350
Neg logit contributions
ients-0.347
 isol-0.309
 disrupted-0.303
housing-0.303
Module-0.296
Token up
Token ID510
Feature activation+9.43e-02

Pos logit contributions
ients+0.933
 isol+0.844
 disrupted+0.829
housing+0.820
Module+0.806
Neg logit contributions
phabet-1.075
 Ital-1.048
 Fall-0.941
 Alphabet-0.933
istine-0.900
Token from
Token ID422
Feature activation+2.10e-01

Pos logit contributions
ients+0.515
 isol+0.459
 disrupted+0.452
housing+0.449
Module+0.441
Neg logit contributions
phabet-0.527
 Ital-0.513
 Fall-0.460
 Alphabet-0.456
istine-0.437
Token[
Token ID58
Feature activation-3.70e-01

Pos logit contributions
phabet+0.586
 Ital+0.586
 Fall+0.525
 Alphabet+0.521
istine+0.500
Neg logit contributions
ients-0.488
housing-0.434
Module-0.434
 isol-0.427
 disrupted-0.418
Tokenyoutube
Token ID11604
Feature activation+1.66e-01

Pos logit contributions
 Ital+1.782
phabet+1.780
 Fall+1.578
 Alphabet+1.550
istine+1.451
Neg logit contributions
ients-2.254
 isol-2.044
housing-2.022
 disrupted-2.013
Module-1.990
Token https
Token ID3740
Feature activation+5.05e-01

Pos logit contributions
ients+0.929
 isol+0.837
housing+0.827
 disrupted+0.823
Module+0.823
Neg logit contributions
phabet-0.888
 Ital-0.884
 Fall-0.782
 Alphabet-0.779
istine-0.742
Token://
Token ID1378
Feature activation+6.16e-01

Pos logit contributions
ients+1.527
Module+1.255
housing+1.253
 isol+1.232
 disrupted+1.179
Neg logit contributions
phabet-3.997
 Ital-3.954
 Alphabet-3.606
 Fall-3.575
istine-3.490
Tokenwww
Token ID2503
Feature activation+8.52e-01

Pos logit contributions
ients+3.816
housing+3.440
 isol+3.409
Module+3.373
 disrupted+3.322
Neg logit contributions
 Ital-2.945
phabet-2.874
 Alphabet-2.529
 Fall-2.527
istine-2.345
Token.
Token ID13
Feature activation+9.22e-01

Pos logit contributions
ients+3.262
housing+2.833
Module+2.828
"),+2.781
'),+2.723
Neg logit contributions
phabet-6.055
 Ital-5.840
 Alphabet-5.210
 Fall-5.148
istine-5.016
Tokenyoutube
Token ID11604
Feature activation+3.89e-01

Pos logit contributions
ients+3.296
housing+2.739
 isol+2.569
Module+2.561
 disrupted+2.491
Neg logit contributions
 Ital-6.809
phabet-6.629
 Fall-6.159
 Alphabet-6.091
istine-5.783
Token.
Token ID13
Feature activation+3.69e-01

Pos logit contributions
ients+1.743
housing+1.513
 isol+1.503
Module+1.499
 disrupted+1.462
Neg logit contributions
phabet-2.531
 Ital-2.501
 Fall-2.240
 Alphabet-2.235
istine-2.145
Tokencom
Token ID785
Feature activation+4.33e-01

Pos logit contributions
ients+2.332
housing+2.148
Module+2.110
"),+2.077
 isol+2.077
Neg logit contributions
phabet-1.685
 Ital-1.681
 Fall-1.413
 Alphabet-1.394
istine-1.335
Token/
Token ID14
Feature activation+4.42e-01

Pos logit contributions
ients+2.220
 isol+1.987
Module+1.981
housing+1.967
 disrupted+1.939
Neg logit contributions
phabet-2.511
 Ital-2.471
 Fall-2.146
 Alphabet-2.124
istine-2.071
Tokenwatch
Token ID8340
Feature activation+5.95e-01

Pos logit contributions
ients+1.281
housing+1.024
 isol+1.002
Module+0.994
 disrupted+0.958
Neg logit contributions
 Ital-3.572
phabet-3.537
 Fall-3.280
 Alphabet-3.262
istine-3.162
Token-
Token ID12
Feature activation+4.74e-01

Pos logit contributions
 Ital+1.354
phabet+1.351
 Fall+1.230
 Alphabet+1.218
istine+1.162
Neg logit contributions
ients-0.988
 isol-0.865
housing-0.855
 disrupted-0.847
Module-0.839
Tokengeneration
Token ID20158
Feature activation+3.12e-01

Pos logit contributions
ients+1.702
housing+1.442
 isol+1.419
Module+1.401
 disrupted+1.372
Neg logit contributions
 Ital-3.489
phabet-3.439
 Fall-3.175
 Alphabet-3.162
istine-3.060
Token Twin
Token ID14968
Feature activation-1.69e-01

Pos logit contributions
ients+1.687
 isol+1.554
 disrupted+1.515
housing+1.508
Module+1.498
Neg logit contributions
phabet-1.735
 Ital-1.674
 Fall-1.485
 Alphabet-1.484
istine-1.437
Token Fro
Token ID9734
Feature activation+6.26e-01

Pos logit contributions
 Ital+0.657
phabet+0.651
 Fall+0.564
 Alphabet+0.550
istine+0.502
Neg logit contributions
ients-1.191
 isol-1.091
housing-1.086
 disrupted-1.077
Module-1.069
Tokenz
Token ID89
Feature activation+1.64e-01

Pos logit contributions
ients+2.964
housing+2.685
Module+2.630
 isol+2.592
 disrupted+2.544
Neg logit contributions
 Ital-3.805
phabet-3.771
 Alphabet-3.362
 Fall-3.308
istine-3.232
Tokenr
Token ID81
Feature activation+9.37e-01

Pos logit contributions
ients+1.023
housing+0.927
Module+0.915
 isol+0.915
 disrupted+0.898
Neg logit contributions
phabet-0.786
 Ital-0.786
 Fall-0.685
 Alphabet-0.677
istine-0.639
Token IV
Token ID8363
Feature activation+4.75e-01

Pos logit contributions
ients+3.676
 isol+3.375
Module+3.234
 disrupted+3.176
housing+3.143
Neg logit contributions
phabet-6.328
 Ital-6.126
 Alphabet-5.629
 Fall-5.563
istine-5.473
Token Thermal
Token ID41590
Feature activation+4.11e-01

Pos logit contributions
ients+2.115
 isol+1.927
Module+1.856
housing+1.854
 disrupted+1.829
Neg logit contributions
phabet-3.061
 Ital-2.958
 Alphabet-2.697
 Fall-2.683
istine-2.640
Token Design
Token ID8495
Feature activation+4.12e-01

Pos logit contributions
ients+1.711
 isol+1.506
housing+1.475
Module+1.457
 disrupted+1.450
Neg logit contributions
phabet-2.764
 Ital-2.751
 Fall-2.500
 Alphabet-2.489
istine-2.411
Token.
Token ID13
Feature activation+4.79e-02

Pos logit contributions
ients+2.109
 isol+1.940
housing+1.898
Module+1.886
 disrupted+1.863
Neg logit contributions
phabet-2.383
 Ital-2.319
 Fall-2.086
 Alphabet-2.083
istine-2.019
Token The
Token ID383
Feature activation+1.63e-01

Pos logit contributions
ients+0.252
 isol+0.229
Module+0.227
housing+0.226
 disrupted+0.223
Neg logit contributions
phabet-0.273
 Ital-0.263
 Fall-0.234
 Alphabet-0.234
istine-0.225
Token eye
Token ID4151
Feature activation+2.89e-01

Pos logit contributions
 Ital+1.380
phabet+1.377
 Fall+1.241
 Alphabet+1.228
istine+1.163
Neg logit contributions
ients-1.266
 isol-1.129
housing-1.114
 disrupted-1.107
Module-1.096
Token-
Token ID12
Feature activation+1.93e-01

Pos logit contributions
ients+1.376
 isol+1.247
 disrupted+1.222
housing+1.203
Module+1.178
Neg logit contributions
phabet-1.798
 Ital-1.775
 Fall-1.600
 Alphabet-1.592
istine-1.519
Tokeninspired
Token ID24194
Feature activation+3.01e-01

Pos logit contributions
ients+1.069
housing+0.964
 isol+0.958
Module+0.938
 disrupted+0.938
Neg logit contributions
 Ital-1.053
phabet-1.043
 Fall-0.931
 Alphabet-0.922
istine-0.870
Token panel
Token ID6103
Feature activation+6.45e-01

Pos logit contributions
ients+1.412
 isol+1.268
 disrupted+1.231
housing+1.225
Module+1.201
Neg logit contributions
phabet-1.903
 Ital-1.864
 Fall-1.677
 Alphabet-1.668
istine-1.619
Token survived
Token ID11803
Feature activation+2.34e-02

Pos logit contributions
ients+2.968
 isol+2.728
 disrupted+2.674
housing+2.606
 appealed+2.510
Neg logit contributions
phabet-4.120
 Ital-4.022
 Fall-3.648
 Alphabet-3.629
istine-3.435
Token those
Token ID883
Feature activation+9.12e-01

Pos logit contributions
ients+0.117
 isol+0.105
 disrupted+0.103
housing+0.102
Module+0.100
Neg logit contributions
phabet-0.141
 Ital-0.139
 Fall-0.125
 Alphabet-0.125
istine-0.119
Token harsh
Token ID11859
Feature activation-2.48e-01

Pos logit contributions
ients+3.722
 isol+3.485
 disrupted+3.299
 terr+3.166
housing+3.080
Neg logit contributions
phabet-6.285
 Ital-5.943
 Alphabet-5.410
istine-5.334
 Fall-5.332
Token conditions
Token ID3403
Feature activation+8.86e-02

Pos logit contributions
phabet+1.568
 Ital+1.561
 Fall+1.414
 Alphabet+1.407
istine+1.345
Neg logit contributions
ients-1.160
 isol-1.023
housing-1.006
 disrupted-0.996
Module-0.978
Token while
Token ID981
Feature activation+2.36e-01

Pos logit contributions
ients+0.465
 isol+0.420
 disrupted+0.411
housing+0.411
Module+0.397
Neg logit contributions
phabet-0.512
 Ital-0.504
 Fall-0.450
 Alphabet-0.447
istine-0.428
Token still
Token ID991
Feature activation+2.58e-01

Pos logit contributions
ients+1.057
 isol+0.961
 disrupted+0.926
housing+0.908
Module+0.891
Neg logit contributions
phabet-1.548
 Ital-1.517
 Fall-1.371
 Alphabet-1.364
istine-1.314
Token generating
Token ID15453
Feature activation+4.05e-01

Pos logit contributions
ients+1.103
 isol+1.017
 disrupted+0.971
housing+0.947
 extrad+0.923
Neg logit contributions
phabet-1.755
 Ital-1.714
 Fall-1.546
 Alphabet-1.544
istine-1.489
Token
Token ID447
Feature activation+1.19e-01

Pos logit contributions
 Ital+0.115
phabet+0.114
 Fall+0.103
 Alphabet+0.102
istine+0.096
Neg logit contributions
ients-0.103
 isol-0.092
housing-0.091
 disrupted-0.091
Module-0.089
Token
Token ID247
Feature activation-2.15e-02

Pos logit contributions
ients+0.732
 isol+0.664
housing+0.663
 disrupted+0.653
Module+0.653
Neg logit contributions
 Ital-0.567
phabet-0.562
 Fall-0.496
 Alphabet-0.492
istine-0.464
Token-
Token ID12
Feature activation-2.28e-01

Pos logit contributions
 Ital+0.112
phabet+0.111
 Fall+0.099
 Alphabet+0.099
istine+0.093
Neg logit contributions
ients-0.122
 isol-0.110
housing-0.110
 disrupted-0.109
Module-0.107
Tokenhour
Token ID9769
Feature activation-5.41e-01

Pos logit contributions
 Ital+1.248
phabet+1.241
 Fall+1.115
 Alphabet+1.103
istine+1.041
Neg logit contributions
ients-1.251
 isol-1.118
housing-1.112
 disrupted-1.098
Module-1.092
Token na
Token ID12385
Feature activation+4.00e-01

Pos logit contributions
 Ital+3.138
phabet+3.126
 Fall+2.846
 Alphabet+2.794
istine+2.654
Neg logit contributions
ients-2.763
 isol-2.442
housing-2.423
Module-2.392
 disrupted-2.390
Tokenï
Token ID26884
Feature activation+2.18e-01

Pos logit contributions
ients+2.674
 isol+2.450
housing+2.414
 disrupted+2.402
Module+2.395
Neg logit contributions
 Ital-1.682
phabet-1.657
 Alphabet-1.414
 Fall-1.412
atron-1.283
Tokenvet
Token ID16809
Feature activation-3.70e-02

Pos logit contributions
ients+1.508
 isol+1.378
housing+1.368
 disrupted+1.348
Module+1.340
Neg logit contributions
phabet-0.871
 Ital-0.862
 Alphabet-0.740
 Fall-0.732
istine-0.666
Tokené
Token ID2634
Feature activation+4.33e-02

Pos logit contributions
phabet+0.182
 Ital+0.178
 Fall+0.157
 Alphabet+0.156
istine+0.147
Neg logit contributions
ients-0.226
 isol-0.205
housing-0.202
 disrupted-0.201
Module-0.199
Token about
Token ID546
Feature activation+1.74e-03

Pos logit contributions
ients+0.218
 isol+0.195
housing+0.192
 disrupted+0.191
Module+0.187
Neg logit contributions
phabet-0.255
 Ital-0.254
 Fall-0.228
 Alphabet-0.227
istine-0.215
Token natural
Token ID3288
Feature activation+2.28e-02

Pos logit contributions
ients+0.009
 isol+0.008
housing+0.008
 disrupted+0.008
Module+0.008
Neg logit contributions
phabet-0.010
 Ital-0.010
 Fall-0.009
 Alphabet-0.009
istine-0.008
Token phenomena
Token ID19428
Feature activation-4.43e-01

Pos logit contributions
ients+0.117
 isol+0.104
 disrupted+0.102
housing+0.102
Module+0.098
Neg logit contributions
phabet-0.134
 Ital-0.133
 Alphabet-0.119
 Fall-0.118
istine-0.113
Token all
Token ID477
Feature activation+1.66e-01

Pos logit contributions
 Ital+0.783
phabet+0.783
 Fall+0.706
 Alphabet+0.698
istine+0.663
Neg logit contributions
ients-0.692
 isol-0.615
housing-0.607
 disrupted-0.602
Module-0.596
Token women
Token ID1466
Feature activation+1.89e-01

Pos logit contributions
ients+0.840
 isol+0.749
 disrupted+0.728
housing+0.728
Module+0.708
Neg logit contributions
phabet-0.982
 Ital-0.975
 Fall-0.878
 Alphabet-0.873
atron-0.832
Token was
Token ID373
Feature activation+1.32e-01

Pos logit contributions
ients+0.942
 isol+0.846
housing+0.825
 disrupted+0.822
Module+0.793
Neg logit contributions
phabet-1.137
 Ital-1.123
 Alphabet-1.010
 Fall-1.007
atron-0.957
Token the
Token ID262
Feature activation+2.72e-01

Pos logit contributions
ients+0.675
 isol+0.620
 disrupted+0.610
housing+0.595
Module+0.581
Neg logit contributions
phabet-0.779
 Ital-0.753
 Alphabet-0.677
 Fall-0.674
atron-0.650
Token vaginal
Token ID30131
Feature activation+4.29e-01

Pos logit contributions
ients+1.420
 isol+1.316
 disrupted+1.285
housing+1.247
 extrad+1.238
Neg logit contributions
phabet-1.554
 Ital-1.498
 Alphabet-1.342
 Fall-1.318
atron-1.296
Token microbiome
Token ID48465
Feature activation+6.53e-01

Pos logit contributions
ients+2.178
 isol+1.990
 disrupted+1.900
housing+1.858
Module+1.840
Neg logit contributions
phabet-2.520
 Ital-2.470
 Alphabet-2.221
 Fall-2.205
atron-2.128
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
ients+3.074
 isol+2.865
 disrupted+2.766
housing+2.728
Module+2.646
Neg logit contributions
phabet-3.953
 Ital-3.892
 Alphabet-3.479
 Fall-3.477
atron-3.351
Token
Token ID198
Feature activation+6.17e-02

Pos logit contributions
ients+0.562
 isol+0.510
housing+0.501
Module+0.500
 disrupted+0.496
Neg logit contributions
phabet-0.653
 Ital-0.620
 Alphabet-0.552
 Fall-0.552
atron-0.540
Token
Token ID198
Feature activation+1.12e-01

Pos logit contributions
ients+0.295
Module+0.263
housing+0.262
 isol+0.260
 disrupted+0.253
Neg logit contributions
phabet-0.383
 Ital-0.376
 Fall-0.334
 Alphabet-0.334
istine-0.321
TokenSur
Token ID14214
Feature activation-1.20e-01

Pos logit contributions
ients+0.555
housing+0.495
Module+0.495
 isol+0.488
 disrupted+0.476
Neg logit contributions
 Ital-0.675
phabet-0.674
 Fall-0.606
 Alphabet-0.603
istine-0.571
Tokenpri
Token ID3448
Feature activation+3.89e-02

Pos logit contributions
 Ital+0.682
phabet+0.679
 Fall+0.613
 Alphabet+0.607
istine+0.574
Neg logit contributions
ients-0.630
 isol-0.558
housing-0.553
 disrupted-0.546
Module-0.544
Token a
Token ID257
Feature activation+1.59e-01

Pos logit contributions
phabet+1.274
 Ital+1.274
 Fall+1.146
 Alphabet+1.133
istine+1.080
Neg logit contributions
ients-1.153
 isol-1.029
housing-1.013
 disrupted-1.007
Module-0.994
Token song
Token ID3496
Feature activation-3.33e-02

Pos logit contributions
ients+0.705
 isol+0.628
 disrupted+0.617
housing+0.608
 theorist+0.590
Neg logit contributions
phabet-1.027
 Ital-1.023
 Fall-0.924
 Alphabet-0.920
istine-0.895
Token for
Token ID329
Feature activation-5.10e-02

Pos logit contributions
 Ital+0.190
phabet+0.190
 Fall+0.171
 Alphabet+0.169
istine+0.162
Neg logit contributions
ients-0.174
 isol-0.156
housing-0.153
 disrupted-0.152
Module-0.149
Token a
Token ID257
Feature activation+1.67e-01

Pos logit contributions
phabet+0.275
 Ital+0.273
 Fall+0.243
 Alphabet+0.241
istine+0.230
Neg logit contributions
ients-0.284
 isol-0.256
housing-0.252
 disrupted-0.250
Module-0.246
Token book
Token ID1492
Feature activation-1.66e-02

Pos logit contributions
ients+0.760
 isol+0.669
 disrupted+0.663
housing+0.653
 theorist+0.634
Neg logit contributions
phabet-1.066
 Ital-1.059
 Fall-0.952
 Alphabet-0.951
istine-0.915
Token,
Token ID11
Feature activation+4.06e-01

Pos logit contributions
phabet+0.097
 Ital+0.096
 Fall+0.086
 Alphabet+0.086
istine+0.082
Neg logit contributions
ients-0.085
 isol-0.076
housing-0.075
 disrupted-0.074
Module-0.073
Token because
Token ID780
Feature activation-4.16e-02

Pos logit contributions
ients+2.065
 isol+1.877
 disrupted+1.819
housing+1.809
 terr+1.776
Neg logit contributions
phabet-2.334
 Ital-2.295
 Fall-2.055
 Alphabet-2.041
atron-1.969
Token you
Token ID345
Feature activation-4.72e-01

Pos logit contributions
phabet+0.237
 Ital+0.236
 Fall+0.210
 Alphabet+0.209
istine+0.199
Neg logit contributions
ients-0.219
 isol-0.197
housing-0.192
 disrupted-0.192
Module-0.188
Token
Token ID447
Feature activation+2.75e-02

Pos logit contributions
 Ital+2.906
phabet+2.895
 Fall+2.647
 Alphabet+2.610
istine+2.488
Neg logit contributions
ients-2.248
 isol-1.967
housing-1.958
 disrupted-1.926
Module-1.922
Token
Token ID247
Feature activation-1.55e-01

Pos logit contributions
ients+0.187
 isol+0.171
housing+0.170
Module+0.168
 disrupted+0.166
Neg logit contributions
 Ital-0.115
phabet-0.114
 Fall-0.098
 Alphabet-0.097
istine-0.091
Tokenre
Token ID260
Feature activation-5.58e-01

Pos logit contributions
 Ital+0.757
phabet+0.755
 Fall+0.665
 Alphabet+0.661
istine+0.617
Neg logit contributions
ients-0.941
 isol-0.851
housing-0.845
 disrupted-0.837
Module-0.831
Token NC
Token ID8823
Feature activation-6.17e-02

Pos logit contributions
 Ital+2.554
phabet+2.537
 Fall+2.296
 Alphabet+2.270
istine+2.135
Neg logit contributions
ients-2.477
 isol-2.205
housing-2.190
 disrupted-2.166
Module-2.153
TokenQ
Token ID48
Feature activation-2.06e-01

Pos logit contributions
phabet+0.360
 Ital+0.360
 Fall+0.323
 Alphabet+0.321
istine+0.308
Neg logit contributions
ients-0.315
 isol-0.277
housing-0.274
 disrupted-0.272
Module-0.272
TokenA
Token ID32
Feature activation-1.13e-01

Pos logit contributions
 Ital+1.109
phabet+1.109
 Fall+0.985
 Alphabet+0.982
istine+0.930
Neg logit contributions
ients-1.149
 isol-1.033
housing-1.018
 disrupted-1.014
Module-1.003
Token to
Token ID284
Feature activation-1.34e-01

Pos logit contributions
phabet+0.586
 Ital+0.585
 Fall+0.518
 Alphabet+0.515
istine+0.487
Neg logit contributions
ients-0.648
 isol-0.584
 disrupted-0.575
housing-0.575
Module-0.568
Token demand
Token ID3512
Feature activation+1.17e-01

Pos logit contributions
phabet+0.782
 Ital+0.782
 Fall+0.703
 Alphabet+0.697
istine+0.663
Neg logit contributions
ients-0.688
 isol-0.612
housing-0.604
 disrupted-0.600
Module-0.594
Token strict
Token ID7646
Feature activation+3.01e-01

Pos logit contributions
ients+0.599
 isol+0.539
 disrupted+0.527
housing+0.526
Module+0.516
Neg logit contributions
phabet-0.683
 Ital-0.676
 Fall-0.604
 Alphabet-0.600
istine-0.575
Token control
Token ID1630
Feature activation-1.88e-02

Pos logit contributions
ients+1.438
 isol+1.295
housing+1.255
 disrupted+1.242
 terr+1.211
Neg logit contributions
 Ital-1.843
phabet-1.842
 Fall-1.638
 Alphabet-1.636
istine-1.582
Token of
Token ID286
Feature activation+1.59e-01

Pos logit contributions
 Ital+0.118
phabet+0.118
 Fall+0.107
 Alphabet+0.106
istine+0.101
Neg logit contributions
ients-0.089
 isol-0.078
housing-0.077
 disrupted-0.076
Module-0.075
Token blood
Token ID2910
Feature activation-5.20e-02

Pos logit contributions
ients+0.803
 isol+0.725
 disrupted+0.708
housing+0.702
Module+0.685
Neg logit contributions
phabet-0.950
 Ital-0.935
 Fall-0.834
 Alphabet-0.831
istine-0.795
Token sugar
Token ID7543
Feature activation-4.00e-01

Pos logit contributions
phabet+0.308
 Ital+0.307
 Alphabet+0.275
 Fall+0.273
istine+0.260
Neg logit contributions
ients-0.265
 isol-0.237
housing-0.231
 disrupted-0.229
Module-0.225
Token in
Token ID287
Feature activation+2.23e-01

Pos logit contributions
 Ital+2.478
phabet+2.467
 Fall+2.256
 Alphabet+2.228
istine+2.118
Neg logit contributions
ients-1.896
 isol-1.660
housing-1.652
 disrupted-1.628
Module-1.618
Token others
Token ID1854
Feature activation-2.56e-02

Pos logit contributions
phabet+0.072
 Ital+0.071
 Fall+0.064
 Alphabet+0.063
istine+0.061
Neg logit contributions
ients-0.065
 isol-0.059
 disrupted-0.058
housing-0.058
Module-0.056
Token was
Token ID373
Feature activation+2.39e-01

Pos logit contributions
phabet+0.152
 Ital+0.152
 Fall+0.136
 Alphabet+0.136
istine+0.129
Neg logit contributions
ients-0.129
 isol-0.115
 disrupted-0.114
housing-0.113
Module-0.110
Token limited
Token ID3614
Feature activation+2.02e-01

Pos logit contributions
ients+1.147
 isol+1.079
 disrupted+1.071
housing+1.010
 appealed+1.002
Neg logit contributions
phabet-1.482
 Ital-1.440
 Fall-1.285
 Alphabet-1.278
istine-1.247
Token.
Token ID13
Feature activation+1.45e-01

Pos logit contributions
ients+1.072
 isol+0.974
housing+0.951
 disrupted+0.950
 extrad+0.921
Neg logit contributions
phabet-1.134
 Ital-1.123
 Alphabet-0.997
 Fall-0.996
istine-0.946
Token But
Token ID887
Feature activation+1.09e-01

Pos logit contributions
ients+0.714
 isol+0.648
housing+0.645
Module+0.636
 disrupted+0.630
Neg logit contributions
phabet-0.861
 Ital-0.820
 Fall-0.730
 Alphabet-0.729
istine-0.722
Token the
Token ID262
Feature activation+2.68e-01

Pos logit contributions
ients+0.554
 isol+0.503
 disrupted+0.490
housing+0.489
Module+0.476
Neg logit contributions
phabet-0.633
 Ital-0.627
 Fall-0.560
 Alphabet-0.557
istine-0.538
Token stories
Token ID3923
Feature activation+2.91e-01

Pos logit contributions
ients+1.375
 isol+1.281
 disrupted+1.237
housing+1.221
 theorist+1.194
Neg logit contributions
phabet-1.539
 Ital-1.519
 Fall-1.332
 Alphabet-1.326
istine-1.311
Token of
Token ID286
Feature activation+1.25e-01

Pos logit contributions
ients+1.398
 isol+1.254
 disrupted+1.238
housing+1.217
 appealed+1.191
Neg logit contributions
phabet-1.788
 Ital-1.766
 Fall-1.590
 Alphabet-1.582
istine-1.530
Token marginalized
Token ID31670
Feature activation+2.67e-01

Pos logit contributions
ients+0.586
 isol+0.528
 disrupted+0.515
housing+0.510
Module+0.495
Neg logit contributions
phabet-0.779
 Ital-0.765
 Fall-0.692
 Alphabet-0.688
istine-0.670
Token and
Token ID290
Feature activation+2.54e-01

Pos logit contributions
ients+1.286
 isol+1.149
 disrupted+1.111
housing+1.103
 theorist+1.080
Neg logit contributions
phabet-1.654
 Ital-1.608
 Fall-1.457
 Alphabet-1.452
istine-1.414
Token oppressed
Token ID25765
Feature activation+3.97e-01

Pos logit contributions
ients+1.105
 isol+1.006
 disrupted+0.993
housing+0.953
 theorist+0.928
Neg logit contributions
phabet-1.687
 Ital-1.643
 Fall-1.497
 Alphabet-1.495
istine-1.451
Token response
Token ID2882
Feature activation+2.27e-01

Pos logit contributions
phabet+0.121
 Ital+0.120
 Fall+0.107
 Alphabet+0.106
istine+0.102
Neg logit contributions
ients-0.102
 isol-0.092
 disrupted-0.090
housing-0.090
Module-0.088
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
ients+1.253
 isol+1.136
 disrupted+1.116
housing+1.104
Module+1.084
Neg logit contributions
phabet-1.234
 Ital-1.215
 Fall-1.076
 Alphabet-1.075
istine-1.024
Token I
Token ID314
Feature activation+9.26e-03

Pos logit contributions
ients+1.029
 isol+0.923
 disrupted+0.896
housing+0.895
Module+0.878
Neg logit contributions
phabet-1.162
 Ital-1.154
 Fall-1.028
 Alphabet-1.024
atron-0.987
Token still
Token ID991
Feature activation+6.83e-02

Pos logit contributions
ients+0.042
 isol+0.037
 disrupted+0.037
housing+0.036
Module+0.036
Neg logit contributions
phabet-0.059
 Ital-0.059
 Alphabet-0.053
 Fall-0.053
istine-0.051
Token wasn
Token ID2492
Feature activation+1.98e-01

Pos logit contributions
ients+0.313
 isol+0.277
 disrupted+0.273
housing+0.269
Module+0.264
Neg logit contributions
phabet-0.435
 Ital-0.434
 Alphabet-0.390
 Fall-0.389
istine-0.373
Token
Token ID447
Feature activation-2.38e-01

Pos logit contributions
ients+0.840
housing+0.729
Module+0.725
 isol+0.714
 disrupted+0.697
Neg logit contributions
phabet-1.343
 Ital-1.313
 Alphabet-1.183
 Fall-1.181
istine-1.133
Token
Token ID247
Feature activation-2.08e-01

Pos logit contributions
 Ital+1.189
phabet+1.182
 Fall+1.049
 Alphabet+1.037
istine+0.975
Neg logit contributions
ients-1.420
 isol-1.278
housing-1.270
 disrupted-1.257
Module-1.254
Tokent
Token ID83
Feature activation+8.93e-02

Pos logit contributions
 Ital+1.042
phabet+1.042
 Fall+0.918
 Alphabet+0.912
istine+0.852
Neg logit contributions
ients-1.250
 isol-1.126
housing-1.120
 disrupted-1.107
Module-1.091
Token fully
Token ID3938
Feature activation+2.24e-01

Pos logit contributions
ients+0.434
 isol+0.386
 disrupted+0.381
housing+0.376
 extrad+0.367
Neg logit contributions
phabet-0.550
 Ital-0.544
 Alphabet-0.488
 Fall-0.487
istine-0.466
Token sold
Token ID2702
Feature activation+5.62e-02

Pos logit contributions
ients+0.748
 isol+0.652
 disrupted+0.634
housing+0.614
 extrad+0.602
Neg logit contributions
phabet-1.696
 Ital-1.690
 Fall-1.541
 Alphabet-1.533
istine-1.495
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
ients+0.323
 isol+0.290
housing+0.288
 disrupted+0.284
Module+0.281
Neg logit contributions
 Ital-0.293
phabet-0.292
 Fall-0.260
 Alphabet-0.256
istine-0.241
Token,
Token ID11
Feature activation+1.73e-02

Pos logit contributions
 Ital+1.630
phabet+1.620
 Fall+1.463
 Alphabet+1.442
istine+1.356
Neg logit contributions
ients-1.654
 isol-1.474
housing-1.464
 disrupted-1.448
Module-1.443
Token these
Token ID777
Feature activation+2.37e-01

Pos logit contributions
ients+0.097
 isol+0.087
 disrupted+0.086
housing+0.086
Module+0.084
Neg logit contributions
phabet-0.092
 Ital-0.092
 Fall-0.082
 Alphabet-0.080
istine-0.077
Token are
Token ID389
Feature activation+2.16e-01

Pos logit contributions
ients+1.112
 isol+0.979
 disrupted+0.963
housing+0.946
 extrad+0.932
Neg logit contributions
phabet-1.482
 Ital-1.471
 Fall-1.324
 Alphabet-1.316
atron-1.268
Token the
Token ID262
Feature activation+6.21e-02

Pos logit contributions
ients+1.096
 isol+0.994
 disrupted+0.968
housing+0.948
 extrad+0.939
Neg logit contributions
phabet-1.280
 Ital-1.259
 Fall-1.113
 Alphabet-1.108
atron-1.077
Token experts
Token ID6154
Feature activation+1.83e-01

Pos logit contributions
ients+0.291
 isol+0.257
 disrupted+0.251
housing+0.249
Module+0.244
Neg logit contributions
phabet-0.391
 Ital-0.389
 Fall-0.350
 Alphabet-0.347
atron-0.335
Token speaking
Token ID5486
Feature activation-4.10e-01

Pos logit contributions
ients+0.946
 isol+0.846
housing+0.826
 disrupted+0.826
Module+0.805
Neg logit contributions
phabet-1.058
 Ital-1.050
 Fall-0.940
 Alphabet-0.934
atron-0.893
Token out
Token ID503
Feature activation+8.72e-02

Pos logit contributions
 Ital+2.185
phabet+2.150
 Fall+1.958
 Alphabet+1.927
istine+1.812
Neg logit contributions
ients-2.291
housing-2.050
 isol-2.047
Module-2.022
 disrupted-2.016
Token on
Token ID319
Feature activation-2.32e-01

Pos logit contributions
ients+0.459
 isol+0.414
housing+0.406
 disrupted+0.405
Module+0.393
Neg logit contributions
phabet-0.499
 Ital-0.494
 Fall-0.443
 Alphabet-0.438
istine-0.416
Token this
Token ID428
Feature activation+6.85e-02

Pos logit contributions
 Ital+1.245
phabet+1.239
 Fall+1.113
 Alphabet+1.099
istine+1.037
Neg logit contributions
ients-1.295
 isol-1.159
housing-1.151
 disrupted-1.140
Module-1.133
Token,'
Token ID4032
Feature activation-1.50e-01

Pos logit contributions
ients+0.341
 isol+0.306
 disrupted+0.302
housing+0.301
Module+0.295
Neg logit contributions
phabet-0.409
 Ital-0.405
 Fall-0.363
 Alphabet-0.361
istine-0.345
Token M
Token ID337
Feature activation+4.31e-01

Pos logit contributions
 Ital+0.850
phabet+0.849
 Fall+0.763
 Alphabet+0.754
istine+0.718
Neg logit contributions
ients-0.784
 isol-0.700
housing-0.691
 disrupted-0.688
Module-0.680
Token.
Token ID13
Feature activation+5.78e-02

Pos logit contributions
ients+0.652
 isol+0.586
housing+0.575
 disrupted+0.569
Module+0.557
Neg logit contributions
phabet-0.782
 Ital-0.780
 Fall-0.703
 Alphabet-0.697
istine-0.665
Token
Token ID447
Feature activation+5.16e-02

Pos logit contributions
ients+0.245
 isol+0.213
housing+0.212
 disrupted+0.207
Module+0.207
Neg logit contributions
phabet-0.385
 Ital-0.381
 Fall-0.347
 Alphabet-0.345
istine-0.333
Token
Token ID251
Feature activation+1.42e-01

Pos logit contributions
ients+0.304
 isol+0.274
housing+0.272
 disrupted+0.270
Module+0.268
Neg logit contributions
 Ital-0.261
phabet-0.260
 Fall-0.232
 Alphabet-0.228
istine-0.214
Token
Token ID198
Feature activation-6.38e-02

Pos logit contributions
ients+0.733
 isol+0.652
housing+0.647
 disrupted+0.640
Module+0.634
Neg logit contributions
 Ital-0.815
phabet-0.814
 Fall-0.733
 Alphabet-0.726
istine-0.690
Token
Token ID198
Feature activation-7.77e-02

Pos logit contributions
phabet+0.371
 Ital+0.370
 Fall+0.332
 Alphabet+0.329
istine+0.313
Neg logit contributions
ients-0.328
 isol-0.291
housing-0.289
Module-0.285
 disrupted-0.285
TokenBut
Token ID1537
Feature activation+1.34e-01

Pos logit contributions
 Ital+0.467
phabet+0.465
 Fall+0.423
 Alphabet+0.419
istine+0.397
Neg logit contributions
ients-0.384
 isol-0.337
housing-0.337
Module-0.332
 disrupted-0.331
Token sick
Token ID6639
Feature activation+2.99e-01

Pos logit contributions
ients+0.721
 isol+0.642
housing+0.633
 disrupted+0.630
Module+0.621
Neg logit contributions
phabet-0.753
 Ital-0.748
 Fall-0.670
 Alphabet-0.665
istine-0.636
Token leave
Token ID2666
Feature activation+5.90e-01

Pos logit contributions
ients+1.485
 isol+1.317
housing+1.299
 disrupted+1.281
Module+1.250
Neg logit contributions
phabet-1.787
 Ital-1.786
 Fall-1.608
 Alphabet-1.599
istine-1.533
Token isn
Token ID2125
Feature activation+5.06e-01

Pos logit contributions
ients+2.774
 isol+2.498
 disrupted+2.478
housing+2.413
 appealed+2.360
Neg logit contributions
phabet-3.624
 Ital-3.623
 Alphabet-3.223
 Fall-3.219
istine-3.127
Token
Token ID447
Feature activation+1.56e-01

Pos logit contributions
ients+1.913
housing+1.673
Module+1.622
 isol+1.592
'),+1.546
Neg logit contributions
phabet-3.646
 Ital-3.544
 Alphabet-3.197
 Fall-3.185
istine-3.072
Token
Token ID247
Feature activation+2.91e-01

Pos logit contributions
ients+0.965
 isol+0.871
housing+0.867
Module+0.859
 disrupted+0.854
Neg logit contributions
 Ital-0.737
phabet-0.735
 Fall-0.644
 Alphabet-0.639
istine-0.609
Token arguments
Token ID7159
Feature activation+2.39e-01

Pos logit contributions
ients+0.990
 isol+0.853
 disrupted+0.823
housing+0.822
 theorist+0.812
Neg logit contributions
phabet-1.675
 Ital-1.654
 Alphabet-1.509
 Fall-1.497
atron-1.460
Token throughout
Token ID3690
Feature activation-3.31e-02

Pos logit contributions
ients+1.172
 isol+1.056
 disrupted+1.029
housing+1.017
 extrad+0.997
Neg logit contributions
phabet-1.447
 Ital-1.420
 Alphabet-1.284
 Fall-1.276
atron-1.220
Token this
Token ID428
Feature activation-1.30e-01

Pos logit contributions
 Ital+0.187
phabet+0.187
 Fall+0.168
 Alphabet+0.166
istine+0.158
Neg logit contributions
ients-0.175
 isol-0.156
housing-0.155
 disrupted-0.153
Module-0.152
Token case
Token ID1339
Feature activation-5.75e-02

Pos logit contributions
phabet+0.822
 Ital+0.819
 Fall+0.740
 Alphabet+0.737
istine+0.708
Neg logit contributions
ients-0.603
 isol-0.533
housing-0.524
 disrupted-0.521
Module-0.515
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
 Ital+0.345
phabet+0.344
 Fall+0.312
 Alphabet+0.309
istine+0.293
Neg logit contributions
ients-0.285
 isol-0.251
housing-0.249
 disrupted-0.246
Module-0.244
Token The
Token ID383
Feature activation+3.06e-02

Pos logit contributions
ients+0.601
Module+0.538
housing+0.533
 isol+0.531
 disrupted+0.519
Neg logit contributions
phabet-0.754
 Ital-0.714
 Alphabet-0.639
 Fall-0.637
istine-0.627
Token smart
Token ID4451
Feature activation+8.60e-03

Pos logit contributions
ients+0.151
 isol+0.134
 disrupted+0.131
housing+0.131
Module+0.128
Neg logit contributions
phabet-0.186
 Ital-0.182
 Alphabet-0.163
 Fall-0.163
istine-0.157
Token money
Token ID1637
Feature activation-2.07e-01

Pos logit contributions
ients+0.047
 isol+0.042
housing+0.042
 disrupted+0.041
Module+0.041
Neg logit contributions
phabet-0.048
 Ital-0.047
 Fall-0.042
 Alphabet-0.042
istine-0.040
Token would
Token ID561
Feature activation-4.30e-01

Pos logit contributions
 Ital+1.134
phabet+1.133
 Fall+1.013
 Alphabet+1.003
istine+0.947
Neg logit contributions
ients-1.136
 isol-1.018
housing-1.007
 disrupted-0.998
Module-0.988
Token still
Token ID991
Feature activation-4.50e-01

Pos logit contributions
 Ital+2.583
phabet+2.571
 Fall+2.350
 Alphabet+2.316
istine+2.203
Neg logit contributions
ients-2.109
 isol-1.855
housing-1.842
 disrupted-1.820
Module-1.809
Token say
Token ID910
Feature activation-2.27e-01

Pos logit contributions
 Ital+2.798
phabet+2.789
 Fall+2.549
 Alphabet+2.517
istine+2.400
Neg logit contributions
ients-2.124
 isol-1.860
housing-1.846
 disrupted-1.823
Module-1.811
Token raises
Token ID12073
Feature activation+2.09e-01

Pos logit contributions
phabet+0.668
 Ital+0.664
 Fall+0.593
 Alphabet+0.590
istine+0.568
Neg logit contributions
ients-0.555
 isol-0.507
 disrupted-0.497
housing-0.493
Module-0.481
Token a
Token ID257
Feature activation+1.34e-01

Pos logit contributions
ients+1.079
 isol+0.975
housing+0.952
 disrupted+0.951
Module+0.933
Neg logit contributions
phabet-1.193
 Ital-1.189
 Fall-1.061
 Alphabet-1.052
istine-1.010
Token question
Token ID1808
Feature activation+1.88e-01

Pos logit contributions
ients+0.692
 isol+0.621
 disrupted+0.614
housing+0.612
Module+0.599
Neg logit contributions
 Ital-0.773
phabet-0.772
 Fall-0.688
 Alphabet-0.685
istine-0.655
Token,
Token ID11
Feature activation-1.19e-01

Pos logit contributions
ients+0.952
 isol+0.849
housing+0.840
 disrupted+0.830
Module+0.815
Neg logit contributions
phabet-1.102
 Ital-1.101
 Fall-0.990
 Alphabet-0.983
istine-0.934
Token yet
Token ID1865
Feature activation+1.13e-01

Pos logit contributions
 Ital+0.664
phabet+0.662
 Fall+0.596
 Alphabet+0.589
istine+0.558
Neg logit contributions
ients-0.640
 isol-0.571
housing-0.566
 disrupted-0.561
Module-0.556
Token again
Token ID757
Feature activation-8.67e-02

Pos logit contributions
ients+0.568
 isol+0.509
housing+0.500
 disrupted+0.497
Module+0.490
Neg logit contributions
phabet-0.667
 Ital-0.663
 Fall-0.595
 Alphabet-0.591
istine-0.564
Token,
Token ID11
Feature activation-1.89e-01

Pos logit contributions
 Ital+0.418
phabet+0.414
 Fall+0.369
 Alphabet+0.361
istine+0.338
Neg logit contributions
ients-0.532
housing-0.478
 isol-0.477
Module-0.474
 disrupted-0.472
Token about
Token ID546
Feature activation-5.06e-01

Pos logit contributions
 Ital+1.065
phabet+1.061
 Fall+0.959
 Alphabet+0.946
istine+0.896
Neg logit contributions
ients-1.007
 isol-0.896
housing-0.889
 disrupted-0.880
Module-0.876
Token the
Token ID262
Feature activation-4.86e-01

Pos logit contributions
 Ital+2.754
phabet+2.725
 Fall+2.487
 Alphabet+2.447
istine+2.296
Neg logit contributions
ients-2.780
housing-2.454
 isol-2.453
Module-2.422
 disrupted-2.420
Token supposed
Token ID4385
Feature activation-6.71e-01

Pos logit contributions
 Ital+2.714
phabet+2.702
 Fall+2.442
 Alphabet+2.411
istine+2.280
Neg logit contributions
ients-2.597
 isol-2.311
housing-2.295
 disrupted-2.270
Module-2.257
Token socialism
Token ID19803
Feature activation-6.51e-01

Pos logit contributions
 Ital+3.857
phabet+3.818
 Fall+3.505
 Alphabet+3.443
istine+3.257
Neg logit contributions
ients-3.466
housing-3.051
 isol-3.041
Module-3.002
 disrupted-2.992
Token go
Token ID467
Feature activation-9.59e-02

Pos logit contributions
ients+0.166
 isol+0.148
 disrupted+0.146
housing+0.144
Module+0.141
Neg logit contributions
phabet-0.198
 Ital-0.197
 Fall-0.176
 Alphabet-0.176
istine-0.169
Token for
Token ID329
Feature activation+1.44e-01

Pos logit contributions
 Ital+0.522
phabet+0.521
 Fall+0.466
 Alphabet+0.462
istine+0.438
Neg logit contributions
ients-0.528
 isol-0.474
housing-0.468
 disrupted-0.464
Module-0.458
Token it
Token ID340
Feature activation-9.49e-02

Pos logit contributions
ients+0.739
 isol+0.662
housing+0.646
 disrupted+0.643
Module+0.631
Neg logit contributions
 Ital-0.834
phabet-0.831
 Alphabet-0.743
 Fall-0.743
istine-0.713
Token;
Token ID26
Feature activation+4.45e-01

Pos logit contributions
phabet+0.552
 Ital+0.551
 Fall+0.496
 Alphabet+0.492
istine+0.467
Neg logit contributions
ients-0.487
 isol-0.433
housing-0.428
 disrupted-0.423
Module-0.418
Token one
Token ID530
Feature activation-7.49e-01

Pos logit contributions
ients+2.178
 isol+1.957
 disrupted+1.914
housing+1.889
 appealed+1.832
Neg logit contributions
phabet-2.673
 Ital-2.634
 Alphabet-2.375
 Fall-2.359
istine-2.309
Token of
Token ID286
Feature activation-6.44e-01

Pos logit contributions
 Ital+4.214
phabet+4.128
 Fall+3.848
 Alphabet+3.750
istine+3.522
Neg logit contributions
ients-3.910
housing-3.496
Module-3.439
 isol-3.433
 disrupted-3.389
Token my
Token ID616
Feature activation-2.86e-01

Pos logit contributions
 Ital+3.283
phabet+3.250
 Fall+2.979
 Alphabet+2.915
atron+2.655
Neg logit contributions
ients-3.719
housing-3.324
 isol-3.292
Module-3.279
 disrupted-3.277
Token biggest
Token ID4094
Feature activation-3.55e-01

Pos logit contributions
 Ital+1.599
phabet+1.596
 Fall+1.433
 Alphabet+1.417
istine+1.346
Neg logit contributions
ients-1.532
 isol-1.369
housing-1.354
 disrupted-1.342
Module-1.331
Token coaches
Token ID11070
Feature activation+5.07e-01

Pos logit contributions
 Ital+2.161
phabet+2.142
 Fall+1.967
 Alphabet+1.936
istine+1.834
Neg logit contributions
ients-1.716
housing-1.500
 isol-1.499
 disrupted-1.478
Module-1.475
Token.
Token ID13
Feature activation+2.26e-01

Pos logit contributions
ients+2.430
 isol+2.203
 disrupted+2.133
housing+2.115
 appealed+2.091
Neg logit contributions
phabet-3.098
 Ital-3.030
 Alphabet-2.724
 Fall-2.715
istine-2.636
Token
Token ID447
Feature activation-2.63e-01

Pos logit contributions
ients+1.102
 isol+0.986
housing+0.984
Module+0.964
 disrupted+0.958
Neg logit contributions
phabet-1.370
 Ital-1.313
 Alphabet-1.176
 Fall-1.173
istine-1.158
Token so
Token ID523
Feature activation-5.93e-03

Pos logit contributions
 Ital+0.758
phabet+0.757
 Fall+0.682
 Alphabet+0.675
istine+0.640
Neg logit contributions
ients-0.681
 isol-0.608
housing-0.599
 disrupted-0.595
Module-0.589
Token because
Token ID780
Feature activation+2.12e-01

Pos logit contributions
phabet+0.032
 Ital+0.032
 Fall+0.028
 Alphabet+0.028
istine+0.026
Neg logit contributions
ients-0.033
 isol-0.030
housing-0.030
 disrupted-0.029
Module-0.029
Token it
Token ID340
Feature activation-9.18e-03

Pos logit contributions
ients+1.044
 isol+0.947
 disrupted+0.922
housing+0.915
Module+0.897
Neg logit contributions
phabet-1.260
 Ital-1.249
 Fall-1.120
 Alphabet-1.111
istine-1.073
Token bet
Token ID731
Feature activation-5.02e-01

Pos logit contributions
phabet+0.057
 Ital+0.056
 Fall+0.051
 Alphabet+0.051
istine+0.048
Neg logit contributions
ients-0.044
 isol-0.039
 disrupted-0.038
housing-0.038
Module-0.038
Tokenrays
Token ID20477
Feature activation-1.64e-01

Pos logit contributions
 Ital+3.508
phabet+3.492
 Fall+3.235
 Alphabet+3.204
istine+3.070
Neg logit contributions
ients-1.954
housing-1.652
 isol-1.648
 disrupted-1.628
Module-1.615
Token an
Token ID281
Feature activation-7.52e-01

Pos logit contributions
phabet+0.966
 Ital+0.964
 Fall+0.865
 Alphabet+0.858
istine+0.819
Neg logit contributions
ients-0.823
 isol-0.737
housing-0.722
 disrupted-0.720
Module-0.710
Token ignorance
Token ID17010
Feature activation-3.39e-01

Pos logit contributions
 Ital+4.185
phabet+4.120
 Fall+3.769
 Alphabet+3.717
istine+3.508
Neg logit contributions
ients-4.024
housing-3.570
 isol-3.543
Module-3.504
 disrupted-3.500
Token in
Token ID287
Feature activation-2.06e-01

Pos logit contributions
 Ital+1.957
phabet+1.954
 Fall+1.755
 Alphabet+1.740
istine+1.648
Neg logit contributions
ients-1.756
 isol-1.567
housing-1.547
 disrupted-1.536
Module-1.518
Token most
Token ID749
Feature activation-2.48e-01

Pos logit contributions
phabet+1.169
 Ital+1.165
 Fall+1.043
 Alphabet+1.033
istine+0.985
Neg logit contributions
ients-1.083
 isol-0.981
housing-0.959
 disrupted-0.956
Module-0.941
Token things
Token ID1243
Feature activation-3.39e-01

Pos logit contributions
 Ital+1.519
phabet+1.518
 Fall+1.375
 Alphabet+1.363
istine+1.297
Neg logit contributions
ients-1.196
 isol-1.055
housing-1.042
 disrupted-1.033
Module-1.022
Token having
Token ID1719
Feature activation-4.05e-01

Pos logit contributions
 Ital+1.971
phabet+1.966
 Fall+1.777
 Alphabet+1.757
istine+1.668
Neg logit contributions
ients-1.733
 isol-1.539
housing-1.524
 disrupted-1.509
Module-1.497
Token drug
Token ID2563
Feature activation+5.07e-01

Pos logit contributions
ients+0.451
 isol+0.404
housing+0.398
 disrupted+0.397
Module+0.386
Neg logit contributions
phabet-0.556
 Ital-0.553
 Fall-0.494
 Alphabet-0.492
istine-0.473
Token-
Token ID12
Feature activation-3.64e-02

Pos logit contributions
ients+2.437
 isol+2.202
 disrupted+2.153
housing+2.123
 extrad+2.098
Neg logit contributions
 Ital-3.092
phabet-3.087
 Alphabet-2.737
 Fall-2.698
istine-2.656
Tokenrelated
Token ID5363
Feature activation-3.35e-01

Pos logit contributions
 Ital+0.157
phabet+0.156
 Fall+0.136
 Alphabet+0.134
istine+0.124
Neg logit contributions
ients-0.242
 isol-0.221
housing-0.220
 disrupted-0.218
Module-0.217
Token charges
Token ID4530
Feature activation+3.32e-01

Pos logit contributions
 Ital+1.770
phabet+1.752
 Fall+1.592
 Alphabet+1.563
istine+1.464
Neg logit contributions
ients-1.888
 isol-1.682
housing-1.680
Module-1.661
 disrupted-1.656
Token.
Token ID13
Feature activation-5.21e-01

Pos logit contributions
ients+1.657
 isol+1.489
 disrupted+1.461
housing+1.459
 extrad+1.421
Neg logit contributions
 Ital-1.963
phabet-1.961
 Fall-1.747
 Alphabet-1.735
istine-1.670
Token Today
Token ID6288
Feature activation-7.95e-01

Pos logit contributions
 Ital+2.606
phabet+2.568
 Fall+2.325
 Alphabet+2.283
istine+2.120
Neg logit contributions
ients-3.104
 isol-2.799
housing-2.774
 disrupted-2.758
Module-2.725
Token,
Token ID11
Feature activation-2.13e-01

Pos logit contributions
 Ital+3.721
phabet+3.708
 Fall+3.354
 Alphabet+3.292
atron+3.020
Neg logit contributions
ients-4.844
housing-4.339
Module-4.336
 isol-4.317
 disrupted-4.292
Token it
Token ID340
Feature activation+2.18e-01

Pos logit contributions
 Ital+1.201
phabet+1.197
 Fall+1.081
 Alphabet+1.067
istine+1.010
Neg logit contributions
ients-1.128
 isol-1.004
housing-0.996
 disrupted-0.986
Module-0.980
Token is
Token ID318
Feature activation+3.62e-02

Pos logit contributions
ients+1.204
 isol+1.102
 disrupted+1.074
housing+1.070
 extrad+1.043
Neg logit contributions
phabet-1.185
 Ital-1.175
 Alphabet-1.034
 Fall-1.031
istine-0.973
Token closer
Token ID5699
Feature activation+1.73e-01

Pos logit contributions
ients+0.183
 isol+0.164
 disrupted+0.161
housing+0.161
Module+0.157
Neg logit contributions
phabet-0.214
 Ital-0.214
 Fall-0.190
 Alphabet-0.189
istine-0.182
Token to
Token ID284
Feature activation+4.73e-01

Pos logit contributions
ients+1.069
 isol+0.965
housing+0.960
 disrupted+0.951
Module+0.949
Neg logit contributions
 Ital-0.825
phabet-0.823
 Fall-0.731
 Alphabet-0.719
istine-0.672
Token to
Token ID284
Feature activation-3.70e-01

Pos logit contributions
ients+1.004
 isol+0.888
 disrupted+0.882
housing+0.873
Module+0.854
Neg logit contributions
phabet-1.130
 Ital-1.121
 Fall-0.998
 Alphabet-0.995
istine-0.945
Token counting
Token ID14143
Feature activation-6.78e-01

Pos logit contributions
 Ital+2.189
phabet+2.179
 Fall+1.979
 Alphabet+1.956
istine+1.856
Neg logit contributions
ients-1.858
 isol-1.640
housing-1.628
 disrupted-1.611
Module-1.599
Token votes
Token ID5690
Feature activation-4.92e-01

Pos logit contributions
 Ital+3.745
phabet+3.704
 Fall+3.364
 Alphabet+3.320
istine+3.122
Neg logit contributions
ients-3.653
 isol-3.257
housing-3.249
 disrupted-3.204
Module-3.200
Token,
Token ID11
Feature activation-4.44e-01

Pos logit contributions
 Ital+2.717
phabet+2.707
 Fall+2.437
 Alphabet+2.407
istine+2.276
Neg logit contributions
ients-2.663
 isol-2.378
housing-2.359
 disrupted-2.336
Module-2.320
Token and
Token ID290
Feature activation-4.86e-01

Pos logit contributions
 Ital+2.494
phabet+2.481
 Fall+2.240
 Alphabet+2.213
istine+2.091
Neg logit contributions
ients-2.364
 isol-2.105
housing-2.091
 disrupted-2.068
Module-2.057
Token of
Token ID286
Feature activation-7.62e-01

Pos logit contributions
 Ital+2.763
phabet+2.748
 Fall+2.486
 Alphabet+2.457
istine+2.323
Neg logit contributions
ients-2.558
 isol-2.274
housing-2.259
 disrupted-2.232
Module-2.219
Token course
Token ID1781
Feature activation-4.78e-01

Pos logit contributions
 Ital+3.373
phabet+3.304
 Fall+2.980
 Alphabet+2.906
atron+2.643
Neg logit contributions
ients-4.867
housing-4.402
 isol-4.390
 disrupted-4.375
Module-4.331
Token I
Token ID314
Feature activation-2.55e-01

Pos logit contributions
 Ital+2.682
phabet+2.675
 Fall+2.410
 Alphabet+2.380
istine+2.256
Neg logit contributions
ients-2.544
 isol-2.268
housing-2.247
 disrupted-2.227
Module-2.209
Token couldn
Token ID3521
Feature activation-1.48e-01

Pos logit contributions
 Ital+1.568
phabet+1.565
 Fall+1.421
 Alphabet+1.407
istine+1.341
Neg logit contributions
ients-1.222
 isol-1.075
housing-1.063
 disrupted-1.055
Module-1.043
Token't
Token ID470
Feature activation-2.55e-01

Pos logit contributions
 Ital+0.923
phabet+0.920
 Fall+0.839
 Alphabet+0.830
istine+0.790
Neg logit contributions
ients-0.699
 isol-0.613
housing-0.608
 disrupted-0.601
Module-0.596
Token count
Token ID954
Feature activation-7.51e-01

Pos logit contributions
phabet+1.544
 Ital+1.537
 Fall+1.389
 Alphabet+1.377
istine+1.315
Neg logit contributions
ients-1.249
 isol-1.108
housing-1.091
 disrupted-1.083
Module-1.071
Token Steve
Token ID6542
Feature activation+2.60e-04

Pos logit contributions
ients+0.675
 isol+0.611
 disrupted+0.601
housing+0.600
Module+0.589
Neg logit contributions
phabet-0.606
 Ital-0.597
 Fall-0.526
 Alphabet-0.524
istine-0.502
Token Do
Token ID2141
Feature activation+4.82e-01

Pos logit contributions
ients+0.001
 isol+0.001
 disrupted+0.001
housing+0.001
Module+0.001
Neg logit contributions
phabet-0.002
 Ital-0.002
 Alphabet-0.002
 Fall-0.002
istine-0.002
Tokenocy
Token ID13733
Feature activation+2.81e-03

Pos logit contributions
ients+2.513
 isol+2.236
housing+2.198
 disrupted+2.188
Module+2.176
Neg logit contributions
phabet-2.753
 Ital-2.748
 Alphabet-2.440
 Fall-2.405
atron-2.307
Token,
Token ID11
Feature activation+2.90e-02

Pos logit contributions
ients+0.014
 isol+0.013
housing+0.012
 disrupted+0.012
Module+0.012
Neg logit contributions
 Ital-0.017
phabet-0.017
 Fall-0.015
 Alphabet-0.015
istine-0.014
Token Brian
Token ID8403
Feature activation+3.35e-01

Pos logit contributions
ients+0.156
 isol+0.140
 disrupted+0.138
housing+0.138
Module+0.135
Neg logit contributions
phabet-0.160
 Ital-0.160
 Fall-0.142
 Alphabet-0.141
istine-0.135
Token Kil
Token ID19902
Feature activation-8.61e-01

Pos logit contributions
ients+1.533
 isol+1.339
 disrupted+1.326
housing+1.294
Module+1.294
Neg logit contributions
phabet-2.183
 Ital-2.094
 Alphabet-1.911
 Fall-1.842
istine-1.803
Tokenme
Token ID1326
Feature activation-1.46e-01

Pos logit contributions
phabet+4.783
 Ital+4.708
 Fall+4.496
istine+4.237
 Alphabet+4.142
Neg logit contributions
ients-4.271
housing-3.923
 isol-3.778
 disrupted-3.761
Module-3.691
Tokenade
Token ID671
Feature activation+1.82e-01

Pos logit contributions
 Ital+0.604
phabet+0.600
 Fall+0.515
 Alphabet+0.510
istine+0.470
Neg logit contributions
ients-0.994
 isol-0.907
housing-0.902
 disrupted-0.892
Module-0.889
Token and
Token ID290
Feature activation-3.53e-01

Pos logit contributions
ients+1.042
 isol+0.934
housing+0.927
 disrupted+0.917
Module+0.915
Neg logit contributions
 Ital-0.959
phabet-0.953
 Fall-0.857
 Alphabet-0.841
istine-0.794
Token Me
Token ID2185
Feature activation-2.20e-01

Pos logit contributions
 Ital+1.716
phabet+1.687
 Fall+1.523
 Alphabet+1.491
istine+1.384
Neg logit contributions
ients-2.149
housing-1.936
 isol-1.934
Module-1.908
 disrupted-1.904
Tokengyn
Token ID40183
Feature activation+7.41e-03

Pos logit contributions
 Ital+1.432
phabet+1.432
 Fall+1.304
 Alphabet+1.297
istine+1.234
Neg logit contributions
ients-0.975
 isol-0.843
housing-0.838
 disrupted-0.825
Module-0.818