TOP ACTIVATIONS
MAX = 1.573

FC RA regulates the collection , dissemination , and use of
ay Nation this would never have been filmed . It is
graph = Fl atten [ Table [ Table [ graph List
Suzuki , The Chronicle 3 of 15 Photo : Le a
height = 600 ) gray = c v 2 . c
.. *) graph List = Table [ { Keys [ search
Hampshire . Hide Caption 6 of 15 Photos : Carly Fiorina
, March 24 , 2016 . REUTERS / Nick Oxford /
Iowa . Hide Caption 5 of 15 Photos : Carly Fiorina
. Height ) $ image = [ System . Draw ing
, 0 ) black hat = c v 2 . morphology
24 , 2016 . REUTERS / Nick Oxford / File Photo
self . depth , check : self . check , prev
v 2 . c vt Color ( image , c v
irect ed edges *) graph = Fl atten [ Table [
for the technical colleges . Thanks to Abrams maneuver
send you three meaningful actions you can take each week .
other heavy metals . REUTERS / B obby Y ip
plunged more than 10 percent after the FDA 's announcement .
2016 primary Sanders : ' Damn right ' I 'll make

MIN ACTIVATIONS
MIN = -1.484

extra level of safety ! Donald J . Trump (@
all episodes of 19 Kids and Count ing from the air
facts . This is as far as guess work can and
persona is so vastly different from his or her offline persona
Not - Associated T x - Power = 20 dB m
their possible paths ? As far as I can tell ,
the receiver and antenna as far removed , physically and electr
leadership was a disaster ! Donald J . Trump (@
y , with a cool m oust ache and a bit
Joe Buck . But as far as people viewing him unfavorable
_ P ulse Amount ( " P ulse amount " ,
did it attract you ? - January 15 , 2010 Que
Google Scholar Ol we us , D
stop awesome grappling hook aw es omen ess ! Having done
Foundation , told the Ou ach ita Citizen that the think
all about which player out - per forms the other .
people with bad manners . - January 4 , 2010 User
the same sex that is treated as a marriage under the
the reputation of medicine so greatly that women like Ok ami
while 12 months is probably too early to declare the curse

INTERVAL 0.809 - 1.573
CONTAINS 0.159%

sn acking ). The easiest and best way to accomplish this
" to enc irc le and destroy the enemy grouping invading
( path , 4 ) = '. cl j ' OR
aren t able to boast ; another popular and
by saying that Lee shows no love , and attention towards

INTERVAL 0.045 - 0.809
CONTAINS 42.219%

but at that time discussions were still said to be
of travel from Down Under ? Not something I would bet
bike turned up Thursday when Andre Sc ard ina , 19
large pr imate brain , or whether a honey bee might
published at The Wall Street Examiner . Follow the money !

INTERVAL -0.720 - 0.045
CONTAINS 57.328%

ll assume a relatively low rate , for a billionaire ,
to eight years made no distinction between youth and adult punishment
the territories ended up emphasizing them . In other
a CV the details tell you lots the exact list
power - play time per game last season , while Alex

INTERVAL -1.484 - -0.720
CONTAINS 0.293%

her buttocks at a rhythm with no apparent connection to the
Had ros aur # 2 Had ros aur
percentage of afford ably priced units whenever the zoning designation is
share the vision or agree with the underlying principles of our
in front of buildings like Sur rog ate 's Court .
Token FC
Token ID10029
Feature activation+1.44e-01

Pos logit contributions
+1.710
INST+1.577
aternal+1.492
agy+1.477
 Rabbit+1.454
Neg logit contributions
 comprehension-2.159
tesy-2.046
 stagn-2.010
abeth-1.970
ector-1.932
TokenRA
Token ID3861
Feature activation-4.48e-01

Pos logit contributions
 comprehension+0.737
tesy+0.675
 stagn+0.659
abeth+0.640
ector+0.628
Neg logit contributions
-1.205
INST-1.114
aternal-1.076
agy-1.069
iotics-1.058
Token regulates
Token ID39474
Feature activation-1.60e-01

Pos logit contributions
+2.920
INST+2.726
aternal+2.593
agy+2.566
 Rabbit+2.541
Neg logit contributions
 comprehension-3.047
tesy-2.893
 stagn-2.803
abeth-2.768
ector-2.701
Token the
Token ID262
Feature activation+3.98e-01

Pos logit contributions
+1.117
INST+1.043
aternal+0.996
agy+0.989
 Rabbit+0.973
Neg logit contributions
 comprehension-1.029
tesy-0.964
 stagn-0.946
abeth-0.922
ector-0.901
Token collection
Token ID4947
Feature activation+5.82e-01

Pos logit contributions
 comprehension+2.213
tesy+1.942
 stagn+1.927
 unde+1.858
abeth+1.802
Neg logit contributions
-3.228
INST-2.968
aternal-2.909
agy-2.881
iotics-2.832
Token,
Token ID11
Feature activation+1.57e+00

Pos logit contributions
 comprehension+3.828
tesy+3.539
 stagn+3.466
abeth+3.390
 unde+3.290
Neg logit contributions
-3.933
INST-3.660
aternal-3.518
agy-3.459
 Rabbit-3.401
Token dissemination
Token ID44832
Feature activation+4.38e-01

Pos logit contributions
 comprehension+7.250
 unde+5.840
tesy+5.755
 stagn+5.651
 retaining+5.633
Neg logit contributions
-13.115
aternal-12.052
iotics-11.948
INST-11.777
 Rabbit-11.284
Token,
Token ID11
Feature activation+9.78e-01

Pos logit contributions
 comprehension+3.034
tesy+2.832
 stagn+2.786
abeth+2.709
ector+2.663
Neg logit contributions
-2.838
INST-2.636
aternal-2.511
agy-2.475
 Rabbit-2.443
Token and
Token ID290
Feature activation+9.83e-01

Pos logit contributions
 comprehension+6.209
 stagn+5.394
tesy+5.324
 unde+5.244
 manipulative+5.036
Neg logit contributions
-6.987
aternal-6.280
INST-6.233
iotics-6.174
agy-6.032
Token use
Token ID779
Feature activation+7.34e-01

Pos logit contributions
 comprehension+4.958
 stagn+4.083
tesy+4.077
 unde+3.975
 manipulative+3.805
Neg logit contributions
-8.362
aternal-7.589
iotics-7.570
INST-7.563
agy-7.481
Token of
Token ID286
Feature activation-1.87e-01

Pos logit contributions
 comprehension+3.989
tesy+3.695
 stagn+3.556
abeth+3.429
ector+3.387
Neg logit contributions
-5.713
INST-5.397
aternal-5.222
agy-5.129
 Rabbit-5.102
Tokenay
Token ID323
Feature activation-6.49e-02

Pos logit contributions
 comprehension+0.137
tesy+0.130
 stagn+0.128
abeth+0.125
ector+0.123
Neg logit contributions
-0.105
INST-0.097
aternal-0.091
agy-0.090
 Rabbit-0.089
Token Nation
Token ID8741
Feature activation-1.04e-01

Pos logit contributions
+0.396
INST+0.366
aternal+0.347
agy+0.344
 Rabbit+0.338
Neg logit contributions
 comprehension-0.475
tesy-0.449
 stagn-0.441
abeth-0.432
ector-0.423
Token this
Token ID428
Feature activation+6.29e-01

Pos logit contributions
+0.679
INST+0.631
aternal+0.600
agy+0.595
 Rabbit+0.586
Neg logit contributions
 comprehension-0.716
tesy-0.674
 stagn-0.662
abeth-0.647
ector-0.634
Token would
Token ID561
Feature activation+6.38e-01

Pos logit contributions
 comprehension+3.809
 stagn+3.540
tesy+3.411
 unde+3.282
abeth+3.223
Neg logit contributions
-4.593
INST-4.291
agy-4.161
aternal-4.152
iotics-4.007
Token never
Token ID1239
Feature activation+1.12e+00

Pos logit contributions
 comprehension+3.894
 stagn+3.722
tesy+3.666
abeth+3.443
ector+3.423
Neg logit contributions
-4.456
INST-4.238
agy-4.035
aternal-3.992
 Cosponsors-3.972
Token have
Token ID423
Feature activation+1.44e+00

Pos logit contributions
 comprehension+6.186
 stagn+5.905
tesy+5.698
 unde+5.498
abeth+5.386
Neg logit contributions
-7.935
INST-7.734
 Cosponsors-7.415
agy-7.222
 Rabbit-7.200
Token been
Token ID587
Feature activation+1.21e+00

Pos logit contributions
 comprehension+6.717
 stagn+6.503
tesy+6.060
abeth+5.833
 unde+5.718
Neg logit contributions
-10.823
INST-10.612
 Cosponsors-10.377
aternal-10.111
agy-9.910
Token filmed
Token ID18976
Feature activation+6.22e-01

Pos logit contributions
 comprehension+4.940
 stagn+4.479
tesy+4.260
 unde+4.151
abeth+3.943
Neg logit contributions
-10.361
INST-9.804
 Cosponsors-9.796
aternal-9.725
agy-9.647
Token.
Token ID13
Feature activation-3.44e-02

Pos logit contributions
 comprehension+3.954
tesy+3.691
 stagn+3.648
abeth+3.501
 unde+3.469
Neg logit contributions
-4.165
INST-3.932
aternal-3.817
agy-3.790
 Cosponsors-3.709
Token It
Token ID632
Feature activation+4.21e-03

Pos logit contributions
+0.205
INST+0.189
aternal+0.181
agy+0.179
iotics+0.174
Neg logit contributions
 comprehension-0.255
tesy-0.241
 stagn-0.236
abeth-0.232
ector-0.228
Token is
Token ID318
Feature activation+1.66e-02

Pos logit contributions
 comprehension+0.029
tesy+0.028
 stagn+0.027
abeth+0.026
ector+0.026
Neg logit contributions
-0.027
INST-0.025
aternal-0.024
agy-0.024
 Rabbit-0.023
Token graph
Token ID4823
Feature activation-1.36e-01

Pos logit contributions
+2.156
INST+2.015
aternal+1.910
 Rabbit+1.894
agy+1.885
Neg logit contributions
 comprehension-1.897
tesy-1.818
 stagn-1.752
abeth-1.732
ector-1.659
Token =
Token ID796
Feature activation+1.26e+00

Pos logit contributions
+1.094
INST+1.031
aternal+0.991
agy+0.984
 Rabbit+0.971
Neg logit contributions
 comprehension-0.729
tesy-0.674
 stagn-0.659
abeth-0.639
ector-0.620
Token Fl
Token ID1610
Feature activation+5.12e-01

Pos logit contributions
 comprehension+8.418
 stagn+7.541
 unde+7.462
 Stru+7.356
tesy+7.356
Neg logit contributions
-7.673
INST-7.213
agy-7.039
iotics-7.032
aternal-7.030
Tokenatten
Token ID41769
Feature activation+9.30e-01

Pos logit contributions
 comprehension+1.049
tesy+0.915
 stagn+0.803
ector+0.802
abeth+0.772
Neg logit contributions
-5.779
INST-5.404
 Rabbit-5.331
iotics-5.235
aternal-5.189
Token[
Token ID58
Feature activation+9.57e-01

Pos logit contributions
 comprehension+4.055
tesy+3.546
ector+3.435
 stagn+3.433
abeth+3.330
Neg logit contributions
-7.924
INST-7.550
iotics-7.424
aternal-7.389
agy-7.303
Token Table
Token ID8655
Feature activation+1.43e+00

Pos logit contributions
 comprehension+6.268
tesy+5.676
 stagn+5.647
ector+5.590
 unde+5.461
Neg logit contributions
-6.034
INST-5.612
agy-5.462
aternal-5.457
iotics-5.388
Token[
Token ID58
Feature activation+1.17e+00

Pos logit contributions
 comprehension+5.336
ector+4.527
tesy+4.515
 stagn+4.240
abeth+4.061
Neg logit contributions
-12.608
INST-11.747
aternal-11.721
iotics-11.681
agy-11.497
TokenTable
Token ID10962
Feature activation+1.03e+00

Pos logit contributions
 comprehension+6.802
 stagn+5.980
tesy+5.965
ector+5.887
 unde+5.737
Neg logit contributions
-8.108
INST-7.518
aternal-7.407
agy-7.305
iotics-7.283
Token[
Token ID58
Feature activation+5.65e-01

Pos logit contributions
 comprehension+4.325
tesy+3.802
ector+3.760
 stagn+3.559
abeth+3.505
Neg logit contributions
-8.885
INST-8.265
aternal-8.159
iotics-8.154
agy-7.998
Token graph
Token ID4823
Feature activation+1.73e-01

Pos logit contributions
 comprehension+3.844
tesy+3.540
 stagn+3.486
ector+3.445
abeth+3.389
Neg logit contributions
-3.622
INST-3.323
aternal-3.229
agy-3.198
iotics-3.168
TokenList
Token ID8053
Feature activation+7.74e-01

Pos logit contributions
 comprehension+1.097
tesy+1.013
 stagn+0.990
abeth+0.969
ector+0.966
Neg logit contributions
-1.218
INST-1.132
aternal-1.091
agy-1.088
iotics-1.071
Token Suzuki
Token ID35807
Feature activation+5.54e-01

Pos logit contributions
 comprehension+2.341
tesy+2.145
abeth+2.023
 Stru+1.958
 stagn+1.944
Neg logit contributions
-5.603
INST-5.197
aternal-5.164
agy-5.119
iotics-5.099
Token,
Token ID11
Feature activation+1.73e-01

Pos logit contributions
 comprehension+3.288
tesy+3.081
 stagn+2.937
abeth+2.906
CTV+2.905
Neg logit contributions
-3.965
INST-3.682
aternal-3.617
iotics-3.588
agy-3.565
Token The
Token ID383
Feature activation+8.31e-01

Pos logit contributions
 comprehension+1.342
tesy+1.288
 stagn+1.240
abeth+1.232
 unde+1.204
Neg logit contributions
-0.962
INST-0.866
aternal-0.842
agy-0.833
iotics-0.829
Token Chronicle
Token ID14160
Feature activation+7.71e-01

Pos logit contributions
 comprehension+4.384
tesy+4.083
 stagn+3.897
abeth+3.829
CTV+3.806
Neg logit contributions
-6.350
INST-6.001
aternal-5.893
iotics-5.870
agy-5.866
Token 3
Token ID513
Feature activation+2.45e-01

Pos logit contributions
 comprehension+5.019
tesy+4.838
 stagn+4.607
ector+4.573
abeth+4.501
Neg logit contributions
-5.036
INST-4.615
agy-4.463
aternal-4.396
iotics-4.302
Token of
Token ID286
Feature activation+1.41e+00

Pos logit contributions
 comprehension+1.408
tesy+1.309
 stagn+1.279
abeth+1.231
ector+1.211
Neg logit contributions
-1.878
INST-1.761
aternal-1.688
agy-1.673
iotics-1.657
Token 15
Token ID1315
Feature activation+3.37e-01

Pos logit contributions
 comprehension+6.538
tesy+6.366
 stagn+6.130
 Stru+5.837
essa+5.828
Neg logit contributions
-10.560
aternal-9.970
INST-9.909
iotics-9.709
agy-9.691
Token Photo
Token ID5555
Feature activation+7.47e-01

Pos logit contributions
 comprehension+2.915
tesy+2.792
 stagn+2.715
abeth+2.663
ector+2.649
Neg logit contributions
-1.600
INST-1.390
aternal-1.330
agy-1.325
iotics-1.302
Token:
Token ID25
Feature activation+5.57e-01

Pos logit contributions
 comprehension+3.973
tesy+3.664
 stagn+3.531
abeth+3.385
ector+3.329
Neg logit contributions
-5.967
aternal-5.391
INST-5.387
iotics-5.316
agy-5.244
Token Le
Token ID1004
Feature activation+7.21e-01

Pos logit contributions
 comprehension+3.823
tesy+3.581
 stagn+3.481
abeth+3.477
 unde+3.374
Neg logit contributions
-3.534
INST-3.221
aternal-3.117
iotics-3.064
agy-3.061
Tokena
Token ID64
Feature activation+7.10e-01

Pos logit contributions
 comprehension+3.509
tesy+3.337
abeth+3.254
 stagn+3.119
essa+3.071
Neg logit contributions
-5.938
INST-5.472
 Cosponsors-5.265
iotics-5.222
aternal-5.168
Token height
Token ID6001
Feature activation+4.33e-01

Pos logit contributions
+0.966
INST+0.898
aternal+0.858
agy+0.853
 Rabbit+0.835
Neg logit contributions
 comprehension-0.890
tesy-0.827
 stagn-0.820
abeth-0.792
ector-0.778
Token =
Token ID796
Feature activation+8.51e-01

Pos logit contributions
 comprehension+2.173
tesy+1.948
 stagn+1.941
abeth+1.884
 unde+1.843
Neg logit contributions
-3.608
INST-3.337
aternal-3.281
iotics-3.223
agy-3.210
Token 600
Token ID10053
Feature activation-2.38e-01

Pos logit contributions
 comprehension+4.015
 stagn+3.550
tesy+3.529
 unde+3.432
ector+3.281
Neg logit contributions
-7.272
INST-6.825
aternal-6.617
 Cosponsors-6.534
-6.494
Token )
Token ID1267
Feature activation+1.84e-01

Pos logit contributions
+1.422
INST+1.315
aternal+1.244
agy+1.234
 Rabbit+1.212
Neg logit contributions
 comprehension-1.774
tesy-1.677
 stagn-1.650
abeth-1.615
ector-1.583
Token gray
Token ID12768
Feature activation+2.77e-01

Pos logit contributions
 comprehension+0.794
tesy+0.706
 stagn+0.694
abeth+0.658
 unde+0.641
Neg logit contributions
-1.695
INST-1.601
aternal-1.543
agy-1.532
 Rabbit-1.514
Token =
Token ID796
Feature activation+1.41e+00

Pos logit contributions
 comprehension+1.445
tesy+1.307
 stagn+1.294
abeth+1.245
ector+1.232
Neg logit contributions
-2.281
INST-2.110
aternal-2.041
agy-2.033
iotics-1.998
Token c
Token ID269
Feature activation+3.05e-01

Pos logit contributions
 comprehension+5.902
 stagn+5.426
 unde+4.966
 subt+4.674
ector+4.352
Neg logit contributions
-12.763
INST-11.831
aternal-11.448
agy-11.315
iotics-11.277
Tokenv
Token ID85
Feature activation+7.95e-01

Pos logit contributions
 comprehension+2.248
tesy+2.120
 stagn+2.109
abeth+2.052
ector+2.048
Neg logit contributions
-1.901
INST-1.663
aternal-1.574
 Rabbit-1.571
 Cosponsors-1.539
Token2
Token ID17
Feature activation+5.38e-01

Pos logit contributions
 comprehension+4.947
tesy+4.620
 stagn+4.491
CTV+4.439
ector+4.432
Neg logit contributions
-5.547
INST-4.852
iotics-4.749
agy-4.735
aternal-4.717
Token.
Token ID764
Feature activation+7.31e-01

Pos logit contributions
 comprehension+3.132
 stagn+2.828
tesy+2.807
ector+2.727
 unde+2.717
Neg logit contributions
-4.172
INST-3.759
aternal-3.620
agy-3.604
iotics-3.585
Token c
Token ID269
Feature activation-4.61e-02

Pos logit contributions
 comprehension+4.143
 stagn+3.694
 unde+3.639
tesy+3.570
ector+3.461
Neg logit contributions
-5.884
INST-5.366
aternal-5.162
agy-5.160
iotics-5.078
Token..
Token ID492
Feature activation+2.88e-01

Pos logit contributions
 comprehension+3.598
tesy+3.360
 stagn+3.293
ector+3.242
abeth+3.189
Neg logit contributions
-2.609
INST-2.362
aternal-2.282
agy-2.225
iotics-2.208
Token *)
Token ID31936
Feature activation+5.54e-02

Pos logit contributions
 comprehension+1.971
tesy+1.825
 stagn+1.783
abeth+1.743
ector+1.738
Neg logit contributions
-1.859
INST-1.729
aternal-1.676
agy-1.635
 Rabbit-1.618
Token graph
Token ID4823
Feature activation+4.13e-02

Pos logit contributions
 comprehension+0.377
tesy+0.351
 stagn+0.347
abeth+0.337
ector+0.332
Neg logit contributions
-0.368
INST-0.342
aternal-0.327
agy-0.324
 Rabbit-0.317
TokenList
Token ID8053
Feature activation+3.38e-01

Pos logit contributions
 comprehension+0.258
tesy+0.238
 stagn+0.235
abeth+0.228
ector+0.226
Neg logit contributions
-0.297
INST-0.277
aternal-0.266
agy-0.264
 Rabbit-0.260
Token =
Token ID796
Feature activation+1.05e+00

Pos logit contributions
 comprehension+1.968
tesy+1.770
 stagn+1.744
ector+1.709
abeth+1.679
Neg logit contributions
-2.567
INST-2.401
aternal-2.335
iotics-2.292
agy-2.287
Token Table
Token ID8655
Feature activation+1.31e+00

Pos logit contributions
 comprehension+6.797
 stagn+6.040
 unde+5.777
ector+5.736
tesy+5.715
Neg logit contributions
-7.003
INST-6.479
aternal-6.392
iotics-6.284
agy-6.238
Token[
Token ID58
Feature activation+9.91e-01

Pos logit contributions
 comprehension+5.344
tesy+4.500
ector+4.409
 stagn+4.256
abeth+4.060
Neg logit contributions
-11.143
iotics-10.411
INST-10.368
aternal-10.343
 Ü-10.103
Token {
Token ID1391
Feature activation-5.49e-02

Pos logit contributions
 comprehension+6.552
 stagn+5.894
tesy+5.838
ector+5.773
abeth+5.581
Neg logit contributions
-6.256
INST-5.744
aternal-5.658
iotics-5.526
agy-5.525
Token Keys
Token ID26363
Feature activation+5.23e-01

Pos logit contributions
+0.363
INST+0.338
aternal+0.322
agy+0.319
 Rabbit+0.314
Neg logit contributions
 comprehension-0.373
tesy-0.351
 stagn-0.344
abeth-0.336
ector-0.329
Token[
Token ID58
Feature activation+5.42e-02

Pos logit contributions
 comprehension+2.475
tesy+2.203
 stagn+2.129
ector+2.082
abeth+2.081
Neg logit contributions
-4.476
INST-4.237
aternal-4.117
agy-4.067
iotics-4.036
Tokensearch
Token ID12947
Feature activation-2.51e-01

Pos logit contributions
 comprehension+0.519
tesy+0.496
 stagn+0.490
abeth+0.481
ector+0.476
Neg logit contributions
-0.209
INST-0.183
aternal-0.168
agy-0.165
 Rabbit-0.160
Token Hampshire
Token ID13910
Feature activation+4.54e-02

Pos logit contributions
 comprehension+0.656
tesy+0.601
 stagn+0.584
abeth+0.566
ector+0.545
Neg logit contributions
-1.180
INST-1.114
aternal-1.073
agy-1.068
 Rabbit-1.050
Token.
Token ID13
Feature activation+3.53e-01

Pos logit contributions
 comprehension+0.322
tesy+0.304
 stagn+0.299
abeth+0.291
ector+0.286
Neg logit contributions
-0.286
INST-0.266
aternal-0.252
agy-0.250
 Rabbit-0.246
Token Hide
Token ID10415
Feature activation+2.63e-02

Pos logit contributions
 comprehension+2.363
tesy+2.240
abeth+2.180
 stagn+2.168
 unde+2.110
Neg logit contributions
-2.304
INST-2.130
aternal-2.048
agy-2.045
iotics-2.002
Token Caption
Token ID11260
Feature activation+5.11e-01

Pos logit contributions
 comprehension+0.178
tesy+0.168
 stagn+0.164
abeth+0.161
ector+0.158
Neg logit contributions
-0.174
INST-0.162
aternal-0.154
agy-0.153
iotics-0.150
Token 6
Token ID718
Feature activation+1.32e-01

Pos logit contributions
 comprehension+2.529
tesy+2.285
 stagn+2.216
abeth+2.183
ector+2.114
Neg logit contributions
-4.265
INST-4.059
aternal-3.942
agy-3.864
iotics-3.828
Token of
Token ID286
Feature activation+1.30e+00

Pos logit contributions
 comprehension+0.767
tesy+0.712
 stagn+0.698
abeth+0.676
ector+0.664
Neg logit contributions
-1.001
INST-0.934
aternal-0.900
agy-0.889
iotics-0.879
Token 15
Token ID1315
Feature activation+5.89e-01

Pos logit contributions
 comprehension+6.216
 stagn+5.828
tesy+5.732
 Stru+5.512
essa+5.439
Neg logit contributions
-10.097
aternal-9.316
iotics-9.291
INST-9.197
agy-9.078
Token Photos
Token ID9434
Feature activation+6.94e-01

Pos logit contributions
 comprehension+2.629
tesy+2.452
 stagn+2.334
abeth+2.299
 Stru+2.271
Neg logit contributions
-5.009
INST-4.720
aternal-4.671
iotics-4.579
agy-4.531
Token:
Token ID25
Feature activation+8.89e-02

Pos logit contributions
 comprehension+3.448
tesy+3.166
 stagn+3.100
abeth+2.986
ector+2.943
Neg logit contributions
-5.706
INST-5.176
aternal-5.143
iotics-5.119
agy-4.953
Token Carly
Token ID35982
Feature activation+2.84e-02

Pos logit contributions
 comprehension+0.540
tesy+0.503
 stagn+0.493
abeth+0.487
ector+0.470
Neg logit contributions
-0.648
INST-0.607
aternal-0.584
agy-0.580
iotics-0.570
Token Fiorina
Token ID49028
Feature activation-2.18e-01

Pos logit contributions
 comprehension+0.105
tesy+0.093
 stagn+0.090
abeth+0.087
ector+0.083
Neg logit contributions
-0.276
INST-0.262
aternal-0.254
agy-0.252
iotics-0.249
Token,
Token ID11
Feature activation+2.97e-01

Pos logit contributions
 comprehension+2.261
tesy+2.157
 stagn+2.110
abeth+2.038
ector+2.006
Neg logit contributions
-2.367
INST-2.196
aternal-2.099
agy-2.091
iotics-2.057
Token March
Token ID2805
Feature activation+5.09e-01

Pos logit contributions
 comprehension+1.912
 stagn+1.777
tesy+1.776
abeth+1.695
 unde+1.693
Neg logit contributions
-2.075
INST-1.897
aternal-1.837
agy-1.819
iotics-1.790
Token 24
Token ID1987
Feature activation+5.17e-01

Pos logit contributions
 comprehension+3.166
 stagn+2.895
tesy+2.872
abeth+2.836
 unde+2.769
Neg logit contributions
-3.600
INST-3.270
aternal-3.230
agy-3.209
iotics-3.186
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
 comprehension+3.106
 stagn+2.890
tesy+2.857
abeth+2.798
 unde+2.740
Neg logit contributions
-3.726
INST-3.403
aternal-3.258
agy-3.257
iotics-3.231
Token 2016
Token ID1584
Feature activation+7.26e-01

Pos logit contributions
 comprehension+1.648
tesy+1.541
 stagn+1.531
abeth+1.494
 unde+1.471
Neg logit contributions
-1.316
INST-1.186
aternal-1.141
agy-1.123
iotics-1.093
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
 comprehension+4.266
tesy+4.014
 stagn+3.896
abeth+3.754
ector+3.708
Neg logit contributions
-5.373
INST-4.784
agy-4.732
aternal-4.616
iotics-4.550
Token REUTERS
Token ID15862
Feature activation+2.41e-01

Pos logit contributions
 comprehension+4.595
tesy+4.259
CTV+4.134
 Stru+4.070
 Citation+4.022
Neg logit contributions
-11.166
agy-10.566
aternal-10.378
iotics-10.277
INST-10.042
Token/
Token ID14
Feature activation+1.27e+00

Pos logit contributions
 comprehension+1.100
tesy+1.000
 stagn+0.962
abeth+0.937
ector+0.928
Neg logit contributions
-2.137
INST-1.992
aternal-1.949
agy-1.934
iotics-1.919
TokenNick
Token ID23609
Feature activation+4.06e-01

Pos logit contributions
CTV+7.329
 comprehension+7.174
abeth+6.959
tesy+6.931
ector+6.870
Neg logit contributions
-8.355
aternal-7.628
iotics-7.564
 Rabbit-7.199
agy-7.063
Token Oxford
Token ID13643
Feature activation+5.27e-01

Pos logit contributions
 comprehension+2.707
tesy+2.564
abeth+2.510
ector+2.473
 stagn+2.466
Neg logit contributions
-2.636
INST-2.508
aternal-2.333
agy-2.302
iotics-2.240
Token/
Token ID14
Feature activation+9.36e-01

Pos logit contributions
 comprehension+3.445
tesy+3.245
 stagn+3.157
abeth+3.072
ector+3.058
Neg logit contributions
-3.592
INST-3.332
aternal-3.159
agy-3.123
 Rabbit-3.067
Token Iowa
Token ID9406
Feature activation+2.58e-01

Pos logit contributions
 comprehension+0.695
tesy+0.650
 stagn+0.643
abeth+0.624
 unde+0.612
Neg logit contributions
-0.682
INST-0.630
aternal-0.604
agy-0.599
iotics-0.584
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
 comprehension+1.708
tesy+1.612
 stagn+1.575
abeth+1.551
ector+1.515
Neg logit contributions
-1.742
INST-1.618
aternal-1.537
agy-1.529
iotics-1.494
Token Hide
Token ID10415
Feature activation+4.69e-02

Pos logit contributions
 comprehension+3.279
tesy+3.106
abeth+3.023
 stagn+2.989
 Stru+2.980
Neg logit contributions
-3.401
INST-3.150
agy-3.057
aternal-3.047
iotics-2.991
Token Caption
Token ID11260
Feature activation+3.00e-01

Pos logit contributions
 comprehension+0.317
tesy+0.298
 stagn+0.291
abeth+0.286
ector+0.281
Neg logit contributions
-0.312
INST-0.291
aternal-0.277
agy-0.274
iotics-0.270
Token 5
Token ID642
Feature activation+2.26e-01

Pos logit contributions
 comprehension+1.702
tesy+1.565
 stagn+1.518
ector+1.479
abeth+1.466
Neg logit contributions
-2.346
INST-2.160
aternal-2.120
agy-2.063
iotics-2.049
Token of
Token ID286
Feature activation+1.29e+00

Pos logit contributions
 comprehension+1.272
tesy+1.175
 stagn+1.151
abeth+1.110
ector+1.098
Neg logit contributions
-1.766
INST-1.641
aternal-1.589
agy-1.563
iotics-1.560
Token 15
Token ID1315
Feature activation+5.93e-01

Pos logit contributions
 comprehension+6.040
 stagn+5.662
tesy+5.621
 Stru+5.499
essa+5.320
Neg logit contributions
-10.078
INST-9.303
aternal-9.274
iotics-9.226
agy-9.018
Token Photos
Token ID9434
Feature activation+6.53e-01

Pos logit contributions
 comprehension+2.573
tesy+2.410
 stagn+2.286
abeth+2.257
 Stru+2.253
Neg logit contributions
-5.104
INST-4.813
aternal-4.761
iotics-4.670
agy-4.624
Token:
Token ID25
Feature activation+6.21e-02

Pos logit contributions
 comprehension+3.303
tesy+3.049
 stagn+2.977
abeth+2.869
ector+2.852
Neg logit contributions
-5.316
INST-4.828
aternal-4.785
iotics-4.760
agy-4.617
Token Carly
Token ID35982
Feature activation-9.74e-02

Pos logit contributions
 comprehension+0.373
tesy+0.347
 stagn+0.340
abeth+0.336
 unde+0.324
Neg logit contributions
-0.456
INST-0.427
aternal-0.413
agy-0.409
iotics-0.402
Token Fiorina
Token ID49028
Feature activation-7.98e-02

Pos logit contributions
+0.931
INST+0.885
aternal+0.856
agy+0.851
 Rabbit+0.841
Neg logit contributions
 comprehension-0.377
tesy-0.337
 stagn-0.327
abeth-0.313
ector-0.299
Token.
Token ID13
Feature activation+2.28e-02

Pos logit contributions
 comprehension+0.842
tesy+0.791
 stagn+0.771
abeth+0.759
ector+0.751
Neg logit contributions
-0.899
INST-0.821
aternal-0.797
agy-0.785
iotics-0.781
TokenHeight
Token ID23106
Feature activation-4.34e-01

Pos logit contributions
 comprehension+0.163
tesy+0.154
 stagn+0.151
abeth+0.150
ector+0.147
Neg logit contributions
-0.142
INST-0.129
aternal-0.124
agy-0.124
iotics-0.121
Token)
Token ID8
Feature activation+2.72e-01

Pos logit contributions
+2.225
INST+2.031
agy+1.889
aternal+1.888
 Rabbit+1.839
Neg logit contributions
 comprehension-3.575
tesy-3.387
 stagn-3.365
abeth-3.275
ector-3.212
Token $
Token ID720
Feature activation-3.18e-01

Pos logit contributions
 comprehension+1.638
tesy+1.507
 stagn+1.493
abeth+1.436
 unde+1.426
Neg logit contributions
-2.004
INST-1.877
aternal-1.811
agy-1.786
iotics-1.759
Tokenimage
Token ID9060
Feature activation+1.16e-01

Pos logit contributions
+2.100
INST+1.967
aternal+1.856
agy+1.845
 Rabbit+1.808
Neg logit contributions
 comprehension-2.166
tesy-2.032
 stagn-2.004
abeth-1.948
ector-1.901
Token =
Token ID796
Feature activation+1.28e+00

Pos logit contributions
 comprehension+0.698
tesy+0.650
 stagn+0.635
abeth+0.618
ector+0.608
Neg logit contributions
-0.855
INST-0.797
aternal-0.768
agy-0.761
 Rabbit-0.748
Token [
Token ID685
Feature activation+2.75e-01

Pos logit contributions
 comprehension+6.826
 stagn+6.121
 unde+6.032
tesy+5.948
ector+5.733
Neg logit contributions
-9.411
INST-8.706
aternal-8.660
iotics-8.596
agy-8.537
TokenSystem
Token ID11964
Feature activation+2.85e-01

Pos logit contributions
 comprehension+1.744
tesy+1.649
 stagn+1.592
abeth+1.566
ector+1.539
Neg logit contributions
-1.922
INST-1.766
aternal-1.722
agy-1.707
 Rabbit-1.675
Token.
Token ID13
Feature activation+7.14e-02

Pos logit contributions
 comprehension+1.782
tesy+1.675
 stagn+1.619
abeth+1.597
ector+1.580
Neg logit contributions
-2.044
INST-1.848
aternal-1.814
agy-1.780
iotics-1.778
TokenDraw
Token ID25302
Feature activation-1.22e-03

Pos logit contributions
 comprehension+0.653
tesy+0.624
 stagn+0.614
abeth+0.604
ector+0.596
Neg logit contributions
-0.305
INST-0.271
aternal-0.252
agy-0.249
 Rabbit-0.242
Tokening
Token ID278
Feature activation+2.51e-01

Pos logit contributions
+0.009
INST+0.008
aternal+0.008
agy+0.008
 Rabbit+0.008
Neg logit contributions
 comprehension-0.007
tesy-0.007
 stagn-0.007
abeth-0.007
ector-0.007
Token,
Token ID837
Feature activation+9.55e-02

Pos logit contributions
 comprehension+0.064
tesy+0.061
 stagn+0.059
abeth+0.058
 unde+0.057
Neg logit contributions
-0.059
INST-0.053
aternal-0.051
agy-0.050
iotics-0.050
Token 0
Token ID657
Feature activation+1.62e-01

Pos logit contributions
 comprehension+0.529
 stagn+0.483
tesy+0.477
abeth+0.454
ector+0.450
Neg logit contributions
-0.764
INST-0.712
aternal-0.683
agy-0.680
iotics-0.671
Token )
Token ID1267
Feature activation+2.62e-01

Pos logit contributions
 comprehension+1.099
tesy+1.029
 stagn+1.008
abeth+0.995
ector+0.970
Neg logit contributions
-1.089
INST-0.988
aternal-0.945
agy-0.937
iotics-0.923
Token black
Token ID2042
Feature activation+1.80e-01

Pos logit contributions
 comprehension+1.476
 stagn+1.333
tesy+1.316
 unde+1.278
abeth+1.250
Neg logit contributions
-2.119
INST-1.952
aternal-1.867
agy-1.845
iotics-1.827
Tokenhat
Token ID5183
Feature activation+1.62e-01

Pos logit contributions
 comprehension+1.468
tesy+1.392
 stagn+1.370
abeth+1.350
ector+1.330
Neg logit contributions
-0.967
INST-0.859
agy-0.805
aternal-0.805
 Rabbit-0.797
Token =
Token ID796
Feature activation+1.27e+00

Pos logit contributions
 comprehension+0.884
tesy+0.813
 stagn+0.802
abeth+0.777
ector+0.766
Neg logit contributions
-1.286
INST-1.197
aternal-1.159
agy-1.149
iotics-1.134
Token c
Token ID269
Feature activation+1.81e-01

Pos logit contributions
 comprehension+5.706
 stagn+5.206
 unde+4.784
 subt+4.445
ector+4.359
Neg logit contributions
-11.257
INST-10.477
aternal-10.160
iotics-10.017
agy-9.992
Tokenv
Token ID85
Feature activation+6.55e-01

Pos logit contributions
 comprehension+1.383
tesy+1.310
 stagn+1.296
abeth+1.265
ector+1.257
Neg logit contributions
-1.064
INST-0.944
aternal-0.891
 Rabbit-0.881
agy-0.871
Token2
Token ID17
Feature activation+4.58e-01

Pos logit contributions
 comprehension+4.327
tesy+4.079
 stagn+3.963
ector+3.904
abeth+3.890
Neg logit contributions
-4.389
INST-3.863
iotics-3.715
aternal-3.705
agy-3.695
Token.
Token ID764
Feature activation+3.72e-01

Pos logit contributions
 comprehension+2.718
tesy+2.483
 stagn+2.459
abeth+2.387
ector+2.380
Neg logit contributions
-3.465
INST-3.138
aternal-3.020
iotics-2.998
agy-2.993
Token morphology
Token ID46320
Feature activation+9.82e-01

Pos logit contributions
 comprehension+2.736
tesy+2.512
 stagn+2.503
 unde+2.442
abeth+2.412
Neg logit contributions
-2.300
INST-2.110
aternal-1.997
agy-1.984
 Rabbit-1.945
Token 24
Token ID1987
Feature activation+5.17e-01

Pos logit contributions
 comprehension+3.166
 stagn+2.895
tesy+2.872
abeth+2.836
 unde+2.769
Neg logit contributions
-3.600
INST-3.270
aternal-3.230
agy-3.209
iotics-3.186
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
 comprehension+3.106
 stagn+2.890
tesy+2.857
abeth+2.798
 unde+2.740
Neg logit contributions
-3.726
INST-3.403
aternal-3.258
agy-3.257
iotics-3.231
Token 2016
Token ID1584
Feature activation+7.26e-01

Pos logit contributions
 comprehension+1.648
tesy+1.541
 stagn+1.531
abeth+1.494
 unde+1.471
Neg logit contributions
-1.316
INST-1.186
aternal-1.141
agy-1.123
iotics-1.093
Token.
Token ID13
Feature activation+1.29e+00

Pos logit contributions
 comprehension+4.266
tesy+4.014
 stagn+3.896
abeth+3.754
ector+3.708
Neg logit contributions
-5.373
INST-4.784
agy-4.732
aternal-4.616
iotics-4.550
Token REUTERS
Token ID15862
Feature activation+2.41e-01

Pos logit contributions
 comprehension+4.595
tesy+4.259
CTV+4.134
 Stru+4.070
 Citation+4.022
Neg logit contributions
-11.166
agy-10.566
aternal-10.378
iotics-10.277
INST-10.042
Token/
Token ID14
Feature activation+1.27e+00

Pos logit contributions
 comprehension+1.100
tesy+1.000
 stagn+0.962
abeth+0.937
ector+0.928
Neg logit contributions
-2.137
INST-1.992
aternal-1.949
agy-1.934
iotics-1.919
TokenNick
Token ID23609
Feature activation+4.06e-01

Pos logit contributions
CTV+7.329
 comprehension+7.174
abeth+6.959
tesy+6.931
ector+6.870
Neg logit contributions
-8.355
aternal-7.628
iotics-7.564
 Rabbit-7.199
agy-7.063
Token Oxford
Token ID13643
Feature activation+5.27e-01

Pos logit contributions
 comprehension+2.707
tesy+2.564
abeth+2.510
ector+2.473
 stagn+2.466
Neg logit contributions
-2.636
INST-2.508
aternal-2.333
agy-2.302
iotics-2.240
Token/
Token ID14
Feature activation+9.36e-01

Pos logit contributions
 comprehension+3.445
tesy+3.245
 stagn+3.157
abeth+3.072
ector+3.058
Neg logit contributions
-3.592
INST-3.332
aternal-3.159
agy-3.123
 Rabbit-3.067
TokenFile
Token ID8979
Feature activation+1.43e-01

Pos logit contributions
 comprehension+4.307
tesy+4.012
 stagn+3.809
CTV+3.727
ector+3.674
Neg logit contributions
-7.797
iotics-7.000
aternal-6.987
agy-6.928
INST-6.921
Token Photo
Token ID5555
Feature activation+4.67e-01

Pos logit contributions
 comprehension+1.293
tesy+1.244
 stagn+1.214
abeth+1.192
ector+1.181
Neg logit contributions
-0.629
INST-0.563
agy-0.521
aternal-0.519
iotics-0.491
Token self
Token ID2116
Feature activation+6.05e-01

Pos logit contributions
 comprehension+2.240
 stagn+1.870
tesy+1.763
 unde+1.758
ector+1.736
Neg logit contributions
-6.095
INST-5.707
aternal-5.552
agy-5.503
iotics-5.422
Token.
Token ID764
Feature activation+1.68e-01

Pos logit contributions
 comprehension+3.584
 stagn+3.196
tesy+3.195
abeth+3.119
 unde+3.073
Neg logit contributions
-4.557
INST-4.148
aternal-3.984
agy-3.969
iotics-3.955
Tokendepth
Token ID18053
Feature activation-5.04e-01

Pos logit contributions
 comprehension+0.741
tesy+0.664
 stagn+0.649
abeth+0.618
ector+0.609
Neg logit contributions
-1.516
INST-1.419
agy-1.382
aternal-1.378
 Rabbit-1.372
Token,
Token ID837
Feature activation+3.48e-01

Pos logit contributions
+3.165
INST+2.932
aternal+2.817
agy+2.783
iotics+2.704
Neg logit contributions
 comprehension-3.512
tesy-3.345
 stagn-3.278
abeth-3.202
ector-3.133
Token check
Token ID2198
Feature activation+1.20e-01

Pos logit contributions
 comprehension+2.143
 stagn+1.923
tesy+1.895
 unde+1.839
ector+1.799
Neg logit contributions
-2.605
INST-2.408
aternal-2.309
agy-2.297
 Rabbit-2.263
Token :
Token ID1058
Feature activation+1.27e+00

Pos logit contributions
 comprehension+0.797
tesy+0.737
 stagn+0.729
abeth+0.716
ector+0.708
Neg logit contributions
-0.820
INST-0.750
aternal-0.722
agy-0.721
 Rabbit-0.703
Token self
Token ID2116
Feature activation+6.27e-01

Pos logit contributions
 comprehension+7.015
 stagn+6.057
 unde+6.056
ector+6.012
tesy+5.656
Neg logit contributions
-9.536
INST-8.681
agy-8.459
aternal-8.423
iotics-8.169
Token.
Token ID764
Feature activation+6.89e-01

Pos logit contributions
 comprehension+4.042
tesy+3.744
 stagn+3.629
abeth+3.614
ector+3.540
Neg logit contributions
-4.384
INST-4.022
agy-3.825
aternal-3.816
 Rabbit-3.759
Tokencheck
Token ID9122
Feature activation+9.10e-02

Pos logit contributions
 comprehension+4.824
tesy+4.519
 stagn+4.367
ector+4.333
abeth+4.303
Neg logit contributions
-4.266
INST-3.739
agy-3.714
aternal-3.668
 Cosponsors-3.664
Token,
Token ID837
Feature activation+2.82e-01

Pos logit contributions
 comprehension+0.644
tesy+0.603
 stagn+0.593
abeth+0.582
ector+0.571
Neg logit contributions
-0.582
INST-0.538
aternal-0.509
agy-0.507
 Rabbit-0.498
Token prev
Token ID8654
Feature activation-6.87e-01

Pos logit contributions
 comprehension+1.636
 stagn+1.457
tesy+1.440
 unde+1.390
ector+1.354
Neg logit contributions
-2.203
INST-2.050
aternal-1.971
agy-1.960
 Rabbit-1.928
Tokenv
Token ID85
Feature activation+7.95e-01

Pos logit contributions
 comprehension+2.248
tesy+2.120
 stagn+2.109
abeth+2.052
ector+2.048
Neg logit contributions
-1.901
INST-1.663
aternal-1.574
 Rabbit-1.571
 Cosponsors-1.539
Token2
Token ID17
Feature activation+5.38e-01

Pos logit contributions
 comprehension+4.947
tesy+4.620
 stagn+4.491
CTV+4.439
ector+4.432
Neg logit contributions
-5.547
INST-4.852
iotics-4.749
agy-4.735
aternal-4.717
Token.
Token ID764
Feature activation+7.31e-01

Pos logit contributions
 comprehension+3.132
 stagn+2.828
tesy+2.807
ector+2.727
 unde+2.717
Neg logit contributions
-4.172
INST-3.759
aternal-3.620
agy-3.604
iotics-3.585
Token c
Token ID269
Feature activation-4.61e-02

Pos logit contributions
 comprehension+4.143
 stagn+3.694
 unde+3.639
tesy+3.570
ector+3.461
Neg logit contributions
-5.884
INST-5.366
aternal-5.162
agy-5.160
iotics-5.078
Tokenvt
Token ID36540
Feature activation+7.97e-01

Pos logit contributions
+0.341
INST+0.319
aternal+0.305
agy+0.302
 Rabbit+0.300
Neg logit contributions
 comprehension-0.278
tesy-0.260
 stagn-0.254
abeth-0.247
ector-0.242
TokenColor
Token ID10258
Feature activation+1.27e+00

Pos logit contributions
 comprehension+4.881
tesy+4.421
ector+4.371
abeth+4.352
 stagn+4.327
Neg logit contributions
-5.717
INST-5.295
agy-4.960
aternal-4.958
iotics-4.932
Token (
Token ID357
Feature activation+5.76e-01

Pos logit contributions
 comprehension+5.280
ector+4.436
abeth+4.425
tesy+4.411
 stagn+4.385
Neg logit contributions
-10.773
INST-9.919
iotics-9.757
aternal-9.738
agy-9.550
Token image
Token ID2939
Feature activation+2.41e-01

Pos logit contributions
 comprehension+2.400
 stagn+2.074
 unde+1.974
tesy+1.974
ector+1.865
Neg logit contributions
-5.518
INST-5.111
aternal-4.952
agy-4.906
 Cosponsors-4.859
Token,
Token ID837
Feature activation+8.98e-01

Pos logit contributions
 comprehension+1.681
tesy+1.565
 stagn+1.545
abeth+1.494
ector+1.486
Neg logit contributions
-1.555
INST-1.423
aternal-1.383
agy-1.358
iotics-1.342
Token c
Token ID269
Feature activation-1.27e-01

Pos logit contributions
 comprehension+4.547
 stagn+4.107
 unde+3.843
tesy+3.729
ector+3.678
Neg logit contributions
-7.643
INST-7.083
aternal-6.855
agy-6.842
iotics-6.744
Tokenv
Token ID85
Feature activation+4.98e-01

Pos logit contributions
+0.670
INST+0.611
aternal+0.574
agy+0.568
 Rabbit+0.555
Neg logit contributions
 comprehension-1.037
tesy-0.985
 stagn-0.971
abeth-0.952
ector-0.935
Tokenirect
Token ID1060
Feature activation+5.39e-02

Pos logit contributions
+0.532
INST+0.495
aternal+0.468
agy+0.467
 Rabbit+0.454
Neg logit contributions
 comprehension-0.550
tesy-0.511
 stagn-0.508
abeth-0.498
ector-0.489
Tokened
Token ID276
Feature activation+3.00e-01

Pos logit contributions
 comprehension+0.355
tesy+0.331
 stagn+0.325
abeth+0.317
ector+0.312
Neg logit contributions
-0.369
INST-0.344
aternal-0.327
agy-0.325
iotics-0.321
Token edges
Token ID13015
Feature activation+1.93e-01

Pos logit contributions
 comprehension+1.961
tesy+1.813
 stagn+1.771
ector+1.713
abeth+1.708
Neg logit contributions
-2.048
INST-1.912
aternal-1.832
agy-1.832
iotics-1.797
Token *)
Token ID31936
Feature activation-3.05e-01

Pos logit contributions
 comprehension+1.292
tesy+1.204
 stagn+1.184
abeth+1.150
ector+1.145
Neg logit contributions
-1.293
INST-1.211
aternal-1.155
agy-1.148
iotics-1.123
Token graph
Token ID4823
Feature activation-1.36e-01

Pos logit contributions
+2.156
INST+2.015
aternal+1.910
 Rabbit+1.894
agy+1.885
Neg logit contributions
 comprehension-1.897
tesy-1.818
 stagn-1.752
abeth-1.732
ector-1.659
Token =
Token ID796
Feature activation+1.26e+00

Pos logit contributions
+1.094
INST+1.031
aternal+0.991
agy+0.984
 Rabbit+0.971
Neg logit contributions
 comprehension-0.729
tesy-0.674
 stagn-0.659
abeth-0.639
ector-0.620
Token Fl
Token ID1610
Feature activation+5.12e-01

Pos logit contributions
 comprehension+8.418
 stagn+7.541
 unde+7.462
 Stru+7.356
tesy+7.356
Neg logit contributions
-7.673
INST-7.213
agy-7.039
iotics-7.032
aternal-7.030
Tokenatten
Token ID41769
Feature activation+9.30e-01

Pos logit contributions
 comprehension+1.049
tesy+0.915
 stagn+0.803
ector+0.802
abeth+0.772
Neg logit contributions
-5.779
INST-5.404
 Rabbit-5.331
iotics-5.235
aternal-5.189
Token[
Token ID58
Feature activation+9.57e-01

Pos logit contributions
 comprehension+4.055
tesy+3.546
ector+3.435
 stagn+3.433
abeth+3.330
Neg logit contributions
-7.924
INST-7.550
iotics-7.424
aternal-7.389
agy-7.303
Token Table
Token ID8655
Feature activation+1.43e+00

Pos logit contributions
 comprehension+6.268
tesy+5.676
 stagn+5.647
ector+5.590
 unde+5.461
Neg logit contributions
-6.034
INST-5.612
agy-5.462
aternal-5.457
iotics-5.388
Token[
Token ID58
Feature activation+1.17e+00

Pos logit contributions
 comprehension+5.336
ector+4.527
tesy+4.515
 stagn+4.240
abeth+4.061
Neg logit contributions
-12.608
INST-11.747
aternal-11.721
iotics-11.681
agy-11.497
Token for
Token ID329
Feature activation+5.94e-01

Pos logit contributions
 comprehension+1.041
 stagn+0.939
tesy+0.929
abeth+0.888
 unde+0.883
Neg logit contributions
-1.421
INST-1.336
agy-1.282
aternal-1.279
iotics-1.251
Token the
Token ID262
Feature activation+3.23e-01

Pos logit contributions
 comprehension+3.456
 stagn+3.079
tesy+2.945
 unde+2.932
abeth+2.875
Neg logit contributions
-4.590
INST-4.248
agy-4.174
aternal-4.024
-3.968
Token technical
Token ID6276
Feature activation+3.73e-01

Pos logit contributions
 comprehension+2.070
 stagn+1.891
tesy+1.861
abeth+1.783
 unde+1.770
Neg logit contributions
-2.308
INST-2.145
agy-2.075
aternal-2.053
iotics-1.986
Token colleges
Token ID15273
Feature activation+5.64e-02

Pos logit contributions
 comprehension+2.162
 stagn+1.956
tesy+1.891
abeth+1.837
 unde+1.819
Neg logit contributions
-2.901
INST-2.697
agy-2.610
aternal-2.588
 Rabbit-2.534
Token.
Token ID13
Feature activation+2.54e-02

Pos logit contributions
 comprehension+0.377
tesy+0.352
 stagn+0.348
abeth+0.338
ector+0.332
Neg logit contributions
-0.381
INST-0.355
aternal-0.339
agy-0.337
 Rabbit-0.330
Token Thanks
Token ID6930
Feature activation+1.26e+00

Pos logit contributions
 comprehension+0.198
tesy+0.187
 stagn+0.183
abeth+0.181
ector+0.177
Neg logit contributions
-0.141
INST-0.129
aternal-0.124
agy-0.123
iotics-0.119
Token to
Token ID284
Feature activation+1.62e-01

Pos logit contributions
 comprehension+5.491
tesy+5.116
 stagn+4.973
abeth+4.789
 unde+4.580
Neg logit contributions
-10.205
INST-9.734
agy-9.079
 Rabbit-9.070
aternal-8.922
Token Abrams
Token ID32414
Feature activation+1.51e-01

Pos logit contributions
 comprehension+1.037
 stagn+0.950
tesy+0.949
abeth+0.916
ector+0.897
Neg logit contributions
-1.152
INST-1.068
aternal-1.023
agy-1.020
 Rabbit-0.991
Token
Token ID447
Feature activation+1.45e-01

Pos logit contributions
 comprehension+1.032
tesy+0.965
 stagn+0.952
abeth+0.930
ector+0.909
Neg logit contributions
-0.989
INST-0.923
aternal-0.879
agy-0.874
 Rabbit-0.859
Token
Token ID247
Feature activation+1.80e-01

Pos logit contributions
 comprehension+1.064
tesy+1.005
 stagn+0.988
abeth+0.965
ector+0.945
Neg logit contributions
-0.885
INST-0.818
aternal-0.772
agy-0.771
 Rabbit-0.754
Token maneuver
Token ID20186
Feature activation+2.85e-01

Pos logit contributions
 comprehension+1.314
tesy+1.235
 stagn+1.218
abeth+1.185
ector+1.161
Neg logit contributions
-1.092
INST-1.014
aternal-0.956
agy-0.955
 Rabbit-0.942
Token send
Token ID3758
Feature activation+5.60e-01

Pos logit contributions
 comprehension+2.410
tesy+2.207
 stagn+2.200
 unde+2.038
abeth+2.027
Neg logit contributions
-4.498
INST-4.281
aternal-4.100
agy-4.066
iotics-4.054
Token you
Token ID345
Feature activation+5.86e-01

Pos logit contributions
 comprehension+2.512
 stagn+2.259
tesy+2.218
 unde+2.104
abeth+2.099
Neg logit contributions
-4.981
INST-4.689
aternal-4.523
iotics-4.420
agy-4.405
Token three
Token ID1115
Feature activation+3.55e-01

Pos logit contributions
 comprehension+2.829
 stagn+2.611
tesy+2.573
 unde+2.435
CTV+2.392
Neg logit contributions
-5.072
INST-4.710
aternal-4.530
iotics-4.484
agy-4.436
Token meaningful
Token ID11570
Feature activation+6.30e-01

Pos logit contributions
 comprehension+2.458
tesy+2.231
 stagn+2.221
 unde+2.146
abeth+2.114
Neg logit contributions
-2.375
INST-2.213
agy-2.113
aternal-2.105
 Rabbit-2.069
Token actions
Token ID4028
Feature activation+3.01e-01

Pos logit contributions
 comprehension+2.136
 stagn+1.887
tesy+1.782
 unde+1.774
abeth+1.645
Neg logit contributions
-6.365
INST-5.985
aternal-5.698
agy-5.673
 Rabbit-5.604
Token you
Token ID345
Feature activation+1.25e+00

Pos logit contributions
 comprehension+1.469
 stagn+1.317
tesy+1.304
abeth+1.243
 unde+1.230
Neg logit contributions
-2.585
INST-2.434
aternal-2.358
agy-2.307
iotics-2.299
Token can
Token ID460
Feature activation+2.25e-01

Pos logit contributions
 comprehension+4.663
 stagn+4.136
 unde+3.665
 subt+3.556
CTV+3.459
Neg logit contributions
-11.397
INST-10.822
aternal-10.462
agy-10.287
iotics-10.236
Token take
Token ID1011
Feature activation+2.57e-02

Pos logit contributions
 comprehension+1.295
 stagn+1.189
tesy+1.187
abeth+1.128
 unde+1.118
Neg logit contributions
-1.734
INST-1.615
aternal-1.551
agy-1.534
iotics-1.515
Token each
Token ID1123
Feature activation-1.83e-02

Pos logit contributions
 comprehension+0.183
tesy+0.172
 stagn+0.169
abeth+0.165
ector+0.162
Neg logit contributions
-0.162
INST-0.150
aternal-0.143
agy-0.141
 Rabbit-0.139
Token week
Token ID1285
Feature activation+1.22e-01

Pos logit contributions
+0.115
INST+0.106
aternal+0.101
agy+0.100
 Rabbit+0.098
Neg logit contributions
 comprehension-0.132
tesy-0.124
 stagn-0.123
abeth-0.119
ector-0.117
Token.
Token ID13
Feature activation+6.59e-01

Pos logit contributions
 comprehension+0.773
tesy+0.724
 stagn+0.709
abeth+0.695
ector+0.683
Neg logit contributions
-0.862
INST-0.786
aternal-0.762
iotics-0.745
agy-0.745
Token other
Token ID584
Feature activation-1.36e-01

Pos logit contributions
+0.978
INST+0.914
aternal+0.874
agy+0.868
 Rabbit+0.854
Neg logit contributions
 comprehension-0.866
tesy-0.810
 stagn-0.794
abeth-0.774
ector-0.756
Token heavy
Token ID4334
Feature activation+1.07e-01

Pos logit contributions
+1.049
INST+0.986
aternal+0.946
agy+0.939
 Rabbit+0.927
Neg logit contributions
 comprehension-0.781
tesy-0.726
 stagn-0.709
abeth-0.691
ector-0.673
Token metals
Token ID21782
Feature activation+1.69e-02

Pos logit contributions
 comprehension+0.470
tesy+0.424
 stagn+0.419
abeth+0.392
 unde+0.391
Neg logit contributions
-0.966
INST-0.917
aternal-0.883
agy-0.882
iotics-0.865
Token.
Token ID13
Feature activation+5.80e-01

Pos logit contributions
 comprehension+0.113
tesy+0.106
 stagn+0.105
abeth+0.102
 unde+0.100
Neg logit contributions
-0.114
INST-0.106
aternal-0.101
agy-0.099
 Rabbit-0.098
Token REUTERS
Token ID15862
Feature activation+1.26e-01

Pos logit contributions
 comprehension+3.927
tesy+3.702
 stagn+3.633
abeth+3.531
 Stru+3.524
Neg logit contributions
-3.651
INST-3.284
aternal-3.264
agy-3.200
iotics-3.177
Token/
Token ID14
Feature activation+1.23e+00

Pos logit contributions
 comprehension+0.645
tesy+0.592
 stagn+0.576
abeth+0.559
ector+0.545
Neg logit contributions
-1.047
INST-0.985
aternal-0.952
agy-0.945
 Rabbit-0.930
TokenB
Token ID33
Feature activation+4.61e-01

Pos logit contributions
 comprehension+7.994
CTV+7.768
tesy+7.675
abeth+7.568
ector+7.556
Neg logit contributions
-6.950
aternal-6.277
iotics-6.125
 Rabbit-5.888
INST-5.807
Tokenobby
Token ID11369
Feature activation+4.99e-01

Pos logit contributions
 comprehension+2.871
tesy+2.682
abeth+2.627
ector+2.603
 stagn+2.597
Neg logit contributions
-3.262
INST-2.952
 Rabbit-2.843
aternal-2.822
iotics-2.746
Token Y
Token ID575
Feature activation+4.42e-01

Pos logit contributions
 comprehension+4.228
tesy+4.058
 stagn+3.983
abeth+3.963
 Stru+3.864
Neg logit contributions
-2.404
INST-2.159
aternal-2.056
agy-1.963
iotics-1.953
Tokenip
Token ID541
Feature activation+7.01e-01

Pos logit contributions
 comprehension+2.465
tesy+2.321
 stagn+2.212
abeth+2.178
ector+2.167
Neg logit contributions
-3.478
INST-3.251
aternal-3.081
iotics-3.025
 Rabbit-3.023
Token
Token ID198
Feature activation+5.44e-01

Pos logit contributions
 comprehension+4.867
tesy+4.601
 stagn+4.533
abeth+4.428
ector+4.396
Neg logit contributions
-4.284
INST-3.855
aternal-3.714
agy-3.699
iotics-3.665
Token plunged
Token ID31460
Feature activation-3.75e-01

Pos logit contributions
+0.501
INST+0.470
aternal+0.451
agy+0.447
 Rabbit+0.441
Neg logit contributions
 comprehension-0.393
tesy-0.365
 stagn-0.358
abeth-0.348
ector-0.339
Token more
Token ID517
Feature activation-4.61e-01

Pos logit contributions
+2.562
INST+2.395
aternal+2.292
agy+2.269
 Rabbit+2.260
Neg logit contributions
 comprehension-2.413
tesy-2.301
abeth-2.190
 stagn-2.190
ector-2.132
Token than
Token ID621
Feature activation+3.82e-01

Pos logit contributions
+2.865
INST+2.667
aternal+2.529
 Rabbit+2.486
agy+2.464
Neg logit contributions
 comprehension-3.266
tesy-3.120
 stagn-2.990
abeth-2.972
ector-2.908
Token 10
Token ID838
Feature activation-1.07e-01

Pos logit contributions
 comprehension+2.330
 stagn+2.166
tesy+2.071
abeth+2.036
 unde+2.028
Neg logit contributions
-2.842
INST-2.633
agy-2.540
aternal-2.515
iotics-2.480
Token percent
Token ID1411
Feature activation-5.98e-02

Pos logit contributions
+0.691
INST+0.642
aternal+0.610
agy+0.609
 Rabbit+0.594
Neg logit contributions
 comprehension-0.747
tesy-0.697
 stagn-0.692
abeth-0.674
ector-0.662
Token after
Token ID706
Feature activation+1.23e+00

Pos logit contributions
+0.411
INST+0.383
aternal+0.365
agy+0.363
 Rabbit+0.356
Neg logit contributions
 comprehension-0.392
tesy-0.367
 stagn-0.362
abeth-0.351
ector-0.344
Token the
Token ID262
Feature activation+8.39e-01

Pos logit contributions
 comprehension+6.685
 stagn+5.975
abeth+5.798
 unde+5.666
ector+5.630
Neg logit contributions
-9.327
INST-8.693
agy-8.200
aternal-8.032
iotics-8.003
Token FDA
Token ID14672
Feature activation+2.37e-01

Pos logit contributions
 comprehension+4.634
 stagn+4.157
 unde+4.021
abeth+3.915
tesy+3.799
Neg logit contributions
-6.535
INST-6.116
agy-5.834
aternal-5.798
iotics-5.695
Token's
Token ID338
Feature activation+4.10e-01

Pos logit contributions
 comprehension+1.568
 stagn+1.448
tesy+1.427
abeth+1.387
 unde+1.377
Neg logit contributions
-1.630
INST-1.500
aternal-1.438
agy-1.420
iotics-1.402
Token announcement
Token ID8009
Feature activation+4.90e-01

Pos logit contributions
 comprehension+2.315
 stagn+2.091
tesy+2.019
abeth+1.997
 unde+1.991
Neg logit contributions
-3.216
INST-3.012
agy-2.863
aternal-2.862
iotics-2.799
Token.
Token ID13
Feature activation+2.59e-01

Pos logit contributions
 comprehension+3.086
 stagn+2.876
tesy+2.779
abeth+2.741
 unde+2.703
Neg logit contributions
-3.495
INST-3.244
aternal-3.056
agy-3.037
iotics-2.995
Token 2016
Token ID1584
Feature activation+6.55e-02

Pos logit contributions
 comprehension+2.276
 stagn+2.055
tesy+2.050
 unde+1.970
abeth+1.955
Neg logit contributions
-2.874
INST-2.663
aternal-2.576
agy-2.544
 Rabbit-2.465
Token primary
Token ID4165
Feature activation+3.74e-01

Pos logit contributions
 comprehension+0.441
tesy+0.414
 stagn+0.405
abeth+0.398
ector+0.389
Neg logit contributions
-0.437
INST-0.407
aternal-0.387
agy-0.383
 Rabbit-0.379
Token Sanders
Token ID5831
Feature activation+6.02e-01

Pos logit contributions
 comprehension+2.291
 stagn+2.136
tesy+2.135
abeth+2.028
ector+2.020
Neg logit contributions
-2.708
INST-2.537
aternal-2.403
agy-2.372
 Rabbit-2.349
Token:
Token ID25
Feature activation+6.91e-01

Pos logit contributions
 comprehension+3.451
 stagn+3.220
tesy+3.167
abeth+3.126
 unde+3.046
Neg logit contributions
-4.612
INST-4.223
aternal-4.074
agy-4.037
 Rabbit-4.008
Token '
Token ID705
Feature activation+4.51e-01

Pos logit contributions
 comprehension+3.432
 stagn+3.278
tesy+3.261
abeth+3.146
 unde+3.111
Neg logit contributions
-5.734
INST-5.235
agy-4.956
-4.771
bol-4.720
TokenDamn
Token ID43343
Feature activation+1.22e+00

Pos logit contributions
 comprehension+2.324
ector+2.257
tesy+2.209
 stagn+2.202
abeth+2.139
Neg logit contributions
-3.571
INST-3.135
agy-3.129
 Rabbit-3.113
aternal-3.109
Token right
Token ID826
Feature activation+4.01e-01

Pos logit contributions
 comprehension+6.983
 stagn+6.220
 manipulative+5.959
tesy+5.941
ector+5.906
Neg logit contributions
-8.863
INST-8.645
aternal-8.385
agy-8.234
 Cosponsors-8.161
Token'
Token ID6
Feature activation+4.03e-02

Pos logit contributions
 comprehension+2.373
tesy+2.171
 stagn+2.161
ector+2.091
abeth+2.062
Neg logit contributions
-2.991
INST-2.826
aternal-2.718
agy-2.688
 Rabbit-2.611
Token I
Token ID314
Feature activation+6.07e-01

Pos logit contributions
 comprehension+0.289
tesy+0.275
 stagn+0.268
abeth+0.264
ector+0.258
Neg logit contributions
-0.250
INST-0.233
aternal-0.220
agy-0.218
 Rabbit-0.215
Token'll
Token ID1183
Feature activation+2.67e-01

Pos logit contributions
 comprehension+3.764
tesy+3.534
 stagn+3.506
abeth+3.349
ector+3.318
Neg logit contributions
-4.330
INST-4.061
aternal-3.857
 Rabbit-3.818
iotics-3.809
Token make
Token ID787
Feature activation+1.02e-01

Pos logit contributions
 comprehension+1.456
tesy+1.325
 stagn+1.318
abeth+1.273
ector+1.243
Neg logit contributions
-2.140
INST-1.995
aternal-1.928
agy-1.911
 Rabbit-1.893
Token extra
Token ID3131
Feature activation+1.74e-01

Pos logit contributions
+0.998
INST+0.931
aternal+0.891
agy+0.884
 Rabbit+0.870
Neg logit contributions
 comprehension-0.887
tesy-0.830
 stagn-0.814
abeth-0.792
ector-0.774
Token level
Token ID1241
Feature activation-4.71e-01

Pos logit contributions
 comprehension+0.894
 stagn+0.799
tesy+0.792
 unde+0.740
abeth+0.734
Neg logit contributions
-1.487
INST-1.373
agy-1.325
aternal-1.322
iotics-1.307
Token of
Token ID286
Feature activation+2.27e-01

Pos logit contributions
+3.102
INST+2.917
agy+2.781
aternal+2.758
 Rabbit+2.750
Neg logit contributions
 comprehension-3.091
tesy-2.947
abeth-2.860
 stagn-2.856
ector-2.751
Token safety
Token ID3747
Feature activation-1.06e+00

Pos logit contributions
 comprehension+1.235
 stagn+1.117
tesy+1.101
ector+1.028
 unde+1.025
Neg logit contributions
-1.837
INST-1.733
aternal-1.661
agy-1.636
iotics-1.630
Token!
Token ID0
Feature activation-8.03e-01

Pos logit contributions
+5.622
INST+5.254
agy+4.820
 Rabbit+4.819
aternal+4.775
Neg logit contributions
 comprehension-8.034
 stagn-7.626
tesy-7.549
abeth-7.457
ector-7.205
Token
Token ID851
Feature activation-1.48e+00

Pos logit contributions
+4.378
INST+4.077
 Rabbit+3.815
agy+3.774
aternal+3.755
Neg logit contributions
 comprehension-6.107
tesy-5.827
 stagn-5.784
abeth-5.649
ector-5.516
Token Donald
Token ID3759
Feature activation-7.82e-01

Pos logit contributions
+5.882
INST+5.637
+5.161
 Rabbit+5.035
aternal+4.984
Neg logit contributions
 comprehension-12.266
tesy-11.782
 stagn-11.718
abeth-11.351
CTV-11.197
Token J
Token ID449
Feature activation-6.95e-02

Pos logit contributions
+3.051
INST+2.740
 Rabbit+2.592
agy+2.405
+2.368
Neg logit contributions
 comprehension-7.301
tesy-6.951
 stagn-6.933
abeth-6.810
ector-6.692
Token.
Token ID13
Feature activation-4.55e-01

Pos logit contributions
+0.461
INST+0.426
aternal+0.406
agy+0.402
iotics+0.395
Neg logit contributions
 comprehension-0.473
tesy-0.444
 stagn-0.435
abeth-0.426
ector-0.420
Token Trump
Token ID1301
Feature activation-7.25e-01

Pos logit contributions
+2.298
INST+2.116
 Rabbit+1.953
agy+1.952
aternal+1.950
Neg logit contributions
 comprehension-3.792
tesy-3.614
 stagn-3.564
abeth-3.486
ector-3.407
Token (@
Token ID4275
Feature activation-7.13e-01

Pos logit contributions
+3.667
INST+3.383
agy+3.114
aternal+3.098
 Rabbit+3.083
Neg logit contributions
 comprehension-5.936
tesy-5.639
 stagn-5.592
abeth-5.537
ector-5.372
Token all
Token ID477
Feature activation+3.26e-01

Pos logit contributions
 comprehension+1.553
tesy+1.426
 stagn+1.399
abeth+1.376
 unde+1.345
Neg logit contributions
-1.770
INST-1.634
agy-1.548
aternal-1.544
iotics-1.537
Token episodes
Token ID8640
Feature activation+3.05e-01

Pos logit contributions
 comprehension+1.910
tesy+1.738
 stagn+1.700
abeth+1.678
 unde+1.649
Neg logit contributions
-2.489
INST-2.334
agy-2.219
aternal-2.203
iotics-2.200
Token of
Token ID286
Feature activation+1.52e-01

Pos logit contributions
 comprehension+1.725
 stagn+1.583
tesy+1.572
abeth+1.530
 unde+1.489
Neg logit contributions
-2.340
INST-2.202
aternal-2.130
agy-2.111
iotics-2.073
Token 19
Token ID678
Feature activation-1.55e-02

Pos logit contributions
 comprehension+1.065
tesy+0.985
 stagn+0.974
abeth+0.965
ector+0.939
Neg logit contributions
-0.988
INST-0.887
agy-0.860
aternal-0.844
iotics-0.833
Token Kids
Token ID17476
Feature activation-5.40e-01

Pos logit contributions
+0.099
INST+0.091
agy+0.087
aternal+0.087
iotics+0.084
Neg logit contributions
 comprehension-0.110
tesy-0.103
 stagn-0.101
abeth-0.099
ector-0.097
Token and
Token ID290
Feature activation-1.35e+00

Pos logit contributions
+2.948
INST+2.746
agy+2.548
 Rabbit+2.527
aternal+2.526
Neg logit contributions
 comprehension-4.205
tesy-3.978
 stagn-3.962
abeth-3.892
ector-3.784
Token Count
Token ID2764
Feature activation-9.81e-01

Pos logit contributions
+5.640
 Ü+5.476
 Byrne+5.391
INST+5.278
+5.225
Neg logit contributions
 comprehension-11.309
tesy-10.923
 stagn-10.581
abeth-10.194
 challeng-9.696
Tokening
Token ID278
Feature activation-6.63e-01

Pos logit contributions
+5.574
INST+5.290
 Rabbit+5.080
agy+4.898
aternal+4.861
Neg logit contributions
 comprehension-7.106
tesy-6.759
 stagn-6.706
abeth-6.420
ector-6.350
Token from
Token ID422
Feature activation+4.99e-01

Pos logit contributions
+4.205
INST+3.959
aternal+3.712
 Rabbit+3.696
agy+3.667
Neg logit contributions
 comprehension-4.516
tesy-4.312
 stagn-4.199
abeth-4.121
ector-4.002
Token the
Token ID262
Feature activation+3.95e-01

Pos logit contributions
 comprehension+3.164
tesy+2.882
 stagn+2.834
 unde+2.738
abeth+2.726
Neg logit contributions
-3.537
INST-3.199
agy-3.172
aternal-3.109
iotics-3.092
Token air
Token ID1633
Feature activation-7.09e-02

Pos logit contributions
 comprehension+2.516
 stagn+2.315
tesy+2.276
abeth+2.192
 unde+2.184
Neg logit contributions
-2.802
INST-2.575
agy-2.519
aternal-2.489
iotics-2.437
Token facts
Token ID6419
Feature activation-5.41e-02

Pos logit contributions
+4.158
INST+3.911
aternal+3.703
 Rabbit+3.673
+3.656
Neg logit contributions
 comprehension-4.284
tesy-4.250
abeth-4.104
 stagn-4.070
ector-3.959
Token.
Token ID13
Feature activation-8.05e-02

Pos logit contributions
+0.363
INST+0.338
aternal+0.322
agy+0.320
 Rabbit+0.315
Neg logit contributions
 comprehension-0.365
tesy-0.341
 stagn-0.336
abeth-0.327
ector-0.320
Token This
Token ID770
Feature activation-2.47e-01

Pos logit contributions
+0.466
INST+0.430
aternal+0.411
agy+0.404
iotics+0.397
Neg logit contributions
 comprehension-0.612
tesy-0.577
 stagn-0.565
abeth-0.557
ector-0.547
Token is
Token ID318
Feature activation-2.19e-01

Pos logit contributions
+1.631
INST+1.514
aternal+1.443
agy+1.431
 Rabbit+1.411
Neg logit contributions
 comprehension-1.676
tesy-1.586
 stagn-1.549
abeth-1.522
ector-1.488
Token as
Token ID355
Feature activation-3.00e-01

Pos logit contributions
+1.536
INST+1.432
aternal+1.368
agy+1.359
 Rabbit+1.343
Neg logit contributions
 comprehension-1.385
tesy-1.307
 stagn-1.272
abeth-1.252
ector-1.221
Token far
Token ID1290
Feature activation-1.27e+00

Pos logit contributions
+2.248
INST+2.103
aternal+2.023
agy+2.003
 Rabbit+1.983
Neg logit contributions
 comprehension-1.749
tesy-1.654
 stagn-1.594
abeth-1.566
ector-1.532
Token as
Token ID355
Feature activation-1.92e-01

Pos logit contributions
+6.782
INST+6.028
agy+5.932
aternal+5.780
+5.669
Neg logit contributions
 comprehension-8.966
tesy-8.896
abeth-8.651
 stagn-8.532
rha-8.426
Token guess
Token ID4724
Feature activation-2.65e-01

Pos logit contributions
+1.342
INST+1.253
aternal+1.198
agy+1.189
 Rabbit+1.175
Neg logit contributions
 comprehension-1.218
tesy-1.148
 stagn-1.122
abeth-1.099
ector-1.073
Tokenwork
Token ID1818
Feature activation+1.40e-01

Pos logit contributions
+1.732
INST+1.609
aternal+1.532
agy+1.521
 Rabbit+1.495
Neg logit contributions
 comprehension-1.813
tesy-1.707
 stagn-1.677
abeth-1.641
ector-1.607
Token can
Token ID460
Feature activation+3.65e-01

Pos logit contributions
 comprehension+1.006
 stagn+0.930
tesy+0.922
abeth+0.890
 unde+0.876
Neg logit contributions
-0.891
INST-0.828
aternal-0.781
agy-0.772
 Rabbit-0.755
Token and
Token ID290
Feature activation-1.41e-01

Pos logit contributions
 comprehension+2.130
 stagn+1.971
tesy+1.961
 unde+1.854
abeth+1.831
Neg logit contributions
-2.799
INST-2.637
aternal-2.482
 Cosponsors-2.480
agy-2.457
Token persona
Token ID27822
Feature activation-4.07e-02

Pos logit contributions
+0.415
INST+0.389
aternal+0.374
agy+0.370
 Rabbit+0.364
Neg logit contributions
 comprehension-0.319
tesy-0.294
 stagn-0.290
abeth-0.279
ector-0.273
Token is
Token ID318
Feature activation-4.58e-01

Pos logit contributions
+0.268
INST+0.249
aternal+0.238
agy+0.234
 Rabbit+0.229
Neg logit contributions
 comprehension-0.281
 stagn-0.263
tesy-0.262
abeth-0.252
 unde-0.248
Token so
Token ID523
Feature activation-2.83e-01

Pos logit contributions
+3.355
INST+3.150
aternal+3.005
agy+2.992
 Rabbit+2.971
Neg logit contributions
 comprehension-2.712
tesy-2.573
abeth-2.461
 stagn-2.453
ector-2.387
Token vastly
Token ID22056
Feature activation-3.17e-01

Pos logit contributions
+2.141
INST+2.012
aternal+1.927
agy+1.913
 Rabbit+1.888
Neg logit contributions
 comprehension-1.647
tesy-1.536
 stagn-1.499
abeth-1.462
ector-1.425
Token different
Token ID1180
Feature activation-6.15e-01

Pos logit contributions
+2.017
INST+1.870
aternal+1.776
agy+1.761
 Rabbit+1.732
Neg logit contributions
 comprehension-2.232
tesy-2.109
 stagn-2.066
abeth-2.029
ector-1.983
Token from
Token ID422
Feature activation-1.27e+00

Pos logit contributions
+3.818
INST+3.550
aternal+3.366
agy+3.354
 Rabbit+3.324
Neg logit contributions
 comprehension-4.213
tesy-4.076
 stagn-3.915
abeth-3.912
ector-3.811
Token his
Token ID465
Feature activation-7.28e-01

Pos logit contributions
+6.886
INST+6.564
 Rabbit+6.282
+6.164
aternal+6.155
Neg logit contributions
tesy-8.778
 comprehension-8.553
abeth-8.173
 stagn-8.172
ector-8.044
Token or
Token ID393
Feature activation-3.30e-01

Pos logit contributions
+4.491
INST+4.219
 Rabbit+4.009
aternal+3.972
agy+3.892
Neg logit contributions
 comprehension-4.890
tesy-4.813
abeth-4.653
 stagn-4.586
ector-4.546
Token her
Token ID607
Feature activation-6.98e-01

Pos logit contributions
+2.079
INST+1.924
aternal+1.836
agy+1.815
 Rabbit+1.792
Neg logit contributions
 comprehension-2.331
tesy-2.205
 stagn-2.159
abeth-2.107
ector-2.072
Token offline
Token ID18043
Feature activation-7.04e-02

Pos logit contributions
+4.473
INST+4.232
 Rabbit+4.037
aternal+4.027
agy+3.939
Neg logit contributions
 comprehension-4.505
tesy-4.428
abeth-4.246
 stagn-4.213
ector-4.194
Token persona
Token ID27822
Feature activation-3.69e-01

Pos logit contributions
+0.599
INST+0.566
aternal+0.546
agy+0.542
 Rabbit+0.533
Neg logit contributions
 comprehension-0.350
tesy-0.317
 stagn-0.311
abeth-0.298
ector-0.290
Token Not
Token ID1892
Feature activation+2.28e-02

Pos logit contributions
+1.155
INST+1.051
aternal+0.985
agy+0.969
 Rabbit+0.964
Neg logit contributions
 comprehension-1.710
tesy-1.621
 stagn-1.599
abeth-1.566
ector-1.533
Token-
Token ID12
Feature activation+1.48e-01

Pos logit contributions
 comprehension+0.149
tesy+0.140
 stagn+0.136
abeth+0.133
ector+0.131
Neg logit contributions
-0.157
INST-0.146
aternal-0.140
agy-0.139
iotics-0.137
TokenAssociated
Token ID29014
Feature activation-7.60e-01

Pos logit contributions
 comprehension+1.031
tesy+0.989
 stagn+0.952
abeth+0.949
ector+0.938
Neg logit contributions
-0.921
INST-0.829
aternal-0.815
iotics-0.814
agy-0.814
Token T
Token ID309
Feature activation-7.30e-01

Pos logit contributions
+3.651
INST+3.243
 Rabbit+3.087
 Ü+2.941
aternal+2.921
Neg logit contributions
 comprehension-6.347
 stagn-5.980
tesy-5.940
abeth-5.865
ector-5.686
Tokenx
Token ID87
Feature activation-8.91e-01

Pos logit contributions
+3.501
INST+3.278
agy+3.052
aternal+3.022
+2.951
Neg logit contributions
 comprehension-6.036
tesy-5.770
 stagn-5.673
abeth-5.488
 unde-5.449
Token-
Token ID12
Feature activation-1.23e+00

Pos logit contributions
+4.801
INST+4.486
 Rabbit+4.159
+4.084
agy+4.071
Neg logit contributions
 comprehension-6.851
tesy-6.520
 stagn-6.506
 challeng-6.365
abeth-6.295
TokenPower
Token ID13434
Feature activation-5.64e-01

Pos logit contributions
+4.794
INST+4.586
+3.837
 Rabbit+3.640
agy+3.634
Neg logit contributions
 comprehension-11.239
 stagn-10.769
 challeng-10.527
tesy-10.428
abeth-10.398
Token=
Token ID28
Feature activation-9.35e-01

Pos logit contributions
+4.515
INST+4.306
agy+4.076
 Rabbit+4.032
aternal+4.030
Neg logit contributions
 comprehension-2.992
tesy-2.798
 stagn-2.721
abeth-2.619
ector-2.559
Token20
Token ID1238
Feature activation-9.47e-01

Pos logit contributions
+4.915
INST+4.652
aternal+4.256
agy+4.158
+4.121
Neg logit contributions
 comprehension-7.270
tesy-6.845
 stagn-6.812
abeth-6.658
 unde-6.500
Token dB
Token ID30221
Feature activation-3.33e-02

Pos logit contributions
+2.295
INST+1.777
 Rabbit+1.560
 Ü+1.453
aternal+1.432
Neg logit contributions
 comprehension-10.076
tesy-9.868
abeth-9.597
 stagn-9.578
ector-9.353
Tokenm
Token ID76
Feature activation-4.95e-01

Pos logit contributions
+0.214
INST+0.197
aternal+0.188
agy+0.186
 Rabbit+0.184
Neg logit contributions
 comprehension-0.234
tesy-0.220
 stagn-0.217
abeth-0.211
ector-0.208
Token their
Token ID511
Feature activation-2.87e-01

Pos logit contributions
+0.746
INST+0.695
aternal+0.663
agy+0.658
 Rabbit+0.648
Neg logit contributions
 comprehension-0.721
tesy-0.677
 stagn-0.665
abeth-0.648
ector-0.634
Token possible
Token ID1744
Feature activation-5.70e-02

Pos logit contributions
+2.030
INST+1.897
aternal+1.812
agy+1.801
 Rabbit+1.782
Neg logit contributions
 comprehension-1.798
tesy-1.694
 stagn-1.648
abeth-1.616
ector-1.585
Token paths
Token ID13532
Feature activation+3.15e-01

Pos logit contributions
+0.418
INST+0.392
aternal+0.375
agy+0.371
 Rabbit+0.363
Neg logit contributions
 comprehension-0.350
tesy-0.323
 stagn-0.321
abeth-0.309
 unde-0.301
Token?
Token ID30
Feature activation+1.54e-01

Pos logit contributions
 comprehension+1.582
 stagn+1.444
tesy+1.437
abeth+1.358
 unde+1.331
Neg logit contributions
-2.596
INST-2.477
aternal-2.394
agy-2.360
 Rabbit-2.315
Token As
Token ID1081
Feature activation-4.11e-02

Pos logit contributions
 comprehension+1.131
tesy+1.067
 stagn+1.049
abeth+1.030
ector+1.009
Neg logit contributions
-0.914
INST-0.844
aternal-0.823
agy-0.807
iotics-0.787
Token far
Token ID1290
Feature activation-1.23e+00

Pos logit contributions
+0.295
INST+0.276
aternal+0.264
agy+0.262
 Rabbit+0.258
Neg logit contributions
 comprehension-0.257
tesy-0.240
 stagn-0.235
abeth-0.229
ector-0.224
Token as
Token ID355
Feature activation-4.86e-01

Pos logit contributions
+6.376
agy+5.584
INST+5.531
+5.333
 Rabbit+5.320
Neg logit contributions
 comprehension-8.861
tesy-8.526
abeth-8.473
 stagn-8.456
ector-8.177
Token I
Token ID314
Feature activation+1.90e-01

Pos logit contributions
+3.273
INST+3.057
 Rabbit+2.958
agy+2.927
+2.897
Neg logit contributions
 comprehension-3.072
tesy-2.956
abeth-2.821
 stagn-2.799
ector-2.769
Token can
Token ID460
Feature activation+2.51e-01

Pos logit contributions
 comprehension+1.066
tesy+0.985
 stagn+0.966
abeth+0.931
ector+0.904
Neg logit contributions
-1.498
INST-1.409
aternal-1.350
agy-1.334
 Rabbit-1.320
Token tell
Token ID1560
Feature activation+1.02e-01

Pos logit contributions
 comprehension+1.499
tesy+1.391
 stagn+1.367
abeth+1.311
 unde+1.291
Neg logit contributions
-1.898
INST-1.777
aternal-1.696
agy-1.674
 Rabbit-1.655
Token,
Token ID11
Feature activation+8.48e-02

Pos logit contributions
 comprehension+0.718
tesy+0.674
 stagn+0.665
abeth+0.647
ector+0.634
Neg logit contributions
-0.652
INST-0.609
aternal-0.580
agy-0.572
 Rabbit-0.557
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
 comprehension+0.917
tesy+0.844
 stagn+0.841
abeth+0.807
 unde+0.796
Neg logit contributions
-1.001
INST-0.933
aternal-0.899
agy-0.891
iotics-0.867
Token receiver
Token ID9733
Feature activation-3.67e-02

Pos logit contributions
+0.738
INST+0.688
aternal+0.658
agy+0.653
 Rabbit+0.639
Neg logit contributions
 comprehension-0.713
tesy-0.667
 stagn-0.657
abeth-0.638
ector-0.626
Token and
Token ID290
Feature activation+2.14e-01

Pos logit contributions
+0.262
INST+0.245
aternal+0.235
agy+0.233
 Rabbit+0.228
Neg logit contributions
 comprehension-0.231
tesy-0.214
 stagn-0.212
abeth-0.205
ector-0.201
Token antenna
Token ID20509
Feature activation-1.02e-01

Pos logit contributions
 comprehension+1.369
 stagn+1.264
tesy+1.263
abeth+1.203
ector+1.197
Neg logit contributions
-1.520
INST-1.418
aternal-1.365
agy-1.358
iotics-1.325
Token as
Token ID355
Feature activation-2.20e-01

Pos logit contributions
+0.745
INST+0.698
aternal+0.668
agy+0.663
 Rabbit+0.654
Neg logit contributions
 comprehension-0.619
tesy-0.578
 stagn-0.566
abeth-0.552
ector-0.538
Token far
Token ID1290
Feature activation-1.22e+00

Pos logit contributions
+1.605
INST+1.503
aternal+1.437
agy+1.423
 Rabbit+1.415
Neg logit contributions
 comprehension-1.338
tesy-1.252
 stagn-1.222
abeth-1.199
ector-1.161
Token removed
Token ID4615
Feature activation-3.70e-01

Pos logit contributions
+7.379
INST+6.742
agy+6.430
aternal+6.323
 Ü+6.257
Neg logit contributions
 comprehension-8.000
tesy-7.732
abeth-7.453
 stagn-7.357
ruction-6.976
Token,
Token ID11
Feature activation+1.08e-02

Pos logit contributions
+2.435
INST+2.243
aternal+2.148
agy+2.124
 Rabbit+2.103
Neg logit contributions
 comprehension-2.491
tesy-2.357
 stagn-2.300
abeth-2.275
ector-2.192
Token physically
Token ID10170
Feature activation-1.48e-01

Pos logit contributions
 comprehension+0.069
tesy+0.064
 stagn+0.063
abeth+0.061
ector+0.060
Neg logit contributions
-0.076
INST-0.071
aternal-0.068
agy-0.068
 Rabbit-0.067
Token and
Token ID290
Feature activation-7.97e-02

Pos logit contributions
+1.066
INST+0.997
aternal+0.952
agy+0.946
 Rabbit+0.932
Neg logit contributions
 comprehension-0.910
tesy-0.853
 stagn-0.832
abeth-0.813
ector-0.795
Token electr
Token ID30880
Feature activation+1.34e-02

Pos logit contributions
+0.610
INST+0.574
aternal+0.551
agy+0.547
 Rabbit+0.539
Neg logit contributions
 comprehension-0.462
tesy-0.426
 stagn-0.419
abeth-0.405
ector-0.397
Token leadership
Token ID5531
Feature activation+3.44e-01

Pos logit contributions
 comprehension+1.287
tesy+1.181
 stagn+1.168
abeth+1.116
 unde+1.110
Neg logit contributions
-1.616
INST-1.516
aternal-1.452
agy-1.439
iotics-1.408
Token was
Token ID373
Feature activation+4.47e-02

Pos logit contributions
 comprehension+2.317
 stagn+2.139
tesy+2.138
 unde+2.023
abeth+2.013
Neg logit contributions
-2.289
INST-2.149
aternal-2.055
agy-2.039
 Rabbit-1.984
Token a
Token ID257
Feature activation-2.22e-01

Pos logit contributions
 comprehension+0.264
tesy+0.244
 stagn+0.242
abeth+0.231
 unde+0.227
Neg logit contributions
-0.338
INST-0.315
aternal-0.303
agy-0.302
 Rabbit-0.296
Token disaster
Token ID9336
Feature activation-2.78e-01

Pos logit contributions
+1.673
INST+1.571
aternal+1.506
agy+1.495
 Rabbit+1.475
Neg logit contributions
 comprehension-1.306
tesy-1.215
 stagn-1.191
abeth-1.159
ector-1.129
Token!
Token ID0
Feature activation+1.35e-01

Pos logit contributions
+1.815
INST+1.688
aternal+1.603
agy+1.587
 Rabbit+1.563
Neg logit contributions
 comprehension-1.905
tesy-1.797
 stagn-1.762
abeth-1.725
ector-1.684
Token
Token ID851
Feature activation-1.22e+00

Pos logit contributions
 comprehension+0.957
tesy+0.894
 stagn+0.879
abeth+0.860
ector+0.846
Neg logit contributions
-0.841
INST-0.781
aternal-0.750
agy-0.744
iotics-0.723
Token Donald
Token ID3759
Feature activation-6.98e-01

Pos logit contributions
+5.342
INST+4.865
+4.573
 Rabbit+4.510
agy+4.487
Neg logit contributions
 comprehension-10.221
tesy-9.905
 stagn-9.560
abeth-9.332
ector-9.216
Token J
Token ID449
Feature activation+2.75e-02

Pos logit contributions
+2.889
INST+2.637
 Rabbit+2.427
agy+2.334
aternal+2.270
Neg logit contributions
 comprehension-6.391
tesy-6.125
 stagn-6.069
abeth-5.968
ector-5.836
Token.
Token ID13
Feature activation-3.81e-01

Pos logit contributions
 comprehension+0.182
tesy+0.170
 stagn+0.166
abeth+0.163
ector+0.162
Neg logit contributions
-0.187
INST-0.172
aternal-0.165
agy-0.163
iotics-0.161
Token Trump
Token ID1301
Feature activation-6.60e-01

Pos logit contributions
+1.976
INST+1.815
aternal+1.684
agy+1.679
 Rabbit+1.668
Neg logit contributions
 comprehension-3.138
tesy-2.988
 stagn-2.946
abeth-2.881
ector-2.818
Token (@
Token ID4275
Feature activation-7.38e-01

Pos logit contributions
+3.371
INST+3.078
agy+2.807
 Rabbit+2.781
aternal+2.771
Neg logit contributions
 comprehension-5.460
tesy-5.174
 stagn-5.128
abeth-5.049
ector-4.913
Tokeny
Token ID88
Feature activation-4.53e-01

Pos logit contributions
+0.994
INST+0.904
aternal+0.848
agy+0.840
 Rabbit+0.820
Neg logit contributions
 comprehension-1.631
tesy-1.551
 stagn-1.529
abeth-1.500
ector-1.474
Token,
Token ID11
Feature activation-2.26e-01

Pos logit contributions
+2.657
INST+2.400
aternal+2.257
 Rabbit+2.230
agy+2.200
Neg logit contributions
 comprehension-3.386
tesy-3.232
 stagn-3.161
abeth-3.117
ector-3.046
Token with
Token ID351
Feature activation+1.41e-01

Pos logit contributions
+1.399
INST+1.292
aternal+1.227
agy+1.213
 Rabbit+1.201
Neg logit contributions
 comprehension-1.629
tesy-1.544
 stagn-1.510
abeth-1.483
ector-1.452
Token a
Token ID257
Feature activation+7.84e-02

Pos logit contributions
 comprehension+0.821
 stagn+0.745
tesy+0.741
abeth+0.713
 unde+0.702
Neg logit contributions
-1.087
INST-1.020
aternal-0.977
agy-0.968
iotics-0.954
Token cool
Token ID3608
Feature activation-4.11e-02

Pos logit contributions
 comprehension+0.490
tesy+0.458
 stagn+0.449
abeth+0.437
ector+0.426
Neg logit contributions
-0.561
INST-0.527
aternal-0.503
agy-0.500
 Rabbit-0.492
Token m
Token ID285
Feature activation-1.21e+00

Pos logit contributions
+0.290
INST+0.271
aternal+0.259
agy+0.257
 Rabbit+0.252
Neg logit contributions
 comprehension-0.263
tesy-0.244
 stagn-0.241
abeth-0.234
ector-0.229
Tokenoust
Token ID23968
Feature activation+2.83e-01

Pos logit contributions
+2.532
INST+2.091
aternal+1.967
agy+1.836
+1.794
Neg logit contributions
 comprehension-12.691
 stagn-12.103
abeth-12.001
tesy-11.958
 unde-11.783
Tokenache
Token ID4891
Feature activation-4.05e-03

Pos logit contributions
 comprehension+1.962
tesy+1.857
abeth+1.811
 stagn+1.799
ector+1.771
Neg logit contributions
-1.850
INST-1.700
aternal-1.603
 Rabbit-1.600
agy-1.585
Token and
Token ID290
Feature activation+7.50e-02

Pos logit contributions
+0.024
INST+0.022
aternal+0.021
agy+0.021
 Rabbit+0.020
Neg logit contributions
 comprehension-0.030
tesy-0.029
 stagn-0.028
abeth-0.028
ector-0.027
Token a
Token ID257
Feature activation-3.97e-01

Pos logit contributions
 comprehension+0.444
tesy+0.407
 stagn+0.402
abeth+0.389
ector+0.379
Neg logit contributions
-0.568
INST-0.534
aternal-0.510
agy-0.507
 Rabbit-0.497
Token bit
Token ID1643
Feature activation-2.07e-01

Pos logit contributions
+2.265
INST+2.070
aternal+1.956
agy+1.933
 Rabbit+1.913
Neg logit contributions
 comprehension-3.013
tesy-2.883
 stagn-2.819
abeth-2.796
ector-2.741
Token Joe
Token ID5689
Feature activation-2.35e-01

Pos logit contributions
+1.295
INST+1.202
aternal+1.142
agy+1.132
 Rabbit+1.114
Neg logit contributions
 comprehension-1.441
tesy-1.359
 stagn-1.335
abeth-1.305
ector-1.279
Token Buck
Token ID13452
Feature activation-2.83e-03

Pos logit contributions
+1.487
INST+1.382
aternal+1.309
agy+1.299
 Rabbit+1.283
Neg logit contributions
 comprehension-1.658
tesy-1.569
 stagn-1.535
abeth-1.505
ector-1.474
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
+0.019
INST+0.018
aternal+0.017
agy+0.017
 Rabbit+0.016
Neg logit contributions
 comprehension-0.019
tesy-0.018
 stagn-0.017
abeth-0.017
ector-0.017
Token But
Token ID887
Feature activation-5.90e-02

Pos logit contributions
+0.766
INST+0.704
aternal+0.669
agy+0.662
iotics+0.642
Neg logit contributions
 comprehension-1.021
tesy-0.966
 stagn-0.948
abeth-0.932
ector-0.913
Token as
Token ID355
Feature activation-3.15e-01

Pos logit contributions
+0.427
INST+0.400
aternal+0.382
agy+0.379
 Rabbit+0.371
Neg logit contributions
 comprehension-0.367
tesy-0.341
 stagn-0.336
abeth-0.326
ector-0.318
Token far
Token ID1290
Feature activation-1.20e+00

Pos logit contributions
+2.227
INST+2.079
aternal+1.996
agy+1.979
 Rabbit+1.963
Neg logit contributions
 comprehension-1.978
tesy-1.867
 stagn-1.814
abeth-1.781
ector-1.739
Token as
Token ID355
Feature activation-5.67e-01

Pos logit contributions
+6.393
agy+5.650
INST+5.649
aternal+5.531
 Rabbit+5.529
Neg logit contributions
 comprehension-8.594
tesy-8.519
abeth-8.264
ector-8.140
 stagn-8.121
Token people
Token ID661
Feature activation-2.61e-02

Pos logit contributions
+3.808
INST+3.561
 Rabbit+3.456
aternal+3.388
agy+3.377
Neg logit contributions
 comprehension-3.582
tesy-3.489
abeth-3.327
 stagn-3.309
ector-3.267
Token viewing
Token ID11681
Feature activation-1.43e-01

Pos logit contributions
+0.188
INST+0.175
aternal+0.168
agy+0.165
 Rabbit+0.163
Neg logit contributions
 comprehension-0.166
tesy-0.153
 stagn-0.152
abeth-0.146
 unde-0.142
Token him
Token ID683
Feature activation-3.52e-01

Pos logit contributions
+0.985
INST+0.917
aternal+0.876
agy+0.869
 Rabbit+0.848
Neg logit contributions
 comprehension-0.948
tesy-0.883
 stagn-0.871
abeth-0.846
ector-0.826
Token unfavorable
Token ID38206
Feature activation-9.16e-02

Pos logit contributions
+2.274
INST+2.108
aternal+2.004
agy+1.993
 Rabbit+1.963
Neg logit contributions
 comprehension-2.426
tesy-2.289
 stagn-2.242
abeth-2.200
ector-2.161
Token _
Token ID4808
Feature activation-2.12e-01

Pos logit contributions
+1.907
INST+1.770
aternal+1.655
 Rabbit+1.633
agy+1.628
Neg logit contributions
 comprehension-2.343
tesy-2.209
abeth-2.176
 stagn-2.171
ector-2.073
TokenP
Token ID47
Feature activation+2.28e-01

Pos logit contributions
+0.880
INST+0.781
aternal+0.717
agy+0.701
 Rabbit+0.688
Neg logit contributions
 comprehension-1.967
tesy-1.880
 stagn-1.858
abeth-1.826
ector-1.797
Tokenulse
Token ID9615
Feature activation+9.00e-02

Pos logit contributions
 comprehension+1.552
tesy+1.451
 stagn+1.447
abeth+1.422
ector+1.412
Neg logit contributions
-1.503
INST-1.372
 Rabbit-1.299
 Cosponsors-1.297
agy-1.286
TokenAmount
Token ID31264
Feature activation+2.46e-01

Pos logit contributions
 comprehension+0.673
tesy+0.636
 stagn+0.622
abeth+0.612
ector+0.603
Neg logit contributions
-0.529
INST-0.487
agy-0.467
aternal-0.465
 Rabbit-0.452
Token (
Token ID357
Feature activation+1.42e-01

Pos logit contributions
 comprehension+1.568
tesy+1.466
 stagn+1.446
abeth+1.397
ector+1.376
Neg logit contributions
-1.727
INST-1.611
aternal-1.542
agy-1.539
 Rabbit-1.506
Token "
Token ID366
Feature activation-1.20e+00

Pos logit contributions
 comprehension+0.898
 stagn+0.828
tesy+0.823
ector+0.780
 unde+0.778
Neg logit contributions
-1.012
INST-0.948
agy-0.913
aternal-0.913
iotics-0.882
TokenP
Token ID47
Feature activation-1.25e-01

Pos logit contributions
+3.396
INST+2.988
 Rabbit+2.451
aternal+2.335
+2.286
Neg logit contributions
 comprehension-12.268
 stagn-11.788
tesy-11.777
abeth-11.640
 challeng-11.608
Tokenulse
Token ID9615
Feature activation+3.57e-01

Pos logit contributions
+0.770
INST+0.710
aternal+0.666
agy+0.664
 Rabbit+0.656
Neg logit contributions
 comprehension-0.914
tesy-0.860
 stagn-0.850
abeth-0.832
ector-0.819
Token amount
Token ID2033
Feature activation-4.59e-01

Pos logit contributions
 comprehension+2.045
 stagn+1.812
tesy+1.746
 unde+1.706
abeth+1.638
Neg logit contributions
-2.850
INST-2.690
agy-2.559
aternal-2.519
 Cosponsors-2.508
Token"
Token ID1
Feature activation-3.36e-01

Pos logit contributions
+3.107
INST+2.887
aternal+2.740
agy+2.728
 Rabbit+2.703
Neg logit contributions
 comprehension-3.023
tesy-2.843
 stagn-2.789
abeth-2.757
ector-2.668
Token,
Token ID837
Feature activation+2.69e-01

Pos logit contributions
+1.985
INST+1.831
aternal+1.729
agy+1.712
 Rabbit+1.676
Neg logit contributions
 comprehension-2.534
tesy-2.394
 stagn-2.359
abeth-2.303
ector-2.267
Token did
Token ID750
Feature activation+1.24e-01

Pos logit contributions
+3.060
INST+2.877
aternal+2.729
agy+2.720
 Rabbit+2.710
Neg logit contributions
 comprehension-2.702
tesy-2.566
 stagn-2.469
abeth-2.437
ector-2.393
Token it
Token ID340
Feature activation-8.49e-02

Pos logit contributions
 comprehension+0.753
tesy+0.690
 stagn+0.689
abeth+0.667
ector+0.653
Neg logit contributions
-0.916
INST-0.851
aternal-0.820
agy-0.811
iotics-0.792
Token attract
Token ID4729
Feature activation-1.22e-01

Pos logit contributions
+0.608
INST+0.569
aternal+0.544
agy+0.540
 Rabbit+0.533
Neg logit contributions
 comprehension-0.530
tesy-0.496
 stagn-0.485
abeth-0.474
ector-0.463
Token you
Token ID345
Feature activation+4.63e-02

Pos logit contributions
+0.796
INST+0.739
aternal+0.704
agy+0.698
 Rabbit+0.686
Neg logit contributions
 comprehension-0.835
tesy-0.785
 stagn-0.772
abeth-0.754
ector-0.738
Token?
Token ID30
Feature activation+1.09e-01

Pos logit contributions
 comprehension+0.293
tesy+0.273
 stagn+0.270
abeth+0.262
 unde+0.254
Neg logit contributions
-0.327
INST-0.306
aternal-0.293
agy-0.290
 Rabbit-0.284
Token -
Token ID532
Feature activation-1.17e+00

Pos logit contributions
 comprehension+0.782
tesy+0.731
 stagn+0.719
abeth+0.708
ector+0.690
Neg logit contributions
-0.679
INST-0.627
aternal-0.607
agy-0.599
iotics-0.583
Token January
Token ID3269
Feature activation+1.67e-01

Pos logit contributions
+4.367
INST+3.845
 Rabbit+3.779
 Ü+3.645
agy+3.597
Neg logit contributions
 comprehension-10.399
 stagn-9.833
tesy-9.770
abeth-9.703
ector-9.701
Token 15
Token ID1315
Feature activation+4.72e-02

Pos logit contributions
 comprehension+1.145
tesy+1.073
 stagn+1.062
abeth+1.030
 unde+1.010
Neg logit contributions
-1.098
INST-1.007
aternal-0.970
agy-0.952
iotics-0.937
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
 comprehension+0.327
tesy+0.308
 stagn+0.302
abeth+0.297
ector+0.290
Neg logit contributions
-0.305
INST-0.281
aternal-0.270
agy-0.265
iotics-0.262
Token 2010
Token ID3050
Feature activation+7.54e-02

Pos logit contributions
 comprehension+0.710
tesy+0.656
 stagn+0.645
abeth+0.624
 unde+0.612
Neg logit contributions
-0.861
INST-0.811
aternal-0.780
agy-0.772
iotics-0.757
TokenQue
Token ID15681
Feature activation-4.26e-01

Pos logit contributions
 comprehension+0.646
tesy+0.620
 stagn+0.609
abeth+0.603
ector+0.593
Neg logit contributions
-0.359
INST-0.317
aternal-0.305
agy-0.299
iotics-0.289
Token
Token ID198
Feature activation+5.85e-02

Pos logit contributions
+1.978
INST+1.799
aternal+1.671
agy+1.641
 Rabbit+1.628
Neg logit contributions
 comprehension-3.544
tesy-3.346
 stagn-3.335
abeth-3.252
ector-3.203
Token
Token ID198
Feature activation-1.54e-01

Pos logit contributions
 comprehension+0.452
tesy+0.427
 stagn+0.422
abeth+0.412
ector+0.407
Neg logit contributions
-0.335
INST-0.300
aternal-0.287
agy-0.282
iotics-0.280
TokenGoogle
Token ID11708
Feature activation+1.21e-01

Pos logit contributions
+1.379
INST+1.298
aternal+1.259
agy+1.252
 Rabbit+1.239
Neg logit contributions
 comprehension-0.689
tesy-0.629
 stagn-0.604
abeth-0.589
ector-0.568
Token Scholar
Token ID11713
Feature activation+5.22e-01

Pos logit contributions
 comprehension+0.511
tesy+0.461
 stagn+0.447
abeth+0.434
ector+0.415
Neg logit contributions
-1.110
INST-1.046
aternal-1.019
agy-1.006
iotics-1.002
Token
Token ID198
Feature activation-4.39e-01

Pos logit contributions
 comprehension+3.698
tesy+3.494
 stagn+3.437
abeth+3.356
ector+3.320
Neg logit contributions
-3.174
INST-2.925
agy-2.813
aternal-2.795
iotics-2.718
Token
Token ID198
Feature activation-1.17e+00

Pos logit contributions
+2.072
INST+1.904
aternal+1.755
agy+1.737
 Rabbit+1.734
Neg logit contributions
 comprehension-3.771
tesy-3.609
 stagn-3.545
abeth-3.498
ector-3.423
TokenOl
Token ID30098
Feature activation-1.80e-01

Pos logit contributions
+5.586
INST+5.267
+4.704
aternal+4.638
 Rabbit+4.540
Neg logit contributions
 comprehension-9.689
 stagn-9.186
tesy-9.073
 unde-8.748
abeth-8.692
Tokenwe
Token ID732
Feature activation-1.84e-01

Pos logit contributions
+1.043
INST+0.959
aternal+0.905
agy+0.896
 Rabbit+0.880
Neg logit contributions
 comprehension-1.379
tesy-1.307
 stagn-1.285
abeth-1.259
ector-1.236
Tokenus
Token ID385
Feature activation+2.66e-01

Pos logit contributions
+0.882
INST+0.797
aternal+0.742
agy+0.733
 Rabbit+0.716
Neg logit contributions
 comprehension-1.590
tesy-1.515
 stagn-1.494
abeth-1.467
ector-1.443
Token,
Token ID11
Feature activation+2.19e-01

Pos logit contributions
 comprehension+1.749
tesy+1.647
 stagn+1.617
abeth+1.588
ector+1.557
Neg logit contributions
-1.810
INST-1.668
aternal-1.580
agy-1.558
iotics-1.552
Token D
Token ID360
Feature activation+9.12e-02

Pos logit contributions
 comprehension+1.649
tesy+1.551
 stagn+1.529
abeth+1.499
 unde+1.483
Neg logit contributions
-1.303
INST-1.173
aternal-1.113
agy-1.096
iotics-1.086
Tokenstop
Token ID11338
Feature activation-1.89e-01

Pos logit contributions
+2.047
INST+1.915
aternal+1.808
agy+1.791
 Rabbit+1.785
Neg logit contributions
 comprehension-2.273
tesy-2.181
 stagn-2.112
abeth-2.102
ector-2.056
Token awesome
Token ID7427
Feature activation-2.57e-01

Pos logit contributions
+1.357
INST+1.270
aternal+1.214
agy+1.205
 Rabbit+1.187
Neg logit contributions
 comprehension-1.178
tesy-1.102
 stagn-1.080
abeth-1.052
ector-1.028
Token grappling
Token ID36762
Feature activation-9.89e-02

Pos logit contributions
+1.699
INST+1.580
aternal+1.503
agy+1.491
 Rabbit+1.468
Neg logit contributions
 comprehension-1.743
tesy-1.644
 stagn-1.612
abeth-1.579
ector-1.541
Token hook
Token ID8011
Feature activation-3.13e-02

Pos logit contributions
+0.777
INST+0.730
aternal+0.702
agy+0.697
 Rabbit+0.686
Neg logit contributions
 comprehension-0.554
tesy-0.509
 stagn-0.500
abeth-0.481
ector-0.472
Token aw
Token ID3253
Feature activation-3.13e-01

Pos logit contributions
+0.221
INST+0.206
aternal+0.198
agy+0.195
 Rabbit+0.192
Neg logit contributions
 comprehension-0.201
tesy-0.186
 stagn-0.184
abeth-0.177
ector-0.175
Tokenes
Token ID274
Feature activation-1.16e+00

Pos logit contributions
+2.249
INST+2.094
agy+2.013
aternal+2.005
 Rabbit+1.966
Neg logit contributions
 comprehension-1.950
tesy-1.810
 stagn-1.791
abeth-1.749
ector-1.696
Tokenomen
Token ID3674
Feature activation-2.74e-02

Pos logit contributions
+4.676
INST+4.193
aternal+3.915
agy+3.876
iotics+3.528
Neg logit contributions
 comprehension-10.381
tesy-9.934
 stagn-9.865
abeth-9.575
 unde-9.432
Tokeness
Token ID408
Feature activation-3.02e-01

Pos logit contributions
+0.234
INST+0.219
aternal+0.211
agy+0.210
 Rabbit+0.208
Neg logit contributions
 comprehension-0.135
tesy-0.123
 stagn-0.120
abeth-0.118
ector-0.115
Token!
Token ID0
Feature activation-2.82e-01

Pos logit contributions
+1.955
INST+1.818
aternal+1.722
agy+1.711
 Rabbit+1.683
Neg logit contributions
 comprehension-2.094
tesy-1.975
 stagn-1.936
abeth-1.898
ector-1.854
Token Having
Token ID11136
Feature activation-2.93e-01

Pos logit contributions
+1.835
INST+1.707
aternal+1.616
agy+1.604
 Rabbit+1.584
Neg logit contributions
 comprehension-1.948
tesy-1.833
 stagn-1.803
abeth-1.760
ector-1.721
Token done
Token ID1760
Feature activation-1.20e-01

Pos logit contributions
+2.091
INST+1.951
aternal+1.870
agy+1.861
 Rabbit+1.844
Neg logit contributions
 comprehension-1.816
tesy-1.721
 stagn-1.664
abeth-1.639
ector-1.598
Token Foundation
Token ID5693
Feature activation+8.53e-03

Pos logit contributions
 comprehension+2.317
tesy+2.148
 stagn+2.081
abeth+2.071
ector+2.029
Neg logit contributions
-2.762
INST-2.577
aternal-2.468
agy-2.455
iotics-2.423
Token,
Token ID11
Feature activation-1.06e-01

Pos logit contributions
 comprehension+0.060
tesy+0.057
 stagn+0.056
abeth+0.055
ector+0.054
Neg logit contributions
-0.054
INST-0.050
aternal-0.047
agy-0.047
 Rabbit-0.046
Token told
Token ID1297
Feature activation+2.95e-02

Pos logit contributions
+0.722
INST+0.673
aternal+0.642
agy+0.637
 Rabbit+0.626
Neg logit contributions
 comprehension-0.695
tesy-0.651
 stagn-0.640
abeth-0.624
ector-0.610
Token the
Token ID262
Feature activation+1.14e-01

Pos logit contributions
 comprehension+0.210
tesy+0.196
 stagn+0.194
abeth+0.189
 unde+0.185
Neg logit contributions
-0.187
INST-0.173
aternal-0.165
agy-0.163
iotics-0.159
Token Ou
Token ID42201
Feature activation-2.42e-01

Pos logit contributions
 comprehension+0.782
tesy+0.730
 stagn+0.723
abeth+0.704
ector+0.690
Neg logit contributions
-0.752
INST-0.702
aternal-0.667
agy-0.664
iotics-0.647
Tokenach
Token ID620
Feature activation-1.16e+00

Pos logit contributions
+1.457
INST+1.342
aternal+1.304
agy+1.291
 Rabbit+1.252
Neg logit contributions
 comprehension-1.769
tesy-1.681
 stagn-1.646
abeth-1.598
ector-1.570
Tokenita
Token ID5350
Feature activation-4.99e-01

Pos logit contributions
+5.056
aternal+4.931
ū+4.796
 Rabbit+4.588
INST+4.575
Neg logit contributions
 comprehension-9.379
tesy-9.275
 stagn-8.898
CTV-8.509
ector-8.473
Token Citizen
Token ID22307
Feature activation-3.15e-02

Pos logit contributions
+3.231
INST+2.964
aternal+2.840
 Rabbit+2.802
agy+2.789
Neg logit contributions
 comprehension-3.438
tesy-3.305
 stagn-3.202
abeth-3.105
ector-3.041
Token that
Token ID326
Feature activation-9.58e-02

Pos logit contributions
+0.210
INST+0.195
aternal+0.186
agy+0.185
 Rabbit+0.181
Neg logit contributions
 comprehension-0.213
tesy-0.199
 stagn-0.196
abeth-0.192
ector-0.188
Token the
Token ID262
Feature activation-1.60e-01

Pos logit contributions
+0.616
INST+0.572
aternal+0.543
agy+0.538
 Rabbit+0.530
Neg logit contributions
 comprehension-0.672
tesy-0.631
 stagn-0.620
abeth-0.607
ector-0.594
Token think
Token ID892
Feature activation+6.56e-02

Pos logit contributions
+1.164
INST+1.091
aternal+1.043
agy+1.037
 Rabbit+1.020
Neg logit contributions
 comprehension-0.982
tesy-0.915
 stagn-0.899
abeth-0.874
ector-0.853
Token all
Token ID477
Feature activation+3.69e-03

Pos logit contributions
+0.393
INST+0.368
aternal+0.351
agy+0.348
iotics+0.341
Neg logit contributions
 comprehension-0.362
tesy-0.336
 stagn-0.332
abeth-0.321
 unde-0.314
Token about
Token ID546
Feature activation-7.55e-02

Pos logit contributions
 comprehension+0.022
tesy+0.020
 stagn+0.020
abeth+0.019
 unde+0.019
Neg logit contributions
-0.028
INST-0.026
aternal-0.025
agy-0.025
iotics-0.025
Token which
Token ID543
Feature activation-3.47e-01

Pos logit contributions
+0.525
INST+0.490
aternal+0.468
agy+0.464
 Rabbit+0.454
Neg logit contributions
 comprehension-0.492
tesy-0.458
 stagn-0.452
abeth-0.437
ector-0.428
Token player
Token ID2137
Feature activation-3.71e-01

Pos logit contributions
+2.519
INST+2.358
aternal+2.252
agy+2.244
 Rabbit+2.234
Neg logit contributions
 comprehension-2.092
tesy-1.993
 stagn-1.918
abeth-1.895
ector-1.831
Token out
Token ID503
Feature activation-1.10e+00

Pos logit contributions
+2.334
INST+2.169
 Rabbit+2.053
agy+2.052
aternal+2.044
Neg logit contributions
 comprehension-2.566
tesy-2.475
abeth-2.365
 stagn-2.342
ector-2.292
Token-
Token ID12
Feature activation-1.16e+00

Pos logit contributions
+6.792
+6.349
INST+6.237
agy+6.080
bol+5.957
Neg logit contributions
 comprehension-7.181
tesy-6.676
 stagn-6.554
abeth-6.394
CTV-6.298
Tokenper
Token ID525
Feature activation-8.27e-02

Pos logit contributions
+6.796
INST+6.633
+6.443
agy+6.288
bol+6.218
Neg logit contributions
 comprehension-7.772
tesy-7.211
 stagn-7.011
abeth-6.789
 unde-6.652
Tokenforms
Token ID23914
Feature activation-7.86e-01

Pos logit contributions
+0.766
INST+0.727
aternal+0.702
agy+0.697
 Rabbit+0.689
Neg logit contributions
 comprehension-0.347
tesy-0.313
 stagn-0.304
abeth-0.288
ector-0.280
Token the
Token ID262
Feature activation-3.10e-01

Pos logit contributions
+4.752
INST+4.398
 Rabbit+4.291
+4.183
aternal+4.174
Neg logit contributions
 comprehension-5.417
tesy-5.274
abeth-5.061
 stagn-5.040
ector-4.923
Token other
Token ID584
Feature activation-6.01e-01

Pos logit contributions
+2.054
INST+1.910
aternal+1.821
agy+1.806
 Rabbit+1.786
Neg logit contributions
 comprehension-2.098
tesy-1.980
 stagn-1.934
abeth-1.897
ector-1.855
Token.
Token ID13
Feature activation-1.68e-02

Pos logit contributions
+3.899
INST+3.633
 Rabbit+3.468
aternal+3.451
agy+3.434
Neg logit contributions
 comprehension-4.025
tesy-3.868
abeth-3.731
 stagn-3.692
ector-3.597
Token people
Token ID661
Feature activation-8.13e-02

Pos logit contributions
+2.514
INST+2.369
aternal+2.252
 Rabbit+2.228
agy+2.224
Neg logit contributions
 comprehension-2.412
tesy-2.298
 stagn-2.230
abeth-2.194
ector-2.141
Token with
Token ID351
Feature activation-9.83e-02

Pos logit contributions
+0.514
INST+0.475
aternal+0.453
agy+0.448
 Rabbit+0.437
Neg logit contributions
 comprehension-0.581
tesy-0.544
 stagn-0.539
abeth-0.523
ector-0.513
Token bad
Token ID2089
Feature activation+2.06e-01

Pos logit contributions
+0.716
INST+0.669
aternal+0.639
agy+0.636
 Rabbit+0.624
Neg logit contributions
 comprehension-0.608
tesy-0.563
 stagn-0.556
abeth-0.537
ector-0.526
Token manners
Token ID33558
Feature activation-3.27e-02

Pos logit contributions
 comprehension+1.275
tesy+1.145
 stagn+1.120
abeth+1.086
 unde+1.078
Neg logit contributions
-1.532
INST-1.426
agy-1.376
aternal-1.369
 Rabbit-1.330
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
+0.217
INST+0.201
aternal+0.193
agy+0.191
 Rabbit+0.187
Neg logit contributions
 comprehension-0.223
tesy-0.209
 stagn-0.206
abeth-0.200
ector-0.196
Token -
Token ID532
Feature activation-1.15e+00

Pos logit contributions
+0.243
INST+0.224
aternal+0.215
agy+0.212
iotics+0.206
Neg logit contributions
 comprehension-0.283
tesy-0.266
 stagn-0.262
abeth-0.256
ector-0.251
Token January
Token ID3269
Feature activation-1.96e-01

Pos logit contributions
+4.391
INST+3.973
 Rabbit+3.863
+3.683
 Ü+3.682
Neg logit contributions
 comprehension-10.150
 stagn-9.578
tesy-9.532
abeth-9.470
 challeng-9.423
Token 4
Token ID604
Feature activation+1.32e-01

Pos logit contributions
+1.393
INST+1.301
aternal+1.245
agy+1.232
 Rabbit+1.213
Neg logit contributions
 comprehension-1.235
tesy-1.156
 stagn-1.133
abeth-1.104
ector-1.079
Token,
Token ID11
Feature activation+2.68e-01

Pos logit contributions
 comprehension+0.890
tesy+0.835
 stagn+0.820
abeth+0.802
ector+0.786
Neg logit contributions
-0.875
INST-0.805
aternal-0.775
agy-0.762
iotics-0.756
Token 2010
Token ID3050
Feature activation-8.49e-02

Pos logit contributions
 comprehension+1.545
tesy+1.409
 stagn+1.385
abeth+1.335
 unde+1.325
Neg logit contributions
-2.042
INST-1.932
aternal-1.872
agy-1.852
iotics-1.824
TokenUser
Token ID12982
Feature activation+1.91e-01

Pos logit contributions
+0.559
INST+0.509
aternal+0.499
agy+0.490
iotics+0.482
Neg logit contributions
 comprehension-0.578
tesy-0.546
 stagn-0.531
abeth-0.523
ector-0.515
Token the
Token ID262
Feature activation-6.62e-01

Pos logit contributions
+1.967
INST+1.838
aternal+1.759
agy+1.721
 Rabbit+1.721
Neg logit contributions
 comprehension-1.939
tesy-1.848
 stagn-1.794
abeth-1.762
ector-1.721
Token same
Token ID976
Feature activation-1.42e-01

Pos logit contributions
+3.250
INST+3.032
aternal+2.833
 Rabbit+2.784
+2.683
Neg logit contributions
 comprehension-5.370
tesy-5.252
 stagn-5.092
abeth-5.069
ector-4.940
Token sex
Token ID1714
Feature activation-5.79e-01

Pos logit contributions
+0.708
INST+0.641
aternal+0.599
agy+0.595
 Rabbit+0.578
Neg logit contributions
 comprehension-1.196
tesy-1.134
 stagn-1.121
abeth-1.098
ector-1.080
Token that
Token ID326
Feature activation-4.91e-01

Pos logit contributions
+3.609
INST+3.360
aternal+3.186
agy+3.118
 Rabbit+3.103
Neg logit contributions
 comprehension-4.070
tesy-3.855
 stagn-3.730
abeth-3.684
ector-3.634
Token is
Token ID318
Feature activation-5.27e-01

Pos logit contributions
+2.960
INST+2.783
aternal+2.655
 Rabbit+2.584
agy+2.575
Neg logit contributions
 comprehension-3.478
tesy-3.338
abeth-3.212
 stagn-3.197
ector-3.150
Token treated
Token ID5716
Feature activation-1.14e+00

Pos logit contributions
+3.591
INST+3.411
aternal+3.284
 Rabbit+3.189
agy+3.184
Neg logit contributions
 comprehension-3.320
tesy-3.182
abeth-3.045
 stagn-3.041
ector-2.968
Token as
Token ID355
Feature activation-6.08e-01

Pos logit contributions
+6.484
INST+6.092
aternal+5.852
+5.610
 Rabbit+5.607
Neg logit contributions
 comprehension-7.993
tesy-7.584
abeth-7.437
ector-7.420
 stagn-7.348
Token a
Token ID257
Feature activation-5.98e-01

Pos logit contributions
+4.033
INST+3.852
aternal+3.762
 Rabbit+3.651
+3.569
Neg logit contributions
 comprehension-3.837
tesy-3.682
abeth-3.532
 stagn-3.520
ector-3.474
Token marriage
Token ID4845
Feature activation-1.92e-01

Pos logit contributions
+4.078
INST+3.890
aternal+3.797
 Rabbit+3.688
agy+3.610
Neg logit contributions
 comprehension-3.682
tesy-3.452
abeth-3.394
 stagn-3.376
ector-3.316
Token under
Token ID739
Feature activation-3.00e-01

Pos logit contributions
+1.282
INST+1.203
aternal+1.150
agy+1.129
 Rabbit+1.122
Neg logit contributions
 comprehension-1.268
tesy-1.194
 stagn-1.171
abeth-1.149
ector-1.127
Token the
Token ID262
Feature activation-6.97e-01

Pos logit contributions
+2.223
INST+2.109
aternal+2.036
 Rabbit+2.000
agy+1.992
Neg logit contributions
 comprehension-1.743
tesy-1.637
 stagn-1.591
abeth-1.560
ector-1.520
Token the
Token ID262
Feature activation-2.91e-01

Pos logit contributions
 comprehension+0.257
tesy+0.237
 stagn+0.235
abeth+0.227
ector+0.221
Neg logit contributions
-0.300
INST-0.281
aternal-0.268
agy-0.267
 Rabbit-0.262
Token reputation
Token ID8507
Feature activation-3.10e-01

Pos logit contributions
+2.083
INST+1.956
aternal+1.879
agy+1.854
 Rabbit+1.839
Neg logit contributions
 comprehension-1.776
tesy-1.695
 stagn-1.630
abeth-1.605
ector-1.573
Token of
Token ID286
Feature activation-3.66e-01

Pos logit contributions
+2.160
INST+2.033
aternal+1.941
agy+1.908
 Rabbit+1.899
Neg logit contributions
 comprehension-1.949
tesy-1.849
 stagn-1.793
abeth-1.771
ector-1.721
Token medicine
Token ID9007
Feature activation-2.30e-01

Pos logit contributions
+2.539
INST+2.379
aternal+2.280
agy+2.236
 Rabbit+2.223
Neg logit contributions
 comprehension-2.326
tesy-2.207
 stagn-2.142
abeth-2.097
ector-2.061
Token so
Token ID523
Feature activation+1.37e-02

Pos logit contributions
+1.499
INST+1.397
aternal+1.330
agy+1.314
 Rabbit+1.297
Neg logit contributions
 comprehension-1.565
tesy-1.478
 stagn-1.442
abeth-1.420
ector-1.395
Token greatly
Token ID9257
Feature activation-1.14e+00

Pos logit contributions
 comprehension+0.083
tesy+0.076
 stagn+0.076
abeth+0.073
 unde+0.071
Neg logit contributions
-0.102
INST-0.095
aternal-0.092
agy-0.091
 Rabbit-0.089
Token that
Token ID326
Feature activation-3.59e-01

Pos logit contributions
+5.922
INST+5.435
aternal+5.077
+4.926
 Rabbit+4.893
Neg logit contributions
 comprehension-8.554
tesy-8.276
abeth-7.950
 stagn-7.839
ector-7.763
Token women
Token ID1466
Feature activation-3.30e-01

Pos logit contributions
+2.506
INST+2.342
aternal+2.236
agy+2.207
 Rabbit+2.193
Neg logit contributions
 comprehension-2.275
tesy-2.162
 stagn-2.088
abeth-2.052
ector-2.014
Token like
Token ID588
Feature activation+1.98e-01

Pos logit contributions
+2.341
INST+2.189
aternal+2.085
agy+2.063
 Rabbit+2.041
Neg logit contributions
 comprehension-2.069
tesy-1.949
 stagn-1.875
abeth-1.861
ector-1.825
Token Ok
Token ID6762
Feature activation+1.06e-01

Pos logit contributions
 comprehension+1.431
 stagn+1.330
tesy+1.324
abeth+1.303
 unde+1.272
Neg logit contributions
-1.198
INST-1.120
aternal-1.058
agy-1.048
iotics-1.026
Tokenami
Token ID6277
Feature activation-2.31e-01

Pos logit contributions
 comprehension+0.593
tesy+0.554
 stagn+0.535
abeth+0.524
ector+0.512
Neg logit contributions
-0.826
INST-0.777
aternal-0.735
 Rabbit-0.730
agy-0.724
Token while
Token ID981
Feature activation-4.31e-02

Pos logit contributions
+1.745
INST+1.623
aternal+1.549
agy+1.546
 Rabbit+1.531
Neg logit contributions
 comprehension-1.562
tesy-1.472
 stagn-1.438
abeth-1.410
ector-1.373
Token 12
Token ID1105
Feature activation+3.95e-02

Pos logit contributions
+0.306
INST+0.287
aternal+0.274
agy+0.271
 Rabbit+0.265
Neg logit contributions
 comprehension-0.274
tesy-0.255
 stagn-0.251
abeth-0.243
ector-0.239
Token months
Token ID1933
Feature activation-1.76e-01

Pos logit contributions
 comprehension+0.267
tesy+0.249
 stagn+0.246
abeth+0.239
 unde+0.235
Neg logit contributions
-0.263
INST-0.245
aternal-0.234
agy-0.232
iotics-0.225
Token is
Token ID318
Feature activation-3.66e-01

Pos logit contributions
+1.168
INST+1.087
aternal+1.035
agy+1.027
 Rabbit+1.011
Neg logit contributions
 comprehension-1.191
tesy-1.121
 stagn-1.098
abeth-1.077
ector-1.052
Token probably
Token ID2192
Feature activation-8.28e-01

Pos logit contributions
+2.574
INST+2.407
agy+2.308
aternal+2.286
 Rabbit+2.283
Neg logit contributions
 comprehension-2.269
tesy-2.158
 stagn-2.072
abeth-2.070
ector-2.019
Token too
Token ID1165
Feature activation-1.14e+00

Pos logit contributions
+5.192
INST+4.866
agy+4.721
 Rabbit+4.710
aternal+4.599
Neg logit contributions
 comprehension-5.292
tesy-5.210
abeth-5.040
ector-4.871
 stagn-4.823
Token early
Token ID1903
Feature activation-1.82e-01

Pos logit contributions
+7.049
INST+6.631
agy+6.393
 Rabbit+6.344
+6.289
Neg logit contributions
 comprehension-7.202
tesy-7.108
ector-6.800
abeth-6.749
 stagn-6.570
Token to
Token ID284
Feature activation-1.51e-01

Pos logit contributions
+1.355
INST+1.266
aternal+1.214
agy+1.211
 Rabbit+1.189
Neg logit contributions
 comprehension-1.073
tesy-1.008
 stagn-0.981
abeth-0.968
ector-0.940
Token declare
Token ID13627
Feature activation+7.35e-02

Pos logit contributions
+1.071
INST+1.003
aternal+0.959
agy+0.953
 Rabbit+0.939
Neg logit contributions
 comprehension-0.947
tesy-0.888
 stagn-0.868
abeth-0.849
ector-0.829
Token the
Token ID262
Feature activation-1.89e-01

Pos logit contributions
 comprehension+0.447
tesy+0.407
 stagn+0.406
abeth+0.391
ector+0.383
Neg logit contributions
-0.544
INST-0.510
aternal-0.491
agy-0.484
iotics-0.475
Token curse
Token ID17328
Feature activation-7.13e-02

Pos logit contributions
+1.294
INST+1.209
aternal+1.155
agy+1.143
 Rabbit+1.119
Neg logit contributions
 comprehension-1.247
tesy-1.161
 stagn-1.148
abeth-1.115
ector-1.090
Token sn
Token ID3013
Feature activation+2.08e-01

Pos logit contributions
 comprehension+1.790
 stagn+1.647
tesy+1.616
 unde+1.561
abeth+1.552
Neg logit contributions
-2.188
INST-2.065
aternal-1.954
agy-1.938
 Rabbit-1.891
Tokenacking
Token ID5430
Feature activation-2.31e-02

Pos logit contributions
 comprehension+1.037
tesy+0.974
 stagn+0.926
ector+0.907
abeth+0.898
Neg logit contributions
-1.781
INST-1.657
aternal-1.572
 Rabbit-1.560
agy-1.539
Token).
Token ID737
Feature activation+3.34e-01

Pos logit contributions
+0.147
INST+0.137
aternal+0.131
agy+0.129
 Rabbit+0.126
Neg logit contributions
 comprehension-0.164
tesy-0.153
 stagn-0.151
abeth-0.147
 unde-0.144
Token The
Token ID383
Feature activation+8.75e-02

Pos logit contributions
 comprehension+2.340
tesy+2.179
 stagn+2.146
abeth+2.123
 Stru+2.096
Neg logit contributions
-2.048
INST-1.888
aternal-1.875
agy-1.829
iotics-1.808
Token easiest
Token ID16638
Feature activation+7.76e-02

Pos logit contributions
 comprehension+0.568
 stagn+0.519
tesy+0.518
abeth+0.499
 unde+0.492
Neg logit contributions
-0.609
INST-0.571
aternal-0.546
agy-0.540
 Rabbit-0.528
Token and
Token ID290
Feature activation+1.03e+00

Pos logit contributions
 comprehension+0.432
tesy+0.390
 stagn+0.389
abeth+0.371
 unde+0.364
Neg logit contributions
-0.614
INST-0.578
aternal-0.559
agy-0.550
 Rabbit-0.543
Token best
Token ID1266
Feature activation+1.25e-01

Pos logit contributions
 comprehension+5.863
 stagn+5.297
 unde+5.101
tesy+5.065
abeth+4.758
Neg logit contributions
-7.386
INST-7.250
aternal-6.872
 Cosponsors-6.740
iotics-6.683
Token way
Token ID835
Feature activation+1.86e-01

Pos logit contributions
 comprehension+0.672
tesy+0.604
 stagn+0.600
abeth+0.569
 unde+0.561
Neg logit contributions
-1.012
INST-0.953
aternal-0.922
agy-0.910
 Rabbit-0.896
Token to
Token ID284
Feature activation-2.36e-02

Pos logit contributions
 comprehension+1.017
tesy+0.933
 stagn+0.920
abeth+0.887
ector+0.863
Neg logit contributions
-1.481
INST-1.394
aternal-1.344
agy-1.329
 Rabbit-1.314
Token accomplish
Token ID9989
Feature activation+2.28e-01

Pos logit contributions
+0.183
INST+0.172
aternal+0.165
agy+0.164
 Rabbit+0.162
Neg logit contributions
 comprehension-0.133
tesy-0.124
 stagn-0.121
abeth-0.118
ector-0.115
Token this
Token ID428
Feature activation+6.77e-02

Pos logit contributions
 comprehension+1.103
 stagn+0.981
tesy+0.969
abeth+0.915
 unde+0.906
Neg logit contributions
-1.975
INST-1.850
aternal-1.794
agy-1.777
 Rabbit-1.756
Token "
Token ID366
Feature activation+1.05e-01

Pos logit contributions
 comprehension+1.353
tesy+1.246
 stagn+1.235
abeth+1.192
 unde+1.179
Neg logit contributions
-1.628
INST-1.523
aternal-1.464
agy-1.442
iotics-1.429
Tokento
Token ID1462
Feature activation+1.30e-01

Pos logit contributions
 comprehension+0.607
tesy+0.568
 stagn+0.552
abeth+0.540
ector+0.526
Neg logit contributions
-0.795
INST-0.741
aternal-0.715
agy-0.710
 Rabbit-0.704
Token enc
Token ID2207
Feature activation+1.51e-01

Pos logit contributions
 comprehension+0.738
tesy+0.676
 stagn+0.675
abeth+0.643
 unde+0.635
Neg logit contributions
-1.020
INST-0.956
aternal-0.920
agy-0.916
 Rabbit-0.897
Tokenirc
Token ID1980
Feature activation+7.58e-02

Pos logit contributions
 comprehension+1.039
tesy+1.002
abeth+0.975
 stagn+0.973
ector+0.953
Neg logit contributions
-1.001
INST-0.915
aternal-0.841
 Rabbit-0.837
agy-0.827
Tokenle
Token ID293
Feature activation+8.79e-02

Pos logit contributions
 comprehension+0.471
tesy+0.441
 stagn+0.433
abeth+0.421
ector+0.410
Neg logit contributions
-0.550
INST-0.514
aternal-0.487
agy-0.484
 Rabbit-0.479
Token and
Token ID290
Feature activation+9.12e-01

Pos logit contributions
 comprehension+0.603
tesy+0.562
 stagn+0.557
abeth+0.541
 unde+0.530
Neg logit contributions
-0.578
INST-0.540
aternal-0.512
agy-0.510
 Rabbit-0.498
Token destroy
Token ID4117
Feature activation+2.89e-01

Pos logit contributions
 comprehension+3.565
 stagn+3.209
 unde+2.947
tesy+2.940
abeth+2.748
Neg logit contributions
-8.551
INST-8.078
 Cosponsors-7.879
aternal-7.801
agy-7.782
Token the
Token ID262
Feature activation+1.15e-01

Pos logit contributions
 comprehension+1.833
 stagn+1.679
tesy+1.674
abeth+1.616
 unde+1.603
Neg logit contributions
-2.052
INST-1.930
aternal-1.837
agy-1.830
iotics-1.790
Token enemy
Token ID4472
Feature activation+1.60e-01

Pos logit contributions
 comprehension+0.757
 stagn+0.695
tesy+0.692
abeth+0.669
 unde+0.662
Neg logit contributions
-0.792
INST-0.739
aternal-0.701
agy-0.699
iotics-0.684
Token grouping
Token ID36115
Feature activation+1.34e-01

Pos logit contributions
 comprehension+0.929
tesy+0.849
 stagn+0.842
abeth+0.807
 unde+0.789
Neg logit contributions
-1.229
INST-1.153
aternal-1.103
agy-1.102
iotics-1.079
Token invading
Token ID33827
Feature activation+4.83e-01

Pos logit contributions
 comprehension+0.803
tesy+0.742
 stagn+0.736
abeth+0.710
 unde+0.696
Neg logit contributions
-0.994
INST-0.931
aternal-0.896
agy-0.891
iotics-0.872
Token (
Token ID357
Feature activation+6.71e-01

Pos logit contributions
 comprehension+2.431
tesy+2.253
 stagn+2.181
abeth+2.168
ector+2.146
Neg logit contributions
-2.556
aternal-2.406
agy-2.371
INST-2.356
iotics-2.343
Token path
Token ID3108
Feature activation+4.93e-02

Pos logit contributions
 comprehension+4.299
tesy+3.959
 stagn+3.892
abeth+3.766
 unde+3.739
Neg logit contributions
-4.528
INST-4.102
aternal-4.094
agy-4.078
iotics-3.983
Token,
Token ID837
Feature activation+4.01e-01

Pos logit contributions
 comprehension+0.328
tesy+0.307
 stagn+0.299
abeth+0.292
ector+0.289
Neg logit contributions
-0.332
INST-0.309
aternal-0.298
agy-0.296
 Rabbit-0.291
Token 4
Token ID604
Feature activation+4.37e-02

Pos logit contributions
 comprehension+2.582
tesy+2.360
 stagn+2.351
abeth+2.248
 unde+2.242
Neg logit contributions
-2.801
INST-2.548
aternal-2.517
agy-2.510
iotics-2.439
Token )
Token ID1267
Feature activation+7.81e-01

Pos logit contributions
 comprehension+0.312
tesy+0.294
 stagn+0.287
abeth+0.281
ector+0.278
Neg logit contributions
-0.274
INST-0.251
agy-0.241
aternal-0.240
 Rabbit-0.234
Token =
Token ID796
Feature activation+9.01e-01

Pos logit contributions
 comprehension+4.600
tesy+4.166
 stagn+4.043
 unde+3.994
ector+3.979
Neg logit contributions
-5.411
agy-5.206
aternal-5.049
iotics-4.950
INST-4.941
Token '.
Token ID45302
Feature activation-9.15e-02

Pos logit contributions
 comprehension+5.504
 stagn+4.887
tesy+4.873
 unde+4.778
ector+4.636
Neg logit contributions
-6.205
agy-5.759
aternal-5.740
INST-5.667
iotics-5.646
Tokencl
Token ID565
Feature activation-7.75e-02

Pos logit contributions
+0.581
INST+0.538
aternal+0.513
agy+0.508
 Rabbit+0.499
Neg logit contributions
 comprehension-0.646
tesy-0.609
 stagn-0.598
abeth-0.585
ector-0.573
Tokenj
Token ID73
Feature activation+1.49e-01

Pos logit contributions
+0.443
INST+0.407
aternal+0.385
agy+0.380
 Rabbit+0.374
Neg logit contributions
 comprehension-0.597
tesy-0.566
 stagn-0.557
abeth-0.545
ector-0.536
Token'
Token ID6
Feature activation+6.07e-01

Pos logit contributions
 comprehension+0.875
tesy+0.812
 stagn+0.796
abeth+0.770
ector+0.759
Neg logit contributions
-1.117
INST-1.044
aternal-1.010
agy-1.008
 Rabbit-0.983
Token OR
Token ID6375
Feature activation+4.66e-01

Pos logit contributions
 comprehension+4.027
 stagn+3.639
tesy+3.626
 unde+3.534
ector+3.499
Neg logit contributions
-4.034
agy-3.666
INST-3.665
aternal-3.636
iotics-3.528
Token aren
Token ID3588
Feature activation+5.43e-01

Pos logit contributions
 comprehension+3.568
 stagn+3.327
tesy+3.235
abeth+3.106
 unde+3.095
Neg logit contributions
-4.274
INST-4.039
aternal-3.901
agy-3.826
iotics-3.755
Token
Token ID447
Feature activation+1.93e-02

Pos logit contributions
 comprehension+2.666
tesy+2.562
 stagn+2.394
abeth+2.342
CTV+2.315
Neg logit contributions
-4.391
INST-4.077
aternal-4.030
iotics-4.024
 Rabbit-3.969
Token
Token ID247
Feature activation+1.98e-01

Pos logit contributions
 comprehension+0.129
tesy+0.121
 stagn+0.119
abeth+0.116
ector+0.113
Neg logit contributions
-0.130
INST-0.121
aternal-0.115
agy-0.114
 Rabbit-0.113
Tokent
Token ID83
Feature activation+4.34e-01

Pos logit contributions
 comprehension+1.212
tesy+1.140
 stagn+1.111
abeth+1.081
ector+1.056
Neg logit contributions
-1.432
INST-1.344
aternal-1.288
agy-1.272
 Rabbit-1.267
Token able
Token ID1498
Feature activation+2.94e-01

Pos logit contributions
 comprehension+2.446
tesy+2.253
 stagn+2.232
abeth+2.148
 unde+2.106
Neg logit contributions
-3.327
INST-3.148
aternal-3.020
agy-2.986
 Rabbit-2.957
Token to
Token ID284
Feature activation+8.21e-01

Pos logit contributions
 comprehension+1.458
tesy+1.329
 stagn+1.313
abeth+1.250
 unde+1.216
Neg logit contributions
-2.472
INST-2.347
aternal-2.254
agy-2.230
 Rabbit-2.220
Token boast
Token ID28020
Feature activation+7.56e-01

Pos logit contributions
 comprehension+4.533
 stagn+4.173
tesy+4.138
 unde+3.973
abeth+3.812
Neg logit contributions
-6.193
INST-5.946
aternal-5.681
agy-5.604
 Cosponsors-5.589
Token;
Token ID26
Feature activation+1.01e-01

Pos logit contributions
 comprehension+4.559
tesy+4.290
 stagn+4.159
 unde+4.017
abeth+3.971
Neg logit contributions
-5.141
INST-4.984
aternal-4.782
agy-4.772
 Rabbit-4.766
Token another
Token ID1194
Feature activation-3.09e-03

Pos logit contributions
 comprehension+0.638
tesy+0.591
 stagn+0.584
abeth+0.564
 unde+0.552
Neg logit contributions
-0.712
INST-0.668
aternal-0.641
agy-0.634
 Rabbit-0.623
Token popular
Token ID2968
Feature activation+6.40e-01

Pos logit contributions
+0.022
INST+0.021
aternal+0.020
agy+0.020
 Rabbit+0.019
Neg logit contributions
 comprehension-0.019
tesy-0.018
 stagn-0.018
abeth-0.017
ector-0.017
Token and
Token ID290
Feature activation+4.83e-01

Pos logit contributions
 comprehension+4.031
 stagn+3.618
tesy+3.601
 unde+3.477
abeth+3.370
Neg logit contributions
-4.440
INST-4.222
aternal-4.070
agy-3.991
iotics-3.944
Token by
Token ID416
Feature activation+1.08e-01

Pos logit contributions
+2.278
INST+2.112
aternal+2.007
agy+1.989
 Rabbit+1.969
Neg logit contributions
 comprehension-2.223
tesy-2.092
 stagn-2.049
abeth-2.009
ector-1.968
Token saying
Token ID2282
Feature activation-3.94e-02

Pos logit contributions
 comprehension+0.648
 stagn+0.598
tesy+0.596
abeth+0.567
 unde+0.560
Neg logit contributions
-0.794
INST-0.745
aternal-0.721
agy-0.711
iotics-0.698
Token that
Token ID326
Feature activation-1.26e-01

Pos logit contributions
+0.250
INST+0.232
aternal+0.221
agy+0.219
 Rabbit+0.214
Neg logit contributions
 comprehension-0.279
tesy-0.262
 stagn-0.259
abeth-0.253
ector-0.247
Token Lee
Token ID5741
Feature activation+9.42e-02

Pos logit contributions
+0.840
INST+0.782
aternal+0.744
agy+0.738
 Rabbit+0.730
Neg logit contributions
 comprehension-0.845
tesy-0.796
 stagn-0.778
abeth-0.762
ector-0.746
Token shows
Token ID2523
Feature activation+5.72e-01

Pos logit contributions
 comprehension+0.605
tesy+0.565
 stagn+0.561
abeth+0.541
 unde+0.528
Neg logit contributions
-0.655
INST-0.615
aternal-0.588
agy-0.582
 Rabbit-0.570
Token no
Token ID645
Feature activation+8.60e-01

Pos logit contributions
 comprehension+3.468
 stagn+3.184
tesy+3.097
 unde+3.033
abeth+2.992
Neg logit contributions
-4.167
INST-3.951
aternal-3.777
agy-3.747
iotics-3.677
Token love
Token ID1842
Feature activation+3.76e-01

Pos logit contributions
 comprehension+5.125
 stagn+4.397
 unde+4.222
tesy+4.213
abeth+3.976
Neg logit contributions
-6.430
INST-6.235
agy-5.890
aternal-5.881
iotics-5.791
Token,
Token ID11
Feature activation-9.19e-02

Pos logit contributions
 comprehension+2.129
tesy+1.986
 stagn+1.922
abeth+1.861
 unde+1.806
Neg logit contributions
-2.864
INST-2.732
aternal-2.608
agy-2.599
 Rabbit-2.563
Token and
Token ID290
Feature activation-2.22e-01

Pos logit contributions
+0.642
INST+0.600
aternal+0.572
agy+0.568
 Rabbit+0.559
Neg logit contributions
 comprehension-0.592
tesy-0.554
 stagn-0.544
abeth-0.530
ector-0.517
Token attention
Token ID3241
Feature activation+3.04e-01

Pos logit contributions
+1.504
INST+1.400
aternal+1.334
agy+1.325
 Rabbit+1.312
Neg logit contributions
 comprehension-1.466
tesy-1.379
 stagn-1.347
abeth-1.319
ector-1.293
Token towards
Token ID3371
Feature activation-8.27e-02

Pos logit contributions
 comprehension+1.766
 stagn+1.625
tesy+1.623
abeth+1.569
 unde+1.504
Neg logit contributions
-2.301
INST-2.171
aternal-2.057
agy-2.048
 Rabbit-2.021
Token but
Token ID475
Feature activation-1.23e-01

Pos logit contributions
+1.179
INST+1.106
aternal+1.057
agy+1.051
 Rabbit+1.039
Neg logit contributions
 comprehension-0.962
tesy-0.901
 stagn-0.877
abeth-0.861
ector-0.838
Token at
Token ID379
Feature activation+6.18e-02

Pos logit contributions
+0.878
INST+0.822
aternal+0.787
agy+0.781
 Rabbit+0.774
Neg logit contributions
 comprehension-0.764
tesy-0.716
 stagn-0.697
abeth-0.685
ector-0.666
Token that
Token ID326
Feature activation-4.77e-01

Pos logit contributions
 comprehension+0.391
tesy+0.363
 stagn+0.361
abeth+0.350
ector+0.342
Neg logit contributions
-0.440
INST-0.411
aternal-0.394
agy-0.392
iotics-0.382
Token time
Token ID640
Feature activation-3.20e-01

Pos logit contributions
+2.374
INST+2.121
aternal+1.986
 Rabbit+1.953
agy+1.943
Neg logit contributions
 comprehension-3.974
tesy-3.810
 stagn-3.726
abeth-3.683
ector-3.628
Token discussions
Token ID9984
Feature activation+3.66e-01

Pos logit contributions
+2.209
INST+2.046
aternal+1.964
agy+1.954
 Rabbit+1.947
Neg logit contributions
 comprehension-2.044
tesy-1.932
 stagn-1.872
abeth-1.846
ector-1.798
Token were
Token ID547
Feature activation+1.59e-01

Pos logit contributions
 comprehension+2.122
 stagn+1.988
tesy+1.954
abeth+1.853
 unde+1.827
Neg logit contributions
-2.829
INST-2.649
aternal-2.532
agy-2.503
 Rabbit-2.443
Token still
Token ID991
Feature activation+1.06e-01

Pos logit contributions
 comprehension+0.872
 stagn+0.806
tesy+0.797
abeth+0.754
 unde+0.750
Neg logit contributions
-1.275
INST-1.197
aternal-1.147
agy-1.139
iotics-1.121
Token said
Token ID531
Feature activation+5.54e-03

Pos logit contributions
 comprehension+0.542
 stagn+0.500
tesy+0.491
 unde+0.467
abeth+0.463
Neg logit contributions
-0.892
INST-0.841
aternal-0.807
agy-0.801
iotics-0.790
Token to
Token ID284
Feature activation+3.04e-01

Pos logit contributions
 comprehension+0.030
tesy+0.028
 stagn+0.027
abeth+0.026
ector+0.026
Neg logit contributions
-0.044
INST-0.042
aternal-0.040
agy-0.040
 Rabbit-0.040
Token be
Token ID307
Feature activation+1.38e-01

Pos logit contributions
 comprehension+1.836
 stagn+1.738
tesy+1.703
 unde+1.614
abeth+1.614
Neg logit contributions
-2.258
INST-2.115
aternal-2.005
agy-2.005
 Cosponsors-1.966
Token
Token ID564
Feature activation+3.39e-03

Pos logit contributions
 comprehension+0.740
 stagn+0.684
tesy+0.670
 unde+0.638
abeth+0.637
Neg logit contributions
-1.127
INST-1.061
aternal-1.018
agy-1.008
iotics-0.994
Token of
Token ID286
Feature activation-3.77e-01

Pos logit contributions
+2.070
INST+1.932
aternal+1.841
agy+1.831
 Rabbit+1.816
Neg logit contributions
 comprehension-1.942
tesy-1.829
 stagn-1.773
abeth-1.747
ector-1.699
Token travel
Token ID3067
Feature activation+3.19e-01

Pos logit contributions
+2.602
INST+2.434
aternal+2.318
agy+2.303
 Rabbit+2.293
Neg logit contributions
 comprehension-2.413
tesy-2.286
 stagn-2.203
abeth-2.179
ector-2.128
Token from
Token ID422
Feature activation-5.37e-02

Pos logit contributions
 comprehension+2.028
 stagn+1.874
tesy+1.860
abeth+1.775
 unde+1.743
Neg logit contributions
-2.246
INST-2.109
aternal-2.019
agy-1.981
 Cosponsors-1.961
Token Down
Token ID5588
Feature activation-7.22e-02

Pos logit contributions
+0.342
INST+0.317
aternal+0.302
agy+0.299
 Rabbit+0.293
Neg logit contributions
 comprehension-0.379
tesy-0.356
 stagn-0.351
abeth-0.342
ector-0.336
Token Under
Token ID4698
Feature activation+1.47e-02

Pos logit contributions
+0.418
INST+0.385
aternal+0.364
agy+0.360
 Rabbit+0.353
Neg logit contributions
 comprehension-0.550
tesy-0.521
 stagn-0.513
abeth-0.502
ector-0.493
Token?
Token ID30
Feature activation+2.15e-01

Pos logit contributions
 comprehension+0.097
tesy+0.091
 stagn+0.090
abeth+0.087
ector+0.085
Neg logit contributions
-0.100
INST-0.093
aternal-0.089
agy-0.088
 Rabbit-0.086
Token Not
Token ID1892
Feature activation+2.79e-01

Pos logit contributions
 comprehension+1.562
tesy+1.460
 stagn+1.450
abeth+1.401
ector+1.385
Neg logit contributions
-1.296
INST-1.205
aternal-1.168
agy-1.145
iotics-1.120
Token something
Token ID1223
Feature activation-1.12e-01

Pos logit contributions
 comprehension+1.666
 stagn+1.525
tesy+1.522
abeth+1.439
 unde+1.423
Neg logit contributions
-2.074
INST-1.948
aternal-1.869
agy-1.846
 Cosponsors-1.804
Token I
Token ID314
Feature activation+1.64e-01

Pos logit contributions
+0.788
INST+0.737
aternal+0.705
agy+0.700
 Rabbit+0.685
Neg logit contributions
 comprehension-0.713
tesy-0.664
 stagn-0.655
abeth-0.635
ector-0.621
Token would
Token ID561
Feature activation+1.68e-01

Pos logit contributions
 comprehension+0.963
tesy+0.887
 stagn+0.882
abeth+0.849
 unde+0.823
Neg logit contributions
-1.243
INST-1.161
aternal-1.118
agy-1.098
 Rabbit-1.085
Token bet
Token ID731
Feature activation+3.77e-03

Pos logit contributions
 comprehension+1.008
 stagn+0.934
tesy+0.924
abeth+0.885
 unde+0.877
Neg logit contributions
-1.266
INST-1.174
aternal-1.129
agy-1.113
 Rabbit-1.097
Token bike
Token ID7161
Feature activation-5.28e-02

Pos logit contributions
 comprehension+0.411
tesy+0.380
 stagn+0.378
abeth+0.364
 unde+0.360
Neg logit contributions
-0.439
INST-0.408
aternal-0.392
agy-0.387
iotics-0.378
Token turned
Token ID2900
Feature activation-2.64e-01

Pos logit contributions
+0.356
INST+0.331
aternal+0.316
agy+0.312
 Rabbit+0.307
Neg logit contributions
 comprehension-0.355
tesy-0.332
 stagn-0.327
abeth-0.318
ector-0.312
Token up
Token ID510
Feature activation-4.80e-01

Pos logit contributions
+1.984
INST+1.860
aternal+1.783
agy+1.772
 Rabbit+1.751
Neg logit contributions
 comprehension-1.552
tesy-1.451
 stagn-1.414
abeth-1.383
ector-1.346
Token Thursday
Token ID3635
Feature activation+1.35e-01

Pos logit contributions
+3.312
INST+3.115
agy+2.963
aternal+2.951
 Rabbit+2.914
Neg logit contributions
 comprehension-3.043
tesy-2.871
abeth-2.766
 stagn-2.757
ector-2.705
Token when
Token ID618
Feature activation+1.91e-01

Pos logit contributions
 comprehension+0.805
tesy+0.749
 stagn+0.737
abeth+0.706
 unde+0.703
Neg logit contributions
-1.015
INST-0.949
aternal-0.916
agy-0.906
iotics-0.891
Token Andre
Token ID10948
Feature activation+3.88e-01

Pos logit contributions
 comprehension+1.166
tesy+1.059
 stagn+1.050
abeth+1.013
ector+0.999
Neg logit contributions
-1.429
INST-1.327
aternal-1.277
agy-1.256
iotics-1.238
Token Sc
Token ID1446
Feature activation-2.55e-01

Pos logit contributions
 comprehension+2.781
tesy+2.625
 stagn+2.572
abeth+2.569
ector+2.517
Neg logit contributions
-2.353
INST-2.210
aternal-2.063
agy-2.015
iotics-1.992
Tokenard
Token ID446
Feature activation+4.25e-01

Pos logit contributions
+1.765
INST+1.671
aternal+1.613
agy+1.606
 Rabbit+1.559
Neg logit contributions
 comprehension-1.624
tesy-1.512
 stagn-1.489
abeth-1.430
 unde-1.404
Tokenina
Token ID1437
Feature activation+1.02e-01

Pos logit contributions
 comprehension+3.004
tesy+2.832
 stagn+2.768
abeth+2.764
ector+2.707
Neg logit contributions
-2.682
INST-2.464
aternal-2.276
 Rabbit-2.220
 Cosponsors-2.215
Token,
Token ID11
Feature activation-1.06e-01

Pos logit contributions
 comprehension+0.685
tesy+0.644
 stagn+0.632
abeth+0.618
ector+0.608
Neg logit contributions
-0.682
INST-0.636
aternal-0.606
agy-0.595
 Rabbit-0.588
Token 19
Token ID678
Feature activation-5.20e-02

Pos logit contributions
+0.718
INST+0.668
aternal+0.637
agy+0.631
 Rabbit+0.620
Neg logit contributions
 comprehension-0.706
tesy-0.660
 stagn-0.649
abeth-0.633
ector-0.621
Token large
Token ID1588
Feature activation-2.92e-01

Pos logit contributions
+0.883
INST+0.828
aternal+0.790
agy+0.785
 Rabbit+0.769
Neg logit contributions
 comprehension-0.785
tesy-0.726
 stagn-0.716
abeth-0.692
ector-0.678
Token pr
Token ID778
Feature activation+2.61e-02

Pos logit contributions
+1.956
INST+1.819
aternal+1.735
agy+1.721
 Rabbit+1.698
Neg logit contributions
 comprehension-1.942
tesy-1.837
 stagn-1.800
abeth-1.764
ector-1.723
Tokenimate
Token ID1920
Feature activation+8.64e-03

Pos logit contributions
 comprehension+0.184
tesy+0.173
 stagn+0.169
abeth+0.166
ector+0.163
Neg logit contributions
-0.167
INST-0.155
agy-0.145
aternal-0.145
 Rabbit-0.143
Token brain
Token ID3632
Feature activation+3.65e-01

Pos logit contributions
 comprehension+0.056
tesy+0.051
 stagn+0.050
abeth+0.048
 unde+0.048
Neg logit contributions
-0.061
INST-0.058
aternal-0.055
agy-0.054
iotics-0.054
Token,
Token ID11
Feature activation+2.36e-01

Pos logit contributions
 comprehension+2.299
tesy+2.068
 stagn+2.067
 unde+1.984
ector+1.966
Neg logit contributions
-2.593
INST-2.454
aternal-2.351
agy-2.318
 Cosponsors-2.277
Token or
Token ID393
Feature activation+3.15e-01

Pos logit contributions
 comprehension+1.516
tesy+1.395
 stagn+1.386
abeth+1.321
 unde+1.317
Neg logit contributions
-1.623
INST-1.545
aternal-1.465
agy-1.448
iotics-1.419
Token whether
Token ID1771
Feature activation+1.31e-01

Pos logit contributions
 comprehension+1.809
 stagn+1.611
tesy+1.607
 unde+1.530
abeth+1.526
Neg logit contributions
-2.419
INST-2.310
aternal-2.203
agy-2.189
iotics-2.144
Token a
Token ID257
Feature activation-4.99e-02

Pos logit contributions
 comprehension+0.789
 stagn+0.708
tesy+0.704
abeth+0.676
 unde+0.670
Neg logit contributions
-0.989
INST-0.935
aternal-0.894
agy-0.885
iotics-0.868
Token honey
Token ID12498
Feature activation+3.49e-01

Pos logit contributions
+0.354
INST+0.332
aternal+0.316
agy+0.314
iotics+0.307
Neg logit contributions
 comprehension-0.319
tesy-0.294
 stagn-0.291
abeth-0.281
ector-0.275
Tokenbee
Token ID20963
Feature activation+5.32e-02

Pos logit contributions
 comprehension+2.617
tesy+2.461
 stagn+2.419
abeth+2.368
ector+2.321
Neg logit contributions
-2.069
INST-1.902
aternal-1.782
agy-1.765
 Rabbit-1.729
Token might
Token ID1244
Feature activation-2.67e-01

Pos logit contributions
 comprehension+0.330
tesy+0.300
 stagn+0.299
abeth+0.286
ector+0.282
Neg logit contributions
-0.389
INST-0.365
aternal-0.349
agy-0.346
iotics-0.341
Token published
Token ID3199
Feature activation+5.75e-02

Pos logit contributions
 comprehension+0.092
tesy+0.087
 stagn+0.086
abeth+0.084
ector+0.082
Neg logit contributions
-0.076
INST-0.070
aternal-0.067
agy-0.066
 Rabbit-0.065
Token at
Token ID379
Feature activation+2.49e-01

Pos logit contributions
 comprehension+0.378
tesy+0.356
 stagn+0.349
abeth+0.340
ector+0.333
Neg logit contributions
-0.392
INST-0.365
aternal-0.349
agy-0.346
 Rabbit-0.339
Token The
Token ID383
Feature activation+1.09e-01

Pos logit contributions
 comprehension+1.700
tesy+1.594
 stagn+1.574
abeth+1.530
ector+1.506
Neg logit contributions
-1.640
INST-1.507
aternal-1.454
agy-1.445
iotics-1.404
Token Wall
Token ID5007
Feature activation-2.31e-01

Pos logit contributions
 comprehension+0.624
tesy+0.578
 stagn+0.564
abeth+0.556
ector+0.543
Neg logit contributions
-0.835
INST-0.780
aternal-0.749
agy-0.748
iotics-0.730
Token Street
Token ID3530
Feature activation+4.25e-02

Pos logit contributions
+1.139
INST+1.035
aternal+0.965
agy+0.954
 Rabbit+0.941
Neg logit contributions
 comprehension-1.952
tesy-1.859
 stagn-1.831
abeth-1.798
ector-1.766
Token Examiner
Token ID35855
Feature activation+2.84e-01

Pos logit contributions
 comprehension+0.140
tesy+0.122
 stagn+0.117
abeth+0.111
ector+0.104
Neg logit contributions
-0.431
INST-0.412
aternal-0.399
agy-0.396
 Rabbit-0.394
Token.
Token ID13
Feature activation+8.41e-02

Pos logit contributions
 comprehension+1.890
tesy+1.786
 stagn+1.740
abeth+1.691
ector+1.677
Neg logit contributions
-1.890
INST-1.747
aternal-1.693
agy-1.686
iotics-1.633
Token Follow
Token ID7281
Feature activation-2.38e-01

Pos logit contributions
 comprehension+0.627
tesy+0.600
 stagn+0.581
abeth+0.575
ector+0.565
Neg logit contributions
-0.491
INST-0.450
aternal-0.434
agy-0.430
iotics-0.420
Token the
Token ID262
Feature activation-1.42e-01

Pos logit contributions
+1.543
INST+1.439
aternal+1.363
agy+1.352
 Rabbit+1.336
Neg logit contributions
 comprehension-1.637
tesy-1.545
 stagn-1.517
abeth-1.479
ector-1.449
Token money
Token ID1637
Feature activation-1.65e-01

Pos logit contributions
+0.734
INST+0.670
aternal+0.629
agy+0.621
 Rabbit+0.609
Neg logit contributions
 comprehension-1.169
tesy-1.112
 stagn-1.095
abeth-1.074
ector-1.056
Token!
Token ID0
Feature activation+1.13e-01

Pos logit contributions
+1.161
INST+1.099
aternal+1.048
agy+1.043
 Rabbit+1.022
Neg logit contributions
 comprehension-1.037
tesy-0.968
 stagn-0.947
abeth-0.933
ector-0.901
Tokenll
Token ID297
Feature activation-7.27e-02

Pos logit contributions
+0.526
INST+0.494
aternal+0.475
agy+0.472
 Rabbit+0.465
Neg logit contributions
 comprehension-0.377
tesy-0.349
 stagn-0.342
abeth-0.332
ector-0.323
Token assume
Token ID7048
Feature activation-1.01e-01

Pos logit contributions
+0.526
INST+0.492
aternal+0.471
agy+0.468
 Rabbit+0.461
Neg logit contributions
 comprehension-0.449
tesy-0.419
 stagn-0.410
abeth-0.400
ector-0.391
Token a
Token ID257
Feature activation+8.29e-02

Pos logit contributions
+0.658
INST+0.611
aternal+0.582
agy+0.577
 Rabbit+0.567
Neg logit contributions
 comprehension-0.698
tesy-0.657
 stagn-0.646
abeth-0.630
ector-0.617
Token relatively
Token ID5365
Feature activation-1.08e-03

Pos logit contributions
 comprehension+0.501
tesy+0.463
 stagn+0.458
abeth+0.440
ector+0.431
Neg logit contributions
-0.617
INST-0.578
aternal-0.554
agy-0.550
 Rabbit-0.539
Token low
Token ID1877
Feature activation-3.58e-01

Pos logit contributions
+0.009
INST+0.008
aternal+0.008
agy+0.008
iotics+0.008
Neg logit contributions
 comprehension-0.006
tesy-0.005
 stagn-0.005
abeth-0.005
 unde-0.005
Token rate
Token ID2494
Feature activation-1.44e-01

Pos logit contributions
+2.880
INST+2.721
aternal+2.602
agy+2.590
 Rabbit+2.581
Neg logit contributions
 comprehension-1.854
tesy-1.736
 stagn-1.684
abeth-1.654
ector-1.608
Token,
Token ID11
Feature activation-2.23e-02

Pos logit contributions
+0.990
INST+0.923
aternal+0.881
agy+0.874
 Rabbit+0.861
Neg logit contributions
 comprehension-0.937
tesy-0.879
 stagn-0.862
abeth-0.842
ector-0.823
Token for
Token ID329
Feature activation+3.26e-01

Pos logit contributions
+0.158
INST+0.147
aternal+0.141
agy+0.140
 Rabbit+0.137
Neg logit contributions
 comprehension-0.141
tesy-0.132
 stagn-0.130
abeth-0.126
ector-0.123
Token a
Token ID257
Feature activation+2.01e-01

Pos logit contributions
 comprehension+1.975
tesy+1.814
 stagn+1.778
abeth+1.708
 unde+1.706
Neg logit contributions
-2.424
INST-2.260
agy-2.165
aternal-2.160
iotics-2.120
Token billionaire
Token ID18828
Feature activation+2.72e-01

Pos logit contributions
 comprehension+1.241
tesy+1.148
 stagn+1.138
abeth+1.080
 unde+1.074
Neg logit contributions
-1.478
INST-1.381
aternal-1.325
agy-1.321
iotics-1.293
Token,
Token ID11
Feature activation+1.87e-01

Pos logit contributions
 comprehension+1.807
 stagn+1.680
tesy+1.675
 unde+1.600
abeth+1.598
Neg logit contributions
-1.846
INST-1.718
aternal-1.648
agy-1.630
iotics-1.629
Token to
Token ID284
Feature activation-4.08e-02

Pos logit contributions
 comprehension+0.616
 stagn+0.568
tesy+0.567
abeth+0.545
 unde+0.534
Neg logit contributions
-0.759
INST-0.711
aternal-0.680
agy-0.676
iotics-0.663
Token eight
Token ID3624
Feature activation-2.06e-01

Pos logit contributions
+0.265
INST+0.245
aternal+0.233
agy+0.231
 Rabbit+0.227
Neg logit contributions
 comprehension-0.285
tesy-0.266
 stagn-0.264
abeth-0.256
ector-0.251
Token years
Token ID812
Feature activation+6.71e-02

Pos logit contributions
+1.187
INST+1.091
aternal+1.033
agy+1.021
 Rabbit+1.003
Neg logit contributions
 comprehension-1.570
tesy-1.491
 stagn-1.464
abeth-1.437
ector-1.410
Token made
Token ID925
Feature activation+2.47e-01

Pos logit contributions
 comprehension+0.442
tesy+0.412
 stagn+0.408
abeth+0.395
ector+0.388
Neg logit contributions
-0.458
INST-0.428
aternal-0.408
agy-0.405
 Rabbit-0.398
Token no
Token ID645
Feature activation-1.23e-01

Pos logit contributions
 comprehension+1.530
tesy+1.407
 stagn+1.399
abeth+1.338
 unde+1.324
Neg logit contributions
-1.809
INST-1.684
aternal-1.605
agy-1.593
iotics-1.561
Token distinction
Token ID12941
Feature activation-2.26e-01

Pos logit contributions
+0.911
INST+0.855
aternal+0.819
agy+0.813
 Rabbit+0.803
Neg logit contributions
 comprehension-0.738
tesy-0.690
 stagn-0.675
abeth-0.659
ector-0.642
Token between
Token ID1022
Feature activation+2.27e-01

Pos logit contributions
+1.690
INST+1.586
aternal+1.520
agy+1.509
 Rabbit+1.487
Neg logit contributions
 comprehension-1.337
tesy-1.245
 stagn-1.220
abeth-1.186
ector-1.156
Token youth
Token ID6205
Feature activation+6.62e-02

Pos logit contributions
 comprehension+1.474
 stagn+1.347
tesy+1.336
 unde+1.287
abeth+1.277
Neg logit contributions
-1.606
INST-1.486
agy-1.421
aternal-1.414
iotics-1.384
Token and
Token ID290
Feature activation-5.03e-01

Pos logit contributions
 comprehension+0.466
tesy+0.434
 stagn+0.429
abeth+0.414
ector+0.410
Neg logit contributions
-0.426
INST-0.396
aternal-0.376
agy-0.374
 Rabbit-0.368
Token adult
Token ID4044
Feature activation-1.99e-02

Pos logit contributions
+3.570
INST+3.376
aternal+3.253
+3.205
agy+3.190
Neg logit contributions
 comprehension-2.954
tesy-2.833
abeth-2.724
 stagn-2.718
ector-2.621
Token punishment
Token ID9837
Feature activation-1.15e-01

Pos logit contributions
+0.143
INST+0.134
aternal+0.128
agy+0.126
 Rabbit+0.124
Neg logit contributions
 comprehension-0.126
tesy-0.116
 stagn-0.115
abeth-0.110
 unde-0.109
Token the
Token ID262
Feature activation+2.94e-01

Pos logit contributions
 comprehension+1.241
tesy+1.151
 stagn+1.143
abeth+1.109
 unde+1.099
Neg logit contributions
-1.365
INST-1.288
aternal-1.214
agy-1.210
iotics-1.191
Token territories
Token ID16771
Feature activation-4.71e-02

Pos logit contributions
 comprehension+2.112
 stagn+1.967
tesy+1.947
 unde+1.907
abeth+1.882
Neg logit contributions
-1.845
INST-1.726
agy-1.615
aternal-1.613
iotics-1.584
Token ended
Token ID4444
Feature activation+1.45e-01

Pos logit contributions
+0.302
INST+0.281
aternal+0.267
agy+0.264
 Rabbit+0.259
Neg logit contributions
 comprehension-0.330
tesy-0.310
 stagn-0.307
abeth-0.298
ector-0.292
Token up
Token ID510
Feature activation+2.86e-02

Pos logit contributions
 comprehension+0.876
tesy+0.811
 stagn+0.805
abeth+0.776
 unde+0.759
Neg logit contributions
-1.069
INST-1.004
aternal-0.961
agy-0.959
 Rabbit-0.944
Token emphasizing
Token ID36360
Feature activation-1.08e-02

Pos logit contributions
 comprehension+0.164
tesy+0.151
 stagn+0.150
abeth+0.144
ector+0.140
Neg logit contributions
-0.220
INST-0.207
aternal-0.198
agy-0.197
 Rabbit-0.195
Token them
Token ID606
Feature activation-9.12e-02

Pos logit contributions
+0.075
INST+0.070
aternal+0.067
agy+0.067
 Rabbit+0.066
Neg logit contributions
 comprehension-0.070
tesy-0.065
 stagn-0.064
abeth-0.062
ector-0.061
Token.
Token ID13
Feature activation+1.05e-01

Pos logit contributions
+0.630
INST+0.590
aternal+0.562
agy+0.558
 Rabbit+0.550
Neg logit contributions
 comprehension-0.592
tesy-0.554
 stagn-0.546
abeth-0.531
ector-0.519
Token
Token ID447
Feature activation+1.44e-01

Pos logit contributions
 comprehension+0.764
tesy+0.720
 stagn+0.705
abeth+0.696
ector+0.680
Neg logit contributions
-0.636
INST-0.589
aternal-0.565
agy-0.562
iotics-0.547
Token
Token ID251
Feature activation+5.72e-02

Pos logit contributions
 comprehension+0.956
tesy+0.898
 stagn+0.879
abeth+0.864
ector+0.836
Neg logit contributions
-0.966
INST-0.904
aternal-0.866
agy-0.861
 Rabbit-0.839
Token In
Token ID554
Feature activation+2.22e-01

Pos logit contributions
 comprehension+0.430
tesy+0.406
 stagn+0.399
abeth+0.392
ector+0.384
Neg logit contributions
-0.332
INST-0.308
aternal-0.294
agy-0.291
iotics-0.283
Token other
Token ID584
Feature activation-1.32e-01

Pos logit contributions
 comprehension+1.355
tesy+1.243
 stagn+1.235
abeth+1.199
 unde+1.182
Neg logit contributions
-1.621
INST-1.521
aternal-1.450
agy-1.443
 Rabbit-1.419
Token a
Token ID257
Feature activation-1.02e-02

Pos logit contributions
+2.860
INST+2.667
aternal+2.534
agy+2.517
 Rabbit+2.512
Neg logit contributions
 comprehension-2.557
tesy-2.442
 stagn-2.366
abeth-2.336
ector-2.266
Token CV
Token ID26196
Feature activation+2.73e-01

Pos logit contributions
+0.072
INST+0.068
aternal+0.065
agy+0.064
 Rabbit+0.063
Neg logit contributions
 comprehension-0.064
tesy-0.060
 stagn-0.059
abeth-0.057
ector-0.056
Token the
Token ID262
Feature activation+1.36e-02

Pos logit contributions
 comprehension+1.813
tesy+1.661
 stagn+1.645
abeth+1.581
ector+1.572
Neg logit contributions
-1.860
INST-1.738
aternal-1.685
agy-1.662
 Rabbit-1.618
Token details
Token ID3307
Feature activation-2.83e-01

Pos logit contributions
 comprehension+0.084
tesy+0.078
 stagn+0.077
abeth+0.074
ector+0.073
Neg logit contributions
-0.099
INST-0.093
aternal-0.089
agy-0.088
 Rabbit-0.086
Token tell
Token ID1560
Feature activation-5.68e-02

Pos logit contributions
+1.954
INST+1.816
aternal+1.736
agy+1.729
 Rabbit+1.701
Neg logit contributions
 comprehension-1.825
tesy-1.724
 stagn-1.676
abeth-1.644
ector-1.608
Token you
Token ID345
Feature activation-2.52e-01

Pos logit contributions
+0.432
INST+0.406
aternal+0.390
agy+0.387
 Rabbit+0.381
Neg logit contributions
 comprehension-0.329
tesy-0.306
 stagn-0.300
abeth-0.291
ector-0.284
Token lots
Token ID6041
Feature activation-3.42e-02

Pos logit contributions
+1.760
INST+1.644
aternal+1.563
agy+1.555
 Rabbit+1.536
Neg logit contributions
 comprehension-1.599
tesy-1.514
 stagn-1.473
abeth-1.447
ector-1.410
Token
Token ID784
Feature activation+2.32e-02

Pos logit contributions
+0.248
INST+0.233
aternal+0.223
agy+0.221
 Rabbit+0.218
Neg logit contributions
 comprehension-0.211
tesy-0.196
 stagn-0.192
abeth-0.187
ector-0.183
Token the
Token ID262
Feature activation+2.24e-02

Pos logit contributions
 comprehension+0.148
tesy+0.138
 stagn+0.136
abeth+0.132
ector+0.129
Neg logit contributions
-0.164
INST-0.153
aternal-0.147
agy-0.146
 Rabbit-0.143
Token exact
Token ID2748
Feature activation-5.43e-01

Pos logit contributions
 comprehension+0.138
tesy+0.127
 stagn+0.126
abeth+0.122
ector+0.119
Neg logit contributions
-0.164
INST-0.154
aternal-0.147
agy-0.146
 Rabbit-0.143
Token list
Token ID1351
Feature activation-3.42e-01

Pos logit contributions
+3.977
INST+3.736
aternal+3.555
agy+3.552
 Rabbit+3.514
Neg logit contributions
 comprehension-3.153
tesy-3.039
 stagn-2.928
abeth-2.882
ector-2.785
Token power
Token ID1176
Feature activation-8.51e-02

Pos logit contributions
 comprehension+1.119
tesy+1.031
 stagn+1.030
 unde+0.982
abeth+0.978
Neg logit contributions
-1.047
INST-0.949
agy-0.914
aternal-0.904
iotics-0.882
Token-
Token ID12
Feature activation-2.96e-01

Pos logit contributions
+0.532
INST+0.492
aternal+0.467
agy+0.463
 Rabbit+0.455
Neg logit contributions
 comprehension-0.610
tesy-0.576
 stagn-0.566
abeth-0.553
ector-0.542
Tokenplay
Token ID1759
Feature activation-5.59e-01

Pos logit contributions
+1.451
INST+1.319
aternal+1.233
agy+1.218
 Rabbit+1.188
Neg logit contributions
 comprehension-2.518
tesy-2.395
 stagn-2.363
abeth-2.321
ector-2.281
Token time
Token ID640
Feature activation-2.73e-01

Pos logit contributions
+2.721
INST+2.506
aternal+2.349
agy+2.297
 Rabbit+2.287
Neg logit contributions
 comprehension-4.617
tesy-4.460
abeth-4.389
 stagn-4.357
ector-4.257
Token per
Token ID583
Feature activation+2.19e-01

Pos logit contributions
+1.825
INST+1.700
aternal+1.618
agy+1.602
 Rabbit+1.575
Neg logit contributions
 comprehension-1.837
tesy-1.728
 stagn-1.689
abeth-1.661
ector-1.622
Token game
Token ID983
Feature activation-1.12e-01

Pos logit contributions
 comprehension+1.243
tesy+1.119
 stagn+1.114
abeth+1.061
 unde+1.044
Neg logit contributions
-1.716
INST-1.590
agy-1.556
aternal-1.539
 Rabbit-1.494
Token last
Token ID938
Feature activation-1.42e-01

Pos logit contributions
+0.770
INST+0.718
aternal+0.686
agy+0.681
 Rabbit+0.670
Neg logit contributions
 comprehension-0.735
tesy-0.690
 stagn-0.679
abeth-0.659
ector-0.645
Token season
Token ID1622
Feature activation-1.98e-01

Pos logit contributions
+0.958
INST+0.892
aternal+0.851
agy+0.844
 Rabbit+0.830
Neg logit contributions
 comprehension-0.945
tesy-0.887
 stagn-0.871
abeth-0.850
ector-0.831
Token,
Token ID11
Feature activation+6.44e-02

Pos logit contributions
+1.231
INST+1.139
aternal+1.082
agy+1.072
 Rabbit+1.053
Neg logit contributions
 comprehension-1.424
tesy-1.344
 stagn-1.322
abeth-1.292
ector-1.266
Token while
Token ID981
Feature activation+1.40e-01

Pos logit contributions
 comprehension+0.421
tesy+0.394
 stagn+0.389
abeth+0.376
 unde+0.369
Neg logit contributions
-0.443
INST-0.415
aternal-0.395
agy-0.391
iotics-0.384
Token Alex
Token ID4422
Feature activation-1.86e-04

Pos logit contributions
 comprehension+1.032
tesy+0.970
 stagn+0.957
abeth+0.936
ector+0.918
Neg logit contributions
-0.854
INST-0.792
aternal-0.747
agy-0.743
iotics-0.728
Token her
Token ID607
Feature activation+3.04e-01

Pos logit contributions
 comprehension+0.391
tesy+0.360
 stagn+0.358
abeth+0.348
 unde+0.340
Neg logit contributions
-0.409
INST-0.380
aternal-0.362
agy-0.362
iotics-0.352
Token buttocks
Token ID50193
Feature activation+4.08e-01

Pos logit contributions
 comprehension+2.022
 stagn+1.856
tesy+1.843
abeth+1.771
 unde+1.764
Neg logit contributions
-2.095
INST-1.931
aternal-1.852
agy-1.838
iotics-1.804
Token at
Token ID379
Feature activation-9.51e-02

Pos logit contributions
 comprehension+2.633
 stagn+2.430
tesy+2.426
abeth+2.327
 unde+2.313
Neg logit contributions
-2.799
INST-2.625
aternal-2.533
agy-2.488
 Rabbit-2.462
Token a
Token ID257
Feature activation-1.40e-01

Pos logit contributions
+0.675
INST+0.631
aternal+0.603
agy+0.599
 Rabbit+0.589
Neg logit contributions
 comprehension-0.601
tesy-0.562
 stagn-0.552
abeth-0.537
ector-0.525
Token rhythm
Token ID18662
Feature activation-1.92e-01

Pos logit contributions
+1.002
INST+0.937
aternal+0.896
agy+0.889
 Rabbit+0.875
Neg logit contributions
 comprehension-0.885
tesy-0.825
 stagn-0.812
abeth-0.788
ector-0.771
Token with
Token ID351
Feature activation-7.26e-01

Pos logit contributions
+1.274
INST+1.186
aternal+1.129
agy+1.120
 Rabbit+1.102
Neg logit contributions
 comprehension-1.301
tesy-1.223
 stagn-1.201
abeth-1.172
ector-1.148
Token no
Token ID645
Feature activation-2.04e-01

Pos logit contributions
+4.830
INST+4.525
aternal+4.340
 Rabbit+4.288
agy+4.258
Neg logit contributions
 comprehension-4.586
tesy-4.510
abeth-4.300
 stagn-4.272
ector-4.174
Token apparent
Token ID4156
Feature activation+5.80e-02

Pos logit contributions
+1.550
INST+1.457
aternal+1.396
agy+1.385
 Rabbit+1.365
Neg logit contributions
 comprehension-1.199
tesy-1.109
 stagn-1.090
abeth-1.055
ector-1.029
Token connection
Token ID4637
Feature activation+1.92e-01

Pos logit contributions
 comprehension+0.366
 stagn+0.332
tesy+0.330
abeth+0.313
 unde+0.312
Neg logit contributions
-0.423
INST-0.398
aternal-0.380
agy-0.375
iotics-0.369
Token to
Token ID284
Feature activation-2.41e-01

Pos logit contributions
 comprehension+1.081
tesy+0.981
 stagn+0.973
abeth+0.930
 unde+0.916
Neg logit contributions
-1.497
INST-1.416
aternal-1.351
agy-1.340
 Rabbit-1.323
Token the
Token ID262
Feature activation-3.38e-01

Pos logit contributions
+1.654
INST+1.543
aternal+1.474
agy+1.462
 Rabbit+1.441
Neg logit contributions
 comprehension-1.577
tesy-1.486
 stagn-1.454
abeth-1.421
ector-1.389
Token
Token ID198
Feature activation+1.50e-01

Pos logit contributions
 comprehension+0.552
tesy+0.520
 stagn+0.514
abeth+0.502
ector+0.495
Neg logit contributions
-0.400
INST-0.359
aternal-0.344
agy-0.338
iotics-0.335
TokenHad
Token ID25383
Feature activation-1.37e-01

Pos logit contributions
 comprehension+0.972
tesy+0.918
 stagn+0.892
abeth+0.879
ector+0.861
Neg logit contributions
-1.027
INST-0.943
aternal-0.917
agy-0.911
iotics-0.898
Tokenros
Token ID4951
Feature activation+1.34e-01

Pos logit contributions
+0.838
INST+0.775
aternal+0.733
agy+0.725
 Rabbit+0.714
Neg logit contributions
 comprehension-1.004
tesy-0.948
 stagn-0.932
abeth-0.913
ector-0.896
Tokenaur
Token ID2899
Feature activation+3.60e-01

Pos logit contributions
 comprehension+0.906
tesy+0.850
 stagn+0.838
abeth+0.820
ector+0.808
Neg logit contributions
-0.899
INST-0.829
aternal-0.784
agy-0.775
 Rabbit-0.769
Token #
Token ID1303
Feature activation+2.45e-01

Pos logit contributions
 comprehension+2.130
tesy+1.947
 stagn+1.945
abeth+1.864
ector+1.851
Neg logit contributions
-2.707
INST-2.513
aternal-2.406
agy-2.376
iotics-2.360
Token2
Token ID17
Feature activation-7.51e-01

Pos logit contributions
 comprehension+1.907
tesy+1.814
 stagn+1.778
abeth+1.748
ector+1.728
Neg logit contributions
-1.345
INST-1.224
aternal-1.160
agy-1.149
iotics-1.126
Token
Token ID198
Feature activation+1.23e-02

Pos logit contributions
+3.815
INST+3.497
aternal+3.278
 Rabbit+3.269
agy+3.235
Neg logit contributions
 comprehension-6.073
tesy-5.780
 stagn-5.679
abeth-5.586
 unde-5.408
Token
Token ID198
Feature activation+8.67e-02

Pos logit contributions
 comprehension+0.097
tesy+0.092
 stagn+0.091
abeth+0.089
ector+0.087
Neg logit contributions
-0.068
INST-0.061
aternal-0.059
agy-0.058
iotics-0.057
TokenHad
Token ID25383
Feature activation-1.53e-01

Pos logit contributions
 comprehension+0.548
tesy+0.516
 stagn+0.502
abeth+0.493
ector+0.483
Neg logit contributions
-0.610
INST-0.563
aternal-0.546
agy-0.542
iotics-0.533
Tokenros
Token ID4951
Feature activation+9.14e-02

Pos logit contributions
+0.926
INST+0.855
aternal+0.810
agy+0.803
 Rabbit+0.788
Neg logit contributions
 comprehension-1.124
tesy-1.062
 stagn-1.045
abeth-1.022
ector-1.002
Tokenaur
Token ID2899
Feature activation+3.51e-01

Pos logit contributions
 comprehension+0.624
tesy+0.586
 stagn+0.578
abeth+0.565
ector+0.556
Neg logit contributions
-0.605
INST-0.560
aternal-0.530
agy-0.524
 Rabbit-0.519
Token percentage
Token ID5873
Feature activation-5.89e-01

Pos logit contributions
+2.159
INST+2.021
aternal+1.934
agy+1.903
 Rabbit+1.897
Neg logit contributions
 comprehension-2.008
tesy-1.905
 stagn-1.848
abeth-1.823
ector-1.781
Token of
Token ID286
Feature activation-5.71e-01

Pos logit contributions
+3.894
INST+3.661
aternal+3.449
+3.411
agy+3.396
Neg logit contributions
 comprehension-3.810
tesy-3.648
abeth-3.570
 stagn-3.531
ector-3.440
Token afford
Token ID5368
Feature activation-3.35e-01

Pos logit contributions
+3.798
INST+3.549
aternal+3.396
 Rabbit+3.334
agy+3.331
Neg logit contributions
 comprehension-3.762
tesy-3.572
 stagn-3.437
abeth-3.424
ector-3.359
Tokenably
Token ID1346
Feature activation-3.26e-01

Pos logit contributions
+1.872
INST+1.715
aternal+1.633
agy+1.616
 Rabbit+1.567
Neg logit contributions
 comprehension-2.601
tesy-2.465
 stagn-2.423
abeth-2.387
ector-2.341
Token priced
Token ID19744
Feature activation-6.82e-01

Pos logit contributions
+2.518
INST+2.366
aternal+2.272
agy+2.257
 Rabbit+2.225
Neg logit contributions
 comprehension-1.859
tesy-1.726
 stagn-1.692
abeth-1.640
ector-1.597
Token units
Token ID4991
Feature activation-7.93e-01

Pos logit contributions
+4.531
INST+4.193
aternal+4.032
 Rabbit+3.979
agy+3.927
Neg logit contributions
 comprehension-4.479
tesy-4.273
abeth-4.129
 stagn-4.076
ector-4.014
Token whenever
Token ID8797
Feature activation-3.12e-01

Pos logit contributions
+4.735
INST+4.413
aternal+4.237
agy+4.185
+4.126
Neg logit contributions
 comprehension-5.624
tesy-5.336
abeth-5.109
 stagn-5.057
ector-5.001
Token the
Token ID262
Feature activation-5.00e-01

Pos logit contributions
+2.200
INST+2.057
aternal+1.965
agy+1.947
 Rabbit+1.922
Neg logit contributions
 comprehension-1.978
tesy-1.857
 stagn-1.813
abeth-1.775
ector-1.736
Token zoning
Token ID31784
Feature activation-5.29e-01

Pos logit contributions
+3.376
INST+3.153
aternal+2.998
agy+2.970
 Rabbit+2.960
Neg logit contributions
 comprehension-3.270
tesy-3.110
 stagn-3.002
abeth-2.967
ector-2.912
Token designation
Token ID22566
Feature activation-1.43e-01

Pos logit contributions
+3.146
INST+2.928
agy+2.773
aternal+2.764
 Rabbit+2.722
Neg logit contributions
 comprehension-3.779
tesy-3.682
abeth-3.538
 stagn-3.506
ector-3.449
Token is
Token ID318
Feature activation-4.64e-01

Pos logit contributions
+0.926
INST+0.860
aternal+0.816
agy+0.809
 Rabbit+0.793
Neg logit contributions
 comprehension-1.000
tesy-0.935
 stagn-0.926
abeth-0.900
ector-0.882
Token share
Token ID2648
Feature activation-4.93e-01

Pos logit contributions
 comprehension+1.806
 stagn+1.647
tesy+1.626
 unde+1.556
ector+1.537
Neg logit contributions
-2.374
INST-2.208
agy-2.117
aternal-2.106
 Rabbit-2.067
Token the
Token ID262
Feature activation-4.26e-01

Pos logit contributions
+3.281
INST+3.078
aternal+2.931
agy+2.902
 Rabbit+2.891
Neg logit contributions
 comprehension-3.222
tesy-3.093
 stagn-3.001
abeth-2.960
ector-2.864
Token vision
Token ID5761
Feature activation-3.68e-01

Pos logit contributions
+3.198
INST+3.033
aternal+2.908
agy+2.878
 Rabbit+2.864
Neg logit contributions
 comprehension-2.407
tesy-2.308
 stagn-2.215
abeth-2.182
ector-2.123
Token or
Token ID393
Feature activation+1.41e-01

Pos logit contributions
+2.458
INST+2.281
aternal+2.164
agy+2.156
 Rabbit+2.120
Neg logit contributions
 comprehension-2.462
tesy-2.334
 stagn-2.278
abeth-2.230
ector-2.178
Token agree
Token ID4236
Feature activation-1.81e-01

Pos logit contributions
 comprehension+0.840
tesy+0.759
 stagn+0.752
abeth+0.723
 unde+0.711
Neg logit contributions
-1.077
INST-1.007
aternal-0.967
agy-0.961
 Rabbit-0.944
Token with
Token ID351
Feature activation-7.37e-01

Pos logit contributions
+1.175
INST+1.091
aternal+1.038
agy+1.030
 Rabbit+1.012
Neg logit contributions
 comprehension-1.261
tesy-1.186
 stagn-1.165
abeth-1.138
ector-1.115
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
+4.696
INST+4.460
aternal+4.199
agy+4.138
 Rabbit+4.136
Neg logit contributions
 comprehension-4.829
tesy-4.739
 stagn-4.509
abeth-4.504
ector-4.333
Token underlying
Token ID10238
Feature activation-9.77e-02

Pos logit contributions
+1.192
INST+1.117
aternal+1.069
agy+1.061
 Rabbit+1.046
Neg logit contributions
 comprehension-0.998
tesy-0.933
 stagn-0.913
abeth-0.890
ector-0.868
Token principles
Token ID7811
Feature activation-3.43e-01

Pos logit contributions
+0.738
INST+0.691
aternal+0.662
agy+0.659
 Rabbit+0.648
Neg logit contributions
 comprehension-0.579
tesy-0.534
 stagn-0.526
abeth-0.509
ector-0.498
Token of
Token ID286
Feature activation-2.55e-01

Pos logit contributions
+2.389
INST+2.238
aternal+2.131
agy+2.115
 Rabbit+2.085
Neg logit contributions
 comprehension-2.177
tesy-2.055
 stagn-2.003
abeth-1.962
ector-1.913
Token our
Token ID674
Feature activation-2.83e-01

Pos logit contributions
+1.797
INST+1.680
aternal+1.605
agy+1.592
 Rabbit+1.569
Neg logit contributions
 comprehension-1.616
tesy-1.514
 stagn-1.484
abeth-1.447
ector-1.414
Token in
Token ID287
Feature activation-1.56e-01

Pos logit contributions
 comprehension+1.726
 stagn+1.594
tesy+1.564
 unde+1.528
abeth+1.506
Neg logit contributions
-2.041
INST-1.852
aternal-1.785
agy-1.784
iotics-1.763
Token front
Token ID2166
Feature activation+4.61e-02

Pos logit contributions
+1.314
INST+1.242
aternal+1.196
agy+1.189
 Rabbit+1.174
Neg logit contributions
 comprehension-0.780
tesy-0.717
 stagn-0.699
abeth-0.676
ector-0.656
Token of
Token ID286
Feature activation+3.82e-01

Pos logit contributions
 comprehension+0.273
tesy+0.253
 stagn+0.249
abeth+0.241
ector+0.236
Neg logit contributions
-0.347
INST-0.325
aternal-0.312
agy-0.309
iotics-0.303
Token buildings
Token ID6832
Feature activation+4.23e-01

Pos logit contributions
 comprehension+2.745
 stagn+2.586
tesy+2.505
 unde+2.488
abeth+2.462
Neg logit contributions
-2.426
INST-2.179
aternal-2.090
agy-2.078
iotics-2.009
Token like
Token ID588
Feature activation-2.00e-02

Pos logit contributions
 comprehension+2.515
 stagn+2.288
tesy+2.264
abeth+2.173
 unde+2.156
Neg logit contributions
-3.230
INST-2.979
aternal-2.892
agy-2.835
iotics-2.783
Token Sur
Token ID4198
Feature activation-7.20e-01

Pos logit contributions
+0.147
INST+0.137
aternal+0.132
agy+0.131
 Rabbit+0.129
Neg logit contributions
 comprehension-0.121
tesy-0.114
 stagn-0.111
abeth-0.109
ector-0.106
Tokenrog
Token ID3828
Feature activation+9.39e-01

Pos logit contributions
+2.698
INST+2.486
agy+2.410
+2.306
aternal+2.292
Neg logit contributions
 comprehension-6.714
 stagn-6.282
tesy-6.202
abeth-6.124
 unde-6.028
Tokenate
Token ID378
Feature activation+1.39e-02

Pos logit contributions
 comprehension+3.629
tesy+3.303
abeth+3.287
ector+3.254
 stagn+3.135
Neg logit contributions
-8.941
INST-8.048
 Cosponsors-7.761
 Rabbit-7.564
aternal-7.514
Token's
Token ID338
Feature activation+7.49e-01

Pos logit contributions
 comprehension+0.094
tesy+0.088
 stagn+0.087
abeth+0.085
ector+0.083
Neg logit contributions
-0.093
INST-0.086
aternal-0.082
agy-0.081
iotics-0.080
Token Court
Token ID3078
Feature activation+5.46e-02

Pos logit contributions
 comprehension+5.170
tesy+4.643
 stagn+4.617
abeth+4.609
 unde+4.586
Neg logit contributions
-4.803
INST-4.222
agy-4.185
iotics-4.160
aternal-4.093
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
 comprehension+0.363
tesy+0.339
 stagn+0.333
abeth+0.326
ector+0.319
Neg logit contributions
-0.371
INST-0.342
aternal-0.329
agy-0.324
iotics-0.319