TOP ACTIVATIONS
MAX = 19.159

. Number 1 I think for quite a while will always
's financial counselling service or call K ild on an for
It simply does not make any sense . We must now
seriously awesome ! For anyone who has ever taken
synthes ized words . <|endoftext|> For rapper Off set , of
think some pieces of some rubber can fly and build up
innings that gave me the most satisfaction were scoring those two
his whereabouts is urged to call NYPD Crime St oppers at
the sunk en fireplace . After that , he turned his
Good updates ? No more bad news pl s
Americans FX Out standing Achievement in Comedy
seems to be on the front page . The Republicans should
and creates corresponding templates . This templates will contain the instructions
actions . " For all the pomp and ceremony
more than ever on the front lines of major global and
Ryan said : No , look , it
/ 2016 / 05 / 5 - ways - law -
Policy here . Sign up for Take Action Now and we
and the do .. end syntax e . g .:
Real ist Reader [ Email him ] In looking at the

MIN ACTIVATIONS
MIN = -20.839

stripping back the shield a few inches and making a couple
, let s take a moment to discuss some
itized limited evidence in an unt ime ly fashion and call
t . Now move back one metre , and repeat the
a couple of turns about an 1 - 2 inches in
by stripping back the shield a few inches and making a
Brian Albert have taken on the task of ranking every Halo
like utility will be referenced as " snap " ( with
gateway was shoved to an unw el coming corner , which
ugg s drives him back a couple yards , but Bo
ish and G hat fan def ected , was admitted into
s drives him back a couple yards , but Bo z
5 { ( 4 s )} 30 . K f 2
turns about an 1 - 2 inches in diameter . Sold
duration of the game chasing one another up and down the
the natural flora and an unt rained eye will never be
( commit ) Levi Jones LB 6 ' 3 / 215
proposes will encourage universities to inf late prices and offer sub
FAULT 5 second delay before reb ooting 3 .
B iz arrow Green Arrow was the perfect safety

INTERVAL 9.159 - 19.159
CONTAINS 1.011%

Perhaps with some apolog etic intent , the late English scholar
. In this sense , for Nietzsche , the good of
ark owski : I would say if you look across this
( May 2013 ), S out ron Global ( March 2013
least one thing right . By chance , a Babylon ian

INTERVAL -0.840 - 9.159
CONTAINS 58.416%

but two different concepts from the arg uable king of curse
T oward the end of the clip ,
out app data _ full v ) { // Time component
it 's probably just a bad cold , but a doctor
compromise with the reigning Republicans , while Evans and others have

INTERVAL -10.840 - -0.840
CONTAINS 40.296%

the next generation of Ron D enn ises and Frank Williams
ars have since denied that they purposefully chose Hut chens .
BT lifestyle for children as young as seven years old ,"
But he s a sod om ite
you prefer to sail yourself , the cruising grounds of the

INTERVAL -20.839 - -10.840
CONTAINS 0.277%

court has ruled if the seizure can go ahead . Some
The Bh ag av ad G ita - and
ONG K ONG ( Reuters ) - One needs to look
g asse , which was several floors up , so he
Asked about sexual harassment in increasingly gender - integ rated military
Token.
Token ID13
Feature activation-1.49e+00

Pos logit contributions
 unders+1.691
��+1.664
COMPLE+1.623
aja+1.614
 Obj+1.601
Neg logit contributions
agate-1.564
ince-1.543
theless-1.486
cius-1.478
 Difference-1.470
TokenNumber
Token ID15057
Feature activation+3.32e+00

Pos logit contributions
 unders+1.053
��+1.044
COMPLE+1.019
aja+1.006
 Obj+1.001
Neg logit contributions
agate-0.966
ince-0.956
theless-0.924
cius-0.916
 Difference-0.910
Token 1
Token ID352
Feature activation-2.79e+00

Pos logit contributions
ince+1.877
agate+1.871
 Difference+1.801
theless+1.788
cius+1.776
Neg logit contributions
��-2.575
 unders-2.570
COMPLE-2.522
aja-2.472
 Obj-2.440
Token I
Token ID314
Feature activation+1.15e+00

Pos logit contributions
 unders+1.975
��+1.937
COMPLE+1.888
aja+1.887
 Obj+1.870
Neg logit contributions
agate-1.807
ince-1.780
theless-1.711
cius-1.699
 Difference-1.678
Token think
Token ID892
Feature activation-2.71e+00

Pos logit contributions
agate+0.808
ince+0.804
theless+0.774
cius+0.770
 Difference+0.770
Neg logit contributions
��-0.755
 unders-0.749
COMPLE-0.729
aja-0.717
 Obj-0.709
Token for
Token ID329
Feature activation+1.92e+01

Pos logit contributions
 unders+1.863
��+1.823
COMPLE+1.777
aja+1.765
 Obj+1.764
Neg logit contributions
agate-1.812
ince-1.797
theless-1.739
cius-1.726
 Difference-1.708
Token quite
Token ID2407
Feature activation-3.81e+00

Pos logit contributions
theless+6.992
 Difference+6.534
ments+6.409
agate+6.249
ince+6.235
Neg logit contributions
��-15.501
��-15.083
COMPLE-15.054
 unders-14.890
 Obj-14.844
Token a
Token ID257
Feature activation-1.06e-01

Pos logit contributions
 unders+2.403
COMPLE+2.313
aja+2.305
��+2.289
 Obj+2.228
Neg logit contributions
agate-2.726
ince-2.706
cius-2.677
theless-2.677
 Difference-2.610
Token while
Token ID981
Feature activation-4.07e-01

Pos logit contributions
 unders+0.069
��+0.069
COMPLE+0.066
aja+0.066
 Obj+0.065
Neg logit contributions
agate-0.074
ince-0.074
theless-0.071
 Difference-0.071
cius-0.070
Token will
Token ID481
Feature activation+2.97e+00

Pos logit contributions
 unders+0.280
��+0.277
COMPLE+0.270
aja+0.268
 Obj+0.264
Neg logit contributions
agate-0.272
ince-0.269
theless-0.259
 Difference-0.258
cius-0.257
Token always
Token ID1464
Feature activation-3.66e+00

Pos logit contributions
ince+1.520
agate+1.515
theless+1.431
cius+1.422
 Difference+1.412
Neg logit contributions
��-2.502
 unders-2.458
COMPLE-2.412
aja-2.383
��-2.361
Token's
Token ID338
Feature activation+4.44e+00

Pos logit contributions
ince+6.053
 Difference+5.888
ments+5.858
agate+5.757
cius+5.580
Neg logit contributions
��-8.299
COMPLE-7.988
aja-7.606
 unders-7.588
 Obj-7.572
Token financial
Token ID3176
Feature activation+9.15e+00

Pos logit contributions
ince+2.892
agate+2.860
 Difference+2.814
ments+2.760
theless+2.754
Neg logit contributions
��-3.096
COMPLE-2.978
 unders-2.941
aja-2.933
 Obj-2.883
Token counselling
Token ID47586
Feature activation+1.04e+00

Pos logit contributions
ince+5.040
ments+4.971
agate+4.922
 Difference+4.851
theless+4.677
Neg logit contributions
��-7.126
COMPLE-6.772
aja-6.745
 Obj-6.653
 unders-6.566
Token service
Token ID2139
Feature activation+1.00e+01

Pos logit contributions
agate+0.660
ince+0.657
theless+0.631
 Difference+0.628
cius+0.626
Neg logit contributions
��-0.739
 unders-0.736
COMPLE-0.715
aja-0.708
 Obj-0.700
Token or
Token ID393
Feature activation+1.16e+01

Pos logit contributions
ince+5.452
agate+5.241
ments+5.194
 Difference+5.182
cius+5.165
Neg logit contributions
��-7.813
 unders-7.415
COMPLE-7.409
aja-7.236
��-7.086
Token call
Token ID869
Feature activation+1.90e+01

Pos logit contributions
agate+6.531
 Difference+6.502
ince+6.446
ments+6.376
cius+6.291
Neg logit contributions
��-8.402
COMPLE-8.002
��-7.780
aja-7.668
 unders-7.536
Token K
Token ID509
Feature activation+6.45e+00

Pos logit contributions
 Difference+8.424
ments+8.028
ince+7.907
agate+7.668
cius+7.587
Neg logit contributions
��-15.198
��-14.187
COMPLE-13.824
geoning-13.252
aja-13.194
Tokenild
Token ID688
Feature activation-2.98e+00

Pos logit contributions
ince+6.265
agate+6.216
cius+5.989
 Difference+5.917
ments+5.912
Neg logit contributions
��-2.513
 unders-2.441
 Obj-2.229
COMPLE-2.210
aja-2.098
Tokenon
Token ID261
Feature activation+7.02e-01

Pos logit contributions
 unders+2.300
aja+2.242
COMPLE+2.231
��+2.227
 Obj+2.170
Neg logit contributions
theless-1.671
agate-1.667
ince-1.650
 Difference-1.613
cius-1.598
Tokenan
Token ID272
Feature activation+8.01e+00

Pos logit contributions
agate+0.530
ince+0.526
theless+0.505
 Difference+0.504
cius+0.503
Neg logit contributions
��-0.422
 unders-0.422
COMPLE-0.405
aja-0.399
 Obj-0.396
Token for
Token ID329
Feature activation+1.07e+01

Pos logit contributions
 Difference+4.996
ince+4.932
agate+4.917
cius+4.794
ments+4.768
Neg logit contributions
��-5.685
COMPLE-5.347
aja-5.310
 unders-5.308
 Obj-5.196
Token It
Token ID632
Feature activation+3.89e+00

Pos logit contributions
agate+2.580
ince+2.566
 Difference+2.538
theless+2.448
ments+2.433
Neg logit contributions
��-3.457
 unders-3.429
COMPLE-3.290
aja-3.271
��-3.247
Token simply
Token ID2391
Feature activation+2.59e+00

Pos logit contributions
agate+2.309
ince+2.291
theless+2.227
ments+2.173
 Difference+2.160
Neg logit contributions
��-2.942
 unders-2.869
COMPLE-2.825
 Obj-2.791
��-2.765
Token does
Token ID857
Feature activation+6.14e+00

Pos logit contributions
agate+1.553
ince+1.541
theless+1.493
 Difference+1.464
ments+1.461
Neg logit contributions
��-1.954
 unders-1.914
COMPLE-1.877
aja-1.863
 Obj-1.848
Token not
Token ID407
Feature activation+8.34e+00

Pos logit contributions
ince+3.516
agate+3.512
 Difference+3.289
ments+3.280
theless+3.278
Neg logit contributions
��-4.719
 unders-4.582
aja-4.474
COMPLE-4.473
��-4.443
Token make
Token ID787
Feature activation+1.33e+01

Pos logit contributions
ince+4.854
agate+4.842
ments+4.631
 Difference+4.620
theless+4.595
Neg logit contributions
��-6.359
��-6.030
COMPLE-5.857
aja-5.804
 unders-5.785
Token any
Token ID597
Feature activation+1.88e+01

Pos logit contributions
ince+4.881
 Difference+4.867
agate+4.675
ments+4.662
theless+3.854
Neg logit contributions
��-12.392
��-11.641
aja-11.499
 unders-11.452
geoning-11.090
Token sense
Token ID2565
Feature activation+6.92e+00

Pos logit contributions
 Difference+7.685
ince+6.818
 difference+6.706
ments+6.310
agate+5.751
Neg logit contributions
��-15.786
��-15.168
aja-14.921
 unders-14.270
geoning-14.167
Token.
Token ID13
Feature activation+3.25e+00

Pos logit contributions
ince+3.798
agate+3.771
ments+3.652
 Difference+3.611
theless+3.605
Neg logit contributions
��-5.374
 unders-5.224
COMPLE-5.093
aja-5.070
��-4.975
Token We
Token ID775
Feature activation+1.32e+00

Pos logit contributions
agate+1.914
ince+1.904
 Difference+1.869
theless+1.819
cius+1.799
Neg logit contributions
 unders-2.465
��-2.463
COMPLE-2.354
aja-2.346
��-2.305
Token must
Token ID1276
Feature activation+6.54e+00

Pos logit contributions
agate+0.841
ince+0.835
theless+0.806
cius+0.797
 Difference+0.796
Neg logit contributions
��-0.937
 unders-0.933
COMPLE-0.907
aja-0.895
 Obj-0.887
Token now
Token ID783
Feature activation+5.47e+00

Pos logit contributions
agate+4.152
ince+4.077
theless+3.947
ments+3.902
cius+3.836
Neg logit contributions
��-4.684
 unders-4.432
COMPLE-4.422
��-4.394
aja-4.309
Token seriously
Token ID6411
Feature activation+1.19e+00

Pos logit contributions
agate+0.685
ince+0.681
theless+0.654
 Difference+0.653
cius+0.646
Neg logit contributions
��-0.749
 unders-0.746
COMPLE-0.724
aja-0.719
 Obj-0.708
Token awesome
Token ID7427
Feature activation-1.35e+00

Pos logit contributions
agate+0.785
ince+0.781
theless+0.749
 Difference+0.747
cius+0.743
Neg logit contributions
��-0.821
 unders-0.818
COMPLE-0.793
aja-0.788
 Obj-0.778
Token!
Token ID0
Feature activation-1.69e+00

Pos logit contributions
 unders+0.964
��+0.952
COMPLE+0.929
aja+0.924
 Obj+0.915
Neg logit contributions
agate-0.859
ince-0.847
theless-0.821
 Difference-0.811
cius-0.809
Token
Token ID198
Feature activation-4.88e-01

Pos logit contributions
 unders+1.217
��+1.199
COMPLE+1.179
aja+1.167
 Obj+1.158
Neg logit contributions
agate-1.074
ince-1.060
theless-1.023
cius-1.015
 Difference-1.001
Token
Token ID198
Feature activation+3.68e-01

Pos logit contributions
 unders+0.348
��+0.345
COMPLE+0.337
aja+0.333
 Obj+0.329
Neg logit contributions
agate-0.312
ince-0.309
theless-0.299
cius-0.296
 Difference-0.296
TokenFor
Token ID1890
Feature activation+1.87e+01

Pos logit contributions
agate+0.236
ince+0.234
theless+0.224
 Difference+0.223
cius+0.223
Neg logit contributions
 unders-0.263
��-0.261
COMPLE-0.252
aja-0.251
 Obj-0.249
Token anyone
Token ID2687
Feature activation-2.40e-01

Pos logit contributions
ince+9.739
agate+9.252
ments+9.214
 Difference+9.213
got+9.015
Neg logit contributions
��-12.369
COMPLE-11.604
 unders-11.595
 Obj-11.413
aja-11.356
Token who
Token ID508
Feature activation-2.42e-01

Pos logit contributions
 unders+0.159
��+0.157
COMPLE+0.153
aja+0.151
 Obj+0.150
Neg logit contributions
agate-0.166
ince-0.164
theless-0.159
 Difference-0.158
cius-0.158
Token has
Token ID468
Feature activation+5.02e+00

Pos logit contributions
 unders+0.168
��+0.166
COMPLE+0.161
aja+0.160
 Obj+0.159
Neg logit contributions
agate-0.160
ince-0.159
theless-0.153
 Difference-0.152
cius-0.152
Token ever
Token ID1683
Feature activation+7.09e+00

Pos logit contributions
ince+3.330
agate+3.328
ments+3.199
theless+3.179
 Difference+3.156
Neg logit contributions
��-3.436
COMPLE-3.277
 unders-3.258
aja-3.218
 Obj-3.144
Token taken
Token ID2077
Feature activation-4.36e+00

Pos logit contributions
ince+4.289
agate+4.223
theless+4.136
ments+4.076
 Difference+3.956
Neg logit contributions
��-5.264
COMPLE-4.999
 unders-4.864
aja-4.859
��-4.847
Token synthes
Token ID24983
Feature activation-2.30e+00

Pos logit contributions
 unders+4.413
 Obj+4.194
��+4.148
COMPLE+4.140
aja+4.060
Neg logit contributions
agate-4.252
ince-4.109
theless-3.988
cius-3.975
 Difference-3.882
Tokenized
Token ID1143
Feature activation-6.17e+00

Pos logit contributions
 unders+2.212
��+2.172
COMPLE+2.139
aja+2.138
 Obj+2.116
Neg logit contributions
agate-0.896
ince-0.861
theless-0.831
 Difference-0.820
cius-0.816
Token words
Token ID2456
Feature activation+3.13e+00

Pos logit contributions
 unders+4.207
��+3.988
 Obj+3.986
COMPLE+3.914
aja+3.853
Neg logit contributions
agate-4.247
ince-4.057
cius-4.002
theless-3.977
 Difference-3.838
Token.
Token ID13
Feature activation-3.47e+00

Pos logit contributions
ince+1.855
agate+1.840
 Difference+1.741
theless+1.737
ments+1.735
Neg logit contributions
��-2.377
 unders-2.329
COMPLE-2.262
aja-2.254
 Obj-2.212
Token<|endoftext|>
Token ID50256
Feature activation+6.65e-02

Pos logit contributions
 unders+2.316
��+2.280
COMPLE+2.279
 Obj+2.256
aja+2.245
Neg logit contributions
agate-2.347
ince-2.306
theless-2.250
cius-2.230
ments-2.155
TokenFor
Token ID1890
Feature activation+1.84e+01

Pos logit contributions
agate+0.043
ince+0.043
theless+0.041
 Difference+0.041
cius+0.041
Neg logit contributions
 unders-0.047
��-0.047
COMPLE-0.045
aja-0.045
 Obj-0.045
Token rapper
Token ID25670
Feature activation-3.67e+00

Pos logit contributions
ince+10.573
agate+10.200
 Difference+10.153
ments+10.091
cius+9.714
Neg logit contributions
��-11.144
COMPLE-10.541
 unders-10.521
 Obj-10.335
aja-10.301
Token Off
Token ID3242
Feature activation+1.05e+00

Pos logit contributions
 unders+2.437
��+2.370
COMPLE+2.335
aja+2.315
 Obj+2.305
Neg logit contributions
agate-2.550
ince-2.482
theless-2.434
cius-2.420
 Difference-2.360
Tokenset
Token ID2617
Feature activation+4.97e+00

Pos logit contributions
agate+0.764
ince+0.756
 Difference+0.729
cius+0.724
theless+0.723
Neg logit contributions
��-0.656
 unders-0.653
COMPLE-0.630
aja-0.619
 Obj-0.615
Token,
Token ID11
Feature activation-6.35e+00

Pos logit contributions
agate+3.099
ince+3.097
 Difference+2.970
cius+2.959
ments+2.919
Neg logit contributions
��-3.575
 unders-3.523
COMPLE-3.450
 Obj-3.351
aja-3.349
Token of
Token ID286
Feature activation-6.41e+00

Pos logit contributions
 unders+4.248
��+4.033
 Obj+3.967
aja+3.959
COMPLE+3.943
Neg logit contributions
agate-4.349
ince-4.230
theless-4.170
cius-4.136
 Difference-3.976
Token think
Token ID892
Feature activation-4.59e-01

Pos logit contributions
ince+3.092
agate+3.081
theless+2.954
ments+2.926
cius+2.925
Neg logit contributions
��-3.577
COMPLE-3.427
 unders-3.381
��-3.346
aja-3.273
Token some
Token ID617
Feature activation+3.65e+00

Pos logit contributions
 unders+0.318
��+0.314
COMPLE+0.305
aja+0.303
 Obj+0.299
Neg logit contributions
agate-0.304
ince-0.302
theless-0.290
cius-0.289
 Difference-0.288
Token pieces
Token ID5207
Feature activation+2.75e+00

Pos logit contributions
ince+2.384
agate+2.380
theless+2.314
 Difference+2.303
cius+2.262
Neg logit contributions
��-2.528
COMPLE-2.443
 unders-2.436
aja-2.360
 Obj-2.354
Token of
Token ID286
Feature activation+4.50e+00

Pos logit contributions
agate+1.729
ince+1.724
theless+1.661
 Difference+1.639
cius+1.634
Neg logit contributions
��-2.003
 unders-1.955
COMPLE-1.938
aja-1.888
 Obj-1.877
Token some
Token ID617
Feature activation+5.73e+00

Pos logit contributions
agate+3.053
ince+3.040
theless+2.943
 Difference+2.933
ments+2.881
Neg logit contributions
��-2.998
COMPLE-2.903
 unders-2.858
aja-2.801
��-2.778
Token rubber
Token ID14239
Feature activation+1.83e+01

Pos logit contributions
ince+3.519
agate+3.509
theless+3.432
 Difference+3.405
ments+3.347
Neg logit contributions
��-4.148
COMPLE-3.996
 unders-3.913
��-3.875
aja-3.839
Token can
Token ID460
Feature activation+2.40e+00

Pos logit contributions
theless+9.501
ince+9.492
ments+9.340
agate+9.301
 Difference+8.991
Neg logit contributions
��-13.339
COMPLE-12.238
��-12.222
ゴン-11.623
 Obj-11.228
Token fly
Token ID6129
Feature activation+4.09e+00

Pos logit contributions
agate+1.506
ince+1.493
theless+1.425
 Difference+1.421
ments+1.404
Neg logit contributions
��-1.749
 unders-1.710
COMPLE-1.693
aja-1.650
 Obj-1.645
Token and
Token ID290
Feature activation+4.00e-01

Pos logit contributions
agate+2.654
ince+2.607
theless+2.533
 Difference+2.518
ments+2.510
Neg logit contributions
��-2.895
COMPLE-2.787
 unders-2.766
aja-2.715
 Obj-2.698
Token build
Token ID1382
Feature activation+2.75e+00

Pos logit contributions
agate+0.264
ince+0.262
theless+0.252
 Difference+0.251
cius+0.250
Neg logit contributions
 unders-0.277
��-0.277
COMPLE-0.268
aja-0.266
 Obj-0.263
Token up
Token ID510
Feature activation+2.43e+00

Pos logit contributions
agate+1.689
ince+1.683
theless+1.624
 Difference+1.618
ments+1.599
Neg logit contributions
��-2.026
 unders-1.973
COMPLE-1.950
 Obj-1.909
aja-1.909
Token innings
Token ID16267
Feature activation+3.05e+00

Pos logit contributions
ince+4.185
 Difference+4.098
theless+4.086
ments+4.075
agate+4.053
Neg logit contributions
��-5.138
COMPLE-4.815
 Obj-4.779
aja-4.771
 unders-4.765
Token that
Token ID326
Feature activation+4.97e+00

Pos logit contributions
agate+1.849
ince+1.838
 Difference+1.782
theless+1.757
cius+1.751
Neg logit contributions
��-2.290
 unders-2.232
COMPLE-2.202
aja-2.159
 Obj-2.140
Token gave
Token ID2921
Feature activation+5.70e+00

Pos logit contributions
agate+2.668
ince+2.651
 Difference+2.584
theless+2.561
ments+2.518
Neg logit contributions
��-4.046
 unders-3.877
COMPLE-3.870
aja-3.780
��-3.760
Token me
Token ID502
Feature activation+1.09e+01

Pos logit contributions
ince+3.046
agate+3.022
 Difference+2.971
theless+2.914
ments+2.888
Neg logit contributions
��-4.660
 unders-4.453
COMPLE-4.413
aja-4.367
��-4.352
Token the
Token ID262
Feature activation+8.23e+00

Pos logit contributions
 Difference+5.910
ince+5.825
ments+5.621
theless+5.580
agate+5.573
Neg logit contributions
��-8.719
��-8.015
aja-7.884
COMPLE-7.821
 Obj-7.788
Token most
Token ID749
Feature activation+1.82e+01

Pos logit contributions
 Difference+5.041
ince+4.917
agate+4.759
ments+4.724
theless+4.708
Neg logit contributions
��-6.045
��-5.581
aja-5.545
COMPLE-5.505
 Obj-5.472
Token satisfaction
Token ID14676
Feature activation+1.82e+00

Pos logit contributions
 Difference+10.121
ince+9.681
ments+9.584
theless+9.294
het+9.183
Neg logit contributions
��-12.756
��-12.021
aja-11.199
 Obj-11.116
geoning-10.925
Token were
Token ID547
Feature activation+6.18e+00

Pos logit contributions
agate+1.091
ince+1.087
theless+1.043
 Difference+1.037
cius+1.031
Neg logit contributions
��-1.380
 unders-1.368
COMPLE-1.327
aja-1.317
 Obj-1.303
Token scoring
Token ID9689
Feature activation+8.58e+00

Pos logit contributions
ince+3.636
agate+3.632
 Difference+3.565
theless+3.537
ments+3.504
Neg logit contributions
��-4.674
 unders-4.440
COMPLE-4.414
aja-4.348
��-4.345
Token those
Token ID883
Feature activation+7.60e+00

Pos logit contributions
ince+4.834
 Difference+4.779
agate+4.776
ments+4.717
theless+4.706
Neg logit contributions
��-6.725
COMPLE-6.229
��-6.199
aja-6.148
 unders-6.111
Token two
Token ID734
Feature activation+9.82e+00

Pos logit contributions
ince+4.189
ments+4.160
theless+4.155
 Difference+4.135
agate+4.121
Neg logit contributions
��-5.989
COMPLE-5.559
��-5.526
aja-5.519
 Obj-5.479
Token his
Token ID465
Feature activation+1.57e+00

Pos logit contributions
 unders+2.575
��+2.460
COMPLE+2.393
aja+2.377
 Obj+2.347
Neg logit contributions
agate-3.424
ince-3.349
cius-3.285
theless-3.281
 Difference-3.220
Token whereabouts
Token ID33035
Feature activation-3.46e+00

Pos logit contributions
agate+0.783
ince+0.782
theless+0.738
 Difference+0.734
cius+0.725
Neg logit contributions
��-1.341
 unders-1.334
COMPLE-1.305
aja-1.296
 Obj-1.283
Token is
Token ID318
Feature activation+2.93e+00

Pos logit contributions
 unders+2.259
��+2.132
aja+2.117
COMPLE+2.112
 Obj+2.091
Neg logit contributions
agate-2.444
ince-2.400
theless-2.324
cius-2.319
 Difference-2.317
Token urged
Token ID11643
Feature activation+4.47e+00

Pos logit contributions
ince+1.915
agate+1.857
cius+1.832
ments+1.806
theless+1.801
Neg logit contributions
��-2.064
 unders-1.963
COMPLE-1.949
 Obj-1.934
aja-1.921
Token to
Token ID284
Feature activation+8.76e+00

Pos logit contributions
agate+2.703
ince+2.697
ments+2.543
cius+2.535
theless+2.500
Neg logit contributions
��-3.343
 unders-3.245
COMPLE-3.173
aja-3.163
 Obj-3.122
Token call
Token ID869
Feature activation+1.81e+01

Pos logit contributions
agate+4.864
ince+4.855
cius+4.518
ments+4.490
 Difference+4.478
Neg logit contributions
��-6.960
��-6.452
COMPLE-6.441
 unders-6.383
aja-6.312
Token NYPD
Token ID27615
Feature activation+9.09e+00

Pos logit contributions
ince+9.883
agate+9.312
 Difference+9.261
cius+9.110
theless+8.786
Neg logit contributions
��-12.242
��-11.500
COMPLE-11.141
aja-10.598
 unders-10.587
Token Crime
Token ID10003
Feature activation+2.36e+00

Pos logit contributions
agate+5.577
ince+5.507
 Difference+5.278
cius+5.226
theless+5.088
Neg logit contributions
��-6.552
COMPLE-6.255
 unders-6.114
��-6.016
aja-5.912
Token St
Token ID520
Feature activation-1.97e+00

Pos logit contributions
agate+1.453
ince+1.453
 Difference+1.393
cius+1.375
theless+1.362
Neg logit contributions
��-1.741
 unders-1.723
COMPLE-1.684
aja-1.648
��-1.600
Tokenoppers
Token ID37186
Feature activation+1.47e+01

Pos logit contributions
 unders+1.148
��+1.130
aja+1.094
COMPLE+1.093
 Obj+1.072
Neg logit contributions
agate-1.509
ince-1.499
theless-1.463
 Difference-1.449
cius-1.443
Token at
Token ID379
Feature activation+1.04e+01

Pos logit contributions
ince+7.103
 Difference+6.801
cius+6.754
agate+6.692
ments+6.334
Neg logit contributions
��-11.986
��-10.753
 unders-10.699
COMPLE-10.689
aja-10.417
Token the
Token ID262
Feature activation+9.43e-01

Pos logit contributions
 unders+1.807
��+1.756
aja+1.710
COMPLE+1.704
 Obj+1.694
Neg logit contributions
agate-1.759
ince-1.744
cius-1.686
theless-1.672
 Difference-1.663
Token sunk
Token ID24790
Feature activation-6.49e+00

Pos logit contributions
agate+0.643
ince+0.639
theless+0.617
 Difference+0.612
cius+0.611
Neg logit contributions
��-0.631
 unders-0.627
COMPLE-0.610
aja-0.604
 Obj-0.597
Tokenen
Token ID268
Feature activation-2.69e+00

Pos logit contributions
 unders+3.757
aja+3.546
��+3.473
 Obj+3.397
COMPLE+3.387
Neg logit contributions
agate-4.899
ince-4.824
theless-4.809
cius-4.788
 Difference-4.729
Token fireplace
Token ID44250
Feature activation+3.41e+00

Pos logit contributions
 unders+1.539
��+1.465
COMPLE+1.414
aja+1.410
 Obj+1.389
Neg logit contributions
agate-2.124
ince-2.101
theless-2.030
 Difference-2.028
cius-2.019
Token.
Token ID13
Feature activation+6.90e+00

Pos logit contributions
agate+2.027
ince+2.018
theless+1.931
 Difference+1.916
ments+1.914
Neg logit contributions
��-2.584
 unders-2.512
COMPLE-2.497
aja-2.458
 Obj-2.439
Token After
Token ID2293
Feature activation+1.80e+01

Pos logit contributions
 Difference+3.834
agate+3.724
ince+3.703
theless+3.608
ments+3.561
Neg logit contributions
��-5.459
 unders-5.313
��-5.197
COMPLE-5.125
aja-5.051
Token that
Token ID326
Feature activation+8.78e+00

Pos logit contributions
theless+9.089
ince+8.915
ments+8.795
agate+8.730
 Difference+8.541
Neg logit contributions
��-12.395
��-11.732
 Obj-11.244
COMPLE-11.023
aja-10.895
Token,
Token ID11
Feature activation+6.62e+00

Pos logit contributions
ince+4.516
agate+4.509
theless+4.359
 Difference+4.310
ments+4.303
Neg logit contributions
��-6.943
COMPLE-6.680
 unders-6.612
��-6.478
aja-6.442
Token he
Token ID339
Feature activation+9.42e+00

Pos logit contributions
agate+3.276
ince+3.235
theless+3.168
 Difference+3.130
ments+3.100
Neg logit contributions
��-5.508
COMPLE-5.365
 unders-5.302
��-5.260
aja-5.182
Token turned
Token ID2900
Feature activation+3.33e+00

Pos logit contributions
theless+5.311
agate+5.251
ince+5.249
ments+5.184
cius+4.902
Neg logit contributions
��-7.182
��-6.833
COMPLE-6.779
 Obj-6.547
aja-6.472
Token his
Token ID465
Feature activation+7.98e+00

Pos logit contributions
agate+2.110
ince+2.085
theless+2.005
 Difference+1.996
ments+1.988
Neg logit contributions
��-2.388
 unders-2.325
COMPLE-2.313
aja-2.271
 Obj-2.246
Token
Token ID198
Feature activation+5.63e+00

Pos logit contributions
ince+6.426
 Difference+6.403
agate+6.189
theless+6.096
cius+6.069
Neg logit contributions
��-7.349
COMPLE-6.864
 unders-6.855
��-6.848
aja-6.677
Token
Token ID198
Feature activation+1.94e+00

Pos logit contributions
ince+2.998
agate+2.965
ments+2.839
 Difference+2.817
cius+2.686
Neg logit contributions
��-4.756
��-4.520
 unders-4.503
aja-4.314
 Obj-4.290
TokenGood
Token ID10248
Feature activation+3.07e+00

Pos logit contributions
agate+1.313
ince+1.310
cius+1.251
theless+1.248
 Difference+1.234
Neg logit contributions
 unders-1.313
��-1.304
COMPLE-1.251
 Obj-1.243
aja-1.242
Token updates
Token ID5992
Feature activation+2.55e+00

Pos logit contributions
ince+2.046
agate+2.038
cius+1.959
 Difference+1.957
theless+1.950
Neg logit contributions
��-2.082
 unders-2.081
COMPLE-2.023
aja-1.988
 Obj-1.963
Token?
Token ID30
Feature activation+8.08e+00

Pos logit contributions
ince+1.615
agate+1.608
 Difference+1.544
cius+1.543
theless+1.541
Neg logit contributions
��-1.833
 unders-1.814
COMPLE-1.760
aja-1.743
 Obj-1.713
Token No
Token ID1400
Feature activation+1.79e+01

Pos logit contributions
ince+5.039
 Difference+4.990
agate+4.939
cius+4.873
ments+4.781
Neg logit contributions
��-5.646
 unders-5.490
COMPLE-5.318
aja-5.293
��-5.253
Token more
Token ID517
Feature activation+3.93e+00

Pos logit contributions
 Difference+9.613
ince+9.492
agate+8.967
ments+8.956
het+8.566
Neg logit contributions
��-11.973
COMPLE-11.178
aja-11.089
 unders-10.997
��-10.934
Token bad
Token ID2089
Feature activation+7.61e+00

Pos logit contributions
ince+2.508
agate+2.469
 Difference+2.402
theless+2.397
cius+2.365
Neg logit contributions
��-2.805
 unders-2.724
COMPLE-2.678
aja-2.658
 Obj-2.597
Token news
Token ID1705
Feature activation+5.24e+00

Pos logit contributions
ince+3.805
agate+3.691
ments+3.683
 Difference+3.595
cius+3.567
Neg logit contributions
��-6.352
 unders-6.107
COMPLE-6.004
aja-5.951
 Obj-5.938
Token pl
Token ID458
Feature activation+1.17e+00

Pos logit contributions
ince+3.197
agate+3.142
 Difference+3.099
ments+3.057
theless+3.042
Neg logit contributions
��-3.837
 unders-3.755
COMPLE-3.633
aja-3.613
��-3.553
Tokens
Token ID82
Feature activation+5.69e+00

Pos logit contributions
agate+0.633
ince+0.629
theless+0.599
 Difference+0.595
cius+0.593
Neg logit contributions
��-0.943
 unders-0.943
COMPLE-0.917
aja-0.907
 Obj-0.900
Token Americans
Token ID3399
Feature activation+5.21e+00

Pos logit contributions
agate+2.196
ince+2.163
 Difference+2.126
cius+2.082
theless+2.079
Neg logit contributions
��-2.101
 unders-2.072
COMPLE-2.013
aja-1.979
 Obj-1.945
Token FX
Token ID19534
Feature activation+7.42e+00

Pos logit contributions
ince+3.371
 Difference+3.346
agate+3.327
ments+3.198
theless+3.164
Neg logit contributions
��-3.616
 unders-3.575
aja-3.420
��-3.415
COMPLE-3.382
Token
Token ID198
Feature activation+6.34e+00

Pos logit contributions
ince+4.623
 Difference+4.615
agate+4.476
ments+4.322
theless+4.265
Neg logit contributions
��-5.243
 unders-5.208
��-4.985
aja-4.895
COMPLE-4.868
Token
Token ID198
Feature activation+1.10e+01

Pos logit contributions
ince+3.274
agate+3.233
ments+3.108
 Difference+3.067
cius+2.874
Neg logit contributions
��-5.475
��-5.242
 unders-5.153
aja-4.947
 Obj-4.913
TokenOut
Token ID7975
Feature activation+9.41e+00

Pos logit contributions
ince+6.725
agate+6.406
ments+6.210
 Difference+6.173
cius+5.944
Neg logit contributions
 unders-7.800
��-7.673
��-7.389
 Obj-7.182
 Neh-7.158
Tokenstanding
Token ID5646
Feature activation+1.78e+01

Pos logit contributions
ince+2.664
 Difference+2.555
agate+2.545
ments+2.467
cius+2.268
Neg logit contributions
��-9.693
 unders-9.637
aja-9.344
COMPLE-9.253
��-9.160
Token Achievement
Token ID45511
Feature activation+1.01e+01

Pos logit contributions
 Difference+11.022
ince+10.067
ments+9.815
agate+9.509
 Duo+9.291
Neg logit contributions
��-11.595
��-11.128
 unders-10.589
aja-10.504
geoning-10.371
Token in
Token ID287
Feature activation+8.43e+00

Pos logit contributions
ince+5.541
agate+5.422
 Difference+5.352
ments+5.167
cius+5.084
Neg logit contributions
 unders-7.666
��-7.577
aja-7.416
COMPLE-7.372
��-7.115
Token Comedy
Token ID22329
Feature activation+3.01e+00

Pos logit contributions
 Difference+5.137
ince+4.981
agate+4.891
ments+4.691
theless+4.607
Neg logit contributions
��-6.144
 unders-5.984
��-5.839
aja-5.839
COMPLE-5.696
Token
Token ID198
Feature activation+8.91e+00

Pos logit contributions
agate+1.826
ince+1.807
 Difference+1.744
theless+1.724
cius+1.701
Neg logit contributions
 unders-2.239
��-2.225
COMPLE-2.166
aja-2.150
 Obj-2.100
Token
Token ID198
Feature activation+1.05e+01

Pos logit contributions
ince+4.116
agate+4.034
ments+3.952
 Difference+3.839
het+3.502
Neg logit contributions
��-8.167
��-7.891
 unders-7.537
aja-7.209
 Obj-7.164
Token seems
Token ID2331
Feature activation+7.03e-01

Pos logit contributions
ince+3.885
agate+3.820
theless+3.770
ments+3.695
 Difference+3.663
Neg logit contributions
��-5.273
COMPLE-5.018
��-4.989
 Obj-4.896
 unders-4.891
Token to
Token ID284
Feature activation+2.98e+00

Pos logit contributions
agate+0.465
ince+0.461
theless+0.444
 Difference+0.440
cius+0.439
Neg logit contributions
��-0.485
 unders-0.484
COMPLE-0.469
aja-0.465
 Obj-0.461
Token be
Token ID307
Feature activation+4.75e+00

Pos logit contributions
agate+2.024
ince+2.021
theless+1.921
 Difference+1.914
cius+1.908
Neg logit contributions
��-2.017
 unders-1.938
COMPLE-1.935
aja-1.890
 Obj-1.872
Token on
Token ID319
Feature activation+1.28e+01

Pos logit contributions
ince+2.892
agate+2.892
theless+2.747
cius+2.732
 Difference+2.724
Neg logit contributions
��-3.492
COMPLE-3.363
 unders-3.351
aja-3.311
 Obj-3.276
Token the
Token ID262
Feature activation+1.01e+01

Pos logit contributions
ince+6.861
agate+6.502
 Difference+6.386
ments+6.248
cius+6.198
Neg logit contributions
��-9.424
COMPLE-9.133
aja-9.016
 unders-9.004
��-8.857
Token front
Token ID2166
Feature activation+1.78e+01

Pos logit contributions
ince+6.077
theless+5.762
 Difference+5.704
agate+5.690
cius+5.476
Neg logit contributions
��-7.351
COMPLE-6.992
aja-6.950
��-6.925
 unders-6.768
Token page
Token ID2443
Feature activation+5.90e+00

Pos logit contributions
ince+6.394
ments+5.790
theless+5.470
agate+5.294
 Difference+5.231
Neg logit contributions
��-15.075
COMPLE-14.654
��-14.570
aja-14.508
 Adin-14.392
Token.
Token ID13
Feature activation+3.68e+00

Pos logit contributions
ince+3.466
agate+3.392
theless+3.293
 Difference+3.263
cius+3.249
Neg logit contributions
��-4.500
 unders-4.316
COMPLE-4.264
aja-4.189
��-4.174
Token The
Token ID383
Feature activation+5.72e+00

Pos logit contributions
agate+2.228
ince+2.224
 Difference+2.190
theless+2.105
ments+2.100
Neg logit contributions
��-2.748
 unders-2.709
aja-2.594
COMPLE-2.584
��-2.547
Token Republicans
Token ID4734
Feature activation+2.46e+00

Pos logit contributions
ince+3.803
agate+3.771
 Difference+3.730
theless+3.649
ments+3.606
Neg logit contributions
��-3.811
 unders-3.659
COMPLE-3.651
aja-3.625
 Obj-3.564
Token should
Token ID815
Feature activation-4.01e-01

Pos logit contributions
agate+1.574
ince+1.565
cius+1.488
 Difference+1.485
theless+1.483
Neg logit contributions
��-1.765
 unders-1.734
COMPLE-1.698
aja-1.670
 Obj-1.656
Token and
Token ID290
Feature activation+9.97e+00

Pos logit contributions
agate+0.235
ince+0.232
theless+0.223
 Difference+0.222
cius+0.221
Neg logit contributions
 unders-0.268
��-0.267
COMPLE-0.259
aja-0.256
 Obj-0.254
Token creates
Token ID8075
Feature activation+1.06e+01

Pos logit contributions
agate+4.656
ince+4.615
ments+4.438
 Difference+4.414
theless+4.172
Neg logit contributions
��-8.474
 unders-8.083
COMPLE-8.042
aja-7.886
 Obj-7.828
Token corresponding
Token ID11188
Feature activation+1.36e+01

Pos logit contributions
agate+4.260
 Difference+3.997
ince+3.974
ments+3.947
theless+3.578
Neg logit contributions
��-10.104
COMPLE-9.237
 Obj-9.219
 unders-9.171
 Neh-9.040
Token templates
Token ID24019
Feature activation+4.36e-01

Pos logit contributions
ince+6.578
 Difference+6.396
agate+6.364
ments+6.239
cius+5.671
Neg logit contributions
��-10.926
COMPLE-10.143
 unders-9.911
aja-9.793
 Obj-9.670
Token.
Token ID13
Feature activation+3.07e+00

Pos logit contributions
agate+0.272
ince+0.269
theless+0.261
cius+0.258
 Difference+0.257
Neg logit contributions
 unders-0.318
��-0.313
COMPLE-0.307
aja-0.304
 Obj-0.301
Token This
Token ID770
Feature activation+1.76e+01

Pos logit contributions
agate+1.675
ince+1.664
 Difference+1.629
ments+1.568
theless+1.544
Neg logit contributions
��-2.504
 unders-2.466
COMPLE-2.366
aja-2.349
��-2.346
Token templates
Token ID24019
Feature activation+6.50e-01

Pos logit contributions
ments+4.854
agate+4.396
ince+4.266
 Difference+3.814
theless+3.707
Neg logit contributions
��-17.538
 Obj-16.587
ゴン-15.759
��-15.606
 Neh-15.456
Token will
Token ID481
Feature activation+6.13e+00

Pos logit contributions
agate+0.431
ince+0.427
theless+0.412
 Difference+0.409
cius+0.409
Neg logit contributions
 unders-0.449
��-0.445
COMPLE-0.432
aja-0.429
 Obj-0.424
Token contain
Token ID3994
Feature activation+6.75e+00

Pos logit contributions
agate+3.869
ince+3.805
ments+3.519
 Difference+3.481
cius+3.456
Neg logit contributions
��-4.654
 unders-4.370
aja-4.290
��-4.282
COMPLE-4.271
Token the
Token ID262
Feature activation+9.99e+00

Pos logit contributions
ince+3.557
agate+3.552
ments+3.365
 Difference+3.357
cius+3.228
Neg logit contributions
��-5.762
 unders-5.281
��-5.219
COMPLE-5.208
aja-5.130
Token instructions
Token ID7729
Feature activation+5.19e+00

Pos logit contributions
agate+4.000
ments+3.991
ince+3.959
 Difference+3.928
cius+3.546
Neg logit contributions
��-9.648
��-8.486
COMPLE-8.468
 unders-8.337
 Obj-8.331
Token actions
Token ID4028
Feature activation+3.05e+00

Pos logit contributions
ince+1.905
agate+1.883
ments+1.832
theless+1.792
 Difference+1.778
Neg logit contributions
��-2.985
 unders-2.921
COMPLE-2.893
aja-2.874
 Obj-2.813
Token.
Token ID13
Feature activation+3.71e+00

Pos logit contributions
ince+1.865
agate+1.855
theless+1.774
cius+1.764
ments+1.761
Neg logit contributions
��-2.258
 unders-2.216
COMPLE-2.168
aja-2.154
 Obj-2.120
Token
Token ID198
Feature activation-1.44e+00

Pos logit contributions
ince+2.194
agate+2.193
 Difference+2.143
theless+2.055
ments+2.054
Neg logit contributions
 unders-2.811
��-2.801
aja-2.676
COMPLE-2.668
��-2.619
Token
Token ID198
Feature activation+3.44e+00

Pos logit contributions
 unders+0.973
COMPLE+0.951
��+0.938
aja+0.932
 Obj+0.919
Neg logit contributions
agate-0.972
ince-0.956
theless-0.954
cius-0.933
 Difference-0.920
Token"
Token ID1
Feature activation+3.24e+00

Pos logit contributions
ince+2.078
agate+2.068
ments+1.938
 Difference+1.927
cius+1.926
Neg logit contributions
 unders-2.604
��-2.571
aja-2.458
 Obj-2.456
��-2.416
TokenFor
Token ID1890
Feature activation+1.74e+01

Pos logit contributions
ince+1.905
agate+1.898
 Difference+1.800
ments+1.770
theless+1.766
Neg logit contributions
 unders-2.480
��-2.477
aja-2.367
 Obj-2.334
COMPLE-2.328
Token all
Token ID477
Feature activation+1.20e+01

Pos logit contributions
ince+9.228
ments+8.814
agate+8.772
 Difference+8.671
theless+8.444
Neg logit contributions
��-11.642
 Obj-11.170
 unders-11.117
aja-10.980
COMPLE-10.962
Token the
Token ID262
Feature activation+5.98e+00

Pos logit contributions
ince+6.287
agate+5.990
ments+5.895
theless+5.882
 Difference+5.686
Neg logit contributions
��-9.322
aja-8.703
COMPLE-8.597
 unders-8.558
��-8.406
Token pomp
Token ID49435
Feature activation+8.39e-01

Pos logit contributions
ince+3.842
agate+3.759
 Difference+3.721
theless+3.664
ments+3.652
Neg logit contributions
��-4.187
 unders-3.982
aja-3.953
COMPLE-3.929
 Obj-3.854
Token and
Token ID290
Feature activation+9.07e+00

Pos logit contributions
agate+0.518
ince+0.514
theless+0.490
cius+0.489
 Difference+0.488
Neg logit contributions
 unders-0.618
��-0.614
COMPLE-0.598
aja-0.592
 Obj-0.586
Token ceremony
Token ID10542
Feature activation+5.56e+00

Pos logit contributions
agate+5.878
ince+5.802
 Difference+5.541
cius+5.536
ments+5.496
Neg logit contributions
��-5.877
aja-5.729
 unders-5.701
COMPLE-5.681
 Obj-5.657
Token more
Token ID517
Feature activation+2.19e+00

Pos logit contributions
ince+2.440
agate+2.423
theless+2.316
cius+2.314
ments+2.294
Neg logit contributions
��-2.639
COMPLE-2.529
 unders-2.516
aja-2.506
 Obj-2.480
Token than
Token ID621
Feature activation-2.12e+00

Pos logit contributions
ince+1.330
agate+1.319
cius+1.262
theless+1.260
 Difference+1.246
Neg logit contributions
��-1.627
 unders-1.592
COMPLE-1.569
aja-1.562
 Obj-1.543
Token ever
Token ID1683
Feature activation-1.34e-01

Pos logit contributions
 unders+1.491
��+1.456
COMPLE+1.409
aja+1.402
 Obj+1.380
Neg logit contributions
agate-1.406
ince-1.372
 Difference-1.338
theless-1.336
cius-1.325
Token on
Token ID319
Feature activation+3.19e+00

Pos logit contributions
 unders+0.094
��+0.093
COMPLE+0.090
aja+0.089
 Obj+0.088
Neg logit contributions
agate-0.089
ince-0.088
theless-0.085
 Difference-0.084
cius-0.084
Token the
Token ID262
Feature activation+5.22e+00

Pos logit contributions
ince+1.913
agate+1.912
theless+1.826
cius+1.818
 Difference+1.814
Neg logit contributions
��-2.359
 unders-2.313
COMPLE-2.290
aja-2.275
 Obj-2.235
Token front
Token ID2166
Feature activation+1.74e+01

Pos logit contributions
ince+3.420
agate+3.379
theless+3.249
 Difference+3.222
cius+3.199
Neg logit contributions
��-3.622
COMPLE-3.491
 unders-3.461
aja-3.453
 Obj-3.395
Token lines
Token ID3951
Feature activation+5.80e+00

Pos logit contributions
ince+6.131
ments+6.050
theless+5.284
agate+5.116
het+5.035
Neg logit contributions
��-14.718
aja-14.418
COMPLE-14.317
 Adin-14.108
 Obj-14.027
Token of
Token ID286
Feature activation+2.33e+00

Pos logit contributions
ince+3.394
agate+3.283
cius+3.171
theless+3.148
ments+3.142
Neg logit contributions
��-4.487
COMPLE-4.228
aja-4.179
 unders-4.179
��-4.146
Token major
Token ID1688
Feature activation+4.42e+00

Pos logit contributions
agate+1.414
ince+1.409
 Difference+1.352
theless+1.346
cius+1.338
Neg logit contributions
��-1.724
 unders-1.670
COMPLE-1.663
aja-1.653
 Obj-1.633
Token global
Token ID3298
Feature activation+7.50e-01

Pos logit contributions
ince+2.659
agate+2.658
ments+2.541
 Difference+2.536
theless+2.521
Neg logit contributions
��-3.256
aja-3.134
COMPLE-3.134
 unders-3.118
 Obj-3.082
Token and
Token ID290
Feature activation-5.27e+00

Pos logit contributions
agate+0.488
ince+0.485
theless+0.465
 Difference+0.464
cius+0.462
Neg logit contributions
��-0.524
 unders-0.520
COMPLE-0.508
aja-0.504
 Obj-0.498
Token Ryan
Token ID6047
Feature activation+2.77e+00

Pos logit contributions
ince+5.469
 Difference+5.076
agate+5.036
theless+4.959
cius+4.815
Neg logit contributions
��-9.111
 unders-8.552
aja-8.525
COMPLE-8.413
��-8.273
Token said
Token ID531
Feature activation+6.73e+00

Pos logit contributions
ince+1.948
agate+1.947
theless+1.883
 Difference+1.863
cius+1.851
Neg logit contributions
��-1.789
 unders-1.737
COMPLE-1.709
aja-1.680
 Obj-1.666
Token:
Token ID25
Feature activation+2.49e+00

Pos logit contributions
ince+3.785
agate+3.653
 Difference+3.512
theless+3.486
ments+3.408
Neg logit contributions
��-5.290
 unders-5.121
COMPLE-5.021
aja-4.993
 Obj-4.855
Token
Token ID564
Feature activation-4.28e+00

Pos logit contributions
ince+1.341
agate+1.340
 Difference+1.274
theless+1.262
cius+1.251
Neg logit contributions
��-2.016
 unders-2.003
COMPLE-1.950
aja-1.932
 Obj-1.913
Token
Token ID250
Feature activation+9.69e-01

Pos logit contributions
��+2.154
 unders+2.134
COMPLE+2.059
aja+2.013
 Obj+1.980
Neg logit contributions
agate-3.615
ince-3.571
theless-3.565
cius-3.476
 Difference-3.466
TokenNo
Token ID2949
Feature activation+1.73e+01

Pos logit contributions
agate+0.589
ince+0.586
theless+0.559
 Difference+0.556
cius+0.555
Neg logit contributions
 unders-0.724
��-0.722
COMPLE-0.697
aja-0.693
 Obj-0.687
Token,
Token ID11
Feature activation+6.54e+00

Pos logit contributions
ince+8.217
 Difference+7.806
agate+7.576
ments+7.417
het+7.296
Neg logit contributions
��-13.173
COMPLE-12.250
 unders-11.988
aja-11.893
 Obj-11.553
Token look
Token ID804
Feature activation+4.85e+00

Pos logit contributions
ince+3.451
agate+3.369
 Difference+3.213
ments+3.175
theless+3.173
Neg logit contributions
��-5.340
 unders-5.159
COMPLE-5.044
aja-4.984
��-4.962
Token,
Token ID11
Feature activation+5.42e+00

Pos logit contributions
agate+2.787
ince+2.776
 Difference+2.632
ments+2.605
cius+2.555
Neg logit contributions
��-3.740
COMPLE-3.645
 unders-3.643
 Obj-3.513
aja-3.500
Token it
Token ID340
Feature activation-2.18e+00

Pos logit contributions
ince+3.045
agate+3.023
 Difference+2.873
cius+2.806
theless+2.803
Neg logit contributions
��-4.279
 unders-4.174
COMPLE-4.087
aja-4.011
��-3.967
Token
Token ID447
Feature activation+1.13e+00

Pos logit contributions
 unders+1.622
��+1.584
COMPLE+1.558
aja+1.558
 Obj+1.531
Neg logit contributions
agate-1.326
ince-1.311
theless-1.269
cius-1.260
 Difference-1.254
Token/
Token ID14
Feature activation+1.68e+00

Pos logit contributions
ince+1.804
agate+1.797
ments+1.669
 Difference+1.645
cius+1.636
Neg logit contributions
��-3.462
 unders-3.393
 Obj-3.270
��-3.265
COMPLE-3.251
Token2016
Token ID5304
Feature activation+5.21e+00

Pos logit contributions
agate+1.024
ince+1.018
theless+0.965
cius+0.963
 Difference+0.963
Neg logit contributions
 unders-1.245
��-1.240
COMPLE-1.196
aja-1.188
 Obj-1.187
Token/
Token ID14
Feature activation+2.39e+00

Pos logit contributions
ince+2.268
agate+2.239
ments+2.076
 Difference+2.040
cius+1.969
Neg logit contributions
��-4.927
 unders-4.724
��-4.673
 Obj-4.571
aja-4.526
Token05
Token ID2713
Feature activation-7.40e-01

Pos logit contributions
agate+1.313
ince+1.309
theless+1.227
cius+1.225
 Difference+1.219
Neg logit contributions
��-1.925
 unders-1.916
COMPLE-1.838
 Obj-1.835
aja-1.827
Token/
Token ID14
Feature activation+6.87e+00

Pos logit contributions
 unders+0.602
��+0.593
COMPLE+0.587
aja+0.582
 Obj+0.573
Neg logit contributions
agate-0.398
ince-0.393
theless-0.383
cius-0.375
 Difference-0.374
Token5
Token ID20
Feature activation+1.73e+01

Pos logit contributions
ince+4.394
agate+4.377
cius+4.172
ments+4.137
theless+4.091
Neg logit contributions
��-4.810
 unders-4.689
 Obj-4.638
COMPLE-4.509
��-4.488
Token-
Token ID12
Feature activation+4.77e+00

Pos logit contributions
ince+8.866
agate+8.673
ments+8.297
cius+8.076
 Difference+7.981
Neg logit contributions
��-12.146
 Obj-11.581
COMPLE-11.480
 unders-11.363
 Neh-11.217
Tokenways
Token ID1322
Feature activation+5.56e+00

Pos logit contributions
ince+3.214
agate+3.180
cius+3.072
ments+3.032
theless+3.029
Neg logit contributions
��-3.227
 unders-3.182
 Obj-3.106
COMPLE-3.047
��-2.996
Token-
Token ID12
Feature activation+5.84e+00

Pos logit contributions
ince+3.054
agate+3.049
cius+2.865
ments+2.837
theless+2.799
Neg logit contributions
��-4.456
 unders-4.362
COMPLE-4.304
 Obj-4.299
aja-4.196
Tokenlaw
Token ID6270
Feature activation+7.31e+00

Pos logit contributions
ince+3.694
agate+3.625
cius+3.458
ments+3.444
theless+3.406
Neg logit contributions
��-4.206
 unders-4.098
 Obj-4.063
COMPLE-3.959
��-3.938
Token-
Token ID12
Feature activation+4.65e+00

Pos logit contributions
ince+4.643
agate+4.561
ments+4.488
cius+4.378
 Difference+4.363
Neg logit contributions
��-5.128
 unders-4.912
COMPLE-4.879
 Obj-4.831
aja-4.803
Token Policy
Token ID7820
Feature activation+2.56e-01

Pos logit contributions
agate+0.140
ince+0.139
theless+0.135
cius+0.134
 Difference+0.133
Neg logit contributions
 unders-0.103
��-0.101
COMPLE-0.098
aja-0.097
 Obj-0.096
Token here
Token ID994
Feature activation-7.19e+00

Pos logit contributions
agate+0.187
ince+0.186
theless+0.180
 Difference+0.179
cius+0.178
Neg logit contributions
��-0.158
 unders-0.158
COMPLE-0.153
aja-0.151
 Obj-0.149
Token.
Token ID13
Feature activation+7.21e-01

Pos logit contributions
 unders+4.275
aja+4.264
 Obj+4.205
COMPLE+4.156
 Neh+3.929
Neg logit contributions
theless-5.298
agate-5.125
ince-4.993
cius-4.918
 Difference-4.801
Token Sign
Token ID5865
Feature activation-3.80e-01

Pos logit contributions
agate+0.483
ince+0.481
theless+0.461
 Difference+0.461
cius+0.457
Neg logit contributions
��-0.492
 unders-0.491
COMPLE-0.473
aja-0.469
 Obj-0.462
Token up
Token ID510
Feature activation+3.49e-02

Pos logit contributions
 unders+0.276
��+0.271
COMPLE+0.265
aja+0.263
 Obj+0.260
Neg logit contributions
agate-0.239
ince-0.237
theless-0.230
 Difference-0.227
cius-0.227
Token for
Token ID329
Feature activation+1.73e+01

Pos logit contributions
agate+0.023
ince+0.022
theless+0.022
cius+0.021
 Difference+0.021
Neg logit contributions
 unders-0.025
��-0.024
COMPLE-0.024
aja-0.024
 Obj-0.023
Token Take
Token ID7214
Feature activation-4.99e+00

Pos logit contributions
ince+5.639
 Difference+5.311
agate+5.250
ments+5.017
het+4.308
Neg logit contributions
��-16.502
��-15.975
COMPLE-15.090
geoning-14.964
 unders-14.779
Token Action
Token ID7561
Feature activation+3.93e-01

Pos logit contributions
 unders+2.462
aja+2.338
COMPLE+2.321
 Obj+2.320
 Neh+2.188
Neg logit contributions
theless-4.306
ince-4.248
agate-4.229
cius-4.159
uyomi-3.931
Token Now
Token ID2735
Feature activation-1.13e+00

Pos logit contributions
agate+0.259
ince+0.257
 Difference+0.246
theless+0.246
cius+0.244
Neg logit contributions
 unders-0.273
��-0.272
COMPLE-0.263
aja-0.261
 Obj-0.257
Token and
Token ID290
Feature activation+2.00e+00

Pos logit contributions
 unders+0.769
��+0.740
COMPLE+0.734
aja+0.730
 Obj+0.716
Neg logit contributions
agate-0.766
ince-0.756
theless-0.741
 Difference-0.730
cius-0.728
Token we
Token ID356
Feature activation+5.65e+00

Pos logit contributions
agate+1.102
ince+1.092
 Difference+1.029
theless+1.028
cius+1.026
Neg logit contributions
��-1.608
 unders-1.590
COMPLE-1.554
aja-1.541
 Obj-1.524
Token
Token ID198
Feature activation+5.97e+00

Pos logit contributions
agate+1.141
ince+1.134
 Difference+1.080
theless+1.072
cius+1.067
Neg logit contributions
��-1.399
 unders-1.386
COMPLE-1.334
aja-1.328
 Obj-1.316
Tokenand
Token ID392
Feature activation+3.47e+00

Pos logit contributions
ince+3.631
agate+3.602
theless+3.329
 Difference+3.319
ments+3.317
Neg logit contributions
��-4.488
 unders-4.435
 Obj-4.210
COMPLE-4.164
aja-4.155
Token the
Token ID262
Feature activation+1.98e-01

Pos logit contributions
agate+2.107
ince+2.098
 Difference+2.026
theless+2.015
cius+1.986
Neg logit contributions
��-2.574
 unders-2.526
COMPLE-2.487
aja-2.443
 Obj-2.382
Token do
Token ID466
Feature activation+3.59e+00

Pos logit contributions
agate+0.140
ince+0.139
theless+0.134
 Difference+0.134
cius+0.133
Neg logit contributions
 unders-0.127
��-0.127
COMPLE-0.123
aja-0.122
 Obj-0.120
Token..
Token ID492
Feature activation+2.10e+00

Pos logit contributions
ince+2.363
agate+2.355
 Difference+2.264
cius+2.247
theless+2.221
Neg logit contributions
��-2.485
 unders-2.427
COMPLE-2.378
aja-2.318
 Obj-2.290
Tokenend
Token ID437
Feature activation+1.71e+01

Pos logit contributions
agate+1.334
ince+1.329
 Difference+1.275
theless+1.274
cius+1.260
Neg logit contributions
��-1.506
 unders-1.499
COMPLE-1.451
aja-1.428
 Obj-1.421
Token syntax
Token ID15582
Feature activation+4.48e+00

Pos logit contributions
ince+9.558
 Difference+9.530
ments+9.235
agate+9.225
cius+8.995
Neg logit contributions
��-11.129
COMPLE-10.454
 unders-10.180
��-10.043
 Obj-9.855
Token e
Token ID304
Feature activation+5.21e+00

Pos logit contributions
ince+2.553
agate+2.540
 Difference+2.446
theless+2.410
cius+2.386
Neg logit contributions
��-3.509
 unders-3.420
COMPLE-3.372
aja-3.285
 Obj-3.241
Token.
Token ID13
Feature activation-6.04e+00

Pos logit contributions
ince+2.868
agate+2.841
ments+2.654
cius+2.630
 Difference+2.629
Neg logit contributions
��-4.306
 unders-4.174
COMPLE-4.014
��-3.996
 Obj-3.971
Tokeng
Token ID70
Feature activation+4.38e+00

Pos logit contributions
 unders+1.654
aja+1.529
COMPLE+1.432
��+1.383
 Obj+1.364
Neg logit contributions
theless-6.335
agate-6.293
ince-6.147
 Difference-6.145
cius-6.142
Token.:
Token ID11207
Feature activation+2.07e+00

Pos logit contributions
agate+2.733
ince+2.699
 Difference+2.569
cius+2.553
ments+2.522
Neg logit contributions
��-3.202
 unders-3.166
COMPLE-3.091
aja-3.027
 Obj-2.968
Token Real
Token ID6416
Feature activation-1.32e+00

Pos logit contributions
agate+1.312
ince+1.309
 Difference+1.275
cius+1.256
theless+1.255
Neg logit contributions
��-1.025
 unders-1.015
COMPLE-0.985
aja-0.968
 Obj-0.947
Tokenist
Token ID396
Feature activation-5.37e-02

Pos logit contributions
 unders+0.853
��+0.845
COMPLE+0.819
aja+0.814
 Obj+0.805
Neg logit contributions
agate-0.926
ince-0.917
theless-0.892
cius-0.881
 Difference-0.879
Token Reader
Token ID25342
Feature activation+4.18e-01

Pos logit contributions
 unders+0.036
��+0.036
COMPLE+0.035
aja+0.035
 Obj+0.034
Neg logit contributions
agate-0.036
ince-0.036
theless-0.035
 Difference-0.035
cius-0.034
Token [
Token ID685
Feature activation+7.48e+00

Pos logit contributions
agate+0.277
ince+0.275
theless+0.264
 Difference+0.263
cius+0.263
Neg logit contributions
 unders-0.289
��-0.288
COMPLE-0.279
aja-0.276
 Obj-0.273
TokenEmail
Token ID15333
Feature activation+1.06e+01

Pos logit contributions
ince+4.958
agate+4.906
 Difference+4.717
cius+4.686
ments+4.656
Neg logit contributions
��-5.024
 unders-4.888
 Obj-4.619
COMPLE-4.616
��-4.596
Token him
Token ID683
Feature activation+1.69e+01

Pos logit contributions
agate+5.558
ince+5.485
 Difference+5.321
ments+5.151
cius+5.020
Neg logit contributions
��-8.512
COMPLE-8.176
 unders-8.011
aja-7.834
geoning-7.581
Token]
Token ID60
Feature activation+6.58e+00

Pos logit contributions
ince+8.344
agate+7.875
ments+7.767
 Difference+7.397
theless+7.384
Neg logit contributions
��-12.869
COMPLE-11.907
aja-11.631
��-11.629
 unders-11.541
Token In
Token ID554
Feature activation+1.94e+00

Pos logit contributions
agate+4.259
ince+4.241
 Difference+4.169
cius+4.083
ments+4.034
Neg logit contributions
��-4.569
 unders-4.335
COMPLE-4.324
aja-4.168
��-4.097
Token looking
Token ID2045
Feature activation+4.38e+00

Pos logit contributions
agate+1.229
ince+1.225
 Difference+1.169
cius+1.159
theless+1.159
Neg logit contributions
��-1.394
 unders-1.383
COMPLE-1.344
aja-1.329
 Obj-1.316
Token at
Token ID379
Feature activation-1.73e+00

Pos logit contributions
agate+2.643
ince+2.627
ments+2.473
 Difference+2.459
theless+2.450
Neg logit contributions
��-3.281
 unders-3.173
COMPLE-3.148
aja-3.114
 Obj-3.067
Token the
Token ID262
Feature activation-1.26e+00

Pos logit contributions
 unders+1.212
��+1.192
COMPLE+1.160
aja+1.153
 Obj+1.142
Neg logit contributions
agate-1.141
ince-1.126
theless-1.095
cius-1.081
 Difference-1.076
Token stripping
Token ID37727
Feature activation-9.27e+00

Pos logit contributions
 unders+4.414
COMPLE+4.027
aja+4.024
��+3.994
 Obj+3.972
Neg logit contributions
agate-4.537
ince-4.525
theless-4.366
 Difference-4.280
cius-4.274
Token back
Token ID736
Feature activation-8.18e+00

Pos logit contributions
 unders+5.860
COMPLE+5.243
aja+5.198
 Obj+5.115
 Neh+5.064
Neg logit contributions
agate-6.505
ince-6.469
theless-6.272
uyomi-6.268
cius-6.265
Token the
Token ID262
Feature activation-1.08e+01

Pos logit contributions
 unders+5.331
COMPLE+4.991
��+4.936
aja+4.896
 Obj+4.802
Neg logit contributions
agate-5.537
ince-5.430
 Difference-5.361
theless-5.273
cius-5.248
Token shield
Token ID7614
Feature activation-5.70e+00

Pos logit contributions
 unders+7.257
COMPLE+6.462
 Neh+6.331
 Obj+6.276
��+6.241
Neg logit contributions
agate-7.124
ince-7.080
theless-6.795
uyomi-6.765
cius-6.733
Token a
Token ID257
Feature activation-1.82e+01

Pos logit contributions
 unders+3.791
COMPLE+3.530
��+3.509
aja+3.499
 Obj+3.439
Neg logit contributions
agate-3.925
ince-3.870
theless-3.816
 Difference-3.722
cius-3.685
Token few
Token ID1178
Feature activation-2.08e+01

Pos logit contributions
 unders+9.961
COMPLE+9.368
aja+9.316
��+8.725
pload+8.700
Neg logit contributions
agate-12.026
ince-11.365
 Difference-11.278
uyomi-11.219
theless-11.133
Token inches
Token ID8331
Feature activation-1.32e+01

Pos logit contributions
 unders+11.789
COMPLE+10.193
 straw+10.037
aja+9.984
 Neh+9.955
Neg logit contributions
agate-13.075
uyomi-12.370
 Difference-12.270
ince-12.225
cius-12.083
Token and
Token ID290
Feature activation-8.01e+00

Pos logit contributions
 unders+7.632
COMPLE+6.527
 Obj+6.452
aja+6.450
 prox+6.430
Neg logit contributions
agate-9.494
ince-9.293
theless-9.170
cius-9.012
 Difference-8.918
Token making
Token ID1642
Feature activation-6.02e+00

Pos logit contributions
 unders+5.245
COMPLE+4.690
aja+4.617
 Obj+4.568
��+4.554
Neg logit contributions
ince-5.524
agate-5.473
theless-5.368
cius-5.292
 Difference-5.262
Token a
Token ID257
Feature activation-9.68e+00

Pos logit contributions
 unders+3.987
COMPLE+3.675
aja+3.628
��+3.595
 Obj+3.554
Neg logit contributions
ince-4.166
agate-4.161
cius-4.091
theless-4.026
 Difference-3.984
Token couple
Token ID3155
Feature activation-1.00e+01

Pos logit contributions
 unders+6.528
COMPLE+6.039
aja+5.826
 Neh+5.757
��+5.754
Neg logit contributions
agate-6.347
ince-6.311
theless-6.153
cius-6.080
uyomi-5.960
Token,
Token ID11
Feature activation-1.08e+01

Pos logit contributions
 unders+3.652
��+3.529
aja+3.481
COMPLE+3.454
 Obj+3.353
Neg logit contributions
agate-4.003
ince-3.905
theless-3.852
cius-3.820
 Difference-3.807
Token let
Token ID1309
Feature activation-5.68e+00

Pos logit contributions
 unders+6.614
aja+6.121
COMPLE+6.112
 Obj+6.065
��+6.015
Neg logit contributions
agate-7.524
ince-7.393
cius-7.315
uyomi-7.234
theless-7.152
Token
Token ID447
Feature activation+1.32e+00

Pos logit contributions
 unders+3.882
aja+3.763
��+3.760
COMPLE+3.743
 Obj+3.660
Neg logit contributions
agate-3.683
ince-3.614
theless-3.593
cius-3.539
 Difference-3.499
Token
Token ID247
Feature activation-1.71e+00

Pos logit contributions
agate+0.705
ince+0.698
 Difference+0.657
cius+0.656
theless+0.650
Neg logit contributions
 unders-1.077
��-1.070
COMPLE-1.048
aja-1.038
 Obj-1.032
Tokens
Token ID82
Feature activation-6.59e+00

Pos logit contributions
 unders+1.437
��+1.419
COMPLE+1.391
aja+1.385
 Obj+1.368
Neg logit contributions
agate-0.875
ince-0.861
theless-0.824
 Difference-0.820
cius-0.819
Token take
Token ID1011
Feature activation-2.05e+01

Pos logit contributions
 unders+4.314
aja+3.997
 Obj+3.914
COMPLE+3.910
��+3.872
Neg logit contributions
agate-4.632
theless-4.561
ince-4.538
cius-4.477
 Difference-4.412
Token a
Token ID257
Feature activation-1.22e+01

Pos logit contributions
 unders+8.997
 Obj+8.128
aja+7.926
 Neh+7.762
COMPLE+7.387
Neg logit contributions
cius-15.579
theless-15.281
agate-14.727
ince-14.696
uyomi-14.479
Token moment
Token ID2589
Feature activation-8.19e+00

Pos logit contributions
 unders+7.031
aja+6.584
 Obj+6.515
COMPLE+6.514
��+6.442
Neg logit contributions
agate-8.519
cius-8.421
ince-8.352
theless-8.303
uyomi-8.075
Token to
Token ID284
Feature activation-9.72e+00

Pos logit contributions
 unders+4.846
aja+4.583
��+4.531
COMPLE+4.527
 Obj+4.453
Neg logit contributions
agate-5.934
theless-5.925
ince-5.844
 Difference-5.814
cius-5.737
Token discuss
Token ID2112
Feature activation-7.84e+00

Pos logit contributions
 unders+6.390
aja+5.921
 Obj+5.795
COMPLE+5.682
��+5.508
Neg logit contributions
agate-6.645
theless-6.625
ince-6.509
cius-6.226
 Difference-6.220
Token some
Token ID617
Feature activation-4.85e+00

Pos logit contributions
 unders+5.025
aja+4.745
COMPLE+4.721
 Obj+4.710
��+4.679
Neg logit contributions
agate-5.416
ince-5.320
cius-5.284
theless-5.094
uyomi-5.020
Tokenitized
Token ID36951
Feature activation-2.56e+00

Pos logit contributions
 unders+2.637
��+2.555
aja+2.528
COMPLE+2.514
 Obj+2.488
Neg logit contributions
agate-3.299
ince-3.243
theless-3.222
 Difference-3.193
cius-3.167
Token limited
Token ID3614
Feature activation+7.84e+00

Pos logit contributions
 unders+1.784
��+1.729
COMPLE+1.700
aja+1.674
 Obj+1.668
Neg logit contributions
agate-1.691
ince-1.666
theless-1.618
cius-1.610
 Difference-1.603
Token evidence
Token ID2370
Feature activation-1.19e+00

Pos logit contributions
ince+4.916
agate+4.879
ments+4.776
theless+4.639
 Difference+4.625
Neg logit contributions
��-5.552
COMPLE-5.214
aja-5.204
��-5.137
 unders-5.114
Token in
Token ID287
Feature activation-4.52e-02

Pos logit contributions
 unders+0.815
��+0.800
COMPLE+0.781
aja+0.774
 Obj+0.767
Neg logit contributions
agate-0.803
ince-0.795
theless-0.767
 Difference-0.763
cius-0.762
Token an
Token ID281
Feature activation+2.96e+00

Pos logit contributions
 unders+0.029
��+0.028
COMPLE+0.027
aja+0.027
 Obj+0.027
Neg logit contributions
agate-0.033
ince-0.032
theless-0.031
 Difference-0.031
cius-0.031
Token unt
Token ID1418
Feature activation-2.00e+01

Pos logit contributions
agate+2.048
ince+2.042
theless+1.937
cius+1.923
 Difference+1.921
Neg logit contributions
��-1.964
 unders-1.896
COMPLE-1.875
aja-1.864
 Obj-1.823
Tokenime
Token ID524
Feature activation+2.01e+00

Pos logit contributions
aja+10.320
 unders+10.225
COMPLE+9.752
oward+9.369
icro+9.343
Neg logit contributions
theless-12.997
 Difference-12.947
cius-12.934
uyomi-12.479
agate-12.152
Tokenly
Token ID306
Feature activation+1.80e+00

Pos logit contributions
agate+0.821
ince+0.806
theless+0.764
 Difference+0.753
cius+0.743
Neg logit contributions
 unders-1.898
��-1.892
COMPLE-1.857
aja-1.846
 Obj-1.830
Token fashion
Token ID6977
Feature activation+3.12e+00

Pos logit contributions
agate+1.014
ince+1.008
theless+0.967
 Difference+0.956
cius+0.944
Neg logit contributions
��-1.412
 unders-1.400
COMPLE-1.365
aja-1.361
 Obj-1.342
Token and
Token ID290
Feature activation+2.14e+00

Pos logit contributions
agate+1.934
ince+1.921
theless+1.835
ments+1.820
 Difference+1.818
Neg logit contributions
��-2.281
 unders-2.242
COMPLE-2.184
aja-2.170
 Obj-2.143
Token call
Token ID869
Feature activation+2.82e+00

Pos logit contributions
agate+1.391
ince+1.378
theless+1.327
 Difference+1.319
cius+1.308
Neg logit contributions
��-1.503
 unders-1.482
COMPLE-1.449
aja-1.434
 Obj-1.414
Tokent
Token ID83
Feature activation+9.27e-01

Pos logit contributions
 unders+3.650
aja+3.492
 Obj+3.416
��+3.416
COMPLE+3.401
Neg logit contributions
theless-3.340
agate-3.301
ince-3.254
cius-3.213
 Difference-3.137
Token.
Token ID13
Feature activation+6.88e-02

Pos logit contributions
agate+0.605
ince+0.601
theless+0.576
 Difference+0.575
cius+0.571
Neg logit contributions
��-0.650
 unders-0.648
COMPLE-0.627
aja-0.621
 Obj-0.613
Token Now
Token ID2735
Feature activation-3.39e+00

Pos logit contributions
agate+0.044
ince+0.044
theless+0.042
 Difference+0.042
cius+0.042
Neg logit contributions
 unders-0.049
��-0.049
COMPLE-0.047
aja-0.047
 Obj-0.046
Token move
Token ID1445
Feature activation-1.30e+00

Pos logit contributions
 unders+2.274
��+2.190
aja+2.150
COMPLE+2.143
 Obj+2.138
Neg logit contributions
agate-2.307
ince-2.287
theless-2.224
cius-2.204
 Difference-2.176
Token back
Token ID736
Feature activation-4.22e+00

Pos logit contributions
 unders+0.892
��+0.865
COMPLE+0.846
aja+0.841
 Obj+0.837
Neg logit contributions
agate-0.872
ince-0.865
theless-0.834
cius-0.834
 Difference-0.828
Token one
Token ID530
Feature activation-1.99e+01

Pos logit contributions
 unders+2.815
COMPLE+2.651
��+2.646
 Obj+2.631
aja+2.629
Neg logit contributions
agate-2.883
ince-2.842
cius-2.791
theless-2.771
 Difference-2.760
Token metre
Token ID42760
Feature activation-1.13e+01

Pos logit contributions
 unders+10.999
 Obj+9.548
aja+9.261
COMPLE+9.027
 prox+8.915
Neg logit contributions
agate-13.316
ince-12.606
cius-12.597
theless-12.580
uyomi-12.305
Token,
Token ID11
Feature activation-4.69e+00

Pos logit contributions
 unders+6.819
 Obj+5.964
aja+5.961
COMPLE+5.797
 prox+5.730
Neg logit contributions
agate-8.165
ince-7.967
cius-7.943
theless-7.666
 Difference-7.534
Token and
Token ID290
Feature activation-3.60e+00

Pos logit contributions
 unders+3.064
COMPLE+2.847
��+2.840
aja+2.836
 Obj+2.820
Neg logit contributions
agate-3.294
ince-3.250
theless-3.187
cius-3.141
 Difference-3.108
Token repeat
Token ID9585
Feature activation-3.33e+00

Pos logit contributions
 unders+2.408
��+2.274
aja+2.245
COMPLE+2.244
 Obj+2.238
Neg logit contributions
agate-2.493
ince-2.473
theless-2.399
cius-2.380
 Difference-2.350
Token the
Token ID262
Feature activation-2.98e+00

Pos logit contributions
 unders+2.283
��+2.192
COMPLE+2.165
 Obj+2.152
aja+2.151
Neg logit contributions
agate-2.239
ince-2.213
theless-2.149
cius-2.132
 Difference-2.097
Token a
Token ID257
Feature activation-9.68e+00

Pos logit contributions
 unders+3.987
COMPLE+3.675
aja+3.628
��+3.595
 Obj+3.554
Neg logit contributions
ince-4.166
agate-4.161
cius-4.091
theless-4.026
 Difference-3.984
Token couple
Token ID3155
Feature activation-1.00e+01

Pos logit contributions
 unders+6.528
COMPLE+6.039
aja+5.826
 Neh+5.757
��+5.754
Neg logit contributions
agate-6.347
ince-6.311
theless-6.153
cius-6.080
uyomi-5.960
Token of
Token ID286
Feature activation-1.03e+01

Pos logit contributions
 unders+6.762
COMPLE+6.122
��+6.047
aja+6.004
 Obj+5.827
Neg logit contributions
agate-6.679
ince-6.459
cius-6.352
theless-6.285
 Difference-6.198
Token turns
Token ID4962
Feature activation-6.92e+00

Pos logit contributions
 unders+7.616
COMPLE+6.925
aja+6.789
��+6.682
 Obj+6.661
Neg logit contributions
agate-6.189
ince-6.071
cius-6.015
theless-5.878
uyomi-5.780
Token about
Token ID546
Feature activation-1.20e+01

Pos logit contributions
 unders+4.440
COMPLE+4.064
aja+4.021
 Obj+3.954
��+3.938
Neg logit contributions
ince-4.870
agate-4.859
theless-4.715
cius-4.638
 Difference-4.587
Token an
Token ID281
Feature activation-1.93e+01

Pos logit contributions
 unders+7.476
 Obj+6.744
COMPLE+6.700
aja+6.677
 Neh+6.498
Neg logit contributions
agate-8.196
ince-8.105
uyomi-7.929
theless-7.779
cius-7.765
Token 1
Token ID352
Feature activation-1.23e+01

Pos logit contributions
 unders+9.280
 Obj+7.499
aja+7.168
COMPLE+7.138
 Neh+6.982
Neg logit contributions
theless-14.769
 Difference-14.276
agate-14.265
ments-14.232
uyomi-14.145
Token-
Token ID12
Feature activation-1.22e+01

Pos logit contributions
 unders+8.165
COMPLE+7.637
aja+7.613
��+7.451
 Obj+7.449
Neg logit contributions
theless-7.735
agate-7.732
uyomi-7.404
 Difference-7.286
ments-7.252
Token2
Token ID17
Feature activation-1.69e+01

Pos logit contributions
 unders+6.872
COMPLE+6.674
aja+6.552
��+6.309
 Obj+6.204
Neg logit contributions
theless-8.368
agate-8.337
uyomi-8.200
 Difference-8.175
cius-8.086
Token inches
Token ID8331
Feature activation-1.15e+01

Pos logit contributions
 unders+9.825
COMPLE+8.912
aja+8.779
��+8.592
 Obj+8.464
Neg logit contributions
agate-10.843
uyomi-10.739
theless-10.714
cius-10.407
 Difference-10.323
Token in
Token ID287
Feature activation-8.81e+00

Pos logit contributions
 unders+6.935
aja+5.989
 Obj+5.956
COMPLE+5.901
 prox+5.815
Neg logit contributions
agate-8.188
ince-8.065
theless-8.058
cius-7.819
uyomi-7.671
Token by
Token ID416
Feature activation-6.63e+00

Pos logit contributions
 unders+0.468
��+0.462
COMPLE+0.448
aja+0.445
 Obj+0.439
Neg logit contributions
agate-0.507
ince-0.502
theless-0.487
 Difference-0.483
cius-0.481
Token stripping
Token ID37727
Feature activation-9.27e+00

Pos logit contributions
 unders+4.414
COMPLE+4.027
aja+4.024
��+3.994
 Obj+3.972
Neg logit contributions
agate-4.537
ince-4.525
theless-4.366
 Difference-4.280
cius-4.274
Token back
Token ID736
Feature activation-8.18e+00

Pos logit contributions
 unders+5.860
COMPLE+5.243
aja+5.198
 Obj+5.115
 Neh+5.064
Neg logit contributions
agate-6.505
ince-6.469
theless-6.272
uyomi-6.268
cius-6.265
Token the
Token ID262
Feature activation-1.08e+01

Pos logit contributions
 unders+5.331
COMPLE+4.991
��+4.936
aja+4.896
 Obj+4.802
Neg logit contributions
agate-5.537
ince-5.430
 Difference-5.361
theless-5.273
cius-5.248
Token shield
Token ID7614
Feature activation-5.70e+00

Pos logit contributions
 unders+7.257
COMPLE+6.462
 Neh+6.331
 Obj+6.276
��+6.241
Neg logit contributions
agate-7.124
ince-7.080
theless-6.795
uyomi-6.765
cius-6.733
Token a
Token ID257
Feature activation-1.82e+01

Pos logit contributions
 unders+3.791
COMPLE+3.530
��+3.509
aja+3.499
 Obj+3.439
Neg logit contributions
agate-3.925
ince-3.870
theless-3.816
 Difference-3.722
cius-3.685
Token few
Token ID1178
Feature activation-2.08e+01

Pos logit contributions
 unders+9.961
COMPLE+9.368
aja+9.316
��+8.725
pload+8.700
Neg logit contributions
agate-12.026
ince-11.365
 Difference-11.278
uyomi-11.219
theless-11.133
Token inches
Token ID8331
Feature activation-1.32e+01

Pos logit contributions
 unders+11.789
COMPLE+10.193
 straw+10.037
aja+9.984
 Neh+9.955
Neg logit contributions
agate-13.075
uyomi-12.370
 Difference-12.270
ince-12.225
cius-12.083
Token and
Token ID290
Feature activation-8.01e+00

Pos logit contributions
 unders+7.632
COMPLE+6.527
 Obj+6.452
aja+6.450
 prox+6.430
Neg logit contributions
agate-9.494
ince-9.293
theless-9.170
cius-9.012
 Difference-8.918
Token making
Token ID1642
Feature activation-6.02e+00

Pos logit contributions
 unders+5.245
COMPLE+4.690
aja+4.617
 Obj+4.568
��+4.554
Neg logit contributions
ince-5.524
agate-5.473
theless-5.368
cius-5.292
 Difference-5.262
Token a
Token ID257
Feature activation-9.68e+00

Pos logit contributions
 unders+3.987
COMPLE+3.675
aja+3.628
��+3.595
 Obj+3.554
Neg logit contributions
ince-4.166
agate-4.161
cius-4.091
theless-4.026
 Difference-3.984
Token Brian
Token ID8403
Feature activation+4.86e+00

Pos logit contributions
agate+1.003
ince+1.001
theless+0.959
 Difference+0.959
cius+0.953
Neg logit contributions
��-1.069
 unders-1.066
COMPLE-1.032
aja-1.021
 Obj-1.009
Token Albert
Token ID9966
Feature activation-5.91e+00

Pos logit contributions
agate+3.144
ince+3.124
 Difference+3.056
theless+3.012
cius+2.992
Neg logit contributions
��-3.434
 unders-3.318
COMPLE-3.232
aja-3.203
��-3.135
Token have
Token ID423
Feature activation-1.01e+01

Pos logit contributions
 unders+3.833
aja+3.740
��+3.656
COMPLE+3.620
 Obj+3.585
Neg logit contributions
agate-4.041
ince-3.922
 Difference-3.875
theless-3.868
cius-3.805
Token taken
Token ID2077
Feature activation-1.62e+01

Pos logit contributions
 unders+6.266
aja+5.711
COMPLE+5.687
 Obj+5.654
��+5.442
Neg logit contributions
agate-7.201
ince-7.112
cius-7.052
uyomi-6.699
 Difference-6.691
Token on
Token ID319
Feature activation-1.10e+01

Pos logit contributions
 unders+8.936
 Obj+8.192
aja+7.856
 Neh+7.623
COMPLE+7.336
Neg logit contributions
cius-11.603
theless-11.411
agate-11.376
ince-11.217
uyomi-10.659
Token the
Token ID262
Feature activation-1.79e+01

Pos logit contributions
 unders+6.727
 Obj+6.321
aja+6.245
COMPLE+6.182
��+6.044
Neg logit contributions
agate-7.673
theless-7.641
cius-7.464
ince-7.463
 Difference-7.280
Token task
Token ID4876
Feature activation-7.13e+00

Pos logit contributions
 unders+11.013
 Obj+9.864
COMPLE+9.517
 Neh+9.505
��+9.311
Neg logit contributions
agate-11.134
theless-11.032
ince-10.995
cius-10.797
uyomi-10.377
Token of
Token ID286
Feature activation-5.95e+00

Pos logit contributions
 unders+4.381
��+4.110
aja+4.080
 Obj+4.067
COMPLE+4.058
Neg logit contributions
agate-5.282
theless-5.063
ince-5.019
 Difference-4.978
cius-4.943
Token ranking
Token ID12759
Feature activation-5.08e+00

Pos logit contributions
 unders+3.742
COMPLE+3.467
��+3.427
aja+3.413
 Obj+3.380
Neg logit contributions
agate-4.386
ince-4.331
theless-4.184
cius-4.168
 Difference-4.090
Token every
Token ID790
Feature activation-5.49e+00

Pos logit contributions
 unders+3.441
 Obj+3.240
��+3.230
COMPLE+3.209
aja+3.195
Neg logit contributions
agate-3.466
ince-3.415
theless-3.338
cius-3.317
 Difference-3.201
Token Halo
Token ID18597
Feature activation+6.08e-01

Pos logit contributions
 unders+3.536
��+3.344
 Obj+3.337
COMPLE+3.310
aja+3.248
Neg logit contributions
agate-3.991
ince-3.837
cius-3.774
theless-3.726
ments-3.640
Token like
Token ID588
Feature activation-1.47e+01

Pos logit contributions
 unders+1.809
��+1.736
COMPLE+1.715
aja+1.703
 Obj+1.702
Neg logit contributions
agate-2.029
ince-1.982
theless-1.933
cius-1.918
 Difference-1.908
Token utility
Token ID10361
Feature activation-8.71e+00

Pos logit contributions
 unders+8.822
 Obj+8.684
COMPLE+8.204
aja+8.127
 Neh+8.017
Neg logit contributions
agate-9.592
ince-9.110
cius-8.831
uyomi-8.800
theless-8.744
Token will
Token ID481
Feature activation-3.19e+00

Pos logit contributions
 unders+5.290
 Obj+5.091
COMPLE+5.028
aja+4.957
��+4.902
Neg logit contributions
agate-6.265
ince-5.978
theless-5.885
cius-5.854
 Difference-5.787
Token be
Token ID307
Feature activation-7.02e+00

Pos logit contributions
 unders+1.982
��+1.897
COMPLE+1.860
 Obj+1.856
aja+1.851
Neg logit contributions
agate-2.358
ince-2.316
theless-2.262
cius-2.246
 Difference-2.240
Token referenced
Token ID20717
Feature activation-7.76e+00

Pos logit contributions
 unders+4.494
 Obj+4.206
aja+4.147
COMPLE+4.122
��+4.115
Neg logit contributions
agate-5.064
ince-4.853
cius-4.766
theless-4.737
 Difference-4.617
Token as
Token ID355
Feature activation-1.78e+01

Pos logit contributions
 unders+4.912
 Obj+4.672
COMPLE+4.498
aja+4.454
��+4.450
Neg logit contributions
agate-5.451
theless-5.350
ince-5.346
cius-5.170
 Difference-5.144
Token "
Token ID366
Feature activation-1.51e+01

Pos logit contributions
 unders+10.157
 Obj+9.965
COMPLE+9.290
 prox+9.027
aja+8.889
Neg logit contributions
agate-11.937
cius-11.109
theless-11.026
ince-10.885
uyomi-10.711
Tokensnap
Token ID45380
Feature activation-4.27e+00

Pos logit contributions
COMPLE+8.794
 unders+8.232
 Obj+8.167
aja+8.055
��+7.779
Neg logit contributions
agate-9.822
theless-9.674
cius-9.404
uyomi-9.287
 Difference-9.160
Token"
Token ID1
Feature activation-6.15e+00

Pos logit contributions
 unders+2.975
COMPLE+2.907
��+2.898
aja+2.872
 Obj+2.852
Neg logit contributions
agate-2.711
ince-2.662
 Difference-2.595
theless-2.591
cius-2.557
Token (
Token ID357
Feature activation-6.31e+00

Pos logit contributions
 unders+3.861
COMPLE+3.700
��+3.658
 Obj+3.657
aja+3.654
Neg logit contributions
agate-4.366
ince-4.257
theless-4.207
cius-4.124
 Difference-4.103
Tokenwith
Token ID4480
Feature activation-6.06e+00

Pos logit contributions
 unders+4.527
COMPLE+4.468
��+4.399
aja+4.381
 Obj+4.356
Neg logit contributions
agate-3.818
ince-3.688
theless-3.674
cius-3.659
 Difference-3.599
Token gateway
Token ID24308
Feature activation-1.87e+00

Pos logit contributions
 unders+3.308
��+3.142
COMPLE+3.076
aja+3.051
 Obj+3.043
Neg logit contributions
agate-3.291
ince-3.205
cius-3.109
theless-3.082
 Difference-3.016
Token was
Token ID373
Feature activation-2.00e+00

Pos logit contributions
 unders+1.285
��+1.253
COMPLE+1.227
aja+1.212
 Obj+1.199
Neg logit contributions
agate-1.262
ince-1.248
theless-1.205
cius-1.198
 Difference-1.188
Token shoved
Token ID35137
Feature activation-9.90e-01

Pos logit contributions
 unders+1.388
��+1.341
COMPLE+1.316
aja+1.301
 Obj+1.283
Neg logit contributions
agate-1.349
ince-1.330
theless-1.284
 Difference-1.282
cius-1.277
Token to
Token ID284
Feature activation+4.04e+00

Pos logit contributions
 unders+0.657
��+0.641
COMPLE+0.626
aja+0.619
 Obj+0.613
Neg logit contributions
agate-0.689
ince-0.682
theless-0.659
cius-0.659
 Difference-0.657
Token an
Token ID281
Feature activation-4.96e+00

Pos logit contributions
agate+2.412
ince+2.406
theless+2.302
 Difference+2.269
ments+2.257
Neg logit contributions
��-3.019
 unders-2.936
COMPLE-2.909
aja-2.868
 Obj-2.862
Token unw
Token ID7379
Feature activation-1.77e+01

Pos logit contributions
 unders+3.271
COMPLE+2.993
 Obj+2.968
aja+2.949
��+2.932
Neg logit contributions
agate-3.528
ince-3.491
theless-3.411
cius-3.408
 Difference-3.396
Tokenel
Token ID417
Feature activation+1.81e+00

Pos logit contributions
 unders+6.559
COMPLE+6.241
aja+6.184
itary+6.114
asty+5.782
Neg logit contributions
cius-14.997
agate-14.624
uyomi-14.208
 Difference-14.095
theless-14.029
Tokencoming
Token ID4976
Feature activation-3.30e+00

Pos logit contributions
agate+0.696
ince+0.690
theless+0.648
 Difference+0.633
cius+0.633
Neg logit contributions
 unders-1.749
��-1.749
COMPLE-1.705
aja-1.692
 Obj-1.682
Token corner
Token ID5228
Feature activation-2.18e+00

Pos logit contributions
 unders+2.211
��+2.104
COMPLE+2.092
aja+2.053
 Obj+2.047
Neg logit contributions
agate-2.309
ince-2.267
cius-2.216
theless-2.175
 Difference-2.174
Token,
Token ID11
Feature activation+1.29e+00

Pos logit contributions
 unders+1.487
��+1.440
COMPLE+1.415
aja+1.401
 Obj+1.380
Neg logit contributions
agate-1.495
ince-1.469
cius-1.423
theless-1.420
 Difference-1.410
Token which
Token ID543
Feature activation-2.74e+00

Pos logit contributions
agate+0.842
ince+0.837
theless+0.806
 Difference+0.799
cius+0.795
Neg logit contributions
��-0.897
 unders-0.889
COMPLE-0.865
aja-0.861
 Obj-0.851
Tokenugg
Token ID6837
Feature activation+1.13e+00

Pos logit contributions
agate+0.013
ince+0.013
theless+0.012
 Difference+0.012
cius+0.012
Neg logit contributions
 unders-0.008
��-0.008
COMPLE-0.008
aja-0.007
 Obj-0.007
Tokens
Token ID82
Feature activation+1.43e+00

Pos logit contributions
agate+0.541
ince+0.533
theless+0.504
 Difference+0.502
cius+0.499
Neg logit contributions
��-0.989
 unders-0.983
COMPLE-0.961
aja-0.951
 Obj-0.945
Token drives
Token ID10182
Feature activation-5.98e+00

Pos logit contributions
agate+0.914
ince+0.907
theless+0.874
 Difference+0.868
cius+0.863
Neg logit contributions
��-1.019
 unders-1.004
COMPLE-0.980
aja-0.970
 Obj-0.961
Token him
Token ID683
Feature activation-5.87e+00

Pos logit contributions
 unders+4.117
 Obj+3.777
COMPLE+3.768
��+3.758
aja+3.723
Neg logit contributions
agate-4.008
ince-3.978
cius-3.914
theless-3.897
 Difference-3.792
Token back
Token ID736
Feature activation-6.47e+00

Pos logit contributions
 unders+3.802
COMPLE+3.466
aja+3.439
 Obj+3.412
��+3.407
Neg logit contributions
agate-4.167
ince-4.164
cius-4.112
 Difference-4.047
theless-3.995
Token a
Token ID257
Feature activation-1.77e+01

Pos logit contributions
 unders+4.175
COMPLE+3.872
aja+3.811
 Obj+3.795
��+3.762
Neg logit contributions
agate-4.555
ince-4.506
cius-4.477
 Difference-4.463
theless-4.385
Token couple
Token ID3155
Feature activation-1.71e+01

Pos logit contributions
 unders+8.203
COMPLE+7.549
 prox+6.885
pload+6.848
 Neh+6.815
Neg logit contributions
agate-13.900
cius-13.342
ince-13.253
 Difference-13.044
uyomi-12.915
Token yards
Token ID5695
Feature activation-9.47e+00

Pos logit contributions
 unders+8.754
COMPLE+7.194
 Neh+7.083
 Obj+7.013
 straw+6.906
Neg logit contributions
agate-13.362
cius-12.797
ince-12.594
theless-12.489
 Difference-12.374
Token,
Token ID11
Feature activation+3.89e-02

Pos logit contributions
 unders+5.854
COMPLE+5.264
aja+5.185
 Obj+5.154
 Neh+4.948
Neg logit contributions
agate-6.838
cius-6.769
ince-6.673
 Difference-6.668
theless-6.553
Token but
Token ID475
Feature activation-9.22e-01

Pos logit contributions
agate+0.025
ince+0.025
theless+0.024
 Difference+0.023
cius+0.023
Neg logit contributions
 unders-0.028
��-0.028
COMPLE-0.027
aja-0.027
 Obj-0.026
Token Bo
Token ID3248
Feature activation-6.59e+00

Pos logit contributions
 unders+0.690
��+0.681
COMPLE+0.664
aja+0.659
 Obj+0.654
Neg logit contributions
agate-0.562
ince-0.556
theless-0.534
cius-0.530
 Difference-0.528
Tokenish
Token ID680
Feature activation-8.51e-02

Pos logit contributions
ince+2.591
agate+2.577
theless+2.478
 Difference+2.455
cius+2.435
Neg logit contributions
��-2.517
 unders-2.457
COMPLE-2.448
 Obj-2.344
��-2.271
Token and
Token ID290
Feature activation+2.88e+00

Pos logit contributions
 unders+0.060
��+0.060
COMPLE+0.058
aja+0.057
 Obj+0.057
Neg logit contributions
agate-0.055
ince-0.055
theless-0.053
 Difference-0.052
cius-0.052
Token G
Token ID402
Feature activation+1.41e+00

Pos logit contributions
agate+1.881
ince+1.879
 Difference+1.823
theless+1.802
cius+1.790
Neg logit contributions
��-1.989
 unders-1.955
COMPLE-1.920
aja-1.871
 Obj-1.862
Tokenhat
Token ID5183
Feature activation+4.55e-01

Pos logit contributions
ince+0.914
agate+0.914
cius+0.865
 Difference+0.864
theless+0.862
Neg logit contributions
 unders-0.989
��-0.989
COMPLE-0.961
 Obj-0.941
aja-0.932
Tokenfan
Token ID24408
Feature activation+1.59e+00

Pos logit contributions
agate+0.304
ince+0.302
theless+0.291
cius+0.289
 Difference+0.289
Neg logit contributions
 unders-0.311
��-0.310
COMPLE-0.301
aja-0.296
 Obj-0.294
Token def
Token ID825
Feature activation-1.75e+01

Pos logit contributions
agate+1.024
ince+1.017
theless+0.976
cius+0.968
 Difference+0.968
Neg logit contributions
��-1.118
 unders-1.114
COMPLE-1.083
 Obj-1.061
aja-1.054
Tokenected
Token ID11197
Feature activation-1.87e+00

Pos logit contributions
aja+7.141
 unders+6.556
itary+6.292
icro+5.983
COMPLE+5.945
Neg logit contributions
theless-14.692
 Difference-13.405
agate-13.301
cius-13.200
uyomi-13.165
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
 unders+1.256
��+1.222
COMPLE+1.200
aja+1.192
 Obj+1.174
Neg logit contributions
agate-1.282
ince-1.268
theless-1.230
 Difference-1.221
cius-1.217
Token was
Token ID373
Feature activation-2.75e-01

Pos logit contributions
agate+0.140
ince+0.138
theless+0.133
 Difference+0.133
cius+0.132
Neg logit contributions
 unders-0.148
��-0.148
COMPLE-0.143
aja-0.142
 Obj-0.140
Token admitted
Token ID6848
Feature activation-2.06e+00

Pos logit contributions
 unders+0.184
��+0.183
COMPLE+0.178
aja+0.176
 Obj+0.174
Neg logit contributions
agate-0.188
ince-0.187
theless-0.180
 Difference-0.179
cius-0.179
Token into
Token ID656
Feature activation-3.81e+00

Pos logit contributions
 unders+1.410
��+1.379
aja+1.344
COMPLE+1.344
 Obj+1.321
Neg logit contributions
agate-1.389
ince-1.376
theless-1.334
 Difference-1.322
cius-1.319
Tokens
Token ID82
Feature activation+1.43e+00

Pos logit contributions
agate+0.541
ince+0.533
theless+0.504
 Difference+0.502
cius+0.499
Neg logit contributions
��-0.989
 unders-0.983
COMPLE-0.961
aja-0.951
 Obj-0.945
Token drives
Token ID10182
Feature activation-5.98e+00

Pos logit contributions
agate+0.914
ince+0.907
theless+0.874
 Difference+0.868
cius+0.863
Neg logit contributions
��-1.019
 unders-1.004
COMPLE-0.980
aja-0.970
 Obj-0.961
Token him
Token ID683
Feature activation-5.87e+00

Pos logit contributions
 unders+4.117
 Obj+3.777
COMPLE+3.768
��+3.758
aja+3.723
Neg logit contributions
agate-4.008
ince-3.978
cius-3.914
theless-3.897
 Difference-3.792
Token back
Token ID736
Feature activation-6.47e+00

Pos logit contributions
 unders+3.802
COMPLE+3.466
aja+3.439
 Obj+3.412
��+3.407
Neg logit contributions
agate-4.167
ince-4.164
cius-4.112
 Difference-4.047
theless-3.995
Token a
Token ID257
Feature activation-1.77e+01

Pos logit contributions
 unders+4.175
COMPLE+3.872
aja+3.811
 Obj+3.795
��+3.762
Neg logit contributions
agate-4.555
ince-4.506
cius-4.477
 Difference-4.463
theless-4.385
Token couple
Token ID3155
Feature activation-1.71e+01

Pos logit contributions
 unders+8.203
COMPLE+7.549
 prox+6.885
pload+6.848
 Neh+6.815
Neg logit contributions
agate-13.900
cius-13.342
ince-13.253
 Difference-13.044
uyomi-12.915
Token yards
Token ID5695
Feature activation-9.47e+00

Pos logit contributions
 unders+8.754
COMPLE+7.194
 Neh+7.083
 Obj+7.013
 straw+6.906
Neg logit contributions
agate-13.362
cius-12.797
ince-12.594
theless-12.489
 Difference-12.374
Token,
Token ID11
Feature activation+3.89e-02

Pos logit contributions
 unders+5.854
COMPLE+5.264
aja+5.185
 Obj+5.154
 Neh+4.948
Neg logit contributions
agate-6.838
cius-6.769
ince-6.673
 Difference-6.668
theless-6.553
Token but
Token ID475
Feature activation-9.22e-01

Pos logit contributions
agate+0.025
ince+0.025
theless+0.024
 Difference+0.023
cius+0.023
Neg logit contributions
 unders-0.028
��-0.028
COMPLE-0.027
aja-0.027
 Obj-0.026
Token Bo
Token ID3248
Feature activation-6.59e+00

Pos logit contributions
 unders+0.690
��+0.681
COMPLE+0.664
aja+0.659
 Obj+0.654
Neg logit contributions
agate-0.562
ince-0.556
theless-0.534
cius-0.530
 Difference-0.528
Tokenz
Token ID89
Feature activation+6.55e-01

Pos logit contributions
 unders+4.167
aja+4.049
COMPLE+3.886
 Obj+3.771
��+3.760
Neg logit contributions
theless-4.725
agate-4.511
ince-4.452
 Difference-4.423
uyomi-4.375
Token5
Token ID20
Feature activation-6.67e+00

Pos logit contributions
 unders+2.059
��+2.012
COMPLE+1.999
aja+1.986
 Obj+1.919
Neg logit contributions
agate-2.707
ince-2.663
theless-2.653
 Difference-2.622
cius-2.594
Token {
Token ID1391
Feature activation-6.34e+00

Pos logit contributions
 unders+3.784
aja+3.730
 Obj+3.723
COMPLE+3.639
��+3.594
Neg logit contributions
agate-5.046
theless-5.002
ince-4.946
cius-4.842
ments-4.668
Token(
Token ID7
Feature activation-2.62e+00

Pos logit contributions
COMPLE+3.780
 unders+3.562
aja+3.546
 Obj+3.418
��+3.292
Neg logit contributions
theless-4.797
agate-4.650
cius-4.608
ince-4.600
uyomi-4.577
Token4
Token ID19
Feature activation+3.07e+00

Pos logit contributions
 unders+1.918
��+1.898
COMPLE+1.885
aja+1.858
 Obj+1.834
Neg logit contributions
agate-1.613
ince-1.589
theless-1.538
cius-1.527
 Difference-1.512
Tokens
Token ID82
Feature activation-5.06e+00

Pos logit contributions
agate+1.483
ince+1.478
theless+1.395
 Difference+1.392
ments+1.380
Neg logit contributions
 unders-2.643
��-2.636
COMPLE-2.571
aja-2.541
 Obj-2.513
Token)}
Token ID38165
Feature activation-1.70e+01

Pos logit contributions
 unders+2.717
COMPLE+2.634
aja+2.605
��+2.574
 Obj+2.562
Neg logit contributions
agate-4.004
ince-3.984
theless-3.959
cius-3.840
 Difference-3.836
Token 30
Token ID1542
Feature activation-2.01e+00

Pos logit contributions
 Obj+6.668
 unders+6.281
COMPLE+5.821
 Neh+5.724
aja+5.541
Neg logit contributions
agate-14.397
theless-13.624
ince-13.581
ments-13.100
cius-13.100
Token.
Token ID13
Feature activation-8.68e-01

Pos logit contributions
 unders+1.384
COMPLE+1.347
aja+1.335
��+1.328
 Obj+1.327
Neg logit contributions
agate-1.332
ince-1.306
theless-1.306
cius-1.271
 Difference-1.248
Token K
Token ID509
Feature activation+1.20e+00

Pos logit contributions
 unders+0.549
��+0.541
COMPLE+0.525
aja+0.522
 Obj+0.518
Neg logit contributions
agate-0.628
ince-0.622
theless-0.602
cius-0.598
 Difference-0.596
Tokenf
Token ID69
Feature activation+2.30e-01

Pos logit contributions
agate+0.770
ince+0.764
cius+0.728
theless+0.726
 Difference+0.724
Neg logit contributions
��-0.853
 unders-0.852
COMPLE-0.822
 Obj-0.807
aja-0.806
Token2
Token ID17
Feature activation-5.61e+00

Pos logit contributions
agate+0.149
ince+0.148
theless+0.142
 Difference+0.141
cius+0.141
Neg logit contributions
 unders-0.163
��-0.162
COMPLE-0.157
aja-0.156
 Obj-0.154
Token turns
Token ID4962
Feature activation-6.92e+00

Pos logit contributions
 unders+7.616
COMPLE+6.925
aja+6.789
��+6.682
 Obj+6.661
Neg logit contributions
agate-6.189
ince-6.071
cius-6.015
theless-5.878
uyomi-5.780
Token about
Token ID546
Feature activation-1.20e+01

Pos logit contributions
 unders+4.440
COMPLE+4.064
aja+4.021
 Obj+3.954
��+3.938
Neg logit contributions
ince-4.870
agate-4.859
theless-4.715
cius-4.638
 Difference-4.587
Token an
Token ID281
Feature activation-1.93e+01

Pos logit contributions
 unders+7.476
 Obj+6.744
COMPLE+6.700
aja+6.677
 Neh+6.498
Neg logit contributions
agate-8.196
ince-8.105
uyomi-7.929
theless-7.779
cius-7.765
Token 1
Token ID352
Feature activation-1.23e+01

Pos logit contributions
 unders+9.280
 Obj+7.499
aja+7.168
COMPLE+7.138
 Neh+6.982
Neg logit contributions
theless-14.769
 Difference-14.276
agate-14.265
ments-14.232
uyomi-14.145
Token-
Token ID12
Feature activation-1.22e+01

Pos logit contributions
 unders+8.165
COMPLE+7.637
aja+7.613
��+7.451
 Obj+7.449
Neg logit contributions
theless-7.735
agate-7.732
uyomi-7.404
 Difference-7.286
ments-7.252
Token2
Token ID17
Feature activation-1.69e+01

Pos logit contributions
 unders+6.872
COMPLE+6.674
aja+6.552
��+6.309
 Obj+6.204
Neg logit contributions
theless-8.368
agate-8.337
uyomi-8.200
 Difference-8.175
cius-8.086
Token inches
Token ID8331
Feature activation-1.15e+01

Pos logit contributions
 unders+9.825
COMPLE+8.912
aja+8.779
��+8.592
 Obj+8.464
Neg logit contributions
agate-10.843
uyomi-10.739
theless-10.714
cius-10.407
 Difference-10.323
Token in
Token ID287
Feature activation-8.81e+00

Pos logit contributions
 unders+6.935
aja+5.989
 Obj+5.956
COMPLE+5.901
 prox+5.815
Neg logit contributions
agate-8.188
ince-8.065
theless-8.058
cius-7.819
uyomi-7.671
Token diameter
Token ID14753
Feature activation-7.05e+00

Pos logit contributions
 unders+5.435
COMPLE+5.078
aja+4.991
 Obj+4.869
��+4.860
Neg logit contributions
agate-6.228
theless-6.222
ince-6.156
 Difference-5.883
cius-5.871
Token.
Token ID13
Feature activation-5.75e+00

Pos logit contributions
 unders+4.594
COMPLE+4.251
aja+4.202
 Obj+4.146
��+4.126
Neg logit contributions
agate-4.836
ince-4.821
theless-4.712
cius-4.578
 Difference-4.565
Token Sold
Token ID12324
Feature activation-4.04e+00

Pos logit contributions
 unders+3.796
COMPLE+3.641
 Obj+3.613
aja+3.579
��+3.550
Neg logit contributions
agate-3.871
ince-3.858
theless-3.783
cius-3.733
ments-3.584
Token duration
Token ID9478
Feature activation-1.64e+00

Pos logit contributions
 unders+2.652
��+2.493
COMPLE+2.461
 Obj+2.414
aja+2.411
Neg logit contributions
agate-3.370
ince-3.310
cius-3.187
theless-3.180
 Difference-3.097
Token of
Token ID286
Feature activation-2.92e+00

Pos logit contributions
 unders+1.122
��+1.093
COMPLE+1.069
aja+1.059
 Obj+1.046
Neg logit contributions
agate-1.116
ince-1.098
theless-1.062
cius-1.054
 Difference-1.053
Token the
Token ID262
Feature activation+4.72e-01

Pos logit contributions
 unders+1.988
��+1.931
COMPLE+1.890
aja+1.869
 Obj+1.858
Neg logit contributions
agate-1.993
ince-1.959
theless-1.910
cius-1.902
 Difference-1.873
Token game
Token ID983
Feature activation-3.74e+00

Pos logit contributions
agate+0.356
ince+0.354
theless+0.342
 Difference+0.341
cius+0.340
Neg logit contributions
 unders-0.281
��-0.281
COMPLE-0.270
aja-0.268
 Obj-0.264
Token chasing
Token ID20023
Feature activation-1.02e+01

Pos logit contributions
 unders+2.602
��+2.451
COMPLE+2.435
aja+2.404
 Obj+2.373
Neg logit contributions
agate-2.532
ince-2.492
theless-2.406
cius-2.398
 Difference-2.390
Token one
Token ID530
Feature activation-1.68e+01

Pos logit contributions
 unders+6.420
 Neh+5.670
aja+5.668
 Obj+5.606
��+5.566
Neg logit contributions
agate-7.201
cius-7.169
ince-7.103
theless-7.042
 Difference-6.764
Token another
Token ID1194
Feature activation-8.49e+00

Pos logit contributions
 unders+7.166
 Neh+5.946
aja+5.915
 Obj+5.780
itary+5.780
Neg logit contributions
cius-14.131
agate-13.868
theless-13.676
ince-13.496
 Difference-13.166
Token up
Token ID510
Feature activation-4.85e+00

Pos logit contributions
 unders+5.230
aja+4.619
COMPLE+4.564
 Obj+4.537
��+4.508
Neg logit contributions
agate-6.209
ince-6.114
cius-6.067
theless-6.066
 Difference-5.987
Token and
Token ID290
Feature activation-5.13e+00

Pos logit contributions
 unders+3.275
��+3.129
aja+3.064
COMPLE+3.038
 Obj+2.995
Neg logit contributions
agate-3.287
theless-3.233
ince-3.223
cius-3.177
 Difference-3.160
Token down
Token ID866
Feature activation-4.12e+00

Pos logit contributions
 unders+2.660
��+2.419
COMPLE+2.382
aja+2.369
 Obj+2.313
Neg logit contributions
agate-4.286
ince-4.285
theless-4.187
 Difference-4.177
cius-4.176
Token the
Token ID262
Feature activation-6.19e-01

Pos logit contributions
 unders+2.787
��+2.635
aja+2.590
COMPLE+2.585
 Obj+2.544
Neg logit contributions
agate-2.836
ince-2.778
cius-2.730
theless-2.711
 Difference-2.690
Token the
Token ID262
Feature activation-2.49e+00

Pos logit contributions
 unders+0.013
��+0.013
COMPLE+0.012
aja+0.012
 Obj+0.012
Neg logit contributions
agate-0.011
ince-0.011
theless-0.010
 Difference-0.010
cius-0.010
Token natural
Token ID3288
Feature activation+1.28e+00

Pos logit contributions
 unders+1.608
��+1.541
COMPLE+1.508
aja+1.478
 Obj+1.471
Neg logit contributions
agate-1.802
ince-1.784
cius-1.725
theless-1.720
 Difference-1.694
Token flora
Token ID48037
Feature activation-2.83e+00

Pos logit contributions
agate+0.876
ince+0.868
theless+0.840
 Difference+0.838
cius+0.828
Neg logit contributions
��-0.848
 unders-0.840
COMPLE-0.818
aja-0.814
 Obj-0.803
Token and
Token ID290
Feature activation+6.45e+00

Pos logit contributions
 unders+1.918
��+1.862
COMPLE+1.826
aja+1.820
 Obj+1.788
Neg logit contributions
agate-1.947
ince-1.913
theless-1.865
cius-1.849
 Difference-1.838
Token an
Token ID281
Feature activation+7.51e-01

Pos logit contributions
agate+3.671
ince+3.641
theless+3.499
cius+3.488
 Difference+3.471
Neg logit contributions
��-4.885
 unders-4.732
aja-4.695
COMPLE-4.683
 Obj-4.665
Token unt
Token ID1418
Feature activation-1.67e+01

Pos logit contributions
agate+0.509
ince+0.505
theless+0.486
cius+0.483
 Difference+0.483
Neg logit contributions
��-0.506
 unders-0.505
COMPLE-0.490
aja-0.484
 Obj-0.480
Tokenrained
Token ID13363
Feature activation+2.85e+00

Pos logit contributions
 unders+8.691
aja+8.538
COMPLE+8.222
icro+7.986
asty+7.714
Neg logit contributions
theless-11.563
cius-11.133
 Difference-11.058
agate-11.010
uyomi-10.747
Token eye
Token ID4151
Feature activation-3.31e-01

Pos logit contributions
ince+1.937
agate+1.934
theless+1.859
 Difference+1.853
cius+1.840
Neg logit contributions
��-1.907
 unders-1.854
COMPLE-1.824
aja-1.804
 Obj-1.774
Token will
Token ID481
Feature activation+2.49e+00

Pos logit contributions
 unders+0.228
��+0.225
COMPLE+0.219
aja+0.218
 Obj+0.215
Neg logit contributions
agate-0.221
ince-0.219
theless-0.211
 Difference-0.210
cius-0.210
Token never
Token ID1239
Feature activation+1.42e+00

Pos logit contributions
agate+1.692
ince+1.690
theless+1.620
 Difference+1.610
cius+1.606
Neg logit contributions
��-1.675
 unders-1.635
COMPLE-1.611
aja-1.587
 Obj-1.561
Token be
Token ID307
Feature activation+1.55e+00

Pos logit contributions
agate+0.973
ince+0.967
theless+0.935
 Difference+0.924
cius+0.922
Neg logit contributions
��-0.945
 unders-0.937
COMPLE-0.913
aja-0.900
 Obj-0.889
Token (
Token ID357
Feature activation-4.67e+00

Pos logit contributions
 unders+1.891
��+1.846
COMPLE+1.808
aja+1.800
 Obj+1.762
Neg logit contributions
agate-1.967
ince-1.918
theless-1.889
cius-1.864
 Difference-1.824
Tokencommit
Token ID41509
Feature activation+3.01e+00

Pos logit contributions
 unders+3.221
COMPLE+3.162
��+3.123
aja+3.075
 Obj+2.995
Neg logit contributions
agate-3.068
theless-2.983
ince-2.975
 Difference-2.887
cius-2.879
Token)
Token ID8
Feature activation-3.45e+00

Pos logit contributions
agate+1.936
ince+1.931
ments+1.878
 Difference+1.840
theless+1.835
Neg logit contributions
��-2.115
 unders-2.085
COMPLE-2.043
aja-2.012
 Obj-1.991
Token Levi
Token ID20196
Feature activation-4.26e-01

Pos logit contributions
 unders+2.372
��+2.296
COMPLE+2.281
aja+2.259
 Obj+2.245
Neg logit contributions
agate-2.318
ince-2.276
theless-2.209
cius-2.180
 Difference-2.136
Token Jones
Token ID5437
Feature activation-6.90e+00

Pos logit contributions
 unders+0.291
��+0.290
COMPLE+0.281
aja+0.278
 Obj+0.275
Neg logit contributions
agate-0.285
ince-0.283
theless-0.272
 Difference-0.271
cius-0.270
Token LB
Token ID22199
Feature activation-1.66e+01

Pos logit contributions
 unders+4.424
COMPLE+4.164
aja+4.142
 Obj+4.111
��+4.093
Neg logit contributions
agate-4.991
ince-4.777
theless-4.636
cius-4.586
uyomi-4.497
Token 6
Token ID718
Feature activation-1.64e+00

Pos logit contributions
 unders+8.379
COMPLE+7.507
aja+7.391
 Obj+7.312
 Neh+6.936
Neg logit contributions
agate-12.669
ince-12.345
theless-12.157
cius-12.088
uyomi-11.743
Token'
Token ID6
Feature activation-5.95e+00

Pos logit contributions
 unders+1.240
COMPLE+1.207
��+1.203
aja+1.195
 Obj+1.175
Neg logit contributions
agate-0.984
ince-0.969
theless-0.958
cius-0.938
 Difference-0.926
Token3
Token ID18
Feature activation-6.22e+00

Pos logit contributions
COMPLE+3.764
 unders+3.758
aja+3.653
��+3.583
 Obj+3.551
Neg logit contributions
agate-4.156
theless-4.039
ince-4.037
cius-3.983
 Difference-3.902
Token/
Token ID14
Feature activation-1.11e+01

Pos logit contributions
 unders+4.427
COMPLE+4.382
aja+4.252
 Obj+4.168
 Neh+4.030
Neg logit contributions
theless-3.966
agate-3.867
cius-3.757
ince-3.756
uyomi-3.663
Token215
Token ID23349
Feature activation-1.25e+00

Pos logit contributions
COMPLE+6.949
aja+6.672
 unders+6.554
 Obj+6.269
��+6.133
Neg logit contributions
agate-7.416
uyomi-7.375
cius-7.212
theless-7.206
ince-7.061
Token proposes
Token ID26017
Feature activation-2.01e+00

Pos logit contributions
 unders+1.045
��+1.022
COMPLE+0.995
aja+0.992
 Obj+0.981
Neg logit contributions
agate-1.003
ince-0.990
theless-0.958
 Difference-0.954
cius-0.949
Token will
Token ID481
Feature activation-7.67e-01

Pos logit contributions
 unders+1.377
��+1.333
COMPLE+1.311
aja+1.303
 Obj+1.281
Neg logit contributions
agate-1.360
ince-1.342
theless-1.308
cius-1.290
 Difference-1.288
Token encourage
Token ID7898
Feature activation-1.62e-02

Pos logit contributions
 unders+0.516
��+0.505
COMPLE+0.493
aja+0.489
 Obj+0.482
Neg logit contributions
agate-0.528
ince-0.522
theless-0.506
cius-0.502
 Difference-0.501
Token universities
Token ID11155
Feature activation-2.90e+00

Pos logit contributions
 unders+0.011
��+0.011
COMPLE+0.010
aja+0.010
 Obj+0.010
Neg logit contributions
agate-0.011
ince-0.011
theless-0.011
 Difference-0.011
cius-0.011
Token to
Token ID284
Feature activation-8.43e-01

Pos logit contributions
 unders+1.916
��+1.843
COMPLE+1.803
aja+1.793
 Obj+1.754
Neg logit contributions
agate-2.027
ince-2.000
theless-1.971
 Difference-1.932
cius-1.916
Token inf
Token ID1167
Feature activation-1.65e+01

Pos logit contributions
 unders+0.573
��+0.560
COMPLE+0.545
aja+0.542
 Obj+0.535
Neg logit contributions
agate-0.575
ince-0.570
theless-0.552
 Difference-0.547
cius-0.545
Tokenlate
Token ID17660
Feature activation+3.96e-01

Pos logit contributions
aja+7.105
 unders+6.876
COMPLE+6.073
 Obj+5.972
itary+5.822
Neg logit contributions
theless-13.994
uyomi-13.216
 Difference-13.039
agate-12.122
cius-12.020
Token prices
Token ID4536
Feature activation+8.64e-01

Pos logit contributions
agate+0.267
ince+0.265
theless+0.255
 Difference+0.254
cius+0.253
Neg logit contributions
 unders-0.268
��-0.267
COMPLE-0.259
aja-0.257
 Obj-0.254
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
agate+0.555
ince+0.552
theless+0.528
 Difference+0.526
cius+0.525
Neg logit contributions
��-0.613
 unders-0.609
COMPLE-0.592
aja-0.586
 Obj-0.580
Token offer
Token ID2897
Feature activation-4.18e-01

Pos logit contributions
 unders+0.864
��+0.843
COMPLE+0.822
aja+0.815
 Obj+0.805
Neg logit contributions
agate-0.856
ince-0.850
theless-0.821
 Difference-0.814
cius-0.811
Token sub
Token ID850
Feature activation-6.57e+00

Pos logit contributions
 unders+0.281
��+0.275
COMPLE+0.268
aja+0.266
 Obj+0.262
Neg logit contributions
agate-0.289
ince-0.286
theless-0.276
 Difference-0.274
cius-0.273
TokenFAULT
Token ID38865
Feature activation-5.84e+00

Pos logit contributions
COMPLE+5.219
 unders+5.176
��+4.930
aja+4.906
 Obj+4.781
Neg logit contributions
agate-5.942
theless-5.752
ince-5.696
cius-5.675
 Difference-5.474
Token 5
Token ID642
Feature activation-6.83e+00

Pos logit contributions
 unders+3.728
COMPLE+3.628
��+3.540
aja+3.485
 Obj+3.430
Neg logit contributions
agate-4.251
ince-4.070
theless-3.961
cius-3.919
ments-3.872
Token second
Token ID1218
Feature activation-1.25e+00

Pos logit contributions
 unders+4.233
COMPLE+4.164
��+4.062
aja+4.013
 Obj+3.874
Neg logit contributions
agate-4.984
ince-4.705
theless-4.659
 Difference-4.588
cius-4.561
Token delay
Token ID5711
Feature activation-3.06e+00

Pos logit contributions
 unders+0.824
��+0.809
COMPLE+0.792
aja+0.783
 Obj+0.772
Neg logit contributions
agate-0.874
ince-0.861
theless-0.834
cius-0.824
 Difference-0.822
Token before
Token ID878
Feature activation-5.33e-01

Pos logit contributions
 unders+2.081
��+2.025
COMPLE+2.007
aja+1.979
 Obj+1.961
Neg logit contributions
agate-2.063
ince-2.029
theless-1.974
cius-1.932
 Difference-1.922
Token reb
Token ID3405
Feature activation-1.64e+01

Pos logit contributions
 unders+0.346
��+0.343
COMPLE+0.333
aja+0.330
 Obj+0.326
Neg logit contributions
agate-0.376
ince-0.373
theless-0.360
 Difference-0.358
cius-0.358
Tokenooting
Token ID12494
Feature activation-5.33e+00

Pos logit contributions
aja+3.071
COMPLE+2.715
 unders+2.381
pload+2.070
geoning+2.000
Neg logit contributions
theless-17.254
agate-17.027
 Difference-16.742
ince-16.363
cius-16.304
Token
Token ID198
Feature activation-9.56e-01

Pos logit contributions
 unders+3.579
COMPLE+3.422
��+3.413
aja+3.358
 Obj+3.344
Neg logit contributions
agate-3.577
ince-3.520
theless-3.419
cius-3.413
 Difference-3.348
Token
Token ID198
Feature activation-2.51e+00

Pos logit contributions
 unders+0.652
COMPLE+0.636
��+0.629
aja+0.624
 Obj+0.615
Neg logit contributions
agate-0.640
ince-0.631
theless-0.625
cius-0.614
 Difference-0.606
Token3
Token ID18
Feature activation+3.66e-01

Pos logit contributions
 unders+1.716
��+1.702
COMPLE+1.682
aja+1.645
 Obj+1.620
Neg logit contributions
agate-1.651
ince-1.629
theless-1.595
cius-1.579
 Difference-1.576
Token.
Token ID13
Feature activation-3.96e+00

Pos logit contributions
agate+0.226
ince+0.224
theless+0.216
 Difference+0.214
cius+0.214
Neg logit contributions
 unders-0.269
��-0.267
COMPLE-0.260
aja-0.258
 Obj-0.255
Token
Token ID198
Feature activation+1.11e+00

Pos logit contributions
agate+0.262
ince+0.260
theless+0.250
 Difference+0.249
cius+0.248
Neg logit contributions
 unders-0.299
��-0.298
COMPLE-0.289
aja-0.287
 Obj-0.283
Token
Token ID198
Feature activation-1.69e+00

Pos logit contributions
agate+0.695
ince+0.689
theless+0.659
 Difference+0.658
cius+0.654
Neg logit contributions
 unders-0.809
��-0.809
COMPLE-0.780
aja-0.775
 Obj-0.766
TokenB
Token ID33
Feature activation-1.34e+00

Pos logit contributions
 unders+1.160
��+1.156
COMPLE+1.143
aja+1.122
 Obj+1.099
Neg logit contributions
agate-1.112
ince-1.097
theless-1.071
cius-1.057
 Difference-1.053
Tokeniz
Token ID528
Feature activation-1.71e+00

Pos logit contributions
 unders+0.889
��+0.882
aja+0.861
COMPLE+0.859
 Obj+0.838
Neg logit contributions
agate-0.924
ince-0.915
theless-0.890
 Difference-0.881
cius-0.879
Tokenarrow
Token ID6018
Feature activation-5.25e+00

Pos logit contributions
 unders+1.161
��+1.146
COMPLE+1.119
aja+1.113
 Obj+1.097
Neg logit contributions
agate-1.145
ince-1.129
theless-1.097
 Difference-1.086
cius-1.083
Token Green
Token ID3469
Feature activation-1.63e+01

Pos logit contributions
 unders+3.641
��+3.526
aja+3.515
COMPLE+3.468
 Obj+3.406
Neg logit contributions
agate-3.435
ince-3.357
theless-3.232
cius-3.231
 Difference-3.148
Token Arrow
Token ID19408
Feature activation-8.09e+00

Pos logit contributions
 unders+10.553
aja+10.283
��+9.933
COMPLE+9.729
 Obj+9.549
Neg logit contributions
agate-9.922
ince-9.319
theless-8.947
cius-8.855
amines-8.508
Token was
Token ID373
Feature activation-2.53e+00

Pos logit contributions
 unders+5.410
aja+5.095
COMPLE+5.069
��+5.035
 Obj+4.955
Neg logit contributions
agate-5.506
ince-5.336
cius-5.172
theless-5.113
ments-4.866
Token the
Token ID262
Feature activation-6.67e+00

Pos logit contributions
 unders+1.774
��+1.723
COMPLE+1.690
aja+1.687
 Obj+1.662
Neg logit contributions
agate-1.661
ince-1.642
theless-1.586
cius-1.579
 Difference-1.566
Token perfect
Token ID2818
Feature activation-8.74e+00

Pos logit contributions
 unders+4.515
��+4.194
COMPLE+4.185
 Obj+4.147
aja+4.145
Neg logit contributions
agate-4.680
ince-4.511
cius-4.378
theless-4.373
ments-4.169
Token safety
Token ID3747
Feature activation-6.38e+00

Pos logit contributions
 unders+6.110
��+5.639
COMPLE+5.597
 Obj+5.561
aja+5.516
Neg logit contributions
agate-5.867
ince-5.592
cius-5.548
theless-5.472
amines-5.112
TokenPerhaps
Token ID13710
Feature activation+5.46e+00

Pos logit contributions
ince+3.528
agate+3.466
 Difference+3.274
ments+3.269
cius+3.251
Neg logit contributions
��-4.332
 unders-4.274
 Obj-4.053
��-4.023
COMPLE-4.016
Token with
Token ID351
Feature activation+7.54e+00

Pos logit contributions
ince+3.298
agate+3.254
theless+3.195
 Difference+3.194
ments+3.147
Neg logit contributions
��-4.030
COMPLE-3.862
 unders-3.850
aja-3.785
��-3.757
Token some
Token ID617
Feature activation+8.36e+00

Pos logit contributions
ince+4.562
agate+4.391
 Difference+4.363
ments+4.223
theless+4.168
Neg logit contributions
��-5.569
 unders-5.339
COMPLE-5.299
aja-5.160
��-5.121
Token apolog
Token ID8453
Feature activation-8.49e-01

Pos logit contributions
ince+5.269
 Difference+5.096
agate+5.010
theless+4.864
ments+4.843
Neg logit contributions
��-5.903
aja-5.514
COMPLE-5.484
��-5.441
 unders-5.437
Tokenetic
Token ID5139
Feature activation+3.92e+00

Pos logit contributions
 unders+0.519
��+0.513
COMPLE+0.497
aja+0.495
 Obj+0.487
Neg logit contributions
agate-0.629
ince-0.623
theless-0.608
 Difference-0.601
cius-0.601
Token intent
Token ID6824
Feature activation+1.27e+01

Pos logit contributions
ince+2.532
agate+2.502
 Difference+2.442
theless+2.405
ments+2.395
Neg logit contributions
��-2.785
 unders-2.662
COMPLE-2.654
aja-2.611
 Obj-2.546
Token,
Token ID11
Feature activation+6.49e+00

Pos logit contributions
ince+6.887
ments+6.692
 Difference+6.633
cius+6.424
agate+6.406
Neg logit contributions
��-9.449
COMPLE-9.141
 unders-8.780
��-8.761
geoning-8.613
Token the
Token ID262
Feature activation+3.29e+00

Pos logit contributions
ince+3.981
agate+3.887
 Difference+3.848
theless+3.783
ments+3.739
Neg logit contributions
��-4.670
COMPLE-4.507
 unders-4.491
aja-4.370
��-4.331
Token late
Token ID2739
Feature activation+1.41e+00

Pos logit contributions
ince+2.077
agate+2.076
 Difference+2.026
theless+1.987
cius+1.970
Neg logit contributions
��-2.341
COMPLE-2.255
 unders-2.248
aja-2.210
 Obj-2.167
Token English
Token ID3594
Feature activation+4.02e+00

Pos logit contributions
agate+0.928
ince+0.927
theless+0.895
 Difference+0.894
cius+0.883
Neg logit contributions
��-0.967
 unders-0.957
COMPLE-0.934
aja-0.919
 Obj-0.907
Token scholar
Token ID15606
Feature activation+2.39e+00

Pos logit contributions
ince+2.580
agate+2.553
 Difference+2.487
theless+2.455
ments+2.445
Neg logit contributions
��-2.814
 unders-2.750
COMPLE-2.731
aja-2.654
 Obj-2.630
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
agate+0.252
ince+0.250
theless+0.240
 Difference+0.239
cius+0.238
Neg logit contributions
 unders-0.291
��-0.290
COMPLE-0.282
aja-0.279
 Obj-0.276
Token In
Token ID554
Feature activation+4.69e+00

Pos logit contributions
agate+0.772
ince+0.767
 Difference+0.745
theless+0.734
cius+0.728
Neg logit contributions
 unders-0.876
��-0.873
COMPLE-0.841
aja-0.835
 Obj-0.819
Token this
Token ID428
Feature activation+9.53e+00

Pos logit contributions
ince+2.999
agate+2.974
 Difference+2.927
cius+2.848
theless+2.831
Neg logit contributions
��-3.286
 unders-3.247
COMPLE-3.147
aja-3.132
��-3.081
Token sense
Token ID2565
Feature activation+4.11e+00

Pos logit contributions
agate+4.940
ince+4.936
 Difference+4.888
cius+4.660
ments+4.614
Neg logit contributions
��-7.596
aja-7.154
COMPLE-7.146
 unders-7.122
��-7.013
Token,
Token ID11
Feature activation+4.94e+00

Pos logit contributions
ince+2.477
agate+2.461
 Difference+2.393
cius+2.345
theless+2.340
Neg logit contributions
��-3.030
 unders-3.025
COMPLE-2.949
aja-2.906
 Obj-2.846
Token for
Token ID329
Feature activation+1.16e+01

Pos logit contributions
ince+3.044
agate+3.027
 Difference+2.964
theless+2.872
cius+2.867
Neg logit contributions
��-3.582
 unders-3.514
COMPLE-3.476
aja-3.410
��-3.359
Token Nietzsche
Token ID40258
Feature activation+7.63e+00

Pos logit contributions
ince+6.082
 Difference+6.035
agate+5.766
ments+5.757
theless+5.707
Neg logit contributions
��-8.774
 unders-8.552
COMPLE-8.470
��-8.328
aja-8.288
Token,
Token ID11
Feature activation+4.49e+00

Pos logit contributions
ince+4.264
agate+4.240
 Difference+4.116
theless+4.043
ments+4.025
Neg logit contributions
��-5.762
 unders-5.653
COMPLE-5.604
aja-5.464
 Obj-5.375
Token the
Token ID262
Feature activation+4.94e+00

Pos logit contributions
agate+2.742
ince+2.734
 Difference+2.648
theless+2.572
cius+2.571
Neg logit contributions
��-3.282
 unders-3.236
COMPLE-3.173
aja-3.133
��-3.060
Token good
Token ID922
Feature activation+3.42e+00

Pos logit contributions
ince+3.260
agate+3.249
 Difference+3.187
cius+3.102
theless+3.084
Neg logit contributions
��-3.374
 unders-3.274
COMPLE-3.220
aja-3.188
 Obj-3.107
Token of
Token ID286
Feature activation+2.39e+00

Pos logit contributions
ince+2.109
agate+2.103
 Difference+2.014
theless+1.999
cius+1.996
Neg logit contributions
��-2.486
 unders-2.469
COMPLE-2.417
aja-2.382
 Obj-2.361
Tokenark
Token ID668
Feature activation-4.38e+00

Pos logit contributions
 unders+2.795
aja+2.767
COMPLE+2.590
��+2.586
 Obj+2.521
Neg logit contributions
theless-4.675
agate-4.574
 Difference-4.537
ince-4.514
cius-4.420
Tokenowski
Token ID12079
Feature activation-3.36e+00

Pos logit contributions
 unders+1.947
aja+1.920
COMPLE+1.819
��+1.779
 Obj+1.738
Neg logit contributions
agate-3.917
theless-3.906
ince-3.855
 Difference-3.799
cius-3.744
Token:
Token ID25
Feature activation+2.45e+00

Pos logit contributions
 unders+2.219
COMPLE+2.153
aja+2.140
 Obj+2.099
��+2.069
Neg logit contributions
agate-2.310
theless-2.288
ince-2.255
cius-2.214
 Difference-2.160
Token I
Token ID314
Feature activation+2.68e+00

Pos logit contributions
ince+1.521
agate+1.516
 Difference+1.487
theless+1.448
ments+1.436
Neg logit contributions
��-1.795
 unders-1.772
COMPLE-1.709
aja-1.690
 Obj-1.660
Token would
Token ID561
Feature activation-4.79e+00

Pos logit contributions
ince+1.902
agate+1.897
theless+1.810
cius+1.807
 Difference+1.803
Neg logit contributions
��-1.735
 unders-1.681
COMPLE-1.655
aja-1.608
 Obj-1.604
Token say
Token ID910
Feature activation+9.63e+00

Pos logit contributions
 unders+3.386
aja+3.182
��+3.181
COMPLE+3.169
 Obj+3.159
Neg logit contributions
agate-3.134
ince-3.055
theless-3.050
cius-3.007
 Difference-2.959
Token if
Token ID611
Feature activation-1.76e+00

Pos logit contributions
ince+5.276
 Difference+5.036
agate+5.018
ments+4.941
theless+4.881
Neg logit contributions
��-7.575
COMPLE-7.046
��-6.934
aja-6.816
 unders-6.771
Token you
Token ID345
Feature activation-6.70e+00

Pos logit contributions
 unders+1.230
��+1.205
COMPLE+1.176
aja+1.170
 Obj+1.158
Neg logit contributions
agate-1.162
ince-1.148
theless-1.111
 Difference-1.098
cius-1.098
Token look
Token ID804
Feature activation+1.52e+00

Pos logit contributions
 unders+4.368
aja+4.139
��+4.031
 Obj+3.969
COMPLE+3.956
Neg logit contributions
agate-4.691
theless-4.566
ince-4.555
 Difference-4.467
cius-4.450
Token across
Token ID1973
Feature activation+2.04e-01

Pos logit contributions
agate+0.968
ince+0.964
 Difference+0.917
theless+0.913
cius+0.911
Neg logit contributions
��-1.078
 unders-1.066
COMPLE-1.044
aja-1.030
 Obj-1.020
Token this
Token ID428
Feature activation-1.42e+00

Pos logit contributions
agate+0.131
ince+0.130
theless+0.125
 Difference+0.124
cius+0.124
Neg logit contributions
 unders-0.145
��-0.145
COMPLE-0.140
aja-0.139
 Obj-0.137
Token (
Token ID357
Feature activation-7.82e-01

Pos logit contributions
 unders+2.594
��+2.510
COMPLE+2.475
aja+2.460
 Obj+2.436
Neg logit contributions
agate-2.917
ince-2.892
theless-2.841
cius-2.796
 Difference-2.769
TokenMay
Token ID6747
Feature activation-4.93e+00

Pos logit contributions
 unders+0.593
��+0.589
COMPLE+0.578
aja+0.572
 Obj+0.564
Neg logit contributions
agate-0.464
ince-0.459
theless-0.442
 Difference-0.438
cius-0.438
Token 2013
Token ID2211
Feature activation-3.44e+00

Pos logit contributions
 unders+2.915
aja+2.759
COMPLE+2.748
��+2.734
 Obj+2.697
Neg logit contributions
agate-3.734
ince-3.692
theless-3.591
cius-3.580
 Difference-3.530
Token),
Token ID828
Feature activation-9.98e-01

Pos logit contributions
 unders+2.149
��+2.095
COMPLE+2.055
aja+2.049
 Obj+2.013
Neg logit contributions
agate-2.496
ince-2.461
theless-2.403
cius-2.388
 Difference-2.374
Token S
Token ID311
Feature activation-1.05e+01

Pos logit contributions
 unders+0.683
��+0.675
COMPLE+0.656
aja+0.651
 Obj+0.645
Neg logit contributions
agate-0.668
ince-0.663
theless-0.640
cius-0.634
 Difference-0.632
Tokenout
Token ID448
Feature activation+9.50e+00

Pos logit contributions
aja+7.135
 unders+7.027
��+6.895
COMPLE+6.748
 Obj+6.549
Neg logit contributions
theless-6.313
agate-6.208
ince-6.145
cius-5.946
 Difference-5.911
Tokenron
Token ID1313
Feature activation+2.21e+00

Pos logit contributions
agate+5.173
ince+5.080
ments+4.783
het+4.738
 Difference+4.705
Neg logit contributions
��-7.595
 unders-7.420
COMPLE-7.206
 Obj-7.068
��-7.030
Token Global
Token ID8060
Feature activation-4.11e+00

Pos logit contributions
agate+1.448
ince+1.436
 Difference+1.376
cius+1.371
theless+1.369
Neg logit contributions
 unders-1.550
��-1.548
COMPLE-1.500
aja-1.475
 Obj-1.468
Token (
Token ID357
Feature activation-2.70e+00

Pos logit contributions
 unders+2.834
��+2.765
aja+2.712
COMPLE+2.699
 Obj+2.673
Neg logit contributions
agate-2.697
ince-2.662
theless-2.602
cius-2.543
 Difference-2.491
TokenMarch
Token ID16192
Feature activation-5.73e+00

Pos logit contributions
 unders+2.023
��+2.004
COMPLE+1.999
aja+1.972
 Obj+1.921
Neg logit contributions
agate-1.595
ince-1.562
theless-1.539
 Difference-1.524
cius-1.521
Token 2013
Token ID2211
Feature activation-2.85e+00

Pos logit contributions
 unders+3.276
aja+3.106
��+3.055
COMPLE+3.047
 Obj+3.024
Neg logit contributions
agate-4.460
ince-4.382
cius-4.270
theless-4.253
 Difference-4.161
Token least
Token ID1551
Feature activation+2.38e+00

Pos logit contributions
agate+1.140
ince+1.128
 Difference+1.084
theless+1.076
cius+1.074
Neg logit contributions
��-1.214
 unders-1.205
COMPLE-1.171
aja-1.157
 Obj-1.140
Token one
Token ID530
Feature activation+8.24e-01

Pos logit contributions
agate+1.460
ince+1.457
 Difference+1.391
theless+1.389
cius+1.377
Neg logit contributions
��-1.749
 unders-1.728
COMPLE-1.687
aja-1.668
 Obj-1.646
Token thing
Token ID1517
Feature activation+5.75e+00

Pos logit contributions
agate+0.552
ince+0.549
theless+0.528
 Difference+0.527
cius+0.524
Neg logit contributions
��-0.560
 unders-0.559
COMPLE-0.541
aja-0.536
 Obj-0.530
Token right
Token ID826
Feature activation+4.55e+00

Pos logit contributions
ince+3.490
agate+3.413
ments+3.305
cius+3.270
 Difference+3.269
Neg logit contributions
��-4.278
 unders-4.082
aja-4.007
COMPLE-3.993
 Obj-3.963
Token.
Token ID13
Feature activation-4.40e-01

Pos logit contributions
agate+2.703
ince+2.702
cius+2.574
theless+2.547
 Difference+2.546
Neg logit contributions
��-3.441
 unders-3.369
COMPLE-3.294
aja-3.214
 Obj-3.187
Token By
Token ID2750
Feature activation+1.26e+01

Pos logit contributions
 unders+0.307
��+0.304
COMPLE+0.297
aja+0.294
 Obj+0.291
Neg logit contributions
agate-0.288
ince-0.286
theless-0.276
cius-0.273
 Difference-0.272
Token chance
Token ID2863
Feature activation+8.89e-01

Pos logit contributions
ince+7.381
 Difference+7.230
agate+7.177
cius+7.142
theless+7.071
Neg logit contributions
��-8.774
 unders-8.075
COMPLE-7.947
aja-7.932
��-7.929
Token,
Token ID11
Feature activation+2.44e+00

Pos logit contributions
agate+0.561
ince+0.556
theless+0.536
 Difference+0.532
cius+0.530
Neg logit contributions
��-0.640
 unders-0.638
COMPLE-0.619
aja-0.613
 Obj-0.607
Token a
Token ID257
Feature activation+2.41e+00

Pos logit contributions
agate+1.519
ince+1.508
theless+1.450
 Difference+1.448
cius+1.436
Neg logit contributions
��-1.784
 unders-1.753
COMPLE-1.712
aja-1.694
 Obj-1.676
Token Babylon
Token ID28028
Feature activation+1.29e+00

Pos logit contributions
agate+1.637
ince+1.633
 Difference+1.583
theless+1.579
cius+1.565
Neg logit contributions
��-1.599
 unders-1.579
COMPLE-1.535
aja-1.516
 Obj-1.502
Tokenian
Token ID666
Feature activation-1.09e+00

Pos logit contributions
agate+0.520
ince+0.513
 Difference+0.477
theless+0.476
cius+0.473
Neg logit contributions
��-1.224
 unders-1.221
COMPLE-1.195
aja-1.176
 Obj-1.175
Token but
Token ID475
Feature activation+5.26e+00

Pos logit contributions
ince+1.435
agate+1.425
theless+1.371
 Difference+1.354
cius+1.343
Neg logit contributions
��-1.995
 unders-1.968
COMPLE-1.923
aja-1.899
 Obj-1.871
Token two
Token ID734
Feature activation+2.61e+00

Pos logit contributions
ince+2.580
agate+2.580
theless+2.433
 Difference+2.433
ments+2.411
Neg logit contributions
��-4.456
 unders-4.331
COMPLE-4.261
 Obj-4.198
aja-4.188
Token different
Token ID1180
Feature activation+1.01e+00

Pos logit contributions
agate+1.695
ince+1.693
theless+1.634
 Difference+1.627
cius+1.617
Neg logit contributions
��-1.825
 unders-1.781
COMPLE-1.748
aja-1.719
 Obj-1.701
Token concepts
Token ID10838
Feature activation+4.93e+00

Pos logit contributions
agate+0.678
ince+0.673
theless+0.649
 Difference+0.644
cius+0.643
Neg logit contributions
 unders-0.684
��-0.682
COMPLE-0.661
aja-0.654
 Obj-0.647
Token from
Token ID422
Feature activation+1.30e+00

Pos logit contributions
ince+3.024
agate+2.988
cius+2.847
theless+2.825
 Difference+2.809
Neg logit contributions
��-3.625
 unders-3.572
COMPLE-3.526
 Obj-3.410
aja-3.410
Token the
Token ID262
Feature activation+5.04e+00

Pos logit contributions
agate+0.846
ince+0.840
theless+0.808
 Difference+0.805
cius+0.800
Neg logit contributions
 unders-0.905
��-0.905
COMPLE-0.877
aja-0.867
 Obj-0.856
Token arg
Token ID1822
Feature activation+4.27e+00

Pos logit contributions
ince+3.335
agate+3.311
 Difference+3.237
theless+3.225
cius+3.153
Neg logit contributions
��-3.422
 unders-3.343
COMPLE-3.277
aja-3.263
��-3.209
Tokenuable
Token ID7153
Feature activation-2.30e+00

Pos logit contributions
ince+2.973
agate+2.921
cius+2.793
 Difference+2.793
ments+2.778
Neg logit contributions
 unders-2.804
��-2.799
COMPLE-2.682
aja-2.619
 Obj-2.611
Token king
Token ID5822
Feature activation+8.26e-01

Pos logit contributions
 unders+1.560
��+1.525
COMPLE+1.485
aja+1.478
 Obj+1.466
Neg logit contributions
agate-1.574
ince-1.546
cius-1.491
theless-1.490
 Difference-1.470
Token of
Token ID286
Feature activation+2.92e+00

Pos logit contributions
agate+0.530
ince+0.527
theless+0.506
 Difference+0.503
cius+0.503
Neg logit contributions
 unders-0.587
��-0.584
COMPLE-0.568
aja-0.560
 Obj-0.556
Token curse
Token ID17328
Feature activation+1.93e+00

Pos logit contributions
agate+1.935
ince+1.923
theless+1.853
 Difference+1.852
cius+1.826
Neg logit contributions
��-1.995
 unders-1.966
COMPLE-1.937
aja-1.911
 Obj-1.880
Token
Token ID251
Feature activation+2.56e-01

Pos logit contributions
agate+1.005
ince+0.997
 Difference+0.970
cius+0.967
theless+0.966
Neg logit contributions
 unders-0.491
��-0.477
COMPLE-0.465
aja-0.459
 Obj-0.450
Token
Token ID198
Feature activation+9.95e+00

Pos logit contributions
agate+0.161
ince+0.160
theless+0.154
 Difference+0.153
cius+0.152
Neg logit contributions
 unders-0.185
��-0.184
COMPLE-0.178
aja-0.177
 Obj-0.175
Token
Token ID198
Feature activation+6.83e+00

Pos logit contributions
ince+4.333
ments+4.162
agate+4.154
 Difference+3.975
het+3.638
Neg logit contributions
��-9.429
��-9.190
 unders-8.576
-8.361
 Adin-8.310
TokenT
Token ID51
Feature activation+1.78e+00

Pos logit contributions
ince+3.271
agate+3.236
ments+3.014
 Difference+2.999
cius+2.957
Neg logit contributions
 unders-5.839
��-5.802
 Obj-5.608
aja-5.510
��-5.462
Tokenoward
Token ID46138
Feature activation+6.57e-01

Pos logit contributions
agate+1.305
ince+1.299
 Difference+1.241
theless+1.240
cius+1.237
Neg logit contributions
��-1.104
 unders-1.101
COMPLE-1.060
 Obj-1.043
aja-1.039
Token the
Token ID262
Feature activation-7.67e-01

Pos logit contributions
agate+0.416
ince+0.413
theless+0.397
 Difference+0.395
cius+0.394
Neg logit contributions
 unders-0.471
��-0.469
COMPLE-0.456
aja-0.452
 Obj-0.446
Token end
Token ID886
Feature activation+6.14e+00

Pos logit contributions
 unders+0.500
��+0.494
COMPLE+0.480
aja+0.475
 Obj+0.470
Neg logit contributions
agate-0.541
ince-0.535
theless-0.518
cius-0.514
 Difference-0.513
Token of
Token ID286
Feature activation+4.85e+00

Pos logit contributions
ince+3.597
agate+3.476
cius+3.413
 Difference+3.340
theless+3.314
Neg logit contributions
 unders-4.565
��-4.537
COMPLE-4.486
aja-4.392
 Obj-4.365
Token the
Token ID262
Feature activation+8.33e+00

Pos logit contributions
ince+2.850
agate+2.795
 Difference+2.685
cius+2.669
ments+2.648
Neg logit contributions
��-3.641
 unders-3.627
COMPLE-3.553
aja-3.539
 Obj-3.486
Token clip
Token ID10651
Feature activation+3.64e+00

Pos logit contributions
ince+5.537
agate+5.345
 Difference+5.282
theless+5.229
cius+5.215
Neg logit contributions
��-5.468
aja-5.213
COMPLE-5.207
 unders-5.173
 Obj-5.103
Token,
Token ID11
Feature activation-7.22e-01

Pos logit contributions
agate+2.181
ince+2.176
cius+2.080
 Difference+2.070
theless+2.068
Neg logit contributions
 unders-2.671
��-2.670
COMPLE-2.619
aja-2.593
 Obj-2.573
Tokenout
Token ID448
Feature activation+9.55e-01

Pos logit contributions
ince+2.463
agate+2.317
ments+1.973
cius+1.916
theless+1.913
Neg logit contributions
��-7.670
��-7.317
COMPLE-7.230
 unders-7.222
 Obj-7.106
Token app
Token ID598
Feature activation+4.88e-01

Pos logit contributions
agate+0.633
ince+0.630
 Difference+0.603
theless+0.602
cius+0.598
Neg logit contributions
��-0.658
 unders-0.653
COMPLE-0.632
aja-0.629
 Obj-0.617
Tokendata
Token ID7890
Feature activation+1.82e+00

Pos logit contributions
agate+0.389
ince+0.386
theless+0.375
 Difference+0.373
cius+0.372
Neg logit contributions
 unders-0.271
��-0.268
COMPLE-0.259
aja-0.256
 Obj-0.252
Token _
Token ID4808
Feature activation-1.98e-01

Pos logit contributions
agate+1.171
ince+1.163
 Difference+1.116
ments+1.101
theless+1.099
Neg logit contributions
��-1.308
 unders-1.277
COMPLE-1.235
aja-1.226
 Obj-1.201
Token full
Token ID1336
Feature activation+2.02e-01

Pos logit contributions
 unders+0.093
��+0.092
COMPLE+0.088
aja+0.087
 Obj+0.086
Neg logit contributions
agate-0.175
ince-0.174
theless-0.169
 Difference-0.169
cius-0.168
Token v
Token ID410
Feature activation+3.63e+00

Pos logit contributions
agate+0.149
ince+0.148
theless+0.143
 Difference+0.143
cius+0.142
Neg logit contributions
��-0.124
 unders-0.123
COMPLE-0.119
aja-0.118
 Obj-0.116
Token )
Token ID1267
Feature activation-1.50e+00

Pos logit contributions
agate+1.872
ince+1.871
 Difference+1.758
ments+1.750
cius+1.710
Neg logit contributions
��-3.127
 unders-2.976
COMPLE-2.918
��-2.884
 Obj-2.863
Token {
Token ID1391
Feature activation-1.44e+00

Pos logit contributions
 unders+0.954
��+0.934
COMPLE+0.923
aja+0.917
 Obj+0.907
Neg logit contributions
agate-1.068
ince-1.055
theless-1.033
cius-1.018
 Difference-1.010
Token //
Token ID3373
Feature activation-4.23e+00

Pos logit contributions
 unders+0.660
��+0.647
COMPLE+0.621
aja+0.614
 Obj+0.607
Neg logit contributions
agate-1.292
ince-1.281
theless-1.245
cius-1.242
 Difference-1.230
Token Time
Token ID3862
Feature activation+4.43e+00

Pos logit contributions
 unders+2.378
��+2.305
 Obj+2.258
COMPLE+2.236
aja+2.230
Neg logit contributions
agate-3.306
ince-3.286
theless-3.230
cius-3.151
ments-3.064
Token component
Token ID7515
Feature activation+4.73e+00

Pos logit contributions
ince+2.735
agate+2.672
 Difference+2.590
ments+2.566
cius+2.546
Neg logit contributions
��-3.337
 unders-3.174
aja-3.090
COMPLE-3.070
��-3.067
Token it
Token ID340
Feature activation+5.74e+00

Pos logit contributions
agate+3.035
ince+3.017
 Difference+2.883
theless+2.880
ments+2.866
Neg logit contributions
��-4.219
 unders-4.060
COMPLE-4.039
��-3.968
aja-3.956
Token's
Token ID338
Feature activation+9.51e+00

Pos logit contributions
agate+3.126
ince+3.118
ments+2.971
theless+2.965
 Difference+2.897
Neg logit contributions
��-4.610
 unders-4.410
��-4.402
COMPLE-4.392
 Obj-4.305
Token probably
Token ID2192
Feature activation+1.00e+01

Pos logit contributions
ince+5.676
agate+5.515
ments+5.381
 Difference+5.347
theless+5.339
Neg logit contributions
��-6.893
��-6.569
 Obj-6.352
aja-6.328
COMPLE-6.300
Token just
Token ID655
Feature activation+8.75e+00

Pos logit contributions
ince+5.969
agate+5.713
ments+5.557
cius+5.520
theless+5.509
Neg logit contributions
��-7.357
��-6.943
aja-6.670
 Obj-6.663
COMPLE-6.605
Token a
Token ID257
Feature activation+5.37e+00

Pos logit contributions
ince+5.388
agate+5.194
ments+5.032
theless+4.975
cius+4.930
Neg logit contributions
��-6.320
��-6.007
COMPLE-5.930
 unders-5.881
aja-5.867
Token bad
Token ID2089
Feature activation+2.89e-01

Pos logit contributions
ince+3.754
agate+3.699
 Difference+3.568
ments+3.558
cius+3.541
Neg logit contributions
��-3.428
 unders-3.379
COMPLE-3.302
aja-3.269
 Obj-3.216
Token cold
Token ID4692
Feature activation-2.56e+00

Pos logit contributions
agate+0.204
ince+0.202
theless+0.195
 Difference+0.194
cius+0.194
Neg logit contributions
 unders-0.187
��-0.186
COMPLE-0.180
aja-0.178
 Obj-0.176
Token,
Token ID11
Feature activation+3.18e+00

Pos logit contributions
 unders+1.765
��+1.722
aja+1.696
COMPLE+1.688
 Obj+1.673
Neg logit contributions
agate-1.702
ince-1.679
theless-1.644
 Difference-1.610
cius-1.603
Token but
Token ID475
Feature activation+6.11e+00

Pos logit contributions
ince+1.833
agate+1.830
theless+1.751
 Difference+1.751
ments+1.729
Neg logit contributions
��-2.457
 unders-2.396
COMPLE-2.357
aja-2.333
 Obj-2.298
Token a
Token ID257
Feature activation+4.79e+00

Pos logit contributions
agate+3.494
ince+3.465
theless+3.397
 Difference+3.351
ments+3.328
Neg logit contributions
��-4.649
COMPLE-4.513
 unders-4.479
��-4.417
aja-4.375
Token doctor
Token ID6253
Feature activation+1.93e+00

Pos logit contributions
ince+3.360
agate+3.287
 Difference+3.221
theless+3.206
cius+3.195
Neg logit contributions
��-3.035
 unders-3.008
COMPLE-2.951
 Obj-2.893
aja-2.862
Token compromise
Token ID13110
Feature activation-3.22e+00

Pos logit contributions
 unders+0.685
��+0.674
COMPLE+0.657
aja+0.650
 Obj+0.643
Neg logit contributions
agate-0.665
ince-0.656
theless-0.635
cius-0.628
 Difference-0.627
Token with
Token ID351
Feature activation-7.89e-01

Pos logit contributions
 unders+2.124
��+2.056
COMPLE+2.016
aja+1.983
 Obj+1.964
Neg logit contributions
agate-2.264
ince-2.234
theless-2.164
cius-2.141
 Difference-2.130
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
 unders+0.561
��+0.553
COMPLE+0.539
aja+0.534
 Obj+0.528
Neg logit contributions
agate-0.511
ince-0.505
theless-0.487
cius-0.482
 Difference-0.482
Token reigning
Token ID40098
Feature activation-1.32e-02

Pos logit contributions
 unders+0.802
��+0.791
COMPLE+0.769
aja+0.759
 Obj+0.752
Neg logit contributions
agate-0.848
ince-0.839
theless-0.811
cius-0.804
 Difference-0.803
Token Republicans
Token ID4734
Feature activation-1.45e+00

Pos logit contributions
 unders+0.009
��+0.009
COMPLE+0.008
aja+0.008
 Obj+0.008
Neg logit contributions
agate-0.009
ince-0.009
theless-0.009
 Difference-0.009
cius-0.009
Token,
Token ID11
Feature activation-2.23e-01

Pos logit contributions
 unders+0.986
��+0.970
COMPLE+0.944
aja+0.939
 Obj+0.926
Neg logit contributions
agate-0.980
ince-0.964
theless-0.945
 Difference-0.931
cius-0.926
Token while
Token ID981
Feature activation-2.48e+00

Pos logit contributions
 unders+0.155
��+0.153
COMPLE+0.149
aja+0.148
 Obj+0.146
Neg logit contributions
agate-0.148
ince-0.147
theless-0.142
 Difference-0.140
cius-0.140
Token Evans
Token ID13922
Feature activation+3.40e+00

Pos logit contributions
 unders+1.662
��+1.608
COMPLE+1.585
aja+1.568
 Obj+1.547
Neg logit contributions
agate-1.713
ince-1.679
cius-1.623
theless-1.619
 Difference-1.603
Token and
Token ID290
Feature activation-7.04e-01

Pos logit contributions
ince+2.125
agate+2.120
theless+2.031
 Difference+2.006
cius+2.005
Neg logit contributions
��-2.462
 unders-2.409
COMPLE-2.360
aja-2.344
 Obj-2.320
Token others
Token ID1854
Feature activation+6.43e-01

Pos logit contributions
 unders+0.491
��+0.485
COMPLE+0.473
aja+0.468
 Obj+0.463
Neg logit contributions
agate-0.464
ince-0.459
theless-0.443
cius-0.440
 Difference-0.438
Token have
Token ID423
Feature activation+2.62e+00

Pos logit contributions
agate+0.420
ince+0.417
theless+0.401
 Difference+0.398
cius+0.398
Neg logit contributions
 unders-0.450
��-0.448
COMPLE-0.435
aja-0.431
 Obj-0.426
Token the
Token ID262
Feature activation-1.05e+00

Pos logit contributions
 unders+0.283
��+0.281
COMPLE+0.273
aja+0.271
 Obj+0.267
Neg logit contributions
agate-0.265
ince-0.263
theless-0.253
 Difference-0.251
cius-0.251
Token next
Token ID1306
Feature activation-3.49e+00

Pos logit contributions
 unders+0.712
��+0.703
COMPLE+0.683
aja+0.678
 Obj+0.671
Neg logit contributions
agate-0.709
ince-0.702
theless-0.679
cius-0.672
 Difference-0.668
Token generation
Token ID5270
Feature activation+1.87e+00

Pos logit contributions
 unders+2.201
��+2.115
aja+2.075
COMPLE+2.074
 Obj+2.049
Neg logit contributions
agate-2.560
ince-2.499
theless-2.482
cius-2.447
 Difference-2.371
Token of
Token ID286
Feature activation-1.31e-03

Pos logit contributions
ince+1.190
agate+1.188
 Difference+1.140
theless+1.138
cius+1.127
Neg logit contributions
��-1.330
 unders-1.317
COMPLE-1.290
aja-1.267
 Obj-1.256
Token Ron
Token ID6575
Feature activation+6.96e-02

Pos logit contributions
 unders+0.001
��+0.001
COMPLE+0.001
aja+0.001
 Obj+0.001
Neg logit contributions
agate-0.001
ince-0.001
theless-0.001
cius-0.001
 Difference-0.001
Token D
Token ID360
Feature activation-2.84e+00

Pos logit contributions
agate+0.044
ince+0.043
theless+0.042
 Difference+0.042
cius+0.041
Neg logit contributions
 unders-0.050
��-0.050
COMPLE-0.048
aja-0.048
 Obj-0.048
Tokenenn
Token ID1697
Feature activation+2.39e+00

Pos logit contributions
 unders+1.837
aja+1.804
��+1.783
 Obj+1.713
COMPLE+1.702
Neg logit contributions
agate-1.975
theless-1.971
ince-1.922
cius-1.892
 Difference-1.891
Tokenises
Token ID2696
Feature activation+4.72e+00

Pos logit contributions
agate+1.560
ince+1.538
theless+1.465
cius+1.464
 Difference+1.457
Neg logit contributions
��-1.700
 unders-1.681
COMPLE-1.645
 Obj-1.610
aja-1.603
Token and
Token ID290
Feature activation+2.98e+00

Pos logit contributions
ince+2.962
agate+2.933
cius+2.808
 Difference+2.779
theless+2.764
Neg logit contributions
��-3.425
 unders-3.359
COMPLE-3.315
 Obj-3.221
aja-3.195
Token Frank
Token ID5278
Feature activation+4.83e+00

Pos logit contributions
agate+1.958
ince+1.954
 Difference+1.885
theless+1.879
cius+1.863
Neg logit contributions
��-2.043
 unders-2.025
COMPLE-1.978
aja-1.940
 Obj-1.920
Token Williams
Token ID6484
Feature activation+6.46e+00

Pos logit contributions
agate+3.238
ince+3.234
 Difference+3.105
theless+3.095
ments+3.086
Neg logit contributions
��-3.270
 unders-3.216
COMPLE-3.149
aja-3.103
 Obj-3.058
Tokenars
Token ID945
Feature activation+8.35e-01

Pos logit contributions
agate+2.412
ince+2.371
ments+2.308
cius+2.301
 Difference+2.280
Neg logit contributions
��-1.031
 unders-0.996
COMPLE-0.960
 Obj-0.944
��-0.911
Token have
Token ID423
Feature activation+9.54e-01

Pos logit contributions
agate+0.542
ince+0.537
theless+0.518
cius+0.513
 Difference+0.513
Neg logit contributions
��-0.587
 unders-0.586
COMPLE-0.569
aja-0.562
 Obj-0.557
Token since
Token ID1201
Feature activation-3.33e+00

Pos logit contributions
agate+0.605
ince+0.599
theless+0.578
 Difference+0.572
cius+0.571
Neg logit contributions
��-0.684
 unders-0.683
COMPLE-0.663
aja-0.657
 Obj-0.651
Token denied
Token ID6699
Feature activation+4.43e+00

Pos logit contributions
 unders+2.313
��+2.188
aja+2.165
COMPLE+2.164
 Obj+2.117
Neg logit contributions
agate-2.227
ince-2.204
 Difference-2.128
cius-2.127
theless-2.102
Token that
Token ID326
Feature activation+3.15e-01

Pos logit contributions
ince+2.797
agate+2.777
theless+2.636
 Difference+2.606
cius+2.604
Neg logit contributions
��-3.226
 unders-3.124
COMPLE-3.049
aja-3.022
 Obj-3.019
Token they
Token ID484
Feature activation-4.63e+00

Pos logit contributions
agate+0.203
ince+0.201
theless+0.193
 Difference+0.192
cius+0.192
Neg logit contributions
 unders-0.223
��-0.222
COMPLE-0.216
aja-0.214
 Obj-0.211
Token purposefully
Token ID45136
Feature activation-6.02e-01

Pos logit contributions
 unders+3.013
aja+2.854
��+2.801
 Obj+2.788
COMPLE+2.777
Neg logit contributions
agate-3.191
ince-3.157
 Difference-3.121
theless-3.071
cius-3.063
Token chose
Token ID7690
Feature activation+4.62e+00

Pos logit contributions
 unders+0.412
��+0.406
COMPLE+0.395
aja+0.393
 Obj+0.387
Neg logit contributions
agate-0.404
ince-0.400
theless-0.386
 Difference-0.384
cius-0.383
Token Hut
Token ID32767
Feature activation-3.96e+00

Pos logit contributions
ince+2.812
agate+2.791
theless+2.648
 Difference+2.621
ments+2.614
Neg logit contributions
��-3.466
 unders-3.350
COMPLE-3.297
 Obj-3.263
aja-3.244
Tokenchens
Token ID29937
Feature activation-1.34e+00

Pos logit contributions
 unders+2.573
COMPLE+2.433
aja+2.426
��+2.384
 Obj+2.297
Neg logit contributions
ince-2.707
theless-2.704
 Difference-2.700
agate-2.688
uyomi-2.590
Token.
Token ID13
Feature activation+6.49e-01

Pos logit contributions
 unders+0.911
��+0.886
COMPLE+0.865
aja+0.865
 Obj+0.852
Neg logit contributions
agate-0.898
ince-0.890
theless-0.864
 Difference-0.859
cius-0.855
TokenBT
Token ID19313
Feature activation-2.14e-01

Pos logit contributions
 unders+1.045
��+1.034
COMPLE+1.010
aja+1.000
 Obj+0.985
Neg logit contributions
agate-0.954
ince-0.943
theless-0.912
cius-0.905
 Difference-0.899
Token lifestyle
Token ID12263
Feature activation-4.03e+00

Pos logit contributions
 unders+0.156
��+0.155
COMPLE+0.151
aja+0.149
 Obj+0.148
Neg logit contributions
agate-0.134
ince-0.133
theless-0.128
 Difference-0.127
cius-0.127
Token for
Token ID329
Feature activation+3.98e+00

Pos logit contributions
 unders+2.700
��+2.550
COMPLE+2.514
aja+2.483
 Obj+2.470
Neg logit contributions
agate-2.765
ince-2.718
theless-2.674
 Difference-2.643
cius-2.629
Token children
Token ID1751
Feature activation+2.15e+00

Pos logit contributions
ince+2.404
agate+2.398
 Difference+2.312
ments+2.299
theless+2.298
Neg logit contributions
��-2.915
 unders-2.815
COMPLE-2.808
aja-2.786
 Obj-2.772
Token as
Token ID355
Feature activation-3.09e+00

Pos logit contributions
agate+1.361
ince+1.353
theless+1.301
cius+1.291
 Difference+1.283
Neg logit contributions
��-1.547
 unders-1.520
COMPLE-1.494
aja-1.477
 Obj-1.464
Token young
Token ID1862
Feature activation-1.47e+00

Pos logit contributions
 unders+2.077
��+2.017
COMPLE+1.949
aja+1.924
 Obj+1.910
Neg logit contributions
agate-2.136
ince-2.109
theless-2.035
 Difference-2.030
cius-2.030
Token as
Token ID355
Feature activation-2.52e+00

Pos logit contributions
 unders+1.039
��+1.016
COMPLE+0.994
aja+0.985
 Obj+0.971
Neg logit contributions
agate-0.961
ince-0.951
theless-0.924
 Difference-0.914
cius-0.910
Token seven
Token ID3598
Feature activation+2.11e+00

Pos logit contributions
 unders+1.769
��+1.693
COMPLE+1.671
aja+1.647
 Obj+1.636
Neg logit contributions
agate-1.678
ince-1.657
theless-1.615
cius-1.601
 Difference-1.594
Token years
Token ID812
Feature activation+3.71e+00

Pos logit contributions
agate+1.222
ince+1.215
theless+1.153
 Difference+1.148
cius+1.145
Neg logit contributions
��-1.631
 unders-1.617
COMPLE-1.579
aja-1.566
 Obj-1.554
Token old
Token ID1468
Feature activation-1.82e+00

Pos logit contributions
ince+2.189
agate+2.183
cius+2.059
ments+2.057
 Difference+2.053
Neg logit contributions
��-2.774
 unders-2.741
COMPLE-2.728
aja-2.677
 Obj-2.670
Token,"
Token ID553
Feature activation-7.12e-01

Pos logit contributions
 unders+1.268
��+1.237
COMPLE+1.212
aja+1.201
 Obj+1.185
Neg logit contributions
agate-1.201
ince-1.187
theless-1.151
 Difference-1.144
cius-1.140
Token
Token ID564
Feature activation-2.44e+00

Pos logit contributions
agate+1.045
ince+1.045
 Difference+0.996
theless+0.986
cius+0.980
Neg logit contributions
��-1.577
 unders-1.574
COMPLE-1.528
aja-1.518
 Obj-1.502
Token
Token ID250
Feature activation+2.13e+00

Pos logit contributions
��+1.249
 unders+1.246
COMPLE+1.203
aja+1.177
 Obj+1.156
Neg logit contributions
agate-2.045
ince-2.027
theless-2.002
cius-1.964
 Difference-1.961
TokenBut
Token ID1537
Feature activation+3.84e+00

Pos logit contributions
agate+1.287
ince+1.287
 Difference+1.215
theless+1.213
cius+1.212
Neg logit contributions
 unders-1.592
��-1.591
COMPLE-1.521
aja-1.516
 Obj-1.514
Token he
Token ID339
Feature activation-2.06e+00

Pos logit contributions
ince+2.304
agate+2.295
theless+2.213
 Difference+2.209
cius+2.189
Neg logit contributions
��-2.853
 unders-2.796
COMPLE-2.775
aja-2.714
 Obj-2.698
Token
Token ID447
Feature activation-1.34e+00

Pos logit contributions
 unders+1.489
��+1.449
aja+1.418
COMPLE+1.417
 Obj+1.393
Neg logit contributions
agate-1.304
ince-1.290
theless-1.242
 Difference-1.237
cius-1.235
Token
Token ID247
Feature activation-2.72e+00

Pos logit contributions
 unders+0.955
��+0.950
COMPLE+0.919
aja+0.913
 Obj+0.900
Neg logit contributions
agate-0.864
ince-0.856
theless-0.836
cius-0.819
 Difference-0.817
Tokens
Token ID82
Feature activation-2.15e+00

Pos logit contributions
 unders+1.969
��+1.911
aja+1.887
COMPLE+1.882
 Obj+1.840
Neg logit contributions
agate-1.703
ince-1.679
 Difference-1.627
theless-1.620
cius-1.605
Token a
Token ID257
Feature activation+4.51e+00

Pos logit contributions
 unders+1.467
��+1.426
COMPLE+1.391
aja+1.384
 Obj+1.366
Neg logit contributions
agate-1.460
ince-1.436
theless-1.397
cius-1.387
 Difference-1.383
Token sod
Token ID32581
Feature activation+2.48e+00

Pos logit contributions
ince+2.843
agate+2.841
 Difference+2.765
theless+2.712
ments+2.698
Neg logit contributions
��-3.221
 unders-3.130
COMPLE-3.080
aja-3.049
 Obj-2.999
Tokenom
Token ID296
Feature activation+1.26e+00

Pos logit contributions
agate+1.749
ince+1.728
cius+1.647
 Difference+1.644
theless+1.637
Neg logit contributions
 unders-1.623
��-1.617
COMPLE-1.565
 Obj-1.529
aja-1.527
Tokenite
Token ID578
Feature activation+9.80e-01

Pos logit contributions
agate+0.748
ince+0.741
theless+0.704
 Difference+0.702
cius+0.700
Neg logit contributions
 unders-0.963
��-0.961
COMPLE-0.935
aja-0.923
 Obj-0.916
Token you
Token ID345
Feature activation+1.69e+00

Pos logit contributions
 unders+1.308
��+1.285
COMPLE+1.248
aja+1.245
 Obj+1.226
Neg logit contributions
agate-1.278
ince-1.262
theless-1.224
 Difference-1.210
cius-1.210
Token prefer
Token ID4702
Feature activation-3.32e+00

Pos logit contributions
agate+1.066
ince+1.060
theless+1.009
 Difference+1.005
cius+1.005
Neg logit contributions
��-1.224
 unders-1.207
COMPLE-1.187
aja-1.163
 Obj-1.156
Token to
Token ID284
Feature activation+4.30e-01

Pos logit contributions
 unders+2.249
��+2.182
COMPLE+2.125
aja+2.125
 Obj+2.091
Neg logit contributions
agate-2.277
ince-2.248
theless-2.174
cius-2.155
 Difference-2.142
Token sail
Token ID14936
Feature activation-2.13e+00

Pos logit contributions
agate+0.291
ince+0.289
theless+0.278
 Difference+0.277
cius+0.276
Neg logit contributions
 unders-0.289
��-0.289
COMPLE-0.280
aja-0.277
 Obj-0.274
Token yourself
Token ID3511
Feature activation-1.03e+00

Pos logit contributions
 unders+1.455
��+1.416
COMPLE+1.381
aja+1.375
 Obj+1.356
Neg logit contributions
agate-1.441
ince-1.425
theless-1.383
cius-1.369
 Difference-1.366
Token,
Token ID11
Feature activation-1.60e+00

Pos logit contributions
 unders+0.701
��+0.689
COMPLE+0.672
aja+0.669
 Obj+0.659
Neg logit contributions
agate-0.699
ince-0.692
theless-0.669
cius-0.664
 Difference-0.664
Token the
Token ID262
Feature activation-4.92e+00

Pos logit contributions
 unders+1.094
��+1.080
COMPLE+1.048
aja+1.045
 Obj+1.030
Neg logit contributions
agate-1.079
ince-1.067
theless-1.030
cius-1.019
 Difference-1.017
Token cruising
Token ID44339
Feature activation-7.19e+00

Pos logit contributions
 unders+3.319
��+3.203
aja+3.153
COMPLE+3.133
 Obj+3.074
Neg logit contributions
agate-3.372
ince-3.303
theless-3.201
cius-3.151
ments-3.074
Token grounds
Token ID9384
Feature activation-4.36e+00

Pos logit contributions
 unders+4.650
aja+4.405
COMPLE+4.355
��+4.312
 Obj+4.222
Neg logit contributions
agate-5.125
ince-4.863
theless-4.845
cius-4.808
uyomi-4.630
Token of
Token ID286
Feature activation-6.50e+00

Pos logit contributions
 unders+2.833
��+2.777
aja+2.718
COMPLE+2.677
 Obj+2.615
Neg logit contributions
agate-3.071
ince-2.958
theless-2.934
 Difference-2.871
cius-2.850
Token the
Token ID262
Feature activation-5.94e+00

Pos logit contributions
 unders+4.418
��+4.257
aja+4.208
COMPLE+4.121
 Obj+4.105
Neg logit contributions
agate-4.371
ince-4.275
theless-4.132
cius-4.095
ments-3.994
Token court
Token ID2184
Feature activation+2.18e+00

Pos logit contributions
 unders+2.725
��+2.597
aja+2.540
COMPLE+2.532
 Obj+2.500
Neg logit contributions
agate-2.894
ince-2.832
theless-2.769
cius-2.749
 Difference-2.687
Token has
Token ID468
Feature activation+2.07e+00

Pos logit contributions
agate+1.383
ince+1.380
theless+1.327
 Difference+1.317
cius+1.307
Neg logit contributions
��-1.558
 unders-1.535
COMPLE-1.504
aja-1.485
 Obj-1.470
Token ruled
Token ID8879
Feature activation+4.85e+00

Pos logit contributions
agate+1.316
ince+1.310
theless+1.269
 Difference+1.255
cius+1.247
Neg logit contributions
��-1.472
 unders-1.448
COMPLE-1.417
aja-1.400
 Obj-1.383
Token if
Token ID611
Feature activation-4.36e+00

Pos logit contributions
ince+2.972
agate+2.956
 Difference+2.799
ments+2.782
cius+2.766
Neg logit contributions
��-3.596
 unders-3.461
COMPLE-3.418
aja-3.375
 Obj-3.313
Token the
Token ID262
Feature activation-7.26e+00

Pos logit contributions
 unders+2.956
��+2.830
aja+2.781
COMPLE+2.764
 Obj+2.749
Neg logit contributions
agate-2.969
ince-2.919
theless-2.870
cius-2.832
 Difference-2.799
Token seizure
Token ID22338
Feature activation-1.17e+01

Pos logit contributions
 unders+4.735
��+4.335
aja+4.316
 Obj+4.311
COMPLE+4.217
Neg logit contributions
agate-5.158
ince-5.058
theless-4.987
cius-4.943
 Difference-4.757
Token can
Token ID460
Feature activation-5.81e+00

Pos logit contributions
 unders+7.109
aja+6.404
 Obj+6.315
COMPLE+6.199
 straw+6.090
Neg logit contributions
cius-8.146
agate-8.116
ince-7.969
theless-7.968
 Difference-7.799
Token go
Token ID467
Feature activation+6.81e-01

Pos logit contributions
 unders+3.364
��+3.142
aja+3.129
COMPLE+3.105
 Obj+3.043
Neg logit contributions
agate-4.515
ince-4.448
theless-4.418
cius-4.404
 Difference-4.354
Token ahead
Token ID4058
Feature activation+3.51e+00

Pos logit contributions
agate+0.581
ince+0.577
theless+0.558
 Difference+0.558
cius+0.555
Neg logit contributions
��-0.340
 unders-0.338
COMPLE-0.323
aja-0.320
 Obj-0.313
Token.
Token ID13
Feature activation-6.63e+00

Pos logit contributions
agate+2.037
ince+2.030
theless+1.910
ments+1.909
 Difference+1.902
Neg logit contributions
��-2.728
 unders-2.653
COMPLE-2.600
aja-2.570
 Obj-2.529
Token Some
Token ID2773
Feature activation-3.58e+00

Pos logit contributions
 unders+4.141
 Obj+4.053
aja+4.024
COMPLE+3.997
��+3.918
Neg logit contributions
agate-4.697
ince-4.548
cius-4.530
theless-4.491
ments-4.299
Token The
Token ID383
Feature activation+1.04e+00

Pos logit contributions
agate+2.094
ince+2.084
 Difference+2.034
theless+1.998
cius+1.985
Neg logit contributions
 unders-2.051
��-2.043
COMPLE-1.987
aja-1.952
 Obj-1.916
Token Bh
Token ID16581
Feature activation-2.77e+00

Pos logit contributions
agate+0.740
ince+0.734
 Difference+0.712
cius+0.705
theless+0.704
Neg logit contributions
 unders-0.663
��-0.659
COMPLE-0.640
aja-0.630
 Obj-0.621
Tokenag
Token ID363
Feature activation-3.54e+00

Pos logit contributions
aja+2.203
 unders+2.200
��+2.193
COMPLE+2.144
 Obj+2.107
Neg logit contributions
theless-1.463
agate-1.461
ince-1.439
 Difference-1.414
cius-1.389
Tokenav
Token ID615
Feature activation-6.47e+00

Pos logit contributions
 unders+2.433
��+2.432
aja+2.404
COMPLE+2.329
 Obj+2.308
Neg logit contributions
agate-2.301
ince-2.283
theless-2.262
 Difference-2.179
cius-2.165
Tokenad
Token ID324
Feature activation-9.93e+00

Pos logit contributions
aja+4.376
 unders+4.139
��+4.091
COMPLE+4.043
 Neh+4.001
Neg logit contributions
theless-4.353
agate-4.132
ince-4.044
 Difference-4.007
cius-3.970
Token G
Token ID402
Feature activation-1.22e+01

Pos logit contributions
 unders+4.285
aja+4.265
��+4.243
 Neh+3.952
COMPLE+3.912
Neg logit contributions
theless-8.764
ince-8.509
agate-8.385
 Difference-8.170
ments-8.160
Tokenita
Token ID5350
Feature activation+2.80e+00

Pos logit contributions
aja+1.939
 unders+1.478
��+1.228
iga+0.821
olls+0.803
Neg logit contributions
theless-14.825
cius-14.017
ments-13.922
ince-13.670
agate-13.654
Token
Token ID198
Feature activation+7.72e-01

Pos logit contributions
agate+1.679
ince+1.664
 Difference+1.597
cius+1.588
theless+1.578
Neg logit contributions
 unders-2.091
��-2.090
COMPLE-2.031
aja-1.996
 Obj-1.981
Token
Token ID198
Feature activation+5.47e+00

Pos logit contributions
agate+0.488
ince+0.484
theless+0.465
 Difference+0.462
cius+0.461
Neg logit contributions
 unders-0.557
��-0.554
COMPLE-0.538
aja-0.533
 Obj-0.527
Token-
Token ID12
Feature activation+3.43e+00

Pos logit contributions
ince+3.159
agate+3.150
cius+2.949
 Difference+2.949
ments+2.913
Neg logit contributions
 unders-4.223
��-4.150
 Obj-3.997
aja-3.975
COMPLE-3.918
Token and
Token ID290
Feature activation+5.67e+00

Pos logit contributions
ince+2.138
agate+2.134
 Difference+2.041
theless+2.040
cius+2.021
Neg logit contributions
��-2.474
 unders-2.451
COMPLE-2.391
aja-2.359
 Obj-2.340
TokenONG
Token ID18494
Feature activation-3.10e+00

Pos logit contributions
 unders+2.688
COMPLE+2.633
aja+2.627
��+2.624
 Obj+2.447
Neg logit contributions
agate-4.781
theless-4.706
ince-4.684
 Difference-4.624
cius-4.584
Token K
Token ID509
Feature activation+4.12e+00

Pos logit contributions
 unders+2.036
COMPLE+1.983
 Obj+1.972
aja+1.962
��+1.951
Neg logit contributions
agate-2.209
ince-2.152
theless-2.068
cius-2.060
 Difference-2.027
TokenONG
Token ID18494
Feature activation-3.35e+00

Pos logit contributions
agate+3.388
ince+3.379
ments+3.315
 Difference+3.296
cius+3.194
Neg logit contributions
��-2.189
 unders-2.123
 Obj-2.037
��-1.941
 Neh-1.918
Token (
Token ID357
Feature activation-3.81e-01

Pos logit contributions
 unders+2.216
��+2.156
COMPLE+2.151
aja+2.126
 Obj+2.093
Neg logit contributions
agate-2.317
ince-2.270
theless-2.240
cius-2.196
 Difference-2.175
TokenReuters
Token ID12637
Feature activation+2.39e+00

Pos logit contributions
 unders+0.144
��+0.142
COMPLE+0.135
aja+0.133
 Obj+0.130
Neg logit contributions
agate-0.372
ince-0.369
theless-0.361
 Difference-0.359
cius-0.358
Token)
Token ID8
Feature activation-1.21e+01

Pos logit contributions
agate+1.606
ince+1.602
 Difference+1.554
ments+1.502
cius+1.500
Neg logit contributions
��-1.628
 unders-1.596
aja-1.530
COMPLE-1.519
��-1.495
Token -
Token ID532
Feature activation+2.79e+00

Pos logit contributions
COMPLE+5.760
 Obj+5.430
 Neh+5.199
aja+5.185
 unders+5.059
Neg logit contributions
theless-10.482
agate-10.319
uyomi-10.197
cius-10.122
ince-9.703
Token One
Token ID1881
Feature activation-5.21e+00

Pos logit contributions
agate+1.784
ince+1.770
 Difference+1.724
theless+1.686
cius+1.680
Neg logit contributions
��-2.000
 unders-1.972
COMPLE-1.906
aja-1.895
 Obj-1.852
Token needs
Token ID2476
Feature activation+3.11e-01

Pos logit contributions
 unders+3.382
��+3.215
COMPLE+3.158
aja+3.150
 Obj+3.130
Neg logit contributions
agate-3.715
ince-3.636
theless-3.578
 Difference-3.498
cius-3.487
Token to
Token ID284
Feature activation-2.64e+00

Pos logit contributions
agate+0.206
ince+0.205
theless+0.197
 Difference+0.196
cius+0.195
Neg logit contributions
 unders-0.214
��-0.214
COMPLE-0.207
aja-0.205
 Obj-0.203
Token look
Token ID804
Feature activation+2.52e+00

Pos logit contributions
 unders+1.698
��+1.634
aja+1.612
COMPLE+1.600
 Obj+1.584
Neg logit contributions
agate-1.862
ince-1.856
theless-1.830
 Difference-1.808
cius-1.790
Tokeng
Token ID70
Feature activation+3.60e+00

Pos logit contributions
 unders+0.279
��+0.278
COMPLE+0.271
aja+0.269
 Obj+0.266
Neg logit contributions
agate-0.204
ince-0.202
theless-0.194
 Difference-0.192
cius-0.192
Tokenasse
Token ID21612
Feature activation-3.47e-01

Pos logit contributions
agate+2.363
ince+2.325
cius+2.245
 Difference+2.194
theless+2.178
Neg logit contributions
��-2.538
 unders-2.508
COMPLE-2.441
 Obj-2.378
aja-2.345
Token,
Token ID11
Feature activation+1.82e+00

Pos logit contributions
 unders+0.236
��+0.234
COMPLE+0.227
aja+0.225
 Obj+0.222
Neg logit contributions
agate-0.234
ince-0.232
theless-0.224
 Difference-0.222
cius-0.222
Token which
Token ID543
Feature activation-4.55e+00

Pos logit contributions
agate+1.173
ince+1.165
theless+1.123
 Difference+1.120
cius+1.114
Neg logit contributions
��-1.289
 unders-1.274
COMPLE-1.242
aja-1.232
 Obj-1.209
Token was
Token ID373
Feature activation-4.45e+00

Pos logit contributions
 unders+3.143
��+3.060
aja+2.961
COMPLE+2.945
 Obj+2.938
Neg logit contributions
agate-3.046
ince-2.993
theless-2.844
cius-2.831
 Difference-2.826
Token several
Token ID1811
Feature activation-1.19e+01

Pos logit contributions
 unders+3.015
��+2.884
COMPLE+2.828
aja+2.810
 Obj+2.798
Neg logit contributions
agate-3.069
ince-3.005
 Difference-2.910
theless-2.884
cius-2.876
Token floors
Token ID18570
Feature activation-1.02e+01

Pos logit contributions
 unders+6.749
aja+6.027
��+5.996
COMPLE+5.927
 Obj+5.908
Neg logit contributions
agate-8.793
ince-8.542
 Difference-8.438
cius-8.345
theless-8.316
Token up
Token ID510
Feature activation-5.30e-01

Pos logit contributions
 unders+5.688
��+4.941
aja+4.927
 Neh+4.726
COMPLE+4.626
Neg logit contributions
agate-8.070
ince-7.880
cius-7.840
theless-7.766
uyomi-7.599
Token,
Token ID11
Feature activation-3.71e+00

Pos logit contributions
 unders+0.362
��+0.359
COMPLE+0.349
aja+0.346
 Obj+0.342
Neg logit contributions
agate-0.356
ince-0.353
theless-0.341
 Difference-0.338
cius-0.338
Token so
Token ID523
Feature activation-9.30e-01

Pos logit contributions
 unders+2.533
��+2.409
COMPLE+2.386
aja+2.352
 Obj+2.335
Neg logit contributions
agate-2.528
ince-2.493
theless-2.401
cius-2.396
 Difference-2.390
Token he
Token ID339
Feature activation-5.49e-02

Pos logit contributions
 unders+0.661
��+0.653
COMPLE+0.633
aja+0.630
 Obj+0.623
Neg logit contributions
agate-0.601
ince-0.596
theless-0.572
cius-0.569
 Difference-0.567
Token Asked
Token ID26220
Feature activation+6.65e+00

Pos logit contributions
 unders+2.131
��+2.102
COMPLE+2.067
 Obj+2.046
aja+2.043
Neg logit contributions
agate-2.000
ince-1.964
theless-1.933
cius-1.906
 Difference-1.852
Token about
Token ID546
Feature activation-3.97e+00

Pos logit contributions
ince+3.808
agate+3.737
theless+3.541
ments+3.539
 Difference+3.539
Neg logit contributions
��-5.091
 unders-4.910
COMPLE-4.858
aja-4.751
 Obj-4.701
Token sexual
Token ID3206
Feature activation+7.94e-01

Pos logit contributions
 unders+2.704
��+2.631
COMPLE+2.549
 Obj+2.536
aja+2.534
Neg logit contributions
agate-2.644
ince-2.607
theless-2.586
cius-2.548
 Difference-2.517
Token harassment
Token ID10556
Feature activation-7.99e-01

Pos logit contributions
agate+0.523
ince+0.517
theless+0.496
 Difference+0.495
cius+0.494
Neg logit contributions
 unders-0.551
��-0.550
COMPLE-0.535
aja-0.529
 Obj-0.523
Token in
Token ID287
Feature activation-4.18e+00

Pos logit contributions
 unders+0.511
��+0.505
COMPLE+0.489
aja+0.486
 Obj+0.479
Neg logit contributions
agate-0.571
ince-0.567
theless-0.550
 Difference-0.545
cius-0.545
Token increasingly
Token ID6481
Feature activation-1.11e+01

Pos logit contributions
 unders+2.818
��+2.751
COMPLE+2.660
aja+2.647
 Obj+2.610
Neg logit contributions
agate-2.801
theless-2.778
ince-2.752
cius-2.695
 Difference-2.667
Token gender
Token ID5279
Feature activation+1.58e+00

Pos logit contributions
 unders+7.001
��+6.327
COMPLE+6.262
 Neh+6.067
 Obj+5.807
Neg logit contributions
agate-7.887
cius-7.676
 Difference-7.665
theless-7.624
ince-7.588
Token-
Token ID12
Feature activation+8.64e+00

Pos logit contributions
agate+0.945
ince+0.941
theless+0.898
 Difference+0.892
cius+0.889
Neg logit contributions
��-1.190
 unders-1.181
COMPLE-1.160
aja-1.149
 Obj-1.144
Tokeninteg
Token ID18908
Feature activation+5.91e+00

Pos logit contributions
ince+5.351
agate+5.146
theless+4.808
het+4.802
cius+4.789
Neg logit contributions
��-6.298
 Obj-6.151
��-6.148
 unders-5.915
COMPLE-5.911
Tokenrated
Token ID4111
Feature activation-1.47e+00

Pos logit contributions
ince+0.667
agate+0.635
ments+0.487
cius+0.456
theless+0.456
Neg logit contributions
��-7.407
 unders-7.244
 Obj-7.134
COMPLE-7.121
��-7.082
Token military
Token ID2422
Feature activation-1.59e+00

Pos logit contributions
 unders+1.017
��+0.995
COMPLE+0.968
aja+0.957
 Obj+0.952
Neg logit contributions
agate-0.980
ince-0.968
theless-0.940
cius-0.932
 Difference-0.929