TOP ACTIVATIONS
MAX = 2.728

vert ices is called normal extr usion , and is discussed
control the value of our extr usion , so that it
Waves The normal extr usion done in the previous
and other accol ades represent extr ins ic . Research has
' a , T > Clone for Shared < ' a
say that we want the extr usion value to go from
added Sunderland 's manager before ext oll ing the Tex an
effect , we want this extr usion to change over time
Code of the Duel ist Sneak Pe ek on July 29
and athletic Western Australian quickly cement ed his spot in the
the F ED state ) accompl ishes all of these ,
a skin condition , an autos omal dominant palm opl ant
to be a joy ous Easter Sunday celebration , but the
prev _ state . un wrap ()) } } pub fn
th U . S . Circuit Court of Appeal 's 3
prev _ state . un wrap (); self . turn =
, 1 ] // Final extr usion v . ver tex
- white . The supporters wrap up their giant signs and
as important as understanding super conduct ivity itself ," said Mel
a different flight ( because Exp edia informed her that her

MIN ACTIVATIONS
MIN = -2.129

its annual renewables portfolio standard report showing that large investor -
Cook man University celebrated its hundred th anniversary from its founding
said . " I love Hels ing borg and Sweden ,
, he planned a " sur prise " trip to Mexico
slack . The Center for Respons ive Politics has the full
args ()) # initialize a rectangular and square struct uring kernel
The company released an independent report showing that East German prisoners
performance . There are six preset modes for the truck depending
. " I miss you Elsa , my precious daughter ."
and his attorney , Josh Engel , said he currently is
, they are pursued by Or cs and by an ancient
Clinton Hillary Diane Rodham Clinton Sanders : ' I fully expect
the projectile tonight , seven hundred thousand people would be saved
Germany and few wanted to participate in the war . The
, Liam , Stella , Hatt ie and Finn , and
ever ka . Sur ve ying the current scene
se held no allegiance to Muss olini and even suggested that
the United States . She maj ored in philosophy at the
considered . In his annual report submitted October 1 , 1863
Hillary Clinton Hillary Diane Rodham Clinton Sanders : ' I fully

INTERVAL 1.514 - 2.728
CONTAINS 0.095%

( P AL ) 100 % on ESA 2012 , any
Maps The old Everett High School building was renamed
can � � t be easy be ­ ing Jimmy Carter
( 1977 ) Grade : A �
to the current and future economic outlook , � � according

INTERVAL 0.299 - 1.514
CONTAINS 19.436%

game still has 65 million players worldwide , though its popularity
$ 15 , 000 to build ( the steep le was
the chest with that long jab and not only will they
at the time . Earlier in the day a
level can be tailored via remote control or rear - mounted

INTERVAL -0.915 - 0.299
CONTAINS 78.696%

prices for consumers . " That is up to
rop ost ach , is a recipient of $ 155 .
to feel comfortable utilizing starter Trevor Knight as a runner ;
luxury for a small minority in India . This is illustrated
- painted wood stripes ( from $ 399 a night for

INTERVAL -2.129 - -0.915
CONTAINS 1.773%

plan . However , Zh uk ov did not clarify whether
who 's given Super Metroid or Symphony of the Night a
in the comments ! <|endoftext|> LG has cancelled a series of
that gave birth to this field of research .[ 17 ]
, so you can easily use this flour on its own
Token vert
Token ID9421
Feature activation+6.92e-01

Pos logit contributions
 Defin+0.457
Ct+0.446
 Prosecut+0.417
 Proceed+0.379
Score+0.376
Neg logit contributions
wig-0.465
 seiz-0.462
ris-0.410
unin-0.403
models-0.397
Tokenices
Token ID1063
Feature activation+5.05e-01

Pos logit contributions
 Defin+1.092
Ct+1.053
 Prosecut+0.945
 Proceed+0.828
Score+0.828
Neg logit contributions
 seiz-1.918
wig-1.917
ris-1.731
unin-1.722
ppo-1.693
Token is
Token ID318
Feature activation+3.89e-01

Pos logit contributions
 Defin+1.142
Ct+1.117
 Prosecut+1.048
 Proceed+0.956
Score+0.950
Neg logit contributions
wig-1.051
 seiz-1.047
ris-0.921
unin-0.906
models-0.890
Token called
Token ID1444
Feature activation+9.74e-02

Pos logit contributions
 Defin+0.892
Ct+0.873
 Prosecut+0.813
 Proceed+0.746
 Divide+0.745
Neg logit contributions
wig-0.798
 seiz-0.792
ris-0.696
unin-0.690
models-0.672
Token normal
Token ID3487
Feature activation-1.27e+00

Pos logit contributions
 Defin+0.223
Ct+0.218
 Prosecut+0.205
 Proceed+0.187
Score+0.186
Neg logit contributions
wig-0.201
 seiz-0.200
ris-0.175
unin-0.172
models-0.170
Token extr
Token ID22820
Feature activation+2.73e+00

Pos logit contributions
wig+2.363
 seiz+2.358
ris+2.045
unin+1.996
models+1.993
Neg logit contributions
 Defin-3.110
Ct-3.040
 Prosecut-2.897
 Proceed-2.647
Score-2.609
Tokenusion
Token ID4241
Feature activation+3.15e-01

Pos logit contributions
 Defin+4.448
Ct+4.391
 Prosecut+3.960
Score+3.546
 Proceed+3.524
Neg logit contributions
wig-6.980
 seiz-6.946
ris-6.180
ppo-6.048
models-6.034
Token,
Token ID11
Feature activation+2.54e-01

Pos logit contributions
 Defin+0.798
Ct+0.783
 Prosecut+0.738
 Proceed+0.684
Score+0.679
Neg logit contributions
wig-0.569
 seiz-0.566
ris-0.485
unin-0.479
models-0.466
Token and
Token ID290
Feature activation+3.55e-01

Pos logit contributions
 Defin+0.622
Ct+0.610
 Prosecut+0.576
 Proceed+0.529
Score+0.526
Neg logit contributions
wig-0.482
 seiz-0.479
ris-0.416
unin-0.408
models-0.402
Token is
Token ID318
Feature activation+3.79e-01

Pos logit contributions
 Defin+0.798
Ct+0.782
 Prosecut+0.734
 Proceed+0.666
Score+0.663
Neg logit contributions
wig-0.746
 seiz-0.744
ris-0.655
unin-0.641
models-0.635
Token discussed
Token ID6693
Feature activation+1.52e+00

Pos logit contributions
 Defin+0.886
Ct+0.867
 Prosecut+0.816
 Proceed+0.746
Score+0.742
Neg logit contributions
wig-0.763
 seiz-0.759
ris-0.665
unin-0.653
models-0.643
Token control
Token ID1630
Feature activation-7.07e-01

Pos logit contributions
wig+0.134
 seiz+0.133
ris+0.116
unin+0.113
models+0.113
Neg logit contributions
 Defin-0.162
Ct-0.157
 Prosecut-0.150
 Proceed-0.136
Score-0.135
Token the
Token ID262
Feature activation+2.89e-01

Pos logit contributions
 seiz+1.452
wig+1.446
ris+1.265
models+1.233
unin+1.231
Neg logit contributions
 Defin-1.615
Ct-1.575
 Prosecut-1.492
 Proceed-1.358
Score-1.346
Token value
Token ID1988
Feature activation+5.05e-01

Pos logit contributions
 Defin+0.675
Ct+0.660
 Prosecut+0.623
 Proceed+0.568
Score+0.565
Neg logit contributions
wig-0.580
 seiz-0.578
ris-0.505
unin-0.493
models-0.489
Token of
Token ID286
Feature activation+4.53e-01

Pos logit contributions
 Defin+1.069
Ct+1.047
 Prosecut+0.972
 Proceed+0.887
Score+0.879
Neg logit contributions
wig-1.125
 seiz-1.118
ris-0.991
unin-0.982
models-0.962
Token our
Token ID674
Feature activation+6.15e-01

Pos logit contributions
 Defin+1.022
Ct+0.999
 Prosecut+0.937
 Proceed+0.855
Score+0.849
Neg logit contributions
wig-0.948
 seiz-0.943
ris-0.831
unin-0.818
models-0.804
Token extr
Token ID22820
Feature activation+2.67e+00

Pos logit contributions
 Defin+1.437
Ct+1.404
 Prosecut+1.323
 Proceed+1.209
Score+1.200
Neg logit contributions
wig-1.240
 seiz-1.237
ris-1.080
unin-1.057
models-1.047
Tokenusion
Token ID4241
Feature activation+3.93e-01

Pos logit contributions
 Defin+4.617
Ct+4.540
 Prosecut+4.104
Score+3.722
 Proceed+3.660
Neg logit contributions
 seiz-6.799
wig-6.681
ris-5.963
unin-5.820
models-5.806
Token,
Token ID11
Feature activation+3.71e-01

Pos logit contributions
 Defin+0.963
Ct+0.943
 Prosecut+0.891
 Proceed+0.818
Score+0.813
Neg logit contributions
wig-0.744
 seiz-0.740
ris-0.642
unin-0.629
models-0.620
Token so
Token ID523
Feature activation+4.56e-01

Pos logit contributions
 Defin+0.908
Ct+0.891
 Prosecut+0.845
 Proceed+0.770
Score+0.768
Neg logit contributions
wig-0.705
 seiz-0.700
ris-0.607
models-0.589
unin-0.588
Token that
Token ID326
Feature activation+6.94e-01

Pos logit contributions
 Defin+0.972
Ct+0.949
 Prosecut+0.889
 Proceed+0.804
Score+0.798
Neg logit contributions
wig-1.010
 seiz-1.005
ris-0.891
unin-0.877
models-0.866
Token it
Token ID340
Feature activation+9.62e-02

Pos logit contributions
 Defin+1.543
Ct+1.507
 Prosecut+1.413
 Proceed+1.288
Score+1.280
Neg logit contributions
wig-1.471
 seiz-1.466
ris-1.292
unin-1.272
models-1.251
Token Waves
Token ID42733
Feature activation+5.57e-01

Pos logit contributions
 Defin+1.091
Ct+1.063
 Prosecut+0.993
 Proceed+0.899
Score+0.892
Neg logit contributions
wig-1.169
 seiz-1.166
ris-1.035
unin-1.019
models-1.004
Token
Token ID198
Feature activation+2.37e-01

Pos logit contributions
 Defin+1.209
Ct+1.175
 Prosecut+1.104
 Proceed+1.007
 Divide+1.000
Neg logit contributions
wig-1.211
 seiz-1.210
ris-1.068
unin-1.058
models-1.026
Token
Token ID198
Feature activation+1.60e-01

Pos logit contributions
 Defin+0.608
Ct+0.568
 Prosecut+0.557
 Proceed+0.500
 Divide+0.497
Neg logit contributions
wig-0.441
ris-0.386
 seiz-0.383
models-0.370
lot-0.368
TokenThe
Token ID464
Feature activation-2.90e-02

Pos logit contributions
 Defin+0.281
Ct+0.271
 Prosecut+0.252
 Proceed+0.221
 Crafting+0.218
Neg logit contributions
wig-0.418
 seiz-0.414
ris-0.376
unin-0.371
models-0.368
Token normal
Token ID3487
Feature activation-8.82e-01

Pos logit contributions
wig+0.059
 seiz+0.059
ris+0.051
unin+0.050
models+0.050
Neg logit contributions
 Defin-0.067
Ct-0.066
 Prosecut-0.062
 Proceed-0.056
Score-0.056
Token extr
Token ID22820
Feature activation+2.57e+00

Pos logit contributions
wig+1.796
 seiz+1.783
ris+1.567
unin+1.528
models+1.526
Neg logit contributions
 Defin-2.024
Ct-1.990
 Prosecut-1.877
 Proceed-1.706
Score-1.699
Tokenusion
Token ID4241
Feature activation+1.05e-01

Pos logit contributions
 Defin+4.701
Ct+4.606
 Prosecut+4.230
 Proceed+3.781
Score+3.760
Neg logit contributions
 seiz-6.407
wig-6.300
ris-5.608
unin-5.514
models-5.471
Token done
Token ID1760
Feature activation+5.31e-01

Pos logit contributions
 Defin+0.252
Ct+0.247
 Prosecut+0.233
 Proceed+0.213
Score+0.212
Neg logit contributions
wig-0.207
 seiz-0.205
ris-0.179
unin-0.175
models-0.173
Token in
Token ID287
Feature activation+1.93e-02

Pos logit contributions
 Defin+1.256
Ct+1.229
 Prosecut+1.158
 Proceed+1.061
Score+1.055
Neg logit contributions
wig-1.052
 seiz-1.048
ris-0.915
unin-0.899
models-0.884
Token the
Token ID262
Feature activation+1.65e-01

Pos logit contributions
 Defin+0.040
Ct+0.039
 Prosecut+0.036
 Proceed+0.032
Score+0.032
Neg logit contributions
wig-0.044
 seiz-0.044
ris-0.039
unin-0.038
models-0.038
Token previous
Token ID2180
Feature activation+1.24e-01

Pos logit contributions
 Defin+0.379
Ct+0.372
 Prosecut+0.349
 Proceed+0.318
Score+0.317
Neg logit contributions
wig-0.337
 seiz-0.337
ris-0.295
unin-0.289
models-0.286
Token and
Token ID290
Feature activation+3.20e-01

Pos logit contributions
 Defin+1.141
Ct+1.116
 Prosecut+1.051
 Proceed+0.964
Score+0.959
Neg logit contributions
wig-0.947
 seiz-0.941
ris-0.821
unin-0.808
models-0.794
Token other
Token ID584
Feature activation+2.50e-01

Pos logit contributions
 Defin+0.744
Ct+0.727
 Prosecut+0.686
 Proceed+0.626
Score+0.621
Neg logit contributions
wig-0.646
 seiz-0.644
ris-0.564
unin-0.553
models-0.545
Token accol
Token ID45667
Feature activation+4.10e-01

Pos logit contributions
 Defin+0.604
Ct+0.591
 Prosecut+0.558
 Proceed+0.511
Score+0.507
Neg logit contributions
wig-0.484
 seiz-0.481
ris-0.419
unin-0.411
models-0.405
Tokenades
Token ID2367
Feature activation+7.21e-02

Pos logit contributions
 Defin+1.105
Ct+1.076
 Prosecut+1.028
 Proceed+0.949
 Divide+0.943
Neg logit contributions
wig-0.683
 seiz-0.666
ris-0.589
unin-0.567
models-0.560
Token represent
Token ID2380
Feature activation-9.94e-02

Pos logit contributions
 Defin+0.171
Ct+0.168
 Prosecut+0.158
 Proceed+0.144
Score+0.143
Neg logit contributions
wig-0.142
 seiz-0.142
ris-0.124
unin-0.120
models-0.120
Token extr
Token ID22820
Feature activation+2.48e+00

Pos logit contributions
wig+0.202
 seiz+0.201
ris+0.176
unin+0.173
models+0.171
Neg logit contributions
 Defin-0.230
Ct-0.225
 Prosecut-0.212
 Proceed-0.193
Score-0.192
Tokenins
Token ID1040
Feature activation+5.71e-01

Pos logit contributions
 Defin+4.703
Ct+4.679
 Prosecut+4.203
Score+3.907
 Proceed+3.742
Neg logit contributions
wig-5.875
 seiz-5.858
ris-5.094
ppo-5.052
models-5.030
Tokenic
Token ID291
Feature activation+1.34e-01

Pos logit contributions
 Defin+0.987
Ct+0.954
 Prosecut+0.883
 Proceed+0.778
Score+0.769
Neg logit contributions
wig-1.497
 seiz-1.487
ris-1.350
unin-1.329
models-1.314
Token.
Token ID13
Feature activation-1.08e-01

Pos logit contributions
 Defin+0.294
Ct+0.287
 Prosecut+0.269
 Proceed+0.244
Score+0.243
Neg logit contributions
wig-0.289
 seiz-0.288
ris-0.254
unin-0.250
models-0.246
Token Research
Token ID4992
Feature activation-1.26e-01

Pos logit contributions
wig+0.229
 seiz+0.225
ris+0.204
models+0.199
lot+0.196
Neg logit contributions
 Defin-0.230
Ct-0.229
 Prosecut-0.212
Score-0.190
 Proceed-0.190
Token has
Token ID468
Feature activation+4.58e-02

Pos logit contributions
wig+0.261
 seiz+0.260
ris+0.229
unin+0.225
models+0.222
Neg logit contributions
 Defin-0.288
Ct-0.282
 Prosecut-0.265
 Proceed-0.241
Score-0.240
Token '
Token ID705
Feature activation-2.07e-01

Pos logit contributions
wig+0.765
 seiz+0.756
ris+0.668
unin+0.665
models+0.653
Neg logit contributions
 Defin-0.649
Ct-0.635
 Prosecut-0.589
 Proceed-0.533
Score-0.529
Tokena
Token ID64
Feature activation-1.09e-01

Pos logit contributions
wig+0.295
 seiz+0.257
ris+0.248
unin+0.239
models+0.235
Neg logit contributions
 Defin-0.618
Ct-0.588
 Prosecut-0.577
 Proceed-0.537
 Crafting-0.531
Token,
Token ID837
Feature activation-6.32e-01

Pos logit contributions
wig+0.231
 seiz+0.216
ris+0.206
models+0.198
unin+0.196
Neg logit contributions
 Defin-0.242
Ct-0.233
 Prosecut-0.223
 Proceed-0.201
 Divide-0.197
Token T
Token ID309
Feature activation+6.36e-01

Pos logit contributions
wig+1.568
 seiz+1.558
ris+1.396
unin+1.385
models+1.364
Neg logit contributions
 Defin-1.192
Ct-1.161
 Prosecut-1.072
 Proceed-0.960
Score-0.947
Token >
Token ID1875
Feature activation+2.94e-02

Pos logit contributions
 Defin+1.145
Ct+1.136
 Prosecut+1.008
Score+0.929
 Proceed+0.911
Neg logit contributions
wig-1.606
 seiz-1.589
unin-1.441
ris-1.393
models-1.379
Token Clone
Token ID30698
Feature activation+2.43e+00

Pos logit contributions
 Defin+0.054
Ct+0.053
 Prosecut+0.049
Score+0.043
 Proceed+0.043
Neg logit contributions
wig-0.073
 seiz-0.073
ris-0.066
unin-0.065
models-0.064
Token for
Token ID329
Feature activation+3.83e-01

Pos logit contributions
Ct+4.654
 Defin+4.526
 Prosecut+4.112
Score+3.817
 Proceed+3.791
Neg logit contributions
 seiz-5.778
wig-5.662
unin-5.071
ris-4.916
models-4.901
Token Shared
Token ID39403
Feature activation+5.42e-01

Pos logit contributions
 Defin+0.718
Ct+0.706
 Prosecut+0.648
Score+0.572
 Proceed+0.572
Neg logit contributions
wig-0.930
 seiz-0.907
ris-0.843
models-0.813
unin-0.812
Token <
Token ID1279
Feature activation-3.82e-01

Pos logit contributions
 Defin+1.120
Ct+1.050
 Prosecut+1.023
 Proceed+0.904
 Divide+0.895
Neg logit contributions
wig-1.252
 seiz-1.168
ris-1.125
models-1.104
lean-1.084
Token '
Token ID705
Feature activation-1.80e-01

Pos logit contributions
wig+0.732
 seiz+0.704
ris+0.667
models+0.634
unin+0.628
Neg logit contributions
 Defin-0.902
Ct-0.868
 Prosecut-0.827
 Proceed-0.746
Score-0.744
Tokena
Token ID64
Feature activation-1.17e-01

Pos logit contributions
wig+0.252
 seiz+0.213
ris+0.212
unin+0.204
lot+0.200
Neg logit contributions
 Defin-0.544
Ct-0.515
 Prosecut-0.507
 Proceed-0.472
 Crafting-0.466
Token say
Token ID910
Feature activation-7.35e-02

Pos logit contributions
 Defin+0.977
Ct+0.955
 Prosecut+0.904
 Proceed+0.823
Score+0.820
Neg logit contributions
wig-0.809
 seiz-0.802
ris-0.701
unin-0.686
models-0.680
Token that
Token ID326
Feature activation+4.06e-01

Pos logit contributions
wig+0.162
 seiz+0.162
ris+0.143
unin+0.141
models+0.139
Neg logit contributions
 Defin-0.157
Ct-0.153
 Prosecut-0.143
 Proceed-0.130
Score-0.129
Token we
Token ID356
Feature activation+2.07e-01

Pos logit contributions
 Defin+0.865
Ct+0.844
 Prosecut+0.790
 Proceed+0.715
Score+0.710
Neg logit contributions
wig-0.899
 seiz-0.895
ris-0.794
unin-0.781
models-0.771
Token want
Token ID765
Feature activation+8.28e-01

Pos logit contributions
 Defin+0.523
Ct+0.512
 Prosecut+0.485
 Proceed+0.445
Score+0.443
Neg logit contributions
wig-0.377
 seiz-0.374
ris-0.324
unin-0.313
models-0.311
Token the
Token ID262
Feature activation+2.39e-01

Pos logit contributions
 Defin+1.839
Ct+1.804
 Prosecut+1.681
 Proceed+1.537
Score+1.534
Neg logit contributions
wig-1.748
 seiz-1.739
ris-1.532
unin-1.516
models-1.481
Token extr
Token ID22820
Feature activation+2.43e+00

Pos logit contributions
 Defin+0.559
Ct+0.546
 Prosecut+0.516
 Proceed+0.471
Score+0.467
Neg logit contributions
wig-0.479
 seiz-0.479
ris-0.418
unin-0.408
models-0.405
Tokenusion
Token ID4241
Feature activation+3.11e-01

Pos logit contributions
 Defin+4.645
Ct+4.545
 Prosecut+4.199
 Proceed+3.761
Score+3.752
Neg logit contributions
 seiz-5.872
wig-5.843
ris-5.207
unin-5.123
models-5.076
Token value
Token ID1988
Feature activation+5.56e-01

Pos logit contributions
 Defin+0.736
Ct+0.720
 Prosecut+0.679
 Proceed+0.621
Score+0.617
Neg logit contributions
wig-0.614
 seiz-0.611
ris-0.534
unin-0.523
models-0.516
Token to
Token ID284
Feature activation+4.10e-01

Pos logit contributions
 Defin+1.296
Ct+1.268
 Prosecut+1.193
 Proceed+1.093
Score+1.085
Neg logit contributions
wig-1.119
 seiz-1.114
ris-0.974
unin-0.958
models-0.943
Token go
Token ID467
Feature activation+2.50e-01

Pos logit contributions
 Defin+0.977
Ct+0.956
 Prosecut+0.901
 Proceed+0.825
Score+0.821
Neg logit contributions
wig-0.804
 seiz-0.801
ris-0.698
unin-0.685
models-0.675
Token from
Token ID422
Feature activation+4.61e-01

Pos logit contributions
 Defin+0.609
Ct+0.593
 Prosecut+0.561
 Proceed+0.514
Score+0.510
Neg logit contributions
wig-0.483
 seiz-0.479
ris-0.418
unin-0.407
models-0.400
Token added
Token ID2087
Feature activation-2.01e-02

Pos logit contributions
wig+0.801
 seiz+0.796
ris+0.701
unin+0.684
models+0.672
Neg logit contributions
 Defin-0.901
Ct-0.881
 Prosecut-0.824
 Proceed-0.752
Score-0.751
Token Sunderland
Token ID35706
Feature activation-5.03e-02

Pos logit contributions
wig+0.043
 seiz+0.043
ris+0.038
unin+0.037
models+0.036
Neg logit contributions
 Defin-0.044
Ct-0.043
 Prosecut-0.041
 Proceed-0.037
Score-0.037
Token's
Token ID338
Feature activation-3.22e-01

Pos logit contributions
wig+0.094
 seiz+0.093
ris+0.081
unin+0.079
models+0.078
Neg logit contributions
 Defin-0.125
Ct-0.123
 Prosecut-0.116
 Proceed-0.106
Score-0.106
Token manager
Token ID4706
Feature activation-3.56e-01

Pos logit contributions
wig+0.625
 seiz+0.621
ris+0.543
unin+0.525
models+0.513
Neg logit contributions
 Defin-0.780
Ct-0.762
 Prosecut-0.712
 Proceed-0.656
Score-0.652
Token before
Token ID878
Feature activation-4.24e-01

Pos logit contributions
wig+0.632
 seiz+0.627
ris+0.540
unin+0.528
models+0.516
Neg logit contributions
 Defin-0.918
Ct-0.898
 Prosecut-0.849
 Proceed-0.785
Score-0.781
Token ext
Token ID1070
Feature activation+2.41e+00

Pos logit contributions
wig+0.904
 seiz+0.902
ris+0.795
unin+0.780
models+0.763
Neg logit contributions
 Defin-0.941
Ct-0.918
 Prosecut-0.856
Score-0.778
 Proceed-0.777
Tokenoll
Token ID692
Feature activation-3.98e-01

Pos logit contributions
 Defin+5.507
Ct+5.490
 Prosecut+5.188
Score+4.729
 Divide+4.688
Neg logit contributions
 seiz-4.716
wig-4.572
unin-3.910
models-3.900
ris-3.895
Tokening
Token ID278
Feature activation-4.60e-02

Pos logit contributions
wig+0.807
 seiz+0.790
ris+0.707
unin+0.701
models+0.685
Neg logit contributions
 Defin-0.915
Ct-0.894
 Prosecut-0.831
 Proceed-0.764
Score-0.759
Token the
Token ID262
Feature activation+5.39e-02

Pos logit contributions
wig+0.097
 seiz+0.097
ris+0.085
unin+0.084
models+0.082
Neg logit contributions
 Defin-0.103
Ct-0.101
 Prosecut-0.094
 Proceed-0.086
Score-0.085
Token Tex
Token ID3567
Feature activation+3.67e-01

Pos logit contributions
 Defin+0.125
Ct+0.122
 Prosecut+0.115
 Proceed+0.104
Score+0.103
Neg logit contributions
wig-0.110
 seiz-0.109
ris-0.096
unin-0.094
models-0.092
Tokenan
Token ID272
Feature activation+9.43e-01

Pos logit contributions
 Defin+0.880
Ct+0.860
 Prosecut+0.811
 Proceed+0.742
Score+0.741
Neg logit contributions
wig-0.717
 seiz-0.713
ris-0.619
unin-0.609
models-0.600
Token effect
Token ID1245
Feature activation-5.90e-01

Pos logit contributions
 Defin+0.436
Ct+0.427
 Prosecut+0.402
 Proceed+0.368
Score+0.366
Neg logit contributions
wig-0.360
 seiz-0.360
ris-0.313
unin-0.306
models-0.303
Token,
Token ID11
Feature activation+2.32e-01

Pos logit contributions
wig+1.091
 seiz+1.084
ris+0.939
unin+0.917
models+0.908
Neg logit contributions
 Defin-1.474
Ct-1.446
 Prosecut-1.367
 Proceed-1.256
Score-1.249
Token we
Token ID356
Feature activation-2.07e-02

Pos logit contributions
 Defin+0.476
Ct+0.467
 Prosecut+0.437
 Proceed+0.392
Score+0.390
Neg logit contributions
wig-0.531
 seiz-0.529
ris-0.470
unin-0.461
models-0.459
Token want
Token ID765
Feature activation+8.61e-01

Pos logit contributions
wig+0.040
 seiz+0.040
ris+0.035
unin+0.034
models+0.034
Neg logit contributions
 Defin-0.050
Ct-0.049
 Prosecut-0.046
 Proceed-0.042
Score-0.042
Token this
Token ID428
Feature activation-1.50e-01

Pos logit contributions
 Defin+1.921
Ct+1.880
 Prosecut+1.747
 Divide+1.602
 Proceed+1.599
Neg logit contributions
wig-1.808
 seiz-1.795
ris-1.588
unin-1.565
ppo-1.530
Token extr
Token ID22820
Feature activation+2.28e+00

Pos logit contributions
wig+0.294
 seiz+0.293
ris+0.256
unin+0.251
models+0.247
Neg logit contributions
 Defin-0.357
Ct-0.349
 Prosecut-0.329
 Proceed-0.302
Score-0.300
Tokenusion
Token ID4241
Feature activation+7.81e-02

Pos logit contributions
 Defin+4.344
Ct+4.251
 Prosecut+3.928
 Proceed+3.512
Score+3.504
Neg logit contributions
 seiz-5.548
wig-5.504
ris-4.910
unin-4.819
models-4.790
Token to
Token ID284
Feature activation+2.29e-01

Pos logit contributions
 Defin+0.185
Ct+0.181
 Prosecut+0.170
 Proceed+0.156
Score+0.155
Neg logit contributions
wig-0.155
 seiz-0.155
ris-0.135
unin-0.133
models-0.130
Token change
Token ID1487
Feature activation-8.34e-01

Pos logit contributions
 Defin+0.546
Ct+0.534
 Prosecut+0.504
 Proceed+0.460
Score+0.458
Neg logit contributions
wig-0.450
 seiz-0.447
ris-0.391
unin-0.383
models-0.377
Token over
Token ID625
Feature activation+6.47e-02

Pos logit contributions
wig+1.691
 seiz+1.675
ris+1.465
unin+1.437
models+1.433
Neg logit contributions
 Defin-1.925
Ct-1.883
 Prosecut-1.784
 Proceed-1.616
Score-1.607
Token time
Token ID640
Feature activation-1.80e-01

Pos logit contributions
 Defin+0.111
Ct+0.108
 Prosecut+0.099
 Proceed+0.087
Score+0.087
Neg logit contributions
wig-0.170
 seiz-0.169
ris-0.153
unin-0.151
models-0.150
Token Code
Token ID6127
Feature activation+1.05e+00

Pos logit contributions
 Defin+0.130
Ct+0.127
 Prosecut+0.119
 Proceed+0.107
Score+0.107
Neg logit contributions
wig-0.134
 seiz-0.134
ris-0.119
unin-0.116
models-0.115
Token of
Token ID286
Feature activation-8.74e-02

Pos logit contributions
 Defin+2.217
Ct+2.183
 Prosecut+2.011
 Crafting+1.835
 Proceed+1.833
Neg logit contributions
 seiz-2.349
wig-2.338
ris-2.055
unin-2.046
models-2.024
Token the
Token ID262
Feature activation-1.48e-01

Pos logit contributions
wig+0.229
 seiz+0.229
ris+0.207
unin+0.204
models+0.202
Neg logit contributions
 Defin-0.151
Ct-0.146
 Prosecut-0.134
 Proceed-0.118
Score-0.117
Token Duel
Token ID23958
Feature activation+3.13e-01

Pos logit contributions
wig+0.335
 seiz+0.334
ris+0.297
unin+0.292
models+0.288
Neg logit contributions
 Defin-0.307
Ct-0.299
 Prosecut-0.279
 Proceed-0.252
Score-0.250
Tokenist
Token ID396
Feature activation+7.77e-02

Pos logit contributions
 Defin+0.742
Ct+0.726
 Prosecut+0.684
 Proceed+0.627
Score+0.623
Neg logit contributions
wig-0.619
 seiz-0.616
ris-0.538
unin-0.528
models-0.520
Token Sneak
Token ID49475
Feature activation+2.28e+00

Pos logit contributions
 Defin+0.177
Ct+0.173
 Prosecut+0.163
 Proceed+0.149
Score+0.148
Neg logit contributions
wig-0.160
 seiz-0.159
ris-0.140
unin-0.137
models-0.136
Token Pe
Token ID2631
Feature activation+1.17e-01

Pos logit contributions
 Defin+4.569
Ct+4.492
 Prosecut+4.121
Score+3.796
 Crafting+3.790
Neg logit contributions
 seiz-5.265
wig-5.250
unin-4.622
ris-4.566
models-4.478
Tokenek
Token ID988
Feature activation+1.13e+00

Pos logit contributions
 Defin+0.275
Ct+0.268
 Prosecut+0.254
 Proceed+0.231
Score+0.230
Neg logit contributions
wig-0.233
 seiz-0.231
ris-0.203
unin-0.200
models-0.196
Token on
Token ID319
Feature activation+9.05e-02

Pos logit contributions
 Defin+2.699
Ct+2.643
 Prosecut+2.486
 Proceed+2.294
Score+2.270
Neg logit contributions
 seiz-2.205
wig-2.199
ris-1.909
unin-1.881
models-1.846
Token July
Token ID2901
Feature activation+8.42e-01

Pos logit contributions
 Defin+0.199
Ct+0.194
 Prosecut+0.183
 Proceed+0.165
Score+0.165
Neg logit contributions
 seiz-0.193
wig-0.193
ris-0.171
unin-0.167
models-0.164
Token 29
Token ID2808
Feature activation+1.76e-01

Pos logit contributions
 Defin+1.638
Ct+1.578
 Prosecut+1.482
 Proceed+1.313
Score+1.306
Neg logit contributions
wig-2.010
 seiz-1.961
ris-1.822
ppo-1.758
unin-1.753
Token and
Token ID290
Feature activation-1.59e-02

Pos logit contributions
wig+0.551
 seiz+0.549
ris+0.487
unin+0.478
models+0.469
Neg logit contributions
 Defin-0.562
Ct-0.549
 Prosecut-0.513
 Proceed-0.466
Score-0.462
Token athletic
Token ID15177
Feature activation-1.26e-02

Pos logit contributions
wig+0.029
 seiz+0.029
ris+0.025
unin+0.025
models+0.024
Neg logit contributions
 Defin-0.039
Ct-0.039
 Prosecut-0.037
 Proceed-0.034
Score-0.033
Token Western
Token ID4885
Feature activation-8.37e-02

Pos logit contributions
wig+0.028
 seiz+0.028
ris+0.025
unin+0.024
models+0.024
Neg logit contributions
 Defin-0.027
Ct-0.026
 Prosecut-0.024
 Proceed-0.022
Score-0.022
Token Australian
Token ID6638
Feature activation-2.65e-01

Pos logit contributions
wig+0.141
 seiz+0.140
ris+0.120
unin+0.117
models+0.115
Neg logit contributions
 Defin-0.223
Ct-0.218
 Prosecut-0.207
 Proceed-0.192
Score-0.191
Token quickly
Token ID2952
Feature activation+3.11e-01

Pos logit contributions
wig+0.534
 seiz+0.531
ris+0.467
unin+0.457
models+0.449
Neg logit contributions
 Defin-0.620
Ct-0.606
 Prosecut-0.571
 Proceed-0.521
Score-0.518
Token cement
Token ID20534
Feature activation+2.19e+00

Pos logit contributions
 Defin+0.746
Ct+0.730
 Prosecut+0.688
 Proceed+0.631
Score+0.626
Neg logit contributions
wig-0.609
 seiz-0.606
ris-0.529
unin-0.518
models-0.509
Tokened
Token ID276
Feature activation+1.13e-01

Pos logit contributions
 Defin+4.701
Ct+4.602
 Prosecut+4.295
Score+3.922
 Crafting+3.876
Neg logit contributions
 seiz-4.767
wig-4.722
ris-4.142
unin-4.099
models-4.034
Token his
Token ID465
Feature activation-6.35e-02

Pos logit contributions
 Defin+0.245
Ct+0.240
 Prosecut+0.224
 Proceed+0.204
Score+0.202
Neg logit contributions
wig-0.246
 seiz-0.245
ris-0.217
unin-0.213
models-0.210
Token spot
Token ID4136
Feature activation-5.44e-01

Pos logit contributions
wig+0.127
 seiz+0.125
ris+0.111
unin+0.108
models+0.106
Neg logit contributions
 Defin-0.150
Ct-0.146
 Prosecut-0.138
 Proceed-0.126
Score-0.124
Token in
Token ID287
Feature activation-1.86e-01

Pos logit contributions
wig+1.063
 seiz+1.055
ris+0.922
unin+0.905
models+0.889
Neg logit contributions
 Defin-1.305
Ct-1.275
 Prosecut-1.204
 Proceed-1.104
Score-1.094
Token the
Token ID262
Feature activation+1.01e-01

Pos logit contributions
wig+0.390
 seiz+0.388
ris+0.343
unin+0.337
models+0.332
Neg logit contributions
 Defin-0.418
Ct-0.409
 Prosecut-0.383
 Proceed-0.349
Score-0.345
Token the
Token ID262
Feature activation+2.85e-02

Pos logit contributions
 Defin+0.480
Ct+0.469
 Prosecut+0.440
 Proceed+0.400
Score+0.398
Neg logit contributions
wig-0.458
 seiz-0.456
ris-0.402
unin-0.395
models-0.390
Token F
Token ID376
Feature activation-3.69e-02

Pos logit contributions
 Defin+0.062
Ct+0.061
 Prosecut+0.057
 Proceed+0.052
Score+0.051
Neg logit contributions
wig-0.062
 seiz-0.062
ris-0.055
unin-0.054
models-0.053
TokenED
Token ID1961
Feature activation+4.54e-01

Pos logit contributions
 seiz+0.074
wig+0.074
ris+0.064
unin+0.063
models+0.062
Neg logit contributions
 Defin-0.086
Ct-0.085
 Prosecut-0.079
 Proceed-0.072
Score-0.072
Token state
Token ID1181
Feature activation-5.71e-01

Pos logit contributions
 Defin+0.951
Ct+0.933
 Prosecut+0.864
 Proceed+0.785
Score+0.781
Neg logit contributions
wig-1.015
 seiz-1.015
ris-0.898
unin-0.888
models-0.878
Token)
Token ID8
Feature activation-1.53e-01

Pos logit contributions
wig+0.988
 seiz+0.982
ris+0.841
unin+0.822
models+0.808
Neg logit contributions
 Defin-1.493
Ct-1.462
 Prosecut-1.388
 Proceed-1.282
Score-1.275
Token accompl
Token ID6424
Feature activation+2.17e+00

Pos logit contributions
wig+0.315
 seiz+0.314
ris+0.279
unin+0.270
models+0.270
Neg logit contributions
 Defin-0.348
Ct-0.340
 Prosecut-0.321
 Proceed-0.291
Score-0.289
Tokenishes
Token ID5614
Feature activation+7.90e-01

Pos logit contributions
 Defin+4.575
Ct+4.476
 Prosecut+4.123
Score+3.770
 Proceed+3.753
Neg logit contributions
 seiz-4.830
wig-4.797
ris-4.188
unin-4.136
models-4.110
Token all
Token ID477
Feature activation+5.92e-01

Pos logit contributions
 Defin+1.854
Ct+1.811
 Prosecut+1.705
 Proceed+1.563
Score+1.551
Neg logit contributions
wig-1.576
 seiz-1.572
ris-1.372
unin-1.348
models-1.329
Token of
Token ID286
Feature activation+5.18e-01

Pos logit contributions
 Defin+1.302
Ct+1.269
 Prosecut+1.186
 Proceed+1.082
Score+1.074
Neg logit contributions
wig-1.271
 seiz-1.271
ris-1.120
unin-1.106
models-1.085
Token these
Token ID777
Feature activation+1.33e-01

Pos logit contributions
 Defin+1.109
Ct+1.083
 Prosecut+1.011
 Proceed+0.919
Score+0.912
Neg logit contributions
wig-1.142
 seiz-1.142
ris-1.007
unin-0.993
models-0.978
Token,
Token ID11
Feature activation+7.90e-02

Pos logit contributions
 Defin+0.306
Ct+0.299
 Prosecut+0.281
 Proceed+0.257
Score+0.255
Neg logit contributions
wig-0.271
 seiz-0.270
ris-0.237
unin-0.232
models-0.229
Token a
Token ID257
Feature activation-8.52e-02

Pos logit contributions
wig+1.011
 seiz+1.009
ris+0.882
unin+0.864
models+0.849
Neg logit contributions
 Defin-1.144
Ct-1.121
 Prosecut-1.048
 Proceed-0.953
Score-0.951
Token skin
Token ID4168
Feature activation-4.90e-01

Pos logit contributions
wig+0.174
 seiz+0.173
ris+0.151
unin+0.149
models+0.147
Neg logit contributions
 Defin-0.197
Ct-0.193
 Prosecut-0.181
 Proceed-0.165
Score-0.164
Token condition
Token ID4006
Feature activation-3.39e-02

Pos logit contributions
wig+0.954
 seiz+0.948
ris+0.827
unin+0.810
models+0.797
Neg logit contributions
 Defin-1.179
Ct-1.153
 Prosecut-1.088
 Proceed-0.998
Score-0.991
Token,
Token ID11
Feature activation+9.53e-02

Pos logit contributions
wig+0.067
 seiz+0.066
ris+0.058
unin+0.057
models+0.056
Neg logit contributions
 Defin-0.081
Ct-0.079
 Prosecut-0.075
 Proceed-0.068
Score-0.068
Token an
Token ID281
Feature activation+2.34e-01

Pos logit contributions
 Defin+0.224
Ct+0.219
 Prosecut+0.206
 Proceed+0.189
Score+0.187
Neg logit contributions
wig-0.190
 seiz-0.190
ris-0.166
unin-0.163
models-0.160
Token autos
Token ID44619
Feature activation+2.14e+00

Pos logit contributions
 Defin+0.549
Ct+0.539
 Prosecut+0.507
 Proceed+0.464
Score+0.461
Neg logit contributions
wig-0.468
 seiz-0.466
ris-0.407
unin-0.399
models-0.394
Tokenomal
Token ID18048
Feature activation+8.61e-01

Pos logit contributions
Ct+3.201
 Defin+3.181
 Prosecut+2.761
Score+2.469
 Proceed+2.451
Neg logit contributions
wig-5.830
 seiz-5.803
unin-5.221
ris-5.212
ppo-5.177
Token dominant
Token ID11410
Feature activation+4.26e-01

Pos logit contributions
 Defin+1.844
Ct+1.813
 Prosecut+1.696
 Proceed+1.532
Score+1.517
Neg logit contributions
wig-1.898
 seiz-1.880
ris-1.665
unin-1.646
models-1.629
Token palm
Token ID18057
Feature activation+4.66e-01

Pos logit contributions
 Defin+0.997
Ct+0.978
 Prosecut+0.922
 Proceed+0.845
Score+0.839
Neg logit contributions
wig-0.853
 seiz-0.848
ris-0.740
unin-0.730
models-0.720
Tokenopl
Token ID20106
Feature activation+6.64e-01

Pos logit contributions
 Defin+1.110
Ct+1.089
 Prosecut+1.027
 Proceed+0.942
Score+0.936
Neg logit contributions
wig-0.907
 seiz-0.906
ris-0.788
unin-0.774
models-0.765
Tokenant
Token ID415
Feature activation-7.62e-02

Pos logit contributions
 Defin+1.512
Ct+1.481
 Prosecut+1.401
 Proceed+1.274
Score+1.273
Neg logit contributions
 seiz-1.362
wig-1.349
ris-1.177
models-1.155
unin-1.151
Token to
Token ID284
Feature activation+1.40e-01

Pos logit contributions
 Defin+0.638
Ct+0.612
 Prosecut+0.588
 Proceed+0.539
Score+0.533
Neg logit contributions
wig-0.426
 seiz-0.393
ris-0.371
unin-0.358
models-0.349
Token be
Token ID307
Feature activation+4.25e-01

Pos logit contributions
 Defin+0.323
Ct+0.318
 Prosecut+0.296
 Proceed+0.273
Score+0.271
Neg logit contributions
 seiz-0.286
wig-0.285
ris-0.249
unin-0.245
models-0.244
Token a
Token ID257
Feature activation-3.84e-02

Pos logit contributions
 Defin+0.994
Ct+0.973
 Prosecut+0.916
 Proceed+0.838
Score+0.832
Neg logit contributions
wig-0.853
 seiz-0.850
ris-0.743
unin-0.730
models-0.720
Token joy
Token ID8716
Feature activation-2.32e-01

Pos logit contributions
wig+0.078
 seiz+0.078
ris+0.068
unin+0.067
models+0.066
Neg logit contributions
 Defin-0.089
Ct-0.087
 Prosecut-0.082
 Proceed-0.075
Score-0.074
Tokenous
Token ID516
Feature activation-6.92e-01

Pos logit contributions
wig+0.538
 seiz+0.536
ris+0.480
unin+0.470
models+0.464
Neg logit contributions
 Defin-0.473
Ct-0.459
 Prosecut-0.429
 Proceed-0.386
Score-0.381
Token Easter
Token ID20111
Feature activation+2.13e+00

Pos logit contributions
wig+1.415
 seiz+1.413
ris+1.240
unin+1.207
ppo+1.183
Neg logit contributions
 Defin-1.599
Ct-1.553
 Prosecut-1.456
 Proceed-1.326
Score-1.319
Token Sunday
Token ID3502
Feature activation-1.11e-01

Pos logit contributions
 Defin+4.558
Ct+4.509
 Prosecut+4.193
 Proceed+3.881
Score+3.850
Neg logit contributions
 seiz-4.515
wig-4.509
ris-3.985
unin-3.954
models-3.935
Token celebration
Token ID14395
Feature activation+9.14e-01

Pos logit contributions
wig+0.208
 seiz+0.207
ris+0.179
unin+0.175
models+0.171
Neg logit contributions
 Defin-0.276
Ct-0.270
 Prosecut-0.255
 Proceed-0.234
Score-0.233
Token,
Token ID11
Feature activation+1.20e-01

Pos logit contributions
 Defin+2.196
Ct+2.160
 Prosecut+2.040
 Proceed+1.879
Score+1.863
Neg logit contributions
wig-1.748
 seiz-1.746
ris-1.512
unin-1.491
models-1.472
Token but
Token ID475
Feature activation+3.41e-02

Pos logit contributions
 Defin+0.306
Ct+0.299
 Prosecut+0.283
 Proceed+0.261
Score+0.260
Neg logit contributions
wig-0.218
 seiz-0.217
ris-0.187
unin-0.182
models-0.179
Token the
Token ID262
Feature activation+9.15e-02

Pos logit contributions
 Defin+0.080
Ct+0.078
 Prosecut+0.074
 Proceed+0.067
Score+0.067
Neg logit contributions
wig-0.068
 seiz-0.068
ris-0.060
unin-0.058
models-0.057
Tokenprev
Token ID47050
Feature activation+3.42e-01

Pos logit contributions
 Defin+0.296
Ct+0.291
 Prosecut+0.273
 Proceed+0.252
Score+0.252
Neg logit contributions
wig-0.233
 seiz-0.233
ris-0.200
unin-0.197
models-0.193
Token_
Token ID62
Feature activation+1.33e-02

Pos logit contributions
Ct+0.632
 Defin+0.625
 Prosecut+0.560
Score+0.523
 Proceed+0.516
Neg logit contributions
 seiz-0.852
wig-0.847
ris-0.748
unin-0.739
ppo-0.735
Tokenstate
Token ID5219
Feature activation+1.19e-01

Pos logit contributions
Ct+0.026
 Defin+0.026
 Prosecut+0.023
Score+0.021
 Proceed+0.021
Neg logit contributions
 seiz-0.032
wig-0.031
ris-0.027
unin-0.027
ppo-0.027
Token.
Token ID764
Feature activation+4.45e-03

Pos logit contributions
 Defin+0.263
Ct+0.258
 Prosecut+0.240
 Proceed+0.220
Score+0.219
Neg logit contributions
 seiz-0.253
wig-0.253
ris-0.222
unin-0.220
models-0.215
Tokenun
Token ID403
Feature activation+1.59e+00

Pos logit contributions
 Defin+0.011
Ct+0.011
 Prosecut+0.011
 Proceed+0.010
Score+0.010
Neg logit contributions
wig-0.008
 seiz-0.008
ris-0.007
unin-0.007
models-0.006
Tokenwrap
Token ID37150
Feature activation+2.12e+00

Pos logit contributions
 Defin+3.566
Ct+3.561
 Prosecut+3.306
Score+3.038
 Proceed+3.028
Neg logit contributions
 seiz-3.267
wig-3.217
ris-2.760
unin-2.737
models-2.727
Token ())
Token ID32865
Feature activation+6.34e-01

Pos logit contributions
Ct+3.647
 Defin+3.630
 Prosecut+3.257
Score+2.986
 Proceed+2.939
Neg logit contributions
 seiz-5.439
wig-5.359
unin-4.782
ris-4.780
ppo-4.686
Token }
Token ID1782
Feature activation+5.80e-01

Pos logit contributions
 Defin+1.356
Ct+1.330
 Prosecut+1.241
 Proceed+1.131
Score+1.125
Neg logit contributions
wig-1.399
 seiz-1.395
ris-1.224
unin-1.214
models-1.190
Token }
Token ID1782
Feature activation+3.52e-01

Pos logit contributions
 Defin+1.235
Ct+1.208
 Prosecut+1.126
 Proceed+1.027
Score+1.020
Neg logit contributions
wig-1.288
 seiz-1.278
ris-1.137
unin-1.121
models-1.103
Token pub
Token ID2240
Feature activation+8.15e-01

Pos logit contributions
 Defin+0.761
Ct+0.744
 Prosecut+0.695
 Proceed+0.632
Score+0.627
Neg logit contributions
wig-0.773
 seiz-0.767
ris-0.682
unin-0.671
models-0.657
Token fn
Token ID24714
Feature activation+4.53e-01

Pos logit contributions
 Defin+1.911
Ct+1.895
 Prosecut+1.754
 Proceed+1.622
Score+1.607
Neg logit contributions
wig-1.626
 seiz-1.596
unin-1.379
ris-1.363
models-1.338
Tokenth
Token ID400
Feature activation+2.94e-01

Pos logit contributions
 seiz+0.092
wig+0.091
ris+0.076
unin+0.076
models+0.074
Neg logit contributions
 Defin-0.131
Ct-0.130
 Prosecut-0.122
 Proceed-0.113
 Crafting-0.112
Token U
Token ID471
Feature activation+1.07e-01

Pos logit contributions
 Defin+0.542
Ct+0.528
 Prosecut+0.488
 Proceed+0.434
Score+0.430
Neg logit contributions
wig-0.736
 seiz-0.733
ris-0.659
unin-0.650
models-0.643
Token.
Token ID13
Feature activation+5.50e-01

Pos logit contributions
 Defin+0.270
Ct+0.266
 Prosecut+0.250
 Proceed+0.231
Score+0.230
Neg logit contributions
 seiz-0.194
wig-0.194
ris-0.166
unin-0.163
models-0.160
TokenS
Token ID50
Feature activation+6.65e-01

Pos logit contributions
 Defin+1.204
Ct+1.115
 Prosecut+1.101
 Proceed+0.988
 Crafting+0.966
Neg logit contributions
wig-1.210
 seiz-1.119
ris-1.075
unin-1.040
models-1.034
Token.
Token ID13
Feature activation+7.48e-01

Pos logit contributions
 Defin+1.825
Ct+1.753
 Prosecut+1.679
Score+1.561
 Proceed+1.556
Neg logit contributions
wig-1.077
 seiz-1.010
ris-0.917
unin-0.887
models-0.863
Token Circuit
Token ID13588
Feature activation+2.08e+00

Pos logit contributions
 Defin+0.889
Ct+0.853
 Prosecut+0.753
 Proceed+0.615
Score+0.605
Neg logit contributions
wig-2.360
 seiz-2.352
ris-2.165
unin-2.141
models-2.123
Token Court
Token ID3078
Feature activation+1.22e+00

Pos logit contributions
Ct+3.418
 Defin+3.310
 Prosecut+3.109
 Proceed+2.705
 Crafting+2.637
Neg logit contributions
wig-5.598
 seiz-5.515
ppo-4.926
ris-4.924
models-4.896
Token of
Token ID286
Feature activation-2.41e-01

Pos logit contributions
Ct+2.307
 Defin+2.271
 Prosecut+2.111
 Proceed+1.937
 Crafting+1.857
Neg logit contributions
wig-2.952
 seiz-2.905
ris-2.599
unin-2.590
ppo-2.567
Token Appeal
Token ID31876
Feature activation+2.04e-01

Pos logit contributions
wig+0.695
 seiz+0.677
ris+0.643
unin+0.628
models+0.626
Neg logit contributions
 Defin-0.344
Ct-0.335
 Prosecut-0.289
Score-0.247
 Crafting-0.241
Token's
Token ID338
Feature activation-2.88e-01

Pos logit contributions
 Defin+0.478
Ct+0.468
 Prosecut+0.441
 Proceed+0.404
Score+0.400
Neg logit contributions
wig-0.408
 seiz-0.405
ris-0.355
unin-0.348
models-0.344
Token 3
Token ID513
Feature activation-8.01e-02

Pos logit contributions
wig+0.624
 seiz+0.624
ris+0.552
unin+0.541
models+0.531
Neg logit contributions
 Defin-0.629
Ct-0.611
 Prosecut-0.571
 Proceed-0.517
Score-0.517
Tokenprev
Token ID47050
Feature activation+5.08e-01

Pos logit contributions
wig+0.052
 seiz+0.052
ris+0.045
unin+0.044
models+0.043
Neg logit contributions
 Defin-0.067
Ct-0.066
 Prosecut-0.062
 Proceed-0.057
Score-0.057
Token_
Token ID62
Feature activation+9.55e-02

Pos logit contributions
Ct+0.928
 Defin+0.924
 Prosecut+0.831
Score+0.772
 Proceed+0.767
Neg logit contributions
wig-1.266
 seiz-1.250
ris-1.110
unin-1.109
ppo-1.095
Tokenstate
Token ID5219
Feature activation+3.83e-01

Pos logit contributions
 Defin+0.219
Ct+0.209
 Prosecut+0.200
 Proceed+0.181
 Crafting+0.179
Neg logit contributions
wig-0.202
 seiz-0.192
ris-0.179
unin-0.175
models-0.173
Token.
Token ID764
Feature activation-3.77e-02

Pos logit contributions
 Defin+0.683
Ct+0.671
 Prosecut+0.614
 Proceed+0.551
Score+0.546
Neg logit contributions
wig-0.977
 seiz-0.972
ris-0.874
unin-0.868
models-0.850
Tokenun
Token ID403
Feature activation+1.65e+00

Pos logit contributions
wig+0.070
 seiz+0.070
ris+0.060
unin+0.059
models+0.058
Neg logit contributions
 Defin-0.094
Ct-0.092
 Prosecut-0.087
 Proceed-0.080
Score-0.080
Tokenwrap
Token ID37150
Feature activation+2.08e+00

Pos logit contributions
 Defin+1.491
Ct+1.392
 Prosecut+1.179
 Proceed+0.870
Score+0.844
Neg logit contributions
wig-5.730
 seiz-5.685
ris-5.306
unin-5.252
models-5.204
Token ();
Token ID13979
Feature activation+1.40e-01

Pos logit contributions
Ct+3.538
 Defin+3.491
 Prosecut+3.162
Score+2.898
 Proceed+2.875
Neg logit contributions
 seiz-5.354
wig-5.336
ris-4.774
unin-4.758
ppo-4.632
Token self
Token ID2116
Feature activation+2.29e-01

Pos logit contributions
 Defin+0.319
Ct+0.311
 Prosecut+0.293
 Proceed+0.266
Score+0.265
Neg logit contributions
wig-0.287
 seiz-0.287
ris-0.251
unin-0.247
models-0.243
Token.
Token ID764
Feature activation+7.44e-02

Pos logit contributions
 Defin+0.399
Ct+0.391
 Prosecut+0.352
 Proceed+0.322
Score+0.316
Neg logit contributions
wig-0.597
 seiz-0.591
ris-0.532
unin-0.528
ppo-0.523
Tokenturn
Token ID15344
Feature activation+3.59e-01

Pos logit contributions
 Defin+0.173
Ct+0.169
 Prosecut+0.159
 Proceed+0.146
Score+0.145
Neg logit contributions
 seiz-0.150
wig-0.149
ris-0.130
unin-0.128
models-0.125
Token =
Token ID796
Feature activation+2.12e-01

Pos logit contributions
 Defin+0.714
Ct+0.702
 Prosecut+0.646
 Proceed+0.590
Score+0.589
Neg logit contributions
wig-0.841
 seiz-0.841
ris-0.744
unin-0.740
models-0.725
Token,
Token ID11
Feature activation-3.16e-01

Pos logit contributions
 Defin+1.368
Ct+1.359
 Prosecut+1.277
Score+1.185
 Proceed+1.183
Neg logit contributions
wig-0.949
 seiz-0.947
ris-0.796
unin-0.790
models-0.774
Token1
Token ID16
Feature activation+4.39e-01

Pos logit contributions
wig+0.705
 seiz+0.704
ris+0.627
unin+0.613
models+0.609
Neg logit contributions
 Defin-0.665
Ct-0.643
 Prosecut-0.605
 Proceed-0.545
Score-0.539
Token]
Token ID60
Feature activation+3.62e-01

Pos logit contributions
 Defin+1.077
Ct+1.065
 Prosecut+0.999
Score+0.920
 Proceed+0.919
Neg logit contributions
wig-0.831
 seiz-0.825
ris-0.710
unin-0.699
models-0.685
Token //
Token ID3373
Feature activation-2.18e-02

Pos logit contributions
 Defin+0.787
Ct+0.766
 Prosecut+0.719
 Proceed+0.650
Score+0.645
Neg logit contributions
 seiz-0.785
wig-0.784
ris-0.693
unin-0.679
models-0.674
Token Final
Token ID8125
Feature activation+1.56e+00

Pos logit contributions
 seiz+0.050
wig+0.050
ris+0.045
unin+0.044
models+0.044
Neg logit contributions
 Defin-0.044
Ct-0.043
 Prosecut-0.040
 Proceed-0.036
Score-0.036
Token extr
Token ID22820
Feature activation+2.07e+00

Pos logit contributions
 Defin+3.506
Ct+3.440
 Prosecut+3.218
 Proceed+2.949
Score+2.936
Neg logit contributions
wig-3.252
 seiz-3.238
ris-2.859
unin-2.830
models-2.762
Tokenusion
Token ID4241
Feature activation-4.88e-02

Pos logit contributions
 Defin+3.703
Ct+3.636
 Prosecut+3.315
 Proceed+2.994
Score+2.985
Neg logit contributions
wig-5.111
 seiz-5.108
ris-4.556
unin-4.478
models-4.439
Token v
Token ID410
Feature activation+7.15e-01

Pos logit contributions
 seiz+0.105
wig+0.105
ris+0.093
unin+0.091
models+0.090
Neg logit contributions
 Defin-0.107
Ct-0.104
 Prosecut-0.098
 Proceed-0.088
Score-0.088
Token.
Token ID13
Feature activation+3.25e-01

Pos logit contributions
 Defin+1.668
Ct+1.647
 Prosecut+1.532
Score+1.417
 Proceed+1.414
Neg logit contributions
wig-1.427
 seiz-1.426
ris-1.233
unin-1.225
ppo-1.195
Tokenver
Token ID332
Feature activation+2.45e-01

Pos logit contributions
 Defin+0.750
Ct+0.733
 Prosecut+0.689
 Proceed+0.630
Score+0.626
Neg logit contributions
wig-0.661
 seiz-0.658
ris-0.577
unin-0.567
models-0.559
Tokentex
Token ID16886
Feature activation+1.82e-01

Pos logit contributions
 Defin+0.584
Ct+0.552
 Prosecut+0.533
 Proceed+0.488
 Crafting+0.488
Neg logit contributions
wig-0.504
 seiz-0.466
ris-0.442
unin-0.430
models-0.421
Token-
Token ID12
Feature activation-3.15e-02

Pos logit contributions
wig+0.017
 seiz+0.016
ris+0.015
unin+0.014
lean+0.014
Neg logit contributions
 Defin-0.029
Ct-0.028
 Prosecut-0.027
 Proceed-0.025
 Crafting-0.025
Tokenwhite
Token ID11186
Feature activation-5.82e-01

Pos logit contributions
 seiz+0.071
wig+0.071
ris+0.062
unin+0.062
models+0.061
Neg logit contributions
 Defin-0.066
Ct-0.065
 Prosecut-0.060
 Proceed-0.055
Score-0.054
Token.
Token ID13
Feature activation+6.69e-02

Pos logit contributions
wig+1.081
 seiz+1.078
ris+0.932
unin+0.911
models+0.898
Neg logit contributions
 Defin-1.447
Ct-1.418
 Prosecut-1.340
 Proceed-1.233
Score-1.226
Token The
Token ID383
Feature activation+2.96e-02

Pos logit contributions
 Defin+0.137
Ct+0.134
 Prosecut+0.125
 Proceed+0.112
Score+0.111
Neg logit contributions
wig-0.152
 seiz-0.150
ris-0.136
unin-0.132
models-0.132
Token supporters
Token ID5941
Feature activation+9.86e-01

Pos logit contributions
 Defin+0.069
Ct+0.068
 Prosecut+0.063
 Proceed+0.058
 Crafting+0.057
Neg logit contributions
wig-0.060
 seiz-0.060
ris-0.052
unin-0.051
models-0.050
Token wrap
Token ID14441
Feature activation+2.06e+00

Pos logit contributions
 Defin+2.299
Ct+2.247
 Prosecut+2.114
 Proceed+1.954
Score+1.935
Neg logit contributions
wig-1.963
 seiz-1.947
ris-1.702
unin-1.701
models-1.661
Token up
Token ID510
Feature activation+9.66e-01

Pos logit contributions
 Defin+4.518
Ct+4.475
 Prosecut+4.224
 Divide+3.839
 Proceed+3.833
Neg logit contributions
wig-4.137
 seiz-4.132
ris-3.629
unin-3.593
models-3.569
Token their
Token ID511
Feature activation+3.03e-01

Pos logit contributions
 Defin+2.305
Ct+2.266
 Prosecut+2.136
 Proceed+1.962
Score+1.951
Neg logit contributions
wig-1.871
 seiz-1.858
ris-1.619
unin-1.596
models-1.575
Token giant
Token ID6175
Feature activation-6.48e-01

Pos logit contributions
 Defin+0.723
Ct+0.707
 Prosecut+0.666
 Proceed+0.610
Score+0.606
Neg logit contributions
wig-0.594
 seiz-0.591
ris-0.516
unin-0.506
models-0.498
Token signs
Token ID5895
Feature activation-3.63e-01

Pos logit contributions
wig+1.256
 seiz+1.249
ris+1.089
unin+1.063
models+1.046
Neg logit contributions
 Defin-1.568
Ct-1.534
 Prosecut-1.443
 Proceed-1.320
Score-1.311
Token and
Token ID290
Feature activation-1.47e-01

Pos logit contributions
wig+0.656
 seiz+0.651
ris+0.562
unin+0.547
models+0.541
Neg logit contributions
 Defin-0.924
Ct-0.903
 Prosecut-0.855
 Proceed-0.788
Score-0.783
Token as
Token ID355
Feature activation+8.94e-02

Pos logit contributions
 Defin+0.292
Ct+0.286
 Prosecut+0.269
 Proceed+0.246
Score+0.245
Neg logit contributions
wig-0.254
 seiz-0.253
ris-0.222
unin-0.218
models-0.214
Token important
Token ID1593
Feature activation-6.02e-02

Pos logit contributions
 Defin+0.207
Ct+0.202
 Prosecut+0.190
 Proceed+0.174
Score+0.173
Neg logit contributions
wig-0.181
 seiz-0.181
ris-0.158
unin-0.155
models-0.153
Token as
Token ID355
Feature activation-1.32e-01

Pos logit contributions
 seiz+0.134
wig+0.133
ris+0.117
unin+0.116
ppo+0.114
Neg logit contributions
 Defin-0.128
Ct-0.125
 Prosecut-0.117
 Proceed-0.107
 Crafting-0.106
Token understanding
Token ID4547
Feature activation-1.41e-01

Pos logit contributions
wig+0.330
 seiz+0.328
ris+0.296
unin+0.291
models+0.289
Neg logit contributions
 Defin-0.244
Ct-0.237
 Prosecut-0.219
 Proceed-0.195
Score-0.195
Token super
Token ID2208
Feature activation+1.05e+00

Pos logit contributions
wig+0.292
 seiz+0.291
ris+0.255
unin+0.250
models+0.247
Neg logit contributions
 Defin-0.322
Ct-0.315
 Prosecut-0.296
 Proceed-0.270
Score-0.270
Tokenconduct
Token ID36495
Feature activation+2.06e+00

Pos logit contributions
 Defin+1.969
Ct+1.929
 Prosecut+1.766
 Proceed+1.577
 Divide+1.565
Neg logit contributions
 seiz-2.576
wig-2.513
ris-2.248
ppo-2.232
unin-2.227
Tokenivity
Token ID3458
Feature activation+5.56e-01

Pos logit contributions
 Defin+3.385
Ct+3.379
 Prosecut+3.032
 Proceed+2.705
 Crafting+2.681
Neg logit contributions
 seiz-5.384
wig-5.236
unin-4.696
ris-4.665
models-4.658
Token itself
Token ID2346
Feature activation-4.78e-01

Pos logit contributions
 Defin+1.325
Ct+1.297
 Prosecut+1.218
 Proceed+1.123
 Divide+1.111
Neg logit contributions
wig-1.087
 seiz-1.085
ris-0.940
unin-0.927
models-0.910
Token,"
Token ID553
Feature activation-2.83e-01

Pos logit contributions
wig+0.870
 seiz+0.867
ris+0.746
unin+0.731
models+0.719
Neg logit contributions
 Defin-1.208
Ct-1.183
 Prosecut-1.119
 Proceed-1.032
Score-1.025
Token said
Token ID531
Feature activation+4.98e-01

Pos logit contributions
wig+0.551
 seiz+0.548
ris+0.478
unin+0.468
models+0.461
Neg logit contributions
 Defin-0.678
Ct-0.664
 Prosecut-0.626
 Proceed-0.574
Score-0.571
Token Mel
Token ID5616
Feature activation-8.41e-02

Pos logit contributions
 Defin+1.052
Ct+1.026
 Prosecut+0.963
 Proceed+0.873
Score+0.863
Neg logit contributions
wig-1.109
 seiz-1.104
ris-0.978
unin-0.969
models-0.954
Token a
Token ID257
Feature activation-9.43e-02

Pos logit contributions
wig+0.232
 seiz+0.230
ris+0.202
unin+0.197
models+0.194
Neg logit contributions
 Defin-0.284
Ct-0.278
 Prosecut-0.262
 Proceed-0.240
Score-0.239
Token different
Token ID1180
Feature activation-9.97e-01

Pos logit contributions
wig+0.182
 seiz+0.182
ris+0.159
unin+0.155
models+0.150
Neg logit contributions
 Defin-0.228
Ct-0.223
 Prosecut-0.209
 Proceed-0.192
Score-0.191
Token flight
Token ID5474
Feature activation+2.37e-01

Pos logit contributions
wig+1.975
 seiz+1.969
ris+1.741
unin+1.691
lot+1.654
Neg logit contributions
 Defin-2.353
Ct-2.276
 Prosecut-2.122
 Proceed-1.967
Score-1.952
Token (
Token ID357
Feature activation-4.13e-02

Pos logit contributions
 Defin+0.602
Ct+0.591
 Prosecut+0.559
 Proceed+0.516
Score+0.512
Neg logit contributions
wig-0.428
 seiz-0.426
ris-0.367
unin-0.360
models-0.353
Tokenbecause
Token ID13893
Feature activation-2.63e-02

Pos logit contributions
wig+0.085
 seiz+0.084
ris+0.074
unin+0.073
models+0.072
Neg logit contributions
 Defin-0.095
Ct-0.093
 Prosecut-0.087
 Proceed-0.080
Score-0.079
Token Exp
Token ID5518
Feature activation+2.06e+00

Pos logit contributions
wig+0.053
 seiz+0.053
ris+0.046
unin+0.045
models+0.044
Neg logit contributions
 Defin-0.061
Ct-0.060
 Prosecut-0.057
 Proceed-0.052
Score-0.051
Tokenedia
Token ID5507
Feature activation-2.10e-01

Pos logit contributions
Ct+4.532
 Defin+4.527
 Prosecut+4.186
Score+3.813
 Divide+3.766
Neg logit contributions
 seiz-4.279
wig-4.277
unin-3.661
ris-3.646
models-3.615
Token informed
Token ID7981
Feature activation-7.03e-01

Pos logit contributions
wig+0.404
 seiz+0.402
ris+0.351
unin+0.343
models+0.338
Neg logit contributions
 Defin-0.508
Ct-0.498
 Prosecut-0.469
 Proceed-0.431
Score-0.428
Token her
Token ID607
Feature activation-4.42e-02

Pos logit contributions
wig+1.371
 seiz+1.362
ris+1.202
unin+1.164
models+1.148
Neg logit contributions
 Defin-1.674
Ct-1.635
 Prosecut-1.533
 Proceed-1.413
Score-1.406
Token that
Token ID326
Feature activation-1.03e-02

Pos logit contributions
wig+0.093
 seiz+0.093
ris+0.082
unin+0.080
models+0.079
Neg logit contributions
 Defin-0.099
Ct-0.097
 Prosecut-0.091
 Proceed-0.083
Score-0.082
Token her
Token ID607
Feature activation-2.28e-01

Pos logit contributions
wig+0.020
 seiz+0.020
ris+0.018
unin+0.017
models+0.017
Neg logit contributions
 Defin-0.024
Ct-0.024
 Prosecut-0.022
 Proceed-0.021
Score-0.020
Token its
Token ID663
Feature activation-7.29e-01

Pos logit contributions
wig+2.331
 seiz+2.310
ris+2.061
unin+1.984
models+1.982
Neg logit contributions
 Defin-2.609
Ct-2.572
 Prosecut-2.408
 Proceed-2.178
Score-2.162
Token annual
Token ID5079
Feature activation-4.50e-01

Pos logit contributions
wig+1.579
 seiz+1.562
ris+1.401
models+1.350
unin+1.345
Neg logit contributions
 Defin-1.566
Ct-1.543
 Prosecut-1.432
 Proceed-1.284
Score-1.272
Token renewables
Token ID36351
Feature activation+1.68e-01

Pos logit contributions
wig+0.965
 seiz+0.954
ris+0.854
models+0.825
unin+0.825
Neg logit contributions
 Defin-0.978
Ct-0.962
 Prosecut-0.896
 Proceed-0.808
Score-0.802
Token portfolio
Token ID15320
Feature activation-4.79e-01

Pos logit contributions
 Defin+0.377
Ct+0.368
 Prosecut+0.345
 Proceed+0.315
Score+0.312
Neg logit contributions
wig-0.355
 seiz-0.353
ris-0.311
unin-0.305
models-0.302
Token standard
Token ID3210
Feature activation-1.00e+00

Pos logit contributions
wig+0.979
 seiz+0.972
ris+0.857
unin+0.835
models+0.829
Neg logit contributions
 Defin-1.101
Ct-1.076
 Prosecut-1.012
 Proceed-0.924
Score-0.914
Token report
Token ID989
Feature activation-2.13e+00

Pos logit contributions
wig+1.818
 seiz+1.792
ris+1.568
unin+1.514
models+1.506
Neg logit contributions
 Defin-2.533
Ct-2.484
 Prosecut-2.355
 Proceed-2.162
Score-2.140
Token showing
Token ID4478
Feature activation-1.05e+00

Pos logit contributions
wig+3.662
 seiz+3.536
ris+3.137
models+3.035
unin+2.972
Neg logit contributions
 Defin-5.493
Ct-5.358
 Prosecut-5.081
 Proceed-4.674
Score-4.604
Token that
Token ID326
Feature activation-9.30e-02

Pos logit contributions
wig+2.234
 seiz+2.218
ris+1.978
unin+1.923
models+1.900
Neg logit contributions
 Defin-2.332
Ct-2.278
 Prosecut-2.139
 Proceed-1.949
Score-1.916
Token large
Token ID1588
Feature activation-8.02e-01

Pos logit contributions
wig+0.197
 seiz+0.196
ris+0.173
unin+0.170
models+0.168
Neg logit contributions
 Defin-0.207
Ct-0.202
 Prosecut-0.190
 Proceed-0.173
Score-0.171
Token investor
Token ID15811
Feature activation+2.05e-01

Pos logit contributions
wig+1.491
 seiz+1.468
ris+1.277
unin+1.245
models+1.227
Neg logit contributions
 Defin-2.001
Ct-1.962
 Prosecut-1.854
 Proceed-1.707
Score-1.694
Token-
Token ID12
Feature activation-1.49e-01

Pos logit contributions
 Defin+0.522
Ct+0.512
 Prosecut+0.485
 Proceed+0.447
Score+0.444
Neg logit contributions
wig-0.369
 seiz-0.367
ris-0.316
unin-0.309
models-0.304
TokenCook
Token ID28937
Feature activation+1.03e+00

Pos logit contributions
wig+0.236
 seiz+0.235
ris+0.209
unin+0.206
models+0.204
Neg logit contributions
 Defin-0.208
Ct-0.202
 Prosecut-0.189
 Proceed-0.170
Score-0.169
Tokenman
Token ID805
Feature activation+2.05e-02

Pos logit contributions
 Defin+2.271
Ct+2.219
 Prosecut+2.084
 Proceed+1.896
Score+1.885
Neg logit contributions
 seiz-2.189
wig-2.181
ris-1.921
unin-1.887
models-1.866
Token University
Token ID2059
Feature activation-5.57e-01

Pos logit contributions
 Defin+0.045
Ct+0.044
 Prosecut+0.041
 Proceed+0.037
Score+0.037
Neg logit contributions
wig-0.044
 seiz-0.044
ris-0.039
unin-0.038
models-0.038
Token celebrated
Token ID13943
Feature activation+5.94e-01

Pos logit contributions
wig+1.122
 seiz+1.112
ris+0.980
unin+0.956
models+0.943
Neg logit contributions
 Defin-1.293
Ct-1.267
 Prosecut-1.184
 Proceed-1.085
Score-1.083
Token its
Token ID663
Feature activation-1.93e-01

Pos logit contributions
 Defin+1.321
Ct+1.291
 Prosecut+1.213
 Proceed+1.105
Score+1.095
Neg logit contributions
wig-1.254
 seiz-1.253
ris-1.102
unin-1.088
models-1.073
Token hundred
Token ID3470
Feature activation-2.13e+00

Pos logit contributions
wig+0.440
 seiz+0.437
ris+0.388
unin+0.379
models+0.374
Neg logit contributions
 Defin-0.402
Ct-0.389
 Prosecut-0.365
 Proceed-0.328
Score-0.326
Tokenth
Token ID400
Feature activation-3.69e-02

Pos logit contributions
wig+3.120
 seiz+3.045
ris+2.588
unin+2.442
models+2.406
Neg logit contributions
 Defin-6.029
Ct-5.906
 Prosecut-5.697
 Proceed-5.279
Score-5.232
Token anniversary
Token ID11162
Feature activation+3.52e-01

Pos logit contributions
wig+0.082
 seiz+0.082
ris+0.072
unin+0.071
models+0.070
Neg logit contributions
 Defin-0.079
Ct-0.077
 Prosecut-0.072
 Proceed-0.065
Score-0.064
Token from
Token ID422
Feature activation-3.00e-01

Pos logit contributions
 Defin+0.843
Ct+0.826
 Prosecut+0.779
 Proceed+0.715
Score+0.710
Neg logit contributions
wig-0.686
 seiz-0.683
ris-0.595
unin-0.586
models-0.576
Token its
Token ID663
Feature activation+4.91e-02

Pos logit contributions
wig+0.621
 seiz+0.619
ris+0.544
unin+0.530
models+0.520
Neg logit contributions
 Defin-0.686
Ct-0.666
 Prosecut-0.626
Score-0.570
 Proceed-0.570
Token founding
Token ID16636
Feature activation-6.51e-01

Pos logit contributions
 Defin+0.109
Ct+0.106
 Prosecut+0.100
 Proceed+0.091
Score+0.090
Neg logit contributions
wig-0.105
 seiz-0.104
ris-0.092
unin-0.090
models-0.089
Token said
Token ID531
Feature activation+2.63e-01

Pos logit contributions
 Defin+0.097
Ct+0.095
 Prosecut+0.090
 Proceed+0.082
Score+0.082
Neg logit contributions
wig-0.080
 seiz-0.080
ris-0.070
unin-0.069
models-0.068
Token.
Token ID13
Feature activation+4.67e-02

Pos logit contributions
 Defin+0.647
Ct+0.635
 Prosecut+0.601
 Proceed+0.552
 Divide+0.547
Neg logit contributions
wig-0.490
 seiz-0.489
ris-0.421
unin-0.415
models-0.409
Token "
Token ID366
Feature activation+3.05e-01

Pos logit contributions
 Defin+0.105
Ct+0.102
 Prosecut+0.096
 Proceed+0.088
Score+0.087
Neg logit contributions
 seiz-0.099
wig-0.099
ris-0.086
unin-0.086
models-0.084
TokenI
Token ID40
Feature activation-3.12e-01

Pos logit contributions
 Defin+0.571
Ct+0.550
 Prosecut+0.517
 Proceed+0.457
 Divide+0.451
Neg logit contributions
wig-0.756
 seiz-0.739
ris-0.680
unin-0.667
models-0.662
Token love
Token ID1842
Feature activation+1.34e-01

Pos logit contributions
wig+0.576
 seiz+0.562
ris+0.505
unin+0.478
ppo+0.476
Neg logit contributions
 Defin-0.777
Ct-0.768
 Prosecut-0.725
Score-0.664
 Proceed-0.663
Token Hels
Token ID33889
Feature activation-2.10e+00

Pos logit contributions
 Defin+0.308
Ct+0.301
 Prosecut+0.283
 Proceed+0.258
Score+0.257
Neg logit contributions
wig-0.277
 seiz-0.275
ris-0.242
unin-0.238
models-0.234
Tokening
Token ID278
Feature activation-3.38e-01

Pos logit contributions
wig+4.385
 seiz+4.268
ris+3.865
unin+3.771
models+3.730
Neg logit contributions
 Defin-4.773
Ct-4.596
 Prosecut-4.372
 Proceed-3.935
 Crafting-3.884
Tokenborg
Token ID23297
Feature activation-6.94e-01

Pos logit contributions
wig+0.618
 seiz+0.616
ris+0.530
unin+0.520
models+0.511
Neg logit contributions
 Defin-0.852
Ct-0.835
 Prosecut-0.789
 Proceed-0.727
Score-0.723
Token and
Token ID290
Feature activation+6.66e-02

Pos logit contributions
wig+1.249
 seiz+1.234
ris+1.070
unin+1.044
models+1.024
Neg logit contributions
 Defin-1.767
Ct-1.734
 Prosecut-1.641
 Proceed-1.512
Score-1.504
Token Sweden
Token ID10710
Feature activation-1.31e+00

Pos logit contributions
 Defin+0.155
Ct+0.152
 Prosecut+0.143
 Proceed+0.130
Score+0.129
Neg logit contributions
wig-0.135
 seiz-0.134
ris-0.118
unin-0.115
models-0.113
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
wig+2.308
 seiz+2.269
ris+2.004
unin+1.931
models+1.910
Neg logit contributions
 Defin-3.351
Ct-3.299
 Prosecut-3.124
 Proceed-2.877
Score-2.845
Token,
Token ID11
Feature activation+1.25e-02

Pos logit contributions
 Defin+0.259
Ct+0.254
 Prosecut+0.239
 Proceed+0.218
Score+0.217
Neg logit contributions
wig-0.219
 seiz-0.218
ris-0.191
unin-0.187
models-0.184
Token he
Token ID339
Feature activation-8.78e-02

Pos logit contributions
 Defin+0.031
Ct+0.030
 Prosecut+0.028
 Proceed+0.026
Score+0.026
Neg logit contributions
wig-0.024
 seiz-0.024
ris-0.021
unin-0.020
models-0.020
Token planned
Token ID6027
Feature activation+2.78e-01

Pos logit contributions
wig+0.167
 seiz+0.166
ris+0.144
unin+0.140
models+0.139
Neg logit contributions
 Defin-0.216
Ct-0.211
 Prosecut-0.199
 Proceed-0.183
Score-0.182
Token a
Token ID257
Feature activation-6.14e-02

Pos logit contributions
 Defin+0.643
Ct+0.628
 Prosecut+0.592
 Proceed+0.540
Score+0.537
Neg logit contributions
wig-0.564
 seiz-0.560
ris-0.490
unin-0.484
models-0.476
Token "
Token ID366
Feature activation+8.40e-02

Pos logit contributions
wig+0.123
 seiz+0.122
ris+0.107
unin+0.105
models+0.103
Neg logit contributions
 Defin-0.144
Ct-0.141
 Prosecut-0.133
 Proceed-0.121
Score-0.121
Tokensur
Token ID11793
Feature activation-2.09e+00

Pos logit contributions
 Defin+0.181
Ct+0.177
 Prosecut+0.166
 Proceed+0.150
Score+0.150
Neg logit contributions
 seiz-0.183
wig-0.183
ris-0.161
unin-0.159
models-0.157
Tokenprise
Token ID7919
Feature activation-7.29e-01

Pos logit contributions
wig+4.800
 seiz+4.689
ris+4.319
unin+4.201
ppo+4.136
Neg logit contributions
 Defin-4.250
Ct-4.109
 Prosecut-3.905
 Proceed-3.463
 Divide-3.414
Token"
Token ID1
Feature activation-3.99e-01

Pos logit contributions
wig+1.473
 seiz+1.463
ris+1.279
unin+1.254
models+1.230
Neg logit contributions
 Defin-1.696
Ct-1.657
 Prosecut-1.551
 Proceed-1.415
Score-1.403
Token trip
Token ID5296
Feature activation+3.55e-01

Pos logit contributions
wig+0.812
 seiz+0.804
ris+0.705
unin+0.690
models+0.679
Neg logit contributions
 Defin-0.929
Ct-0.909
 Prosecut-0.852
 Proceed-0.776
Score-0.772
Token to
Token ID284
Feature activation+4.09e-01

Pos logit contributions
 Defin+0.840
Ct+0.822
 Prosecut+0.775
 Proceed+0.710
Score+0.706
Neg logit contributions
wig-0.700
 seiz-0.698
ris-0.609
unin-0.597
models-0.588
Token Mexico
Token ID5828
Feature activation+6.75e-01

Pos logit contributions
 Defin+0.943
Ct+0.922
 Prosecut+0.867
 Proceed+0.793
Score+0.788
Neg logit contributions
wig-0.836
 seiz-0.833
ris-0.730
unin-0.718
models-0.707
Token slack
Token ID30740
Feature activation+1.08e+00

Pos logit contributions
 Defin+0.466
Ct+0.456
 Prosecut+0.431
 Proceed+0.396
Score+0.393
Neg logit contributions
wig-0.365
 seiz-0.364
ris-0.316
unin-0.310
models-0.305
Token.
Token ID13
Feature activation-8.62e-02

Pos logit contributions
 Defin+2.563
Ct+2.524
 Prosecut+2.385
 Proceed+2.188
Score+2.168
Neg logit contributions
 seiz-2.079
wig-2.055
unin-1.763
ris-1.763
models-1.730
Token The
Token ID383
Feature activation-2.69e-01

Pos logit contributions
wig+0.189
 seiz+0.186
ris+0.167
unin+0.162
models+0.161
Neg logit contributions
 Defin-0.182
Ct-0.178
 Prosecut-0.165
Score-0.150
 Proceed-0.148
Token Center
Token ID3337
Feature activation+2.77e-01

Pos logit contributions
wig+0.554
 seiz+0.550
ris+0.484
unin+0.472
models+0.464
Neg logit contributions
 Defin-0.614
Ct-0.602
 Prosecut-0.564
 Proceed-0.514
Score-0.509
Token for
Token ID329
Feature activation-3.00e-01

Pos logit contributions
 Defin+0.613
Ct+0.600
 Prosecut+0.563
 Proceed+0.512
Score+0.509
Neg logit contributions
wig-0.588
 seiz-0.586
ris-0.516
unin-0.509
models-0.502
Token Respons
Token ID20549
Feature activation-2.07e+00

Pos logit contributions
wig+0.665
 seiz+0.661
ris+0.589
unin+0.575
models+0.568
Neg logit contributions
 Defin-0.634
Ct-0.620
 Prosecut-0.571
Score-0.522
 Proceed-0.519
Tokenive
Token ID425
Feature activation-8.09e-01

Pos logit contributions
wig+3.562
 seiz+3.273
ris+3.045
lean+2.881
unin+2.881
Neg logit contributions
 Defin-5.496
Ct-5.318
 Prosecut-5.022
 Divide-4.698
 Proceed-4.680
Token Politics
Token ID17554
Feature activation+2.51e-01

Pos logit contributions
wig+2.401
 seiz+2.302
ris+2.168
lean+2.138
unin+2.121
Neg logit contributions
 Defin-1.117
Ct-1.055
 Prosecut-0.919
Score-0.824
 Proceed-0.774
Token has
Token ID468
Feature activation-9.42e-02

Pos logit contributions
 Defin+0.562
Ct+0.549
 Prosecut+0.515
 Proceed+0.469
Score+0.465
Neg logit contributions
wig-0.528
 seiz-0.526
ris-0.463
unin-0.455
models-0.449
Token the
Token ID262
Feature activation-5.73e-02

Pos logit contributions
wig+0.191
 seiz+0.191
ris+0.167
unin+0.163
models+0.161
Neg logit contributions
 Defin-0.218
Ct-0.212
 Prosecut-0.200
 Proceed-0.183
Score-0.181
Token full
Token ID1336
Feature activation-1.93e-01

Pos logit contributions
wig+0.114
 seiz+0.114
ris+0.099
unin+0.097
models+0.096
Neg logit contributions
 Defin-0.135
Ct-0.132
 Prosecut-0.124
 Proceed-0.114
Score-0.112
Tokenargs
Token ID22046
Feature activation+4.30e-01

Pos logit contributions
wig+0.055
 seiz+0.050
ris+0.049
models+0.048
unin+0.048
Neg logit contributions
 Defin-0.065
Ct-0.060
 Prosecut-0.059
 Proceed-0.053
 Crafting-0.053
Token())
Token ID28955
Feature activation+9.21e-02

Pos logit contributions
 Defin+0.956
Ct+0.901
 Prosecut+0.870
 Crafting+0.778
 Proceed+0.778
Neg logit contributions
wig-0.925
 seiz-0.888
ris-0.822
models-0.818
unin-0.809
Token #
Token ID1303
Feature activation+3.83e-01

Pos logit contributions
 Defin+0.170
Ct+0.167
 Prosecut+0.154
 Proceed+0.139
Score+0.137
Neg logit contributions
wig-0.232
 seiz-0.228
ris-0.205
unin-0.204
lean-0.199
Token initialize
Token ID41216
Feature activation-3.29e-02

Pos logit contributions
 Defin+0.735
Ct+0.717
 Prosecut+0.663
 Proceed+0.598
 Divide+0.589
Neg logit contributions
wig-0.933
 seiz-0.929
ris-0.828
unin-0.814
models-0.804
Token a
Token ID257
Feature activation-3.60e-01

Pos logit contributions
wig+0.055
 seiz+0.051
ris+0.048
models+0.046
unin+0.046
Neg logit contributions
 Defin-0.088
Ct-0.085
 Prosecut-0.082
 Proceed-0.075
 Crafting-0.074
Token rectangular
Token ID36954
Feature activation-2.06e+00

Pos logit contributions
wig+0.410
 seiz+0.388
ris+0.317
unin+0.305
models+0.284
Neg logit contributions
 Defin-1.163
Ct-1.135
 Prosecut-1.091
 Proceed-1.028
Score-1.026
Token and
Token ID290
Feature activation-1.77e-01

Pos logit contributions
wig+2.534
ris+2.131
 seiz+2.042
models+1.967
lean+1.964
Neg logit contributions
 Defin-6.376
Ct-6.035
 Prosecut-5.877
 Proceed-5.442
doms-5.309
Token square
Token ID6616
Feature activation-6.38e-01

Pos logit contributions
wig+0.356
 seiz+0.335
ris+0.318
unin+0.306
lean+0.302
Neg logit contributions
 Defin-0.423
Ct-0.403
 Prosecut-0.386
 Proceed-0.349
 Crafting-0.344
Token struct
Token ID2878
Feature activation-7.41e-01

Pos logit contributions
wig+0.996
 seiz+0.936
ris+0.816
unin+0.792
models+0.788
Neg logit contributions
 Defin-1.783
Ct-1.724
 Prosecut-1.658
 Proceed-1.548
Score-1.542
Tokenuring
Token ID870
Feature activation-5.73e-01

Pos logit contributions
wig+1.132
 seiz+1.000
ris+0.957
unin+0.914
models+0.901
Neg logit contributions
 Defin-2.094
Ct-2.051
 Prosecut-1.965
Score-1.836
 Proceed-1.823
Token kernel
Token ID9720
Feature activation-6.06e-01

Pos logit contributions
wig+0.720
 seiz+0.669
ris+0.579
unin+0.570
lot+0.544
Neg logit contributions
 Defin-1.787
Ct-1.727
 Prosecut-1.653
 Proceed-1.544
 Divide-1.530
TokenThe
Token ID464
Feature activation-4.51e-01

Pos logit contributions
wig+0.969
 seiz+0.966
ris+0.856
unin+0.843
models+0.831
Neg logit contributions
 Defin-0.932
Ct-0.910
 Prosecut-0.851
 Proceed-0.771
Score-0.766
Token company
Token ID1664
Feature activation+2.37e-01

Pos logit contributions
wig+0.889
 seiz+0.882
ris+0.770
unin+0.747
models+0.740
Neg logit contributions
 Defin-1.078
Ct-1.055
 Prosecut-0.991
 Proceed-0.903
Score-0.900
Token released
Token ID2716
Feature activation-1.15e+00

Pos logit contributions
 Defin+0.567
Ct+0.556
 Prosecut+0.525
 Proceed+0.481
Score+0.478
Neg logit contributions
wig-0.462
 seiz-0.459
ris-0.400
unin-0.394
models-0.387
Token an
Token ID281
Feature activation-7.03e-01

Pos logit contributions
wig+2.234
 seiz+2.221
ris+1.945
unin+1.886
models+1.869
Neg logit contributions
 Defin-2.778
Ct-2.711
 Prosecut-2.555
 Proceed-2.340
Score-2.326
Token independent
Token ID4795
Feature activation+1.27e-01

Pos logit contributions
wig+1.637
 seiz+1.628
ris+1.449
unin+1.424
models+1.414
Neg logit contributions
 Defin-1.426
Ct-1.384
 Prosecut-1.289
 Proceed-1.155
 Divide-1.145
Token report
Token ID989
Feature activation-2.03e+00

Pos logit contributions
 Defin+0.262
Ct+0.255
 Prosecut+0.239
 Proceed+0.215
Score+0.214
Neg logit contributions
wig-0.289
 seiz-0.288
ris-0.256
unin-0.253
models-0.249
Token showing
Token ID4478
Feature activation-1.30e+00

Pos logit contributions
wig+4.088
 seiz+4.036
ris+3.547
models+3.430
lean+3.421
Neg logit contributions
 Defin-4.693
Ct-4.479
 Prosecut-4.202
 Proceed-3.895
Score-3.847
Token that
Token ID326
Feature activation-1.80e-01

Pos logit contributions
 seiz+2.747
wig+2.740
ris+2.424
unin+2.337
models+2.330
Neg logit contributions
 Defin-2.928
Ct-2.835
 Prosecut-2.653
 Proceed-2.429
Score-2.419
Token East
Token ID3687
Feature activation+3.88e-01

Pos logit contributions
wig+0.358
 seiz+0.356
ris+0.312
unin+0.305
models+0.301
Neg logit contributions
 Defin-0.424
Ct-0.415
 Prosecut-0.390
 Proceed-0.357
Score-0.356
Token German
Token ID2679
Feature activation-5.83e-01

Pos logit contributions
 Defin+1.071
Ct+1.054
 Prosecut+1.002
 Proceed+0.934
 Divide+0.922
Neg logit contributions
 seiz-0.611
wig-0.588
unin-0.505
ris-0.495
models-0.481
Token prisoners
Token ID10577
Feature activation+4.82e-01

Pos logit contributions
wig+0.997
 seiz+0.991
ris+0.850
unin+0.822
models+0.812
Neg logit contributions
 Defin-1.538
Ct-1.505
 Prosecut-1.424
 Proceed-1.321
Score-1.318
Token performance
Token ID2854
Feature activation-1.88e-01

Pos logit contributions
 Defin+0.479
Ct+0.468
 Prosecut+0.442
 Proceed+0.406
Score+0.403
Neg logit contributions
wig-0.385
 seiz-0.384
ris-0.334
unin-0.327
models-0.322
Token.
Token ID13
Feature activation-2.54e-01

Pos logit contributions
wig+0.341
 seiz+0.339
ris+0.293
unin+0.286
models+0.282
Neg logit contributions
 Defin-0.474
Ct-0.465
 Prosecut-0.440
 Proceed-0.405
Score-0.403
Token There
Token ID1318
Feature activation+1.85e-01

Pos logit contributions
wig+0.562
 seiz+0.560
ris+0.495
unin+0.488
models+0.480
Neg logit contributions
 Defin-0.545
Ct-0.532
 Prosecut-0.498
 Proceed-0.452
Score-0.448
Token are
Token ID389
Feature activation-1.07e+00

Pos logit contributions
 Defin+0.422
Ct+0.415
 Prosecut+0.389
 Proceed+0.358
Score+0.355
Neg logit contributions
 seiz-0.382
wig-0.378
ris-0.325
unin-0.322
models-0.320
Token six
Token ID2237
Feature activation-1.13e+00

Pos logit contributions
wig+2.029
 seiz+2.029
ris+1.775
unin+1.731
ppo+1.701
Neg logit contributions
 Defin-2.568
Ct-2.502
 Prosecut-2.382
 Proceed-2.171
Score-2.160
Token preset
Token ID38266
Feature activation-2.02e+00

Pos logit contributions
 seiz+2.160
wig+2.152
ris+1.853
unin+1.828
models+1.795
Neg logit contributions
 Defin-2.722
Ct-2.665
 Prosecut-2.524
 Proceed-2.310
Score-2.288
Token modes
Token ID12881
Feature activation-7.69e-01

Pos logit contributions
 seiz+3.935
wig+3.904
ris+3.382
lean+3.317
unin+3.308
Neg logit contributions
 Defin-4.674
Ct-4.587
 Prosecut-4.342
 Proceed-3.970
Score-3.920
Token for
Token ID329
Feature activation-2.17e-01

Pos logit contributions
wig+1.469
 seiz+1.456
ris+1.273
unin+1.245
models+1.227
Neg logit contributions
 Defin-1.872
Ct-1.832
 Prosecut-1.729
 Proceed-1.589
Score-1.577
Token the
Token ID262
Feature activation-2.77e-01

Pos logit contributions
wig+0.426
 seiz+0.425
ris+0.370
unin+0.364
models+0.357
Neg logit contributions
 Defin-0.515
Ct-0.503
 Prosecut-0.475
 Proceed-0.436
Score-0.432
Token truck
Token ID7779
Feature activation-6.51e-01

Pos logit contributions
 seiz+0.530
wig+0.529
ris+0.459
unin+0.451
models+0.443
Neg logit contributions
 Defin-0.672
Ct-0.658
 Prosecut-0.622
 Proceed-0.570
Score-0.566
Token depending
Token ID6906
Feature activation+3.23e-02

Pos logit contributions
wig+1.218
 seiz+1.211
ris+1.049
unin+1.028
models+1.012
Neg logit contributions
 Defin-1.614
Ct-1.581
 Prosecut-1.493
 Proceed-1.374
Score-1.364
Token.
Token ID13
Feature activation+9.40e-02

Pos logit contributions
 Defin+0.371
Ct+0.364
 Prosecut+0.344
 Proceed+0.317
Score+0.315
Neg logit contributions
wig-0.268
 seiz-0.266
ris-0.229
unin-0.226
models-0.222
Token "
Token ID366
Feature activation-1.04e-02

Pos logit contributions
 Defin+0.217
Ct+0.213
 Prosecut+0.200
 Proceed+0.182
Score+0.182
Neg logit contributions
wig-0.191
 seiz-0.190
ris-0.167
unin-0.163
models-0.161
TokenI
Token ID40
Feature activation-5.20e-01

Pos logit contributions
wig+0.026
 seiz+0.025
ris+0.023
unin+0.022
models+0.022
Neg logit contributions
 Defin-0.020
Ct-0.019
 Prosecut-0.018
 Proceed-0.016
 Crafting-0.016
Token miss
Token ID2051
Feature activation-3.42e-02

Pos logit contributions
wig+1.001
 seiz+0.979
ris+0.882
ppo+0.834
unin+0.828
Neg logit contributions
 Defin-1.264
Ct-1.242
 Prosecut-1.169
 Proceed-1.074
Score-1.074
Token you
Token ID345
Feature activation-5.58e-01

Pos logit contributions
wig+0.068
 seiz+0.067
ris+0.059
unin+0.058
models+0.057
Neg logit contributions
 Defin-0.081
Ct-0.079
 Prosecut-0.075
 Proceed-0.068
Score-0.068
Token Elsa
Token ID19226
Feature activation-2.02e+00

Pos logit contributions
wig+1.030
 seiz+1.026
ris+0.887
unin+0.868
models+0.852
Neg logit contributions
 Defin-1.393
Ct-1.365
 Prosecut-1.290
 Proceed-1.188
Score-1.181
Token,
Token ID11
Feature activation-4.40e-01

Pos logit contributions
wig+3.519
 seiz+3.472
ris+3.017
unin+2.934
models+2.870
Neg logit contributions
 Defin-5.182
Ct-5.071
 Prosecut-4.814
 Proceed-4.443
Score-4.417
Token my
Token ID616
Feature activation-4.52e-01

Pos logit contributions
wig+0.835
 seiz+0.832
ris+0.724
unin+0.706
models+0.696
Neg logit contributions
 Defin-1.073
Ct-1.052
 Prosecut-0.993
 Proceed-0.910
Score-0.906
Token precious
Token ID14186
Feature activation-4.85e-01

Pos logit contributions
wig+0.844
 seiz+0.839
ris+0.728
unin+0.712
models+0.700
Neg logit contributions
 Defin-1.122
Ct-1.099
 Prosecut-1.039
 Proceed-0.956
Score-0.949
Token daughter
Token ID4957
Feature activation-4.91e-01

Pos logit contributions
wig+0.902
 seiz+0.897
ris+0.777
unin+0.760
models+0.749
Neg logit contributions
 Defin-1.208
Ct-1.181
 Prosecut-1.117
 Proceed-1.027
Score-1.020
Token."
Token ID526
Feature activation-3.69e-01

Pos logit contributions
wig+0.918
 seiz+0.914
ris+0.789
unin+0.775
models+0.762
Neg logit contributions
 Defin-1.214
Ct-1.190
 Prosecut-1.124
 Proceed-1.035
Score-1.029
Token and
Token ID290
Feature activation+1.37e-01

Pos logit contributions
 seiz+0.063
wig+0.063
ris+0.055
unin+0.054
models+0.052
Neg logit contributions
 Defin-0.081
Ct-0.079
 Prosecut-0.074
Score-0.068
 Proceed-0.068
Token his
Token ID465
Feature activation+6.28e-02

Pos logit contributions
 Defin+0.318
Ct+0.310
 Prosecut+0.292
 Proceed+0.267
Score+0.265
Neg logit contributions
wig-0.280
 seiz-0.279
ris-0.245
unin-0.240
models-0.236
Token attorney
Token ID6136
Feature activation+4.22e-01

Pos logit contributions
 Defin+0.150
Ct+0.145
 Prosecut+0.137
 Proceed+0.126
Score+0.125
Neg logit contributions
wig-0.124
 seiz-0.123
ris-0.108
unin-0.105
models-0.104
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
 Defin+1.036
Ct+1.017
 Prosecut+0.965
 Proceed+0.884
Score+0.876
Neg logit contributions
wig-0.794
 seiz-0.790
ris-0.683
unin-0.675
models-0.662
Token Josh
Token ID8518
Feature activation+1.62e-01

Pos logit contributions
wig+0.343
 seiz+0.342
ris+0.305
unin+0.298
models+0.295
Neg logit contributions
 Defin-0.316
Ct-0.308
 Prosecut-0.286
 Proceed-0.259
Score-0.259
Token Engel
Token ID46073
Feature activation-2.01e+00

Pos logit contributions
 Defin+0.331
Ct+0.323
 Prosecut+0.303
 Proceed+0.273
Score+0.270
Neg logit contributions
 seiz-0.372
wig-0.371
ris-0.328
unin-0.325
models-0.320
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
wig+3.942
 seiz+3.838
ris+3.409
unin+3.317
models+3.250
Neg logit contributions
 Defin-4.855
Ct-4.716
 Prosecut-4.447
 Proceed-4.085
Score-4.065
Token said
Token ID531
Feature activation+4.85e-02

Pos logit contributions
wig+0.281
 seiz+0.279
ris+0.244
unin+0.238
models+0.235
Neg logit contributions
 Defin-0.337
Ct-0.328
 Prosecut-0.308
 Proceed-0.283
Score-0.282
Token he
Token ID339
Feature activation-1.99e-01

Pos logit contributions
 Defin+0.111
Ct+0.108
 Prosecut+0.101
 Proceed+0.093
Score+0.092
Neg logit contributions
wig-0.100
 seiz-0.100
ris-0.088
unin-0.086
models-0.085
Token currently
Token ID3058
Feature activation-1.64e-01

Pos logit contributions
wig+0.389
 seiz+0.388
ris+0.340
unin+0.328
models+0.326
Neg logit contributions
 Defin-0.478
Ct-0.466
 Prosecut-0.437
Score-0.403
 Proceed-0.402
Token is
Token ID318
Feature activation-1.70e-01

Pos logit contributions
wig+0.336
 seiz+0.335
ris+0.296
unin+0.287
models+0.285
Neg logit contributions
 Defin-0.377
Ct-0.367
 Prosecut-0.343
 Proceed-0.315
Score-0.314
Token,
Token ID11
Feature activation+5.89e-02

Pos logit contributions
 Defin+0.892
Ct+0.883
 Prosecut+0.826
 Proceed+0.760
 Crafting+0.754
Neg logit contributions
wig-0.751
 seiz-0.747
ris-0.647
unin-0.641
models-0.635
Token they
Token ID484
Feature activation+2.82e-01

Pos logit contributions
 Defin+0.136
Ct+0.134
 Prosecut+0.125
 Proceed+0.115
Score+0.114
Neg logit contributions
wig-0.120
 seiz-0.119
ris-0.104
unin-0.102
models-0.101
Token are
Token ID389
Feature activation-3.00e-01

Pos logit contributions
 Defin+0.657
Ct+0.643
 Prosecut+0.605
 Proceed+0.554
Score+0.550
Neg logit contributions
wig-0.568
 seiz-0.565
ris-0.495
unin-0.486
models-0.479
Token pursued
Token ID19189
Feature activation+1.53e-01

Pos logit contributions
wig+0.582
 seiz+0.581
ris+0.506
unin+0.494
models+0.484
Neg logit contributions
 Defin-0.727
Ct-0.705
 Prosecut-0.665
 Proceed-0.612
Score-0.611
Token by
Token ID416
Feature activation+3.91e-03

Pos logit contributions
 Defin+0.369
Ct+0.361
 Prosecut+0.341
 Proceed+0.314
Score+0.311
Neg logit contributions
wig-0.295
 seiz-0.294
ris-0.255
unin-0.251
models-0.247
Token Or
Token ID1471
Feature activation-2.00e+00

Pos logit contributions
 Defin+0.009
Ct+0.009
 Prosecut+0.008
 Proceed+0.007
Score+0.007
Neg logit contributions
wig-0.008
 seiz-0.008
ris-0.007
unin-0.007
models-0.007
Tokencs
Token ID6359
Feature activation+8.60e-02

Pos logit contributions
wig+3.466
 seiz+3.248
ris+2.930
unin+2.782
lean+2.761
Neg logit contributions
 Defin-5.311
Ct-5.190
 Prosecut-4.966
Score-4.576
 Proceed-4.532
Token and
Token ID290
Feature activation+7.96e-02

Pos logit contributions
 Defin+0.216
Ct+0.212
 Prosecut+0.201
 Proceed+0.185
Score+0.184
Neg logit contributions
wig-0.157
 seiz-0.157
ris-0.135
unin-0.132
models-0.130
Token by
Token ID416
Feature activation+1.33e-01

Pos logit contributions
 Defin+0.175
Ct+0.171
 Prosecut+0.160
 Proceed+0.145
Score+0.145
Neg logit contributions
wig-0.171
 seiz-0.171
ris-0.151
unin-0.148
models-0.146
Token an
Token ID281
Feature activation-1.54e-01

Pos logit contributions
 Defin+0.297
Ct+0.291
 Prosecut+0.273
 Proceed+0.250
Score+0.247
Neg logit contributions
wig-0.279
 seiz-0.278
ris-0.244
unin-0.240
models-0.237
Token ancient
Token ID6156
Feature activation-2.52e-01

Pos logit contributions
wig+0.279
 seiz+0.277
ris+0.239
unin+0.235
models+0.230
Neg logit contributions
 Defin-0.389
Ct-0.381
 Prosecut-0.360
 Proceed-0.331
Score-0.330
Token Clinton
Token ID2605
Feature activation-1.42e+00

Pos logit contributions
wig+2.418
 seiz+2.281
ris+2.055
ppo+1.931
models+1.930
Neg logit contributions
 Defin-4.001
Ct-3.897
 Prosecut-3.699
Score-3.407
 Proceed-3.390
Token Hillary
Token ID5073
Feature activation-1.62e+00

Pos logit contributions
wig+2.812
 seiz+2.806
ris+2.423
unin+2.360
lean+2.335
Neg logit contributions
 Defin-3.337
Ct-3.276
 Prosecut-3.047
Score-2.798
 Proceed-2.783
Token Diane
Token ID26542
Feature activation-1.71e+00

Pos logit contributions
wig+2.376
 seiz+2.092
ris+1.969
adin+1.897
models+1.889
Neg logit contributions
 Defin-4.640
Ct-4.435
 Prosecut-4.330
 Proceed-3.988
Score-3.949
Token Rodham
Token ID41848
Feature activation-1.60e+00

Pos logit contributions
wig+1.664
 seiz+1.411
ris+1.264
models+1.108
ppo+1.074
Neg logit contributions
 Defin-5.894
Ct-5.685
 Prosecut-5.462
 Proceed-5.203
 Dialogue-5.120
Token Clinton
Token ID2605
Feature activation-1.93e+00

Pos logit contributions
wig+2.577
 seiz+2.385
ris+2.141
ppo+2.077
lean+2.040
Neg logit contributions
 Defin-4.439
Ct-4.278
 Prosecut-4.073
 Proceed-3.748
Score-3.720
TokenSanders
Token ID26747
Feature activation-2.00e+00

Pos logit contributions
wig+4.789
 seiz+4.629
models+4.243
ris+4.239
unin+4.214
Neg logit contributions
 Defin-3.541
Ct-3.285
 Prosecut-3.131
 Proceed-2.811
 Divide-2.762
Token:
Token ID25
Feature activation-1.25e+00

Pos logit contributions
 seiz+4.344
wig+4.194
models+3.732
ris+3.693
lean+3.681
Neg logit contributions
 Defin-4.240
Ct-4.085
 Prosecut-3.869
 Proceed-3.454
 Crafting-3.441
Token '
Token ID705
Feature activation-7.56e-01

Pos logit contributions
wig+2.248
 seiz+2.156
ris+1.957
ppo+1.885
unin+1.872
Neg logit contributions
 Defin-3.100
Ct-3.019
 Prosecut-2.797
Score-2.613
 Transcript-2.553
TokenI
Token ID40
Feature activation-1.04e+00

Pos logit contributions
wig+1.644
 seiz+1.477
models+1.445
ris+1.427
unin+1.407
Neg logit contributions
 Defin-1.715
Ct-1.542
 Prosecut-1.501
 Proceed-1.361
 Divide-1.351
Token fully
Token ID3938
Feature activation-6.77e-01

Pos logit contributions
wig+2.158
 seiz+2.129
ris+2.097
ppo+1.898
lean+1.877
Neg logit contributions
 Defin-2.289
Ct-2.207
 Prosecut-2.098
Score-1.989
 Crafting-1.908
Token expect
Token ID1607
Feature activation-1.17e+00

Pos logit contributions
wig+1.208
 seiz+1.205
ris+1.051
unin+1.009
models+1.002
Neg logit contributions
 Defin-1.728
Ct-1.679
 Prosecut-1.597
 Proceed-1.471
 Crafting-1.466
Token the
Token ID262
Feature activation-3.52e-02

Pos logit contributions
wig+0.750
 seiz+0.748
ris+0.656
unin+0.644
models+0.634
Neg logit contributions
 Defin-0.851
Ct-0.832
 Prosecut-0.782
 Proceed-0.715
Score-0.711
Token projectile
Token ID26489
Feature activation-3.64e-01

Pos logit contributions
wig+0.075
 seiz+0.075
ris+0.067
unin+0.065
models+0.064
Neg logit contributions
 Defin-0.077
Ct-0.076
 Prosecut-0.071
 Proceed-0.064
Score-0.064
Token tonight
Token ID9975
Feature activation-6.31e-01

Pos logit contributions
wig+0.662
 seiz+0.658
ris+0.567
unin+0.556
models+0.546
Neg logit contributions
 Defin-0.921
Ct-0.902
 Prosecut-0.853
 Proceed-0.786
Score-0.782
Token,
Token ID11
Feature activation-9.41e-02

Pos logit contributions
wig+1.084
 seiz+1.078
ris+0.920
unin+0.900
models+0.885
Neg logit contributions
 Defin-1.658
Ct-1.627
 Prosecut-1.542
 Proceed-1.426
Score-1.418
Token seven
Token ID3598
Feature activation-1.09e+00

Pos logit contributions
wig+0.191
 seiz+0.191
ris+0.168
unin+0.164
models+0.162
Neg logit contributions
 Defin-0.218
Ct-0.212
 Prosecut-0.200
 Proceed-0.183
Score-0.182
Token hundred
Token ID3470
Feature activation-1.99e+00

Pos logit contributions
 seiz+2.032
wig+2.032
ris+1.761
unin+1.727
models+1.692
Neg logit contributions
 Defin-2.702
Ct-2.643
 Prosecut-2.492
 Proceed-2.291
Score-2.283
Token thousand
Token ID7319
Feature activation-1.20e+00

Pos logit contributions
 seiz+3.513
wig+3.447
ris+3.018
unin+2.930
models+2.855
Neg logit contributions
 Defin-5.077
Ct-4.970
 Prosecut-4.665
 Proceed-4.315
Score-4.301
Token people
Token ID661
Feature activation+3.72e-01

Pos logit contributions
 seiz+2.136
wig+2.123
ris+1.835
unin+1.790
models+1.747
Neg logit contributions
 Defin-3.089
Ct-3.015
 Prosecut-2.845
Score-2.630
 Proceed-2.629
Token would
Token ID561
Feature activation+9.50e-04

Pos logit contributions
 Defin+0.906
Ct+0.889
 Prosecut+0.839
 Proceed+0.772
Score+0.764
Neg logit contributions
wig-0.705
 seiz-0.701
ris-0.609
unin-0.602
models-0.592
Token be
Token ID307
Feature activation+4.32e-01

Pos logit contributions
 Defin+0.002
Ct+0.002
 Prosecut+0.002
 Proceed+0.002
Score+0.002
Neg logit contributions
wig-0.002
 seiz-0.002
ris-0.002
unin-0.002
models-0.002
Token saved
Token ID7448
Feature activation+2.02e-01

Pos logit contributions
 Defin+1.029
Ct+1.011
 Prosecut+0.953
 Proceed+0.872
Score+0.867
Neg logit contributions
wig-0.842
 seiz-0.836
ris-0.730
unin-0.717
models-0.708
Token Germany
Token ID4486
Feature activation-5.88e-01

Pos logit contributions
wig+0.387
 seiz+0.386
ris+0.337
unin+0.330
models+0.325
Neg logit contributions
 Defin-0.470
Ct-0.460
 Prosecut-0.434
 Proceed-0.397
Score-0.395
Token and
Token ID290
Feature activation-2.11e-01

Pos logit contributions
wig+1.059
 seiz+1.054
ris+0.908
unin+0.889
models+0.873
Neg logit contributions
 Defin-1.496
Ct-1.465
 Prosecut-1.386
 Proceed-1.278
Score-1.271
Token few
Token ID1178
Feature activation-3.46e-01

Pos logit contributions
wig+0.399
 seiz+0.397
ris+0.345
unin+0.338
models+0.333
Neg logit contributions
 Defin-0.516
Ct-0.505
 Prosecut-0.477
 Proceed-0.439
Score-0.436
Token wanted
Token ID2227
Feature activation-2.22e-01

Pos logit contributions
wig+0.649
 seiz+0.645
ris+0.558
unin+0.549
models+0.539
Neg logit contributions
 Defin-0.856
Ct-0.838
 Prosecut-0.793
 Proceed-0.730
Score-0.724
Token to
Token ID284
Feature activation-2.00e-01

Pos logit contributions
wig+0.454
 seiz+0.448
ris+0.388
unin+0.385
models+0.380
Neg logit contributions
 Defin-0.512
Ct-0.502
 Prosecut-0.474
 Proceed-0.433
Score-0.426
Token participate
Token ID8277
Feature activation-1.98e+00

Pos logit contributions
wig+0.382
 seiz+0.381
ris+0.331
unin+0.324
models+0.319
Neg logit contributions
 Defin-0.487
Ct-0.477
 Prosecut-0.449
 Proceed-0.412
Score-0.411
Token in
Token ID287
Feature activation-3.84e-01

Pos logit contributions
 seiz+3.570
wig+3.565
ris+3.086
unin+3.031
models+2.966
Neg logit contributions
 Defin-4.988
Ct-4.884
 Prosecut-4.588
 Proceed-4.240
Score-4.232
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
wig+0.740
 seiz+0.736
ris+0.640
unin+0.628
models+0.618
Neg logit contributions
 Defin-0.927
Ct-0.908
 Prosecut-0.857
 Proceed-0.787
Score-0.781
Token war
Token ID1175
Feature activation-5.12e-01

Pos logit contributions
wig+0.442
 seiz+0.441
ris+0.383
unin+0.373
models+0.366
Neg logit contributions
 Defin-0.604
Ct-0.592
 Prosecut-0.557
Score-0.512
 Proceed-0.511
Token.
Token ID13
Feature activation-4.77e-01

Pos logit contributions
wig+0.963
 seiz+0.959
ris+0.832
unin+0.815
models+0.802
Neg logit contributions
 Defin-1.263
Ct-1.236
 Prosecut-1.167
 Proceed-1.073
Score-1.068
Token The
Token ID383
Feature activation-7.06e-01

Pos logit contributions
wig+1.072
 seiz+1.057
ris+0.957
models+0.930
unin+0.924
Neg logit contributions
 Defin-0.978
Ct-0.970
 Prosecut-0.887
Score-0.812
 Proceed-0.793
Token,
Token ID11
Feature activation-2.41e-02

Pos logit contributions
wig+0.779
 seiz+0.777
ris+0.667
unin+0.659
models+0.645
Neg logit contributions
 Defin-1.065
Ct-1.045
 Prosecut-0.989
 Proceed-0.912
Score-0.905
Token Liam
Token ID31684
Feature activation-1.77e-01

Pos logit contributions
wig+0.047
 seiz+0.047
ris+0.040
unin+0.040
models+0.039
Neg logit contributions
 Defin-0.058
Ct-0.057
 Prosecut-0.053
 Proceed-0.049
Score-0.049
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
 seiz+0.334
wig+0.334
ris+0.286
unin+0.284
models+0.279
Neg logit contributions
 Defin-0.435
Ct-0.428
 Prosecut-0.404
 Proceed-0.372
Score-0.369
Token Stella
Token ID45856
Feature activation-1.08e+00

Pos logit contributions
wig+0.291
 seiz+0.289
ris+0.258
unin+0.252
models+0.249
Neg logit contributions
 Defin-0.278
Ct-0.271
 Prosecut-0.253
 Proceed-0.230
Score-0.228
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
wig+1.916
 seiz+1.913
ris+1.635
unin+1.609
models+1.577
Neg logit contributions
 Defin-2.768
Ct-2.719
 Prosecut-2.572
 Proceed-2.377
Score-2.360
Token Hatt
Token ID48509
Feature activation-1.97e+00

Pos logit contributions
wig+0.284
 seiz+0.283
ris+0.247
unin+0.243
models+0.239
Neg logit contributions
 Defin-0.334
Ct-0.327
 Prosecut-0.308
 Proceed-0.282
Score-0.280
Tokenie
Token ID494
Feature activation-1.33e-01

Pos logit contributions
wig+3.630
 seiz+3.525
ris+3.143
unin+3.013
models+2.966
Neg logit contributions
 Defin-4.973
Ct-4.860
 Prosecut-4.609
 Proceed-4.219
Score-4.198
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
 seiz+0.247
wig+0.243
unin+0.210
ris+0.208
models+0.204
Neg logit contributions
 Defin-0.331
Ct-0.328
 Prosecut-0.309
 Proceed-0.286
Score-0.283
Token Finn
Token ID15368
Feature activation-5.82e-02

Pos logit contributions
wig+0.293
 seiz+0.291
ris+0.258
unin+0.254
models+0.250
Neg logit contributions
 Defin-0.288
Ct-0.282
 Prosecut-0.263
 Proceed-0.239
Score-0.237
Token,
Token ID11
Feature activation-7.86e-02

Pos logit contributions
 seiz+0.106
wig+0.105
ris+0.090
unin+0.090
models+0.087
Neg logit contributions
 Defin-0.145
Ct-0.144
 Prosecut-0.135
 Proceed-0.126
Score-0.124
Token and
Token ID290
Feature activation-6.05e-02

Pos logit contributions
wig+0.144
 seiz+0.143
ris+0.124
unin+0.121
models+0.119
Neg logit contributions
 Defin-0.198
Ct-0.193
 Prosecut-0.183
 Proceed-0.169
Score-0.168
Tokenever
Token ID964
Feature activation-2.05e-01

Pos logit contributions
wig+0.283
 seiz+0.262
ris+0.255
unin+0.249
lean+0.245
Neg logit contributions
 Defin-0.230
Ct-0.215
 Prosecut-0.208
 Proceed-0.185
 Divide-0.184
Tokenka
Token ID4914
Feature activation-1.67e-01

Pos logit contributions
wig+0.265
 seiz+0.222
ris+0.214
unin+0.210
ppo+0.206
Neg logit contributions
 Defin-0.648
Ct-0.616
 Prosecut-0.599
 Proceed-0.561
 Divide-0.560
Token.
Token ID13
Feature activation-3.97e-02

Pos logit contributions
 seiz+0.311
wig+0.308
ris+0.266
models+0.261
unin+0.261
Neg logit contributions
 Defin-0.411
Ct-0.408
 Prosecut-0.387
 Proceed-0.355
Score-0.352
Token
Token ID198
Feature activation-5.82e-01

Pos logit contributions
wig+0.084
 seiz+0.083
ris+0.073
unin+0.072
models+0.071
Neg logit contributions
 Defin-0.089
Ct-0.087
 Prosecut-0.081
 Proceed-0.074
Score-0.074
Token
Token ID198
Feature activation-4.37e-01

Pos logit contributions
wig+1.009
ris+0.886
models+0.853
lot+0.847
lean+0.845
Neg logit contributions
 Defin-1.535
Ct-1.425
 Prosecut-1.411
 Proceed-1.266
 Divide-1.262
TokenSur
Token ID14214
Feature activation-1.97e+00

Pos logit contributions
wig+1.080
 seiz+1.046
ris+0.975
unin+0.951
models+0.946
Neg logit contributions
 Defin-0.832
Ct-0.783
 Prosecut-0.750
 Proceed-0.661
 Crafting-0.655
Tokenve
Token ID303
Feature activation-2.68e-01

Pos logit contributions
wig+3.647
 seiz+3.554
ris+3.197
unin+3.109
models+3.027
Neg logit contributions
 Defin-4.867
Ct-4.702
 Prosecut-4.516
 Proceed-4.110
Score-4.068
Tokenying
Token ID1112
Feature activation-6.72e-01

Pos logit contributions
wig+0.560
 seiz+0.559
ris+0.490
unin+0.482
models+0.476
Neg logit contributions
 Defin-0.603
Ct-0.590
 Prosecut-0.553
 Proceed-0.504
Score-0.502
Token the
Token ID262
Feature activation-1.53e-02

Pos logit contributions
wig+1.398
 seiz+1.396
ris+1.235
unin+1.204
models+1.194
Neg logit contributions
 Defin-1.508
Ct-1.482
 Prosecut-1.382
Score-1.257
 Proceed-1.251
Token current
Token ID1459
Feature activation-1.38e-01

Pos logit contributions
wig+0.031
 seiz+0.031
ris+0.027
unin+0.027
models+0.027
Neg logit contributions
 Defin-0.035
Ct-0.034
 Prosecut-0.032
 Proceed-0.029
Score-0.029
Token scene
Token ID3715
Feature activation-6.38e-01

Pos logit contributions
wig+0.271
 seiz+0.269
ris+0.235
unin+0.231
models+0.227
Neg logit contributions
 Defin-0.329
Ct-0.322
 Prosecut-0.304
 Proceed-0.278
Score-0.277
Tokense
Token ID325
Feature activation-5.92e-01

Pos logit contributions
wig+0.647
 seiz+0.561
ris+0.539
unin+0.501
models+0.484
Neg logit contributions
 Defin-1.729
Ct-1.668
 Prosecut-1.625
 Proceed-1.530
 Divide-1.505
Token held
Token ID2714
Feature activation-2.39e-01

Pos logit contributions
wig+1.117
 seiz+1.111
ris+0.965
unin+0.944
models+0.923
Neg logit contributions
 Defin-1.461
Ct-1.427
 Prosecut-1.341
 Proceed-1.239
Score-1.236
Token no
Token ID645
Feature activation+8.15e-01

Pos logit contributions
wig+0.481
 seiz+0.479
ris+0.419
unin+0.412
models+0.405
Neg logit contributions
 Defin-0.560
Ct-0.547
 Prosecut-0.515
 Proceed-0.471
Score-0.469
Token allegiance
Token ID25696
Feature activation-7.38e-01

Pos logit contributions
 Defin+2.040
Ct+2.000
 Prosecut+1.902
 Proceed+1.745
Score+1.728
Neg logit contributions
wig-1.486
 seiz-1.478
ris-1.271
unin-1.254
models-1.231
Token to
Token ID284
Feature activation+1.75e-03

Pos logit contributions
wig+1.510
 seiz+1.507
ris+1.304
unin+1.292
models+1.267
Neg logit contributions
 Defin-1.699
Ct-1.669
 Prosecut-1.563
 Proceed-1.436
Score-1.425
Token Muss
Token ID43309
Feature activation-1.96e+00

Pos logit contributions
 Defin+0.004
Ct+0.004
 Prosecut+0.004
 Proceed+0.003
Score+0.003
Neg logit contributions
wig-0.004
 seiz-0.004
ris-0.003
unin-0.003
models-0.003
Tokenolini
Token ID43232
Feature activation-6.88e-01

Pos logit contributions
wig+2.445
ris+2.118
 seiz+2.025
unin+1.964
ppo+1.946
Neg logit contributions
 Defin-6.193
Ct-5.830
 Prosecut-5.754
 Crafting-5.348
 Transcript-5.304
Token and
Token ID290
Feature activation-5.13e-02

Pos logit contributions
wig+1.203
 seiz+1.196
ris+1.028
unin+1.005
models+0.986
Neg logit contributions
 Defin-1.786
Ct-1.749
 Prosecut-1.657
 Proceed-1.529
Score-1.522
Token even
Token ID772
Feature activation+1.09e-01

Pos logit contributions
wig+0.100
 seiz+0.100
ris+0.087
unin+0.085
models+0.084
Neg logit contributions
 Defin-0.123
Ct-0.120
 Prosecut-0.113
 Proceed-0.104
Score-0.104
Token suggested
Token ID5220
Feature activation+2.81e-02

Pos logit contributions
 Defin+0.263
Ct+0.258
 Prosecut+0.244
 Proceed+0.224
Score+0.222
Neg logit contributions
wig-0.208
 seiz-0.207
ris-0.180
unin-0.177
models-0.174
Token that
Token ID326
Feature activation-1.43e-01

Pos logit contributions
 Defin+0.063
Ct+0.062
 Prosecut+0.058
 Proceed+0.053
Score+0.052
Neg logit contributions
wig-0.059
 seiz-0.059
ris-0.052
unin-0.051
models-0.050
Token the
Token ID262
Feature activation-2.46e-01

Pos logit contributions
 Defin+0.521
Ct+0.510
 Prosecut+0.479
 Proceed+0.437
Score+0.434
Neg logit contributions
wig-0.482
 seiz-0.480
ris-0.422
unin-0.415
models-0.409
Token United
Token ID1578
Feature activation+3.38e-01

Pos logit contributions
wig+0.538
 seiz+0.535
ris+0.475
unin+0.465
models+0.458
Neg logit contributions
 Defin-0.532
Ct-0.520
 Prosecut-0.483
 Proceed-0.438
Score-0.435
Token States
Token ID1829
Feature activation-2.04e-01

Pos logit contributions
 Defin+0.669
Ct+0.645
 Prosecut+0.597
Score+0.533
 Proceed+0.533
Neg logit contributions
wig-0.789
 seiz-0.748
ris-0.720
unin-0.687
models-0.686
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
wig+0.374
 seiz+0.373
ris+0.322
unin+0.315
models+0.310
Neg logit contributions
 Defin-0.515
Ct-0.504
 Prosecut-0.476
 Proceed-0.439
Score-0.437
Token She
Token ID1375
Feature activation-5.24e-01

Pos logit contributions
wig+0.511
 seiz+0.491
ris+0.451
models+0.438
ppo+0.433
Neg logit contributions
 Defin-0.471
Ct-0.460
 Prosecut-0.424
Score-0.386
 Proceed-0.377
Token maj
Token ID16486
Feature activation-1.94e+00

Pos logit contributions
wig+1.038
 seiz+1.030
ris+0.901
unin+0.878
models+0.867
Neg logit contributions
 Defin-1.253
Ct-1.220
 Prosecut-1.147
 Proceed-1.053
Score-1.050
Tokenored
Token ID1850
Feature activation-5.04e-01

Pos logit contributions
wig+3.141
 seiz+2.681
ris+2.666
unin+2.471
lean+2.395
Neg logit contributions
 Defin-5.313
Ct-5.212
 Prosecut-5.014
 Proceed-4.712
 Crafting-4.613
Token in
Token ID287
Feature activation-1.54e-01

Pos logit contributions
 seiz+0.969
wig+0.969
ris+0.841
unin+0.824
models+0.811
Neg logit contributions
 Defin-1.218
Ct-1.194
 Prosecut-1.125
 Proceed-1.035
Score-1.028
Token philosophy
Token ID8876
Feature activation-1.31e-01

Pos logit contributions
wig+0.304
 seiz+0.303
ris+0.265
unin+0.260
models+0.256
Neg logit contributions
 Defin-0.365
Ct-0.357
 Prosecut-0.337
 Proceed-0.309
Score-0.307
Token at
Token ID379
Feature activation+3.87e-01

Pos logit contributions
 seiz+0.243
wig+0.242
ris+0.208
unin+0.205
models+0.202
Neg logit contributions
 Defin-0.322
Ct-0.318
 Prosecut-0.298
 Proceed-0.276
 Divide-0.273
Token the
Token ID262
Feature activation+2.90e-02

Pos logit contributions
 Defin+0.849
Ct+0.829
 Prosecut+0.779
 Proceed+0.709
Score+0.703
Neg logit contributions
 seiz-0.833
wig-0.832
ris-0.730
unin-0.721
models-0.711
Token considered
Token ID3177
Feature activation+3.11e-01

Pos logit contributions
 Defin+0.209
Ct+0.205
 Prosecut+0.194
 Proceed+0.179
Score+0.178
Neg logit contributions
wig-0.151
 seiz-0.151
ris-0.130
unin-0.127
models-0.125
Token.
Token ID13
Feature activation-3.01e-01

Pos logit contributions
 Defin+0.734
Ct+0.720
 Prosecut+0.679
 Proceed+0.622
Score+0.616
Neg logit contributions
wig-0.615
 seiz-0.612
ris-0.532
unin-0.525
models-0.517
Token In
Token ID554
Feature activation-1.29e-01

Pos logit contributions
wig+0.673
 seiz+0.662
ris+0.599
models+0.581
unin+0.577
Neg logit contributions
 Defin-0.625
Ct-0.614
 Prosecut-0.566
Score-0.515
 Proceed-0.505
Token his
Token ID465
Feature activation-1.51e-01

Pos logit contributions
wig+0.270
 seiz+0.269
ris+0.238
unin+0.234
models+0.230
Neg logit contributions
 Defin-0.289
Ct-0.282
 Prosecut-0.265
 Proceed-0.240
Score-0.240
Token annual
Token ID5079
Feature activation-3.02e-01

Pos logit contributions
wig+0.303
 seiz+0.302
ris+0.265
unin+0.258
models+0.254
Neg logit contributions
 Defin-0.354
Ct-0.347
 Prosecut-0.324
Score-0.295
 Proceed-0.295
Token report
Token ID989
Feature activation-1.94e+00

Pos logit contributions
wig+0.652
 seiz+0.648
ris+0.574
unin+0.560
models+0.554
Neg logit contributions
 Defin-0.659
Ct-0.647
 Prosecut-0.601
Score-0.542
 Proceed-0.541
Token submitted
Token ID8948
Feature activation-8.03e-01

Pos logit contributions
wig+3.502
 seiz+3.463
ris+3.034
unin+2.926
models+2.894
Neg logit contributions
 Defin-4.797
Ct-4.690
 Prosecut-4.377
Score-4.054
 Proceed-4.010
Token October
Token ID3267
Feature activation+4.39e-01

Pos logit contributions
wig+1.552
 seiz+1.541
ris+1.344
unin+1.317
models+1.296
Neg logit contributions
 Defin-1.941
Ct-1.897
 Prosecut-1.788
 Proceed-1.641
Score-1.635
Token 1
Token ID352
Feature activation-6.26e-01

Pos logit contributions
 Defin+0.883
Ct+0.844
 Prosecut+0.791
Score+0.706
 Proceed+0.702
Neg logit contributions
wig-1.033
 seiz-1.006
ris-0.931
unin-0.903
lot-0.888
Token,
Token ID11
Feature activation-5.41e-01

Pos logit contributions
wig+1.059
 seiz+1.055
ris+0.889
unin+0.873
models+0.860
Neg logit contributions
 Defin-1.664
Ct-1.640
 Prosecut-1.552
 Proceed-1.443
Score-1.432
Token 1863
Token ID47072
Feature activation-8.69e-01

Pos logit contributions
wig+1.017
 seiz+1.015
ris+0.880
unin+0.862
models+0.847
Neg logit contributions
 Defin-1.334
Ct-1.303
 Prosecut-1.230
 Proceed-1.129
Score-1.126
Token Hillary
Token ID5073
Feature activation-1.49e+00

Pos logit contributions
 seiz+1.105
wig+1.093
ris+0.913
unin+0.888
models+0.870
Neg logit contributions
 Defin-2.054
Ct-2.022
 Prosecut-1.911
Score-1.786
 Proceed-1.782
Token Clinton
Token ID2605
Feature activation-1.42e+00

Pos logit contributions
wig+2.418
 seiz+2.281
ris+2.055
ppo+1.931
models+1.930
Neg logit contributions
 Defin-4.001
Ct-3.897
 Prosecut-3.699
Score-3.407
 Proceed-3.390
Token Hillary
Token ID5073
Feature activation-1.62e+00

Pos logit contributions
wig+2.812
 seiz+2.806
ris+2.423
unin+2.360
lean+2.335
Neg logit contributions
 Defin-3.337
Ct-3.276
 Prosecut-3.047
Score-2.798
 Proceed-2.783
Token Diane
Token ID26542
Feature activation-1.71e+00

Pos logit contributions
wig+2.376
 seiz+2.092
ris+1.969
adin+1.897
models+1.889
Neg logit contributions
 Defin-4.640
Ct-4.435
 Prosecut-4.330
 Proceed-3.988
Score-3.949
Token Rodham
Token ID41848
Feature activation-1.60e+00

Pos logit contributions
wig+1.664
 seiz+1.411
ris+1.264
models+1.108
ppo+1.074
Neg logit contributions
 Defin-5.894
Ct-5.685
 Prosecut-5.462
 Proceed-5.203
 Dialogue-5.120
Token Clinton
Token ID2605
Feature activation-1.93e+00

Pos logit contributions
wig+2.577
 seiz+2.385
ris+2.141
ppo+2.077
lean+2.040
Neg logit contributions
 Defin-4.439
Ct-4.278
 Prosecut-4.073
 Proceed-3.748
Score-3.720
TokenSanders
Token ID26747
Feature activation-2.00e+00

Pos logit contributions
wig+4.789
 seiz+4.629
models+4.243
ris+4.239
unin+4.214
Neg logit contributions
 Defin-3.541
Ct-3.285
 Prosecut-3.131
 Proceed-2.811
 Divide-2.762
Token:
Token ID25
Feature activation-1.25e+00

Pos logit contributions
 seiz+4.344
wig+4.194
models+3.732
ris+3.693
lean+3.681
Neg logit contributions
 Defin-4.240
Ct-4.085
 Prosecut-3.869
 Proceed-3.454
 Crafting-3.441
Token '
Token ID705
Feature activation-7.56e-01

Pos logit contributions
wig+2.248
 seiz+2.156
ris+1.957
ppo+1.885
unin+1.872
Neg logit contributions
 Defin-3.100
Ct-3.019
 Prosecut-2.797
Score-2.613
 Transcript-2.553
TokenI
Token ID40
Feature activation-1.04e+00

Pos logit contributions
wig+1.644
 seiz+1.477
models+1.445
ris+1.427
unin+1.407
Neg logit contributions
 Defin-1.715
Ct-1.542
 Prosecut-1.501
 Proceed-1.361
 Divide-1.351
Token fully
Token ID3938
Feature activation-6.77e-01

Pos logit contributions
wig+2.158
 seiz+2.129
ris+2.097
ppo+1.898
lean+1.877
Neg logit contributions
 Defin-2.289
Ct-2.207
 Prosecut-2.098
Score-1.989
 Crafting-1.908
Token (
Token ID357
Feature activation-4.21e-01

Pos logit contributions
wig+0.249
 seiz+0.248
ris+0.219
unin+0.215
models+0.212
Neg logit contributions
 Defin-0.257
Ct-0.251
 Prosecut-0.235
 Proceed-0.214
Score-0.213
TokenP
Token ID47
Feature activation+1.72e-01

Pos logit contributions
wig+0.977
 seiz+0.972
ris+0.870
unin+0.856
models+0.846
Neg logit contributions
 Defin-0.850
Ct-0.828
 Prosecut-0.774
 Proceed-0.696
Score-0.688
TokenAL
Token ID1847
Feature activation-3.30e-04

Pos logit contributions
 Defin+0.543
Ct+0.525
 Prosecut+0.510
 Proceed+0.475
 Divide+0.472
Neg logit contributions
wig-0.208
 seiz-0.196
ris-0.170
unin-0.162
models-0.156
Token)
Token ID8
Feature activation-6.68e-02

Pos logit contributions
 seiz+0.001
wig+0.001
ris+0.001
unin+0.001
models+0.000
Neg logit contributions
 Defin-0.001
Ct-0.001
 Prosecut-0.001
 Proceed-0.001
Score-0.001
Token 100
Token ID1802
Feature activation-2.07e-01

Pos logit contributions
wig+0.154
 seiz+0.153
ris+0.137
unin+0.134
models+0.133
Neg logit contributions
 Defin-0.134
Ct-0.133
 Prosecut-0.123
 Proceed-0.110
Score-0.109
Token%
Token ID4
Feature activation+1.62e+00

Pos logit contributions
wig+0.362
 seiz+0.360
ris+0.304
unin+0.300
models+0.294
Neg logit contributions
 Defin-0.539
Ct-0.529
 Prosecut-0.500
 Proceed-0.465
Score-0.465
Token on
Token ID319
Feature activation+6.62e-01

Pos logit contributions
 Defin+3.535
Ct+3.417
 Prosecut+3.195
 Proceed+2.947
Score+2.938
Neg logit contributions
wig-3.479
 seiz-3.466
ris-3.032
unin-3.010
models-2.957
Token ESA
Token ID42212
Feature activation+3.85e-01

Pos logit contributions
 Defin+1.486
Ct+1.450
 Prosecut+1.362
 Proceed+1.243
Score+1.236
Neg logit contributions
wig-1.391
 seiz-1.381
ris-1.215
unin-1.196
models-1.181
Token2012
Token ID6999
Feature activation+6.28e-01

Pos logit contributions
 Defin+0.763
Ct+0.744
 Prosecut+0.690
 Proceed+0.624
Score+0.622
Neg logit contributions
wig-0.911
 seiz-0.907
ris-0.805
unin-0.796
models-0.783
Token,
Token ID11
Feature activation+9.81e-02

Pos logit contributions
 Defin+1.526
Ct+1.493
 Prosecut+1.408
 Proceed+1.295
Score+1.289
Neg logit contributions
wig-1.206
 seiz-1.201
ris-1.041
unin-1.023
models-1.007
Token any
Token ID597
Feature activation+2.59e-01

Pos logit contributions
 Defin+0.232
Ct+0.228
 Prosecut+0.215
 Proceed+0.196
Score+0.195
Neg logit contributions
wig-0.193
 seiz-0.192
ris-0.169
unin-0.164
models-0.162
Token Maps
Token ID20347
Feature activation-6.13e-02

Pos logit contributions
wig+0.452
 seiz+0.449
ris+0.396
unin+0.394
ppo+0.384
Neg logit contributions
 Defin-0.445
Ct-0.432
 Prosecut-0.403
 Proceed-0.370
Score-0.369
Token
Token ID198
Feature activation-3.59e-01

Pos logit contributions
wig+0.131
 seiz+0.130
ris+0.115
unin+0.113
models+0.111
Neg logit contributions
 Defin-0.136
Ct-0.133
 Prosecut-0.125
 Proceed-0.113
Score-0.113
Token
Token ID198
Feature activation-2.88e-01

Pos logit contributions
wig+0.638
ris+0.559
models+0.538
lot+0.537
lean+0.530
Neg logit contributions
 Defin-0.949
Ct-0.878
 Prosecut-0.871
 Proceed-0.781
 Divide-0.778
TokenThe
Token ID464
Feature activation-4.12e-01

Pos logit contributions
wig+0.711
 seiz+0.703
ris+0.637
unin+0.627
models+0.622
Neg logit contributions
 Defin-0.543
Ct-0.524
 Prosecut-0.490
 Proceed-0.435
Score-0.428
Token old
Token ID1468
Feature activation-1.34e-01

Pos logit contributions
wig+0.851
 seiz+0.845
ris+0.744
unin+0.729
models+0.720
Neg logit contributions
 Defin-0.940
Ct-0.920
 Prosecut-0.863
 Proceed-0.786
Score-0.781
Token Everett
Token ID36815
Feature activation+1.60e+00

Pos logit contributions
wig+0.278
 seiz+0.278
ris+0.244
unin+0.240
models+0.236
Neg logit contributions
 Defin-0.301
Ct-0.294
 Prosecut-0.277
 Proceed-0.252
Score-0.251
Token High
Token ID3334
Feature activation-6.41e-02

Pos logit contributions
 Defin+3.313
Ct+3.187
 Prosecut+3.012
 Proceed+2.720
 Crafting+2.707
Neg logit contributions
 seiz-3.632
wig-3.623
ris-3.204
unin-3.194
models-3.114
Token School
Token ID3961
Feature activation-2.28e-01

Pos logit contributions
wig+0.145
 seiz+0.145
ris+0.128
unin+0.127
models+0.125
Neg logit contributions
 Defin-0.134
Ct-0.130
 Prosecut-0.121
 Proceed-0.110
Score-0.109
Token building
Token ID2615
Feature activation-1.19e-01

Pos logit contributions
wig+0.445
 seiz+0.443
ris+0.386
unin+0.379
models+0.373
Neg logit contributions
 Defin-0.545
Ct-0.533
 Prosecut-0.502
 Proceed-0.461
Score-0.458
Token was
Token ID373
Feature activation+3.73e-02

Pos logit contributions
wig+0.234
 seiz+0.233
ris+0.203
unin+0.199
models+0.196
Neg logit contributions
 Defin-0.285
Ct-0.279
 Prosecut-0.263
 Proceed-0.241
Score-0.240
Token renamed
Token ID25121
Feature activation-7.38e-01

Pos logit contributions
 Defin+0.090
Ct+0.088
 Prosecut+0.083
 Proceed+0.076
Score+0.075
Neg logit contributions
wig-0.072
 seiz-0.072
ris-0.063
unin-0.062
models-0.061
Token can
Token ID460
Feature activation+1.90e-02

Pos logit contributions
wig+0.925
 seiz+0.920
ris+0.800
unin+0.785
models+0.773
Neg logit contributions
 Defin-1.162
Ct-1.138
 Prosecut-1.073
 Proceed-0.985
Score-0.979
Token�
Token ID447
Feature activation-2.32e-01

Pos logit contributions
 Defin+0.045
Ct+0.044
 Prosecut+0.042
 Proceed+0.038
Score+0.038
Neg logit contributions
 seiz-0.037
wig-0.037
ris-0.032
unin-0.032
models-0.031
Token�
Token ID247
Feature activation+1.90e-01

Pos logit contributions
wig+0.532
 seiz+0.527
ris+0.476
unin+0.466
models+0.462
Neg logit contributions
 Defin-0.473
Ct-0.462
 Prosecut-0.433
 Proceed-0.390
 Crafting-0.386
Tokent
Token ID83
Feature activation-4.04e-01

Pos logit contributions
 Defin+0.411
Ct+0.402
 Prosecut+0.375
 Proceed+0.341
Score+0.339
Neg logit contributions
wig-0.412
 seiz-0.412
ris-0.363
unin-0.358
models-0.353
Token be
Token ID307
Feature activation+1.65e-01

Pos logit contributions
wig+0.792
 seiz+0.788
ris+0.690
unin+0.675
models+0.663
Neg logit contributions
 Defin-0.962
Ct-0.938
 Prosecut-0.886
 Proceed-0.810
Score-0.804
Token easy
Token ID2562
Feature activation+1.52e+00

Pos logit contributions
 Defin+0.380
Ct+0.371
 Prosecut+0.348
 Proceed+0.319
Score+0.316
Neg logit contributions
wig-0.338
 seiz-0.336
ris-0.295
unin-0.290
models-0.286
Token be
Token ID307
Feature activation+2.54e-01

Pos logit contributions
 Defin+3.264
Ct+3.227
 Prosecut+3.000
Score+2.776
 Crafting+2.739
Neg logit contributions
wig-3.226
 seiz-3.217
ris-2.850
unin-2.844
lean-2.745
Token­
Token ID3907
Feature activation+9.10e-01

Pos logit contributions
 Defin+0.586
Ct+0.574
 Prosecut+0.540
 Proceed+0.493
Score+0.491
Neg logit contributions
wig-0.515
 seiz-0.514
ris-0.450
unin-0.444
models-0.436
Tokening
Token ID278
Feature activation+1.53e-02

Pos logit contributions
 Defin+1.928
Ct+1.916
 Prosecut+1.765
Score+1.630
 Proceed+1.615
Neg logit contributions
 seiz-1.973
wig-1.973
ris-1.706
unin-1.699
models-1.678
Token Jimmy
Token ID12963
Feature activation-2.65e-01

Pos logit contributions
 Defin+0.036
Ct+0.035
 Prosecut+0.033
 Proceed+0.030
Score+0.030
Neg logit contributions
wig-0.030
 seiz-0.030
ris-0.027
unin-0.026
models-0.026
Token Carter
Token ID10831
Feature activation-3.47e-01

Pos logit contributions
wig+0.586
 seiz+0.584
ris+0.516
unin+0.508
models+0.501
Neg logit contributions
 Defin-0.566
Ct-0.554
 Prosecut-0.518
 Proceed-0.469
Score-0.466
Token (
Token ID357
Feature activation-1.26e-01

Pos logit contributions
 Defin+0.978
Ct+0.957
 Prosecut+0.901
 Proceed+0.826
Score+0.821
Neg logit contributions
wig-0.822
 seiz-0.820
ris-0.715
unin-0.706
models-0.691
Token1977
Token ID37781
Feature activation-4.81e-01

Pos logit contributions
wig+0.297
 seiz+0.295
ris+0.264
unin+0.260
models+0.257
Neg logit contributions
 Defin-0.253
Ct-0.246
 Prosecut-0.229
 Proceed-0.206
Score-0.204
Token)
Token ID8
Feature activation+1.91e-01

Pos logit contributions
wig+0.879
 seiz+0.877
ris+0.755
unin+0.742
models+0.727
Neg logit contributions
 Defin-1.212
Ct-1.189
 Prosecut-1.123
 Proceed-1.035
Score-1.030
Token
Token ID198
Feature activation+2.47e-01

Pos logit contributions
 Defin+0.413
Ct+0.403
 Prosecut+0.377
 Proceed+0.342
Score+0.340
Neg logit contributions
wig-0.417
 seiz-0.416
ris-0.368
unin-0.363
models-0.356
Token
Token ID198
Feature activation+2.29e-01

Pos logit contributions
 Defin+0.632
Ct+0.590
 Prosecut+0.580
 Proceed+0.524
 Divide+0.518
Neg logit contributions
wig-0.462
ris-0.404
 seiz-0.402
models-0.388
unin-0.386
TokenGrade
Token ID42233
Feature activation+1.54e+00

Pos logit contributions
 Defin+0.414
Ct+0.406
 Prosecut+0.370
Score+0.331
 Proceed+0.330
Neg logit contributions
 seiz-0.581
wig-0.579
ris-0.520
unin-0.514
models-0.506
Token:
Token ID25
Feature activation-1.16e-01

Pos logit contributions
 Defin+3.766
Ct+3.627
 Prosecut+3.396
Score+3.206
 Divide+3.175
Neg logit contributions
wig-2.914
 seiz-2.904
unin-2.504
ris-2.477
models-2.426
Token A
Token ID317
Feature activation-2.92e-01

Pos logit contributions
wig+0.250
 seiz+0.246
ris+0.220
unin+0.216
models+0.213
Neg logit contributions
 Defin-0.249
Ct-0.246
 Prosecut-0.232
 Proceed-0.207
Score-0.205
Token
Token ID198
Feature activation+1.10e-01

Pos logit contributions
wig+0.597
 seiz+0.595
ris+0.519
unin+0.512
models+0.505
Neg logit contributions
 Defin-0.676
Ct-0.659
 Prosecut-0.617
 Proceed-0.567
Score-0.566
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
 Defin+0.283
Ct+0.263
 Prosecut+0.260
 Proceed+0.234
 Divide+0.232
Neg logit contributions
wig-0.202
ris-0.177
 seiz-0.174
models-0.170
lot-0.169
Token�
Token ID447
Feature activation+1.87e-01

Pos logit contributions
 Defin+0.199
Ct+0.195
 Prosecut+0.178
 Proceed+0.159
Score+0.159
Neg logit contributions
wig-0.281
 seiz-0.281
ris-0.252
unin-0.249
models-0.246
Token to
Token ID284
Feature activation+1.20e-01

Pos logit contributions
 seiz+0.611
wig+0.608
unin+0.525
ris+0.523
lean+0.519
Neg logit contributions
 Defin-0.638
Ct-0.635
 Prosecut-0.582
 Proceed-0.540
Score-0.535
Token the
Token ID262
Feature activation+1.21e-01

Pos logit contributions
 Defin+0.277
Ct+0.271
 Prosecut+0.255
 Proceed+0.234
Score+0.232
Neg logit contributions
wig-0.244
 seiz-0.243
ris-0.212
unin-0.209
models-0.206
Token current
Token ID1459
Feature activation+2.77e-01

Pos logit contributions
 Defin+0.278
Ct+0.272
 Prosecut+0.256
 Proceed+0.233
Score+0.232
Neg logit contributions
wig-0.247
 seiz-0.246
ris-0.216
unin-0.212
models-0.209
Token and
Token ID290
Feature activation+9.18e-02

Pos logit contributions
 Defin+0.652
Ct+0.638
 Prosecut+0.601
 Proceed+0.552
Score+0.549
Neg logit contributions
wig-0.549
 seiz-0.548
ris-0.477
unin-0.470
models-0.463
Token future
Token ID2003
Feature activation+4.01e-01

Pos logit contributions
 Defin+0.209
Ct+0.204
 Prosecut+0.191
 Proceed+0.175
Score+0.174
Neg logit contributions
wig-0.191
 seiz-0.190
ris-0.166
unin-0.164
models-0.161
Token economic
Token ID3034
Feature activation+1.54e+00

Pos logit contributions
 Defin+0.937
Ct+0.919
 Prosecut+0.864
 Proceed+0.793
Score+0.789
Neg logit contributions
wig-0.797
 seiz-0.797
ris-0.693
unin-0.686
models-0.673
Token outlook
Token ID19360
Feature activation-7.24e-01

Pos logit contributions
 Defin+3.451
Ct+3.390
 Prosecut+3.180
 Divide+2.948
Score+2.913
Neg logit contributions
wig-3.173
 seiz-3.157
unin-2.758
ris-2.724
ppo-2.663
Token,
Token ID11
Feature activation-5.12e-03

Pos logit contributions
wig+1.371
 seiz+1.367
ris+1.182
unin+1.162
models+1.141
Neg logit contributions
 Defin-1.773
Ct-1.739
 Prosecut-1.639
 Proceed-1.508
Score-1.501
Token�
Token ID447
Feature activation-7.16e-02

Pos logit contributions
wig+0.012
 seiz+0.012
ris+0.011
unin+0.010
models+0.010
Neg logit contributions
 Defin-0.010
Ct-0.010
 Prosecut-0.009
 Proceed-0.008
Score-0.008
Token�
Token ID251
Feature activation-8.68e-02

Pos logit contributions
wig+0.139
 seiz+0.136
ris+0.121
unin+0.118
models+0.117
Neg logit contributions
 Defin-0.172
Ct-0.168
 Prosecut-0.159
 Proceed-0.146
 Crafting-0.144
Token according
Token ID1864
Feature activation-4.27e-01

Pos logit contributions
wig+0.195
 seiz+0.193
ris+0.173
unin+0.169
models+0.167
Neg logit contributions
 Defin-0.183
Ct-0.178
 Prosecut-0.167
 Proceed-0.150
Score-0.150
Token game
Token ID983
Feature activation-3.72e-01

Pos logit contributions
 Defin+0.028
Ct+0.027
 Prosecut+0.026
 Proceed+0.023
Score+0.023
Neg logit contributions
wig-0.025
 seiz-0.025
ris-0.022
unin-0.021
models-0.021
Token still
Token ID991
Feature activation-3.55e-02

Pos logit contributions
wig+0.747
 seiz+0.743
ris+0.655
unin+0.639
models+0.631
Neg logit contributions
 Defin-0.870
Ct-0.850
 Prosecut-0.800
 Proceed-0.731
Score-0.726
Token has
Token ID468
Feature activation-8.49e-02

Pos logit contributions
wig+0.071
 seiz+0.070
ris+0.062
unin+0.060
models+0.060
Neg logit contributions
 Defin-0.084
Ct-0.082
 Prosecut-0.077
 Proceed-0.070
Score-0.070
Token 65
Token ID6135
Feature activation-4.16e-01

Pos logit contributions
wig+0.169
 seiz+0.168
ris+0.147
unin+0.144
models+0.142
Neg logit contributions
 Defin-0.200
Ct-0.196
 Prosecut-0.184
 Proceed-0.169
Score-0.168
Token million
Token ID1510
Feature activation-6.94e-01

Pos logit contributions
 seiz+0.713
wig+0.712
ris+0.603
unin+0.589
models+0.579
Neg logit contributions
 Defin-1.096
Ct-1.076
 Prosecut-1.021
 Proceed-0.944
Score-0.941
Token players
Token ID1938
Feature activation+3.47e-01

Pos logit contributions
wig+1.384
 seiz+1.373
ris+1.205
unin+1.184
models+1.165
Neg logit contributions
 Defin-1.630
Ct-1.592
 Prosecut-1.502
 Proceed-1.371
Score-1.358
Token worldwide
Token ID8688
Feature activation+3.91e-01

Pos logit contributions
 Defin+0.872
Ct+0.857
 Prosecut+0.808
 Proceed+0.749
Score+0.746
Neg logit contributions
wig-0.631
 seiz-0.631
ris-0.536
unin-0.531
models-0.517
Token,
Token ID11
Feature activation+2.33e-01

Pos logit contributions
 Defin+0.963
Ct+0.949
 Prosecut+0.896
 Proceed+0.833
Score+0.827
Neg logit contributions
wig-0.720
 seiz-0.720
unin-0.613
ris-0.611
models-0.593
Token though
Token ID996
Feature activation+9.41e-02

Pos logit contributions
 Defin+0.574
Ct+0.562
 Prosecut+0.530
 Proceed+0.487
Score+0.485
Neg logit contributions
wig-0.439
 seiz-0.437
ris-0.379
unin-0.371
models-0.365
Token its
Token ID663
Feature activation+1.27e-01

Pos logit contributions
 Defin+0.219
Ct+0.214
 Prosecut+0.202
 Proceed+0.184
Score+0.183
Neg logit contributions
wig-0.190
 seiz-0.189
ris-0.166
unin-0.163
models-0.160
Token popularity
Token ID11533
Feature activation+1.41e-01

Pos logit contributions
 Defin+0.301
Ct+0.294
 Prosecut+0.277
 Proceed+0.254
Score+0.252
Neg logit contributions
wig-0.253
 seiz-0.252
ris-0.220
unin-0.216
models-0.213
Token $
Token ID720
Feature activation+7.14e-01

Pos logit contributions
wig+0.705
 seiz+0.700
ris+0.607
unin+0.590
models+0.581
Neg logit contributions
 Defin-1.010
Ct-0.987
 Prosecut-0.935
 Proceed-0.860
Score-0.856
Token15
Token ID1314
Feature activation+1.05e-01

Pos logit contributions
 Defin+1.530
Ct+1.485
 Prosecut+1.397
 Proceed+1.264
Score+1.250
Neg logit contributions
wig-1.577
 seiz-1.561
ris-1.395
unin-1.367
models-1.356
Token,
Token ID11
Feature activation-2.36e-01

Pos logit contributions
 Defin+0.277
Ct+0.274
 Prosecut+0.258
 Proceed+0.241
Score+0.239
Neg logit contributions
wig-0.179
 seiz-0.178
ris-0.149
unin-0.147
models-0.145
Token000
Token ID830
Feature activation-4.23e-01

Pos logit contributions
wig+0.395
 seiz+0.393
ris+0.333
unin+0.327
models+0.320
Neg logit contributions
 Defin-0.631
Ct-0.620
 Prosecut-0.588
 Proceed-0.545
Score-0.542
Token to
Token ID284
Feature activation+7.69e-02

Pos logit contributions
wig+0.791
 seiz+0.787
ris+0.681
unin+0.669
models+0.657
Neg logit contributions
 Defin-1.046
Ct-1.025
 Prosecut-0.968
 Proceed-0.891
Score-0.886
Token build
Token ID1382
Feature activation+4.11e-01

Pos logit contributions
 Defin+0.175
Ct+0.172
 Prosecut+0.161
 Proceed+0.147
Score+0.146
Neg logit contributions
wig-0.158
 seiz-0.158
ris-0.138
unin-0.136
models-0.134
Token (
Token ID357
Feature activation-1.43e-01

Pos logit contributions
 Defin+1.007
Ct+0.996
 Prosecut+0.930
Score+0.864
 Proceed+0.861
Neg logit contributions
 seiz-0.773
wig-0.769
ris-0.659
unin-0.653
ppo-0.638
Tokenthe
Token ID1169
Feature activation-1.63e-01

Pos logit contributions
wig+0.291
 seiz+0.289
ris+0.255
unin+0.250
models+0.247
Neg logit contributions
 Defin-0.329
Ct-0.321
 Prosecut-0.303
 Proceed-0.276
Score-0.274
Token steep
Token ID14559
Feature activation+1.12e-01

Pos logit contributions
wig+0.331
 seiz+0.329
ris+0.289
unin+0.284
models+0.280
Neg logit contributions
 Defin-0.378
Ct-0.369
 Prosecut-0.347
 Proceed-0.317
Score-0.314
Tokenle
Token ID293
Feature activation+1.90e-01

Pos logit contributions
 Defin+0.269
Ct+0.263
 Prosecut+0.248
 Proceed+0.228
Score+0.226
Neg logit contributions
wig-0.219
 seiz-0.218
ris-0.189
unin-0.187
models-0.183
Token was
Token ID373
Feature activation+7.79e-02

Pos logit contributions
 Defin+0.442
Ct+0.432
 Prosecut+0.407
 Proceed+0.372
Score+0.371
Neg logit contributions
wig-0.384
 seiz-0.383
ris-0.334
unin-0.329
models-0.323
Token the
Token ID262
Feature activation+3.02e-01

Pos logit contributions
 Defin+1.236
Ct+1.222
 Prosecut+1.125
Score+1.026
 Divide+1.024
Neg logit contributions
wig-1.304
 seiz-1.267
ris-1.123
unin-1.120
models-1.120
Token chest
Token ID7721
Feature activation-6.44e-01

Pos logit contributions
 Defin+0.777
Ct+0.763
 Prosecut+0.722
 Proceed+0.667
Score+0.664
Neg logit contributions
wig-0.533
 seiz-0.529
ris-0.455
unin-0.446
models-0.438
Token with
Token ID351
Feature activation+1.97e-01

Pos logit contributions
wig+1.197
 seiz+1.191
ris+1.037
unin+1.007
models+0.993
Neg logit contributions
 Defin-1.603
Ct-1.565
 Prosecut-1.487
 Proceed-1.367
Score-1.353
Token that
Token ID326
Feature activation+2.24e-02

Pos logit contributions
 Defin+0.440
Ct+0.430
 Prosecut+0.403
 Proceed+0.368
Score+0.365
Neg logit contributions
wig-0.415
 seiz-0.413
ris-0.364
unin-0.358
models-0.353
Token long
Token ID890
Feature activation+4.95e-01

Pos logit contributions
 Defin+0.050
Ct+0.049
 Prosecut+0.046
 Proceed+0.042
Score+0.042
Neg logit contributions
wig-0.047
 seiz-0.047
ris-0.041
unin-0.041
models-0.040
Token jab
Token ID33896
Feature activation+3.84e-01

Pos logit contributions
 Defin+1.049
Ct+1.028
 Prosecut+0.956
 Proceed+0.868
Score+0.863
Neg logit contributions
wig-1.096
 seiz-1.089
ris-0.965
unin-0.956
models-0.947
Token and
Token ID290
Feature activation-1.06e-04

Pos logit contributions
 Defin+0.964
Ct+0.944
 Prosecut+0.891
 Proceed+0.822
Score+0.818
Neg logit contributions
wig-0.706
 seiz-0.703
ris-0.605
unin-0.594
models-0.584
Token not
Token ID407
Feature activation+2.21e-01

Pos logit contributions
 seiz+0.000
wig+0.000
ris+0.000
unin+0.000
models+0.000
Neg logit contributions
 Defin-0.000
Ct-0.000
 Prosecut-0.000
 Proceed-0.000
 Crafting-0.000
Token only
Token ID691
Feature activation+5.01e-01

Pos logit contributions
 Defin+0.520
Ct+0.509
 Prosecut+0.479
 Proceed+0.439
Score+0.435
Neg logit contributions
wig-0.440
 seiz-0.438
ris-0.384
unin-0.375
models-0.370
Token will
Token ID481
Feature activation-3.00e-01

Pos logit contributions
 Defin+1.158
Ct+1.133
 Prosecut+1.066
 Proceed+0.973
Score+0.964
Neg logit contributions
wig-1.021
 seiz-1.020
ris-0.897
unin-0.872
models-0.864
Token they
Token ID484
Feature activation+2.65e-01

Pos logit contributions
wig+0.590
 seiz+0.590
ris+0.515
unin+0.502
models+0.497
Neg logit contributions
 Defin-0.711
Ct-0.695
 Prosecut-0.657
 Proceed-0.601
Score-0.596
Token at
Token ID379
Feature activation+7.50e-01

Pos logit contributions
wig+0.953
 seiz+0.950
ris+0.810
unin+0.798
models+0.783
Neg logit contributions
 Defin-1.378
Ct-1.354
 Prosecut-1.280
 Proceed-1.185
Score-1.175
Token the
Token ID262
Feature activation+3.28e-01

Pos logit contributions
 Defin+1.448
Ct+1.408
 Prosecut+1.304
 Proceed+1.181
Score+1.160
Neg logit contributions
wig-1.809
 seiz-1.800
ris-1.608
unin-1.593
models-1.579
Token time
Token ID640
Feature activation+1.35e-01

Pos logit contributions
 Defin+0.575
Ct+0.558
 Prosecut+0.514
 Proceed+0.454
Score+0.450
Neg logit contributions
wig-0.852
 seiz-0.849
ris-0.767
unin-0.757
models-0.749
Token.
Token ID13
Feature activation+4.22e-01

Pos logit contributions
 Defin+0.298
Ct+0.295
 Prosecut+0.275
 Proceed+0.252
 Crafting+0.248
Neg logit contributions
wig-0.287
 seiz-0.285
unin-0.248
ris-0.248
models-0.244
Token
Token ID198
Feature activation+5.41e-01

Pos logit contributions
 Defin+0.902
Ct+0.879
 Prosecut+0.824
 Proceed+0.751
Score+0.739
Neg logit contributions
 seiz-0.938
wig-0.937
ris-0.825
unin-0.816
models-0.803
Token
Token ID198
Feature activation+4.83e-01

Pos logit contributions
 Defin+1.353
Ct+1.275
 Prosecut+1.245
 Proceed+1.116
 Divide+1.113
Neg logit contributions
wig-1.039
 seiz-0.924
ris-0.913
models-0.873
unin-0.871
TokenEarlier
Token ID13689
Feature activation+1.42e-01

Pos logit contributions
 Defin+0.877
Ct+0.868
 Prosecut+0.785
Score+0.710
 Proceed+0.706
Neg logit contributions
 seiz-1.225
wig-1.212
ris-1.081
unin-1.072
models-1.057
Token in
Token ID287
Feature activation+4.32e-02

Pos logit contributions
 Defin+0.320
Ct+0.313
 Prosecut+0.294
 Proceed+0.269
Score+0.266
Neg logit contributions
wig-0.296
 seiz-0.295
ris-0.259
unin-0.255
models-0.251
Token the
Token ID262
Feature activation+1.76e-01

Pos logit contributions
 Defin+0.092
Ct+0.091
 Prosecut+0.084
 Proceed+0.077
 Crafting+0.076
Neg logit contributions
wig-0.095
 seiz-0.095
ris-0.084
unin-0.083
models-0.081
Token day
Token ID1110
Feature activation-6.93e-02

Pos logit contributions
 Defin+0.465
Ct+0.457
 Prosecut+0.434
 Proceed+0.403
Score+0.399
Neg logit contributions
wig-0.297
 seiz-0.296
ris-0.251
unin-0.247
models-0.243
Token a
Token ID257
Feature activation-2.69e-02

Pos logit contributions
wig+0.122
 seiz+0.122
unin+0.102
ris+0.102
models+0.100
Neg logit contributions
 Defin-0.178
Ct-0.176
 Prosecut-0.166
 Proceed-0.155
Score-0.152
Token level
Token ID1241
Feature activation+3.96e-01

Pos logit contributions
wig+0.481
 seiz+0.479
ris+0.419
unin+0.412
models+0.406
Neg logit contributions
 Defin-0.556
Ct-0.544
 Prosecut-0.512
 Proceed-0.468
Score-0.465
Token can
Token ID460
Feature activation+8.31e-02

Pos logit contributions
 Defin+0.884
Ct+0.863
 Prosecut+0.807
 Proceed+0.739
Score+0.737
Neg logit contributions
wig-0.835
 seiz-0.830
ris-0.729
unin-0.722
models-0.709
Token be
Token ID307
Feature activation+1.07e-01

Pos logit contributions
 Defin+0.197
Ct+0.193
 Prosecut+0.181
 Proceed+0.166
Score+0.165
Neg logit contributions
wig-0.164
 seiz-0.163
ris-0.143
unin-0.140
models-0.138
Token tailored
Token ID27571
Feature activation-9.26e-02

Pos logit contributions
 Defin+0.252
Ct+0.245
 Prosecut+0.232
 Proceed+0.213
Score+0.211
Neg logit contributions
wig-0.214
 seiz-0.213
ris-0.187
unin-0.182
models-0.179
Token via
Token ID2884
Feature activation-4.62e-01

Pos logit contributions
wig+0.184
 seiz+0.183
ris+0.160
unin+0.157
models+0.155
Neg logit contributions
 Defin-0.218
Ct-0.214
 Prosecut-0.201
 Proceed-0.184
Score-0.183
Token remote
Token ID6569
Feature activation+6.87e-01

Pos logit contributions
wig+0.937
 seiz+0.933
ris+0.816
unin+0.801
models+0.790
Neg logit contributions
 Defin-1.066
Ct-1.044
 Prosecut-0.985
 Proceed-0.898
Score-0.892
Token control
Token ID1630
Feature activation-6.13e-01

Pos logit contributions
 Defin+1.542
Ct+1.512
 Prosecut+1.409
 Proceed+1.290
Score+1.279
Neg logit contributions
wig-1.435
 seiz-1.432
ris-1.261
unin-1.245
models-1.225
Token or
Token ID393
Feature activation-9.47e-01

Pos logit contributions
wig+1.105
 seiz+1.097
ris+0.945
unin+0.922
models+0.911
Neg logit contributions
 Defin-1.562
Ct-1.528
 Prosecut-1.451
 Proceed-1.334
Score-1.328
Token rear
Token ID8286
Feature activation-2.33e-01

Pos logit contributions
wig+1.925
 seiz+1.913
ris+1.672
unin+1.629
models+1.607
Neg logit contributions
 Defin-2.190
Ct-2.138
 Prosecut-2.029
 Proceed-1.844
Score-1.836
Token-
Token ID12
Feature activation-6.66e-02

Pos logit contributions
wig+0.452
 seiz+0.450
ris+0.392
unin+0.385
models+0.378
Neg logit contributions
 Defin-0.561
Ct-0.549
 Prosecut-0.517
 Proceed-0.475
Score-0.472
Tokenmounted
Token ID29728
Feature activation-7.33e-01

Pos logit contributions
wig+0.110
 seiz+0.110
ris+0.092
unin+0.090
models+0.088
Neg logit contributions
 Defin-0.178
Ct-0.176
 Prosecut-0.166
 Proceed-0.154
Score-0.154
Token prices
Token ID4536
Feature activation+4.93e-01

Pos logit contributions
wig+1.171
 seiz+1.170
ris+1.029
unin+0.982
models+0.980
Neg logit contributions
 Defin-1.435
Ct-1.392
 Prosecut-1.316
 Proceed-1.206
Score-1.184
Token for
Token ID329
Feature activation-8.78e-02

Pos logit contributions
 Defin+1.183
Ct+1.163
 Prosecut+1.098
 Proceed+1.007
 Crafting+1.002
Neg logit contributions
wig-0.953
 seiz-0.946
ris-0.818
unin-0.815
models-0.795
Token consumers
Token ID7008
Feature activation-1.96e-01

Pos logit contributions
wig+0.162
 seiz+0.162
ris+0.140
unin+0.137
models+0.135
Neg logit contributions
 Defin-0.220
Ct-0.215
 Prosecut-0.203
 Proceed-0.187
Score-0.186
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
wig+0.354
 seiz+0.352
ris+0.302
unin+0.298
models+0.292
Neg logit contributions
 Defin-0.497
Ct-0.488
 Prosecut-0.462
 Proceed-0.426
Score-0.423
Token
Token ID198
Feature activation+2.83e-01

Pos logit contributions
 Defin+0.272
Ct+0.265
 Prosecut+0.249
 Proceed+0.228
 Crafting+0.225
Neg logit contributions
 seiz-0.290
wig-0.288
ris-0.253
unin-0.252
models-0.246
Token
Token ID198
Feature activation+2.17e-01

Pos logit contributions
 Defin+0.722
Ct+0.679
 Prosecut+0.658
 Proceed+0.594
 Divide+0.589
Neg logit contributions
wig-0.531
ris-0.467
 seiz-0.465
models-0.448
unin-0.445
Token"
Token ID1
Feature activation+1.07e+00

Pos logit contributions
 Defin+0.477
Ct+0.472
 Prosecut+0.437
Score+0.401
 Proceed+0.399
Neg logit contributions
 seiz-0.468
wig-0.464
ris-0.406
unin-0.401
models-0.394
TokenThat
Token ID2504
Feature activation+1.16e-01

Pos logit contributions
 Defin+1.887
Ct+1.835
 Prosecut+1.689
 Proceed+1.493
Score+1.483
Neg logit contributions
wig-2.767
 seiz-2.761
ris-2.488
unin-2.456
models-2.427
Token is
Token ID318
Feature activation+1.19e-01

Pos logit contributions
 Defin+0.277
Ct+0.271
 Prosecut+0.255
 Proceed+0.234
Score+0.233
Neg logit contributions
wig-0.226
 seiz-0.225
ris-0.196
unin-0.192
models-0.189
Token up
Token ID510
Feature activation-3.67e-02

Pos logit contributions
 Defin+0.273
Ct+0.268
 Prosecut+0.252
 Proceed+0.230
Score+0.229
Neg logit contributions
wig-0.242
 seiz-0.241
ris-0.212
unin-0.208
models-0.205
Token to
Token ID284
Feature activation-7.06e-01

Pos logit contributions
wig+0.078
 seiz+0.078
unin+0.067
ppo+0.066
models+0.066
Neg logit contributions
 Defin-0.081
Ct-0.080
 Prosecut-0.074
 Proceed-0.068
Score-0.068
Tokenrop
Token ID1773
Feature activation+3.44e-01

Pos logit contributions
wig+1.528
 seiz+1.507
ris+1.312
unin+1.277
models+1.246
Neg logit contributions
 Defin-2.320
Ct-2.271
 Prosecut-2.156
 Proceed-1.993
Score-1.983
Tokenost
Token ID455
Feature activation+1.41e-02

Pos logit contributions
 Defin+0.818
Ct+0.805
 Prosecut+0.757
 Proceed+0.696
Score+0.690
Neg logit contributions
 seiz-0.673
wig-0.671
ris-0.579
unin-0.570
models-0.562
Tokenach
Token ID620
Feature activation+1.03e-01

Pos logit contributions
 Defin+0.034
Ct+0.034
 Prosecut+0.032
 Proceed+0.029
Score+0.029
Neg logit contributions
 seiz-0.027
wig-0.027
ris-0.023
unin-0.023
models-0.022
Token,
Token ID11
Feature activation-1.89e-01

Pos logit contributions
 Defin+0.250
Ct+0.245
 Prosecut+0.232
 Proceed+0.213
Score+0.211
Neg logit contributions
wig-0.198
 seiz-0.198
ris-0.171
unin-0.168
models-0.166
Token is
Token ID318
Feature activation-7.22e-02

Pos logit contributions
wig+0.388
 seiz+0.386
ris+0.339
unin+0.332
models+0.328
Neg logit contributions
 Defin-0.434
Ct-0.424
 Prosecut-0.398
 Proceed-0.364
Score-0.362
Token a
Token ID257
Feature activation+2.17e-02

Pos logit contributions
wig+0.147
 seiz+0.146
ris+0.128
unin+0.126
models+0.124
Neg logit contributions
 Defin-0.167
Ct-0.163
 Prosecut-0.153
 Proceed-0.140
Score-0.140
Token recipient
Token ID17800
Feature activation-8.50e-01

Pos logit contributions
 Defin+0.051
Ct+0.050
 Prosecut+0.046
 Proceed+0.042
Score+0.042
Neg logit contributions
wig-0.044
 seiz-0.044
ris-0.038
unin-0.037
models-0.037
Token of
Token ID286
Feature activation+3.48e-01

Pos logit contributions
wig+1.919
 seiz+1.917
ris+1.698
unin+1.684
models+1.662
Neg logit contributions
 Defin-1.766
Ct-1.737
 Prosecut-1.615
 Proceed-1.474
Score-1.448
Token $
Token ID720
Feature activation+8.05e-01

Pos logit contributions
 Defin+0.848
Ct+0.826
 Prosecut+0.785
 Proceed+0.729
Score+0.715
Neg logit contributions
wig-0.664
 seiz-0.658
ris-0.569
unin-0.561
models-0.554
Token155
Token ID18742
Feature activation-1.07e-01

Pos logit contributions
 Defin+1.663
Ct+1.620
 Prosecut+1.513
 Proceed+1.365
Score+1.354
Neg logit contributions
wig-1.836
 seiz-1.826
ris-1.629
unin-1.602
models-1.583
Token.
Token ID13
Feature activation-1.43e-01

Pos logit contributions
wig+0.199
 seiz+0.199
unin+0.169
ris+0.169
models+0.166
Neg logit contributions
Ct-0.263
 Defin-0.261
 Prosecut-0.244
 Proceed-0.228
Score-0.226
Token to
Token ID284
Feature activation+1.25e-01

Pos logit contributions
 Defin+1.183
Ct+1.163
 Prosecut+1.083
Score+0.993
 Proceed+0.985
Neg logit contributions
wig-1.134
 seiz-1.120
unin-0.982
ris-0.974
models-0.964
Token feel
Token ID1254
Feature activation-6.45e-01

Pos logit contributions
 Defin+0.290
Ct+0.285
 Prosecut+0.268
 Proceed+0.244
Score+0.241
Neg logit contributions
wig-0.252
 seiz-0.250
ris-0.221
unin-0.216
lean-0.212
Token comfortable
Token ID6792
Feature activation+9.64e-01

Pos logit contributions
wig+1.301
 seiz+1.289
ris+1.150
unin+1.110
lean+1.089
Neg logit contributions
 Defin-1.508
Ct-1.470
 Prosecut-1.386
 Proceed-1.272
 Crafting-1.251
Token utilizing
Token ID25137
Feature activation-7.85e-01

Pos logit contributions
 Defin+2.030
Ct+1.999
 Prosecut+1.874
 Proceed+1.687
Score+1.684
Neg logit contributions
wig-2.133
 seiz-2.112
ris-1.880
unin-1.860
models-1.838
Token starter
Token ID14217
Feature activation-5.19e-01

Pos logit contributions
wig+1.581
 seiz+1.577
ris+1.398
unin+1.349
models+1.332
Neg logit contributions
 Defin-1.827
Ct-1.789
 Prosecut-1.672
 Proceed-1.528
Score-1.515
Token Trevor
Token ID25389
Feature activation-6.30e-01

Pos logit contributions
wig+1.162
 seiz+1.160
ris+1.039
unin+1.009
models+1.003
Neg logit contributions
 Defin-1.089
Ct-1.064
 Prosecut-0.990
 Proceed-0.895
Score-0.887
Token Knight
Token ID6700
Feature activation+1.44e-01

Pos logit contributions
wig+1.394
 seiz+1.378
ris+1.232
unin+1.204
models+1.187
Neg logit contributions
 Defin-1.351
Ct-1.312
 Prosecut-1.228
 Proceed-1.111
Score-1.107
Token as
Token ID355
Feature activation-2.39e-01

Pos logit contributions
 Defin+0.353
Ct+0.345
 Prosecut+0.326
 Proceed+0.300
Score+0.298
Neg logit contributions
wig-0.274
 seiz-0.273
ris-0.236
unin-0.232
models-0.228
Token a
Token ID257
Feature activation-2.93e-01

Pos logit contributions
wig+0.478
 seiz+0.476
ris+0.416
unin+0.410
models+0.403
Neg logit contributions
 Defin-0.561
Ct-0.548
 Prosecut-0.517
 Proceed-0.473
Score-0.470
Token runner
Token ID17490
Feature activation+5.07e-01

Pos logit contributions
wig+0.592
 seiz+0.591
ris+0.518
unin+0.503
models+0.494
Neg logit contributions
 Defin-0.683
Ct-0.666
 Prosecut-0.625
 Proceed-0.570
Score-0.564
Token;
Token ID26
Feature activation-6.28e-01

Pos logit contributions
 Defin+1.238
Ct+1.213
 Prosecut+1.145
 Proceed+1.055
Score+1.053
Neg logit contributions
wig-0.957
 seiz-0.955
ris-0.826
unin-0.814
models-0.801
Token luxury
Token ID13064
Feature activation+6.86e-01

Pos logit contributions
 Defin+0.069
Ct+0.067
 Prosecut+0.063
 Proceed+0.058
Score+0.057
Neg logit contributions
wig-0.061
 seiz-0.060
ris-0.053
unin-0.052
models-0.051
Token for
Token ID329
Feature activation+1.04e-01

Pos logit contributions
 Defin+1.649
Ct+1.614
 Prosecut+1.521
 Proceed+1.396
Score+1.391
Neg logit contributions
wig-1.331
 seiz-1.320
ris-1.151
unin-1.132
models-1.113
Token a
Token ID257
Feature activation-2.18e-01

Pos logit contributions
 Defin+0.253
Ct+0.247
 Prosecut+0.233
 Proceed+0.214
Score+0.213
Neg logit contributions
wig-0.200
 seiz-0.199
ris-0.173
unin-0.170
models-0.167
Token small
Token ID1402
Feature activation-8.54e-02

Pos logit contributions
 seiz+0.425
wig+0.425
ris+0.371
unin+0.363
models+0.355
Neg logit contributions
 Defin-0.522
Ct-0.510
 Prosecut-0.483
 Proceed-0.441
Score-0.438
Token minority
Token ID9137
Feature activation+3.09e-03

Pos logit contributions
wig+0.169
 seiz+0.169
ris+0.147
unin+0.145
models+0.143
Neg logit contributions
 Defin-0.201
Ct-0.197
 Prosecut-0.185
 Proceed-0.170
Score-0.169
Token in
Token ID287
Feature activation+1.84e-01

Pos logit contributions
 Defin+0.007
Ct+0.007
 Prosecut+0.007
 Proceed+0.006
Score+0.006
Neg logit contributions
wig-0.006
 seiz-0.006
ris-0.005
unin-0.005
models-0.005
Token India
Token ID3794
Feature activation+1.13e-01

Pos logit contributions
 Defin+0.423
Ct+0.413
 Prosecut+0.389
 Proceed+0.355
Score+0.353
Neg logit contributions
wig-0.375
 seiz-0.373
ris-0.327
unin-0.322
models-0.317
Token.
Token ID13
Feature activation-1.70e-01

Pos logit contributions
 Defin+0.284
Ct+0.278
 Prosecut+0.262
 Proceed+0.242
Score+0.241
Neg logit contributions
wig-0.207
 seiz-0.206
ris-0.177
unin-0.175
models-0.171
Token This
Token ID770
Feature activation-2.27e-01

Pos logit contributions
wig+0.380
 seiz+0.376
ris+0.341
models+0.332
ppo+0.326
Neg logit contributions
 Defin-0.347
Ct-0.344
 Prosecut-0.316
Score-0.284
 Proceed-0.282
Token is
Token ID318
Feature activation+2.15e-01

Pos logit contributions
 seiz+0.480
wig+0.480
ris+0.423
unin+0.414
models+0.408
Neg logit contributions
 Defin-0.506
Ct-0.495
 Prosecut-0.466
 Proceed-0.423
Score-0.418
Token illustrated
Token ID18542
Feature activation+7.14e-01

Pos logit contributions
 Defin+0.486
Ct+0.475
 Prosecut+0.447
 Proceed+0.407
Score+0.404
Neg logit contributions
wig-0.450
 seiz-0.448
ris-0.395
unin-0.387
models-0.382
Token-
Token ID12
Feature activation+1.28e-01

Pos logit contributions
wig+1.033
 seiz+1.031
ris+0.886
unin+0.872
models+0.858
Neg logit contributions
 Defin-1.392
Ct-1.362
 Prosecut-1.286
 Proceed-1.186
Score-1.178
Tokenpainted
Token ID47351
Feature activation-7.73e-01

Pos logit contributions
 Defin+0.302
Ct+0.298
 Prosecut+0.279
Score+0.256
 Proceed+0.256
Neg logit contributions
 seiz-0.253
wig-0.249
ris-0.216
unin-0.213
models-0.210
Token wood
Token ID4898
Feature activation+4.14e-02

Pos logit contributions
wig+1.432
 seiz+1.427
ris+1.232
unin+1.207
models+1.188
Neg logit contributions
 Defin-1.927
Ct-1.886
 Prosecut-1.782
 Proceed-1.639
Score-1.631
Token stripes
Token ID28806
Feature activation-4.03e-01

Pos logit contributions
 Defin+0.101
Ct+0.100
 Prosecut+0.093
 Proceed+0.086
Score+0.086
Neg logit contributions
wig-0.079
 seiz-0.078
ris-0.067
unin-0.067
models-0.065
Token (
Token ID357
Feature activation-9.33e-02

Pos logit contributions
wig+0.729
 seiz+0.726
ris+0.627
unin+0.611
models+0.603
Neg logit contributions
 Defin-1.025
Ct-1.002
 Prosecut-0.949
 Proceed-0.873
Score-0.870
Tokenfrom
Token ID6738
Feature activation-1.92e-01

Pos logit contributions
wig+0.189
 seiz+0.189
ris+0.165
unin+0.162
models+0.160
Neg logit contributions
 Defin-0.216
Ct-0.211
 Prosecut-0.199
 Proceed-0.182
Score-0.181
Token $
Token ID720
Feature activation+1.06e+00

Pos logit contributions
wig+0.414
 seiz+0.413
ris+0.364
unin+0.359
models+0.354
Neg logit contributions
 Defin-0.421
Ct-0.411
 Prosecut-0.386
 Proceed-0.351
Score-0.348
Token399
Token ID28771
Feature activation-6.00e-02

Pos logit contributions
 Defin+2.241
Ct+2.182
 Prosecut+2.045
 Proceed+1.849
Score+1.834
Neg logit contributions
wig-2.358
 seiz-2.344
ris-2.086
unin-2.049
models-2.027
Token a
Token ID257
Feature activation+1.71e-02

Pos logit contributions
wig+0.114
 seiz+0.113
ris+0.098
unin+0.096
models+0.094
Neg logit contributions
 Defin-0.147
Ct-0.144
 Prosecut-0.136
 Proceed-0.126
Score-0.125
Token night
Token ID1755
Feature activation+4.60e-02

Pos logit contributions
 Defin+0.042
Ct+0.041
 Prosecut+0.039
 Proceed+0.036
Score+0.036
Neg logit contributions
wig-0.032
 seiz-0.032
ris-0.028
unin-0.027
models-0.027
Token for
Token ID329
Feature activation+7.77e-02

Pos logit contributions
 Defin+0.114
Ct+0.112
 Prosecut+0.105
 Proceed+0.097
Score+0.097
Neg logit contributions
wig-0.086
 seiz-0.085
ris-0.074
unin-0.073
models-0.071
Token plan
Token ID1410
Feature activation-3.79e-01

Pos logit contributions
wig+0.069
 seiz+0.069
ris+0.059
unin+0.058
models+0.057
Neg logit contributions
 Defin-0.095
Ct-0.093
 Prosecut-0.088
 Proceed-0.081
Score-0.081
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
wig+0.683
 seiz+0.679
ris+0.585
unin+0.573
models+0.563
Neg logit contributions
 Defin-0.965
Ct-0.946
 Prosecut-0.895
 Proceed-0.825
Score-0.821
Token However
Token ID2102
Feature activation+6.55e-02

Pos logit contributions
wig+0.355
 seiz+0.351
ris+0.315
models+0.307
unin+0.305
Neg logit contributions
 Defin-0.335
Ct-0.328
 Prosecut-0.303
Score-0.278
 Proceed-0.274
Token,
Token ID11
Feature activation+5.39e-02

Pos logit contributions
Ct+0.168
 Defin+0.167
 Prosecut+0.159
 Proceed+0.145
Score+0.144
Neg logit contributions
wig-0.113
 seiz-0.113
ris-0.095
unin-0.094
lean-0.093
Token Zh
Token ID10511
Feature activation-3.15e-01

Pos logit contributions
 Defin+0.125
Ct+0.122
 Prosecut+0.115
 Proceed+0.105
Score+0.105
Neg logit contributions
wig-0.109
 seiz-0.109
ris-0.096
unin-0.094
models-0.092
Tokenuk
Token ID2724
Feature activation-9.30e-01

Pos logit contributions
wig+0.605
 seiz+0.558
ris+0.530
unin+0.528
models+0.501
Neg logit contributions
 Defin-0.795
Ct-0.764
 Prosecut-0.727
 Proceed-0.666
 Crafting-0.666
Tokenov
Token ID709
Feature activation-2.32e-01

Pos logit contributions
wig+1.396
ris+1.186
unin+1.177
 seiz+1.135
lot+1.123
Neg logit contributions
 Defin-2.764
Ct-2.621
 Prosecut-2.554
 Crafting-2.352
 Proceed-2.341
Token did
Token ID750
Feature activation-9.32e-02

Pos logit contributions
wig+0.462
 seiz+0.460
ris+0.402
unin+0.394
models+0.387
Neg logit contributions
 Defin-0.549
Ct-0.537
 Prosecut-0.504
 Proceed-0.462
Score-0.460
Token not
Token ID407
Feature activation-5.78e-02

Pos logit contributions
wig+0.193
 seiz+0.192
ris+0.169
unin+0.166
models+0.163
Neg logit contributions
 Defin-0.212
Ct-0.207
 Prosecut-0.195
 Proceed-0.178
Score-0.177
Token clarify
Token ID18282
Feature activation-3.91e-01

Pos logit contributions
wig+0.119
 seiz+0.119
ris+0.104
unin+0.102
models+0.100
Neg logit contributions
 Defin-0.132
Ct-0.129
 Prosecut-0.121
 Proceed-0.110
Score-0.110
Token whether
Token ID1771
Feature activation-2.89e-01

Pos logit contributions
 seiz+0.792
wig+0.787
ris+0.690
unin+0.677
models+0.667
Neg logit contributions
 Defin-0.908
Ct-0.882
 Prosecut-0.828
 Proceed-0.760
Score-0.757
Token who
Token ID508
Feature activation+2.38e-01

Pos logit contributions
wig+1.453
 seiz+1.446
ris+1.260
unin+1.226
models+1.205
Neg logit contributions
 Defin-1.888
Ct-1.845
 Prosecut-1.745
 Proceed-1.606
Score-1.591
Token's
Token ID338
Feature activation-1.47e-01

Pos logit contributions
 Defin+0.562
Ct+0.550
 Prosecut+0.518
 Proceed+0.474
Score+0.471
Neg logit contributions
wig-0.473
 seiz-0.470
ris-0.412
unin-0.403
models-0.397
Token given
Token ID1813
Feature activation-3.63e-01

Pos logit contributions
wig+0.297
 seiz+0.295
ris+0.258
unin+0.252
models+0.249
Neg logit contributions
 Defin-0.346
Ct-0.336
 Prosecut-0.318
 Proceed-0.290
Score-0.288
Token Super
Token ID3115
Feature activation+1.17e-01

Pos logit contributions
wig+0.717
 seiz+0.714
ris+0.629
unin+0.612
models+0.604
Neg logit contributions
 Defin-0.861
Ct-0.843
 Prosecut-0.795
 Proceed-0.720
Score-0.716
Token Metroid
Token ID45430
Feature activation-8.35e-01

Pos logit contributions
 Defin+0.317
Ct+0.309
 Prosecut+0.297
 Proceed+0.273
Score+0.269
Neg logit contributions
wig-0.193
 seiz-0.181
ris-0.167
unin-0.160
models-0.159
Token or
Token ID393
Feature activation-1.01e+00

Pos logit contributions
 seiz+1.592
wig+1.588
ris+1.404
unin+1.345
models+1.340
Neg logit contributions
 Defin-2.015
Ct-1.985
 Prosecut-1.891
 Proceed-1.701
 Divide-1.690
Token Symphony
Token ID33195
Feature activation+1.21e-01

Pos logit contributions
wig+2.070
 seiz+2.062
ris+1.850
unin+1.758
lean+1.747
Neg logit contributions
 Defin-2.270
Ct-2.226
 Prosecut-2.090
Score-1.891
 Proceed-1.877
Token of
Token ID286
Feature activation-8.80e-02

Pos logit contributions
 Defin+0.249
Ct+0.242
 Prosecut+0.222
 Proceed+0.207
Score+0.205
Neg logit contributions
wig-0.278
 seiz-0.275
ris-0.243
unin-0.241
models-0.237
Token the
Token ID262
Feature activation-1.17e-01

Pos logit contributions
wig+0.177
 seiz+0.171
ris+0.154
unin+0.152
models+0.149
Neg logit contributions
 Defin-0.205
Ct-0.201
 Prosecut-0.189
Score-0.172
 Proceed-0.171
Token Night
Token ID5265
Feature activation-1.33e-01

Pos logit contributions
wig+0.226
 seiz+0.211
ris+0.198
unin+0.194
models+0.190
Neg logit contributions
 Defin-0.280
Ct-0.272
 Prosecut-0.259
 Proceed-0.230
Score-0.230
Token a
Token ID257
Feature activation-2.52e-01

Pos logit contributions
wig+0.262
 seiz+0.260
ris+0.230
unin+0.222
models+0.221
Neg logit contributions
 Defin-0.316
Ct-0.310
 Prosecut-0.293
 Proceed-0.266
Score-0.264
Token in
Token ID287
Feature activation+1.64e-01

Pos logit contributions
 Defin+0.061
Ct+0.059
 Prosecut+0.056
 Proceed+0.051
Score+0.050
Neg logit contributions
wig-0.060
 seiz-0.060
ris-0.053
unin-0.052
models-0.051
Token the
Token ID262
Feature activation+1.73e-01

Pos logit contributions
 Defin+0.359
Ct+0.351
 Prosecut+0.327
 Proceed+0.301
Score+0.298
Neg logit contributions
wig-0.356
 seiz-0.354
ris-0.311
unin-0.307
models-0.302
Token comments
Token ID3651
Feature activation-1.31e-01

Pos logit contributions
 Defin+0.317
Ct+0.308
 Prosecut+0.285
 Proceed+0.253
Score+0.251
Neg logit contributions
wig-0.436
 seiz-0.435
ris-0.391
unin-0.386
models-0.381
Token!
Token ID0
Feature activation+3.82e-01

Pos logit contributions
wig+0.263
 seiz+0.262
ris+0.229
unin+0.225
models+0.222
Neg logit contributions
 Defin-0.306
Ct-0.299
 Prosecut-0.282
 Proceed-0.258
Score-0.256
Token<|endoftext|>
Token ID50256
Feature activation+3.83e-01

Pos logit contributions
 Defin+0.763
Ct+0.743
 Prosecut+0.689
 Proceed+0.624
Score+0.619
Neg logit contributions
wig-0.903
 seiz-0.901
ris-0.798
unin-0.792
models-0.778
TokenLG
Token ID41257
Feature activation-1.58e+00

Pos logit contributions
 Defin+0.714
Ct+0.707
 Prosecut+0.641
Score+0.585
 Proceed+0.574
Neg logit contributions
 seiz-0.950
wig-0.940
ris-0.838
unin-0.829
models-0.816
Token has
Token ID468
Feature activation-2.78e-01

Pos logit contributions
wig+3.514
 seiz+3.486
ris+3.113
models+3.026
unin+3.004
Neg logit contributions
 Defin-3.296
Ct-3.256
 Prosecut-3.007
 Proceed-2.717
Score-2.716
Token cancelled
Token ID16769
Feature activation-2.64e-01

Pos logit contributions
wig+0.537
 seiz+0.536
ris+0.466
unin+0.455
models+0.448
Neg logit contributions
 Defin-0.670
Ct-0.657
 Prosecut-0.619
 Proceed-0.569
Score-0.565
Token a
Token ID257
Feature activation-1.35e-01

Pos logit contributions
wig+0.551
 seiz+0.549
ris+0.483
unin+0.475
models+0.468
Neg logit contributions
 Defin-0.596
Ct-0.583
 Prosecut-0.547
 Proceed-0.499
Score-0.496
Token series
Token ID2168
Feature activation+2.93e-01

Pos logit contributions
wig+0.259
 seiz+0.258
ris+0.224
unin+0.220
models+0.217
Neg logit contributions
 Defin-0.328
Ct-0.321
 Prosecut-0.303
 Proceed-0.278
Score-0.276
Token of
Token ID286
Feature activation+9.34e-02

Pos logit contributions
 Defin+0.572
Ct+0.571
 Prosecut+0.514
 Proceed+0.473
 Divide+0.470
Neg logit contributions
 seiz-0.691
wig-0.688
ris-0.611
unin-0.609
lean-0.595
Token that
Token ID326
Feature activation+5.77e-02

Pos logit contributions
 Defin+0.152
Ct+0.149
 Prosecut+0.139
 Proceed+0.127
 Divide+0.126
Neg logit contributions
wig-0.141
 seiz-0.141
ris-0.123
unin-0.122
models-0.119
Token gave
Token ID2921
Feature activation+8.92e-02

Pos logit contributions
 Defin+0.139
Ct+0.136
 Prosecut+0.128
 Proceed+0.118
Score+0.117
Neg logit contributions
wig-0.112
 seiz-0.111
ris-0.097
unin-0.095
models-0.094
Token birth
Token ID4082
Feature activation-3.15e-01

Pos logit contributions
 Defin+0.203
Ct+0.198
 Prosecut+0.186
 Proceed+0.170
Score+0.169
Neg logit contributions
wig-0.185
 seiz-0.184
ris-0.162
unin-0.159
models-0.157
Token to
Token ID284
Feature activation-3.61e-01

Pos logit contributions
wig+0.668
 seiz+0.668
ris+0.574
unin+0.572
lean+0.566
Neg logit contributions
 Defin-0.701
Ct-0.693
 Prosecut-0.634
 Proceed-0.589
Score-0.580
Token this
Token ID428
Feature activation-3.80e-01

Pos logit contributions
wig+0.761
 seiz+0.756
ris+0.667
unin+0.654
models+0.647
Neg logit contributions
 Defin-0.810
Ct-0.791
 Prosecut-0.744
 Proceed-0.676
Score-0.674
Token field
Token ID2214
Feature activation-1.72e+00

Pos logit contributions
wig+0.783
 seiz+0.780
ris+0.685
unin+0.672
models+0.663
Neg logit contributions
 Defin-0.866
Ct-0.848
 Prosecut-0.799
 Proceed-0.728
Score-0.727
Token of
Token ID286
Feature activation-7.03e-01

Pos logit contributions
wig+3.160
 seiz+3.088
ris+2.733
unin+2.658
models+2.624
Neg logit contributions
 Defin-4.294
Ct-4.182
 Prosecut-3.978
 Proceed-3.658
Score-3.646
Token research
Token ID2267
Feature activation-8.76e-01

Pos logit contributions
wig+1.357
 seiz+1.346
ris+1.178
unin+1.147
models+1.137
Neg logit contributions
 Defin-1.697
Ct-1.659
 Prosecut-1.569
Score-1.437
 Proceed-1.435
Token.[
Token ID3693
Feature activation+5.19e-01

Pos logit contributions
wig+1.555
 seiz+1.537
ris+1.331
unin+1.295
models+1.280
Neg logit contributions
 Defin-2.253
Ct-2.205
 Prosecut-2.093
 Proceed-1.926
Score-1.921
Token17
Token ID1558
Feature activation+3.79e-01

Pos logit contributions
 Defin+1.058
Ct+1.035
 Prosecut+0.962
 Proceed+0.869
Score+0.864
Neg logit contributions
wig-1.196
 seiz-1.195
ris-1.060
unin-1.047
models-1.031
Token]
Token ID60
Feature activation+2.16e-01

Pos logit contributions
 Defin+0.863
Ct+0.854
 Prosecut+0.791
 Proceed+0.734
Score+0.725
Neg logit contributions
wig-0.782
 seiz-0.779
ris-0.673
unin-0.669
models-0.652
Token,
Token ID11
Feature activation-1.22e-01

Pos logit contributions
 Defin+0.490
Ct+0.480
 Prosecut+0.452
 Proceed+0.415
Score+0.413
Neg logit contributions
wig-0.394
 seiz-0.393
ris-0.342
unin-0.336
models-0.330
Token so
Token ID523
Feature activation-1.55e-02

Pos logit contributions
wig+0.227
 seiz+0.225
ris+0.196
unin+0.191
models+0.189
Neg logit contributions
 Defin-0.304
Ct-0.298
 Prosecut-0.282
 Proceed-0.260
Score-0.258
Token you
Token ID345
Feature activation+1.00e-01

Pos logit contributions
wig+0.031
 seiz+0.030
ris+0.027
unin+0.026
models+0.026
Neg logit contributions
 Defin-0.037
Ct-0.036
 Prosecut-0.034
 Proceed-0.031
Score-0.031
Token can
Token ID460
Feature activation+3.26e-01

Pos logit contributions
 Defin+0.237
Ct+0.232
 Prosecut+0.220
 Proceed+0.200
Score+0.199
Neg logit contributions
wig-0.198
 seiz-0.195
ris-0.172
unin-0.167
models-0.166
Token easily
Token ID3538
Feature activation+6.99e-01

Pos logit contributions
 Defin+0.772
Ct+0.757
 Prosecut+0.720
 Proceed+0.649
Score+0.649
Neg logit contributions
wig-0.646
 seiz-0.635
ris-0.560
unin-0.544
lean-0.538
Token use
Token ID779
Feature activation-1.04e+00

Pos logit contributions
 Defin+1.616
Ct+1.582
 Prosecut+1.495
 Proceed+1.356
Score+1.352
Neg logit contributions
wig-1.425
 seiz-1.409
ris-1.244
unin-1.211
models-1.194
Token this
Token ID428
Feature activation-3.95e-01

Pos logit contributions
wig+2.042
 seiz+2.030
ris+1.771
unin+1.712
models+1.696
Neg logit contributions
 Defin-2.468
Ct-2.395
 Prosecut-2.271
 Proceed-2.061
Score-2.053
Token flour
Token ID10601
Feature activation-4.21e-01

Pos logit contributions
wig+0.779
 seiz+0.773
ris+0.677
unin+0.659
models+0.650
Neg logit contributions
 Defin-0.940
Ct-0.921
 Prosecut-0.867
 Proceed-0.793
Score-0.789
Token on
Token ID319
Feature activation-6.49e-02

Pos logit contributions
wig+0.818
 seiz+0.809
ris+0.709
unin+0.690
models+0.681
Neg logit contributions
 Defin-1.014
Ct-0.994
 Prosecut-0.936
 Proceed-0.860
Score-0.853
Token its
Token ID663
Feature activation+1.25e-01

Pos logit contributions
wig+0.122
 seiz+0.121
ris+0.105
unin+0.103
models+0.101
Neg logit contributions
 Defin-0.160
Ct-0.157
 Prosecut-0.148
 Proceed-0.136
Score-0.136
Token own
Token ID898
Feature activation+2.04e-01

Pos logit contributions
 Defin+0.346
Ct+0.340
 Prosecut+0.323
 Proceed+0.300
Score+0.299
Neg logit contributions
wig-0.195
 seiz-0.194
ris-0.163
unin-0.159
models-0.156