TOP ACTIVATIONS
MAX = 1.659

: struct Board { ... prev _ state :
< li ng - repeat =" phone in phones "> {{
support of my readers on Patreon . This post is 2
- I tt u on Patreon <|endoftext|> Car oline T omp
: struct Board { turn : Player , //
blowing up Advertisement N PH :
{ | i | print "# { i } " }
pub struct Guard { ... } pub fn pin
Up Continue reading the main story Please verify you 're not
: impl Board { pub fn copy _ board
Own ed < T > { ... } impl < T
Own ed < T > { pub fn new ( t
Why !? ( Na of
Up Continue reading the main story Please verify you 're not
< li ng - repeat =" phone in phones "> <
Output s { {{ 1 , 6 }, {{
Up Continue reading the main story Please verify you 're not
struct Atomic < T > { ... } impl < T
| 2 | struct Board { | ^ ^^^^ ^^^^ ^^
< body ng - controller =" Phone List Ctrl "> <

MIN ACTIVATIONS
MIN = -1.690

But t appeared in a Channel 4 documentary entitled
re pos . files ] WHERE RIGHT ( path , 4
s US Cons ulate to protest Trump
First in the Nation Republican Leadership Summit April 18 , 2015
's paired with " Del iver ance " on Aug .
. ore gon . gov / pre vention well ness /
re pos . contents ] WHERE id IN ( SELECT id
cl ojure . contents ] WHERE id IN ( SELECT id
s The Last Jedi , which opened to a
cl ojure . files ] WHERE path LIKE ' % project
ell (@ s hel by y 512 ) August
. c ensus . gov / data / 2015 / acs
resources <|endoftext|> When ideology takes up the brain space normally reserved
. The Congressional Leadership Fund , a political action
, while Gand alf dep arts to seek help from Tree
discussing the sale of aborted fetal tissue . But , in
, Congress made clear its intent to deflect political fire from
a . dot . gov / pub s / 8 11
cameras being defective ( cons ist ently front - f ocusing
v . Windsor struck down Section 3 of the Defense Of

INTERVAL 0.822 - 1.659
CONTAINS 0.476%

meeting that six Cabinet ministers belonging to the party as well
is now Gand alf the White , for he has taken
a licensing division of AT & T / Western Electric ,
in Cleveland . By the start of the 1946 season ,
celebrations in St irling . Again the emphasis is not on

INTERVAL -0.016 - 0.822
CONTAINS 51.287%

DO , NEVER GOT INTO THAT * Let 's
: That homosexuality is rep ug nant , a sin that
on the third line in Chicago 's 3 - 2 loss
for the S PA , Koreans cast their ballots again .
to ongoing friction between her and another employee , Desire e

INTERVAL -0.853 - -0.016
CONTAINS 47.924%

. 171 102 30 86 x . x . x .
knowing you were -- RF : You know ,
it is quite easy to sort those issues .
omm and | . 3 . If you start editing a
wouldn t survive the mortgage meltdown , saw Cohn

INTERVAL -1.690 - -0.853
CONTAINS 0.312%

my sound system . Rated 5 out of 5
Catalyst Control Center and Over Current Protection limits which means access
inspired by the Three Fl oy ds - Half Ac re
a should have been sent off for a heavy , knee
s commitment was plain ly stated in
Token:
Token ID25
Feature activation-8.63e-02

Pos logit contributions
luster+0.820
jriwal+0.745
anmar+0.741
clud+0.711
rete+0.690
Neg logit contributions
DEF-0.632
235-0.617
226-0.608
entin-0.600
ELF-0.599
Token
Token ID198
Feature activation+2.87e-02

Pos logit contributions
luster+0.256
jriwal+0.230
anmar+0.229
clud+0.218
rete+0.211
Neg logit contributions
DEF-0.240
235-0.235
226-0.232
entin-0.229
ELF-0.229
Token
Token ID198
Feature activation+1.16e-01

Pos logit contributions
DEF+0.076
235+0.075
226+0.074
ELF+0.072
309+0.072
Neg logit contributions
luster-0.089
jriwal-0.081
anmar-0.081
clud-0.077
rete-0.074
Tokenstruct
Token ID7249
Feature activation-5.68e-01

Pos logit contributions
DEF+0.273
235+0.266
226+0.262
entin+0.258
ELF+0.257
Neg logit contributions
luster-0.393
jriwal-0.359
anmar-0.358
clud-0.344
rete-0.334
Token Board
Token ID5926
Feature activation+1.13e-01

Pos logit contributions
luster+1.676
jriwal+1.507
anmar+1.502
clud+1.440
rete+1.394
Neg logit contributions
DEF-1.575
235-1.544
226-1.522
entin-1.507
ELF-1.502
Token {
Token ID1391
Feature activation+1.66e+00

Pos logit contributions
DEF+0.292
235+0.286
226+0.281
entin+0.277
ELF+0.277
Neg logit contributions
luster-0.357
jriwal-0.324
anmar-0.322
clud-0.309
rete-0.299
Token...
Token ID2644
Feature activation+4.05e-01

Pos logit contributions
DEF+4.216
235+4.143
226+4.069
entin+4.016
ELF+3.990
Neg logit contributions
luster-5.195
anmar-4.689
jriwal-4.670
clud-4.447
galitarian-4.362
Token prev
Token ID8654
Feature activation-2.73e-01

Pos logit contributions
DEF+1.040
235+1.018
226+1.001
entin+0.984
ELF+0.983
Neg logit contributions
luster-1.293
jriwal-1.171
anmar-1.168
clud-1.117
rete-1.080
Token_
Token ID62
Feature activation+2.55e-01

Pos logit contributions
luster+0.958
jriwal+0.881
anmar+0.879
clud+0.847
rete+0.823
Neg logit contributions
DEF-0.604
235-0.590
226-0.580
entin-0.573
ELF-0.571
Tokenstate
Token ID5219
Feature activation+2.79e-02

Pos logit contributions
DEF+0.829
235+0.815
226+0.806
entin+0.795
ELF+0.794
Neg logit contributions
luster-0.638
jriwal-0.563
anmar-0.559
clud-0.529
rete-0.505
Token :
Token ID1058
Feature activation+2.34e-01

Pos logit contributions
DEF+0.063
235+0.062
226+0.061
entin+0.060
ELF+0.060
Neg logit contributions
luster-0.097
jriwal-0.089
anmar-0.088
clud-0.085
rete-0.083
Token <
Token ID1279
Feature activation+3.64e-01

Pos logit contributions
DEF+2.070
235+2.048
226+1.995
ELF+1.931
309+1.927
Neg logit contributions
luster-3.594
jriwal-3.296
anmar-3.263
clud-3.145
rete-3.054
Tokenli
Token ID4528
Feature activation-1.32e-01

Pos logit contributions
DEF+0.721
235+0.707
226+0.690
ELF+0.673
309+0.669
Neg logit contributions
luster-1.368
jriwal-1.267
anmar-1.257
clud-1.211
rete-1.183
Token ng
Token ID23370
Feature activation-9.55e-01

Pos logit contributions
luster+0.356
jriwal+0.315
anmar+0.314
clud+0.297
rete+0.287
Neg logit contributions
DEF-0.406
235-0.399
226-0.394
entin-0.389
ELF-0.388
Token-
Token ID12
Feature activation-1.89e-01

Pos logit contributions
luster+3.022
anmar+2.745
jriwal+2.738
clud+2.637
rete+2.549
Neg logit contributions
DEF-2.445
235-2.374
226-2.339
entin-2.337
ELF-2.322
Tokenrepeat
Token ID44754
Feature activation+1.23e-01

Pos logit contributions
luster+0.577
jriwal+0.523
anmar+0.517
clud+0.495
rete+0.479
Neg logit contributions
DEF-0.511
235-0.502
226-0.496
entin-0.486
ELF-0.485
Token="
Token ID2625
Feature activation+1.60e+00

Pos logit contributions
DEF+0.338
235+0.331
226+0.327
ELF+0.321
entin+0.319
Neg logit contributions
luster-0.372
jriwal-0.335
anmar-0.331
clud-0.316
rete-0.307
Tokenphone
Token ID4862
Feature activation-2.21e-01

Pos logit contributions
DEF+3.932
235+3.903
226+3.837
309+3.735
186+3.693
Neg logit contributions
luster-5.182
jriwal-4.730
anmar-4.594
clud-4.483
galitarian-4.352
Token in
Token ID287
Feature activation+1.51e-01

Pos logit contributions
luster+0.713
jriwal+0.648
anmar+0.646
clud+0.619
rete+0.599
Neg logit contributions
DEF-0.557
235-0.545
226-0.537
entin-0.529
ELF-0.529
Token phones
Token ID9512
Feature activation+4.44e-03

Pos logit contributions
DEF+0.408
235+0.401
226+0.395
entin+0.388
ELF+0.387
Neg logit contributions
luster-0.463
jriwal-0.417
anmar-0.414
clud-0.395
rete-0.383
Token">
Token ID5320
Feature activation+7.92e-01

Pos logit contributions
DEF+0.011
235+0.010
226+0.010
entin+0.010
ELF+0.010
Neg logit contributions
luster-0.015
jriwal-0.014
anmar-0.014
clud-0.013
rete-0.013
Token {{
Token ID22935
Feature activation+9.07e-01

Pos logit contributions
DEF+1.691
235+1.668
226+1.637
309+1.587
ELF+1.573
Neg logit contributions
luster-2.863
jriwal-2.628
anmar-2.605
clud-2.516
rete-2.436
Token support
Token ID1104
Feature activation+7.70e-02

Pos logit contributions
DEF+0.660
235+0.646
226+0.638
entin+0.628
ELF+0.627
Neg logit contributions
luster-0.753
jriwal-0.680
anmar-0.675
clud-0.648
rete-0.626
Token of
Token ID286
Feature activation+6.66e-01

Pos logit contributions
DEF+0.205
235+0.201
226+0.198
entin+0.195
ELF+0.194
Neg logit contributions
luster-0.238
jriwal-0.215
anmar-0.214
clud-0.205
rete-0.198
Token my
Token ID616
Feature activation+3.10e-01

Pos logit contributions
DEF+1.790
235+1.770
226+1.745
entin+1.711
309+1.710
Neg logit contributions
luster-2.027
jriwal-1.854
anmar-1.798
clud-1.728
galitarian-1.680
Token readers
Token ID7183
Feature activation+1.20e+00

Pos logit contributions
DEF+0.770
235+0.754
226+0.743
ELF+0.731
entin+0.730
Neg logit contributions
luster-1.015
jriwal-0.925
anmar-0.913
clud-0.879
rete-0.853
Token on
Token ID319
Feature activation+2.69e-01

Pos logit contributions
DEF+2.805
235+2.755
226+2.716
entin+2.653
ELF+2.648
Neg logit contributions
luster-4.048
jriwal-3.708
anmar-3.667
clud-3.527
rete-3.440
Token Patreon
Token ID29859
Feature activation+1.57e+00

Pos logit contributions
DEF+0.643
235+0.632
226+0.621
entin+0.610
ELF+0.607
Neg logit contributions
luster-0.907
jriwal-0.828
anmar-0.818
clud-0.788
rete-0.767
Token.
Token ID13
Feature activation+3.08e-01

Pos logit contributions
DEF+3.576
235+3.542
226+3.493
186+3.375
ELF+3.363
Neg logit contributions
luster-5.400
jriwal-4.932
anmar-4.852
clud-4.697
rete-4.578
Token This
Token ID770
Feature activation+2.06e-01

Pos logit contributions
DEF+0.712
235+0.695
226+0.683
entin+0.669
ELF+0.668
Neg logit contributions
luster-1.063
jriwal-0.976
anmar-0.966
clud-0.928
rete-0.904
Token post
Token ID1281
Feature activation+5.67e-02

Pos logit contributions
DEF+0.539
235+0.527
226+0.520
entin+0.512
ELF+0.511
Neg logit contributions
luster-0.645
jriwal-0.583
anmar-0.581
clud-0.557
rete-0.538
Token is
Token ID318
Feature activation+6.90e-02

Pos logit contributions
DEF+0.149
235+0.145
226+0.143
ELF+0.142
entin+0.141
Neg logit contributions
luster-0.176
jriwal-0.160
anmar-0.159
clud-0.153
rete-0.148
Token 2
Token ID362
Feature activation+6.34e-01

Pos logit contributions
DEF+0.182
235+0.178
226+0.175
entin+0.172
ELF+0.172
Neg logit contributions
luster-0.215
jriwal-0.195
anmar-0.194
clud-0.186
rete-0.180
Token-
Token ID12
Feature activation-6.52e-03

Pos logit contributions
DEF+1.153
235+1.116
ELF+1.108
226+1.100
entin+1.090
Neg logit contributions
luster-1.366
jriwal-1.241
anmar-1.227
clud-1.186
wald-1.144
TokenI
Token ID40
Feature activation-2.02e-01

Pos logit contributions
luster+0.025
jriwal+0.023
anmar+0.023
clud+0.022
rete+0.022
Neg logit contributions
DEF-0.012
235-0.012
226-0.012
entin-0.011
ELF-0.011
Tokentt
Token ID926
Feature activation-2.26e-01

Pos logit contributions
luster+0.742
jriwal+0.681
anmar+0.680
clud+0.656
rete+0.638
Neg logit contributions
DEF-0.417
235-0.406
226-0.398
entin-0.391
ELF-0.391
Tokenu
Token ID84
Feature activation-9.24e-03

Pos logit contributions
luster+0.830
jriwal+0.762
anmar+0.760
clud+0.733
rete+0.713
Neg logit contributions
DEF-0.470
235-0.457
226-0.448
ELF-0.440
entin-0.439
Token on
Token ID319
Feature activation+1.93e-01

Pos logit contributions
luster+0.029
anmar+0.026
jriwal+0.026
clud+0.025
rete+0.024
Neg logit contributions
DEF-0.025
235-0.024
226-0.024
entin-0.023
309-0.023
Token Patreon
Token ID29859
Feature activation+1.56e+00

Pos logit contributions
DEF+0.272
235+0.255
226+0.251
ELF+0.247
entin+0.245
Neg logit contributions
luster-0.839
anmar-0.778
jriwal-0.775
clud-0.754
rete-0.738
Token<|endoftext|>
Token ID50256
Feature activation+3.10e-02

Pos logit contributions
DEF+3.114
235+3.113
226+3.063
186+2.943
309+2.941
Neg logit contributions
luster-5.794
jriwal-5.387
anmar-5.298
clud-5.224
rete-5.021
TokenCar
Token ID9914
Feature activation-9.12e-02

Pos logit contributions
DEF+0.067
235+0.065
226+0.064
entin+0.063
ELF+0.063
Neg logit contributions
luster-0.111
jriwal-0.102
anmar-0.102
clud-0.098
rete-0.095
Tokenoline
Token ID14453
Feature activation+2.23e-01

Pos logit contributions
luster+0.264
jriwal+0.236
anmar+0.236
clud+0.226
rete+0.216
Neg logit contributions
DEF-0.260
235-0.255
226-0.252
entin-0.248
ELF-0.247
Token T
Token ID309
Feature activation-5.57e-01

Pos logit contributions
DEF+0.554
235+0.542
226+0.534
entin+0.524
ELF+0.524
Neg logit contributions
luster-0.732
jriwal-0.665
anmar-0.664
clud-0.636
rete-0.615
Tokenomp
Token ID3361
Feature activation+1.85e-01

Pos logit contributions
luster+1.670
jriwal+1.504
anmar+1.500
clud+1.426
rete+1.398
Neg logit contributions
DEF-1.527
235-1.494
226-1.471
ELF-1.445
entin-1.444
Token:
Token ID25
Feature activation-1.31e-01

Pos logit contributions
luster+0.235
jriwal+0.213
anmar+0.212
clud+0.204
rete+0.197
Neg logit contributions
DEF-0.177
235-0.173
226-0.170
ELF-0.167
entin-0.167
Token
Token ID198
Feature activation-6.59e-02

Pos logit contributions
luster+0.396
jriwal+0.357
anmar+0.356
clud+0.340
rete+0.329
Neg logit contributions
DEF-0.358
235-0.351
226-0.346
entin-0.341
ELF-0.340
Token
Token ID198
Feature activation+1.07e-01

Pos logit contributions
luster+0.204
jriwal+0.184
anmar+0.183
clud+0.175
rete+0.168
Neg logit contributions
DEF-0.177
235-0.174
226-0.171
ELF-0.168
entin-0.167
Tokenstruct
Token ID7249
Feature activation-4.22e-01

Pos logit contributions
DEF+0.242
235+0.236
226+0.232
ELF+0.227
entin+0.227
Neg logit contributions
luster-0.375
jriwal-0.343
anmar-0.342
clud-0.329
rete-0.319
Token Board
Token ID5926
Feature activation+1.77e-01

Pos logit contributions
luster+1.228
jriwal+1.101
anmar+1.099
clud+1.052
rete+1.017
Neg logit contributions
DEF-1.192
235-1.170
226-1.154
entin-1.143
ELF-1.137
Token {
Token ID1391
Feature activation+1.56e+00

Pos logit contributions
DEF+0.448
235+0.437
226+0.430
ELF+0.423
entin+0.423
Neg logit contributions
luster-0.573
jriwal-0.520
anmar-0.518
clud-0.496
rete-0.480
Token turn
Token ID1210
Feature activation+5.15e-01

Pos logit contributions
DEF+4.050
235+3.976
226+3.891
ELF+3.821
entin+3.805
Neg logit contributions
luster-4.869
anmar-4.350
jriwal-4.345
clud-4.152
galitarian-4.022
Token :
Token ID1058
Feature activation+4.91e-01

Pos logit contributions
DEF+1.254
235+1.226
226+1.204
entin+1.179
ELF+1.176
Neg logit contributions
luster-1.718
jriwal-1.562
anmar-1.557
clud-1.495
rete-1.441
Token Player
Token ID7853
Feature activation-1.87e-01

Pos logit contributions
DEF+1.281
235+1.257
226+1.237
ELF+1.210
186+1.206
Neg logit contributions
luster-1.542
jriwal-1.411
anmar-1.398
clud-1.328
galitarian-1.280
Token,
Token ID837
Feature activation+3.54e-01

Pos logit contributions
luster+0.624
jriwal+0.568
anmar+0.566
clud+0.544
rete+0.528
Neg logit contributions
DEF-0.451
235-0.440
226-0.433
entin-0.427
ELF-0.426
Token //
Token ID3373
Feature activation+1.90e-01

Pos logit contributions
DEF+0.973
235+0.955
226+0.941
entin+0.925
ELF+0.923
Neg logit contributions
luster-1.069
jriwal-0.959
anmar-0.957
clud-0.911
rete-0.877
Token blowing
Token ID19280
Feature activation+3.68e-01

Pos logit contributions
luster+0.811
jriwal+0.732
anmar+0.729
clud+0.698
rete+0.675
Neg logit contributions
DEF-0.716
235-0.700
226-0.691
ELF-0.680
entin-0.680
Token up
Token ID510
Feature activation+4.30e-01

Pos logit contributions
DEF+1.046
235+1.028
226+1.013
entin+1.000
ELF+0.993
Neg logit contributions
luster-1.071
anmar-0.962
jriwal-0.961
clud-0.912
rete-0.883
Token
Token ID960
Feature activation-4.59e-01

Pos logit contributions
DEF+1.165
235+1.147
226+1.128
entin+1.113
ELF+1.110
Neg logit contributions
luster-1.308
jriwal-1.184
anmar-1.178
clud-1.124
rete-1.087
Token
Token ID198
Feature activation-5.28e-01

Pos logit contributions
luster+1.442
jriwal+1.298
anmar+1.295
clud+1.246
rete+1.205
Neg logit contributions
DEF-1.191
235-1.165
226-1.147
entin-1.138
ELF-1.133
Token
Token ID198
Feature activation-2.91e-01

Pos logit contributions
luster+1.517
jriwal+1.356
anmar+1.349
clud+1.308
rete+1.267
Neg logit contributions
DEF-1.485
235-1.446
entin-1.435
226-1.428
ELF-1.419
TokenAdvertisement
Token ID4723
Feature activation+1.53e+00

Pos logit contributions
luster+1.096
jriwal+1.007
anmar+1.003
clud+0.973
rete+0.949
Neg logit contributions
DEF-0.567
235-0.555
226-0.544
entin-0.541
ELF-0.530
Token
Token ID198
Feature activation+3.06e-01

Pos logit contributions
DEF+3.634
235+3.507
226+3.430
309+3.379
NM+3.367
Neg logit contributions
luster-5.154
jriwal-4.775
anmar-4.769
clud-4.454
rete-4.380
Token
Token ID198
Feature activation+3.61e-01

Pos logit contributions
DEF+0.800
235+0.788
226+0.776
ELF+0.757
309+0.756
Neg logit contributions
luster-0.976
jriwal-0.888
anmar-0.885
clud-0.835
rete-0.804
TokenN
Token ID45
Feature activation+1.66e-01

Pos logit contributions
DEF+0.720
235+0.694
226+0.680
ELF+0.669
309+0.659
Neg logit contributions
luster-1.371
jriwal-1.266
anmar-1.264
clud-1.213
rete-1.179
TokenPH
Token ID11909
Feature activation-8.75e-01

Pos logit contributions
DEF+0.330
235+0.320
226+0.314
entin+0.308
ELF+0.308
Neg logit contributions
luster-0.627
jriwal-0.578
anmar-0.576
clud-0.556
rete-0.541
Token:
Token ID25
Feature activation-4.23e-01

Pos logit contributions
luster+2.784
jriwal+2.536
anmar+2.529
clud+2.406
wald+2.360
Neg logit contributions
DEF-2.150
235-2.108
226-2.078
entin-2.069
ELF-2.038
Token {
Token ID1391
Feature activation+1.27e+00

Pos logit contributions
luster+3.913
jriwal+3.529
anmar+3.507
clud+3.438
rete+3.284
Neg logit contributions
DEF-3.656
entin-3.588
235-3.580
226-3.523
ELF-3.486
Token|
Token ID91
Feature activation+8.23e-01

Pos logit contributions
DEF+3.387
235+3.320
226+3.243
ELF+3.189
309+3.161
Neg logit contributions
luster-3.925
anmar-3.585
jriwal-3.577
clud-3.392
rete-3.273
Tokeni
Token ID72
Feature activation-5.79e-01

Pos logit contributions
DEF+2.112
235+2.071
226+2.031
ELF+1.994
309+1.972
Neg logit contributions
luster-2.627
jriwal-2.407
anmar-2.400
clud-2.264
rete-2.199
Token|
Token ID91
Feature activation+7.96e-01

Pos logit contributions
luster+2.261
anmar+2.090
jriwal+2.084
clud+2.018
rete+1.967
Neg logit contributions
DEF-1.053
235-1.015
226-0.997
entin-0.993
ELF-0.977
Token print
Token ID3601
Feature activation-3.03e-01

Pos logit contributions
DEF+1.982
235+1.952
226+1.910
ELF+1.859
309+1.858
Neg logit contributions
luster-2.595
jriwal-2.363
anmar-2.358
clud-2.248
rete-2.177
Token "#
Token ID25113
Feature activation+1.53e+00

Pos logit contributions
luster+0.951
jriwal+0.859
anmar+0.854
clud+0.824
rete+0.793
Neg logit contributions
DEF-0.788
235-0.771
226-0.762
entin-0.757
ELF-0.748
Token{
Token ID90
Feature activation+1.00e+00

Pos logit contributions
DEF+3.753
235+3.657
226+3.562
ELF+3.489
309+3.452
Neg logit contributions
luster-5.027
jriwal-4.598
anmar-4.588
clud-4.348
galitarian-4.215
Tokeni
Token ID72
Feature activation-2.88e-01

Pos logit contributions
DEF+2.135
235+2.091
226+2.045
ELF+1.987
309+1.974
Neg logit contributions
luster-3.634
anmar-3.345
jriwal-3.331
clud-3.187
rete-3.108
Token}
Token ID92
Feature activation+3.33e-01

Pos logit contributions
luster+1.086
jriwal+1.001
anmar+0.995
clud+0.966
rete+0.940
Neg logit contributions
DEF-0.562
235-0.544
226-0.536
entin-0.532
ELF-0.526
Token "
Token ID366
Feature activation+5.73e-01

Pos logit contributions
DEF+0.833
235+0.815
226+0.801
ELF+0.786
entin+0.781
Neg logit contributions
luster-1.083
jriwal-0.983
anmar-0.983
clud-0.938
rete-0.908
Token}
Token ID92
Feature activation-5.72e-02

Pos logit contributions
DEF+1.308
235+1.275
226+1.253
ELF+1.227
entin+1.217
Neg logit contributions
luster-1.994
anmar-1.825
jriwal-1.823
clud-1.744
rete-1.695
Token
Token ID198
Feature activation-1.60e-01

Pos logit contributions
luster+0.477
jriwal+0.430
anmar+0.427
clud+0.408
rete+0.394
Neg logit contributions
DEF-0.441
235-0.432
226-0.427
entin-0.421
ELF-0.420
Token
Token ID198
Feature activation-7.10e-02

Pos logit contributions
luster+0.485
jriwal+0.437
anmar+0.435
clud+0.416
rete+0.402
Neg logit contributions
DEF-0.437
235-0.428
226-0.422
entin-0.416
ELF-0.415
Tokenpub
Token ID12984
Feature activation-6.92e-01

Pos logit contributions
luster+0.236
jriwal+0.215
anmar+0.214
clud+0.205
rete+0.199
Neg logit contributions
DEF-0.171
235-0.168
226-0.165
entin-0.163
ELF-0.162
Token struct
Token ID2878
Feature activation-5.21e-01

Pos logit contributions
luster+2.331
jriwal+2.158
anmar+2.130
clud+2.080
rete+2.008
Neg logit contributions
DEF-1.575
235-1.539
226-1.513
ELF-1.497
entin-1.492
Token Guard
Token ID4932
Feature activation+5.56e-01

Pos logit contributions
luster+1.632
jriwal+1.480
anmar+1.473
clud+1.417
rete+1.369
Neg logit contributions
DEF-1.347
235-1.319
226-1.298
entin-1.292
ELF-1.276
Token {
Token ID1391
Feature activation+1.51e+00

Pos logit contributions
DEF+1.357
235+1.324
226+1.303
ELF+1.279
entin+1.279
Neg logit contributions
luster-1.846
jriwal-1.684
anmar-1.678
clud-1.612
rete-1.557
Token...
Token ID2644
Feature activation+3.26e-01

Pos logit contributions
DEF+3.899
235+3.840
226+3.768
ELF+3.691
entin+3.676
Neg logit contributions
luster-4.784
anmar-4.304
jriwal-4.265
clud-4.114
galitarian-3.963
Token }
Token ID1782
Feature activation+9.11e-01

Pos logit contributions
DEF+0.794
235+0.775
226+0.763
entin+0.749
ELF+0.749
Neg logit contributions
luster-1.083
jriwal-0.985
anmar-0.982
clud-0.943
rete-0.914
Token pub
Token ID2240
Feature activation-5.59e-01

Pos logit contributions
DEF+2.312
235+2.263
226+2.227
entin+2.188
ELF+2.180
Neg logit contributions
luster-2.937
anmar-2.658
jriwal-2.640
clud-2.533
rete-2.453
Token fn
Token ID24714
Feature activation-2.50e-01

Pos logit contributions
luster+1.851
jriwal+1.716
anmar+1.675
clud+1.649
rete+1.590
Neg logit contributions
DEF-1.295
235-1.276
226-1.255
entin-1.253
ELF-1.253
Token pin
Token ID6757
Feature activation-4.42e-01

Pos logit contributions
luster+0.777
jriwal+0.707
anmar+0.703
clud+0.673
rete+0.651
Neg logit contributions
DEF-0.657
235-0.642
226-0.632
entin-0.625
ELF-0.625
Token Up
Token ID3205
Feature activation-8.97e-01

Pos logit contributions
luster+0.891
jriwal+0.821
anmar+0.819
clud+0.791
rete+0.771
Neg logit contributions
DEF-0.450
235-0.436
226-0.427
entin-0.419
ELF-0.418
Token Continue
Token ID10054
Feature activation+4.73e-01

Pos logit contributions
luster+2.425
jriwal+2.197
anmar+2.169
clud+2.066
rete+2.015
Neg logit contributions
DEF-2.672
235-2.599
226-2.565
entin-2.560
ELF-2.529
Token reading
Token ID3555
Feature activation+2.73e-01

Pos logit contributions
DEF+1.124
235+1.098
226+1.083
entin+1.059
ELF+1.058
Neg logit contributions
luster-1.603
jriwal-1.455
anmar-1.454
clud-1.402
rete-1.356
Token the
Token ID262
Feature activation+6.46e-01

Pos logit contributions
DEF+0.700
235+0.686
226+0.675
ELF+0.662
entin+0.660
Neg logit contributions
luster-0.876
jriwal-0.792
anmar-0.791
clud-0.756
rete-0.730
Token main
Token ID1388
Feature activation+2.71e-01

Pos logit contributions
DEF+1.850
235+1.842
226+1.828
entin+1.784
186+1.772
Neg logit contributions
luster-1.859
jriwal-1.637
anmar-1.627
clud-1.560
galitarian-1.486
Token story
Token ID1621
Feature activation+1.51e+00

Pos logit contributions
DEF+0.804
235+0.801
226+0.792
entin+0.774
233+0.768
Neg logit contributions
luster-0.762
jriwal-0.669
anmar-0.666
clud-0.642
galitarian-0.611
Token Please
Token ID4222
Feature activation+7.28e-01

Pos logit contributions
DEF+1.334
235+1.313
226+1.253
ELF+1.164
entin+1.160
Neg logit contributions
luster-7.316
jriwal-6.847
anmar-6.816
clud-6.674
rete-6.512
Token verify
Token ID11767
Feature activation+9.11e-01

Pos logit contributions
235+1.078
DEF+1.071
226+1.066
186+1.008
309+1.006
Neg logit contributions
luster-3.144
jriwal-2.863
anmar-2.850
clud-2.775
rete-2.716
Token you
Token ID345
Feature activation+1.82e-01

Pos logit contributions
DEF+2.383
235+2.372
226+2.335
309+2.281
ELF+2.263
Neg logit contributions
luster-2.889
anmar-2.596
jriwal-2.558
clud-2.453
galitarian-2.368
Token're
Token ID821
Feature activation+4.10e-01

Pos logit contributions
DEF+0.546
235+0.538
226+0.531
entin+0.523
ELF+0.522
Neg logit contributions
luster-0.508
anmar-0.450
jriwal-0.445
clud-0.424
galitarian-0.399
Token not
Token ID407
Feature activation+3.90e-01

Pos logit contributions
DEF+1.183
235+1.168
226+1.148
entin+1.136
ELF+1.131
Neg logit contributions
luster-1.186
anmar-1.054
jriwal-1.052
clud-0.999
rete-0.954
Token:
Token ID25
Feature activation-1.33e-01

Pos logit contributions
luster+0.756
jriwal+0.688
anmar+0.686
clud+0.660
rete+0.638
Neg logit contributions
DEF-0.565
235-0.553
226-0.544
entin-0.535
ELF-0.535
Token
Token ID198
Feature activation-5.60e-02

Pos logit contributions
luster+0.404
jriwal+0.365
anmar+0.363
clud+0.347
rete+0.336
Neg logit contributions
DEF-0.359
235-0.351
226-0.347
entin-0.341
ELF-0.341
Token
Token ID198
Feature activation+3.40e-02

Pos logit contributions
luster+0.172
jriwal+0.155
anmar+0.155
clud+0.148
rete+0.142
Neg logit contributions
DEF-0.151
235-0.148
226-0.146
ELF-0.143
entin-0.143
Tokenimpl
Token ID23928
Feature activation-5.16e-01

Pos logit contributions
DEF+0.079
235+0.077
226+0.076
entin+0.075
ELF+0.075
Neg logit contributions
luster-0.116
jriwal-0.106
anmar-0.106
clud-0.101
rete-0.099
Token Board
Token ID5926
Feature activation-5.53e-03

Pos logit contributions
luster+1.627
anmar+1.479
jriwal+1.472
clud+1.417
rete+1.375
Neg logit contributions
DEF-1.318
235-1.290
226-1.275
entin-1.268
ELF-1.256
Token {
Token ID1391
Feature activation+1.50e+00

Pos logit contributions
luster+0.018
jriwal+0.016
anmar+0.016
clud+0.015
rete+0.015
Neg logit contributions
DEF-0.014
235-0.014
226-0.013
entin-0.013
ELF-0.013
Token pub
Token ID2240
Feature activation-3.42e-01

Pos logit contributions
DEF+3.778
235+3.708
226+3.630
ELF+3.569
entin+3.554
Neg logit contributions
luster-4.825
jriwal-4.314
anmar-4.310
clud-4.137
galitarian-3.972
Token fn
Token ID24714
Feature activation-5.52e-03

Pos logit contributions
luster+1.159
jriwal+1.067
anmar+1.054
clud+1.027
rete+0.993
Neg logit contributions
DEF-0.785
235-0.770
226-0.759
entin-0.748
ELF-0.746
Token copy
Token ID4866
Feature activation-2.56e-01

Pos logit contributions
luster+0.013
jriwal+0.011
anmar+0.011
clud+0.011
rete+0.010
Neg logit contributions
DEF-0.019
235-0.019
226-0.018
ELF-0.018
entin-0.018
Token_
Token ID62
Feature activation+4.41e-01

Pos logit contributions
luster+0.831
anmar+0.755
jriwal+0.754
clud+0.726
rete+0.703
Neg logit contributions
DEF-0.634
235-0.619
226-0.610
entin-0.607
ELF-0.600
Tokenboard
Token ID3526
Feature activation-6.37e-03

Pos logit contributions
DEF+1.179
235+1.159
226+1.142
entin+1.122
ELF+1.119
Neg logit contributions
luster-1.353
jriwal-1.227
anmar-1.218
clud-1.163
rete-1.118
Token Own
Token ID11744
Feature activation+7.24e-01

Pos logit contributions
luster+1.840
jriwal+1.672
anmar+1.665
clud+1.616
rete+1.566
Neg logit contributions
DEF-1.317
235-1.288
226-1.262
entin-1.254
ELF-1.237
Tokened
Token ID276
Feature activation+4.30e-01

Pos logit contributions
DEF+1.638
235+1.580
226+1.560
ELF+1.538
entin+1.531
Neg logit contributions
luster-2.531
jriwal-2.329
anmar-2.321
clud-2.224
rete-2.152
Token <
Token ID1279
Feature activation+7.04e-01

Pos logit contributions
DEF+1.005
235+0.981
226+0.965
entin+0.948
ELF+0.948
Neg logit contributions
luster-1.468
jriwal-1.340
anmar-1.335
clud-1.284
rete-1.245
Token T
Token ID309
Feature activation+1.99e-01

Pos logit contributions
DEF+1.657
235+1.614
226+1.590
ELF+1.569
entin+1.545
Neg logit contributions
luster-2.409
jriwal-2.209
anmar-2.191
clud-2.095
rete-2.036
Token >
Token ID1875
Feature activation+9.05e-01

Pos logit contributions
DEF+0.478
235+0.466
226+0.459
ELF+0.451
entin+0.450
Neg logit contributions
luster-0.670
jriwal-0.612
anmar-0.609
clud-0.584
rete-0.566
Token {
Token ID1391
Feature activation+1.48e+00

Pos logit contributions
DEF+2.214
235+2.167
226+2.133
ELF+2.088
entin+2.082
Neg logit contributions
luster-3.008
anmar-2.732
jriwal-2.710
clud-2.605
rete-2.513
Token...
Token ID2644
Feature activation+2.73e-01

Pos logit contributions
DEF+3.855
235+3.790
226+3.721
ELF+3.663
entin+3.645
Neg logit contributions
luster-4.666
anmar-4.188
jriwal-4.175
clud-4.013
rete-3.849
Token }
Token ID1782
Feature activation+9.14e-01

Pos logit contributions
DEF+0.580
235+0.564
226+0.554
entin+0.544
ELF+0.543
Neg logit contributions
luster-0.989
jriwal-0.908
anmar-0.905
clud-0.873
rete-0.848
Token impl
Token ID4114
Feature activation-1.98e-01

Pos logit contributions
DEF+2.185
235+2.139
226+2.107
entin+2.077
ELF+2.066
Neg logit contributions
luster-3.073
anmar-2.787
jriwal-2.786
clud-2.670
rete-2.582
Token <
Token ID1279
Feature activation+2.33e-01

Pos logit contributions
luster+0.655
anmar+0.597
jriwal+0.596
clud+0.573
rete+0.557
Neg logit contributions
DEF-0.474
235-0.465
226-0.457
entin-0.453
ELF-0.447
Token T
Token ID309
Feature activation+2.07e-01

Pos logit contributions
DEF+0.568
235+0.555
226+0.546
ELF+0.537
entin+0.536
Neg logit contributions
luster-0.775
jriwal-0.707
anmar-0.704
clud-0.675
rete-0.654
Token Own
Token ID11744
Feature activation+9.64e-01

Pos logit contributions
DEF+2.283
235+2.209
226+2.177
ELF+2.163
entin+2.128
Neg logit contributions
luster-3.646
anmar-3.332
jriwal-3.325
clud-3.196
rete-3.088
Tokened
Token ID276
Feature activation+3.10e-01

Pos logit contributions
DEF+2.137
235+2.046
226+2.019
ELF+2.004
entin+1.985
Neg logit contributions
luster-3.409
jriwal-3.136
anmar-3.128
clud-3.008
rete-2.897
Token <
Token ID1279
Feature activation+4.62e-01

Pos logit contributions
DEF+0.706
235+0.689
226+0.679
entin+0.668
ELF+0.665
Neg logit contributions
luster-1.071
jriwal-0.980
anmar-0.976
clud-0.939
rete-0.913
Token T
Token ID309
Feature activation+2.05e-01

Pos logit contributions
DEF+1.199
235+1.171
226+1.155
ELF+1.141
entin+1.136
Neg logit contributions
luster-1.461
jriwal-1.325
anmar-1.317
clud-1.260
rete-1.218
Token >
Token ID1875
Feature activation+9.41e-01

Pos logit contributions
DEF+0.495
235+0.484
226+0.476
ELF+0.468
entin+0.467
Neg logit contributions
luster-0.685
jriwal-0.625
anmar-0.622
clud-0.597
rete-0.579
Token {
Token ID1391
Feature activation+1.47e+00

Pos logit contributions
DEF+2.162
235+2.111
226+2.074
ELF+2.041
entin+2.023
Neg logit contributions
luster-3.265
anmar-2.975
jriwal-2.961
clud-2.854
rete-2.759
Token pub
Token ID2240
Feature activation-4.82e-01

Pos logit contributions
DEF+3.503
235+3.462
226+3.380
ELF+3.317
entin+3.296
Neg logit contributions
luster-4.995
anmar-4.484
jriwal-4.467
clud-4.303
rete-4.142
Token fn
Token ID24714
Feature activation-7.97e-02

Pos logit contributions
luster+1.618
jriwal+1.483
anmar+1.470
clud+1.433
rete+1.385
Neg logit contributions
DEF-1.119
235-1.090
226-1.075
entin-1.063
ELF-1.056
Token new
Token ID649
Feature activation-1.59e-01

Pos logit contributions
luster+0.238
jriwal+0.215
anmar+0.214
clud+0.204
rete+0.197
Neg logit contributions
DEF-0.220
235-0.216
226-0.212
entin-0.210
ELF-0.210
Token (
Token ID357
Feature activation+7.82e-01

Pos logit contributions
luster+0.495
anmar+0.450
jriwal+0.448
clud+0.428
rete+0.414
Neg logit contributions
DEF-0.410
235-0.402
226-0.397
entin-0.391
ELF-0.390
Token t
Token ID256
Feature activation-3.72e-01

Pos logit contributions
DEF+1.743
235+1.705
226+1.675
ELF+1.658
entin+1.642
Neg logit contributions
luster-2.756
jriwal-2.521
anmar-2.505
clud-2.401
rete-2.334
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
luster+0.410
jriwal+0.371
anmar+0.370
clud+0.356
rete+0.346
Neg logit contributions
DEF-0.318
235-0.310
226-0.306
entin-0.303
ELF-0.301
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
luster+0.326
jriwal+0.292
anmar+0.292
clud+0.281
rete+0.272
Neg logit contributions
DEF-0.308
235-0.301
entin-0.298
226-0.297
ELF-0.293
Token
Token ID447
Feature activation+1.75e-01

Pos logit contributions
DEF+0.220
235+0.214
226+0.210
entin+0.209
ELF+0.206
Neg logit contributions
luster-0.371
jriwal-0.340
anmar-0.339
clud-0.327
rete-0.319
Token
Token ID250
Feature activation+2.49e-01

Pos logit contributions
DEF+0.512
235+0.503
226+0.497
ELF+0.489
entin+0.488
Neg logit contributions
luster-0.496
jriwal-0.444
anmar-0.442
clud-0.420
rete-0.405
TokenWhy
Token ID5195
Feature activation+1.48e-02

Pos logit contributions
DEF+0.509
235+0.495
226+0.486
entin+0.476
ELF+0.476
Neg logit contributions
luster-0.924
jriwal-0.850
anmar-0.847
clud-0.817
rete-0.795
Token!?
Token ID22857
Feature activation+1.47e+00

Pos logit contributions
DEF+0.043
235+0.042
226+0.041
entin+0.041
ELF+0.041
Neg logit contributions
luster-0.042
jriwal-0.038
anmar-0.038
clud-0.036
rete-0.034
Token
Token ID447
Feature activation+4.28e-01

Pos logit contributions
DEF+3.248
235+3.169
226+3.121
ELF+3.012
309+3.004
Neg logit contributions
luster-5.224
jriwal-4.871
anmar-4.818
clud-4.546
rete-4.440
Token
Token ID251
Feature activation+1.39e-01

Pos logit contributions
DEF+1.365
235+1.343
226+1.327
entin+1.308
ELF+1.308
Neg logit contributions
luster-1.098
jriwal-0.973
anmar-0.966
clud-0.914
rete-0.877
Token (
Token ID357
Feature activation+6.52e-01

Pos logit contributions
DEF+0.363
235+0.355
226+0.350
entin+0.345
ELF+0.344
Neg logit contributions
luster-0.434
jriwal-0.392
anmar-0.391
clud-0.375
rete-0.362
TokenNa
Token ID26705
Feature activation+7.54e-01

Pos logit contributions
DEF+1.377
235+1.339
226+1.312
ELF+1.286
entin+1.273
Neg logit contributions
luster-2.377
jriwal-2.192
anmar-2.183
clud-2.101
rete-2.036
Tokenof
Token ID1659
Feature activation+2.48e-01

Pos logit contributions
DEF+2.387
235+2.349
226+2.320
ELF+2.293
309+2.285
Neg logit contributions
luster-1.947
jriwal-1.707
anmar-1.696
clud-1.595
galitarian-1.543
Token Up
Token ID3205
Feature activation-8.88e-01

Pos logit contributions
luster+1.374
jriwal+1.264
anmar+1.262
clud+1.219
rete+1.187
Neg logit contributions
DEF-0.719
235-0.692
226-0.679
entin-0.667
ELF-0.666
Token Continue
Token ID10054
Feature activation+5.26e-01

Pos logit contributions
luster+2.319
jriwal+2.111
anmar+2.068
clud+1.970
rete+1.933
Neg logit contributions
DEF-2.707
235-2.622
226-2.587
entin-2.584
ELF-2.549
Token reading
Token ID3555
Feature activation+2.81e-01

Pos logit contributions
DEF+1.284
235+1.254
226+1.235
entin+1.213
ELF+1.212
Neg logit contributions
luster-1.745
jriwal-1.587
anmar-1.584
clud-1.522
rete-1.473
Token the
Token ID262
Feature activation+5.96e-01

Pos logit contributions
DEF+0.732
235+0.716
226+0.706
entin+0.694
ELF+0.694
Neg logit contributions
luster-0.888
jriwal-0.802
anmar-0.800
clud-0.767
rete-0.741
Token main
Token ID1388
Feature activation+2.23e-01

Pos logit contributions
DEF+1.755
235+1.739
226+1.729
entin+1.697
ELF+1.686
Neg logit contributions
luster-1.670
jriwal-1.476
anmar-1.468
clud-1.408
rete-1.336
Token story
Token ID1621
Feature activation+1.47e+00

Pos logit contributions
DEF+0.673
235+0.667
226+0.660
entin+0.648
309+0.641
Neg logit contributions
luster-0.616
jriwal-0.539
anmar-0.538
clud-0.518
rete-0.492
Token Please
Token ID4222
Feature activation+6.27e-01

Pos logit contributions
DEF+1.353
235+1.312
226+1.260
entin+1.181
ELF+1.173
Neg logit contributions
luster-7.070
jriwal-6.608
anmar-6.583
clud-6.437
rete-6.284
Token verify
Token ID11767
Feature activation+7.65e-01

Pos logit contributions
DEF+0.969
235+0.966
226+0.954
309+0.905
186+0.904
Neg logit contributions
luster-2.660
jriwal-2.432
anmar-2.422
clud-2.353
rete-2.299
Token you
Token ID345
Feature activation+7.55e-02

Pos logit contributions
DEF+2.036
235+2.020
226+1.990
309+1.942
ELF+1.933
Neg logit contributions
luster-2.395
anmar-2.151
jriwal-2.124
clud-2.037
galitarian-1.956
Token're
Token ID821
Feature activation+3.40e-01

Pos logit contributions
DEF+0.230
235+0.226
226+0.224
entin+0.220
ELF+0.220
Neg logit contributions
luster-0.207
anmar-0.184
jriwal-0.182
clud-0.173
rete-0.164
Token not
Token ID407
Feature activation+3.04e-01

Pos logit contributions
DEF+0.988
235+0.974
226+0.958
entin+0.948
ELF+0.945
Neg logit contributions
luster-0.975
anmar-0.866
jriwal-0.865
clud-0.822
rete-0.785
Token <
Token ID1279
Feature activation+2.93e-01

Pos logit contributions
DEF+1.921
235+1.880
226+1.840
entin+1.802
ELF+1.792
Neg logit contributions
luster-3.236
jriwal-2.960
anmar-2.939
clud-2.834
rete-2.752
Tokenli
Token ID4528
Feature activation-1.84e-01

Pos logit contributions
DEF+0.601
235+0.587
226+0.575
entin+0.562
ELF+0.562
Neg logit contributions
luster-1.087
jriwal-0.999
anmar-0.997
clud-0.960
rete-0.935
Token ng
Token ID23370
Feature activation-7.62e-01

Pos logit contributions
luster+0.310
jriwal+0.257
anmar+0.255
clud+0.234
rete+0.216
Neg logit contributions
DEF-0.747
235-0.735
226-0.729
ELF-0.722
entin-0.721
Token-
Token ID12
Feature activation-6.87e-02

Pos logit contributions
luster+2.405
anmar+2.181
jriwal+2.177
clud+2.099
rete+2.036
Neg logit contributions
DEF-1.949
235-1.893
226-1.864
entin-1.863
ELF-1.857
Tokenrepeat
Token ID44754
Feature activation+8.32e-02

Pos logit contributions
luster+0.235
jriwal+0.215
anmar+0.213
clud+0.204
rete+0.198
Neg logit contributions
DEF-0.161
235-0.157
226-0.155
entin-0.152
ELF-0.151
Token="
Token ID2625
Feature activation+1.47e+00

Pos logit contributions
DEF+0.230
235+0.226
226+0.223
ELF+0.218
309+0.217
Neg logit contributions
luster-0.251
jriwal-0.227
anmar-0.225
clud-0.213
rete-0.207
Tokenphone
Token ID4862
Feature activation-1.82e-01

Pos logit contributions
DEF+3.472
235+3.445
226+3.393
309+3.336
entin+3.287
Neg logit contributions
luster-4.840
jriwal-4.422
anmar-4.333
clud-4.216
galitarian-4.098
Token in
Token ID287
Feature activation+1.74e-01

Pos logit contributions
luster+0.563
jriwal+0.510
anmar+0.509
clud+0.488
rete+0.471
Neg logit contributions
DEF-0.481
235-0.470
226-0.462
ELF-0.459
entin-0.458
Token phones
Token ID9512
Feature activation-9.88e-02

Pos logit contributions
DEF+0.450
235+0.442
226+0.436
entin+0.429
ELF+0.427
Neg logit contributions
luster-0.551
jriwal-0.497
anmar-0.495
clud-0.474
rete-0.459
Token">
Token ID5320
Feature activation+6.59e-01

Pos logit contributions
luster+0.349
jriwal+0.321
anmar+0.320
clud+0.309
rete+0.299
Neg logit contributions
DEF-0.218
235-0.211
226-0.208
ELF-0.206
entin-0.204
Token <
Token ID1279
Feature activation+1.65e-01

Pos logit contributions
DEF+1.256
235+1.219
226+1.196
ELF+1.165
entin+1.165
Neg logit contributions
luster-2.534
jriwal-2.341
anmar-2.332
clud-2.254
rete-2.194
TokenOutput
Token ID26410
Feature activation-4.52e-02

Pos logit contributions
DEF+0.525
235+0.511
226+0.502
ELF+0.492
entin+0.492
Neg logit contributions
luster-0.829
jriwal-0.760
anmar-0.757
clud-0.728
rete-0.707
Tokens
Token ID82
Feature activation-1.96e-01

Pos logit contributions
luster+0.151
jriwal+0.137
anmar+0.137
clud+0.131
rete+0.127
Neg logit contributions
DEF-0.109
235-0.106
226-0.105
entin-0.103
ELF-0.103
Token
Token ID198
Feature activation-2.59e-01

Pos logit contributions
luster+0.624
jriwal+0.566
anmar+0.562
clud+0.539
rete+0.523
Neg logit contributions
DEF-0.501
235-0.489
226-0.482
entin-0.477
ELF-0.474
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
luster+0.774
jriwal+0.697
anmar+0.694
clud+0.665
rete+0.643
Neg logit contributions
DEF-0.710
235-0.694
226-0.685
entin-0.678
ELF-0.676
Token{
Token ID90
Feature activation+9.28e-01

Pos logit contributions
luster+0.428
jriwal+0.392
anmar+0.390
clud+0.377
rete+0.367
Neg logit contributions
DEF-0.256
235-0.250
226-0.245
entin-0.244
ELF-0.242
Token {{
Token ID22935
Feature activation+1.44e+00

Pos logit contributions
DEF+2.141
235+2.094
226+2.055
309+1.999
186+1.992
Neg logit contributions
luster-3.199
jriwal-2.934
anmar-2.926
clud-2.790
rete-2.713
Token1
Token ID16
Feature activation+4.34e-01

Pos logit contributions
DEF+3.613
235+3.547
226+3.484
309+3.406
186+3.399
Neg logit contributions
luster-4.641
jriwal-4.234
anmar-4.213
clud-3.963
rete-3.868
Token,
Token ID11
Feature activation+3.58e-01

Pos logit contributions
DEF+1.019
235+0.995
226+0.979
ELF+0.960
entin+0.959
Neg logit contributions
luster-1.477
jriwal-1.349
anmar-1.343
clud-1.291
rete-1.252
Token 6
Token ID718
Feature activation+2.82e-01

Pos logit contributions
DEF+0.942
235+0.925
226+0.912
ELF+0.891
309+0.891
Neg logit contributions
luster-1.121
jriwal-1.017
anmar-1.012
clud-0.966
rete-0.933
Token},
Token ID5512
Feature activation+3.53e-02

Pos logit contributions
DEF+0.522
235+0.507
226+0.496
entin+0.484
ELF+0.484
Neg logit contributions
luster-1.102
jriwal-1.018
anmar-1.016
clud-0.982
rete-0.956
Token {{
Token ID22935
Feature activation+1.01e+00

Pos logit contributions
DEF+0.091
235+0.089
226+0.088
entin+0.087
ELF+0.087
Neg logit contributions
luster-0.112
jriwal-0.101
anmar-0.101
clud-0.097
rete-0.094
Token Up
Token ID3205
Feature activation-7.40e-01

Pos logit contributions
luster+0.818
jriwal+0.754
anmar+0.752
clud+0.727
rete+0.708
Neg logit contributions
DEF-0.411
235-0.398
226-0.390
entin-0.382
ELF-0.382
Token Continue
Token ID10054
Feature activation+6.51e-01

Pos logit contributions
luster+1.979
jriwal+1.798
anmar+1.771
clud+1.681
rete+1.642
Neg logit contributions
DEF-2.220
235-2.156
entin-2.127
226-2.126
ELF-2.098
Token reading
Token ID3555
Feature activation+3.18e-01

Pos logit contributions
DEF+1.582
235+1.546
226+1.523
ELF+1.494
entin+1.493
Neg logit contributions
luster-2.167
jriwal-1.969
anmar-1.965
clud-1.889
rete-1.829
Token the
Token ID262
Feature activation+7.53e-01

Pos logit contributions
DEF+0.820
235+0.802
226+0.790
ELF+0.776
entin+0.775
Neg logit contributions
luster-1.011
jriwal-0.914
anmar-0.912
clud-0.874
rete-0.844
Token main
Token ID1388
Feature activation+2.35e-01

Pos logit contributions
DEF+2.179
235+2.168
226+2.157
entin+2.102
ELF+2.089
Neg logit contributions
luster-2.148
jriwal-1.892
anmar-1.881
clud-1.809
galitarian-1.711
Token story
Token ID1621
Feature activation+1.44e+00

Pos logit contributions
DEF+0.708
235+0.702
226+0.695
entin+0.680
309+0.674
Neg logit contributions
luster-0.650
jriwal-0.569
anmar-0.567
clud-0.547
galitarian-0.519
Token Please
Token ID4222
Feature activation+6.46e-01

Pos logit contributions
DEF+1.341
235+1.306
226+1.254
ELF+1.168
entin+1.162
Neg logit contributions
luster-6.937
jriwal-6.492
anmar-6.442
clud-6.312
rete-6.154
Token verify
Token ID11767
Feature activation+8.34e-01

Pos logit contributions
DEF+0.989
235+0.987
226+0.974
309+0.924
186+0.922
Neg logit contributions
luster-2.752
jriwal-2.517
anmar-2.496
clud-2.431
rete-2.376
Token you
Token ID345
Feature activation+1.85e-01

Pos logit contributions
DEF+2.214
235+2.200
226+2.168
309+2.115
ELF+2.104
Neg logit contributions
luster-2.621
anmar-2.343
jriwal-2.318
clud-2.226
galitarian-2.138
Token're
Token ID821
Feature activation+3.76e-01

Pos logit contributions
DEF+0.559
235+0.551
226+0.544
ELF+0.536
entin+0.535
Neg logit contributions
luster-0.512
anmar-0.452
jriwal-0.449
clud-0.427
rete-0.402
Token not
Token ID407
Feature activation+3.66e-01

Pos logit contributions
DEF+1.090
235+1.075
226+1.058
entin+1.045
ELF+1.043
Neg logit contributions
luster-1.081
anmar-0.959
jriwal-0.959
clud-0.911
rete-0.870
Token struct
Token ID2878
Feature activation-5.22e-01

Pos logit contributions
luster+2.328
jriwal+2.146
anmar+2.117
clud+2.081
rete+2.020
Neg logit contributions
DEF-1.368
235-1.349
226-1.321
ELF-1.306
entin-1.303
Token Atomic
Token ID28976
Feature activation+2.94e-01

Pos logit contributions
luster+1.806
jriwal+1.643
anmar+1.636
clud+1.587
rete+1.542
Neg logit contributions
DEF-1.178
235-1.149
226-1.129
entin-1.117
ELF-1.104
Token <
Token ID1279
Feature activation+5.88e-01

Pos logit contributions
DEF+0.678
235+0.661
226+0.651
entin+0.640
ELF+0.638
Neg logit contributions
luster-1.012
jriwal-0.925
anmar-0.921
clud-0.886
rete-0.860
Token T
Token ID309
Feature activation+1.10e-01

Pos logit contributions
DEF+1.520
235+1.487
226+1.465
ELF+1.450
entin+1.432
Neg logit contributions
luster-1.872
jriwal-1.705
anmar-1.691
clud-1.615
rete-1.564
Token >
Token ID1875
Feature activation+8.41e-01

Pos logit contributions
DEF+0.267
235+0.261
226+0.256
ELF+0.252
entin+0.252
Neg logit contributions
luster-0.368
jriwal-0.335
anmar-0.334
clud-0.321
rete-0.311
Token {
Token ID1391
Feature activation+1.44e+00

Pos logit contributions
DEF+2.096
235+2.049
226+2.014
ELF+1.983
entin+1.977
Neg logit contributions
luster-2.755
anmar-2.501
jriwal-2.486
clud-2.378
rete-2.298
Token...
Token ID2644
Feature activation+9.16e-02

Pos logit contributions
DEF+3.867
235+3.795
226+3.725
ELF+3.688
entin+3.672
Neg logit contributions
luster-4.379
anmar-3.925
jriwal-3.919
clud-3.750
rete-3.601
Token }
Token ID1782
Feature activation+6.85e-01

Pos logit contributions
DEF+0.196
235+0.190
226+0.189
entin+0.185
ELF+0.184
Neg logit contributions
luster-0.328
jriwal-0.302
anmar-0.300
clud-0.289
rete-0.281
Token impl
Token ID4114
Feature activation-3.18e-01

Pos logit contributions
DEF+1.687
235+1.649
226+1.621
entin+1.603
ELF+1.594
Neg logit contributions
luster-2.252
anmar-2.044
jriwal-2.042
clud-1.960
rete-1.896
Token <
Token ID1279
Feature activation+1.72e-01

Pos logit contributions
luster+1.051
anmar+0.958
jriwal+0.955
clud+0.918
rete+0.891
Neg logit contributions
DEF-0.761
235-0.745
226-0.737
entin-0.727
309-0.719
Token T
Token ID309
Feature activation+2.87e-02

Pos logit contributions
DEF+0.457
235+0.447
226+0.441
entin+0.435
ELF+0.433
Neg logit contributions
luster-0.529
jriwal-0.478
anmar-0.476
clud-0.456
rete-0.441
Token |
Token ID930
Feature activation+1.60e-01

Pos logit contributions
DEF+0.486
235+0.475
226+0.467
entin+0.460
ELF+0.459
Neg logit contributions
luster-0.689
jriwal-0.628
anmar-0.626
clud-0.601
rete-0.583
Token 2
Token ID362
Feature activation+9.23e-02

Pos logit contributions
DEF+0.413
235+0.404
226+0.398
ELF+0.391
entin+0.391
Neg logit contributions
luster-0.506
jriwal-0.458
anmar-0.457
clud-0.437
rete-0.424
Token |
Token ID930
Feature activation+2.32e-01

Pos logit contributions
DEF+0.226
235+0.220
226+0.217
entin+0.214
ELF+0.213
Neg logit contributions
luster-0.305
jriwal-0.278
anmar-0.277
clud-0.266
rete-0.258
Token struct
Token ID2878
Feature activation-3.68e-01

Pos logit contributions
DEF+0.612
235+0.599
226+0.590
ELF+0.580
entin+0.579
Neg logit contributions
luster-0.724
jriwal-0.653
anmar-0.653
clud-0.623
rete-0.602
Token Board
Token ID5926
Feature activation-1.49e-01

Pos logit contributions
luster+1.081
jriwal+0.972
anmar+0.967
clud+0.927
rete+0.895
Neg logit contributions
DEF-1.032
235-1.013
226-0.999
entin-0.988
ELF-0.985
Token {
Token ID1391
Feature activation+1.44e+00

Pos logit contributions
luster+0.469
jriwal+0.425
anmar+0.423
clud+0.407
rete+0.394
Neg logit contributions
DEF-0.387
235-0.379
226-0.373
entin-0.368
ELF-0.368
Token |
Token ID930
Feature activation+4.33e-01

Pos logit contributions
DEF+3.855
235+3.798
226+3.727
entin+3.678
ELF+3.668
Neg logit contributions
luster-4.338
anmar-3.883
jriwal-3.880
clud-3.687
galitarian-3.613
Token ^
Token ID10563
Feature activation+7.79e-02

Pos logit contributions
DEF+1.148
235+1.124
226+1.108
entin+1.090
ELF+1.089
Neg logit contributions
luster-1.343
jriwal-1.214
anmar-1.212
clud-1.156
rete-1.115
Token^^^^
Token ID39397
Feature activation-2.54e-01

Pos logit contributions
DEF+0.200
235+0.196
226+0.193
entin+0.190
ELF+0.190
Neg logit contributions
luster-0.248
jriwal-0.225
anmar-0.224
clud-0.214
rete-0.208
Token^^^^
Token ID39397
Feature activation-2.96e-01

Pos logit contributions
luster+0.795
jriwal+0.719
anmar+0.717
clud+0.687
rete+0.665
Neg logit contributions
DEF-0.661
235-0.648
226-0.640
entin-0.631
ELF-0.629
Token^^
Token ID18237
Feature activation-2.88e-01

Pos logit contributions
luster+0.930
jriwal+0.841
anmar+0.838
clud+0.803
rete+0.778
Neg logit contributions
DEF-0.767
235-0.752
226-0.742
entin-0.733
ELF-0.729
Token<
Token ID27
Feature activation+1.57e-01

Pos logit contributions
DEF+0.355
235+0.346
226+0.340
ELF+0.333
entin+0.332
Neg logit contributions
luster-0.585
jriwal-0.536
anmar-0.535
clud-0.515
rete-0.500
Tokenbody
Token ID2618
Feature activation-3.41e-01

Pos logit contributions
DEF+0.405
235+0.398
226+0.392
entin+0.384
ELF+0.384
Neg logit contributions
luster-0.501
jriwal-0.454
anmar-0.451
clud-0.432
rete-0.419
Token ng
Token ID23370
Feature activation-8.45e-01

Pos logit contributions
luster+1.063
jriwal+0.961
anmar+0.957
clud+0.916
rete+0.886
Neg logit contributions
DEF-0.902
235-0.883
226-0.871
entin-0.857
ELF-0.856
Token-
Token ID12
Feature activation-2.91e-01

Pos logit contributions
luster+2.651
anmar+2.412
jriwal+2.399
clud+2.320
rete+2.231
Neg logit contributions
DEF-2.179
235-2.111
entin-2.095
226-2.076
ELF-2.070
Tokencontroller
Token ID36500
Feature activation-7.22e-01

Pos logit contributions
luster+0.816
jriwal+0.732
anmar+0.719
clud+0.689
rete+0.662
Neg logit contributions
DEF-0.856
235-0.844
226-0.836
entin-0.819
ELF-0.817
Token="
Token ID2625
Feature activation+1.43e+00

Pos logit contributions
luster+2.295
anmar+2.098
jriwal+2.081
clud+2.006
rete+1.935
Neg logit contributions
DEF-1.830
235-1.773
226-1.749
entin-1.748
ELF-1.733
TokenPhone
Token ID6132
Feature activation+5.60e-02

Pos logit contributions
DEF+3.682
235+3.662
226+3.611
309+3.504
ELF+3.500
Neg logit contributions
luster-4.472
jriwal-4.079
anmar-3.970
clud-3.867
rete-3.716
TokenList
Token ID8053
Feature activation+4.56e-01

Pos logit contributions
DEF+0.125
235+0.122
226+0.120
entin+0.118
ELF+0.117
Neg logit contributions
luster-0.197
jriwal-0.180
anmar-0.180
clud-0.173
rete-0.168
TokenCtrl
Token ID40069
Feature activation+3.48e-01

Pos logit contributions
DEF+0.956
235+0.934
226+0.917
entin+0.896
ELF+0.889
Neg logit contributions
luster-1.669
jriwal-1.533
anmar-1.521
clud-1.469
rete-1.435
Token">
Token ID5320
Feature activation+6.70e-01

Pos logit contributions
DEF+0.778
235+0.761
226+0.747
ELF+0.727
309+0.726
Neg logit contributions
luster-1.230
jriwal-1.126
anmar-1.121
clud-1.079
rete-1.047
Token <
Token ID1279
Feature activation+2.59e-01

Pos logit contributions
DEF+1.453
235+1.426
226+1.399
309+1.354
ELF+1.347
Neg logit contributions
luster-2.419
jriwal-2.214
anmar-2.198
clud-2.120
rete-2.049
TokenBut
Token ID1537
Feature activation-7.92e-03

Pos logit contributions
DEF+0.510
235+0.497
226+0.488
ELF+0.478
entin+0.477
Neg logit contributions
luster-0.878
jriwal-0.807
anmar-0.804
clud-0.775
rete-0.753
Tokent
Token ID83
Feature activation-6.47e-03

Pos logit contributions
luster+0.025
jriwal+0.023
anmar+0.023
clud+0.022
rete+0.021
Neg logit contributions
DEF-0.020
235-0.020
226-0.020
entin-0.019
ELF-0.019
Token appeared
Token ID4120
Feature activation-1.78e-01

Pos logit contributions
luster+0.020
jriwal+0.019
anmar+0.019
clud+0.018
rete+0.017
Neg logit contributions
DEF-0.017
235-0.016
226-0.016
ELF-0.016
entin-0.016
Token in
Token ID287
Feature activation+2.59e-01

Pos logit contributions
luster+0.530
jriwal+0.478
anmar+0.476
clud+0.454
rete+0.439
Neg logit contributions
DEF-0.494
235-0.484
226-0.478
entin-0.471
ELF-0.471
Token a
Token ID257
Feature activation+2.10e-01

Pos logit contributions
DEF+0.760
235+0.746
226+0.736
entin+0.727
ELF+0.725
Neg logit contributions
luster-0.732
jriwal-0.655
anmar-0.651
clud-0.620
rete-0.597
Token Channel
Token ID11102
Feature activation-1.69e+00

Pos logit contributions
DEF+0.569
235+0.558
226+0.550
entin+0.541
ELF+0.539
Neg logit contributions
luster-0.643
jriwal-0.580
anmar-0.575
clud-0.549
rete-0.530
Token 4
Token ID604
Feature activation+3.13e-01

Pos logit contributions
luster+4.264
anmar+3.764
jriwal+3.741
clud+3.600
rete+3.481
Neg logit contributions
DEF-5.305
235-5.137
226-5.090
entin-5.051
ELF-5.021
Token documentary
Token ID11648
Feature activation+2.60e-01

Pos logit contributions
DEF+0.799
235+0.781
226+0.769
entin+0.755
ELF+0.753
Neg logit contributions
luster-1.010
jriwal-0.911
anmar-0.907
clud-0.870
rete-0.841
Token entitled
Token ID9080
Feature activation+1.61e-01

Pos logit contributions
DEF+0.688
235+0.673
226+0.663
entin+0.653
ELF+0.653
Neg logit contributions
luster-0.808
jriwal-0.731
anmar-0.729
clud-0.698
rete-0.674
Token
Token ID564
Feature activation+4.33e-02

Pos logit contributions
DEF+0.416
235+0.406
226+0.400
entin+0.395
ELF+0.394
Neg logit contributions
luster-0.511
jriwal-0.463
anmar-0.462
clud-0.443
rete-0.429
Token
Token ID250
Feature activation+8.51e-02

Pos logit contributions
DEF+0.129
235+0.127
226+0.125
ELF+0.123
entin+0.123
Neg logit contributions
luster-0.120
jriwal-0.107
anmar-0.106
clud-0.101
rete-0.097
Tokenre
Token ID260
Feature activation+4.98e-01

Pos logit contributions
DEF+0.622
235+0.612
226+0.603
entin+0.594
ELF+0.591
Neg logit contributions
luster-0.706
jriwal-0.640
anmar-0.637
clud-0.606
rete-0.585
Tokenpos
Token ID1930
Feature activation-3.25e-01

Pos logit contributions
DEF+1.440
235+1.417
226+1.399
entin+1.379
ELF+1.370
Neg logit contributions
luster-1.431
jriwal-1.280
anmar-1.279
clud-1.198
rete-1.168
Token.
Token ID764
Feature activation+1.48e-02

Pos logit contributions
luster+0.944
jriwal+0.849
anmar+0.846
clud+0.811
rete+0.776
Neg logit contributions
DEF-0.923
235-0.903
226-0.892
ELF-0.880
entin-0.879
Token files
Token ID3696
Feature activation-1.98e-01

Pos logit contributions
DEF+0.041
235+0.040
226+0.039
entin+0.039
ELF+0.039
Neg logit contributions
luster-0.044
jriwal-0.040
anmar-0.040
clud-0.038
rete-0.037
Token ]
Token ID2361
Feature activation+8.13e-01

Pos logit contributions
luster+0.646
jriwal+0.587
anmar+0.585
clud+0.563
rete+0.545
Neg logit contributions
DEF-0.493
235-0.481
226-0.474
entin-0.467
ELF-0.466
Token WHERE
Token ID33411
Feature activation-1.66e+00

Pos logit contributions
DEF+2.048
235+1.989
226+1.952
ELF+1.921
309+1.900
Neg logit contributions
luster-2.660
anmar-2.442
jriwal-2.425
clud-2.273
rete-2.204
Token RIGHT
Token ID33621
Feature activation+2.17e-01

Pos logit contributions
luster+4.887
jriwal+4.393
anmar+4.306
clud+4.290
rete+4.088
Neg logit contributions
DEF-4.459
235-4.352
226-4.310
entin-4.253
ELF-4.227
Token (
Token ID357
Feature activation+5.72e-01

Pos logit contributions
DEF+0.506
235+0.491
226+0.484
ELF+0.476
entin+0.470
Neg logit contributions
luster-0.746
jriwal-0.684
anmar-0.681
clud-0.651
rete-0.634
Token path
Token ID3108
Feature activation-5.71e-01

Pos logit contributions
DEF+1.512
235+1.476
226+1.454
ELF+1.437
entin+1.424
Neg logit contributions
luster-1.785
jriwal-1.626
anmar-1.620
clud-1.528
rete-1.483
Token,
Token ID837
Feature activation-2.04e-01

Pos logit contributions
luster+1.903
jriwal+1.725
anmar+1.717
clud+1.660
rete+1.606
Neg logit contributions
DEF-1.370
235-1.341
226-1.321
entin-1.304
ELF-1.297
Token 4
Token ID604
Feature activation-2.00e-01

Pos logit contributions
luster+0.624
jriwal+0.563
anmar+0.560
clud+0.538
rete+0.519
Neg logit contributions
DEF-0.547
235-0.535
226-0.528
entin-0.521
ELF-0.520
Token
Token ID447
Feature activation-2.97e-01

Pos logit contributions
luster+1.287
jriwal+1.177
anmar+1.168
clud+1.125
rete+1.086
Neg logit contributions
DEF-0.976
235-0.954
226-0.939
ELF-0.930
entin-0.924
Token
Token ID247
Feature activation-2.33e-01

Pos logit contributions
luster+0.786
jriwal+0.698
anmar+0.694
clud+0.659
rete+0.632
Neg logit contributions
DEF-0.923
235-0.907
226-0.896
entin-0.884
ELF-0.883
Tokens
Token ID82
Feature activation-3.07e-01

Pos logit contributions
luster+0.727
jriwal+0.658
anmar+0.656
clud+0.628
rete+0.607
Neg logit contributions
DEF-0.613
235-0.600
226-0.591
entin-0.582
ELF-0.582
Token US
Token ID1294
Feature activation+1.25e-01

Pos logit contributions
luster+0.953
jriwal+0.862
anmar+0.859
clud+0.822
rete+0.794
Neg logit contributions
DEF-0.814
235-0.797
226-0.785
entin-0.773
ELF-0.773
Token Cons
Token ID3515
Feature activation-4.15e-01

Pos logit contributions
DEF+0.360
235+0.353
226+0.347
entin+0.342
ELF+0.342
Neg logit contributions
luster-0.362
jriwal-0.325
anmar-0.323
clud-0.307
rete-0.297
Tokenulate
Token ID5039
Feature activation-1.65e+00

Pos logit contributions
luster+1.122
jriwal+0.998
anmar+0.994
clud+0.944
rete+0.907
Neg logit contributions
DEF-1.265
235-1.242
226-1.227
entin-1.211
ELF-1.210
Token to
Token ID284
Feature activation-2.10e-01

Pos logit contributions
luster+4.843
anmar+4.392
jriwal+4.368
clud+4.147
wald+4.016
Neg logit contributions
DEF-4.453
235-4.311
ELF-4.285
226-4.277
entin-4.259
Token protest
Token ID5402
Feature activation-2.06e-01

Pos logit contributions
luster+0.577
jriwal+0.514
anmar+0.512
clud+0.486
rete+0.468
Neg logit contributions
DEF-0.634
235-0.622
226-0.614
entin-0.606
ELF-0.604
Token Trump
Token ID1301
Feature activation+6.57e-02

Pos logit contributions
luster+0.607
jriwal+0.546
anmar+0.546
clud+0.519
rete+0.501
Neg logit contributions
DEF-0.576
235-0.563
226-0.556
ELF-0.550
entin-0.548
Token
Token ID447
Feature activation-2.56e-01

Pos logit contributions
DEF+0.151
235+0.148
226+0.145
entin+0.143
ELF+0.142
Neg logit contributions
luster-0.227
jriwal-0.207
anmar-0.206
clud-0.199
rete-0.193
Token
Token ID247
Feature activation-1.08e-01

Pos logit contributions
luster+0.688
jriwal+0.611
anmar+0.609
clud+0.578
rete+0.555
Neg logit contributions
DEF-0.786
235-0.772
226-0.762
entin-0.752
ELF-0.752
Token First
Token ID3274
Feature activation+1.02e-01

Pos logit contributions
DEF+0.726
235+0.710
226+0.700
ELF+0.686
entin+0.684
Neg logit contributions
luster-1.004
jriwal-0.917
anmar-0.912
clud-0.877
rete-0.847
Token in
Token ID287
Feature activation+1.46e-01

Pos logit contributions
DEF+0.260
235+0.254
226+0.250
ELF+0.246
entin+0.246
Neg logit contributions
luster-0.326
jriwal-0.296
anmar-0.295
clud-0.283
rete-0.273
Token the
Token ID262
Feature activation+3.43e-01

Pos logit contributions
DEF+0.337
235+0.329
226+0.323
entin+0.318
ELF+0.318
Neg logit contributions
luster-0.500
jriwal-0.457
anmar-0.456
clud-0.438
rete-0.425
Token Nation
Token ID8741
Feature activation-1.77e-01

Pos logit contributions
DEF+0.802
235+0.784
226+0.771
ELF+0.756
entin+0.756
Neg logit contributions
luster-1.169
jriwal-1.070
anmar-1.063
clud-1.021
rete-0.990
Token Republican
Token ID3415
Feature activation-6.81e-01

Pos logit contributions
luster+0.563
jriwal+0.509
anmar+0.507
clud+0.486
rete+0.471
Neg logit contributions
DEF-0.456
235-0.446
226-0.439
entin-0.434
ELF-0.433
Token Leadership
Token ID26935
Feature activation-1.58e+00

Pos logit contributions
luster+2.042
jriwal+1.841
anmar+1.832
clud+1.744
rete+1.699
Neg logit contributions
DEF-1.848
235-1.813
226-1.781
entin-1.768
ELF-1.764
Token Summit
Token ID20014
Feature activation+6.67e-02

Pos logit contributions
luster+4.980
jriwal+4.505
anmar+4.446
clud+4.309
rete+4.226
Neg logit contributions
DEF-3.893
235-3.790
ELF-3.744
entin-3.720
226-3.712
Token April
Token ID3035
Feature activation+3.63e-01

Pos logit contributions
DEF+0.164
235+0.160
226+0.158
entin+0.155
ELF+0.155
Neg logit contributions
luster-0.220
jriwal-0.200
anmar-0.199
clud-0.191
rete-0.185
Token 18
Token ID1248
Feature activation+7.26e-01

Pos logit contributions
DEF+0.878
235+0.864
226+0.851
ELF+0.828
309+0.826
Neg logit contributions
luster-1.216
jriwal-1.116
anmar-1.105
clud-1.056
rete-1.023
Token,
Token ID11
Feature activation+5.22e-01

Pos logit contributions
DEF+1.711
235+1.672
226+1.647
entin+1.621
ELF+1.607
Neg logit contributions
luster-2.466
jriwal-2.249
anmar-2.240
clud-2.151
rete-2.088
Token 2015
Token ID1853
Feature activation-4.01e-01

Pos logit contributions
DEF+0.886
235+0.860
226+0.839
entin+0.818
ELF+0.815
Neg logit contributions
luster-2.115
jriwal-1.960
anmar-1.956
clud-1.892
rete-1.846
Token's
Token ID338
Feature activation+3.36e-01

Pos logit contributions
luster+0.102
jriwal+0.093
anmar+0.093
clud+0.089
rete+0.086
Neg logit contributions
DEF-0.073
235-0.071
226-0.070
ELF-0.069
entin-0.069
Token paired
Token ID20312
Feature activation+2.05e-01

Pos logit contributions
DEF+0.854
235+0.835
226+0.822
entin+0.810
ELF+0.809
Neg logit contributions
luster-1.080
jriwal-0.980
anmar-0.976
clud-0.936
rete-0.906
Token with
Token ID351
Feature activation+2.70e-01

Pos logit contributions
DEF+0.459
235+0.448
226+0.440
entin+0.432
ELF+0.432
Neg logit contributions
luster-0.722
jriwal-0.662
anmar-0.659
clud-0.635
rete-0.616
Token "
Token ID366
Feature activation-6.12e-02

Pos logit contributions
DEF+0.701
235+0.687
226+0.676
entin+0.668
ELF+0.665
Neg logit contributions
luster-0.852
jriwal-0.772
anmar-0.768
clud-0.737
rete-0.712
TokenDel
Token ID13856
Feature activation-7.35e-01

Pos logit contributions
luster+0.223
jriwal+0.204
anmar+0.204
clud+0.196
rete+0.191
Neg logit contributions
DEF-0.128
235-0.125
226-0.122
entin-0.121
ELF-0.120
Tokeniver
Token ID1428
Feature activation-1.52e+00

Pos logit contributions
luster+2.245
jriwal+2.023
anmar+2.012
clud+1.925
rete+1.866
Neg logit contributions
DEF-1.991
235-1.942
226-1.915
ELF-1.894
entin-1.883
Tokenance
Token ID590
Feature activation-4.94e-01

Pos logit contributions
luster+4.074
anmar+3.608
jriwal+3.560
clud+3.392
wald+3.291
Neg logit contributions
DEF-4.727
235-4.521
entin-4.474
226-4.469
ELF-4.418
Token"
Token ID1
Feature activation-5.53e-01

Pos logit contributions
luster+1.521
jriwal+1.359
anmar+1.351
clud+1.297
rete+1.257
Neg logit contributions
DEF-1.329
235-1.290
226-1.273
ELF-1.265
entin-1.261
Token on
Token ID319
Feature activation-2.30e-01

Pos logit contributions
luster+1.737
anmar+1.562
jriwal+1.559
clud+1.502
rete+1.449
Neg logit contributions
DEF-1.437
235-1.397
226-1.381
ELF-1.365
entin-1.355
Token Aug
Token ID2447
Feature activation+2.23e-01

Pos logit contributions
luster+0.782
jriwal+0.713
anmar+0.712
clud+0.684
rete+0.664
Neg logit contributions
DEF-0.538
235-0.523
226-0.515
entin-0.507
ELF-0.506
Token.
Token ID13
Feature activation-2.56e-01

Pos logit contributions
DEF+0.524
235+0.516
226+0.507
entin+0.496
ELF+0.495
Neg logit contributions
luster-0.759
jriwal-0.696
anmar-0.692
clud-0.663
rete-0.640
Token.
Token ID13
Feature activation-7.59e-02

Pos logit contributions
luster+0.636
anmar+0.580
jriwal+0.580
clud+0.556
rete+0.538
Neg logit contributions
DEF-0.481
235-0.467
226-0.459
ELF-0.457
entin-0.452
Tokenore
Token ID382
Feature activation-4.26e-01

Pos logit contributions
luster+0.202
jriwal+0.179
anmar+0.179
clud+0.170
rete+0.163
Neg logit contributions
DEF-0.234
235-0.229
226-0.227
ELF-0.224
entin-0.224
Tokengon
Token ID14520
Feature activation+1.52e-01

Pos logit contributions
luster+0.978
jriwal+0.851
anmar+0.847
clud+0.795
rete+0.758
Neg logit contributions
DEF-1.472
235-1.449
226-1.433
entin-1.416
ELF-1.415
Token.
Token ID13
Feature activation-2.82e-01

Pos logit contributions
DEF+0.377
235+0.368
226+0.363
entin+0.357
ELF+0.357
Neg logit contributions
luster-0.497
jriwal-0.451
anmar-0.450
clud-0.432
rete-0.418
Tokengov
Token ID9567
Feature activation+1.25e-01

Pos logit contributions
luster+0.710
anmar+0.629
jriwal+0.617
clud+0.596
rete+0.571
Neg logit contributions
DEF-0.906
ELF-0.875
235-0.871
226-0.866
entin-0.865
Token/
Token ID14
Feature activation-1.51e+00

Pos logit contributions
DEF+0.355
235+0.347
226+0.343
entin+0.338
ELF+0.338
Neg logit contributions
luster-0.367
jriwal-0.330
anmar-0.329
clud-0.314
rete-0.302
Tokenpre
Token ID3866
Feature activation-1.14e-01

Pos logit contributions
luster+4.908
anmar+4.494
jriwal+4.444
clud+4.301
rete+4.192
Neg logit contributions
DEF-3.471
235-3.400
entin-3.353
226-3.332
ELF-3.320
Tokenvention
Token ID4018
Feature activation-3.07e-01

Pos logit contributions
luster+0.313
jriwal+0.280
anmar+0.279
clud+0.267
rete+0.256
Neg logit contributions
DEF-0.344
235-0.336
226-0.332
ELF-0.329
entin-0.328
Tokenwell
Token ID4053
Feature activation-1.05e-01

Pos logit contributions
luster+0.904
jriwal+0.813
anmar+0.812
clud+0.774
rete+0.748
Neg logit contributions
DEF-0.857
235-0.838
226-0.826
ELF-0.819
entin-0.817
Tokenness
Token ID1108
Feature activation-2.71e-01

Pos logit contributions
luster+0.250
jriwal+0.218
anmar+0.218
clud+0.205
rete+0.196
Neg logit contributions
DEF-0.354
235-0.348
226-0.345
ELF-0.341
entin-0.340
Token/
Token ID14
Feature activation-9.39e-01

Pos logit contributions
luster+0.804
jriwal+0.724
anmar+0.721
clud+0.688
rete+0.665
Neg logit contributions
DEF-0.753
235-0.736
226-0.726
ELF-0.718
entin-0.717
Tokenre
Token ID260
Feature activation+5.63e-01

Pos logit contributions
DEF+0.724
235+0.709
226+0.698
entin+0.688
ELF+0.686
Neg logit contributions
luster-0.887
jriwal-0.803
anmar-0.800
clud-0.766
rete-0.741
Tokenpos
Token ID1930
Feature activation-2.71e-02

Pos logit contributions
235+1.668
226+1.656
DEF+1.656
entin+1.629
309+1.617
Neg logit contributions
luster-1.583
anmar-1.411
jriwal-1.361
clud-1.258
galitarian-1.225
Token.
Token ID764
Feature activation+4.81e-02

Pos logit contributions
luster+0.091
jriwal+0.083
anmar+0.083
clud+0.080
rete+0.077
Neg logit contributions
DEF-0.065
235-0.063
226-0.062
entin-0.061
ELF-0.061
Token contents
Token ID10154
Feature activation-1.96e-01

Pos logit contributions
DEF+0.114
235+0.112
226+0.110
entin+0.108
ELF+0.108
Neg logit contributions
luster-0.163
jriwal-0.148
anmar-0.148
clud-0.141
rete-0.137
Token ]
Token ID2361
Feature activation+9.93e-01

Pos logit contributions
luster+0.622
jriwal+0.562
anmar+0.561
clud+0.544
rete+0.522
Neg logit contributions
DEF-0.498
235-0.488
226-0.481
entin-0.476
ELF-0.473
Token WHERE
Token ID33411
Feature activation-1.50e+00

Pos logit contributions
DEF+2.619
235+2.533
226+2.500
ELF+2.470
309+2.425
Neg logit contributions
luster-3.121
jriwal-2.844
anmar-2.835
clud-2.649
rete-2.569
Token id
Token ID4686
Feature activation-9.83e-02

Pos logit contributions
luster+5.484
anmar+4.989
jriwal+4.964
clud+4.875
rete+4.737
Neg logit contributions
DEF-2.879
235-2.807
226-2.786
entin-2.775
309-2.714
Token IN
Token ID3268
Feature activation-1.62e-01

Pos logit contributions
luster+0.340
jriwal+0.310
anmar+0.309
clud+0.298
rete+0.290
Neg logit contributions
DEF-0.224
235-0.219
226-0.215
entin-0.212
ELF-0.211
Token (
Token ID357
Feature activation+6.57e-01

Pos logit contributions
luster+0.516
jriwal+0.468
anmar+0.466
clud+0.446
rete+0.431
Neg logit contributions
DEF-0.418
235-0.408
226-0.402
ELF-0.396
entin-0.396
Token SELECT
Token ID33493
Feature activation-7.05e-01

Pos logit contributions
DEF+1.729
235+1.697
226+1.665
ELF+1.644
309+1.630
Neg logit contributions
luster-2.050
jriwal-1.870
anmar-1.862
clud-1.752
rete-1.699
Token id
Token ID4686
Feature activation-3.70e-02

Pos logit contributions
luster+2.101
jriwal+1.885
anmar+1.874
clud+1.813
rete+1.742
Neg logit contributions
DEF-1.929
235-1.894
226-1.871
entin-1.847
ELF-1.832
Token cl
Token ID537
Feature activation-5.51e-01

Pos logit contributions
DEF+0.378
235+0.370
226+0.364
ELF+0.359
entin+0.359
Neg logit contributions
luster-0.393
jriwal-0.354
anmar-0.353
clud-0.334
rete-0.323
Tokenojure
Token ID32511
Feature activation-6.92e-02

Pos logit contributions
luster+0.782
jriwal+0.620
anmar+0.610
clud+0.553
rete+0.504
Neg logit contributions
DEF-2.385
235-2.348
226-2.328
ELF-2.313
entin-2.309
Token.
Token ID764
Feature activation+5.51e-02

Pos logit contributions
luster+0.231
jriwal+0.211
anmar+0.208
clud+0.203
rete+0.196
Neg logit contributions
DEF-0.166
235-0.162
226-0.160
entin-0.158
ELF-0.158
Token contents
Token ID10154
Feature activation-3.65e-01

Pos logit contributions
DEF+0.135
235+0.132
226+0.130
entin+0.127
ELF+0.127
Neg logit contributions
luster-0.183
jriwal-0.166
anmar-0.166
clud-0.158
rete-0.154
Token ]
Token ID2361
Feature activation+6.40e-01

Pos logit contributions
luster+1.156
jriwal+1.045
anmar+1.040
clud+1.013
rete+0.972
Neg logit contributions
DEF-0.927
235-0.907
226-0.894
entin-0.885
ELF-0.879
Token WHERE
Token ID33411
Feature activation-1.45e+00

Pos logit contributions
DEF+1.675
235+1.628
226+1.605
ELF+1.588
309+1.557
Neg logit contributions
luster-2.034
anmar-1.848
jriwal-1.843
clud-1.733
rete-1.683
Token id
Token ID4686
Feature activation-9.40e-03

Pos logit contributions
luster+4.273
jriwal+3.791
anmar+3.775
clud+3.739
rete+3.536
Neg logit contributions
DEF-3.869
235-3.811
entin-3.763
226-3.749
309-3.690
Token IN
Token ID3268
Feature activation-1.61e-01

Pos logit contributions
luster+0.036
jriwal+0.033
anmar+0.033
clud+0.032
rete+0.031
Neg logit contributions
DEF-0.018
235-0.018
226-0.017
entin-0.017
ELF-0.017
Token (
Token ID357
Feature activation+7.44e-01

Pos logit contributions
luster+0.497
jriwal+0.450
anmar+0.448
clud+0.427
rete+0.413
Neg logit contributions
DEF-0.433
235-0.423
226-0.417
ELF-0.411
entin-0.410
Token SELECT
Token ID33493
Feature activation-6.07e-01

Pos logit contributions
DEF+1.662
235+1.607
226+1.578
ELF+1.567
309+1.527
Neg logit contributions
luster-2.628
jriwal-2.428
anmar-2.420
clud-2.281
rete-2.227
Token id
Token ID4686
Feature activation+2.78e-02

Pos logit contributions
luster+1.733
jriwal+1.547
anmar+1.540
clud+1.484
rete+1.420
Neg logit contributions
DEF-1.746
235-1.716
226-1.692
entin-1.675
ELF-1.664
Token
Token ID447
Feature activation+2.49e-01

Pos logit contributions
luster+2.106
jriwal+1.910
anmar+1.906
clud+1.834
rete+1.767
Neg logit contributions
DEF-1.571
235-1.535
226-1.508
ELF-1.499
entin-1.477
Token
Token ID247
Feature activation-6.81e-02

Pos logit contributions
DEF+0.766
235+0.754
226+0.745
entin+0.734
ELF+0.733
Neg logit contributions
luster-0.665
jriwal-0.592
anmar-0.589
clud-0.559
rete-0.537
Tokens
Token ID82
Feature activation+1.66e-02

Pos logit contributions
luster+0.213
jriwal+0.193
anmar+0.192
clud+0.184
rete+0.178
Neg logit contributions
DEF-0.178
235-0.175
226-0.172
ELF-0.170
entin-0.169
Token The
Token ID383
Feature activation-1.61e-01

Pos logit contributions
DEF+0.043
235+0.042
226+0.042
ELF+0.041
entin+0.041
Neg logit contributions
luster-0.052
jriwal-0.047
anmar-0.047
clud-0.045
rete-0.044
Token Last
Token ID4586
Feature activation-3.19e-01

Pos logit contributions
luster+0.533
jriwal+0.485
anmar+0.483
clud+0.464
rete+0.449
Neg logit contributions
DEF-0.396
235-0.387
226-0.381
entin-0.375
ELF-0.374
Token Jedi
Token ID16147
Feature activation-1.44e+00

Pos logit contributions
luster+0.421
jriwal+0.331
anmar+0.320
clud+0.282
rete+0.256
Neg logit contributions
DEF-1.413
235-1.397
226-1.386
entin-1.369
309-1.366
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
luster+4.372
jriwal+3.939
anmar+3.916
clud+3.787
rete+3.634
Neg logit contributions
DEF-3.762
235-3.634
226-3.603
ELF-3.594
309-3.516
Token which
Token ID543
Feature activation+6.04e-02

Pos logit contributions
DEF+0.324
235+0.316
226+0.312
entin+0.307
ELF+0.306
Neg logit contributions
luster-0.423
jriwal-0.384
anmar-0.383
clud-0.367
rete-0.355
Token opened
Token ID4721
Feature activation-4.47e-01

Pos logit contributions
DEF+0.155
235+0.151
226+0.149
entin+0.147
ELF+0.147
Neg logit contributions
luster-0.193
jriwal-0.175
anmar-0.174
clud-0.167
rete-0.161
Token to
Token ID284
Feature activation+5.29e-02

Pos logit contributions
luster+1.387
anmar+1.248
jriwal+1.246
clud+1.193
rete+1.150
Neg logit contributions
DEF-1.174
235-1.146
226-1.133
ELF-1.114
entin-1.113
Token a
Token ID257
Feature activation+4.63e-02

Pos logit contributions
DEF+0.148
235+0.145
226+0.143
entin+0.141
ELF+0.140
Neg logit contributions
luster-0.157
jriwal-0.141
anmar-0.140
clud-0.134
rete-0.129
Token cl
Token ID537
Feature activation-4.32e-01

Pos logit contributions
DEF+0.042
235+0.041
226+0.041
ELF+0.040
entin+0.040
Neg logit contributions
luster-0.050
anmar-0.045
jriwal-0.045
clud-0.043
rete-0.042
Tokenojure
Token ID32511
Feature activation+1.95e-02

Pos logit contributions
luster+0.638
jriwal+0.509
anmar+0.506
clud+0.451
rete+0.412
Neg logit contributions
DEF-1.846
235-1.824
226-1.808
entin-1.791
ELF-1.788
Token.
Token ID764
Feature activation+1.73e-01

Pos logit contributions
DEF+0.044
235+0.043
226+0.043
entin+0.042
ELF+0.042
Neg logit contributions
luster-0.068
jriwal-0.062
anmar-0.062
clud-0.059
rete-0.058
Token files
Token ID3696
Feature activation-4.67e-02

Pos logit contributions
DEF+0.407
235+0.397
226+0.391
ELF+0.384
entin+0.382
Neg logit contributions
luster-0.594
jriwal-0.540
anmar-0.540
clud-0.515
rete-0.501
Token ]
Token ID2361
Feature activation+4.75e-01

Pos logit contributions
luster+0.149
jriwal+0.134
anmar+0.134
clud+0.130
rete+0.125
Neg logit contributions
DEF-0.120
235-0.117
226-0.115
entin-0.114
ELF-0.113
Token WHERE
Token ID33411
Feature activation-1.43e+00

Pos logit contributions
DEF+1.288
235+1.256
226+1.240
ELF+1.228
entin+1.208
Neg logit contributions
luster-1.457
jriwal-1.319
anmar-1.317
clud-1.247
rete-1.205
Token path
Token ID3108
Feature activation-2.42e-01

Pos logit contributions
luster+3.949
jriwal+3.492
anmar+3.451
clud+3.407
rete+3.224
Neg logit contributions
DEF-4.144
235-4.043
entin-4.004
226-3.996
309-3.925
Token LIKE
Token ID34178
Feature activation-2.05e-01

Pos logit contributions
luster+0.810
jriwal+0.736
anmar+0.733
clud+0.707
rete+0.685
Neg logit contributions
DEF-0.579
235-0.566
226-0.557
entin-0.550
ELF-0.547
Token '
Token ID705
Feature activation+8.69e-01

Pos logit contributions
luster+0.652
jriwal+0.591
anmar+0.589
clud+0.564
rete+0.546
Neg logit contributions
DEF-0.525
235-0.514
226-0.506
entin-0.498
ELF-0.498
Token%
Token ID4
Feature activation-1.19e-01

Pos logit contributions
DEF+2.114
235+2.069
226+2.035
ELF+2.001
309+1.981
Neg logit contributions
luster-2.877
jriwal-2.637
anmar-2.627
clud-2.498
rete-2.422
Tokenproject
Token ID16302
Feature activation+6.49e-02

Pos logit contributions
luster+0.425
jriwal+0.390
anmar+0.388
clud+0.375
rete+0.364
Neg logit contributions
DEF-0.258
235-0.251
226-0.247
entin-0.243
ELF-0.242
Tokenell
Token ID695
Feature activation+5.16e-02

Pos logit contributions
DEF+2.185
235+2.145
226+2.117
entin+2.109
309+2.075
Neg logit contributions
luster-2.145
jriwal-1.934
anmar-1.922
clud-1.833
rete-1.745
Token
Token ID14519
Feature activation+1.42e-01

Pos logit contributions
DEF+0.141
235+0.138
226+0.136
entin+0.134
ELF+0.134
Neg logit contributions
luster-0.155
jriwal-0.140
anmar-0.139
clud-0.133
rete-0.129
Token
Token ID234
Feature activation+2.25e-01

Pos logit contributions
DEF+0.351
235+0.346
226+0.340
ELF+0.334
entin+0.334
Neg logit contributions
luster-0.463
jriwal-0.421
anmar-0.419
clud-0.402
rete-0.389
Token (@
Token ID4275
Feature activation+1.56e-01

Pos logit contributions
DEF+0.681
235+0.670
226+0.661
ELF+0.652
entin+0.650
Neg logit contributions
luster-0.612
jriwal-0.547
anmar-0.544
clud-0.516
rete-0.495
Tokens
Token ID82
Feature activation-5.16e-01

Pos logit contributions
DEF+0.348
235+0.340
226+0.334
ELF+0.327
entin+0.326
Neg logit contributions
luster-0.547
jriwal-0.502
anmar-0.500
clud-0.480
rete-0.466
Tokenhel
Token ID2978
Feature activation-1.43e+00

Pos logit contributions
luster+2.214
anmar+2.056
jriwal+2.054
clud+2.003
rete+1.967
Neg logit contributions
DEF-0.752
235-0.708
226-0.686
ELF-0.674
entin-0.670
Tokenby
Token ID1525
Feature activation-1.32e-01

Pos logit contributions
luster+3.766
jriwal+3.329
anmar+3.321
clud+3.194
rete+3.118
Neg logit contributions
DEF-4.426
235-4.267
ELF-4.260
226-4.206
entin-4.131
Tokeny
Token ID88
Feature activation-1.00e-01

Pos logit contributions
luster+0.389
jriwal+0.350
anmar+0.348
clud+0.332
rete+0.320
Neg logit contributions
DEF-0.373
235-0.366
226-0.361
entin-0.356
ELF-0.355
Token512
Token ID25836
Feature activation-2.15e-01

Pos logit contributions
luster+0.304
jriwal+0.274
anmar+0.272
clud+0.260
rete+0.251
Neg logit contributions
DEF-0.274
235-0.269
226-0.265
entin-0.261
ELF-0.261
Token)
Token ID8
Feature activation+1.35e-01

Pos logit contributions
luster+0.662
jriwal+0.598
anmar+0.597
clud+0.573
rete+0.553
Neg logit contributions
DEF-0.572
235-0.556
226-0.548
entin-0.546
ELF-0.544
Token August
Token ID2932
Feature activation+4.29e-01

Pos logit contributions
DEF+0.325
235+0.319
226+0.316
ELF+0.308
309+0.305
Neg logit contributions
luster-0.454
jriwal-0.416
anmar-0.414
clud-0.394
rete-0.382
Token.
Token ID13
Feature activation-2.40e-01

Pos logit contributions
luster+1.225
jriwal+1.113
anmar+1.110
clud+1.066
rete+1.034
Neg logit contributions
DEF-0.951
235-0.922
226-0.907
ELF-0.905
entin-0.899
Tokenc
Token ID66
Feature activation-6.60e-01

Pos logit contributions
luster+0.771
anmar+0.699
jriwal+0.699
clud+0.671
rete+0.649
Neg logit contributions
DEF-0.611
235-0.595
226-0.586
ELF-0.578
entin-0.577
Tokenensus
Token ID7314
Feature activation+2.08e-01

Pos logit contributions
luster+1.611
anmar+1.408
jriwal+1.401
clud+1.328
rete+1.284
Neg logit contributions
DEF-2.189
235-2.136
226-2.109
ELF-2.099
entin-2.084
Token.
Token ID13
Feature activation-3.09e-01

Pos logit contributions
DEF+0.516
235+0.502
226+0.495
ELF+0.490
entin+0.487
Neg logit contributions
luster-0.683
jriwal-0.621
anmar-0.619
clud-0.595
rete-0.576
Tokengov
Token ID9567
Feature activation+1.58e-01

Pos logit contributions
luster+0.789
jriwal+0.695
anmar+0.694
clud+0.660
rete+0.632
Neg logit contributions
DEF-0.986
235-0.965
226-0.954
ELF-0.946
entin-0.944
Token/
Token ID14
Feature activation-1.42e+00

Pos logit contributions
DEF+0.445
235+0.436
226+0.431
entin+0.425
ELF+0.424
Neg logit contributions
luster-0.462
jriwal-0.415
anmar-0.413
clud-0.395
rete-0.380
Tokendata
Token ID7890
Feature activation+2.37e-01

Pos logit contributions
luster+4.671
anmar+4.277
jriwal+4.266
clud+4.124
rete+4.038
Neg logit contributions
DEF-3.343
235-3.229
ELF-3.189
entin-3.177
226-3.171
Token/
Token ID14
Feature activation-7.77e-01

Pos logit contributions
DEF+0.657
235+0.644
226+0.636
entin+0.624
ELF+0.623
Neg logit contributions
luster-0.711
jriwal-0.640
anmar-0.638
clud-0.608
rete-0.587
Token2015
Token ID4626
Feature activation-3.48e-01

Pos logit contributions
luster+2.618
anmar+2.393
jriwal+2.383
clud+2.303
rete+2.240
Neg logit contributions
DEF-1.802
235-1.742
226-1.715
entin-1.711
ELF-1.703
Token/
Token ID14
Feature activation-9.22e-01

Pos logit contributions
luster+1.057
anmar+0.956
jriwal+0.955
clud+0.916
rete+0.887
Neg logit contributions
DEF-0.930
235-0.912
226-0.898
entin-0.893
ELF-0.887
Tokenacs
Token ID16436
Feature activation-5.32e-01

Pos logit contributions
luster+3.026
anmar+2.756
jriwal+2.743
clud+2.658
rete+2.585
Neg logit contributions
DEF-2.199
235-2.143
226-2.099
entin-2.097
ELF-2.094
Token resources
Token ID4133
Feature activation-3.03e-01

Pos logit contributions
luster+0.105
jriwal+0.095
anmar+0.095
clud+0.091
rete+0.088
Neg logit contributions
DEF-0.078
235-0.076
226-0.075
entin-0.073
ELF-0.073
Token<|endoftext|>
Token ID50256
Feature activation+2.89e-02

Pos logit contributions
luster+0.990
jriwal+0.898
anmar+0.895
clud+0.858
rete+0.833
Neg logit contributions
DEF-0.751
235-0.734
226-0.723
entin-0.712
ELF-0.710
TokenWhen
Token ID2215
Feature activation+5.37e-02

Pos logit contributions
DEF+0.062
235+0.061
226+0.060
entin+0.058
ELF+0.058
Neg logit contributions
luster-0.104
jriwal-0.095
anmar-0.095
clud-0.092
rete-0.089
Token ideology
Token ID12959
Feature activation-2.21e-01

Pos logit contributions
DEF+0.142
235+0.139
226+0.137
entin+0.135
ELF+0.135
Neg logit contributions
luster-0.167
jriwal-0.151
anmar-0.150
clud-0.144
rete-0.139
Token takes
Token ID2753
Feature activation-4.13e-01

Pos logit contributions
luster+0.694
jriwal+0.630
anmar+0.625
clud+0.599
rete+0.581
Neg logit contributions
DEF-0.577
235-0.563
226-0.555
entin-0.548
ELF-0.547
Token up
Token ID510
Feature activation-1.42e+00

Pos logit contributions
luster+1.233
jriwal+1.110
anmar+1.106
clud+1.056
rete+1.019
Neg logit contributions
DEF-1.140
235-1.117
226-1.102
entin-1.086
ELF-1.086
Token the
Token ID262
Feature activation-3.37e-01

Pos logit contributions
luster+4.016
anmar+3.533
jriwal+3.518
clud+3.354
galitarian+3.237
Neg logit contributions
DEF-4.051
235-3.956
226-3.889
ELF-3.861
entin-3.847
Token brain
Token ID3632
Feature activation-4.23e-01

Pos logit contributions
luster+0.983
jriwal+0.882
anmar+0.879
clud+0.838
rete+0.807
Neg logit contributions
DEF-0.954
235-0.935
226-0.923
entin-0.910
ELF-0.908
Token space
Token ID2272
Feature activation-7.75e-02

Pos logit contributions
luster+1.354
jriwal+1.222
anmar+1.220
clud+1.168
rete+1.130
Neg logit contributions
DEF-1.079
235-1.051
226-1.036
ELF-1.024
entin-1.022
Token normally
Token ID7685
Feature activation-4.82e-01

Pos logit contributions
luster+0.244
jriwal+0.221
anmar+0.220
clud+0.211
rete+0.204
Neg logit contributions
DEF-0.202
235-0.197
226-0.195
entin-0.192
ELF-0.191
Token reserved
Token ID10395
Feature activation-5.25e-02

Pos logit contributions
luster+1.664
jriwal+1.521
anmar+1.513
clud+1.460
rete+1.415
Neg logit contributions
DEF-1.109
235-1.080
226-1.060
ELF-1.043
entin-1.043
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
luster+1.150
jriwal+1.040
anmar+1.037
clud+0.991
rete+0.959
Neg logit contributions
DEF-0.950
235-0.927
226-0.913
entin-0.903
ELF-0.903
Token
Token ID198
Feature activation-9.10e-02

Pos logit contributions
DEF+0.486
235+0.475
226+0.468
entin+0.461
ELF+0.461
Neg logit contributions
luster-0.572
jriwal-0.518
anmar-0.515
clud-0.493
rete-0.477
Token
Token ID198
Feature activation+6.12e-02

Pos logit contributions
luster+0.274
jriwal+0.247
anmar+0.246
clud+0.236
rete+0.228
Neg logit contributions
DEF-0.248
235-0.243
226-0.240
entin-0.237
ELF-0.236
TokenThe
Token ID464
Feature activation+5.97e-02

Pos logit contributions
DEF+0.138
235+0.135
226+0.132
entin+0.130
ELF+0.130
Neg logit contributions
luster-0.214
jriwal-0.195
anmar-0.195
clud-0.188
rete-0.182
Token Congressional
Token ID15757
Feature activation+2.15e-01

Pos logit contributions
DEF+0.164
235+0.160
226+0.158
entin+0.156
ELF+0.156
Neg logit contributions
luster-0.180
jriwal-0.162
anmar-0.161
clud-0.154
rete-0.149
Token Leadership
Token ID26935
Feature activation-1.42e+00

Pos logit contributions
DEF+0.564
235+0.549
226+0.542
ELF+0.536
entin+0.535
Neg logit contributions
luster-0.673
jriwal-0.609
anmar-0.606
clud-0.579
rete-0.563
Token Fund
Token ID7557
Feature activation-4.97e-01

Pos logit contributions
luster+4.554
jriwal+4.137
anmar+4.117
clud+3.983
rete+3.834
Neg logit contributions
DEF-3.477
235-3.330
ELF-3.318
226-3.282
entin-3.279
Token,
Token ID11
Feature activation-3.56e-02

Pos logit contributions
luster+1.604
anmar+1.458
jriwal+1.454
clud+1.393
rete+1.346
Neg logit contributions
DEF-1.237
235-1.207
226-1.189
ELF-1.179
entin-1.172
Token a
Token ID257
Feature activation+2.41e-02

Pos logit contributions
luster+0.117
jriwal+0.106
anmar+0.106
clud+0.102
rete+0.099
Neg logit contributions
DEF-0.088
235-0.086
226-0.084
entin-0.083
ELF-0.083
Token political
Token ID1964
Feature activation-1.28e-01

Pos logit contributions
DEF+0.064
235+0.063
226+0.062
entin+0.061
ELF+0.061
Neg logit contributions
luster-0.074
jriwal-0.067
anmar-0.067
clud-0.064
rete-0.062
Token action
Token ID2223
Feature activation-2.98e-01

Pos logit contributions
luster+0.385
jriwal+0.347
anmar+0.346
clud+0.331
rete+0.319
Neg logit contributions
DEF-0.351
235-0.343
226-0.338
ELF-0.334
entin-0.334
Token,
Token ID11
Feature activation+1.34e-01

Pos logit contributions
DEF+0.010
235+0.010
226+0.010
entin+0.009
ELF+0.009
Neg logit contributions
luster-0.013
jriwal-0.012
anmar-0.012
clud-0.012
rete-0.011
Token while
Token ID981
Feature activation+1.69e-01

Pos logit contributions
DEF+0.317
235+0.310
226+0.305
entin+0.299
ELF+0.299
Neg logit contributions
luster-0.456
jriwal-0.416
anmar-0.415
clud-0.398
rete-0.386
Token Gand
Token ID17727
Feature activation+2.22e-01

Pos logit contributions
DEF+0.401
235+0.391
226+0.385
entin+0.378
ELF+0.378
Neg logit contributions
luster-0.571
jriwal-0.521
anmar-0.519
clud-0.499
rete-0.483
Tokenalf
Token ID1604
Feature activation+6.80e-01

Pos logit contributions
DEF+0.555
235+0.545
226+0.536
entin+0.527
ELF+0.525
Neg logit contributions
luster-0.718
jriwal-0.658
anmar-0.654
clud-0.626
rete-0.601
Token dep
Token ID1207
Feature activation+1.00e-01

Pos logit contributions
DEF+1.726
235+1.689
226+1.666
entin+1.639
309+1.628
Neg logit contributions
luster-2.177
anmar-1.974
jriwal-1.972
clud-1.890
rete-1.827
Tokenarts
Token ID5889
Feature activation-1.42e+00

Pos logit contributions
DEF+0.306
235+0.301
226+0.297
entin+0.295
309+0.292
Neg logit contributions
luster-0.268
jriwal-0.238
anmar-0.237
clud-0.225
rete-0.215
Token to
Token ID284
Feature activation-3.69e-01

Pos logit contributions
luster+4.270
anmar+3.822
jriwal+3.820
clud+3.668
rete+3.572
Neg logit contributions
DEF-3.761
235-3.645
226-3.611
entin-3.587
ELF-3.581
Token seek
Token ID5380
Feature activation-3.88e-01

Pos logit contributions
luster+1.129
jriwal+1.019
anmar+1.015
clud+0.971
rete+0.938
Neg logit contributions
DEF-0.995
235-0.974
226-0.960
entin-0.946
ELF-0.945
Token help
Token ID1037
Feature activation-9.12e-02

Pos logit contributions
luster+1.224
jriwal+1.102
anmar+1.097
clud+1.058
rete+1.023
Neg logit contributions
DEF-1.008
235-0.981
226-0.968
ELF-0.966
entin-0.956
Token from
Token ID422
Feature activation+5.38e-02

Pos logit contributions
luster+0.281
jriwal+0.253
anmar+0.251
clud+0.241
rete+0.234
Neg logit contributions
DEF-0.243
235-0.238
226-0.234
entin-0.232
ELF-0.232
Token Tree
Token ID12200
Feature activation+4.60e-01

Pos logit contributions
DEF+0.138
235+0.135
226+0.133
entin+0.131
ELF+0.131
Neg logit contributions
luster-0.172
jriwal-0.156
anmar-0.155
clud-0.149
rete-0.144
Token discussing
Token ID11142
Feature activation-3.72e-01

Pos logit contributions
luster+0.767
jriwal+0.697
anmar+0.696
clud+0.666
rete+0.645
Neg logit contributions
DEF-0.606
235-0.593
226-0.584
entin-0.576
ELF-0.575
Token the
Token ID262
Feature activation-7.05e-02

Pos logit contributions
luster+1.109
anmar+0.999
jriwal+0.996
clud+0.952
rete+0.921
Neg logit contributions
DEF-1.024
235-0.997
226-0.982
entin-0.975
ELF-0.971
Token sale
Token ID5466
Feature activation-8.27e-02

Pos logit contributions
luster+0.216
jriwal+0.195
anmar+0.193
clud+0.185
rete+0.179
Neg logit contributions
DEF-0.191
235-0.188
226-0.185
ELF-0.182
entin-0.181
Token of
Token ID286
Feature activation+6.80e-02

Pos logit contributions
luster+0.234
jriwal+0.209
anmar+0.208
clud+0.198
rete+0.191
Neg logit contributions
DEF-0.242
235-0.237
226-0.234
entin-0.231
ELF-0.231
Token aborted
Token ID46847
Feature activation-3.07e-01

Pos logit contributions
DEF+0.188
235+0.185
226+0.182
entin+0.179
ELF+0.179
Neg logit contributions
luster-0.203
jriwal-0.182
anmar-0.182
clud-0.174
rete-0.167
Token fetal
Token ID27652
Feature activation-1.41e+00

Pos logit contributions
luster+0.740
jriwal+0.647
anmar+0.646
clud+0.609
rete+0.583
Neg logit contributions
DEF-1.023
235-1.003
226-0.992
ELF-0.982
entin-0.982
Token tissue
Token ID10712
Feature activation-7.45e-01

Pos logit contributions
luster+3.376
anmar+2.968
jriwal+2.907
clud+2.808
rete+2.682
Neg logit contributions
DEF-4.565
235-4.463
entin-4.434
226-4.425
ELF-4.424
Token.
Token ID13
Feature activation-3.54e-02

Pos logit contributions
luster+2.315
anmar+2.103
jriwal+2.101
clud+2.017
rete+1.946
Neg logit contributions
DEF-1.931
235-1.872
226-1.845
entin-1.831
ELF-1.829
Token But
Token ID887
Feature activation+2.42e-01

Pos logit contributions
luster+0.110
jriwal+0.099
anmar+0.099
clud+0.094
rete+0.091
Neg logit contributions
DEF-0.094
235-0.092
226-0.091
ELF-0.090
entin-0.089
Token,
Token ID11
Feature activation-2.67e-01

Pos logit contributions
DEF+0.648
235+0.639
226+0.630
ELF+0.618
entin+0.617
Neg logit contributions
luster-0.742
jriwal-0.672
anmar-0.666
clud-0.638
rete-0.616
Token in
Token ID287
Feature activation-1.80e-01

Pos logit contributions
luster+0.809
jriwal+0.729
anmar+0.727
clud+0.695
rete+0.671
Neg logit contributions
DEF-0.724
235-0.709
226-0.699
entin-0.689
ELF-0.688
Token,
Token ID11
Feature activation-6.76e-02

Pos logit contributions
DEF+1.631
235+1.594
226+1.571
entin+1.540
ELF+1.534
Neg logit contributions
luster-2.303
anmar-2.103
jriwal-2.078
clud-2.002
rete-1.936
Token Congress
Token ID3162
Feature activation+3.29e-01

Pos logit contributions
luster+0.193
jriwal+0.173
anmar+0.172
clud+0.164
rete+0.158
Neg logit contributions
DEF-0.196
235-0.192
226-0.189
entin-0.187
ELF-0.187
Token made
Token ID925
Feature activation-1.17e-01

Pos logit contributions
DEF+0.860
235+0.842
226+0.829
entin+0.818
ELF+0.817
Neg logit contributions
luster-1.029
jriwal-0.934
anmar-0.927
clud-0.889
rete-0.859
Token clear
Token ID1598
Feature activation-5.07e-01

Pos logit contributions
luster+0.341
jriwal+0.310
anmar+0.307
clud+0.293
rete+0.281
Neg logit contributions
DEF-0.331
235-0.323
226-0.319
entin-0.315
ELF-0.315
Token its
Token ID663
Feature activation+1.97e-01

Pos logit contributions
luster+1.445
jriwal+1.307
anmar+1.292
clud+1.236
rete+1.187
Neg logit contributions
DEF-1.462
235-1.434
226-1.415
ELF-1.396
entin-1.395
Token intent
Token ID6824
Feature activation-1.41e+00

Pos logit contributions
DEF+0.527
235+0.515
226+0.508
entin+0.501
ELF+0.501
Neg logit contributions
luster-0.607
jriwal-0.547
anmar-0.545
clud-0.522
rete-0.504
Token to
Token ID284
Feature activation+1.97e-01

Pos logit contributions
luster+3.669
jriwal+3.351
anmar+3.247
clud+3.088
rete+2.972
Neg logit contributions
DEF-4.261
235-4.157
ELF-4.110
226-4.089
entin-4.089
Token deflect
Token ID31685
Feature activation+4.65e-02

Pos logit contributions
DEF+0.547
235+0.536
226+0.528
entin+0.520
ELF+0.520
Neg logit contributions
luster-0.586
jriwal-0.526
anmar-0.525
clud-0.501
rete-0.484
Token political
Token ID1964
Feature activation+3.87e-01

Pos logit contributions
DEF+0.115
235+0.113
226+0.111
entin+0.109
ELF+0.109
Neg logit contributions
luster-0.152
jriwal-0.139
anmar-0.138
clud-0.132
rete-0.128
Token fire
Token ID2046
Feature activation+3.90e-01

Pos logit contributions
DEF+1.029
235+1.007
226+0.993
entin+0.978
ELF+0.976
Neg logit contributions
luster-1.195
jriwal-1.078
anmar-1.075
clud-1.032
rete-0.996
Token from
Token ID422
Feature activation-1.11e-01

Pos logit contributions
DEF+0.977
235+0.964
226+0.950
entin+0.932
ELF+0.926
Neg logit contributions
luster-1.260
anmar-1.147
jriwal-1.137
clud-1.101
rete-1.058
Tokena
Token ID64
Feature activation-2.86e-01

Pos logit contributions
luster+2.601
jriwal+2.355
anmar+2.346
clud+2.247
rete+2.174
Neg logit contributions
DEF-2.166
235-2.118
226-2.088
entin-2.056
ELF-2.056
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
luster+0.961
jriwal+0.876
anmar+0.872
clud+0.837
rete+0.811
Neg logit contributions
DEF-0.686
235-0.672
226-0.661
entin-0.648
ELF-0.647
Tokendot
Token ID26518
Feature activation-3.41e-01

Pos logit contributions
luster+0.314
jriwal+0.280
anmar+0.278
clud+0.263
rete+0.252
Neg logit contributions
DEF-0.363
235-0.358
226-0.354
entin-0.348
ELF-0.347
Token.
Token ID13
Feature activation-3.78e-02

Pos logit contributions
luster+1.112
jriwal+1.009
anmar+1.007
clud+0.974
rete+0.942
Neg logit contributions
DEF-0.845
235-0.819
226-0.806
ELF-0.803
entin-0.799
Tokengov
Token ID9567
Feature activation+1.46e-01

Pos logit contributions
luster+0.103
jriwal+0.092
anmar+0.091
clud+0.086
rete+0.083
Neg logit contributions
DEF-0.114
235-0.114
226-0.112
entin-0.110
309-0.110
Token/
Token ID14
Feature activation-1.40e+00

Pos logit contributions
DEF+0.416
235+0.407
226+0.402
ELF+0.397
entin+0.396
Neg logit contributions
luster-0.424
jriwal-0.380
anmar-0.379
clud-0.361
rete-0.348
Tokenpub
Token ID12984
Feature activation-6.30e-01

Pos logit contributions
luster+4.650
anmar+4.220
jriwal+4.204
clud+4.071
rete+3.972
Neg logit contributions
DEF-3.214
235-3.170
226-3.113
entin-3.112
ELF-3.084
Tokens
Token ID82
Feature activation-3.98e-01

Pos logit contributions
luster+1.878
jriwal+1.703
anmar+1.684
clud+1.629
rete+1.571
Neg logit contributions
DEF-1.722
235-1.686
226-1.662
ELF-1.656
entin-1.642
Token/
Token ID14
Feature activation-1.02e+00

Pos logit contributions
luster+1.118
jriwal+1.006
anmar+1.003
clud+0.958
rete+0.922
Neg logit contributions
DEF-1.154
235-1.129
226-1.114
ELF-1.107
entin-1.101
Token8
Token ID23
Feature activation-3.73e-01

Pos logit contributions
luster+3.434
anmar+3.131
jriwal+3.107
clud+3.017
rete+2.934
Neg logit contributions
DEF-2.341
235-2.289
entin-2.254
226-2.242
ELF-2.230
Token11
Token ID1157
Feature activation+2.85e-01

Pos logit contributions
luster+1.367
anmar+1.258
jriwal+1.255
clud+1.212
rete+1.182
Neg logit contributions
DEF-0.765
235-0.736
226-0.723
entin-0.722
ELF-0.721
Token cameras
Token ID9073
Feature activation-1.38e-01

Pos logit contributions
luster+0.870
jriwal+0.786
anmar+0.782
clud+0.748
rete+0.722
Neg logit contributions
DEF-0.769
235-0.753
226-0.743
entin-0.732
ELF-0.732
Token being
Token ID852
Feature activation-3.96e-02

Pos logit contributions
luster+0.446
jriwal+0.404
anmar+0.404
clud+0.387
rete+0.375
Neg logit contributions
DEF-0.349
235-0.341
226-0.336
entin-0.331
ELF-0.330
Token defective
Token ID32129
Feature activation+6.58e-02

Pos logit contributions
luster+0.125
jriwal+0.113
anmar+0.112
clud+0.108
rete+0.104
Neg logit contributions
DEF-0.103
235-0.101
226-0.100
ELF-0.098
entin-0.098
Token (
Token ID357
Feature activation-3.33e-01

Pos logit contributions
DEF+0.163
235+0.160
226+0.157
entin+0.155
ELF+0.154
Neg logit contributions
luster-0.215
jriwal-0.196
anmar-0.195
clud-0.187
rete-0.181
Tokencons
Token ID5936
Feature activation-6.03e-01

Pos logit contributions
luster+1.066
anmar+0.962
jriwal+0.959
clud+0.921
rete+0.893
Neg logit contributions
DEF-0.846
235-0.826
226-0.813
entin-0.808
ELF-0.802
Tokenist
Token ID396
Feature activation-1.40e+00

Pos logit contributions
luster+1.593
jriwal+1.416
anmar+1.412
clud+1.346
rete+1.290
Neg logit contributions
DEF-1.876
235-1.832
226-1.806
ELF-1.794
entin-1.773
Tokenently
Token ID1473
Feature activation-5.14e-01

Pos logit contributions
luster+4.485
anmar+4.071
jriwal+4.044
clud+3.875
rete+3.842
Neg logit contributions
DEF-3.528
235-3.377
226-3.319
ELF-3.309
186-3.233
Token front
Token ID2166
Feature activation-1.14e-01

Pos logit contributions
luster+1.607
anmar+1.443
jriwal+1.443
clud+1.385
rete+1.340
Neg logit contributions
DEF-1.350
235-1.315
226-1.297
entin-1.282
ELF-1.275
Token-
Token ID12
Feature activation-1.48e-01

Pos logit contributions
luster+0.363
jriwal+0.329
anmar+0.328
clud+0.314
rete+0.304
Neg logit contributions
DEF-0.295
235-0.288
226-0.284
entin-0.280
ELF-0.279
Tokenf
Token ID69
Feature activation-5.78e-02

Pos logit contributions
luster+0.450
jriwal+0.406
anmar+0.405
clud+0.387
rete+0.374
Neg logit contributions
DEF-0.402
235-0.393
226-0.388
entin-0.382
ELF-0.382
Tokenocusing
Token ID45743
Feature activation-4.54e-01

Pos logit contributions
luster+0.173
jriwal+0.156
anmar+0.155
clud+0.148
rete+0.143
Neg logit contributions
DEF-0.159
235-0.156
226-0.154
entin-0.152
ELF-0.152
Token v
Token ID410
Feature activation-5.67e-01

Pos logit contributions
luster+1.430
anmar+1.289
jriwal+1.288
clud+1.246
rete+1.201
Neg logit contributions
DEF-1.149
235-1.129
226-1.114
entin-1.098
ELF-1.097
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
luster+1.874
jriwal+1.704
anmar+1.701
clud+1.635
rete+1.583
Neg logit contributions
DEF-1.385
235-1.352
226-1.331
entin-1.313
ELF-1.308
Token Windsor
Token ID31561
Feature activation+1.33e-01

Pos logit contributions
luster+0.856
anmar+0.778
jriwal+0.776
clud+0.751
rete+0.727
Neg logit contributions
DEF-0.589
235-0.575
226-0.566
entin-0.560
ELF-0.557
Token struck
Token ID7425
Feature activation-5.70e-01

Pos logit contributions
DEF+0.344
235+0.337
226+0.332
entin+0.327
ELF+0.326
Neg logit contributions
luster-0.420
jriwal-0.381
anmar-0.379
clud-0.363
rete-0.351
Token down
Token ID866
Feature activation-9.85e-02

Pos logit contributions
luster+1.780
jriwal+1.595
anmar+1.594
clud+1.565
rete+1.477
Neg logit contributions
DEF-1.471
235-1.438
226-1.417
ELF-1.409
entin-1.397
Token Section
Token ID7275
Feature activation-1.38e+00

Pos logit contributions
luster+0.299
anmar+0.270
jriwal+0.269
clud+0.258
rete+0.249
Neg logit contributions
DEF-0.267
235-0.261
226-0.257
entin-0.254
ELF-0.254
Token 3
Token ID513
Feature activation-9.27e-01

Pos logit contributions
luster+3.882
anmar+3.451
jriwal+3.364
clud+3.318
rete+3.191
Neg logit contributions
DEF-3.966
entin-3.814
235-3.774
ELF-3.738
226-3.703
Token of
Token ID286
Feature activation-3.99e-01

Pos logit contributions
luster+2.575
anmar+2.294
jriwal+2.260
clud+2.204
galitarian+2.101
Neg logit contributions
DEF-2.650
entin-2.633
ELF-2.583
235-2.548
226-2.510
Token the
Token ID262
Feature activation-4.53e-01

Pos logit contributions
luster+1.153
jriwal+1.030
anmar+1.029
clud+0.992
rete+0.949
Neg logit contributions
DEF-1.135
235-1.111
226-1.095
entin-1.086
ELF-1.085
Token Defense
Token ID5947
Feature activation-7.58e-01

Pos logit contributions
luster+1.541
anmar+1.409
jriwal+1.389
clud+1.353
rete+1.316
Neg logit contributions
DEF-1.052
235-1.027
226-1.008
entin-1.001
ELF-0.993
Token Of
Token ID3226
Feature activation-6.29e-01

Pos logit contributions
luster+2.067
anmar+1.831
jriwal+1.825
clud+1.739
rete+1.682
Neg logit contributions
DEF-2.278
235-2.225
226-2.195
ELF-2.184
entin-2.179
Token meeting
Token ID3249
Feature activation+1.63e-01

Pos logit contributions
luster+1.022
jriwal+0.927
anmar+0.925
clud+0.886
rete+0.859
Neg logit contributions
DEF-0.799
235-0.780
226-0.769
entin-0.759
ELF-0.757
Token that
Token ID326
Feature activation+9.59e-02

Pos logit contributions
DEF+0.385
235+0.377
226+0.371
entin+0.364
ELF+0.363
Neg logit contributions
luster-0.550
jriwal-0.502
anmar-0.500
clud-0.481
rete-0.466
Token six
Token ID2237
Feature activation-1.56e-01

Pos logit contributions
DEF+0.270
235+0.265
226+0.262
ELF+0.257
entin+0.257
Neg logit contributions
luster-0.282
jriwal-0.253
anmar-0.252
clud-0.241
rete-0.232
Token Cabinet
Token ID20384
Feature activation+5.87e-01

Pos logit contributions
luster+0.511
jriwal+0.464
anmar+0.463
clud+0.444
rete+0.430
Neg logit contributions
DEF-0.388
235-0.380
226-0.374
entin-0.368
ELF-0.367
Token ministers
Token ID14138
Feature activation+2.40e-01

Pos logit contributions
DEF+1.293
235+1.259
226+1.238
entin+1.216
ELF+1.205
Neg logit contributions
luster-2.086
jriwal-1.909
anmar-1.905
clud-1.839
rete-1.775
Token belonging
Token ID16686
Feature activation+8.70e-01

Pos logit contributions
DEF+0.622
235+0.610
226+0.601
entin+0.591
ELF+0.589
Neg logit contributions
luster-0.760
jriwal-0.689
anmar-0.686
clud-0.655
rete-0.635
Token to
Token ID284
Feature activation+2.59e-01

Pos logit contributions
DEF+2.283
235+2.267
226+2.231
309+2.179
ELF+2.168
Neg logit contributions
luster-2.738
jriwal-2.453
anmar-2.451
clud-2.343
rete-2.232
Token the
Token ID262
Feature activation+1.06e-01

Pos logit contributions
DEF+0.668
235+0.654
226+0.645
ELF+0.632
entin+0.632
Neg logit contributions
luster-0.825
jriwal-0.747
anmar-0.743
clud-0.713
rete-0.690
Token party
Token ID2151
Feature activation-4.50e-02

Pos logit contributions
DEF+0.288
235+0.282
226+0.278
ELF+0.273
entin+0.273
Neg logit contributions
luster-0.324
jriwal-0.293
anmar-0.291
clud-0.279
rete-0.269
Token as
Token ID355
Feature activation+4.15e-02

Pos logit contributions
luster+0.144
jriwal+0.130
anmar+0.130
clud+0.124
rete+0.120
Neg logit contributions
DEF-0.115
235-0.113
226-0.111
entin-0.109
ELF-0.109
Token well
Token ID880
Feature activation-3.34e-01

Pos logit contributions
DEF+0.115
235+0.113
226+0.111
ELF+0.110
entin+0.109
Neg logit contributions
luster-0.124
jriwal-0.111
anmar-0.111
clud-0.106
rete-0.102
Token is
Token ID318
Feature activation+1.86e-02

Pos logit contributions
DEF+0.395
235+0.387
226+0.381
entin+0.375
ELF+0.374
Neg logit contributions
luster-0.502
jriwal-0.456
anmar-0.454
clud-0.435
rete-0.421
Token now
Token ID783
Feature activation+4.73e-01

Pos logit contributions
DEF+0.048
235+0.047
226+0.046
entin+0.045
ELF+0.045
Neg logit contributions
luster-0.059
jriwal-0.053
anmar-0.053
clud-0.051
rete-0.049
Token Gand
Token ID17727
Feature activation+4.41e-01

Pos logit contributions
DEF+1.237
235+1.210
226+1.193
entin+1.173
ELF+1.173
Neg logit contributions
luster-1.482
jriwal-1.342
anmar-1.338
clud-1.282
rete-1.235
Tokenalf
Token ID1604
Feature activation+8.44e-01

Pos logit contributions
DEF+1.085
235+1.068
226+1.051
entin+1.037
309+1.030
Neg logit contributions
luster-1.438
jriwal-1.322
anmar-1.308
clud-1.256
galitarian-1.206
Token the
Token ID262
Feature activation+3.12e-01

Pos logit contributions
DEF+1.899
235+1.860
226+1.835
entin+1.802
309+1.783
Neg logit contributions
luster-2.936
jriwal-2.692
anmar-2.682
clud-2.580
rete-2.490
Token White
Token ID2635
Feature activation+1.05e+00

Pos logit contributions
DEF+0.814
235+0.798
226+0.790
ELF+0.774
entin+0.773
Neg logit contributions
luster-0.975
jriwal-0.889
anmar-0.883
clud-0.844
rete-0.815
Token,
Token ID11
Feature activation+3.95e-01

Pos logit contributions
235+2.336
DEF+2.324
226+2.303
309+2.237
ELF+2.232
Neg logit contributions
luster-3.561
jriwal-3.332
anmar-3.331
clud-3.122
rete-3.051
Token for
Token ID329
Feature activation+2.45e-01

Pos logit contributions
DEF+1.009
235+0.988
226+0.974
ELF+0.955
entin+0.954
Neg logit contributions
luster-1.264
jriwal-1.148
anmar-1.145
clud-1.095
rete-1.058
Token he
Token ID339
Feature activation-4.94e-02

Pos logit contributions
DEF+0.613
235+0.602
226+0.593
entin+0.582
ELF+0.581
Neg logit contributions
luster-0.794
jriwal-0.722
anmar-0.720
clud-0.689
rete-0.666
Token has
Token ID468
Feature activation+2.31e-01

Pos logit contributions
luster+0.157
jriwal+0.143
anmar+0.142
clud+0.137
rete+0.132
Neg logit contributions
DEF-0.126
235-0.123
226-0.121
ELF-0.120
entin-0.120
Token taken
Token ID2077
Feature activation+3.77e-01

Pos logit contributions
DEF+0.599
235+0.586
226+0.577
entin+0.569
ELF+0.568
Neg logit contributions
luster-0.727
jriwal-0.659
anmar-0.656
clud-0.629
rete-0.608
Token a
Token ID257
Feature activation-7.55e-02

Pos logit contributions
luster+0.190
anmar+0.172
jriwal+0.171
clud+0.164
rete+0.159
Neg logit contributions
DEF-0.155
235-0.151
226-0.149
entin-0.147
ELF-0.147
Token licensing
Token ID15665
Feature activation+5.00e-02

Pos logit contributions
luster+0.222
anmar+0.200
jriwal+0.199
clud+0.190
rete+0.184
Neg logit contributions
DEF-0.212
235-0.207
226-0.204
entin-0.202
ELF-0.202
Token division
Token ID7297
Feature activation-3.59e-01

Pos logit contributions
DEF+0.131
235+0.129
226+0.127
entin+0.125
ELF+0.125
Neg logit contributions
luster-0.156
jriwal-0.141
anmar-0.141
clud-0.135
rete-0.130
Token of
Token ID286
Feature activation-3.55e-01

Pos logit contributions
luster+1.011
anmar+0.906
jriwal+0.901
clud+0.860
rete+0.833
Neg logit contributions
DEF-1.044
235-1.020
226-1.008
ELF-1.002
entin-0.999
Token AT
Token ID5161
Feature activation-3.16e-02

Pos logit contributions
luster+1.087
anmar+0.982
jriwal+0.977
clud+0.932
rete+0.907
Neg logit contributions
DEF-0.952
235-0.932
226-0.918
ELF-0.907
entin-0.906
Token&
Token ID5
Feature activation+8.67e-01

Pos logit contributions
luster+0.126
anmar+0.117
jriwal+0.117
clud+0.113
rete+0.110
Neg logit contributions
DEF-0.055
235-0.053
226-0.052
entin-0.051
ELF-0.051
TokenT
Token ID51
Feature activation+4.38e-01

Pos logit contributions
DEF+1.495
235+1.441
226+1.406
ELF+1.363
309+1.351
Neg logit contributions
luster-3.591
jriwal-3.345
anmar-3.341
clud-3.195
rete-3.098
Token/
Token ID14
Feature activation+5.32e-03

Pos logit contributions
DEF+1.043
235+1.018
226+1.002
ELF+0.980
entin+0.978
Neg logit contributions
luster-1.479
jriwal-1.355
anmar-1.343
clud-1.294
rete-1.252
TokenWestern
Token ID24227
Feature activation+4.25e-01

Pos logit contributions
DEF+0.011
235+0.010
226+0.010
entin+0.010
ELF+0.010
Neg logit contributions
luster-0.020
jriwal-0.018
anmar-0.018
clud-0.018
rete-0.017
Token Electric
Token ID13944
Feature activation+9.46e-01

Pos logit contributions
DEF+0.914
235+0.891
226+0.874
entin+0.854
ELF+0.854
Neg logit contributions
luster-1.535
jriwal-1.409
anmar-1.400
clud-1.352
rete-1.312
Token,
Token ID11
Feature activation+2.73e-01

Pos logit contributions
DEF+2.291
235+2.249
226+2.215
309+2.162
ELF+2.153
Neg logit contributions
luster-3.140
jriwal-2.872
anmar-2.827
clud-2.742
galitarian-2.625
Token in
Token ID287
Feature activation+3.15e-01

Pos logit contributions
luster+0.944
jriwal+0.857
anmar+0.855
clud+0.820
rete+0.795
Neg logit contributions
DEF-0.708
235-0.691
226-0.681
entin-0.669
ELF-0.669
Token Cleveland
Token ID10306
Feature activation+2.94e-01

Pos logit contributions
DEF+0.825
235+0.810
226+0.798
entin+0.786
ELF+0.783
Neg logit contributions
luster-0.984
jriwal-0.894
anmar-0.890
clud-0.847
rete-0.823
Token.
Token ID13
Feature activation-3.11e-02

Pos logit contributions
DEF+0.709
235+0.694
226+0.684
entin+0.669
ELF+0.669
Neg logit contributions
luster-0.980
jriwal-0.898
anmar-0.892
clud-0.856
rete-0.830
Token By
Token ID2750
Feature activation-2.58e-01

Pos logit contributions
luster+0.099
jriwal+0.089
anmar+0.089
clud+0.085
rete+0.082
Neg logit contributions
DEF-0.080
235-0.079
226-0.078
entin-0.076
ELF-0.076
Token the
Token ID262
Feature activation+3.38e-01

Pos logit contributions
luster+0.801
jriwal+0.722
anmar+0.721
clud+0.693
rete+0.668
Neg logit contributions
DEF-0.681
235-0.666
226-0.656
entin-0.648
ELF-0.647
Token start
Token ID923
Feature activation+9.16e-01

Pos logit contributions
DEF+0.810
235+0.792
226+0.780
entin+0.766
ELF+0.765
Neg logit contributions
luster-1.134
jriwal-1.034
anmar-1.030
clud-0.989
rete-0.959
Token of
Token ID286
Feature activation+4.66e-01

Pos logit contributions
DEF+2.540
235+2.494
226+2.458
entin+2.415
309+2.409
Neg logit contributions
luster-2.685
jriwal-2.443
anmar-2.438
clud-2.315
rete-2.242
Token the
Token ID262
Feature activation+8.47e-03

Pos logit contributions
DEF+1.190
235+1.165
226+1.147
entin+1.128
ELF+1.121
Neg logit contributions
luster-1.488
jriwal-1.355
anmar-1.353
clud-1.290
rete-1.249
Token 1946
Token ID22717
Feature activation+2.59e-01

Pos logit contributions
DEF+0.020
235+0.019
226+0.019
ELF+0.019
entin+0.019
Neg logit contributions
luster-0.029
jriwal-0.026
anmar-0.026
clud-0.025
rete-0.024
Token season
Token ID1622
Feature activation+9.21e-02

Pos logit contributions
DEF+0.666
235+0.653
226+0.643
entin+0.631
ELF+0.630
Neg logit contributions
luster-0.828
jriwal-0.752
anmar-0.748
clud-0.714
rete-0.692
Token,
Token ID11
Feature activation+1.63e-02

Pos logit contributions
DEF+0.222
235+0.216
226+0.213
entin+0.209
ELF+0.209
Neg logit contributions
luster-0.308
jriwal-0.281
anmar-0.280
clud-0.269
rete-0.261
Token celebrations
Token ID22861
Feature activation-4.35e-01

Pos logit contributions
luster+0.978
jriwal+0.892
anmar+0.890
clud+0.847
rete+0.821
Neg logit contributions
DEF-0.788
235-0.770
226-0.758
ELF-0.754
entin-0.746
Token in
Token ID287
Feature activation-1.83e-01

Pos logit contributions
luster+1.401
jriwal+1.277
anmar+1.274
clud+1.215
rete+1.179
Neg logit contributions
DEF-1.094
235-1.066
226-1.049
ELF-1.040
entin-1.036
Token St
Token ID520
Feature activation-2.93e-01

Pos logit contributions
luster+0.632
anmar+0.579
jriwal+0.578
clud+0.558
rete+0.541
Neg logit contributions
DEF-0.415
235-0.404
226-0.399
ELF-0.394
entin-0.389
Tokenirling
Token ID24297
Feature activation-2.79e-01

Pos logit contributions
luster+0.986
anmar+0.894
jriwal+0.893
clud+0.869
rete+0.856
Neg logit contributions
DEF-0.693
235-0.671
226-0.663
ELF-0.656
309-0.647
Token.
Token ID13
Feature activation-3.39e-01

Pos logit contributions
luster+0.919
jriwal+0.837
anmar+0.834
clud+0.801
rete+0.778
Neg logit contributions
DEF-0.684
235-0.667
226-0.656
ELF-0.650
entin-0.646
Token Again
Token ID6521
Feature activation+1.03e+00

Pos logit contributions
luster+1.073
anmar+0.969
jriwal+0.969
clud+0.932
rete+0.902
Neg logit contributions
DEF-0.866
235-0.844
226-0.831
entin-0.825
ELF-0.823
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
DEF+2.441
235+2.395
226+2.355
ELF+2.312
entin+2.309
Neg logit contributions
luster-3.445
jriwal-3.160
anmar-3.140
clud-3.050
rete-2.921
Token emphasis
Token ID12476
Feature activation-2.36e-01

Pos logit contributions
luster+0.390
jriwal+0.354
anmar+0.353
clud+0.337
rete+0.327
Neg logit contributions
DEF-0.323
235-0.316
226-0.311
entin-0.305
ELF-0.305
Token is
Token ID318
Feature activation-3.17e-02

Pos logit contributions
luster+0.721
jriwal+0.651
anmar+0.648
clud+0.620
rete+0.599
Neg logit contributions
DEF-0.636
235-0.622
226-0.613
ELF-0.605
entin-0.605
Token not
Token ID407
Feature activation-9.77e-02

Pos logit contributions
luster+0.095
jriwal+0.086
anmar+0.085
clud+0.082
rete+0.079
Neg logit contributions
DEF-0.087
235-0.085
226-0.084
entin-0.083
ELF-0.083
Token on
Token ID319
Feature activation+4.71e-01

Pos logit contributions
luster+0.291
jriwal+0.262
anmar+0.261
clud+0.249
rete+0.240
Neg logit contributions
DEF-0.271
235-0.266
226-0.262
entin-0.259
ELF-0.258
Token DO
Token ID8410
Feature activation-2.25e-01

Pos logit contributions
DEF+0.144
235+0.141
entin+0.140
226+0.139
ELF+0.135
Neg logit contributions
luster-0.176
anmar-0.158
jriwal-0.158
clud-0.154
rete-0.149
Token,
Token ID11
Feature activation+3.61e-02

Pos logit contributions
luster+0.797
jriwal+0.724
anmar+0.722
clud+0.704
rete+0.687
Neg logit contributions
DEF-0.482
235-0.470
entin-0.466
226-0.463
ELF-0.447
Token NEVER
Token ID33602
Feature activation+2.60e-01

Pos logit contributions
DEF+0.078
235+0.076
226+0.075
entin+0.074
ELF+0.073
Neg logit contributions
luster-0.128
jriwal-0.117
anmar-0.117
clud-0.113
rete-0.110
Token GOT
Token ID41725
Feature activation-3.45e-01

Pos logit contributions
DEF+0.583
235+0.569
226+0.559
ELF+0.549
entin+0.548
Neg logit contributions
luster-0.910
jriwal-0.834
anmar-0.831
clud-0.799
rete-0.776
Token INTO
Token ID39319
Feature activation-5.00e-03

Pos logit contributions
luster+1.204
anmar+1.092
jriwal+1.089
clud+1.062
rete+1.032
Neg logit contributions
DEF-0.754
235-0.740
entin-0.730
226-0.726
309-0.705
Token THAT
Token ID14603
Feature activation+3.00e-01

Pos logit contributions
luster+0.018
jriwal+0.016
anmar+0.016
clud+0.015
rete+0.015
Neg logit contributions
DEF-0.011
235-0.011
226-0.011
entin-0.011
ELF-0.010
Token*
Token ID9
Feature activation-1.36e-01

Pos logit contributions
DEF+0.681
235+0.664
226+0.652
ELF+0.642
entin+0.638
Neg logit contributions
luster-1.049
jriwal-0.961
anmar-0.958
clud-0.919
rete-0.892
Token
Token ID198
Feature activation+9.34e-02

Pos logit contributions
luster+0.430
jriwal+0.389
anmar+0.387
clud+0.372
rete+0.360
Neg logit contributions
DEF-0.353
235-0.345
226-0.340
entin-0.337
ELF-0.334
Token
Token ID198
Feature activation+9.03e-02

Pos logit contributions
DEF+0.249
235+0.245
226+0.241
ELF+0.236
309+0.235
Neg logit contributions
luster-0.291
jriwal-0.264
anmar-0.263
clud-0.249
rete-0.240
TokenLet
Token ID5756
Feature activation+8.28e-02

Pos logit contributions
DEF+0.192
235+0.187
226+0.184
entin+0.181
ELF+0.180
Neg logit contributions
luster-0.327
jriwal-0.300
anmar-0.299
clud-0.288
rete-0.281
Token's
Token ID338
Feature activation+2.58e-01

Pos logit contributions
DEF+0.217
235+0.212
226+0.209
entin+0.206
ELF+0.206
Neg logit contributions
luster-0.260
jriwal-0.235
anmar-0.235
clud-0.224
rete-0.217
Token:
Token ID25
Feature activation-1.15e-01

Pos logit contributions
luster+0.879
jriwal+0.802
anmar+0.800
clud+0.767
rete+0.745
Neg logit contributions
DEF-0.644
235-0.628
226-0.617
entin-0.610
ELF-0.609
Token That
Token ID1320
Feature activation-1.44e-01

Pos logit contributions
luster+0.359
jriwal+0.325
anmar+0.324
clud+0.310
rete+0.300
Neg logit contributions
DEF-0.304
235-0.297
226-0.293
entin-0.288
ELF-0.288
Token homosexuality
Token ID19822
Feature activation-4.60e-01

Pos logit contributions
luster+0.453
jriwal+0.411
anmar+0.408
clud+0.392
rete+0.379
Neg logit contributions
DEF-0.374
235-0.367
226-0.361
entin-0.356
ELF-0.355
Token is
Token ID318
Feature activation-1.07e-01

Pos logit contributions
luster+1.368
jriwal+1.243
anmar+1.225
clud+1.178
rete+1.139
Neg logit contributions
DEF-1.254
235-1.229
226-1.211
entin-1.199
ELF-1.199
Token rep
Token ID1128
Feature activation-2.05e-01

Pos logit contributions
luster+0.329
jriwal+0.297
anmar+0.296
clud+0.284
rete+0.274
Neg logit contributions
DEF-0.285
235-0.279
226-0.275
entin-0.272
ELF-0.271
Tokenug
Token ID1018
Feature activation+3.57e-01

Pos logit contributions
luster+0.553
jriwal+0.492
anmar+0.489
clud+0.464
rete+0.444
Neg logit contributions
DEF-0.627
235-0.616
226-0.608
entin-0.601
ELF-0.599
Tokennant
Token ID22057
Feature activation-3.91e-01

Pos logit contributions
DEF+0.972
235+0.959
entin+0.938
226+0.936
309+0.919
Neg logit contributions
luster-1.067
anmar-0.989
jriwal-0.983
clud-0.917
galitarian-0.876
Token,
Token ID11
Feature activation+1.82e-03

Pos logit contributions
luster+1.188
jriwal+1.072
anmar+1.070
clud+1.021
rete+0.987
Neg logit contributions
DEF-1.056
235-1.034
226-1.019
entin-1.006
ELF-1.004
Token a
Token ID257
Feature activation-1.28e-01

Pos logit contributions
DEF+0.005
235+0.005
226+0.005
entin+0.005
ELF+0.005
Neg logit contributions
luster-0.006
jriwal-0.005
anmar-0.005
clud-0.005
rete-0.005
Token sin
Token ID7813
Feature activation-3.74e-01

Pos logit contributions
luster+0.398
jriwal+0.357
anmar+0.356
clud+0.340
rete+0.329
Neg logit contributions
DEF-0.342
235-0.335
226-0.330
ELF-0.324
entin-0.324
Token that
Token ID326
Feature activation-3.45e-01

Pos logit contributions
luster+1.223
jriwal+1.112
anmar+1.107
clud+1.065
rete+1.032
Neg logit contributions
DEF-0.927
235-0.905
226-0.890
entin-0.879
ELF-0.878
Token on
Token ID319
Feature activation+1.85e-01

Pos logit contributions
DEF+1.361
235+1.331
226+1.312
entin+1.291
ELF+1.290
Neg logit contributions
luster-1.657
jriwal-1.501
anmar-1.495
clud-1.432
rete-1.385
Token the
Token ID262
Feature activation+5.60e-02

Pos logit contributions
DEF+0.487
235+0.476
226+0.469
ELF+0.464
entin+0.462
Neg logit contributions
luster-0.576
jriwal-0.520
anmar-0.518
clud-0.496
rete-0.481
Token third
Token ID2368
Feature activation+7.27e-02

Pos logit contributions
DEF+0.144
235+0.141
226+0.139
ELF+0.138
entin+0.136
Neg logit contributions
luster-0.177
jriwal-0.160
anmar-0.160
clud-0.153
rete-0.149
Token line
Token ID1627
Feature activation-1.27e-01

Pos logit contributions
DEF+0.218
235+0.214
226+0.212
ELF+0.210
entin+0.209
Neg logit contributions
luster-0.199
jriwal-0.176
anmar-0.176
clud-0.168
rete-0.162
Token in
Token ID287
Feature activation+3.17e-01

Pos logit contributions
luster+0.409
jriwal+0.368
anmar+0.368
clud+0.353
rete+0.343
Neg logit contributions
DEF-0.323
235-0.314
226-0.310
ELF-0.308
entin-0.306
Token Chicago
Token ID4842
Feature activation+3.47e-01

Pos logit contributions
DEF+0.807
235+0.789
226+0.777
entin+0.765
ELF+0.763
Neg logit contributions
luster-1.017
jriwal-0.924
anmar-0.920
clud-0.881
rete-0.854
Token's
Token ID338
Feature activation-1.34e-01

Pos logit contributions
DEF+0.888
235+0.868
226+0.855
entin+0.842
ELF+0.840
Neg logit contributions
luster-1.109
jriwal-1.006
anmar-1.002
clud-0.961
rete-0.929
Token 3
Token ID513
Feature activation+3.65e-02

Pos logit contributions
luster+0.425
anmar+0.383
jriwal+0.383
clud+0.367
rete+0.355
Neg logit contributions
DEF-0.348
235-0.339
226-0.335
ELF-0.330
entin-0.330
Token-
Token ID12
Feature activation+4.92e-01

Pos logit contributions
DEF+0.090
235+0.088
226+0.087
entin+0.085
ELF+0.085
Neg logit contributions
luster-0.120
jriwal-0.109
anmar-0.109
clud-0.104
rete-0.101
Token2
Token ID17
Feature activation+8.15e-01

Pos logit contributions
DEF+1.092
235+1.080
226+1.058
309+1.026
entin+1.025
Neg logit contributions
luster-1.732
jriwal-1.607
anmar-1.592
clud-1.535
rete-1.480
Token loss
Token ID2994
Feature activation+6.38e-01

Pos logit contributions
DEF+2.102
235+2.052
226+2.015
entin+1.982
309+1.966
Neg logit contributions
luster-2.584
jriwal-2.370
anmar-2.349
clud-2.242
rete-2.176
Token for
Token ID329
Feature activation+5.57e-02

Pos logit contributions
DEF+1.022
235+1.001
226+0.984
entin+0.967
ELF+0.964
Neg logit contributions
luster-1.427
jriwal-1.295
anmar-1.292
clud-1.244
rete-1.206
Token the
Token ID262
Feature activation+1.49e-01

Pos logit contributions
DEF+0.153
235+0.150
226+0.148
ELF+0.146
entin+0.146
Neg logit contributions
luster-0.167
jriwal-0.151
anmar-0.150
clud-0.143
rete-0.138
Token S
Token ID311
Feature activation+9.97e-02

Pos logit contributions
DEF+0.401
235+0.393
226+0.388
ELF+0.382
entin+0.381
Neg logit contributions
luster-0.455
jriwal-0.411
anmar-0.409
clud-0.392
rete-0.378
TokenPA
Token ID4537
Feature activation+8.21e-02

Pos logit contributions
DEF+0.245
235+0.239
226+0.235
ELF+0.232
entin+0.231
Neg logit contributions
luster-0.329
jriwal-0.300
anmar-0.299
clud-0.288
rete-0.277
Token,
Token ID11
Feature activation+9.45e-03

Pos logit contributions
DEF+0.205
235+0.201
226+0.198
entin+0.194
ELF+0.194
Neg logit contributions
luster-0.267
jriwal-0.243
anmar-0.242
clud-0.232
rete-0.225
Token Koreans
Token ID25999
Feature activation+1.12e-01

Pos logit contributions
DEF+0.025
235+0.024
226+0.024
entin+0.024
ELF+0.024
Neg logit contributions
luster-0.029
jriwal-0.027
anmar-0.026
clud-0.025
rete-0.024
Token cast
Token ID3350
Feature activation-5.92e-01

Pos logit contributions
DEF+0.309
235+0.302
226+0.298
entin+0.295
ELF+0.294
Neg logit contributions
luster-0.337
jriwal-0.303
anmar-0.302
clud-0.289
rete-0.279
Token their
Token ID511
Feature activation-3.21e-03

Pos logit contributions
luster+1.732
jriwal+1.541
anmar+1.537
clud+1.464
rete+1.430
Neg logit contributions
DEF-1.685
235-1.625
226-1.613
entin-1.596
ELF-1.591
Token ballots
Token ID20103
Feature activation+1.62e-01

Pos logit contributions
luster+0.009
jriwal+0.008
anmar+0.008
clud+0.008
rete+0.008
Neg logit contributions
DEF-0.009
235-0.009
226-0.009
entin-0.009
ELF-0.009
Token again
Token ID757
Feature activation+6.01e-01

Pos logit contributions
DEF+0.375
235+0.366
226+0.360
entin+0.354
ELF+0.354
Neg logit contributions
luster-0.556
jriwal-0.508
anmar-0.506
clud-0.486
rete-0.472
Token.
Token ID13
Feature activation+5.54e-02

Pos logit contributions
DEF+1.368
235+1.342
226+1.319
entin+1.291
ELF+1.285
Neg logit contributions
luster-2.084
jriwal-1.908
anmar-1.899
clud-1.829
rete-1.774
Token to
Token ID284
Feature activation+5.91e-02

Pos logit contributions
luster+1.571
jriwal+1.398
anmar+1.389
clud+1.327
rete+1.269
Neg logit contributions
DEF-1.722
235-1.689
226-1.668
entin-1.650
ELF-1.645
Token ongoing
Token ID7044
Feature activation-1.64e-01

Pos logit contributions
DEF+0.167
235+0.163
226+0.161
entin+0.158
ELF+0.158
Neg logit contributions
luster-0.173
jriwal-0.156
anmar-0.155
clud-0.149
rete-0.143
Token friction
Token ID23822
Feature activation-4.87e-01

Pos logit contributions
luster+0.500
jriwal+0.453
anmar+0.451
clud+0.431
rete+0.416
Neg logit contributions
DEF-0.440
235-0.431
226-0.425
entin-0.418
ELF-0.418
Token between
Token ID1022
Feature activation+4.70e-01

Pos logit contributions
luster+1.643
jriwal+1.491
anmar+1.481
clud+1.427
rete+1.392
Neg logit contributions
DEF-1.165
235-1.125
226-1.106
entin-1.099
ELF-1.096
Token her
Token ID607
Feature activation+1.03e-01

Pos logit contributions
DEF+1.250
235+1.235
226+1.221
ELF+1.195
entin+1.190
Neg logit contributions
luster-1.442
jriwal-1.311
anmar-1.295
clud-1.242
rete-1.193
Token and
Token ID290
Feature activation-5.11e-03

Pos logit contributions
DEF+0.245
235+0.240
226+0.236
ELF+0.232
entin+0.232
Neg logit contributions
luster-0.346
jriwal-0.316
anmar-0.314
clud-0.301
rete-0.293
Token another
Token ID1194
Feature activation+4.03e-01

Pos logit contributions
luster+0.017
jriwal+0.016
anmar+0.016
clud+0.015
rete+0.015
Neg logit contributions
DEF-0.012
235-0.012
226-0.012
entin-0.011
ELF-0.011
Token employee
Token ID6538
Feature activation+3.51e-01

Pos logit contributions
DEF+1.138
235+1.121
226+1.114
ELF+1.088
entin+1.083
Neg logit contributions
luster-1.170
jriwal-1.062
anmar-1.053
clud-0.991
rete-0.958
Token,
Token ID11
Feature activation+1.55e-01

Pos logit contributions
DEF+0.825
235+0.810
226+0.798
entin+0.780
ELF+0.777
Neg logit contributions
luster-1.191
jriwal-1.088
anmar-1.084
clud-1.040
rete-1.009
Token Desire
Token ID41042
Feature activation+2.64e-02

Pos logit contributions
DEF+0.376
235+0.367
226+0.361
entin+0.355
ELF+0.355
Neg logit contributions
luster-0.516
jriwal-0.470
anmar-0.469
clud-0.450
rete-0.436
Tokene
Token ID68
Feature activation-9.73e-02

Pos logit contributions
DEF+0.060
235+0.059
226+0.058
entin+0.057
ELF+0.057
Neg logit contributions
luster-0.092
jriwal-0.084
anmar-0.083
clud-0.080
rete-0.078
Token.
Token ID13
Feature activation+5.69e-02

Pos logit contributions
luster+1.354
jriwal+1.229
anmar+1.226
clud+1.182
rete+1.146
Neg logit contributions
DEF-1.018
235-0.989
226-0.974
entin-0.968
ELF-0.961
Token171
Token ID27192
Feature activation-1.46e-01

Pos logit contributions
226+0.081
DEF+0.081
235+0.080
186+0.078
ELF+0.075
Neg logit contributions
luster-0.247
jriwal-0.233
anmar-0.232
clud-0.221
rete-0.217
Token 102
Token ID15143
Feature activation+1.06e-01

Pos logit contributions
luster+0.463
jriwal+0.423
anmar+0.419
clud+0.397
rete+0.386
Neg logit contributions
DEF-0.378
235-0.372
226-0.366
ELF-0.360
309-0.358
Token30
Token ID1270
Feature activation-5.00e-02

Pos logit contributions
DEF+0.217
235+0.209
entin+0.207
226+0.205
ELF+0.204
Neg logit contributions
luster-0.391
anmar-0.357
jriwal-0.357
clud-0.347
rete-0.338
Token86
Token ID4521
Feature activation+3.33e-01

Pos logit contributions
luster+0.189
jriwal+0.174
anmar+0.174
clud+0.168
rete+0.164
Neg logit contributions
DEF-0.097
235-0.094
entin-0.092
226-0.092
ELF-0.091
Token x
Token ID2124
Feature activation-5.33e-01

Pos logit contributions
235+0.717
DEF+0.716
226+0.703
309+0.682
186+0.681
Neg logit contributions
luster-1.206
jriwal-1.119
anmar-1.114
clud-1.056
galitarian-1.021
Token.
Token ID13
Feature activation-2.07e-01

Pos logit contributions
luster+1.664
jriwal+1.501
anmar+1.498
clud+1.460
rete+1.423
Neg logit contributions
DEF-1.357
235-1.308
entin-1.307
226-1.291
ELF-1.287
Tokenx
Token ID87
Feature activation-5.21e-01

Pos logit contributions
luster+0.722
jriwal+0.660
anmar+0.658
clud+0.636
rete+0.618
Neg logit contributions
DEF-0.468
235-0.455
226-0.447
entin-0.442
ELF-0.441
Token.
Token ID13
Feature activation-6.26e-02

Pos logit contributions
luster+1.645
jriwal+1.487
anmar+1.484
clud+1.442
rete+1.404
Neg logit contributions
DEF-1.319
235-1.272
entin-1.263
226-1.254
ELF-1.246
Tokenx
Token ID87
Feature activation-4.77e-01

Pos logit contributions
luster+0.223
jriwal+0.205
anmar+0.204
clud+0.196
rete+0.190
Neg logit contributions
DEF-0.138
235-0.135
226-0.132
ELF-0.129
entin-0.129
Token.
Token ID13
Feature activation+3.79e-02

Pos logit contributions
luster+1.545
jriwal+1.399
anmar+1.398
clud+1.349
rete+1.311
Neg logit contributions
DEF-1.186
235-1.150
226-1.132
entin-1.131
ELF-1.120
Token knowing
Token ID6970
Feature activation-2.33e-01

Pos logit contributions
luster+0.576
jriwal+0.518
anmar+0.515
clud+0.496
rete+0.477
Neg logit contributions
DEF-0.502
235-0.490
226-0.483
entin-0.478
ELF-0.476
Token you
Token ID345
Feature activation-3.28e-01

Pos logit contributions
luster+0.703
jriwal+0.633
anmar+0.629
clud+0.603
rete+0.582
Neg logit contributions
DEF-0.636
235-0.622
226-0.613
entin-0.604
ELF-0.604
Token were
Token ID547
Feature activation-2.43e-01

Pos logit contributions
luster+0.899
jriwal+0.802
anmar+0.800
clud+0.762
rete+0.734
Neg logit contributions
DEF-0.984
235-0.963
226-0.951
ELF-0.939
entin-0.938
Token --
Token ID1377
Feature activation+8.84e-02

Pos logit contributions
luster+0.740
jriwal+0.668
anmar+0.665
clud+0.637
rete+0.615
Neg logit contributions
DEF-0.657
235-0.643
226-0.634
entin-0.625
ELF-0.624
Token
Token ID198
Feature activation-5.76e-01

Pos logit contributions
DEF+0.238
235+0.233
226+0.230
entin+0.226
ELF+0.226
Neg logit contributions
luster-0.271
jriwal-0.245
anmar-0.244
clud-0.233
rete-0.225
Token
Token ID198
Feature activation-4.13e-01

Pos logit contributions
luster+1.652
jriwal+1.477
anmar+1.469
clud+1.424
rete+1.380
Neg logit contributions
DEF-1.625
235-1.581
entin-1.567
226-1.561
ELF-1.553
TokenRF
Token ID32754
Feature activation-1.34e-01

Pos logit contributions
luster+1.649
jriwal+1.520
anmar+1.509
clud+1.476
rete+1.448
Neg logit contributions
DEF-0.673
235-0.668
entin-0.660
226-0.649
ELF-0.627
Token:
Token ID25
Feature activation-3.43e-01

Pos logit contributions
luster+0.447
jriwal+0.407
anmar+0.406
clud+0.392
rete+0.382
Neg logit contributions
DEF-0.320
235-0.311
226-0.306
entin-0.305
ELF-0.303
Token You
Token ID921
Feature activation+1.58e-04

Pos logit contributions
luster+1.107
anmar+0.996
jriwal+0.992
clud+0.966
rete+0.935
Neg logit contributions
DEF-0.852
235-0.827
entin-0.817
226-0.814
ELF-0.803
Token know
Token ID760
Feature activation+1.13e-01

Pos logit contributions
DEF+0.000
235+0.000
226+0.000
entin+0.000
ELF+0.000
Neg logit contributions
luster-0.000
jriwal-0.000
anmar-0.000
clud-0.000
rete-0.000
Token,
Token ID11
Feature activation-8.44e-02

Pos logit contributions
DEF+0.278
235+0.271
226+0.267
entin+0.263
ELF+0.263
Neg logit contributions
luster-0.372
jriwal-0.338
anmar-0.337
clud-0.324
rete-0.314
Token it
Token ID340
Feature activation-1.77e-01

Pos logit contributions
DEF+0.069
235+0.068
226+0.067
entin+0.066
ELF+0.066
Neg logit contributions
luster-0.080
jriwal-0.072
anmar-0.072
clud-0.069
rete-0.067
Token is
Token ID318
Feature activation+1.66e-01

Pos logit contributions
luster+0.581
jriwal+0.529
anmar+0.526
clud+0.506
rete+0.490
Neg logit contributions
DEF-0.433
235-0.423
226-0.416
ELF-0.411
entin-0.410
Token quite
Token ID2407
Feature activation+3.29e-01

Pos logit contributions
DEF+0.440
235+0.431
226+0.425
entin+0.418
ELF+0.418
Neg logit contributions
luster-0.515
jriwal-0.466
anmar-0.464
clud-0.444
rete-0.429
Token easy
Token ID2562
Feature activation-2.66e-02

Pos logit contributions
DEF+0.848
235+0.830
226+0.818
entin+0.805
ELF+0.804
Neg logit contributions
luster-1.043
jriwal-0.945
anmar-0.942
clud-0.903
rete-0.873
Token to
Token ID284
Feature activation-5.49e-02

Pos logit contributions
luster+0.082
jriwal+0.074
anmar+0.074
clud+0.071
rete+0.069
Neg logit contributions
DEF-0.071
235-0.069
226-0.068
entin-0.067
ELF-0.067
Token sort
Token ID3297
Feature activation-1.25e-01

Pos logit contributions
luster+0.165
jriwal+0.148
anmar+0.148
clud+0.141
rete+0.135
Neg logit contributions
DEF-0.152
235-0.149
226-0.147
entin-0.145
ELF-0.144
Token those
Token ID883
Feature activation+4.85e-01

Pos logit contributions
luster+0.319
jriwal+0.282
anmar+0.280
clud+0.265
rete+0.254
Neg logit contributions
DEF-0.400
235-0.394
226-0.389
entin-0.384
ELF-0.383
Token issues
Token ID2428
Feature activation-1.60e-01

Pos logit contributions
DEF+1.218
235+1.196
226+1.178
entin+1.156
ELF+1.152
Neg logit contributions
luster-1.572
jriwal-1.427
anmar-1.420
clud-1.363
rete-1.314
Token.
Token ID13
Feature activation-4.30e-02

Pos logit contributions
luster+0.529
jriwal+0.484
anmar+0.481
clud+0.463
rete+0.452
Neg logit contributions
DEF-0.387
235-0.373
226-0.368
ELF-0.364
entin-0.364
Token
Token ID447
Feature activation-2.07e-01

Pos logit contributions
luster+0.132
jriwal+0.119
anmar+0.119
clud+0.114
rete+0.110
Neg logit contributions
DEF-0.114
235-0.112
226-0.110
entin-0.109
ELF-0.109
Token
Token ID251
Feature activation-1.86e-01

Pos logit contributions
luster+0.575
jriwal+0.514
anmar+0.511
clud+0.487
rete+0.468
Neg logit contributions
DEF-0.615
235-0.603
226-0.596
entin-0.587
ELF-0.587
Tokenomm
Token ID2002
Feature activation+5.03e-01

Pos logit contributions
DEF+1.943
235+1.928
226+1.905
309+1.867
entin+1.862
Neg logit contributions
luster-1.527
jriwal-1.378
anmar-1.376
clud-1.259
galitarian-1.226
Tokenand
Token ID392
Feature activation-2.69e-01

Pos logit contributions
DEF+1.337
235+1.300
226+1.280
entin+1.268
ELF+1.257
Neg logit contributions
luster-1.571
jriwal-1.419
anmar-1.404
clud-1.335
rete-1.298
Token|
Token ID91
Feature activation+8.68e-01

Pos logit contributions
luster+1.011
jriwal+0.927
anmar+0.926
clud+0.895
rete+0.872
Neg logit contributions
DEF-0.533
235-0.518
226-0.508
entin-0.499
ELF-0.497
Token.
Token ID13
Feature activation+6.48e-02

Pos logit contributions
DEF+1.945
235+1.894
226+1.865
ELF+1.833
entin+1.821
Neg logit contributions
luster-3.040
jriwal-2.790
anmar-2.789
clud-2.669
rete-2.586
Token 3
Token ID513
Feature activation-3.77e-02

Pos logit contributions
DEF+0.168
235+0.164
226+0.162
entin+0.160
ELF+0.159
Neg logit contributions
luster-0.205
jriwal-0.185
anmar-0.185
clud-0.177
rete-0.171
Token.
Token ID13
Feature activation-3.20e-01

Pos logit contributions
luster+0.126
jriwal+0.114
anmar+0.114
clud+0.110
rete+0.106
Neg logit contributions
DEF-0.091
235-0.089
226-0.088
entin-0.087
ELF-0.086
Token If
Token ID1002
Feature activation-2.02e-01

Pos logit contributions
luster+1.004
jriwal+0.909
anmar+0.908
clud+0.873
rete+0.843
Neg logit contributions
DEF-0.826
235-0.807
226-0.796
entin-0.786
ELF-0.783
Token you
Token ID345
Feature activation+9.34e-02

Pos logit contributions
luster+0.553
jriwal+0.493
anmar+0.492
clud+0.470
rete+0.452
Neg logit contributions
DEF-0.605
235-0.594
226-0.586
entin-0.579
ELF-0.577
Token start
Token ID923
Feature activation+1.05e-01

Pos logit contributions
DEF+0.251
235+0.246
226+0.243
entin+0.239
ELF+0.239
Neg logit contributions
luster-0.288
jriwal-0.259
anmar-0.258
clud-0.246
rete-0.237
Token editing
Token ID12857
Feature activation+1.27e-01

Pos logit contributions
DEF+0.265
235+0.259
226+0.255
entin+0.251
ELF+0.251
Neg logit contributions
luster-0.342
jriwal-0.310
anmar-0.309
clud-0.296
rete-0.287
Token a
Token ID257
Feature activation+1.95e-01

Pos logit contributions
DEF+0.346
235+0.338
226+0.334
entin+0.329
ELF+0.328
Neg logit contributions
luster-0.387
jriwal-0.349
anmar-0.348
clud-0.333
rete-0.322
Token wouldn
Token ID3636
Feature activation-1.11e-01

Pos logit contributions
DEF+0.347
235+0.339
226+0.334
entin+0.329
ELF+0.328
Neg logit contributions
luster-0.449
jriwal-0.408
anmar-0.406
clud-0.390
rete-0.377
Token
Token ID447
Feature activation+1.58e-01

Pos logit contributions
luster+0.387
jriwal+0.354
anmar+0.353
clud+0.339
rete+0.329
Neg logit contributions
DEF-0.252
235-0.246
226-0.242
entin-0.237
ELF-0.237
Token
Token ID247
Feature activation-1.63e-01

Pos logit contributions
DEF+0.455
235+0.455
226+0.453
309+0.437
186+0.437
Neg logit contributions
luster-0.457
jriwal-0.409
anmar-0.405
clud-0.388
rete-0.369
Tokent
Token ID83
Feature activation+7.07e-02

Pos logit contributions
luster+0.521
jriwal+0.471
anmar+0.470
clud+0.449
rete+0.432
Neg logit contributions
DEF-0.418
235-0.410
226-0.405
entin-0.397
ELF-0.396
Token survive
Token ID7866
Feature activation+9.05e-02

Pos logit contributions
DEF+0.188
235+0.185
226+0.183
entin+0.179
309+0.178
Neg logit contributions
luster-0.219
jriwal-0.198
anmar-0.198
clud-0.187
galitarian-0.181
Token the
Token ID262
Feature activation-2.11e-01

Pos logit contributions
DEF+0.235
235+0.230
226+0.227
entin+0.223
ELF+0.223
Neg logit contributions
luster-0.285
jriwal-0.259
anmar-0.258
clud-0.247
rete-0.239
Token mortgage
Token ID13682
Feature activation+4.95e-01

Pos logit contributions
luster+0.686
jriwal+0.621
anmar+0.621
clud+0.594
rete+0.575
Neg logit contributions
DEF-0.529
235-0.516
226-0.508
ELF-0.501
entin-0.501
Token meltdown
Token ID37824
Feature activation+1.71e-01

Pos logit contributions
DEF+1.069
235+1.044
226+1.026
ELF+1.002
entin+0.999
Neg logit contributions
luster-1.771
jriwal-1.629
anmar-1.623
clud-1.566
rete-1.521
Token,
Token ID11
Feature activation-6.38e-02

Pos logit contributions
DEF+0.411
235+0.404
226+0.397
entin+0.389
ELF+0.387
Neg logit contributions
luster-0.571
jriwal-0.522
anmar-0.520
clud-0.499
rete-0.484
Token saw
Token ID2497
Feature activation+2.37e-01

Pos logit contributions
luster+0.205
jriwal+0.186
anmar+0.185
clud+0.178
rete+0.172
Neg logit contributions
DEF-0.162
235-0.158
226-0.156
entin-0.153
ELF-0.153
Token Cohn
Token ID45005
Feature activation-2.05e-01

Pos logit contributions
DEF+0.598
235+0.589
226+0.579
entin+0.567
ELF+0.567
Neg logit contributions
luster-0.761
jriwal-0.693
anmar-0.690
clud-0.663
rete-0.640
Token my
Token ID616
Feature activation-3.63e-01

Pos logit contributions
luster+0.949
anmar+0.849
jriwal+0.848
clud+0.809
rete+0.787
Neg logit contributions
DEF-0.873
235-0.853
226-0.842
entin-0.831
ELF-0.830
Token sound
Token ID2128
Feature activation-2.46e-01

Pos logit contributions
luster+1.088
jriwal+0.980
anmar+0.976
clud+0.933
rete+0.900
Neg logit contributions
DEF-1.001
235-0.981
226-0.967
entin-0.953
ELF-0.953
Token system
Token ID1080
Feature activation-1.30e-03

Pos logit contributions
luster+0.725
jriwal+0.652
anmar+0.649
clud+0.619
rete+0.597
Neg logit contributions
DEF-0.692
235-0.679
226-0.670
entin-0.660
ELF-0.659
Token.
Token ID13
Feature activation-2.48e-01

Pos logit contributions
luster+0.004
jriwal+0.004
anmar+0.004
clud+0.004
rete+0.004
Neg logit contributions
DEF-0.003
235-0.003
226-0.003
ELF-0.003
entin-0.003
Token
Token ID198
Feature activation-1.14e+00

Pos logit contributions
luster+0.808
anmar+0.731
jriwal+0.731
clud+0.702
rete+0.680
Neg logit contributions
DEF-0.618
235-0.603
226-0.594
entin-0.587
ELF-0.586
Token
Token ID198
Feature activation-8.96e-01

Pos logit contributions
luster+2.922
jriwal+2.558
anmar+2.547
clud+2.538
rete+2.478
Neg logit contributions
DEF-3.394
entin-3.369
ELF-3.269
235-3.268
226-3.235
TokenRated
Token ID15322
Feature activation-6.62e-03

Pos logit contributions
luster+3.711
anmar+3.416
jriwal+3.401
clud+3.344
rete+3.289
Neg logit contributions
DEF-1.322
entin-1.307
235-1.270
ELF-1.254
226-1.252
Token 5
Token ID642
Feature activation-6.79e-01

Pos logit contributions
luster+0.019
jriwal+0.017
anmar+0.017
clud+0.017
rete+0.016
Neg logit contributions
DEF-0.019
235-0.018
226-0.018
entin-0.018
ELF-0.018
Token out
Token ID503
Feature activation-1.96e-01

Pos logit contributions
luster+2.226
jriwal+2.022
anmar+2.013
clud+1.961
rete+1.908
Neg logit contributions
DEF-1.647
235-1.585
226-1.564
entin-1.562
ELF-1.558
Token of
Token ID286
Feature activation-3.02e-01

Pos logit contributions
luster+0.557
jriwal+0.499
anmar+0.497
clud+0.473
rete+0.455
Neg logit contributions
DEF-0.573
235-0.562
226-0.555
entin-0.547
ELF-0.547
Token 5
Token ID642
Feature activation-6.77e-01

Pos logit contributions
luster+0.879
jriwal+0.789
anmar+0.786
clud+0.750
rete+0.723
Neg logit contributions
DEF-0.857
235-0.840
226-0.829
entin-0.817
ELF-0.816
Token Catalyst
Token ID48238
Feature activation-2.40e-01

Pos logit contributions
luster+2.262
jriwal+2.045
anmar+2.034
clud+1.950
rete+1.900
Neg logit contributions
DEF-1.888
235-1.848
226-1.824
entin-1.815
ELF-1.784
Token Control
Token ID6779
Feature activation-1.50e-01

Pos logit contributions
luster+0.757
jriwal+0.687
anmar+0.683
clud+0.657
rete+0.637
Neg logit contributions
DEF-0.620
235-0.605
226-0.597
entin-0.590
ELF-0.586
Token Center
Token ID3337
Feature activation+3.37e-01

Pos logit contributions
luster+0.494
jriwal+0.450
anmar+0.447
clud+0.429
rete+0.419
Neg logit contributions
DEF-0.370
235-0.360
226-0.355
entin-0.351
ELF-0.349
Token and
Token ID290
Feature activation-9.32e-03

Pos logit contributions
DEF+0.867
235+0.850
226+0.837
ELF+0.824
entin+0.822
Neg logit contributions
luster-1.073
jriwal-0.972
anmar-0.970
clud-0.929
rete-0.896
Token Over
Token ID3827
Feature activation-1.06e+00

Pos logit contributions
luster+0.029
jriwal+0.026
anmar+0.026
clud+0.025
rete+0.024
Neg logit contributions
DEF-0.025
235-0.024
226-0.024
ELF-0.023
entin-0.023
TokenCurrent
Token ID11297
Feature activation-1.25e+00

Pos logit contributions
luster+3.009
anmar+2.645
jriwal+2.613
clud+2.523
rete+2.434
Neg logit contributions
DEF-3.079
235-2.998
226-2.957
entin-2.944
ELF-2.938
Token Protection
Token ID9985
Feature activation-3.81e-02

Pos logit contributions
luster+3.926
jriwal+3.567
anmar+3.526
clud+3.408
rete+3.325
Neg logit contributions
DEF-3.198
235-3.120
226-3.075
entin-3.061
ELF-3.025
Token limits
Token ID7095
Feature activation-4.08e-01

Pos logit contributions
luster+0.123
jriwal+0.112
anmar+0.112
clud+0.107
rete+0.104
Neg logit contributions
DEF-0.096
235-0.094
226-0.092
entin-0.091
ELF-0.091
Token which
Token ID543
Feature activation-7.70e-02

Pos logit contributions
luster+1.311
jriwal+1.191
anmar+1.185
clud+1.136
rete+1.106
Neg logit contributions
DEF-1.033
235-1.003
226-0.989
entin-0.978
ELF-0.976
Token means
Token ID1724
Feature activation+6.19e-01

Pos logit contributions
luster+0.252
jriwal+0.229
anmar+0.228
clud+0.219
rete+0.212
Neg logit contributions
DEF-0.192
235-0.187
226-0.184
entin-0.181
ELF-0.181
Token access
Token ID1895
Feature activation-1.58e-01

Pos logit contributions
DEF+1.659
235+1.627
226+1.598
ELF+1.582
entin+1.567
Neg logit contributions
luster-1.900
jriwal-1.727
anmar-1.715
clud-1.654
galitarian-1.576
Token inspired
Token ID7867
Feature activation+2.39e-01

Pos logit contributions
luster+0.829
jriwal+0.748
anmar+0.747
clud+0.716
rete+0.694
Neg logit contributions
DEF-0.684
235-0.668
226-0.658
entin-0.652
ELF-0.651
Token by
Token ID416
Feature activation+2.55e-01

Pos logit contributions
DEF+0.637
235+0.627
226+0.617
entin+0.604
ELF+0.603
Neg logit contributions
luster-0.739
jriwal-0.669
anmar-0.665
clud-0.636
rete-0.614
Token the
Token ID262
Feature activation-7.44e-02

Pos logit contributions
DEF+0.668
235+0.656
226+0.647
entin+0.636
ELF+0.633
Neg logit contributions
luster-0.795
jriwal-0.722
anmar-0.716
clud-0.687
rete-0.664
Token Three
Token ID7683
Feature activation-5.50e-01

Pos logit contributions
luster+0.223
jriwal+0.201
anmar+0.200
clud+0.191
rete+0.185
Neg logit contributions
DEF-0.205
235-0.201
226-0.198
entin-0.195
ELF-0.195
Token Fl
Token ID1610
Feature activation+6.49e-02

Pos logit contributions
luster+1.744
anmar+1.574
jriwal+1.572
clud+1.523
rete+1.468
Neg logit contributions
DEF-1.422
235-1.374
226-1.354
ELF-1.340
entin-1.338
Tokenoy
Token ID726
Feature activation-9.68e-01

Pos logit contributions
DEF+0.151
235+0.147
226+0.145
ELF+0.142
entin+0.142
Neg logit contributions
luster-0.223
jriwal-0.203
anmar-0.203
clud-0.195
rete-0.190
Tokends
Token ID9310
Feature activation-1.96e-01

Pos logit contributions
luster+2.192
anmar+1.902
jriwal+1.879
clud+1.773
rete+1.738
Neg logit contributions
DEF-3.325
235-3.257
ELF-3.218
226-3.205
entin-3.197
Token-
Token ID12
Feature activation-3.07e-01

Pos logit contributions
luster+0.620
anmar+0.560
jriwal+0.559
clud+0.537
rete+0.520
Neg logit contributions
DEF-0.505
235-0.491
226-0.485
entin-0.479
ELF-0.478
TokenHalf
Token ID31305
Feature activation-1.30e-01

Pos logit contributions
luster+0.934
jriwal+0.838
anmar+0.838
clud+0.801
rete+0.774
Neg logit contributions
DEF-0.833
235-0.814
226-0.804
entin-0.795
ELF-0.792
Token Ac
Token ID4013
Feature activation+1.11e-01

Pos logit contributions
luster+0.410
jriwal+0.372
anmar+0.370
clud+0.355
rete+0.343
Neg logit contributions
DEF-0.339
235-0.332
226-0.327
entin-0.322
ELF-0.322
Tokenre
Token ID260
Feature activation+2.79e-01

Pos logit contributions
DEF+0.291
235+0.285
226+0.281
entin+0.277
ELF+0.276
Neg logit contributions
luster-0.346
jriwal-0.314
anmar-0.313
clud-0.299
rete-0.288
Tokena
Token ID64
Feature activation-8.84e-02

Pos logit contributions
luster+0.701
anmar+0.634
jriwal+0.630
clud+0.600
rete+0.582
Neg logit contributions
DEF-0.665
235-0.649
226-0.638
ELF-0.635
entin-0.630
Token should
Token ID815
Feature activation+1.80e-01

Pos logit contributions
luster+0.269
anmar+0.242
jriwal+0.242
clud+0.232
rete+0.224
Neg logit contributions
DEF-0.237
235-0.232
226-0.229
ELF-0.227
entin-0.226
Token have
Token ID423
Feature activation-1.23e-01

Pos logit contributions
DEF+0.438
235+0.429
226+0.422
entin+0.414
ELF+0.412
Neg logit contributions
luster-0.596
jriwal-0.544
anmar-0.541
clud-0.518
rete-0.502
Token been
Token ID587
Feature activation-1.42e-01

Pos logit contributions
luster+0.388
jriwal+0.350
anmar+0.350
clud+0.336
rete+0.325
Neg logit contributions
DEF-0.321
235-0.314
226-0.310
entin-0.305
ELF-0.305
Token sent
Token ID1908
Feature activation-1.53e-01

Pos logit contributions
luster+0.436
jriwal+0.393
anmar+0.392
clud+0.376
rete+0.363
Neg logit contributions
DEF-0.378
235-0.370
226-0.365
entin-0.360
ELF-0.359
Token off
Token ID572
Feature activation-9.15e-01

Pos logit contributions
luster+0.539
jriwal+0.493
anmar+0.491
clud+0.473
rete+0.460
Neg logit contributions
DEF-0.343
235-0.335
226-0.329
entin-0.323
ELF-0.323
Token for
Token ID329
Feature activation-4.75e-02

Pos logit contributions
luster+2.806
anmar+2.524
jriwal+2.511
clud+2.421
rete+2.364
Neg logit contributions
DEF-2.401
235-2.314
entin-2.291
ELF-2.289
226-2.284
Token a
Token ID257
Feature activation-6.37e-02

Pos logit contributions
luster+0.146
jriwal+0.132
anmar+0.131
clud+0.125
rete+0.121
Neg logit contributions
DEF-0.127
235-0.125
226-0.123
entin-0.121
ELF-0.121
Token heavy
Token ID4334
Feature activation+1.94e-01

Pos logit contributions
luster+0.205
jriwal+0.186
anmar+0.185
clud+0.177
rete+0.172
Neg logit contributions
DEF-0.162
235-0.158
226-0.156
ELF-0.153
entin-0.153
Token,
Token ID11
Feature activation+1.33e-01

Pos logit contributions
DEF+0.518
235+0.508
226+0.501
ELF+0.491
entin+0.490
Neg logit contributions
luster-0.600
jriwal-0.545
anmar-0.540
clud-0.517
rete-0.500
Token knee
Token ID10329
Feature activation+1.59e-01

Pos logit contributions
DEF+0.360
235+0.353
226+0.348
ELF+0.342
entin+0.342
Neg logit contributions
luster-0.403
jriwal-0.364
anmar-0.362
clud-0.346
rete-0.334
Tokens
Token ID82
Feature activation+8.80e-02

Pos logit contributions
DEF+0.189
235+0.185
226+0.183
entin+0.180
ELF+0.179
Neg logit contributions
luster-0.227
jriwal-0.205
anmar-0.204
clud-0.195
rete-0.189
Token commitment
Token ID7901
Feature activation-3.50e-01

Pos logit contributions
DEF+0.235
235+0.230
226+0.227
ELF+0.223
entin+0.223
Neg logit contributions
luster-0.272
jriwal-0.245
anmar-0.244
clud-0.234
rete-0.226
Token was
Token ID373
Feature activation+5.47e-02

Pos logit contributions
luster+1.032
jriwal+0.927
anmar+0.924
clud+0.881
rete+0.851
Neg logit contributions
DEF-0.981
235-0.962
226-0.949
entin-0.935
ELF-0.934
Token
Token ID564
Feature activation+1.17e-01

Pos logit contributions
DEF+0.149
235+0.146
226+0.144
entin+0.141
ELF+0.141
Neg logit contributions
luster-0.167
jriwal-0.150
anmar-0.150
clud-0.143
rete-0.138
Token
Token ID250
Feature activation-1.57e-01

Pos logit contributions
DEF+0.352
235+0.347
226+0.342
entin+0.337
ELF+0.336
Neg logit contributions
luster-0.322
jriwal-0.289
anmar-0.286
clud-0.272
rete-0.261
Tokenplain
Token ID25638
Feature activation-1.07e+00

Pos logit contributions
luster+0.465
jriwal+0.418
anmar+0.417
clud+0.398
rete+0.384
Neg logit contributions
DEF-0.436
235-0.428
226-0.422
entin-0.416
ELF-0.416
Tokenly
Token ID306
Feature activation-7.54e-01

Pos logit contributions
luster+3.328
jriwal+3.008
anmar+3.002
clud+2.899
rete+2.801
Neg logit contributions
DEF-2.763
235-2.711
226-2.658
ELF-2.628
entin-2.624
Token stated
Token ID5081
Feature activation+3.44e-02

Pos logit contributions
luster+2.230
jriwal+2.010
anmar+1.996
clud+1.924
rete+1.855
Neg logit contributions
DEF-2.078
235-2.043
226-2.009
entin-1.982
ELF-1.980
Token
Token ID447
Feature activation-1.81e-01

Pos logit contributions
DEF+0.082
235+0.080
226+0.079
entin+0.078
ELF+0.078
Neg logit contributions
luster-0.116
jriwal-0.105
anmar-0.105
clud-0.101
rete-0.098
Token
Token ID251
Feature activation+2.22e-02

Pos logit contributions
luster+0.475
jriwal+0.423
anmar+0.420
clud+0.398
rete+0.382
Neg logit contributions
DEF-0.565
235-0.557
226-0.550
entin-0.541
ELF-0.541
Token in
Token ID287
Feature activation-1.51e-01

Pos logit contributions
DEF+0.055
235+0.054
226+0.053
entin+0.052
ELF+0.052
Neg logit contributions
luster-0.073
jriwal-0.066
anmar-0.066
clud-0.063
rete-0.061