TOP ACTIVATIONS
MAX = 1.498

3 . In a large bowl , combine the broccoli
At the first annual Hogan Award at Vermont College
: Watch this : Top 5 best features in the new
s 12 th largest arms exp orter quite an achievement
his smooth ies in a regular blender . So those of
borrowing money to keep under the $ 16 . 7 trillion
under Israel s control . Some
to become the first freshman in SEC history to gain 300
playing : Watch this : Top 5 best features in the
, en ra ison des inc end ies sur # B
he commanded the troops " to enc irc le and destroy
change like a European carbon bank or tighter rules
Kn ob which appears rec essed into a hill
, Christianity would later on rec u perate the pagan myths
vote on at Yahoo 's annual meeting . The move would
? Email us at tips @ the - sun . co
October 17 to operate under the US debt cap . Lew
and , according to several European sources , the Netherlands and
Of corner backs in my top 10 , Peterson faced the
he eagerly listened to the exp iring gro ans of his

MIN ACTIVATIONS
MIN = -1.808

artist , graduating with a B FA in Musical Theater .
needed . Hare wood has excellent size at 6 - foot
, opinions about his off - ice benefit for the team
are going to stay on their lifes aving medications . He
More research in ul cer prevention and treatment in
are going to be on your feet for hours every day
Own ed < T > { ... } The
to the Federal Trade Commission ( F TC ) by using
L add , who had 139 goals and 166 assists for
. We all turn a blind eye to the ways that
s caught the White House off guard here , they were
Hundreds of people stood on their toes and perched children on
quite quickly but to have that extra racing skill is a
and a total retreat of Hel heim Glacier by 1 .
the building , only on the other side , so it
have the opportunity to pursue their dream un apolog etically ,
core . Punk became " hard core " and more masculine
that experts , advocates and aud itors have all suggested .
t fresh Wil Chung Bl ocked Un block
hews , Brazil N uts , Pep itas ( h ull

INTERVAL 0.671 - 1.498
CONTAINS 0.772%

10 : 47 ] < k asse mm el > lol
)} Rd 8 { ( 7 17 s )} 14 .
image , c v 2 . COLOR _ B GR 2
:// www . cd c . gov / in jury /
More entertainment stories on Yahoo ! Bob bi

INTERVAL -0.155 - 0.671
CONTAINS 71.819%

would bet on . Hopefully the Australians will continue their improvements
gender identity does not match their body 's biological sex ,
all four it was a nickel ." Re es - Jones
another probable an agram that I can t do
Wednesday . The veteran safety called the entire defense into a

INTERVAL -0.982 - -0.155
CONTAINS 27.353%

had never heard of the book or author .
sound , with competition . That
Europe in the world s worst nuclear disaster .
. They will have five days to do so after the
and status Indians are largely a federal responsibility . Access ing

INTERVAL -1.808 - -0.982
CONTAINS 0.055%

Brad en Holt by lacks the stamina to carry his team
Lake Ur mia basin - East Azerbaijan , West Azerbaijan ,
in a 1999 Le M onde interview , as
fed state . Just like anything else , this
, based in Canada , from 3 i Group Pl c
Token
Token ID198
Feature activation+1.58e-02

Pos logit contributions
ixel+1.678
eu+1.574
theless+1.562
asta+1.513
DX+1.504
Neg logit contributions
acity-2.244
 Tup-1.810
-1.772
icate-1.708
ighth-1.659
Token3
Token ID18
Feature activation-1.28e-01

Pos logit contributions
ixel+0.195
theless+0.187
eu+0.180
asta+0.178
DX+0.175
Neg logit contributions
acity-0.173
 Tup-0.132
-0.123
icate-0.122
icity-0.117
Token.
Token ID13
Feature activation-1.74e-01

Pos logit contributions
acity+1.448
 Tup+1.110
icate+1.042
+1.028
icity+1.000
Neg logit contributions
ixel-1.539
theless-1.496
eu-1.391
asta-1.386
DX-1.351
Token In
Token ID554
Feature activation+3.04e-01

Pos logit contributions
acity+2.144
 Tup+1.728
icate+1.603
+1.600
icity+1.549
Neg logit contributions
ixel-1.884
theless-1.812
eu-1.702
asta-1.680
DX-1.637
Token a
Token ID257
Feature activation+1.04e+00

Pos logit contributions
ixel+3.097
theless+2.866
eu+2.840
asta+2.797
DX+2.713
Neg logit contributions
acity-3.821
 Tup-3.079
-2.928
icate-2.828
icity-2.787
Token large
Token ID1588
Feature activation+1.50e+00

Pos logit contributions
theless+8.375
ixel+8.344
 rou+8.320
 bottleneck+8.163
DX+7.943
Neg logit contributions
acity-12.472
-9.902
 Tup-9.594
ighth-9.301
icate-9.246
Token bowl
Token ID9396
Feature activation+3.57e-01

Pos logit contributions
 rou+9.713
 accord+9.063
 bottleneck+9.013
theless+8.725
 Patel+8.525
Neg logit contributions
acity-17.618
-15.956
ighth-14.983
Effects-14.371
LAB-14.185
Token,
Token ID11
Feature activation+2.90e-01

Pos logit contributions
ixel+3.367
theless+3.126
asta+3.056
eu+3.044
DX+2.906
Neg logit contributions
acity-4.759
 Tup-3.814
-3.674
ighth-3.581
icate-3.558
Token combine
Token ID12082
Feature activation+3.62e-01

Pos logit contributions
ixel+2.301
theless+2.095
eu+2.056
asta+2.030
DX+1.955
Neg logit contributions
acity-4.333
 Tup-3.587
-3.462
ighth-3.372
icate-3.371
Token the
Token ID262
Feature activation+3.93e-02

Pos logit contributions
ixel+3.619
theless+3.442
eu+3.302
asta+3.283
 Patel+3.275
Neg logit contributions
acity-4.605
 Tup-3.601
-3.570
icate-3.436
ighth-3.371
Token broccoli
Token ID44653
Feature activation-3.01e-01

Pos logit contributions
ixel+0.421
theless+0.402
eu+0.384
asta+0.378
DX+0.371
Neg logit contributions
acity-0.492
 Tup-0.388
-0.370
icate-0.366
icity-0.353
Token
Token ID198
Feature activation+9.49e-02

Pos logit contributions
ixel+2.124
theless+2.033
eu+1.969
asta+1.911
 Phill+1.879
Neg logit contributions
acity-2.751
 Tup-2.179
-2.075
icate-2.051
icity-2.002
Token
Token ID198
Feature activation+2.95e-01

Pos logit contributions
ixel+0.971
theless+0.914
eu+0.900
asta+0.873
DX+0.862
Neg logit contributions
acity-1.218
 Tup-0.974
-0.941
icate-0.919
icity-0.885
TokenAt
Token ID2953
Feature activation+3.71e-01

Pos logit contributions
ixel+2.888
eu+2.723
theless+2.690
DX+2.654
asta+2.602
Neg logit contributions
acity-3.858
 Tup-3.104
icate-2.952
-2.937
icity-2.824
Token the
Token ID262
Feature activation+4.23e-01

Pos logit contributions
ixel+3.808
theless+3.542
eu+3.464
asta+3.420
 Phill+3.360
Neg logit contributions
acity-4.640
 Tup-3.636
-3.546
icate-3.495
ighth-3.336
Token first
Token ID717
Feature activation+6.02e-01

Pos logit contributions
ixel+4.347
theless+4.069
eu+3.959
 Patel+3.912
asta+3.902
Neg logit contributions
acity-5.251
-4.020
 Tup-4.018
icate-3.873
ighth-3.754
Token annual
Token ID5079
Feature activation+1.48e+00

Pos logit contributions
ixel+6.192
theless+5.711
 Patel+5.683
asta+5.656
 Phill+5.580
Neg logit contributions
acity-7.123
-5.514
 Tup-5.428
icate-5.167
ighth-5.073
Token Hogan
Token ID25316
Feature activation+7.68e-01

Pos logit contributions
 Patel+12.425
ixel+12.237
 Phill+12.149
 Junction+12.022
theless+11.849
Neg logit contributions
acity-15.191
-11.712
icate-11.357
ighth-11.072
igon-10.602
Token Award
Token ID11289
Feature activation+4.09e-01

Pos logit contributions
ixel+7.024
eu+6.706
 Patel+6.494
theless+6.484
asta+6.463
Neg logit contributions
acity-9.521
 Tup-7.501
-7.220
icate-7.164
ighth-6.900
Token at
Token ID379
Feature activation+7.82e-01

Pos logit contributions
ixel+4.156
theless+3.928
eu+3.809
 Patel+3.760
asta+3.756
Neg logit contributions
acity-5.131
 Tup-3.927
-3.813
icate-3.805
ighth-3.644
Token Vermont
Token ID16033
Feature activation+1.35e-01

Pos logit contributions
ixel+7.492
theless+7.115
 Phill+7.093
 Patel+7.077
eu+6.993
Neg logit contributions
acity-9.258
-6.992
 Tup-6.940
icate-6.906
ighth-6.677
Token College
Token ID5535
Feature activation+4.20e-01

Pos logit contributions
ixel+1.439
theless+1.364
eu+1.320
asta+1.287
DX+1.280
Neg logit contributions
acity-1.685
 Tup-1.334
-1.269
icate-1.258
icity-1.209
Token:
Token ID25
Feature activation+2.86e-01

Pos logit contributions
ixel+6.022
DX+5.204
asta+5.118
eu+5.081
 Graveyard+5.063
Neg logit contributions
acity-11.805
 Tup-9.798
-9.734
icate-9.487
icity-9.413
Token Watch
Token ID6305
Feature activation+5.17e-01

Pos logit contributions
ixel+3.040
theless+2.798
eu+2.715
asta+2.670
DX+2.610
Neg logit contributions
acity-3.541
 Tup-2.787
-2.733
icate-2.681
igon-2.600
Token this
Token ID428
Feature activation+8.31e-01

Pos logit contributions
ixel+4.715
eu+4.225
theless+4.167
DX+4.068
asta+4.063
Neg logit contributions
acity-6.806
icate-5.580
-5.560
 Tup-5.559
icity-5.460
Token:
Token ID25
Feature activation+5.62e-01

Pos logit contributions
ixel+5.788
asta+5.162
theless+4.818
DX+4.766
eu+4.736
Neg logit contributions
acity-11.994
-10.525
icate-10.045
icity-9.892
 Tup-9.809
Token Top
Token ID5849
Feature activation+1.30e+00

Pos logit contributions
ixel+5.455
eu+5.210
theless+5.105
 Patel+5.040
 Phill+4.996
Neg logit contributions
acity-6.999
-5.568
icate-5.255
 Tup-5.229
icity-5.180
Token 5
Token ID642
Feature activation+1.42e+00

Pos logit contributions
ixel+11.414
asta+10.310
eu+10.161
DX+10.149
 exec+10.147
Neg logit contributions
acity-14.254
-12.162
icate-10.993
ilateral-10.788
 Tup-10.764
Token best
Token ID1266
Feature activation+8.95e-01

Pos logit contributions
ixel+12.019
DX+11.406
 bottleneck+11.358
eu+11.152
 nick+11.144
Neg logit contributions
acity-14.452
-12.515
icate-11.126
roth-11.048
icity-11.042
Token features
Token ID3033
Feature activation+7.94e-01

Pos logit contributions
ixel+8.754
DX+7.883
eu+7.839
asta+7.747
theless+7.712
Neg logit contributions
acity-10.215
-8.373
 Tup-7.992
icate-7.839
ilateral-7.731
Token in
Token ID287
Feature activation+1.02e+00

Pos logit contributions
ixel+7.018
theless+6.411
eu+6.244
asta+6.203
 Phill+6.172
Neg logit contributions
acity-10.008
-8.440
 Tup-7.845
icate-7.833
icity-7.654
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
ixel+9.092
eu+8.161
DX+8.056
 Patel+8.045
theless+8.006
Neg logit contributions
acity-12.000
-10.433
icity-9.279
icate-9.267
 Tup-9.234
Token new
Token ID649
Feature activation+4.76e-01

Pos logit contributions
ixel+9.723
eu+9.100
 Patel+8.718
DX+8.663
 Graveyard+8.650
Neg logit contributions
acity-12.034
-10.305
icate-9.214
 Tup-9.090
icity-9.040
Tokens
Token ID82
Feature activation-3.39e-01

Pos logit contributions
ixel+0.177
theless+0.169
eu+0.162
asta+0.159
DX+0.157
Neg logit contributions
acity-0.193
 Tup-0.151
-0.143
icate-0.142
icity-0.137
Token 12
Token ID1105
Feature activation-1.55e-01

Pos logit contributions
acity+3.829
 Tup+2.982
icate+2.702
+2.688
ighth+2.647
Neg logit contributions
ixel-4.018
theless-3.850
eu-3.633
asta-3.551
DX-3.520
Tokenth
Token ID400
Feature activation-2.53e-01

Pos logit contributions
acity+1.395
 Tup+0.983
+0.885
icate+0.882
ighth+0.839
Neg logit contributions
ixel-2.226
theless-2.156
eu-2.058
asta-2.031
DX-2.007
Token largest
Token ID4387
Feature activation+6.26e-02

Pos logit contributions
acity+2.859
 Tup+2.216
+2.036
icate+2.036
ighth+1.970
Neg logit contributions
ixel-3.040
theless-2.921
eu-2.753
asta-2.680
DX-2.665
Token arms
Token ID5101
Feature activation+2.85e-01

Pos logit contributions
ixel+0.691
theless+0.659
eu+0.634
asta+0.626
DX+0.615
Neg logit contributions
acity-0.760
 Tup-0.593
-0.568
icate-0.560
icity-0.538
Token exp
Token ID1033
Feature activation+1.39e+00

Pos logit contributions
ixel+2.417
eu+2.245
theless+2.222
asta+2.219
DX+2.164
Neg logit contributions
acity-4.056
 Tup-3.320
-3.304
icate-3.136
igon-3.028
Tokenorter
Token ID4337
Feature activation+1.88e-01

Pos logit contributions
eu+0.655
ixel+0.431
asta+0.255
orp+0.211
orter+0.189
Neg logit contributions
-25.144
acity-24.703
Detailed-22.829
 Tup-22.343
ICE-22.301
Token
Token ID851
Feature activation-1.10e-01

Pos logit contributions
ixel+1.982
theless+1.874
eu+1.812
asta+1.782
DX+1.749
Neg logit contributions
acity-2.362
 Tup-1.872
-1.803
icate-1.762
icity-1.690
Token quite
Token ID2407
Feature activation+9.40e-02

Pos logit contributions
acity+1.338
 Tup+1.059
+0.990
icate+0.988
icity+0.953
Neg logit contributions
ixel-1.210
theless-1.153
eu-1.100
asta-1.079
DX-1.067
Token an
Token ID281
Feature activation+5.98e-01

Pos logit contributions
ixel+1.006
theless+0.964
eu+0.928
asta+0.901
DX+0.888
Neg logit contributions
acity-1.172
 Tup-0.922
-0.886
icate-0.873
icity-0.837
Token achievement
Token ID13293
Feature activation+3.40e-01

Pos logit contributions
ixel+5.590
eu+5.425
asta+5.241
theless+5.125
 Patel+5.007
Neg logit contributions
acity-7.403
-6.098
 Tup-6.011
icate-5.711
dylib-5.673
Token his
Token ID465
Feature activation+3.36e-01

Pos logit contributions
ixel+2.825
theless+2.711
eu+2.615
asta+2.550
DX+2.490
Neg logit contributions
acity-3.363
 Tup-2.651
-2.561
icate-2.507
ighth-2.434
Token smooth
Token ID7209
Feature activation-3.32e-01

Pos logit contributions
ixel+3.664
theless+3.565
eu+3.416
asta+3.322
DX+3.308
Neg logit contributions
acity-3.992
 Tup-3.090
-2.990
icate-2.941
ighth-2.787
Tokenies
Token ID444
Feature activation-3.76e-01

Pos logit contributions
acity+3.574
 Tup+2.642
icate+2.543
+2.524
icity+2.464
Neg logit contributions
ixel-4.084
theless-3.952
eu-3.697
asta-3.679
DX-3.646
Token in
Token ID287
Feature activation+4.51e-01

Pos logit contributions
acity+4.313
 Tup+3.363
icate+3.131
+3.059
icity+3.011
Neg logit contributions
ixel-4.298
theless-4.085
eu-3.925
asta-3.867
DX-3.848
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
ixel+4.549
theless+4.254
eu+4.160
DX+4.039
 Patel+4.030
Neg logit contributions
acity-5.572
 Tup-4.374
-4.317
icate-4.262
icity-4.069
Token regular
Token ID3218
Feature activation+1.38e+00

Pos logit contributions
theless+9.271
ixel+9.264
 bottleneck+9.057
 rou+8.969
eu+8.798
Neg logit contributions
acity-11.942
-9.493
icate-8.990
 Tup-8.972
Effects-8.905
Token blender
Token ID40047
Feature activation+5.33e-01

Pos logit contributions
ixel+10.708
 rou+10.619
theless+10.520
 accord+10.421
 Patel+10.411
Neg logit contributions
acity-14.977
-12.202
Detailed-11.878
icate-11.689
 Tup-11.542
Token.
Token ID13
Feature activation-5.40e-02

Pos logit contributions
ixel+5.074
theless+4.693
eu+4.566
asta+4.533
DX+4.433
Neg logit contributions
acity-6.767
 Tup-5.457
-5.295
icate-5.068
ighth-5.063
Token So
Token ID1406
Feature activation-2.17e-01

Pos logit contributions
acity+0.694
 Tup+0.555
+0.523
icate+0.521
icity+0.503
Neg logit contributions
ixel-0.563
theless-0.535
eu-0.508
asta-0.502
DX-0.491
Token those
Token ID883
Feature activation-2.32e-01

Pos logit contributions
acity+2.628
 Tup+2.070
+1.945
icate+1.943
icity+1.865
Neg logit contributions
ixel-2.384
theless-2.288
eu-2.177
asta-2.127
DX-2.113
Token of
Token ID286
Feature activation-4.31e-01

Pos logit contributions
acity+2.741
 Tup+2.151
icate+2.015
+2.004
icity+1.936
Neg logit contributions
ixel-2.630
theless-2.486
eu-2.393
asta-2.367
DX-2.337
Token borrowing
Token ID23669
Feature activation+1.84e-01

Pos logit contributions
acity+1.970
 Tup+1.546
icate+1.436
+1.418
icity+1.376
Neg logit contributions
ixel-1.936
theless-1.847
eu-1.767
asta-1.724
DX-1.724
Token money
Token ID1637
Feature activation+8.73e-03

Pos logit contributions
ixel+1.924
theless+1.848
eu+1.786
asta+1.757
DX+1.723
Neg logit contributions
acity-2.294
 Tup-1.808
-1.770
icate-1.710
icity-1.655
Token to
Token ID284
Feature activation-3.49e-01

Pos logit contributions
ixel+0.097
theless+0.093
eu+0.089
asta+0.088
DX+0.086
Neg logit contributions
acity-0.106
 Tup-0.083
-0.078
icate-0.078
icity-0.075
Token keep
Token ID1394
Feature activation+2.39e-01

Pos logit contributions
acity+4.148
 Tup+3.336
icate+3.149
+3.041
ighth+2.951
Neg logit contributions
ixel-3.804
theless-3.731
eu-3.521
asta-3.422
DX-3.386
Token under
Token ID739
Feature activation+3.96e-01

Pos logit contributions
ixel+2.409
theless+2.296
eu+2.220
asta+2.192
 Patel+2.145
Neg logit contributions
acity-3.040
 Tup-2.374
-2.340
icate-2.283
icity-2.207
Token the
Token ID262
Feature activation+1.36e+00

Pos logit contributions
ixel+3.930
theless+3.708
eu+3.647
asta+3.536
 Patel+3.489
Neg logit contributions
acity-4.996
 Tup-4.019
-3.818
icate-3.742
icity-3.624
Token $
Token ID720
Feature activation+2.27e-01

Pos logit contributions
 bottleneck+10.217
 veto+10.110
 accord+9.976
 rou+9.834
 Patel+9.649
Neg logit contributions
acity-15.713
-13.176
 Tup-12.003
icate-11.669
ighth-11.477
Token16
Token ID1433
Feature activation+5.21e-01

Pos logit contributions
ixel+2.454
theless+2.320
eu+2.317
asta+2.228
DX+2.215
Neg logit contributions
acity-2.768
 Tup-2.201
-2.093
icate-2.025
icity-1.967
Token.
Token ID13
Feature activation+3.39e-01

Pos logit contributions
ixel+4.621
eu+4.404
theless+4.351
asta+4.305
DX+4.180
Neg logit contributions
acity-6.809
 Tup-5.581
-5.514
icate-5.268
icity-5.102
Token7
Token ID22
Feature activation+5.94e-01

Pos logit contributions
ixel+3.609
theless+3.447
eu+3.380
asta+3.286
DX+3.210
Neg logit contributions
acity-4.092
 Tup-3.222
-3.081
icate-3.050
icity-2.968
Token trillion
Token ID12989
Feature activation+6.04e-01

Pos logit contributions
ixel+4.353
eu+4.254
theless+4.233
asta+4.051
DX+3.953
Neg logit contributions
acity-8.527
 Tup-6.929
-6.853
icate-6.806
icity-6.603
Token
Token ID250
Feature activation+3.24e-01

Pos logit contributions
ixel+0.861
theless+0.818
eu+0.791
asta+0.775
DX+0.765
Neg logit contributions
acity-0.839
 Tup-0.649
-0.610
icate-0.606
icity-0.584
Tokenunder
Token ID4625
Feature activation+5.03e-01

Pos logit contributions
ixel+3.779
theless+3.668
eu+3.590
asta+3.470
DX+3.461
Neg logit contributions
acity-3.629
 Tup-2.810
-2.609
icate-2.601
icity-2.429
Token Israel
Token ID2692
Feature activation+4.67e-01

Pos logit contributions
ixel+5.115
theless+4.845
eu+4.714
asta+4.615
DX+4.526
Neg logit contributions
acity-6.312
 Tup-5.001
icate-4.636
-4.571
icity-4.518
Token
Token ID447
Feature activation+2.08e-01

Pos logit contributions
ixel+4.279
theless+4.055
eu+3.929
asta+3.896
 Phill+3.748
Neg logit contributions
acity-6.237
 Tup-5.031
-4.695
icate-4.657
ighth-4.598
Token
Token ID247
Feature activation+2.90e-01

Pos logit contributions
ixel+2.253
theless+2.090
eu+2.038
asta+1.981
DX+1.961
Neg logit contributions
acity-2.594
 Tup-2.068
-1.965
icate-1.937
icity-1.895
Tokens
Token ID82
Feature activation+1.33e+00

Pos logit contributions
ixel+2.833
theless+2.746
eu+2.632
asta+2.569
DX+2.493
Neg logit contributions
acity-3.802
 Tup-3.079
-2.900
icate-2.854
ighth-2.765
Token control
Token ID1630
Feature activation+5.56e-01

Pos logit contributions
ixel+10.080
 veto+9.910
 Patel+9.540
 accord+9.463
asta+9.181
Neg logit contributions
acity-15.737
 Tup-12.234
icate-12.148
igon-12.099
ighth-11.927
Token.
Token ID13
Feature activation+2.33e-01

Pos logit contributions
ixel+5.190
theless+4.868
eu+4.790
asta+4.777
 Patel+4.510
Neg logit contributions
acity-7.159
 Tup-5.806
-5.601
icate-5.412
ighth-5.281
Token
Token ID447
Feature activation-6.98e-02

Pos logit contributions
ixel+2.260
theless+2.150
eu+2.118
 Patel+2.044
 Phill+2.021
Neg logit contributions
acity-3.082
 Tup-2.437
-2.380
icate-2.355
icity-2.260
Token
Token ID251
Feature activation+3.12e-01

Pos logit contributions
acity+0.773
 Tup+0.588
+0.553
icate+0.548
icity+0.523
Neg logit contributions
ixel-0.848
theless-0.815
eu-0.781
asta-0.771
DX-0.760
Token Some
Token ID2773
Feature activation+1.91e-01

Pos logit contributions
ixel+3.002
theless+2.871
eu+2.827
 Patel+2.698
 Phill+2.682
Neg logit contributions
acity-4.080
 Tup-3.261
-3.122
icate-3.119
igon-3.000
Token to
Token ID284
Feature activation-3.27e-01

Pos logit contributions
acity+0.642
 Tup+0.503
+0.473
icate+0.471
icity+0.454
Neg logit contributions
ixel-0.595
theless-0.567
eu-0.541
asta-0.534
DX-0.525
Token become
Token ID1716
Feature activation+1.84e-01

Pos logit contributions
acity+4.005
 Tup+3.207
icate+3.029
+2.985
icity+2.858
Neg logit contributions
ixel-3.519
theless-3.384
eu-3.137
asta-3.126
DX-3.095
Token the
Token ID262
Feature activation+2.53e-01

Pos logit contributions
ixel+1.950
theless+1.848
eu+1.769
asta+1.738
DX+1.723
Neg logit contributions
acity-2.310
 Tup-1.814
-1.734
icate-1.726
icity-1.651
Token first
Token ID717
Feature activation+5.04e-01

Pos logit contributions
ixel+2.727
theless+2.606
eu+2.491
asta+2.461
 Phill+2.434
Neg logit contributions
acity-3.087
 Tup-2.410
-2.362
icate-2.315
icity-2.219
Token freshman
Token ID18621
Feature activation+5.18e-01

Pos logit contributions
ixel+4.898
theless+4.572
 Phill+4.412
asta+4.374
eu+4.351
Neg logit contributions
acity-6.389
-5.040
 Tup-4.985
icate-4.889
igon-4.738
Token in
Token ID287
Feature activation+1.31e+00

Pos logit contributions
ixel+4.710
theless+4.487
asta+4.461
eu+4.386
 Phill+4.256
Neg logit contributions
acity-6.676
-5.283
 Tup-5.282
icate-5.205
icity-4.989
Token SEC
Token ID10729
Feature activation+6.42e-01

Pos logit contributions
ixel+10.887
 Phill+9.634
eu+9.607
 nick+9.501
 Blossom+9.418
Neg logit contributions
acity-14.493
-12.105
icity-11.722
icate-11.615
 Tup-11.535
Token history
Token ID2106
Feature activation+1.75e-01

Pos logit contributions
ixel+7.579
eu+7.192
asta+6.920
DX+6.882
theless+6.876
Neg logit contributions
acity-6.514
 Tup-4.955
-4.817
icate-4.652
icity-4.518
Token to
Token ID284
Feature activation-2.16e-01

Pos logit contributions
ixel+1.876
theless+1.779
eu+1.732
asta+1.708
DX+1.657
Neg logit contributions
acity-2.154
 Tup-1.696
-1.601
icate-1.595
icity-1.551
Token gain
Token ID4461
Feature activation-4.75e-02

Pos logit contributions
acity+2.754
 Tup+2.166
icate+2.073
+2.001
icity+1.956
Neg logit contributions
ixel-2.279
theless-2.177
eu-2.034
DX-2.003
asta-1.992
Token 300
Token ID5867
Feature activation-4.39e-01

Pos logit contributions
acity+0.567
 Tup+0.443
+0.416
icate+0.415
icity+0.399
Neg logit contributions
ixel-0.537
theless-0.513
eu-0.491
asta-0.483
DX-0.476
Token playing
Token ID2712
Feature activation+8.23e-01

Pos logit contributions
ixel+5.129
theless+4.846
eu+4.646
asta+4.564
DX+4.519
Neg logit contributions
acity-6.638
 Tup-5.320
-5.314
icate-4.965
icity-4.877
Token:
Token ID25
Feature activation+2.86e-01

Pos logit contributions
ixel+6.022
DX+5.204
asta+5.118
eu+5.081
 Graveyard+5.063
Neg logit contributions
acity-11.805
 Tup-9.798
-9.734
icate-9.487
icity-9.413
Token Watch
Token ID6305
Feature activation+5.17e-01

Pos logit contributions
ixel+3.040
theless+2.798
eu+2.715
asta+2.670
DX+2.610
Neg logit contributions
acity-3.541
 Tup-2.787
-2.733
icate-2.681
igon-2.600
Token this
Token ID428
Feature activation+8.31e-01

Pos logit contributions
ixel+4.715
eu+4.225
theless+4.167
DX+4.068
asta+4.063
Neg logit contributions
acity-6.806
icate-5.580
-5.560
 Tup-5.559
icity-5.460
Token:
Token ID25
Feature activation+5.62e-01

Pos logit contributions
ixel+5.788
asta+5.162
theless+4.818
DX+4.766
eu+4.736
Neg logit contributions
acity-11.994
-10.525
icate-10.045
icity-9.892
 Tup-9.809
Token Top
Token ID5849
Feature activation+1.30e+00

Pos logit contributions
ixel+5.455
eu+5.210
theless+5.105
 Patel+5.040
 Phill+4.996
Neg logit contributions
acity-6.999
-5.568
icate-5.255
 Tup-5.229
icity-5.180
Token 5
Token ID642
Feature activation+1.42e+00

Pos logit contributions
ixel+11.414
asta+10.310
eu+10.161
DX+10.149
 exec+10.147
Neg logit contributions
acity-14.254
-12.162
icate-10.993
ilateral-10.788
 Tup-10.764
Token best
Token ID1266
Feature activation+8.95e-01

Pos logit contributions
ixel+12.019
DX+11.406
 bottleneck+11.358
eu+11.152
 nick+11.144
Neg logit contributions
acity-14.452
-12.515
icate-11.126
roth-11.048
icity-11.042
Token features
Token ID3033
Feature activation+7.94e-01

Pos logit contributions
ixel+8.754
DX+7.883
eu+7.839
asta+7.747
theless+7.712
Neg logit contributions
acity-10.215
-8.373
 Tup-7.992
icate-7.839
ilateral-7.731
Token in
Token ID287
Feature activation+1.02e+00

Pos logit contributions
ixel+7.018
theless+6.411
eu+6.244
asta+6.203
 Phill+6.172
Neg logit contributions
acity-10.008
-8.440
 Tup-7.845
icate-7.833
icity-7.654
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
ixel+9.092
eu+8.161
DX+8.056
 Patel+8.045
theless+8.006
Neg logit contributions
acity-12.000
-10.433
icity-9.279
icate-9.267
 Tup-9.234
Token,
Token ID837
Feature activation-1.89e-01

Pos logit contributions
ixel+2.989
theless+2.821
eu+2.759
DX+2.684
asta+2.674
Neg logit contributions
acity-3.745
 Tup-2.948
-2.865
icate-2.848
ighth-2.718
Token en
Token ID551
Feature activation-3.37e-02

Pos logit contributions
acity+2.407
 Tup+1.918
+1.808
icate+1.803
icity+1.747
Neg logit contributions
ixel-1.984
theless-1.891
eu-1.781
asta-1.765
DX-1.725
Token ra
Token ID2179
Feature activation+1.98e-01

Pos logit contributions
acity+0.414
 Tup+0.326
+0.308
icate+0.306
icity+0.294
Neg logit contributions
ixel-0.371
theless-0.354
eu-0.338
asta-0.332
DX-0.326
Tokenison
Token ID1653
Feature activation-1.79e-01

Pos logit contributions
ixel+2.501
theless+2.372
eu+2.345
asta+2.302
DX+2.233
Neg logit contributions
acity-2.074
 Tup-1.578
-1.513
icate-1.444
icity-1.382
Token des
Token ID748
Feature activation+3.05e-01

Pos logit contributions
acity+1.899
 Tup+1.455
+1.347
icate+1.329
icity+1.263
Neg logit contributions
ixel-2.253
theless-2.162
eu-2.054
asta-2.032
DX-2.021
Token inc
Token ID753
Feature activation+1.27e+00

Pos logit contributions
ixel+3.360
eu+3.211
theless+3.156
asta+3.020
 Patel+2.990
Neg logit contributions
acity-3.607
 Tup-2.801
-2.719
icate-2.644
ighth-2.584
Tokenend
Token ID437
Feature activation+3.03e-01

Pos logit contributions
ixel+12.733
eu+12.675
asta+11.627
orp+11.386
ey+11.332
Neg logit contributions
acity-11.944
-10.292
 Tup-10.193
Detailed-9.504
Cod-8.891
Tokenies
Token ID444
Feature activation+1.10e-01

Pos logit contributions
ixel+3.430
eu+3.223
theless+3.117
asta+3.088
 Phill+2.955
Neg logit contributions
acity-3.525
 Tup-2.850
-2.760
icate-2.592
ighth-2.517
Token sur
Token ID969
Feature activation+2.88e-01

Pos logit contributions
ixel+1.186
theless+1.128
eu+1.091
asta+1.067
DX+1.048
Neg logit contributions
acity-1.375
 Tup-1.079
-1.035
icate-1.022
icity-0.986
Token #
Token ID1303
Feature activation+1.09e-01

Pos logit contributions
ixel+2.903
theless+2.750
eu+2.702
asta+2.606
DX+2.542
Neg logit contributions
acity-3.722
 Tup-2.919
-2.840
icate-2.830
icity-2.732
TokenB
Token ID33
Feature activation+7.29e-02

Pos logit contributions
ixel+1.306
theless+1.242
eu+1.219
asta+1.192
DX+1.176
Neg logit contributions
acity-1.225
 Tup-0.947
-0.881
icate-0.876
icity-0.838
Token he
Token ID339
Feature activation+6.88e-01

Pos logit contributions
ixel+4.398
theless+4.316
eu+4.130
 Phill+3.946
asta+3.934
Neg logit contributions
acity-5.593
 Tup-4.330
-4.269
icate-4.193
icity-4.116
Token commanded
Token ID22419
Feature activation+4.85e-01

Pos logit contributions
theless+6.385
ixel+6.355
eu+5.998
asta+5.619
 rou+5.537
Neg logit contributions
acity-8.506
-7.081
 Tup-6.820
icate-6.569
icity-6.363
Token the
Token ID262
Feature activation+4.43e-01

Pos logit contributions
ixel+4.734
theless+4.601
eu+4.452
 Phill+4.277
 Patel+4.248
Neg logit contributions
acity-6.052
-4.774
 Tup-4.657
icate-4.524
icity-4.436
Token troops
Token ID6553
Feature activation+4.72e-01

Pos logit contributions
ixel+4.571
theless+4.401
eu+4.269
 Phill+4.083
asta+4.064
Neg logit contributions
acity-5.351
-4.200
 Tup-4.100
icate-4.043
icity-3.922
Token "
Token ID366
Feature activation+6.79e-01

Pos logit contributions
ixel+4.519
theless+4.328
eu+4.307
asta+4.068
 Phill+4.045
Neg logit contributions
acity-5.965
-4.774
 Tup-4.687
icate-4.638
icity-4.456
Tokento
Token ID1462
Feature activation+1.26e+00

Pos logit contributions
ixel+6.417
eu+6.347
theless+6.245
DX+5.921
asta+5.858
Neg logit contributions
acity-8.100
 Tup-6.725
icate-6.354
-6.310
icity-6.092
Token enc
Token ID2207
Feature activation+1.03e+00

Pos logit contributions
ixel+10.421
eu+9.915
theless+9.784
 bottleneck+9.671
 disg+9.604
Neg logit contributions
acity-13.965
-11.105
icity-10.769
 Tup-10.756
ighth-10.540
Tokenirc
Token ID1980
Feature activation+2.92e-01

Pos logit contributions
ixel+6.151
eu+5.444
asta+5.178
 Patel+4.824
 Phill+4.811
Neg logit contributions
acity-15.404
-14.066
 Tup-13.377
Detailed-12.851
Cod-12.660
Tokenle
Token ID293
Feature activation+4.21e-02

Pos logit contributions
ixel+2.705
theless+2.465
eu+2.441
asta+2.357
 Patel+2.248
Neg logit contributions
acity-4.053
 Tup-3.327
-3.232
icate-3.069
icity-3.021
Token and
Token ID290
Feature activation+8.68e-01

Pos logit contributions
ixel+0.462
theless+0.443
eu+0.425
asta+0.416
DX+0.408
Neg logit contributions
acity-0.514
 Tup-0.403
-0.384
icate-0.380
icity-0.365
Token destroy
Token ID4117
Feature activation+2.92e-01

Pos logit contributions
ixel+7.288
eu+7.093
 bottleneck+6.812
theless+6.755
asta+6.703
Neg logit contributions
acity-10.791
-8.833
 Tup-8.712
icate-8.434
Cod-8.211
Token change
Token ID1487
Feature activation+3.34e-01

Pos logit contributions
ixel+4.543
theless+4.400
eu+4.364
asta+4.081
 accord+4.071
Neg logit contributions
acity-9.450
 Tup-7.983
-7.982
icate-7.783
igon-7.497
Token
Token ID784
Feature activation+2.10e-01

Pos logit contributions
ixel+3.314
theless+3.214
eu+3.087
asta+3.011
 Patel+2.943
Neg logit contributions
acity-4.236
 Tup-3.389
-3.371
icate-3.231
ighth-3.114
Token like
Token ID588
Feature activation+3.41e-01

Pos logit contributions
ixel+2.212
theless+2.133
eu+2.054
asta+2.009
DX+1.949
Neg logit contributions
acity-2.622
 Tup-2.059
-1.994
icate-1.955
icity-1.887
Token a
Token ID257
Feature activation+5.47e-01

Pos logit contributions
ixel+3.434
theless+3.343
eu+3.233
asta+3.153
 Phill+3.092
Neg logit contributions
acity-4.285
 Tup-3.361
-3.339
icate-3.236
igon-3.127
Token European
Token ID3427
Feature activation+8.24e-01

Pos logit contributions
ixel+5.177
theless+4.921
eu+4.912
asta+4.766
 Phill+4.665
Neg logit contributions
acity-6.936
-5.420
 Tup-5.292
icate-5.277
igon-5.122
Token carbon
Token ID6588
Feature activation+1.26e+00

Pos logit contributions
ixel+7.051
eu+6.621
theless+6.522
 Phill+6.518
 accord+6.481
Neg logit contributions
acity-10.442
-8.722
icate-8.013
 Tup-7.889
ighth-7.825
Token bank
Token ID3331
Feature activation+4.19e-01

Pos logit contributions
 accord+8.272
eu+7.933
 veto+7.749
ixel+7.638
DX+7.593
Neg logit contributions
acity-15.537
-14.166
 Tup-13.335
igon-12.902
Detailed-12.558
Token
Token ID784
Feature activation+1.01e-02

Pos logit contributions
ixel+3.974
theless+3.808
eu+3.726
asta+3.609
DX+3.526
Neg logit contributions
acity-5.431
 Tup-4.379
-4.376
icate-4.064
ighth-4.010
Token or
Token ID393
Feature activation+2.36e-01

Pos logit contributions
ixel+0.112
theless+0.107
eu+0.103
asta+0.101
DX+0.099
Neg logit contributions
acity-0.123
 Tup-0.096
-0.091
icate-0.090
icity-0.087
Token tighter
Token ID28619
Feature activation-1.69e-01

Pos logit contributions
ixel+2.503
theless+2.412
eu+2.323
asta+2.279
 Patel+2.206
Neg logit contributions
acity-2.919
 Tup-2.283
-2.229
icate-2.164
icity-2.086
Token rules
Token ID3173
Feature activation-8.91e-02

Pos logit contributions
acity+2.002
 Tup+1.572
icate+1.451
+1.448
icity+1.416
Neg logit contributions
ixel-1.939
theless-1.851
eu-1.742
asta-1.721
DX-1.707
Token Kn
Token ID6102
Feature activation+4.01e-01

Pos logit contributions
acity+1.048
 Tup+0.866
+0.826
icate+0.824
icity+0.799
Neg logit contributions
ixel-0.595
theless-0.561
eu-0.527
asta-0.515
DX-0.504
Tokenob
Token ID672
Feature activation-4.34e-01

Pos logit contributions
ixel+3.234
eu+2.716
asta+2.679
theless+2.616
DX+2.594
Neg logit contributions
acity-6.021
-5.100
 Tup-5.090
icate-4.743
icity-4.622
Token
Token ID851
Feature activation+4.95e-02

Pos logit contributions
acity+5.032
 Tup+3.936
icate+3.624
+3.578
icity+3.477
Neg logit contributions
ixel-4.970
theless-4.771
eu-4.477
asta-4.450
DX-4.383
Token which
Token ID543
Feature activation-3.09e-01

Pos logit contributions
ixel+0.538
theless+0.514
eu+0.493
asta+0.484
DX+0.475
Neg logit contributions
acity-0.612
 Tup-0.480
-0.456
icate-0.453
icity-0.436
Token appears
Token ID3568
Feature activation+7.99e-02

Pos logit contributions
acity+3.812
 Tup+3.096
icate+2.801
+2.780
icity+2.714
Neg logit contributions
ixel-3.329
theless-3.148
eu-2.988
asta-2.941
DX-2.906
Token rec
Token ID664
Feature activation+1.26e+00

Pos logit contributions
ixel+0.875
theless+0.834
eu+0.804
asta+0.789
DX+0.772
Neg logit contributions
acity-0.977
 Tup-0.766
-0.725
icate-0.722
icity-0.695
Tokenessed
Token ID6676
Feature activation-1.71e-01

Pos logit contributions
ixel+10.588
asta+9.799
eu+9.550
orp+9.063
ategor+8.687
Neg logit contributions
acity-14.970
-13.339
 Tup-12.373
Detailed-11.776
icate-11.610
Token into
Token ID656
Feature activation+8.79e-03

Pos logit contributions
acity+2.022
 Tup+1.593
icate+1.478
+1.459
icity+1.421
Neg logit contributions
ixel-1.955
theless-1.868
eu-1.783
asta-1.752
DX-1.732
Token a
Token ID257
Feature activation+2.79e-01

Pos logit contributions
ixel+0.098
theless+0.094
eu+0.090
asta+0.088
DX+0.087
Neg logit contributions
acity-0.106
 Tup-0.083
-0.079
icate-0.078
icity-0.075
Token hill
Token ID12788
Feature activation+1.20e-01

Pos logit contributions
ixel+2.901
theless+2.792
eu+2.666
asta+2.606
 Phill+2.583
Neg logit contributions
acity-3.529
 Tup-2.699
-2.694
icate-2.614
icity-2.510
Token
Token ID851
Feature activation-6.14e-03

Pos logit contributions
ixel+1.348
theless+1.279
eu+1.231
asta+1.213
DX+1.192
Neg logit contributions
acity-1.442
 Tup-1.123
-1.075
icate-1.055
icity-1.011
Token,
Token ID11
Feature activation-2.12e-01

Pos logit contributions
acity+1.397
 Tup+1.097
icate+1.023
+1.018
icity+0.987
Neg logit contributions
ixel-1.339
theless-1.274
eu-1.216
asta-1.192
DX-1.183
Token Christianity
Token ID13624
Feature activation+3.94e-01

Pos logit contributions
acity+2.459
 Tup+1.946
icate+1.797
+1.777
icity+1.730
Neg logit contributions
ixel-2.450
theless-2.323
eu-2.213
asta-2.174
DX-2.168
Token would
Token ID561
Feature activation+4.17e-01

Pos logit contributions
ixel+3.824
theless+3.690
eu+3.488
asta+3.410
 Patel+3.295
Neg logit contributions
acity-5.114
 Tup-4.050
-4.046
icate-3.878
ighth-3.736
Token later
Token ID1568
Feature activation+3.15e-01

Pos logit contributions
ixel+3.971
theless+3.717
eu+3.613
asta+3.536
 bottleneck+3.379
Neg logit contributions
acity-5.457
-4.428
 Tup-4.421
icate-4.131
Detailed-4.079
Token on
Token ID319
Feature activation+2.79e-01

Pos logit contributions
ixel+3.075
theless+2.941
eu+2.815
asta+2.749
 Patel+2.598
Neg logit contributions
acity-4.130
 Tup-3.304
-3.272
icate-3.095
ighth-3.046
Token rec
Token ID664
Feature activation+1.25e+00

Pos logit contributions
ixel+2.747
theless+2.627
eu+2.507
asta+2.450
DX+2.334
Neg logit contributions
acity-3.656
 Tup-2.925
-2.880
icate-2.723
ighth-2.689
Tokenu
Token ID84
Feature activation-7.64e-02

Pos logit contributions
ixel+10.904
asta+9.984
eu+9.376
ategor+9.107
orp+8.944
Neg logit contributions
acity-14.649
-12.903
 Tup-12.404
Detailed-11.510
ICE-11.281
Tokenperate
Token ID30052
Feature activation-1.88e-01

Pos logit contributions
acity+1.040
 Tup+0.836
+0.798
icate+0.795
icity+0.766
Neg logit contributions
ixel-0.737
theless-0.698
eu-0.667
asta-0.650
DX-0.643
Token the
Token ID262
Feature activation-2.22e-01

Pos logit contributions
acity+2.176
 Tup+1.703
icate+1.600
+1.578
icity+1.523
Neg logit contributions
ixel-2.185
theless-2.081
eu-1.990
asta-1.963
DX-1.950
Token pagan
Token ID36164
Feature activation+2.51e-01

Pos logit contributions
acity+2.739
 Tup+2.170
icate+2.038
+2.030
icity+1.960
Neg logit contributions
ixel-2.397
theless-2.285
eu-2.166
asta-2.127
DX-2.127
Token myths
Token ID22886
Feature activation-8.01e-02

Pos logit contributions
ixel+2.594
theless+2.462
eu+2.400
asta+2.374
DX+2.279
Neg logit contributions
acity-3.135
 Tup-2.490
-2.407
icate-2.342
ighth-2.269
Token vote
Token ID3015
Feature activation-1.37e-01

Pos logit contributions
acity+3.859
 Tup+3.131
icate+2.987
icity+2.801
+2.776
Neg logit contributions
ixel-3.348
theless-3.294
eu-3.080
DX-3.019
asta-2.990
Token on
Token ID319
Feature activation+3.61e-01

Pos logit contributions
acity+1.607
 Tup+1.259
icate+1.173
+1.165
icity+1.118
Neg logit contributions
ixel-1.577
theless-1.522
eu-1.438
asta-1.417
DX-1.398
Token at
Token ID379
Feature activation+5.74e-01

Pos logit contributions
ixel+3.614
theless+3.342
eu+3.342
asta+3.246
 Patel+3.190
Neg logit contributions
acity-4.622
-3.621
 Tup-3.606
icate-3.451
icity-3.315
Token Yahoo
Token ID16551
Feature activation+5.46e-01

Pos logit contributions
ixel+5.362
eu+4.993
theless+4.729
asta+4.725
 Patel+4.652
Neg logit contributions
acity-7.491
-5.921
 Tup-5.887
icate-5.594
icity-5.432
Token's
Token ID338
Feature activation+7.42e-01

Pos logit contributions
ixel+5.057
eu+4.867
theless+4.681
DX+4.547
asta+4.506
Neg logit contributions
acity-7.199
-5.729
 Tup-5.638
icate-5.323
icity-5.192
Token annual
Token ID5079
Feature activation+1.24e+00

Pos logit contributions
ixel+6.612
eu+6.299
 Patel+5.962
 shareholder+5.955
theless+5.905
Neg logit contributions
acity-9.521
-7.732
 Tup-7.338
icate-7.142
icity-7.009
Token meeting
Token ID3249
Feature activation+7.45e-02

Pos logit contributions
 shareholder+8.722
ixel+8.085
eu+7.852
 exec+7.633
 Patel+7.210
Neg logit contributions
acity-16.178
-13.840
 Tup-13.044
igon-12.671
icate-12.510
Token.
Token ID13
Feature activation+1.69e-01

Pos logit contributions
ixel+0.809
theless+0.770
eu+0.742
asta+0.726
DX+0.715
Neg logit contributions
acity-0.919
 Tup-0.722
-0.693
icate-0.681
icity-0.654
Token The
Token ID383
Feature activation+1.89e-01

Pos logit contributions
ixel+1.698
theless+1.606
eu+1.569
asta+1.506
DX+1.492
Neg logit contributions
acity-2.214
 Tup-1.744
-1.699
icate-1.667
icity-1.615
Token move
Token ID1445
Feature activation-8.75e-01

Pos logit contributions
ixel+1.910
theless+1.807
eu+1.752
asta+1.700
DX+1.672
Neg logit contributions
acity-2.460
 Tup-1.942
-1.900
icate-1.840
icity-1.784
Token would
Token ID561
Feature activation-2.67e-01

Pos logit contributions
acity+8.987
 Tup+7.135
icity+6.266
icate+6.184
igon+6.043
Neg logit contributions
ixel-9.902
theless-8.930
DX-8.832
asta-8.759
eu-8.519
Token?
Token ID30
Feature activation+3.20e-01

Pos logit contributions
ixel+3.074
theless+2.855
eu+2.831
asta+2.786
DX+2.765
Neg logit contributions
acity-3.075
 Tup-2.418
-2.306
icate-2.207
icity-2.132
Token Email
Token ID9570
Feature activation+6.90e-01

Pos logit contributions
ixel+3.436
theless+3.207
asta+3.163
eu+3.157
DX+3.147
Neg logit contributions
acity-3.826
-3.030
 Tup-2.955
icate-2.784
ilateral-2.667
Token us
Token ID514
Feature activation+5.86e-01

Pos logit contributions
ixel+5.999
DX+5.125
asta+4.826
eu+4.748
 Patel+4.671
Neg logit contributions
acity-9.347
-8.105
 Tup-7.873
ilateral-7.631
itol-7.441
Token at
Token ID379
Feature activation+1.17e+00

Pos logit contributions
ixel+4.918
theless+4.429
asta+4.359
DX+4.195
 guiActive+4.187
Neg logit contributions
acity-8.152
-6.682
icate-6.470
 Tup-6.424
Detailed-6.332
Token tips
Token ID9040
Feature activation+6.22e-01

Pos logit contributions
ixel+5.455
 nick+3.919
 rou+3.871
 exec+3.501
asta+3.499
Neg logit contributions
acity-19.632
-16.647
ilateral-16.344
icate-16.343
ffe-16.235
Token@
Token ID31
Feature activation+1.24e+00

Pos logit contributions
ixel+5.270
eu+4.434
asta+4.239
theless+4.210
DX+4.173
Neg logit contributions
acity-8.794
 Tup-7.719
-7.186
icate-7.043
igon-6.941
Tokenthe
Token ID1169
Feature activation+7.67e-01

Pos logit contributions
ixel+6.814
eu+6.170
orp+6.052
asta+5.954
ey+5.927
Neg logit contributions
acity-17.029
-16.375
 Tup-16.173
 Cah-15.148
 Haz-15.075
Token-
Token ID12
Feature activation+3.45e-01

Pos logit contributions
ixel+6.061
eu+5.708
asta+5.611
theless+5.521
orp+5.270
Neg logit contributions
acity-9.869
-9.330
 Tup-9.014
icate-8.118
 Cah-8.092
Tokensun
Token ID19155
Feature activation+1.65e-01

Pos logit contributions
ixel+1.670
theless+1.475
eu+1.472
asta+1.359
DX+1.213
Neg logit contributions
acity-6.143
-5.442
 Tup-5.394
icate-5.039
Detailed-4.974
Token.
Token ID13
Feature activation+6.86e-01

Pos logit contributions
ixel+1.771
theless+1.671
eu+1.630
asta+1.601
DX+1.547
Neg logit contributions
acity-2.041
 Tup-1.651
-1.588
icate-1.518
icity-1.457
Tokenco
Token ID1073
Feature activation+5.62e-01

Pos logit contributions
ixel+6.023
eu+5.803
asta+5.347
DX+5.238
theless+5.105
Neg logit contributions
acity-9.000
-7.980
 Tup-7.745
icate-7.084
ilateral-6.931
Token October
Token ID3267
Feature activation+7.91e-02

Pos logit contributions
ixel+1.821
theless+1.734
eu+1.676
asta+1.642
 Patel+1.601
Neg logit contributions
acity-2.100
 Tup-1.651
-1.584
icate-1.561
icity-1.507
Token 17
Token ID1596
Feature activation+2.01e-01

Pos logit contributions
ixel+0.796
theless+0.758
eu+0.722
asta+0.707
DX+0.696
Neg logit contributions
acity-1.035
 Tup-0.830
-0.797
icate-0.784
icity-0.755
Token to
Token ID284
Feature activation-2.25e-01

Pos logit contributions
ixel+2.148
theless+2.079
eu+2.004
asta+1.940
DX+1.914
Neg logit contributions
acity-2.445
 Tup-1.929
-1.859
icate-1.821
icity-1.741
Token operate
Token ID8076
Feature activation-1.01e-01

Pos logit contributions
acity+2.831
 Tup+2.263
icate+2.147
+2.093
icity+2.035
Neg logit contributions
ixel-2.342
theless-2.272
eu-2.164
asta-2.113
DX-2.087
Token under
Token ID739
Feature activation+5.10e-01

Pos logit contributions
acity+1.245
 Tup+0.984
icate+0.924
+0.923
icity+0.888
Neg logit contributions
ixel-1.104
theless-1.050
eu-1.000
asta-0.988
DX-0.970
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
ixel+4.819
theless+4.600
eu+4.427
DX+4.241
 Patel+4.239
Neg logit contributions
acity-6.531
 Tup-5.291
-5.111
icate-5.012
icity-4.821
Token US
Token ID1294
Feature activation+9.32e-01

Pos logit contributions
 veto+9.812
ixel+9.721
 accord+9.683
theless+9.593
 Patel+9.495
Neg logit contributions
acity-14.019
-11.217
 Tup-10.779
icate-10.740
igon-10.395
Token debt
Token ID5057
Feature activation+7.67e-02

Pos logit contributions
ixel+8.200
eu+7.748
DX+7.679
theless+7.627
asta+7.442
Neg logit contributions
acity-11.252
-8.819
 Tup-8.804
icate-8.658
icity-8.241
Token cap
Token ID1451
Feature activation+7.14e-02

Pos logit contributions
ixel+0.716
theless+0.677
eu+0.648
asta+0.634
DX+0.623
Neg logit contributions
acity-1.059
 Tup-0.859
-0.828
icate-0.814
icity-0.784
Token.
Token ID13
Feature activation+4.51e-01

Pos logit contributions
ixel+0.776
theless+0.741
eu+0.713
asta+0.697
DX+0.687
Neg logit contributions
acity-0.876
 Tup-0.695
-0.665
icate-0.651
icity-0.624
Token Lew
Token ID8260
Feature activation-1.52e-02

Pos logit contributions
ixel+4.958
theless+4.769
eu+4.719
 Phill+4.596
 Patel+4.500
Neg logit contributions
acity-5.129
 Tup-3.927
-3.776
icate-3.702
igon-3.594
Token and
Token ID290
Feature activation+6.86e-02

Pos logit contributions
ixel+1.777
theless+1.691
eu+1.628
asta+1.586
DX+1.553
Neg logit contributions
acity-2.198
 Tup-1.746
-1.672
icate-1.641
icity-1.590
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
ixel+0.683
theless+0.652
eu+0.623
asta+0.607
DX+0.596
Neg logit contributions
acity-0.908
 Tup-0.725
-0.702
icate-0.689
icity-0.665
Token according
Token ID1864
Feature activation+5.58e-02

Pos logit contributions
acity+0.200
 Tup+0.157
+0.149
icate+0.148
icity+0.142
Neg logit contributions
ixel-0.179
theless-0.171
eu-0.164
asta-0.161
DX-0.158
Token to
Token ID284
Feature activation+5.06e-01

Pos logit contributions
ixel+0.619
theless+0.589
eu+0.571
asta+0.558
DX+0.550
Neg logit contributions
acity-0.673
 Tup-0.528
-0.502
icate-0.496
icity-0.478
Token several
Token ID1811
Feature activation+8.50e-01

Pos logit contributions
ixel+5.091
eu+4.822
theless+4.818
 Phill+4.509
asta+4.499
Neg logit contributions
acity-6.261
 Tup-4.890
-4.842
icate-4.719
icity-4.586
Token European
Token ID3427
Feature activation+1.23e+00

Pos logit contributions
eu+7.235
ixel+7.235
theless+6.851
 Patel+6.636
 Phill+6.573
Neg logit contributions
acity-10.463
-8.744
 Tup-8.388
icate-8.268
igon-8.083
Token sources
Token ID4237
Feature activation+2.19e-01

Pos logit contributions
eu+9.647
ixel+9.265
DX+8.865
theless+8.726
 Phill+8.632
Neg logit contributions
acity-14.319
-12.404
 Tup-11.544
igon-11.159
hid-11.112
Token,
Token ID11
Feature activation+1.50e-01

Pos logit contributions
ixel+2.204
theless+2.105
eu+2.061
asta+1.998
DX+1.928
Neg logit contributions
acity-2.828
 Tup-2.233
-2.144
icate-2.104
icity-2.029
Token the
Token ID262
Feature activation+4.99e-01

Pos logit contributions
ixel+1.511
theless+1.446
eu+1.391
asta+1.348
 Phill+1.326
Neg logit contributions
acity-1.967
 Tup-1.554
-1.531
icate-1.485
icity-1.434
Token Netherlands
Token ID12671
Feature activation+2.15e-01

Pos logit contributions
ixel+4.173
theless+3.979
eu+3.906
 Phill+3.882
 Patel+3.707
Neg logit contributions
acity-6.922
-5.811
 Tup-5.535
icate-5.379
icity-5.343
Token and
Token ID290
Feature activation+2.13e-02

Pos logit contributions
ixel+2.124
theless+2.058
eu+1.970
asta+1.894
DX+1.856
Neg logit contributions
acity-2.823
 Tup-2.241
-2.207
icate-2.135
icity-2.080
TokenOf
Token ID5189
Feature activation+7.75e-02

Pos logit contributions
ixel+2.668
eu+2.527
theless+2.492
DX+2.460
asta+2.366
Neg logit contributions
acity-3.465
 Tup-2.805
icate-2.625
-2.617
icity-2.568
Token corner
Token ID5228
Feature activation-1.98e-01

Pos logit contributions
ixel+0.962
theless+0.919
eu+0.890
asta+0.873
DX+0.863
Neg logit contributions
acity-0.837
 Tup-0.634
-0.598
icate-0.585
icity-0.563
Tokenbacks
Token ID10146
Feature activation+1.23e-01

Pos logit contributions
acity+2.000
 Tup+1.505
icate+1.375
+1.361
icity+1.312
Neg logit contributions
ixel-2.600
theless-2.498
eu-2.385
asta-2.362
DX-2.334
Token in
Token ID287
Feature activation+5.08e-01

Pos logit contributions
ixel+1.293
theless+1.230
eu+1.189
asta+1.162
DX+1.136
Neg logit contributions
acity-1.546
 Tup-1.217
-1.182
icate-1.157
icity-1.117
Token my
Token ID616
Feature activation+1.00e+00

Pos logit contributions
ixel+5.068
eu+4.573
theless+4.452
asta+4.442
 Phill+4.336
Neg logit contributions
acity-6.331
 Tup-5.024
-4.958
icate-4.846
icity-4.800
Token top
Token ID1353
Feature activation+1.22e+00

Pos logit contributions
ixel+8.998
eu+8.050
 Phill+7.903
DX+7.889
asta+7.874
Neg logit contributions
acity-11.846
-10.075
 Tup-9.513
icate-9.445
icity-9.354
Token 10
Token ID838
Feature activation+4.88e-01

Pos logit contributions
ixel+8.001
asta+7.365
 bottleneck+7.256
DX+7.162
 Phill+7.096
Neg logit contributions
acity-15.940
-14.284
 Tup-12.967
icate-12.909
igon-12.800
Token,
Token ID11
Feature activation+5.43e-01

Pos logit contributions
ixel+4.602
eu+4.267
theless+4.207
asta+4.186
DX+4.093
Neg logit contributions
acity-6.273
-5.114
 Tup-5.051
icate-4.813
ighth-4.713
Token Peterson
Token ID18914
Feature activation-5.12e-01

Pos logit contributions
ixel+4.375
theless+4.085
eu+3.966
 Phill+3.898
 Patel+3.896
Neg logit contributions
acity-7.684
-6.239
 Tup-6.161
icate-5.995
icity-5.934
Token faced
Token ID7452
Feature activation-9.16e-02

Pos logit contributions
acity+6.032
 Tup+4.667
icate+4.240
icity+4.236
ighth+4.155
Neg logit contributions
ixel-5.804
theless-5.273
DX-5.081
asta-5.068
eu-5.050
Token the
Token ID262
Feature activation+3.43e-01

Pos logit contributions
acity+1.103
 Tup+0.868
icate+0.808
+0.807
icity+0.777
Neg logit contributions
ixel-1.027
theless-0.983
eu-0.940
asta-0.924
DX-0.910
Token he
Token ID339
Feature activation+4.74e-01

Pos logit contributions
ixel+3.730
theless+3.643
eu+3.447
asta+3.414
 Phill+3.362
Neg logit contributions
acity-5.081
-3.993
 Tup-3.908
icate-3.859
icity-3.729
Token eagerly
Token ID30130
Feature activation+6.56e-01

Pos logit contributions
ixel+4.513
theless+4.465
eu+4.276
asta+4.154
 rou+4.058
Neg logit contributions
acity-6.050
-4.869
 Tup-4.692
icate-4.582
icity-4.456
Token listened
Token ID16399
Feature activation-9.15e-02

Pos logit contributions
theless+6.296
ixel+6.121
eu+5.878
 rou+5.749
asta+5.648
Neg logit contributions
acity-8.139
-6.581
 Tup-6.137
icate-6.064
icity-5.883
Token to
Token ID284
Feature activation+2.88e-01

Pos logit contributions
acity+1.074
 Tup+0.836
+0.784
icate+0.781
icity+0.746
Neg logit contributions
ixel-1.058
theless-1.010
eu-0.966
asta-0.950
DX-0.942
Token the
Token ID262
Feature activation+5.97e-01

Pos logit contributions
ixel+2.990
theless+2.852
eu+2.777
asta+2.707
 Phill+2.686
Neg logit contributions
acity-3.572
 Tup-2.755
-2.748
icate-2.671
icity-2.609
Token exp
Token ID1033
Feature activation+1.22e+00

Pos logit contributions
ixel+5.967
theless+5.778
eu+5.730
asta+5.601
 Phill+5.566
Neg logit contributions
acity-6.924
-5.551
 Tup-5.253
icate-5.181
icity-5.078
Tokeniring
Token ID3428
Feature activation+1.39e-01

Pos logit contributions
ixel+11.354
eu+10.839
asta+10.220
ey+10.061
orp+9.955
Neg logit contributions
acity-12.504
-11.408
 Tup-10.794
Detailed-9.580
icate-9.550
Token gro
Token ID7128
Feature activation+1.42e-01

Pos logit contributions
ixel+1.477
theless+1.414
eu+1.361
asta+1.334
DX+1.300
Neg logit contributions
acity-1.725
 Tup-1.357
-1.324
icate-1.291
icity-1.246
Tokenans
Token ID504
Feature activation+1.27e-01

Pos logit contributions
ixel+1.549
theless+1.448
eu+1.420
asta+1.393
DX+1.342
Neg logit contributions
acity-1.739
 Tup-1.376
-1.325
icate-1.288
icity-1.238
Token of
Token ID286
Feature activation+5.36e-01

Pos logit contributions
ixel+1.377
theless+1.315
eu+1.271
asta+1.241
DX+1.213
Neg logit contributions
acity-1.562
 Tup-1.227
-1.181
icate-1.163
ighth-1.114
Token his
Token ID465
Feature activation+3.92e-01

Pos logit contributions
ixel+5.070
theless+4.937
eu+4.797
 Phill+4.761
 Patel+4.677
Neg logit contributions
acity-6.709
-5.425
 Tup-5.117
icate-5.074
icity-4.988
Token artist
Token ID6802
Feature activation-1.54e-01

Pos logit contributions
acity+2.119
 Tup+1.633
icate+1.529
+1.515
icity+1.459
Neg logit contributions
ixel-2.051
theless-1.998
eu-1.885
asta-1.856
DX-1.834
Token,
Token ID11
Feature activation+5.67e-02

Pos logit contributions
acity+1.869
 Tup+1.469
icate+1.378
+1.371
icity+1.324
Neg logit contributions
ixel-1.709
theless-1.647
eu-1.555
asta-1.532
DX-1.505
Token graduating
Token ID30024
Feature activation-3.80e-01

Pos logit contributions
ixel+0.585
theless+0.554
eu+0.530
asta+0.520
DX+0.512
Neg logit contributions
acity-0.734
 Tup-0.583
-0.558
icate-0.552
icity-0.533
Token with
Token ID351
Feature activation-2.69e-01

Pos logit contributions
acity+4.310
 Tup+3.392
icate+3.109
+3.062
ighth+2.946
Neg logit contributions
ixel-4.394
theless-4.286
eu-3.987
asta-3.936
DX-3.919
Token a
Token ID257
Feature activation-3.76e-01

Pos logit contributions
acity+3.057
 Tup+2.373
icate+2.186
+2.168
icity+2.103
Neg logit contributions
ixel-3.146
theless-3.066
eu-2.904
asta-2.872
DX-2.827
Token B
Token ID347
Feature activation-1.81e+00

Pos logit contributions
acity+4.265
 Tup+3.310
icate+3.070
icity+2.986
+2.980
Neg logit contributions
ixel-4.340
theless-4.259
eu-4.060
asta-3.986
DX-3.848
TokenFA
Token ID7708
Feature activation-3.02e-01

Pos logit contributions
acity+15.852
FA+13.767
ICE+12.935
igon+12.296
 Tup+12.038
Neg logit contributions
theless-16.237
eu-14.050
 Patel-13.874
ixel-13.827
 bottleneck-13.822
Token in
Token ID287
Feature activation-4.42e-01

Pos logit contributions
acity+3.446
 Tup+2.650
icate+2.477
+2.456
icity+2.373
Neg logit contributions
ixel-3.496
theless-3.459
eu-3.269
asta-3.176
DX-3.140
Token Musical
Token ID33375
Feature activation-1.98e-01

Pos logit contributions
acity+5.346
 Tup+4.250
icate+3.946
icity+3.829
+3.818
Neg logit contributions
ixel-4.608
theless-4.564
eu-4.332
asta-4.250
DX-4.143
Token Theater
Token ID22207
Feature activation-4.05e-01

Pos logit contributions
acity+2.324
 Tup+1.813
+1.687
icate+1.682
icity+1.641
Neg logit contributions
ixel-2.246
theless-2.185
eu-2.060
asta-2.042
DX-2.011
Token.
Token ID13
Feature activation-2.86e-02

Pos logit contributions
acity+4.686
 Tup+3.644
+3.375
icate+3.375
icity+3.297
Neg logit contributions
ixel-4.541
theless-4.507
eu-4.145
asta-4.110
DX-4.064
Token needed
Token ID2622
Feature activation+1.28e-01

Pos logit contributions
acity+1.970
 Tup+1.574
icate+1.459
+1.454
icity+1.403
Neg logit contributions
ixel-1.752
theless-1.677
eu-1.596
asta-1.571
DX-1.546
Token.
Token ID13
Feature activation-3.94e-02

Pos logit contributions
ixel+1.374
theless+1.311
eu+1.265
asta+1.233
DX+1.212
Neg logit contributions
acity-1.582
 Tup-1.241
-1.196
icate-1.179
icity-1.129
Token Hare
Token ID42285
Feature activation+3.36e-02

Pos logit contributions
acity+0.498
 Tup+0.396
+0.374
icate+0.372
icity+0.358
Neg logit contributions
ixel-0.419
theless-0.399
eu-0.380
asta-0.375
DX-0.368
Tokenwood
Token ID3822
Feature activation-5.58e-01

Pos logit contributions
ixel+0.301
theless+0.283
eu+0.269
asta+0.263
DX+0.258
Neg logit contributions
acity-0.480
 Tup-0.394
-0.377
icate-0.373
icity-0.361
Token has
Token ID468
Feature activation-4.77e-01

Pos logit contributions
acity+6.333
 Tup+4.959
icity+4.437
icate+4.427
igon+4.347
Neg logit contributions
ixel-6.379
theless-5.855
DX-5.716
asta-5.644
eu-5.567
Token excellent
Token ID6275
Feature activation-1.66e+00

Pos logit contributions
acity+5.103
 Tup+4.069
icate+3.609
icity+3.599
ighth+3.483
Neg logit contributions
ixel-5.751
theless-5.392
DX-5.179
asta-5.165
eu-5.141
Token size
Token ID2546
Feature activation-3.71e-01

Pos logit contributions
acity+13.934
 Tup+11.907
 acute+10.234
 practice+10.030
 answer+9.966
Neg logit contributions
ixel-16.579
theless-15.503
asta-15.393
ilon-14.844
DX-14.546
Token at
Token ID379
Feature activation+3.55e-01

Pos logit contributions
acity+4.133
 Tup+3.195
icate+2.904
+2.861
icity+2.831
Neg logit contributions
ixel-4.485
theless-4.267
eu-4.026
asta-4.013
DX-3.929
Token 6
Token ID718
Feature activation-1.71e-01

Pos logit contributions
ixel+3.540
theless+3.370
eu+3.230
asta+3.150
 Patel+3.143
Neg logit contributions
acity-4.552
 Tup-3.554
icate-3.515
-3.498
icity-3.358
Token-
Token ID12
Feature activation-3.17e-01

Pos logit contributions
acity+1.990
 Tup+1.537
+1.429
icate+1.428
icity+1.380
Neg logit contributions
ixel-1.998
theless-1.919
eu-1.826
asta-1.798
DX-1.767
Tokenfoot
Token ID5898
Feature activation-1.96e-01

Pos logit contributions
acity+2.584
 Tup+1.712
+1.539
icate+1.523
ighth+1.466
Neg logit contributions
ixel-4.688
theless-4.528
eu-4.404
asta-4.346
 Phill-4.282
Token,
Token ID11
Feature activation-2.25e-01

Pos logit contributions
acity+1.559
 Tup+1.229
+1.150
icate+1.146
icity+1.107
Neg logit contributions
ixel-1.422
theless-1.360
eu-1.294
asta-1.276
DX-1.255
Token opinions
Token ID9317
Feature activation+1.76e-01

Pos logit contributions
acity+2.633
 Tup+2.064
icate+1.907
+1.895
icity+1.836
Neg logit contributions
ixel-2.603
theless-2.480
eu-2.369
asta-2.339
DX-2.308
Token about
Token ID546
Feature activation-1.99e-01

Pos logit contributions
ixel+1.861
theless+1.762
eu+1.692
asta+1.666
DX+1.639
Neg logit contributions
acity-2.216
 Tup-1.744
-1.694
icate-1.662
icity-1.592
Token his
Token ID465
Feature activation-3.32e-01

Pos logit contributions
acity+2.313
 Tup+1.823
icate+1.677
+1.666
icity+1.601
Neg logit contributions
ixel-2.303
theless-2.199
eu-2.104
asta-2.077
DX-2.048
Token off
Token ID572
Feature activation-9.85e-01

Pos logit contributions
acity+4.049
 Tup+3.224
icate+2.935
+2.892
icity+2.883
Neg logit contributions
ixel-3.613
theless-3.436
eu-3.294
asta-3.235
DX-3.182
Token-
Token ID12
Feature activation-1.47e+00

Pos logit contributions
acity+8.905
 Tup+6.248
icate+5.981
ICE+5.955
icity+5.915
Neg logit contributions
ixel-12.210
theless-11.961
asta-11.164
DX-10.967
 Patel-10.788
Tokenice
Token ID501
Feature activation-3.52e-01

Pos logit contributions
acity+11.677
ice+10.497
ICE+8.787
cast+8.543
products+8.426
Neg logit contributions
ixel-16.675
asta-15.980
theless-15.826
 bottleneck-15.466
 Patel-15.232
Token benefit
Token ID4414
Feature activation+1.45e-01

Pos logit contributions
acity+4.191
 Tup+3.320
icate+3.018
+2.974
icity+2.970
Neg logit contributions
ixel-3.919
theless-3.726
eu-3.554
asta-3.484
DX-3.459
Token for
Token ID329
Feature activation-1.57e-01

Pos logit contributions
ixel+1.568
theless+1.475
eu+1.427
asta+1.404
DX+1.370
Neg logit contributions
acity-1.793
 Tup-1.421
-1.356
icate-1.334
ighth-1.287
Token the
Token ID262
Feature activation+2.93e-02

Pos logit contributions
acity+1.846
 Tup+1.451
icate+1.350
+1.338
icity+1.288
Neg logit contributions
ixel-1.788
theless-1.711
eu-1.646
asta-1.617
DX-1.594
Token team
Token ID1074
Feature activation-1.95e-01

Pos logit contributions
ixel+0.331
theless+0.316
eu+0.303
asta+0.298
DX+0.293
Neg logit contributions
acity-0.349
 Tup-0.272
-0.259
icate-0.256
icity-0.246
Token are
Token ID389
Feature activation-4.55e-02

Pos logit contributions
ixel+3.226
theless+3.016
eu+2.917
asta+2.818
 Patel+2.712
Neg logit contributions
acity-5.003
 Tup-4.018
-3.996
icate-3.841
icity-3.762
Token going
Token ID1016
Feature activation+3.05e-01

Pos logit contributions
acity+0.575
 Tup+0.457
+0.431
icate+0.428
icity+0.413
Neg logit contributions
ixel-0.483
theless-0.460
eu-0.439
asta-0.431
DX-0.424
Token to
Token ID284
Feature activation-7.90e-02

Pos logit contributions
ixel+3.148
theless+2.958
eu+2.911
asta+2.829
 Phill+2.750
Neg logit contributions
acity-3.849
 Tup-3.034
icate-2.879
icity-2.817
-2.796
Token stay
Token ID2652
Feature activation+1.49e-02

Pos logit contributions
acity+1.032
 Tup+0.827
+0.782
icate+0.781
icity+0.751
Neg logit contributions
ixel-0.804
theless-0.768
eu-0.729
asta-0.715
DX-0.705
Token on
Token ID319
Feature activation-4.12e-01

Pos logit contributions
ixel+0.161
theless+0.154
eu+0.148
asta+0.145
DX+0.143
Neg logit contributions
acity-0.184
 Tup-0.145
-0.138
icate-0.137
icity-0.131
Token their
Token ID511
Feature activation-1.47e+00

Pos logit contributions
acity+4.342
 Tup+3.404
icate+3.167
+3.025
icity+2.970
Neg logit contributions
ixel-5.172
theless-4.794
eu-4.605
asta-4.572
DX-4.530
Token lifes
Token ID25622
Feature activation-8.36e-02

Pos logit contributions
acity+12.788
 Tup+11.155
icity+9.534
icate+9.312
iqu+9.303
Neg logit contributions
ixel-15.910
theless-14.222
eu-13.744
 mathemat-13.301
ellation-13.161
Tokenaving
Token ID2703
Feature activation-7.85e-01

Pos logit contributions
acity+1.084
 Tup+0.864
icate+0.818
+0.815
icity+0.789
Neg logit contributions
ixel-0.862
theless-0.819
eu-0.779
asta-0.766
DX-0.757
Token medications
Token ID17638
Feature activation-1.91e-01

Pos logit contributions
acity+8.147
 Tup+6.656
icate+5.756
icity+5.645
+5.432
Neg logit contributions
ixel-9.441
theless-8.418
eu-8.247
asta-8.064
DX-7.806
Token.
Token ID13
Feature activation+2.95e-01

Pos logit contributions
acity+2.212
 Tup+1.721
icate+1.612
+1.610
icity+1.541
Neg logit contributions
ixel-2.226
theless-2.127
eu-2.030
asta-1.996
DX-1.978
Token He
Token ID679
Feature activation-2.01e-02

Pos logit contributions
ixel+2.812
theless+2.722
eu+2.644
 Patel+2.575
 Phill+2.538
Neg logit contributions
acity-3.897
 Tup-3.019
-2.991
icate-2.962
icity-2.863
Token
Token ID198
Feature activation-5.83e-02

Pos logit contributions
acity+1.589
 Tup+1.245
+1.178
icate+1.167
icity+1.126
Neg logit contributions
ixel-1.473
theless-1.400
eu-1.336
asta-1.317
DX-1.299
Token
Token ID198
Feature activation-1.03e-01

Pos logit contributions
acity+0.711
 Tup+0.557
icate+0.524
+0.522
icity+0.504
Neg logit contributions
ixel-0.647
theless-0.621
eu-0.586
asta-0.579
DX-0.568
TokenMore
Token ID5167
Feature activation-1.52e-01

Pos logit contributions
acity+1.263
 Tup+0.990
+0.946
icate+0.932
icity+0.901
Neg logit contributions
ixel-1.136
theless-1.090
eu-1.024
asta-1.015
DX-0.987
Token research
Token ID2267
Feature activation+1.78e-01

Pos logit contributions
acity+1.719
 Tup+1.332
+1.237
icate+1.232
icity+1.185
Neg logit contributions
ixel-1.808
theless-1.734
eu-1.651
asta-1.629
DX-1.602
Token in
Token ID287
Feature activation+2.18e-01

Pos logit contributions
ixel+1.802
theless+1.714
eu+1.640
asta+1.629
DX+1.601
Neg logit contributions
acity-2.285
 Tup-1.812
-1.751
icate-1.722
ighth-1.659
Token ul
Token ID14856
Feature activation-1.43e+00

Pos logit contributions
ixel+2.310
theless+2.203
eu+2.131
asta+2.070
 Patel+2.061
Neg logit contributions
acity-2.721
 Tup-2.108
-2.043
icate-2.032
ighth-1.940
Tokencer
Token ID2189
Feature activation+3.54e-01

Pos logit contributions
acity+12.386
cers+9.042
 Tup+8.967
icate+8.880
iqu+8.712
Neg logit contributions
ixel-15.946
theless-15.390
 guiActive-14.777
 Blossom-14.594
 Graveyard-14.517
Token prevention
Token ID14196
Feature activation+9.91e-02

Pos logit contributions
ixel+3.581
theless+3.439
eu+3.324
asta+3.262
DX+3.224
Neg logit contributions
acity-4.430
 Tup-3.518
-3.358
icate-3.297
ighth-3.131
Token and
Token ID290
Feature activation-3.52e-01

Pos logit contributions
ixel+1.030
theless+0.982
eu+0.940
asta+0.924
DX+0.906
Neg logit contributions
acity-1.266
 Tup-1.005
-0.960
icate-0.950
ighth-0.912
Token treatment
Token ID3513
Feature activation-3.17e-01

Pos logit contributions
acity+4.177
 Tup+3.358
+3.109
icate+3.059
icity+3.025
Neg logit contributions
ixel-3.923
theless-3.680
asta-3.481
eu-3.473
DX-3.426
Token in
Token ID287
Feature activation-2.29e-01

Pos logit contributions
acity+3.797
 Tup+3.004
icate+2.799
+2.759
icity+2.731
Neg logit contributions
ixel-3.532
theless-3.355
eu-3.194
asta-3.133
DX-3.099
Token are
Token ID389
Feature activation-3.26e-02

Pos logit contributions
ixel+1.400
theless+1.315
eu+1.252
asta+1.222
DX+1.192
Neg logit contributions
acity-2.095
 Tup-1.695
-1.644
icate-1.600
icity-1.561
Token going
Token ID1016
Feature activation+7.09e-01

Pos logit contributions
acity+0.425
 Tup+0.340
+0.321
icate+0.320
icity+0.309
Neg logit contributions
ixel-0.334
theless-0.317
eu-0.302
asta-0.297
DX-0.291
Token to
Token ID284
Feature activation+1.16e-01

Pos logit contributions
ixel+6.188
eu+5.685
theless+5.682
 Phill+5.334
asta+5.272
Neg logit contributions
acity-9.293
 Tup-7.420
icity-7.092
ighth-7.036
icate-7.012
Token be
Token ID307
Feature activation+1.78e-01

Pos logit contributions
ixel+1.128
theless+1.057
eu+1.022
asta+0.998
DX+0.970
Neg logit contributions
acity-1.562
 Tup-1.253
-1.213
icate-1.182
icity-1.151
Token on
Token ID319
Feature activation-3.95e-01

Pos logit contributions
ixel+1.912
theless+1.834
eu+1.774
asta+1.722
DX+1.696
Neg logit contributions
acity-2.190
 Tup-1.697
-1.661
icate-1.615
icity-1.558
Token your
Token ID534
Feature activation-1.42e+00

Pos logit contributions
acity+4.468
 Tup+3.502
icate+3.243
+3.091
igon+3.058
Neg logit contributions
ixel-4.565
theless-4.379
asta-4.222
eu-4.155
DX-4.044
Token feet
Token ID3625
Feature activation-1.42e-01

Pos logit contributions
acity+11.705
 Tup+9.239
ighth+7.708
cast+7.447
icity+7.404
Neg logit contributions
ixel-16.335
theless-15.057
asta-14.997
eu-14.650
ellation-13.982
Token for
Token ID329
Feature activation+4.28e-01

Pos logit contributions
acity+1.687
 Tup+1.319
icate+1.232
+1.219
icity+1.189
Neg logit contributions
ixel-1.608
theless-1.538
eu-1.471
asta-1.455
DX-1.425
Token hours
Token ID2250
Feature activation-2.63e-01

Pos logit contributions
ixel+4.217
theless+4.044
eu+3.890
asta+3.735
DX+3.724
Neg logit contributions
acity-5.445
-4.319
 Tup-4.294
icate-4.102
icity-3.956
Token every
Token ID790
Feature activation-5.63e-01

Pos logit contributions
acity+3.051
 Tup+2.352
icate+2.207
+2.153
icity+2.143
Neg logit contributions
ixel-3.035
theless-2.922
eu-2.778
asta-2.754
DX-2.689
Token day
Token ID1110
Feature activation+2.18e-01

Pos logit contributions
acity+5.368
 Tup+3.915
icate+3.582
+3.460
icity+3.431
Neg logit contributions
ixel-7.318
theless-6.996
eu-6.859
asta-6.854
DX-6.519
Token Own
Token ID11744
Feature activation-8.68e-02

Pos logit contributions
acity+2.435
 Tup+1.867
icate+1.714
+1.711
icity+1.623
Neg logit contributions
ixel-2.933
theless-2.841
eu-2.697
asta-2.680
DX-2.611
Tokened
Token ID276
Feature activation-2.43e-01

Pos logit contributions
acity+1.155
 Tup+0.918
icate+0.876
+0.868
icity+0.847
Neg logit contributions
ixel-0.871
theless-0.834
eu-0.774
asta-0.767
DX-0.752
Token <
Token ID1279
Feature activation-4.52e-01

Pos logit contributions
acity+2.750
 Tup+2.113
icate+1.980
+1.941
icity+1.883
Neg logit contributions
ixel-2.918
theless-2.829
eu-2.619
asta-2.614
 Phill-2.536
Token T
Token ID309
Feature activation+2.91e-01

Pos logit contributions
acity+3.822
 Tup+2.926
icate+2.470
icity+2.288
+2.282
Neg logit contributions
ixel-6.515
theless-6.329
asta-6.012
eu-5.930
 mathemat-5.853
Token >
Token ID1875
Feature activation+8.64e-02

Pos logit contributions
ixel+2.392
theless+2.176
eu+2.167
DX+2.060
asta+2.058
Neg logit contributions
acity-4.272
 Tup-3.501
-3.418
icate-3.331
ighth-3.261
Token {
Token ID1391
Feature activation-1.40e+00

Pos logit contributions
ixel+0.755
theless+0.704
eu+0.680
asta+0.661
DX+0.648
Neg logit contributions
acity-1.247
 Tup-1.019
-0.984
icate-0.969
icity-0.943
Token...
Token ID2644
Feature activation-3.59e-01

Pos logit contributions
acity+11.105
 Tup+8.758
icate+8.355
cast+7.417
+7.357
Neg logit contributions
theless-15.749
ixel-15.439
asta-14.426
eu-14.199
 Phill-14.143
Token }
Token ID1782
Feature activation-3.40e-01

Pos logit contributions
acity+3.516
 Tup+2.648
icate+2.440
+2.306
icity+2.268
Neg logit contributions
ixel-4.729
theless-4.559
asta-4.338
eu-4.323
DX-4.274
Token
Token ID198
Feature activation+2.10e-01

Pos logit contributions
acity+4.133
 Tup+3.337
icate+3.119
+3.012
icity+2.966
Neg logit contributions
ixel-3.658
theless-3.474
asta-3.312
eu-3.280
DX-3.190
Token
Token ID198
Feature activation+7.13e-02

Pos logit contributions
ixel+2.003
eu+1.894
theless+1.864
asta+1.806
DX+1.800
Neg logit contributions
acity-2.778
 Tup-2.249
-2.216
icate-2.127
ighth-2.069
TokenThe
Token ID464
Feature activation-2.21e-01

Pos logit contributions
ixel+0.728
theless+0.687
eu+0.663
asta+0.647
DX+0.642
Neg logit contributions
acity-0.929
 Tup-0.745
-0.706
icate-0.701
icity-0.677
Token to
Token ID284
Feature activation-5.55e-02

Pos logit contributions
acity+2.982
 Tup+2.297
icate+2.190
+2.143
icity+2.117
Neg logit contributions
ixel-2.702
theless-2.603
eu-2.450
asta-2.399
DX-2.371
Token the
Token ID262
Feature activation-2.48e-01

Pos logit contributions
acity+0.669
 Tup+0.526
+0.493
icate+0.492
icity+0.473
Neg logit contributions
ixel-0.623
theless-0.593
eu-0.569
asta-0.558
DX-0.551
Token Federal
Token ID5618
Feature activation-3.55e-02

Pos logit contributions
acity+2.973
 Tup+2.360
icate+2.187
+2.146
icity+2.107
Neg logit contributions
ixel-2.796
theless-2.633
eu-2.522
asta-2.474
DX-2.452
Token Trade
Token ID9601
Feature activation-2.32e-01

Pos logit contributions
acity+0.375
 Tup+0.283
+0.263
icate+0.262
icity+0.249
Neg logit contributions
ixel-0.450
theless-0.432
eu-0.417
asta-0.410
DX-0.404
Token Commission
Token ID4513
Feature activation-8.65e-01

Pos logit contributions
acity+3.492
 Tup+2.880
icate+2.715
+2.652
icity+2.648
Neg logit contributions
ixel-1.954
theless-1.809
eu-1.651
asta-1.635
DX-1.611
Token (
Token ID357
Feature activation-1.39e+00

Pos logit contributions
acity+8.580
 Tup+6.564
icate+6.049
icity+5.734
igon+5.630
Neg logit contributions
ixel-10.078
theless-9.655
eu-9.241
 Patel-8.989
asta-8.906
TokenF
Token ID37
Feature activation-5.56e-01

Pos logit contributions
acity+10.103
UST+7.922
 Tup+7.462
ighth+7.331
ICE+7.166
Neg logit contributions
ixel-17.025
 Graveyard-15.753
 Patel-15.604
 bottleneck-15.449
 Junction-15.398
TokenTC
Token ID4825
Feature activation-5.14e-01

Pos logit contributions
acity+4.679
 Tup+3.089
icate+3.014
ighth+2.876
roth+2.828
Neg logit contributions
ixel-7.867
theless-7.845
 Graveyard-7.390
 Patel-7.237
 mathemat-7.228
Token)
Token ID8
Feature activation-8.33e-01

Pos logit contributions
acity+5.074
 Tup+3.824
icate+3.576
+3.450
icity+3.330
Neg logit contributions
ixel-6.499
theless-6.260
eu-5.910
asta-5.861
 Patel-5.783
Token by
Token ID416
Feature activation+9.61e-02

Pos logit contributions
acity+8.609
 Tup+6.715
icate+6.186
icity+5.893
igon+5.796
Neg logit contributions
ixel-9.430
theless-9.106
eu-8.553
asta-8.492
DX-8.252
Token using
Token ID1262
Feature activation-6.20e-02

Pos logit contributions
ixel+0.954
theless+0.913
eu+0.870
asta+0.852
DX+0.833
Neg logit contributions
acity-1.268
 Tup-1.011
-0.977
icate-0.960
ighth-0.926
TokenL
Token ID43
Feature activation-6.46e-01

Pos logit contributions
acity+6.664
 Tup+5.101
+4.990
Detailed+4.978
ICE+4.850
Neg logit contributions
ixel-6.608
theless-6.482
asta-5.959
 ende-5.921
 bottleneck-5.901
Tokenadd
Token ID2860
Feature activation-4.74e-01

Pos logit contributions
acity+5.187
icate+3.138
icity+2.963
igon+2.918
ighth+2.911
Neg logit contributions
theless-9.588
ixel-9.178
 mathemat-9.018
 bottleneck-8.854
 Graveyard-8.828
Token,
Token ID11
Feature activation-1.47e-01

Pos logit contributions
acity+5.539
 Tup+4.320
icate+3.914
icity+3.897
ighth+3.823
Neg logit contributions
ixel-5.387
theless-4.999
DX-4.830
asta-4.808
eu-4.755
Token who
Token ID508
Feature activation-4.39e-01

Pos logit contributions
acity+1.777
 Tup+1.404
icate+1.298
+1.292
ighth+1.253
Neg logit contributions
ixel-1.656
theless-1.576
eu-1.496
asta-1.479
DX-1.461
Token had
Token ID550
Feature activation-5.19e-01

Pos logit contributions
acity+5.082
 Tup+4.035
icate+3.585
icity+3.580
ighth+3.510
Neg logit contributions
ixel-5.040
theless-4.718
DX-4.508
eu-4.486
asta-4.428
Token 139
Token ID23666
Feature activation-1.37e+00

Pos logit contributions
acity+5.482
 Tup+4.415
icity+3.839
icate+3.824
ighth+3.800
Neg logit contributions
ixel-6.283
theless-5.896
DX-5.661
eu-5.633
asta-5.565
Token goals
Token ID4661
Feature activation-2.29e-01

Pos logit contributions
acity+13.959
 Tup+9.093
icity+8.689
ice+8.677
ighth+8.526
Neg logit contributions
ixel-14.576
theless-14.571
eu-13.594
DX-13.418
asta-13.418
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
acity+2.692
 Tup+2.068
icate+1.928
+1.910
icity+1.887
Neg logit contributions
ixel-2.658
theless-2.528
eu-2.391
asta-2.379
DX-2.354
Token 166
Token ID26753
Feature activation-7.64e-01

Pos logit contributions
ixel+1.049
theless+1.000
eu+0.960
asta+0.939
DX+0.918
Neg logit contributions
acity-1.387
 Tup-1.106
-1.070
icate-1.054
icity-1.013
Token assists
Token ID13648
Feature activation-3.28e-01

Pos logit contributions
acity+8.914
icity+6.115
 Tup+6.112
icate+5.852
+5.832
Neg logit contributions
ixel-8.171
eu-7.861
theless-7.860
asta-7.774
DX-7.602
Token for
Token ID329
Feature activation+3.19e-01

Pos logit contributions
acity+3.662
 Tup+2.814
icate+2.579
icity+2.523
+2.515
Neg logit contributions
ixel-3.963
theless-3.759
asta-3.589
eu-3.566
DX-3.551
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
ixel+2.603
theless+2.457
eu+2.403
asta+2.364
DX+2.297
Neg logit contributions
acity-3.140
 Tup-2.497
-2.435
icate-2.353
ighth-2.271
Token We
Token ID775
Feature activation+1.72e-01

Pos logit contributions
ixel+1.427
theless+1.356
eu+1.330
 Patel+1.265
asta+1.265
Neg logit contributions
acity-1.921
 Tup-1.528
-1.493
icate-1.459
ighth-1.414
Token all
Token ID477
Feature activation+8.93e-02

Pos logit contributions
ixel+1.729
theless+1.629
eu+1.566
asta+1.541
DX+1.487
Neg logit contributions
acity-2.266
 Tup-1.817
-1.744
icate-1.701
icity-1.664
Token turn
Token ID1210
Feature activation+5.97e-02

Pos logit contributions
ixel+0.876
theless+0.829
eu+0.791
asta+0.777
DX+0.755
Neg logit contributions
acity-1.197
 Tup-0.962
-0.926
icate-0.906
icity-0.882
Token a
Token ID257
Feature activation+3.34e-02

Pos logit contributions
ixel+0.666
theless+0.633
eu+0.610
asta+0.600
DX+0.588
Neg logit contributions
acity-0.718
 Tup-0.562
-0.534
icate-0.527
icity-0.509
Token blind
Token ID7770
Feature activation-1.33e+00

Pos logit contributions
ixel+0.346
theless+0.329
eu+0.315
asta+0.309
DX+0.303
Neg logit contributions
acity-0.430
 Tup-0.342
-0.325
icate-0.323
icity-0.312
Token eye
Token ID4151
Feature activation-2.86e-01

Pos logit contributions
acity+11.622
icate+8.221
 Tup+7.835
ice+7.442
er+7.350
Neg logit contributions
ixel-15.835
theless-15.164
asta-14.269
DX-13.952
 mathemat-13.633
Token to
Token ID284
Feature activation-1.56e-01

Pos logit contributions
acity+3.192
 Tup+2.434
icate+2.281
+2.250
icity+2.163
Neg logit contributions
ixel-3.415
theless-3.332
eu-3.161
asta-3.099
DX-3.095
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
acity+1.885
 Tup+1.490
icate+1.391
+1.366
icity+1.339
Neg logit contributions
ixel-1.740
theless-1.664
eu-1.587
asta-1.555
DX-1.548
Token ways
Token ID2842
Feature activation-8.62e-02

Pos logit contributions
acity+2.571
 Tup+2.023
icate+1.878
+1.863
icity+1.832
Neg logit contributions
ixel-2.413
theless-2.319
eu-2.203
asta-2.158
DX-2.154
Token that
Token ID326
Feature activation-2.08e-01

Pos logit contributions
acity+1.035
 Tup+0.807
icate+0.757
+0.753
icity+0.731
Neg logit contributions
ixel-0.973
theless-0.931
eu-0.888
asta-0.873
DX-0.865
Tokens
Token ID82
Feature activation-4.16e-02

Pos logit contributions
ixel+1.792
theless+1.732
eu+1.661
asta+1.620
DX+1.592
Neg logit contributions
acity-2.239
 Tup-1.784
-1.723
icate-1.687
ighth-1.624
Token caught
Token ID4978
Feature activation-1.68e-01

Pos logit contributions
acity+0.520
 Tup+0.412
+0.388
icate+0.386
icity+0.373
Neg logit contributions
ixel-0.449
theless-0.427
eu-0.408
asta-0.401
DX-0.395
Token the
Token ID262
Feature activation+1.18e-01

Pos logit contributions
acity+1.963
 Tup+1.528
+1.428
icate+1.425
icity+1.364
Neg logit contributions
ixel-1.952
theless-1.860
eu-1.768
asta-1.742
DX-1.720
Token White
Token ID2635
Feature activation+1.98e-01

Pos logit contributions
ixel+1.254
theless+1.197
eu+1.156
asta+1.133
DX+1.108
Neg logit contributions
acity-1.472
 Tup-1.159
-1.107
icate-1.100
icity-1.053
Token House
Token ID2097
Feature activation-4.57e-01

Pos logit contributions
ixel+1.628
theless+1.465
eu+1.451
asta+1.398
DX+1.390
Neg logit contributions
acity-2.919
 Tup-2.418
-2.363
icate-2.326
icity-2.255
Token off
Token ID572
Feature activation-1.33e+00

Pos logit contributions
acity+5.191
 Tup+4.038
icate+3.796
+3.771
icity+3.639
Neg logit contributions
ixel-5.247
theless-5.068
eu-4.751
asta-4.671
DX-4.651
Token guard
Token ID4860
Feature activation-2.52e-01

Pos logit contributions
acity+9.792
iqu+7.194
 Tup+6.792
roth+6.369
icity+6.367
Neg logit contributions
theless-16.789
ixel-16.529
DX-15.531
eu-14.784
asta-14.737
Token here
Token ID994
Feature activation-7.45e-02

Pos logit contributions
acity+2.919
 Tup+2.279
+2.136
icate+2.131
icity+2.054
Neg logit contributions
ixel-2.892
theless-2.759
eu-2.630
asta-2.589
DX-2.560
Token,
Token ID11
Feature activation-1.63e-01

Pos logit contributions
acity+0.914
 Tup+0.719
+0.678
icate+0.675
icity+0.650
Neg logit contributions
ixel-0.817
theless-0.780
eu-0.745
asta-0.731
DX-0.721
Token they
Token ID484
Feature activation-2.64e-01

Pos logit contributions
acity+1.973
 Tup+1.559
+1.458
icate+1.453
icity+1.404
Neg logit contributions
ixel-1.808
theless-1.730
eu-1.645
asta-1.611
DX-1.600
Token were
Token ID547
Feature activation-2.67e-01

Pos logit contributions
acity+3.240
 Tup+2.558
+2.415
icate+2.415
icity+2.316
Neg logit contributions
ixel-2.823
theless-2.694
eu-2.571
DX-2.528
asta-2.527
TokenHundreds
Token ID38150
Feature activation-1.53e-01

Pos logit contributions
ixel+0.256
theless+0.243
eu+0.233
asta+0.228
DX+0.225
Neg logit contributions
acity-0.319
 Tup-0.254
-0.241
icate-0.240
icity-0.231
Token of
Token ID286
Feature activation-5.55e-01

Pos logit contributions
acity+1.846
 Tup+1.457
icate+1.359
+1.359
icity+1.314
Neg logit contributions
ixel-1.696
theless-1.611
eu-1.533
asta-1.520
DX-1.506
Token people
Token ID661
Feature activation+1.15e-01

Pos logit contributions
acity+6.333
 Tup+5.150
icate+4.572
icity+4.390
+4.368
Neg logit contributions
ixel-6.279
theless-5.944
eu-5.701
asta-5.677
DX-5.597
Token stood
Token ID6204
Feature activation+2.54e-01

Pos logit contributions
ixel+1.122
theless+1.071
eu+1.023
asta+0.997
DX+0.969
Neg logit contributions
acity-1.552
 Tup-1.239
-1.204
icate-1.181
icity-1.139
Token on
Token ID319
Feature activation-5.48e-01

Pos logit contributions
ixel+2.444
theless+2.337
eu+2.280
asta+2.233
 Patel+2.195
Neg logit contributions
acity-3.348
 Tup-2.642
-2.615
icate-2.546
ighth-2.440
Token their
Token ID511
Feature activation-1.32e+00

Pos logit contributions
acity+6.005
 Tup+5.009
icate+4.505
+4.393
ighth+4.300
Neg logit contributions
ixel-6.147
theless-6.003
eu-5.695
asta-5.615
DX-5.502
Token toes
Token ID23932
Feature activation-3.79e-03

Pos logit contributions
acity+11.945
 Tup+10.582
 Haz+8.670
ighth+8.595
 Cah+8.504
Neg logit contributions
ixel-14.049
theless-13.190
eu-12.429
DX-12.335
asta-12.087
Token and
Token ID290
Feature activation-9.19e-03

Pos logit contributions
acity+0.047
 Tup+0.037
+0.035
icate+0.035
icity+0.033
Neg logit contributions
ixel-0.041
theless-0.039
eu-0.038
asta-0.037
DX-0.036
Token perched
Token ID49264
Feature activation-3.25e-01

Pos logit contributions
acity+0.111
 Tup+0.087
+0.082
icate+0.082
icity+0.079
Neg logit contributions
ixel-0.102
theless-0.098
eu-0.094
asta-0.092
DX-0.090
Token children
Token ID1751
Feature activation-6.12e-02

Pos logit contributions
acity+3.627
 Tup+2.908
icate+2.617
+2.506
icity+2.499
Neg logit contributions
ixel-3.892
theless-3.724
eu-3.513
DX-3.456
asta-3.426
Token on
Token ID319
Feature activation-6.50e-01

Pos logit contributions
acity+0.745
 Tup+0.587
icate+0.550
+0.550
icity+0.530
Neg logit contributions
ixel-0.679
theless-0.646
eu-0.618
asta-0.607
DX-0.600
Token quite
Token ID2407
Feature activation-2.26e-02

Pos logit contributions
acity+4.486
 Tup+3.561
icate+3.224
icity+3.068
+3.060
Neg logit contributions
ixel-4.587
theless-4.345
eu-4.185
asta-4.099
DX-4.033
Token quickly
Token ID2952
Feature activation-1.45e-02

Pos logit contributions
acity+0.283
 Tup+0.224
+0.212
icate+0.211
icity+0.203
Neg logit contributions
ixel-0.242
theless-0.231
eu-0.221
asta-0.217
DX-0.213
Token but
Token ID475
Feature activation-1.86e-01

Pos logit contributions
acity+0.179
 Tup+0.141
+0.134
icate+0.133
icity+0.128
Neg logit contributions
ixel-0.157
theless-0.150
eu-0.143
asta-0.141
DX-0.139
Token to
Token ID284
Feature activation-2.92e-01

Pos logit contributions
acity+2.276
 Tup+1.811
icate+1.674
+1.654
icity+1.614
Neg logit contributions
ixel-2.030
theless-1.936
eu-1.846
asta-1.821
DX-1.792
Token have
Token ID423
Feature activation-6.29e-01

Pos logit contributions
acity+3.669
 Tup+2.924
icate+2.749
+2.691
icity+2.636
Neg logit contributions
ixel-3.056
theless-2.967
eu-2.747
asta-2.712
DX-2.696
Token that
Token ID326
Feature activation-1.31e+00

Pos logit contributions
acity+6.751
 Tup+5.261
icate+4.554
icity+4.511
ighth+4.479
Neg logit contributions
ixel-7.490
theless-6.991
DX-6.693
eu-6.656
asta-6.640
Token extra
Token ID3131
Feature activation-5.79e-01

Pos logit contributions
acity+13.422
 Tup+10.597
icity+9.650
 acute+9.229
 Cah+9.083
Neg logit contributions
ixel-13.151
theless-11.861
DX-11.409
覚醒-11.179
asta-11.111
Token racing
Token ID11717
Feature activation-6.28e-02

Pos logit contributions
acity+6.807
 Tup+5.193
icity+4.702
icate+4.697
ighth+4.571
Neg logit contributions
ixel-6.461
theless-6.255
asta-5.737
DX-5.734
eu-5.712
Token skill
Token ID5032
Feature activation-1.79e-01

Pos logit contributions
acity+0.792
 Tup+0.626
icate+0.589
+0.588
icity+0.569
Neg logit contributions
ixel-0.673
theless-0.640
eu-0.610
asta-0.600
DX-0.590
Token is
Token ID318
Feature activation-8.26e-02

Pos logit contributions
acity+2.187
 Tup+1.707
icate+1.587
+1.560
icity+1.547
Neg logit contributions
ixel-1.995
theless-1.910
eu-1.808
asta-1.789
DX-1.762
Token a
Token ID257
Feature activation+1.03e-01

Pos logit contributions
acity+1.015
 Tup+0.799
icate+0.745
+0.743
icity+0.718
Neg logit contributions
ixel-0.913
theless-0.868
eu-0.826
asta-0.814
DX-0.801
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
acity+3.720
 Tup+2.934
icate+2.762
+2.735
icity+2.677
Neg logit contributions
ixel-3.586
theless-3.394
eu-3.245
asta-3.182
DX-3.168
Token a
Token ID257
Feature activation-1.66e-01

Pos logit contributions
acity+1.296
 Tup+1.024
+0.959
icate+0.957
icity+0.923
Neg logit contributions
ixel-1.162
theless-1.103
eu-1.059
asta-1.041
DX-1.028
Token total
Token ID2472
Feature activation-1.69e-01

Pos logit contributions
acity+1.928
 Tup+1.510
+1.401
icate+1.399
icity+1.357
Neg logit contributions
ixel-1.932
theless-1.831
eu-1.758
asta-1.730
DX-1.709
Token retreat
Token ID13703
Feature activation+4.43e-02

Pos logit contributions
acity+1.946
 Tup+1.497
icate+1.404
+1.398
icity+1.369
Neg logit contributions
ixel-2.003
theless-1.923
eu-1.822
asta-1.789
DX-1.786
Token of
Token ID286
Feature activation-1.28e-01

Pos logit contributions
ixel+0.493
theless+0.470
eu+0.452
asta+0.443
DX+0.435
Neg logit contributions
acity-0.535
 Tup-0.420
-0.399
icate-0.394
icity-0.378
Token Hel
Token ID5053
Feature activation-1.30e+00

Pos logit contributions
acity+1.541
 Tup+1.216
icate+1.136
+1.127
icity+1.094
Neg logit contributions
ixel-1.429
theless-1.358
eu-1.298
asta-1.273
DX-1.261
Tokenheim
Token ID9096
Feature activation-1.76e-01

Pos logit contributions
acity+14.076
icate+11.029
icity+10.777
ice+10.358
 Tup+10.355
Neg logit contributions
theless-11.991
ixel-11.683
DX-11.055
 Patel-10.981
 bottleneck-10.816
Token Glacier
Token ID49120
Feature activation+8.88e-02

Pos logit contributions
acity+2.126
 Tup+1.667
icate+1.577
+1.566
icity+1.520
Neg logit contributions
ixel-1.981
theless-1.884
eu-1.781
asta-1.766
DX-1.733
Token by
Token ID416
Feature activation+6.84e-02

Pos logit contributions
ixel+0.956
theless+0.908
eu+0.875
asta+0.858
DX+0.846
Neg logit contributions
acity-1.103
 Tup-0.867
-0.830
icate-0.817
icity-0.785
Token 1
Token ID352
Feature activation-2.73e-02

Pos logit contributions
ixel+0.746
theless+0.713
eu+0.684
asta+0.672
DX+0.659
Neg logit contributions
acity-0.840
 Tup-0.657
-0.628
icate-0.619
icity-0.596
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
acity+0.331
 Tup+0.259
+0.245
icate+0.243
icity+0.234
Neg logit contributions
ixel-0.305
theless-0.291
eu-0.279
asta-0.274
DX-0.270
Token the
Token ID262
Feature activation-3.03e-01

Pos logit contributions
acity+2.752
 Tup+2.175
icate+1.992
+1.963
icity+1.908
Neg logit contributions
ixel-2.822
theless-2.718
eu-2.566
asta-2.542
DX-2.493
Token building
Token ID2615
Feature activation-8.32e-02

Pos logit contributions
acity+3.163
 Tup+2.416
icate+2.204
+2.123
icity+2.100
Neg logit contributions
ixel-3.839
theless-3.703
eu-3.542
asta-3.513
DX-3.441
Token,
Token ID11
Feature activation-3.43e-01

Pos logit contributions
acity+0.999
 Tup+0.782
icate+0.732
+0.726
icity+0.702
Neg logit contributions
ixel-0.940
theless-0.896
eu-0.858
asta-0.846
DX-0.830
Token only
Token ID691
Feature activation-2.59e-01

Pos logit contributions
acity+3.786
 Tup+2.948
icate+2.738
+2.651
icity+2.585
Neg logit contributions
ixel-4.119
theless-3.971
eu-3.765
asta-3.749
DX-3.638
Token on
Token ID319
Feature activation-1.00e+00

Pos logit contributions
acity+2.939
 Tup+2.302
icate+2.124
+2.091
icity+2.012
Neg logit contributions
ixel-3.087
theless-2.932
eu-2.805
asta-2.777
DX-2.706
Token the
Token ID262
Feature activation-1.29e+00

Pos logit contributions
acity+9.505
 Tup+7.487
icate+7.079
ighth+6.632
ice+6.347
Neg logit contributions
ixel-11.690
theless-11.347
asta-11.170
eu-10.897
DX-10.328
Token other
Token ID584
Feature activation-6.76e-01

Pos logit contributions
acity+11.914
 Tup+9.837
ice+8.816
ighth+8.641
 practice+8.632
Neg logit contributions
ixel-13.925
theless-12.983
asta-12.822
eu-12.584
ellation-12.204
Token side
Token ID1735
Feature activation-1.25e-01

Pos logit contributions
acity+6.185
 Tup+4.441
icity+4.039
icate+4.008
ighth+3.865
Neg logit contributions
ixel-8.863
theless-8.579
eu-8.280
asta-8.262
DX-7.992
Token,
Token ID11
Feature activation-3.93e-01

Pos logit contributions
acity+1.450
 Tup+1.123
icate+1.049
+1.044
icity+1.004
Neg logit contributions
ixel-1.458
theless-1.397
eu-1.337
asta-1.321
DX-1.298
Token so
Token ID523
Feature activation-2.07e-01

Pos logit contributions
acity+4.384
 Tup+3.462
icate+3.157
icity+3.023
+3.008
Neg logit contributions
ixel-4.695
theless-4.457
eu-4.221
asta-4.221
DX-4.105
Token it
Token ID340
Feature activation+2.37e-01

Pos logit contributions
acity+2.388
 Tup+1.864
icate+1.734
+1.695
icity+1.653
Neg logit contributions
ixel-2.423
theless-2.321
eu-2.206
asta-2.180
DX-2.137
Token have
Token ID423
Feature activation-4.51e-01

Pos logit contributions
acity+2.525
 Tup+1.997
icate+1.873
+1.834
icity+1.797
Neg logit contributions
ixel-2.257
theless-2.174
eu-2.072
asta-2.024
DX-2.024
Token the
Token ID262
Feature activation-1.02e+00

Pos logit contributions
acity+4.976
 Tup+3.943
icate+3.480
+3.469
icity+3.381
Neg logit contributions
ixel-5.351
theless-5.007
eu-4.804
asta-4.755
DX-4.746
Token opportunity
Token ID3663
Feature activation-3.60e-01

Pos logit contributions
acity+9.835
 Tup+7.403
icity+6.398
+6.351
ighth+6.125
Neg logit contributions
ixel-12.264
theless-11.618
DX-10.901
asta-10.766
eu-10.748
Token to
Token ID284
Feature activation-2.85e-01

Pos logit contributions
acity+3.892
 Tup+2.936
+2.801
icate+2.738
icity+2.599
Neg logit contributions
ixel-4.376
theless-4.247
eu-4.054
DX-3.992
asta-3.992
Token pursue
Token ID10660
Feature activation-7.25e-01

Pos logit contributions
acity+3.415
 Tup+2.706
icate+2.548
+2.452
icity+2.414
Neg logit contributions
ixel-3.095
theless-3.025
eu-2.874
DX-2.802
asta-2.798
Token their
Token ID511
Feature activation-1.28e+00

Pos logit contributions
acity+7.281
 Tup+5.622
icate+4.908
icity+4.717
igon+4.675
Neg logit contributions
ixel-8.714
theless-8.356
eu-8.116
DX-7.946
asta-7.884
Token dream
Token ID4320
Feature activation+3.27e-02

Pos logit contributions
acity+11.953
 Tup+9.092
icate+7.862
icity+7.861
 practice+7.648
Neg logit contributions
ixel-14.408
theless-13.488
eu-12.629
-12.435
DX-12.359
Token un
Token ID555
Feature activation-4.53e-01

Pos logit contributions
ixel+0.366
theless+0.350
eu+0.335
asta+0.330
DX+0.324
Neg logit contributions
acity-0.393
 Tup-0.308
-0.292
icate-0.289
icity-0.277
Tokenapolog
Token ID46407
Feature activation+1.12e-01

Pos logit contributions
acity+5.101
 Tup+3.811
icate+3.691
ighth+3.643
icity+3.564
Neg logit contributions
ixel-5.267
theless-5.077
DX-4.803
asta-4.764
 Phill-4.744
Tokenetically
Token ID16877
Feature activation-2.72e-01

Pos logit contributions
ixel+0.510
theless+0.445
eu+0.415
asta+0.385
DX+0.361
Neg logit contributions
acity-2.077
 Tup-1.800
-1.740
icate-1.719
ighth-1.683
Token,
Token ID11
Feature activation+6.59e-02

Pos logit contributions
acity+3.146
 Tup+2.430
+2.280
icate+2.273
icity+2.178
Neg logit contributions
ixel-3.147
theless-3.012
eu-2.865
asta-2.856
DX-2.804
Token core
Token ID4755
Feature activation+4.27e-02

Pos logit contributions
acity+1.446
 Tup+1.119
icate+1.043
+1.034
icity+1.003
Neg logit contributions
ixel-1.480
theless-1.417
eu-1.342
asta-1.325
DX-1.304
Token.
Token ID13
Feature activation-1.40e-01

Pos logit contributions
ixel+0.475
theless+0.453
eu+0.436
asta+0.428
DX+0.421
Neg logit contributions
acity-0.516
 Tup-0.405
-0.385
icate-0.380
icity-0.364
Token Punk
Token ID32592
Feature activation+2.55e-01

Pos logit contributions
acity+1.798
 Tup+1.465
icate+1.351
+1.343
icity+1.308
Neg logit contributions
ixel-1.449
theless-1.372
eu-1.289
asta-1.280
DX-1.248
Token became
Token ID2627
Feature activation-4.84e-02

Pos logit contributions
ixel+2.586
theless+2.476
eu+2.385
asta+2.348
DX+2.333
Neg logit contributions
acity-3.241
 Tup-2.576
-2.545
icate-2.473
ighth-2.356
Token "
Token ID366
Feature activation-1.46e-01

Pos logit contributions
acity+0.591
 Tup+0.466
+0.438
icate+0.436
icity+0.421
Neg logit contributions
ixel-0.534
theless-0.508
eu-0.487
asta-0.478
DX-0.471
Tokenhard
Token ID10424
Feature activation-1.27e+00

Pos logit contributions
acity+1.858
 Tup+1.475
+1.394
icate+1.390
icity+1.340
Neg logit contributions
ixel-1.540
theless-1.460
eu-1.375
asta-1.361
DX-1.328
Tokencore
Token ID7295
Feature activation-4.19e-01

Pos logit contributions
acity+11.665
 Tup+9.343
cast+8.554
er+8.110
icity+8.018
Neg logit contributions
ixel-13.224
theless-12.445
 Phill-12.027
 Patel-11.778
orp-11.699
Token"
Token ID1
Feature activation-3.33e-01

Pos logit contributions
acity+4.826
 Tup+3.840
icate+3.510
icity+3.422
+3.395
Neg logit contributions
ixel-4.750
theless-4.502
eu-4.232
asta-4.128
 Phill-4.127
Token and
Token ID290
Feature activation+5.33e-02

Pos logit contributions
acity+3.855
 Tup+3.105
icate+2.797
icity+2.737
+2.677
Neg logit contributions
ixel-3.822
theless-3.616
eu-3.425
asta-3.334
DX-3.288
Token more
Token ID517
Feature activation-3.63e-01

Pos logit contributions
ixel+0.569
theless+0.544
eu+0.521
asta+0.512
DX+0.502
Neg logit contributions
acity-0.669
 Tup-0.525
-0.505
icate-0.499
icity-0.479
Token masculine
Token ID27543
Feature activation-3.42e-01

Pos logit contributions
acity+4.317
 Tup+3.523
icate+3.177
icity+3.083
+3.065
Neg logit contributions
ixel-3.991
theless-3.778
eu-3.572
asta-3.501
DX-3.461
Token that
Token ID326
Feature activation+1.72e-01

Pos logit contributions
ixel+2.376
theless+2.297
eu+2.156
asta+2.125
 Patel+2.102
Neg logit contributions
acity-2.709
 Tup-2.109
-2.048
icate-2.002
icity-1.943
Token experts
Token ID6154
Feature activation+4.80e-01

Pos logit contributions
ixel+1.779
theless+1.718
eu+1.627
asta+1.587
 Patel+1.565
Neg logit contributions
acity-2.202
 Tup-1.736
-1.678
icate-1.655
icity-1.606
Token,
Token ID11
Feature activation-1.20e-01

Pos logit contributions
ixel+4.908
theless+4.570
eu+4.547
 Patel+4.335
asta+4.266
Neg logit contributions
acity-6.029
 Tup-4.610
-4.593
icate-4.402
icity-4.308
Token advocates
Token ID11009
Feature activation+1.23e-01

Pos logit contributions
acity+1.401
 Tup+1.092
+1.024
icate+1.016
icity+0.976
Neg logit contributions
ixel-1.376
theless-1.319
eu-1.261
asta-1.248
DX-1.223
Token and
Token ID290
Feature activation-3.32e-01

Pos logit contributions
ixel+1.304
theless+1.230
eu+1.184
asta+1.154
DX+1.133
Neg logit contributions
acity-1.565
 Tup-1.236
-1.174
icate-1.168
icity-1.127
Token aud
Token ID2709
Feature activation-1.26e+00

Pos logit contributions
acity+3.846
 Tup+3.033
+2.794
icate+2.750
icity+2.670
Neg logit contributions
ixel-3.799
theless-3.612
eu-3.453
asta-3.448
DX-3.354
Tokenitors
Token ID6742
Feature activation-2.23e-01

Pos logit contributions
acity+10.105
iov+5.612
acious+5.497
icate+5.488
iqu+5.259
Neg logit contributions
theless-18.113
ixel-17.550
asta-16.426
DX-16.324
 Phill-16.081
Token have
Token ID423
Feature activation+7.64e-02

Pos logit contributions
acity+2.639
 Tup+2.090
icate+1.950
+1.920
icity+1.862
Neg logit contributions
ixel-2.507
theless-2.401
eu-2.275
asta-2.275
DX-2.263
Token all
Token ID477
Feature activation+3.32e-01

Pos logit contributions
ixel+0.809
theless+0.773
eu+0.742
asta+0.724
DX+0.707
Neg logit contributions
acity-0.965
 Tup-0.763
-0.734
icate-0.721
icity-0.694
Token suggested
Token ID5220
Feature activation-9.75e-02

Pos logit contributions
ixel+3.227
theless+3.179
eu+3.050
asta+2.903
 Patel+2.830
Neg logit contributions
acity-4.325
 Tup-3.451
-3.381
icate-3.268
icity-3.148
Token.
Token ID13
Feature activation+2.09e-01

Pos logit contributions
acity+1.208
 Tup+0.956
+0.904
icate+0.897
icity+0.864
Neg logit contributions
ixel-1.056
theless-1.011
eu-0.961
asta-0.948
DX-0.933
Token
Token ID247
Feature activation+4.53e-02

Pos logit contributions
acity+0.917
 Tup+0.729
+0.691
icate+0.687
icity+0.660
Neg logit contributions
ixel-0.769
theless-0.734
eu-0.699
asta-0.688
DX-0.676
Tokent
Token ID83
Feature activation+2.23e-01

Pos logit contributions
ixel+0.535
theless+0.514
eu+0.495
asta+0.485
DX+0.477
Neg logit contributions
acity-0.517
 Tup-0.398
-0.373
icate-0.371
icity-0.356
Token fresh
Token ID4713
Feature activation-2.25e-01

Pos logit contributions
ixel+2.254
theless+2.189
eu+2.113
asta+2.049
 Patel+2.010
Neg logit contributions
acity-2.849
 Tup-2.231
-2.160
icate-2.110
icity-2.058
Token
Token ID198
Feature activation-4.72e-03

Pos logit contributions
acity+2.644
 Tup+2.049
+1.927
icate+1.920
icity+1.854
Neg logit contributions
ixel-2.597
theless-2.475
eu-2.347
asta-2.314
DX-2.283
Token
Token ID198
Feature activation+2.13e-01

Pos logit contributions
acity+0.059
 Tup+0.046
+0.044
icate+0.044
icity+0.042
Neg logit contributions
ixel-0.051
theless-0.049
eu-0.046
asta-0.046
DX-0.045
TokenWil
Token ID22327
Feature activation-1.26e+00

Pos logit contributions
ixel+2.195
theless+2.073
eu+2.037
asta+1.972
DX+1.953
Neg logit contributions
acity-2.718
 Tup-2.178
icate-2.039
-2.019
icity-1.960
Token Chung
Token ID43915
Feature activation+1.67e-01

Pos logit contributions
acity+13.795
 Tup+10.612
icate+10.497
icity+10.240
cast+10.072
Neg logit contributions
ixel-12.209
eu-10.982
DX-10.866
theless-10.737
 mathemat-10.091
Token Bl
Token ID1086
Feature activation-3.54e-01

Pos logit contributions
ixel+1.777
theless+1.693
eu+1.635
asta+1.598
DX+1.566
Neg logit contributions
acity-2.081
 Tup-1.635
-1.555
icate-1.545
icity-1.485
Tokenocked
Token ID3543
Feature activation+4.82e-02

Pos logit contributions
acity+4.438
 Tup+3.458
icate+3.334
icity+3.226
ighth+3.200
Neg logit contributions
ixel-3.657
theless-3.656
eu-3.366
DX-3.314
 Phill-3.241
Token Un
Token ID791
Feature activation-8.16e-01

Pos logit contributions
ixel+0.517
theless+0.493
eu+0.473
asta+0.464
DX+0.455
Neg logit contributions
acity-0.603
 Tup-0.475
-0.451
icate-0.447
icity-0.432
Tokenblock
Token ID9967
Feature activation-8.11e-02

Pos logit contributions
acity+9.830
icity+7.577
 Tup+7.544
icate+7.442
ilateral+7.323
Neg logit contributions
ixel-8.049
theless-7.842
DX-7.038
 Phill-6.924
 bottleneck-6.907
Tokenhews
Token ID40645
Feature activation-5.50e-01

Pos logit contributions
acity+5.041
 Tup+3.398
icate+3.273
icity+3.142
igon+3.079
Neg logit contributions
ixel-7.051
theless-6.952
DX-6.628
eu-6.490
 bottleneck-6.395
Token,
Token ID837
Feature activation-1.20e+00

Pos logit contributions
acity+5.231
 Tup+3.915
icate+3.578
+3.396
icity+3.370
Neg logit contributions
ixel-7.131
theless-7.037
eu-6.655
asta-6.551
DX-6.445
Token Brazil
Token ID7595
Feature activation-3.57e-01

Pos logit contributions
acity+12.789
 Tup+11.135
icate+9.553
 Mess+9.491
 Cah+9.446
Neg logit contributions
ixel-11.563
theless-11.523
asta-10.387
eu-10.217
DX-10.078
Token N
Token ID399
Feature activation-4.41e-01

Pos logit contributions
acity+4.019
 Tup+3.241
icate+2.933
+2.841
icity+2.787
Neg logit contributions
ixel-4.179
theless-4.027
eu-3.806
DX-3.720
asta-3.697
Tokenuts
Token ID5500
Feature activation-3.33e-01

Pos logit contributions
acity+4.591
 Tup+3.433
icate+3.243
ighth+3.197
+3.124
Neg logit contributions
theless-5.520
ixel-5.460
DX-5.021
eu-4.992
 Graveyard-4.906
Token,
Token ID837
Feature activation-1.26e+00

Pos logit contributions
acity+3.033
 Tup+2.174
icate+1.986
+1.956
icity+1.869
Neg logit contributions
ixel-4.633
theless-4.514
eu-4.318
asta-4.253
DX-4.205
Token Pep
Token ID24226
Feature activation-3.89e-01

Pos logit contributions
acity+13.238
 Tup+11.597
icate+9.962
 Cah+9.931
 Mess+9.909
Neg logit contributions
ixel-11.960
theless-11.830
asta-10.713
eu-10.561
DX-10.393
Tokenitas
Token ID21416
Feature activation-4.05e-01

Pos logit contributions
acity+4.401
 Tup+3.346
icate+3.186
+2.984
icity+2.978
Neg logit contributions
ixel-4.625
theless-4.411
eu-4.238
DX-4.126
asta-4.045
Token (
Token ID357
Feature activation-6.33e-01

Pos logit contributions
acity+4.007
 Tup+3.021
icate+2.743
+2.658
icity+2.606
Neg logit contributions
ixel-5.276
theless-5.152
eu-4.839
asta-4.788
DX-4.753
Tokenh
Token ID71
Feature activation-5.56e-01

Pos logit contributions
acity+7.132
 Tup+5.609
+5.344
icate+5.204
Cod+5.151
Neg logit contributions
ixel-6.925
theless-6.780
asta-6.192
eu-6.110
 bottleneck-6.020
Tokenull
Token ID724
Feature activation-2.66e-01

Pos logit contributions
acity+5.777
 Tup+4.327
icate+4.158
ighth+4.093
icity+4.029
Neg logit contributions
ixel-6.731
theless-6.674
eu-6.209
DX-6.062
 Patel-5.998
Token10
Token ID940
Feature activation+5.28e-01

Pos logit contributions
ixel+2.642
eu+2.542
DX+2.410
theless+2.395
asta+2.388
Neg logit contributions
acity-3.938
 Tup-3.311
-3.145
icate-3.096
ighth-2.977
Token:
Token ID25
Feature activation+3.67e-01

Pos logit contributions
ixel+3.894
eu+3.856
DX+3.586
asta+3.558
 Patel+3.425
Neg logit contributions
acity-7.547
-6.494
 Tup-6.339
icate-6.058
ighth-6.020
Token47
Token ID2857
Feature activation+2.90e-01

Pos logit contributions
ixel+2.775
eu+2.595
DX+2.534
theless+2.446
asta+2.433
Neg logit contributions
acity-5.525
 Tup-4.622
-4.524
icate-4.407
icity-4.283
Token]
Token ID60
Feature activation+2.49e-01

Pos logit contributions
ixel+2.788
eu+2.668
theless+2.554
DX+2.531
asta+2.529
Neg logit contributions
acity-3.773
 Tup-3.049
-3.032
icate-2.872
ighth-2.795
Token <
Token ID1279
Feature activation+3.53e-01

Pos logit contributions
ixel+2.521
theless+2.367
eu+2.334
asta+2.253
DX+2.224
Neg logit contributions
acity-3.233
 Tup-2.546
-2.467
icate-2.426
icity-2.347
Tokenk
Token ID74
Feature activation+8.63e-01

Pos logit contributions
ixel+4.166
theless+3.962
eu+3.918
asta+3.816
DX+3.744
Neg logit contributions
acity-3.878
 Tup-3.049
-2.838
icate-2.787
icity-2.665
Tokenasse
Token ID21612
Feature activation-1.38e-01

Pos logit contributions
ixel+8.649
eu+7.918
theless+7.697
asta+7.694
DX+7.438
Neg logit contributions
acity-9.797
 Tup-8.404
-8.040
icate-7.319
 Cah-7.265
Tokenmm
Token ID3020
Feature activation+2.30e-01

Pos logit contributions
acity+1.595
 Tup+1.219
+1.153
icate+1.152
ighth+1.109
Neg logit contributions
ixel-1.606
theless-1.537
eu-1.467
asta-1.450
DX-1.438
Tokenel
Token ID417
Feature activation+1.81e-01

Pos logit contributions
ixel+2.517
theless+2.361
eu+2.303
asta+2.272
DX+2.188
Neg logit contributions
acity-2.807
 Tup-2.267
-2.132
icate-2.085
icity-2.002
Token>
Token ID29
Feature activation-6.48e-02

Pos logit contributions
ixel+1.909
theless+1.818
eu+1.748
asta+1.714
DX+1.658
Neg logit contributions
acity-2.274
 Tup-1.839
-1.728
icate-1.684
icity-1.618
Token lol
Token ID19462
Feature activation+2.30e-01

Pos logit contributions
acity+0.729
 Tup+0.563
+0.524
icate+0.521
icity+0.498
Neg logit contributions
ixel-0.778
theless-0.744
eu-0.713
asta-0.703
DX-0.692
Token)}
Token ID38165
Feature activation-4.86e-01

Pos logit contributions
ixel+3.631
eu+3.387
theless+3.362
asta+3.298
DX+3.256
Neg logit contributions
acity-3.140
 Tup-2.426
-2.342
icate-2.229
icity-2.124
Token Rd
Token ID20769
Feature activation-2.82e-01

Pos logit contributions
acity+4.919
 Tup+3.724
icate+3.400
+3.249
icity+3.189
Neg logit contributions
ixel-6.225
theless-5.922
asta-5.622
eu-5.585
DX-5.474
Token8
Token ID23
Feature activation-8.78e-02

Pos logit contributions
acity+2.987
 Tup+2.184
icate+2.074
+2.074
icity+1.975
Neg logit contributions
ixel-3.496
theless-3.439
eu-3.212
asta-3.210
 Patel-3.149
Token {
Token ID1391
Feature activation+5.50e-02

Pos logit contributions
acity+1.218
 Tup+0.987
+0.938
icate+0.936
icity+0.905
Neg logit contributions
ixel-0.823
theless-0.784
eu-0.737
asta-0.724
DX-0.708
Token(
Token ID7
Feature activation+1.17e-01

Pos logit contributions
ixel+0.562
theless+0.531
eu+0.516
asta+0.502
DX+0.499
Neg logit contributions
acity-0.711
 Tup-0.570
-0.542
icate-0.536
icity-0.517
Token7
Token ID22
Feature activation+7.32e-01

Pos logit contributions
ixel+1.256
theless+1.197
eu+1.156
asta+1.125
DX+1.121
Neg logit contributions
acity-1.463
 Tup-1.153
-1.099
icate-1.087
icity-1.046
Token17
Token ID1558
Feature activation+3.84e-01

Pos logit contributions
ixel+6.279
eu+6.263
theless+6.083
DX+5.855
asta+5.585
Neg logit contributions
acity-9.379
-7.736
 Tup-7.588
icity-7.101
icate-7.096
Tokens
Token ID82
Feature activation+3.80e-01

Pos logit contributions
ixel+3.493
eu+3.308
theless+3.184
asta+3.130
DX+3.097
Neg logit contributions
acity-5.131
 Tup-4.263
-4.209
icate-3.995
ighth-3.927
Token)}
Token ID38165
Feature activation-5.98e-01

Pos logit contributions
ixel+4.516
eu+4.211
theless+4.132
asta+4.099
DX+4.037
Neg logit contributions
acity-4.109
 Tup-3.216
-3.133
icate-2.947
ighth-2.824
Token 14
Token ID1478
Feature activation-2.98e-01

Pos logit contributions
acity+5.842
 Tup+4.391
icate+4.049
+3.961
icity+3.835
Neg logit contributions
ixel-7.611
theless-7.392
asta-6.995
eu-6.922
DX-6.749
Token.
Token ID13
Feature activation-6.35e-01

Pos logit contributions
acity+3.209
 Tup+2.389
icate+2.217
+2.165
icity+2.121
Neg logit contributions
ixel-3.774
theless-3.713
asta-3.397
eu-3.356
 mathemat-3.341
Tokenimage
Token ID9060
Feature activation+4.48e-01

Pos logit contributions
ixel+2.850
theless+2.660
eu+2.631
asta+2.555
DX+2.544
Neg logit contributions
acity-2.722
 Tup-2.127
-2.041
icate-1.992
icity-1.890
Token,
Token ID11
Feature activation+4.19e-01

Pos logit contributions
ixel+4.017
eu+3.635
DX+3.585
theless+3.555
asta+3.531
Neg logit contributions
acity-6.109
 Tup-4.931
-4.799
icate-4.682
icity-4.549
Token c
Token ID269
Feature activation+4.63e-01

Pos logit contributions
ixel+4.418
theless+4.123
eu+4.014
DX+3.898
asta+3.890
Neg logit contributions
acity-5.063
-4.043
 Tup-3.925
icate-3.834
icity-3.666
Tokenv
Token ID85
Feature activation+3.99e-01

Pos logit contributions
ixel+6.466
eu+6.088
theless+6.076
asta+5.931
DX+5.867
Neg logit contributions
acity-3.997
 Tup-2.918
-2.816
icate-2.615
icity-2.448
Token2
Token ID17
Feature activation+4.02e-01

Pos logit contributions
ixel+4.416
eu+3.999
DX+3.941
theless+3.917
asta+3.876
Neg logit contributions
acity-4.656
 Tup-3.745
-3.676
icate-3.482
ighth-3.355
Token.
Token ID13
Feature activation+7.43e-01

Pos logit contributions
ixel+3.967
eu+3.689
DX+3.666
asta+3.469
theless+3.460
Neg logit contributions
acity-5.135
 Tup-4.184
-4.027
icate-3.895
ighth-3.765
TokenCOLOR
Token ID46786
Feature activation+3.16e-01

Pos logit contributions
ixel+7.996
eu+7.420
DX+7.319
theless+7.162
asta+6.860
Neg logit contributions
acity-8.239
 Tup-6.751
-6.422
icate-6.301
icity-5.983
Token_
Token ID62
Feature activation+3.82e-01

Pos logit contributions
ixel+3.871
theless+3.557
eu+3.489
DX+3.476
asta+3.421
Neg logit contributions
acity-3.476
 Tup-2.635
-2.543
icate-2.443
icity-2.330
TokenB
Token ID33
Feature activation+2.49e-02

Pos logit contributions
ixel+3.970
DX+3.711
eu+3.681
theless+3.635
asta+3.518
Neg logit contributions
acity-4.793
 Tup-3.774
-3.620
icate-3.541
icity-3.436
TokenGR
Token ID10761
Feature activation+2.43e-01

Pos logit contributions
ixel+0.255
theless+0.241
eu+0.231
asta+0.227
DX+0.223
Neg logit contributions
acity-0.323
 Tup-0.259
-0.246
icate-0.244
icity-0.235
Token2
Token ID17
Feature activation-4.42e-02

Pos logit contributions
ixel+2.751
theless+2.558
eu+2.465
DX+2.433
asta+2.427
Neg logit contributions
acity-2.936
 Tup-2.269
-2.209
icate-2.122
icity-2.050
Token://
Token ID1378
Feature activation+5.08e-01

Pos logit contributions
ixel+3.015
eu+2.928
DX+2.731
asta+2.583
['+2.471
Neg logit contributions
acity-7.963
 Tup-6.941
-6.799
ilateral-6.461
icate-6.433
Tokenwww
Token ID2503
Feature activation+4.87e-01

Pos logit contributions
ixel+4.628
eu+4.569
theless+4.466
asta+4.206
DX+4.167
Neg logit contributions
acity-6.546
 Tup-5.549
-5.275
icate-5.017
icity-4.863
Token.
Token ID13
Feature activation+3.70e-02

Pos logit contributions
ixel+4.265
eu+4.197
DX+3.920
asta+3.805
 Patel+3.696
Neg logit contributions
acity-6.411
-5.411
 Tup-5.407
icate-4.980
ighth-4.854
Tokencd
Token ID10210
Feature activation-2.40e-01

Pos logit contributions
ixel+0.488
theless+0.471
eu+0.455
asta+0.448
DX+0.441
Neg logit contributions
acity-0.369
 Tup-0.275
-0.256
icate-0.252
icity-0.239
Tokenc
Token ID66
Feature activation+9.23e-02

Pos logit contributions
acity+1.957
icate+1.204
 Tup+1.201
icity+1.076
ighth+1.067
Neg logit contributions
ixel-3.673
theless-3.590
DX-3.376
eu-3.361
 Patel-3.350
Token.
Token ID13
Feature activation+7.38e-01

Pos logit contributions
ixel+1.018
theless+0.956
eu+0.938
asta+0.915
DX+0.896
Neg logit contributions
acity-1.115
 Tup-0.890
-0.851
icate-0.824
icity-0.791
Tokengov
Token ID9567
Feature activation+7.86e-02

Pos logit contributions
eu+7.731
ixel+7.436
theless+6.903
asta+6.775
ey+6.647
Neg logit contributions
acity-8.220
-7.704
 Tup-6.886
Cod-6.258
icate-6.245
Token/
Token ID14
Feature activation+1.19e-01

Pos logit contributions
ixel+0.932
theless+0.881
eu+0.860
asta+0.844
DX+0.832
Neg logit contributions
acity-0.891
 Tup-0.687
-0.655
icate-0.640
icity-0.611
Tokenin
Token ID259
Feature activation-2.76e-01

Pos logit contributions
ixel+1.404
theless+1.343
eu+1.307
asta+1.273
DX+1.263
Neg logit contributions
acity-1.354
 Tup-1.057
-0.997
icate-0.980
icity-0.933
Tokenjury
Token ID21871
Feature activation-1.62e-01

Pos logit contributions
acity+4.445
 Tup+3.611
icate+3.519
ighth+3.448
+3.446
Neg logit contributions
ixel-1.949
theless-1.902
asta-1.676
eu-1.649
DX-1.627
Token/
Token ID14
Feature activation+2.81e-01

Pos logit contributions
acity+1.753
 Tup+1.309
icate+1.225
+1.221
icity+1.188
Neg logit contributions
ixel-2.021
theless-1.948
eu-1.848
asta-1.840
DX-1.812
Token
Token ID198
Feature activation+7.12e-01

Pos logit contributions
ixel+3.361
eu+3.240
asta+3.056
DX+3.052
theless+2.983
Neg logit contributions
acity-5.394
-4.540
 Tup-4.458
icate-4.220
ighth-4.135
TokenMore
Token ID5167
Feature activation-2.81e-03

Pos logit contributions
ixel+7.903
theless+7.495
DX+7.441
asta+7.434
eu+7.340
Neg logit contributions
acity-7.406
 Tup-6.203
-5.724
icate-5.406
 Cah-5.293
Token entertainment
Token ID9739
Feature activation+9.50e-02

Pos logit contributions
acity+0.037
 Tup+0.029
+0.028
icate+0.028
icity+0.027
Neg logit contributions
ixel-0.029
theless-0.027
eu-0.026
asta-0.026
DX-0.025
Token stories
Token ID3923
Feature activation+4.26e-01

Pos logit contributions
ixel+1.033
theless+0.978
eu+0.942
asta+0.926
DX+0.922
Neg logit contributions
acity-1.171
 Tup-0.919
-0.883
icate-0.875
ighth-0.834
Token on
Token ID319
Feature activation+1.31e-01

Pos logit contributions
ixel+4.056
theless+3.774
eu+3.613
asta+3.612
DX+3.533
Neg logit contributions
acity-5.601
-4.520
 Tup-4.496
icate-4.318
ighth-4.199
Token Yahoo
Token ID16551
Feature activation+6.98e-01

Pos logit contributions
ixel+1.276
theless+1.192
eu+1.145
asta+1.120
DX+1.107
Neg logit contributions
acity-1.778
 Tup-1.411
-1.372
icate-1.356
icity-1.309
Token!
Token ID0
Feature activation+6.70e-01

Pos logit contributions
ixel+5.874
theless+5.783
eu+5.583
DX+5.408
 Phill+5.342
Neg logit contributions
acity-9.312
 Tup-7.562
-7.531
icate-7.306
ilateral-7.172
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
ixel+5.815
theless+5.520
DX+5.438
eu+5.345
 Phill+5.216
Neg logit contributions
acity-9.065
-7.404
 Tup-7.136
icate-6.903
ilateral-6.822
Token
Token ID198
Feature activation+4.74e-01

Pos logit contributions
ixel+2.145
eu+2.021
theless+1.976
asta+1.935
DX+1.924
Neg logit contributions
acity-2.976
 Tup-2.416
-2.384
icate-2.284
ighth-2.212
TokenBob
Token ID18861
Feature activation+6.38e-02

Pos logit contributions
ixel+4.830
eu+4.517
DX+4.478
theless+4.381
asta+4.345
Neg logit contributions
acity-5.873
 Tup-4.683
-4.472
icate-4.378
icity-4.235
Tokenbi
Token ID8482
Feature activation+2.01e-01

Pos logit contributions
ixel+0.693
theless+0.657
eu+0.630
asta+0.618
DX+0.608
Neg logit contributions
acity-0.792
 Tup-0.619
-0.592
icate-0.586
icity-0.566
Token would
Token ID561
Feature activation-3.50e-02

Pos logit contributions
acity+0.705
 Tup+0.559
icate+0.526
+0.525
icity+0.506
Neg logit contributions
ixel-0.603
theless-0.574
eu-0.547
asta-0.538
DX-0.531
Token bet
Token ID731
Feature activation+3.17e-01

Pos logit contributions
acity+0.444
 Tup+0.353
icate+0.333
+0.333
icity+0.320
Neg logit contributions
ixel-0.370
theless-0.352
eu-0.336
asta-0.330
DX-0.325
Token on
Token ID319
Feature activation+3.33e-01

Pos logit contributions
ixel+3.022
theless+2.881
eu+2.869
asta+2.754
DX+2.669
Neg logit contributions
acity-4.178
 Tup-3.350
-3.291
icate-3.156
icity-3.099
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
ixel+3.339
theless+3.200
eu+3.117
asta+3.030
 Patel+2.958
Neg logit contributions
acity-4.238
 Tup-3.340
-3.295
icate-3.185
ighth-3.098
Token Hopefully
Token ID19996
Feature activation+3.98e-01

Pos logit contributions
ixel+1.373
theless+1.312
eu+1.273
asta+1.218
DX+1.216
Neg logit contributions
acity-1.801
 Tup-1.425
-1.382
icate-1.369
icity-1.324
Token the
Token ID262
Feature activation+2.42e-01

Pos logit contributions
ixel+4.037
theless+3.840
eu+3.648
 Patel+3.571
 Phill+3.563
Neg logit contributions
acity-5.027
-3.945
 Tup-3.939
icate-3.820
ighth-3.725
Token Australians
Token ID24933
Feature activation+3.67e-01

Pos logit contributions
ixel+2.684
theless+2.566
eu+2.477
asta+2.410
 Phill+2.366
Neg logit contributions
acity-2.887
 Tup-2.242
-2.198
icate-2.137
icity-2.070
Token will
Token ID481
Feature activation+4.43e-01

Pos logit contributions
ixel+3.388
theless+3.170
eu+3.030
asta+2.927
 Patel+2.877
Neg logit contributions
acity-5.013
-4.033
 Tup-4.026
icate-3.836
icity-3.788
Token continue
Token ID2555
Feature activation+2.33e-01

Pos logit contributions
ixel+4.221
theless+3.943
eu+3.824
asta+3.742
 bottleneck+3.666
Neg logit contributions
acity-5.818
-4.729
 Tup-4.667
icity-4.393
icate-4.375
Token their
Token ID511
Feature activation-2.02e-01

Pos logit contributions
ixel+2.513
theless+2.395
eu+2.341
asta+2.256
 Phill+2.217
Neg logit contributions
acity-2.847
 Tup-2.229
-2.139
icate-2.124
icity-2.067
Token improvements
Token ID8561
Feature activation+4.37e-01

Pos logit contributions
acity+2.447
 Tup+1.935
+1.789
icate+1.788
icity+1.717
Neg logit contributions
ixel-2.224
theless-2.111
eu-2.019
asta-1.997
DX-1.969
Token gender
Token ID5279
Feature activation+5.90e-01

Pos logit contributions
ixel+1.969
theless+1.870
eu+1.811
asta+1.765
DX+1.735
Neg logit contributions
acity-2.641
 Tup-2.126
-2.037
icate-2.017
ighth-1.946
Token identity
Token ID5369
Feature activation+2.33e-01

Pos logit contributions
ixel+5.235
theless+5.068
eu+4.847
DX+4.768
asta+4.714
Neg logit contributions
acity-7.693
 Tup-6.210
-6.168
icate-6.072
ighth-5.836
Token does
Token ID857
Feature activation+4.56e-01

Pos logit contributions
ixel+2.245
theless+2.157
eu+2.049
asta+1.994
DX+1.971
Neg logit contributions
acity-3.097
 Tup-2.483
-2.444
icate-2.376
ighth-2.313
Token not
Token ID407
Feature activation+7.05e-01

Pos logit contributions
ixel+4.068
theless+3.756
eu+3.608
DX+3.427
asta+3.367
Neg logit contributions
acity-6.249
 Tup-5.150
-4.872
ighth-4.816
icate-4.730
Token match
Token ID2872
Feature activation+5.55e-01

Pos logit contributions
ixel+5.863
theless+5.745
 accord+5.509
eu+5.349
asta+5.046
Neg logit contributions
acity-9.455
-8.209
 Tup-7.814
ighth-7.335
Detailed-7.312
Token their
Token ID511
Feature activation+2.51e-01

Pos logit contributions
ixel+5.118
theless+4.988
eu+4.879
asta+4.737
DX+4.611
Neg logit contributions
acity-7.191
-5.625
 Tup-5.594
icate-5.400
ighth-5.336
Token body
Token ID1767
Feature activation+2.67e-01

Pos logit contributions
ixel+2.654
theless+2.528
eu+2.470
asta+2.400
DX+2.378
Neg logit contributions
acity-3.115
 Tup-2.441
-2.382
icate-2.344
ighth-2.252
Token's
Token ID338
Feature activation+5.51e-01

Pos logit contributions
ixel+3.015
theless+2.916
eu+2.812
asta+2.746
DX+2.723
Neg logit contributions
acity-3.095
 Tup-2.414
-2.303
icate-2.255
ighth-2.180
Token biological
Token ID10685
Feature activation+4.51e-01

Pos logit contributions
ixel+5.412
theless+5.216
eu+5.078
asta+4.933
DX+4.909
Neg logit contributions
acity-6.799
-5.386
 Tup-5.355
icate-5.223
ighth-5.088
Token sex
Token ID1714
Feature activation+4.65e-01

Pos logit contributions
ixel+4.568
theless+4.410
eu+4.297
asta+4.217
DX+4.119
Neg logit contributions
acity-5.529
 Tup-4.295
-4.293
icate-4.194
ighth-4.146
Token,
Token ID11
Feature activation+2.83e-01

Pos logit contributions
ixel+4.377
theless+4.195
eu+4.153
asta+3.934
DX+3.929
Neg logit contributions
acity-5.996
 Tup-4.860
-4.702
icate-4.556
ighth-4.440
Token all
Token ID477
Feature activation+1.29e-01

Pos logit contributions
ixel+1.035
theless+0.992
eu+0.954
asta+0.937
DX+0.913
Neg logit contributions
acity-1.180
 Tup-0.922
-0.896
icate-0.876
icity-0.843
Token four
Token ID1440
Feature activation-5.54e-01

Pos logit contributions
ixel+1.403
theless+1.353
eu+1.301
asta+1.272
DX+1.239
Neg logit contributions
acity-1.568
 Tup-1.228
-1.188
icate-1.164
icity-1.114
Token it
Token ID340
Feature activation-2.95e-01

Pos logit contributions
acity+5.654
 Tup+4.162
icate+3.736
icity+3.552
+3.470
Neg logit contributions
ixel-7.264
theless-6.855
DX-6.513
asta-6.438
eu-6.411
Token was
Token ID373
Feature activation-1.76e-01

Pos logit contributions
acity+3.486
 Tup+2.736
icate+2.517
+2.471
ighth+2.424
Neg logit contributions
ixel-3.373
theless-3.200
eu-3.011
DX-2.994
asta-2.975
Token a
Token ID257
Feature activation+1.42e-01

Pos logit contributions
acity+2.039
 Tup+1.619
icate+1.487
+1.463
icity+1.422
Neg logit contributions
ixel-2.040
theless-1.944
eu-1.853
asta-1.814
DX-1.806
Token nickel
Token ID30666
Feature activation+1.63e-01

Pos logit contributions
ixel+1.471
theless+1.407
eu+1.347
asta+1.330
DX+1.297
Neg logit contributions
acity-1.812
 Tup-1.414
-1.380
icate-1.354
icity-1.307
Token."
Token ID526
Feature activation+5.15e-02

Pos logit contributions
ixel+1.764
theless+1.678
eu+1.613
asta+1.585
DX+1.559
Neg logit contributions
acity-1.998
 Tup-1.564
-1.508
icate-1.472
icity-1.429
Token Re
Token ID797
Feature activation-1.10e-01

Pos logit contributions
ixel+0.522
theless+0.496
eu+0.476
asta+0.465
DX+0.459
Neg logit contributions
acity-0.672
 Tup-0.534
-0.514
icate-0.509
icity-0.491
Tokenes
Token ID274
Feature activation+3.82e-01

Pos logit contributions
acity+1.452
 Tup+1.144
icate+1.098
+1.082
icity+1.065
Neg logit contributions
ixel-1.109
theless-1.078
eu-0.997
asta-0.982
DX-0.975
Token-
Token ID12
Feature activation+1.93e-01

Pos logit contributions
ixel+3.682
theless+3.513
eu+3.413
asta+3.284
 Phill+3.250
Neg logit contributions
acity-4.989
-3.965
 Tup-3.935
icate-3.782
icity-3.643
TokenJones
Token ID25784
Feature activation-7.93e-02

Pos logit contributions
ixel+2.114
eu+1.961
theless+1.959
DX+1.940
asta+1.913
Neg logit contributions
acity-2.336
 Tup-1.855
-1.767
icate-1.716
icity-1.650
Token another
Token ID1194
Feature activation-3.79e-02

Pos logit contributions
acity+1.112
 Tup+0.886
+0.834
icate+0.834
icity+0.800
Neg logit contributions
ixel-0.942
theless-0.899
eu-0.856
asta-0.842
DX-0.828
Token probable
Token ID17939
Feature activation-1.71e-01

Pos logit contributions
acity+0.454
 Tup+0.355
+0.335
icate+0.333
icity+0.320
Neg logit contributions
ixel-0.427
theless-0.408
eu-0.390
asta-0.384
DX-0.378
Token an
Token ID281
Feature activation+2.80e-01

Pos logit contributions
acity+2.025
 Tup+1.600
icate+1.486
+1.484
icity+1.437
Neg logit contributions
ixel-1.946
theless-1.854
eu-1.769
asta-1.741
DX-1.721
Tokenagram
Token ID6713
Feature activation+3.19e-01

Pos logit contributions
ixel+2.230
eu+2.041
asta+1.949
theless+1.923
DX+1.830
Neg logit contributions
acity-4.196
 Tup-3.554
-3.433
icate-3.286
icity-3.222
Token that
Token ID326
Feature activation+3.65e-01

Pos logit contributions
ixel+3.289
theless+3.107
eu+3.014
asta+2.969
DX+2.882
Neg logit contributions
acity-4.009
 Tup-3.145
-3.029
icate-2.958
ighth-2.903
Token I
Token ID314
Feature activation+1.27e-01

Pos logit contributions
ixel+3.599
theless+3.423
eu+3.299
asta+3.246
DX+3.153
Neg logit contributions
acity-4.757
 Tup-3.715
-3.581
icate-3.575
ighth-3.476
Token can
Token ID460
Feature activation+1.33e-02

Pos logit contributions
ixel+1.293
theless+1.225
eu+1.176
asta+1.156
DX+1.130
Neg logit contributions
acity-1.652
 Tup-1.311
-1.256
icate-1.242
icity-1.204
Token
Token ID447
Feature activation+1.11e-01

Pos logit contributions
ixel+0.149
theless+0.142
eu+0.136
asta+0.134
DX+0.131
Neg logit contributions
acity-0.161
 Tup-0.126
-0.119
icate-0.118
icity-0.114
Token
Token ID247
Feature activation+9.14e-02

Pos logit contributions
ixel+1.217
theless+1.137
eu+1.113
asta+1.084
DX+1.070
Neg logit contributions
acity-1.364
 Tup-1.079
-1.024
icate-1.007
icity-0.983
Tokent
Token ID83
Feature activation+3.30e-02

Pos logit contributions
ixel+1.141
theless+1.095
eu+1.059
asta+1.039
DX+1.026
Neg logit contributions
acity-0.984
 Tup-0.743
-0.696
icate-0.685
icity-0.658
Token do
Token ID466
Feature activation-7.10e-02

Pos logit contributions
ixel+0.347
theless+0.329
eu+0.315
asta+0.310
DX+0.304
Neg logit contributions
acity-0.421
 Tup-0.334
-0.318
icate-0.314
icity-0.304
Token Wednesday
Token ID3583
Feature activation-2.74e-01

Pos logit contributions
ixel+1.591
theless+1.506
eu+1.459
asta+1.428
DX+1.405
Neg logit contributions
acity-1.849
 Tup-1.455
-1.390
icate-1.379
icity-1.337
Token.
Token ID13
Feature activation-2.00e-01

Pos logit contributions
acity+3.107
 Tup+2.426
+2.266
icate+2.216
icity+2.132
Neg logit contributions
ixel-3.216
theless-3.096
eu-2.948
asta-2.897
DX-2.865
Token The
Token ID383
Feature activation-2.05e-01

Pos logit contributions
acity+2.485
 Tup+2.015
+1.853
icate+1.843
icity+1.767
Neg logit contributions
ixel-2.154
theless-2.059
asta-1.942
eu-1.924
DX-1.864
Token veteran
Token ID9298
Feature activation-7.89e-03

Pos logit contributions
acity+2.657
 Tup+2.153
icate+1.989
+1.974
icity+1.912
Neg logit contributions
ixel-2.105
theless-1.993
eu-1.911
asta-1.873
DX-1.844
Token safety
Token ID3747
Feature activation-6.92e-01

Pos logit contributions
acity+0.106
 Tup+0.086
+0.082
icate+0.081
icity+0.078
Neg logit contributions
ixel-0.077
theless-0.073
eu-0.070
asta-0.068
DX-0.067
Token called
Token ID1444
Feature activation-1.10e-01

Pos logit contributions
acity+7.759
 Tup+6.396
icate+5.579
+5.566
ighth+5.460
Neg logit contributions
ixel-7.575
theless-7.134
DX-6.812
asta-6.648
eu-6.644
Token the
Token ID262
Feature activation-3.37e-01

Pos logit contributions
acity+1.296
 Tup+1.018
+0.947
icate+0.945
icity+0.906
Neg logit contributions
ixel-1.257
theless-1.208
eu-1.153
asta-1.133
DX-1.114
Token entire
Token ID2104
Feature activation-1.53e-01

Pos logit contributions
acity+4.182
 Tup+3.383
icate+3.096
+3.028
icity+2.968
Neg logit contributions
ixel-3.591
theless-3.468
eu-3.278
asta-3.221
DX-3.184
Token defense
Token ID3761
Feature activation-3.69e-01

Pos logit contributions
acity+1.855
 Tup+1.464
icate+1.358
+1.343
icity+1.303
Neg logit contributions
ixel-1.694
theless-1.631
eu-1.551
asta-1.528
DX-1.500
Token into
Token ID656
Feature activation-3.17e-01

Pos logit contributions
acity+4.159
 Tup+3.278
+3.030
icate+3.017
ighth+2.827
Neg logit contributions
ixel-4.361
theless-4.198
asta-3.954
eu-3.944
DX-3.889
Token a
Token ID257
Feature activation-2.79e-02

Pos logit contributions
acity+3.781
 Tup+3.056
+2.830
icate+2.814
ighth+2.674
Neg logit contributions
ixel-3.445
theless-3.439
eu-3.183
asta-3.153
DX-3.076
Token had
Token ID550
Feature activation-1.80e-02

Pos logit contributions
ixel+1.349
theless+1.291
eu+1.232
asta+1.210
 Patel+1.174
Neg logit contributions
acity-1.583
 Tup-1.249
-1.205
icate-1.189
icity-1.133
Token never
Token ID1239
Feature activation-6.21e-02

Pos logit contributions
acity+0.218
 Tup+0.171
+0.161
icate+0.160
icity+0.154
Neg logit contributions
ixel-0.200
theless-0.191
eu-0.183
asta-0.179
DX-0.177
Token heard
Token ID2982
Feature activation-1.98e-01

Pos logit contributions
acity+0.765
 Tup+0.605
icate+0.567
+0.566
icity+0.549
Neg logit contributions
ixel-0.677
theless-0.643
eu-0.617
asta-0.606
DX-0.599
Token of
Token ID286
Feature activation+3.63e-01

Pos logit contributions
acity+2.250
 Tup+1.751
+1.617
icate+1.602
icity+1.548
Neg logit contributions
ixel-2.341
theless-2.261
eu-2.165
asta-2.123
DX-2.104
Token the
Token ID262
Feature activation-4.69e-02

Pos logit contributions
ixel+3.564
theless+3.372
eu+3.245
 Patel+3.185
asta+3.152
Neg logit contributions
acity-4.725
 Tup-3.673
-3.625
icate-3.585
icity-3.447
Token book
Token ID1492
Feature activation-3.21e-01

Pos logit contributions
acity+0.586
 Tup+0.466
+0.439
icate+0.436
icity+0.421
Neg logit contributions
ixel-0.504
theless-0.480
eu-0.459
asta-0.451
DX-0.444
Token or
Token ID393
Feature activation+2.85e-01

Pos logit contributions
acity+3.753
 Tup+2.982
+2.737
icate+2.694
icity+2.635
Neg logit contributions
ixel-3.649
theless-3.448
eu-3.304
asta-3.286
DX-3.249
Token author
Token ID1772
Feature activation-8.66e-02

Pos logit contributions
ixel+2.955
theless+2.806
eu+2.685
asta+2.642
 Patel+2.622
Neg logit contributions
acity-3.626
 Tup-2.799
-2.761
icate-2.725
ilateral-2.573
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
acity+1.055
 Tup+0.833
+0.783
icate+0.776
icity+0.751
Neg logit contributions
ixel-0.959
theless-0.913
eu-0.875
asta-0.860
DX-0.848
Token
Token ID198
Feature activation-2.17e-01

Pos logit contributions
acity+2.757
 Tup+2.216
+2.023
icate+2.017
icity+1.958
Neg logit contributions
ixel-2.467
theless-2.348
asta-2.222
eu-2.192
DX-2.147
Token
Token ID198
Feature activation-2.06e-01

Pos logit contributions
acity+2.496
 Tup+1.909
icate+1.782
+1.725
icity+1.718
Neg logit contributions
ixel-2.584
theless-2.515
asta-2.302
eu-2.291
 mathemat-2.276
Token sound
Token ID2128
Feature activation-9.21e-01

Pos logit contributions
acity+1.879
 Tup+1.502
icate+1.404
+1.402
icity+1.363
Neg logit contributions
ixel-1.538
theless-1.457
eu-1.384
asta-1.363
DX-1.342
Token,
Token ID11
Feature activation-4.47e-01

Pos logit contributions
acity+9.483
 Tup+7.045
+6.745
icate+6.629
icity+6.503
Neg logit contributions
ixel-10.381
theless-9.703
 Graveyard-9.418
asta-9.389
 Patel-9.095
Token with
Token ID351
Feature activation-4.06e-01

Pos logit contributions
acity+5.210
 Tup+4.086
+3.902
icate+3.857
icity+3.688
Neg logit contributions
ixel-4.977
theless-4.781
eu-4.516
asta-4.424
DX-4.396
Token competition
Token ID5449
Feature activation+2.15e-01

Pos logit contributions
acity+4.692
 Tup+3.700
icate+3.364
+3.339
icity+3.299
Neg logit contributions
ixel-4.672
theless-4.421
eu-4.198
asta-4.165
DX-4.114
Token.
Token ID13
Feature activation-8.91e-02

Pos logit contributions
ixel+2.226
theless+2.106
eu+2.033
asta+1.996
DX+1.953
Neg logit contributions
acity-2.721
 Tup-2.165
-2.070
icate-2.042
ighth-1.972
Token
Token ID447
Feature activation-2.57e-01

Pos logit contributions
acity+1.132
 Tup+0.905
+0.853
icate+0.847
icity+0.818
Neg logit contributions
ixel-0.941
theless-0.896
eu-0.848
asta-0.839
DX-0.821
Token
Token ID251
Feature activation-2.37e-01

Pos logit contributions
acity+2.730
 Tup+2.052
+1.938
icate+1.915
ighth+1.811
Neg logit contributions
ixel-3.198
theless-3.122
eu-2.947
asta-2.937
DX-2.875
Token
Token ID198
Feature activation+8.80e-02

Pos logit contributions
acity+2.862
 Tup+2.256
icate+2.122
+2.117
icity+2.038
Neg logit contributions
ixel-2.632
theless-2.484
eu-2.360
asta-2.356
DX-2.301
Token
Token ID198
Feature activation-1.09e-01

Pos logit contributions
ixel+0.905
theless+0.854
eu+0.837
asta+0.813
DX+0.805
Neg logit contributions
acity-1.125
 Tup-0.899
-0.864
icate-0.847
icity-0.816
TokenThat
Token ID2504
Feature activation-1.08e-01

Pos logit contributions
acity+1.352
 Tup+1.062
+1.012
icate+1.002
icity+0.965
Neg logit contributions
ixel-1.186
theless-1.135
eu-1.070
asta-1.060
DX-1.031
Token
Token ID447
Feature activation-9.84e-02

Pos logit contributions
acity+1.351
 Tup+1.073
+1.007
icate+1.004
icity+0.970
Neg logit contributions
ixel-1.161
theless-1.097
eu-1.050
asta-1.035
DX-1.016
Token Europe
Token ID2031
Feature activation-1.97e-01

Pos logit contributions
acity+3.203
 Tup+2.560
icate+2.309
+2.276
icity+2.221
Neg logit contributions
ixel-3.360
theless-3.214
eu-3.059
asta-3.009
DX-2.980
Token in
Token ID287
Feature activation-3.68e-01

Pos logit contributions
acity+2.282
 Tup+1.778
icate+1.653
+1.643
icity+1.605
Neg logit contributions
ixel-2.283
theless-2.178
eu-2.065
asta-2.042
DX-2.014
Token the
Token ID262
Feature activation-1.45e-01

Pos logit contributions
acity+4.063
 Tup+3.227
icate+2.896
+2.879
icity+2.806
Neg logit contributions
ixel-4.404
theless-4.164
eu-3.954
asta-3.931
DX-3.876
Token world
Token ID995
Feature activation-1.97e-01

Pos logit contributions
acity+1.691
 Tup+1.329
+1.222
icate+1.220
icity+1.171
Neg logit contributions
ixel-1.682
theless-1.605
eu-1.522
asta-1.512
DX-1.487
Token
Token ID447
Feature activation-2.74e-01

Pos logit contributions
acity+2.385
 Tup+1.882
icate+1.749
+1.718
icity+1.697
Neg logit contributions
ixel-2.206
theless-2.100
eu-1.989
asta-1.958
DX-1.923
Token
Token ID247
Feature activation-2.08e-01

Pos logit contributions
acity+3.103
 Tup+2.418
+2.276
icate+2.266
ighth+2.155
Neg logit contributions
ixel-3.196
theless-3.096
eu-2.922
asta-2.895
DX-2.851
Tokens
Token ID82
Feature activation-2.33e-01

Pos logit contributions
acity+2.533
 Tup+2.002
+1.893
icate+1.871
icity+1.811
Neg logit contributions
ixel-2.275
theless-2.152
eu-2.049
asta-2.032
DX-1.997
Token worst
Token ID5290
Feature activation-1.01e-01

Pos logit contributions
acity+2.706
 Tup+2.100
+1.961
icate+1.944
ighth+1.874
Neg logit contributions
ixel-2.731
theless-2.592
eu-2.473
asta-2.421
DX-2.406
Token nuclear
Token ID4523
Feature activation-6.23e-02

Pos logit contributions
acity+1.130
 Tup+0.865
+0.799
icate+0.797
icity+0.771
Neg logit contributions
ixel-1.228
theless-1.175
eu-1.123
asta-1.104
DX-1.091
Token disaster
Token ID9336
Feature activation+1.79e-01

Pos logit contributions
acity+0.581
 Tup+0.417
+0.380
icate+0.379
icity+0.361
Neg logit contributions
ixel-0.870
theless-0.837
eu-0.808
asta-0.795
DX-0.788
Token.
Token ID13
Feature activation-9.86e-02

Pos logit contributions
ixel+1.852
theless+1.754
eu+1.703
asta+1.681
DX+1.633
Neg logit contributions
acity-2.265
 Tup-1.801
-1.727
icate-1.704
ighth-1.636
Token.
Token ID13
Feature activation+2.72e-01

Pos logit contributions
acity+0.286
 Tup+0.224
+0.211
icate+0.209
icity+0.202
Neg logit contributions
ixel-0.269
theless-0.257
eu-0.246
asta-0.242
DX-0.238
Token They
Token ID1119
Feature activation-2.92e-02

Pos logit contributions
ixel+2.600
theless+2.489
eu+2.453
asta+2.369
 Phill+2.340
Neg logit contributions
acity-3.576
-2.838
 Tup-2.838
icate-2.745
icity-2.644
Token will
Token ID481
Feature activation+2.97e-01

Pos logit contributions
acity+0.380
 Tup+0.304
+0.288
icate+0.287
icity+0.277
Neg logit contributions
ixel-0.299
theless-0.284
eu-0.271
asta-0.266
DX-0.262
Token have
Token ID423
Feature activation-1.14e-02

Pos logit contributions
ixel+2.757
theless+2.597
eu+2.469
asta+2.430
 Patel+2.334
Neg logit contributions
acity-4.088
 Tup-3.298
-3.283
icate-3.090
icity-3.030
Token five
Token ID1936
Feature activation-4.09e-02

Pos logit contributions
acity+0.136
 Tup+0.106
+0.100
icate+0.100
icity+0.096
Neg logit contributions
ixel-0.129
theless-0.123
eu-0.118
asta-0.116
DX-0.115
Token days
Token ID1528
Feature activation-4.46e-01

Pos logit contributions
acity+0.458
 Tup+0.350
+0.328
icate+0.326
icity+0.313
Neg logit contributions
ixel-0.493
theless-0.473
eu-0.453
asta-0.447
DX-0.440
Token to
Token ID284
Feature activation+3.26e-01

Pos logit contributions
acity+4.702
 Tup+3.550
icate+3.373
+3.301
icity+3.196
Neg logit contributions
ixel-5.522
theless-5.250
DX-5.021
asta-5.004
eu-4.996
Token do
Token ID466
Feature activation+7.46e-02

Pos logit contributions
ixel+3.138
theless+2.893
eu+2.810
asta+2.770
 Patel+2.635
Neg logit contributions
acity-4.410
-3.527
 Tup-3.514
icate-3.265
icity-3.218
Token so
Token ID523
Feature activation-2.32e-01

Pos logit contributions
ixel+0.875
theless+0.834
eu+0.804
asta+0.791
DX+0.777
Neg logit contributions
acity-0.858
 Tup-0.663
-0.630
icate-0.617
icity-0.593
Token after
Token ID706
Feature activation+5.67e-01

Pos logit contributions
acity+2.670
 Tup+2.053
icate+1.933
+1.891
icity+1.856
Neg logit contributions
ixel-2.714
theless-2.595
eu-2.465
asta-2.444
DX-2.419
Token the
Token ID262
Feature activation+4.32e-01

Pos logit contributions
ixel+5.371
theless+5.057
eu+4.942
asta+4.925
 Phill+4.878
Neg logit contributions
acity-7.231
-5.762
 Tup-5.744
icate-5.365
igon-5.160
Token and
Token ID290
Feature activation-3.44e-01

Pos logit contributions
ixel+1.929
theless+1.862
eu+1.753
asta+1.723
DX+1.700
Neg logit contributions
acity-2.710
 Tup-2.177
icate-2.099
-2.096
icity-1.990
Token status
Token ID3722
Feature activation-6.20e-02

Pos logit contributions
acity+3.942
 Tup+3.150
icate+2.830
+2.820
icity+2.761
Neg logit contributions
ixel-3.995
theless-3.770
eu-3.622
asta-3.536
DX-3.482
Token Indians
Token ID12746
Feature activation-8.54e-02

Pos logit contributions
acity+0.795
 Tup+0.633
+0.598
icate+0.595
icity+0.575
Neg logit contributions
ixel-0.647
theless-0.614
eu-0.588
asta-0.577
DX-0.566
Token are
Token ID389
Feature activation-3.86e-02

Pos logit contributions
acity+1.082
 Tup+0.864
icate+0.812
+0.812
icity+0.781
Neg logit contributions
ixel-0.899
theless-0.862
eu-0.819
asta-0.804
DX-0.791
Token largely
Token ID5688
Feature activation+1.64e-01

Pos logit contributions
acity+0.483
 Tup+0.383
+0.361
icate+0.359
icity+0.347
Neg logit contributions
ixel-0.414
theless-0.394
eu-0.376
asta-0.370
DX-0.364
Token a
Token ID257
Feature activation-1.98e-01

Pos logit contributions
ixel+1.664
theless+1.601
eu+1.528
asta+1.498
 Patel+1.475
Neg logit contributions
acity-2.124
 Tup-1.664
-1.618
icate-1.596
icity-1.528
Token federal
Token ID2717
Feature activation+3.18e-01

Pos logit contributions
acity+2.319
 Tup+1.845
+1.726
icate+1.709
icity+1.654
Neg logit contributions
ixel-2.252
theless-2.119
eu-2.054
asta-2.013
DX-1.979
Token responsibility
Token ID5798
Feature activation-7.60e-02

Pos logit contributions
ixel+3.228
theless+3.043
eu+2.902
asta+2.854
 bottleneck+2.850
Neg logit contributions
acity-4.099
 Tup-3.179
-3.124
icate-3.054
igon-2.941
Token.
Token ID13
Feature activation+3.00e-01

Pos logit contributions
acity+0.905
 Tup+0.707
+0.663
icate+0.661
icity+0.637
Neg logit contributions
ixel-0.862
theless-0.826
eu-0.790
asta-0.777
DX-0.765
Token Access
Token ID8798
Feature activation+2.61e-01

Pos logit contributions
ixel+2.883
theless+2.788
eu+2.720
 Patel+2.669
 Phill+2.609
Neg logit contributions
acity-3.946
 Tup-3.047
-3.035
icate-2.986
ighth-2.885
Tokening
Token ID278
Feature activation-4.93e-01

Pos logit contributions
ixel+2.741
theless+2.587
eu+2.502
asta+2.436
DX+2.404
Neg logit contributions
acity-3.237
 Tup-2.574
icate-2.445
-2.441
icity-2.342
Token Brad
Token ID8114
Feature activation-1.04e-01

Pos logit contributions
acity+5.110
 Tup+4.287
icate+3.729
+3.655
icity+3.616
Neg logit contributions
ixel-5.036
theless-4.723
asta-4.486
eu-4.343
 mathemat-4.312
Tokenen
Token ID268
Feature activation-1.24e-01

Pos logit contributions
acity+1.265
 Tup+0.987
icate+0.929
+0.920
icity+0.896
Neg logit contributions
ixel-1.161
theless-1.112
eu-1.057
asta-1.047
DX-1.032
Token Holt
Token ID28330
Feature activation+6.21e-02

Pos logit contributions
acity+1.249
 Tup+0.941
+0.864
icate+0.861
icity+0.820
Neg logit contributions
ixel-1.622
theless-1.559
eu-1.508
asta-1.485
DX-1.471
Tokenby
Token ID1525
Feature activation-4.59e-01

Pos logit contributions
ixel+0.779
theless+0.744
eu+0.720
asta+0.710
DX+0.696
Neg logit contributions
acity-0.666
 Tup-0.504
-0.471
icate-0.469
icity-0.443
Token lacks
Token ID16523
Feature activation-5.00e-01

Pos logit contributions
acity+5.540
 Tup+4.312
icate+4.060
+4.021
icity+3.899
Neg logit contributions
ixel-4.974
theless-4.708
eu-4.448
DX-4.435
asta-4.430
Token the
Token ID262
Feature activation-1.07e+00

Pos logit contributions
acity+5.890
 Tup+4.598
icate+4.203
icity+4.107
ighth+4.055
Neg logit contributions
ixel-5.677
theless-5.344
asta-5.063
eu-5.013
DX-5.006
Token stamina
Token ID32312
Feature activation-1.75e-01

Pos logit contributions
acity+12.228
 Tup+9.513
icity+8.643
icate+8.554
+8.246
Neg logit contributions
ixel-10.722
theless-9.806
asta-9.327
DX-9.199
eu-9.014
Token to
Token ID284
Feature activation-5.60e-02

Pos logit contributions
acity+2.054
 Tup+1.591
+1.486
icate+1.481
icity+1.430
Neg logit contributions
ixel-2.029
theless-1.946
eu-1.846
asta-1.823
DX-1.793
Token carry
Token ID3283
Feature activation-1.37e-01

Pos logit contributions
acity+0.705
 Tup+0.558
icate+0.527
+0.526
icity+0.505
Neg logit contributions
ixel-0.598
theless-0.571
eu-0.544
asta-0.534
DX-0.527
Token his
Token ID465
Feature activation-1.02e-01

Pos logit contributions
acity+1.604
 Tup+1.252
icate+1.168
+1.165
icity+1.117
Neg logit contributions
ixel-1.594
theless-1.516
eu-1.455
asta-1.434
DX-1.415
Token team
Token ID1074
Feature activation+1.05e-01

Pos logit contributions
acity+1.074
 Tup+0.807
icate+0.746
+0.743
icity+0.713
Neg logit contributions
ixel-1.300
theless-1.244
eu-1.204
asta-1.183
DX-1.167
Token Lake
Token ID6233
Feature activation-5.81e-02

Pos logit contributions
ixel+1.318
theless+1.263
eu+1.219
asta+1.207
DX+1.178
Neg logit contributions
acity-1.333
 Tup-1.015
-0.973
icate-0.962
icity-0.929
Token Ur
Token ID8799
Feature activation-6.45e-01

Pos logit contributions
acity+0.627
 Tup+0.478
+0.446
icate+0.441
icity+0.421
Neg logit contributions
ixel-0.725
theless-0.694
eu-0.667
asta-0.657
DX-0.649
Tokenmia
Token ID20730
Feature activation+2.20e-01

Pos logit contributions
acity+7.511
 Tup+5.456
icity+5.236
icate+5.200
igon+4.941
Neg logit contributions
theless-7.185
ixel-7.055
eu-6.668
 Phill-6.605
 ende-6.495
Token basin
Token ID34164
Feature activation+1.46e-01

Pos logit contributions
ixel+2.222
theless+2.130
eu+2.063
asta+2.041
DX+1.990
Neg logit contributions
acity-2.831
 Tup-2.221
-2.147
icate-2.102
ighth-2.049
Token -
Token ID532
Feature activation-2.12e-01

Pos logit contributions
ixel+1.433
theless+1.371
eu+1.312
asta+1.287
DX+1.258
Neg logit contributions
acity-1.948
 Tup-1.553
-1.505
icate-1.470
ighth-1.423
Token East
Token ID3687
Feature activation-9.84e-01

Pos logit contributions
acity+2.417
 Tup+1.922
icate+1.756
+1.748
icity+1.682
Neg logit contributions
ixel-2.498
theless-2.371
eu-2.289
asta-2.235
DX-2.203
Token Azerbaijan
Token ID33605
Feature activation-9.21e-02

Pos logit contributions
acity+9.126
 Tup+7.834
icate+6.549
 Mosul+6.500
 Haz+6.441
Neg logit contributions
ixel-11.583
theless-10.893
 bottleneck-10.143
asta-10.084
eu-10.002
Token,
Token ID11
Feature activation-4.06e-01

Pos logit contributions
acity+1.110
 Tup+0.874
+0.824
icate+0.819
icity+0.786
Neg logit contributions
ixel-1.029
theless-0.985
eu-0.940
asta-0.921
DX-0.909
Token West
Token ID2688
Feature activation-8.45e-01

Pos logit contributions
acity+4.394
 Tup+3.591
+3.295
icate+3.234
icity+3.051
Neg logit contributions
ixel-4.891
theless-4.622
eu-4.378
asta-4.352
DX-4.311
Token Azerbaijan
Token ID33605
Feature activation-7.86e-02

Pos logit contributions
acity+6.162
 Tup+4.878
+3.988
icate+3.909
 Cah+3.712
Neg logit contributions
ixel-12.005
theless-11.232
asta-10.874
eu-10.796
DX-10.736
Token,
Token ID11
Feature activation-3.09e-01

Pos logit contributions
acity+0.955
 Tup+0.752
+0.711
icate+0.706
icity+0.678
Neg logit contributions
ixel-0.872
theless-0.833
eu-0.794
asta-0.780
DX-0.769
Token
Token ID447
Feature activation-8.59e-02

Pos logit contributions
ixel+1.004
theless+0.955
eu+0.917
asta+0.903
DX+0.883
Neg logit contributions
acity-1.186
 Tup-0.939
-0.892
icate-0.883
icity-0.850
Token
Token ID251
Feature activation-2.23e-01

Pos logit contributions
acity+0.993
 Tup+0.766
+0.719
icate+0.715
icity+0.685
Neg logit contributions
ixel-1.002
theless-0.971
eu-0.923
asta-0.911
DX-0.890
Token in
Token ID287
Feature activation+1.38e-01

Pos logit contributions
acity+2.615
 Tup+2.028
+1.906
icate+1.889
icity+1.830
Neg logit contributions
ixel-2.560
theless-2.446
eu-2.326
asta-2.304
DX-2.272
Token a
Token ID257
Feature activation+7.98e-01

Pos logit contributions
ixel+1.513
theless+1.450
eu+1.399
asta+1.364
DX+1.341
Neg logit contributions
acity-1.671
 Tup-1.289
-1.246
icate-1.232
icity-1.186
Token 1999
Token ID7358
Feature activation+9.04e-01

Pos logit contributions
ixel+7.060
theless+6.927
eu+6.878
 Phill+6.845
 Patel+6.663
Neg logit contributions
acity-9.659
-7.653
icate-7.462
ighth-7.267
 Tup-7.093
Token Le
Token ID1004
Feature activation-1.03e+00

Pos logit contributions
ixel+7.351
eu+7.165
 Phill+7.076
theless+7.060
 Patel+6.840
Neg logit contributions
acity-11.311
-9.099
icate-8.852
ighth-8.606
 Tup-8.377
Token M
Token ID337
Feature activation-2.92e-01

Pos logit contributions
acity+9.552
 Tup+7.950
icate+7.148
+7.024
 Cah+6.910
Neg logit contributions
theless-11.665
ixel-10.974
DX-10.441
 Patel-10.267
 bottleneck-10.164
Tokenonde
Token ID14378
Feature activation+1.50e-01

Pos logit contributions
acity+4.090
 Tup+3.260
icate+3.168
icity+3.090
+3.049
Neg logit contributions
theless-2.640
ixel-2.611
eu-2.380
DX-2.325
asta-2.293
Token interview
Token ID2720
Feature activation-4.27e-01

Pos logit contributions
ixel+1.616
theless+1.527
eu+1.492
asta+1.457
DX+1.423
Neg logit contributions
acity-1.864
 Tup-1.454
-1.390
icate-1.379
icity-1.330
Token,
Token ID11
Feature activation-1.33e-01

Pos logit contributions
acity+4.684
 Tup+3.618
+3.341
icate+3.312
icity+3.207
Neg logit contributions
ixel-5.041
theless-4.919
eu-4.629
DX-4.551
asta-4.541
Token as
Token ID355
Feature activation+1.81e-01

Pos logit contributions
acity+1.602
 Tup+1.259
+1.181
icate+1.169
icity+1.130
Neg logit contributions
ixel-1.496
theless-1.430
eu-1.369
asta-1.343
DX-1.329
Token fed
Token ID11672
Feature activation+2.25e-01

Pos logit contributions
acity+1.380
 Tup+1.084
+1.017
icate+1.011
icity+0.978
Neg logit contributions
ixel-1.269
theless-1.210
eu-1.154
asta-1.135
DX-1.117
Token state
Token ID1181
Feature activation-8.67e-02

Pos logit contributions
ixel+2.519
theless+2.382
eu+2.313
asta+2.283
DX+2.255
Neg logit contributions
acity-2.648
 Tup-2.077
-1.977
icate-1.953
ighth-1.871
Token.
Token ID13
Feature activation-2.33e-01

Pos logit contributions
acity+1.021
 Tup+0.792
+0.742
icate+0.742
icity+0.714
Neg logit contributions
ixel-0.996
theless-0.952
eu-0.911
asta-0.898
DX-0.884
Token
Token ID198
Feature activation-3.90e-01

Pos logit contributions
acity+2.833
 Tup+2.271
+2.101
icate+2.094
icity+2.029
Neg logit contributions
ixel-2.559
theless-2.440
asta-2.300
eu-2.296
DX-2.216
Token
Token ID198
Feature activation-3.65e-01

Pos logit contributions
acity+4.152
 Tup+3.090
icate+2.875
icity+2.767
Detailed+2.716
Neg logit contributions
ixel-4.938
theless-4.912
 mathemat-4.706
 ende-4.512
asta-4.384
TokenJust
Token ID5703
Feature activation-9.96e-01

Pos logit contributions
acity+4.213
 Tup+3.198
+3.144
icate+3.048
Detailed+3.045
Neg logit contributions
ixel-4.162
theless-4.117
asta-3.750
eu-3.702
 Phill-3.682
Token like
Token ID588
Feature activation-2.45e-01

Pos logit contributions
acity+9.541
icate+7.442
 Tup+7.270
icity+6.958
ICE+6.841
Neg logit contributions
ixel-11.140
theless-11.133
asta-10.386
eu-10.157
 Phill-9.926
Token anything
Token ID1997
Feature activation-8.54e-02

Pos logit contributions
acity+2.900
 Tup+2.310
icate+2.136
+2.082
icity+2.048
Neg logit contributions
ixel-2.776
theless-2.637
eu-2.534
asta-2.498
DX-2.431
Token else
Token ID2073
Feature activation-3.48e-02

Pos logit contributions
acity+1.204
 Tup+0.985
+0.937
icate+0.934
icity+0.905
Neg logit contributions
ixel-0.780
theless-0.735
eu-0.699
asta-0.683
DX-0.667
Token,
Token ID11
Feature activation-2.34e-01

Pos logit contributions
acity+0.422
 Tup+0.331
+0.313
icate+0.311
icity+0.299
Neg logit contributions
ixel-0.388
theless-0.370
eu-0.355
asta-0.348
DX-0.342
Token this
Token ID428
Feature activation-3.67e-01

Pos logit contributions
acity+2.782
 Tup+2.191
icate+2.048
+2.038
icity+1.978
Neg logit contributions
ixel-2.621
theless-2.508
eu-2.409
asta-2.361
DX-2.317
Token,
Token ID11
Feature activation-5.28e-01

Pos logit contributions
acity+4.399
 Tup+3.542
icate+3.307
+3.180
icity+3.159
Neg logit contributions
ixel-4.312
theless-4.192
asta-3.919
eu-3.891
DX-3.750
Token based
Token ID1912
Feature activation-2.96e-01

Pos logit contributions
acity+5.805
 Tup+4.777
icate+4.415
+4.190
icity+4.177
Neg logit contributions
ixel-5.926
theless-5.713
eu-5.430
asta-5.370
DX-5.271
Token in
Token ID287
Feature activation-1.92e-01

Pos logit contributions
acity+3.318
 Tup+2.584
icate+2.413
+2.384
icity+2.310
Neg logit contributions
ixel-3.535
theless-3.394
eu-3.211
asta-3.160
DX-3.094
Token Canada
Token ID3340
Feature activation+1.63e-01

Pos logit contributions
acity+2.292
 Tup+1.846
icate+1.693
+1.682
icity+1.627
Neg logit contributions
ixel-2.135
theless-2.037
eu-1.947
asta-1.917
DX-1.883
Token,
Token ID11
Feature activation-1.40e-01

Pos logit contributions
ixel+1.745
theless+1.652
eu+1.610
asta+1.564
DX+1.547
Neg logit contributions
acity-2.032
 Tup-1.599
-1.532
icate-1.503
icity-1.440
Token from
Token ID422
Feature activation-1.03e+00

Pos logit contributions
acity+1.650
 Tup+1.309
icate+1.218
+1.212
icity+1.171
Neg logit contributions
ixel-1.590
theless-1.524
eu-1.450
asta-1.433
DX-1.406
Token 3
Token ID513
Feature activation-4.45e-01

Pos logit contributions
acity+11.417
 Tup+10.014
icate+9.000
icity+8.752
 Cah+8.686
Neg logit contributions
ixel-9.568
theless-9.115
eu-8.787
asta-8.712
DX-8.226
Tokeni
Token ID72
Feature activation-1.78e-01

Pos logit contributions
acity+5.147
 Tup+4.040
icate+3.917
+3.858
icity+3.742
Neg logit contributions
ixel-4.904
theless-4.768
eu-4.474
asta-4.402
 Graveyard-4.253
Token Group
Token ID4912
Feature activation+2.48e-01

Pos logit contributions
acity+2.268
 Tup+1.812
icate+1.708
+1.708
icity+1.654
Neg logit contributions
ixel-1.849
theless-1.776
eu-1.678
asta-1.647
DX-1.607
Token Pl
Token ID1345
Feature activation-7.20e-01

Pos logit contributions
ixel+2.542
theless+2.402
eu+2.351
asta+2.267
DX+2.261
Neg logit contributions
acity-3.176
 Tup-2.477
-2.432
icate-2.347
icity-2.262
Tokenc
Token ID66
Feature activation+8.62e-02

Pos logit contributions
acity+4.729
 Tup+2.946
icate+2.690
ighth+2.563
icity+2.381
Neg logit contributions
ixel-11.461
theless-11.028
eu-10.465
 Graveyard-10.358
asta-10.315