TOP ACTIVATIONS
MAX = 8.624

moved to the less - rest rict ive Commerce Control List
data : alcohol - im pa ired driving . Washington ,
ideas , is fairly imp en etr able . On the
researchers . As if all of this not enough
Maze design . A 3 × 3 room maze was designed
crumbling sidewalks to the hom ew on ers owning the houses
I considered letting this game rest this year but decided that
it was introduced as an Excel add - in back in
AB : [ As if counting before a song ]
bone . And , as if that wasn 't enough to
was , and what Ad e o R ess i warned
) 401 - 414 ; doi : 10 . 10 71
friendly formats What if you have digital copies of
s a start . <|endoftext|> Abstract Background Depression presents
symptoms that are in ext ric ably linked to the core
thereof . Article IV , Section 2 begins with
and stay there , as if they had taken out a
maze consists in a 3 × 3 rooms . The rooms
in America . Everyone gave themselves the rhetorical authority to tweet
they attempted to get a free handset . The

MIN ACTIVATIONS
MIN = -9.050

Something about the vowel system gets me all tri
ly but hardly exclusively to female veterans and female reporters ).
joined seconds later by a deaf ening th ud from the
th U . S . Circuit Court of Appeal 's 3
the Pentagon 's investigation is Private first class Bradley Manning ,
the raid : Male Officer : Did
that the use of flap valves limits the reliability and longevity
billion , according to a Venture Beat report that cites anonymous
worked out perfectly for Gre ipel . The one
( including the use of female pronouns like " she and
. [ color = red ][ b ] U f ou
More : Law suit : UC has ' anti
- blood spew ing Vul v atron . The
Prepar ation Goes A Long Way Con ventions usually
a contempt ible class of corporate fascists , ego maniac al
. [ color = red ][ b ] Mega Pony [/
alongside nurses Jan ice and Miss Pig gy . Though considered
dummy will go a long way . We can only hope
step further and eliminate the valves ? The prime
" Mr W izard " Cue ll ar , EV O

INTERVAL 4.205 - 8.624
CONTAINS 0.400%

, 0 ) black hat = c v 2 . morphology
hull . In an often grim world , the GL OO
I recently covered Ruby block basics in my post , More
During the 2008 presidential campaign , the NRA Political
versus cue ). https :// doi . org / 10 .

INTERVAL -0.213 - 4.205
CONTAINS 55.608%

s a group of people who make a living churn
estate boom raised rents beyond the reach of many people ,
shifted over time . During some periods it was the buffer
| ISI P el leg r ini , A
, we just took away the 1 mana ability on that

INTERVAL -4.632 - -0.213
CONTAINS 43.818%

You may need to pull up the e - bra ke
of very good chances and in international football you have to
said Jess a . In Josh
causes you to land in jail . The trouble is that
. Connor s play with Blake Wheeler

INTERVAL -9.050 - -4.632
CONTAINS 0.173%

in common between the Mil it ants and the so -
. [ color = red ][ b ] U f ou
the political spectrum to make transgender people seem ridiculous . It
can basically just count those up and figure out how long
she advised she will also hear the defense s
Token moved
Token ID3888
Feature activation+1.94e+00

Pos logit contributions
 toug+0.136
 doct+0.136
 communal+0.134
 grips+0.133
 indo+0.132
Neg logit contributions
umber-0.168
ANGE-0.167
Oracle-0.165
 Banks-0.161
 Pompe-0.160
Token to
Token ID284
Feature activation-2.80e-01

Pos logit contributions
 toug+1.405
 doct+1.404
 communal+1.382
 grips+1.376
 whis+1.364
Neg logit contributions
ANGE-1.409
umber-1.407
Oracle-1.376
 Banks-1.339
 Pompe-1.326
Token the
Token ID262
Feature activation+7.80e-01

Pos logit contributions
umber+0.200
ANGE+0.199
Oracle+0.196
 Banks+0.191
 Pompe+0.188
Neg logit contributions
 toug-0.206
 doct-0.205
 communal-0.202
 grips-0.202
 indo-0.200
Token less
Token ID1342
Feature activation-8.32e-01

Pos logit contributions
 toug+0.576
 doct+0.572
 communal+0.567
 grips+0.566
 indo+0.559
Neg logit contributions
umber-0.556
ANGE-0.552
Oracle-0.544
 Banks-0.528
 Pompe-0.523
Token-
Token ID12
Feature activation+1.89e-01

Pos logit contributions
umber+0.675
ANGE+0.673
Oracle+0.663
 Banks+0.651
 Pompe+0.645
Neg logit contributions
 toug-0.525
 doct-0.522
 livest-0.515
 communal-0.514
 grips-0.514
Tokenrest
Token ID2118
Feature activation+8.62e+00

Pos logit contributions
 toug+0.132
 doct+0.132
 communal+0.130
 grips+0.129
 indo+0.128
Neg logit contributions
umber-0.141
ANGE-0.141
Oracle-0.138
 Banks-0.136
 Pompe-0.135
Tokenrict
Token ID2012
Feature activation+5.62e+00

Pos logit contributions
 doct+5.065
 toug+5.031
 grips+4.917
 communal+4.889
 whis+4.669
Neg logit contributions
umber-7.133
Oracle-7.078
ANGE-7.037
 Pompe-6.912
 Banks-6.785
Tokenive
Token ID425
Feature activation+1.36e+00

Pos logit contributions
 toug+3.112
 doct+3.081
 communal+3.064
 grips+3.057
 recip+2.946
Neg logit contributions
umber-4.922
ANGE-4.907
Oracle-4.813
 Banks-4.744
 Pompe-4.691
Token Commerce
Token ID16127
Feature activation-9.76e-01

Pos logit contributions
 toug+0.907
 doct+0.899
 grips+0.892
 communal+0.891
 indo+0.876
Neg logit contributions
umber-1.061
ANGE-1.051
Oracle-1.041
 Banks-1.010
 Pompe-0.997
Token Control
Token ID6779
Feature activation+4.91e-01

Pos logit contributions
umber+0.686
ANGE+0.682
Oracle+0.671
 Banks+0.654
 Pompe+0.648
Neg logit contributions
 toug-0.727
 doct-0.723
 communal-0.715
 grips-0.714
 livest-0.709
Token List
Token ID7343
Feature activation-2.46e+00

Pos logit contributions
 toug+0.358
 doct+0.355
 grips+0.353
 communal+0.351
 indo+0.347
Neg logit contributions
umber-0.351
ANGE-0.351
Oracle-0.346
 Banks-0.333
 Pompe-0.332
Token data
Token ID1366
Feature activation+4.46e+00

Pos logit contributions
umber+0.512
ANGE+0.509
Oracle+0.492
 Banks+0.483
 Pompe+0.474
Neg logit contributions
 doct-0.795
 toug-0.795
 communal-0.785
 grips-0.785
 livest-0.782
Token:
Token ID25
Feature activation+2.40e+00

Pos logit contributions
 doct+3.420
 toug+3.412
 communal+3.375
 grips+3.342
 livest+3.329
Neg logit contributions
umber-3.006
ANGE-2.967
Oracle-2.947
 Pompe-2.869
 Banks-2.817
Token alcohol
Token ID5548
Feature activation+3.04e+00

Pos logit contributions
 toug+1.597
 doct+1.577
 communal+1.558
 grips+1.556
 livest+1.551
Neg logit contributions
umber-1.883
ANGE-1.875
Oracle-1.856
 Banks-1.801
 Pompe-1.792
Token-
Token ID12
Feature activation+1.74e+00

Pos logit contributions
 toug+2.157
 doct+2.142
 communal+2.115
 recip+2.108
 grips+2.106
Neg logit contributions
umber-2.228
Oracle-2.222
ANGE-2.209
 Pompe-2.148
 Banks-2.123
Tokenim
Token ID320
Feature activation+3.40e+00

Pos logit contributions
 toug+1.387
 doct+1.380
 communal+1.363
 recip+1.353
 whis+1.352
Neg logit contributions
umber-1.112
ANGE-1.111
Oracle-1.102
 Pompe-1.077
 Banks-1.068
Tokenpa
Token ID8957
Feature activation+8.34e+00

Pos logit contributions
 doct+2.354
 toug+2.329
 recip+2.320
 communal+2.286
 grips+2.269
Neg logit contributions
ANGE-2.596
Oracle-2.526
umber-2.519
 Pompe-2.464
 Banks-2.399
Tokenired
Token ID1202
Feature activation+1.80e+00

Pos logit contributions
 grips+2.834
 toug+2.741
 doct+2.642
 communal+2.561
 recip+2.550
Neg logit contributions
umber-8.929
ANGE-8.875
Oracle-8.838
 Pompe-8.624
 Banks-8.541
Token driving
Token ID5059
Feature activation+8.29e-01

Pos logit contributions
 toug+1.169
 doct+1.167
 communal+1.157
 grips+1.156
 whis+1.146
Neg logit contributions
umber-1.427
ANGE-1.420
Oracle-1.406
 Banks-1.364
 Pompe-1.359
Token.
Token ID13
Feature activation-3.92e-01

Pos logit contributions
 toug+0.613
 doct+0.610
 communal+0.602
 grips+0.600
 livest+0.597
Neg logit contributions
umber-0.587
ANGE-0.584
Oracle-0.575
 Banks-0.560
 Pompe-0.557
Token Washington
Token ID2669
Feature activation+1.59e+00

Pos logit contributions
umber+0.275
ANGE+0.274
Oracle+0.269
 Banks+0.264
 Pompe+0.261
Neg logit contributions
 toug-0.292
 doct-0.291
 communal-0.288
 grips-0.287
 livest-0.286
Token,
Token ID11
Feature activation-1.55e+00

Pos logit contributions
 toug+1.287
 doct+1.283
 communal+1.268
 grips+1.264
 livest+1.261
Neg logit contributions
umber-1.018
ANGE-1.012
Oracle-0.992
 Banks-0.970
 Pompe-0.958
Token ideas
Token ID4213
Feature activation+1.01e+00

Pos logit contributions
umber+0.793
ANGE+0.790
Oracle+0.778
 Banks+0.762
 Pompe+0.756
Neg logit contributions
 toug-0.605
 doct-0.603
 communal-0.594
 grips-0.592
 indo-0.588
Token,
Token ID11
Feature activation+5.48e-01

Pos logit contributions
 doct+0.772
 toug+0.772
 communal+0.762
 grips+0.762
 indo+0.754
Neg logit contributions
umber-0.693
ANGE-0.691
Oracle-0.682
 Banks-0.655
 Pompe-0.651
Token is
Token ID318
Feature activation-1.03e+00

Pos logit contributions
 toug+0.427
 doct+0.425
 grips+0.421
 communal+0.420
 indo+0.417
Neg logit contributions
umber-0.370
ANGE-0.366
Oracle-0.362
 Banks-0.350
 Pompe-0.346
Token fairly
Token ID6547
Feature activation+2.94e-01

Pos logit contributions
umber+0.769
ANGE+0.766
Oracle+0.753
 Banks+0.735
 Pompe+0.727
Neg logit contributions
 toug-0.728
 doct-0.725
 communal-0.716
 grips-0.711
 indo-0.706
Token imp
Token ID848
Feature activation-1.10e+00

Pos logit contributions
 doct+0.209
 toug+0.208
 communal+0.205
 grips+0.204
 indo+0.202
Neg logit contributions
umber-0.221
ANGE-0.220
Oracle-0.216
 Banks-0.210
 Pompe-0.207
Tokenen
Token ID268
Feature activation+7.92e+00

Pos logit contributions
umber+0.602
ANGE+0.598
Oracle+0.585
 Banks+0.568
 Pompe+0.561
Neg logit contributions
 toug-0.985
 doct-0.981
 communal-0.971
 grips-0.969
 livest-0.965
Tokenetr
Token ID21879
Feature activation+1.93e+00

Pos logit contributions
 grips+4.382
 toug+4.374
 doct+4.363
 cooker+4.264
 communal+4.234
Neg logit contributions
ANGE-7.017
Oracle-6.660
 Banks-6.643
 Pompe-6.446
TION-6.356
Tokenable
Token ID540
Feature activation+1.50e+00

Pos logit contributions
 toug+1.710
 doct+1.707
 communal+1.692
 grips+1.679
 whis+1.669
Neg logit contributions
ANGE-1.070
Oracle-1.048
umber-1.035
 Banks-1.010
 Pompe-0.999
Token.
Token ID13
Feature activation-8.46e-01

Pos logit contributions
 toug+1.223
 doct+1.219
 communal+1.205
 grips+1.204
 whis+1.196
Neg logit contributions
ANGE-0.950
umber-0.947
Oracle-0.925
 Banks-0.896
 Pompe-0.889
Token On
Token ID1550
Feature activation-9.14e-01

Pos logit contributions
umber+0.591
ANGE+0.584
Oracle+0.573
 Banks+0.550
 Pompe+0.548
Neg logit contributions
 doct-0.635
 toug-0.632
 grips-0.621
 communal-0.620
 whis-0.614
Token the
Token ID262
Feature activation+1.58e+00

Pos logit contributions
umber+0.634
ANGE+0.631
Oracle+0.621
 Banks+0.605
 Pompe+0.599
Neg logit contributions
 toug-0.690
 doct-0.687
 communal-0.679
 grips-0.677
 indo-0.673
Token researchers
Token ID4837
Feature activation+7.95e-01

Pos logit contributions
 toug+0.089
 doct+0.089
 communal+0.087
 grips+0.087
 livest+0.086
Neg logit contributions
umber-0.130
ANGE-0.129
Oracle-0.127
 Banks-0.125
 Pompe-0.124
Token.
Token ID13
Feature activation+3.95e-01

Pos logit contributions
 toug+0.682
 doct+0.680
 communal+0.672
 grips+0.670
 livest+0.668
Neg logit contributions
umber-0.466
ANGE-0.465
Oracle-0.455
 Banks-0.443
 Pompe-0.438
Token
Token ID198
Feature activation+4.08e-01

Pos logit contributions
 toug+0.308
 doct+0.307
 grips+0.302
 communal+0.301
 indo+0.299
Neg logit contributions
umber-0.263
ANGE-0.261
Oracle-0.256
 Banks-0.248
 Pompe-0.246
Token
Token ID198
Feature activation+3.02e-01

Pos logit contributions
 toug+0.299
 doct+0.298
 grips+0.294
 communal+0.288
 whis+0.287
Neg logit contributions
umber-0.286
ANGE-0.286
Oracle-0.278
 Pompe-0.271
 Banks-0.269
TokenAs
Token ID1722
Feature activation-7.18e-02

Pos logit contributions
 toug+0.231
 doct+0.230
 grips+0.227
 communal+0.226
 whis+0.224
Neg logit contributions
umber-0.203
ANGE-0.201
Oracle-0.194
 Banks-0.194
 Pompe-0.193
Token if
Token ID611
Feature activation+7.76e+00

Pos logit contributions
umber+0.053
ANGE+0.053
Oracle+0.052
 Banks+0.050
 Pompe+0.050
Neg logit contributions
 toug-0.051
 doct-0.051
 communal-0.050
 grips-0.050
 whis-0.050
Token all
Token ID477
Feature activation-3.46e-01

Pos logit contributions
 toug+5.096
 doct+4.921
 grips+4.871
 communal+4.809
 whis+4.718
Neg logit contributions
ANGE-6.062
umber-5.970
Oracle-5.904
 Banks-5.564
 Pompe-5.515
Token of
Token ID286
Feature activation-2.28e+00

Pos logit contributions
umber+0.259
ANGE+0.259
Oracle+0.255
 Banks+0.248
 Pompe+0.246
Neg logit contributions
 toug-0.242
 doct-0.241
 communal-0.237
 grips-0.237
 livest-0.236
Token this
Token ID428
Feature activation-2.80e-01

Pos logit contributions
umber+1.782
ANGE+1.775
Oracle+1.747
 Banks+1.713
 Pompe+1.697
Neg logit contributions
 toug-1.508
 doct-1.501
 communal-1.480
 grips-1.475
 livest-1.470
Token not
Token ID407
Feature activation+2.04e-01

Pos logit contributions
umber+0.215
ANGE+0.214
Oracle+0.211
 Banks+0.206
 Pompe+0.204
Neg logit contributions
 toug-0.190
 doct-0.189
 communal-0.187
 grips-0.186
 indo-0.185
Token enough
Token ID1576
Feature activation-2.93e+00

Pos logit contributions
 toug+0.148
 doct+0.148
 communal+0.146
 grips+0.145
 indo+0.144
Neg logit contributions
umber-0.147
ANGE-0.146
Oracle-0.144
 Banks-0.141
 Pompe-0.140
Token Maze
Token ID33412
Feature activation+1.41e+00

Pos logit contributions
umber+0.032
ANGE+0.032
Oracle+0.031
 Banks+0.030
 Pompe+0.030
Neg logit contributions
 toug-0.035
 doct-0.035
 communal-0.034
 grips-0.034
 indo-0.034
Token design
Token ID1486
Feature activation+2.68e+00

Pos logit contributions
 doct+1.065
 toug+1.064
 grips+1.050
 communal+1.049
 indo+1.039
Neg logit contributions
ANGE-0.974
umber-0.974
Oracle-0.961
 Banks-0.919
 Pompe-0.916
Token.
Token ID13
Feature activation+1.68e+00

Pos logit contributions
 doct+1.932
 toug+1.918
 communal+1.907
 grips+1.896
 indo+1.886
Neg logit contributions
umber-1.922
ANGE-1.916
Oracle-1.888
 Banks-1.814
 Pompe-1.804
Token A
Token ID317
Feature activation+2.54e+00

Pos logit contributions
 doct+1.220
 toug+1.215
 communal+1.201
 grips+1.195
 indo+1.186
Neg logit contributions
umber-1.206
ANGE-1.196
Oracle-1.174
 Banks-1.136
 Pompe-1.128
Token 3
Token ID513
Feature activation+2.67e+00

Pos logit contributions
 doct+1.658
 toug+1.657
 communal+1.645
 grips+1.617
 indo+1.603
Neg logit contributions
umber-2.018
ANGE-2.007
Oracle-1.981
 Banks-1.928
 Pompe-1.905
Token×
Token ID12906
Feature activation+7.49e+00

Pos logit contributions
 doct+2.039
 toug+2.035
 communal+2.026
 grips+1.994
 indo+1.989
Neg logit contributions
umber-1.813
ANGE-1.799
Oracle-1.786
 Banks-1.729
 Pompe-1.713
Token3
Token ID18
Feature activation+3.15e+00

Pos logit contributions
 toug+5.650
 communal+5.641
 doct+5.641
 whis+5.413
 cooker+5.394
Neg logit contributions
umber-4.877
Oracle-4.791
ANGE-4.782
 Banks-4.757
 Pompe-4.655
Token room
Token ID2119
Feature activation+5.38e+00

Pos logit contributions
 toug+2.004
 doct+1.998
 communal+1.986
 indo+1.945
 grips+1.941
Neg logit contributions
umber-2.538
ANGE-2.528
Oracle-2.502
 Banks-2.442
 Pompe-2.415
Token maze
Token ID31237
Feature activation+8.39e-02

Pos logit contributions
 toug+3.348
 communal+3.341
 doct+3.325
 grips+3.249
 indo+3.232
Neg logit contributions
umber-4.378
ANGE-4.329
Oracle-4.328
 Banks-4.219
 Pompe-4.134
Token was
Token ID373
Feature activation+1.15e+00

Pos logit contributions
 toug+0.059
 doct+0.059
 communal+0.058
 grips+0.058
 indo+0.058
Neg logit contributions
umber-0.062
ANGE-0.062
Oracle-0.061
 Banks-0.059
 Pompe-0.059
Token designed
Token ID3562
Feature activation+1.74e+00

Pos logit contributions
 toug+0.721
 doct+0.718
 communal+0.707
 grips+0.697
 indo+0.696
Neg logit contributions
ANGE-0.953
umber-0.953
Oracle-0.938
 Banks-0.914
 Pompe-0.905
Token crumbling
Token ID37550
Feature activation+3.22e+00

Pos logit contributions
 toug+0.808
 communal+0.804
 doct+0.804
 grips+0.797
 whis+0.785
Neg logit contributions
umber-0.937
ANGE-0.936
Oracle-0.923
 Pompe-0.890
 Banks-0.884
Token sidewalks
Token ID38282
Feature activation-1.96e-01

Pos logit contributions
 communal+1.850
 toug+1.826
 doct+1.824
 grips+1.824
 whis+1.793
Neg logit contributions
ANGE-2.796
umber-2.772
Oracle-2.765
 Pompe-2.625
 Banks-2.615
Token to
Token ID284
Feature activation+1.33e-01

Pos logit contributions
umber+0.118
ANGE+0.118
Oracle+0.115
 Banks+0.112
 Pompe+0.111
Neg logit contributions
 toug-0.166
 doct-0.165
 communal-0.163
 grips-0.163
 livest-0.162
Token the
Token ID262
Feature activation+1.92e-01

Pos logit contributions
 toug+0.089
 doct+0.089
 communal+0.088
 grips+0.087
 livest+0.086
Neg logit contributions
umber-0.103
ANGE-0.103
Oracle-0.101
 Banks-0.098
 Pompe-0.098
Token hom
Token ID3488
Feature activation-5.05e-01

Pos logit contributions
 toug+0.120
 doct+0.120
 communal+0.119
 grips+0.118
 indo+0.116
Neg logit contributions
umber-0.158
ANGE-0.158
Oracle-0.155
 Banks-0.151
 Pompe-0.151
Tokenew
Token ID413
Feature activation+7.48e+00

Pos logit contributions
ANGE+0.449
umber+0.445
Oracle+0.442
 Banks+0.432
 Pompe+0.429
Neg logit contributions
 toug-0.282
 doct-0.281
 communal-0.275
 grips-0.275
 livest-0.273
Tokenon
Token ID261
Feature activation+2.29e+00

Pos logit contributions
 doct+4.406
 toug+4.381
 whis+4.349
 communal+4.310
 grips+4.273
Neg logit contributions
ANGE-6.070
Oracle-6.030
 Banks-5.894
umber-5.804
 Pompe-5.786
Tokeners
Token ID364
Feature activation+2.20e+00

Pos logit contributions
 communal+1.796
 toug+1.790
 grips+1.786
 doct+1.783
 whis+1.764
Neg logit contributions
ANGE-1.489
Oracle-1.461
umber-1.447
 Pompe-1.416
 Banks-1.408
Token owning
Token ID23107
Feature activation+1.26e+00

Pos logit contributions
 toug+1.717
 doct+1.710
 communal+1.698
 grips+1.680
 whis+1.668
Neg logit contributions
ANGE-1.450
umber-1.438
Oracle-1.419
 Pompe-1.381
 Banks-1.373
Token the
Token ID262
Feature activation-1.11e+00

Pos logit contributions
 doct+0.860
 toug+0.858
 communal+0.857
 grips+0.845
 whis+0.836
Neg logit contributions
umber-0.965
ANGE-0.962
Oracle-0.946
 Banks-0.917
 Pompe-0.913
Token houses
Token ID7777
Feature activation-1.19e+00

Pos logit contributions
umber+0.991
ANGE+0.987
Oracle+0.974
 Banks+0.957
 Pompe+0.949
Neg logit contributions
 toug-0.620
 doct-0.617
 communal-0.606
 grips-0.604
 livest-0.601
TokenI
Token ID40
Feature activation+9.64e-01

Pos logit contributions
umber+0.313
ANGE+0.312
Oracle+0.306
 Banks+0.299
 Pompe+0.295
Neg logit contributions
 toug-0.381
 doct-0.380
 communal-0.375
 grips-0.374
 livest-0.373
Token considered
Token ID3177
Feature activation+4.47e-01

Pos logit contributions
 toug+0.661
 doct+0.658
 communal+0.645
 grips+0.644
 livest+0.638
Neg logit contributions
umber-0.738
ANGE-0.735
Oracle-0.729
 Banks-0.705
 Pompe-0.703
Token letting
Token ID9616
Feature activation-6.02e-01

Pos logit contributions
 toug+0.308
 doct+0.306
 communal+0.303
 grips+0.301
 livest+0.299
Neg logit contributions
umber-0.339
ANGE-0.338
Oracle-0.333
 Banks-0.324
 Pompe-0.322
Token this
Token ID428
Feature activation+1.13e+00

Pos logit contributions
umber+0.429
ANGE+0.427
Oracle+0.420
 Banks+0.410
 Pompe+0.406
Neg logit contributions
 toug-0.442
 doct-0.440
 communal-0.435
 grips-0.434
 livest-0.432
Token game
Token ID983
Feature activation+4.18e-01

Pos logit contributions
 toug+0.872
 doct+0.869
 communal+0.860
 grips+0.856
 livest+0.851
Neg logit contributions
ANGE-0.758
umber-0.754
Oracle-0.744
 Pompe-0.716
 Banks-0.715
Token rest
Token ID1334
Feature activation+7.43e+00

Pos logit contributions
 toug+0.308
 doct+0.305
 communal+0.302
 grips+0.302
 livest+0.299
Neg logit contributions
ANGE-0.299
umber-0.298
Oracle-0.294
 Banks-0.284
 Pompe-0.283
Token this
Token ID428
Feature activation+4.90e-01

Pos logit contributions
 toug+5.163
 doct+5.161
 grips+5.141
 communal+5.117
 livest+5.035
Neg logit contributions
ANGE-5.279
Oracle-5.166
umber-5.113
 Pompe-5.020
 Banks-4.948
Token year
Token ID614
Feature activation+4.98e-01

Pos logit contributions
 toug+0.348
 doct+0.347
 communal+0.344
 grips+0.342
 indo+0.340
Neg logit contributions
umber-0.362
ANGE-0.361
Oracle-0.355
 Banks-0.345
 Pompe-0.343
Token but
Token ID475
Feature activation-8.68e-01

Pos logit contributions
 toug+0.392
 doct+0.390
 communal+0.387
 grips+0.386
 whis+0.382
Neg logit contributions
umber-0.327
ANGE-0.326
Oracle-0.320
 Banks-0.311
 Pompe-0.309
Token decided
Token ID3066
Feature activation+4.66e-01

Pos logit contributions
umber+0.627
ANGE+0.623
Oracle+0.613
 Banks+0.597
 Pompe+0.592
Neg logit contributions
 toug-0.632
 doct-0.628
 communal-0.620
 grips-0.619
 livest-0.615
Token that
Token ID326
Feature activation-9.95e-01

Pos logit contributions
 toug+0.393
 doct+0.391
 communal+0.387
 grips+0.385
 livest+0.382
Neg logit contributions
umber-0.283
ANGE-0.282
Oracle-0.277
 Banks-0.267
 Pompe-0.264
Token it
Token ID340
Feature activation+1.01e+00

Pos logit contributions
 toug+0.610
 doct+0.607
 communal+0.602
 grips+0.596
 livest+0.593
Neg logit contributions
umber-0.622
ANGE-0.618
Oracle-0.604
 Banks-0.590
 Pompe-0.584
Token was
Token ID373
Feature activation-4.56e-01

Pos logit contributions
 toug+0.720
 doct+0.715
 communal+0.703
 grips+0.700
 indo+0.695
Neg logit contributions
umber-0.747
ANGE-0.744
Oracle-0.727
 Banks-0.714
 Pompe-0.708
Token introduced
Token ID5495
Feature activation+1.53e+00

Pos logit contributions
umber+0.338
ANGE+0.336
Oracle+0.329
 Banks+0.323
 Pompe+0.319
Neg logit contributions
 toug-0.325
 doct-0.323
 communal-0.319
 grips-0.316
 indo-0.314
Token as
Token ID355
Feature activation+6.54e-01

Pos logit contributions
 toug+1.184
 doct+1.179
 communal+1.170
 grips+1.160
 whis+1.147
Neg logit contributions
ANGE-1.024
umber-1.021
Oracle-0.995
 Banks-0.969
 Pompe-0.963
Token an
Token ID281
Feature activation+1.21e+00

Pos logit contributions
 toug+0.469
 doct+0.468
 communal+0.462
 grips+0.460
 livest+0.458
Neg logit contributions
umber-0.477
ANGE-0.475
Oracle-0.466
 Banks-0.456
 Pompe-0.451
Token Excel
Token ID24134
Feature activation+7.35e+00

Pos logit contributions
 doct+0.741
 toug+0.739
 communal+0.732
 indo+0.724
 grips+0.721
Neg logit contributions
umber-1.006
ANGE-1.004
Oracle-0.981
 Banks-0.967
 Pompe-0.963
Token add
Token ID751
Feature activation+7.47e-01

Pos logit contributions
 doct+4.788
 communal+4.773
 toug+4.639
 whis+4.603
 recip+4.590
Neg logit contributions
ANGE-5.532
umber-5.526
Oracle-5.450
 Pompe-5.370
 Banks-5.362
Token-
Token ID12
Feature activation+4.90e+00

Pos logit contributions
 toug+0.556
 doct+0.553
 communal+0.546
 grips+0.545
 livest+0.543
Neg logit contributions
umber-0.525
ANGE-0.522
Oracle-0.513
 Banks-0.502
 Pompe-0.497
Tokenin
Token ID259
Feature activation+3.89e+00

Pos logit contributions
 doct+4.484
 toug+4.481
 communal+4.458
 livest+4.398
 recip+4.396
Neg logit contributions
ANGE-2.546
umber-2.500
Oracle-2.428
 Banks-2.389
 Pompe-2.387
Token back
Token ID736
Feature activation+2.78e+00

Pos logit contributions
 doct+2.836
 toug+2.822
 communal+2.810
 whis+2.763
 recip+2.756
Neg logit contributions
ANGE-2.775
umber-2.733
Oracle-2.693
 Banks-2.620
 Pompe-2.586
Token in
Token ID287
Feature activation+1.02e+00

Pos logit contributions
 doct+2.194
 toug+2.187
 communal+2.169
 indo+2.131
 recip+2.126
Neg logit contributions
ANGE-1.809
umber-1.790
Oracle-1.769
 Banks-1.732
 Pompe-1.722
Token
Token ID198
Feature activation-4.32e-01

Pos logit contributions
umber+0.101
ANGE+0.100
Oracle+0.098
 Pompe+0.095
 Banks+0.094
Neg logit contributions
 doct-0.105
 toug-0.105
 grips-0.104
 communal-0.102
 whis-0.101
TokenAB
Token ID6242
Feature activation+7.38e-01

Pos logit contributions
umber+0.296
ANGE+0.294
Oracle+0.288
 Banks+0.283
 Pompe+0.280
Neg logit contributions
 toug-0.328
 doct-0.326
 communal-0.322
 grips-0.322
 livest-0.320
Token:
Token ID25
Feature activation+1.97e+00

Pos logit contributions
 doct+0.616
 toug+0.611
 grips+0.605
 communal+0.598
 whis+0.596
Neg logit contributions
umber-0.442
ANGE-0.436
Oracle-0.428
 Pompe-0.413
 Banks-0.412
Token [
Token ID685
Feature activation+1.63e+00

Pos logit contributions
 doct+1.353
 toug+1.340
 grips+1.320
 communal+1.309
 whis+1.291
Neg logit contributions
umber-1.494
ANGE-1.468
Oracle-1.454
 Banks-1.359
 Pompe-1.354
TokenAs
Token ID1722
Feature activation-6.93e-01

Pos logit contributions
 toug+1.268
 doct+1.262
 grips+1.252
 communal+1.249
 whis+1.240
Neg logit contributions
umber-1.069
ANGE-1.066
Oracle-1.032
 Pompe-1.022
 Banks-1.022
Token if
Token ID611
Feature activation+7.35e+00

Pos logit contributions
umber+0.473
ANGE+0.470
Oracle+0.464
 Banks+0.449
 Pompe+0.447
Neg logit contributions
 toug-0.527
 doct-0.527
 communal-0.520
 grips-0.519
 livest-0.514
Token counting
Token ID14143
Feature activation-1.23e+00

Pos logit contributions
 toug+5.242
 doct+5.199
 grips+5.106
 communal+5.052
 recip+4.938
Neg logit contributions
ANGE-5.176
umber-5.157
Oracle-5.115
 Banks-4.726
 Pompe-4.691
Token before
Token ID878
Feature activation-2.34e+00

Pos logit contributions
umber+0.846
ANGE+0.843
Oracle+0.830
 Banks+0.805
 Pompe+0.798
Neg logit contributions
 toug-0.928
 doct-0.926
 communal-0.915
 grips-0.912
 indo-0.906
Token a
Token ID257
Feature activation-3.00e-01

Pos logit contributions
umber+1.610
ANGE+1.602
Oracle+1.570
 Banks+1.542
 Pompe+1.523
Neg logit contributions
 toug-1.772
 doct-1.762
 communal-1.741
 grips-1.736
 livest-1.733
Token song
Token ID3496
Feature activation-3.32e+00

Pos logit contributions
umber+0.232
ANGE+0.230
Oracle+0.227
 Banks+0.220
 Pompe+0.219
Neg logit contributions
 doct-0.204
 toug-0.204
 communal-0.201
 grips-0.199
 livest-0.198
Token]
Token ID60
Feature activation-2.23e+00

Pos logit contributions
umber+2.025
ANGE+2.000
Oracle+1.969
 Banks+1.927
 Pompe+1.892
Neg logit contributions
 toug-2.755
 doct-2.742
 communal-2.714
 livest-2.707
 indo-2.702
Token bone
Token ID9970
Feature activation-1.82e+00

Pos logit contributions
umber+2.567
ANGE+2.559
Oracle+2.511
 Banks+2.496
 Pompe+2.480
Neg logit contributions
 doct-1.501
 toug-1.488
 livest-1.476
 indo-1.475
 communal-1.470
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
umber+1.114
ANGE+1.105
Oracle+1.079
 Banks+1.065
 Pompe+1.050
Neg logit contributions
 doct-1.504
 toug-1.501
 livest-1.492
 communal-1.489
 indo-1.486
Token And
Token ID843
Feature activation-5.71e-01

Pos logit contributions
umber+0.719
ANGE+0.715
Oracle+0.703
 Banks+0.687
 Pompe+0.680
Neg logit contributions
 toug-0.792
 doct-0.788
 communal-0.778
 grips-0.776
 livest-0.774
Token,
Token ID11
Feature activation+7.89e-01

Pos logit contributions
umber+0.418
ANGE+0.415
Oracle+0.408
 Banks+0.398
 Pompe+0.393
Neg logit contributions
 toug-0.412
 doct-0.409
 grips-0.405
 communal-0.404
 livest-0.400
Token as
Token ID355
Feature activation+1.09e+00

Pos logit contributions
 toug+0.579
 doct+0.572
 grips+0.569
 communal+0.567
 whis+0.560
Neg logit contributions
umber-0.569
ANGE-0.568
Oracle-0.557
 Banks-0.538
 Pompe-0.532
Token if
Token ID611
Feature activation+7.33e+00

Pos logit contributions
 toug+0.783
 doct+0.776
 communal+0.768
 grips+0.765
 livest+0.752
Neg logit contributions
umber-0.806
ANGE-0.801
Oracle-0.785
 Banks-0.757
 Pompe-0.749
Token that
Token ID326
Feature activation+1.79e-01

Pos logit contributions
 toug+5.133
 doct+4.975
 grips+4.899
 communal+4.810
 whis+4.731
Neg logit contributions
ANGE-5.513
umber-5.476
Oracle-5.280
 Banks-5.024
 Pompe-4.910
Token wasn
Token ID2492
Feature activation+1.51e-01

Pos logit contributions
 toug+0.117
 doct+0.116
 communal+0.114
 grips+0.114
 indo+0.113
Neg logit contributions
umber-0.143
ANGE-0.142
Oracle-0.140
 Banks-0.137
 Pompe-0.135
Token't
Token ID470
Feature activation-4.82e-01

Pos logit contributions
 toug+0.111
 doct+0.111
 grips+0.109
 communal+0.108
 whis+0.106
Neg logit contributions
ANGE-0.106
umber-0.105
Oracle-0.103
 Pompe-0.101
 Banks-0.101
Token enough
Token ID1576
Feature activation-3.07e+00

Pos logit contributions
umber+0.372
ANGE+0.370
Oracle+0.365
 Banks+0.357
 Pompe+0.353
Neg logit contributions
 toug-0.326
 doct-0.324
 communal-0.319
 grips-0.318
 indo-0.317
Token to
Token ID284
Feature activation+7.13e-01

Pos logit contributions
umber+1.763
ANGE+1.745
Oracle+1.725
 Banks+1.701
 Pompe+1.694
Neg logit contributions
 livest-2.621
 doct-2.596
 toug-2.575
kefeller-2.558
 communal-2.558
Token was
Token ID373
Feature activation-1.13e+00

Pos logit contributions
 toug+0.374
 doct+0.372
 grips+0.365
 communal+0.365
 indo+0.362
Neg logit contributions
umber-0.395
ANGE-0.394
Oracle-0.388
 Banks-0.377
 Pompe-0.372
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
umber+0.807
ANGE+0.803
Oracle+0.789
 Banks+0.771
 Pompe+0.763
Neg logit contributions
 toug-0.836
 doct-0.833
 communal-0.822
 grips-0.819
 livest-0.815
Token and
Token ID290
Feature activation+4.72e-01

Pos logit contributions
umber+0.078
ANGE+0.077
Oracle+0.076
 Banks+0.073
 Pompe+0.072
Neg logit contributions
 toug-0.084
 doct-0.084
 communal-0.083
 grips-0.083
 whis-0.082
Token what
Token ID644
Feature activation+1.07e-01

Pos logit contributions
 toug+0.343
 doct+0.341
 communal+0.338
 grips+0.336
 whis+0.332
Neg logit contributions
umber-0.344
ANGE-0.341
Oracle-0.335
 Banks-0.324
 Pompe-0.320
Token Ad
Token ID1215
Feature activation-3.81e-01

Pos logit contributions
 toug+0.079
 doct+0.079
 grips+0.078
 communal+0.078
 whis+0.077
Neg logit contributions
umber-0.075
ANGE-0.075
Oracle-0.074
 Banks-0.071
 Pompe-0.070
Tokene
Token ID68
Feature activation+7.30e+00

Pos logit contributions
ANGE+0.240
umber+0.238
Oracle+0.233
 Banks+0.226
 Pompe+0.225
Neg logit contributions
 toug-0.310
 doct-0.309
 grips-0.306
 communal-0.305
 indo-0.303
Tokeno
Token ID78
Feature activation+3.18e+00

Pos logit contributions
 toug+5.542
 grips+5.522
 doct+5.471
 communal+5.437
 indo+5.307
Neg logit contributions
ANGE-4.796
umber-4.764
Oracle-4.608
 Pompe-4.412
 Banks-4.377
Token R
Token ID371
Feature activation+9.15e-01

Pos logit contributions
 doct+2.437
 toug+2.420
 grips+2.399
 communal+2.398
 whis+2.367
Neg logit contributions
ANGE-2.140
umber-2.130
Oracle-2.094
 Banks-1.990
 Pompe-1.983
Tokeness
Token ID408
Feature activation-2.48e+00

Pos logit contributions
 toug+0.706
 doct+0.705
 grips+0.695
 communal+0.692
 indo+0.690
Neg logit contributions
ANGE-0.610
umber-0.607
Oracle-0.603
 Banks-0.585
 Pompe-0.580
Tokeni
Token ID72
Feature activation+8.08e-01

Pos logit contributions
umber+1.626
ANGE+1.606
Oracle+1.573
 Banks+1.541
 Pompe+1.526
Neg logit contributions
 toug-1.970
 doct-1.962
 communal-1.946
 livest-1.935
 indo-1.931
Token warned
Token ID7728
Feature activation-3.18e-01

Pos logit contributions
 toug+0.598
 doct+0.597
 grips+0.587
 communal+0.587
 indo+0.579
Neg logit contributions
ANGE-0.569
umber-0.565
Oracle-0.558
 Banks-0.540
 Pompe-0.536
Token)
Token ID8
Feature activation-7.49e-02

Pos logit contributions
umber+0.329
ANGE+0.327
Oracle+0.316
 Banks+0.301
 Pompe+0.295
Neg logit contributions
 toug-0.983
 doct-0.980
 communal-0.972
 grips-0.970
 indo-0.967
Token 401
Token ID22219
Feature activation+1.62e+00

Pos logit contributions
umber+0.051
ANGE+0.051
Oracle+0.050
 Banks+0.049
 Pompe+0.048
Neg logit contributions
 toug-0.057
 doct-0.057
 communal-0.056
 grips-0.056
 livest-0.056
Token-
Token ID12
Feature activation+1.21e+00

Pos logit contributions
 toug+1.227
 doct+1.219
 communal+1.206
 grips+1.202
 livest+1.195
Neg logit contributions
umber-1.117
ANGE-1.112
Oracle-1.095
 Banks-1.067
 Pompe-1.056
Token414
Token ID37309
Feature activation+1.60e+00

Pos logit contributions
 toug+0.870
 doct+0.867
 grips+0.849
 communal+0.846
 whis+0.838
Neg logit contributions
umber-0.867
ANGE-0.860
Oracle-0.842
 Banks-0.827
 Pompe-0.821
Token;
Token ID26
Feature activation+1.46e+00

Pos logit contributions
 toug+1.391
 doct+1.382
 communal+1.372
 grips+1.364
 indo+1.356
Neg logit contributions
umber-0.921
ANGE-0.913
Oracle-0.898
 Banks-0.867
 Pompe-0.857
Token doi
Token ID23899
Feature activation+7.25e+00

Pos logit contributions
 toug+0.522
 doct+0.517
 communal+0.503
 grips+0.500
 livest+0.497
Neg logit contributions
umber-1.593
ANGE-1.588
Oracle-1.571
 Banks-1.548
 Pompe-1.538
Token:
Token ID25
Feature activation+5.58e+00

Pos logit contributions
 doct+5.173
 toug+5.104
 grips+5.082
 communal+4.924
 cooker+4.832
Neg logit contributions
umber-4.769
ANGE-4.672
 Banks-4.585
Oracle-4.547
 Pompe-4.496
Token 10
Token ID838
Feature activation+3.90e+00

Pos logit contributions
 toug+4.138
 doct+4.085
 grips+4.059
 communal+4.057
 whis+3.999
Neg logit contributions
umber-3.836
ANGE-3.793
Oracle-3.745
 Banks-3.642
 Pompe-3.573
Token.
Token ID13
Feature activation+2.76e+00

Pos logit contributions
 doct+2.900
 toug+2.899
 grips+2.859
 whis+2.801
 communal+2.792
Neg logit contributions
ANGE-2.609
umber-2.605
Oracle-2.497
 Pompe-2.434
 Banks-2.402
Token10
Token ID940
Feature activation+2.32e+00

Pos logit contributions
 toug+2.075
 doct+2.055
 communal+2.016
 livest+2.010
 whis+2.006
Neg logit contributions
umber-1.825
ANGE-1.811
Oracle-1.775
 Pompe-1.747
 Banks-1.740
Token71
Token ID4869
Feature activation+1.53e+00

Pos logit contributions
 livest+1.705
 toug+1.699
 communal+1.694
 grips+1.673
 doct+1.672
Neg logit contributions
ANGE-1.601
umber-1.569
Oracle-1.568
 Pompe-1.567
 Banks-1.511
Tokenfriendly
Token ID13120
Feature activation+6.64e-01

Pos logit contributions
umber+0.081
ANGE+0.080
Oracle+0.079
 Banks+0.077
 Pompe+0.077
Neg logit contributions
 toug-0.094
 doct-0.094
 communal-0.093
 grips-0.092
 livest-0.092
Token formats
Token ID17519
Feature activation+1.29e-01

Pos logit contributions
 doct+0.317
 toug+0.317
 communal+0.312
 grips+0.312
 livest+0.306
Neg logit contributions
umber-0.645
ANGE-0.645
Oracle-0.635
 Banks-0.622
 Pompe-0.619
Token
Token ID198
Feature activation+3.49e-02

Pos logit contributions
 toug+0.104
 doct+0.103
 grips+0.102
 communal+0.102
 indo+0.101
Neg logit contributions
umber-0.082
ANGE-0.082
Oracle-0.081
 Banks-0.078
 Pompe-0.078
Token
Token ID198
Feature activation+2.95e-01

Pos logit contributions
 doct+0.026
 toug+0.026
 grips+0.026
 communal+0.025
 whis+0.025
Neg logit contributions
umber-0.024
ANGE-0.024
Oracle-0.024
 Pompe-0.023
 Banks-0.023
TokenWhat
Token ID2061
Feature activation+4.31e-01

Pos logit contributions
 doct+0.238
 toug+0.238
 communal+0.234
 grips+0.233
 whis+0.232
Neg logit contributions
umber-0.186
ANGE-0.184
 Banks-0.179
Oracle-0.178
 Pompe-0.177
Token if
Token ID611
Feature activation+7.19e+00

Pos logit contributions
 doct+0.331
 toug+0.329
 grips+0.326
 communal+0.325
 livest+0.322
Neg logit contributions
umber-0.294
ANGE-0.294
Oracle-0.288
 Banks-0.280
 Pompe-0.279
Token you
Token ID345
Feature activation+6.65e-01

Pos logit contributions
 doct+5.262
 toug+5.131
 communal+5.112
 whis+5.087
 recip+5.067
Neg logit contributions
umber-4.982
ANGE-4.936
Oracle-4.880
 Banks-4.710
 Pompe-4.657
Token have
Token ID423
Feature activation+7.54e-01

Pos logit contributions
 doct+0.480
 toug+0.479
 communal+0.469
 grips+0.468
 livest+0.466
Neg logit contributions
umber-0.485
ANGE-0.482
Oracle-0.478
 Banks-0.465
 Pompe-0.461
Token digital
Token ID4875
Feature activation-1.81e+00

Pos logit contributions
 doct+0.515
 toug+0.515
 communal+0.506
 grips+0.503
 livest+0.500
Neg logit contributions
umber-0.579
ANGE-0.575
Oracle-0.566
 Banks-0.554
 Pompe-0.548
Token copies
Token ID9088
Feature activation-2.44e-01

Pos logit contributions
umber+1.615
ANGE+1.608
Oracle+1.586
 Banks+1.559
 Pompe+1.549
Neg logit contributions
 toug-1.000
 doct-0.991
 communal-0.975
 grips-0.972
 indo-0.969
Token of
Token ID286
Feature activation-7.96e-01

Pos logit contributions
umber+0.166
ANGE+0.166
Oracle+0.163
 Banks+0.159
 Pompe+0.157
Neg logit contributions
 toug-0.187
 doct-0.186
 communal-0.184
 grips-0.183
 livest-0.183
Tokens
Token ID82
Feature activation+2.88e-03

Pos logit contributions
umber+0.110
ANGE+0.109
Oracle+0.107
 Banks+0.104
 Pompe+0.103
Neg logit contributions
 toug-0.166
 doct-0.165
 communal-0.163
 grips-0.163
 livest-0.162
Token a
Token ID257
Feature activation-3.55e-01

Pos logit contributions
 toug+0.002
 doct+0.002
 communal+0.002
 grips+0.002
 livest+0.002
Neg logit contributions
umber-0.002
ANGE-0.002
Oracle-0.002
 Banks-0.002
 Pompe-0.002
Token start
Token ID923
Feature activation-8.76e-01

Pos logit contributions
umber+0.276
ANGE+0.274
Oracle+0.272
 Banks+0.264
 Pompe+0.262
Neg logit contributions
 toug-0.240
 doct-0.237
 communal-0.236
 grips-0.233
 livest-0.231
Token.
Token ID13
Feature activation+8.52e-01

Pos logit contributions
umber+0.543
ANGE+0.541
Oracle+0.532
 Banks+0.518
 Pompe+0.513
Neg logit contributions
 toug-0.725
 doct-0.720
 communal-0.714
 grips-0.711
 indo-0.706
Token<|endoftext|>
Token ID50256
Feature activation+3.53e+00

Pos logit contributions
 toug+0.633
 doct+0.628
 grips+0.620
 communal+0.617
 whis+0.611
Neg logit contributions
umber-0.597
ANGE-0.589
Oracle-0.587
 Banks-0.559
 Pompe-0.556
TokenAbstract
Token ID23839
Feature activation+7.17e+00

Pos logit contributions
 toug+2.852
 doct+2.848
 communal+2.809
 grips+2.796
 whis+2.777
Neg logit contributions
umber-2.141
ANGE-2.109
 Banks-2.072
 Pompe-2.056
Oracle-2.016
Token
Token ID198
Feature activation+3.62e+00

Pos logit contributions
 doct+5.183
 toug+5.106
 grips+5.055
 communal+5.036
 indo+5.008
Neg logit contributions
umber-4.872
ANGE-4.871
Oracle-4.780
 Pompe-4.518
 trump-4.478
Token
Token ID198
Feature activation+3.33e+00

Pos logit contributions
 doct+2.385
 toug+2.369
 grips+2.339
 communal+2.256
 whis+2.255
Neg logit contributions
umber-2.709
ANGE-2.684
Oracle-2.614
 Pompe-2.588
 Banks-2.526
TokenBackground
Token ID21756
Feature activation+5.70e+00

Pos logit contributions
 toug+2.615
 doct+2.614
 communal+2.583
 grips+2.582
 recip+2.555
Neg logit contributions
umber-2.081
ANGE-2.036
 Banks-2.012
 Pompe-2.004
Oracle-1.952
Token Depression
Token ID22483
Feature activation+1.81e+00

Pos logit contributions
 doct+4.186
 toug+4.150
 grips+4.098
 communal+4.072
 whis+4.033
Neg logit contributions
ANGE-3.875
umber-3.851
Oracle-3.800
 trump-3.523
 Pompe-3.518
Token presents
Token ID10969
Feature activation+2.67e+00

Pos logit contributions
 toug+1.440
 grips+1.423
 doct+1.420
 communal+1.417
 whis+1.400
Neg logit contributions
umber-1.187
ANGE-1.182
Oracle-1.167
 Banks-1.115
 Pompe-1.113
Token symptoms
Token ID7460
Feature activation+1.72e-01

Pos logit contributions
umber+0.372
ANGE+0.369
Oracle+0.364
 Banks+0.356
 Pompe+0.353
Neg logit contributions
 toug-0.344
 doct-0.344
 communal-0.341
 grips-0.338
 livest-0.334
Token that
Token ID326
Feature activation+3.70e-01

Pos logit contributions
 toug+0.130
 doct+0.130
 communal+0.128
 grips+0.128
 indo+0.127
Neg logit contributions
umber-0.119
ANGE-0.118
Oracle-0.116
 Banks-0.113
 Pompe-0.113
Token are
Token ID389
Feature activation-9.13e-01

Pos logit contributions
 toug+0.280
 doct+0.280
 communal+0.278
 grips+0.278
 livest+0.272
Neg logit contributions
umber-0.255
ANGE-0.253
Oracle-0.249
 Pompe-0.242
 Banks-0.241
Token in
Token ID287
Feature activation-9.03e-01

Pos logit contributions
umber+0.697
ANGE+0.693
Oracle+0.681
 Banks+0.666
 Pompe+0.660
Neg logit contributions
 toug-0.630
 doct-0.626
 communal-0.619
 grips-0.617
 livest-0.611
Tokenext
Token ID2302
Feature activation+1.98e+00

Pos logit contributions
umber+0.652
ANGE+0.650
Oracle+0.641
 Banks+0.624
 Pompe+0.619
Neg logit contributions
 toug-0.653
 doct-0.650
 communal-0.644
 grips-0.642
 livest-0.638
Tokenric
Token ID1173
Feature activation+7.01e+00

Pos logit contributions
 doct+1.386
 toug+1.371
 grips+1.348
 communal+1.347
 whis+1.328
Neg logit contributions
ANGE-1.472
Oracle-1.452
umber-1.418
 Banks-1.412
 Pompe-1.380
Tokenably
Token ID1346
Feature activation+9.73e-01

Pos logit contributions
 doct+3.178
 communal+3.134
 toug+3.105
 grips+3.089
 whis+3.055
Neg logit contributions
ANGE-6.633
Oracle-6.606
 Banks-6.574
 Pompe-6.474
umber-6.403
Token linked
Token ID6692
Feature activation+2.32e+00

Pos logit contributions
 doct+0.591
 toug+0.589
 communal+0.579
 grips+0.578
 whis+0.564
Neg logit contributions
umber-0.825
ANGE-0.821
Oracle-0.808
 Pompe-0.790
 Banks-0.790
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
 toug+1.763
 doct+1.762
 communal+1.747
 grips+1.735
 indo+1.717
Neg logit contributions
ANGE-1.573
umber-1.568
Oracle-1.537
 Banks-1.506
 Pompe-1.497
Token the
Token ID262
Feature activation-9.69e-02

Pos logit contributions
 doct+0.705
 toug+0.703
 communal+0.696
 grips+0.688
 livest+0.675
Neg logit contributions
umber-0.754
ANGE-0.749
Oracle-0.734
 Banks-0.716
 Pompe-0.713
Token core
Token ID4755
Feature activation+1.54e+00

Pos logit contributions
umber+0.076
ANGE+0.075
Oracle+0.074
 Banks+0.072
 Pompe+0.072
Neg logit contributions
 toug-0.065
 doct-0.065
 communal-0.064
 grips-0.064
 indo-0.063
Token thereof
Token ID15370
Feature activation-6.80e-01

Pos logit contributions
umber+0.859
ANGE+0.845
Oracle+0.835
 Banks+0.816
 Pompe+0.807
Neg logit contributions
 toug-1.014
 doct-1.008
 grips-0.998
 communal-0.997
 indo-0.996
Token.
Token ID13
Feature activation+4.48e-01

Pos logit contributions
umber+0.436
ANGE+0.435
Oracle+0.425
 Banks+0.415
 Pompe+0.411
Neg logit contributions
 toug-0.549
 doct-0.546
 communal-0.540
 grips-0.538
 livest-0.536
Token
Token ID198
Feature activation-4.52e-01

Pos logit contributions
 toug+0.356
 doct+0.356
 communal+0.351
 grips+0.350
 livest+0.348
Neg logit contributions
umber-0.290
ANGE-0.289
Oracle-0.283
 Banks-0.275
 Pompe-0.272
Token
Token ID198
Feature activation-4.92e-01

Pos logit contributions
umber+0.314
ANGE+0.311
Oracle+0.302
 Pompe+0.295
 Banks+0.293
Neg logit contributions
 doct-0.337
 toug-0.336
 grips-0.332
 communal-0.325
 whis-0.324
TokenArticle
Token ID14906
Feature activation+3.41e+00

Pos logit contributions
umber+0.320
ANGE+0.316
Oracle+0.305
 Banks+0.304
 Pompe+0.302
Neg logit contributions
 doct-0.388
 toug-0.387
 grips-0.379
 communal-0.379
 whis-0.374
Token IV
Token ID8363
Feature activation+6.98e+00

Pos logit contributions
 doct+2.507
 toug+2.489
 communal+2.459
 grips+2.454
 livest+2.431
Neg logit contributions
ANGE-2.406
umber-2.395
Oracle-2.349
 Banks-2.282
 Pompe-2.247
Token,
Token ID11
Feature activation+2.38e+00

Pos logit contributions
 toug+5.018
 doct+5.000
 grips+4.913
 livest+4.888
 communal+4.861
Neg logit contributions
ANGE-4.818
umber-4.747
Oracle-4.732
 Banks-4.503
 Pompe-4.449
Token Section
Token ID7275
Feature activation+3.49e+00

Pos logit contributions
 toug+1.633
 doct+1.632
 communal+1.605
 grips+1.599
 livest+1.596
Neg logit contributions
umber-1.801
ANGE-1.799
Oracle-1.777
 Banks-1.717
 Pompe-1.708
Token 2
Token ID362
Feature activation+2.76e+00

Pos logit contributions
 toug+2.561
 doct+2.558
 grips+2.526
 communal+2.514
 whis+2.507
Neg logit contributions
ANGE-2.446
umber-2.432
Oracle-2.400
 Banks-2.319
 Pompe-2.282
Token begins
Token ID6140
Feature activation-1.60e+00

Pos logit contributions
 toug+2.056
 doct+2.045
 grips+2.019
 indo+2.010
 whis+1.996
Neg logit contributions
ANGE-1.934
umber-1.917
Oracle-1.876
 Banks-1.829
 Pompe-1.807
Token with
Token ID351
Feature activation-2.26e+00

Pos logit contributions
umber+1.006
ANGE+1.002
Oracle+0.982
 Banks+0.956
 Pompe+0.944
Neg logit contributions
 toug-1.316
 doct-1.311
 communal-1.295
 grips-1.292
 livest-1.286
Token and
Token ID290
Feature activation+6.11e-01

Pos logit contributions
ANGE+0.570
umber+0.569
Oracle+0.559
 Banks+0.543
 Pompe+0.539
Neg logit contributions
 toug-0.660
 doct-0.657
 communal-0.650
 grips-0.649
 whis-0.643
Token stay
Token ID2652
Feature activation+1.85e+00

Pos logit contributions
 toug+0.469
 doct+0.466
 communal+0.462
 grips+0.460
 indo+0.457
Neg logit contributions
ANGE-0.418
umber-0.416
Oracle-0.409
 Banks-0.395
 Pompe-0.389
Token there
Token ID612
Feature activation-8.85e-01

Pos logit contributions
 doct+1.320
 toug+1.320
 communal+1.311
 grips+1.282
 whis+1.275
Neg logit contributions
ANGE-1.378
umber-1.361
Oracle-1.338
 Banks-1.307
 Pompe-1.280
Token,
Token ID11
Feature activation-5.88e-01

Pos logit contributions
ANGE+0.637
umber+0.637
Oracle+0.622
 Banks+0.608
 Pompe+0.602
Neg logit contributions
 toug-0.647
 doct-0.644
 communal-0.636
 grips-0.632
 whis-0.628
Token as
Token ID355
Feature activation+3.67e-01

Pos logit contributions
umber+0.418
ANGE+0.416
Oracle+0.409
 Banks+0.397
 Pompe+0.392
Neg logit contributions
 toug-0.435
 doct-0.432
 communal-0.428
 grips-0.426
 livest-0.423
Token if
Token ID611
Feature activation+6.92e+00

Pos logit contributions
 doct+0.269
 toug+0.269
 communal+0.267
 grips+0.264
 whis+0.262
Neg logit contributions
umber-0.262
ANGE-0.262
Oracle-0.256
 Banks-0.247
 Pompe-0.244
Token they
Token ID484
Feature activation+2.97e-01

Pos logit contributions
 toug+4.738
 doct+4.681
 communal+4.631
 whis+4.563
 grips+4.548
Neg logit contributions
ANGE-5.186
umber-5.132
Oracle-5.081
-4.761
 Banks-4.721
Token had
Token ID550
Feature activation+1.34e+00

Pos logit contributions
 toug+0.222
 doct+0.221
 communal+0.217
 grips+0.217
 livest+0.216
Neg logit contributions
ANGE-0.209
umber-0.209
Oracle-0.206
 Banks-0.200
 Pompe-0.198
Token taken
Token ID2077
Feature activation-2.93e-01

Pos logit contributions
 toug+0.887
 doct+0.882
 communal+0.866
 grips+0.860
 whis+0.852
Neg logit contributions
umber-1.058
ANGE-1.057
Oracle-1.043
 Banks-1.006
 Pompe-0.997
Token out
Token ID503
Feature activation-1.36e+00

Pos logit contributions
umber+0.211
ANGE+0.211
Oracle+0.208
 Banks+0.200
 Pompe+0.198
Neg logit contributions
 toug-0.213
 doct-0.213
 communal-0.211
 grips-0.210
 indo-0.207
Token a
Token ID257
Feature activation-6.82e-02

Pos logit contributions
umber+0.989
ANGE+0.984
Oracle+0.968
 Banks+0.948
 Pompe+0.938
Neg logit contributions
 toug-0.979
 doct-0.974
 communal-0.961
 grips-0.959
 livest-0.955
Token maze
Token ID31237
Feature activation-2.51e-01

Pos logit contributions
 toug+0.492
 doct+0.490
 communal+0.484
 grips+0.482
 indo+0.478
Neg logit contributions
umber-0.643
ANGE-0.641
Oracle-0.631
 Banks-0.617
 Pompe-0.612
Token consists
Token ID10874
Feature activation+1.52e-01

Pos logit contributions
umber+0.184
ANGE+0.183
Oracle+0.180
 Banks+0.176
 Pompe+0.174
Neg logit contributions
 toug-0.180
 doct-0.179
 communal-0.177
 grips-0.176
 indo-0.175
Token in
Token ID287
Feature activation-1.24e+00

Pos logit contributions
 toug+0.122
 doct+0.122
 livest+0.120
 indo+0.120
 grips+0.120
Neg logit contributions
umber-0.098
ANGE-0.097
Oracle-0.095
 Banks-0.093
 Pompe-0.092
Token a
Token ID257
Feature activation-9.08e-01

Pos logit contributions
umber+0.906
ANGE+0.902
Oracle+0.886
 Banks+0.869
 Pompe+0.860
Neg logit contributions
 toug-0.886
 doct-0.882
 communal-0.869
 grips-0.867
 livest-0.867
Token 3
Token ID513
Feature activation+1.89e+00

Pos logit contributions
umber+0.717
ANGE+0.714
Oracle+0.703
 Banks+0.690
 Pompe+0.684
Neg logit contributions
 toug-0.596
 doct-0.593
 communal-0.584
 grips-0.583
 livest-0.581
Token×
Token ID12906
Feature activation+6.91e+00

Pos logit contributions
 toug+1.461
 doct+1.455
 communal+1.441
 grips+1.430
 indo+1.425
Neg logit contributions
umber-1.275
ANGE-1.269
Oracle-1.249
 Banks-1.219
 Pompe-1.206
Token3
Token ID18
Feature activation+2.95e+00

Pos logit contributions
 doct+5.082
 communal+5.066
 toug+5.058
 cooker+4.853
 whis+4.841
Neg logit contributions
umber-4.648
ANGE-4.574
 Banks-4.573
Oracle-4.538
 Pompe-4.455
Tokenrooms
Token ID9649
Feature activation+1.62e+00

Pos logit contributions
 toug+1.696
 doct+1.684
 communal+1.672
 indo+1.638
 grips+1.636
Neg logit contributions
umber-2.576
ANGE-2.562
Oracle-2.534
 Banks-2.482
 Pompe-2.458
Token.
Token ID13
Feature activation+4.49e-01

Pos logit contributions
 toug+1.207
 doct+1.202
 communal+1.193
 grips+1.178
 indo+1.175
Neg logit contributions
umber-1.137
ANGE-1.132
Oracle-1.118
 Banks-1.090
 Pompe-1.078
Token The
Token ID383
Feature activation+6.74e-02

Pos logit contributions
 toug+0.332
 doct+0.331
 communal+0.327
 grips+0.325
 indo+0.324
Neg logit contributions
umber-0.318
ANGE-0.316
Oracle-0.310
 Banks-0.303
 Pompe-0.300
Token rooms
Token ID9519
Feature activation+7.75e-01

Pos logit contributions
 toug+0.043
 doct+0.043
 communal+0.042
 grips+0.042
 indo+0.042
Neg logit contributions
umber-0.055
ANGE-0.055
Oracle-0.054
 Banks-0.053
 Pompe-0.052
Token in
Token ID287
Feature activation+1.13e-01

Pos logit contributions
 toug+0.264
 doct+0.263
 communal+0.262
 grips+0.260
 indo+0.258
Neg logit contributions
umber-0.219
ANGE-0.218
Oracle-0.215
 Pompe-0.207
 Banks-0.207
Token America
Token ID2253
Feature activation+1.63e+00

Pos logit contributions
 toug+0.087
 doct+0.087
 communal+0.086
 grips+0.086
 indo+0.085
Neg logit contributions
ANGE-0.076
umber-0.076
Oracle-0.075
 Banks-0.073
 Pompe-0.072
Token.
Token ID13
Feature activation+1.22e-01

Pos logit contributions
 toug+1.323
 doct+1.321
 communal+1.305
 grips+1.297
 livest+1.286
Neg logit contributions
umber-1.032
ANGE-1.031
Oracle-1.017
 Pompe-0.979
 Banks-0.977
Token Everyone
Token ID11075
Feature activation+8.85e-02

Pos logit contributions
 doct+0.087
 toug+0.086
 communal+0.085
 grips+0.085
 whis+0.084
Neg logit contributions
umber-0.089
ANGE-0.089
Oracle-0.087
 Banks-0.083
 Pompe-0.083
Token gave
Token ID2921
Feature activation+4.76e-01

Pos logit contributions
 doct+0.063
 toug+0.063
 grips+0.062
 communal+0.062
 indo+0.061
Neg logit contributions
umber-0.066
ANGE-0.065
Oracle-0.064
 Banks-0.062
 Pompe-0.062
Token themselves
Token ID2405
Feature activation+6.91e+00

Pos logit contributions
 doct+0.341
 toug+0.337
 communal+0.337
 grips+0.332
 indo+0.330
Neg logit contributions
umber-0.350
ANGE-0.348
Oracle-0.345
 Banks-0.330
 Pompe-0.329
Token the
Token ID262
Feature activation+2.85e+00

Pos logit contributions
 communal+4.732
 doct+4.681
 toug+4.613
 whis+4.563
 grips+4.547
Neg logit contributions
umber-5.226
Oracle-5.211
ANGE-5.185
 Banks-4.805
 Pompe-4.724
Token rhetorical
Token ID38505
Feature activation+2.94e+00

Pos logit contributions
 doct+1.843
 toug+1.836
 communal+1.830
 grips+1.810
 indo+1.793
Neg logit contributions
ANGE-2.265
umber-2.256
Oracle-2.227
 Banks-2.167
 Pompe-2.141
Token authority
Token ID4934
Feature activation+1.71e+00

Pos logit contributions
 toug+1.897
 doct+1.884
 grips+1.879
 communal+1.866
 indo+1.838
Neg logit contributions
ANGE-2.372
umber-2.356
Oracle-2.341
 Banks-2.251
 Pompe-2.211
Token to
Token ID284
Feature activation+1.09e+00

Pos logit contributions
 toug+1.293
 doct+1.288
 communal+1.272
 grips+1.269
 indo+1.260
Neg logit contributions
umber-1.174
ANGE-1.170
Oracle-1.149
 Banks-1.121
 Pompe-1.109
Token tweet
Token ID6126
Feature activation-2.62e-01

Pos logit contributions
 toug+0.754
 doct+0.750
 communal+0.738
 grips+0.734
 indo+0.727
Neg logit contributions
ANGE-0.832
umber-0.829
Oracle-0.822
 Banks-0.793
 Pompe-0.788
Token they
Token ID484
Feature activation+8.21e-01

Pos logit contributions
 toug+0.095
 doct+0.095
 communal+0.093
 grips+0.093
 livest+0.093
Neg logit contributions
umber-0.101
ANGE-0.101
Oracle-0.099
 Banks-0.097
 Pompe-0.096
Token attempted
Token ID7482
Feature activation+6.12e-01

Pos logit contributions
 toug+0.542
 doct+0.541
 communal+0.529
 grips+0.528
 livest+0.522
Neg logit contributions
ANGE-0.651
umber-0.651
Oracle-0.642
 Banks-0.623
 Pompe-0.619
Token to
Token ID284
Feature activation-8.31e-01

Pos logit contributions
 toug+0.487
 doct+0.483
 livest+0.478
 indo+0.477
 communal+0.477
Neg logit contributions
umber-0.397
ANGE-0.394
Oracle-0.387
 Banks-0.380
 Pompe-0.376
Token get
Token ID651
Feature activation+2.06e+00

Pos logit contributions
umber+0.659
ANGE+0.658
Oracle+0.647
 Banks+0.633
 Pompe+0.628
Neg logit contributions
 toug-0.547
 doct-0.544
 communal-0.535
 grips-0.534
 livest-0.529
Token a
Token ID257
Feature activation+2.25e+00

Pos logit contributions
 doct+1.451
 toug+1.447
 grips+1.443
 communal+1.425
 whis+1.395
Neg logit contributions
umber-1.539
ANGE-1.520
Oracle-1.501
 Banks-1.458
 Pompe-1.447
Token free
Token ID1479
Feature activation+6.90e+00

Pos logit contributions
 doct+1.369
 toug+1.359
 grips+1.343
 communal+1.339
 livest+1.297
Neg logit contributions
umber-1.902
ANGE-1.887
Oracle-1.865
 Banks-1.813
 Pompe-1.806
Token handset
Token ID40445
Feature activation+8.88e-01

Pos logit contributions
 toug+4.723
 doct+4.721
 grips+4.684
 communal+4.654
 cooker+4.606
Neg logit contributions
umber-5.036
ANGE-5.009
Oracle-4.979
 Banks-4.828
 Pompe-4.756
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
 toug+0.701
 doct+0.700
 grips+0.690
 communal+0.689
 whis+0.681
Neg logit contributions
umber-0.590
ANGE-0.586
Oracle-0.575
 Banks-0.558
 Pompe-0.552
Token
Token ID198
Feature activation-3.99e-01

Pos logit contributions
umber+0.084
ANGE+0.084
Oracle+0.082
 Banks+0.080
 Pompe+0.079
Neg logit contributions
 toug-0.105
 doct-0.105
 communal-0.104
 grips-0.103
 indo-0.103
Token
Token ID198
Feature activation-4.66e-01

Pos logit contributions
umber+0.276
ANGE+0.273
Oracle+0.265
 Pompe+0.261
 Banks+0.259
Neg logit contributions
 doct-0.298
 toug-0.297
 grips-0.293
 communal-0.288
 whis-0.286
TokenThe
Token ID464
Feature activation-1.04e+00

Pos logit contributions
umber+0.291
ANGE+0.285
Oracle+0.279
 Banks+0.276
 Pompe+0.275
Neg logit contributions
 toug-0.382
 doct-0.382
 communal-0.376
 grips-0.375
 whis-0.371
Token
Token ID564
Feature activation+7.57e-01

Pos logit contributions
umber+0.185
ANGE+0.184
Oracle+0.180
 Banks+0.175
 Pompe+0.174
Neg logit contributions
 toug-0.206
 doct-0.206
 communal-0.203
 grips-0.203
 indo-0.202
Token
Token ID250
Feature activation+1.63e+00

Pos logit contributions
 toug+0.525
 doct+0.523
 communal+0.512
 grips+0.511
 whis+0.502
Neg logit contributions
umber-0.568
ANGE-0.559
Oracle-0.546
 Banks-0.537
 Pompe-0.534
TokenSomething
Token ID22210
Feature activation+2.37e+00

Pos logit contributions
 toug+1.313
 doct+1.310
 communal+1.288
 grips+1.288
 whis+1.278
Neg logit contributions
umber-1.022
ANGE-0.994
 Pompe-0.967
 Banks-0.956
Oracle-0.950
Token about
Token ID546
Feature activation+1.85e+00

Pos logit contributions
 toug+1.768
 doct+1.751
 communal+1.740
 grips+1.740
 indo+1.721
Neg logit contributions
umber-1.663
ANGE-1.645
Oracle-1.623
 Pompe-1.583
 Banks-1.575
Token the
Token ID262
Feature activation+1.30e-01

Pos logit contributions
 toug+1.339
 doct+1.331
 communal+1.324
 grips+1.316
 indo+1.308
Neg logit contributions
umber-1.341
ANGE-1.332
Oracle-1.308
 Banks-1.260
 Pompe-1.255
Token vowel
Token ID48617
Feature activation-9.05e+00

Pos logit contributions
 toug+0.092
 doct+0.092
 communal+0.091
 grips+0.091
 indo+0.090
Neg logit contributions
umber-0.097
ANGE-0.097
Oracle-0.095
 Banks-0.092
 Pompe-0.092
Token system
Token ID1080
Feature activation-7.49e-01

Pos logit contributions
umber+6.951
ANGE+6.826
 Banks+6.731
 Pompe+6.611
Oracle+6.523
Neg logit contributions
 doct-5.736
kefeller-5.684
 livest-5.661
 toug-5.608
 indo-5.553
Token gets
Token ID3011
Feature activation+1.69e-01

Pos logit contributions
umber+0.504
ANGE+0.503
Oracle+0.494
 Banks+0.480
 Pompe+0.477
Neg logit contributions
 toug-0.582
 doct-0.578
 grips-0.573
 communal-0.571
 indo-0.569
Token me
Token ID502
Feature activation+3.26e+00

Pos logit contributions
 toug+0.130
 doct+0.129
 communal+0.127
 grips+0.127
 indo+0.126
Neg logit contributions
umber-0.117
ANGE-0.116
Oracle-0.113
 Banks-0.110
 Pompe-0.109
Token all
Token ID477
Feature activation+8.11e-01

Pos logit contributions
 toug+2.364
 doct+2.326
 grips+2.314
 communal+2.301
 whis+2.271
Neg logit contributions
umber-2.379
ANGE-2.377
Oracle-2.345
 Banks-2.274
 Pompe-2.260
Token tri
Token ID1333
Feature activation-8.35e-01

Pos logit contributions
 toug+0.519
 doct+0.516
 communal+0.508
 grips+0.507
 indo+0.503
Neg logit contributions
umber-0.657
ANGE-0.655
Oracle-0.646
 Banks-0.631
 Pompe-0.627
Tokenly
Token ID306
Feature activation-4.31e-01

Pos logit contributions
 doct+1.447
 toug+1.432
 communal+1.409
 grips+1.407
 whis+1.387
Neg logit contributions
ANGE-1.944
umber-1.943
Oracle-1.922
 Banks-1.874
 Pompe-1.831
Token but
Token ID475
Feature activation-1.08e+00

Pos logit contributions
umber+0.312
ANGE+0.310
Oracle+0.304
 Banks+0.296
 Pompe+0.291
Neg logit contributions
 toug-0.317
 doct-0.316
 communal-0.310
 grips-0.308
 livest-0.305
Token hardly
Token ID8941
Feature activation-1.12e+00

Pos logit contributions
umber+0.780
ANGE+0.779
Oracle+0.767
 Banks+0.749
 Pompe+0.738
Neg logit contributions
 toug-0.782
 doct-0.779
 communal-0.766
 grips-0.763
 livest-0.758
Token exclusively
Token ID11541
Feature activation-1.13e+00

Pos logit contributions
umber+0.846
ANGE+0.842
Oracle+0.827
 Banks+0.808
 Pompe+0.796
Neg logit contributions
 doct-0.779
 toug-0.779
 communal-0.765
 grips-0.761
 indo-0.755
Token to
Token ID284
Feature activation-1.16e+00

Pos logit contributions
umber+0.719
ANGE+0.713
Oracle+0.701
 Banks+0.677
 Pompe+0.667
Neg logit contributions
 doct-0.932
 toug-0.927
 communal-0.914
 grips-0.909
 indo-0.902
Token female
Token ID4048
Feature activation-8.07e+00

Pos logit contributions
umber+0.842
ANGE+0.837
Oracle+0.822
 Banks+0.801
 Pompe+0.790
Neg logit contributions
 toug-0.850
 doct-0.845
 communal-0.832
 grips-0.830
 indo-0.825
Token veterans
Token ID11255
Feature activation-2.67e+00

Pos logit contributions
ANGE+6.367
umber+6.353
 Pompe+6.273
Oracle+6.267
 Banks+6.208
Neg logit contributions
 livest-4.989
 toug-4.920
 doct-4.905
 communal-4.897
kefeller-4.871
Token and
Token ID290
Feature activation-8.52e-01

Pos logit contributions
umber+1.497
ANGE+1.489
Oracle+1.457
 Banks+1.415
 Pompe+1.397
Neg logit contributions
 toug-2.360
 doct-2.352
 communal-2.326
 grips-2.320
 indo-2.311
Token female
Token ID4048
Feature activation-5.29e+00

Pos logit contributions
umber+0.677
ANGE+0.672
Oracle+0.664
 Banks+0.647
 Pompe+0.635
Neg logit contributions
 toug-0.564
 doct-0.559
 communal-0.552
 grips-0.548
 indo-0.548
Token reporters
Token ID7638
Feature activation-5.67e-01

Pos logit contributions
umber+4.176
ANGE+4.170
Oracle+4.107
 Pompe+4.068
 Banks+4.058
Neg logit contributions
 toug-3.348
 doct-3.339
 livest-3.335
 communal-3.286
 grips-3.279
Token).
Token ID737
Feature activation-9.12e-01

Pos logit contributions
ANGE+0.266
umber+0.265
Oracle+0.259
 Banks+0.249
 Pompe+0.245
Neg logit contributions
 toug-0.555
 doct-0.555
 communal-0.548
 grips-0.546
 whis-0.543
Token joined
Token ID5399
Feature activation-1.46e+00

Pos logit contributions
umber+0.169
ANGE+0.168
Oracle+0.165
 Banks+0.161
 Pompe+0.159
Neg logit contributions
 toug-0.167
 doct-0.166
 communal-0.165
 grips-0.164
 whis-0.163
Token seconds
Token ID4201
Feature activation+2.07e-01

Pos logit contributions
umber+0.995
ANGE+0.990
Oracle+0.973
 Banks+0.950
 Pompe+0.940
Neg logit contributions
 toug-1.119
 doct-1.114
 communal-1.101
 grips-1.098
 livest-1.092
Token later
Token ID1568
Feature activation+5.84e-01

Pos logit contributions
 toug+0.150
 doct+0.149
 communal+0.148
 grips+0.148
 indo+0.146
Neg logit contributions
ANGE-0.150
umber-0.149
Oracle-0.147
 Banks-0.144
 Pompe-0.142
Token by
Token ID416
Feature activation+7.21e-01

Pos logit contributions
 toug+0.437
 doct+0.432
 communal+0.430
 grips+0.428
 livest+0.423
Neg logit contributions
ANGE-0.408
umber-0.406
Oracle-0.402
 Banks-0.393
 Pompe-0.386
Token a
Token ID257
Feature activation-1.48e+00

Pos logit contributions
 toug+0.512
 doct+0.510
 communal+0.507
 grips+0.502
 whis+0.499
Neg logit contributions
umber-0.535
ANGE-0.533
Oracle-0.525
 Banks-0.510
 Pompe-0.502
Token deaf
Token ID21815
Feature activation-7.32e+00

Pos logit contributions
umber+1.154
ANGE+1.148
Oracle+1.131
 Banks+1.107
 Pompe+1.097
Neg logit contributions
 toug-0.994
 doct-0.989
 communal-0.977
 grips-0.971
 livest-0.966
Tokenening
Token ID3101
Feature activation-2.41e+00

Pos logit contributions
umber+3.537
ANGE+3.517
Oracle+3.398
 Pompe+3.301
 Banks+3.281
Neg logit contributions
 toug-6.898
 doct-6.828
 livest-6.820
 indo-6.807
 recip-6.792
Token th
Token ID294
Feature activation+9.95e-01

Pos logit contributions
umber+1.944
ANGE+1.936
Oracle+1.905
 Banks+1.872
 Pompe+1.858
Neg logit contributions
 toug-1.535
 doct-1.528
 communal-1.502
 grips-1.501
 livest-1.500
Tokenud
Token ID463
Feature activation+8.89e-01

Pos logit contributions
 toug+0.792
 doct+0.783
 communal+0.779
 grips+0.777
 whis+0.772
Neg logit contributions
ANGE-0.639
Oracle-0.629
umber-0.623
 Banks-0.613
 Pompe-0.608
Token from
Token ID422
Feature activation+2.06e+00

Pos logit contributions
 toug+0.726
 doct+0.725
 communal+0.718
 grips+0.715
 whis+0.709
Neg logit contributions
umber-0.559
ANGE-0.558
Oracle-0.548
 Banks-0.535
 Pompe-0.528
Token the
Token ID262
Feature activation+1.82e-01

Pos logit contributions
 toug+1.507
 doct+1.498
 communal+1.495
 grips+1.484
 whis+1.465
Neg logit contributions
umber-1.473
ANGE-1.468
Oracle-1.440
 Banks-1.394
 Pompe-1.368
Tokenth
Token ID400
Feature activation+7.67e-01

Pos logit contributions
umber+0.222
ANGE+0.221
Oracle+0.217
 Banks+0.212
 Pompe+0.208
Neg logit contributions
 toug-0.349
 doct-0.348
 grips-0.344
 indo-0.344
 communal-0.344
Token U
Token ID471
Feature activation+1.13e+00

Pos logit contributions
 toug+0.591
 doct+0.587
 communal+0.582
 grips+0.580
 livest+0.575
Neg logit contributions
umber-0.519
ANGE-0.516
Oracle-0.507
 Banks-0.494
 Pompe-0.489
Token.
Token ID13
Feature activation-1.88e+00

Pos logit contributions
 toug+0.940
 doct+0.936
 grips+0.925
 communal+0.925
 livest+0.910
Neg logit contributions
umber-0.684
ANGE-0.676
Oracle-0.673
 Banks-0.649
 Pompe-0.644
TokenS
Token ID50
Feature activation-2.12e+00

Pos logit contributions
umber+1.278
ANGE+1.250
Oracle+1.226
 Pompe+1.191
 Banks+1.191
Neg logit contributions
 toug-1.441
 doct-1.441
 grips-1.416
 communal-1.413
 whis-1.392
Token.
Token ID13
Feature activation-1.54e+00

Pos logit contributions
umber+1.252
ANGE+1.247
Oracle+1.217
 Banks+1.172
 Pompe+1.168
Neg logit contributions
 toug-1.822
 doct-1.815
 communal-1.794
 grips-1.790
 whis-1.768
Token Circuit
Token ID13588
Feature activation-7.32e+00

Pos logit contributions
umber+0.514
ANGE+0.509
Oracle+0.490
 Banks+0.467
 Pompe+0.456
Neg logit contributions
 toug-1.707
 doct-1.702
 communal-1.687
 grips-1.684
 livest-1.681
Token Court
Token ID3078
Feature activation-3.71e+00

Pos logit contributions
 Banks+2.957
ANGE+2.923
umber+2.904
Oracle+2.844
 Pompe+2.768
Neg logit contributions
kefeller-7.391
 doct-7.375
 indo-7.374
 recip-7.344
 livest-7.318
Token of
Token ID286
Feature activation+5.31e-01

Pos logit contributions
umber+2.166
Oracle+2.121
ANGE+2.117
 Banks+2.105
 Pompe+2.050
Neg logit contributions
 doct-3.124
 communal-3.105
 indo-3.098
 livest-3.095
 toug-3.091
Token Appeal
Token ID31876
Feature activation-2.39e+00

Pos logit contributions
 toug+0.462
 doct+0.460
 grips+0.457
 communal+0.454
 whis+0.445
Neg logit contributions
umber-0.304
ANGE-0.304
Oracle-0.299
 Pompe-0.277
 Banks-0.276
Token's
Token ID338
Feature activation-8.21e-02

Pos logit contributions
umber+1.746
ANGE+1.739
Oracle+1.707
 Banks+1.668
 Pompe+1.652
Neg logit contributions
 toug-1.725
 doct-1.715
 communal-1.691
 grips-1.687
 livest-1.678
Token 3
Token ID513
Feature activation+4.73e-01

Pos logit contributions
umber+0.059
ANGE+0.058
Oracle+0.057
 Banks+0.054
 Pompe+0.054
Neg logit contributions
 toug-0.062
 doct-0.061
 communal-0.060
 grips-0.060
 recip-0.059
Token the
Token ID262
Feature activation-5.38e-01

Pos logit contributions
umber+0.666
ANGE+0.663
Oracle+0.651
 Banks+0.635
 Pompe+0.628
Neg logit contributions
 toug-0.724
 doct-0.721
 communal-0.712
 grips-0.710
 livest-0.706
Token Pentagon
Token ID12651
Feature activation-4.42e+00

Pos logit contributions
umber+0.423
ANGE+0.421
Oracle+0.415
 Banks+0.404
 Pompe+0.400
Neg logit contributions
 toug-0.358
 doct-0.356
 communal-0.351
 grips-0.350
 livest-0.347
Token's
Token ID338
Feature activation+1.12e+00

Pos logit contributions
umber+3.119
ANGE+3.102
Oracle+3.032
 Banks+3.024
 Pompe+3.000
Neg logit contributions
 toug-3.209
 doct-3.197
 livest-3.151
 communal-3.150
 indo-3.147
Token investigation
Token ID3645
Feature activation-3.41e-01

Pos logit contributions
 doct+0.734
 toug+0.733
 communal+0.722
 grips+0.718
 indo+0.709
Neg logit contributions
umber-0.898
ANGE-0.890
Oracle-0.881
 Banks-0.848
 Pompe-0.836
Token is
Token ID318
Feature activation-1.21e+00

Pos logit contributions
umber+0.228
ANGE+0.228
Oracle+0.224
 Banks+0.216
 Pompe+0.213
Neg logit contributions
 toug-0.267
 doct-0.266
 communal-0.262
 grips-0.262
 indo-0.259
Token Private
Token ID15348
Feature activation-7.28e+00

Pos logit contributions
umber+0.887
ANGE+0.885
Oracle+0.866
 Banks+0.840
 Pompe+0.829
Neg logit contributions
 toug-0.871
 doct-0.863
 communal-0.852
 grips-0.846
 livest-0.840
Token first
Token ID717
Feature activation-2.45e+00

Pos logit contributions
umber+4.165
ANGE+4.085
 Banks+4.031
Oracle+4.001
 Pompe+3.968
Neg logit contributions
 toug-6.333
 doct-6.257
 livest-6.248
 indo-6.247
 communal-6.232
Token class
Token ID1398
Feature activation+2.11e-01

Pos logit contributions
umber+1.998
ANGE+1.991
Oracle+1.961
 Banks+1.923
 Pompe+1.907
Neg logit contributions
 toug-1.549
 doct-1.542
 communal-1.519
 grips-1.514
 livest-1.507
Token Bradley
Token ID16182
Feature activation-1.01e+00

Pos logit contributions
 toug+0.175
 doct+0.175
 communal+0.173
 grips+0.172
 whis+0.171
Neg logit contributions
umber-0.130
ANGE-0.130
Oracle-0.127
 Banks-0.121
 Pompe-0.120
Token Manning
Token ID15281
Feature activation-2.37e-01

Pos logit contributions
umber+0.625
ANGE+0.623
Oracle+0.612
 Banks+0.592
 Pompe+0.586
Neg logit contributions
 toug-0.831
 doct-0.828
 communal-0.819
 grips-0.816
 livest-0.814
Token,
Token ID11
Feature activation-6.88e-01

Pos logit contributions
ANGE+0.153
umber+0.152
Oracle+0.149
 Banks+0.145
 Pompe+0.143
Neg logit contributions
 toug-0.191
 doct-0.190
 communal-0.187
 grips-0.187
 indo-0.186
Token the
Token ID262
Feature activation+7.48e-01

Pos logit contributions
 toug+0.144
 doct+0.143
 communal+0.142
 grips+0.141
 livest+0.141
Neg logit contributions
umber-0.130
ANGE-0.129
Oracle-0.127
 Banks-0.123
 Pompe-0.122
Token raid
Token ID9513
Feature activation-1.44e+00

Pos logit contributions
 toug+0.456
 doct+0.456
 communal+0.452
 grips+0.443
 indo+0.443
Neg logit contributions
umber-0.629
ANGE-0.624
Oracle-0.617
 Banks-0.600
 Pompe-0.596
Token:
Token ID25
Feature activation-1.52e+00

Pos logit contributions
umber+0.832
ANGE+0.827
Oracle+0.810
 Banks+0.788
 Pompe+0.778
Neg logit contributions
 toug-1.245
 doct-1.240
 communal-1.226
 grips-1.224
 livest-1.220
Token
Token ID198
Feature activation-4.28e+00

Pos logit contributions
umber+0.940
ANGE+0.934
Oracle+0.915
 Banks+0.897
 Pompe+0.881
Neg logit contributions
 toug-1.260
 doct-1.250
 communal-1.238
 grips-1.234
 livest-1.233
Token
Token ID198
Feature activation-3.97e+00

Pos logit contributions
umber+2.653
ANGE+2.640
Oracle+2.589
 Banks+2.525
 Pompe+2.492
Neg logit contributions
 toug-3.539
 doct-3.524
 communal-3.489
 livest-3.485
 indo-3.477
TokenMale
Token ID25486
Feature activation-7.22e+00

Pos logit contributions
ANGE+2.553
umber+2.552
Oracle+2.511
 Banks+2.422
 Pompe+2.390
Neg logit contributions
 toug-3.191
 doct-3.169
 communal-3.140
 livest-3.132
 indo-3.129
Token Officer
Token ID10391
Feature activation-1.23e+00

Pos logit contributions
umber+4.937
ANGE+4.908
Oracle+4.815
 Banks+4.748
 Pompe+4.651
Neg logit contributions
 toug-5.395
 doct-5.349
 communal-5.262
 livest-5.258
kefeller-5.237
Token:
Token ID25
Feature activation+1.44e-01

Pos logit contributions
umber+0.747
ANGE+0.744
Oracle+0.730
 Banks+0.704
 Pompe+0.698
Neg logit contributions
 toug-1.035
 doct-1.032
 grips-1.021
 communal-1.019
 indo-1.015
Token
Token ID564
Feature activation+3.91e-02

Pos logit contributions
 doct+0.104
 toug+0.104
 communal+0.103
 grips+0.102
 indo+0.102
Neg logit contributions
umber-0.105
ANGE-0.104
Oracle-0.103
 Banks-0.099
 Pompe-0.099
Token
Token ID250
Feature activation+1.48e+00

Pos logit contributions
 toug+0.033
 doct+0.033
 grips+0.033
 communal+0.033
 livest+0.033
Neg logit contributions
umber-0.023
ANGE-0.023
Oracle-0.023
 Banks-0.022
 Pompe-0.022
TokenDid
Token ID11633
Feature activation+9.11e-01

Pos logit contributions
 doct+1.215
 toug+1.214
 communal+1.193
 grips+1.188
 livest+1.186
Neg logit contributions
umber-0.914
ANGE-0.899
Oracle-0.884
 Banks-0.870
 Pompe-0.862
Token that
Token ID326
Feature activation-2.93e+00

Pos logit contributions
umber+0.653
ANGE+0.651
Oracle+0.640
 Banks+0.625
 Pompe+0.619
Neg logit contributions
 toug-0.652
 doct-0.649
 communal-0.641
 grips-0.638
 indo-0.634
Token the
Token ID262
Feature activation-1.36e+00

Pos logit contributions
umber+2.137
ANGE+2.126
Oracle+2.085
 Banks+2.051
 Pompe+2.031
Neg logit contributions
 toug-2.092
 doct-2.082
 livest-2.056
 communal-2.055
 grips-2.046
Token use
Token ID779
Feature activation-3.60e-01

Pos logit contributions
umber+1.065
ANGE+1.061
Oracle+1.044
 Banks+1.023
 Pompe+1.014
Neg logit contributions
 toug-0.897
 doct-0.893
 communal-0.880
 grips-0.877
 livest-0.872
Token of
Token ID286
Feature activation-8.65e-01

Pos logit contributions
umber+0.234
ANGE+0.232
Oracle+0.225
 Banks+0.220
 Pompe+0.218
Neg logit contributions
 toug-0.287
 doct-0.287
 livest-0.283
 communal-0.282
 indo-0.281
Token flap
Token ID37699
Feature activation-3.81e+00

Pos logit contributions
umber+0.695
ANGE+0.692
Oracle+0.682
 Banks+0.668
 Pompe+0.662
Neg logit contributions
 toug-0.557
 doct-0.555
 communal-0.547
 grips-0.545
 livest-0.542
Token valves
Token ID41843
Feature activation-7.07e+00

Pos logit contributions
umber+2.974
ANGE+2.943
Oracle+2.892
 Banks+2.863
 Pompe+2.823
Neg logit contributions
 toug-2.505
 doct-2.505
 livest-2.475
 communal-2.472
 indo-2.459
Token limits
Token ID7095
Feature activation-4.37e+00

Pos logit contributions
umber+4.423
ANGE+4.381
Oracle+4.281
 Banks+4.272
 Pompe+4.215
Neg logit contributions
 livest-5.507
 doct-5.477
 communal-5.450
 toug-5.437
kefeller-5.436
Token the
Token ID262
Feature activation-8.97e-01

Pos logit contributions
umber+3.141
ANGE+3.109
Oracle+3.055
 Banks+3.014
 Pompe+2.991
Neg logit contributions
 toug-3.100
 doct-3.100
 livest-3.082
 communal-3.053
 indo-3.049
Token reliability
Token ID17843
Feature activation-6.61e-01

Pos logit contributions
umber+0.682
ANGE+0.681
Oracle+0.670
 Banks+0.653
 Pompe+0.647
Neg logit contributions
 toug-0.619
 doct-0.616
 communal-0.608
 grips-0.605
 indo-0.601
Token and
Token ID290
Feature activation+6.94e-01

Pos logit contributions
umber+0.445
ANGE+0.443
Oracle+0.435
 Banks+0.425
 Pompe+0.420
Neg logit contributions
 toug-0.511
 doct-0.509
 communal-0.503
 grips-0.502
 livest-0.501
Token longevity
Token ID28188
Feature activation+2.58e+00

Pos logit contributions
 toug+0.491
 doct+0.484
 communal+0.479
 grips+0.477
 indo+0.473
Neg logit contributions
umber-0.524
ANGE-0.523
Oracle-0.517
 Banks-0.502
 Pompe-0.493
Token billion
Token ID2997
Feature activation+8.87e-01

Pos logit contributions
umber+0.278
ANGE+0.277
Oracle+0.270
 Banks+0.262
 Pompe+0.260
Neg logit contributions
 toug-0.402
 doct-0.400
 communal-0.397
 grips-0.396
 indo-0.394
Token,
Token ID11
Feature activation+1.89e-01

Pos logit contributions
 toug+0.717
 doct+0.708
 communal+0.705
 grips+0.702
 indo+0.695
Neg logit contributions
umber-0.567
ANGE-0.564
Oracle-0.547
 Pompe-0.533
 Banks-0.532
Token according
Token ID1864
Feature activation+1.57e+00

Pos logit contributions
 toug+0.142
 doct+0.139
 communal+0.139
 grips+0.138
 livest+0.138
Neg logit contributions
umber-0.134
ANGE-0.134
Oracle-0.130
 Banks-0.126
 Pompe-0.125
Token to
Token ID284
Feature activation+5.92e-01

Pos logit contributions
 toug+1.044
 doct+1.038
 grips+1.025
 communal+1.019
 whis+0.991
Neg logit contributions
ANGE-1.208
umber-1.198
Oracle-1.169
 Pompe-1.132
 Banks-1.125
Token a
Token ID257
Feature activation-5.82e-01

Pos logit contributions
 toug+0.426
 doct+0.425
 communal+0.423
 grips+0.420
 livest+0.418
Neg logit contributions
umber-0.429
ANGE-0.428
Oracle-0.417
 Banks-0.403
 Pompe-0.401
Token Venture
Token ID35686
Feature activation-6.99e+00

Pos logit contributions
umber+0.389
ANGE+0.384
Oracle+0.376
 Banks+0.363
 Pompe+0.360
Neg logit contributions
 toug-0.457
 doct-0.455
 communal-0.453
 grips-0.447
 livest-0.447
TokenBeat
Token ID34979
Feature activation-4.79e+00

Pos logit contributions
ANGE+2.806
umber+2.801
Oracle+2.718
 Banks+2.690
 Pompe+2.637
Neg logit contributions
 doct-7.210
 recip-7.181
 livest-7.159
 toug-7.135
 communal-7.099
Token report
Token ID989
Feature activation-7.41e-01

Pos logit contributions
umber+2.782
ANGE+2.762
Oracle+2.703
 Banks+2.642
 Pompe+2.607
Neg logit contributions
 toug-4.135
 doct-4.120
 communal-4.072
 grips-4.064
 indo-4.054
Token that
Token ID326
Feature activation-1.24e+00

Pos logit contributions
umber+0.454
ANGE+0.453
Oracle+0.442
 Banks+0.429
 Pompe+0.425
Neg logit contributions
 toug-0.620
 doct-0.618
 communal-0.610
 grips-0.609
 livest-0.605
Token cites
Token ID23474
Feature activation-3.04e+00

Pos logit contributions
ANGE+0.883
umber+0.883
Oracle+0.864
 Banks+0.839
 Pompe+0.830
Neg logit contributions
 toug-0.918
 doct-0.912
 grips-0.902
 communal-0.897
 indo-0.893
Token anonymous
Token ID11614
Feature activation-1.25e+00

Pos logit contributions
umber+2.188
ANGE+2.180
Oracle+2.143
 Banks+2.101
 Pompe+2.080
Neg logit contributions
 toug-2.201
 doct-2.188
 communal-2.159
 grips-2.154
 indo-2.150
Token worked
Token ID3111
Feature activation+1.41e+00

Pos logit contributions
umber+0.109
ANGE+0.108
Oracle+0.107
 Banks+0.104
 Pompe+0.103
Neg logit contributions
 toug-0.099
 doct-0.098
 grips-0.096
 communal-0.096
 livest-0.095
Token out
Token ID503
Feature activation-6.87e-01

Pos logit contributions
 toug+1.117
 doct+1.095
 communal+1.092
 grips+1.085
 whis+1.076
Neg logit contributions
ANGE-0.935
umber-0.924
Oracle-0.905
 Banks-0.884
 Pompe-0.872
Token perfectly
Token ID7138
Feature activation+9.30e-01

Pos logit contributions
ANGE+0.512
umber+0.511
Oracle+0.502
 Banks+0.490
 Pompe+0.485
Neg logit contributions
 toug-0.485
 doct-0.481
 communal-0.475
 grips-0.474
 whis-0.469
Token for
Token ID329
Feature activation-5.03e-01

Pos logit contributions
 toug+0.687
 doct+0.679
 grips+0.673
 communal+0.673
 whis+0.664
Neg logit contributions
ANGE-0.664
umber-0.655
Oracle-0.645
 Banks-0.627
 Pompe-0.624
Token Gre
Token ID11955
Feature activation-1.43e+00

Pos logit contributions
umber+0.350
ANGE+0.349
Oracle+0.343
 Banks+0.333
 Pompe+0.330
Neg logit contributions
 toug-0.378
 doct-0.377
 communal-0.372
 grips-0.371
 livest-0.369
Tokenipel
Token ID40634
Feature activation-6.89e+00

Pos logit contributions
umber+0.984
ANGE+0.977
Oracle+0.960
 Banks+0.939
 Pompe+0.929
Neg logit contributions
 toug-1.088
 doct-1.083
 communal-1.069
 grips-1.066
 livest-1.064
Token.
Token ID13
Feature activation-1.81e+00

Pos logit contributions
umber+4.155
Oracle+4.000
ANGE+3.987
 Banks+3.962
 Pompe+3.874
Neg logit contributions
 indo-5.644
 doct-5.629
 toug-5.613
 livest-5.602
 communal-5.583
Token
Token ID564
Feature activation-1.04e+00

Pos logit contributions
umber+1.239
ANGE+1.234
Oracle+1.212
 Banks+1.188
 Pompe+1.175
Neg logit contributions
 toug-1.378
 doct-1.371
 communal-1.356
 livest-1.351
 grips-1.350
Token
Token ID250
Feature activation+7.51e-01

Pos logit contributions
umber+0.749
ANGE+0.743
Oracle+0.727
 Banks+0.714
 Pompe+0.706
Neg logit contributions
 toug-0.753
 doct-0.751
 communal-0.740
 grips-0.736
 recip-0.728
TokenThe
Token ID464
Feature activation-2.36e-01

Pos logit contributions
 toug+0.619
 doct+0.617
 grips+0.610
 communal+0.607
 whis+0.601
Neg logit contributions
umber-0.460
ANGE-0.455
Oracle-0.442
 Banks-0.434
 Pompe-0.430
Token one
Token ID530
Feature activation-1.11e+00

Pos logit contributions
umber+0.183
ANGE+0.183
Oracle+0.180
 Banks+0.176
 Pompe+0.174
Neg logit contributions
 toug-0.158
 doct-0.157
 communal-0.155
 grips-0.155
 livest-0.154
Token (
Token ID357
Feature activation-1.72e+00

Pos logit contributions
 toug+0.942
 doct+0.940
 communal+0.931
 grips+0.924
 indo+0.920
Neg logit contributions
umber-0.814
ANGE-0.810
Oracle-0.797
 Banks-0.775
 Pompe-0.771
Tokenincluding
Token ID8201
Feature activation-2.12e+00

Pos logit contributions
umber+1.180
ANGE+1.177
Oracle+1.156
 Banks+1.126
 Pompe+1.111
Neg logit contributions
 toug-1.303
 doct-1.296
 grips-1.281
 communal-1.279
 livest-1.276
Token the
Token ID262
Feature activation-1.87e+00

Pos logit contributions
umber+1.337
ANGE+1.332
Oracle+1.291
 Banks+1.282
 Pompe+1.272
Neg logit contributions
 toug-1.695
 doct-1.690
 livest-1.678
 grips-1.670
 recip-1.667
Token use
Token ID779
Feature activation-1.60e+00

Pos logit contributions
ANGE+1.414
umber+1.412
Oracle+1.378
 Banks+1.361
 Pompe+1.349
Neg logit contributions
 toug-1.272
 doct-1.265
 livest-1.259
 indo-1.250
 grips-1.249
Token of
Token ID286
Feature activation-1.86e+00

Pos logit contributions
umber+1.140
ANGE+1.138
Oracle+1.114
 Banks+1.090
 Pompe+1.083
Neg logit contributions
 toug-1.180
 doct-1.177
 grips-1.159
 communal-1.157
 whis-1.146
Token female
Token ID4048
Feature activation-6.89e+00

Pos logit contributions
umber+1.310
ANGE+1.307
Oracle+1.279
 Banks+1.254
 Pompe+1.248
Neg logit contributions
 toug-1.372
 doct-1.363
 livest-1.349
 grips-1.348
 communal-1.342
Token pronouns
Token ID43947
Feature activation-6.08e+00

Pos logit contributions
umber+5.760
ANGE+5.707
Oracle+5.641
 Pompe+5.560
 Banks+5.555
Neg logit contributions
 toug-4.060
 livest-4.028
 doct-4.020
 indo-3.997
 recip-3.939
Token like
Token ID588
Feature activation-1.50e+00

Pos logit contributions
umber+5.110
ANGE+5.069
Oracle+4.963
 Banks+4.922
 Pompe+4.866
Neg logit contributions
 livest-3.649
 indo-3.591
 toug-3.581
 recip-3.570
 doct-3.551
Token "
Token ID366
Feature activation-4.37e+00

Pos logit contributions
ANGE+1.118
umber+1.117
Oracle+1.098
 Banks+1.067
 Pompe+1.056
Neg logit contributions
 toug-1.061
 doct-1.056
 communal-1.041
 grips-1.039
 whis-1.027
Tokenshe
Token ID7091
Feature activation-5.06e+00

Pos logit contributions
umber+2.956
ANGE+2.901
Oracle+2.848
 Banks+2.770
 Pompe+2.739
Neg logit contributions
 toug-3.366
 doct-3.350
 livest-3.338
 grips-3.330
 communal-3.327
Token and
Token ID290
Feature activation-1.50e+00

Pos logit contributions
umber+3.243
ANGE+3.221
 Banks+3.079
Oracle+3.068
 Pompe+3.019
Neg logit contributions
 toug-3.946
 livest-3.936
 doct-3.933
 communal-3.910
 recip-3.898
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
 toug+1.298
 doct+1.298
 grips+1.284
 communal+1.275
 whis+1.260
Neg logit contributions
ANGE-0.975
umber-0.974
Oracle-0.959
 Pompe-0.923
 Banks-0.910
Token [
Token ID685
Feature activation-3.43e-02

Pos logit contributions
 toug+1.009
 doct+1.006
 communal+0.992
 grips+0.989
 livest+0.985
Neg logit contributions
umber-1.121
ANGE-1.116
Oracle-1.100
 Banks-1.072
 Pompe-1.065
Tokencolor
Token ID8043
Feature activation+9.98e-01

Pos logit contributions
umber+0.027
ANGE+0.027
Oracle+0.027
 Banks+0.026
 Pompe+0.026
Neg logit contributions
 toug-0.023
 doct-0.022
 communal-0.022
 grips-0.022
 livest-0.022
Token=
Token ID28
Feature activation-2.77e-01

Pos logit contributions
 doct+0.686
 toug+0.684
 grips+0.669
 communal+0.664
 whis+0.656
Neg logit contributions
umber-0.735
Oracle-0.730
ANGE-0.727
 Pompe-0.714
 Banks-0.706
Tokenred
Token ID445
Feature activation+1.40e+00

Pos logit contributions
umber+0.188
ANGE+0.187
Oracle+0.183
 Banks+0.178
 Pompe+0.176
Neg logit contributions
 toug-0.214
 doct-0.213
 communal-0.211
 grips-0.210
 livest-0.210
Token][
Token ID7131
Feature activation-6.87e+00

Pos logit contributions
 toug+1.024
 doct+1.019
 communal+1.006
 grips+1.003
 livest+1.000
Neg logit contributions
umber-1.008
ANGE-1.003
Oracle-0.985
 Banks-0.965
 Pompe-0.954
Tokenb
Token ID65
Feature activation-1.25e-01

Pos logit contributions
umber+3.228
ANGE+2.996
Oracle+2.925
 Banks+2.855
tr+2.841
Neg logit contributions
 toug-6.732
 recip-6.646
 indo-6.626
 doct-6.621
 livest-6.619
Token]
Token ID60
Feature activation-3.47e+00

Pos logit contributions
umber+0.077
ANGE+0.077
Oracle+0.076
 Banks+0.074
 Pompe+0.073
Neg logit contributions
 doct-0.102
 toug-0.102
 grips-0.101
 communal-0.099
 whis-0.098
TokenU
Token ID52
Feature activation-9.94e-01

Pos logit contributions
ANGE+2.019
umber+2.018
Oracle+1.993
 Banks+1.917
 Pompe+1.877
Neg logit contributions
 toug-3.007
 doct-2.996
 communal-2.966
 indo-2.964
 livest-2.954
Tokenf
Token ID69
Feature activation-1.65e+00

Pos logit contributions
umber+0.639
ANGE+0.637
Oracle+0.626
 Banks+0.609
 Pompe+0.604
Neg logit contributions
 toug-0.797
 doct-0.793
 communal-0.784
 grips-0.783
 livest-0.779
Tokenou
Token ID280
Feature activation+1.00e+00

Pos logit contributions
ANGE+1.000
umber+0.997
Oracle+0.976
 Banks+0.953
 Pompe+0.941
Neg logit contributions
 toug-1.382
 doct-1.377
 grips-1.359
 communal-1.359
 livest-1.349
Token
Token ID198
Feature activation+9.84e-01

Pos logit contributions
 toug+1.723
 doct+1.715
 communal+1.690
 grips+1.677
 whis+1.664
Neg logit contributions
ANGE-1.787
umber-1.762
Oracle-1.749
 Banks-1.675
 Pompe-1.673
Token
Token ID198
Feature activation+5.39e-01

Pos logit contributions
 doct+0.703
 toug+0.700
 grips+0.690
 communal+0.673
 whis+0.670
Neg logit contributions
ANGE-0.708
umber-0.706
Oracle-0.686
 Pompe-0.673
 Banks-0.661
TokenMore
Token ID5167
Feature activation+1.70e-01

Pos logit contributions
 toug+0.433
 doct+0.432
 communal+0.424
 grips+0.422
 whis+0.418
Neg logit contributions
umber-0.342
ANGE-0.335
 Banks-0.329
Oracle-0.327
 Pompe-0.324
Token:
Token ID25
Feature activation+1.86e+00

Pos logit contributions
 toug+0.153
 doct+0.153
 communal+0.151
 grips+0.151
 livest+0.151
Neg logit contributions
umber-0.092
ANGE-0.092
Oracle-0.090
 Banks-0.087
 Pompe-0.086
Token Law
Token ID3854
Feature activation-1.49e+00

Pos logit contributions
 toug+1.378
 doct+1.375
 communal+1.347
 grips+1.344
 indo+1.328
Neg logit contributions
umber-1.308
ANGE-1.292
Oracle-1.274
 Banks-1.219
 Pompe-1.203
Tokensuit
Token ID6063
Feature activation-6.81e+00

Pos logit contributions
umber+0.985
ANGE+0.980
Oracle+0.961
 Banks+0.939
 Pompe+0.929
Neg logit contributions
 toug-1.164
 doct-1.161
 communal-1.147
 grips-1.143
 livest-1.140
Token:
Token ID25
Feature activation+6.20e-01

Pos logit contributions
 Banks+4.872
ANGE+4.863
umber+4.854
 Pompe+4.817
Oracle+4.808
Neg logit contributions
 livest-4.686
 doct-4.622
kefeller-4.617
 communal-4.583
 indo-4.554
Token UC
Token ID14417
Feature activation-1.03e+00

Pos logit contributions
 toug+0.457
 doct+0.456
 communal+0.448
 grips+0.447
 indo+0.443
Neg logit contributions
umber-0.442
ANGE-0.438
Oracle-0.431
 Banks-0.417
 Pompe-0.412
Token has
Token ID468
Feature activation+4.51e-01

Pos logit contributions
umber+0.802
ANGE+0.798
Oracle+0.786
 Banks+0.769
 Pompe+0.762
Neg logit contributions
 toug-0.684
 doct-0.680
 communal-0.671
 grips-0.669
 livest-0.665
Token '
Token ID705
Feature activation-1.12e+00

Pos logit contributions
 toug+0.300
 doct+0.298
 communal+0.293
 grips+0.292
 indo+0.290
Neg logit contributions
umber-0.357
ANGE-0.353
Oracle-0.348
 Banks-0.339
 Pompe-0.336
Tokenanti
Token ID17096
Feature activation+1.33e+00

Pos logit contributions
umber+0.794
ANGE+0.790
Oracle+0.777
 Banks+0.759
 Pompe+0.752
Neg logit contributions
 toug-0.824
 doct-0.821
 communal-0.810
 grips-0.808
 livest-0.806
Token-
Token ID12
Feature activation-1.89e+00

Pos logit contributions
umber+2.124
ANGE+2.093
Oracle+2.062
 Banks+2.047
 Pompe+2.015
Neg logit contributions
 doct-1.991
 toug-1.989
 communal-1.977
 livest-1.971
 indo-1.962
Tokenblood
Token ID18041
Feature activation-4.94e-01

Pos logit contributions
umber+1.217
ANGE+1.205
Oracle+1.184
 Banks+1.157
 Pompe+1.143
Neg logit contributions
 toug-1.514
 doct-1.510
 communal-1.494
 grips-1.489
 livest-1.487
Token spew
Token ID38645
Feature activation+1.45e+00

Pos logit contributions
umber+0.356
ANGE+0.355
Oracle+0.349
 Banks+0.341
 Pompe+0.338
Neg logit contributions
 toug-0.358
 doct-0.357
 communal-0.352
 grips-0.351
 livest-0.350
Tokening
Token ID278
Feature activation+1.06e+00

Pos logit contributions
 toug+1.012
 doct+1.009
 grips+0.994
 communal+0.993
 whis+0.985
Neg logit contributions
ANGE-1.088
umber-1.085
Oracle-1.067
 Banks-1.043
 Pompe-1.034
Token Vul
Token ID25442
Feature activation-5.11e+00

Pos logit contributions
 toug+0.783
 doct+0.781
 communal+0.772
 grips+0.769
 whis+0.765
Neg logit contributions
umber-0.753
ANGE-0.750
Oracle-0.734
 Banks-0.713
 Pompe-0.707
Tokenv
Token ID85
Feature activation-6.77e+00

Pos logit contributions
umber+3.112
ANGE+3.066
Oracle+2.993
 Banks+2.963
 Pompe+2.921
Neg logit contributions
 toug-4.224
 doct-4.218
 livest-4.213
 communal-4.195
 indo-4.158
Tokenatron
Token ID23484
Feature activation-1.90e+00

Pos logit contributions
umber+4.517
ANGE+4.455
Oracle+4.403
 Banks+4.302
 Pompe+4.237
Neg logit contributions
 toug-5.092
 communal-5.071
 livest-5.070
 indo-5.055
 doct-5.053
Token.
Token ID13
Feature activation-1.40e-01

Pos logit contributions
umber+1.301
ANGE+1.294
Oracle+1.271
 Banks+1.243
 Pompe+1.229
Neg logit contributions
 toug-1.442
 doct-1.436
 communal-1.418
 grips-1.414
 livest-1.410
Token
Token ID198
Feature activation-7.42e-01

Pos logit contributions
umber+0.097
ANGE+0.096
Oracle+0.094
 Banks+0.091
 Pompe+0.090
Neg logit contributions
 doct-0.105
 toug-0.105
 grips-0.103
 communal-0.102
 whis-0.102
Token
Token ID198
Feature activation-1.08e+00

Pos logit contributions
umber+0.511
ANGE+0.508
Oracle+0.494
 Pompe+0.484
 Banks+0.481
Neg logit contributions
 doct-0.557
 toug-0.556
 grips-0.550
 communal-0.541
 whis-0.536
TokenThe
Token ID464
Feature activation-1.27e+00

Pos logit contributions
umber+0.670
ANGE+0.662
Oracle+0.642
 Banks+0.640
 Pompe+0.633
Neg logit contributions
 toug-0.879
 doct-0.878
 grips-0.865
 communal-0.864
 whis-0.855
Token Prepar
Token ID38397
Feature activation+2.45e+00

Pos logit contributions
umber+0.629
ANGE+0.626
Oracle+0.615
 Banks+0.601
 Pompe+0.596
Neg logit contributions
 toug-0.651
 doct-0.648
 communal-0.640
 grips-0.638
 livest-0.635
Tokenation
Token ID341
Feature activation+1.35e+00

Pos logit contributions
 toug+1.945
 doct+1.933
 communal+1.922
 grips+1.903
 recip+1.874
Neg logit contributions
Oracle-1.560
ANGE-1.540
umber-1.526
 Banks-1.489
 Pompe-1.475
Token Goes
Token ID31914
Feature activation-1.35e+00

Pos logit contributions
 toug+1.015
 doct+1.013
 grips+1.003
 communal+0.999
 whis+0.990
Neg logit contributions
ANGE-0.933
umber-0.928
Oracle-0.920
 Banks-0.876
 Pompe-0.873
Token A
Token ID317
Feature activation+1.03e+00

Pos logit contributions
umber+1.036
ANGE+1.029
Oracle+1.012
 Banks+0.996
 Pompe+0.986
Neg logit contributions
 toug-0.926
 doct-0.920
 grips-0.908
 communal-0.907
 livest-0.906
Token Long
Token ID5882
Feature activation+1.58e+00

Pos logit contributions
 toug+0.749
 doct+0.741
 communal+0.731
 whis+0.725
 indo+0.718
Neg logit contributions
ANGE-0.721
Oracle-0.707
umber-0.698
 Pompe-0.670
 Banks-0.668
Token Way
Token ID6378
Feature activation-6.74e+00

Pos logit contributions
 doct+0.788
 toug+0.785
 communal+0.775
 grips+0.755
 whis+0.754
Neg logit contributions
ANGE-1.472
umber-1.455
Oracle-1.451
 Pompe-1.397
 Banks-1.384
Token
Token ID198
Feature activation+8.37e-01

Pos logit contributions
umber+3.985
ANGE+3.895
 Banks+3.858
Oracle+3.824
 Pompe+3.733
Neg logit contributions
 livest-5.669
 toug-5.663
 doct-5.629
 grips-5.613
 recip-5.595
Token
Token ID198
Feature activation+7.60e-01

Pos logit contributions
 doct+0.598
 toug+0.597
 grips+0.589
 communal+0.575
 whis+0.571
Neg logit contributions
umber-0.600
ANGE-0.594
Oracle-0.578
 Pompe-0.565
 Banks-0.557
TokenCon
Token ID3103
Feature activation-2.62e+00

Pos logit contributions
 toug+0.606
 doct+0.602
 communal+0.595
 grips+0.593
 whis+0.587
Neg logit contributions
umber-0.485
ANGE-0.479
 Banks-0.465
Oracle-0.461
 Pompe-0.460
Tokenventions
Token ID16593
Feature activation+6.04e-02

Pos logit contributions
umber+1.699
ANGE+1.680
Oracle+1.645
 Banks+1.614
 Pompe+1.595
Neg logit contributions
 toug-2.094
 doct-2.088
 communal-2.059
 livest-2.055
 grips-2.050
Token usually
Token ID3221
Feature activation+1.54e+00

Pos logit contributions
 toug+0.048
 doct+0.047
 communal+0.047
 grips+0.047
 indo+0.046
Neg logit contributions
umber-0.040
ANGE-0.040
Oracle-0.039
 Banks-0.038
 Pompe-0.038
Token a
Token ID257
Feature activation-1.15e+00

Pos logit contributions
 doct+0.768
 toug+0.763
 communal+0.759
 grips+0.751
 indo+0.743
Neg logit contributions
umber-0.787
ANGE-0.781
Oracle-0.768
 Pompe-0.730
 Banks-0.730
Token contempt
Token ID18035
Feature activation-1.17e-02

Pos logit contributions
umber+0.914
ANGE+0.910
Oracle+0.896
 Banks+0.875
 Pompe+0.867
Neg logit contributions
 toug-0.756
 doct-0.753
 communal-0.743
 grips-0.739
 indo-0.734
Tokenible
Token ID856
Feature activation-8.40e-01

Pos logit contributions
umber+0.008
ANGE+0.008
Oracle+0.008
 Banks+0.008
 Pompe+0.008
Neg logit contributions
 toug-0.009
 doct-0.009
 communal-0.009
 grips-0.009
 livest-0.009
Token class
Token ID1398
Feature activation-1.03e+00

Pos logit contributions
umber+0.636
ANGE+0.632
Oracle+0.621
 Banks+0.602
 Pompe+0.596
Neg logit contributions
 doct-0.585
 toug-0.584
 communal-0.578
 grips-0.574
 indo-0.571
Token of
Token ID286
Feature activation+2.05e-01

Pos logit contributions
umber+0.702
ANGE+0.699
Oracle+0.684
 Banks+0.672
 Pompe+0.666
Neg logit contributions
 toug-0.789
 doct-0.784
 livest-0.777
 communal-0.775
 grips-0.774
Token corporate
Token ID6355
Feature activation-6.68e+00

Pos logit contributions
 doct+0.144
 toug+0.144
 communal+0.142
 grips+0.141
 indo+0.140
Neg logit contributions
umber-0.154
ANGE-0.153
Oracle-0.150
 Banks-0.145
 Pompe-0.145
Token fascists
Token ID48594
Feature activation-3.77e+00

Pos logit contributions
umber+5.011
ANGE+4.973
 Banks+4.973
Oracle+4.924
 Pompe+4.858
Neg logit contributions
 toug-4.432
 livest-4.407
 doct-4.326
 grips-4.290
 recip-4.283
Token,
Token ID11
Feature activation-1.64e-01

Pos logit contributions
umber+2.442
ANGE+2.418
Oracle+2.371
 Banks+2.339
 Pompe+2.309
Neg logit contributions
 toug-2.971
 doct-2.954
 livest-2.931
 communal-2.926
 grips-2.919
Token ego
Token ID19225
Feature activation-1.85e+00

Pos logit contributions
umber+0.117
ANGE+0.116
Oracle+0.114
 Banks+0.110
 Pompe+0.110
Neg logit contributions
 doct-0.121
 toug-0.121
 communal-0.120
 grips-0.119
 indo-0.119
Token maniac
Token ID45575
Feature activation+1.90e-02

Pos logit contributions
umber+1.399
ANGE+1.393
Oracle+1.372
 Banks+1.354
 Pompe+1.338
Neg logit contributions
 toug-1.251
 doct-1.239
 livest-1.228
 indo-1.223
 communal-1.222
Tokenal
Token ID282
Feature activation+4.38e-01

Pos logit contributions
 toug+0.016
 doct+0.016
 communal+0.016
 grips+0.016
 indo+0.016
Neg logit contributions
umber-0.011
ANGE-0.011
Oracle-0.011
 Banks-0.011
 Pompe-0.010
Token.
Token ID13
Feature activation+2.05e+00

Pos logit contributions
 doct+0.733
 toug+0.732
 grips+0.724
 communal+0.716
 whis+0.708
Neg logit contributions
umber-0.583
ANGE-0.581
Oracle-0.569
 Pompe-0.556
 Banks-0.544
Token [
Token ID685
Feature activation+5.40e-01

Pos logit contributions
 toug+1.403
 doct+1.397
 communal+1.378
 grips+1.373
 livest+1.368
Neg logit contributions
umber-1.557
ANGE-1.550
Oracle-1.525
 Banks-1.493
 Pompe-1.480
Tokencolor
Token ID8043
Feature activation+1.43e+00

Pos logit contributions
 toug+0.339
 doct+0.338
 communal+0.333
 grips+0.332
 livest+0.330
Neg logit contributions
umber-0.441
ANGE-0.439
Oracle-0.433
 Banks-0.425
 Pompe-0.421
Token=
Token ID28
Feature activation+1.68e-01

Pos logit contributions
 doct+0.949
 toug+0.948
 grips+0.929
 communal+0.917
 whis+0.908
Neg logit contributions
umber-1.068
Oracle-1.057
ANGE-1.053
 Pompe-1.040
 Banks-1.028
Tokenred
Token ID445
Feature activation+1.75e+00

Pos logit contributions
 toug+0.131
 doct+0.131
 communal+0.129
 grips+0.129
 livest+0.129
Neg logit contributions
umber-0.113
ANGE-0.113
Oracle-0.110
 Banks-0.107
 Pompe-0.106
Token][
Token ID7131
Feature activation-6.66e+00

Pos logit contributions
 toug+1.273
 doct+1.268
 communal+1.250
 grips+1.246
 livest+1.244
Neg logit contributions
umber-1.257
ANGE-1.252
Oracle-1.227
 Banks-1.203
 Pompe-1.189
Tokenb
Token ID65
Feature activation-8.21e-03

Pos logit contributions
umber+3.374
ANGE+3.212
Oracle+3.173
 Banks+3.077
 Pompe+2.956
Neg logit contributions
 livest-6.309
 toug-6.292
 indo-6.292
 doct-6.229
 recip-6.225
Token]
Token ID60
Feature activation-3.30e+00

Pos logit contributions
umber+0.005
ANGE+0.005
Oracle+0.005
 Banks+0.005
 Pompe+0.005
Neg logit contributions
 doct-0.007
 toug-0.007
 grips-0.007
 communal-0.006
 whis-0.006
TokenMega
Token ID43471
Feature activation-7.06e-01

Pos logit contributions
umber+1.938
ANGE+1.937
Oracle+1.913
 Banks+1.838
 Pompe+1.809
Neg logit contributions
 toug-2.836
 doct-2.826
 communal-2.803
 indo-2.794
 livest-2.793
Token Pony
Token ID30377
Feature activation-3.82e+00

Pos logit contributions
umber+0.517
ANGE+0.516
Oracle+0.507
 Banks+0.494
 Pompe+0.492
Neg logit contributions
 toug-0.505
 doct-0.501
 communal-0.495
 grips-0.494
 indo-0.491
Token[/
Token ID13412
Feature activation+1.30e+00

Pos logit contributions
umber+2.665
ANGE+2.625
Oracle+2.563
 Banks+2.551
 Pompe+2.508
Neg logit contributions
 doct-2.839
 toug-2.837
 communal-2.802
 livest-2.799
 indo-2.788
Token alongside
Token ID7848
Feature activation-1.18e+00

Pos logit contributions
 toug+0.261
 doct+0.260
 communal+0.256
 grips+0.255
 livest+0.254
Neg logit contributions
ANGE-0.246
umber-0.245
Oracle-0.240
 Banks-0.231
 Pompe-0.229
Token nurses
Token ID20669
Feature activation-3.72e-02

Pos logit contributions
umber+0.829
ANGE+0.827
Oracle+0.814
 Banks+0.787
 Pompe+0.781
Neg logit contributions
 toug-0.877
 doct-0.875
 communal-0.863
 grips-0.859
 livest-0.854
Token Jan
Token ID2365
Feature activation+2.68e-01

Pos logit contributions
ANGE+0.025
umber+0.025
Oracle+0.025
 Banks+0.024
 Pompe+0.024
Neg logit contributions
 toug-0.029
 doct-0.029
 communal-0.028
 grips-0.028
 whis-0.028
Tokenice
Token ID501
Feature activation-2.56e+00

Pos logit contributions
 toug+0.230
 doct+0.229
 grips+0.226
 communal+0.226
 indo+0.225
Neg logit contributions
ANGE-0.157
umber-0.157
Oracle-0.154
 Banks-0.148
 Pompe-0.147
Token and
Token ID290
Feature activation-9.30e-01

Pos logit contributions
umber+1.657
ANGE+1.648
Oracle+1.615
 Banks+1.589
 Pompe+1.571
Neg logit contributions
 toug-2.035
 doct-2.029
 communal-2.007
 livest-1.998
 grips-1.996
Token Miss
Token ID4544
Feature activation-6.63e+00

Pos logit contributions
umber+0.617
ANGE+0.614
Oracle+0.604
 Banks+0.585
 Pompe+0.579
Neg logit contributions
 toug-0.730
 doct-0.728
 communal-0.719
 grips-0.716
 indo-0.712
Token Pig
Token ID23097
Feature activation-8.48e-02

Pos logit contributions
umber+4.421
 Banks+4.333
ANGE+4.318
 Pompe+4.258
Oracle+4.242
Neg logit contributions
 toug-5.053
 livest-5.052
 communal-5.016
 doct-5.003
 indo-4.959
Tokengy
Token ID1360
Feature activation+1.32e+00

Pos logit contributions
ANGE+0.062
umber+0.061
Oracle+0.061
 Pompe+0.058
 Banks+0.058
Neg logit contributions
 toug-0.061
 doct-0.060
 communal-0.060
 grips-0.059
 livest-0.059
Token.
Token ID13
Feature activation-5.02e-02

Pos logit contributions
 toug+1.069
 doct+1.062
 grips+1.050
 communal+1.049
 whis+1.043
Neg logit contributions
ANGE-0.834
umber-0.832
Oracle-0.820
 Banks-0.779
 Pompe-0.778
Token Though
Token ID7486
Feature activation+1.67e+00

Pos logit contributions
umber+0.034
ANGE+0.034
Oracle+0.033
 Banks+0.032
 Pompe+0.032
Neg logit contributions
 doct-0.038
 toug-0.038
 communal-0.038
 grips-0.038
 whis-0.037
Token considered
Token ID3177
Feature activation+1.74e+00

Pos logit contributions
 doct+1.204
 toug+1.201
 communal+1.183
 grips+1.170
 whis+1.165
Neg logit contributions
umber-1.214
ANGE-1.208
Oracle-1.195
 Banks-1.131
 Pompe-1.128
Token dummy
Token ID31548
Feature activation-4.41e-01

Pos logit contributions
umber+0.600
ANGE+0.598
Oracle+0.589
 Banks+0.576
 Pompe+0.571
Neg logit contributions
 toug-0.533
 doct-0.531
 communal-0.524
 grips-0.522
 livest-0.519
Token will
Token ID481
Feature activation-7.36e-01

Pos logit contributions
umber+0.281
ANGE+0.279
Oracle+0.274
 Banks+0.266
 Pompe+0.263
Neg logit contributions
 toug-0.360
 doct-0.358
 grips-0.354
 communal-0.354
 indo-0.352
Token go
Token ID467
Feature activation-2.30e+00

Pos logit contributions
umber+0.550
ANGE+0.548
Oracle+0.541
 Banks+0.527
 Pompe+0.523
Neg logit contributions
 toug-0.520
 doct-0.514
 communal-0.508
 grips-0.506
 whis-0.502
Token a
Token ID257
Feature activation-1.01e+00

Pos logit contributions
umber+1.729
ANGE+1.720
Oracle+1.692
 Banks+1.656
 Pompe+1.637
Neg logit contributions
 toug-1.609
 doct-1.602
 communal-1.582
 grips-1.575
 livest-1.567
Token long
Token ID890
Feature activation+1.66e+00

Pos logit contributions
umber+0.792
ANGE+0.791
Oracle+0.780
 Banks+0.761
 Pompe+0.753
Neg logit contributions
 toug-0.675
 doct-0.672
 communal-0.664
 indo-0.656
 livest-0.655
Token way
Token ID835
Feature activation-6.61e+00

Pos logit contributions
 toug+0.945
 doct+0.935
 communal+0.931
 grips+0.912
 indo+0.907
Neg logit contributions
ANGE-1.459
umber-1.456
Oracle-1.442
 Banks-1.406
 Pompe-1.390
Token.
Token ID13
Feature activation+3.01e-01

Pos logit contributions
umber+3.726
ANGE+3.694
Oracle+3.575
 Banks+3.558
 Pompe+3.478
Neg logit contributions
 livest-5.645
 doct-5.642
 toug-5.628
 grips-5.613
 recip-5.611
Token We
Token ID775
Feature activation-2.67e+00

Pos logit contributions
 doct+0.214
 toug+0.213
 grips+0.210
 communal+0.208
 whis+0.206
Neg logit contributions
umber-0.221
ANGE-0.219
Oracle-0.214
 Banks-0.205
 Pompe-0.205
Token can
Token ID460
Feature activation-3.47e-01

Pos logit contributions
umber+1.920
ANGE+1.911
Oracle+1.876
 Banks+1.840
 Pompe+1.822
Neg logit contributions
 toug-1.928
 doct-1.920
 communal-1.897
 grips-1.892
 livest-1.887
Token only
Token ID691
Feature activation-3.60e+00

Pos logit contributions
umber+0.241
ANGE+0.241
Oracle+0.237
 Banks+0.231
 Pompe+0.229
Neg logit contributions
 toug-0.262
 doct-0.261
 communal-0.257
 grips-0.257
 indo-0.254
Token hope
Token ID2911
Feature activation-3.42e+00

Pos logit contributions
umber+2.513
ANGE+2.482
Oracle+2.422
 Banks+2.407
 Pompe+2.381
Neg logit contributions
 doct-2.638
 toug-2.628
 indo-2.609
 livest-2.609
 communal-2.605
Tokenstep
Token ID9662
Feature activation-2.88e+00

Pos logit contributions
umber+1.161
ANGE+1.158
Oracle+1.134
 Banks+1.108
 Pompe+1.097
Neg logit contributions
 toug-1.518
 doct-1.513
 communal-1.494
 grips-1.488
 livest-1.483
Token further
Token ID2252
Feature activation-7.61e-01

Pos logit contributions
umber+1.944
ANGE+1.934
Oracle+1.896
 Banks+1.854
 Pompe+1.834
Neg logit contributions
 toug-2.219
 doct-2.211
 communal-2.186
 livest-2.181
 grips-2.181
Token and
Token ID290
Feature activation-9.83e-01

Pos logit contributions
umber+0.481
ANGE+0.481
Oracle+0.471
 Banks+0.460
 Pompe+0.454
Neg logit contributions
 toug-0.621
 doct-0.619
 communal-0.609
 grips-0.608
 indo-0.605
Token eliminate
Token ID11005
Feature activation+2.37e+00

Pos logit contributions
umber+0.753
ANGE+0.751
Oracle+0.742
 Banks+0.723
 Pompe+0.716
Neg logit contributions
 toug-0.677
 doct-0.672
 communal-0.661
 grips-0.659
 indo-0.654
Token the
Token ID262
Feature activation-8.01e-01

Pos logit contributions
 toug+1.677
 doct+1.675
 grips+1.653
 communal+1.649
 whis+1.622
Neg logit contributions
umber-1.762
Oracle-1.752
ANGE-1.748
 Banks-1.674
 Pompe-1.653
Token valves
Token ID41843
Feature activation-6.58e+00

Pos logit contributions
umber+0.599
ANGE+0.596
Oracle+0.589
 Banks+0.572
 Pompe+0.568
Neg logit contributions
 toug-0.564
 doct-0.562
 communal-0.555
 grips-0.553
 indo-0.547
Token?
Token ID30
Feature activation-3.96e+00

Pos logit contributions
umber+4.225
ANGE+4.166
Oracle+4.073
 Banks+4.019
 Pompe+3.978
Neg logit contributions
 livest-5.191
 toug-5.159
 doct-5.134
 indo-5.127
 communal-5.110
Token
Token ID198
Feature activation-1.80e-01

Pos logit contributions
umber+2.739
ANGE+2.727
Oracle+2.681
 Banks+2.641
 Pompe+2.608
Neg logit contributions
 toug-2.981
 doct-2.960
 communal-2.938
 livest-2.934
 grips-2.921
Token
Token ID198
Feature activation-1.97e-01

Pos logit contributions
umber+0.126
ANGE+0.125
Oracle+0.122
 Pompe+0.119
 Banks+0.118
Neg logit contributions
 doct-0.133
 toug-0.132
 grips-0.131
 communal-0.128
 whis-0.127
TokenThe
Token ID464
Feature activation-6.11e-01

Pos logit contributions
umber+0.127
ANGE+0.124
 Banks+0.121
 Pompe+0.120
Oracle+0.120
Neg logit contributions
 doct-0.155
 toug-0.155
 grips-0.152
 communal-0.152
 whis-0.151
Token prime
Token ID6994
Feature activation+1.53e-01

Pos logit contributions
umber+0.358
ANGE+0.357
Oracle+0.348
 Banks+0.336
 Pompe+0.335
Neg logit contributions
 toug-0.527
 doct-0.526
 communal-0.521
 grips-0.519
 indo-0.515
Token "
Token ID366
Feature activation-4.71e+00

Pos logit contributions
umber+2.406
ANGE+2.393
Oracle+2.344
 Banks+2.313
 Pompe+2.286
Neg logit contributions
 toug-2.494
 doct-2.488
 communal-2.456
 grips-2.445
 livest-2.440
TokenMr
Token ID5246
Feature activation-2.55e+00

Pos logit contributions
umber+2.794
ANGE+2.791
Oracle+2.780
 Banks+2.656
 Pompe+2.582
Neg logit contributions
 toug-3.963
 doct-3.951
 communal-3.939
 livest-3.934
 indo-3.933
TokenW
Token ID54
Feature activation+6.02e-01

Pos logit contributions
ANGE+1.455
umber+1.455
Oracle+1.427
 Banks+1.401
 Pompe+1.365
Neg logit contributions
 toug-2.212
 doct-2.201
 communal-2.192
 livest-2.190
 indo-2.183
Tokenizard
Token ID8669
Feature activation-1.11e+00

Pos logit contributions
 toug+0.457
 doct+0.455
 communal+0.449
 grips+0.448
 livest+0.446
Neg logit contributions
umber-0.414
ANGE-0.411
Oracle-0.404
 Banks-0.395
 Pompe-0.392
Token"
Token ID1
Feature activation-1.10e+00

Pos logit contributions
umber+0.786
ANGE+0.784
Oracle+0.770
 Banks+0.753
 Pompe+0.745
Neg logit contributions
 toug-0.818
 doct-0.815
 communal-0.805
 grips-0.803
 livest-0.801
Token Cue
Token ID40568
Feature activation-6.58e+00

Pos logit contributions
umber+0.731
ANGE+0.728
Oracle+0.714
 Banks+0.692
 Pompe+0.686
Neg logit contributions
 toug-0.853
 doct-0.850
 communal-0.839
 grips-0.838
 indo-0.831
Tokenll
Token ID297
Feature activation-2.32e-01

Pos logit contributions
umber+2.407
ANGE+2.359
Oracle+2.274
 Banks+2.221
 Pompe+2.149
Neg logit contributions
 livest-6.948
 doct-6.946
 toug-6.944
 communal-6.925
 indo-6.915
Tokenar
Token ID283
Feature activation-1.98e+00

Pos logit contributions
umber+0.137
ANGE+0.137
Oracle+0.134
 Banks+0.130
 Pompe+0.129
Neg logit contributions
 toug-0.198
 doct-0.198
 communal-0.195
 grips-0.195
 livest-0.194
Token,
Token ID11
Feature activation-2.36e+00

Pos logit contributions
umber+1.377
ANGE+1.370
Oracle+1.346
 Banks+1.317
 Pompe+1.303
Neg logit contributions
 toug-1.480
 doct-1.475
 communal-1.457
 grips-1.452
 livest-1.448
Token EV
Token ID8696
Feature activation+3.54e-01

Pos logit contributions
umber+1.580
ANGE+1.572
Oracle+1.544
 Banks+1.511
 Pompe+1.493
Neg logit contributions
 toug-1.825
 doct-1.817
 communal-1.794
 grips-1.790
 indo-1.786
TokenO
Token ID46
Feature activation+9.59e-01

Pos logit contributions
 toug+0.272
 doct+0.270
 communal+0.267
 grips+0.267
 whis+0.265
Neg logit contributions
umber-0.239
ANGE-0.236
Oracle-0.232
 Banks-0.227
 Pompe-0.225
Token,
Token ID837
Feature activation+3.04e-01

Pos logit contributions
ANGE+0.838
umber+0.835
Oracle+0.813
 Banks+0.785
 Pompe+0.781
Neg logit contributions
 toug-1.481
 doct-1.474
 grips-1.463
 communal-1.453
 whis-1.445
Token 0
Token ID657
Feature activation-1.64e+00

Pos logit contributions
 toug+0.232
 doct+0.232
 grips+0.229
 communal+0.224
 cooker+0.223
Neg logit contributions
ANGE-0.208
umber-0.207
Oracle-0.206
 Banks-0.197
 Pompe-0.196
Token )
Token ID1267
Feature activation-8.45e-01

Pos logit contributions
umber+0.915
ANGE+0.907
Oracle+0.888
 Banks+0.857
 Pompe+0.856
Neg logit contributions
 toug-1.450
 doct-1.447
 grips-1.431
 communal-1.412
 whis-1.409
Token black
Token ID2042
Feature activation+1.42e+00

Pos logit contributions
ANGE+0.730
umber+0.728
Oracle+0.726
 Banks+0.704
 Pompe+0.701
Neg logit contributions
 toug-0.495
 doct-0.490
 communal-0.488
 grips-0.482
 whis-0.479
Tokenhat
Token ID5183
Feature activation-9.58e-01

Pos logit contributions
 doct+0.868
 toug+0.859
 grips+0.851
 communal+0.843
 whis+0.840
Neg logit contributions
ANGE-1.172
Oracle-1.160
umber-1.156
 Pompe-1.150
 Banks-1.142
Token =
Token ID796
Feature activation+6.42e+00

Pos logit contributions
umber+0.708
ANGE+0.702
Oracle+0.691
 Pompe+0.676
 Banks+0.676
Neg logit contributions
 doct-0.674
 toug-0.672
 grips-0.662
 communal-0.651
 whis-0.648
Token c
Token ID269
Feature activation+1.76e+00

Pos logit contributions
 doct+2.991
 toug+2.890
 grips+2.885
 communal+2.868
 cooker+2.830
Neg logit contributions
ANGE-6.289
Oracle-6.252
umber-6.240
 Banks-5.884
 Pompe-5.867
Tokenv
Token ID85
Feature activation-1.89e+00

Pos logit contributions
 doct+1.683
 toug+1.668
 grips+1.649
 communal+1.616
 whis+1.607
Neg logit contributions
ANGE-0.858
Oracle-0.797
 Banks-0.796
umber-0.794
 Pompe-0.792
Token2
Token ID17
Feature activation+1.07e+00

Pos logit contributions
ANGE+1.314
umber+1.312
Oracle+1.286
 Banks+1.263
 Pompe+1.254
Neg logit contributions
 toug-1.407
 doct-1.404
 communal-1.378
 grips-1.378
 whis-1.363
Token.
Token ID764
Feature activation+1.65e+00

Pos logit contributions
 doct+0.695
 toug+0.692
 grips+0.687
 communal+0.664
 whis+0.660
Neg logit contributions
umber-0.842
ANGE-0.840
Oracle-0.822
 Pompe-0.802
 Banks-0.795
Token morphology
Token ID46320
Feature activation-2.48e-01

Pos logit contributions
 toug+0.872
 doct+0.867
 communal+0.851
 grips+0.848
 livest+0.843
Neg logit contributions
umber-1.512
ANGE-1.506
Oracle-1.486
 Banks-1.461
 Pompe-1.449
Token hull
Token ID23644
Feature activation+9.55e-01

Pos logit contributions
 toug+1.253
 doct+1.243
 communal+1.237
 grips+1.221
 indo+1.210
Neg logit contributions
umber-1.741
ANGE-1.740
Oracle-1.706
 Banks-1.668
 Pompe-1.653
Token.
Token ID13
Feature activation+4.72e-01

Pos logit contributions
 toug+0.790
 doct+0.784
 communal+0.777
 grips+0.775
 indo+0.767
Neg logit contributions
ANGE-0.593
umber-0.590
Oracle-0.582
 Banks-0.564
 Pompe-0.562
Token In
Token ID554
Feature activation+6.66e-02

Pos logit contributions
 doct+0.345
 toug+0.345
 grips+0.339
 communal+0.338
 whis+0.333
Neg logit contributions
umber-0.338
ANGE-0.334
Oracle-0.329
 Banks-0.316
 Pompe-0.314
Token an
Token ID281
Feature activation+1.17e+00

Pos logit contributions
 toug+0.050
 doct+0.050
 communal+0.049
 grips+0.049
 indo+0.049
Neg logit contributions
umber-0.046
ANGE-0.046
Oracle-0.046
 Banks-0.044
 Pompe-0.044
Token often
Token ID1690
Feature activation+6.72e-01

Pos logit contributions
 toug+0.795
 doct+0.794
 communal+0.787
 indo+0.776
 grips+0.775
Neg logit contributions
ANGE-0.891
umber-0.889
Oracle-0.876
 Banks-0.854
 Pompe-0.849
Token grim
Token ID18288
Feature activation+4.41e+00

Pos logit contributions
 doct+0.474
 toug+0.474
 communal+0.468
 grips+0.462
 indo+0.461
Neg logit contributions
ANGE-0.498
umber-0.498
Oracle-0.488
 Banks-0.477
 Pompe-0.474
Token world
Token ID995
Feature activation+2.79e-01

Pos logit contributions
 communal+3.073
 doct+3.073
 toug+3.056
 grips+3.023
 indo+2.949
Neg logit contributions
ANGE-3.217
umber-3.214
Oracle-3.167
 Banks-3.085
 Pompe-3.046
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
 toug+0.214
 doct+0.213
 communal+0.211
 grips+0.210
 livest+0.209
Neg logit contributions
umber-0.190
ANGE-0.189
Oracle-0.186
 Banks-0.181
 Pompe-0.179
Token the
Token ID262
Feature activation+5.83e-01

Pos logit contributions
 toug+0.981
 doct+0.978
 communal+0.967
 grips+0.962
 livest+0.953
Neg logit contributions
umber-1.081
ANGE-1.073
Oracle-1.057
 Banks-1.025
 Pompe-1.014
Token GL
Token ID10188
Feature activation-3.21e-01

Pos logit contributions
 toug+0.381
 doct+0.380
 communal+0.375
 grips+0.373
 livest+0.371
Neg logit contributions
umber-0.464
ANGE-0.461
Oracle-0.455
 Banks-0.444
 Pompe-0.440
TokenOO
Token ID6684
Feature activation+1.02e-01

Pos logit contributions
umber+0.314
ANGE+0.310
Oracle+0.308
 Banks+0.305
 Pompe+0.303
Neg logit contributions
 toug-0.149
 doct-0.149
 grips-0.145
 communal-0.144
 whis-0.142
TokenI
Token ID40
Feature activation+5.53e-01

Pos logit contributions
 toug+2.823
 doct+2.820
 communal+2.781
 grips+2.766
 whis+2.746
Neg logit contributions
umber-2.122
ANGE-2.090
 Banks-2.052
 Pompe-2.037
Oracle-1.996
Token recently
Token ID2904
Feature activation-1.93e+00

Pos logit contributions
 toug+0.379
 doct+0.377
 communal+0.370
 grips+0.368
 livest+0.365
Neg logit contributions
umber-0.423
ANGE-0.420
Oracle-0.417
 Banks-0.405
 Pompe-0.403
Token covered
Token ID5017
Feature activation+1.09e+00

Pos logit contributions
umber+1.425
ANGE+1.415
Oracle+1.392
 Banks+1.371
 Pompe+1.356
Neg logit contributions
 toug-1.348
 doct-1.341
 communal-1.330
 grips-1.327
 livest-1.326
Token Ruby
Token ID10888
Feature activation+9.61e-01

Pos logit contributions
 toug+0.800
 doct+0.797
 communal+0.788
 grips+0.784
 livest+0.781
Neg logit contributions
umber-0.779
ANGE-0.777
Oracle-0.762
 Banks-0.742
 Pompe-0.736
Token block
Token ID2512
Feature activation-9.31e-02

Pos logit contributions
 doct+0.749
 toug+0.746
 grips+0.738
 communal+0.733
 whis+0.732
Neg logit contributions
umber-0.641
ANGE-0.638
Oracle-0.622
 Pompe-0.605
 Banks-0.605
Token basics
Token ID19165
Feature activation+4.21e+00

Pos logit contributions
ANGE+0.072
umber+0.072
Oracle+0.071
 Banks+0.070
 Pompe+0.069
Neg logit contributions
 toug-0.062
 doct-0.062
 grips-0.061
 communal-0.061
 livest-0.061
Token in
Token ID287
Feature activation+5.32e-01

Pos logit contributions
 doct+3.158
 toug+3.137
 grips+3.116
 communal+3.079
 whis+3.069
Neg logit contributions
ANGE-2.868
umber-2.860
Oracle-2.807
 Pompe-2.695
 Banks-2.688
Token my
Token ID616
Feature activation+8.05e-01

Pos logit contributions
 toug+0.376
 doct+0.375
 grips+0.370
 communal+0.369
 livest+0.366
Neg logit contributions
umber-0.393
ANGE-0.393
Oracle-0.386
 Banks-0.376
 Pompe-0.372
Token post
Token ID1281
Feature activation-1.28e+00

Pos logit contributions
 toug+0.565
 doct+0.563
 communal+0.558
 grips+0.556
 livest+0.552
Neg logit contributions
umber-0.602
ANGE-0.600
Oracle-0.587
 Banks-0.569
 Pompe-0.565
Token,
Token ID11
Feature activation+1.86e+00

Pos logit contributions
umber+0.885
ANGE+0.880
Oracle+0.865
 Banks+0.847
 Pompe+0.837
Neg logit contributions
 toug-0.960
 doct-0.956
 communal-0.944
 grips-0.941
 livest-0.938
Token More
Token ID3125
Feature activation+4.47e-01

Pos logit contributions
 toug+1.372
 doct+1.366
 communal+1.345
 grips+1.344
 livest+1.336
Neg logit contributions
umber-1.314
ANGE-1.311
Oracle-1.284
 Banks-1.238
 Pompe-1.237
Token
Token ID198
Feature activation+2.13e+00

Pos logit contributions
 toug+1.708
 doct+1.701
 communal+1.674
 grips+1.673
 whis+1.658
Neg logit contributions
umber-1.462
ANGE-1.458
Oracle-1.427
 Banks-1.361
 Pompe-1.357
Token
Token ID198
Feature activation+1.78e+00

Pos logit contributions
 doct+1.469
 toug+1.454
 grips+1.438
 communal+1.394
 whis+1.389
Neg logit contributions
umber-1.565
ANGE-1.554
Oracle-1.508
 Pompe-1.471
 Banks-1.442
TokenDuring
Token ID7191
Feature activation-8.00e-01

Pos logit contributions
 toug+1.364
 doct+1.356
 communal+1.334
 grips+1.332
 whis+1.312
Neg logit contributions
umber-1.186
ANGE-1.162
 Banks-1.131
 Pompe-1.128
Oracle-1.122
Token the
Token ID262
Feature activation+5.45e-01

Pos logit contributions
umber+0.530
ANGE+0.526
Oracle+0.515
 Banks+0.508
 Pompe+0.504
Neg logit contributions
 toug-0.623
 doct-0.623
 indo-0.615
 livest-0.615
 communal-0.614
Token 2008
Token ID3648
Feature activation+4.35e-01

Pos logit contributions
 toug+0.352
 doct+0.348
 communal+0.345
 grips+0.344
 recip+0.337
Neg logit contributions
umber-0.441
ANGE-0.438
Oracle-0.431
 Banks-0.420
 Pompe-0.414
Token presidential
Token ID4787
Feature activation+4.62e+00

Pos logit contributions
 toug+0.244
 doct+0.241
 communal+0.239
 grips+0.237
 indo+0.235
Neg logit contributions
umber-0.387
ANGE-0.385
Oracle-0.379
 Banks-0.372
 Pompe-0.368
Token campaign
Token ID1923
Feature activation+1.77e+00

Pos logit contributions
 toug+2.488
 doct+2.438
 communal+2.425
 grips+2.419
 recip+2.381
Neg logit contributions
umber-4.155
ANGE-4.135
Oracle-4.061
 Banks-3.990
 Pompe-3.917
Token,
Token ID11
Feature activation+8.79e-01

Pos logit contributions
 toug+1.465
 doct+1.464
 livest+1.447
 grips+1.446
 communal+1.444
Neg logit contributions
umber-1.097
ANGE-1.088
Oracle-1.066
 Banks-1.048
 Pompe-1.037
Token the
Token ID262
Feature activation+1.45e-01

Pos logit contributions
 toug+0.641
 doct+0.638
 communal+0.630
 grips+0.627
 livest+0.622
Neg logit contributions
umber-0.632
ANGE-0.630
Oracle-0.619
 Banks-0.603
 Pompe-0.596
Token NRA
Token ID23962
Feature activation-2.48e+00

Pos logit contributions
 toug+0.104
 doct+0.104
 communal+0.102
 grips+0.102
 indo+0.101
Neg logit contributions
umber-0.107
ANGE-0.106
Oracle-0.104
 Banks-0.102
 Pompe-0.101
Token Political
Token ID14611
Feature activation+1.96e+00

Pos logit contributions
umber+1.958
ANGE+1.942
Oracle+1.910
 Banks+1.883
 Pompe+1.862
Neg logit contributions
 toug-1.601
 doct-1.598
 livest-1.584
 communal-1.577
 grips-1.576
Token versus
Token ID9051
Feature activation-1.61e+00

Pos logit contributions
 toug+0.281
 doct+0.280
 communal+0.278
 grips+0.277
 indo+0.275
Neg logit contributions
umber-0.128
ANGE-0.128
Oracle-0.125
 Banks-0.121
 Pompe-0.119
Token cue
Token ID28381
Feature activation-1.75e+00

Pos logit contributions
umber+1.330
ANGE+1.325
Oracle+1.305
 Banks+1.280
 Pompe+1.270
Neg logit contributions
 toug-0.998
 doct-0.993
 communal-0.978
 grips-0.974
 livest-0.971
Token).
Token ID737
Feature activation-3.80e-01

Pos logit contributions
umber+0.822
ANGE+0.816
Oracle+0.794
 Banks+0.768
 Pompe+0.756
Neg logit contributions
 toug-1.716
 doct-1.710
 communal-1.694
 grips-1.690
 livest-1.686
Token https
Token ID3740
Feature activation+2.66e+00

Pos logit contributions
umber+0.263
ANGE+0.262
Oracle+0.257
 Banks+0.251
 Pompe+0.248
Neg logit contributions
 toug-0.286
 doct-0.285
 communal-0.282
 grips-0.281
 indo-0.279
Token://
Token ID1378
Feature activation+3.57e+00

Pos logit contributions
 doct+1.607
 grips+1.596
 toug+1.584
 communal+1.558
 cooker+1.509
Neg logit contributions
umber-2.106
ANGE-2.086
Oracle-2.049
 Banks-2.048
 Pompe-2.046
Tokendoi
Token ID34023
Feature activation+6.18e+00

Pos logit contributions
 grips+1.112
 toug+1.072
 doct+1.058
 communal+0.999
 whis+0.963
Neg logit contributions
ANGE-4.038
umber-4.034
Oracle-3.976
 Scarlett-3.940
 Banks-3.914
Token.
Token ID13
Feature activation+4.59e+00

Pos logit contributions
 doct+4.153
 grips+4.067
 toug+4.054
 communal+3.915
 whis+3.827
Neg logit contributions
ANGE-4.479
umber-4.362
 Pompe-4.256
Oracle-4.189
 Scarlett-4.136
Tokenorg
Token ID2398
Feature activation+1.56e+00

Pos logit contributions
 toug+2.491
 grips+2.458
 doct+2.451
 whis+2.378
 communal+2.373
Neg logit contributions
ANGE-4.051
umber-4.003
Oracle-3.987
 Banks-3.882
 Scarlett-3.860
Token/
Token ID14
Feature activation+6.60e+00

Pos logit contributions
 doct+1.420
 toug+1.397
 grips+1.389
 communal+1.380
 whis+1.364
Neg logit contributions
umber-0.823
ANGE-0.810
Oracle-0.770
 Pompe-0.765
 Banks-0.751
Token10
Token ID940
Feature activation+3.63e+00

Pos logit contributions
 toug+4.730
 doct+4.725
 grips+4.674
 communal+4.591
 whis+4.569
Neg logit contributions
ANGE-4.493
umber-4.328
 Pompe-4.317
Oracle-4.307
 Banks-4.280
Token.
Token ID13
Feature activation+3.43e+00

Pos logit contributions
 doct+2.693
 toug+2.673
 grips+2.638
 communal+2.575
 whis+2.559
Neg logit contributions
umber-2.481
ANGE-2.466
 Pompe-2.356
Oracle-2.349
 Banks-2.304
Token
Token ID247
Feature activation+1.27e+00

Pos logit contributions
 toug+1.739
 doct+1.732
 communal+1.731
 grips+1.703
 recip+1.699
Neg logit contributions
umber-1.298
Oracle-1.294
ANGE-1.285
 Banks-1.236
 Pompe-1.226
Tokens
Token ID82
Feature activation+2.27e+00

Pos logit contributions
 toug+1.104
 doct+1.101
 communal+1.090
 grips+1.087
 livest+1.082
Neg logit contributions
umber-0.730
ANGE-0.726
Oracle-0.714
 Banks-0.692
 Pompe-0.684
Token a
Token ID257
Feature activation+1.50e+00

Pos logit contributions
 toug+1.559
 doct+1.550
 communal+1.540
 grips+1.531
 whis+1.515
Neg logit contributions
ANGE-1.714
umber-1.710
Oracle-1.692
 Banks-1.622
 Pompe-1.619
Token group
Token ID1448
Feature activation-8.19e-01

Pos logit contributions
 toug+1.001
 doct+1.001
 communal+0.997
 grips+0.984
 recip+0.977
Neg logit contributions
umber-1.170
ANGE-1.167
Oracle-1.147
 Banks-1.105
 Pompe-1.100
Token of
Token ID286
Feature activation+1.27e+00

Pos logit contributions
umber+0.556
ANGE+0.552
Oracle+0.540
 Banks+0.531
 Pompe+0.525
Neg logit contributions
 toug-0.628
 doct-0.625
 communal-0.617
 grips-0.616
 livest-0.615
Token people
Token ID661
Feature activation+1.20e+00

Pos logit contributions
 toug+0.846
 doct+0.844
 communal+0.835
 grips+0.831
 whis+0.825
Neg logit contributions
ANGE-0.996
umber-0.995
Oracle-0.976
 Banks-0.942
 Pompe-0.939
Token who
Token ID508
Feature activation-7.04e-01

Pos logit contributions
 toug+0.907
 doct+0.903
 communal+0.892
 grips+0.890
 whis+0.885
Neg logit contributions
ANGE-0.839
umber-0.834
Oracle-0.826
 Banks-0.795
 Pompe-0.792
Token make
Token ID787
Feature activation+1.51e-01

Pos logit contributions
umber+0.500
ANGE+0.498
Oracle+0.490
 Banks+0.478
 Pompe+0.474
Neg logit contributions
 toug-0.520
 doct-0.517
 communal-0.511
 grips-0.509
 livest-0.507
Token a
Token ID257
Feature activation-6.73e-01

Pos logit contributions
 toug+0.106
 doct+0.106
 communal+0.105
 grips+0.104
 whis+0.103
Neg logit contributions
umber-0.113
ANGE-0.113
Oracle-0.111
 Banks-0.108
 Pompe-0.107
Token living
Token ID2877
Feature activation-8.65e-02

Pos logit contributions
umber+0.570
ANGE+0.568
Oracle+0.560
 Banks+0.547
 Pompe+0.544
Neg logit contributions
 toug-0.405
 doct-0.403
 communal-0.399
 grips-0.396
 livest-0.394
Token churn
Token ID28488
Feature activation+1.61e-01

Pos logit contributions
ANGE+0.063
umber+0.063
Oracle+0.061
 Banks+0.060
 Pompe+0.060
Neg logit contributions
 toug-0.063
 doct-0.063
 communal-0.062
 grips-0.062
 indo-0.061
Tokenestate
Token ID44146
Feature activation-1.45e+00

Pos logit contributions
ANGE+0.506
umber+0.506
Oracle+0.497
 Banks+0.487
 Pompe+0.485
Neg logit contributions
 toug-0.512
 doct-0.510
 communal-0.505
 grips-0.504
 livest-0.500
Token boom
Token ID14166
Feature activation-3.71e+00

Pos logit contributions
umber+1.037
ANGE+1.032
Oracle+1.011
 Banks+0.987
 Pompe+0.982
Neg logit contributions
 toug-1.068
 doct-1.066
 communal-1.053
 grips-1.052
 whis-1.042
Token raised
Token ID4376
Feature activation+1.13e+00

Pos logit contributions
umber+2.513
ANGE+2.493
Oracle+2.449
 Banks+2.415
 Pompe+2.355
Neg logit contributions
 toug-2.810
 doct-2.806
 livest-2.791
 communal-2.772
 indo-2.763
Token rents
Token ID28393
Feature activation-1.42e+00

Pos logit contributions
 toug+0.767
 doct+0.760
 communal+0.759
 grips+0.751
 whis+0.741
Neg logit contributions
ANGE-0.876
umber-0.873
Oracle-0.865
 Pompe-0.833
 Banks-0.831
Token beyond
Token ID3675
Feature activation-2.14e+00

Pos logit contributions
umber+0.919
ANGE+0.914
Oracle+0.897
 Banks+0.876
 Pompe+0.865
Neg logit contributions
 toug-1.139
 doct-1.135
 communal-1.121
 grips-1.118
 livest-1.117
Token the
Token ID262
Feature activation-2.86e-02

Pos logit contributions
umber+1.552
ANGE+1.545
Oracle+1.519
 Banks+1.485
 Pompe+1.471
Neg logit contributions
 toug-1.540
 doct-1.533
 communal-1.514
 grips-1.509
 livest-1.501
Token reach
Token ID3151
Feature activation+1.86e+00

Pos logit contributions
umber+0.022
ANGE+0.022
Oracle+0.022
 Pompe+0.021
 Banks+0.021
Neg logit contributions
 toug-0.019
 doct-0.019
 communal-0.019
 grips-0.019
 recip-0.019
Token of
Token ID286
Feature activation+2.50e-01

Pos logit contributions
 toug+1.310
 doct+1.309
 communal+1.302
 grips+1.299
 whis+1.278
Neg logit contributions
ANGE-1.377
umber-1.349
Oracle-1.331
 Pompe-1.303
 Banks-1.292
Token many
Token ID867
Feature activation+8.57e-02

Pos logit contributions
 toug+0.175
 doct+0.174
 communal+0.173
 grips+0.171
 whis+0.169
Neg logit contributions
umber-0.188
ANGE-0.188
Oracle-0.185
 Pompe-0.178
 Banks-0.178
Token people
Token ID661
Feature activation+2.30e-01

Pos logit contributions
 toug+0.059
 doct+0.059
 communal+0.058
 grips+0.058
 indo+0.057
Neg logit contributions
ANGE-0.065
umber-0.065
Oracle-0.064
 Banks-0.062
 Pompe-0.062
Token,
Token ID11
Feature activation-6.10e-01

Pos logit contributions
 toug+0.180
 doct+0.179
 communal+0.178
 grips+0.177
 whis+0.175
Neg logit contributions
ANGE-0.153
umber-0.152
Oracle-0.150
 Banks-0.145
 Pompe-0.145
Token shifted
Token ID14869
Feature activation+1.84e+00

Pos logit contributions
 toug+0.475
 doct+0.472
 grips+0.467
 communal+0.467
 indo+0.463
Neg logit contributions
umber-0.410
ANGE-0.409
Oracle-0.401
 Banks-0.392
 Pompe-0.388
Token over
Token ID625
Feature activation+1.12e+00

Pos logit contributions
 toug+1.377
 communal+1.368
 doct+1.362
 grips+1.360
 indo+1.342
Neg logit contributions
ANGE-1.301
umber-1.298
Oracle-1.263
 Banks-1.238
 Pompe-1.214
Token time
Token ID640
Feature activation+2.44e-01

Pos logit contributions
 toug+0.798
 doct+0.793
 communal+0.793
 grips+0.782
 indo+0.775
Neg logit contributions
ANGE-0.814
umber-0.813
Oracle-0.802
 Banks-0.780
 Pompe-0.766
Token.
Token ID13
Feature activation-4.57e-01

Pos logit contributions
 toug+0.198
 doct+0.197
 communal+0.196
 grips+0.195
 indo+0.192
Neg logit contributions
ANGE-0.155
umber-0.154
Oracle-0.151
 Banks-0.148
 Pompe-0.145
Token During
Token ID5856
Feature activation-2.11e+00

Pos logit contributions
umber+0.316
ANGE+0.313
Oracle+0.306
 Banks+0.295
 Pompe+0.291
Neg logit contributions
 toug-0.346
 doct-0.345
 communal-0.340
 grips-0.340
 indo-0.337
Token some
Token ID617
Feature activation+1.48e+00

Pos logit contributions
umber+1.448
ANGE+1.442
Oracle+1.415
 Banks+1.382
 Pompe+1.365
Neg logit contributions
 toug-1.613
 doct-1.605
 communal-1.587
 grips-1.582
 livest-1.575
Token periods
Token ID9574
Feature activation-1.27e+00

Pos logit contributions
 toug+1.116
 communal+1.115
 doct+1.111
 grips+1.098
 indo+1.090
Neg logit contributions
ANGE-1.039
umber-1.032
Oracle-1.009
 Banks-0.979
 Pompe-0.951
Token it
Token ID340
Feature activation-1.12e+00

Pos logit contributions
umber+0.840
ANGE+0.835
Oracle+0.820
 Banks+0.803
 Pompe+0.798
Neg logit contributions
 toug-0.994
 doct-0.991
 livest-0.976
 communal-0.975
 grips-0.975
Token was
Token ID373
Feature activation-1.75e+00

Pos logit contributions
umber+0.845
ANGE+0.845
Oracle+0.829
 Banks+0.809
 Pompe+0.799
Neg logit contributions
 toug-0.787
 doct-0.780
 communal-0.768
 grips-0.765
 indo-0.759
Token the
Token ID262
Feature activation-2.14e+00

Pos logit contributions
umber+1.323
ANGE+1.317
Oracle+1.295
 Banks+1.268
 Pompe+1.256
Neg logit contributions
 toug-1.211
 doct-1.205
 communal-1.190
 grips-1.185
 livest-1.179
Token buffer
Token ID11876
Feature activation+1.52e-01

Pos logit contributions
umber+1.729
ANGE+1.722
Oracle+1.694
 Banks+1.660
 Pompe+1.646
Neg logit contributions
 toug-1.370
 doct-1.363
 communal-1.345
 grips-1.340
 indo-1.333
Token |
Token ID930
Feature activation+2.51e+00

Pos logit contributions
 toug+1.049
 doct+1.048
 livest+1.033
 grips+1.031
 communal+1.030
Neg logit contributions
umber-0.764
ANGE-0.757
Oracle-0.740
 Banks-0.723
 Pompe-0.713
Token ISI
Token ID22048
Feature activation+6.07e-02

Pos logit contributions
 doct+1.996
 toug+1.969
 grips+1.959
 communal+1.948
 cooker+1.889
Neg logit contributions
umber-1.623
ANGE-1.533
Oracle-1.486
 Banks-1.427
 trump-1.423
Token
Token ID198
Feature activation+6.75e-01

Pos logit contributions
 toug+0.045
 doct+0.045
 grips+0.045
 communal+0.044
 whis+0.044
Neg logit contributions
umber-0.042
ANGE-0.041
Oracle-0.040
 Pompe-0.039
 Banks-0.039
Token
Token ID198
Feature activation+5.78e-01

Pos logit contributions
 toug+0.487
 doct+0.487
 grips+0.481
 communal+0.469
 whis+0.467
Neg logit contributions
umber-0.482
ANGE-0.480
Oracle-0.464
 Pompe-0.449
 Banks-0.447
TokenP
Token ID47
Feature activation-1.59e+00

Pos logit contributions
 toug+0.509
 doct+0.508
 grips+0.501
 communal+0.501
 whis+0.495
Neg logit contributions
umber-0.321
ANGE-0.318
Oracle-0.302
 Banks-0.301
 Pompe-0.296
Tokenel
Token ID417
Feature activation+6.09e-01

Pos logit contributions
umber+1.112
ANGE+1.106
Oracle+1.086
 Banks+1.062
 Pompe+1.050
Neg logit contributions
 toug-1.190
 doct-1.185
 communal-1.170
 grips-1.166
 livest-1.164
Tokenleg
Token ID1455
Feature activation-1.19e+00

Pos logit contributions
 toug+0.336
 doct+0.335
 communal+0.327
 grips+0.327
 livest+0.325
Neg logit contributions
ANGE-0.543
umber-0.540
Oracle-0.534
 Banks-0.526
 Pompe-0.520
Tokenr
Token ID81
Feature activation-2.40e+00

Pos logit contributions
ANGE+0.591
Oracle+0.570
umber+0.567
 Banks+0.545
 Pompe+0.539
Neg logit contributions
 toug-1.131
 doct-1.127
 communal-1.114
 grips-1.112
 whis-1.104
Tokenini
Token ID5362
Feature activation+5.44e-01

Pos logit contributions
ANGE+1.875
umber+1.863
Oracle+1.838
 Banks+1.802
 Pompe+1.790
Neg logit contributions
 toug-1.588
 doct-1.583
 grips-1.553
 communal-1.545
 recip-1.525
Token,
Token ID11
Feature activation-1.03e+00

Pos logit contributions
 doct+0.407
 toug+0.406
 grips+0.400
 communal+0.396
 whis+0.390
Neg logit contributions
umber-0.376
ANGE-0.375
Oracle-0.365
 Banks-0.351
 Pompe-0.351
Token A
Token ID317
Feature activation+1.45e+00

Pos logit contributions
umber+0.821
ANGE+0.817
Oracle+0.804
 Banks+0.781
 Pompe+0.776
Neg logit contributions
 toug-0.663
 doct-0.661
 communal-0.649
 grips-0.649
 whis-0.639
Token,
Token ID11
Feature activation-8.40e-02

Pos logit contributions
 toug+0.965
 doct+0.960
 communal+0.952
 grips+0.946
 whis+0.940
Neg logit contributions
umber-0.850
ANGE-0.848
Oracle-0.830
 Banks-0.807
 Pompe-0.798
Token we
Token ID356
Feature activation-9.72e-01

Pos logit contributions
umber+0.058
ANGE+0.058
Oracle+0.057
 Banks+0.055
 Pompe+0.054
Neg logit contributions
 toug-0.064
 doct-0.063
 communal-0.063
 grips-0.063
 whis-0.062
Token just
Token ID655
Feature activation+9.49e-01

Pos logit contributions
umber+0.765
ANGE+0.763
Oracle+0.751
 Banks+0.736
 Pompe+0.729
Neg logit contributions
 toug-0.642
 doct-0.639
 communal-0.629
 grips-0.627
 livest-0.624
Token took
Token ID1718
Feature activation-3.59e-01

Pos logit contributions
 toug+0.608
 doct+0.604
 communal+0.593
 grips+0.590
 indo+0.584
Neg logit contributions
umber-0.770
ANGE-0.769
Oracle-0.757
 Banks-0.740
 Pompe-0.733
Token away
Token ID1497
Feature activation+1.66e+00

Pos logit contributions
umber+0.260
ANGE+0.259
Oracle+0.254
 Banks+0.249
 Pompe+0.246
Neg logit contributions
 toug-0.260
 doct-0.259
 communal-0.255
 grips-0.255
 livest-0.254
Token the
Token ID262
Feature activation+3.91e-01

Pos logit contributions
 toug+1.169
 doct+1.165
 communal+1.152
 grips+1.147
 livest+1.140
Neg logit contributions
umber-1.231
ANGE-1.229
Oracle-1.208
 Banks-1.176
 Pompe-1.167
Token 1
Token ID352
Feature activation-2.61e-01

Pos logit contributions
 toug+0.203
 doct+0.202
 communal+0.198
 grips+0.198
 livest+0.197
Neg logit contributions
umber-0.361
ANGE-0.360
Oracle-0.355
 Banks-0.350
 Pompe-0.347
Token mana
Token ID13149
Feature activation+7.15e-01

Pos logit contributions
umber+0.205
ANGE+0.204
Oracle+0.201
 Banks+0.197
 Pompe+0.196
Neg logit contributions
 toug-0.172
 doct-0.171
 communal-0.169
 grips-0.168
 livest-0.168
Token ability
Token ID2694
Feature activation-4.00e+00

Pos logit contributions
 toug+0.525
 doct+0.522
 communal+0.517
 grips+0.514
 livest+0.511
Neg logit contributions
umber-0.510
ANGE-0.508
Oracle-0.500
 Banks-0.487
 Pompe-0.483
Token on
Token ID319
Feature activation-5.68e-01

Pos logit contributions
umber+2.468
ANGE+2.405
Oracle+2.373
 Banks+2.332
 Pompe+2.294
Neg logit contributions
 doct-3.283
 toug-3.281
 indo-3.280
 livest-3.273
 communal-3.237
Token that
Token ID326
Feature activation-1.11e+00

Pos logit contributions
umber+0.498
ANGE+0.495
Oracle+0.489
 Banks+0.480
 Pompe+0.476
Neg logit contributions
 toug-0.323
 doct-0.321
 communal-0.316
 grips-0.315
 livest-0.314
Token You
Token ID921
Feature activation-1.65e+00

Pos logit contributions
umber+0.481
ANGE+0.475
Oracle+0.468
 Banks+0.455
 Pompe+0.453
Neg logit contributions
 toug-0.497
 doct-0.495
 grips-0.490
 communal-0.484
 whis-0.481
Token may
Token ID743
Feature activation-8.67e-01

Pos logit contributions
umber+0.981
ANGE+0.976
Oracle+0.959
 Banks+0.925
 Pompe+0.918
Neg logit contributions
 toug-1.412
 doct-1.404
 grips-1.385
 communal-1.385
 livest-1.376
Token need
Token ID761
Feature activation-1.14e+00

Pos logit contributions
ANGE+0.604
umber+0.603
Oracle+0.594
 Banks+0.578
 Pompe+0.574
Neg logit contributions
 toug-0.654
 doct-0.649
 grips-0.640
 communal-0.640
 livest-0.634
Token to
Token ID284
Feature activation+1.71e-01

Pos logit contributions
umber+0.766
ANGE+0.763
Oracle+0.746
 Banks+0.731
 Pompe+0.723
Neg logit contributions
 toug-0.882
 doct-0.880
 communal-0.870
 livest-0.870
 indo-0.868
Token pull
Token ID2834
Feature activation+8.15e-01

Pos logit contributions
 toug+0.116
 doct+0.114
 grips+0.113
 communal+0.111
 whis+0.110
Neg logit contributions
umber-0.135
ANGE-0.134
Oracle-0.133
 Banks-0.129
 Pompe-0.128
Token up
Token ID510
Feature activation-4.27e+00

Pos logit contributions
 toug+0.591
 doct+0.586
 grips+0.584
 communal+0.579
 recip+0.572
Neg logit contributions
umber-0.591
ANGE-0.587
Oracle-0.582
 Banks-0.561
 Pompe-0.557
Token the
Token ID262
Feature activation-8.12e-01

Pos logit contributions
umber+2.867
ANGE+2.838
Oracle+2.769
 Banks+2.736
 Pompe+2.704
Neg logit contributions
 doct-3.262
 toug-3.261
 indo-3.226
 livest-3.215
 communal-3.212
Token e
Token ID304
Feature activation+2.70e-01

Pos logit contributions
umber+0.662
ANGE+0.660
Oracle+0.650
 Banks+0.636
 Pompe+0.631
Neg logit contributions
 toug-0.514
 doct-0.511
 communal-0.504
 grips-0.503
 livest-0.499
Token-
Token ID12
Feature activation-2.47e+00

Pos logit contributions
 toug+0.190
 doct+0.188
 grips+0.188
 communal+0.185
 whis+0.183
Neg logit contributions
ANGE-0.201
umber-0.200
Oracle-0.198
 Pompe-0.193
 Banks-0.193
Tokenbra
Token ID16057
Feature activation-1.03e+00

Pos logit contributions
umber+1.753
ANGE+1.742
Oracle+1.704
 Banks+1.671
 Pompe+1.643
Neg logit contributions
 toug-1.821
 doct-1.820
 communal-1.794
 indo-1.784
 livest-1.783
Tokenke
Token ID365
Feature activation-2.12e+00

Pos logit contributions
ANGE+0.584
umber+0.576
Oracle+0.573
 Banks+0.551
 Pompe+0.551
Neg logit contributions
 toug-0.901
 grips-0.895
 doct-0.895
 communal-0.885
 whis-0.880
Token of
Token ID286
Feature activation+8.60e-01

Pos logit contributions
umber+0.821
ANGE+0.817
Oracle+0.803
 Banks+0.784
 Pompe+0.775
Neg logit contributions
 toug-0.873
 doct-0.869
 communal-0.858
 grips-0.856
 livest-0.851
Token very
Token ID845
Feature activation-5.34e-01

Pos logit contributions
 toug+0.591
 doct+0.587
 grips+0.580
 communal+0.577
 whis+0.571
Neg logit contributions
umber-0.668
ANGE-0.666
Oracle-0.663
 Banks-0.630
 Pompe-0.620
Token good
Token ID922
Feature activation-1.26e+00

Pos logit contributions
umber+0.400
ANGE+0.397
Oracle+0.393
 Banks+0.380
 Pompe+0.376
Neg logit contributions
 toug-0.380
 doct-0.378
 communal-0.372
 grips-0.371
 whis-0.366
Token chances
Token ID8395
Feature activation-1.08e+00

Pos logit contributions
umber+1.028
ANGE+1.023
Oracle+1.011
 Banks+0.982
 Pompe+0.973
Neg logit contributions
 toug-0.799
 doct-0.795
 communal-0.785
 grips-0.784
 indo-0.773
Token and
Token ID290
Feature activation-7.07e-01

Pos logit contributions
ANGE+0.730
umber+0.729
Oracle+0.716
 Banks+0.694
 Pompe+0.685
Neg logit contributions
 toug-0.834
 doct-0.827
 communal-0.817
 grips-0.816
 whis-0.809
Token in
Token ID287
Feature activation-1.84e+00

Pos logit contributions
umber+0.515
ANGE+0.512
Oracle+0.507
 Banks+0.486
 Pompe+0.480
Neg logit contributions
 toug-0.516
 doct-0.511
 grips-0.506
 communal-0.505
 whis-0.498
Token international
Token ID3230
Feature activation-6.69e-01

Pos logit contributions
umber+1.290
ANGE+1.285
Oracle+1.263
 Banks+1.232
 Pompe+1.219
Neg logit contributions
 toug-1.376
 doct-1.370
 communal-1.354
 grips-1.349
 livest-1.342
Token football
Token ID4346
Feature activation+3.53e+00

Pos logit contributions
umber+0.507
ANGE+0.506
Oracle+0.505
 Banks+0.486
 Pompe+0.477
Neg logit contributions
 toug-0.464
 doct-0.459
 communal-0.456
 grips-0.452
 indo-0.446
Token you
Token ID345
Feature activation-1.56e+00

Pos logit contributions
 toug+2.530
 grips+2.484
 communal+2.477
 doct+2.475
 whis+2.425
Neg logit contributions
Oracle-2.622
umber-2.577
ANGE-2.575
 Banks-2.443
 Pompe-2.398
Token have
Token ID423
Feature activation-1.10e+00

Pos logit contributions
umber+1.106
ANGE+1.102
Oracle+1.085
 Banks+1.057
 Pompe+1.046
Neg logit contributions
 toug-1.156
 doct-1.148
 communal-1.134
 grips-1.131
 livest-1.125
Token to
Token ID284
Feature activation-1.29e+00

Pos logit contributions
umber+0.773
ANGE+0.769
Oracle+0.757
 Banks+0.734
 Pompe+0.726
Neg logit contributions
 toug-0.820
 doct-0.816
 communal-0.807
 grips-0.803
 indo-0.797
Token
Token ID447
Feature activation+3.38e-01

Pos logit contributions
umber+0.371
ANGE+0.368
Oracle+0.362
 Banks+0.352
 Pompe+0.349
Neg logit contributions
 toug-0.439
 doct-0.437
 communal-0.431
 grips-0.430
 indo-0.427
Token
Token ID251
Feature activation+3.99e-02

Pos logit contributions
 toug+0.274
 doct+0.274
 communal+0.270
 grips+0.269
 recip+0.267
Neg logit contributions
ANGE-0.213
umber-0.213
Oracle-0.209
 Banks-0.202
 Pompe-0.201
Token said
Token ID531
Feature activation+1.95e+00

Pos logit contributions
 toug+0.031
 doct+0.031
 communal+0.030
 grips+0.030
 livest+0.030
Neg logit contributions
umber-0.027
ANGE-0.027
Oracle-0.026
 Banks-0.025
 Pompe-0.025
Token Jess
Token ID12707
Feature activation-1.55e+00

Pos logit contributions
 doct+1.420
 toug+1.411
 communal+1.393
 grips+1.380
 recip+1.372
Neg logit contributions
umber-1.395
ANGE-1.390
Oracle-1.375
 Banks-1.310
 Pompe-1.310
Tokena
Token ID64
Feature activation+5.31e-02

Pos logit contributions
ANGE+1.167
umber+1.167
Oracle+1.150
 Banks+1.117
 Pompe+1.114
Neg logit contributions
 toug-1.066
 doct-1.063
 grips-1.045
 communal-1.044
 whis-1.033
Token.
Token ID13
Feature activation-5.52e-01

Pos logit contributions
 toug+0.043
 doct+0.043
 communal+0.043
 grips+0.043
 livest+0.042
Neg logit contributions
umber-0.033
ANGE-0.033
Oracle-0.033
 Banks-0.032
 Pompe-0.031
Token
Token ID564
Feature activation+7.53e-01

Pos logit contributions
umber+0.381
ANGE+0.378
Oracle+0.371
 Banks+0.362
 Pompe+0.359
Neg logit contributions
 toug-0.417
 doct-0.417
 communal-0.410
 grips-0.409
 indo-0.407
Token
Token ID250
Feature activation+1.39e+00

Pos logit contributions
 toug+0.526
 doct+0.526
 communal+0.513
 grips+0.512
 indo+0.505
Neg logit contributions
umber-0.561
ANGE-0.553
Oracle-0.540
 Banks-0.532
 Pompe-0.529
TokenIn
Token ID818
Feature activation+1.84e-01

Pos logit contributions
 doct+1.190
 toug+1.189
 communal+1.163
 grips+1.158
 whis+1.151
Neg logit contributions
umber-0.819
ANGE-0.795
Oracle-0.774
 Pompe-0.769
 Banks-0.766
Token Josh
Token ID8518
Feature activation+2.03e+00

Pos logit contributions
 toug+0.141
 doct+0.141
 communal+0.139
 grips+0.138
 indo+0.136
Neg logit contributions
umber-0.125
ANGE-0.125
Oracle-0.124
 Banks-0.119
 Pompe-0.118
Token
Token ID447
Feature activation+2.12e+00

Pos logit contributions
 toug+1.636
 doct+1.629
 communal+1.610
 grips+1.605
 livest+1.601
Neg logit contributions
umber-1.301
ANGE-1.294
Oracle-1.268
 Banks-1.240
 Pompe-1.224
Token causes
Token ID5640
Feature activation-1.04e+00

Pos logit contributions
umber+1.202
ANGE+1.196
Oracle+1.177
 Banks+1.146
 Pompe+1.136
Neg logit contributions
 toug-1.128
 doct-1.122
 communal-1.108
 grips-1.107
 indo-1.097
Token you
Token ID345
Feature activation-9.00e-01

Pos logit contributions
umber+0.781
ANGE+0.776
Oracle+0.766
 Banks+0.745
 Pompe+0.738
Neg logit contributions
 toug-0.728
 doct-0.724
 communal-0.717
 grips-0.714
 livest-0.706
Token to
Token ID284
Feature activation-1.06e+00

Pos logit contributions
umber+0.609
ANGE+0.605
Oracle+0.585
 Banks+0.580
 Pompe+0.571
Neg logit contributions
 doct-0.690
 toug-0.689
 livest-0.677
 indo-0.675
 grips-0.675
Token land
Token ID1956
Feature activation-9.60e-01

Pos logit contributions
umber+0.826
ANGE+0.823
Oracle+0.810
 Banks+0.794
 Pompe+0.786
Neg logit contributions
 toug-0.704
 doct-0.701
 communal-0.691
 grips-0.689
 livest-0.686
Token in
Token ID287
Feature activation-1.98e+00

Pos logit contributions
umber+0.613
ANGE+0.610
Oracle+0.599
 Banks+0.583
 Pompe+0.577
Neg logit contributions
 toug-0.776
 doct-0.773
 communal-0.764
 grips-0.762
 livest-0.758
Token jail
Token ID7356
Feature activation-5.12e-01

Pos logit contributions
umber+1.606
ANGE+1.594
Oracle+1.563
 Banks+1.552
 Pompe+1.538
Neg logit contributions
 toug-1.255
 doct-1.246
 livest-1.232
 indo-1.225
 recip-1.219
Token.
Token ID13
Feature activation-1.43e+00

Pos logit contributions
umber+0.315
ANGE+0.315
Oracle+0.309
 Banks+0.301
 Pompe+0.298
Neg logit contributions
 toug-0.425
 doct-0.423
 communal-0.419
 grips-0.418
 indo-0.415
Token The
Token ID383
Feature activation-8.99e-01

Pos logit contributions
umber+1.020
ANGE+1.013
Oracle+0.996
 Banks+0.963
 Pompe+0.958
Neg logit contributions
 toug-1.053
 doct-1.051
 grips-1.033
 communal-1.033
 indo-1.022
Token trouble
Token ID5876
Feature activation-4.76e-01

Pos logit contributions
umber+0.695
ANGE+0.693
Oracle+0.682
 Banks+0.665
 Pompe+0.661
Neg logit contributions
 toug-0.608
 doct-0.604
 communal-0.597
 grips-0.596
 livest-0.590
Token is
Token ID318
Feature activation+5.76e-01

Pos logit contributions
umber+0.281
ANGE+0.279
Oracle+0.274
 Banks+0.266
 Pompe+0.263
Neg logit contributions
 toug-0.407
 doct-0.405
 communal-0.401
 grips-0.400
 livest-0.399
Token that
Token ID326
Feature activation-2.06e+00

Pos logit contributions
 toug+0.443
 doct+0.441
 communal+0.437
 grips+0.436
 indo+0.431
Neg logit contributions
umber-0.391
ANGE-0.389
Oracle-0.384
 Banks-0.371
 Pompe-0.369
Token.
Token ID13
Feature activation-3.65e-01

Pos logit contributions
umber+1.965
ANGE+1.947
Oracle+1.911
 Banks+1.864
 Pompe+1.843
Neg logit contributions
 toug-2.786
 doct-2.780
 communal-2.745
 livest-2.742
 grips-2.739
Token
Token ID198
Feature activation-3.99e-01

Pos logit contributions
umber+0.250
ANGE+0.247
Oracle+0.244
 Banks+0.234
 Pompe+0.232
Neg logit contributions
 doct-0.279
 toug-0.279
 grips-0.273
 communal-0.273
 whis-0.271
Token
Token ID198
Feature activation-3.91e-01

Pos logit contributions
umber+0.274
ANGE+0.273
Oracle+0.266
 Pompe+0.260
 Banks+0.258
Neg logit contributions
 toug-0.300
 doct-0.300
 grips-0.296
 communal-0.291
 whis-0.289
TokenConnor
Token ID27136
Feature activation-1.17e+00

Pos logit contributions
umber+0.247
ANGE+0.243
Oracle+0.237
 Banks+0.235
 Pompe+0.233
Neg logit contributions
 toug-0.316
 doct-0.316
 communal-0.311
 grips-0.310
 whis-0.308
Token
Token ID447
Feature activation+2.38e-01

Pos logit contributions
umber+0.761
ANGE+0.757
Oracle+0.743
 Banks+0.725
 Pompe+0.717
Neg logit contributions
 toug-0.925
 doct-0.922
 communal-0.911
 grips-0.908
 livest-0.905
Token
Token ID247
Feature activation-3.86e-01

Pos logit contributions
 toug+0.193
 doct+0.192
 communal+0.192
 grips+0.190
 whis+0.189
Neg logit contributions
ANGE-0.148
umber-0.148
Oracle-0.147
 Banks-0.142
 Pompe-0.141
Tokens
Token ID82
Feature activation-3.76e-02

Pos logit contributions
umber+0.233
ANGE+0.233
Oracle+0.229
 Banks+0.222
 Pompe+0.220
Neg logit contributions
 toug-0.325
 doct-0.324
 communal-0.320
 grips-0.320
 livest-0.318
Token play
Token ID711
Feature activation+7.09e-02

Pos logit contributions
ANGE+0.028
umber+0.028
Oracle+0.027
 Banks+0.027
 Pompe+0.026
Neg logit contributions
 toug-0.027
 doct-0.027
 communal-0.026
 livest-0.026
 grips-0.026
Token with
Token ID351
Feature activation-3.72e+00

Pos logit contributions
 toug+0.055
 doct+0.054
 grips+0.054
 communal+0.054
 indo+0.053
Neg logit contributions
ANGE-0.048
umber-0.048
Oracle-0.047
 Banks-0.046
 Pompe-0.045
Token Blake
Token ID13307
Feature activation-2.04e+00

Pos logit contributions
umber+2.754
ANGE+2.754
Oracle+2.682
 Banks+2.650
 Pompe+2.630
Neg logit contributions
 toug-2.583
 doct-2.581
 livest-2.550
 indo-2.549
 communal-2.543
Token Wheeler
Token ID27512
Feature activation+1.82e-02

Pos logit contributions
umber+1.376
ANGE+1.368
Oracle+1.347
 Banks+1.308
 Pompe+1.297
Neg logit contributions
 toug-1.582
 doct-1.575
 communal-1.555
 grips-1.553
 livest-1.543
Token in
Token ID287
Feature activation-8.73e-01

Pos logit contributions
umber+0.961
ANGE+0.957
Oracle+0.937
 Banks+0.919
 Pompe+0.910
Neg logit contributions
 toug-1.050
 doct-1.044
 grips-1.031
 communal-1.030
 livest-1.029
Token common
Token ID2219
Feature activation-4.26e-01

Pos logit contributions
umber+0.715
ANGE+0.711
Oracle+0.699
 Banks+0.690
 Pompe+0.683
Neg logit contributions
 toug-0.547
 doct-0.543
 livest-0.535
 grips-0.533
 indo-0.533
Token between
Token ID1022
Feature activation+2.45e+00

Pos logit contributions
umber+0.254
ANGE+0.249
Oracle+0.245
 Banks+0.239
 Pompe+0.236
Neg logit contributions
 toug-0.364
 doct-0.361
 livest-0.359
 indo-0.357
 communal-0.356
Token the
Token ID262
Feature activation-1.59e+00

Pos logit contributions
 doct+1.774
 toug+1.764
 communal+1.752
 grips+1.731
 indo+1.712
Neg logit contributions
umber-1.774
ANGE-1.773
Oracle-1.767
 Banks-1.676
 Pompe-1.675
Token Mil
Token ID4460
Feature activation+2.56e-01

Pos logit contributions
umber+1.218
ANGE+1.209
Oracle+1.183
 Banks+1.179
 Pompe+1.165
Neg logit contributions
 toug-1.073
 doct-1.060
 livest-1.056
 communal-1.046
 grips-1.046
Tokenit
Token ID270
Feature activation-4.85e+00

Pos logit contributions
 doct+0.173
 toug+0.171
 grips+0.169
 communal+0.168
 whis+0.164
Neg logit contributions
ANGE-0.201
Oracle-0.197
umber-0.195
 Pompe-0.190
 Banks-0.189
Tokenants
Token ID1187
Feature activation-2.57e+00

Pos logit contributions
umber+2.995
ANGE+2.919
 Banks+2.864
Oracle+2.864
 Pompe+2.787
Neg logit contributions
 toug-4.026
 livest-4.010
 doct-3.973
 indo-3.951
 recip-3.931
Token and
Token ID290
Feature activation-1.19e-01

Pos logit contributions
umber+1.591
ANGE+1.572
 Banks+1.519
Oracle+1.510
 Pompe+1.485
Neg logit contributions
 toug-2.103
 livest-2.086
 doct-2.080
 grips-2.070
 whis-2.057
Token the
Token ID262
Feature activation-1.53e+00

Pos logit contributions
umber+0.081
ANGE+0.080
Oracle+0.079
 Banks+0.077
 Pompe+0.076
Neg logit contributions
 toug-0.091
 doct-0.090
 communal-0.089
 grips-0.089
 livest-0.089
Token so
Token ID523
Feature activation-3.12e+00

Pos logit contributions
umber+1.178
ANGE+1.170
Oracle+1.147
 Banks+1.140
 Pompe+1.127
Neg logit contributions
 toug-1.029
 doct-1.015
 livest-1.010
 communal-1.007
 indo-1.003
Token-
Token ID12
Feature activation+7.56e-01

Pos logit contributions
umber+2.202
ANGE+2.192
Oracle+2.154
 Banks+2.107
 Pompe+2.085
Neg logit contributions
 toug-2.309
 doct-2.299
 communal-2.270
 grips-2.263
 livest-2.255
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
 toug+1.298
 doct+1.298
 grips+1.284
 communal+1.275
 whis+1.260
Neg logit contributions
ANGE-0.975
umber-0.974
Oracle-0.959
 Pompe-0.923
 Banks-0.910
Token [
Token ID685
Feature activation-3.43e-02

Pos logit contributions
 toug+1.009
 doct+1.006
 communal+0.992
 grips+0.989
 livest+0.985
Neg logit contributions
umber-1.121
ANGE-1.116
Oracle-1.100
 Banks-1.072
 Pompe-1.065
Tokencolor
Token ID8043
Feature activation+9.98e-01

Pos logit contributions
umber+0.027
ANGE+0.027
Oracle+0.027
 Banks+0.026
 Pompe+0.026
Neg logit contributions
 toug-0.023
 doct-0.022
 communal-0.022
 grips-0.022
 livest-0.022
Token=
Token ID28
Feature activation-2.77e-01

Pos logit contributions
 doct+0.686
 toug+0.684
 grips+0.669
 communal+0.664
 whis+0.656
Neg logit contributions
umber-0.735
Oracle-0.730
ANGE-0.727
 Pompe-0.714
 Banks-0.706
Tokenred
Token ID445
Feature activation+1.40e+00

Pos logit contributions
umber+0.188
ANGE+0.187
Oracle+0.183
 Banks+0.178
 Pompe+0.176
Neg logit contributions
 toug-0.214
 doct-0.213
 communal-0.211
 grips-0.210
 livest-0.210
Token][
Token ID7131
Feature activation-6.87e+00

Pos logit contributions
 toug+1.024
 doct+1.019
 communal+1.006
 grips+1.003
 livest+1.000
Neg logit contributions
umber-1.008
ANGE-1.003
Oracle-0.985
 Banks-0.965
 Pompe-0.954
Tokenb
Token ID65
Feature activation-1.25e-01

Pos logit contributions
umber+3.228
ANGE+2.996
Oracle+2.925
 Banks+2.855
tr+2.841
Neg logit contributions
 toug-6.732
 recip-6.646
 indo-6.626
 doct-6.621
 livest-6.619
Token]
Token ID60
Feature activation-3.47e+00

Pos logit contributions
umber+0.077
ANGE+0.077
Oracle+0.076
 Banks+0.074
 Pompe+0.073
Neg logit contributions
 doct-0.102
 toug-0.102
 grips-0.101
 communal-0.099
 whis-0.098
TokenU
Token ID52
Feature activation-9.94e-01

Pos logit contributions
ANGE+2.019
umber+2.018
Oracle+1.993
 Banks+1.917
 Pompe+1.877
Neg logit contributions
 toug-3.007
 doct-2.996
 communal-2.966
 indo-2.964
 livest-2.954
Tokenf
Token ID69
Feature activation-1.65e+00

Pos logit contributions
umber+0.639
ANGE+0.637
Oracle+0.626
 Banks+0.609
 Pompe+0.604
Neg logit contributions
 toug-0.797
 doct-0.793
 communal-0.784
 grips-0.783
 livest-0.779
Tokenou
Token ID280
Feature activation+1.00e+00

Pos logit contributions
ANGE+1.000
umber+0.997
Oracle+0.976
 Banks+0.953
 Pompe+0.941
Neg logit contributions
 toug-1.382
 doct-1.377
 grips-1.359
 communal-1.359
 livest-1.349
Token the
Token ID262
Feature activation+9.55e-01

Pos logit contributions
umber+0.029
ANGE+0.028
Oracle+0.028
 Banks+0.027
 Pompe+0.027
Neg logit contributions
 toug-0.026
 doct-0.026
 communal-0.026
 grips-0.025
 livest-0.025
Token political
Token ID1964
Feature activation+2.08e+00

Pos logit contributions
 doct+0.606
 toug+0.602
 communal+0.598
 grips+0.591
 indo+0.584
Neg logit contributions
umber-0.780
ANGE-0.778
Oracle-0.768
 Banks-0.745
 Pompe-0.739
Token spectrum
Token ID10958
Feature activation+9.24e-01

Pos logit contributions
 doct+1.188
 communal+1.180
 toug+1.175
 grips+1.160
 indo+1.147
Neg logit contributions
umber-1.842
ANGE-1.829
Oracle-1.808
 Banks-1.753
 Pompe-1.735
Token to
Token ID284
Feature activation-1.19e+00

Pos logit contributions
 toug+0.748
 doct+0.744
 communal+0.739
 grips+0.733
 indo+0.725
Neg logit contributions
ANGE-0.592
umber-0.587
Oracle-0.583
 Banks-0.560
 Pompe-0.556
Token make
Token ID787
Feature activation-3.66e-01

Pos logit contributions
umber+0.913
ANGE+0.909
Oracle+0.895
 Banks+0.875
 Pompe+0.867
Neg logit contributions
 toug-0.815
 doct-0.810
 communal-0.800
 grips-0.797
 indo-0.793
Token transgender
Token ID10637
Feature activation-4.87e+00

Pos logit contributions
umber+0.268
ANGE+0.266
Oracle+0.263
 Banks+0.256
 Pompe+0.252
Neg logit contributions
 doct-0.264
 toug-0.263
 communal-0.260
 grips-0.258
 indo-0.256
Token people
Token ID661
Feature activation+3.62e-01

Pos logit contributions
ANGE+3.889
umber+3.878
Oracle+3.769
 Banks+3.729
 Pompe+3.728
Neg logit contributions
 toug-3.105
 doct-3.061
 livest-3.045
 indo-3.019
 communal-3.006
Token seem
Token ID1283
Feature activation+1.96e+00

Pos logit contributions
 toug+0.256
 doct+0.255
 communal+0.252
 grips+0.251
 indo+0.249
Neg logit contributions
umber-0.271
ANGE-0.268
Oracle-0.266
 Banks-0.259
 Pompe-0.256
Token ridiculous
Token ID11441
Feature activation+5.38e-01

Pos logit contributions
 doct+1.289
 toug+1.283
 communal+1.262
 indo+1.240
 grips+1.236
Neg logit contributions
umber-1.575
ANGE-1.559
Oracle-1.541
 Banks-1.505
 Pompe-1.484
Token.
Token ID13
Feature activation-4.69e-01

Pos logit contributions
 toug+0.430
 doct+0.429
 communal+0.424
 grips+0.422
 whis+0.420
Neg logit contributions
ANGE-0.348
umber-0.348
Oracle-0.342
 Banks-0.334
 Pompe-0.329
Token It
Token ID632
Feature activation-8.20e-01

Pos logit contributions
umber+0.321
ANGE+0.317
Oracle+0.311
 Banks+0.299
 Pompe+0.297
Neg logit contributions
 doct-0.358
 toug-0.357
 communal-0.350
 grips-0.349
 indo-0.346
Token can
Token ID460
Feature activation+2.87e-01

Pos logit contributions
 toug+0.014
 doct+0.014
 communal+0.014
 grips+0.014
 livest+0.014
Neg logit contributions
ANGE-0.011
umber-0.011
Oracle-0.011
 Banks-0.011
 Pompe-0.010
Token basically
Token ID6209
Feature activation+8.45e-01

Pos logit contributions
 toug+0.215
 doct+0.213
 grips+0.210
 communal+0.210
 livest+0.208
Neg logit contributions
ANGE-0.202
umber-0.201
Oracle-0.199
 Banks-0.193
 Pompe-0.191
Token just
Token ID655
Feature activation+2.09e-01

Pos logit contributions
 toug+0.602
 doct+0.595
 communal+0.585
 grips+0.584
 whis+0.578
Neg logit contributions
ANGE-0.631
umber-0.627
Oracle-0.624
 Banks-0.601
 Pompe-0.597
Token count
Token ID954
Feature activation-2.17e+00

Pos logit contributions
 toug+0.145
 doct+0.144
 communal+0.141
 grips+0.141
 livest+0.140
Neg logit contributions
umber-0.159
ANGE-0.159
Oracle-0.157
 Banks-0.153
 Pompe-0.151
Token those
Token ID883
Feature activation+1.38e+00

Pos logit contributions
umber+1.447
ANGE+1.432
Oracle+1.402
 Banks+1.378
 Pompe+1.370
Neg logit contributions
 toug-1.684
 doct-1.676
 livest-1.657
 indo-1.656
 communal-1.653
Token up
Token ID510
Feature activation-4.67e+00

Pos logit contributions
 toug+0.910
 doct+0.902
 communal+0.894
 grips+0.885
 whis+0.877
Neg logit contributions
umber-1.102
ANGE-1.102
Oracle-1.085
 Banks-1.052
 Pompe-1.042
Token and
Token ID290
Feature activation-8.27e-01

Pos logit contributions
umber+2.963
ANGE+2.924
Oracle+2.852
 Banks+2.803
 Pompe+2.761
Neg logit contributions
 toug-3.763
 doct-3.748
 livest-3.722
 indo-3.706
 grips-3.699
Token figure
Token ID3785
Feature activation-9.21e-01

Pos logit contributions
umber+0.594
ANGE+0.592
Oracle+0.584
 Banks+0.568
 Pompe+0.563
Neg logit contributions
 toug-0.605
 doct-0.602
 communal-0.594
 grips-0.593
 livest-0.589
Token out
Token ID503
Feature activation-1.91e+00

Pos logit contributions
umber+0.549
ANGE+0.546
Oracle+0.534
 Banks+0.520
 Pompe+0.514
Neg logit contributions
 toug-0.783
 doct-0.780
 communal-0.772
 grips-0.770
 livest-0.768
Token how
Token ID703
Feature activation-1.51e+00

Pos logit contributions
umber+1.353
ANGE+1.345
Oracle+1.320
 Banks+1.295
 Pompe+1.283
Neg logit contributions
 toug-1.403
 doct-1.398
 communal-1.380
 livest-1.376
 grips-1.375
Token long
Token ID890
Feature activation+2.84e+00

Pos logit contributions
umber+1.100
ANGE+1.095
Oracle+1.076
 Banks+1.052
 Pompe+1.042
Neg logit contributions
 toug-1.087
 doct-1.082
 communal-1.068
 grips-1.064
 livest-1.059
Token she
Token ID673
Feature activation-2.41e+00

Pos logit contributions
 toug+0.344
 doct+0.342
 communal+0.339
 grips+0.336
 livest+0.334
Neg logit contributions
umber-0.341
ANGE-0.339
Oracle-0.335
 Banks-0.324
 Pompe-0.322
Token advised
Token ID13030
Feature activation+1.30e-04

Pos logit contributions
umber+1.589
ANGE+1.582
Oracle+1.551
 Banks+1.512
 Pompe+1.495
Neg logit contributions
 toug-1.909
 doct-1.898
 communal-1.873
 grips-1.872
 livest-1.864
Token she
Token ID673
Feature activation-1.28e+00

Pos logit contributions
 toug+0.000
 doct+0.000
 communal+0.000
 grips+0.000
 livest+0.000
Neg logit contributions
umber-0.000
ANGE-0.000
Oracle-0.000
 Banks-0.000
 Pompe-0.000
Token will
Token ID481
Feature activation-1.12e-01

Pos logit contributions
umber+0.881
ANGE+0.878
Oracle+0.862
 Banks+0.840
 Pompe+0.833
Neg logit contributions
 toug-0.974
 doct-0.968
 grips-0.953
 communal-0.953
 livest-0.950
Token also
Token ID635
Feature activation-3.04e+00

Pos logit contributions
umber+0.086
ANGE+0.086
Oracle+0.085
 Banks+0.083
 Pompe+0.082
Neg logit contributions
 toug-0.078
 doct-0.076
 communal-0.075
 grips-0.075
 livest-0.075
Token hear
Token ID3285
Feature activation-4.84e+00

Pos logit contributions
umber+2.297
ANGE+2.283
Oracle+2.239
 Banks+2.206
 Pompe+2.187
Neg logit contributions
 doct-2.079
 toug-2.070
 communal-2.050
 grips-2.043
 indo-2.041
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
umber+3.350
ANGE+3.346
Oracle+3.260
 Banks+3.248
 Pompe+3.203
Neg logit contributions
 toug-3.573
 doct-3.555
 indo-3.500
 grips-3.498
 communal-3.495
Token defense
Token ID3761
Feature activation-8.81e-01

Pos logit contributions
umber+1.085
ANGE+1.080
Oracle+1.067
 Banks+1.034
 Pompe+1.030
Neg logit contributions
 toug-0.927
 doct-0.922
 communal-0.912
 grips-0.907
 livest-0.900
Token
Token ID447
Feature activation-4.57e-01

Pos logit contributions
umber+0.651
ANGE+0.649
Oracle+0.640
 Banks+0.623
 Pompe+0.618
Neg logit contributions
 toug-0.624
 doct-0.622
 communal-0.614
 grips-0.612
 indo-0.609
Token
Token ID247
Feature activation+3.02e-02

Pos logit contributions
ANGE+0.275
umber+0.275
Oracle+0.271
 Banks+0.263
 Pompe+0.261
Neg logit contributions
 toug-0.382
 doct-0.380
 communal-0.379
 grips-0.376
 recip-0.374
Tokens
Token ID82
Feature activation+9.29e-01

Pos logit contributions
 toug+0.026
 doct+0.026
 communal+0.026
 grips+0.026
 livest+0.026
Neg logit contributions
umber-0.017
ANGE-0.017
Oracle-0.017
 Banks-0.017
 Pompe-0.016