TOP ACTIVATIONS
MAX = 18.678

S pen cer said that ads pretending
in . S par ks , a Cohn prot
expelled . S por adic gunfire rocked the new
from forc ings ( S pen cer & Bras well ,
episode ? Share in the comments below ! Can
Let us know in the comments below …. <|endoftext|> D ish
up . For more from The Nation , check out our
this ? Share in the comments ! <|endoftext|> LG has cancelled
the public service , but who outside , or even inside
Bomb ing Bit & S per mb owl 96 - D
permission . <|endoftext|> The S ver ig es Ri ks bank
Email This Story Send email to this address Enter Your Name
you 'd recommend in the comments . <|endoftext|> Red Transit L
Let us know in the comments below . <|endoftext|> From John
v ous , le S é nat Imp é rial ,
, shalt thou take unto thy self . De uter onomy
female crowd . Who do you think should be
te legraph . co . uk / business / analysis /
or traffic controlling devices . Who is more important , bicycl
00 pm GMT ( GMT + 00 : 00 ) (

MIN ACTIVATIONS
MIN = -11.855

option ), Dwight Powell , Pet ter i Kop onen (
. Ren aud Lav o ie (@ ren lav o
, if Y onge - E gl inton is to become
ER , THIS W ON ' T WORK * The watch
She pulls away , hits my shoulder softly and responds ,
email below . : To prove just how easy it is
Ki pling subway station in Et obic oke .
f 4 b be 66 power off Notice how
lady , he is a disillusion ed Irish revolutionary who has
versus Kh ab ib Nur mag omed ov which was
have been reported in the Chand pai region 's channels .
ISI P el leg r ini , A .
example , the Hotel Vol can on Lan zar ote costs
of what follows . King te ases , however , that
Simon [ M ign o let ] came back in
bitters weet one . A bitters weet frenzy ? Sure ,
bench Brian Al ain u ' uese returns from suspension and
try and collect the Cur ric ulum Vit ae of everyone
than current Rapt or 's 411 - horse power , 4
): _ P ulse Amount (" P ulse amount

INTERVAL 11.045 - 18.678
CONTAINS 0.004%

Email This Story Send email to this address Enter Your Name
Let us know in the comments below . <|endoftext|> From John
in . S par ks , a Cohn prot
te legraph . co . uk / business / analysis /
Let us know in the comments below …. <|endoftext|> D ish

INTERVAL 3.412 - 11.045
CONTAINS 6.374%

I hope he does . I 'm just not optimistic given
* lot * more populism - and not just during election
solve the Pittsburgh Penguins expansion draft problem ? If you believe
past week referring to the perpetrators as sav ages . Which
. org / index . php ? topic = 28 18

INTERVAL -4.222 - 3.412
CONTAINS 90.348%

Em anski - esque back - to - back - to
a very detailed way to diagnose what is actually keeping your
The production company has stayed tight l ipped about both titles
country and Puerto Rico getting the testing that s
a $ 200 , 000 sever ance after retiring from a

INTERVAL -11.855 - -4.222
CONTAINS 3.274%

1 // metres _ P ulse Per iod ( " P
the U . S . government determine the social costs of
of this contributes to rising seas because of course ,
How about just in the Ruby community ?
and 48 injured , President Trump responded to the tragedy by
Token
Token ID447
Feature activation-1.09e+00

Pos logit contributions
��+0.095
�士+0.086
+0.078
NT+0.078
 welf+0.075
Neg logit contributions
 IMAGES-0.100
iph-0.100
kefeller-0.089
bley-0.087
pees-0.085
Token
Token ID251
Feature activation-3.80e+00

Pos logit contributions
��+0.620
�士+0.542
+0.474
NT+0.456
��+0.452
Neg logit contributions
 IMAGES-1.091
iph-1.082
kefeller-0.988
bley-0.967
 STATES-0.955
Token
Token ID198
Feature activation-5.73e-02

Pos logit contributions
��+2.714
�士+2.418
NT+2.216
+2.203
 Ll+2.135
Neg logit contributions
iph-3.136
 IMAGES-3.119
kefeller-2.890
bley-2.768
pees-2.734
Token
Token ID198
Feature activation+1.25e+00

Pos logit contributions
��+0.040
�士+0.035
NT+0.033
+0.033
Associated+0.032
Neg logit contributions
 IMAGES-0.049
iph-0.048
kefeller-0.044
bley-0.043
pees-0.042
TokenS
Token ID50
Feature activation+7.64e+00

Pos logit contributions
 IMAGES+0.938
iph+0.932
kefeller+0.817
bley+0.800
 STATES+0.784
Neg logit contributions
��-1.018
�士-0.929
-0.842
NT-0.836
 welf-0.824
Tokenpen
Token ID3617
Feature activation+1.87e+01

Pos logit contributions
iph+4.965
 IMAGES+4.678
kefeller+4.062
pees+4.020
bley+4.012
Neg logit contributions
��-7.050
�士-6.600
-6.108
��-5.966
 welf-5.964
Tokencer
Token ID2189
Feature activation+4.42e+00

Pos logit contributions
iph+8.535
 IMAGES+7.785
agus+7.062
kes+6.829
bley+6.532
Neg logit contributions
�士-19.535
��-18.900
��-17.736
 welf-17.704
-17.048
Token said
Token ID531
Feature activation+5.03e-01

Pos logit contributions
 IMAGES+3.160
iph+3.150
bley+2.710
kefeller+2.696
 STATES+2.670
Neg logit contributions
��-3.694
�士-3.420
-3.073
NT-3.067
 welf-3.056
Token that
Token ID326
Feature activation+7.09e-01

Pos logit contributions
 IMAGES+0.404
iph+0.402
kefeller+0.357
bley+0.349
 STATES+0.342
Neg logit contributions
��-0.382
�士-0.346
-0.311
NT-0.310
 welf-0.303
Token ads
Token ID9011
Feature activation-2.22e-01

Pos logit contributions
 IMAGES+0.566
iph+0.562
kefeller+0.500
bley+0.488
 STATES+0.479
Neg logit contributions
��-0.542
�士-0.491
-0.443
NT-0.441
 welf-0.432
Token pretending
Token ID23662
Feature activation-9.06e-01

Pos logit contributions
��+0.170
�士+0.154
+0.140
NT+0.139
 welf+0.136
Neg logit contributions
 IMAGES-0.176
iph-0.175
kefeller-0.156
bley-0.152
 STATES-0.150
Tokenin
Token ID259
Feature activation+2.26e+00

Pos logit contributions
��+0.122
�士+0.109
NT+0.101
+0.101
 Ll+0.097
Neg logit contributions
 IMAGES-0.125
iph-0.121
kefeller-0.109
bley-0.105
 STATES-0.104
Token.
Token ID13
Feature activation-8.31e-02

Pos logit contributions
iph+1.619
 IMAGES+1.618
kefeller+1.408
bley+1.383
 STATES+1.347
Neg logit contributions
��-1.894
�士-1.732
-1.573
NT-1.568
 welf-1.551
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
��+0.062
�士+0.056
+0.050
NT+0.050
 Ll+0.049
Neg logit contributions
iph-0.068
 IMAGES-0.068
kefeller-0.061
bley-0.059
pees-0.058
Token
Token ID198
Feature activation+1.90e+00

Pos logit contributions
 IMAGES+0.973
iph+0.966
kefeller+0.859
bley+0.843
 STATES+0.827
Neg logit contributions
��-0.876
�士-0.790
-0.710
NT-0.707
 welf-0.691
TokenS
Token ID50
Feature activation+5.42e+00

Pos logit contributions
 IMAGES+1.472
iph+1.462
kefeller+1.286
bley+1.262
 STATES+1.238
Neg logit contributions
��-1.494
�士-1.360
-1.227
NT-1.217
 welf-1.201
Tokenpar
Token ID1845
Feature activation+1.50e+01

Pos logit contributions
iph+3.116
 IMAGES+3.040
kefeller+2.548
bley+2.496
agus+2.442
Neg logit contributions
��-5.390
�士-5.032
-4.602
��-4.599
catentry-4.553
Tokenks
Token ID591
Feature activation-6.78e-01

Pos logit contributions
 IMAGES+9.105
iph+9.064
agus+8.194
bley+8.029
kes+7.891
Neg logit contributions
��-13.744
�士-13.724
��-12.692
 welf-12.029
-11.714
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
��+0.518
�士+0.469
+0.426
NT+0.420
 Ll+0.413
Neg logit contributions
 IMAGES-0.542
iph-0.534
kefeller-0.479
bley-0.463
pees-0.454
Token a
Token ID257
Feature activation+2.55e+00

Pos logit contributions
 IMAGES+1.388
iph+1.378
kefeller+1.222
bley+1.198
 STATES+1.175
Neg logit contributions
��-1.298
�士-1.175
-1.057
NT-1.049
 welf-1.031
Token Cohn
Token ID45005
Feature activation-7.97e-01

Pos logit contributions
 IMAGES+1.983
iph+1.970
kefeller+1.737
bley+1.701
 STATES+1.669
Neg logit contributions
��-2.011
�士-1.830
-1.651
NT-1.642
��-1.612
Token prot
Token ID1237
Feature activation+1.11e-02

Pos logit contributions
��+0.633
�士+0.580
+0.524
NT+0.523
 welf+0.515
Neg logit contributions
 IMAGES-0.616
iph-0.601
kefeller-0.529
bley-0.522
pees-0.515
Token expelled
Token ID27307
Feature activation-2.36e+00

Pos logit contributions
��+1.384
�士+1.238
+1.132
NT+1.126
 Ll+1.087
Neg logit contributions
 IMAGES-1.423
iph-1.388
kefeller-1.250
bley-1.200
 STATES-1.183
Token.
Token ID13
Feature activation+2.01e+00

Pos logit contributions
��+1.743
�士+1.535
+1.422
NT+1.401
 Ll+1.363
Neg logit contributions
 IMAGES-1.938
iph-1.894
kefeller-1.729
bley-1.658
 STATES-1.638
Token
Token ID198
Feature activation-2.44e+00

Pos logit contributions
 IMAGES+1.622
iph+1.609
kefeller+1.416
bley+1.395
 STATES+1.373
Neg logit contributions
��-1.517
�士-1.377
-1.236
NT-1.222
 welf-1.213
Token
Token ID198
Feature activation-2.10e-01

Pos logit contributions
��+1.564
NT+1.311
�士+1.295
Associated+1.259
+1.248
Neg logit contributions
 IMAGES-2.170
iph-2.149
kefeller-1.997
bley-1.938
pees-1.900
TokenS
Token ID50
Feature activation+2.75e+00

Pos logit contributions
��+0.162
�士+0.145
NT+0.133
+0.132
Associated+0.129
Neg logit contributions
 IMAGES-0.168
iph-0.165
kefeller-0.149
bley-0.144
 STATES-0.142
Tokenpor
Token ID1819
Feature activation+1.45e+01

Pos logit contributions
 IMAGES+2.548
iph+2.547
kefeller+2.285
bley+2.251
pees+2.211
Neg logit contributions
��-1.750
�士-1.563
-1.363
NT-1.335
��-1.329
Tokenadic
Token ID23876
Feature activation+9.28e-01

Pos logit contributions
iph+7.837
 IMAGES+7.549
agus+6.283
bley+6.197
pees+6.072
Neg logit contributions
��-14.063
�士-13.561
��-12.572
 welf-12.212
-12.106
Token gunfire
Token ID29276
Feature activation+1.84e+00

Pos logit contributions
 IMAGES+0.732
iph+0.726
kefeller+0.641
bley+0.629
 STATES+0.617
Neg logit contributions
��-0.721
�士-0.655
-0.590
NT-0.586
 welf-0.576
Token rocked
Token ID36872
Feature activation-1.06e+00

Pos logit contributions
 IMAGES+1.387
iph+1.379
kefeller+1.193
bley+1.185
 STATES+1.161
Neg logit contributions
��-1.488
�士-1.364
NT-1.222
-1.221
��-1.215
Token the
Token ID262
Feature activation-3.95e-01

Pos logit contributions
��+0.791
�士+0.717
+0.644
NT+0.643
 welf+0.631
Neg logit contributions
 IMAGES-0.853
iph-0.847
kefeller-0.756
bley-0.735
 STATES-0.726
Token new
Token ID649
Feature activation-1.11e+00

Pos logit contributions
��+0.303
�士+0.276
+0.249
NT+0.248
 welf+0.244
Neg logit contributions
 IMAGES-0.312
iph-0.310
kefeller-0.275
bley-0.267
pees-0.263
Token from
Token ID422
Feature activation-1.08e+00

Pos logit contributions
 IMAGES+0.054
iph+0.053
kefeller+0.047
bley+0.046
pees+0.045
Neg logit contributions
��-0.050
�士-0.045
-0.041
NT-0.041
 welf-0.040
Token forc
Token ID24686
Feature activation+6.87e-01

Pos logit contributions
��+0.820
�士+0.734
+0.674
NT+0.671
 welf+0.665
Neg logit contributions
 IMAGES-0.866
iph-0.857
kefeller-0.764
bley-0.750
pees-0.737
Tokenings
Token ID654
Feature activation+3.92e+00

Pos logit contributions
 IMAGES+0.448
iph+0.437
kefeller+0.378
bley+0.369
 STATES+0.360
Neg logit contributions
��-0.628
�士-0.572
NT-0.532
-0.531
 welf-0.520
Token (
Token ID357
Feature activation+3.01e+00

Pos logit contributions
iph+2.971
 IMAGES+2.967
kefeller+2.582
bley+2.547
 STATES+2.520
Neg logit contributions
��-3.135
�士-2.887
-2.568
��-2.543
NT-2.542
TokenS
Token ID50
Feature activation+2.18e+00

Pos logit contributions
iph+2.250
 IMAGES+2.246
kefeller+1.948
bley+1.924
 STATES+1.881
Neg logit contributions
��-2.452
�士-2.247
-2.023
��-1.994
NT-1.987
Tokenpen
Token ID3617
Feature activation+1.36e+01

Pos logit contributions
 IMAGES+1.710
iph+1.705
kefeller+1.500
bley+1.470
 STATES+1.443
Neg logit contributions
��-1.696
�士-1.541
-1.388
NT-1.374
��-1.357
Tokencer
Token ID2189
Feature activation+4.58e+00

Pos logit contributions
iph+7.752
 IMAGES+7.395
agus+6.086
kefeller+6.083
kes+5.959
Neg logit contributions
��-12.629
�士-12.126
��-10.904
catentry-10.836
 welf-10.824
Token &
Token ID1222
Feature activation+2.67e+00

Pos logit contributions
iph+3.304
 IMAGES+3.301
kefeller+2.853
bley+2.803
 STATES+2.753
Neg logit contributions
��-3.837
�士-3.521
-3.164
NT-3.131
��-3.119
Token Bras
Token ID39452
Feature activation+2.09e+00

Pos logit contributions
 IMAGES+2.106
iph+2.090
kefeller+1.841
bley+1.806
 STATES+1.778
Neg logit contributions
��-2.071
�士-1.879
-1.691
NT-1.673
��-1.663
Tokenwell
Token ID4053
Feature activation+2.68e+00

Pos logit contributions
 IMAGES+1.576
iph+1.562
kefeller+1.371
bley+1.343
 STATES+1.317
Neg logit contributions
��-1.702
�士-1.551
-1.407
NT-1.399
 welf-1.376
Token,
Token ID11
Feature activation+6.97e-01

Pos logit contributions
 IMAGES+1.991
iph+1.976
kefeller+1.727
bley+1.693
 STATES+1.660
Neg logit contributions
��-2.206
�士-2.016
-1.827
NT-1.814
 welf-1.788
Token episode
Token ID4471
Feature activation+2.00e+00

Pos logit contributions
iph+1.923
 IMAGES+1.917
kefeller+1.562
bley+1.537
 STATES+1.508
Neg logit contributions
��-3.375
�士-3.150
-2.897
NT-2.881
��-2.853
Token?
Token ID30
Feature activation-2.20e+00

Pos logit contributions
iph+1.446
 IMAGES+1.444
kefeller+1.238
bley+1.223
 STATES+1.199
Neg logit contributions
��-1.687
�士-1.546
-1.398
NT-1.387
 welf-1.379
Token Share
Token ID8734
Feature activation+4.23e+00

Pos logit contributions
��+1.472
�士+1.318
+1.189
NT+1.182
 Ll+1.149
Neg logit contributions
iph-1.925
 IMAGES-1.920
kefeller-1.770
bley-1.719
pees-1.665
Token in
Token ID287
Feature activation+1.05e+00

Pos logit contributions
 IMAGES+3.272
iph+3.246
kefeller+2.793
bley+2.766
 STATES+2.731
Neg logit contributions
��-3.347
�士-3.062
-2.744
��-2.696
NT-2.675
Token the
Token ID262
Feature activation+3.24e+00

Pos logit contributions
 IMAGES+0.815
iph+0.809
kefeller+0.709
bley+0.697
 STATES+0.684
Neg logit contributions
��-0.826
�士-0.751
-0.678
NT-0.673
��-0.663
Token comments
Token ID3651
Feature activation+1.36e+01

Pos logit contributions
 IMAGES+2.078
iph+2.067
kefeller+1.713
bley+1.700
 STATES+1.680
Neg logit contributions
��-3.010
�士-2.768
-2.536
NT-2.511
��-2.502
Token below
Token ID2174
Feature activation+6.25e+00

Pos logit contributions
iph+7.314
 IMAGES+6.972
 STATES+5.609
bley+5.557
agus+5.517
Neg logit contributions
��-12.825
�士-12.068
-10.917
��-10.774
catentry-10.699
Token!
Token ID0
Feature activation+3.39e+00

Pos logit contributions
iph+4.464
 IMAGES+4.445
bley+3.730
 STATES+3.686
kefeller+3.661
Neg logit contributions
��-5.258
�士-4.884
-4.339
NT-4.266
��-4.263
Token
Token ID198
Feature activation+2.32e+00

Pos logit contributions
 IMAGES+2.506
iph+2.482
kefeller+2.113
bley+2.097
 STATES+2.073
Neg logit contributions
��-2.821
�士-2.585
��-2.308
-2.304
 welf-2.301
Token
Token ID198
Feature activation+3.89e+00

Pos logit contributions
 IMAGES+1.871
iph+1.856
kefeller+1.629
bley+1.604
 STATES+1.584
Neg logit contributions
��-1.778
�士-1.630
-1.445
��-1.427
 welf-1.427
TokenCan
Token ID6090
Feature activation+6.45e+00

Pos logit contributions
iph+3.068
 IMAGES+3.026
kefeller+2.620
bley+2.613
pees+2.555
Neg logit contributions
��-3.026
�士-2.796
 welf-2.461
-2.455
��-2.453
Token Let
Token ID3914
Feature activation+4.77e+00

Pos logit contributions
 IMAGES+2.301
iph+2.274
kefeller+1.973
bley+1.949
 STATES+1.930
Neg logit contributions
��-2.231
�士-2.030
-1.805
��-1.796
 welf-1.789
Token us
Token ID514
Feature activation+6.65e+00

Pos logit contributions
 IMAGES+3.189
iph+3.181
bley+2.634
kefeller+2.634
 STATES+2.593
Neg logit contributions
��-4.206
�士-3.878
 welf-3.548
NT-3.498
-3.493
Token know
Token ID760
Feature activation+8.59e+00

Pos logit contributions
 IMAGES+4.676
iph+4.672
bley+4.013
 STATES+3.980
kefeller+3.950
Neg logit contributions
��-5.662
�士-5.187
-4.668
��-4.617
 welf-4.540
Token in
Token ID287
Feature activation+7.49e+00

Pos logit contributions
 IMAGES+5.788
iph+5.730
 STATES+4.724
bley+4.666
agus+4.603
Neg logit contributions
��-7.429
�士-6.984
-6.271
��-6.187
catentry-6.181
Token the
Token ID262
Feature activation+5.55e+00

Pos logit contributions
 IMAGES+5.213
iph+5.127
 STATES+4.203
bley+4.193
kefeller+4.168
Neg logit contributions
��-6.450
�士-5.977
��-5.345
NT-5.332
-5.329
Token comments
Token ID3651
Feature activation+1.28e+01

Pos logit contributions
 IMAGES+3.460
iph+3.415
 STATES+2.796
kefeller+2.783
bley+2.773
Neg logit contributions
��-5.228
�士-4.835
-4.417
��-4.406
NT-4.365
Token below
Token ID2174
Feature activation+7.61e+00

Pos logit contributions
iph+6.782
 IMAGES+6.567
 STATES+5.238
bley+5.125
agus+5.111
Neg logit contributions
��-12.148
�士-11.440
-10.501
 welf-10.212
��-10.184
Token….
Token ID11580
Feature activation+1.27e+00

Pos logit contributions
iph+5.140
 IMAGES+5.095
bley+4.194
 STATES+4.191
agus+4.139
Neg logit contributions
��-6.578
�士-6.157
-5.483
 welf-5.438
��-5.418
Token<|endoftext|>
Token ID50256
Feature activation+4.58e-01

Pos logit contributions
 IMAGES+0.926
iph+0.923
kefeller+0.797
bley+0.784
 STATES+0.767
Neg logit contributions
��-1.061
�士-0.973
-0.878
NT-0.871
 welf-0.865
TokenD
Token ID35
Feature activation-1.78e+00

Pos logit contributions
 IMAGES+0.360
iph+0.356
kefeller+0.315
bley+0.309
 STATES+0.303
Neg logit contributions
��-0.357
�士-0.324
-0.293
NT-0.292
 welf-0.285
Tokenish
Token ID680
Feature activation+6.83e-01

Pos logit contributions
��+1.386
�士+1.250
NT+1.156
+1.137
 welf+1.105
Neg logit contributions
 IMAGES-1.395
iph-1.362
kefeller-1.221
bley-1.184
 STATES-1.178
Token up
Token ID510
Feature activation-2.56e+00

Pos logit contributions
��+0.958
�士+0.821
NT+0.788
+0.776
────+0.763
Neg logit contributions
 IMAGES-1.147
iph-1.127
kefeller-1.026
bley-0.995
pees-0.975
Token.
Token ID13
Feature activation-2.00e+00

Pos logit contributions
��+1.798
NT+1.552
�士+1.508
Associated+1.496
────+1.487
Neg logit contributions
 IMAGES-2.131
iph-2.062
kefeller-1.925
bley-1.866
pees-1.818
Token For
Token ID1114
Feature activation+5.40e-01

Pos logit contributions
��+1.210
�士+1.017
NT+0.954
+0.945
 Ll+0.918
Neg logit contributions
iph-1.889
 IMAGES-1.876
kefeller-1.762
bley-1.712
pees-1.666
Token more
Token ID517
Feature activation-1.12e+00

Pos logit contributions
 IMAGES+0.469
iph+0.466
kefeller+0.417
bley+0.410
 STATES+0.403
Neg logit contributions
��-0.375
�士-0.333
-0.300
NT-0.299
 welf-0.290
Token from
Token ID422
Feature activation+2.62e+00

Pos logit contributions
��+0.794
�士+0.683
+0.649
NT+0.646
Associated+0.626
Neg logit contributions
 IMAGES-0.952
iph-0.942
kefeller-0.848
bley-0.831
pees-0.809
Token The
Token ID383
Feature activation+1.22e+01

Pos logit contributions
 IMAGES+2.098
iph+2.078
kefeller+1.830
bley+1.792
 STATES+1.778
Neg logit contributions
��-2.002
�士-1.833
-1.629
 welf-1.610
��-1.608
Token Nation
Token ID8741
Feature activation+4.67e+00

Pos logit contributions
 IMAGES+5.483
iph+4.949
 STATES+4.082
kefeller+3.930
ydia+3.804
Neg logit contributions
�士-12.927
��-12.856
 welf-11.736
��-11.555
 gobl-11.130
Token,
Token ID11
Feature activation-2.30e+00

Pos logit contributions
 IMAGES+3.388
iph+3.383
kefeller+2.882
bley+2.875
 STATES+2.834
Neg logit contributions
��-3.935
�士-3.698
 welf-3.277
��-3.237
-3.226
Token check
Token ID2198
Feature activation-2.15e+00

Pos logit contributions
��+1.071
�士+0.861
+0.814
NT+0.814
────+0.791
Neg logit contributions
 IMAGES-2.463
iph-2.433
kefeller-2.228
bley-2.193
 STATES-2.159
Token out
Token ID503
Feature activation-1.21e+00

Pos logit contributions
��+1.542
�士+1.305
+1.261
NT+1.251
 Ll+1.205
Neg logit contributions
 IMAGES-1.793
iph-1.761
kefeller-1.609
bley-1.595
pees-1.543
Token our
Token ID674
Feature activation+6.57e-01

Pos logit contributions
��+0.823
�士+0.705
NT+0.662
+0.655
 Ll+0.636
Neg logit contributions
 IMAGES-1.057
iph-1.050
kefeller-0.943
bley-0.937
pees-0.914
Token this
Token ID428
Feature activation+2.64e-01

Pos logit contributions
 IMAGES+1.553
iph+1.542
kefeller+1.369
bley+1.345
 STATES+1.322
Neg logit contributions
��-1.379
�士-1.247
-1.114
NT-1.106
��-1.087
Token?
Token ID30
Feature activation+1.33e+00

Pos logit contributions
 IMAGES+0.208
iph+0.206
kefeller+0.183
bley+0.178
 STATES+0.176
Neg logit contributions
��-0.204
�士-0.184
NT-0.167
-0.167
 welf-0.164
Token Share
Token ID8734
Feature activation+4.41e+00

Pos logit contributions
 IMAGES+1.098
iph+1.089
kefeller+0.962
bley+0.947
 STATES+0.932
Neg logit contributions
��-0.990
�士-0.896
-0.800
NT-0.794
 welf-0.784
Token in
Token ID287
Feature activation+1.27e+00

Pos logit contributions
 IMAGES+3.687
iph+3.655
kefeller+3.171
bley+3.143
 STATES+3.130
Neg logit contributions
��-3.225
�士-2.943
-2.600
��-2.554
NT-2.527
Token the
Token ID262
Feature activation+1.25e+00

Pos logit contributions
 IMAGES+0.986
iph+0.978
kefeller+0.856
bley+0.843
 STATES+0.828
Neg logit contributions
��-1.002
�士-0.912
-0.822
NT-0.816
��-0.805
Token comments
Token ID3651
Feature activation+1.21e+01

Pos logit contributions
 IMAGES+0.842
iph+0.835
kefeller+0.719
bley+0.702
 STATES+0.687
Neg logit contributions
��-1.122
�士-1.032
-0.945
NT-0.940
 welf-0.927
Token!
Token ID0
Feature activation+2.36e+00

Pos logit contributions
iph+7.072
 IMAGES+6.846
 STATES+5.626
agus+5.567
bley+5.554
Neg logit contributions
��-10.951
�士-10.226
-9.246
��-9.103
 welf-9.069
Token<|endoftext|>
Token ID50256
Feature activation+5.17e-01

Pos logit contributions
 IMAGES+1.723
iph+1.707
kefeller+1.466
bley+1.446
 STATES+1.422
Neg logit contributions
��-1.984
�士-1.824
-1.638
��-1.628
NT-1.628
TokenLG
Token ID41257
Feature activation+1.16e+00

Pos logit contributions
 IMAGES+0.406
iph+0.403
kefeller+0.356
bley+0.349
 STATES+0.342
Neg logit contributions
��-0.403
�士-0.366
-0.331
NT-0.329
 welf-0.322
Token has
Token ID468
Feature activation+1.15e+00

Pos logit contributions
 IMAGES+0.934
iph+0.927
kefeller+0.825
bley+0.807
 STATES+0.790
Neg logit contributions
��-0.881
�士-0.798
-0.718
NT-0.715
 welf-0.700
Token cancelled
Token ID16769
Feature activation-1.59e-01

Pos logit contributions
 IMAGES+0.889
iph+0.882
kefeller+0.778
bley+0.762
 STATES+0.746
Neg logit contributions
��-0.906
�士-0.824
-0.744
NT-0.741
 welf-0.728
Token the
Token ID262
Feature activation+1.23e+00

Pos logit contributions
 IMAGES+2.684
iph+2.661
kefeller+2.327
bley+2.275
 STATES+2.250
Neg logit contributions
��-2.869
�士-2.672
-2.365
��-2.340
NT-2.332
Token public
Token ID1171
Feature activation+6.26e-01

Pos logit contributions
 IMAGES+0.903
iph+0.896
kefeller+0.784
bley+0.767
 STATES+0.751
Neg logit contributions
��-1.029
�士-0.939
-0.855
NT-0.851
 welf-0.836
Token service
Token ID2139
Feature activation+3.71e+00

Pos logit contributions
 IMAGES+0.594
iph+0.590
kefeller+0.535
bley+0.525
 STATES+0.517
Neg logit contributions
��-0.383
�士-0.335
-0.297
NT-0.295
 welf-0.287
Token,
Token ID11
Feature activation+4.42e+00

Pos logit contributions
 IMAGES+2.670
iph+2.653
kefeller+2.269
bley+2.252
 STATES+2.207
Neg logit contributions
��-3.105
�士-2.876
-2.588
NT-2.555
��-2.528
Token but
Token ID475
Feature activation+1.84e+00

Pos logit contributions
iph+2.944
 IMAGES+2.939
kefeller+2.502
bley+2.468
pees+2.403
Neg logit contributions
��-3.889
�士-3.600
-3.284
NT-3.266
��-3.233
Token who
Token ID508
Feature activation+1.20e+01

Pos logit contributions
 IMAGES+1.443
iph+1.433
kefeller+1.263
bley+1.239
 STATES+1.216
Neg logit contributions
��-1.441
�士-1.312
-1.182
NT-1.173
��-1.155
Token outside
Token ID2354
Feature activation-1.01e+00

Pos logit contributions
iph+8.208
 IMAGES+7.850
pees+6.736
bley+6.696
kes+6.552
Neg logit contributions
��-9.821
�士-9.004
��-8.104
catentry-8.019
 welf-7.873
Token,
Token ID11
Feature activation+3.82e+00

Pos logit contributions
��+0.740
�士+0.663
+0.602
NT+0.599
 welf+0.588
Neg logit contributions
 IMAGES-0.830
iph-0.828
kefeller-0.736
bley-0.727
 STATES-0.705
Token or
Token ID393
Feature activation+9.41e-01

Pos logit contributions
iph+2.995
 IMAGES+2.990
kefeller+2.636
bley+2.588
 STATES+2.544
Neg logit contributions
��-2.937
�士-2.676
-2.398
NT-2.376
��-2.359
Token even
Token ID772
Feature activation+1.72e+00

Pos logit contributions
 IMAGES+0.785
iph+0.780
kefeller+0.695
bley+0.681
 STATES+0.667
Neg logit contributions
��-0.687
�士-0.618
-0.555
NT-0.553
 welf-0.544
Token inside
Token ID2641
Feature activation+2.01e+00

Pos logit contributions
 IMAGES+1.391
iph+1.382
kefeller+1.225
bley+1.201
 STATES+1.179
Neg logit contributions
��-1.295
�士-1.172
-1.053
NT-1.047
 welf-1.029
Token Bomb
Token ID14476
Feature activation+7.12e-01

Pos logit contributions
 IMAGES+0.495
iph+0.490
kefeller+0.432
bley+0.423
 STATES+0.416
Neg logit contributions
��-0.523
�士-0.474
-0.431
NT-0.429
 welf-0.420
Tokening
Token ID278
Feature activation-1.04e+00

Pos logit contributions
 IMAGES+0.539
iph+0.535
kefeller+0.468
bley+0.459
 STATES+0.450
Neg logit contributions
��-0.576
�士-0.527
-0.476
NT-0.472
 welf-0.466
Token Bit
Token ID4722
Feature activation-3.85e+00

Pos logit contributions
��+0.827
�士+0.743
+0.680
NT+0.680
 Ll+0.662
Neg logit contributions
 IMAGES-0.800
iph-0.799
kefeller-0.712
bley-0.691
 STATES-0.676
Token &
Token ID1222
Feature activation+8.61e-01

Pos logit contributions
��+3.426
�士+3.095
+2.937
NT+2.916
 Ll+2.856
Neg logit contributions
 IMAGES-2.534
iph-2.460
kefeller-2.190
 STATES-2.081
bley-2.079
Token S
Token ID311
Feature activation+1.20e+00

Pos logit contributions
 IMAGES+0.691
iph+0.686
kefeller+0.607
bley+0.595
 STATES+0.584
Neg logit contributions
��-0.657
�士-0.595
-0.536
NT-0.532
 welf-0.523
Tokenper
Token ID525
Feature activation+1.18e+01

Pos logit contributions
 IMAGES+0.840
iph+0.839
kefeller+0.724
bley+0.708
 STATES+0.692
Neg logit contributions
��-1.038
�士-0.955
-0.869
NT-0.861
 welf-0.854
Tokenmb
Token ID2022
Feature activation-1.27e+00

Pos logit contributions
iph+8.791
 IMAGES+8.572
agus+7.658
bley+7.542
pees+7.427
Neg logit contributions
��-9.000
�士-8.430
catentry-7.467
 welf-7.408
��-7.317
Tokenowl
Token ID4883
Feature activation+3.11e+00

Pos logit contributions
��+1.021
�士+0.922
NT+0.852
+0.847
��+0.819
Neg logit contributions
 IMAGES-0.961
iph-0.938
kefeller-0.841
bley-0.807
 STATES-0.804
Token 96
Token ID9907
Feature activation+2.42e+00

Pos logit contributions
 IMAGES+2.303
iph+2.282
kefeller+1.973
bley+1.948
 STATES+1.900
Neg logit contributions
��-2.547
�士-2.355
-2.112
��-2.083
NT-2.082
Token-
Token ID12
Feature activation+3.20e+00

Pos logit contributions
 IMAGES+1.791
iph+1.763
kefeller+1.528
bley+1.505
 STATES+1.482
Neg logit contributions
��-1.997
�士-1.843
-1.660
NT-1.636
 welf-1.634
TokenD
Token ID35
Feature activation+1.83e+00

Pos logit contributions
 IMAGES+2.625
iph+2.617
kefeller+2.287
bley+2.252
 STATES+2.224
Neg logit contributions
��-2.367
�士-2.163
-1.918
 welf-1.905
��-1.892
Token permission
Token ID7170
Feature activation+1.94e+00

Pos logit contributions
 IMAGES+1.428
iph+1.416
kefeller+1.256
bley+1.232
 STATES+1.207
Neg logit contributions
��-1.345
�士-1.217
-1.096
NT-1.090
 welf-1.069
Token.
Token ID13
Feature activation-1.56e-01

Pos logit contributions
 IMAGES+1.389
iph+1.378
kefeller+1.194
bley+1.174
 STATES+1.151
Neg logit contributions
��-1.642
�士-1.508
-1.365
NT-1.358
 welf-1.349
Token<|endoftext|>
Token ID50256
Feature activation+5.92e-01

Pos logit contributions
��+0.133
�士+0.122
NT+0.112
+0.112
────+0.109
Neg logit contributions
 IMAGES-0.110
iph-0.110
kefeller-0.097
bley-0.094
pees-0.091
TokenThe
Token ID464
Feature activation+5.25e-01

Pos logit contributions
 IMAGES+0.464
iph+0.460
kefeller+0.407
bley+0.398
 STATES+0.391
Neg logit contributions
��-0.462
�士-0.419
-0.379
NT-0.377
 welf-0.370
Token S
Token ID311
Feature activation+1.59e+00

Pos logit contributions
 IMAGES+0.414
iph+0.411
kefeller+0.363
bley+0.356
 STATES+0.349
Neg logit contributions
��-0.407
�士-0.370
-0.334
NT-0.332
 welf-0.326
Tokenver
Token ID332
Feature activation+1.17e+01

Pos logit contributions
iph+1.273
 IMAGES+1.272
kefeller+1.120
bley+1.100
 STATES+1.077
Neg logit contributions
��-1.210
�士-1.095
-0.984
NT-0.970
 welf-0.963
Tokenig
Token ID328
Feature activation-1.43e+00

Pos logit contributions
iph+9.366
 IMAGES+9.042
bley+8.032
agus+7.802
kefeller+7.773
Neg logit contributions
��-8.862
�士-8.367
��-7.412
catentry-7.116
 welf-7.068
Tokenes
Token ID274
Feature activation+3.61e+00

Pos logit contributions
��+1.021
�士+0.908
NT+0.831
+0.817
 Ll+0.793
Neg logit contributions
 IMAGES-1.222
iph-1.191
kefeller-1.084
bley-1.057
 STATES-1.040
Token Ri
Token ID30385
Feature activation+7.11e+00

Pos logit contributions
 IMAGES+2.824
iph+2.822
kefeller+2.457
bley+2.430
 STATES+2.389
Neg logit contributions
��-2.808
�士-2.559
-2.279
NT-2.258
��-2.244
Tokenks
Token ID591
Feature activation-2.56e+00

Pos logit contributions
iph+5.497
 IMAGES+5.285
bley+4.729
pees+4.703
kefeller+4.675
Neg logit contributions
��-5.498
�士-5.147
-4.466
��-4.420
 welf-4.375
Tokenbank
Token ID17796
Feature activation+4.33e+00

Pos logit contributions
��+2.131
�士+1.935
+1.795
NT+1.791
 welf+1.737
Neg logit contributions
 IMAGES-1.868
iph-1.810
kefeller-1.596
 STATES-1.544
bley-1.543
TokenEmail
Token ID15333
Feature activation-3.28e+00

Pos logit contributions
 IMAGES+2.358
iph+2.342
kefeller+2.107
bley+2.076
 STATES+2.046
Neg logit contributions
��-1.487
�士-1.330
-1.140
��-1.116
NT-1.116
Token This
Token ID770
Feature activation+1.65e+00

Pos logit contributions
��+2.177
�士+1.863
NT+1.737
+1.706
 welf+1.631
Neg logit contributions
iph-2.914
 IMAGES-2.908
kefeller-2.678
bley-2.612
agus-2.518
Token Story
Token ID8362
Feature activation+4.32e+00

Pos logit contributions
 IMAGES+1.103
iph+1.084
kefeller+0.924
bley+0.900
 STATES+0.883
Neg logit contributions
��-1.484
�士-1.377
-1.257
NT-1.241
��-1.240
Token Send
Token ID16290
Feature activation+1.48e+00

Pos logit contributions
 IMAGES+3.273
iph+3.213
kefeller+2.793
bley+2.775
 STATES+2.740
Neg logit contributions
��-3.502
�士-3.220
-2.913
��-2.893
 welf-2.857
Token email
Token ID3053
Feature activation+3.33e+00

Pos logit contributions
 IMAGES+1.285
iph+1.274
kefeller+1.138
bley+1.118
 STATES+1.103
Neg logit contributions
��-1.033
�士-0.932
-0.824
NT-0.818
 welf-0.805
Token to
Token ID284
Feature activation+1.16e+01

Pos logit contributions
 IMAGES+2.375
iph+2.312
kefeller+2.022
 STATES+1.982
bley+1.968
Neg logit contributions
��-2.839
�士-2.667
-2.387
NT-2.371
��-2.355
Token this
Token ID428
Feature activation+7.56e+00

Pos logit contributions
 IMAGES+6.902
iph+6.532
agus+5.754
 STATES+5.747
pees+5.543
Neg logit contributions
��-10.227
�士-10.092
��-9.822
-8.921
ADRA-8.829
Token address
Token ID2209
Feature activation+6.99e+00

Pos logit contributions
 IMAGES+3.578
iph+3.347
 STATES+2.709
bley+2.548
 rotation+2.531
Neg logit contributions
��-8.237
�士-7.904
��-7.301
-7.030
 welf-7.015
Token Enter
Token ID6062
Feature activation-1.16e+00

Pos logit contributions
 IMAGES+4.634
iph+4.592
 STATES+3.909
bley+3.738
pees+3.732
Neg logit contributions
��-6.237
�士-6.004
��-5.364
-5.182
 welf-5.165
Token Your
Token ID3406
Feature activation+2.69e+00

Pos logit contributions
��+0.667
�士+0.545
NT+0.523
+0.509
Associated+0.499
Neg logit contributions
 IMAGES-1.145
iph-1.138
kefeller-1.052
bley-1.018
agus-1.009
Token Name
Token ID6530
Feature activation+2.19e+00

Pos logit contributions
 IMAGES+2.509
iph+2.494
kefeller+2.246
bley+2.211
 STATES+2.178
Neg logit contributions
��-1.698
�士-1.508
-1.319
NT-1.308
 welf-1.281
Token you
Token ID345
Feature activation+2.56e+00

Pos logit contributions
 IMAGES+4.200
iph+4.134
 STATES+3.491
bley+3.463
kefeller+3.430
Neg logit contributions
��-4.550
�士-4.161
-3.709
��-3.695
NT-3.630
Token'd
Token ID1549
Feature activation-9.09e-01

Pos logit contributions
 IMAGES+1.756
iph+1.742
kefeller+1.506
bley+1.472
 STATES+1.440
Neg logit contributions
��-2.247
�士-2.064
-1.885
NT-1.873
 welf-1.847
Token recommend
Token ID4313
Feature activation+3.66e+00

Pos logit contributions
��+0.687
�士+0.615
NT+0.574
+0.568
 welf+0.552
Neg logit contributions
 IMAGES-0.716
iph-0.711
kefeller-0.638
bley-0.619
 STATES-0.609
Token in
Token ID287
Feature activation+4.08e+00

Pos logit contributions
 IMAGES+2.711
iph+2.700
kefeller+2.300
bley+2.290
 STATES+2.250
Neg logit contributions
��-3.031
�士-2.780
-2.507
NT-2.477
��-2.458
Token the
Token ID262
Feature activation+5.71e+00

Pos logit contributions
 IMAGES+3.074
iph+3.027
kefeller+2.555
bley+2.536
 STATES+2.536
Neg logit contributions
��-3.364
�士-3.081
-2.751
NT-2.731
��-2.724
Token comments
Token ID3651
Feature activation+1.16e+01

Pos logit contributions
 IMAGES+3.518
iph+3.458
 STATES+2.805
kefeller+2.784
bley+2.777
Neg logit contributions
��-5.466
�士-5.066
-4.609
��-4.596
NT-4.587
Token.
Token ID13
Feature activation+1.62e+00

Pos logit contributions
iph+6.908
 IMAGES+6.828
 STATES+5.513
agus+5.455
bley+5.451
Neg logit contributions
��-10.639
�士-9.919
-8.911
��-8.782
catentry-8.767
Token<|endoftext|>
Token ID50256
Feature activation+4.65e-01

Pos logit contributions
 IMAGES+1.195
iph+1.182
kefeller+1.024
bley+1.006
 STATES+0.989
Neg logit contributions
��-1.344
�士-1.231
-1.111
NT-1.104
��-1.095
TokenRed
Token ID7738
Feature activation-2.68e-01

Pos logit contributions
 IMAGES+0.365
iph+0.362
kefeller+0.320
bley+0.314
 STATES+0.308
Neg logit contributions
��-0.363
�士-0.329
-0.297
NT-0.296
 welf-0.290
Token Transit
Token ID22325
Feature activation-2.54e+00

Pos logit contributions
��+0.204
�士+0.184
+0.166
NT+0.165
��+0.162
Neg logit contributions
 IMAGES-0.215
iph-0.213
kefeller-0.190
bley-0.186
 STATES-0.182
Token L
Token ID406
Feature activation-4.98e+00

Pos logit contributions
��+1.966
�士+1.764
NT+1.619
+1.615
 Ll+1.572
Neg logit contributions
 IMAGES-1.971
iph-1.961
kefeller-1.745
bley-1.726
pees-1.672
Token Let
Token ID3914
Feature activation+3.34e+00

Pos logit contributions
 IMAGES+4.919
iph+4.689
 STATES+4.060
bley+3.948
kefeller+3.942
Neg logit contributions
��-4.887
�士-4.480
��-4.057
 welf-3.980
-3.925
Token us
Token ID514
Feature activation+6.93e+00

Pos logit contributions
 IMAGES+2.304
iph+2.274
kefeller+1.915
bley+1.911
 STATES+1.878
Neg logit contributions
��-2.922
�士-2.683
 welf-2.445
-2.420
NT-2.417
Token know
Token ID760
Feature activation+7.84e+00

Pos logit contributions
 IMAGES+4.887
iph+4.858
bley+4.191
 STATES+4.185
kefeller+4.096
Neg logit contributions
��-5.934
�士-5.424
��-4.874
-4.867
 welf-4.767
Token in
Token ID287
Feature activation+7.52e+00

Pos logit contributions
 IMAGES+5.378
iph+5.299
 STATES+4.430
bley+4.378
agus+4.324
Neg logit contributions
��-6.784
�士-6.308
-5.684
��-5.639
catentry-5.619
Token the
Token ID262
Feature activation+4.27e+00

Pos logit contributions
 IMAGES+5.208
iph+5.071
 STATES+4.160
bley+4.133
kefeller+4.101
Neg logit contributions
��-6.550
�士-6.050
��-5.437
catentry-5.408
NT-5.407
Token comments
Token ID3651
Feature activation+1.15e+01

Pos logit contributions
 IMAGES+2.697
iph+2.642
kefeller+2.184
 STATES+2.178
bley+2.174
Neg logit contributions
��-4.009
�士-3.700
-3.381
��-3.365
NT-3.340
Token below
Token ID2174
Feature activation+4.86e+00

Pos logit contributions
iph+6.543
 IMAGES+6.367
 STATES+5.233
agus+5.117
bley+5.098
Neg logit contributions
��-10.733
�士-10.011
-9.144
 welf-8.971
��-8.920
Token.
Token ID13
Feature activation+3.09e+00

Pos logit contributions
iph+3.574
 IMAGES+3.565
bley+3.014
kefeller+2.995
 STATES+2.992
Neg logit contributions
��-4.011
�士-3.689
-3.301
 welf-3.256
NT-3.255
Token<|endoftext|>
Token ID50256
Feature activation+7.22e-01

Pos logit contributions
 IMAGES+2.349
iph+2.295
kefeller+1.975
bley+1.943
 STATES+1.936
Neg logit contributions
��-2.529
�士-2.322
��-2.081
-2.069
 welf-2.063
TokenFrom
Token ID4863
Feature activation-5.68e+00

Pos logit contributions
 IMAGES+0.566
iph+0.562
kefeller+0.496
bley+0.486
 STATES+0.477
Neg logit contributions
��-0.563
�士-0.511
-0.461
NT-0.459
 welf-0.450
Token John
Token ID1757
Feature activation-4.41e+00

Pos logit contributions
��+4.370
�士+3.893
NT+3.613
+3.582
 Ll+3.556
Neg logit contributions
 IMAGES-4.308
iph-4.272
kefeller-3.866
bley-3.792
pees-3.745
Token v
Token ID410
Feature activation+2.02e+00

Pos logit contributions
 IMAGES+2.917
iph+2.906
kefeller+2.546
bley+2.510
 STATES+2.482
Neg logit contributions
��-2.625
�士-2.376
-2.112
��-2.088
NT-2.086
Tokenous
Token ID516
Feature activation+4.47e+00

Pos logit contributions
iph+1.488
 IMAGES+1.485
kefeller+1.289
bley+1.266
 STATES+1.237
Neg logit contributions
��-1.684
�士-1.540
-1.393
NT-1.375
��-1.372
Token,
Token ID11
Feature activation+4.97e+00

Pos logit contributions
 IMAGES+3.492
iph+3.481
kefeller+3.005
bley+2.972
 STATES+2.949
Neg logit contributions
��-3.460
�士-3.154
-2.818
NT-2.793
��-2.791
Token le
Token ID443
Feature activation+6.46e+00

Pos logit contributions
iph+4.085
 IMAGES+4.082
kefeller+3.521
bley+3.492
 STATES+3.471
Neg logit contributions
��-3.654
�士-3.311
-2.929
��-2.915
NT-2.885
Token S
Token ID311
Feature activation+3.02e-01

Pos logit contributions
iph+5.154
 IMAGES+5.111
bley+4.371
kefeller+4.368
 STATES+4.319
Neg logit contributions
��-4.892
�士-4.437
NT-3.905
��-3.896
-3.848
Tokené
Token ID2634
Feature activation+1.15e+01

Pos logit contributions
 IMAGES+0.230
iph+0.227
kefeller+0.201
bley+0.196
 STATES+0.192
Neg logit contributions
��-0.242
�士-0.221
-0.200
NT-0.200
 welf-0.196
Tokennat
Token ID32353
Feature activation+6.95e+00

Pos logit contributions
iph+7.954
 IMAGES+7.626
bley+6.851
pees+6.671
agus+6.657
Neg logit contributions
��-9.417
�士-8.639
NT-7.616
-7.544
 welf-7.512
Token Imp
Token ID9855
Feature activation+1.87e+00

Pos logit contributions
iph+5.380
 IMAGES+5.256
kefeller+4.474
agus+4.454
 STATES+4.452
Neg logit contributions
��-5.406
�士-4.956
-4.365
��-4.361
NT-4.331
Tokené
Token ID2634
Feature activation+3.63e-01

Pos logit contributions
 IMAGES+1.278
iph+1.276
kefeller+1.100
bley+1.076
 STATES+1.048
Neg logit contributions
��-1.635
�士-1.505
-1.374
NT-1.362
 welf-1.349
Tokenrial
Token ID4454
Feature activation+4.62e+00

Pos logit contributions
 IMAGES+0.279
iph+0.277
kefeller+0.244
bley+0.239
 STATES+0.234
Neg logit contributions
��-0.289
�士-0.263
-0.237
NT-0.236
 welf-0.232
Token,
Token ID11
Feature activation+2.79e+00

Pos logit contributions
 IMAGES+3.564
iph+3.561
kefeller+3.056
 STATES+3.014
bley+3.010
Neg logit contributions
��-3.639
�士-3.307
-2.955
��-2.924
NT-2.913
Token,
Token ID11
Feature activation+7.90e-01

Pos logit contributions
 IMAGES+2.309
iph+2.308
kefeller+1.999
bley+1.989
 STATES+1.948
Neg logit contributions
��-2.302
�士-2.103
-1.872
NT-1.867
��-1.852
Token shalt
Token ID36258
Feature activation+6.90e+00

Pos logit contributions
 IMAGES+0.644
iph+0.639
kefeller+0.570
bley+0.557
 STATES+0.547
Neg logit contributions
��-0.591
�士-0.534
-0.481
NT-0.478
 welf-0.470
Token thou
Token ID14210
Feature activation+7.05e+00

Pos logit contributions
iph+4.727
 IMAGES+4.680
bley+3.914
kefeller+3.832
pees+3.811
Neg logit contributions
��-5.945
�士-5.623
��-5.059
catentry-4.930
NT-4.905
Token take
Token ID1011
Feature activation+4.23e+00

Pos logit contributions
iph+5.117
 IMAGES+5.082
bley+4.303
kefeller+4.225
 STATES+4.194
Neg logit contributions
��-5.839
�士-5.490
��-4.879
catentry-4.787
-4.778
Token unto
Token ID12722
Feature activation+8.41e+00

Pos logit contributions
 IMAGES+3.077
iph+3.072
kefeller+2.599
bley+2.590
 STATES+2.540
Neg logit contributions
��-3.525
�士-3.258
-2.911
��-2.898
NT-2.892
Token thy
Token ID11906
Feature activation+1.14e+01

Pos logit contributions
iph+5.049
 IMAGES+4.895
bley+4.026
pees+3.973
kefeller+3.915
Neg logit contributions
��-7.762
�士-7.382
NT-6.727
��-6.704
catentry-6.669
Tokenself
Token ID944
Feature activation+5.40e+00

Pos logit contributions
 IMAGES+7.931
iph+7.827
bley+6.721
pees+6.484
agus+6.402
Neg logit contributions
��-9.338
�士-8.660
��-7.764
NT-7.745
catentry-7.597
Token.
Token ID13
Feature activation+3.95e+00

Pos logit contributions
 IMAGES+4.020
iph+3.986
bley+3.357
kefeller+3.353
agus+3.323
Neg logit contributions
��-4.411
�士-4.060
NT-3.607
-3.598
catentry-3.591
Token De
Token ID1024
Feature activation+2.89e+00

Pos logit contributions
 IMAGES+2.972
iph+2.924
kefeller+2.512
bley+2.506
 STATES+2.486
Neg logit contributions
��-3.179
�士-2.919
-2.612
��-2.611
 welf-2.587
Tokenuter
Token ID11894
Feature activation-6.11e-01

Pos logit contributions
 IMAGES+2.172
iph+2.171
kefeller+1.883
bley+1.859
 STATES+1.819
Neg logit contributions
��-2.343
�士-2.138
-1.932
NT-1.923
��-1.898
Tokenonomy
Token ID30565
Feature activation-6.80e-01

Pos logit contributions
��+0.411
�士+0.366
+0.327
NT+0.323
��+0.314
Neg logit contributions
 IMAGES-0.545
iph-0.537
kefeller-0.487
bley-0.481
 STATES-0.470
Token female
Token ID4048
Feature activation-1.12e+00

Pos logit contributions
 IMAGES+2.279
iph+2.268
kefeller+1.979
bley+1.944
 STATES+1.918
Neg logit contributions
��-2.360
�士-2.149
-1.939
NT-1.919
��-1.904
Token crowd
Token ID4315
Feature activation+2.59e+00

Pos logit contributions
��+0.810
�士+0.723
NT+0.657
+0.653
 welf+0.639
Neg logit contributions
 IMAGES-0.935
iph-0.928
kefeller-0.834
bley-0.822
 STATES-0.801
Token.
Token ID13
Feature activation+4.59e+00

Pos logit contributions
 IMAGES+1.975
iph+1.965
kefeller+1.717
bley+1.686
 STATES+1.657
Neg logit contributions
��-2.072
�士-1.894
-1.706
NT-1.692
 welf-1.671
Token
Token ID198
Feature activation+1.44e+00

Pos logit contributions
 IMAGES+3.572
iph+3.524
kefeller+3.029
 STATES+3.006
bley+2.995
Neg logit contributions
��-3.595
�士-3.293
 welf-2.943
-2.935
��-2.931
Token
Token ID198
Feature activation+5.42e+00

Pos logit contributions
 IMAGES+1.170
iph+1.162
kefeller+1.027
bley+1.009
 STATES+0.993
Neg logit contributions
��-1.081
�士-0.982
-0.878
NT-0.868
 welf-0.860
TokenWho
Token ID8241
Feature activation+1.14e+01

Pos logit contributions
iph+4.160
 IMAGES+4.090
kefeller+3.524
bley+3.511
agus+3.448
Neg logit contributions
��-4.258
�士-3.962
 welf-3.540
��-3.538
-3.487
Token do
Token ID466
Feature activation+8.19e+00

Pos logit contributions
iph+7.694
 IMAGES+7.533
bley+6.117
 STATES+6.084
agus+6.071
Neg logit contributions
��-9.521
�士-8.933
 welf-7.925
��-7.917
catentry-7.752
Token you
Token ID345
Feature activation+3.33e+00

Pos logit contributions
iph+5.764
 IMAGES+5.729
bley+4.808
 STATES+4.800
kefeller+4.776
Neg logit contributions
��-6.754
�士-6.309
��-5.674
catentry-5.671
 welf-5.625
Token think
Token ID892
Feature activation+2.24e+00

Pos logit contributions
 IMAGES+2.137
iph+2.119
kefeller+1.811
bley+1.768
 STATES+1.726
Neg logit contributions
��-3.079
�士-2.841
-2.608
NT-2.590
 welf-2.559
Token should
Token ID815
Feature activation+8.12e-02

Pos logit contributions
 IMAGES+1.661
iph+1.648
kefeller+1.444
bley+1.413
 STATES+1.383
Neg logit contributions
��-1.849
�士-1.688
-1.533
NT-1.525
 welf-1.500
Token be
Token ID307
Feature activation+1.30e+00

Pos logit contributions
 IMAGES+0.062
iph+0.062
kefeller+0.055
bley+0.054
 STATES+0.052
Neg logit contributions
��-0.064
�士-0.058
NT-0.053
-0.053
 welf-0.052
Tokente
Token ID660
Feature activation+2.07e+00

Pos logit contributions
iph+2.421
 IMAGES+2.383
bley+1.903
 STATES+1.878
kefeller+1.869
Neg logit contributions
��-4.277
�士-4.028
 welf-3.707
-3.678
catentry-3.642
Tokenlegraph
Token ID16606
Feature activation+5.42e+00

Pos logit contributions
 IMAGES+1.822
iph+1.819
bley+1.600
kefeller+1.583
pees+1.574
Neg logit contributions
��-1.421
�士-1.294
-1.122
NT-1.117
 welf-1.116
Token.
Token ID13
Feature activation+8.41e-01

Pos logit contributions
iph+3.580
 IMAGES+3.574
kefeller+2.979
 STATES+2.931
bley+2.929
Neg logit contributions
��-4.957
�士-4.755
 welf-4.256
��-4.192
-4.184
Tokenco
Token ID1073
Feature activation-3.18e+00

Pos logit contributions
 IMAGES+0.726
iph+0.715
kefeller+0.645
bley+0.633
 STATES+0.622
Neg logit contributions
��-0.587
�士-0.522
-0.470
NT-0.469
catentry-0.452
Token.
Token ID13
Feature activation+6.30e-01

Pos logit contributions
��+2.219
NT+1.910
�士+1.865
Associated+1.830
 Ll+1.821
Neg logit contributions
 IMAGES-2.643
iph-2.575
kefeller-2.356
bley-2.288
pees-2.235
Tokenuk
Token ID2724
Feature activation+1.12e+01

Pos logit contributions
 IMAGES+0.398
iph+0.394
kefeller+0.339
bley+0.328
 STATES+0.320
Neg logit contributions
��-0.585
�士-0.537
NT-0.498
-0.497
 welf-0.486
Token/
Token ID14
Feature activation+3.57e+00

Pos logit contributions
 IMAGES+5.446
iph+5.210
 STATES+4.222
bley+3.816
pees+3.779
Neg logit contributions
��-11.768
�士-11.602
 welf-10.656
��-10.462
-10.197
Tokenbusiness
Token ID22680
Feature activation+3.74e+00

Pos logit contributions
iph+2.629
 IMAGES+2.617
kefeller+2.262
bley+2.210
 STATES+2.179
Neg logit contributions
��-2.941
�士-2.720
-2.452
��-2.436
 welf-2.413
Token/
Token ID14
Feature activation+1.84e+00

Pos logit contributions
 IMAGES+2.350
iph+2.328
kefeller+1.963
bley+1.932
 STATES+1.880
Neg logit contributions
��-3.501
�士-3.257
-2.973
NT-2.946
 welf-2.945
Tokenanalysis
Token ID20930
Feature activation+3.55e+00

Pos logit contributions
iph+1.504
 IMAGES+1.500
kefeller+1.323
bley+1.302
 STATES+1.277
Neg logit contributions
��-1.372
�士-1.247
-1.112
NT-1.099
��-1.095
Token/
Token ID14
Feature activation+1.08e+00

Pos logit contributions
 IMAGES+2.234
iph+2.219
kefeller+1.860
bley+1.805
 STATES+1.788
Neg logit contributions
��-3.297
�士-3.055
-2.820
catentry-2.792
NT-2.778
Token or
Token ID393
Feature activation-1.25e+00

Pos logit contributions
��+1.082
�士+0.975
+0.891
NT+0.886
 welf+0.861
Neg logit contributions
 IMAGES-1.108
iph-1.082
kefeller-0.968
bley-0.942
 STATES-0.929
Token traffic
Token ID4979
Feature activation-3.32e+00

Pos logit contributions
��+0.937
�士+0.839
+0.767
NT+0.760
 welf+0.755
Neg logit contributions
 IMAGES-1.013
iph-1.009
kefeller-0.900
bley-0.881
pees-0.861
Token controlling
Token ID12755
Feature activation+7.91e-01

Pos logit contributions
��+2.993
�士+2.722
+2.560
NT+2.526
 welf+2.498
Neg logit contributions
 IMAGES-2.168
iph-2.133
kefeller-1.862
bley-1.810
 STATES-1.761
Token devices
Token ID4410
Feature activation+3.15e+00

Pos logit contributions
 IMAGES+0.634
iph+0.629
kefeller+0.558
bley+0.547
 STATES+0.536
Neg logit contributions
��-0.602
�士-0.545
-0.492
NT-0.489
 welf-0.481
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
iph+2.281
 IMAGES+2.274
kefeller+1.961
bley+1.947
 STATES+1.901
Neg logit contributions
��-2.620
�士-2.416
-2.168
NT-2.156
��-2.146
Token Who
Token ID5338
Feature activation+1.12e+01

Pos logit contributions
 IMAGES+1.242
iph+1.235
kefeller+1.095
bley+1.073
 STATES+1.050
Neg logit contributions
��-1.180
�士-1.068
-0.963
NT-0.958
 welf-0.938
Token is
Token ID318
Feature activation+3.84e+00

Pos logit contributions
iph+7.516
 IMAGES+7.319
bley+6.266
pees+6.258
kefeller+6.080
Neg logit contributions
��-9.341
�士-8.689
catentry-7.783
��-7.721
NT-7.560
Token more
Token ID517
Feature activation+4.43e+00

Pos logit contributions
 IMAGES+2.903
iph+2.901
kefeller+2.523
bley+2.509
 STATES+2.453
Neg logit contributions
��-3.050
�士-2.800
-2.515
NT-2.483
��-2.479
Token important
Token ID1593
Feature activation+2.42e+00

Pos logit contributions
 IMAGES+3.469
iph+3.461
kefeller+3.031
bley+2.972
 STATES+2.948
Neg logit contributions
��-3.392
�士-3.117
-2.773
NT-2.752
��-2.739
Token,
Token ID11
Feature activation+5.30e+00

Pos logit contributions
 IMAGES+1.709
iph+1.702
kefeller+1.471
bley+1.442
 STATES+1.413
Neg logit contributions
��-2.064
�士-1.903
-1.722
NT-1.709
��-1.692
Token bicycl
Token ID48094
Feature activation+1.81e+00

Pos logit contributions
 IMAGES+3.992
iph+3.975
kefeller+3.479
bley+3.408
 STATES+3.366
Neg logit contributions
��-4.236
�士-3.909
-3.470
��-3.447
NT-3.426
Token00
Token ID405
Feature activation+2.31e+00

Pos logit contributions
��+0.485
�士+0.428
NT+0.387
+0.381
────+0.365
Neg logit contributions
 IMAGES-0.752
iph-0.751
kefeller-0.685
bley-0.671
pees-0.657
Tokenpm
Token ID4426
Feature activation+5.55e+00

Pos logit contributions
 IMAGES+1.826
iph+1.813
kefeller+1.590
bley+1.559
 STATES+1.541
Neg logit contributions
��-1.790
�士-1.626
-1.454
NT-1.436
 welf-1.435
Token GMT
Token ID16987
Feature activation+6.35e+00

Pos logit contributions
 IMAGES+4.248
iph+4.152
kefeller+3.575
bley+3.555
 STATES+3.518
Neg logit contributions
��-4.488
�士-4.110
-3.648
��-3.618
 welf-3.603
Token (
Token ID357
Feature activation+3.30e+00

Pos logit contributions
 IMAGES+4.598
iph+4.539
kefeller+3.809
bley+3.766
 STATES+3.765
Neg logit contributions
��-5.336
�士-4.944
-4.406
��-4.374
 welf-4.373
TokenGMT
Token ID49424
Feature activation+6.83e+00

Pos logit contributions
 IMAGES+2.778
iph+2.774
kefeller+2.451
bley+2.418
 STATES+2.368
Neg logit contributions
��-2.379
�士-2.144
-1.904
��-1.872
 welf-1.859
Token+
Token ID10
Feature activation+1.11e+01

Pos logit contributions
iph+3.688
 IMAGES+3.602
 STATES+2.895
bley+2.864
kefeller+2.851
Neg logit contributions
��-6.885
�士-6.491
 welf-5.924
��-5.913
-5.891
Token00
Token ID405
Feature activation+2.02e+00

Pos logit contributions
 IMAGES+8.197
iph+8.167
 STATES+6.987
bley+6.580
agus+6.553
Neg logit contributions
��-8.885
�士-8.624
��-7.741
 welf-7.617
-7.157
Token:
Token ID25
Feature activation-1.32e+00

Pos logit contributions
 IMAGES+1.576
iph+1.564
kefeller+1.375
bley+1.350
 STATES+1.328
Neg logit contributions
��-1.591
�士-1.450
-1.306
NT-1.295
 welf-1.279
Token00
Token ID405
Feature activation+6.01e+00

Pos logit contributions
��+0.740
�士+0.635
NT+0.592
+0.563
Associated+0.549
Neg logit contributions
 IMAGES-1.297
iph-1.296
kefeller-1.202
bley-1.163
pees-1.146
Token)
Token ID8
Feature activation+4.16e+00

Pos logit contributions
 IMAGES+4.117
iph+4.103
bley+3.398
kefeller+3.385
 STATES+3.375
Neg logit contributions
��-5.196
�士-4.832
-4.411
��-4.354
────-4.295
Token (
Token ID357
Feature activation+5.62e+00

Pos logit contributions
 IMAGES+3.256
iph+3.202
kefeller+2.767
bley+2.737
 STATES+2.708
Neg logit contributions
��-3.293
�士-3.039
-2.712
��-2.677
 welf-2.650
Token option
Token ID3038
Feature activation+7.36e-01

Pos logit contributions
 IMAGES+0.591
iph+0.588
kefeller+0.526
bley+0.517
 STATES+0.509
Neg logit contributions
��-0.448
�士-0.401
-0.354
NT-0.351
��-0.344
Token),
Token ID828
Feature activation-1.93e+00

Pos logit contributions
 IMAGES+0.513
iph+0.507
kefeller+0.442
bley+0.431
 STATES+0.421
Neg logit contributions
��-0.640
�士-0.585
-0.536
NT-0.534
 welf-0.525
Token Dwight
Token ID29902
Feature activation-1.00e-01

Pos logit contributions
��+1.450
�士+1.304
NT+1.186
+1.182
 welf+1.156
Neg logit contributions
 IMAGES-1.558
iph-1.539
kefeller-1.381
bley-1.350
 STATES-1.328
Token Powell
Token ID19675
Feature activation+2.82e+00

Pos logit contributions
��+0.061
�士+0.054
+0.047
NT+0.046
 welf+0.045
Neg logit contributions
 IMAGES-0.096
iph-0.095
kefeller-0.086
bley-0.084
 STATES-0.083
Token,
Token ID11
Feature activation-2.77e+00

Pos logit contributions
 IMAGES+2.061
iph+2.060
kefeller+1.778
bley+1.756
agus+1.718
Neg logit contributions
��-2.325
�士-2.124
-1.921
 welf-1.903
NT-1.894
Token Pet
Token ID4767
Feature activation-1.19e+01

Pos logit contributions
��+2.059
�士+1.847
NT+1.688
+1.677
 welf+1.641
Neg logit contributions
 IMAGES-2.242
iph-2.215
kefeller-2.000
bley-1.952
 STATES-1.920
Tokenter
Token ID353
Feature activation-1.68e+00

Pos logit contributions
��+8.661
 Ll+7.976
+7.898
�士+7.881
NT+7.645
Neg logit contributions
 IMAGES-9.229
iph-7.897
kefeller-7.806
 STATES-7.521
pees-7.040
Tokeni
Token ID72
Feature activation-1.34e+00

Pos logit contributions
��+1.329
�士+1.211
NT+1.101
+1.100
 Ll+1.076
Neg logit contributions
 IMAGES-1.290
iph-1.264
kefeller-1.131
bley-1.101
 STATES-1.082
Token Kop
Token ID40500
Feature activation-1.04e-02

Pos logit contributions
��+1.058
�士+0.963
+0.869
NT+0.865
 welf+0.848
Neg logit contributions
 IMAGES-1.037
iph-1.029
kefeller-0.911
bley-0.892
 STATES-0.872
Tokenonen
Token ID34481
Feature activation-1.11e+00

Pos logit contributions
��+0.007
�士+0.006
+0.006
NT+0.005
��+0.005
Neg logit contributions
 IMAGES-0.009
iph-0.009
kefeller-0.008
bley-0.008
 STATES-0.008
Token (
Token ID357
Feature activation+3.13e+00

Pos logit contributions
��+0.876
�士+0.796
+0.721
NT+0.716
 welf+0.703
Neg logit contributions
 IMAGES-0.860
iph-0.852
kefeller-0.752
bley-0.736
 STATES-0.723
Token.
Token ID13
Feature activation-2.95e+00

Pos logit contributions
��+0.021
�士+0.019
+0.017
NT+0.017
 welf+0.016
Neg logit contributions
 IMAGES-0.021
iph-0.020
kefeller-0.018
bley-0.018
 STATES-0.017
Token
Token ID851
Feature activation-8.70e-01

Pos logit contributions
��+2.439
�士+2.223
NT+2.058
+2.044
Associated+1.974
Neg logit contributions
 IMAGES-2.131
iph-2.118
kefeller-1.876
bley-1.817
 STATES-1.778
Token Ren
Token ID7152
Feature activation-4.71e+00

Pos logit contributions
��+0.690
�士+0.624
+0.569
NT+0.566
 welf+0.553
Neg logit contributions
 IMAGES-0.666
iph-0.665
kefeller-0.590
bley-0.574
pees-0.561
Tokenaud
Token ID3885
Feature activation-2.74e+00

Pos logit contributions
��+3.880
�士+3.524
NT+3.297
+3.263
 Ll+3.194
Neg logit contributions
 IMAGES-3.418
iph-3.322
kefeller-2.982
 STATES-2.857
bley-2.817
Token Lav
Token ID21438
Feature activation-4.52e+00

Pos logit contributions
��+2.101
�士+1.899
NT+1.716
+1.715
 Ll+1.691
Neg logit contributions
 IMAGES-2.177
iph-2.143
kefeller-1.943
bley-1.852
pees-1.851
Tokeno
Token ID78
Feature activation-1.17e+01

Pos logit contributions
��+3.158
�士+2.875
NT+2.706
+2.616
 Ll+2.575
Neg logit contributions
 IMAGES-3.772
iph-3.538
kefeller-3.234
 STATES-3.148
pees-3.068
Tokenie
Token ID494
Feature activation-4.38e+00

Pos logit contributions
��+8.452
�士+7.887
NT+7.620
 Ll+7.305
cephal+7.206
Neg logit contributions
 IMAGES-8.548
iph-7.671
kefeller-7.391
 STATES-7.262
bley-7.076
Token (@
Token ID4275
Feature activation+2.55e+00

Pos logit contributions
��+2.762
�士+2.463
NT+2.205
+2.192
 Ll+2.114
Neg logit contributions
 IMAGES-3.965
iph-3.958
kefeller-3.644
bley-3.508
pees-3.456
Tokenren
Token ID918
Feature activation-4.61e+00

Pos logit contributions
iph+2.137
 IMAGES+2.129
kefeller+1.877
bley+1.872
pees+1.821
Neg logit contributions
��-1.841
�士-1.681
-1.483
 welf-1.458
��-1.457
Tokenlav
Token ID18809
Feature activation-1.74e+00

Pos logit contributions
��+3.400
�士+3.027
+2.801
NT+2.798
 welf+2.723
Neg logit contributions
 IMAGES-3.760
iph-3.575
kefeller-3.229
 STATES-3.170
bley-3.092
Tokeno
Token ID78
Feature activation-5.69e+00

Pos logit contributions
��+1.350
�士+1.219
+1.107
NT+1.106
 welf+1.081
Neg logit contributions
 IMAGES-1.365
iph-1.327
kefeller-1.177
bley-1.150
 STATES-1.147
Token,
Token ID11
Feature activation+2.29e+00

Pos logit contributions
 IMAGES+2.984
iph+2.949
kefeller+2.577
bley+2.540
 STATES+2.488
Neg logit contributions
��-3.188
�士-2.916
-2.631
catentry-2.592
��-2.589
Token if
Token ID611
Feature activation+1.66e+00

Pos logit contributions
 IMAGES+1.789
iph+1.775
kefeller+1.563
bley+1.529
 STATES+1.501
Neg logit contributions
��-1.795
�士-1.638
-1.473
NT-1.461
��-1.441
Token Y
Token ID575
Feature activation-1.11e-01

Pos logit contributions
 IMAGES+1.306
iph+1.297
kefeller+1.145
bley+1.122
 STATES+1.099
Neg logit contributions
��-1.289
�士-1.174
-1.055
NT-1.047
 welf-1.032
Tokenonge
Token ID14220
Feature activation-1.19e+00

Pos logit contributions
��+0.081
�士+0.073
NT+0.065
+0.065
��+0.063
Neg logit contributions
 IMAGES-0.092
iph-0.091
kefeller-0.082
bley-0.080
 STATES-0.079
Token-
Token ID12
Feature activation-6.68e+00

Pos logit contributions
��+1.020
�士+0.933
+0.856
NT+0.848
��+0.836
Neg logit contributions
 IMAGES-0.839
iph-0.833
kefeller-0.724
bley-0.704
 STATES-0.694
TokenE
Token ID36
Feature activation-1.16e+01

Pos logit contributions
��+5.758
�士+5.300
+4.956
NT+4.947
Associated+4.928
Neg logit contributions
 IMAGES-4.407
iph-4.214
kefeller-3.754
 STATES-3.689
bley-3.604
Tokengl
Token ID4743
Feature activation+6.74e-01

Pos logit contributions
��+4.790
�士+4.127
+3.587
��+3.470
HL+3.430
Neg logit contributions
 IMAGES-12.070
iph-11.646
kefeller-11.059
 STATES-10.903
bley-10.669
Tokeninton
Token ID2371
Feature activation+3.37e+00

Pos logit contributions
 IMAGES+0.398
iph+0.394
kefeller+0.333
bley+0.323
 STATES+0.315
Neg logit contributions
��-0.656
�士-0.606
-0.562
NT-0.559
 welf-0.551
Token is
Token ID318
Feature activation-9.01e-01

Pos logit contributions
 IMAGES+2.480
iph+2.454
kefeller+2.142
bley+2.103
 STATES+2.058
Neg logit contributions
��-2.780
�士-2.545
-2.292
NT-2.280
catentry-2.273
Token to
Token ID284
Feature activation-1.84e+00

Pos logit contributions
��+0.694
�士+0.629
+0.569
NT+0.567
 welf+0.557
Neg logit contributions
 IMAGES-0.716
iph-0.708
kefeller-0.626
bley-0.615
 STATES-0.602
Token become
Token ID1716
Feature activation+1.60e+00

Pos logit contributions
��+1.325
�士+1.192
NT+1.099
+1.072
��+1.048
Neg logit contributions
 IMAGES-1.510
iph-1.505
kefeller-1.346
bley-1.304
pees-1.286
TokenER
Token ID1137
Feature activation+5.94e-01

Pos logit contributions
��+2.393
�士+2.159
NT+2.028
+1.986
────+1.951
Neg logit contributions
iph-2.273
 IMAGES-2.237
kefeller-2.093
bley-2.024
pees-1.972
Token,
Token ID11
Feature activation+1.71e+00

Pos logit contributions
 IMAGES+0.463
iph+0.462
kefeller+0.411
bley+0.401
pees+0.392
Neg logit contributions
��-0.463
�士-0.420
-0.381
NT-0.380
 welf-0.371
Token THIS
Token ID12680
Feature activation+8.12e-01

Pos logit contributions
 IMAGES+1.322
iph+1.313
kefeller+1.157
bley+1.133
 STATES+1.111
Neg logit contributions
��-1.348
�士-1.226
-1.108
NT-1.102
 welf-1.083
Token W
Token ID370
Feature activation+4.01e+00

Pos logit contributions
iph+0.569
 IMAGES+0.567
kefeller+0.498
bley+0.484
pees+0.471
Neg logit contributions
��-0.695
�士-0.637
NT-0.584
-0.583
 welf-0.570
TokenON
Token ID1340
Feature activation+1.37e+00

Pos logit contributions
 IMAGES+3.531
iph+3.477
kefeller+3.074
bley+3.055
 STATES+3.026
Neg logit contributions
��-2.782
�士-2.507
-2.207
��-2.179
 welf-2.168
Token'
Token ID6
Feature activation-1.14e+01

Pos logit contributions
 IMAGES+1.040
iph+1.029
kefeller+0.901
bley+0.885
 STATES+0.874
Neg logit contributions
��-1.099
�士-1.002
-0.905
NT-0.895
��-0.889
TokenT
Token ID51
Feature activation+3.09e+00

Pos logit contributions
NT+5.520
��+5.280
────+4.640
RL+4.569
HL+4.370
Neg logit contributions
iph-11.481
kefeller-11.230
 IMAGES-10.827
bley-10.728
pees-10.390
Token WORK
Token ID30936
Feature activation+6.50e-01

Pos logit contributions
 IMAGES+2.136
iph+2.089
kefeller+1.797
bley+1.765
 STATES+1.756
Neg logit contributions
��-2.729
�士-2.511
-2.288
��-2.256
NT-2.253
Token*
Token ID9
Feature activation-4.45e+00

Pos logit contributions
 IMAGES+0.505
iph+0.504
kefeller+0.447
bley+0.435
pees+0.426
Neg logit contributions
��-0.509
�士-0.461
-0.419
NT-0.419
 welf-0.408
Token The
Token ID383
Feature activation-1.51e-01

Pos logit contributions
��+3.019
�士+2.672
NT+2.532
+2.454
 Ll+2.435
Neg logit contributions
iph-3.717
 IMAGES-3.701
kefeller-3.446
bley-3.286
pees-3.251
Token watch
Token ID2342
Feature activation+3.10e+00

Pos logit contributions
��+0.117
�士+0.106
+0.096
NT+0.096
 welf+0.095
Neg logit contributions
 IMAGES-0.119
iph-0.118
kefeller-0.105
bley-0.102
 STATES-0.100
TokenShe
Token ID3347
Feature activation-2.89e+00

Pos logit contributions
��+1.549
�士+1.383
NT+1.280
+1.267
────+1.240
Neg logit contributions
 IMAGES-1.590
iph-1.551
kefeller-1.418
bley-1.364
 STATES-1.342
Token pulls
Token ID16194
Feature activation-5.90e+00

Pos logit contributions
��+2.249
�士+2.030
NT+1.864
+1.860
 Ll+1.822
Neg logit contributions
 IMAGES-2.178
iph-2.172
kefeller-1.977
bley-1.882
agus-1.833
Token away
Token ID1497
Feature activation-2.62e+00

Pos logit contributions
��+4.427
�士+3.953
 Ll+3.680
NT+3.673
 welf+3.582
Neg logit contributions
iph-4.532
 IMAGES-4.517
kefeller-4.066
bley-3.875
 STATES-3.870
Token,
Token ID11
Feature activation-2.29e+00

Pos logit contributions
��+1.886
�士+1.669
NT+1.530
+1.527
────+1.508
Neg logit contributions
 IMAGES-2.179
iph-2.166
kefeller-1.954
bley-1.877
pees-1.853
Token hits
Token ID7127
Feature activation-4.44e+00

Pos logit contributions
��+1.768
�士+1.590
+1.443
NT+1.437
 welf+1.415
Neg logit contributions
 IMAGES-1.792
iph-1.782
kefeller-1.604
bley-1.556
pees-1.510
Token my
Token ID616
Feature activation-1.13e+01

Pos logit contributions
��+3.396
�士+3.082
NT+2.791
+2.783
 Ll+2.742
Neg logit contributions
 IMAGES-3.501
iph-3.377
kefeller-3.078
 STATES-2.938
bley-2.937
Token shoulder
Token ID8163
Feature activation-3.81e+00

Pos logit contributions
��+7.517
�士+6.389
 Ll+6.116
NT+6.033
+6.022
Neg logit contributions
 IMAGES-8.942
iph-8.853
kefeller-8.534
bley-7.878
pees-7.806
Token softly
Token ID26625
Feature activation-1.07e+00

Pos logit contributions
��+2.780
�士+2.480
+2.294
NT+2.277
────+2.242
Neg logit contributions
 IMAGES-3.083
iph-3.070
kefeller-2.769
bley-2.645
pees-2.628
Token and
Token ID290
Feature activation-9.80e-01

Pos logit contributions
��+0.835
�士+0.754
+0.685
NT+0.681
 welf+0.667
Neg logit contributions
 IMAGES-0.840
iph-0.834
kefeller-0.739
bley-0.720
 STATES-0.707
Token responds
Token ID20067
Feature activation+3.77e+00

Pos logit contributions
��+0.781
�士+0.709
+0.644
NT+0.642
 welf+0.633
Neg logit contributions
 IMAGES-0.745
iph-0.741
kefeller-0.660
bley-0.641
agus-0.621
Token,
Token ID11
Feature activation-5.08e-01

Pos logit contributions
 IMAGES+2.808
iph+2.786
kefeller+2.408
bley+2.390
 STATES+2.364
Neg logit contributions
��-3.061
�士-2.845
-2.532
NT-2.510
��-2.493
Token email
Token ID3053
Feature activation+4.47e+00

Pos logit contributions
 IMAGES+2.451
iph+2.446
kefeller+2.143
bley+2.109
 STATES+2.077
Neg logit contributions
��-2.371
�士-2.165
-1.936
NT-1.920
��-1.900
Token below
Token ID2174
Feature activation+5.69e+00

Pos logit contributions
 IMAGES+2.911
iph+2.907
kefeller+2.433
bley+2.411
 STATES+2.376
Neg logit contributions
��-4.052
�士-3.762
-3.424
NT-3.400
 welf-3.362
Token.
Token ID13
Feature activation+2.91e+00

Pos logit contributions
iph+4.003
 IMAGES+3.992
kefeller+3.355
bley+3.322
 STATES+3.306
Neg logit contributions
��-4.819
�士-4.495
-4.013
NT-3.984
 welf-3.954
Token :
Token ID1058
Feature activation-1.48e+00

Pos logit contributions
 IMAGES+2.283
iph+2.265
kefeller+1.971
bley+1.946
 STATES+1.922
Neg logit contributions
��-2.275
�士-2.076
-1.853
NT-1.842
��-1.830
Token To
Token ID1675
Feature activation-5.24e+00

Pos logit contributions
��+1.099
�士+0.984
+0.898
NT+0.894
 welf+0.868
Neg logit contributions
 IMAGES-1.204
iph-1.182
kefeller-1.070
bley-1.043
 STATES-1.022
Token prove
Token ID5879
Feature activation-1.13e+01

Pos logit contributions
��+2.516
�士+2.088
NT+1.907
+1.859
────+1.836
Neg logit contributions
 IMAGES-5.501
iph-5.437
kefeller-5.041
bley-4.857
pees-4.812
Token just
Token ID655
Feature activation-4.15e+00

Pos logit contributions
��+5.790
NT+5.087
────+5.051
+4.740
Krist+4.545
Neg logit contributions
 IMAGES-10.403
iph-9.395
kefeller-9.334
bley-9.125
 STATES-8.970
Token how
Token ID703
Feature activation-5.12e+00

Pos logit contributions
��+2.805
�士+2.334
NT+2.323
+2.264
────+2.231
Neg logit contributions
 IMAGES-3.589
iph-3.390
kefeller-3.224
bley-3.105
agus-3.104
Token easy
Token ID2562
Feature activation-4.83e+00

Pos logit contributions
��+2.936
�士+2.474
+2.313
NT+2.309
────+2.286
Neg logit contributions
 IMAGES-4.954
iph-4.774
kefeller-4.501
bley-4.303
 STATES-4.273
Token it
Token ID340
Feature activation-1.80e+00

Pos logit contributions
��+2.969
────+2.475
NT+2.436
�士+2.421
 Ll+2.420
Neg logit contributions
 IMAGES-4.353
iph-4.256
kefeller-3.926
bley-3.783
 STATES-3.736
Token is
Token ID318
Feature activation-4.49e+00

Pos logit contributions
��+1.286
�士+1.105
NT+1.078
+1.050
────+1.043
Neg logit contributions
 IMAGES-1.491
iph-1.466
kefeller-1.322
bley-1.287
 STATES-1.248
Token Ki
Token ID21927
Feature activation-1.77e+00

Pos logit contributions
��+3.845
�士+3.532
+3.213
 Ll+3.179
NT+3.141
Neg logit contributions
 IMAGES-3.929
iph-3.866
kefeller-3.451
bley-3.370
 STATES-3.330
Tokenpling
Token ID11347
Feature activation+1.63e+00

Pos logit contributions
��+1.622
�士+1.485
+1.373
NT+1.369
 welf+1.344
Neg logit contributions
 IMAGES-1.154
iph-1.124
kefeller-0.984
bley-0.948
 STATES-0.930
Token subway
Token ID19612
Feature activation+4.02e-02

Pos logit contributions
 IMAGES+0.996
iph+0.991
kefeller+0.835
bley+0.816
 STATES+0.793
Neg logit contributions
��-1.549
�士-1.437
-1.319
NT-1.313
��-1.299
Token station
Token ID4429
Feature activation-2.41e+00

Pos logit contributions
 IMAGES+0.031
iph+0.031
kefeller+0.027
bley+0.027
 STATES+0.026
Neg logit contributions
��-0.032
�士-0.029
-0.026
NT-0.026
 welf-0.026
Token in
Token ID287
Feature activation-1.69e+00

Pos logit contributions
��+1.883
�士+1.690
+1.546
NT+1.536
 Ll+1.495
Neg logit contributions
 IMAGES-1.881
iph-1.858
kefeller-1.655
bley-1.608
 STATES-1.583
Token Et
Token ID17906
Feature activation-1.13e+01

Pos logit contributions
��+1.246
�士+1.128
+1.018
NT+1.009
 welf+0.990
Neg logit contributions
 IMAGES-1.379
iph-1.367
kefeller-1.218
bley-1.190
 STATES-1.170
Tokenobic
Token ID20803
Feature activation-2.01e+00

Pos logit contributions
��+3.475
�士+2.522
ulum+2.309
NT+2.291
+2.266
Neg logit contributions
 IMAGES-13.171
iph-12.231
kefeller-12.145
 STATES-11.897
bley-11.880
Tokenoke
Token ID2088
Feature activation-3.51e-01

Pos logit contributions
��+1.748
�士+1.603
+1.471
NT+1.462
��+1.435
Neg logit contributions
 IMAGES-1.395
iph-1.383
kefeller-1.202
bley-1.175
 STATES-1.148
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
��+0.286
�士+0.261
+0.237
NT+0.235
 welf+0.231
Neg logit contributions
 IMAGES-0.263
iph-0.261
kefeller-0.229
bley-0.224
 STATES-0.220
Token
Token ID198
Feature activation-3.37e+00

Pos logit contributions
��+0.225
�士+0.204
+0.183
NT+0.182
 welf+0.178
Neg logit contributions
 IMAGES-0.246
iph-0.245
kefeller-0.218
bley-0.213
 STATES-0.209
Token
Token ID198
Feature activation-2.49e+00

Pos logit contributions
��+2.079
NT+1.760
Associated+1.688
�士+1.682
+1.663
Neg logit contributions
 IMAGES-3.041
iph-2.993
kefeller-2.814
bley-2.708
pees-2.658
Tokenf
Token ID69
Feature activation-3.43e+00

Pos logit contributions
��+3.360
NT+2.971
�士+2.966
+2.909
Associated+2.908
Neg logit contributions
 IMAGES-1.755
iph-1.635
kefeller-1.463
bley-1.349
 STATES-1.337
Token4
Token ID19
Feature activation-4.34e+00

Pos logit contributions
��+2.831
�士+2.596
NT+2.383
+2.379
��+2.306
Neg logit contributions
 IMAGES-2.517
iph-2.433
kefeller-2.144
bley-2.095
 STATES-2.085
Tokenb
Token ID65
Feature activation-7.42e+00

Pos logit contributions
��+2.688
�士+2.211
NT+2.194
+2.128
Associated+2.091
Neg logit contributions
 IMAGES-3.980
iph-3.862
kefeller-3.558
 STATES-3.424
bley-3.400
Tokenbe
Token ID1350
Feature activation-8.53e+00

Pos logit contributions
��+3.280
NT+2.516
cephal+2.500
+2.482
Associated+2.427
Neg logit contributions
 IMAGES-7.685
iph-7.378
kefeller-7.014
 STATES-6.813
ydia-6.542
Token66
Token ID2791
Feature activation-5.19e+00

Pos logit contributions
��+3.713
+3.000
NT+2.932
Associated+2.739
────+2.639
Neg logit contributions
 IMAGES-8.927
iph-8.518
kefeller-8.213
bley-8.049
pees-7.859
Token power
Token ID1176
Feature activation-1.11e+01

Pos logit contributions
��+3.775
�士+3.337
NT+3.124
+3.076
 Ll+3.000
Neg logit contributions
 IMAGES-4.186
iph-4.135
kefeller-3.804
bley-3.707
pees-3.601
Token off
Token ID572
Feature activation-4.12e+00

Pos logit contributions
��+7.751
�士+6.681
NT+6.509
+6.453
 Ll+6.369
Neg logit contributions
 IMAGES-8.447
iph-8.186
kefeller-7.715
bley-7.464
pees-7.371
Token
Token ID198
Feature activation-9.60e-01

Pos logit contributions
��+2.890
�士+2.546
NT+2.355
+2.342
 welf+2.288
Neg logit contributions
 IMAGES-3.441
iph-3.407
kefeller-3.114
bley-3.040
pees-2.996
Token
Token ID198
Feature activation-3.85e+00

Pos logit contributions
��+0.652
�士+0.560
NT+0.538
+0.524
Associated+0.515
Neg logit contributions
 IMAGES-0.833
iph-0.826
kefeller-0.757
bley-0.735
pees-0.721
TokenNotice
Token ID26396
Feature activation-7.59e-01

Pos logit contributions
��+2.921
�士+2.565
NT+2.468
+2.386
Associated+2.353
Neg logit contributions
 IMAGES-3.125
iph-2.948
kefeller-2.723
 STATES-2.628
bley-2.611
Token how
Token ID703
Feature activation+4.17e-01

Pos logit contributions
��+0.563
�士+0.506
+0.460
NT+0.456
 welf+0.448
Neg logit contributions
 IMAGES-0.623
iph-0.614
kefeller-0.551
bley-0.534
pees-0.528
Token lady
Token ID10846
Feature activation-5.97e-01

Pos logit contributions
��+0.473
�士+0.426
+0.389
NT+0.388
 welf+0.384
Neg logit contributions
 IMAGES-0.464
iph-0.456
kefeller-0.406
bley-0.399
pees-0.389
Token,
Token ID11
Feature activation-6.07e-01

Pos logit contributions
��+0.455
�士+0.411
+0.374
NT+0.371
 welf+0.364
Neg logit contributions
 IMAGES-0.478
iph-0.472
kefeller-0.422
bley-0.411
 STATES-0.404
Token he
Token ID339
Feature activation-2.28e+00

Pos logit contributions
��+0.439
�士+0.395
+0.354
NT+0.352
 welf+0.345
Neg logit contributions
 IMAGES-0.510
iph-0.505
kefeller-0.451
bley-0.444
 STATES-0.434
Token is
Token ID318
Feature activation-8.03e-01

Pos logit contributions
��+1.741
�士+1.583
+1.440
NT+1.437
 welf+1.418
Neg logit contributions
 IMAGES-1.777
iph-1.770
kefeller-1.576
bley-1.534
pees-1.488
Token a
Token ID257
Feature activation+5.43e-01

Pos logit contributions
��+0.615
�士+0.557
+0.503
NT+0.500
 welf+0.492
Neg logit contributions
 IMAGES-0.641
iph-0.636
kefeller-0.563
bley-0.552
 STATES-0.539
Token disillusion
Token ID39977
Feature activation-1.10e+01

Pos logit contributions
 IMAGES+0.432
iph+0.429
kefeller+0.379
bley+0.372
 STATES+0.365
Neg logit contributions
��-0.417
�士-0.378
-0.341
NT-0.339
 welf-0.334
Tokened
Token ID276
Feature activation-9.33e-01

Pos logit contributions
��+8.879
�士+7.932
+7.488
NT+7.429
Associated+7.147
Neg logit contributions
 IMAGES-7.627
iph-7.198
kefeller-6.601
bley-6.545
pees-6.342
Token Irish
Token ID8685
Feature activation+2.47e+00

Pos logit contributions
��+0.732
�士+0.668
+0.605
NT+0.603
 welf+0.602
Neg logit contributions
 IMAGES-0.719
iph-0.713
kefeller-0.630
bley-0.621
pees-0.609
Token revolutionary
Token ID12253
Feature activation+3.20e+00

Pos logit contributions
 IMAGES+1.819
iph+1.818
kefeller+1.584
bley+1.547
 STATES+1.518
Neg logit contributions
��-2.035
�士-1.865
-1.685
NT-1.671
��-1.646
Token who
Token ID508
Feature activation-8.82e-01

Pos logit contributions
 IMAGES+2.471
iph+2.456
kefeller+2.154
bley+2.105
 STATES+2.083
Neg logit contributions
��-2.533
�士-2.323
-2.076
NT-2.048
��-2.040
Token has
Token ID468
Feature activation-1.62e+00

Pos logit contributions
��+0.689
�士+0.626
+0.569
NT+0.566
 welf+0.558
Neg logit contributions
 IMAGES-0.683
iph-0.679
kefeller-0.602
bley-0.587
 STATES-0.572
Token versus
Token ID9051
Feature activation-8.75e-01

Pos logit contributions
 IMAGES+0.690
iph+0.686
kefeller+0.602
bley+0.590
 STATES+0.579
Neg logit contributions
��-0.730
�士-0.666
-0.602
NT-0.598
 welf-0.589
Token Kh
Token ID5311
Feature activation-2.69e-01

Pos logit contributions
��+0.628
�士+0.557
+0.503
NT+0.502
 welf+0.488
Neg logit contributions
 IMAGES-0.744
iph-0.737
kefeller-0.665
bley-0.648
 STATES-0.634
Tokenab
Token ID397
Feature activation+2.43e-02

Pos logit contributions
��+0.197
�士+0.177
+0.160
NT+0.159
 welf+0.155
Neg logit contributions
 IMAGES-0.225
iph-0.222
kefeller-0.199
bley-0.194
 STATES-0.191
Tokenib
Token ID571
Feature activation+1.49e+00

Pos logit contributions
 IMAGES+0.018
iph+0.017
kefeller+0.016
 STATES+0.015
bley+0.015
Neg logit contributions
��-0.021
�士-0.019
-0.017
NT-0.017
 welf-0.017
Token Nur
Token ID49281
Feature activation-2.84e-01

Pos logit contributions
 IMAGES+1.202
iph+1.196
kefeller+1.050
bley+1.042
 STATES+1.017
Neg logit contributions
��-1.133
�士-1.026
-0.923
NT-0.914
��-0.904
Tokenmag
Token ID19726
Feature activation-1.10e+01

Pos logit contributions
��+0.256
�士+0.231
+0.220
NT+0.215
 Ll+0.210
Neg logit contributions
 IMAGES-0.192
iph-0.182
kefeller-0.162
bley-0.161
 STATES-0.152
Tokenomed
Token ID12657
Feature activation-1.47e+00

Pos logit contributions
��+2.874
NT+2.278
�士+2.189
+2.115
HL+1.897
Neg logit contributions
 IMAGES-13.465
iph-12.410
kefeller-12.266
bley-11.891
 STATES-11.887
Tokenov
Token ID709
Feature activation+2.32e+00

Pos logit contributions
��+0.888
�士+0.770
+0.692
NT+0.680
��+0.652
Neg logit contributions
 IMAGES-1.410
iph-1.380
kefeller-1.280
bley-1.240
 STATES-1.225
Token
Token ID784
Feature activation+6.19e-02

Pos logit contributions
 IMAGES+1.725
iph+1.720
kefeller+1.491
bley+1.473
 STATES+1.438
Neg logit contributions
��-1.891
�士-1.737
-1.560
NT-1.556
 welf-1.544
Token which
Token ID543
Feature activation-1.95e+00

Pos logit contributions
 IMAGES+0.048
iph+0.048
kefeller+0.042
bley+0.041
 STATES+0.040
Neg logit contributions
��-0.049
�士-0.044
-0.040
NT-0.040
 welf-0.039
Token was
Token ID373
Feature activation-2.77e+00

Pos logit contributions
��+1.491
�士+1.344
+1.230
NT+1.226
 Ll+1.192
Neg logit contributions
iph-1.537
 IMAGES-1.530
kefeller-1.385
bley-1.325
pees-1.303
Token have
Token ID423
Feature activation-2.79e+00

Pos logit contributions
��+4.701
�士+4.103
+3.936
 Ll+3.820
Associated+3.808
Neg logit contributions
 IMAGES-5.690
iph-5.672
kefeller-5.291
bley-5.020
 STATES-4.955
Token been
Token ID587
Feature activation-3.79e+00

Pos logit contributions
��+2.066
�士+1.830
NT+1.685
+1.681
 Ll+1.632
Neg logit contributions
iph-2.269
 IMAGES-2.263
kefeller-2.025
bley-1.948
pees-1.912
Token reported
Token ID2098
Feature activation-5.50e-01

Pos logit contributions
��+2.809
�士+2.515
NT+2.263
+2.257
 welf+2.226
Neg logit contributions
 IMAGES-3.092
iph-3.074
kefeller-2.779
bley-2.677
 STATES-2.617
Token in
Token ID287
Feature activation-1.27e+00

Pos logit contributions
��+0.431
�士+0.388
+0.355
NT+0.352
 welf+0.345
Neg logit contributions
 IMAGES-0.428
iph-0.426
kefeller-0.377
bley-0.367
 STATES-0.361
Token the
Token ID262
Feature activation-1.78e+00

Pos logit contributions
��+0.962
�士+0.870
NT+0.793
+0.792
 welf+0.783
Neg logit contributions
iph-1.009
 IMAGES-1.008
kefeller-0.895
bley-0.873
 STATES-0.854
Token Chand
Token ID20631
Feature activation-1.10e+01

Pos logit contributions
��+1.427
�士+1.296
+1.198
NT+1.195
 welf+1.182
Neg logit contributions
iph-1.319
 IMAGES-1.304
kefeller-1.156
bley-1.128
pees-1.102
Tokenpai
Token ID49712
Feature activation-3.92e+00

Pos logit contributions
��+5.570
NT+4.597
+4.553
cephal+4.496
 Ll+4.443
Neg logit contributions
 IMAGES-9.877
iph-9.329
kefeller-9.095
 STATES-8.744
agus-8.508
Token region
Token ID3814
Feature activation-2.18e-01

Pos logit contributions
��+3.209
�士+2.917
+2.721
NT+2.718
 welf+2.674
Neg logit contributions
 IMAGES-2.776
iph-2.764
kefeller-2.475
bley-2.422
 STATES-2.349
Token's
Token ID338
Feature activation-4.25e+00

Pos logit contributions
��+0.174
�士+0.157
+0.143
NT+0.142
 welf+0.140
Neg logit contributions
 IMAGES-0.167
iph-0.166
kefeller-0.146
bley-0.143
 STATES-0.140
Token channels
Token ID9619
Feature activation-6.03e-01

Pos logit contributions
��+3.226
�士+2.911
NT+2.695
+2.693
 Ll+2.670
Neg logit contributions
iph-3.273
 IMAGES-3.257
kefeller-2.921
bley-2.835
pees-2.781
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
��+0.480
�士+0.435
+0.396
NT+0.394
 welf+0.387
Neg logit contributions
 IMAGES-0.463
iph-0.458
kefeller-0.405
bley-0.396
 STATES-0.388
Token ISI
Token ID22048
Feature activation+6.37e+00

Pos logit contributions
iph+0.458
 IMAGES+0.457
kefeller+0.399
bley+0.390
pees+0.379
Neg logit contributions
��-0.551
�士-0.503
NT-0.460
-0.460
 welf-0.449
Token
Token ID198
Feature activation+5.61e+00

Pos logit contributions
 IMAGES+4.680
iph+4.526
 STATES+3.925
kefeller+3.817
bley+3.774
Neg logit contributions
��-5.348
�士-5.141
 welf-4.556
��-4.531
-4.383
Token
Token ID198
Feature activation+7.57e+00

Pos logit contributions
 IMAGES+4.089
iph+4.021
 STATES+3.429
kefeller+3.378
bley+3.370
Neg logit contributions
��-4.761
�士-4.600
��-4.050
 welf-3.986
-3.923
TokenP
Token ID47
Feature activation+9.65e-02

Pos logit contributions
iph+5.587
 IMAGES+5.479
kefeller+4.808
bley+4.761
pees+4.721
Neg logit contributions
��-5.949
�士-5.613
��-4.981
 welf-4.980
-4.948
Tokenel
Token ID417
Feature activation-3.98e+00

Pos logit contributions
 IMAGES+0.073
iph+0.072
kefeller+0.063
bley+0.062
 STATES+0.061
Neg logit contributions
��-0.078
�士-0.071
NT-0.064
-0.064
 welf-0.063
Tokenleg
Token ID1455
Feature activation-1.08e+01

Pos logit contributions
��+1.942
�士+1.622
+1.441
NT+1.416
 Ll+1.356
Neg logit contributions
 IMAGES-4.251
iph-4.077
kefeller-3.749
 STATES-3.729
bley-3.666
Tokenr
Token ID81
Feature activation-4.78e+00

Pos logit contributions
��+5.067
NT+4.282
 Ll+4.226
RL+4.192
quila+4.093
Neg logit contributions
 IMAGES-10.435
iph-9.741
kefeller-9.387
 STATES-9.082
bley-8.875
Tokenini
Token ID5362
Feature activation+2.35e-01

Pos logit contributions
��+3.244
NT+2.849
�士+2.782
 Ll+2.725
+2.706
Neg logit contributions
 IMAGES-3.901
iph-3.773
kefeller-3.445
 STATES-3.346
bley-3.333
Token,
Token ID11
Feature activation-4.78e-01

Pos logit contributions
 IMAGES+0.190
iph+0.188
kefeller+0.169
bley+0.164
pees+0.161
Neg logit contributions
��-0.176
�士-0.157
NT-0.145
-0.144
 Ll-0.140
Token A
Token ID317
Feature activation-4.69e+00

Pos logit contributions
��+0.360
�士+0.323
NT+0.297
+0.296
 Ll+0.289
Neg logit contributions
 IMAGES-0.384
iph-0.383
kefeller-0.346
bley-0.335
pees-0.328
Token.
Token ID13
Feature activation-2.57e+00

Pos logit contributions
��+3.715
�士+3.387
+3.061
NT+3.056
��+3.003
Neg logit contributions
 IMAGES-3.593
iph-3.511
kefeller-3.114
bley-3.025
 STATES-3.008
Token example
Token ID1672
Feature activation+9.91e-01

Pos logit contributions
��+2.075
�士+1.861
+1.671
NT+1.666
 welf+1.664
Neg logit contributions
 IMAGES-2.437
iph-2.419
kefeller-2.172
bley-2.128
pees-2.081
Token,
Token ID11
Feature activation+1.21e+00

Pos logit contributions
 IMAGES+0.763
iph+0.758
kefeller+0.666
bley+0.653
 STATES+0.641
Neg logit contributions
��-0.787
�士-0.715
-0.648
NT-0.643
��-0.632
Token the
Token ID262
Feature activation+1.47e+00

Pos logit contributions
 IMAGES+0.823
iph+0.815
kefeller+0.706
bley+0.689
 STATES+0.673
Neg logit contributions
��-1.065
�士-0.979
-0.895
NT-0.890
 welf-0.879
Token Hotel
Token ID12696
Feature activation+7.43e-01

Pos logit contributions
 IMAGES+1.090
iph+1.082
kefeller+0.947
bley+0.927
 STATES+0.908
Neg logit contributions
��-1.215
�士-1.110
-1.008
NT-1.001
 welf-0.986
Token Vol
Token ID4709
Feature activation-5.63e+00

Pos logit contributions
 IMAGES+0.591
iph+0.586
kefeller+0.519
bley+0.509
 STATES+0.499
Neg logit contributions
��-0.570
�士-0.518
-0.466
NT-0.464
 welf-0.456
Tokencan
Token ID5171
Feature activation-1.07e+01

Pos logit contributions
��+4.503
�士+4.055
+3.817
NT+3.737
 Ll+3.661
Neg logit contributions
 IMAGES-4.239
iph-3.879
 STATES-3.515
kefeller-3.509
bley-3.442
Token on
Token ID319
Feature activation-2.02e+00

Pos logit contributions
��+8.424
�士+7.616
 Ll+7.033
+6.974
NT+6.946
Neg logit contributions
 IMAGES-7.501
iph-6.795
bley-6.408
kefeller-6.399
 STATES-6.260
Token Lan
Token ID14730
Feature activation-2.72e-01

Pos logit contributions
��+1.554
�士+1.407
+1.279
 Ll+1.266
NT+1.264
Neg logit contributions
 IMAGES-1.590
iph-1.559
kefeller-1.390
bley-1.359
pees-1.336
Tokenzar
Token ID41046
Feature activation-1.75e+00

Pos logit contributions
��+0.214
�士+0.194
+0.176
NT+0.176
 welf+0.171
Neg logit contributions
 IMAGES-0.211
iph-0.207
kefeller-0.184
bley-0.180
 STATES-0.178
Tokenote
Token ID1258
Feature activation+1.48e+00

Pos logit contributions
��+1.179
�士+1.055
+0.942
NT+0.935
 welf+0.910
Neg logit contributions
 IMAGES-1.546
iph-1.531
kefeller-1.377
bley-1.351
 STATES-1.336
Token costs
Token ID3484
Feature activation-1.13e+00

Pos logit contributions
 IMAGES+1.121
iph+1.113
kefeller+0.976
bley+0.957
 STATES+0.939
Neg logit contributions
��-1.193
�士-1.088
-0.985
NT-0.978
 welf-0.963
Token of
Token ID286
Feature activation-1.30e+00

Pos logit contributions
 IMAGES+0.588
iph+0.584
kefeller+0.514
bley+0.503
 STATES+0.494
Neg logit contributions
��-0.611
�士-0.557
-0.504
NT-0.500
 welf-0.492
Token what
Token ID644
Feature activation+5.19e-02

Pos logit contributions
��+0.970
�士+0.873
+0.783
NT+0.776
 welf+0.759
Neg logit contributions
 IMAGES-1.065
iph-1.064
kefeller-0.946
bley-0.934
 STATES-0.911
Token follows
Token ID5679
Feature activation-1.11e+00

Pos logit contributions
 IMAGES+0.040
iph+0.040
kefeller+0.036
bley+0.035
 STATES+0.034
Neg logit contributions
��-0.041
�士-0.037
-0.033
NT-0.033
 welf-0.033
Token.
Token ID13
Feature activation+1.43e+00

Pos logit contributions
��+0.859
�士+0.769
+0.709
NT+0.701
────+0.691
Neg logit contributions
 IMAGES-0.865
iph-0.862
kefeller-0.770
bley-0.751
 STATES-0.733
Token King
Token ID2677
Feature activation+3.66e-01

Pos logit contributions
 IMAGES+1.166
iph+1.159
kefeller+1.029
bley+1.009
 STATES+0.989
Neg logit contributions
��-1.074
�士-0.972
-0.873
NT-0.867
 welf-0.850
Token te
Token ID573
Feature activation-1.06e+01

Pos logit contributions
 IMAGES+0.273
iph+0.272
kefeller+0.240
bley+0.234
 STATES+0.228
Neg logit contributions
��-0.297
�士-0.271
-0.247
NT-0.245
 welf-0.240
Tokenases
Token ID1386
Feature activation-1.19e+00

Pos logit contributions
��+7.439
�士+6.626
quila+6.327
+6.308
NT+6.177
Neg logit contributions
 IMAGES-7.992
iph-7.903
kefeller-7.284
 STATES-6.836
bley-6.823
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
��+0.859
�士+0.769
+0.690
NT+0.685
��+0.670
Neg logit contributions
iph-0.998
 IMAGES-0.990
kefeller-0.894
bley-0.873
pees-0.857
Token however
Token ID2158
Feature activation+2.09e+00

Pos logit contributions
 IMAGES+1.404
iph+1.391
kefeller+1.232
bley+1.208
 STATES+1.190
Neg logit contributions
��-1.281
�士-1.162
-1.040
NT-1.034
 welf-1.018
Token,
Token ID11
Feature activation-1.12e+00

Pos logit contributions
 IMAGES+1.541
iph+1.525
kefeller+1.316
bley+1.308
 STATES+1.285
Neg logit contributions
��-1.712
�士-1.578
-1.418
NT-1.406
catentry-1.401
Token that
Token ID326
Feature activation-9.58e-01

Pos logit contributions
��+0.800
�士+0.716
+0.644
NT+0.639
��+0.622
Neg logit contributions
iph-0.942
 IMAGES-0.941
kefeller-0.846
bley-0.828
pees-0.813
Token
Token ID250
Feature activation-6.94e-01

Pos logit contributions
 IMAGES+0.243
iph+0.242
kefeller+0.219
bley+0.215
 STATES+0.212
Neg logit contributions
��-0.151
�士-0.133
-0.115
NT-0.114
 welf-0.112
TokenSimon
Token ID35475
Feature activation-1.30e+00

Pos logit contributions
��+0.568
�士+0.515
NT+0.473
+0.472
────+0.457
Neg logit contributions
 IMAGES-0.518
iph-0.510
kefeller-0.453
bley-0.438
 STATES-0.433
Token [
Token ID685
Feature activation+1.60e+00

Pos logit contributions
��+1.050
�士+0.947
+0.867
NT+0.855
 Ll+0.850
Neg logit contributions
 IMAGES-0.988
iph-0.972
kefeller-0.857
bley-0.836
 STATES-0.831
TokenM
Token ID44
Feature activation-1.19e+00

Pos logit contributions
iph+1.275
 IMAGES+1.275
kefeller+1.122
bley+1.103
 STATES+1.076
Neg logit contributions
��-1.227
�士-1.117
-0.998
NT-0.984
��-0.982
Tokenign
Token ID570
Feature activation-1.85e+00

Pos logit contributions
��+0.935
�士+0.849
NT+0.772
+0.771
��+0.748
Neg logit contributions
 IMAGES-0.925
iph-0.912
kefeller-0.804
bley-0.786
 STATES-0.779
Tokeno
Token ID78
Feature activation-1.06e+01

Pos logit contributions
��+1.355
�士+1.215
NT+1.134
+1.096
 welf+1.089
Neg logit contributions
 IMAGES-1.540
iph-1.468
kefeller-1.330
bley-1.301
 STATES-1.293
Tokenlet
Token ID1616
Feature activation-2.93e-01

Pos logit contributions
��+4.696
�士+3.768
 Ll+3.683
NT+3.589
HL+3.221
Neg logit contributions
 IMAGES-11.081
iph-10.489
 STATES-10.068
kefeller-10.051
bley-9.953
Token]
Token ID60
Feature activation-3.40e+00

Pos logit contributions
��+0.231
�士+0.210
+0.190
NT+0.188
 welf+0.185
Neg logit contributions
 IMAGES-0.228
iph-0.226
kefeller-0.199
bley-0.195
 STATES-0.192
Token came
Token ID1625
Feature activation+5.70e+00

Pos logit contributions
��+2.593
�士+2.335
+2.144
NT+2.136
 Ll+2.109
Neg logit contributions
 IMAGES-2.680
iph-2.634
kefeller-2.353
 STATES-2.270
bley-2.264
Token back
Token ID736
Feature activation+3.46e+00

Pos logit contributions
 IMAGES+3.917
iph+3.898
bley+3.368
kefeller+3.328
 STATES+3.209
Neg logit contributions
��-4.908
�士-4.582
-4.063
��-4.062
 welf-4.053
Token in
Token ID287
Feature activation+2.74e+00

Pos logit contributions
 IMAGES+2.548
iph+2.542
kefeller+2.193
bley+2.192
 STATES+2.118
Neg logit contributions
��-2.852
�士-2.634
-2.350
NT-2.344
��-2.317
Token bitters
Token ID48666
Feature activation-8.34e+00

Pos logit contributions
��+3.130
�士+2.840
NT+2.594
+2.588
────+2.522
Neg logit contributions
 IMAGES-3.133
iph-3.094
kefeller-2.794
bley-2.666
pees-2.657
Tokenweet
Token ID7277
Feature activation-1.23e+00

Pos logit contributions
��+5.071
NT+4.129
�士+4.072
+4.020
Associated+3.971
Neg logit contributions
 IMAGES-7.460
iph-6.957
kefeller-6.726
 STATES-6.451
bley-6.295
Token one
Token ID530
Feature activation-8.38e-01

Pos logit contributions
��+0.957
�士+0.866
+0.790
NT+0.786
 welf+0.773
Neg logit contributions
 IMAGES-0.959
iph-0.952
kefeller-0.847
bley-0.820
 STATES-0.809
Token.
Token ID13
Feature activation+4.18e-01

Pos logit contributions
��+0.651
�士+0.583
+0.535
NT+0.531
 welf+0.520
Neg logit contributions
 IMAGES-0.659
iph-0.651
kefeller-0.582
bley-0.565
 STATES-0.557
Token A
Token ID317
Feature activation-6.04e-01

Pos logit contributions
 IMAGES+0.342
iph+0.340
kefeller+0.304
bley+0.297
 STATES+0.291
Neg logit contributions
��-0.312
�士-0.282
-0.253
NT-0.252
 welf-0.245
Token bitters
Token ID48666
Feature activation-1.05e+01

Pos logit contributions
��+0.474
�士+0.431
+0.390
NT+0.389
 welf+0.381
Neg logit contributions
 IMAGES-0.469
iph-0.464
kefeller-0.413
bley-0.401
pees-0.395
Tokenweet
Token ID7277
Feature activation-2.62e+00

Pos logit contributions
��+5.531
NT+4.781
Associated+4.533
catentry+4.453
+4.432
Neg logit contributions
 IMAGES-9.699
iph-8.849
kefeller-8.712
 STATES-8.488
 Sachs-8.176
Token frenzy
Token ID31934
Feature activation-3.60e+00

Pos logit contributions
��+2.039
�士+1.834
+1.690
NT+1.683
 welf+1.642
Neg logit contributions
 IMAGES-2.040
iph-2.014
kefeller-1.812
bley-1.747
pees-1.731
Token?
Token ID30
Feature activation-1.60e+00

Pos logit contributions
��+2.655
�士+2.370
+2.171
NT+2.171
────+2.131
Neg logit contributions
 IMAGES-2.899
iph-2.877
kefeller-2.616
bley-2.496
 STATES-2.467
Token Sure
Token ID10889
Feature activation+3.22e+00

Pos logit contributions
��+1.174
�士+1.061
NT+0.968
+0.953
 Ll+0.935
Neg logit contributions
iph-1.310
 IMAGES-1.300
kefeller-1.192
bley-1.152
pees-1.125
Token,
Token ID11
Feature activation+2.70e+00

Pos logit contributions
 IMAGES+2.334
iph+2.316
bley+1.994
kefeller+1.983
agus+1.927
Neg logit contributions
��-2.681
�士-2.470
-2.215
NT-2.200
��-2.185
Token bench
Token ID7624
Feature activation-2.37e+00

Pos logit contributions
��+1.053
�士+0.943
+0.848
NT+0.848
 welf+0.827
Neg logit contributions
 IMAGES-1.212
iph-1.204
kefeller-1.085
bley-1.049
pees-1.038
Token Brian
Token ID8403
Feature activation-1.39e+00

Pos logit contributions
��+1.774
�士+1.597
+1.444
NT+1.442
 Ll+1.415
Neg logit contributions
 IMAGES-1.901
iph-1.891
kefeller-1.715
bley-1.648
pees-1.631
Token Al
Token ID978
Feature activation-8.32e-01

Pos logit contributions
��+1.085
�士+0.981
+0.891
NT+0.888
 Ll+0.869
Neg logit contributions
 IMAGES-1.086
iph-1.074
kefeller-0.955
bley-0.929
 STATES-0.915
Tokenain
Token ID391
Feature activation+2.22e-01

Pos logit contributions
��+0.649
�士+0.586
+0.538
NT+0.534
 welf+0.521
Neg logit contributions
 IMAGES-0.650
iph-0.628
kefeller-0.560
 STATES-0.546
bley-0.545
Tokenu
Token ID84
Feature activation-7.68e-02

Pos logit contributions
 IMAGES+0.171
iph+0.169
kefeller+0.151
bley+0.147
 STATES+0.144
Neg logit contributions
��-0.175
�士-0.160
-0.145
NT-0.144
 welf-0.141
Token'
Token ID6
Feature activation-1.05e+01

Pos logit contributions
��+0.061
�士+0.055
+0.050
NT+0.050
 welf+0.049
Neg logit contributions
 IMAGES-0.059
iph-0.059
kefeller-0.052
bley-0.051
 STATES-0.050
Tokenuese
Token ID20506
Feature activation-3.37e+00

Pos logit contributions
��+7.195
�士+6.626
+6.120
 Ll+6.048
NT+6.044
Neg logit contributions
 IMAGES-8.164
iph-7.768
kefeller-7.273
 STATES-6.881
bley-6.833
Token returns
Token ID5860
Feature activation-1.99e+00

Pos logit contributions
��+2.528
�士+2.310
+2.102
NT+2.087
 Ll+2.035
Neg logit contributions
 IMAGES-2.661
iph-2.624
kefeller-2.394
bley-2.288
 STATES-2.253
Token from
Token ID422
Feature activation-2.74e+00

Pos logit contributions
��+1.510
�士+1.365
+1.237
NT+1.223
 welf+1.205
Neg logit contributions
 IMAGES-1.592
iph-1.582
kefeller-1.421
bley-1.376
pees-1.353
Token suspension
Token ID11461
Feature activation-3.48e-01

Pos logit contributions
��+2.085
�士+1.900
NT+1.716
+1.705
 Ll+1.681
Neg logit contributions
 IMAGES-2.162
iph-2.120
kefeller-1.939
bley-1.859
pees-1.856
Token and
Token ID290
Feature activation-3.30e+00

Pos logit contributions
��+0.272
�士+0.247
+0.223
NT+0.222
 welf+0.219
Neg logit contributions
 IMAGES-0.271
iph-0.269
kefeller-0.239
bley-0.233
 STATES-0.229
Token try
Token ID1949
Feature activation-1.77e+00

Pos logit contributions
 IMAGES+1.785
iph+1.773
kefeller+1.561
bley+1.535
 STATES+1.508
Neg logit contributions
��-1.759
�士-1.603
-1.441
NT-1.425
��-1.407
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
��+1.375
�士+1.248
+1.126
NT+1.124
 welf+1.100
Neg logit contributions
 IMAGES-1.391
iph-1.372
kefeller-1.227
bley-1.195
 STATES-1.166
Token collect
Token ID2824
Feature activation+1.16e+00

Pos logit contributions
 IMAGES+1.362
iph+1.352
kefeller+1.192
bley+1.168
 STATES+1.147
Neg logit contributions
��-1.375
�士-1.251
-1.129
NT-1.121
 welf-1.103
Token the
Token ID262
Feature activation+7.11e-01

Pos logit contributions
 IMAGES+0.927
iph+0.920
kefeller+0.814
bley+0.798
 STATES+0.784
Neg logit contributions
��-0.888
�士-0.806
-0.725
NT-0.719
��-0.707
Token Cur
Token ID4424
Feature activation+1.62e+00

Pos logit contributions
 IMAGES+0.568
iph+0.564
kefeller+0.499
bley+0.489
 STATES+0.480
Neg logit contributions
��-0.545
�士-0.494
-0.445
NT-0.442
 welf-0.435
Tokenric
Token ID1173
Feature activation-1.03e+01

Pos logit contributions
 IMAGES+1.207
iph+1.200
kefeller+1.050
bley+1.028
 STATES+1.006
Neg logit contributions
��-1.330
�士-1.215
-1.101
NT-1.093
 welf-1.079
Tokenulum
Token ID14452
Feature activation-4.14e+00

Pos logit contributions
��+4.054
ulum+3.221
�士+3.172
+2.813
NT+2.809
Neg logit contributions
 IMAGES-11.573
iph-11.023
bley-10.504
kefeller-10.439
 STATES-10.214
Token Vit
Token ID18271
Feature activation-8.30e-01

Pos logit contributions
��+3.157
�士+2.811
NT+2.606
+2.596
 Ll+2.514
Neg logit contributions
 IMAGES-3.268
iph-3.226
kefeller-2.899
bley-2.847
 STATES-2.788
Tokenae
Token ID3609
Feature activation-1.05e+00

Pos logit contributions
��+0.783
�士+0.719
+0.670
NT+0.667
 welf+0.653
Neg logit contributions
 IMAGES-0.511
iph-0.502
kefeller-0.439
bley-0.426
 STATES-0.412
Token of
Token ID286
Feature activation+1.16e+00

Pos logit contributions
��+0.820
�士+0.736
+0.673
NT+0.671
 welf+0.657
Neg logit contributions
 IMAGES-0.827
iph-0.820
kefeller-0.733
bley-0.713
 STATES-0.697
Token everyone
Token ID2506
Feature activation+4.60e+00

Pos logit contributions
 IMAGES+0.920
iph+0.913
kefeller+0.808
bley+0.792
 STATES+0.777
Neg logit contributions
��-0.891
�士-0.809
-0.729
NT-0.724
 welf-0.711
Token than
Token ID621
Feature activation-1.39e+00

Pos logit contributions
 IMAGES+0.635
iph+0.631
kefeller+0.550
bley+0.537
 STATES+0.528
Neg logit contributions
��-0.760
�士-0.696
-0.632
NT-0.630
��-0.620
Token current
Token ID1459
Feature activation-2.45e+00

Pos logit contributions
��+1.008
�士+0.917
+0.825
NT+0.818
 welf+0.795
Neg logit contributions
 IMAGES-1.162
iph-1.147
kefeller-1.031
bley-1.003
pees-0.983
Token Rapt
Token ID21301
Feature activation+1.47e+00

Pos logit contributions
��+1.734
�士+1.571
+1.423
NT+1.419
 welf+1.382
Neg logit contributions
 IMAGES-2.053
iph-2.027
kefeller-1.838
bley-1.776
pees-1.752
Tokenor
Token ID273
Feature activation-3.60e+00

Pos logit contributions
 IMAGES+1.121
iph+1.119
kefeller+0.983
bley+0.962
 STATES+0.943
Neg logit contributions
��-1.167
�士-1.063
-0.957
NT-0.952
 welf-0.942
Token's
Token ID338
Feature activation-6.45e+00

Pos logit contributions
��+2.491
�士+2.228
+2.045
NT+2.038
 Ll+1.961
Neg logit contributions
 IMAGES-3.056
iph-3.038
kefeller-2.758
bley-2.665
agus-2.652
Token 411
Token ID43184
Feature activation-1.03e+01

Pos logit contributions
��+4.480
�士+4.058
+3.764
NT+3.715
────+3.582
Neg logit contributions
 IMAGES-5.316
iph-5.240
kefeller-4.769
agus-4.583
bley-4.525
Token-
Token ID12
Feature activation-7.76e+00

Pos logit contributions
��+6.529
�士+5.951
NT+5.625
+5.615
��+5.319
Neg logit contributions
 IMAGES-8.751
iph-8.583
kefeller-8.076
agus-7.715
ydia-7.623
Tokenhorse
Token ID30527
Feature activation-3.68e+00

Pos logit contributions
��+5.080
�士+4.728
NT+4.351
+4.323
Associated+4.046
Neg logit contributions
 IMAGES-6.640
iph-6.343
kefeller-6.095
ydia-5.696
agus-5.691
Tokenpower
Token ID6477
Feature activation-4.16e+00

Pos logit contributions
��+2.203
�士+1.856
+1.778
NT+1.685
catentry+1.613
Neg logit contributions
 IMAGES-3.514
iph-3.469
kefeller-3.210
bley-3.071
 STATES-3.051
Token,
Token ID11
Feature activation-8.18e-01

Pos logit contributions
��+3.008
�士+2.739
+2.504
NT+2.463
 Ll+2.386
Neg logit contributions
 IMAGES-3.415
iph-3.376
kefeller-3.079
pees-2.924
bley-2.919
Token 4
Token ID604
Feature activation-1.65e+00

Pos logit contributions
��+0.631
�士+0.578
+0.521
NT+0.518
 welf+0.506
Neg logit contributions
 IMAGES-0.646
iph-0.638
kefeller-0.566
bley-0.552
pees-0.542
Token):
Token ID2599
Feature activation-3.94e-01

Pos logit contributions
 IMAGES+2.353
iph+2.327
kefeller+1.982
bley+1.950
 STATES+1.935
Neg logit contributions
��-2.986
�士-2.754
-2.503
NT-2.472
��-2.465
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
��+0.292
�士+0.264
+0.237
NT+0.236
 welf+0.231
Neg logit contributions
 IMAGES-0.323
iph-0.321
kefeller-0.286
bley-0.280
 STATES-0.274
Token
Token ID198
Feature activation-3.29e+00

Pos logit contributions
 IMAGES+0.837
iph+0.831
kefeller+0.741
bley+0.726
 STATES+0.711
Neg logit contributions
��-0.747
�士-0.672
-0.606
NT-0.604
 welf-0.588
Token_
Token ID62
Feature activation-2.15e+00

Pos logit contributions
��+2.438
�士+2.186
NT+2.019
+1.992
────+1.922
Neg logit contributions
 IMAGES-2.680
iph-2.613
kefeller-2.413
bley-2.304
 STATES-2.265
TokenP
Token ID47
Feature activation-7.09e+00

Pos logit contributions
��+1.547
�士+1.389
NT+1.256
+1.256
catentry+1.212
Neg logit contributions
 IMAGES-1.806
iph-1.787
kefeller-1.614
bley-1.571
 STATES-1.537
Tokenulse
Token ID9615
Feature activation-1.03e+01

Pos logit contributions
��+5.353
NT+4.853
HL+4.541
catentry+4.495
ulum+4.484
Neg logit contributions
 IMAGES-5.291
iph-4.937
kefeller-4.820
bley-4.472
 STATES-4.289
TokenAmount
Token ID31264
Feature activation-5.93e+00

Pos logit contributions
��+5.671
�士+4.780
────+4.583
NT+4.540
+4.462
Neg logit contributions
 IMAGES-9.731
iph-9.358
kefeller-8.830
bley-8.460
 STATES-8.287
Token ("
Token ID5855
Feature activation-4.07e+00

Pos logit contributions
��+4.374
�士+3.876
NT+3.637
+3.589
────+3.510
Neg logit contributions
iph-4.583
 IMAGES-4.538
kefeller-4.280
bley-4.132
pees-3.987
TokenP
Token ID47
Feature activation-4.06e+00

Pos logit contributions
��+3.122
�士+2.820
NT+2.626
+2.596
catentry+2.531
Neg logit contributions
 IMAGES-3.172
iph-3.054
kefeller-2.814
bley-2.690
 STATES-2.636
Tokenulse
Token ID9615
Feature activation-2.75e+00

Pos logit contributions
��+3.472
�士+3.133
NT+3.015
+2.931
Associated+2.842
Neg logit contributions
 IMAGES-2.794
iph-2.676
kefeller-2.432
bley-2.317
 STATES-2.265
Token amount
Token ID2033
Feature activation-5.69e-01

Pos logit contributions
��+1.998
�士+1.782
+1.631
NT+1.618
────+1.577
Neg logit contributions
 IMAGES-2.297
iph-2.256
kefeller-2.045
bley-1.973
 STATES-1.932
TokenEmail
Token ID15333
Feature activation-3.28e+00

Pos logit contributions
 IMAGES+2.358
iph+2.342
kefeller+2.107
bley+2.076
 STATES+2.046
Neg logit contributions
��-1.487
�士-1.330
-1.140
��-1.116
NT-1.116
Token This
Token ID770
Feature activation+1.65e+00

Pos logit contributions
��+2.177
�士+1.863
NT+1.737
+1.706
 welf+1.631
Neg logit contributions
iph-2.914
 IMAGES-2.908
kefeller-2.678
bley-2.612
agus-2.518
Token Story
Token ID8362
Feature activation+4.32e+00

Pos logit contributions
 IMAGES+1.103
iph+1.084
kefeller+0.924
bley+0.900
 STATES+0.883
Neg logit contributions
��-1.484
�士-1.377
-1.257
NT-1.241
��-1.240
Token Send
Token ID16290
Feature activation+1.48e+00

Pos logit contributions
 IMAGES+3.273
iph+3.213
kefeller+2.793
bley+2.775
 STATES+2.740
Neg logit contributions
��-3.502
�士-3.220
-2.913
��-2.893
 welf-2.857
Token email
Token ID3053
Feature activation+3.33e+00

Pos logit contributions
 IMAGES+1.285
iph+1.274
kefeller+1.138
bley+1.118
 STATES+1.103
Neg logit contributions
��-1.033
�士-0.932
-0.824
NT-0.818
 welf-0.805
Token to
Token ID284
Feature activation+1.16e+01

Pos logit contributions
 IMAGES+2.375
iph+2.312
kefeller+2.022
 STATES+1.982
bley+1.968
Neg logit contributions
��-2.839
�士-2.667
-2.387
NT-2.371
��-2.355
Token this
Token ID428
Feature activation+7.56e+00

Pos logit contributions
 IMAGES+6.902
iph+6.532
agus+5.754
 STATES+5.747
pees+5.543
Neg logit contributions
��-10.227
�士-10.092
��-9.822
-8.921
ADRA-8.829
Token address
Token ID2209
Feature activation+6.99e+00

Pos logit contributions
 IMAGES+3.578
iph+3.347
 STATES+2.709
bley+2.548
 rotation+2.531
Neg logit contributions
��-8.237
�士-7.904
��-7.301
-7.030
 welf-7.015
Token Enter
Token ID6062
Feature activation-1.16e+00

Pos logit contributions
 IMAGES+4.634
iph+4.592
 STATES+3.909
bley+3.738
pees+3.732
Neg logit contributions
��-6.237
�士-6.004
��-5.364
-5.182
 welf-5.165
Token Your
Token ID3406
Feature activation+2.69e+00

Pos logit contributions
��+0.667
�士+0.545
NT+0.523
+0.509
Associated+0.499
Neg logit contributions
 IMAGES-1.145
iph-1.138
kefeller-1.052
bley-1.018
agus-1.009
Token Name
Token ID6530
Feature activation+2.19e+00

Pos logit contributions
 IMAGES+2.509
iph+2.494
kefeller+2.246
bley+2.211
 STATES+2.178
Neg logit contributions
��-1.698
�士-1.508
-1.319
NT-1.308
 welf-1.281
Token Let
Token ID3914
Feature activation+3.34e+00

Pos logit contributions
 IMAGES+4.919
iph+4.689
 STATES+4.060
bley+3.948
kefeller+3.942
Neg logit contributions
��-4.887
�士-4.480
��-4.057
 welf-3.980
-3.925
Token us
Token ID514
Feature activation+6.93e+00

Pos logit contributions
 IMAGES+2.304
iph+2.274
kefeller+1.915
bley+1.911
 STATES+1.878
Neg logit contributions
��-2.922
�士-2.683
 welf-2.445
-2.420
NT-2.417
Token know
Token ID760
Feature activation+7.84e+00

Pos logit contributions
 IMAGES+4.887
iph+4.858
bley+4.191
 STATES+4.185
kefeller+4.096
Neg logit contributions
��-5.934
�士-5.424
��-4.874
-4.867
 welf-4.767
Token in
Token ID287
Feature activation+7.52e+00

Pos logit contributions
 IMAGES+5.378
iph+5.299
 STATES+4.430
bley+4.378
agus+4.324
Neg logit contributions
��-6.784
�士-6.308
-5.684
��-5.639
catentry-5.619
Token the
Token ID262
Feature activation+4.27e+00

Pos logit contributions
 IMAGES+5.208
iph+5.071
 STATES+4.160
bley+4.133
kefeller+4.101
Neg logit contributions
��-6.550
�士-6.050
��-5.437
catentry-5.408
NT-5.407
Token comments
Token ID3651
Feature activation+1.15e+01

Pos logit contributions
 IMAGES+2.697
iph+2.642
kefeller+2.184
 STATES+2.178
bley+2.174
Neg logit contributions
��-4.009
�士-3.700
-3.381
��-3.365
NT-3.340
Token below
Token ID2174
Feature activation+4.86e+00

Pos logit contributions
iph+6.543
 IMAGES+6.367
 STATES+5.233
agus+5.117
bley+5.098
Neg logit contributions
��-10.733
�士-10.011
-9.144
 welf-8.971
��-8.920
Token.
Token ID13
Feature activation+3.09e+00

Pos logit contributions
iph+3.574
 IMAGES+3.565
bley+3.014
kefeller+2.995
 STATES+2.992
Neg logit contributions
��-4.011
�士-3.689
-3.301
 welf-3.256
NT-3.255
Token<|endoftext|>
Token ID50256
Feature activation+7.22e-01

Pos logit contributions
 IMAGES+2.349
iph+2.295
kefeller+1.975
bley+1.943
 STATES+1.936
Neg logit contributions
��-2.529
�士-2.322
��-2.081
-2.069
 welf-2.063
TokenFrom
Token ID4863
Feature activation-5.68e+00

Pos logit contributions
 IMAGES+0.566
iph+0.562
kefeller+0.496
bley+0.486
 STATES+0.477
Neg logit contributions
��-0.563
�士-0.511
-0.461
NT-0.459
 welf-0.450
Token John
Token ID1757
Feature activation-4.41e+00

Pos logit contributions
��+4.370
�士+3.893
NT+3.613
+3.582
 Ll+3.556
Neg logit contributions
 IMAGES-4.308
iph-4.272
kefeller-3.866
bley-3.792
pees-3.745
Tokenin
Token ID259
Feature activation+2.26e+00

Pos logit contributions
��+0.122
�士+0.109
NT+0.101
+0.101
 Ll+0.097
Neg logit contributions
 IMAGES-0.125
iph-0.121
kefeller-0.109
bley-0.105
 STATES-0.104
Token.
Token ID13
Feature activation-8.31e-02

Pos logit contributions
iph+1.619
 IMAGES+1.618
kefeller+1.408
bley+1.383
 STATES+1.347
Neg logit contributions
��-1.894
�士-1.732
-1.573
NT-1.568
 welf-1.551
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
��+0.062
�士+0.056
+0.050
NT+0.050
 Ll+0.049
Neg logit contributions
iph-0.068
 IMAGES-0.068
kefeller-0.061
bley-0.059
pees-0.058
Token
Token ID198
Feature activation+1.90e+00

Pos logit contributions
 IMAGES+0.973
iph+0.966
kefeller+0.859
bley+0.843
 STATES+0.827
Neg logit contributions
��-0.876
�士-0.790
-0.710
NT-0.707
 welf-0.691
TokenS
Token ID50
Feature activation+5.42e+00

Pos logit contributions
 IMAGES+1.472
iph+1.462
kefeller+1.286
bley+1.262
 STATES+1.238
Neg logit contributions
��-1.494
�士-1.360
-1.227
NT-1.217
 welf-1.201
Tokenpar
Token ID1845
Feature activation+1.50e+01

Pos logit contributions
iph+3.116
 IMAGES+3.040
kefeller+2.548
bley+2.496
agus+2.442
Neg logit contributions
��-5.390
�士-5.032
-4.602
��-4.599
catentry-4.553
Tokenks
Token ID591
Feature activation-6.78e-01

Pos logit contributions
 IMAGES+9.105
iph+9.064
agus+8.194
bley+8.029
kes+7.891
Neg logit contributions
��-13.744
�士-13.724
��-12.692
 welf-12.029
-11.714
Token,
Token ID11
Feature activation+1.72e+00

Pos logit contributions
��+0.518
�士+0.469
+0.426
NT+0.420
 Ll+0.413
Neg logit contributions
 IMAGES-0.542
iph-0.534
kefeller-0.479
bley-0.463
pees-0.454
Token a
Token ID257
Feature activation+2.55e+00

Pos logit contributions
 IMAGES+1.388
iph+1.378
kefeller+1.222
bley+1.198
 STATES+1.175
Neg logit contributions
��-1.298
�士-1.175
-1.057
NT-1.049
 welf-1.031
Token Cohn
Token ID45005
Feature activation-7.97e-01

Pos logit contributions
 IMAGES+1.983
iph+1.970
kefeller+1.737
bley+1.701
 STATES+1.669
Neg logit contributions
��-2.011
�士-1.830
-1.651
NT-1.642
��-1.612
Token prot
Token ID1237
Feature activation+1.11e-02

Pos logit contributions
��+0.633
�士+0.580
+0.524
NT+0.523
 welf+0.515
Neg logit contributions
 IMAGES-0.616
iph-0.601
kefeller-0.529
bley-0.522
pees-0.515
Tokente
Token ID660
Feature activation+2.07e+00

Pos logit contributions
iph+2.421
 IMAGES+2.383
bley+1.903
 STATES+1.878
kefeller+1.869
Neg logit contributions
��-4.277
�士-4.028
 welf-3.707
-3.678
catentry-3.642
Tokenlegraph
Token ID16606
Feature activation+5.42e+00

Pos logit contributions
 IMAGES+1.822
iph+1.819
bley+1.600
kefeller+1.583
pees+1.574
Neg logit contributions
��-1.421
�士-1.294
-1.122
NT-1.117
 welf-1.116
Token.
Token ID13
Feature activation+8.41e-01

Pos logit contributions
iph+3.580
 IMAGES+3.574
kefeller+2.979
 STATES+2.931
bley+2.929
Neg logit contributions
��-4.957
�士-4.755
 welf-4.256
��-4.192
-4.184
Tokenco
Token ID1073
Feature activation-3.18e+00

Pos logit contributions
 IMAGES+0.726
iph+0.715
kefeller+0.645
bley+0.633
 STATES+0.622
Neg logit contributions
��-0.587
�士-0.522
-0.470
NT-0.469
catentry-0.452
Token.
Token ID13
Feature activation+6.30e-01

Pos logit contributions
��+2.219
NT+1.910
�士+1.865
Associated+1.830
 Ll+1.821
Neg logit contributions
 IMAGES-2.643
iph-2.575
kefeller-2.356
bley-2.288
pees-2.235
Tokenuk
Token ID2724
Feature activation+1.12e+01

Pos logit contributions
 IMAGES+0.398
iph+0.394
kefeller+0.339
bley+0.328
 STATES+0.320
Neg logit contributions
��-0.585
�士-0.537
NT-0.498
-0.497
 welf-0.486
Token/
Token ID14
Feature activation+3.57e+00

Pos logit contributions
 IMAGES+5.446
iph+5.210
 STATES+4.222
bley+3.816
pees+3.779
Neg logit contributions
��-11.768
�士-11.602
 welf-10.656
��-10.462
-10.197
Tokenbusiness
Token ID22680
Feature activation+3.74e+00

Pos logit contributions
iph+2.629
 IMAGES+2.617
kefeller+2.262
bley+2.210
 STATES+2.179
Neg logit contributions
��-2.941
�士-2.720
-2.452
��-2.436
 welf-2.413
Token/
Token ID14
Feature activation+1.84e+00

Pos logit contributions
 IMAGES+2.350
iph+2.328
kefeller+1.963
bley+1.932
 STATES+1.880
Neg logit contributions
��-3.501
�士-3.257
-2.973
NT-2.946
 welf-2.945
Tokenanalysis
Token ID20930
Feature activation+3.55e+00

Pos logit contributions
iph+1.504
 IMAGES+1.500
kefeller+1.323
bley+1.302
 STATES+1.277
Neg logit contributions
��-1.372
�士-1.247
-1.112
NT-1.099
��-1.095
Token/
Token ID14
Feature activation+1.08e+00

Pos logit contributions
 IMAGES+2.234
iph+2.219
kefeller+1.860
bley+1.805
 STATES+1.788
Neg logit contributions
��-3.297
�士-3.055
-2.820
catentry-2.792
NT-2.778
Token Let
Token ID3914
Feature activation+4.77e+00

Pos logit contributions
 IMAGES+2.301
iph+2.274
kefeller+1.973
bley+1.949
 STATES+1.930
Neg logit contributions
��-2.231
�士-2.030
-1.805
��-1.796
 welf-1.789
Token us
Token ID514
Feature activation+6.65e+00

Pos logit contributions
 IMAGES+3.189
iph+3.181
bley+2.634
kefeller+2.634
 STATES+2.593
Neg logit contributions
��-4.206
�士-3.878
 welf-3.548
NT-3.498
-3.493
Token know
Token ID760
Feature activation+8.59e+00

Pos logit contributions
 IMAGES+4.676
iph+4.672
bley+4.013
 STATES+3.980
kefeller+3.950
Neg logit contributions
��-5.662
�士-5.187
-4.668
��-4.617
 welf-4.540
Token in
Token ID287
Feature activation+7.49e+00

Pos logit contributions
 IMAGES+5.788
iph+5.730
 STATES+4.724
bley+4.666
agus+4.603
Neg logit contributions
��-7.429
�士-6.984
-6.271
��-6.187
catentry-6.181
Token the
Token ID262
Feature activation+5.55e+00

Pos logit contributions
 IMAGES+5.213
iph+5.127
 STATES+4.203
bley+4.193
kefeller+4.168
Neg logit contributions
��-6.450
�士-5.977
��-5.345
NT-5.332
-5.329
Token comments
Token ID3651
Feature activation+1.28e+01

Pos logit contributions
 IMAGES+3.460
iph+3.415
 STATES+2.796
kefeller+2.783
bley+2.773
Neg logit contributions
��-5.228
�士-4.835
-4.417
��-4.406
NT-4.365
Token below
Token ID2174
Feature activation+7.61e+00

Pos logit contributions
iph+6.782
 IMAGES+6.567
 STATES+5.238
bley+5.125
agus+5.111
Neg logit contributions
��-12.148
�士-11.440
-10.501
 welf-10.212
��-10.184
Token….
Token ID11580
Feature activation+1.27e+00

Pos logit contributions
iph+5.140
 IMAGES+5.095
bley+4.194
 STATES+4.191
agus+4.139
Neg logit contributions
��-6.578
�士-6.157
-5.483
 welf-5.438
��-5.418
Token<|endoftext|>
Token ID50256
Feature activation+4.58e-01

Pos logit contributions
 IMAGES+0.926
iph+0.923
kefeller+0.797
bley+0.784
 STATES+0.767
Neg logit contributions
��-1.061
�士-0.973
-0.878
NT-0.871
 welf-0.865
TokenD
Token ID35
Feature activation-1.78e+00

Pos logit contributions
 IMAGES+0.360
iph+0.356
kefeller+0.315
bley+0.309
 STATES+0.303
Neg logit contributions
��-0.357
�士-0.324
-0.293
NT-0.292
 welf-0.285
Tokenish
Token ID680
Feature activation+6.83e-01

Pos logit contributions
��+1.386
�士+1.250
NT+1.156
+1.137
 welf+1.105
Neg logit contributions
 IMAGES-1.395
iph-1.362
kefeller-1.221
bley-1.184
 STATES-1.178
Token I
Token ID314
Feature activation+3.28e+00

Pos logit contributions
��+0.067
�士+0.061
NT+0.055
+0.055
 Ll+0.054
Neg logit contributions
 IMAGES-0.072
iph-0.072
kefeller-0.064
bley-0.062
pees-0.062
Token hope
Token ID2911
Feature activation+8.23e-01

Pos logit contributions
 IMAGES+2.600
iph+2.585
kefeller+2.276
bley+2.243
 STATES+2.200
Neg logit contributions
��-2.538
�士-2.306
-2.072
NT-2.042
��-2.030
Token he
Token ID339
Feature activation-1.84e+00

Pos logit contributions
 IMAGES+0.659
iph+0.656
kefeller+0.583
bley+0.568
pees+0.558
Neg logit contributions
��-0.627
�士-0.569
NT-0.512
-0.511
 welf-0.500
Token does
Token ID857
Feature activation-3.00e+00

Pos logit contributions
��+1.416
�士+1.284
NT+1.182
+1.171
 Ll+1.159
Neg logit contributions
 IMAGES-1.423
iph-1.422
kefeller-1.272
bley-1.218
agus-1.197
Token.
Token ID13
Feature activation-9.52e-01

Pos logit contributions
��+2.194
�士+1.947
NT+1.839
+1.787
 Ll+1.768
Neg logit contributions
 IMAGES-2.434
iph-2.424
kefeller-2.212
bley-2.102
agus-2.074
Token I
Token ID314
Feature activation+4.62e+00

Pos logit contributions
��+0.680
�士+0.616
NT+0.557
+0.550
 Ll+0.545
Neg logit contributions
iph-0.797
 IMAGES-0.788
kefeller-0.728
bley-0.705
pees-0.691
Token'm
Token ID1101
Feature activation+1.70e+00

Pos logit contributions
 IMAGES+3.467
iph+3.454
kefeller+3.006
bley+2.991
 STATES+2.914
Neg logit contributions
��-3.754
�士-3.441
-3.095
��-3.053
 welf-3.046
Token just
Token ID655
Feature activation+1.29e+00

Pos logit contributions
 IMAGES+1.304
iph+1.294
kefeller+1.139
bley+1.116
 STATES+1.094
Neg logit contributions
��-1.353
�士-1.231
-1.114
NT-1.107
 welf-1.089
Token not
Token ID407
Feature activation+6.74e-01

Pos logit contributions
 IMAGES+0.981
iph+0.973
kefeller+0.857
bley+0.838
 STATES+0.822
Neg logit contributions
��-1.030
�士-0.938
-0.849
NT-0.844
 welf-0.830
Token optimistic
Token ID16915
Feature activation+1.47e+00

Pos logit contributions
 IMAGES+0.492
iph+0.487
kefeller+0.429
bley+0.416
 STATES+0.407
Neg logit contributions
��-0.562
�士-0.513
-0.468
NT-0.466
 welf-0.458
Token given
Token ID1813
Feature activation+9.05e-01

Pos logit contributions
 IMAGES+1.210
iph+1.206
kefeller+1.075
bley+1.049
 STATES+1.028
Neg logit contributions
��-1.089
�士-0.983
-0.881
NT-0.878
 welf-0.860
Token *
Token ID1635
Feature activation+2.80e+00

Pos logit contributions
 IMAGES+0.794
iph+0.788
kefeller+0.697
bley+0.683
 STATES+0.670
Neg logit contributions
��-0.775
�士-0.703
-0.634
NT-0.630
 welf-0.620
Tokenlot
Token ID26487
Feature activation-2.64e+00

Pos logit contributions
 IMAGES+2.111
iph+2.109
kefeller+1.840
bley+1.809
 STATES+1.768
Neg logit contributions
��-2.253
�士-2.059
-1.853
NT-1.830
��-1.821
Token*
Token ID9
Feature activation+3.72e-01

Pos logit contributions
��+2.226
�士+2.030
+1.865
NT+1.856
 welf+1.819
Neg logit contributions
 IMAGES-1.897
iph-1.867
kefeller-1.646
bley-1.592
 STATES-1.568
Token more
Token ID517
Feature activation+2.57e+00

Pos logit contributions
 IMAGES+0.305
iph+0.303
kefeller+0.269
bley+0.264
 STATES+0.259
Neg logit contributions
��-0.277
�士-0.249
-0.224
NT-0.223
 welf-0.219
Token populism
Token ID47584
Feature activation-3.68e+00

Pos logit contributions
 IMAGES+1.942
iph+1.926
kefeller+1.691
bley+1.654
 STATES+1.628
Neg logit contributions
��-2.074
�士-1.897
-1.712
NT-1.699
��-1.676
Token -
Token ID532
Feature activation+5.26e+00

Pos logit contributions
��+2.778
�士+2.470
+2.292
NT+2.287
 welf+2.226
Neg logit contributions
 IMAGES-2.942
iph-2.886
kefeller-2.598
bley-2.527
 STATES-2.454
Token and
Token ID290
Feature activation+4.13e+00

Pos logit contributions
 IMAGES+3.963
iph+3.957
kefeller+3.446
bley+3.385
 STATES+3.337
Neg logit contributions
��-4.145
�士-3.817
-3.396
��-3.370
NT-3.360
Token not
Token ID407
Feature activation-6.15e-01

Pos logit contributions
 IMAGES+3.180
iph+3.139
kefeller+2.760
bley+2.713
 STATES+2.688
Neg logit contributions
��-3.237
�士-2.977
-2.663
��-2.624
NT-2.614
Token just
Token ID655
Feature activation-4.13e-01

Pos logit contributions
��+0.476
�士+0.428
+0.389
NT+0.389
 welf+0.386
Neg logit contributions
 IMAGES-0.483
iph-0.483
kefeller-0.427
bley-0.418
 STATES-0.408
Token during
Token ID1141
Feature activation+2.60e+00

Pos logit contributions
��+0.315
�士+0.282
NT+0.257
+0.256
 welf+0.254
Neg logit contributions
 IMAGES-0.330
iph-0.329
kefeller-0.292
bley-0.286
agus-0.279
Token election
Token ID3071
Feature activation-2.77e-01

Pos logit contributions
 IMAGES+1.963
iph+1.955
kefeller+1.726
bley+1.683
 STATES+1.660
Neg logit contributions
��-2.087
�士-1.906
-1.714
NT-1.706
��-1.686
Token solve
Token ID8494
Feature activation-1.54e-01

Pos logit contributions
��+2.257
�士+2.037
NT+1.911
+1.876
 welf+1.844
Neg logit contributions
iph-2.325
 IMAGES-2.307
kefeller-2.082
bley-2.009
pees-1.959
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
��+0.116
�士+0.105
+0.095
NT+0.094
 welf+0.093
Neg logit contributions
 IMAGES-0.125
iph-0.124
kefeller-0.110
bley-0.108
 STATES-0.106
Token Pittsburgh
Token ID12767
Feature activation-1.65e+00

Pos logit contributions
��+1.729
�士+1.556
+1.413
NT+1.412
 welf+1.399
Neg logit contributions
 IMAGES-1.827
iph-1.812
kefeller-1.625
bley-1.591
pees-1.570
Token Penguins
Token ID29755
Feature activation-6.22e-01

Pos logit contributions
��+0.890
�士+0.767
NT+0.663
+0.661
 welf+0.636
Neg logit contributions
 IMAGES-1.689
iph-1.678
kefeller-1.536
bley-1.510
pees-1.495
Token expansion
Token ID7118
Feature activation-1.56e-01

Pos logit contributions
��+0.484
�士+0.437
NT+0.397
+0.397
 welf+0.388
Neg logit contributions
 IMAGES-0.489
iph-0.484
kefeller-0.430
bley-0.421
pees-0.413
Token draft
Token ID4538
Feature activation+7.15e+00

Pos logit contributions
��+0.117
�士+0.106
+0.095
NT+0.095
 welf+0.093
Neg logit contributions
 IMAGES-0.127
iph-0.126
kefeller-0.112
bley-0.109
 STATES-0.107
Token problem
Token ID1917
Feature activation+1.79e+00

Pos logit contributions
 IMAGES+5.134
iph+5.107
kefeller+4.446
bley+4.339
 STATES+4.306
Neg logit contributions
��-5.868
�士-5.521
��-4.840
-4.802
catentry-4.709
Token?
Token ID30
Feature activation-3.52e+00

Pos logit contributions
 IMAGES+1.360
iph+1.350
kefeller+1.184
bley+1.163
 STATES+1.140
Neg logit contributions
��-1.436
�士-1.312
-1.183
NT-1.175
 welf-1.158
Token If
Token ID1002
Feature activation-1.04e+00

Pos logit contributions
��+2.438
�士+2.193
NT+2.026
+1.960
 Ll+1.932
Neg logit contributions
iph-2.984
 IMAGES-2.907
kefeller-2.736
bley-2.657
pees-2.607
Token you
Token ID345
Feature activation-1.56e+00

Pos logit contributions
��+0.744
�士+0.670
NT+0.601
+0.599
 welf+0.585
Neg logit contributions
 IMAGES-0.875
iph-0.875
kefeller-0.783
bley-0.763
pees-0.753
Token believe
Token ID1975
Feature activation-4.04e-01

Pos logit contributions
��+1.171
�士+1.070
NT+0.984
+0.965
 welf+0.941
Neg logit contributions
 IMAGES-1.237
iph-1.236
kefeller-1.103
bley-1.058
pees-1.052
Token past
Token ID1613
Feature activation-2.62e+00

Pos logit contributions
��+0.253
�士+0.232
+0.213
NT+0.211
 welf+0.209
Neg logit contributions
 IMAGES-0.196
iph-0.193
kefeller-0.168
bley-0.164
pees-0.160
Token week
Token ID1285
Feature activation-1.62e+00

Pos logit contributions
��+2.217
�士+2.018
+1.867
NT+1.845
 welf+1.821
Neg logit contributions
 IMAGES-1.859
iph-1.834
kefeller-1.627
bley-1.568
pees-1.554
Token referring
Token ID9759
Feature activation+1.49e+00

Pos logit contributions
��+1.235
�士+1.114
+1.015
NT+0.998
��+0.979
Neg logit contributions
 IMAGES-1.297
iph-1.272
kefeller-1.147
bley-1.114
pees-1.088
Token to
Token ID284
Feature activation+2.93e+00

Pos logit contributions
 IMAGES+1.090
iph+1.085
kefeller+0.950
bley+0.931
 STATES+0.911
Neg logit contributions
��-1.218
�士-1.112
-1.009
NT-1.006
 welf-0.996
Token the
Token ID262
Feature activation-3.89e-01

Pos logit contributions
 IMAGES+2.306
iph+2.295
kefeller+1.999
bley+1.975
 STATES+1.940
Neg logit contributions
��-2.260
�士-2.070
NT-1.844
-1.839
��-1.829
Token perpetrators
Token ID25185
Feature activation+4.36e+00

Pos logit contributions
��+0.286
�士+0.258
+0.233
NT+0.230
 welf+0.227
Neg logit contributions
 IMAGES-0.322
iph-0.319
kefeller-0.286
bley-0.279
 STATES-0.274
Token as
Token ID355
Feature activation-6.69e-01

Pos logit contributions
iph+3.115
 IMAGES+3.098
kefeller+2.632
bley+2.624
 STATES+2.557
Neg logit contributions
��-3.650
�士-3.390
-3.028
 welf-3.012
NT-3.005
Token sav
Token ID6799
Feature activation+2.67e+00

Pos logit contributions
��+0.500
�士+0.451
+0.408
NT+0.401
 welf+0.398
Neg logit contributions
 IMAGES-0.546
iph-0.542
kefeller-0.486
bley-0.474
pees-0.466
Tokenages
Token ID1095
Feature activation+9.92e-01

Pos logit contributions
 IMAGES+1.515
iph+1.510
kefeller+1.250
bley+1.222
 STATES+1.184
Neg logit contributions
��-2.663
�士-2.476
-2.281
NT-2.273
 welf-2.253
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
 IMAGES+0.752
iph+0.746
kefeller+0.656
bley+0.642
 STATES+0.629
Neg logit contributions
��-0.800
�士-0.729
-0.661
NT-0.656
 welf-0.646
Token Which
Token ID9022
Feature activation+1.40e-01

Pos logit contributions
��+0.831
�士+0.746
+0.672
NT+0.668
 welf+0.645
Neg logit contributions
iph-0.935
 IMAGES-0.935
kefeller-0.844
bley-0.824
pees-0.805
Token.
Token ID13
Feature activation+2.17e+00

Pos logit contributions
 IMAGES+1.256
iph+1.250
kefeller+1.090
bley+1.068
 STATES+1.047
Neg logit contributions
��-1.435
�士-1.316
-1.192
NT-1.182
 welf-1.171
Tokenorg
Token ID2398
Feature activation+3.02e+00

Pos logit contributions
iph+1.842
 IMAGES+1.839
kefeller+1.621
bley+1.606
 STATES+1.578
Neg logit contributions
��-1.546
�士-1.414
-1.234
��-1.225
 welf-1.221
Token/
Token ID14
Feature activation+2.99e+00

Pos logit contributions
 IMAGES+1.894
iph+1.884
kefeller+1.593
bley+1.560
 STATES+1.531
Neg logit contributions
��-2.829
�士-2.626
-2.399
NT-2.381
 welf-2.365
Tokenindex
Token ID9630
Feature activation+3.80e+00

Pos logit contributions
iph+2.024
 IMAGES+1.993
bley+1.709
kefeller+1.656
pees+1.655
Neg logit contributions
��-2.624
�士-2.455
��-2.209
 welf-2.198
-2.192
Token.
Token ID13
Feature activation-3.97e-01

Pos logit contributions
iph+2.677
 IMAGES+2.647
kefeller+2.284
bley+2.259
 STATES+2.220
Neg logit contributions
��-3.272
�士-3.067
 welf-2.728
-2.726
��-2.696
Tokenphp
Token ID10121
Feature activation+5.17e+00

Pos logit contributions
��+0.301
�士+0.272
+0.246
NT+0.244
 welf+0.239
Neg logit contributions
 IMAGES-0.320
iph-0.316
kefeller-0.282
bley-0.276
 STATES-0.271
Token?
Token ID30
Feature activation+6.09e+00

Pos logit contributions
iph+3.588
 IMAGES+3.540
bley+3.003
kefeller+2.946
 STATES+2.941
Neg logit contributions
��-4.504
�士-4.148
 welf-3.735
-3.732
��-3.657
Tokentopic
Token ID26652
Feature activation+1.52e+00

Pos logit contributions
iph+3.769
 IMAGES+3.604
bley+2.995
pees+2.945
 STATES+2.903
Neg logit contributions
��-5.715
�士-5.433
 welf-4.857
-4.845
��-4.754
Token=
Token ID28
Feature activation+5.88e-01

Pos logit contributions
 IMAGES+1.363
iph+1.355
kefeller+1.215
bley+1.195
 STATES+1.176
Neg logit contributions
��-1.011
�士-0.904
-0.798
NT-0.790
 welf-0.776
Token28
Token ID2078
Feature activation-5.21e+00

Pos logit contributions
 IMAGES+0.467
iph+0.462
kefeller+0.411
bley+0.403
 STATES+0.394
Neg logit contributions
��-0.451
�士-0.410
-0.370
NT-0.370
 welf-0.358
Token18
Token ID1507
Feature activation-5.00e+00

Pos logit contributions
��+3.508
�士+3.141
NT+2.931
+2.862
────+2.793
Neg logit contributions
 IMAGES-4.380
iph-4.325
kefeller-3.992
bley-3.891
pees-3.799
Token Em
Token ID2295
Feature activation+1.95e+00

Pos logit contributions
 IMAGES+1.226
iph+1.213
kefeller+1.041
bley+1.013
 STATES+0.990
Neg logit contributions
��-1.726
�士-1.592
-1.463
NT-1.456
 welf-1.433
Tokenanski
Token ID44978
Feature activation+1.50e+00

Pos logit contributions
 IMAGES+1.371
iph+1.359
kefeller+1.181
bley+1.154
 STATES+1.130
Neg logit contributions
��-1.679
�士-1.540
-1.405
NT-1.397
 welf-1.376
Token-
Token ID12
Feature activation-1.47e+00

Pos logit contributions
 IMAGES+1.159
iph+1.155
kefeller+1.014
bley+0.999
 STATES+0.979
Neg logit contributions
��-1.178
�士-1.074
-0.965
NT-0.960
 welf-0.946
Tokenesque
Token ID28939
Feature activation+1.11e+00

Pos logit contributions
��+1.162
�士+1.056
+0.959
NT+0.958
 welf+0.936
Neg logit contributions
 IMAGES-1.142
iph-1.120
kefeller-0.992
bley-0.967
 STATES-0.957
Token back
Token ID736
Feature activation+1.10e+00

Pos logit contributions
 IMAGES+0.867
iph+0.860
kefeller+0.760
bley+0.743
 STATES+0.729
Neg logit contributions
��-0.870
�士-0.791
-0.715
NT-0.710
 welf-0.699
Token-
Token ID12
Feature activation+4.55e-01

Pos logit contributions
 IMAGES+0.846
iph+0.835
kefeller+0.737
bley+0.720
 STATES+0.707
Neg logit contributions
��-0.876
�士-0.797
-0.723
NT-0.717
 welf-0.705
Tokento
Token ID1462
Feature activation+3.14e+00

Pos logit contributions
 IMAGES+0.335
iph+0.329
kefeller+0.289
bley+0.282
 STATES+0.278
Neg logit contributions
��-0.377
�士-0.345
-0.315
NT-0.313
 welf-0.307
Token-
Token ID12
Feature activation+3.82e+00

Pos logit contributions
 IMAGES+2.291
iph+2.288
kefeller+1.991
bley+1.955
 STATES+1.912
Neg logit contributions
��-2.621
�士-2.424
-2.168
NT-2.143
��-2.138
Tokenback
Token ID1891
Feature activation+8.43e-01

Pos logit contributions
iph+1.777
 IMAGES+1.694
kefeller+1.412
bley+1.401
pees+1.312
Neg logit contributions
��-4.213
�士-3.923
��-3.627
-3.624
 welf-3.589
Token-
Token ID12
Feature activation-8.97e-01

Pos logit contributions
 IMAGES+0.657
iph+0.651
kefeller+0.575
bley+0.563
 STATES+0.552
Neg logit contributions
��-0.663
�士-0.602
-0.545
NT-0.542
 welf-0.532
Tokento
Token ID1462
Feature activation+3.57e+00

Pos logit contributions
��+0.742
�士+0.679
+0.619
NT+0.617
 welf+0.603
Neg logit contributions
 IMAGES-0.663
iph-0.650
kefeller-0.572
bley-0.557
 STATES-0.550
Token a
Token ID257
Feature activation-1.52e+00

Pos logit contributions
 IMAGES+0.448
iph+0.445
kefeller+0.396
bley+0.387
 STATES+0.379
Neg logit contributions
��-0.418
�士-0.377
-0.340
NT-0.339
 welf-0.332
Token very
Token ID845
Feature activation+1.67e-01

Pos logit contributions
��+1.100
�士+0.973
+0.894
NT+0.892
 welf+0.880
Neg logit contributions
 IMAGES-1.254
iph-1.251
kefeller-1.121
bley-1.084
pees-1.071
Token detailed
Token ID6496
Feature activation-1.64e+00

Pos logit contributions
 IMAGES+0.137
iph+0.136
kefeller+0.122
bley+0.119
 STATES+0.116
Neg logit contributions
��-0.124
�士-0.111
-0.101
NT-0.100
 welf-0.099
Token way
Token ID835
Feature activation+2.38e+00

Pos logit contributions
��+1.227
�士+1.089
NT+0.996
+0.993
 welf+0.986
Neg logit contributions
 IMAGES-1.333
iph-1.328
kefeller-1.197
bley-1.154
pees-1.139
Token to
Token ID284
Feature activation+3.28e+00

Pos logit contributions
 IMAGES+1.771
iph+1.766
kefeller+1.538
bley+1.510
 STATES+1.479
Neg logit contributions
��-1.938
�士-1.781
-1.603
NT-1.589
��-1.575
Token diagnose
Token ID37489
Feature activation+2.79e+00

Pos logit contributions
 IMAGES+2.338
iph+2.318
kefeller+2.013
bley+1.972
 STATES+1.937
Neg logit contributions
��-2.795
�士-2.571
-2.333
NT-2.307
��-2.288
Token what
Token ID644
Feature activation+3.78e+00

Pos logit contributions
 IMAGES+2.216
iph+2.206
kefeller+1.938
bley+1.901
 STATES+1.877
Neg logit contributions
��-2.150
�士-1.958
-1.757
NT-1.733
��-1.727
Token is
Token ID318
Feature activation+4.14e+00

Pos logit contributions
 IMAGES+2.884
iph+2.858
kefeller+2.499
bley+2.450
 STATES+2.437
Neg logit contributions
��-3.029
�士-2.758
-2.495
NT-2.459
��-2.456
Token actually
Token ID1682
Feature activation+3.90e+00

Pos logit contributions
iph+3.056
 IMAGES+3.055
kefeller+2.650
bley+2.591
 STATES+2.565
Neg logit contributions
��-3.385
�士-3.103
-2.816
��-2.762
NT-2.754
Token keeping
Token ID5291
Feature activation-6.96e-01

Pos logit contributions
 IMAGES+2.883
iph+2.875
kefeller+2.501
bley+2.447
 STATES+2.423
Neg logit contributions
��-3.194
�士-2.933
-2.649
��-2.612
NT-2.606
Token your
Token ID534
Feature activation+1.08e+00

Pos logit contributions
��+0.519
�士+0.468
+0.422
NT+0.421
 welf+0.412
Neg logit contributions
 IMAGES-0.567
iph-0.563
kefeller-0.502
bley-0.492
 STATES-0.481
Token The
Token ID383
Feature activation+1.06e+00

Pos logit contributions
��+0.252
�士+0.228
NT+0.205
+0.204
 welf+0.197
Neg logit contributions
iph-0.274
 IMAGES-0.274
kefeller-0.245
bley-0.240
pees-0.234
Token production
Token ID3227
Feature activation+3.01e+00

Pos logit contributions
 IMAGES+0.817
iph+0.812
kefeller+0.717
bley+0.700
 STATES+0.686
Neg logit contributions
��-0.848
�士-0.771
-0.698
NT-0.695
 welf-0.682
Token company
Token ID1664
Feature activation+1.72e+00

Pos logit contributions
 IMAGES+2.208
iph+2.182
kefeller+1.889
bley+1.859
 STATES+1.830
Neg logit contributions
��-2.503
�士-2.293
-2.086
NT-2.062
��-2.041
Token has
Token ID468
Feature activation-1.64e-01

Pos logit contributions
 IMAGES+1.299
iph+1.290
kefeller+1.134
bley+1.110
 STATES+1.087
Neg logit contributions
��-1.386
�士-1.263
-1.144
NT-1.137
 welf-1.119
Token stayed
Token ID9658
Feature activation+2.07e+00

Pos logit contributions
��+0.127
�士+0.115
NT+0.104
+0.104
 welf+0.102
Neg logit contributions
 IMAGES-0.129
iph-0.128
kefeller-0.114
bley-0.110
pees-0.108
Token tight
Token ID5381
Feature activation+1.07e+00

Pos logit contributions
 IMAGES+1.745
iph+1.737
kefeller+1.542
bley+1.517
 STATES+1.493
Neg logit contributions
��-1.485
�士-1.343
-1.195
NT-1.187
��-1.166
Token l
Token ID300
Feature activation-1.25e+00

Pos logit contributions
 IMAGES+0.806
iph+0.802
kefeller+0.704
bley+0.689
 STATES+0.675
Neg logit contributions
��-0.867
�士-0.791
-0.716
NT-0.712
��-0.700
Tokenipped
Token ID3949
Feature activation+6.76e-01

Pos logit contributions
��+0.951
�士+0.856
+0.789
NT+0.783
 welf+0.763
Neg logit contributions
 IMAGES-0.973
iph-0.957
kefeller-0.853
bley-0.835
 STATES-0.823
Token about
Token ID546
Feature activation-6.65e-01

Pos logit contributions
 IMAGES+0.547
iph+0.544
kefeller+0.484
bley+0.472
 STATES+0.464
Neg logit contributions
��-0.509
�士-0.458
-0.414
NT-0.411
 welf-0.404
Token both
Token ID1111
Feature activation-1.58e+00

Pos logit contributions
��+0.490
�士+0.439
NT+0.396
+0.392
 welf+0.384
Neg logit contributions
 IMAGES-0.548
iph-0.546
kefeller-0.491
bley-0.478
pees-0.468
Token titles
Token ID8714
Feature activation+1.30e+00

Pos logit contributions
��+1.174
�士+1.054
NT+0.956
+0.948
 welf+0.931
Neg logit contributions
 IMAGES-1.273
iph-1.271
kefeller-1.150
bley-1.118
pees-1.090
Token country
Token ID1499
Feature activation+2.15e+00

Pos logit contributions
iph+2.548
 IMAGES+2.532
kefeller+2.276
bley+2.216
 STATES+2.191
Neg logit contributions
��-2.201
�士-1.972
-1.756
NT-1.746
catentry-1.736
Token and
Token ID290
Feature activation+1.17e+00

Pos logit contributions
 IMAGES+1.660
iph+1.651
kefeller+1.450
bley+1.425
 STATES+1.399
Neg logit contributions
��-1.697
�士-1.547
-1.392
NT-1.382
��-1.362
Token Puerto
Token ID15295
Feature activation-3.50e+00

Pos logit contributions
 IMAGES+0.924
iph+0.920
kefeller+0.813
bley+0.796
 STATES+0.778
Neg logit contributions
��-0.909
�士-0.824
-0.746
NT-0.742
 welf-0.731
Token Rico
Token ID16707
Feature activation+5.85e-01

Pos logit contributions
��+2.778
�士+2.492
+2.326
NT+2.316
 Ll+2.295
Neg logit contributions
 IMAGES-2.610
iph-2.582
kefeller-2.296
bley-2.225
agus-2.161
Token getting
Token ID1972
Feature activation+4.25e+00

Pos logit contributions
 IMAGES+0.457
iph+0.453
kefeller+0.401
bley+0.392
 STATES+0.384
Neg logit contributions
��-0.457
�士-0.414
-0.376
NT-0.374
 welf-0.366
Token the
Token ID262
Feature activation+3.05e-01

Pos logit contributions
 IMAGES+3.101
iph+3.062
kefeller+2.664
bley+2.606
 STATES+2.568
Neg logit contributions
��-3.552
�士-3.263
-2.942
��-2.906
NT-2.906
Token testing
Token ID4856
Feature activation-2.71e+00

Pos logit contributions
 IMAGES+0.247
iph+0.246
kefeller+0.218
bley+0.213
 STATES+0.210
Neg logit contributions
��-0.229
�士-0.207
-0.187
NT-0.186
 welf-0.183
Token that
Token ID326
Feature activation-7.30e-01

Pos logit contributions
��+2.016
�士+1.790
NT+1.667
+1.631
 Ll+1.625
Neg logit contributions
 IMAGES-2.175
iph-2.175
kefeller-1.937
bley-1.885
 STATES-1.858
Token
Token ID447
Feature activation+6.70e-01

Pos logit contributions
��+0.570
�士+0.518
NT+0.471
+0.469
 welf+0.462
Neg logit contributions
iph-0.564
 IMAGES-0.563
kefeller-0.499
bley-0.487
 STATES-0.475
Token
Token ID247
Feature activation-9.84e-01

Pos logit contributions
 IMAGES+0.677
iph+0.672
kefeller+0.613
bley+0.601
 STATES+0.594
Neg logit contributions
��-0.374
�士-0.325
-0.283
NT-0.276
��-0.270
Tokens
Token ID82
Feature activation+1.74e+00

Pos logit contributions
��+0.763
�士+0.691
+0.628
NT+0.626
��+0.610
Neg logit contributions
 IMAGES-0.773
iph-0.765
kefeller-0.680
bley-0.663
 STATES-0.653
Token a
Token ID257
Feature activation+6.34e-01

Pos logit contributions
 IMAGES+0.843
iph+0.835
kefeller+0.740
bley+0.726
 STATES+0.711
Neg logit contributions
��-0.796
�士-0.718
-0.649
NT-0.645
 welf-0.636
Token $
Token ID720
Feature activation+1.12e+00

Pos logit contributions
 IMAGES+0.499
iph+0.494
kefeller+0.437
bley+0.427
 STATES+0.419
Neg logit contributions
��-0.494
�士-0.446
-0.405
NT-0.402
 welf-0.398
Token200
Token ID2167
Feature activation+3.88e-01

Pos logit contributions
 IMAGES+0.866
iph+0.860
kefeller+0.758
bley+0.742
 STATES+0.727
Neg logit contributions
��-0.891
�士-0.810
-0.733
NT-0.729
 welf-0.715
Token,
Token ID11
Feature activation+3.76e+00

Pos logit contributions
 IMAGES+0.304
iph+0.301
kefeller+0.266
bley+0.260
 STATES+0.255
Neg logit contributions
��-0.304
�士-0.275
-0.249
NT-0.247
 welf-0.243
Token000
Token ID830
Feature activation+2.45e+00

Pos logit contributions
 IMAGES+2.142
iph+2.122
kefeller+1.780
bley+1.730
 STATES+1.692
Neg logit contributions
��-3.723
�士-3.486
-3.195
 welf-3.171
NT-3.152
Token sever
Token ID1750
Feature activation+6.07e-01

Pos logit contributions
 IMAGES+1.933
iph+1.922
kefeller+1.696
bley+1.662
 STATES+1.633
Neg logit contributions
��-1.896
�士-1.725
-1.552
NT-1.542
��-1.515
Tokenance
Token ID590
Feature activation+9.89e-01

Pos logit contributions
 IMAGES+0.450
iph+0.447
kefeller+0.392
bley+0.383
 STATES+0.376
Neg logit contributions
��-0.498
�士-0.456
-0.413
NT-0.410
 welf-0.404
Token after
Token ID706
Feature activation+1.31e+00

Pos logit contributions
 IMAGES+0.781
iph+0.773
kefeller+0.685
bley+0.672
 STATES+0.658
Neg logit contributions
��-0.767
�士-0.694
-0.628
NT-0.624
 welf-0.614
Token retiring
Token ID29040
Feature activation+2.27e+00

Pos logit contributions
 IMAGES+1.018
iph+1.009
kefeller+0.890
bley+0.872
 STATES+0.854
Neg logit contributions
��-1.032
�士-0.937
-0.847
NT-0.844
 welf-0.830
Token from
Token ID422
Feature activation+2.17e+00

Pos logit contributions
 IMAGES+1.748
iph+1.737
kefeller+1.527
bley+1.496
 STATES+1.470
Neg logit contributions
��-1.795
�士-1.642
-1.478
NT-1.468
��-1.444
Token a
Token ID257
Feature activation+1.30e+00

Pos logit contributions
 IMAGES+1.616
iph+1.604
kefeller+1.406
bley+1.376
 STATES+1.349
Neg logit contributions
��-1.775
�士-1.621
-1.470
NT-1.461
 welf-1.437
Token1
Token ID16
Feature activation+7.31e-01

Pos logit contributions
��+0.358
�士+0.316
+0.292
NT+0.292
────+0.281
Neg logit contributions
 IMAGES-0.419
iph-0.418
kefeller-0.380
bley-0.369
pees-0.358
Token //
Token ID3373
Feature activation+9.00e-01

Pos logit contributions
iph+0.422
 IMAGES+0.421
kefeller+0.359
bley+0.348
pees+0.334
Neg logit contributions
��-0.716
�士-0.660
-0.616
NT-0.615
 welf-0.599
Token metres
Token ID18985
Feature activation+2.72e+00

Pos logit contributions
 IMAGES+0.723
iph+0.721
kefeller+0.639
bley+0.625
 STATES+0.613
Neg logit contributions
��-0.683
�士-0.616
-0.556
NT-0.553
 welf-0.543
Token _
Token ID4808
Feature activation+3.58e+00

Pos logit contributions
 IMAGES+1.860
iph+1.816
kefeller+1.539
bley+1.516
 STATES+1.513
Neg logit contributions
��-2.400
�士-2.223
-2.013
NT-2.007
 welf-2.002
TokenP
Token ID47
Feature activation-2.09e+00

Pos logit contributions
 IMAGES+2.784
iph+2.754
 STATES+2.361
kefeller+2.352
bley+2.340
Neg logit contributions
��-2.850
�士-2.671
 welf-2.376
��-2.329
-2.309
Tokenulse
Token ID9615
Feature activation-5.22e+00

Pos logit contributions
��+1.771
NT+1.550
�士+1.534
+1.499
catentry+1.483
Neg logit contributions
 IMAGES-1.474
iph-1.394
kefeller-1.274
bley-1.220
 STATES-1.190
TokenPer
Token ID5990
Feature activation-6.81e+00

Pos logit contributions
��+2.897
NT+2.436
────+2.419
Associated+2.385
�士+2.272
Neg logit contributions
 IMAGES-5.048
iph-5.032
kefeller-4.781
bley-4.586
agus-4.421
Tokeniod
Token ID2101
Feature activation-2.52e-01

Pos logit contributions
��+3.922
NT+3.275
+3.186
────+3.118
Associated+3.041
Neg logit contributions
 IMAGES-6.534
iph-6.005
kefeller-5.820
bley-5.535
 STATES-5.452
Token (
Token ID357
Feature activation+2.79e+00

Pos logit contributions
��+0.209
�士+0.188
NT+0.173
+0.173
catentry+0.170
Neg logit contributions
 IMAGES-0.185
iph-0.184
kefeller-0.163
bley-0.158
 STATES-0.153
Token "
Token ID366
Feature activation-3.79e+00

Pos logit contributions
 IMAGES+2.257
iph+2.221
kefeller+1.941
bley+1.919
 STATES+1.903
Neg logit contributions
��-2.124
�士-1.940
-1.733
NT-1.716
 welf-1.702
TokenP
Token ID47
Feature activation-2.50e+00

Pos logit contributions
��+2.390
NT+2.013
�士+1.958
Associated+1.939
────+1.926
Neg logit contributions
 IMAGES-3.438
iph-3.393
kefeller-3.226
bley-3.085
agus-2.975
Token the
Token ID262
Feature activation-4.32e+00

Pos logit contributions
��+3.017
�士+2.694
NT+2.518
+2.486
 welf+2.485
Neg logit contributions
 IMAGES-3.056
iph-3.054
kefeller-2.713
bley-2.663
agus-2.567
Token U
Token ID471
Feature activation-1.22e+00

Pos logit contributions
��+3.037
�士+2.714
NT+2.489
+2.456
 welf+2.455
Neg logit contributions
 IMAGES-3.639
iph-3.621
kefeller-3.211
bley-3.169
pees-3.065
Token.
Token ID13
Feature activation-1.09e+00

Pos logit contributions
��+0.923
�士+0.806
NT+0.790
+0.757
Associated+0.749
Neg logit contributions
 IMAGES-0.977
iph-0.959
kefeller-0.866
bley-0.841
pees-0.811
TokenS
Token ID50
Feature activation-2.70e+00

Pos logit contributions
��+1.007
�士+0.912
NT+0.874
+0.858
Associated+0.839
Neg logit contributions
iph-0.697
 IMAGES-0.692
kefeller-0.606
bley-0.591
pees-0.563
Token.
Token ID13
Feature activation-4.98e+00

Pos logit contributions
��+2.189
�士+1.995
+1.811
NT+1.809
 welf+1.773
Neg logit contributions
 IMAGES-2.043
iph-2.019
kefeller-1.764
bley-1.738
 STATES-1.696
Token government
Token ID1230
Feature activation-5.32e+00

Pos logit contributions
��+3.638
�士+3.277
NT+3.045
+3.033
 welf+2.978
Neg logit contributions
 IMAGES-3.994
iph-3.969
kefeller-3.501
bley-3.501
agus-3.322
Token determine
Token ID5004
Feature activation-1.85e+00

Pos logit contributions
��+3.757
�士+3.290
NT+3.184
 welf+3.149
 Ll+3.102
Neg logit contributions
 IMAGES-4.238
iph-4.234
kefeller-3.807
bley-3.727
agus-3.603
Token the
Token ID262
Feature activation-2.16e+00

Pos logit contributions
��+1.348
�士+1.217
NT+1.105
+1.102
 welf+1.099
Neg logit contributions
 IMAGES-1.520
iph-1.505
kefeller-1.332
bley-1.325
pees-1.280
Token social
Token ID1919
Feature activation-6.12e+00

Pos logit contributions
��+1.512
�士+1.351
+1.227
NT+1.225
 welf+1.224
Neg logit contributions
 IMAGES-1.843
iph-1.822
kefeller-1.623
bley-1.599
 STATES-1.558
Token costs
Token ID3484
Feature activation-2.67e+00

Pos logit contributions
��+3.354
�士+2.925
 welf+2.760
NT+2.689
+2.687
Neg logit contributions
 IMAGES-5.918
iph-5.886
kefeller-5.373
bley-5.290
 STATES-5.159
Token of
Token ID286
Feature activation-1.51e+00

Pos logit contributions
��+2.102
�士+1.900
+1.730
NT+1.724
 welf+1.698
Neg logit contributions
 IMAGES-2.075
iph-2.054
kefeller-1.807
bley-1.787
 STATES-1.741
Token of
Token ID286
Feature activation-2.33e+00

Pos logit contributions
��+1.516
�士+1.364
+1.244
NT+1.234
 welf+1.212
Neg logit contributions
 IMAGES-1.601
iph-1.592
kefeller-1.415
bley-1.395
 STATES-1.361
Token this
Token ID428
Feature activation+1.81e+00

Pos logit contributions
��+1.620
�士+1.443
NT+1.299
+1.297
 welf+1.259
Neg logit contributions
 IMAGES-2.022
iph-1.999
kefeller-1.794
bley-1.771
pees-1.729
Token contributes
Token ID22625
Feature activation+3.62e-01

Pos logit contributions
 IMAGES+1.398
iph+1.388
kefeller+1.223
bley+1.198
 STATES+1.175
Neg logit contributions
��-1.435
�士-1.306
-1.180
NT-1.172
 welf-1.153
Token to
Token ID284
Feature activation-1.29e+00

Pos logit contributions
 IMAGES+0.275
iph+0.273
kefeller+0.239
bley+0.235
 STATES+0.230
Neg logit contributions
��-0.293
�士-0.267
-0.241
NT-0.240
 welf-0.236
Token rising
Token ID7396
Feature activation-2.10e+00

Pos logit contributions
��+0.938
�士+0.844
+0.764
NT+0.762
 welf+0.740
Neg logit contributions
 IMAGES-1.073
iph-1.061
kefeller-0.949
bley-0.929
pees-0.910
Token seas
Token ID21547
Feature activation-4.58e+00

Pos logit contributions
��+1.505
�士+1.342
NT+1.236
+1.232
 welf+1.202
Neg logit contributions
 IMAGES-1.743
iph-1.729
kefeller-1.547
bley-1.521
pees-1.507
Token
Token ID851
Feature activation+3.92e-01

Pos logit contributions
��+3.336
�士+2.986
+2.792
NT+2.769
────+2.712
Neg logit contributions
 IMAGES-3.709
iph-3.632
kefeller-3.290
bley-3.191
pees-3.141
Token because
Token ID780
Feature activation-2.47e+00

Pos logit contributions
 IMAGES+0.305
iph+0.303
kefeller+0.267
bley+0.262
 STATES+0.257
Neg logit contributions
��-0.309
�士-0.280
-0.254
NT-0.252
 welf-0.248
Token of
Token ID286
Feature activation-1.21e+00

Pos logit contributions
��+1.782
�士+1.609
NT+1.476
+1.460
 Ll+1.435
Neg logit contributions
 IMAGES-2.029
iph-2.008
kefeller-1.816
bley-1.765
pees-1.743
Token course
Token ID1781
Feature activation-1.06e+00

Pos logit contributions
��+0.888
�士+0.801
+0.724
NT+0.724
 welf+0.705
Neg logit contributions
 IMAGES-0.999
iph-0.984
kefeller-0.880
bley-0.863
 STATES-0.846
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
��+0.790
�士+0.715
NT+0.648
+0.647
 welf+0.631
Neg logit contributions
 IMAGES-0.855
iph-0.846
kefeller-0.758
bley-0.738
pees-0.723
Token
Token ID198
Feature activation-8.17e-01

Pos logit contributions
��+0.098
�士+0.088
+0.079
NT+0.079
 welf+0.077
Neg logit contributions
 IMAGES-0.107
iph-0.107
kefeller-0.095
bley-0.093
 STATES-0.091
TokenHow
Token ID2437
Feature activation+6.48e-01

Pos logit contributions
��+0.643
�士+0.579
NT+0.533
+0.530
catentry+0.513
Neg logit contributions
 IMAGES-0.638
iph-0.626
kefeller-0.562
bley-0.546
 STATES-0.536
Token about
Token ID546
Feature activation+5.36e-01

Pos logit contributions
 IMAGES+0.501
iph+0.498
kefeller+0.439
bley+0.430
 STATES+0.420
Neg logit contributions
��-0.512
�士-0.465
-0.422
NT-0.419
 welf-0.413
Token just
Token ID655
Feature activation+2.41e+00

Pos logit contributions
 IMAGES+0.431
iph+0.428
kefeller+0.379
bley+0.371
 STATES+0.364
Neg logit contributions
��-0.408
�士-0.369
-0.333
NT-0.331
 welf-0.325
Token in
Token ID287
Feature activation-3.94e+00

Pos logit contributions
 IMAGES+1.863
iph+1.857
kefeller+1.625
bley+1.598
 STATES+1.575
Neg logit contributions
��-1.907
�士-1.740
-1.563
NT-1.551
��-1.534
Token the
Token ID262
Feature activation-4.83e+00

Pos logit contributions
��+2.696
�士+2.405
+2.206
NT+2.200
 welf+2.151
Neg logit contributions
 IMAGES-3.386
iph-3.329
kefeller-3.051
bley-2.944
pees-2.874
Token Ruby
Token ID10888
Feature activation-2.62e+00

Pos logit contributions
��+3.519
�士+3.140
+2.912
NT+2.909
 welf+2.870
Neg logit contributions
 IMAGES-3.869
iph-3.833
kefeller-3.505
bley-3.387
pees-3.320
Token community
Token ID2055
Feature activation+2.00e+00

Pos logit contributions
��+1.940
�士+1.730
NT+1.587
+1.586
 welf+1.546
Neg logit contributions
 IMAGES-2.134
iph-2.113
kefeller-1.918
bley-1.841
pees-1.812
Token?
Token ID30
Feature activation+5.15e-01

Pos logit contributions
 IMAGES+1.502
iph+1.492
kefeller+1.301
bley+1.282
 STATES+1.259
Neg logit contributions
��-1.623
�士-1.484
-1.333
NT-1.326
��-1.307
Token
Token ID198
Feature activation+5.10e-01

Pos logit contributions
 IMAGES+0.414
iph+0.411
kefeller+0.366
bley+0.358
 STATES+0.350
Neg logit contributions
��-0.392
�士-0.354
-0.320
NT-0.318
 welf-0.311
Token
Token ID198
Feature activation+5.63e-01

Pos logit contributions
 IMAGES+0.428
iph+0.425
kefeller+0.382
bley+0.373
pees+0.365
Neg logit contributions
��-0.368
�士-0.326
NT-0.299
-0.297
────-0.287
Token and
Token ID290
Feature activation+2.22e+00

Pos logit contributions
 IMAGES+0.512
iph+0.508
kefeller+0.444
bley+0.438
 STATES+0.428
Neg logit contributions
��-0.542
�士-0.496
-0.447
NT-0.444
 welf-0.439
Token 48
Token ID4764
Feature activation+2.92e+00

Pos logit contributions
 IMAGES+1.851
iph+1.840
kefeller+1.630
bley+1.608
 STATES+1.575
Neg logit contributions
��-1.628
�士-1.471
-1.315
NT-1.299
��-1.291
Token injured
Token ID6686
Feature activation+2.08e+00

Pos logit contributions
 IMAGES+2.300
iph+2.293
kefeller+1.998
bley+1.989
 STATES+1.943
Neg logit contributions
��-2.256
�士-2.039
-1.839
 welf-1.821
NT-1.820
Token,
Token ID11
Feature activation-7.94e-01

Pos logit contributions
 IMAGES+1.565
iph+1.556
kefeller+1.346
bley+1.339
pees+1.304
Neg logit contributions
��-1.681
�士-1.542
-1.386
NT-1.376
��-1.370
Token President
Token ID1992
Feature activation+1.05e+00

Pos logit contributions
��+0.601
�士+0.543
+0.489
NT+0.488
 welf+0.481
Neg logit contributions
 IMAGES-0.638
iph-0.636
kefeller-0.565
bley-0.551
 STATES-0.542
Token Trump
Token ID1301
Feature activation-5.23e+00

Pos logit contributions
 IMAGES+0.720
iph+0.715
kefeller+0.619
bley+0.605
 STATES+0.592
Neg logit contributions
��-0.915
�士-0.841
-0.768
NT-0.764
 welf-0.752
Token responded
Token ID7082
Feature activation+2.77e+00

Pos logit contributions
��+3.732
�士+3.410
NT+3.047
 Ll+3.045
+3.045
Neg logit contributions
 IMAGES-4.244
iph-4.185
kefeller-3.797
bley-3.613
 STATES-3.588
Token to
Token ID284
Feature activation-2.46e+00

Pos logit contributions
 IMAGES+2.072
iph+2.061
kefeller+1.801
bley+1.771
 STATES+1.745
Neg logit contributions
��-2.252
�士-2.070
-1.857
NT-1.851
��-1.829
Token the
Token ID262
Feature activation-2.66e+00

Pos logit contributions
��+1.826
�士+1.645
+1.478
NT+1.466
 welf+1.446
Neg logit contributions
 IMAGES-2.001
iph-1.989
kefeller-1.772
bley-1.736
pees-1.709
Token tragedy
Token ID13574
Feature activation+2.78e+00

Pos logit contributions
��+1.997
�士+1.793
+1.637
NT+1.624
 welf+1.610
Neg logit contributions
 IMAGES-2.132
iph-2.124
kefeller-1.901
bley-1.840
pees-1.833
Token by
Token ID416
Feature activation-9.60e-01

Pos logit contributions
 IMAGES+2.082
iph+2.064
kefeller+1.791
bley+1.773
 STATES+1.745
Neg logit contributions
��-2.269
�士-2.101
-1.874
NT-1.865
��-1.850