TOP ACTIVATIONS
MAX = 1.598

ve had as pres ­ id ­ ent since at
thrust of his 2012 pres ­ id ­ en ­ t
di ­ um and al ­ l owed only a brief
prime - time video mes ­ s age . This led
ing Jimmy Carter . Al ­ though he left the White
name is still a polit ­ ic ­ al in ­
) AS count FROM ( SELECT SPL IT ( content ,
> & density Grid , const Vector < Part icle >
-- they have little in common at first except their love
icle :: Get Mass ( float ambient D ensity ) const
to get the most out of the limited number of pixels
­ on to this pres ­ id ­ ent ,
, and push and j ost le in that ritual istic
Game in 2016 ( Photo : Wayne McG a hee III
] WHERE id IN ( SELECT id FROM cl ojure .
) { // Time component const float TA U = 6
To get the most out of it will require commitment and
for cases . ( Photo by Kathleen Flynn , N OL
] WHERE id IN ( SELECT id FROM [ cl ojure
rights group were on the same page ? Claire Gast an

MIN ACTIVATIONS
MIN = -1.529

youtube . com / channel / UC _ K k BY
ek . com / thread / 26 219 / user -
partition (/ dev / block / mm c bl k 0
non random ized , controlled study . The first cohort of
MD D . Methods This is a non random
con ceptions I 've been asked about and the answers as
. Materials and Methods Experimental subjects Three female r
shows that they planned to excise from their lives . Again
19 th partition (/ dev / block / mm c bl
great . Competitive Trib es Ascend player . Sym tex in
Pride he sought to st oke through Pad m av ati
, for example , You Gov discovered that 55 percent of
Not - Associated T x - Power = 20 dB m
ek . com / thread / 54 33 95 / lost
Barack Hussein Obama With low birth rate , America needs future
coordination with the Ministry of Foreign Affairs . The country
parties . Big Fan Profile Blog Joined December 2010
down the mountain , and decided to wait out the rain
cor vette forum . com / forums ... with - p
cl unky acting and weak plot gets rave reviews in the

INTERVAL 0.816 - 1.598
CONTAINS 0.254%

to this pres ­ id ­ ent , John
are a lot of p oth oles . (
the crisis to a new level it would be impossible
about Anna but these caught me off guard . Ol af
basically reconnect ing with each other s bodies outside

INTERVAL 0.035 - 0.816
CONTAINS 44.730%

early to say what the worst outcome could be
| C aj un Kitchen C aj un Kitchen
So far , only California has adopted the bills
aring such different molecules as minerals and organ ics from the
with e thereum ) organization aimed at taking on Uber first

INTERVAL -0.747 - 0.035
CONTAINS 54.629%

like transaction cut - through and bid irection al payment channels
anxiety because will you be able to transport it ? There
. From the Kodi home screen , select Add
after a movie 's DVD release to show up , and
And then we re in . And it

INTERVAL -1.529 - -0.747
CONTAINS 0.387%

the so - called part 1 of the public service ,
But after Benjamin Netanyahu ruled out a Palestinian state as he
T Pick Stephen East on PA 1 hour ago I think
! He stepped off the platform and walked up
lan came under fire earlier this month after using prof anity
Token
Token ID247
Feature activation-1.19e-01

Pos logit contributions
roma+0.264
+0.221
 centimeters+0.217
opez+0.213
 Aden+0.212
Neg logit contributions
buster-0.285
 tort-0.259
adr-0.255
 dere-0.254
 treadmill-0.250
Tokenve
Token ID303
Feature activation+2.73e-02

Pos logit contributions
roma+0.613
+0.525
 centimeters+0.514
opez+0.506
 Aden+0.505
Neg logit contributions
buster-0.498
 tort-0.447
adr-0.436
 dere-0.435
 treadmill-0.435
Token had
Token ID550
Feature activation-1.05e-01

Pos logit contributions
buster+0.139
 tort+0.128
 dere+0.125
adr+0.125
 treadmill+0.124
Neg logit contributions
roma-0.117
-0.096
 centimeters-0.094
opez-0.092
 Aden-0.092
Token as
Token ID355
Feature activation-5.00e-01

Pos logit contributions
roma+0.497
+0.418
 centimeters+0.409
opez+0.402
 Aden+0.399
Neg logit contributions
buster-0.488
 tort-0.443
 dere-0.434
adr-0.432
 treadmill-0.429
Token pres
Token ID906
Feature activation-1.77e-01

Pos logit contributions
roma+2.433
+2.065
 centimeters+2.028
 Aden+1.998
opez+1.991
Neg logit contributions
buster-2.232
 tort-2.004
adr-1.973
 treadmill-1.965
ETF-1.954
Token­
Token ID3907
Feature activation+1.60e+00

Pos logit contributions
roma+0.976
+0.846
 centimeters+0.830
opez+0.814
 Braun+0.812
Neg logit contributions
buster-0.693
 tort-0.613
adr-0.602
 dere-0.599
 treadmill-0.589
Tokenid
Token ID312
Feature activation+8.32e-01

Pos logit contributions
buster+5.925
adr+5.060
 dere+4.692
 tort+4.666
ETF+4.617
Neg logit contributions
roma-8.622
-7.780
 centimeters-7.699
-7.629
 Butter-7.558
Token­
Token ID3907
Feature activation+1.17e+00

Pos logit contributions
buster+3.029
 tort+2.662
adr+2.648
 dere+2.611
ETF+2.551
Neg logit contributions
roma-4.731
-4.119
 centimeters-4.077
opez-3.998
 Braun-3.966
Tokenent
Token ID298
Feature activation+4.91e-01

Pos logit contributions
buster+3.860
adr+3.220
 tort+3.111
 dere+3.080
ETF+2.886
Neg logit contributions
roma-6.965
-6.396
 centimeters-6.253
-6.215
 Butter-6.062
Token since
Token ID1201
Feature activation+1.14e-01

Pos logit contributions
buster+2.167
 tort+1.968
 dere+1.937
adr+1.904
 treadmill+1.865
Neg logit contributions
roma-2.458
-2.063
 centimeters-2.008
opez-1.983
 Aden-1.952
Token at
Token ID379
Feature activation-2.81e-01

Pos logit contributions
buster+0.535
 tort+0.491
 dere+0.481
adr+0.477
 treadmill+0.473
Neg logit contributions
roma-0.532
-0.447
 centimeters-0.436
opez-0.429
 Aden-0.422
Token thrust
Token ID14613
Feature activation-2.29e-02

Pos logit contributions
buster+0.907
 tort+0.815
 dere+0.797
adr+0.786
 treadmill+0.784
Neg logit contributions
roma-1.150
-0.983
 centimeters-0.967
opez-0.950
 Aden-0.948
Token of
Token ID286
Feature activation-2.48e-01

Pos logit contributions
roma+0.107
+0.090
 centimeters+0.088
opez+0.087
 Aden+0.086
Neg logit contributions
buster-0.108
 tort-0.098
 dere-0.096
adr-0.096
 treadmill-0.095
Token his
Token ID465
Feature activation-2.68e-01

Pos logit contributions
roma+1.201
+1.019
 centimeters+1.000
opez+0.982
 Aden+0.979
Neg logit contributions
buster-1.123
 tort-1.010
adr-0.991
 dere-0.986
 treadmill-0.984
Token 2012
Token ID2321
Feature activation-1.10e-01

Pos logit contributions
roma+1.420
+1.226
 centimeters+1.204
opez+1.184
 Aden+1.183
Neg logit contributions
buster-1.087
 tort-0.964
adr-0.945
 dere-0.936
 treadmill-0.935
Token pres
Token ID906
Feature activation-5.93e-01

Pos logit contributions
roma+0.596
+0.515
 centimeters+0.506
opez+0.498
 Aden+0.496
Neg logit contributions
buster-0.433
 tort-0.385
adr-0.375
 dere-0.374
 treadmill-0.372
Token­
Token ID3907
Feature activation+1.51e+00

Pos logit contributions
roma+3.077
+2.629
 centimeters+2.583
opez+2.564
 Aden+2.558
Neg logit contributions
buster-2.449
 tort-2.205
 treadmill-2.152
 dere-2.139
adr-2.125
Tokenid
Token ID312
Feature activation+5.79e-01

Pos logit contributions
buster+5.697
adr+4.930
 dere+4.599
 tort+4.589
ETF+4.554
Neg logit contributions
roma-7.991
 centimeters-7.205
-7.180
-7.004
 Butter-7.001
Token­
Token ID3907
Feature activation+1.15e+00

Pos logit contributions
buster+2.180
 tort+1.931
adr+1.907
 dere+1.884
ETF+1.850
Neg logit contributions
roma-3.239
-2.816
 centimeters-2.784
opez-2.731
 Braun-2.710
Tokenen
Token ID268
Feature activation+4.11e-01

Pos logit contributions
buster+3.799
adr+3.187
 tort+3.067
 dere+3.046
ETF+2.880
Neg logit contributions
roma-6.811
-6.234
 centimeters-6.084
-6.045
龍契士-5.945
Token­
Token ID3907
Feature activation+1.23e+00

Pos logit contributions
buster+1.535
 tort+1.355
adr+1.338
 dere+1.330
 treadmill+1.291
Neg logit contributions
roma-2.326
-2.029
 centimeters-1.984
opez-1.948
 Aden-1.939
Tokent
Token ID83
Feature activation+7.10e-01

Pos logit contributions
buster+4.777
 tort+4.085
adr+4.079
 dere+3.841
ETF+3.836
Neg logit contributions
roma-6.532
 centimeters-5.830
-5.803
龍契士-5.690
opez-5.644
Tokendi
Token ID10989
Feature activation+3.53e-01

Pos logit contributions
buster+2.970
adr+2.642
 tort+2.639
 dere+2.600
 treadmill+2.554
Neg logit contributions
roma-2.835
-2.455
 centimeters-2.395
 Braun-2.356
 Aden-2.353
Token­
Token ID3907
Feature activation+5.51e-01

Pos logit contributions
buster+1.305
 tort+1.166
adr+1.146
 dere+1.138
 treadmill+1.121
Neg logit contributions
roma-1.986
-1.741
 centimeters-1.703
opez-1.675
 Aden-1.675
Tokenum
Token ID388
Feature activation+6.77e-02

Pos logit contributions
buster+2.682
adr+2.351
 tort+2.339
 dere+2.294
 treadmill+2.287
Neg logit contributions
roma-2.493
-2.136
 centimeters-2.097
 Braun-2.046
 Aden-2.042
Token and
Token ID290
Feature activation+6.02e-02

Pos logit contributions
buster+0.295
 tort+0.267
 dere+0.262
adr+0.261
 treadmill+0.258
Neg logit contributions
roma-0.340
-0.290
 centimeters-0.285
opez-0.279
 Braun-0.277
Token al
Token ID435
Feature activation-2.06e-01

Pos logit contributions
buster+0.272
 tort+0.246
 dere+0.241
adr+0.241
 treadmill+0.239
Neg logit contributions
roma-0.293
-0.248
 centimeters-0.243
opez-0.239
 Aden-0.237
Token­
Token ID3907
Feature activation+1.49e+00

Pos logit contributions
roma+1.110
+0.957
 centimeters+0.940
opez+0.925
 Aden+0.922
Neg logit contributions
buster-0.821
 tort-0.732
adr-0.714
 dere-0.713
 treadmill-0.707
Tokenl
Token ID75
Feature activation+9.42e-01

Pos logit contributions
buster+7.193
adr+6.359
laws+6.247
 tort+6.138
 dere+6.032
Neg logit contributions
roma-6.220
-5.549
 centimeters-5.386
 Butter-5.225
-5.202
Tokenowed
Token ID6972
Feature activation-2.13e-01

Pos logit contributions
buster+3.834
adr+3.545
 tort+3.440
 dere+3.391
ETF+3.274
Neg logit contributions
roma-4.679
-4.185
 centimeters-4.097
 Aden-3.979
 Butter-3.948
Token only
Token ID691
Feature activation+3.61e-02

Pos logit contributions
roma+0.956
+0.800
 centimeters+0.782
opez+0.768
 Aden+0.765
Neg logit contributions
buster-1.036
 tort-0.942
adr-0.925
 dere-0.921
 treadmill-0.918
Token a
Token ID257
Feature activation+1.44e-01

Pos logit contributions
buster+0.179
 tort+0.164
 dere+0.160
adr+0.160
 treadmill+0.159
Neg logit contributions
roma-0.161
-0.133
 centimeters-0.130
opez-0.128
 Braun-0.127
Token brief
Token ID4506
Feature activation-1.16e-01

Pos logit contributions
buster+0.610
 tort+0.553
 dere+0.540
adr+0.537
 treadmill+0.534
Neg logit contributions
roma-0.737
-0.629
 centimeters-0.617
opez-0.609
 Aden-0.603
Tokenprime
Token ID35505
Feature activation-2.46e-01

Pos logit contributions
buster+0.217
 tort+0.197
adr+0.193
 dere+0.192
 treadmill+0.192
Neg logit contributions
roma-0.221
-0.187
 centimeters-0.183
opez-0.179
 Aden-0.179
Token-
Token ID12
Feature activation+4.88e-02

Pos logit contributions
roma+1.212
+1.037
 centimeters+1.012
 Aden+0.998
opez+0.995
Neg logit contributions
buster-1.093
 tort-0.982
adr-0.964
 dere-0.955
 treadmill-0.953
Tokentime
Token ID2435
Feature activation-1.66e-01

Pos logit contributions
buster+0.210
 tort+0.191
 dere+0.186
adr+0.186
 treadmill+0.185
Neg logit contributions
roma-0.245
-0.210
 centimeters-0.205
opez-0.201
 Aden-0.201
Token video
Token ID2008
Feature activation-3.88e-01

Pos logit contributions
roma+0.898
+0.781
 centimeters+0.766
 Aden+0.756
 Braun+0.753
Neg logit contributions
buster-0.653
 tort-0.575
adr-0.565
 treadmill-0.558
 dere-0.557
Token mes
Token ID18842
Feature activation+1.89e-01

Pos logit contributions
roma+1.973
+1.708
 centimeters+1.666
 Aden+1.644
 Braun+1.642
Neg logit contributions
buster-1.645
 tort-1.454
adr-1.439
 treadmill-1.417
 dere-1.416
Token­
Token ID3907
Feature activation+1.45e+00

Pos logit contributions
buster+0.706
 tort+0.628
adr+0.612
 dere+0.602
 treadmill+0.591
Neg logit contributions
roma-1.075
-0.931
 centimeters-0.916
opez-0.899
 Aden-0.898
Tokens
Token ID82
Feature activation+8.04e-01

Pos logit contributions
buster+6.424
 tort+5.582
adr+5.559
ETF+5.526
DAY+5.229
Neg logit contributions
roma-6.747
 centimeters-6.000
-5.876
龍契士-5.850
 Braun-5.754
Tokenage
Token ID496
Feature activation+5.06e-02

Pos logit contributions
buster+3.748
 tort+3.388
adr+3.384
ETF+3.245
 dere+3.223
Neg logit contributions
roma-3.697
-3.222
 centimeters-3.161
 Braun-3.064
-3.012
Token.
Token ID13
Feature activation+1.88e-01

Pos logit contributions
buster+0.239
 tort+0.217
 dere+0.212
adr+0.212
 treadmill+0.211
Neg logit contributions
roma-0.237
-0.199
 centimeters-0.195
opez-0.191
 Aden-0.190
Token This
Token ID770
Feature activation+2.08e-01

Pos logit contributions
buster+0.861
 tort+0.777
adr+0.765
 dere+0.763
ETF+0.752
Neg logit contributions
roma-0.905
-0.754
 centimeters-0.747
opez-0.735
 Braun-0.718
Token led
Token ID2957
Feature activation+5.62e-01

Pos logit contributions
buster+0.895
 tort+0.811
 dere+0.797
adr+0.789
 treadmill+0.782
Neg logit contributions
roma-1.058
-0.898
 centimeters-0.882
opez-0.870
 Aden-0.861
Tokening
Token ID278
Feature activation-1.45e-01

Pos logit contributions
buster+3.425
 tort+3.078
 dere+3.057
adr+3.043
 treadmill+2.934
Neg logit contributions
roma-2.974
-2.503
 centimeters-2.496
 Butter-2.384
opez-2.375
Token Jimmy
Token ID12963
Feature activation+6.28e-02

Pos logit contributions
roma+0.722
+0.614
 centimeters+0.601
opez+0.593
 Aden+0.588
Neg logit contributions
buster-0.643
 tort-0.581
 dere-0.569
adr-0.568
 treadmill-0.564
Token Carter
Token ID10831
Feature activation-1.35e-01

Pos logit contributions
buster+0.321
 tort+0.297
 dere+0.290
adr+0.287
ETF+0.286
Neg logit contributions
roma-0.270
-0.220
 centimeters-0.215
opez-0.212
 Aden-0.208
Token.
Token ID13
Feature activation+7.27e-02

Pos logit contributions
roma+0.645
+0.544
 centimeters+0.532
opez+0.525
 Aden+0.520
Neg logit contributions
buster-0.624
 tort-0.566
 dere-0.555
adr-0.553
 treadmill-0.547
Token Al
Token ID978
Feature activation+1.04e-01

Pos logit contributions
buster+0.334
 tort+0.300
adr+0.295
 dere+0.294
ETF+0.290
Neg logit contributions
roma-0.353
-0.294
 centimeters-0.291
opez-0.288
 Braun-0.279
Token­
Token ID3907
Feature activation+1.44e+00

Pos logit contributions
buster+0.474
 tort+0.435
adr+0.421
 dere+0.420
 treadmill+0.416
Neg logit contributions
roma-0.496
-0.420
 centimeters-0.411
opez-0.399
 Braun-0.399
Tokenthough
Token ID2016
Feature activation-2.00e-01

Pos logit contributions
buster+7.046
adr+6.204
 tort+6.034
laws+6.014
 dere+5.926
Neg logit contributions
roma-5.944
-5.224
 centimeters-5.179
龍契士-5.006
-4.939
Token he
Token ID339
Feature activation-1.83e-01

Pos logit contributions
roma+0.901
+0.751
 centimeters+0.735
opez+0.721
 Braun+0.715
Neg logit contributions
buster-0.977
 tort-0.892
 dere-0.875
adr-0.873
 treadmill-0.867
Token left
Token ID1364
Feature activation-6.30e-01

Pos logit contributions
roma+0.848
+0.712
 centimeters+0.698
opez+0.684
 Braun+0.682
Neg logit contributions
buster-0.869
 tort-0.786
adr-0.772
 dere-0.769
 treadmill-0.768
Token the
Token ID262
Feature activation+3.98e-02

Pos logit contributions
roma+2.927
+2.469
 centimeters+2.421
 Aden+2.378
opez+2.371
Neg logit contributions
buster-2.971
 tort-2.684
adr-2.629
 dere-2.619
 treadmill-2.613
Token White
Token ID2635
Feature activation-2.13e-01

Pos logit contributions
buster+0.196
 tort+0.180
 dere+0.177
adr+0.176
 treadmill+0.175
Neg logit contributions
roma-0.178
-0.148
 centimeters-0.144
opez-0.142
 Braun-0.140
Token name
Token ID1438
Feature activation-5.34e-01

Pos logit contributions
roma+0.320
+0.274
 centimeters+0.269
opez+0.265
 Braun+0.263
Neg logit contributions
buster-0.256
 tort-0.230
 dere-0.225
adr-0.224
 treadmill-0.222
Token is
Token ID318
Feature activation-4.73e-01

Pos logit contributions
roma+2.600
+2.229
 centimeters+2.196
 Braun+2.156
 Aden+2.145
Neg logit contributions
buster-2.360
 tort-2.110
adr-2.076
 treadmill-2.070
 dere-2.066
Token still
Token ID991
Feature activation-1.93e-01

Pos logit contributions
roma+2.163
+1.832
 centimeters+1.793
 Aden+1.756
 Braun+1.754
Neg logit contributions
buster-2.256
 tort-2.031
adr-1.998
 treadmill-1.988
 dere-1.984
Token a
Token ID257
Feature activation-1.94e-01

Pos logit contributions
roma+0.930
+0.787
 centimeters+0.772
opez+0.758
 Aden+0.755
Neg logit contributions
buster-0.877
 tort-0.793
adr-0.776
 dere-0.775
 treadmill-0.770
Token polit
Token ID1408
Feature activation-4.09e-01

Pos logit contributions
roma+0.978
+0.830
 centimeters+0.817
opez+0.806
 Aden+0.799
Neg logit contributions
buster-0.848
 tort-0.768
 dere-0.752
adr-0.746
 treadmill-0.743
Token­
Token ID3907
Feature activation+1.44e+00

Pos logit contributions
roma+2.165
+1.863
 centimeters+1.826
opez+1.801
 Aden+1.792
Neg logit contributions
buster-1.667
 tort-1.493
 dere-1.452
adr-1.451
 treadmill-1.446
Tokenic
Token ID291
Feature activation+5.75e-01

Pos logit contributions
buster+5.105
adr+4.152
 tort+4.143
 treadmill+4.004
 dere+3.948
Neg logit contributions
roma-7.876
龍契士-6.994
 centimeters-6.981
-6.948
-6.914
Token­
Token ID3907
Feature activation+6.33e-01

Pos logit contributions
buster+2.143
 tort+1.922
 dere+1.883
adr+1.862
 treadmill+1.859
Neg logit contributions
roma-3.217
-2.806
 centimeters-2.742
 Aden-2.725
opez-2.719
Tokenal
Token ID282
Feature activation+6.78e-02

Pos logit contributions
buster+3.170
 tort+2.790
adr+2.783
 treadmill+2.724
 dere+2.682
Neg logit contributions
roma-2.755
-2.296
 centimeters-2.287
 Butter-2.243
 Aden-2.228
Token in
Token ID287
Feature activation-1.07e-01

Pos logit contributions
buster+0.282
 tort+0.255
 dere+0.250
 treadmill+0.246
adr+0.245
Neg logit contributions
roma-0.356
-0.302
 centimeters-0.298
opez-0.295
 Aden-0.292
Token­
Token ID3907
Feature activation+1.19e+00

Pos logit contributions
roma+0.575
+0.490
 centimeters+0.485
opez+0.477
 Braun+0.475
Neg logit contributions
buster-0.428
 tort-0.386
 dere-0.378
adr-0.376
 treadmill-0.371
Token )
Token ID1267
Feature activation+4.26e-01

Pos logit contributions
buster+2.353
 tort+2.130
adr+2.085
ETF+2.060
 dere+2.048
Neg logit contributions
roma-3.225
-2.815
 Aden-2.737
opez-2.711
 centimeters-2.705
Token AS
Token ID7054
Feature activation-7.87e-02

Pos logit contributions
buster+1.675
 tort+1.512
adr+1.479
 dere+1.468
ETF+1.464
Neg logit contributions
roma-2.296
-1.993
opez-1.924
 centimeters-1.916
 Aden-1.910
Token count
Token ID954
Feature activation+1.29e-01

Pos logit contributions
roma+0.396
+0.338
 centimeters+0.330
opez+0.326
 Aden+0.324
Neg logit contributions
buster-0.344
 tort-0.310
adr-0.302
 dere-0.302
 treadmill-0.300
Token FROM
Token ID16034
Feature activation+1.90e-01

Pos logit contributions
buster+0.622
 tort+0.564
adr+0.556
 dere+0.550
 treadmill+0.548
Neg logit contributions
roma-0.591
-0.497
 centimeters-0.482
opez-0.478
 Aden-0.475
Token (
Token ID357
Feature activation-1.91e-01

Pos logit contributions
buster+0.840
 tort+0.764
 dere+0.745
 treadmill+0.743
adr+0.741
Neg logit contributions
roma-0.939
-0.803
 Aden-0.775
opez-0.774
 centimeters-0.773
Token SELECT
Token ID33493
Feature activation+1.43e+00

Pos logit contributions
roma+0.942
+0.799
 centimeters+0.784
opez+0.770
 Braun+0.767
Neg logit contributions
buster-0.853
 tort-0.770
adr-0.753
 dere-0.752
 treadmill-0.747
Token SPL
Token ID46341
Feature activation+9.40e-01

Pos logit contributions
buster+5.920
 tort+5.384
 dere+5.164
adr+5.145
 treadmill+5.126
Neg logit contributions
roma-6.896
-5.960
 Aden-5.874
 Braun-5.738
opez-5.728
TokenIT
Token ID2043
Feature activation+1.84e-01

Pos logit contributions
buster+2.666
ETF+2.335
 tort+2.325
adr+2.320
 treadmill+2.151
Neg logit contributions
roma-6.026
 centimeters-5.181
opez-5.159
-5.134
 Braun-5.122
Token (
Token ID357
Feature activation-1.80e-01

Pos logit contributions
buster+0.810
 tort+0.725
adr+0.712
ETF+0.707
 dere+0.702
Neg logit contributions
roma-0.918
-0.780
 centimeters-0.765
opez-0.758
 Aden-0.756
Token content
Token ID2695
Feature activation+4.77e-01

Pos logit contributions
roma+0.896
+0.762
 centimeters+0.748
opez+0.735
 Aden+0.732
Neg logit contributions
buster-0.793
 tort-0.714
adr-0.698
 dere-0.697
 treadmill-0.693
Token,
Token ID837
Feature activation-2.52e-02

Pos logit contributions
buster+2.332
 tort+2.108
adr+2.102
ETF+2.063
 dere+2.039
Neg logit contributions
roma-2.126
-1.777
 Aden-1.728
 centimeters-1.721
opez-1.719
Token >
Token ID1875
Feature activation+6.07e-01

Pos logit contributions
buster+5.127
 tort+4.606
adr+4.500
 dere+4.470
ETF+4.340
Neg logit contributions
roma-6.775
-5.747
opez-5.616
 Aden-5.611
 centimeters-5.514
Token &
Token ID1222
Feature activation+7.45e-01

Pos logit contributions
buster+2.493
 tort+2.259
 dere+2.202
adr+2.171
 treadmill+2.171
Neg logit contributions
roma-3.173
-2.714
opez-2.618
 Aden-2.613
 centimeters-2.604
Token density
Token ID12109
Feature activation+4.91e-01

Pos logit contributions
buster+3.165
 tort+2.861
adr+2.835
 dere+2.789
 treadmill+2.752
Neg logit contributions
roma-3.757
-3.226
 centimeters-3.093
opez-3.086
 Aden-3.086
TokenGrid
Token ID41339
Feature activation-1.18e-01

Pos logit contributions
buster+1.992
 tort+1.770
adr+1.753
ETF+1.726
 dere+1.724
Neg logit contributions
roma-2.603
-2.224
 centimeters-2.174
opez-2.167
 Aden-2.150
Token,
Token ID837
Feature activation+6.99e-01

Pos logit contributions
roma+0.609
+0.521
 centimeters+0.512
opez+0.504
 Aden+0.501
Neg logit contributions
buster-0.498
 tort-0.446
adr-0.436
 dere-0.435
 treadmill-0.432
Token const
Token ID1500
Feature activation+1.43e+00

Pos logit contributions
buster+2.702
 tort+2.450
 dere+2.383
 treadmill+2.371
adr+2.352
Neg logit contributions
roma-3.781
-3.315
opez-3.193
 centimeters-3.158
 Aden-3.121
Token Vector
Token ID20650
Feature activation+3.65e-01

Pos logit contributions
buster+6.037
 tort+5.576
 treadmill+5.431
 dere+5.377
 WARN+5.334
Neg logit contributions
roma-6.824
-5.919
opez-5.676
 centimeters-5.525
rition-5.504
Token<
Token ID27
Feature activation+2.86e-01

Pos logit contributions
buster+1.805
 tort+1.647
adr+1.614
 dere+1.612
 treadmill+1.599
Neg logit contributions
roma-1.615
-1.338
 centimeters-1.307
opez-1.292
 Aden-1.277
Token Part
Token ID2142
Feature activation+1.23e+00

Pos logit contributions
buster+0.945
 tort+0.828
adr+0.802
 dere+0.799
 treadmill+0.790
Neg logit contributions
roma-1.731
-1.521
 centimeters-1.483
opez-1.477
 Aden-1.469
Tokenicle
Token ID1548
Feature activation+5.63e-01

Pos logit contributions
buster+5.048
 tort+4.384
ETF+4.286
adr+4.241
 dere+4.220
Neg logit contributions
roma-6.103
-5.357
opez-5.241
 Aden-5.239
 centimeters-5.201
Token >
Token ID1875
Feature activation+8.27e-01

Pos logit contributions
buster+2.307
 tort+2.051
adr+2.002
 dere+1.998
ETF+1.985
Neg logit contributions
roma-2.962
-2.533
 centimeters-2.481
opez-2.478
 Aden-2.455
Token --
Token ID1377
Feature activation-2.53e-01

Pos logit contributions
buster+1.249
 tort+1.143
 dere+1.104
adr+1.098
 treadmill+1.096
Neg logit contributions
roma-1.424
-1.203
 centimeters-1.178
opez-1.164
 Braun-1.154
Token they
Token ID484
Feature activation+3.16e-01

Pos logit contributions
roma+1.200
+1.013
 centimeters+0.994
opez+0.975
 Aden+0.973
Neg logit contributions
buster-1.174
 tort-1.059
adr-1.041
 dere-1.038
 treadmill-1.032
Token have
Token ID423
Feature activation+5.13e-01

Pos logit contributions
buster+1.400
 tort+1.276
 dere+1.238
 treadmill+1.227
adr+1.223
Neg logit contributions
roma-1.566
-1.342
 centimeters-1.302
opez-1.291
 Aden-1.277
Token little
Token ID1310
Feature activation-9.97e-02

Pos logit contributions
buster+2.322
 tort+2.205
 dere+2.124
 treadmill+2.093
adr+2.059
Neg logit contributions
roma-2.463
-2.086
opez-1.997
 centimeters-1.973
 Braun-1.934
Token in
Token ID287
Feature activation+4.89e-01

Pos logit contributions
roma+0.480
+0.405
 centimeters+0.394
opez+0.389
 Braun+0.386
Neg logit contributions
buster-0.458
 tort-0.417
 dere-0.407
adr-0.405
 treadmill-0.404
Token common
Token ID2219
Feature activation+1.42e+00

Pos logit contributions
buster+2.313
 tort+2.210
 dere+2.129
adr+2.119
 treadmill+2.096
Neg logit contributions
roma-2.220
-1.825
opez-1.774
 centimeters-1.773
 Braun-1.757
Token at
Token ID379
Feature activation+1.05e-01

Pos logit contributions
buster+5.936
 tort+5.732
 dere+5.404
 treadmill+5.295
ETF+5.192
Neg logit contributions
roma-7.038
-5.773
opez-5.489
 centimeters-5.480
rition-5.445
Token first
Token ID717
Feature activation+5.56e-01

Pos logit contributions
buster+0.525
 tort+0.488
 dere+0.477
 treadmill+0.471
adr+0.468
Neg logit contributions
roma-0.454
-0.379
 centimeters-0.364
opez-0.360
 Braun-0.355
Token except
Token ID2845
Feature activation+7.46e-01

Pos logit contributions
buster+2.426
 tort+2.256
 dere+2.165
 treadmill+2.154
ETF+2.111
Neg logit contributions
roma-2.773
-2.353
opez-2.268
 centimeters-2.238
 Aden-2.209
Token their
Token ID511
Feature activation+5.48e-01

Pos logit contributions
buster+3.388
 tort+3.199
 dere+3.102
 treadmill+3.055
adr+2.968
Neg logit contributions
roma-3.564
-2.976
opez-2.844
 Braun-2.817
 centimeters-2.815
Token love
Token ID1842
Feature activation+2.75e-01

Pos logit contributions
buster+2.474
 tort+2.347
 dere+2.281
 treadmill+2.235
adr+2.177
Neg logit contributions
roma-2.647
-2.228
opez-2.127
 Braun-2.115
 centimeters-2.114
Tokenicle
Token ID1548
Feature activation-1.71e-02

Pos logit contributions
buster+2.158
 tort+1.919
adr+1.877
 dere+1.867
 treadmill+1.855
Neg logit contributions
roma-2.294
-1.984
 centimeters-1.923
opez-1.921
 Aden-1.913
Token::
Token ID3712
Feature activation+3.19e-01

Pos logit contributions
roma+0.081
+0.069
 centimeters+0.067
opez+0.066
 Aden+0.066
Neg logit contributions
buster-0.080
 tort-0.072
adr-0.070
 dere-0.070
 treadmill-0.070
TokenGet
Token ID3855
Feature activation-4.89e-01

Pos logit contributions
buster+1.305
 tort+1.145
adr+1.143
ETF+1.120
 dere+1.120
Neg logit contributions
roma-1.685
-1.444
 centimeters-1.424
opez-1.416
 Braun-1.405
TokenMass
Token ID20273
Feature activation+7.34e-03

Pos logit contributions
roma+1.935
+1.571
 centimeters+1.535
opez+1.495
 Aden+1.487
Neg logit contributions
buster-2.652
 tort-2.441
 dere-2.396
adr-2.394
 treadmill-2.386
Token(
Token ID7
Feature activation+6.82e-01

Pos logit contributions
buster+0.032
 tort+0.029
adr+0.028
 dere+0.028
 treadmill+0.028
Neg logit contributions
roma-0.037
-0.031
 centimeters-0.031
opez-0.030
 Aden-0.030
Token float
Token ID12178
Feature activation+1.40e+00

Pos logit contributions
buster+2.362
 tort+2.063
adr+2.049
 dere+1.959
 treadmill+1.956
Neg logit contributions
roma-3.963
-3.491
 centimeters-3.415
opez-3.410
 Aden-3.386
Token ambient
Token ID25237
Feature activation+3.27e-03

Pos logit contributions
buster+5.831
 tort+5.411
 dere+5.237
adr+5.116
 treadmill+5.085
Neg logit contributions
roma-6.714
-5.688
opez-5.504
 Aden-5.466
-5.451
TokenD
Token ID35
Feature activation+1.35e-01

Pos logit contributions
buster+0.015
 tort+0.013
adr+0.013
 dere+0.013
 treadmill+0.013
Neg logit contributions
roma-0.016
-0.014
 centimeters-0.013
opez-0.013
 Aden-0.013
Tokenensity
Token ID6377
Feature activation+4.83e-01

Pos logit contributions
buster+0.737
 tort+0.683
adr+0.674
 dere+0.667
 treadmill+0.666
Neg logit contributions
roma-0.522
-0.428
 centimeters-0.414
opez-0.405
 Aden-0.404
Token )
Token ID1267
Feature activation+8.15e-01

Pos logit contributions
buster+1.958
 tort+1.762
adr+1.723
 dere+1.713
 treadmill+1.698
Neg logit contributions
roma-2.552
-2.195
 centimeters-2.141
opez-2.136
 Aden-2.119
Token const
Token ID1500
Feature activation+1.06e+00

Pos logit contributions
buster+3.362
 tort+3.102
adr+2.987
 dere+2.967
 treadmill+2.960
Neg logit contributions
roma-4.137
-3.560
opez-3.472
 centimeters-3.381
 Braun-3.365
Token to
Token ID284
Feature activation+1.97e-01

Pos logit contributions
roma+0.061
+0.050
 centimeters+0.048
opez+0.048
 Aden+0.047
Neg logit contributions
buster-0.076
 tort-0.069
 dere-0.068
adr-0.068
 treadmill-0.068
Token get
Token ID651
Feature activation+4.85e-01

Pos logit contributions
buster+0.863
 tort+0.780
 dere+0.760
adr+0.758
 treadmill+0.755
Neg logit contributions
roma-0.990
-0.843
 centimeters-0.822
opez-0.815
 Aden-0.809
Token the
Token ID262
Feature activation+3.94e-01

Pos logit contributions
buster+2.252
 tort+2.077
 dere+2.025
 treadmill+2.004
adr+2.003
Neg logit contributions
roma-2.285
-1.919
opez-1.844
 Aden-1.831
 centimeters-1.827
Token most
Token ID749
Feature activation+5.24e-01

Pos logit contributions
buster+1.760
 tort+1.609
 dere+1.563
adr+1.551
 treadmill+1.551
Neg logit contributions
roma-1.926
-1.635
 centimeters-1.583
opez-1.579
 Aden-1.571
Token out
Token ID503
Feature activation+9.88e-01

Pos logit contributions
buster+2.262
 tort+2.085
 dere+2.013
 treadmill+1.974
adr+1.963
Neg logit contributions
roma-2.614
-2.231
opez-2.161
 Aden-2.158
 centimeters-2.143
Token of
Token ID286
Feature activation+1.38e+00

Pos logit contributions
buster+3.977
 tort+3.663
 dere+3.520
ETF+3.461
adr+3.421
Neg logit contributions
roma-5.019
-4.304
 centimeters-4.218
opez-4.200
 Aden-4.152
Token the
Token ID262
Feature activation+9.00e-01

Pos logit contributions
buster+5.939
 tort+5.623
 dere+5.409
ETF+5.185
 treadmill+5.176
Neg logit contributions
roma-6.520
-5.324
opez-5.260
 Aden-5.182
rition-5.100
Token limited
Token ID3614
Feature activation+7.50e-02

Pos logit contributions
buster+4.103
 tort+3.838
 dere+3.733
 treadmill+3.653
adr+3.612
Neg logit contributions
roma-4.176
-3.527
opez-3.428
 Aden-3.375
 centimeters-3.363
Token number
Token ID1271
Feature activation+5.83e-02

Pos logit contributions
buster+0.319
 tort+0.287
 dere+0.280
adr+0.280
 treadmill+0.278
Neg logit contributions
roma-0.385
-0.329
 centimeters-0.323
opez-0.318
 Aden-0.317
Token of
Token ID286
Feature activation+4.19e-01

Pos logit contributions
buster+0.277
 tort+0.252
adr+0.246
 dere+0.246
 treadmill+0.244
Neg logit contributions
roma-0.271
-0.227
 centimeters-0.222
opez-0.218
 Aden-0.218
Token pixels
Token ID17848
Feature activation+2.51e-01

Pos logit contributions
buster+1.874
 tort+1.718
 dere+1.667
adr+1.658
 treadmill+1.646
Neg logit contributions
roma-2.041
-1.735
opez-1.664
 centimeters-1.656
 Aden-1.655
Token­
Token ID3907
Feature activation+8.51e-01

Pos logit contributions
buster+1.160
 tort+1.027
adr+1.014
 dere+1.002
 treadmill+0.978
Neg logit contributions
roma-1.782
-1.560
 centimeters-1.521
opez-1.503
 Aden-1.496
Tokenon
Token ID261
Feature activation+4.24e-01

Pos logit contributions
buster+4.006
adr+3.478
 tort+3.425
 treadmill+3.301
 dere+3.290
Neg logit contributions
roma-3.995
-3.446
 centimeters-3.377
 Butter-3.327
 Aden-3.315
Token to
Token ID284
Feature activation-4.53e-02

Pos logit contributions
buster+1.946
 tort+1.785
 dere+1.764
adr+1.726
 treadmill+1.710
Neg logit contributions
roma-2.016
-1.689
 centimeters-1.638
opez-1.637
 Aden-1.615
Token this
Token ID428
Feature activation+4.35e-01

Pos logit contributions
roma+0.211
+0.176
 centimeters+0.173
opez+0.170
 Braun+0.168
Neg logit contributions
buster-0.214
 tort-0.196
 dere-0.192
adr-0.191
 treadmill-0.190
Token pres
Token ID906
Feature activation-4.73e-01

Pos logit contributions
buster+1.831
 tort+1.688
 dere+1.664
adr+1.616
 treadmill+1.614
Neg logit contributions
roma-2.252
-1.903
 centimeters-1.870
opez-1.840
 Braun-1.808
Token­
Token ID3907
Feature activation+1.37e+00

Pos logit contributions
roma+2.529
+2.178
 centimeters+2.139
opez+2.107
 Aden+2.098
Neg logit contributions
buster-1.907
 tort-1.702
adr-1.659
 dere-1.658
 treadmill-1.646
Tokenid
Token ID312
Feature activation+6.70e-01

Pos logit contributions
buster+5.402
adr+4.577
 tort+4.419
ETF+4.389
 dere+4.345
Neg logit contributions
roma-7.005
-6.321
 centimeters-6.288
-6.177
 Butter-6.128
Token­
Token ID3907
Feature activation+1.13e+00

Pos logit contributions
buster+2.521
 tort+2.236
adr+2.209
 dere+2.178
ETF+2.122
Neg logit contributions
roma-3.734
-3.249
 centimeters-3.228
opez-3.151
 Aden-3.116
Tokenent
Token ID298
Feature activation+3.00e-01

Pos logit contributions
buster+3.740
adr+3.105
 tort+3.016
 dere+2.957
 treadmill+2.780
Neg logit contributions
roma-6.697
-6.111
 centimeters-6.105
-6.010
 Butter-5.917
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
buster+1.390
 tort+1.275
 dere+1.254
adr+1.232
 treadmill+1.222
Neg logit contributions
roma-1.435
-1.197
 centimeters-1.178
opez-1.158
 Aden-1.134
Token
Token ID447
Feature activation+3.37e-01

Pos logit contributions
buster+0.917
 tort+0.842
 dere+0.830
adr+0.812
 treadmill+0.807
Neg logit contributions
roma-0.966
-0.809
 centimeters-0.794
opez-0.787
 Braun-0.767
Token,
Token ID11
Feature activation+2.01e-01

Pos logit contributions
buster+3.919
 tort+3.676
 dere+3.520
adr+3.449
 treadmill+3.445
Neg logit contributions
roma-4.712
-3.961
opez-3.856
 Aden-3.757
 centimeters-3.720
Token and
Token ID290
Feature activation-1.31e-01

Pos logit contributions
buster+0.899
 tort+0.828
 dere+0.802
 treadmill+0.796
adr+0.792
Neg logit contributions
roma-0.989
-0.838
opez-0.814
 centimeters-0.808
 Aden-0.796
Token push
Token ID4574
Feature activation+2.01e-01

Pos logit contributions
roma+0.619
+0.521
 centimeters+0.510
opez+0.502
 Aden+0.498
Neg logit contributions
buster-0.612
 tort-0.556
 dere-0.543
adr-0.543
 treadmill-0.540
Token and
Token ID290
Feature activation+2.75e-01

Pos logit contributions
buster+0.934
 tort+0.854
 dere+0.833
adr+0.831
 treadmill+0.823
Neg logit contributions
roma-0.963
-0.802
opez-0.778
 centimeters-0.778
 Aden-0.764
Token j
Token ID474
Feature activation+4.30e-02

Pos logit contributions
buster+1.151
 tort+1.048
 dere+1.008
 treadmill+1.006
adr+0.998
Neg logit contributions
roma-1.440
-1.226
 centimeters-1.188
opez-1.186
 Aden-1.176
Tokenost
Token ID455
Feature activation+1.36e+00

Pos logit contributions
buster+0.181
 tort+0.160
adr+0.159
 dere+0.156
 treadmill+0.155
Neg logit contributions
roma-0.224
-0.192
 centimeters-0.190
 Braun-0.186
 Aden-0.185
Tokenle
Token ID293
Feature activation+5.72e-01

Pos logit contributions
buster+4.621
 tort+4.046
 dere+4.046
ETF+3.858
adr+3.812
Neg logit contributions
roma-7.754
-6.674
 centimeters-6.641
opez-6.579
 Aden-6.561
Token in
Token ID287
Feature activation+1.30e-01

Pos logit contributions
buster+2.573
 tort+2.379
 dere+2.307
 treadmill+2.271
adr+2.269
Neg logit contributions
roma-2.831
-2.351
opez-2.289
 centimeters-2.248
 Aden-2.231
Token that
Token ID326
Feature activation+1.39e-01

Pos logit contributions
buster+0.614
 tort+0.567
 dere+0.554
adr+0.550
 treadmill+0.549
Neg logit contributions
roma-0.604
-0.504
opez-0.488
 centimeters-0.487
 Braun-0.480
Token ritual
Token ID12146
Feature activation-1.20e-01

Pos logit contributions
buster+0.619
 tort+0.565
 dere+0.552
 treadmill+0.548
adr+0.548
Neg logit contributions
roma-0.688
-0.585
 centimeters-0.569
opez-0.566
 Braun-0.558
Tokenistic
Token ID2569
Feature activation-2.73e-01

Pos logit contributions
roma+0.589
+0.500
 centimeters+0.489
opez+0.483
 Aden+0.479
Neg logit contributions
buster-0.538
 tort-0.487
 dere-0.476
adr-0.474
 treadmill-0.473
Token Game
Token ID3776
Feature activation+3.04e-01

Pos logit contributions
buster+0.229
 tort+0.206
 dere+0.199
 treadmill+0.199
adr+0.198
Neg logit contributions
roma-0.292
-0.251
 centimeters-0.246
opez-0.243
 Aden-0.241
Token in
Token ID287
Feature activation+3.58e-01

Pos logit contributions
buster+1.370
 tort+1.213
 dere+1.186
adr+1.178
 treadmill+1.174
Neg logit contributions
roma-1.495
-1.276
 centimeters-1.254
opez-1.223
 Braun-1.220
Token 2016
Token ID1584
Feature activation+4.02e-01

Pos logit contributions
buster+1.718
 tort+1.587
 dere+1.553
 treadmill+1.551
adr+1.528
Neg logit contributions
roma-1.644
-1.373
 centimeters-1.328
opez-1.315
-1.270
Token (
Token ID357
Feature activation+4.16e-01

Pos logit contributions
buster+1.858
 tort+1.682
adr+1.631
 dere+1.629
 treadmill+1.622
Neg logit contributions
roma-1.925
-1.597
 centimeters-1.592
opez-1.565
 Aden-1.535
TokenPhoto
Token ID6191
Feature activation+4.37e-01

Pos logit contributions
buster+1.724
 tort+1.517
adr+1.487
 dere+1.463
ETF+1.461
Neg logit contributions
roma-2.181
 centimeters-1.860
-1.841
 Aden-1.819
 Butter-1.814
Token:
Token ID25
Feature activation+1.36e+00

Pos logit contributions
buster+1.993
 tort+1.798
 dere+1.747
adr+1.742
 treadmill+1.723
Neg logit contributions
roma-2.121
-1.796
 centimeters-1.745
opez-1.740
 Braun-1.680
Token Wayne
Token ID13329
Feature activation+6.47e-02

Pos logit contributions
buster+5.793
 tort+5.342
 WARN+5.165
 dere+5.160
 treadmill+5.110
Neg logit contributions
roma-6.429
opez-5.310
 centimeters-5.273
-5.264
-5.010
Token McG
Token ID11130
Feature activation+5.45e-02

Pos logit contributions
buster+0.315
 tort+0.286
 dere+0.279
adr+0.279
 treadmill+0.276
Neg logit contributions
roma-0.296
-0.245
 centimeters-0.242
opez-0.236
 Aden-0.230
Tokena
Token ID64
Feature activation+3.19e-02

Pos logit contributions
buster+0.226
adr+0.200
 tort+0.198
 dere+0.193
 treadmill+0.192
Neg logit contributions
roma-0.284
-0.247
 centimeters-0.245
 Aden-0.239
 Braun-0.238
Tokenhee
Token ID21067
Feature activation+2.17e-01

Pos logit contributions
buster+0.173
adr+0.157
 tort+0.157
 treadmill+0.156
 dere+0.155
Neg logit contributions
roma-0.124
-0.102
 centimeters-0.101
 Braun-0.098
 Aden-0.097
Token III
Token ID6711
Feature activation-8.19e-02

Pos logit contributions
buster+1.060
 tort+0.963
 dere+0.948
 treadmill+0.943
adr+0.941
Neg logit contributions
roma-0.992
-0.815
 centimeters-0.807
opez-0.788
 Aden-0.779
Token ]
Token ID2361
Feature activation+4.99e-01

Pos logit contributions
buster+1.202
 tort+1.066
adr+1.044
 dere+1.035
 treadmill+1.033
Neg logit contributions
roma-1.634
-1.415
 centimeters-1.377
 Aden-1.366
opez-1.365
Token WHERE
Token ID33411
Feature activation+6.09e-01

Pos logit contributions
buster+1.724
 tort+1.519
 treadmill+1.469
 dere+1.466
adr+1.455
Neg logit contributions
roma-2.940
-2.575
 centimeters-2.501
opez-2.501
 Aden-2.487
Token id
Token ID4686
Feature activation-1.26e-01

Pos logit contributions
buster+2.253
 tort+2.015
 dere+1.974
 treadmill+1.954
adr+1.946
Neg logit contributions
roma-3.434
-2.987
opez-2.891
 centimeters-2.876
 Aden-2.854
Token IN
Token ID3268
Feature activation-1.31e-01

Pos logit contributions
roma+0.608
+0.515
 centimeters+0.504
opez+0.496
 Aden+0.494
Neg logit contributions
buster-0.571
 tort-0.516
adr-0.505
 dere-0.504
 treadmill-0.501
Token (
Token ID357
Feature activation-1.49e-01

Pos logit contributions
roma+0.687
+0.590
 centimeters+0.580
opez+0.570
 Braun+0.567
Neg logit contributions
buster-0.543
 tort-0.485
 dere-0.473
adr-0.473
 treadmill-0.469
Token SELECT
Token ID33493
Feature activation+1.35e+00

Pos logit contributions
roma+0.744
+0.632
 centimeters+0.621
opez+0.610
 Braun+0.607
Neg logit contributions
buster-0.653
 tort-0.588
adr-0.575
 dere-0.574
 treadmill-0.571
Token id
Token ID4686
Feature activation+4.57e-02

Pos logit contributions
buster+5.489
 tort+5.012
 dere+4.881
 treadmill+4.840
adr+4.827
Neg logit contributions
roma-6.670
-5.714
 Aden-5.668
opez-5.563
 Braun-5.516
Token FROM
Token ID16034
Feature activation-1.77e-01

Pos logit contributions
buster+0.217
 tort+0.197
adr+0.193
 dere+0.192
 treadmill+0.191
Neg logit contributions
roma-0.212
-0.178
 centimeters-0.174
opez-0.172
 Aden-0.172
Token cl
Token ID537
Feature activation-8.50e-02

Pos logit contributions
roma+0.844
+0.713
 centimeters+0.696
opez+0.686
 Aden+0.684
Neg logit contributions
buster-0.821
 tort-0.745
 dere-0.729
adr-0.728
 treadmill-0.723
Tokenojure
Token ID32511
Feature activation-1.08e-01

Pos logit contributions
roma+0.434
+0.375
 centimeters+0.368
 Aden+0.363
 Braun+0.362
Neg logit contributions
buster-0.363
 tort-0.320
adr-0.320
 dere-0.314
 treadmill-0.312
Token.
Token ID764
Feature activation+2.60e-01

Pos logit contributions
roma+0.473
+0.395
 centimeters+0.384
 Aden+0.379
opez+0.378
Neg logit contributions
buster-0.541
 tort-0.493
adr-0.484
 dere-0.483
 treadmill-0.479
Token )
Token ID1267
Feature activation+5.58e-01

Pos logit contributions
buster+0.626
 tort+0.552
adr+0.539
 dere+0.537
 treadmill+0.532
Neg logit contributions
roma-0.928
-0.806
 centimeters-0.791
opez-0.780
 Aden-0.777
Token {
Token ID1391
Feature activation+6.76e-01

Pos logit contributions
buster+2.304
 tort+2.022
adr+2.009
 dere+1.954
ETF+1.941
Neg logit contributions
roma-2.939
-2.540
 centimeters-2.454
opez-2.453
 Aden-2.410
Token //
Token ID3373
Feature activation+3.92e-02

Pos logit contributions
buster+2.643
 tort+2.395
 treadmill+2.316
 dere+2.305
adr+2.293
Neg logit contributions
roma-3.663
-3.229
opez-3.076
 centimeters-3.061
 Aden-3.054
Token Time
Token ID3862
Feature activation+6.93e-01

Pos logit contributions
buster+0.204
 tort+0.188
adr+0.183
 dere+0.183
 treadmill+0.183
Neg logit contributions
roma-0.164
-0.136
 centimeters-0.131
opez-0.130
 Aden-0.128
Token component
Token ID7515
Feature activation+7.16e-01

Pos logit contributions
buster+2.985
 tort+2.775
 dere+2.720
adr+2.677
 treadmill+2.649
Neg logit contributions
roma-3.400
-2.935
 Aden-2.807
 centimeters-2.801
 Braun-2.795
Token const
Token ID1500
Feature activation+1.34e+00

Pos logit contributions
buster+2.497
 tort+2.348
 dere+2.231
adr+2.184
 treadmill+2.182
Neg logit contributions
roma-4.187
-3.632
opez-3.501
 centimeters-3.426
 Aden-3.417
Token float
Token ID12178
Feature activation+1.13e+00

Pos logit contributions
buster+3.341
 tort+2.984
 treadmill+2.804
 dere+2.692
 WARN+2.608
Neg logit contributions
roma-9.030
-8.103
 Aden-7.759
opez-7.719
rition-7.658
Token TA
Token ID21664
Feature activation+6.51e-01

Pos logit contributions
buster+3.793
 tort+3.441
 treadmill+3.297
 dere+3.260
 WARN+3.256
Neg logit contributions
roma-6.586
-5.845
opez-5.682
 centimeters-5.549
rition-5.483
TokenU
Token ID52
Feature activation+7.29e-01

Pos logit contributions
buster+2.648
ETF+2.349
 tort+2.340
adr+2.338
DAY+2.259
Neg logit contributions
roma-3.371
-2.907
 centimeters-2.861
opez-2.831
 Aden-2.805
Token =
Token ID796
Feature activation+1.03e+00

Pos logit contributions
buster+2.949
 tort+2.560
adr+2.544
ETF+2.531
 WARN+2.475
Neg logit contributions
roma-3.886
-3.312
opez-3.260
 centimeters-3.228
 Aden-3.192
Token 6
Token ID718
Feature activation+1.20e+00

Pos logit contributions
buster+3.803
 tort+3.545
 treadmill+3.377
 dere+3.337
adr+3.323
Neg logit contributions
roma-5.644
-4.842
 Aden-4.673
-4.629
opez-4.621
TokenTo
Token ID2514
Feature activation-4.72e-01

Pos logit contributions
buster+0.074
 tort+0.066
adr+0.065
 dere+0.064
ETF+0.064
Neg logit contributions
roma-0.091
-0.078
 centimeters-0.077
opez-0.075
 Braun-0.075
Token get
Token ID651
Feature activation+5.04e-02

Pos logit contributions
roma+2.378
+2.027
 centimeters+1.992
 Aden+1.969
 Braun+1.963
Neg logit contributions
buster-2.035
 tort-1.816
adr-1.786
 dere-1.761
 treadmill-1.756
Token the
Token ID262
Feature activation+2.58e-01

Pos logit contributions
buster+0.236
 tort+0.215
 dere+0.212
adr+0.209
 treadmill+0.209
Neg logit contributions
roma-0.238
-0.200
 centimeters-0.194
opez-0.192
 Aden-0.189
Token most
Token ID749
Feature activation+5.49e-01

Pos logit contributions
buster+1.132
 tort+1.029
 dere+1.006
adr+1.004
 treadmill+0.998
Neg logit contributions
roma-1.284
-1.093
 centimeters-1.063
opez-1.052
 Aden-1.038
Token out
Token ID503
Feature activation+9.96e-01

Pos logit contributions
buster+2.328
 tort+2.132
 dere+2.084
 treadmill+2.035
adr+2.000
Neg logit contributions
roma-2.807
-2.409
opez-2.323
 centimeters-2.311
 Braun-2.274
Token of
Token ID286
Feature activation+1.33e+00

Pos logit contributions
buster+4.136
 tort+3.760
 dere+3.697
ETF+3.589
adr+3.581
Neg logit contributions
roma-5.086
-4.365
 centimeters-4.246
opez-4.215
rition-4.072
Token it
Token ID340
Feature activation+4.64e-01

Pos logit contributions
buster+5.687
 dere+5.340
 tort+5.328
ETF+5.153
 treadmill+5.066
Neg logit contributions
roma-6.395
-5.214
 centimeters-5.150
opez-5.132
rition-4.889
Token will
Token ID481
Feature activation+2.96e-01

Pos logit contributions
buster+1.966
 tort+1.788
 dere+1.757
 treadmill+1.717
adr+1.715
Neg logit contributions
roma-2.399
-2.031
 centimeters-1.994
opez-1.964
 Aden-1.910
Token require
Token ID2421
Feature activation+8.33e-02

Pos logit contributions
buster+1.290
 tort+1.157
 dere+1.134
 treadmill+1.116
adr+1.105
Neg logit contributions
roma-1.500
-1.290
 centimeters-1.249
opez-1.245
 Aden-1.230
Token commitment
Token ID7901
Feature activation+6.13e-02

Pos logit contributions
buster+0.372
 tort+0.339
 dere+0.332
adr+0.329
 treadmill+0.328
Neg logit contributions
roma-0.410
-0.348
 centimeters-0.339
opez-0.334
 Aden-0.331
Token and
Token ID290
Feature activation+8.01e-02

Pos logit contributions
buster+0.271
 tort+0.246
 dere+0.241
 treadmill+0.238
adr+0.237
Neg logit contributions
roma-0.307
-0.260
 centimeters-0.254
opez-0.250
 Aden-0.248
Token for
Token ID329
Feature activation+7.37e-01

Pos logit contributions
buster+2.851
 tort+2.705
 dere+2.619
 treadmill+2.529
adr+2.507
Neg logit contributions
roma-3.166
-2.593
 centimeters-2.515
opez-2.514
 Braun-2.437
Token cases
Token ID2663
Feature activation+2.34e-01

Pos logit contributions
buster+3.257
 tort+3.106
 dere+2.994
 treadmill+2.909
adr+2.877
Neg logit contributions
roma-3.645
-3.030
 centimeters-2.919
opez-2.909
 Butter-2.807
Token.
Token ID13
Feature activation+3.17e-01

Pos logit contributions
buster+1.084
 tort+1.000
 dere+0.965
adr+0.957
 treadmill+0.949
Neg logit contributions
roma-1.117
-0.933
 centimeters-0.912
opez-0.904
 Braun-0.888
Token (
Token ID357
Feature activation+3.71e-01

Pos logit contributions
buster+1.421
 tort+1.290
 dere+1.261
adr+1.256
 treadmill+1.236
Neg logit contributions
roma-1.554
-1.306
 centimeters-1.287
opez-1.262
 Aden-1.230
TokenPhoto
Token ID6191
Feature activation+4.53e-01

Pos logit contributions
buster+1.541
 tort+1.374
adr+1.340
ETF+1.330
 dere+1.329
Neg logit contributions
roma-1.934
-1.639
 centimeters-1.637
opez-1.605
 Aden-1.588
Token by
Token ID416
Feature activation+1.31e+00

Pos logit contributions
buster+2.019
 tort+1.834
 treadmill+1.786
 dere+1.774
adr+1.761
Neg logit contributions
roma-2.202
-1.874
 centimeters-1.848
opez-1.825
 Aden-1.777
Token Kathleen
Token ID31275
Feature activation+2.10e-01

Pos logit contributions
buster+5.650
 tort+5.165
 dere+5.071
 treadmill+4.908
adr+4.901
Neg logit contributions
roma-6.216
-5.163
opez-5.135
 centimeters-5.114
-4.942
Token Flynn
Token ID14654
Feature activation-2.26e-01

Pos logit contributions
buster+1.016
 tort+0.922
 dere+0.909
adr+0.892
 treadmill+0.891
Neg logit contributions
roma-0.956
-0.790
 centimeters-0.779
opez-0.758
-0.733
Token,
Token ID11
Feature activation+5.20e-02

Pos logit contributions
roma+1.069
+0.899
 centimeters+0.881
opez+0.864
 Aden+0.860
Neg logit contributions
buster-1.059
 tort-0.959
 dere-0.939
adr-0.938
 treadmill-0.932
Token N
Token ID399
Feature activation-2.99e-02

Pos logit contributions
buster+0.246
 tort+0.226
 dere+0.220
adr+0.219
 treadmill+0.218
Neg logit contributions
roma-0.243
-0.201
 centimeters-0.198
opez-0.194
 Braun-0.191
TokenOL
Token ID3535
Feature activation+7.30e-01

Pos logit contributions
roma+0.156
+0.133
 centimeters+0.133
 Aden+0.130
 Braun+0.129
Neg logit contributions
buster-0.122
 tort-0.111
adr-0.111
 dere-0.108
ETF-0.108
Token ]
Token ID2361
Feature activation+4.40e-01

Pos logit contributions
buster+0.495
 tort+0.445
adr+0.437
 dere+0.435
 treadmill+0.433
Neg logit contributions
roma-0.552
-0.472
 centimeters-0.459
opez-0.455
 Aden-0.454
Token WHERE
Token ID33411
Feature activation+7.12e-01

Pos logit contributions
buster+1.748
 tort+1.565
 dere+1.523
adr+1.517
 treadmill+1.511
Neg logit contributions
roma-2.365
-2.050
opez-1.983
 centimeters-1.981
 Aden-1.974
Token id
Token ID4686
Feature activation-2.53e-01

Pos logit contributions
buster+2.983
 tort+2.726
 dere+2.647
adr+2.618
 treadmill+2.611
Neg logit contributions
roma-3.636
-3.121
opez-3.005
 centimeters-2.989
 Aden-2.989
Token IN
Token ID3268
Feature activation-5.78e-02

Pos logit contributions
roma+1.212
+1.025
 centimeters+1.005
opez+0.986
 Braun+0.982
Neg logit contributions
buster-1.163
 tort-1.052
 dere-1.029
adr-1.029
 treadmill-1.023
Token (
Token ID357
Feature activation-5.39e-03

Pos logit contributions
roma+0.292
+0.249
 centimeters+0.244
opez+0.240
 Aden+0.239
Neg logit contributions
buster-0.251
 tort-0.226
adr-0.221
 dere-0.220
 treadmill-0.219
Token SELECT
Token ID33493
Feature activation+1.31e+00

Pos logit contributions
roma+0.028
+0.024
 centimeters+0.023
opez+0.023
 Aden+0.023
Neg logit contributions
buster-0.023
 tort-0.020
adr-0.020
 dere-0.020
 treadmill-0.020
Token id
Token ID4686
Feature activation+1.48e-01

Pos logit contributions
buster+3.909
 tort+3.471
 dere+3.305
 treadmill+3.249
adr+3.247
Neg logit contributions
roma-7.931
 Aden-6.912
-6.897
opez-6.776
 centimeters-6.763
Token FROM
Token ID16034
Feature activation-7.76e-02

Pos logit contributions
buster+0.600
 tort+0.535
adr+0.524
 dere+0.521
 treadmill+0.517
Neg logit contributions
roma-0.786
-0.676
 centimeters-0.664
opez-0.654
 Aden-0.654
Token [
Token ID685
Feature activation-6.32e-02

Pos logit contributions
roma+0.375
+0.318
 centimeters+0.309
 Aden+0.307
opez+0.306
Neg logit contributions
buster-0.354
 tort-0.320
 dere-0.313
adr-0.313
 treadmill-0.311
Token cl
Token ID537
Feature activation+2.08e-01

Pos logit contributions
roma+0.309
+0.263
 centimeters+0.256
 Aden+0.254
opez+0.254
Neg logit contributions
buster-0.285
 tort-0.257
adr-0.251
 dere-0.251
 treadmill-0.249
Tokenojure
Token ID32511
Feature activation-1.74e-01

Pos logit contributions
buster+0.917
adr+0.823
 tort+0.807
 dere+0.793
 treadmill+0.783
Neg logit contributions
roma-1.020
-0.889
 centimeters-0.877
 Butter-0.856
 Braun-0.850
Token rights
Token ID2489
Feature activation-5.72e-02

Pos logit contributions
roma+0.150
+0.133
 centimeters+0.131
opez+0.129
 Aden+0.129
Neg logit contributions
buster-0.061
 tort-0.052
 dere-0.050
adr-0.050
 treadmill-0.049
Token group
Token ID1448
Feature activation+4.36e-02

Pos logit contributions
roma+0.287
+0.245
 centimeters+0.239
opez+0.235
 Braun+0.234
Neg logit contributions
buster-0.252
 tort-0.227
adr-0.221
 dere-0.221
 treadmill-0.221
Token were
Token ID547
Feature activation+7.50e-01

Pos logit contributions
buster+0.215
 tort+0.196
 dere+0.191
adr+0.191
 treadmill+0.190
Neg logit contributions
roma-0.195
-0.162
 centimeters-0.159
opez-0.157
 Aden-0.155
Token on
Token ID319
Feature activation+2.30e-01

Pos logit contributions
buster+3.303
 tort+3.103
 dere+2.985
 treadmill+2.915
adr+2.889
Neg logit contributions
roma-3.660
-3.140
opez-3.066
 Aden-2.954
 centimeters-2.953
Token the
Token ID262
Feature activation+9.05e-01

Pos logit contributions
buster+1.089
 tort+1.005
 dere+0.974
 treadmill+0.970
adr+0.965
Neg logit contributions
roma-1.066
-0.895
 centimeters-0.871
opez-0.866
 Braun-0.846
Token same
Token ID976
Feature activation+1.31e+00

Pos logit contributions
buster+4.038
 tort+3.846
 dere+3.746
 treadmill+3.724
adr+3.642
Neg logit contributions
roma-4.293
-3.597
opez-3.509
 centimeters-3.481
 Braun-3.423
Token page
Token ID2443
Feature activation+5.69e-01

Pos logit contributions
buster+4.558
 tort+4.486
 treadmill+4.262
 dere+4.234
adr+3.988
Neg logit contributions
roma-7.271
-6.270
opez-6.183
 Braun-5.996
 centimeters-5.992
Token?
Token ID30
Feature activation-3.01e-02

Pos logit contributions
buster+2.647
 tort+2.427
 dere+2.330
adr+2.325
 treadmill+2.315
Neg logit contributions
roma-2.706
-2.246
opez-2.181
 centimeters-2.179
 Aden-2.145
Token Claire
Token ID20758
Feature activation+8.63e-02

Pos logit contributions
roma+0.146
+0.122
 centimeters+0.120
opez+0.118
 Aden+0.117
Neg logit contributions
buster-0.138
 tort-0.124
adr-0.122
 dere-0.122
ETF-0.120
Token Gast
Token ID31486
Feature activation-1.37e-01

Pos logit contributions
buster+0.429
 tort+0.391
 dere+0.382
adr+0.381
 treadmill+0.380
Neg logit contributions
roma-0.383
-0.315
 centimeters-0.309
opez-0.302
 Aden-0.295
Tokenan
Token ID272
Feature activation+5.58e-02

Pos logit contributions
roma+0.568
+0.470
 centimeters+0.461
 Aden+0.449
opez+0.449
Neg logit contributions
buster-0.713
 tort-0.652
adr-0.642
 dere-0.637
 treadmill-0.634
Tokenyoutube
Token ID11604
Feature activation-8.54e-02

Pos logit contributions
roma+0.547
+0.464
 centimeters+0.458
 Aden+0.447
opez+0.447
Neg logit contributions
buster-0.537
 tort-0.483
adr-0.475
 dere-0.473
 treadmill-0.468
Token.
Token ID13
Feature activation-8.48e-01

Pos logit contributions
roma+0.379
+0.316
 centimeters+0.309
opez+0.303
 Aden+0.302
Neg logit contributions
buster-0.422
 tort-0.385
adr-0.377
 dere-0.377
 treadmill-0.374
Tokencom
Token ID785
Feature activation-1.79e-01

Pos logit contributions
roma+3.612
+3.010
 centimeters+2.900
 Braun+2.892
 Aden+2.861
Neg logit contributions
buster-4.178
 treadmill-3.936
 tort-3.916
 dere-3.819
adr-3.745
Token/
Token ID14
Feature activation-1.01e+00

Pos logit contributions
roma+0.839
+0.705
 centimeters+0.691
opez+0.680
 Aden+0.675
Neg logit contributions
buster-0.844
 tort-0.765
 dere-0.749
adr-0.749
 treadmill-0.744
Tokenchannel
Token ID17620
Feature activation-4.79e-01

Pos logit contributions
roma+5.115
+4.394
 centimeters+4.214
opez+4.194
 Braun+4.161
Neg logit contributions
buster-4.231
 tort-3.831
 dere-3.746
 treadmill-3.714
adr-3.681
Token/
Token ID14
Feature activation-1.53e+00

Pos logit contributions
roma+2.079
+1.740
 centimeters+1.696
 Butter+1.663
 Braun+1.661
Neg logit contributions
buster-2.344
 tort-2.186
 treadmill-2.156
 dere-2.139
adr-2.129
TokenUC
Token ID9598
Feature activation-2.21e-01

Pos logit contributions
roma+5.817
+4.651
 centimeters+4.347
 Butter+4.339
opez+4.275
Neg logit contributions
buster-8.086
 tort-7.649
 dere-7.521
 treadmill-7.517
 indemn-7.262
Token_
Token ID62
Feature activation-6.03e-01

Pos logit contributions
roma+1.021
+0.857
 centimeters+0.837
opez+0.824
 Aden+0.818
Neg logit contributions
buster-1.049
 tort-0.955
 dere-0.934
adr-0.932
 treadmill-0.929
TokenK
Token ID42
Feature activation-2.97e-01

Pos logit contributions
roma+2.852
+2.404
 centimeters+2.321
opez+2.316
 Aden+2.277
Neg logit contributions
buster-2.769
 tort-2.547
 treadmill-2.487
 dere-2.483
adr-2.439
Tokenk
Token ID74
Feature activation-5.26e-01

Pos logit contributions
roma+1.390
+1.167
 centimeters+1.137
opez+1.124
 Aden+1.113
Neg logit contributions
buster-1.395
 tort-1.272
 dere-1.244
 treadmill-1.238
adr-1.237
TokenBY
Token ID17513
Feature activation+8.31e-03

Pos logit contributions
roma+2.470
+2.071
 centimeters+2.010
opez+2.001
 Aden+1.969
Neg logit contributions
buster-2.448
 tort-2.248
 dere-2.193
 treadmill-2.191
adr-2.160
Tokenek
Token ID988
Feature activation+8.15e-02

Pos logit contributions
roma+0.092
+0.076
 centimeters+0.074
 Braun+0.072
opez+0.072
Neg logit contributions
buster-0.116
 tort-0.106
adr-0.105
 dere-0.104
 treadmill-0.104
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
buster+0.396
 tort+0.357
adr+0.353
 dere+0.350
 treadmill+0.347
Neg logit contributions
roma-0.369
-0.310
 centimeters-0.304
opez-0.298
 Aden-0.298
Tokencom
Token ID785
Feature activation-3.04e-02

Pos logit contributions
roma+1.944
+1.636
 centimeters+1.600
opez+1.573
 Braun+1.566
Neg logit contributions
buster-1.897
 tort-1.722
 dere-1.682
adr-1.681
 treadmill-1.677
Token/
Token ID14
Feature activation-1.14e+00

Pos logit contributions
roma+0.143
+0.120
 centimeters+0.118
opez+0.116
 Aden+0.115
Neg logit contributions
buster-0.143
 tort-0.129
adr-0.127
 dere-0.126
 treadmill-0.126
Tokenthread
Token ID16663
Feature activation-4.74e-01

Pos logit contributions
roma+5.455
+4.664
 centimeters+4.418
opez+4.360
 Braun+4.335
Neg logit contributions
buster-5.117
 tort-4.653
 treadmill-4.546
 dere-4.526
adr-4.488
Token/
Token ID14
Feature activation-1.47e+00

Pos logit contributions
roma+2.089
+1.736
 centimeters+1.683
opez+1.660
 Braun+1.650
Neg logit contributions
buster-2.338
 tort-2.156
 dere-2.110
adr-2.104
 treadmill-2.096
Token26
Token ID2075
Feature activation-3.68e-01

Pos logit contributions
roma+5.992
+4.931
 centimeters+4.659
opez+4.637
 Braun+4.533
Neg logit contributions
buster-7.323
 tort-6.835
 treadmill-6.791
 dere-6.736
 WARN-6.608
Token219
Token ID28896
Feature activation-4.97e-01

Pos logit contributions
roma+1.450
+1.176
 centimeters+1.147
opez+1.121
 Aden+1.114
Neg logit contributions
buster-2.009
 tort-1.847
adr-1.816
 dere-1.813
 treadmill-1.802
Token/
Token ID14
Feature activation-5.57e-01

Pos logit contributions
roma+2.114
+1.751
 centimeters+1.710
opez+1.676
 Braun+1.666
Neg logit contributions
buster-2.531
 tort-2.320
 dere-2.277
adr-2.271
 treadmill-2.266
Tokenuser
Token ID7220
Feature activation-2.46e-01

Pos logit contributions
roma+2.487
+2.083
 centimeters+2.010
opez+1.973
 Braun+1.957
Neg logit contributions
buster-2.712
 tort-2.490
 dere-2.442
 treadmill-2.433
adr-2.422
Token-
Token ID12
Feature activation-1.91e-01

Pos logit contributions
roma+1.150
+0.971
 centimeters+0.950
opez+0.932
 Aden+0.931
Neg logit contributions
buster-1.167
 tort-1.056
adr-1.035
 dere-1.034
 treadmill-1.026
Token partition
Token ID18398
Feature activation+5.21e-01

Pos logit contributions
buster+2.181
 tort+2.000
 dere+1.940
 treadmill+1.937
adr+1.929
Neg logit contributions
roma-1.952
-1.637
opez-1.589
 centimeters-1.575
 Aden-1.569
Token (/
Token ID50247
Feature activation-5.05e-01

Pos logit contributions
buster+2.128
 tort+1.896
 dere+1.830
 treadmill+1.828
adr+1.813
Neg logit contributions
roma-2.764
-2.400
opez-2.329
 centimeters-2.316
 Aden-2.283
Tokendev
Token ID7959
Feature activation-2.47e-02

Pos logit contributions
roma+2.521
+2.135
 centimeters+2.098
 Braun+2.057
opez+2.057
Neg logit contributions
buster-2.208
 tort-1.992
 dere-1.951
adr-1.945
 treadmill-1.936
Token/
Token ID14
Feature activation-1.28e+00

Pos logit contributions
roma+0.116
+0.097
 centimeters+0.095
opez+0.094
 Aden+0.093
Neg logit contributions
buster-0.117
 tort-0.105
adr-0.103
 dere-0.103
 treadmill-0.102
Tokenblock
Token ID9967
Feature activation+7.05e-02

Pos logit contributions
roma+4.764
+3.777
 centimeters+3.635
 Braun+3.518
opez+3.502
Neg logit contributions
buster-7.026
 tort-6.524
 dere-6.411
 treadmill-6.400
ETF-6.344
Token/
Token ID14
Feature activation-1.45e+00

Pos logit contributions
buster+0.326
 tort+0.284
 dere+0.280
adr+0.280
 treadmill+0.279
Neg logit contributions
roma-0.344
-0.294
 centimeters-0.285
opez-0.284
 Aden-0.284
Tokenmm
Token ID3020
Feature activation-5.71e-01

Pos logit contributions
roma+6.215
+5.083
 centimeters+4.963
 Braun+4.895
 Butter+4.808
Neg logit contributions
buster-6.917
 tort-6.552
 dere-6.376
 treadmill-6.334
ETF-6.257
Tokenc
Token ID66
Feature activation-6.86e-01

Pos logit contributions
roma+2.084
+1.665
 centimeters+1.612
opez+1.569
 Braun+1.556
Neg logit contributions
buster-3.263
 tort-3.018
 dere-2.960
adr-2.957
 treadmill-2.953
Tokenbl
Token ID2436
Feature activation-5.40e-01

Pos logit contributions
roma+2.918
+2.398
 centimeters+2.347
opez+2.301
 Braun+2.287
Neg logit contributions
buster-3.518
 tort-3.220
 dere-3.159
 treadmill-3.147
adr-3.141
Tokenk
Token ID74
Feature activation-7.45e-01

Pos logit contributions
roma+2.158
+1.755
 centimeters+1.712
opez+1.678
 Braun+1.664
Neg logit contributions
buster-2.898
 tort-2.669
 dere-2.621
adr-2.616
 treadmill-2.608
Token0
Token ID15
Feature activation-9.60e-01

Pos logit contributions
roma+3.213
+2.648
 centimeters+2.589
opez+2.534
 Braun+2.532
Neg logit contributions
buster-3.738
 tort-3.442
 dere-3.370
 treadmill-3.361
adr-3.350
Token non
Token ID1729
Feature activation-3.44e-01

Pos logit contributions
roma+2.171
+1.839
 centimeters+1.798
opez+1.769
 Aden+1.753
Neg logit contributions
buster-2.044
 tort-1.865
 dere-1.819
adr-1.816
 treadmill-1.815
Tokenrandom
Token ID25120
Feature activation-4.52e-01

Pos logit contributions
roma+1.573
+1.318
 centimeters+1.289
opez+1.266
 Aden+1.258
Neg logit contributions
buster-1.651
 tort-1.508
adr-1.476
 dere-1.473
 treadmill-1.466
Tokenized
Token ID1143
Feature activation-5.91e-01

Pos logit contributions
roma+2.268
+1.934
 centimeters+1.903
opez+1.868
 Aden+1.864
Neg logit contributions
buster-1.967
 tort-1.758
adr-1.731
 dere-1.721
 treadmill-1.702
Token,
Token ID11
Feature activation-4.76e-01

Pos logit contributions
roma+2.829
+2.398
 centimeters+2.359
 Braun+2.318
 Aden+2.316
Neg logit contributions
buster-2.705
 tort-2.416
adr-2.392
 dere-2.372
ETF-2.347
Token controlled
Token ID6856
Feature activation-9.68e-01

Pos logit contributions
roma+2.167
+1.815
 centimeters+1.782
opez+1.746
 Braun+1.741
Neg logit contributions
buster-2.297
 tort-2.079
adr-2.044
 dere-2.039
 treadmill-2.020
Token study
Token ID2050
Feature activation-1.37e+00

Pos logit contributions
roma+4.956
+4.282
 centimeters+4.222
 Braun+4.192
 Butter+4.174
Neg logit contributions
buster-4.004
adr-3.497
 tort-3.455
 dere-3.418
ETF-3.393
Token.
Token ID13
Feature activation-5.79e-01

Pos logit contributions
roma+5.684
+4.756
 centimeters+4.659
 Braun+4.633
 Aden+4.622
Neg logit contributions
buster-6.889
adr-6.138
 tort-6.059
 dere-6.057
 WARN-5.985
Token The
Token ID383
Feature activation-3.62e-01

Pos logit contributions
roma+2.557
+2.137
 centimeters+2.089
 Aden+2.056
 Braun+2.047
Neg logit contributions
buster-2.863
 tort-2.611
 dere-2.560
adr-2.559
 treadmill-2.541
Token first
Token ID717
Feature activation-7.22e-01

Pos logit contributions
roma+1.667
+1.398
 centimeters+1.367
opez+1.342
 Aden+1.334
Neg logit contributions
buster-1.726
 tort-1.573
adr-1.538
 dere-1.538
 treadmill-1.531
Token cohort
Token ID20812
Feature activation-7.35e-01

Pos logit contributions
roma+3.599
+3.063
 centimeters+3.037
 Aden+2.975
opez+2.975
Neg logit contributions
buster-3.145
 tort-2.794
adr-2.763
 dere-2.741
ETF-2.718
Token of
Token ID286
Feature activation-2.41e-01

Pos logit contributions
roma+3.255
+2.712
 centimeters+2.657
opez+2.604
 Aden+2.597
Neg logit contributions
buster-3.634
 tort-3.302
adr-3.249
 dere-3.240
 treadmill-3.213
Token MD
Token ID10670
Feature activation-1.21e-01

Pos logit contributions
buster+0.457
 tort+0.416
 dere+0.407
 treadmill+0.403
adr+0.396
Neg logit contributions
roma-0.582
-0.498
 centimeters-0.483
opez-0.478
 Braun-0.475
TokenD
Token ID35
Feature activation-3.48e-01

Pos logit contributions
roma+0.574
+0.481
 centimeters+0.475
opez+0.466
 Braun+0.461
Neg logit contributions
buster-0.563
 tort-0.516
 dere-0.504
 treadmill-0.502
adr-0.501
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
roma+1.588
+1.330
 centimeters+1.304
opez+1.279
 Aden+1.274
Neg logit contributions
buster-1.679
 tort-1.522
adr-1.498
 dere-1.492
 treadmill-1.481
Token
Token ID198
Feature activation-5.77e-01

Pos logit contributions
roma+1.307
+1.091
 centimeters+1.070
opez+1.057
 Braun+1.035
Neg logit contributions
buster-1.333
 tort-1.211
adr-1.190
 dere-1.186
 treadmill-1.176
Token
Token ID198
Feature activation-7.13e-01

Pos logit contributions
roma+2.479
+2.087
 centimeters+2.024
 Aden+2.005
 Braun+2.004
Neg logit contributions
buster-2.864
 tort-2.648
 treadmill-2.629
 dere-2.581
adr-2.573
TokenMethods
Token ID46202
Feature activation-1.36e+00

Pos logit contributions
roma+2.750
+2.264
 centimeters+2.130
opez+2.112
 Braun+2.080
Neg logit contributions
buster-3.927
 tort-3.625
 treadmill-3.567
 dere-3.557
 WARN-3.483
Token This
Token ID770
Feature activation-1.16e-01

Pos logit contributions
roma+5.802
 Aden+4.887
+4.854
 Butter+4.799
 centimeters+4.789
Neg logit contributions
buster-6.656
 tort-6.031
adr-5.996
ETF-5.944
 dere-5.905
Token is
Token ID318
Feature activation-5.56e-01

Pos logit contributions
roma+0.530
+0.441
 centimeters+0.430
opez+0.423
 Aden+0.417
Neg logit contributions
buster-0.554
 tort-0.516
 treadmill-0.504
 dere-0.502
adr-0.499
Token a
Token ID257
Feature activation-4.49e-01

Pos logit contributions
roma+2.678
+2.268
 centimeters+2.223
 Aden+2.179
opez+2.179
Neg logit contributions
buster-2.539
 tort-2.289
 dere-2.243
adr-2.240
 treadmill-2.223
Token non
Token ID1729
Feature activation-3.44e-01

Pos logit contributions
roma+2.171
+1.839
 centimeters+1.798
opez+1.769
 Aden+1.753
Neg logit contributions
buster-2.044
 tort-1.865
 dere-1.819
adr-1.816
 treadmill-1.815
Tokenrandom
Token ID25120
Feature activation-4.52e-01

Pos logit contributions
roma+1.573
+1.318
 centimeters+1.289
opez+1.266
 Aden+1.258
Neg logit contributions
buster-1.651
 tort-1.508
adr-1.476
 dere-1.473
 treadmill-1.466
Tokencon
Token ID1102
Feature activation+6.65e-02

Pos logit contributions
buster+0.647
 tort+0.590
adr+0.578
 dere+0.567
ETF+0.562
Neg logit contributions
roma-0.734
-0.619
 centimeters-0.611
 Aden-0.601
 Braun-0.599
Tokenceptions
Token ID11755
Feature activation-7.24e-01

Pos logit contributions
buster+0.270
 tort+0.243
 dere+0.235
adr+0.233
ETF+0.229
Neg logit contributions
roma-0.352
-0.304
 centimeters-0.299
opez-0.297
 Braun-0.293
Token I
Token ID314
Feature activation-2.08e-01

Pos logit contributions
roma+3.250
+2.722
 centimeters+2.683
 Butter+2.626
 Braun+2.626
Neg logit contributions
buster-3.453
adr-3.114
 tort-3.111
 dere-3.068
 treadmill-3.056
Token've
Token ID1053
Feature activation-2.10e-01

Pos logit contributions
roma+1.059
+0.906
 centimeters+0.891
opez+0.874
 Braun+0.874
Neg logit contributions
buster-0.887
 tort-0.795
adr-0.782
 dere-0.777
 treadmill-0.773
Token been
Token ID587
Feature activation-2.29e-01

Pos logit contributions
roma+0.866
+0.711
 centimeters+0.695
opez+0.679
 Braun+0.676
Neg logit contributions
buster-1.106
 tort-1.013
adr-0.997
 dere-0.994
 treadmill-0.989
Token asked
Token ID1965
Feature activation-1.32e+00

Pos logit contributions
roma+1.219
+1.050
 centimeters+1.044
 Braun+1.019
 Butter+1.018
Neg logit contributions
buster-0.914
 tort-0.806
adr-0.800
 treadmill-0.793
ETF-0.787
Token about
Token ID546
Feature activation-5.65e-01

Pos logit contributions
roma+5.486
 centimeters+4.605
+4.535
 Braun+4.514
 Butter+4.513
Neg logit contributions
buster-6.383
adr-5.894
ETF-5.766
 tort-5.715
 treadmill-5.713
Token and
Token ID290
Feature activation-5.06e-01

Pos logit contributions
roma+2.445
+2.044
 centimeters+2.009
 Braun+2.001
 Butter+1.979
Neg logit contributions
buster-2.773
adr-2.501
 tort-2.477
 treadmill-2.463
ETF-2.452
Token the
Token ID262
Feature activation-2.91e-01

Pos logit contributions
roma+2.233
+1.883
 centimeters+1.856
 Braun+1.818
 Aden+1.805
Neg logit contributions
buster-2.470
 tort-2.215
adr-2.213
 treadmill-2.180
 dere-2.170
Token answers
Token ID7429
Feature activation-3.63e-01

Pos logit contributions
roma+1.384
+1.169
 centimeters+1.149
opez+1.125
 Aden+1.125
Neg logit contributions
buster-1.337
 tort-1.199
adr-1.183
 dere-1.176
 treadmill-1.173
Token as
Token ID355
Feature activation-3.26e-01

Pos logit contributions
roma+1.554
+1.285
 centimeters+1.264
opez+1.234
 Aden+1.233
Neg logit contributions
buster-1.840
 tort-1.677
adr-1.658
 dere-1.646
 treadmill-1.639
Token.
Token ID13
Feature activation-1.84e-01

Pos logit contributions
buster+1.003
 tort+0.938
 treadmill+0.914
 dere+0.895
adr+0.884
Neg logit contributions
roma-1.195
-1.019
 centimeters-0.977
 Aden-0.974
opez-0.973
Token
Token ID198
Feature activation-3.38e-01

Pos logit contributions
roma+0.876
+0.740
 centimeters+0.725
opez+0.712
 Aden+0.708
Neg logit contributions
buster-0.853
 tort-0.773
adr-0.756
 dere-0.756
 treadmill-0.751
Token
Token ID198
Feature activation-3.95e-01

Pos logit contributions
roma+1.513
+1.274
 centimeters+1.239
 Aden+1.219
 Braun+1.217
Neg logit contributions
buster-1.642
 tort-1.505
 treadmill-1.481
 dere-1.473
adr-1.468
TokenMaterials
Token ID41657
Feature activation-5.23e-01

Pos logit contributions
roma+1.893
+1.615
 centimeters+1.561
opez+1.537
 Aden+1.526
Neg logit contributions
buster-1.798
 tort-1.638
 dere-1.609
 treadmill-1.602
adr-1.583
Token and
Token ID290
Feature activation-7.04e-01

Pos logit contributions
roma+2.435
+2.036
 centimeters+2.003
 Aden+1.969
 Braun+1.968
Neg logit contributions
buster-2.464
 tort-2.220
adr-2.204
 dere-2.184
 treadmill-2.160
Token Methods
Token ID25458
Feature activation-1.30e+00

Pos logit contributions
roma+2.525
+2.019
 centimeters+1.980
 Butter+1.950
 Braun+1.924
Neg logit contributions
buster-4.057
adr-3.672
 tort-3.644
 dere-3.641
ETF-3.622
Token Experimental
Token ID32286
Feature activation-6.91e-01

Pos logit contributions
roma+5.717
 Butter+4.798
 centimeters+4.788
 Aden+4.736
+4.726
Neg logit contributions
buster-6.183
adr-5.516
 tort-5.451
ETF-5.439
 dere-5.426
Token subjects
Token ID7481
Feature activation-2.95e-01

Pos logit contributions
roma+3.112
+2.591
 centimeters+2.569
 Butter+2.516
 Braun+2.505
Neg logit contributions
buster-3.346
adr-3.002
 tort-2.991
 dere-2.963
ETF-2.950
Token Three
Token ID7683
Feature activation+2.78e-01

Pos logit contributions
roma+1.348
+1.129
 centimeters+1.109
 Aden+1.085
opez+1.085
Neg logit contributions
buster-1.416
 tort-1.282
adr-1.264
 dere-1.256
 treadmill-1.246
Token female
Token ID4048
Feature activation-4.51e-01

Pos logit contributions
buster+1.236
 tort+1.156
 dere+1.123
 treadmill+1.122
adr+1.104
Neg logit contributions
roma-1.348
-1.152
 centimeters-1.098
opez-1.091
 Braun-1.089
Token r
Token ID374
Feature activation-1.99e-01

Pos logit contributions
roma+2.098
+1.774
 centimeters+1.767
 Butter+1.735
 Aden+1.728
Neg logit contributions
buster-2.111
adr-1.870
 tort-1.858
ETF-1.840
 dere-1.832
Token shows
Token ID2523
Feature activation-4.87e-01

Pos logit contributions
roma+2.136
+1.793
 centimeters+1.760
opez+1.723
 Braun+1.717
Neg logit contributions
buster-2.190
 tort-1.984
adr-1.953
 dere-1.941
 treadmill-1.930
Token that
Token ID326
Feature activation-3.81e-01

Pos logit contributions
roma+2.194
+1.827
 centimeters+1.794
opez+1.757
 Braun+1.754
Neg logit contributions
buster-2.373
 tort-2.159
adr-2.126
 dere-2.115
 treadmill-2.104
Token they
Token ID484
Feature activation-5.74e-01

Pos logit contributions
roma+1.794
+1.512
 centimeters+1.483
 Braun+1.454
opez+1.453
Neg logit contributions
buster-1.774
 tort-1.602
adr-1.579
 dere-1.569
 treadmill-1.559
Token planned
Token ID6027
Feature activation-4.34e-01

Pos logit contributions
roma+2.587
+2.134
 centimeters+2.109
 Braun+2.063
opez+2.058
Neg logit contributions
buster-2.782
 tort-2.517
adr-2.498
 dere-2.482
 treadmill-2.468
Token to
Token ID284
Feature activation-2.07e-01

Pos logit contributions
roma+1.806
+1.483
 centimeters+1.448
opez+1.418
 Aden+1.409
Neg logit contributions
buster-2.271
 tort-2.081
adr-2.044
 dere-2.042
 treadmill-2.030
Token excise
Token ID47547
Feature activation-1.30e+00

Pos logit contributions
roma+0.983
+0.828
 centimeters+0.813
opez+0.798
 Braun+0.794
Neg logit contributions
buster-0.956
 tort-0.865
adr-0.848
 dere-0.846
 treadmill-0.840
Token from
Token ID422
Feature activation-4.87e-01

Pos logit contributions
roma+5.614
 centimeters+4.697
+4.667
 Braun+4.590
 Butter+4.543
Neg logit contributions
buster-6.240
ETF-5.643
adr-5.633
 tort-5.606
 treadmill-5.561
Token their
Token ID511
Feature activation-4.87e-01

Pos logit contributions
roma+2.080
+1.719
 centimeters+1.680
opez+1.645
 Aden+1.636
Neg logit contributions
buster-2.493
 tort-2.280
adr-2.238
 dere-2.235
 treadmill-2.222
Token lives
Token ID3160
Feature activation-4.51e-01

Pos logit contributions
roma+2.316
+1.961
 centimeters+1.929
 Braun+1.885
opez+1.883
Neg logit contributions
buster-2.244
 tort-2.016
adr-1.992
 dere-1.971
ETF-1.964
Token.
Token ID13
Feature activation-2.71e-01

Pos logit contributions
roma+2.061
+1.725
 centimeters+1.687
opez+1.657
 Aden+1.649
Neg logit contributions
buster-2.178
 tort-1.982
 dere-1.940
adr-1.940
 treadmill-1.927
Token Again
Token ID6521
Feature activation+2.37e-01

Pos logit contributions
roma+1.288
+1.084
 centimeters+1.063
opez+1.047
 Aden+1.036
Neg logit contributions
buster-1.254
 tort-1.134
adr-1.113
 dere-1.109
 treadmill-1.097
Token 19
Token ID678
Feature activation+6.33e-02

Pos logit contributions
buster+0.636
 tort+0.575
 dere+0.562
adr+0.559
 treadmill+0.558
Neg logit contributions
roma-0.744
-0.638
 centimeters-0.620
opez-0.618
 Aden-0.609
Tokenth
Token ID400
Feature activation+4.43e-01

Pos logit contributions
buster+0.373
 tort+0.348
adr+0.341
 dere+0.339
ETF+0.337
Neg logit contributions
roma-0.220
-0.174
opez-0.165
 centimeters-0.165
 Aden-0.164
Token partition
Token ID18398
Feature activation+5.21e-01

Pos logit contributions
buster+2.181
 tort+2.000
 dere+1.940
 treadmill+1.937
adr+1.929
Neg logit contributions
roma-1.952
-1.637
opez-1.589
 centimeters-1.575
 Aden-1.569
Token (/
Token ID50247
Feature activation-5.05e-01

Pos logit contributions
buster+2.128
 tort+1.896
 dere+1.830
 treadmill+1.828
adr+1.813
Neg logit contributions
roma-2.764
-2.400
opez-2.329
 centimeters-2.316
 Aden-2.283
Tokendev
Token ID7959
Feature activation-2.47e-02

Pos logit contributions
roma+2.521
+2.135
 centimeters+2.098
 Braun+2.057
opez+2.057
Neg logit contributions
buster-2.208
 tort-1.992
 dere-1.951
adr-1.945
 treadmill-1.936
Token/
Token ID14
Feature activation-1.28e+00

Pos logit contributions
roma+0.116
+0.097
 centimeters+0.095
opez+0.094
 Aden+0.093
Neg logit contributions
buster-0.117
 tort-0.105
adr-0.103
 dere-0.103
 treadmill-0.102
Tokenblock
Token ID9967
Feature activation+7.05e-02

Pos logit contributions
roma+4.764
+3.777
 centimeters+3.635
 Braun+3.518
opez+3.502
Neg logit contributions
buster-7.026
 tort-6.524
 dere-6.411
 treadmill-6.400
ETF-6.344
Token/
Token ID14
Feature activation-1.45e+00

Pos logit contributions
buster+0.326
 tort+0.284
 dere+0.280
adr+0.280
 treadmill+0.279
Neg logit contributions
roma-0.344
-0.294
 centimeters-0.285
opez-0.284
 Aden-0.284
Tokenmm
Token ID3020
Feature activation-5.71e-01

Pos logit contributions
roma+6.215
+5.083
 centimeters+4.963
 Braun+4.895
 Butter+4.808
Neg logit contributions
buster-6.917
 tort-6.552
 dere-6.376
 treadmill-6.334
ETF-6.257
Tokenc
Token ID66
Feature activation-6.86e-01

Pos logit contributions
roma+2.084
+1.665
 centimeters+1.612
opez+1.569
 Braun+1.556
Neg logit contributions
buster-3.263
 tort-3.018
 dere-2.960
adr-2.957
 treadmill-2.953
Tokenbl
Token ID2436
Feature activation-5.40e-01

Pos logit contributions
roma+2.918
+2.398
 centimeters+2.347
opez+2.301
 Braun+2.287
Neg logit contributions
buster-3.518
 tort-3.220
 dere-3.159
 treadmill-3.147
adr-3.141
Token great
Token ID1049
Feature activation-1.40e-01

Pos logit contributions
roma+1.723
+1.445
 centimeters+1.422
 Braun+1.394
opez+1.392
Neg logit contributions
buster-1.779
 tort-1.606
adr-1.590
 dere-1.580
 treadmill-1.572
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
roma+0.659
+0.555
 centimeters+0.542
opez+0.533
 Aden+0.530
Neg logit contributions
buster-0.656
 tort-0.597
 dere-0.582
adr-0.580
 treadmill-0.579
Token Competitive
Token ID47665
Feature activation-2.05e-01

Pos logit contributions
roma+1.208
+1.014
 centimeters+0.992
opez+0.975
 Aden+0.968
Neg logit contributions
buster-1.252
 tort-1.138
adr-1.114
 dere-1.114
 treadmill-1.107
Token Trib
Token ID10841
Feature activation-9.38e-01

Pos logit contributions
roma+0.991
+0.838
 centimeters+0.821
opez+0.808
 Aden+0.803
Neg logit contributions
buster-0.930
 tort-0.843
 dere-0.823
adr-0.822
 treadmill-0.819
Tokenes
Token ID274
Feature activation-3.71e-01

Pos logit contributions
roma+4.626
+3.916
 centimeters+3.843
 Braun+3.777
opez+3.751
Neg logit contributions
buster-4.105
 tort-3.719
 dere-3.642
adr-3.640
 treadmill-3.621
Token Ascend
Token ID45387
Feature activation-1.28e+00

Pos logit contributions
roma+1.791
+1.517
 centimeters+1.488
opez+1.459
 Braun+1.455
Neg logit contributions
buster-1.688
 tort-1.523
adr-1.494
 dere-1.492
 treadmill-1.481
Token player
Token ID2137
Feature activation-6.22e-01

Pos logit contributions
roma+5.926
+4.994
 centimeters+4.985
 Butter+4.911
 Braun+4.909
Neg logit contributions
buster-5.771
adr-5.092
 tort-5.023
 dere-4.987
ETF-4.957
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
roma+2.968
+2.509
 centimeters+2.467
 Braun+2.424
opez+2.419
Neg logit contributions
buster-2.839
 tort-2.558
adr-2.526
 dere-2.508
 treadmill-2.490
Token Sym
Token ID15845
Feature activation-1.88e-02

Pos logit contributions
roma+1.562
+1.313
 centimeters+1.287
opez+1.265
 Aden+1.255
Neg logit contributions
buster-1.571
 tort-1.427
adr-1.396
 dere-1.395
 treadmill-1.386
Tokentex
Token ID16886
Feature activation-2.64e-01

Pos logit contributions
roma+0.086
+0.072
 centimeters+0.071
 Aden+0.069
opez+0.069
Neg logit contributions
buster-0.091
 tort-0.082
adr-0.081
 dere-0.081
 treadmill-0.080
Token in
Token ID287
Feature activation-4.70e-01

Pos logit contributions
roma+1.263
+1.067
 centimeters+1.046
opez+1.027
 Aden+1.021
Neg logit contributions
buster-1.217
 tort-1.107
adr-1.082
 dere-1.077
 treadmill-1.070
Token Pride
Token ID21735
Feature activation-2.21e-01

Pos logit contributions
roma+0.582
+0.488
 centimeters+0.480
opez+0.473
 Braun+0.468
Neg logit contributions
buster-0.587
 tort-0.532
 dere-0.519
adr-0.518
 treadmill-0.514
Token he
Token ID339
Feature activation-4.19e-01

Pos logit contributions
roma+1.079
+0.914
 centimeters+0.896
opez+0.881
 Aden+0.876
Neg logit contributions
buster-0.998
 tort-0.902
adr-0.882
 dere-0.881
 treadmill-0.875
Token sought
Token ID7194
Feature activation-7.16e-01

Pos logit contributions
roma+1.875
+1.565
 centimeters+1.531
opez+1.502
 Aden+1.493
Neg logit contributions
buster-2.053
 tort-1.871
adr-1.835
 dere-1.832
 treadmill-1.820
Token to
Token ID284
Feature activation-4.77e-01

Pos logit contributions
roma+2.986
+2.464
 centimeters+2.404
 Braun+2.365
 Aden+2.354
Neg logit contributions
buster-3.682
 tort-3.340
adr-3.310
 treadmill-3.294
 dere-3.291
Token st
Token ID336
Feature activation-1.00e+00

Pos logit contributions
roma+2.227
+1.863
 centimeters+1.837
 Aden+1.803
opez+1.798
Neg logit contributions
buster-2.226
 tort-1.988
adr-1.980
ETF-1.972
 dere-1.959
Tokenoke
Token ID2088
Feature activation-1.28e+00

Pos logit contributions
roma+4.443
+3.585
opez+3.549
 centimeters+3.459
 Aden+3.440
Neg logit contributions
buster-4.915
 tort-4.563
 dere-4.453
 treadmill-4.446
ETF-4.396
Token through
Token ID832
Feature activation-4.46e-01

Pos logit contributions
roma+5.388
+4.452
 centimeters+4.394
 Braun+4.345
opez+4.314
Neg logit contributions
buster-6.242
adr-5.610
 treadmill-5.562
 tort-5.526
 dere-5.522
Token Pad
Token ID15744
Feature activation+2.58e-01

Pos logit contributions
roma+1.953
+1.619
 centimeters+1.583
 Braun+1.552
opez+1.551
Neg logit contributions
buster-2.224
 tort-2.016
adr-1.993
 treadmill-1.979
 dere-1.979
Tokenm
Token ID76
Feature activation+3.96e-01

Pos logit contributions
buster+1.063
 tort+0.936
adr+0.924
 dere+0.921
 treadmill+0.905
Neg logit contributions
roma-1.354
-1.181
 centimeters-1.161
opez-1.137
 Braun-1.131
Tokenav
Token ID615
Feature activation+3.98e-02

Pos logit contributions
buster+1.516
 tort+1.339
adr+1.333
 dere+1.303
 treadmill+1.280
Neg logit contributions
roma-2.196
-1.916
 centimeters-1.885
 Aden-1.854
 Braun-1.845
Tokenati
Token ID7246
Feature activation-5.94e-02

Pos logit contributions
buster+0.207
 tort+0.190
adr+0.188
 dere+0.186
 treadmill+0.183
Neg logit contributions
roma-0.166
-0.137
 centimeters-0.134
opez-0.131
 Braun-0.131
Token,
Token ID11
Feature activation+2.39e-01

Pos logit contributions
buster+1.249
 tort+1.124
 dere+1.082
 treadmill+1.079
adr+1.078
Neg logit contributions
roma-1.573
-1.335
 centimeters-1.330
opez-1.316
 Aden-1.289
Token for
Token ID329
Feature activation-9.86e-03

Pos logit contributions
buster+1.140
 tort+1.049
 dere+1.019
 treadmill+1.018
adr+1.009
Neg logit contributions
roma-1.096
-0.913
 centimeters-0.895
opez-0.884
 Aden-0.862
Token example
Token ID1672
Feature activation+6.71e-01

Pos logit contributions
roma+0.055
+0.047
 centimeters+0.046
opez+0.046
 Aden+0.045
Neg logit contributions
buster-0.038
 tort-0.034
 dere-0.033
 treadmill-0.033
adr-0.033
Token,
Token ID11
Feature activation+3.49e-02

Pos logit contributions
buster+2.597
 tort+2.260
adr+2.232
ETF+2.186
 dere+2.184
Neg logit contributions
roma-3.699
-3.171
 centimeters-3.109
opez-3.099
-2.996
Token You
Token ID921
Feature activation+1.15e-01

Pos logit contributions
buster+0.170
 tort+0.156
 dere+0.152
 treadmill+0.152
adr+0.151
Neg logit contributions
roma-0.158
-0.132
 centimeters-0.129
opez-0.127
 Aden-0.124
TokenGov
Token ID23774
Feature activation-1.28e+00

Pos logit contributions
buster+0.537
 tort+0.486
 dere+0.476
adr+0.475
 treadmill+0.473
Neg logit contributions
roma-0.543
-0.456
 centimeters-0.449
opez-0.441
 Aden-0.438
Token discovered
Token ID5071
Feature activation+1.23e-01

Pos logit contributions
roma+4.932
+4.026
 centimeters+3.945
 Braun+3.852
 Butter+3.824
Neg logit contributions
buster-6.759
adr-6.149
 tort-6.098
 dere-6.077
ETF-6.071
Token that
Token ID326
Feature activation+9.06e-02

Pos logit contributions
buster+0.589
 tort+0.540
 dere+0.525
 treadmill+0.523
adr+0.517
Neg logit contributions
roma-0.569
-0.477
 centimeters-0.466
opez-0.456
 Aden-0.452
Token 55
Token ID5996
Feature activation-2.32e-01

Pos logit contributions
buster+0.426
 tort+0.390
 dere+0.379
 treadmill+0.379
adr+0.376
Neg logit contributions
roma-0.423
-0.356
 centimeters-0.348
opez-0.342
 Aden-0.338
Token percent
Token ID1411
Feature activation-3.64e-01

Pos logit contributions
roma+0.675
+0.499
 centimeters+0.476
opez+0.465
 Aden+0.464
Neg logit contributions
buster-1.506
 tort-1.410
 dere-1.383
adr-1.381
 treadmill-1.381
Token of
Token ID286
Feature activation-3.54e-01

Pos logit contributions
roma+1.638
+1.369
 centimeters+1.338
opez+1.311
 Aden+1.308
Neg logit contributions
buster-1.769
 tort-1.610
adr-1.587
 dere-1.582
 treadmill-1.569
Token Not
Token ID1892
Feature activation+5.65e-02

Pos logit contributions
roma+1.055
+0.916
 centimeters+0.900
opez+0.887
 Aden+0.884
Neg logit contributions
buster-0.708
 tort-0.626
adr-0.610
 dere-0.609
 treadmill-0.604
Token-
Token ID12
Feature activation-2.69e-01

Pos logit contributions
buster+0.262
 tort+0.234
adr+0.231
 dere+0.231
ETF+0.229
Neg logit contributions
roma-0.271
-0.229
 centimeters-0.224
opez-0.222
 Braun-0.216
TokenAssociated
Token ID29014
Feature activation+1.58e-01

Pos logit contributions
roma+1.449
+1.256
 centimeters+1.228
 Aden+1.209
opez+1.206
Neg logit contributions
buster-1.069
 tort-0.961
 treadmill-0.937
 dere-0.933
adr-0.927
Token T
Token ID309
Feature activation-1.93e-01

Pos logit contributions
buster+0.558
 tort+0.488
 dere+0.473
adr+0.472
 treadmill+0.470
Neg logit contributions
roma-0.930
-0.812
 centimeters-0.797
opez-0.794
 Aden-0.782
Tokenx
Token ID87
Feature activation-1.62e-01

Pos logit contributions
roma+0.690
+0.545
 centimeters+0.529
opez+0.516
 Aden+0.513
Neg logit contributions
buster-1.124
 tort-1.042
 dere-1.024
adr-1.022
 treadmill-1.020
Token-
Token ID12
Feature activation-1.28e+00

Pos logit contributions
roma+0.737
+0.617
 centimeters+0.604
opez+0.592
 Aden+0.590
Neg logit contributions
buster-0.785
 tort-0.715
adr-0.700
 dere-0.700
 treadmill-0.696
TokenPower
Token ID13434
Feature activation+1.99e-01

Pos logit contributions
roma+4.360
+3.530
 Braun+3.240
 centimeters+3.239
 Butter+3.178
Neg logit contributions
buster-7.203
 treadmill-6.946
 tort-6.861
 dere-6.785
 indemn-6.579
Token=
Token ID28
Feature activation-3.15e-01

Pos logit contributions
buster+0.973
 tort+0.880
adr+0.865
 dere+0.861
ETF+0.858
Neg logit contributions
roma-0.896
-0.743
 centimeters-0.724
opez-0.722
 Aden-0.710
Token20
Token ID1238
Feature activation+1.05e-01

Pos logit contributions
roma+1.232
+0.998
 centimeters+0.972
opez+0.950
 Aden+0.944
Neg logit contributions
buster-1.728
 tort-1.590
adr-1.563
 dere-1.561
 treadmill-1.553
Token dB
Token ID30221
Feature activation-1.53e-01

Pos logit contributions
buster+0.498
 tort+0.450
 dere+0.443
adr+0.442
 treadmill+0.439
Neg logit contributions
roma-0.491
-0.411
 centimeters-0.397
opez-0.397
 Aden-0.392
Tokenm
Token ID76
Feature activation+6.15e-02

Pos logit contributions
roma+0.686
+0.573
 centimeters+0.559
opez+0.550
 Aden+0.547
Neg logit contributions
buster-0.750
 tort-0.682
adr-0.670
 dere-0.668
 treadmill-0.664
Tokenek
Token ID988
Feature activation+1.09e-02

Pos logit contributions
buster+0.099
 tort+0.090
adr+0.089
 dere+0.088
ETF+0.088
Neg logit contributions
roma-0.079
-0.065
 centimeters-0.064
opez-0.062
 Braun-0.062
Token.
Token ID13
Feature activation-4.83e-01

Pos logit contributions
buster+0.053
 tort+0.048
adr+0.047
 dere+0.047
ETF+0.047
Neg logit contributions
roma-0.049
-0.041
 centimeters-0.040
opez-0.040
 Aden-0.039
Tokencom
Token ID785
Feature activation+1.52e-02

Pos logit contributions
roma+2.190
+1.825
 centimeters+1.781
 Braun+1.767
 Aden+1.756
Neg logit contributions
buster-2.306
 tort-2.128
 treadmill-2.113
 dere-2.076
adr-2.050
Token/
Token ID14
Feature activation-7.79e-01

Pos logit contributions
buster+0.071
 tort+0.064
adr+0.063
 dere+0.063
 treadmill+0.062
Neg logit contributions
roma-0.071
-0.060
 centimeters-0.059
opez-0.058
 Aden-0.057
Tokenthread
Token ID16663
Feature activation-2.78e-01

Pos logit contributions
roma+3.149
+2.551
 centimeters+2.423
opez+2.409
 Braun+2.385
Neg logit contributions
buster-4.133
 tort-3.804
 dere-3.762
 treadmill-3.740
adr-3.696
Token/
Token ID14
Feature activation-1.27e+00

Pos logit contributions
roma+1.243
+1.037
 centimeters+1.008
 Aden+0.991
 Braun+0.990
Neg logit contributions
buster-1.349
 tort-1.247
 treadmill-1.225
 dere-1.221
adr-1.212
Token54
Token ID4051
Feature activation-1.06e-01

Pos logit contributions
roma+4.817
+3.950
 centimeters+3.709
 Braun+3.685
opez+3.674
Neg logit contributions
buster-6.737
 tort-6.425
 treadmill-6.370
 dere-6.258
ETF-6.091
Token33
Token ID2091
Feature activation-4.84e-01

Pos logit contributions
roma+0.396
+0.317
 centimeters+0.309
opez+0.301
 Aden+0.300
Neg logit contributions
buster-0.597
 tort-0.549
adr-0.543
 dere-0.538
ETF-0.535
Token95
Token ID3865
Feature activation-3.36e-01

Pos logit contributions
roma+2.121
+1.767
 centimeters+1.724
opez+1.695
 Braun+1.681
Neg logit contributions
buster-2.408
 tort-2.195
 dere-2.159
adr-2.155
 treadmill-2.149
Token/
Token ID14
Feature activation-5.02e-01

Pos logit contributions
roma+1.501
+1.262
 centimeters+1.229
 Aden+1.206
opez+1.205
Neg logit contributions
buster-1.633
 tort-1.495
 treadmill-1.468
 dere-1.467
adr-1.464
Tokenlost
Token ID33224
Feature activation+4.69e-01

Pos logit contributions
roma+2.329
+1.956
 centimeters+1.896
opez+1.869
 Braun+1.857
Neg logit contributions
buster-2.366
 tort-2.154
 dere-2.114
 treadmill-2.103
adr-2.101
Token Barack
Token ID8732
Feature activation+1.45e-02

Pos logit contributions
buster+1.021
 tort+0.924
adr+0.900
 dere+0.898
 treadmill+0.886
Neg logit contributions
roma-1.174
-0.997
 centimeters-0.981
opez-0.966
 Aden-0.956
Token Hussein
Token ID22983
Feature activation-7.91e-02

Pos logit contributions
buster+0.049
 tort+0.043
adr+0.042
 dere+0.042
 treadmill+0.041
Neg logit contributions
roma-0.087
-0.076
 centimeters-0.075
 Aden-0.074
opez-0.074
Token Obama
Token ID2486
Feature activation-9.15e-02

Pos logit contributions
roma+0.351
+0.290
 centimeters+0.289
opez+0.286
 Aden+0.285
Neg logit contributions
buster-0.396
 tort-0.353
 treadmill-0.346
 dere-0.344
ETF-0.341
TokenWith
Token ID3152
Feature activation-2.41e-01

Pos logit contributions
roma+0.509
+0.440
 centimeters+0.432
 Aden+0.425
opez+0.424
Neg logit contributions
buster-0.351
 tort-0.311
 dere-0.303
adr-0.302
 treadmill-0.301
Token low
Token ID1877
Feature activation-1.06e+00

Pos logit contributions
roma+1.156
+0.971
 centimeters+0.960
 Aden+0.937
 Braun+0.935
Neg logit contributions
buster-1.104
 tort-0.987
adr-0.987
 treadmill-0.967
 dere-0.964
Token birth
Token ID4082
Feature activation-1.27e+00

Pos logit contributions
roma+4.626
 centimeters+3.835
 Butter+3.794
+3.783
opez+3.737
Neg logit contributions
buster-5.114
adr-4.904
 dere-4.682
 tort-4.672
 WARN-4.621
Token rate
Token ID2494
Feature activation-3.16e-01

Pos logit contributions
roma+5.946
 centimeters+4.928
+4.850
 Braun+4.847
opez+4.795
Neg logit contributions
buster-5.795
adr-5.371
ETF-5.245
 dere-5.244
 treadmill-5.149
Token,
Token ID11
Feature activation-7.10e-01

Pos logit contributions
roma+1.475
+1.252
 centimeters+1.228
 Braun+1.214
 Aden+1.211
Neg logit contributions
buster-1.473
adr-1.338
 tort-1.327
 dere-1.323
 treadmill-1.305
Token America
Token ID2253
Feature activation-5.82e-02

Pos logit contributions
roma+3.339
+2.759
 centimeters+2.748
 Aden+2.698
 Butter+2.691
Neg logit contributions
buster-3.260
 tort-2.943
 dere-2.942
adr-2.913
 treadmill-2.897
Token needs
Token ID2476
Feature activation-5.32e-01

Pos logit contributions
roma+0.296
+0.247
 centimeters+0.243
opez+0.241
 Aden+0.240
Neg logit contributions
buster-0.255
 tort-0.234
 dere-0.229
 treadmill-0.222
adr-0.218
Token future
Token ID2003
Feature activation-8.55e-01

Pos logit contributions
roma+2.171
+1.877
 centimeters+1.829
opez+1.726
 Braun+1.714
Neg logit contributions
buster-2.838
adr-2.516
ETF-2.460
 dere-2.451
 tort-2.447
Token coordination
Token ID19877
Feature activation-1.52e-01

Pos logit contributions
roma+1.727
+1.461
 centimeters+1.435
opez+1.407
 Aden+1.403
Neg logit contributions
buster-1.645
 tort-1.483
adr-1.455
 dere-1.450
 treadmill-1.442
Token with
Token ID351
Feature activation-1.50e-01

Pos logit contributions
roma+0.645
+0.533
 centimeters+0.521
opez+0.510
 Aden+0.507
Neg logit contributions
buster-0.782
 tort-0.715
adr-0.703
 dere-0.702
 treadmill-0.698
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
roma+0.647
+0.530
 centimeters+0.517
opez+0.512
 Braun+0.501
Neg logit contributions
buster-0.767
 tort-0.708
 dere-0.692
adr-0.686
 treadmill-0.685
Token Ministry
Token ID9475
Feature activation-4.49e-01

Pos logit contributions
roma+0.921
+0.759
 centimeters+0.739
opez+0.727
 Braun+0.720
Neg logit contributions
buster-1.098
 tort-1.009
 dere-0.988
adr-0.984
 treadmill-0.981
Token of
Token ID286
Feature activation-7.52e-01

Pos logit contributions
roma+2.037
+1.708
 centimeters+1.670
opez+1.637
 Aden+1.631
Neg logit contributions
buster-2.165
 tort-1.968
adr-1.937
 dere-1.930
 treadmill-1.918
Token Foreign
Token ID8708
Feature activation-1.26e+00

Pos logit contributions
roma+2.957
+2.396
 centimeters+2.339
 Aden+2.298
 Butter+2.284
Neg logit contributions
buster-4.078
 tort-3.746
adr-3.696
 dere-3.678
 treadmill-3.671
Token Affairs
Token ID10665
Feature activation-4.06e-01

Pos logit contributions
roma+6.683
+5.750
 Butter+5.616
 centimeters+5.586
 Aden+5.567
Neg logit contributions
buster-4.980
 tort-4.416
 treadmill-4.369
adr-4.320
 dere-4.306
Token.
Token ID13
Feature activation+1.66e-01

Pos logit contributions
roma+1.893
+1.590
 centimeters+1.558
opez+1.530
 Aden+1.522
Neg logit contributions
buster-1.919
 tort-1.742
adr-1.706
 dere-1.705
 treadmill-1.693
Token The
Token ID383
Feature activation+9.83e-02

Pos logit contributions
buster+0.755
 tort+0.678
adr+0.667
 dere+0.664
ETF+0.661
Neg logit contributions
roma-0.809
-0.670
 centimeters-0.661
opez-0.659
 Braun-0.634
Token country
Token ID1499
Feature activation-2.51e-01

Pos logit contributions
buster+0.463
 tort+0.431
 dere+0.420
 treadmill+0.415
adr+0.412
Neg logit contributions
roma-0.461
-0.382
 centimeters-0.371
opez-0.369
 Braun-0.365
Token
Token ID447
Feature activation-3.27e-01

Pos logit contributions
roma+1.231
+1.042
 centimeters+1.022
opez+1.007
 Aden+1.000
Neg logit contributions
buster-1.126
 tort-1.020
 dere-0.995
adr-0.993
 treadmill-0.986
Token parties
Token ID4671
Feature activation+1.62e-01

Pos logit contributions
roma+2.803
+2.368
 centimeters+2.325
 Braun+2.277
opez+2.275
Neg logit contributions
buster-2.679
 tort-2.412
adr-2.373
 dere-2.361
 treadmill-2.350
Token.
Token ID13
Feature activation-3.75e-01

Pos logit contributions
buster+0.747
 tort+0.686
 dere+0.664
adr+0.657
 treadmill+0.656
Neg logit contributions
roma-0.778
-0.658
 centimeters-0.640
opez-0.632
 Aden-0.625
Token
Token ID198
Feature activation-2.30e-01

Pos logit contributions
roma+1.720
+1.442
 centimeters+1.412
opez+1.384
 Aden+1.381
Neg logit contributions
buster-1.798
 tort-1.632
adr-1.603
 dere-1.599
 treadmill-1.590
Token
Token ID198
Feature activation-2.12e-01

Pos logit contributions
roma+1.030
+0.862
 centimeters+0.845
 Braun+0.829
 Aden+0.829
Neg logit contributions
buster-1.115
 tort-1.021
 treadmill-1.005
 dere-0.999
adr-0.999
TokenBig
Token ID12804
Feature activation-5.31e-01

Pos logit contributions
roma+1.045
+0.887
 centimeters+0.870
opez+0.855
 Aden+0.851
Neg logit contributions
buster-0.948
 tort-0.855
adr-0.837
 dere-0.836
 treadmill-0.830
TokenFan
Token ID22480
Feature activation-1.25e+00

Pos logit contributions
roma+2.526
+2.140
 centimeters+2.099
opez+2.057
 Braun+2.055
Neg logit contributions
buster-2.433
 tort-2.198
adr-2.168
 dere-2.154
 treadmill-2.143
Token Profile
Token ID13118
Feature activation-8.64e-01

Pos logit contributions
roma+5.274
+4.357
 centimeters+4.190
 Braun+4.151
 Butter+4.134
Neg logit contributions
buster-6.179
 tort-5.686
 dere-5.642
 treadmill-5.611
adr-5.579
Token Blog
Token ID14001
Feature activation-1.04e+00

Pos logit contributions
roma+3.869
+3.241
 centimeters+3.150
 Braun+3.119
 Butter+3.112
Neg logit contributions
buster-4.178
 tort-3.773
adr-3.723
 dere-3.720
 treadmill-3.685
Token Joined
Token ID16709
Feature activation-8.65e-01

Pos logit contributions
roma+4.137
 Braun+3.396
+3.380
 centimeters+3.339
 Aden+3.319
Neg logit contributions
buster-5.422
 treadmill-4.959
 tort-4.923
ETF-4.879
 dere-4.870
Token December
Token ID3426
Feature activation-1.02e+00

Pos logit contributions
roma+3.880
+3.259
 centimeters+3.253
 Braun+3.234
 Aden+3.230
Neg logit contributions
buster-4.176
adr-3.712
 tort-3.704
 treadmill-3.663
 dere-3.632
Token 2010
Token ID3050
Feature activation-4.63e-01

Pos logit contributions
roma+3.668
 Braun+3.009
 Aden+2.986
+2.975
 centimeters+2.973
Neg logit contributions
buster-5.786
 tort-5.268
 treadmill-5.262
adr-5.220
ETF-5.145
Token down
Token ID866
Feature activation-3.82e-01

Pos logit contributions
roma+1.510
+1.275
 centimeters+1.248
opez+1.228
 Braun+1.219
Neg logit contributions
buster-1.450
 tort-1.314
 dere-1.286
adr-1.284
 treadmill-1.276
Token the
Token ID262
Feature activation-2.52e-01

Pos logit contributions
roma+1.667
+1.380
 centimeters+1.346
opez+1.325
 Braun+1.307
Neg logit contributions
buster-1.927
 tort-1.766
 dere-1.733
adr-1.723
 treadmill-1.716
Token mountain
Token ID8598
Feature activation+1.02e-01

Pos logit contributions
roma+1.301
+1.114
 centimeters+1.094
opez+1.076
 Aden+1.071
Neg logit contributions
buster-1.064
 tort-0.953
adr-0.931
 dere-0.930
 treadmill-0.923
Token,
Token ID11
Feature activation-6.72e-01

Pos logit contributions
buster+0.470
 tort+0.431
 dere+0.420
 treadmill+0.418
adr+0.412
Neg logit contributions
roma-0.492
-0.413
 centimeters-0.401
opez-0.400
 Aden-0.394
Token and
Token ID290
Feature activation-7.67e-01

Pos logit contributions
roma+3.081
+2.618
 centimeters+2.584
 Aden+2.525
 Braun+2.515
Neg logit contributions
buster-3.166
 tort-2.839
adr-2.835
 dere-2.778
ETF-2.776
Token decided
Token ID3066
Feature activation-1.23e+00

Pos logit contributions
roma+3.046
+2.546
 centimeters+2.506
 Aden+2.462
 Braun+2.462
Neg logit contributions
buster-4.043
adr-3.664
 tort-3.640
ETF-3.596
 dere-3.577
Token to
Token ID284
Feature activation-4.57e-01

Pos logit contributions
roma+4.525
+3.672
 centimeters+3.653
 Aden+3.560
 Braun+3.541
Neg logit contributions
buster-6.680
adr-6.191
 tort-6.103
ETF-6.024
 treadmill-6.008
Token wait
Token ID4043
Feature activation-4.55e-01

Pos logit contributions
roma+2.178
+1.845
 centimeters+1.824
 Braun+1.783
 Aden+1.783
Neg logit contributions
buster-2.085
 tort-1.864
adr-1.863
ETF-1.826
 dere-1.820
Token out
Token ID503
Feature activation+3.47e-02

Pos logit contributions
roma+2.158
+1.830
 centimeters+1.813
 Braun+1.762
 Aden+1.762
Neg logit contributions
buster-2.092
 tort-1.880
adr-1.865
 treadmill-1.836
 dere-1.835
Token the
Token ID262
Feature activation+8.30e-02

Pos logit contributions
buster+0.169
 tort+0.155
 dere+0.153
 treadmill+0.150
adr+0.149
Neg logit contributions
roma-0.158
-0.130
 centimeters-0.127
opez-0.126
 Braun-0.123
Token rain
Token ID6290
Feature activation+3.09e-01

Pos logit contributions
buster+0.346
 tort+0.316
 dere+0.309
 treadmill+0.305
adr+0.300
Neg logit contributions
roma-0.433
-0.370
 centimeters-0.360
opez-0.359
 Braun-0.355
Tokencor
Token ID10215
Feature activation+2.56e-01

Pos logit contributions
roma+0.440
+0.369
 centimeters+0.364
opez+0.355
 Aden+0.354
Neg logit contributions
buster-0.483
 tort-0.438
adr-0.432
 dere-0.429
 treadmill-0.424
Tokenvette
Token ID33573
Feature activation+1.93e-04

Pos logit contributions
buster+1.212
adr+1.076
 tort+1.073
 dere+1.050
 treadmill+1.041
Neg logit contributions
roma-1.179
-1.017
 centimeters-1.000
 Butter-0.971
 Aden-0.970
Tokenforum
Token ID27302
Feature activation-2.11e-01

Pos logit contributions
buster+0.001
 tort+0.001
adr+0.001
 dere+0.001
 treadmill+0.001
Neg logit contributions
roma-0.001
-0.001
 centimeters-0.001
 Aden-0.001
 Butter-0.001
Token.
Token ID13
Feature activation-5.68e-01

Pos logit contributions
roma+0.966
+0.821
 centimeters+0.804
 Aden+0.787
 Braun+0.784
Neg logit contributions
buster-1.020
 tort-0.919
adr-0.909
 dere-0.898
 treadmill-0.886
Tokencom
Token ID785
Feature activation-2.25e-01

Pos logit contributions
roma+2.724
+2.292
 centimeters+2.246
opez+2.213
 Braun+2.201
Neg logit contributions
buster-2.596
 tort-2.353
 dere-2.300
adr-2.294
 treadmill-2.294
Token/
Token ID14
Feature activation-1.22e+00

Pos logit contributions
roma+1.027
+0.860
 centimeters+0.840
opez+0.825
 Braun+0.821
Neg logit contributions
buster-1.080
 tort-0.981
adr-0.961
 dere-0.960
 treadmill-0.954
Tokenforums
Token ID37141
Feature activation-4.86e-01

Pos logit contributions
roma+5.360
+4.449
 centimeters+4.282
opez+4.217
 Braun+4.201
Neg logit contributions
buster-5.917
 tort-5.375
 treadmill-5.323
 dere-5.295
adr-5.232
Token...
Token ID986
Feature activation+1.22e-01

Pos logit contributions
roma+2.152
+1.790
 centimeters+1.747
 Braun+1.717
opez+1.707
Neg logit contributions
buster-2.374
 tort-2.195
 treadmill-2.161
 dere-2.148
adr-2.126
Tokenwith
Token ID4480
Feature activation-1.02e-01

Pos logit contributions
buster+0.594
 tort+0.536
adr+0.531
 dere+0.524
ETF+0.520
Neg logit contributions
roma-0.552
-0.463
 centimeters-0.456
 Aden-0.445
opez-0.444
Token-
Token ID12
Feature activation-4.55e-01

Pos logit contributions
roma+0.471
+0.397
 centimeters+0.389
 Aden+0.382
opez+0.382
Neg logit contributions
buster-0.483
 tort-0.438
adr-0.431
 dere-0.428
 treadmill-0.423
Tokenp
Token ID79
Feature activation-1.37e-01

Pos logit contributions
roma+2.037
+1.699
 centimeters+1.658
opez+1.627
 Braun+1.615
Neg logit contributions
buster-2.224
 tort-2.029
 dere-1.990
adr-1.987
 treadmill-1.982
Token cl
Token ID537
Feature activation-8.48e-02

Pos logit contributions
roma+1.563
+1.338
 centimeters+1.316
opez+1.292
 Aden+1.288
Neg logit contributions
buster-1.274
 tort-1.133
adr-1.115
 dere-1.106
 treadmill-1.099
Tokenunky
Token ID28898
Feature activation-2.36e-01

Pos logit contributions
roma+0.382
+0.323
 centimeters+0.316
 Braun+0.308
 Aden+0.307
Neg logit contributions
buster-0.413
 tort-0.374
adr-0.369
 dere-0.367
 treadmill-0.365
Token acting
Token ID7205
Feature activation-2.94e-01

Pos logit contributions
roma+1.164
+0.988
 centimeters+0.969
opez+0.953
 Aden+0.948
Neg logit contributions
buster-1.054
 tort-0.951
adr-0.930
 dere-0.929
 treadmill-0.923
Token and
Token ID290
Feature activation-2.40e-01

Pos logit contributions
roma+1.412
+1.195
 centimeters+1.170
opez+1.151
 Aden+1.145
Neg logit contributions
buster-1.344
 tort-1.216
 dere-1.189
adr-1.189
 treadmill-1.181
Token weak
Token ID4939
Feature activation-4.63e-01

Pos logit contributions
roma+1.214
+1.037
 centimeters+1.014
opez+1.000
 Aden+0.995
Neg logit contributions
buster-1.043
 tort-0.945
 dere-0.922
adr-0.916
 treadmill-0.914
Token plot
Token ID7110
Feature activation-1.22e+00

Pos logit contributions
roma+2.324
+1.979
 centimeters+1.943
opez+1.910
 Braun+1.901
Neg logit contributions
buster-2.022
 tort-1.819
adr-1.781
 dere-1.777
 treadmill-1.765
Token gets
Token ID3011
Feature activation-2.20e-01

Pos logit contributions
roma+5.212
+4.366
 centimeters+4.352
 Braun+4.180
 Butter+4.132
Neg logit contributions
buster-5.846
adr-5.341
 dere-5.300
ETF-5.298
 tort-5.281
Token rave
Token ID46242
Feature activation-1.52e-01

Pos logit contributions
roma+1.041
+0.878
 centimeters+0.860
opez+0.845
 Aden+0.840
Neg logit contributions
buster-1.025
 tort-0.929
adr-0.910
 dere-0.909
 treadmill-0.903
Token reviews
Token ID8088
Feature activation-4.81e-01

Pos logit contributions
roma+0.462
+0.348
 centimeters+0.336
opez+0.330
 Aden+0.323
Neg logit contributions
buster-0.971
 tort-0.904
 dere-0.888
adr-0.886
 treadmill-0.884
Token in
Token ID287
Feature activation-1.88e-01

Pos logit contributions
roma+2.199
+1.842
 centimeters+1.814
 Braun+1.773
 Aden+1.769
Neg logit contributions
buster-2.299
 tort-2.080
adr-2.055
 dere-2.046
 treadmill-2.037
Token the
Token ID262
Feature activation+1.37e-01

Pos logit contributions
roma+0.829
+0.691
 centimeters+0.671
opez+0.666
 Braun+0.654
Neg logit contributions
buster-0.932
 tort-0.861
 dere-0.839
 treadmill-0.833
adr-0.832
Token to
Token ID284
Feature activation-4.53e-02

Pos logit contributions
buster+1.946
 tort+1.785
 dere+1.764
adr+1.726
 treadmill+1.710
Neg logit contributions
roma-2.016
-1.689
 centimeters-1.638
opez-1.637
 Aden-1.615
Token this
Token ID428
Feature activation+4.35e-01

Pos logit contributions
roma+0.211
+0.176
 centimeters+0.173
opez+0.170
 Braun+0.168
Neg logit contributions
buster-0.214
 tort-0.196
 dere-0.192
adr-0.191
 treadmill-0.190
Token pres
Token ID906
Feature activation-4.73e-01

Pos logit contributions
buster+1.831
 tort+1.688
 dere+1.664
adr+1.616
 treadmill+1.614
Neg logit contributions
roma-2.252
-1.903
 centimeters-1.870
opez-1.840
 Braun-1.808
Token­
Token ID3907
Feature activation+1.37e+00

Pos logit contributions
roma+2.529
+2.178
 centimeters+2.139
opez+2.107
 Aden+2.098
Neg logit contributions
buster-1.907
 tort-1.702
adr-1.659
 dere-1.658
 treadmill-1.646
Tokenid
Token ID312
Feature activation+6.70e-01

Pos logit contributions
buster+5.402
adr+4.577
 tort+4.419
ETF+4.389
 dere+4.345
Neg logit contributions
roma-7.005
-6.321
 centimeters-6.288
-6.177
 Butter-6.128
Token­
Token ID3907
Feature activation+1.13e+00

Pos logit contributions
buster+2.521
 tort+2.236
adr+2.209
 dere+2.178
ETF+2.122
Neg logit contributions
roma-3.734
-3.249
 centimeters-3.228
opez-3.151
 Aden-3.116
Tokenent
Token ID298
Feature activation+3.00e-01

Pos logit contributions
buster+3.740
adr+3.105
 tort+3.016
 dere+2.957
 treadmill+2.780
Neg logit contributions
roma-6.697
-6.111
 centimeters-6.105
-6.010
 Butter-5.917
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
buster+1.390
 tort+1.275
 dere+1.254
adr+1.232
 treadmill+1.222
Neg logit contributions
roma-1.435
-1.197
 centimeters-1.178
opez-1.158
 Aden-1.134
Token
Token ID447
Feature activation+3.37e-01

Pos logit contributions
buster+0.917
 tort+0.842
 dere+0.830
adr+0.812
 treadmill+0.807
Neg logit contributions
roma-0.966
-0.809
 centimeters-0.794
opez-0.787
 Braun-0.767
Token
Token ID251
Feature activation+1.33e-01

Pos logit contributions
buster+1.451
 tort+1.298
adr+1.287
 dere+1.272
ETF+1.248
Neg logit contributions
roma-1.702
-1.440
 centimeters-1.436
opez-1.406
 Aden-1.387
Token John
Token ID1757
Feature activation+8.05e-02

Pos logit contributions
buster+0.608
 tort+0.553
 dere+0.541
adr+0.539
 treadmill+0.532
Neg logit contributions
roma-0.645
-0.542
 centimeters-0.534
opez-0.523
 Braun-0.518
Token are
Token ID389
Feature activation-1.16e-01

Pos logit contributions
roma+0.122
+0.104
 centimeters+0.101
opez+0.100
 Aden+0.100
Neg logit contributions
buster-0.101
 tort-0.092
 dere-0.089
adr-0.089
 treadmill-0.088
Token a
Token ID257
Feature activation-8.22e-02

Pos logit contributions
roma+0.587
+0.500
 centimeters+0.487
opez+0.482
 Braun+0.479
Neg logit contributions
buster-0.504
 tort-0.457
 dere-0.445
adr-0.443
 treadmill-0.442
Token lot
Token ID1256
Feature activation+1.77e-01

Pos logit contributions
roma+0.339
+0.278
 centimeters+0.271
opez+0.267
 Aden+0.266
Neg logit contributions
buster-0.431
 tort-0.398
adr-0.389
 dere-0.389
 treadmill-0.387
Token of
Token ID286
Feature activation-1.90e-01

Pos logit contributions
buster+0.785
 tort+0.727
 dere+0.708
 treadmill+0.695
adr+0.691
Neg logit contributions
roma-0.871
-0.732
opez-0.705
 centimeters-0.704
 Aden-0.702
Token p
Token ID279
Feature activation-3.18e-01

Pos logit contributions
roma+0.963
+0.819
 centimeters+0.798
opez+0.789
 Aden+0.784
Neg logit contributions
buster-0.820
 tort-0.745
 dere-0.728
 treadmill-0.721
adr-0.721
Tokenoth
Token ID849
Feature activation+8.40e-01

Pos logit contributions
roma+1.256
+1.019
 centimeters+0.995
opez+0.978
 Braun+0.967
Neg logit contributions
buster-1.726
 tort-1.588
 dere-1.559
adr-1.558
 treadmill-1.552
Tokenoles
Token ID4316
Feature activation+1.19e-01

Pos logit contributions
buster+3.376
 tort+3.046
adr+2.995
 dere+2.896
laws+2.859
Neg logit contributions
roma-4.304
-3.778
 Braun-3.675
 centimeters-3.619
 Aden-3.610
Token.
Token ID13
Feature activation+1.84e-01

Pos logit contributions
buster+0.544
 tort+0.498
 dere+0.485
adr+0.478
 treadmill+0.476
Neg logit contributions
roma-0.577
-0.486
 centimeters-0.469
opez-0.469
 Aden-0.465
Token
Token ID447
Feature activation+1.08e-01

Pos logit contributions
buster+0.843
 tort+0.765
 dere+0.746
adr+0.745
ETF+0.736
Neg logit contributions
roma-0.885
-0.743
 centimeters-0.724
opez-0.722
 Braun-0.706
Token
Token ID251
Feature activation+1.00e-01

Pos logit contributions
buster+0.468
 tort+0.421
adr+0.413
 dere+0.410
 treadmill+0.405
Neg logit contributions
roma-0.539
-0.458
 centimeters-0.450
opez-0.445
 Aden-0.442
Token (
Token ID357
Feature activation+1.56e-01

Pos logit contributions
buster+0.462
 tort+0.418
adr+0.408
 dere+0.408
ETF+0.403
Neg logit contributions
roma-0.480
-0.402
 centimeters-0.393
opez-0.389
 Braun-0.384
Token the
Token ID262
Feature activation+6.59e-02

Pos logit contributions
roma+1.583
+1.333
 centimeters+1.300
opez+1.280
 Braun+1.268
Neg logit contributions
buster-1.589
 tort-1.449
 dere-1.419
adr-1.407
 treadmill-1.403
Token crisis
Token ID4902
Feature activation+1.60e-01

Pos logit contributions
buster+0.297
 tort+0.274
 dere+0.267
 treadmill+0.264
adr+0.263
Neg logit contributions
roma-0.322
-0.273
 centimeters-0.264
opez-0.262
 Braun-0.258
Token to
Token ID284
Feature activation+3.46e-01

Pos logit contributions
buster+0.788
 tort+0.728
 dere+0.710
 treadmill+0.701
adr+0.691
Neg logit contributions
roma-0.716
-0.599
opez-0.574
 centimeters-0.573
 Aden-0.563
Token a
Token ID257
Feature activation+7.38e-01

Pos logit contributions
buster+1.426
 tort+1.333
 dere+1.272
 treadmill+1.250
adr+1.249
Neg logit contributions
roma-1.804
-1.530
opez-1.494
 centimeters-1.478
 Butter-1.454
Token new
Token ID649
Feature activation+2.79e-01

Pos logit contributions
buster+3.147
 tort+3.023
 dere+2.883
 treadmill+2.832
adr+2.709
Neg logit contributions
roma-3.681
opez-3.128
-3.122
 centimeters-2.972
 Butter-2.950
Token level
Token ID1241
Feature activation+9.81e-01

Pos logit contributions
buster+1.084
 tort+0.985
 dere+0.954
 treadmill+0.940
adr+0.935
Neg logit contributions
roma-1.534
-1.315
opez-1.289
 centimeters-1.288
 Aden-1.267
Token
Token ID851
Feature activation-2.54e-01

Pos logit contributions
buster+4.172
 tort+3.818
 dere+3.657
adr+3.617
 treadmill+3.560
Neg logit contributions
roma-4.909
-4.105
opez-4.052
 centimeters-4.002
 Aden-3.927
Token it
Token ID340
Feature activation+1.73e-01

Pos logit contributions
roma+1.196
+1.008
 centimeters+0.988
opez+0.970
 Aden+0.965
Neg logit contributions
buster-1.184
 tort-1.073
adr-1.051
 dere-1.050
 treadmill-1.044
Token would
Token ID561
Feature activation+7.21e-02

Pos logit contributions
buster+0.866
 tort+0.797
 dere+0.778
adr+0.773
 treadmill+0.764
Neg logit contributions
roma-0.764
-0.634
 centimeters-0.617
opez-0.611
 Aden-0.604
Token be
Token ID307
Feature activation+9.84e-03

Pos logit contributions
buster+0.330
 tort+0.299
 dere+0.293
adr+0.290
 treadmill+0.289
Neg logit contributions
roma-0.349
-0.296
 centimeters-0.287
opez-0.285
 Aden-0.283
Token impossible
Token ID5340
Feature activation-1.14e-01

Pos logit contributions
buster+0.046
 tort+0.042
 dere+0.041
adr+0.040
 treadmill+0.040
Neg logit contributions
roma-0.047
-0.040
 centimeters-0.038
opez-0.038
 Aden-0.038
Token about
Token ID546
Feature activation+1.46e-01

Pos logit contributions
buster+0.251
 tort+0.230
 dere+0.223
 treadmill+0.220
adr+0.220
Neg logit contributions
roma-0.266
-0.225
 centimeters-0.218
opez-0.217
 Aden-0.213
Token Anna
Token ID11735
Feature activation-1.99e-01

Pos logit contributions
buster+0.720
 tort+0.664
 dere+0.647
adr+0.642
 treadmill+0.642
Neg logit contributions
roma-0.655
-0.542
 centimeters-0.527
opez-0.521
 Aden-0.514
Token but
Token ID475
Feature activation-1.98e-01

Pos logit contributions
roma+0.936
+0.788
 centimeters+0.769
opez+0.757
 Aden+0.751
Neg logit contributions
buster-0.932
 tort-0.850
 dere-0.828
adr-0.825
 treadmill-0.821
Token these
Token ID777
Feature activation+6.01e-02

Pos logit contributions
roma+0.865
+0.718
 centimeters+0.702
opez+0.688
 Aden+0.684
Neg logit contributions
buster-0.989
 tort-0.903
adr-0.885
 dere-0.885
 treadmill-0.879
Token caught
Token ID4978
Feature activation+4.34e-01

Pos logit contributions
buster+0.269
 tort+0.245
 dere+0.239
adr+0.238
 treadmill+0.236
Neg logit contributions
roma-0.296
-0.250
 centimeters-0.243
opez-0.241
 Aden-0.239
Token me
Token ID502
Feature activation+8.30e-01

Pos logit contributions
buster+2.145
 tort+2.016
 dere+1.958
 treadmill+1.905
adr+1.903
Neg logit contributions
roma-1.918
-1.575
opez-1.515
 centimeters-1.506
 Aden-1.502
Token off
Token ID572
Feature activation-1.66e-01

Pos logit contributions
buster+2.959
 tort+2.730
 dere+2.619
 treadmill+2.494
adr+2.486
Neg logit contributions
roma-4.781
-4.106
opez-4.017
 Aden-3.991
 centimeters-3.940
Token guard
Token ID4860
Feature activation+2.99e-01

Pos logit contributions
roma+0.694
+0.566
 centimeters+0.552
opez+0.544
 Aden+0.539
Neg logit contributions
buster-0.871
 tort-0.801
 dere-0.787
adr-0.781
 treadmill-0.781
Token.
Token ID13
Feature activation-3.75e-02

Pos logit contributions
buster+1.393
 tort+1.282
 dere+1.237
 treadmill+1.226
adr+1.221
Neg logit contributions
roma-1.422
-1.185
opez-1.149
 centimeters-1.144
 Aden-1.132
Token Ol
Token ID6544
Feature activation-3.70e-02

Pos logit contributions
roma+0.173
+0.144
 centimeters+0.141
opez+0.139
 Braun+0.136
Neg logit contributions
buster-0.180
 tort-0.164
adr-0.160
 dere-0.159
ETF-0.157
Tokenaf
Token ID1878
Feature activation-6.56e-01

Pos logit contributions
roma+0.217
+0.190
 centimeters+0.187
opez+0.184
 Braun+0.183
Neg logit contributions
buster-0.131
 tort-0.115
adr-0.112
 dere-0.111
 treadmill-0.110
Token basically
Token ID6209
Feature activation+9.46e-02

Pos logit contributions
roma+0.804
+0.677
 centimeters+0.664
opez+0.651
 Aden+0.648
Neg logit contributions
buster-0.805
 tort-0.730
adr-0.715
 dere-0.714
 treadmill-0.709
Token reconnect
Token ID37671
Feature activation+2.88e-01

Pos logit contributions
buster+0.427
 tort+0.389
 dere+0.379
adr+0.377
 treadmill+0.376
Neg logit contributions
roma-0.462
-0.390
 centimeters-0.381
opez-0.376
 Aden-0.373
Tokening
Token ID278
Feature activation+6.91e-01

Pos logit contributions
buster+1.260
 tort+1.151
 dere+1.115
adr+1.109
 treadmill+1.105
Neg logit contributions
roma-1.444
-1.212
 centimeters-1.187
opez-1.181
 Aden-1.173
Token with
Token ID351
Feature activation+1.58e-01

Pos logit contributions
buster+3.114
 tort+2.877
 dere+2.810
 treadmill+2.750
adr+2.748
Neg logit contributions
roma-3.368
-2.780
opez-2.693
 centimeters-2.644
 Aden-2.614
Token each
Token ID1123
Feature activation+5.27e-01

Pos logit contributions
buster+0.700
 tort+0.638
 dere+0.624
adr+0.619
 treadmill+0.616
Neg logit contributions
roma-0.787
-0.663
 centimeters-0.646
opez-0.641
 Aden-0.630
Token other
Token ID584
Feature activation+1.00e+00

Pos logit contributions
buster+2.299
 tort+2.099
 dere+2.046
 treadmill+2.027
adr+2.012
Neg logit contributions
roma-2.639
-2.215
 centimeters-2.173
opez-2.164
 Aden-2.142
Token
Token ID447
Feature activation+3.98e-02

Pos logit contributions
buster+4.237
 tort+3.982
 dere+3.789
adr+3.703
 treadmill+3.681
Neg logit contributions
roma-4.977
-4.187
opez-4.049
 centimeters-3.968
 Aden-3.953
Token
Token ID247
Feature activation-2.44e-02

Pos logit contributions
buster+0.174
 tort+0.157
adr+0.154
 dere+0.153
 treadmill+0.150
Neg logit contributions
roma-0.198
-0.168
 centimeters-0.165
opez-0.163
 Aden-0.162
Tokens
Token ID82
Feature activation+5.91e-02

Pos logit contributions
roma+0.113
+0.094
 centimeters+0.092
opez+0.091
 Aden+0.090
Neg logit contributions
buster-0.116
 tort-0.106
adr-0.103
 dere-0.103
 treadmill-0.102
Token bodies
Token ID5920
Feature activation+1.39e-01

Pos logit contributions
buster+0.275
 tort+0.251
 dere+0.245
adr+0.245
 treadmill+0.242
Neg logit contributions
roma-0.280
-0.235
 centimeters-0.229
opez-0.227
 Aden-0.226
Token outside
Token ID2354
Feature activation+3.53e-02

Pos logit contributions
buster+0.647
 tort+0.588
adr+0.573
 dere+0.573
 treadmill+0.567
Neg logit contributions
roma-0.665
-0.559
 centimeters-0.542
opez-0.538
 Aden-0.534
Token early
Token ID1903
Feature activation+7.39e-01

Pos logit contributions
roma+0.553
+0.473
 centimeters+0.465
opez+0.456
 Aden+0.455
Neg logit contributions
buster-0.455
 tort-0.406
adr-0.399
 dere-0.395
 treadmill-0.394
Token to
Token ID284
Feature activation+6.96e-01

Pos logit contributions
buster+3.547
 tort+3.239
 dere+3.154
 treadmill+3.135
adr+3.134
Neg logit contributions
roma-3.382
-2.733
opez-2.695
 Aden-2.673
 Braun-2.670
Token say
Token ID910
Feature activation+6.82e-01

Pos logit contributions
buster+3.124
 tort+2.868
 dere+2.796
 treadmill+2.746
adr+2.697
Neg logit contributions
roma-3.402
-2.907
opez-2.784
 Braun-2.758
 Aden-2.739
Token
Token ID447
Feature activation+9.84e-02

Pos logit contributions
buster+2.978
 tort+2.746
 dere+2.688
 treadmill+2.607
adr+2.597
Neg logit contributions
roma-3.411
-2.862
opez-2.779
 centimeters-2.747
 Braun-2.712
Token
Token ID251
Feature activation+1.34e-01

Pos logit contributions
buster+0.467
 tort+0.423
adr+0.417
 dere+0.414
ETF+0.407
Neg logit contributions
roma-0.454
-0.380
 centimeters-0.374
opez-0.370
 Aden-0.365
Token what
Token ID644
Feature activation+1.31e-01

Pos logit contributions
buster+0.632
 tort+0.574
 dere+0.561
adr+0.559
 treadmill+0.553
Neg logit contributions
roma-0.628
-0.528
 centimeters-0.518
opez-0.508
 Braun-0.505
Token the
Token ID262
Feature activation+2.08e-01

Pos logit contributions
buster+0.629
 tort+0.578
 dere+0.563
adr+0.559
 treadmill+0.554
Neg logit contributions
roma-0.606
-0.504
 centimeters-0.491
opez-0.488
 Aden-0.479
Token worst
Token ID5290
Feature activation+4.67e-01

Pos logit contributions
buster+0.954
 tort+0.881
 dere+0.858
 treadmill+0.845
adr+0.841
Neg logit contributions
roma-0.996
-0.838
opez-0.813
 centimeters-0.813
 Aden-0.797
Token outcome
Token ID8055
Feature activation+3.38e-01

Pos logit contributions
buster+2.161
 tort+1.992
 dere+1.937
 treadmill+1.873
adr+1.859
Neg logit contributions
roma-2.229
-1.864
opez-1.825
 centimeters-1.805
 Aden-1.775
Token could
Token ID714
Feature activation+1.17e-01

Pos logit contributions
buster+1.617
 tort+1.483
 dere+1.438
adr+1.420
ETF+1.404
Neg logit contributions
roma-1.552
-1.295
 centimeters-1.267
opez-1.258
 Braun-1.230
Token be
Token ID307
Feature activation-8.73e-02

Pos logit contributions
buster+0.494
 tort+0.444
 dere+0.432
adr+0.427
 treadmill+0.425
Neg logit contributions
roma-0.605
-0.521
 centimeters-0.511
opez-0.505
 Aden-0.502
Token |
Token ID930
Feature activation+9.05e-02

Pos logit contributions
roma+0.238
+0.201
 centimeters+0.198
opez+0.195
 Braun+0.193
Neg logit contributions
buster-0.218
 tort-0.197
adr-0.191
 dere-0.191
 treadmill-0.188
Token C
Token ID327
Feature activation+5.84e-01

Pos logit contributions
buster+0.442
 tort+0.407
 dere+0.397
adr+0.393
 treadmill+0.392
Neg logit contributions
roma-0.407
-0.338
 centimeters-0.331
opez-0.327
 Aden-0.320
Tokenaj
Token ID1228
Feature activation-8.50e-02

Pos logit contributions
buster+2.080
 tort+1.867
adr+1.831
 dere+1.795
ETF+1.781
Neg logit contributions
roma-3.267
-2.899
 centimeters-2.894
 Aden-2.825
 Braun-2.811
Tokenun
Token ID403
Feature activation-9.61e-02

Pos logit contributions
roma+0.339
+0.279
 centimeters+0.275
 Aden+0.265
 Braun+0.264
Neg logit contributions
buster-0.459
 tort-0.420
adr-0.414
 dere-0.411
 treadmill-0.407
Token Kitchen
Token ID22541
Feature activation-2.43e-01

Pos logit contributions
roma+0.422
+0.351
 centimeters+0.344
opez+0.339
 Aden+0.334
Neg logit contributions
buster-0.481
 tort-0.442
 dere-0.431
adr-0.427
 treadmill-0.426
Token
Token ID198
Feature activation+4.20e-01

Pos logit contributions
roma+1.191
+1.010
 centimeters+0.991
opez+0.974
 Aden+0.969
Neg logit contributions
buster-1.094
 tort-0.988
adr-0.966
 dere-0.965
 treadmill-0.958
Token
Token ID198
Feature activation+1.49e-01

Pos logit contributions
buster+1.842
 tort+1.628
adr+1.601
ETF+1.590
 dere+1.588
Neg logit contributions
roma-2.119
-1.790
 centimeters-1.753
opez-1.749
 Aden-1.702
TokenC
Token ID34
Feature activation+4.67e-02

Pos logit contributions
buster+0.657
 tort+0.588
adr+0.580
 dere+0.574
ETF+0.573
Neg logit contributions
roma-0.743
-0.628
 centimeters-0.625
 Aden-0.612
opez-0.611
Tokenaj
Token ID1228
Feature activation-4.50e-01

Pos logit contributions
buster+0.182
 tort+0.162
adr+0.159
 dere+0.157
 treadmill+0.156
Neg logit contributions
roma-0.255
-0.221
 centimeters-0.218
 Aden-0.214
 Braun-0.214
Tokenun
Token ID403
Feature activation-4.26e-01

Pos logit contributions
roma+1.732
+1.396
 centimeters+1.360
opez+1.330
 Aden+1.321
Neg logit contributions
buster-2.492
 tort-2.297
adr-2.256
 dere-2.256
 treadmill-2.244
Token Kitchen
Token ID22541
Feature activation-1.36e-01

Pos logit contributions
roma+1.942
+1.625
 centimeters+1.591
opez+1.561
 Aden+1.553
Neg logit contributions
buster-2.054
 tort-1.867
adr-1.832
 dere-1.829
 treadmill-1.818
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
roma+0.659
+0.552
 centimeters+0.543
opez+0.535
 Braun+0.527
Neg logit contributions
buster-0.640
 tort-0.578
adr-0.568
 dere-0.566
ETF-0.561
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
roma+0.678
+0.570
 centimeters+0.556
 Aden+0.545
 Braun+0.545
Neg logit contributions
buster-0.714
 tort-0.652
 treadmill-0.639
 dere-0.639
adr-0.637
TokenSo
Token ID2396
Feature activation+9.71e-03

Pos logit contributions
roma+0.562
+0.480
 centimeters+0.471
opez+0.463
 Aden+0.461
Neg logit contributions
buster-0.479
 tort-0.430
adr-0.421
 dere-0.420
 treadmill-0.417
Token far
Token ID1290
Feature activation+3.16e-01

Pos logit contributions
buster+0.043
 tort+0.039
 dere+0.038
adr+0.038
 treadmill+0.038
Neg logit contributions
roma-0.048
-0.041
 centimeters-0.040
opez-0.039
 Braun-0.039
Token,
Token ID11
Feature activation-4.76e-03

Pos logit contributions
buster+1.350
 tort+1.231
 dere+1.208
 treadmill+1.184
adr+1.179
Neg logit contributions
roma-1.611
-1.369
 centimeters-1.334
opez-1.316
 Braun-1.287
Token only
Token ID691
Feature activation+3.86e-01

Pos logit contributions
roma+0.022
+0.018
 centimeters+0.018
opez+0.017
 Braun+0.017
Neg logit contributions
buster-0.023
 tort-0.021
 dere-0.021
 treadmill-0.020
adr-0.020
Token California
Token ID3442
Feature activation-3.68e-03

Pos logit contributions
buster+1.731
 tort+1.606
 dere+1.566
 treadmill+1.551
adr+1.532
Neg logit contributions
roma-1.881
-1.592
 centimeters-1.522
opez-1.518
 Braun-1.491
Token has
Token ID468
Feature activation+5.45e-01

Pos logit contributions
roma+0.020
+0.017
 centimeters+0.017
opez+0.016
 Braun+0.016
Neg logit contributions
buster-0.015
 tort-0.013
 dere-0.013
adr-0.013
 treadmill-0.013
Token adopted
Token ID8197
Feature activation+3.48e-01

Pos logit contributions
buster+2.516
 tort+2.353
 dere+2.328
 treadmill+2.274
adr+2.223
Neg logit contributions
roma-2.565
-2.152
opez-2.083
 centimeters-2.055
-2.003
Token the
Token ID262
Feature activation+8.01e-02

Pos logit contributions
buster+1.595
 tort+1.468
 dere+1.425
 treadmill+1.419
adr+1.395
Neg logit contributions
roma-1.666
-1.407
 centimeters-1.358
opez-1.357
 Aden-1.317
Token bills
Token ID9024
Feature activation-2.32e-02

Pos logit contributions
buster+0.352
 tort+0.319
 dere+0.311
adr+0.310
 treadmill+0.309
Neg logit contributions
roma-0.401
-0.341
 centimeters-0.333
opez-0.329
 Braun-0.325
Tokenaring
Token ID1723
Feature activation+7.27e-03

Pos logit contributions
buster+5.253
adr+4.746
 tort+4.655
 treadmill+4.562
 dere+4.475
Neg logit contributions
roma-5.546
-4.901
 Butter-4.758
 centimeters-4.726
opez-4.712
Token such
Token ID884
Feature activation-1.45e-02

Pos logit contributions
buster+0.035
 tort+0.032
 dere+0.032
 treadmill+0.032
adr+0.031
Neg logit contributions
roma-0.033
-0.028
opez-0.027
 centimeters-0.027
 Braun-0.026
Token different
Token ID1180
Feature activation+3.99e-01

Pos logit contributions
roma+0.068
+0.058
 centimeters+0.056
opez+0.055
 Braun+0.055
Neg logit contributions
buster-0.068
 tort-0.062
 dere-0.060
adr-0.060
 treadmill-0.060
Token molecules
Token ID17745
Feature activation+3.45e-01

Pos logit contributions
buster+1.890
 tort+1.739
 treadmill+1.682
 dere+1.667
adr+1.656
Neg logit contributions
roma-1.887
-1.570
opez-1.522
 centimeters-1.491
 Aden-1.485
Token as
Token ID355
Feature activation+3.03e-01

Pos logit contributions
buster+1.573
 tort+1.435
 dere+1.385
 treadmill+1.383
adr+1.377
Neg logit contributions
roma-1.673
-1.417
opez-1.367
 centimeters-1.362
 Braun-1.340
Token minerals
Token ID21622
Feature activation+5.12e-02

Pos logit contributions
buster+1.380
 tort+1.264
 treadmill+1.228
 dere+1.226
adr+1.222
Neg logit contributions
roma-1.465
-1.239
opez-1.191
 centimeters-1.182
 Braun-1.169
Token and
Token ID290
Feature activation-1.31e-02

Pos logit contributions
buster+0.233
 tort+0.211
 dere+0.206
adr+0.205
 treadmill+0.205
Neg logit contributions
roma-0.249
-0.210
 centimeters-0.204
opez-0.202
 Braun-0.201
Token organ
Token ID1618
Feature activation-4.35e-01

Pos logit contributions
roma+0.063
+0.053
 centimeters+0.051
opez+0.051
 Braun+0.050
Neg logit contributions
buster-0.060
 tort-0.055
 dere-0.054
adr-0.053
 treadmill-0.053
Tokenics
Token ID873
Feature activation-3.29e-01

Pos logit contributions
roma+1.788
+1.470
 centimeters+1.431
opez+1.406
 Braun+1.395
Neg logit contributions
buster-2.290
 tort-2.099
 dere-2.062
adr-2.062
 treadmill-2.049
Token from
Token ID422
Feature activation-1.71e-01

Pos logit contributions
roma+1.548
+1.307
 centimeters+1.292
 Aden+1.261
 Braun+1.257
Neg logit contributions
buster-1.530
 tort-1.380
adr-1.361
 dere-1.356
 treadmill-1.345
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
roma+0.796
+0.669
 centimeters+0.658
opez+0.644
 Aden+0.642
Neg logit contributions
buster-0.807
 tort-0.730
adr-0.717
 dere-0.715
 treadmill-0.710
Tokenwith
Token ID4480
Feature activation+1.56e-01

Pos logit contributions
roma+0.719
+0.601
 centimeters+0.589
opez+0.578
 Aden+0.575
Neg logit contributions
buster-0.767
 tort-0.698
adr-0.684
 dere-0.683
 treadmill-0.679
Token e
Token ID304
Feature activation-2.63e-01

Pos logit contributions
buster+0.693
 tort+0.634
 dere+0.617
adr+0.616
 treadmill+0.613
Neg logit contributions
roma-0.764
-0.651
 centimeters-0.633
opez-0.627
 Aden-0.620
Tokenthereum
Token ID17733
Feature activation-2.47e-01

Pos logit contributions
roma+1.127
+0.931
 centimeters+0.911
opez+0.893
 Aden+0.888
Neg logit contributions
buster-1.338
 tort-1.225
adr-1.203
 dere-1.200
 treadmill-1.192
Token)
Token ID8
Feature activation-1.65e-01

Pos logit contributions
roma+1.212
+1.027
 centimeters+1.007
opez+0.991
 Aden+0.987
Neg logit contributions
buster-1.108
 tort-1.002
adr-0.980
 dere-0.978
 treadmill-0.973
Token organization
Token ID4009
Feature activation-2.40e-01

Pos logit contributions
roma+0.849
+0.726
 centimeters+0.712
opez+0.701
 Aden+0.697
Neg logit contributions
buster-0.704
 tort-0.634
 dere-0.619
adr-0.617
 treadmill-0.614
Token aimed
Token ID8998
Feature activation+3.13e-01

Pos logit contributions
roma+1.108
+0.931
 centimeters+0.911
opez+0.894
 Aden+0.890
Neg logit contributions
buster-1.145
 tort-1.039
adr-1.019
 dere-1.017
 treadmill-1.012
Token at
Token ID379
Feature activation-5.48e-02

Pos logit contributions
buster+1.274
 tort+1.152
 dere+1.120
adr+1.110
 treadmill+1.102
Neg logit contributions
roma-1.653
-1.407
 centimeters-1.382
 Braun-1.371
opez-1.370
Token taking
Token ID2263
Feature activation-3.53e-01

Pos logit contributions
roma+0.281
+0.240
 centimeters+0.235
opez+0.232
 Braun+0.230
Neg logit contributions
buster-0.233
 tort-0.211
 dere-0.206
adr-0.205
 treadmill-0.204
Token on
Token ID319
Feature activation+9.03e-02

Pos logit contributions
roma+1.701
+1.439
 centimeters+1.412
opez+1.386
 Aden+1.382
Neg logit contributions
buster-1.614
 tort-1.457
adr-1.429
 dere-1.424
 treadmill-1.416
Token Uber
Token ID12024
Feature activation-7.14e-02

Pos logit contributions
buster+0.417
 tort+0.383
 dere+0.376
 treadmill+0.371
adr+0.370
Neg logit contributions
roma-0.431
-0.361
 centimeters-0.355
opez-0.349
 Aden-0.343
Token first
Token ID717
Feature activation+4.06e-03

Pos logit contributions
roma+0.348
+0.293
 centimeters+0.289
opez+0.284
 Aden+0.283
Neg logit contributions
buster-0.323
 tort-0.292
 dere-0.285
adr-0.285
 treadmill-0.283
Token like
Token ID588
Feature activation+1.40e-01

Pos logit contributions
buster+0.839
 tort+0.765
 dere+0.747
adr+0.738
 treadmill+0.737
Neg logit contributions
roma-0.859
-0.721
 centimeters-0.699
opez-0.694
 Aden-0.686
Token transaction
Token ID8611
Feature activation-3.57e-01

Pos logit contributions
buster+0.681
 tort+0.625
 dere+0.610
adr+0.608
 treadmill+0.606
Neg logit contributions
roma-0.635
-0.527
opez-0.511
 centimeters-0.510
 Aden-0.499
Token cut
Token ID2005
Feature activation-2.48e-02

Pos logit contributions
roma+1.761
+1.496
 centimeters+1.465
opez+1.443
 Aden+1.434
Neg logit contributions
buster-1.596
 tort-1.440
adr-1.408
 dere-1.406
 treadmill-1.397
Token-
Token ID12
Feature activation+3.70e-02

Pos logit contributions
roma+0.121
+0.103
 centimeters+0.100
opez+0.099
 Braun+0.099
Neg logit contributions
buster-0.112
 tort-0.101
adr-0.099
 dere-0.098
 treadmill-0.098
Tokenthrough
Token ID9579
Feature activation-4.50e-01

Pos logit contributions
buster+0.162
 tort+0.145
adr+0.142
 dere+0.141
 treadmill+0.140
Neg logit contributions
roma-0.187
-0.159
 centimeters-0.156
 Braun-0.154
opez-0.154
Token and
Token ID290
Feature activation+2.62e-02

Pos logit contributions
roma+2.122
+1.788
 centimeters+1.754
opez+1.719
 Braun+1.712
Neg logit contributions
buster-2.103
 tort-1.905
adr-1.866
 dere-1.865
 treadmill-1.852
Token bid
Token ID8406
Feature activation+2.08e-01

Pos logit contributions
buster+0.117
 tort+0.107
 dere+0.104
adr+0.104
 treadmill+0.103
Neg logit contributions
roma-0.129
-0.109
 centimeters-0.106
opez-0.106
 Aden-0.104
Tokenirection
Token ID4154
Feature activation-4.33e-02

Pos logit contributions
buster+1.176
 tort+1.085
adr+1.068
 dere+1.062
 treadmill+1.062
Neg logit contributions
roma-0.778
-0.620
 centimeters-0.605
opez-0.604
 Braun-0.597
Tokenal
Token ID282
Feature activation-4.11e-02

Pos logit contributions
roma+0.174
+0.142
 centimeters+0.138
opez+0.135
 Braun+0.135
Neg logit contributions
buster-0.232
 tort-0.214
 dere-0.210
adr-0.210
 treadmill-0.210
Token payment
Token ID6074
Feature activation-2.23e-01

Pos logit contributions
roma+0.202
+0.171
 centimeters+0.165
opez+0.165
 Aden+0.163
Neg logit contributions
buster-0.185
 tort-0.169
adr-0.165
 dere-0.164
 treadmill-0.163
Token channels
Token ID9619
Feature activation+8.45e-02

Pos logit contributions
roma+1.064
+0.897
 centimeters+0.879
opez+0.866
 Aden+0.860
Neg logit contributions
buster-1.031
 tort-0.935
adr-0.915
 dere-0.913
 treadmill-0.909
Token anxiety
Token ID9751
Feature activation+2.24e-01

Pos logit contributions
buster+1.609
 tort+1.483
 dere+1.455
 treadmill+1.425
adr+1.413
Neg logit contributions
roma-2.085
-1.763
 centimeters-1.733
opez-1.704
 Braun-1.704
Token because
Token ID780
Feature activation+1.23e-01

Pos logit contributions
buster+0.994
 tort+0.909
 dere+0.882
adr+0.873
 treadmill+0.873
Neg logit contributions
roma-1.114
-0.932
 centimeters-0.914
opez-0.900
 Aden-0.898
Token will
Token ID481
Feature activation+1.56e-01

Pos logit contributions
buster+0.587
 tort+0.537
 dere+0.526
adr+0.522
 treadmill+0.521
Neg logit contributions
roma-0.571
-0.477
 centimeters-0.468
opez-0.459
 Braun-0.456
Token you
Token ID345
Feature activation-3.35e-01

Pos logit contributions
buster+0.735
 tort+0.671
 dere+0.658
adr+0.651
 treadmill+0.650
Neg logit contributions
roma-0.733
-0.609
 centimeters-0.599
opez-0.587
 Aden-0.585
Token be
Token ID307
Feature activation+1.06e-02

Pos logit contributions
roma+1.627
+1.382
 centimeters+1.356
 Braun+1.328
opez+1.328
Neg logit contributions
buster-1.506
 tort-1.354
adr-1.338
 dere-1.319
 treadmill-1.317
Token able
Token ID1498
Feature activation-2.30e-01

Pos logit contributions
buster+0.039
 tort+0.034
adr+0.033
 dere+0.033
 treadmill+0.033
Neg logit contributions
roma-0.061
-0.053
 centimeters-0.052
opez-0.051
 Aden-0.051
Token to
Token ID284
Feature activation-2.98e-01

Pos logit contributions
roma+0.930
+0.762
 centimeters+0.742
opez+0.726
 Braun+0.722
Neg logit contributions
buster-1.229
 tort-1.127
adr-1.108
 dere-1.106
 treadmill-1.100
Token transport
Token ID4839
Feature activation+7.50e-02

Pos logit contributions
roma+1.453
+1.236
 centimeters+1.216
opez+1.193
 Braun+1.189
Neg logit contributions
buster-1.333
 tort-1.193
adr-1.176
 dere-1.166
 treadmill-1.157
Token it
Token ID340
Feature activation-1.96e-01

Pos logit contributions
buster+0.351
 tort+0.320
 dere+0.313
adr+0.312
 treadmill+0.311
Neg logit contributions
roma-0.354
-0.296
 centimeters-0.290
opez-0.285
 Aden-0.284
Token?
Token ID30
Feature activation-8.18e-02

Pos logit contributions
roma+0.873
+0.728
 centimeters+0.712
opez+0.698
 Aden+0.694
Neg logit contributions
buster-0.967
 tort-0.882
adr-0.864
 dere-0.864
 treadmill-0.858
Token There
Token ID1318
Feature activation-3.71e-02

Pos logit contributions
roma+0.412
+0.349
 centimeters+0.344
opez+0.339
 Aden+0.334
Neg logit contributions
buster-0.358
 tort-0.322
adr-0.315
 dere-0.314
 treadmill-0.311
Token
Token ID447
Feature activation-3.29e-01

Pos logit contributions
roma+2.599
+2.187
 centimeters+2.148
 Aden+2.101
 Braun+2.100
Neg logit contributions
buster-2.561
 tort-2.324
adr-2.278
 dere-2.277
 treadmill-2.267
Token
Token ID247
Feature activation-2.14e-01

Pos logit contributions
roma+1.595
+1.348
 centimeters+1.324
opez+1.303
 Aden+1.296
Neg logit contributions
buster-1.497
 tort-1.352
adr-1.324
 dere-1.320
 treadmill-1.310
Token.
Token ID13
Feature activation-2.48e-01

Pos logit contributions
roma+1.002
+0.842
 centimeters+0.824
opez+0.814
 Aden+0.806
Neg logit contributions
buster-1.016
 tort-0.916
adr-0.896
 dere-0.895
 treadmill-0.888
Token From
Token ID3574
Feature activation-3.29e-01

Pos logit contributions
roma+1.195
+1.010
 centimeters+0.990
opez+0.974
 Aden+0.968
Neg logit contributions
buster-1.133
 tort-1.024
adr-1.002
 dere-1.001
 treadmill-0.994
Token the
Token ID262
Feature activation-2.26e-01

Pos logit contributions
roma+1.548
+1.305
 centimeters+1.279
opez+1.254
 Braun+1.249
Neg logit contributions
buster-1.542
 tort-1.398
adr-1.371
 dere-1.368
 treadmill-1.360
Token Kodi
Token ID44129
Feature activation-5.19e-01

Pos logit contributions
roma+1.096
+0.931
 centimeters+0.910
opez+0.899
 Aden+0.891
Neg logit contributions
buster-1.025
 tort-0.924
 dere-0.906
adr-0.902
 treadmill-0.898
Token home
Token ID1363
Feature activation+8.30e-02

Pos logit contributions
roma+2.609
+2.227
 centimeters+2.190
opez+2.148
 Aden+2.141
Neg logit contributions
buster-2.243
 tort-2.025
adr-1.985
 dere-1.972
 treadmill-1.960
Token screen
Token ID3159
Feature activation+2.96e-01

Pos logit contributions
buster+0.318
 tort+0.281
 dere+0.273
adr+0.271
 treadmill+0.270
Neg logit contributions
roma-0.462
-0.402
 centimeters-0.393
 Aden-0.390
opez-0.390
Token,
Token ID11
Feature activation-7.64e-02

Pos logit contributions
buster+1.242
 tort+1.103
 dere+1.078
 treadmill+1.069
adr+1.053
Neg logit contributions
roma-1.532
-1.321
opez-1.285
 centimeters-1.281
 Aden-1.276
Token select
Token ID2922
Feature activation+2.87e-02

Pos logit contributions
roma+0.365
+0.309
 centimeters+0.301
opez+0.298
 Aden+0.295
Neg logit contributions
buster-0.353
 tort-0.320
 dere-0.313
 treadmill-0.311
adr-0.310
Token Add
Token ID3060
Feature activation-7.89e-03

Pos logit contributions
buster+0.139
 tort+0.126
adr+0.124
 dere+0.123
 treadmill+0.122
Neg logit contributions
roma-0.131
-0.109
 centimeters-0.107
opez-0.106
 Braun-0.104
Token after
Token ID706
Feature activation-1.14e-03

Pos logit contributions
roma+1.335
+1.112
 centimeters+1.090
opez+1.066
 Aden+1.063
Neg logit contributions
buster-1.480
 tort-1.348
adr-1.324
 dere-1.323
 treadmill-1.314
Token a
Token ID257
Feature activation+9.01e-02

Pos logit contributions
roma+0.006
+0.005
 centimeters+0.005
opez+0.005
 Aden+0.005
Neg logit contributions
buster-0.005
 tort-0.005
adr-0.005
 dere-0.005
 treadmill-0.004
Token movie
Token ID3807
Feature activation+1.60e-01

Pos logit contributions
buster+0.394
 tort+0.355
 dere+0.348
 treadmill+0.344
adr+0.344
Neg logit contributions
roma-0.452
-0.387
 centimeters-0.379
opez-0.374
 Aden-0.370
Token's
Token ID338
Feature activation+1.17e-01

Pos logit contributions
buster+0.738
 tort+0.666
 dere+0.652
adr+0.647
 treadmill+0.646
Neg logit contributions
roma-0.765
-0.650
 centimeters-0.634
opez-0.626
 Braun-0.619
Token DVD
Token ID12490
Feature activation+2.45e-01

Pos logit contributions
buster+0.461
 tort+0.409
 dere+0.400
 treadmill+0.395
adr+0.394
Neg logit contributions
roma-0.636
-0.553
 centimeters-0.540
opez-0.535
 Braun-0.530
Token release
Token ID2650
Feature activation+5.37e-03

Pos logit contributions
buster+0.971
 tort+0.848
 dere+0.829
 treadmill+0.819
adr+0.813
Neg logit contributions
roma-1.328
-1.163
 centimeters-1.142
opez-1.132
 Braun-1.117
Token to
Token ID284
Feature activation+1.65e-01

Pos logit contributions
buster+0.027
 tort+0.024
 dere+0.024
adr+0.024
 treadmill+0.024
Neg logit contributions
roma-0.024
-0.020
 centimeters-0.019
opez-0.019
 Braun-0.019
Token show
Token ID905
Feature activation-7.51e-02

Pos logit contributions
buster+0.685
 tort+0.612
 dere+0.597
adr+0.594
 treadmill+0.593
Neg logit contributions
roma-0.865
-0.745
 centimeters-0.728
opez-0.719
 Aden-0.713
Token up
Token ID510
Feature activation+4.01e-01

Pos logit contributions
roma+0.363
+0.307
 centimeters+0.301
opez+0.296
 Aden+0.294
Neg logit contributions
buster-0.342
 tort-0.309
 dere-0.303
adr-0.303
 treadmill-0.301
Token,
Token ID11
Feature activation+1.46e-01

Pos logit contributions
buster+1.860
 tort+1.686
 dere+1.667
 treadmill+1.624
adr+1.621
Neg logit contributions
roma-1.904
-1.625
 centimeters-1.550
opez-1.547
 Braun-1.519
Token and
Token ID290
Feature activation+1.06e-01

Pos logit contributions
buster+0.725
 tort+0.664
 dere+0.650
 treadmill+0.644
adr+0.643
Neg logit contributions
roma-0.649
-0.542
 centimeters-0.526
opez-0.520
 Aden-0.513
TokenAnd
Token ID1870
Feature activation-2.94e-01

Pos logit contributions
buster+0.650
 tort+0.578
adr+0.569
 dere+0.563
ETF+0.560
Neg logit contributions
roma-0.832
-0.711
 centimeters-0.704
opez-0.692
 Aden-0.689
Token then
Token ID788
Feature activation-6.28e-01

Pos logit contributions
roma+1.375
+1.157
 centimeters+1.136
opez+1.113
 Braun+1.110
Neg logit contributions
buster-1.382
 tort-1.250
adr-1.228
 dere-1.224
 treadmill-1.217
Token
Token ID3926
Feature activation-1.60e-01

Pos logit contributions
roma+2.974
+2.525
 centimeters+2.488
 Braun+2.453
 Butter+2.449
Neg logit contributions
buster-2.874
 tort-2.559
adr-2.550
ETF-2.527
 dere-2.512
Token we
Token ID356
Feature activation+1.10e-01

Pos logit contributions
roma+0.757
+0.638
 centimeters+0.623
opez+0.614
 Aden+0.610
Neg logit contributions
buster-0.743
 tort-0.675
 dere-0.660
adr-0.659
 treadmill-0.655
Token
Token ID447
Feature activation+7.41e-02

Pos logit contributions
buster+0.469
 tort+0.423
 dere+0.413
adr+0.411
 treadmill+0.409
Neg logit contributions
roma-0.565
-0.483
 centimeters-0.473
opez-0.468
 Aden-0.465
Token
Token ID247
Feature activation-9.92e-02

Pos logit contributions
buster+0.324
 tort+0.293
adr+0.287
 dere+0.285
ETF+0.281
Neg logit contributions
roma-0.367
-0.314
 centimeters-0.307
opez-0.304
 Aden-0.302
Tokenre
Token ID260
Feature activation-1.76e-01

Pos logit contributions
roma+0.486
+0.412
 centimeters+0.404
opez+0.398
 Aden+0.396
Neg logit contributions
buster-0.445
 tort-0.402
adr-0.393
 dere-0.393
 treadmill-0.390
Token in
Token ID287
Feature activation+2.38e-01

Pos logit contributions
roma+0.874
+0.742
 centimeters+0.732
opez+0.716
 Braun+0.715
Neg logit contributions
buster-0.775
 tort-0.695
adr-0.682
 dere-0.678
 treadmill-0.676
Token.
Token ID13
Feature activation+2.14e-01

Pos logit contributions
buster+1.130
 tort+1.042
 dere+1.022
adr+1.012
 treadmill+1.002
Neg logit contributions
roma-1.105
-0.923
 centimeters-0.896
opez-0.891
 Aden-0.875
Token And
Token ID843
Feature activation-1.37e-02

Pos logit contributions
buster+0.974
 tort+0.878
adr+0.863
 dere+0.860
ETF+0.846
Neg logit contributions
roma-1.036
-0.869
 centimeters-0.852
opez-0.849
 Aden-0.825
Token it
Token ID340
Feature activation-2.78e-01

Pos logit contributions
roma+0.065
+0.055
 centimeters+0.053
opez+0.052
 Aden+0.052
Neg logit contributions
buster-0.064
 tort-0.058
 dere-0.057
adr-0.057
 treadmill-0.057
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
buster+0.359
 tort+0.332
 dere+0.325
adr+0.322
 treadmill+0.321
Neg logit contributions
roma-0.337
-0.278
 centimeters-0.271
opez-0.266
 Braun-0.265
Token so
Token ID523
Feature activation-4.69e-01

Pos logit contributions
roma+0.808
+0.686
 centimeters+0.673
opez+0.661
 Braun+0.658
Neg logit contributions
buster-0.713
 tort-0.643
 dere-0.629
adr-0.628
 treadmill-0.623
Token-
Token ID12
Feature activation+4.12e-01

Pos logit contributions
roma+2.001
+1.709
 centimeters+1.654
opez+1.615
 Aden+1.611
Neg logit contributions
buster-2.358
 tort-2.135
adr-2.093
 treadmill-2.092
 dere-2.083
Tokencalled
Token ID7174
Feature activation-2.78e-02

Pos logit contributions
buster+1.943
 tort+1.755
adr+1.729
 dere+1.718
ETF+1.701
Neg logit contributions
roma-1.922
-1.616
 centimeters-1.589
opez-1.564
 Braun-1.554
Token part
Token ID636
Feature activation-1.95e-01

Pos logit contributions
roma+0.137
+0.116
 centimeters+0.114
opez+0.112
 Braun+0.111
Neg logit contributions
buster-0.124
 tort-0.113
 dere-0.110
adr-0.110
 treadmill-0.109
Token 1
Token ID352
Feature activation-8.16e-01

Pos logit contributions
roma+0.927
+0.780
 centimeters+0.764
opez+0.750
 Aden+0.746
Neg logit contributions
buster-0.905
 tort-0.818
adr-0.802
 dere-0.801
 treadmill-0.796
Token of
Token ID286
Feature activation-1.54e-01

Pos logit contributions
roma+4.101
+3.528
 centimeters+3.462
 Aden+3.396
opez+3.395
Neg logit contributions
buster-3.470
 tort-3.082
adr-3.034
 dere-3.013
 treadmill-3.010
Token the
Token ID262
Feature activation-1.22e-01

Pos logit contributions
roma+0.665
+0.544
 centimeters+0.532
opez+0.522
 Braun+0.518
Neg logit contributions
buster-0.784
 tort-0.722
 dere-0.707
adr-0.705
 treadmill-0.700
Token public
Token ID1171
Feature activation+3.51e-02

Pos logit contributions
roma+0.597
+0.503
 centimeters+0.493
opez+0.485
 Braun+0.482
Neg logit contributions
buster-0.554
 tort-0.502
 dere-0.491
adr-0.491
 treadmill-0.486
Token service
Token ID2139
Feature activation-2.88e-02

Pos logit contributions
buster+0.134
 tort+0.120
 dere+0.117
adr+0.116
 treadmill+0.115
Neg logit contributions
roma-0.196
-0.169
 centimeters-0.166
opez-0.164
 Aden-0.163
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
roma+0.143
+0.120
 centimeters+0.118
opez+0.116
 Aden+0.115
Neg logit contributions
buster-0.129
 tort-0.117
 dere-0.114
adr-0.114
 treadmill-0.113
Token But
Token ID887
Feature activation-1.71e-02

Pos logit contributions
buster+0.491
 tort+0.440
adr+0.437
ETF+0.433
 dere+0.432
Neg logit contributions
roma-0.503
-0.417
 centimeters-0.410
opez-0.408
 Aden-0.392
Token after
Token ID706
Feature activation+1.28e-01

Pos logit contributions
roma+0.078
+0.065
 centimeters+0.063
opez+0.063
 Aden+0.062
Neg logit contributions
buster-0.082
 tort-0.075
 dere-0.074
 treadmill-0.073
adr-0.073
Token Benjamin
Token ID14533
Feature activation-8.52e-02

Pos logit contributions
buster+0.594
 tort+0.549
 dere+0.535
 treadmill+0.527
adr+0.526
Neg logit contributions
roma-0.607
-0.513
 centimeters-0.494
opez-0.492
 Aden-0.481
Token Netanyahu
Token ID17104
Feature activation+1.11e-01

Pos logit contributions
roma+0.220
+0.158
 centimeters+0.151
opez+0.147
 Aden+0.138
Neg logit contributions
buster-0.577
 tort-0.544
adr-0.534
 dere-0.534
 treadmill-0.531
Token ruled
Token ID8879
Feature activation+6.67e-02

Pos logit contributions
buster+0.531
 tort+0.489
 dere+0.478
adr+0.474
 treadmill+0.469
Neg logit contributions
roma-0.515
-0.428
 centimeters-0.419
opez-0.416
 Aden-0.407
Token out
Token ID503
Feature activation-9.31e-01

Pos logit contributions
buster+0.318
 tort+0.293
 dere+0.289
adr+0.282
 treadmill+0.282
Neg logit contributions
roma-0.313
-0.255
 centimeters-0.250
opez-0.249
 Braun-0.240
Token a
Token ID257
Feature activation-2.60e-01

Pos logit contributions
roma+4.339
+3.709
 centimeters+3.672
 Aden+3.662
 Braun+3.647
Neg logit contributions
buster-4.224
adr-3.770
 tort-3.722
 treadmill-3.697
ETF-3.682
Token Palestinian
Token ID8501
Feature activation-5.87e-02

Pos logit contributions
roma+1.232
+1.037
 centimeters+1.014
opez+1.002
 Aden+0.986
Neg logit contributions
buster-1.213
 tort-1.103
 dere-1.080
adr-1.073
 treadmill-1.069
Token state
Token ID1181
Feature activation-1.04e-01

Pos logit contributions
roma+0.338
+0.294
 centimeters+0.289
opez+0.286
 Aden+0.283
Neg logit contributions
buster-0.214
 tort-0.189
adr-0.183
 dere-0.183
 treadmill-0.182
Token as
Token ID355
Feature activation-3.72e-01

Pos logit contributions
roma+0.502
+0.421
 centimeters+0.414
opez+0.408
 Aden+0.403
Neg logit contributions
buster-0.474
 tort-0.429
 dere-0.419
adr-0.419
 treadmill-0.415
Token he
Token ID339
Feature activation-2.85e-01

Pos logit contributions
roma+1.766
+1.489
 centimeters+1.457
opez+1.434
 Braun+1.422
Neg logit contributions
buster-1.728
 tort-1.566
 dere-1.533
adr-1.532
 treadmill-1.521
TokenT
Token ID51
Feature activation-4.12e-01

Pos logit contributions
roma+5.117
+4.610
 Aden+4.390
 centimeters+4.383
 Butter+4.379
Neg logit contributions
buster-3.566
 treadmill-3.238
 tort-3.192
 dere-3.185
adr-3.123
Token Pick
Token ID12346
Feature activation-2.46e-01

Pos logit contributions
roma+1.496
+1.196
 centimeters+1.168
 Butter+1.159
 Braun+1.154
Neg logit contributions
buster-2.331
 tort-2.154
adr-2.136
 treadmill-2.121
 dere-2.108
Token Stephen
Token ID7970
Feature activation-1.56e-01

Pos logit contributions
roma+1.225
+1.042
 centimeters+1.022
opez+1.005
 Aden+1.000
Neg logit contributions
buster-1.087
 tort-0.980
adr-0.958
 dere-0.957
 treadmill-0.950
Token East
Token ID3687
Feature activation-2.72e-01

Pos logit contributions
roma+0.724
+0.606
 centimeters+0.593
opez+0.582
 Aden+0.576
Neg logit contributions
buster-0.747
 tort-0.678
adr-0.663
 dere-0.663
 treadmill-0.657
Tokenon
Token ID261
Feature activation-2.90e-01

Pos logit contributions
roma+1.134
+0.930
 centimeters+0.910
opez+0.889
 Braun+0.882
Neg logit contributions
buster-1.420
 tort-1.300
adr-1.277
 dere-1.276
 treadmill-1.266
Token PA
Token ID8147
Feature activation-7.77e-01

Pos logit contributions
roma+1.296
+1.080
 centimeters+1.057
opez+1.036
 Braun+1.030
Neg logit contributions
buster-1.424
 tort-1.297
adr-1.271
 dere-1.270
 treadmill-1.262
Token 1
Token ID352
Feature activation-3.50e-01

Pos logit contributions
roma+3.970
+3.432
 centimeters+3.356
 Butter+3.298
opez+3.297
Neg logit contributions
buster-3.265
 tort-2.900
adr-2.858
 dere-2.852
 treadmill-2.826
Token hour
Token ID1711
Feature activation-4.50e-01

Pos logit contributions
roma+2.050
+1.805
 centimeters+1.777
opez+1.743
 Butter+1.735
Neg logit contributions
buster-1.221
 tort-1.060
adr-1.038
 dere-1.032
 treadmill-1.022
Token ago
Token ID2084
Feature activation-2.63e-01

Pos logit contributions
roma+2.331
+1.996
 centimeters+1.968
opez+1.943
 Aden+1.940
Neg logit contributions
buster-1.855
 tort-1.671
 treadmill-1.637
adr-1.635
 dere-1.634
Token I
Token ID314
Feature activation+2.91e-02

Pos logit contributions
roma+1.226
+1.030
 centimeters+1.009
opez+0.991
 Aden+0.986
Neg logit contributions
buster-1.240
 tort-1.125
adr-1.102
 dere-1.101
 treadmill-1.094
Token think
Token ID892
Feature activation-1.30e-01

Pos logit contributions
buster+0.124
 tort+0.112
 dere+0.109
adr+0.109
 treadmill+0.108
Neg logit contributions
roma-0.149
-0.128
 centimeters-0.125
opez-0.123
 Aden-0.122
Token!
Token ID0
Feature activation-5.30e-01

Pos logit contributions
roma+2.674
+2.245
 centimeters+2.234
 Braun+2.164
 Aden+2.155
Neg logit contributions
buster-3.029
adr-2.728
 tort-2.710
 treadmill-2.690
ETF-2.680
Token
Token ID447
Feature activation-5.22e-01

Pos logit contributions
roma+2.556
+2.185
 centimeters+2.138
 Aden+2.100
 Braun+2.098
Neg logit contributions
buster-2.389
 tort-2.159
adr-2.119
 treadmill-2.114
 dere-2.112
Token
Token ID251
Feature activation-4.25e-01

Pos logit contributions
roma+2.181
+1.804
 centimeters+1.754
 Braun+1.715
opez+1.714
Neg logit contributions
buster-2.704
 tort-2.479
adr-2.433
 dere-2.432
 treadmill-2.423
Token He
Token ID679
Feature activation-4.75e-02

Pos logit contributions
roma+1.982
+1.668
 centimeters+1.634
opez+1.604
 Aden+1.596
Neg logit contributions
buster-2.004
 tort-1.819
adr-1.782
 dere-1.780
 treadmill-1.769
Token stepped
Token ID10764
Feature activation-1.32e-01

Pos logit contributions
roma+0.193
+0.157
 centimeters+0.153
opez+0.151
 Aden+0.149
Neg logit contributions
buster-0.253
 tort-0.235
 dere-0.229
adr-0.229
 treadmill-0.226
Token off
Token ID572
Feature activation-8.81e-01

Pos logit contributions
roma+0.624
+0.526
 centimeters+0.515
opez+0.506
 Aden+0.503
Neg logit contributions
buster-0.619
 tort-0.562
adr-0.550
 dere-0.550
 treadmill-0.546
Token the
Token ID262
Feature activation-1.18e-01

Pos logit contributions
roma+3.395
+2.772
 centimeters+2.718
 Aden+2.661
 Braun+2.658
Neg logit contributions
buster-4.776
 tort-4.379
adr-4.325
 dere-4.277
ETF-4.273
Token platform
Token ID3859
Feature activation+2.21e-01

Pos logit contributions
roma+0.672
+0.585
 centimeters+0.576
opez+0.567
 Aden+0.566
Neg logit contributions
buster-0.434
 tort-0.381
adr-0.373
 dere-0.370
 treadmill-0.367
Token and
Token ID290
Feature activation-6.30e-01

Pos logit contributions
buster+0.987
 tort+0.898
 dere+0.878
 treadmill+0.868
adr+0.868
Neg logit contributions
roma-1.090
-0.919
opez-0.894
 centimeters-0.890
 Aden-0.881
Token walked
Token ID6807
Feature activation-4.11e-01

Pos logit contributions
roma+2.572
+2.129
 centimeters+2.094
 Braun+2.047
 Aden+2.041
Neg logit contributions
buster-3.303
 tort-3.001
adr-2.974
 dere-2.950
 treadmill-2.939
Token up
Token ID510
Feature activation-6.40e-01

Pos logit contributions
roma+1.802
+1.504
 centimeters+1.489
 Aden+1.451
 Braun+1.449
Neg logit contributions
buster-2.037
 tort-1.839
adr-1.825
 dere-1.802
ETF-1.799
Tokenlan
Token ID9620
Feature activation+1.26e-01

Pos logit contributions
roma+0.963
+0.697
 centimeters+0.667
opez+0.639
 Braun+0.635
Neg logit contributions
buster-2.434
 tort-2.280
adr-2.250
 dere-2.242
 treadmill-2.227
Token came
Token ID1625
Feature activation-1.40e-01

Pos logit contributions
buster+0.574
 tort+0.525
 dere+0.510
adr+0.509
 treadmill+0.502
Neg logit contributions
roma-0.608
-0.511
 centimeters-0.503
opez-0.495
 Aden-0.489
Token under
Token ID739
Feature activation-3.94e-01

Pos logit contributions
roma+0.741
+0.637
 centimeters+0.623
opez+0.616
 Aden+0.612
Neg logit contributions
buster-0.574
 tort-0.512
 dere-0.501
adr-0.499
 treadmill-0.494
Token fire
Token ID2046
Feature activation-1.72e-01

Pos logit contributions
roma+1.958
+1.668
 centimeters+1.639
opez+1.609
 Aden+1.605
Neg logit contributions
buster-1.731
 tort-1.549
adr-1.527
 dere-1.519
 treadmill-1.509
Token earlier
Token ID2961
Feature activation+1.27e-02

Pos logit contributions
roma+0.826
+0.699
 centimeters+0.685
opez+0.673
 Aden+0.670
Neg logit contributions
buster-0.785
 tort-0.710
adr-0.695
 dere-0.694
 treadmill-0.690
Token this
Token ID428
Feature activation-9.32e-01

Pos logit contributions
buster+0.062
 tort+0.057
 dere+0.055
 treadmill+0.055
adr+0.055
Neg logit contributions
roma-0.058
-0.048
 centimeters-0.047
opez-0.046
 Aden-0.046
Token month
Token ID1227
Feature activation-2.14e-01

Pos logit contributions
roma+4.709
+4.076
 centimeters+3.987
 Braun+3.955
 Aden+3.907
Neg logit contributions
buster-3.915
 tort-3.421
adr-3.404
 dere-3.368
ETF-3.357
Token after
Token ID706
Feature activation+3.99e-01

Pos logit contributions
roma+1.026
+0.867
 centimeters+0.850
opez+0.835
 Aden+0.831
Neg logit contributions
buster-0.981
 tort-0.887
adr-0.868
 dere-0.868
 treadmill-0.862
Token using
Token ID1262
Feature activation-2.76e-01

Pos logit contributions
buster+1.747
 tort+1.618
 dere+1.573
 treadmill+1.545
adr+1.541
Neg logit contributions
roma-1.971
-1.668
 centimeters-1.627
opez-1.617
 Aden-1.583
Token prof
Token ID1534
Feature activation-2.08e-01

Pos logit contributions
roma+1.282
+1.076
 centimeters+1.054
opez+1.035
 Aden+1.029
Neg logit contributions
buster-1.313
 tort-1.193
adr-1.168
 dere-1.167
 treadmill-1.160
Tokenanity
Token ID19689
Feature activation+1.27e-01

Pos logit contributions
roma+0.768
+0.609
 centimeters+0.596
opez+0.578
 Braun+0.576
Neg logit contributions
buster-1.188
 tort-1.099
adr-1.077
 dere-1.076
 treadmill-1.067