TOP ACTIVATIONS
MAX = 0.514

App Brain ] Pixel pipe Advertisement
Richard Painter , the chief White House
after a model of Pont iac GT O sold from 1969
press office of Romero J uca , an influential senator .
from coaches and eagle - eye views of the games ,
box of our old Pont iac Bon ne ville . Sally
sanctions regime , Painter said . That
Senate violated perjury laws . Painter also said Ross must rec
complete abolition of all arm aments , and of nuclear weapons
United States to supply arm aments and troop deployments to NATO
Y oru ichi - san said that
. She left with Y oru ichi - san .
my here IM M IG RANT food from a good Christian
it ! Ever since Y oru ichi - sama showed up
is pretty much secondary stream fodder if the second stream does
the B DP + Bott leneck Buffer Size point ( green
of a person is Y oru ichi - san ? People
Christian , hired this sod om ite ?
s a sod om ite !
thanks @ c per c iva ). 3 .

MIN ACTIVATIONS
MIN = -0.465

( ve get arian pl atter ) consisting of s arson
op games of rock - paper - sc issors in a
Great Again # In aug uration 2017
today 's crash will inaug urate a bear market like the
. But Bur rows s sterile style
the Morris Museum in Mor rist own , N . J
have not yet discussed Bur rows s art work
Brazil on Monday will inaug urate a $ 900 million port
Cub ans ( bl acks and mul att os ),
for J oon a " nat u " Le pp ä
ous , le S é nat Imp é rial , le
à cher cher le conf lit a vec nos an ci
they have a delicate sm attering of light brown " f
if ($ Use Current Wall paper As Source ) { $
drilling plans . <|endoftext|> Good bye , Ghosts ! is an
adopting a neo - A rist ot el ian approach which
Cry om ancer vs Chaos Charm Sc ourge of
that aren 't comm ens urate with their past performances or
have sn agged Tay Z onday of Youtube fame to narr
year 's election . Ronnie Dug ger wrote the definitive warning

INTERVAL 0.269 - 0.514
CONTAINS 0.124%

The first lady 's outgoing public hug went to School
South Australia , an Emer itus Professor of Economics at Fl
. Dr D yer said that rule learning was
On Facebook , Flynn linked to the CNN article
Bolton only semi - cle ared Anton Ferdinand 's deep cross

INTERVAL 0.024 - 0.269
CONTAINS 28.221%

- aligned countries must give priority to the abolition of nuclear
ans , the only H OPE scholar in the legislature at
UCK G ). The church was started in 1977 by Ed
say they received notices from Tinder saying the were unable to
young participants of protests . One # We ve

INTERVAL -0.220 - 0.024
CONTAINS 71.322%

match for the Gods themselves . O lymp us
anxiety and , of course , a drinking problem . That
who wishes to transition their gender socially and / or physically
They re tourists spending money . I
rary to the South s reputation as not being

INTERVAL -0.465 - -0.220
CONTAINS 0.332%

nickel defense . Off ensive ly , the Soon
CO 2 content as the air on the surface at the
sticks . And finally , as for admins ,
- im perial ist sc ound rel Vac lav Ha vel
your driver = driver - type .
TokenApp
Token ID4677
Feature activation-2.03e-01

Pos logit contributions
iblical+0.035
uca+0.034
leneck+0.033
 Painter+0.032
esan+0.030
Neg logit contributions
EMS-0.056
 Mehran-0.051
 accuracy-0.049
papers-0.048
":{"-0.047
TokenBrain
Token ID44687
Feature activation+2.37e-01

Pos logit contributions
EMS+0.201
 Mehran+0.183
papers+0.177
 accuracy+0.176
":{"+0.172
Neg logit contributions
iblical-0.092
uca-0.092
leneck-0.087
 Painter-0.085
esan-0.076
Token]
Token ID60
Feature activation+1.70e-02

Pos logit contributions
iblical+0.171
uca+0.171
leneck+0.165
 Painter+0.163
esan+0.153
Neg logit contributions
EMS-0.170
 Mehran-0.150
papers-0.141
 accuracy-0.140
":{"-0.136
Token
Token ID198
Feature activation+2.00e-02

Pos logit contributions
iblical+0.012
uca+0.011
leneck+0.011
 Painter+0.011
esan+0.010
Neg logit contributions
EMS-0.013
 Mehran-0.012
papers-0.011
 accuracy-0.011
":{"-0.011
Token
Token ID198
Feature activation+4.16e-03

Pos logit contributions
iblical+0.014
uca+0.013
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.015
 Mehran-0.014
papers-0.013
 accuracy-0.013
":{"-0.013
TokenPixel
Token ID40809
Feature activation+5.14e-01

Pos logit contributions
iblical+0.003
uca+0.003
leneck+0.003
 Painter+0.003
esan+0.003
Neg logit contributions
EMS-0.003
 Mehran-0.003
 accuracy-0.002
papers-0.002
 Stability-0.002
Tokenpipe
Token ID34360
Feature activation-2.66e-02

Pos logit contributions
iblical+0.373
uca+0.372
leneck+0.359
 Painter+0.358
esan+0.333
Neg logit contributions
EMS-0.366
 Mehran-0.324
papers-0.304
 accuracy-0.302
":{"-0.292
Token
Token ID198
Feature activation-1.89e-02

Pos logit contributions
EMS+0.021
 Mehran+0.018
papers+0.017
 accuracy+0.017
":{"+0.017
Neg logit contributions
iblical-0.018
uca-0.018
leneck-0.017
 Painter-0.017
esan-0.016
Token
Token ID198
Feature activation-1.89e-02

Pos logit contributions
EMS+0.014
 Mehran+0.013
papers+0.012
 accuracy+0.012
":{"+0.012
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.012
 Painter-0.012
esan-0.011
TokenAdvertisement
Token ID4723
Feature activation-2.98e-02

Pos logit contributions
EMS+0.013
 Mehran+0.011
 accuracy+0.011
papers+0.011
 Stability+0.010
Neg logit contributions
iblical-0.014
uca-0.014
leneck-0.014
 Painter-0.013
esan-0.013
Token
Token ID198
Feature activation+3.63e-02

Pos logit contributions
EMS+0.023
 Mehran+0.020
papers+0.019
 accuracy+0.019
":{"+0.019
Neg logit contributions
iblical-0.020
uca-0.020
leneck-0.020
 Painter-0.019
esan-0.018
Token
Token ID447
Feature activation+1.80e-03

Pos logit contributions
iblical+0.004
uca+0.004
leneck+0.003
 Painter+0.003
esan+0.003
Neg logit contributions
EMS-0.004
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.003
Token
Token ID251
Feature activation-1.50e-02

Pos logit contributions
iblical+0.001
uca+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.001
Token
Token ID198
Feature activation+4.07e-02

Pos logit contributions
EMS+0.012
 Mehran+0.010
papers+0.010
 accuracy+0.010
":{"+0.009
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.010
 Painter-0.010
esan-0.009
Token
Token ID198
Feature activation+3.10e-02

Pos logit contributions
iblical+0.028
uca+0.027
leneck+0.026
 Painter+0.026
esan+0.024
Neg logit contributions
EMS-0.031
 Mehran-0.028
papers-0.026
 accuracy-0.026
":{"-0.025
TokenRichard
Token ID22245
Feature activation-4.87e-02

Pos logit contributions
iblical+0.021
uca+0.021
leneck+0.021
 Painter+0.020
esan+0.019
Neg logit contributions
EMS-0.023
 Mehran-0.021
papers-0.019
 accuracy-0.019
 Stability-0.019
Token Painter
Token ID49102
Feature activation+4.99e-01

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.031
":{"+0.030
Neg logit contributions
iblical-0.033
uca-0.033
leneck-0.032
 Painter-0.031
esan-0.029
Token,
Token ID11
Feature activation+1.60e-02

Pos logit contributions
iblical+0.355
uca+0.355
 Painter+0.342
leneck+0.341
esan+0.318
Neg logit contributions
EMS-0.363
 Mehran-0.319
papers-0.303
 accuracy-0.300
":{"-0.290
Token the
Token ID262
Feature activation+2.10e-02

Pos logit contributions
iblical+0.012
uca+0.012
 Painter+0.012
leneck+0.012
esan+0.011
Neg logit contributions
EMS-0.011
 Mehran-0.009
papers-0.009
 accuracy-0.009
":{"-0.009
Token chief
Token ID4039
Feature activation+2.70e-02

Pos logit contributions
iblical+0.015
uca+0.015
leneck+0.015
 Painter+0.015
esan+0.014
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.013
 accuracy-0.012
":{"-0.012
Token White
Token ID2635
Feature activation-1.45e-02

Pos logit contributions
iblical+0.023
uca+0.023
leneck+0.022
 Painter+0.022
esan+0.021
Neg logit contributions
EMS-0.016
 Mehran-0.014
papers-0.013
 accuracy-0.013
":{"-0.012
Token House
Token ID2097
Feature activation+9.62e-02

Pos logit contributions
EMS+0.012
 Mehran+0.011
papers+0.010
 accuracy+0.010
":{"+0.010
Neg logit contributions
iblical-0.009
uca-0.009
leneck-0.009
 Painter-0.008
esan-0.008
Token after
Token ID706
Feature activation+5.86e-03

Pos logit contributions
iblical+0.021
uca+0.021
leneck+0.020
 Painter+0.020
esan+0.019
Neg logit contributions
EMS-0.021
 Mehran-0.019
papers-0.018
 accuracy-0.018
":{"-0.017
Token a
Token ID257
Feature activation+1.36e-02

Pos logit contributions
iblical+0.005
uca+0.005
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.004
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.003
Token model
Token ID2746
Feature activation+4.76e-02

Pos logit contributions
iblical+0.010
uca+0.010
leneck+0.010
 Painter+0.010
esan+0.009
Neg logit contributions
EMS-0.009
 Mehran-0.008
papers-0.008
 accuracy-0.008
":{"-0.008
Token of
Token ID286
Feature activation-2.22e-02

Pos logit contributions
iblical+0.036
uca+0.036
leneck+0.034
 Painter+0.034
esan+0.032
Neg logit contributions
EMS-0.033
 Mehran-0.029
papers-0.027
 accuracy-0.027
":{"-0.026
Token Pont
Token ID26345
Feature activation+1.55e-01

Pos logit contributions
EMS+0.015
 Mehran+0.013
papers+0.012
 accuracy+0.012
":{"+0.012
Neg logit contributions
iblical-0.017
uca-0.017
leneck-0.016
 Painter-0.016
esan-0.015
Tokeniac
Token ID9607
Feature activation+4.97e-01

Pos logit contributions
iblical+0.064
uca+0.064
leneck+0.060
 Painter+0.059
esan+0.052
Neg logit contributions
EMS-0.158
 Mehran-0.145
papers-0.140
 accuracy-0.139
":{"-0.136
Token GT
Token ID7963
Feature activation-1.20e-01

Pos logit contributions
iblical+0.368
uca+0.368
leneck+0.355
 Painter+0.353
esan+0.331
Neg logit contributions
EMS-0.347
 Mehran-0.304
papers-0.289
 accuracy-0.284
":{"-0.275
TokenO
Token ID46
Feature activation+1.70e-03

Pos logit contributions
EMS+0.084
 Mehran+0.074
papers+0.070
 accuracy+0.069
":{"+0.067
Neg logit contributions
iblical-0.089
uca-0.089
leneck-0.086
 Painter-0.085
esan-0.080
Token sold
Token ID2702
Feature activation+3.42e-02

Pos logit contributions
iblical+0.001
uca+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.001
Token from
Token ID422
Feature activation-6.59e-03

Pos logit contributions
iblical+0.025
uca+0.025
leneck+0.024
 Painter+0.024
esan+0.023
Neg logit contributions
EMS-0.024
 Mehran-0.021
papers-0.020
 accuracy-0.020
":{"-0.019
Token 1969
Token ID16450
Feature activation+2.73e-02

Pos logit contributions
EMS+0.004
 Mehran+0.004
papers+0.004
 accuracy+0.004
":{"+0.003
Neg logit contributions
iblical-0.005
uca-0.005
leneck-0.005
 Painter-0.005
esan-0.005
Token press
Token ID1803
Feature activation-5.73e-02

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.006
 Painter+0.006
esan+0.006
Neg logit contributions
EMS-0.006
 Mehran-0.005
papers-0.005
 accuracy-0.005
":{"-0.004
Token office
Token ID2607
Feature activation+6.71e-02

Pos logit contributions
EMS+0.036
 Mehran+0.032
papers+0.030
 accuracy+0.029
 Stability+0.028
Neg logit contributions
iblical-0.046
uca-0.046
leneck-0.044
 Painter-0.044
esan-0.042
Token of
Token ID286
Feature activation-8.20e-03

Pos logit contributions
iblical+0.050
uca+0.050
leneck+0.048
 Painter+0.048
esan+0.045
Neg logit contributions
EMS-0.047
 Mehran-0.041
papers-0.039
 accuracy-0.038
":{"-0.037
Token Romero
Token ID38623
Feature activation+6.07e-02

Pos logit contributions
EMS+0.005
 Mehran+0.005
papers+0.004
 accuracy+0.004
":{"+0.004
Neg logit contributions
uca-0.006
iblical-0.006
leneck-0.006
 Painter-0.006
esan-0.006
Token J
Token ID449
Feature activation-1.63e-03

Pos logit contributions
iblical+0.048
uca+0.048
leneck+0.046
 Painter+0.046
esan+0.043
Neg logit contributions
EMS-0.040
 Mehran-0.034
papers-0.032
 accuracy-0.032
":{"-0.031
Tokenuca
Token ID43120
Feature activation+4.77e-01

Pos logit contributions
EMS+0.001
 Mehran+0.001
papers+0.001
 accuracy+0.001
":{"+0.001
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token,
Token ID11
Feature activation+1.83e-02

Pos logit contributions
uca+0.364
iblical+0.361
leneck+0.347
 Painter+0.347
esan+0.328
Neg logit contributions
EMS-0.326
 Mehran-0.283
papers-0.269
 accuracy-0.265
 Stability-0.254
Token an
Token ID281
Feature activation-1.70e-02

Pos logit contributions
iblical+0.014
uca+0.014
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.013
 Mehran-0.011
papers-0.010
 accuracy-0.010
":{"-0.010
Token influential
Token ID14212
Feature activation-1.66e-03

Pos logit contributions
EMS+0.011
 Mehran+0.010
papers+0.009
 accuracy+0.009
":{"+0.009
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.013
 Painter-0.012
esan-0.012
Token senator
Token ID12329
Feature activation-8.61e-02

Pos logit contributions
EMS+0.001
 Mehran+0.001
papers+0.001
 accuracy+0.001
":{"+0.001
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token.
Token ID13
Feature activation-3.07e-03

Pos logit contributions
EMS+0.058
 Mehran+0.051
papers+0.048
 accuracy+0.047
":{"+0.046
Neg logit contributions
iblical-0.066
uca-0.066
leneck-0.064
 Painter-0.063
esan-0.059
Token from
Token ID422
Feature activation-5.89e-02

Pos logit contributions
iblical+0.122
uca+0.121
leneck+0.117
 Painter+0.116
esan+0.109
Neg logit contributions
EMS-0.118
 Mehran-0.104
papers-0.098
 accuracy-0.097
":{"-0.094
Token coaches
Token ID11070
Feature activation+4.60e-02

Pos logit contributions
EMS+0.041
 Mehran+0.036
papers+0.034
 accuracy+0.033
":{"+0.032
Neg logit contributions
iblical-0.044
uca-0.044
leneck-0.042
 Painter-0.042
esan-0.039
Token and
Token ID290
Feature activation+3.45e-03

Pos logit contributions
uca+0.033
iblical+0.033
leneck+0.032
 Painter+0.032
esan+0.030
Neg logit contributions
EMS-0.033
 Mehran-0.029
papers-0.028
 accuracy-0.027
":{"-0.026
Token eagle
Token ID31176
Feature activation-1.06e-01

Pos logit contributions
iblical+0.002
uca+0.002
leneck+0.002
 Painter+0.002
esan+0.002
Neg logit contributions
EMS-0.002
 Mehran-0.002
papers-0.002
 accuracy-0.002
":{"-0.002
Token-
Token ID12
Feature activation+4.49e-03

Pos logit contributions
EMS+0.076
 Mehran+0.067
papers+0.064
 accuracy+0.063
":{"+0.061
Neg logit contributions
iblical-0.077
uca-0.077
leneck-0.074
 Painter-0.073
esan-0.069
Tokeneye
Token ID25379
Feature activation+4.64e-01

Pos logit contributions
iblical+0.003
uca+0.003
leneck+0.003
 Painter+0.003
esan+0.002
Neg logit contributions
EMS-0.004
 Mehran-0.003
 accuracy-0.003
papers-0.003
 Stability-0.003
Token views
Token ID5009
Feature activation-4.58e-02

Pos logit contributions
iblical+0.343
uca+0.341
leneck+0.328
 Painter+0.326
esan+0.306
Neg logit contributions
EMS-0.325
 Mehran-0.286
papers-0.269
 accuracy-0.263
 Stability-0.257
Token of
Token ID286
Feature activation-2.41e-02

Pos logit contributions
EMS+0.032
 Mehran+0.028
papers+0.026
 accuracy+0.026
 Stability+0.025
Neg logit contributions
iblical-0.034
uca-0.034
leneck-0.033
 Painter-0.033
esan-0.031
Token the
Token ID262
Feature activation-1.30e-02

Pos logit contributions
EMS+0.017
 Mehran+0.015
papers+0.014
 accuracy+0.014
":{"+0.013
Neg logit contributions
iblical-0.018
uca-0.018
leneck-0.017
 Painter-0.017
esan-0.016
Token games
Token ID1830
Feature activation+5.81e-02

Pos logit contributions
EMS+0.009
 Mehran+0.008
papers+0.008
 accuracy+0.008
":{"+0.008
Neg logit contributions
iblical-0.009
uca-0.009
leneck-0.009
 Painter-0.009
esan-0.008
Token,
Token ID11
Feature activation-7.19e-04

Pos logit contributions
uca+0.043
iblical+0.043
leneck+0.042
 Painter+0.041
esan+0.039
Neg logit contributions
EMS-0.040
 Mehran-0.035
papers-0.033
 accuracy-0.033
":{"-0.032
Tokenbox
Token ID3524
Feature activation-1.73e-01

Pos logit contributions
EMS+0.047
 Mehran+0.042
papers+0.040
 accuracy+0.040
":{"+0.039
Neg logit contributions
iblical-0.036
uca-0.036
leneck-0.035
 Painter-0.034
esan-0.032
Token of
Token ID286
Feature activation-1.92e-02

Pos logit contributions
EMS+0.120
 Mehran+0.105
papers+0.099
 accuracy+0.098
":{"+0.095
Neg logit contributions
iblical-0.129
uca-0.129
leneck-0.125
 Painter-0.123
esan-0.116
Token our
Token ID674
Feature activation-8.59e-03

Pos logit contributions
EMS+0.013
 Mehran+0.011
papers+0.010
 accuracy+0.010
":{"+0.010
Neg logit contributions
iblical-0.015
uca-0.015
leneck-0.014
 Painter-0.014
esan-0.013
Token old
Token ID1468
Feature activation+5.46e-02

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.005
Neg logit contributions
iblical-0.006
uca-0.006
leneck-0.006
 Painter-0.006
esan-0.006
Token Pont
Token ID26345
Feature activation+1.07e-01

Pos logit contributions
iblical+0.041
uca+0.040
leneck+0.039
 Painter+0.039
esan+0.036
Neg logit contributions
EMS-0.038
 Mehran-0.033
papers-0.032
 accuracy-0.031
":{"-0.030
Tokeniac
Token ID9607
Feature activation+4.59e-01

Pos logit contributions
iblical+0.032
uca+0.032
leneck+0.029
 Painter+0.028
esan+0.024
Neg logit contributions
EMS-0.121
 Mehran-0.112
papers-0.109
 accuracy-0.108
":{"-0.106
Token Bon
Token ID7979
Feature activation+1.48e-02

Pos logit contributions
uca+0.354
iblical+0.353
leneck+0.342
 Painter+0.340
esan+0.320
Neg logit contributions
EMS-0.306
 Mehran-0.266
papers-0.254
 accuracy-0.248
":{"-0.239
Tokenne
Token ID710
Feature activation+3.28e-02

Pos logit contributions
iblical+0.009
uca+0.009
leneck+0.009
 Painter+0.009
esan+0.008
Neg logit contributions
EMS-0.012
 Mehran-0.011
papers-0.010
 accuracy-0.010
":{"-0.010
Tokenville
Token ID4244
Feature activation+7.95e-03

Pos logit contributions
iblical+0.022
uca+0.022
leneck+0.022
 Painter+0.021
esan+0.020
Neg logit contributions
EMS-0.025
 Mehran-0.022
papers-0.021
 accuracy-0.021
":{"-0.020
Token.
Token ID13
Feature activation-3.88e-02

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.006
 Painter+0.006
esan+0.005
Neg logit contributions
EMS-0.005
 Mehran-0.005
papers-0.004
 accuracy-0.004
":{"-0.004
Token Sally
Token ID25737
Feature activation-1.83e-01

Pos logit contributions
EMS+0.028
 Mehran+0.025
papers+0.024
 accuracy+0.023
":{"+0.023
Neg logit contributions
iblical-0.028
uca-0.028
leneck-0.027
 Painter-0.026
esan-0.025
Token sanctions
Token ID9388
Feature activation+1.70e-02

Pos logit contributions
EMS+0.059
 Mehran+0.051
papers+0.049
 accuracy+0.048
":{"+0.046
Neg logit contributions
iblical-0.060
uca-0.060
 Painter-0.058
leneck-0.058
esan-0.054
Token regime
Token ID7142
Feature activation-3.94e-02

Pos logit contributions
iblical+0.013
uca+0.013
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.011
 Mehran-0.010
papers-0.009
 accuracy-0.009
":{"-0.009
Token,
Token ID11
Feature activation+3.43e-02

Pos logit contributions
EMS+0.025
 Mehran+0.022
papers+0.020
 accuracy+0.020
":{"+0.019
Neg logit contributions
iblical-0.032
uca-0.032
leneck-0.031
 Painter-0.030
esan-0.029
Token
Token ID447
Feature activation+2.56e-03

Pos logit contributions
iblical+0.025
uca+0.025
leneck+0.024
 Painter+0.024
esan+0.022
Neg logit contributions
EMS-0.025
 Mehran-0.022
papers-0.021
 accuracy-0.020
":{"-0.020
Token
Token ID251
Feature activation-2.76e-02

Pos logit contributions
iblical+0.002
uca+0.002
leneck+0.002
 Painter+0.002
esan+0.002
Neg logit contributions
EMS-0.002
 Mehran-0.002
papers-0.002
 accuracy-0.002
":{"-0.001
Token Painter
Token ID49102
Feature activation+4.57e-01

Pos logit contributions
EMS+0.022
 Mehran+0.019
papers+0.018
 accuracy+0.018
":{"+0.018
Neg logit contributions
iblical-0.018
uca-0.018
 Painter-0.017
leneck-0.017
esan-0.016
Token said
Token ID531
Feature activation+2.56e-02

Pos logit contributions
iblical+0.272
uca+0.271
leneck+0.260
 Painter+0.259
esan+0.238
Neg logit contributions
EMS-0.385
 Mehran-0.347
papers-0.331
 accuracy-0.328
":{"-0.319
Token.
Token ID13
Feature activation-3.53e-02

Pos logit contributions
iblical+0.021
uca+0.021
leneck+0.020
 Painter+0.020
esan+0.019
Neg logit contributions
EMS-0.016
 Mehran-0.014
papers-0.013
 accuracy-0.013
":{"-0.012
Token
Token ID564
Feature activation-1.45e-02

Pos logit contributions
EMS+0.027
 Mehran+0.024
papers+0.023
 accuracy+0.023
":{"+0.022
Neg logit contributions
iblical-0.024
uca-0.023
leneck-0.023
 Painter-0.022
esan-0.021
Token
Token ID250
Feature activation-1.49e-03

Pos logit contributions
EMS+0.010
 Mehran+0.009
papers+0.009
 accuracy+0.009
":{"+0.008
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.010
 Painter-0.010
esan-0.009
TokenThat
Token ID2504
Feature activation-6.37e-03

Pos logit contributions
EMS+0.001
 Mehran+0.001
papers+0.001
 accuracy+0.001
":{"+0.001
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token Senate
Token ID3845
Feature activation+6.73e-02

Pos logit contributions
EMS+0.000
 Mehran+0.000
papers+0.000
 accuracy+0.000
":{"+0.000
Neg logit contributions
iblical-0.000
uca-0.000
 Painter-0.000
leneck-0.000
esan-0.000
Token violated
Token ID13998
Feature activation-7.93e-02

Pos logit contributions
iblical+0.049
uca+0.049
 Painter+0.048
leneck+0.047
esan+0.044
Neg logit contributions
EMS-0.048
 Mehran-0.042
papers-0.039
 accuracy-0.039
":{"-0.038
Token perjury
Token ID47816
Feature activation-3.98e-02

Pos logit contributions
EMS+0.054
 Mehran+0.047
papers+0.045
 accuracy+0.044
":{"+0.043
Neg logit contributions
iblical-0.060
uca-0.060
 Painter-0.058
leneck-0.057
esan-0.053
Token laws
Token ID3657
Feature activation-5.54e-02

Pos logit contributions
EMS+0.027
 Mehran+0.024
papers+0.022
 accuracy+0.022
":{"+0.021
Neg logit contributions
iblical-0.030
uca-0.030
leneck-0.029
 Painter-0.029
esan-0.027
Token.
Token ID13
Feature activation+1.23e-03

Pos logit contributions
EMS+0.036
 Mehran+0.031
papers+0.029
 accuracy+0.029
":{"+0.028
Neg logit contributions
iblical-0.044
uca-0.044
leneck-0.042
 Painter-0.042
esan-0.040
Token Painter
Token ID49102
Feature activation+4.54e-01

Pos logit contributions
iblical+0.001
uca+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.001
Token also
Token ID635
Feature activation+2.28e-02

Pos logit contributions
iblical+0.293
uca+0.293
leneck+0.281
 Painter+0.279
esan+0.259
Neg logit contributions
EMS-0.361
 Mehran-0.322
papers-0.307
 accuracy-0.304
":{"-0.295
Token said
Token ID531
Feature activation+1.73e-02

Pos logit contributions
iblical+0.014
uca+0.014
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.019
 Mehran-0.017
papers-0.016
 accuracy-0.016
":{"-0.016
Token Ross
Token ID9847
Feature activation+2.67e-02

Pos logit contributions
iblical+0.013
uca+0.013
 Painter+0.013
leneck+0.013
esan+0.012
Neg logit contributions
EMS-0.012
 Mehran-0.010
papers-0.010
 accuracy-0.010
":{"-0.009
Token must
Token ID1276
Feature activation-5.43e-02

Pos logit contributions
iblical+0.019
uca+0.019
leneck+0.019
 Painter+0.018
esan+0.017
Neg logit contributions
EMS-0.019
 Mehran-0.017
papers-0.016
 accuracy-0.016
":{"-0.015
Token rec
Token ID664
Feature activation+1.24e-01

Pos logit contributions
EMS+0.038
 Mehran+0.033
papers+0.031
 accuracy+0.031
":{"+0.030
Neg logit contributions
iblical-0.041
uca-0.040
leneck-0.039
 Painter-0.039
esan-0.036
Token complete
Token ID1844
Feature activation-1.32e-01

Pos logit contributions
EMS+0.002
 Mehran+0.002
papers+0.002
 accuracy+0.002
":{"+0.002
Neg logit contributions
iblical-0.003
uca-0.003
leneck-0.003
 Painter-0.002
esan-0.002
Token abolition
Token ID30480
Feature activation-4.79e-03

Pos logit contributions
EMS+0.084
 Mehran+0.072
papers+0.068
 accuracy+0.066
":{"+0.064
Neg logit contributions
iblical-0.106
uca-0.106
leneck-0.103
 Painter-0.101
esan-0.096
Token of
Token ID286
Feature activation+4.38e-02

Pos logit contributions
EMS+0.003
 Mehran+0.003
papers+0.003
 accuracy+0.003
":{"+0.003
Neg logit contributions
iblical-0.004
uca-0.004
leneck-0.003
 Painter-0.003
esan-0.003
Token all
Token ID477
Feature activation+1.20e-02

Pos logit contributions
iblical+0.033
uca+0.033
leneck+0.032
 Painter+0.032
esan+0.030
Neg logit contributions
EMS-0.030
 Mehran-0.026
papers-0.025
 accuracy-0.024
":{"-0.024
Token arm
Token ID3211
Feature activation-2.53e-02

Pos logit contributions
iblical+0.009
uca+0.009
leneck+0.009
 Painter+0.009
esan+0.008
Neg logit contributions
EMS-0.008
 Mehran-0.007
papers-0.007
 accuracy-0.007
":{"-0.007
Tokenaments
Token ID12604
Feature activation+4.53e-01

Pos logit contributions
EMS+0.029
 Mehran+0.027
papers+0.026
 accuracy+0.026
":{"+0.026
Neg logit contributions
iblical-0.007
uca-0.007
leneck-0.006
 Painter-0.006
esan-0.005
Token,
Token ID11
Feature activation-9.11e-03

Pos logit contributions
iblical+0.339
uca+0.337
leneck+0.326
 Painter+0.323
esan+0.304
Neg logit contributions
EMS-0.314
 Mehran-0.276
papers-0.261
 accuracy-0.256
":{"-0.248
Token and
Token ID290
Feature activation+2.34e-03

Pos logit contributions
EMS+0.006
 Mehran+0.006
papers+0.005
 accuracy+0.005
":{"+0.005
Neg logit contributions
iblical-0.007
uca-0.007
leneck-0.006
 Painter-0.006
esan-0.006
Token of
Token ID286
Feature activation-1.05e-02

Pos logit contributions
iblical+0.002
uca+0.002
leneck+0.002
 Painter+0.002
esan+0.002
Neg logit contributions
EMS-0.002
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.001
Token nuclear
Token ID4523
Feature activation-4.59e-02

Pos logit contributions
EMS+0.007
 Mehran+0.006
papers+0.006
 accuracy+0.006
":{"+0.006
Neg logit contributions
iblical-0.008
uca-0.008
leneck-0.008
 Painter-0.007
esan-0.007
Token weapons
Token ID3777
Feature activation+1.00e-01

Pos logit contributions
EMS+0.032
 Mehran+0.028
papers+0.027
 accuracy+0.026
":{"+0.026
Neg logit contributions
iblical-0.034
uca-0.034
leneck-0.033
 Painter-0.032
esan-0.030
Token United
Token ID1578
Feature activation-1.91e-02

Pos logit contributions
EMS+0.010
 Mehran+0.009
papers+0.009
 accuracy+0.009
":{"+0.008
Neg logit contributions
iblical-0.011
uca-0.011
leneck-0.011
 Painter-0.011
esan-0.010
Token States
Token ID1829
Feature activation-1.45e-01

Pos logit contributions
EMS+0.012
 Mehran+0.010
papers+0.010
 accuracy+0.010
":{"+0.009
Neg logit contributions
iblical-0.015
uca-0.015
leneck-0.015
 Painter-0.015
esan-0.014
Token to
Token ID284
Feature activation+1.53e-02

Pos logit contributions
EMS+0.099
 Mehran+0.087
papers+0.082
 accuracy+0.081
":{"+0.078
Neg logit contributions
iblical-0.110
uca-0.110
leneck-0.106
 Painter-0.105
esan-0.099
Token supply
Token ID5127
Feature activation-3.69e-04

Pos logit contributions
iblical+0.012
uca+0.011
leneck+0.011
 Painter+0.011
esan+0.010
Neg logit contributions
EMS-0.011
 Mehran-0.009
papers-0.009
 accuracy-0.009
":{"-0.008
Token arm
Token ID3211
Feature activation-7.78e-03

Pos logit contributions
EMS+0.000
 Mehran+0.000
papers+0.000
 accuracy+0.000
":{"+0.000
Neg logit contributions
iblical-0.000
uca-0.000
leneck-0.000
 Painter-0.000
esan-0.000
Tokenaments
Token ID12604
Feature activation+4.52e-01

Pos logit contributions
EMS+0.009
 Mehran+0.008
papers+0.008
 accuracy+0.008
":{"+0.008
Neg logit contributions
iblical-0.002
uca-0.002
leneck-0.002
 Painter-0.002
esan-0.002
Token and
Token ID290
Feature activation-1.68e-02

Pos logit contributions
iblical+0.333
uca+0.332
leneck+0.322
 Painter+0.318
esan+0.298
Neg logit contributions
EMS-0.317
 Mehran-0.279
papers-0.264
 accuracy-0.260
":{"-0.252
Token troop
Token ID31874
Feature activation-5.20e-02

Pos logit contributions
EMS+0.011
 Mehran+0.010
papers+0.009
 accuracy+0.009
":{"+0.009
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.012
 Painter-0.012
esan-0.012
Token deployments
Token ID40854
Feature activation-3.73e-02

Pos logit contributions
EMS+0.035
 Mehran+0.031
papers+0.029
 accuracy+0.028
":{"+0.028
Neg logit contributions
iblical-0.040
uca-0.040
leneck-0.038
 Painter-0.038
esan-0.036
Token to
Token ID284
Feature activation+7.28e-03

Pos logit contributions
EMS+0.026
 Mehran+0.023
papers+0.022
 accuracy+0.021
":{"+0.021
Neg logit contributions
iblical-0.028
uca-0.028
leneck-0.027
 Painter-0.026
esan-0.025
Token NATO
Token ID11332
Feature activation-3.84e-02

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.006
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.005
 Mehran-0.004
papers-0.004
 accuracy-0.004
":{"-0.004
Token
Token ID447
Feature activation+5.31e-03

Pos logit contributions
iblical+0.032
uca+0.032
leneck+0.031
 Painter+0.031
esan+0.029
Neg logit contributions
EMS-0.029
 Mehran-0.025
papers-0.024
 accuracy-0.023
":{"-0.022
Token
Token ID251
Feature activation-4.88e-02

Pos logit contributions
iblical+0.004
uca+0.004
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.003
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.003
Token
Token ID564
Feature activation+1.03e-02

Pos logit contributions
EMS+0.034
 Mehran+0.030
papers+0.028
 accuracy+0.028
":{"+0.027
Neg logit contributions
uca-0.036
iblical-0.036
leneck-0.035
 Painter-0.035
esan-0.033
Token
Token ID250
Feature activation+1.69e-02

Pos logit contributions
iblical+0.007
uca+0.007
leneck+0.007
 Painter+0.007
esan+0.007
Neg logit contributions
EMS-0.007
 Mehran-0.007
papers-0.006
 accuracy-0.006
":{"-0.006
TokenY
Token ID56
Feature activation-3.45e-02

Pos logit contributions
iblical+0.010
uca+0.010
leneck+0.010
 Painter+0.010
esan+0.009
Neg logit contributions
EMS-0.014
 Mehran-0.012
papers-0.012
 accuracy-0.012
 Stability-0.011
Tokenoru
Token ID27786
Feature activation+4.47e-01

Pos logit contributions
EMS+0.027
 Mehran+0.024
papers+0.023
 accuracy+0.023
 Stability+0.022
Neg logit contributions
uca-0.022
iblical-0.022
leneck-0.021
 Painter-0.021
esan-0.020
Tokenichi
Token ID16590
Feature activation+1.64e-01

Pos logit contributions
iblical+0.256
uca+0.255
leneck+0.243
 Painter+0.240
esan+0.222
Neg logit contributions
EMS-0.387
 Mehran-0.350
papers-0.335
 accuracy-0.332
 Stability-0.323
Token-
Token ID12
Feature activation+1.73e-02

Pos logit contributions
uca+0.115
iblical+0.115
leneck+0.111
 Painter+0.110
esan+0.104
Neg logit contributions
EMS-0.121
 Mehran-0.108
papers-0.102
 accuracy-0.101
 Stability-0.097
Tokensan
Token ID12807
Feature activation-1.59e-01

Pos logit contributions
uca+0.015
iblical+0.015
leneck+0.014
 Painter+0.014
esan+0.013
Neg logit contributions
EMS-0.010
 Mehran-0.009
papers-0.008
 accuracy-0.008
 Stability-0.008
Token said
Token ID531
Feature activation+4.97e-02

Pos logit contributions
EMS+0.120
 Mehran+0.106
papers+0.101
 accuracy+0.100
 Stability+0.097
Neg logit contributions
iblical-0.109
uca-0.109
leneck-0.105
 Painter-0.103
esan-0.097
Token that
Token ID326
Feature activation+2.42e-03

Pos logit contributions
uca+0.037
iblical+0.036
leneck+0.035
 Painter+0.035
esan+0.033
Neg logit contributions
EMS-0.035
 Mehran-0.031
papers-0.029
 accuracy-0.029
 Stability-0.028
Token.
Token ID13
Feature activation-3.54e-02

Pos logit contributions
uca+0.145
iblical+0.145
leneck+0.139
 Painter+0.139
esan+0.130
Neg logit contributions
EMS-0.143
 Mehran-0.126
papers-0.119
 accuracy-0.118
 Stability-0.114
Token She
Token ID1375
Feature activation-1.16e-02

Pos logit contributions
EMS+0.026
 Mehran+0.023
papers+0.021
 accuracy+0.021
":{"+0.020
Neg logit contributions
iblical-0.025
uca-0.025
leneck-0.024
 Painter-0.024
esan-0.023
Token left
Token ID1364
Feature activation+1.95e-03

Pos logit contributions
EMS+0.008
 Mehran+0.007
papers+0.007
 accuracy+0.007
":{"+0.007
Neg logit contributions
iblical-0.008
uca-0.008
leneck-0.008
 Painter-0.008
esan-0.007
Token with
Token ID351
Feature activation-4.75e-02

Pos logit contributions
uca+0.001
iblical+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.001
Token Y
Token ID575
Feature activation-4.61e-02

Pos logit contributions
EMS+0.033
 Mehran+0.028
papers+0.027
 accuracy+0.027
 Stability+0.026
Neg logit contributions
uca-0.036
iblical-0.035
leneck-0.034
 Painter-0.034
esan-0.032
Tokenoru
Token ID27786
Feature activation+4.45e-01

Pos logit contributions
EMS+0.038
 Mehran+0.034
papers+0.032
 accuracy+0.032
 Stability+0.031
Neg logit contributions
uca-0.029
iblical-0.029
leneck-0.027
 Painter-0.027
esan-0.025
Tokenichi
Token ID16590
Feature activation+1.62e-01

Pos logit contributions
iblical+0.260
uca+0.259
leneck+0.248
 Painter+0.245
esan+0.226
Neg logit contributions
EMS-0.380
 Mehran-0.343
papers-0.328
 accuracy-0.326
":{"-0.317
Token-
Token ID12
Feature activation+1.24e-03

Pos logit contributions
uca+0.120
iblical+0.120
leneck+0.116
 Painter+0.115
esan+0.109
Neg logit contributions
EMS-0.112
 Mehran-0.099
papers-0.094
 accuracy-0.093
":{"-0.089
Tokensan
Token ID12807
Feature activation-1.63e-01

Pos logit contributions
iblical+0.001
uca+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
 Stability-0.001
Token.
Token ID13
Feature activation-7.84e-03

Pos logit contributions
EMS+0.112
 Mehran+0.099
papers+0.093
 accuracy+0.092
 Stability+0.089
Neg logit contributions
iblical-0.122
uca-0.122
leneck-0.117
 Painter-0.116
esan-0.110
Token
Token ID447
Feature activation+5.23e-02

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.005
Neg logit contributions
iblical-0.006
uca-0.006
leneck-0.005
 Painter-0.005
esan-0.005
Token my
Token ID616
Feature activation-6.11e-02

Pos logit contributions
EMS+0.064
 Mehran+0.056
papers+0.053
 accuracy+0.052
":{"+0.051
Neg logit contributions
iblical-0.069
uca-0.069
leneck-0.067
 Painter-0.066
esan-0.062
Token here
Token ID994
Feature activation+1.37e-02

Pos logit contributions
EMS+0.042
 Mehran+0.037
papers+0.035
 accuracy+0.034
 Stability+0.033
Neg logit contributions
iblical-0.046
uca-0.046
leneck-0.044
 Painter-0.044
esan-0.041
Token IM
Token ID8959
Feature activation-8.80e-02

Pos logit contributions
iblical+0.010
uca+0.010
leneck+0.010
 Painter+0.010
esan+0.009
Neg logit contributions
EMS-0.010
 Mehran-0.008
papers-0.008
 accuracy-0.008
":{"-0.008
TokenM
Token ID44
Feature activation-2.99e-02

Pos logit contributions
EMS+0.056
 Mehran+0.050
papers+0.047
 accuracy+0.046
 Stability+0.044
Neg logit contributions
iblical-0.070
uca-0.070
leneck-0.067
 Painter-0.067
esan-0.063
TokenIG
Token ID3528
Feature activation+7.85e-02

Pos logit contributions
EMS+0.021
 Mehran+0.018
papers+0.017
 accuracy+0.017
":{"+0.017
Neg logit contributions
iblical-0.022
uca-0.022
leneck-0.021
 Painter-0.021
esan-0.020
TokenRANT
Token ID32506
Feature activation+4.42e-01

Pos logit contributions
iblical+0.053
uca+0.052
leneck+0.051
 Painter+0.050
esan+0.047
Neg logit contributions
EMS-0.060
 Mehran-0.054
papers-0.051
 accuracy-0.051
":{"-0.049
Token food
Token ID2057
Feature activation+1.40e-01

Pos logit contributions
iblical+0.349
uca+0.348
leneck+0.336
 Painter+0.333
esan+0.316
Neg logit contributions
EMS-0.286
 Mehran-0.250
papers-0.234
 accuracy-0.231
":{"-0.224
Token from
Token ID422
Feature activation+2.11e-02

Pos logit contributions
iblical+0.108
uca+0.107
leneck+0.104
 Painter+0.102
esan+0.097
Neg logit contributions
EMS-0.093
 Mehran-0.082
papers-0.077
 accuracy-0.076
 Stability-0.073
Token a
Token ID257
Feature activation+1.57e-02

Pos logit contributions
iblical+0.016
uca+0.016
leneck+0.015
 Painter+0.015
esan+0.014
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.012
 accuracy-0.012
":{"-0.012
Token good
Token ID922
Feature activation-2.76e-02

Pos logit contributions
iblical+0.012
uca+0.012
leneck+0.011
 Painter+0.011
esan+0.011
Neg logit contributions
EMS-0.011
 Mehran-0.010
papers-0.009
 accuracy-0.009
 Stability-0.009
Token Christian
Token ID4302
Feature activation-2.70e-02

Pos logit contributions
EMS+0.019
 Mehran+0.017
papers+0.016
 accuracy+0.015
":{"+0.015
Neg logit contributions
iblical-0.021
uca-0.021
leneck-0.020
 Painter-0.020
esan-0.019
Tokenit
Token ID270
Feature activation+5.33e-02

Pos logit contributions
iblical+0.027
uca+0.027
leneck+0.026
 Painter+0.025
esan+0.024
Neg logit contributions
EMS-0.027
 Mehran-0.024
papers-0.023
 accuracy-0.022
 Stability-0.022
Token!
Token ID0
Feature activation+2.00e-02

Pos logit contributions
iblical+0.038
uca+0.038
leneck+0.037
 Painter+0.036
esan+0.034
Neg logit contributions
EMS-0.039
 Mehran-0.034
papers-0.032
 accuracy-0.032
 Stability-0.031
Token Ever
Token ID10776
Feature activation-7.10e-02

Pos logit contributions
iblical+0.015
uca+0.015
leneck+0.014
 Painter+0.014
esan+0.013
Neg logit contributions
EMS-0.014
 Mehran-0.013
papers-0.012
 accuracy-0.012
":{"-0.011
Token since
Token ID1201
Feature activation-3.06e-03

Pos logit contributions
EMS+0.054
 Mehran+0.048
papers+0.046
 accuracy+0.045
 Stability+0.044
Neg logit contributions
iblical-0.048
uca-0.048
leneck-0.046
 Painter-0.045
esan-0.043
Token Y
Token ID575
Feature activation-3.92e-02

Pos logit contributions
EMS+0.002
 Mehran+0.002
papers+0.002
 accuracy+0.002
 Stability+0.002
Neg logit contributions
uca-0.002
iblical-0.002
 Painter-0.002
leneck-0.002
esan-0.002
Tokenoru
Token ID27786
Feature activation+4.38e-01

Pos logit contributions
EMS+0.051
 Mehran+0.048
papers+0.047
 accuracy+0.047
 Stability+0.046
Neg logit contributions
uca-0.005
iblical-0.005
leneck-0.004
 Painter-0.004
esan-0.002
Tokenichi
Token ID16590
Feature activation+1.35e-01

Pos logit contributions
iblical+0.254
uca+0.252
leneck+0.242
 Painter+0.238
esan+0.220
Neg logit contributions
EMS-0.377
 Mehran-0.341
papers-0.326
 accuracy-0.323
":{"-0.314
Token-
Token ID12
Feature activation-5.79e-03

Pos logit contributions
iblical+0.095
uca+0.094
leneck+0.091
 Painter+0.090
esan+0.085
Neg logit contributions
EMS-0.099
 Mehran-0.088
papers-0.084
 accuracy-0.083
 Stability-0.080
Tokensama
Token ID33843
Feature activation+1.44e-01

Pos logit contributions
EMS+0.003
 Mehran+0.002
papers+0.002
 accuracy+0.002
 Stability+0.002
Neg logit contributions
iblical-0.005
uca-0.005
leneck-0.005
 Painter-0.005
esan-0.005
Token showed
Token ID3751
Feature activation-1.30e-01

Pos logit contributions
iblical+0.099
uca+0.099
leneck+0.095
 Painter+0.094
esan+0.089
Neg logit contributions
EMS-0.108
 Mehran-0.095
papers-0.091
 accuracy-0.090
 Stability-0.087
Token up
Token ID510
Feature activation-8.20e-02

Pos logit contributions
EMS+0.086
 Mehran+0.075
papers+0.071
 accuracy+0.070
 Stability+0.067
Neg logit contributions
uca-0.101
iblical-0.101
leneck-0.098
 Painter-0.097
esan-0.092
Token is
Token ID318
Feature activation-4.07e-02

Pos logit contributions
iblical+0.070
uca+0.070
leneck+0.067
 Painter+0.067
esan+0.063
Neg logit contributions
EMS-0.065
 Mehran-0.057
papers-0.054
 accuracy-0.053
":{"-0.051
Token pretty
Token ID2495
Feature activation+7.06e-03

Pos logit contributions
EMS+0.029
 Mehran+0.026
papers+0.024
 accuracy+0.024
":{"+0.023
Neg logit contributions
iblical-0.030
uca-0.029
leneck-0.029
 Painter-0.028
esan-0.026
Token much
Token ID881
Feature activation-3.22e-02

Pos logit contributions
iblical+0.005
uca+0.005
leneck+0.005
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.005
 Mehran-0.004
papers-0.004
 accuracy-0.004
":{"-0.004
Token secondary
Token ID9233
Feature activation-1.89e-02

Pos logit contributions
EMS+0.023
 Mehran+0.020
papers+0.019
 accuracy+0.019
":{"+0.018
Neg logit contributions
iblical-0.024
uca-0.023
leneck-0.023
 Painter-0.022
esan-0.021
Token stream
Token ID4269
Feature activation+8.71e-02

Pos logit contributions
EMS+0.013
 Mehran+0.011
papers+0.011
 accuracy+0.011
":{"+0.010
Neg logit contributions
iblical-0.014
uca-0.014
leneck-0.014
 Painter-0.014
esan-0.013
Token fodder
Token ID41579
Feature activation+4.36e-01

Pos logit contributions
iblical+0.065
uca+0.065
leneck+0.063
 Painter+0.062
esan+0.058
Neg logit contributions
EMS-0.060
 Mehran-0.053
papers-0.050
 accuracy-0.050
":{"-0.048
Token if
Token ID611
Feature activation+5.68e-02

Pos logit contributions
iblical+0.331
uca+0.330
leneck+0.319
 Painter+0.315
esan+0.297
Neg logit contributions
EMS-0.296
 Mehran-0.261
papers-0.245
 accuracy-0.242
":{"-0.234
Token the
Token ID262
Feature activation-3.15e-03

Pos logit contributions
uca+0.042
iblical+0.042
leneck+0.040
 Painter+0.040
esan+0.037
Neg logit contributions
EMS-0.040
 Mehran-0.035
papers-0.034
 accuracy-0.033
":{"-0.032
Token second
Token ID1218
Feature activation+5.74e-03

Pos logit contributions
EMS+0.002
 Mehran+0.002
papers+0.002
 accuracy+0.002
":{"+0.002
Neg logit contributions
iblical-0.002
uca-0.002
leneck-0.002
 Painter-0.002
esan-0.002
Token stream
Token ID4269
Feature activation+4.80e-02

Pos logit contributions
uca+0.004
iblical+0.004
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.004
 Mehran-0.004
papers-0.003
 accuracy-0.003
":{"-0.003
Token does
Token ID857
Feature activation+1.70e-02

Pos logit contributions
iblical+0.036
uca+0.036
leneck+0.034
 Painter+0.034
esan+0.032
Neg logit contributions
EMS-0.033
 Mehran-0.029
papers-0.028
 accuracy-0.027
":{"-0.027
Token the
Token ID262
Feature activation+3.14e-02

Pos logit contributions
iblical+0.020
uca+0.020
leneck+0.019
 Painter+0.019
esan+0.018
Neg logit contributions
EMS-0.017
 Mehran-0.015
papers-0.014
 accuracy-0.013
":{"-0.013
Token B
Token ID347
Feature activation+5.59e-02

Pos logit contributions
iblical+0.024
uca+0.024
leneck+0.023
 Painter+0.023
esan+0.021
Neg logit contributions
EMS-0.022
 Mehran-0.019
papers-0.018
 accuracy-0.017
":{"-0.017
TokenDP
Token ID6322
Feature activation-1.23e-01

Pos logit contributions
iblical+0.053
uca+0.053
leneck+0.051
 Painter+0.051
esan+0.049
Neg logit contributions
EMS-0.027
 Mehran-0.023
papers-0.021
 accuracy-0.021
":{"-0.019
Token +
Token ID1343
Feature activation+1.57e-01

Pos logit contributions
EMS+0.085
 Mehran+0.074
papers+0.070
 accuracy+0.069
":{"+0.067
Neg logit contributions
iblical-0.092
uca-0.092
leneck-0.090
 Painter-0.088
esan-0.083
Token Bott
Token ID14835
Feature activation-3.33e-02

Pos logit contributions
iblical+0.118
uca+0.118
leneck+0.115
 Painter+0.112
esan+0.106
Neg logit contributions
EMS-0.107
 Mehran-0.094
papers-0.089
 accuracy-0.087
":{"-0.085
Tokenleneck
Token ID43163
Feature activation+4.36e-01

Pos logit contributions
EMS+0.046
 Mehran+0.043
papers+0.042
 accuracy+0.042
":{"+0.041
Neg logit contributions
iblical-0.002
uca-0.002
leneck-0.001
 Painter-0.001
esan+0.001
Token Buffer
Token ID47017
Feature activation-1.74e-02

Pos logit contributions
uca+0.332
iblical+0.332
leneck+0.324
 Painter+0.319
esan+0.300
Neg logit contributions
EMS-0.294
 Mehran-0.258
papers-0.243
 accuracy-0.238
":{"-0.231
Token Size
Token ID12849
Feature activation+1.32e-01

Pos logit contributions
EMS+0.012
 Mehran+0.010
papers+0.010
 accuracy+0.009
":{"+0.009
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.013
 Painter-0.013
esan-0.012
Token point
Token ID966
Feature activation-6.31e-02

Pos logit contributions
iblical+0.105
uca+0.105
leneck+0.102
 Painter+0.101
esan+0.095
Neg logit contributions
EMS-0.085
 Mehran-0.074
papers-0.070
 accuracy-0.068
":{"-0.066
Token (
Token ID357
Feature activation+1.80e-02

Pos logit contributions
EMS+0.042
 Mehran+0.036
papers+0.034
 accuracy+0.034
":{"+0.032
Neg logit contributions
iblical-0.049
uca-0.049
leneck-0.048
 Painter-0.047
esan-0.044
Tokengreen
Token ID14809
Feature activation+5.60e-02

Pos logit contributions
iblical+0.013
uca+0.013
leneck+0.012
 Painter+0.012
esan+0.011
Neg logit contributions
EMS-0.013
 Mehran-0.012
papers-0.011
 accuracy-0.011
":{"-0.011
Token of
Token ID286
Feature activation+7.57e-03

Pos logit contributions
uca+0.053
iblical+0.053
leneck+0.051
 Painter+0.051
esan+0.048
Neg logit contributions
EMS-0.050
 Mehran-0.044
papers-0.041
 accuracy-0.041
 Stability-0.040
Token a
Token ID257
Feature activation+2.71e-04

Pos logit contributions
iblical+0.005
uca+0.005
leneck+0.005
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.005
 Mehran-0.005
papers-0.005
 accuracy-0.005
":{"-0.004
Token person
Token ID1048
Feature activation+7.73e-02

Pos logit contributions
iblical+0.000
uca+0.000
leneck+0.000
 Painter+0.000
esan+0.000
Neg logit contributions
EMS-0.000
 Mehran-0.000
papers-0.000
 accuracy-0.000
":{"-0.000
Token is
Token ID318
Feature activation+1.66e-03

Pos logit contributions
iblical+0.057
uca+0.057
leneck+0.055
 Painter+0.054
esan+0.051
Neg logit contributions
EMS-0.055
 Mehran-0.048
papers-0.046
 accuracy-0.045
 Stability-0.043
Token Y
Token ID575
Feature activation-4.75e-02

Pos logit contributions
uca+0.001
iblical+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
 Stability-0.001
Tokenoru
Token ID27786
Feature activation+4.32e-01

Pos logit contributions
EMS+0.062
 Mehran+0.058
papers+0.057
 accuracy+0.056
 Stability+0.055
Neg logit contributions
uca-0.006
iblical-0.006
leneck-0.005
 Painter-0.005
esan-0.003
Tokenichi
Token ID16590
Feature activation+1.40e-01

Pos logit contributions
iblical+0.250
uca+0.249
leneck+0.238
 Painter+0.234
esan+0.217
Neg logit contributions
EMS-0.372
 Mehran-0.335
papers-0.321
 accuracy-0.318
":{"-0.310
Token-
Token ID12
Feature activation+7.71e-03

Pos logit contributions
iblical+0.101
uca+0.101
leneck+0.098
 Painter+0.097
esan+0.091
Neg logit contributions
EMS-0.100
 Mehran-0.088
papers-0.084
 accuracy-0.083
":{"-0.080
Tokensan
Token ID12807
Feature activation-1.73e-01

Pos logit contributions
iblical+0.007
uca+0.007
leneck+0.007
 Painter+0.007
esan+0.007
Neg logit contributions
EMS-0.004
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.003
Token?
Token ID30
Feature activation+5.30e-02

Pos logit contributions
EMS+0.122
 Mehran+0.107
papers+0.102
 accuracy+0.101
 Stability+0.097
Neg logit contributions
uca-0.126
iblical-0.126
leneck-0.121
 Painter-0.120
esan-0.113
Token People
Token ID4380
Feature activation-1.55e-02

Pos logit contributions
iblical+0.040
uca+0.040
leneck+0.039
 Painter+0.038
esan+0.036
Neg logit contributions
EMS-0.036
 Mehran-0.032
papers-0.030
 accuracy-0.030
":{"-0.028
Token Christian
Token ID4302
Feature activation-2.19e-05

Pos logit contributions
EMS+0.040
 Mehran+0.035
papers+0.033
 accuracy+0.033
 Stability+0.032
Neg logit contributions
iblical-0.045
uca-0.045
leneck-0.044
 Painter-0.043
esan-0.041
Token,
Token ID11
Feature activation+6.67e-03

Pos logit contributions
EMS+0.000
 Mehran+0.000
papers+0.000
 accuracy+0.000
 Stability+0.000
Neg logit contributions
iblical-0.000
uca-0.000
leneck-0.000
 Painter-0.000
esan-0.000
Token hired
Token ID9657
Feature activation+1.83e-02

Pos logit contributions
iblical+0.005
uca+0.005
leneck+0.005
 Painter+0.005
esan+0.004
Neg logit contributions
EMS-0.005
 Mehran-0.004
papers-0.004
 accuracy-0.004
 Stability-0.004
Token this
Token ID428
Feature activation+5.56e-04

Pos logit contributions
iblical+0.014
uca+0.014
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.013
 Mehran-0.011
papers-0.010
 accuracy-0.010
 Stability-0.010
Token sod
Token ID32581
Feature activation+1.03e-01

Pos logit contributions
iblical+0.000
uca+0.000
leneck+0.000
 Painter+0.000
esan+0.000
Neg logit contributions
EMS-0.000
 Mehran-0.000
papers-0.000
 accuracy-0.000
 Stability-0.000
Tokenom
Token ID296
Feature activation+4.31e-01

Pos logit contributions
iblical+0.037
uca+0.037
leneck+0.034
 Painter+0.033
esan+0.029
Neg logit contributions
EMS-0.112
 Mehran-0.103
papers-0.099
 accuracy-0.099
":{"-0.097
Tokenite
Token ID578
Feature activation-3.54e-02

Pos logit contributions
iblical+0.347
uca+0.343
leneck+0.332
 Painter+0.331
esan+0.315
Neg logit contributions
EMS-0.277
 Mehran-0.240
papers-0.225
 accuracy-0.222
 Stability-0.214
Token?
Token ID30
Feature activation+2.13e-02

Pos logit contributions
EMS+0.025
 Mehran+0.022
papers+0.021
 accuracy+0.020
 Stability+0.020
Neg logit contributions
iblical-0.026
uca-0.026
leneck-0.025
 Painter-0.025
esan-0.024
Token
Token ID447
Feature activation-1.03e-02

Pos logit contributions
iblical+0.016
uca+0.016
leneck+0.016
 Painter+0.016
esan+0.015
Neg logit contributions
EMS-0.014
 Mehran-0.012
papers-0.012
 accuracy-0.012
":{"-0.011
Token
Token ID251
Feature activation-3.62e-02

Pos logit contributions
EMS+0.007
 Mehran+0.006
papers+0.005
 accuracy+0.005
":{"+0.005
Neg logit contributions
iblical-0.008
uca-0.008
leneck-0.008
 Painter-0.008
esan-0.008
Token
Token ID198
Feature activation+2.06e-02

Pos logit contributions
EMS+0.028
 Mehran+0.025
papers+0.024
 accuracy+0.024
":{"+0.023
Neg logit contributions
iblical-0.024
uca-0.024
leneck-0.023
 Painter-0.023
esan-0.021
Token
Token ID447
Feature activation-1.46e-02

Pos logit contributions
EMS+0.001
 Mehran+0.001
papers+0.001
 accuracy+0.001
 Stability+0.001
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token
Token ID247
Feature activation-1.40e-02

Pos logit contributions
EMS+0.010
 Mehran+0.009
papers+0.008
 accuracy+0.008
":{"+0.008
Neg logit contributions
iblical-0.011
uca-0.011
leneck-0.010
 Painter-0.010
esan-0.010
Tokens
Token ID82
Feature activation+2.09e-03

Pos logit contributions
EMS+0.010
 Mehran+0.009
papers+0.008
 accuracy+0.008
":{"+0.008
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.010
 Painter-0.010
esan-0.009
Token a
Token ID257
Feature activation+3.83e-02

Pos logit contributions
iblical+0.002
uca+0.002
leneck+0.002
 Painter+0.002
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
 Stability-0.001
Token sod
Token ID32581
Feature activation+8.90e-02

Pos logit contributions
iblical+0.029
uca+0.028
leneck+0.028
 Painter+0.027
esan+0.026
Neg logit contributions
EMS-0.027
 Mehran-0.023
papers-0.022
 accuracy-0.022
":{"-0.021
Tokenom
Token ID296
Feature activation+4.19e-01

Pos logit contributions
iblical+0.031
uca+0.031
leneck+0.029
 Painter+0.028
esan+0.024
Neg logit contributions
EMS-0.096
 Mehran-0.089
papers-0.086
 accuracy-0.086
":{"-0.084
Tokenite
Token ID578
Feature activation-3.45e-02

Pos logit contributions
iblical+0.342
uca+0.339
leneck+0.328
 Painter+0.326
esan+0.310
Neg logit contributions
EMS-0.264
 Mehran-0.228
papers-0.214
 accuracy-0.211
 Stability-0.203
Token!
Token ID0
Feature activation+3.59e-02

Pos logit contributions
EMS+0.024
 Mehran+0.021
papers+0.020
 accuracy+0.020
 Stability+0.019
Neg logit contributions
iblical-0.025
uca-0.025
leneck-0.024
 Painter-0.024
esan-0.023
Token
Token ID447
Feature activation-2.17e-02

Pos logit contributions
iblical+0.027
uca+0.027
leneck+0.026
 Painter+0.026
esan+0.025
Neg logit contributions
EMS-0.024
 Mehran-0.021
papers-0.020
 accuracy-0.020
":{"-0.019
Token
Token ID251
Feature activation-4.71e-02

Pos logit contributions
EMS+0.014
 Mehran+0.012
papers+0.011
 accuracy+0.011
":{"+0.010
Neg logit contributions
iblical-0.018
uca-0.018
leneck-0.017
 Painter-0.017
esan-0.016
Token
Token ID198
Feature activation+9.22e-03

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.031
":{"+0.030
Neg logit contributions
iblical-0.031
uca-0.031
leneck-0.030
 Painter-0.029
esan-0.027
Tokenthanks
Token ID27547
Feature activation+1.56e-01

Pos logit contributions
iblical+0.023
uca+0.023
leneck+0.023
 Painter+0.022
esan+0.021
Neg logit contributions
EMS-0.025
 Mehran-0.023
papers-0.021
 accuracy-0.021
":{"-0.021
Token @
Token ID2488
Feature activation+4.32e-02

Pos logit contributions
iblical+0.117
uca+0.117
leneck+0.113
 Painter+0.112
esan+0.105
Neg logit contributions
EMS-0.108
 Mehran-0.095
papers-0.090
 accuracy-0.089
":{"-0.086
Tokenc
Token ID66
Feature activation+2.54e-02

Pos logit contributions
iblical+0.031
uca+0.031
leneck+0.030
 Painter+0.029
esan+0.028
Neg logit contributions
EMS-0.031
 Mehran-0.028
papers-0.026
 accuracy-0.026
":{"-0.025
Tokenper
Token ID525
Feature activation-4.30e-02

Pos logit contributions
iblical+0.018
uca+0.018
leneck+0.018
 Painter+0.018
esan+0.016
Neg logit contributions
EMS-0.018
 Mehran-0.016
 accuracy-0.015
papers-0.015
 Stability-0.015
Tokenc
Token ID66
Feature activation+3.29e-02

Pos logit contributions
EMS+0.030
 Mehran+0.027
papers+0.025
 accuracy+0.025
":{"+0.024
Neg logit contributions
iblical-0.032
uca-0.032
leneck-0.031
 Painter-0.030
esan-0.028
Tokeniva
Token ID12151
Feature activation+4.18e-01

Pos logit contributions
iblical+0.023
uca+0.023
leneck+0.022
 Painter+0.022
esan+0.021
Neg logit contributions
EMS-0.024
 Mehran-0.022
papers-0.020
 accuracy-0.020
":{"-0.020
Token).
Token ID737
Feature activation-3.73e-02

Pos logit contributions
iblical+0.288
uca+0.287
leneck+0.276
 Painter+0.272
esan+0.256
Neg logit contributions
EMS-0.315
 Mehran-0.280
papers-0.265
 accuracy-0.264
":{"-0.254
Token
Token ID198
Feature activation+1.25e-02

Pos logit contributions
EMS+0.027
 Mehran+0.024
papers+0.023
 accuracy+0.023
":{"+0.022
Neg logit contributions
iblical-0.026
uca-0.026
leneck-0.025
 Painter-0.025
esan-0.024
Token
Token ID198
Feature activation+2.18e-03

Pos logit contributions
iblical+0.008
uca+0.008
leneck+0.008
 Painter+0.008
esan+0.007
Neg logit contributions
EMS-0.010
 Mehran-0.009
papers-0.008
 accuracy-0.008
":{"-0.008
Token3
Token ID18
Feature activation+3.90e-02

Pos logit contributions
iblical+0.001
uca+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.002
 Mehran-0.001
 accuracy-0.001
papers-0.001
 Stability-0.001
Token.
Token ID13
Feature activation-1.79e-02

Pos logit contributions
iblical+0.030
uca+0.030
leneck+0.029
 Painter+0.029
esan+0.027
Neg logit contributions
EMS-0.026
 Mehran-0.023
papers-0.021
 accuracy-0.021
":{"-0.020
Token (
Token ID357
Feature activation-3.95e-02

Pos logit contributions
EMS+0.096
 Mehran+0.084
papers+0.079
 accuracy+0.079
 Stability+0.076
Neg logit contributions
uca-0.102
iblical-0.102
leneck-0.098
 Painter-0.097
esan-0.092
Tokenve
Token ID303
Feature activation+3.99e-03

Pos logit contributions
EMS+0.029
 Mehran+0.026
papers+0.025
 accuracy+0.024
":{"+0.024
Neg logit contributions
iblical-0.028
uca-0.028
leneck-0.027
 Painter-0.026
esan-0.025
Tokenget
Token ID1136
Feature activation+1.87e-02

Pos logit contributions
iblical+0.003
uca+0.003
leneck+0.003
 Painter+0.003
esan+0.002
Neg logit contributions
EMS-0.003
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.002
Tokenarian
Token ID3699
Feature activation-7.68e-02

Pos logit contributions
iblical+0.013
uca+0.013
leneck+0.012
 Painter+0.012
esan+0.011
Neg logit contributions
EMS-0.014
 Mehran-0.013
papers-0.012
 accuracy-0.012
":{"-0.012
Token pl
Token ID458
Feature activation+8.28e-02

Pos logit contributions
EMS+0.058
 Mehran+0.052
papers+0.049
 accuracy+0.049
":{"+0.047
Neg logit contributions
iblical-0.053
uca-0.052
leneck-0.051
 Painter-0.050
esan-0.047
Tokenatter
Token ID1436
Feature activation-4.65e-01

Pos logit contributions
iblical+0.080
uca+0.080
leneck+0.078
 Painter+0.077
esan+0.073
Neg logit contributions
EMS-0.040
 Mehran-0.033
papers-0.030
 accuracy-0.029
 Stability-0.028
Token)
Token ID8
Feature activation-1.04e-02

Pos logit contributions
EMS+0.344
 Mehran+0.306
papers+0.290
 accuracy+0.288
":{"+0.279
Neg logit contributions
iblical-0.325
uca-0.323
leneck-0.313
 Painter-0.308
esan-0.286
Token consisting
Token ID17747
Feature activation-5.43e-02

Pos logit contributions
EMS+0.007
 Mehran+0.006
papers+0.006
 accuracy+0.006
":{"+0.006
Neg logit contributions
iblical-0.008
uca-0.008
leneck-0.007
 Painter-0.007
esan-0.007
Token of
Token ID286
Feature activation-3.10e-02

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.031
":{"+0.030
Neg logit contributions
iblical-0.041
uca-0.041
leneck-0.039
 Painter-0.039
esan-0.037
Token s
Token ID264
Feature activation-8.18e-03

Pos logit contributions
EMS+0.023
 Mehran+0.020
papers+0.019
 accuracy+0.019
":{"+0.018
Neg logit contributions
iblical-0.022
uca-0.022
leneck-0.021
 Painter-0.021
esan-0.020
Tokenarson
Token ID12613
Feature activation-6.81e-02

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.005
Neg logit contributions
iblical-0.006
uca-0.006
leneck-0.005
 Painter-0.005
esan-0.005
Tokenop
Token ID404
Feature activation-8.72e-02

Pos logit contributions
EMS+0.042
 Mehran+0.037
papers+0.035
 accuracy+0.034
":{"+0.033
Neg logit contributions
iblical-0.042
uca-0.042
leneck-0.040
 Painter-0.040
esan-0.037
Token games
Token ID1830
Feature activation-2.91e-02

Pos logit contributions
EMS+0.061
 Mehran+0.053
papers+0.050
 accuracy+0.050
":{"+0.048
Neg logit contributions
iblical-0.064
uca-0.064
leneck-0.062
 Painter-0.062
esan-0.058
Token of
Token ID286
Feature activation-4.19e-02

Pos logit contributions
EMS+0.021
 Mehran+0.018
papers+0.017
 accuracy+0.017
":{"+0.016
Neg logit contributions
iblical-0.021
uca-0.021
leneck-0.020
 Painter-0.020
esan-0.019
Token rock
Token ID3881
Feature activation+3.72e-02

Pos logit contributions
EMS+0.029
 Mehran+0.025
papers+0.024
 accuracy+0.023
":{"+0.023
Neg logit contributions
iblical-0.031
uca-0.031
leneck-0.030
 Painter-0.030
esan-0.028
Token-
Token ID12
Feature activation-5.80e-02

Pos logit contributions
iblical+0.027
uca+0.027
leneck+0.026
 Painter+0.025
esan+0.024
Neg logit contributions
EMS-0.027
 Mehran-0.024
papers-0.022
 accuracy-0.022
":{"-0.022
Tokenpaper
Token ID20189
Feature activation-4.62e-01

Pos logit contributions
EMS+0.018
 Mehran+0.013
 accuracy+0.011
papers+0.010
 Stability+0.009
Neg logit contributions
uca-0.066
iblical-0.066
leneck-0.064
 Painter-0.064
esan-0.061
Token-
Token ID12
Feature activation-4.11e-02

Pos logit contributions
EMS+0.330
 Mehran+0.291
papers+0.275
 accuracy+0.273
":{"+0.263
Neg logit contributions
iblical-0.336
uca-0.334
leneck-0.323
 Painter-0.319
esan-0.300
Tokensc
Token ID1416
Feature activation-1.50e-01

Pos logit contributions
EMS+0.021
 Mehran+0.017
papers+0.016
 accuracy+0.016
":{"+0.015
Neg logit contributions
iblical-0.038
uca-0.038
leneck-0.037
 Painter-0.037
esan-0.035
Tokenissors
Token ID32555
Feature activation+8.32e-02

Pos logit contributions
EMS+0.134
 Mehran+0.121
papers+0.116
 accuracy+0.115
":{"+0.112
Neg logit contributions
iblical-0.083
uca-0.082
leneck-0.079
 Painter-0.077
esan-0.071
Token in
Token ID287
Feature activation+1.52e-03

Pos logit contributions
iblical+0.062
uca+0.062
leneck+0.060
 Painter+0.060
esan+0.056
Neg logit contributions
EMS-0.057
 Mehran-0.050
papers-0.047
 accuracy-0.047
":{"-0.045
Token a
Token ID257
Feature activation+6.48e-03

Pos logit contributions
uca+0.001
iblical+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.001
Token Great
Token ID3878
Feature activation+1.50e-02

Pos logit contributions
iblical+0.062
uca+0.061
leneck+0.059
 Painter+0.059
esan+0.055
Neg logit contributions
EMS-0.063
 Mehran-0.056
papers-0.053
 accuracy-0.052
":{"-0.051
Token Again
Token ID6521
Feature activation-5.75e-02

Pos logit contributions
iblical+0.010
uca+0.010
leneck+0.010
 Painter+0.010
esan+0.009
Neg logit contributions
EMS-0.011
 Mehran-0.010
papers-0.010
 accuracy-0.009
":{"-0.009
Token #
Token ID1303
Feature activation+1.24e-01

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.030
":{"+0.029
Neg logit contributions
iblical-0.045
uca-0.045
leneck-0.044
 Painter-0.043
esan-0.041
TokenIn
Token ID818
Feature activation-3.17e-02

Pos logit contributions
iblical+0.078
uca+0.078
leneck+0.075
 Painter+0.074
esan+0.068
Neg logit contributions
EMS-0.100
 Mehran-0.090
papers-0.085
 accuracy-0.085
":{"-0.082
Tokenaug
Token ID7493
Feature activation+1.38e-01

Pos logit contributions
EMS+0.025
 Mehran+0.022
papers+0.021
 accuracy+0.021
":{"+0.020
Neg logit contributions
iblical-0.021
uca-0.021
leneck-0.020
 Painter-0.020
esan-0.018
Tokenuration
Token ID3924
Feature activation-4.61e-01

Pos logit contributions
iblical+0.161
uca+0.161
leneck+0.158
 Painter+0.156
esan+0.150
Neg logit contributions
EMS-0.038
 Mehran-0.026
papers-0.021
 accuracy-0.021
 Stability-0.018
Token2017
Token ID5539
Feature activation+1.61e-02

Pos logit contributions
EMS+0.328
 Mehran+0.290
papers+0.274
 accuracy+0.271
":{"+0.262
Neg logit contributions
iblical-0.335
uca-0.334
leneck-0.323
 Painter-0.320
esan-0.300
Token
Token ID6184
Feature activation-5.86e-02

Pos logit contributions
iblical+0.012
uca+0.012
leneck+0.011
 Painter+0.011
esan+0.010
Neg logit contributions
EMS-0.011
 Mehran-0.010
papers-0.010
 accuracy-0.009
":{"-0.009
Token
Token ID95
Feature activation+8.37e-02

Pos logit contributions
EMS+0.044
 Mehran+0.040
papers+0.037
 accuracy+0.037
":{"+0.036
Neg logit contributions
iblical-0.040
uca-0.040
leneck-0.038
 Painter-0.038
esan-0.035
Token
Token ID126
Feature activation-1.79e-02

Pos logit contributions
iblical+0.060
uca+0.060
leneck+0.058
 Painter+0.057
esan+0.053
Neg logit contributions
EMS-0.061
 Mehran-0.054
papers-0.051
 accuracy-0.050
":{"-0.048
Token
Token ID222
Feature activation+3.31e-02

Pos logit contributions
EMS+0.013
 Mehran+0.012
papers+0.011
 accuracy+0.011
":{"+0.011
Neg logit contributions
iblical-0.013
uca-0.012
leneck-0.012
 Painter-0.012
esan-0.011
Token today
Token ID1909
Feature activation-9.01e-02

Pos logit contributions
EMS+0.023
 Mehran+0.020
papers+0.019
 accuracy+0.019
":{"+0.018
Neg logit contributions
iblical-0.027
uca-0.027
leneck-0.026
 Painter-0.026
esan-0.024
Token's
Token ID338
Feature activation-3.82e-02

Pos logit contributions
EMS+0.072
 Mehran+0.065
papers+0.062
 accuracy+0.061
":{"+0.059
Neg logit contributions
iblical-0.058
uca-0.057
leneck-0.055
 Painter-0.054
esan-0.051
Token crash
Token ID7014
Feature activation+1.62e-02

Pos logit contributions
EMS+0.026
 Mehran+0.023
papers+0.021
 accuracy+0.021
":{"+0.020
Neg logit contributions
iblical-0.029
uca-0.029
leneck-0.028
 Painter-0.028
esan-0.026
Token will
Token ID481
Feature activation-6.67e-02

Pos logit contributions
iblical+0.012
uca+0.012
leneck+0.011
 Painter+0.011
esan+0.010
Neg logit contributions
EMS-0.012
 Mehran-0.010
papers-0.010
 accuracy-0.010
":{"-0.009
Token inaug
Token ID14243
Feature activation+1.24e-01

Pos logit contributions
EMS+0.046
 Mehran+0.041
papers+0.038
 accuracy+0.038
":{"+0.037
Neg logit contributions
iblical-0.050
uca-0.050
leneck-0.048
 Painter-0.047
esan-0.045
Tokenurate
Token ID15537
Feature activation-4.39e-01

Pos logit contributions
uca+0.140
iblical+0.139
leneck+0.137
 Painter+0.135
esan+0.130
Neg logit contributions
EMS-0.040
 Mehran-0.029
papers-0.024
 accuracy-0.024
 Stability-0.021
Token a
Token ID257
Feature activation-1.51e-02

Pos logit contributions
EMS+0.290
 Mehran+0.253
papers+0.238
 accuracy+0.235
":{"+0.227
Neg logit contributions
iblical-0.343
uca-0.341
leneck-0.331
 Painter-0.327
esan-0.307
Token bear
Token ID6842
Feature activation+9.71e-02

Pos logit contributions
EMS+0.010
 Mehran+0.009
papers+0.008
 accuracy+0.008
":{"+0.008
Neg logit contributions
iblical-0.012
uca-0.012
leneck-0.011
 Painter-0.011
esan-0.011
Token market
Token ID1910
Feature activation-1.19e-01

Pos logit contributions
iblical+0.081
uca+0.080
leneck+0.078
 Painter+0.077
esan+0.073
Neg logit contributions
EMS-0.059
 Mehran-0.051
papers-0.048
 accuracy-0.047
":{"-0.045
Token like
Token ID588
Feature activation-4.65e-02

Pos logit contributions
EMS+0.081
 Mehran+0.071
papers+0.067
 accuracy+0.066
":{"+0.064
Neg logit contributions
iblical-0.091
uca-0.090
leneck-0.087
 Painter-0.086
esan-0.081
Token the
Token ID262
Feature activation-2.60e-02

Pos logit contributions
EMS+0.030
 Mehran+0.027
papers+0.025
 accuracy+0.025
":{"+0.024
Neg logit contributions
iblical-0.036
uca-0.036
leneck-0.035
 Painter-0.035
esan-0.033
Token.
Token ID13
Feature activation-1.50e-02

Pos logit contributions
EMS+0.027
 Mehran+0.023
papers+0.022
 accuracy+0.022
":{"+0.021
Neg logit contributions
iblical-0.036
uca-0.036
leneck-0.035
 Painter-0.034
esan-0.032
Token
Token ID198
Feature activation+8.51e-03

Pos logit contributions
EMS+0.011
 Mehran+0.010
papers+0.009
 accuracy+0.009
":{"+0.009
Neg logit contributions
iblical-0.011
uca-0.011
leneck-0.010
 Painter-0.010
esan-0.009
Token
Token ID198
Feature activation+1.93e-03

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.006
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.007
 Mehran-0.006
papers-0.006
 accuracy-0.005
":{"-0.005
TokenBut
Token ID1537
Feature activation-4.01e-02

Pos logit contributions
iblical+0.001
uca+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
 Stability-0.001
Token Bur
Token ID5481
Feature activation+9.37e-02

Pos logit contributions
EMS+0.028
 Mehran+0.024
papers+0.023
 accuracy+0.023
":{"+0.022
Neg logit contributions
iblical-0.030
uca-0.030
leneck-0.029
 Painter-0.029
esan-0.027
Tokenrows
Token ID8516
Feature activation-4.22e-01

Pos logit contributions
iblical+0.113
uca+0.113
leneck+0.111
 Painter+0.110
esan+0.106
Neg logit contributions
EMS-0.022
 Mehran-0.014
papers-0.011
 accuracy-0.010
":{"-0.008
Token
Token ID447
Feature activation+5.31e-03

Pos logit contributions
EMS+0.288
 Mehran+0.252
papers+0.238
 accuracy+0.236
":{"+0.227
Neg logit contributions
iblical-0.320
uca-0.319
leneck-0.309
 Painter-0.305
esan-0.287
Token
Token ID247
Feature activation-1.79e-04

Pos logit contributions
uca+0.004
iblical+0.004
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.004
 Mehran-0.003
papers-0.003
 accuracy-0.003
 Stability-0.003
Tokens
Token ID82
Feature activation+5.94e-02

Pos logit contributions
EMS+0.000
 Mehran+0.000
papers+0.000
 accuracy+0.000
":{"+0.000
Neg logit contributions
iblical-0.000
uca-0.000
leneck-0.000
 Painter-0.000
esan-0.000
Token sterile
Token ID38697
Feature activation-5.89e-02

Pos logit contributions
iblical+0.042
uca+0.041
leneck+0.040
 Painter+0.040
esan+0.037
Neg logit contributions
EMS-0.044
 Mehran-0.039
papers-0.037
 accuracy-0.036
":{"-0.035
Token style
Token ID3918
Feature activation+1.48e-01

Pos logit contributions
EMS+0.041
 Mehran+0.036
papers+0.034
 accuracy+0.033
":{"+0.033
Neg logit contributions
iblical-0.043
uca-0.043
leneck-0.042
 Painter-0.042
esan-0.039
Token the
Token ID262
Feature activation-1.70e-02

Pos logit contributions
EMS+0.023
 Mehran+0.020
papers+0.019
 accuracy+0.019
":{"+0.018
Neg logit contributions
iblical-0.027
uca-0.027
leneck-0.026
 Painter-0.026
esan-0.024
Token Morris
Token ID14433
Feature activation-1.09e-01

Pos logit contributions
EMS+0.012
 Mehran+0.010
papers+0.010
 accuracy+0.009
":{"+0.009
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.012
 Painter-0.012
esan-0.012
Token Museum
Token ID9594
Feature activation-7.97e-02

Pos logit contributions
EMS+0.073
 Mehran+0.063
papers+0.060
 accuracy+0.059
":{"+0.057
Neg logit contributions
iblical-0.085
uca-0.085
leneck-0.082
 Painter-0.081
esan-0.077
Token in
Token ID287
Feature activation-7.27e-03

Pos logit contributions
EMS+0.056
 Mehran+0.049
papers+0.046
 accuracy+0.046
":{"+0.045
Neg logit contributions
iblical-0.059
uca-0.059
leneck-0.057
 Painter-0.056
esan-0.053
Token Mor
Token ID3461
Feature activation+4.40e-02

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.004
Neg logit contributions
iblical-0.005
uca-0.005
leneck-0.005
 Painter-0.005
esan-0.004
Tokenrist
Token ID1585
Feature activation-4.21e-01

Pos logit contributions
uca+0.041
iblical+0.041
leneck+0.040
 Painter+0.039
esan+0.037
Neg logit contributions
EMS-0.023
 Mehran-0.019
 accuracy-0.018
papers-0.017
 Stability-0.016
Tokenown
Token ID593
Feature activation+8.17e-02

Pos logit contributions
EMS+0.333
 Mehran+0.297
papers+0.284
 accuracy+0.281
":{"+0.273
Neg logit contributions
iblical-0.274
uca-0.272
leneck-0.262
 Painter-0.258
esan-0.240
Token,
Token ID11
Feature activation-1.14e-02

Pos logit contributions
uca+0.059
iblical+0.058
leneck+0.057
 Painter+0.056
esan+0.053
Neg logit contributions
EMS-0.059
 Mehran-0.052
papers-0.049
 accuracy-0.049
":{"-0.047
Token N
Token ID399
Feature activation-4.98e-02

Pos logit contributions
EMS+0.008
 Mehran+0.007
papers+0.006
 accuracy+0.006
":{"+0.006
Neg logit contributions
iblical-0.009
uca-0.009
leneck-0.009
 Painter-0.009
esan-0.008
Token.
Token ID13
Feature activation-5.38e-04

Pos logit contributions
EMS+0.029
 Mehran+0.025
papers+0.024
 accuracy+0.023
 Stability+0.022
Neg logit contributions
iblical-0.042
uca-0.042
leneck-0.041
 Painter-0.041
esan-0.038
TokenJ
Token ID41
Feature activation-3.81e-02

Pos logit contributions
EMS+0.000
 Mehran+0.000
papers+0.000
 accuracy+0.000
 Stability+0.000
Neg logit contributions
uca-0.000
iblical-0.000
 Painter-0.000
leneck-0.000
esan-0.000
Token have
Token ID423
Feature activation+2.64e-02

Pos logit contributions
iblical+0.039
uca+0.038
leneck+0.037
 Painter+0.037
esan+0.034
Neg logit contributions
EMS-0.041
 Mehran-0.036
papers-0.035
 accuracy-0.034
":{"-0.033
Token not
Token ID407
Feature activation-5.64e-03

Pos logit contributions
iblical+0.019
uca+0.019
leneck+0.018
 Painter+0.018
esan+0.017
Neg logit contributions
EMS-0.019
 Mehran-0.017
papers-0.016
 accuracy-0.016
":{"-0.015
Token yet
Token ID1865
Feature activation+2.39e-03

Pos logit contributions
EMS+0.004
 Mehran+0.004
papers+0.004
 accuracy+0.004
":{"+0.003
Neg logit contributions
iblical-0.004
uca-0.004
leneck-0.004
 Painter-0.004
esan-0.003
Token discussed
Token ID6693
Feature activation+1.01e-02

Pos logit contributions
iblical+0.002
uca+0.002
leneck+0.002
 Painter+0.002
esan+0.001
Neg logit contributions
EMS-0.002
 Mehran-0.002
papers-0.002
 accuracy-0.002
":{"-0.001
Token Bur
Token ID5481
Feature activation+1.04e-01

Pos logit contributions
iblical+0.007
uca+0.007
leneck+0.007
 Painter+0.007
esan+0.007
Neg logit contributions
EMS-0.007
 Mehran-0.006
papers-0.006
 accuracy-0.006
":{"-0.006
Tokenrows
Token ID8516
Feature activation-4.18e-01

Pos logit contributions
uca+0.089
iblical+0.089
leneck+0.087
 Painter+0.084
esan+0.081
Neg logit contributions
EMS-0.062
 Mehran-0.054
 accuracy-0.049
papers-0.049
 Stability-0.047
Token
Token ID447
Feature activation+1.91e-02

Pos logit contributions
EMS+0.287
 Mehran+0.253
papers+0.238
 accuracy+0.235
":{"+0.227
Neg logit contributions
iblical-0.314
uca-0.313
leneck-0.304
 Painter-0.300
esan-0.283
Token
Token ID247
Feature activation-7.38e-04

Pos logit contributions
uca+0.014
iblical+0.014
leneck+0.014
 Painter+0.013
esan+0.013
Neg logit contributions
EMS-0.013
 Mehran-0.012
papers-0.011
 accuracy-0.011
 Stability-0.011
Tokens
Token ID82
Feature activation+7.03e-02

Pos logit contributions
EMS+0.000
 Mehran+0.000
papers+0.000
 accuracy+0.000
":{"+0.000
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token art
Token ID1242
Feature activation+1.31e-01

Pos logit contributions
iblical+0.052
uca+0.052
leneck+0.050
 Painter+0.049
esan+0.046
Neg logit contributions
EMS-0.050
 Mehran-0.044
papers-0.041
 accuracy-0.040
":{"-0.039
Token work
Token ID670
Feature activation+1.61e-01

Pos logit contributions
iblical+0.095
uca+0.094
leneck+0.091
 Painter+0.090
esan+0.085
Neg logit contributions
EMS-0.094
 Mehran-0.083
papers-0.079
 accuracy-0.078
":{"-0.075
Token Brazil
Token ID7595
Feature activation+3.98e-02

Pos logit contributions
iblical+0.027
uca+0.027
leneck+0.026
 Painter+0.026
esan+0.024
Neg logit contributions
EMS-0.023
 Mehran-0.020
papers-0.019
 accuracy-0.019
":{"-0.018
Token on
Token ID319
Feature activation+3.25e-02

Pos logit contributions
iblical+0.030
uca+0.030
leneck+0.029
 Painter+0.029
esan+0.027
Neg logit contributions
EMS-0.027
 Mehran-0.024
papers-0.023
 accuracy-0.022
":{"-0.022
Token Monday
Token ID3321
Feature activation+5.50e-02

Pos logit contributions
iblical+0.021
uca+0.020
leneck+0.020
 Painter+0.019
esan+0.018
Neg logit contributions
EMS-0.026
 Mehran-0.023
papers-0.022
 accuracy-0.022
":{"-0.022
Token will
Token ID481
Feature activation+1.38e-02

Pos logit contributions
iblical+0.040
uca+0.040
leneck+0.038
 Painter+0.038
esan+0.036
Neg logit contributions
EMS-0.039
 Mehran-0.035
papers-0.033
 accuracy-0.032
":{"-0.031
Token inaug
Token ID14243
Feature activation+1.49e-01

Pos logit contributions
iblical+0.011
uca+0.011
leneck+0.011
 Painter+0.010
esan+0.010
Neg logit contributions
EMS-0.009
 Mehran-0.008
papers-0.007
 accuracy-0.007
":{"-0.007
Tokenurate
Token ID15537
Feature activation-4.12e-01

Pos logit contributions
uca+0.164
iblical+0.164
leneck+0.161
 Painter+0.159
esan+0.152
Neg logit contributions
EMS-0.052
 Mehran-0.038
papers-0.032
 accuracy-0.032
 Stability-0.029
Token a
Token ID257
Feature activation+4.97e-02

Pos logit contributions
EMS+0.272
 Mehran+0.237
papers+0.223
 accuracy+0.221
":{"+0.212
Neg logit contributions
iblical-0.321
uca-0.320
leneck-0.310
 Painter-0.307
esan-0.289
Token $
Token ID720
Feature activation-3.68e-02

Pos logit contributions
iblical+0.038
uca+0.038
leneck+0.036
 Painter+0.036
esan+0.034
Neg logit contributions
EMS-0.034
 Mehran-0.030
papers-0.028
 accuracy-0.028
":{"-0.027
Token900
Token ID12865
Feature activation+8.03e-02

Pos logit contributions
EMS+0.027
 Mehran+0.024
papers+0.022
 accuracy+0.022
":{"+0.021
Neg logit contributions
iblical-0.026
uca-0.026
leneck-0.025
 Painter-0.025
esan-0.023
Token million
Token ID1510
Feature activation+5.64e-02

Pos logit contributions
iblical+0.052
uca+0.052
leneck+0.050
 Painter+0.049
esan+0.046
Neg logit contributions
EMS-0.064
 Mehran-0.057
papers-0.054
 accuracy-0.054
":{"-0.052
Token port
Token ID2493
Feature activation+4.44e-02

Pos logit contributions
uca+0.042
iblical+0.042
leneck+0.041
 Painter+0.040
esan+0.038
Neg logit contributions
EMS-0.039
 Mehran-0.034
papers-0.032
 accuracy-0.031
":{"-0.031
Token
Token ID20977
Feature activation-2.77e-01

Pos logit contributions
iblical+0.097
uca+0.096
leneck+0.094
 Painter+0.092
esan+0.088
Neg logit contributions
EMS-0.072
 Mehran-0.062
papers-0.058
 accuracy-0.057
":{"-0.055
TokenCub
Token ID43632
Feature activation-9.71e-02

Pos logit contributions
EMS+0.190
 Mehran+0.166
papers+0.157
 accuracy+0.155
":{"+0.150
Neg logit contributions
iblical-0.210
uca-0.209
leneck-0.203
 Painter-0.200
esan-0.188
Tokenans
Token ID504
Feature activation+3.93e-02

Pos logit contributions
EMS+0.072
 Mehran+0.064
papers+0.061
 accuracy+0.060
 Stability+0.058
Neg logit contributions
uca-0.068
iblical-0.067
leneck-0.065
 Painter-0.064
esan-0.060
Token (
Token ID357
Feature activation+6.41e-03

Pos logit contributions
uca+0.030
iblical+0.030
leneck+0.029
 Painter+0.029
esan+0.027
Neg logit contributions
EMS-0.026
 Mehran-0.023
papers-0.022
 accuracy-0.021
":{"-0.021
Tokenbl
Token ID2436
Feature activation-1.16e-01

Pos logit contributions
iblical+0.004
uca+0.004
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.005
 Mehran-0.005
papers-0.004
 accuracy-0.004
 Stability-0.004
Tokenacks
Token ID4595
Feature activation-4.11e-01

Pos logit contributions
EMS+0.054
 Mehran+0.044
papers+0.040
 accuracy+0.040
 Stability+0.038
Neg logit contributions
iblical-0.114
uca-0.113
leneck-0.110
 Painter-0.109
esan-0.104
Token and
Token ID290
Feature activation-1.99e-02

Pos logit contributions
EMS+0.292
 Mehran+0.257
papers+0.243
 accuracy+0.241
":{"+0.233
Neg logit contributions
iblical-0.300
uca-0.299
leneck-0.289
 Painter-0.285
esan-0.268
Token mul
Token ID35971
Feature activation-1.08e-01

Pos logit contributions
EMS+0.013
 Mehran+0.011
papers+0.011
 accuracy+0.011
":{"+0.010
Neg logit contributions
iblical-0.015
uca-0.015
leneck-0.015
 Painter-0.015
esan-0.014
Tokenatt
Token ID1078
Feature activation-8.90e-02

Pos logit contributions
EMS+0.061
 Mehran+0.052
papers+0.048
 accuracy+0.048
":{"+0.046
Neg logit contributions
iblical-0.094
uca-0.094
leneck-0.091
 Painter-0.090
esan-0.086
Tokenos
Token ID418
Feature activation+2.37e-02

Pos logit contributions
EMS+0.065
 Mehran+0.058
papers+0.055
 accuracy+0.055
 Stability+0.053
Neg logit contributions
iblical-0.063
uca-0.063
leneck-0.061
 Painter-0.059
esan-0.056
Token),
Token ID828
Feature activation+6.75e-02

Pos logit contributions
iblical+0.017
uca+0.017
leneck+0.017
 Painter+0.017
esan+0.016
Neg logit contributions
EMS-0.017
 Mehran-0.015
papers-0.014
 accuracy-0.014
":{"-0.013
Token for
Token ID329
Feature activation+1.19e-02

Pos logit contributions
EMS+0.025
 Mehran+0.022
papers+0.021
 accuracy+0.021
":{"+0.020
Neg logit contributions
iblical-0.025
uca-0.025
leneck-0.024
 Painter-0.024
esan-0.023
Token J
Token ID449
Feature activation-8.08e-03

Pos logit contributions
iblical+0.009
uca+0.009
leneck+0.009
 Painter+0.009
esan+0.008
Neg logit contributions
EMS-0.008
 Mehran-0.007
papers-0.007
 accuracy-0.007
":{"-0.006
Tokenoon
Token ID2049
Feature activation-1.45e-02

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.005
Neg logit contributions
iblical-0.006
uca-0.006
leneck-0.006
 Painter-0.005
esan-0.005
Tokena
Token ID64
Feature activation-1.30e-02

Pos logit contributions
EMS+0.012
 Mehran+0.011
 accuracy+0.010
papers+0.010
 Stability+0.010
Neg logit contributions
iblical-0.009
uca-0.009
leneck-0.009
 Painter-0.009
esan-0.008
Token "
Token ID366
Feature activation-1.59e-02

Pos logit contributions
EMS+0.009
 Mehran+0.008
papers+0.007
 accuracy+0.007
":{"+0.007
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.009
 Painter-0.009
esan-0.009
Tokennat
Token ID32353
Feature activation-4.09e-01

Pos logit contributions
EMS+0.012
 Mehran+0.010
papers+0.010
 accuracy+0.010
 Stability+0.009
Neg logit contributions
iblical-0.011
uca-0.011
leneck-0.011
 Painter-0.011
esan-0.010
Tokenu
Token ID84
Feature activation+1.94e-01

Pos logit contributions
EMS+0.297
 Mehran+0.262
papers+0.249
 accuracy+0.246
":{"+0.239
Neg logit contributions
iblical-0.293
uca-0.291
leneck-0.282
 Painter-0.278
esan-0.261
Token"
Token ID1
Feature activation-8.80e-03

Pos logit contributions
uca+0.147
iblical+0.147
leneck+0.141
 Painter+0.140
esan+0.132
Neg logit contributions
EMS-0.132
 Mehran-0.117
papers-0.111
 accuracy-0.109
 Stability-0.105
Token Le
Token ID1004
Feature activation-9.79e-02

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.004
Neg logit contributions
uca-0.007
iblical-0.007
leneck-0.007
 Painter-0.007
esan-0.006
Tokenpp
Token ID381
Feature activation+1.33e-01

Pos logit contributions
EMS+0.070
 Mehran+0.062
papers+0.059
 accuracy+0.058
":{"+0.056
Neg logit contributions
uca-0.071
iblical-0.071
leneck-0.068
 Painter-0.067
esan-0.063
Tokenä
Token ID11033
Feature activation+4.58e-02

Pos logit contributions
iblical+0.091
uca+0.091
leneck+0.088
 Painter+0.087
esan+0.081
Neg logit contributions
EMS-0.099
 Mehran-0.088
papers-0.084
 accuracy-0.083
":{"-0.081
Tokenous
Token ID516
Feature activation-5.64e-02

Pos logit contributions
EMS+0.056
 Mehran+0.050
papers+0.047
 accuracy+0.047
":{"+0.045
Neg logit contributions
iblical-0.059
uca-0.059
leneck-0.057
 Painter-0.057
esan-0.053
Token,
Token ID11
Feature activation-1.37e-02

Pos logit contributions
EMS+0.036
 Mehran+0.031
papers+0.030
 accuracy+0.029
":{"+0.028
Neg logit contributions
iblical-0.045
uca-0.045
leneck-0.043
 Painter-0.043
esan-0.041
Token le
Token ID443
Feature activation-5.40e-02

Pos logit contributions
EMS+0.009
 Mehran+0.008
papers+0.007
 accuracy+0.007
 Stability+0.007
Neg logit contributions
iblical-0.011
uca-0.011
leneck-0.011
 Painter-0.010
esan-0.010
Token S
Token ID311
Feature activation-1.04e-01

Pos logit contributions
EMS+0.036
 Mehran+0.032
papers+0.030
 accuracy+0.030
":{"+0.029
Neg logit contributions
iblical-0.041
uca-0.041
leneck-0.040
 Painter-0.039
esan-0.037
Tokené
Token ID2634
Feature activation-1.46e-01

Pos logit contributions
EMS+0.071
 Mehran+0.062
papers+0.059
 accuracy+0.058
":{"+0.056
Neg logit contributions
iblical-0.078
uca-0.078
leneck-0.076
 Painter-0.075
esan-0.070
Tokennat
Token ID32353
Feature activation-4.07e-01

Pos logit contributions
EMS+0.119
 Mehran+0.107
papers+0.102
 accuracy+0.101
":{"+0.098
Neg logit contributions
iblical-0.091
uca-0.091
leneck-0.087
 Painter-0.086
esan-0.080
Token Imp
Token ID9855
Feature activation-7.84e-02

Pos logit contributions
EMS+0.253
 Mehran+0.220
papers+0.206
 accuracy+0.203
":{"+0.195
Neg logit contributions
iblical-0.332
uca-0.331
leneck-0.321
 Painter-0.318
esan-0.300
Tokené
Token ID2634
Feature activation-1.74e-01

Pos logit contributions
EMS+0.044
 Mehran+0.038
papers+0.036
 accuracy+0.035
 Stability+0.033
Neg logit contributions
uca-0.068
iblical-0.068
leneck-0.066
 Painter-0.066
esan-0.063
Tokenrial
Token ID4454
Feature activation-7.22e-02

Pos logit contributions
EMS+0.086
 Mehran+0.071
papers+0.065
 accuracy+0.064
 Stability+0.060
Neg logit contributions
iblical-0.166
uca-0.165
leneck-0.162
 Painter-0.159
esan-0.152
Token,
Token ID11
Feature activation+1.75e-02

Pos logit contributions
EMS+0.047
 Mehran+0.041
papers+0.038
 accuracy+0.038
":{"+0.036
Neg logit contributions
iblical-0.057
uca-0.057
leneck-0.055
 Painter-0.055
esan-0.052
Token le
Token ID443
Feature activation-3.45e-02

Pos logit contributions
iblical+0.014
uca+0.014
leneck+0.014
 Painter+0.014
esan+0.013
Neg logit contributions
EMS-0.011
 Mehran-0.010
papers-0.009
 accuracy-0.009
":{"-0.008
Token à
Token ID28141
Feature activation-3.57e-02

Pos logit contributions
EMS+0.012
 Mehran+0.011
papers+0.010
 accuracy+0.010
 Stability+0.010
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.013
 Painter-0.013
esan-0.012
Token cher
Token ID21382
Feature activation+7.29e-02

Pos logit contributions
EMS+0.023
 Mehran+0.020
papers+0.019
 accuracy+0.019
":{"+0.018
Neg logit contributions
iblical-0.028
uca-0.028
leneck-0.027
 Painter-0.027
esan-0.025
Tokencher
Token ID2044
Feature activation+2.19e-02

Pos logit contributions
iblical+0.039
uca+0.039
leneck+0.037
 Painter+0.036
esan+0.034
Neg logit contributions
EMS-0.066
 Mehran-0.060
papers-0.057
 accuracy-0.057
 Stability-0.055
Token le
Token ID443
Feature activation-5.29e-02

Pos logit contributions
iblical+0.017
uca+0.017
leneck+0.016
 Painter+0.016
esan+0.015
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.012
 accuracy-0.012
 Stability-0.012
Token conf
Token ID1013
Feature activation-4.19e-02

Pos logit contributions
EMS+0.035
 Mehran+0.031
papers+0.029
 accuracy+0.029
":{"+0.028
Neg logit contributions
iblical-0.041
uca-0.041
leneck-0.039
 Painter-0.039
esan-0.037
Tokenlit
Token ID18250
Feature activation-4.01e-01

Pos logit contributions
EMS+0.030
 Mehran+0.026
papers+0.025
 accuracy+0.025
":{"+0.024
Neg logit contributions
iblical-0.031
uca-0.031
leneck-0.030
 Painter-0.029
esan-0.027
Token a
Token ID257
Feature activation+1.36e-02

Pos logit contributions
EMS+0.259
 Mehran+0.227
papers+0.213
 accuracy+0.210
":{"+0.201
Neg logit contributions
iblical-0.318
uca-0.317
leneck-0.307
 Painter-0.304
esan-0.287
Tokenvec
Token ID35138
Feature activation+1.11e-01

Pos logit contributions
iblical+0.009
uca+0.009
leneck+0.009
 Painter+0.009
esan+0.008
Neg logit contributions
EMS-0.011
 Mehran-0.009
papers-0.009
 accuracy-0.009
":{"-0.009
Token nos
Token ID43630
Feature activation+3.22e-02

Pos logit contributions
iblical+0.088
uca+0.088
leneck+0.085
 Painter+0.085
esan+0.079
Neg logit contributions
EMS-0.072
 Mehran-0.062
papers-0.060
 accuracy-0.057
":{"-0.056
Token an
Token ID281
Feature activation-1.94e-02

Pos logit contributions
iblical+0.025
uca+0.025
leneck+0.024
 Painter+0.024
esan+0.023
Neg logit contributions
EMS-0.021
 Mehran-0.018
papers-0.017
 accuracy-0.017
":{"-0.016
Tokenci
Token ID979
Feature activation-2.32e-01

Pos logit contributions
EMS+0.012
 Mehran+0.011
papers+0.010
 accuracy+0.010
":{"+0.009
Neg logit contributions
iblical-0.016
uca-0.016
leneck-0.015
 Painter-0.015
esan-0.014
Token they
Token ID484
Feature activation+2.51e-02

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.005
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.006
 Mehran-0.005
papers-0.005
 accuracy-0.005
":{"-0.005
Token have
Token ID423
Feature activation+2.42e-02

Pos logit contributions
iblical+0.015
uca+0.015
leneck+0.015
 Painter+0.015
esan+0.014
Neg logit contributions
EMS-0.021
 Mehran-0.019
papers-0.018
 accuracy-0.018
 Stability-0.017
Token a
Token ID257
Feature activation-1.77e-02

Pos logit contributions
iblical+0.017
uca+0.017
leneck+0.016
 Painter+0.016
esan+0.015
Neg logit contributions
EMS-0.018
 Mehran-0.016
papers-0.015
 accuracy-0.015
":{"-0.014
Token delicate
Token ID19217
Feature activation-2.13e-02

Pos logit contributions
EMS+0.013
 Mehran+0.011
papers+0.011
 accuracy+0.010
":{"+0.010
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.012
 Painter-0.012
esan-0.011
Token sm
Token ID895
Feature activation+1.04e-01

Pos logit contributions
EMS+0.015
 Mehran+0.014
papers+0.013
 accuracy+0.013
":{"+0.012
Neg logit contributions
iblical-0.015
uca-0.015
leneck-0.015
 Painter-0.015
esan-0.014
Tokenattering
Token ID16475
Feature activation-3.95e-01

Pos logit contributions
iblical+0.066
uca+0.066
leneck+0.063
 Painter+0.062
esan+0.058
Neg logit contributions
EMS-0.084
 Mehran-0.075
papers-0.071
 accuracy-0.071
":{"-0.069
Token of
Token ID286
Feature activation+2.64e-04

Pos logit contributions
EMS+0.273
 Mehran+0.240
papers+0.226
 accuracy+0.224
":{"+0.216
Neg logit contributions
iblical-0.296
uca-0.295
leneck-0.285
 Painter-0.282
esan-0.265
Token light
Token ID1657
Feature activation-5.41e-02

Pos logit contributions
iblical+0.000
uca+0.000
leneck+0.000
 Painter+0.000
esan+0.000
Neg logit contributions
EMS-0.000
 Mehran-0.000
papers-0.000
 accuracy-0.000
":{"-0.000
Token brown
Token ID7586
Feature activation-5.21e-02

Pos logit contributions
EMS+0.039
 Mehran+0.034
papers+0.032
 accuracy+0.032
":{"+0.031
Neg logit contributions
iblical-0.039
uca-0.039
leneck-0.038
 Painter-0.037
esan-0.035
Token "
Token ID366
Feature activation+2.09e-02

Pos logit contributions
EMS+0.040
 Mehran+0.036
papers+0.034
 accuracy+0.033
":{"+0.032
Neg logit contributions
iblical-0.035
uca-0.035
leneck-0.034
 Painter-0.033
esan-0.031
Tokenf
Token ID69
Feature activation+9.07e-02

Pos logit contributions
iblical+0.014
uca+0.014
leneck+0.014
 Painter+0.014
esan+0.013
Neg logit contributions
EMS-0.016
 Mehran-0.014
papers-0.013
 accuracy-0.013
":{"-0.013
Token if
Token ID611
Feature activation+1.25e-01

Pos logit contributions
iblical+0.010
uca+0.010
leneck+0.010
 Painter+0.010
esan+0.009
Neg logit contributions
EMS-0.011
 Mehran-0.010
papers-0.009
 accuracy-0.009
":{"-0.009
Token ($
Token ID7198
Feature activation-2.21e-02

Pos logit contributions
iblical+0.104
uca+0.104
leneck+0.101
 Painter+0.100
esan+0.095
Neg logit contributions
EMS-0.076
 Mehran-0.066
papers-0.061
 accuracy-0.060
":{"-0.058
TokenUse
Token ID11041
Feature activation+9.01e-02

Pos logit contributions
EMS+0.016
 Mehran+0.014
papers+0.013
 accuracy+0.013
":{"+0.013
Neg logit contributions
iblical-0.016
uca-0.016
leneck-0.015
 Painter-0.015
esan-0.014
TokenCurrent
Token ID11297
Feature activation+1.12e-01

Pos logit contributions
iblical+0.062
uca+0.062
leneck+0.060
 Painter+0.059
esan+0.055
Neg logit contributions
EMS-0.068
 Mehran-0.060
papers-0.057
 accuracy-0.057
":{"-0.055
TokenWall
Token ID22401
Feature activation+5.42e-02

Pos logit contributions
iblical+0.077
uca+0.077
leneck+0.074
 Painter+0.074
esan+0.069
Neg logit contributions
EMS-0.083
 Mehran-0.074
papers-0.070
 accuracy-0.070
":{"-0.067
Tokenpaper
Token ID20189
Feature activation-3.95e-01

Pos logit contributions
iblical+0.058
uca+0.058
leneck+0.057
 Painter+0.056
esan+0.054
Neg logit contributions
EMS-0.020
 Mehran-0.016
 accuracy-0.013
papers-0.013
":{"-0.012
TokenAs
Token ID1722
Feature activation+8.33e-02

Pos logit contributions
EMS+0.301
 Mehran+0.268
papers+0.254
 accuracy+0.253
":{"+0.244
Neg logit contributions
iblical-0.267
uca-0.265
leneck-0.256
 Painter-0.253
esan-0.236
TokenSource
Token ID7416
Feature activation-2.66e-02

Pos logit contributions
iblical+0.056
uca+0.055
leneck+0.054
 Painter+0.053
esan+0.050
Neg logit contributions
EMS-0.064
 Mehran-0.057
papers-0.054
 accuracy-0.054
 Stability-0.052
Token)
Token ID8
Feature activation+1.15e-01

Pos logit contributions
EMS+0.021
 Mehran+0.018
papers+0.017
 accuracy+0.017
":{"+0.017
Neg logit contributions
iblical-0.018
uca-0.018
leneck-0.017
 Painter-0.017
esan-0.016
Token {
Token ID1391
Feature activation-3.78e-02

Pos logit contributions
iblical+0.089
uca+0.089
leneck+0.086
 Painter+0.085
esan+0.080
Neg logit contributions
EMS-0.076
 Mehran-0.066
papers-0.062
 accuracy-0.062
":{"-0.059
Token $
Token ID720
Feature activation-6.20e-02

Pos logit contributions
EMS+0.026
 Mehran+0.023
papers+0.022
 accuracy+0.021
":{"+0.021
Neg logit contributions
iblical-0.028
uca-0.028
leneck-0.027
 Painter-0.027
esan-0.025
Token drilling
Token ID19747
Feature activation-7.63e-02

Pos logit contributions
iblical+0.013
uca+0.013
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.014
 Mehran-0.012
papers-0.012
 accuracy-0.011
":{"-0.011
Token plans
Token ID3352
Feature activation-4.59e-02

Pos logit contributions
EMS+0.053
 Mehran+0.047
papers+0.044
 accuracy+0.044
":{"+0.042
Neg logit contributions
iblical-0.057
uca-0.056
leneck-0.055
 Painter-0.054
esan-0.051
Token.
Token ID13
Feature activation-5.78e-02

Pos logit contributions
EMS+0.032
 Mehran+0.028
papers+0.027
 accuracy+0.026
":{"+0.025
Neg logit contributions
iblical-0.034
uca-0.034
leneck-0.033
 Painter-0.032
esan-0.030
Token<|endoftext|>
Token ID50256
Feature activation+8.42e-03

Pos logit contributions
EMS+0.045
 Mehran+0.040
papers+0.038
 accuracy+0.038
":{"+0.037
Neg logit contributions
iblical-0.038
uca-0.038
leneck-0.037
 Painter-0.036
esan-0.034
TokenGood
Token ID10248
Feature activation-2.10e-02

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.006
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.006
 Mehran-0.006
papers-0.005
 accuracy-0.005
 Stability-0.005
Tokenbye
Token ID16390
Feature activation-3.94e-01

Pos logit contributions
EMS+0.015
 Mehran+0.013
papers+0.012
 accuracy+0.012
":{"+0.012
Neg logit contributions
iblical-0.016
uca-0.016
leneck-0.015
 Painter-0.015
esan-0.014
Token,
Token ID11
Feature activation-2.71e-02

Pos logit contributions
EMS+0.279
 Mehran+0.246
papers+0.232
 accuracy+0.230
":{"+0.222
Neg logit contributions
iblical-0.288
uca-0.287
leneck-0.277
 Painter-0.274
esan-0.257
Token Ghosts
Token ID38389
Feature activation+6.97e-03

Pos logit contributions
EMS+0.019
 Mehran+0.017
papers+0.016
 accuracy+0.016
":{"+0.015
Neg logit contributions
iblical-0.020
uca-0.020
leneck-0.019
 Painter-0.019
esan-0.018
Token!
Token ID0
Feature activation+3.44e-03

Pos logit contributions
iblical+0.005
uca+0.005
leneck+0.005
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.005
 Mehran-0.004
papers-0.004
 accuracy-0.004
":{"-0.004
Token is
Token ID318
Feature activation-5.31e-02

Pos logit contributions
iblical+0.002
uca+0.002
leneck+0.002
 Painter+0.002
esan+0.002
Neg logit contributions
EMS-0.003
 Mehran-0.002
papers-0.002
 accuracy-0.002
":{"-0.002
Token an
Token ID281
Feature activation-2.71e-02

Pos logit contributions
EMS+0.036
 Mehran+0.031
papers+0.029
 accuracy+0.029
":{"+0.028
Neg logit contributions
iblical-0.041
uca-0.041
leneck-0.040
 Painter-0.039
esan-0.037
Token adopting
Token ID22868
Feature activation-5.26e-02

Pos logit contributions
EMS+0.011
 Mehran+0.009
papers+0.009
 accuracy+0.009
":{"+0.008
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.010
 Painter-0.010
esan-0.009
Token a
Token ID257
Feature activation-2.17e-02

Pos logit contributions
EMS+0.036
 Mehran+0.031
papers+0.029
 accuracy+0.029
":{"+0.028
Neg logit contributions
iblical-0.040
uca-0.040
leneck-0.039
 Painter-0.039
esan-0.036
Token neo
Token ID19102
Feature activation-9.47e-03

Pos logit contributions
EMS+0.015
 Mehran+0.013
papers+0.013
 accuracy+0.012
":{"+0.012
Neg logit contributions
iblical-0.016
uca-0.016
leneck-0.015
 Painter-0.015
esan-0.014
Token-
Token ID12
Feature activation-2.23e-02

Pos logit contributions
EMS+0.007
 Mehran+0.006
papers+0.006
 accuracy+0.006
":{"+0.006
Neg logit contributions
iblical-0.007
uca-0.007
leneck-0.006
 Painter-0.006
esan-0.006
TokenA
Token ID32
Feature activation-4.59e-02

Pos logit contributions
EMS+0.016
 Mehran+0.014
papers+0.013
 accuracy+0.013
 Stability+0.013
Neg logit contributions
iblical-0.016
uca-0.016
leneck-0.016
 Painter-0.015
esan-0.014
Tokenrist
Token ID1585
Feature activation-3.89e-01

Pos logit contributions
EMS+0.015
 Mehran+0.012
papers+0.010
 accuracy+0.010
 Stability+0.008
Neg logit contributions
iblical-0.051
uca-0.051
leneck-0.050
 Painter-0.049
esan-0.048
Tokenot
Token ID313
Feature activation-3.91e-02

Pos logit contributions
EMS+0.282
 Mehran+0.250
papers+0.236
 accuracy+0.234
":{"+0.226
Neg logit contributions
iblical-0.279
uca-0.278
leneck-0.269
 Painter-0.265
esan-0.249
Tokenel
Token ID417
Feature activation-1.15e-01

Pos logit contributions
EMS+0.021
 Mehran+0.018
papers+0.016
 accuracy+0.016
":{"+0.015
Neg logit contributions
iblical-0.035
uca-0.035
leneck-0.034
 Painter-0.034
esan-0.032
Tokenian
Token ID666
Feature activation-1.68e-01

Pos logit contributions
EMS+0.051
 Mehran+0.041
papers+0.038
 accuracy+0.037
":{"+0.035
Neg logit contributions
iblical-0.113
uca-0.113
leneck-0.111
 Painter-0.109
esan-0.104
Token approach
Token ID3164
Feature activation-6.78e-02

Pos logit contributions
EMS+0.116
 Mehran+0.102
papers+0.096
 accuracy+0.095
":{"+0.092
Neg logit contributions
iblical-0.126
uca-0.125
leneck-0.121
 Painter-0.120
esan-0.113
Token which
Token ID543
Feature activation+3.86e-02

Pos logit contributions
EMS+0.046
 Mehran+0.041
papers+0.038
 accuracy+0.038
":{"+0.037
Neg logit contributions
iblical-0.051
uca-0.051
leneck-0.050
 Painter-0.049
esan-0.046
Token Cry
Token ID8152
Feature activation-1.07e-01

Pos logit contributions
iblical+0.028
uca+0.028
leneck+0.027
 Painter+0.027
esan+0.025
Neg logit contributions
EMS-0.029
 Mehran-0.025
papers-0.024
 accuracy-0.024
":{"-0.023
Tokenom
Token ID296
Feature activation+2.94e-01

Pos logit contributions
EMS+0.050
 Mehran+0.041
papers+0.037
 accuracy+0.036
":{"+0.034
Neg logit contributions
uca-0.104
iblical-0.104
leneck-0.101
 Painter-0.101
esan-0.096
Tokenancer
Token ID8250
Feature activation+8.47e-02

Pos logit contributions
iblical+0.155
uca+0.154
leneck+0.146
 Painter+0.145
esan+0.134
Neg logit contributions
EMS-0.267
 Mehran-0.245
papers-0.234
 accuracy-0.232
 Stability-0.225
Token vs
Token ID3691
Feature activation-1.87e-01

Pos logit contributions
iblical+0.070
uca+0.070
leneck+0.068
 Painter+0.068
esan+0.064
Neg logit contributions
EMS-0.051
 Mehran-0.044
papers-0.041
 accuracy-0.041
":{"-0.039
Token Chaos
Token ID13903
Feature activation+8.19e-02

Pos logit contributions
EMS+0.134
 Mehran+0.118
papers+0.112
 accuracy+0.111
":{"+0.107
Neg logit contributions
iblical-0.135
uca-0.135
leneck-0.130
 Painter-0.129
esan-0.121
Token Charm
Token ID30225
Feature activation-3.89e-01

Pos logit contributions
iblical+0.060
uca+0.059
leneck+0.057
 Painter+0.057
esan+0.053
Neg logit contributions
EMS-0.058
 Mehran-0.051
papers-0.049
 accuracy-0.048
":{"-0.046
Token
Token ID198
Feature activation-1.07e-01

Pos logit contributions
EMS+0.295
 Mehran+0.263
papers+0.250
 accuracy+0.249
":{"+0.241
Neg logit contributions
iblical-0.265
uca-0.263
leneck-0.254
 Painter-0.249
esan-0.233
Token
Token ID198
Feature activation-9.07e-02

Pos logit contributions
EMS+0.081
 Mehran+0.072
papers+0.069
 accuracy+0.068
":{"+0.067
Neg logit contributions
iblical-0.072
uca-0.072
leneck-0.069
 Painter-0.068
esan-0.064
TokenSc
Token ID3351
Feature activation-1.38e-01

Pos logit contributions
EMS+0.066
 Mehran+0.058
papers+0.055
 accuracy+0.055
":{"+0.053
Neg logit contributions
iblical-0.065
uca-0.064
leneck-0.062
 Painter-0.061
esan-0.058
Tokenourge
Token ID14501
Feature activation-1.18e-01

Pos logit contributions
EMS+0.102
 Mehran+0.091
papers+0.086
 accuracy+0.086
 Stability+0.082
Neg logit contributions
iblical-0.096
uca-0.095
leneck-0.092
 Painter-0.091
esan-0.085
Token of
Token ID286
Feature activation-5.33e-02

Pos logit contributions
EMS+0.083
 Mehran+0.073
papers+0.069
 accuracy+0.068
":{"+0.066
Neg logit contributions
iblical-0.088
uca-0.088
leneck-0.085
 Painter-0.084
esan-0.079
Token that
Token ID326
Feature activation-3.23e-02

Pos logit contributions
iblical+0.019
uca+0.019
 Painter+0.019
leneck+0.019
esan+0.017
Neg logit contributions
EMS-0.019
 Mehran-0.017
papers-0.016
 accuracy-0.015
":{"-0.015
Token aren
Token ID3588
Feature activation-9.30e-02

Pos logit contributions
EMS+0.023
 Mehran+0.020
papers+0.019
 accuracy+0.018
":{"+0.018
Neg logit contributions
iblical-0.024
uca-0.024
leneck-0.023
 Painter-0.023
esan-0.021
Token't
Token ID470
Feature activation+2.24e-02

Pos logit contributions
EMS+0.068
 Mehran+0.060
papers+0.057
 accuracy+0.056
":{"+0.055
Neg logit contributions
iblical-0.066
uca-0.066
leneck-0.064
 Painter-0.063
esan-0.059
Token comm
Token ID725
Feature activation+2.04e-02

Pos logit contributions
iblical+0.016
uca+0.016
leneck+0.016
 Painter+0.016
esan+0.015
Neg logit contributions
EMS-0.016
 Mehran-0.014
papers-0.013
 accuracy-0.013
":{"-0.013
Tokenens
Token ID641
Feature activation-3.91e-02

Pos logit contributions
iblical+0.016
uca+0.016
leneck+0.016
 Painter+0.015
esan+0.015
Neg logit contributions
EMS-0.013
 Mehran-0.012
papers-0.011
 accuracy-0.011
":{"-0.010
Tokenurate
Token ID15537
Feature activation-3.85e-01

Pos logit contributions
EMS+0.013
 Mehran+0.010
papers+0.008
 accuracy+0.008
 Stability+0.007
Neg logit contributions
iblical-0.044
uca-0.043
leneck-0.043
 Painter-0.042
esan-0.041
Token with
Token ID351
Feature activation-1.29e-03

Pos logit contributions
EMS+0.269
 Mehran+0.236
papers+0.223
 accuracy+0.221
":{"+0.213
Neg logit contributions
iblical-0.286
uca-0.285
leneck-0.276
 Painter-0.272
esan-0.256
Token their
Token ID511
Feature activation+2.90e-02

Pos logit contributions
EMS+0.001
 Mehran+0.001
papers+0.001
 accuracy+0.001
":{"+0.001
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token past
Token ID1613
Feature activation-1.12e-01

Pos logit contributions
iblical+0.022
uca+0.022
leneck+0.021
 Painter+0.021
esan+0.020
Neg logit contributions
EMS-0.020
 Mehran-0.018
papers-0.017
 accuracy-0.016
":{"-0.016
Token performances
Token ID13289
Feature activation+1.04e-02

Pos logit contributions
EMS+0.061
 Mehran+0.052
papers+0.048
 accuracy+0.047
":{"+0.045
Neg logit contributions
iblical-0.099
uca-0.099
leneck-0.096
 Painter-0.095
esan-0.091
Token or
Token ID393
Feature activation-3.63e-03

Pos logit contributions
iblical+0.008
uca+0.008
leneck+0.008
 Painter+0.008
esan+0.008
Neg logit contributions
EMS-0.007
 Mehran-0.006
papers-0.005
 accuracy-0.005
 Stability-0.005
Token have
Token ID423
Feature activation-1.80e-02

Pos logit contributions
EMS+0.041
 Mehran+0.037
papers+0.035
 accuracy+0.035
":{"+0.034
Neg logit contributions
iblical-0.032
uca-0.032
leneck-0.030
 Painter-0.030
esan-0.028
Token sn
Token ID3013
Feature activation-1.09e-01

Pos logit contributions
EMS+0.013
 Mehran+0.012
papers+0.011
 accuracy+0.011
":{"+0.010
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.012
 Painter-0.012
esan-0.011
Tokenagged
Token ID14655
Feature activation-6.74e-02

Pos logit contributions
EMS+0.086
 Mehran+0.077
papers+0.073
 accuracy+0.073
 Stability+0.071
Neg logit contributions
iblical-0.070
uca-0.070
leneck-0.067
 Painter-0.066
esan-0.062
Token Tay
Token ID25569
Feature activation-1.24e-01

Pos logit contributions
EMS+0.046
 Mehran+0.040
papers+0.038
 accuracy+0.037
":{"+0.036
Neg logit contributions
iblical-0.051
uca-0.051
leneck-0.049
 Painter-0.049
esan-0.046
Token Z
Token ID1168
Feature activation-1.15e-02

Pos logit contributions
EMS+0.088
 Mehran+0.078
papers+0.074
 accuracy+0.073
":{"+0.070
Neg logit contributions
uca-0.090
iblical-0.090
leneck-0.087
 Painter-0.086
esan-0.081
Tokenonday
Token ID3204
Feature activation-3.85e-01

Pos logit contributions
EMS+0.008
 Mehran+0.008
papers+0.007
 accuracy+0.007
 Stability+0.007
Neg logit contributions
uca-0.008
iblical-0.008
leneck-0.008
 Painter-0.008
esan-0.007
Token of
Token ID286
Feature activation-4.07e-02

Pos logit contributions
EMS+0.272
 Mehran+0.239
papers+0.226
 accuracy+0.224
":{"+0.217
Neg logit contributions
iblical-0.282
uca-0.281
leneck-0.272
 Painter-0.268
esan-0.252
Token Youtube
Token ID27431
Feature activation-1.77e-01

Pos logit contributions
EMS+0.029
 Mehran+0.025
papers+0.024
 accuracy+0.024
":{"+0.023
Neg logit contributions
iblical-0.030
uca-0.030
leneck-0.029
 Painter-0.029
esan-0.027
Token fame
Token ID16117
Feature activation+2.19e-02

Pos logit contributions
EMS+0.122
 Mehran+0.108
papers+0.102
 accuracy+0.101
":{"+0.097
Neg logit contributions
iblical-0.132
uca-0.132
leneck-0.128
 Painter-0.126
esan-0.119
Token to
Token ID284
Feature activation-2.19e-02

Pos logit contributions
iblical+0.016
uca+0.016
leneck+0.016
 Painter+0.015
esan+0.015
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.013
 accuracy-0.013
":{"-0.012
Token narr
Token ID6664
Feature activation+2.61e-02

Pos logit contributions
EMS+0.015
 Mehran+0.013
papers+0.013
 accuracy+0.012
":{"+0.012
Neg logit contributions
iblical-0.016
uca-0.016
leneck-0.016
 Painter-0.016
esan-0.015
Token year
Token ID614
Feature activation-6.06e-02

Pos logit contributions
EMS+0.002
 Mehran+0.002
papers+0.002
 accuracy+0.002
":{"+0.002
Neg logit contributions
iblical-0.003
uca-0.003
leneck-0.003
 Painter-0.003
esan-0.003
Token's
Token ID338
Feature activation-1.87e-02

Pos logit contributions
EMS+0.049
 Mehran+0.044
papers+0.042
 accuracy+0.042
":{"+0.040
Neg logit contributions
iblical-0.038
uca-0.038
leneck-0.037
 Painter-0.036
esan-0.034
Token election
Token ID3071
Feature activation-1.31e-01

Pos logit contributions
EMS+0.013
 Mehran+0.011
papers+0.011
 accuracy+0.010
":{"+0.010
Neg logit contributions
iblical-0.014
uca-0.014
leneck-0.013
 Painter-0.013
esan-0.013
Token.
Token ID13
Feature activation-6.10e-02

Pos logit contributions
EMS+0.084
 Mehran+0.073
papers+0.068
 accuracy+0.067
":{"+0.065
Neg logit contributions
iblical-0.105
uca-0.105
leneck-0.102
 Painter-0.101
esan-0.096
Token Ronnie
Token ID40892
Feature activation-1.72e-01

Pos logit contributions
EMS+0.047
 Mehran+0.042
papers+0.040
 accuracy+0.039
":{"+0.038
Neg logit contributions
iblical-0.041
uca-0.041
leneck-0.039
 Painter-0.039
esan-0.036
Token Dug
Token ID27436
Feature activation-3.84e-01

Pos logit contributions
EMS+0.113
 Mehran+0.099
papers+0.093
 accuracy+0.092
":{"+0.089
Neg logit contributions
iblical-0.133
uca-0.133
leneck-0.129
 Painter-0.128
esan-0.120
Tokenger
Token ID1362
Feature activation-6.48e-02

Pos logit contributions
EMS+0.300
 Mehran+0.267
papers+0.254
 accuracy+0.252
 Stability+0.244
Neg logit contributions
iblical-0.254
uca-0.254
leneck-0.244
 Painter-0.240
esan-0.225
Token wrote
Token ID2630
Feature activation+7.32e-02

Pos logit contributions
EMS+0.047
 Mehran+0.041
papers+0.039
 accuracy+0.039
":{"+0.037
Neg logit contributions
iblical-0.046
uca-0.046
leneck-0.045
 Painter-0.044
esan-0.041
Token the
Token ID262
Feature activation-5.20e-02

Pos logit contributions
iblical+0.052
uca+0.051
leneck+0.049
 Painter+0.049
esan+0.046
Neg logit contributions
EMS-0.054
 Mehran-0.048
papers-0.045
 accuracy-0.044
":{"-0.043
Token definitive
Token ID17347
Feature activation-1.01e-01

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.030
":{"+0.030
Neg logit contributions
iblical-0.038
uca-0.038
leneck-0.036
 Painter-0.036
esan-0.034
Token warning
Token ID6509
Feature activation-1.05e-01

Pos logit contributions
EMS+0.067
 Mehran+0.058
papers+0.055
 accuracy+0.054
":{"+0.052
Neg logit contributions
iblical-0.079
uca-0.079
 Painter-0.076
leneck-0.076
esan-0.071
Token
Token ID198
Feature activation+2.70e-02

Pos logit contributions
iblical+0.024
uca+0.024
leneck+0.023
 Painter+0.023
esan+0.021
Neg logit contributions
EMS-0.027
 Mehran-0.024
papers-0.023
 accuracy-0.023
":{"-0.022
TokenThe
Token ID464
Feature activation+2.09e-02

Pos logit contributions
iblical+0.019
uca+0.019
leneck+0.019
 Painter+0.018
esan+0.017
Neg logit contributions
EMS-0.020
 Mehran-0.017
papers-0.016
 accuracy-0.016
":{"-0.016
Token first
Token ID717
Feature activation+4.12e-02

Pos logit contributions
iblical+0.015
uca+0.015
leneck+0.015
 Painter+0.014
esan+0.014
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.012
 accuracy-0.012
":{"-0.012
Token lady
Token ID10846
Feature activation-9.22e-02

Pos logit contributions
iblical+0.036
uca+0.036
leneck+0.035
 Painter+0.035
esan+0.033
Neg logit contributions
EMS-0.023
 Mehran-0.020
papers-0.018
 accuracy-0.018
":{"-0.017
Token's
Token ID338
Feature activation+8.61e-02

Pos logit contributions
EMS+0.072
 Mehran+0.064
papers+0.061
 accuracy+0.061
":{"+0.059
Neg logit contributions
iblical-0.060
uca-0.060
leneck-0.058
 Painter-0.058
esan-0.054
Token outgoing
Token ID28181
Feature activation+2.86e-01

Pos logit contributions
iblical+0.062
uca+0.061
leneck+0.059
 Painter+0.059
esan+0.055
Neg logit contributions
EMS-0.062
 Mehran-0.055
papers-0.052
 accuracy-0.051
":{"-0.050
Token public
Token ID1171
Feature activation+3.84e-03

Pos logit contributions
iblical+0.204
uca+0.203
leneck+0.196
 Painter+0.195
esan+0.182
Neg logit contributions
EMS-0.208
 Mehran-0.184
papers-0.174
 accuracy-0.171
":{"-0.167
Token hug
Token ID16225
Feature activation+6.49e-02

Pos logit contributions
iblical+0.003
uca+0.003
leneck+0.003
 Painter+0.003
esan+0.003
Neg logit contributions
EMS-0.003
 Mehran-0.002
papers-0.002
 accuracy-0.002
":{"-0.002
Token went
Token ID1816
Feature activation+1.76e-02

Pos logit contributions
iblical+0.046
uca+0.046
leneck+0.045
 Painter+0.044
esan+0.041
Neg logit contributions
EMS-0.047
 Mehran-0.041
papers-0.039
 accuracy-0.039
":{"-0.038
Token to
Token ID284
Feature activation-7.64e-04

Pos logit contributions
iblical+0.013
uca+0.013
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.012
 Mehran-0.010
papers-0.010
 accuracy-0.010
":{"-0.009
Token School
Token ID3961
Feature activation-4.03e-02

Pos logit contributions
EMS+0.001
 Mehran+0.000
papers+0.000
 accuracy+0.000
":{"+0.000
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token South
Token ID2520
Feature activation-6.09e-02

Pos logit contributions
EMS+0.035
 Mehran+0.031
papers+0.030
 accuracy+0.030
":{"+0.029
Neg logit contributions
iblical-0.027
uca-0.027
leneck-0.026
 Painter-0.026
esan-0.024
Token Australia
Token ID4505
Feature activation+8.66e-02

Pos logit contributions
EMS+0.046
 Mehran+0.041
papers+0.039
 accuracy+0.038
":{"+0.037
Neg logit contributions
iblical-0.042
uca-0.042
leneck-0.040
 Painter-0.040
esan-0.037
Token,
Token ID11
Feature activation+2.68e-02

Pos logit contributions
iblical+0.068
uca+0.068
leneck+0.066
 Painter+0.065
esan+0.061
Neg logit contributions
EMS-0.057
 Mehran-0.049
papers-0.046
 accuracy-0.046
":{"-0.044
Token an
Token ID281
Feature activation+5.83e-03

Pos logit contributions
iblical+0.019
uca+0.019
leneck+0.019
 Painter+0.018
esan+0.017
Neg logit contributions
EMS-0.019
 Mehran-0.017
papers-0.016
 accuracy-0.016
":{"-0.015
Token Emer
Token ID10320
Feature activation+1.88e-01

Pos logit contributions
iblical+0.004
uca+0.004
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.004
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.003
Tokenitus
Token ID17506
Feature activation+3.14e-01

Pos logit contributions
iblical+0.051
uca+0.051
leneck+0.046
 Painter+0.045
esan+0.036
Neg logit contributions
EMS-0.221
 Mehran-0.204
papers-0.197
 accuracy-0.197
":{"-0.193
Token Professor
Token ID8129
Feature activation+7.12e-02

Pos logit contributions
iblical+0.222
uca+0.221
leneck+0.214
 Painter+0.212
esan+0.198
Neg logit contributions
EMS-0.230
 Mehran-0.204
papers-0.193
 accuracy-0.192
":{"-0.185
Token of
Token ID286
Feature activation-4.13e-02

Pos logit contributions
iblical+0.054
uca+0.054
leneck+0.052
 Painter+0.051
esan+0.048
Neg logit contributions
EMS-0.049
 Mehran-0.043
papers-0.040
 accuracy-0.040
":{"-0.038
Token Economics
Token ID18963
Feature activation+6.45e-02

Pos logit contributions
EMS+0.034
 Mehran+0.031
papers+0.029
 accuracy+0.029
":{"+0.028
Neg logit contributions
iblical-0.025
uca-0.025
leneck-0.024
 Painter-0.024
esan-0.022
Token at
Token ID379
Feature activation-7.96e-03

Pos logit contributions
iblical+0.050
uca+0.050
leneck+0.048
 Painter+0.048
esan+0.045
Neg logit contributions
EMS-0.043
 Mehran-0.038
papers-0.035
 accuracy-0.035
":{"-0.034
Token Fl
Token ID1610
Feature activation+1.41e-02

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.004
Neg logit contributions
iblical-0.006
uca-0.006
leneck-0.006
 Painter-0.006
esan-0.005
Token.
Token ID13
Feature activation+2.15e-03

Pos logit contributions
EMS+0.021
 Mehran+0.019
papers+0.018
 accuracy+0.017
 Stability+0.017
Neg logit contributions
iblical-0.025
uca-0.025
leneck-0.024
 Painter-0.024
esan-0.022
Token
Token ID198
Feature activation+1.98e-02

Pos logit contributions
iblical+0.001
uca+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.002
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.001
Token
Token ID198
Feature activation+1.14e-02

Pos logit contributions
iblical+0.013
uca+0.013
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.015
 Mehran-0.014
papers-0.013
 accuracy-0.013
":{"-0.012
TokenDr
Token ID6187
Feature activation-1.06e-02

Pos logit contributions
iblical+0.008
uca+0.008
leneck+0.008
 Painter+0.008
esan+0.007
Neg logit contributions
EMS-0.008
 Mehran-0.007
papers-0.007
 accuracy-0.007
 Stability-0.007
Token D
Token ID360
Feature activation-1.54e-02

Pos logit contributions
EMS+0.008
 Mehran+0.007
papers+0.006
 accuracy+0.006
 Stability+0.006
Neg logit contributions
iblical-0.008
uca-0.008
leneck-0.008
 Painter-0.007
esan-0.007
Tokenyer
Token ID9860
Feature activation+3.33e-01

Pos logit contributions
EMS+0.017
 Mehran+0.015
papers+0.015
 accuracy+0.015
":{"+0.015
Neg logit contributions
iblical-0.005
uca-0.005
leneck-0.005
 Painter-0.005
esan-0.004
Token said
Token ID531
Feature activation+2.68e-02

Pos logit contributions
uca+0.218
iblical+0.218
leneck+0.210
 Painter+0.208
esan+0.193
Neg logit contributions
EMS-0.261
 Mehran-0.232
papers-0.221
 accuracy-0.219
 Stability-0.212
Token that
Token ID326
Feature activation-1.14e-02

Pos logit contributions
iblical+0.020
uca+0.020
leneck+0.019
 Painter+0.019
esan+0.018
Neg logit contributions
EMS-0.018
 Mehran-0.016
papers-0.015
 accuracy-0.015
":{"-0.014
Token rule
Token ID3896
Feature activation-1.09e-01

Pos logit contributions
EMS+0.008
 Mehran+0.007
papers+0.007
 accuracy+0.006
":{"+0.006
Neg logit contributions
iblical-0.009
uca-0.009
leneck-0.008
 Painter-0.008
esan-0.008
Token learning
Token ID4673
Feature activation-9.47e-02

Pos logit contributions
EMS+0.074
 Mehran+0.065
papers+0.062
 accuracy+0.061
":{"+0.059
Neg logit contributions
iblical-0.082
uca-0.082
leneck-0.079
 Painter-0.078
esan-0.074
Token was
Token ID373
Feature activation-1.67e-03

Pos logit contributions
EMS+0.068
 Mehran+0.060
papers+0.056
 accuracy+0.055
":{"+0.054
Neg logit contributions
iblical-0.069
uca-0.069
leneck-0.066
 Painter-0.066
esan-0.062
Token
Token ID447
Feature activation+1.59e-02

Pos logit contributions
EMS+0.005
 Mehran+0.005
papers+0.005
 accuracy+0.004
":{"+0.004
Neg logit contributions
iblical-0.005
uca-0.005
leneck-0.004
 Painter-0.004
esan-0.004
Token
Token ID251
Feature activation-1.12e-02

Pos logit contributions
iblical+0.012
uca+0.012
leneck+0.011
 Painter+0.011
esan+0.011
Neg logit contributions
EMS-0.011
 Mehran-0.010
papers-0.009
 accuracy-0.009
":{"-0.009
Token On
Token ID1550
Feature activation-5.05e-02

Pos logit contributions
EMS+0.009
 Mehran+0.008
papers+0.007
 accuracy+0.007
":{"+0.007
Neg logit contributions
iblical-0.007
uca-0.007
leneck-0.007
 Painter-0.007
esan-0.007
Token Facebook
Token ID3203
Feature activation+5.22e-02

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.031
":{"+0.030
Neg logit contributions
iblical-0.035
uca-0.035
leneck-0.034
 Painter-0.034
esan-0.031
Token,
Token ID11
Feature activation+2.34e-02

Pos logit contributions
iblical+0.039
uca+0.038
leneck+0.037
 Painter+0.037
esan+0.034
Neg logit contributions
EMS-0.037
 Mehran-0.032
papers-0.031
 accuracy-0.030
":{"-0.029
Token Flynn
Token ID14654
Feature activation+2.76e-01

Pos logit contributions
iblical+0.015
uca+0.015
leneck+0.014
 Painter+0.014
esan+0.013
Neg logit contributions
EMS-0.019
 Mehran-0.017
papers-0.016
 accuracy-0.016
":{"-0.015
Token linked
Token ID6692
Feature activation+6.87e-02

Pos logit contributions
iblical+0.185
uca+0.185
leneck+0.178
 Painter+0.176
esan+0.164
Neg logit contributions
EMS-0.213
 Mehran-0.189
papers-0.180
 accuracy-0.178
":{"-0.172
Token to
Token ID284
Feature activation-1.06e-02

Pos logit contributions
iblical+0.052
uca+0.052
leneck+0.050
 Painter+0.050
esan+0.047
Neg logit contributions
EMS-0.047
 Mehran-0.041
papers-0.039
 accuracy-0.038
":{"-0.037
Token the
Token ID262
Feature activation-1.73e-02

Pos logit contributions
EMS+0.007
 Mehran+0.006
papers+0.006
 accuracy+0.006
":{"+0.006
Neg logit contributions
iblical-0.008
uca-0.008
leneck-0.008
 Painter-0.008
esan-0.007
Token CNN
Token ID8100
Feature activation-8.27e-03

Pos logit contributions
EMS+0.012
 Mehran+0.011
papers+0.010
 accuracy+0.010
":{"+0.010
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.012
 Painter-0.012
esan-0.011
Token article
Token ID2708
Feature activation-5.22e-02

Pos logit contributions
EMS+0.005
 Mehran+0.005
papers+0.005
 accuracy+0.004
":{"+0.004
Neg logit contributions
iblical-0.006
uca-0.006
leneck-0.006
 Painter-0.006
esan-0.006
Token Bolton
Token ID35293
Feature activation+6.96e-02

Pos logit contributions
EMS+0.049
 Mehran+0.043
papers+0.041
 accuracy+0.039
":{"+0.038
Neg logit contributions
uca-0.053
iblical-0.053
 Painter-0.051
leneck-0.051
esan-0.047
Token only
Token ID691
Feature activation+7.11e-02

Pos logit contributions
uca+0.047
iblical+0.047
leneck+0.045
 Painter+0.045
esan+0.042
Neg logit contributions
EMS-0.053
 Mehran-0.047
papers-0.045
 accuracy-0.044
":{"-0.043
Token semi
Token ID10663
Feature activation+1.76e-01

Pos logit contributions
uca+0.048
iblical+0.047
leneck+0.046
 Painter+0.045
esan+0.042
Neg logit contributions
EMS-0.055
 Mehran-0.049
papers-0.046
 accuracy-0.045
 Stability-0.044
Token-
Token ID12
Feature activation+6.23e-02

Pos logit contributions
iblical+0.128
uca+0.128
leneck+0.123
 Painter+0.122
esan+0.114
Neg logit contributions
EMS-0.126
 Mehran-0.111
papers-0.105
 accuracy-0.104
":{"-0.101
Tokencle
Token ID2375
Feature activation-6.21e-03

Pos logit contributions
iblical+0.046
uca+0.046
leneck+0.044
 Painter+0.043
esan+0.041
Neg logit contributions
EMS-0.044
 Mehran-0.039
papers-0.036
 accuracy-0.036
 Stability-0.035
Tokenared
Token ID1144
Feature activation+2.96e-01

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.005
Neg logit contributions
iblical-0.003
uca-0.003
leneck-0.003
 Painter-0.003
esan-0.003
Token Anton
Token ID9261
Feature activation+6.12e-02

Pos logit contributions
iblical+0.221
uca+0.220
 Painter+0.215
leneck+0.212
esan+0.199
Neg logit contributions
EMS-0.203
 Mehran-0.178
papers-0.168
 accuracy-0.164
":{"-0.160
Token Ferdinand
Token ID44312
Feature activation+6.62e-02

Pos logit contributions
iblical+0.047
uca+0.047
leneck+0.046
 Painter+0.045
esan+0.043
Neg logit contributions
EMS-0.041
 Mehran-0.036
papers-0.034
 accuracy-0.033
":{"-0.032
Token's
Token ID338
Feature activation+6.80e-02

Pos logit contributions
iblical+0.044
uca+0.044
leneck+0.042
 Painter+0.042
esan+0.039
Neg logit contributions
EMS-0.051
 Mehran-0.046
papers-0.043
 accuracy-0.043
":{"-0.042
Token deep
Token ID2769
Feature activation-6.16e-02

Pos logit contributions
uca+0.050
iblical+0.050
leneck+0.048
 Painter+0.048
esan+0.045
Neg logit contributions
EMS-0.048
 Mehran-0.042
papers-0.040
 accuracy-0.038
":{"-0.038
Token cross
Token ID3272
Feature activation+9.67e-02

Pos logit contributions
EMS+0.043
 Mehran+0.038
papers+0.036
 accuracy+0.034
":{"+0.034
Neg logit contributions
iblical-0.046
uca-0.045
leneck-0.044
 Painter-0.044
esan-0.041
Token-
Token ID12
Feature activation-9.83e-03

Pos logit contributions
iblical+0.025
uca+0.025
leneck+0.024
 Painter+0.024
esan+0.023
Neg logit contributions
EMS-0.026
 Mehran-0.023
papers-0.021
 accuracy-0.021
 Stability-0.021
Tokenaligned
Token ID41634
Feature activation-8.75e-02

Pos logit contributions
EMS+0.008
 Mehran+0.007
papers+0.007
 accuracy+0.007
 Stability+0.006
Neg logit contributions
iblical-0.006
uca-0.006
leneck-0.006
 Painter-0.006
esan-0.006
Token countries
Token ID2678
Feature activation-1.09e-01

Pos logit contributions
EMS+0.042
 Mehran+0.035
papers+0.032
 accuracy+0.031
":{"+0.029
Neg logit contributions
iblical-0.084
uca-0.084
leneck-0.081
 Painter-0.081
esan-0.077
Token must
Token ID1276
Feature activation-2.91e-02

Pos logit contributions
EMS+0.077
 Mehran+0.068
papers+0.064
 accuracy+0.064
":{"+0.061
Neg logit contributions
iblical-0.080
uca-0.080
leneck-0.077
 Painter-0.077
esan-0.072
Token give
Token ID1577
Feature activation-2.69e-02

Pos logit contributions
EMS+0.020
 Mehran+0.018
papers+0.017
 accuracy+0.016
":{"+0.016
Neg logit contributions
iblical-0.022
uca-0.022
leneck-0.021
 Painter-0.021
esan-0.020
Token priority
Token ID8475
Feature activation+4.33e-02

Pos logit contributions
EMS+0.018
 Mehran+0.015
papers+0.014
 accuracy+0.014
":{"+0.014
Neg logit contributions
iblical-0.021
uca-0.021
leneck-0.020
 Painter-0.020
esan-0.019
Token to
Token ID284
Feature activation+1.37e-02

Pos logit contributions
iblical+0.033
uca+0.033
leneck+0.032
 Painter+0.031
esan+0.029
Neg logit contributions
EMS-0.030
 Mehran-0.026
papers-0.025
 accuracy-0.024
":{"-0.023
Token the
Token ID262
Feature activation-2.64e-02

Pos logit contributions
iblical+0.011
uca+0.011
leneck+0.010
 Painter+0.010
esan+0.010
Neg logit contributions
EMS-0.009
 Mehran-0.008
papers-0.007
 accuracy-0.007
":{"-0.007
Token abolition
Token ID30480
Feature activation-2.34e-02

Pos logit contributions
EMS+0.017
 Mehran+0.015
papers+0.014
 accuracy+0.014
":{"+0.013
Neg logit contributions
iblical-0.021
uca-0.021
leneck-0.020
 Painter-0.020
esan-0.019
Token of
Token ID286
Feature activation+4.91e-03

Pos logit contributions
EMS+0.016
 Mehran+0.014
papers+0.013
 accuracy+0.013
":{"+0.013
Neg logit contributions
iblical-0.018
uca-0.017
leneck-0.017
 Painter-0.017
esan-0.016
Token nuclear
Token ID4523
Feature activation-6.10e-02

Pos logit contributions
iblical+0.004
uca+0.004
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.003
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.002
Tokenans
Token ID504
Feature activation+6.07e-02

Pos logit contributions
EMS+0.209
 Mehran+0.184
papers+0.175
 accuracy+0.172
":{"+0.168
Neg logit contributions
iblical-0.218
uca-0.217
leneck-0.211
 Painter-0.208
esan-0.194
Token,
Token ID11
Feature activation+2.51e-02

Pos logit contributions
iblical+0.043
uca+0.043
leneck+0.041
 Painter+0.041
esan+0.038
Neg logit contributions
EMS-0.044
 Mehran-0.040
papers-0.037
 accuracy-0.037
 Stability-0.036
Token the
Token ID262
Feature activation+2.08e-02

Pos logit contributions
iblical+0.019
uca+0.019
leneck+0.019
 Painter+0.019
esan+0.017
Neg logit contributions
EMS-0.017
 Mehran-0.015
papers-0.014
 accuracy-0.014
":{"-0.013
Token only
Token ID691
Feature activation+3.96e-02

Pos logit contributions
iblical+0.015
uca+0.015
leneck+0.014
 Painter+0.014
esan+0.013
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.013
 accuracy-0.013
":{"-0.012
Token H
Token ID367
Feature activation-8.71e-02

Pos logit contributions
iblical+0.027
uca+0.027
leneck+0.026
 Painter+0.026
esan+0.024
Neg logit contributions
EMS-0.030
 Mehran-0.026
papers-0.025
 accuracy-0.025
":{"-0.024
TokenOPE
Token ID32135
Feature activation+1.21e-01

Pos logit contributions
EMS+0.070
 Mehran+0.063
papers+0.060
 accuracy+0.060
":{"+0.058
Neg logit contributions
iblical-0.055
uca-0.055
leneck-0.053
 Painter-0.052
esan-0.048
Token scholar
Token ID15606
Feature activation+6.46e-02

Pos logit contributions
iblical+0.082
uca+0.082
leneck+0.079
 Painter+0.078
esan+0.073
Neg logit contributions
EMS-0.092
 Mehran-0.082
papers-0.078
 accuracy-0.077
":{"-0.074
Token in
Token ID287
Feature activation+3.08e-02

Pos logit contributions
iblical+0.047
uca+0.047
leneck+0.045
 Painter+0.045
esan+0.042
Neg logit contributions
EMS-0.046
 Mehran-0.040
papers-0.038
 accuracy-0.038
 Stability-0.036
Token the
Token ID262
Feature activation+1.48e-02

Pos logit contributions
iblical+0.024
uca+0.023
leneck+0.023
 Painter+0.023
esan+0.021
Neg logit contributions
EMS-0.021
 Mehran-0.018
papers-0.017
 accuracy-0.017
":{"-0.016
Token legislature
Token ID15928
Feature activation+2.14e-02

Pos logit contributions
iblical+0.010
uca+0.010
leneck+0.009
 Painter+0.009
esan+0.009
Neg logit contributions
EMS-0.012
 Mehran-0.010
papers-0.010
 accuracy-0.010
":{"-0.009
Token at
Token ID379
Feature activation+1.52e-03

Pos logit contributions
iblical+0.016
uca+0.016
leneck+0.015
 Painter+0.015
esan+0.014
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.013
 accuracy-0.012
":{"-0.012
TokenUCK
Token ID16696
Feature activation-7.13e-02

Pos logit contributions
iblical+0.013
uca+0.013
leneck+0.012
 Painter+0.012
esan+0.011
Neg logit contributions
EMS-0.014
 Mehran-0.013
papers-0.012
 accuracy-0.012
":{"-0.012
TokenG
Token ID38
Feature activation-1.32e-02

Pos logit contributions
EMS+0.049
 Mehran+0.043
papers+0.041
 accuracy+0.041
":{"+0.039
Neg logit contributions
iblical-0.053
uca-0.053
leneck-0.051
 Painter-0.051
esan-0.048
Token).
Token ID737
Feature activation-3.30e-02

Pos logit contributions
EMS+0.009
 Mehran+0.008
papers+0.008
 accuracy+0.008
":{"+0.008
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.009
 Painter-0.009
esan-0.009
Token The
Token ID383
Feature activation-4.99e-03

Pos logit contributions
EMS+0.024
 Mehran+0.021
papers+0.020
 accuracy+0.020
":{"+0.019
Neg logit contributions
iblical-0.023
uca-0.023
leneck-0.023
 Painter-0.022
esan-0.021
Token church
Token ID4928
Feature activation+5.43e-02

Pos logit contributions
EMS+0.003
 Mehran+0.003
papers+0.003
 accuracy+0.003
":{"+0.003
Neg logit contributions
iblical-0.004
uca-0.004
leneck-0.004
 Painter-0.004
esan-0.003
Token was
Token ID373
Feature activation+2.97e-02

Pos logit contributions
iblical+0.039
uca+0.039
leneck+0.037
 Painter+0.037
esan+0.035
Neg logit contributions
EMS-0.039
 Mehran-0.035
papers-0.033
 accuracy-0.032
 Stability-0.031
Token started
Token ID2067
Feature activation+6.28e-02

Pos logit contributions
iblical+0.021
uca+0.021
leneck+0.020
 Painter+0.020
esan+0.018
Neg logit contributions
EMS-0.022
 Mehran-0.020
papers-0.019
 accuracy-0.018
":{"-0.018
Token in
Token ID287
Feature activation+3.91e-03

Pos logit contributions
uca+0.045
iblical+0.045
leneck+0.043
 Painter+0.043
esan+0.040
Neg logit contributions
EMS-0.046
 Mehran-0.040
papers-0.038
 accuracy-0.038
 Stability-0.037
Token 1977
Token ID15589
Feature activation+4.28e-02

Pos logit contributions
iblical+0.003
uca+0.003
leneck+0.003
 Painter+0.003
esan+0.003
Neg logit contributions
EMS-0.003
 Mehran-0.002
papers-0.002
 accuracy-0.002
":{"-0.002
Token by
Token ID416
Feature activation+5.67e-03

Pos logit contributions
iblical+0.031
uca+0.031
leneck+0.030
 Painter+0.029
esan+0.028
Neg logit contributions
EMS-0.031
 Mehran-0.027
papers-0.026
 accuracy-0.025
 Stability-0.024
Token Ed
Token ID1717
Feature activation+1.10e-01

Pos logit contributions
iblical+0.004
uca+0.004
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.004
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.003
Token say
Token ID910
Feature activation-1.23e-03

Pos logit contributions
uca+0.050
iblical+0.050
leneck+0.048
 Painter+0.048
esan+0.045
Neg logit contributions
EMS-0.058
 Mehran-0.052
papers-0.049
 accuracy-0.049
 Stability-0.047
Token they
Token ID484
Feature activation+3.14e-02

Pos logit contributions
EMS+0.001
 Mehran+0.001
papers+0.001
 accuracy+0.001
":{"+0.001
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token received
Token ID2722
Feature activation-1.12e-01

Pos logit contributions
iblical+0.021
uca+0.021
leneck+0.021
 Painter+0.020
esan+0.019
Neg logit contributions
EMS-0.024
 Mehran-0.021
papers-0.020
 accuracy-0.020
":{"-0.019
Token notices
Token ID19748
Feature activation+3.89e-02

Pos logit contributions
EMS+0.082
 Mehran+0.073
papers+0.069
 accuracy+0.068
":{"+0.066
Neg logit contributions
iblical-0.080
uca-0.079
leneck-0.077
 Painter-0.076
esan-0.071
Token from
Token ID422
Feature activation-4.38e-02

Pos logit contributions
iblical+0.028
uca+0.028
leneck+0.027
 Painter+0.027
esan+0.025
Neg logit contributions
EMS-0.028
 Mehran-0.024
papers-0.023
 accuracy-0.023
":{"-0.022
Token Tinder
Token ID44043
Feature activation+8.64e-02

Pos logit contributions
EMS+0.032
 Mehran+0.029
papers+0.027
 accuracy+0.027
":{"+0.026
Neg logit contributions
iblical-0.031
uca-0.031
leneck-0.029
 Painter-0.029
esan-0.027
Token saying
Token ID2282
Feature activation+8.23e-03

Pos logit contributions
iblical+0.062
uca+0.062
leneck+0.060
 Painter+0.060
esan+0.056
Neg logit contributions
EMS-0.062
 Mehran-0.055
papers-0.052
 accuracy-0.051
":{"-0.049
Token the
Token ID262
Feature activation-5.22e-02

Pos logit contributions
uca+0.006
iblical+0.006
leneck+0.006
 Painter+0.006
esan+0.005
Neg logit contributions
EMS-0.006
 Mehran-0.005
papers-0.005
 accuracy-0.005
":{"-0.005
Token were
Token ID547
Feature activation+7.62e-03

Pos logit contributions
EMS+0.036
 Mehran+0.032
papers+0.030
 accuracy+0.029
":{"+0.029
Neg logit contributions
uca-0.039
iblical-0.039
leneck-0.038
 Painter-0.037
esan-0.035
Token unable
Token ID5906
Feature activation+2.02e-02

Pos logit contributions
uca+0.005
iblical+0.005
leneck+0.005
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.006
 Mehran-0.005
papers-0.005
 accuracy-0.005
":{"-0.005
Token to
Token ID284
Feature activation+1.30e-02

Pos logit contributions
iblical+0.015
uca+0.015
leneck+0.014
 Painter+0.014
esan+0.013
Neg logit contributions
EMS-0.014
 Mehran-0.012
papers-0.012
 accuracy-0.012
":{"-0.011
Token young
Token ID1862
Feature activation+8.40e-03

Pos logit contributions
iblical+0.088
uca+0.088
leneck+0.085
 Painter+0.084
esan+0.079
Neg logit contributions
EMS-0.079
 Mehran-0.069
papers-0.065
 accuracy-0.064
 Stability-0.061
Token participants
Token ID6809
Feature activation+1.70e-01

Pos logit contributions
iblical+0.007
uca+0.007
leneck+0.006
 Painter+0.006
esan+0.006
Neg logit contributions
EMS-0.005
 Mehran-0.005
papers-0.005
 accuracy-0.004
":{"-0.004
Token of
Token ID286
Feature activation+4.37e-02

Pos logit contributions
iblical+0.125
uca+0.125
leneck+0.121
 Painter+0.120
esan+0.112
Neg logit contributions
EMS-0.119
 Mehran-0.105
papers-0.099
 accuracy-0.098
 Stability-0.095
Token protests
Token ID8536
Feature activation+2.10e-02

Pos logit contributions
iblical+0.035
uca+0.034
leneck+0.033
 Painter+0.033
esan+0.031
Neg logit contributions
EMS-0.028
 Mehran-0.025
papers-0.023
 accuracy-0.023
":{"-0.022
Token.
Token ID13
Feature activation+9.87e-03

Pos logit contributions
uca+0.016
iblical+0.016
leneck+0.015
 Painter+0.015
esan+0.014
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.012
 accuracy-0.012
 Stability-0.012
Token One
Token ID1881
Feature activation+1.01e-01

Pos logit contributions
iblical+0.007
uca+0.007
leneck+0.007
 Painter+0.007
esan+0.006
Neg logit contributions
EMS-0.007
 Mehran-0.006
papers-0.006
 accuracy-0.006
":{"-0.006
Token #
Token ID1303
Feature activation+1.74e-01

Pos logit contributions
iblical+0.075
uca+0.075
leneck+0.072
 Painter+0.071
esan+0.067
Neg logit contributions
EMS-0.070
 Mehran-0.062
papers-0.059
 accuracy-0.058
":{"-0.056
TokenWe
Token ID1135
Feature activation+1.46e-02

Pos logit contributions
iblical+0.120
uca+0.120
leneck+0.116
 Painter+0.114
esan+0.107
Neg logit contributions
EMS-0.130
 Mehran-0.116
papers-0.110
 accuracy-0.109
":{"-0.105
Token
Token ID447
Feature activation+4.05e-03

Pos logit contributions
iblical+0.011
uca+0.011
leneck+0.011
 Painter+0.011
esan+0.010
Neg logit contributions
EMS-0.010
 Mehran-0.009
papers-0.008
 accuracy-0.008
":{"-0.008
Token
Token ID247
Feature activation+8.46e-03

Pos logit contributions
uca+0.003
iblical+0.003
leneck+0.003
 Painter+0.003
esan+0.003
Neg logit contributions
EMS-0.003
 Mehran-0.002
papers-0.002
 accuracy-0.002
 Stability-0.002
Tokenve
Token ID303
Feature activation+6.47e-02

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.005
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.007
 Mehran-0.006
papers-0.006
 accuracy-0.005
":{"-0.005
Token match
Token ID2872
Feature activation+1.73e-01

Pos logit contributions
iblical+0.018
uca+0.018
leneck+0.017
 Painter+0.017
esan+0.016
Neg logit contributions
EMS-0.018
 Mehran-0.016
papers-0.015
 accuracy-0.015
":{"-0.015
Token for
Token ID329
Feature activation+5.45e-02

Pos logit contributions
iblical+0.110
uca+0.110
leneck+0.106
 Painter+0.104
esan+0.097
Neg logit contributions
EMS-0.138
 Mehran-0.124
papers-0.118
 accuracy-0.117
":{"-0.114
Token the
Token ID262
Feature activation+2.86e-02

Pos logit contributions
iblical+0.042
uca+0.041
leneck+0.040
 Painter+0.040
esan+0.037
Neg logit contributions
EMS-0.037
 Mehran-0.032
papers-0.031
 accuracy-0.030
":{"-0.029
Token Gods
Token ID15391
Feature activation+5.23e-02

Pos logit contributions
iblical+0.020
uca+0.020
leneck+0.020
 Painter+0.019
esan+0.018
Neg logit contributions
EMS-0.021
 Mehran-0.018
papers-0.017
 accuracy-0.017
":{"-0.017
Token themselves
Token ID2405
Feature activation+7.85e-02

Pos logit contributions
iblical+0.041
uca+0.041
leneck+0.040
 Painter+0.040
esan+0.038
Neg logit contributions
EMS-0.034
 Mehran-0.030
papers-0.028
 accuracy-0.027
 Stability-0.026
Token.
Token ID13
Feature activation+2.00e-02

Pos logit contributions
iblical+0.064
uca+0.064
leneck+0.062
 Painter+0.061
esan+0.058
Neg logit contributions
EMS-0.049
 Mehran-0.042
papers-0.040
 accuracy-0.039
 Stability-0.038
Token
Token ID564
Feature activation+1.59e-02

Pos logit contributions
iblical+0.014
uca+0.014
leneck+0.013
 Painter+0.013
esan+0.012
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.013
 accuracy-0.012
":{"-0.012
Token
Token ID250
Feature activation+2.15e-02

Pos logit contributions
iblical+0.011
uca+0.011
leneck+0.011
 Painter+0.010
esan+0.010
Neg logit contributions
EMS-0.012
 Mehran-0.011
papers-0.010
 accuracy-0.010
":{"-0.010
TokenO
Token ID46
Feature activation-6.81e-03

Pos logit contributions
iblical+0.014
uca+0.014
leneck+0.014
 Painter+0.013
esan+0.013
Neg logit contributions
EMS-0.017
 Mehran-0.015
papers-0.014
 accuracy-0.014
":{"-0.014
Tokenlymp
Token ID6760
Feature activation+1.17e-01

Pos logit contributions
EMS+0.005
 Mehran+0.005
 accuracy+0.004
papers+0.004
 Stability+0.004
Neg logit contributions
iblical-0.004
uca-0.004
leneck-0.004
 Painter-0.004
esan-0.004
Tokenus
Token ID385
Feature activation+1.16e-01

Pos logit contributions
iblical+0.073
uca+0.073
leneck+0.070
 Painter+0.069
esan+0.064
Neg logit contributions
EMS-0.096
 Mehran-0.086
papers-0.082
 accuracy-0.081
":{"-0.079
Token anxiety
Token ID9751
Feature activation+8.61e-02

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.031
":{"+0.030
Neg logit contributions
iblical-0.033
uca-0.033
leneck-0.032
 Painter-0.032
esan-0.030
Token and
Token ID290
Feature activation-2.12e-02

Pos logit contributions
iblical+0.066
uca+0.066
leneck+0.064
 Painter+0.063
esan+0.060
Neg logit contributions
EMS-0.058
 Mehran-0.050
papers-0.048
 accuracy-0.048
":{"-0.046
Token,
Token ID11
Feature activation-3.34e-03

Pos logit contributions
EMS+0.015
 Mehran+0.014
papers+0.013
 accuracy+0.013
":{"+0.012
Neg logit contributions
iblical-0.015
uca-0.015
leneck-0.015
 Painter-0.014
esan-0.013
Token of
Token ID286
Feature activation-1.12e-02

Pos logit contributions
EMS+0.002
 Mehran+0.002
papers+0.002
 accuracy+0.002
":{"+0.002
Neg logit contributions
iblical-0.002
uca-0.002
leneck-0.002
 Painter-0.002
esan-0.002
Token course
Token ID1781
Feature activation-5.57e-02

Pos logit contributions
EMS+0.007
 Mehran+0.006
papers+0.006
 accuracy+0.006
":{"+0.005
Neg logit contributions
iblical-0.009
uca-0.009
leneck-0.009
 Painter-0.009
esan-0.008
Token,
Token ID11
Feature activation+8.47e-04

Pos logit contributions
EMS+0.039
 Mehran+0.035
papers+0.033
 accuracy+0.032
":{"+0.031
Neg logit contributions
iblical-0.041
uca-0.041
leneck-0.040
 Painter-0.039
esan-0.037
Token a
Token ID257
Feature activation+1.22e-02

Pos logit contributions
iblical+0.001
uca+0.001
leneck+0.001
 Painter+0.001
esan+0.001
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.000
Token drinking
Token ID7722
Feature activation-4.99e-02

Pos logit contributions
iblical+0.009
uca+0.009
leneck+0.008
 Painter+0.008
esan+0.008
Neg logit contributions
EMS-0.009
 Mehran-0.008
papers-0.008
 accuracy-0.007
":{"-0.007
Token problem
Token ID1917
Feature activation-3.22e-03

Pos logit contributions
EMS+0.033
 Mehran+0.028
papers+0.027
 accuracy+0.026
":{"+0.025
Neg logit contributions
iblical-0.039
uca-0.039
leneck-0.038
 Painter-0.037
esan-0.035
Token.
Token ID13
Feature activation-1.08e-02

Pos logit contributions
EMS+0.002
 Mehran+0.002
papers+0.002
 accuracy+0.002
":{"+0.002
Neg logit contributions
iblical-0.003
uca-0.003
leneck-0.003
 Painter-0.003
esan-0.002
Token That
Token ID1320
Feature activation+1.45e-02

Pos logit contributions
EMS+0.008
 Mehran+0.007
papers+0.007
 accuracy+0.007
":{"+0.006
Neg logit contributions
iblical-0.008
uca-0.008
leneck-0.007
 Painter-0.007
esan-0.007
Token who
Token ID508
Feature activation-5.12e-02

Pos logit contributions
iblical+0.008
uca+0.008
leneck+0.007
 Painter+0.007
esan+0.007
Neg logit contributions
EMS-0.009
 Mehran-0.008
papers-0.007
 accuracy-0.007
 Stability-0.007
Token wishes
Token ID12802
Feature activation+6.17e-02

Pos logit contributions
EMS+0.037
 Mehran+0.032
papers+0.030
 accuracy+0.030
 Stability+0.029
Neg logit contributions
iblical-0.037
uca-0.037
leneck-0.036
 Painter-0.035
esan-0.033
Token to
Token ID284
Feature activation-2.26e-02

Pos logit contributions
uca+0.045
iblical+0.045
leneck+0.043
 Painter+0.043
esan+0.040
Neg logit contributions
EMS-0.044
 Mehran-0.038
papers-0.036
 accuracy-0.036
 Stability-0.035
Token transition
Token ID6801
Feature activation-1.47e-01

Pos logit contributions
EMS+0.015
 Mehran+0.013
papers+0.012
 accuracy+0.012
 Stability+0.012
Neg logit contributions
iblical-0.017
uca-0.017
leneck-0.017
 Painter-0.017
esan-0.016
Token their
Token ID511
Feature activation+1.02e-02

Pos logit contributions
EMS+0.101
 Mehran+0.089
papers+0.083
 accuracy+0.082
 Stability+0.079
Neg logit contributions
iblical-0.112
uca-0.111
leneck-0.107
 Painter-0.106
esan-0.100
Token gender
Token ID5279
Feature activation-5.73e-02

Pos logit contributions
iblical+0.008
uca+0.008
leneck+0.008
 Painter+0.008
esan+0.008
Neg logit contributions
EMS-0.006
 Mehran-0.005
papers-0.005
 accuracy-0.005
":{"-0.005
Token socially
Token ID18118
Feature activation-2.91e-02

Pos logit contributions
EMS+0.035
 Mehran+0.030
papers+0.028
 accuracy+0.027
":{"+0.026
Neg logit contributions
iblical-0.048
uca-0.047
leneck-0.046
 Painter-0.046
esan-0.043
Token and
Token ID290
Feature activation-3.81e-02

Pos logit contributions
EMS+0.019
 Mehran+0.017
papers+0.016
 accuracy+0.016
 Stability+0.015
Neg logit contributions
iblical-0.023
uca-0.023
leneck-0.022
 Painter-0.022
esan-0.020
Token/
Token ID14
Feature activation-4.57e-02

Pos logit contributions
EMS+0.028
 Mehran+0.024
papers+0.023
 accuracy+0.023
":{"+0.022
Neg logit contributions
iblical-0.027
uca-0.027
leneck-0.026
 Painter-0.026
esan-0.024
Tokenor
Token ID273
Feature activation-1.00e-02

Pos logit contributions
EMS+0.036
 Mehran+0.032
papers+0.031
 accuracy+0.031
":{"+0.030
Neg logit contributions
iblical-0.030
uca-0.029
leneck-0.028
 Painter-0.028
esan-0.026
Token physically
Token ID10170
Feature activation+4.80e-02

Pos logit contributions
EMS+0.007
 Mehran+0.006
papers+0.006
 accuracy+0.006
 Stability+0.006
Neg logit contributions
iblical-0.007
uca-0.007
leneck-0.007
 Painter-0.007
esan-0.006
Token
Token ID564
Feature activation-1.45e-02

Pos logit contributions
EMS+0.014
 Mehran+0.013
papers+0.012
 accuracy+0.012
":{"+0.012
Neg logit contributions
uca-0.013
iblical-0.013
leneck-0.012
 Painter-0.012
esan-0.011
Token
Token ID250
Feature activation-1.95e-03

Pos logit contributions
EMS+0.010
 Mehran+0.009
papers+0.009
 accuracy+0.009
":{"+0.008
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.010
 Painter-0.010
esan-0.009
TokenThey
Token ID2990
Feature activation+4.39e-02

Pos logit contributions
EMS+0.002
 Mehran+0.001
papers+0.001
 accuracy+0.001
 Stability+0.001
Neg logit contributions
iblical-0.001
uca-0.001
leneck-0.001
 Painter-0.001
esan-0.001
Token
Token ID447
Feature activation-2.24e-02

Pos logit contributions
iblical+0.032
uca+0.032
leneck+0.031
 Painter+0.030
esan+0.029
Neg logit contributions
EMS-0.031
 Mehran-0.028
papers-0.026
 accuracy-0.026
 Stability-0.025
Token
Token ID247
Feature activation-1.92e-02

Pos logit contributions
EMS+0.016
 Mehran+0.014
papers+0.013
 accuracy+0.013
 Stability+0.013
Neg logit contributions
uca-0.016
iblical-0.016
leneck-0.016
 Painter-0.016
esan-0.015
Tokenre
Token ID260
Feature activation-3.66e-02

Pos logit contributions
EMS+0.014
 Mehran+0.012
papers+0.012
 accuracy+0.012
":{"+0.011
Neg logit contributions
iblical-0.014
uca-0.014
leneck-0.013
 Painter-0.013
esan-0.012
Token tourists
Token ID15930
Feature activation-3.20e-02

Pos logit contributions
EMS+0.027
 Mehran+0.024
papers+0.023
 accuracy+0.023
":{"+0.022
Neg logit contributions
iblical-0.026
uca-0.026
leneck-0.025
 Painter-0.024
esan-0.023
Token spending
Token ID4581
Feature activation+2.60e-02

Pos logit contributions
EMS+0.022
 Mehran+0.019
papers+0.018
 accuracy+0.018
 Stability+0.017
Neg logit contributions
iblical-0.024
uca-0.024
leneck-0.023
 Painter-0.023
esan-0.022
Token money
Token ID1637
Feature activation-8.77e-02

Pos logit contributions
iblical+0.019
uca+0.019
leneck+0.018
 Painter+0.018
esan+0.017
Neg logit contributions
EMS-0.018
 Mehran-0.016
papers-0.015
 accuracy-0.015
":{"-0.014
Token.
Token ID13
Feature activation-7.34e-03

Pos logit contributions
EMS+0.059
 Mehran+0.052
papers+0.049
 accuracy+0.049
":{"+0.047
Neg logit contributions
iblical-0.067
uca-0.067
leneck-0.064
 Painter-0.064
esan-0.060
Token I
Token ID314
Feature activation-9.60e-03

Pos logit contributions
EMS+0.005
 Mehran+0.004
papers+0.004
 accuracy+0.004
":{"+0.004
Neg logit contributions
iblical-0.005
uca-0.005
leneck-0.005
 Painter-0.005
esan-0.005
Tokenrary
Token ID11619
Feature activation-6.82e-02

Pos logit contributions
iblical+0.060
uca+0.059
leneck+0.057
 Painter+0.056
esan+0.052
Neg logit contributions
EMS-0.079
 Mehran-0.072
papers-0.068
 accuracy-0.067
 Stability-0.065
Token to
Token ID284
Feature activation-9.99e-03

Pos logit contributions
EMS+0.046
 Mehran+0.041
papers+0.038
 accuracy+0.038
":{"+0.037
Neg logit contributions
iblical-0.052
uca-0.052
leneck-0.050
 Painter-0.049
esan-0.047
Token the
Token ID262
Feature activation-1.53e-02

Pos logit contributions
EMS+0.007
 Mehran+0.006
papers+0.005
 accuracy+0.005
":{"+0.005
Neg logit contributions
iblical-0.008
uca-0.008
leneck-0.008
 Painter-0.007
esan-0.007
Token South
Token ID2520
Feature activation-6.15e-02

Pos logit contributions
EMS+0.011
 Mehran+0.010
papers+0.009
 accuracy+0.009
":{"+0.009
Neg logit contributions
iblical-0.011
uca-0.011
leneck-0.011
 Painter-0.011
esan-0.010
Token
Token ID447
Feature activation+3.26e-04

Pos logit contributions
EMS+0.043
 Mehran+0.038
papers+0.036
 accuracy+0.035
":{"+0.034
Neg logit contributions
iblical-0.046
uca-0.045
leneck-0.044
 Painter-0.043
esan-0.041
Token
Token ID247
Feature activation-3.32e-03

Pos logit contributions
iblical+0.000
uca+0.000
leneck+0.000
 Painter+0.000
esan+0.000
Neg logit contributions
EMS-0.000
 Mehran-0.000
papers-0.000
 accuracy-0.000
":{"-0.000
Tokens
Token ID82
Feature activation-4.83e-03

Pos logit contributions
EMS+0.002
 Mehran+0.002
papers+0.002
 accuracy+0.002
":{"+0.002
Neg logit contributions
iblical-0.003
uca-0.003
leneck-0.002
 Painter-0.002
esan-0.002
Token reputation
Token ID8507
Feature activation-5.65e-02

Pos logit contributions
EMS+0.003
 Mehran+0.003
papers+0.003
 accuracy+0.003
":{"+0.003
Neg logit contributions
iblical-0.004
uca-0.004
leneck-0.003
 Painter-0.003
esan-0.003
Token as
Token ID355
Feature activation+2.18e-03

Pos logit contributions
EMS+0.040
 Mehran+0.035
papers+0.033
 accuracy+0.033
":{"+0.032
Neg logit contributions
iblical-0.041
uca-0.041
leneck-0.040
 Painter-0.039
esan-0.037
Token not
Token ID407
Feature activation-1.32e-02

Pos logit contributions
iblical+0.002
uca+0.002
leneck+0.002
 Painter+0.002
esan+0.002
Neg logit contributions
EMS-0.001
 Mehran-0.001
papers-0.001
 accuracy-0.001
":{"-0.001
Token being
Token ID852
Feature activation+7.18e-03

Pos logit contributions
EMS+0.010
 Mehran+0.009
papers+0.008
 accuracy+0.008
":{"+0.008
Neg logit contributions
iblical-0.009
uca-0.009
leneck-0.009
 Painter-0.009
esan-0.008
Token nickel
Token ID30666
Feature activation+7.13e-03

Pos logit contributions
iblical+0.002
uca+0.002
leneck+0.002
 Painter+0.002
esan+0.002
Neg logit contributions
EMS-0.002
 Mehran-0.002
papers-0.002
 accuracy-0.002
":{"-0.002
Token defense
Token ID3761
Feature activation-5.38e-02

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.005
 Painter+0.005
esan+0.005
Neg logit contributions
EMS-0.005
 Mehran-0.004
papers-0.004
 accuracy-0.004
":{"-0.004
Token.
Token ID13
Feature activation-2.15e-02

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.031
":{"+0.030
Neg logit contributions
iblical-0.040
uca-0.040
leneck-0.039
 Painter-0.038
esan-0.036
Token
Token ID198
Feature activation-1.48e-02

Pos logit contributions
EMS+0.017
 Mehran+0.015
papers+0.014
 accuracy+0.014
":{"+0.014
Neg logit contributions
iblical-0.014
uca-0.014
leneck-0.013
 Painter-0.013
esan-0.012
Token
Token ID198
Feature activation-1.92e-02

Pos logit contributions
EMS+0.011
 Mehran+0.010
papers+0.010
 accuracy+0.009
":{"+0.009
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.010
 Painter-0.010
esan-0.009
TokenOff
Token ID9362
Feature activation-2.24e-01

Pos logit contributions
EMS+0.014
 Mehran+0.013
papers+0.012
 accuracy+0.012
 Stability+0.012
Neg logit contributions
iblical-0.013
uca-0.013
leneck-0.013
 Painter-0.012
esan-0.012
Tokenensive
Token ID2021
Feature activation+1.63e-01

Pos logit contributions
EMS+0.215
 Mehran+0.198
papers+0.189
 accuracy+0.187
":{"+0.182
Neg logit contributions
iblical-0.108
uca-0.107
leneck-0.101
 Painter-0.101
esan-0.090
Tokenly
Token ID306
Feature activation-1.45e-01

Pos logit contributions
iblical+0.101
uca+0.101
leneck+0.097
 Painter+0.096
esan+0.089
Neg logit contributions
EMS-0.134
 Mehran-0.120
papers-0.115
 accuracy-0.113
":{"-0.110
Token,
Token ID11
Feature activation-1.16e-02

Pos logit contributions
EMS+0.103
 Mehran+0.091
papers+0.086
 accuracy+0.086
":{"+0.083
Neg logit contributions
iblical-0.105
uca-0.104
leneck-0.101
 Painter-0.100
esan-0.094
Token the
Token ID262
Feature activation-8.79e-03

Pos logit contributions
EMS+0.008
 Mehran+0.007
papers+0.006
 accuracy+0.006
":{"+0.006
Neg logit contributions
iblical-0.009
uca-0.009
leneck-0.009
 Painter-0.008
esan-0.008
Token Soon
Token ID15894
Feature activation-1.92e-01

Pos logit contributions
EMS+0.006
 Mehran+0.005
papers+0.005
 accuracy+0.005
":{"+0.004
Neg logit contributions
iblical-0.007
uca-0.007
leneck-0.007
 Painter-0.007
esan-0.006
Token CO
Token ID7375
Feature activation-5.23e-02

Pos logit contributions
iblical+0.006
uca+0.006
leneck+0.006
 Painter+0.006
esan+0.006
Neg logit contributions
EMS-0.006
 Mehran-0.005
papers-0.005
 accuracy-0.004
":{"-0.004
Token 2
Token ID362
Feature activation-1.25e-02

Pos logit contributions
EMS+0.037
 Mehran+0.033
papers+0.031
 accuracy+0.030
":{"+0.029
Neg logit contributions
iblical-0.039
uca-0.038
leneck-0.037
 Painter-0.037
esan-0.034
Token content
Token ID2695
Feature activation-6.44e-02

Pos logit contributions
EMS+0.008
 Mehran+0.007
papers+0.007
 accuracy+0.007
":{"+0.006
Neg logit contributions
iblical-0.010
uca-0.010
leneck-0.009
 Painter-0.009
esan-0.009
Token as
Token ID355
Feature activation+6.08e-04

Pos logit contributions
EMS+0.045
 Mehran+0.040
papers+0.038
 accuracy+0.037
":{"+0.036
Neg logit contributions
iblical-0.047
uca-0.047
leneck-0.046
 Painter-0.045
esan-0.042
Token the
Token ID262
Feature activation+2.17e-02

Pos logit contributions
iblical+0.000
uca+0.000
leneck+0.000
 Painter+0.000
esan+0.000
Neg logit contributions
EMS-0.000
 Mehran-0.000
papers-0.000
 accuracy-0.000
":{"-0.000
Token air
Token ID1633
Feature activation-2.22e-01

Pos logit contributions
iblical+0.016
uca+0.016
leneck+0.016
 Painter+0.016
esan+0.015
Neg logit contributions
EMS-0.015
 Mehran-0.013
papers-0.012
 accuracy-0.012
":{"-0.012
Token on
Token ID319
Feature activation+6.47e-03

Pos logit contributions
EMS+0.156
 Mehran+0.138
papers+0.129
 accuracy+0.128
":{"+0.124
Neg logit contributions
iblical-0.164
uca-0.163
leneck-0.158
 Painter-0.156
esan-0.147
Token the
Token ID262
Feature activation+1.40e-02

Pos logit contributions
iblical+0.005
uca+0.005
leneck+0.004
 Painter+0.004
esan+0.004
Neg logit contributions
EMS-0.005
 Mehran-0.004
papers-0.004
 accuracy-0.004
":{"-0.004
Token surface
Token ID4417
Feature activation+1.34e-02

Pos logit contributions
iblical+0.011
uca+0.011
leneck+0.010
 Painter+0.010
esan+0.010
Neg logit contributions
EMS-0.009
 Mehran-0.008
papers-0.008
 accuracy-0.008
":{"-0.007
Token at
Token ID379
Feature activation+9.08e-03

Pos logit contributions
iblical+0.010
uca+0.010
leneck+0.010
 Painter+0.010
esan+0.009
Neg logit contributions
EMS-0.009
 Mehran-0.008
papers-0.008
 accuracy-0.008
":{"-0.007
Token the
Token ID262
Feature activation+2.36e-02

Pos logit contributions
iblical+0.007
uca+0.007
leneck+0.007
 Painter+0.007
esan+0.006
Neg logit contributions
EMS-0.006
 Mehran-0.005
papers-0.005
 accuracy-0.005
":{"-0.005
Token sticks
Token ID16461
Feature activation-3.11e-02

Pos logit contributions
iblical+0.048
uca+0.048
leneck+0.046
 Painter+0.046
esan+0.043
Neg logit contributions
EMS-0.044
 Mehran-0.039
papers-0.036
 accuracy-0.036
":{"-0.035
Token.
Token ID13
Feature activation-2.10e-02

Pos logit contributions
EMS+0.022
 Mehran+0.019
papers+0.018
 accuracy+0.018
":{"+0.017
Neg logit contributions
iblical-0.023
uca-0.023
leneck-0.022
 Painter-0.022
esan-0.021
Token
Token ID198
Feature activation+1.13e-02

Pos logit contributions
EMS+0.016
 Mehran+0.014
papers+0.013
 accuracy+0.013
":{"+0.013
Neg logit contributions
iblical-0.014
uca-0.014
leneck-0.014
 Painter-0.014
esan-0.013
Token
Token ID198
Feature activation+1.48e-02

Pos logit contributions
iblical+0.008
uca+0.008
leneck+0.007
 Painter+0.007
esan+0.007
Neg logit contributions
EMS-0.009
 Mehran-0.008
papers-0.007
 accuracy-0.007
":{"-0.007
TokenAnd
Token ID1870
Feature activation+6.86e-02

Pos logit contributions
iblical+0.010
uca+0.010
leneck+0.010
 Painter+0.009
esan+0.009
Neg logit contributions
EMS-0.011
 Mehran-0.010
papers-0.010
 accuracy-0.010
 Stability-0.009
Token finally
Token ID3443
Feature activation-2.29e-01

Pos logit contributions
iblical+0.050
uca+0.050
leneck+0.049
 Painter+0.048
esan+0.045
Neg logit contributions
EMS-0.048
 Mehran-0.043
papers-0.040
 accuracy-0.040
":{"-0.038
Token,
Token ID11
Feature activation-2.83e-02

Pos logit contributions
EMS+0.163
 Mehran+0.144
papers+0.136
 accuracy+0.134
":{"+0.130
Neg logit contributions
iblical-0.167
uca-0.166
leneck-0.161
 Painter-0.158
esan-0.149
Token as
Token ID355
Feature activation-1.61e-02

Pos logit contributions
EMS+0.020
 Mehran+0.017
papers+0.017
 accuracy+0.016
":{"+0.016
Neg logit contributions
iblical-0.021
uca-0.021
leneck-0.020
 Painter-0.020
esan-0.019
Token for
Token ID329
Feature activation-3.09e-02

Pos logit contributions
EMS+0.011
 Mehran+0.010
papers+0.009
 accuracy+0.009
":{"+0.009
Neg logit contributions
uca-0.012
iblical-0.012
leneck-0.012
 Painter-0.012
esan-0.011
Token admins
Token ID44563
Feature activation+2.85e-01

Pos logit contributions
EMS+0.021
 Mehran+0.018
papers+0.017
 accuracy+0.017
":{"+0.016
Neg logit contributions
iblical-0.024
uca-0.024
leneck-0.023
 Painter-0.023
esan-0.021
Token,
Token ID11
Feature activation-2.40e-02

Pos logit contributions
uca+0.205
iblical+0.204
leneck+0.197
 Painter+0.195
esan+0.184
Neg logit contributions
EMS-0.206
 Mehran-0.183
papers-0.172
 accuracy-0.168
 Stability-0.164
Token-
Token ID12
Feature activation+2.48e-02

Pos logit contributions
EMS+0.050
 Mehran+0.044
papers+0.042
 accuracy+0.042
":{"+0.040
Neg logit contributions
iblical-0.052
uca-0.052
leneck-0.050
 Painter-0.050
esan-0.047
Tokenim
Token ID320
Feature activation-6.28e-02

Pos logit contributions
iblical+0.017
uca+0.017
leneck+0.016
 Painter+0.016
esan+0.015
Neg logit contributions
EMS-0.019
 Mehran-0.017
papers-0.016
 accuracy-0.016
":{"-0.015
Tokenperial
Token ID7629
Feature activation+1.96e-01

Pos logit contributions
EMS+0.074
 Mehran+0.068
papers+0.066
 accuracy+0.066
 Stability+0.064
Neg logit contributions
uca-0.017
iblical-0.017
leneck-0.015
 Painter-0.015
esan-0.012
Tokenist
Token ID396
Feature activation-1.08e-01

Pos logit contributions
iblical+0.134
uca+0.133
leneck+0.129
 Painter+0.127
esan+0.119
Neg logit contributions
EMS-0.149
 Mehran-0.132
papers-0.125
 accuracy-0.124
":{"-0.120
Token sc
Token ID629
Feature activation+9.07e-03

Pos logit contributions
EMS+0.072
 Mehran+0.063
papers+0.059
 accuracy+0.059
":{"+0.057
Neg logit contributions
iblical-0.084
uca-0.084
leneck-0.081
 Painter-0.080
esan-0.075
Tokenound
Token ID633
Feature activation-2.24e-01

Pos logit contributions
iblical+0.008
uca+0.008
 Painter+0.008
leneck+0.008
esan+0.007
Neg logit contributions
EMS-0.005
 Mehran-0.004
 accuracy-0.004
papers-0.004
 Stability-0.004
Tokenrel
Token ID2411
Feature activation-1.91e-01

Pos logit contributions
EMS+0.112
 Mehran+0.093
papers+0.085
 accuracy+0.085
":{"+0.081
Neg logit contributions
uca-0.210
iblical-0.210
leneck-0.204
 Painter-0.202
esan-0.192
Token Vac
Token ID25709
Feature activation-2.10e-01

Pos logit contributions
EMS+0.126
 Mehran+0.110
papers+0.104
 accuracy+0.103
":{"+0.099
Neg logit contributions
iblical-0.148
uca-0.148
leneck-0.143
 Painter-0.142
esan-0.133
Tokenlav
Token ID18809
Feature activation-7.88e-02

Pos logit contributions
EMS+0.141
 Mehran+0.124
papers+0.117
 accuracy+0.115
":{"+0.111
Neg logit contributions
iblical-0.162
uca-0.161
leneck-0.156
 Painter-0.154
esan-0.145
Token Ha
Token ID9398
Feature activation-4.71e-02

Pos logit contributions
EMS+0.053
 Mehran+0.046
papers+0.043
 accuracy+0.043
 Stability+0.042
Neg logit contributions
iblical-0.061
uca-0.060
leneck-0.059
 Painter-0.057
esan-0.054
Tokenvel
Token ID626
Feature activation+7.07e-02

Pos logit contributions
EMS+0.036
 Mehran+0.033
papers+0.031
 accuracy+0.031
":{"+0.029
Neg logit contributions
iblical-0.031
uca-0.031
leneck-0.030
 Painter-0.030
esan-0.028
Token your
Token ID534
Feature activation-6.86e-03

Pos logit contributions
iblical+0.016
uca+0.016
leneck+0.016
 Painter+0.016
esan+0.015
Neg logit contributions
EMS-0.014
 Mehran-0.012
papers-0.012
 accuracy-0.012
":{"-0.011
Token
Token ID564
Feature activation+5.19e-04

Pos logit contributions
EMS+0.005
 Mehran+0.004
papers+0.004
 accuracy+0.004
":{"+0.004
Neg logit contributions
iblical-0.005
uca-0.005
leneck-0.005
 Painter-0.005
esan-0.005
Token
Token ID250
Feature activation+2.09e-02

Pos logit contributions
iblical+0.000
uca+0.000
leneck+0.000
 Painter+0.000
esan+0.000
Neg logit contributions
EMS-0.000
 Mehran-0.000
papers-0.000
 accuracy-0.000
":{"-0.000
Tokendriver
Token ID26230
Feature activation-2.03e-01

Pos logit contributions
iblical+0.015
uca+0.015
leneck+0.014
 Painter+0.014
esan+0.013
Neg logit contributions
EMS-0.015
 Mehran-0.014
papers-0.013
 accuracy-0.013
":{"-0.012
Token=
Token ID28
Feature activation+1.29e-01

Pos logit contributions
EMS+0.138
 Mehran+0.121
papers+0.114
 accuracy+0.113
":{"+0.109
Neg logit contributions
iblical-0.155
uca-0.154
leneck-0.149
 Painter-0.147
esan-0.139
Tokendriver
Token ID26230
Feature activation-2.43e-01

Pos logit contributions
uca+0.091
iblical+0.091
leneck+0.088
 Painter+0.086
esan+0.081
Neg logit contributions
EMS-0.095
 Mehran-0.084
papers-0.080
 accuracy-0.079
":{"-0.077
Token-
Token ID12
Feature activation-2.21e-02

Pos logit contributions
EMS+0.173
 Mehran+0.152
papers+0.144
 accuracy+0.142
":{"+0.137
Neg logit contributions
iblical-0.177
uca-0.177
leneck-0.171
 Painter-0.169
esan-0.159
Tokentype
Token ID4906
Feature activation+9.48e-02

Pos logit contributions
EMS+0.016
 Mehran+0.014
 accuracy+0.013
papers+0.013
 Stability+0.013
Neg logit contributions
uca-0.016
iblical-0.016
leneck-0.015
 Painter-0.015
esan-0.014
Token.
Token ID13
Feature activation-1.97e-04

Pos logit contributions
iblical+0.069
uca+0.069
leneck+0.067
 Painter+0.066
esan+0.062
Neg logit contributions
EMS-0.067
 Mehran-0.059
papers-0.056
 accuracy-0.055
":{"-0.054
Token
Token ID447
Feature activation+4.53e-03

Pos logit contributions
EMS+0.000
 Mehran+0.000
papers+0.000
 accuracy+0.000
":{"+0.000
Neg logit contributions
iblical-0.000
uca-0.000
leneck-0.000
 Painter-0.000
esan-0.000
Token
Token ID251
Feature activation-4.98e-03

Pos logit contributions
iblical+0.003
uca+0.003
leneck+0.003
 Painter+0.003
esan+0.003
Neg logit contributions
EMS-0.003
 Mehran-0.003
papers-0.003
 accuracy-0.003
":{"-0.003