TOP ACTIVATIONS
MAX = 0.567

D arth Camel though didn
a cos m ologist at Carnegie Mellon University in Pittsburgh ,
wrote that sign ificantly scaling back the Clinton Foundation
October 2016 addressed the de arth of movies and television shows
hes story " <|endoftext|> ( Phys . org ) -- An
, is inaccurate . Stat istically speaking , the ' average
Libya might sign ificantly curtail its oil
and a solid Neptune - mass planet . Ap J ,
s near - mon opoly on energy supplies to Europe
with 25 60 CU DA cores and 20 SM units .
function , ping becomes the subscriber for this topic . So
have mixed feelings about James Gunn 's GU ARD I ANS
fl uttering proudly in the breeze . Shia La Be ouf
King was a non - factor in his first year ,
they barely ripple in the breeze . In Shang ba ,
lone no vote was L ared o Sen . Judith Z
, a senior associate at Carnegie End owment of International Peace
is . Yesterday , James Gunn announced via his personal Facebook
into the very continental land mass itself . But that ,
and Michelle went on Me gyn Kelly s Fox

MIN ACTIVATIONS
MIN = -0.556

the word C ubs had come up
hashtag # Ask The D ubs , and be on the
cons ensual sex . Un cover ing the deception and rape
s been Nikol aj Eh lers and Patri k L aine
é , CM DR Kay mel Kan , pour le Command
with # Ask The D ubs , then tune in to
Time in Passing : Ment ions of Time in Fiction
is also the first grapp ler in the Street Fighter series
explained away as an aber ration of the primitive mind .
equivalent of an Olympic gymn astics routine for the speaker
3 . Daniels Prom inent Family
Danny Daniels and his wife , Jan
Drive Inn , and other Daniels theaters and properties in South
Hur tt has also tut ored outside linebacker Willie Young ,
to his name to become Daniels .)
This happened because many Nort her ners feared that if the
wife , Jan is Turk Daniels announce the birth of their
Jan is Turk Daniels , who also goes by
Stamford Bridge , the nort her ners were remarkably absent (
TER + Advent urous , Cult ured , Quick

INTERVAL 0.286 - 0.567
CONTAINS 0.268%

can do that except Jack Kirby !" According to King ,
taking a mis direction counter step behind Bell before cutting behind
and West All is - West Milwaukee . The expansion would
. Co inc identally , the Doors were formed
source ( com puters and monitors in our case ) but

INTERVAL 0.006 - 0.286
CONTAINS 46.095%

I only dropped it because of budgetary needs . This book
/ api instead . So make sure to double check this
May 21 , and out on Monday , Memorial Day .
:// api . nice hash . com / api instead .
Tang down , and set his vengeance into motion . While

INTERVAL -0.275 - 0.006
CONTAINS 53.281%

ge ois intellig ents ia resisted en masse letting itself be
loyal when it comes to white people with money and coat
reject and protest ideas we don 't agree with . That
get her pencil s and help color the maps I took
in 2006 . This Finnish album contains no original lyrics by

INTERVAL -0.556 - -0.275
CONTAINS 0.355%

) , as usual , with berries , but
started to unravel for the Caps , as St Louis scored
itself , and even then alleg ory was a weapon in
s in the F Z 1000 / RX 100 III ,
potential running mates , V Ps See Gallery Donald Trump passes
Token
Token ID447
Feature activation+1.50e-02

Pos logit contributions
arth+0.034
factor+0.034
byss+0.032
asu+0.032
Factor+0.032
Neg logit contributions
ubs-0.038
mel-0.035
cover-0.032
opus-0.031
-0.029
Token
Token ID251
Feature activation+4.93e-02

Pos logit contributions
arth+0.009
factor+0.009
byss+0.009
asu+0.009
Factor+0.009
Neg logit contributions
ubs-0.011
mel-0.010
cover-0.009
opus-0.009
-0.009
Token
Token ID198
Feature activation+1.35e-02

Pos logit contributions
arth+0.031
factor+0.030
asu+0.029
byss+0.029
Factor+0.029
Neg logit contributions
ubs-0.036
mel-0.034
cover-0.030
opus-0.029
-0.028
Token
Token ID198
Feature activation+9.84e-03

Pos logit contributions
arth+0.008
factor+0.008
asu+0.008
Factor+0.007
byss+0.007
Neg logit contributions
ubs-0.010
mel-0.010
cover-0.009
opus-0.008
-0.008
TokenD
Token ID35
Feature activation-9.12e-02

Pos logit contributions
arth+0.006
factor+0.006
asu+0.006
byss+0.006
Factor+0.006
Neg logit contributions
ubs-0.007
mel-0.007
cover-0.006
opus-0.006
-0.005
Tokenarth
Token ID11999
Feature activation+5.67e-01

Pos logit contributions
ubs+0.114
mel+0.110
cover+0.104
opus+0.103
+0.099
Neg logit contributions
factor-0.008
arth-0.008
byss-0.007
asu-0.005
Factor-0.005
Token Camel
Token ID43281
Feature activation-3.08e-02

Pos logit contributions
arth+0.372
factor+0.366
byss+0.355
asu+0.355
Factor+0.348
Neg logit contributions
ubs-0.388
mel-0.367
cover-0.329
opus-0.318
-0.301
Token though
Token ID996
Feature activation+1.17e-01

Pos logit contributions
ubs+0.021
mel+0.020
cover+0.018
opus+0.018
+0.017
Neg logit contributions
arth-0.020
factor-0.020
asu-0.019
byss-0.019
Factor-0.019
Token didn
Token ID1422
Feature activation-4.21e-02

Pos logit contributions
arth+0.076
factor+0.074
byss+0.072
asu+0.072
Factor+0.071
Neg logit contributions
ubs-0.082
mel-0.077
cover-0.070
opus-0.067
-0.064
Token
Token ID447
Feature activation+2.12e-02

Pos logit contributions
ubs+0.030
mel+0.029
cover+0.026
opus+0.025
+0.024
Neg logit contributions
arth-0.026
factor-0.026
asu-0.025
byss-0.025
Factor-0.024
Token
Token ID247
Feature activation+3.63e-02

Pos logit contributions
arth+0.012
factor+0.012
byss+0.012
asu+0.012
Factor+0.012
Neg logit contributions
ubs-0.016
mel-0.015
cover-0.014
opus-0.013
-0.013
Token a
Token ID257
Feature activation+2.07e-02

Pos logit contributions
arth+0.015
factor+0.015
byss+0.015
asu+0.015
Factor+0.014
Neg logit contributions
ubs-0.018
mel-0.017
cover-0.015
opus-0.015
-0.014
Token cos
Token ID8615
Feature activation+1.15e-01

Pos logit contributions
arth+0.012
factor+0.012
byss+0.011
asu+0.011
Factor+0.011
Neg logit contributions
ubs-0.016
mel-0.015
cover-0.014
opus-0.013
-0.013
Tokenm
Token ID76
Feature activation-7.71e-02

Pos logit contributions
arth+0.079
factor+0.077
byss+0.076
asu+0.076
Factor+0.074
Neg logit contributions
ubs-0.075
mel-0.071
cover-0.063
opus-0.061
-0.057
Tokenologist
Token ID7451
Feature activation-5.61e-02

Pos logit contributions
ubs+0.050
mel+0.047
cover+0.042
opus+0.040
+0.038
Neg logit contributions
arth-0.054
factor-0.053
byss-0.051
asu-0.051
Factor-0.050
Token at
Token ID379
Feature activation+2.78e-02

Pos logit contributions
ubs+0.045
mel+0.043
cover+0.039
opus+0.038
+0.036
Neg logit contributions
arth-0.031
factor-0.030
byss-0.029
asu-0.029
Factor-0.028
Token Carnegie
Token ID33976
Feature activation+5.28e-01

Pos logit contributions
arth+0.016
factor+0.016
byss+0.015
asu+0.015
Factor+0.015
Neg logit contributions
ubs-0.021
mel-0.020
cover-0.018
opus-0.018
-0.017
Token Mellon
Token ID49808
Feature activation-1.32e-01

Pos logit contributions
arth+0.379
factor+0.372
asu+0.363
byss+0.362
 Carnegie+0.357
Neg logit contributions
ubs-0.329
mel-0.308
cover-0.275
opus-0.266
-0.250
Token University
Token ID2059
Feature activation-6.99e-03

Pos logit contributions
ubs+0.094
mel+0.089
cover+0.080
opus+0.078
+0.074
Neg logit contributions
arth-0.083
factor-0.081
byss-0.079
asu-0.079
Factor-0.077
Token in
Token ID287
Feature activation-3.53e-02

Pos logit contributions
ubs+0.005
mel+0.005
cover+0.004
opus+0.004
+0.004
Neg logit contributions
arth-0.004
factor-0.004
byss-0.004
asu-0.004
Factor-0.004
Token Pittsburgh
Token ID12767
Feature activation-1.22e-01

Pos logit contributions
ubs+0.022
mel+0.020
cover+0.018
opus+0.017
+0.016
Neg logit contributions
arth-0.026
factor-0.025
byss-0.025
asu-0.024
Factor-0.024
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
ubs+0.085
mel+0.080
cover+0.072
opus+0.070
+0.066
Neg logit contributions
arth-0.078
factor-0.076
byss-0.075
asu-0.074
Factor-0.073
Token wrote
Token ID2630
Feature activation+1.55e-01

Pos logit contributions
arth+0.027
factor+0.027
byss+0.026
asu+0.026
Factor+0.025
Neg logit contributions
ubs-0.029
mel-0.028
cover-0.025
opus-0.024
-0.023
Token that
Token ID326
Feature activation+8.86e-02

Pos logit contributions
arth+0.089
factor+0.087
byss+0.085
asu+0.084
Factor+0.083
Neg logit contributions
ubs-0.118
mel-0.112
cover-0.102
opus-0.099
-0.094
Token
Token ID564
Feature activation-1.70e-03

Pos logit contributions
arth+0.054
factor+0.052
byss+0.051
asu+0.051
Factor+0.050
Neg logit contributions
ubs-0.065
mel-0.062
cover-0.056
opus-0.055
-0.051
Token
Token ID250
Feature activation-1.80e-02

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
byss-0.001
asu-0.001
Factor-0.001
Tokensign
Token ID12683
Feature activation+3.42e-01

Pos logit contributions
ubs+0.013
mel+0.012
cover+0.011
opus+0.010
+0.010
Neg logit contributions
arth-0.012
factor-0.011
asu-0.011
byss-0.011
Factor-0.011
Tokenificantly
Token ID42491
Feature activation+5.27e-01

Pos logit contributions
arth+0.085
factor+0.081
byss+0.075
asu+0.074
Factor+0.070
Neg logit contributions
ubs-0.373
mel-0.360
cover-0.337
opus-0.332
-0.320
Token scaling
Token ID20796
Feature activation+1.54e-01

Pos logit contributions
arth+0.329
factor+0.322
asu+0.312
byss+0.311
Factor+0.306
Neg logit contributions
ubs-0.377
mel-0.358
cover-0.322
opus-0.316
-0.295
Token back
Token ID736
Feature activation-4.85e-02

Pos logit contributions
arth+0.095
factor+0.093
byss+0.091
asu+0.090
Factor+0.089
Neg logit contributions
ubs-0.110
mel-0.105
cover-0.094
opus-0.092
-0.087
Token the
Token ID262
Feature activation-6.14e-03

Pos logit contributions
ubs+0.036
mel+0.034
cover+0.031
opus+0.030
+0.028
Neg logit contributions
arth-0.029
factor-0.029
byss-0.028
asu-0.028
Factor-0.027
Token Clinton
Token ID2605
Feature activation+7.44e-02

Pos logit contributions
ubs+0.004
mel+0.004
cover+0.004
opus+0.004
+0.004
Neg logit contributions
arth-0.004
factor-0.004
byss-0.004
asu-0.004
Factor-0.004
Token Foundation
Token ID5693
Feature activation-1.95e-01

Pos logit contributions
arth+0.056
factor+0.055
byss+0.054
asu+0.054
Factor+0.053
Neg logit contributions
ubs-0.043
mel-0.041
cover-0.036
opus-0.034
-0.032
Token October
Token ID3267
Feature activation-5.58e-02

Pos logit contributions
ubs+0.041
mel+0.039
cover+0.035
opus+0.034
+0.032
Neg logit contributions
arth-0.035
factor-0.034
byss-0.033
asu-0.033
Factor-0.032
Token 2016
Token ID1584
Feature activation+6.62e-03

Pos logit contributions
ubs+0.040
mel+0.038
cover+0.035
opus+0.034
+0.032
Neg logit contributions
arth-0.034
factor-0.034
byss-0.033
asu-0.033
Factor-0.032
Token addressed
Token ID9469
Feature activation+4.49e-03

Pos logit contributions
arth+0.004
factor+0.004
byss+0.004
asu+0.004
Factor+0.004
Neg logit contributions
ubs-0.005
mel-0.004
cover-0.004
opus-0.004
-0.004
Token the
Token ID262
Feature activation-4.98e-02

Pos logit contributions
arth+0.003
factor+0.003
byss+0.003
asu+0.003
Factor+0.003
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.003
opus-0.003
-0.003
Token de
Token ID390
Feature activation+7.61e-02

Pos logit contributions
ubs+0.036
mel+0.034
cover+0.030
opus+0.030
+0.028
Neg logit contributions
arth-0.031
factor-0.031
byss-0.030
asu-0.030
Factor-0.029
Tokenarth
Token ID11999
Feature activation+5.26e-01

Pos logit contributions
arth+0.031
factor+0.030
byss+0.029
asu+0.029
Factor+0.028
Neg logit contributions
ubs-0.071
mel-0.068
cover-0.063
opus-0.062
-0.059
Token of
Token ID286
Feature activation-3.66e-02

Pos logit contributions
arth+0.296
factor+0.290
byss+0.282
asu+0.281
Factor+0.275
Neg logit contributions
ubs-0.409
mel-0.389
cover-0.354
opus-0.344
-0.326
Token movies
Token ID6918
Feature activation+5.21e-02

Pos logit contributions
ubs+0.026
mel+0.025
cover+0.023
opus+0.022
+0.021
Neg logit contributions
arth-0.023
factor-0.022
byss-0.022
asu-0.022
Factor-0.021
Token and
Token ID290
Feature activation-2.52e-02

Pos logit contributions
arth+0.031
factor+0.031
byss+0.030
asu+0.030
Factor+0.029
Neg logit contributions
ubs-0.039
mel-0.037
cover-0.033
opus-0.032
-0.030
Token television
Token ID5581
Feature activation-6.64e-02

Pos logit contributions
ubs+0.018
mel+0.018
cover+0.016
opus+0.015
+0.015
Neg logit contributions
arth-0.015
factor-0.015
byss-0.015
asu-0.014
Factor-0.014
Token shows
Token ID2523
Feature activation+3.08e-02

Pos logit contributions
ubs+0.050
mel+0.047
cover+0.043
opus+0.041
+0.039
Neg logit contributions
arth-0.039
factor-0.039
byss-0.038
asu-0.037
Factor-0.037
Tokenhes
Token ID956
Feature activation-1.09e-01

Pos logit contributions
ubs+0.103
mel+0.097
cover+0.085
opus+0.083
+0.078
Neg logit contributions
arth-0.134
factor-0.131
asu-0.128
byss-0.127
Factor-0.125
Token story
Token ID1621
Feature activation-2.29e-02

Pos logit contributions
ubs+0.077
mel+0.072
cover+0.065
opus+0.063
+0.060
Neg logit contributions
arth-0.070
factor-0.068
byss-0.067
asu-0.066
Factor-0.065
Token"
Token ID1
Feature activation-9.46e-03

Pos logit contributions
ubs+0.015
mel+0.014
cover+0.012
opus+0.012
+0.011
Neg logit contributions
arth-0.016
factor-0.016
byss-0.015
asu-0.015
Factor-0.015
Token<|endoftext|>
Token ID50256
Feature activation+7.44e-02

Pos logit contributions
ubs+0.007
mel+0.007
cover+0.006
opus+0.006
+0.005
Neg logit contributions
arth-0.006
factor-0.006
byss-0.005
asu-0.005
Factor-0.005
Token(
Token ID7
Feature activation+2.66e-02

Pos logit contributions
arth+0.046
factor+0.045
byss+0.044
asu+0.044
Factor+0.043
Neg logit contributions
ubs-0.054
mel-0.051
cover-0.046
opus-0.044
-0.042
TokenPhys
Token ID43215
Feature activation+5.21e-01

Pos logit contributions
arth+0.015
factor+0.015
byss+0.015
asu+0.015
Factor+0.014
Neg logit contributions
ubs-0.020
mel-0.019
cover-0.018
opus-0.017
-0.016
Token.
Token ID13
Feature activation+1.59e-02

Pos logit contributions
arth+0.324
factor+0.315
asu+0.307
byss+0.305
Factor+0.301
Neg logit contributions
ubs-0.373
mel-0.353
cover-0.318
opus-0.308
-0.296
Tokenorg
Token ID2398
Feature activation+7.34e-02

Pos logit contributions
arth+0.008
factor+0.008
asu+0.008
byss+0.007
 Carnegie+0.007
Neg logit contributions
ubs-0.013
mel-0.013
cover-0.011
opus-0.011
ominated-0.011
Token)
Token ID8
Feature activation-2.97e-02

Pos logit contributions
arth+0.059
factor+0.058
asu+0.057
byss+0.056
Factor+0.056
Neg logit contributions
ubs-0.040
mel-0.037
cover-0.031
opus-0.030
-0.028
Token --
Token ID1377
Feature activation+3.64e-02

Pos logit contributions
ubs+0.018
mel+0.017
cover+0.015
opus+0.015
+0.014
Neg logit contributions
arth-0.021
factor-0.021
byss-0.020
asu-0.020
Factor-0.020
Token An
Token ID1052
Feature activation-5.99e-02

Pos logit contributions
arth+0.022
factor+0.022
byss+0.021
asu+0.021
Factor+0.021
Neg logit contributions
ubs-0.026
mel-0.025
cover-0.023
opus-0.022
-0.021
Token,
Token ID11
Feature activation-1.18e-02

Pos logit contributions
arth+0.010
factor+0.010
byss+0.010
asu+0.010
Factor+0.010
Neg logit contributions
ubs-0.011
mel-0.010
cover-0.009
opus-0.009
-0.008
Token is
Token ID318
Feature activation-3.92e-02

Pos logit contributions
ubs+0.008
mel+0.008
cover+0.007
opus+0.007
+0.007
Neg logit contributions
arth-0.007
factor-0.007
asu-0.007
byss-0.007
Factor-0.007
Token inaccurate
Token ID21873
Feature activation+7.63e-02

Pos logit contributions
ubs+0.029
mel+0.027
cover+0.024
opus+0.024
+0.022
Neg logit contributions
arth-0.024
factor-0.023
byss-0.023
asu-0.023
Factor-0.022
Token.
Token ID13
Feature activation-2.71e-02

Pos logit contributions
arth+0.048
factor+0.047
byss+0.046
asu+0.046
Factor+0.045
Neg logit contributions
ubs-0.054
mel-0.051
cover-0.046
opus-0.045
-0.042
Token Stat
Token ID5133
Feature activation+5.39e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.017
+0.016
Neg logit contributions
arth-0.016
factor-0.016
byss-0.016
asu-0.015
Factor-0.015
Tokenistically
Token ID16772
Feature activation+4.98e-01

Pos logit contributions
arth+0.007
factor+0.007
byss+0.006
asu+0.005
Factor+0.005
Neg logit contributions
ubs-0.065
mel-0.063
cover-0.059
opus-0.058
-0.057
Token speaking
Token ID5486
Feature activation+2.15e-01

Pos logit contributions
arth+0.259
factor+0.253
byss+0.245
asu+0.244
Factor+0.238
Neg logit contributions
ubs-0.408
mel-0.389
cover-0.356
opus-0.347
-0.331
Token,
Token ID11
Feature activation+2.41e-03

Pos logit contributions
arth+0.137
factor+0.135
byss+0.131
asu+0.131
Factor+0.128
Neg logit contributions
ubs-0.150
mel-0.142
cover-0.128
opus-0.124
-0.117
Token the
Token ID262
Feature activation-2.22e-02

Pos logit contributions
arth+0.001
factor+0.001
byss+0.001
asu+0.001
Factor+0.001
Neg logit contributions
ubs-0.002
mel-0.002
cover-0.002
opus-0.001
-0.001
Token '
Token ID705
Feature activation+9.26e-02

Pos logit contributions
ubs+0.016
mel+0.015
cover+0.014
opus+0.014
+0.013
Neg logit contributions
arth-0.013
factor-0.013
byss-0.013
asu-0.013
Factor-0.013
Tokenaverage
Token ID23913
Feature activation-1.41e-01

Pos logit contributions
arth+0.060
factor+0.059
byss+0.057
asu+0.057
Factor+0.056
Neg logit contributions
ubs-0.065
mel-0.061
cover-0.055
opus-0.053
-0.050
Token Libya
Token ID13465
Feature activation-3.88e-02

Pos logit contributions
arth+0.009
factor+0.008
byss+0.008
asu+0.008
Factor+0.008
Neg logit contributions
ubs-0.010
mel-0.010
cover-0.009
opus-0.009
-0.008
Token might
Token ID1244
Feature activation-2.90e-02

Pos logit contributions
ubs+0.027
mel+0.025
cover+0.023
opus+0.022
+0.021
Neg logit contributions
arth-0.025
factor-0.025
byss-0.024
asu-0.024
Factor-0.023
Token
Token ID564
Feature activation-1.10e-02

Pos logit contributions
ubs+0.022
mel+0.021
cover+0.019
opus+0.018
+0.017
Neg logit contributions
arth-0.017
factor-0.017
asu-0.016
byss-0.016
Factor-0.016
Token
Token ID246
Feature activation+2.09e-02

Pos logit contributions
ubs+0.008
mel+0.008
cover+0.007
opus+0.007
+0.006
Neg logit contributions
arth-0.007
factor-0.006
byss-0.006
asu-0.006
Factor-0.006
Tokensign
Token ID12683
Feature activation+2.90e-01

Pos logit contributions
arth+0.013
factor+0.013
asu+0.012
byss+0.012
Factor+0.012
Neg logit contributions
ubs-0.015
mel-0.014
cover-0.013
opus-0.013
-0.012
Tokenificantly
Token ID42491
Feature activation+4.96e-01

Pos logit contributions
arth+0.129
factor+0.126
byss+0.122
asu+0.120
Factor+0.117
Neg logit contributions
ubs-0.260
mel-0.249
cover-0.230
opus-0.224
-0.215
Token curtail
Token ID45962
Feature activation+2.64e-01

Pos logit contributions
arth+0.236
factor+0.231
asu+0.219
byss+0.217
Factor+0.215
Neg logit contributions
ubs-0.429
mel-0.410
cover-0.376
opus-0.369
-0.352
Token
Token ID447
Feature activation+7.06e-02

Pos logit contributions
arth+0.162
factor+0.158
byss+0.154
asu+0.153
Factor+0.151
Neg logit contributions
ubs-0.192
mel-0.182
cover-0.164
opus-0.160
-0.151
Token
Token ID247
Feature activation+4.72e-02

Pos logit contributions
arth+0.040
factor+0.039
byss+0.038
asu+0.038
Factor+0.037
Neg logit contributions
ubs-0.054
mel-0.052
cover-0.047
opus-0.046
-0.043
Token its
Token ID663
Feature activation-2.06e-02

Pos logit contributions
arth+0.027
factor+0.027
byss+0.026
asu+0.026
Factor+0.025
Neg logit contributions
ubs-0.036
mel-0.034
cover-0.031
opus-0.030
-0.029
Token oil
Token ID3056
Feature activation-6.79e-02

Pos logit contributions
ubs+0.015
mel+0.014
cover+0.013
opus+0.013
+0.012
Neg logit contributions
arth-0.012
factor-0.012
asu-0.012
byss-0.012
Factor-0.011
Token and
Token ID290
Feature activation+2.46e-02

Pos logit contributions
ubs+0.034
mel+0.032
cover+0.029
opus+0.028
+0.027
Neg logit contributions
arth-0.030
factor-0.029
byss-0.028
asu-0.028
Factor-0.028
Token a
Token ID257
Feature activation+4.71e-03

Pos logit contributions
arth+0.015
factor+0.015
byss+0.014
asu+0.014
Factor+0.014
Neg logit contributions
ubs-0.018
mel-0.017
cover-0.015
opus-0.015
-0.014
Token solid
Token ID4735
Feature activation-9.33e-02

Pos logit contributions
arth+0.003
factor+0.003
byss+0.003
asu+0.003
Factor+0.003
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.003
opus-0.003
-0.003
Token Neptune
Token ID26461
Feature activation-6.15e-02

Pos logit contributions
ubs+0.067
mel+0.064
cover+0.058
opus+0.056
+0.053
Neg logit contributions
arth-0.058
factor-0.056
byss-0.055
asu-0.055
Factor-0.054
Token-
Token ID12
Feature activation-1.72e-02

Pos logit contributions
ubs+0.043
mel+0.041
cover+0.037
opus+0.036
+0.034
Neg logit contributions
arth-0.039
factor-0.039
byss-0.038
asu-0.037
Factor-0.037
Tokenmass
Token ID22208
Feature activation+4.79e-01

Pos logit contributions
ubs+0.012
mel+0.012
cover+0.010
opus+0.010
+0.009
Neg logit contributions
arth-0.011
factor-0.011
byss-0.010
asu-0.010
Factor-0.010
Token planet
Token ID5440
Feature activation+2.20e-01

Pos logit contributions
arth+0.270
factor+0.264
byss+0.258
asu+0.254
Factor+0.250
Neg logit contributions
ubs-0.372
mel-0.354
cover-0.323
opus-0.315
-0.297
Token.
Token ID13
Feature activation-3.84e-03

Pos logit contributions
arth+0.139
factor+0.137
byss+0.133
asu+0.132
Factor+0.130
Neg logit contributions
ubs-0.156
mel-0.148
cover-0.133
opus-0.129
-0.122
Token Ap
Token ID5949
Feature activation-4.80e-02

Pos logit contributions
ubs+0.003
mel+0.003
cover+0.002
opus+0.002
+0.002
Neg logit contributions
arth-0.002
factor-0.002
byss-0.002
asu-0.002
Factor-0.002
TokenJ
Token ID41
Feature activation+6.64e-02

Pos logit contributions
ubs+0.035
mel+0.033
cover+0.030
opus+0.029
+0.028
Neg logit contributions
arth-0.029
factor-0.029
asu-0.028
byss-0.028
Factor-0.027
Token,
Token ID11
Feature activation+1.10e-02

Pos logit contributions
arth+0.041
factor+0.040
byss+0.039
asu+0.039
Factor+0.038
Neg logit contributions
ubs-0.048
mel-0.046
cover-0.042
opus-0.040
-0.038
Token
Token ID247
Feature activation-2.28e-03

Pos logit contributions
arth+0.002
factor+0.002
byss+0.001
asu+0.001
Factor+0.001
Neg logit contributions
ubs-0.003
mel-0.002
cover-0.002
opus-0.002
-0.002
Tokens
Token ID82
Feature activation+1.60e-02

Pos logit contributions
ubs+0.002
mel+0.002
cover+0.002
opus+0.002
+0.001
Neg logit contributions
arth-0.001
factor-0.001
byss-0.001
asu-0.001
Factor-0.001
Token near
Token ID1474
Feature activation+1.63e-01

Pos logit contributions
arth+0.010
factor+0.010
asu+0.010
byss+0.009
Factor+0.009
Neg logit contributions
ubs-0.011
mel-0.011
cover-0.010
opus-0.009
-0.009
Token-
Token ID12
Feature activation-1.43e-02

Pos logit contributions
arth+0.104
factor+0.102
byss+0.099
asu+0.099
Factor+0.097
Neg logit contributions
ubs-0.115
mel-0.108
cover-0.098
opus-0.095
-0.089
Tokenmon
Token ID2144
Feature activation+2.33e-01

Pos logit contributions
ubs+0.010
mel+0.010
cover+0.009
opus+0.008
+0.008
Neg logit contributions
arth-0.009
factor-0.009
byss-0.009
asu-0.009
Factor-0.009
Tokenopoly
Token ID35894
Feature activation+4.78e-01

Pos logit contributions
arth+0.140
factor+0.136
asu+0.132
byss+0.132
Factor+0.128
Neg logit contributions
ubs-0.173
mel-0.164
cover-0.149
opus-0.144
-0.138
Token on
Token ID319
Feature activation+2.87e-03

Pos logit contributions
arth+0.287
factor+0.281
byss+0.273
asu+0.272
Factor+0.266
Neg logit contributions
ubs-0.354
mel-0.336
cover-0.304
opus-0.295
-0.280
Token energy
Token ID2568
Feature activation+7.22e-02

Pos logit contributions
arth+0.002
factor+0.002
byss+0.002
asu+0.002
Factor+0.002
Neg logit contributions
ubs-0.002
mel-0.002
cover-0.002
opus-0.002
-0.002
Token supplies
Token ID9416
Feature activation-1.38e-01

Pos logit contributions
arth+0.040
factor+0.039
byss+0.038
asu+0.038
Factor+0.037
Neg logit contributions
ubs-0.056
mel-0.053
cover-0.049
opus-0.048
-0.045
Token to
Token ID284
Feature activation-1.56e-02

Pos logit contributions
ubs+0.097
mel+0.092
cover+0.083
opus+0.081
+0.076
Neg logit contributions
arth-0.087
factor-0.086
byss-0.084
asu-0.083
Factor-0.082
Token Europe
Token ID2031
Feature activation+3.00e-02

Pos logit contributions
ubs+0.012
mel+0.011
cover+0.010
opus+0.010
+0.009
Neg logit contributions
arth-0.009
factor-0.009
byss-0.009
asu-0.009
Factor-0.009
Token with
Token ID351
Feature activation+1.06e-01

Pos logit contributions
arth+0.086
factor+0.084
byss+0.082
asu+0.082
Factor+0.080
Neg logit contributions
ubs-0.103
mel-0.098
cover-0.089
opus-0.086
-0.081
Token 25
Token ID1679
Feature activation+1.57e-01

Pos logit contributions
arth+0.064
factor+0.063
byss+0.061
asu+0.060
Factor+0.059
Neg logit contributions
ubs-0.079
mel-0.075
cover-0.068
opus-0.066
-0.062
Token60
Token ID1899
Feature activation+1.22e-01

Pos logit contributions
arth+0.096
factor+0.094
byss+0.091
asu+0.091
Factor+0.089
Neg logit contributions
ubs-0.115
mel-0.109
cover-0.099
opus-0.096
-0.091
Token CU
Token ID29369
Feature activation+3.60e-01

Pos logit contributions
arth+0.057
factor+0.056
byss+0.054
asu+0.053
Factor+0.052
Neg logit contributions
ubs-0.106
mel-0.101
cover-0.094
opus-0.092
-0.088
TokenDA
Token ID5631
Feature activation+9.76e-02

Pos logit contributions
arth+0.196
factor+0.189
Factor+0.183
asu+0.180
byss+0.174
Neg logit contributions
ubs-0.281
mel-0.269
cover-0.248
opus-0.247
-0.235
Token cores
Token ID21758
Feature activation+4.74e-01

Pos logit contributions
arth+0.046
factor+0.044
byss+0.044
asu+0.043
Factor+0.041
Neg logit contributions
ubs-0.085
mel-0.082
cover-0.075
opus-0.073
-0.070
Token and
Token ID290
Feature activation+1.39e-01

Pos logit contributions
arth+0.293
factor+0.288
byss+0.279
asu+0.278
Factor+0.273
Neg logit contributions
ubs-0.342
mel-0.324
cover-0.292
opus-0.284
-0.268
Token 20
Token ID1160
Feature activation+1.74e-01

Pos logit contributions
arth+0.082
factor+0.081
byss+0.078
asu+0.078
Factor+0.076
Neg logit contributions
ubs-0.103
mel-0.098
cover-0.089
opus-0.086
-0.082
Token SM
Token ID9447
Feature activation+7.38e-02

Pos logit contributions
arth+0.106
factor+0.105
byss+0.101
asu+0.101
Factor+0.099
Neg logit contributions
ubs-0.127
mel-0.119
cover-0.108
opus-0.105
-0.100
Token units
Token ID4991
Feature activation-2.56e-02

Pos logit contributions
arth+0.046
factor+0.045
byss+0.043
asu+0.043
Factor+0.043
Neg logit contributions
ubs-0.053
mel-0.050
cover-0.045
opus-0.044
-0.042
Token.
Token ID13
Feature activation+1.04e-01

Pos logit contributions
ubs+0.018
mel+0.017
cover+0.016
opus+0.015
+0.014
Neg logit contributions
arth-0.016
factor-0.016
byss-0.015
asu-0.015
Factor-0.015
Token function
Token ID2163
Feature activation+3.83e-03

Pos logit contributions
arth+0.021
factor+0.020
byss+0.020
asu+0.020
Factor+0.019
Neg logit contributions
ubs-0.030
mel-0.029
cover-0.026
opus-0.025
-0.024
Token,
Token ID11
Feature activation+3.40e-02

Pos logit contributions
arth+0.002
factor+0.002
byss+0.002
asu+0.002
Factor+0.002
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.002
opus-0.002
-0.002
Token ping
Token ID29400
Feature activation+8.42e-02

Pos logit contributions
arth+0.021
factor+0.020
byss+0.020
asu+0.020
Factor+0.019
Neg logit contributions
ubs-0.025
mel-0.023
cover-0.021
opus-0.021
-0.019
Token becomes
Token ID4329
Feature activation+7.28e-02

Pos logit contributions
arth+0.056
factor+0.055
byss+0.053
asu+0.053
Factor+0.052
Neg logit contributions
ubs-0.057
mel-0.054
cover-0.049
opus-0.047
-0.044
Token the
Token ID262
Feature activation+3.53e-02

Pos logit contributions
arth+0.043
factor+0.042
byss+0.041
asu+0.041
Factor+0.040
Neg logit contributions
ubs-0.055
mel-0.052
cover-0.047
opus-0.046
-0.043
Token subscriber
Token ID32944
Feature activation+4.73e-01

Pos logit contributions
arth+0.022
factor+0.021
byss+0.021
asu+0.021
Factor+0.020
Neg logit contributions
ubs-0.026
mel-0.024
cover-0.022
opus-0.021
-0.020
Token for
Token ID329
Feature activation+5.21e-02

Pos logit contributions
arth+0.296
factor+0.290
byss+0.283
asu+0.281
Factor+0.276
Neg logit contributions
ubs-0.338
mel-0.320
cover-0.289
opus-0.280
-0.265
Token this
Token ID428
Feature activation+1.60e-02

Pos logit contributions
arth+0.031
factor+0.031
byss+0.030
asu+0.030
Factor+0.029
Neg logit contributions
ubs-0.039
mel-0.037
cover-0.033
opus-0.032
-0.031
Token topic
Token ID7243
Feature activation+4.37e-02

Pos logit contributions
arth+0.010
factor+0.010
byss+0.009
asu+0.009
Factor+0.009
Neg logit contributions
ubs-0.012
mel-0.011
cover-0.010
opus-0.010
-0.009
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
arth+0.027
factor+0.027
byss+0.026
asu+0.026
Factor+0.025
Neg logit contributions
ubs-0.031
mel-0.030
cover-0.027
opus-0.026
-0.025
Token So
Token ID1406
Feature activation-3.65e-02

Pos logit contributions
arth+0.011
factor+0.011
byss+0.011
asu+0.011
Factor+0.011
Neg logit contributions
ubs-0.014
mel-0.013
cover-0.012
opus-0.011
-0.011
Token have
Token ID423
Feature activation+3.31e-02

Pos logit contributions
arth+0.129
factor+0.127
asu+0.123
Factor+0.120
byss+0.120
Neg logit contributions
ubs-0.129
mel-0.122
cover-0.109
opus-0.105
-0.099
Token mixed
Token ID7668
Feature activation+4.13e-02

Pos logit contributions
arth+0.020
factor+0.020
byss+0.019
asu+0.019
Factor+0.019
Neg logit contributions
ubs-0.024
mel-0.023
cover-0.021
opus-0.020
-0.019
Token feelings
Token ID7666
Feature activation+4.82e-02

Pos logit contributions
arth+0.022
factor+0.021
asu+0.020
byss+0.020
Factor+0.020
Neg logit contributions
ubs-0.034
mel-0.032
cover-0.029
opus-0.029
-0.028
Token about
Token ID546
Feature activation+1.02e-01

Pos logit contributions
arth+0.026
factor+0.025
byss+0.024
asu+0.024
Factor+0.024
Neg logit contributions
ubs-0.039
mel-0.037
cover-0.034
opus-0.033
-0.031
Token James
Token ID3700
Feature activation+1.28e-01

Pos logit contributions
arth+0.063
factor+0.061
byss+0.060
asu+0.059
Factor+0.058
Neg logit contributions
ubs-0.075
mel-0.071
cover-0.064
opus-0.062
-0.059
Token Gunn
Token ID40837
Feature activation+4.69e-01

Pos logit contributions
arth+0.078
factor+0.076
byss+0.074
asu+0.074
Factor+0.072
Neg logit contributions
ubs-0.094
mel-0.089
cover-0.080
opus-0.078
-0.074
Token's
Token ID338
Feature activation+8.52e-03

Pos logit contributions
arth+0.317
factor+0.311
asu+0.302
byss+0.302
Factor+0.296
Neg logit contributions
ubs-0.313
mel-0.296
cover-0.264
opus-0.256
-0.241
Token GU
Token ID19348
Feature activation+1.24e-01

Pos logit contributions
arth+0.005
factor+0.005
byss+0.005
asu+0.005
Factor+0.004
Neg logit contributions
ubs-0.007
mel-0.006
cover-0.006
opus-0.006
-0.005
TokenARD
Token ID9795
Feature activation+3.08e-01

Pos logit contributions
arth+0.066
factor+0.065
byss+0.063
asu+0.062
Factor+0.061
Neg logit contributions
ubs-0.101
mel-0.096
cover-0.088
opus-0.085
-0.081
TokenI
Token ID40
Feature activation+2.83e-02

Pos logit contributions
arth+0.197
factor+0.193
asu+0.186
byss+0.184
Factor+0.184
Neg logit contributions
ubs-0.216
mel-0.206
cover-0.184
opus-0.179
-0.170
TokenANS
Token ID15037
Feature activation+8.35e-02

Pos logit contributions
arth+0.015
factor+0.014
byss+0.014
asu+0.014
Factor+0.013
Neg logit contributions
ubs-0.023
mel-0.022
cover-0.020
opus-0.020
-0.019
Token fl
Token ID781
Feature activation+1.34e-01

Pos logit contributions
arth+0.003
factor+0.003
asu+0.003
byss+0.003
Factor+0.003
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.003
opus-0.003
-0.003
Tokenuttering
Token ID33598
Feature activation+1.63e-01

Pos logit contributions
arth+0.084
factor+0.082
asu+0.080
byss+0.079
Factor+0.077
Neg logit contributions
ubs-0.096
mel-0.091
cover-0.082
opus-0.080
-0.077
Token proudly
Token ID23017
Feature activation+2.64e-01

Pos logit contributions
arth+0.100
factor+0.098
asu+0.095
byss+0.094
Factor+0.093
Neg logit contributions
ubs-0.119
mel-0.113
cover-0.102
opus-0.099
-0.094
Token in
Token ID287
Feature activation+4.21e-02

Pos logit contributions
arth+0.161
factor+0.158
asu+0.153
byss+0.153
Factor+0.150
Neg logit contributions
ubs-0.192
mel-0.182
cover-0.165
opus-0.160
-0.152
Token the
Token ID262
Feature activation+1.09e-02

Pos logit contributions
arth+0.024
factor+0.024
byss+0.023
asu+0.023
Factor+0.023
Neg logit contributions
ubs-0.032
mel-0.030
cover-0.028
opus-0.027
-0.026
Token breeze
Token ID28633
Feature activation+4.68e-01

Pos logit contributions
arth+0.006
factor+0.006
byss+0.006
asu+0.006
Factor+0.005
Neg logit contributions
ubs-0.009
mel-0.008
cover-0.008
opus-0.007
-0.007
Token.
Token ID13
Feature activation+3.09e-02

Pos logit contributions
arth+0.291
factor+0.285
byss+0.277
asu+0.277
Factor+0.271
Neg logit contributions
ubs-0.337
mel-0.318
cover-0.287
opus-0.279
-0.264
Token Shia
Token ID27424
Feature activation-9.72e-02

Pos logit contributions
arth+0.019
factor+0.018
byss+0.018
asu+0.018
Factor+0.017
Neg logit contributions
ubs-0.023
mel-0.021
cover-0.019
opus-0.019
-0.018
Token La
Token ID4689
Feature activation-9.40e-02

Pos logit contributions
ubs+0.063
mel+0.059
cover+0.053
opus+0.051
+0.048
Neg logit contributions
arth-0.068
factor-0.066
asu-0.065
byss-0.065
Factor-0.063
TokenBe
Token ID3856
Feature activation-5.24e-02

Pos logit contributions
ubs+0.066
mel+0.061
cover+0.056
opus+0.054
+0.051
Neg logit contributions
arth-0.059
factor-0.057
asu-0.057
byss-0.057
Factor-0.056
Tokenouf
Token ID37116
Feature activation-1.33e-01

Pos logit contributions
ubs+0.029
mel+0.027
cover+0.024
opus+0.023
+0.021
Neg logit contributions
arth-0.041
factor-0.040
asu-0.039
byss-0.039
Factor-0.039
Token King
Token ID2677
Feature activation-8.80e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.016
+0.015
Neg logit contributions
arth-0.017
factor-0.016
asu-0.016
byss-0.016
Factor-0.015
Token was
Token ID373
Feature activation+1.35e-02

Pos logit contributions
ubs+0.061
mel+0.057
cover+0.051
opus+0.050
+0.047
Neg logit contributions
arth-0.058
factor-0.056
asu-0.055
byss-0.055
Factor-0.054
Token a
Token ID257
Feature activation+2.06e-02

Pos logit contributions
arth+0.008
factor+0.008
asu+0.008
byss+0.008
Factor+0.008
Neg logit contributions
ubs-0.010
mel-0.009
cover-0.008
opus-0.008
-0.008
Token non
Token ID1729
Feature activation-2.20e-02

Pos logit contributions
arth+0.012
factor+0.012
byss+0.012
asu+0.012
Factor+0.011
Neg logit contributions
ubs-0.015
mel-0.014
cover-0.013
opus-0.013
-0.012
Token-
Token ID12
Feature activation-1.34e-02

Pos logit contributions
ubs+0.015
mel+0.015
cover+0.013
opus+0.013
+0.012
Neg logit contributions
arth-0.014
factor-0.014
byss-0.013
asu-0.013
Factor-0.013
Tokenfactor
Token ID31412
Feature activation+4.67e-01

Pos logit contributions
ubs+0.015
mel+0.014
cover+0.013
opus+0.013
+0.012
Neg logit contributions
arth-0.003
factor-0.003
byss-0.003
asu-0.003
Factor-0.003
Token in
Token ID287
Feature activation+1.88e-03

Pos logit contributions
arth+0.285
factor+0.279
byss+0.270
asu+0.270
Factor+0.264
Neg logit contributions
ubs-0.342
mel-0.325
cover-0.293
opus-0.286
-0.270
Token his
Token ID465
Feature activation+1.08e-01

Pos logit contributions
arth+0.001
factor+0.001
byss+0.001
asu+0.001
Factor+0.001
Neg logit contributions
ubs-0.001
mel-0.001
cover-0.001
opus-0.001
-0.001
Token first
Token ID717
Feature activation-3.16e-02

Pos logit contributions
arth+0.061
factor+0.059
asu+0.057
byss+0.057
Factor+0.056
Neg logit contributions
ubs-0.084
mel-0.080
cover-0.073
opus-0.071
-0.067
Token year
Token ID614
Feature activation-8.97e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.017
+0.016
Neg logit contributions
arth-0.022
factor-0.022
asu-0.021
byss-0.021
Factor-0.021
Token,
Token ID11
Feature activation-1.91e-02

Pos logit contributions
ubs+0.067
mel+0.064
cover+0.058
opus+0.056
+0.053
Neg logit contributions
arth-0.053
factor-0.052
byss-0.051
asu-0.050
Factor-0.049
Token they
Token ID484
Feature activation-2.71e-02

Pos logit contributions
ubs+0.012
mel+0.012
cover+0.010
opus+0.010
+0.010
Neg logit contributions
arth-0.010
factor-0.010
asu-0.010
byss-0.010
Factor-0.010
Token barely
Token ID8523
Feature activation+6.01e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.016
+0.015
Neg logit contributions
arth-0.017
factor-0.016
byss-0.016
asu-0.016
Factor-0.016
Token ripple
Token ID42462
Feature activation-1.05e-01

Pos logit contributions
arth+0.039
factor+0.038
byss+0.037
asu+0.037
Factor+0.036
Neg logit contributions
ubs-0.042
mel-0.040
cover-0.036
opus-0.034
-0.033
Token in
Token ID287
Feature activation-1.23e-02

Pos logit contributions
ubs+0.076
mel+0.072
cover+0.065
opus+0.063
+0.059
Neg logit contributions
arth-0.066
factor-0.064
byss-0.063
asu-0.062
Factor-0.061
Token the
Token ID262
Feature activation-4.22e-02

Pos logit contributions
ubs+0.009
mel+0.008
cover+0.008
opus+0.007
+0.007
Neg logit contributions
arth-0.008
factor-0.007
byss-0.007
asu-0.007
Factor-0.007
Token breeze
Token ID28633
Feature activation+4.58e-01

Pos logit contributions
ubs+0.032
mel+0.030
cover+0.027
opus+0.026
+0.025
Neg logit contributions
arth-0.025
factor-0.025
byss-0.024
asu-0.024
Factor-0.023
Token.
Token ID13
Feature activation-2.25e-02

Pos logit contributions
arth+0.290
factor+0.284
byss+0.276
asu+0.275
Factor+0.270
Neg logit contributions
ubs-0.324
mel-0.306
cover-0.276
opus-0.268
-0.253
Token In
Token ID554
Feature activation-3.18e-02

Pos logit contributions
ubs+0.016
mel+0.016
cover+0.014
opus+0.014
+0.013
Neg logit contributions
arth-0.014
factor-0.013
byss-0.013
asu-0.013
Factor-0.013
Token Shang
Token ID45797
Feature activation-7.36e-02

Pos logit contributions
ubs+0.022
mel+0.021
cover+0.019
opus+0.018
+0.017
Neg logit contributions
arth-0.020
factor-0.020
asu-0.019
byss-0.019
Factor-0.019
Tokenba
Token ID7012
Feature activation-6.43e-02

Pos logit contributions
ubs+0.049
mel+0.046
cover+0.041
opus+0.040
+0.038
Neg logit contributions
arth-0.049
factor-0.048
asu-0.047
byss-0.046
Factor-0.046
Token,
Token ID11
Feature activation-4.51e-02

Pos logit contributions
ubs+0.045
mel+0.042
cover+0.038
opus+0.037
+0.035
Neg logit contributions
arth-0.041
factor-0.041
byss-0.040
asu-0.039
Factor-0.039
Token lone
Token ID13497
Feature activation+1.23e-01

Pos logit contributions
arth+0.052
factor+0.050
asu+0.049
byss+0.049
Factor+0.048
Neg logit contributions
ubs-0.057
mel-0.054
cover-0.049
opus-0.048
-0.045
Token no
Token ID645
Feature activation+8.34e-02

Pos logit contributions
arth+0.078
factor+0.077
byss+0.074
asu+0.074
Factor+0.073
Neg logit contributions
ubs-0.087
mel-0.083
cover-0.074
opus-0.072
-0.068
Token vote
Token ID3015
Feature activation+2.02e-01

Pos logit contributions
arth+0.054
factor+0.053
byss+0.052
asu+0.052
Factor+0.051
Neg logit contributions
ubs-0.058
mel-0.054
cover-0.049
opus-0.047
-0.045
Token was
Token ID373
Feature activation+2.17e-02

Pos logit contributions
arth+0.131
factor+0.129
byss+0.126
asu+0.125
Factor+0.123
Neg logit contributions
ubs-0.139
mel-0.132
cover-0.118
opus-0.114
-0.108
Token L
Token ID406
Feature activation-5.22e-02

Pos logit contributions
arth+0.014
factor+0.013
byss+0.013
asu+0.013
Factor+0.013
Neg logit contributions
ubs-0.016
mel-0.015
cover-0.013
opus-0.013
-0.012
Tokenared
Token ID1144
Feature activation+4.56e-01

Pos logit contributions
ubs+0.038
mel+0.036
cover+0.033
opus+0.032
+0.030
Neg logit contributions
arth-0.032
factor-0.031
byss-0.030
asu-0.030
Factor-0.030
Tokeno
Token ID78
Feature activation+4.42e-02

Pos logit contributions
arth+0.265
factor+0.258
asu+0.252
byss+0.250
Factor+0.244
Neg logit contributions
ubs-0.347
mel-0.329
cover-0.299
opus-0.292
-0.282
Token Sen
Token ID2311
Feature activation-8.50e-02

Pos logit contributions
arth+0.028
factor+0.028
byss+0.027
asu+0.027
Factor+0.026
Neg logit contributions
ubs-0.031
mel-0.029
cover-0.026
opus-0.026
-0.024
Token.
Token ID13
Feature activation+9.78e-03

Pos logit contributions
ubs+0.060
mel+0.057
cover+0.052
opus+0.050
+0.047
Neg logit contributions
arth-0.054
factor-0.052
byss-0.051
asu-0.051
Factor-0.050
Token Judith
Token ID39987
Feature activation+3.83e-03

Pos logit contributions
arth+0.006
factor+0.006
byss+0.005
asu+0.005
Factor+0.005
Neg logit contributions
ubs-0.007
mel-0.007
cover-0.006
opus-0.006
-0.006
Token Z
Token ID1168
Feature activation+1.98e-02

Pos logit contributions
arth+0.002
factor+0.002
byss+0.002
asu+0.002
Factor+0.002
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.002
opus-0.002
-0.002
Token,
Token ID11
Feature activation-2.00e-02

Pos logit contributions
ubs+0.068
mel+0.064
cover+0.058
opus+0.056
+0.053
Neg logit contributions
arth-0.062
factor-0.060
byss-0.059
asu-0.059
Factor-0.057
Token a
Token ID257
Feature activation-1.61e-02

Pos logit contributions
ubs+0.015
mel+0.014
cover+0.013
opus+0.012
+0.012
Neg logit contributions
arth-0.012
factor-0.012
byss-0.011
asu-0.011
Factor-0.011
Token senior
Token ID4664
Feature activation+1.08e-01

Pos logit contributions
ubs+0.012
mel+0.011
cover+0.010
opus+0.010
+0.009
Neg logit contributions
arth-0.010
factor-0.009
asu-0.009
byss-0.009
Factor-0.009
Token associate
Token ID11602
Feature activation+5.12e-02

Pos logit contributions
arth+0.058
factor+0.058
asu+0.055
byss+0.055
Factor+0.054
Neg logit contributions
ubs-0.087
mel-0.083
cover-0.075
opus-0.073
-0.070
Token at
Token ID379
Feature activation+2.19e-02

Pos logit contributions
arth+0.031
factor+0.030
asu+0.029
byss+0.029
Factor+0.028
Neg logit contributions
ubs-0.038
mel-0.036
cover-0.032
opus-0.032
-0.030
Token Carnegie
Token ID33976
Feature activation+4.54e-01

Pos logit contributions
arth+0.013
factor+0.013
byss+0.012
asu+0.012
Factor+0.012
Neg logit contributions
ubs-0.017
mel-0.016
cover-0.014
opus-0.014
-0.013
Token End
Token ID5268
Feature activation-1.37e-02

Pos logit contributions
arth+0.331
factor+0.327
asu+0.318
byss+0.317
 Carnegie+0.312
Neg logit contributions
ubs-0.276
mel-0.259
cover-0.229
opus-0.221
-0.208
Tokenowment
Token ID36569
Feature activation+6.45e-02

Pos logit contributions
ubs+0.012
mel+0.011
cover+0.010
opus+0.010
+0.010
Neg logit contributions
arth-0.007
factor-0.006
byss-0.006
asu-0.006
Factor-0.006
Token of
Token ID286
Feature activation-2.66e-02

Pos logit contributions
arth+0.039
factor+0.038
byss+0.037
asu+0.037
Factor+0.036
Neg logit contributions
ubs-0.048
mel-0.045
cover-0.041
opus-0.040
-0.038
Token International
Token ID4037
Feature activation-1.30e-01

Pos logit contributions
ubs+0.014
mel+0.013
cover+0.012
opus+0.011
+0.010
Neg logit contributions
arth-0.021
factor-0.021
byss-0.021
asu-0.021
Factor-0.020
Token Peace
Token ID12689
Feature activation+1.55e-02

Pos logit contributions
ubs+0.094
mel+0.089
cover+0.081
opus+0.078
+0.074
Neg logit contributions
arth-0.081
factor-0.079
byss-0.077
asu-0.077
Factor-0.075
Token is
Token ID318
Feature activation+3.99e-03

Pos logit contributions
arth+0.045
factor+0.044
byss+0.043
asu+0.043
Factor+0.042
Neg logit contributions
ubs-0.049
mel-0.046
cover-0.042
opus-0.041
-0.038
Token.
Token ID13
Feature activation+1.76e-02

Pos logit contributions
arth+0.003
factor+0.002
byss+0.002
asu+0.002
Factor+0.002
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.002
opus-0.002
-0.002
Token Yesterday
Token ID34668
Feature activation-1.86e-01

Pos logit contributions
arth+0.011
factor+0.010
byss+0.010
asu+0.010
Factor+0.010
Neg logit contributions
ubs-0.013
mel-0.012
cover-0.011
opus-0.011
-0.010
Token,
Token ID11
Feature activation+2.17e-02

Pos logit contributions
ubs+0.130
mel+0.123
cover+0.111
opus+0.107
+0.101
Neg logit contributions
arth-0.120
factor-0.118
byss-0.115
asu-0.114
Factor-0.112
Token James
Token ID3700
Feature activation+7.19e-02

Pos logit contributions
arth+0.013
factor+0.013
byss+0.013
asu+0.013
Factor+0.012
Neg logit contributions
ubs-0.016
mel-0.015
cover-0.014
opus-0.013
-0.013
Token Gunn
Token ID40837
Feature activation+4.50e-01

Pos logit contributions
arth+0.024
factor+0.023
byss+0.022
asu+0.022
Factor+0.021
Neg logit contributions
ubs-0.072
mel-0.070
cover-0.065
opus-0.063
-0.061
Token announced
Token ID3414
Feature activation+6.32e-02

Pos logit contributions
arth+0.288
factor+0.282
asu+0.273
byss+0.272
Factor+0.268
Neg logit contributions
ubs-0.316
mel-0.299
cover-0.269
opus-0.261
-0.246
Token via
Token ID2884
Feature activation+2.12e-02

Pos logit contributions
arth+0.038
factor+0.037
byss+0.036
asu+0.036
Factor+0.035
Neg logit contributions
ubs-0.047
mel-0.044
cover-0.040
opus-0.039
-0.037
Token his
Token ID465
Feature activation+1.08e-01

Pos logit contributions
arth+0.014
factor+0.014
byss+0.014
asu+0.014
Factor+0.014
Neg logit contributions
ubs-0.014
mel-0.013
cover-0.012
opus-0.011
-0.011
Token personal
Token ID2614
Feature activation+1.47e-01

Pos logit contributions
arth+0.078
factor+0.076
byss+0.074
asu+0.074
Factor+0.073
Neg logit contributions
ubs-0.067
mel-0.063
cover-0.056
opus-0.054
-0.050
Token Facebook
Token ID3203
Feature activation+7.87e-02

Pos logit contributions
arth+0.106
factor+0.104
asu+0.101
byss+0.101
Factor+0.099
Neg logit contributions
ubs-0.091
mel-0.085
cover-0.075
opus-0.073
-0.068
Token into
Token ID656
Feature activation-6.53e-02

Pos logit contributions
ubs+0.040
mel+0.038
cover+0.035
opus+0.034
+0.032
Neg logit contributions
arth-0.035
factor-0.034
byss-0.034
asu-0.033
Factor-0.033
Token the
Token ID262
Feature activation-3.20e-04

Pos logit contributions
ubs+0.047
mel+0.044
cover+0.040
opus+0.039
+0.037
Neg logit contributions
arth-0.041
factor-0.040
byss-0.039
asu-0.039
Factor-0.038
Token very
Token ID845
Feature activation+5.22e-02

Pos logit contributions
ubs+0.000
mel+0.000
cover+0.000
opus+0.000
+0.000
Neg logit contributions
arth-0.000
factor-0.000
byss-0.000
asu-0.000
Factor-0.000
Token continental
Token ID29843
Feature activation-9.19e-02

Pos logit contributions
arth+0.031
factor+0.031
byss+0.030
asu+0.030
Factor+0.029
Neg logit contributions
ubs-0.039
mel-0.037
cover-0.033
opus-0.032
-0.031
Token land
Token ID1956
Feature activation+5.32e-02

Pos logit contributions
ubs+0.064
mel+0.061
cover+0.055
opus+0.053
+0.050
Neg logit contributions
arth-0.059
factor-0.058
byss-0.056
asu-0.056
Factor-0.055
Tokenmass
Token ID22208
Feature activation+4.46e-01

Pos logit contributions
arth+0.015
factor+0.015
byss+0.014
asu+0.014
Factor+0.013
Neg logit contributions
ubs-0.056
mel-0.054
cover-0.050
opus-0.050
-0.048
Token itself
Token ID2346
Feature activation-8.67e-02

Pos logit contributions
arth+0.270
factor+0.264
byss+0.257
asu+0.256
Factor+0.251
Neg logit contributions
ubs-0.327
mel-0.310
cover-0.281
opus-0.273
-0.258
Token.
Token ID13
Feature activation+1.36e-02

Pos logit contributions
ubs+0.061
mel+0.057
cover+0.052
opus+0.050
+0.047
Neg logit contributions
arth-0.056
factor-0.055
byss-0.053
asu-0.053
Factor-0.052
Token But
Token ID887
Feature activation+2.27e-02

Pos logit contributions
arth+0.008
factor+0.008
byss+0.008
asu+0.008
Factor+0.008
Neg logit contributions
ubs-0.010
mel-0.009
cover-0.008
opus-0.008
-0.008
Token that
Token ID326
Feature activation+3.37e-02

Pos logit contributions
arth+0.014
factor+0.013
asu+0.013
byss+0.013
Factor+0.013
Neg logit contributions
ubs-0.017
mel-0.016
cover-0.014
opus-0.014
-0.013
Token,
Token ID11
Feature activation+2.51e-02

Pos logit contributions
arth+0.022
factor+0.021
asu+0.021
byss+0.021
Factor+0.020
Neg logit contributions
ubs-0.024
mel-0.022
cover-0.020
opus-0.019
-0.018
Token and
Token ID290
Feature activation+1.81e-02

Pos logit contributions
arth+0.025
factor+0.025
asu+0.024
byss+0.024
Factor+0.023
Neg logit contributions
ubs-0.027
mel-0.026
cover-0.023
opus-0.023
-0.021
Token Michelle
Token ID16738
Feature activation+4.01e-02

Pos logit contributions
arth+0.010
factor+0.010
byss+0.010
asu+0.010
Factor+0.009
Neg logit contributions
ubs-0.014
mel-0.013
cover-0.012
opus-0.012
-0.011
Token went
Token ID1816
Feature activation-3.50e-02

Pos logit contributions
arth+0.026
factor+0.025
byss+0.025
asu+0.025
Factor+0.024
Neg logit contributions
ubs-0.028
mel-0.026
cover-0.024
opus-0.023
-0.022
Token on
Token ID319
Feature activation+1.25e-02

Pos logit contributions
ubs+0.025
mel+0.024
cover+0.022
opus+0.021
+0.020
Neg logit contributions
arth-0.021
factor-0.021
asu-0.021
byss-0.020
Factor-0.020
Token Me
Token ID2185
Feature activation-5.78e-02

Pos logit contributions
arth+0.008
factor+0.007
byss+0.007
asu+0.007
Factor+0.007
Neg logit contributions
ubs-0.009
mel-0.009
cover-0.008
opus-0.008
-0.007
Tokengyn
Token ID40183
Feature activation+4.41e-01

Pos logit contributions
ubs+0.067
mel+0.065
cover+0.061
opus+0.060
+0.058
Neg logit contributions
arth-0.010
factor-0.010
byss-0.009
asu-0.008
Factor-0.008
Token Kelly
Token ID9077
Feature activation+2.33e-02

Pos logit contributions
arth+0.244
factor+0.239
asu+0.229
byss+0.227
Factor+0.225
Neg logit contributions
ubs-0.350
mel-0.332
cover-0.303
opus-0.295
-0.281
Token
Token ID447
Feature activation+5.29e-03

Pos logit contributions
arth+0.016
factor+0.016
byss+0.015
asu+0.015
Factor+0.015
Neg logit contributions
ubs-0.015
mel-0.014
cover-0.013
opus-0.012
-0.012
Token
Token ID247
Feature activation+1.50e-02

Pos logit contributions
arth+0.003
factor+0.003
byss+0.003
asu+0.003
Factor+0.003
Neg logit contributions
ubs-0.004
mel-0.004
cover-0.003
opus-0.003
-0.003
Tokens
Token ID82
Feature activation+6.73e-03

Pos logit contributions
arth+0.009
factor+0.009
byss+0.008
asu+0.008
Factor+0.008
Neg logit contributions
ubs-0.011
mel-0.011
cover-0.010
opus-0.009
-0.009
Token Fox
Token ID5426
Feature activation+1.28e-01

Pos logit contributions
arth+0.004
factor+0.004
asu+0.004
byss+0.004
Factor+0.004
Neg logit contributions
ubs-0.005
mel-0.004
cover-0.004
opus-0.004
-0.004
Token the
Token ID262
Feature activation-2.25e-02

Pos logit contributions
ubs+0.002
mel+0.002
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
asu-0.001
byss-0.001
Factor-0.001
Token word
Token ID1573
Feature activation+1.76e-01

Pos logit contributions
ubs+0.015
mel+0.014
cover+0.013
opus+0.013
+0.012
Neg logit contributions
arth-0.015
factor-0.015
asu-0.014
byss-0.014
Factor-0.014
Token
Token ID564
Feature activation-4.06e-02

Pos logit contributions
arth+0.101
factor+0.099
byss+0.096
asu+0.095
Factor+0.093
Neg logit contributions
ubs-0.134
mel-0.128
cover-0.116
opus-0.113
-0.107
Token
Token ID250
Feature activation-2.47e-02

Pos logit contributions
ubs+0.029
mel+0.027
cover+0.024
opus+0.024
+0.022
Neg logit contributions
arth-0.026
factor-0.025
byss-0.025
asu-0.025
Factor-0.024
TokenC
Token ID34
Feature activation-6.94e-02

Pos logit contributions
ubs+0.018
mel+0.017
cover+0.015
opus+0.015
+0.014
Neg logit contributions
arth-0.015
factor-0.015
byss-0.015
asu-0.014
Factor-0.014
Tokenubs
Token ID23161
Feature activation-5.56e-01

Pos logit contributions
ubs+0.006
mel+0.005
cover+0.001
opus-0.001
-0.003
Neg logit contributions
arth-0.086
factor-0.085
asu-0.083
byss-0.083
Factor-0.082
Token
Token ID447
Feature activation-5.65e-04

Pos logit contributions
ubs+0.388
mel+0.367
cover+0.330
opus+0.320
+0.302
Neg logit contributions
arth-0.357
factor-0.351
byss-0.342
asu-0.340
Factor-0.334
Token
Token ID251
Feature activation+5.14e-02

Pos logit contributions
ubs+0.000
mel+0.000
cover+0.000
opus+0.000
+0.000
Neg logit contributions
arth-0.000
factor-0.000
byss-0.000
asu-0.000
Factor-0.000
Token had
Token ID550
Feature activation+1.29e-02

Pos logit contributions
arth+0.030
factor+0.029
byss+0.028
asu+0.028
Factor+0.028
Neg logit contributions
ubs-0.039
mel-0.037
cover-0.034
opus-0.033
-0.031
Token come
Token ID1282
Feature activation+2.99e-02

Pos logit contributions
arth+0.008
factor+0.008
asu+0.008
byss+0.008
Factor+0.008
Neg logit contributions
ubs-0.009
mel-0.009
cover-0.008
opus-0.008
-0.007
Token up
Token ID510
Feature activation-9.17e-02

Pos logit contributions
arth+0.019
factor+0.019
byss+0.018
asu+0.018
Factor+0.018
Neg logit contributions
ubs-0.021
mel-0.020
cover-0.018
opus-0.017
-0.016
Token hashtag
Token ID23950
Feature activation-1.50e-01

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.003
factor-0.003
asu-0.003
byss-0.003
Factor-0.003
Token #
Token ID1303
Feature activation-9.39e-02

Pos logit contributions
ubs+0.079
mel+0.073
cover+0.063
opus+0.060
+0.055
Neg logit contributions
arth-0.123
factor-0.121
byss-0.119
asu-0.118
Factor-0.116
TokenAsk
Token ID25214
Feature activation+2.14e-01

Pos logit contributions
ubs+0.064
mel+0.060
cover+0.054
opus+0.053
+0.050
Neg logit contributions
arth-0.062
factor-0.061
byss-0.059
asu-0.059
Factor-0.058
TokenThe
Token ID464
Feature activation+4.15e-02

Pos logit contributions
arth+0.132
factor+0.129
asu+0.125
byss+0.125
Factor+0.123
Neg logit contributions
ubs-0.155
mel-0.147
cover-0.133
opus-0.129
-0.123
TokenD
Token ID35
Feature activation-7.61e-02

Pos logit contributions
arth+0.025
factor+0.025
asu+0.024
byss+0.024
Factor+0.024
Neg logit contributions
ubs-0.030
mel-0.029
cover-0.026
opus-0.025
-0.024
Tokenubs
Token ID23161
Feature activation-5.55e-01

Pos logit contributions
ubs+0.054
mel+0.051
cover+0.046
opus+0.044
+0.042
Neg logit contributions
arth-0.049
factor-0.048
byss-0.047
asu-0.046
Factor-0.045
Token,
Token ID11
Feature activation+1.88e-02

Pos logit contributions
ubs+0.396
mel+0.375
cover+0.338
opus+0.328
+0.310
Neg logit contributions
arth-0.349
factor-0.342
byss-0.333
asu-0.331
Factor-0.325
Token and
Token ID290
Feature activation+1.03e-02

Pos logit contributions
arth+0.011
factor+0.011
byss+0.010
asu+0.010
Factor+0.010
Neg logit contributions
ubs-0.014
mel-0.014
cover-0.013
opus-0.012
-0.012
Token be
Token ID307
Feature activation+4.83e-02

Pos logit contributions
arth+0.006
factor+0.006
asu+0.006
byss+0.006
Factor+0.006
Neg logit contributions
ubs-0.007
mel-0.007
cover-0.006
opus-0.006
-0.006
Token on
Token ID319
Feature activation+2.46e-02

Pos logit contributions
arth+0.023
factor+0.022
byss+0.022
asu+0.022
Factor+0.021
Neg logit contributions
ubs-0.042
mel-0.040
cover-0.037
opus-0.036
-0.034
Token the
Token ID262
Feature activation+1.51e-02

Pos logit contributions
arth+0.015
factor+0.015
byss+0.014
asu+0.014
Factor+0.014
Neg logit contributions
ubs-0.018
mel-0.017
cover-0.015
opus-0.015
-0.014
Tokencons
Token ID5936
Feature activation-9.38e-02

Pos logit contributions
ubs+0.031
mel+0.029
cover+0.026
opus+0.025
+0.024
Neg logit contributions
arth-0.034
factor-0.034
asu-0.033
byss-0.033
Factor-0.032
Tokenensual
Token ID31406
Feature activation-8.58e-02

Pos logit contributions
ubs+0.057
mel+0.053
cover+0.047
opus+0.045
+0.043
Neg logit contributions
arth-0.069
factor-0.068
asu-0.066
byss-0.066
Factor-0.065
Token sex
Token ID1714
Feature activation+3.69e-02

Pos logit contributions
ubs+0.059
mel+0.056
cover+0.050
opus+0.049
+0.046
Neg logit contributions
arth-0.056
factor-0.055
byss-0.053
asu-0.053
Factor-0.052
Token.
Token ID13
Feature activation-2.61e-02

Pos logit contributions
arth+0.024
factor+0.023
byss+0.023
asu+0.023
Factor+0.022
Neg logit contributions
ubs-0.026
mel-0.024
cover-0.022
opus-0.021
-0.020
Token Un
Token ID791
Feature activation-3.82e-02

Pos logit contributions
ubs+0.019
mel+0.018
cover+0.016
opus+0.016
+0.015
Neg logit contributions
arth-0.016
factor-0.016
asu-0.015
byss-0.015
Factor-0.015
Tokencover
Token ID9631
Feature activation-5.50e-01

Pos logit contributions
ubs+0.025
mel+0.023
cover+0.020
opus+0.020
+0.019
Neg logit contributions
arth-0.026
factor-0.026
asu-0.025
byss-0.025
Factor-0.025
Tokening
Token ID278
Feature activation+4.14e-02

Pos logit contributions
ubs+0.436
mel+0.416
cover+0.379
opus+0.368
+0.351
Neg logit contributions
arth-0.302
factor-0.295
byss-0.287
asu-0.284
Factor-0.278
Token the
Token ID262
Feature activation-2.06e-02

Pos logit contributions
arth+0.026
factor+0.025
byss+0.024
asu+0.024
Factor+0.024
Neg logit contributions
ubs-0.030
mel-0.028
cover-0.026
opus-0.025
-0.023
Token deception
Token ID26337
Feature activation-3.53e-02

Pos logit contributions
ubs+0.015
mel+0.014
cover+0.013
opus+0.013
+0.012
Neg logit contributions
arth-0.013
factor-0.012
byss-0.012
asu-0.012
Factor-0.012
Token and
Token ID290
Feature activation-2.02e-02

Pos logit contributions
ubs+0.025
mel+0.024
cover+0.022
opus+0.021
+0.020
Neg logit contributions
arth-0.022
factor-0.021
byss-0.021
asu-0.021
Factor-0.020
Token rape
Token ID7262
Feature activation-7.69e-02

Pos logit contributions
ubs+0.014
mel+0.014
cover+0.012
opus+0.012
+0.011
Neg logit contributions
arth-0.013
factor-0.013
byss-0.012
asu-0.012
Factor-0.012
Tokens
Token ID82
Feature activation+1.21e-02

Pos logit contributions
ubs+0.025
mel+0.024
cover+0.022
opus+0.022
+0.021
Neg logit contributions
arth-0.017
factor-0.016
asu-0.016
byss-0.015
Factor-0.015
Token been
Token ID587
Feature activation-5.56e-02

Pos logit contributions
arth+0.007
factor+0.007
byss+0.006
asu+0.006
Factor+0.006
Neg logit contributions
ubs-0.009
mel-0.009
cover-0.008
opus-0.008
-0.008
Token Nikol
Token ID48127
Feature activation-1.56e-02

Pos logit contributions
ubs+0.041
mel+0.039
cover+0.036
opus+0.035
+0.033
Neg logit contributions
arth-0.033
factor-0.032
byss-0.031
asu-0.031
Factor-0.031
Tokenaj
Token ID1228
Feature activation-8.63e-02

Pos logit contributions
ubs+0.009
mel+0.009
cover+0.008
opus+0.007
+0.007
Neg logit contributions
arth-0.012
factor-0.011
asu-0.011
byss-0.011
Factor-0.011
Token Eh
Token ID31480
Feature activation+2.39e-02

Pos logit contributions
ubs+0.069
mel+0.065
cover+0.060
opus+0.058
+0.056
Neg logit contributions
arth-0.047
factor-0.046
asu-0.044
byss-0.044
Factor-0.043
Tokenlers
Token ID8116
Feature activation-5.43e-01

Pos logit contributions
arth+0.027
factor+0.026
asu+0.026
byss+0.026
Factor+0.026
Neg logit contributions
ubs-0.005
mel-0.004
cover-0.003
opus-0.003
-0.002
Token and
Token ID290
Feature activation-2.46e-02

Pos logit contributions
ubs+0.386
mel+0.365
cover+0.329
opus+0.319
+0.301
Neg logit contributions
arth-0.343
factor-0.336
byss-0.327
asu-0.325
Factor-0.319
Token Patri
Token ID9606
Feature activation+7.30e-02

Pos logit contributions
ubs+0.018
mel+0.017
cover+0.016
opus+0.015
+0.014
Neg logit contributions
arth-0.015
factor-0.015
byss-0.014
asu-0.014
Factor-0.014
Tokenk
Token ID74
Feature activation+1.08e-02

Pos logit contributions
arth+0.036
factor+0.035
byss+0.034
asu+0.034
Factor+0.033
Neg logit contributions
ubs-0.062
mel-0.059
cover-0.054
opus-0.053
-0.051
Token L
Token ID406
Feature activation-1.40e-01

Pos logit contributions
arth+0.007
factor+0.007
asu+0.007
Factor+0.007
byss+0.007
Neg logit contributions
ubs-0.007
mel-0.007
cover-0.006
opus-0.006
-0.006
Tokenaine
Token ID5718
Feature activation-2.16e-01

Pos logit contributions
ubs+0.081
mel+0.075
cover+0.067
+0.064
opus+0.063
Neg logit contributions
arth-0.104
factor-0.102
asu-0.102
Factor-0.098
byss-0.097
Tokené
Token ID2634
Feature activation-5.19e-02

Pos logit contributions
arth+0.160
factor+0.156
asu+0.153
byss+0.152
Factor+0.148
Neg logit contributions
ubs-0.150
mel-0.141
cover-0.127
opus-0.121
-0.116
Token,
Token ID11
Feature activation-7.70e-03

Pos logit contributions
ubs+0.037
mel+0.035
cover+0.031
opus+0.030
+0.029
Neg logit contributions
arth-0.033
factor-0.032
byss-0.031
asu-0.031
Factor-0.031
TokenCM
Token ID24187
Feature activation-6.13e-02

Pos logit contributions
ubs+0.006
mel+0.005
cover+0.005
opus+0.005
+0.004
Neg logit contributions
arth-0.005
factor-0.005
byss-0.005
asu-0.005
Factor-0.004
TokenDR
Token ID7707
Feature activation-9.78e-02

Pos logit contributions
ubs+0.034
mel+0.031
cover+0.027
opus+0.026
+0.024
Neg logit contributions
arth-0.048
factor-0.048
byss-0.046
asu-0.046
Factor-0.046
Token Kay
Token ID17356
Feature activation+5.17e-02

Pos logit contributions
ubs+0.075
mel+0.071
cover+0.065
opus+0.063
+0.060
Neg logit contributions
arth-0.056
factor-0.055
byss-0.053
asu-0.053
Factor-0.052
Tokenmel
Token ID17694
Feature activation-5.42e-01

Pos logit contributions
arth+0.066
factor+0.066
byss+0.065
asu+0.065
Factor+0.064
Neg logit contributions
ubs-0.003
mel-0.001
cover+0.003
opus+0.003
+0.005
Token Kan
Token ID14248
Feature activation+1.13e-01

Pos logit contributions
ubs+0.441
mel+0.420
cover+0.384
opus+0.374
+0.357
Neg logit contributions
arth-0.285
factor-0.279
byss-0.270
asu-0.268
Factor-0.262
Token,
Token ID11
Feature activation+2.22e-02

Pos logit contributions
arth+0.072
factor+0.070
byss+0.069
asu+0.068
Factor+0.067
Neg logit contributions
ubs-0.080
mel-0.076
cover-0.068
opus-0.066
-0.063
Token pour
Token ID12797
Feature activation-1.02e-01

Pos logit contributions
arth+0.014
factor+0.014
byss+0.013
asu+0.013
Factor+0.013
Neg logit contributions
ubs-0.016
mel-0.015
cover-0.014
opus-0.013
-0.012
Token le
Token ID443
Feature activation-1.46e-01

Pos logit contributions
ubs+0.073
mel+0.068
cover+0.062
opus+0.060
+0.056
Neg logit contributions
arth-0.065
factor-0.063
byss-0.062
asu-0.062
Factor-0.060
Token Command
Token ID9455
Feature activation-9.52e-02

Pos logit contributions
ubs+0.109
mel+0.103
cover+0.094
opus+0.091
+0.086
Neg logit contributions
arth-0.087
factor-0.085
byss-0.083
asu-0.082
Factor-0.081
Token with
Token ID351
Feature activation+1.36e-02

Pos logit contributions
ubs+0.008
mel+0.008
cover+0.007
opus+0.007
+0.007
Neg logit contributions
arth-0.007
factor-0.007
byss-0.007
asu-0.006
Factor-0.006
Token #
Token ID1303
Feature activation-1.35e-01

Pos logit contributions
arth+0.008
factor+0.008
asu+0.008
byss+0.008
Factor+0.008
Neg logit contributions
ubs-0.010
mel-0.010
cover-0.009
opus-0.008
-0.008
TokenAsk
Token ID25214
Feature activation+2.18e-01

Pos logit contributions
ubs+0.091
mel+0.085
cover+0.077
opus+0.074
+0.070
Neg logit contributions
arth-0.090
factor-0.088
byss-0.086
asu-0.086
Factor-0.084
TokenThe
Token ID464
Feature activation+4.83e-02

Pos logit contributions
arth+0.139
factor+0.136
byss+0.132
asu+0.132
Factor+0.130
Neg logit contributions
ubs-0.152
mel-0.145
cover-0.130
opus-0.126
-0.120
TokenD
Token ID35
Feature activation-7.55e-02

Pos logit contributions
arth+0.036
factor+0.035
byss+0.034
asu+0.034
Factor+0.034
Neg logit contributions
ubs-0.029
mel-0.027
cover-0.024
opus-0.023
-0.022
Tokenubs
Token ID23161
Feature activation-5.40e-01

Pos logit contributions
ubs+0.000
mel-0.002
cover-0.007
opus-0.008
-0.010
Neg logit contributions
arth-0.100
factor-0.099
asu-0.098
byss-0.097
Factor-0.097
Token,
Token ID11
Feature activation+2.72e-02

Pos logit contributions
ubs+0.387
mel+0.366
cover+0.331
opus+0.321
+0.304
Neg logit contributions
arth-0.337
factor-0.331
byss-0.321
asu-0.320
Factor-0.314
Token then
Token ID788
Feature activation+2.36e-01

Pos logit contributions
arth+0.017
factor+0.016
asu+0.016
byss+0.016
Factor+0.015
Neg logit contributions
ubs-0.020
mel-0.019
cover-0.017
opus-0.017
-0.016
Token tune
Token ID14009
Feature activation+2.74e-01

Pos logit contributions
arth+0.142
factor+0.139
asu+0.135
byss+0.132
Factor+0.132
Neg logit contributions
ubs-0.174
mel-0.165
cover-0.149
opus-0.146
-0.138
Token in
Token ID287
Feature activation+3.15e-02

Pos logit contributions
arth+0.172
factor+0.169
byss+0.164
asu+0.164
Factor+0.160
Neg logit contributions
ubs-0.195
mel-0.184
cover-0.166
opus-0.161
-0.153
Token to
Token ID284
Feature activation+2.73e-02

Pos logit contributions
arth+0.019
factor+0.019
byss+0.018
asu+0.018
Factor+0.018
Neg logit contributions
ubs-0.023
mel-0.022
cover-0.020
opus-0.019
-0.018
TokenTime
Token ID7575
Feature activation+7.07e-02

Pos logit contributions
ubs+0.048
mel+0.045
cover+0.041
opus+0.039
+0.037
Neg logit contributions
arth-0.042
factor-0.041
byss-0.040
asu-0.039
Factor-0.039
Token in
Token ID287
Feature activation-3.55e-02

Pos logit contributions
arth+0.043
factor+0.043
byss+0.041
asu+0.041
Factor+0.040
Neg logit contributions
ubs-0.051
mel-0.049
cover-0.044
opus-0.043
-0.040
Token Passing
Token ID46389
Feature activation-1.04e-01

Pos logit contributions
ubs+0.026
mel+0.025
cover+0.022
opus+0.022
+0.021
Neg logit contributions
arth-0.022
factor-0.021
asu-0.021
byss-0.021
Factor-0.020
Token:
Token ID25
Feature activation-3.92e-02

Pos logit contributions
ubs+0.081
mel+0.077
cover+0.070
opus+0.068
+0.065
Neg logit contributions
arth-0.058
factor-0.057
byss-0.055
asu-0.055
Factor-0.054
Token Ment
Token ID31879
Feature activation-1.17e-01

Pos logit contributions
ubs+0.028
mel+0.027
cover+0.024
opus+0.023
+0.022
Neg logit contributions
arth-0.024
factor-0.024
byss-0.023
asu-0.023
Factor-0.023
Tokenions
Token ID507
Feature activation-5.37e-01

Pos logit contributions
ubs+0.079
mel+0.075
cover+0.067
opus+0.065
+0.062
Neg logit contributions
arth-0.078
factor-0.076
byss-0.074
asu-0.074
Factor-0.073
Token of
Token ID286
Feature activation-2.84e-02

Pos logit contributions
ubs+0.408
mel+0.387
cover+0.352
opus+0.342
+0.324
Neg logit contributions
arth-0.312
factor-0.306
byss-0.297
asu-0.295
Factor-0.289
Token Time
Token ID3862
Feature activation+4.14e-02

Pos logit contributions
ubs+0.021
mel+0.020
cover+0.018
opus+0.017
+0.016
Neg logit contributions
arth-0.017
factor-0.017
byss-0.017
asu-0.017
Factor-0.016
Token in
Token ID287
Feature activation-4.45e-02

Pos logit contributions
arth+0.026
factor+0.025
byss+0.025
asu+0.024
Factor+0.024
Neg logit contributions
ubs-0.030
mel-0.028
cover-0.026
opus-0.025
-0.023
Token Fiction
Token ID26756
Feature activation-1.11e-01

Pos logit contributions
ubs+0.032
mel+0.031
cover+0.028
opus+0.027
+0.025
Neg logit contributions
arth-0.027
factor-0.027
byss-0.026
asu-0.026
Factor-0.025
Token
Token ID198
Feature activation-6.68e-02

Pos logit contributions
ubs+0.080
mel+0.076
cover+0.068
opus+0.066
+0.063
Neg logit contributions
arth-0.069
factor-0.068
byss-0.066
asu-0.066
Factor-0.064
Token is
Token ID318
Feature activation-4.30e-02

Pos logit contributions
ubs+0.074
mel+0.070
cover+0.063
opus+0.061
+0.057
Neg logit contributions
arth-0.070
factor-0.069
byss-0.067
asu-0.067
Factor-0.066
Token also
Token ID635
Feature activation-4.17e-03

Pos logit contributions
ubs+0.032
mel+0.030
cover+0.028
opus+0.027
+0.025
Neg logit contributions
arth-0.026
factor-0.025
asu-0.024
byss-0.024
Factor-0.024
Token the
Token ID262
Feature activation-8.52e-02

Pos logit contributions
ubs+0.003
mel+0.003
cover+0.003
opus+0.003
+0.003
Neg logit contributions
arth-0.002
factor-0.002
asu-0.002
byss-0.002
Factor-0.002
Token first
Token ID717
Feature activation-6.84e-02

Pos logit contributions
ubs+0.061
mel+0.057
cover+0.052
opus+0.050
+0.047
Neg logit contributions
arth-0.054
factor-0.053
byss-0.051
asu-0.051
Factor-0.050
Token grapp
Token ID21338
Feature activation-1.10e-01

Pos logit contributions
ubs+0.052
mel+0.050
cover+0.045
opus+0.044
+0.042
Neg logit contributions
arth-0.040
factor-0.039
byss-0.038
asu-0.037
Factor-0.037
Tokenler
Token ID1754
Feature activation-5.34e-01

Pos logit contributions
ubs+0.033
mel+0.028
cover+0.021
opus+0.019
+0.017
Neg logit contributions
arth-0.115
factor-0.113
asu-0.111
byss-0.110
Factor-0.109
Token in
Token ID287
Feature activation-8.01e-02

Pos logit contributions
ubs+0.399
mel+0.378
cover+0.343
opus+0.333
+0.316
Neg logit contributions
arth-0.318
factor-0.311
byss-0.303
asu-0.301
Factor-0.295
Token the
Token ID262
Feature activation-1.05e-01

Pos logit contributions
ubs+0.058
mel+0.055
cover+0.050
opus+0.049
+0.046
Neg logit contributions
arth-0.049
factor-0.048
byss-0.047
asu-0.046
Factor-0.046
Token Street
Token ID3530
Feature activation-2.21e-03

Pos logit contributions
ubs+0.077
mel+0.073
cover+0.066
opus+0.064
+0.061
Neg logit contributions
arth-0.063
factor-0.062
byss-0.060
asu-0.060
Factor-0.059
Token Fighter
Token ID15411
Feature activation+2.47e-01

Pos logit contributions
ubs+0.002
mel+0.002
cover+0.002
opus+0.002
+0.002
Neg logit contributions
arth-0.001
factor-0.001
byss-0.001
asu-0.001
Factor-0.001
Token series
Token ID2168
Feature activation+5.70e-02

Pos logit contributions
arth+0.135
factor+0.132
byss+0.128
asu+0.128
Factor+0.125
Neg logit contributions
ubs-0.195
mel-0.186
cover-0.170
opus-0.165
-0.157
Token explained
Token ID4893
Feature activation-1.26e-02

Pos logit contributions
arth+0.001
factor+0.001
byss+0.001
asu+0.001
Factor+0.001
Neg logit contributions
ubs-0.001
mel-0.001
cover-0.001
opus-0.001
-0.001
Token away
Token ID1497
Feature activation-7.48e-04

Pos logit contributions
ubs+0.009
mel+0.009
cover+0.008
opus+0.008
+0.007
Neg logit contributions
arth-0.007
factor-0.007
byss-0.007
asu-0.007
Factor-0.007
Token as
Token ID355
Feature activation+1.50e-02

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.000
opus+0.000
+0.000
Neg logit contributions
arth-0.000
factor-0.000
byss-0.000
asu-0.000
Factor-0.000
Token an
Token ID281
Feature activation-3.91e-02

Pos logit contributions
arth+0.009
factor+0.009
byss+0.009
asu+0.009
Factor+0.009
Neg logit contributions
ubs-0.011
mel-0.010
cover-0.009
opus-0.009
-0.009
Token aber
Token ID42499
Feature activation+2.98e-01

Pos logit contributions
ubs+0.029
mel+0.028
cover+0.025
opus+0.025
+0.023
Neg logit contributions
arth-0.023
factor-0.022
byss-0.022
asu-0.022
Factor-0.021
Tokenration
Token ID1358
Feature activation-5.34e-01

Pos logit contributions
arth+0.299
factor+0.296
asu+0.290
byss+0.289
Factor+0.286
Neg logit contributions
ubs-0.101
mel-0.089
cover-0.069
opus-0.064
-0.056
Token of
Token ID286
Feature activation-1.59e-02

Pos logit contributions
ubs+0.387
mel+0.367
cover+0.332
opus+0.321
+0.303
Neg logit contributions
arth-0.329
factor-0.322
byss-0.316
asu-0.312
Factor-0.306
Token the
Token ID262
Feature activation-1.06e-02

Pos logit contributions
ubs+0.012
mel+0.011
cover+0.010
opus+0.010
+0.009
Neg logit contributions
arth-0.010
factor-0.009
byss-0.009
asu-0.009
Factor-0.009
Token primitive
Token ID20049
Feature activation+1.18e-01

Pos logit contributions
ubs+0.008
mel+0.007
cover+0.006
opus+0.006
+0.006
Neg logit contributions
arth-0.007
factor-0.006
byss-0.006
asu-0.006
Factor-0.006
Token mind
Token ID2000
Feature activation+1.17e-01

Pos logit contributions
arth+0.073
factor+0.071
byss+0.069
asu+0.069
Factor+0.068
Neg logit contributions
ubs-0.086
mel-0.082
cover-0.074
opus-0.072
-0.068
Token.
Token ID13
Feature activation-2.59e-02

Pos logit contributions
arth+0.075
factor+0.073
byss+0.071
asu+0.071
Factor+0.070
Neg logit contributions
ubs-0.082
mel-0.077
cover-0.070
opus-0.067
-0.064
Token equivalent
Token ID7548
Feature activation-1.66e-01

Pos logit contributions
arth+0.004
factor+0.004
byss+0.004
asu+0.004
Factor+0.004
Neg logit contributions
ubs-0.005
mel-0.004
cover-0.004
opus-0.004
-0.004
Token of
Token ID286
Feature activation-1.50e-02

Pos logit contributions
ubs+0.128
mel+0.122
cover+0.111
opus+0.108
+0.102
Neg logit contributions
arth-0.094
factor-0.092
byss-0.089
asu-0.088
Factor-0.087
Token an
Token ID281
Feature activation-6.52e-02

Pos logit contributions
ubs+0.011
mel+0.011
cover+0.010
opus+0.009
+0.009
Neg logit contributions
arth-0.009
factor-0.009
byss-0.008
asu-0.008
Factor-0.008
Token Olympic
Token ID11514
Feature activation-1.66e-01

Pos logit contributions
ubs+0.048
mel+0.046
cover+0.042
opus+0.041
+0.038
Neg logit contributions
arth-0.039
factor-0.038
byss-0.037
asu-0.037
Factor-0.036
Token gymn
Token ID38581
Feature activation-1.07e-01

Pos logit contributions
ubs+0.122
mel+0.115
cover+0.104
opus+0.101
+0.096
Neg logit contributions
arth-0.100
factor-0.099
asu-0.095
byss-0.095
Factor-0.094
Tokenastics
Token ID24232
Feature activation-5.34e-01

Pos logit contributions
ubs+0.073
mel+0.068
cover+0.062
opus+0.059
+0.057
Neg logit contributions
arth-0.071
factor-0.070
asu-0.068
byss-0.067
Factor-0.066
Token routine
Token ID8027
Feature activation-2.24e-01

Pos logit contributions
ubs+0.389
mel+0.368
cover+0.332
opus+0.323
+0.306
Neg logit contributions
arth-0.327
factor-0.321
asu-0.311
byss-0.309
Factor-0.305
Token for
Token ID329
Feature activation-7.48e-02

Pos logit contributions
ubs+0.160
mel+0.151
cover+0.136
opus+0.132
+0.125
Neg logit contributions
arth-0.140
factor-0.138
byss-0.134
asu-0.133
Factor-0.131
Token the
Token ID262
Feature activation-7.12e-02

Pos logit contributions
ubs+0.055
mel+0.053
cover+0.048
opus+0.046
+0.044
Neg logit contributions
arth-0.045
factor-0.044
byss-0.043
asu-0.043
Factor-0.042
Token speaker
Token ID10834
Feature activation+7.64e-02

Pos logit contributions
ubs+0.051
mel+0.048
cover+0.044
opus+0.042
+0.040
Neg logit contributions
arth-0.044
factor-0.044
byss-0.042
asu-0.042
Factor-0.041
Token
Token ID447
Feature activation-5.39e-02

Pos logit contributions
arth+0.047
factor+0.046
byss+0.045
asu+0.045
Factor+0.044
Neg logit contributions
ubs-0.055
mel-0.052
cover-0.047
opus-0.046
-0.043
Token
Token ID251
Feature activation+1.81e-02

Pos logit contributions
ubs+0.014
mel+0.013
cover+0.012
opus+0.012
+0.011
Neg logit contributions
arth-0.013
factor-0.013
byss-0.012
asu-0.012
Factor-0.012
Token
Token ID198
Feature activation+1.49e-02

Pos logit contributions
arth+0.011
factor+0.011
byss+0.011
asu+0.011
Factor+0.010
Neg logit contributions
ubs-0.013
mel-0.012
cover-0.011
opus-0.011
-0.010
Token
Token ID198
Feature activation+6.83e-03

Pos logit contributions
arth+0.009
factor+0.009
asu+0.008
byss+0.008
Factor+0.008
Neg logit contributions
ubs-0.011
mel-0.011
cover-0.010
opus-0.009
-0.009
Token3
Token ID18
Feature activation-8.48e-03

Pos logit contributions
arth+0.004
factor+0.004
byss+0.004
asu+0.004
Factor+0.004
Neg logit contributions
ubs-0.005
mel-0.005
cover-0.004
opus-0.004
-0.004
Token.
Token ID13
Feature activation+4.65e-04

Pos logit contributions
ubs+0.006
mel+0.006
cover+0.005
opus+0.005
+0.005
Neg logit contributions
arth-0.005
factor-0.005
byss-0.005
asu-0.005
Factor-0.005
Token Daniels
Token ID28162
Feature activation-5.27e-01

Pos logit contributions
arth+0.000
factor+0.000
byss+0.000
asu+0.000
Factor+0.000
Neg logit contributions
ubs-0.000
mel-0.000
cover-0.000
opus-0.000
-0.000
Token
Token ID447
Feature activation+4.18e-03

Pos logit contributions
ubs+0.365
mel+0.345
cover+0.310
opus+0.300
+0.283
Neg logit contributions
arth-0.342
factor-0.336
byss-0.327
asu-0.326
Factor-0.320
Token
Token ID247
Feature activation-2.66e-02

Pos logit contributions
arth+0.002
factor+0.002
byss+0.002
asu+0.002
Factor+0.002
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.003
opus-0.003
-0.003
Token Prom
Token ID10335
Feature activation-1.53e-01

Pos logit contributions
ubs+0.019
mel+0.018
cover+0.016
opus+0.015
+0.015
Neg logit contributions
arth-0.017
factor-0.017
asu-0.016
byss-0.016
Factor-0.016
Tokeninent
Token ID7233
Feature activation-3.16e-02

Pos logit contributions
ubs+0.119
mel+0.113
cover+0.103
opus+0.100
+0.095
Neg logit contributions
arth-0.087
factor-0.085
byss-0.083
asu-0.082
Factor-0.080
Token Family
Token ID7884
Feature activation-5.21e-02

Pos logit contributions
ubs+0.024
mel+0.022
cover+0.020
opus+0.020
+0.019
Neg logit contributions
arth-0.019
factor-0.018
byss-0.018
asu-0.018
Factor-0.017
Token
Token ID564
Feature activation-4.06e-02

Pos logit contributions
arth+0.019
factor+0.018
byss+0.018
asu+0.018
Factor+0.018
Neg logit contributions
ubs-0.013
mel-0.012
cover-0.011
opus-0.010
-0.010
Token
Token ID246
Feature activation-4.04e-03

Pos logit contributions
ubs+0.029
mel+0.028
cover+0.025
opus+0.024
+0.023
Neg logit contributions
arth-0.025
factor-0.025
byss-0.024
asu-0.024
Factor-0.023
TokenDanny
Token ID45478
Feature activation-6.50e-02

Pos logit contributions
ubs+0.003
mel+0.003
cover+0.002
opus+0.002
+0.002
Neg logit contributions
arth-0.003
factor-0.003
asu-0.002
byss-0.002
Factor-0.002
Token
Token ID447
Feature activation+3.07e-02

Pos logit contributions
ubs+0.028
mel+0.025
cover+0.021
opus+0.020
+0.018
Neg logit contributions
arth-0.059
factor-0.059
byss-0.057
asu-0.057
Factor-0.057
Token
Token ID247
Feature activation+1.54e-02

Pos logit contributions
arth+0.018
factor+0.017
byss+0.017
asu+0.017
Factor+0.016
Neg logit contributions
ubs-0.023
mel-0.022
cover-0.020
opus-0.020
-0.019
Token Daniels
Token ID28162
Feature activation-5.15e-01

Pos logit contributions
arth+0.010
factor+0.010
byss+0.010
asu+0.010
Factor+0.009
Neg logit contributions
ubs-0.011
mel-0.010
cover-0.009
opus-0.009
-0.008
Token and
Token ID290
Feature activation+1.55e-03

Pos logit contributions
ubs+0.354
mel+0.335
cover+0.301
opus+0.291
+0.274
Neg logit contributions
arth-0.337
factor-0.330
byss-0.322
asu-0.320
Factor-0.314
Token his
Token ID465
Feature activation+9.21e-02

Pos logit contributions
arth+0.001
factor+0.001
byss+0.001
asu+0.001
Factor+0.001
Neg logit contributions
ubs-0.001
mel-0.001
cover-0.001
opus-0.001
-0.001
Token wife
Token ID3656
Feature activation+6.11e-02

Pos logit contributions
arth+0.057
factor+0.056
byss+0.055
asu+0.055
Factor+0.054
Neg logit contributions
ubs-0.066
mel-0.062
cover-0.056
opus-0.055
-0.052
Token,
Token ID11
Feature activation-1.93e-03

Pos logit contributions
arth+0.039
factor+0.038
byss+0.037
asu+0.037
Factor+0.036
Neg logit contributions
ubs-0.043
mel-0.041
cover-0.037
opus-0.036
-0.034
Token Jan
Token ID2365
Feature activation+7.44e-03

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
byss-0.001
asu-0.001
Factor-0.001
TokenDrive
Token ID24825
Feature activation+5.62e-02

Pos logit contributions
arth+0.002
factor+0.002
byss+0.002
asu+0.002
Factor+0.002
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.002
opus-0.002
-0.002
Token Inn
Token ID10849
Feature activation-1.17e-01

Pos logit contributions
arth+0.041
factor+0.041
byss+0.040
asu+0.040
Factor+0.039
Neg logit contributions
ubs-0.034
mel-0.032
cover-0.028
opus-0.027
-0.025
Token,
Token ID11
Feature activation+2.68e-02

Pos logit contributions
ubs+0.083
mel+0.078
cover+0.071
opus+0.068
+0.064
Neg logit contributions
arth-0.075
factor-0.073
byss-0.072
asu-0.071
Factor-0.070
Token and
Token ID290
Feature activation+1.35e-02

Pos logit contributions
arth+0.016
factor+0.016
byss+0.016
asu+0.016
Factor+0.015
Neg logit contributions
ubs-0.019
mel-0.018
cover-0.017
opus-0.016
-0.015
Token other
Token ID584
Feature activation-6.65e-02

Pos logit contributions
arth+0.008
factor+0.008
byss+0.008
asu+0.008
Factor+0.008
Neg logit contributions
ubs-0.010
mel-0.009
cover-0.008
opus-0.008
-0.008
Token Daniels
Token ID28162
Feature activation-5.15e-01

Pos logit contributions
ubs+0.047
mel+0.045
cover+0.040
opus+0.039
+0.037
Neg logit contributions
arth-0.042
factor-0.041
byss-0.040
asu-0.040
Factor-0.039
Token theaters
Token ID20550
Feature activation+8.09e-02

Pos logit contributions
ubs+0.386
mel+0.366
cover+0.332
opus+0.322
+0.305
Neg logit contributions
arth-0.304
factor-0.298
byss-0.291
asu-0.288
Factor-0.283
Token and
Token ID290
Feature activation-7.50e-03

Pos logit contributions
arth+0.051
factor+0.050
byss+0.049
asu+0.048
Factor+0.048
Neg logit contributions
ubs-0.057
mel-0.054
cover-0.049
opus-0.047
-0.045
Token properties
Token ID6608
Feature activation+8.82e-02

Pos logit contributions
ubs+0.006
mel+0.005
cover+0.005
opus+0.005
+0.004
Neg logit contributions
arth-0.005
factor-0.004
asu-0.004
byss-0.004
Factor-0.004
Token in
Token ID287
Feature activation-1.16e-02

Pos logit contributions
arth+0.055
factor+0.054
byss+0.053
asu+0.053
Factor+0.052
Neg logit contributions
ubs-0.063
mel-0.060
cover-0.054
opus-0.052
-0.049
Token South
Token ID2520
Feature activation-1.19e-01

Pos logit contributions
ubs+0.008
mel+0.008
cover+0.007
opus+0.007
+0.006
Neg logit contributions
arth-0.007
factor-0.007
byss-0.007
asu-0.007
Factor-0.007
TokenHur
Token ID42633
Feature activation-1.15e-01

Pos logit contributions
ubs+0.013
mel+0.012
cover+0.011
opus+0.010
+0.010
Neg logit contributions
arth-0.012
factor-0.012
asu-0.011
byss-0.011
Factor-0.011
Tokentt
Token ID926
Feature activation+1.29e-01

Pos logit contributions
ubs+0.103
mel+0.099
cover+0.091
opus+0.089
+0.086
Neg logit contributions
arth-0.051
factor-0.049
byss-0.047
asu-0.047
Factor-0.046
Token has
Token ID468
Feature activation-1.47e-03

Pos logit contributions
arth+0.084
factor+0.082
asu+0.080
byss+0.079
Factor+0.078
Neg logit contributions
ubs-0.089
mel-0.084
cover-0.076
opus-0.073
-0.069
Token also
Token ID635
Feature activation+1.92e-02

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
asu-0.001
byss-0.001
Factor-0.001
Token tut
Token ID9732
Feature activation-3.00e-02

Pos logit contributions
arth+0.012
factor+0.012
asu+0.012
byss+0.012
Factor+0.012
Neg logit contributions
ubs-0.013
mel-0.013
cover-0.011
opus-0.011
-0.010
Tokenored
Token ID1850
Feature activation-5.14e-01

Pos logit contributions
ubs+0.008
mel+0.006
cover+0.004
opus+0.004
+0.003
Neg logit contributions
arth-0.033
factor-0.032
asu-0.032
byss-0.031
Factor-0.031
Token outside
Token ID2354
Feature activation+4.23e-02

Pos logit contributions
ubs+0.381
mel+0.361
cover+0.327
opus+0.315
+0.300
Neg logit contributions
arth-0.309
factor-0.303
byss-0.297
asu-0.293
Factor-0.287
Token linebacker
Token ID18961
Feature activation-6.79e-02

Pos logit contributions
arth+0.023
factor+0.023
byss+0.022
asu+0.022
Factor+0.021
Neg logit contributions
ubs-0.034
mel-0.032
cover-0.029
opus-0.028
-0.027
Token Willie
Token ID28623
Feature activation+6.94e-02

Pos logit contributions
ubs+0.050
mel+0.048
cover+0.043
opus+0.042
+0.040
Neg logit contributions
arth-0.041
factor-0.040
byss-0.039
asu-0.039
Factor-0.038
Token Young
Token ID6960
Feature activation-1.37e-01

Pos logit contributions
arth+0.048
factor+0.047
asu+0.045
byss+0.045
Factor+0.045
Neg logit contributions
ubs-0.045
mel-0.042
cover-0.038
opus-0.037
-0.035
Token,
Token ID11
Feature activation-7.39e-03

Pos logit contributions
ubs+0.097
mel+0.092
cover+0.083
opus+0.081
+0.076
Neg logit contributions
arth-0.087
factor-0.085
byss-0.083
asu-0.082
Factor-0.081
Token to
Token ID284
Feature activation-1.48e-02

Pos logit contributions
arth+0.003
factor+0.003
byss+0.003
asu+0.003
Factor+0.003
Neg logit contributions
ubs-0.004
mel-0.003
cover-0.003
opus-0.003
-0.003
Token his
Token ID465
Feature activation+7.91e-02

Pos logit contributions
ubs+0.012
mel+0.011
cover+0.010
opus+0.010
+0.009
Neg logit contributions
arth-0.008
factor-0.008
byss-0.008
asu-0.008
Factor-0.007
Token name
Token ID1438
Feature activation-1.82e-02

Pos logit contributions
arth+0.054
factor+0.053
asu+0.052
byss+0.052
Factor+0.051
Neg logit contributions
ubs-0.052
mel-0.049
cover-0.043
opus-0.042
-0.039
Token to
Token ID284
Feature activation-3.43e-02

Pos logit contributions
ubs+0.013
mel+0.012
cover+0.011
opus+0.011
+0.010
Neg logit contributions
arth-0.011
factor-0.011
byss-0.011
asu-0.011
Factor-0.011
Token become
Token ID1716
Feature activation+5.50e-02

Pos logit contributions
ubs+0.025
mel+0.024
cover+0.021
opus+0.021
+0.020
Neg logit contributions
arth-0.021
factor-0.021
asu-0.020
byss-0.020
Factor-0.020
Token Daniels
Token ID28162
Feature activation-5.09e-01

Pos logit contributions
arth+0.034
factor+0.034
byss+0.033
asu+0.033
Factor+0.032
Neg logit contributions
ubs-0.039
mel-0.037
cover-0.034
opus-0.033
-0.031
Token.)
Token ID2014
Feature activation-1.36e-02

Pos logit contributions
ubs+0.345
mel+0.325
cover+0.292
opus+0.282
+0.267
Neg logit contributions
arth-0.337
factor-0.331
byss-0.324
asu-0.321
Factor-0.315
Token
Token ID447
Feature activation-1.27e-02

Pos logit contributions
ubs+0.009
mel+0.009
cover+0.008
opus+0.008
+0.007
Neg logit contributions
arth-0.009
factor-0.009
byss-0.009
asu-0.009
Factor-0.008
Token
Token ID251
Feature activation+1.40e-02

Pos logit contributions
ubs+0.011
mel+0.010
cover+0.009
opus+0.009
+0.009
Neg logit contributions
arth-0.006
factor-0.006
byss-0.006
asu-0.006
Factor-0.006
Token
Token ID198
Feature activation-2.17e-02

Pos logit contributions
arth+0.009
factor+0.009
byss+0.009
asu+0.009
Factor+0.009
Neg logit contributions
ubs-0.010
mel-0.009
cover-0.008
opus-0.008
-0.007
Token
Token ID198
Feature activation-2.95e-02

Pos logit contributions
ubs+0.016
mel+0.015
cover+0.014
opus+0.013
+0.013
Neg logit contributions
arth-0.013
factor-0.013
asu-0.012
byss-0.012
Factor-0.012
Token This
Token ID770
Feature activation-1.05e-02

Pos logit contributions
ubs+0.002
mel+0.002
cover+0.002
opus+0.002
+0.002
Neg logit contributions
arth-0.002
factor-0.002
byss-0.002
asu-0.002
Factor-0.002
Token happened
Token ID3022
Feature activation-1.91e-01

Pos logit contributions
ubs+0.008
mel+0.007
cover+0.007
opus+0.006
+0.006
Neg logit contributions
arth-0.006
factor-0.006
asu-0.006
byss-0.006
Factor-0.006
Token because
Token ID780
Feature activation-1.18e-01

Pos logit contributions
ubs+0.137
mel+0.130
cover+0.117
opus+0.114
+0.107
Neg logit contributions
arth-0.120
factor-0.117
byss-0.114
asu-0.114
Factor-0.112
Token many
Token ID867
Feature activation-5.56e-02

Pos logit contributions
ubs+0.087
mel+0.082
cover+0.075
opus+0.073
+0.069
Neg logit contributions
arth-0.071
factor-0.070
byss-0.068
asu-0.068
Factor-0.066
Token Nort
Token ID15209
Feature activation+9.96e-03

Pos logit contributions
ubs+0.041
mel+0.039
cover+0.035
opus+0.034
+0.032
Neg logit contributions
arth-0.034
factor-0.033
byss-0.032
asu-0.032
Factor-0.031
Tokenher
Token ID372
Feature activation-5.09e-01

Pos logit contributions
arth+0.009
factor+0.009
byss+0.009
asu+0.009
Factor+0.009
Neg logit contributions
ubs-0.004
mel-0.004
cover-0.003
opus-0.003
-0.003
Tokenners
Token ID2741
Feature activation-1.62e-01

Pos logit contributions
ubs+0.380
mel+0.360
cover+0.326
opus+0.319
+0.303
Neg logit contributions
arth-0.302
factor-0.295
byss-0.287
asu-0.286
Factor-0.279
Token feared
Token ID15240
Feature activation-5.02e-02

Pos logit contributions
ubs+0.115
mel+0.109
cover+0.099
opus+0.096
+0.090
Neg logit contributions
arth-0.102
factor-0.100
byss-0.097
asu-0.096
Factor-0.095
Token that
Token ID326
Feature activation-2.01e-02

Pos logit contributions
ubs+0.037
mel+0.035
cover+0.032
opus+0.031
+0.029
Neg logit contributions
arth-0.031
factor-0.030
byss-0.029
asu-0.029
Factor-0.028
Token if
Token ID611
Feature activation-1.77e-02

Pos logit contributions
ubs+0.015
mel+0.014
cover+0.012
opus+0.012
+0.011
Neg logit contributions
arth-0.012
factor-0.012
byss-0.012
asu-0.012
Factor-0.012
Token the
Token ID262
Feature activation-4.17e-02

Pos logit contributions
ubs+0.013
mel+0.012
cover+0.011
opus+0.010
+0.010
Neg logit contributions
arth-0.011
factor-0.011
byss-0.011
asu-0.011
Factor-0.010
Token wife
Token ID3656
Feature activation+6.11e-02

Pos logit contributions
arth+0.057
factor+0.056
byss+0.055
asu+0.055
Factor+0.054
Neg logit contributions
ubs-0.066
mel-0.062
cover-0.056
opus-0.055
-0.052
Token,
Token ID11
Feature activation-1.93e-03

Pos logit contributions
arth+0.039
factor+0.038
byss+0.037
asu+0.037
Factor+0.036
Neg logit contributions
ubs-0.043
mel-0.041
cover-0.037
opus-0.036
-0.034
Token Jan
Token ID2365
Feature activation+7.44e-03

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
byss-0.001
asu-0.001
Factor-0.001
Tokenis
Token ID271
Feature activation-3.88e-02

Pos logit contributions
arth+0.004
factor+0.004
byss+0.004
asu+0.004
Factor+0.004
Neg logit contributions
ubs-0.006
mel-0.005
cover-0.005
opus-0.005
-0.004
Token Turk
Token ID8484
Feature activation+6.80e-02

Pos logit contributions
ubs+0.026
mel+0.024
cover+0.022
opus+0.021
+0.020
Neg logit contributions
arth-0.026
factor-0.026
byss-0.025
asu-0.025
Factor-0.024
Token Daniels
Token ID28162
Feature activation-5.06e-01

Pos logit contributions
arth+0.044
factor+0.044
byss+0.042
asu+0.042
Factor+0.041
Neg logit contributions
ubs-0.047
mel-0.044
cover-0.040
opus-0.038
-0.036
Token announce
Token ID5453
Feature activation+1.17e-01

Pos logit contributions
ubs+0.349
mel+0.330
cover+0.296
opus+0.287
+0.270
Neg logit contributions
arth-0.330
factor-0.324
byss-0.316
asu-0.314
Factor-0.308
Token the
Token ID262
Feature activation-2.16e-02

Pos logit contributions
arth+0.070
factor+0.069
byss+0.067
asu+0.067
Factor+0.065
Neg logit contributions
ubs-0.086
mel-0.082
cover-0.074
opus-0.072
-0.068
Token birth
Token ID4082
Feature activation-2.37e-02

Pos logit contributions
ubs+0.015
mel+0.015
cover+0.013
opus+0.013
+0.012
Neg logit contributions
arth-0.014
factor-0.013
byss-0.013
asu-0.013
Factor-0.013
Token of
Token ID286
Feature activation-1.20e-02

Pos logit contributions
ubs+0.018
mel+0.018
cover+0.016
opus+0.016
+0.015
Neg logit contributions
arth-0.013
factor-0.013
byss-0.013
asu-0.013
Factor-0.012
Token their
Token ID511
Feature activation+5.88e-03

Pos logit contributions
ubs+0.009
mel+0.009
cover+0.008
opus+0.008
+0.007
Neg logit contributions
arth-0.007
factor-0.007
byss-0.007
asu-0.007
Factor-0.006
Token
Token ID198
Feature activation-2.06e-03

Pos logit contributions
ubs+0.004
mel+0.004
cover+0.004
opus+0.003
+0.003
Neg logit contributions
arth-0.004
factor-0.004
byss-0.003
asu-0.003
Factor-0.003
Token
Token ID198
Feature activation-8.96e-03

Pos logit contributions
ubs+0.002
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
asu-0.001
byss-0.001
Factor-0.001
TokenJan
Token ID12128
Feature activation-2.05e-02

Pos logit contributions
ubs+0.006
mel+0.006
cover+0.005
opus+0.005
+0.005
Neg logit contributions
arth-0.006
factor-0.006
asu-0.006
byss-0.006
Factor-0.005
Tokenis
Token ID271
Feature activation-1.90e-02

Pos logit contributions
ubs+0.014
mel+0.013
cover+0.012
opus+0.012
+0.011
Neg logit contributions
arth-0.013
factor-0.013
byss-0.013
asu-0.013
Factor-0.013
Token Turk
Token ID8484
Feature activation+7.37e-02

Pos logit contributions
ubs+0.013
mel+0.012
cover+0.011
opus+0.011
+0.010
Neg logit contributions
arth-0.012
factor-0.012
byss-0.012
asu-0.012
Factor-0.012
Token Daniels
Token ID28162
Feature activation-5.06e-01

Pos logit contributions
arth+0.075
factor+0.075
byss+0.073
asu+0.073
Factor+0.072
Neg logit contributions
ubs-0.023
mel-0.021
cover-0.016
opus-0.014
-0.012
Token,
Token ID11
Feature activation+4.61e-03

Pos logit contributions
ubs+0.350
mel+0.331
cover+0.297
opus+0.288
+0.271
Neg logit contributions
arth-0.328
factor-0.322
byss-0.314
asu-0.312
Factor-0.307
Token who
Token ID508
Feature activation-2.70e-02

Pos logit contributions
arth+0.003
factor+0.003
byss+0.003
asu+0.003
Factor+0.003
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.003
opus-0.003
-0.003
Token also
Token ID635
Feature activation+4.60e-03

Pos logit contributions
ubs+0.019
mel+0.018
cover+0.016
opus+0.016
+0.015
Neg logit contributions
arth-0.017
factor-0.017
asu-0.016
byss-0.016
Factor-0.016
Token goes
Token ID2925
Feature activation+5.96e-02

Pos logit contributions
arth+0.003
factor+0.003
asu+0.003
byss+0.003
Factor+0.003
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.003
opus-0.003
-0.003
Token by
Token ID416
Feature activation-3.08e-02

Pos logit contributions
arth+0.038
factor+0.037
byss+0.037
asu+0.036
Factor+0.036
Neg logit contributions
ubs-0.042
mel-0.040
cover-0.035
opus-0.034
-0.032
Token Stamford
Token ID47604
Feature activation+9.68e-02

Pos logit contributions
ubs+0.004
mel+0.004
cover+0.003
opus+0.003
+0.003
Neg logit contributions
arth-0.003
factor-0.003
asu-0.003
byss-0.003
Factor-0.003
Token Bridge
Token ID10290
Feature activation-2.27e-01

Pos logit contributions
arth+0.080
factor+0.078
byss+0.077
asu+0.077
Factor+0.076
Neg logit contributions
ubs-0.050
mel-0.046
cover-0.040
opus-0.038
-0.035
Token,
Token ID11
Feature activation-2.07e-02

Pos logit contributions
ubs+0.157
mel+0.148
cover+0.133
opus+0.129
+0.122
Neg logit contributions
arth-0.148
factor-0.145
byss-0.142
asu-0.141
Factor-0.138
Token the
Token ID262
Feature activation-2.37e-02

Pos logit contributions
ubs+0.015
mel+0.015
cover+0.013
opus+0.013
+0.012
Neg logit contributions
arth-0.012
factor-0.012
byss-0.012
asu-0.012
Factor-0.011
Token nort
Token ID17605
Feature activation+6.66e-02

Pos logit contributions
ubs+0.017
mel+0.017
cover+0.015
opus+0.015
+0.014
Neg logit contributions
arth-0.014
factor-0.014
byss-0.014
asu-0.014
Factor-0.013
Tokenher
Token ID372
Feature activation-5.03e-01

Pos logit contributions
arth+0.064
factor+0.063
byss+0.062
asu+0.061
Factor+0.060
Neg logit contributions
ubs-0.026
mel-0.023
cover-0.019
opus-0.018
-0.016
Tokenners
Token ID2741
Feature activation-1.49e-01

Pos logit contributions
ubs+0.389
mel+0.369
cover+0.335
opus+0.327
+0.311
Neg logit contributions
arth-0.287
factor-0.280
byss-0.272
asu-0.270
Factor-0.264
Token were
Token ID547
Feature activation-5.76e-03

Pos logit contributions
ubs+0.104
mel+0.098
cover+0.089
opus+0.086
+0.081
Neg logit contributions
arth-0.096
factor-0.094
byss-0.091
asu-0.091
Factor-0.089
Token remarkably
Token ID21196
Feature activation-5.69e-02

Pos logit contributions
ubs+0.004
mel+0.004
cover+0.004
opus+0.004
+0.003
Neg logit contributions
arth-0.003
factor-0.003
byss-0.003
asu-0.003
Factor-0.003
Token absent
Token ID13717
Feature activation+1.27e-01

Pos logit contributions
ubs+0.041
mel+0.039
cover+0.035
opus+0.034
+0.032
Neg logit contributions
arth-0.036
factor-0.035
asu-0.034
byss-0.034
Factor-0.033
Token (
Token ID357
Feature activation-4.24e-02

Pos logit contributions
arth+0.080
factor+0.078
byss+0.076
asu+0.076
Factor+0.074
Neg logit contributions
ubs-0.090
mel-0.086
cover-0.077
opus-0.075
-0.071
TokenTER
Token ID5781
Feature activation-2.08e-01

Pos logit contributions
ubs+0.005
mel+0.005
cover+0.005
opus+0.004
+0.004
Neg logit contributions
arth-0.005
factor-0.005
asu-0.004
byss-0.004
Factor-0.004
Token
Token ID198
Feature activation-2.64e-03

Pos logit contributions
ubs+0.152
mel+0.144
cover+0.130
opus+0.126
+0.120
Neg logit contributions
arth-0.127
factor-0.124
byss-0.121
asu-0.120
Factor-0.118
Token
Token ID198
Feature activation-2.22e-03

Pos logit contributions
ubs+0.002
mel+0.002
cover+0.002
opus+0.002
+0.002
Neg logit contributions
arth-0.002
factor-0.002
asu-0.001
byss-0.001
Factor-0.001
Token+
Token ID10
Feature activation+4.67e-02

Pos logit contributions
ubs+0.002
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
byss-0.001
asu-0.001
Factor-0.001
Token Advent
Token ID33732
Feature activation-2.45e-01

Pos logit contributions
arth+0.029
factor+0.029
byss+0.028
asu+0.028
Factor+0.027
Neg logit contributions
ubs-0.033
mel-0.032
cover-0.029
opus-0.028
-0.026
Tokenurous
Token ID29277
Feature activation-5.02e-01

Pos logit contributions
ubs+0.157
mel+0.148
cover+0.132
opus+0.127
+0.119
Neg logit contributions
arth-0.171
factor-0.168
byss-0.164
asu-0.163
Factor-0.160
Token,
Token ID11
Feature activation-2.43e-02

Pos logit contributions
ubs+0.366
mel+0.347
cover+0.315
opus+0.304
+0.288
Neg logit contributions
arth-0.307
factor-0.301
byss-0.294
asu-0.292
Factor-0.285
Token Cult
Token ID22449
Feature activation-1.90e-01

Pos logit contributions
ubs+0.017
mel+0.016
cover+0.015
opus+0.014
+0.014
Neg logit contributions
arth-0.015
factor-0.015
byss-0.014
asu-0.014
Factor-0.014
Tokenured
Token ID1522
Feature activation-1.28e-01

Pos logit contributions
ubs+0.131
mel+0.124
cover+0.111
opus+0.108
+0.102
Neg logit contributions
arth-0.123
factor-0.121
byss-0.118
asu-0.117
Factor-0.115
Token,
Token ID11
Feature activation-3.12e-02

Pos logit contributions
ubs+0.092
mel+0.087
cover+0.078
opus+0.076
+0.072
Neg logit contributions
arth-0.080
factor-0.078
byss-0.076
asu-0.076
Factor-0.074
Token Quick
Token ID12029
Feature activation-1.27e-01

Pos logit contributions
ubs+0.022
mel+0.021
cover+0.019
opus+0.019
+0.018
Neg logit contributions
arth-0.019
factor-0.019
byss-0.019
asu-0.019
Factor-0.018
Token can
Token ID460
Feature activation+6.96e-03

Pos logit contributions
arth+0.022
factor+0.022
asu+0.021
byss+0.021
Factor+0.021
Neg logit contributions
ubs-0.022
mel-0.021
cover-0.019
opus-0.018
-0.017
Token do
Token ID466
Feature activation-1.76e-02

Pos logit contributions
arth+0.004
factor+0.004
asu+0.004
byss+0.004
Factor+0.004
Neg logit contributions
ubs-0.005
mel-0.005
cover-0.004
opus-0.004
-0.004
Token that
Token ID326
Feature activation-3.24e-04

Pos logit contributions
ubs+0.013
mel+0.013
cover+0.011
opus+0.011
+0.011
Neg logit contributions
arth-0.010
factor-0.010
byss-0.010
asu-0.010
Factor-0.010
Token except
Token ID2845
Feature activation-3.20e-02

Pos logit contributions
ubs+0.000
mel+0.000
cover+0.000
opus+0.000
+0.000
Neg logit contributions
arth-0.000
factor-0.000
asu-0.000
byss-0.000
Factor-0.000
Token Jack
Token ID3619
Feature activation-9.63e-02

Pos logit contributions
ubs+0.024
mel+0.023
cover+0.021
opus+0.020
+0.019
Neg logit contributions
arth-0.019
factor-0.018
byss-0.018
asu-0.017
Factor-0.017
Token Kirby
Token ID23965
Feature activation+3.47e-01

Pos logit contributions
ubs+0.082
mel+0.078
cover+0.072
opus+0.070
+0.067
Neg logit contributions
arth-0.047
factor-0.046
byss-0.045
asu-0.044
Factor-0.043
Token!"
Token ID2474
Feature activation+2.18e-03

Pos logit contributions
arth+0.230
factor+0.226
byss+0.221
asu+0.219
Factor+0.215
Neg logit contributions
ubs-0.235
mel-0.222
cover-0.199
opus-0.192
-0.181
Token According
Token ID4784
Feature activation-7.43e-02

Pos logit contributions
arth+0.001
factor+0.001
byss+0.001
asu+0.001
Factor+0.001
Neg logit contributions
ubs-0.002
mel-0.002
cover-0.001
opus-0.001
-0.001
Token to
Token ID284
Feature activation+3.26e-02

Pos logit contributions
ubs+0.058
mel+0.054
cover+0.049
opus+0.048
+0.046
Neg logit contributions
arth-0.042
factor-0.041
asu-0.040
byss-0.040
Factor-0.039
Token King
Token ID2677
Feature activation-1.61e-02

Pos logit contributions
arth+0.023
factor+0.022
asu+0.021
byss+0.021
Factor+0.021
Neg logit contributions
ubs-0.021
mel-0.020
cover-0.018
opus-0.017
-0.016
Token,
Token ID11
Feature activation-1.63e-02

Pos logit contributions
ubs+0.011
mel+0.010
cover+0.009
opus+0.009
+0.009
Neg logit contributions
arth-0.011
factor-0.010
byss-0.010
asu-0.010
Factor-0.010
Token taking
Token ID2263
Feature activation+3.74e-02

Pos logit contributions
arth+0.031
factor+0.030
byss+0.030
asu+0.029
Factor+0.029
Neg logit contributions
ubs-0.037
mel-0.035
cover-0.032
opus-0.031
-0.029
Token a
Token ID257
Feature activation+3.38e-02

Pos logit contributions
arth+0.024
factor+0.023
asu+0.023
byss+0.023
Factor+0.022
Neg logit contributions
ubs-0.026
mel-0.025
cover-0.022
opus-0.022
-0.020
Token mis
Token ID2984
Feature activation+5.13e-02

Pos logit contributions
arth+0.019
factor+0.018
asu+0.018
byss+0.018
Factor+0.017
Neg logit contributions
ubs-0.027
mel-0.025
cover-0.023
opus-0.023
-0.021
Tokendirection
Token ID37295
Feature activation-8.46e-02

Pos logit contributions
arth+0.031
factor+0.030
byss+0.029
asu+0.029
Factor+0.028
Neg logit contributions
ubs-0.038
mel-0.036
cover-0.033
opus-0.032
-0.030
Token counter
Token ID3753
Feature activation-6.62e-02

Pos logit contributions
ubs+0.063
mel+0.060
cover+0.054
opus+0.052
+0.050
Neg logit contributions
arth-0.051
factor-0.050
byss-0.048
asu-0.048
Factor-0.047
Tokenstep
Token ID9662
Feature activation+3.14e-01

Pos logit contributions
ubs+0.048
mel+0.046
cover+0.041
opus+0.040
+0.038
Neg logit contributions
arth-0.041
factor-0.040
byss-0.039
asu-0.039
Factor-0.038
Token behind
Token ID2157
Feature activation+1.44e-02

Pos logit contributions
arth+0.192
factor+0.188
byss+0.183
asu+0.182
Factor+0.178
Neg logit contributions
ubs-0.229
mel-0.217
cover-0.196
opus-0.190
-0.180
Token Bell
Token ID7459
Feature activation-2.84e-02

Pos logit contributions
arth+0.009
factor+0.009
byss+0.008
asu+0.008
Factor+0.008
Neg logit contributions
ubs-0.011
mel-0.010
cover-0.009
opus-0.009
-0.008
Token before
Token ID878
Feature activation-2.32e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.016
+0.016
Neg logit contributions
arth-0.018
factor-0.018
byss-0.017
asu-0.017
Factor-0.017
Token cutting
Token ID7720
Feature activation-3.18e-02

Pos logit contributions
ubs+0.017
mel+0.016
cover+0.015
opus+0.014
+0.013
Neg logit contributions
arth-0.014
factor-0.014
byss-0.013
asu-0.013
Factor-0.013
Token behind
Token ID2157
Feature activation+1.28e-02

Pos logit contributions
ubs+0.024
mel+0.022
cover+0.020
opus+0.020
+0.019
Neg logit contributions
arth-0.019
factor-0.019
byss-0.018
asu-0.018
Factor-0.018
Token and
Token ID290
Feature activation+6.78e-02

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
byss-0.001
asu-0.001
Factor-0.001
Token West
Token ID2688
Feature activation+2.38e-01

Pos logit contributions
arth+0.042
factor+0.041
byss+0.040
asu+0.039
Factor+0.039
Neg logit contributions
ubs-0.049
mel-0.047
cover-0.042
opus-0.041
-0.039
Token All
Token ID1439
Feature activation+1.43e-02

Pos logit contributions
arth+0.156
factor+0.153
asu+0.149
byss+0.149
Factor+0.145
Neg logit contributions
ubs-0.163
mel-0.153
cover-0.137
opus-0.134
-0.127
Tokenis
Token ID271
Feature activation+2.89e-02

Pos logit contributions
arth+0.009
factor+0.009
asu+0.009
byss+0.009
Factor+0.008
Neg logit contributions
ubs-0.010
mel-0.010
cover-0.009
opus-0.008
-0.008
Token-
Token ID12
Feature activation+5.53e-02

Pos logit contributions
arth+0.019
factor+0.018
asu+0.018
byss+0.018
Factor+0.017
Neg logit contributions
ubs-0.020
mel-0.019
cover-0.017
opus-0.017
-0.016
TokenWest
Token ID15045
Feature activation+3.29e-01

Pos logit contributions
arth+0.034
factor+0.033
byss+0.032
asu+0.032
Factor+0.031
Neg logit contributions
ubs-0.040
mel-0.038
cover-0.035
opus-0.034
-0.032
Token Milwaukee
Token ID16629
Feature activation+2.30e-02

Pos logit contributions
arth+0.212
factor+0.207
byss+0.202
asu+0.201
Factor+0.197
Neg logit contributions
ubs-0.230
mel-0.217
cover-0.195
opus-0.190
-0.180
Token.
Token ID13
Feature activation+4.95e-02

Pos logit contributions
arth+0.015
factor+0.014
byss+0.014
asu+0.014
Factor+0.014
Neg logit contributions
ubs-0.016
mel-0.015
cover-0.014
opus-0.013
-0.013
Token The
Token ID383
Feature activation+8.20e-02

Pos logit contributions
arth+0.030
factor+0.030
byss+0.029
asu+0.029
Factor+0.028
Neg logit contributions
ubs-0.036
mel-0.034
cover-0.031
opus-0.030
-0.028
Token expansion
Token ID7118
Feature activation+7.78e-02

Pos logit contributions
arth+0.050
factor+0.049
asu+0.048
byss+0.048
Factor+0.047
Neg logit contributions
ubs-0.059
mel-0.056
cover-0.051
opus-0.050
-0.047
Token would
Token ID561
Feature activation+2.54e-02

Pos logit contributions
arth+0.048
factor+0.048
byss+0.046
asu+0.046
Factor+0.045
Neg logit contributions
ubs-0.056
mel-0.053
cover-0.048
opus-0.046
-0.044
Token.
Token ID13
Feature activation+2.30e-02

Pos logit contributions
arth+0.021
factor+0.020
byss+0.020
asu+0.020
Factor+0.019
Neg logit contributions
ubs-0.024
mel-0.022
cover-0.020
opus-0.020
-0.019
Token
Token ID198
Feature activation+4.61e-02

Pos logit contributions
arth+0.014
factor+0.014
byss+0.013
asu+0.013
Factor+0.013
Neg logit contributions
ubs-0.017
mel-0.016
cover-0.014
opus-0.014
-0.013
Token
Token ID198
Feature activation+5.27e-02

Pos logit contributions
arth+0.028
factor+0.027
asu+0.026
byss+0.026
Factor+0.026
Neg logit contributions
ubs-0.034
mel-0.032
cover-0.029
opus-0.029
-0.027
TokenCo
Token ID7222
Feature activation-2.92e-02

Pos logit contributions
arth+0.033
factor+0.032
byss+0.031
asu+0.031
Factor+0.031
Neg logit contributions
ubs-0.038
mel-0.036
cover-0.032
opus-0.031
-0.029
Tokeninc
Token ID1939
Feature activation-4.07e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.016
+0.016
Neg logit contributions
arth-0.019
factor-0.019
byss-0.018
asu-0.018
Factor-0.018
Tokenidentally
Token ID23961
Feature activation+3.13e-01

Pos logit contributions
ubs+0.039
mel+0.038
cover+0.035
opus+0.034
+0.033
Neg logit contributions
arth-0.015
factor-0.015
byss-0.014
asu-0.014
Factor-0.014
Token,
Token ID11
Feature activation-4.01e-02

Pos logit contributions
arth+0.207
factor+0.203
byss+0.198
asu+0.197
Factor+0.194
Neg logit contributions
ubs-0.213
mel-0.201
cover-0.180
opus-0.174
-0.164
Token the
Token ID262
Feature activation-4.57e-02

Pos logit contributions
ubs+0.030
mel+0.029
cover+0.026
opus+0.025
+0.024
Neg logit contributions
arth-0.024
factor-0.023
byss-0.022
asu-0.022
Factor-0.022
Token Doors
Token ID33775
Feature activation-2.82e-02

Pos logit contributions
ubs+0.033
mel+0.032
cover+0.029
opus+0.028
+0.026
Neg logit contributions
arth-0.028
factor-0.027
byss-0.026
asu-0.026
Factor-0.026
Token were
Token ID547
Feature activation+3.14e-02

Pos logit contributions
ubs+0.019
mel+0.018
cover+0.017
opus+0.016
+0.015
Neg logit contributions
arth-0.018
factor-0.018
asu-0.017
byss-0.017
Factor-0.017
Token formed
Token ID7042
Feature activation-9.45e-02

Pos logit contributions
arth+0.019
factor+0.019
byss+0.018
asu+0.018
Factor+0.018
Neg logit contributions
ubs-0.023
mel-0.022
cover-0.019
opus-0.019
-0.018
Token source
Token ID2723
Feature activation+1.46e-02

Pos logit contributions
ubs+0.044
mel+0.041
cover+0.038
opus+0.037
+0.035
Neg logit contributions
arth-0.036
factor-0.035
byss-0.034
asu-0.034
Factor-0.033
Token (
Token ID357
Feature activation-5.09e-02

Pos logit contributions
arth+0.009
factor+0.009
byss+0.008
asu+0.008
Factor+0.008
Neg logit contributions
ubs-0.011
mel-0.010
cover-0.009
opus-0.009
-0.009
Tokencom
Token ID785
Feature activation-5.75e-02

Pos logit contributions
ubs+0.037
mel+0.035
cover+0.031
opus+0.030
+0.029
Neg logit contributions
arth-0.032
factor-0.031
byss-0.030
asu-0.030
Factor-0.030
Tokenputers
Token ID41510
Feature activation+1.28e-01

Pos logit contributions
ubs+0.042
mel+0.040
cover+0.036
opus+0.035
+0.033
Neg logit contributions
arth-0.035
factor-0.034
byss-0.033
asu-0.033
Factor-0.033
Token and
Token ID290
Feature activation+8.95e-03

Pos logit contributions
arth+0.079
factor+0.078
byss+0.076
asu+0.075
Factor+0.074
Neg logit contributions
ubs-0.092
mel-0.087
cover-0.079
opus-0.076
-0.072
Token monitors
Token ID19374
Feature activation+2.96e-01

Pos logit contributions
arth+0.005
factor+0.005
byss+0.005
asu+0.005
Factor+0.005
Neg logit contributions
ubs-0.007
mel-0.006
cover-0.006
opus-0.005
-0.005
Token in
Token ID287
Feature activation+2.33e-02

Pos logit contributions
arth+0.185
factor+0.182
byss+0.177
asu+0.176
Factor+0.173
Neg logit contributions
ubs-0.212
mel-0.200
cover-0.181
opus-0.175
-0.166
Token our
Token ID674
Feature activation+1.05e-01

Pos logit contributions
arth+0.015
factor+0.015
byss+0.014
asu+0.014
Factor+0.014
Neg logit contributions
ubs-0.016
mel-0.016
cover-0.014
opus-0.014
-0.013
Token case
Token ID1339
Feature activation-2.39e-01

Pos logit contributions
arth+0.070
factor+0.069
byss+0.066
asu+0.066
Factor+0.065
Neg logit contributions
ubs-0.071
mel-0.067
cover-0.060
opus-0.058
-0.055
Token)
Token ID8
Feature activation-1.11e-03

Pos logit contributions
ubs+0.168
mel+0.159
cover+0.143
opus+0.139
+0.131
Neg logit contributions
arth-0.153
factor-0.150
byss-0.146
asu-0.145
Factor-0.143
Token but
Token ID475
Feature activation+1.16e-03

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
byss-0.001
asu-0.001
Factor-0.001
Token I
Token ID314
Feature activation+7.63e-02

Pos logit contributions
arth+0.021
factor+0.020
byss+0.020
asu+0.020
Factor+0.019
Neg logit contributions
ubs-0.025
mel-0.023
cover-0.021
opus-0.021
-0.019
Token only
Token ID691
Feature activation+1.07e-03

Pos logit contributions
arth+0.049
factor+0.048
asu+0.047
byss+0.046
Factor+0.046
Neg logit contributions
ubs-0.053
mel-0.050
cover-0.045
opus-0.044
-0.042
Token dropped
Token ID5710
Feature activation+3.51e-02

Pos logit contributions
arth+0.001
factor+0.001
asu+0.001
byss+0.001
Factor+0.001
Neg logit contributions
ubs-0.001
mel-0.001
cover-0.001
opus-0.001
-0.001
Token it
Token ID340
Feature activation+3.93e-02

Pos logit contributions
arth+0.020
factor+0.020
byss+0.019
asu+0.019
Factor+0.019
Neg logit contributions
ubs-0.027
mel-0.025
cover-0.023
opus-0.023
-0.021
Token because
Token ID780
Feature activation-1.05e-01

Pos logit contributions
arth+0.023
factor+0.023
byss+0.022
asu+0.022
Factor+0.021
Neg logit contributions
ubs-0.030
mel-0.028
cover-0.026
opus-0.025
-0.024
Token of
Token ID286
Feature activation+7.34e-02

Pos logit contributions
ubs+0.079
mel+0.075
cover+0.068
opus+0.066
+0.063
Neg logit contributions
arth-0.061
factor-0.060
byss-0.058
asu-0.058
Factor-0.057
Token budgetary
Token ID47560
Feature activation+1.21e-02

Pos logit contributions
arth+0.043
factor+0.042
byss+0.041
asu+0.040
Factor+0.040
Neg logit contributions
ubs-0.056
mel-0.053
cover-0.048
opus-0.047
-0.044
Token needs
Token ID2476
Feature activation-1.22e-01

Pos logit contributions
arth+0.006
factor+0.006
byss+0.006
asu+0.006
Factor+0.006
Neg logit contributions
ubs-0.010
mel-0.009
cover-0.009
opus-0.008
-0.008
Token.
Token ID13
Feature activation+1.77e-02

Pos logit contributions
ubs+0.085
mel+0.081
cover+0.073
opus+0.070
+0.066
Neg logit contributions
arth-0.078
factor-0.076
byss-0.075
asu-0.074
Factor-0.073
Token This
Token ID770
Feature activation-1.54e-04

Pos logit contributions
arth+0.011
factor+0.011
byss+0.010
asu+0.010
Factor+0.010
Neg logit contributions
ubs-0.013
mel-0.012
cover-0.011
opus-0.011
-0.010
Token book
Token ID1492
Feature activation+9.60e-03

Pos logit contributions
ubs+0.000
mel+0.000
cover+0.000
opus+0.000
+0.000
Neg logit contributions
arth-0.000
factor-0.000
asu-0.000
byss-0.000
Factor-0.000
Token/
Token ID14
Feature activation-3.24e-02

Pos logit contributions
ubs+0.046
mel+0.043
cover+0.039
opus+0.038
+0.035
Neg logit contributions
arth-0.049
factor-0.048
byss-0.047
asu-0.046
Factor-0.046
Tokenapi
Token ID15042
Feature activation-5.69e-02

Pos logit contributions
ubs+0.022
mel+0.021
cover+0.019
opus+0.018
+0.017
Neg logit contributions
arth-0.021
factor-0.021
byss-0.020
asu-0.020
Factor-0.020
Token instead
Token ID2427
Feature activation-8.52e-02

Pos logit contributions
ubs+0.037
mel+0.035
cover+0.031
opus+0.030
+0.028
Neg logit contributions
arth-0.039
factor-0.039
byss-0.038
asu-0.038
Factor-0.037
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
ubs+0.061
mel+0.058
cover+0.052
opus+0.050
+0.048
Neg logit contributions
arth-0.053
factor-0.052
byss-0.051
asu-0.051
Factor-0.050
Token So
Token ID1406
Feature activation-2.52e-02

Pos logit contributions
arth+0.012
factor+0.012
asu+0.012
byss+0.012
Factor+0.011
Neg logit contributions
ubs-0.015
mel-0.014
cover-0.013
opus-0.012
-0.012
Token make
Token ID787
Feature activation+1.74e-02

Pos logit contributions
ubs+0.016
mel+0.015
cover+0.014
opus+0.013
+0.012
Neg logit contributions
arth-0.018
factor-0.017
asu-0.017
byss-0.017
Factor-0.016
Token sure
Token ID1654
Feature activation+1.78e-01

Pos logit contributions
arth+0.007
factor+0.007
byss+0.007
asu+0.007
Factor+0.006
Neg logit contributions
ubs-0.016
mel-0.016
cover-0.014
opus-0.014
-0.014
Token to
Token ID284
Feature activation+2.38e-02

Pos logit contributions
arth+0.109
factor+0.107
byss+0.104
asu+0.103
Factor+0.101
Neg logit contributions
ubs-0.129
mel-0.123
cover-0.111
opus-0.108
-0.102
Token double
Token ID4274
Feature activation+1.06e-01

Pos logit contributions
arth+0.014
factor+0.014
asu+0.014
byss+0.014
Factor+0.013
Neg logit contributions
ubs-0.017
mel-0.017
cover-0.015
opus-0.014
-0.014
Token check
Token ID2198
Feature activation+8.19e-02

Pos logit contributions
arth+0.066
factor+0.064
byss+0.062
asu+0.062
Factor+0.061
Neg logit contributions
ubs-0.077
mel-0.073
cover-0.066
opus-0.064
-0.060
Token this
Token ID428
Feature activation-2.42e-02

Pos logit contributions
arth+0.049
factor+0.048
byss+0.047
asu+0.047
Factor+0.046
Neg logit contributions
ubs-0.060
mel-0.057
cover-0.052
opus-0.050
-0.048
Token May
Token ID1737
Feature activation-5.66e-02

Pos logit contributions
arth+0.011
factor+0.010
asu+0.010
byss+0.010
Factor+0.010
Neg logit contributions
ubs-0.013
mel-0.012
cover-0.011
opus-0.010
-0.010
Token 21
Token ID2310
Feature activation-7.40e-02

Pos logit contributions
ubs+0.040
mel+0.038
cover+0.035
opus+0.034
+0.032
Neg logit contributions
arth-0.035
factor-0.035
asu-0.034
byss-0.034
Factor-0.033
Token,
Token ID11
Feature activation+2.75e-02

Pos logit contributions
ubs+0.055
mel+0.052
cover+0.048
opus+0.046
+0.044
Neg logit contributions
arth-0.044
factor-0.043
byss-0.042
asu-0.042
Factor-0.041
Token and
Token ID290
Feature activation+1.56e-02

Pos logit contributions
arth+0.017
factor+0.016
byss+0.016
asu+0.016
Factor+0.015
Neg logit contributions
ubs-0.020
mel-0.019
cover-0.017
opus-0.017
-0.016
Token out
Token ID503
Feature activation+4.73e-02

Pos logit contributions
arth+0.010
factor+0.009
asu+0.009
byss+0.009
Factor+0.009
Neg logit contributions
ubs-0.011
mel-0.011
cover-0.010
opus-0.009
-0.009
Token on
Token ID319
Feature activation+2.06e-02

Pos logit contributions
arth+0.029
factor+0.028
byss+0.028
asu+0.027
Factor+0.027
Neg logit contributions
ubs-0.035
mel-0.033
cover-0.030
opus-0.029
-0.027
Token Monday
Token ID3321
Feature activation+5.67e-02

Pos logit contributions
arth+0.014
factor+0.014
byss+0.014
asu+0.014
Factor+0.013
Neg logit contributions
ubs-0.013
mel-0.013
cover-0.011
opus-0.011
-0.010
Token,
Token ID11
Feature activation+3.11e-02

Pos logit contributions
arth+0.038
factor+0.037
byss+0.036
asu+0.036
Factor+0.035
Neg logit contributions
ubs-0.038
mel-0.036
cover-0.032
opus-0.031
-0.029
Token Memorial
Token ID14861
Feature activation-2.02e-01

Pos logit contributions
arth+0.024
factor+0.024
byss+0.024
asu+0.023
Factor+0.023
Neg logit contributions
ubs-0.017
mel-0.016
cover-0.014
opus-0.013
-0.012
Token Day
Token ID3596
Feature activation+7.85e-02

Pos logit contributions
ubs+0.133
mel+0.126
cover+0.112
opus+0.108
+0.102
Neg logit contributions
arth-0.138
factor-0.136
byss-0.132
asu-0.132
Factor-0.129
Token.
Token ID13
Feature activation+3.26e-02

Pos logit contributions
arth+0.055
factor+0.054
byss+0.053
asu+0.053
Factor+0.052
Neg logit contributions
ubs-0.050
mel-0.047
cover-0.042
opus-0.041
-0.038
Token://
Token ID1378
Feature activation+1.02e-01

Pos logit contributions
ubs+0.101
mel+0.097
cover+0.089
opus+0.087
+0.083
Neg logit contributions
arth-0.055
factor-0.053
byss-0.052
asu-0.051
Factor-0.050
Token api
Token ID40391
Feature activation-2.64e-04

Pos logit contributions
arth+0.065
factor+0.063
byss+0.062
asu+0.061
Factor+0.060
Neg logit contributions
ubs-0.071
mel-0.068
cover-0.061
opus-0.059
-0.056
Token.
Token ID764
Feature activation-4.52e-03

Pos logit contributions
ubs+0.000
mel+0.000
cover+0.000
opus+0.000
+0.000
Neg logit contributions
arth-0.000
factor-0.000
byss-0.000
asu-0.000
Factor-0.000
Tokennice
Token ID44460
Feature activation+1.01e-01

Pos logit contributions
ubs+0.003
mel+0.003
cover+0.003
opus+0.003
+0.003
Neg logit contributions
arth-0.003
factor-0.003
asu-0.003
byss-0.003
Factor-0.003
Tokenhash
Token ID17831
Feature activation-6.41e-02

Pos logit contributions
arth+0.070
factor+0.069
asu+0.067
byss+0.066
Factor+0.066
Neg logit contributions
ubs-0.065
mel-0.061
cover-0.054
opus-0.053
-0.050
Token.
Token ID13
Feature activation+1.22e-02

Pos logit contributions
ubs+0.046
mel+0.044
cover+0.039
opus+0.038
+0.036
Neg logit contributions
arth-0.040
factor-0.039
asu-0.038
byss-0.037
Factor-0.037
Tokencom
Token ID785
Feature activation-7.08e-02

Pos logit contributions
arth+0.009
factor+0.009
byss+0.008
asu+0.008
Factor+0.008
Neg logit contributions
ubs-0.008
mel-0.007
cover-0.006
opus-0.006
-0.006
Token/
Token ID14
Feature activation-3.24e-02

Pos logit contributions
ubs+0.046
mel+0.043
cover+0.039
opus+0.038
+0.035
Neg logit contributions
arth-0.049
factor-0.048
byss-0.047
asu-0.046
Factor-0.046
Tokenapi
Token ID15042
Feature activation-5.69e-02

Pos logit contributions
ubs+0.022
mel+0.021
cover+0.019
opus+0.018
+0.017
Neg logit contributions
arth-0.021
factor-0.021
byss-0.020
asu-0.020
Factor-0.020
Token instead
Token ID2427
Feature activation-8.52e-02

Pos logit contributions
ubs+0.037
mel+0.035
cover+0.031
opus+0.030
+0.028
Neg logit contributions
arth-0.039
factor-0.039
byss-0.038
asu-0.038
Factor-0.037
Token.
Token ID13
Feature activation+2.03e-02

Pos logit contributions
ubs+0.061
mel+0.058
cover+0.052
opus+0.050
+0.048
Neg logit contributions
arth-0.053
factor-0.052
byss-0.051
asu-0.051
Factor-0.050
Token Tang
Token ID18816
Feature activation-7.39e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.017
+0.016
Neg logit contributions
arth-0.016
factor-0.016
asu-0.015
byss-0.015
Factor-0.015
Token down
Token ID866
Feature activation+7.31e-02

Pos logit contributions
ubs+0.054
mel+0.051
cover+0.046
opus+0.045
+0.042
Neg logit contributions
arth-0.045
factor-0.044
byss-0.043
asu-0.043
Factor-0.042
Token,
Token ID11
Feature activation-7.75e-05

Pos logit contributions
arth+0.045
factor+0.044
byss+0.042
asu+0.042
Factor+0.041
Neg logit contributions
ubs-0.054
mel-0.051
cover-0.046
opus-0.045
-0.042
Token and
Token ID290
Feature activation-2.75e-02

Pos logit contributions
ubs+0.000
mel+0.000
cover+0.000
opus+0.000
+0.000
Neg logit contributions
arth-0.000
factor-0.000
byss-0.000
asu-0.000
Factor-0.000
Token set
Token ID900
Feature activation+1.22e-01

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.016
+0.016
Neg logit contributions
arth-0.017
factor-0.017
asu-0.016
byss-0.016
Factor-0.016
Token his
Token ID465
Feature activation+8.12e-02

Pos logit contributions
arth+0.071
factor+0.070
asu+0.067
byss+0.067
Factor+0.066
Neg logit contributions
ubs-0.092
mel-0.087
cover-0.079
opus-0.077
-0.073
Token vengeance
Token ID30657
Feature activation-2.60e-02

Pos logit contributions
arth+0.046
factor+0.045
byss+0.044
asu+0.044
Factor+0.043
Neg logit contributions
ubs-0.063
mel-0.060
cover-0.054
opus-0.053
-0.050
Token into
Token ID656
Feature activation-5.28e-02

Pos logit contributions
ubs+0.019
mel+0.018
cover+0.016
opus+0.016
+0.015
Neg logit contributions
arth-0.016
factor-0.015
byss-0.015
asu-0.015
Factor-0.015
Token motion
Token ID6268
Feature activation+1.92e-01

Pos logit contributions
ubs+0.044
mel+0.042
cover+0.039
opus+0.038
+0.036
Neg logit contributions
arth-0.027
factor-0.026
byss-0.025
asu-0.025
Factor-0.024
Token.
Token ID13
Feature activation-3.72e-03

Pos logit contributions
arth+0.120
factor+0.117
byss+0.114
asu+0.114
Factor+0.112
Neg logit contributions
ubs-0.138
mel-0.131
cover-0.118
opus-0.115
-0.108
Token While
Token ID2893
Feature activation-1.64e-02

Pos logit contributions
ubs+0.003
mel+0.003
cover+0.002
opus+0.002
+0.002
Neg logit contributions
arth-0.002
factor-0.002
asu-0.002
byss-0.002
Factor-0.002
Tokenge
Token ID469
Feature activation-7.60e-02

Pos logit contributions
arth+0.021
factor+0.020
byss+0.020
asu+0.020
Factor+0.019
Neg logit contributions
ubs-0.023
mel-0.022
cover-0.020
opus-0.019
-0.018
Tokenois
Token ID10924
Feature activation+1.00e-01

Pos logit contributions
ubs+0.070
mel+0.067
cover+0.062
opus+0.060
+0.058
Neg logit contributions
arth-0.032
factor-0.032
byss-0.030
asu-0.030
Factor-0.029
Token intellig
Token ID10878
Feature activation+1.09e-01

Pos logit contributions
arth+0.053
factor+0.051
byss+0.050
asu+0.049
Factor+0.048
Neg logit contributions
ubs-0.082
mel-0.078
cover-0.071
opus-0.070
-0.066
Tokenents
Token ID658
Feature activation-1.44e-01

Pos logit contributions
arth+0.070
factor+0.069
byss+0.067
asu+0.067
Factor+0.065
Neg logit contributions
ubs-0.076
mel-0.072
cover-0.065
opus-0.063
-0.059
Tokenia
Token ID544
Feature activation-1.03e-01

Pos logit contributions
ubs+0.095
mel+0.090
cover+0.080
opus+0.077
+0.074
Neg logit contributions
arth-0.097
factor-0.096
asu-0.093
byss-0.092
Factor-0.091
Token resisted
Token ID26643
Feature activation+4.16e-03

Pos logit contributions
ubs+0.072
mel+0.068
cover+0.061
opus+0.059
+0.056
Neg logit contributions
arth-0.066
factor-0.065
byss-0.063
asu-0.063
Factor-0.062
Token en
Token ID551
Feature activation+1.17e-02

Pos logit contributions
arth+0.003
factor+0.002
byss+0.002
asu+0.002
Factor+0.002
Neg logit contributions
ubs-0.003
mel-0.003
cover-0.003
opus-0.003
-0.002
Token masse
Token ID41869
Feature activation+1.84e-01

Pos logit contributions
arth+0.007
factor+0.007
byss+0.007
asu+0.007
Factor+0.007
Neg logit contributions
ubs-0.009
mel-0.008
cover-0.007
opus-0.007
-0.007
Token letting
Token ID9616
Feature activation-6.51e-04

Pos logit contributions
arth+0.113
factor+0.111
byss+0.108
asu+0.108
Factor+0.105
Neg logit contributions
ubs-0.133
mel-0.126
cover-0.114
opus-0.111
-0.105
Token itself
Token ID2346
Feature activation-1.24e-01

Pos logit contributions
ubs+0.000
mel+0.000
cover+0.000
opus+0.000
+0.000
Neg logit contributions
arth-0.000
factor-0.000
byss-0.000
asu-0.000
Factor-0.000
Token be
Token ID307
Feature activation+3.75e-02

Pos logit contributions
ubs+0.095
mel+0.090
cover+0.082
opus+0.080
+0.076
Neg logit contributions
arth-0.072
factor-0.070
byss-0.068
asu-0.068
Factor-0.066
Token loyal
Token ID9112
Feature activation-2.42e-01

Pos logit contributions
ubs+0.002
mel+0.002
cover+0.002
opus+0.002
+0.001
Neg logit contributions
arth-0.002
factor-0.002
byss-0.002
asu-0.002
Factor-0.002
Token when
Token ID618
Feature activation+3.41e-02

Pos logit contributions
ubs+0.173
mel+0.164
cover+0.149
opus+0.143
+0.135
Neg logit contributions
arth-0.150
factor-0.148
byss-0.145
asu-0.143
Factor-0.141
Token it
Token ID340
Feature activation+4.74e-02

Pos logit contributions
arth+0.020
factor+0.020
asu+0.019
byss+0.019
Factor+0.019
Neg logit contributions
ubs-0.026
mel-0.024
cover-0.022
opus-0.021
-0.020
Token comes
Token ID2058
Feature activation+6.66e-02

Pos logit contributions
arth+0.025
factor+0.024
asu+0.023
byss+0.023
Factor+0.023
Neg logit contributions
ubs-0.039
mel-0.037
cover-0.034
opus-0.033
-0.031
Token to
Token ID284
Feature activation-2.88e-02

Pos logit contributions
arth+0.038
factor+0.038
byss+0.036
asu+0.036
Factor+0.036
Neg logit contributions
ubs-0.051
mel-0.048
cover-0.044
opus-0.043
-0.041
Token white
Token ID2330
Feature activation-1.45e-01

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.017
+0.016
Neg logit contributions
arth-0.018
factor-0.018
byss-0.018
asu-0.018
Factor-0.017
Token people
Token ID661
Feature activation+2.90e-04

Pos logit contributions
ubs+0.106
mel+0.100
cover+0.091
opus+0.088
+0.083
Neg logit contributions
arth-0.089
factor-0.087
asu-0.084
byss-0.084
Factor-0.083
Token with
Token ID351
Feature activation-2.96e-02

Pos logit contributions
arth+0.000
factor+0.000
byss+0.000
asu+0.000
Factor+0.000
Neg logit contributions
ubs-0.000
mel-0.000
cover-0.000
opus-0.000
-0.000
Token money
Token ID1637
Feature activation+8.08e-03

Pos logit contributions
ubs+0.021
mel+0.020
cover+0.018
opus+0.017
+0.016
Neg logit contributions
arth-0.019
factor-0.018
byss-0.018
asu-0.018
Factor-0.018
Token and
Token ID290
Feature activation-1.47e-03

Pos logit contributions
arth+0.005
factor+0.005
byss+0.005
asu+0.005
Factor+0.005
Neg logit contributions
ubs-0.005
mel-0.005
cover-0.005
opus-0.004
-0.004
Token coat
Token ID13209
Feature activation-1.88e-01

Pos logit contributions
ubs+0.001
mel+0.001
cover+0.001
opus+0.001
+0.001
Neg logit contributions
arth-0.001
factor-0.001
asu-0.001
byss-0.001
Factor-0.001
Token reject
Token ID4968
Feature activation+4.71e-02

Pos logit contributions
arth+0.020
factor+0.019
asu+0.019
byss+0.019
Factor+0.019
Neg logit contributions
ubs-0.021
mel-0.020
cover-0.018
opus-0.018
-0.017
Token and
Token ID290
Feature activation+4.89e-02

Pos logit contributions
arth+0.028
factor+0.028
byss+0.027
asu+0.027
Factor+0.026
Neg logit contributions
ubs-0.035
mel-0.033
cover-0.030
opus-0.029
-0.028
Token protest
Token ID5402
Feature activation-6.57e-02

Pos logit contributions
arth+0.031
factor+0.030
asu+0.029
byss+0.029
Factor+0.029
Neg logit contributions
ubs-0.035
mel-0.033
cover-0.030
opus-0.029
-0.027
Token ideas
Token ID4213
Feature activation+1.36e-01

Pos logit contributions
ubs+0.049
mel+0.046
cover+0.042
opus+0.041
+0.039
Neg logit contributions
arth-0.039
factor-0.038
byss-0.037
asu-0.037
Factor-0.036
Token we
Token ID356
Feature activation+1.82e-02

Pos logit contributions
arth+0.083
factor+0.081
asu+0.079
byss+0.078
Factor+0.077
Neg logit contributions
ubs-0.099
mel-0.093
cover-0.085
opus-0.082
-0.078
Token don
Token ID836
Feature activation-6.79e-02

Pos logit contributions
arth+0.012
factor+0.012
asu+0.011
byss+0.011
Factor+0.011
Neg logit contributions
ubs-0.012
mel-0.012
cover-0.011
opus-0.010
-0.010
Token't
Token ID470
Feature activation+3.75e-02

Pos logit contributions
ubs+0.049
mel+0.046
cover+0.042
opus+0.041
+0.038
Neg logit contributions
arth-0.042
factor-0.041
asu-0.040
byss-0.040
Factor-0.039
Token agree
Token ID4236
Feature activation+8.97e-02

Pos logit contributions
arth+0.022
factor+0.022
asu+0.021
Factor+0.021
byss+0.021
Neg logit contributions
ubs-0.028
mel-0.026
cover-0.024
opus-0.023
-0.022
Token with
Token ID351
Feature activation+5.65e-02

Pos logit contributions
arth+0.055
factor+0.054
byss+0.052
asu+0.052
Factor+0.051
Neg logit contributions
ubs-0.066
mel-0.062
cover-0.056
opus-0.055
-0.052
Token.
Token ID13
Feature activation+8.35e-02

Pos logit contributions
arth+0.036
factor+0.035
byss+0.034
asu+0.034
Factor+0.033
Neg logit contributions
ubs-0.040
mel-0.038
cover-0.034
opus-0.033
-0.031
Token That
Token ID1320
Feature activation+5.81e-02

Pos logit contributions
arth+0.056
factor+0.055
asu+0.053
byss+0.053
Factor+0.053
Neg logit contributions
ubs-0.056
mel-0.053
cover-0.047
opus-0.046
-0.043
Token get
Token ID651
Feature activation-4.86e-02

Pos logit contributions
ubs+0.035
mel+0.034
cover+0.030
opus+0.030
+0.028
Neg logit contributions
arth-0.029
factor-0.029
asu-0.028
byss-0.028
Factor-0.027
Token her
Token ID607
Feature activation-8.36e-02

Pos logit contributions
ubs+0.035
mel+0.034
cover+0.030
opus+0.029
+0.028
Neg logit contributions
arth-0.030
factor-0.029
asu-0.028
byss-0.028
Factor-0.028
Token pencil
Token ID21613
Feature activation-8.79e-03

Pos logit contributions
ubs+0.060
mel+0.057
cover+0.051
opus+0.050
+0.047
Neg logit contributions
arth-0.052
factor-0.051
asu-0.050
byss-0.049
Factor-0.049
Tokens
Token ID82
Feature activation+1.00e-01

Pos logit contributions
ubs+0.007
mel+0.006
cover+0.006
opus+0.006
+0.005
Neg logit contributions
arth-0.005
factor-0.005
asu-0.005
byss-0.005
Factor-0.005
Token and
Token ID290
Feature activation-2.26e-02

Pos logit contributions
arth+0.060
factor+0.058
asu+0.057
byss+0.055
Factor+0.055
Neg logit contributions
ubs-0.074
mel-0.070
cover-0.064
opus-0.063
-0.060
Token help
Token ID1037
Feature activation-7.53e-02

Pos logit contributions
ubs+0.016
mel+0.015
cover+0.014
opus+0.014
+0.013
Neg logit contributions
arth-0.014
factor-0.014
asu-0.013
byss-0.013
Factor-0.013
Token color
Token ID3124
Feature activation-3.11e-02

Pos logit contributions
ubs+0.055
mel+0.052
cover+0.047
opus+0.046
+0.044
Neg logit contributions
arth-0.046
factor-0.045
byss-0.043
asu-0.043
Factor-0.042
Token the
Token ID262
Feature activation-1.93e-02

Pos logit contributions
ubs+0.023
mel+0.022
cover+0.020
opus+0.019
+0.018
Neg logit contributions
arth-0.019
factor-0.018
byss-0.018
asu-0.018
Factor-0.017
Token maps
Token ID8739
Feature activation+1.83e-01

Pos logit contributions
ubs+0.013
mel+0.013
cover+0.011
opus+0.011
+0.011
Neg logit contributions
arth-0.012
factor-0.012
asu-0.012
byss-0.012
Factor-0.012
Token I
Token ID314
Feature activation+1.95e-02

Pos logit contributions
arth+0.115
factor+0.112
asu+0.109
byss+0.109
Factor+0.107
Neg logit contributions
ubs-0.131
mel-0.124
cover-0.111
opus-0.109
-0.103
Token took
Token ID1718
Feature activation-1.64e-02

Pos logit contributions
arth+0.012
factor+0.012
asu+0.011
byss+0.011
Factor+0.011
Neg logit contributions
ubs-0.014
mel-0.013
cover-0.012
opus-0.012
-0.011
Token in
Token ID287
Feature activation-3.27e-02

Pos logit contributions
ubs+0.171
mel+0.162
cover+0.146
opus+0.142
+0.134
Neg logit contributions
arth-0.151
factor-0.148
byss-0.144
asu-0.143
Factor-0.141
Token 2006
Token ID4793
Feature activation+1.04e-02

Pos logit contributions
ubs+0.025
mel+0.023
cover+0.021
opus+0.021
+0.020
Neg logit contributions
arth-0.019
factor-0.019
byss-0.018
asu-0.018
Factor-0.018
Token.
Token ID13
Feature activation-3.82e-02

Pos logit contributions
arth+0.006
factor+0.006
byss+0.006
asu+0.006
Factor+0.006
Neg logit contributions
ubs-0.007
mel-0.007
cover-0.006
opus-0.006
-0.006
Token This
Token ID770
Feature activation-6.17e-02

Pos logit contributions
ubs+0.028
mel+0.027
cover+0.024
opus+0.023
+0.022
Neg logit contributions
arth-0.023
factor-0.023
byss-0.022
asu-0.022
Factor-0.022
Token Finnish
Token ID26838
Feature activation+9.49e-02

Pos logit contributions
ubs+0.044
mel+0.041
cover+0.037
opus+0.036
+0.034
Neg logit contributions
arth-0.039
factor-0.038
asu-0.037
byss-0.037
Factor-0.036
Token album
Token ID5062
Feature activation-1.47e-01

Pos logit contributions
arth+0.060
factor+0.059
asu+0.057
byss+0.057
Factor+0.056
Neg logit contributions
ubs-0.067
mel-0.063
cover-0.057
opus-0.055
-0.052
Token contains
Token ID4909
Feature activation-1.13e-02

Pos logit contributions
ubs+0.104
mel+0.098
cover+0.088
opus+0.086
+0.081
Neg logit contributions
arth-0.094
factor-0.092
byss-0.089
asu-0.089
Factor-0.087
Token no
Token ID645
Feature activation-3.12e-02

Pos logit contributions
ubs+0.008
mel+0.008
cover+0.007
opus+0.007
+0.006
Neg logit contributions
arth-0.007
factor-0.007
byss-0.007
asu-0.007
Factor-0.006
Token original
Token ID2656
Feature activation-3.93e-02

Pos logit contributions
ubs+0.022
mel+0.021
cover+0.019
opus+0.018
+0.017
Neg logit contributions
arth-0.020
factor-0.019
byss-0.019
asu-0.019
Factor-0.018
Token lyrics
Token ID15844
Feature activation+2.08e-02

Pos logit contributions
ubs+0.027
mel+0.026
cover+0.023
opus+0.022
+0.021
Neg logit contributions
arth-0.026
factor-0.025
byss-0.024
asu-0.024
Factor-0.024
Token by
Token ID416
Feature activation-3.58e-02

Pos logit contributions
arth+0.013
factor+0.012
byss+0.012
asu+0.012
Factor+0.012
Neg logit contributions
ubs-0.015
mel-0.014
cover-0.013
opus-0.013
-0.012
Token)
Token ID8
Feature activation-2.75e-02

Pos logit contributions
ubs+0.019
mel+0.018
cover+0.016
opus+0.016
+0.015
Neg logit contributions
arth-0.017
factor-0.016
asu-0.016
byss-0.016
Factor-0.016
Token
Token ID198
Feature activation+2.45e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.017
+0.016
Neg logit contributions
arth-0.017
factor-0.016
byss-0.016
asu-0.016
Factor-0.016
Token
Token ID198
Feature activation+1.56e-02

Pos logit contributions
arth+0.014
factor+0.014
asu+0.014
Factor+0.013
byss+0.013
Neg logit contributions
ubs-0.018
mel-0.017
cover-0.016
opus-0.015
-0.015
Token,
Token ID11
Feature activation+1.36e-02

Pos logit contributions
arth+0.010
factor+0.009
asu+0.009
byss+0.009
Factor+0.009
Neg logit contributions
ubs-0.011
mel-0.011
cover-0.010
opus-0.009
-0.009
Token as
Token ID355
Feature activation+5.35e-03

Pos logit contributions
arth+0.008
factor+0.007
asu+0.007
byss+0.007
Factor+0.007
Neg logit contributions
ubs-0.011
mel-0.010
cover-0.009
opus-0.009
-0.009
Token usual
Token ID6678
Feature activation-3.48e-01

Pos logit contributions
arth+0.005
factor+0.005
asu+0.005
Factor+0.005
 Carnegie+0.005
Neg logit contributions
ubs-0.002
mel-0.002
cover-0.001
opus-0.001
-0.001
Token,
Token ID11
Feature activation-7.36e-03

Pos logit contributions
ubs+0.242
mel+0.228
cover+0.205
opus+0.199
+0.188
Neg logit contributions
arth-0.225
factor-0.221
asu-0.214
byss-0.214
Factor-0.210
Token with
Token ID351
Feature activation-8.66e-03

Pos logit contributions
ubs+0.005
mel+0.005
cover+0.005
opus+0.005
+0.004
Neg logit contributions
arth-0.004
factor-0.004
asu-0.004
byss-0.004
Factor-0.004
Token berries
Token ID36322
Feature activation+5.59e-02

Pos logit contributions
ubs+0.007
mel+0.006
cover+0.006
opus+0.006
+0.005
Neg logit contributions
arth-0.005
factor-0.005
asu-0.005
byss-0.005
 oats-0.005
Token,
Token ID11
Feature activation-9.63e-03

Pos logit contributions
arth+0.036
factor+0.035
asu+0.034
byss+0.034
Factor+0.034
Neg logit contributions
ubs-0.039
mel-0.037
cover-0.033
opus-0.032
-0.030
Token but
Token ID475
Feature activation-2.36e-02

Pos logit contributions
ubs+0.007
mel+0.007
cover+0.006
opus+0.006
+0.005
Neg logit contributions
arth-0.006
factor-0.006
asu-0.006
byss-0.006
Factor-0.006
Token started
Token ID2067
Feature activation-1.37e-01

Pos logit contributions
arth+0.019
factor+0.019
asu+0.018
Factor+0.018
byss+0.018
Neg logit contributions
ubs-0.017
mel-0.016
cover-0.014
opus-0.014
-0.013
Token to
Token ID284
Feature activation-6.77e-03

Pos logit contributions
ubs+0.100
mel+0.095
cover+0.086
opus+0.083
+0.079
Neg logit contributions
arth-0.084
factor-0.082
byss-0.080
asu-0.079
Factor-0.078
Token unravel
Token ID30282
Feature activation-2.18e-02

Pos logit contributions
ubs+0.005
mel+0.005
cover+0.004
opus+0.004
+0.004
Neg logit contributions
arth-0.004
factor-0.004
asu-0.004
Factor-0.004
byss-0.004
Token for
Token ID329
Feature activation-2.44e-02

Pos logit contributions
ubs+0.016
mel+0.015
cover+0.014
opus+0.013
+0.013
Neg logit contributions
arth-0.013
factor-0.013
byss-0.013
asu-0.013
Factor-0.012
Token the
Token ID262
Feature activation-3.39e-02

Pos logit contributions
ubs+0.018
mel+0.017
cover+0.015
opus+0.015
+0.014
Neg logit contributions
arth-0.015
factor-0.015
byss-0.014
asu-0.014
Factor-0.014
Token Caps
Token ID23534
Feature activation-2.95e-01

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.016
+0.015
Neg logit contributions
arth-0.025
factor-0.025
byss-0.024
asu-0.024
Factor-0.024
Token,
Token ID11
Feature activation-7.45e-03

Pos logit contributions
ubs+0.209
mel+0.198
cover+0.179
opus+0.173
+0.164
Neg logit contributions
arth-0.186
factor-0.182
byss-0.178
asu-0.176
Factor-0.173
Token as
Token ID355
Feature activation-1.43e-02

Pos logit contributions
ubs+0.006
mel+0.005
cover+0.005
opus+0.005
+0.005
Neg logit contributions
arth-0.004
factor-0.004
byss-0.004
asu-0.004
Factor-0.004
Token St
Token ID520
Feature activation-1.92e-01

Pos logit contributions
ubs+0.010
mel+0.010
cover+0.009
opus+0.009
+0.008
Neg logit contributions
arth-0.009
factor-0.009
byss-0.008
asu-0.008
Factor-0.008
Token Louis
Token ID5593
Feature activation+3.75e-02

Pos logit contributions
ubs+0.133
mel+0.126
cover+0.113
opus+0.109
+0.104
Neg logit contributions
arth-0.124
factor-0.122
asu-0.118
byss-0.118
Factor-0.116
Token scored
Token ID7781
Feature activation-1.18e-01

Pos logit contributions
arth+0.024
factor+0.024
byss+0.023
asu+0.023
Factor+0.023
Neg logit contributions
ubs-0.026
mel-0.025
cover-0.022
opus-0.021
-0.020
Token itself
Token ID2346
Feature activation-1.48e-01

Pos logit contributions
ubs+0.064
mel+0.061
cover+0.055
opus+0.053
+0.050
Neg logit contributions
arth-0.057
factor-0.056
byss-0.055
asu-0.054
Factor-0.053
Token,
Token ID11
Feature activation-2.69e-02

Pos logit contributions
ubs+0.104
mel+0.098
cover+0.089
opus+0.086
+0.081
Neg logit contributions
arth-0.095
factor-0.093
byss-0.091
asu-0.090
Factor-0.088
Token and
Token ID290
Feature activation-1.78e-02

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.017
+0.016
Neg logit contributions
arth-0.016
factor-0.016
byss-0.015
asu-0.015
Factor-0.015
Token even
Token ID772
Feature activation-4.02e-02

Pos logit contributions
ubs+0.013
mel+0.012
cover+0.011
opus+0.011
+0.010
Neg logit contributions
arth-0.011
factor-0.011
byss-0.010
asu-0.010
Factor-0.010
Token then
Token ID788
Feature activation+1.53e-01

Pos logit contributions
ubs+0.029
mel+0.028
cover+0.025
opus+0.024
+0.023
Neg logit contributions
arth-0.025
factor-0.024
byss-0.023
asu-0.023
Factor-0.023
Token alleg
Token ID2868
Feature activation-3.71e-01

Pos logit contributions
arth+0.094
factor+0.092
byss+0.090
asu+0.089
Factor+0.088
Neg logit contributions
ubs-0.111
mel-0.105
cover-0.095
opus-0.092
-0.087
Tokenory
Token ID652
Feature activation-4.64e-02

Pos logit contributions
ubs+0.179
mel+0.165
cover+0.141
opus+0.134
+0.122
Neg logit contributions
arth-0.319
factor-0.314
byss-0.307
asu-0.307
Factor-0.302
Token was
Token ID373
Feature activation-2.30e-02

Pos logit contributions
ubs+0.032
mel+0.030
cover+0.027
opus+0.026
+0.024
Neg logit contributions
arth-0.030
factor-0.030
byss-0.029
asu-0.029
Factor-0.029
Token a
Token ID257
Feature activation-2.03e-02

Pos logit contributions
ubs+0.017
mel+0.016
cover+0.014
opus+0.014
+0.013
Neg logit contributions
arth-0.014
factor-0.014
byss-0.013
asu-0.013
Factor-0.013
Token weapon
Token ID4282
Feature activation-4.19e-02

Pos logit contributions
ubs+0.015
mel+0.014
cover+0.012
opus+0.012
+0.011
Neg logit contributions
arth-0.013
factor-0.012
byss-0.012
asu-0.012
Factor-0.012
Token in
Token ID287
Feature activation-5.05e-02

Pos logit contributions
ubs+0.031
mel+0.029
cover+0.027
opus+0.026
+0.024
Neg logit contributions
arth-0.025
factor-0.025
byss-0.024
asu-0.024
Factor-0.023
Tokens
Token ID82
Feature activation+2.74e-02

Pos logit contributions
arth+0.010
factor+0.009
byss+0.009
asu+0.009
Factor+0.009
Neg logit contributions
ubs-0.013
mel-0.013
cover-0.011
opus-0.011
-0.011
Token in
Token ID287
Feature activation-4.45e-03

Pos logit contributions
arth+0.017
factor+0.017
byss+0.016
asu+0.016
Factor+0.016
Neg logit contributions
ubs-0.020
mel-0.019
cover-0.017
opus-0.016
-0.015
Token the
Token ID262
Feature activation-2.82e-02

Pos logit contributions
ubs+0.003
mel+0.003
cover+0.003
opus+0.003
+0.003
Neg logit contributions
arth-0.003
factor-0.003
byss-0.003
asu-0.003
Factor-0.003
Token F
Token ID376
Feature activation+1.03e-01

Pos logit contributions
ubs+0.020
mel+0.019
cover+0.017
opus+0.017
+0.016
Neg logit contributions
arth-0.017
factor-0.017
byss-0.017
asu-0.016
Factor-0.016
TokenZ
Token ID57
Feature activation-2.18e-02

Pos logit contributions
arth+0.066
factor+0.065
asu+0.063
byss+0.063
Factor+0.062
Neg logit contributions
ubs-0.072
mel-0.068
cover-0.061
opus-0.060
-0.056
Token1000
Token ID12825
Feature activation-3.70e-01

Pos logit contributions
ubs+0.015
mel+0.015
cover+0.013
opus+0.013
+0.012
Neg logit contributions
arth-0.014
factor-0.014
byss-0.013
asu-0.013
Factor-0.013
Token/
Token ID14
Feature activation-1.95e-02

Pos logit contributions
ubs+0.267
mel+0.253
cover+0.228
opus+0.221
+0.209
Neg logit contributions
arth-0.230
factor-0.225
byss-0.219
asu-0.218
Factor-0.214
Token RX
Token ID24202
Feature activation-7.16e-02

Pos logit contributions
ubs+0.013
mel+0.013
cover+0.011
opus+0.011
+0.010
Neg logit contributions
arth-0.013
factor-0.012
byss-0.012
asu-0.012
Factor-0.012
Token100
Token ID3064
Feature activation-3.84e-02

Pos logit contributions
ubs+0.043
mel+0.041
cover+0.036
opus+0.035
+0.032
Neg logit contributions
arth-0.053
factor-0.052
asu-0.050
byss-0.050
Factor-0.050
TokenIII
Token ID10855
Feature activation+2.32e-01

Pos logit contributions
ubs+0.030
mel+0.028
cover+0.025
opus+0.025
+0.024
Neg logit contributions
arth-0.022
factor-0.022
byss-0.021
asu-0.021
Factor-0.020
Token,
Token ID11
Feature activation+8.41e-03

Pos logit contributions
arth+0.139
factor+0.136
byss+0.132
asu+0.132
Factor+0.129
Neg logit contributions
ubs-0.173
mel-0.164
cover-0.148
opus-0.144
-0.136
Token potential
Token ID2785
Feature activation+1.65e-01

Pos logit contributions
ubs+0.008
mel+0.008
cover+0.007
opus+0.007
+0.007
Neg logit contributions
arth-0.007
factor-0.006
asu-0.006
byss-0.006
Factor-0.006
Token running
Token ID2491
Feature activation+1.69e-01

Pos logit contributions
arth+0.115
factor+0.113
asu+0.110
byss+0.109
Factor+0.107
Neg logit contributions
ubs-0.106
mel-0.100
cover-0.089
opus-0.086
-0.081
Token mates
Token ID27973
Feature activation+1.93e-02

Pos logit contributions
arth+0.126
factor+0.124
asu+0.121
byss+0.119
Factor+0.119
Neg logit contributions
ubs-0.100
mel-0.094
cover-0.082
opus-0.080
-0.075
Token,
Token ID11
Feature activation-8.86e-03

Pos logit contributions
arth+0.010
factor+0.009
asu+0.009
byss+0.009
Factor+0.009
Neg logit contributions
ubs-0.016
mel-0.016
cover-0.014
opus-0.014
-0.013
Token V
Token ID569
Feature activation+2.12e-02

Pos logit contributions
ubs+0.006
mel+0.006
cover+0.005
opus+0.005
+0.005
Neg logit contributions
arth-0.006
factor-0.005
byss-0.005
asu-0.005
Factor-0.005
TokenPs
Token ID12016
Feature activation-3.12e-01

Pos logit contributions
arth+0.013
factor+0.013
byss+0.013
asu+0.012
Factor+0.012
Neg logit contributions
ubs-0.015
mel-0.014
cover-0.013
opus-0.013
-0.012
Token See
Token ID4091
Feature activation+3.16e-01

Pos logit contributions
ubs+0.228
mel+0.216
cover+0.196
opus+0.190
+0.180
Neg logit contributions
arth-0.190
factor-0.186
byss-0.181
asu-0.180
Factor-0.177
Token Gallery
Token ID12917
Feature activation-2.27e-01

Pos logit contributions
arth+0.263
factor+0.257
asu+0.253
byss+0.251
Factor+0.248
Neg logit contributions
ubs-0.162
mel-0.150
cover-0.128
opus-0.123
-0.113
Token Donald
Token ID3759
Feature activation-1.30e-02

Pos logit contributions
ubs+0.138
mel+0.129
cover+0.114
opus+0.110
+0.102
Neg logit contributions
arth-0.166
factor-0.164
byss-0.161
asu-0.159
Factor-0.157
Token Trump
Token ID1301
Feature activation-1.72e-01

Pos logit contributions
ubs+0.006
mel+0.005
cover+0.004
opus+0.004
+0.004
Neg logit contributions
arth-0.012
factor-0.011
asu-0.011
byss-0.011
Factor-0.011
Token passes
Token ID8318
Feature activation+5.29e-02

Pos logit contributions
ubs+0.109
mel+0.103
cover+0.091
opus+0.088
+0.083
Neg logit contributions
arth-0.121
factor-0.119
byss-0.117
asu-0.116
Factor-0.114