TOP ACTIVATIONS
MAX = 0.540

is to edit the vert ices of the model . Its
common technique to manipulate vert ices is called normal extr usion
by doing ABS OL UT ELY NOT H ING ? The
th Tata Steel Chess Chall engers "] [ Site " W
Man agers Ind ices .
current highly partisan atmosphere subs ides . We have
W asting Food - Night mares - E mber assing Shit
fake Republican and Democratic polit icans working for banks and oil
now know existed on Hispan iola relatively unchanged for over a
from experience about the injust ices they are forced to contend
I consulted Greg Nic ot ero on these bodies specifically and
of two sets of invent ories , two product catalog s
the coalition prov ides responsible economic leadership in tough
J oshi , Ok ami and Islam b oul i
L AND OF EXT REM ELY EXP ENS IVE BE ER
v . vertex . x yz += v . normal *
( April 19 ), Num ero Group will release the limited
it is possible if not probable that credit would not be
. Add to that some probable guidance from regulators to the
think bother , another probable an agram that I can

MIN ACTIVATIONS
MIN = -0.506

. co / ni WB qv fi W u Dan
ree fs , water - poll ution , environmental - impact
on the Effects of Atomic Radiation ( UN S CE AR
the beginning of the catastrophic phase , Sebastian P
On the heels of green lighting its first min iser ies
German - based Society for Radiation Protection , said of Japan
Twin Fro z r IV Thermal Design with Dust Removal Technology
Lake Ur mia a bi osphere reserve to encourage sustainable development
ancing , although Night c rawler s animation wasn
unn ings has decided to phase them out in Australia .
said that it will then phase out support for the browser
in , when Night c rawler notices that no one in
was riding her bike north bound on Pennsylvania Street , said
Chron o Trigger and Earth bound , tell us there
streaming five videos of high definition video a day .
prime minister , following Margaret Thatcher , told the BBC .
The proposed first phase would do two main things
, Pe pt ost re pt ococ cus , Gard n
caus ality and the defin itiveness of intentions in jazz improvis
bass driver , which is augmented by two passive radi ators

INTERVAL 0.278 - 0.540
CONTAINS 0.218%

the game a ' Pain ful ' 2 . 4 out
Arduino has enabled no v ices to build projects . However
ished with Bas ij milit iam en , and freed a
editorial last month , F FR F Staff Attorney Patrick Elliott
. That was Rob H oe gee 's episode and I

INTERVAL 0.017 - 0.278
CONTAINS 36.869%

, par och ial my country right or wrong
including this weekend s The Last Jedi , which
our faith outside the four walls . For a
but a lot happens : the characters go to a lot
(@ r math is _ the 1 st ) on Dec

INTERVAL -0.245 - 0.017
CONTAINS 62.452%

may have fallen , they were unable to find it .
asserted the right to pick and choose what laws to follow
. The top eight airports commanded about half of all passenger
They had played their characters for six years and I was
and they lack the lead - in character that indicates that

INTERVAL -0.506 - -0.245
CONTAINS 0.461%

hardware . But not all workload s : some are network
forced to rely on stored body fat for fuel . You
marketing campaign . Instead of Techn icol or , Hitchcock shot
official of the Sund arb ans in Bangladesh , told reporters
remake of my X - wing Target ing Computer app here
Token is
Token ID318
Feature activation-9.16e-03

Pos logit contributions
qv+0.010
nexus+0.010
 Defender+0.009
owered+0.009
finder+0.009
Neg logit contributions
ELY-0.009
yz-0.009
llah-0.009
ami-0.008
icans-0.008
Token to
Token ID284
Feature activation+7.44e-03

Pos logit contributions
qv+0.006
nexus+0.006
 Defender+0.005
owered+0.005
finder+0.005
Neg logit contributions
ELY-0.006
yz-0.006
llah-0.006
ami-0.006
icans-0.006
Token edit
Token ID4370
Feature activation+5.46e-03

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.004
ami+0.004
icans+0.004
Neg logit contributions
qv-0.005
nexus-0.005
 Defender-0.004
owered-0.004
finder-0.004
Token the
Token ID262
Feature activation+1.32e-02

Pos logit contributions
ELY+0.004
yz+0.003
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.003
owered-0.003
finder-0.003
Token vert
Token ID9421
Feature activation+1.93e-01

Pos logit contributions
ELY+0.008
yz+0.008
llah+0.008
ami+0.008
icans+0.007
Neg logit contributions
qv-0.009
nexus-0.009
 Defender-0.008
owered-0.007
finder-0.007
Tokenices
Token ID1063
Feature activation+5.40e-01

Pos logit contributions
ELY+0.104
yz+0.100
llah+0.097
ami+0.093
icans+0.091
Neg logit contributions
qv-0.147
nexus-0.145
 Defender-0.129
owered-0.128
finder-0.127
Token of
Token ID286
Feature activation+7.80e-04

Pos logit contributions
ELY+0.342
yz+0.335
llah+0.322
ami+0.314
icans+0.308
Neg logit contributions
qv-0.359
nexus-0.354
 Defender-0.308
owered-0.304
finder-0.300
Token the
Token ID262
Feature activation+8.49e-03

Pos logit contributions
ELY+0.001
yz+0.000
llah+0.000
ami+0.000
icans+0.000
Neg logit contributions
qv-0.001
nexus-0.001
 Defender-0.000
owered-0.000
finder-0.000
Token model
Token ID2746
Feature activation+2.95e-02

Pos logit contributions
ELY+0.005
yz+0.005
llah+0.005
ami+0.005
icans+0.005
Neg logit contributions
qv-0.006
nexus-0.006
 Defender-0.005
owered-0.005
finder-0.005
Token.
Token ID13
Feature activation+1.46e-02

Pos logit contributions
ELY+0.019
yz+0.018
llah+0.018
ami+0.017
icans+0.017
Neg logit contributions
qv-0.020
nexus-0.019
 Defender-0.017
owered-0.017
finder-0.017
Token Its
Token ID6363
Feature activation+7.02e-02

Pos logit contributions
ELY+0.009
yz+0.009
llah+0.009
ami+0.008
icans+0.008
Neg logit contributions
qv-0.010
nexus-0.010
 Defender-0.008
owered-0.008
finder-0.008
Token common
Token ID2219
Feature activation-7.49e-03

Pos logit contributions
qv+0.009
nexus+0.009
 Defender+0.008
owered+0.008
finder+0.008
Neg logit contributions
ELY-0.009
yz-0.008
llah-0.008
ami-0.008
icans-0.008
Token technique
Token ID8173
Feature activation-1.59e-01

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Token to
Token ID284
Feature activation+1.90e-03

Pos logit contributions
qv+0.104
nexus+0.102
 Defender+0.089
owered+0.087
finder+0.086
Neg logit contributions
ELY-0.103
yz-0.101
llah-0.097
ami-0.094
icans-0.093
Token manipulate
Token ID18510
Feature activation-4.37e-02

Pos logit contributions
ELY+0.001
yz+0.001
llah+0.001
ami+0.001
icans+0.001
Neg logit contributions
qv-0.001
nexus-0.001
 Defender-0.001
owered-0.001
finder-0.001
Token vert
Token ID9421
Feature activation+2.12e-01

Pos logit contributions
qv+0.030
nexus+0.030
 Defender+0.026
owered+0.026
finder+0.026
Neg logit contributions
ELY-0.027
yz-0.026
llah-0.025
ami-0.024
icans-0.024
Tokenices
Token ID1063
Feature activation+5.36e-01

Pos logit contributions
ELY+0.037
yz+0.032
llah+0.029
ami+0.024
icans+0.022
Neg logit contributions
qv-0.240
nexus-0.237
 Defender-0.220
owered-0.218
finder-0.217
Token is
Token ID318
Feature activation-2.21e-02

Pos logit contributions
ELY+0.325
yz+0.318
llah+0.305
ami+0.298
icans+0.291
Neg logit contributions
qv-0.370
nexus-0.366
 Defender-0.320
owered-0.316
finder-0.312
Token called
Token ID1444
Feature activation-1.38e-01

Pos logit contributions
qv+0.014
nexus+0.014
 Defender+0.012
owered+0.012
finder+0.012
Neg logit contributions
ELY-0.015
yz-0.014
llah-0.014
ami-0.014
icans-0.013
Token normal
Token ID3487
Feature activation-1.54e-02

Pos logit contributions
qv+0.091
nexus+0.090
 Defender+0.078
owered+0.077
finder+0.076
Neg logit contributions
ELY-0.088
yz-0.086
llah-0.083
ami-0.080
icans-0.079
Token extr
Token ID22820
Feature activation+9.57e-03

Pos logit contributions
qv+0.011
nexus+0.010
 Defender+0.009
owered+0.009
finder+0.009
Neg logit contributions
ELY-0.009
yz-0.009
llah-0.009
ami-0.009
icans-0.008
Tokenusion
Token ID4241
Feature activation+9.17e-02

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.004
ami+0.004
icans+0.004
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
Token by
Token ID416
Feature activation-6.98e-03

Pos logit contributions
ELY+0.050
yz+0.049
llah+0.047
ami+0.046
icans+0.045
Neg logit contributions
qv-0.054
nexus-0.053
 Defender-0.046
owered-0.046
finder-0.045
Token doing
Token ID1804
Feature activation-5.38e-02

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.004
Token ABS
Token ID29950
Feature activation+1.73e-01

Pos logit contributions
qv+0.036
nexus+0.035
 Defender+0.031
owered+0.030
finder+0.030
Neg logit contributions
ELY-0.034
yz-0.033
llah-0.032
ami-0.031
icans-0.030
TokenOL
Token ID3535
Feature activation-8.52e-02

Pos logit contributions
ELY+0.123
yz+0.119
llah+0.115
ami+0.112
icans+0.110
Neg logit contributions
qv-0.104
nexus-0.103
 Defender-0.088
owered-0.086
finder-0.086
TokenUT
Token ID3843
Feature activation+3.47e-02

Pos logit contributions
qv+0.057
nexus+0.056
 Defender+0.049
owered+0.048
finder+0.048
Neg logit contributions
ELY-0.053
yz-0.053
llah-0.051
ami-0.049
icans-0.049
TokenELY
Token ID30943
Feature activation+5.32e-01

Pos logit contributions
yz+0.001
llah+0.000
ELY+0.000
ami-0.001
icans-0.001
Neg logit contributions
qv-0.044
nexus-0.043
 Defender-0.041
owered-0.040
finder-0.040
Token NOT
Token ID5626
Feature activation+2.36e-02

Pos logit contributions
ELY+0.318
yz+0.309
llah+0.298
ami+0.288
icans+0.283
Neg logit contributions
qv-0.374
nexus-0.368
 Defender-0.324
owered-0.319
finder-0.317
TokenH
Token ID39
Feature activation-1.26e-01

Pos logit contributions
ELY+0.015
yz+0.014
llah+0.014
ami+0.013
icans+0.013
Neg logit contributions
qv-0.016
nexus-0.016
 Defender-0.014
owered-0.014
finder-0.014
TokenING
Token ID2751
Feature activation+4.50e-02

Pos logit contributions
qv+0.100
nexus+0.098
 Defender+0.089
owered+0.087
finder+0.087
Neg logit contributions
yz-0.062
ELY-0.062
llah-0.060
ami-0.056
icans-0.055
Token?
Token ID30
Feature activation+9.96e-03

Pos logit contributions
ELY+0.027
yz+0.027
llah+0.026
ami+0.025
icans+0.025
Neg logit contributions
qv-0.031
nexus-0.030
 Defender-0.027
owered-0.026
finder-0.026
Token The
Token ID383
Feature activation-2.14e-02

Pos logit contributions
ELY+0.006
yz+0.006
llah+0.006
ami+0.006
icans+0.006
Neg logit contributions
qv-0.007
nexus-0.006
 Defender-0.006
owered-0.006
finder-0.006
Tokenth
Token ID400
Feature activation-9.63e-02

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.004
Token Tata
Token ID49963
Feature activation-2.43e-01

Pos logit contributions
qv+0.065
nexus+0.064
 Defender+0.056
owered+0.055
finder+0.055
Neg logit contributions
ELY-0.060
yz-0.058
llah-0.056
ami-0.055
icans-0.054
Token Steel
Token ID7851
Feature activation-2.82e-01

Pos logit contributions
qv+0.157
nexus+0.155
 Defender+0.134
owered+0.133
finder+0.132
Neg logit contributions
ELY-0.158
yz-0.155
llah-0.149
ami-0.145
icans-0.142
Token Chess
Token ID25774
Feature activation+2.58e-01

Pos logit contributions
qv+0.194
nexus+0.191
 Defender+0.167
owered+0.166
finder+0.164
Neg logit contributions
ELY-0.173
yz-0.169
llah-0.163
ami-0.158
icans-0.155
Token Chall
Token ID10772
Feature activation+1.51e-02

Pos logit contributions
ELY+0.161
yz+0.158
llah+0.152
ami+0.148
icans+0.145
Neg logit contributions
qv-0.173
nexus-0.171
 Defender-0.149
owered-0.148
finder-0.146
Tokenengers
Token ID9302
Feature activation+5.31e-01

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.001
icans+0.001
Neg logit contributions
qv-0.017
nexus-0.017
 Defender-0.016
owered-0.016
finder-0.016
Token"]
Token ID8973
Feature activation+1.97e-02

Pos logit contributions
ELY+0.346
yz+0.338
llah+0.326
ami+0.320
icans+0.313
Neg logit contributions
qv-0.343
nexus-0.339
 Defender-0.290
owered-0.289
finder-0.286
Token [
Token ID685
Feature activation+1.28e-01

Pos logit contributions
ELY+0.012
yz+0.012
llah+0.011
ami+0.011
icans+0.011
Neg logit contributions
qv-0.014
nexus-0.013
owered-0.012
 Defender-0.012
finder-0.011
TokenSite
Token ID29123
Feature activation+1.90e-02

Pos logit contributions
ELY+0.082
yz+0.079
ami+0.075
llah+0.075
icans+0.072
Neg logit contributions
qv-0.086
nexus-0.085
owered-0.073
 Defender-0.072
finder-0.071
Token "
Token ID366
Feature activation+1.41e-01

Pos logit contributions
ELY+0.010
yz+0.010
ami+0.010
llah+0.009
icans+0.009
Neg logit contributions
qv-0.014
nexus-0.014
 Defender-0.012
owered-0.012
finder-0.012
TokenW
Token ID54
Feature activation+3.34e-02

Pos logit contributions
ELY+0.090
yz+0.088
llah+0.084
ami+0.082
icans+0.080
Neg logit contributions
qv-0.094
nexus-0.093
 Defender-0.081
owered-0.080
finder-0.079
Token Man
Token ID1869
Feature activation-3.49e-02

Pos logit contributions
qv+0.027
nexus+0.027
 Defender+0.023
owered+0.023
finder+0.022
Neg logit contributions
ELY-0.032
yz-0.031
llah-0.031
ami-0.030
icans-0.029
Tokenagers
Token ID10321
Feature activation+1.87e-01

Pos logit contributions
qv+0.027
nexus+0.026
 Defender+0.023
owered+0.023
finder+0.023
Neg logit contributions
ELY-0.019
yz-0.018
llah-0.018
ami-0.017
icans-0.017
Token
Token ID447
Feature activation+3.23e-02

Pos logit contributions
ELY+0.132
yz+0.129
llah+0.125
ami+0.122
icans+0.120
Neg logit contributions
qv-0.111
nexus-0.109
 Defender-0.093
owered-0.092
finder-0.091
Token
Token ID247
Feature activation+4.22e-02

Pos logit contributions
ELY+0.021
yz+0.020
llah+0.019
ami+0.019
icans+0.019
Neg logit contributions
qv-0.021
nexus-0.021
 Defender-0.018
owered-0.018
finder-0.018
Token Ind
Token ID1423
Feature activation+6.38e-02

Pos logit contributions
ELY+0.028
yz+0.027
llah+0.026
ami+0.025
icans+0.025
Neg logit contributions
qv-0.027
nexus-0.027
 Defender-0.023
owered-0.023
finder-0.022
Tokenices
Token ID1063
Feature activation+5.09e-01

Pos logit contributions
ELY+0.029
yz+0.028
llah+0.027
ami+0.026
icans+0.026
Neg logit contributions
qv-0.053
nexus-0.053
 Defender-0.048
owered-0.047
finder-0.047
Token.
Token ID13
Feature activation+2.79e-02

Pos logit contributions
ELY+0.318
yz+0.310
llah+0.299
ami+0.292
icans+0.287
Neg logit contributions
qv-0.341
nexus-0.336
 Defender-0.294
owered-0.289
finder-0.285
Token
Token ID198
Feature activation+4.90e-02

Pos logit contributions
ELY+0.019
yz+0.018
llah+0.018
ami+0.017
icans+0.017
Neg logit contributions
qv-0.018
nexus-0.017
 Defender-0.015
owered-0.015
finder-0.015
Token
Token ID198
Feature activation+4.50e-02

Pos logit contributions
ELY+0.030
yz+0.029
llah+0.028
ami+0.027
icans+0.027
Neg logit contributions
qv-0.034
nexus-0.033
 Defender-0.029
owered-0.029
finder-0.028
Token
Token ID447
Feature activation+1.90e-02

Pos logit contributions
ELY+0.029
yz+0.028
llah+0.027
ami+0.026
icans+0.026
Neg logit contributions
qv-0.030
nexus-0.030
 Defender-0.026
owered-0.025
finder-0.025
Token
Token ID250
Feature activation+2.18e-02

Pos logit contributions
ELY+0.011
yz+0.011
llah+0.011
ami+0.010
icans+0.010
Neg logit contributions
qv-0.013
nexus-0.013
 Defender-0.012
owered-0.011
finder-0.011
Token current
Token ID1459
Feature activation-6.19e-02

Pos logit contributions
ELY+0.031
yz+0.030
llah+0.029
ami+0.029
icans+0.028
Neg logit contributions
qv-0.032
nexus-0.032
 Defender-0.027
owered-0.027
finder-0.027
Token highly
Token ID4047
Feature activation-6.36e-03

Pos logit contributions
qv+0.040
nexus+0.040
 Defender+0.034
owered+0.034
finder+0.034
Neg logit contributions
ELY-0.040
yz-0.039
llah-0.038
ami-0.037
icans-0.036
Token partisan
Token ID19666
Feature activation+6.91e-02

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.003
Token atmosphere
Token ID8137
Feature activation-8.25e-02

Pos logit contributions
ELY+0.043
yz+0.042
llah+0.040
ami+0.039
icans+0.038
Neg logit contributions
qv-0.047
nexus-0.046
 Defender-0.041
owered-0.040
finder-0.040
Token subs
Token ID6352
Feature activation+1.02e-01

Pos logit contributions
qv+0.058
nexus+0.057
 Defender+0.050
owered+0.050
finder+0.049
Neg logit contributions
ELY-0.049
yz-0.048
llah-0.046
ami-0.045
icans-0.044
Tokenides
Token ID1460
Feature activation+4.77e-01

Pos logit contributions
ELY+0.010
yz+0.009
llah+0.007
ami+0.005
icans+0.004
Neg logit contributions
qv-0.122
nexus-0.121
 Defender-0.113
owered-0.112
finder-0.111
Token.
Token ID13
Feature activation-4.95e-03

Pos logit contributions
ELY+0.300
yz+0.292
llah+0.282
ami+0.274
icans+0.269
Neg logit contributions
qv-0.321
nexus-0.316
 Defender-0.276
owered-0.273
finder-0.270
Token
Token ID564
Feature activation-1.02e-03

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.003
icans-0.003
Token
Token ID250
Feature activation+2.15e-02

Pos logit contributions
qv+0.001
nexus+0.001
 Defender+0.001
owered+0.001
finder+0.001
Neg logit contributions
ELY-0.001
yz-0.001
llah-0.001
ami-0.001
icans-0.001
TokenWe
Token ID1135
Feature activation+5.14e-02

Pos logit contributions
ELY+0.013
yz+0.012
llah+0.012
ami+0.012
icans+0.011
Neg logit contributions
qv-0.015
nexus-0.015
 Defender-0.013
owered-0.013
finder-0.013
Token have
Token ID423
Feature activation+2.44e-02

Pos logit contributions
ELY+0.031
yz+0.030
llah+0.029
ami+0.028
icans+0.028
Neg logit contributions
qv-0.036
nexus-0.035
 Defender-0.031
owered-0.031
finder-0.030
TokenW
Token ID54
Feature activation-1.80e-03

Pos logit contributions
ELY+0.085
yz+0.082
llah+0.079
ami+0.077
icans+0.076
Neg logit contributions
qv-0.085
nexus-0.083
 Defender-0.073
owered-0.072
finder-0.071
Tokenasting
Token ID9222
Feature activation-3.59e-01

Pos logit contributions
qv+0.001
nexus+0.001
 Defender+0.001
owered+0.001
finder+0.001
Neg logit contributions
ELY-0.001
yz-0.001
llah-0.001
ami-0.001
icans-0.001
Token Food
Token ID7318
Feature activation+6.28e-02

Pos logit contributions
qv+0.241
nexus+0.237
 Defender+0.207
owered+0.204
finder+0.202
Neg logit contributions
ELY-0.226
yz-0.220
llah-0.213
ami-0.206
icans-0.203
Token-
Token ID12
Feature activation+1.19e-01

Pos logit contributions
ELY+0.042
yz+0.041
llah+0.039
ami+0.038
icans+0.038
Neg logit contributions
qv-0.040
nexus-0.039
 Defender-0.034
owered-0.034
finder-0.033
TokenNight
Token ID24732
Feature activation-7.10e-02

Pos logit contributions
ELY+0.079
yz+0.077
llah+0.074
ami+0.072
icans+0.071
Neg logit contributions
qv-0.076
nexus-0.075
 Defender-0.065
owered-0.064
finder-0.064
Tokenmares
Token ID23745
Feature activation+4.76e-01

Pos logit contributions
qv+0.060
nexus+0.059
 Defender+0.053
owered+0.052
finder+0.052
Neg logit contributions
ELY-0.033
yz-0.032
llah-0.030
ami-0.029
icans-0.028
Token-
Token ID12
Feature activation+1.48e-01

Pos logit contributions
ELY+0.320
yz+0.313
llah+0.303
ami+0.295
icans+0.290
Neg logit contributions
qv-0.298
nexus-0.293
 Defender-0.253
owered-0.250
finder-0.247
TokenE
Token ID36
Feature activation+2.93e-02

Pos logit contributions
ELY+0.098
yz+0.095
llah+0.092
ami+0.089
icans+0.088
Neg logit contributions
qv-0.095
nexus-0.094
 Defender-0.081
owered-0.080
finder-0.079
Tokenmber
Token ID1916
Feature activation+1.68e-02

Pos logit contributions
ELY+0.020
yz+0.019
llah+0.019
ami+0.018
icans+0.018
Neg logit contributions
qv-0.018
nexus-0.018
 Defender-0.016
owered-0.015
finder-0.015
Tokenassing
Token ID19696
Feature activation-3.78e-03

Pos logit contributions
ELY+0.011
yz+0.010
llah+0.010
ami+0.010
icans+0.010
Neg logit contributions
qv-0.011
nexus-0.011
 Defender-0.010
owered-0.009
finder-0.009
Token Shit
Token ID41356
Feature activation+1.13e-01

Pos logit contributions
qv+0.002
nexus+0.002
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.002
yz-0.002
llah-0.002
ami-0.002
icans-0.002
Token fake
Token ID8390
Feature activation+5.88e-02

Pos logit contributions
qv+0.007
nexus+0.007
 Defender+0.006
owered+0.006
finder+0.006
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Token Republican
Token ID3415
Feature activation-9.40e-02

Pos logit contributions
ELY+0.040
yz+0.039
llah+0.038
ami+0.037
icans+0.036
Neg logit contributions
qv-0.037
nexus-0.036
 Defender-0.031
owered-0.031
finder-0.030
Token and
Token ID290
Feature activation+2.89e-02

Pos logit contributions
qv+0.065
nexus+0.064
 Defender+0.056
owered+0.056
finder+0.055
Neg logit contributions
ELY-0.057
yz-0.056
llah-0.054
ami-0.052
icans-0.051
Token Democratic
Token ID4390
Feature activation-8.35e-02

Pos logit contributions
ELY+0.019
yz+0.018
llah+0.018
ami+0.017
icans+0.017
Neg logit contributions
qv-0.019
nexus-0.018
 Defender-0.016
owered-0.016
finder-0.016
Token polit
Token ID1408
Feature activation-2.15e-01

Pos logit contributions
qv+0.059
nexus+0.058
 Defender+0.051
owered+0.050
finder+0.050
Neg logit contributions
ELY-0.050
yz-0.049
llah-0.047
ami-0.045
icans-0.045
Tokenicans
Token ID22398
Feature activation+4.74e-01

Pos logit contributions
qv+0.194
nexus+0.192
 Defender+0.174
owered+0.173
finder+0.171
Neg logit contributions
ELY-0.085
yz-0.082
llah-0.077
ami-0.073
icans-0.071
Token working
Token ID1762
Feature activation+1.75e-01

Pos logit contributions
ELY+0.297
yz+0.291
llah+0.280
ami+0.273
icans+0.268
Neg logit contributions
qv-0.318
nexus-0.314
 Defender-0.273
owered-0.270
finder-0.267
Token for
Token ID329
Feature activation+4.86e-02

Pos logit contributions
ELY+0.111
yz+0.108
llah+0.104
ami+0.101
icans+0.100
Neg logit contributions
qv-0.117
nexus-0.115
 Defender-0.100
owered-0.099
finder-0.097
Token banks
Token ID6341
Feature activation-4.30e-02

Pos logit contributions
ELY+0.031
yz+0.030
llah+0.029
ami+0.028
icans+0.028
Neg logit contributions
qv-0.032
nexus-0.032
 Defender-0.028
owered-0.027
finder-0.027
Token and
Token ID290
Feature activation+3.41e-02

Pos logit contributions
qv+0.028
nexus+0.028
 Defender+0.024
owered+0.024
finder+0.023
Neg logit contributions
ELY-0.028
yz-0.027
llah-0.026
ami-0.026
icans-0.025
Token oil
Token ID3056
Feature activation+1.15e-02

Pos logit contributions
ELY+0.021
yz+0.020
llah+0.019
ami+0.019
icans+0.019
Neg logit contributions
qv-0.024
nexus-0.023
 Defender-0.020
owered-0.020
finder-0.020
Token now
Token ID783
Feature activation-1.87e-02

Pos logit contributions
qv+0.086
nexus+0.085
 Defender+0.074
owered+0.073
finder+0.073
Neg logit contributions
ELY-0.077
yz-0.075
llah-0.072
ami-0.070
icans-0.069
Token know
Token ID760
Feature activation-1.15e-01

Pos logit contributions
qv+0.012
nexus+0.012
 Defender+0.011
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.012
yz-0.012
llah-0.011
ami-0.011
icans-0.011
Token existed
Token ID11196
Feature activation-2.70e-02

Pos logit contributions
qv+0.076
nexus+0.075
 Defender+0.066
owered+0.065
finder+0.064
Neg logit contributions
ELY-0.074
yz-0.072
llah-0.069
ami-0.067
icans-0.066
Token on
Token ID319
Feature activation+2.95e-02

Pos logit contributions
qv+0.018
nexus+0.018
 Defender+0.015
owered+0.015
finder+0.015
Neg logit contributions
ELY-0.017
yz-0.017
llah-0.016
ami-0.016
icans-0.016
Token Hispan
Token ID13600
Feature activation+2.08e-02

Pos logit contributions
ELY+0.019
yz+0.019
llah+0.018
ami+0.017
icans+0.017
Neg logit contributions
qv-0.019
nexus-0.019
 Defender-0.016
owered-0.016
finder-0.016
Tokeniola
Token ID30292
Feature activation+4.73e-01

Pos logit contributions
ELY+0.005
yz+0.005
llah+0.004
ami+0.004
icans+0.004
Neg logit contributions
qv-0.022
nexus-0.022
 Defender-0.020
owered-0.020
finder-0.020
Token relatively
Token ID5365
Feature activation-1.85e-02

Pos logit contributions
ELY+0.300
yz+0.294
llah+0.284
ami+0.276
icans+0.272
Neg logit contributions
qv-0.314
nexus-0.308
 Defender-0.268
owered-0.266
finder-0.263
Token unchanged
Token ID21588
Feature activation-2.02e-02

Pos logit contributions
qv+0.013
nexus+0.013
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.011
yz-0.011
llah-0.010
ami-0.010
icans-0.010
Token for
Token ID329
Feature activation+5.13e-02

Pos logit contributions
qv+0.013
nexus+0.013
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.013
yz-0.013
llah-0.012
ami-0.012
icans-0.012
Token over
Token ID625
Feature activation+1.12e-01

Pos logit contributions
ELY+0.033
yz+0.033
llah+0.031
ami+0.031
icans+0.030
Neg logit contributions
qv-0.033
nexus-0.033
 Defender-0.028
owered-0.028
finder-0.028
Token a
Token ID257
Feature activation+5.83e-02

Pos logit contributions
ELY+0.077
yz+0.076
llah+0.073
ami+0.071
icans+0.070
Neg logit contributions
qv-0.068
nexus-0.067
 Defender-0.057
owered-0.057
finder-0.056
Token from
Token ID422
Feature activation-5.49e-02

Pos logit contributions
qv+0.019
nexus+0.019
 Defender+0.016
owered+0.016
finder+0.016
Neg logit contributions
ELY-0.020
yz-0.019
llah-0.019
ami-0.018
icans-0.018
Token experience
Token ID1998
Feature activation-5.06e-02

Pos logit contributions
qv+0.036
nexus+0.035
 Defender+0.031
owered+0.030
finder+0.030
Neg logit contributions
ELY-0.035
yz-0.035
llah-0.034
ami-0.032
icans-0.032
Token about
Token ID546
Feature activation-5.80e-02

Pos logit contributions
qv+0.033
nexus+0.032
 Defender+0.028
owered+0.027
finder+0.027
Neg logit contributions
ELY-0.033
yz-0.032
llah-0.031
ami-0.030
icans-0.030
Token the
Token ID262
Feature activation+9.74e-03

Pos logit contributions
qv+0.038
nexus+0.037
 Defender+0.032
owered+0.032
finder+0.032
Neg logit contributions
ELY-0.037
yz-0.037
llah-0.035
ami-0.034
icans-0.034
Token injust
Token ID19019
Feature activation+3.02e-02

Pos logit contributions
ELY+0.006
yz+0.006
llah+0.006
ami+0.006
icans+0.006
Neg logit contributions
qv-0.006
nexus-0.006
 Defender-0.005
owered-0.005
finder-0.005
Tokenices
Token ID1063
Feature activation+4.68e-01

Pos logit contributions
ELY+0.005
yz+0.004
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.035
nexus-0.034
 Defender-0.032
owered-0.032
finder-0.031
Token they
Token ID484
Feature activation-5.49e-02

Pos logit contributions
ELY+0.290
yz+0.284
llah+0.274
ami+0.266
icans+0.261
Neg logit contributions
qv-0.318
nexus-0.313
 Defender-0.274
owered-0.270
finder-0.268
Token are
Token ID389
Feature activation-8.78e-03

Pos logit contributions
qv+0.034
nexus+0.034
 Defender+0.029
owered+0.029
finder+0.028
Neg logit contributions
ELY-0.037
yz-0.036
llah-0.035
ami-0.034
icans-0.034
Token forced
Token ID4137
Feature activation+8.40e-02

Pos logit contributions
qv+0.006
nexus+0.006
 Defender+0.005
owered+0.005
finder+0.005
Neg logit contributions
ELY-0.006
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Token to
Token ID284
Feature activation+3.00e-02

Pos logit contributions
ELY+0.058
yz+0.057
llah+0.055
ami+0.053
icans+0.053
Neg logit contributions
qv-0.051
nexus-0.050
 Defender-0.043
owered-0.043
finder-0.042
Token contend
Token ID24351
Feature activation-5.25e-02

Pos logit contributions
ELY+0.020
yz+0.020
llah+0.019
ami+0.019
icans+0.018
Neg logit contributions
qv-0.019
nexus-0.018
 Defender-0.016
owered-0.016
finder-0.015
TokenI
Token ID40
Feature activation-1.10e-04

Pos logit contributions
ELY+0.022
yz+0.022
llah+0.021
ami+0.020
icans+0.020
Neg logit contributions
qv-0.025
nexus-0.025
 Defender-0.022
owered-0.022
finder-0.021
Token consulted
Token ID26889
Feature activation+2.93e-03

Pos logit contributions
qv+0.000
nexus+0.000
 Defender+0.000
owered+0.000
finder+0.000
Neg logit contributions
ELY-0.000
yz-0.000
llah-0.000
ami-0.000
icans-0.000
Token Greg
Token ID8547
Feature activation+5.99e-02

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.002
owered-0.002
finder-0.002
Token Nic
Token ID8377
Feature activation-5.46e-02

Pos logit contributions
ELY+0.040
yz+0.039
llah+0.038
ami+0.037
icans+0.036
Neg logit contributions
qv-0.038
nexus-0.037
 Defender-0.032
owered-0.032
finder-0.031
Tokenot
Token ID313
Feature activation-9.47e-02

Pos logit contributions
qv+0.026
nexus+0.026
 Defender+0.021
owered+0.021
finder+0.020
Neg logit contributions
ELY-0.044
yz-0.044
llah-0.043
ami-0.042
icans-0.041
Tokenero
Token ID3529
Feature activation+4.67e-01

Pos logit contributions
qv+0.105
nexus+0.105
 Defender+0.097
owered+0.096
finder+0.095
Neg logit contributions
ELY-0.018
yz-0.016
llah-0.014
ami-0.013
icans-0.012
Token on
Token ID319
Feature activation+5.26e-03

Pos logit contributions
ELY+0.295
yz+0.287
llah+0.278
ami+0.272
icans+0.265
Neg logit contributions
qv-0.310
nexus-0.307
 Defender-0.268
owered-0.266
finder-0.261
Token these
Token ID777
Feature activation+7.62e-02

Pos logit contributions
ELY+0.003
yz+0.003
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.003
 Defender-0.003
owered-0.003
finder-0.003
Token bodies
Token ID5920
Feature activation-2.67e-01

Pos logit contributions
ELY+0.048
yz+0.046
llah+0.045
ami+0.044
icans+0.043
Neg logit contributions
qv-0.051
nexus-0.050
 Defender-0.044
owered-0.044
finder-0.043
Token specifically
Token ID5734
Feature activation+1.10e-01

Pos logit contributions
qv+0.178
nexus+0.176
 Defender+0.154
owered+0.152
finder+0.149
Neg logit contributions
ELY-0.167
yz-0.163
llah-0.159
ami-0.155
icans-0.151
Token and
Token ID290
Feature activation+2.51e-03

Pos logit contributions
ELY+0.070
yz+0.069
llah+0.067
ami+0.065
icans+0.064
Neg logit contributions
qv-0.072
nexus-0.071
 Defender-0.062
owered-0.061
finder-0.060
Token of
Token ID286
Feature activation-1.96e-03

Pos logit contributions
ELY+0.074
yz+0.072
llah+0.070
ami+0.067
icans+0.066
Neg logit contributions
qv-0.076
nexus-0.075
 Defender-0.066
owered-0.065
finder-0.064
Token two
Token ID734
Feature activation-7.94e-02

Pos logit contributions
qv+0.001
nexus+0.001
 Defender+0.001
owered+0.001
finder+0.001
Neg logit contributions
ELY-0.001
yz-0.001
llah-0.001
ami-0.001
icans-0.001
Token sets
Token ID5621
Feature activation-1.25e-01

Pos logit contributions
qv+0.055
nexus+0.054
 Defender+0.047
owered+0.047
finder+0.046
Neg logit contributions
ELY-0.049
yz-0.047
llah-0.046
ami-0.044
icans-0.044
Token of
Token ID286
Feature activation+1.43e-02

Pos logit contributions
qv+0.083
nexus+0.081
 Defender+0.071
owered+0.070
finder+0.070
Neg logit contributions
ELY-0.080
yz-0.078
llah-0.076
ami-0.073
icans-0.072
Token invent
Token ID8067
Feature activation-3.00e-02

Pos logit contributions
ELY+0.009
yz+0.009
llah+0.008
ami+0.008
icans+0.008
Neg logit contributions
qv-0.010
nexus-0.010
 Defender-0.008
owered-0.008
finder-0.008
Tokenories
Token ID1749
Feature activation+4.66e-01

Pos logit contributions
qv+0.033
nexus+0.033
 Defender+0.030
owered+0.030
finder+0.030
Neg logit contributions
ELY-0.006
yz-0.005
llah-0.005
ami-0.004
icans-0.004
Token,
Token ID11
Feature activation+6.33e-03

Pos logit contributions
ELY+0.301
yz+0.294
llah+0.284
ami+0.276
icans+0.271
Neg logit contributions
qv-0.304
nexus-0.300
 Defender-0.261
owered-0.258
finder-0.254
Token two
Token ID734
Feature activation-5.29e-02

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.004
ami+0.004
icans+0.004
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.004
owered-0.004
finder-0.004
Token product
Token ID1720
Feature activation-2.32e-02

Pos logit contributions
qv+0.034
nexus+0.034
 Defender+0.029
owered+0.029
finder+0.028
Neg logit contributions
ELY-0.035
yz-0.034
llah-0.033
ami-0.032
icans-0.031
Token catalog
Token ID18388
Feature activation-2.34e-02

Pos logit contributions
qv+0.015
nexus+0.015
 Defender+0.013
owered+0.013
finder+0.013
Neg logit contributions
ELY-0.015
yz-0.015
llah-0.014
ami-0.014
icans-0.014
Tokens
Token ID82
Feature activation+2.61e-02

Pos logit contributions
qv+0.016
nexus+0.015
 Defender+0.014
owered+0.013
finder+0.013
Neg logit contributions
ELY-0.015
yz-0.014
llah-0.014
ami-0.013
icans-0.013
Token the
Token ID262
Feature activation-7.87e-03

Pos logit contributions
qv+0.009
nexus+0.009
 Defender+0.008
owered+0.008
finder+0.008
Neg logit contributions
ELY-0.009
yz-0.009
llah-0.009
ami-0.008
icans-0.008
Token coalition
Token ID9906
Feature activation+5.90e-02

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.005
owered+0.005
finder+0.004
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.005
ami-0.004
icans-0.004
Token
Token ID564
Feature activation-4.36e-02

Pos logit contributions
ELY+0.036
yz+0.035
llah+0.034
ami+0.032
icans+0.032
Neg logit contributions
qv-0.041
nexus-0.040
 Defender-0.036
owered-0.035
finder-0.035
Token
Token ID250
Feature activation-2.21e-02

Pos logit contributions
qv+0.030
nexus+0.030
 Defender+0.026
owered+0.026
finder+0.026
Neg logit contributions
ELY-0.026
yz-0.026
llah-0.025
ami-0.024
icans-0.024
Tokenprov
Token ID15234
Feature activation+5.58e-02

Pos logit contributions
qv+0.014
nexus+0.014
 Defender+0.012
owered+0.012
finder+0.011
Neg logit contributions
ELY-0.015
yz-0.014
llah-0.014
ami-0.014
icans-0.013
Tokenides
Token ID1460
Feature activation+4.60e-01

Pos logit contributions
ELY+0.007
yz+0.006
llah+0.005
ami+0.004
icans+0.004
Neg logit contributions
qv-0.065
nexus-0.065
 Defender-0.060
owered-0.060
finder-0.059
Token responsible
Token ID4497
Feature activation-1.73e-02

Pos logit contributions
ELY+0.294
yz+0.286
llah+0.277
ami+0.268
icans+0.263
Neg logit contributions
qv-0.305
nexus-0.300
 Defender-0.262
owered-0.259
finder-0.256
Token economic
Token ID3034
Feature activation+3.47e-02

Pos logit contributions
qv+0.012
nexus+0.011
 Defender+0.010
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.011
yz-0.011
llah-0.010
ami-0.010
icans-0.010
Token leadership
Token ID5531
Feature activation-1.79e-01

Pos logit contributions
ELY+0.021
yz+0.021
llah+0.020
ami+0.020
icans+0.019
Neg logit contributions
qv-0.024
nexus-0.023
 Defender-0.020
owered-0.020
finder-0.020
Token in
Token ID287
Feature activation-8.13e-03

Pos logit contributions
qv+0.119
nexus+0.117
 Defender+0.102
owered+0.101
finder+0.100
Neg logit contributions
ELY-0.113
yz-0.111
llah-0.107
ami-0.104
icans-0.102
Token tough
Token ID5802
Feature activation+1.24e-02

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.005
owered+0.005
finder+0.005
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Token
Token ID198
Feature activation+8.88e-03

Pos logit contributions
ELY+0.007
yz+0.007
llah+0.007
ami+0.007
icans+0.006
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
TokenJ
Token ID41
Feature activation+6.22e-02

Pos logit contributions
ELY+0.006
yz+0.005
llah+0.005
ami+0.005
icans+0.005
Neg logit contributions
qv-0.006
nexus-0.006
 Defender-0.005
owered-0.005
finder-0.005
Tokenoshi
Token ID13704
Feature activation+2.27e-01

Pos logit contributions
ELY+0.020
yz+0.018
llah+0.017
ami+0.016
icans+0.015
Neg logit contributions
qv-0.062
nexus-0.061
 Defender-0.055
owered-0.055
finder-0.055
Token,
Token ID11
Feature activation-1.51e-02

Pos logit contributions
ELY+0.137
yz+0.134
llah+0.129
ami+0.125
icans+0.123
Neg logit contributions
qv-0.157
nexus-0.155
 Defender-0.136
owered-0.134
finder-0.133
Token Ok
Token ID6762
Feature activation-7.80e-02

Pos logit contributions
qv+0.010
nexus+0.010
 Defender+0.008
owered+0.008
finder+0.008
Neg logit contributions
ELY-0.010
yz-0.010
llah-0.009
ami-0.009
icans-0.009
Tokenami
Token ID6277
Feature activation+4.59e-01

Pos logit contributions
qv+0.055
nexus+0.054
 Defender+0.047
owered+0.047
finder+0.046
Neg logit contributions
ELY-0.047
yz-0.046
llah-0.044
ami-0.043
icans-0.042
Token and
Token ID290
Feature activation-1.66e-02

Pos logit contributions
ELY+0.294
yz+0.288
llah+0.278
ami+0.271
icans+0.265
Neg logit contributions
qv-0.302
nexus-0.298
 Defender-0.260
owered-0.256
finder-0.253
Token Islam
Token ID3449
Feature activation+3.45e-02

Pos logit contributions
qv+0.011
nexus+0.011
 Defender+0.009
owered+0.009
finder+0.009
Neg logit contributions
ELY-0.011
yz-0.010
llah-0.010
ami-0.010
icans-0.009
Tokenb
Token ID65
Feature activation+1.71e-02

Pos logit contributions
ELY+0.020
yz+0.020
llah+0.019
ami+0.018
icans+0.018
Neg logit contributions
qv-0.025
nexus-0.024
 Defender-0.021
owered-0.021
finder-0.021
Tokenoul
Token ID2852
Feature activation+1.54e-01

Pos logit contributions
ELY+0.010
yz+0.010
llah+0.009
ami+0.009
icans+0.009
Neg logit contributions
qv-0.012
nexus-0.012
 Defender-0.011
owered-0.011
finder-0.011
Tokeni
Token ID72
Feature activation+1.57e-01

Pos logit contributions
ELY+0.082
yz+0.079
llah+0.075
ami+0.072
icans+0.071
Neg logit contributions
qv-0.119
nexus-0.117
 Defender-0.104
owered-0.104
finder-0.103
Token L
Token ID406
Feature activation-4.36e-02

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.002
yz-0.002
llah-0.002
ami-0.002
icans-0.002
TokenAND
Token ID6981
Feature activation+1.84e-01

Pos logit contributions
qv+0.033
nexus+0.032
 Defender+0.029
owered+0.028
finder+0.028
Neg logit contributions
ELY-0.024
yz-0.023
llah-0.022
ami-0.022
icans-0.021
Token OF
Token ID3963
Feature activation-6.71e-02

Pos logit contributions
ELY+0.109
yz+0.106
llah+0.103
ami+0.099
icans+0.097
Neg logit contributions
qv-0.130
nexus-0.128
 Defender-0.113
owered-0.112
finder-0.111
Token EXT
Token ID27489
Feature activation+3.44e-02

Pos logit contributions
qv+0.048
nexus+0.047
 Defender+0.042
owered+0.041
finder+0.041
Neg logit contributions
ELY-0.039
yz-0.038
llah-0.037
ami-0.036
icans-0.035
TokenREM
Token ID40726
Feature activation-1.63e-02

Pos logit contributions
ELY+0.018
yz+0.017
llah+0.016
ami+0.016
icans+0.015
Neg logit contributions
qv-0.027
nexus-0.027
 Defender-0.024
owered-0.024
finder-0.023
TokenELY
Token ID30943
Feature activation+4.59e-01

Pos logit contributions
qv+0.020
nexus+0.020
 Defender+0.018
owered+0.018
finder+0.018
Neg logit contributions
ELY-0.001
yz-0.001
llah-0.001
ami-0.001
icans-0.000
Token EXP
Token ID25703
Feature activation+6.36e-02

Pos logit contributions
ELY+0.271
yz+0.264
llah+0.254
ami+0.246
icans+0.241
Neg logit contributions
qv-0.326
nexus-0.321
 Defender-0.283
owered-0.280
finder-0.277
TokenENS
Token ID16938
Feature activation+2.32e-01

Pos logit contributions
ELY+0.024
yz+0.023
llah+0.021
ami+0.020
icans+0.020
Neg logit contributions
qv-0.059
nexus-0.058
 Defender-0.053
owered-0.053
finder-0.052
TokenIVE
Token ID9306
Feature activation+2.35e-01

Pos logit contributions
ELY+0.086
yz+0.083
llah+0.079
ami+0.074
icans+0.073
Neg logit contributions
qv-0.214
nexus-0.211
 Defender-0.192
owered-0.190
finder-0.189
Token BE
Token ID9348
Feature activation+1.29e-01

Pos logit contributions
ELY+0.137
yz+0.133
llah+0.128
ami+0.124
icans+0.122
Neg logit contributions
qv-0.168
nexus-0.166
 Defender-0.146
owered-0.144
finder-0.143
TokenER
Token ID1137
Feature activation-5.29e-02

Pos logit contributions
ELY+0.076
yz+0.074
llah+0.071
ami+0.068
icans+0.067
Neg logit contributions
qv-0.092
nexus-0.091
 Defender-0.080
owered-0.079
finder-0.078
Token v
Token ID410
Feature activation+1.28e-01

Pos logit contributions
nexus+0.092
qv+0.092
owered+0.079
 Defender+0.079
finder+0.078
Neg logit contributions
ELY-0.080
yz-0.079
llah-0.075
ami-0.074
icans-0.072
Token.
Token ID764
Feature activation-6.25e-03

Pos logit contributions
ELY+0.092
yz+0.091
llah+0.088
ami+0.086
icans+0.084
Neg logit contributions
qv-0.074
nexus-0.073
 Defender-0.062
owered-0.061
finder-0.060
Token vertex
Token ID37423
Feature activation-9.48e-02

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.003
icans-0.003
Token.
Token ID764
Feature activation-7.26e-03

Pos logit contributions
qv+0.054
nexus+0.053
 Defender+0.045
owered+0.045
finder+0.044
Neg logit contributions
ELY-0.069
yz-0.068
llah-0.066
ami-0.064
icans-0.063
Token x
Token ID2124
Feature activation-9.12e-02

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
yz-0.005
ELY-0.005
llah-0.005
ami-0.005
icans-0.005
Tokenyz
Token ID45579
Feature activation+4.57e-01

Pos logit contributions
qv+0.071
nexus+0.070
 Defender+0.063
owered+0.062
finder+0.061
Neg logit contributions
ELY-0.047
yz-0.046
llah-0.044
ami-0.042
icans-0.041
Token +=
Token ID15853
Feature activation+6.53e-02

Pos logit contributions
ELY+0.265
yz+0.263
llah+0.248
ami+0.243
icans+0.235
Neg logit contributions
qv-0.327
nexus-0.323
 Defender-0.283
owered-0.280
finder-0.276
Token v
Token ID410
Feature activation+1.60e-01

Pos logit contributions
ELY+0.038
yz+0.038
llah+0.036
ami+0.035
icans+0.034
Neg logit contributions
nexus-0.046
qv-0.046
owered-0.040
 Defender-0.039
finder-0.039
Token.
Token ID764
Feature activation+1.36e-03

Pos logit contributions
ELY+0.116
yz+0.114
llah+0.110
ami+0.107
icans+0.105
Neg logit contributions
qv-0.092
nexus-0.090
 Defender-0.077
owered-0.076
finder-0.075
Token normal
Token ID3487
Feature activation+1.47e-02

Pos logit contributions
ELY+0.001
yz+0.001
llah+0.001
ami+0.001
icans+0.001
Neg logit contributions
qv-0.001
nexus-0.001
 Defender-0.001
owered-0.001
finder-0.001
Token *
Token ID1635
Feature activation+6.54e-02

Pos logit contributions
ELY+0.009
yz+0.009
llah+0.008
ami+0.008
icans+0.008
Neg logit contributions
qv-0.010
nexus-0.010
 Defender-0.009
owered-0.009
finder-0.008
Token (
Token ID357
Feature activation-6.03e-03

Pos logit contributions
qv+0.042
nexus+0.042
 Defender+0.036
owered+0.036
finder+0.035
Neg logit contributions
ELY-0.042
yz-0.041
llah-0.040
ami-0.039
icans-0.038
TokenApril
Token ID16784
Feature activation-1.51e-01

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.004
Token 19
Token ID678
Feature activation+1.32e-03

Pos logit contributions
qv+0.097
nexus+0.095
 Defender+0.082
owered+0.082
finder+0.081
Neg logit contributions
ELY-0.099
yz-0.097
llah-0.093
ami-0.091
icans-0.089
Token),
Token ID828
Feature activation-6.49e-03

Pos logit contributions
ELY+0.001
yz+0.001
llah+0.001
ami+0.001
icans+0.001
Neg logit contributions
qv-0.001
nexus-0.001
 Defender-0.001
owered-0.001
finder-0.001
Token Num
Token ID31835
Feature activation-8.64e-03

Pos logit contributions
qv+0.005
nexus+0.004
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.004
Tokenero
Token ID3529
Feature activation+4.53e-01

Pos logit contributions
qv+0.007
nexus+0.007
 Defender+0.007
owered+0.007
finder+0.006
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.003
icans-0.003
Token Group
Token ID4912
Feature activation-2.89e-02

Pos logit contributions
ELY+0.303
yz+0.296
llah+0.284
ami+0.278
icans+0.274
Neg logit contributions
qv-0.286
nexus-0.281
 Defender-0.242
owered-0.240
finder-0.237
Token will
Token ID481
Feature activation+4.01e-02

Pos logit contributions
qv+0.019
nexus+0.019
 Defender+0.016
owered+0.016
finder+0.016
Neg logit contributions
ELY-0.019
yz-0.018
llah-0.017
ami-0.017
icans-0.017
Token release
Token ID2650
Feature activation-1.66e-02

Pos logit contributions
ELY+0.026
yz+0.026
llah+0.025
ami+0.024
icans+0.024
Neg logit contributions
qv-0.026
nexus-0.025
 Defender-0.022
owered-0.022
finder-0.022
Token the
Token ID262
Feature activation-3.00e-03

Pos logit contributions
qv+0.011
nexus+0.011
 Defender+0.010
owered+0.009
finder+0.009
Neg logit contributions
ELY-0.010
yz-0.010
llah-0.010
ami-0.010
icans-0.009
Token limited
Token ID3614
Feature activation-8.31e-02

Pos logit contributions
qv+0.002
nexus+0.002
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.002
yz-0.002
llah-0.002
ami-0.002
icans-0.002
Token it
Token ID340
Feature activation+5.26e-03

Pos logit contributions
ELY+0.001
yz+0.001
llah+0.001
ami+0.001
icans+0.001
Neg logit contributions
qv-0.001
nexus-0.001
 Defender-0.001
owered-0.001
finder-0.001
Token is
Token ID318
Feature activation+1.84e-02

Pos logit contributions
ELY+0.003
yz+0.003
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.003
owered-0.003
finder-0.003
Token possible
Token ID1744
Feature activation+6.59e-02

Pos logit contributions
ELY+0.011
yz+0.010
llah+0.010
ami+0.010
icans+0.010
Neg logit contributions
qv-0.013
nexus-0.013
 Defender-0.011
owered-0.011
finder-0.011
Token if
Token ID611
Feature activation+1.07e-01

Pos logit contributions
ELY+0.041
yz+0.040
llah+0.039
ami+0.037
icans+0.037
Neg logit contributions
qv-0.045
nexus-0.044
 Defender-0.039
owered-0.038
finder-0.037
Token not
Token ID407
Feature activation-1.90e-02

Pos logit contributions
ELY+0.065
yz+0.064
llah+0.062
ami+0.060
icans+0.059
Neg logit contributions
qv-0.073
nexus-0.072
 Defender-0.063
owered-0.062
finder-0.061
Token probable
Token ID17939
Feature activation+4.50e-01

Pos logit contributions
qv+0.015
nexus+0.015
 Defender+0.013
owered+0.013
finder+0.013
Neg logit contributions
ELY-0.010
yz-0.009
llah-0.009
ami-0.009
icans-0.008
Token that
Token ID326
Feature activation-8.46e-03

Pos logit contributions
ELY+0.281
yz+0.274
llah+0.265
ami+0.257
icans+0.252
Neg logit contributions
qv-0.304
nexus-0.300
 Defender-0.263
owered-0.259
finder-0.256
Token credit
Token ID3884
Feature activation-1.83e-01

Pos logit contributions
qv+0.006
nexus+0.006
 Defender+0.005
owered+0.005
finder+0.005
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Token would
Token ID561
Feature activation-5.77e-02

Pos logit contributions
qv+0.127
nexus+0.125
 Defender+0.110
owered+0.108
finder+0.108
Neg logit contributions
ELY-0.111
yz-0.108
llah-0.104
ami-0.101
icans-0.099
Token not
Token ID407
Feature activation-5.71e-02

Pos logit contributions
qv+0.040
nexus+0.039
 Defender+0.034
owered+0.034
finder+0.033
Neg logit contributions
ELY-0.035
yz-0.034
llah-0.033
ami-0.032
icans-0.032
Token be
Token ID307
Feature activation-7.61e-02

Pos logit contributions
qv+0.039
nexus+0.038
 Defender+0.034
owered+0.033
finder+0.033
Neg logit contributions
ELY-0.035
yz-0.034
llah-0.033
ami-0.032
icans-0.032
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
qv+0.040
nexus+0.039
 Defender+0.034
owered+0.033
finder+0.033
Neg logit contributions
ELY-0.039
yz-0.038
llah-0.037
ami-0.036
icans-0.035
Token Add
Token ID3060
Feature activation-1.35e-01

Pos logit contributions
ELY+0.009
yz+0.008
llah+0.008
ami+0.008
icans+0.008
Neg logit contributions
qv-0.009
nexus-0.009
 Defender-0.008
owered-0.008
finder-0.008
Token to
Token ID284
Feature activation+1.66e-02

Pos logit contributions
qv+0.088
nexus+0.087
 Defender+0.075
owered+0.074
finder+0.074
Neg logit contributions
ELY-0.087
yz-0.085
llah-0.082
ami-0.080
icans-0.079
Token that
Token ID326
Feature activation+3.58e-02

Pos logit contributions
ELY+0.010
yz+0.010
llah+0.009
ami+0.009
icans+0.009
Neg logit contributions
qv-0.012
nexus-0.012
 Defender-0.010
owered-0.010
finder-0.010
Token some
Token ID617
Feature activation-1.47e-01

Pos logit contributions
ELY+0.023
yz+0.022
llah+0.022
ami+0.021
icans+0.021
Neg logit contributions
qv-0.023
nexus-0.023
 Defender-0.020
owered-0.020
finder-0.020
Token probable
Token ID17939
Feature activation+4.49e-01

Pos logit contributions
qv+0.098
nexus+0.097
 Defender+0.084
owered+0.083
finder+0.082
Neg logit contributions
ELY-0.092
yz-0.090
llah-0.087
ami-0.084
icans-0.083
Token guidance
Token ID11154
Feature activation+2.36e-02

Pos logit contributions
ELY+0.272
yz+0.266
llah+0.256
ami+0.248
icans+0.243
Neg logit contributions
qv-0.311
nexus-0.307
 Defender-0.269
owered-0.266
finder-0.262
Token from
Token ID422
Feature activation-6.48e-02

Pos logit contributions
ELY+0.016
yz+0.015
llah+0.015
ami+0.014
icans+0.014
Neg logit contributions
qv-0.015
nexus-0.015
 Defender-0.013
owered-0.013
finder-0.012
Token regulators
Token ID17199
Feature activation+1.70e-01

Pos logit contributions
qv+0.043
nexus+0.042
 Defender+0.037
owered+0.036
finder+0.036
Neg logit contributions
ELY-0.041
yz-0.040
llah-0.039
ami-0.038
icans-0.037
Token to
Token ID284
Feature activation-2.08e-02

Pos logit contributions
ELY+0.108
yz+0.105
llah+0.102
ami+0.099
icans+0.097
Neg logit contributions
qv-0.113
nexus-0.112
 Defender-0.097
owered-0.096
finder-0.094
Token the
Token ID262
Feature activation+5.43e-03

Pos logit contributions
qv+0.014
nexus+0.014
 Defender+0.012
owered+0.012
finder+0.012
Neg logit contributions
ELY-0.013
yz-0.013
llah-0.012
ami-0.012
icans-0.012
Token think
Token ID892
Feature activation+1.20e-01

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.004
Token
Token ID784
Feature activation+3.77e-02

Pos logit contributions
ELY+0.079
yz+0.077
llah+0.074
ami+0.072
icans+0.071
Neg logit contributions
qv-0.077
nexus-0.075
 Defender-0.066
owered-0.065
finder-0.064
Token bother
Token ID11393
Feature activation+1.93e-02

Pos logit contributions
ELY+0.023
yz+0.023
llah+0.022
ami+0.021
icans+0.021
Neg logit contributions
qv-0.026
nexus-0.025
 Defender-0.022
owered-0.022
finder-0.022
Token,
Token ID11
Feature activation+1.00e-02

Pos logit contributions
ELY+0.013
yz+0.012
llah+0.012
ami+0.012
icans+0.011
Neg logit contributions
qv-0.012
nexus-0.012
 Defender-0.011
owered-0.011
finder-0.010
Token another
Token ID1194
Feature activation-4.06e-02

Pos logit contributions
ELY+0.006
yz+0.006
llah+0.006
ami+0.006
icans+0.006
Neg logit contributions
qv-0.007
nexus-0.007
 Defender-0.006
owered-0.006
finder-0.006
Token probable
Token ID17939
Feature activation+4.46e-01

Pos logit contributions
qv+0.027
nexus+0.027
 Defender+0.023
owered+0.023
finder+0.023
Neg logit contributions
ELY-0.026
yz-0.025
llah-0.024
ami-0.024
icans-0.023
Token an
Token ID281
Feature activation-2.77e-02

Pos logit contributions
ELY+0.271
yz+0.264
llah+0.254
ami+0.246
icans+0.241
Neg logit contributions
qv-0.309
nexus-0.305
 Defender-0.267
owered-0.264
finder-0.261
Tokenagram
Token ID6713
Feature activation+1.47e-01

Pos logit contributions
qv+0.027
nexus+0.027
 Defender+0.025
owered+0.025
finder+0.024
Neg logit contributions
ELY-0.009
yz-0.008
llah-0.008
ami-0.007
icans-0.007
Token that
Token ID326
Feature activation+7.33e-03

Pos logit contributions
ELY+0.093
yz+0.090
llah+0.087
ami+0.085
icans+0.083
Neg logit contributions
qv-0.099
nexus-0.097
 Defender-0.085
owered-0.084
finder-0.083
Token I
Token ID314
Feature activation-8.80e-02

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.004
ami+0.004
icans+0.004
Neg logit contributions
qv-0.005
nexus-0.005
 Defender-0.004
owered-0.004
finder-0.004
Token can
Token ID460
Feature activation+1.52e-02

Pos logit contributions
qv+0.062
nexus+0.061
 Defender+0.054
owered+0.053
finder+0.053
Neg logit contributions
ELY-0.052
yz-0.051
llah-0.049
ami-0.047
icans-0.046
Token.
Token ID13
Feature activation+1.86e-02

Pos logit contributions
ELY+0.057
yz+0.056
llah+0.054
ami+0.053
icans+0.052
Neg logit contributions
qv-0.064
nexus-0.063
 Defender-0.055
owered-0.054
finder-0.054
Tokenco
Token ID1073
Feature activation-4.24e-02

Pos logit contributions
ELY+0.012
yz+0.012
llah+0.011
ami+0.011
icans+0.011
Neg logit contributions
qv-0.012
nexus-0.012
 Defender-0.011
owered-0.010
finder-0.010
Token/
Token ID14
Feature activation-4.83e-02

Pos logit contributions
qv+0.022
nexus+0.021
 Defender+0.018
owered+0.018
finder+0.017
Neg logit contributions
ELY-0.034
yz-0.033
llah-0.032
ami-0.031
icans-0.030
Tokenni
Token ID8461
Feature activation-7.92e-02

Pos logit contributions
qv+0.032
nexus+0.031
 Defender+0.028
owered+0.027
finder+0.027
Neg logit contributions
ELY-0.031
yz-0.030
llah-0.029
ami-0.028
icans-0.028
TokenWB
Token ID45607
Feature activation+1.37e-01

Pos logit contributions
qv+0.053
nexus+0.052
 Defender+0.046
owered+0.045
finder+0.044
Neg logit contributions
ELY-0.050
yz-0.049
llah-0.047
ami-0.046
icans-0.045
Tokenqv
Token ID44179
Feature activation-5.06e-01

Pos logit contributions
ELY+0.089
yz+0.087
llah+0.084
ami+0.082
icans+0.080
Neg logit contributions
qv-0.089
nexus-0.088
 Defender-0.077
owered-0.076
finder-0.075
Tokenfi
Token ID12463
Feature activation-1.36e-01

Pos logit contributions
qv+0.337
nexus+0.331
 Defender+0.288
owered+0.284
finder+0.282
Neg logit contributions
ELY-0.322
yz-0.313
llah-0.303
ami-0.294
icans-0.290
TokenW
Token ID54
Feature activation-5.74e-02

Pos logit contributions
qv+0.090
nexus+0.089
 Defender+0.078
owered+0.077
finder+0.076
Neg logit contributions
ELY-0.086
yz-0.084
llah-0.081
ami-0.078
icans-0.077
Tokenu
Token ID84
Feature activation-1.08e-01

Pos logit contributions
qv+0.037
nexus+0.037
 Defender+0.032
owered+0.032
finder+0.031
Neg logit contributions
ELY-0.037
yz-0.036
llah-0.035
ami-0.034
icans-0.034
Token
Token ID851
Feature activation-5.98e-02

Pos logit contributions
qv+0.067
nexus+0.066
 Defender+0.057
owered+0.056
finder+0.056
Neg logit contributions
ELY-0.074
yz-0.072
llah-0.070
ami-0.068
icans-0.067
Token Dan
Token ID6035
Feature activation+7.45e-02

Pos logit contributions
qv+0.041
nexus+0.041
 Defender+0.036
owered+0.035
finder+0.035
Neg logit contributions
ELY-0.037
yz-0.036
llah-0.034
ami-0.033
icans-0.033
Tokenree
Token ID631
Feature activation+1.37e-01

Pos logit contributions
qv+0.007
nexus+0.007
owered+0.007
 Defender+0.007
finder+0.007
Neg logit contributions
ELY-0.002
yz-0.002
llah-0.002
ami-0.002
icans-0.002
Tokenfs
Token ID9501
Feature activation+1.96e-01

Pos logit contributions
ELY+0.071
yz+0.063
llah+0.061
 Ancients+0.059
ami+0.058
Neg logit contributions
nexus-0.110
qv-0.110
finder-0.098
owered-0.096
 Defender-0.096
Token,
Token ID11
Feature activation-5.37e-02

Pos logit contributions
ELY+0.123
yz+0.121
llah+0.117
ami+0.114
icans+0.112
Neg logit contributions
qv-0.130
nexus-0.128
 Defender-0.112
owered-0.110
finder-0.109
Token water
Token ID1660
Feature activation+8.19e-02

Pos logit contributions
qv+0.037
nexus+0.036
 Defender+0.032
owered+0.031
finder+0.031
Neg logit contributions
ELY-0.033
yz-0.032
llah-0.031
ami-0.030
icans-0.030
Token-
Token ID12
Feature activation-1.63e-02

Pos logit contributions
ELY+0.052
yz+0.052
llah+0.050
ami+0.049
icans+0.048
Neg logit contributions
qv-0.053
nexus-0.052
 Defender-0.046
owered-0.045
finder-0.044
Tokenpoll
Token ID30393
Feature activation-4.67e-01

Pos logit contributions
qv+0.009
nexus+0.009
 Defender+0.008
owered+0.008
finder+0.008
Neg logit contributions
ELY-0.012
yz-0.012
llah-0.011
ami-0.011
icans-0.011
Tokenution
Token ID1009
Feature activation-2.83e-01

Pos logit contributions
qv+0.198
nexus+0.193
 Defender+0.154
owered+0.149
finder+0.147
Neg logit contributions
ELY-0.408
yz-0.404
llah-0.393
ami-0.385
icans-0.381
Token,
Token ID11
Feature activation-4.94e-02

Pos logit contributions
qv+0.182
nexus+0.179
 Defender+0.155
owered+0.153
finder+0.152
Neg logit contributions
ELY-0.187
yz-0.182
llah-0.176
ami-0.171
icans-0.168
Token environmental
Token ID6142
Feature activation-8.61e-02

Pos logit contributions
qv+0.034
nexus+0.034
 Defender+0.030
owered+0.029
finder+0.029
Neg logit contributions
ELY-0.030
yz-0.029
llah-0.028
ami-0.027
icans-0.027
Token-
Token ID12
Feature activation-9.98e-03

Pos logit contributions
qv+0.052
nexus+0.051
 Defender+0.044
owered+0.044
finder+0.043
Neg logit contributions
ELY-0.060
yz-0.058
llah-0.057
ami-0.055
icans-0.054
Tokenimpact
Token ID48240
Feature activation-3.47e-02

Pos logit contributions
qv+0.007
nexus+0.007
 Defender+0.006
owered+0.006
finder+0.006
Neg logit contributions
ELY-0.006
yz-0.006
llah-0.006
ami-0.005
icans-0.005
Token on
Token ID319
Feature activation-2.20e-02

Pos logit contributions
qv+0.014
nexus+0.014
 Defender+0.012
owered+0.012
finder+0.012
Neg logit contributions
ELY-0.013
yz-0.013
llah-0.013
ami-0.012
icans-0.012
Token the
Token ID262
Feature activation-3.99e-02

Pos logit contributions
qv+0.013
nexus+0.013
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.016
yz-0.015
llah-0.015
ami-0.014
icans-0.014
Token Effects
Token ID17417
Feature activation+6.43e-02

Pos logit contributions
qv+0.027
nexus+0.026
 Defender+0.023
owered+0.023
finder+0.022
Neg logit contributions
ELY-0.025
yz-0.024
llah-0.024
ami-0.023
icans-0.023
Token of
Token ID286
Feature activation-5.04e-04

Pos logit contributions
ELY+0.041
yz+0.040
llah+0.039
ami+0.038
icans+0.037
Neg logit contributions
qv-0.043
nexus-0.042
 Defender-0.037
owered-0.036
finder-0.036
Token Atomic
Token ID28976
Feature activation-2.17e-01

Pos logit contributions
qv+0.000
nexus+0.000
 Defender+0.000
owered+0.000
finder+0.000
Neg logit contributions
ELY-0.000
yz-0.000
llah-0.000
ami-0.000
icans-0.000
Token Radiation
Token ID47532
Feature activation-4.67e-01

Pos logit contributions
qv+0.083
nexus+0.080
 Defender+0.063
owered+0.063
finder+0.060
Neg logit contributions
ELY-0.198
yz-0.195
llah-0.191
ami-0.185
icans-0.185
Token (
Token ID357
Feature activation-1.38e-02

Pos logit contributions
qv+0.304
nexus+0.297
 Defender+0.261
owered+0.256
finder+0.254
Neg logit contributions
ELY-0.304
yz-0.296
llah-0.287
ami-0.277
icans-0.274
TokenUN
Token ID4944
Feature activation-6.79e-02

Pos logit contributions
qv+0.010
nexus+0.010
 Defender+0.008
owered+0.008
finder+0.008
Neg logit contributions
ELY-0.008
yz-0.008
llah-0.008
ami-0.008
icans-0.007
TokenS
Token ID50
Feature activation+3.09e-02

Pos logit contributions
qv+0.053
nexus+0.052
 Defender+0.046
owered+0.046
finder+0.045
Neg logit contributions
ELY-0.036
yz-0.035
llah-0.033
ami-0.032
icans-0.031
TokenCE
Token ID5222
Feature activation+1.34e-01

Pos logit contributions
ELY+0.016
yz+0.015
llah+0.015
ami+0.014
icans+0.014
Neg logit contributions
qv-0.024
nexus-0.024
 Defender-0.021
owered-0.021
finder-0.021
TokenAR
Token ID1503
Feature activation-2.85e-02

Pos logit contributions
ELY+0.082
yz+0.079
llah+0.076
ami+0.074
icans+0.072
Neg logit contributions
qv-0.093
nexus-0.092
 Defender-0.081
owered-0.080
finder-0.079
Token the
Token ID262
Feature activation-2.78e-02

Pos logit contributions
ELY+0.007
yz+0.007
llah+0.006
ami+0.006
icans+0.006
Neg logit contributions
qv-0.007
nexus-0.007
 Defender-0.006
owered-0.006
finder-0.006
Token beginning
Token ID3726
Feature activation-7.54e-02

Pos logit contributions
qv+0.018
nexus+0.018
 Defender+0.015
owered+0.015
finder+0.015
Neg logit contributions
ELY-0.018
yz-0.018
llah-0.017
ami-0.017
icans-0.016
Token of
Token ID286
Feature activation-1.07e-02

Pos logit contributions
qv+0.047
nexus+0.046
 Defender+0.040
owered+0.040
finder+0.039
Neg logit contributions
ELY-0.051
yz-0.050
llah-0.048
ami-0.047
icans-0.046
Token the
Token ID262
Feature activation-5.55e-03

Pos logit contributions
qv+0.007
nexus+0.007
 Defender+0.006
owered+0.006
finder+0.006
Neg logit contributions
ELY-0.007
yz-0.007
llah-0.007
ami-0.006
icans-0.006
Token catastrophic
Token ID22694
Feature activation+8.73e-02

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.004
yz-0.003
llah-0.003
ami-0.003
icans-0.003
Token phase
Token ID7108
Feature activation-4.60e-01

Pos logit contributions
ELY+0.058
yz+0.057
llah+0.055
ami+0.053
icans+0.052
Neg logit contributions
qv-0.056
nexus-0.055
 Defender-0.047
owered-0.047
finder-0.046
Token,
Token ID11
Feature activation-7.80e-03

Pos logit contributions
qv+0.306
nexus+0.302
 Defender+0.264
owered+0.260
finder+0.257
Neg logit contributions
ELY-0.291
yz-0.284
llah-0.274
ami-0.266
icans-0.262
Token
Token ID447
Feature activation-1.90e-02

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.005
owered+0.005
finder+0.005
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.004
ami-0.004
icans-0.004
Token
Token ID251
Feature activation-3.44e-02

Pos logit contributions
qv+0.013
nexus+0.012
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.012
yz-0.012
llah-0.011
ami-0.011
icans-0.011
Token Sebastian
Token ID26190
Feature activation-3.49e-02

Pos logit contributions
qv+0.024
nexus+0.023
 Defender+0.021
owered+0.020
finder+0.020
Neg logit contributions
ELY-0.021
yz-0.020
llah-0.020
ami-0.019
icans-0.019
Token P
Token ID350
Feature activation-9.59e-03

Pos logit contributions
qv+0.024
nexus+0.023
 Defender+0.020
owered+0.020
finder+0.020
Neg logit contributions
ELY-0.022
yz-0.021
llah-0.021
ami-0.020
icans-0.020
TokenOn
Token ID2202
Feature activation-5.40e-02

Pos logit contributions
qv+0.038
nexus+0.037
 Defender+0.033
owered+0.032
finder+0.032
Neg logit contributions
ELY-0.036
yz-0.035
llah-0.034
ami-0.033
icans-0.032
Token the
Token ID262
Feature activation-4.29e-02

Pos logit contributions
qv+0.035
nexus+0.035
 Defender+0.030
owered+0.030
finder+0.029
Neg logit contributions
ELY-0.035
yz-0.034
llah-0.033
ami-0.032
icans-0.032
Token heels
Token ID18728
Feature activation+6.82e-02

Pos logit contributions
qv+0.027
nexus+0.027
 Defender+0.023
owered+0.023
finder+0.023
Neg logit contributions
ELY-0.029
yz-0.028
llah-0.027
ami-0.026
icans-0.026
Token of
Token ID286
Feature activation-1.67e-02

Pos logit contributions
ELY+0.042
yz+0.041
llah+0.039
ami+0.038
icans+0.037
Neg logit contributions
qv-0.047
nexus-0.046
 Defender-0.040
owered-0.040
finder-0.039
Token green
Token ID4077
Feature activation+6.33e-02

Pos logit contributions
qv+0.011
nexus+0.011
 Defender+0.010
owered+0.009
finder+0.009
Neg logit contributions
ELY-0.010
yz-0.010
llah-0.010
ami-0.010
icans-0.009
Tokenlighting
Token ID43351
Feature activation-4.55e-01

Pos logit contributions
ELY+0.043
yz+0.042
llah+0.041
ami+0.040
icans+0.039
Neg logit contributions
qv-0.039
nexus-0.039
 Defender-0.033
owered-0.033
finder-0.032
Token its
Token ID663
Feature activation+9.35e-02

Pos logit contributions
qv+0.299
nexus+0.294
 Defender+0.256
owered+0.252
finder+0.250
Neg logit contributions
ELY-0.293
yz-0.286
llah-0.276
ami-0.268
icans-0.264
Token first
Token ID717
Feature activation-9.63e-02

Pos logit contributions
ELY+0.061
yz+0.060
llah+0.058
ami+0.056
icans+0.055
Neg logit contributions
qv-0.060
nexus-0.059
 Defender-0.051
owered-0.051
finder-0.050
Token min
Token ID949
Feature activation+2.25e-02

Pos logit contributions
qv+0.064
nexus+0.063
 Defender+0.055
owered+0.054
finder+0.054
Neg logit contributions
ELY-0.061
yz-0.060
llah-0.057
ami-0.056
icans-0.055
Tokeniser
Token ID5847
Feature activation-1.23e-01

Pos logit contributions
ELY+0.018
yz+0.018
llah+0.017
ami+0.017
icans+0.017
Neg logit contributions
qv-0.011
nexus-0.011
 Defender-0.009
owered-0.009
finder-0.009
Tokenies
Token ID444
Feature activation-2.72e-02

Pos logit contributions
qv+0.076
nexus+0.075
 Defender+0.065
owered+0.064
finder+0.063
Neg logit contributions
ELY-0.083
yz-0.081
llah-0.079
ami-0.076
icans-0.075
Token German
Token ID2679
Feature activation-1.20e-01

Pos logit contributions
qv+0.017
nexus+0.016
 Defender+0.014
owered+0.014
finder+0.014
Neg logit contributions
ELY-0.015
yz-0.014
llah-0.014
ami-0.014
icans-0.013
Token-
Token ID12
Feature activation-3.09e-02

Pos logit contributions
qv+0.077
nexus+0.076
 Defender+0.066
owered+0.066
finder+0.065
Neg logit contributions
ELY-0.078
yz-0.077
llah-0.073
ami-0.072
icans-0.070
Tokenbased
Token ID3106
Feature activation-1.91e-01

Pos logit contributions
qv+0.017
nexus+0.017
 Defender+0.014
owered+0.014
finder+0.014
Neg logit contributions
ELY-0.023
yz-0.022
llah-0.022
ami-0.021
icans-0.021
Token Society
Token ID7023
Feature activation+9.81e-02

Pos logit contributions
qv+0.129
nexus+0.127
 Defender+0.111
owered+0.109
finder+0.108
Neg logit contributions
ELY-0.119
yz-0.116
llah-0.112
ami-0.109
icans-0.106
Token for
Token ID329
Feature activation-1.76e-02

Pos logit contributions
ELY+0.064
yz+0.063
llah+0.061
ami+0.059
icans+0.058
Neg logit contributions
qv-0.063
nexus-0.062
 Defender-0.054
owered-0.054
finder-0.053
Token Radiation
Token ID47532
Feature activation-4.51e-01

Pos logit contributions
qv+0.012
nexus+0.011
 Defender+0.010
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.011
yz-0.011
llah-0.011
ami-0.010
icans-0.010
Token Protection
Token ID9985
Feature activation-2.50e-01

Pos logit contributions
qv+0.173
nexus+0.168
 Defender+0.133
owered+0.127
finder+0.126
Neg logit contributions
ELY-0.414
yz-0.406
llah-0.397
ami-0.389
icans-0.385
Token,
Token ID11
Feature activation+7.00e-03

Pos logit contributions
qv+0.164
nexus+0.161
 Defender+0.141
owered+0.139
finder+0.137
Neg logit contributions
ELY-0.161
yz-0.157
llah-0.152
ami-0.147
icans-0.145
Token said
Token ID531
Feature activation+5.96e-03

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.004
ami+0.004
icans+0.004
Neg logit contributions
qv-0.005
nexus-0.005
 Defender-0.004
owered-0.004
finder-0.004
Token of
Token ID286
Feature activation-2.89e-02

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.004
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.003
owered-0.003
finder-0.003
Token Japan
Token ID2869
Feature activation-8.54e-02

Pos logit contributions
qv+0.020
nexus+0.019
 Defender+0.017
owered+0.017
finder+0.017
Neg logit contributions
ELY-0.018
yz-0.017
llah-0.017
ami-0.016
icans-0.016
Token Twin
Token ID14968
Feature activation-9.42e-02

Pos logit contributions
qv+0.097
nexus+0.095
 Defender+0.082
owered+0.081
finder+0.081
Neg logit contributions
ELY-0.098
yz-0.096
llah-0.092
ami-0.090
icans-0.088
Token Fro
Token ID9734
Feature activation-1.74e-01

Pos logit contributions
qv+0.056
nexus+0.055
 Defender+0.048
owered+0.047
finder+0.046
Neg logit contributions
ELY-0.066
yz-0.065
llah-0.063
ami-0.061
icans-0.060
Tokenz
Token ID89
Feature activation-7.12e-02

Pos logit contributions
qv+0.110
nexus+0.108
 Defender+0.094
owered+0.093
finder+0.091
Neg logit contributions
ELY-0.116
yz-0.113
llah-0.109
ami-0.106
icans-0.105
Tokenr
Token ID81
Feature activation-1.03e-01

Pos logit contributions
qv+0.044
nexus+0.043
 Defender+0.037
owered+0.036
finder+0.036
Neg logit contributions
ELY-0.049
yz-0.048
llah-0.046
ami-0.045
icans-0.044
Token IV
Token ID8363
Feature activation-7.27e-02

Pos logit contributions
qv+0.065
nexus+0.064
 Defender+0.055
owered+0.054
finder+0.054
Neg logit contributions
ELY-0.069
yz-0.068
llah-0.065
ami-0.064
icans-0.063
Token Thermal
Token ID41590
Feature activation-4.40e-01

Pos logit contributions
qv+0.046
nexus+0.045
 Defender+0.039
owered+0.038
finder+0.038
Neg logit contributions
ELY-0.048
yz-0.047
llah-0.046
ami-0.045
icans-0.044
Token Design
Token ID8495
Feature activation-3.00e-02

Pos logit contributions
qv+0.306
nexus+0.301
 Defender+0.265
owered+0.261
finder+0.259
Neg logit contributions
ELY-0.266
yz-0.259
llah-0.250
ami-0.241
icans-0.238
Token with
Token ID351
Feature activation-7.19e-02

Pos logit contributions
qv+0.020
nexus+0.019
 Defender+0.017
owered+0.016
finder+0.016
Neg logit contributions
ELY-0.019
yz-0.019
llah-0.018
ami-0.018
icans-0.017
Token Dust
Token ID16240
Feature activation-1.51e-01

Pos logit contributions
qv+0.046
nexus+0.045
 Defender+0.039
owered+0.038
finder+0.038
Neg logit contributions
ELY-0.047
yz-0.046
llah-0.045
ami-0.044
icans-0.043
Token Removal
Token ID46209
Feature activation-1.55e-01

Pos logit contributions
qv+0.076
nexus+0.074
 Defender+0.061
owered+0.060
finder+0.059
Neg logit contributions
ELY-0.120
yz-0.118
llah-0.114
ami-0.112
icans-0.110
Token Technology
Token ID8987
Feature activation-1.48e-01

Pos logit contributions
qv+0.100
nexus+0.098
 Defender+0.085
owered+0.084
finder+0.083
Neg logit contributions
ELY-0.101
yz-0.099
llah-0.095
ami-0.093
icans-0.091
Token Lake
Token ID6233
Feature activation-1.04e-03

Pos logit contributions
qv+0.052
nexus+0.052
 Defender+0.045
owered+0.044
finder+0.044
Neg logit contributions
ELY-0.050
yz-0.049
llah-0.047
ami-0.046
icans-0.045
Token Ur
Token ID8799
Feature activation+7.49e-02

Pos logit contributions
qv+0.001
nexus+0.001
 Defender+0.001
owered+0.001
finder+0.001
Neg logit contributions
ELY-0.001
yz-0.001
llah-0.001
ami-0.000
icans-0.000
Tokenmia
Token ID20730
Feature activation-2.87e-02

Pos logit contributions
ELY+0.054
yz+0.053
llah+0.051
ami+0.050
icans+0.049
Neg logit contributions
qv-0.044
nexus-0.043
 Defender-0.036
owered-0.036
finder-0.036
Token a
Token ID257
Feature activation-2.09e-03

Pos logit contributions
qv+0.018
nexus+0.018
 Defender+0.015
owered+0.015
finder+0.015
Neg logit contributions
ELY-0.020
yz-0.019
llah-0.018
ami-0.018
icans-0.018
Token bi
Token ID3182
Feature activation+1.56e-02

Pos logit contributions
qv+0.002
nexus+0.001
 Defender+0.001
owered+0.001
finder+0.001
Neg logit contributions
ELY-0.001
yz-0.001
llah-0.001
ami-0.001
icans-0.001
Tokenosphere
Token ID22829
Feature activation-4.38e-01

Pos logit contributions
ELY+0.016
yz+0.016
llah+0.015
ami+0.015
icans+0.015
Neg logit contributions
qv-0.005
nexus-0.004
 Defender-0.003
owered-0.003
finder-0.003
Token reserve
Token ID11515
Feature activation+1.46e-01

Pos logit contributions
qv+0.334
nexus+0.328
 Defender+0.295
finder+0.290
owered+0.289
Neg logit contributions
ELY-0.236
yz-0.229
llah-0.221
ami-0.210
icans-0.209
Token to
Token ID284
Feature activation-3.86e-02

Pos logit contributions
ELY+0.094
yz+0.092
llah+0.089
ami+0.086
icans+0.085
Neg logit contributions
qv-0.095
nexus-0.094
 Defender-0.082
owered-0.081
finder-0.080
Token encourage
Token ID7898
Feature activation-6.85e-02

Pos logit contributions
qv+0.026
nexus+0.026
 Defender+0.023
owered+0.022
finder+0.022
Neg logit contributions
ELY-0.024
yz-0.023
llah-0.023
ami-0.022
icans-0.022
Token sustainable
Token ID13347
Feature activation+1.40e-01

Pos logit contributions
qv+0.049
nexus+0.048
 Defender+0.043
owered+0.042
finder+0.042
Neg logit contributions
ELY-0.040
yz-0.039
llah-0.037
ami-0.036
icans-0.036
Token development
Token ID2478
Feature activation-1.88e-01

Pos logit contributions
ELY+0.094
yz+0.091
llah+0.088
ami+0.086
icans+0.085
Neg logit contributions
qv-0.089
nexus-0.087
 Defender-0.075
owered-0.074
finder-0.074
Tokenancing
Token ID5077
Feature activation+2.16e-01

Pos logit contributions
qv+0.049
nexus+0.048
 Defender+0.044
owered+0.043
finder+0.043
Neg logit contributions
ELY-0.019
yz-0.018
llah-0.017
ami-0.015
icans-0.015
Token,
Token ID11
Feature activation-7.01e-03

Pos logit contributions
ELY+0.137
yz+0.133
llah+0.129
ami+0.126
icans+0.123
Neg logit contributions
qv-0.143
nexus-0.141
 Defender-0.123
owered-0.122
finder-0.121
Token although
Token ID3584
Feature activation+2.39e-02

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.004
Token Night
Token ID5265
Feature activation-7.03e-02

Pos logit contributions
ELY+0.015
yz+0.014
llah+0.014
ami+0.013
icans+0.013
Neg logit contributions
qv-0.016
nexus-0.016
 Defender-0.014
owered-0.014
finder-0.014
Tokenc
Token ID66
Feature activation-1.03e-01

Pos logit contributions
qv+0.041
nexus+0.040
 Defender+0.034
owered+0.034
finder+0.033
Neg logit contributions
ELY-0.050
yz-0.049
llah-0.047
ami-0.047
icans-0.046
Tokenrawler
Token ID39464
Feature activation-4.28e-01

Pos logit contributions
qv+0.024
nexus+0.024
 Defender+0.015
owered+0.014
finder+0.013
Neg logit contributions
ELY-0.109
yz-0.108
llah-0.105
ami-0.104
icans-0.103
Token
Token ID447
Feature activation+1.61e-02

Pos logit contributions
qv+0.302
nexus+0.297
 Defender+0.262
owered+0.258
finder+0.256
Neg logit contributions
ELY-0.255
yz-0.248
llah-0.240
ami-0.231
icans-0.228
Token
Token ID247
Feature activation-1.22e-02

Pos logit contributions
ELY+0.010
yz+0.010
llah+0.010
ami+0.009
icans+0.009
Neg logit contributions
qv-0.011
nexus-0.010
 Defender-0.009
owered-0.009
finder-0.009
Tokens
Token ID82
Feature activation-2.36e-02

Pos logit contributions
qv+0.008
nexus+0.008
 Defender+0.007
owered+0.007
finder+0.006
Neg logit contributions
ELY-0.008
yz-0.008
llah-0.008
ami-0.007
icans-0.007
Token animation
Token ID11034
Feature activation+3.43e-03

Pos logit contributions
qv+0.016
nexus+0.015
 Defender+0.013
owered+0.013
finder+0.013
Neg logit contributions
ELY-0.015
yz-0.015
llah-0.014
ami-0.014
icans-0.014
Token wasn
Token ID2492
Feature activation+3.03e-02

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.002
owered-0.002
finder-0.002
Tokenunn
Token ID20935
Feature activation-1.51e-01

Pos logit contributions
ELY+0.075
yz+0.073
llah+0.070
ami+0.068
icans+0.067
Neg logit contributions
qv-0.101
nexus-0.099
 Defender-0.088
owered-0.087
finder-0.086
Tokenings
Token ID654
Feature activation-5.57e-02

Pos logit contributions
qv+0.104
nexus+0.102
 Defender+0.090
owered+0.089
finder+0.088
Neg logit contributions
ELY-0.092
yz-0.090
llah-0.087
ami-0.084
icans-0.083
Token has
Token ID468
Feature activation-4.12e-02

Pos logit contributions
qv+0.039
nexus+0.038
 Defender+0.033
owered+0.033
finder+0.033
Neg logit contributions
ELY-0.034
yz-0.033
llah-0.032
ami-0.031
icans-0.030
Token decided
Token ID3066
Feature activation-1.13e-02

Pos logit contributions
qv+0.028
nexus+0.028
 Defender+0.024
owered+0.024
finder+0.024
Neg logit contributions
ELY-0.026
yz-0.025
llah-0.024
ami-0.023
icans-0.023
Token to
Token ID284
Feature activation+3.17e-02

Pos logit contributions
qv+0.007
nexus+0.007
 Defender+0.006
owered+0.006
finder+0.006
Neg logit contributions
ELY-0.008
yz-0.008
llah-0.007
ami-0.007
icans-0.007
Token phase
Token ID7108
Feature activation-4.21e-01

Pos logit contributions
ELY+0.020
yz+0.019
llah+0.018
ami+0.018
icans+0.017
Neg logit contributions
qv-0.022
nexus-0.021
 Defender-0.019
owered-0.019
finder-0.018
Token them
Token ID606
Feature activation+3.95e-02

Pos logit contributions
qv+0.248
nexus+0.244
 Defender+0.209
owered+0.205
finder+0.204
Neg logit contributions
ELY-0.298
yz-0.292
llah-0.284
ami-0.276
icans-0.272
Token out
Token ID503
Feature activation-7.20e-02

Pos logit contributions
ELY+0.028
yz+0.027
llah+0.026
ami+0.026
icans+0.025
Neg logit contributions
qv-0.024
nexus-0.023
 Defender-0.020
owered-0.020
finder-0.019
Token in
Token ID287
Feature activation+1.74e-02

Pos logit contributions
qv+0.048
nexus+0.047
 Defender+0.041
owered+0.041
finder+0.040
Neg logit contributions
ELY-0.045
yz-0.044
llah-0.043
ami-0.041
icans-0.041
Token Australia
Token ID4505
Feature activation+4.41e-02

Pos logit contributions
ELY+0.011
yz+0.011
llah+0.010
ami+0.010
icans+0.010
Neg logit contributions
qv-0.012
nexus-0.012
 Defender-0.010
owered-0.010
finder-0.010
Token.
Token ID13
Feature activation+5.46e-02

Pos logit contributions
ELY+0.028
yz+0.027
llah+0.026
ami+0.025
icans+0.025
Neg logit contributions
qv-0.029
nexus-0.029
 Defender-0.025
owered-0.025
finder-0.025
Token said
Token ID531
Feature activation-2.64e-03

Pos logit contributions
qv+0.028
nexus+0.027
 Defender+0.024
owered+0.024
finder+0.024
Neg logit contributions
ELY-0.023
yz-0.022
llah-0.021
ami-0.021
icans-0.020
Token that
Token ID326
Feature activation-3.07e-02

Pos logit contributions
qv+0.002
nexus+0.002
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.002
yz-0.002
llah-0.002
ami-0.001
icans-0.001
Token it
Token ID340
Feature activation-2.58e-02

Pos logit contributions
qv+0.021
nexus+0.021
 Defender+0.018
owered+0.018
finder+0.018
Neg logit contributions
ELY-0.019
yz-0.018
llah-0.018
ami-0.017
icans-0.017
Token will
Token ID481
Feature activation+1.81e-02

Pos logit contributions
qv+0.019
nexus+0.019
 Defender+0.017
owered+0.017
finder+0.016
Neg logit contributions
ELY-0.014
yz-0.014
llah-0.013
ami-0.013
icans-0.013
Token then
Token ID788
Feature activation+3.35e-02

Pos logit contributions
ELY+0.011
yz+0.011
llah+0.010
ami+0.010
icans+0.010
Neg logit contributions
qv-0.013
nexus-0.012
 Defender-0.011
owered-0.011
finder-0.011
Token phase
Token ID7108
Feature activation-4.20e-01

Pos logit contributions
ELY+0.021
yz+0.020
llah+0.020
ami+0.019
icans+0.019
Neg logit contributions
qv-0.023
nexus-0.022
 Defender-0.020
owered-0.019
finder-0.019
Token out
Token ID503
Feature activation-1.05e-01

Pos logit contributions
qv+0.254
nexus+0.250
 Defender+0.215
owered+0.211
finder+0.209
Neg logit contributions
ELY-0.292
yz-0.285
llah-0.277
ami-0.269
icans-0.265
Token support
Token ID1104
Feature activation+4.77e-02

Pos logit contributions
qv+0.073
nexus+0.071
 Defender+0.063
owered+0.062
finder+0.061
Neg logit contributions
ELY-0.064
yz-0.062
llah-0.060
ami-0.058
icans-0.057
Token for
Token ID329
Feature activation-5.21e-02

Pos logit contributions
ELY+0.032
yz+0.031
llah+0.030
ami+0.029
icans+0.029
Neg logit contributions
qv-0.030
nexus-0.029
 Defender-0.025
owered-0.025
finder-0.025
Token the
Token ID262
Feature activation-2.51e-02

Pos logit contributions
qv+0.034
nexus+0.033
 Defender+0.029
owered+0.028
finder+0.028
Neg logit contributions
ELY-0.034
yz-0.033
llah-0.032
ami-0.031
icans-0.031
Token browser
Token ID6444
Feature activation-6.17e-02

Pos logit contributions
qv+0.016
nexus+0.016
 Defender+0.014
owered+0.014
finder+0.014
Neg logit contributions
ELY-0.016
yz-0.016
llah-0.015
ami-0.015
icans-0.015
Token in
Token ID287
Feature activation+2.60e-03

Pos logit contributions
ELY+0.081
yz+0.079
llah+0.076
ami+0.074
icans+0.072
Neg logit contributions
qv-0.092
nexus-0.091
 Defender-0.080
owered-0.079
finder-0.078
Token,
Token ID11
Feature activation+2.15e-02

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.001
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.001
owered-0.001
finder-0.001
Token when
Token ID618
Feature activation+6.66e-02

Pos logit contributions
ELY+0.013
yz+0.012
llah+0.012
ami+0.012
icans+0.011
Neg logit contributions
qv-0.015
nexus-0.015
 Defender-0.013
owered-0.013
finder-0.013
Token Night
Token ID5265
Feature activation-4.75e-02

Pos logit contributions
ELY+0.051
yz+0.050
llah+0.048
ami+0.048
icans+0.047
Neg logit contributions
qv-0.036
nexus-0.035
 Defender-0.029
owered-0.029
finder-0.029
Tokenc
Token ID66
Feature activation-9.88e-02

Pos logit contributions
qv+0.027
nexus+0.027
 Defender+0.023
owered+0.023
finder+0.022
Neg logit contributions
ELY-0.034
yz-0.033
llah-0.032
ami-0.031
icans-0.031
Tokenrawler
Token ID39464
Feature activation-4.20e-01

Pos logit contributions
qv+0.023
nexus+0.022
 Defender+0.014
owered+0.013
finder+0.012
Neg logit contributions
ELY-0.105
yz-0.104
llah-0.101
ami-0.100
icans-0.099
Token notices
Token ID19748
Feature activation-2.03e-02

Pos logit contributions
qv+0.288
nexus+0.284
 Defender+0.249
owered+0.246
finder+0.243
Neg logit contributions
ELY-0.257
yz-0.251
llah-0.242
ami-0.235
icans-0.231
Token that
Token ID326
Feature activation+6.75e-03

Pos logit contributions
qv+0.012
nexus+0.012
 Defender+0.010
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.014
yz-0.014
llah-0.013
ami-0.013
icans-0.013
Token no
Token ID645
Feature activation+1.14e-02

Pos logit contributions
ELY+0.005
yz+0.005
llah+0.005
ami+0.005
icans+0.005
Neg logit contributions
qv-0.003
nexus-0.003
 Defender-0.003
owered-0.003
finder-0.003
Token one
Token ID530
Feature activation+3.89e-02

Pos logit contributions
ELY+0.007
yz+0.007
llah+0.006
ami+0.006
icans+0.006
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
finder-0.007
owered-0.007
Token in
Token ID287
Feature activation+2.46e-03

Pos logit contributions
ELY+0.023
yz+0.023
llah+0.022
ami+0.021
icans+0.021
Neg logit contributions
qv-0.027
nexus-0.027
 Defender-0.024
owered-0.023
finder-0.023
Token was
Token ID373
Feature activation+4.65e-04

Pos logit contributions
qv+0.026
nexus+0.025
 Defender+0.022
owered+0.022
finder+0.022
Neg logit contributions
ELY-0.024
yz-0.023
llah-0.022
ami-0.022
icans-0.021
Token riding
Token ID10311
Feature activation+1.39e-01

Pos logit contributions
ELY+0.000
yz+0.000
llah+0.000
ami+0.000
icans+0.000
Neg logit contributions
qv-0.000
nexus-0.000
 Defender-0.000
owered-0.000
finder-0.000
Token her
Token ID607
Feature activation+5.96e-02

Pos logit contributions
ELY+0.080
yz+0.077
llah+0.075
ami+0.072
icans+0.071
Neg logit contributions
qv-0.101
nexus-0.099
 Defender-0.088
owered-0.087
finder-0.086
Token bike
Token ID7161
Feature activation+1.52e-01

Pos logit contributions
ELY+0.030
yz+0.029
llah+0.027
ami+0.026
icans+0.026
Neg logit contributions
qv-0.048
nexus-0.047
 Defender-0.042
owered-0.042
finder-0.041
Token north
Token ID5093
Feature activation-5.62e-02

Pos logit contributions
ELY+0.095
yz+0.093
llah+0.090
ami+0.087
icans+0.086
Neg logit contributions
qv-0.102
nexus-0.101
 Defender-0.089
owered-0.087
finder-0.086
Tokenbound
Token ID7784
Feature activation-4.19e-01

Pos logit contributions
qv+0.023
nexus+0.023
 Defender+0.018
owered+0.017
finder+0.017
Neg logit contributions
ELY-0.050
yz-0.049
llah-0.048
ami-0.047
icans-0.046
Token on
Token ID319
Feature activation-1.79e-02

Pos logit contributions
qv+0.276
nexus+0.271
 Defender+0.238
owered+0.234
finder+0.231
Neg logit contributions
ELY-0.268
yz-0.261
llah-0.254
ami-0.245
icans-0.241
Token Pennsylvania
Token ID9589
Feature activation+1.65e-02

Pos logit contributions
qv+0.012
nexus+0.012
 Defender+0.011
owered+0.011
finder+0.010
Neg logit contributions
ELY-0.011
yz-0.011
llah-0.010
ami-0.010
icans-0.010
Token Street
Token ID3530
Feature activation+3.05e-03

Pos logit contributions
ELY+0.012
yz+0.012
llah+0.012
ami+0.012
icans+0.011
Neg logit contributions
qv-0.009
nexus-0.009
 Defender-0.007
owered-0.007
finder-0.007
Token,
Token ID11
Feature activation-4.77e-02

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.002
owered-0.002
finder-0.002
Token said
Token ID531
Feature activation-3.77e-03

Pos logit contributions
qv+0.033
nexus+0.033
 Defender+0.029
owered+0.028
finder+0.028
Neg logit contributions
ELY-0.029
yz-0.028
llah-0.027
ami-0.026
icans-0.026
Token Chron
Token ID8750
Feature activation-2.29e-01

Pos logit contributions
qv+0.035
nexus+0.035
 Defender+0.030
owered+0.030
finder+0.030
Neg logit contributions
ELY-0.033
yz-0.032
llah-0.031
ami-0.031
icans-0.030
Tokeno
Token ID78
Feature activation+2.28e-02

Pos logit contributions
qv+0.173
nexus+0.171
 Defender+0.152
owered+0.150
finder+0.148
Neg logit contributions
ELY-0.124
yz-0.121
llah-0.115
ami-0.113
icans-0.111
Token Trigger
Token ID24593
Feature activation-1.82e-01

Pos logit contributions
ELY+0.019
yz+0.019
llah+0.018
ami+0.018
icans+0.018
Neg logit contributions
qv-0.010
nexus-0.010
 Defender-0.008
owered-0.008
finder-0.008
Token and
Token ID290
Feature activation+1.51e-02

Pos logit contributions
qv+0.117
nexus+0.115
 Defender+0.100
owered+0.099
finder+0.098
Neg logit contributions
ELY-0.119
yz-0.117
llah-0.113
ami-0.110
icans-0.108
Token Earth
Token ID3668
Feature activation-7.14e-02

Pos logit contributions
ELY+0.010
yz+0.009
llah+0.009
ami+0.009
icans+0.009
Neg logit contributions
qv-0.010
nexus-0.010
 Defender-0.008
owered-0.008
finder-0.008
Tokenbound
Token ID7784
Feature activation-4.18e-01

Pos logit contributions
qv+0.021
nexus+0.020
 Defender+0.014
owered+0.014
finder+0.013
Neg logit contributions
ELY-0.072
yz-0.071
llah-0.069
ami-0.068
icans-0.067
Token,
Token ID11
Feature activation-2.72e-02

Pos logit contributions
qv+0.261
nexus+0.256
 Defender+0.221
owered+0.218
finder+0.216
Neg logit contributions
ELY-0.283
yz-0.277
llah-0.268
ami-0.260
icans-0.256
Token tell
Token ID1560
Feature activation-3.46e-05

Pos logit contributions
qv+0.019
nexus+0.019
 Defender+0.016
owered+0.016
finder+0.016
Neg logit contributions
ELY-0.016
yz-0.016
llah-0.015
ami-0.015
icans-0.015
Token us
Token ID514
Feature activation-2.68e-02

Pos logit contributions
qv+0.000
nexus+0.000
 Defender+0.000
owered+0.000
finder+0.000
Neg logit contributions
ELY-0.000
yz-0.000
llah-0.000
ami-0.000
icans-0.000
Token there
Token ID612
Feature activation-6.30e-02

Pos logit contributions
qv+0.018
nexus+0.017
 Defender+0.015
owered+0.015
finder+0.015
Neg logit contributions
ELY-0.017
yz-0.017
llah-0.016
ami-0.016
icans-0.015
Token
Token ID447
Feature activation+4.12e-03

Pos logit contributions
qv+0.043
nexus+0.043
 Defender+0.038
owered+0.037
finder+0.037
Neg logit contributions
ELY-0.038
yz-0.037
llah-0.036
ami-0.035
icans-0.034
Token streaming
Token ID11305
Feature activation-1.92e-02

Pos logit contributions
ELY+0.010
yz+0.010
llah+0.010
ami+0.009
icans+0.009
Neg logit contributions
qv-0.010
nexus-0.009
 Defender-0.008
owered-0.008
finder-0.008
Token five
Token ID1936
Feature activation-9.88e-02

Pos logit contributions
qv+0.012
nexus+0.012
 Defender+0.010
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.013
yz-0.012
llah-0.012
ami-0.012
icans-0.011
Token videos
Token ID5861
Feature activation+5.94e-02

Pos logit contributions
qv+0.069
nexus+0.068
 Defender+0.060
owered+0.059
finder+0.059
Neg logit contributions
ELY-0.059
yz-0.058
llah-0.056
ami-0.054
icans-0.053
Token of
Token ID286
Feature activation+2.55e-03

Pos logit contributions
ELY+0.038
yz+0.037
llah+0.036
ami+0.035
icans+0.034
Neg logit contributions
qv-0.039
nexus-0.038
 Defender-0.033
owered-0.033
finder-0.033
Token high
Token ID1029
Feature activation-9.18e-03

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.001
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.001
owered-0.001
finder-0.001
Token definition
Token ID6770
Feature activation-4.18e-01

Pos logit contributions
qv+0.006
nexus+0.006
 Defender+0.005
owered+0.005
finder+0.005
Neg logit contributions
ELY-0.006
yz-0.006
llah-0.006
ami-0.006
icans-0.006
Token video
Token ID2008
Feature activation+5.84e-03

Pos logit contributions
qv+0.279
nexus+0.275
 Defender+0.241
owered+0.237
finder+0.235
Neg logit contributions
ELY-0.263
yz-0.257
llah-0.249
ami-0.240
icans-0.237
Token a
Token ID257
Feature activation-3.58e-03

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.003
owered-0.003
finder-0.003
Token day
Token ID1110
Feature activation-6.51e-02

Pos logit contributions
qv+0.002
nexus+0.002
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.003
icans-0.003
Token.
Token ID13
Feature activation+1.20e-02

Pos logit contributions
qv+0.043
nexus+0.042
 Defender+0.037
owered+0.036
finder+0.036
Neg logit contributions
ELY-0.042
yz-0.041
llah-0.039
ami-0.038
icans-0.038
Token
Token ID198
Feature activation-4.73e-03

Pos logit contributions
ELY+0.008
yz+0.008
llah+0.007
ami+0.007
icans+0.007
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
Token prime
Token ID6994
Feature activation-1.33e-02

Pos logit contributions
ELY+0.022
yz+0.022
llah+0.021
ami+0.021
icans+0.020
Neg logit contributions
qv-0.021
nexus-0.021
 Defender-0.018
owered-0.018
finder-0.018
Token minister
Token ID5342
Feature activation-2.43e-01

Pos logit contributions
qv+0.009
nexus+0.009
 Defender+0.008
owered+0.007
finder+0.007
Neg logit contributions
ELY-0.009
yz-0.008
llah-0.008
ami-0.008
icans-0.008
Token,
Token ID11
Feature activation-7.08e-02

Pos logit contributions
qv+0.162
nexus+0.160
 Defender+0.139
owered+0.137
finder+0.136
Neg logit contributions
ELY-0.154
yz-0.150
llah-0.145
ami-0.141
icans-0.139
Token following
Token ID1708
Feature activation-1.15e-01

Pos logit contributions
qv+0.048
nexus+0.047
 Defender+0.041
owered+0.040
finder+0.040
Neg logit contributions
ELY-0.044
yz-0.043
llah-0.042
ami-0.041
icans-0.040
Token Margaret
Token ID19579
Feature activation-9.96e-02

Pos logit contributions
qv+0.076
nexus+0.075
 Defender+0.065
owered+0.064
finder+0.064
Neg logit contributions
ELY-0.074
yz-0.072
llah-0.069
ami-0.067
icans-0.066
Token Thatcher
Token ID31706
Feature activation-4.17e-01

Pos logit contributions
nexus+0.025
qv+0.025
 Defender+0.015
owered+0.015
finder+0.015
Neg logit contributions
ELY-0.104
yz-0.102
llah-0.101
ami-0.099
icans-0.098
Token,
Token ID11
Feature activation-6.97e-03

Pos logit contributions
qv+0.277
nexus+0.272
 Defender+0.238
owered+0.234
finder+0.232
Neg logit contributions
ELY-0.265
yz-0.259
llah-0.250
ami-0.242
icans-0.239
Token told
Token ID1297
Feature activation+1.05e-02

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.004
Token the
Token ID262
Feature activation-7.47e-02

Pos logit contributions
ELY+0.007
yz+0.007
llah+0.006
ami+0.006
icans+0.006
Neg logit contributions
qv-0.007
nexus-0.007
 Defender-0.006
owered-0.006
finder-0.006
Token BBC
Token ID7823
Feature activation-5.29e-02

Pos logit contributions
qv+0.050
nexus+0.049
 Defender+0.043
owered+0.042
finder+0.042
Neg logit contributions
ELY-0.047
yz-0.046
llah-0.044
ami-0.043
icans-0.042
Token.
Token ID13
Feature activation-5.61e-02

Pos logit contributions
qv+0.037
nexus+0.036
 Defender+0.032
owered+0.031
finder+0.031
Neg logit contributions
ELY-0.032
yz-0.031
llah-0.030
ami-0.029
icans-0.029
Token
Token ID198
Feature activation+5.97e-04

Pos logit contributions
ELY+0.001
yz+0.001
llah+0.001
ami+0.001
icans+0.001
Neg logit contributions
qv-0.001
nexus-0.001
 Defender-0.001
owered-0.001
finder-0.001
Token
Token ID198
Feature activation-8.95e-03

Pos logit contributions
ELY+0.000
yz+0.000
llah+0.000
ami+0.000
icans+0.000
Neg logit contributions
qv-0.000
nexus-0.000
 Defender-0.000
owered-0.000
finder-0.000
TokenThe
Token ID464
Feature activation-2.60e-02

Pos logit contributions
qv+0.006
nexus+0.006
 Defender+0.005
owered+0.005
finder+0.005
Neg logit contributions
ELY-0.006
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Token proposed
Token ID5150
Feature activation+8.50e-02

Pos logit contributions
qv+0.017
nexus+0.017
 Defender+0.015
owered+0.014
finder+0.014
Neg logit contributions
ELY-0.017
yz-0.016
llah-0.016
ami-0.015
icans-0.015
Token first
Token ID717
Feature activation-7.92e-02

Pos logit contributions
ELY+0.052
yz+0.051
llah+0.049
ami+0.047
icans+0.047
Neg logit contributions
qv-0.058
nexus-0.058
 Defender-0.050
owered-0.050
finder-0.049
Token phase
Token ID7108
Feature activation-4.16e-01

Pos logit contributions
qv+0.034
nexus+0.033
 Defender+0.026
owered+0.026
finder+0.025
Neg logit contributions
ELY-0.069
yz-0.068
llah-0.066
ami-0.065
icans-0.064
Token would
Token ID561
Feature activation-4.45e-02

Pos logit contributions
qv+0.289
nexus+0.284
 Defender+0.250
owered+0.246
finder+0.244
Neg logit contributions
ELY-0.252
yz-0.246
llah-0.237
ami-0.230
icans-0.226
Token do
Token ID466
Feature activation-1.39e-01

Pos logit contributions
qv+0.030
nexus+0.030
 Defender+0.026
owered+0.025
finder+0.025
Neg logit contributions
ELY-0.028
yz-0.027
llah-0.026
ami-0.025
icans-0.025
Token two
Token ID734
Feature activation-6.69e-02

Pos logit contributions
qv+0.103
nexus+0.101
 Defender+0.090
owered+0.089
finder+0.088
Neg logit contributions
ELY-0.078
yz-0.076
llah-0.073
ami-0.070
icans-0.069
Token main
Token ID1388
Feature activation+8.06e-02

Pos logit contributions
qv+0.045
nexus+0.044
 Defender+0.039
owered+0.038
finder+0.038
Neg logit contributions
ELY-0.042
yz-0.041
llah-0.040
ami-0.039
icans-0.038
Token things
Token ID1243
Feature activation-2.07e-02

Pos logit contributions
ELY+0.051
yz+0.049
llah+0.048
ami+0.046
icans+0.045
Neg logit contributions
qv-0.054
nexus-0.053
 Defender-0.047
owered-0.046
finder-0.045
Token,
Token ID11
Feature activation-3.28e-02

Pos logit contributions
ELY+0.026
yz+0.026
llah+0.025
ami+0.024
icans+0.024
Neg logit contributions
qv-0.028
nexus-0.027
 Defender-0.024
owered-0.024
finder-0.023
Token Pe
Token ID2631
Feature activation-1.07e-02

Pos logit contributions
qv+0.023
nexus+0.022
 Defender+0.020
owered+0.019
finder+0.019
Neg logit contributions
ELY-0.020
yz-0.019
llah-0.019
ami-0.018
icans-0.018
Tokenpt
Token ID457
Feature activation-3.58e-01

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.009
yz-0.008
llah-0.008
ami-0.008
icans-0.008
Tokenost
Token ID455
Feature activation-2.18e-01

Pos logit contributions
qv+0.307
nexus+0.303
 Defender+0.274
owered+0.270
finder+0.268
Neg logit contributions
ELY-0.158
yz-0.154
llah-0.146
ami-0.140
icans-0.137
Tokenre
Token ID260
Feature activation+2.21e-02

Pos logit contributions
qv+0.155
nexus+0.153
 Defender+0.135
owered+0.133
finder+0.132
Neg logit contributions
ELY-0.129
yz-0.124
llah-0.121
ami-0.116
icans-0.114
Tokenpt
Token ID457
Feature activation-4.16e-01

Pos logit contributions
ELY+0.015
yz+0.015
llah+0.014
ami+0.014
icans+0.014
Neg logit contributions
qv-0.013
nexus-0.013
 Defender-0.011
owered-0.011
finder-0.011
Tokenococ
Token ID34403
Feature activation-7.45e-02

Pos logit contributions
qv+0.309
nexus+0.304
 Defender+0.269
owered+0.267
finder+0.264
Neg logit contributions
ELY-0.234
yz-0.224
llah-0.218
ami-0.206
icans-0.202
Tokencus
Token ID9042
Feature activation-2.07e-01

Pos logit contributions
qv+0.041
nexus+0.040
 Defender+0.034
owered+0.033
finder+0.033
Neg logit contributions
ELY-0.056
yz-0.055
llah-0.053
ami-0.052
icans-0.051
Token,
Token ID11
Feature activation-5.56e-02

Pos logit contributions
qv+0.135
nexus+0.133
 Defender+0.116
owered+0.115
finder+0.113
Neg logit contributions
ELY-0.133
yz-0.130
llah-0.126
ami-0.122
icans-0.120
Token Gard
Token ID12790
Feature activation-2.72e-02

Pos logit contributions
qv+0.037
nexus+0.037
 Defender+0.032
owered+0.032
finder+0.031
Neg logit contributions
ELY-0.035
yz-0.034
llah-0.033
ami-0.032
icans-0.031
Tokenn
Token ID77
Feature activation-1.53e-01

Pos logit contributions
qv+0.018
nexus+0.018
 Defender+0.016
owered+0.016
finder+0.015
Neg logit contributions
ELY-0.017
yz-0.017
llah-0.016
ami-0.016
icans-0.015
Token caus
Token ID26846
Feature activation+1.34e-01

Pos logit contributions
qv+0.041
nexus+0.040
 Defender+0.035
owered+0.035
finder+0.035
Neg logit contributions
ELY-0.038
yz-0.037
llah-0.036
ami-0.035
icans-0.034
Tokenality
Token ID1483
Feature activation-1.76e-01

Pos logit contributions
ELY+0.109
yz+0.107
llah+0.104
ami+0.102
icans+0.101
Neg logit contributions
qv-0.065
nexus-0.063
 Defender-0.053
owered-0.052
finder-0.050
Token and
Token ID290
Feature activation-1.14e-02

Pos logit contributions
qv+0.116
nexus+0.114
 Defender+0.100
owered+0.098
finder+0.097
Neg logit contributions
ELY-0.113
yz-0.110
llah-0.107
ami-0.103
icans-0.102
Token the
Token ID262
Feature activation+1.51e-02

Pos logit contributions
qv+0.008
nexus+0.008
 Defender+0.007
owered+0.007
finder+0.007
Neg logit contributions
ELY-0.007
yz-0.007
llah-0.006
ami-0.006
icans-0.006
Token defin
Token ID2730
Feature activation+6.53e-03

Pos logit contributions
ELY+0.009
yz+0.009
llah+0.009
ami+0.008
icans+0.008
Neg logit contributions
qv-0.010
nexus-0.010
 Defender-0.009
owered-0.009
finder-0.009
Tokenitiveness
Token ID31366
Feature activation-4.16e-01

Pos logit contributions
ELY+0.006
yz+0.006
llah+0.006
ami+0.005
icans+0.005
Neg logit contributions
qv-0.003
nexus-0.003
 Defender-0.002
owered-0.002
finder-0.002
Token of
Token ID286
Feature activation+1.50e-02

Pos logit contributions
qv+0.272
nexus+0.268
 Defender+0.233
owered+0.229
finder+0.228
Neg logit contributions
ELY-0.269
yz-0.263
llah-0.253
ami-0.245
icans-0.243
Token intentions
Token ID14953
Feature activation+1.28e-01

Pos logit contributions
ELY+0.009
yz+0.009
llah+0.009
ami+0.008
icans+0.008
Neg logit contributions
qv-0.010
nexus-0.010
 Defender-0.009
owered-0.009
finder-0.009
Token in
Token ID287
Feature activation+1.44e-02

Pos logit contributions
ELY+0.081
yz+0.079
llah+0.076
ami+0.074
icans+0.073
Neg logit contributions
qv-0.086
nexus-0.085
 Defender-0.074
owered-0.073
finder-0.072
Token jazz
Token ID21274
Feature activation+1.07e-01

Pos logit contributions
ELY+0.008
yz+0.007
llah+0.007
ami+0.007
icans+0.007
Neg logit contributions
qv-0.011
nexus-0.011
 Defender-0.010
owered-0.010
finder-0.010
Token improvis
Token ID47833
Feature activation-2.92e-02

Pos logit contributions
ELY+0.061
yz+0.059
llah+0.057
ami+0.055
icans+0.054
Neg logit contributions
qv-0.078
nexus-0.077
 Defender-0.069
owered-0.068
finder-0.067
Token bass
Token ID12702
Feature activation-5.01e-03

Pos logit contributions
ELY+0.003
yz+0.003
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.003
owered-0.003
finder-0.003
Token driver
Token ID4639
Feature activation+5.63e-02

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.003
icans-0.003
Token,
Token ID11
Feature activation-1.01e-02

Pos logit contributions
ELY+0.036
yz+0.035
llah+0.034
ami+0.033
icans+0.032
Neg logit contributions
qv-0.037
nexus-0.037
 Defender-0.032
owered-0.032
finder-0.031
Token which
Token ID543
Feature activation+5.22e-02

Pos logit contributions
qv+0.007
nexus+0.007
 Defender+0.006
owered+0.006
finder+0.006
Neg logit contributions
ELY-0.006
yz-0.006
llah-0.006
ami-0.006
icans-0.006
Token is
Token ID318
Feature activation-4.30e-03

Pos logit contributions
ELY+0.031
yz+0.030
llah+0.029
ami+0.028
icans+0.027
Neg logit contributions
qv-0.037
nexus-0.037
 Defender-0.032
owered-0.032
finder-0.031
Token augmented
Token ID30259
Feature activation-4.15e-01

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.002
icans-0.002
Token by
Token ID416
Feature activation+1.34e-02

Pos logit contributions
qv+0.281
nexus+0.276
 Defender+0.242
owered+0.239
finder+0.236
Neg logit contributions
ELY-0.259
yz-0.252
llah-0.244
ami-0.236
icans-0.233
Token two
Token ID734
Feature activation-8.47e-02

Pos logit contributions
ELY+0.009
yz+0.008
llah+0.008
ami+0.008
icans+0.008
Neg logit contributions
qv-0.009
nexus-0.009
 Defender-0.008
owered-0.007
finder-0.007
Token passive
Token ID14513
Feature activation-7.41e-02

Pos logit contributions
qv+0.057
nexus+0.056
 Defender+0.049
owered+0.048
finder+0.047
Neg logit contributions
ELY-0.054
yz-0.052
llah-0.050
ami-0.049
icans-0.048
Token radi
Token ID19772
Feature activation-1.15e-01

Pos logit contributions
qv+0.050
nexus+0.049
 Defender+0.043
owered+0.043
finder+0.042
Neg logit contributions
ELY-0.046
yz-0.045
llah-0.044
ami-0.042
icans-0.041
Tokenators
Token ID2024
Feature activation-1.35e-01

Pos logit contributions
qv+0.069
nexus+0.068
 Defender+0.059
owered+0.057
finder+0.057
Neg logit contributions
ELY-0.080
yz-0.079
llah-0.076
ami-0.075
icans-0.074
Token the
Token ID262
Feature activation-5.17e-03

Pos logit contributions
qv+0.064
nexus+0.063
 Defender+0.055
owered+0.055
finder+0.054
Neg logit contributions
ELY-0.062
yz-0.060
llah-0.058
ami-0.056
icans-0.055
Token game
Token ID983
Feature activation+4.09e-02

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.003
icans-0.003
Token a
Token ID257
Feature activation+2.63e-02

Pos logit contributions
ELY+0.027
yz+0.026
llah+0.025
ami+0.024
icans+0.024
Neg logit contributions
qv-0.027
nexus-0.026
 Defender-0.023
owered-0.022
finder-0.022
Token '
Token ID705
Feature activation-9.23e-02

Pos logit contributions
ELY+0.019
yz+0.018
llah+0.018
ami+0.017
icans+0.017
Neg logit contributions
qv-0.016
nexus-0.015
 Defender-0.013
owered-0.013
finder-0.013
TokenPain
Token ID38490
Feature activation+7.14e-02

Pos logit contributions
qv+0.063
nexus+0.062
 Defender+0.055
owered+0.054
finder+0.053
Neg logit contributions
ELY-0.057
yz-0.055
llah-0.053
ami-0.052
icans-0.051
Tokenful
Token ID913
Feature activation+3.56e-01

Pos logit contributions
ELY+0.023
yz+0.022
llah+0.021
ami+0.019
icans+0.019
Neg logit contributions
qv-0.070
nexus-0.069
 Defender-0.063
owered-0.062
finder-0.062
Token'
Token ID6
Feature activation-5.99e-03

Pos logit contributions
ELY+0.236
yz+0.231
llah+0.223
ami+0.217
icans+0.213
Neg logit contributions
qv-0.227
nexus-0.224
 Defender-0.193
owered-0.191
finder-0.189
Token 2
Token ID362
Feature activation+5.38e-03

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.004
owered+0.004
finder+0.003
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.003
ami-0.003
icans-0.003
Token.
Token ID13
Feature activation-1.32e-04

Pos logit contributions
ELY+0.003
yz+0.003
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.003
owered-0.003
finder-0.003
Token4
Token ID19
Feature activation-5.67e-02

Pos logit contributions
qv+0.000
nexus+0.000
 Defender+0.000
owered+0.000
finder+0.000
Neg logit contributions
ELY-0.000
yz-0.000
llah-0.000
ami-0.000
icans-0.000
Token out
Token ID503
Feature activation-7.90e-02

Pos logit contributions
qv+0.034
nexus+0.034
 Defender+0.029
owered+0.028
finder+0.028
Neg logit contributions
ELY-0.040
yz-0.039
llah-0.038
ami-0.037
icans-0.036
Token Arduino
Token ID27634
Feature activation-1.12e-01

Pos logit contributions
qv+0.022
nexus+0.021
 Defender+0.019
owered+0.018
finder+0.018
Neg logit contributions
ELY-0.022
yz-0.021
llah-0.020
ami-0.020
icans-0.019
Token has
Token ID468
Feature activation-3.76e-02

Pos logit contributions
qv+0.082
nexus+0.080
 Defender+0.071
owered+0.070
finder+0.070
Neg logit contributions
ELY-0.064
yz-0.062
llah-0.060
ami-0.057
icans-0.056
Token enabled
Token ID9343
Feature activation-1.16e-02

Pos logit contributions
qv+0.026
nexus+0.026
 Defender+0.022
owered+0.022
finder+0.022
Neg logit contributions
ELY-0.023
yz-0.022
llah-0.022
ami-0.021
icans-0.021
Token no
Token ID645
Feature activation+2.50e-02

Pos logit contributions
qv+0.008
nexus+0.008
 Defender+0.007
owered+0.006
finder+0.006
Neg logit contributions
ELY-0.007
yz-0.007
llah-0.007
ami-0.007
icans-0.007
Tokenv
Token ID85
Feature activation-1.77e-02

Pos logit contributions
ELY+0.016
yz+0.016
llah+0.015
ami+0.015
icans+0.014
Neg logit contributions
qv-0.017
nexus-0.016
 Defender-0.014
owered-0.014
finder-0.014
Tokenices
Token ID1063
Feature activation+4.04e-01

Pos logit contributions
qv+0.020
nexus+0.019
 Defender+0.018
owered+0.018
finder+0.018
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.002
icans-0.002
Token to
Token ID284
Feature activation-2.64e-02

Pos logit contributions
ELY+0.269
yz+0.263
llah+0.255
ami+0.248
icans+0.243
Neg logit contributions
qv-0.255
nexus-0.251
 Defender-0.218
owered-0.214
finder-0.212
Token build
Token ID1382
Feature activation+3.04e-02

Pos logit contributions
qv+0.018
nexus+0.018
 Defender+0.016
owered+0.016
finder+0.015
Neg logit contributions
ELY-0.016
yz-0.016
llah-0.015
ami-0.015
icans-0.014
Token projects
Token ID4493
Feature activation+1.21e-02

Pos logit contributions
ELY+0.019
yz+0.018
llah+0.017
ami+0.017
icans+0.017
Neg logit contributions
qv-0.021
nexus-0.021
 Defender-0.018
owered-0.018
finder-0.018
Token.
Token ID13
Feature activation+1.39e-02

Pos logit contributions
ELY+0.008
yz+0.008
llah+0.007
ami+0.007
icans+0.007
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
Token However
Token ID2102
Feature activation+5.85e-03

Pos logit contributions
ELY+0.009
yz+0.009
llah+0.008
ami+0.008
icans+0.008
Neg logit contributions
qv-0.009
nexus-0.009
 Defender-0.008
owered-0.008
finder-0.008
Tokenished
Token ID1348
Feature activation+9.77e-02

Pos logit contributions
qv+0.025
nexus+0.025
 Defender+0.022
owered+0.022
finder+0.022
Neg logit contributions
ELY-0.020
yz-0.019
llah-0.018
ami-0.018
icans-0.017
Token with
Token ID351
Feature activation-2.00e-02

Pos logit contributions
ELY+0.063
yz+0.061
llah+0.059
ami+0.058
icans+0.056
Neg logit contributions
qv-0.064
nexus-0.063
 Defender-0.055
owered-0.054
finder-0.054
Token Bas
Token ID6455
Feature activation+5.52e-02

Pos logit contributions
qv+0.011
nexus+0.011
 Defender+0.009
owered+0.009
finder+0.009
Neg logit contributions
ELY-0.015
yz-0.014
llah-0.014
ami-0.013
icans-0.013
Tokenij
Token ID2926
Feature activation+1.35e-01

Pos logit contributions
ELY+0.023
yz+0.022
llah+0.021
ami+0.021
icans+0.020
Neg logit contributions
qv-0.048
nexus-0.048
 Defender-0.044
owered-0.043
finder-0.043
Token milit
Token ID2036
Feature activation-2.08e-01

Pos logit contributions
ELY+0.080
yz+0.078
llah+0.076
ami+0.073
icans+0.072
Neg logit contributions
qv-0.095
nexus-0.094
 Defender-0.082
owered-0.081
finder-0.080
Tokeniam
Token ID1789
Feature activation+2.85e-01

Pos logit contributions
qv+0.209
nexus+0.206
 Defender+0.189
owered+0.188
finder+0.187
Neg logit contributions
ELY-0.061
yz-0.058
llah-0.054
ami-0.049
icans-0.047
Tokenen
Token ID268
Feature activation-5.98e-02

Pos logit contributions
ELY+0.201
yz+0.197
llah+0.191
ami+0.186
icans+0.183
Neg logit contributions
qv-0.170
nexus-0.167
 Defender-0.144
owered-0.141
finder-0.139
Token,
Token ID11
Feature activation-4.68e-03

Pos logit contributions
qv+0.039
nexus+0.039
 Defender+0.034
owered+0.033
finder+0.033
Neg logit contributions
ELY-0.038
yz-0.038
llah-0.036
ami-0.035
icans-0.035
Token and
Token ID290
Feature activation-2.46e-02

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.003
icans-0.003
Token freed
Token ID13459
Feature activation-2.54e-01

Pos logit contributions
qv+0.017
nexus+0.016
 Defender+0.014
owered+0.014
finder+0.014
Neg logit contributions
ELY-0.015
yz-0.015
llah-0.015
ami-0.014
icans-0.014
Token a
Token ID257
Feature activation-4.07e-03

Pos logit contributions
qv+0.165
nexus+0.163
 Defender+0.140
owered+0.138
finder+0.137
Neg logit contributions
ELY-0.165
yz-0.162
llah-0.157
ami-0.153
icans-0.150
Token editorial
Token ID13684
Feature activation+2.32e-02

Pos logit contributions
qv+0.020
nexus+0.020
 Defender+0.017
owered+0.017
finder+0.017
Neg logit contributions
ELY-0.019
yz-0.019
llah-0.018
ami-0.017
icans-0.017
Token last
Token ID938
Feature activation+4.15e-02

Pos logit contributions
ELY+0.015
yz+0.015
llah+0.014
ami+0.014
icans+0.014
Neg logit contributions
qv-0.015
nexus-0.015
 Defender-0.013
owered-0.013
finder-0.012
Token month
Token ID1227
Feature activation-6.24e-02

Pos logit contributions
ELY+0.029
yz+0.029
llah+0.028
ami+0.027
icans+0.027
Neg logit contributions
qv-0.025
nexus-0.024
 Defender-0.021
owered-0.020
finder-0.020
Token,
Token ID11
Feature activation-1.75e-02

Pos logit contributions
qv+0.041
nexus+0.040
 Defender+0.035
owered+0.034
finder+0.034
Neg logit contributions
ELY-0.040
yz-0.039
llah-0.038
ami-0.037
icans-0.036
Token F
Token ID376
Feature activation+9.65e-02

Pos logit contributions
qv+0.012
nexus+0.011
 Defender+0.010
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.011
yz-0.011
llah-0.010
ami-0.010
icans-0.010
TokenFR
Token ID10913
Feature activation+3.10e-01

Pos logit contributions
ELY+0.027
yz+0.025
llah+0.023
ami+0.022
icans+0.021
Neg logit contributions
qv-0.099
nexus-0.098
 Defender-0.090
owered-0.089
finder-0.088
TokenF
Token ID37
Feature activation+6.29e-02

Pos logit contributions
ELY+0.197
yz+0.189
llah+0.180
ami+0.178
icans+0.173
Neg logit contributions
qv-0.209
nexus-0.206
owered-0.179
 Defender-0.178
finder-0.174
Token Staff
Token ID9983
Feature activation+7.50e-02

Pos logit contributions
ELY+0.038
yz+0.037
llah+0.036
ami+0.035
icans+0.034
Neg logit contributions
qv-0.043
nexus-0.043
 Defender-0.037
owered-0.037
finder-0.037
Token Attorney
Token ID8123
Feature activation+7.94e-02

Pos logit contributions
ELY+0.047
yz+0.046
llah+0.044
ami+0.043
icans+0.042
Neg logit contributions
qv-0.050
nexus-0.050
 Defender-0.043
owered-0.043
finder-0.042
Token Patrick
Token ID9925
Feature activation+3.29e-03

Pos logit contributions
ELY+0.049
yz+0.048
llah+0.046
ami+0.044
icans+0.044
Neg logit contributions
qv-0.054
nexus-0.054
 Defender-0.047
owered-0.046
finder-0.046
Token Elliott
Token ID22386
Feature activation+1.02e-01

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.002
owered-0.002
finder-0.002
Token.
Token ID13
Feature activation+2.29e-03

Pos logit contributions
ELY+0.018
yz+0.017
llah+0.017
ami+0.016
icans+0.016
Neg logit contributions
qv-0.019
nexus-0.019
 Defender-0.016
owered-0.016
finder-0.016
Token That
Token ID1320
Feature activation+1.12e-02

Pos logit contributions
ELY+0.001
yz+0.001
llah+0.001
ami+0.001
icans+0.001
Neg logit contributions
qv-0.002
nexus-0.002
owered-0.001
 Defender-0.001
finder-0.001
Token was
Token ID373
Feature activation+4.72e-02

Pos logit contributions
ELY+0.007
yz+0.007
llah+0.006
ami+0.006
icans+0.006
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
Token Rob
Token ID3851
Feature activation-7.88e-03

Pos logit contributions
ELY+0.029
yz+0.028
llah+0.027
ami+0.026
icans+0.026
Neg logit contributions
qv-0.032
nexus-0.032
 Defender-0.028
owered-0.028
finder-0.027
Token H
Token ID367
Feature activation-8.65e-02

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Tokenoe
Token ID2577
Feature activation+3.73e-01

Pos logit contributions
qv+0.063
nexus+0.062
 Defender+0.054
owered+0.054
finder+0.053
Neg logit contributions
ELY-0.050
yz-0.049
llah-0.047
ami-0.045
icans-0.044
Tokengee
Token ID29622
Feature activation-1.25e-01

Pos logit contributions
ELY+0.211
yz+0.207
llah+0.198
ami+0.191
icans+0.187
Neg logit contributions
nexus-0.271
qv-0.271
 Defender-0.238
owered-0.236
finder-0.234
Token's
Token ID338
Feature activation+2.19e-02

Pos logit contributions
qv+0.084
nexus+0.082
 Defender+0.072
owered+0.071
finder+0.070
Neg logit contributions
ELY-0.079
yz-0.077
llah-0.074
ami-0.072
icans-0.071
Token episode
Token ID4471
Feature activation-4.85e-02

Pos logit contributions
ELY+0.014
yz+0.014
llah+0.013
ami+0.013
icans+0.013
Neg logit contributions
qv-0.014
nexus-0.014
 Defender-0.012
owered-0.012
finder-0.012
Token and
Token ID290
Feature activation+3.93e-02

Pos logit contributions
qv+0.033
nexus+0.032
 Defender+0.028
owered+0.028
finder+0.027
Neg logit contributions
ELY-0.031
yz-0.030
llah-0.029
ami-0.028
icans-0.027
Token I
Token ID314
Feature activation-3.42e-02

Pos logit contributions
ELY+0.024
yz+0.023
llah+0.022
ami+0.022
icans+0.021
Neg logit contributions
qv-0.027
nexus-0.027
 Defender-0.023
owered-0.023
finder-0.023
Token,
Token ID11
Feature activation+6.18e-03

Pos logit contributions
ELY+0.099
yz+0.097
llah+0.093
ami+0.091
icans+0.089
Neg logit contributions
qv-0.099
nexus-0.098
 Defender-0.085
owered-0.084
finder-0.083
Token par
Token ID1582
Feature activation-1.28e-02

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.004
owered-0.004
finder-0.004
Tokenoch
Token ID5374
Feature activation+1.79e-01

Pos logit contributions
qv+0.012
nexus+0.012
 Defender+0.010
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.005
ami-0.004
icans-0.004
Tokenial
Token ID498
Feature activation-1.05e-01

Pos logit contributions
ELY+0.132
yz+0.129
llah+0.125
ami+0.122
icans+0.121
Neg logit contributions
qv-0.100
nexus-0.098
 Defender-0.084
owered-0.082
finder-0.081
Token
Token ID564
Feature activation-4.57e-02

Pos logit contributions
qv+0.069
nexus+0.068
 Defender+0.060
owered+0.059
finder+0.058
Neg logit contributions
ELY-0.067
yz-0.065
llah-0.063
ami-0.061
icans-0.060
Token
Token ID246
Feature activation+1.25e-01

Pos logit contributions
qv+0.039
nexus+0.038
 Defender+0.035
owered+0.034
finder+0.034
Neg logit contributions
ELY-0.021
yz-0.020
llah-0.019
ami-0.018
icans-0.018
Tokenmy
Token ID1820
Feature activation+3.56e-02

Pos logit contributions
ELY+0.080
yz+0.078
llah+0.076
ami+0.073
icans+0.072
Neg logit contributions
qv-0.083
nexus-0.081
 Defender-0.071
owered-0.070
finder-0.069
Token country
Token ID1499
Feature activation+4.87e-02

Pos logit contributions
ELY+0.024
yz+0.024
llah+0.023
ami+0.022
icans+0.022
Neg logit contributions
qv-0.022
nexus-0.022
 Defender-0.019
owered-0.018
finder-0.018
Token right
Token ID826
Feature activation-3.60e-02

Pos logit contributions
ELY+0.030
yz+0.030
llah+0.029
ami+0.028
icans+0.027
Neg logit contributions
qv-0.033
nexus-0.033
 Defender-0.028
owered-0.028
finder-0.028
Token or
Token ID393
Feature activation+4.28e-02

Pos logit contributions
qv+0.024
nexus+0.024
 Defender+0.021
owered+0.021
finder+0.020
Neg logit contributions
ELY-0.023
yz-0.022
llah-0.021
ami-0.021
icans-0.020
Token wrong
Token ID2642
Feature activation+3.53e-02

Pos logit contributions
ELY+0.028
yz+0.027
llah+0.026
ami+0.026
icans+0.025
Neg logit contributions
qv-0.028
nexus-0.027
 Defender-0.024
owered-0.023
finder-0.023
Tokenincluding
Token ID8201
Feature activation-3.27e-02

Pos logit contributions
qv+0.018
nexus+0.018
 Defender+0.016
owered+0.016
finder+0.016
Neg logit contributions
ELY-0.015
yz-0.015
llah-0.014
ami-0.014
icans-0.013
Token this
Token ID428
Feature activation+7.98e-02

Pos logit contributions
qv+0.021
nexus+0.021
 Defender+0.018
owered+0.018
finder+0.018
Neg logit contributions
ELY-0.021
yz-0.021
llah-0.020
ami-0.019
icans-0.019
Token weekend
Token ID5041
Feature activation+2.24e-01

Pos logit contributions
ELY+0.051
yz+0.050
llah+0.048
ami+0.046
icans+0.046
Neg logit contributions
qv-0.053
nexus-0.052
 Defender-0.045
owered-0.045
finder-0.044
Token
Token ID447
Feature activation+2.13e-02

Pos logit contributions
ELY+0.141
yz+0.138
llah+0.133
ami+0.129
icans+0.127
Neg logit contributions
qv-0.149
nexus-0.147
 Defender-0.129
owered-0.127
finder-0.125
Token
Token ID247
Feature activation+2.31e-02

Pos logit contributions
ELY+0.013
yz+0.013
llah+0.012
ami+0.012
icans+0.012
Neg logit contributions
qv-0.015
nexus-0.014
 Defender-0.013
owered-0.012
finder-0.012
Tokens
Token ID82
Feature activation+1.99e-02

Pos logit contributions
ELY+0.016
yz+0.015
llah+0.015
ami+0.014
icans+0.014
Neg logit contributions
qv-0.015
nexus-0.014
 Defender-0.012
owered-0.012
finder-0.012
Token The
Token ID383
Feature activation+2.40e-02

Pos logit contributions
ELY+0.013
yz+0.012
llah+0.012
ami+0.012
icans+0.011
Neg logit contributions
qv-0.013
nexus-0.013
 Defender-0.011
owered-0.011
finder-0.011
Token Last
Token ID4586
Feature activation+5.88e-02

Pos logit contributions
ELY+0.016
yz+0.015
llah+0.015
ami+0.014
icans+0.014
Neg logit contributions
qv-0.016
nexus-0.015
 Defender-0.013
owered-0.013
finder-0.013
Token Jedi
Token ID16147
Feature activation+1.26e-02

Pos logit contributions
ELY+0.039
yz+0.038
llah+0.036
ami+0.035
icans+0.035
Neg logit contributions
qv-0.038
nexus-0.037
 Defender-0.033
finder-0.032
owered-0.032
Token,
Token ID11
Feature activation+3.53e-02

Pos logit contributions
ELY+0.008
yz+0.008
llah+0.008
ami+0.008
icans+0.008
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
Token which
Token ID543
Feature activation+7.69e-02

Pos logit contributions
ELY+0.022
yz+0.021
llah+0.021
ami+0.020
icans+0.020
Neg logit contributions
qv-0.024
nexus-0.024
 Defender-0.021
owered-0.021
finder-0.020
Token our
Token ID674
Feature activation+1.77e-02

Pos logit contributions
qv+0.032
nexus+0.031
 Defender+0.027
owered+0.027
finder+0.027
Neg logit contributions
ELY-0.028
yz-0.027
llah-0.026
ami-0.026
icans-0.025
Token faith
Token ID4562
Feature activation+1.59e-01

Pos logit contributions
ELY+0.010
yz+0.010
llah+0.010
ami+0.009
icans+0.009
Neg logit contributions
qv-0.013
nexus-0.012
 Defender-0.011
owered-0.011
finder-0.011
Token outside
Token ID2354
Feature activation+8.05e-03

Pos logit contributions
ELY+0.095
yz+0.093
llah+0.090
ami+0.087
icans+0.085
Neg logit contributions
qv-0.111
nexus-0.110
 Defender-0.096
owered-0.096
finder-0.093
Token the
Token ID262
Feature activation+1.24e-02

Pos logit contributions
ELY+0.005
yz+0.005
llah+0.005
ami+0.005
icans+0.005
Neg logit contributions
qv-0.005
nexus-0.005
 Defender-0.005
owered-0.004
finder-0.004
Token four
Token ID1440
Feature activation-7.57e-02

Pos logit contributions
ELY+0.008
yz+0.008
llah+0.007
ami+0.007
icans+0.007
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
Token walls
Token ID7714
Feature activation+4.42e-02

Pos logit contributions
qv+0.056
nexus+0.056
 Defender+0.049
owered+0.049
finder+0.048
Neg logit contributions
ELY-0.042
yz-0.041
llah-0.039
ami-0.038
icans-0.037
Token.
Token ID13
Feature activation+3.28e-03

Pos logit contributions
ELY+0.029
yz+0.028
llah+0.027
ami+0.026
icans+0.026
Neg logit contributions
qv-0.029
nexus-0.028
 Defender-0.025
owered-0.024
finder-0.024
Token
Token ID447
Feature activation+2.70e-02

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.002
owered-0.002
finder-0.002
Token
Token ID247
Feature activation+2.73e-02

Pos logit contributions
ELY+0.018
yz+0.017
llah+0.017
ami+0.016
icans+0.016
Neg logit contributions
qv-0.018
nexus-0.017
 Defender-0.015
owered-0.015
finder-0.015
Token For
Token ID1114
Feature activation-2.85e-02

Pos logit contributions
ELY+0.017
yz+0.017
llah+0.016
ami+0.016
icans+0.016
Neg logit contributions
qv-0.018
nexus-0.018
 Defender-0.016
owered-0.015
finder-0.015
Token a
Token ID257
Feature activation+3.24e-02

Pos logit contributions
qv+0.019
nexus+0.019
 Defender+0.016
owered+0.016
finder+0.016
Neg logit contributions
ELY-0.018
yz-0.017
llah-0.017
ami-0.016
icans-0.016
Token but
Token ID475
Feature activation+4.69e-02

Pos logit contributions
ELY+0.007
yz+0.007
llah+0.006
ami+0.006
icans+0.006
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.006
Token a
Token ID257
Feature activation+2.11e-02

Pos logit contributions
ELY+0.030
yz+0.029
llah+0.028
ami+0.027
icans+0.027
Neg logit contributions
qv-0.031
nexus-0.031
 Defender-0.027
owered-0.027
finder-0.026
Token lot
Token ID1256
Feature activation+2.36e-01

Pos logit contributions
ELY+0.011
yz+0.010
llah+0.010
ami+0.009
icans+0.009
Neg logit contributions
qv-0.017
nexus-0.017
 Defender-0.015
owered-0.015
finder-0.015
Token happens
Token ID4325
Feature activation-1.06e-01

Pos logit contributions
ELY+0.159
yz+0.156
llah+0.151
ami+0.146
icans+0.144
Neg logit contributions
qv-0.148
nexus-0.145
 Defender-0.126
owered-0.124
finder-0.123
Token:
Token ID25
Feature activation-4.33e-02

Pos logit contributions
qv+0.071
nexus+0.070
 Defender+0.061
owered+0.060
finder+0.059
Neg logit contributions
ELY-0.067
yz-0.065
llah-0.063
ami-0.061
icans-0.060
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
qv+0.028
nexus+0.028
 Defender+0.024
owered+0.024
finder+0.024
Neg logit contributions
ELY-0.028
yz-0.027
llah-0.026
ami-0.025
icans-0.025
Token characters
Token ID3435
Feature activation+3.35e-02

Pos logit contributions
ELY+0.012
yz+0.012
llah+0.012
ami+0.011
icans+0.011
Neg logit contributions
qv-0.016
nexus-0.016
 Defender-0.014
owered-0.014
finder-0.014
Token go
Token ID467
Feature activation+2.97e-02

Pos logit contributions
ELY+0.020
yz+0.020
llah+0.019
ami+0.018
icans+0.018
Neg logit contributions
qv-0.023
nexus-0.023
 Defender-0.020
owered-0.020
finder-0.020
Token to
Token ID284
Feature activation+3.04e-02

Pos logit contributions
ELY+0.018
yz+0.018
llah+0.017
ami+0.017
icans+0.016
Neg logit contributions
qv-0.020
nexus-0.020
 Defender-0.018
owered-0.017
finder-0.017
Token a
Token ID257
Feature activation+2.97e-02

Pos logit contributions
ELY+0.020
yz+0.019
llah+0.018
ami+0.018
icans+0.018
Neg logit contributions
qv-0.020
nexus-0.020
 Defender-0.017
owered-0.017
finder-0.017
Token lot
Token ID1256
Feature activation+2.37e-01

Pos logit contributions
ELY+0.019
yz+0.019
llah+0.018
ami+0.018
icans+0.017
Neg logit contributions
qv-0.019
nexus-0.019
 Defender-0.017
owered-0.016
finder-0.016
Token (@
Token ID4275
Feature activation+9.61e-02

Pos logit contributions
qv+0.069
nexus+0.068
 Defender+0.060
owered+0.059
finder+0.058
Neg logit contributions
ELY-0.058
yz-0.056
llah-0.054
ami-0.052
icans-0.051
Tokenr
Token ID81
Feature activation-2.84e-02

Pos logit contributions
ELY+0.062
yz+0.060
llah+0.059
ami+0.057
icans+0.056
Neg logit contributions
qv-0.063
nexus-0.062
 Defender-0.054
owered-0.053
finder-0.053
Tokenmath
Token ID11018
Feature activation+1.07e-01

Pos logit contributions
qv+0.020
nexus+0.020
 Defender+0.017
owered+0.017
finder+0.017
Neg logit contributions
ELY-0.017
yz-0.016
llah-0.016
ami-0.015
icans-0.015
Tokenis
Token ID271
Feature activation-9.35e-02

Pos logit contributions
ELY+0.076
yz+0.074
llah+0.072
ami+0.070
icans+0.069
Neg logit contributions
qv-0.063
nexus-0.062
 Defender-0.053
owered-0.052
finder-0.051
Token_
Token ID62
Feature activation-1.38e-01

Pos logit contributions
qv+0.064
nexus+0.063
 Defender+0.055
owered+0.054
finder+0.054
Neg logit contributions
ELY-0.058
yz-0.056
llah-0.054
ami-0.053
icans-0.052
Tokenthe
Token ID1169
Feature activation+4.87e-02

Pos logit contributions
qv+0.093
nexus+0.091
 Defender+0.080
owered+0.078
finder+0.078
Neg logit contributions
ELY-0.087
yz-0.084
llah-0.082
ami-0.079
icans-0.078
Token1
Token ID16
Feature activation+3.17e-02

Pos logit contributions
ELY+0.036
yz+0.035
llah+0.034
ami+0.033
icans+0.033
Neg logit contributions
qv-0.027
nexus-0.027
 Defender-0.023
owered-0.022
finder-0.022
Tokenst
Token ID301
Feature activation-5.19e-03

Pos logit contributions
ELY+0.020
yz+0.019
llah+0.019
ami+0.018
icans+0.018
Neg logit contributions
qv-0.022
nexus-0.021
 Defender-0.019
owered-0.018
finder-0.018
Token)
Token ID8
Feature activation-1.69e-02

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.003
icans-0.003
Token on
Token ID319
Feature activation+1.18e-02

Pos logit contributions
qv+0.012
nexus+0.012
 Defender+0.010
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.010
yz-0.010
llah-0.009
ami-0.009
icans-0.009
Token Dec
Token ID4280
Feature activation+1.45e-01

Pos logit contributions
ELY+0.007
yz+0.007
llah+0.007
ami+0.007
icans+0.007
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
Token may
Token ID743
Feature activation+7.55e-02

Pos logit contributions
ELY+0.031
yz+0.030
llah+0.029
ami+0.029
icans+0.028
Neg logit contributions
qv-0.035
nexus-0.034
 Defender-0.030
owered-0.030
finder-0.029
Token have
Token ID423
Feature activation+2.11e-02

Pos logit contributions
ELY+0.045
yz+0.044
llah+0.043
ami+0.041
icans+0.040
Neg logit contributions
qv-0.053
nexus-0.052
 Defender-0.046
owered-0.045
finder-0.045
Token fallen
Token ID9292
Feature activation+1.38e-01

Pos logit contributions
ELY+0.011
yz+0.011
llah+0.010
ami+0.010
icans+0.010
Neg logit contributions
qv-0.016
nexus-0.016
 Defender-0.014
owered-0.014
finder-0.014
Token,
Token ID11
Feature activation+3.91e-03

Pos logit contributions
ELY+0.093
yz+0.091
llah+0.088
ami+0.086
icans+0.084
Neg logit contributions
qv-0.086
nexus-0.085
 Defender-0.073
owered-0.072
finder-0.071
Token they
Token ID484
Feature activation-2.06e-02

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.003
nexus-0.003
 Defender-0.002
owered-0.002
finder-0.002
Token were
Token ID547
Feature activation-5.38e-02

Pos logit contributions
qv+0.013
nexus+0.013
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.014
yz-0.013
llah-0.013
ami-0.013
icans-0.012
Token unable
Token ID5906
Feature activation+8.87e-02

Pos logit contributions
qv+0.039
nexus+0.039
 Defender+0.034
owered+0.034
finder+0.034
Neg logit contributions
ELY-0.031
yz-0.030
llah-0.029
ami-0.028
icans-0.027
Token to
Token ID284
Feature activation-3.07e-02

Pos logit contributions
ELY+0.058
yz+0.058
llah+0.056
ami+0.054
icans+0.053
Neg logit contributions
qv-0.056
nexus-0.056
 Defender-0.048
owered-0.048
finder-0.047
Token find
Token ID1064
Feature activation-5.37e-02

Pos logit contributions
qv+0.017
nexus+0.017
 Defender+0.014
owered+0.014
finder+0.013
Neg logit contributions
ELY-0.023
yz-0.023
llah-0.022
ami-0.021
icans-0.021
Token it
Token ID340
Feature activation+6.18e-03

Pos logit contributions
qv+0.033
nexus+0.033
 Defender+0.028
owered+0.028
finder+0.027
Neg logit contributions
ELY-0.037
yz-0.036
llah-0.035
ami-0.034
icans-0.033
Token.
Token ID13
Feature activation-7.38e-03

Pos logit contributions
ELY+0.004
yz+0.004
llah+0.004
ami+0.003
icans+0.003
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.004
owered-0.004
finder-0.004
Token asserted
Token ID21635
Feature activation-5.61e-02

Pos logit contributions
qv+0.024
nexus+0.024
 Defender+0.021
owered+0.021
finder+0.020
Neg logit contributions
ELY-0.022
yz-0.022
llah-0.021
ami-0.021
icans-0.020
Token the
Token ID262
Feature activation-2.06e-02

Pos logit contributions
qv+0.037
nexus+0.037
 Defender+0.032
owered+0.032
finder+0.031
Neg logit contributions
ELY-0.036
yz-0.035
llah-0.034
ami-0.033
icans-0.032
Token right
Token ID826
Feature activation-9.62e-02

Pos logit contributions
qv+0.011
nexus+0.011
 Defender+0.009
owered+0.009
finder+0.009
Neg logit contributions
ELY-0.016
yz-0.015
llah-0.015
ami-0.014
icans-0.014
Token to
Token ID284
Feature activation+1.35e-05

Pos logit contributions
qv+0.059
nexus+0.058
 Defender+0.050
owered+0.049
finder+0.049
Neg logit contributions
ELY-0.066
yz-0.065
llah-0.063
ami-0.061
icans-0.060
Token pick
Token ID2298
Feature activation-3.90e-02

Pos logit contributions
ELY+0.000
yz+0.000
llah+0.000
ami+0.000
icans+0.000
Neg logit contributions
qv-0.000
nexus-0.000
 Defender-0.000
owered-0.000
finder-0.000
Token and
Token ID290
Feature activation+7.08e-03

Pos logit contributions
qv+0.025
nexus+0.025
 Defender+0.022
owered+0.021
finder+0.021
Neg logit contributions
ELY-0.025
yz-0.025
llah-0.024
ami-0.023
icans-0.023
Token choose
Token ID3853
Feature activation-4.04e-03

Pos logit contributions
ELY+0.005
yz+0.005
llah+0.005
ami+0.004
icans+0.004
Neg logit contributions
qv-0.004
nexus-0.004
 Defender-0.004
owered-0.004
finder-0.004
Token what
Token ID644
Feature activation-1.83e-02

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.002
ami-0.002
icans-0.002
Token laws
Token ID3657
Feature activation+7.76e-03

Pos logit contributions
qv+0.013
nexus+0.013
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.011
yz-0.010
llah-0.010
ami-0.010
icans-0.009
Token to
Token ID284
Feature activation+1.58e-02

Pos logit contributions
ELY+0.005
yz+0.005
llah+0.004
ami+0.004
icans+0.004
Neg logit contributions
qv-0.005
nexus-0.005
 Defender-0.005
finder-0.005
owered-0.005
Token follow
Token ID1061
Feature activation+2.75e-02

Pos logit contributions
ELY+0.010
yz+0.010
llah+0.009
ami+0.009
icans+0.009
Neg logit contributions
qv-0.011
nexus-0.010
 Defender-0.009
owered-0.009
finder-0.009
Token.
Token ID13
Feature activation+5.06e-02

Pos logit contributions
ELY+0.067
yz+0.066
llah+0.063
ami+0.061
icans+0.060
Neg logit contributions
qv-0.068
nexus-0.067
 Defender-0.058
owered-0.057
finder-0.057
Token The
Token ID383
Feature activation+4.63e-02

Pos logit contributions
ELY+0.033
yz+0.032
llah+0.031
ami+0.030
icans+0.029
Neg logit contributions
qv-0.033
nexus-0.033
 Defender-0.028
owered-0.028
finder-0.028
Token top
Token ID1353
Feature activation+1.23e-01

Pos logit contributions
ELY+0.029
yz+0.028
llah+0.027
ami+0.026
icans+0.026
Neg logit contributions
qv-0.031
nexus-0.031
 Defender-0.027
owered-0.027
finder-0.026
Token eight
Token ID3624
Feature activation-8.72e-02

Pos logit contributions
ELY+0.083
yz+0.082
llah+0.079
ami+0.077
icans+0.076
Neg logit contributions
qv-0.077
nexus-0.075
 Defender-0.065
owered-0.064
finder-0.063
Token airports
Token ID22600
Feature activation+3.24e-03

Pos logit contributions
qv+0.062
nexus+0.061
 Defender+0.054
owered+0.053
finder+0.052
Neg logit contributions
ELY-0.052
yz-0.050
llah-0.049
ami-0.047
icans-0.046
Token commanded
Token ID22419
Feature activation-6.43e-02

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.002
owered-0.002
finder-0.002
Token about
Token ID546
Feature activation-4.33e-02

Pos logit contributions
qv+0.041
nexus+0.041
 Defender+0.036
owered+0.035
finder+0.035
Neg logit contributions
ELY-0.042
yz-0.041
llah-0.040
ami-0.039
icans-0.038
Token half
Token ID2063
Feature activation-1.02e-01

Pos logit contributions
qv+0.027
nexus+0.026
 Defender+0.023
owered+0.022
finder+0.022
Neg logit contributions
ELY-0.030
yz-0.029
llah-0.028
ami-0.027
icans-0.027
Token of
Token ID286
Feature activation+3.43e-03

Pos logit contributions
qv+0.066
nexus+0.065
 Defender+0.057
owered+0.056
finder+0.056
Neg logit contributions
ELY-0.067
yz-0.065
llah-0.063
ami-0.061
icans-0.060
Token all
Token ID477
Feature activation-8.90e-03

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.002
nexus-0.002
 Defender-0.002
owered-0.002
finder-0.002
Token passenger
Token ID11849
Feature activation+6.79e-02

Pos logit contributions
qv+0.006
nexus+0.006
 Defender+0.005
owered+0.005
finder+0.005
Neg logit contributions
ELY-0.006
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Token They
Token ID1119
Feature activation-5.05e-02

Pos logit contributions
qv+0.023
nexus+0.023
 Defender+0.020
owered+0.020
finder+0.020
Neg logit contributions
ELY-0.022
yz-0.021
llah-0.021
ami-0.020
icans-0.020
Token had
Token ID550
Feature activation-3.55e-02

Pos logit contributions
qv+0.034
nexus+0.033
 Defender+0.029
owered+0.029
finder+0.028
Neg logit contributions
ELY-0.032
yz-0.031
llah-0.030
ami-0.029
icans-0.029
Token played
Token ID2826
Feature activation-2.21e-01

Pos logit contributions
qv+0.023
nexus+0.023
 Defender+0.020
owered+0.020
finder+0.020
Neg logit contributions
ELY-0.023
yz-0.022
llah-0.021
ami-0.021
icans-0.020
Token their
Token ID511
Feature activation-1.07e-01

Pos logit contributions
qv+0.145
nexus+0.143
 Defender+0.124
owered+0.123
finder+0.121
Neg logit contributions
ELY-0.142
yz-0.138
llah-0.134
ami-0.130
icans-0.128
Token characters
Token ID3435
Feature activation-2.68e-02

Pos logit contributions
qv+0.070
nexus+0.069
 Defender+0.060
owered+0.059
finder+0.058
Neg logit contributions
ELY-0.069
yz-0.068
llah-0.065
ami-0.064
icans-0.063
Token for
Token ID329
Feature activation-2.11e-02

Pos logit contributions
qv+0.017
nexus+0.017
 Defender+0.015
owered+0.015
finder+0.014
Neg logit contributions
ELY-0.018
yz-0.017
llah-0.017
ami-0.016
icans-0.016
Token six
Token ID2237
Feature activation-1.36e-01

Pos logit contributions
qv+0.013
nexus+0.013
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.014
yz-0.014
llah-0.014
ami-0.013
icans-0.013
Token years
Token ID812
Feature activation-3.55e-02

Pos logit contributions
qv+0.078
nexus+0.077
 Defender+0.065
owered+0.064
finder+0.063
Neg logit contributions
ELY-0.099
yz-0.097
llah-0.094
ami-0.092
icans-0.090
Token and
Token ID290
Feature activation-1.47e-02

Pos logit contributions
qv+0.023
nexus+0.023
 Defender+0.020
owered+0.019
finder+0.019
Neg logit contributions
ELY-0.023
yz-0.023
llah-0.022
ami-0.021
icans-0.021
Token I
Token ID314
Feature activation-7.43e-02

Pos logit contributions
qv+0.010
nexus+0.010
 Defender+0.009
owered+0.009
finder+0.009
Neg logit contributions
ELY-0.009
yz-0.009
llah-0.008
ami-0.008
icans-0.008
Token was
Token ID373
Feature activation+9.90e-03

Pos logit contributions
qv+0.051
nexus+0.050
 Defender+0.044
owered+0.044
finder+0.043
Neg logit contributions
ELY-0.045
yz-0.044
llah-0.043
ami-0.041
icans-0.040
Token and
Token ID290
Feature activation-2.48e-02

Pos logit contributions
qv+0.009
nexus+0.009
 Defender+0.008
owered+0.008
finder+0.008
Neg logit contributions
ELY-0.008
yz-0.008
llah-0.008
ami-0.007
icans-0.007
Token they
Token ID484
Feature activation-2.42e-02

Pos logit contributions
qv+0.017
nexus+0.016
 Defender+0.014
owered+0.014
finder+0.014
Neg logit contributions
ELY-0.015
yz-0.015
llah-0.015
ami-0.014
icans-0.014
Token lack
Token ID3092
Feature activation+1.63e-02

Pos logit contributions
qv+0.017
nexus+0.017
 Defender+0.015
owered+0.015
finder+0.015
Neg logit contributions
ELY-0.014
yz-0.014
llah-0.013
ami-0.013
icans-0.013
Token the
Token ID262
Feature activation-3.74e-02

Pos logit contributions
ELY+0.011
yz+0.010
llah+0.010
ami+0.010
icans+0.009
Neg logit contributions
qv-0.011
nexus-0.011
 Defender-0.009
owered-0.009
finder-0.009
Token lead
Token ID1085
Feature activation+5.79e-02

Pos logit contributions
qv+0.025
nexus+0.025
 Defender+0.021
owered+0.021
finder+0.021
Neg logit contributions
ELY-0.024
yz-0.023
llah-0.022
ami-0.022
icans-0.021
Token-
Token ID12
Feature activation-4.62e-02

Pos logit contributions
ELY+0.036
yz+0.035
llah+0.034
ami+0.033
icans+0.032
Neg logit contributions
qv-0.039
nexus-0.038
 Defender-0.034
owered-0.033
finder-0.033
Tokenin
Token ID259
Feature activation-1.69e-01

Pos logit contributions
qv+0.028
nexus+0.028
 Defender+0.024
owered+0.024
finder+0.023
Neg logit contributions
ELY-0.032
yz-0.031
llah-0.030
ami-0.029
icans-0.029
Token character
Token ID2095
Feature activation-3.58e-03

Pos logit contributions
qv+0.115
nexus+0.113
 Defender+0.099
owered+0.098
finder+0.097
Neg logit contributions
ELY-0.105
yz-0.103
llah-0.099
ami-0.096
icans-0.094
Token that
Token ID326
Feature activation-1.31e-02

Pos logit contributions
qv+0.002
nexus+0.002
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.002
yz-0.002
llah-0.002
ami-0.002
icans-0.002
Token indicates
Token ID9217
Feature activation-4.45e-02

Pos logit contributions
qv+0.009
nexus+0.009
 Defender+0.008
owered+0.008
finder+0.008
Neg logit contributions
ELY-0.008
yz-0.008
llah-0.008
ami-0.007
icans-0.007
Token that
Token ID326
Feature activation-7.33e-02

Pos logit contributions
qv+0.029
nexus+0.029
 Defender+0.025
owered+0.025
finder+0.024
Neg logit contributions
ELY-0.029
yz-0.028
llah-0.027
ami-0.026
icans-0.026
Token hardware
Token ID6890
Feature activation-7.69e-03

Pos logit contributions
qv+0.013
nexus+0.013
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.013
yz-0.013
llah-0.012
ami-0.012
icans-0.012
Token.
Token ID13
Feature activation+4.97e-03

Pos logit contributions
qv+0.005
nexus+0.005
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.005
yz-0.005
llah-0.005
ami-0.005
icans-0.005
Token But
Token ID887
Feature activation+1.11e-02

Pos logit contributions
ELY+0.003
yz+0.003
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.003
nexus-0.003
 Defender-0.003
owered-0.003
finder-0.003
Token not
Token ID407
Feature activation-4.82e-02

Pos logit contributions
ELY+0.007
yz+0.007
llah+0.007
ami+0.006
icans+0.006
Neg logit contributions
qv-0.007
nexus-0.007
 Defender-0.006
owered-0.006
finder-0.006
Token all
Token ID477
Feature activation-7.04e-02

Pos logit contributions
qv+0.032
nexus+0.031
 Defender+0.027
owered+0.027
finder+0.027
Neg logit contributions
ELY-0.031
yz-0.030
llah-0.029
ami-0.028
icans-0.028
Token workload
Token ID26211
Feature activation-3.02e-01

Pos logit contributions
qv+0.044
nexus+0.044
 Defender+0.038
owered+0.037
finder+0.037
Neg logit contributions
ELY-0.047
yz-0.046
llah-0.044
ami-0.043
icans-0.043
Tokens
Token ID82
Feature activation+5.31e-02

Pos logit contributions
qv+0.194
nexus+0.192
 Defender+0.167
owered+0.164
finder+0.162
Neg logit contributions
ELY-0.198
yz-0.193
llah-0.186
ami-0.181
icans-0.179
Token:
Token ID25
Feature activation-3.57e-02

Pos logit contributions
ELY+0.032
yz+0.032
llah+0.031
ami+0.030
icans+0.029
Neg logit contributions
qv-0.036
nexus-0.036
 Defender-0.032
owered-0.031
finder-0.031
Token some
Token ID617
Feature activation-1.50e-01

Pos logit contributions
qv+0.023
nexus+0.023
 Defender+0.019
owered+0.019
finder+0.019
Neg logit contributions
ELY-0.023
yz-0.023
llah-0.022
ami-0.022
icans-0.021
Token are
Token ID389
Feature activation-1.88e-03

Pos logit contributions
qv+0.088
nexus+0.087
 Defender+0.074
owered+0.073
finder+0.072
Neg logit contributions
ELY-0.107
yz-0.104
llah-0.101
ami-0.098
icans-0.097
Token network
Token ID3127
Feature activation+2.68e-02

Pos logit contributions
qv+0.001
nexus+0.001
 Defender+0.001
owered+0.001
finder+0.001
Neg logit contributions
ELY-0.001
yz-0.001
llah-0.001
ami-0.001
icans-0.001
Token forced
Token ID4137
Feature activation+5.77e-02

Pos logit contributions
ELY+0.009
yz+0.008
llah+0.008
ami+0.008
icans+0.008
Neg logit contributions
qv-0.008
nexus-0.008
 Defender-0.007
owered-0.007
finder-0.007
Token to
Token ID284
Feature activation-1.61e-02

Pos logit contributions
ELY+0.039
yz+0.038
llah+0.037
ami+0.036
icans+0.036
Neg logit contributions
qv-0.036
nexus-0.035
 Defender-0.030
owered-0.030
finder-0.030
Token rely
Token ID8814
Feature activation+2.33e-02

Pos logit contributions
qv+0.010
nexus+0.010
 Defender+0.009
owered+0.009
finder+0.008
Neg logit contributions
ELY-0.011
yz-0.010
llah-0.010
ami-0.010
icans-0.010
Token on
Token ID319
Feature activation-2.73e-02

Pos logit contributions
ELY+0.015
yz+0.014
llah+0.014
ami+0.014
icans+0.013
Neg logit contributions
qv-0.015
nexus-0.015
 Defender-0.013
owered-0.013
finder-0.013
Token stored
Token ID8574
Feature activation-8.69e-02

Pos logit contributions
qv+0.020
nexus+0.019
 Defender+0.017
owered+0.017
finder+0.017
Neg logit contributions
ELY-0.016
yz-0.015
llah-0.015
ami-0.014
icans-0.014
Token body
Token ID1767
Feature activation-2.80e-01

Pos logit contributions
qv+0.058
nexus+0.057
 Defender+0.049
owered+0.049
finder+0.048
Neg logit contributions
ELY-0.055
yz-0.054
llah-0.052
ami-0.051
icans-0.050
Token fat
Token ID3735
Feature activation+1.36e-02

Pos logit contributions
qv+0.200
nexus+0.197
 Defender+0.174
owered+0.171
finder+0.169
Neg logit contributions
ELY-0.164
yz-0.159
llah-0.154
ami-0.149
icans-0.146
Token for
Token ID329
Feature activation-1.44e-02

Pos logit contributions
ELY+0.009
yz+0.009
llah+0.008
ami+0.008
icans+0.008
Neg logit contributions
qv-0.009
nexus-0.009
 Defender-0.008
owered-0.007
finder-0.007
Token fuel
Token ID5252
Feature activation+1.29e-01

Pos logit contributions
qv+0.012
nexus+0.012
 Defender+0.011
owered+0.011
finder+0.011
Neg logit contributions
ELY-0.007
yz-0.006
llah-0.006
ami-0.006
icans-0.006
Token.
Token ID13
Feature activation-4.37e-02

Pos logit contributions
ELY+0.080
yz+0.077
llah+0.075
ami+0.072
icans+0.071
Neg logit contributions
qv-0.088
nexus-0.087
 Defender-0.076
owered-0.075
finder-0.074
Token You
Token ID921
Feature activation-4.19e-02

Pos logit contributions
qv+0.029
nexus+0.028
 Defender+0.025
owered+0.024
finder+0.024
Neg logit contributions
ELY-0.028
yz-0.027
llah-0.026
ami-0.026
icans-0.025
Token marketing
Token ID7124
Feature activation-6.90e-02

Pos logit contributions
ELY+0.070
yz+0.068
llah+0.066
ami+0.064
icans+0.063
Neg logit contributions
qv-0.077
nexus-0.076
 Defender-0.066
owered-0.066
finder-0.065
Token campaign
Token ID1923
Feature activation-1.50e-01

Pos logit contributions
qv+0.044
nexus+0.043
 Defender+0.038
owered+0.037
finder+0.037
Neg logit contributions
ELY-0.046
yz-0.045
llah-0.043
ami-0.042
icans-0.041
Token.
Token ID13
Feature activation+3.15e-03

Pos logit contributions
qv+0.100
nexus+0.099
 Defender+0.086
owered+0.085
finder+0.084
Neg logit contributions
ELY-0.095
yz-0.093
llah-0.089
ami-0.087
icans-0.085
Token Instead
Token ID5455
Feature activation-2.37e-02

Pos logit contributions
ELY+0.002
yz+0.002
llah+0.002
ami+0.002
icans+0.002
Neg logit contributions
qv-0.002
nexus-0.002
owered-0.002
 Defender-0.002
finder-0.002
Token of
Token ID286
Feature activation-2.03e-02

Pos logit contributions
qv+0.016
nexus+0.016
 Defender+0.014
owered+0.013
finder+0.013
Neg logit contributions
ELY-0.015
yz-0.015
llah-0.014
ami-0.014
icans-0.014
TokenTechn
Token ID25574
Feature activation-3.09e-01

Pos logit contributions
qv+0.014
nexus+0.014
 Defender+0.012
owered+0.012
finder+0.012
Neg logit contributions
ELY-0.012
yz-0.012
llah-0.012
ami-0.011
icans-0.011
Tokenicol
Token ID27045
Feature activation-3.76e-03

Pos logit contributions
qv+0.207
nexus+0.204
 Defender+0.178
owered+0.176
finder+0.174
Neg logit contributions
ELY-0.194
yz-0.189
llah-0.183
ami-0.177
icans-0.175
Tokenor
Token ID273
Feature activation+1.89e-01

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.003
owered+0.003
finder+0.003
Neg logit contributions
ELY-0.002
yz-0.002
llah-0.002
ami-0.002
icans-0.002
Token,
Token ID11
Feature activation+8.08e-03

Pos logit contributions
ELY+0.121
yz+0.118
llah+0.114
ami+0.111
icans+0.109
Neg logit contributions
qv-0.125
nexus-0.123
 Defender-0.107
owered-0.106
finder-0.105
Token Hitchcock
Token ID49817
Feature activation-1.30e-01

Pos logit contributions
ELY+0.005
yz+0.005
llah+0.005
ami+0.005
icans+0.005
Neg logit contributions
qv-0.005
nexus-0.005
 Defender-0.005
owered-0.005
finder-0.005
Token shot
Token ID2823
Feature activation-2.28e-02

Pos logit contributions
qv+0.087
nexus+0.085
 Defender+0.074
owered+0.074
finder+0.073
Neg logit contributions
ELY-0.082
yz-0.080
llah-0.077
ami-0.075
icans-0.073
Token official
Token ID1743
Feature activation-3.97e-02

Pos logit contributions
ELY+0.125
yz+0.122
llah+0.118
ami+0.114
icans+0.112
Neg logit contributions
qv-0.129
nexus-0.127
 Defender-0.111
owered-0.110
finder-0.108
Token of
Token ID286
Feature activation-2.11e-02

Pos logit contributions
qv+0.026
nexus+0.026
 Defender+0.022
owered+0.022
finder+0.022
Neg logit contributions
ELY-0.026
yz-0.025
llah-0.024
ami-0.023
icans-0.023
Token the
Token ID262
Feature activation-1.56e-02

Pos logit contributions
qv+0.015
nexus+0.015
 Defender+0.013
owered+0.013
finder+0.013
Neg logit contributions
ELY-0.012
yz-0.012
llah-0.011
ami-0.011
icans-0.011
Token Sund
Token ID3309
Feature activation+2.39e-01

Pos logit contributions
qv+0.012
nexus+0.012
 Defender+0.010
owered+0.010
finder+0.010
Neg logit contributions
ELY-0.008
yz-0.008
llah-0.008
ami-0.007
icans-0.007
Tokenarb
Token ID38039
Feature activation-2.12e-02

Pos logit contributions
ELY+0.137
yz+0.134
llah+0.129
ami+0.126
icans+0.122
Neg logit contributions
qv-0.172
nexus-0.170
 Defender-0.151
owered-0.148
finder-0.147
Tokenans
Token ID504
Feature activation-2.51e-01

Pos logit contributions
qv+0.009
nexus+0.009
 Defender+0.007
owered+0.007
finder+0.007
Neg logit contributions
ELY-0.019
yz-0.018
llah-0.018
ami-0.018
icans-0.017
Token in
Token ID287
Feature activation+6.17e-03

Pos logit contributions
qv+0.167
nexus+0.164
 Defender+0.143
owered+0.141
finder+0.140
Neg logit contributions
ELY-0.159
yz-0.155
llah-0.150
ami-0.145
icans-0.143
Token Bangladesh
Token ID19483
Feature activation+2.16e-01

Pos logit contributions
ELY+0.003
yz+0.003
llah+0.003
ami+0.003
icans+0.003
Neg logit contributions
qv-0.005
nexus-0.004
 Defender-0.004
owered-0.004
finder-0.004
Token,
Token ID11
Feature activation-3.71e-03

Pos logit contributions
ELY+0.137
yz+0.134
llah+0.130
ami+0.126
icans+0.124
Neg logit contributions
qv-0.143
nexus-0.141
 Defender-0.123
owered-0.121
finder-0.120
Token told
Token ID1297
Feature activation+5.96e-02

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.002
yz-0.002
llah-0.002
ami-0.002
icans-0.002
Token reporters
Token ID7638
Feature activation-6.12e-02

Pos logit contributions
ELY+0.035
yz+0.034
llah+0.033
ami+0.032
icans+0.031
Neg logit contributions
qv-0.042
nexus-0.042
 Defender-0.037
owered-0.036
finder-0.036
Token remake
Token ID28763
Feature activation+1.11e-01

Pos logit contributions
ELY+0.041
yz+0.040
llah+0.039
ami+0.038
icans+0.037
Neg logit contributions
qv-0.038
nexus-0.038
 Defender-0.032
owered-0.032
finder-0.032
Token of
Token ID286
Feature activation-4.31e-03

Pos logit contributions
ELY+0.065
yz+0.063
llah+0.060
ami+0.060
icans+0.058
Neg logit contributions
qv-0.078
nexus-0.077
 Defender-0.067
owered-0.067
finder-0.066
Token my
Token ID616
Feature activation-6.37e-03

Pos logit contributions
qv+0.003
nexus+0.003
 Defender+0.002
owered+0.002
finder+0.002
Neg logit contributions
ELY-0.003
yz-0.003
llah-0.003
ami-0.003
icans-0.002
Token X
Token ID1395
Feature activation-2.05e-01

Pos logit contributions
qv+0.004
nexus+0.004
 Defender+0.004
owered+0.004
finder+0.004
Neg logit contributions
ELY-0.004
yz-0.004
llah-0.004
ami-0.004
icans-0.004
Token-
Token ID12
Feature activation+8.57e-03

Pos logit contributions
qv+0.142
nexus+0.140
 Defender+0.124
owered+0.121
finder+0.120
Neg logit contributions
ELY-0.124
yz-0.119
llah-0.117
icans-0.112
ami-0.112
Tokenwing
Token ID5469
Feature activation-3.48e-01

Pos logit contributions
ELY+0.005
yz+0.005
llah+0.005
ami+0.005
icans+0.005
Neg logit contributions
qv-0.006
nexus-0.006
 Defender-0.005
owered-0.005
finder-0.005
Token Target
Token ID12744
Feature activation-3.94e-02

Pos logit contributions
qv+0.228
nexus+0.224
 Defender+0.196
owered+0.193
finder+0.191
Neg logit contributions
ELY-0.224
yz-0.219
llah-0.212
ami-0.205
icans-0.202
Tokening
Token ID278
Feature activation+9.44e-02

Pos logit contributions
qv+0.026
nexus+0.026
 Defender+0.022
owered+0.022
finder+0.022
Neg logit contributions
ELY-0.025
yz-0.024
llah-0.023
ami-0.023
icans-0.022
Token Computer
Token ID13851
Feature activation-1.55e-01

Pos logit contributions
ELY+0.062
yz+0.061
llah+0.059
ami+0.058
icans+0.056
Neg logit contributions
qv-0.060
nexus-0.059
owered-0.051
 Defender-0.050
finder-0.050
Token app
Token ID598
Feature activation+3.02e-02

Pos logit contributions
qv+0.108
nexus+0.107
owered+0.093
 Defender+0.092
finder+0.091
Neg logit contributions
ELY-0.093
yz-0.091
llah-0.086
ami-0.085
icans-0.082
Token here
Token ID994
Feature activation+1.21e-01

Pos logit contributions
ELY+0.016
yz+0.016
llah+0.015
ami+0.015
icans+0.014
Neg logit contributions
qv-0.023
nexus-0.023
 Defender-0.020
owered-0.020
finder-0.020