TOP ACTIVATIONS
MAX = 0.738

elled as tre ason by young members
elle Evans and David E ason are officially engaged , she
minerals , especially since ep oxy resin is " res il
fog machines or WWE page antry . S igh .
itle ft > lol j ason [ 10 : 47 ]
uc ort > haha j ason [ 10 : 47 ]
allowed the Canadiens to exc ite fans around the world ;
hex agon - shaped ep oxy resin scaff old that measures
he first appeared on The Forbes 400 in 2006 , he
first RPG scores to use samples rather than just MIDI ,
the player is sufficiently contr ite , does some sort of
people dead . The resh uffle appears to have strengthened Mr
a bit of a resp ite , and 2012 was 25
copyright AFP Image caption The Kurds ' push has alarmed Turkey
other composition information about powder samples from rocks and soil .
ne s omm es pas encl in à cher cher le
days later , bed r agg led Bart was found me
)" high resolution no j agg ies but screen door effect
play it with the j agg ies since even though they
you can see " j agg ies " along diagonal lines

MIN ACTIVATIONS
MIN = -0.702

dolphins and the highly endangered Bengal tiger . "
two reasons . Instead of gloss ing over these details ,
. youtube . com / watch ? v = 2 g
about him have been hed ged , of all the Americans
he bore no gr ud ges . He reiterated that he
D ana Hod ges , state director of Concern
whose work is occasionally dod gy and occasionally brilliant , hits
autonomy under the Norfolk Island Act 1979 through its own legislative
MP ID R ) in Rost ock , Germany , has
Fe eling Hel pless But are the
, but since I use macros I also have the names
players . <|endoftext|> A D DoS Gap occurs whenever D DoS
a peer - to - peer app , founded in Austin
Booker is not just in offensive , but corrupt . <|endoftext|>
- year - old Hed ges is sure to be a
: " Defense Of Marriage Act "). But the whole argument
queue delay , and conver ges with high probability to Klein
the petty - bour ge ois intellig ents ia forged in
world . ( Wa ub ges hig Rice / CBC )
Post reading " ISIS Ur ges Jihad ists To Attack Canadians

INTERVAL 0.378 - 0.738
CONTAINS 0.155%

.: New York , NY $ 5 , 450 Membership grant
F ocal ac ral hyper ker at osis ( also known
ole i emo ko Kal an
, and no personnel have exceeded 20 m S v in
extracted his nickel . Wh arton had spent $ 200 ,

INTERVAL 0.018 - 0.378
CONTAINS 39.429%

all that stuff has an impact on our long - term
s has some miscar riages in the past ,
I don t try to communicate with kids .
implemented this in the states include : Uber , Ben &
I found nothing in their writings to warrant anything approaching the

INTERVAL -0.342 - 0.018
CONTAINS 60.104%

mod er ates voting for T org ersen
I think we can all agree that if we
the fr inges of the right . How does someone who
saw already dangling , waiting for opening night .
enforced , natural gas - fired electricity generation produces more lif

INTERVAL -0.702 - -0.342
CONTAINS 0.313%

- 35 f / 4 IS lens at 16 mm .
to g ank immedi at ly after 6 . With EQ
] < mom per > lol m 2 k [ 10
women , 37 % were manual workers , over 10 %
Pick M . G ess berg witz West chester 1 hour
Tokenelled
Token ID11978
Feature activation+2.25e-01

Pos logit contributions
NRS+0.145
 UP+0.144
 Act+0.144
 gloss+0.143
 Rost+0.143
Neg logit contributions
ason-0.118
arton-0.112
asons-0.100
Write-0.099
odox-0.099
Token as
Token ID355
Feature activation-2.68e-02

Pos logit contributions
ason+0.177
arton+0.170
asons+0.156
odox+0.155
Write+0.155
Neg logit contributions
NRS-0.124
 UP-0.123
 Act-0.122
 gloss-0.122
 Rost-0.121
Token
Token ID564
Feature activation-5.33e-02

Pos logit contributions
NRS+0.015
 UP+0.015
 Act+0.015
 gloss+0.015
 Rost+0.015
Neg logit contributions
ason-0.021
arton-0.020
asons-0.018
odox-0.018
Write-0.018
Token
Token ID246
Feature activation-1.87e-02

Pos logit contributions
NRS+0.038
 UP+0.037
 Act+0.037
 gloss+0.037
 Rost+0.037
Neg logit contributions
ason-0.034
arton-0.032
asons-0.029
odox-0.028
Write-0.028
Tokentre
Token ID33945
Feature activation+5.52e-02

Pos logit contributions
NRS+0.011
 UP+0.011
 Act+0.011
 gloss+0.011
 Rost+0.011
Neg logit contributions
ason-0.014
arton-0.013
asons-0.012
odox-0.012
Write-0.012
Tokenason
Token ID888
Feature activation+7.38e-01

Pos logit contributions
ason+0.038
arton+0.037
Write+0.033
asons+0.033
antry+0.033
Neg logit contributions
NRS-0.035
 UP-0.035
 Act-0.035
 gloss-0.035
 Rost-0.035
Token
Token ID447
Feature activation-1.36e-02

Pos logit contributions
ason+0.567
arton+0.542
asons+0.496
odox+0.494
Write+0.494
Neg logit contributions
NRS-0.424
 UP-0.419
 Act-0.417
 gloss-0.417
 Rost-0.413
Token
Token ID247
Feature activation+5.29e-02

Pos logit contributions
NRS+0.010
 Act+0.009
 UP+0.009
 gloss+0.009
 Rost+0.009
Neg logit contributions
ason-0.009
arton-0.008
odox-0.007
asons-0.007
Write-0.007
Token by
Token ID416
Feature activation-5.10e-02

Pos logit contributions
ason+0.035
arton+0.034
asons+0.030
odox+0.030
Write+0.030
Neg logit contributions
NRS-0.036
 UP-0.035
 Act-0.035
 gloss-0.035
 Rost-0.035
Token young
Token ID1862
Feature activation-7.75e-02

Pos logit contributions
NRS+0.029
 UP+0.029
 Act+0.029
 gloss+0.029
 Rost+0.028
Neg logit contributions
ason-0.039
arton-0.038
asons-0.034
odox-0.034
Write-0.034
Token members
Token ID1866
Feature activation-1.10e-01

Pos logit contributions
NRS+0.050
 UP+0.049
 Act+0.049
 gloss+0.049
 Rost+0.048
Neg logit contributions
ason-0.054
arton-0.052
asons-0.047
odox-0.047
Write-0.047
Tokenelle
Token ID13485
Feature activation-3.19e-02

Pos logit contributions
ason+0.205
arton+0.195
asons+0.178
odox+0.177
Write+0.177
Neg logit contributions
NRS-0.177
 UP-0.175
 Act-0.175
 gloss-0.174
 Rost-0.173
Token Evans
Token ID13922
Feature activation-2.23e-01

Pos logit contributions
NRS+0.014
 UP+0.013
 gloss+0.013
 Act+0.013
 Rost+0.013
Neg logit contributions
ason-0.029
arton-0.028
asons-0.026
Write-0.026
odox-0.026
Token and
Token ID290
Feature activation+4.84e-02

Pos logit contributions
NRS+0.126
 UP+0.123
 gloss+0.122
 Act+0.122
 Rost+0.121
Neg logit contributions
ason-0.174
arton-0.166
asons-0.152
odox-0.151
Write-0.151
Token David
Token ID3271
Feature activation+1.13e-01

Pos logit contributions
ason+0.037
arton+0.035
asons+0.032
Write+0.032
odox+0.032
Neg logit contributions
NRS-0.028
 UP-0.027
 gloss-0.027
 Act-0.027
 Rost-0.027
Token E
Token ID412
Feature activation-7.76e-02

Pos logit contributions
ason+0.089
arton+0.085
asons+0.078
odox+0.078
Write+0.077
Neg logit contributions
NRS-0.063
 UP-0.062
 gloss-0.061
 Act-0.061
 Rost-0.061
Tokenason
Token ID888
Feature activation+6.95e-01

Pos logit contributions
NRS+0.054
 UP+0.053
 Act+0.053
 gloss+0.053
 Rost+0.053
Neg logit contributions
ason-0.050
arton-0.048
asons-0.043
odox-0.043
Write-0.043
Token are
Token ID389
Feature activation+5.15e-02

Pos logit contributions
ason+0.486
arton+0.460
asons+0.418
antry+0.414
odox+0.413
Neg logit contributions
NRS-0.451
 UP-0.441
 gloss-0.438
 Act-0.437
 Rost-0.435
Token officially
Token ID8720
Feature activation-2.49e-01

Pos logit contributions
ason+0.038
arton+0.036
asons+0.033
Write+0.033
antry+0.033
Neg logit contributions
NRS-0.031
 UP-0.030
 Act-0.030
 gloss-0.030
 Rost-0.030
Token engaged
Token ID7953
Feature activation-1.69e-01

Pos logit contributions
NRS+0.147
 UP+0.142
 gloss+0.142
 Act+0.141
 Rost+0.141
Neg logit contributions
ason-0.188
arton-0.178
asons-0.164
Write-0.163
antry-0.163
Token,
Token ID11
Feature activation+5.16e-02

Pos logit contributions
NRS+0.106
 UP+0.104
 gloss+0.104
 Act+0.104
 Rost+0.103
Neg logit contributions
ason-0.121
arton-0.115
asons-0.105
odox-0.104
Write-0.104
Token she
Token ID673
Feature activation-3.26e-02

Pos logit contributions
ason+0.039
arton+0.037
asons+0.034
antry+0.034
Write+0.034
Neg logit contributions
NRS-0.030
 UP-0.029
 gloss-0.029
 Act-0.029
 Rost-0.029
Token minerals
Token ID21622
Feature activation-1.40e-01

Pos logit contributions
ason+0.002
arton+0.002
asons+0.002
Write+0.002
odox+0.002
Neg logit contributions
NRS-0.002
 UP-0.002
 Act-0.002
 gloss-0.002
 Rost-0.002
Token,
Token ID11
Feature activation+3.60e-02

Pos logit contributions
NRS+0.087
 UP+0.087
 Act+0.086
 gloss+0.086
 Rost+0.086
Neg logit contributions
ason-0.101
arton-0.096
asons-0.088
odox-0.087
Write-0.087
Token especially
Token ID2592
Feature activation+1.43e-02

Pos logit contributions
ason+0.027
arton+0.026
asons+0.023
Write+0.023
odox+0.023
Neg logit contributions
NRS-0.021
 UP-0.021
 Act-0.021
 gloss-0.021
 Rost-0.021
Token since
Token ID1201
Feature activation+8.89e-03

Pos logit contributions
ason+0.010
arton+0.010
asons+0.009
odox+0.009
Write+0.009
Neg logit contributions
NRS-0.009
 UP-0.009
 Act-0.009
 gloss-0.009
 Rost-0.009
Token ep
Token ID2462
Feature activation-2.17e-01

Pos logit contributions
ason+0.007
arton+0.007
asons+0.006
Write+0.006
odox+0.006
Neg logit contributions
NRS-0.005
 UP-0.005
 Act-0.005
 gloss-0.005
 Rost-0.005
Tokenoxy
Token ID23536
Feature activation+6.43e-01

Pos logit contributions
NRS+0.190
 gloss+0.189
 UP+0.189
 Act+0.188
 Rost+0.187
Neg logit contributions
ason-0.099
arton-0.092
Write-0.079
asons-0.078
antry-0.077
Token resin
Token ID36536
Feature activation+7.83e-02

Pos logit contributions
ason+0.475
arton+0.453
asons+0.413
Write+0.410
odox+0.410
Neg logit contributions
NRS-0.390
 UP-0.384
 Act-0.382
 gloss-0.379
 Rost-0.379
Token is
Token ID318
Feature activation-1.92e-02

Pos logit contributions
ason+0.061
arton+0.058
asons+0.053
Write+0.053
odox+0.053
Neg logit contributions
NRS-0.044
 UP-0.044
 Act-0.044
 gloss-0.043
 Rost-0.043
Token "
Token ID366
Feature activation-1.05e-01

Pos logit contributions
NRS+0.012
 UP+0.012
 Act+0.012
 Rost+0.012
 gloss+0.012
Neg logit contributions
ason-0.014
arton-0.013
asons-0.012
Write-0.012
odox-0.012
Tokenres
Token ID411
Feature activation+1.66e-02

Pos logit contributions
NRS+0.074
 UP+0.073
 Act+0.073
 gloss+0.073
 Rost+0.072
Neg logit contributions
ason-0.066
arton-0.063
asons-0.056
odox-0.056
Write-0.056
Tokenil
Token ID346
Feature activation+2.29e-01

Pos logit contributions
ason+0.010
arton+0.009
asons+0.008
odox+0.008
antry+0.008
Neg logit contributions
NRS-0.013
 UP-0.013
 Act-0.012
 gloss-0.012
 Rost-0.012
Token fog
Token ID19558
Feature activation-3.92e-01

Pos logit contributions
NRS+0.052
 UP+0.051
 Act+0.050
 gloss+0.050
 Rost+0.050
Neg logit contributions
ason-0.070
arton-0.067
asons-0.061
Write-0.061
antry-0.061
Token machines
Token ID8217
Feature activation-2.69e-02

Pos logit contributions
NRS+0.224
 UP+0.222
 Act+0.221
 gloss+0.220
 Rost+0.218
Neg logit contributions
ason-0.303
arton-0.290
asons-0.265
odox-0.264
Write-0.264
Token or
Token ID393
Feature activation-1.66e-01

Pos logit contributions
NRS+0.016
 UP+0.016
 Act+0.016
 gloss+0.016
 Rost+0.015
Neg logit contributions
ason-0.020
arton-0.019
asons-0.018
odox-0.018
Write-0.018
Token WWE
Token ID15247
Feature activation-2.36e-01

Pos logit contributions
NRS+0.093
 UP+0.092
 Act+0.091
 gloss+0.090
 Rost+0.090
Neg logit contributions
ason-0.129
arton-0.124
asons-0.113
antry-0.113
Write-0.113
Token page
Token ID2443
Feature activation-1.20e-01

Pos logit contributions
NRS+0.122
 UP+0.120
 Act+0.119
 gloss+0.119
 Rost+0.118
Neg logit contributions
ason-0.195
arton-0.187
asons-0.172
Write-0.172
antry-0.172
Tokenantry
Token ID21238
Feature activation+6.27e-01

Pos logit contributions
NRS+0.129
 UP+0.128
 Act+0.128
 gloss+0.128
 Rost+0.127
Neg logit contributions
ason-0.032
arton-0.028
antry-0.020
asons-0.020
Write-0.020
Token.
Token ID13
Feature activation-4.92e-02

Pos logit contributions
ason+0.465
arton+0.443
asons+0.404
antry+0.403
Write+0.402
Neg logit contributions
NRS-0.377
 UP-0.372
 Act-0.370
 gloss-0.369
 Rost-0.367
Token S
Token ID311
Feature activation-6.49e-02

Pos logit contributions
NRS+0.030
 UP+0.030
 gloss+0.030
 Act+0.029
 Rost+0.029
Neg logit contributions
ason-0.036
arton-0.034
asons-0.031
Write-0.031
odox-0.031
Tokenigh
Token ID394
Feature activation+6.57e-02

Pos logit contributions
NRS+0.046
 UP+0.046
 Act+0.046
 gloss+0.046
 Rost+0.045
Neg logit contributions
ason-0.041
arton-0.038
asons-0.034
odox-0.034
Write-0.034
Token.
Token ID13
Feature activation-5.89e-02

Pos logit contributions
ason+0.047
arton+0.045
asons+0.041
odox+0.041
Write+0.041
Neg logit contributions
NRS-0.041
 UP-0.041
 Act-0.040
 gloss-0.040
 Rost-0.040
Token
Token ID198
Feature activation-8.28e-02

Pos logit contributions
NRS+0.035
 UP+0.034
 gloss+0.034
 Act+0.034
 Rost+0.034
Neg logit contributions
ason-0.044
arton-0.042
asons-0.039
Write-0.039
odox-0.039
Tokenitle
Token ID2578
Feature activation-5.66e-03

Pos logit contributions
NRS+0.115
 UP+0.114
 Act+0.113
 gloss+0.113
 Rost+0.112
Neg logit contributions
ason-0.122
arton-0.116
asons-0.105
odox-0.105
Write-0.105
Tokenft
Token ID701
Feature activation-2.05e-01

Pos logit contributions
NRS+0.004
 UP+0.004
 Act+0.004
 gloss+0.004
 Rost+0.004
Neg logit contributions
ason-0.004
arton-0.004
asons-0.003
odox-0.003
Write-0.003
Token>
Token ID29
Feature activation+6.68e-04

Pos logit contributions
NRS+0.123
 UP+0.121
 Act+0.121
 gloss+0.121
 Rost+0.120
Neg logit contributions
ason-0.152
arton-0.145
asons-0.132
odox-0.132
Write-0.132
Token lol
Token ID19462
Feature activation-3.08e-01

Pos logit contributions
ason+0.001
arton+0.000
asons+0.000
Write+0.000
odox+0.000
Neg logit contributions
NRS-0.000
 UP-0.000
 Act-0.000
 gloss-0.000
 Rost-0.000
Token j
Token ID474
Feature activation+1.33e-01

Pos logit contributions
NRS+0.196
 UP+0.194
 Act+0.193
 gloss+0.193
 Rost+0.192
Neg logit contributions
ason-0.217
arton-0.207
asons-0.187
odox-0.187
Write-0.187
Tokenason
Token ID888
Feature activation+5.92e-01

Pos logit contributions
ason+0.030
arton+0.026
Write+0.017
odox+0.017
asons+0.017
Neg logit contributions
NRS-0.148
 UP-0.147
 Act-0.147
 gloss-0.147
 Rost-0.146
Token [
Token ID685
Feature activation-8.81e-02

Pos logit contributions
ason+0.380
arton+0.358
asons+0.322
odox+0.320
Write+0.320
Neg logit contributions
NRS-0.416
 UP-0.412
 Act-0.410
 gloss-0.409
 Rost-0.407
Token10
Token ID940
Feature activation+4.36e-02

Pos logit contributions
NRS+0.058
 UP+0.057
 Act+0.057
 gloss+0.057
 Rost+0.056
Neg logit contributions
ason-0.061
arton-0.057
asons-0.052
Write-0.052
antry-0.051
Token:
Token ID25
Feature activation-1.96e-01

Pos logit contributions
ason+0.032
arton+0.031
asons+0.028
Write+0.028
antry+0.028
Neg logit contributions
NRS-0.026
 UP-0.026
 gloss-0.026
 Act-0.026
 Rost-0.026
Token47
Token ID2857
Feature activation+1.96e-01

Pos logit contributions
NRS+0.151
 UP+0.150
 Act+0.149
 gloss+0.149
 Rost+0.148
Neg logit contributions
ason-0.112
arton-0.106
asons-0.093
odox-0.093
antry-0.093
Token]
Token ID60
Feature activation-9.52e-02

Pos logit contributions
ason+0.136
arton+0.128
asons+0.117
Write+0.116
antry+0.115
Neg logit contributions
NRS-0.128
 UP-0.126
 gloss-0.126
 Act-0.125
 Rost-0.125
Tokenuc
Token ID1229
Feature activation+2.26e-01

Pos logit contributions
ason+0.028
arton+0.027
asons+0.024
odox+0.024
Write+0.024
Neg logit contributions
NRS-0.025
 UP-0.025
 Act-0.025
 gloss-0.025
 Rost-0.025
Tokenort
Token ID419
Feature activation-5.00e-02

Pos logit contributions
ason+0.149
arton+0.142
Write+0.128
asons+0.128
odox+0.127
Neg logit contributions
NRS-0.152
 UP-0.151
 Act-0.151
 gloss-0.150
 Rost-0.149
Token>
Token ID29
Feature activation-2.17e-02

Pos logit contributions
NRS+0.033
 UP+0.033
 Act+0.032
 gloss+0.032
 Rost+0.032
Neg logit contributions
ason-0.034
arton-0.032
asons-0.029
Write-0.029
odox-0.029
Token haha
Token ID42254
Feature activation-4.10e-02

Pos logit contributions
NRS+0.013
 UP+0.012
 Act+0.012
 gloss+0.012
 Rost+0.012
Neg logit contributions
ason-0.017
arton-0.016
asons-0.014
Write-0.014
odox-0.014
Token j
Token ID474
Feature activation+1.92e-01

Pos logit contributions
NRS+0.027
 UP+0.026
 Act+0.026
 gloss+0.026
 Rost+0.026
Neg logit contributions
ason-0.028
arton-0.027
asons-0.024
odox-0.024
Write-0.024
Tokenason
Token ID888
Feature activation+5.87e-01

Pos logit contributions
ason+0.127
arton+0.120
asons+0.108
odox+0.108
Write+0.108
Neg logit contributions
NRS-0.131
 UP-0.130
 Act-0.129
 gloss-0.129
 Rost-0.128
Token [
Token ID685
Feature activation-5.00e-02

Pos logit contributions
ason+0.397
arton+0.376
asons+0.340
odox+0.338
Write+0.338
Neg logit contributions
NRS-0.392
 UP-0.388
 Act-0.387
 gloss-0.386
 Rost-0.383
Token10
Token ID940
Feature activation+5.28e-02

Pos logit contributions
NRS+0.033
 UP+0.033
 Act+0.032
 gloss+0.032
 Rost+0.032
Neg logit contributions
ason-0.034
arton-0.032
asons-0.030
Write-0.029
antry-0.029
Token:
Token ID25
Feature activation-1.70e-01

Pos logit contributions
ason+0.039
arton+0.037
asons+0.034
Write+0.034
antry+0.034
Neg logit contributions
NRS-0.032
 UP-0.031
 gloss-0.031
 Act-0.031
 Rost-0.031
Token47
Token ID2857
Feature activation+2.07e-01

Pos logit contributions
NRS+0.126
 UP+0.125
 Act+0.125
 gloss+0.124
 Rost+0.123
Neg logit contributions
ason-0.101
arton-0.096
asons-0.086
antry-0.085
odox-0.085
Token]
Token ID60
Feature activation-6.34e-02

Pos logit contributions
ason+0.143
arton+0.135
asons+0.123
Write+0.123
odox+0.121
Neg logit contributions
NRS-0.135
 UP-0.133
 gloss-0.133
 Act-0.132
 Rost-0.131
Token allowed
Token ID3142
Feature activation+6.21e-02

Pos logit contributions
ason+0.055
arton+0.052
asons+0.047
odox+0.047
Write+0.047
Neg logit contributions
NRS-0.056
 UP-0.056
 Act-0.056
 gloss-0.056
 Rost-0.056
Token the
Token ID262
Feature activation+3.44e-02

Pos logit contributions
ason+0.053
arton+0.051
asons+0.047
odox+0.047
Write+0.047
Neg logit contributions
NRS-0.031
 UP-0.030
 Act-0.030
 gloss-0.030
 Rost-0.029
Token Canadiens
Token ID33664
Feature activation+1.54e-01

Pos logit contributions
ason+0.025
arton+0.024
asons+0.022
odox+0.022
Write+0.022
Neg logit contributions
NRS-0.021
 UP-0.021
 Act-0.021
 gloss-0.021
 Rost-0.021
Token to
Token ID284
Feature activation+8.69e-02

Pos logit contributions
ason+0.120
arton+0.115
odox+0.105
asons+0.105
Write+0.105
Neg logit contributions
NRS-0.086
 UP-0.086
 Act-0.086
 gloss-0.085
 Rost-0.085
Token exc
Token ID2859
Feature activation+3.09e-01

Pos logit contributions
ason+0.061
arton+0.058
asons+0.053
odox+0.053
Write+0.053
Neg logit contributions
NRS-0.055
 UP-0.055
 Act-0.055
 gloss-0.054
 Rost-0.054
Tokenite
Token ID578
Feature activation+5.81e-01

Pos logit contributions
ason+0.054
arton+0.044
odox+0.025
antry+0.025
asons+0.024
Neg logit contributions
NRS-0.358
 UP-0.357
 Act-0.357
 gloss-0.356
 Rost-0.354
Token fans
Token ID3296
Feature activation+1.32e-01

Pos logit contributions
ason+0.465
arton+0.446
asons+0.408
Write+0.407
odox+0.406
Neg logit contributions
NRS-0.315
 UP-0.308
 gloss-0.307
 Act-0.306
 Rost-0.303
Token around
Token ID1088
Feature activation+2.76e-01

Pos logit contributions
ason+0.098
arton+0.094
odox+0.086
asons+0.086
Write+0.085
Neg logit contributions
NRS-0.079
 UP-0.078
 Act-0.078
 gloss-0.078
 Rost-0.077
Token the
Token ID262
Feature activation+7.36e-02

Pos logit contributions
ason+0.236
arton+0.227
asons+0.210
odox+0.210
Write+0.209
Neg logit contributions
NRS-0.133
 UP-0.132
 Act-0.132
 gloss-0.131
 Rost-0.130
Token world
Token ID995
Feature activation+2.37e-01

Pos logit contributions
ason+0.053
arton+0.050
asons+0.046
odox+0.045
Write+0.045
Neg logit contributions
NRS-0.046
 UP-0.046
 Act-0.046
 gloss-0.046
 Rost-0.045
Token;
Token ID26
Feature activation+1.71e-01

Pos logit contributions
ason+0.175
arton+0.168
asons+0.153
odox+0.153
Write+0.152
Neg logit contributions
NRS-0.143
 UP-0.142
 Act-0.141
 gloss-0.141
 Rost-0.140
Token hex
Token ID17910
Feature activation-2.62e-02

Pos logit contributions
NRS+0.006
 UP+0.006
 Act+0.006
 gloss+0.006
 Rost+0.005
Neg logit contributions
ason-0.007
arton-0.007
asons-0.006
Write-0.006
odox-0.006
Tokenagon
Token ID1840
Feature activation+4.25e-02

Pos logit contributions
NRS+0.020
 UP+0.020
 Act+0.020
 gloss+0.020
 Rost+0.020
Neg logit contributions
ason-0.015
arton-0.014
asons-0.012
Write-0.012
odox-0.012
Token-
Token ID12
Feature activation-3.87e-02

Pos logit contributions
ason+0.031
arton+0.029
asons+0.027
odox+0.026
Write+0.026
Neg logit contributions
NRS-0.026
 UP-0.026
 Act-0.026
 gloss-0.026
 Rost-0.026
Tokenshaped
Token ID16760
Feature activation+1.49e-02

Pos logit contributions
NRS+0.025
 UP+0.025
 Act+0.025
 gloss+0.025
 Rost+0.025
Neg logit contributions
ason-0.027
arton-0.025
asons-0.023
odox-0.023
Write-0.023
Token ep
Token ID2462
Feature activation-2.94e-01

Pos logit contributions
ason+0.011
arton+0.010
asons+0.009
Write+0.009
antry+0.009
Neg logit contributions
NRS-0.009
 UP-0.009
 Act-0.009
 gloss-0.009
 Rost-0.009
Tokenoxy
Token ID23536
Feature activation+5.78e-01

Pos logit contributions
 UP+0.259
NRS+0.258
 gloss+0.258
 Act+0.258
 Rost+0.256
Neg logit contributions
ason-0.131
arton-0.121
asons-0.106
Write-0.105
odox-0.103
Token resin
Token ID36536
Feature activation+3.70e-02

Pos logit contributions
ason+0.413
arton+0.394
asons+0.357
Write+0.356
antry+0.355
Neg logit contributions
NRS-0.363
 UP-0.358
 Act-0.356
 gloss-0.354
 Rost-0.353
Token scaff
Token ID41498
Feature activation+2.29e-01

Pos logit contributions
ason+0.027
arton+0.025
asons+0.023
Write+0.023
odox+0.023
Neg logit contributions
NRS-0.023
 UP-0.023
 Act-0.023
 gloss-0.023
 Rost-0.023
Tokenold
Token ID727
Feature activation+1.16e-01

Pos logit contributions
ason+0.143
arton+0.136
asons+0.123
Write+0.123
odox+0.122
Neg logit contributions
NRS-0.162
 Act-0.162
 UP-0.161
 gloss-0.160
 Rost-0.159
Token that
Token ID326
Feature activation+3.50e-02

Pos logit contributions
ason+0.084
arton+0.080
asons+0.073
odox+0.073
Write+0.073
Neg logit contributions
NRS-0.072
 UP-0.071
 Act-0.071
 gloss-0.071
 Rost-0.070
Token measures
Token ID5260
Feature activation+9.10e-02

Pos logit contributions
ason+0.026
arton+0.025
asons+0.023
odox+0.023
Write+0.023
Neg logit contributions
NRS-0.021
 UP-0.020
 Act-0.020
 gloss-0.020
 Rost-0.020
Token he
Token ID339
Feature activation+1.74e-01

Pos logit contributions
ason+0.013
arton+0.013
asons+0.012
Write+0.011
odox+0.011
Neg logit contributions
NRS-0.009
 UP-0.009
 gloss-0.009
 Act-0.009
 Rost-0.009
Token first
Token ID717
Feature activation+7.00e-02

Pos logit contributions
ason+0.124
arton+0.118
asons+0.107
odox+0.107
Write+0.107
Neg logit contributions
NRS-0.110
 UP-0.108
 Act-0.107
 gloss-0.106
 Rost-0.106
Token appeared
Token ID4120
Feature activation+9.85e-04

Pos logit contributions
ason+0.050
arton+0.048
asons+0.043
Write+0.043
odox+0.043
Neg logit contributions
NRS-0.044
 UP-0.043
 Act-0.043
 gloss-0.043
 Rost-0.043
Token on
Token ID319
Feature activation+1.58e-01

Pos logit contributions
ason+0.001
arton+0.001
asons+0.001
odox+0.001
Write+0.001
Neg logit contributions
NRS-0.001
 UP-0.001
 Act-0.001
 gloss-0.001
 Rost-0.001
Token The
Token ID383
Feature activation+6.04e-02

Pos logit contributions
ason+0.121
arton+0.116
Write+0.106
odox+0.106
asons+0.106
Neg logit contributions
NRS-0.091
 UP-0.090
 gloss-0.089
 Act-0.088
 Rost-0.088
Token Forbes
Token ID22720
Feature activation+5.67e-01

Pos logit contributions
ason+0.046
arton+0.044
Write+0.041
asons+0.041
odox+0.041
Neg logit contributions
NRS-0.035
 UP-0.034
 gloss-0.034
 Act-0.034
 Rost-0.034
Token 400
Token ID7337
Feature activation+3.24e-01

Pos logit contributions
ason+0.403
arton+0.384
asons+0.350
odox+0.348
antry+0.346
Neg logit contributions
NRS-0.358
 Act-0.357
 UP-0.357
 gloss-0.356
 Rost-0.351
Token in
Token ID287
Feature activation+6.72e-02

Pos logit contributions
ason+0.229
arton+0.218
asons+0.197
Write+0.197
odox+0.197
Neg logit contributions
NRS-0.206
 UP-0.204
 Act-0.203
 gloss-0.203
 Rost-0.201
Token 2006
Token ID4793
Feature activation-4.69e-02

Pos logit contributions
ason+0.060
arton+0.058
asons+0.053
Write+0.053
antry+0.053
Neg logit contributions
NRS-0.031
 UP-0.029
 gloss-0.029
 Act-0.029
 Rost-0.029
Token,
Token ID11
Feature activation+8.42e-02

Pos logit contributions
NRS+0.029
 UP+0.029
 Act+0.029
 gloss+0.028
 Rost+0.028
Neg logit contributions
ason-0.034
arton-0.033
asons-0.030
odox-0.030
Write-0.029
Token he
Token ID339
Feature activation+2.30e-01

Pos logit contributions
ason+0.056
arton+0.054
Write+0.048
asons+0.048
odox+0.048
Neg logit contributions
NRS-0.057
 UP-0.056
 gloss-0.055
 Act-0.055
 Rost-0.055
Token first
Token ID717
Feature activation+1.01e-01

Pos logit contributions
ason+0.043
arton+0.041
asons+0.037
Write+0.037
odox+0.037
Neg logit contributions
NRS-0.043
 UP-0.043
 Act-0.042
 gloss-0.042
 Rost-0.042
Token RPG
Token ID12909
Feature activation-1.36e-02

Pos logit contributions
ason+0.081
arton+0.078
asons+0.072
Write+0.071
antry+0.071
Neg logit contributions
NRS-0.055
 UP-0.054
 gloss-0.054
 Act-0.053
 Rost-0.053
Token scores
Token ID8198
Feature activation-9.14e-02

Pos logit contributions
NRS+0.008
 UP+0.008
 Act+0.008
 gloss+0.008
 Rost+0.008
Neg logit contributions
ason-0.010
arton-0.010
asons-0.009
Write-0.009
antry-0.009
Token to
Token ID284
Feature activation+8.31e-02

Pos logit contributions
NRS+0.057
 UP+0.056
 Act+0.056
 gloss+0.056
 Rost+0.055
Neg logit contributions
ason-0.066
arton-0.063
asons-0.057
odox-0.057
Write-0.057
Token use
Token ID779
Feature activation+4.91e-02

Pos logit contributions
ason+0.059
arton+0.056
asons+0.051
odox+0.051
Write+0.051
Neg logit contributions
NRS-0.053
 UP-0.052
 Act-0.052
 gloss-0.052
 Rost-0.052
Token samples
Token ID8405
Feature activation+5.67e-01

Pos logit contributions
ason+0.040
arton+0.038
asons+0.035
Write+0.035
odox+0.035
Neg logit contributions
NRS-0.026
 UP-0.026
 Act-0.026
 gloss-0.026
 Rost-0.026
Token rather
Token ID2138
Feature activation+1.07e-01

Pos logit contributions
ason+0.403
arton+0.384
asons+0.349
odox+0.347
Write+0.347
Neg logit contributions
NRS-0.358
 UP-0.354
 Act-0.353
 gloss-0.352
 Rost-0.349
Token than
Token ID621
Feature activation-5.39e-02

Pos logit contributions
ason+0.082
arton+0.078
Write+0.072
asons+0.071
antry+0.071
Neg logit contributions
NRS-0.062
 UP-0.061
 Act-0.061
 Rost-0.061
 gloss-0.060
Token just
Token ID655
Feature activation+8.20e-02

Pos logit contributions
NRS+0.029
 UP+0.028
 Act+0.028
 Rost+0.028
 gloss+0.028
Neg logit contributions
ason-0.043
arton-0.042
Write-0.038
asons-0.038
antry-0.038
Token MIDI
Token ID33439
Feature activation+2.96e-03

Pos logit contributions
ason+0.067
arton+0.064
asons+0.059
Write+0.059
odox+0.059
Neg logit contributions
NRS-0.043
 UP-0.042
 Act-0.042
 Rost-0.042
 gloss-0.042
Token,
Token ID11
Feature activation+4.84e-02

Pos logit contributions
ason+0.002
arton+0.002
asons+0.002
odox+0.002
Write+0.002
Neg logit contributions
NRS-0.002
 UP-0.002
 Act-0.002
 gloss-0.002
 Rost-0.002
Token the
Token ID262
Feature activation-2.48e-02

Pos logit contributions
ason+0.153
arton+0.146
asons+0.133
odox+0.132
Write+0.132
Neg logit contributions
NRS-0.137
 UP-0.136
 Act-0.136
 gloss-0.135
 Rost-0.134
Token player
Token ID2137
Feature activation+3.35e-02

Pos logit contributions
NRS+0.015
 UP+0.015
 Act+0.015
 gloss+0.015
 Rost+0.015
Neg logit contributions
ason-0.018
arton-0.017
asons-0.016
Write-0.016
odox-0.016
Token is
Token ID318
Feature activation-7.20e-02

Pos logit contributions
ason+0.026
arton+0.024
asons+0.022
Write+0.022
odox+0.022
Neg logit contributions
NRS-0.019
 UP-0.019
 Act-0.019
 Rost-0.019
 gloss-0.019
Token sufficiently
Token ID17338
Feature activation-7.84e-02

Pos logit contributions
NRS+0.043
 UP+0.043
 Act+0.043
 gloss+0.042
 Rost+0.042
Neg logit contributions
ason-0.053
arton-0.051
asons-0.046
Write-0.046
odox-0.046
Token contr
Token ID3445
Feature activation+3.10e-01

Pos logit contributions
NRS+0.046
 UP+0.046
 Act+0.046
 gloss+0.045
 Rost+0.045
Neg logit contributions
ason-0.059
arton-0.056
asons-0.051
Write-0.051
odox-0.051
Tokenite
Token ID578
Feature activation+5.66e-01

Pos logit contributions
ason+0.055
arton+0.046
Write+0.027
asons+0.026
odox+0.026
Neg logit contributions
NRS-0.359
 UP-0.357
 Act-0.356
 gloss-0.356
 Rost-0.355
Token,
Token ID11
Feature activation-2.28e-02

Pos logit contributions
ason+0.416
arton+0.395
asons+0.362
Write+0.359
odox+0.359
Neg logit contributions
NRS-0.345
 UP-0.339
 Act-0.337
 gloss-0.337
 Rost-0.335
Token does
Token ID857
Feature activation-1.42e-01

Pos logit contributions
NRS+0.014
 UP+0.013
 Act+0.013
 gloss+0.013
 Rost+0.013
Neg logit contributions
ason-0.017
arton-0.016
asons-0.015
Write-0.015
odox-0.015
Token some
Token ID617
Feature activation+2.35e-01

Pos logit contributions
NRS+0.088
 UP+0.087
 Act+0.087
 gloss+0.086
 Rost+0.086
Neg logit contributions
ason-0.103
arton-0.099
asons-0.090
odox-0.089
Write-0.089
Token sort
Token ID3297
Feature activation+7.16e-02

Pos logit contributions
ason+0.167
arton+0.159
asons+0.144
odox+0.144
Write+0.144
Neg logit contributions
NRS-0.149
 UP-0.147
 Act-0.147
 gloss-0.147
 Rost-0.145
Token of
Token ID286
Feature activation-7.18e-02

Pos logit contributions
ason+0.052
arton+0.049
asons+0.045
odox+0.045
Write+0.045
Neg logit contributions
NRS-0.044
 UP-0.044
 Act-0.044
 gloss-0.044
 Rost-0.043
Token people
Token ID661
Feature activation+1.37e-02

Pos logit contributions
ason+0.058
arton+0.055
asons+0.050
odox+0.050
Write+0.050
Neg logit contributions
NRS-0.050
 UP-0.050
 gloss-0.050
 Act-0.050
 Rost-0.049
Token dead
Token ID2636
Feature activation+2.51e-02

Pos logit contributions
ason+0.009
arton+0.008
asons+0.008
Write+0.008
odox+0.008
Neg logit contributions
NRS-0.009
 UP-0.009
 Act-0.009
 gloss-0.009
 Rost-0.009
Token.
Token ID13
Feature activation+1.61e-02

Pos logit contributions
ason+0.019
arton+0.018
asons+0.016
odox+0.016
antry+0.016
Neg logit contributions
NRS-0.015
 UP-0.015
 Act-0.015
 gloss-0.015
 Rost-0.014
Token The
Token ID383
Feature activation+4.04e-02

Pos logit contributions
ason+0.012
arton+0.011
asons+0.010
odox+0.010
Write+0.010
Neg logit contributions
NRS-0.010
 UP-0.010
 Act-0.010
 gloss-0.010
 Rost-0.010
Token resh
Token ID27179
Feature activation-3.14e-02

Pos logit contributions
ason+0.030
arton+0.029
asons+0.026
odox+0.026
Write+0.026
Neg logit contributions
NRS-0.024
 UP-0.023
 gloss-0.023
 Act-0.023
 Rost-0.023
Tokenuffle
Token ID18137
Feature activation+5.66e-01

Pos logit contributions
NRS+0.030
 UP+0.030
 Act+0.030
 gloss+0.030
 Rost+0.030
Neg logit contributions
ason-0.011
arton-0.011
Write-0.009
asons-0.008
odox-0.008
Token appears
Token ID3568
Feature activation-1.65e-01

Pos logit contributions
ason+0.395
arton+0.374
asons+0.341
Write+0.339
antry+0.339
Neg logit contributions
NRS-0.365
 UP-0.358
 Act-0.356
 gloss-0.354
 Rost-0.354
Token to
Token ID284
Feature activation+1.26e-02

Pos logit contributions
NRS+0.093
 UP+0.093
 gloss+0.092
 Act+0.092
 Rost+0.091
Neg logit contributions
ason-0.128
arton-0.123
asons-0.112
odox-0.112
antry-0.112
Token have
Token ID423
Feature activation-7.12e-02

Pos logit contributions
ason+0.009
arton+0.009
asons+0.008
odox+0.008
Write+0.008
Neg logit contributions
NRS-0.008
 UP-0.008
 gloss-0.008
 Act-0.008
 Rost-0.008
Token strengthened
Token ID27324
Feature activation-1.38e-01

Pos logit contributions
NRS+0.045
 UP+0.045
 Act+0.045
 gloss+0.045
 Rost+0.044
Neg logit contributions
ason-0.051
arton-0.048
asons-0.044
odox-0.044
Write-0.043
Token Mr
Token ID1770
Feature activation+1.24e-01

Pos logit contributions
NRS+0.074
 UP+0.074
 Act+0.073
 gloss+0.073
 Rost+0.072
Neg logit contributions
ason-0.111
arton-0.107
asons-0.098
odox-0.098
Write-0.098
Token a
Token ID257
Feature activation+1.72e-02

Pos logit contributions
ason+0.054
arton+0.052
asons+0.048
odox+0.047
Write+0.047
Neg logit contributions
NRS-0.041
 UP-0.041
 Act-0.041
 gloss-0.040
 Rost-0.040
Token bit
Token ID1643
Feature activation-9.78e-02

Pos logit contributions
ason+0.012
arton+0.011
asons+0.010
odox+0.010
antry+0.010
Neg logit contributions
NRS-0.011
 UP-0.011
 Act-0.011
 gloss-0.011
 Rost-0.011
Token of
Token ID286
Feature activation-2.19e-02

Pos logit contributions
NRS+0.061
 UP+0.060
 Act+0.060
 gloss+0.060
 Rost+0.059
Neg logit contributions
ason-0.071
arton-0.067
asons-0.061
odox-0.061
Write-0.061
Token a
Token ID257
Feature activation-7.91e-03

Pos logit contributions
NRS+0.012
 UP+0.012
 Act+0.012
 gloss+0.012
 Rost+0.012
Neg logit contributions
ason-0.017
arton-0.016
asons-0.015
odox-0.015
Write-0.015
Token resp
Token ID1217
Feature activation+2.58e-01

Pos logit contributions
NRS+0.005
 UP+0.005
 Act+0.005
 gloss+0.005
 Rost+0.005
Neg logit contributions
ason-0.005
arton-0.005
asons-0.005
odox-0.005
antry-0.005
Tokenite
Token ID578
Feature activation+5.63e-01

Pos logit contributions
ason+0.043
arton+0.035
asons+0.019
antry+0.019
odox+0.019
Neg logit contributions
NRS-0.301
 UP-0.300
 Act-0.299
 gloss-0.299
 Rost-0.297
Token,
Token ID11
Feature activation-2.84e-02

Pos logit contributions
ason+0.409
arton+0.388
asons+0.354
Write+0.352
antry+0.351
Neg logit contributions
NRS-0.350
 UP-0.344
 Act-0.342
 gloss-0.342
 Rost-0.340
Token and
Token ID290
Feature activation+1.25e-03

Pos logit contributions
NRS+0.016
 UP+0.016
 Act+0.016
 gloss+0.016
 Rost+0.016
Neg logit contributions
ason-0.022
arton-0.021
asons-0.019
odox-0.019
Write-0.019
Token 2012
Token ID2321
Feature activation+3.73e-02

Pos logit contributions
ason+0.001
arton+0.001
asons+0.001
odox+0.001
Write+0.001
Neg logit contributions
NRS-0.001
 UP-0.001
 Act-0.001
 gloss-0.001
 Rost-0.001
Token was
Token ID373
Feature activation+5.94e-02

Pos logit contributions
ason+0.027
arton+0.025
asons+0.023
odox+0.023
Write+0.023
Neg logit contributions
NRS-0.023
 UP-0.023
 Act-0.023
 gloss-0.023
 Rost-0.023
Token 25
Token ID1679
Feature activation+1.45e-01

Pos logit contributions
ason+0.043
arton+0.041
asons+0.038
odox+0.037
Write+0.037
Neg logit contributions
NRS-0.036
 UP-0.036
 Act-0.036
 gloss-0.036
 Rost-0.035
Token copyright
Token ID6634
Feature activation-2.17e-01

Pos logit contributions
ason+0.061
arton+0.059
asons+0.055
odox+0.054
antry+0.054
Neg logit contributions
 UP-0.027
NRS-0.027
 Act-0.027
 gloss-0.026
 Rost-0.026
Token AFP
Token ID8916
Feature activation+2.19e-01

Pos logit contributions
 UP+0.143
NRS+0.142
 Act+0.142
 Rost+0.141
 gloss+0.141
Neg logit contributions
ason-0.149
arton-0.142
asons-0.129
odox-0.128
antry-0.128
Token Image
Token ID7412
Feature activation+7.47e-02

Pos logit contributions
ason+0.160
arton+0.152
asons+0.139
odox+0.138
Write+0.138
Neg logit contributions
NRS-0.135
 UP-0.134
 Act-0.133
 gloss-0.133
 Rost-0.132
Token caption
Token ID8305
Feature activation-1.69e-01

Pos logit contributions
ason+0.072
arton+0.070
odox+0.065
asons+0.065
Write+0.065
Neg logit contributions
NRS-0.028
 UP-0.028
 Act-0.028
 gloss-0.027
 Rost-0.027
Token The
Token ID383
Feature activation+2.30e-02

Pos logit contributions
NRS+0.108
 UP+0.108
 Act+0.108
 Rost+0.107
 gloss+0.107
Neg logit contributions
ason-0.118
arton-0.112
asons-0.102
odox-0.102
Write-0.101
Token Kurds
Token ID23880
Feature activation+5.60e-01

Pos logit contributions
ason+0.016
arton+0.015
asons+0.013
odox+0.013
Write+0.013
Neg logit contributions
NRS-0.015
 UP-0.015
 Act-0.015
 gloss-0.015
 Rost-0.015
Token'
Token ID6
Feature activation+1.20e-01

Pos logit contributions
ason+0.409
arton+0.391
asons+0.355
odox+0.355
Write+0.354
Neg logit contributions
NRS-0.342
 UP-0.339
 Act-0.338
 gloss-0.337
 Rost-0.335
Token push
Token ID4574
Feature activation+1.84e-01

Pos logit contributions
ason+0.093
arton+0.089
asons+0.081
odox+0.081
Write+0.081
Neg logit contributions
NRS-0.069
 UP-0.068
 Act-0.068
 gloss-0.068
 Rost-0.067
Token has
Token ID468
Feature activation-1.13e-01

Pos logit contributions
ason+0.139
arton+0.133
asons+0.122
odox+0.121
Write+0.121
Neg logit contributions
NRS-0.107
 UP-0.106
 Act-0.106
 gloss-0.105
 Rost-0.105
Token alarmed
Token ID32064
Feature activation+1.77e-01

Pos logit contributions
 UP+0.072
 gloss+0.072
NRS+0.072
 Act+0.072
 Rost+0.071
Neg logit contributions
ason-0.079
arton-0.076
odox-0.069
asons-0.069
Write-0.068
Token Turkey
Token ID7137
Feature activation+3.00e-01

Pos logit contributions
ason+0.120
arton+0.114
asons+0.103
odox+0.103
Write+0.103
Neg logit contributions
NRS-0.117
 UP-0.115
 Act-0.115
 gloss-0.115
 Rost-0.114
Token other
Token ID584
Feature activation+7.26e-02

Pos logit contributions
ason+0.040
arton+0.038
asons+0.035
Write+0.035
odox+0.035
Neg logit contributions
NRS-0.030
 UP-0.029
 Act-0.029
 gloss-0.029
 Rost-0.029
Token composition
Token ID11742
Feature activation-1.78e-02

Pos logit contributions
ason+0.057
arton+0.054
asons+0.050
Write+0.049
odox+0.049
Neg logit contributions
NRS-0.041
 UP-0.040
 Act-0.040
 gloss-0.040
 Rost-0.040
Token information
Token ID1321
Feature activation-5.80e-02

Pos logit contributions
NRS+0.011
 UP+0.011
 Act+0.010
 gloss+0.010
 Rost+0.010
Neg logit contributions
ason-0.013
arton-0.013
asons-0.012
Write-0.011
odox-0.011
Token about
Token ID546
Feature activation+5.21e-02

Pos logit contributions
NRS+0.035
 UP+0.035
 Act+0.035
 gloss+0.035
 Rost+0.034
Neg logit contributions
ason-0.043
arton-0.041
asons-0.037
Write-0.037
odox-0.037
Token powder
Token ID11913
Feature activation+1.94e-01

Pos logit contributions
ason+0.041
arton+0.039
asons+0.036
odox+0.036
Write+0.036
Neg logit contributions
NRS-0.029
 UP-0.029
 Act-0.028
 gloss-0.028
 Rost-0.028
Token samples
Token ID8405
Feature activation+5.54e-01

Pos logit contributions
ason+0.137
arton+0.131
asons+0.119
odox+0.119
Write+0.118
Neg logit contributions
NRS-0.123
 UP-0.122
 Act-0.121
 gloss-0.121
 Rost-0.120
Token from
Token ID422
Feature activation+7.94e-02

Pos logit contributions
ason+0.408
arton+0.390
asons+0.355
odox+0.354
Write+0.353
Neg logit contributions
NRS-0.335
 UP-0.332
 Act-0.330
 gloss-0.329
 Rost-0.327
Token rocks
Token ID12586
Feature activation+2.10e-01

Pos logit contributions
ason+0.060
arton+0.057
asons+0.052
Write+0.052
odox+0.052
Neg logit contributions
NRS-0.047
 UP-0.046
 Act-0.046
 gloss-0.046
 Rost-0.046
Token and
Token ID290
Feature activation+3.33e-02

Pos logit contributions
ason+0.150
arton+0.143
asons+0.130
Write+0.129
odox+0.129
Neg logit contributions
NRS-0.131
 UP-0.130
 Act-0.129
 gloss-0.129
 Rost-0.128
Token soil
Token ID9260
Feature activation+1.88e-01

Pos logit contributions
ason+0.024
arton+0.023
asons+0.021
Write+0.020
odox+0.020
Neg logit contributions
NRS-0.021
 UP-0.021
 Act-0.021
 gloss-0.021
 Rost-0.021
Token.
Token ID13
Feature activation+3.44e-02

Pos logit contributions
ason+0.110
arton+0.104
asons+0.092
odox+0.092
Write+0.091
Neg logit contributions
NRS-0.142
 UP-0.141
 Act-0.141
 gloss-0.141
 Rost-0.139
Token ne
Token ID497
Feature activation-7.40e-04

Pos logit contributions
ason+0.113
arton+0.108
asons+0.098
odox+0.098
Write+0.098
Neg logit contributions
NRS-0.093
 UP-0.092
 Act-0.092
 gloss-0.092
 Rost-0.091
Token s
Token ID264
Feature activation+2.42e-01

Pos logit contributions
NRS+0.000
 UP+0.000
 Act+0.000
 gloss+0.000
 Rost+0.000
Neg logit contributions
ason-0.001
arton-0.001
odox-0.000
Write-0.000
asons-0.000
Tokenomm
Token ID2002
Feature activation-1.66e-02

Pos logit contributions
ason+0.148
arton+0.140
Write+0.125
asons+0.125
odox+0.125
Neg logit contributions
NRS-0.176
 UP-0.175
 Act-0.175
 gloss-0.174
 Rost-0.173
Tokenes
Token ID274
Feature activation+8.99e-02

Pos logit contributions
NRS+0.011
 Act+0.011
 UP+0.011
 gloss+0.011
 Rost+0.011
Neg logit contributions
ason-0.011
arton-0.010
Write-0.009
odox-0.009
antry-0.009
Token pas
Token ID38836
Feature activation+3.82e-02

Pos logit contributions
ason+0.066
arton+0.064
asons+0.058
odox+0.058
Write+0.058
Neg logit contributions
NRS-0.054
 UP-0.054
 Act-0.054
 gloss-0.053
 Rost-0.053
Token encl
Token ID13507
Feature activation+5.51e-01

Pos logit contributions
ason+0.028
arton+0.027
asons+0.024
Write+0.024
odox+0.024
Neg logit contributions
NRS-0.023
 Act-0.023
 UP-0.023
 gloss-0.023
 Rost-0.023
Tokenin
Token ID259
Feature activation+9.49e-02

Pos logit contributions
ason+0.343
arton+0.325
asons+0.290
Write+0.290
odox+0.288
Neg logit contributions
NRS-0.394
 UP-0.391
 Act-0.390
 gloss-0.388
 Rost-0.387
Token à
Token ID28141
Feature activation-1.65e-01

Pos logit contributions
ason+0.055
arton+0.052
odox+0.046
Write+0.046
asons+0.046
Neg logit contributions
NRS-0.071
 UP-0.071
 Act-0.071
 gloss-0.071
 Rost-0.070
Token cher
Token ID21382
Feature activation+1.43e-01

Pos logit contributions
NRS+0.103
 Act+0.102
 UP+0.102
 gloss+0.102
 Rost+0.101
Neg logit contributions
ason-0.118
arton-0.113
asons-0.103
odox-0.102
Write-0.102
Tokencher
Token ID2044
Feature activation+2.24e-01

Pos logit contributions
ason+0.082
arton+0.077
asons+0.068
odox+0.068
Write+0.068
Neg logit contributions
NRS-0.110
 UP-0.109
 Act-0.109
 gloss-0.109
 Rost-0.108
Token le
Token ID443
Feature activation-1.41e-01

Pos logit contributions
ason+0.159
arton+0.151
asons+0.137
Write+0.137
odox+0.137
Neg logit contributions
NRS-0.142
 UP-0.140
 Act-0.140
 gloss-0.139
 Rost-0.138
Token days
Token ID1528
Feature activation-7.18e-02

Pos logit contributions
ason+0.036
arton+0.034
asons+0.032
odox+0.031
Write+0.031
Neg logit contributions
NRS-0.022
 UP-0.022
 Act-0.022
 gloss-0.022
 Rost-0.022
Token later
Token ID1568
Feature activation-8.47e-02

Pos logit contributions
NRS+0.039
 UP+0.039
 Act+0.039
 gloss+0.039
 Rost+0.039
Neg logit contributions
ason-0.057
arton-0.054
odox-0.050
asons-0.050
Write-0.050
Token,
Token ID11
Feature activation-3.85e-02

Pos logit contributions
NRS+0.052
 UP+0.052
 Act+0.052
 gloss+0.052
 Rost+0.052
Neg logit contributions
ason-0.061
arton-0.058
asons-0.053
odox-0.053
Write-0.053
Token bed
Token ID3996
Feature activation-7.53e-03

Pos logit contributions
NRS+0.025
 UP+0.025
 Act+0.025
 gloss+0.024
 Rost+0.024
Neg logit contributions
ason-0.027
arton-0.026
asons-0.023
odox-0.023
Write-0.023
Tokenr
Token ID81
Feature activation+1.20e-01

Pos logit contributions
NRS+0.004
 UP+0.004
 Act+0.004
 gloss+0.004
 Rost+0.004
Neg logit contributions
ason-0.006
arton-0.006
asons-0.006
odox-0.006
Write-0.006
Tokenagg
Token ID9460
Feature activation+5.51e-01

Pos logit contributions
ason+0.025
arton+0.022
asons+0.014
odox+0.014
Write+0.013
Neg logit contributions
NRS-0.135
 UP-0.135
 Act-0.134
 Rost-0.134
 gloss-0.133
Tokenled
Token ID992
Feature activation+1.90e-01

Pos logit contributions
ason+0.346
arton+0.327
asons+0.294
Write+0.293
odox+0.292
Neg logit contributions
NRS-0.392
 UP-0.389
 Act-0.388
 gloss-0.387
 Rost-0.385
Token Bart
Token ID13167
Feature activation+7.67e-02

Pos logit contributions
ason+0.140
arton+0.134
asons+0.122
Write+0.121
odox+0.121
Neg logit contributions
NRS-0.116
 UP-0.114
 Act-0.113
 gloss-0.113
 Rost-0.112
Token was
Token ID373
Feature activation+2.02e-02

Pos logit contributions
ason+0.052
arton+0.050
asons+0.045
odox+0.045
Write+0.045
Neg logit contributions
NRS-0.051
 UP-0.050
 Act-0.050
 gloss-0.050
 Rost-0.049
Token found
Token ID1043
Feature activation-7.97e-02

Pos logit contributions
ason+0.015
arton+0.014
asons+0.013
Write+0.013
antry+0.013
Neg logit contributions
NRS-0.012
 UP-0.012
 Act-0.012
 gloss-0.012
 Rost-0.012
Token me
Token ID502
Feature activation+1.59e-01

Pos logit contributions
NRS+0.050
 UP+0.049
 Act+0.049
 gloss+0.049
 Rost+0.048
Neg logit contributions
ason-0.057
arton-0.055
asons-0.050
odox-0.049
Write-0.049
Token)"
Token ID16725
Feature activation+4.86e-02

Pos logit contributions
NRS+0.076
 UP+0.075
 Act+0.075
 gloss+0.075
 Rost+0.074
Neg logit contributions
ason-0.089
arton-0.085
asons-0.077
odox-0.077
Write-0.077
Tokenhigh
Token ID8929
Feature activation+1.36e-01

Pos logit contributions
ason+0.035
arton+0.033
asons+0.030
odox+0.030
Write+0.030
Neg logit contributions
NRS-0.030
 UP-0.030
 Act-0.030
 gloss-0.030
 Rost-0.030
Token resolution
Token ID6323
Feature activation-3.12e-01

Pos logit contributions
ason+0.138
arton+0.134
asons+0.125
odox+0.125
Write+0.125
Neg logit contributions
NRS-0.045
 UP-0.044
 Act-0.043
 gloss-0.043
 Rost-0.043
Token no
Token ID645
Feature activation-9.32e-03

Pos logit contributions
NRS+0.192
 UP+0.190
 Act+0.189
 gloss+0.189
 Rost+0.187
Neg logit contributions
ason-0.227
arton-0.216
asons-0.197
odox-0.196
Write-0.196
Token j
Token ID474
Feature activation+1.84e-01

Pos logit contributions
NRS+0.003
 UP+0.003
 Act+0.003
 gloss+0.003
 Rost+0.003
Neg logit contributions
ason-0.009
arton-0.009
asons-0.008
Write-0.008
odox-0.008
Tokenagg
Token ID9460
Feature activation+5.48e-01

Pos logit contributions
ason+0.046
arton+0.040
asons+0.028
odox+0.028
Write+0.028
Neg logit contributions
NRS-0.201
 UP-0.200
 Act-0.199
 gloss-0.199
 Rost-0.198
Tokenies
Token ID444
Feature activation+2.89e-02

Pos logit contributions
ason+0.414
arton+0.396
asons+0.362
Write+0.360
odox+0.360
Neg logit contributions
NRS-0.322
 UP-0.318
 Act-0.317
 gloss-0.316
 Rost-0.314
Token but
Token ID475
Feature activation+1.21e-01

Pos logit contributions
ason+0.020
arton+0.020
asons+0.018
odox+0.018
Write+0.018
Neg logit contributions
NRS-0.018
 UP-0.018
 Act-0.018
 gloss-0.018
 Rost-0.018
Token screen
Token ID3159
Feature activation-3.03e-01

Pos logit contributions
ason+0.092
arton+0.088
asons+0.080
antry+0.080
Write+0.080
Neg logit contributions
NRS-0.071
 UP-0.070
 Act-0.069
 Rost-0.069
 gloss-0.069
Token door
Token ID3420
Feature activation+4.80e-02

Pos logit contributions
NRS+0.176
 UP+0.176
 Act+0.175
 gloss+0.175
 Rost+0.172
Neg logit contributions
ason-0.229
arton-0.220
odox-0.201
asons-0.201
Write-0.200
Token effect
Token ID1245
Feature activation-1.40e-02

Pos logit contributions
ason+0.041
arton+0.039
asons+0.036
odox+0.036
Write+0.036
Neg logit contributions
NRS-0.024
 UP-0.024
 Act-0.024
 gloss-0.024
 Rost-0.023
Token play
Token ID711
Feature activation+3.81e-02

Pos logit contributions
ason+0.099
arton+0.094
asons+0.085
Write+0.085
antry+0.085
Neg logit contributions
NRS-0.085
 UP-0.083
 Act-0.083
 Rost-0.083
 gloss-0.082
Token it
Token ID340
Feature activation-5.52e-02

Pos logit contributions
ason+0.028
arton+0.026
asons+0.024
odox+0.024
Write+0.024
Neg logit contributions
NRS-0.023
 UP-0.023
 Act-0.023
 gloss-0.023
 Rost-0.023
Token with
Token ID351
Feature activation-4.59e-03

Pos logit contributions
NRS+0.035
 UP+0.034
 Act+0.034
 gloss+0.034
 Rost+0.034
Neg logit contributions
ason-0.039
arton-0.037
asons-0.034
odox-0.034
Write-0.034
Token the
Token ID262
Feature activation-1.72e-02

Pos logit contributions
NRS+0.003
 UP+0.003
 Act+0.003
 gloss+0.003
 Rost+0.003
Neg logit contributions
ason-0.003
arton-0.003
asons-0.003
Write-0.003
odox-0.003
Token j
Token ID474
Feature activation+2.03e-01

Pos logit contributions
NRS+0.010
 UP+0.010
 Act+0.010
 gloss+0.010
 Rost+0.009
Neg logit contributions
ason-0.013
arton-0.013
asons-0.012
odox-0.012
Write-0.012
Tokenagg
Token ID9460
Feature activation+5.48e-01

Pos logit contributions
ason+0.128
arton+0.121
asons+0.108
odox+0.108
Write+0.108
Neg logit contributions
NRS-0.145
 UP-0.144
 Act-0.143
 gloss-0.143
 Rost-0.142
Tokenies
Token ID444
Feature activation+1.39e-02

Pos logit contributions
ason+0.417
arton+0.399
asons+0.365
odox+0.364
Write+0.363
Neg logit contributions
NRS-0.317
 UP-0.315
 Act-0.313
 gloss-0.312
 Rost-0.310
Token since
Token ID1201
Feature activation-8.01e-04

Pos logit contributions
ason+0.010
arton+0.010
asons+0.009
odox+0.009
Write+0.009
Neg logit contributions
NRS-0.009
 UP-0.009
 Act-0.009
 gloss-0.009
 Rost-0.009
Token even
Token ID772
Feature activation-2.78e-02

Pos logit contributions
NRS+0.000
 UP+0.000
 Act+0.000
 gloss+0.000
 Rost+0.000
Neg logit contributions
ason-0.001
arton-0.001
asons-0.001
Write-0.001
antry-0.001
Token though
Token ID996
Feature activation+3.21e-02

Pos logit contributions
NRS+0.018
 UP+0.017
 Act+0.017
 gloss+0.017
 Rost+0.017
Neg logit contributions
ason-0.020
arton-0.019
asons-0.017
Write-0.017
odox-0.017
Token they
Token ID484
Feature activation+4.29e-02

Pos logit contributions
ason+0.024
arton+0.023
asons+0.021
Write+0.020
odox+0.020
Neg logit contributions
NRS-0.019
 UP-0.019
 Act-0.019
 gloss-0.019
 Rost-0.019
Token you
Token ID345
Feature activation-4.58e-02

Pos logit contributions
NRS+0.017
 UP+0.017
 Act+0.017
 gloss+0.017
 Rost+0.017
Neg logit contributions
ason-0.021
arton-0.020
asons-0.018
odox-0.018
Write-0.018
Token can
Token ID460
Feature activation-1.28e-01

Pos logit contributions
NRS+0.026
 UP+0.025
 Act+0.025
 gloss+0.025
 Rost+0.025
Neg logit contributions
ason-0.036
arton-0.034
asons-0.032
Write-0.031
odox-0.031
Token see
Token ID766
Feature activation+2.36e-02

Pos logit contributions
NRS+0.080
 UP+0.079
 Act+0.078
 Rost+0.078
 gloss+0.078
Neg logit contributions
ason-0.091
arton-0.087
asons-0.079
Write-0.079
antry-0.079
Token "
Token ID366
Feature activation-1.49e-01

Pos logit contributions
ason+0.019
arton+0.018
asons+0.017
Write+0.017
odox+0.017
Neg logit contributions
NRS-0.013
 UP-0.012
 Act-0.012
 gloss-0.012
 Rost-0.012
Tokenj
Token ID73
Feature activation+4.49e-02

Pos logit contributions
NRS+0.096
 UP+0.095
 Act+0.095
 gloss+0.094
 Rost+0.094
Neg logit contributions
ason-0.104
arton-0.099
asons-0.090
Write-0.089
odox-0.089
Tokenagg
Token ID9460
Feature activation+5.47e-01

Pos logit contributions
ason+0.012
arton+0.011
odox+0.008
Write+0.008
asons+0.008
Neg logit contributions
NRS-0.048
 UP-0.048
 Act-0.047
 gloss-0.047
 Rost-0.047
Tokenies
Token ID444
Feature activation-1.20e-02

Pos logit contributions
ason+0.427
arton+0.408
odox+0.374
asons+0.373
Write+0.373
Neg logit contributions
NRS-0.305
 UP-0.304
 Act-0.302
 gloss-0.301
 Rost-0.300
Token"
Token ID1
Feature activation+1.36e-01

Pos logit contributions
NRS+0.008
 UP+0.008
 Act+0.008
 gloss+0.008
 Rost+0.008
Neg logit contributions
ason-0.008
arton-0.007
asons-0.006
odox-0.006
Write-0.006
Token along
Token ID1863
Feature activation+1.40e-01

Pos logit contributions
ason+0.097
arton+0.093
asons+0.084
Write+0.084
odox+0.084
Neg logit contributions
NRS-0.085
 UP-0.084
 Act-0.084
 gloss-0.084
 Rost-0.083
Token diagonal
Token ID40039
Feature activation-8.94e-02

Pos logit contributions
ason+0.112
arton+0.107
asons+0.098
odox+0.098
Write+0.098
Neg logit contributions
NRS-0.077
 UP-0.076
 Act-0.076
 gloss-0.075
 Rost-0.075
Token lines
Token ID3951
Feature activation+5.84e-03

Pos logit contributions
NRS+0.058
 UP+0.058
 Act+0.057
 gloss+0.057
 Rost+0.057
Neg logit contributions
ason-0.062
arton-0.059
asons-0.053
odox-0.053
Write-0.053
Token dolphins
Token ID36798
Feature activation+9.25e-02

Pos logit contributions
NRS+0.243
 Act+0.242
 UP+0.242
 gloss+0.241
 Rost+0.239
Neg logit contributions
ason-0.270
arton-0.257
asons-0.233
Write-0.233
odox-0.232
Token and
Token ID290
Feature activation+9.51e-04

Pos logit contributions
ason+0.068
arton+0.065
asons+0.059
odox+0.059
Write+0.059
Neg logit contributions
NRS-0.056
 UP-0.056
 Act-0.056
 gloss-0.055
 Rost-0.055
Token the
Token ID262
Feature activation+1.52e-02

Pos logit contributions
ason+0.001
arton+0.001
asons+0.001
odox+0.001
Write+0.001
Neg logit contributions
NRS-0.001
 UP-0.001
 Act-0.001
 gloss-0.001
 Rost-0.001
Token highly
Token ID4047
Feature activation+5.12e-03

Pos logit contributions
ason+0.011
arton+0.011
asons+0.010
odox+0.010
Write+0.010
Neg logit contributions
NRS-0.009
 UP-0.009
 Act-0.009
 gloss-0.009
 Rost-0.009
Token endangered
Token ID22700
Feature activation-1.86e-01

Pos logit contributions
ason+0.004
arton+0.004
asons+0.003
odox+0.003
Write+0.003
Neg logit contributions
NRS-0.003
 UP-0.003
 Act-0.003
 Rost-0.003
 gloss-0.003
Token Bengal
Token ID28630
Feature activation-7.02e-01

Pos logit contributions
NRS+0.112
 UP+0.110
 gloss+0.109
 Act+0.109
 Rost+0.108
Neg logit contributions
ason-0.138
arton-0.132
asons-0.120
odox-0.120
Write-0.119
Token tiger
Token ID26241
Feature activation-1.60e-01

Pos logit contributions
NRS+0.405
 Act+0.403
 UP+0.402
 gloss+0.401
 Rost+0.397
Neg logit contributions
ason-0.536
arton-0.513
asons-0.469
Write-0.468
odox-0.467
Token.
Token ID13
Feature activation+1.24e-02

Pos logit contributions
NRS+0.100
 UP+0.099
 Act+0.098
 gloss+0.098
 Rost+0.097
Neg logit contributions
ason-0.115
arton-0.110
asons-0.100
odox-0.099
Write-0.099
Token
Token ID198
Feature activation+1.71e-02

Pos logit contributions
ason+0.010
arton+0.010
asons+0.009
odox+0.009
antry+0.009
Neg logit contributions
NRS-0.007
 UP-0.007
 Act-0.007
 gloss-0.007
 Rost-0.006
Token
Token ID198
Feature activation+1.01e-02

Pos logit contributions
ason+0.014
arton+0.013
Write+0.012
asons+0.012
antry+0.012
Neg logit contributions
NRS-0.009
 gloss-0.009
 UP-0.009
 Act-0.009
 Rost-0.009
Token"
Token ID1
Feature activation+9.38e-02

Pos logit contributions
ason+0.007
arton+0.007
asons+0.006
odox+0.006
Write+0.006
Neg logit contributions
NRS-0.007
 UP-0.006
 Act-0.006
 gloss-0.006
 Rost-0.006
Token two
Token ID734
Feature activation-9.66e-02

Pos logit contributions
NRS+0.053
 UP+0.052
 Act+0.052
 gloss+0.052
 Rost+0.051
Neg logit contributions
ason-0.052
arton-0.050
asons-0.045
odox-0.045
Write-0.044
Token reasons
Token ID3840
Feature activation+2.14e-01

Pos logit contributions
NRS+0.079
 UP+0.079
 Act+0.078
 gloss+0.078
 Rost+0.078
Neg logit contributions
ason-0.050
arton-0.047
asons-0.041
odox-0.041
Write-0.041
Token.
Token ID13
Feature activation-9.97e-02

Pos logit contributions
ason+0.160
arton+0.153
odox+0.140
asons+0.139
Write+0.139
Neg logit contributions
NRS-0.127
 UP-0.126
 Act-0.126
 gloss-0.125
 Rost-0.124
Token Instead
Token ID5455
Feature activation+4.97e-02

Pos logit contributions
NRS+0.055
 UP+0.054
 Act+0.054
 gloss+0.054
 Rost+0.053
Neg logit contributions
ason-0.079
arton-0.076
asons-0.070
odox-0.069
Write-0.069
Token of
Token ID286
Feature activation-9.22e-02

Pos logit contributions
ason+0.037
arton+0.035
asons+0.032
odox+0.032
Write+0.032
Neg logit contributions
 UP-0.030
NRS-0.030
 Act-0.029
 gloss-0.029
 Rost-0.029
Token gloss
Token ID21194
Feature activation-6.21e-01

Pos logit contributions
NRS+0.051
 UP+0.050
 Act+0.050
 gloss+0.050
 Rost+0.050
Neg logit contributions
ason-0.073
arton-0.070
asons-0.064
Write-0.064
odox-0.064
Tokening
Token ID278
Feature activation-9.81e-02

Pos logit contributions
NRS+0.415
 UP+0.411
 Act+0.409
 gloss+0.407
 Rost+0.406
Neg logit contributions
ason-0.419
arton-0.397
asons-0.358
odox-0.357
Write-0.357
Token over
Token ID625
Feature activation-1.11e-01

Pos logit contributions
NRS+0.058
 UP+0.058
 Act+0.057
 gloss+0.057
 Rost+0.057
Neg logit contributions
ason-0.073
arton-0.070
asons-0.064
Write-0.064
odox-0.064
Token these
Token ID777
Feature activation-9.58e-02

Pos logit contributions
NRS+0.061
 UP+0.060
 Act+0.060
 gloss+0.060
 Rost+0.059
Neg logit contributions
ason-0.087
arton-0.084
asons-0.077
Write-0.076
odox-0.076
Token details
Token ID3307
Feature activation-3.01e-02

Pos logit contributions
NRS+0.059
 UP+0.058
 Act+0.058
 gloss+0.058
 Rost+0.057
Neg logit contributions
ason-0.070
arton-0.066
asons-0.060
Write-0.060
odox-0.060
Token,
Token ID11
Feature activation+4.65e-03

Pos logit contributions
NRS+0.019
 UP+0.018
 Act+0.018
 gloss+0.018
 Rost+0.018
Neg logit contributions
ason-0.022
arton-0.021
asons-0.019
Write-0.019
odox-0.019
Token.
Token ID13
Feature activation+4.34e-02

Pos logit contributions
NRS+0.027
 gloss+0.027
 UP+0.027
 Act+0.026
 Rost+0.026
Neg logit contributions
ason-0.030
arton-0.029
asons-0.026
Write-0.026
antry-0.026
Tokenyoutube
Token ID11604
Feature activation-8.44e-02

Pos logit contributions
ason+0.042
arton+0.041
asons+0.038
odox+0.038
Write+0.038
Neg logit contributions
NRS-0.016
 UP-0.016
 Act-0.016
 gloss-0.016
 Rost-0.015
Token.
Token ID13
Feature activation+4.29e-02

Pos logit contributions
NRS+0.053
 UP+0.052
 gloss+0.052
 Act+0.052
 Rost+0.052
Neg logit contributions
ason-0.060
arton-0.057
asons-0.052
Write-0.052
antry-0.051
Tokencom
Token ID785
Feature activation-1.20e-01

Pos logit contributions
ason+0.038
arton+0.037
asons+0.034
Write+0.034
antry+0.034
Neg logit contributions
NRS-0.020
 UP-0.019
 gloss-0.019
 Act-0.019
 Rost-0.019
Token/
Token ID14
Feature activation+2.87e-02

Pos logit contributions
NRS+0.079
 UP+0.078
 gloss+0.078
 Act+0.078
 Rost+0.077
Neg logit contributions
ason-0.083
arton-0.079
asons-0.072
Write-0.071
antry-0.071
Tokenwatch
Token ID8340
Feature activation-5.95e-01

Pos logit contributions
ason+0.032
arton+0.032
odox+0.030
asons+0.030
Write+0.030
Neg logit contributions
NRS-0.006
 UP-0.006
 Act-0.006
 gloss-0.006
 Rost-0.006
Token?
Token ID30
Feature activation-2.89e-01

Pos logit contributions
NRS+0.290
 UP+0.285
 gloss+0.284
 Act+0.283
 Rost+0.280
Neg logit contributions
ason-0.511
arton-0.489
asons-0.453
Write-0.452
antry-0.449
Tokenv
Token ID85
Feature activation+5.51e-02

Pos logit contributions
NRS+0.211
 UP+0.209
 Act+0.208
 gloss+0.208
 Rost+0.206
Neg logit contributions
ason-0.177
arton-0.167
asons-0.150
Write-0.149
odox-0.149
Token=
Token ID28
Feature activation-1.94e-01

Pos logit contributions
ason+0.039
arton+0.037
asons+0.033
Write+0.033
odox+0.033
Neg logit contributions
NRS-0.035
 UP-0.035
 Act-0.035
 gloss-0.035
 Rost-0.035
Token2
Token ID17
Feature activation+6.84e-04

Pos logit contributions
NRS+0.113
 UP+0.112
 Act+0.112
 gloss+0.111
 Rost+0.111
Neg logit contributions
ason-0.147
arton-0.141
asons-0.128
Write-0.128
odox-0.128
Tokeng
Token ID70
Feature activation-9.10e-02

Pos logit contributions
ason+0.001
arton+0.000
asons+0.000
Write+0.000
odox+0.000
Neg logit contributions
NRS-0.000
 UP-0.000
 Act-0.000
 gloss-0.000
 Rost-0.000
Token about
Token ID546
Feature activation-7.33e-03

Pos logit contributions
ason+0.040
arton+0.038
asons+0.034
odox+0.034
Write+0.034
Neg logit contributions
NRS-0.039
 UP-0.039
 Act-0.039
 gloss-0.039
 Rost-0.038
Token him
Token ID683
Feature activation+3.69e-02

Pos logit contributions
NRS+0.004
 UP+0.004
 Act+0.004
 gloss+0.004
 Rost+0.004
Neg logit contributions
ason-0.006
arton-0.005
Write-0.005
asons-0.005
odox-0.005
Token have
Token ID423
Feature activation-6.75e-02

Pos logit contributions
ason+0.026
arton+0.024
asons+0.022
odox+0.022
Write+0.022
Neg logit contributions
NRS-0.024
 UP-0.024
 Act-0.024
 gloss-0.024
 Rost-0.023
Token been
Token ID587
Feature activation-2.14e-02

Pos logit contributions
NRS+0.043
 UP+0.043
 gloss+0.043
 Act+0.043
 Rost+0.042
Neg logit contributions
ason-0.047
arton-0.045
asons-0.041
odox-0.041
Write-0.040
Token hed
Token ID16418
Feature activation-1.17e-01

Pos logit contributions
NRS+0.013
 UP+0.013
 Act+0.012
 gloss+0.012
 Rost+0.012
Neg logit contributions
ason-0.016
arton-0.015
asons-0.014
Write-0.014
odox-0.014
Tokenged
Token ID2004
Feature activation-5.85e-01

Pos logit contributions
NRS+0.023
 UP+0.023
 Act+0.022
 gloss+0.022
 Rost+0.022
Neg logit contributions
ason-0.133
arton-0.130
asons-0.122
odox-0.122
Write-0.122
Token,
Token ID11
Feature activation-3.51e-02

Pos logit contributions
NRS+0.357
 UP+0.354
 Act+0.353
 gloss+0.352
 Rost+0.349
Neg logit contributions
ason-0.427
arton-0.408
asons-0.372
odox-0.370
Write-0.370
Token of
Token ID286
Feature activation-2.03e-02

Pos logit contributions
NRS+0.022
 UP+0.021
 Act+0.021
 gloss+0.021
 Rost+0.021
Neg logit contributions
ason-0.025
arton-0.024
asons-0.022
Write-0.022
antry-0.022
Token all
Token ID477
Feature activation+8.10e-03

Pos logit contributions
NRS+0.011
 UP+0.011
 Act+0.011
 gloss+0.011
 Rost+0.011
Neg logit contributions
ason-0.016
arton-0.016
asons-0.014
Write-0.014
odox-0.014
Token the
Token ID262
Feature activation-3.09e-02

Pos logit contributions
ason+0.007
arton+0.006
asons+0.006
Write+0.006
antry+0.006
Neg logit contributions
NRS-0.004
 UP-0.004
 Act-0.004
 gloss-0.004
 Rost-0.004
Token Americans
Token ID3399
Feature activation+1.42e-01

Pos logit contributions
NRS+0.016
 UP+0.016
 Act+0.015
 gloss+0.015
 Rost+0.015
Neg logit contributions
ason-0.026
arton-0.025
asons-0.023
Write-0.023
odox-0.023
Token he
Token ID339
Feature activation+2.53e-02

Pos logit contributions
NRS+0.060
 UP+0.060
 Act+0.059
 gloss+0.059
 Rost+0.059
Neg logit contributions
ason-0.053
arton-0.050
asons-0.045
odox-0.045
Write-0.044
Token bore
Token ID18631
Feature activation-1.23e-01

Pos logit contributions
ason+0.018
arton+0.017
asons+0.015
odox+0.015
Write+0.015
Neg logit contributions
NRS-0.016
 UP-0.016
 Act-0.016
 gloss-0.016
 Rost-0.016
Token no
Token ID645
Feature activation-7.28e-02

Pos logit contributions
NRS+0.069
 UP+0.068
 Act+0.067
 gloss+0.067
 Rost+0.067
Neg logit contributions
ason-0.097
arton-0.093
asons-0.085
Write-0.085
odox-0.085
Token gr
Token ID1036
Feature activation+5.67e-02

Pos logit contributions
NRS+0.042
 UP+0.041
 Act+0.041
 gloss+0.041
 Rost+0.041
Neg logit contributions
ason-0.056
arton-0.053
asons-0.049
Write-0.048
antry-0.048
Tokenud
Token ID463
Feature activation+1.11e-01

Pos logit contributions
ason+0.033
arton+0.032
Write+0.029
odox+0.028
asons+0.028
Neg logit contributions
NRS-0.042
 UP-0.042
 Act-0.042
 gloss-0.041
 Rost-0.041
Tokenges
Token ID3212
Feature activation-5.81e-01

Pos logit contributions
ason+0.140
arton+0.137
asons+0.130
odox+0.130
Write+0.130
Neg logit contributions
NRS-0.008
 UP-0.008
 Act-0.008
 gloss-0.007
 Rost-0.007
Token.
Token ID13
Feature activation-8.18e-02

Pos logit contributions
NRS+0.315
 UP+0.313
 Act+0.312
 gloss+0.311
 Rost+0.308
Neg logit contributions
ason-0.464
arton-0.446
asons-0.409
odox-0.408
Write-0.407
Token He
Token ID679
Feature activation-1.09e-01

Pos logit contributions
NRS+0.050
 UP+0.049
 Act+0.049
 gloss+0.049
 Rost+0.049
Neg logit contributions
ason-0.060
arton-0.058
asons-0.052
odox-0.052
antry-0.052
Token reiterated
Token ID27116
Feature activation-3.60e-02

Pos logit contributions
NRS+0.072
 UP+0.071
 Act+0.071
 gloss+0.071
 Rost+0.070
Neg logit contributions
ason-0.074
arton-0.071
asons-0.064
odox-0.064
Write-0.064
Token that
Token ID326
Feature activation-9.18e-02

Pos logit contributions
NRS+0.023
 UP+0.022
 Act+0.022
 gloss+0.022
 Rost+0.022
Neg logit contributions
ason-0.026
arton-0.025
asons-0.022
odox-0.022
Write-0.022
Token he
Token ID339
Feature activation-3.03e-02

Pos logit contributions
NRS+0.067
 UP+0.067
 Act+0.066
 gloss+0.066
 Rost+0.066
Neg logit contributions
ason-0.056
arton-0.053
asons-0.047
odox-0.047
Write-0.047
Token
Token ID198
Feature activation-3.14e-02

Pos logit contributions
NRS+0.002
 UP+0.002
 gloss+0.002
 Act+0.002
 Rost+0.002
Neg logit contributions
ason-0.003
arton-0.003
asons-0.003
Write-0.003
odox-0.003
Token
Token ID198
Feature activation-3.76e-02

Pos logit contributions
NRS+0.017
 UP+0.017
 gloss+0.017
 Act+0.016
 Rost+0.016
Neg logit contributions
ason-0.025
arton-0.024
Write-0.022
asons-0.022
antry-0.022
TokenD
Token ID35
Feature activation-1.14e-01

Pos logit contributions
NRS+0.025
 UP+0.024
 Act+0.024
 gloss+0.024
 Rost+0.024
Neg logit contributions
ason-0.026
arton-0.025
asons-0.022
Write-0.022
odox-0.022
Tokenana
Token ID2271
Feature activation-8.54e-02

Pos logit contributions
NRS+0.077
 UP+0.077
 Act+0.076
 gloss+0.076
 Rost+0.076
Neg logit contributions
ason-0.075
arton-0.071
asons-0.065
odox-0.064
Write-0.064
Token Hod
Token ID22816
Feature activation+1.25e-01

Pos logit contributions
NRS+0.049
 UP+0.049
 Act+0.048
 gloss+0.048
 Rost+0.048
Neg logit contributions
ason-0.065
arton-0.063
asons-0.057
odox-0.057
Write-0.057
Tokenges
Token ID3212
Feature activation-5.59e-01

Pos logit contributions
ason+0.142
arton+0.138
asons+0.130
odox+0.129
Write+0.129
Neg logit contributions
NRS-0.026
 UP-0.025
 gloss-0.025
 Act-0.025
 Rost-0.024
Token,
Token ID11
Feature activation-1.79e-02

Pos logit contributions
NRS+0.337
 UP+0.336
 Act+0.336
 gloss+0.333
 Rost+0.332
Neg logit contributions
ason-0.411
arton-0.393
asons-0.360
odox-0.358
Write-0.357
Token state
Token ID1181
Feature activation+3.55e-03

Pos logit contributions
NRS+0.008
 gloss+0.008
 UP+0.008
 Act+0.008
 Rost+0.008
Neg logit contributions
ason-0.016
arton-0.015
Write-0.014
antry-0.014
odox-0.014
Token director
Token ID3437
Feature activation-3.11e-01

Pos logit contributions
ason+0.003
arton+0.003
asons+0.003
odox+0.003
Write+0.003
Neg logit contributions
NRS-0.002
 UP-0.002
 Act-0.002
 gloss-0.002
 Rost-0.002
Token of
Token ID286
Feature activation-1.25e-02

Pos logit contributions
 UP+0.183
NRS+0.182
 Act+0.182
 gloss+0.181
 Rost+0.180
Neg logit contributions
ason-0.233
arton-0.223
asons-0.204
odox-0.203
Write-0.203
Token Concern
Token ID32265
Feature activation+2.23e-01

Pos logit contributions
NRS+0.007
 gloss+0.006
 UP+0.006
 Act+0.006
 Rost+0.006
Neg logit contributions
ason-0.010
arton-0.010
odox-0.009
asons-0.009
Write-0.009
Token whose
Token ID3025
Feature activation-2.20e-01

Pos logit contributions
NRS+0.036
 gloss+0.035
 UP+0.035
 Act+0.035
 Rost+0.034
Neg logit contributions
ason-0.049
arton-0.047
Write-0.043
asons-0.043
antry-0.043
Token work
Token ID670
Feature activation+4.47e-02

Pos logit contributions
NRS+0.115
 UP+0.112
 Act+0.111
 gloss+0.111
 Rost+0.110
Neg logit contributions
ason-0.181
arton-0.173
asons-0.159
Write-0.159
antry-0.158
Token is
Token ID318
Feature activation-1.20e-01

Pos logit contributions
ason+0.034
arton+0.032
asons+0.029
odox+0.029
Write+0.029
Neg logit contributions
NRS-0.026
 UP-0.026
 Act-0.026
 gloss-0.026
 Rost-0.026
Token occasionally
Token ID10491
Feature activation+2.89e-01

Pos logit contributions
NRS+0.071
 UP+0.070
 Act+0.070
 gloss+0.070
 Rost+0.069
Neg logit contributions
ason-0.090
arton-0.086
asons-0.079
odox-0.078
Write-0.078
Token dod
Token ID20764
Feature activation-2.05e-01

Pos logit contributions
ason+0.235
arton+0.225
asons+0.207
odox+0.206
Write+0.206
Neg logit contributions
NRS-0.154
 UP-0.152
 Act-0.151
 gloss-0.151
 Rost-0.150
Tokengy
Token ID1360
Feature activation-5.59e-01

Pos logit contributions
NRS+0.039
 UP+0.037
 Act+0.037
 gloss+0.037
 Rost+0.036
Neg logit contributions
ason-0.236
arton-0.229
odox-0.216
Write-0.216
asons-0.216
Token and
Token ID290
Feature activation-6.23e-02

Pos logit contributions
NRS+0.341
 UP+0.339
 Act+0.337
 gloss+0.337
 Rost+0.334
Neg logit contributions
ason-0.408
arton-0.390
asons-0.355
odox-0.354
Write-0.353
Token occasionally
Token ID10491
Feature activation+2.75e-01

Pos logit contributions
NRS+0.040
 UP+0.039
 Act+0.039
 gloss+0.039
 Rost+0.039
Neg logit contributions
ason-0.044
arton-0.042
asons-0.038
Write-0.038
odox-0.038
Token brilliant
Token ID10457
Feature activation-1.96e-01

Pos logit contributions
ason+0.209
arton+0.199
asons+0.182
Write+0.181
odox+0.181
Neg logit contributions
NRS-0.161
 UP-0.158
 Act-0.157
 Rost-0.156
 gloss-0.156
Token,
Token ID11
Feature activation-5.44e-02

Pos logit contributions
NRS+0.120
 UP+0.119
 Act+0.119
 gloss+0.119
 Rost+0.117
Neg logit contributions
ason-0.143
arton-0.137
asons-0.125
odox-0.124
antry-0.124
Token hits
Token ID7127
Feature activation-6.55e-03

Pos logit contributions
NRS+0.033
 UP+0.032
 Act+0.032
 gloss+0.032
 Rost+0.032
Neg logit contributions
ason-0.041
arton-0.039
asons-0.035
Write-0.035
odox-0.035
Token autonomy
Token ID21851
Feature activation-4.57e-02

Pos logit contributions
ason+0.200
arton+0.193
Write+0.177
asons+0.177
odox+0.177
Neg logit contributions
NRS-0.119
 UP-0.116
 gloss-0.115
 Act-0.115
 Rost-0.115
Token under
Token ID739
Feature activation-1.18e-02

Pos logit contributions
NRS+0.028
 UP+0.028
 Act+0.028
 gloss+0.028
 Rost+0.028
Neg logit contributions
ason-0.033
arton-0.031
asons-0.029
odox-0.029
Write-0.029
Token the
Token ID262
Feature activation-6.66e-04

Pos logit contributions
NRS+0.006
 UP+0.006
 gloss+0.006
 Act+0.006
 Rost+0.006
Neg logit contributions
ason-0.010
arton-0.009
asons-0.009
odox-0.009
Write-0.009
Token Norfolk
Token ID32134
Feature activation+3.12e-02

Pos logit contributions
NRS+0.000
 UP+0.000
 gloss+0.000
 Rost+0.000
 Act+0.000
Neg logit contributions
ason-0.001
arton-0.000
asons-0.000
Write-0.000
odox-0.000
Token Island
Token ID5451
Feature activation-4.71e-02

Pos logit contributions
ason+0.030
arton+0.029
asons+0.027
odox+0.027
Write+0.027
Neg logit contributions
NRS-0.012
 UP-0.012
 gloss-0.011
 Act-0.011
 Rost-0.011
Token Act
Token ID2191
Feature activation-5.54e-01

Pos logit contributions
NRS+0.014
 UP+0.014
 Act+0.014
 gloss+0.014
 Rost+0.013
Neg logit contributions
ason-0.049
arton-0.048
asons-0.045
odox-0.045
Write-0.045
Token 1979
Token ID13521
Feature activation-4.50e-02

Pos logit contributions
NRS+0.313
 Act+0.311
 UP+0.310
 gloss+0.308
 Rost+0.305
Neg logit contributions
ason-0.430
arton-0.412
asons-0.377
Write-0.376
odox-0.376
Token through
Token ID832
Feature activation-1.21e-01

Pos logit contributions
NRS+0.027
 UP+0.027
 Act+0.027
 gloss+0.027
 Rost+0.027
Neg logit contributions
ason-0.033
arton-0.031
asons-0.029
odox-0.029
Write-0.029
Token its
Token ID663
Feature activation-8.77e-02

Pos logit contributions
NRS+0.061
 UP+0.059
 gloss+0.059
 Rost+0.059
 Act+0.058
Neg logit contributions
ason-0.102
arton-0.098
asons-0.090
Write-0.090
odox-0.089
Token own
Token ID898
Feature activation+4.37e-02

Pos logit contributions
NRS+0.047
 UP+0.047
 gloss+0.046
 Act+0.046
 Rost+0.046
Neg logit contributions
ason-0.070
arton-0.067
asons-0.062
Write-0.062
odox-0.061
Token legislative
Token ID10828
Feature activation-3.70e-01

Pos logit contributions
ason+0.036
arton+0.034
asons+0.031
Write+0.031
odox+0.031
Neg logit contributions
NRS-0.023
 UP-0.022
 gloss-0.022
 Rost-0.022
 Act-0.022
TokenMP
Token ID7378
Feature activation-1.18e-01

Pos logit contributions
ason+0.233
arton+0.223
asons+0.206
Write+0.206
odox+0.205
Neg logit contributions
 gloss-0.143
NRS-0.143
 Act-0.142
 UP-0.142
 Rost-0.141
TokenID
Token ID2389
Feature activation-1.13e-01

Pos logit contributions
NRS+0.060
 gloss+0.059
 Act+0.059
 UP+0.059
 Rost+0.058
Neg logit contributions
ason-0.098
arton-0.094
asons-0.087
Write-0.086
odox-0.086
TokenR
Token ID49
Feature activation-5.20e-02

Pos logit contributions
NRS+0.084
 UP+0.083
 Act+0.083
 gloss+0.083
 Rost+0.082
Neg logit contributions
ason-0.069
arton-0.065
asons-0.058
odox-0.057
Write-0.057
Token)
Token ID8
Feature activation+1.33e-01

Pos logit contributions
NRS+0.040
 UP+0.039
 Act+0.039
 gloss+0.039
 Rost+0.039
Neg logit contributions
ason-0.030
arton-0.028
asons-0.025
odox-0.025
Write-0.025
Token in
Token ID287
Feature activation+2.19e-02

Pos logit contributions
ason+0.098
arton+0.094
asons+0.086
odox+0.085
Write+0.085
Neg logit contributions
NRS-0.081
 UP-0.079
 Act-0.079
 gloss-0.079
 Rost-0.078
Token Rost
Token ID48306
Feature activation-5.53e-01

Pos logit contributions
ason+0.017
arton+0.016
asons+0.015
Write+0.015
antry+0.015
Neg logit contributions
NRS-0.013
 gloss-0.012
 UP-0.012
 Act-0.012
 Rost-0.012
Tokenock
Token ID735
Feature activation+1.32e-01

Pos logit contributions
NRS+0.545
 gloss+0.541
 UP+0.538
 Act+0.537
 Rost+0.531
Neg logit contributions
ason-0.203
arton-0.181
Write-0.144
asons-0.143
odox-0.143
Token,
Token ID11
Feature activation-1.08e-02

Pos logit contributions
ason+0.096
arton+0.091
asons+0.083
Write+0.083
odox+0.083
Neg logit contributions
NRS-0.081
 UP-0.080
 Act-0.079
 gloss-0.079
 Rost-0.078
Token Germany
Token ID4486
Feature activation-9.49e-02

Pos logit contributions
NRS+0.006
 gloss+0.006
 UP+0.006
 Act+0.006
 Rost+0.005
Neg logit contributions
ason-0.009
arton-0.008
asons-0.008
Write-0.008
odox-0.008
Token,
Token ID11
Feature activation-2.52e-02

Pos logit contributions
NRS+0.056
 UP+0.056
 Act+0.056
 gloss+0.055
 Rost+0.055
Neg logit contributions
ason-0.071
arton-0.068
asons-0.062
odox-0.062
Write-0.062
Token has
Token ID468
Feature activation-1.36e-01

Pos logit contributions
NRS+0.015
 UP+0.015
 Act+0.014
 gloss+0.014
 Rost+0.014
Neg logit contributions
ason-0.019
arton-0.018
asons-0.017
Write-0.017
odox-0.017
Token
Token ID198
Feature activation-3.32e-02

Pos logit contributions
NRS+0.045
 UP+0.044
 gloss+0.044
 Act+0.044
 Rost+0.043
Neg logit contributions
ason-0.061
arton-0.058
asons-0.053
Write-0.053
odox-0.053
Token
Token ID198
Feature activation-2.51e-02

Pos logit contributions
NRS+0.018
 gloss+0.018
 UP+0.018
 Act+0.017
 Rost+0.017
Neg logit contributions
ason-0.027
arton-0.026
Write-0.024
asons-0.024
antry-0.023
TokenFe
Token ID14304
Feature activation-3.21e-01

Pos logit contributions
NRS+0.016
 UP+0.016
 gloss+0.016
 Act+0.016
 Rost+0.016
Neg logit contributions
ason-0.018
arton-0.017
asons-0.015
Write-0.015
odox-0.015
Tokeneling
Token ID10809
Feature activation+1.19e-02

Pos logit contributions
NRS+0.237
 UP+0.235
 Act+0.234
 gloss+0.234
 Rost+0.232
Neg logit contributions
ason-0.193
arton-0.183
asons-0.162
odox-0.162
Write-0.162
Token Hel
Token ID5053
Feature activation-8.60e-02

Pos logit contributions
ason+0.009
arton+0.008
asons+0.008
Write+0.008
odox+0.008
Neg logit contributions
NRS-0.007
 UP-0.007
 gloss-0.007
 Act-0.007
 Rost-0.007
Tokenpless
Token ID14570
Feature activation-5.49e-01

Pos logit contributions
NRS+0.008
 UP+0.008
 Act+0.008
 gloss+0.008
 Rost+0.007
Neg logit contributions
ason-0.107
arton-0.104
asons-0.099
Write-0.098
odox-0.098
Token
Token ID198
Feature activation-7.00e-02

Pos logit contributions
NRS+0.303
 UP+0.298
 gloss+0.297
 Act+0.295
 Rost+0.293
Neg logit contributions
ason-0.435
arton-0.416
asons-0.382
Write-0.381
antry-0.379
Token
Token ID198
Feature activation-5.84e-02

Pos logit contributions
NRS+0.038
 gloss+0.038
 UP+0.037
 Act+0.037
 Rost+0.037
Neg logit contributions
ason-0.056
arton-0.053
Write-0.049
asons-0.049
antry-0.048
TokenBut
Token ID1537
Feature activation+8.02e-02

Pos logit contributions
NRS+0.038
 UP+0.038
 gloss+0.038
 Act+0.038
 Rost+0.038
Neg logit contributions
ason-0.040
arton-0.038
Write-0.035
asons-0.035
odox-0.034
Token are
Token ID389
Feature activation-5.25e-02

Pos logit contributions
ason+0.059
arton+0.056
asons+0.051
Write+0.051
odox+0.051
Neg logit contributions
NRS-0.049
 UP-0.048
 gloss-0.048
 Act-0.047
 Rost-0.047
Token the
Token ID262
Feature activation-6.09e-02

Pos logit contributions
NRS+0.029
 UP+0.028
 gloss+0.028
 Rost+0.028
 Act+0.028
Neg logit contributions
ason-0.041
arton-0.040
asons-0.036
Write-0.036
odox-0.036
Token,
Token ID11
Feature activation-2.59e-02

Pos logit contributions
NRS+0.078
 UP+0.078
 Act+0.077
 gloss+0.077
 Rost+0.076
Neg logit contributions
ason-0.095
arton-0.091
asons-0.083
odox-0.083
antry-0.082
Token but
Token ID475
Feature activation+9.88e-04

Pos logit contributions
NRS+0.016
 UP+0.016
 Act+0.016
 gloss+0.016
 Rost+0.016
Neg logit contributions
ason-0.018
arton-0.018
asons-0.016
odox-0.016
Write-0.016
Token since
Token ID1201
Feature activation-7.28e-02

Pos logit contributions
ason+0.001
arton+0.001
asons+0.001
odox+0.001
Write+0.001
Neg logit contributions
NRS-0.001
 UP-0.001
 Act-0.001
 gloss-0.001
 Rost-0.001
Token I
Token ID314
Feature activation-2.72e-02

Pos logit contributions
NRS+0.043
 UP+0.043
 Act+0.043
 gloss+0.043
 Rost+0.042
Neg logit contributions
ason-0.054
arton-0.052
odox-0.047
asons-0.047
Write-0.047
Token use
Token ID779
Feature activation-9.03e-02

Pos logit contributions
NRS+0.016
 UP+0.016
 Act+0.016
 gloss+0.016
 Rost+0.016
Neg logit contributions
ason-0.020
arton-0.019
asons-0.018
Write-0.018
odox-0.017
Token macros
Token ID34749
Feature activation-5.45e-01

Pos logit contributions
NRS+0.052
 UP+0.052
 Act+0.051
 gloss+0.051
 Rost+0.051
Neg logit contributions
ason-0.069
arton-0.066
asons-0.060
odox-0.060
antry-0.060
Token I
Token ID314
Feature activation-5.08e-02

Pos logit contributions
NRS+0.339
 UP+0.338
 Act+0.336
 gloss+0.336
 Rost+0.332
Neg logit contributions
ason-0.391
arton-0.375
asons-0.340
odox-0.339
antry-0.338
Token also
Token ID635
Feature activation+1.92e-01

Pos logit contributions
NRS+0.031
 UP+0.031
 Act+0.031
 gloss+0.031
 Rost+0.030
Neg logit contributions
ason-0.037
arton-0.035
asons-0.032
Write-0.032
odox-0.032
Token have
Token ID423
Feature activation-1.17e-01

Pos logit contributions
ason+0.142
arton+0.135
asons+0.123
Write+0.123
antry+0.122
Neg logit contributions
NRS-0.117
 UP-0.115
 Act-0.115
 gloss-0.114
 Rost-0.114
Token the
Token ID262
Feature activation-1.04e-01

Pos logit contributions
NRS+0.065
 UP+0.064
 Act+0.064
 gloss+0.064
 Rost+0.063
Neg logit contributions
ason-0.092
arton-0.088
asons-0.081
odox-0.081
antry-0.081
Token names
Token ID3891
Feature activation-2.25e-01

Pos logit contributions
NRS+0.056
 UP+0.056
 Act+0.055
 gloss+0.055
 Rost+0.055
Neg logit contributions
ason-0.083
arton-0.080
asons-0.073
odox-0.073
Write-0.073
Token players
Token ID1938
Feature activation-1.05e-01

Pos logit contributions
ason+0.023
arton+0.022
asons+0.020
Write+0.020
odox+0.020
Neg logit contributions
NRS-0.014
 UP-0.014
 gloss-0.014
 Rost-0.014
 Act-0.014
Token.
Token ID13
Feature activation+1.62e-02

Pos logit contributions
NRS+0.064
 UP+0.063
 Act+0.063
 gloss+0.063
 Rost+0.062
Neg logit contributions
ason-0.077
arton-0.073
asons-0.067
Write-0.067
odox-0.067
Token<|endoftext|>
Token ID50256
Feature activation+1.17e-01

Pos logit contributions
ason+0.013
arton+0.012
asons+0.011
Write+0.011
odox+0.011
Neg logit contributions
NRS-0.009
 gloss-0.009
 UP-0.009
 Act-0.009
 Rost-0.009
TokenA
Token ID32
Feature activation+6.04e-02

Pos logit contributions
ason+0.084
arton+0.080
asons+0.073
odox+0.073
Write+0.073
Neg logit contributions
NRS-0.073
 UP-0.072
 Act-0.072
 gloss-0.072
 Rost-0.071
Token D
Token ID360
Feature activation-9.73e-02

Pos logit contributions
ason+0.047
arton+0.045
asons+0.041
odox+0.041
Write+0.041
Neg logit contributions
NRS-0.034
 UP-0.034
 Act-0.034
 gloss-0.034
 Rost-0.033
TokenDoS
Token ID46498
Feature activation-5.42e-01

Pos logit contributions
NRS+0.063
 UP+0.062
 Act+0.062
 gloss+0.062
 Rost+0.061
Neg logit contributions
ason-0.068
arton-0.064
asons-0.058
odox-0.058
Write-0.058
Token Gap
Token ID33899
Feature activation-3.34e-01

Pos logit contributions
NRS+0.247
 UP+0.241
 gloss+0.239
 Rost+0.237
 Act+0.237
Neg logit contributions
ason-0.483
arton-0.463
asons-0.429
Write-0.429
odox-0.427
Token occurs
Token ID8833
Feature activation-5.08e-02

Pos logit contributions
NRS+0.190
 UP+0.186
 gloss+0.186
 Act+0.184
 Rost+0.184
Neg logit contributions
ason-0.260
arton-0.248
asons-0.227
Write-0.226
antry-0.226
Token whenever
Token ID8797
Feature activation-4.88e-02

Pos logit contributions
NRS+0.030
 UP+0.029
 Act+0.029
 gloss+0.029
 Rost+0.029
Neg logit contributions
ason-0.039
arton-0.037
asons-0.034
Write-0.034
odox-0.034
Token D
Token ID360
Feature activation-1.25e-01

Pos logit contributions
NRS+0.027
 gloss+0.027
 UP+0.027
 Act+0.026
 Rost+0.026
Neg logit contributions
ason-0.038
arton-0.036
Write-0.033
asons-0.033
odox-0.033
TokenDoS
Token ID46498
Feature activation-5.21e-01

Pos logit contributions
NRS+0.027
 UP+0.026
 gloss+0.026
 Act+0.026
 Rost+0.025
Neg logit contributions
ason-0.140
arton-0.136
asons-0.128
Write-0.128
odox-0.128
Token a
Token ID257
Feature activation-3.16e-02

Pos logit contributions
NRS+0.070
 UP+0.069
 Act+0.069
 gloss+0.069
 Rost+0.068
Neg logit contributions
ason-0.086
arton-0.082
asons-0.075
Write-0.075
antry-0.075
Token peer
Token ID12720
Feature activation-2.93e-01

Pos logit contributions
NRS+0.018
 UP+0.017
 Act+0.017
 gloss+0.017
 Rost+0.017
Neg logit contributions
ason-0.025
arton-0.024
asons-0.022
Write-0.022
odox-0.022
Token-
Token ID12
Feature activation-2.37e-02

Pos logit contributions
NRS+0.178
 UP+0.177
 Act+0.176
 gloss+0.176
 Rost+0.174
Neg logit contributions
ason-0.215
arton-0.206
asons-0.187
odox-0.187
Write-0.186
Tokento
Token ID1462
Feature activation-1.11e-01

Pos logit contributions
NRS+0.014
 UP+0.014
 Act+0.014
 gloss+0.014
 Rost+0.014
Neg logit contributions
ason-0.017
arton-0.017
asons-0.015
odox-0.015
Write-0.015
Token-
Token ID12
Feature activation-2.52e-02

Pos logit contributions
NRS+0.071
 UP+0.070
 Act+0.070
 gloss+0.070
 Rost+0.069
Neg logit contributions
ason-0.078
arton-0.074
asons-0.067
Write-0.067
antry-0.066
Tokenpeer
Token ID33350
Feature activation-5.40e-01

Pos logit contributions
NRS+0.002
 Act+0.002
 gloss+0.002
 Rost+0.002
 UP+0.002
Neg logit contributions
ason-0.032
arton-0.031
asons-0.029
Write-0.029
antry-0.029
Token app
Token ID598
Feature activation-6.22e-02

Pos logit contributions
NRS+0.300
 UP+0.296
 Act+0.294
 gloss+0.294
 Rost+0.291
Neg logit contributions
ason-0.425
arton-0.407
asons-0.373
Write-0.372
odox-0.372
Token,
Token ID11
Feature activation+8.11e-04

Pos logit contributions
NRS+0.037
 UP+0.036
 Act+0.036
 gloss+0.036
 Rost+0.036
Neg logit contributions
ason-0.047
arton-0.045
asons-0.041
Write-0.041
antry-0.041
Token founded
Token ID9393
Feature activation-1.20e-01

Pos logit contributions
ason+0.001
arton+0.001
Write+0.001
asons+0.001
antry+0.001
Neg logit contributions
NRS-0.000
 UP-0.000
 Act-0.000
 gloss-0.000
 Rost-0.000
Token in
Token ID287
Feature activation-1.47e-02

Pos logit contributions
NRS+0.068
 UP+0.068
 Act+0.068
 gloss+0.068
 Rost+0.067
Neg logit contributions
ason-0.092
arton-0.088
asons-0.081
odox-0.081
Write-0.080
Token Austin
Token ID9533
Feature activation-2.39e-01

Pos logit contributions
NRS+0.007
 gloss+0.007
 UP+0.007
 Act+0.007
 Rost+0.007
Neg logit contributions
ason-0.013
arton-0.012
asons-0.011
Write-0.011
odox-0.011
Token Booker
Token ID31004
Feature activation-7.47e-02

Pos logit contributions
NRS+0.036
 UP+0.036
 Act+0.036
 gloss+0.035
 Rost+0.035
Neg logit contributions
ason-0.048
arton-0.046
asons-0.042
Write-0.042
odox-0.042
Token is
Token ID318
Feature activation-8.24e-02

Pos logit contributions
NRS+0.044
 UP+0.043
 Act+0.043
 gloss+0.043
 Rost+0.043
Neg logit contributions
ason-0.057
arton-0.054
asons-0.050
odox-0.049
Write-0.049
Token not
Token ID407
Feature activation-1.95e-02

Pos logit contributions
NRS+0.050
 UP+0.049
 Act+0.049
 gloss+0.048
 Rost+0.048
Neg logit contributions
ason-0.061
arton-0.058
asons-0.053
Write-0.053
odox-0.053
Token just
Token ID655
Feature activation+9.55e-02

Pos logit contributions
NRS+0.012
 UP+0.012
 Act+0.012
 gloss+0.012
 Rost+0.012
Neg logit contributions
ason-0.014
arton-0.014
Write-0.012
asons-0.012
odox-0.012
Token in
Token ID287
Feature activation-1.07e-02

Pos logit contributions
ason+0.072
arton+0.068
asons+0.063
odox+0.063
Write+0.062
Neg logit contributions
NRS-0.057
 UP-0.055
 Act-0.055
 gloss-0.055
 Rost-0.055
Tokenoffensive
Token ID45055
Feature activation-5.40e-01

Pos logit contributions
NRS+0.006
 UP+0.006
 gloss+0.006
 Act+0.006
 Rost+0.006
Neg logit contributions
ason-0.009
arton-0.008
asons-0.008
Write-0.007
odox-0.007
Token,
Token ID11
Feature activation-4.44e-02

Pos logit contributions
NRS+0.329
 UP+0.325
 Act+0.324
 gloss+0.323
 Rost+0.321
Neg logit contributions
ason-0.396
arton-0.378
asons-0.344
Write-0.342
odox-0.342
Token but
Token ID475
Feature activation+4.43e-02

Pos logit contributions
NRS+0.029
 UP+0.029
 Act+0.029
 gloss+0.029
 Rost+0.029
Neg logit contributions
ason-0.030
arton-0.029
asons-0.026
odox-0.026
Write-0.026
Token corrupt
Token ID10622
Feature activation+2.02e-01

Pos logit contributions
ason+0.032
arton+0.030
asons+0.028
Write+0.028
odox+0.028
Neg logit contributions
NRS-0.028
 UP-0.027
 Act-0.027
 gloss-0.027
 Rost-0.027
Token.
Token ID13
Feature activation-3.54e-02

Pos logit contributions
ason+0.151
arton+0.144
asons+0.131
odox+0.130
Write+0.130
Neg logit contributions
NRS-0.121
 UP-0.120
 Act-0.119
 gloss-0.119
 Rost-0.118
Token<|endoftext|>
Token ID50256
Feature activation+9.30e-02

Pos logit contributions
NRS+0.022
 gloss+0.021
 UP+0.021
 Act+0.021
 Rost+0.021
Neg logit contributions
ason-0.026
arton-0.025
asons-0.022
Write-0.022
odox-0.022
Token-
Token ID12
Feature activation+3.16e-02

Pos logit contributions
NRS+0.064
 UP+0.063
 gloss+0.063
 Rost+0.062
 Act+0.062
Neg logit contributions
ason-0.075
arton-0.071
asons-0.065
Write-0.064
antry-0.064
Tokenyear
Token ID1941
Feature activation+5.83e-02

Pos logit contributions
ason+0.022
arton+0.021
asons+0.019
Write+0.019
antry+0.018
Neg logit contributions
NRS-0.021
 UP-0.020
 gloss-0.020
 Act-0.020
 Rost-0.020
Token-
Token ID12
Feature activation+5.06e-02

Pos logit contributions
ason+0.043
arton+0.041
asons+0.037
odox+0.037
Write+0.037
Neg logit contributions
NRS-0.035
 UP-0.035
 Act-0.035
 gloss-0.035
 Rost-0.035
Tokenold
Token ID727
Feature activation+2.81e-02

Pos logit contributions
ason+0.029
arton+0.028
asons+0.025
Write+0.024
antry+0.024
Neg logit contributions
NRS-0.039
 UP-0.038
 gloss-0.038
 Act-0.038
 Rost-0.038
Token Hed
Token ID30920
Feature activation-5.33e-02

Pos logit contributions
ason+0.022
arton+0.021
asons+0.019
Write+0.019
antry+0.019
Neg logit contributions
NRS-0.016
 UP-0.015
 gloss-0.015
 Act-0.015
 Rost-0.015
Tokenges
Token ID3212
Feature activation-5.38e-01

Pos logit contributions
NRS+0.014
 UP+0.014
 Act+0.014
 gloss+0.013
 Rost+0.013
Neg logit contributions
ason-0.058
arton-0.056
asons-0.052
odox-0.052
Write-0.052
Token is
Token ID318
Feature activation-5.25e-02

Pos logit contributions
NRS+0.338
 UP+0.334
 Act+0.333
 gloss+0.332
 Rost+0.330
Neg logit contributions
ason-0.385
arton-0.366
asons-0.333
Write-0.331
odox-0.331
Token sure
Token ID1654
Feature activation+1.50e-01

Pos logit contributions
NRS+0.031
 UP+0.031
 Act+0.031
 gloss+0.031
 Rost+0.030
Neg logit contributions
ason-0.039
arton-0.037
asons-0.034
Write-0.034
odox-0.034
Token to
Token ID284
Feature activation+3.75e-03

Pos logit contributions
ason+0.113
arton+0.107
asons+0.099
Write+0.098
antry+0.098
Neg logit contributions
NRS-0.089
 UP-0.087
 gloss-0.087
 Act-0.087
 Rost-0.086
Token be
Token ID307
Feature activation+1.02e-01

Pos logit contributions
ason+0.003
arton+0.002
asons+0.002
odox+0.002
Write+0.002
Neg logit contributions
NRS-0.002
 UP-0.002
 Act-0.002
 gloss-0.002
 Rost-0.002
Token a
Token ID257
Feature activation-3.40e-03

Pos logit contributions
ason+0.077
arton+0.073
asons+0.067
Write+0.067
odox+0.067
Neg logit contributions
NRS-0.060
 UP-0.059
 Act-0.059
 gloss-0.059
 Rost-0.058
Token:
Token ID25
Feature activation-3.52e-02

Pos logit contributions
ason+0.064
arton+0.061
asons+0.056
odox+0.055
Write+0.055
Neg logit contributions
NRS-0.055
 UP-0.055
 Act-0.054
 gloss-0.054
 Rost-0.054
Token "
Token ID366
Feature activation-1.49e-01

Pos logit contributions
NRS+0.018
 UP+0.018
 gloss+0.018
 Act+0.018
 Rost+0.018
Neg logit contributions
ason-0.029
arton-0.028
asons-0.026
Write-0.025
odox-0.025
TokenDefense
Token ID27300
Feature activation+4.33e-03

Pos logit contributions
NRS+0.091
 UP+0.090
 gloss+0.090
 Act+0.090
 Rost+0.089
Neg logit contributions
ason-0.108
arton-0.103
asons-0.094
Write-0.094
odox-0.094
Token Of
Token ID3226
Feature activation-2.91e-02

Pos logit contributions
ason+0.003
arton+0.003
asons+0.003
odox+0.003
Write+0.003
Neg logit contributions
NRS-0.003
 UP-0.003
 Act-0.003
 gloss-0.003
 Rost-0.003
Token Marriage
Token ID24509
Feature activation+2.89e-02

Pos logit contributions
NRS+0.016
 UP+0.016
 Act+0.015
 gloss+0.015
 Rost+0.015
Neg logit contributions
ason-0.023
arton-0.022
asons-0.021
odox-0.021
Write-0.021
Token Act
Token ID2191
Feature activation-5.36e-01

Pos logit contributions
ason+0.027
arton+0.026
asons+0.024
odox+0.024
Write+0.024
Neg logit contributions
NRS-0.012
 UP-0.012
 gloss-0.012
 Act-0.012
 Rost-0.012
Token").
Token ID11074
Feature activation+2.63e-02

Pos logit contributions
NRS+0.385
 Act+0.383
 UP+0.383
 gloss+0.380
 Rost+0.378
Neg logit contributions
ason-0.334
arton-0.317
asons-0.284
odox-0.282
Write-0.282
Token But
Token ID887
Feature activation+9.23e-02

Pos logit contributions
ason+0.020
arton+0.019
asons+0.018
Write+0.018
odox+0.017
Neg logit contributions
NRS-0.015
 UP-0.015
 gloss-0.015
 Act-0.015
 Rost-0.015
Token the
Token ID262
Feature activation-3.80e-02

Pos logit contributions
ason+0.068
arton+0.065
asons+0.059
Write+0.059
odox+0.059
Neg logit contributions
NRS-0.056
 UP-0.055
 Act-0.054
 gloss-0.054
 Rost-0.054
Token whole
Token ID2187
Feature activation-1.48e-02

Pos logit contributions
NRS+0.021
 UP+0.021
 gloss+0.021
 Act+0.021
 Rost+0.021
Neg logit contributions
ason-0.030
arton-0.028
asons-0.026
odox-0.026
Write-0.026
Token argument
Token ID4578
Feature activation+1.93e-01

Pos logit contributions
NRS+0.009
 UP+0.009
 gloss+0.008
 Act+0.008
 Rost+0.008
Neg logit contributions
ason-0.011
arton-0.011
asons-0.010
odox-0.010
Write-0.010
Token queue
Token ID16834
Feature activation-1.03e-01

Pos logit contributions
NRS+0.004
 UP+0.004
 Act+0.004
 gloss+0.004
 Rost+0.004
Neg logit contributions
ason-0.005
arton-0.005
asons-0.005
Write-0.005
odox-0.005
Token delay
Token ID5711
Feature activation-3.42e-02

Pos logit contributions
NRS+0.064
 UP+0.063
 Act+0.063
 gloss+0.063
 Rost+0.062
Neg logit contributions
ason-0.075
arton-0.071
asons-0.065
Write-0.064
odox-0.064
Token,
Token ID11
Feature activation-2.58e-02

Pos logit contributions
NRS+0.021
 UP+0.021
 Act+0.021
 gloss+0.021
 Rost+0.021
Neg logit contributions
ason-0.025
arton-0.024
asons-0.021
odox-0.021
Write-0.021
Token and
Token ID290
Feature activation-3.79e-02

Pos logit contributions
NRS+0.015
 UP+0.015
 Act+0.015
 gloss+0.015
 Rost+0.015
Neg logit contributions
ason-0.020
arton-0.019
asons-0.017
Write-0.017
antry-0.017
Token conver
Token ID6718
Feature activation+1.86e-01

Pos logit contributions
NRS+0.022
 UP+0.022
 Act+0.022
 gloss+0.022
 Rost+0.022
Neg logit contributions
ason-0.029
arton-0.027
Write-0.025
asons-0.025
antry-0.025
Tokenges
Token ID3212
Feature activation-5.31e-01

Pos logit contributions
ason+0.232
arton+0.226
asons+0.214
odox+0.214
Write+0.214
Neg logit contributions
NRS-0.018
 UP-0.017
 Act-0.016
 gloss-0.016
 Rost-0.015
Token with
Token ID351
Feature activation-5.71e-02

Pos logit contributions
NRS+0.318
 UP+0.312
 Act+0.312
 gloss+0.311
 Rost+0.309
Neg logit contributions
ason-0.396
arton-0.377
asons-0.344
Write-0.343
antry-0.341
Token high
Token ID1029
Feature activation-1.92e-02

Pos logit contributions
NRS+0.032
 UP+0.031
 gloss+0.031
 Act+0.030
panel+0.030
Neg logit contributions
ason-0.045
arton-0.043
Write-0.040
asons-0.039
odox-0.039
Token probability
Token ID12867
Feature activation-2.77e-02

Pos logit contributions
NRS+0.010
 UP+0.010
 Act+0.010
 gloss+0.010
 Rost+0.010
Neg logit contributions
ason-0.016
arton-0.015
Write-0.014
asons-0.014
antry-0.013
Token to
Token ID284
Feature activation-5.66e-02

Pos logit contributions
NRS+0.017
 UP+0.016
 Act+0.016
 gloss+0.016
 Rost+0.016
Neg logit contributions
ason-0.021
arton-0.020
asons-0.018
Write-0.018
odox-0.018
Token Klein
Token ID22864
Feature activation-5.67e-02

Pos logit contributions
NRS+0.035
 UP+0.035
 Act+0.034
 gloss+0.034
 Rost+0.034
Neg logit contributions
ason-0.041
arton-0.039
asons-0.035
Write-0.035
antry-0.035
Tokenthe
Token ID1169
Feature activation-1.75e-01

Pos logit contributions
NRS+0.032
 UP+0.032
 Act+0.032
 gloss+0.032
 Rost+0.031
Neg logit contributions
ason-0.040
arton-0.038
asons-0.035
odox-0.035
Write-0.035
Token petty
Token ID25229
Feature activation+4.89e-04

Pos logit contributions
NRS+0.099
 Act+0.097
 UP+0.097
 gloss+0.096
 Rost+0.096
Neg logit contributions
ason-0.136
arton-0.130
asons-0.119
odox-0.119
antry-0.118
Token-
Token ID12
Feature activation-8.12e-02

Pos logit contributions
ason+0.000
arton+0.000
asons+0.000
Write+0.000
odox+0.000
Neg logit contributions
NRS-0.000
 UP-0.000
 Act-0.000
 gloss-0.000
 Rost-0.000
Tokenbour
Token ID6084
Feature activation+1.64e-01

Pos logit contributions
NRS+0.074
 UP+0.074
 Act+0.073
 gloss+0.073
 Rost+0.073
Neg logit contributions
ason-0.035
arton-0.032
asons-0.027
Write-0.027
odox-0.027
Tokenge
Token ID469
Feature activation-2.63e-01

Pos logit contributions
ason+0.133
arton+0.127
asons+0.117
odox+0.117
Write+0.117
Neg logit contributions
NRS-0.088
 UP-0.087
 Act-0.086
 gloss-0.086
 Rost-0.085
Tokenois
Token ID10924
Feature activation-5.30e-01

Pos logit contributions
NRS+0.057
 UP+0.056
 Act+0.055
 gloss+0.055
 Rost+0.053
Neg logit contributions
ason-0.296
arton-0.287
asons-0.271
odox-0.270
Write-0.270
Token intellig
Token ID10878
Feature activation-4.27e-02

Pos logit contributions
NRS+0.267
 UP+0.264
 Act+0.263
 gloss+0.262
 Rost+0.260
Neg logit contributions
ason-0.444
arton-0.426
asons-0.393
odox-0.392
Write-0.392
Tokenents
Token ID658
Feature activation+1.56e-01

Pos logit contributions
NRS+0.038
 UP+0.038
 Act+0.038
 gloss+0.038
 Rost+0.038
Neg logit contributions
ason-0.019
arton-0.018
asons-0.015
odox-0.015
Write-0.015
Tokenia
Token ID544
Feature activation-2.49e-01

Pos logit contributions
ason+0.144
arton+0.139
asons+0.129
odox+0.129
Write+0.129
Neg logit contributions
NRS-0.065
 UP-0.064
 Act-0.064
 gloss-0.064
 Rost-0.063
Token forged
Token ID24918
Feature activation-1.76e-01

Pos logit contributions
NRS+0.151
 UP+0.149
 Act+0.149
 gloss+0.149
 Rost+0.147
Neg logit contributions
ason-0.183
arton-0.175
asons-0.159
odox-0.158
Write-0.158
Token in
Token ID287
Feature activation+2.77e-02

Pos logit contributions
NRS+0.098
 UP+0.096
 Act+0.096
 gloss+0.096
 Rost+0.095
Neg logit contributions
ason-0.138
arton-0.132
asons-0.121
odox-0.121
Write-0.121
Token world
Token ID995
Feature activation+1.35e-01

Pos logit contributions
NRS+0.002
 UP+0.002
 Act+0.002
 gloss+0.002
 Rost+0.002
Neg logit contributions
ason-0.002
arton-0.002
asons-0.001
odox-0.001
Write-0.001
Token.
Token ID13
Feature activation+4.21e-02

Pos logit contributions
ason+0.099
arton+0.095
asons+0.086
odox+0.086
Write+0.086
Neg logit contributions
NRS-0.082
 UP-0.082
 Act-0.081
 gloss-0.081
 Rost-0.080
Token (
Token ID357
Feature activation+2.79e-01

Pos logit contributions
ason+0.022
arton+0.021
asons+0.018
Write+0.018
odox+0.018
Neg logit contributions
NRS-0.034
 UP-0.034
 gloss-0.034
 Act-0.034
 Rost-0.034
TokenWa
Token ID33484
Feature activation-9.51e-02

Pos logit contributions
ason+0.217
arton+0.207
asons+0.190
odox+0.189
Write+0.189
Neg logit contributions
NRS-0.158
 UP-0.156
 Act-0.155
 gloss-0.155
 Rost-0.154
Tokenub
Token ID549
Feature activation+2.87e-02

Pos logit contributions
NRS+0.075
 UP+0.074
 Act+0.074
 gloss+0.074
 Rost+0.074
Neg logit contributions
ason-0.053
arton-0.050
asons-0.044
odox-0.043
Write-0.043
Tokenges
Token ID3212
Feature activation-5.26e-01

Pos logit contributions
ason+0.036
arton+0.035
asons+0.033
Write+0.033
odox+0.033
Neg logit contributions
NRS-0.003
 UP-0.003
 Act-0.003
 gloss-0.002
 Rost-0.002
Tokenhig
Token ID25196
Feature activation-2.74e-02

Pos logit contributions
NRS+0.300
 UP+0.297
 Act+0.296
 gloss+0.296
 Rost+0.294
Neg logit contributions
ason-0.407
arton-0.388
asons-0.356
Write-0.353
odox-0.353
Token Rice
Token ID13823
Feature activation+4.85e-02

Pos logit contributions
NRS+0.019
 UP+0.019
 gloss+0.019
 Act+0.019
 Rost+0.019
Neg logit contributions
ason-0.018
arton-0.017
asons-0.015
Write-0.015
antry-0.015
Token/
Token ID14
Feature activation+8.58e-03

Pos logit contributions
ason+0.033
arton+0.032
asons+0.029
Write+0.029
antry+0.028
Neg logit contributions
NRS-0.032
 UP-0.031
 gloss-0.031
 Act-0.031
 Rost-0.031
TokenCBC
Token ID29208
Feature activation-6.19e-02

Pos logit contributions
ason+0.008
arton+0.007
asons+0.007
Write+0.007
odox+0.007
Neg logit contributions
NRS-0.004
 UP-0.004
 gloss-0.004
 Act-0.004
 Rost-0.004
Token)
Token ID8
Feature activation+8.35e-02

Pos logit contributions
NRS+0.046
 UP+0.046
 Act+0.046
 gloss+0.046
 Rost+0.045
Neg logit contributions
ason-0.037
arton-0.035
asons-0.031
antry-0.031
odox-0.031
Token Post
Token ID2947
Feature activation-1.91e-01

Pos logit contributions
NRS+0.019
 UP+0.019
 gloss+0.019
 Act+0.018
 Rost+0.018
Neg logit contributions
ason-0.035
arton-0.034
asons-0.031
odox-0.031
Write-0.031
Token reading
Token ID3555
Feature activation+1.27e-01

Pos logit contributions
NRS+0.121
 UP+0.120
 Act+0.119
 gloss+0.119
 Rost+0.118
Neg logit contributions
ason-0.136
arton-0.130
asons-0.118
odox-0.117
Write-0.117
Token "
Token ID366
Feature activation-1.24e-01

Pos logit contributions
ason+0.102
arton+0.098
asons+0.090
odox+0.090
Write+0.090
Neg logit contributions
NRS-0.068
 UP-0.067
 Act-0.067
 gloss-0.067
 Rost-0.066
TokenISIS
Token ID29322
Feature activation+9.85e-02

Pos logit contributions
NRS+0.076
 UP+0.075
 Act+0.075
 gloss+0.075
 Rost+0.074
Neg logit contributions
ason-0.090
arton-0.086
asons-0.078
odox-0.078
Write-0.078
Token Ur
Token ID8799
Feature activation+7.60e-02

Pos logit contributions
ason+0.075
arton+0.072
asons+0.066
odox+0.066
Write+0.066
Neg logit contributions
NRS-0.057
 UP-0.056
 Act-0.056
 gloss-0.056
 Rost-0.055
Tokenges
Token ID3212
Feature activation-5.25e-01

Pos logit contributions
ason+0.088
arton+0.086
asons+0.081
Write+0.081
odox+0.081
Neg logit contributions
NRS-0.014
 UP-0.013
 gloss-0.013
 Act-0.013
 Rost-0.013
Token Jihad
Token ID35243
Feature activation+2.59e-01

Pos logit contributions
NRS+0.318
 UP+0.318
 Act+0.316
 gloss+0.314
 Rost+0.312
Neg logit contributions
ason-0.385
arton-0.369
asons-0.336
odox-0.335
antry-0.334
Tokenists
Token ID1023
Feature activation-2.38e-02

Pos logit contributions
ason+0.153
arton+0.145
asons+0.129
odox+0.128
Write+0.128
Neg logit contributions
NRS-0.193
 Act-0.193
 UP-0.192
 gloss-0.191
 Rost-0.190
Token To
Token ID1675
Feature activation+2.45e-05

Pos logit contributions
 Act+0.013
 UP+0.013
NRS+0.013
 gloss+0.013
 Rost+0.013
Neg logit contributions
ason-0.019
arton-0.018
odox-0.016
asons-0.016
antry-0.016
Token Attack
Token ID8307
Feature activation-1.82e-01

Pos logit contributions
ason+0.000
arton+0.000
asons+0.000
Write+0.000
odox+0.000
Neg logit contributions
NRS-0.000
 UP-0.000
 Act-0.000
 gloss-0.000
 Rost-0.000
Token Canadians
Token ID14008
Feature activation+6.58e-02

Pos logit contributions
 UP+0.109
NRS+0.109
 Act+0.109
 gloss+0.107
 Rost+0.107
Neg logit contributions
ason-0.135
arton-0.129
asons-0.118
odox-0.118
Write-0.117
Token.:
Token ID11207
Feature activation-2.55e-01

Pos logit contributions
NRS+0.027
 UP+0.027
 Act+0.027
 gloss+0.027
 Rost+0.026
Neg logit contributions
ason-0.053
arton-0.051
odox-0.047
asons-0.047
Write-0.047
Token New
Token ID968
Feature activation-8.09e-02

Pos logit contributions
NRS+0.152
 UP+0.151
 Act+0.151
 gloss+0.150
 Rost+0.149
Neg logit contributions
ason-0.190
arton-0.181
asons-0.166
odox-0.165
antry-0.165
Token York
Token ID1971
Feature activation-1.46e-01

Pos logit contributions
NRS+0.032
 UP+0.032
 Act+0.032
 gloss+0.032
 Rost+0.032
Neg logit contributions
ason-0.076
arton-0.073
asons-0.068
odox-0.068
Write-0.068
Token,
Token ID11
Feature activation+5.24e-02

Pos logit contributions
 UP+0.091
 Act+0.090
NRS+0.090
 Rost+0.088
 gloss+0.088
Neg logit contributions
ason-0.107
arton-0.101
odox-0.093
asons-0.093
antry-0.093
Token NY
Token ID6645
Feature activation-6.19e-02

Pos logit contributions
ason+0.050
arton+0.048
asons+0.045
odox+0.044
Write+0.044
Neg logit contributions
NRS-0.021
 UP-0.021
 Act-0.020
 gloss-0.020
 Rost-0.020
Token $
Token ID720
Feature activation+3.98e-01

Pos logit contributions
 UP+0.054
NRS+0.054
 Act+0.054
 gloss+0.053
 Rost+0.053
Neg logit contributions
ason-0.029
arton-0.027
odox-0.024
asons-0.024
antry-0.023
Token5
Token ID20
Feature activation-1.36e-01

Pos logit contributions
ason+0.304
arton+0.290
asons+0.266
Write+0.265
odox+0.264
Neg logit contributions
NRS-0.230
 UP-0.228
 Act-0.227
 gloss-0.226
 Rost-0.225
Token,
Token ID11
Feature activation+3.21e-02

Pos logit contributions
NRS+0.083
 UP+0.083
 Act+0.082
 gloss+0.082
 Rost+0.081
Neg logit contributions
ason-0.100
arton-0.095
asons-0.087
odox-0.087
Write-0.086
Token450
Token ID17885
Feature activation+5.29e-02

Pos logit contributions
ason+0.021
arton+0.020
asons+0.018
odox+0.017
Write+0.017
Neg logit contributions
NRS-0.022
 UP-0.022
 Act-0.022
 gloss-0.022
 Rost-0.022
Token Membership
Token ID37939
Feature activation+1.63e-01

Pos logit contributions
ason+0.037
arton+0.036
asons+0.032
odox+0.032
Write+0.032
Neg logit contributions
NRS-0.034
 UP-0.033
 Act-0.033
 gloss-0.033
 Rost-0.033
Token grant
Token ID7264
Feature activation-1.35e-01

Pos logit contributions
ason+0.156
arton+0.151
asons+0.141
odox+0.141
Write+0.140
Neg logit contributions
NRS-0.062
 UP-0.061
 Act-0.061
 gloss-0.060
 Rost-0.059
Token F
Token ID376
Feature activation+3.25e-02

Pos logit contributions
NRS+0.002
 UP+0.002
 Act+0.002
 gloss+0.002
 Rost+0.002
Neg logit contributions
ason-0.003
arton-0.003
asons-0.003
odox-0.003
Write-0.003
Tokenocal
Token ID4374
Feature activation-3.08e-02

Pos logit contributions
ason+0.023
arton+0.022
asons+0.020
odox+0.019
Write+0.019
Neg logit contributions
NRS-0.021
 UP-0.021
 Act-0.021
 gloss-0.021
 Rost-0.020
Token ac
Token ID936
Feature activation+1.52e-01

Pos logit contributions
NRS+0.019
 UP+0.018
 gloss+0.018
 Act+0.018
 Rost+0.018
Neg logit contributions
ason-0.023
arton-0.022
asons-0.020
Write-0.020
odox-0.020
Tokenral
Token ID1373
Feature activation+1.00e-01

Pos logit contributions
ason+0.106
arton+0.101
asons+0.091
Write+0.091
odox+0.091
Neg logit contributions
NRS-0.099
 UP-0.098
 Act-0.097
 gloss-0.097
 Rost-0.096
Token hyper
Token ID8718
Feature activation+1.04e-01

Pos logit contributions
ason+0.073
arton+0.070
Write+0.064
asons+0.064
odox+0.064
Neg logit contributions
 gloss-0.061
 Act-0.060
NRS-0.060
 UP-0.060
 Rost-0.059
Tokenker
Token ID6122
Feature activation+4.45e-01

Pos logit contributions
ason+0.073
arton+0.070
Write+0.064
asons+0.064
odox+0.063
Neg logit contributions
NRS-0.065
 gloss-0.065
 Act-0.065
 UP-0.065
 Rost-0.064
Tokenat
Token ID265
Feature activation+2.48e-02

Pos logit contributions
ason+0.292
arton+0.280
Write+0.256
asons+0.252
odox+0.251
Neg logit contributions
 Act-0.296
NRS-0.295
 gloss-0.295
 UP-0.294
 Rost-0.293
Tokenosis
Token ID5958
Feature activation+3.23e-01

Pos logit contributions
ason+0.010
arton+0.009
Write+0.008
asons+0.008
odox+0.008
Neg logit contributions
NRS-0.023
 Act-0.023
 gloss-0.023
 UP-0.023
 Rost-0.023
Token (
Token ID357
Feature activation+2.89e-01

Pos logit contributions
ason+0.200
arton+0.189
asons+0.169
odox+0.168
Write+0.168
Neg logit contributions
NRS-0.233
 UP-0.232
 Act-0.231
 gloss-0.231
 Rost-0.229
Tokenalso
Token ID14508
Feature activation+1.69e-01

Pos logit contributions
ason+0.187
arton+0.178
asons+0.160
Write+0.159
odox+0.158
Neg logit contributions
NRS-0.200
 UP-0.199
 Act-0.198
 gloss-0.198
 Rost-0.197
Token known
Token ID1900
Feature activation-5.97e-02

Pos logit contributions
ason+0.157
arton+0.151
asons+0.141
odox+0.140
Write+0.140
Neg logit contributions
NRS-0.070
 UP-0.070
 Act-0.070
 gloss-0.069
 Rost-0.068
Token
Token ID134
Feature activation+3.56e-01

Pos logit contributions
ason+0.046
arton+0.044
asons+0.040
odox+0.040
Write+0.040
Neg logit contributions
NRS-0.041
 UP-0.041
 Act-0.041
 gloss-0.041
 Rost-0.040
Token
Token ID119
Feature activation+1.70e-01

Pos logit contributions
ason+0.196
arton+0.184
asons+0.162
odox+0.161
Write+0.161
Neg logit contributions
NRS-0.282
 UP-0.280
 Act-0.280
 gloss-0.279
 Rost-0.277
Tokenole
Token ID2305
Feature activation+1.64e-01

Pos logit contributions
ason+0.115
arton+0.109
asons+0.099
odox+0.098
Write+0.098
Neg logit contributions
NRS-0.113
 UP-0.112
 Act-0.112
 gloss-0.112
 Rost-0.111
Token i
Token ID1312
Feature activation+2.34e-01

Pos logit contributions
ason+0.134
arton+0.128
asons+0.118
odox+0.118
Write+0.118
Neg logit contributions
NRS-0.087
 UP-0.086
 Act-0.085
 gloss-0.085
 Rost-0.084
Token
Token ID220
Feature activation+1.42e-01

Pos logit contributions
ason+0.179
arton+0.171
asons+0.156
odox+0.156
Write+0.155
Neg logit contributions
NRS-0.136
 UP-0.135
 Act-0.134
 gloss-0.134
 Rost-0.133
Token
Token ID134
Feature activation+4.19e-01

Pos logit contributions
ason+0.068
arton+0.064
asons+0.055
antry+0.054
odox+0.054
Neg logit contributions
NRS-0.122
 UP-0.122
 Act-0.122
 gloss-0.121
 Rost-0.120
Token
Token ID119
Feature activation+1.93e-01

Pos logit contributions
ason+0.226
arton+0.211
asons+0.185
odox+0.184
Write+0.184
Neg logit contributions
NRS-0.338
 UP-0.335
 Act-0.334
 gloss-0.333
 Rost-0.332
Tokenemo
Token ID41903
Feature activation+5.04e-02

Pos logit contributions
ason+0.130
arton+0.124
asons+0.112
odox+0.111
Write+0.111
Neg logit contributions
NRS-0.128
 UP-0.127
 Act-0.127
 gloss-0.126
 Rost-0.126
Token ko
Token ID41727
Feature activation-2.96e-01

Pos logit contributions
ason+0.034
arton+0.032
asons+0.029
odox+0.029
Write+0.029
Neg logit contributions
NRS-0.034
 UP-0.034
 Act-0.034
 gloss-0.033
 Rost-0.033
Token Kal
Token ID12612
Feature activation-1.34e-03

Pos logit contributions
NRS+0.240
 UP+0.239
 Act+0.239
 gloss+0.239
 Rost+0.237
Neg logit contributions
ason-0.156
arton-0.147
asons-0.128
odox-0.128
Write-0.127
Tokenan
Token ID272
Feature activation+1.54e-01

Pos logit contributions
NRS+0.001
 UP+0.001
 Act+0.001
 gloss+0.001
 Rost+0.001
Neg logit contributions
ason-0.001
arton-0.001
asons-0.001
Write-0.001
odox-0.001
Token,
Token ID11
Feature activation+1.50e-02

Pos logit contributions
ason+0.019
arton+0.018
asons+0.017
odox+0.017
Write+0.016
Neg logit contributions
NRS-0.017
 UP-0.017
 Act-0.017
 gloss-0.017
 Rost-0.017
Token and
Token ID290
Feature activation+2.39e-02

Pos logit contributions
ason+0.011
arton+0.011
asons+0.010
Write+0.010
odox+0.010
Neg logit contributions
NRS-0.009
 UP-0.009
 Act-0.009
 gloss-0.009
 Rost-0.009
Token no
Token ID645
Feature activation+4.86e-03

Pos logit contributions
ason+0.018
arton+0.017
asons+0.016
Write+0.015
antry+0.015
Neg logit contributions
NRS-0.014
 UP-0.014
 gloss-0.014
 Act-0.014
 Rost-0.014
Token personnel
Token ID8213
Feature activation-1.94e-02

Pos logit contributions
ason+0.004
arton+0.004
asons+0.003
Write+0.003
antry+0.003
Neg logit contributions
NRS-0.003
 UP-0.003
 gloss-0.003
 Act-0.003
 Rost-0.003
Token have
Token ID423
Feature activation-1.06e-02

Pos logit contributions
NRS+0.012
 UP+0.012
 gloss+0.012
 Act+0.012
 Rost+0.011
Neg logit contributions
ason-0.014
arton-0.014
asons-0.012
Write-0.012
antry-0.012
Token exceeded
Token ID20672
Feature activation+3.83e-01

Pos logit contributions
NRS+0.007
 UP+0.007
 Act+0.007
 gloss+0.007
 Rost+0.007
Neg logit contributions
ason-0.008
arton-0.007
asons-0.006
Write-0.006
antry-0.006
Token 20
Token ID1160
Feature activation+1.54e-01

Pos logit contributions
ason+0.276
arton+0.262
Write+0.239
asons+0.237
antry+0.236
Neg logit contributions
NRS-0.238
 gloss-0.232
 UP-0.231
 Act-0.230
 Rost-0.229
Token m
Token ID285
Feature activation+8.08e-02

Pos logit contributions
ason+0.112
arton+0.106
asons+0.097
Write+0.096
odox+0.096
Neg logit contributions
NRS-0.095
 UP-0.094
 gloss-0.094
 Act-0.094
 Rost-0.093
TokenS
Token ID50
Feature activation-7.07e-02

Pos logit contributions
ason+0.069
arton+0.067
asons+0.062
odox+0.062
Write+0.062
Neg logit contributions
NRS-0.039
 UP-0.038
 Act-0.038
 gloss-0.038
 Rost-0.038
Tokenv
Token ID85
Feature activation+1.22e-01

Pos logit contributions
NRS+0.052
 UP+0.052
 Act+0.051
 gloss+0.051
 Rost+0.051
Neg logit contributions
ason-0.043
arton-0.041
asons-0.036
Write-0.036
odox-0.036
Token in
Token ID287
Feature activation+3.30e-02

Pos logit contributions
ason+0.091
arton+0.086
asons+0.079
Write+0.079
antry+0.078
Neg logit contributions
NRS-0.073
 UP-0.072
 Act-0.072
 gloss-0.072
 Rost-0.071
Token extracted
Token ID21242
Feature activation+1.72e-02

Pos logit contributions
ason+0.358
arton+0.342
odox+0.311
asons+0.310
Write+0.309
Neg logit contributions
NRS-0.336
 Act-0.336
 UP-0.336
 gloss-0.335
 Rost-0.332
Token his
Token ID465
Feature activation+3.34e-02

Pos logit contributions
ason+0.012
arton+0.011
asons+0.010
Write+0.010
odox+0.010
Neg logit contributions
NRS-0.011
 UP-0.011
 Act-0.011
 gloss-0.011
 Rost-0.011
Token nickel
Token ID30666
Feature activation+1.54e-01

Pos logit contributions
ason+0.019
arton+0.018
asons+0.016
odox+0.016
Write+0.016
Neg logit contributions
NRS-0.026
 UP-0.025
 Act-0.025
 gloss-0.025
 Rost-0.025
Token.
Token ID13
Feature activation-5.09e-03

Pos logit contributions
ason+0.111
arton+0.106
asons+0.097
odox+0.096
Write+0.096
Neg logit contributions
NRS-0.096
 UP-0.095
 Act-0.095
 gloss-0.094
 Rost-0.094
Token Wh
Token ID854
Feature activation-6.90e-02

Pos logit contributions
NRS+0.003
 UP+0.003
 Act+0.003
 gloss+0.003
 Rost+0.003
Neg logit contributions
ason-0.004
arton-0.004
asons-0.003
odox-0.003
Write-0.003
Tokenarton
Token ID41328
Feature activation+5.45e-01

Pos logit contributions
NRS+0.089
 UP+0.089
 Act+0.089
 gloss+0.088
 Rost+0.088
Neg logit contributions
ason-0.003
arton-0.000
Write+0.003
asons+0.003
odox+0.003
Token had
Token ID550
Feature activation-9.94e-02

Pos logit contributions
ason+0.383
arton+0.364
asons+0.330
odox+0.328
Write+0.328
Neg logit contributions
NRS-0.350
 UP-0.345
 Act-0.342
 gloss-0.342
 Rost-0.340
Token spent
Token ID3377
Feature activation-1.45e-01

Pos logit contributions
NRS+0.060
 UP+0.059
 Act+0.059
 gloss+0.059
 Rost+0.059
Neg logit contributions
ason-0.073
arton-0.070
asons-0.064
Write-0.064
odox-0.064
Token $
Token ID720
Feature activation+3.35e-01

Pos logit contributions
NRS+0.089
 UP+0.088
 Act+0.088
 gloss+0.088
 Rost+0.087
Neg logit contributions
ason-0.106
arton-0.102
asons-0.092
odox-0.092
Write-0.092
Token200
Token ID2167
Feature activation+3.47e-02

Pos logit contributions
ason+0.248
arton+0.236
asons+0.216
Write+0.215
odox+0.214
Neg logit contributions
NRS-0.201
 UP-0.199
 Act-0.199
 gloss-0.198
 Rost-0.197
Token,
Token ID11
Feature activation-3.68e-02

Pos logit contributions
ason+0.025
arton+0.024
odox+0.022
asons+0.022
Write+0.022
Neg logit contributions
NRS-0.021
 UP-0.021
 Act-0.021
 gloss-0.021
 Rost-0.021
Tokenall
Token ID439
Feature activation+1.07e-01

Pos logit contributions
ason+0.069
arton+0.066
asons+0.060
Write+0.059
odox+0.059
Neg logit contributions
NRS-0.065
 UP-0.064
 Act-0.064
 gloss-0.064
 Rost-0.063
Token that
Token ID326
Feature activation+4.33e-02

Pos logit contributions
ason+0.081
arton+0.077
asons+0.071
odox+0.070
Write+0.070
Neg logit contributions
NRS-0.063
 UP-0.062
 Act-0.062
 gloss-0.062
 Rost-0.061
Token stuff
Token ID3404
Feature activation+1.58e-01

Pos logit contributions
ason+0.031
arton+0.029
asons+0.027
odox+0.026
Write+0.026
Neg logit contributions
NRS-0.027
 UP-0.027
 Act-0.027
 gloss-0.027
 Rost-0.027
Token has
Token ID468
Feature activation-7.48e-02

Pos logit contributions
ason+0.118
arton+0.113
asons+0.103
odox+0.102
Write+0.102
Neg logit contributions
NRS-0.094
 UP-0.094
 Act-0.093
 gloss-0.093
 Rost-0.092
Token an
Token ID281
Feature activation-5.03e-03

Pos logit contributions
NRS+0.048
 UP+0.048
 gloss+0.048
 Act+0.048
 Rost+0.047
Neg logit contributions
ason-0.052
arton-0.050
odox-0.045
asons-0.045
Write-0.045
Token impact
Token ID2928
Feature activation+5.77e-02

Pos logit contributions
NRS+0.003
 UP+0.003
 Act+0.003
 gloss+0.003
 Rost+0.003
Neg logit contributions
ason-0.004
arton-0.004
asons-0.003
odox-0.003
Write-0.003
Token on
Token ID319
Feature activation+6.52e-02

Pos logit contributions
ason+0.038
arton+0.036
asons+0.032
odox+0.032
Write+0.032
Neg logit contributions
NRS-0.040
 UP-0.039
 Act-0.039
 gloss-0.039
 Rost-0.039
Token our
Token ID674
Feature activation-5.68e-02

Pos logit contributions
ason+0.053
arton+0.050
Write+0.046
asons+0.046
antry+0.046
Neg logit contributions
NRS-0.035
 UP-0.034
 Act-0.034
 gloss-0.034
 Rost-0.034
Token long
Token ID890
Feature activation-6.40e-02

Pos logit contributions
NRS+0.032
 UP+0.032
 Act+0.031
 gloss+0.031
 Rost+0.031
Neg logit contributions
ason-0.044
arton-0.042
asons-0.039
odox-0.039
Write-0.039
Token-
Token ID12
Feature activation+5.52e-03

Pos logit contributions
NRS+0.035
 UP+0.034
 Act+0.034
 gloss+0.034
 Rost+0.034
Neg logit contributions
ason-0.051
arton-0.049
asons-0.045
odox-0.045
antry-0.045
Tokenterm
Token ID4354
Feature activation-2.53e-01

Pos logit contributions
ason+0.005
arton+0.005
odox+0.005
asons+0.005
antry+0.005
Neg logit contributions
NRS-0.002
 UP-0.002
 Act-0.002
 gloss-0.002
 Rost-0.002
Token
Token ID447
Feature activation-5.66e-02

Pos logit contributions
NRS+0.052
 UP+0.052
 Act+0.052
 gloss+0.051
 Rost+0.051
Neg logit contributions
ason-0.050
arton-0.048
odox-0.043
asons-0.043
Write-0.043
Token
Token ID247
Feature activation+1.72e-02

Pos logit contributions
NRS+0.041
 UP+0.041
 Act+0.041
 gloss+0.041
 Rost+0.041
Neg logit contributions
ason-0.035
arton-0.033
asons-0.029
Write-0.029
antry-0.029
Tokens
Token ID82
Feature activation+8.04e-04

Pos logit contributions
ason+0.012
arton+0.011
asons+0.010
odox+0.010
Write+0.010
Neg logit contributions
NRS-0.011
 UP-0.011
 Act-0.011
 gloss-0.011
 Rost-0.011
Token has
Token ID468
Feature activation-1.15e-01

Pos logit contributions
ason+0.001
arton+0.001
asons+0.000
Write+0.000
odox+0.000
Neg logit contributions
NRS-0.001
 UP-0.001
 Act-0.001
 gloss-0.000
 Rost-0.000
Token some
Token ID617
Feature activation+2.05e-01

Pos logit contributions
NRS+0.070
 UP+0.069
 Act+0.069
 gloss+0.069
 Rost+0.068
Neg logit contributions
ason-0.085
arton-0.081
asons-0.074
Write-0.073
antry-0.073
Token miscar
Token ID35976
Feature activation+8.04e-02

Pos logit contributions
ason+0.131
arton+0.124
asons+0.111
Write+0.111
odox+0.110
Neg logit contributions
NRS-0.144
 UP-0.143
 Act-0.142
 gloss-0.142
 Rost-0.141
Tokenriages
Token ID16451
Feature activation+2.17e-03

Pos logit contributions
ason+0.064
arton+0.062
Write+0.057
odox+0.057
asons+0.057
Neg logit contributions
 UP-0.043
 Act-0.042
NRS-0.042
 gloss-0.042
 Rost-0.042
Token in
Token ID287
Feature activation-7.77e-03

Pos logit contributions
ason+0.002
arton+0.002
asons+0.001
Write+0.001
antry+0.001
Neg logit contributions
NRS-0.001
 UP-0.001
 Act-0.001
 gloss-0.001
 Rost-0.001
Token the
Token ID262
Feature activation-6.90e-02

Pos logit contributions
NRS+0.004
 UP+0.004
 gloss+0.004
 Rost+0.004
 Act+0.004
Neg logit contributions
ason-0.006
arton-0.006
asons-0.006
Write-0.006
antry-0.006
Token past
Token ID1613
Feature activation+6.06e-03

Pos logit contributions
NRS+0.038
 UP+0.037
 gloss+0.036
 Act+0.036
 Rost+0.036
Neg logit contributions
ason-0.055
arton-0.053
asons-0.048
Write-0.048
antry-0.048
Token,
Token ID11
Feature activation-1.29e-02

Pos logit contributions
ason+0.004
arton+0.004
asons+0.004
Write+0.004
antry+0.004
Neg logit contributions
NRS-0.004
 UP-0.004
 gloss-0.004
 Act-0.004
 Rost-0.004
Token I
Token ID314
Feature activation+9.47e-04

Pos logit contributions
ason+0.031
arton+0.030
asons+0.027
Write+0.027
odox+0.027
Neg logit contributions
NRS-0.031
 UP-0.030
 Act-0.030
 gloss-0.030
 Rost-0.030
Token don
Token ID836
Feature activation-6.05e-02

Pos logit contributions
ason+0.001
arton+0.001
asons+0.001
Write+0.001
antry+0.001
Neg logit contributions
NRS-0.001
 UP-0.001
 Act-0.001
 gloss-0.001
 Rost-0.001
Token
Token ID447
Feature activation-8.09e-02

Pos logit contributions
NRS+0.036
 UP+0.035
 gloss+0.035
 Act+0.035
 Rost+0.035
Neg logit contributions
ason-0.046
arton-0.043
asons-0.040
Write-0.040
antry-0.039
Token
Token ID247
Feature activation-3.45e-02

Pos logit contributions
NRS+0.061
 UP+0.061
 Act+0.060
 gloss+0.060
 Rost+0.060
Neg logit contributions
ason-0.048
arton-0.045
asons-0.040
Write-0.040
antry-0.039
Tokent
Token ID83
Feature activation+3.00e-02

Pos logit contributions
NRS+0.023
 UP+0.023
 Act+0.023
 gloss+0.023
 Rost+0.023
Neg logit contributions
ason-0.023
arton-0.022
asons-0.019
odox-0.019
Write-0.019
Token try
Token ID1949
Feature activation+3.43e-02

Pos logit contributions
ason+0.021
arton+0.020
Write+0.019
asons+0.019
antry+0.019
Neg logit contributions
NRS-0.019
 UP-0.018
 Act-0.018
 Rost-0.018
 gloss-0.018
Token to
Token ID284
Feature activation-7.25e-02

Pos logit contributions
ason+0.026
arton+0.025
asons+0.023
odox+0.023
Write+0.023
Neg logit contributions
NRS-0.019
 UP-0.019
 Act-0.019
 gloss-0.019
 Rost-0.019
Token communicate
Token ID10996
Feature activation-1.24e-01

Pos logit contributions
NRS+0.045
 UP+0.044
 Act+0.044
 Rost+0.044
 gloss+0.044
Neg logit contributions
ason-0.052
arton-0.050
asons-0.045
Write-0.045
antry-0.045
Token with
Token ID351
Feature activation-1.41e-01

Pos logit contributions
NRS+0.076
 UP+0.075
 Act+0.074
 gloss+0.074
 Rost+0.074
Neg logit contributions
ason-0.090
arton-0.086
Write-0.078
asons-0.078
antry-0.078
Token kids
Token ID3988
Feature activation-1.01e-01

Pos logit contributions
NRS+0.073
 UP+0.071
 gloss+0.070
 Act+0.070
 Rost+0.070
Neg logit contributions
ason-0.116
arton-0.111
Write-0.103
asons-0.102
antry-0.102
Token.
Token ID13
Feature activation-4.54e-02

Pos logit contributions
NRS+0.062
 UP+0.061
 Act+0.061
 gloss+0.061
 Rost+0.061
Neg logit contributions
ason-0.074
arton-0.071
asons-0.064
Write-0.064
odox-0.064
Token implemented
Token ID9177
Feature activation-1.04e-01

Pos logit contributions
ason+0.097
arton+0.092
asons+0.085
Write+0.084
antry+0.084
Neg logit contributions
NRS-0.070
 UP-0.068
 Act-0.068
 Rost-0.067
 gloss-0.067
Token this
Token ID428
Feature activation+8.65e-02

Pos logit contributions
NRS+0.059
 UP+0.057
 gloss+0.057
 Act+0.057
 Rost+0.057
Neg logit contributions
ason-0.081
arton-0.078
asons-0.071
Write-0.071
odox-0.071
Token in
Token ID287
Feature activation-2.68e-02

Pos logit contributions
ason+0.073
arton+0.070
asons+0.064
Write+0.064
antry+0.064
Neg logit contributions
NRS-0.043
 UP-0.042
 gloss-0.042
 Rost-0.042
 Act-0.042
Token the
Token ID262
Feature activation-8.44e-02

Pos logit contributions
NRS+0.014
 UP+0.013
 gloss+0.013
 Act+0.013
 Rost+0.013
Neg logit contributions
ason-0.022
arton-0.021
asons-0.020
Write-0.020
odox-0.019
Token states
Token ID2585
Feature activation-1.62e-02

Pos logit contributions
NRS+0.046
 UP+0.045
 gloss+0.045
 Act+0.045
 Rost+0.045
Neg logit contributions
ason-0.067
arton-0.064
asons-0.059
Write-0.059
odox-0.059
Token include
Token ID2291
Feature activation+8.96e-02

Pos logit contributions
NRS+0.010
 UP+0.010
 Act+0.010
 gloss+0.010
 Rost+0.010
Neg logit contributions
ason-0.012
arton-0.011
asons-0.010
Write-0.010
odox-0.010
Token:
Token ID25
Feature activation-3.08e-02

Pos logit contributions
ason+0.066
arton+0.063
asons+0.057
Write+0.057
odox+0.057
Neg logit contributions
NRS-0.054
 UP-0.053
 gloss-0.053
 Act-0.053
 Rost-0.053
Token Uber
Token ID12024
Feature activation+1.13e-01

Pos logit contributions
NRS+0.017
 UP+0.017
 gloss+0.017
 Act+0.017
 Rost+0.017
Neg logit contributions
ason-0.024
arton-0.023
asons-0.021
Write-0.021
odox-0.021
Token,
Token ID11
Feature activation+1.17e-02

Pos logit contributions
ason+0.081
arton+0.077
asons+0.070
Write+0.070
odox+0.070
Neg logit contributions
NRS-0.071
 UP-0.070
 Act-0.069
 gloss-0.069
 Rost-0.069
Token Ben
Token ID3932
Feature activation-9.05e-02

Pos logit contributions
ason+0.008
arton+0.008
asons+0.007
Write+0.007
antry+0.007
Neg logit contributions
NRS-0.007
 gloss-0.007
 UP-0.007
 Act-0.007
 Rost-0.007
Token &
Token ID1222
Feature activation-1.61e-01

Pos logit contributions
NRS+0.046
 UP+0.046
 gloss+0.045
 Act+0.045
 Rost+0.045
Neg logit contributions
ason-0.076
arton-0.073
asons-0.067
Write-0.067
odox-0.066
Token I
Token ID314
Feature activation-8.45e-02

Pos logit contributions
NRS+0.095
 UP+0.094
 Act+0.094
 gloss+0.094
 Rost+0.093
Neg logit contributions
ason-0.132
arton-0.126
asons-0.116
odox-0.115
Write-0.115
Token found
Token ID1043
Feature activation-1.25e-01

Pos logit contributions
NRS+0.053
 UP+0.053
 Act+0.052
 gloss+0.052
 Rost+0.052
Neg logit contributions
ason-0.060
arton-0.058
asons-0.052
Write-0.052
odox-0.052
Token nothing
Token ID2147
Feature activation+4.78e-02

Pos logit contributions
NRS+0.074
 UP+0.074
 Act+0.074
 gloss+0.073
 Rost+0.073
Neg logit contributions
ason-0.093
arton-0.089
odox-0.081
asons-0.081
antry-0.081
Token in
Token ID287
Feature activation-8.40e-03

Pos logit contributions
ason+0.034
arton+0.033
asons+0.030
Write+0.030
odox+0.030
Neg logit contributions
NRS-0.030
 UP-0.029
 Act-0.029
 gloss-0.029
 Rost-0.029
Token their
Token ID511
Feature activation-1.06e-01

Pos logit contributions
NRS+0.005
 UP+0.005
 gloss+0.005
 Act+0.005
 Rost+0.005
Neg logit contributions
ason-0.007
arton-0.006
asons-0.006
Write-0.006
odox-0.006
Token writings
Token ID19336
Feature activation+1.95e-01

Pos logit contributions
NRS+0.055
 UP+0.054
 Act+0.054
 gloss+0.053
 Rost+0.053
Neg logit contributions
ason-0.087
arton-0.084
asons-0.077
odox-0.077
Write-0.077
Token to
Token ID284
Feature activation+5.87e-03

Pos logit contributions
ason+0.144
arton+0.137
asons+0.125
Write+0.124
odox+0.124
Neg logit contributions
NRS-0.118
 UP-0.116
 Act-0.116
 gloss-0.115
 Rost-0.115
Token warrant
Token ID8084
Feature activation+2.87e-02

Pos logit contributions
ason+0.004
arton+0.004
asons+0.003
odox+0.003
Write+0.003
Neg logit contributions
NRS-0.004
 UP-0.004
 Act-0.004
 gloss-0.004
 Rost-0.004
Token anything
Token ID1997
Feature activation+5.02e-03

Pos logit contributions
ason+0.023
arton+0.022
asons+0.020
Write+0.020
odox+0.020
Neg logit contributions
NRS-0.016
 UP-0.016
 Act-0.016
 gloss-0.015
 Rost-0.015
Token approaching
Token ID13885
Feature activation+2.80e-03

Pos logit contributions
ason+0.004
arton+0.004
asons+0.003
odox+0.003
Write+0.003
Neg logit contributions
NRS-0.003
 UP-0.003
 Act-0.003
 gloss-0.003
 Rost-0.003
Token the
Token ID262
Feature activation+9.58e-02

Pos logit contributions
ason+0.002
arton+0.002
asons+0.002
odox+0.002
Write+0.002
Neg logit contributions
NRS-0.002
 UP-0.002
 Act-0.002
 gloss-0.002
 Rost-0.002
Token
Token ID250
Feature activation+7.98e-02

Pos logit contributions
NRS+0.019
 UP+0.019
 Act+0.019
 gloss+0.019
 Rost+0.019
Neg logit contributions
ason-0.017
arton-0.016
asons-0.014
odox-0.014
Write-0.014
Tokenmod
Token ID4666
Feature activation+8.81e-03

Pos logit contributions
ason+0.058
arton+0.055
asons+0.050
odox+0.050
Write+0.050
Neg logit contributions
NRS-0.050
 UP-0.049
 Act-0.049
 gloss-0.049
 Rost-0.049
Tokener
Token ID263
Feature activation-3.05e-02

Pos logit contributions
ason+0.006
arton+0.006
asons+0.005
odox+0.005
Write+0.005
Neg logit contributions
NRS-0.006
 UP-0.006
 Act-0.006
 gloss-0.006
 Rost-0.006
Tokenates
Token ID689
Feature activation+5.73e-02

Pos logit contributions
NRS+0.023
 UP+0.022
 Act+0.022
 gloss+0.022
 Rost+0.022
Neg logit contributions
ason-0.018
arton-0.017
asons-0.015
Write-0.015
odox-0.015
Token
Token ID447
Feature activation-2.66e-02

Pos logit contributions
ason+0.041
arton+0.039
asons+0.036
odox+0.036
Write+0.036
Neg logit contributions
NRS-0.036
 UP-0.035
 Act-0.035
 gloss-0.035
 Rost-0.035
Token
Token ID251
Feature activation-2.02e-02

Pos logit contributions
NRS+0.020
 UP+0.020
 Act+0.020
 gloss+0.020
 Rost+0.020
Neg logit contributions
ason-0.015
arton-0.014
asons-0.013
Write-0.013
odox-0.013
Token voting
Token ID6709
Feature activation+2.69e-01

Pos logit contributions
NRS+0.012
 UP+0.012
 Act+0.012
 gloss+0.012
 Rost+0.012
Neg logit contributions
ason-0.015
arton-0.014
asons-0.013
odox-0.013
Write-0.013
Token for
Token ID329
Feature activation+4.68e-02

Pos logit contributions
ason+0.196
arton+0.187
asons+0.170
Write+0.170
antry+0.170
Neg logit contributions
NRS-0.165
 UP-0.163
 gloss-0.162
 Act-0.161
 Rost-0.161
Token T
Token ID309
Feature activation+3.56e-02

Pos logit contributions
ason+0.035
arton+0.033
asons+0.030
Write+0.030
odox+0.030
Neg logit contributions
NRS-0.028
 UP-0.028
 gloss-0.028
 Act-0.027
 Rost-0.027
Tokenorg
Token ID2398
Feature activation-9.91e-03

Pos logit contributions
ason+0.025
arton+0.023
asons+0.021
odox+0.021
Write+0.021
Neg logit contributions
NRS-0.023
 UP-0.023
 Act-0.023
 gloss-0.023
 Rost-0.023
Tokenersen
Token ID46516
Feature activation+3.92e-01

Pos logit contributions
NRS+0.010
 UP+0.010
 Act+0.010
 gloss+0.010
 Rost+0.010
Neg logit contributions
ason-0.003
arton-0.003
asons-0.003
odox-0.002
Write-0.002
Token
Token ID447
Feature activation+3.40e-02

Pos logit contributions
ason+0.053
arton+0.050
asons+0.045
odox+0.045
Write+0.045
Neg logit contributions
NRS-0.056
 UP-0.056
 Act-0.055
 gloss-0.055
 Rost-0.055
Token
Token ID250
Feature activation+1.99e-02

Pos logit contributions
ason+0.019
arton+0.018
asons+0.016
odox+0.016
Write+0.016
Neg logit contributions
NRS-0.026
 UP-0.026
 Act-0.026
 gloss-0.026
 Rost-0.026
TokenI
Token ID40
Feature activation-4.46e-02

Pos logit contributions
ason+0.014
arton+0.013
asons+0.012
odox+0.012
antry+0.012
Neg logit contributions
NRS-0.013
 UP-0.013
 Act-0.013
 gloss-0.013
 Rost-0.013
Token think
Token ID892
Feature activation-5.42e-03

Pos logit contributions
NRS+0.027
 UP+0.027
 Act+0.027
 gloss+0.027
 Rost+0.027
Neg logit contributions
ason-0.033
arton-0.031
asons-0.028
odox-0.028
Write-0.028
Token we
Token ID356
Feature activation-4.61e-02

Pos logit contributions
 UP+0.003
NRS+0.003
 Act+0.003
 gloss+0.003
 Rost+0.003
Neg logit contributions
ason-0.004
arton-0.004
odox-0.004
asons-0.004
Write-0.004
Token can
Token ID460
Feature activation-1.22e-01

Pos logit contributions
NRS+0.033
 UP+0.033
 gloss+0.033
 Act+0.033
 Rost+0.033
Neg logit contributions
ason-0.028
arton-0.027
asons-0.024
odox-0.024
Write-0.024
Token all
Token ID477
Feature activation+5.33e-02

Pos logit contributions
NRS+0.076
 gloss+0.076
 UP+0.076
 Act+0.076
 Rost+0.075
Neg logit contributions
ason-0.087
arton-0.083
odox-0.076
asons-0.076
Write-0.075
Token agree
Token ID4236
Feature activation+1.91e-01

Pos logit contributions
ason+0.035
arton+0.034
odox+0.030
asons+0.030
Write+0.030
Neg logit contributions
NRS-0.036
 UP-0.036
 gloss-0.036
 Act-0.036
 Rost-0.035
Token that
Token ID326
Feature activation+3.10e-02

Pos logit contributions
ason+0.143
arton+0.137
odox+0.125
asons+0.125
Write+0.124
Neg logit contributions
NRS-0.113
 UP-0.112
 Act-0.112
 gloss-0.112
 Rost-0.111
Token if
Token ID611
Feature activation+2.66e-01

Pos logit contributions
ason+0.023
arton+0.022
asons+0.020
odox+0.020
Write+0.020
Neg logit contributions
NRS-0.018
 UP-0.018
 Act-0.018
 gloss-0.018
 Rost-0.018
Token we
Token ID356
Feature activation-1.36e-02

Pos logit contributions
ason+0.211
arton+0.202
asons+0.186
odox+0.186
Write+0.185
Neg logit contributions
NRS-0.145
 UP-0.145
 Act-0.144
 gloss-0.144
 Rost-0.143
Token the
Token ID262
Feature activation+6.10e-03

Pos logit contributions
NRS+0.003
 UP+0.003
 Act+0.003
 gloss+0.003
 Rost+0.003
Neg logit contributions
ason-0.004
arton-0.003
asons-0.003
odox-0.003
Write-0.003
Token fr
Token ID1216
Feature activation-8.68e-02

Pos logit contributions
ason+0.005
arton+0.005
asons+0.004
odox+0.004
Write+0.004
Neg logit contributions
NRS-0.003
 UP-0.003
 Act-0.003
 gloss-0.003
 Rost-0.003
Tokeninges
Token ID26792
Feature activation+1.77e-01

Pos logit contributions
NRS+0.064
 UP+0.064
 Act+0.063
 gloss+0.063
 Rost+0.063
Neg logit contributions
ason-0.052
arton-0.049
Write-0.044
asons-0.044
odox-0.044
Token of
Token ID286
Feature activation+4.31e-02

Pos logit contributions
ason+0.130
arton+0.124
asons+0.113
odox+0.112
Write+0.112
Neg logit contributions
NRS-0.108
 UP-0.107
 Act-0.106
 gloss-0.106
 Rost-0.105
Token the
Token ID262
Feature activation+1.56e-02

Pos logit contributions
ason+0.035
arton+0.033
asons+0.031
odox+0.031
Write+0.031
Neg logit contributions
NRS-0.023
 UP-0.023
 gloss-0.022
 Act-0.022
 Rost-0.022
Token right
Token ID826
Feature activation-1.80e-01

Pos logit contributions
ason+0.012
arton+0.011
asons+0.010
odox+0.010
Write+0.010
Neg logit contributions
NRS-0.009
 UP-0.009
 Act-0.009
 gloss-0.009
 Rost-0.009
Token.
Token ID13
Feature activation+4.07e-02

Pos logit contributions
NRS+0.107
 UP+0.106
 Act+0.105
 gloss+0.105
 Rost+0.104
Neg logit contributions
ason-0.135
arton-0.129
asons-0.118
Write-0.117
odox-0.117
Token How
Token ID1374
Feature activation-1.09e-01

Pos logit contributions
ason+0.029
arton+0.028
Write+0.026
asons+0.026
odox+0.025
Neg logit contributions
NRS-0.025
 gloss-0.025
 UP-0.024
 Act-0.024
 Rost-0.024
Token does
Token ID857
Feature activation-1.34e-01

Pos logit contributions
NRS+0.065
 UP+0.064
 Act+0.063
 gloss+0.063
 Rost+0.063
Neg logit contributions
ason-0.082
arton-0.078
asons-0.071
Write-0.071
odox-0.071
Token someone
Token ID2130
Feature activation-5.05e-02

Pos logit contributions
NRS+0.079
 UP+0.077
 gloss+0.076
 Act+0.076
 Rost+0.076
Neg logit contributions
ason-0.101
arton-0.097
Write-0.089
asons-0.089
antry-0.088
Token who
Token ID508
Feature activation-4.15e-02

Pos logit contributions
NRS+0.027
 UP+0.027
 Act+0.027
 gloss+0.027
 Rost+0.027
Neg logit contributions
ason-0.040
arton-0.039
asons-0.036
Write-0.035
odox-0.035
Token saw
Token ID2497
Feature activation+1.37e-01

Pos logit contributions
NRS+0.018
 UP+0.017
 Act+0.017
 gloss+0.017
 Rost+0.017
Neg logit contributions
ason-0.018
arton-0.017
asons-0.015
Write-0.015
antry-0.015
Token already
Token ID1541
Feature activation-8.64e-03

Pos logit contributions
ason+0.098
arton+0.093
asons+0.085
odox+0.084
Write+0.084
Neg logit contributions
NRS-0.086
 UP-0.085
 Act-0.085
 gloss-0.085
 Rost-0.084
Token dangling
Token ID47270
Feature activation-4.12e-02

Pos logit contributions
NRS+0.005
 UP+0.005
 Act+0.005
 gloss+0.005
 Rost+0.005
Neg logit contributions
ason-0.006
arton-0.006
asons-0.005
odox-0.005
Write-0.005
Token,
Token ID11
Feature activation-3.60e-02

Pos logit contributions
NRS+0.027
 UP+0.026
 Act+0.026
 gloss+0.026
 Rost+0.026
Neg logit contributions
ason-0.029
arton-0.027
asons-0.025
odox-0.025
Write-0.024
Token waiting
Token ID4953
Feature activation-1.24e-01

Pos logit contributions
NRS+0.022
 UP+0.022
 Act+0.021
 gloss+0.021
 Rost+0.021
Neg logit contributions
ason-0.026
arton-0.025
asons-0.023
Write-0.023
antry-0.023
Token for
Token ID329
Feature activation-7.97e-02

Pos logit contributions
NRS+0.073
 UP+0.072
 Act+0.072
 gloss+0.071
 Rost+0.071
Neg logit contributions
ason-0.094
arton-0.090
asons-0.082
odox-0.082
Write-0.082
Token opening
Token ID4756
Feature activation-3.58e-02

Pos logit contributions
NRS+0.043
 UP+0.042
 gloss+0.042
 Rost+0.041
 Act+0.041
Neg logit contributions
ason-0.064
arton-0.061
asons-0.057
antry-0.056
Write-0.056
Token night
Token ID1755
Feature activation+1.57e-01

Pos logit contributions
NRS+0.021
 UP+0.021
 gloss+0.021
 Act+0.021
 Rost+0.021
Neg logit contributions
ason-0.027
arton-0.025
asons-0.023
Write-0.023
odox-0.023
Token.
Token ID13
Feature activation+1.12e-02

Pos logit contributions
ason+0.114
arton+0.109
asons+0.099
odox+0.099
Write+0.098
Neg logit contributions
NRS-0.097
 UP-0.096
 Act-0.096
 gloss-0.095
 Rost-0.095
Token
Token ID198
Feature activation+1.94e-03

Pos logit contributions
ason+0.009
arton+0.009
asons+0.008
Write+0.008
odox+0.008
Neg logit contributions
NRS-0.006
 UP-0.006
 gloss-0.006
 Act-0.006
 Rost-0.006
Token
Token ID198
Feature activation-1.32e-02

Pos logit contributions
ason+0.002
arton+0.001
Write+0.001
asons+0.001
antry+0.001
Neg logit contributions
NRS-0.001
 UP-0.001
 gloss-0.001
 Act-0.001
 Rost-0.001
Token enforced
Token ID20326
Feature activation-1.88e-01

Pos logit contributions
ason+0.010
arton+0.010
Write+0.009
asons+0.009
odox+0.009
Neg logit contributions
NRS-0.007
 UP-0.007
 Act-0.007
 gloss-0.007
 Rost-0.007
Token,
Token ID11
Feature activation+3.01e-02

Pos logit contributions
NRS+0.117
 UP+0.115
 Act+0.115
 gloss+0.115
 Rost+0.114
Neg logit contributions
ason-0.135
arton-0.129
asons-0.117
Write-0.116
odox-0.116
Token natural
Token ID3288
Feature activation-6.07e-02

Pos logit contributions
ason+0.024
arton+0.022
asons+0.021
Write+0.021
antry+0.020
Neg logit contributions
NRS-0.017
 UP-0.016
 Act-0.016
 gloss-0.016
 Rost-0.016
Token gas
Token ID3623
Feature activation-1.26e-01

Pos logit contributions
NRS+0.037
 UP+0.037
 Act+0.037
 gloss+0.037
 Rost+0.036
Neg logit contributions
ason-0.044
arton-0.042
asons-0.038
odox-0.038
Write-0.038
Token-
Token ID12
Feature activation-4.37e-02

Pos logit contributions
NRS+0.084
 UP+0.083
 Act+0.083
 gloss+0.082
 Rost+0.082
Neg logit contributions
ason-0.086
arton-0.082
asons-0.074
odox-0.074
Write-0.073
Tokenfired
Token ID26803
Feature activation-1.30e-01

Pos logit contributions
NRS+0.022
 UP+0.022
 Act+0.022
 gloss+0.022
 Rost+0.022
Neg logit contributions
ason-0.036
arton-0.035
asons-0.032
Write-0.032
antry-0.032
Token electricity
Token ID8744
Feature activation-6.85e-03

Pos logit contributions
NRS+0.060
 UP+0.059
 Act+0.058
 gloss+0.058
 Rost+0.058
Neg logit contributions
ason-0.114
arton-0.110
asons-0.102
Write-0.101
antry-0.101
Token generation
Token ID5270
Feature activation-2.79e-02

Pos logit contributions
NRS+0.004
 UP+0.004
 Act+0.004
 gloss+0.004
 Rost+0.004
Neg logit contributions
ason-0.005
arton-0.005
asons-0.005
Write-0.005
odox-0.005
Token produces
Token ID11073
Feature activation+2.19e-01

Pos logit contributions
NRS+0.019
 UP+0.019
 gloss+0.019
 Act+0.019
 Rost+0.018
Neg logit contributions
ason-0.019
arton-0.018
asons-0.016
Write-0.016
antry-0.016
Token more
Token ID517
Feature activation-2.63e-03

Pos logit contributions
ason+0.154
arton+0.146
asons+0.133
Write+0.133
odox+0.132
Neg logit contributions
NRS-0.139
 UP-0.137
 Act-0.136
 gloss-0.136
 Rost-0.135
Token lif
Token ID3868
Feature activation+1.62e-01

Pos logit contributions
NRS+0.002
 UP+0.002
 Act+0.002
 Rost+0.001
 gloss+0.001
Neg logit contributions
ason-0.002
arton-0.002
asons-0.002
Write-0.002
odox-0.002
Token-
Token ID12
Feature activation-2.14e-02

Pos logit contributions
NRS+0.050
 UP+0.050
 Act+0.049
 gloss+0.049
 Rost+0.049
Neg logit contributions
ason-0.057
arton-0.054
asons-0.049
odox-0.049
Write-0.049
Token35
Token ID2327
Feature activation+7.54e-03

Pos logit contributions
NRS+0.014
 UP+0.014
 Act+0.013
 gloss+0.013
 Rost+0.013
Neg logit contributions
ason-0.015
arton-0.014
asons-0.013
odox-0.013
Write-0.013
Token f
Token ID277
Feature activation+4.95e-02

Pos logit contributions
ason+0.005
arton+0.005
asons+0.004
odox+0.004
Write+0.004
Neg logit contributions
NRS-0.005
 UP-0.005
 Act-0.005
 gloss-0.005
 Rost-0.005
Token/
Token ID14
Feature activation-5.60e-03

Pos logit contributions
ason+0.034
arton+0.032
asons+0.029
Write+0.029
antry+0.029
Neg logit contributions
NRS-0.033
 UP-0.032
 gloss-0.032
 Act-0.032
 Rost-0.032
Token4
Token ID19
Feature activation-1.01e-01

Pos logit contributions
NRS+0.003
 UP+0.003
 Act+0.003
 gloss+0.003
 Rost+0.003
Neg logit contributions
ason-0.005
arton-0.005
asons-0.004
odox-0.004
Write-0.004
Token IS
Token ID3180
Feature activation-3.73e-01

Pos logit contributions
NRS+0.045
 UP+0.044
 Act+0.044
 gloss+0.044
 Rost+0.043
Neg logit contributions
ason-0.091
arton-0.088
asons-0.081
odox-0.081
Write-0.081
Token lens
Token ID10317
Feature activation-7.59e-02

Pos logit contributions
NRS+0.226
 UP+0.223
 Act+0.222
 gloss+0.222
 Rost+0.220
Neg logit contributions
ason-0.275
arton-0.262
asons-0.238
odox-0.238
Write-0.238
Token at
Token ID379
Feature activation+2.76e-02

Pos logit contributions
NRS+0.048
 UP+0.048
 Act+0.047
 gloss+0.047
 Rost+0.047
Neg logit contributions
ason-0.054
arton-0.052
asons-0.047
odox-0.047
Write-0.047
Token 16
Token ID1467
Feature activation-6.28e-02

Pos logit contributions
ason+0.021
arton+0.021
asons+0.019
Write+0.019
odox+0.019
Neg logit contributions
NRS-0.016
 UP-0.015
 Act-0.015
 gloss-0.015
 Rost-0.015
Tokenmm
Token ID3020
Feature activation-1.96e-01

Pos logit contributions
NRS+0.029
 UP+0.028
 Act+0.028
 gloss+0.028
 Rost+0.028
Neg logit contributions
ason-0.056
arton-0.054
asons-0.050
Write-0.050
odox-0.049
Token.
Token ID13
Feature activation-6.86e-02

Pos logit contributions
NRS+0.116
 UP+0.116
 Act+0.115
 gloss+0.115
 Rost+0.114
Neg logit contributions
ason-0.146
arton-0.140
asons-0.128
odox-0.128
Write-0.127
Token to
Token ID284
Feature activation+3.11e-02

Pos logit contributions
 UP+0.095
NRS+0.094
 Act+0.094
 gloss+0.094
 Rost+0.093
Neg logit contributions
ason-0.110
arton-0.105
odox-0.096
asons-0.096
antry-0.095
Token g
Token ID308
Feature activation+4.99e-02

Pos logit contributions
ason+0.023
arton+0.022
asons+0.020
Write+0.020
odox+0.020
Neg logit contributions
NRS-0.019
 UP-0.019
 Act-0.018
 gloss-0.018
 Rost-0.018
Tokenank
Token ID962
Feature activation-1.34e-01

Pos logit contributions
ason+0.030
arton+0.029
asons+0.026
Write+0.026
odox+0.025
Neg logit contributions
NRS-0.036
 Act-0.036
 UP-0.036
 gloss-0.036
 Rost-0.036
Token immedi
Token ID2729
Feature activation-1.70e-01

Pos logit contributions
NRS+0.077
 UP+0.076
 Act+0.075
 gloss+0.075
 Rost+0.075
Neg logit contributions
ason-0.103
arton-0.098
asons-0.090
odox-0.089
Write-0.089
Tokenat
Token ID265
Feature activation-7.16e-02

Pos logit contributions
NRS+0.121
 UP+0.120
 Act+0.119
 gloss+0.119
 Rost+0.118
Neg logit contributions
ason-0.107
arton-0.101
asons-0.091
odox-0.090
Write-0.090
Tokenly
Token ID306
Feature activation-3.92e-01

Pos logit contributions
 UP+0.029
 Act+0.029
NRS+0.029
 gloss+0.028
 Rost+0.028
Neg logit contributions
ason-0.066
arton-0.065
odox-0.060
Write-0.060
asons-0.060
Token after
Token ID706
Feature activation-6.15e-05

Pos logit contributions
NRS+0.241
 UP+0.239
 Act+0.238
 gloss+0.238
 Rost+0.236
Neg logit contributions
ason-0.284
arton-0.271
asons-0.247
odox-0.246
Write-0.246
Token 6
Token ID718
Feature activation-9.52e-02

Pos logit contributions
NRS+0.000
 UP+0.000
 gloss+0.000
 Act+0.000
 Rost+0.000
Neg logit contributions
ason-0.000
arton-0.000
asons-0.000
Write-0.000
antry-0.000
Token.
Token ID13
Feature activation+1.06e-03

Pos logit contributions
NRS+0.060
 UP+0.059
 Act+0.059
 gloss+0.059
 Rost+0.058
Neg logit contributions
ason-0.068
arton-0.065
asons-0.059
odox-0.059
Write-0.058
Token With
Token ID2080
Feature activation-8.20e-02

Pos logit contributions
ason+0.001
arton+0.001
asons+0.001
Write+0.001
odox+0.001
Neg logit contributions
NRS-0.001
 gloss-0.001
 UP-0.001
 Act-0.001
 Rost-0.001
Token EQ
Token ID36529
Feature activation-2.99e-01

Pos logit contributions
NRS+0.048
 UP+0.047
 Act+0.047
 gloss+0.047
 Rost+0.046
Neg logit contributions
ason-0.062
arton-0.060
asons-0.054
Write-0.054
odox-0.054
Token]
Token ID60
Feature activation-8.28e-02

Pos logit contributions
ason+0.134
arton+0.126
asons+0.115
Write+0.115
odox+0.113
Neg logit contributions
NRS-0.126
 UP-0.124
 gloss-0.124
 Act-0.123
 Rost-0.122
Token <
Token ID1279
Feature activation-1.04e-01

Pos logit contributions
NRS+0.045
 UP+0.045
 Act+0.045
 gloss+0.044
 Rost+0.044
Neg logit contributions
ason-0.066
arton-0.063
asons-0.058
Write-0.058
antry-0.058
Tokenmom
Token ID32542
Feature activation-2.52e-01

Pos logit contributions
NRS+0.065
 UP+0.065
 Act+0.064
 gloss+0.064
 Rost+0.064
Neg logit contributions
ason-0.075
arton-0.072
asons-0.065
odox-0.065
Write-0.065
Tokenper
Token ID525
Feature activation+1.41e-02

Pos logit contributions
NRS+0.171
 UP+0.169
 Act+0.169
 gloss+0.168
 Rost+0.168
Neg logit contributions
ason-0.169
arton-0.160
asons-0.145
Write-0.143
antry-0.143
Token>
Token ID29
Feature activation+6.20e-03

Pos logit contributions
ason+0.010
arton+0.009
asons+0.008
Write+0.008
antry+0.008
Neg logit contributions
NRS-0.009
 UP-0.009
 Act-0.009
 gloss-0.009
 Rost-0.009
Token lol
Token ID19462
Feature activation-3.47e-01

Pos logit contributions
ason+0.005
arton+0.005
asons+0.004
Write+0.004
odox+0.004
Neg logit contributions
NRS-0.003
 UP-0.003
 Act-0.003
 gloss-0.003
 Rost-0.003
Token m
Token ID285
Feature activation-3.63e-02

Pos logit contributions
NRS+0.217
 UP+0.216
 Act+0.216
 gloss+0.215
 Rost+0.213
Neg logit contributions
ason-0.248
arton-0.237
asons-0.215
odox-0.214
Write-0.214
Token2
Token ID17
Feature activation-1.25e-01

Pos logit contributions
NRS+0.018
 UP+0.018
 Act+0.018
 gloss+0.018
 Rost+0.017
Neg logit contributions
ason-0.031
arton-0.030
asons-0.027
Write-0.027
odox-0.027
Tokenk
Token ID74
Feature activation-1.53e-02

Pos logit contributions
NRS+0.077
 UP+0.076
 Act+0.076
 gloss+0.075
 Rost+0.075
Neg logit contributions
ason-0.091
arton-0.087
asons-0.080
odox-0.079
Write-0.079
Token [
Token ID685
Feature activation-1.14e-01

Pos logit contributions
NRS+0.011
 UP+0.011
 Act+0.011
 gloss+0.011
 Rost+0.011
Neg logit contributions
ason-0.009
arton-0.009
asons-0.008
Write-0.008
odox-0.008
Token10
Token ID940
Feature activation+2.30e-02

Pos logit contributions
NRS+0.074
 UP+0.074
 Act+0.073
 gloss+0.073
 Rost+0.073
Neg logit contributions
ason-0.079
arton-0.075
asons-0.068
Write-0.068
odox-0.067
Token women
Token ID1466
Feature activation+9.67e-02

Pos logit contributions
ason+0.088
arton+0.083
asons+0.077
Write+0.076
odox+0.076
Neg logit contributions
NRS-0.064
 gloss-0.062
 UP-0.062
 Act-0.062
 Rost-0.061
Token,
Token ID11
Feature activation-2.90e-02

Pos logit contributions
ason+0.071
arton+0.067
asons+0.061
odox+0.061
Write+0.061
Neg logit contributions
NRS-0.059
 UP-0.059
 Act-0.058
 gloss-0.058
 Rost-0.058
Token 37
Token ID5214
Feature activation+1.18e-01

Pos logit contributions
NRS+0.018
 UP+0.018
 Act+0.018
 gloss+0.018
 Rost+0.017
Neg logit contributions
ason-0.021
arton-0.020
asons-0.018
Write-0.018
odox-0.018
Token%
Token ID4
Feature activation-1.37e-01

Pos logit contributions
ason+0.106
arton+0.102
asons+0.095
odox+0.094
Write+0.094
Neg logit contributions
NRS-0.053
 UP-0.052
 Act-0.052
 gloss-0.052
 Rost-0.051
Token were
Token ID547
Feature activation+1.04e-01

Pos logit contributions
NRS+0.099
 UP+0.098
 Act+0.098
 gloss+0.098
 Rost+0.097
Neg logit contributions
ason-0.084
arton-0.079
asons-0.071
odox-0.071
Write-0.070
Token manual
Token ID10107
Feature activation-3.56e-01

Pos logit contributions
ason+0.082
arton+0.078
asons+0.072
Write+0.071
odox+0.071
Neg logit contributions
NRS-0.059
 UP-0.057
 gloss-0.057
 Act-0.057
 Rost-0.056
Token workers
Token ID3259
Feature activation+3.53e-02

Pos logit contributions
NRS+0.228
 UP+0.227
 Act+0.226
 gloss+0.225
 Rost+0.224
Neg logit contributions
ason-0.249
arton-0.238
asons-0.215
odox-0.215
Write-0.214
Token,
Token ID11
Feature activation-3.70e-02

Pos logit contributions
ason+0.026
arton+0.025
odox+0.023
Write+0.023
asons+0.023
Neg logit contributions
 UP-0.021
NRS-0.021
 Act-0.021
 gloss-0.021
 Rost-0.021
Token over
Token ID625
Feature activation-2.33e-02

Pos logit contributions
NRS+0.022
 UP+0.022
 Act+0.022
 gloss+0.022
 Rost+0.022
Neg logit contributions
ason-0.027
arton-0.026
asons-0.024
Write-0.023
odox-0.023
Token 10
Token ID838
Feature activation+1.12e-01

Pos logit contributions
NRS+0.015
 UP+0.015
 Act+0.015
 gloss+0.015
 Rost+0.015
Neg logit contributions
ason-0.016
arton-0.015
asons-0.014
Write-0.013
odox-0.013
Token%
Token ID4
Feature activation-1.45e-01

Pos logit contributions
ason+0.103
arton+0.099
asons+0.092
odox+0.092
Write+0.092
Neg logit contributions
NRS-0.047
 UP-0.046
 Act-0.046
 gloss-0.046
 Rost-0.045
Token Pick
Token ID12346
Feature activation-7.12e-02

Pos logit contributions
NRS+0.087
 UP+0.085
 gloss+0.085
 Act+0.085
 Rost+0.084
Neg logit contributions
ason-0.111
arton-0.106
asons-0.097
Write-0.097
antry-0.096
Token M
Token ID337
Feature activation-2.96e-02

Pos logit contributions
NRS+0.042
 UP+0.042
 Act+0.042
 gloss+0.041
 Rost+0.041
Neg logit contributions
ason-0.053
arton-0.051
asons-0.047
odox-0.046
Write-0.046
Token.
Token ID13
Feature activation-4.43e-02

Pos logit contributions
NRS+0.019
 UP+0.019
 Act+0.019
 gloss+0.019
 Rost+0.019
Neg logit contributions
ason-0.020
arton-0.019
asons-0.017
odox-0.017
Write-0.017
Token G
Token ID402
Feature activation-7.01e-02

Pos logit contributions
NRS+0.025
 UP+0.025
 Act+0.025
 gloss+0.024
 Rost+0.024
Neg logit contributions
ason-0.034
arton-0.033
asons-0.030
odox-0.030
Write-0.030
Tokeness
Token ID408
Feature activation+2.99e-02

Pos logit contributions
NRS+0.044
 UP+0.044
 Act+0.043
 gloss+0.043
 Rost+0.043
Neg logit contributions
ason-0.050
arton-0.048
asons-0.043
odox-0.043
Write-0.043
Tokenberg
Token ID3900
Feature activation-3.48e-01

Pos logit contributions
ason+0.025
arton+0.024
asons+0.022
odox+0.022
Write+0.022
Neg logit contributions
NRS-0.015
 UP-0.015
 Act-0.015
 gloss-0.015
 Rost-0.015
Tokenwitz
Token ID28155
Feature activation-2.89e-01

Pos logit contributions
NRS+0.175
 UP+0.173
 Act+0.173
 gloss+0.172
 Rost+0.171
Neg logit contributions
ason-0.292
arton-0.281
asons-0.259
odox-0.259
Write-0.258
Token West
Token ID2688
Feature activation+1.59e-01

Pos logit contributions
NRS+0.142
 UP+0.141
 Act+0.140
 gloss+0.139
 Rost+0.139
Neg logit contributions
ason-0.246
arton-0.236
asons-0.218
odox-0.218
Write-0.217
Tokenchester
Token ID35983
Feature activation-1.41e-01

Pos logit contributions
ason+0.138
arton+0.133
asons+0.123
Write+0.122
odox+0.122
Neg logit contributions
NRS-0.075
 UP-0.073
 Act-0.073
 gloss-0.073
 Rost-0.072
Token 1
Token ID352
Feature activation-5.47e-02

Pos logit contributions
NRS+0.086
 UP+0.085
 Act+0.085
 gloss+0.084
 Rost+0.084
Neg logit contributions
ason-0.104
arton-0.099
asons-0.090
odox-0.090
antry-0.089
Token hour
Token ID1711
Feature activation-2.61e-02

Pos logit contributions
NRS+0.031
 UP+0.031
 Act+0.031
 gloss+0.031
 Rost+0.031
Neg logit contributions
ason-0.042
arton-0.040
asons-0.037
odox-0.037
Write-0.037