TOP ACTIVATIONS
MAX = 1.107

$ s Format ) } else { $ font 2 =
the feds when police over step the bounds in peaceful protests
was released from a North Korean prison last week died Monday
elements included drums and other familiar Survivor musical accents playing in
it comes to issues of personal liberty , but it can
Round 4 match of the IGN Pro League Team Arena Challenge
small amounts of marijuana for personal use . "
stories that have aluminum composite panels . The core of many
for watching TV : North Korea executes 80 people in public
is very unlikely the North Korean authorities would decide to keep
same with the do .. end syntax e . g .:
ext = ($ text - split "\ r ")
the arresting officers have a personal relationship of some kind and
// This is the base case if ( parts . Any
, according to the company register . Besides Sh
that matter . Fight Back ! Sign up for
cs will go here + -------------------- --+ + -------------------- --+ <
all seems an ab ject failure of that primary goal
bring cash . Sometimes , credit cards fail , just because
dis charges , or even fix a breach in Tal os

MIN ACTIVATIONS
MIN = -1.230

, including headset , cr anks and brakes . The wheels
path , 4 ) = '. cl j ' OR RIGHT
rules mean judges often pres ide over cases involving major campaign
credit : Cop Richard , royalty free <|endoftext|> J ür gen
Gutenberg ( www . g utenberg . org ). Any file
path , 5 ) = '. cl jc ' OR RIGHT
R aim ondo <|endoftext|> lic hess . org is getting better
damn cle ats . Conf ession time : We don 't
, at whatever cost , whichever Democrat is running …. True
ing , _ : & ' a Guard ) -> Option
, features a full Shim ano 600 tri - color group
scept icism , the C IP D warned that 2008 would
- We ber ( Un ivers ité de T oul ouse
= ($ text - split "\ r ") $
path , 5 ) = '. cl js ' OR RIGHT
a law that will ann ul all legal claims against Israeli
. In the midst of this theoretical debate ,
first to hear about Nation Travels destinations , and explore the
F 1 receptor antagonist . Oral administration of a single dose
camera has an exposure z ebra with adjustable levels ) but

INTERVAL 0.523 - 1.107
CONTAINS 0.390%

at town meetings in New Hampshire and Iowa are sending a
5 kg ), lighter and grey er , with much less
wife thinks that the purple colour is really great and I
and beyond . <|endoftext|> G RAY SON has been building and
responsibility to liberate ourselves and others from all forms of oppression

INTERVAL -0.061 - 0.523
CONTAINS 66.097%

sheriff 's office said . We ird News Photos
couldn t afford the rent when their subsidy was
the other lost , broken toys cast off from various NHL
his mistake . The one thing that Bob told
implications for our common security . Despite this ,

INTERVAL -0.645 - -0.061
CONTAINS 33.401%

1 f 4 b be 66 power off Notice
worked with Pam as long as you did , was that
s rents come to an additional $ 235 million in
an imitation Ver me er hung on the wall .
/ kg ) significantly reduced anxiety - like behavior in the

INTERVAL -1.230 - -0.645
CONTAINS 0.112%

also been actively making a fool of Gohan by not only
one element of a tri pty ch called Echo that Indigenous
2 : A Little Prepar ation Goes A Long Way
respond . De anna B arch , a professor of psychology
can detach the plastic panel running alongside the passenger center console
Token $
Token ID720
Feature activation-2.67e-01

Pos logit contributions
theless+0.309
wcs+0.301
@#&+0.293
etheless+0.275
racuse+0.266
Neg logit contributions
 oppos-0.290
 domain-0.280
amus-0.269
 dur-0.261
 negoti-0.260
Tokens
Token ID82
Feature activation-2.56e-02

Pos logit contributions
 oppos+0.542
 domain+0.523
amus+0.500
 dur+0.481
 negoti+0.478
Neg logit contributions
theless-0.668
wcs-0.646
@#&-0.633
etheless-0.599
racuse-0.582
TokenFormat
Token ID26227
Feature activation+1.18e-01

Pos logit contributions
 oppos+0.058
 domain+0.057
amus+0.055
 dur+0.053
 negoti+0.052
Neg logit contributions
theless-0.057
wcs-0.055
@#&-0.054
etheless-0.051
racuse-0.049
Token)
Token ID8
Feature activation+1.76e-01

Pos logit contributions
theless+0.290
wcs+0.285
@#&+0.279
etheless+0.259
racuse+0.254
Neg logit contributions
 oppos-0.239
 domain-0.230
amus-0.221
 negoti-0.214
 dur-0.213
Token }
Token ID1782
Feature activation+7.40e-02

Pos logit contributions
theless+0.433
wcs+0.426
@#&+0.417
etheless+0.387
 RN+0.379
Neg logit contributions
 oppos-0.356
 domain-0.342
amus-0.331
 negoti-0.318
 dur-0.316
Token else
Token ID2073
Feature activation+1.08e+00

Pos logit contributions
theless+0.183
wcs+0.179
@#&+0.175
etheless+0.163
 RN+0.160
Neg logit contributions
 oppos-0.150
 domain-0.144
amus-0.140
 negoti-0.134
 dur-0.133
Token {
Token ID1391
Feature activation+3.91e-01

Pos logit contributions
theless+2.419
wcs+2.388
@#&+2.307
etheless+2.135
 RN+2.102
Neg logit contributions
 oppos-2.387
 domain-2.279
amus-2.210
 negoti-2.155
 dur-2.129
Token $
Token ID720
Feature activation-1.05e-01

Pos logit contributions
theless+0.973
wcs+0.959
@#&+0.935
etheless+0.867
 RN+0.857
Neg logit contributions
 oppos-0.781
 domain-0.747
amus-0.726
 negoti-0.699
 dur-0.691
Tokenfont
Token ID10331
Feature activation+9.85e-02

Pos logit contributions
 oppos+0.229
 domain+0.221
amus+0.213
 dur+0.206
 negoti+0.205
Neg logit contributions
theless-0.244
wcs-0.237
@#&-0.231
etheless-0.217
racuse-0.210
Token2
Token ID17
Feature activation+4.67e-03

Pos logit contributions
theless+0.238
wcs+0.230
@#&+0.225
etheless+0.213
racuse+0.205
Neg logit contributions
 oppos-0.209
 domain-0.201
amus-0.193
 dur-0.187
 negoti-0.185
Token =
Token ID796
Feature activation+2.29e-01

Pos logit contributions
theless+0.009
wcs+0.009
@#&+0.008
etheless+0.008
racuse+0.007
Neg logit contributions
 oppos-0.012
 domain-0.012
amus-0.011
 negoti-0.011
 dur-0.011
Token the
Token ID262
Feature activation-7.51e-02

Pos logit contributions
 oppos+1.002
 domain+0.965
amus+0.925
 dur+0.896
 negoti+0.892
Neg logit contributions
theless-1.140
wcs-1.110
@#&-1.083
etheless-1.018
racuse-0.987
Token feds
Token ID43646
Feature activation-4.72e-02

Pos logit contributions
 oppos+0.149
 domain+0.143
amus+0.139
 negoti+0.134
 dur+0.133
Neg logit contributions
theless-0.188
wcs-0.183
@#&-0.179
etheless-0.168
 RN-0.164
Token when
Token ID618
Feature activation-8.35e-03

Pos logit contributions
 oppos+0.097
 domain+0.093
amus+0.089
 negoti+0.086
 dur+0.086
Neg logit contributions
theless-0.116
wcs-0.113
@#&-0.110
etheless-0.104
racuse-0.100
Token police
Token ID1644
Feature activation-1.13e-01

Pos logit contributions
 oppos+0.020
 domain+0.019
amus+0.018
 dur+0.018
 negoti+0.018
Neg logit contributions
theless-0.018
wcs-0.017
@#&-0.017
etheless-0.016
racuse-0.015
Token over
Token ID625
Feature activation-5.69e-03

Pos logit contributions
 oppos+0.244
 domain+0.234
amus+0.228
 dur+0.219
 negoti+0.218
Neg logit contributions
theless-0.265
wcs-0.258
@#&-0.250
etheless-0.234
 RN-0.230
Tokenstep
Token ID9662
Feature activation+1.03e+00

Pos logit contributions
 oppos+0.011
 domain+0.011
amus+0.010
 dur+0.010
 negoti+0.010
Neg logit contributions
theless-0.014
wcs-0.014
@#&-0.014
etheless-0.013
racuse-0.013
Token the
Token ID262
Feature activation+1.62e-01

Pos logit contributions
theless+2.232
wcs+2.216
@#&+2.129
 RN+1.965
etheless+1.948
Neg logit contributions
 oppos-2.317
 domain-2.188
amus-2.143
 propagate-2.106
 dur-2.102
Token bounds
Token ID22303
Feature activation-1.33e-01

Pos logit contributions
theless+0.375
wcs+0.365
@#&+0.358
etheless+0.332
 RN+0.324
Neg logit contributions
 oppos-0.356
 domain-0.338
amus-0.329
 dur-0.320
 negoti-0.320
Token in
Token ID287
Feature activation-2.57e-02

Pos logit contributions
 oppos+0.246
 domain+0.237
amus+0.224
 dur+0.217
 negoti+0.214
Neg logit contributions
theless-0.358
wcs-0.347
@#&-0.340
etheless-0.323
racuse-0.314
Token peaceful
Token ID12309
Feature activation+8.19e-02

Pos logit contributions
 oppos+0.052
 domain+0.050
amus+0.049
 dur+0.047
 negoti+0.046
Neg logit contributions
theless-0.063
wcs-0.062
@#&-0.060
etheless-0.056
 RN-0.055
Token protests
Token ID8536
Feature activation+1.93e-01

Pos logit contributions
theless+0.241
wcs+0.233
@#&+0.229
etheless+0.215
 RN+0.214
Neg logit contributions
 oppos-0.126
 domain-0.120
amus-0.117
 propagate-0.110
 dur-0.109
Token was
Token ID373
Feature activation-6.25e-02

Pos logit contributions
theless+0.080
wcs+0.078
@#&+0.076
etheless+0.070
racuse+0.069
Neg logit contributions
 oppos-0.077
 domain-0.074
amus-0.072
 dur-0.069
 negoti-0.069
Token released
Token ID2716
Feature activation+6.63e-02

Pos logit contributions
 oppos+0.135
 domain+0.131
amus+0.126
 negoti+0.122
 dur+0.122
Neg logit contributions
theless-0.145
wcs-0.142
@#&-0.138
etheless-0.129
 RN-0.125
Token from
Token ID422
Feature activation-7.06e-02

Pos logit contributions
theless+0.159
wcs+0.155
@#&+0.151
etheless+0.141
racuse+0.138
Neg logit contributions
 oppos-0.139
 domain-0.134
amus-0.130
 negoti-0.124
 propagate-0.124
Token a
Token ID257
Feature activation-7.35e-02

Pos logit contributions
 oppos+0.137
 domain+0.132
amus+0.127
 negoti+0.122
 dur+0.121
Neg logit contributions
theless-0.181
wcs-0.176
@#&-0.172
etheless-0.162
racuse-0.159
Token North
Token ID2258
Feature activation-1.85e-01

Pos logit contributions
 oppos+0.156
 domain+0.150
amus+0.145
 negoti+0.140
 dur+0.139
Neg logit contributions
theless-0.175
wcs-0.171
@#&-0.167
etheless-0.156
 RN-0.152
Token Korean
Token ID6983
Feature activation+1.03e+00

Pos logit contributions
 oppos+0.396
 domain+0.381
amus+0.366
 dur+0.354
 negoti+0.353
Neg logit contributions
theless-0.442
wcs-0.430
@#&-0.420
etheless-0.394
racuse-0.382
Token prison
Token ID3770
Feature activation-4.34e-02

Pos logit contributions
theless+2.387
wcs+2.357
@#&+2.315
 RN+2.153
etheless+2.153
Neg logit contributions
 oppos-2.092
 domain-2.006
amus-1.997
 negoti-1.906
 propagate-1.894
Token last
Token ID938
Feature activation-3.22e-01

Pos logit contributions
 oppos+0.085
 domain+0.081
amus+0.079
 propagate+0.075
 negoti+0.075
Neg logit contributions
theless-0.110
wcs-0.107
@#&-0.104
etheless-0.098
 RN-0.096
Token week
Token ID1285
Feature activation+1.05e-01

Pos logit contributions
 oppos+0.808
 domain+0.782
amus+0.763
 negoti+0.737
 dur+0.736
Neg logit contributions
theless-0.643
wcs-0.623
@#&-0.602
etheless-0.559
 RN-0.536
Token died
Token ID3724
Feature activation+2.89e-01

Pos logit contributions
theless+0.255
wcs+0.248
@#&+0.241
etheless+0.226
 RN+0.220
Neg logit contributions
 oppos-0.217
 domain-0.209
amus-0.201
 propagate-0.193
 dur-0.192
Token Monday
Token ID3321
Feature activation-1.96e-01

Pos logit contributions
theless+0.715
wcs+0.696
@#&+0.675
etheless+0.637
 RN+0.622
Neg logit contributions
 oppos-0.584
 domain-0.561
amus-0.549
 negoti-0.523
 propagate-0.518
Token elements
Token ID4847
Feature activation+1.81e-01

Pos logit contributions
theless+0.411
wcs+0.400
@#&+0.390
etheless+0.361
 RN+0.351
Neg logit contributions
 oppos-0.432
 domain-0.417
amus-0.403
 negoti-0.391
 dur-0.389
Token included
Token ID3017
Feature activation-1.65e-01

Pos logit contributions
theless+0.384
wcs+0.370
@#&+0.361
etheless+0.333
 RN+0.323
Neg logit contributions
 oppos-0.431
 domain-0.416
amus-0.405
 negoti-0.391
 dur-0.390
Token drums
Token ID24730
Feature activation+1.59e-01

Pos logit contributions
 oppos+0.324
 domain+0.312
amus+0.300
 dur+0.287
 negoti+0.287
Neg logit contributions
theless-0.417
wcs-0.407
@#&-0.397
etheless-0.374
racuse-0.364
Token and
Token ID290
Feature activation+1.79e-02

Pos logit contributions
theless+0.364
wcs+0.355
@#&+0.347
etheless+0.323
racuse+0.313
Neg logit contributions
 oppos-0.354
 domain-0.342
amus-0.328
 negoti-0.318
 dur-0.317
Token other
Token ID584
Feature activation+2.27e-01

Pos logit contributions
theless+0.045
wcs+0.043
@#&+0.042
etheless+0.039
 RN+0.039
Neg logit contributions
 oppos-0.036
 domain-0.034
amus-0.033
 negoti-0.032
 propagate-0.032
Token familiar
Token ID5385
Feature activation+9.97e-01

Pos logit contributions
theless+0.565
wcs+0.546
@#&+0.534
 RN+0.499
etheless+0.498
Neg logit contributions
 oppos-0.442
 domain-0.430
amus-0.423
 negoti-0.399
 propagate-0.398
Token Survivor
Token ID23740
Feature activation+2.11e-02

Pos logit contributions
theless+2.355
wcs+2.284
@#&+2.230
 RN+2.087
etheless+2.075
Neg logit contributions
 oppos-2.000
amus-1.942
 domain-1.941
 propagate-1.828
 negoti-1.824
Token musical
Token ID10530
Feature activation+1.34e-01

Pos logit contributions
theless+0.049
wcs+0.048
@#&+0.047
etheless+0.044
 RN+0.043
Neg logit contributions
 oppos-0.045
 domain-0.044
amus-0.043
 negoti-0.041
 dur-0.041
Token accents
Token ID39271
Feature activation+8.43e-02

Pos logit contributions
theless+0.317
wcs+0.308
@#&+0.302
etheless+0.281
 RN+0.273
Neg logit contributions
 oppos-0.284
 domain-0.274
amus-0.266
 dur-0.254
 negoti-0.254
Token playing
Token ID2712
Feature activation-2.41e-01

Pos logit contributions
theless+0.212
wcs+0.207
@#&+0.202
etheless+0.190
racuse+0.184
Neg logit contributions
 oppos-0.168
 domain-0.162
amus-0.155
 negoti-0.149
 dur-0.149
Token in
Token ID287
Feature activation+1.47e-02

Pos logit contributions
 oppos+0.446
 domain+0.430
amus+0.421
 negoti+0.396
 propagate+0.392
Neg logit contributions
theless-0.631
wcs-0.613
@#&-0.602
etheless-0.563
 RN-0.559
Token it
Token ID340
Feature activation+5.82e-02

Pos logit contributions
theless+0.153
wcs+0.149
@#&+0.144
etheless+0.135
racuse+0.131
Neg logit contributions
 oppos-0.158
 domain-0.153
amus-0.149
 dur-0.143
 negoti-0.143
Token comes
Token ID2058
Feature activation-1.14e-01

Pos logit contributions
theless+0.126
wcs+0.121
@#&+0.116
etheless+0.107
racuse+0.105
Neg logit contributions
 oppos-0.136
 domain-0.132
amus-0.128
 dur-0.123
 propagate-0.123
Token to
Token ID284
Feature activation+9.87e-02

Pos logit contributions
 oppos+0.231
 domain+0.222
amus+0.215
 negoti+0.206
 dur+0.205
Neg logit contributions
theless-0.278
wcs-0.274
@#&-0.265
etheless-0.248
racuse-0.243
Token issues
Token ID2428
Feature activation+1.03e-01

Pos logit contributions
theless+0.259
wcs+0.253
@#&+0.245
etheless+0.232
 RN+0.227
Neg logit contributions
 oppos-0.181
 domain-0.175
amus-0.172
 negoti-0.163
 dur-0.161
Token of
Token ID286
Feature activation+3.97e-02

Pos logit contributions
theless+0.254
wcs+0.249
@#&+0.242
etheless+0.228
racuse+0.220
Neg logit contributions
 oppos-0.210
 domain-0.201
amus-0.196
 negoti-0.187
 dur-0.186
Token personal
Token ID2614
Feature activation+9.76e-01

Pos logit contributions
theless+0.107
wcs+0.105
@#&+0.102
etheless+0.096
 RN+0.094
Neg logit contributions
 oppos-0.070
 domain-0.067
amus-0.066
 negoti-0.063
 dur-0.062
Token liberty
Token ID12354
Feature activation+1.25e-01

Pos logit contributions
theless+2.827
wcs+2.778
@#&+2.701
 RN+2.554
etheless+2.548
Neg logit contributions
 oppos-1.447
amus-1.408
 domain-1.342
 negoti-1.254
 propagate-1.252
Token,
Token ID11
Feature activation+1.16e-02

Pos logit contributions
theless+0.301
wcs+0.297
@#&+0.287
etheless+0.269
racuse+0.262
Neg logit contributions
 oppos-0.262
 domain-0.252
amus-0.244
 negoti-0.234
 dur-0.233
Token but
Token ID475
Feature activation+9.47e-02

Pos logit contributions
theless+0.028
wcs+0.027
@#&+0.027
etheless+0.025
 RN+0.024
Neg logit contributions
 oppos-0.024
 domain-0.023
amus-0.023
 negoti-0.022
 propagate-0.022
Token it
Token ID340
Feature activation+1.74e-02

Pos logit contributions
theless+0.221
wcs+0.212
@#&+0.208
etheless+0.195
 RN+0.188
Neg logit contributions
 oppos-0.206
 domain-0.198
amus-0.194
 negoti-0.185
 propagate-0.185
Token can
Token ID460
Feature activation+3.88e-02

Pos logit contributions
theless+0.043
wcs+0.042
@#&+0.041
etheless+0.039
racuse+0.037
Neg logit contributions
 oppos-0.035
 domain-0.034
amus-0.032
 dur-0.031
 propagate-0.031
Token Round
Token ID10485
Feature activation-1.69e-02

Pos logit contributions
 oppos+0.205
 domain+0.197
amus+0.190
 negoti+0.184
 dur+0.184
Neg logit contributions
theless-0.228
wcs-0.223
@#&-0.217
etheless-0.200
 RN-0.198
Token 4
Token ID604
Feature activation+1.15e-01

Pos logit contributions
 oppos+0.035
 domain+0.034
amus+0.033
 dur+0.032
 negoti+0.032
Neg logit contributions
theless-0.041
wcs-0.040
@#&-0.039
etheless-0.036
 RN-0.035
Token match
Token ID2872
Feature activation+2.18e-02

Pos logit contributions
theless+0.273
wcs+0.269
@#&+0.261
etheless+0.243
 RN+0.238
Neg logit contributions
 oppos-0.241
 domain-0.232
amus-0.223
 dur-0.216
 negoti-0.216
Token of
Token ID286
Feature activation+1.77e-01

Pos logit contributions
theless+0.055
wcs+0.054
@#&+0.053
etheless+0.050
racuse+0.048
Neg logit contributions
 oppos-0.043
 domain-0.041
amus-0.040
 negoti-0.038
 dur-0.038
Token the
Token ID262
Feature activation-3.07e-02

Pos logit contributions
theless+0.429
wcs+0.420
@#&+0.411
etheless+0.384
 RN+0.376
Neg logit contributions
 oppos-0.362
 domain-0.349
amus-0.339
 negoti-0.326
 propagate-0.324
Token IGN
Token ID28730
Feature activation+9.55e-01

Pos logit contributions
 oppos+0.066
 domain+0.064
amus+0.062
 negoti+0.060
 dur+0.059
Neg logit contributions
theless-0.072
wcs-0.070
@#&-0.068
etheless-0.064
 RN-0.062
Token Pro
Token ID1041
Feature activation-9.31e-02

Pos logit contributions
theless+2.328
wcs+2.300
@#&+2.257
etheless+2.080
 RN+2.072
Neg logit contributions
 oppos-1.913
 domain-1.831
amus-1.749
 negoti-1.714
 propagate-1.688
Token League
Token ID4041
Feature activation+1.01e-01

Pos logit contributions
 oppos+0.205
 domain+0.197
amus+0.190
 dur+0.184
 negoti+0.183
Neg logit contributions
theless-0.216
wcs-0.210
@#&-0.205
etheless-0.192
racuse-0.186
Token Team
Token ID4816
Feature activation-1.81e-02

Pos logit contributions
theless+0.241
wcs+0.237
@#&+0.231
etheless+0.215
 RN+0.209
Neg logit contributions
 oppos-0.215
 domain-0.207
amus-0.199
 negoti-0.192
 dur-0.192
Token Arena
Token ID10937
Feature activation+1.30e-01

Pos logit contributions
 oppos+0.039
 domain+0.037
amus+0.036
 dur+0.035
 negoti+0.035
Neg logit contributions
theless-0.043
wcs-0.042
@#&-0.041
etheless-0.038
racuse-0.037
Token Challenge
Token ID13879
Feature activation+1.36e-02

Pos logit contributions
theless+0.299
wcs+0.295
@#&+0.286
etheless+0.266
 RN+0.260
Neg logit contributions
 oppos-0.285
 domain-0.274
amus-0.266
 negoti-0.256
 dur-0.255
Token small
Token ID1402
Feature activation+1.85e-01

Pos logit contributions
 oppos+0.157
 domain+0.151
amus+0.147
 negoti+0.141
 dur+0.140
Neg logit contributions
theless-0.181
wcs-0.177
@#&-0.171
etheless-0.160
 RN-0.157
Token amounts
Token ID6867
Feature activation+1.98e-01

Pos logit contributions
theless+0.373
wcs+0.363
@#&+0.351
etheless+0.324
 RN+0.320
Neg logit contributions
 oppos-0.451
 domain-0.433
amus-0.431
 propagate-0.412
 negoti-0.411
Token of
Token ID286
Feature activation+3.48e-02

Pos logit contributions
theless+0.519
wcs+0.507
@#&+0.496
etheless+0.468
racuse+0.455
Neg logit contributions
 oppos-0.377
 domain-0.362
amus-0.345
 dur-0.333
 negoti-0.331
Token marijuana
Token ID5727
Feature activation+5.11e-01

Pos logit contributions
theless+0.085
wcs+0.082
@#&+0.080
etheless+0.075
 RN+0.074
Neg logit contributions
 oppos-0.071
 domain-0.068
amus-0.067
 negoti-0.064
 propagate-0.064
Token for
Token ID329
Feature activation+1.77e-02

Pos logit contributions
theless+1.305
wcs+1.273
@#&+1.235
etheless+1.167
 RN+1.137
Neg logit contributions
 oppos-0.991
 domain-0.952
amus-0.914
 negoti-0.879
 dur-0.874
Token personal
Token ID2614
Feature activation+9.45e-01

Pos logit contributions
theless+0.046
wcs+0.045
@#&+0.043
etheless+0.041
 RN+0.040
Neg logit contributions
 oppos-0.033
 domain-0.032
amus-0.031
 negoti-0.030
 dur-0.030
Token use
Token ID779
Feature activation+2.09e-01

Pos logit contributions
theless+1.955
wcs+1.854
@#&+1.786
 RN+1.668
racuse+1.646
Neg logit contributions
 oppos-2.210
amus-2.129
 domain-2.106
 propagate-2.032
 negoti-1.987
Token.
Token ID13
Feature activation+1.79e-01

Pos logit contributions
theless+0.549
wcs+0.536
@#&+0.521
etheless+0.492
racuse+0.479
Neg logit contributions
 oppos-0.391
 domain-0.374
amus-0.359
 negoti-0.345
 dur-0.343
Token
Token ID198
Feature activation+3.50e-01

Pos logit contributions
theless+0.373
wcs+0.362
@#&+0.352
etheless+0.327
racuse+0.316
Neg logit contributions
 oppos-0.435
 domain-0.422
amus-0.407
 dur-0.395
 negoti-0.394
Token
Token ID198
Feature activation+3.38e-01

Pos logit contributions
theless+0.626
wcs+0.610
@#&+0.594
 RN+0.523
etheless+0.520
Neg logit contributions
 oppos-0.942
 domain-0.912
amus-0.884
 negoti-0.876
 propagate-0.864
Token"
Token ID1
Feature activation+2.89e-01

Pos logit contributions
theless+0.850
wcs+0.839
@#&+0.819
etheless+0.755
racuse+0.748
Neg logit contributions
 oppos-0.667
 domain-0.644
amus-0.608
 negoti-0.603
 dur-0.596
Token stories
Token ID3923
Feature activation+1.63e-01

Pos logit contributions
theless+0.345
wcs+0.338
@#&+0.328
etheless+0.306
racuse+0.301
Neg logit contributions
 oppos-0.271
 domain-0.259
amus-0.254
 negoti-0.241
 dur-0.241
Token that
Token ID326
Feature activation-8.66e-03

Pos logit contributions
theless+0.395
wcs+0.388
@#&+0.376
etheless+0.353
racuse+0.342
Neg logit contributions
 oppos-0.338
 domain-0.326
amus-0.315
 negoti-0.302
 dur-0.300
Token have
Token ID423
Feature activation+1.24e-01

Pos logit contributions
 oppos+0.019
 domain+0.018
amus+0.018
 dur+0.017
 negoti+0.017
Neg logit contributions
theless-0.020
wcs-0.020
@#&-0.019
etheless-0.018
racuse-0.017
Token aluminum
Token ID15955
Feature activation+1.39e-01

Pos logit contributions
theless+0.293
wcs+0.285
@#&+0.275
etheless+0.259
 RN+0.252
Neg logit contributions
 oppos-0.264
 domain-0.255
amus-0.247
 negoti-0.238
 dur-0.237
Token composite
Token ID24185
Feature activation+4.63e-01

Pos logit contributions
theless+0.231
wcs+0.222
@#&+0.211
etheless+0.192
 RN+0.185
Neg logit contributions
 oppos-0.396
 domain-0.386
amus-0.378
 negoti-0.365
 dur-0.364
Token panels
Token ID13043
Feature activation+9.33e-01

Pos logit contributions
theless+0.801
wcs+0.766
@#&+0.722
etheless+0.663
 RN+0.652
Neg logit contributions
 oppos-1.278
 domain-1.239
amus-1.223
 negoti-1.173
 propagate-1.171
Token.
Token ID13
Feature activation+1.29e-01

Pos logit contributions
theless+2.267
wcs+2.227
@#&+2.130
etheless+2.016
 RN+1.962
Neg logit contributions
 oppos-1.894
 domain-1.841
amus-1.776
 negoti-1.693
 propagate-1.688
Token The
Token ID383
Feature activation+6.95e-02

Pos logit contributions
theless+0.312
wcs+0.302
@#&+0.298
etheless+0.275
 RN+0.274
Neg logit contributions
 oppos-0.268
 domain-0.257
amus-0.248
 negoti-0.240
 dur-0.237
Token core
Token ID4755
Feature activation-7.22e-02

Pos logit contributions
theless+0.158
wcs+0.153
@#&+0.149
etheless+0.139
 RN+0.135
Neg logit contributions
 oppos-0.156
 domain-0.150
amus-0.146
 negoti-0.140
 dur-0.140
Token of
Token ID286
Feature activation+2.23e-01

Pos logit contributions
 oppos+0.146
 domain+0.140
amus+0.134
 dur+0.129
 negoti+0.129
Neg logit contributions
theless-0.180
wcs-0.176
@#&-0.171
etheless-0.161
racuse-0.157
Token many
Token ID867
Feature activation-4.32e-02

Pos logit contributions
theless+0.550
wcs+0.536
@#&+0.524
etheless+0.491
racuse+0.478
Neg logit contributions
 oppos-0.455
 domain-0.438
amus-0.423
 negoti-0.406
 dur-0.404
Token for
Token ID329
Feature activation+9.42e-02

Pos logit contributions
theless+0.586
wcs+0.563
@#&+0.553
etheless+0.518
 RN+0.512
Neg logit contributions
 oppos-0.530
 domain-0.513
amus-0.499
 negoti-0.479
 dur-0.473
Token watching
Token ID4964
Feature activation+7.32e-02

Pos logit contributions
theless+0.234
wcs+0.228
@#&+0.221
etheless+0.208
 RN+0.205
Neg logit contributions
 oppos-0.187
 domain-0.181
amus-0.176
 negoti-0.168
 dur-0.167
Token TV
Token ID3195
Feature activation+2.35e-01

Pos logit contributions
theless+0.182
wcs+0.177
@#&+0.173
etheless+0.162
 RN+0.161
Neg logit contributions
 oppos-0.144
 domain-0.139
amus-0.136
 negoti-0.130
 dur-0.129
Token:
Token ID25
Feature activation-2.98e-02

Pos logit contributions
theless+0.567
wcs+0.552
@#&+0.538
etheless+0.505
 RN+0.493
Neg logit contributions
 oppos-0.487
 domain-0.468
amus-0.456
 negoti-0.437
 dur-0.432
Token North
Token ID2258
Feature activation-7.80e-02

Pos logit contributions
 oppos+0.061
 domain+0.058
amus+0.056
 negoti+0.054
 dur+0.054
Neg logit contributions
theless-0.073
wcs-0.071
@#&-0.070
etheless-0.065
 RN-0.064
Token Korea
Token ID4969
Feature activation+9.20e-01

Pos logit contributions
 oppos+0.197
 domain+0.191
amus+0.184
 dur+0.179
 negoti+0.179
Neg logit contributions
theless-0.155
wcs-0.150
@#&-0.146
etheless-0.135
racuse-0.130
Token executes
Token ID42985
Feature activation+9.85e-02

Pos logit contributions
theless+2.255
wcs+2.226
@#&+2.130
 RN+2.021
etheless+2.003
Neg logit contributions
 oppos-1.781
 domain-1.723
amus-1.676
 propagate-1.593
 dur-1.582
Token 80
Token ID4019
Feature activation-8.07e-02

Pos logit contributions
theless+0.237
wcs+0.231
@#&+0.225
etheless+0.213
 RN+0.210
Neg logit contributions
 oppos-0.201
 domain-0.195
amus-0.191
 negoti-0.182
 dur-0.182
Token people
Token ID661
Feature activation-4.91e-02

Pos logit contributions
 oppos+0.158
 domain+0.153
amus+0.147
 dur+0.141
 negoti+0.141
Neg logit contributions
theless-0.204
wcs-0.200
@#&-0.196
etheless-0.185
 RN-0.179
Token in
Token ID287
Feature activation-4.65e-02

Pos logit contributions
 oppos+0.101
 domain+0.098
amus+0.094
 negoti+0.090
 dur+0.090
Neg logit contributions
theless-0.120
wcs-0.117
@#&-0.114
etheless-0.108
racuse-0.104
Token public
Token ID1171
Feature activation+4.79e-02

Pos logit contributions
 oppos+0.092
 domain+0.088
amus+0.086
 negoti+0.082
 dur+0.082
Neg logit contributions
theless-0.117
wcs-0.114
@#&-0.111
etheless-0.104
 RN-0.103
Token is
Token ID318
Feature activation-2.76e-02

Pos logit contributions
theless+0.063
wcs+0.060
@#&+0.059
etheless+0.055
 RN+0.054
Neg logit contributions
 oppos-0.052
 domain-0.050
amus-0.048
 dur-0.046
 negoti-0.046
Token very
Token ID845
Feature activation-1.73e-01

Pos logit contributions
 oppos+0.053
 domain+0.050
amus+0.049
 negoti+0.047
 propagate+0.047
Neg logit contributions
theless-0.072
wcs-0.069
@#&-0.067
etheless-0.064
 RN-0.062
Token unlikely
Token ID7485
Feature activation+2.86e-02

Pos logit contributions
 oppos+0.291
 domain+0.279
amus+0.271
 negoti+0.255
 propagate+0.254
Neg logit contributions
theless-0.485
wcs-0.473
@#&-0.460
etheless-0.436
 RN-0.427
Token the
Token ID262
Feature activation-6.94e-02

Pos logit contributions
theless+0.069
wcs+0.067
@#&+0.065
etheless+0.061
 RN+0.059
Neg logit contributions
 oppos-0.060
 domain-0.058
amus-0.056
 negoti-0.054
 dur-0.054
Token North
Token ID2258
Feature activation-5.72e-02

Pos logit contributions
 oppos+0.144
 domain+0.139
amus+0.135
 negoti+0.129
 dur+0.129
Neg logit contributions
theless-0.168
wcs-0.163
@#&-0.159
etheless-0.149
 RN-0.145
Token Korean
Token ID6983
Feature activation+9.08e-01

Pos logit contributions
 oppos+0.154
 domain+0.149
amus+0.146
 dur+0.141
 propagate+0.140
Neg logit contributions
theless-0.105
wcs-0.100
@#&-0.096
etheless-0.089
 RN-0.085
Token authorities
Token ID4773
Feature activation+9.05e-02

Pos logit contributions
theless+2.337
wcs+2.306
@#&+2.232
etheless+2.118
 RN+2.103
Neg logit contributions
 oppos-1.628
amus-1.586
 domain-1.573
 propagate-1.476
 negoti-1.452
Token would
Token ID561
Feature activation-3.72e-02

Pos logit contributions
theless+0.207
wcs+0.196
@#&+0.189
etheless+0.179
 RN+0.174
Neg logit contributions
 oppos-0.202
 domain-0.194
amus-0.189
 dur-0.180
 negoti-0.180
Token decide
Token ID5409
Feature activation-1.57e-01

Pos logit contributions
 oppos+0.074
 domain+0.071
amus+0.069
 negoti+0.066
 dur+0.066
Neg logit contributions
theless-0.092
wcs-0.089
@#&-0.086
etheless-0.082
 RN-0.080
Token to
Token ID284
Feature activation-1.48e-01

Pos logit contributions
 oppos+0.309
 domain+0.297
amus+0.284
 dur+0.274
 negoti+0.273
Neg logit contributions
theless-0.398
wcs-0.389
@#&-0.379
etheless-0.358
racuse-0.347
Token keep
Token ID1394
Feature activation-1.25e-02

Pos logit contributions
 oppos+0.279
 domain+0.269
amus+0.262
 negoti+0.249
 dur+0.247
Neg logit contributions
theless-0.382
wcs-0.372
@#&-0.363
etheless-0.340
 RN-0.335
Token same
Token ID976
Feature activation+1.43e-01

Pos logit contributions
theless+0.436
wcs+0.422
@#&+0.413
etheless+0.383
racuse+0.368
Neg logit contributions
 oppos-0.492
 domain-0.476
amus-0.465
 negoti-0.448
 dur-0.448
Token with
Token ID351
Feature activation+3.19e-01

Pos logit contributions
theless+0.344
wcs+0.335
@#&+0.328
etheless+0.306
 RN+0.298
Neg logit contributions
 oppos-0.297
 domain-0.285
amus-0.278
 negoti-0.267
 dur-0.266
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
theless+0.802
wcs+0.785
@#&+0.771
etheless+0.720
 RN+0.703
Neg logit contributions
 oppos-0.627
 domain-0.598
amus-0.585
 negoti-0.561
 dur-0.556
Token do
Token ID466
Feature activation+2.95e-01

Pos logit contributions
theless+0.339
wcs+0.331
@#&+0.325
etheless+0.303
 RN+0.295
Neg logit contributions
 oppos-0.292
 domain-0.279
amus-0.273
 negoti-0.262
 dur-0.260
Token..
Token ID492
Feature activation+1.88e-01

Pos logit contributions
theless+0.720
wcs+0.709
@#&+0.694
etheless+0.648
 RN+0.627
Neg logit contributions
 oppos-0.600
 domain-0.573
amus-0.557
 negoti-0.534
 dur-0.532
Tokenend
Token ID437
Feature activation+9.04e-01

Pos logit contributions
theless+0.429
wcs+0.418
@#&+0.408
etheless+0.381
racuse+0.366
Neg logit contributions
 oppos-0.422
 domain-0.406
amus-0.392
 negoti-0.380
 dur-0.379
Token syntax
Token ID15582
Feature activation+2.80e-01

Pos logit contributions
theless+2.170
wcs+2.107
@#&+2.053
etheless+1.937
 RN+1.885
Neg logit contributions
 oppos-1.876
 domain-1.771
amus-1.757
 negoti-1.679
 dur-1.674
Token e
Token ID304
Feature activation+1.01e-01

Pos logit contributions
theless+0.615
wcs+0.598
@#&+0.584
etheless+0.543
racuse+0.524
Neg logit contributions
 oppos-0.645
 domain-0.622
amus-0.605
 negoti-0.584
 dur-0.584
Token.
Token ID13
Feature activation+5.36e-02

Pos logit contributions
theless+0.272
wcs+0.266
@#&+0.261
etheless+0.241
racuse+0.239
Neg logit contributions
 oppos-0.183
 domain-0.173
 negoti-0.164
amus-0.163
 propagate-0.160
Tokeng
Token ID70
Feature activation+8.50e-02

Pos logit contributions
theless+0.147
wcs+0.145
@#&+0.142
etheless+0.132
racuse+0.131
Neg logit contributions
 oppos-0.094
 domain-0.089
amus-0.084
 negoti-0.083
 propagate-0.081
Token.:
Token ID11207
Feature activation+9.19e-02

Pos logit contributions
theless+0.214
wcs+0.209
@#&+0.204
etheless+0.192
racuse+0.187
Neg logit contributions
 oppos-0.170
 domain-0.164
amus-0.156
 dur-0.151
 negoti-0.151
Tokenext
Token ID2302
Feature activation+2.67e-01

Pos logit contributions
theless+0.115
wcs+0.113
@#&+0.109
etheless+0.100
racuse+0.097
Neg logit contributions
 oppos-0.135
 domain-0.130
amus-0.125
 dur-0.123
 negoti-0.122
Token =
Token ID796
Feature activation+3.42e-01

Pos logit contributions
theless+0.520
wcs+0.511
@#&+0.494
etheless+0.451
racuse+0.438
Neg logit contributions
 oppos-0.680
 domain-0.659
amus-0.636
 dur-0.621
 negoti-0.620
Token ($
Token ID7198
Feature activation-1.67e-01

Pos logit contributions
theless+0.811
wcs+0.795
@#&+0.774
etheless+0.721
 RN+0.704
Neg logit contributions
 oppos-0.729
 domain-0.696
amus-0.680
 dur-0.651
 negoti-0.651
Tokentext
Token ID5239
Feature activation+1.93e-01

Pos logit contributions
 oppos+0.355
 domain+0.343
amus+0.329
 dur+0.318
 negoti+0.317
Neg logit contributions
theless-0.400
wcs-0.389
@#&-0.380
etheless-0.357
racuse-0.346
Token -
Token ID532
Feature activation+7.89e-02

Pos logit contributions
theless+0.447
wcs+0.439
@#&+0.430
etheless+0.398
racuse+0.391
Neg logit contributions
 oppos-0.416
 domain-0.400
amus-0.385
 dur-0.373
 negoti-0.373
Tokensplit
Token ID35312
Feature activation+9.02e-01

Pos logit contributions
theless+0.159
wcs+0.155
@#&+0.150
etheless+0.139
racuse+0.134
Neg logit contributions
 oppos-0.197
 domain-0.191
amus-0.184
 dur-0.179
 negoti-0.179
Token "\
Token ID37082
Feature activation-1.01e+00

Pos logit contributions
theless+2.041
wcs+2.016
@#&+1.986
etheless+1.825
 RN+1.798
Neg logit contributions
 oppos-1.970
 domain-1.879
amus-1.840
 negoti-1.775
 dur-1.772
Tokenr
Token ID81
Feature activation-1.92e-01

Pos logit contributions
 oppos+2.159
 domain+2.086
amus+1.994
 dur+1.943
 negoti+1.904
Neg logit contributions
theless-2.424
wcs-2.320
@#&-2.258
etheless-2.178
racuse-2.097
Token
Token ID198
Feature activation-1.81e-01

Pos logit contributions
 oppos+0.478
 domain+0.463
amus+0.447
 dur+0.435
 negoti+0.432
Neg logit contributions
theless-0.390
wcs-0.378
@#&-0.367
etheless-0.343
racuse-0.329
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
 oppos+0.467
 domain+0.453
amus+0.438
 dur+0.427
 negoti+0.424
Neg logit contributions
theless-0.354
wcs-0.343
@#&-0.331
etheless-0.309
racuse-0.295
Token")
Token ID4943
Feature activation+1.98e-04

Pos logit contributions
 oppos+0.358
 domain+0.345
amus+0.330
 dur+0.320
 negoti+0.320
Neg logit contributions
theless-0.402
wcs-0.392
@#&-0.382
etheless-0.358
racuse-0.348
Token the
Token ID262
Feature activation+1.29e-01

Pos logit contributions
theless+0.434
wcs+0.424
@#&+0.413
etheless+0.389
racuse+0.382
Neg logit contributions
 oppos-0.311
 domain-0.299
amus-0.287
 negoti-0.277
 dur-0.276
Token arresting
Token ID33936
Feature activation+1.20e-02

Pos logit contributions
theless+0.311
wcs+0.303
@#&+0.294
etheless+0.276
 RN+0.270
Neg logit contributions
 oppos-0.266
 domain-0.256
amus-0.249
 propagate-0.239
 negoti-0.239
Token officers
Token ID3790
Feature activation+1.31e-01

Pos logit contributions
theless+0.028
wcs+0.028
@#&+0.027
etheless+0.025
racuse+0.024
Neg logit contributions
 oppos-0.026
 domain-0.025
amus-0.024
 dur-0.023
 negoti-0.023
Token have
Token ID423
Feature activation+1.16e-01

Pos logit contributions
theless+0.296
wcs+0.287
@#&+0.279
etheless+0.261
racuse+0.252
Neg logit contributions
 oppos-0.297
 domain-0.287
amus-0.277
 dur-0.268
 negoti-0.267
Token a
Token ID257
Feature activation+7.41e-02

Pos logit contributions
theless+0.271
wcs+0.263
@#&+0.252
etheless+0.238
 RN+0.232
Neg logit contributions
 oppos-0.248
 domain-0.240
amus-0.234
 dur-0.224
 propagate-0.224
Token personal
Token ID2614
Feature activation+9.00e-01

Pos logit contributions
theless+0.192
wcs+0.188
@#&+0.184
etheless+0.174
 RN+0.170
Neg logit contributions
 oppos-0.139
 domain-0.132
amus-0.130
 negoti-0.124
 propagate-0.123
Token relationship
Token ID2776
Feature activation+2.79e-01

Pos logit contributions
theless+2.125
wcs+2.066
@#&+1.998
etheless+1.883
 RN+1.870
Neg logit contributions
 oppos-1.863
amus-1.779
 domain-1.755
 propagate-1.688
 negoti-1.671
Token of
Token ID286
Feature activation+2.28e-01

Pos logit contributions
theless+0.687
wcs+0.670
@#&+0.653
etheless+0.615
racuse+0.596
Neg logit contributions
 oppos-0.571
 domain-0.550
amus-0.527
 dur-0.509
 negoti-0.507
Token some
Token ID617
Feature activation+2.24e-02

Pos logit contributions
theless+0.598
wcs+0.587
@#&+0.570
etheless+0.534
 RN+0.530
Neg logit contributions
 oppos-0.414
 domain-0.397
amus-0.393
 propagate-0.371
 negoti-0.369
Token kind
Token ID1611
Feature activation+4.04e-01

Pos logit contributions
theless+0.058
wcs+0.057
@#&+0.056
etheless+0.052
 RN+0.051
Neg logit contributions
 oppos-0.042
 domain-0.040
amus-0.039
 propagate-0.037
 negoti-0.037
Token and
Token ID290
Feature activation+7.17e-02

Pos logit contributions
theless+1.009
wcs+0.986
@#&+0.958
etheless+0.902
racuse+0.874
Neg logit contributions
 oppos-0.807
 domain-0.777
amus-0.748
 negoti-0.717
 dur-0.715
Token //
Token ID3373
Feature activation-1.07e-01

Pos logit contributions
theless+0.880
wcs+0.852
@#&+0.832
etheless+0.773
 RN+0.760
Neg logit contributions
 oppos-0.770
 domain-0.742
amus-0.726
 negoti-0.701
 dur-0.691
Token This
Token ID770
Feature activation+1.68e-03

Pos logit contributions
 oppos+0.246
 domain+0.237
amus+0.230
 negoti+0.223
 dur+0.222
Neg logit contributions
theless-0.236
wcs-0.229
@#&-0.224
etheless-0.208
racuse-0.200
Token is
Token ID318
Feature activation+1.76e-01

Pos logit contributions
theless+0.004
wcs+0.004
@#&+0.004
etheless+0.004
 RN+0.004
Neg logit contributions
 oppos-0.003
 domain-0.003
amus-0.003
 dur-0.003
 negoti-0.003
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
theless+0.425
wcs+0.416
@#&+0.404
etheless+0.378
 RN+0.375
Neg logit contributions
 oppos-0.364
 domain-0.347
amus-0.336
 negoti-0.327
 dur-0.324
Token base
Token ID2779
Feature activation-2.95e-02

Pos logit contributions
theless+0.415
wcs+0.405
@#&+0.397
etheless+0.372
 RN+0.367
Neg logit contributions
 oppos-0.338
 domain-0.318
amus-0.317
 negoti-0.302
 dur-0.298
Token case
Token ID1339
Feature activation+8.86e-01

Pos logit contributions
 oppos+0.058
 domain+0.056
amus+0.054
 negoti+0.052
 dur+0.052
Neg logit contributions
theless-0.074
wcs-0.072
@#&-0.071
etheless-0.067
 RN-0.065
Token if
Token ID611
Feature activation+5.00e-01

Pos logit contributions
theless+2.091
wcs+2.028
@#&+2.009
etheless+1.854
 RN+1.831
Neg logit contributions
 oppos-1.834
 domain-1.733
amus-1.727
 negoti-1.659
 dur-1.631
Token (
Token ID357
Feature activation+5.88e-02

Pos logit contributions
theless+1.024
wcs+0.999
@#&+0.966
etheless+0.891
 RN+0.865
Neg logit contributions
 oppos-1.216
 domain-1.171
amus-1.147
 negoti-1.109
 dur-1.103
Token parts
Token ID3354
Feature activation+1.48e-01

Pos logit contributions
theless+0.153
wcs+0.150
@#&+0.146
etheless+0.138
racuse+0.134
Neg logit contributions
 oppos-0.111
 domain-0.106
amus-0.102
 dur-0.098
 negoti-0.098
Token.
Token ID13
Feature activation-5.50e-02

Pos logit contributions
theless+0.344
wcs+0.336
@#&+0.328
etheless+0.305
racuse+0.296
Neg logit contributions
 oppos-0.324
 domain-0.313
amus-0.301
 negoti-0.292
 dur-0.291
TokenAny
Token ID7149
Feature activation+9.82e-02

Pos logit contributions
 oppos+0.111
 domain+0.107
amus+0.102
 dur+0.099
 negoti+0.099
Neg logit contributions
theless-0.137
wcs-0.134
@#&-0.131
etheless-0.123
racuse-0.120
Token,
Token ID11
Feature activation+4.63e-02

Pos logit contributions
theless+0.057
wcs+0.056
@#&+0.054
etheless+0.052
 RN+0.050
Neg logit contributions
 oppos-0.043
 domain-0.041
amus-0.039
 propagate-0.038
 dur-0.038
Token according
Token ID1864
Feature activation+2.15e-01

Pos logit contributions
theless+0.107
wcs+0.104
@#&+0.101
etheless+0.095
 RN+0.092
Neg logit contributions
 oppos-0.101
 domain-0.097
amus-0.095
 dur-0.091
 propagate-0.090
Token to
Token ID284
Feature activation+3.01e-01

Pos logit contributions
theless+0.514
wcs+0.508
@#&+0.493
 RN+0.454
etheless+0.450
Neg logit contributions
 oppos-0.444
 domain-0.424
amus-0.410
 negoti-0.403
 dur-0.397
Token the
Token ID262
Feature activation+1.30e-01

Pos logit contributions
theless+0.755
wcs+0.730
@#&+0.709
etheless+0.672
 RN+0.660
Neg logit contributions
 oppos-0.591
 domain-0.561
amus-0.559
 negoti-0.530
 dur-0.529
Token company
Token ID1664
Feature activation+4.05e-01

Pos logit contributions
theless+0.316
wcs+0.307
@#&+0.298
etheless+0.281
 RN+0.275
Neg logit contributions
 oppos-0.265
 domain-0.254
amus-0.250
 dur-0.238
 negoti-0.238
Token register
Token ID7881
Feature activation+8.84e-01

Pos logit contributions
theless+1.032
wcs+1.023
@#&+0.984
etheless+0.930
 RN+0.919
Neg logit contributions
 oppos-0.766
 domain-0.716
amus-0.709
 dur-0.678
 propagate-0.672
Token.
Token ID13
Feature activation+3.75e-02

Pos logit contributions
theless+2.288
wcs+2.242
@#&+2.174
etheless+2.066
 RN+2.029
Neg logit contributions
 oppos-1.639
 domain-1.551
amus-1.504
 negoti-1.464
 dur-1.457
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
theless+0.078
wcs+0.075
@#&+0.073
etheless+0.068
racuse+0.065
Neg logit contributions
 oppos-0.092
 domain-0.089
amus-0.086
 dur-0.083
 negoti-0.083
Token
Token ID198
Feature activation-1.88e-01

Pos logit contributions
 oppos+0.561
 domain+0.544
amus+0.526
 dur+0.513
 negoti+0.512
Neg logit contributions
theless-0.415
wcs-0.402
@#&-0.389
etheless-0.359
racuse-0.346
TokenBesides
Token ID23937
Feature activation-3.11e-01

Pos logit contributions
 oppos+0.385
 domain+0.371
amus+0.355
 negoti+0.344
 dur+0.344
Neg logit contributions
theless-0.462
wcs-0.451
@#&-0.440
etheless-0.413
racuse-0.402
Token Sh
Token ID911
Feature activation-1.01e-01

Pos logit contributions
 oppos+0.629
 domain+0.605
amus+0.581
 dur+0.560
 negoti+0.558
Neg logit contributions
theless-0.777
wcs-0.757
@#&-0.738
etheless-0.696
racuse-0.676
Token that
Token ID326
Feature activation-6.77e-02

Pos logit contributions
 oppos+0.012
 domain+0.012
amus+0.011
 negoti+0.011
 dur+0.011
Neg logit contributions
theless-0.013
wcs-0.013
@#&-0.013
etheless-0.012
racuse-0.012
Token matter
Token ID2300
Feature activation-9.64e-02

Pos logit contributions
 oppos+0.200
 domain+0.191
amus+0.191
 negoti+0.187
 dur+0.184
Neg logit contributions
wcs-0.101
theless-0.100
@#&-0.096
etheless-0.082
 RN-0.081
Token.
Token ID13
Feature activation+1.13e-01

Pos logit contributions
 oppos+0.174
 domain+0.164
amus+0.158
 negoti+0.157
 propagate+0.153
Neg logit contributions
theless-0.254
wcs-0.253
@#&-0.250
 RN-0.231
etheless-0.227
Token
Token ID198
Feature activation+1.14e-01

Pos logit contributions
theless+0.207
wcs+0.201
@#&+0.196
etheless+0.175
 RN+0.173
Neg logit contributions
 oppos-0.302
 domain-0.291
amus-0.283
 negoti-0.278
 dur-0.277
Token
Token ID198
Feature activation+1.47e-01

Pos logit contributions
theless+0.207
wcs+0.201
@#&+0.196
etheless+0.174
 RN+0.173
Neg logit contributions
 oppos-0.305
 domain-0.295
amus-0.286
 negoti-0.282
 dur-0.279
TokenFight
Token ID27365
Feature activation+8.82e-01

Pos logit contributions
wcs+0.318
theless+0.312
@#&+0.302
 RN+0.276
racuse+0.274
Neg logit contributions
 oppos-0.346
 domain-0.330
amus-0.318
 dur-0.314
 negoti-0.314
Token Back
Token ID5157
Feature activation+5.68e-01

Pos logit contributions
theless+2.216
@#&+2.186
wcs+2.125
 RN+2.056
etheless+2.012
Neg logit contributions
 oppos-1.652
amus-1.585
 negoti-1.510
 dur-1.491
 domain-1.463
Token!
Token ID0
Feature activation+1.90e-01

Pos logit contributions
theless+1.223
wcs+1.175
@#&+1.167
 RN+1.092
etheless+1.078
Neg logit contributions
 oppos-1.317
amus-1.256
 domain-1.235
 negoti-1.200
 dur-1.199
Token Sign
Token ID5865
Feature activation+4.49e-02

Pos logit contributions
theless+0.357
@#&+0.345
wcs+0.337
 RN+0.302
etheless+0.301
Neg logit contributions
 oppos-0.501
 domain-0.479
amus-0.469
 dur-0.461
 negoti-0.460
Token up
Token ID510
Feature activation-7.77e-02

Pos logit contributions
theless+0.113
wcs+0.112
@#&+0.109
etheless+0.102
 RN+0.100
Neg logit contributions
 oppos-0.086
 domain-0.083
amus-0.080
 negoti-0.078
 dur-0.077
Token for
Token ID329
Feature activation+4.39e-02

Pos logit contributions
 oppos+0.156
 domain+0.149
amus+0.144
 negoti+0.139
 dur+0.139
Neg logit contributions
theless-0.194
wcs-0.190
@#&-0.185
etheless-0.173
 RN-0.170
Tokencs
Token ID6359
Feature activation-2.13e-01

Pos logit contributions
 oppos+0.066
 domain+0.062
amus+0.060
 dur+0.058
 negoti+0.057
Neg logit contributions
theless-0.091
wcs-0.089
@#&-0.087
etheless-0.082
racuse-0.080
Token will
Token ID481
Feature activation+3.36e-01

Pos logit contributions
 oppos+0.471
 domain+0.454
amus+0.437
 dur+0.423
 negoti+0.422
Neg logit contributions
theless-0.491
wcs-0.478
@#&-0.466
etheless-0.436
racuse-0.422
Token go
Token ID467
Feature activation+1.08e-01

Pos logit contributions
theless+0.796
wcs+0.770
@#&+0.762
etheless+0.707
 RN+0.687
Neg logit contributions
 oppos-0.714
 domain-0.679
amus-0.665
 negoti-0.638
 dur-0.634
Token here
Token ID994
Feature activation-3.54e-02

Pos logit contributions
theless+0.271
wcs+0.268
@#&+0.263
etheless+0.244
 RN+0.239
Neg logit contributions
 oppos-0.214
 domain-0.204
amus-0.197
 negoti-0.190
 dur-0.189
Token +
Token ID1343
Feature activation+2.87e-01

Pos logit contributions
 oppos+0.072
 domain+0.069
amus+0.066
 dur+0.064
 negoti+0.064
Neg logit contributions
theless-0.088
wcs-0.085
@#&-0.084
etheless-0.079
racuse-0.077
Token--------------------
Token ID19351
Feature activation+8.82e-01

Pos logit contributions
theless+0.586
wcs+0.570
@#&+0.562
etheless+0.512
racuse+0.496
Neg logit contributions
 oppos-0.709
 domain-0.680
amus-0.665
 negoti-0.642
 dur-0.642
Token--+
Token ID44785
Feature activation+2.72e-01

Pos logit contributions
theless+2.524
wcs+2.481
@#&+2.456
etheless+2.314
racuse+2.268
Neg logit contributions
 oppos-1.434
 domain-1.345
amus-1.298
 negoti-1.240
 dur-1.231
Token +
Token ID1343
Feature activation+8.95e-02

Pos logit contributions
theless+0.689
wcs+0.675
@#&+0.668
etheless+0.617
 RN+0.607
Neg logit contributions
 oppos-0.529
 domain-0.498
amus-0.487
 negoti-0.468
 dur-0.463
Token--------------------
Token ID19351
Feature activation+8.19e-01

Pos logit contributions
theless+0.181
wcs+0.177
@#&+0.173
etheless+0.158
racuse+0.153
Neg logit contributions
 oppos-0.222
 domain-0.214
amus-0.208
 negoti-0.202
 dur-0.201
Token--+
Token ID44785
Feature activation+2.24e-01

Pos logit contributions
theless+2.344
wcs+2.308
@#&+2.279
etheless+2.148
racuse+2.111
Neg logit contributions
 oppos-1.328
 domain-1.248
amus-1.199
 negoti-1.149
 dur-1.140
Token <
Token ID1279
Feature activation-5.38e-03

Pos logit contributions
theless+0.540
wcs+0.529
@#&+0.523
etheless+0.480
 RN+0.474
Neg logit contributions
 oppos-0.462
 domain-0.437
amus-0.428
 negoti-0.413
 dur-0.408
Token all
Token ID477
Feature activation+6.50e-02

Pos logit contributions
 oppos+0.011
 domain+0.011
amus+0.010
 dur+0.010
 negoti+0.010
Neg logit contributions
theless-0.011
wcs-0.011
@#&-0.011
etheless-0.010
 RN-0.010
Token
Token ID851
Feature activation-9.34e-02

Pos logit contributions
theless+0.172
wcs+0.169
@#&+0.164
etheless+0.154
racuse+0.150
Neg logit contributions
 oppos-0.121
 domain-0.116
amus-0.111
 negoti-0.106
 dur-0.106
Token seems
Token ID2331
Feature activation+2.16e-01

Pos logit contributions
 oppos+0.191
 domain+0.184
amus+0.177
 dur+0.171
 negoti+0.170
Neg logit contributions
theless-0.230
wcs-0.224
@#&-0.218
etheless-0.204
 RN-0.199
Token an
Token ID281
Feature activation-1.14e-01

Pos logit contributions
theless+0.535
wcs+0.524
@#&+0.505
etheless+0.478
 RN+0.465
Neg logit contributions
 oppos-0.425
 domain-0.413
amus-0.401
 negoti-0.385
 dur-0.384
Token ab
Token ID450
Feature activation-9.93e-02

Pos logit contributions
 oppos+0.205
 domain+0.199
amus+0.191
 dur+0.183
 negoti+0.182
Neg logit contributions
theless-0.304
wcs-0.297
@#&-0.290
etheless-0.273
 RN-0.267
Tokenject
Token ID752
Feature activation+8.76e-01

Pos logit contributions
 oppos+0.204
 domain+0.194
amus+0.185
 dur+0.180
 negoti+0.180
Neg logit contributions
theless-0.246
wcs-0.238
@#&-0.233
etheless-0.219
racuse-0.215
Token failure
Token ID5287
Feature activation-1.28e-01

Pos logit contributions
theless+2.204
wcs+2.114
@#&+2.080
 RN+1.923
etheless+1.922
Neg logit contributions
 oppos-1.650
 domain-1.599
amus-1.567
 negoti-1.508
 propagate-1.496
Token of
Token ID286
Feature activation+1.29e-01

Pos logit contributions
 oppos+0.236
 domain+0.226
amus+0.217
 negoti+0.208
 dur+0.207
Neg logit contributions
theless-0.337
wcs-0.332
@#&-0.322
etheless-0.305
 RN-0.295
Token that
Token ID326
Feature activation+5.77e-02

Pos logit contributions
theless+0.328
wcs+0.321
@#&+0.314
etheless+0.295
 RN+0.289
Neg logit contributions
 oppos-0.247
 domain-0.237
amus-0.233
 dur-0.222
 negoti-0.221
Token primary
Token ID4165
Feature activation+5.05e-01

Pos logit contributions
theless+0.150
wcs+0.149
@#&+0.144
etheless+0.135
 RN+0.132
Neg logit contributions
 oppos-0.108
 domain-0.103
amus-0.100
 dur-0.095
 negoti-0.095
Token goal
Token ID3061
Feature activation+2.17e-01

Pos logit contributions
theless+1.298
wcs+1.282
@#&+1.228
etheless+1.165
 RN+1.143
Neg logit contributions
 oppos-0.947
amus-0.895
 domain-0.893
 negoti-0.841
 dur-0.838
Token bring
Token ID2222
Feature activation-4.10e-02

Pos logit contributions
theless+0.039
wcs+0.038
@#&+0.037
etheless+0.035
 RN+0.034
Neg logit contributions
 oppos-0.037
 domain-0.036
amus-0.035
 negoti-0.033
 dur-0.033
Token cash
Token ID5003
Feature activation-9.95e-03

Pos logit contributions
 oppos+0.077
 domain+0.075
amus+0.072
 negoti+0.069
 propagate+0.068
Neg logit contributions
theless-0.106
wcs-0.104
@#&-0.101
etheless-0.095
 RN-0.094
Token.
Token ID13
Feature activation-2.55e-02

Pos logit contributions
 oppos+0.019
 domain+0.018
amus+0.017
 dur+0.017
 negoti+0.016
Neg logit contributions
theless-0.026
wcs-0.026
@#&-0.025
etheless-0.024
racuse-0.023
Token Sometimes
Token ID8975
Feature activation+9.06e-03

Pos logit contributions
 oppos+0.055
 domain+0.053
amus+0.051
 negoti+0.049
 dur+0.049
Neg logit contributions
theless-0.061
wcs-0.059
@#&-0.058
etheless-0.053
 RN-0.053
Token,
Token ID11
Feature activation+7.87e-02

Pos logit contributions
theless+0.021
wcs+0.021
@#&+0.020
etheless+0.019
racuse+0.018
Neg logit contributions
 oppos-0.020
 domain-0.019
amus-0.018
 dur-0.018
 negoti-0.018
Token credit
Token ID3884
Feature activation+8.75e-01

Pos logit contributions
theless+0.184
wcs+0.179
@#&+0.175
etheless+0.163
racuse+0.158
Neg logit contributions
 oppos-0.171
 domain-0.165
amus-0.159
 dur-0.153
 negoti-0.153
Token cards
Token ID4116
Feature activation+3.03e-01

Pos logit contributions
theless+1.406
wcs+1.373
@#&+1.339
 RN+1.168
etheless+1.148
Neg logit contributions
 oppos-2.469
 domain-2.405
amus-2.372
 dur-2.284
 negoti-2.272
Token fail
Token ID2038
Feature activation-1.37e-01

Pos logit contributions
theless+0.675
wcs+0.657
@#&+0.639
etheless+0.592
 RN+0.576
Neg logit contributions
 oppos-0.689
 domain-0.666
amus-0.643
 dur-0.621
 negoti-0.619
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 oppos+0.274
 domain+0.264
amus+0.252
 dur+0.244
 negoti+0.243
Neg logit contributions
theless-0.344
wcs-0.336
@#&-0.328
etheless-0.309
racuse-0.300
Token just
Token ID655
Feature activation+2.49e-02

Pos logit contributions
theless+0.253
wcs+0.246
@#&+0.240
etheless+0.224
racuse+0.217
Neg logit contributions
 oppos-0.239
 domain-0.231
amus-0.223
 dur-0.215
 negoti-0.214
Token because
Token ID780
Feature activation+6.70e-02

Pos logit contributions
theless+0.062
wcs+0.061
@#&+0.059
etheless+0.056
 RN+0.054
Neg logit contributions
 oppos-0.050
 domain-0.048
amus-0.046
 dur-0.044
 negoti-0.044
Token dis
Token ID595
Feature activation+2.71e-01

Pos logit contributions
theless+0.495
wcs+0.485
@#&+0.471
etheless+0.442
 RN+0.435
Neg logit contributions
 oppos-0.362
 domain-0.347
amus-0.338
 dur-0.319
 negoti-0.319
Tokencharges
Token ID34948
Feature activation-8.34e-02

Pos logit contributions
theless+0.592
wcs+0.576
@#&+0.561
etheless+0.522
racuse+0.505
Neg logit contributions
 oppos-0.632
 domain-0.611
amus-0.588
 dur-0.571
 negoti-0.570
Token,
Token ID11
Feature activation+3.13e-02

Pos logit contributions
 oppos+0.177
 domain+0.171
amus+0.164
 dur+0.159
 negoti+0.158
Neg logit contributions
theless-0.199
wcs-0.195
@#&-0.190
etheless-0.178
racuse-0.172
Token or
Token ID393
Feature activation-1.06e-01

Pos logit contributions
theless+0.075
wcs+0.074
@#&+0.072
etheless+0.067
racuse+0.065
Neg logit contributions
 oppos-0.065
 domain-0.063
amus-0.061
 negoti-0.058
 dur-0.058
Token even
Token ID772
Feature activation+1.43e-01

Pos logit contributions
 oppos+0.205
 domain+0.197
amus+0.190
 negoti+0.182
 dur+0.181
Neg logit contributions
theless-0.270
wcs-0.264
@#&-0.258
etheless-0.241
 RN-0.236
Token fix
Token ID4259
Feature activation+8.67e-01

Pos logit contributions
theless+0.356
wcs+0.348
@#&+0.341
etheless+0.316
 RN+0.312
Neg logit contributions
 oppos-0.282
 domain-0.271
amus-0.265
 negoti-0.253
 dur-0.249
Token a
Token ID257
Feature activation+4.35e-03

Pos logit contributions
theless+2.150
wcs+2.094
@#&+2.058
etheless+1.916
 RN+1.890
Neg logit contributions
 oppos-1.695
 domain-1.649
amus-1.591
 negoti-1.531
 dur-1.509
Token breach
Token ID13694
Feature activation-3.20e-01

Pos logit contributions
theless+0.011
wcs+0.011
@#&+0.011
etheless+0.010
 RN+0.010
Neg logit contributions
 oppos-0.009
 domain-0.008
amus-0.008
 negoti-0.008
 dur-0.008
Token in
Token ID287
Feature activation+3.34e-03

Pos logit contributions
 oppos+0.611
 domain+0.584
amus+0.560
 negoti+0.539
 dur+0.537
Neg logit contributions
theless-0.835
wcs-0.816
@#&-0.798
etheless-0.753
racuse-0.730
Token Tal
Token ID7193
Feature activation+4.77e-03

Pos logit contributions
theless+0.009
wcs+0.009
@#&+0.008
etheless+0.008
racuse+0.008
Neg logit contributions
 oppos-0.006
 domain-0.006
amus-0.006
 negoti-0.006
 dur-0.006
Tokenos
Token ID418
Feature activation+3.20e-01

Pos logit contributions
theless+0.011
wcs+0.011
@#&+0.010
etheless+0.010
racuse+0.009
Neg logit contributions
 oppos-0.011
 domain-0.010
amus-0.010
 dur-0.009
 negoti-0.009
Token,
Token ID11
Feature activation-4.63e-02

Pos logit contributions
 oppos+0.864
 domain+0.828
amus+0.800
 dur+0.767
 negoti+0.765
Neg logit contributions
theless-1.064
wcs-1.038
@#&-1.012
etheless-0.950
 RN-0.932
Token including
Token ID1390
Feature activation+2.35e-02

Pos logit contributions
 oppos+0.091
 domain+0.088
amus+0.085
 propagate+0.081
 negoti+0.081
Neg logit contributions
theless-0.116
wcs-0.114
@#&-0.111
etheless-0.103
 RN-0.102
Token headset
Token ID23492
Feature activation+3.00e-03

Pos logit contributions
theless+0.062
wcs+0.060
@#&+0.059
etheless+0.055
 RN+0.055
Neg logit contributions
 oppos-0.044
 domain-0.042
amus-0.041
 negoti-0.039
 propagate-0.039
Token,
Token ID11
Feature activation+5.72e-02

Pos logit contributions
theless+0.007
wcs+0.007
@#&+0.007
etheless+0.006
 RN+0.006
Neg logit contributions
 oppos-0.006
 domain-0.006
amus-0.006
 negoti-0.006
 propagate-0.006
Token cr
Token ID1067
Feature activation-2.25e-01

Pos logit contributions
theless+0.156
wcs+0.153
@#&+0.149
 RN+0.140
etheless+0.140
Neg logit contributions
 oppos-0.099
 domain-0.095
amus-0.092
 negoti-0.087
 propagate-0.087
Tokenanks
Token ID2283
Feature activation-1.23e+00

Pos logit contributions
 oppos+0.421
 domain+0.401
amus+0.381
 dur+0.368
 negoti+0.368
Neg logit contributions
theless-0.596
wcs-0.581
@#&-0.568
etheless-0.537
racuse-0.524
Token and
Token ID290
Feature activation+8.93e-02

Pos logit contributions
 oppos+2.277
 domain+2.164
amus+2.080
 dur+1.994
 trap+1.971
Neg logit contributions
theless-3.315
wcs-3.243
@#&-3.172
etheless-3.013
racuse-2.923
Token brakes
Token ID23746
Feature activation-3.70e-01

Pos logit contributions
theless+0.241
wcs+0.236
@#&+0.229
 RN+0.216
etheless+0.215
Neg logit contributions
 oppos-0.156
 domain-0.150
amus-0.147
 propagate-0.138
 negoti-0.137
Token.
Token ID13
Feature activation+9.73e-02

Pos logit contributions
 oppos+0.716
 domain+0.688
amus+0.657
 dur+0.634
 negoti+0.631
Neg logit contributions
theless-0.957
wcs-0.934
@#&-0.913
etheless-0.861
racuse-0.838
Token The
Token ID383
Feature activation+5.01e-02

Pos logit contributions
theless+0.237
wcs+0.231
@#&+0.227
etheless+0.209
 RN+0.208
Neg logit contributions
 oppos-0.201
 domain-0.193
amus-0.185
 negoti-0.181
 propagate-0.179
Token wheels
Token ID13666
Feature activation-1.22e-01

Pos logit contributions
theless+0.128
wcs+0.124
@#&+0.121
etheless+0.113
 RN+0.112
Neg logit contributions
 oppos-0.097
 domain-0.092
amus-0.090
 propagate-0.085
 negoti-0.085
Token path
Token ID3108
Feature activation+1.71e-01

Pos logit contributions
 oppos+0.296
 domain+0.281
amus+0.274
 negoti+0.265
 dur+0.263
Neg logit contributions
theless-0.338
wcs-0.330
@#&-0.324
etheless-0.302
 RN-0.294
Token,
Token ID837
Feature activation-1.91e-01

Pos logit contributions
wcs+0.389
theless+0.388
@#&+0.375
etheless+0.345
 RN+0.342
Neg logit contributions
 oppos-0.373
 domain-0.355
amus-0.346
 negoti-0.335
 dur-0.333
Token 4
Token ID604
Feature activation-2.87e-01

Pos logit contributions
 oppos+0.389
 domain+0.370
amus+0.362
 negoti+0.348
 dur+0.345
Neg logit contributions
theless-0.467
wcs-0.456
@#&-0.447
etheless-0.417
 RN-0.407
Token )
Token ID1267
Feature activation-3.48e-01

Pos logit contributions
 oppos+0.529
 domain+0.507
amus+0.486
 negoti+0.466
 dur+0.466
Neg logit contributions
theless-0.764
wcs-0.751
@#&-0.733
etheless-0.689
racuse-0.672
Token =
Token ID796
Feature activation+1.46e-01

Pos logit contributions
 oppos+0.851
 domain+0.824
amus+0.795
 dur+0.773
 negoti+0.771
Neg logit contributions
theless-0.721
wcs-0.700
@#&-0.679
etheless-0.631
racuse-0.609
Token '.
Token ID45302
Feature activation-1.11e+00

Pos logit contributions
theless+0.340
wcs+0.331
@#&+0.327
etheless+0.301
 RN+0.296
Neg logit contributions
 oppos-0.314
 domain-0.300
amus-0.294
 negoti-0.283
 dur-0.281
Tokencl
Token ID565
Feature activation+3.54e-01

Pos logit contributions
 oppos+2.349
 domain+2.273
amus+2.168
 dur+2.101
 negoti+2.084
Neg logit contributions
theless-2.692
wcs-2.612
@#&-2.544
etheless-2.410
racuse-2.335
Tokenj
Token ID73
Feature activation+1.71e-01

Pos logit contributions
theless+1.005
wcs+0.985
@#&+0.964
etheless+0.914
racuse+0.893
Neg logit contributions
 oppos-0.593
 domain-0.565
amus-0.536
 dur-0.515
 negoti-0.512
Token'
Token ID6
Feature activation-1.73e-01

Pos logit contributions
theless+0.415
wcs+0.408
@#&+0.398
etheless+0.371
racuse+0.362
Neg logit contributions
 oppos-0.353
 domain-0.337
amus-0.327
 negoti-0.315
 dur-0.314
Token OR
Token ID6375
Feature activation+3.51e-01

Pos logit contributions
 oppos+0.392
 domain+0.376
amus+0.368
 negoti+0.354
 dur+0.353
Neg logit contributions
theless-0.385
wcs-0.378
@#&-0.370
etheless-0.340
 RN-0.332
Token RIGHT
Token ID33621
Feature activation+2.66e-01

Pos logit contributions
theless+0.810
wcs+0.792
@#&+0.781
etheless+0.718
 RN+0.707
Neg logit contributions
 oppos-0.758
 domain-0.723
amus-0.713
 negoti-0.688
 dur-0.677
Token rules
Token ID3173
Feature activation-9.58e-02

Pos logit contributions
 oppos+0.618
 domain+0.595
amus+0.575
 negoti+0.552
 dur+0.552
Neg logit contributions
theless-0.660
wcs-0.648
@#&-0.622
etheless-0.586
 RN-0.569
Token mean
Token ID1612
Feature activation+1.91e-01

Pos logit contributions
 oppos+0.197
 domain+0.189
amus+0.183
 negoti+0.176
 dur+0.175
Neg logit contributions
theless-0.233
wcs-0.227
@#&-0.221
etheless-0.208
 RN-0.202
Token judges
Token ID10266
Feature activation+2.34e-01

Pos logit contributions
theless+0.461
wcs+0.452
@#&+0.438
etheless+0.412
 RN+0.400
Neg logit contributions
 oppos-0.396
 domain-0.381
amus-0.371
 dur-0.356
 negoti-0.355
Token often
Token ID1690
Feature activation-8.55e-02

Pos logit contributions
theless+0.534
wcs+0.520
@#&+0.506
etheless+0.472
 RN+0.461
Neg logit contributions
 oppos-0.514
 domain-0.495
amus-0.481
 dur-0.462
 negoti-0.459
Token pres
Token ID906
Feature activation+6.45e-02

Pos logit contributions
 oppos+0.186
 domain+0.179
amus+0.173
 dur+0.167
 negoti+0.166
Neg logit contributions
theless-0.200
wcs-0.193
@#&-0.188
etheless-0.177
racuse-0.171
Tokenide
Token ID485
Feature activation-1.10e+00

Pos logit contributions
theless+0.140
wcs+0.136
@#&+0.133
etheless+0.124
racuse+0.120
Neg logit contributions
 oppos-0.151
 domain-0.146
amus-0.141
 dur-0.137
 negoti-0.136
Token over
Token ID625
Feature activation-4.06e-01

Pos logit contributions
 oppos+2.229
 domain+2.158
amus+2.039
 dur+2.005
 negoti+1.971
Neg logit contributions
theless-2.738
wcs-2.650
@#&-2.609
etheless-2.439
racuse-2.354
Token cases
Token ID2663
Feature activation+1.59e-01

Pos logit contributions
 oppos+0.761
 domain+0.729
amus+0.704
 dur+0.673
 negoti+0.672
Neg logit contributions
theless-1.063
wcs-1.042
@#&-1.015
etheless-0.956
racuse-0.931
Token involving
Token ID7411
Feature activation+6.46e-02

Pos logit contributions
theless+0.385
wcs+0.375
@#&+0.366
etheless+0.344
racuse+0.334
Neg logit contributions
 oppos-0.332
 domain-0.319
amus-0.306
 dur-0.296
 negoti-0.295
Token major
Token ID1688
Feature activation+1.41e-01

Pos logit contributions
theless+0.167
wcs+0.164
@#&+0.159
etheless+0.150
 RN+0.147
Neg logit contributions
 oppos-0.122
 domain-0.117
amus-0.114
 negoti-0.109
 dur-0.108
Token campaign
Token ID1923
Feature activation+5.09e-01

Pos logit contributions
theless+0.357
wcs+0.349
@#&+0.339
etheless+0.320
 RN+0.311
Neg logit contributions
 oppos-0.273
 domain-0.262
amus-0.258
 negoti-0.244
 propagate-0.243
Token credit
Token ID3884
Feature activation+2.98e-01

Pos logit contributions
 oppos+0.786
 domain+0.757
amus+0.728
 dur+0.706
 negoti+0.704
Neg logit contributions
theless-0.832
wcs-0.810
@#&-0.789
etheless-0.739
racuse-0.716
Token:
Token ID25
Feature activation-1.02e-01

Pos logit contributions
theless+0.705
wcs+0.687
@#&+0.671
etheless+0.628
racuse+0.609
Neg logit contributions
 oppos-0.639
 domain-0.615
amus-0.591
 negoti-0.572
 dur-0.571
Token Cop
Token ID6955
Feature activation-3.93e-02

Pos logit contributions
 oppos+0.213
 domain+0.203
amus+0.197
 negoti+0.190
 dur+0.188
Neg logit contributions
theless-0.249
wcs-0.242
@#&-0.236
etheless-0.222
 RN-0.217
Token Richard
Token ID6219
Feature activation-2.51e-02

Pos logit contributions
 oppos+0.082
 domain+0.079
amus+0.075
 dur+0.073
 negoti+0.073
Neg logit contributions
theless-0.095
wcs-0.094
@#&-0.091
etheless-0.085
racuse-0.083
Token,
Token ID11
Feature activation-1.74e-01

Pos logit contributions
 oppos+0.051
 domain+0.049
amus+0.047
 dur+0.046
 negoti+0.046
Neg logit contributions
theless-0.062
wcs-0.060
@#&-0.059
etheless-0.055
racuse-0.054
Token royalty
Token ID29359
Feature activation-1.09e+00

Pos logit contributions
 oppos+0.418
 domain+0.403
amus+0.391
 negoti+0.379
 dur+0.378
Neg logit contributions
theless-0.367
wcs-0.356
@#&-0.347
etheless-0.322
 RN-0.311
Token free
Token ID1479
Feature activation-5.51e-01

Pos logit contributions
 oppos+2.283
 domain+2.240
amus+2.096
 negoti+2.049
 dur+2.034
Neg logit contributions
theless-2.653
wcs-2.556
@#&-2.506
etheless-2.372
racuse-2.293
Token<|endoftext|>
Token ID50256
Feature activation-3.42e-02

Pos logit contributions
 oppos+1.217
 domain+1.174
amus+1.129
 dur+1.094
 negoti+1.090
Neg logit contributions
theless-1.273
wcs-1.239
@#&-1.207
etheless-1.131
racuse-1.095
TokenJ
Token ID41
Feature activation+2.81e-01

Pos logit contributions
 oppos+0.071
 domain+0.069
amus+0.066
 negoti+0.064
 dur+0.064
Neg logit contributions
theless-0.083
wcs-0.081
@#&-0.079
etheless-0.074
racuse-0.072
Tokenür
Token ID25151
Feature activation-1.89e-01

Pos logit contributions
theless+0.664
wcs+0.645
@#&+0.631
etheless+0.585
racuse+0.575
Neg logit contributions
 oppos-0.596
 domain-0.575
amus-0.545
 dur-0.540
 negoti-0.536
Tokengen
Token ID5235
Feature activation+1.61e-02

Pos logit contributions
 oppos+0.391
 domain+0.375
amus+0.357
 dur+0.349
 negoti+0.348
Neg logit contributions
theless-0.462
wcs-0.451
@#&-0.439
etheless-0.412
racuse-0.402
Token Gutenberg
Token ID20336
Feature activation-1.92e-01

Pos logit contributions
 oppos+1.011
 domain+0.979
amus+0.942
 dur+0.914
 negoti+0.910
Neg logit contributions
theless-0.963
wcs-0.935
@#&-0.909
etheless-0.850
racuse-0.822
Token (
Token ID357
Feature activation+1.28e-01

Pos logit contributions
 oppos+0.382
 domain+0.365
amus+0.350
 dur+0.339
 negoti+0.339
Neg logit contributions
theless-0.483
wcs-0.472
@#&-0.461
etheless-0.433
racuse-0.421
Tokenwww
Token ID2503
Feature activation-7.15e-02

Pos logit contributions
theless+0.274
wcs+0.268
@#&+0.259
etheless+0.240
 RN+0.230
Neg logit contributions
 oppos-0.302
 domain-0.291
amus-0.281
 negoti-0.275
 dur-0.274
Token.
Token ID13
Feature activation-3.98e-02

Pos logit contributions
 oppos+0.127
 domain+0.121
amus+0.115
 negoti+0.114
 propagate+0.111
Neg logit contributions
theless-0.192
wcs-0.190
@#&-0.187
etheless-0.172
racuse-0.172
Tokeng
Token ID70
Feature activation-9.77e-02

Pos logit contributions
 oppos+0.083
 domain+0.079
amus+0.076
 negoti+0.074
 dur+0.074
Neg logit contributions
theless-0.098
wcs-0.096
@#&-0.093
etheless-0.087
racuse-0.084
Tokenutenberg
Token ID19028
Feature activation-1.09e+00

Pos logit contributions
 oppos+0.171
 domain+0.163
amus+0.155
 dur+0.149
 negoti+0.149
Neg logit contributions
theless-0.270
wcs-0.265
@#&-0.259
etheless-0.244
racuse-0.239
Token.
Token ID13
Feature activation+5.14e-02

Pos logit contributions
 oppos+1.724
 domain+1.669
amus+1.565
 dur+1.482
 negoti+1.458
Neg logit contributions
theless-3.191
wcs-3.106
@#&-3.057
etheless-2.904
racuse-2.829
Tokenorg
Token ID2398
Feature activation+1.33e-03

Pos logit contributions
theless+0.136
wcs+0.134
@#&+0.130
etheless+0.124
racuse+0.120
Neg logit contributions
 oppos-0.095
 domain-0.091
amus-0.087
 dur-0.084
 negoti-0.084
Token).
Token ID737
Feature activation-3.61e-02

Pos logit contributions
theless+0.003
wcs+0.003
@#&+0.003
etheless+0.003
racuse+0.003
Neg logit contributions
 oppos-0.003
 domain-0.003
amus-0.003
 dur-0.003
 negoti-0.003
Token Any
Token ID4377
Feature activation+2.58e-01

Pos logit contributions
 oppos+0.077
 domain+0.074
amus+0.071
 negoti+0.070
 dur+0.069
Neg logit contributions
theless-0.085
wcs-0.083
@#&-0.082
etheless-0.076
 RN-0.074
Token file
Token ID2393
Feature activation-4.02e-01

Pos logit contributions
theless+0.628
wcs+0.608
@#&+0.598
etheless+0.562
 RN+0.543
Neg logit contributions
 oppos-0.537
 domain-0.506
amus-0.495
 negoti-0.481
 dur-0.480
Token path
Token ID3108
Feature activation+6.69e-02

Pos logit contributions
 oppos+0.100
 domain+0.095
amus+0.093
 negoti+0.089
 dur+0.089
Neg logit contributions
theless-0.116
wcs-0.114
@#&-0.112
etheless-0.104
 RN-0.101
Token,
Token ID837
Feature activation-1.77e-01

Pos logit contributions
theless+0.167
wcs+0.163
@#&+0.159
etheless+0.150
racuse+0.146
Neg logit contributions
 oppos-0.135
 domain-0.130
amus-0.124
 dur-0.120
 negoti-0.120
Token 5
Token ID642
Feature activation-3.68e-01

Pos logit contributions
 oppos+0.426
 domain+0.411
amus+0.401
 negoti+0.388
 dur+0.385
Neg logit contributions
theless-0.370
wcs-0.359
@#&-0.351
etheless-0.323
 RN-0.314
Token )
Token ID1267
Feature activation-3.96e-01

Pos logit contributions
 oppos+0.603
 domain+0.575
amus+0.542
 dur+0.521
 negoti+0.514
Neg logit contributions
theless-1.067
wcs-1.039
@#&-1.019
etheless-0.974
racuse-0.945
Token =
Token ID796
Feature activation+9.04e-02

Pos logit contributions
 oppos+0.969
 domain+0.941
amus+0.900
 dur+0.882
 negoti+0.870
Neg logit contributions
theless-0.830
wcs-0.794
@#&-0.770
etheless-0.732
racuse-0.695
Token '.
Token ID45302
Feature activation-1.06e+00

Pos logit contributions
theless+0.258
wcs+0.253
@#&+0.249
etheless+0.235
 RN+0.230
Neg logit contributions
 oppos-0.148
 domain-0.139
amus-0.134
 negoti-0.128
 dur-0.126
Tokencl
Token ID565
Feature activation+2.67e-01

Pos logit contributions
 oppos+1.341
 domain+1.272
amus+1.184
 dur+1.109
 negoti+1.093
Neg logit contributions
theless-3.466
wcs-3.383
@#&-3.328
etheless-3.192
racuse-3.122
Tokenjc
Token ID48055
Feature activation-1.19e-01

Pos logit contributions
theless+0.695
wcs+0.684
@#&+0.666
etheless+0.626
racuse+0.613
Neg logit contributions
 oppos-0.507
 domain-0.483
amus-0.458
 dur-0.447
 negoti-0.446
Token'
Token ID6
Feature activation-1.81e-01

Pos logit contributions
 oppos+0.242
 domain+0.233
amus+0.223
 dur+0.215
 negoti+0.215
Neg logit contributions
theless-0.295
wcs-0.287
@#&-0.281
etheless-0.264
racuse-0.257
Token OR
Token ID6375
Feature activation+3.07e-01

Pos logit contributions
 oppos+0.472
 domain+0.457
amus+0.445
 negoti+0.432
 dur+0.431
Neg logit contributions
theless-0.343
wcs-0.333
@#&-0.324
etheless-0.295
racuse-0.285
Token RIGHT
Token ID33621
Feature activation+1.72e-01

Pos logit contributions
theless+0.716
wcs+0.700
@#&+0.689
etheless+0.636
 RN+0.624
Neg logit contributions
 oppos-0.657
 domain-0.630
amus-0.619
 negoti-0.595
 dur-0.588
Token R
Token ID371
Feature activation-1.77e-01

Pos logit contributions
 oppos+0.377
 domain+0.358
amus+0.341
 negoti+0.329
 dur+0.328
Neg logit contributions
theless-0.614
wcs-0.600
@#&-0.588
etheless-0.553
 RN-0.545
Tokenaim
Token ID1385
Feature activation+1.74e-01

Pos logit contributions
 oppos+0.408
 domain+0.394
 negoti+0.379
 dur+0.375
 propagate+0.369
Neg logit contributions
theless-0.379
@#&-0.365
wcs-0.364
etheless-0.335
 RN-0.329
Tokenondo
Token ID22311
Feature activation+1.05e-01

Pos logit contributions
theless+0.530
wcs+0.520
@#&+0.516
etheless+0.485
racuse+0.478
Neg logit contributions
 oppos-0.255
 domain-0.241
amus-0.223
 dur-0.215
 negoti-0.215
Token<|endoftext|>
Token ID50256
Feature activation+4.73e-04

Pos logit contributions
theless+0.203
wcs+0.197
@#&+0.193
etheless+0.178
 RN+0.174
Neg logit contributions
 oppos-0.267
 domain-0.257
amus-0.249
 negoti-0.246
 dur-0.243
Tokenlic
Token ID677
Feature activation-1.63e-01

Pos logit contributions
theless+0.001
wcs+0.001
@#&+0.001
etheless+0.001
racuse+0.001
Neg logit contributions
 oppos-0.001
 domain-0.001
amus-0.001
 negoti-0.001
 dur-0.001
Tokenhess
Token ID33979
Feature activation-1.06e+00

Pos logit contributions
 oppos+0.347
 domain+0.335
amus+0.321
 dur+0.312
 negoti+0.311
Neg logit contributions
theless-0.390
wcs-0.378
@#&-0.369
etheless-0.348
racuse-0.336
Token.
Token ID13
Feature activation+1.50e-02

Pos logit contributions
 oppos+2.238
 domain+2.163
amus+2.070
 dur+2.002
 negoti+1.995
Neg logit contributions
theless-2.531
wcs-2.468
@#&-2.402
etheless-2.260
racuse-2.189
Tokenorg
Token ID2398
Feature activation+5.69e-02

Pos logit contributions
theless+0.038
wcs+0.037
@#&+0.036
etheless+0.034
racuse+0.033
Neg logit contributions
 oppos-0.030
 domain-0.029
amus-0.027
 dur-0.026
 negoti-0.026
Token is
Token ID318
Feature activation-4.51e-02

Pos logit contributions
theless+0.125
wcs+0.121
@#&+0.118
etheless+0.110
 RN+0.107
Neg logit contributions
 oppos-0.131
 domain-0.125
amus-0.123
 dur-0.118
 negoti-0.118
Token getting
Token ID1972
Feature activation+1.18e-01

Pos logit contributions
 oppos+0.086
 domain+0.081
amus+0.079
 negoti+0.077
 dur+0.076
Neg logit contributions
theless-0.117
wcs-0.113
@#&-0.111
etheless-0.105
 RN-0.102
Token better
Token ID1365
Feature activation+1.72e-01

Pos logit contributions
theless+0.297
wcs+0.288
@#&+0.281
etheless+0.264
 RN+0.260
Neg logit contributions
 oppos-0.232
 domain-0.221
amus-0.218
 negoti-0.209
 dur-0.209
Token damn
Token ID12270
Feature activation-6.86e-02

Pos logit contributions
 oppos+0.043
 domain+0.041
amus+0.041
 negoti+0.039
 dur+0.039
Neg logit contributions
theless-0.054
wcs-0.053
@#&-0.051
etheless-0.048
 RN-0.048
Token cle
Token ID1190
Feature activation+6.33e-02

Pos logit contributions
 oppos+0.137
 domain+0.131
amus+0.127
 dur+0.123
 negoti+0.122
Neg logit contributions
theless-0.171
wcs-0.167
@#&-0.163
etheless-0.153
racuse-0.149
Tokenats
Token ID1381
Feature activation-3.43e-01

Pos logit contributions
theless+0.193
wcs+0.190
@#&+0.185
etheless+0.176
racuse+0.174
Neg logit contributions
 oppos-0.092
 domain-0.087
amus-0.080
 dur-0.077
 negoti-0.077
Token.
Token ID13
Feature activation+2.45e-02

Pos logit contributions
 oppos+0.665
 domain+0.638
amus+0.610
 dur+0.588
 negoti+0.586
Neg logit contributions
theless-0.886
wcs-0.865
@#&-0.845
etheless-0.798
racuse-0.776
Token Conf
Token ID7326
Feature activation-2.51e-01

Pos logit contributions
theless+0.051
wcs+0.049
@#&+0.048
etheless+0.044
 RN+0.043
Neg logit contributions
 oppos-0.060
 domain-0.058
amus-0.056
 negoti-0.054
 dur-0.054
Tokenession
Token ID2521
Feature activation-1.05e+00

Pos logit contributions
 oppos+0.456
 domain+0.437
amus+0.416
 dur+0.401
 negoti+0.400
Neg logit contributions
theless-0.675
wcs-0.662
@#&-0.645
etheless-0.611
racuse-0.597
Token time
Token ID640
Feature activation+9.64e-02

Pos logit contributions
 oppos+2.437
 domain+2.363
amus+2.272
 dur+2.209
 trap+2.195
Neg logit contributions
theless-2.297
wcs-2.233
@#&-2.181
etheless-2.037
racuse-1.964
Token:
Token ID25
Feature activation+1.28e-01

Pos logit contributions
theless+0.248
wcs+0.243
@#&+0.237
etheless+0.223
 RN+0.217
Neg logit contributions
 oppos-0.185
 domain-0.177
amus-0.170
 negoti-0.164
 dur-0.163
Token We
Token ID775
Feature activation+5.41e-02

Pos logit contributions
theless+0.270
wcs+0.262
@#&+0.253
etheless+0.234
 RN+0.229
Neg logit contributions
 oppos-0.302
 domain-0.293
amus-0.288
 negoti-0.278
 dur-0.275
Token don
Token ID836
Feature activation+6.64e-02

Pos logit contributions
theless+0.125
wcs+0.121
@#&+0.118
etheless+0.110
 RN+0.107
Neg logit contributions
 oppos-0.118
 domain-0.114
amus-0.111
 negoti-0.107
 dur-0.107
Token't
Token ID470
Feature activation-2.65e-02

Pos logit contributions
theless+0.140
wcs+0.137
@#&+0.134
etheless+0.121
racuse+0.120
Neg logit contributions
 oppos-0.158
 domain-0.153
amus-0.147
 negoti-0.145
 dur-0.143
Token,
Token ID11
Feature activation-1.81e-02

Pos logit contributions
 oppos+0.291
 domain+0.281
amus+0.269
 negoti+0.257
 propagate+0.256
Neg logit contributions
theless-0.454
wcs-0.443
@#&-0.434
etheless-0.409
 RN-0.403
Token at
Token ID379
Feature activation-6.64e-02

Pos logit contributions
 oppos+0.037
 domain+0.035
amus+0.035
 negoti+0.033
 dur+0.033
Neg logit contributions
theless-0.044
wcs-0.043
@#&-0.042
etheless-0.039
 RN-0.039
Token whatever
Token ID4232
Feature activation-7.23e-01

Pos logit contributions
 oppos+0.143
 domain+0.138
amus+0.133
 negoti+0.129
 propagate+0.128
Neg logit contributions
theless-0.153
wcs-0.150
@#&-0.148
etheless-0.135
 RN-0.133
Token cost
Token ID1575
Feature activation-1.25e-01

Pos logit contributions
 oppos+1.555
 domain+1.497
amus+1.444
 dur+1.392
 negoti+1.388
Neg logit contributions
theless-1.713
wcs-1.668
@#&-1.626
etheless-1.523
racuse-1.478
Token,
Token ID11
Feature activation-1.30e-01

Pos logit contributions
 oppos+0.273
 domain+0.264
amus+0.253
 dur+0.245
 negoti+0.244
Neg logit contributions
theless-0.293
wcs-0.286
@#&-0.278
etheless-0.261
racuse-0.253
Token whichever
Token ID26204
Feature activation-1.04e+00

Pos logit contributions
 oppos+0.238
 domain+0.230
amus+0.223
 negoti+0.212
 dur+0.211
Neg logit contributions
theless-0.342
wcs-0.333
@#&-0.326
etheless-0.307
 RN-0.303
Token Democrat
Token ID9755
Feature activation-1.63e-01

Pos logit contributions
 oppos+1.692
 domain+1.613
amus+1.539
 dur+1.463
 negoti+1.454
Neg logit contributions
theless-3.004
wcs-2.937
@#&-2.875
etheless-2.729
racuse-2.659
Token is
Token ID318
Feature activation-1.42e-01

Pos logit contributions
 oppos+0.399
 domain+0.386
amus+0.374
 dur+0.362
 propagate+0.359
Neg logit contributions
theless-0.338
wcs-0.328
@#&-0.317
etheless-0.293
 RN-0.284
Token running
Token ID2491
Feature activation-2.26e-01

Pos logit contributions
 oppos+0.289
 domain+0.279
amus+0.269
 dur+0.258
 negoti+0.256
Neg logit contributions
theless-0.346
wcs-0.340
@#&-0.330
etheless-0.309
 RN-0.304
Token….
Token ID11580
Feature activation-2.46e-01

Pos logit contributions
 oppos+0.438
 domain+0.420
amus+0.403
 dur+0.386
 negoti+0.386
Neg logit contributions
theless-0.580
wcs-0.568
@#&-0.555
etheless-0.522
racuse-0.508
Token True
Token ID6407
Feature activation-8.58e-02

Pos logit contributions
 oppos+0.509
 domain+0.490
amus+0.470
 negoti+0.456
 dur+0.454
Neg logit contributions
theless-0.603
wcs-0.586
@#&-0.576
etheless-0.537
 RN-0.520
Tokening
Token ID278
Feature activation-1.47e-01

Pos logit contributions
 oppos+0.644
 domain+0.617
amus+0.603
 negoti+0.587
 dur+0.584
Neg logit contributions
theless-0.515
wcs-0.506
@#&-0.488
etheless-0.443
 RN-0.438
Token,
Token ID837
Feature activation-4.20e-01

Pos logit contributions
 oppos+0.297
 domain+0.283
amus+0.275
 negoti+0.269
 dur+0.265
Neg logit contributions
theless-0.357
wcs-0.354
@#&-0.343
 RN-0.319
etheless-0.318
Token _
Token ID4808
Feature activation-1.34e-01

Pos logit contributions
 oppos+0.830
 domain+0.792
amus+0.774
 negoti+0.737
 dur+0.729
Neg logit contributions
theless-1.057
wcs-1.035
@#&-1.006
etheless-0.944
racuse-0.928
Token :
Token ID1058
Feature activation-2.74e-01

Pos logit contributions
 oppos+0.275
 domain+0.264
amus+0.254
 negoti+0.245
 dur+0.244
Neg logit contributions
theless-0.328
wcs-0.323
@#&-0.313
etheless-0.293
racuse-0.287
Token &
Token ID1222
Feature activation-5.90e-01

Pos logit contributions
 oppos+0.501
 domain+0.477
amus+0.467
 negoti+0.445
 dur+0.438
Neg logit contributions
theless-0.727
wcs-0.713
@#&-0.695
etheless-0.653
 RN-0.650
Token '
Token ID705
Feature activation-1.03e+00

Pos logit contributions
 oppos+1.054
 domain+1.005
amus+0.963
 negoti+0.922
 dur+0.921
Neg logit contributions
theless-1.605
wcs-1.574
@#&-1.540
etheless-1.452
racuse-1.423
Tokena
Token ID64
Feature activation-3.51e-01

Pos logit contributions
 oppos+1.799
 domain+1.728
amus+1.642
 dur+1.575
 negoti+1.556
Neg logit contributions
theless-2.858
wcs-2.781
@#&-2.720
etheless-2.591
racuse-2.518
Token Guard
Token ID4932
Feature activation+8.90e-03

Pos logit contributions
 oppos+0.749
 domain+0.720
amus+0.697
 negoti+0.673
 dur+0.669
Neg logit contributions
theless-0.829
wcs-0.811
@#&-0.789
etheless-0.737
racuse-0.723
Token )
Token ID1267
Feature activation-4.61e-01

Pos logit contributions
theless+0.022
wcs+0.022
@#&+0.021
etheless+0.020
racuse+0.019
Neg logit contributions
 oppos-0.018
 domain-0.017
amus-0.017
 negoti-0.016
 dur-0.016
Token ->
Token ID4613
Feature activation-2.20e-01

Pos logit contributions
 oppos+1.258
 domain+1.226
amus+1.181
 dur+1.158
 negoti+1.152
Neg logit contributions
theless-0.828
wcs-0.797
@#&-0.771
etheless-0.712
racuse-0.678
Token Option
Token ID16018
Feature activation+2.51e-01

Pos logit contributions
 oppos+0.425
 domain+0.406
amus+0.390
 negoti+0.376
 dur+0.374
Neg logit contributions
theless-0.564
wcs-0.552
@#&-0.539
etheless-0.506
 RN-0.496
Token,
Token ID11
Feature activation-2.20e-02

Pos logit contributions
theless+0.334
wcs+0.327
@#&+0.315
etheless+0.294
 RN+0.288
Neg logit contributions
 oppos-0.317
 domain-0.307
amus-0.298
 dur-0.285
 negoti-0.283
Token features
Token ID3033
Feature activation-1.64e-01

Pos logit contributions
 oppos+0.046
 domain+0.044
amus+0.043
 propagate+0.041
 dur+0.041
Neg logit contributions
theless-0.053
wcs-0.051
@#&-0.050
etheless-0.046
 RN-0.046
Token a
Token ID257
Feature activation-7.58e-02

Pos logit contributions
 oppos+0.307
 domain+0.295
amus+0.287
 negoti+0.273
 propagate+0.271
Neg logit contributions
theless-0.426
wcs-0.418
@#&-0.406
etheless-0.380
 RN-0.376
Token full
Token ID1336
Feature activation-3.40e-01

Pos logit contributions
 oppos+0.150
 domain+0.143
amus+0.140
 negoti+0.133
 propagate+0.132
Neg logit contributions
theless-0.190
wcs-0.186
@#&-0.182
etheless-0.169
 RN-0.167
Token Shim
Token ID31698
Feature activation-6.90e-01

Pos logit contributions
 oppos+0.664
 domain+0.638
amus+0.613
 dur+0.587
 negoti+0.587
Neg logit contributions
theless-0.871
wcs-0.850
@#&-0.831
etheless-0.782
racuse-0.761
Tokenano
Token ID5733
Feature activation-1.02e+00

Pos logit contributions
 oppos+1.645
 domain+1.599
amus+1.533
 dur+1.494
 negoti+1.488
Neg logit contributions
theless-1.482
wcs-1.434
@#&-1.394
etheless-1.306
racuse-1.255
Token 600
Token ID10053
Feature activation-2.59e-01

Pos logit contributions
 oppos+2.068
 domain+1.995
amus+1.890
 dur+1.850
 negoti+1.833
Neg logit contributions
theless-2.559
wcs-2.492
@#&-2.431
etheless-2.301
racuse-2.232
Token tri
Token ID1333
Feature activation-1.43e-02

Pos logit contributions
 oppos+0.525
 domain+0.504
amus+0.484
 negoti+0.465
 dur+0.465
Neg logit contributions
theless-0.642
wcs-0.629
@#&-0.612
etheless-0.574
racuse-0.559
Token-
Token ID12
Feature activation-1.62e-01

Pos logit contributions
 oppos+0.028
 domain+0.027
amus+0.026
 dur+0.025
 negoti+0.025
Neg logit contributions
theless-0.036
wcs-0.036
@#&-0.035
etheless-0.033
racuse-0.032
Tokencolor
Token ID8043
Feature activation+1.58e-01

Pos logit contributions
 oppos+0.366
 domain+0.353
amus+0.339
 dur+0.330
 negoti+0.328
Neg logit contributions
theless-0.367
wcs-0.357
@#&-0.348
etheless-0.326
racuse-0.315
Token group
Token ID1448
Feature activation-4.29e-01

Pos logit contributions
theless+0.423
wcs+0.414
@#&+0.403
etheless+0.378
 RN+0.377
Neg logit contributions
 oppos-0.286
 domain-0.270
amus-0.264
 negoti-0.248
 propagate-0.247
Token scept
Token ID30186
Feature activation+1.76e-01

Pos logit contributions
 oppos+0.449
 domain+0.433
amus+0.423
 propagate+0.398
 dur+0.396
Neg logit contributions
theless-0.706
wcs-0.686
@#&-0.668
etheless-0.632
 RN-0.625
Tokenicism
Token ID11965
Feature activation+5.17e-03

Pos logit contributions
theless+0.459
wcs+0.452
@#&+0.440
etheless+0.409
racuse+0.404
Neg logit contributions
 oppos-0.336
 domain-0.322
amus-0.303
 dur-0.296
 negoti-0.295
Token,
Token ID11
Feature activation-7.94e-02

Pos logit contributions
theless+0.013
wcs+0.012
@#&+0.012
etheless+0.011
racuse+0.011
Neg logit contributions
 oppos-0.011
 domain-0.010
amus-0.010
 negoti-0.010
 dur-0.010
Token the
Token ID262
Feature activation+1.49e-02

Pos logit contributions
 oppos+0.176
 domain+0.170
amus+0.165
 dur+0.158
 negoti+0.158
Neg logit contributions
theless-0.182
wcs-0.177
@#&-0.172
etheless-0.161
 RN-0.156
Token C
Token ID327
Feature activation-1.50e-01

Pos logit contributions
theless+0.040
wcs+0.039
@#&+0.038
etheless+0.036
 RN+0.036
Neg logit contributions
 oppos-0.027
 domain-0.025
amus-0.025
 propagate-0.023
 negoti-0.023
TokenIP
Token ID4061
Feature activation-1.02e+00

Pos logit contributions
 oppos+0.324
 domain+0.311
amus+0.299
 dur+0.290
 negoti+0.289
Neg logit contributions
theless-0.349
wcs-0.340
@#&-0.332
etheless-0.309
 RN-0.303
TokenD
Token ID35
Feature activation-5.34e-01

Pos logit contributions
 oppos+2.157
 domain+2.083
amus+1.987
 dur+1.929
 negoti+1.929
Neg logit contributions
theless-2.443
wcs-2.387
@#&-2.320
etheless-2.183
racuse-2.122
Token warned
Token ID7728
Feature activation+1.19e-01

Pos logit contributions
 oppos+1.155
 domain+1.113
amus+1.070
 dur+1.036
 negoti+1.031
Neg logit contributions
theless-1.256
wcs-1.221
@#&-1.190
etheless-1.118
racuse-1.082
Token that
Token ID326
Feature activation-1.10e-01

Pos logit contributions
theless+0.290
wcs+0.283
@#&+0.274
etheless+0.260
 RN+0.253
Neg logit contributions
 oppos-0.246
 domain-0.236
amus-0.229
 dur-0.220
 negoti-0.219
Token 2008
Token ID3648
Feature activation+6.21e-02

Pos logit contributions
 oppos+0.223
 domain+0.214
amus+0.208
 dur+0.199
 negoti+0.198
Neg logit contributions
theless-0.270
wcs-0.264
@#&-0.256
etheless-0.242
 RN-0.236
Token would
Token ID561
Feature activation+1.08e-01

Pos logit contributions
theless+0.149
wcs+0.145
@#&+0.141
etheless+0.133
 RN+0.128
Neg logit contributions
 oppos-0.131
 domain-0.127
amus-0.122
 dur-0.118
 negoti-0.117
Token-
Token ID12
Feature activation+3.83e-02

Pos logit contributions
theless+1.220
wcs+1.193
@#&+1.156
etheless+1.084
 RN+1.057
Neg logit contributions
 oppos-1.057
 domain-1.014
amus-0.970
 dur-0.939
 propagate-0.936
TokenWe
Token ID1135
Feature activation-1.30e-01

Pos logit contributions
theless+0.092
wcs+0.090
@#&+0.088
etheless+0.082
racuse+0.080
Neg logit contributions
 oppos-0.081
 domain-0.078
amus-0.074
 negoti-0.072
 dur-0.072
Tokenber
Token ID527
Feature activation-2.60e-01

Pos logit contributions
 oppos+0.271
 domain+0.261
amus+0.249
 dur+0.242
 negoti+0.241
Neg logit contributions
theless-0.316
wcs-0.308
@#&-0.300
etheless-0.282
racuse-0.274
Token (
Token ID357
Feature activation-1.40e-01

Pos logit contributions
 oppos+0.554
 domain+0.534
amus+0.511
 dur+0.495
 negoti+0.494
Neg logit contributions
theless-0.622
wcs-0.605
@#&-0.590
etheless-0.554
racuse-0.537
TokenUn
Token ID3118
Feature activation-3.02e-01

Pos logit contributions
 oppos+0.305
 domain+0.294
amus+0.282
 dur+0.273
 negoti+0.273
Neg logit contributions
theless-0.328
wcs-0.320
@#&-0.311
etheless-0.292
racuse-0.283
Tokenivers
Token ID1191
Feature activation-1.01e+00

Pos logit contributions
 oppos+0.527
 domain+0.502
amus+0.479
 negoti+0.457
 dur+0.456
Neg logit contributions
theless-0.841
wcs-0.819
@#&-0.804
etheless-0.764
racuse-0.741
Tokenité
Token ID43816
Feature activation+3.88e-01

Pos logit contributions
 oppos+2.565
 domain+2.479
amus+2.406
 dur+2.339
 negoti+2.327
Neg logit contributions
theless-2.032
wcs-1.959
@#&-1.889
etheless-1.774
racuse-1.695
Token de
Token ID390
Feature activation-1.31e-01

Pos logit contributions
theless+1.071
wcs+1.043
@#&+1.027
 RN+0.976
racuse+0.949
Neg logit contributions
 oppos-0.645
 domain-0.611
amus-0.578
 propagate-0.564
 negoti-0.558
Token T
Token ID309
Feature activation+1.11e-01

Pos logit contributions
 oppos+0.292
 domain+0.282
amus+0.272
 negoti+0.263
 dur+0.262
Neg logit contributions
theless-0.300
wcs-0.290
@#&-0.284
etheless-0.264
 RN-0.257
Tokenoul
Token ID2852
Feature activation+3.97e-01

Pos logit contributions
theless+0.257
wcs+0.251
@#&+0.246
etheless+0.227
racuse+0.224
Neg logit contributions
 oppos-0.241
 domain-0.232
amus-0.221
 dur-0.217
 negoti-0.216
Tokenouse
Token ID1076
Feature activation+2.42e-02

Pos logit contributions
theless+0.443
wcs+0.432
@#&+0.396
 RN+0.336
racuse+0.333
Neg logit contributions
 oppos-1.330
 domain-1.297
amus-1.249
 negoti-1.246
 propagate-1.242
Token =
Token ID796
Feature activation+3.42e-01

Pos logit contributions
theless+0.520
wcs+0.511
@#&+0.494
etheless+0.451
racuse+0.438
Neg logit contributions
 oppos-0.680
 domain-0.659
amus-0.636
 dur-0.621
 negoti-0.620
Token ($
Token ID7198
Feature activation-1.67e-01

Pos logit contributions
theless+0.811
wcs+0.795
@#&+0.774
etheless+0.721
 RN+0.704
Neg logit contributions
 oppos-0.729
 domain-0.696
amus-0.680
 dur-0.651
 negoti-0.651
Tokentext
Token ID5239
Feature activation+1.93e-01

Pos logit contributions
 oppos+0.355
 domain+0.343
amus+0.329
 dur+0.318
 negoti+0.317
Neg logit contributions
theless-0.400
wcs-0.389
@#&-0.380
etheless-0.357
racuse-0.346
Token -
Token ID532
Feature activation+7.89e-02

Pos logit contributions
theless+0.447
wcs+0.439
@#&+0.430
etheless+0.398
racuse+0.391
Neg logit contributions
 oppos-0.416
 domain-0.400
amus-0.385
 dur-0.373
 negoti-0.373
Tokensplit
Token ID35312
Feature activation+9.02e-01

Pos logit contributions
theless+0.159
wcs+0.155
@#&+0.150
etheless+0.139
racuse+0.134
Neg logit contributions
 oppos-0.197
 domain-0.191
amus-0.184
 dur-0.179
 negoti-0.179
Token "\
Token ID37082
Feature activation-1.01e+00

Pos logit contributions
theless+2.041
wcs+2.016
@#&+1.986
etheless+1.825
 RN+1.798
Neg logit contributions
 oppos-1.970
 domain-1.879
amus-1.840
 negoti-1.775
 dur-1.772
Tokenr
Token ID81
Feature activation-1.92e-01

Pos logit contributions
 oppos+2.159
 domain+2.086
amus+1.994
 dur+1.943
 negoti+1.904
Neg logit contributions
theless-2.424
wcs-2.320
@#&-2.258
etheless-2.178
racuse-2.097
Token
Token ID198
Feature activation-1.81e-01

Pos logit contributions
 oppos+0.478
 domain+0.463
amus+0.447
 dur+0.435
 negoti+0.432
Neg logit contributions
theless-0.390
wcs-0.378
@#&-0.367
etheless-0.343
racuse-0.329
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
 oppos+0.467
 domain+0.453
amus+0.438
 dur+0.427
 negoti+0.424
Neg logit contributions
theless-0.354
wcs-0.343
@#&-0.331
etheless-0.309
racuse-0.295
Token")
Token ID4943
Feature activation+1.98e-04

Pos logit contributions
 oppos+0.358
 domain+0.345
amus+0.330
 dur+0.320
 negoti+0.320
Neg logit contributions
theless-0.402
wcs-0.392
@#&-0.382
etheless-0.358
racuse-0.348
Token $
Token ID720
Feature activation+8.66e-02

Pos logit contributions
theless+0.000
wcs+0.000
@#&+0.000
etheless+0.000
racuse+0.000
Neg logit contributions
 oppos-0.000
 domain-0.000
amus-0.000
 negoti-0.000
 dur-0.000
Token path
Token ID3108
Feature activation+1.08e-02

Pos logit contributions
 oppos+0.104
 domain+0.099
amus+0.096
 negoti+0.092
 dur+0.092
Neg logit contributions
theless-0.123
wcs-0.121
@#&-0.118
etheless-0.110
racuse-0.107
Token,
Token ID837
Feature activation-1.62e-01

Pos logit contributions
theless+0.025
wcs+0.025
@#&+0.025
 RN+0.022
etheless+0.022
Neg logit contributions
 oppos-0.023
 domain-0.022
amus-0.021
 negoti-0.021
 dur-0.020
Token 5
Token ID642
Feature activation-4.10e-01

Pos logit contributions
 oppos+0.369
 domain+0.357
amus+0.342
 dur+0.333
 negoti+0.331
Neg logit contributions
theless-0.364
wcs-0.354
@#&-0.344
etheless-0.323
racuse-0.312
Token )
Token ID1267
Feature activation-4.19e-01

Pos logit contributions
 oppos+0.672
 domain+0.640
amus+0.603
 dur+0.580
 negoti+0.572
Neg logit contributions
theless-1.187
wcs-1.155
@#&-1.133
etheless-1.083
racuse-1.050
Token =
Token ID796
Feature activation+4.52e-02

Pos logit contributions
 oppos+1.029
 domain+1.000
amus+0.956
 dur+0.937
 negoti+0.928
Neg logit contributions
theless-0.873
wcs-0.836
@#&-0.811
etheless-0.767
racuse-0.731
Token '.
Token ID45302
Feature activation-1.01e+00

Pos logit contributions
theless+0.129
wcs+0.126
@#&+0.124
etheless+0.117
 RN+0.115
Neg logit contributions
 oppos-0.074
 domain-0.070
amus-0.068
 negoti-0.064
 dur-0.064
Tokencl
Token ID565
Feature activation+2.56e-01

Pos logit contributions
 oppos+1.797
 domain+1.733
amus+1.651
 dur+1.576
 negoti+1.562
Neg logit contributions
theless-2.740
wcs-2.659
@#&-2.613
etheless-2.483
racuse-2.420
Tokenjs
Token ID8457
Feature activation-1.10e-01

Pos logit contributions
theless+0.518
wcs+0.505
@#&+0.489
etheless+0.452
racuse+0.436
Neg logit contributions
 oppos-0.638
 domain-0.616
amus-0.595
 dur-0.581
 negoti-0.579
Token'
Token ID6
Feature activation-1.23e-01

Pos logit contributions
 oppos+0.221
 domain+0.214
amus+0.204
 dur+0.197
 negoti+0.195
Neg logit contributions
theless-0.279
wcs-0.271
@#&-0.265
etheless-0.251
racuse-0.242
Token OR
Token ID6375
Feature activation+2.87e-01

Pos logit contributions
 oppos+0.324
 domain+0.313
amus+0.306
 negoti+0.297
 dur+0.296
Neg logit contributions
theless-0.230
wcs-0.224
@#&-0.218
etheless-0.198
 RN-0.192
Token RIGHT
Token ID33621
Feature activation+1.75e-01

Pos logit contributions
theless+0.683
wcs+0.668
@#&+0.655
etheless+0.609
racuse+0.592
Neg logit contributions
 oppos-0.610
 domain-0.588
amus-0.569
 negoti-0.549
 dur-0.546
Token a
Token ID257
Feature activation+1.99e-02

Pos logit contributions
 oppos+0.078
 domain+0.075
amus+0.072
 dur+0.069
 negoti+0.069
Neg logit contributions
theless-0.113
wcs-0.111
@#&-0.108
etheless-0.103
racuse-0.100
Token law
Token ID1099
Feature activation+9.69e-02

Pos logit contributions
theless+0.046
wcs+0.045
@#&+0.044
etheless+0.041
racuse+0.040
Neg logit contributions
 oppos-0.044
 domain-0.042
amus-0.041
 negoti-0.039
 dur-0.039
Token that
Token ID326
Feature activation-6.27e-02

Pos logit contributions
theless+0.244
wcs+0.238
@#&+0.232
etheless+0.219
racuse+0.213
Neg logit contributions
 oppos-0.192
 domain-0.185
amus-0.179
 negoti-0.170
 dur-0.170
Token will
Token ID481
Feature activation+4.68e-02

Pos logit contributions
 oppos+0.128
 domain+0.123
amus+0.120
 dur+0.113
 propagate+0.113
Neg logit contributions
theless-0.154
wcs-0.149
@#&-0.145
etheless-0.136
 RN-0.133
Token ann
Token ID1529
Feature activation+3.40e-01

Pos logit contributions
theless+0.120
wcs+0.118
@#&+0.116
etheless+0.108
 RN+0.106
Neg logit contributions
 oppos-0.088
 domain-0.084
amus-0.082
 negoti-0.078
 dur-0.076
Tokenul
Token ID377
Feature activation-1.00e+00

Pos logit contributions
theless+0.553
wcs+0.541
@#&+0.524
etheless+0.477
racuse+0.460
Neg logit contributions
 oppos-0.977
 domain-0.941
amus-0.903
 negoti-0.902
 dur-0.886
Token all
Token ID477
Feature activation-7.74e-04

Pos logit contributions
 oppos+1.914
 domain+1.837
amus+1.729
 dur+1.696
 negoti+1.682
Neg logit contributions
theless-2.628
wcs-2.573
@#&-2.504
etheless-2.362
racuse-2.318
Token legal
Token ID2742
Feature activation+6.64e-02

Pos logit contributions
 oppos+0.001
 domain+0.001
amus+0.001
 propagate+0.001
 dur+0.001
Neg logit contributions
theless-0.002
wcs-0.002
@#&-0.002
etheless-0.002
 RN-0.002
Token claims
Token ID3667
Feature activation+1.04e-01

Pos logit contributions
theless+0.166
wcs+0.162
@#&+0.158
etheless+0.148
 RN+0.145
Neg logit contributions
 oppos-0.128
 domain-0.123
amus-0.122
 propagate-0.116
 negoti-0.115
Token against
Token ID1028
Feature activation-3.35e-02

Pos logit contributions
theless+0.255
wcs+0.248
@#&+0.243
etheless+0.228
racuse+0.220
Neg logit contributions
 oppos-0.215
 domain-0.207
amus-0.199
 negoti-0.191
 dur-0.191
Token Israeli
Token ID6085
Feature activation+1.09e-01

Pos logit contributions
 oppos+0.067
 domain+0.065
amus+0.062
 dur+0.060
 negoti+0.060
Neg logit contributions
theless-0.084
wcs-0.082
@#&-0.080
etheless-0.075
racuse-0.073
Token.
Token ID13
Feature activation-1.00e-01

Pos logit contributions
 oppos+0.292
 domain+0.279
amus+0.265
 dur+0.254
 negoti+0.254
Neg logit contributions
theless-0.447
wcs-0.437
@#&-0.427
etheless-0.405
racuse-0.394
Token
Token ID198
Feature activation-8.78e-02

Pos logit contributions
 oppos+0.228
 domain+0.220
amus+0.212
 negoti+0.208
 dur+0.205
Neg logit contributions
theless-0.225
wcs-0.216
@#&-0.214
etheless-0.197
 RN-0.194
Token
Token ID198
Feature activation-7.82e-02

Pos logit contributions
 oppos+0.231
 domain+0.224
amus+0.217
 negoti+0.212
 dur+0.211
Neg logit contributions
theless-0.165
wcs-0.160
@#&-0.155
etheless-0.140
racuse-0.137
TokenIn
Token ID818
Feature activation+1.73e-02

Pos logit contributions
 oppos+0.159
 domain+0.153
amus+0.146
 negoti+0.144
 dur+0.142
Neg logit contributions
theless-0.194
wcs-0.189
@#&-0.186
etheless-0.171
 RN-0.168
Token the
Token ID262
Feature activation+7.46e-02

Pos logit contributions
theless+0.042
wcs+0.041
@#&+0.040
etheless+0.037
 RN+0.036
Neg logit contributions
 oppos-0.036
 domain-0.035
amus-0.033
 negoti-0.032
 dur-0.032
Token midst
Token ID15925
Feature activation-1.00e+00

Pos logit contributions
theless+0.177
wcs+0.171
@#&+0.167
etheless+0.156
 RN+0.154
Neg logit contributions
 oppos-0.158
 domain-0.151
amus-0.148
 negoti-0.143
 propagate-0.142
Token of
Token ID286
Feature activation-1.21e-01

Pos logit contributions
 oppos+1.858
 domain+1.783
amus+1.692
 dur+1.635
 negoti+1.606
Neg logit contributions
theless-2.678
wcs-2.606
@#&-2.544
etheless-2.436
racuse-2.355
Token this
Token ID428
Feature activation-1.02e-01

Pos logit contributions
 oppos+0.246
 domain+0.239
amus+0.232
 negoti+0.222
 dur+0.221
Neg logit contributions
theless-0.298
wcs-0.292
@#&-0.283
etheless-0.266
 RN-0.260
Token theoretical
Token ID16200
Feature activation+1.47e-01

Pos logit contributions
 oppos+0.168
 domain+0.161
amus+0.155
 dur+0.147
 negoti+0.147
Neg logit contributions
theless-0.288
wcs-0.282
@#&-0.275
etheless-0.260
 RN-0.255
Token debate
Token ID4384
Feature activation+1.31e-01

Pos logit contributions
theless+0.393
wcs+0.384
@#&+0.374
etheless+0.351
 RN+0.344
Neg logit contributions
 oppos-0.263
 domain-0.253
amus-0.246
 negoti-0.235
 dur-0.234
Token,
Token ID11
Feature activation-2.18e-01

Pos logit contributions
theless+0.303
wcs+0.296
@#&+0.288
etheless+0.269
 RN+0.260
Neg logit contributions
 oppos-0.286
 domain-0.277
amus-0.267
 negoti-0.258
 dur-0.257
Token first
Token ID717
Feature activation+4.41e-01

Pos logit contributions
theless+0.287
wcs+0.274
@#&+0.270
etheless+0.251
racuse+0.244
Neg logit contributions
 oppos-0.364
 domain-0.346
amus-0.337
 negoti-0.329
 dur-0.328
Token to
Token ID284
Feature activation+1.38e-01

Pos logit contributions
theless+0.979
wcs+0.977
@#&+0.937
 RN+0.870
etheless+0.870
Neg logit contributions
 oppos-0.978
 domain-0.919
amus-0.894
 negoti-0.876
 Domain-0.859
Token hear
Token ID3285
Feature activation+4.72e-02

Pos logit contributions
theless+0.366
wcs+0.360
@#&+0.351
etheless+0.330
racuse+0.325
Neg logit contributions
 oppos-0.254
 domain-0.240
amus-0.230
 negoti-0.221
 dur-0.221
Token about
Token ID546
Feature activation+1.50e-01

Pos logit contributions
theless+0.118
wcs+0.116
@#&+0.113
 RN+0.104
etheless+0.103
Neg logit contributions
 oppos-0.094
 domain-0.090
amus-0.087
 negoti-0.083
 trap-0.083
Token Nation
Token ID8741
Feature activation+2.36e-01

Pos logit contributions
theless+0.328
wcs+0.316
@#&+0.306
etheless+0.285
 RN+0.282
Neg logit contributions
 oppos-0.348
 domain-0.336
amus-0.326
 negoti-0.318
 dur-0.315
Token Travels
Token ID48899
Feature activation-9.81e-01

Pos logit contributions
theless+0.867
wcs+0.859
@#&+0.828
etheless+0.797
 RN+0.796
Neg logit contributions
 oppos-0.199
 domain-0.174
amus-0.160
 dur-0.145
 negoti-0.144
Token destinations
Token ID23982
Feature activation+2.89e-01

Pos logit contributions
 oppos+1.188
 domain+1.100
amus+1.000
 negoti+0.953
 propagate+0.945
Neg logit contributions
theless-3.289
wcs-3.212
@#&-3.196
etheless-3.041
racuse-2.985
Token,
Token ID11
Feature activation-1.80e-01

Pos logit contributions
theless+0.628
wcs+0.612
@#&+0.591
etheless+0.549
racuse+0.537
Neg logit contributions
 oppos-0.667
 domain-0.647
amus-0.623
 negoti-0.609
 dur-0.605
Token and
Token ID290
Feature activation+4.24e-02

Pos logit contributions
 oppos+0.437
 domain+0.423
amus+0.417
 dur+0.400
 negoti+0.394
Neg logit contributions
theless-0.352
wcs-0.350
@#&-0.339
 RN-0.313
etheless-0.303
Token explore
Token ID7301
Feature activation+8.16e-02

Pos logit contributions
theless+0.124
wcs+0.122
@#&+0.121
etheless+0.113
 RN+0.112
Neg logit contributions
 oppos-0.065
 domain-0.062
amus-0.059
 dur-0.056
 negoti-0.054
Token the
Token ID262
Feature activation+2.35e-02

Pos logit contributions
theless+0.204
wcs+0.199
@#&+0.195
etheless+0.183
racuse+0.179
Neg logit contributions
 oppos-0.164
 domain-0.158
amus-0.151
 dur-0.147
 negoti-0.146
TokenF
Token ID37
Feature activation-1.71e-01

Pos logit contributions
 oppos+0.076
 domain+0.072
amus+0.070
 negoti+0.069
 dur+0.068
Neg logit contributions
theless-0.064
wcs-0.063
@#&-0.061
 RN-0.055
etheless-0.055
Token1
Token ID16
Feature activation-2.01e-01

Pos logit contributions
 oppos+0.225
 domain+0.208
amus+0.199
 negoti+0.188
 dur+0.186
Neg logit contributions
theless-0.542
wcs-0.534
@#&-0.524
etheless-0.499
 RN-0.492
Token receptor
Token ID17408
Feature activation-2.92e-01

Pos logit contributions
 oppos+0.294
 domain+0.275
amus+0.262
 negoti+0.250
 dur+0.248
Neg logit contributions
theless-0.611
wcs-0.599
@#&-0.587
etheless-0.559
 RN-0.548
Token antagonist
Token ID34667
Feature activation-3.53e-01

Pos logit contributions
 oppos+0.323
 domain+0.294
amus+0.280
 negoti+0.259
 dur+0.258
Neg logit contributions
theless-0.988
wcs-0.972
@#&-0.956
etheless-0.912
 RN-0.900
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
 oppos+0.739
 domain+0.711
amus+0.682
 dur+0.660
 negoti+0.658
Neg logit contributions
theless-0.854
wcs-0.833
@#&-0.812
etheless-0.763
racuse-0.739
Token Oral
Token ID42222
Feature activation-9.73e-01

Pos logit contributions
 oppos+0.235
 domain+0.226
amus+0.217
 negoti+0.211
 dur+0.209
Neg logit contributions
theless-0.282
wcs-0.274
@#&-0.269
etheless-0.250
 RN-0.246
Token administration
Token ID3662
Feature activation+8.79e-02

Pos logit contributions
 oppos+1.858
 domain+1.803
amus+1.699
 dur+1.655
 negoti+1.631
Neg logit contributions
theless-2.533
wcs-2.480
@#&-2.430
etheless-2.294
racuse-2.243
Token of
Token ID286
Feature activation-1.22e-01

Pos logit contributions
theless+0.223
wcs+0.218
@#&+0.212
etheless+0.200
racuse+0.195
Neg logit contributions
 oppos-0.174
 domain-0.167
amus-0.160
 negoti-0.154
 dur-0.154
Token a
Token ID257
Feature activation-1.54e-01

Pos logit contributions
 oppos+0.264
 domain+0.254
amus+0.245
 negoti+0.237
 dur+0.235
Neg logit contributions
theless-0.289
wcs-0.280
@#&-0.274
etheless-0.256
 RN-0.250
Token single
Token ID2060
Feature activation-3.94e-01

Pos logit contributions
 oppos+0.325
 domain+0.311
amus+0.300
 negoti+0.290
 dur+0.289
Neg logit contributions
theless-0.371
wcs-0.361
@#&-0.352
etheless-0.330
 RN-0.320
Token dose
Token ID10742
Feature activation-1.24e-01

Pos logit contributions
 oppos+0.778
 domain+0.742
amus+0.714
 negoti+0.689
 dur+0.687
Neg logit contributions
theless-1.002
wcs-0.977
@#&-0.954
etheless-0.897
 RN-0.874
Token camera
Token ID4676
Feature activation-2.88e-01

Pos logit contributions
 oppos+0.133
 domain+0.128
amus+0.124
 negoti+0.119
 dur+0.119
Neg logit contributions
theless-0.164
wcs-0.160
@#&-0.156
etheless-0.147
 RN-0.144
Token has
Token ID468
Feature activation+3.13e-02

Pos logit contributions
 oppos+0.588
 domain+0.564
amus+0.544
 dur+0.525
 negoti+0.519
Neg logit contributions
theless-0.711
wcs-0.695
@#&-0.680
etheless-0.635
 RN-0.618
Token an
Token ID281
Feature activation-4.89e-02

Pos logit contributions
theless+0.079
wcs+0.078
@#&+0.076
etheless+0.071
 RN+0.069
Neg logit contributions
 oppos-0.061
 domain-0.059
amus-0.057
 negoti-0.055
 dur-0.055
Token exposure
Token ID7111
Feature activation-5.07e-01

Pos logit contributions
 oppos+0.093
 domain+0.090
amus+0.087
 negoti+0.083
 dur+0.083
Neg logit contributions
theless-0.126
wcs-0.123
@#&-0.121
etheless-0.113
 RN-0.111
Token z
Token ID1976
Feature activation+3.54e-02

Pos logit contributions
 oppos+1.050
 domain+1.005
amus+0.978
 dur+0.936
 negoti+0.933
Neg logit contributions
theless-1.233
wcs-1.202
@#&-1.178
etheless-1.099
racuse-1.071
Tokenebra
Token ID37052
Feature activation-9.71e-01

Pos logit contributions
theless+0.083
wcs+0.081
@#&+0.080
etheless+0.074
racuse+0.072
Neg logit contributions
 oppos-0.076
 domain-0.073
amus-0.070
 dur-0.068
 negoti-0.068
Token with
Token ID351
Feature activation-7.42e-02

Pos logit contributions
 oppos+2.038
 domain+1.963
amus+1.881
 dur+1.821
 negoti+1.814
Neg logit contributions
theless-2.351
wcs-2.291
@#&-2.234
etheless-2.100
racuse-2.038
Token adjustable
Token ID28138
Feature activation+2.46e-01

Pos logit contributions
 oppos+0.145
 domain+0.139
amus+0.135
 negoti+0.130
 dur+0.129
Neg logit contributions
theless-0.188
wcs-0.184
@#&-0.180
etheless-0.169
 RN-0.165
Token levels
Token ID2974
Feature activation+3.39e-02

Pos logit contributions
theless+0.598
wcs+0.583
@#&+0.563
etheless+0.532
 RN+0.521
Neg logit contributions
 oppos-0.500
 domain-0.481
amus-0.473
 negoti-0.450
 dur-0.444
Token)
Token ID8
Feature activation-4.01e-02

Pos logit contributions
theless+0.084
wcs+0.082
@#&+0.080
etheless+0.075
racuse+0.073
Neg logit contributions
 oppos-0.069
 domain-0.067
amus-0.064
 negoti-0.061
 dur-0.061
Token but
Token ID475
Feature activation-5.20e-02

Pos logit contributions
 oppos+0.087
 domain+0.084
amus+0.082
 negoti+0.079
 dur+0.078
Neg logit contributions
theless-0.093
wcs-0.091
@#&-0.089
etheless-0.083
 RN-0.080
Token at
Token ID379
Feature activation+3.62e-02

Pos logit contributions
theless+1.059
wcs+1.036
@#&+1.002
etheless+0.944
 RN+0.923
Neg logit contributions
 oppos-0.853
 domain-0.823
amus-0.794
 negoti-0.761
 dur-0.751
Token town
Token ID3240
Feature activation+1.81e-01

Pos logit contributions
theless+0.090
wcs+0.087
@#&+0.086
etheless+0.080
 RN+0.079
Neg logit contributions
 oppos-0.073
 domain-0.070
amus-0.068
 negoti-0.066
 propagate-0.065
Token meetings
Token ID8292
Feature activation+1.96e-01

Pos logit contributions
theless+0.432
wcs+0.421
@#&+0.413
etheless+0.380
racuse+0.372
Neg logit contributions
 oppos-0.381
 domain-0.368
amus-0.358
 negoti-0.340
 dur-0.340
Token in
Token ID287
Feature activation+1.10e-01

Pos logit contributions
theless+0.473
wcs+0.462
@#&+0.448
etheless+0.421
 RN+0.408
Neg logit contributions
 oppos-0.407
 domain-0.394
amus-0.379
 negoti-0.363
 dur-0.363
Token New
Token ID968
Feature activation-1.58e-01

Pos logit contributions
theless+0.261
wcs+0.254
@#&+0.248
etheless+0.231
 RN+0.227
Neg logit contributions
 oppos-0.232
 domain-0.224
amus-0.217
 negoti-0.208
 dur-0.207
Token Hampshire
Token ID13910
Feature activation+5.37e-01

Pos logit contributions
 oppos+0.447
 domain+0.434
amus+0.421
 dur+0.411
 negoti+0.410
Neg logit contributions
theless-0.272
wcs-0.260
@#&-0.251
etheless-0.231
racuse-0.216
Token and
Token ID290
Feature activation+1.27e-01

Pos logit contributions
theless+1.270
wcs+1.236
@#&+1.195
etheless+1.114
 RN+1.100
Neg logit contributions
 oppos-1.135
 domain-1.099
amus-1.061
 negoti-1.022
 dur-1.011
Token Iowa
Token ID9406
Feature activation+3.63e-01

Pos logit contributions
theless+0.294
wcs+0.285
@#&+0.276
etheless+0.258
 RN+0.254
Neg logit contributions
 oppos-0.276
 domain-0.267
amus-0.261
 propagate-0.248
 negoti-0.248
Token are
Token ID389
Feature activation+1.02e-01

Pos logit contributions
theless+0.872
wcs+0.848
@#&+0.824
etheless+0.771
 RN+0.750
Neg logit contributions
 oppos-0.757
 domain-0.734
amus-0.708
 negoti-0.680
 dur-0.674
Token sending
Token ID7216
Feature activation-2.19e-01

Pos logit contributions
theless+0.264
wcs+0.258
@#&+0.247
etheless+0.235
 RN+0.232
Neg logit contributions
 oppos-0.190
 domain-0.186
amus-0.181
 negoti-0.172
 propagate-0.170
Token a
Token ID257
Feature activation+5.15e-02

Pos logit contributions
 oppos+0.408
 domain+0.393
amus+0.378
 negoti+0.365
 dur+0.361
Neg logit contributions
theless-0.573
wcs-0.567
@#&-0.549
etheless-0.515
 RN-0.509
Token5
Token ID20
Feature activation-1.67e-01

Pos logit contributions
theless+0.209
wcs+0.203
@#&+0.198
etheless+0.186
racuse+0.180
Neg logit contributions
 oppos-0.189
 domain-0.182
amus-0.174
 dur-0.169
 negoti-0.169
Token kg
Token ID14211
Feature activation+1.42e-01

Pos logit contributions
 oppos+0.381
 domain+0.368
amus+0.355
 negoti+0.344
 dur+0.344
Neg logit contributions
theless-0.371
wcs-0.361
@#&-0.351
etheless-0.326
racuse-0.317
Token),
Token ID828
Feature activation+1.47e-01

Pos logit contributions
theless+0.347
wcs+0.339
@#&+0.329
etheless+0.309
racuse+0.302
Neg logit contributions
 oppos-0.293
 domain-0.282
amus-0.271
 negoti-0.262
 dur-0.260
Token lighter
Token ID14871
Feature activation+1.45e-01

Pos logit contributions
theless+0.347
wcs+0.337
@#&+0.322
etheless+0.304
 RN+0.297
Neg logit contributions
 oppos-0.312
 domain-0.303
amus-0.296
 negoti-0.283
 dur-0.282
Token and
Token ID290
Feature activation-1.12e-02

Pos logit contributions
theless+0.292
wcs+0.283
@#&+0.272
etheless+0.254
racuse+0.244
Neg logit contributions
 oppos-0.360
 domain-0.348
amus-0.338
 negoti-0.328
 dur-0.327
Token grey
Token ID13791
Feature activation+5.31e-01

Pos logit contributions
 oppos+0.022
 domain+0.022
amus+0.021
 negoti+0.020
 dur+0.020
Neg logit contributions
theless-0.028
wcs-0.027
@#&-0.026
etheless-0.024
 RN-0.024
Tokener
Token ID263
Feature activation+1.57e-01

Pos logit contributions
theless+1.224
wcs+1.203
@#&+1.139
etheless+1.086
 RN+1.045
Neg logit contributions
 oppos-1.167
 domain-1.122
amus-1.087
 negoti-1.056
 dur-1.040
Token,
Token ID11
Feature activation+1.73e-01

Pos logit contributions
theless+0.366
wcs+0.358
@#&+0.343
etheless+0.327
 RN+0.314
Neg logit contributions
 oppos-0.341
 domain-0.328
amus-0.318
 negoti-0.308
 dur-0.305
Token with
Token ID351
Feature activation+7.15e-03

Pos logit contributions
theless+0.397
wcs+0.387
@#&+0.368
etheless+0.349
 RN+0.338
Neg logit contributions
 oppos-0.379
 domain-0.368
amus-0.361
 negoti-0.345
 dur-0.343
Token much
Token ID881
Feature activation+6.62e-02

Pos logit contributions
theless+0.018
wcs+0.018
@#&+0.017
etheless+0.016
 RN+0.016
Neg logit contributions
 oppos-0.013
 domain-0.013
amus-0.013
 negoti-0.012
 propagate-0.012
Token less
Token ID1342
Feature activation-5.83e-02

Pos logit contributions
theless+0.161
wcs+0.158
@#&+0.150
etheless+0.143
 RN+0.140
Neg logit contributions
 oppos-0.135
 domain-0.131
amus-0.127
 negoti-0.122
 propagate-0.121
Token wife
Token ID3656
Feature activation+3.48e-02

Pos logit contributions
theless+0.255
wcs+0.247
@#&+0.242
etheless+0.229
 RN+0.223
Neg logit contributions
 oppos-0.200
 domain-0.191
amus-0.186
 negoti-0.180
 propagate-0.178
Token thinks
Token ID6834
Feature activation+2.97e-02

Pos logit contributions
theless+0.083
wcs+0.080
@#&+0.078
etheless+0.073
 RN+0.071
Neg logit contributions
 oppos-0.074
 domain-0.072
amus-0.069
 dur-0.067
 propagate-0.066
Token that
Token ID326
Feature activation+1.22e-01

Pos logit contributions
theless+0.069
wcs+0.068
@#&+0.066
etheless+0.062
racuse+0.060
Neg logit contributions
 oppos-0.064
 domain-0.062
amus-0.060
 dur-0.058
 negoti-0.058
Token the
Token ID262
Feature activation+2.73e-02

Pos logit contributions
theless+0.290
wcs+0.283
@#&+0.275
etheless+0.258
 RN+0.251
Neg logit contributions
 oppos-0.257
 domain-0.249
amus-0.242
 dur-0.231
 negoti-0.230
Token purple
Token ID14032
Feature activation+3.31e-01

Pos logit contributions
theless+0.074
wcs+0.073
@#&+0.071
etheless+0.067
 RN+0.066
Neg logit contributions
 oppos-0.048
 domain-0.046
amus-0.045
 negoti-0.042
 propagate-0.042
Token colour
Token ID9568
Feature activation+6.76e-01

Pos logit contributions
theless+0.812
wcs+0.794
@#&+0.773
etheless+0.726
 RN+0.712
Neg logit contributions
 oppos-0.674
 domain-0.640
amus-0.626
 negoti-0.596
 propagate-0.592
Token is
Token ID318
Feature activation+1.03e-01

Pos logit contributions
theless+1.652
wcs+1.627
@#&+1.564
etheless+1.471
 RN+1.447
Neg logit contributions
 oppos-1.361
 domain-1.302
amus-1.271
 dur-1.206
 negoti-1.205
Token really
Token ID1107
Feature activation+9.51e-02

Pos logit contributions
theless+0.258
wcs+0.255
@#&+0.247
etheless+0.232
 RN+0.228
Neg logit contributions
 oppos-0.200
 domain-0.193
amus-0.187
 propagate-0.179
 negoti-0.179
Token great
Token ID1049
Feature activation+4.38e-02

Pos logit contributions
theless+0.243
wcs+0.240
@#&+0.233
etheless+0.219
 RN+0.215
Neg logit contributions
 oppos-0.180
 domain-0.175
amus-0.170
 negoti-0.162
 propagate-0.161
Token and
Token ID290
Feature activation-7.92e-03

Pos logit contributions
theless+0.109
wcs+0.108
@#&+0.105
etheless+0.099
 RN+0.096
Neg logit contributions
 oppos-0.087
 domain-0.084
amus-0.081
 negoti-0.078
 propagate-0.077
Token I
Token ID314
Feature activation-1.35e-01

Pos logit contributions
 oppos+0.017
 domain+0.016
amus+0.016
 dur+0.015
 propagate+0.015
Neg logit contributions
theless-0.019
wcs-0.018
@#&-0.018
etheless-0.017
 RN-0.016
Token and
Token ID290
Feature activation-4.68e-02

Pos logit contributions
 oppos+0.368
 domain+0.353
amus+0.336
 dur+0.325
 negoti+0.323
Neg logit contributions
theless-0.495
wcs-0.483
@#&-0.472
etheless-0.445
racuse-0.434
Token beyond
Token ID3675
Feature activation-1.24e-01

Pos logit contributions
 oppos+0.084
 domain+0.080
amus+0.077
 dur+0.073
 negoti+0.073
Neg logit contributions
theless-0.127
wcs-0.124
@#&-0.121
etheless-0.115
 RN-0.112
Token.
Token ID13
Feature activation-2.10e-01

Pos logit contributions
 oppos+0.218
 domain+0.209
amus+0.198
 dur+0.191
 negoti+0.190
Neg logit contributions
theless-0.344
wcs-0.336
@#&-0.329
etheless-0.312
racuse-0.304
Token<|endoftext|>
Token ID50256
Feature activation-5.50e-02

Pos logit contributions
 oppos+0.506
 domain+0.490
amus+0.472
 dur+0.459
 negoti+0.456
Neg logit contributions
theless-0.446
wcs-0.433
@#&-0.420
etheless-0.392
racuse-0.379
TokenG
Token ID38
Feature activation-5.24e-02

Pos logit contributions
 oppos+0.115
 domain+0.111
amus+0.106
 negoti+0.103
 dur+0.103
Neg logit contributions
theless-0.133
wcs-0.131
@#&-0.127
etheless-0.119
racuse-0.116
TokenRAY
Token ID30631
Feature activation+6.96e-01

Pos logit contributions
 oppos+0.106
 domain+0.102
amus+0.097
 negoti+0.095
 dur+0.094
Neg logit contributions
theless-0.130
wcs-0.127
@#&-0.124
etheless-0.116
racuse-0.114
TokenSON
Token ID11782
Feature activation-2.18e-02

Pos logit contributions
theless+1.601
wcs+1.531
@#&+1.511
 RN+1.389
etheless+1.378
Neg logit contributions
 oppos-1.530
 domain-1.455
amus-1.418
 negoti-1.369
 propagate-1.365
Token has
Token ID468
Feature activation-3.37e-01

Pos logit contributions
 oppos+0.053
 domain+0.051
amus+0.049
 dur+0.048
 negoti+0.048
Neg logit contributions
theless-0.046
wcs-0.044
@#&-0.043
etheless-0.040
 RN-0.039
Token been
Token ID587
Feature activation-8.04e-02

Pos logit contributions
 oppos+0.731
 domain+0.705
amus+0.678
 dur+0.658
 negoti+0.654
Neg logit contributions
theless-0.787
wcs-0.766
@#&-0.743
etheless-0.698
 RN-0.676
Token building
Token ID2615
Feature activation-1.73e-01

Pos logit contributions
 oppos+0.176
 domain+0.170
amus+0.164
 dur+0.159
 propagate+0.158
Neg logit contributions
theless-0.185
wcs-0.181
@#&-0.176
etheless-0.165
 RN-0.160
Token and
Token ID290
Feature activation-1.54e-01

Pos logit contributions
 oppos+0.325
 domain+0.310
amus+0.301
 negoti+0.288
 dur+0.286
Neg logit contributions
theless-0.452
wcs-0.442
@#&-0.430
etheless-0.404
 RN-0.401
Token responsibility
Token ID5798
Feature activation+4.37e-01

Pos logit contributions
 oppos+0.076
 domain+0.071
amus+0.069
 dur+0.067
 negoti+0.066
Neg logit contributions
theless-0.107
wcs-0.104
@#&-0.102
etheless-0.096
 RN-0.094
Token to
Token ID284
Feature activation-6.01e-02

Pos logit contributions
theless+1.096
wcs+1.073
@#&+1.043
etheless+0.983
racuse+0.952
Neg logit contributions
 oppos-0.875
 domain-0.838
amus-0.803
 dur-0.777
 negoti-0.776
Token liberate
Token ID45521
Feature activation+1.49e-01

Pos logit contributions
 oppos+0.113
 domain+0.109
amus+0.106
 dur+0.101
 negoti+0.101
Neg logit contributions
theless-0.156
wcs-0.151
@#&-0.147
etheless-0.138
 RN-0.137
Token ourselves
Token ID6731
Feature activation+2.30e-01

Pos logit contributions
theless+0.344
wcs+0.333
@#&+0.325
etheless+0.304
racuse+0.294
Neg logit contributions
 oppos-0.327
 domain-0.316
amus-0.306
 dur-0.296
 negoti-0.295
Token and
Token ID290
Feature activation-6.06e-02

Pos logit contributions
theless+0.567
wcs+0.554
@#&+0.538
etheless+0.507
racuse+0.492
Neg logit contributions
 oppos-0.471
 domain-0.454
amus-0.435
 dur-0.420
 negoti-0.420
Token others
Token ID1854
Feature activation+6.03e-01

Pos logit contributions
 oppos+0.124
 domain+0.120
amus+0.116
 dur+0.112
 negoti+0.111
Neg logit contributions
theless-0.148
wcs-0.144
@#&-0.139
etheless-0.131
 RN-0.128
Token from
Token ID422
Feature activation+1.12e-01

Pos logit contributions
theless+1.466
wcs+1.430
@#&+1.385
etheless+1.312
racuse+1.260
Neg logit contributions
 oppos-1.241
 domain-1.200
amus-1.158
 negoti-1.117
 dur-1.111
Token all
Token ID477
Feature activation-1.15e-01

Pos logit contributions
theless+0.297
wcs+0.290
@#&+0.282
etheless+0.268
 RN+0.260
Neg logit contributions
 oppos-0.205
 domain-0.197
amus-0.192
 dur-0.182
 negoti-0.182
Token forms
Token ID5107
Feature activation-6.73e-02

Pos logit contributions
 oppos+0.222
 domain+0.214
amus+0.207
 dur+0.198
 negoti+0.197
Neg logit contributions
theless-0.295
wcs-0.288
@#&-0.280
etheless-0.265
racuse-0.256
Token of
Token ID286
Feature activation-3.87e-02

Pos logit contributions
 oppos+0.133
 domain+0.128
amus+0.123
 negoti+0.120
 dur+0.118
Neg logit contributions
theless-0.169
wcs-0.165
@#&-0.161
etheless-0.151
 RN-0.147
Token oppression
Token ID18345
Feature activation+4.37e-01

Pos logit contributions
 oppos+0.048
 domain+0.045
amus+0.043
 dur+0.040
 negoti+0.040
Neg logit contributions
theless-0.126
wcs-0.123
@#&-0.120
etheless-0.115
 RN-0.112
Token sheriff
Token ID16570
Feature activation+8.93e-02

Pos logit contributions
 oppos+0.023
 domain+0.022
amus+0.021
 negoti+0.020
 dur+0.020
Neg logit contributions
theless-0.028
wcs-0.027
@#&-0.027
etheless-0.025
racuse-0.025
Token's
Token ID338
Feature activation+1.04e-01

Pos logit contributions
theless+0.240
wcs+0.235
@#&+0.229
etheless+0.216
racuse+0.213
Neg logit contributions
 oppos-0.162
 domain-0.154
amus-0.147
 dur-0.142
 negoti-0.141
Token office
Token ID2607
Feature activation-2.00e-01

Pos logit contributions
theless+0.252
wcs+0.245
@#&+0.239
etheless+0.225
racuse+0.219
Neg logit contributions
 oppos-0.217
 domain-0.208
amus-0.201
 dur-0.194
 negoti-0.193
Token said
Token ID531
Feature activation+5.08e-02

Pos logit contributions
 oppos+0.459
 domain+0.443
amus+0.427
 dur+0.415
 negoti+0.413
Neg logit contributions
theless-0.443
wcs-0.431
@#&-0.420
etheless-0.391
racuse-0.379
Token.
Token ID13
Feature activation+7.26e-02

Pos logit contributions
theless+0.139
wcs+0.136
@#&+0.134
etheless+0.127
racuse+0.123
Neg logit contributions
 oppos-0.090
 domain-0.086
amus-0.082
 negoti-0.078
 dur-0.078
Token
Token ID198
Feature activation-5.99e-02

Pos logit contributions
theless+0.157
wcs+0.152
@#&+0.150
etheless+0.138
 RN+0.134
Neg logit contributions
 oppos-0.170
 domain-0.164
amus-0.159
 negoti-0.155
 dur-0.154
Token
Token ID198
Feature activation-4.08e-02

Pos logit contributions
 oppos+0.157
 domain+0.152
amus+0.147
 negoti+0.144
 dur+0.144
Neg logit contributions
theless-0.113
wcs-0.110
@#&-0.106
etheless-0.097
racuse-0.094
TokenWe
Token ID1135
Feature activation+1.55e-01

Pos logit contributions
 oppos+0.083
 domain+0.080
amus+0.076
 negoti+0.074
 dur+0.074
Neg logit contributions
theless-0.101
wcs-0.099
@#&-0.097
etheless-0.090
racuse-0.088
Tokenird
Token ID1447
Feature activation+3.84e-01

Pos logit contributions
theless+0.371
wcs+0.359
@#&+0.354
etheless+0.327
 RN+0.320
Neg logit contributions
 oppos-0.330
 domain-0.318
amus-0.304
 dur-0.295
 negoti-0.295
Token News
Token ID3000
Feature activation+6.28e-03

Pos logit contributions
theless+0.950
wcs+0.926
@#&+0.907
etheless+0.856
 RN+0.836
Neg logit contributions
 oppos-0.769
 domain-0.729
amus-0.704
 negoti-0.685
 propagate-0.677
Token Photos
Token ID9434
Feature activation+1.63e-01

Pos logit contributions
theless+0.011
wcs+0.011
@#&+0.010
etheless+0.009
racuse+0.009
Neg logit contributions
 oppos-0.018
 domain-0.017
amus-0.017
 dur-0.016
 negoti-0.016
Token couldn
Token ID3521
Feature activation+2.22e-01

Pos logit contributions
 oppos+0.022
 domain+0.021
amus+0.020
 dur+0.020
 negoti+0.020
Neg logit contributions
theless-0.020
wcs-0.020
@#&-0.019
etheless-0.018
 RN-0.017
Token
Token ID447
Feature activation-1.93e-01

Pos logit contributions
theless+0.475
wcs+0.464
@#&+0.462
 RN+0.411
etheless+0.409
Neg logit contributions
 oppos-0.516
 domain-0.496
amus-0.477
 negoti-0.474
 propagate-0.465
Token
Token ID247
Feature activation-1.80e-01

Pos logit contributions
 oppos+0.474
 domain+0.459
amus+0.443
 negoti+0.432
 dur+0.430
Neg logit contributions
theless-0.392
wcs-0.381
@#&-0.374
etheless-0.344
racuse-0.331
Tokent
Token ID83
Feature activation-1.75e-01

Pos logit contributions
 oppos+0.358
 domain+0.345
amus+0.330
 dur+0.318
 negoti+0.317
Neg logit contributions
theless-0.457
wcs-0.446
@#&-0.435
etheless-0.411
racuse-0.399
Token afford
Token ID5368
Feature activation-3.11e-01

Pos logit contributions
 oppos+0.295
 domain+0.281
amus+0.270
 dur+0.255
 negoti+0.254
Neg logit contributions
theless-0.490
wcs-0.482
@#&-0.472
etheless-0.445
 RN-0.435
Token the
Token ID262
Feature activation+3.44e-02

Pos logit contributions
 oppos+0.645
 domain+0.621
amus+0.596
 dur+0.575
 negoti+0.574
Neg logit contributions
theless-0.759
wcs-0.740
@#&-0.722
etheless-0.678
racuse-0.658
Token rent
Token ID5602
Feature activation+1.17e-01

Pos logit contributions
theless+0.088
wcs+0.086
@#&+0.084
etheless+0.079
 RN+0.078
Neg logit contributions
 oppos-0.066
 domain-0.063
amus-0.062
 propagate-0.059
 negoti-0.058
Token when
Token ID618
Feature activation-1.34e-01

Pos logit contributions
theless+0.295
wcs+0.289
@#&+0.282
etheless+0.264
 RN+0.257
Neg logit contributions
 oppos-0.234
 domain-0.224
amus-0.216
 dur-0.207
 negoti-0.206
Token their
Token ID511
Feature activation+1.25e-02

Pos logit contributions
 oppos+0.330
 domain+0.320
amus+0.309
 dur+0.300
 negoti+0.299
Neg logit contributions
theless-0.276
wcs-0.268
@#&-0.260
etheless-0.241
racuse-0.233
Token subsidy
Token ID28646
Feature activation-7.02e-03

Pos logit contributions
theless+0.032
wcs+0.031
@#&+0.031
etheless+0.029
 RN+0.029
Neg logit contributions
 oppos-0.023
 domain-0.022
amus-0.022
 propagate-0.021
 negoti-0.021
Token was
Token ID373
Feature activation-1.23e-02

Pos logit contributions
 oppos+0.016
 domain+0.015
amus+0.015
 dur+0.014
 negoti+0.014
Neg logit contributions
theless-0.016
wcs-0.016
@#&-0.015
etheless-0.014
racuse-0.014
Token the
Token ID262
Feature activation-2.62e-04

Pos logit contributions
theless+0.100
wcs+0.097
@#&+0.095
etheless+0.089
 RN+0.087
Neg logit contributions
 oppos-0.084
 domain-0.081
amus-0.079
 negoti-0.076
 dur-0.076
Token other
Token ID584
Feature activation-8.26e-02

Pos logit contributions
 oppos+0.001
 domain+0.001
amus+0.001
 negoti+0.001
 dur+0.001
Neg logit contributions
theless-0.001
wcs-0.001
@#&-0.001
etheless-0.001
 RN-0.001
Token lost
Token ID2626
Feature activation-6.43e-02

Pos logit contributions
 oppos+0.179
 domain+0.175
amus+0.169
 negoti+0.164
 dur+0.163
Neg logit contributions
theless-0.190
wcs-0.186
@#&-0.181
etheless-0.170
 RN-0.164
Token,
Token ID11
Feature activation-7.19e-02

Pos logit contributions
 oppos+0.147
 domain+0.143
amus+0.140
 negoti+0.134
 dur+0.134
Neg logit contributions
theless-0.140
wcs-0.137
@#&-0.133
etheless-0.124
 RN-0.120
Token broken
Token ID5445
Feature activation-4.93e-02

Pos logit contributions
 oppos+0.149
 domain+0.145
amus+0.140
 dur+0.135
 negoti+0.135
Neg logit contributions
theless-0.173
wcs-0.169
@#&-0.164
etheless-0.154
 RN-0.150
Token toys
Token ID14958
Feature activation+3.03e-01

Pos logit contributions
 oppos+0.112
 domain+0.109
amus+0.105
 negoti+0.102
 dur+0.102
Neg logit contributions
theless-0.109
wcs-0.107
@#&-0.104
etheless-0.096
racuse-0.093
Token cast
Token ID3350
Feature activation-3.77e-01

Pos logit contributions
theless+0.782
wcs+0.767
@#&+0.751
etheless+0.707
racuse+0.687
Neg logit contributions
 oppos-0.578
 domain-0.558
amus-0.533
 dur-0.512
 negoti-0.511
Token off
Token ID572
Feature activation+2.19e-02

Pos logit contributions
 oppos+0.747
 domain+0.718
amus+0.687
 negoti+0.662
 dur+0.661
Neg logit contributions
theless-0.956
wcs-0.935
@#&-0.912
etheless-0.859
racuse-0.836
Token from
Token ID422
Feature activation+6.53e-02

Pos logit contributions
theless+0.053
wcs+0.052
@#&+0.050
etheless+0.047
racuse+0.046
Neg logit contributions
 oppos-0.045
 domain-0.044
amus-0.042
 negoti-0.041
 dur-0.040
Token various
Token ID2972
Feature activation-7.94e-02

Pos logit contributions
theless+0.161
wcs+0.157
@#&+0.154
etheless+0.144
racuse+0.140
Neg logit contributions
 oppos-0.132
 domain-0.128
amus-0.124
 dur-0.119
 negoti-0.118
Token NHL
Token ID9481
Feature activation-5.96e-01

Pos logit contributions
 oppos+0.176
 domain+0.170
amus+0.165
 dur+0.159
 negoti+0.159
Neg logit contributions
theless-0.181
wcs-0.176
@#&-0.172
etheless-0.160
 RN-0.155
Token his
Token ID465
Feature activation+8.06e-02

Pos logit contributions
 oppos+0.054
 domain+0.052
amus+0.050
 negoti+0.048
 propagate+0.048
Neg logit contributions
theless-0.074
wcs-0.073
@#&-0.071
etheless-0.067
 RN-0.065
Token mistake
Token ID7457
Feature activation-1.61e-01

Pos logit contributions
theless+0.196
wcs+0.192
@#&+0.188
etheless+0.175
 RN+0.171
Neg logit contributions
 oppos-0.164
 domain-0.156
amus-0.153
 dur-0.147
 negoti-0.147
Token.
Token ID13
Feature activation-4.83e-02

Pos logit contributions
 oppos+0.311
 domain+0.299
amus+0.286
 negoti+0.275
 dur+0.275
Neg logit contributions
theless-0.416
wcs-0.406
@#&-0.397
etheless-0.373
racuse-0.363
Token
Token ID447
Feature activation-1.99e-01

Pos logit contributions
 oppos+0.098
 domain+0.094
amus+0.090
 negoti+0.087
 dur+0.087
Neg logit contributions
theless-0.120
wcs-0.117
@#&-0.115
etheless-0.107
 RN-0.105
Token
Token ID250
Feature activation+1.08e-01

Pos logit contributions
 oppos+0.449
 domain+0.433
amus+0.416
 dur+0.405
 negoti+0.403
Neg logit contributions
theless-0.452
wcs-0.439
@#&-0.426
etheless-0.399
racuse-0.387
TokenThe
Token ID464
Feature activation-3.32e-02

Pos logit contributions
theless+0.266
wcs+0.259
@#&+0.255
etheless+0.237
 RN+0.230
Neg logit contributions
 oppos-0.223
 domain-0.215
amus-0.204
 negoti-0.200
 dur-0.198
Token one
Token ID530
Feature activation+4.88e-02

Pos logit contributions
 oppos+0.067
 domain+0.064
amus+0.062
 negoti+0.060
 dur+0.060
Neg logit contributions
theless-0.082
wcs-0.081
@#&-0.079
etheless-0.074
 RN-0.072
Token thing
Token ID1517
Feature activation-9.08e-02

Pos logit contributions
theless+0.118
wcs+0.116
@#&+0.113
etheless+0.106
racuse+0.103
Neg logit contributions
 oppos-0.102
 domain-0.097
amus-0.094
 dur-0.090
 propagate-0.090
Token that
Token ID326
Feature activation+4.38e-02

Pos logit contributions
 oppos+0.189
 domain+0.182
amus+0.175
 dur+0.169
 negoti+0.169
Neg logit contributions
theless-0.221
wcs-0.215
@#&-0.210
etheless-0.197
 RN-0.190
Token Bob
Token ID5811
Feature activation-8.56e-02

Pos logit contributions
theless+0.105
wcs+0.101
@#&+0.099
etheless+0.093
 RN+0.090
Neg logit contributions
 oppos-0.093
 domain-0.090
amus-0.087
 dur-0.083
 negoti-0.083
Token told
Token ID1297
Feature activation+7.37e-02

Pos logit contributions
 oppos+0.159
 domain+0.152
amus+0.144
 dur+0.139
 propagate+0.138
Neg logit contributions
theless-0.228
wcs-0.223
@#&-0.219
etheless-0.207
 RN-0.200
Token implications
Token ID10939
Feature activation-2.38e-01

Pos logit contributions
 oppos+0.124
 domain+0.119
amus+0.116
 negoti+0.111
 dur+0.111
Neg logit contributions
theless-0.160
wcs-0.156
@#&-0.152
etheless-0.143
racuse-0.140
Token for
Token ID329
Feature activation-1.42e-01

Pos logit contributions
 oppos+0.468
 domain+0.449
amus+0.433
 negoti+0.415
 dur+0.415
Neg logit contributions
theless-0.607
wcs-0.593
@#&-0.577
etheless-0.546
racuse-0.529
Token our
Token ID674
Feature activation+8.82e-03

Pos logit contributions
 oppos+0.280
 domain+0.269
amus+0.261
 negoti+0.250
 dur+0.250
Neg logit contributions
theless-0.356
wcs-0.348
@#&-0.338
etheless-0.319
 RN-0.310
Token common
Token ID2219
Feature activation-1.62e-01

Pos logit contributions
theless+0.023
wcs+0.023
@#&+0.022
etheless+0.021
 RN+0.020
Neg logit contributions
 oppos-0.016
 domain-0.015
amus-0.015
 negoti-0.015
 dur-0.014
Token security
Token ID2324
Feature activation+3.76e-02

Pos logit contributions
 oppos+0.294
 domain+0.278
amus+0.274
 negoti+0.263
 propagate+0.258
Neg logit contributions
theless-0.429
wcs-0.417
@#&-0.410
etheless-0.385
racuse-0.378
Token.
Token ID13
Feature activation-3.87e-02

Pos logit contributions
theless+0.085
wcs+0.085
@#&+0.082
etheless+0.077
 RN+0.075
Neg logit contributions
 oppos-0.082
 domain-0.078
amus-0.077
 negoti-0.074
 propagate-0.073
Token
Token ID198
Feature activation-9.93e-02

Pos logit contributions
 oppos+0.090
 domain+0.087
amus+0.084
 negoti+0.083
 dur+0.081
Neg logit contributions
theless-0.084
wcs-0.081
@#&-0.080
etheless-0.073
 RN-0.073
Token
Token ID198
Feature activation-5.54e-02

Pos logit contributions
 oppos+0.261
 domain+0.253
amus+0.244
 negoti+0.239
 dur+0.238
Neg logit contributions
theless-0.187
wcs-0.181
@#&-0.176
etheless-0.160
racuse-0.156
TokenDespite
Token ID8332
Feature activation-3.13e-01

Pos logit contributions
 oppos+0.113
 domain+0.109
amus+0.103
 negoti+0.102
 dur+0.101
Neg logit contributions
theless-0.136
wcs-0.134
@#&-0.131
etheless-0.121
racuse-0.118
Token this
Token ID428
Feature activation+7.72e-03

Pos logit contributions
 oppos+0.624
 domain+0.601
amus+0.581
 negoti+0.558
 dur+0.557
Neg logit contributions
theless-0.788
wcs-0.766
@#&-0.745
etheless-0.705
 RN-0.681
Token,
Token ID11
Feature activation-5.64e-03

Pos logit contributions
theless+0.020
wcs+0.019
@#&+0.019
etheless+0.018
 RN+0.017
Neg logit contributions
 oppos-0.015
 domain-0.014
amus-0.014
 negoti-0.013
 dur-0.013
Token1
Token ID16
Feature activation-1.36e-02

Pos logit contributions
theless+0.068
wcs+0.067
@#&+0.065
etheless+0.061
racuse+0.059
Neg logit contributions
 oppos-0.059
 domain-0.057
amus-0.055
 dur-0.053
 negoti-0.053
Tokenf
Token ID69
Feature activation-8.09e-03

Pos logit contributions
 oppos+0.033
 domain+0.032
amus+0.031
 negoti+0.030
 dur+0.030
Neg logit contributions
theless-0.028
wcs-0.027
@#&-0.026
etheless-0.024
racuse-0.024
Token4
Token ID19
Feature activation-7.77e-03

Pos logit contributions
 oppos+0.018
 domain+0.017
amus+0.016
 negoti+0.016
 dur+0.016
Neg logit contributions
theless-0.019
wcs-0.019
@#&-0.018
etheless-0.017
racuse-0.016
Tokenb
Token ID65
Feature activation+5.50e-03

Pos logit contributions
 oppos+0.018
 domain+0.017
amus+0.017
 negoti+0.017
 dur+0.016
Neg logit contributions
theless-0.017
wcs-0.017
@#&-0.016
etheless-0.015
racuse-0.014
Tokenbe
Token ID1350
Feature activation-9.18e-02

Pos logit contributions
theless+0.010
wcs+0.010
@#&+0.009
etheless+0.008
racuse+0.008
Neg logit contributions
 oppos-0.015
 domain-0.014
amus-0.014
 negoti-0.014
 dur-0.014
Token66
Token ID2791
Feature activation-9.44e-02

Pos logit contributions
 oppos+0.214
 domain+0.204
 negoti+0.197
amus+0.196
 dur+0.192
Neg logit contributions
theless-0.196
wcs-0.195
@#&-0.189
etheless-0.171
racuse-0.171
Token power
Token ID1176
Feature activation+1.33e-01

Pos logit contributions
 oppos+0.211
 domain+0.202
amus+0.196
 negoti+0.189
 dur+0.189
Neg logit contributions
theless-0.215
wcs-0.211
@#&-0.206
etheless-0.191
racuse-0.186
Token off
Token ID572
Feature activation-1.35e-01

Pos logit contributions
theless+0.345
wcs+0.338
@#&+0.331
etheless+0.310
 RN+0.303
Neg logit contributions
 oppos-0.254
 domain-0.239
amus-0.234
 negoti-0.223
 dur-0.222
Token
Token ID198
Feature activation-8.86e-02

Pos logit contributions
 oppos+0.327
 domain+0.313
amus+0.306
 negoti+0.296
 dur+0.295
Neg logit contributions
theless-0.281
wcs-0.274
@#&-0.266
etheless-0.246
 RN-0.238
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
 oppos+0.231
 domain+0.224
amus+0.217
 negoti+0.211
 dur+0.211
Neg logit contributions
theless-0.169
wcs-0.164
@#&-0.159
etheless-0.145
racuse-0.140
TokenNotice
Token ID26396
Feature activation-4.05e-01

Pos logit contributions
 oppos+0.313
 domain+0.300
amus+0.287
 negoti+0.280
 dur+0.280
Neg logit contributions
theless-0.362
wcs-0.356
@#&-0.346
etheless-0.323
racuse-0.315
Token worked
Token ID3111
Feature activation+2.01e-01

Pos logit contributions
theless+0.295
wcs+0.289
@#&+0.279
etheless+0.258
 RN+0.258
Neg logit contributions
 oppos-0.273
 domain-0.264
amus-0.255
 dur-0.248
 propagate-0.247
Token with
Token ID351
Feature activation-1.55e-03

Pos logit contributions
theless+0.482
wcs+0.478
@#&+0.466
etheless+0.437
 RN+0.427
Neg logit contributions
 oppos-0.412
 domain-0.400
amus-0.387
 propagate-0.370
 dur-0.366
Token Pam
Token ID24497
Feature activation+1.53e-01

Pos logit contributions
 oppos+0.003
 domain+0.003
amus+0.003
 negoti+0.003
 dur+0.003
Neg logit contributions
theless-0.004
wcs-0.004
@#&-0.004
etheless-0.003
 RN-0.003
Token as
Token ID355
Feature activation-1.02e-01

Pos logit contributions
theless+0.384
wcs+0.377
@#&+0.371
etheless+0.345
racuse+0.338
Neg logit contributions
 oppos-0.300
 domain-0.291
amus-0.278
 negoti-0.266
 dur-0.265
Token long
Token ID890
Feature activation+1.44e-02

Pos logit contributions
 oppos+0.186
 domain+0.178
amus+0.172
 negoti+0.164
 dur+0.163
Neg logit contributions
theless-0.270
wcs-0.266
@#&-0.259
etheless-0.245
 RN-0.240
Token as
Token ID355
Feature activation-1.51e-01

Pos logit contributions
theless+0.039
wcs+0.039
@#&+0.038
etheless+0.036
racuse+0.035
Neg logit contributions
 oppos-0.026
 domain-0.024
amus-0.023
 dur-0.022
 negoti-0.022
Token you
Token ID345
Feature activation+2.41e-02

Pos logit contributions
 oppos+0.355
 domain+0.342
amus+0.336
 negoti+0.323
 dur+0.321
Neg logit contributions
theless-0.320
wcs-0.312
@#&-0.303
etheless-0.282
 RN-0.275
Token did
Token ID750
Feature activation-1.06e-01

Pos logit contributions
theless+0.056
wcs+0.055
@#&+0.054
etheless+0.050
 RN+0.049
Neg logit contributions
 oppos-0.052
 domain-0.050
amus-0.048
 negoti-0.047
 dur-0.046
Token,
Token ID11
Feature activation-1.46e-01

Pos logit contributions
 oppos+0.236
 domain+0.227
amus+0.221
 dur+0.212
 negoti+0.212
Neg logit contributions
theless-0.240
wcs-0.235
@#&-0.231
etheless-0.214
 RN-0.210
Token was
Token ID373
Feature activation-1.05e-01

Pos logit contributions
 oppos+0.294
 domain+0.283
amus+0.276
 negoti+0.263
 propagate+0.260
Neg logit contributions
theless-0.359
wcs-0.349
@#&-0.345
etheless-0.319
 RN-0.316
Token that
Token ID326
Feature activation-1.23e-01

Pos logit contributions
 oppos+0.233
 domain+0.226
amus+0.221
 negoti+0.212
 propagate+0.211
Neg logit contributions
theless-0.235
wcs-0.231
@#&-0.226
etheless-0.210
 RN-0.204
Token
Token ID247
Feature activation+5.84e-02

Pos logit contributions
theless+0.224
wcs+0.218
@#&+0.216
etheless+0.191
 RN+0.187
Neg logit contributions
 oppos-0.365
 domain-0.354
amus-0.345
 negoti-0.339
 propagate-0.335
Tokens
Token ID82
Feature activation-1.44e-01

Pos logit contributions
theless+0.117
wcs+0.113
@#&+0.110
etheless+0.100
racuse+0.098
Neg logit contributions
 oppos-0.146
 domain-0.140
amus-0.137
 dur-0.133
 negoti-0.133
Token rents
Token ID28393
Feature activation-2.58e-02

Pos logit contributions
 oppos+0.289
 domain+0.274
amus+0.271
 negoti+0.256
 dur+0.256
Neg logit contributions
theless-0.359
wcs-0.346
@#&-0.338
etheless-0.320
 RN-0.311
Token come
Token ID1282
Feature activation+6.69e-02

Pos logit contributions
 oppos+0.053
 domain+0.051
amus+0.049
 negoti+0.048
 dur+0.047
Neg logit contributions
theless-0.062
wcs-0.061
@#&-0.060
etheless-0.055
 RN-0.054
Token to
Token ID284
Feature activation+2.50e-01

Pos logit contributions
theless+0.169
wcs+0.168
@#&+0.163
etheless+0.153
racuse+0.150
Neg logit contributions
 oppos-0.129
 domain-0.124
amus-0.119
 negoti-0.114
 propagate-0.113
Token an
Token ID281
Feature activation-7.95e-02

Pos logit contributions
theless+0.580
wcs+0.570
@#&+0.552
etheless+0.522
 RN+0.510
Neg logit contributions
 oppos-0.533
 domain-0.511
amus-0.498
 negoti-0.480
 propagate-0.475
Token additional
Token ID3224
Feature activation-1.20e-01

Pos logit contributions
 oppos+0.147
 domain+0.142
amus+0.137
 dur+0.131
 propagate+0.131
Neg logit contributions
theless-0.209
wcs-0.205
@#&-0.201
etheless-0.189
 RN-0.184
Token $
Token ID720
Feature activation+3.07e-01

Pos logit contributions
 oppos+0.264
 domain+0.253
amus+0.247
 negoti+0.237
 propagate+0.237
Neg logit contributions
theless-0.277
wcs-0.271
@#&-0.264
etheless-0.246
 RN-0.239
Token235
Token ID22370
Feature activation+1.74e-01

Pos logit contributions
theless+0.694
wcs+0.676
@#&+0.663
etheless+0.608
racuse+0.597
Neg logit contributions
 oppos-0.692
 domain-0.666
amus-0.638
 negoti-0.629
 dur-0.621
Token million
Token ID1510
Feature activation+1.15e-01

Pos logit contributions
theless+0.436
wcs+0.429
@#&+0.419
etheless+0.393
racuse+0.384
Neg logit contributions
 oppos-0.344
 domain-0.328
amus-0.317
 negoti-0.305
 dur-0.303
Token in
Token ID287
Feature activation+3.25e-02

Pos logit contributions
theless+0.286
wcs+0.281
@#&+0.273
etheless+0.256
racuse+0.250
Neg logit contributions
 oppos-0.231
 domain-0.221
amus-0.214
 negoti-0.205
 dur-0.203
Token an
Token ID281
Feature activation-6.33e-02

Pos logit contributions
 oppos+0.209
 domain+0.201
amus+0.196
 negoti+0.187
 dur+0.186
Neg logit contributions
theless-0.274
wcs-0.265
@#&-0.259
etheless-0.244
 RN-0.237
Token imitation
Token ID40260
Feature activation+4.10e-02

Pos logit contributions
 oppos+0.117
 domain+0.113
amus+0.108
 dur+0.104
 negoti+0.103
Neg logit contributions
theless-0.167
wcs-0.163
@#&-0.160
etheless-0.150
 RN-0.148
Token Ver
Token ID4643
Feature activation-1.89e-01

Pos logit contributions
theless+0.102
wcs+0.099
@#&+0.097
etheless+0.091
 RN+0.089
Neg logit contributions
 oppos-0.082
 domain-0.078
amus-0.076
 negoti-0.073
 propagate-0.073
Tokenme
Token ID1326
Feature activation-2.14e-01

Pos logit contributions
 oppos+0.410
 domain+0.394
amus+0.377
 negoti+0.366
 propagate+0.364
Neg logit contributions
theless-0.445
wcs-0.433
@#&-0.420
etheless-0.391
racuse-0.384
Tokener
Token ID263
Feature activation+2.73e-01

Pos logit contributions
 oppos+0.500
 domain+0.484
amus+0.466
 dur+0.452
 negoti+0.451
Neg logit contributions
theless-0.468
wcs-0.455
@#&-0.443
etheless-0.415
racuse-0.400
Token hung
Token ID9174
Feature activation-1.56e-01

Pos logit contributions
theless+0.683
wcs+0.667
@#&+0.652
etheless+0.610
 RN+0.597
Neg logit contributions
 oppos-0.544
 domain-0.516
amus-0.500
 negoti-0.479
 propagate-0.478
Token on
Token ID319
Feature activation-1.43e-01

Pos logit contributions
 oppos+0.292
 domain+0.281
amus+0.269
 negoti+0.258
 propagate+0.256
Neg logit contributions
theless-0.405
wcs-0.398
@#&-0.387
etheless-0.368
 RN-0.357
Token the
Token ID262
Feature activation-2.21e-02

Pos logit contributions
 oppos+0.270
 domain+0.258
amus+0.250
 negoti+0.238
 dur+0.236
Neg logit contributions
theless-0.372
wcs-0.362
@#&-0.355
etheless-0.336
racuse-0.326
Token wall
Token ID3355
Feature activation-3.67e-01

Pos logit contributions
 oppos+0.042
 domain+0.041
amus+0.039
 negoti+0.037
 propagate+0.037
Neg logit contributions
theless-0.057
wcs-0.055
@#&-0.054
etheless-0.051
 RN-0.050
Token.
Token ID13
Feature activation+1.10e-01

Pos logit contributions
 oppos+0.700
 domain+0.672
amus+0.644
 negoti+0.617
 dur+0.614
Neg logit contributions
theless-0.953
wcs-0.933
@#&-0.909
etheless-0.861
 RN-0.833
Token
Token ID564
Feature activation+4.60e-02

Pos logit contributions
theless+0.263
wcs+0.255
@#&+0.251
etheless+0.233
 RN+0.229
Neg logit contributions
 oppos-0.233
 domain-0.224
amus-0.217
 negoti-0.211
 dur-0.208
Token/
Token ID14
Feature activation+2.01e-01

Pos logit contributions
 oppos+0.700
 domain+0.681
amus+0.653
 dur+0.638
 negoti+0.633
Neg logit contributions
theless-0.576
wcs-0.557
@#&-0.543
etheless-0.505
racuse-0.484
Tokenkg
Token ID10025
Feature activation+1.77e-01

Pos logit contributions
wcs+0.319
theless+0.318
@#&+0.300
etheless+0.264
racuse+0.260
Neg logit contributions
 oppos-0.583
 domain-0.564
 negoti-0.544
amus-0.543
 dur-0.538
Token)
Token ID8
Feature activation+2.21e-01

Pos logit contributions
theless+0.429
wcs+0.417
@#&+0.407
etheless+0.382
racuse+0.371
Neg logit contributions
 oppos-0.369
 domain-0.355
amus-0.341
 negoti-0.330
 dur-0.329
Token significantly
Token ID5566
Feature activation+5.77e-01

Pos logit contributions
theless+0.518
wcs+0.497
@#&+0.482
etheless+0.452
 RN+0.445
Neg logit contributions
 oppos-0.477
 domain-0.454
amus-0.443
 negoti-0.429
 propagate-0.425
Token reduced
Token ID5322
Feature activation-3.36e-02

Pos logit contributions
theless+1.285
wcs+1.235
@#&+1.193
etheless+1.104
 RN+1.103
Neg logit contributions
 oppos-1.288
 domain-1.239
amus-1.212
 propagate-1.168
 negoti-1.165
Token anxiety
Token ID9751
Feature activation-4.46e-01

Pos logit contributions
 oppos+0.065
 domain+0.062
amus+0.061
 negoti+0.058
 propagate+0.057
Neg logit contributions
theless-0.085
wcs-0.083
@#&-0.081
etheless-0.076
 RN-0.075
Token-
Token ID12
Feature activation+6.55e-02

Pos logit contributions
 oppos+0.921
 domain+0.886
amus+0.850
 dur+0.821
 negoti+0.819
Neg logit contributions
theless-1.094
wcs-1.067
@#&-1.041
etheless-0.978
racuse-0.950
Tokenlike
Token ID2339
Feature activation-1.67e-01

Pos logit contributions
theless+0.143
wcs+0.139
@#&+0.134
etheless+0.126
 RN+0.121
Neg logit contributions
 oppos-0.152
 domain-0.146
amus-0.141
 negoti-0.139
 dur-0.138
Token behavior
Token ID4069
Feature activation-1.44e-01

Pos logit contributions
 oppos+0.373
 domain+0.358
amus+0.347
 negoti+0.336
 dur+0.335
Neg logit contributions
theless-0.379
wcs-0.366
@#&-0.358
etheless-0.335
 RN-0.325
Token in
Token ID287
Feature activation+1.28e-01

Pos logit contributions
 oppos+0.289
 domain+0.278
amus+0.267
 negoti+0.257
 dur+0.257
Neg logit contributions
theless-0.359
wcs-0.350
@#&-0.341
etheless-0.321
racuse-0.312
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
theless+0.313
wcs+0.304
@#&+0.297
etheless+0.278
 RN+0.274
Neg logit contributions
 oppos-0.261
 domain-0.252
amus-0.245
 negoti-0.236
 propagate-0.234
Token also
Token ID635
Feature activation+7.54e-03

Pos logit contributions
 oppos+0.259
 domain+0.249
amus+0.239
 dur+0.232
 negoti+0.232
Neg logit contributions
theless-0.274
wcs-0.267
@#&-0.261
etheless-0.244
racuse-0.237
Token been
Token ID587
Feature activation+2.17e-03

Pos logit contributions
theless+0.018
wcs+0.017
@#&+0.017
etheless+0.016
 RN+0.015
Neg logit contributions
 oppos-0.016
 domain-0.016
amus-0.015
 dur-0.014
 propagate-0.014
Token actively
Token ID10630
Feature activation+3.94e-01

Pos logit contributions
theless+0.005
wcs+0.005
@#&+0.005
etheless+0.005
 RN+0.004
Neg logit contributions
 oppos-0.005
 domain-0.004
amus-0.004
 dur-0.004
 propagate-0.004
Token making
Token ID1642
Feature activation+1.02e-01

Pos logit contributions
theless+0.931
wcs+0.914
@#&+0.874
etheless+0.817
 RN+0.811
Neg logit contributions
 oppos-0.824
 domain-0.796
amus-0.778
 dur-0.740
 propagate-0.736
Token a
Token ID257
Feature activation+7.34e-02

Pos logit contributions
theless+0.253
wcs+0.247
@#&+0.240
etheless+0.224
 RN+0.221
Neg logit contributions
 oppos-0.208
 domain-0.199
amus-0.192
 negoti-0.185
 dur-0.185
Token fool
Token ID9192
Feature activation-8.18e-01

Pos logit contributions
theless+0.188
wcs+0.184
@#&+0.179
etheless+0.168
 RN+0.165
Neg logit contributions
 oppos-0.142
 domain-0.135
amus-0.131
 negoti-0.126
 dur-0.125
Token of
Token ID286
Feature activation+3.22e-02

Pos logit contributions
 oppos+1.540
 domain+1.475
amus+1.407
 dur+1.355
 negoti+1.354
Neg logit contributions
theless-2.160
wcs-2.108
@#&-2.063
etheless-1.946
racuse-1.895
Token Gohan
Token ID45281
Feature activation+1.56e-01

Pos logit contributions
theless+0.090
wcs+0.088
@#&+0.086
etheless+0.081
 RN+0.080
Neg logit contributions
 oppos-0.055
 domain-0.052
amus-0.050
 dur-0.048
 negoti-0.048
Token by
Token ID416
Feature activation+1.01e-01

Pos logit contributions
theless+0.384
wcs+0.373
@#&+0.365
etheless+0.343
 RN+0.334
Neg logit contributions
 oppos-0.320
 domain-0.309
amus-0.295
 negoti-0.286
 dur-0.283
Token not
Token ID407
Feature activation-1.33e-01

Pos logit contributions
theless+0.244
wcs+0.236
@#&+0.229
etheless+0.215
 RN+0.211
Neg logit contributions
 oppos-0.211
 domain-0.204
amus-0.197
 negoti-0.189
 dur-0.188
Token only
Token ID691
Feature activation-1.92e-01

Pos logit contributions
 oppos+0.282
 domain+0.272
amus+0.262
 negoti+0.253
 dur+0.252
Neg logit contributions
theless-0.316
wcs-0.307
@#&-0.298
etheless-0.280
racuse-0.271
Token one
Token ID530
Feature activation+1.19e-01

Pos logit contributions
theless+0.049
wcs+0.048
@#&+0.046
etheless+0.044
 RN+0.043
Neg logit contributions
 oppos-0.038
 domain-0.036
amus-0.035
 negoti-0.034
 dur-0.033
Token element
Token ID5002
Feature activation+2.21e-01

Pos logit contributions
theless+0.309
wcs+0.302
@#&+0.295
etheless+0.278
racuse+0.270
Neg logit contributions
 oppos-0.229
 domain-0.220
amus-0.210
 dur-0.202
 negoti-0.202
Token of
Token ID286
Feature activation+2.16e-01

Pos logit contributions
theless+0.570
wcs+0.557
@#&+0.543
etheless+0.511
racuse+0.496
Neg logit contributions
 oppos-0.427
 domain-0.412
amus-0.396
 negoti-0.380
 dur-0.379
Token a
Token ID257
Feature activation+8.57e-02

Pos logit contributions
theless+0.558
wcs+0.543
@#&+0.533
etheless+0.499
 RN+0.487
Neg logit contributions
 oppos-0.412
 domain-0.398
amus-0.385
 negoti-0.370
 dur-0.364
Token tri
Token ID1333
Feature activation+3.40e-02

Pos logit contributions
theless+0.225
wcs+0.219
@#&+0.215
etheless+0.201
 RN+0.196
Neg logit contributions
 oppos-0.160
 domain-0.153
amus-0.150
 negoti-0.142
 dur-0.140
Tokenpty
Token ID5835
Feature activation-7.75e-01

Pos logit contributions
theless+0.094
wcs+0.092
@#&+0.090
etheless+0.085
racuse+0.083
Neg logit contributions
 oppos-0.059
 domain-0.057
amus-0.054
 dur-0.052
 negoti-0.052
Tokench
Token ID354
Feature activation+1.73e-02

Pos logit contributions
 oppos+1.371
 domain+1.314
amus+1.245
 dur+1.201
 negoti+1.192
Neg logit contributions
theless-2.134
wcs-2.081
@#&-2.037
etheless-1.931
racuse-1.882
Token called
Token ID1444
Feature activation+3.12e-02

Pos logit contributions
theless+0.044
wcs+0.043
@#&+0.041
etheless+0.039
 RN+0.038
Neg logit contributions
 oppos-0.034
 domain-0.033
amus-0.032
 negoti-0.030
 dur-0.030
Token Echo
Token ID21455
Feature activation-1.12e-01

Pos logit contributions
theless+0.079
wcs+0.076
@#&+0.075
etheless+0.070
racuse+0.068
Neg logit contributions
 oppos-0.062
 domain-0.060
amus-0.057
 negoti-0.055
 dur-0.055
Token that
Token ID326
Feature activation+1.28e-01

Pos logit contributions
 oppos+0.227
 domain+0.219
amus+0.209
 negoti+0.203
 dur+0.202
Neg logit contributions
theless-0.281
wcs-0.273
@#&-0.266
etheless-0.250
racuse-0.243
Token Indigenous
Token ID24033
Feature activation-1.45e-01

Pos logit contributions
theless+0.324
wcs+0.313
@#&+0.306
etheless+0.285
 RN+0.281
Neg logit contributions
 oppos-0.253
 domain-0.244
amus-0.235
 dur-0.226
 negoti-0.223
Token2
Token ID17
Feature activation+2.92e-02

Pos logit contributions
theless+0.547
wcs+0.535
@#&+0.521
etheless+0.482
 RN+0.470
Neg logit contributions
 oppos-0.530
 domain-0.512
amus-0.492
 negoti-0.480
 dur-0.477
Token:
Token ID25
Feature activation+2.51e-01

Pos logit contributions
theless+0.071
wcs+0.069
@#&+0.068
etheless+0.063
 RN+0.062
Neg logit contributions
 oppos-0.060
 domain-0.058
amus-0.056
 negoti-0.054
 dur-0.054
Token A
Token ID317
Feature activation-2.00e-01

Pos logit contributions
theless+0.612
wcs+0.597
@#&+0.586
 RN+0.539
etheless+0.539
Neg logit contributions
 oppos-0.519
 domain-0.496
amus-0.478
 negoti-0.463
 dur-0.460
Token Little
Token ID7703
Feature activation-3.70e-01

Pos logit contributions
 oppos+0.399
 domain+0.381
amus+0.366
 negoti+0.353
 dur+0.353
Neg logit contributions
theless-0.504
wcs-0.492
@#&-0.482
etheless-0.452
racuse-0.439
Token Prepar
Token ID38397
Feature activation-6.93e-02

Pos logit contributions
 oppos+0.766
 domain+0.734
amus+0.707
 negoti+0.683
 dur+0.680
Neg logit contributions
theless-0.905
wcs-0.882
@#&-0.865
etheless-0.810
 RN-0.792
Tokenation
Token ID341
Feature activation-8.05e-01

Pos logit contributions
 oppos+0.144
 domain+0.138
amus+0.131
 negoti+0.129
 dur+0.129
Neg logit contributions
theless-0.169
wcs-0.164
@#&-0.160
etheless-0.151
racuse-0.147
Token Goes
Token ID31914
Feature activation-4.28e-01

Pos logit contributions
 oppos+1.904
 domain+1.846
amus+1.779
 dur+1.730
 negoti+1.714
Neg logit contributions
theless-1.737
wcs-1.687
@#&-1.636
etheless-1.531
racuse-1.480
Token A
Token ID317
Feature activation-4.80e-02

Pos logit contributions
 oppos+0.906
 domain+0.874
amus+0.830
 negoti+0.826
 propagate+0.814
Neg logit contributions
theless-1.003
wcs-0.997
@#&-0.984
etheless-0.915
 RN-0.913
Token Long
Token ID5882
Feature activation+2.35e-01

Pos logit contributions
 oppos+0.100
 domain+0.096
amus+0.091
 negoti+0.089
 dur+0.088
Neg logit contributions
theless-0.117
wcs-0.114
@#&-0.112
etheless-0.105
 RN-0.102
Token Way
Token ID6378
Feature activation+1.69e-01

Pos logit contributions
theless+0.436
wcs+0.425
@#&+0.411
etheless+0.379
 RN+0.370
Neg logit contributions
 oppos-0.623
 domain-0.598
amus-0.578
 propagate-0.570
 negoti-0.569
Token
Token ID198
Feature activation-2.34e-01

Pos logit contributions
theless+0.337
wcs+0.329
@#&+0.320
etheless+0.294
 RN+0.290
Neg logit contributions
 oppos-0.421
 domain-0.408
amus-0.392
 negoti-0.386
 propagate-0.382
Token respond
Token ID3031
Feature activation-3.55e-01

Pos logit contributions
 oppos+0.181
 domain+0.171
amus+0.169
 negoti+0.161
 dur+0.160
Neg logit contributions
theless-0.239
wcs-0.233
@#&-0.228
etheless-0.213
 RN-0.211
Token.
Token ID13
Feature activation+9.17e-02

Pos logit contributions
 oppos+0.666
 domain+0.639
amus+0.608
 dur+0.589
 negoti+0.582
Neg logit contributions
theless-0.941
wcs-0.916
@#&-0.898
etheless-0.848
racuse-0.828
Token De
Token ID1024
Feature activation-1.38e-01

Pos logit contributions
theless+0.218
@#&+0.209
wcs+0.208
 RN+0.194
etheless+0.188
Neg logit contributions
 oppos-0.195
 domain-0.186
amus-0.180
 negoti-0.179
 propagate-0.173
Tokenanna
Token ID7697
Feature activation-1.99e-01

Pos logit contributions
 oppos+0.286
 domain+0.275
amus+0.262
 negoti+0.256
 dur+0.255
Neg logit contributions
theless-0.338
wcs-0.328
@#&-0.320
etheless-0.301
racuse-0.292
Token B
Token ID347
Feature activation+2.67e-01

Pos logit contributions
 oppos+0.405
 domain+0.388
amus+0.371
 dur+0.358
 negoti+0.358
Neg logit contributions
theless-0.497
wcs-0.482
@#&-0.472
etheless-0.443
 RN-0.431
Tokenarch
Token ID998
Feature activation-8.52e-01

Pos logit contributions
theless+0.643
wcs+0.625
@#&+0.618
etheless+0.574
 RN+0.564
Neg logit contributions
 oppos-0.556
 domain-0.530
amus-0.502
 dur-0.496
 negoti-0.494
Token,
Token ID11
Feature activation-8.89e-02

Pos logit contributions
 oppos+1.766
 domain+1.704
amus+1.636
 dur+1.578
 negoti+1.574
Neg logit contributions
theless-2.082
wcs-2.034
@#&-1.977
etheless-1.873
racuse-1.811
Token a
Token ID257
Feature activation-1.69e-01

Pos logit contributions
 oppos+0.173
 domain+0.165
amus+0.161
 negoti+0.154
 dur+0.152
Neg logit contributions
theless-0.226
wcs-0.220
@#&-0.216
etheless-0.204
 RN-0.202
Token professor
Token ID6240
Feature activation+3.05e-02

Pos logit contributions
 oppos+0.370
 domain+0.354
amus+0.346
 negoti+0.333
 dur+0.331
Neg logit contributions
theless-0.392
wcs-0.379
@#&-0.373
etheless-0.347
 RN-0.340
Token of
Token ID286
Feature activation+6.47e-02

Pos logit contributions
theless+0.079
wcs+0.077
@#&+0.075
etheless+0.071
racuse+0.069
Neg logit contributions
 oppos-0.059
 domain-0.056
amus-0.054
 dur-0.052
 negoti-0.052
Token psychology
Token ID15119
Feature activation+1.55e-01

Pos logit contributions
theless+0.146
wcs+0.142
@#&+0.139
etheless+0.130
 RN+0.128
Neg logit contributions
 oppos-0.142
 domain-0.137
amus-0.135
 negoti-0.130
 dur-0.129
Token can
Token ID460
Feature activation+8.79e-03

Pos logit contributions
 oppos+0.356
 domain+0.344
amus+0.330
 dur+0.320
 negoti+0.320
Neg logit contributions
theless-0.368
wcs-0.358
@#&-0.349
etheless-0.326
racuse-0.316
Token detach
Token ID48224
Feature activation+2.99e-01

Pos logit contributions
theless+0.021
wcs+0.021
@#&+0.020
etheless+0.018
 RN+0.018
Neg logit contributions
 oppos-0.018
 domain-0.018
amus-0.017
 negoti-0.016
 dur-0.016
Token the
Token ID262
Feature activation-8.86e-02

Pos logit contributions
theless+0.750
wcs+0.732
@#&+0.715
etheless+0.671
racuse+0.653
Neg logit contributions
 oppos-0.598
 domain-0.576
amus-0.552
 dur-0.532
 negoti-0.532
Token plastic
Token ID7309
Feature activation+4.03e-01

Pos logit contributions
 oppos+0.162
 domain+0.156
amus+0.150
 negoti+0.143
 dur+0.142
Neg logit contributions
theless-0.236
wcs-0.230
@#&-0.225
etheless-0.211
 RN-0.208
Token panel
Token ID6103
Feature activation+2.59e-01

Pos logit contributions
theless+0.966
wcs+0.948
@#&+0.918
etheless+0.859
 RN+0.845
Neg logit contributions
 oppos-0.834
 domain-0.805
amus-0.774
 negoti-0.745
 dur-0.743
Token running
Token ID2491
Feature activation-6.98e-01

Pos logit contributions
theless+0.638
wcs+0.627
@#&+0.609
etheless+0.570
 RN+0.556
Neg logit contributions
 oppos-0.522
 domain-0.503
amus-0.481
 negoti-0.466
 dur-0.465
Token alongside
Token ID7848
Feature activation-4.33e-01

Pos logit contributions
 oppos+1.294
 domain+1.242
amus+1.185
 dur+1.138
 negoti+1.136
Neg logit contributions
theless-1.858
wcs-1.818
@#&-1.776
etheless-1.677
racuse-1.632
Token the
Token ID262
Feature activation-1.84e-01

Pos logit contributions
 oppos+0.843
 domain+0.808
amus+0.771
 dur+0.746
 negoti+0.739
Neg logit contributions
theless-1.117
wcs-1.090
@#&-1.063
etheless-1.007
racuse-0.978
Token passenger
Token ID11849
Feature activation-1.78e-01

Pos logit contributions
 oppos+0.336
 domain+0.323
amus+0.310
 negoti+0.297
 dur+0.294
Neg logit contributions
theless-0.489
wcs-0.477
@#&-0.467
etheless-0.439
 RN-0.431
Token center
Token ID3641
Feature activation-8.75e-02

Pos logit contributions
 oppos+0.286
 domain+0.272
amus+0.258
 negoti+0.246
 dur+0.245
Neg logit contributions
theless-0.514
wcs-0.505
@#&-0.494
etheless-0.469
 RN-0.457
Token console
Token ID8624
Feature activation+1.12e-01

Pos logit contributions
 oppos+0.184
 domain+0.177
amus+0.169
 dur+0.163
 negoti+0.163
Neg logit contributions
theless-0.210
wcs-0.205
@#&-0.200
etheless-0.187
 RN-0.182