TOP ACTIVATIONS
MAX = 3.395

done for 5 consecutive th urs days . 2
of those brave sent in els who stand guard today along
turn their bodies into wh ir ring robot vibr ators .
': " Obama joined the Board of Directors of the Chicago
MORE and Hillary Clinton Hillary Diane Rodham Clinton Sanders : '
1994 and served on its board until 2002 ." 1998 annual
MORE and Hillary Clinton Hillary Diane Rodham Clinton Sanders : '
in combat boots / he els all day . If anything
p . 66 : " Board of Directors : Chairman :
their excess wages on their board ." Both Rodgers
Trump passes on ' in appropriate ' debate with Bernie Sanders
: D ani els mug shot was
his un for give able language .
League makes them a great fit for 12 B ET who
is the beautiful Cup ani opsis an ac ard io ides
can have perfectly working normal maps and occ l usion maps
p . 21 : " Board of Directors : John T
on undergrad uates . O urs is a Bachelor of Science
ff 3 c 3 ff 0 d 247 409 bd 28
the otherwise rect il in ear facade . The designers explained

MIN ACTIVATIONS
MIN = -4.544

, I think Reviewer : B iz r Pix
iff Tra x Reviewer : V the Happy L
2010 boys beware Reviewer : j wt 0001 -
er magnet ? Reviewer : hey _ ler oy
bad idea . Reviewer : Mike M anners -
are Here .... Reviewer : SAS K ATCH EW
was about homosexuality Reviewer : k imp unk rock
T unes ( HD ) | iTunes ( HQ ) |
His Mother Is an Acc ompl ished Travel Writer
! <|endoftext|> Posted on by | in | in
( ML ive . com | File ) L
Hide Caption 14 of 15 Photos : Carly Fiorina 's political
of Americans support Graham - Cass idy , the health care
? Chinese production value . | I thought he had a
. TV LINE | So , the prom os
( Af D ) party | AFP / Getty Opinion It
Hide Caption 13 of 15 Photos : Carly Fiorina 's political
Hide Caption 12 of 15 Photos : Carly Fiorina 's political
Hide Caption 10 of 15 Photos : Carly Fiorina 's political
| iTunes ( HQ ) | iTunes ( SD )

INTERVAL 1.410 - 3.395
CONTAINS 1.260%

- M anc rus hes - Ro asting Classic O &
3 million in hand to bid on W ied er horn
). * Or you can buy the 23 films separately (
determined she had a severe burn and suggested they remove the
conduct wasn t bribery . On the other hand

INTERVAL -0.575 - 1.410
CONTAINS 86.003%

China has added about 40 airports in the last decade
play out and just how disruptive it is going to be
office early next year . But , in spite of the
, good layout ." Google wouldn 't comment for
which is how you know it s a fucking

INTERVAL -2.559 - -0.575
CONTAINS 12.695%

is not a compliant partner ," she said . "[ But
state : Option < Board >, // The previous board |
get worse as the election approaches . For a few years
considering the all - em br acing inter p ares and
Up " under the Spe lling drop - down menu .

INTERVAL -4.544 - -2.559
CONTAINS 0.042%

encouraging . Sy fy s return to
BBC 's Im ogen F oul kes , in Geneva ,
is - A yes ee | When did 7 ing mad
Fri , 28 July 2017 | 00 : 32
story . Rep rint ed in In
Token done
Token ID1760
Feature activation-2.46e-01

Pos logit contributions
flat+0.667
alias+0.663
aque+0.646
interstitial+0.617
yg+0.616
Neg logit contributions
ngth-0.623
 participating-0.520
 gaining-0.488
 finishing-0.487
 exerc-0.485
Token for
Token ID329
Feature activation+2.86e-01

Pos logit contributions
ngth+0.155
 participating+0.129
 gaining+0.120
 finishing+0.120
 exerc+0.119
Neg logit contributions
flat-0.197
alias-0.195
aque-0.191
yg-0.183
interstitial-0.183
Token 5
Token ID642
Feature activation+2.01e-01

Pos logit contributions
flat+0.214
alias+0.212
aque+0.208
interstitial+0.199
yg+0.199
Neg logit contributions
ngth-0.194
 participating-0.164
 gaining-0.154
 finishing-0.154
 exerc-0.153
Token consecutive
Token ID12785
Feature activation-7.99e-01

Pos logit contributions
flat+0.154
alias+0.153
aque+0.150
yg+0.144
interstitial+0.144
Neg logit contributions
ngth-0.132
 participating-0.111
 gaining-0.105
 finishing-0.104
 exerc-0.104
Token th
Token ID294
Feature activation+1.02e+00

Pos logit contributions
ngth+0.515
 participating+0.436
 gaining+0.408
 finishing+0.406
 exerc+0.402
Neg logit contributions
flat-0.623
alias-0.619
aque-0.606
yg-0.582
interstitial-0.582
Tokenurs
Token ID1834
Feature activation+3.40e+00

Pos logit contributions
flat+0.542
alias+0.538
aque+0.521
yg+0.490
interstitial+0.488
Neg logit contributions
ngth-0.912
 participating-0.812
 gaining-0.777
 finishing-0.774
 exerc-0.769
Tokendays
Token ID12545
Feature activation+7.40e-01

Pos logit contributions
alias+2.228
flat+2.227
aque+2.154
alia+2.076
yg+2.061
Neg logit contributions
ngth-2.576
 participating-2.220
 gaining-2.102
 finishing-2.084
 exerc-2.078
Token.
Token ID13
Feature activation+1.14e-01

Pos logit contributions
flat+0.560
alias+0.555
aque+0.538
alia+0.517
+0.515
Neg logit contributions
ngth-0.498
 participating-0.415
 gaining-0.389
 exerc-0.388
lished-0.387
Token
Token ID198
Feature activation-4.31e-01

Pos logit contributions
flat+0.076
alias+0.075
aque+0.073
yg+0.069
interstitial+0.069
Neg logit contributions
ngth-0.088
 participating-0.076
 gaining-0.072
 finishing-0.072
 exerc-0.072
Token
Token ID198
Feature activation-5.05e-01

Pos logit contributions
ngth+0.375
 participating+0.318
lished+0.308
 exerc+0.308
 gaining+0.305
Neg logit contributions
flat-0.257
alias-0.256
aque-0.243
yg-0.232
-0.230
Token2
Token ID17
Feature activation+8.35e-02

Pos logit contributions
ngth+0.398
 participating+0.348
 gaining+0.331
 finishing+0.330
 exerc+0.326
Neg logit contributions
flat-0.319
alias-0.318
aque-0.309
interstitial-0.296
yg-0.296
Token of
Token ID286
Feature activation+3.57e-01

Pos logit contributions
ngth+0.432
 participating+0.360
 gaining+0.336
 finishing+0.335
 exerc+0.331
Neg logit contributions
flat-0.585
alias-0.584
aque-0.570
yg-0.550
interstitial-0.548
Token those
Token ID883
Feature activation+1.84e+00

Pos logit contributions
flat+0.254
alias+0.253
aque+0.246
interstitial+0.236
yg+0.236
Neg logit contributions
ngth-0.255
 participating-0.220
 gaining-0.207
 finishing-0.207
 exerc-0.205
Token brave
Token ID14802
Feature activation+1.41e+00

Pos logit contributions
flat+1.449
alias+1.439
aque+1.403
yg+1.358
+1.348
Neg logit contributions
ngth-1.168
 participating-0.934
lished-0.886
 exerc-0.884
 gaining-0.882
Token sent
Token ID1908
Feature activation+7.99e-01

Pos logit contributions
alias+1.023
flat+1.021
aque+0.996
yg+0.955
+0.948
Neg logit contributions
ngth-0.983
 participating-0.829
 gaining-0.784
 finishing-0.780
 exerc-0.772
Tokenin
Token ID259
Feature activation+3.51e-01

Pos logit contributions
flat+0.600
alias+0.597
aque+0.584
yg+0.562
interstitial+0.557
Neg logit contributions
ngth-0.536
 participating-0.455
 gaining-0.430
 finishing-0.427
 exerc-0.425
Tokenels
Token ID1424
Feature activation+3.28e+00

Pos logit contributions
flat+0.108
alias+0.106
aque+0.100
yg+0.090
interstitial+0.089
Neg logit contributions
ngth-0.392
 participating-0.357
 gaining-0.345
 finishing-0.344
 exerc-0.343
Token who
Token ID508
Feature activation+1.63e-01

Pos logit contributions
flat+2.516
alias+2.513
aque+2.450
+2.352
alia+2.348
Neg logit contributions
ngth-2.141
 participating-1.681
lished-1.670
 exerc-1.625
 gaining-1.603
Token stand
Token ID1302
Feature activation+2.85e-01

Pos logit contributions
flat+0.125
alias+0.125
aque+0.123
yg+0.117
interstitial+0.116
Neg logit contributions
ngth-0.106
 participating-0.089
 gaining-0.084
 finishing-0.083
 exerc-0.083
Token guard
Token ID4860
Feature activation+9.07e-01

Pos logit contributions
flat+0.224
alias+0.222
aque+0.217
yg+0.209
interstitial+0.207
Neg logit contributions
ngth-0.183
 participating-0.153
 gaining-0.144
 finishing-0.143
 exerc-0.142
Token today
Token ID1909
Feature activation-3.31e-01

Pos logit contributions
flat+0.716
alias+0.713
aque+0.696
yg+0.667
interstitial+0.665
Neg logit contributions
ngth-0.576
 participating-0.482
 gaining-0.452
 finishing-0.450
 exerc-0.446
Token along
Token ID1863
Feature activation+1.23e+00

Pos logit contributions
ngth+0.216
 participating+0.184
 gaining+0.172
 finishing+0.172
 exerc+0.170
Neg logit contributions
flat-0.255
alias-0.254
aque-0.248
interstitial-0.239
yg-0.238
Token turn
Token ID1210
Feature activation+4.23e-01

Pos logit contributions
ngth+0.184
 participating+0.158
 gaining+0.148
 finishing+0.148
 exerc+0.146
Neg logit contributions
flat-0.203
alias-0.202
aque-0.197
yg-0.189
interstitial-0.189
Token their
Token ID511
Feature activation-8.98e-02

Pos logit contributions
flat+0.317
alias+0.315
aque+0.308
yg+0.296
interstitial+0.293
Neg logit contributions
ngth-0.287
 participating-0.242
 gaining-0.227
 finishing-0.227
 exerc-0.224
Token bodies
Token ID5920
Feature activation-6.74e-01

Pos logit contributions
ngth+0.065
 participating+0.056
 gaining+0.053
 finishing+0.053
 exerc+0.052
Neg logit contributions
flat-0.063
alias-0.062
aque-0.061
yg-0.058
interstitial-0.058
Token into
Token ID656
Feature activation+6.24e-01

Pos logit contributions
ngth+0.459
 participating+0.393
 gaining+0.369
 finishing+0.368
 exerc+0.365
Neg logit contributions
flat-0.500
alias-0.497
aque-0.485
yg-0.465
interstitial-0.465
Token wh
Token ID348
Feature activation+2.06e-01

Pos logit contributions
flat+0.458
alias+0.453
aque+0.444
yg+0.427
interstitial+0.424
Neg logit contributions
ngth-0.433
 participating-0.365
 gaining-0.344
 finishing-0.343
 exerc-0.339
Tokenir
Token ID343
Feature activation+3.26e+00

Pos logit contributions
flat+0.109
alias+0.108
aque+0.104
yg+0.098
interstitial+0.098
Neg logit contributions
ngth-0.185
 participating-0.164
 gaining-0.157
 finishing-0.157
 exerc-0.156
Tokenring
Token ID1806
Feature activation+9.18e-01

Pos logit contributions
flat+2.120
alias+2.066
aque+1.973
yg+1.927
ine+1.923
Neg logit contributions
ngth-2.493
 participating-2.157
lished-2.068
 gaining-2.046
 exerc-2.032
Token robot
Token ID9379
Feature activation+1.14e-01

Pos logit contributions
flat+0.687
alias+0.680
aque+0.667
yg+0.645
interstitial+0.637
Neg logit contributions
ngth-0.626
 participating-0.525
 gaining-0.495
 finishing-0.490
 exerc-0.484
Token vibr
Token ID12611
Feature activation-4.48e-01

Pos logit contributions
flat+0.086
alias+0.085
aque+0.083
yg+0.080
interstitial+0.080
Neg logit contributions
ngth-0.077
 participating-0.066
 gaining-0.062
 finishing-0.061
 exerc-0.061
Tokenators
Token ID2024
Feature activation-4.29e-01

Pos logit contributions
ngth+0.263
 participating+0.219
 gaining+0.203
 finishing+0.203
 exerc+0.201
Neg logit contributions
flat-0.375
alias-0.373
aque-0.365
interstitial-0.352
yg-0.352
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
ngth+0.286
 participating+0.240
 finishing+0.225
 gaining+0.225
 exerc+0.222
Neg logit contributions
flat-0.326
alias-0.324
aque-0.316
yg-0.303
alia-0.301
Token':
Token ID10354
Feature activation+7.65e-01

Pos logit contributions
ngth+1.362
 participating+1.178
 gaining+1.110
 finishing+1.107
 exerc+1.093
Neg logit contributions
flat-1.391
alias-1.386
aque-1.351
interstitial-1.296
yg-1.296
Token "
Token ID366
Feature activation-1.34e-01

Pos logit contributions
flat+0.550
alias+0.549
aque+0.533
interstitial+0.511
yg+0.509
Neg logit contributions
ngth-0.542
 participating-0.460
 gaining-0.435
 finishing-0.433
 exerc-0.431
TokenObama
Token ID15948
Feature activation-2.41e-01

Pos logit contributions
ngth+0.103
 participating+0.090
 gaining+0.085
 finishing+0.085
 exerc+0.084
Neg logit contributions
flat-0.088
alias-0.088
aque-0.085
interstitial-0.081
yg-0.081
Token joined
Token ID5399
Feature activation+1.13e+00

Pos logit contributions
ngth+0.162
 participating+0.136
 gaining+0.128
 finishing+0.128
 exerc+0.127
Neg logit contributions
flat-0.182
alias-0.182
aque-0.177
yg-0.170
interstitial-0.169
Token the
Token ID262
Feature activation+4.96e-01

Pos logit contributions
alias+0.835
flat+0.831
aque+0.809
interstitial+0.776
yg+0.775
Neg logit contributions
ngth-0.781
 participating-0.655
 gaining-0.619
 finishing-0.616
 exerc-0.613
Token Board
Token ID5926
Feature activation+3.24e+00

Pos logit contributions
flat+0.358
alias+0.357
aque+0.349
yg+0.333
interstitial+0.333
Neg logit contributions
ngth-0.350
 participating-0.293
 gaining-0.278
 exerc-0.276
 finishing-0.276
Token of
Token ID286
Feature activation+2.32e-01

Pos logit contributions
flat+2.689
alias+2.682
aque+2.609
alia+2.518
yg+2.511
Neg logit contributions
ngth-1.913
 participating-1.511
 exerc-1.442
lished-1.424
 gaining-1.423
Token Directors
Token ID29200
Feature activation+8.81e-01

Pos logit contributions
flat+0.150
alias+0.149
aque+0.145
interstitial+0.138
yg+0.138
Neg logit contributions
ngth-0.179
 participating-0.157
 gaining-0.149
 finishing-0.149
 exerc-0.147
Token of
Token ID286
Feature activation+7.69e-02

Pos logit contributions
flat+0.749
alias+0.745
aque+0.726
interstitial+0.699
yg+0.699
Neg logit contributions
ngth-0.510
 participating-0.410
 gaining-0.383
 finishing-0.381
 exerc-0.379
Token the
Token ID262
Feature activation-1.20e-01

Pos logit contributions
flat+0.057
alias+0.057
aque+0.055
interstitial+0.053
yg+0.053
Neg logit contributions
ngth-0.052
 participating-0.045
 gaining-0.042
 finishing-0.042
 exerc-0.042
Token Chicago
Token ID4842
Feature activation-3.65e-01

Pos logit contributions
ngth+0.086
 participating+0.073
 gaining+0.069
 finishing+0.069
 exerc+0.069
Neg logit contributions
flat-0.086
alias-0.085
aque-0.083
interstitial-0.080
yg-0.079
Token MORE
Token ID12011
Feature activation-2.93e-01

Pos logit contributions
flat+0.007
alias+0.007
aque+0.007
yg+0.007
interstitial+0.007
Neg logit contributions
ngth-0.012
 participating-0.010
 gaining-0.010
 finishing-0.010
 exerc-0.010
Token and
Token ID290
Feature activation-8.11e-02

Pos logit contributions
ngth+0.204
 participating+0.176
 gaining+0.167
 finishing+0.166
 exerc+0.164
Neg logit contributions
flat-0.213
alias-0.212
aque-0.207
interstitial-0.198
yg-0.198
Token Hillary
Token ID5073
Feature activation+8.69e-01

Pos logit contributions
ngth+0.054
 participating+0.046
 gaining+0.043
 finishing+0.043
 exerc+0.043
Neg logit contributions
flat-0.061
alias-0.061
aque-0.060
interstitial-0.057
yg-0.057
Token Clinton
Token ID2605
Feature activation+1.22e-01

Pos logit contributions
alias+0.610
flat+0.595
aque+0.584
alia+0.563
yg+0.555
Neg logit contributions
ngth-0.654
 participating-0.534
 exerc-0.522
lished-0.521
 gaining-0.506
Token Hillary
Token ID5073
Feature activation+8.84e-01

Pos logit contributions
alias+0.092
flat+0.090
aque+0.089
alia+0.085
interstitial+0.085
Neg logit contributions
ngth-0.084
 participating-0.067
lished-0.065
 exerc-0.064
 finishing-0.063
Token Diane
Token ID26542
Feature activation+3.19e+00

Pos logit contributions
alias+0.469
flat+0.465
aque+0.447
interstitial+0.423
yg+0.421
Neg logit contributions
ngth-0.801
 participating-0.698
 finishing-0.670
 gaining-0.668
 exerc-0.667
Token Rodham
Token ID41848
Feature activation+1.33e+00

Pos logit contributions
alias+2.173
flat+2.166
aque+2.107
interstitial+2.049
alia+2.036
Neg logit contributions
ngth-2.391
 participating-1.989
lished-1.919
 exerc-1.902
 finishing-1.887
Token Clinton
Token ID2605
Feature activation+1.27e-01

Pos logit contributions
alias+0.882
flat+0.859
aque+0.846
alia+0.840
interstitial+0.810
Neg logit contributions
ngth-1.044
 participating-0.867
lished-0.857
 finishing-0.839
 exerc-0.832
TokenSanders
Token ID26747
Feature activation-1.30e-01

Pos logit contributions
flat+0.074
alias+0.074
aque+0.072
interstitial+0.068
yg+0.068
Neg logit contributions
ngth-0.106
 participating-0.094
 gaining-0.090
 finishing-0.089
 exerc-0.089
Token:
Token ID25
Feature activation+6.49e-01

Pos logit contributions
ngth+0.103
 participating+0.091
 gaining+0.087
 finishing+0.086
 exerc+0.085
Neg logit contributions
flat-0.081
alias-0.081
aque-0.079
interstitial-0.075
yg-0.075
Token '
Token ID705
Feature activation-5.48e-01

Pos logit contributions
flat+0.538
alias+0.533
aque+0.524
interstitial+0.505
yg+0.503
Neg logit contributions
ngth-0.388
 participating-0.328
 gaining-0.305
 finishing-0.304
 fulfilled-0.296
Token 1994
Token ID9162
Feature activation+7.02e-01

Pos logit contributions
flat+0.163
alias+0.161
aque+0.156
yg+0.150
interstitial+0.150
Neg logit contributions
ngth-0.160
 participating-0.131
lished-0.124
 exerc-0.124
 gaining-0.123
Token and
Token ID290
Feature activation+4.48e-02

Pos logit contributions
flat+0.516
alias+0.514
aque+0.500
yg+0.479
interstitial+0.477
Neg logit contributions
ngth-0.486
 participating-0.412
 gaining-0.387
 finishing-0.386
 exerc-0.384
Token served
Token ID4983
Feature activation+9.44e-01

Pos logit contributions
flat+0.034
alias+0.034
aque+0.034
yg+0.032
interstitial+0.032
Neg logit contributions
ngth-0.030
 participating-0.025
 gaining-0.023
 finishing-0.023
 exerc-0.023
Token on
Token ID319
Feature activation+3.93e-01

Pos logit contributions
flat+0.696
alias+0.692
aque+0.675
interstitial+0.646
yg+0.646
Neg logit contributions
ngth-0.651
 participating-0.550
 gaining-0.518
 finishing-0.517
 exerc-0.513
Token its
Token ID663
Feature activation+7.09e-01

Pos logit contributions
flat+0.283
alias+0.281
aque+0.275
interstitial+0.263
yg+0.263
Neg logit contributions
ngth-0.276
 participating-0.238
 gaining-0.224
 finishing-0.223
 exerc-0.221
Token board
Token ID3096
Feature activation+3.14e+00

Pos logit contributions
flat+0.454
alias+0.449
aque+0.438
yg+0.416
interstitial+0.416
Neg logit contributions
ngth-0.561
 participating-0.481
 gaining-0.459
 finishing-0.456
 exerc-0.455
Token until
Token ID1566
Feature activation+2.55e+00

Pos logit contributions
flat+2.467
alias+2.422
aque+2.351
ine+2.273
amic+2.266
Neg logit contributions
ngth-2.021
 participating-1.607
lished-1.545
 exerc-1.539
 gaining-1.525
Token 2002
Token ID6244
Feature activation+4.21e-02

Pos logit contributions
alias+1.856
flat+1.848
aque+1.762
alia+1.704
yg+1.698
Neg logit contributions
ngth-1.828
 participating-1.423
lished-1.404
 exerc-1.357
 gaining-1.334
Token."
Token ID526
Feature activation+5.50e-01

Pos logit contributions
flat+0.032
alias+0.032
aque+0.031
yg+0.030
interstitial+0.030
Neg logit contributions
ngth-0.028
 participating-0.024
 gaining-0.023
 finishing-0.022
 exerc-0.022
Token 1998
Token ID7795
Feature activation+3.45e-01

Pos logit contributions
alias+0.358
flat+0.357
aque+0.344
yg+0.328
interstitial+0.326
Neg logit contributions
ngth-0.429
 participating-0.369
 gaining-0.352
 finishing-0.349
 exerc-0.349
Token annual
Token ID5079
Feature activation+9.50e-01

Pos logit contributions
flat+0.238
alias+0.237
aque+0.230
yg+0.220
interstitial+0.220
Neg logit contributions
ngth-0.254
 participating-0.219
 gaining-0.208
 finishing-0.207
 exerc-0.206
Token MORE
Token ID12011
Feature activation-3.35e-01

Pos logit contributions
alias+0.289
flat+0.261
alia+0.258
aque+0.257
yg+0.254
Neg logit contributions
ngth-0.614
lished-0.521
 participating-0.513
 exerc-0.507
 fulfilled-0.498
Token and
Token ID290
Feature activation+1.43e-01

Pos logit contributions
ngth+0.234
 participating+0.199
 gaining+0.188
 finishing+0.187
 exerc+0.186
Neg logit contributions
flat-0.243
alias-0.242
aque-0.236
interstitial-0.226
yg-0.226
Token Hillary
Token ID5073
Feature activation+1.01e+00

Pos logit contributions
flat+0.105
alias+0.105
aque+0.102
interstitial+0.098
yg+0.098
Neg logit contributions
ngth-0.099
 participating-0.083
 gaining-0.078
 finishing-0.078
 exerc-0.078
Token Clinton
Token ID2605
Feature activation+1.63e-01

Pos logit contributions
flat+0.758
alias+0.758
aque+0.738
interstitial+0.707
yg+0.706
Neg logit contributions
ngth-0.687
 participating-0.582
 gaining-0.546
 finishing-0.545
 exerc-0.543
Token Hillary
Token ID5073
Feature activation+1.02e+00

Pos logit contributions
alias+0.128
flat+0.127
aque+0.124
interstitial+0.119
alia+0.119
Neg logit contributions
ngth-0.105
 participating-0.086
 finishing-0.080
 exerc-0.080
 gaining-0.080
Token Diane
Token ID26542
Feature activation+3.13e+00

Pos logit contributions
alias+0.548
flat+0.547
aque+0.526
interstitial+0.497
yg+0.496
Neg logit contributions
ngth-0.903
 participating-0.792
 gaining-0.758
 finishing-0.758
 exerc-0.754
Token Rodham
Token ID41848
Feature activation+1.19e+00

Pos logit contributions
alias+2.092
flat+2.076
aque+2.028
interstitial+1.977
alia+1.966
Neg logit contributions
ngth-2.395
 participating-1.977
lished-1.942
 exerc-1.902
 finishing-1.881
Token Clinton
Token ID2605
Feature activation+4.65e-02

Pos logit contributions
alias+0.786
flat+0.761
aque+0.754
alia+0.748
interstitial+0.720
Neg logit contributions
ngth-0.945
 participating-0.777
lished-0.776
 finishing-0.756
 exerc-0.750
TokenSanders
Token ID26747
Feature activation-1.62e-01

Pos logit contributions
flat+0.027
alias+0.027
aque+0.026
interstitial+0.024
yg+0.024
Neg logit contributions
ngth-0.040
 participating-0.035
 gaining-0.033
 finishing-0.033
 exerc-0.033
Token:
Token ID25
Feature activation+6.08e-01

Pos logit contributions
ngth+0.130
 participating+0.114
 gaining+0.109
 finishing+0.108
 exerc+0.107
Neg logit contributions
flat-0.100
alias-0.099
aque-0.097
interstitial-0.092
yg-0.092
Token '
Token ID705
Feature activation-5.89e-01

Pos logit contributions
flat+0.498
alias+0.495
aque+0.485
interstitial+0.467
yg+0.466
Neg logit contributions
ngth-0.369
 participating-0.311
 gaining-0.290
 finishing-0.289
 exerc-0.283
Token in
Token ID287
Feature activation-3.96e-01

Pos logit contributions
ngth+0.317
 participating+0.258
 gaining+0.242
 exerc+0.239
 finishing+0.239
Neg logit contributions
flat-0.395
alias-0.391
aque-0.384
yg-0.369
-0.365
Token combat
Token ID5249
Feature activation+5.85e-01

Pos logit contributions
ngth+0.268
 participating+0.229
 gaining+0.215
 finishing+0.214
 exerc+0.212
Neg logit contributions
flat-0.296
alias-0.295
aque-0.288
interstitial-0.276
yg-0.276
Token boots
Token ID14412
Feature activation+2.83e-01

Pos logit contributions
flat+0.443
alias+0.439
aque+0.429
+0.410
yg+0.409
Neg logit contributions
ngth-0.394
 participating-0.329
 gaining-0.308
 exerc-0.307
 finishing-0.306
Token/
Token ID14
Feature activation+4.42e-01

Pos logit contributions
flat+0.212
alias+0.210
aque+0.206
+0.197
yg+0.197
Neg logit contributions
ngth-0.192
 participating-0.161
 gaining-0.151
 exerc-0.150
 finishing-0.150
Tokenhe
Token ID258
Feature activation-2.99e-01

Pos logit contributions
flat+0.273
alias+0.272
aque+0.264
interstitial+0.252
yg+0.251
Neg logit contributions
ngth-0.354
 participating-0.312
 gaining-0.296
 finishing-0.295
 exerc-0.293
Tokenels
Token ID1424
Feature activation+3.09e+00

Pos logit contributions
ngth+0.308
 participating+0.279
 gaining+0.268
 finishing+0.268
 exerc+0.266
Neg logit contributions
flat-0.118
alias-0.116
aque-0.111
interstitial-0.103
yg-0.102
Token all
Token ID477
Feature activation-1.09e-01

Pos logit contributions
flat+2.215
alias+2.192
aque+2.129
+2.051
yg+2.036
Neg logit contributions
ngth-2.166
 participating-1.795
lished-1.725
 gaining-1.684
 fulfilled-1.681
Token day
Token ID1110
Feature activation-4.44e-01

Pos logit contributions
ngth+0.061
 participating+0.049
 gaining+0.046
 finishing+0.045
 exerc+0.045
Neg logit contributions
flat-0.094
alias-0.093
aque-0.092
yg-0.089
-0.088
Token.
Token ID13
Feature activation-5.20e-01

Pos logit contributions
ngth+0.299
 participating+0.254
 gaining+0.239
 finishing+0.238
 exerc+0.236
Neg logit contributions
flat-0.334
alias-0.332
aque-0.324
yg-0.311
interstitial-0.311
Token If
Token ID1002
Feature activation-8.97e-01

Pos logit contributions
ngth+0.383
 participating+0.330
 gaining+0.312
 finishing+0.311
 exerc+0.309
Neg logit contributions
flat-0.359
alias-0.357
aque-0.347
yg-0.332
interstitial-0.330
Token anything
Token ID1997
Feature activation-1.35e+00

Pos logit contributions
ngth+0.595
 participating+0.511
 gaining+0.479
 finishing+0.477
 exerc+0.472
Neg logit contributions
flat-0.681
alias-0.678
aque-0.663
interstitial-0.638
yg-0.635
Token p
Token ID279
Feature activation-3.43e-01

Pos logit contributions
flat+0.015
alias+0.015
aque+0.015
interstitial+0.014
yg+0.014
Neg logit contributions
ngth-0.013
 participating-0.011
 gaining-0.011
 finishing-0.011
 exerc-0.010
Token.
Token ID13
Feature activation+4.05e-01

Pos logit contributions
ngth+0.251
 participating+0.208
 exerc+0.200
lished+0.198
 gaining+0.198
Neg logit contributions
alias-0.248
flat-0.248
aque-0.240
yg-0.232
alia-0.229
Token 66
Token ID7930
Feature activation-2.98e-01

Pos logit contributions
flat+0.276
alias+0.275
aque+0.266
yg+0.255
interstitial+0.254
Neg logit contributions
ngth-0.304
 participating-0.260
 gaining-0.248
 finishing-0.246
 exerc-0.245
Token:
Token ID25
Feature activation+2.33e-01

Pos logit contributions
ngth+0.164
 participating+0.137
 finishing+0.126
 gaining+0.126
 exerc+0.124
Neg logit contributions
flat-0.258
alias-0.258
aque-0.252
interstitial-0.245
yg-0.243
Token "
Token ID366
Feature activation+5.27e-01

Pos logit contributions
flat+0.201
alias+0.200
aque+0.196
interstitial+0.189
yg+0.189
Neg logit contributions
ngth-0.130
 participating-0.108
 gaining-0.100
 finishing-0.100
 exerc-0.098
TokenBoard
Token ID29828
Feature activation+3.07e+00

Pos logit contributions
flat+0.327
alias+0.327
aque+0.316
interstitial+0.300
yg+0.299
Neg logit contributions
ngth-0.427
 participating-0.371
 gaining-0.354
 finishing-0.353
 exerc-0.352
Token of
Token ID286
Feature activation+1.75e-01

Pos logit contributions
flat+2.652
alias+2.637
aque+2.570
yg+2.493
amic+2.482
Neg logit contributions
ngth-1.699
 participating-1.349
 gaining-1.274
 exerc-1.257
 finishing-1.254
Token Directors
Token ID29200
Feature activation+6.24e-01

Pos logit contributions
flat+0.115
alias+0.113
aque+0.110
+0.106
interstitial+0.106
Neg logit contributions
ngth-0.134
 participating-0.119
 finishing-0.112
 gaining-0.112
 exerc-0.111
Token:
Token ID25
Feature activation-1.61e-01

Pos logit contributions
flat+0.534
alias+0.531
aque+0.521
interstitial+0.502
yg+0.502
Neg logit contributions
ngth-0.354
 participating-0.294
 gaining-0.272
 finishing-0.271
 exerc-0.268
Token Chairman
Token ID12787
Feature activation+7.53e-01

Pos logit contributions
ngth+0.118
 participating+0.102
 gaining+0.097
 finishing+0.097
 exerc+0.096
Neg logit contributions
flat-0.111
alias-0.110
aque-0.107
interstitial-0.103
yg-0.103
Token:
Token ID25
Feature activation-3.05e-01

Pos logit contributions
alias+0.522
flat+0.522
aque+0.505
yg+0.483
interstitial+0.481
Neg logit contributions
ngth-0.551
 participating-0.473
 gaining-0.450
 finishing-0.447
 exerc-0.445
Token their
Token ID511
Feature activation-4.72e-01

Pos logit contributions
ngth+0.264
 participating+0.221
 gaining+0.207
 finishing+0.206
 exerc+0.204
Neg logit contributions
flat-0.341
alias-0.339
aque-0.331
yg-0.319
interstitial-0.319
Token excess
Token ID6992
Feature activation+1.44e+00

Pos logit contributions
ngth+0.302
 participating+0.253
 gaining+0.237
 finishing+0.235
 exerc+0.234
Neg logit contributions
flat-0.372
alias-0.368
aque-0.360
yg-0.346
interstitial-0.345
Token wages
Token ID9400
Feature activation+1.79e-01

Pos logit contributions
flat+1.121
alias+1.099
aque+1.077
yg+1.047
alia+1.033
Neg logit contributions
ngth-0.924
 participating-0.788
 gaining-0.733
 exerc-0.727
 finishing-0.725
Token on
Token ID319
Feature activation-1.91e-01

Pos logit contributions
flat+0.138
alias+0.135
aque+0.132
yg+0.128
+0.126
Neg logit contributions
ngth-0.118
 participating-0.100
 gaining-0.093
 finishing-0.092
 exerc-0.092
Token their
Token ID511
Feature activation-1.16e-01

Pos logit contributions
ngth+0.131
 participating+0.113
 gaining+0.106
 finishing+0.106
 exerc+0.105
Neg logit contributions
flat-0.141
alias-0.141
aque-0.137
interstitial-0.132
yg-0.132
Token board
Token ID3096
Feature activation+3.02e+00

Pos logit contributions
ngth+0.083
 participating+0.071
 gaining+0.067
 finishing+0.067
 exerc+0.066
Neg logit contributions
flat-0.083
alias-0.082
aque-0.080
yg-0.077
interstitial-0.077
Token."
Token ID526
Feature activation-5.76e-01

Pos logit contributions
flat+2.497
alias+2.470
aque+2.390
yg+2.327
alia+2.311
Neg logit contributions
ngth-1.808
 participating-1.408
lished-1.378
 exerc-1.321
 fulfilled-1.315
Token
Token ID198
Feature activation+3.66e-01

Pos logit contributions
ngth+0.444
 participating+0.390
 finishing+0.369
 gaining+0.368
 exerc+0.362
Neg logit contributions
flat-0.375
alias-0.373
aque-0.365
interstitial-0.350
uckland-0.344
Token
Token ID198
Feature activation+2.40e-01

Pos logit contributions
flat+0.215
alias+0.214
aque+0.203
yg+0.195
ine+0.192
Neg logit contributions
ngth-0.322
 participating-0.272
lished-0.265
 exerc-0.265
 gaining-0.262
TokenBoth
Token ID10265
Feature activation+5.90e-01

Pos logit contributions
flat+0.154
alias+0.153
aque+0.149
interstitial+0.142
yg+0.142
Neg logit contributions
ngth-0.187
 participating-0.164
 gaining-0.155
 finishing-0.155
 exerc-0.153
Token Rodgers
Token ID19947
Feature activation+1.10e+00

Pos logit contributions
flat+0.402
alias+0.400
aque+0.389
yg+0.372
interstitial+0.371
Neg logit contributions
ngth-0.438
 participating-0.378
 gaining-0.359
 finishing-0.357
 exerc-0.355
Token Trump
Token ID1301
Feature activation+7.23e-01

Pos logit contributions
flat+0.305
alias+0.304
aque+0.287
yg+0.279
+0.272
Neg logit contributions
ngth-0.532
 participating-0.450
lished-0.445
 exerc-0.442
 gaining-0.434
Token passes
Token ID8318
Feature activation+1.48e+00

Pos logit contributions
alias+0.550
flat+0.550
aque+0.531
yg+0.514
interstitial+0.510
Neg logit contributions
ngth-0.487
 participating-0.397
lished-0.382
 exerc-0.378
 gaining-0.373
Token on
Token ID319
Feature activation+2.34e-01

Pos logit contributions
flat+1.193
alias+1.188
aque+1.162
interstitial+1.118
yg+1.117
Neg logit contributions
ngth-0.911
 participating-0.765
 gaining-0.714
 finishing-0.711
 exerc-0.704
Token '
Token ID705
Feature activation-5.47e-01

Pos logit contributions
flat+0.196
alias+0.194
aque+0.190
yg+0.183
interstitial+0.183
Neg logit contributions
ngth-0.136
 participating-0.116
 gaining-0.108
 finishing-0.108
 exerc-0.105
Tokenin
Token ID259
Feature activation-1.75e-01

Pos logit contributions
ngth+0.433
 participating+0.384
 gaining+0.365
 finishing+0.364
 fulfilled+0.359
Neg logit contributions
flat-0.338
alias-0.337
aque-0.329
yg-0.314
interstitial-0.312
Tokenappropriate
Token ID13335
Feature activation+3.01e+00

Pos logit contributions
ngth+0.130
 participating+0.116
 gaining+0.110
 finishing+0.109
 exerc+0.107
Neg logit contributions
alias-0.117
flat-0.116
aque-0.114
interstitial-0.108
yg-0.108
Token'
Token ID6
Feature activation+1.34e+00

Pos logit contributions
flat+1.924
alias+1.913
aque+1.861
interstitial+1.783
yg+1.776
Neg logit contributions
ngth-2.371
 participating-2.030
 gaining-1.935
 finishing-1.922
 exerc-1.917
Token debate
Token ID4384
Feature activation+1.46e+00

Pos logit contributions
flat+0.811
alias+0.807
aque+0.783
interstitial+0.744
yg+0.743
Neg logit contributions
ngth-1.094
 participating-0.960
 gaining-0.914
 finishing-0.911
 exerc-0.905
Token with
Token ID351
Feature activation+3.97e-01

Pos logit contributions
flat+1.147
alias+1.143
aque+1.114
interstitial+1.074
yg+1.072
Neg logit contributions
ngth-0.936
 participating-0.770
 gaining-0.723
 finishing-0.719
lished-0.719
Token Bernie
Token ID10477
Feature activation+3.78e-01

Pos logit contributions
flat+0.316
alias+0.315
aque+0.307
interstitial+0.296
yg+0.296
Neg logit contributions
ngth-0.250
 participating-0.210
 gaining-0.196
 finishing-0.195
 exerc-0.194
Token Sanders
Token ID5831
Feature activation+1.44e-01

Pos logit contributions
flat+0.284
alias+0.283
aque+0.275
yg+0.264
interstitial+0.264
Neg logit contributions
ngth-0.256
 participating-0.217
 gaining-0.204
 finishing-0.204
 exerc-0.203
Token:
Token ID25
Feature activation-8.79e-02

Pos logit contributions
ngth+0.517
 participating+0.432
 gaining+0.403
 finishing+0.401
 exerc+0.397
Neg logit contributions
flat-0.695
alias-0.692
aque-0.677
yg-0.651
interstitial-0.651
Token
Token ID198
Feature activation-7.40e-01

Pos logit contributions
ngth+0.068
 participating+0.059
 gaining+0.056
 finishing+0.056
 exerc+0.055
Neg logit contributions
flat-0.057
alias-0.057
aque-0.056
yg-0.053
interstitial-0.053
Token
Token ID198
Feature activation-8.25e-01

Pos logit contributions
ngth+0.637
 participating+0.540
 exerc+0.522
lished+0.520
 gaining+0.519
Neg logit contributions
flat-0.445
alias-0.443
aque-0.423
yg-0.405
-0.400
TokenD
Token ID35
Feature activation-3.10e-04

Pos logit contributions
ngth+0.625
 participating+0.548
 gaining+0.519
 finishing+0.517
 exerc+0.511
Neg logit contributions
flat-0.538
alias-0.534
aque-0.525
yg-0.499
interstitial-0.498
Tokenani
Token ID3216
Feature activation-1.45e-01

Pos logit contributions
ngth+0.000
 participating+0.000
 gaining+0.000
 finishing+0.000
 exerc+0.000
Neg logit contributions
flat-0.000
alias-0.000
aque-0.000
yg-0.000
interstitial-0.000
Tokenels
Token ID1424
Feature activation+2.97e+00

Pos logit contributions
ngth+0.164
 participating+0.149
 gaining+0.144
 finishing+0.144
 exerc+0.143
Neg logit contributions
flat-0.044
alias-0.043
aque-0.041
yg-0.036
interstitial-0.036
Token
Token ID447
Feature activation+1.42e-01

Pos logit contributions
flat+2.195
alias+2.185
aque+2.129
alia+2.044
+2.035
Neg logit contributions
ngth-2.043
 participating-1.671
lished-1.604
 gaining-1.588
 finishing-1.580
Token
Token ID247
Feature activation+2.16e-01

Pos logit contributions
flat+0.056
alias+0.055
aque+0.053
yg+0.048
interstitial+0.048
Neg logit contributions
ngth-0.146
 participating-0.133
 gaining-0.128
 finishing-0.127
 exerc-0.126
Token mug
Token ID25152
Feature activation+1.41e+00

Pos logit contributions
flat+0.159
alias+0.159
aque+0.155
interstitial+0.148
yg+0.148
Neg logit contributions
ngth-0.148
 participating-0.126
 gaining-0.119
 finishing-0.118
 exerc-0.117
Tokenshot
Token ID9442
Feature activation+6.57e-01

Pos logit contributions
flat+0.922
alias+0.916
aque+0.894
yg+0.851
interstitial+0.845
Neg logit contributions
ngth-1.094
 participating-0.945
 gaining-0.898
 finishing-0.898
 exerc-0.889
Token was
Token ID373
Feature activation+2.22e-01

Pos logit contributions
flat+0.479
alias+0.478
aque+0.466
interstitial+0.446
yg+0.445
Neg logit contributions
ngth-0.456
 participating-0.388
 gaining-0.365
 finishing-0.365
 exerc-0.363
Token his
Token ID465
Feature activation-2.55e-01

Pos logit contributions
flat+0.061
alias+0.061
aque+0.059
interstitial+0.057
yg+0.057
Neg logit contributions
ngth-0.064
 participating-0.056
 gaining-0.053
 finishing-0.053
 exerc-0.052
Token
Token ID564
Feature activation+1.77e-02

Pos logit contributions
ngth+0.184
 participating+0.159
 gaining+0.150
 finishing+0.149
 exerc+0.148
Neg logit contributions
flat-0.180
alias-0.179
aque-0.174
interstitial-0.167
yg-0.167
Token
Token ID250
Feature activation-3.86e-01

Pos logit contributions
flat+0.007
alias+0.007
aque+0.006
interstitial+0.006
yg+0.006
Neg logit contributions
ngth-0.019
 participating-0.017
 gaining-0.016
 exerc-0.016
 finishing-0.016
Tokenun
Token ID403
Feature activation+5.87e-01

Pos logit contributions
ngth+0.318
 participating+0.280
 gaining+0.266
 finishing+0.265
 exerc+0.262
Neg logit contributions
flat-0.228
alias-0.227
aque-0.222
yg-0.210
interstitial-0.210
Tokenfor
Token ID1640
Feature activation+7.71e-02

Pos logit contributions
flat+0.399
alias+0.395
aque+0.384
interstitial+0.364
amic+0.362
Neg logit contributions
ngth-0.440
 participating-0.382
 gaining-0.365
 finishing-0.363
 exerc-0.359
Tokengive
Token ID26535
Feature activation+2.92e+00

Pos logit contributions
flat+0.046
alias+0.046
aque+0.045
yg+0.042
interstitial+0.042
Neg logit contributions
ngth-0.063
 participating-0.056
 gaining-0.053
 finishing-0.053
 exerc-0.053
Tokenable
Token ID540
Feature activation-7.16e-01

Pos logit contributions
flat+1.877
alias+1.857
aque+1.819
yg+1.726
interstitial+1.718
Neg logit contributions
ngth-2.282
 participating-1.971
 gaining-1.878
 finishing-1.864
 exerc-1.848
Token language
Token ID3303
Feature activation-8.32e-01

Pos logit contributions
ngth+0.470
 participating+0.401
 gaining+0.376
 finishing+0.375
 exerc+0.370
Neg logit contributions
flat-0.550
alias-0.547
aque-0.535
interstitial-0.514
yg-0.513
Token.
Token ID13
Feature activation+4.37e-03

Pos logit contributions
ngth+0.524
 participating+0.444
 gaining+0.415
 finishing+0.413
 exerc+0.409
Neg logit contributions
flat-0.661
alias-0.657
aque-0.643
interstitial-0.619
yg-0.618
Token
Token ID447
Feature activation+9.04e-02

Pos logit contributions
flat+0.003
alias+0.003
aque+0.003
yg+0.003
interstitial+0.003
Neg logit contributions
ngth-0.003
 participating-0.003
 gaining-0.003
 finishing-0.003
 exerc-0.003
Token
Token ID251
Feature activation-2.04e-02

Pos logit contributions
flat+0.035
alias+0.034
aque+0.033
interstitial+0.030
yg+0.030
Neg logit contributions
ngth-0.094
 participating-0.086
 gaining-0.082
 finishing-0.082
 exerc-0.082
Token League
Token ID4041
Feature activation-8.71e-01

Pos logit contributions
ngth+0.042
 participating+0.033
lished+0.033
 exerc+0.032
 gaining+0.031
Neg logit contributions
flat-0.055
alias-0.055
aque-0.053
yg-0.051
amic-0.051
Token makes
Token ID1838
Feature activation+3.77e-02

Pos logit contributions
ngth+0.581
 participating+0.503
 gaining+0.473
 finishing+0.472
 fulfilled+0.464
Neg logit contributions
flat-0.654
alias-0.651
aque-0.637
interstitial-0.611
yg-0.611
Token them
Token ID606
Feature activation+2.39e-01

Pos logit contributions
flat+0.028
alias+0.028
aque+0.028
yg+0.027
alia+0.026
Neg logit contributions
ngth-0.026
 participating-0.021
 gaining-0.020
 finishing-0.020
 exerc-0.020
Token a
Token ID257
Feature activation+3.47e-01

Pos logit contributions
flat+0.178
alias+0.177
aque+0.172
yg+0.165
interstitial+0.165
Neg logit contributions
ngth-0.162
 participating-0.138
 gaining-0.130
 finishing-0.130
 exerc-0.129
Token great
Token ID1049
Feature activation-1.05e+00

Pos logit contributions
flat+0.280
alias+0.276
aque+0.271
yg+0.261
+0.260
Neg logit contributions
ngth-0.218
 participating-0.174
 exerc-0.164
lished-0.164
 gaining-0.162
Token fit
Token ID4197
Feature activation+2.89e+00

Pos logit contributions
ngth+0.670
 participating+0.570
 gaining+0.533
 finishing+0.531
 exerc+0.525
Neg logit contributions
flat-0.825
alias-0.821
aque-0.803
interstitial-0.772
yg-0.771
Token for
Token ID329
Feature activation+3.93e-01

Pos logit contributions
alias+2.128
flat+2.114
aque+2.066
yg+2.000
alia+1.986
Neg logit contributions
ngth-2.000
 participating-1.621
lished-1.545
 exerc-1.534
 gaining-1.531
Token 12
Token ID1105
Feature activation+3.39e-01

Pos logit contributions
flat+0.289
alias+0.288
aque+0.280
yg+0.269
interstitial+0.268
Neg logit contributions
ngth-0.273
 participating-0.228
 gaining-0.216
 finishing-0.215
 exerc-0.215
TokenB
Token ID33
Feature activation+1.57e-01

Pos logit contributions
flat+0.187
alias+0.185
aque+0.178
yg+0.169
+0.168
Neg logit contributions
ngth-0.301
 participating-0.262
 exerc-0.252
lished-0.251
 gaining-0.250
TokenET
Token ID2767
Feature activation-2.65e-01

Pos logit contributions
flat+0.093
alias+0.093
aque+0.089
FORE+0.086
yg+0.085
Neg logit contributions
ngth-0.135
 participating-0.114
 exerc-0.111
 gaining-0.110
lished-0.110
Token who
Token ID508
Feature activation+1.45e-01

Pos logit contributions
ngth+0.178
 participating+0.150
 gaining+0.141
 finishing+0.140
 exerc+0.140
Neg logit contributions
flat-0.199
alias-0.199
aque-0.194
yg-0.186
interstitial-0.185
Token is
Token ID318
Feature activation+1.70e-01

Pos logit contributions
flat+0.336
alias+0.332
aque+0.326
yg+0.310
interstitial+0.309
Neg logit contributions
ngth-0.318
 participating-0.271
 gaining-0.255
 finishing-0.255
 exerc-0.254
Token the
Token ID262
Feature activation+2.44e-02

Pos logit contributions
flat+0.134
alias+0.132
aque+0.130
yg+0.124
interstitial+0.124
Neg logit contributions
ngth-0.109
 participating-0.091
 gaining-0.084
 exerc-0.084
 finishing-0.084
Token beautiful
Token ID4950
Feature activation-3.30e-01

Pos logit contributions
flat+0.018
alias+0.017
aque+0.017
yg+0.016
interstitial+0.016
Neg logit contributions
ngth-0.017
 participating-0.015
 gaining-0.014
 finishing-0.014
 exerc-0.014
Token Cup
Token ID5454
Feature activation-4.63e-01

Pos logit contributions
ngth+0.239
 participating+0.206
 gaining+0.195
 finishing+0.194
 exerc+0.193
Neg logit contributions
flat-0.231
alias-0.230
aque-0.224
interstitial-0.214
yg-0.214
Tokenani
Token ID3216
Feature activation+5.93e-01

Pos logit contributions
ngth+0.402
 participating+0.356
 gaining+0.340
 finishing+0.339
 exerc+0.337
Neg logit contributions
flat-0.258
alias-0.256
aque-0.248
interstitial-0.235
yg-0.234
Tokenopsis
Token ID24608
Feature activation+2.87e+00

Pos logit contributions
flat+0.394
alias+0.391
aque+0.382
yg+0.364
interstitial+0.361
Neg logit contributions
ngth-0.452
 participating-0.393
 gaining-0.372
 finishing-0.371
 exerc-0.369
Token an
Token ID281
Feature activation+6.78e-02

Pos logit contributions
flat+2.075
alias+2.049
aque+2.011
yg+1.928
interstitial+1.918
Neg logit contributions
ngth-2.023
 participating-1.708
 gaining-1.602
lished-1.600
 finishing-1.593
Tokenac
Token ID330
Feature activation-1.24e-01

Pos logit contributions
flat+0.037
alias+0.037
aque+0.036
interstitial+0.034
yg+0.034
Neg logit contributions
ngth-0.059
 participating-0.053
 gaining-0.050
 finishing-0.050
 exerc-0.050
Tokenard
Token ID446
Feature activation+4.46e-01

Pos logit contributions
ngth+0.106
 participating+0.094
 gaining+0.089
 finishing+0.089
 exerc+0.089
Neg logit contributions
flat-0.070
alias-0.070
aque-0.068
interstitial-0.064
yg-0.064
Tokenio
Token ID952
Feature activation+4.61e-01

Pos logit contributions
flat+0.260
alias+0.259
aque+0.251
interstitial+0.238
yg+0.237
Neg logit contributions
ngth-0.375
 participating-0.331
 gaining-0.316
 finishing-0.315
 exerc-0.313
Tokenides
Token ID1460
Feature activation+1.60e+00

Pos logit contributions
flat+0.208
alias+0.206
aque+0.198
interstitial+0.184
yg+0.184
Neg logit contributions
ngth-0.449
 participating-0.404
 gaining-0.387
 finishing-0.387
 exerc-0.384
Token can
Token ID460
Feature activation-1.40e-01

Pos logit contributions
alias+0.044
flat+0.044
aque+0.043
yg+0.042
alia+0.041
Neg logit contributions
ngth-0.031
 participating-0.025
 gaining-0.023
 finishing-0.023
 exerc-0.023
Token have
Token ID423
Feature activation+2.21e-01

Pos logit contributions
ngth+0.091
 participating+0.077
 gaining+0.072
 finishing+0.072
 exerc+0.071
Neg logit contributions
flat-0.109
alias-0.109
aque-0.106
yg-0.102
interstitial-0.101
Token perfectly
Token ID7138
Feature activation+1.27e-01

Pos logit contributions
flat+0.163
alias+0.162
aque+0.157
yg+0.151
interstitial+0.150
Neg logit contributions
ngth-0.152
 participating-0.130
 gaining-0.122
 finishing-0.122
 exerc-0.121
Token working
Token ID1762
Feature activation+1.22e-01

Pos logit contributions
flat+0.095
alias+0.094
aque+0.092
yg+0.088
interstitial+0.087
Neg logit contributions
ngth-0.087
 participating-0.073
 gaining-0.069
 finishing-0.069
 exerc-0.068
Token normal
Token ID3487
Feature activation+4.72e-01

Pos logit contributions
flat+0.092
alias+0.092
aque+0.089
yg+0.086
interstitial+0.085
Neg logit contributions
ngth-0.083
 participating-0.070
 gaining-0.066
 finishing-0.065
 exerc-0.065
Token maps
Token ID8739
Feature activation+2.86e+00

Pos logit contributions
flat+0.337
alias+0.335
aque+0.326
yg+0.312
interstitial+0.309
Neg logit contributions
ngth-0.338
 participating-0.290
 gaining-0.273
 finishing-0.271
 exerc-0.269
Token and
Token ID290
Feature activation+3.37e-01

Pos logit contributions
flat+2.138
alias+2.118
aque+2.044
yg+1.978
ine+1.959
Neg logit contributions
ngth-1.968
 participating-1.582
lished-1.535
 exerc-1.497
 finishing-1.485
Token occ
Token ID1609
Feature activation+1.62e+00

Pos logit contributions
flat+0.252
alias+0.250
aque+0.243
yg+0.234
interstitial+0.231
Neg logit contributions
ngth-0.231
 participating-0.194
 gaining-0.183
 finishing-0.181
 exerc-0.181
Tokenl
Token ID75
Feature activation-3.94e-01

Pos logit contributions
flat+1.207
alias+1.205
aque+1.178
yg+1.122
amic+1.119
Neg logit contributions
ngth-1.111
 participating-0.952
 gaining-0.901
 finishing-0.895
 exerc-0.889
Tokenusion
Token ID4241
Feature activation+8.71e-01

Pos logit contributions
ngth+0.282
 participating+0.248
 finishing+0.232
 gaining+0.232
 exerc+0.229
Neg logit contributions
flat-0.276
alias-0.272
aque-0.263
interstitial-0.257
yg-0.255
Token maps
Token ID8739
Feature activation+2.45e+00

Pos logit contributions
flat+0.641
alias+0.635
aque+0.619
yg+0.595
alia+0.586
Neg logit contributions
ngth-0.610
 participating-0.508
lished-0.478
 gaining-0.478
 finishing-0.474
Token p
Token ID279
Feature activation-5.30e-01

Pos logit contributions
flat+0.005
alias+0.004
aque+0.004
yg+0.004
interstitial+0.004
Neg logit contributions
ngth-0.004
 participating-0.003
 gaining-0.003
 finishing-0.003
 exerc-0.003
Token.
Token ID13
Feature activation-5.04e-02

Pos logit contributions
ngth+0.345
 participating+0.295
 finishing+0.275
 gaining+0.275
 fulfilled+0.270
Neg logit contributions
flat-0.409
alias-0.407
aque-0.396
interstitial-0.383
-0.379
Token 21
Token ID2310
Feature activation+6.66e-02

Pos logit contributions
ngth+0.037
 participating+0.032
 gaining+0.030
 finishing+0.030
 exerc+0.030
Neg logit contributions
flat-0.035
alias-0.035
aque-0.034
yg-0.033
interstitial-0.033
Token:
Token ID25
Feature activation+5.24e-01

Pos logit contributions
flat+0.051
alias+0.051
aque+0.050
yg+0.048
interstitial+0.048
Neg logit contributions
ngth-0.044
 participating-0.037
 gaining-0.035
 finishing-0.035
 exerc-0.034
Token "
Token ID366
Feature activation+5.51e-01

Pos logit contributions
flat+0.428
alias+0.426
aque+0.417
interstitial+0.401
yg+0.401
Neg logit contributions
ngth-0.318
 participating-0.265
 gaining-0.248
 finishing-0.246
 exerc-0.244
TokenBoard
Token ID29828
Feature activation+2.86e+00

Pos logit contributions
flat+0.362
alias+0.362
aque+0.350
interstitial+0.333
yg+0.331
Neg logit contributions
ngth-0.427
 participating-0.367
 gaining-0.349
 finishing-0.348
 exerc-0.347
Token of
Token ID286
Feature activation+1.97e-01

Pos logit contributions
flat+2.423
alias+2.397
aque+2.327
yg+2.266
amic+2.256
Neg logit contributions
ngth-1.640
 participating-1.287
 gaining-1.226
 exerc-1.218
 finishing-1.215
Token Directors
Token ID29200
Feature activation+6.48e-01

Pos logit contributions
flat+0.124
alias+0.123
aque+0.120
yg+0.114
interstitial+0.114
Neg logit contributions
ngth-0.156
 participating-0.138
 gaining-0.130
 finishing-0.130
 exerc-0.129
Token:
Token ID25
Feature activation-7.48e-02

Pos logit contributions
flat+0.502
alias+0.500
aque+0.486
interstitial+0.466
yg+0.466
Neg logit contributions
ngth-0.424
 participating-0.350
 gaining-0.331
 finishing-0.329
 exerc-0.328
Token John
Token ID1757
Feature activation-2.94e-02

Pos logit contributions
ngth+0.056
 participating+0.048
 gaining+0.045
 finishing+0.045
 exerc+0.045
Neg logit contributions
flat-0.051
alias-0.051
aque-0.050
interstitial-0.047
yg-0.047
Token T
Token ID309
Feature activation-2.10e-01

Pos logit contributions
ngth+0.022
 participating+0.019
 gaining+0.018
 finishing+0.018
 exerc+0.018
Neg logit contributions
flat-0.020
alias-0.020
aque-0.020
yg-0.019
interstitial-0.019
Token on
Token ID319
Feature activation-1.01e-02

Pos logit contributions
flat+0.442
alias+0.439
aque+0.430
yg+0.413
interstitial+0.412
Neg logit contributions
ngth-0.356
 participating-0.301
 gaining-0.282
 finishing-0.280
 exerc-0.278
Token undergrad
Token ID43485
Feature activation+8.44e-01

Pos logit contributions
ngth+0.007
 participating+0.006
 exerc+0.005
 gaining+0.005
 finishing+0.005
Neg logit contributions
flat-0.008
alias-0.007
aque-0.007
interstitial-0.007
yg-0.007
Tokenuates
Token ID12632
Feature activation+7.97e-02

Pos logit contributions
flat+0.653
alias+0.652
aque+0.630
amic+0.611
alia+0.610
Neg logit contributions
ngth-0.538
 participating-0.453
 exerc-0.420
 gaining-0.419
lished-0.418
Token.
Token ID13
Feature activation+2.49e-01

Pos logit contributions
flat+0.060
alias+0.059
aque+0.058
interstitial+0.055
yg+0.055
Neg logit contributions
ngth-0.054
 participating-0.045
 exerc-0.042
lished-0.042
 gaining-0.042
Token O
Token ID440
Feature activation+2.58e-01

Pos logit contributions
alias+0.164
flat+0.164
aque+0.156
yg+0.150
+0.149
Neg logit contributions
ngth-0.193
 participating-0.166
 gaining-0.158
 finishing-0.157
 exerc-0.157
Tokenurs
Token ID1834
Feature activation+2.86e+00

Pos logit contributions
flat+0.150
alias+0.149
aque+0.144
yg+0.137
interstitial+0.136
Neg logit contributions
ngth-0.218
 participating-0.192
 gaining-0.183
 finishing-0.183
 exerc-0.182
Token is
Token ID318
Feature activation-1.83e-01

Pos logit contributions
flat+2.022
alias+2.015
aque+1.964
yg+1.871
amic+1.870
Neg logit contributions
ngth-2.026
 participating-1.687
lished-1.601
 finishing-1.600
 gaining-1.599
Token a
Token ID257
Feature activation-1.64e-01

Pos logit contributions
ngth+0.125
 participating+0.106
 gaining+0.099
 finishing+0.099
 exerc+0.098
Neg logit contributions
flat-0.136
alias-0.136
aque-0.132
interstitial-0.127
yg-0.127
Token Bachelor
Token ID33399
Feature activation+1.56e+00

Pos logit contributions
ngth+0.110
 participating+0.091
 exerc+0.086
 gaining+0.086
 finishing+0.085
Neg logit contributions
flat-0.124
alias-0.122
aque-0.120
yg-0.115
interstitial-0.115
Token of
Token ID286
Feature activation-2.65e-01

Pos logit contributions
flat+1.295
alias+1.281
aque+1.249
amic+1.208
yg+1.208
Neg logit contributions
ngth-0.912
 participating-0.751
 exerc-0.694
 gaining-0.691
 finishing-0.691
Token Science
Token ID5800
Feature activation+3.05e-01

Pos logit contributions
ngth+0.198
 participating+0.171
 gaining+0.162
 finishing+0.161
 exerc+0.160
Neg logit contributions
flat-0.178
alias-0.178
aque-0.173
interstitial-0.166
yg-0.166
Tokenff
Token ID487
Feature activation+2.13e+00

Pos logit contributions
flat+0.830
alias+0.825
aque+0.799
yg+0.757
interstitial+0.753
Neg logit contributions
ngth-1.152
 participating-1.009
 gaining-0.964
 finishing-0.961
 exerc-0.954
Token3
Token ID18
Feature activation+1.98e+00

Pos logit contributions
flat+1.254
alias+1.248
aque+1.208
yg+1.147
interstitial+1.135
Neg logit contributions
ngth-1.794
 participating-1.570
 gaining-1.503
 finishing-1.497
 exerc-1.486
Tokenc
Token ID66
Feature activation+1.83e+00

Pos logit contributions
flat+1.175
alias+1.167
aque+1.130
yg+1.069
interstitial+1.063
Neg logit contributions
ngth-1.648
 participating-1.443
 gaining-1.379
 finishing-1.374
 exerc-1.366
Token3
Token ID18
Feature activation+1.95e+00

Pos logit contributions
flat+1.089
alias+1.084
aque+1.046
yg+0.995
interstitial+0.991
Neg logit contributions
ngth-1.518
 participating-1.329
 gaining-1.269
 finishing-1.264
 exerc-1.254
Tokenff
Token ID487
Feature activation+2.56e+00

Pos logit contributions
flat+1.165
alias+1.157
aque+1.122
yg+1.061
interstitial+1.054
Neg logit contributions
ngth-1.622
 participating-1.421
 gaining-1.357
 finishing-1.352
 exerc-1.345
Token0
Token ID15
Feature activation+2.85e+00

Pos logit contributions
flat+1.483
alias+1.476
aque+1.428
yg+1.356
interstitial+1.336
Neg logit contributions
ngth-2.178
 participating-1.905
 gaining-1.824
 finishing-1.818
 exerc-1.803
Tokend
Token ID67
Feature activation+2.50e+00

Pos logit contributions
flat+1.704
alias+1.694
aque+1.637
yg+1.559
interstitial+1.539
Neg logit contributions
ngth-2.358
 participating-2.048
 finishing-1.962
 gaining-1.958
 exerc-1.947
Token247
Token ID23753
Feature activation+9.38e-01

Pos logit contributions
flat+1.493
alias+1.485
aque+1.433
yg+1.360
interstitial+1.356
Neg logit contributions
ngth-2.065
 participating-1.808
 gaining-1.727
 finishing-1.720
 exerc-1.705
Token409
Token ID29416
Feature activation+1.23e+00

Pos logit contributions
flat+0.548
alias+0.544
aque+0.528
yg+0.499
interstitial+0.498
Neg logit contributions
ngth-0.790
 participating-0.696
 gaining-0.664
 finishing-0.662
 exerc-0.658
Tokenbd
Token ID17457
Feature activation+1.95e+00

Pos logit contributions
flat+0.720
alias+0.716
aque+0.694
yg+0.656
interstitial+0.655
Neg logit contributions
ngth-1.032
 participating-0.908
 gaining-0.866
 finishing-0.863
 exerc-0.858
Token28
Token ID2078
Feature activation+2.33e-01

Pos logit contributions
flat+1.156
alias+1.151
aque+1.113
yg+1.055
interstitial+1.049
Neg logit contributions
ngth-1.622
 participating-1.422
 gaining-1.359
 finishing-1.353
 exerc-1.343
Token the
Token ID262
Feature activation+3.44e-01

Pos logit contributions
flat+0.263
alias+0.259
aque+0.254
yg+0.243
interstitial+0.242
Neg logit contributions
ngth-0.244
 participating-0.206
 gaining-0.194
 finishing-0.193
 exerc-0.192
Token otherwise
Token ID4306
Feature activation+9.88e-02

Pos logit contributions
flat+0.261
alias+0.259
aque+0.254
yg+0.243
interstitial+0.243
Neg logit contributions
ngth-0.229
 participating-0.195
 gaining-0.183
 finishing-0.182
 exerc-0.181
Token rect
Token ID13621
Feature activation-1.00e-01

Pos logit contributions
flat+0.070
alias+0.069
aque+0.067
yg+0.064
interstitial+0.064
Neg logit contributions
ngth-0.072
 participating-0.061
 gaining-0.058
 finishing-0.058
 exerc-0.057
Tokenil
Token ID346
Feature activation+1.05e+00

Pos logit contributions
ngth+0.080
 participating+0.070
 gaining+0.066
 finishing+0.066
 exerc+0.066
Neg logit contributions
flat-0.063
alias-0.063
aque-0.061
yg-0.058
interstitial-0.058
Tokenin
Token ID259
Feature activation+5.81e-01

Pos logit contributions
flat+0.574
alias+0.569
aque+0.554
yg+0.525
interstitial+0.519
Neg logit contributions
ngth-0.927
 participating-0.818
 gaining-0.781
 finishing-0.778
 exerc-0.775
Tokenear
Token ID451
Feature activation+2.84e+00

Pos logit contributions
flat+0.313
alias+0.311
aque+0.300
interstitial+0.283
yg+0.283
Neg logit contributions
ngth-0.513
 participating-0.457
 gaining-0.437
 finishing-0.436
 exerc-0.433
Token facade
Token ID43562
Feature activation-1.44e-01

Pos logit contributions
flat+2.189
alias+2.131
aque+2.094
yg+2.018
ine+1.994
Neg logit contributions
ngth-1.908
 participating-1.562
lished-1.479
 gaining-1.462
 fulfilled-1.449
Token.
Token ID13
Feature activation+3.81e-01

Pos logit contributions
ngth+0.096
 participating+0.081
 gaining+0.075
 finishing+0.075
 exerc+0.075
Neg logit contributions
flat-0.110
alias-0.109
aque-0.107
yg-0.102
interstitial-0.102
Token The
Token ID383
Feature activation+1.95e-01

Pos logit contributions
flat+0.257
alias+0.255
aque+0.247
yg+0.236
+0.234
Neg logit contributions
ngth-0.288
 participating-0.248
 gaining-0.236
 finishing-0.234
 exerc-0.234
Token designers
Token ID15441
Feature activation+1.98e-01

Pos logit contributions
flat+0.139
alias+0.138
aque+0.134
yg+0.129
interstitial+0.128
Neg logit contributions
ngth-0.139
 participating-0.118
 gaining-0.112
 finishing-0.111
 exerc-0.111
Token explained
Token ID4893
Feature activation-6.92e-01

Pos logit contributions
flat+0.150
alias+0.149
aque+0.146
yg+0.140
amic+0.139
Neg logit contributions
ngth-0.133
 participating-0.110
 gaining-0.104
 finishing-0.103
 exerc-0.103
Token,
Token ID11
Feature activation-2.37e-01

Pos logit contributions
ngth+0.700
 participating+0.603
 gaining+0.570
 finishing+0.568
 exerc+0.566
Neg logit contributions
flat-0.666
alias-0.663
aque-0.645
yg-0.617
interstitial-0.614
Token I
Token ID314
Feature activation-1.92e-02

Pos logit contributions
ngth+0.170
 participating+0.150
 gaining+0.141
 finishing+0.141
 fulfilled+0.139
Neg logit contributions
flat-0.167
alias-0.166
aque-0.162
interstitial-0.155
uckland-0.154
Token think
Token ID892
Feature activation-8.09e-01

Pos logit contributions
ngth+0.014
 participating+0.012
lished+0.011
 exerc+0.011
 gaining+0.011
Neg logit contributions
flat-0.014
alias-0.014
aque-0.013
yg-0.013
amic-0.013
Token
Token ID198
Feature activation-2.92e-01

Pos logit contributions
ngth+0.599
 participating+0.527
 finishing+0.497
 gaining+0.496
 exerc+0.488
Neg logit contributions
flat-0.548
alias-0.545
aque-0.528
interstitial-0.508
uckland-0.506
Token
Token ID198
Feature activation-4.92e-01

Pos logit contributions
ngth+0.251
 participating+0.213
 exerc+0.206
lished+0.205
 gaining+0.205
Neg logit contributions
flat-0.176
alias-0.175
aque-0.167
yg-0.160
-0.157
TokenReviewer
Token ID35407
Feature activation-4.54e+00

Pos logit contributions
ngth+0.443
 participating+0.395
 finishing+0.378
 gaining+0.376
 exerc+0.372
Neg logit contributions
alias-0.248
flat-0.244
aque-0.241
uckland-0.228
interstitial-0.227
Token:
Token ID25
Feature activation-4.86e-01

Pos logit contributions
ngth+2.410
 participating+2.029
 finishing+1.859
 gaining+1.857
 fulfilled+1.815
Neg logit contributions
flat-3.966
alias-3.953
aque-3.889
interstitial-3.797
uckland-3.764
Token B
Token ID347
Feature activation+2.61e-01

Pos logit contributions
ngth+0.343
 participating+0.298
 gaining+0.281
 finishing+0.280
 exerc+0.277
Neg logit contributions
flat-0.348
alias-0.346
aque-0.338
interstitial-0.326
yg-0.323
Tokeniz
Token ID528
Feature activation-5.40e-01

Pos logit contributions
flat+0.159
alias+0.158
aque+0.154
interstitial+0.146
yg+0.146
Neg logit contributions
ngth-0.212
 participating-0.187
 gaining-0.178
 finishing-0.177
 exerc-0.176
Tokenr
Token ID81
Feature activation-2.21e-01

Pos logit contributions
ngth+0.448
 participating+0.395
 gaining+0.376
 finishing+0.376
 exerc+0.372
Neg logit contributions
flat-0.319
alias-0.318
aque-0.308
interstitial-0.294
yg-0.290
Token Pix
Token ID21642
Feature activation-3.53e-01

Pos logit contributions
ngth+0.178
 participating+0.156
 gaining+0.148
 finishing+0.147
 exerc+0.146
Neg logit contributions
flat-0.137
alias-0.136
aque-0.132
interstitial-0.126
yg-0.126
Tokeniff
Token ID733
Feature activation+7.51e-01

Pos logit contributions
flat+0.352
alias+0.349
aque+0.336
yg+0.324
alia+0.320
Neg logit contributions
ngth-0.627
 participating-0.552
 gaining-0.533
 exerc-0.532
lished-0.530
TokenTra
Token ID15721
Feature activation-1.15e+00

Pos logit contributions
flat+0.437
alias+0.436
aque+0.412
FORE+0.397
+0.394
Neg logit contributions
ngth-0.660
 participating-0.569
lished-0.553
 exerc-0.547
 gaining-0.546
Tokenx
Token ID87
Feature activation-9.20e-01

Pos logit contributions
ngth+1.086
 participating+0.968
 gaining+0.933
 finishing+0.924
 exerc+0.923
Neg logit contributions
flat-0.561
alias-0.553
aque-0.533
yg-0.504
amic-0.491
Token
Token ID198
Feature activation-3.08e-01

Pos logit contributions
ngth+0.696
 participating+0.612
 gaining+0.578
 finishing+0.577
 exerc+0.574
Neg logit contributions
flat-0.611
alias-0.602
aque-0.587
interstitial-0.567
yg-0.564
Token
Token ID198
Feature activation-4.67e-01

Pos logit contributions
ngth+0.267
 participating+0.226
 exerc+0.219
lished+0.218
 gaining+0.218
Neg logit contributions
flat-0.184
alias-0.183
aque-0.174
yg-0.167
-0.164
TokenReviewer
Token ID35407
Feature activation-4.51e+00

Pos logit contributions
ngth+0.433
 participating+0.387
 finishing+0.370
 gaining+0.369
 exerc+0.365
Neg logit contributions
alias-0.226
flat-0.226
aque-0.219
interstitial-0.205
yg-0.204
Token:
Token ID25
Feature activation-5.03e-01

Pos logit contributions
ngth+2.417
 participating+2.028
 finishing+1.860
 gaining+1.860
 fulfilled+1.816
Neg logit contributions
flat-3.924
alias-3.911
aque-3.843
interstitial-3.746
uckland-3.712
Token V
Token ID569
Feature activation+5.18e-01

Pos logit contributions
ngth+0.354
 participating+0.307
 gaining+0.289
 finishing+0.289
 exerc+0.286
Neg logit contributions
flat-0.361
alias-0.359
aque-0.351
interstitial-0.338
yg-0.335
Tokenthe
Token ID1169
Feature activation+4.60e-01

Pos logit contributions
flat+0.307
alias+0.305
aque+0.296
interstitial+0.281
yg+0.281
Neg logit contributions
ngth-0.431
 participating-0.380
 gaining-0.362
 finishing-0.361
 exerc-0.358
TokenHappy
Token ID25082
Feature activation+4.19e-01

Pos logit contributions
flat+0.288
alias+0.287
aque+0.279
interstitial+0.265
yg+0.265
Neg logit contributions
ngth-0.367
 participating-0.322
 gaining-0.306
 finishing-0.305
 exerc-0.303
TokenL
Token ID43
Feature activation-6.03e-01

Pos logit contributions
flat+0.273
alias+0.271
aque+0.264
yg+0.251
interstitial+0.251
Neg logit contributions
ngth-0.323
 participating-0.282
 gaining-0.268
 finishing-0.267
 exerc-0.265
Token 2010
Token ID3050
Feature activation+1.61e-02

Pos logit contributions
ngth+0.455
 participating+0.358
lished+0.338
 exerc+0.337
 gaining+0.333
Neg logit contributions
alias-0.621
flat-0.618
aque-0.603
yg-0.589
-0.581
Tokenboys
Token ID13202
Feature activation+7.76e-01

Pos logit contributions
flat+0.011
alias+0.011
aque+0.011
interstitial+0.011
uckland+0.010
Neg logit contributions
ngth-0.012
 participating-0.010
 finishing-0.009
 gaining-0.009
 exerc-0.009
Token beware
Token ID40600
Feature activation+7.37e-01

Pos logit contributions
flat+0.573
alias+0.572
aque+0.556
yg+0.534
interstitial+0.530
Neg logit contributions
ngth-0.534
 participating-0.449
 gaining-0.424
 finishing-0.423
 exerc-0.420
Token
Token ID198
Feature activation-3.51e-01

Pos logit contributions
flat+0.485
alias+0.482
aque+0.468
interstitial+0.448
yg+0.446
Neg logit contributions
ngth-0.565
 participating-0.493
 gaining-0.466
 finishing-0.465
 exerc-0.462
Token
Token ID198
Feature activation-5.04e-01

Pos logit contributions
ngth+0.301
 participating+0.256
 exerc+0.247
lished+0.246
 gaining+0.246
Neg logit contributions
flat-0.212
alias-0.210
aque-0.200
yg-0.192
-0.188
TokenReviewer
Token ID35407
Feature activation-4.48e+00

Pos logit contributions
ngth+0.459
 participating+0.410
 finishing+0.391
 gaining+0.389
 exerc+0.385
Neg logit contributions
alias-0.251
flat-0.248
aque-0.243
uckland-0.228
yg-0.228
Token:
Token ID25
Feature activation-5.44e-01

Pos logit contributions
ngth+2.394
 participating+2.010
 finishing+1.844
 gaining+1.842
 fulfilled+1.799
Neg logit contributions
flat-3.901
alias-3.888
aque-3.821
interstitial-3.727
uckland-3.693
Token j
Token ID474
Feature activation+6.03e-01

Pos logit contributions
ngth+0.383
 participating+0.333
 gaining+0.313
 finishing+0.313
 exerc+0.310
Neg logit contributions
flat-0.389
alias-0.387
aque-0.378
interstitial-0.364
yg-0.361
Tokenwt
Token ID46569
Feature activation-1.17e+00

Pos logit contributions
flat+0.365
alias+0.362
aque+0.352
yg+0.334
interstitial+0.333
Neg logit contributions
ngth-0.494
 participating-0.436
 gaining-0.415
 finishing-0.413
 exerc-0.410
Token0001
Token ID18005
Feature activation+3.22e-01

Pos logit contributions
ngth+0.934
 participating+0.815
 gaining+0.776
 finishing+0.775
 exerc+0.769
Neg logit contributions
flat-0.725
alias-0.720
aque-0.702
interstitial-0.674
yg-0.665
Token -
Token ID532
Feature activation-4.80e-01

Pos logit contributions
flat+0.246
alias+0.244
aque+0.238
interstitial+0.231
yg+0.228
Neg logit contributions
ngth-0.212
 participating-0.181
 gaining-0.170
 finishing-0.170
 exerc-0.168
Tokener
Token ID263
Feature activation+1.14e+00

Pos logit contributions
ngth+0.356
 participating+0.312
 gaining+0.301
 finishing+0.298
 exerc+0.298
Neg logit contributions
flat-0.212
alias-0.209
aque-0.207
yg-0.195
alia-0.191
Token magnet
Token ID19972
Feature activation+2.96e-01

Pos logit contributions
flat+1.009
alias+1.004
aque+0.987
yg+0.957
+0.946
Neg logit contributions
ngth-0.634
 participating-0.485
lished-0.479
 finishing-0.443
 gaining-0.442
Token?
Token ID30
Feature activation+3.99e-02

Pos logit contributions
alias+0.182
flat+0.182
aque+0.177
interstitial+0.170
yg+0.166
Neg logit contributions
ngth-0.238
 participating-0.210
 gaining-0.200
 finishing-0.199
 fulfilled-0.198
Token
Token ID198
Feature activation+2.33e-02

Pos logit contributions
flat+0.027
alias+0.027
aque+0.026
interstitial+0.025
uckland+0.025
Neg logit contributions
ngth-0.030
 participating-0.026
 finishing-0.025
 gaining-0.025
 fulfilled-0.025
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
flat+0.014
alias+0.014
aque+0.013
yg+0.012
+0.012
Neg logit contributions
ngth-0.020
 participating-0.017
lished-0.017
 exerc-0.017
 gaining-0.017
TokenReviewer
Token ID35407
Feature activation-4.33e+00

Pos logit contributions
ngth+0.110
 participating+0.099
 finishing+0.094
 gaining+0.094
 exerc+0.093
Neg logit contributions
alias-0.058
flat-0.058
aque-0.056
interstitial-0.053
yg-0.052
Token:
Token ID25
Feature activation-3.91e-01

Pos logit contributions
ngth+2.323
 participating+1.951
 finishing+1.790
 gaining+1.789
 fulfilled+1.747
Neg logit contributions
flat-3.767
alias-3.753
aque-3.690
interstitial-3.595
uckland-3.562
Token hey
Token ID17207
Feature activation-2.01e-01

Pos logit contributions
ngth+0.276
 participating+0.240
 gaining+0.226
 finishing+0.226
 exerc+0.223
Neg logit contributions
flat-0.279
alias-0.278
aque-0.271
interstitial-0.261
yg-0.259
Token_
Token ID62
Feature activation-1.35e-01

Pos logit contributions
ngth+0.160
 participating+0.139
 gaining+0.132
 finishing+0.132
 exerc+0.131
Neg logit contributions
flat-0.127
alias-0.126
aque-0.123
interstitial-0.117
yg-0.116
Tokenler
Token ID1754
Feature activation+1.28e+00

Pos logit contributions
ngth+0.108
 participating+0.095
 gaining+0.090
 finishing+0.090
 exerc+0.089
Neg logit contributions
flat-0.083
alias-0.083
aque-0.081
interstitial-0.077
yg-0.077
Tokenoy
Token ID726
Feature activation+5.09e-01

Pos logit contributions
flat+0.753
alias+0.748
aque+0.728
yg+0.691
interstitial+0.683
Neg logit contributions
ngth-1.067
 participating-0.941
 gaining-0.898
 finishing-0.895
 exerc-0.890
Token bad
Token ID2089
Feature activation-8.56e-02

Pos logit contributions
ngth+0.240
 participating+0.203
lished+0.200
 exerc+0.196
 gaining+0.194
Neg logit contributions
flat-0.177
alias-0.173
aque-0.169
yg-0.161
-0.160
Token idea
Token ID2126
Feature activation+4.47e-01

Pos logit contributions
ngth+0.056
 participating+0.044
lished+0.043
 gaining+0.041
 fulfilled+0.040
Neg logit contributions
alias-0.069
flat-0.068
aque-0.067
yg-0.064
-0.064
Token.
Token ID13
Feature activation-1.35e-01

Pos logit contributions
alias+0.321
flat+0.320
aque+0.308
yg+0.300
+0.296
Neg logit contributions
ngth-0.326
 participating-0.272
lished-0.262
 exerc-0.260
 gaining-0.257
Token
Token ID198
Feature activation-4.17e-01

Pos logit contributions
ngth+0.093
 participating+0.081
 finishing+0.076
 gaining+0.076
 fulfilled+0.076
Neg logit contributions
flat-0.097
alias-0.097
aque-0.094
interstitial-0.092
uckland-0.091
Token
Token ID198
Feature activation-5.34e-01

Pos logit contributions
ngth+0.354
 participating+0.302
 exerc+0.290
 gaining+0.290
lished+0.289
Neg logit contributions
flat-0.253
alias-0.250
aque-0.240
yg-0.230
-0.226
TokenReviewer
Token ID35407
Feature activation-4.20e+00

Pos logit contributions
ngth+0.514
 participating+0.462
 gaining+0.443
 finishing+0.442
 exerc+0.439
Neg logit contributions
flat-0.246
alias-0.244
aque-0.235
interstitial-0.219
yg-0.219
Token:
Token ID25
Feature activation-5.50e-01

Pos logit contributions
ngth+2.280
 participating+1.909
 gaining+1.755
 finishing+1.754
 fulfilled+1.713
Neg logit contributions
flat-3.643
alias-3.629
aque-3.565
interstitial-3.465
yg-3.436
Token Mike
Token ID4995
Feature activation+4.08e-01

Pos logit contributions
ngth+0.388
 participating+0.337
 gaining+0.318
 finishing+0.317
 exerc+0.314
Neg logit contributions
flat-0.393
alias-0.391
aque-0.382
interstitial-0.368
yg-0.365
TokenM
Token ID44
Feature activation+2.86e-01

Pos logit contributions
flat+0.265
alias+0.264
aque+0.257
interstitial+0.245
yg+0.244
Neg logit contributions
ngth-0.315
 participating-0.275
 gaining-0.261
 finishing-0.260
 exerc-0.258
Tokenanners
Token ID15672
Feature activation-7.84e-03

Pos logit contributions
flat+0.232
alias+0.230
aque+0.225
interstitial+0.217
yg+0.217
Neg logit contributions
ngth-0.176
 participating-0.148
 gaining-0.138
 finishing-0.138
 exerc-0.135
Token -
Token ID532
Feature activation-7.56e-01

Pos logit contributions
ngth+0.005
 participating+0.005
 gaining+0.004
 finishing+0.004
 exerc+0.004
Neg logit contributions
flat-0.006
alias-0.006
aque-0.006
interstitial-0.005
yg-0.005
Token are
Token ID389
Feature activation-4.08e-02

Pos logit contributions
alias+0.033
flat+0.033
aque+0.032
yg+0.031
interstitial+0.030
Neg logit contributions
ngth-0.030
 participating-0.025
 finishing-0.023
 gaining-0.023
 exerc-0.023
Token Here
Token ID3423
Feature activation+1.40e-01

Pos logit contributions
ngth+0.025
 participating+0.021
 gaining+0.019
 finishing+0.019
 fulfilled+0.019
Neg logit contributions
flat-0.033
alias-0.033
aque-0.032
yg-0.031
interstitial-0.031
Token....
Token ID1106
Feature activation+1.32e+00

Pos logit contributions
flat+0.082
alias+0.082
aque+0.079
interstitial+0.075
yg+0.075
Neg logit contributions
ngth-0.117
 participating-0.104
 gaining-0.099
 finishing-0.098
 exerc-0.098
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
flat+0.855
alias+0.850
aque+0.829
interstitial+0.791
yg+0.785
Neg logit contributions
ngth-1.023
 participating-0.901
 gaining-0.851
 finishing-0.850
 exerc-0.843
Token
Token ID198
Feature activation-2.49e-01

Pos logit contributions
ngth+0.131
 participating+0.111
 exerc+0.107
lished+0.107
 gaining+0.107
Neg logit contributions
flat-0.090
alias-0.089
aque-0.085
yg-0.082
amic-0.080
TokenReviewer
Token ID35407
Feature activation-4.14e+00

Pos logit contributions
ngth+0.228
 participating+0.204
 finishing+0.195
 gaining+0.194
 exerc+0.193
Neg logit contributions
alias-0.123
flat-0.122
aque-0.119
uckland-0.112
interstitial-0.112
Token:
Token ID25
Feature activation-3.77e-01

Pos logit contributions
ngth+2.252
 participating+1.883
 gaining+1.732
 finishing+1.730
 fulfilled+1.691
Neg logit contributions
flat-3.595
alias-3.579
aque-3.518
interstitial-3.419
yg-3.390
Token SAS
Token ID35516
Feature activation+3.13e-01

Pos logit contributions
ngth+0.267
 participating+0.232
 gaining+0.219
 finishing+0.218
 exerc+0.216
Neg logit contributions
flat-0.268
alias-0.267
aque-0.260
interstitial-0.251
yg-0.249
TokenK
Token ID42
Feature activation-3.93e-01

Pos logit contributions
flat+0.193
alias+0.192
aque+0.186
interstitial+0.177
yg+0.177
Neg logit contributions
ngth-0.253
 participating-0.222
 gaining-0.211
 finishing-0.211
 exerc-0.209
TokenATCH
Token ID11417
Feature activation+3.50e-01

Pos logit contributions
ngth+0.318
 participating+0.280
 gaining+0.266
 finishing+0.266
 exerc+0.263
Neg logit contributions
flat-0.241
alias-0.239
aque-0.232
interstitial-0.221
yg-0.220
TokenEW
Token ID6217
Feature activation-4.69e-01

Pos logit contributions
flat+0.223
alias+0.221
aque+0.215
interstitial+0.205
yg+0.204
Neg logit contributions
ngth-0.275
 participating-0.242
 gaining-0.229
 finishing-0.229
 exerc-0.226
Token was
Token ID373
Feature activation+7.70e-01

Pos logit contributions
flat+0.047
alias+0.046
aque+0.045
yg+0.043
interstitial+0.043
Neg logit contributions
ngth-0.039
 participating-0.033
 gaining-0.031
 finishing-0.031
 exerc-0.030
Token about
Token ID546
Feature activation-8.04e-01

Pos logit contributions
flat+0.589
alias+0.584
aque+0.570
yg+0.548
+0.545
Neg logit contributions
ngth-0.511
 participating-0.428
 gaining-0.400
 exerc-0.400
 finishing-0.399
Token homosexuality
Token ID19822
Feature activation-1.19e-01

Pos logit contributions
ngth+0.537
 participating+0.461
 gaining+0.433
 finishing+0.431
 exerc+0.426
Neg logit contributions
flat-0.606
alias-0.604
aque-0.589
interstitial-0.565
yg-0.565
Token
Token ID198
Feature activation-4.13e-01

Pos logit contributions
ngth+0.080
 participating+0.067
 finishing+0.063
 gaining+0.063
 exerc+0.063
Neg logit contributions
flat-0.090
alias-0.089
aque-0.087
yg-0.084
interstitial-0.084
Token
Token ID198
Feature activation-4.25e-01

Pos logit contributions
ngth+0.306
 participating+0.270
 gaining+0.255
 finishing+0.254
 fulfilled+0.250
Neg logit contributions
alias-0.273
flat-0.272
aque-0.268
interstitial-0.257
uckland-0.255
TokenReviewer
Token ID35407
Feature activation-3.94e+00

Pos logit contributions
ngth+0.331
 participating+0.290
 gaining+0.275
 finishing+0.274
 exerc+0.271
Neg logit contributions
flat-0.269
alias-0.269
aque-0.264
interstitial-0.249
uckland-0.249
Token:
Token ID25
Feature activation-6.30e-01

Pos logit contributions
ngth+2.216
 participating+1.836
 gaining+1.699
 finishing+1.691
 exerc+1.668
Neg logit contributions
flat-3.384
alias-3.369
aque-3.301
interstitial-3.186
yg-3.182
Token k
Token ID479
Feature activation+1.01e-02

Pos logit contributions
ngth+0.445
 participating+0.387
 gaining+0.365
 finishing+0.364
 exerc+0.360
Neg logit contributions
flat-0.449
alias-0.447
aque-0.436
interstitial-0.420
yg-0.417
Tokenimp
Token ID11011
Feature activation+2.41e-01

Pos logit contributions
flat+0.006
alias+0.006
aque+0.006
yg+0.006
interstitial+0.006
Neg logit contributions
ngth-0.008
 participating-0.007
 gaining-0.007
 finishing-0.007
 exerc-0.007
Tokenunk
Token ID2954
Feature activation+9.15e-01

Pos logit contributions
flat+0.148
alias+0.147
aque+0.143
interstitial+0.136
yg+0.136
Neg logit contributions
ngth-0.194
 participating-0.171
 gaining-0.162
 finishing-0.162
 exerc-0.160
Tokenrock
Token ID10823
Feature activation-6.33e-01

Pos logit contributions
flat+0.631
alias+0.628
aque+0.612
yg+0.585
interstitial+0.583
Neg logit contributions
ngth-0.671
 participating-0.582
 gaining-0.550
 finishing-0.549
 exerc-0.544
TokenT
Token ID51
Feature activation+2.02e-01

Pos logit contributions
flat+0.097
alias+0.096
aque+0.094
yg+0.090
interstitial+0.090
Neg logit contributions
ngth-0.087
 participating-0.074
 gaining-0.069
 finishing-0.069
 exerc-0.069
Tokenunes
Token ID4015
Feature activation-1.25e+00

Pos logit contributions
flat+0.094
alias+0.094
aque+0.090
yg+0.084
interstitial+0.084
Neg logit contributions
ngth-0.194
 participating-0.174
 gaining-0.167
 finishing-0.166
 exerc-0.165
Token (
Token ID357
Feature activation-1.05e+00

Pos logit contributions
ngth+0.829
 participating+0.710
 gaining+0.667
 finishing+0.663
 exerc+0.656
Neg logit contributions
flat-0.950
alias-0.947
aque-0.925
interstitial-0.889
yg-0.887
TokenHD
Token ID10227
Feature activation-1.03e+00

Pos logit contributions
ngth+0.844
 participating+0.745
 gaining+0.709
 finishing+0.706
 exerc+0.699
Neg logit contributions
flat-0.635
alias-0.635
aque-0.619
interstitial-0.587
yg-0.585
Token)
Token ID8
Feature activation-9.82e-01

Pos logit contributions
ngth+0.760
 participating+0.661
 gaining+0.628
 finishing+0.625
 exerc+0.615
Neg logit contributions
alias-0.701
flat-0.697
aque-0.678
yg-0.649
interstitial-0.649
Token |
Token ID930
Feature activation-3.79e+00

Pos logit contributions
ngth+0.719
 participating+0.627
 gaining+0.593
 finishing+0.590
 exerc+0.584
Neg logit contributions
flat-0.675
alias-0.673
aque-0.657
interstitial-0.629
yg-0.626
Token iTunes
Token ID4830
Feature activation-9.97e-01

Pos logit contributions
ngth+2.828
 participating+2.546
 gaining+2.407
 finishing+2.391
 exerc+2.357
Neg logit contributions
flat-2.466
alias-2.462
aque-2.436
interstitial-2.341
uckland-2.308
Token (
Token ID357
Feature activation-8.07e-01

Pos logit contributions
ngth+0.715
 participating+0.624
 gaining+0.589
 finishing+0.586
 exerc+0.580
Neg logit contributions
alias-0.697
flat-0.697
aque-0.681
interstitial-0.653
yg-0.648
TokenHQ
Token ID41275
Feature activation-8.05e-01

Pos logit contributions
ngth+0.626
 participating+0.549
 gaining+0.520
 finishing+0.520
 exerc+0.512
Neg logit contributions
alias-0.516
flat-0.514
aque-0.502
interstitial-0.478
yg-0.477
Token)
Token ID8
Feature activation-3.91e-01

Pos logit contributions
ngth+0.577
 participating+0.505
 finishing+0.479
 gaining+0.478
 fulfilled+0.465
Neg logit contributions
alias-0.565
flat-0.556
aque-0.546
interstitial-0.526
FORE-0.525
Token |
Token ID930
Feature activation-3.32e+00

Pos logit contributions
ngth+0.260
 participating+0.225
 gaining+0.211
 finishing+0.210
 fulfilled+0.206
Neg logit contributions
alias-0.294
flat-0.293
aque-0.288
interstitial-0.278
uckland-0.274
Token His
Token ID2399
Feature activation-2.23e-01

Pos logit contributions
ngth+0.769
 participating+0.664
 gaining+0.628
 finishing+0.626
 exerc+0.620
Neg logit contributions
flat-0.746
alias-0.742
aque-0.723
interstitial-0.691
yg-0.691
Token Mother
Token ID10584
Feature activation-6.69e-01

Pos logit contributions
ngth+0.164
 participating+0.142
 gaining+0.134
 finishing+0.134
 exerc+0.133
Neg logit contributions
flat-0.153
alias-0.153
aque-0.148
yg-0.142
interstitial-0.141
Token Is
Token ID1148
Feature activation-1.11e-01

Pos logit contributions
ngth+0.476
 participating+0.410
 gaining+0.387
 finishing+0.386
 exerc+0.383
Neg logit contributions
flat-0.476
alias-0.473
aque-0.462
interstitial-0.441
yg-0.441
Token an
Token ID281
Feature activation-4.85e-01

Pos logit contributions
ngth+0.082
 participating+0.071
 gaining+0.067
 finishing+0.067
 exerc+0.066
Neg logit contributions
flat-0.076
alias-0.076
aque-0.074
yg-0.070
interstitial-0.070
Token Acc
Token ID6366
Feature activation+1.39e-01

Pos logit contributions
ngth+0.360
 participating+0.312
 gaining+0.295
 finishing+0.295
 exerc+0.292
Neg logit contributions
flat-0.330
alias-0.329
aque-0.320
interstitial-0.305
yg-0.305
Tokenompl
Token ID6316
Feature activation-3.77e+00

Pos logit contributions
flat+0.119
alias+0.119
aque+0.116
yg+0.112
interstitial+0.112
Neg logit contributions
ngth-0.079
 participating-0.065
 gaining-0.060
 finishing-0.060
 exerc-0.059
Tokenished
Token ID1348
Feature activation-6.21e-01

Pos logit contributions
ngth+3.156
 participating+2.833
 finishing+2.699
 gaining+2.696
 fulfilled+2.663
Neg logit contributions
flat-2.152
alias-2.129
aque-2.037
interstitial-1.963
-1.943
Token Travel
Token ID13524
Feature activation-4.76e-01

Pos logit contributions
ngth+0.478
 participating+0.419
 gaining+0.397
 finishing+0.396
 exerc+0.393
Neg logit contributions
flat-0.404
alias-0.401
aque-0.392
interstitial-0.373
yg-0.372
Token Writer
Token ID26606
Feature activation+2.72e-01

Pos logit contributions
ngth+0.398
 participating+0.351
 gaining+0.335
 finishing+0.334
 exerc+0.331
Neg logit contributions
flat-0.280
alias-0.278
aque-0.270
interstitial-0.256
yg-0.255
Token
Token ID198
Feature activation+5.33e-01

Pos logit contributions
alias+0.181
flat+0.181
aque+0.174
interstitial+0.166
yg+0.166
Neg logit contributions
ngth-0.207
 participating-0.178
 gaining-0.169
 finishing-0.169
 exerc-0.168
Token
Token ID198
Feature activation+3.36e-01

Pos logit contributions
flat+0.307
alias+0.307
aque+0.287
yg+0.277
ine+0.274
Neg logit contributions
ngth-0.481
 participating-0.401
lished-0.396
 exerc-0.394
 gaining-0.388
Token!
Token ID0
Feature activation+1.14e+00

Pos logit contributions
flat+0.851
alias+0.845
aque+0.822
yg+0.792
interstitial+0.786
Neg logit contributions
ngth-0.798
 participating-0.673
 gaining-0.637
 finishing-0.635
 exerc-0.630
Token<|endoftext|>
Token ID50256
Feature activation+6.96e-01

Pos logit contributions
flat+0.734
alias+0.729
aque+0.707
yg+0.676
interstitial+0.671
Neg logit contributions
ngth-0.887
 participating-0.772
 gaining-0.736
 finishing-0.731
 exerc-0.727
TokenPosted
Token ID14231
Feature activation+6.73e-01

Pos logit contributions
flat+0.449
alias+0.446
aque+0.434
interstitial+0.413
yg+0.413
Neg logit contributions
ngth-0.542
 participating-0.474
 gaining-0.450
 finishing-0.449
 exerc-0.445
Token on
Token ID319
Feature activation+1.10e-01

Pos logit contributions
flat+0.496
alias+0.494
aque+0.482
yg+0.461
interstitial+0.461
Neg logit contributions
ngth-0.464
 participating-0.395
 gaining-0.373
 finishing-0.371
 exerc-0.368
Token by
Token ID416
Feature activation-2.87e-01

Pos logit contributions
flat+0.082
alias+0.082
aque+0.080
yg+0.077
interstitial+0.076
Neg logit contributions
ngth-0.075
 participating-0.063
 gaining-0.059
 finishing-0.059
 exerc-0.059
Token |
Token ID930
Feature activation-3.65e+00

Pos logit contributions
ngth+0.206
 participating+0.179
 gaining+0.169
 finishing+0.169
 exerc+0.167
Neg logit contributions
flat-0.201
alias-0.200
aque-0.195
interstitial-0.188
yg-0.186
Token in
Token ID287
Feature activation-1.07e-01

Pos logit contributions
ngth+2.616
 participating+2.355
 finishing+2.209
 gaining+2.200
 exerc+2.166
Neg logit contributions
flat-2.461
alias-2.451
aque-2.434
interstitial-2.347
yg-2.284
Token
Token ID198
Feature activation-2.37e-01

Pos logit contributions
ngth+0.077
 participating+0.067
 gaining+0.063
 finishing+0.063
 exerc+0.063
Neg logit contributions
flat-0.075
alias-0.075
aque-0.073
interstitial-0.070
yg-0.070
Token
Token ID198
Feature activation-3.42e-01

Pos logit contributions
ngth+0.202
 participating+0.173
 exerc+0.166
 gaining+0.165
lished+0.165
Neg logit contributions
flat-0.143
alias-0.142
aque-0.136
yg-0.130
FORE-0.129
Token|
Token ID91
Feature activation-3.24e+00

Pos logit contributions
ngth+0.259
 participating+0.227
 gaining+0.215
 finishing+0.215
 exerc+0.212
Neg logit contributions
flat-0.224
alias-0.223
aque-0.218
interstitial-0.208
yg-0.207
Token in
Token ID287
Feature activation-1.99e-01

Pos logit contributions
ngth+2.274
 participating+2.020
 finishing+1.890
 gaining+1.888
 fulfilled+1.852
Neg logit contributions
flat-2.262
alias-2.253
aque-2.216
interstitial-2.126
yg-2.109
Token(
Token ID7
Feature activation-1.00e+00

Pos logit contributions
ngth+0.173
 participating+0.151
 gaining+0.143
 finishing+0.143
 exerc+0.141
Neg logit contributions
flat-0.147
alias-0.146
aque-0.142
yg-0.136
interstitial-0.136
TokenML
Token ID5805
Feature activation-1.34e+00

Pos logit contributions
ngth+0.802
 participating+0.706
 gaining+0.671
 finishing+0.669
 exerc+0.661
Neg logit contributions
flat-0.623
alias-0.617
aque-0.602
interstitial-0.573
yg-0.572
Tokenive
Token ID425
Feature activation-5.38e-01

Pos logit contributions
ngth+1.448
 participating+1.327
 gaining+1.278
 finishing+1.277
 exerc+1.267
Neg logit contributions
flat-0.447
alias-0.442
aque-0.415
interstitial-0.381
yg-0.376
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
ngth+0.396
 participating+0.344
 gaining+0.325
 finishing+0.324
 exerc+0.321
Neg logit contributions
flat-0.369
alias-0.367
aque-0.358
interstitial-0.342
yg-0.341
Tokencom
Token ID785
Feature activation-5.82e-01

Pos logit contributions
ngth+0.219
 participating+0.191
 gaining+0.180
 finishing+0.178
 exerc+0.177
Neg logit contributions
flat-0.213
alias-0.212
aque-0.207
interstitial-0.200
yg-0.198
Token |
Token ID930
Feature activation-3.47e+00

Pos logit contributions
ngth+0.433
 participating+0.379
 gaining+0.357
 finishing+0.356
 exerc+0.352
Neg logit contributions
flat-0.394
alias-0.392
aque-0.383
interstitial-0.369
yg-0.365
Token File
Token ID9220
Feature activation-9.96e-02

Pos logit contributions
ngth+2.558
 participating+2.282
 gaining+2.137
 finishing+2.129
 exerc+2.112
Neg logit contributions
flat-2.329
alias-2.295
aque-2.268
interstitial-2.188
yg-2.136
Token)
Token ID8
Feature activation-2.22e-01

Pos logit contributions
ngth+0.075
 participating+0.067
 gaining+0.063
 finishing+0.062
 exerc+0.061
Neg logit contributions
flat-0.066
alias-0.065
aque-0.064
interstitial-0.061
yg-0.061
Token
Token ID198
Feature activation+3.15e-02

Pos logit contributions
ngth+0.171
 participating+0.150
 gaining+0.142
 finishing+0.142
 exerc+0.141
Neg logit contributions
flat-0.144
alias-0.143
aque-0.140
yg-0.133
interstitial-0.133
Token
Token ID198
Feature activation-8.08e-02

Pos logit contributions
flat+0.019
alias+0.019
aque+0.018
yg+0.017
+0.017
Neg logit contributions
ngth-0.028
 participating-0.023
 exerc-0.023
lished-0.023
 gaining-0.022
TokenL
Token ID43
Feature activation-7.94e-01

Pos logit contributions
ngth+0.063
 participating+0.055
 gaining+0.052
 finishing+0.052
 exerc+0.052
Neg logit contributions
flat-0.051
alias-0.051
aque-0.050
yg-0.048
interstitial-0.047
Token Hide
Token ID10415
Feature activation-5.39e-01

Pos logit contributions
ngth+0.146
 participating+0.129
 gaining+0.123
 finishing+0.123
 exerc+0.121
Neg logit contributions
flat-0.103
alias-0.102
aque-0.100
interstitial-0.095
yg-0.094
Token Caption
Token ID11260
Feature activation-1.79e+00

Pos logit contributions
ngth+0.483
 participating+0.413
 exerc+0.404
lished+0.401
 gaining+0.399
Neg logit contributions
alias-0.302
flat-0.300
aque-0.287
yg-0.272
alia-0.269
Token 14
Token ID1478
Feature activation+3.38e-01

Pos logit contributions
ngth+1.339
 participating+1.140
 gaining+1.084
 finishing+1.075
 exerc+1.075
Neg logit contributions
flat-1.240
alias-1.236
aque-1.194
yg-1.150
interstitial-1.135
Token of
Token ID286
Feature activation-1.63e-01

Pos logit contributions
alias+0.291
flat+0.291
aque+0.278
yg+0.272
ine+0.271
Neg logit contributions
ngth-0.208
lished-0.157
 participating-0.156
 exerc-0.151
 gaining-0.148
Token 15
Token ID1315
Feature activation+1.98e-01

Pos logit contributions
ngth+0.127
 participating+0.102
lished+0.101
 exerc+0.099
 gaining+0.098
Neg logit contributions
flat-0.114
alias-0.113
aque-0.108
yg-0.104
ine-0.104
Token Photos
Token ID9434
Feature activation-3.46e+00

Pos logit contributions
alias+0.119
flat+0.119
aque+0.114
yg+0.109
+0.107
Neg logit contributions
ngth-0.168
 participating-0.144
 gaining-0.138
 exerc-0.138
lished-0.138
Token:
Token ID25
Feature activation-5.19e-01

Pos logit contributions
ngth+1.981
 participating+1.632
 gaining+1.514
 finishing+1.506
 exerc+1.494
Neg logit contributions
flat-2.957
alias-2.945
aque-2.881
yg-2.778
interstitial-2.772
Token Carly
Token ID35982
Feature activation-3.09e-02

Pos logit contributions
ngth+0.483
 participating+0.420
 gaining+0.405
 exerc+0.403
lished+0.402
Neg logit contributions
alias-0.269
flat-0.268
aque-0.253
yg-0.242
-0.239
Token Fiorina
Token ID49028
Feature activation+4.63e-01

Pos logit contributions
ngth+0.030
 participating+0.026
lished+0.026
 exerc+0.026
 gaining+0.025
Neg logit contributions
alias-0.014
flat-0.014
aque-0.014
yg-0.013
FORE-0.013
Token's
Token ID338
Feature activation-2.12e-01

Pos logit contributions
flat+0.350
alias+0.350
aque+0.336
yg+0.327
ine+0.321
Neg logit contributions
ngth-0.326
 participating-0.258
lished-0.255
 exerc-0.249
 gaining-0.244
Token political
Token ID1964
Feature activation-7.89e-01

Pos logit contributions
ngth+0.199
 participating+0.171
lished+0.167
 exerc+0.165
 gaining+0.164
Neg logit contributions
flat-0.106
alias-0.106
aque-0.102
yg-0.096
-0.095
Token of
Token ID286
Feature activation-4.64e-01

Pos logit contributions
ngth+0.463
 participating+0.383
 gaining+0.353
 finishing+0.351
 exerc+0.345
Neg logit contributions
flat-0.741
alias-0.738
aque-0.723
interstitial-0.699
yg-0.697
Token Americans
Token ID3399
Feature activation-5.45e-01

Pos logit contributions
ngth+0.319
 participating+0.273
 gaining+0.257
 finishing+0.256
 exerc+0.254
Neg logit contributions
flat-0.342
alias-0.341
aque-0.332
yg-0.318
interstitial-0.318
Token support
Token ID1104
Feature activation-8.26e-01

Pos logit contributions
ngth+0.367
 participating+0.309
 gaining+0.291
 finishing+0.290
 exerc+0.288
Neg logit contributions
flat-0.411
alias-0.408
aque-0.399
yg-0.383
interstitial-0.380
Token Graham
Token ID11520
Feature activation-2.58e-01

Pos logit contributions
ngth+0.548
 participating+0.460
 gaining+0.431
 finishing+0.430
 exerc+0.428
Neg logit contributions
flat-0.632
alias-0.628
aque-0.613
yg-0.589
interstitial-0.588
Token-
Token ID12
Feature activation-5.17e-01

Pos logit contributions
ngth+0.167
 participating+0.141
 gaining+0.132
 finishing+0.131
 exerc+0.131
Neg logit contributions
flat-0.200
alias-0.199
aque-0.195
interstitial-0.187
yg-0.187
TokenCass
Token ID43529
Feature activation-3.39e+00

Pos logit contributions
ngth+0.425
 participating+0.375
 gaining+0.357
 finishing+0.356
 exerc+0.352
Neg logit contributions
flat-0.311
alias-0.309
aque-0.301
interstitial-0.285
yg-0.284
Tokenidy
Token ID19325
Feature activation-1.62e+00

Pos logit contributions
ngth+2.328
 participating+2.005
 gaining+1.885
 finishing+1.880
 exerc+1.857
Neg logit contributions
flat-2.490
alias-2.486
aque-2.428
interstitial-2.328
yg-2.316
Token,
Token ID11
Feature activation+1.05e-02

Pos logit contributions
ngth+1.099
 participating+0.943
 gaining+0.888
 finishing+0.884
 exerc+0.874
Neg logit contributions
flat-1.210
alias-1.204
aque-1.176
interstitial-1.128
yg-1.126
Token the
Token ID262
Feature activation-2.14e-01

Pos logit contributions
flat+0.008
alias+0.008
aque+0.008
yg+0.007
interstitial+0.007
Neg logit contributions
ngth-0.007
 participating-0.006
 gaining-0.006
 finishing-0.006
 exerc-0.006
Token health
Token ID1535
Feature activation-1.13e+00

Pos logit contributions
ngth+0.151
 participating+0.127
 gaining+0.119
 finishing+0.119
 exerc+0.118
Neg logit contributions
flat-0.157
alias-0.155
aque-0.151
yg-0.145
-0.144
Token care
Token ID1337
Feature activation+3.05e-01

Pos logit contributions
ngth+0.716
 participating+0.600
 gaining+0.560
 finishing+0.558
 exerc+0.553
Neg logit contributions
flat-0.899
alias-0.894
aque-0.874
yg-0.842
interstitial-0.838
Token?
Token ID30
Feature activation-4.14e-01

Pos logit contributions
ngth+0.393
 participating+0.338
 gaining+0.318
 finishing+0.317
 exerc+0.314
Neg logit contributions
flat-0.404
alias-0.402
aque-0.392
interstitial-0.375
yg-0.375
Token Chinese
Token ID3999
Feature activation+7.36e-01

Pos logit contributions
ngth+0.302
 participating+0.261
 gaining+0.247
 finishing+0.246
 exerc+0.244
Neg logit contributions
flat-0.288
alias-0.287
aque-0.279
interstitial-0.267
yg-0.267
Token production
Token ID3227
Feature activation-3.60e-01

Pos logit contributions
flat+0.524
alias+0.519
aque+0.504
interstitial+0.485
yg+0.485
Neg logit contributions
ngth-0.530
 participating-0.447
 gaining-0.423
 finishing-0.421
 exerc-0.419
Token value
Token ID1988
Feature activation+3.75e-01

Pos logit contributions
ngth+0.247
 participating+0.211
 gaining+0.198
 finishing+0.198
 exerc+0.196
Neg logit contributions
flat-0.266
alias-0.265
aque-0.258
interstitial-0.248
yg-0.247
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
flat+0.275
alias+0.274
aque+0.265
interstitial+0.254
yg+0.254
Neg logit contributions
ngth-0.260
 participating-0.221
 finishing-0.207
 gaining-0.207
 exerc-0.206
Token |
Token ID930
Feature activation-3.39e+00

Pos logit contributions
ngth+0.172
 participating+0.149
 gaining+0.141
 finishing+0.140
 exerc+0.139
Neg logit contributions
flat-0.159
alias-0.158
aque-0.154
yg-0.147
interstitial-0.147
Token I
Token ID314
Feature activation+2.54e-01

Pos logit contributions
ngth+2.438
 participating+2.153
 finishing+2.021
 gaining+2.021
 fulfilled+1.981
Neg logit contributions
flat-2.332
alias-2.317
aque-2.283
interstitial-2.172
amic-2.162
Token thought
Token ID1807
Feature activation-5.27e-01

Pos logit contributions
flat+0.193
alias+0.192
aque+0.187
yg+0.180
interstitial+0.179
Neg logit contributions
ngth-0.169
 participating-0.144
 gaining-0.135
 finishing-0.134
 exerc-0.134
Token he
Token ID339
Feature activation-2.54e-01

Pos logit contributions
ngth+0.344
 participating+0.295
 gaining+0.277
 finishing+0.276
 exerc+0.272
Neg logit contributions
flat-0.404
alias-0.403
aque-0.394
interstitial-0.379
yg-0.378
Token had
Token ID550
Feature activation-1.97e-01

Pos logit contributions
ngth+0.165
 participating+0.139
 gaining+0.131
 finishing+0.130
 exerc+0.129
Neg logit contributions
flat-0.197
alias-0.196
aque-0.191
yg-0.183
interstitial-0.182
Token a
Token ID257
Feature activation-2.73e-01

Pos logit contributions
ngth+0.134
 participating+0.115
 gaining+0.108
 finishing+0.107
 exerc+0.107
Neg logit contributions
flat-0.146
alias-0.145
aque-0.142
yg-0.136
interstitial-0.136
Token.
Token ID13
Feature activation-2.01e-01

Pos logit contributions
ngth+0.177
 participating+0.153
 gaining+0.145
 finishing+0.144
 exerc+0.143
Neg logit contributions
flat-0.173
alias-0.172
aque-0.167
interstitial-0.161
yg-0.160
Token
Token ID198
Feature activation+7.08e-02

Pos logit contributions
ngth+0.152
 participating+0.134
 gaining+0.127
 finishing+0.127
 exerc+0.125
Neg logit contributions
flat-0.133
alias-0.132
aque-0.129
interstitial-0.124
uckland-0.122
Token
Token ID198
Feature activation+7.32e-02

Pos logit contributions
flat+0.042
alias+0.042
aque+0.039
yg+0.038
+0.037
Neg logit contributions
ngth-0.062
 participating-0.052
lished-0.051
 exerc-0.051
 gaining-0.051
TokenTV
Token ID6849
Feature activation-3.80e-01

Pos logit contributions
flat+0.048
alias+0.048
aque+0.047
interstitial+0.045
yg+0.044
Neg logit contributions
ngth-0.056
 participating-0.049
 gaining-0.046
 finishing-0.046
 exerc-0.045
TokenLINE
Token ID24027
Feature activation+6.24e-01

Pos logit contributions
ngth+0.284
 participating+0.246
 gaining+0.233
 finishing+0.232
 exerc+0.231
Neg logit contributions
flat-0.258
alias-0.256
aque-0.250
yg-0.238
interstitial-0.238
Token |
Token ID930
Feature activation-3.39e+00

Pos logit contributions
flat+0.451
alias+0.448
aque+0.435
interstitial+0.419
yg+0.418
Neg logit contributions
ngth-0.445
 participating-0.372
 exerc-0.356
 gaining-0.351
 finishing-0.351
Token So
Token ID1406
Feature activation+3.37e-01

Pos logit contributions
ngth+2.334
 participating+2.042
 finishing+1.921
 gaining+1.919
 fulfilled+1.884
Neg logit contributions
flat-2.441
alias-2.417
aque-2.385
interstitial-2.293
uckland-2.261
Token,
Token ID11
Feature activation-2.90e-01

Pos logit contributions
flat+0.260
alias+0.259
aque+0.253
yg+0.244
interstitial+0.243
Neg logit contributions
ngth-0.220
 participating-0.186
 gaining-0.174
 exerc-0.173
 finishing-0.173
Token the
Token ID262
Feature activation+2.88e-01

Pos logit contributions
ngth+0.185
 participating+0.154
 gaining+0.144
 exerc+0.143
 finishing+0.143
Neg logit contributions
flat-0.230
alias-0.229
aque-0.223
yg-0.215
interstitial-0.215
Token prom
Token ID1552
Feature activation-1.57e+00

Pos logit contributions
flat+0.213
alias+0.212
aque+0.207
yg+0.199
interstitial+0.198
Neg logit contributions
ngth-0.199
 participating-0.167
 gaining-0.157
 exerc-0.156
 finishing-0.155
Tokenos
Token ID418
Feature activation-1.34e+00

Pos logit contributions
ngth+1.233
 participating+1.083
 gaining+1.029
 finishing+1.026
 exerc+1.018
Neg logit contributions
flat-0.994
alias-0.986
aque-0.959
interstitial-0.914
yg-0.911
Token (
Token ID357
Feature activation-2.06e-01

Pos logit contributions
ngth+0.201
 participating+0.171
 finishing+0.161
 gaining+0.161
 exerc+0.160
Neg logit contributions
flat-0.205
alias-0.205
aque-0.199
yg-0.191
interstitial-0.191
TokenAf
Token ID17584
Feature activation-1.43e+00

Pos logit contributions
ngth+0.202
 participating+0.182
 gaining+0.175
 finishing+0.174
 exerc+0.173
Neg logit contributions
flat-0.091
alias-0.090
aque-0.087
yg-0.081
interstitial-0.080
TokenD
Token ID35
Feature activation-5.49e-01

Pos logit contributions
ngth+1.100
 participating+0.973
 gaining+0.925
 finishing+0.920
 fulfilled+0.907
Neg logit contributions
flat-0.914
alias-0.911
aque-0.897
interstitial-0.854
yg-0.851
Token)
Token ID8
Feature activation-9.77e-01

Pos logit contributions
ngth+0.409
 participating+0.357
 gaining+0.339
 finishing+0.337
 exerc+0.330
Neg logit contributions
flat-0.374
alias-0.368
aque-0.361
interstitial-0.346
yg-0.345
Token party
Token ID2151
Feature activation-1.13e-01

Pos logit contributions
ngth+0.677
 participating+0.569
 finishing+0.536
 gaining+0.535
 exerc+0.534
Neg logit contributions
flat-0.715
alias-0.714
aque-0.697
yg-0.668
interstitial-0.664
Token |
Token ID930
Feature activation-3.36e+00

Pos logit contributions
ngth+0.080
 participating+0.068
 finishing+0.064
 gaining+0.064
 exerc+0.064
Neg logit contributions
alias-0.080
flat-0.080
aque-0.078
yg-0.075
interstitial-0.074
Token AFP
Token ID8916
Feature activation+1.84e-01

Pos logit contributions
ngth+2.479
 participating+2.184
 gaining+2.053
 finishing+2.042
 exerc+2.010
Neg logit contributions
flat-2.289
alias-2.253
aque-2.226
interstitial-2.119
-2.093
Token/
Token ID14
Feature activation+9.29e-02

Pos logit contributions
flat+0.127
alias+0.127
aque+0.124
interstitial+0.118
yg+0.118
Neg logit contributions
ngth-0.134
 participating-0.116
 gaining-0.110
 finishing-0.109
 exerc-0.108
TokenGetty
Token ID6633
Feature activation+5.58e-01

Pos logit contributions
alias+0.068
flat+0.068
aque+0.065
yg+0.064
+0.063
Neg logit contributions
ngth-0.066
 participating-0.056
 gaining-0.053
 finishing-0.052
 exerc-0.052
Token Opinion
Token ID29525
Feature activation+6.21e-01

Pos logit contributions
flat+0.336
alias+0.334
aque+0.324
yg+0.307
interstitial+0.307
Neg logit contributions
ngth-0.458
 participating-0.403
 gaining-0.384
 finishing-0.383
 exerc-0.381
Token It
Token ID632
Feature activation-3.36e-01

Pos logit contributions
flat+0.452
alias+0.448
aque+0.437
yg+0.419
interstitial+0.418
Neg logit contributions
ngth-0.433
 participating-0.371
 finishing-0.348
 gaining-0.347
 exerc-0.347
Token Hide
Token ID10415
Feature activation-5.11e-01

Pos logit contributions
ngth+0.213
 participating+0.190
 gaining+0.180
 finishing+0.180
 exerc+0.178
Neg logit contributions
flat-0.155
alias-0.154
aque-0.150
interstitial-0.143
yg-0.142
Token Caption
Token ID11260
Feature activation-1.63e+00

Pos logit contributions
ngth+0.457
 participating+0.390
 exerc+0.381
lished+0.378
 gaining+0.377
Neg logit contributions
alias-0.286
flat-0.285
aque-0.273
yg-0.259
alia-0.256
Token 13
Token ID1511
Feature activation+1.42e-01

Pos logit contributions
ngth+1.133
 participating+0.930
 exerc+0.891
 gaining+0.881
 finishing+0.875
Neg logit contributions
flat-1.231
alias-1.225
aque-1.186
yg-1.137
alia-1.132
Token of
Token ID286
Feature activation-1.00e-01

Pos logit contributions
flat+0.123
alias+0.123
aque+0.118
yg+0.116
ine+0.115
Neg logit contributions
ngth-0.086
 participating-0.065
lished-0.064
 exerc-0.063
 gaining-0.062
Token 15
Token ID1315
Feature activation+1.22e-01

Pos logit contributions
ngth+0.077
 participating+0.063
lished+0.062
 exerc+0.060
 gaining+0.060
Neg logit contributions
flat-0.069
alias-0.069
aque-0.066
yg-0.064
ine-0.063
Token Photos
Token ID9434
Feature activation-3.36e+00

Pos logit contributions
alias+0.073
flat+0.073
aque+0.070
yg+0.067
+0.066
Neg logit contributions
ngth-0.103
 participating-0.088
 gaining-0.085
 exerc-0.085
lished-0.084
Token:
Token ID25
Feature activation-4.99e-01

Pos logit contributions
ngth+1.928
 participating+1.587
 gaining+1.473
 finishing+1.465
 exerc+1.454
Neg logit contributions
flat-2.867
alias-2.855
aque-2.792
yg-2.693
interstitial-2.685
Token Carly
Token ID35982
Feature activation+1.90e-01

Pos logit contributions
ngth+0.466
 participating+0.405
 gaining+0.390
 exerc+0.388
lished+0.388
Neg logit contributions
alias-0.257
flat-0.256
aque-0.242
yg-0.232
-0.229
Token Fiorina
Token ID49028
Feature activation+5.82e-01

Pos logit contributions
alias+0.088
flat+0.087
aque+0.082
yg+0.080
ine+0.078
Neg logit contributions
ngth-0.188
 participating-0.162
lished-0.159
 exerc-0.158
 gaining-0.156
Token's
Token ID338
Feature activation-1.57e-01

Pos logit contributions
flat+0.440
alias+0.439
aque+0.422
yg+0.411
ine+0.404
Neg logit contributions
ngth-0.411
 participating-0.325
lished-0.322
 exerc-0.314
 gaining-0.307
Token political
Token ID1964
Feature activation-5.68e-01

Pos logit contributions
ngth+0.148
 participating+0.126
lished+0.124
 exerc+0.122
 gaining+0.122
Neg logit contributions
flat-0.079
alias-0.078
aque-0.076
yg-0.071
-0.070
Token Hide
Token ID10415
Feature activation-2.60e-01

Pos logit contributions
flat+0.093
alias+0.092
aque+0.090
interstitial+0.085
yg+0.085
Neg logit contributions
ngth-0.132
 participating-0.117
 gaining-0.111
 finishing-0.111
 exerc-0.110
Token Caption
Token ID11260
Feature activation-1.51e+00

Pos logit contributions
ngth+0.237
 participating+0.201
 exerc+0.198
lished+0.197
 gaining+0.195
Neg logit contributions
alias-0.144
flat-0.143
aque-0.137
yg-0.129
alia-0.128
Token 12
Token ID1105
Feature activation+9.64e-02

Pos logit contributions
ngth+1.103
 participating+0.897
lished+0.871
 exerc+0.867
 gaining+0.865
Neg logit contributions
flat-1.116
alias-1.112
aque-1.070
yg-1.034
-1.027
Token of
Token ID286
Feature activation-1.70e-01

Pos logit contributions
flat+0.083
alias+0.082
aque+0.079
yg+0.077
ine+0.077
Neg logit contributions
ngth-0.060
lished-0.045
 participating-0.045
 exerc-0.044
 gaining-0.043
Token 15
Token ID1315
Feature activation-8.64e-02

Pos logit contributions
ngth+0.132
 participating+0.107
lished+0.106
 exerc+0.103
 gaining+0.103
Neg logit contributions
flat-0.118
alias-0.116
aque-0.111
yg-0.107
ine-0.107
Token Photos
Token ID9434
Feature activation-3.33e+00

Pos logit contributions
ngth+0.073
 participating+0.063
 gaining+0.060
 exerc+0.060
lished+0.060
Neg logit contributions
alias-0.052
flat-0.052
aque-0.050
yg-0.048
-0.047
Token:
Token ID25
Feature activation-4.31e-01

Pos logit contributions
ngth+1.923
 participating+1.580
 gaining+1.467
 finishing+1.459
 exerc+1.451
Neg logit contributions
flat-2.841
alias-2.829
aque-2.765
yg-2.667
interstitial-2.656
Token Carly
Token ID35982
Feature activation+2.96e-01

Pos logit contributions
ngth+0.404
 participating+0.351
 gaining+0.338
 exerc+0.337
lished+0.336
Neg logit contributions
alias-0.222
flat-0.221
aque-0.208
yg-0.200
-0.198
Token Fiorina
Token ID49028
Feature activation+6.51e-01

Pos logit contributions
alias+0.136
flat+0.134
aque+0.126
yg+0.123
FORE+0.121
Neg logit contributions
ngth-0.294
 participating-0.254
lished-0.249
 exerc-0.248
 gaining-0.245
Token's
Token ID338
Feature activation-1.30e-01

Pos logit contributions
flat+0.490
alias+0.489
aque+0.470
yg+0.458
ine+0.450
Neg logit contributions
ngth-0.462
 participating-0.364
lished-0.363
 exerc-0.353
 gaining-0.344
Token political
Token ID1964
Feature activation-6.65e-01

Pos logit contributions
ngth+0.122
 participating+0.105
lished+0.103
 exerc+0.101
 gaining+0.101
Neg logit contributions
flat-0.065
alias-0.064
aque-0.062
yg-0.058
-0.058
Token Hide
Token ID10415
Feature activation-3.23e-01

Pos logit contributions
flat+0.144
alias+0.144
aque+0.139
yg+0.132
interstitial+0.131
Neg logit contributions
ngth-0.199
 participating-0.174
 gaining-0.166
 finishing-0.165
 exerc-0.165
Token Caption
Token ID11260
Feature activation-1.55e+00

Pos logit contributions
ngth+0.292
 participating+0.248
 exerc+0.244
lished+0.242
 gaining+0.241
Neg logit contributions
alias-0.179
flat-0.178
aque-0.170
yg-0.161
alia-0.160
Token 10
Token ID838
Feature activation+2.50e-01

Pos logit contributions
ngth+1.145
 participating+0.928
lished+0.921
 exerc+0.913
 gaining+0.893
Neg logit contributions
alias-1.121
flat-1.120
aque-1.068
yg-1.040
FORE-1.034
Token of
Token ID286
Feature activation-7.69e-02

Pos logit contributions
flat+0.214
alias+0.214
aque+0.204
yg+0.200
ine+0.199
Neg logit contributions
ngth-0.154
lished-0.117
 participating-0.116
 exerc-0.113
 gaining-0.111
Token 15
Token ID1315
Feature activation-1.22e-01

Pos logit contributions
ngth+0.060
 participating+0.049
lished+0.048
 exerc+0.047
 gaining+0.047
Neg logit contributions
flat-0.053
alias-0.052
aque-0.050
yg-0.048
ine-0.048
Token Photos
Token ID9434
Feature activation-3.33e+00

Pos logit contributions
ngth+0.103
 participating+0.089
 gaining+0.085
 exerc+0.085
 finishing+0.084
Neg logit contributions
flat-0.074
alias-0.074
aque-0.071
yg-0.068
-0.067
Token:
Token ID25
Feature activation-4.75e-01

Pos logit contributions
ngth+1.923
 participating+1.579
 gaining+1.467
 finishing+1.459
 exerc+1.451
Neg logit contributions
flat-2.837
alias-2.826
aque-2.761
yg-2.664
interstitial-2.652
Token Carly
Token ID35982
Feature activation+2.70e-01

Pos logit contributions
ngth+0.444
 participating+0.386
 gaining+0.372
 exerc+0.371
lished+0.370
Neg logit contributions
alias-0.245
flat-0.244
aque-0.230
yg-0.221
-0.218
Token Fiorina
Token ID49028
Feature activation+4.88e-01

Pos logit contributions
alias+0.124
flat+0.123
aque+0.115
yg+0.113
ine+0.110
Neg logit contributions
ngth-0.268
 participating-0.232
lished-0.227
 exerc-0.226
 gaining-0.223
Token's
Token ID338
Feature activation-2.26e-01

Pos logit contributions
flat+0.367
alias+0.367
aque+0.352
yg+0.343
ine+0.337
Neg logit contributions
ngth-0.346
 participating-0.273
lished-0.271
 exerc-0.265
 gaining-0.259
Token political
Token ID1964
Feature activation-8.77e-01

Pos logit contributions
ngth+0.213
 participating+0.183
lished+0.179
 exerc+0.177
 gaining+0.175
Neg logit contributions
flat-0.113
alias-0.112
aque-0.108
yg-0.102
-0.101
Token |
Token ID930
Feature activation-3.79e+00

Pos logit contributions
ngth+0.719
 participating+0.627
 gaining+0.593
 finishing+0.590
 exerc+0.584
Neg logit contributions
flat-0.675
alias-0.673
aque-0.657
interstitial-0.629
yg-0.626
Token iTunes
Token ID4830
Feature activation-9.97e-01

Pos logit contributions
ngth+2.828
 participating+2.546
 gaining+2.407
 finishing+2.391
 exerc+2.357
Neg logit contributions
flat-2.466
alias-2.462
aque-2.436
interstitial-2.341
uckland-2.308
Token (
Token ID357
Feature activation-8.07e-01

Pos logit contributions
ngth+0.715
 participating+0.624
 gaining+0.589
 finishing+0.586
 exerc+0.580
Neg logit contributions
alias-0.697
flat-0.697
aque-0.681
interstitial-0.653
yg-0.648
TokenHQ
Token ID41275
Feature activation-8.05e-01

Pos logit contributions
ngth+0.626
 participating+0.549
 gaining+0.520
 finishing+0.520
 exerc+0.512
Neg logit contributions
alias-0.516
flat-0.514
aque-0.502
interstitial-0.478
yg-0.477
Token)
Token ID8
Feature activation-3.91e-01

Pos logit contributions
ngth+0.577
 participating+0.505
 finishing+0.479
 gaining+0.478
 fulfilled+0.465
Neg logit contributions
alias-0.565
flat-0.556
aque-0.546
interstitial-0.526
FORE-0.525
Token |
Token ID930
Feature activation-3.32e+00

Pos logit contributions
ngth+0.260
 participating+0.225
 gaining+0.211
 finishing+0.210
 fulfilled+0.206
Neg logit contributions
alias-0.294
flat-0.293
aque-0.288
interstitial-0.278
uckland-0.274
Token iTunes
Token ID4830
Feature activation-7.13e-01

Pos logit contributions
ngth+2.465
 participating+2.209
 gaining+2.088
 finishing+2.077
 exerc+2.048
Neg logit contributions
flat-2.184
alias-2.178
aque-2.154
interstitial-2.065
uckland-2.035
Token (
Token ID357
Feature activation-6.10e-01

Pos logit contributions
ngth+0.527
 participating+0.461
 gaining+0.436
 finishing+0.433
 exerc+0.429
Neg logit contributions
flat-0.483
alias-0.483
aque-0.472
interstitial-0.452
yg-0.449
TokenSD
Token ID10305
Feature activation-4.23e-01

Pos logit contributions
ngth+0.479
 participating+0.420
 gaining+0.398
 finishing+0.398
 exerc+0.394
Neg logit contributions
flat-0.386
alias-0.386
aque-0.375
interstitial-0.358
yg-0.357
Token)
Token ID8
Feature activation-2.95e-01

Pos logit contributions
ngth+0.300
 participating+0.263
 gaining+0.249
 finishing+0.248
 fulfilled+0.243
Neg logit contributions
alias-0.297
flat-0.296
aque-0.289
interstitial-0.280
uckland-0.275
Token
Token ID198
Feature activation+1.80e-01

Pos logit contributions
ngth+0.201
 participating+0.174
 gaining+0.164
 finishing+0.163
 exerc+0.160
Neg logit contributions
alias-0.217
flat-0.217
aque-0.212
interstitial-0.204
yg-0.202
Token-
Token ID12
Feature activation+1.28e+00

Pos logit contributions
flat+0.013
alias+0.013
aque+0.012
yg+0.012
+0.012
Neg logit contributions
ngth-0.015
 participating-0.013
 exerc-0.012
lished-0.012
 gaining-0.012
TokenM
Token ID44
Feature activation+2.14e-01

Pos logit contributions
flat+0.810
alias+0.804
aque+0.777
yg+0.742
interstitial+0.740
Neg logit contributions
ngth-1.024
 participating-0.896
 gaining-0.850
 finishing-0.849
 exerc-0.844
Tokenanc
Token ID1192
Feature activation-1.38e+00

Pos logit contributions
flat+0.129
alias+0.129
aque+0.125
yg+0.119
interstitial+0.119
Neg logit contributions
ngth-0.176
 participating-0.154
 gaining-0.147
 finishing-0.147
 exerc-0.146
Tokenrus
Token ID14932
Feature activation-2.03e-01

Pos logit contributions
ngth+1.172
 participating+1.038
 gaining+0.990
 finishing+0.987
 exerc+0.980
Neg logit contributions
flat-0.791
alias-0.785
aque-0.762
interstitial-0.722
yg-0.720
Tokenhes
Token ID956
Feature activation+3.69e-01

Pos logit contributions
ngth+0.124
 participating+0.104
 gaining+0.097
 finishing+0.096
 exerc+0.096
Neg logit contributions
flat-0.166
alias-0.165
aque-0.161
yg-0.155
interstitial-0.155
Token-
Token ID12
Feature activation+1.51e+00

Pos logit contributions
flat+0.260
alias+0.258
aque+0.252
interstitial+0.241
yg+0.241
Neg logit contributions
ngth-0.266
 participating-0.230
 gaining-0.217
 finishing-0.216
 exerc-0.214
TokenRo
Token ID15450
Feature activation+3.98e-01

Pos logit contributions
flat+0.969
alias+0.961
aque+0.928
yg+0.887
interstitial+0.886
Neg logit contributions
ngth-1.187
 participating-1.037
 gaining-0.983
 finishing-0.981
 exerc-0.976
Tokenasting
Token ID9222
Feature activation+2.73e-01

Pos logit contributions
flat+0.224
alias+0.223
aque+0.216
interstitial+0.204
yg+0.204
Neg logit contributions
ngth-0.342
 participating-0.303
 gaining-0.289
 finishing-0.288
 exerc-0.286
Token Classic
Token ID13449
Feature activation-8.61e-01

Pos logit contributions
flat+0.188
alias+0.187
aque+0.182
interstitial+0.174
yg+0.174
Neg logit contributions
ngth-0.200
 participating-0.174
 gaining-0.164
 finishing-0.163
 exerc-0.162
Token O
Token ID440
Feature activation-4.17e-02

Pos logit contributions
ngth+0.636
 participating+0.553
 gaining+0.522
 finishing+0.520
 exerc+0.516
Neg logit contributions
flat-0.586
alias-0.584
aque-0.570
interstitial-0.546
yg-0.544
Token&
Token ID5
Feature activation-6.23e-01

Pos logit contributions
ngth+0.036
 participating+0.032
 gaining+0.030
 finishing+0.030
 exerc+0.030
Neg logit contributions
flat-0.024
alias-0.023
aque-0.023
interstitial-0.022
yg-0.021
Token3
Token ID18
Feature activation+9.81e-01

Pos logit contributions
flat+0.356
alias+0.354
aque+0.341
yg+0.322
interstitial+0.321
Neg logit contributions
ngth-0.546
 participating-0.481
 gaining-0.460
 finishing-0.458
 exerc-0.458
Token million
Token ID1510
Feature activation+1.35e+00

Pos logit contributions
flat+0.697
alias+0.694
aque+0.675
interstitial+0.647
yg+0.647
Neg logit contributions
ngth-0.697
 participating-0.607
 gaining-0.572
 finishing-0.572
 exerc-0.562
Token in
Token ID287
Feature activation+5.10e-01

Pos logit contributions
flat+1.026
alias+1.019
aque+1.000
yg+0.958
interstitial+0.951
Neg logit contributions
ngth-0.896
 participating-0.742
 gaining-0.695
 exerc-0.691
lished-0.691
Token hand
Token ID1021
Feature activation+9.47e-01

Pos logit contributions
flat+0.348
alias+0.343
aque+0.336
yg+0.321
interstitial+0.317
Neg logit contributions
ngth-0.383
 participating-0.324
 gaining-0.307
 finishing-0.306
 exerc-0.304
Token to
Token ID284
Feature activation+2.26e-01

Pos logit contributions
flat+0.705
alias+0.701
aque+0.688
yg+0.655
+0.655
Neg logit contributions
ngth-0.645
 participating-0.541
 gaining-0.510
 finishing-0.505
 exerc-0.501
Token bid
Token ID8406
Feature activation+1.64e+00

Pos logit contributions
flat+0.161
alias+0.161
aque+0.156
yg+0.150
amic+0.148
Neg logit contributions
ngth-0.163
 participating-0.137
 gaining-0.129
 finishing-0.129
 exerc-0.128
Token on
Token ID319
Feature activation+3.76e-01

Pos logit contributions
flat+1.257
alias+1.233
aque+1.216
yg+1.168
+1.163
Neg logit contributions
ngth-1.106
 participating-0.899
 exerc-0.846
 gaining-0.846
lished-0.840
Token W
Token ID370
Feature activation-3.17e-02

Pos logit contributions
flat+0.274
alias+0.272
aque+0.265
yg+0.254
interstitial+0.254
Neg logit contributions
ngth-0.263
 participating-0.223
 gaining-0.211
 finishing-0.210
 exerc-0.208
Tokenied
Token ID798
Feature activation-3.00e-01

Pos logit contributions
ngth+0.021
 participating+0.017
 exerc+0.017
 gaining+0.016
 finishing+0.016
Neg logit contributions
alias-0.025
flat-0.025
aque-0.024
yg-0.024
alia-0.023
Tokener
Token ID263
Feature activation+5.41e-01

Pos logit contributions
ngth+0.271
 participating+0.236
 gaining+0.227
lished+0.227
 exerc+0.226
Neg logit contributions
flat-0.162
alias-0.161
aque-0.157
yg-0.149
alia-0.146
Tokenhorn
Token ID25311
Feature activation+5.65e-01

Pos logit contributions
alias+0.335
flat+0.334
aque+0.322
yg+0.308
amic+0.306
Neg logit contributions
ngth-0.449
 participating-0.385
 exerc-0.375
lished-0.372
 gaining-0.369
Token).
Token ID737
Feature activation-3.41e-02

Pos logit contributions
flat+0.148
alias+0.148
aque+0.144
yg+0.137
interstitial+0.137
Neg logit contributions
ngth-0.151
 participating-0.130
 gaining-0.123
 finishing-0.122
 exerc-0.122
Token *
Token ID1635
Feature activation-1.11e+00

Pos logit contributions
ngth+0.025
 participating+0.022
 gaining+0.021
 finishing+0.021
 exerc+0.021
Neg logit contributions
flat-0.023
alias-0.023
aque-0.022
yg-0.021
interstitial-0.021
TokenOr
Token ID5574
Feature activation-4.96e-01

Pos logit contributions
ngth+0.790
 participating+0.683
 gaining+0.646
 finishing+0.646
 fulfilled+0.635
Neg logit contributions
flat-0.777
alias-0.773
aque-0.757
interstitial-0.729
yg-0.726
Token you
Token ID345
Feature activation+3.77e-01

Pos logit contributions
ngth+0.318
 participating+0.272
 gaining+0.255
 finishing+0.255
 exerc+0.251
Neg logit contributions
flat-0.386
alias-0.385
aque-0.377
interstitial-0.364
yg-0.362
Token can
Token ID460
Feature activation+2.66e-01

Pos logit contributions
flat+0.338
alias+0.336
aque+0.329
yg+0.318
interstitial+0.317
Neg logit contributions
ngth-0.200
 participating-0.162
 gaining-0.149
 finishing-0.148
 exerc-0.147
Token buy
Token ID2822
Feature activation+1.46e+00

Pos logit contributions
flat+0.191
alias+0.190
aque+0.186
interstitial+0.178
yg+0.178
Neg logit contributions
ngth-0.187
 participating-0.161
 gaining-0.152
 finishing-0.152
 exerc-0.150
Token the
Token ID262
Feature activation+3.20e-01

Pos logit contributions
flat+1.109
alias+1.104
aque+1.078
interstitial+1.034
yg+1.034
Neg logit contributions
ngth-0.964
 participating-0.821
 gaining-0.771
 finishing-0.767
 exerc-0.760
Token 23
Token ID2242
Feature activation+1.29e-02

Pos logit contributions
flat+0.244
alias+0.243
aque+0.238
interstitial+0.228
yg+0.228
Neg logit contributions
ngth-0.212
 participating-0.181
 gaining-0.169
 finishing-0.169
 exerc-0.167
Token films
Token ID7328
Feature activation-7.72e-01

Pos logit contributions
flat+0.009
alias+0.009
aque+0.009
interstitial+0.008
yg+0.008
Neg logit contributions
ngth-0.010
 participating-0.008
 gaining-0.008
 finishing-0.008
 fulfilled-0.008
Token separately
Token ID13869
Feature activation+5.72e-01

Pos logit contributions
ngth+0.520
 participating+0.448
 gaining+0.420
 finishing+0.419
 exerc+0.413
Neg logit contributions
flat-0.577
alias-0.574
aque-0.561
interstitial-0.539
yg-0.538
Token (
Token ID357
Feature activation-3.44e-02

Pos logit contributions
flat+0.434
alias+0.432
aque+0.422
yg+0.403
interstitial+0.403
Neg logit contributions
ngth-0.381
 participating-0.322
 gaining-0.302
 finishing-0.302
 exerc-0.301
Token determined
Token ID5295
Feature activation-4.16e-01

Pos logit contributions
ngth+0.067
 participating+0.057
 gaining+0.054
 finishing+0.054
 exerc+0.054
Neg logit contributions
flat-0.061
alias-0.061
aque-0.059
yg-0.056
interstitial-0.056
Token she
Token ID673
Feature activation+3.84e-01

Pos logit contributions
ngth+0.244
 participating+0.203
 gaining+0.189
 finishing+0.188
 exerc+0.186
Neg logit contributions
flat-0.348
alias-0.347
aque-0.340
interstitial-0.327
yg-0.327
Token had
Token ID550
Feature activation+3.00e-01

Pos logit contributions
flat+0.286
alias+0.282
aque+0.278
yg+0.265
alia+0.264
Neg logit contributions
ngth-0.265
 participating-0.220
 gaining-0.208
 finishing-0.207
 exerc-0.206
Token a
Token ID257
Feature activation+3.08e-01

Pos logit contributions
flat+0.218
alias+0.216
aque+0.211
yg+0.202
interstitial+0.202
Neg logit contributions
ngth-0.210
 participating-0.178
 gaining-0.168
 finishing-0.167
 exerc-0.166
Token severe
Token ID6049
Feature activation+5.11e-01

Pos logit contributions
flat+0.238
alias+0.235
aque+0.231
yg+0.222
interstitial+0.221
Neg logit contributions
ngth-0.203
 participating-0.171
 gaining-0.161
 finishing-0.159
 exerc-0.159
Token burn
Token ID4245
Feature activation+1.43e+00

Pos logit contributions
flat+0.385
alias+0.382
aque+0.374
yg+0.359
interstitial+0.358
Neg logit contributions
ngth-0.345
 participating-0.291
 gaining-0.274
 finishing-0.272
 exerc-0.270
Token and
Token ID290
Feature activation+2.43e-01

Pos logit contributions
flat+1.107
alias+1.099
aque+1.082
yg+1.034
alia+1.021
Neg logit contributions
ngth-0.938
 participating-0.769
 gaining-0.718
 exerc-0.714
 finishing-0.714
Token suggested
Token ID5220
Feature activation+4.22e-01

Pos logit contributions
flat+0.188
alias+0.187
aque+0.183
yg+0.176
interstitial+0.175
Neg logit contributions
ngth-0.159
 participating-0.133
 gaining-0.124
 finishing-0.124
 exerc-0.123
Token they
Token ID484
Feature activation-9.81e-02

Pos logit contributions
flat+0.345
alias+0.342
aque+0.334
yg+0.325
interstitial+0.323
Neg logit contributions
ngth-0.257
 participating-0.208
 gaining-0.194
 fulfilled-0.194
 exerc-0.192
Token remove
Token ID4781
Feature activation+9.38e-01

Pos logit contributions
ngth+0.068
 participating+0.058
 gaining+0.055
 finishing+0.054
 exerc+0.054
Neg logit contributions
flat-0.072
alias-0.071
aque-0.070
yg-0.067
interstitial-0.066
Token the
Token ID262
Feature activation+2.61e-01

Pos logit contributions
flat+0.704
alias+0.699
aque+0.684
yg+0.658
alia+0.651
Neg logit contributions
ngth-0.633
 participating-0.534
 gaining-0.503
 finishing-0.500
 fulfilled-0.497
Token conduct
Token ID3189
Feature activation+1.59e-01

Pos logit contributions
flat+0.034
alias+0.033
aque+0.032
yg+0.031
interstitial+0.031
Neg logit contributions
ngth-0.033
 participating-0.028
 gaining-0.026
 finishing-0.026
 exerc-0.026
Token wasn
Token ID2492
Feature activation-4.30e-01

Pos logit contributions
flat+0.133
alias+0.132
aque+0.129
yg+0.124
alia+0.124
Neg logit contributions
ngth-0.094
 participating-0.076
 finishing-0.072
 gaining-0.071
 exerc-0.071
Token
Token ID447
Feature activation-2.14e-01

Pos logit contributions
ngth+0.291
 participating+0.240
 exerc+0.228
lished+0.226
 gaining+0.226
Neg logit contributions
flat-0.335
alias-0.335
aque-0.324
-0.313
yg-0.312
Token
Token ID247
Feature activation-1.16e-02

Pos logit contributions
ngth+0.239
 participating+0.215
lished+0.208
 gaining+0.208
 exerc+0.207
Neg logit contributions
alias-0.068
flat-0.068
aque-0.064
interstitial-0.059
yg-0.058
Tokent
Token ID83
Feature activation-4.36e-01

Pos logit contributions
ngth+0.008
 participating+0.007
 gaining+0.007
 finishing+0.007
 exerc+0.007
Neg logit contributions
flat-0.008
alias-0.008
aque-0.008
yg-0.007
alia-0.007
Token bribery
Token ID37388
Feature activation+1.53e+00

Pos logit contributions
ngth+0.310
 participating+0.262
 gaining+0.247
 finishing+0.247
 exerc+0.245
Neg logit contributions
flat-0.313
alias-0.311
aque-0.303
yg-0.290
-0.288
Token.
Token ID13
Feature activation+6.21e-03

Pos logit contributions
flat+1.126
alias+1.101
aque+1.082
alia+1.034
yg+1.031
Neg logit contributions
ngth-1.076
 participating-0.887
lished-0.847
 finishing-0.842
 gaining-0.833
Token On
Token ID1550
Feature activation+8.25e-02

Pos logit contributions
flat+0.004
alias+0.004
aque+0.004
yg+0.004
+0.004
Neg logit contributions
ngth-0.004
 participating-0.004
 gaining-0.004
 finishing-0.004
 exerc-0.004
Token the
Token ID262
Feature activation+1.66e-01

Pos logit contributions
alias+0.064
flat+0.064
aque+0.062
interstitial+0.060
yg+0.059
Neg logit contributions
ngth-0.053
 participating-0.046
 gaining-0.043
 finishing-0.043
 exerc-0.042
Token other
Token ID584
Feature activation-3.41e-01

Pos logit contributions
alias+0.125
flat+0.124
aque+0.121
interstitial+0.117
yg+0.116
Neg logit contributions
ngth-0.110
 participating-0.096
 gaining-0.090
 finishing-0.089
 exerc-0.089
Token hand
Token ID1021
Feature activation+1.05e+00

Pos logit contributions
ngth+0.225
 participating+0.194
 gaining+0.181
 finishing+0.181
 exerc+0.180
Neg logit contributions
flat-0.259
alias-0.259
aque-0.252
interstitial-0.246
-0.241
Token
Token ID198
Feature activation+2.63e-01

Pos logit contributions
flat+0.206
alias+0.205
aque+0.194
yg+0.186
ine+0.184
Neg logit contributions
ngth-0.313
 participating-0.263
lished-0.258
 exerc-0.257
 gaining-0.254
TokenChina
Token ID14581
Feature activation+5.41e-01

Pos logit contributions
flat+0.175
alias+0.174
aque+0.169
interstitial+0.161
yg+0.161
Neg logit contributions
ngth-0.200
 participating-0.174
 gaining-0.165
 finishing-0.165
 exerc-0.163
Token has
Token ID468
Feature activation-3.75e-01

Pos logit contributions
flat+0.408
alias+0.405
aque+0.396
yg+0.380
interstitial+0.379
Neg logit contributions
ngth-0.363
 participating-0.308
 gaining-0.289
 finishing-0.288
 exerc-0.286
Token added
Token ID2087
Feature activation-3.38e-01

Pos logit contributions
ngth+0.244
 participating+0.206
 gaining+0.193
 finishing+0.193
 exerc+0.191
Neg logit contributions
flat-0.290
alias-0.288
aque-0.282
yg-0.271
interstitial-0.270
Token about
Token ID546
Feature activation-5.33e-01

Pos logit contributions
ngth+0.234
 participating+0.201
 gaining+0.190
 finishing+0.189
 exerc+0.187
Neg logit contributions
flat-0.247
alias-0.246
aque-0.240
interstitial-0.230
yg-0.230
Token 40
Token ID2319
Feature activation-5.10e-01

Pos logit contributions
ngth+0.372
 participating+0.317
 gaining+0.299
 finishing+0.298
 exerc+0.296
Neg logit contributions
flat-0.389
alias-0.386
aque-0.377
interstitial-0.361
yg-0.360
Token airports
Token ID22600
Feature activation+7.99e-02

Pos logit contributions
ngth+0.353
 participating+0.308
 gaining+0.288
 finishing+0.288
 exerc+0.283
Neg logit contributions
flat-0.370
alias-0.369
aque-0.360
yg-0.345
interstitial-0.345
Token in
Token ID287
Feature activation-4.17e-02

Pos logit contributions
flat+0.062
alias+0.061
aque+0.060
yg+0.057
interstitial+0.057
Neg logit contributions
ngth-0.052
 participating-0.044
 gaining-0.041
 finishing-0.041
 exerc-0.041
Token the
Token ID262
Feature activation+4.31e-01

Pos logit contributions
ngth+0.029
 participating+0.024
 gaining+0.023
 finishing+0.023
 exerc+0.022
Neg logit contributions
flat-0.031
alias-0.031
aque-0.030
yg-0.029
interstitial-0.029
Token last
Token ID938
Feature activation-2.83e-01

Pos logit contributions
flat+0.372
alias+0.367
aque+0.359
yg+0.348
+0.344
Neg logit contributions
ngth-0.248
 participating-0.197
 exerc-0.184
 gaining-0.184
lished-0.182
Token decade
Token ID5707
Feature activation+9.02e-01

Pos logit contributions
ngth+0.185
 participating+0.156
 gaining+0.146
 finishing+0.145
 exerc+0.145
Neg logit contributions
flat-0.219
alias-0.217
aque-0.212
interstitial-0.204
yg-0.204
Token play
Token ID711
Feature activation+3.04e-01

Pos logit contributions
flat+0.741
alias+0.738
aque+0.719
yg+0.695
interstitial+0.686
Neg logit contributions
ngth-0.610
 participating-0.509
 gaining-0.478
 finishing-0.475
 exerc-0.473
Token out
Token ID503
Feature activation+9.55e-01

Pos logit contributions
flat+0.230
alias+0.228
aque+0.223
yg+0.214
interstitial+0.214
Neg logit contributions
ngth-0.204
 participating-0.173
 gaining-0.163
 finishing-0.162
 exerc-0.161
Token and
Token ID290
Feature activation+1.78e-01

Pos logit contributions
flat+0.708
alias+0.703
aque+0.684
yg+0.662
+0.654
Neg logit contributions
ngth-0.657
 participating-0.550
 exerc-0.516
 finishing-0.515
 gaining-0.515
Token just
Token ID655
Feature activation-2.94e-01

Pos logit contributions
flat+0.147
alias+0.146
aque+0.143
yg+0.138
uckland+0.137
Neg logit contributions
ngth-0.108
 participating-0.087
lished-0.081
 finishing-0.081
 exerc-0.081
Token how
Token ID703
Feature activation+4.06e-01

Pos logit contributions
ngth+0.149
 participating+0.115
 gaining+0.104
 exerc+0.104
lished+0.103
Neg logit contributions
flat-0.273
alias-0.271
aque-0.265
yg-0.257
-0.255
Token disruptive
Token ID28094
Feature activation+1.13e+00

Pos logit contributions
flat+0.325
alias+0.324
aque+0.316
yg+0.304
+0.302
Neg logit contributions
ngth-0.256
 participating-0.209
 finishing-0.195
 gaining-0.195
 exerc-0.193
Token it
Token ID340
Feature activation+1.17e-01

Pos logit contributions
alias+0.903
flat+0.902
aque+0.880
yg+0.850
interstitial+0.840
Neg logit contributions
ngth-0.711
 participating-0.580
 gaining-0.541
lished-0.540
 exerc-0.539
Token is
Token ID318
Feature activation+3.23e-01

Pos logit contributions
flat+0.093
alias+0.092
aque+0.090
yg+0.087
interstitial+0.086
Neg logit contributions
ngth-0.074
 participating-0.063
 gaining-0.058
 finishing-0.058
 exerc-0.058
Token going
Token ID1016
Feature activation-8.23e-02

Pos logit contributions
flat+0.250
alias+0.249
aque+0.242
yg+0.234
+0.232
Neg logit contributions
ngth-0.211
 participating-0.175
 finishing-0.163
 exerc-0.163
 gaining-0.163
Token to
Token ID284
Feature activation+9.40e-01

Pos logit contributions
ngth+0.056
 participating+0.047
lished+0.044
 exerc+0.044
 finishing+0.044
Neg logit contributions
flat-0.062
alias-0.061
aque-0.060
yg-0.058
interstitial-0.057
Token be
Token ID307
Feature activation+7.17e-01

Pos logit contributions
flat+0.535
alias+0.527
aque+0.514
interstitial+0.493
yg+0.479
Neg logit contributions
ngth-0.808
 participating-0.719
 gaining-0.689
 finishing-0.684
 fulfilled-0.676
Token office
Token ID2607
Feature activation+2.09e-01

Pos logit contributions
ngth+0.078
 participating+0.067
 gaining+0.064
 finishing+0.063
 exerc+0.063
Neg logit contributions
flat-0.074
alias-0.074
aque-0.072
yg-0.069
interstitial-0.069
Token early
Token ID1903
Feature activation-5.23e-02

Pos logit contributions
flat+0.159
alias+0.158
aque+0.154
yg+0.148
interstitial+0.146
Neg logit contributions
ngth-0.139
 participating-0.117
 gaining-0.110
 finishing-0.110
 exerc-0.108
Token next
Token ID1306
Feature activation+3.70e-01

Pos logit contributions
ngth+0.031
 participating+0.025
 gaining+0.023
 exerc+0.023
 finishing+0.023
Neg logit contributions
flat-0.044
alias-0.044
aque-0.043
yg-0.041
interstitial-0.041
Token year
Token ID614
Feature activation+5.79e-01

Pos logit contributions
flat+0.306
alias+0.304
aque+0.297
yg+0.287
interstitial+0.286
Neg logit contributions
ngth-0.222
 participating-0.184
 gaining-0.171
 finishing-0.170
 exerc-0.169
Token.
Token ID13
Feature activation-2.61e-01

Pos logit contributions
flat+0.433
alias+0.432
aque+0.421
yg+0.403
interstitial+0.401
Neg logit contributions
ngth-0.392
 participating-0.331
 gaining-0.311
 finishing-0.310
 exerc-0.308
Token But
Token ID887
Feature activation-5.18e-01

Pos logit contributions
ngth+0.199
 participating+0.170
 gaining+0.162
 exerc+0.161
 finishing+0.161
Neg logit contributions
alias-0.175
flat-0.175
aque-0.168
yg-0.162
-0.161
Token,
Token ID11
Feature activation-6.60e-01

Pos logit contributions
ngth+0.345
 participating+0.286
 gaining+0.268
 exerc+0.267
 finishing+0.267
Neg logit contributions
flat-0.396
alias-0.394
aque-0.384
yg-0.372
interstitial-0.367
Token in
Token ID287
Feature activation-3.06e-01

Pos logit contributions
ngth+0.433
 participating+0.360
 gaining+0.337
 exerc+0.335
 finishing+0.334
Neg logit contributions
flat-0.510
alias-0.508
aque-0.495
yg-0.479
interstitial-0.473
Token spite
Token ID15275
Feature activation+6.92e-01

Pos logit contributions
ngth+0.208
 participating+0.175
 gaining+0.164
 finishing+0.163
 exerc+0.163
Neg logit contributions
flat-0.230
alias-0.228
aque-0.222
yg-0.214
interstitial-0.212
Token of
Token ID286
Feature activation-5.97e-01

Pos logit contributions
flat+0.580
alias+0.579
aque+0.554
+0.543
yg+0.543
Neg logit contributions
ngth-0.420
lished-0.326
 participating-0.318
 exerc-0.307
 gaining-0.293
Token the
Token ID262
Feature activation-3.86e-01

Pos logit contributions
ngth+0.400
 participating+0.334
 gaining+0.313
 exerc+0.312
 finishing+0.311
Neg logit contributions
flat-0.453
alias-0.450
aque-0.439
yg-0.423
interstitial-0.420
Token,
Token ID11
Feature activation+1.66e-01

Pos logit contributions
flat+0.403
alias+0.400
aque+0.392
yg+0.374
+0.371
Neg logit contributions
ngth-0.335
 participating-0.274
lished-0.257
 exerc-0.256
 gaining-0.255
Token good
Token ID922
Feature activation-1.11e+00

Pos logit contributions
flat+0.130
alias+0.129
aque+0.126
yg+0.121
interstitial+0.120
Neg logit contributions
ngth-0.108
 participating-0.090
 gaining-0.084
 exerc-0.084
 finishing-0.083
Token layout
Token ID12461
Feature activation-1.09e+00

Pos logit contributions
ngth+0.742
 participating+0.638
 gaining+0.600
 finishing+0.599
 exerc+0.591
Neg logit contributions
flat-0.832
alias-0.829
aque-0.811
interstitial-0.779
yg-0.776
Token."
Token ID526
Feature activation-6.61e-01

Pos logit contributions
ngth+0.746
 participating+0.638
 gaining+0.600
 finishing+0.598
 exerc+0.593
Neg logit contributions
flat-0.808
alias-0.803
aque-0.784
yg-0.751
interstitial-0.751
Token
Token ID198
Feature activation+1.57e-01

Pos logit contributions
ngth+0.489
 participating+0.430
 gaining+0.405
 finishing+0.404
 fulfilled+0.400
Neg logit contributions
flat-0.444
alias-0.440
aque-0.430
interstitial-0.420
yg-0.413
Token
Token ID198
Feature activation-3.81e-02

Pos logit contributions
flat+0.092
alias+0.092
aque+0.087
yg+0.083
+0.082
Neg logit contributions
ngth-0.139
 participating-0.117
lished-0.114
 exerc-0.114
 gaining-0.113
TokenGoogle
Token ID11708
Feature activation-2.60e-01

Pos logit contributions
ngth+0.029
 participating+0.025
 gaining+0.024
 finishing+0.024
 exerc+0.024
Neg logit contributions
flat-0.025
alias-0.025
aque-0.024
yg-0.023
interstitial-0.023
Token wouldn
Token ID3636
Feature activation-3.68e-01

Pos logit contributions
ngth+0.174
 participating+0.149
 gaining+0.140
 finishing+0.139
 exerc+0.138
Neg logit contributions
flat-0.196
alias-0.195
aque-0.190
interstitial-0.183
yg-0.183
Token't
Token ID470
Feature activation-1.76e-01

Pos logit contributions
ngth+0.252
 participating+0.204
 exerc+0.195
lished+0.195
 gaining+0.194
Neg logit contributions
alias-0.286
flat-0.285
aque-0.275
yg-0.265
-0.263
Token comment
Token ID2912
Feature activation-1.29e+00

Pos logit contributions
ngth+0.133
 participating+0.116
 gaining+0.109
 finishing+0.109
 exerc+0.108
Neg logit contributions
flat-0.117
alias-0.117
aque-0.114
interstitial-0.108
yg-0.108
Token for
Token ID329
Feature activation-3.21e-01

Pos logit contributions
ngth+0.767
 participating+0.653
 gaining+0.608
 finishing+0.605
 exerc+0.595
Neg logit contributions
flat-1.064
alias-1.062
aque-1.040
interstitial-1.009
yg-1.000
Token which
Token ID543
Feature activation-1.11e-01

Pos logit contributions
flat+0.048
alias+0.048
aque+0.047
interstitial+0.045
yg+0.045
Neg logit contributions
ngth-0.042
 participating-0.036
 gaining-0.034
 finishing-0.034
 exerc-0.033
Token is
Token ID318
Feature activation+1.01e-01

Pos logit contributions
ngth+0.075
 participating+0.063
 gaining+0.060
 finishing+0.059
 exerc+0.059
Neg logit contributions
flat-0.084
alias-0.083
aque-0.081
yg-0.078
interstitial-0.078
Token how
Token ID703
Feature activation-7.97e-02

Pos logit contributions
flat+0.081
alias+0.081
aque+0.079
yg+0.076
+0.075
Neg logit contributions
ngth-0.064
 participating-0.052
lished-0.049
 exerc-0.048
 gaining-0.048
Token you
Token ID345
Feature activation+1.49e-01

Pos logit contributions
ngth+0.052
 participating+0.044
 gaining+0.041
 finishing+0.041
 exerc+0.041
Neg logit contributions
flat-0.062
alias-0.061
aque-0.060
yg-0.057
interstitial-0.057
Token know
Token ID760
Feature activation+9.71e-03

Pos logit contributions
flat+0.109
alias+0.109
aque+0.105
yg+0.102
+0.100
Neg logit contributions
ngth-0.103
 participating-0.087
 gaining-0.082
 exerc-0.081
lished-0.081
Token it
Token ID340
Feature activation+1.90e-01

Pos logit contributions
flat+0.007
alias+0.007
aque+0.007
yg+0.007
interstitial+0.007
Neg logit contributions
ngth-0.006
 participating-0.005
 gaining-0.005
 exerc-0.005
 finishing-0.005
Token
Token ID447
Feature activation+2.38e-01

Pos logit contributions
flat+0.149
alias+0.148
aque+0.144
yg+0.139
alia+0.138
Neg logit contributions
ngth-0.123
 participating-0.103
 exerc-0.097
 gaining-0.097
 finishing-0.096
Token
Token ID247
Feature activation+3.92e-01

Pos logit contributions
alias+0.074
flat+0.073
aque+0.069
interstitial+0.064
+0.062
Neg logit contributions
ngth-0.264
 participating-0.241
 gaining-0.232
 exerc-0.232
 finishing-0.231
Tokens
Token ID82
Feature activation+2.26e-01

Pos logit contributions
flat+0.272
alias+0.268
aque+0.261
yg+0.251
interstitial+0.246
Neg logit contributions
ngth-0.288
 participating-0.251
 gaining-0.237
 finishing-0.235
 exerc-0.235
Token a
Token ID257
Feature activation+3.33e-01

Pos logit contributions
flat+0.182
alias+0.180
aque+0.176
yg+0.169
+0.168
Neg logit contributions
ngth-0.141
 participating-0.116
 exerc-0.108
 gaining-0.108
 finishing-0.107
Token fucking
Token ID9372
Feature activation+2.58e-01

Pos logit contributions
flat+0.252
alias+0.248
aque+0.243
yg+0.233
uckland+0.232
Neg logit contributions
ngth-0.226
 participating-0.187
 exerc-0.178
 gaining-0.177
 finishing-0.175
Token is
Token ID318
Feature activation-5.73e-01

Pos logit contributions
flat+0.036
alias+0.036
aque+0.035
yg+0.034
alia+0.033
Neg logit contributions
ngth-0.035
 participating-0.029
 finishing-0.028
 gaining-0.027
 exerc-0.027
Token not
Token ID407
Feature activation-1.94e-01

Pos logit contributions
ngth+0.352
 participating+0.289
 finishing+0.269
 gaining+0.268
 exerc+0.267
Neg logit contributions
flat-0.467
alias-0.464
aque-0.454
yg-0.437
interstitial-0.435
Token a
Token ID257
Feature activation-6.14e-01

Pos logit contributions
ngth+0.123
 participating+0.100
 finishing+0.094
 gaining+0.094
 exerc+0.093
Neg logit contributions
flat-0.155
alias-0.153
aque-0.150
yg-0.144
interstitial-0.143
Token compliant
Token ID31332
Feature activation-1.04e+00

Pos logit contributions
ngth+0.395
 participating+0.316
 gaining+0.299
 finishing+0.298
lished+0.297
Neg logit contributions
flat-0.490
alias-0.482
aque-0.474
yg-0.456
uckland-0.451
Token partner
Token ID5212
Feature activation+7.19e-01

Pos logit contributions
ngth+0.669
 participating+0.551
 gaining+0.519
 finishing+0.516
 exerc+0.509
Neg logit contributions
flat-0.825
alias-0.820
aque-0.804
yg-0.772
interstitial-0.766
Token,"
Token ID553
Feature activation-1.28e+00

Pos logit contributions
flat+0.543
alias+0.540
aque+0.525
yg+0.506
alia+0.504
Neg logit contributions
ngth-0.486
 participating-0.395
lished-0.377
 finishing-0.374
 gaining-0.373
Token she
Token ID673
Feature activation+5.50e-01

Pos logit contributions
ngth+0.756
 participating+0.644
 gaining+0.598
 finishing+0.598
 exerc+0.586
Neg logit contributions
flat-1.056
alias-1.049
aque-1.028
interstitial-0.993
yg-0.987
Token said
Token ID531
Feature activation+7.73e-01

Pos logit contributions
flat+0.448
alias+0.446
aque+0.436
interstitial+0.421
yg+0.419
Neg logit contributions
ngth-0.334
 participating-0.283
 gaining-0.263
 finishing-0.263
 exerc-0.259
Token.
Token ID13
Feature activation-5.16e-02

Pos logit contributions
flat+0.590
alias+0.587
aque+0.574
interstitial+0.551
yg+0.551
Neg logit contributions
ngth-0.510
 participating-0.435
 gaining-0.408
 finishing-0.406
 exerc-0.402
Token "[
Token ID12878
Feature activation-9.13e-01

Pos logit contributions
ngth+0.029
 participating+0.024
 gaining+0.022
 finishing+0.022
 exerc+0.022
Neg logit contributions
flat-0.045
alias-0.044
aque-0.043
interstitial-0.043
yg-0.042
TokenBut
Token ID1537
Feature activation-5.43e-01

Pos logit contributions
ngth+0.652
 participating+0.562
 gaining+0.531
 finishing+0.529
 exerc+0.524
Neg logit contributions
flat-0.649
alias-0.646
aque-0.629
yg-0.602
interstitial-0.602
Tokenstate
Token ID5219
Feature activation+5.82e-01

Pos logit contributions
flat+0.128
alias+0.127
aque+0.122
interstitial+0.114
yg+0.114
Neg logit contributions
ngth-0.304
 participating-0.274
 gaining-0.263
 finishing-0.263
 exerc-0.260
Token:
Token ID25
Feature activation+2.35e-01

Pos logit contributions
flat+0.469
alias+0.467
aque+0.456
yg+0.438
interstitial+0.436
Neg logit contributions
ngth-0.363
 participating-0.303
 gaining-0.283
 finishing-0.281
 exerc-0.280
Token Option
Token ID16018
Feature activation+3.96e-01

Pos logit contributions
flat+0.220
alias+0.219
aque+0.215
yg+0.208
interstitial+0.208
Neg logit contributions
ngth-0.115
 participating-0.092
 gaining-0.084
 finishing-0.083
 exerc-0.082
Token<
Token ID27
Feature activation+9.45e-01

Pos logit contributions
flat+0.302
alias+0.300
aque+0.293
interstitial+0.283
yg+0.282
Neg logit contributions
ngth-0.259
 participating-0.224
 gaining-0.211
 finishing-0.209
 fulfilled-0.207
TokenBoard
Token ID29828
Feature activation+1.75e+00

Pos logit contributions
flat+0.492
alias+0.488
aque+0.471
interstitial+0.442
yg+0.442
Neg logit contributions
ngth-0.857
 participating-0.762
 gaining-0.729
 finishing-0.727
 exerc-0.724
Token>,
Token ID22330
Feature activation-9.92e-01

Pos logit contributions
flat+0.921
alias+0.915
aque+0.883
yg+0.831
interstitial+0.829
Neg logit contributions
ngth-1.567
 participating-1.392
 gaining-1.331
 finishing-1.328
 exerc-1.321
Token //
Token ID3373
Feature activation+6.22e-01

Pos logit contributions
ngth+0.676
 participating+0.580
 gaining+0.546
 finishing+0.544
 exerc+0.538
Neg logit contributions
flat-0.735
alias-0.731
aque-0.714
interstitial-0.685
yg-0.684
Token The
Token ID383
Feature activation+1.93e-01

Pos logit contributions
alias+0.447
flat+0.447
aque+0.433
yg+0.414
+0.410
Neg logit contributions
ngth-0.442
 participating-0.376
 gaining-0.355
 exerc-0.353
 finishing-0.352
Token previous
Token ID2180
Feature activation+5.94e-01

Pos logit contributions
flat+0.163
alias+0.163
aque+0.159
yg+0.153
interstitial+0.153
Neg logit contributions
ngth-0.112
 participating-0.091
 gaining-0.085
 exerc-0.084
 finishing-0.084
Token board
Token ID3096
Feature activation+1.87e+00

Pos logit contributions
flat+0.406
alias+0.403
aque+0.392
interstitial+0.376
yg+0.375
Neg logit contributions
ngth-0.440
 participating-0.384
 gaining-0.363
 finishing-0.362
 exerc-0.358
Token |
Token ID930
Feature activation-2.86e+00

Pos logit contributions
flat+1.310
alias+1.307
aque+1.273
yg+1.212
interstitial+1.202
Neg logit contributions
ngth-1.354
 participating-1.153
 gaining-1.090
 exerc-1.089
 finishing-1.084
Token get
Token ID651
Feature activation-1.15e+00

Pos logit contributions
ngth+0.014
 participating+0.012
 gaining+0.012
 finishing+0.011
 exerc+0.011
Neg logit contributions
flat-0.015
alias-0.015
aque-0.014
yg-0.014
interstitial-0.014
Token worse
Token ID4785
Feature activation+7.64e-02

Pos logit contributions
ngth+0.802
 participating+0.692
 gaining+0.653
 finishing+0.651
 exerc+0.643
Neg logit contributions
flat-0.836
alias-0.832
aque-0.812
interstitial-0.779
yg-0.776
Token as
Token ID355
Feature activation-3.57e-02

Pos logit contributions
flat+0.058
alias+0.058
aque+0.056
yg+0.054
+0.054
Neg logit contributions
ngth-0.051
 participating-0.043
 gaining-0.040
 finishing-0.040
 exerc-0.040
Token the
Token ID262
Feature activation-2.18e-01

Pos logit contributions
ngth+0.024
 participating+0.021
 gaining+0.020
 finishing+0.020
 exerc+0.020
Neg logit contributions
flat-0.026
alias-0.026
aque-0.026
interstitial-0.025
yg-0.024
Token election
Token ID3071
Feature activation-4.82e-01

Pos logit contributions
ngth+0.155
 participating+0.134
 gaining+0.126
 finishing+0.126
 exerc+0.125
Neg logit contributions
flat-0.155
alias-0.155
aque-0.151
interstitial-0.144
yg-0.144
Token approaches
Token ID10581
Feature activation-9.45e-01

Pos logit contributions
ngth+0.277
 participating+0.230
 gaining+0.213
 finishing+0.212
 exerc+0.210
Neg logit contributions
flat-0.409
alias-0.407
aque-0.398
interstitial-0.384
yg-0.384
Token.
Token ID13
Feature activation-4.64e-01

Pos logit contributions
ngth+0.622
 participating+0.532
 gaining+0.500
 finishing+0.499
 exerc+0.492
Neg logit contributions
flat-0.721
alias-0.717
aque-0.701
yg-0.673
interstitial-0.673
Token For
Token ID1114
Feature activation-3.16e-02

Pos logit contributions
ngth+0.351
 participating+0.305
 gaining+0.289
 finishing+0.288
 exerc+0.286
Neg logit contributions
flat-0.310
alias-0.309
aque-0.300
yg-0.286
interstitial-0.286
Token a
Token ID257
Feature activation-3.68e-01

Pos logit contributions
ngth+0.022
 participating+0.019
 gaining+0.018
 finishing+0.018
 exerc+0.017
Neg logit contributions
flat-0.023
alias-0.023
aque-0.022
interstitial-0.022
yg-0.021
Token few
Token ID1178
Feature activation-4.58e-01

Pos logit contributions
ngth+0.261
 participating+0.223
 gaining+0.210
 finishing+0.209
 exerc+0.209
Neg logit contributions
flat-0.264
alias-0.262
aque-0.256
yg-0.245
interstitial-0.244
Token years
Token ID812
Feature activation-3.50e-01

Pos logit contributions
ngth+0.304
 participating+0.259
 gaining+0.243
 finishing+0.242
 exerc+0.240
Neg logit contributions
flat-0.348
alias-0.346
aque-0.338
yg-0.324
interstitial-0.324
Token considering
Token ID6402
Feature activation-7.81e-01

Pos logit contributions
flat+0.210
alias+0.209
aque+0.204
yg+0.196
interstitial+0.195
Neg logit contributions
ngth-0.177
 participating-0.147
 finishing-0.137
 gaining-0.137
 exerc-0.136
Token the
Token ID262
Feature activation-1.82e-01

Pos logit contributions
ngth+0.488
 participating+0.412
 gaining+0.385
 finishing+0.383
 exerc+0.379
Neg logit contributions
flat-0.624
alias-0.621
aque-0.607
interstitial-0.584
yg-0.583
Token all
Token ID477
Feature activation-2.92e-02

Pos logit contributions
ngth+0.121
 participating+0.102
 gaining+0.096
 finishing+0.095
 exerc+0.095
Neg logit contributions
flat-0.138
alias-0.137
aque-0.135
yg-0.129
interstitial-0.129
Token-
Token ID12
Feature activation+3.47e-01

Pos logit contributions
ngth+0.021
 participating+0.018
 gaining+0.017
 finishing+0.017
 exerc+0.017
Neg logit contributions
flat-0.021
alias-0.021
aque-0.020
yg-0.019
interstitial-0.019
Tokenem
Token ID368
Feature activation+5.72e-01

Pos logit contributions
flat+0.245
alias+0.244
aque+0.238
yg+0.227
interstitial+0.227
Neg logit contributions
ngth-0.250
 participating-0.215
 gaining-0.204
 finishing-0.203
 exerc-0.201
Tokenbr
Token ID1671
Feature activation-1.04e+00

Pos logit contributions
flat+0.380
alias+0.379
aque+0.370
interstitial+0.352
yg+0.351
Neg logit contributions
ngth-0.434
 participating-0.376
 gaining-0.357
 finishing-0.357
 exerc-0.353
Tokenacing
Token ID4092
Feature activation+3.89e-01

Pos logit contributions
ngth+0.892
 participating+0.787
 gaining+0.752
 finishing+0.750
 exerc+0.745
Neg logit contributions
flat-0.591
alias-0.589
aque-0.573
interstitial-0.539
yg-0.538
Token inter
Token ID987
Feature activation-1.01e-01

Pos logit contributions
flat+0.284
alias+0.283
aque+0.276
yg+0.264
interstitial+0.264
Neg logit contributions
ngth-0.270
 participating-0.230
 gaining-0.217
 finishing-0.216
 exerc-0.214
Token p
Token ID279
Feature activation-1.02e+00

Pos logit contributions
ngth+0.083
 participating+0.073
 gaining+0.070
 finishing+0.070
 exerc+0.069
Neg logit contributions
flat-0.060
alias-0.060
aque-0.058
interstitial-0.055
yg-0.055
Tokenares
Token ID3565
Feature activation+1.95e+00

Pos logit contributions
ngth+0.877
 participating+0.776
 gaining+0.742
 finishing+0.739
 exerc+0.734
Neg logit contributions
flat-0.571
alias-0.568
aque-0.550
interstitial-0.520
yg-0.519
Token and
Token ID290
Feature activation-8.76e-02

Pos logit contributions
alias+1.097
flat+1.094
aque+1.063
yg+1.003
amic+0.998
Neg logit contributions
ngth-1.694
 participating-1.476
 gaining-1.408
 finishing-1.407
 fulfilled-1.396
Token Up
Token ID3205
Feature activation-6.48e-01

Pos logit contributions
flat+0.714
alias+0.711
aque+0.690
yg+0.654
interstitial+0.653
Neg logit contributions
ngth-0.957
 participating-0.838
 gaining-0.799
 finishing-0.795
 exerc-0.792
Token"
Token ID1
Feature activation+6.43e-01

Pos logit contributions
ngth+0.509
 participating+0.452
 gaining+0.428
 finishing+0.428
 fulfilled+0.421
Neg logit contributions
flat-0.409
alias-0.403
aque-0.394
interstitial-0.378
yg-0.375
Token under
Token ID739
Feature activation+4.28e-01

Pos logit contributions
flat+0.492
alias+0.490
aque+0.478
yg+0.459
interstitial+0.458
Neg logit contributions
ngth-0.425
 participating-0.360
 gaining-0.338
 finishing-0.336
 exerc-0.334
Token the
Token ID262
Feature activation+8.72e-02

Pos logit contributions
flat+0.323
alias+0.322
aque+0.314
interstitial+0.302
yg+0.301
Neg logit contributions
ngth-0.286
 participating-0.244
 gaining-0.229
 finishing-0.229
 exerc-0.226
Token Spe
Token ID2531
Feature activation-9.26e-01

Pos logit contributions
flat+0.061
alias+0.061
aque+0.059
yg+0.056
interstitial+0.056
Neg logit contributions
ngth-0.063
 participating-0.055
 gaining-0.052
 finishing-0.051
 exerc-0.051
Tokenlling
Token ID2680
Feature activation-7.32e-01

Pos logit contributions
ngth+0.608
 participating+0.514
 gaining+0.485
 exerc+0.483
 finishing+0.478
Neg logit contributions
alias-0.719
flat-0.711
aque-0.701
yg-0.670
interstitial-0.665
Token drop
Token ID4268
Feature activation-4.49e-03

Pos logit contributions
ngth+0.492
 participating+0.429
 gaining+0.403
 finishing+0.403
 exerc+0.395
Neg logit contributions
flat-0.546
alias-0.536
aque-0.526
interstitial-0.512
uckland-0.503
Token-
Token ID12
Feature activation-1.77e-01

Pos logit contributions
ngth+0.003
 participating+0.003
 gaining+0.003
 finishing+0.003
 exerc+0.003
Neg logit contributions
flat-0.003
alias-0.003
aque-0.003
interstitial-0.003
yg-0.003
Tokendown
Token ID2902
Feature activation-1.19e+00

Pos logit contributions
ngth+0.135
 participating+0.121
 gaining+0.114
 finishing+0.114
 exerc+0.112
Neg logit contributions
alias-0.111
flat-0.111
aque-0.109
interstitial-0.103
uckland-0.103
Token menu
Token ID6859
Feature activation+4.73e-01

Pos logit contributions
ngth+0.805
 participating+0.695
 gaining+0.656
 finishing+0.654
 exerc+0.645
Neg logit contributions
flat-0.888
alias-0.884
aque-0.863
interstitial-0.834
yg-0.827
Token.
Token ID13
Feature activation+4.54e-01

Pos logit contributions
flat+0.360
alias+0.358
aque+0.350
yg+0.336
interstitial+0.336
Neg logit contributions
ngth-0.314
 participating-0.267
 gaining-0.251
 finishing-0.250
 exerc-0.247
Token encouraging
Token ID12577
Feature activation-7.67e-01

Pos logit contributions
flat+0.093
alias+0.093
aque+0.091
interstitial+0.087
yg+0.087
Neg logit contributions
ngth-0.085
 participating-0.073
 gaining-0.069
 finishing-0.068
 exerc-0.068
Token.
Token ID13
Feature activation-3.05e-01

Pos logit contributions
ngth+0.539
 participating+0.465
 gaining+0.439
 finishing+0.437
 exerc+0.433
Neg logit contributions
flat-0.552
alias-0.550
aque-0.537
interstitial-0.513
yg-0.513
Token
Token ID198
Feature activation-6.57e-01

Pos logit contributions
ngth+0.229
 participating+0.199
 gaining+0.189
 finishing+0.188
 exerc+0.186
Neg logit contributions
flat-0.205
alias-0.204
aque-0.198
interstitial-0.189
yg-0.189
Token
Token ID198
Feature activation-5.59e-01

Pos logit contributions
ngth+0.560
 participating+0.477
 exerc+0.459
 gaining+0.458
lished+0.457
Neg logit contributions
flat-0.397
alias-0.396
aque-0.377
yg-0.361
-0.358
TokenSy
Token ID13940
Feature activation-7.57e-02

Pos logit contributions
ngth+0.408
 participating+0.354
 gaining+0.335
 finishing+0.333
 exerc+0.330
Neg logit contributions
flat-0.386
alias-0.383
aque-0.375
interstitial-0.358
yg-0.357
Tokenfy
Token ID24928
Feature activation-2.57e+00

Pos logit contributions
ngth+0.028
 participating+0.020
 exerc+0.018
 gaining+0.018
 finishing+0.017
Neg logit contributions
flat-0.080
alias-0.080
aque-0.079
yg-0.076
amic-0.076
Token
Token ID447
Feature activation-8.51e-02

Pos logit contributions
ngth+1.759
 participating+1.524
 gaining+1.436
 finishing+1.428
 fulfilled+1.411
Neg logit contributions
flat-1.890
alias-1.884
aque-1.845
interstitial-1.769
yg-1.753
Token
Token ID247
Feature activation-4.25e-01

Pos logit contributions
ngth+0.098
 participating+0.089
 gaining+0.086
 exerc+0.085
lished+0.085
Neg logit contributions
alias-0.024
flat-0.024
aque-0.022
interstitial-0.020
yg-0.020
Tokens
Token ID82
Feature activation-1.94e-01

Pos logit contributions
ngth+0.295
 participating+0.253
 gaining+0.238
 finishing+0.237
 exerc+0.235
Neg logit contributions
flat-0.310
alias-0.309
aque-0.302
interstitial-0.289
yg-0.289
Token return
Token ID1441
Feature activation-1.08e+00

Pos logit contributions
ngth+0.132
 participating+0.112
 gaining+0.106
 finishing+0.105
 exerc+0.105
Neg logit contributions
flat-0.145
alias-0.144
aque-0.141
yg-0.135
interstitial-0.135
Token to
Token ID284
Feature activation-2.46e-01

Pos logit contributions
ngth+0.688
 participating+0.585
 gaining+0.547
 finishing+0.545
 exerc+0.538
Neg logit contributions
flat-0.846
alias-0.843
aque-0.824
yg-0.791
interstitial-0.790
Token BBC
Token ID7823
Feature activation+1.00e+00

Pos logit contributions
ngth+0.164
 participating+0.140
 gaining+0.131
 finishing+0.131
 exerc+0.130
Neg logit contributions
flat-0.178
alias-0.177
aque-0.173
yg-0.165
interstitial-0.165
Token's
Token ID338
Feature activation-5.23e-01

Pos logit contributions
flat+0.762
alias+0.756
aque+0.740
alia+0.709
yg+0.700
Neg logit contributions
ngth-0.669
 participating-0.554
 exerc-0.524
 gaining-0.522
 finishing-0.518
Token Im
Token ID1846
Feature activation+1.61e-01

Pos logit contributions
ngth+0.381
 participating+0.330
 gaining+0.312
 finishing+0.311
 exerc+0.308
Neg logit contributions
flat-0.363
alias-0.361
aque-0.352
interstitial-0.336
yg-0.336
Tokenogen
Token ID6644
Feature activation+1.31e+00

Pos logit contributions
flat+0.102
alias+0.101
aque+0.098
yg+0.093
interstitial+0.093
Neg logit contributions
ngth-0.127
 participating-0.112
 gaining-0.106
 finishing-0.106
 exerc-0.105
Token F
Token ID376
Feature activation+3.11e-01

Pos logit contributions
flat+0.782
aque+0.743
alias+0.733
amic+0.718
+0.696
Neg logit contributions
ngth-1.112
 participating-0.960
 exerc-0.921
 gaining-0.909
lished-0.906
Tokenoul
Token ID2852
Feature activation-2.65e+00

Pos logit contributions
alias+0.190
flat+0.189
aque+0.187
yg+0.180
alia+0.174
Neg logit contributions
ngth-0.251
 participating-0.222
 gaining-0.211
 exerc-0.210
 finishing-0.210
Tokenkes
Token ID5209
Feature activation+4.22e-01

Pos logit contributions
ngth+2.116
 participating+1.828
 gaining+1.743
 exerc+1.737
 finishing+1.731
Neg logit contributions
flat-1.684
alias-1.673
aque-1.632
yg-1.548
interstitial-1.542
Token,
Token ID11
Feature activation-3.36e-01

Pos logit contributions
flat+0.329
alias+0.327
aque+0.319
interstitial+0.307
yg+0.307
Neg logit contributions
ngth-0.272
 participating-0.231
 gaining-0.217
 finishing-0.216
 exerc-0.213
Token in
Token ID287
Feature activation-3.19e-01

Pos logit contributions
ngth+0.213
 participating+0.181
 gaining+0.169
 finishing+0.168
 exerc+0.166
Neg logit contributions
flat-0.265
alias-0.263
aque-0.258
interstitial-0.248
yg-0.248
Token Geneva
Token ID20552
Feature activation-3.34e-01

Pos logit contributions
ngth+0.196
 participating+0.162
 gaining+0.152
 finishing+0.151
 exerc+0.151
Neg logit contributions
flat-0.260
alias-0.258
aque-0.253
interstitial-0.242
yg-0.242
Token,
Token ID11
Feature activation+4.00e-01

Pos logit contributions
ngth+0.230
 participating+0.202
 gaining+0.190
 finishing+0.189
 fulfilled+0.185
Neg logit contributions
flat-0.243
alias-0.241
aque-0.235
interstitial-0.227
yg-0.227
Tokenis
Token ID271
Feature activation+4.23e-02

Pos logit contributions
flat+0.262
alias+0.260
aque+0.251
interstitial+0.235
yg+0.235
Neg logit contributions
ngth-0.474
 participating-0.423
 gaining-0.405
 finishing-0.404
 exerc-0.402
Token-
Token ID12
Feature activation-5.05e-01

Pos logit contributions
flat+0.032
alias+0.032
aque+0.031
yg+0.029
interstitial+0.029
Neg logit contributions
ngth-0.029
 participating-0.024
 gaining-0.023
 finishing-0.023
 exerc-0.023
Token A
Token ID317
Feature activation-5.30e-01

Pos logit contributions
ngth+0.405
 participating+0.355
 gaining+0.338
 finishing+0.337
 exerc+0.334
Neg logit contributions
flat-0.313
alias-0.312
aque-0.304
interstitial-0.289
yg-0.288
Tokenyes
Token ID8505
Feature activation+5.07e-01

Pos logit contributions
ngth+0.387
 participating+0.334
 gaining+0.316
 finishing+0.315
 exerc+0.313
Neg logit contributions
flat-0.368
alias-0.366
aque-0.357
yg-0.341
interstitial-0.340
Tokenee
Token ID1453
Feature activation-3.30e-01

Pos logit contributions
flat+0.397
alias+0.395
aque+0.387
yg+0.372
interstitial+0.371
Neg logit contributions
ngth-0.327
 participating-0.273
 gaining-0.257
 finishing-0.255
 exerc-0.255
Token |
Token ID930
Feature activation-2.59e+00

Pos logit contributions
ngth+0.235
 participating+0.201
 gaining+0.190
 finishing+0.189
 exerc+0.188
Neg logit contributions
flat-0.236
alias-0.235
aque-0.229
yg-0.219
interstitial-0.219
Token When
Token ID1649
Feature activation-5.87e-01

Pos logit contributions
ngth+1.866
 participating+1.655
 gaining+1.559
 finishing+1.557
 exerc+1.530
Neg logit contributions
flat-1.788
alias-1.775
aque-1.742
interstitial-1.670
yg-1.652
Token did
Token ID750
Feature activation-1.41e-01

Pos logit contributions
ngth+0.394
 participating+0.338
 gaining+0.317
 finishing+0.316
 exerc+0.312
Neg logit contributions
flat-0.441
alias-0.440
aque-0.430
interstitial-0.412
yg-0.411
Token 7
Token ID767
Feature activation-9.75e-01

Pos logit contributions
ngth+0.097
 participating+0.084
 gaining+0.079
 finishing+0.079
 exerc+0.078
Neg logit contributions
flat-0.103
alias-0.103
aque-0.100
interstitial-0.096
yg-0.096
Tokening
Token ID278
Feature activation-7.59e-01

Pos logit contributions
ngth+0.702
 participating+0.612
 gaining+0.578
 finishing+0.576
 fulfilled+0.567
Neg logit contributions
flat-0.680
alias-0.677
aque-0.662
interstitial-0.635
yg-0.630
Token mad
Token ID8805
Feature activation-6.72e-01

Pos logit contributions
ngth+0.500
 participating+0.428
 gaining+0.402
 finishing+0.402
 exerc+0.395
Neg logit contributions
flat-0.578
alias-0.576
aque-0.563
interstitial-0.542
yg-0.539
Token Fri
Token ID19480
Feature activation-3.34e-01

Pos logit contributions
ngth+0.884
 participating+0.789
 gaining+0.752
 finishing+0.750
 exerc+0.743
Neg logit contributions
flat-0.581
alias-0.577
aque-0.561
interstitial-0.533
yg-0.528
Token,
Token ID11
Feature activation-7.84e-01

Pos logit contributions
ngth+0.257
 participating+0.218
 gaining+0.207
 finishing+0.207
 exerc+0.207
Neg logit contributions
flat-0.225
alias-0.223
aque-0.217
yg-0.207
-0.206
Token 28
Token ID2579
Feature activation-7.57e-01

Pos logit contributions
ngth+0.614
 participating+0.519
 gaining+0.494
 exerc+0.492
lished+0.490
Neg logit contributions
flat-0.521
alias-0.518
aque-0.495
yg-0.478
-0.476
Token July
Token ID2901
Feature activation-3.03e-01

Pos logit contributions
ngth+0.589
 participating+0.485
lished+0.464
 gaining+0.463
 exerc+0.461
Neg logit contributions
flat-0.518
alias-0.511
aque-0.487
yg-0.475
-0.474
Token 2017
Token ID2177
Feature activation-8.73e-01

Pos logit contributions
ngth+0.232
 participating+0.189
 finishing+0.180
 gaining+0.179
lished+0.179
Neg logit contributions
flat-0.210
alias-0.209
aque-0.199
yg-0.193
-0.191
Token |
Token ID930
Feature activation-2.91e+00

Pos logit contributions
ngth+0.656
 participating+0.544
 finishing+0.517
 gaining+0.516
 exerc+0.513
Neg logit contributions
flat-0.608
alias-0.605
aque-0.590
yg-0.567
-0.562
Token 00
Token ID3571
Feature activation-4.11e-01

Pos logit contributions
ngth+2.182
 participating+1.857
 gaining+1.763
 finishing+1.757
 exerc+1.747
Neg logit contributions
flat-1.999
alias-1.993
aque-1.920
yg-1.847
-1.845
Token:
Token ID25
Feature activation-2.37e-01

Pos logit contributions
ngth+0.269
 participating+0.212
lished+0.202
 gaining+0.200
 finishing+0.200
Neg logit contributions
flat-0.329
alias-0.322
aque-0.311
yg-0.311
FORE-0.304
Token32
Token ID2624
Feature activation-3.60e-01

Pos logit contributions
ngth+0.205
 participating+0.180
 gaining+0.172
 finishing+0.171
 exerc+0.171
Neg logit contributions
flat-0.134
alias-0.133
aque-0.129
yg-0.122
interstitial-0.121
Token
Token ID198
Feature activation-1.24e-01

Pos logit contributions
ngth+0.306
 participating+0.260
lished+0.252
 gaining+0.251
 exerc+0.250
Neg logit contributions
flat-0.216
alias-0.215
aque-0.205
yg-0.196
FORE-0.195
Token
Token ID198
Feature activation-2.50e-01

Pos logit contributions
ngth+0.109
 participating+0.092
lished+0.090
 exerc+0.089
 gaining+0.088
Neg logit contributions
flat-0.073
alias-0.073
aque-0.069
yg-0.066
FORE-0.066
Token story
Token ID1621
Feature activation+4.94e-01

Pos logit contributions
ngth+0.126
 participating+0.109
 gaining+0.103
 finishing+0.103
 exerc+0.102
Neg logit contributions
flat-0.115
alias-0.115
aque-0.112
yg-0.106
interstitial-0.106
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
flat+0.383
alias+0.382
aque+0.372
yg+0.357
interstitial+0.356
Neg logit contributions
ngth-0.320
 participating-0.269
 gaining-0.252
 exerc-0.252
 finishing-0.251
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
ngth+0.117
 participating+0.102
 gaining+0.097
 finishing+0.096
 exerc+0.096
Neg logit contributions
flat-0.102
alias-0.101
aque-0.098
yg-0.094
interstitial-0.094
Token
Token ID198
Feature activation+1.78e-01

Pos logit contributions
flat+0.132
alias+0.131
aque+0.124
yg+0.119
ine+0.117
Neg logit contributions
ngth-0.196
 participating-0.165
lished-0.162
 exerc-0.161
 gaining-0.159
TokenRep
Token ID6207
Feature activation+6.70e-01

Pos logit contributions
flat+0.119
alias+0.118
aque+0.115
interstitial+0.110
yg+0.110
Neg logit contributions
ngth-0.133
 participating-0.116
 gaining-0.110
 finishing-0.110
 exerc-0.108
Tokenrint
Token ID22272
Feature activation-2.94e+00

Pos logit contributions
flat+0.438
alias+0.437
aque+0.427
yg+0.405
interstitial+0.403
Neg logit contributions
ngth-0.514
 participating-0.450
 gaining-0.427
 finishing-0.425
 exerc-0.424
Tokened
Token ID276
Feature activation-5.69e-01

Pos logit contributions
ngth+2.257
 participating+1.990
 finishing+1.890
 gaining+1.889
 fulfilled+1.859
Neg logit contributions
flat-1.908
alias-1.897
aque-1.857
interstitial-1.770
yg-1.766
Token in
Token ID287
Feature activation-9.55e-02

Pos logit contributions
ngth+0.365
 participating+0.309
 gaining+0.289
 finishing+0.288
 exerc+0.285
Neg logit contributions
flat-0.445
alias-0.443
aque-0.433
interstitial-0.416
yg-0.416
Token
Token ID198
Feature activation-6.19e-02

Pos logit contributions
ngth+0.065
 participating+0.056
 gaining+0.053
 finishing+0.052
 exerc+0.052
Neg logit contributions
flat-0.071
alias-0.070
aque-0.069
interstitial-0.066
yg-0.066
Token
Token ID198
Feature activation-2.31e-01

Pos logit contributions
ngth+0.054
 participating+0.046
 exerc+0.044
lished+0.044
 gaining+0.044
Neg logit contributions
flat-0.037
alias-0.037
aque-0.035
yg-0.033
-0.033
TokenIn
Token ID818
Feature activation-1.27e-02

Pos logit contributions
ngth+0.180
 participating+0.157
 gaining+0.149
 finishing+0.149
 exerc+0.147
Neg logit contributions
flat-0.147
alias-0.147
aque-0.143
interstitial-0.136
yg-0.136