TOP ACTIVATIONS
MAX = 22.670

the NDP in Alberta is gonna succeed by pitching a radical
I think there was an expectation gulf on what many thought
than Fear of the Heaven s and
ley . The hopes of people who regarded her
had ex cessive expectations of Congress .
Eastern Europe , and will NEVER apologise for being white ,
career ? Won t someone start giving out to
that due to the 5 years lifetime , what is
him , How would you vote ?
this match wasn 't going to last more than three days
of " This guy 's gonna let you create a custom
vehicle was destroyed in the April 1986 launch vehicle accident ,
that it wasn 't going to look good in the head
IP D warned that 2008 would be the UK 's worst
Pakistan cruise to 100 , lose 2 quick w ickets
why giant corps ( Google , Facebook , Netflix ) supported
larger image ). In the June 1 image , a large
going to have a hard time en r aging people about
t think she would be any less , or
more of the $ 1 billion in Kh ark iv loans

MIN ACTIVATIONS
MIN = -20.775

heavily using Angular JS . It 's a marvelous framework for
or just following along on social , the event promises to
so their teams are manufacturing all kinds of ways for them
abilia throughout the queue and other merchandise for sale in nearby
technical college assistance , among other changes . But
Angular JS . It 's a marvelous framework for modern web
of . In this post I 'll round up some got
with every new toy , there are some rough edges that
li > ul > body > You
it with one of your own . It
support , force protection and other related elements of logistics and
celebrities and posted them to social media sites including Reddit and
technology that become encoded in digital files are one such type
of normal fairness procedures in other C BA contexts . Industrial
near Alfred Lerner Hall . Other filming locations in Massachusetts were
plush ies , statues and other merchandise , as well as
on domestic spying legislation and other issues ." To put this
anti - Muslim content on social media as well , such
rog ate 's Court . Other sequences like the subway scene
knife . 3 . In a large bowl , combine the

INTERVAL 11.809 - 22.670
CONTAINS 1.102%

what s going to happen as we continue to
ALWAYS thought he had a chance to make it in the
e . g ., the 2015 referendum in Switzerland ), the
9 37 save percentage . For another thing ,
signaling that he has no intention of stopping until he is

INTERVAL 0.948 - 11.809
CONTAINS 42.071%

. If these were Ray Kelly s
M bo ] <|endoftext|> No one at Los Angeles International Airport
one format , if you were to believe ze ves and
: set dir = c \ :\ tmp : */ This
to smoking crack in a dr unk en st

INTERVAL -9.914 - 0.948
CONTAINS 54.437%

off the bat , we played the " What if the
and international co - operation . Just five years
on New York s waterfront , but the building
less tax , while a third , less - developed proposal
and best way to accomplish this ? Lever age your natural

INTERVAL -20.775 - -9.914
CONTAINS 2.390%

live harmon iously with your fellow men . I leave you
reg ains some continuity in his training .
what 's new on your favorite chess site :
take human rights ," he continued . La Rue
the nine team M WC . I will reiterate
Token the
Token ID262
Feature activation+1.40e+01

Pos logit contributions
 derail+8.487
 Dalai+7.810
 overthrow+7.633
 Clintons+7.553
 overturn+7.526
Neg logit contributions
worldly-13.882
 behavi-11.788
settings-10.879
properties-10.459
resources-10.222
Token NDP
Token ID20392
Feature activation+1.09e+01

Pos logit contributions
 derail+11.496
 Dalai+10.757
 overthrow+10.734
 Clintons+10.637
 Hague+10.213
Neg logit contributions
worldly-13.640
 behavi-11.157
settings-10.156
properties-9.844
resources-9.177
Token in
Token ID287
Feature activation+1.90e+00

Pos logit contributions
 derail+9.473
 Dalai+8.751
 overthrow+8.657
 Clintons+8.599
 overturn+8.494
Neg logit contributions
worldly-10.704
 behavi-8.362
settings-7.743
properties-7.449
resources-7.111
Token Alberta
Token ID15555
Feature activation+6.50e+00

Pos logit contributions
 derail+1.473
 overthrow+1.360
 Hague+1.319
 Dalai+1.318
 overturn+1.300
Neg logit contributions
worldly-2.285
 behavi-1.891
settings-1.726
properties-1.640
resources-1.595
Token is
Token ID318
Feature activation+8.09e+00

Pos logit contributions
 derail+5.891
 overthrow+5.456
 Dalai+5.441
 overturn+5.362
 Hague+5.334
Neg logit contributions
worldly-6.629
 behavi-5.186
settings-4.723
properties-4.476
resources-4.314
Token gonna
Token ID8066
Feature activation+2.17e+01

Pos logit contributions
 derail+6.553
 overthrow+5.990
 Dalai+5.943
 overturn+5.865
 Clintons+5.685
Neg logit contributions
worldly-8.907
 behavi-7.288
settings-6.778
properties-6.460
resources-6.267
Token succeed
Token ID6758
Feature activation+1.06e+01

Pos logit contributions
 derail+16.163
 overthrow+15.209
 overturn+14.380
 overtake+14.376
 assassinate+13.609
Neg logit contributions
worldly-21.074
 behavi-18.538
settings-16.156
properties-15.381
resources-15.278
Token by
Token ID416
Feature activation+6.79e+00

Pos logit contributions
 derail+7.961
 Dalai+7.307
 overthrow+7.246
 overturn+7.151
 Hague+6.961
Neg logit contributions
worldly-11.589
 behavi-9.355
settings-8.902
properties-8.474
resources-8.141
Token pitching
Token ID21088
Feature activation+1.61e+00

Pos logit contributions
 derail+5.566
 overthrow+5.143
 overturn+5.031
 Dalai+4.942
 Hague+4.868
Neg logit contributions
worldly-7.688
 behavi-6.274
settings-5.828
properties-5.490
resources-5.356
Token a
Token ID257
Feature activation-3.67e+00

Pos logit contributions
 derail+1.411
 overthrow+1.308
 Dalai+1.281
 Hague+1.262
 overturn+1.254
Neg logit contributions
worldly-1.782
 behavi-1.451
settings-1.334
properties-1.244
resources-1.218
Token radical
Token ID7702
Feature activation-2.12e-01

Pos logit contributions
worldly+3.926
 behavi+3.134
settings+2.871
properties+2.671
resources+2.603
Neg logit contributions
 derail-3.357
 overthrow-3.115
 Dalai-3.075
 Hague-3.042
 Clintons-2.986
TokenI
Token ID40
Feature activation-5.09e+00

Pos logit contributions
worldly+4.050
 behavi+3.256
settings+2.956
properties+2.752
versions+2.692
Neg logit contributions
 derail-3.371
 overthrow-3.120
 Dalai-3.086
 Hague-3.062
 Clintons-2.998
Token think
Token ID892
Feature activation+6.16e+00

Pos logit contributions
worldly+5.872
 behavi+4.768
settings+4.413
properties+4.126
versions+4.067
Neg logit contributions
 derail-4.115
 overthrow-3.791
 Dalai-3.731
 Hague-3.711
 Clintons-3.627
Token there
Token ID612
Feature activation-5.05e+00

Pos logit contributions
 derail+4.591
 overthrow+4.126
 overturn+4.063
 Hague+4.054
 Dalai+4.053
Neg logit contributions
worldly-7.407
 behavi-5.923
settings-5.609
properties-5.381
versions-5.075
Token was
Token ID373
Feature activation+5.18e+00

Pos logit contributions
worldly+5.624
 behavi+4.533
settings+4.166
properties+3.884
versions+3.816
Neg logit contributions
 derail-4.360
 overthrow-4.032
 Dalai-3.971
 Hague-3.940
 overtake-3.858
Token an
Token ID281
Feature activation+2.50e+00

Pos logit contributions
 derail+4.392
 overthrow+3.976
 Dalai+3.925
 overturn+3.891
 Hague+3.875
Neg logit contributions
worldly-5.762
 behavi-4.462
settings-4.253
properties-4.105
resources-3.850
Token expectation
Token ID17507
Feature activation+2.16e+01

Pos logit contributions
 derail+2.190
 overthrow+2.041
 Dalai+1.996
 overturn+1.961
 overtake+1.945
Neg logit contributions
worldly-2.767
 behavi-2.205
settings-2.078
properties-1.929
versions-1.896
Token gulf
Token ID41593
Feature activation+6.85e+00

Pos logit contributions
 derail+14.313
 overturn+12.960
 Dalai+12.821
 overthrow+12.737
 Hague+12.645
Neg logit contributions
worldly-20.186
 behavi-15.875
settings-15.413
properties-15.335
resources-14.034
Token on
Token ID319
Feature activation+4.26e+00

Pos logit contributions
 derail+4.843
 overthrow+4.315
 Dalai+4.299
 Hague+4.227
 overturn+4.197
Neg logit contributions
worldly-8.246
 behavi-6.716
settings-6.336
properties-6.069
resources-5.891
Token what
Token ID644
Feature activation+1.64e+00

Pos logit contributions
 derail+3.542
 overthrow+3.265
 Dalai+3.204
 overturn+3.185
 Hague+3.169
Neg logit contributions
worldly-4.815
 behavi-3.807
settings-3.580
properties-3.448
resources-3.252
Token many
Token ID867
Feature activation-1.34e+00

Pos logit contributions
 derail+1.378
 overthrow+1.259
 Dalai+1.253
 Hague+1.237
 Clintons+1.228
Neg logit contributions
worldly-1.865
 behavi-1.503
settings-1.397
properties-1.332
resources-1.278
Token thought
Token ID1807
Feature activation+6.96e+00

Pos logit contributions
worldly+1.667
 behavi+1.372
settings+1.278
properties+1.213
versions+1.181
Neg logit contributions
 derail-1.002
 overthrow-0.912
 Dalai-0.895
 Hague-0.886
 Clintons-0.871
Token than
Token ID621
Feature activation+1.77e+00

Pos logit contributions
 derail+0.562
 overthrow+0.516
 Dalai+0.508
 Hague+0.502
 overtake+0.498
Neg logit contributions
worldly-0.750
 behavi-0.625
settings-0.567
properties-0.535
versions-0.520
Token
Token ID564
Feature activation+5.77e+00

Pos logit contributions
 derail+1.427
 overthrow+1.301
 Dalai+1.292
 Hague+1.287
 overturn+1.252
Neg logit contributions
worldly-2.041
 behavi-1.708
settings-1.553
properties-1.470
resources-1.426
Token
Token ID246
Feature activation-5.46e-01

Pos logit contributions
 derail+4.319
 Dalai+4.001
 overthrow+3.947
 Hague+3.935
 Clintons+3.832
Neg logit contributions
worldly-6.713
 behavi-5.680
settings-5.213
properties-4.877
versions-4.845
TokenFear
Token ID37798
Feature activation+1.27e+01

Pos logit contributions
worldly+0.642
 behavi+0.534
settings+0.485
properties+0.455
versions+0.447
Neg logit contributions
 derail-0.435
 overthrow-0.397
 Dalai-0.394
 Hague-0.391
 overturn-0.380
Token of
Token ID286
Feature activation+1.76e+01

Pos logit contributions
 derail+10.120
 Dalai+9.354
 overthrow+9.128
 overtake+9.122
 Hague+8.965
Neg logit contributions
worldly-12.772
 behavi-10.549
settings-9.435
properties-8.916
versions-8.808
Token the
Token ID262
Feature activation+2.12e+01

Pos logit contributions
 derail+12.045
 Dalai+11.144
 Hague+10.897
 overthrow+10.629
 Clintons+10.550
Neg logit contributions
worldly-18.368
 behavi-15.595
settings-13.752
properties-13.104
selves-12.778
Token Heaven
Token ID11225
Feature activation+1.06e+01

Pos logit contributions
 derail+13.853
 Dalai+13.205
 Hague+12.776
 Clintons+12.632
 Massacre+12.500
Neg logit contributions
worldly-20.808
 behavi-17.957
settings-15.470
properties-15.339
selves-14.503
Tokens
Token ID82
Feature activation+1.22e+01

Pos logit contributions
 derail+8.989
 Dalai+8.713
 overthrow+8.520
 Hague+8.450
 Clintons+8.206
Neg logit contributions
worldly-10.743
 behavi-8.934
settings-7.749
properties-7.313
versions-7.192
Token
Token ID447
Feature activation+1.16e+00

Pos logit contributions
 derail+9.280
 Dalai+8.632
 overthrow+8.499
 Hague+8.360
 overtake+8.326
Neg logit contributions
worldly-12.515
 behavi-10.302
settings-9.194
properties-9.038
versions-8.585
Token
Token ID247
Feature activation-4.55e+00

Pos logit contributions
 derail+1.008
 overthrow+0.937
 Dalai+0.932
 Hague+0.921
 Clintons+0.903
Neg logit contributions
worldly-1.281
 behavi-1.035
settings-0.949
properties-0.885
versions-0.869
Token and
Token ID290
Feature activation+2.55e+00

Pos logit contributions
worldly+4.845
 behavi+3.861
settings+3.521
properties+3.274
versions+3.205
Neg logit contributions
 derail-4.167
 overthrow-3.866
 Dalai-3.818
 Hague-3.787
 overtake-3.711
Tokenley
Token ID1636
Feature activation+7.56e+00

Pos logit contributions
 derail+4.917
 overthrow+4.546
 Dalai+4.542
 Hague+4.393
 overturn+4.375
Neg logit contributions
worldly-8.929
 behavi-8.186
settings-6.958
properties-6.687
resources-6.580
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
 derail+5.819
 Dalai+5.402
 overthrow+5.243
 overtake+5.101
 overturn+5.085
Neg logit contributions
worldly-8.575
 behavi-7.049
settings-6.552
properties-6.282
resources-6.171
Token
Token ID447
Feature activation-4.46e+00

Pos logit contributions
worldly+0.131
 behavi+0.108
settings+0.098
properties+0.092
versions+0.091
Neg logit contributions
 derail-0.091
 Dalai-0.085
 overthrow-0.084
 Hague-0.083
 Clintons-0.082
Token
Token ID251
Feature activation-9.67e+00

Pos logit contributions
worldly+4.881
 behavi+3.909
settings+3.596
properties+3.344
versions+3.283
Neg logit contributions
 derail-3.923
 overthrow-3.626
 Dalai-3.571
 Hague-3.543
 overtake-3.479
Token The
Token ID383
Feature activation+8.77e+00

Pos logit contributions
worldly+11.245
 behavi+9.101
settings+8.613
properties+8.061
resources+7.973
Neg logit contributions
 derail-7.287
 overthrow-6.725
 overtake-6.412
 Dalai-6.353
 Hague-6.339
Token hopes
Token ID7176
Feature activation+2.12e+01

Pos logit contributions
 derail+7.566
 Dalai+7.041
 overthrow+7.026
 Clintons+6.855
 Hague+6.844
Neg logit contributions
worldly-9.083
 behavi-7.392
settings-6.756
properties-6.382
resources-6.139
Token of
Token ID286
Feature activation+1.07e+01

Pos logit contributions
 derail+15.293
 Dalai+13.831
 Hague+13.539
 Clintons+13.500
 overthrow+13.488
Neg logit contributions
worldly-19.337
 behavi-16.419
settings-15.171
properties-14.812
versions-13.742
Token people
Token ID661
Feature activation+4.78e+00

Pos logit contributions
 derail+8.817
 overturn+8.067
 overthrow+8.031
 Hague+7.774
 Dalai+7.762
Neg logit contributions
worldly-11.327
 behavi-9.109
settings-8.536
properties-8.029
versions-7.739
Token who
Token ID508
Feature activation+1.59e+00

Pos logit contributions
 derail+3.704
 overthrow+3.454
 overturn+3.343
 Dalai+3.330
 Hague+3.285
Neg logit contributions
worldly-5.549
 behavi-4.643
settings-4.312
properties-4.072
versions-3.973
Token regarded
Token ID11987
Feature activation+7.17e+00

Pos logit contributions
 derail+1.354
 overthrow+1.254
 Dalai+1.209
 overturn+1.201
 overtake+1.198
Neg logit contributions
worldly-1.796
 behavi-1.499
settings-1.357
properties-1.278
resources-1.249
Token her
Token ID607
Feature activation-3.15e+00

Pos logit contributions
 derail+5.656
 overthrow+5.190
 Hague+5.095
 Dalai+5.084
 overturn+5.014
Neg logit contributions
worldly-8.114
 behavi-6.708
settings-6.228
properties-5.861
versions-5.727
Token had
Token ID550
Feature activation+8.54e+00

Pos logit contributions
 derail+7.297
 overthrow+6.858
 overturn+6.802
 Hague+6.732
 Dalai+6.611
Neg logit contributions
worldly-9.330
 behavi-7.733
settings-7.030
properties-6.595
versions-6.520
Token
Token ID564
Feature activation+9.41e+00

Pos logit contributions
 derail+7.000
 overthrow+6.461
 overturn+6.332
 Dalai+6.155
 Hague+6.133
Neg logit contributions
worldly-9.296
 behavi-7.747
settings-7.144
properties-6.765
resources-6.642
Token
Token ID250
Feature activation+3.66e+00

Pos logit contributions
 derail+5.901
 overthrow+5.462
 Dalai+5.343
 Clintons+5.286
 overturn+5.221
Neg logit contributions
worldly-11.585
 behavi-9.985
settings-9.141
properties-8.749
versions-8.558
Tokenex
Token ID1069
Feature activation+6.98e-01

Pos logit contributions
 derail+3.278
 Dalai+3.045
 overthrow+3.033
 Hague+2.983
 overturn+2.971
Neg logit contributions
worldly-3.815
 behavi-3.129
settings-2.777
properties-2.621
versions-2.558
Tokencessive
Token ID45428
Feature activation+1.14e+01

Pos logit contributions
 derail+0.653
 overthrow+0.603
 Dalai+0.599
 Hague+0.593
 overturn+0.583
Neg logit contributions
worldly-0.718
 behavi-0.590
settings-0.522
properties-0.484
versions-0.469
Token expectations
Token ID9027
Feature activation+2.08e+01

Pos logit contributions
 derail+9.598
 overthrow+9.083
 overturn+8.858
 overtake+8.541
 Clintons+8.509
Neg logit contributions
worldly-11.653
 behavi-9.132
settings-8.407
properties-8.284
versions-7.792
Token
Token ID447
Feature activation+2.68e+00

Pos logit contributions
 derail+13.167
 overturn+12.278
 Clintons+11.915
 Dalai+11.896
 overthrow+11.794
Neg logit contributions
worldly-20.443
 behavi-16.338
settings-15.689
properties-15.255
versions-14.514
Token
Token ID251
Feature activation-4.43e+00

Pos logit contributions
 derail+2.213
 overthrow+2.053
 Dalai+2.016
 Hague+2.008
 overturn+1.981
Neg logit contributions
worldly-3.052
 behavi-2.524
settings-2.282
properties-2.137
resources-2.088
Token of
Token ID286
Feature activation+1.02e+01

Pos logit contributions
worldly+5.197
 behavi+4.238
settings+3.920
properties+3.672
versions+3.611
Neg logit contributions
 derail-3.550
 overthrow-3.262
 Dalai-3.210
 Hague-3.182
 overtake-3.111
Token Congress
Token ID3162
Feature activation+9.00e+00

Pos logit contributions
 derail+8.277
 overturn+7.731
 overthrow+7.634
 Clintons+7.426
 Hague+7.376
Neg logit contributions
worldly-11.108
 behavi-9.035
settings-8.423
properties-7.979
versions-7.591
Token.
Token ID13
Feature activation+4.81e-01

Pos logit contributions
 derail+7.069
 overturn+6.625
 overthrow+6.472
 Dalai+6.194
 Hague+6.183
Neg logit contributions
worldly-9.855
 behavi-8.023
settings-7.362
properties-7.043
versions-6.653
Token Eastern
Token ID8345
Feature activation-1.36e+00

Pos logit contributions
 derail+8.102
 overthrow+7.836
 Dalai+7.462
 overturn+7.419
 Hague+7.299
Neg logit contributions
worldly-10.896
 behavi-8.932
settings-8.249
properties-7.815
resources-7.395
Token Europe
Token ID2031
Feature activation+3.94e+00

Pos logit contributions
worldly+1.488
 behavi+1.193
settings+1.092
properties+1.025
resources+0.993
Neg logit contributions
 derail-1.211
 overthrow-1.126
 Dalai-1.110
 Hague-1.094
 overturn-1.077
Token,
Token ID11
Feature activation+8.69e+00

Pos logit contributions
 derail+3.195
 overthrow+2.972
 Dalai+2.905
 Hague+2.879
 overturn+2.856
Neg logit contributions
worldly-4.537
 behavi-3.754
settings-3.416
properties-3.214
versions-3.134
Token and
Token ID290
Feature activation+4.53e+00

Pos logit contributions
 derail+6.811
 overthrow+6.453
 overturn+6.134
 Dalai+6.115
 overtake+6.008
Neg logit contributions
worldly-9.997
 behavi-8.063
settings-7.527
properties-7.049
resources-6.861
Token will
Token ID481
Feature activation+1.20e+01

Pos logit contributions
 derail+3.611
 overthrow+3.376
 Dalai+3.231
 overturn+3.198
 overtake+3.175
Neg logit contributions
worldly-5.339
 behavi-4.399
settings-4.081
properties-3.844
resources-3.749
Token NEVER
Token ID33602
Feature activation+2.05e+01

Pos logit contributions
 derail+9.204
 overthrow+8.698
 overturn+8.124
 overtake+8.096
 assassinate+8.027
Neg logit contributions
worldly-13.652
 behavi-11.639
settings-10.324
properties-9.788
resources-9.602
Token apologise
Token ID39012
Feature activation+1.28e+01

Pos logit contributions
 derail+14.471
 overthrow+14.135
 overtake+13.300
 overturn+13.270
 assassinate+12.960
Neg logit contributions
worldly-20.161
 behavi-17.724
settings-15.461
properties-14.849
resources-14.415
Token for
Token ID329
Feature activation+7.32e+00

Pos logit contributions
 derail+7.917
 Dalai+7.276
 overthrow+7.187
 overturn+7.162
 Hague+7.145
Neg logit contributions
worldly-15.147
 behavi-12.544
settings-11.819
properties-11.321
versions-10.805
Token being
Token ID852
Feature activation+3.51e-01

Pos logit contributions
 derail+5.350
 overthrow+4.956
 Dalai+4.877
 Hague+4.809
 overturn+4.803
Neg logit contributions
worldly-8.792
 behavi-7.166
settings-6.772
properties-6.423
resources-6.259
Token white
Token ID2330
Feature activation-4.71e+00

Pos logit contributions
 derail+0.293
 overthrow+0.271
 Dalai+0.268
 Hague+0.263
 Clintons+0.259
Neg logit contributions
worldly-0.403
 behavi-0.327
settings-0.303
properties-0.283
resources-0.277
Token,
Token ID11
Feature activation+7.11e+00

Pos logit contributions
worldly+5.287
 behavi+4.262
settings+3.927
properties+3.664
versions+3.597
Neg logit contributions
 derail-3.984
 overthrow-3.673
 Dalai-3.618
 Hague-3.588
 Clintons-3.519
Token career
Token ID3451
Feature activation+4.94e+00

Pos logit contributions
worldly+2.192
 behavi+1.703
settings+1.548
properties+1.427
versions+1.390
Neg logit contributions
 derail-2.232
 overthrow-2.085
 Dalai-2.059
 Hague-2.041
 Clintons-2.006
Token?
Token ID30
Feature activation+2.59e-01

Pos logit contributions
 derail+3.598
 overthrow+3.214
 Dalai+3.110
 Hague+3.077
 overturn+3.071
Neg logit contributions
worldly-6.097
 behavi-4.927
settings-4.604
properties-4.430
versions-4.340
Token Won
Token ID23306
Feature activation+6.16e+00

Pos logit contributions
 derail+0.200
 Dalai+0.185
 overthrow+0.182
 Hague+0.181
 Clintons+0.176
Neg logit contributions
worldly-0.312
 behavi-0.257
settings-0.236
properties-0.222
versions-0.219
Token
Token ID447
Feature activation+9.35e+00

Pos logit contributions
 derail+3.052
 Dalai+2.662
 overthrow+2.632
 Hague+2.550
 overturn+2.509
Neg logit contributions
worldly-8.866
 behavi-7.502
settings-7.070
properties-6.799
versions-6.652
Token
Token ID247
Feature activation+1.99e+00

Pos logit contributions
 derail+6.306
 Dalai+6.028
 Hague+5.841
 overthrow+5.764
 overturn+5.606
Neg logit contributions
worldly-11.063
 behavi-9.837
settings-8.773
properties-8.105
versions-8.065
Tokent
Token ID83
Feature activation+2.02e+01

Pos logit contributions
 derail+1.666
 overthrow+1.544
 Dalai+1.534
 Hague+1.513
 overturn+1.477
Neg logit contributions
worldly-2.249
 behavi-1.839
settings-1.662
properties-1.552
versions-1.529
Token someone
Token ID2130
Feature activation+7.17e+00

Pos logit contributions
 derail+14.000
 Dalai+12.524
 Hague+12.504
 overturn+12.343
 overthrow+12.219
Neg logit contributions
worldly-19.488
 behavi-15.830
settings-15.014
properties-14.644
versions-14.135
Token start
Token ID923
Feature activation+6.04e+00

Pos logit contributions
 derail+5.525
 overthrow+4.989
 Dalai+4.840
 overturn+4.829
 overtake+4.784
Neg logit contributions
worldly-8.427
 behavi-7.102
settings-6.579
properties-6.276
resources-6.087
Token giving
Token ID3501
Feature activation+1.37e+00

Pos logit contributions
 derail+4.793
 overthrow+4.288
 Dalai+4.279
 overturn+4.277
 Hague+4.139
Neg logit contributions
worldly-6.991
 behavi-5.601
settings-5.281
properties-5.001
versions-4.901
Token out
Token ID503
Feature activation-3.42e-01

Pos logit contributions
 derail+0.846
 Dalai+0.752
 overthrow+0.748
 Hague+0.729
 overturn+0.707
Neg logit contributions
worldly-1.865
 behavi-1.572
settings-1.471
properties-1.410
versions-1.374
Token to
Token ID284
Feature activation+4.36e+00

Pos logit contributions
worldly+0.372
 behavi+0.297
settings+0.273
properties+0.257
versions+0.247
Neg logit contributions
 derail-0.307
 Dalai-0.285
 overthrow-0.283
 Hague-0.277
 overturn-0.273
Token that
Token ID326
Feature activation+3.50e+00

Pos logit contributions
 derail+4.781
 Dalai+4.317
 Hague+4.309
 overthrow+4.271
 overturn+4.167
Neg logit contributions
worldly-8.020
 behavi-6.628
settings-6.177
properties-5.829
versions-5.706
Token due
Token ID2233
Feature activation+7.67e-01

Pos logit contributions
 derail+2.892
 overthrow+2.662
 Dalai+2.638
 Hague+2.615
 overturn+2.583
Neg logit contributions
worldly-3.956
 behavi-3.229
settings-2.978
properties-2.785
resources-2.711
Token to
Token ID284
Feature activation+5.80e+00

Pos logit contributions
 derail+0.607
 overthrow+0.559
 Dalai+0.547
 Hague+0.545
 overturn+0.536
Neg logit contributions
worldly-0.907
 behavi-0.755
settings-0.688
properties-0.644
versions-0.635
Token the
Token ID262
Feature activation+9.46e+00

Pos logit contributions
 derail+5.066
 overthrow+4.654
 overturn+4.613
 Hague+4.601
 Dalai+4.580
Neg logit contributions
worldly-6.094
 behavi-4.839
settings-4.510
properties-4.140
resources-4.087
Token 5
Token ID642
Feature activation+1.35e+01

Pos logit contributions
 derail+8.958
 overthrow+8.339
 Dalai+8.222
 overturn+8.213
 Hague+8.192
Neg logit contributions
worldly-8.734
 behavi-6.870
settings-6.353
resources-5.779
properties-5.666
Token years
Token ID812
Feature activation+2.02e+01

Pos logit contributions
 derail+10.228
 Dalai+9.354
 overthrow+9.317
 Hague+9.147
 overturn+9.110
Neg logit contributions
worldly-13.577
 behavi-11.324
settings-10.170
properties-9.533
versions-9.329
Token lifetime
Token ID10869
Feature activation+1.43e+01

Pos logit contributions
 derail+15.803
 overthrow+14.326
 overturn+14.103
 Dalai+14.081
 Hague+13.895
Neg logit contributions
worldly-17.534
 behavi-14.189
settings-12.736
resources-11.947
versions-11.777
Token,
Token ID11
Feature activation+1.09e+01

Pos logit contributions
 derail+10.746
 Dalai+9.628
 overthrow+9.604
 Hague+9.532
 overturn+9.452
Neg logit contributions
worldly-15.116
 behavi-12.091
settings-11.294
resources-10.627
selves-10.469
Token what
Token ID644
Feature activation-3.48e-01

Pos logit contributions
 derail+8.615
 overthrow+7.871
 Dalai+7.856
 overturn+7.772
 Hague+7.703
Neg logit contributions
worldly-11.546
 behavi-9.223
settings-8.561
properties-7.847
selves-7.796
Token is
Token ID318
Feature activation+6.17e+00

Pos logit contributions
worldly+0.364
 behavi+0.292
settings+0.266
properties+0.247
versions+0.241
Neg logit contributions
 derail-0.324
 overthrow-0.300
 Dalai-0.298
 Hague-0.296
 overturn-0.290
Token
Token ID564
Feature activation+1.23e+01

Pos logit contributions
 derail+5.073
 Dalai+4.653
 overthrow+4.607
 Hague+4.557
 overturn+4.555
Neg logit contributions
worldly-6.783
 behavi-5.612
settings-5.143
properties-4.760
resources-4.672
Token him
Token ID683
Feature activation+1.57e+00

Pos logit contributions
 derail+7.601
 Dalai+7.019
 Hague+6.977
 overthrow+6.960
 Clintons+6.717
Neg logit contributions
worldly-11.025
 behavi-9.245
settings-8.531
properties-7.994
versions-7.849
Token,
Token ID11
Feature activation+1.15e+01

Pos logit contributions
 derail+1.220
 overthrow+1.114
 Dalai+1.110
 Hague+1.087
 overturn+1.066
Neg logit contributions
worldly-1.875
 behavi-1.557
settings-1.436
properties-1.349
resources-1.324
Token
Token ID564
Feature activation+5.03e+00

Pos logit contributions
 derail+8.381
 Dalai+7.817
 overthrow+7.694
 Hague+7.558
 overturn+7.359
Neg logit contributions
worldly-12.632
 behavi-10.576
settings-9.716
properties-9.077
resources-9.001
Token
Token ID246
Feature activation-3.98e-01

Pos logit contributions
 derail+3.472
 Dalai+3.206
 overthrow+3.190
 Hague+3.112
 overturn+3.085
Neg logit contributions
worldly-6.164
 behavi-5.305
settings-4.802
properties-4.498
versions-4.495
TokenHow
Token ID2437
Feature activation+7.38e+00

Pos logit contributions
worldly+0.454
 behavi+0.371
settings+0.339
properties+0.317
versions+0.312
Neg logit contributions
 derail-0.333
 overthrow-0.305
 Dalai-0.303
 Hague-0.301
 overtake-0.293
Token would
Token ID561
Feature activation+2.02e+01

Pos logit contributions
 derail+6.364
 Dalai+5.964
 overthrow+5.870
 overturn+5.701
 overtake+5.699
Neg logit contributions
worldly-7.595
 behavi-6.163
settings-5.827
properties-5.482
resources-5.322
Token you
Token ID345
Feature activation+1.33e+01

Pos logit contributions
 derail+13.944
 overthrow+12.743
 Dalai+12.634
 Hague+12.582
 overtake+12.292
Neg logit contributions
worldly-19.659
 behavi-15.794
settings-14.788
properties-14.182
versions-14.029
Token vote
Token ID3015
Feature activation+1.38e+01

Pos logit contributions
 derail+10.080
 overthrow+9.401
 overturn+8.911
 overtake+8.842
 assassinate+8.788
Neg logit contributions
worldly-14.778
 behavi-12.600
settings-11.042
resources-10.699
properties-10.586
Token?
Token ID30
Feature activation+5.09e+00

Pos logit contributions
 derail+9.277
 Dalai+8.628
 overthrow+8.310
 overturn+8.225
 overtake+8.111
Neg logit contributions
worldly-15.912
 behavi-13.065
settings-12.279
properties-11.999
resources-11.416
Token
Token ID447
Feature activation-4.30e+00

Pos logit contributions
 derail+3.633
 Dalai+3.415
 overthrow+3.301
 Hague+3.287
 Clintons+3.213
Neg logit contributions
worldly-6.213
 behavi-5.219
settings-4.749
properties-4.508
versions-4.446
Token
Token ID247
Feature activation-4.87e+00

Pos logit contributions
worldly+4.677
 behavi+3.745
settings+3.439
properties+3.198
versions+3.131
Neg logit contributions
 derail-3.801
 overthrow-3.511
 Dalai-3.456
 Hague-3.432
 overtake-3.372
Token this
Token ID428
Feature activation+3.48e-02

Pos logit contributions
 derail+4.425
 overthrow+4.028
 Dalai+3.931
 overturn+3.898
 Hague+3.872
Neg logit contributions
worldly-7.716
 behavi-6.562
settings-6.019
properties-5.730
versions-5.643
Token match
Token ID2872
Feature activation+3.60e+00

Pos logit contributions
 derail+0.034
 overthrow+0.032
 Dalai+0.031
 Hague+0.030
 overturn+0.030
Neg logit contributions
worldly-0.035
 behavi-0.028
settings-0.026
properties-0.024
resources-0.023
Token wasn
Token ID2492
Feature activation+4.35e+00

Pos logit contributions
 derail+3.269
 overthrow+3.071
 Dalai+2.974
 overturn+2.969
 overtake+2.962
Neg logit contributions
worldly-3.712
 behavi-3.075
settings-2.739
properties-2.559
resources-2.486
Token't
Token ID470
Feature activation+8.38e+00

Pos logit contributions
 derail+2.431
 overthrow+2.114
 Dalai+2.109
 Hague+2.101
 overturn+2.048
Neg logit contributions
worldly-6.001
 behavi-5.491
settings-4.790
properties-4.515
versions-4.472
Token going
Token ID1016
Feature activation+1.03e+01

Pos logit contributions
 derail+6.117
 overthrow+5.566
 overturn+5.434
 Dalai+5.313
 overtake+5.272
Neg logit contributions
worldly-9.782
 behavi-8.387
settings-7.625
properties-7.292
resources-7.013
Token to
Token ID284
Feature activation+2.01e+01

Pos logit contributions
 derail+7.177
 overthrow+6.524
 overtake+6.310
 overturn+6.249
 Hague+6.244
Neg logit contributions
worldly-12.412
 behavi-10.234
settings-9.556
properties-9.086
resources-8.882
Token last
Token ID938
Feature activation+6.71e+00

Pos logit contributions
 derail+12.385
 overthrow+10.823
 overtake+10.733
 overturn+10.487
 Hague+9.515
Neg logit contributions
worldly-22.798
 behavi-20.701
settings-18.107
properties-17.128
resources-17.035
Token more
Token ID517
Feature activation+4.24e+00

Pos logit contributions
 derail+5.796
 overthrow+5.338
 Dalai+5.256
 Hague+5.187
 overturn+5.165
Neg logit contributions
worldly-7.124
 behavi-5.825
settings-5.392
properties-4.939
resources-4.830
Token than
Token ID621
Feature activation+8.93e+00

Pos logit contributions
 derail+3.706
 overthrow+3.484
 Dalai+3.383
 Hague+3.361
 overtake+3.357
Neg logit contributions
worldly-4.418
 behavi-3.672
settings-3.345
properties-3.093
versions-2.993
Token three
Token ID1115
Feature activation+7.75e+00

Pos logit contributions
 derail+7.275
 overthrow+6.743
 Dalai+6.695
 Hague+6.629
 overturn+6.551
Neg logit contributions
worldly-9.460
 behavi-8.066
settings-7.210
properties-6.773
versions-6.505
Token days
Token ID1528
Feature activation+1.15e+01

Pos logit contributions
 derail+6.024
 overthrow+5.585
 overturn+5.420
 overtake+5.402
 Dalai+5.382
Neg logit contributions
worldly-8.494
 behavi-7.172
settings-6.521
properties-6.152
resources-5.913
Token of
Token ID286
Feature activation+2.09e+00

Pos logit contributions
 derail+5.163
 overthrow+4.755
 Dalai+4.671
 Hague+4.623
 overturn+4.618
Neg logit contributions
worldly-7.321
 behavi-6.170
settings-5.554
properties-5.288
versions-5.153
Token "
Token ID366
Feature activation+3.96e+00

Pos logit contributions
 derail+1.772
 overthrow+1.651
 Dalai+1.612
 Hague+1.605
 overturn+1.586
Neg logit contributions
worldly-2.337
 behavi-1.920
settings-1.754
properties-1.655
versions-1.607
TokenThis
Token ID1212
Feature activation+1.90e+00

Pos logit contributions
 derail+3.270
 overthrow+2.996
 Dalai+2.984
 Hague+2.961
 Clintons+2.870
Neg logit contributions
worldly-4.284
 behavi-3.678
settings-3.203
properties-2.989
versions-2.935
Token guy
Token ID3516
Feature activation+9.04e+00

Pos logit contributions
 derail+1.841
 overthrow+1.718
 Dalai+1.696
 Hague+1.680
 Clintons+1.651
Neg logit contributions
worldly-1.892
 behavi-1.503
settings-1.371
properties-1.276
resources-1.248
Token's
Token ID338
Feature activation+6.30e+00

Pos logit contributions
 derail+7.776
 overthrow+7.170
 Dalai+7.150
 overturn+7.004
 Hague+7.002
Neg logit contributions
worldly-9.324
 behavi-7.782
settings-6.922
properties-6.394
versions-6.327
Token gonna
Token ID8066
Feature activation+1.99e+01

Pos logit contributions
 derail+5.301
 overthrow+4.932
 Dalai+4.843
 Hague+4.731
 overturn+4.719
Neg logit contributions
worldly-6.824
 behavi-5.615
settings-5.181
properties-4.820
resources-4.733
Token let
Token ID1309
Feature activation+1.26e+01

Pos logit contributions
 derail+14.831
 overthrow+13.719
 assassinate+12.907
 overtake+12.767
 overturn+12.752
Neg logit contributions
worldly-19.898
 behavi-17.750
settings-15.361
resources-14.677
properties-14.295
Token you
Token ID345
Feature activation+7.33e+00

Pos logit contributions
 derail+8.318
 overthrow+7.546
 Dalai+7.462
 Hague+7.440
 Clintons+7.294
Neg logit contributions
worldly-14.577
 behavi-12.472
settings-11.496
properties-10.769
versions-10.736
Token create
Token ID2251
Feature activation+5.20e+00

Pos logit contributions
 derail+5.543
 overthrow+5.131
 overtake+4.836
 assassinate+4.800
 overturn+4.791
Neg logit contributions
worldly-8.678
 behavi-7.498
settings-6.730
resources-6.330
properties-6.310
Token a
Token ID257
Feature activation+2.38e-01

Pos logit contributions
 derail+4.367
 overthrow+4.014
 Dalai+3.967
 Hague+3.885
 Clintons+3.849
Neg logit contributions
worldly-5.730
 behavi-4.673
settings-4.280
properties-3.992
resources-3.901
Token custom
Token ID2183
Feature activation-1.53e+00

Pos logit contributions
 derail+0.225
 overthrow+0.208
 Dalai+0.206
 Hague+0.204
 Clintons+0.200
Neg logit contributions
worldly-0.248
 behavi-0.198
settings-0.180
properties-0.167
resources-0.163
Token vehicle
Token ID4038
Feature activation+2.87e+00

Pos logit contributions
 derail+3.764
 overthrow+3.475
 Dalai+3.416
 Hague+3.354
 overtake+3.344
Neg logit contributions
worldly-3.939
 behavi-3.138
settings-2.788
properties-2.617
resources-2.542
Token was
Token ID373
Feature activation+3.07e+00

Pos logit contributions
 derail+2.665
 overthrow+2.474
 Dalai+2.412
 Hague+2.398
 overturn+2.371
Neg logit contributions
worldly-3.038
 behavi-2.443
settings-2.192
properties-2.048
resources-2.002
Token destroyed
Token ID6572
Feature activation+4.05e+00

Pos logit contributions
 derail+2.809
 overthrow+2.597
 Dalai+2.543
 Hague+2.526
 overturn+2.502
Neg logit contributions
worldly-3.264
 behavi-2.702
settings-2.418
properties-2.256
resources-2.196
Token in
Token ID287
Feature activation-7.30e-02

Pos logit contributions
 derail+3.052
 Dalai+2.775
 overthrow+2.756
 Hague+2.729
 overturn+2.671
Neg logit contributions
worldly-4.866
 behavi-4.020
settings-3.731
properties-3.497
versions-3.441
Token the
Token ID262
Feature activation+8.78e+00

Pos logit contributions
worldly+0.094
 behavi+0.079
settings+0.074
properties+0.069
resources+0.068
Neg logit contributions
 derail-0.051
 overthrow-0.046
 Dalai-0.045
 Hague-0.045
 overturn-0.043
Token April
Token ID3035
Feature activation+1.99e+01

Pos logit contributions
 derail+6.696
 overthrow+6.201
 Dalai+5.994
 Hague+5.983
 overturn+5.796
Neg logit contributions
worldly-10.238
 behavi-8.634
settings-7.962
properties-7.438
resources-7.364
Token 1986
Token ID12113
Feature activation+1.15e+01

Pos logit contributions
 derail+13.729
 overthrow+13.117
 massacre+12.540
 Massacre+12.352
 Hague+12.106
Neg logit contributions
worldly-19.447
 behavi-17.492
settings-15.404
resources-14.327
properties-14.206
Token launch
Token ID4219
Feature activation+4.58e+00

Pos logit contributions
 derail+8.884
 overthrow+8.099
 Hague+7.965
 Dalai+7.796
 massacre+7.739
Neg logit contributions
worldly-13.052
 behavi-10.933
settings-10.072
properties-9.414
resources-9.247
Token vehicle
Token ID4038
Feature activation+4.50e+00

Pos logit contributions
 derail+3.639
 overthrow+3.311
 overturn+3.174
 Dalai+3.160
 Hague+3.111
Neg logit contributions
worldly-5.447
 behavi-4.462
settings-4.151
properties-3.895
versions-3.814
Token accident
Token ID5778
Feature activation+7.74e+00

Pos logit contributions
 derail+3.376
 overthrow+3.075
 overturn+2.919
 Dalai+2.907
 Hague+2.906
Neg logit contributions
worldly-5.614
 behavi-4.614
settings-4.326
properties-4.044
versions-3.974
Token,
Token ID11
Feature activation+7.99e+00

Pos logit contributions
 derail+5.666
 Hague+4.958
 Dalai+4.956
 overthrow+4.925
 overturn+4.919
Neg logit contributions
worldly-9.243
 behavi-7.662
settings-7.072
properties-6.704
versions-6.573
Token that
Token ID326
Feature activation+2.02e+00

Pos logit contributions
 derail+9.406
 overthrow+8.472
 overturn+8.470
 Hague+8.188
 overtake+8.103
Neg logit contributions
worldly-14.494
 behavi-12.302
settings-11.201
properties-10.375
resources-10.127
Token it
Token ID340
Feature activation+8.60e-01

Pos logit contributions
 derail+1.813
 overthrow+1.673
 overturn+1.626
 overtake+1.600
 Dalai+1.599
Neg logit contributions
worldly-2.193
 behavi-1.791
settings-1.643
properties-1.548
resources-1.502
Token wasn
Token ID2492
Feature activation+7.33e+00

Pos logit contributions
 derail+0.805
 overthrow+0.748
 overturn+0.727
 Dalai+0.725
 Hague+0.724
Neg logit contributions
worldly-0.901
 behavi-0.738
settings-0.660
properties-0.614
resources-0.600
Token't
Token ID470
Feature activation+1.00e+01

Pos logit contributions
 derail+3.518
 Hague+3.095
 Dalai+3.059
 overthrow+2.991
 overturn+2.945
Neg logit contributions
worldly-10.228
 behavi-9.944
settings-8.295
 redes-8.204
 practition-8.176
Token going
Token ID1016
Feature activation+9.92e+00

Pos logit contributions
 derail+8.513
 overthrow+7.653
 overturn+7.571
 Hague+7.281
 overtake+7.273
Neg logit contributions
worldly-10.531
 behavi-8.836
settings-8.053
properties-7.494
resources-7.232
Token to
Token ID284
Feature activation+1.99e+01

Pos logit contributions
 derail+7.081
 overthrow+6.340
 overtake+6.115
 overturn+6.062
 Hague+6.041
Neg logit contributions
worldly-12.018
 behavi-9.852
settings-9.232
properties-8.758
versions-8.603
Token look
Token ID804
Feature activation+9.41e+00

Pos logit contributions
 derail+14.625
 overthrow+12.609
 overturn+12.536
 overtake+12.290
 assassinate+11.380
Neg logit contributions
worldly-20.954
 behavi-18.657
settings-16.229
properties-15.180
resources-15.090
Token good
Token ID922
Feature activation+4.60e+00

Pos logit contributions
 derail+7.930
 overthrow+7.088
 Dalai+6.952
 Hague+6.917
 overturn+6.886
Neg logit contributions
worldly-9.806
 behavi-8.413
settings-7.715
properties-7.191
resources-7.017
Token in
Token ID287
Feature activation+9.31e-02

Pos logit contributions
 derail+3.674
 overthrow+3.338
 overturn+3.252
 Hague+3.214
 Dalai+3.213
Neg logit contributions
worldly-5.289
 behavi-4.408
settings-4.089
properties-3.861
resources-3.723
Token the
Token ID262
Feature activation+7.64e+00

Pos logit contributions
 derail+0.081
 overthrow+0.075
 Hague+0.073
 Dalai+0.072
 overturn+0.072
Neg logit contributions
worldly-0.104
 behavi-0.084
settings-0.077
properties-0.072
resources-0.070
Token head
Token ID1182
Feature activation+3.34e+00

Pos logit contributions
 derail+6.896
 overthrow+6.317
 overturn+6.183
 Hague+6.131
 Dalai+6.006
Neg logit contributions
worldly-7.866
 behavi-6.379
settings-5.803
properties-5.414
resources-5.305
TokenIP
Token ID4061
Feature activation+5.29e+00

Pos logit contributions
 derail+4.897
 overthrow+4.519
 Hague+4.417
 Dalai+4.376
 Clintons+4.333
Neg logit contributions
worldly-7.727
 behavi-6.337
settings-5.854
properties-5.544
versions-5.355
TokenD
Token ID35
Feature activation+4.95e+00

Pos logit contributions
 derail+4.283
 overthrow+3.972
 Dalai+3.915
 Hague+3.822
 overturn+3.802
Neg logit contributions
worldly-6.056
 behavi-4.893
settings-4.581
properties-4.329
resources-4.154
Token warned
Token ID7728
Feature activation+8.67e+00

Pos logit contributions
 derail+4.254
 overthrow+3.939
 Dalai+3.844
 Hague+3.815
 overturn+3.809
Neg logit contributions
worldly-5.441
 behavi-4.415
settings-4.040
properties-3.829
resources-3.672
Token that
Token ID326
Feature activation+6.40e+00

Pos logit contributions
 derail+6.779
 overthrow+6.081
 overturn+6.026
 Hague+6.012
 Dalai+5.925
Neg logit contributions
worldly-9.717
 behavi-8.117
settings-7.554
properties-7.066
versions-6.855
Token 2008
Token ID3648
Feature activation+1.48e+01

Pos logit contributions
 derail+5.447
 overthrow+4.972
 overturn+4.902
 Hague+4.873
 Dalai+4.839
Neg logit contributions
worldly-6.933
 behavi-5.667
settings-5.265
properties-4.905
resources-4.659
Token would
Token ID561
Feature activation+1.98e+01

Pos logit contributions
 derail+11.695
 overthrow+10.644
 overtake+10.564
 Hague+10.438
 overturn+10.351
Neg logit contributions
worldly-14.375
 behavi-11.769
settings-10.690
properties-10.195
resources-9.604
Token be
Token ID307
Feature activation+1.17e+01

Pos logit contributions
 derail+13.905
 overtake+12.726
 overthrow+12.319
 overturn+12.082
 assassinate+10.843
Neg logit contributions
worldly-21.397
 behavi-19.705
settings-16.858
resources-16.284
properties-16.261
Token the
Token ID262
Feature activation+1.44e+01

Pos logit contributions
 derail+9.433
 overtake+8.439
 overturn+8.414
 overthrow+8.401
 Dalai+8.133
Neg logit contributions
worldly-11.974
 behavi-10.084
settings-9.334
properties-8.673
resources-8.507
Token UK
Token ID3482
Feature activation+1.01e+01

Pos logit contributions
 derail+10.884
 overthrow+10.005
 Dalai+9.748
 overturn+9.692
 Hague+9.453
Neg logit contributions
worldly-14.985
 behavi-12.796
settings-11.653
properties-11.321
resources-10.741
Token's
Token ID338
Feature activation+7.32e+00

Pos logit contributions
 derail+7.689
 overthrow+7.001
 overtake+6.931
 overturn+6.820
 Hague+6.752
Neg logit contributions
worldly-11.226
 behavi-9.408
settings-8.542
properties-8.185
resources-7.723
Token worst
Token ID5290
Feature activation+6.60e+00

Pos logit contributions
 derail+5.228
 overthrow+4.795
 Dalai+4.608
 overturn+4.577
 overtake+4.540
Neg logit contributions
worldly-8.904
 behavi-7.471
settings-6.917
properties-6.670
resources-6.372
Token
Token ID246
Feature activation-1.14e-01

Pos logit contributions
 derail+3.725
 Dalai+3.375
 overthrow+3.367
 Hague+3.286
 Clintons+3.232
Neg logit contributions
worldly-7.092
 behavi-6.015
settings-5.609
properties-5.277
versions-5.237
TokenPakistan
Token ID38485
Feature activation+6.96e+00

Pos logit contributions
worldly+0.133
 behavi+0.109
settings+0.100
properties+0.093
versions+0.092
Neg logit contributions
 derail-0.094
 overthrow-0.086
 Dalai-0.085
 Hague-0.084
 overturn-0.083
Token cruise
Token ID20382
Feature activation+5.92e+00

Pos logit contributions
 derail+5.681
 overthrow+5.313
 overtake+5.200
 overturn+5.198
 Dalai+5.173
Neg logit contributions
worldly-7.574
 behavi-6.286
settings-5.676
properties-5.378
resources-5.221
Token to
Token ID284
Feature activation+1.42e+01

Pos logit contributions
 derail+4.947
 overthrow+4.584
 overtake+4.445
 overturn+4.430
 Dalai+4.410
Neg logit contributions
worldly-6.556
 behavi-5.400
settings-4.928
properties-4.650
resources-4.576
Token 100
Token ID1802
Feature activation+1.23e+01

Pos logit contributions
 derail+11.314
 overthrow+10.726
 overtake+10.516
 Dalai+10.447
 overturn+10.311
Neg logit contributions
worldly-14.006
 behavi-11.659
settings-10.957
properties-10.216
resources-9.820
Token,
Token ID11
Feature activation+1.97e+01

Pos logit contributions
 derail+9.747
 overthrow+9.162
 overturn+8.885
 overtake+8.847
 Dalai+8.808
Neg logit contributions
worldly-12.278
 behavi-10.222
settings-9.418
properties-9.023
resources-8.472
Token lose
Token ID4425
Feature activation+7.86e+00

Pos logit contributions
 derail+14.825
 overthrow+14.226
 overtake+14.036
 overturn+13.968
 Dalai+13.532
Neg logit contributions
worldly-18.213
 behavi-14.802
settings-13.253
properties-12.563
selves-12.174
Token 2
Token ID362
Feature activation+1.21e+01

Pos logit contributions
 derail+6.012
 overthrow+5.491
 overtake+5.398
 Dalai+5.380
 overturn+5.338
Neg logit contributions
worldly-8.940
 behavi-7.465
settings-6.851
properties-6.567
resources-6.402
Token quick
Token ID2068
Feature activation+1.57e+00

Pos logit contributions
 derail+9.317
 overthrow+8.767
 overtake+8.474
 overturn+8.455
 Dalai+8.412
Neg logit contributions
worldly-12.266
 behavi-10.377
settings-9.303
properties-8.912
resources-8.453
Token w
Token ID266
Feature activation+1.60e+00

Pos logit contributions
 derail+1.387
 overthrow+1.284
 Dalai+1.251
 Hague+1.244
 overtake+1.236
Neg logit contributions
worldly-1.710
 behavi-1.388
settings-1.260
properties-1.184
resources-1.155
Tokenickets
Token ID15970
Feature activation+9.07e+00

Pos logit contributions
 derail+1.414
 overthrow+1.320
 Hague+1.298
 Dalai+1.297
 Clintons+1.267
Neg logit contributions
worldly-1.730
 behavi-1.409
settings-1.248
properties-1.168
versions-1.139
Token why
Token ID1521
Feature activation+6.07e+00

Pos logit contributions
 derail+2.728
 Dalai+2.511
 overthrow+2.499
 Hague+2.444
 overturn+2.407
Neg logit contributions
worldly-3.984
 behavi-3.295
settings-3.058
properties-2.885
versions-2.795
Token giant
Token ID6175
Feature activation+3.02e+00

Pos logit contributions
 derail+4.212
 overthrow+3.841
 Dalai+3.838
 Hague+3.727
 Clintons+3.672
Neg logit contributions
worldly-7.471
 behavi-6.255
settings-5.846
properties-5.592
resources-5.442
Token corps
Token ID16812
Feature activation+7.19e+00

Pos logit contributions
 derail+2.454
 overthrow+2.268
 Dalai+2.212
 Hague+2.210
 Clintons+2.189
Neg logit contributions
worldly-3.500
 behavi-2.844
settings-2.638
properties-2.452
resources-2.416
Token (
Token ID357
Feature activation+5.04e+00

Pos logit contributions
 derail+5.443
 overthrow+4.998
 Dalai+4.868
 overturn+4.798
 overtake+4.738
Neg logit contributions
worldly-8.482
 behavi-6.926
settings-6.468
properties-6.109
resources-5.962
TokenGoogle
Token ID11708
Feature activation+8.52e+00

Pos logit contributions
 derail+4.839
 Dalai+4.516
 overthrow+4.511
 Hague+4.434
 Clintons+4.400
Neg logit contributions
worldly-4.795
 behavi-3.879
settings-3.349
properties-3.075
resources-2.968
Token,
Token ID11
Feature activation+1.97e+01

Pos logit contributions
 derail+6.095
 overthrow+5.717
 Dalai+5.652
 Clintons+5.462
 Hague+5.441
Neg logit contributions
worldly-10.175
 behavi-8.307
settings-7.736
properties-7.262
resources-7.070
Token Facebook
Token ID3203
Feature activation+3.97e+00

Pos logit contributions
 derail+14.755
 Clintons+14.040
 overthrow+13.980
 Dalai+13.853
 Hague+13.500
Neg logit contributions
worldly-18.101
 behavi-14.361
settings-13.233
properties-11.869
resources-11.638
Token,
Token ID11
Feature activation+1.48e+01

Pos logit contributions
 derail+2.900
 overthrow+2.662
 Dalai+2.620
 Hague+2.574
 Clintons+2.553
Neg logit contributions
worldly-4.889
 behavi-3.997
settings-3.731
properties-3.505
resources-3.442
Token Netflix
Token ID12074
Feature activation+5.27e+00

Pos logit contributions
 derail+11.909
 overthrow+11.234
 Dalai+11.093
 Clintons+10.938
 Hague+10.770
Neg logit contributions
worldly-14.566
 behavi-11.330
settings-10.458
properties-9.551
versions-9.361
Token)
Token ID8
Feature activation+4.96e+00

Pos logit contributions
 derail+3.629
 overthrow+3.325
 Dalai+3.265
 Hague+3.199
 Clintons+3.179
Neg logit contributions
worldly-6.644
 behavi-5.425
settings-5.105
properties-4.782
resources-4.714
Token supported
Token ID4855
Feature activation+1.14e+01

Pos logit contributions
 derail+4.453
 overthrow+4.128
 Dalai+4.011
 overturn+3.989
 overtake+3.956
Neg logit contributions
worldly-5.295
 behavi-4.196
settings-3.875
properties-3.623
resources-3.533
Token larger
Token ID4025
Feature activation+1.84e+00

Pos logit contributions
 derail+0.424
 overthrow+0.395
 Hague+0.387
 Dalai+0.386
 overturn+0.378
Neg logit contributions
worldly-0.484
 behavi-0.390
settings-0.355
properties-0.330
resources-0.323
Token image
Token ID2939
Feature activation+1.98e+00

Pos logit contributions
 derail+1.737
 overthrow+1.614
 Dalai+1.582
 Hague+1.581
 Clintons+1.550
Neg logit contributions
worldly-1.891
 behavi-1.537
settings-1.389
properties-1.291
resources-1.257
Token).
Token ID737
Feature activation-2.91e+00

Pos logit contributions
 derail+1.540
 overthrow+1.408
 Dalai+1.387
 Hague+1.375
 overturn+1.353
Neg logit contributions
worldly-2.352
 behavi-1.971
settings-1.770
properties-1.677
versions-1.622
Token In
Token ID554
Feature activation-3.43e+00

Pos logit contributions
worldly+2.896
 behavi+2.279
settings+2.048
properties+1.882
versions+1.845
Neg logit contributions
 derail-2.863
 overthrow-2.672
 Dalai-2.652
 Hague-2.633
 Clintons-2.583
Token the
Token ID262
Feature activation+6.20e+00

Pos logit contributions
worldly+3.784
 behavi+3.044
settings+2.788
properties+2.602
resources+2.542
Neg logit contributions
 derail-3.015
 overthrow-2.791
 Dalai-2.751
 Hague-2.730
 overtake-2.673
Token June
Token ID2795
Feature activation+1.97e+01

Pos logit contributions
 derail+5.782
 overthrow+5.381
 Dalai+5.323
 Hague+5.302
 Clintons+5.183
Neg logit contributions
worldly-6.190
 behavi-5.019
settings-4.478
properties-4.213
resources-4.049
Token 1
Token ID352
Feature activation+1.45e+01

Pos logit contributions
 derail+15.286
 overthrow+14.579
 Hague+14.013
 overturn+13.946
 massacre+13.937
Neg logit contributions
worldly-16.801
 behavi-14.627
settings-12.286
resources-11.272
properties-11.261
Token image
Token ID2939
Feature activation+6.89e+00

Pos logit contributions
 derail+12.063
 overthrow+11.388
 Dalai+11.026
 massacre+10.970
 Hague+10.968
Neg logit contributions
worldly-13.369
 behavi-11.008
settings-9.623
properties-8.866
versions-8.805
Token,
Token ID11
Feature activation+7.36e+00

Pos logit contributions
 derail+5.061
 Dalai+4.590
 overthrow+4.533
 Hague+4.463
 overturn+4.398
Neg logit contributions
worldly-8.137
 behavi-6.966
settings-6.260
properties-5.949
resources-5.777
Token a
Token ID257
Feature activation-2.28e+00

Pos logit contributions
 derail+6.072
 Dalai+5.585
 overthrow+5.545
 Hague+5.453
 overturn+5.408
Neg logit contributions
worldly-7.982
 behavi-6.577
settings-6.014
properties-5.567
resources-5.427
Token large
Token ID1588
Feature activation+7.94e-01

Pos logit contributions
worldly+2.396
 behavi+1.925
settings+1.756
properties+1.640
resources+1.596
Neg logit contributions
 derail-2.122
 overthrow-1.961
 Dalai-1.940
 Hague-1.920
 overturn-1.884
Token going
Token ID1016
Feature activation+1.12e+01

Pos logit contributions
 derail+8.979
 overthrow+8.098
 Clintons+8.058
 overturn+8.051
 Dalai+7.960
Neg logit contributions
worldly-11.319
 behavi-9.401
settings-8.732
properties-8.249
resources-8.018
Token to
Token ID284
Feature activation+1.76e+01

Pos logit contributions
 derail+8.279
 overthrow+7.788
 Clintons+7.461
 overturn+7.460
 Dalai+7.408
Neg logit contributions
worldly-12.242
 behavi-10.252
settings-9.642
properties-9.164
resources-8.946
Token have
Token ID423
Feature activation+8.70e+00

Pos logit contributions
 derail+14.032
 overthrow+12.862
 overturn+12.390
 overtake+12.305
 Hague+12.217
Neg logit contributions
worldly-17.447
 behavi-14.698
settings-13.515
properties-12.731
resources-12.508
Token a
Token ID257
Feature activation+4.76e+00

Pos logit contributions
 derail+6.749
 overthrow+6.107
 Dalai+6.020
 Clintons+5.953
 Hague+5.908
Neg logit contributions
worldly-9.946
 behavi-8.283
settings-7.622
properties-7.177
versions-7.068
Token hard
Token ID1327
Feature activation+6.29e+00

Pos logit contributions
 derail+4.294
 overthrow+3.955
 Dalai+3.936
 Hague+3.838
 Clintons+3.817
Neg logit contributions
worldly-5.104
 behavi-4.137
settings-3.812
properties-3.578
resources-3.451
Token time
Token ID640
Feature activation+1.97e+01

Pos logit contributions
 derail+5.782
 overthrow+5.433
 Dalai+5.235
 overtake+5.218
 overturn+5.093
Neg logit contributions
worldly-6.473
 behavi-5.229
settings-4.764
properties-4.519
versions-4.362
Token en
Token ID551
Feature activation+1.40e+00

Pos logit contributions
 derail+14.524
 overturn+13.804
 overthrow+13.261
 overtake+12.654
 Clintons+12.476
Neg logit contributions
worldly-18.817
 behavi-15.725
settings-14.872
properties-13.721
resources-13.519
Tokenr
Token ID81
Feature activation+1.13e+00

Pos logit contributions
 derail+1.408
 overthrow+1.317
 Dalai+1.296
 Hague+1.294
 overturn+1.280
Neg logit contributions
worldly-1.344
 behavi-1.061
settings-0.945
properties-0.874
resources-0.844
Tokenaging
Token ID3039
Feature activation+9.53e+00

Pos logit contributions
 derail+1.140
 overthrow+1.058
 Hague+1.039
 Dalai+1.038
 overturn+1.024
Neg logit contributions
worldly-1.098
 behavi-0.864
settings-0.772
properties-0.709
versions-0.696
Token people
Token ID661
Feature activation+5.23e+00

Pos logit contributions
 derail+7.163
 Clintons+6.533
 overthrow+6.532
 overturn+6.427
 Dalai+6.408
Neg logit contributions
worldly-10.869
 behavi-8.995
settings-8.453
properties-7.934
resources-7.680
Token about
Token ID546
Feature activation+6.41e+00

Pos logit contributions
 derail+4.071
 overthrow+3.769
 Dalai+3.696
 overturn+3.656
 Hague+3.631
Neg logit contributions
worldly-6.076
 behavi-5.075
settings-4.660
properties-4.426
versions-4.270
Token
Token ID447
Feature activation+1.11e+01

Pos logit contributions
 derail+1.582
 Dalai+1.385
 Hague+1.317
 overthrow+1.251
 overturn+1.201
Neg logit contributions
worldly-7.374
 behavi-6.794
settings-6.027
properties-5.737
 redes-5.702
Token
Token ID247
Feature activation+1.98e+00

Pos logit contributions
 derail+7.311
 Dalai+7.115
 Hague+6.744
 overthrow+6.644
 overturn+6.504
Neg logit contributions
worldly-12.838
 behavi-11.537
settings-10.032
versions-9.317
properties-9.299
Tokent
Token ID83
Feature activation+1.37e+01

Pos logit contributions
 derail+1.683
 overthrow+1.553
 Dalai+1.534
 Hague+1.504
 overturn+1.489
Neg logit contributions
worldly-2.249
 behavi-1.862
settings-1.642
properties-1.550
versions-1.504
Token think
Token ID892
Feature activation+1.35e+01

Pos logit contributions
 derail+9.738
 overthrow+8.791
 overturn+8.620
 assassinate+8.384
 overtake+8.209
Neg logit contributions
worldly-15.738
 behavi-13.934
settings-12.015
properties-11.746
resources-11.336
Token she
Token ID673
Feature activation+2.43e+00

Pos logit contributions
 derail+8.998
 overturn+8.170
 Clintons+8.147
 Hague+8.145
 Dalai+8.091
Neg logit contributions
worldly-15.538
 behavi-12.785
settings-11.969
properties-11.552
versions-11.001
Token would
Token ID561
Feature activation+1.95e+01

Pos logit contributions
 derail+2.227
 overthrow+2.057
 Dalai+2.036
 overturn+2.013
 Hague+1.981
Neg logit contributions
worldly-2.595
 behavi-2.091
settings-1.865
properties-1.774
resources-1.712
Token be
Token ID307
Feature activation+1.13e+01

Pos logit contributions
 derail+13.943
 overthrow+12.780
 overturn+12.700
 assassinate+11.836
 overtake+11.627
Neg logit contributions
worldly-20.049
 behavi-18.097
settings-14.997
properties-14.794
resources-14.346
Token any
Token ID597
Feature activation+3.57e+00

Pos logit contributions
 derail+9.471
 overthrow+8.742
 overturn+8.739
 Dalai+8.454
 Clintons+8.383
Neg logit contributions
worldly-11.436
 behavi-9.297
settings-8.826
properties-8.236
resources-8.024
Token less
Token ID1342
Feature activation+7.13e+00

Pos logit contributions
 derail+3.485
 overthrow+3.232
 Dalai+3.172
 Clintons+3.131
 Hague+3.105
Neg logit contributions
worldly-3.506
 behavi-2.769
settings-2.550
properties-2.369
resources-2.286
Token,
Token ID11
Feature activation+1.31e+01

Pos logit contributions
 derail+6.776
 overthrow+6.225
 overturn+6.144
 Dalai+6.121
 Clintons+6.101
Neg logit contributions
worldly-6.951
 behavi-5.577
settings-5.216
properties-4.785
resources-4.637
Token or
Token ID393
Feature activation+8.04e+00

Pos logit contributions
 derail+8.088
 Dalai+7.385
 Hague+7.175
 overthrow+7.038
 Clintons+6.960
Neg logit contributions
worldly-15.765
 behavi-12.963
settings-12.231
properties-11.702
versions-11.321
Token more
Token ID517
Feature activation+2.19e+00

Pos logit contributions
 derail+4.723
 overthrow+4.381
 Dalai+4.271
 Hague+4.254
 overturn+4.252
Neg logit contributions
worldly-5.701
 behavi-4.584
settings-4.299
properties-3.964
resources-3.830
Token of
Token ID286
Feature activation+3.40e+00

Pos logit contributions
 derail+2.050
 overthrow+1.914
 Dalai+1.866
 Hague+1.851
 Clintons+1.840
Neg logit contributions
worldly-2.265
 behavi-1.810
settings-1.667
properties-1.535
resources-1.490
Token the
Token ID262
Feature activation+8.08e+00

Pos logit contributions
 derail+2.798
 overthrow+2.586
 Dalai+2.546
 Hague+2.534
 Clintons+2.491
Neg logit contributions
worldly-3.829
 behavi-3.151
settings-2.916
properties-2.710
resources-2.646
Token $
Token ID720
Feature activation+8.55e+00

Pos logit contributions
 derail+7.463
 overthrow+7.032
 Dalai+6.867
 Clintons+6.863
 overturn+6.741
Neg logit contributions
worldly-7.824
 behavi-6.375
settings-5.932
properties-5.400
versions-5.202
Token1
Token ID16
Feature activation+1.14e+01

Pos logit contributions
 derail+4.543
 Dalai+4.083
 Hague+4.026
 overthrow+3.964
 Clintons+3.842
Neg logit contributions
worldly-11.167
 behavi-10.413
settings-8.802
properties-8.320
versions-8.196
Token billion
Token ID2997
Feature activation+1.95e+01

Pos logit contributions
 derail+8.749
 overthrow+8.119
 Dalai+7.807
 overtake+7.779
 Clintons+7.745
Neg logit contributions
worldly-12.216
 behavi-10.082
settings-9.360
properties-8.907
versions-8.538
Token in
Token ID287
Feature activation+5.38e+00

Pos logit contributions
 derail+15.300
 overthrow+14.306
 Clintons+14.099
 Dalai+13.878
 overturn+13.833
Neg logit contributions
worldly-16.843
 behavi-14.582
settings-13.089
properties-12.034
resources-11.637
Token Kh
Token ID5311
Feature activation+8.59e+00

Pos logit contributions
 derail+5.047
 overthrow+4.708
 Dalai+4.602
 Clintons+4.553
 Hague+4.542
Neg logit contributions
worldly-5.368
 behavi-4.336
settings-3.971
properties-3.641
resources-3.509
Tokenark
Token ID668
Feature activation+5.90e+00

Pos logit contributions
 derail+6.830
 overthrow+6.389
 Hague+6.311
 Dalai+6.241
 overturn+6.009
Neg logit contributions
worldly-9.127
 behavi-8.001
settings-6.906
properties-6.417
versions-6.176
Tokeniv
Token ID452
Feature activation+6.80e+00

Pos logit contributions
 derail+4.697
 overthrow+4.333
 Hague+4.241
 Dalai+4.230
 overturn+4.136
Neg logit contributions
worldly-6.734
 behavi-5.742
settings-5.005
properties-4.713
selves-4.569
Token loans
Token ID10021
Feature activation+7.04e+00

Pos logit contributions
 derail+6.193
 overthrow+5.730
 Hague+5.613
 Dalai+5.558
 overtake+5.536
Neg logit contributions
worldly-6.786
 behavi-5.549
settings-5.018
versions-4.595
properties-4.525
Token heavily
Token ID7272
Feature activation-6.42e+00

Pos logit contributions
worldly+5.922
 behavi+4.480
settings+4.013
properties+3.632
versions+3.566
Neg logit contributions
 derail-7.101
 overthrow-6.681
 Dalai-6.612
 Hague-6.567
 Clintons-6.468
Token using
Token ID1262
Feature activation-9.93e+00

Pos logit contributions
worldly+6.289
 behavi+4.834
settings+4.463
properties+4.085
versions+4.014
Neg logit contributions
 derail-6.294
 overthrow-5.881
 Dalai-5.813
 Hague-5.744
 overtake-5.656
Token Angular
Token ID28147
Feature activation-1.29e+01

Pos logit contributions
worldly+9.294
 behavi+7.187
settings+6.599
properties+6.007
versions+5.961
Neg logit contributions
 derail-9.604
 overthrow-8.974
 Dalai-8.903
 Hague-8.898
 Clintons-8.732
TokenJS
Token ID20120
Feature activation-1.33e+01

Pos logit contributions
worldly+13.691
 behavi+10.743
settings+10.380
properties+9.604
versions+9.569
Neg logit contributions
 derail-10.355
 Dalai-9.650
 overthrow-9.612
 Hague-9.472
 overturn-9.213
Token.
Token ID13
Feature activation-1.65e+01

Pos logit contributions
worldly+10.540
 behavi+8.331
settings+7.373
properties+6.596
resources+6.507
Neg logit contributions
 derail-13.484
 Dalai-12.787
 overthrow-12.678
 Hague-12.646
 Clintons-12.505
Token It
Token ID632
Feature activation-2.08e+01

Pos logit contributions
worldly+13.212
 behavi+9.772
settings+9.564
properties+8.965
resources+8.460
Neg logit contributions
 derail-15.769
 overthrow-14.795
 overtake-14.566
 assassinate-14.504
 Hague-14.350
Token's
Token ID338
Feature activation-1.26e+01

Pos logit contributions
worldly+14.757
settings+9.653
properties+9.022
self+8.426
resources+8.287
Neg logit contributions
 derail-20.418
 assassinate-19.681
 overthrow-19.415
 Clintons-19.215
 overtake-19.021
Token a
Token ID257
Feature activation-1.92e+01

Pos logit contributions
worldly+10.697
settings+6.950
 behavi+6.860
versions+6.279
properties+6.276
Neg logit contributions
 derail-12.784
 Dalai-12.037
 Hague-11.978
 overthrow-11.964
 Clintons-11.949
Token marvelous
Token ID43937
Feature activation-1.28e+01

Pos logit contributions
worldly+12.150
 behavi+6.919
settings+5.482
 webs+5.235
properties+5.072
Neg logit contributions
 derail-21.894
 Dalai-21.780
 Hague-21.459
 assassinate-20.998
 overthrow-20.952
Token framework
Token ID9355
Feature activation-1.08e+01

Pos logit contributions
worldly+10.859
 behavi+7.438
settings+7.076
 webs+6.593
resources+6.483
Neg logit contributions
 derail-13.114
 Dalai-12.589
 assassinate-12.355
 Hague-12.286
 Clintons-12.245
Token for
Token ID329
Feature activation-9.23e+00

Pos logit contributions
worldly+10.855
settings+7.753
 behavi+7.528
properties+7.257
resources+7.056
Neg logit contributions
 derail-9.545
 Dalai-8.900
 overthrow-8.894
 Hague-8.745
 Clintons-8.708
Token or
Token ID393
Feature activation+9.07e-01

Pos logit contributions
worldly+6.367
 behavi+5.320
settings+4.969
properties+4.698
versions+4.630
Neg logit contributions
 derail-3.212
 overthrow-2.903
 Dalai-2.841
 Hague-2.812
 overtake-2.732
Token just
Token ID655
Feature activation-4.58e+00

Pos logit contributions
 derail+0.693
 overthrow+0.629
 Dalai+0.620
 Hague+0.611
 overtake+0.602
Neg logit contributions
worldly-1.106
 behavi-0.916
settings-0.846
properties-0.800
resources-0.783
Token following
Token ID1708
Feature activation-8.25e+00

Pos logit contributions
worldly+5.129
 behavi+4.126
settings+3.806
properties+3.547
versions+3.488
Neg logit contributions
 derail-3.883
 overthrow-3.592
 Dalai-3.551
 Hague-3.518
 Clintons-3.437
Token along
Token ID1863
Feature activation-1.17e+01

Pos logit contributions
worldly+8.888
 behavi+6.988
settings+6.496
properties+5.991
versions+5.929
Neg logit contributions
 derail-7.144
 overthrow-6.677
 Dalai-6.519
 Hague-6.464
 Clintons-6.358
Token on
Token ID319
Feature activation-1.15e+01

Pos logit contributions
worldly+12.835
 behavi+10.253
settings+9.404
properties+8.666
versions+8.649
Neg logit contributions
 derail-9.223
 overthrow-8.683
 Hague-8.333
 Dalai-8.216
 Clintons-8.210
Token social
Token ID1919
Feature activation-2.06e+01

Pos logit contributions
worldly+9.928
 behavi+7.459
settings+6.608
versions+5.935
 webs+5.887
Neg logit contributions
 derail-11.690
 overthrow-11.131
 Hague-11.064
 Dalai-10.890
 Clintons-10.726
Token,
Token ID11
Feature activation-2.75e+00

Pos logit contributions
worldly+15.090
 behavi+12.364
settings+10.679
 webs+9.885
 endeavors+9.539
Neg logit contributions
 derail-18.149
 Hague-17.924
 overthrow-17.572
 Dalai-17.358
 Clintons-17.049
Token the
Token ID262
Feature activation+2.45e+00

Pos logit contributions
worldly+3.434
 behavi+2.838
settings+2.639
properties+2.487
versions+2.447
Neg logit contributions
 derail-2.002
 overthrow-1.822
 Dalai-1.791
 Hague-1.773
 overtake-1.728
Token event
Token ID1785
Feature activation-2.22e+00

Pos logit contributions
 derail+2.086
 Dalai+1.921
 overthrow+1.917
 Hague+1.873
 Clintons+1.842
Neg logit contributions
worldly-2.739
 behavi-2.239
settings-2.029
properties-1.909
versions-1.853
Token promises
Token ID10497
Feature activation-3.58e+00

Pos logit contributions
worldly+2.245
 behavi+1.771
settings+1.593
properties+1.481
versions+1.446
Neg logit contributions
 derail-2.165
 overthrow-2.009
 Dalai-1.990
 Hague-1.973
 overturn-1.947
Token to
Token ID284
Feature activation+5.36e+00

Pos logit contributions
worldly+4.073
 behavi+3.305
settings+3.037
properties+2.839
versions+2.794
Neg logit contributions
 derail-3.040
 overthrow-2.801
 Dalai-2.758
 Hague-2.733
 overturn-2.675
Token so
Token ID523
Feature activation-4.77e+00

Pos logit contributions
 derail+1.891
 overthrow+1.703
 Dalai+1.659
 Hague+1.643
 overturn+1.637
Neg logit contributions
worldly-3.314
 behavi-2.746
settings-2.557
properties-2.417
versions-2.360
Token their
Token ID511
Feature activation-9.48e+00

Pos logit contributions
worldly+5.911
 behavi+4.870
settings+4.532
properties+4.267
versions+4.205
Neg logit contributions
 derail-3.484
 overthrow-3.184
 Dalai-3.135
 Hague-3.101
 overtake-3.017
Token teams
Token ID3466
Feature activation-4.49e+00

Pos logit contributions
worldly+10.147
 behavi+8.004
settings+7.369
versions+6.850
properties+6.800
Neg logit contributions
 derail-7.999
 overthrow-7.450
 Hague-7.432
 Dalai-7.401
 assassinate-7.158
Token are
Token ID389
Feature activation-4.34e+00

Pos logit contributions
worldly+5.110
 behavi+4.070
settings+3.811
properties+3.543
versions+3.496
Neg logit contributions
 derail-3.662
 overthrow-3.383
 Dalai-3.381
 Hague-3.339
 Clintons-3.276
Token manufacturing
Token ID9138
Feature activation-1.07e+01

Pos logit contributions
worldly+4.852
 behavi+3.912
settings+3.598
properties+3.358
versions+3.294
Neg logit contributions
 derail-3.729
 overthrow-3.445
 Dalai-3.396
 Hague-3.367
 overtake-3.297
Token all
Token ID477
Feature activation-2.02e+01

Pos logit contributions
worldly+10.785
 behavi+8.455
settings+7.711
versions+7.114
properties+7.106
Neg logit contributions
 derail-9.660
 overthrow-9.073
 Dalai-8.961
 Hague-8.934
 overtake-8.684
Token kinds
Token ID6982
Feature activation-1.01e+01

Pos logit contributions
worldly+15.140
settings+9.430
 behavi+9.421
versions+9.274
 webs+8.905
Neg logit contributions
 derail-19.902
 overthrow-18.720
 assassinate-18.657
 Dalai-18.640
 overtake-18.502
Token of
Token ID286
Feature activation-1.63e+01

Pos logit contributions
worldly+12.057
settings+9.201
properties+8.704
 behavi+8.529
resources+8.453
Neg logit contributions
 derail-7.126
 overthrow-6.633
 assassinate-6.527
 Dalai-6.357
 overtake-6.332
Token ways
Token ID2842
Feature activation-4.22e+00

Pos logit contributions
worldly+12.782
 behavi+9.262
settings+8.260
versions+7.692
 webs+7.441
Neg logit contributions
 derail-16.739
 Hague-16.002
 Dalai-15.961
 overthrow-15.866
 assassinate-15.489
Token for
Token ID329
Feature activation-6.48e+00

Pos logit contributions
worldly+4.706
 behavi+3.779
settings+3.498
properties+3.251
versions+3.204
Neg logit contributions
 derail-3.521
 overthrow-3.285
 Dalai-3.241
 Hague-3.226
 overtake-3.134
Token them
Token ID606
Feature activation-1.35e+01

Pos logit contributions
worldly+6.303
 behavi+4.901
settings+4.467
properties+4.099
versions+4.067
Neg logit contributions
 derail-6.236
 overthrow-5.835
 Dalai-5.799
 Hague-5.767
 overtake-5.632
Tokenabilia
Token ID48249
Feature activation-4.51e+00

Pos logit contributions
worldly+3.677
 behavi+2.728
settings+2.463
properties+2.227
versions+2.171
Neg logit contributions
 derail-4.758
 overthrow-4.495
 Dalai-4.443
 Hague-4.418
 overtake-4.350
Token throughout
Token ID3690
Feature activation-1.16e+01

Pos logit contributions
worldly+5.285
 behavi+4.299
settings+3.976
properties+3.730
versions+3.663
Neg logit contributions
 derail-3.615
 overthrow-3.323
 Dalai-3.272
 Hague-3.245
 overtake-3.170
Token the
Token ID262
Feature activation-2.50e+00

Pos logit contributions
worldly+12.108
 behavi+9.416
settings+8.665
versions+8.172
properties+8.082
Neg logit contributions
 derail-9.796
 overthrow-9.256
 Dalai-9.213
 Hague-9.145
 Clintons-8.912
Token queue
Token ID16834
Feature activation-6.45e+00

Pos logit contributions
worldly+3.210
 behavi+2.691
settings+2.502
properties+2.366
resources+2.318
Neg logit contributions
 derail-1.764
 overthrow-1.590
 Dalai-1.554
 Hague-1.528
 overturn-1.502
Token and
Token ID290
Feature activation-3.19e+00

Pos logit contributions
worldly+7.346
 behavi+5.889
settings+5.506
properties+5.126
versions+5.061
Neg logit contributions
 derail-5.226
 overthrow-4.834
 Dalai-4.792
 Hague-4.749
 Clintons-4.653
Token other
Token ID584
Feature activation-1.99e+01

Pos logit contributions
worldly+3.486
 behavi+2.800
settings+2.567
properties+2.391
versions+2.337
Neg logit contributions
 derail-2.843
 overthrow-2.631
 Dalai-2.594
 Hague-2.568
 overtake-2.521
Token merchandise
Token ID20813
Feature activation-1.51e+01

Pos logit contributions
worldly+21.857
 behavi+16.347
settings+15.325
properties+14.542
versions+14.460
Neg logit contributions
 derail-13.166
 Hague-12.469
 Dalai-12.192
 overthrow-12.188
 overtake-11.816
Token for
Token ID329
Feature activation-8.59e+00

Pos logit contributions
worldly+15.319
 behavi+11.750
settings+10.879
properties+10.242
versions+10.130
Neg logit contributions
 derail-12.269
 overthrow-11.470
 Hague-11.393
 Dalai-11.371
 overtake-11.059
Token sale
Token ID5466
Feature activation-1.68e+01

Pos logit contributions
worldly+9.461
 behavi+7.531
settings+6.935
versions+6.485
properties+6.475
Neg logit contributions
 derail-7.110
 overthrow-6.592
 Hague-6.570
 Dalai-6.517
 overtake-6.338
Token in
Token ID287
Feature activation-1.19e+01

Pos logit contributions
worldly+16.080
 behavi+12.083
settings+11.429
properties+10.559
versions+10.438
Neg logit contributions
 derail-14.172
 overthrow-13.318
 Dalai-13.139
 Hague-13.135
 Clintons-12.843
Token nearby
Token ID6716
Feature activation-1.02e+01

Pos logit contributions
worldly+12.867
 behavi+10.062
settings+9.324
versions+8.923
properties+8.809
Neg logit contributions
 derail-9.637
 overthrow-9.040
 Hague-8.929
 Dalai-8.821
 overtake-8.620
Token technical
Token ID6276
Feature activation-8.53e+00

Pos logit contributions
 derail+2.822
 overthrow+2.637
 Hague+2.585
 Dalai+2.572
 overturn+2.562
Neg logit contributions
worldly-3.083
 behavi-2.449
settings-2.233
properties-2.088
versions-1.970
Token college
Token ID4152
Feature activation-7.01e+00

Pos logit contributions
worldly+8.871
 behavi+6.972
settings+6.392
properties+5.942
resources+5.882
Neg logit contributions
 derail-7.583
 Dalai-7.048
 overthrow-7.009
 Hague-6.924
 assassinate-6.794
Token assistance
Token ID6829
Feature activation-3.16e+00

Pos logit contributions
worldly+7.458
 behavi+5.960
settings+5.478
properties+5.058
versions+5.045
Neg logit contributions
 derail-6.184
 overthrow-5.747
 Dalai-5.691
 Hague-5.646
 overtake-5.506
Token,
Token ID11
Feature activation+2.03e+00

Pos logit contributions
worldly+3.554
 behavi+2.872
settings+2.637
properties+2.464
versions+2.414
Neg logit contributions
 derail-2.721
 overthrow-2.509
 Dalai-2.472
 Hague-2.449
 overturn-2.406
Token among
Token ID1871
Feature activation-1.24e+01

Pos logit contributions
 derail+1.618
 overthrow+1.469
 overturn+1.440
 Hague+1.434
 Dalai+1.430
Neg logit contributions
worldly-2.431
 behavi-1.953
settings-1.820
properties-1.708
resources-1.663
Token other
Token ID584
Feature activation-1.92e+01

Pos logit contributions
worldly+12.038
 behavi+9.470
settings+8.798
properties+8.146
versions+8.129
Neg logit contributions
 derail-11.106
 overthrow-10.369
 Dalai-10.355
 Hague-10.267
 overtake-10.098
Token changes
Token ID2458
Feature activation-3.84e+00

Pos logit contributions
worldly+18.528
 behavi+14.251
settings+13.391
properties+12.428
 endeavors+12.387
Neg logit contributions
 derail-14.422
 Dalai-13.705
 Hague-13.588
 overtake-13.427
 assassinate-13.348
Token.
Token ID13
Feature activation-7.99e+00

Pos logit contributions
worldly+3.870
 behavi+3.037
settings+2.759
properties+2.546
versions+2.491
Neg logit contributions
 derail-3.727
 overthrow-3.476
 Dalai-3.432
 Hague-3.407
 overtake-3.345
Token
Token ID198
Feature activation+7.66e+00

Pos logit contributions
worldly+8.663
 behavi+6.901
settings+6.484
properties+6.069
resources+5.937
Neg logit contributions
 derail-6.851
 overthrow-6.359
 Dalai-6.157
 overtake-6.131
 Hague-6.046
Token
Token ID198
Feature activation+2.36e+00

Pos logit contributions
 derail+6.556
 Dalai+6.157
 Hague+6.124
 overthrow+6.034
 overturn+5.952
Neg logit contributions
worldly-7.743
 behavi-7.545
 practition-5.775
 redes-5.752
settings-5.579
TokenBut
Token ID1537
Feature activation+1.44e+00

Pos logit contributions
 derail+1.979
 Dalai+1.822
 overthrow+1.808
 Hague+1.794
 Clintons+1.778
Neg logit contributions
worldly-2.629
 behavi-2.209
settings-1.910
properties-1.798
versions-1.741
Token Angular
Token ID28147
Feature activation-1.29e+01

Pos logit contributions
worldly+9.294
 behavi+7.187
settings+6.599
properties+6.007
versions+5.961
Neg logit contributions
 derail-9.604
 overthrow-8.974
 Dalai-8.903
 Hague-8.898
 Clintons-8.732
TokenJS
Token ID20120
Feature activation-1.33e+01

Pos logit contributions
worldly+13.691
 behavi+10.743
settings+10.380
properties+9.604
versions+9.569
Neg logit contributions
 derail-10.355
 Dalai-9.650
 overthrow-9.612
 Hague-9.472
 overturn-9.213
Token.
Token ID13
Feature activation-1.65e+01

Pos logit contributions
worldly+10.540
 behavi+8.331
settings+7.373
properties+6.596
resources+6.507
Neg logit contributions
 derail-13.484
 Dalai-12.787
 overthrow-12.678
 Hague-12.646
 Clintons-12.505
Token It
Token ID632
Feature activation-2.08e+01

Pos logit contributions
worldly+13.212
 behavi+9.772
settings+9.564
properties+8.965
resources+8.460
Neg logit contributions
 derail-15.769
 overthrow-14.795
 overtake-14.566
 assassinate-14.504
 Hague-14.350
Token's
Token ID338
Feature activation-1.26e+01

Pos logit contributions
worldly+14.757
settings+9.653
properties+9.022
self+8.426
resources+8.287
Neg logit contributions
 derail-20.418
 assassinate-19.681
 overthrow-19.415
 Clintons-19.215
 overtake-19.021
Token a
Token ID257
Feature activation-1.92e+01

Pos logit contributions
worldly+10.697
settings+6.950
 behavi+6.860
versions+6.279
properties+6.276
Neg logit contributions
 derail-12.784
 Dalai-12.037
 Hague-11.978
 overthrow-11.964
 Clintons-11.949
Token marvelous
Token ID43937
Feature activation-1.28e+01

Pos logit contributions
worldly+12.150
 behavi+6.919
settings+5.482
 webs+5.235
properties+5.072
Neg logit contributions
 derail-21.894
 Dalai-21.780
 Hague-21.459
 assassinate-20.998
 overthrow-20.952
Token framework
Token ID9355
Feature activation-1.08e+01

Pos logit contributions
worldly+10.859
 behavi+7.438
settings+7.076
 webs+6.593
resources+6.483
Neg logit contributions
 derail-13.114
 Dalai-12.589
 assassinate-12.355
 Hague-12.286
 Clintons-12.245
Token for
Token ID329
Feature activation-9.23e+00

Pos logit contributions
worldly+10.855
settings+7.753
 behavi+7.528
properties+7.257
resources+7.056
Neg logit contributions
 derail-9.545
 Dalai-8.900
 overthrow-8.894
 Hague-8.745
 Clintons-8.708
Token modern
Token ID3660
Feature activation-1.31e+01

Pos logit contributions
worldly+7.293
 behavi+4.921
settings+4.557
versions+4.087
properties+3.986
Neg logit contributions
 derail-10.586
 Dalai-10.049
 overthrow-9.935
 Hague-9.888
 overtake-9.739
Token web
Token ID3992
Feature activation-1.62e+01

Pos logit contributions
worldly+6.588
 webs+2.803
settings+2.667
 behavi+2.514
versions+1.799
Neg logit contributions
 derail-17.515
 Dalai-16.902
 assassinate-16.673
 overthrow-16.629
 Clintons-16.562
Token of
Token ID286
Feature activation-1.20e+01

Pos logit contributions
worldly+7.516
settings+5.846
 behavi+5.837
properties+5.528
versions+5.431
Neg logit contributions
 derail-3.805
 overthrow-3.459
 assassinate-3.332
 Dalai-3.330
 overtake-3.307
Token.
Token ID13
Feature activation-1.13e+01

Pos logit contributions
worldly+9.522
settings+6.398
 behavi+6.279
properties+5.696
resources+5.513
Neg logit contributions
 derail-12.837
 overthrow-12.181
 Clintons-12.044
 Dalai-11.991
 assassinate-11.894
Token In
Token ID554
Feature activation-1.38e+01

Pos logit contributions
worldly+8.817
 behavi+6.426
settings+6.093
properties+5.628
resources+5.322
Neg logit contributions
 derail-12.180
 overthrow-11.441
 overtake-11.251
 overturn-11.127
 Dalai-11.115
Token this
Token ID428
Feature activation-1.55e+01

Pos logit contributions
worldly+12.213
settings+8.388
 behavi+8.069
versions+7.946
properties+7.704
Neg logit contributions
 derail-13.046
 overthrow-12.255
 Dalai-12.190
 assassinate-12.095
 Hague-11.918
Token post
Token ID1281
Feature activation-9.38e+00

Pos logit contributions
worldly+11.660
settings+7.230
 behavi+6.972
 webs+6.921
versions+6.596
Neg logit contributions
 derail-15.278
 Clintons-14.743
 assassinate-14.733
 Dalai-14.522
 overtake-14.448
Token I
Token ID314
Feature activation-1.92e+01

Pos logit contributions
worldly+10.321
 behavi+7.759
settings+7.701
versions+7.188
resources+7.126
Neg logit contributions
 derail-7.432
 overthrow-6.919
 Clintons-6.740
 Hague-6.669
 Dalai-6.621
Token'll
Token ID1183
Feature activation-1.22e+00

Pos logit contributions
worldly+11.439
settings+6.576
versions+5.580
 webs+5.375
 endeav+5.350
Neg logit contributions
 derail-20.218
 Hague-20.178
 Clintons-19.803
 Dalai-19.690
 overthrow-19.329
Token round
Token ID2835
Feature activation-7.22e+00

Pos logit contributions
worldly+1.337
 behavi+1.069
settings+0.984
properties+0.916
versions+0.898
Neg logit contributions
 derail-1.079
 overthrow-0.999
 Dalai-0.987
 Hague-0.979
 Clintons-0.959
Token up
Token ID510
Feature activation-8.53e+00

Pos logit contributions
worldly+6.830
 behavi+4.879
settings+4.747
properties+4.334
versions+4.291
Neg logit contributions
 derail-7.159
 overthrow-6.688
 Dalai-6.669
 Hague-6.576
 Clintons-6.509
Token some
Token ID617
Feature activation-1.33e+01

Pos logit contributions
worldly+7.977
 behavi+5.918
settings+5.676
versions+5.195
properties+5.183
Neg logit contributions
 derail-8.387
 overthrow-7.906
 Hague-7.794
 Dalai-7.778
 overtake-7.631
Token got
Token ID1392
Feature activation-1.42e+01

Pos logit contributions
worldly+12.902
settings+9.406
 behavi+9.240
versions+8.518
resources+8.379
Neg logit contributions
 derail-11.395
 overthrow-10.527
 Hague-10.478
 overturn-10.394
 Dalai-10.356
Token with
Token ID351
Feature activation-1.12e+01

Pos logit contributions
worldly+8.510
settings+5.791
 behavi+5.784
versions+5.303
properties+5.299
Neg logit contributions
 derail-8.939
 overthrow-8.396
 Dalai-8.276
 Hague-8.236
 assassinate-8.168
Token every
Token ID790
Feature activation-1.58e+01

Pos logit contributions
worldly+9.143
 behavi+6.053
settings+5.833
versions+5.337
properties+5.297
Neg logit contributions
 derail-11.960
 Dalai-11.308
 overthrow-11.285
 Hague-11.231
 assassinate-11.012
Token new
Token ID649
Feature activation-1.20e+01

Pos logit contributions
worldly+9.751
settings+4.691
 behavi+4.395
versions+4.270
 webs+4.236
Neg logit contributions
 derail-18.674
 assassinate-18.039
 Dalai-17.901
 Hague-17.704
 overthrow-17.653
Token toy
Token ID13373
Feature activation-1.36e+01

Pos logit contributions
worldly+11.654
 behavi+8.346
settings+7.792
versions+7.314
 webs+7.151
Neg logit contributions
 derail-10.807
 Hague-10.304
 Dalai-10.270
 Clintons-10.121
 assassinate-10.111
Token,
Token ID11
Feature activation-9.63e+00

Pos logit contributions
worldly+12.401
settings+8.488
properties+7.911
versions+7.810
resources+7.613
Neg logit contributions
 derail-12.724
 assassinate-12.025
 overthrow-11.864
 overtake-11.791
 Dalai-11.575
Token there
Token ID612
Feature activation-1.91e+01

Pos logit contributions
worldly+9.596
settings+7.006
 behavi+6.980
properties+6.545
resources+6.437
Neg logit contributions
 derail-8.713
 overthrow-8.107
 Dalai-8.069
 Hague-7.922
 assassinate-7.886
Token are
Token ID389
Feature activation-1.13e+01

Pos logit contributions
worldly+12.051
settings+7.030
versions+6.569
 webs+6.369
resources+6.161
Neg logit contributions
 derail-20.149
 overthrow-19.679
 Dalai-19.116
 assassinate-19.064
 Clintons-18.928
Token some
Token ID617
Feature activation-1.37e+01

Pos logit contributions
worldly+9.601
settings+6.730
 behavi+6.649
versions+6.004
properties+5.906
Neg logit contributions
 derail-11.435
 overthrow-10.885
 Dalai-10.840
 Hague-10.757
 assassinate-10.529
Token rough
Token ID5210
Feature activation-1.16e+01

Pos logit contributions
worldly+9.305
settings+5.453
 behavi+5.410
properties+4.419
versions+4.415
Neg logit contributions
 derail-16.040
 Dalai-15.514
 Hague-15.193
 Clintons-15.159
 overthrow-15.060
Token edges
Token ID13015
Feature activation+2.14e-01

Pos logit contributions
worldly+10.652
settings+7.495
 behavi+7.346
versions+6.817
properties+6.771
Neg logit contributions
 derail-11.066
 assassinate-10.434
 Dalai-10.363
 overthrow-10.234
 Hague-10.031
Token that
Token ID326
Feature activation-7.13e+00

Pos logit contributions
 derail+0.159
 overthrow+0.144
 Dalai+0.141
 Hague+0.140
 overturn+0.138
Neg logit contributions
worldly-0.264
 behavi-0.222
settings-0.201
properties-0.190
versions-0.188
Tokenli
Token ID4528
Feature activation-7.62e+00

Pos logit contributions
worldly+12.369
settings+8.636
 behavi+8.047
properties+7.898
resources+7.769
Neg logit contributions
 derail-15.681
 overthrow-14.770
 Clintons-14.650
 Hague-14.612
 Dalai-14.521
Token>
Token ID29
Feature activation-7.48e+00

Pos logit contributions
worldly+7.854
 behavi+6.006
settings+5.686
properties+5.326
resources+5.136
Neg logit contributions
 derail-7.019
 overthrow-6.532
 Dalai-6.430
 Hague-6.391
 overtake-6.311
Token
Token ID7359
Feature activation-1.78e+01

Pos logit contributions
worldly+6.962
 behavi+5.431
settings+4.986
properties+4.562
resources+4.465
Neg logit contributions
 derail-7.437
 overthrow-6.939
 Hague-6.883
 Dalai-6.866
 overtake-6.720
Tokenul
Token ID377
Feature activation-9.40e+00

Pos logit contributions
worldly+12.824
settings+8.996
properties+8.101
resources+8.018
loc+7.236
Neg logit contributions
 derail-18.918
 Clintons-17.715
 assassinate-17.667
 overthrow-17.573
 Hague-17.497
Token>
Token ID29
Feature activation-1.04e+01

Pos logit contributions
worldly+8.952
settings+6.315
 behavi+6.046
properties+5.935
resources+5.700
Neg logit contributions
 derail-9.145
 overthrow-8.529
 Dalai-8.395
 Hague-8.330
 overtake-8.330
Token
Token ID7359
Feature activation-1.90e+01

Pos logit contributions
worldly+9.068
 behavi+7.087
settings+6.493
properties+5.859
resources+5.818
Neg logit contributions
 derail-10.394
 Hague-9.683
 overthrow-9.666
 Dalai-9.599
 overtake-9.428
Tokenbody
Token ID2618
Feature activation-1.31e+01

Pos logit contributions
worldly+11.743
settings+7.552
properties+6.612
resources+6.473
self+5.745
Neg logit contributions
 derail-22.328
 overtake-20.953
 overthrow-20.945
 Hague-20.933
 assassinate-20.926
Token>
Token ID29
Feature activation-1.21e+01

Pos logit contributions
worldly+11.168
settings+7.780
properties+7.342
resources+6.917
versions+6.657
Neg logit contributions
 derail-13.109
 overthrow-12.289
 assassinate-12.169
 overtake-12.120
 Dalai-12.022
Token
Token ID198
Feature activation-1.41e-01

Pos logit contributions
worldly+10.077
 behavi+7.856
settings+7.114
properties+6.419
resources+6.378
Neg logit contributions
 derail-12.049
 Hague-11.354
 overthrow-11.223
 Dalai-11.175
 overtake-11.050
Token
Token ID198
Feature activation-2.65e+00

Pos logit contributions
worldly+0.130
 behavi+0.105
settings+0.089
properties+0.081
versions+0.079
Neg logit contributions
 derail-0.149
 overthrow-0.140
 Dalai-0.139
 Hague-0.138
 overturn-0.135
TokenYou
Token ID1639
Feature activation-1.05e+01

Pos logit contributions
worldly+2.868
 behavi+2.347
settings+2.071
properties+1.938
versions+1.892
Neg logit contributions
 derail-2.386
 overthrow-2.206
 Dalai-2.180
 Hague-2.152
 overturn-2.113
Token it
Token ID340
Feature activation-7.68e+00

Pos logit contributions
worldly+1.522
 behavi+1.244
settings+1.142
properties+1.073
resources+1.045
Neg logit contributions
 derail-1.139
 overthrow-1.048
 Dalai-1.037
 Hague-1.022
 overturn-1.007
Token with
Token ID351
Feature activation-3.77e+00

Pos logit contributions
worldly+8.358
 behavi+6.635
settings+6.151
properties+5.683
versions+5.644
Neg logit contributions
 derail-6.586
 overthrow-6.107
 Dalai-6.006
 Hague-5.993
 Clintons-5.913
Token one
Token ID530
Feature activation-4.12e+00

Pos logit contributions
worldly+4.202
 behavi+3.381
settings+3.109
properties+2.898
versions+2.850
Neg logit contributions
 derail-3.250
 overthrow-3.004
 Dalai-2.960
 Hague-2.938
 Clintons-2.875
Token of
Token ID286
Feature activation-6.83e+00

Pos logit contributions
worldly+5.420
 behavi+4.494
settings+4.202
properties+3.965
versions+3.925
Neg logit contributions
 derail-2.699
 overthrow-2.439
 Dalai-2.382
 Hague-2.377
 Clintons-2.314
Token your
Token ID534
Feature activation-1.44e+01

Pos logit contributions
worldly+7.735
 behavi+6.190
settings+5.728
versions+5.335
properties+5.321
Neg logit contributions
 derail-5.616
 overthrow-5.195
 Hague-5.096
 Dalai-5.082
 Clintons-4.976
Token own
Token ID898
Feature activation-1.90e+01

Pos logit contributions
worldly+15.833
 behavi+12.300
settings+11.692
versions+11.127
selves+11.026
Neg logit contributions
 derail-10.435
 Hague-9.698
 overthrow-9.584
 Dalai-9.442
 Clintons-9.263
Token.
Token ID13
Feature activation-8.48e+00

Pos logit contributions
worldly+19.893
 behavi+14.945
settings+14.272
versions+13.698
selves+13.476
Neg logit contributions
 derail-13.061
 Hague-12.231
 overthrow-12.062
 Clintons-11.835
 Dalai-11.719
Token
Token ID198
Feature activation+1.00e+01

Pos logit contributions
worldly+8.281
 behavi+6.302
settings+5.880
properties+5.470
versions+5.291
Neg logit contributions
 derail-8.178
 overthrow-7.625
 Dalai-7.404
 overtake-7.400
 Hague-7.360
Token
Token ID198
Feature activation+5.22e+00

Pos logit contributions
 derail+7.925
 Dalai+7.431
 Hague+7.409
 overthrow+7.275
 overturn+7.201
Neg logit contributions
 behavi-10.317
worldly-10.207
 practition-8.347
 redes-8.018
 referen-7.726
TokenIt
Token ID1026
Feature activation-9.52e+00

Pos logit contributions
 derail+4.323
 Dalai+4.024
 overthrow+3.945
 Hague+3.875
 overturn+3.829
Neg logit contributions
worldly-5.574
 behavi-4.866
settings-4.045
properties-3.814
 webs-3.703
Token
Token ID447
Feature activation+5.21e+00

Pos logit contributions
worldly+12.889
 behavi+10.849
settings+10.248
properties+9.679
versions+9.572
Neg logit contributions
 derail-5.486
 overthrow-4.912
 Hague-4.738
 Dalai-4.733
 overtake-4.596
Token support
Token ID1104
Feature activation-3.05e+00

Pos logit contributions
worldly+2.453
 behavi+2.022
settings+1.876
properties+1.767
versions+1.740
Neg logit contributions
 derail-1.447
 overthrow-1.319
 Dalai-1.294
 Hague-1.285
 Clintons-1.251
Token,
Token ID11
Feature activation+3.09e+00

Pos logit contributions
worldly+3.762
 behavi+3.101
settings+2.873
properties+2.696
versions+2.656
Neg logit contributions
 derail-2.302
 overthrow-2.100
 Dalai-2.057
 Hague-2.040
 Clintons-1.996
Token force
Token ID2700
Feature activation+1.31e+00

Pos logit contributions
 derail+3.034
 overthrow+2.850
 Hague+2.775
 Dalai+2.755
 Clintons+2.717
Neg logit contributions
worldly-3.122
 behavi-2.438
settings-2.199
properties-2.008
versions-1.957
Token protection
Token ID4800
Feature activation-9.90e-01

Pos logit contributions
 derail+1.257
 overthrow+1.182
 Dalai+1.151
 Hague+1.142
 overturn+1.128
Neg logit contributions
worldly-1.354
 behavi-1.058
settings-0.956
properties-0.887
versions-0.866
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
worldly+1.146
 behavi+0.927
settings+0.854
properties+0.795
versions+0.787
Neg logit contributions
 derail-0.832
 overthrow-0.766
 Dalai-0.746
 Hague-0.742
 overturn-0.731
Token other
Token ID584
Feature activation-1.88e+01

Pos logit contributions
worldly+1.596
 behavi+1.259
settings+1.146
properties+1.062
versions+1.037
Neg logit contributions
 derail-1.472
 overthrow-1.373
 Dalai-1.343
 Hague-1.337
 overturn-1.315
Token related
Token ID3519
Feature activation-1.63e+01

Pos logit contributions
worldly+19.083
 behavi+14.736
settings+13.777
properties+13.242
resources+13.158
Neg logit contributions
 derail-13.895
 Dalai-12.860
 overtake-12.711
 overthrow-12.488
 Hague-12.302
Token elements
Token ID4847
Feature activation-6.53e+00

Pos logit contributions
worldly+16.104
 behavi+12.421
settings+11.413
properties+10.857
resources+10.828
Neg logit contributions
 derail-13.183
 Dalai-12.400
 overtake-12.335
 overthrow-12.140
 Hague-11.988
Token of
Token ID286
Feature activation-6.87e+00

Pos logit contributions
worldly+7.118
 behavi+5.691
settings+5.240
properties+4.890
versions+4.792
Neg logit contributions
 derail-5.672
 overthrow-5.245
 Dalai-5.170
 Hague-5.109
 overtake-5.039
Token logistics
Token ID26355
Feature activation-7.98e+00

Pos logit contributions
worldly+7.566
 behavi+6.069
settings+5.575
properties+5.215
versions+5.164
Neg logit contributions
 derail-5.826
 Dalai-5.355
 overthrow-5.345
 Hague-5.257
 overtake-5.200
Token and
Token ID290
Feature activation-5.09e+00

Pos logit contributions
worldly+9.148
 behavi+7.437
settings+6.849
properties+6.443
resources+6.316
Neg logit contributions
 derail-6.403
 Dalai-5.872
 overthrow-5.867
 Hague-5.773
 overtake-5.650
Token celebrities
Token ID21305
Feature activation-1.21e+00

Pos logit contributions
worldly+11.144
 behavi+8.475
settings+7.937
properties+7.357
versions+7.248
Neg logit contributions
 derail-8.390
 overthrow-7.844
 overtake-7.652
 Dalai-7.629
 Hague-7.621
Token and
Token ID290
Feature activation-5.01e+00

Pos logit contributions
worldly+1.401
 behavi+1.145
settings+1.052
properties+0.987
versions+0.969
Neg logit contributions
 derail-0.996
 overthrow-0.915
 Dalai-0.900
 Hague-0.891
 overturn-0.876
Token posted
Token ID4481
Feature activation-6.89e+00

Pos logit contributions
worldly+5.482
 behavi+4.391
settings+4.040
properties+3.755
versions+3.688
Neg logit contributions
 derail-4.389
 overthrow-4.070
 Dalai-4.014
 Hague-3.989
 overtake-3.902
Token them
Token ID606
Feature activation-1.24e+01

Pos logit contributions
worldly+6.951
 behavi+5.427
settings+4.993
properties+4.573
versions+4.526
Neg logit contributions
 derail-6.462
 overthrow-6.043
 Dalai-5.955
 Hague-5.947
 overtake-5.846
Token to
Token ID284
Feature activation-4.47e+00

Pos logit contributions
worldly+12.247
 behavi+9.249
settings+8.750
properties+7.940
selves+7.881
Neg logit contributions
 derail-10.829
 overthrow-10.186
 Hague-10.159
 Dalai-10.035
 Clintons-9.954
Token social
Token ID1919
Feature activation-1.86e+01

Pos logit contributions
worldly+4.742
 behavi+3.770
settings+3.451
properties+3.199
versions+3.139
Neg logit contributions
 derail-4.084
 overthrow-3.795
 Dalai-3.746
 Hague-3.721
 overtake-3.644
Token media
Token ID2056
Feature activation-1.39e+01

Pos logit contributions
worldly+16.047
 behavi+12.598
settings+11.983
 webs+11.010
resources+10.600
Neg logit contributions
 derail-15.015
 Hague-14.702
 Dalai-14.500
 overthrow-14.449
 Clintons-14.277
Token sites
Token ID5043
Feature activation-7.11e+00

Pos logit contributions
worldly+13.336
 behavi+10.297
settings+9.712
properties+8.759
 webs+8.656
Neg logit contributions
 derail-12.074
 Hague-11.607
 overthrow-11.538
 Dalai-11.460
 Clintons-11.263
Token including
Token ID1390
Feature activation-4.85e+00

Pos logit contributions
worldly+7.615
 behavi+6.059
settings+5.586
properties+5.172
versions+5.062
Neg logit contributions
 derail-6.260
 overthrow-5.821
 Dalai-5.739
 Hague-5.734
 Clintons-5.597
Token Reddit
Token ID10750
Feature activation-6.58e+00

Pos logit contributions
worldly+4.744
 behavi+3.679
settings+3.355
properties+3.073
versions+3.022
Neg logit contributions
 derail-4.758
 overthrow-4.461
 Dalai-4.404
 Hague-4.398
 overtake-4.298
Token and
Token ID290
Feature activation-2.80e+00

Pos logit contributions
worldly+7.486
 behavi+6.057
settings+5.631
properties+5.255
versions+5.169
Neg logit contributions
 derail-5.369
 overthrow-4.977
 Dalai-4.912
 Hague-4.884
 overtake-4.764
Token technology
Token ID3037
Feature activation-3.60e+00

Pos logit contributions
worldly+8.870
 behavi+7.111
settings+6.474
properties+6.025
versions+5.934
Neg logit contributions
 derail-7.140
 Dalai-6.679
 overthrow-6.645
 Hague-6.614
 Clintons-6.451
Token that
Token ID326
Feature activation+1.85e-01

Pos logit contributions
worldly+4.265
 behavi+3.485
settings+3.225
properties+3.025
versions+2.974
Neg logit contributions
 derail-2.845
 overthrow-2.610
 Dalai-2.572
 Hague-2.549
 Clintons-2.488
Token become
Token ID1716
Feature activation-3.12e+00

Pos logit contributions
 derail+0.146
 overthrow+0.133
 Dalai+0.130
 Hague+0.129
 overtake+0.127
Neg logit contributions
worldly-0.222
 behavi-0.182
settings-0.169
properties-0.159
versions-0.156
Token encoded
Token ID30240
Feature activation-1.03e+01

Pos logit contributions
worldly+3.345
 behavi+2.670
settings+2.447
properties+2.277
versions+2.224
Neg logit contributions
 derail-2.854
 overthrow-2.633
 Dalai-2.596
 Hague-2.575
 overtake-2.538
Token in
Token ID287
Feature activation-7.84e+00

Pos logit contributions
worldly+11.221
 behavi+9.034
settings+8.247
properties+7.664
versions+7.656
Neg logit contributions
 derail-8.546
 overthrow-7.944
 Hague-7.930
 Dalai-7.924
 Clintons-7.696
Token digital
Token ID4875
Feature activation-1.86e+01

Pos logit contributions
worldly+8.152
 behavi+6.423
settings+5.881
versions+5.430
properties+5.389
Neg logit contributions
 derail-7.122
 overthrow-6.653
 Dalai-6.592
 Hague-6.557
 Clintons-6.416
Token files
Token ID3696
Feature activation-7.80e+00

Pos logit contributions
worldly+18.880
 behavi+15.057
settings+14.067
versions+13.326
resources+12.923
Neg logit contributions
 derail-13.160
 Dalai-12.880
 Hague-12.736
 overthrow-12.397
 Clintons-12.229
Token are
Token ID389
Feature activation-4.83e+00

Pos logit contributions
worldly+9.181
 behavi+7.402
settings+6.958
properties+6.494
versions+6.450
Neg logit contributions
 derail-5.864
 Dalai-5.424
 overthrow-5.386
 Hague-5.379
 Clintons-5.255
Token one
Token ID530
Feature activation-7.33e+00

Pos logit contributions
worldly+5.222
 behavi+4.176
settings+3.827
properties+3.559
versions+3.488
Neg logit contributions
 derail-4.338
 overthrow-4.020
 Dalai-3.964
 Hague-3.932
 overtake-3.854
Token such
Token ID884
Feature activation-1.11e+01

Pos logit contributions
worldly+9.604
 behavi+7.951
settings+7.365
properties+6.971
versions+6.938
Neg logit contributions
 derail-4.684
 overthrow-4.250
 Hague-4.177
 Dalai-4.161
 Clintons-4.045
Token type
Token ID2099
Feature activation-7.25e+00

Pos logit contributions
worldly+13.755
 behavi+11.369
settings+10.315
versions+9.860
properties+9.794
Neg logit contributions
 derail-7.242
 Hague-6.777
 overthrow-6.694
 Dalai-6.657
 Clintons-6.540
Token of
Token ID286
Feature activation-8.38e+00

Pos logit contributions
worldly+9.594
 behavi+8.055
settings+7.490
properties+7.085
versions+6.985
Neg logit contributions
 derail-4.838
 Dalai-4.448
 overthrow-4.401
 Hague-4.306
 overtake-4.220
Token normal
Token ID3487
Feature activation-1.27e+01

Pos logit contributions
worldly+8.836
 behavi+7.030
settings+6.451
properties+5.994
versions+5.939
Neg logit contributions
 derail-7.330
 overthrow-6.833
 Dalai-6.785
 overtake-6.618
 Hague-6.598
Token fairness
Token ID22692
Feature activation-2.01e+00

Pos logit contributions
worldly+13.245
 behavi+10.574
settings+9.753
properties+9.072
versions+8.986
Neg logit contributions
 derail-10.183
 Dalai-9.653
 overthrow-9.504
 overtake-9.227
 Hague-9.214
Token procedures
Token ID9021
Feature activation-4.10e+00

Pos logit contributions
worldly+2.076
 behavi+1.622
settings+1.477
properties+1.370
versions+1.339
Neg logit contributions
 derail-1.934
 overthrow-1.796
 Hague-1.765
 Dalai-1.757
 overturn-1.739
Token in
Token ID287
Feature activation-8.14e+00

Pos logit contributions
worldly+4.361
 behavi+3.471
settings+3.169
properties+2.944
versions+2.881
Neg logit contributions
 derail-3.780
 overthrow-3.507
 Dalai-3.457
 Hague-3.439
 overturn-3.368
Token other
Token ID584
Feature activation-1.85e+01

Pos logit contributions
worldly+8.559
 behavi+6.784
settings+6.247
versions+5.828
properties+5.797
Neg logit contributions
 derail-7.175
 overthrow-6.672
 Dalai-6.636
 Hague-6.476
 overtake-6.427
Token C
Token ID327
Feature activation-8.39e+00

Pos logit contributions
worldly+18.681
 behavi+14.192
settings+13.406
 endeavors+12.589
versions+12.492
Neg logit contributions
 derail-14.030
 Dalai-13.272
 overthrow-13.026
 overtake-12.678
 assassinate-12.494
TokenBA
Token ID4339
Feature activation-7.14e+00

Pos logit contributions
worldly+8.804
 behavi+6.629
settings+6.437
versions+6.024
properties+5.976
Neg logit contributions
 derail-7.433
 overthrow-6.866
 Dalai-6.827
 overtake-6.681
 Hague-6.666
Token contexts
Token ID26307
Feature activation-8.60e+00

Pos logit contributions
worldly+7.102
 behavi+5.562
settings+5.067
properties+4.645
versions+4.590
Neg logit contributions
 derail-6.883
 overthrow-6.440
 Dalai-6.378
 Hague-6.279
 overtake-6.220
Token.
Token ID13
Feature activation-8.66e+00

Pos logit contributions
worldly+8.415
 behavi+6.529
settings+6.016
properties+5.471
resources+5.395
Neg logit contributions
 derail-8.176
 overthrow-7.650
 Dalai-7.607
 Hague-7.429
 Clintons-7.383
Token Industrial
Token ID19034
Feature activation-9.86e+00

Pos logit contributions
worldly+7.813
 behavi+5.815
settings+5.400
properties+5.019
resources+4.836
Neg logit contributions
 derail-8.941
 overthrow-8.439
 overtake-8.210
 Dalai-8.149
 assassinate-8.057
Token near
Token ID1474
Feature activation-4.78e+00

Pos logit contributions
worldly+8.724
 behavi+6.019
settings+5.741
properties+5.254
resources+5.121
Neg logit contributions
 derail-10.087
 overthrow-9.462
 Dalai-9.424
 Clintons-9.316
 Hague-9.220
Token Alfred
Token ID22044
Feature activation+2.06e+00

Pos logit contributions
worldly+6.358
 behavi+5.317
settings+4.976
properties+4.714
versions+4.645
Neg logit contributions
 derail-3.087
 overthrow-2.774
 Dalai-2.717
 Hague-2.686
 overtake-2.615
Token Lerner
Token ID49750
Feature activation+3.84e+00

Pos logit contributions
 derail+1.243
 Dalai+1.123
 overthrow+1.116
 Hague+1.103
 Clintons+1.057
Neg logit contributions
worldly-2.797
 behavi-2.429
settings-2.215
properties-2.086
versions-2.073
Token Hall
Token ID4789
Feature activation-6.39e+00

Pos logit contributions
 derail+1.883
 Hague+1.682
 Dalai+1.682
 overthrow+1.666
 Clintons+1.575
Neg logit contributions
worldly-5.546
 behavi-4.952
settings-4.483
properties-4.293
versions-4.255
Token.
Token ID13
Feature activation-1.03e+01

Pos logit contributions
worldly+5.678
 behavi+4.026
settings+3.912
properties+3.576
resources+3.464
Neg logit contributions
 derail-6.820
 overthrow-6.422
 Dalai-6.308
 overtake-6.236
 Hague-6.229
Token Other
Token ID3819
Feature activation-1.85e+01

Pos logit contributions
worldly+8.742
 behavi+6.438
settings+6.000
properties+5.516
resources+5.301
Neg logit contributions
 derail-11.150
 overthrow-10.465
 Dalai-10.145
 overtake-10.120
 Hague-10.024
Token filming
Token ID17691
Feature activation-8.78e+00

Pos logit contributions
worldly+15.715
settings+10.468
 behavi+9.857
versions+8.932
 endeav+8.851
Neg logit contributions
 derail-17.367
 overthrow-16.578
 Clintons-16.449
 overtake-16.176
 Dalai-16.018
Token locations
Token ID7064
Feature activation-4.10e+00

Pos logit contributions
worldly+4.867
 behavi+2.730
settings+2.484
properties+1.979
versions+1.806
Neg logit contributions
 derail-12.185
 overthrow-11.708
 Dalai-11.624
 Hague-11.536
 overtake-11.425
Token in
Token ID287
Feature activation-8.58e+00

Pos logit contributions
worldly+4.495
 behavi+3.525
settings+3.327
properties+3.097
versions+3.036
Neg logit contributions
 derail-3.576
 overthrow-3.318
 Dalai-3.275
 Hague-3.233
 Clintons-3.193
Token Massachusetts
Token ID10140
Feature activation-7.82e+00

Pos logit contributions
worldly+9.895
 behavi+8.083
settings+7.653
properties+7.137
versions+7.130
Neg logit contributions
 derail-6.631
 overthrow-6.130
 Dalai-6.021
 Hague-5.979
 overtake-5.852
Token were
Token ID547
Feature activation-5.51e+00

Pos logit contributions
worldly+7.282
 behavi+5.127
settings+5.028
properties+4.553
versions+4.487
Neg logit contributions
 derail-7.872
 overthrow-7.404
 Dalai-7.333
 Hague-7.187
 overtake-7.182
Token plush
Token ID46336
Feature activation-4.19e+00

Pos logit contributions
 derail+4.789
 Dalai+4.370
 overthrow+4.365
 Hague+4.242
 overturn+4.207
Neg logit contributions
worldly-5.324
 behavi-4.285
settings-3.833
properties-3.516
versions-3.446
Tokenies
Token ID444
Feature activation+2.32e+00

Pos logit contributions
worldly+4.318
 behavi+3.398
settings+3.113
properties+2.892
versions+2.831
Neg logit contributions
 derail-3.923
 overthrow-3.659
 Dalai-3.608
 Hague-3.590
 overtake-3.527
Token,
Token ID11
Feature activation+3.99e+00

Pos logit contributions
 derail+1.654
 overthrow+1.502
 Dalai+1.475
 Hague+1.456
 overturn+1.423
Neg logit contributions
worldly-2.928
 behavi-2.441
settings-2.256
properties-2.123
versions-2.086
Token statues
Token ID25827
Feature activation+2.70e+00

Pos logit contributions
 derail+3.714
 Dalai+3.395
 overthrow+3.391
 Hague+3.298
 overturn+3.260
Neg logit contributions
worldly-4.178
 behavi-3.360
settings-3.028
properties-2.787
versions-2.718
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
 derail+1.933
 overthrow+1.758
 Dalai+1.730
 Hague+1.709
 overturn+1.676
Neg logit contributions
worldly-3.385
 behavi-2.832
settings-2.616
properties-2.452
versions-2.410
Token other
Token ID584
Feature activation-1.85e+01

Pos logit contributions
worldly+1.271
 behavi+1.023
settings+0.933
properties+0.865
versions+0.846
Neg logit contributions
 derail-1.073
 overthrow-0.989
 Dalai-0.982
 Hague-0.966
 overturn-0.950
Token merchandise
Token ID20813
Feature activation-1.28e+01

Pos logit contributions
worldly+18.484
 behavi+13.406
settings+12.646
properties+12.135
resources+11.972
Neg logit contributions
 derail-14.454
 overthrow-13.418
 Hague-13.405
 overtake-13.241
 assassinate-13.001
Token,
Token ID11
Feature activation-2.48e+00

Pos logit contributions
worldly+12.670
 behavi+9.639
settings+8.951
properties+8.475
versions+8.269
Neg logit contributions
 derail-11.155
 overthrow-10.440
 overtake-10.205
 Dalai-10.194
 Hague-10.188
Token as
Token ID355
Feature activation-7.84e+00

Pos logit contributions
worldly+2.826
 behavi+2.290
settings+2.107
properties+1.967
versions+1.932
Neg logit contributions
 derail-2.082
 overthrow-1.919
 Dalai-1.889
 Hague-1.875
 overturn-1.835
Token well
Token ID880
Feature activation-9.87e+00

Pos logit contributions
worldly+6.103
 behavi+4.352
settings+3.886
properties+3.477
versions+3.394
Neg logit contributions
 derail-9.123
 overthrow-8.632
 Dalai-8.527
 Hague-8.481
 overtake-8.389
Token as
Token ID355
Feature activation-2.50e+00

Pos logit contributions
worldly+8.696
 behavi+6.154
settings+5.867
properties+5.388
resources+5.174
Neg logit contributions
 derail-10.050
 overthrow-9.539
 Dalai-9.459
 overtake-9.268
 Hague-9.260
Token on
Token ID319
Feature activation-3.58e+00

Pos logit contributions
 derail+2.545
 overthrow+2.317
 Hague+2.285
 Dalai+2.277
 overturn+2.260
Neg logit contributions
worldly-3.997
 behavi-3.325
settings-3.087
properties-2.881
resources-2.792
Token domestic
Token ID5928
Feature activation-7.62e+00

Pos logit contributions
worldly+3.238
 behavi+2.461
settings+2.200
properties+2.002
versions+1.948
Neg logit contributions
 derail-3.857
 overthrow-3.622
 Dalai-3.581
 Hague-3.558
 overtake-3.499
Token spying
Token ID23649
Feature activation-5.08e-01

Pos logit contributions
worldly+8.107
 behavi+6.475
settings+6.003
properties+5.538
versions+5.503
Neg logit contributions
 derail-6.598
 overthrow-6.083
 Dalai-6.069
 Hague-6.020
 overtake-5.887
Token legislation
Token ID5520
Feature activation-6.24e-01

Pos logit contributions
worldly+0.568
 behavi+0.458
settings+0.418
properties+0.391
versions+0.382
Neg logit contributions
 derail-0.443
 overthrow-0.409
 Dalai-0.401
 Hague-0.398
 overturn-0.395
Token and
Token ID290
Feature activation-2.22e+00

Pos logit contributions
worldly+0.743
 behavi+0.613
settings+0.564
properties+0.530
resources+0.518
Neg logit contributions
 derail-0.497
 overthrow-0.454
 Dalai-0.446
 Hague-0.443
 overturn-0.439
Token other
Token ID584
Feature activation-1.84e+01

Pos logit contributions
worldly+2.573
 behavi+2.091
settings+1.929
properties+1.806
versions+1.773
Neg logit contributions
 derail-1.823
 overthrow-1.677
 Dalai-1.649
 Hague-1.636
 overtake-1.602
Token issues
Token ID2428
Feature activation-5.38e+00

Pos logit contributions
worldly+19.625
 behavi+15.075
settings+14.357
versions+13.800
 endeavors+13.686
Neg logit contributions
 derail-12.465
 Dalai-11.726
 overthrow-11.469
 overtake-11.330
 assassinate-11.205
Token."
Token ID526
Feature activation-5.42e+00

Pos logit contributions
worldly+6.188
 behavi+4.983
settings+4.624
properties+4.316
versions+4.262
Neg logit contributions
 derail-4.370
 overthrow-4.046
 Dalai-3.971
 Hague-3.937
 overtake-3.865
Token To
Token ID1675
Feature activation+5.01e+00

Pos logit contributions
worldly+5.361
 behavi+4.175
settings+3.809
properties+3.526
versions+3.436
Neg logit contributions
 derail-5.317
 overthrow-4.963
 Dalai-4.882
 Hague-4.835
 overtake-4.784
Token put
Token ID1234
Feature activation-4.74e+00

Pos logit contributions
 derail+3.795
 overthrow+3.444
 overturn+3.326
 Hague+3.268
 overtake+3.249
Neg logit contributions
worldly-6.187
 behavi-5.140
settings-4.716
properties-4.465
resources-4.339
Token this
Token ID428
Feature activation-6.36e+00

Pos logit contributions
worldly+5.512
 behavi+4.486
settings+4.142
properties+3.879
versions+3.811
Neg logit contributions
 derail-3.862
 overthrow-3.551
 Dalai-3.496
 Hague-3.467
 overtake-3.389
Token anti
Token ID3098
Feature activation-4.79e+00

Pos logit contributions
worldly+6.480
 behavi+5.146
settings+4.705
properties+4.356
versions+4.297
Neg logit contributions
 derail-5.563
 overthrow-5.170
 Dalai-5.092
 Hague-5.081
 overtake-4.974
Token-
Token ID12
Feature activation+1.55e+00

Pos logit contributions
worldly+4.689
 behavi+3.660
settings+3.323
properties+3.046
resources+2.998
Neg logit contributions
 derail-4.626
 overthrow-4.316
 Dalai-4.254
 Hague-4.253
 overturn-4.136
TokenMuslim
Token ID17067
Feature activation-4.24e+00

Pos logit contributions
 derail+1.301
 overthrow+1.197
 Dalai+1.192
 Hague+1.167
 Clintons+1.161
Neg logit contributions
worldly-1.736
 behavi-1.445
settings-1.292
properties-1.209
versions-1.191
Token content
Token ID2695
Feature activation-6.89e+00

Pos logit contributions
worldly+4.543
 behavi+3.627
settings+3.318
properties+3.085
versions+3.024
Neg logit contributions
 derail-3.819
 overthrow-3.544
 Dalai-3.494
 Hague-3.473
 overtake-3.406
Token on
Token ID319
Feature activation-8.17e+00

Pos logit contributions
worldly+7.771
 behavi+6.251
settings+5.781
properties+5.393
versions+5.293
Neg logit contributions
 derail-5.720
 overthrow-5.283
 Dalai-5.182
 Hague-5.179
 overtake-5.069
Token social
Token ID1919
Feature activation-1.84e+01

Pos logit contributions
worldly+8.266
 behavi+6.464
settings+5.939
properties+5.455
versions+5.383
Neg logit contributions
 derail-7.583
 overthrow-7.071
 Hague-7.001
 Dalai-6.933
 overtake-6.834
Token media
Token ID2056
Feature activation-1.29e+01

Pos logit contributions
worldly+12.010
 behavi+9.603
settings+8.260
 webs+7.609
 endeavors+6.917
Neg logit contributions
 derail-17.952
 Hague-17.780
 Dalai-17.696
 overthrow-17.290
 Clintons-16.978
Token as
Token ID355
Feature activation-7.05e+00

Pos logit contributions
worldly+13.606
 behavi+10.873
settings+10.141
properties+9.298
versions+9.239
Neg logit contributions
 derail-10.537
 overthrow-9.895
 Hague-9.789
 Dalai-9.754
 Clintons-9.506
Token well
Token ID880
Feature activation-1.28e+01

Pos logit contributions
worldly+6.268
 behavi+4.735
settings+4.266
properties+3.875
versions+3.795
Neg logit contributions
 derail-7.390
 overthrow-6.961
 Dalai-6.862
 Hague-6.835
 overtake-6.764
Token,
Token ID11
Feature activation-2.44e+00

Pos logit contributions
worldly+12.606
 behavi+9.961
settings+9.060
properties+8.302
versions+8.202
Neg logit contributions
 derail-11.224
 overthrow-10.570
 Dalai-10.332
 overtake-10.231
 Hague-10.212
Token such
Token ID884
Feature activation-1.18e+01

Pos logit contributions
worldly+2.848
 behavi+2.313
settings+2.137
properties+2.002
versions+1.964
Neg logit contributions
 derail-2.006
 overthrow-1.844
 Dalai-1.814
 Hague-1.800
 overturn-1.764
Tokenrog
Token ID3828
Feature activation+1.48e+00

Pos logit contributions
 derail+2.259
 overthrow+2.109
 Hague+2.081
 Dalai+2.080
 overturn+2.043
Neg logit contributions
worldly-2.491
 behavi-2.133
settings-1.802
properties-1.666
versions-1.638
Tokenate
Token ID378
Feature activation+7.08e-01

Pos logit contributions
 derail+0.919
 overthrow+0.830
 Dalai+0.811
 Hague+0.810
 overturn+0.780
Neg logit contributions
worldly-1.984
 behavi-1.727
settings-1.555
properties-1.480
versions-1.458
Token's
Token ID338
Feature activation+2.12e+00

Pos logit contributions
 derail+0.621
 overthrow+0.574
 Dalai+0.574
 Hague+0.570
 Clintons+0.556
Neg logit contributions
worldly-0.772
 behavi-0.648
settings-0.569
properties-0.531
versions-0.522
Token Court
Token ID3078
Feature activation-8.90e-01

Pos logit contributions
 derail+1.081
 Hague+0.963
 overthrow+0.957
 Dalai+0.951
 overturn+0.901
Neg logit contributions
worldly-3.064
 behavi-2.682
settings-2.459
properties-2.348
versions-2.329
Token.
Token ID13
Feature activation-7.28e+00

Pos logit contributions
worldly+0.887
 behavi+0.715
settings+0.627
properties+0.577
versions+0.568
Neg logit contributions
 derail-0.875
 overthrow-0.814
 Dalai-0.807
 Hague-0.805
 overturn-0.786
Token Other
Token ID3819
Feature activation-1.83e+01

Pos logit contributions
worldly+6.360
 behavi+4.813
settings+4.377
properties+3.995
resources+3.868
Neg logit contributions
 derail-7.758
 overthrow-7.313
 Dalai-7.141
 Hague-7.087
 overtake-7.066
Token sequences
Token ID16311
Feature activation+5.27e-02

Pos logit contributions
worldly+12.583
settings+7.565
 behavi+7.148
versions+6.757
 webs+6.039
Neg logit contributions
 derail-19.563
 overthrow-19.158
 Hague-19.043
 Dalai-18.869
 overtake-18.664
Token like
Token ID588
Feature activation-4.29e+00

Pos logit contributions
 derail+0.042
 overthrow+0.038
 Dalai+0.038
 Hague+0.037
 overturn+0.037
Neg logit contributions
worldly-0.062
 behavi-0.052
settings-0.047
properties-0.045
resources-0.044
Token the
Token ID262
Feature activation-2.40e-01

Pos logit contributions
worldly+4.467
 behavi+3.418
settings+3.221
properties+2.971
versions+2.945
Neg logit contributions
 derail-3.984
 overthrow-3.713
 Dalai-3.659
 Hague-3.624
 overtake-3.572
Token subway
Token ID19612
Feature activation-1.34e+00

Pos logit contributions
worldly+0.274
 behavi+0.224
settings+0.204
properties+0.193
versions+0.189
Neg logit contributions
 derail-0.203
 overthrow-0.186
 Dalai-0.180
 Hague-0.178
 overtake-0.178
Token scene
Token ID3715
Feature activation+6.24e-01

Pos logit contributions
worldly+1.542
 behavi+1.256
settings+1.157
properties+1.083
versions+1.062
Neg logit contributions
 derail-1.112
 overthrow-1.023
 Dalai-1.005
 Hague-0.997
 overtake-0.976
Token knife
Token ID9845
Feature activation-9.61e+00

Pos logit contributions
worldly+14.285
settings+9.730
 webs+9.059
 behavi+9.003
versions+8.903
Neg logit contributions
 derail-11.908
 Clintons-11.540
 Dalai-11.261
 overthrow-11.187
 assassinate-11.095
Token.
Token ID13
Feature activation-1.21e+01

Pos logit contributions
worldly+7.892
settings+5.266
 behavi+5.102
properties+4.715
resources+4.551
Neg logit contributions
 derail-10.343
 overthrow-9.846
 Dalai-9.770
 Clintons-9.723
 Hague-9.607
Token 3
Token ID513
Feature activation-5.00e+00

Pos logit contributions
worldly+11.574
 behavi+9.377
settings+8.825
properties+8.158
resources+8.012
Neg logit contributions
 derail-10.196
 overthrow-9.705
 Dalai-9.667
 overtake-9.333
 overturn-9.327
Token.
Token ID13
Feature activation-7.25e+00

Pos logit contributions
worldly+4.586
 behavi+3.316
settings+3.165
properties+2.898
resources+2.825
Neg logit contributions
 derail-5.270
 overthrow-4.947
 Dalai-4.868
 Hague-4.815
 overtake-4.786
Token In
Token ID554
Feature activation-1.37e+01

Pos logit contributions
worldly+6.822
 behavi+5.009
settings+4.846
properties+4.449
resources+4.338
Neg logit contributions
 derail-7.190
 overthrow-6.756
 Dalai-6.633
 overtake-6.517
 overturn-6.471
Token a
Token ID257
Feature activation-1.83e+01

Pos logit contributions
worldly+11.147
settings+7.191
versions+6.624
 behavi+6.396
properties+6.396
Neg logit contributions
 derail-14.138
 Dalai-13.362
 overthrow-13.325
 Hague-13.251
 assassinate-13.164
Token large
Token ID1588
Feature activation-1.45e+01

Pos logit contributions
worldly+10.926
settings+5.365
 webs+4.550
resources+4.194
versions+4.173
Neg logit contributions
 derail-21.753
 overthrow-21.130
 Dalai-21.106
 assassinate-20.822
yip-20.503
Token bowl
Token ID9396
Feature activation-8.11e+00

Pos logit contributions
worldly+12.357
settings+7.842
 behavi+7.511
versions+7.233
 webs+7.088
Neg logit contributions
 derail-14.248
 Dalai-13.665
 assassinate-13.465
 overthrow-13.224
 Clintons-12.875
Token,
Token ID11
Feature activation-8.04e+00

Pos logit contributions
worldly+8.296
settings+5.992
 behavi+5.693
properties+5.562
resources+5.436
Neg logit contributions
 derail-7.219
 Dalai-6.794
 overthrow-6.726
 Hague-6.586
 assassinate-6.575
Token combine
Token ID12082
Feature activation-9.54e+00

Pos logit contributions
worldly+9.266
 behavi+7.017
settings+6.928
properties+6.394
versions+6.384
Neg logit contributions
 derail-6.098
 Dalai-5.796
 overthrow-5.646
 Hague-5.643
 Clintons-5.494
Token the
Token ID262
Feature activation-4.01e+00

Pos logit contributions
worldly+8.930
 behavi+6.051
settings+5.992
versions+5.595
properties+5.509
Neg logit contributions
 derail-9.266
 overthrow-8.721
 Dalai-8.691
 Hague-8.625
 assassinate-8.584
Token what
Token ID644
Feature activation+1.86e+00

Pos logit contributions
 derail+7.295
 Dalai+6.617
 overthrow+6.583
 Hague+6.522
 overturn+6.501
Neg logit contributions
worldly-10.951
 behavi-8.876
settings-8.269
properties-7.847
versions-7.515
Token
Token ID447
Feature activation+3.02e+00

Pos logit contributions
 derail+1.499
 overthrow+1.355
 Dalai+1.350
 Hague+1.341
 Clintons+1.314
Neg logit contributions
worldly-2.184
 behavi-1.798
settings-1.666
properties-1.570
resources-1.534
Token
Token ID247
Feature activation+2.73e+00

Pos logit contributions
 derail+2.483
 overthrow+2.304
 Dalai+2.288
 Hague+2.275
 overturn+2.233
Neg logit contributions
worldly-3.424
 behavi-2.843
settings-2.568
properties-2.409
versions-2.363
Tokens
Token ID82
Feature activation+4.21e+00

Pos logit contributions
 derail+2.435
 Dalai+2.261
 overthrow+2.232
 Hague+2.210
 Clintons+2.188
Neg logit contributions
worldly-2.893
 behavi-2.326
settings-2.113
properties-1.976
versions-1.912
Token going
Token ID1016
Feature activation+7.25e+00

Pos logit contributions
 derail+3.846
 Dalai+3.519
 overthrow+3.512
 overturn+3.459
 Hague+3.453
Neg logit contributions
worldly-4.410
 behavi-3.544
settings-3.245
properties-3.027
resources-2.938
Token to
Token ID284
Feature activation+1.74e+01

Pos logit contributions
 derail+5.066
 overthrow+4.635
 Dalai+4.581
 overtake+4.537
 overturn+4.499
Neg logit contributions
worldly-8.744
 behavi-7.208
settings-6.873
properties-6.465
resources-6.332
Token happen
Token ID1645
Feature activation+1.13e+01

Pos logit contributions
 derail+11.734
 overtake+10.181
 overthrow+10.120
 overturn+9.900
 Hague+9.067
Neg logit contributions
worldly-19.653
 behavi-16.826
settings-15.463
properties-14.796
resources-14.577
Token as
Token ID355
Feature activation+6.36e+00

Pos logit contributions
 derail+8.819
 Dalai+8.206
 overturn+8.048
 overthrow+8.003
 overtake+7.910
Neg logit contributions
worldly-11.624
 behavi-9.437
settings-8.652
properties-8.260
resources-7.830
Token we
Token ID356
Feature activation+2.05e+00

Pos logit contributions
 derail+5.511
 Dalai+5.019
 overthrow+5.012
 Hague+4.914
 overtake+4.911
Neg logit contributions
worldly-6.736
 behavi-5.396
settings-4.899
properties-4.610
resources-4.465
Token continue
Token ID2555
Feature activation+7.04e+00

Pos logit contributions
 derail+1.394
 overthrow+1.262
 overtake+1.215
 Dalai+1.203
 Hague+1.190
Neg logit contributions
worldly-2.688
 behavi-2.304
settings-2.097
properties-1.992
resources-1.963
Token to
Token ID284
Feature activation+9.60e+00

Pos logit contributions
 derail+5.092
 overthrow+4.607
 overturn+4.568
 Dalai+4.541
 Hague+4.492
Neg logit contributions
worldly-8.389
 behavi-6.814
settings-6.422
properties-6.106
resources-5.953
Token ALWAYS
Token ID49576
Feature activation+6.60e+00

Pos logit contributions
 derail+5.990
 Hague+5.576
 Dalai+5.556
 overthrow+5.413
 Clintons+5.292
Neg logit contributions
worldly-10.741
 behavi-9.131
settings-8.285
properties-7.929
versions-7.815
Token thought
Token ID1807
Feature activation+9.36e+00

Pos logit contributions
 derail+4.665
 Dalai+4.291
 Hague+4.211
 overthrow+4.210
 Clintons+4.111
Neg logit contributions
worldly-7.947
 behavi-6.880
settings-6.067
properties-5.811
versions-5.729
Token he
Token ID339
Feature activation+1.75e-01

Pos logit contributions
 derail+6.929
 Dalai+6.412
 overthrow+6.326
 Hague+6.286
 Clintons+6.279
Neg logit contributions
worldly-10.606
 behavi-8.886
settings-8.298
properties-7.793
versions-7.720
Token had
Token ID550
Feature activation+5.62e+00

Pos logit contributions
 derail+0.167
 overthrow+0.157
 Dalai+0.154
 Hague+0.153
 overturn+0.151
Neg logit contributions
worldly-0.179
 behavi-0.145
settings-0.130
properties-0.121
versions-0.119
Token a
Token ID257
Feature activation-1.75e+00

Pos logit contributions
 derail+4.674
 overthrow+4.349
 Dalai+4.222
 Hague+4.181
 overtake+4.167
Neg logit contributions
worldly-6.138
 behavi-5.106
settings-4.671
properties-4.347
versions-4.338
Token chance
Token ID2863
Feature activation+1.37e+01

Pos logit contributions
worldly+1.847
 behavi+1.481
settings+1.355
properties+1.259
resources+1.226
Neg logit contributions
 derail-1.618
 overthrow-1.501
 Dalai-1.485
 Hague-1.471
 Clintons-1.442
Token to
Token ID284
Feature activation+1.31e+01

Pos logit contributions
 derail+9.817
 overthrow+9.005
 Dalai+8.901
 overturn+8.800
 overtake+8.600
Neg logit contributions
worldly-14.283
 behavi-11.912
settings-11.199
properties-10.752
versions-10.488
Token make
Token ID787
Feature activation+2.38e+00

Pos logit contributions
 derail+8.970
 overthrow+8.177
 overtake+7.885
 overturn+7.816
 assassinate+7.387
Neg logit contributions
worldly-15.730
 behavi-14.170
settings-12.529
properties-11.956
resources-11.872
Token it
Token ID340
Feature activation+4.71e-01

Pos logit contributions
 derail+1.943
 overthrow+1.777
 Dalai+1.757
 Hague+1.734
 Clintons+1.710
Neg logit contributions
worldly-2.727
 behavi-2.261
settings-2.079
properties-1.938
resources-1.931
Token in
Token ID287
Feature activation-3.26e+00

Pos logit contributions
 derail+0.413
 overthrow+0.380
 Dalai+0.374
 Hague+0.367
 overtake+0.366
Neg logit contributions
worldly-0.517
 behavi-0.421
settings-0.390
properties-0.365
resources-0.356
Token the
Token ID262
Feature activation+7.66e+00

Pos logit contributions
worldly+3.835
 behavi+3.121
settings+2.885
properties+2.706
versions+2.664
Neg logit contributions
 derail-2.601
 overthrow-2.388
 Dalai-2.352
 Hague-2.333
 overtake-2.278
Tokene
Token ID68
Feature activation-6.32e+00

Pos logit contributions
worldly+0.956
 behavi+0.763
settings+0.682
properties+0.630
versions+0.613
Neg logit contributions
 derail-0.892
 overthrow-0.829
 Dalai-0.822
 Hague-0.816
 overturn-0.799
Token.
Token ID13
Feature activation-4.24e+00

Pos logit contributions
worldly+5.842
 behavi+4.356
settings+4.043
properties+3.713
versions+3.628
Neg logit contributions
 derail-6.608
 overthrow-6.196
 Dalai-6.097
 Hague-6.051
 overtake-5.982
Tokeng
Token ID70
Feature activation-1.06e+00

Pos logit contributions
worldly+4.245
 behavi+3.218
settings+3.034
properties+2.819
resources+2.746
Neg logit contributions
 derail-4.115
 overthrow-3.833
 Dalai-3.771
 Hague-3.729
 overtake-3.706
Token.,
Token ID1539
Feature activation+7.17e+00

Pos logit contributions
worldly+1.156
 behavi+0.925
settings+0.849
properties+0.790
versions+0.774
Neg logit contributions
 derail-0.943
 overthrow-0.874
 Dalai-0.862
 Hague-0.855
 overtake-0.838
Token the
Token ID262
Feature activation+9.77e+00

Pos logit contributions
 derail+5.918
 overthrow+5.481
 overturn+5.462
 Hague+5.418
 Dalai+5.363
Neg logit contributions
worldly-7.959
 behavi-6.385
settings-5.881
properties-5.454
resources-5.282
Token 2015
Token ID1853
Feature activation+1.37e+01

Pos logit contributions
 derail+7.932
 Hague+7.468
 overthrow+7.446
 Dalai+7.437
 overturn+7.266
Neg logit contributions
worldly-10.571
 behavi-8.750
settings-7.749
properties-7.393
resources-7.091
Token referendum
Token ID11467
Feature activation+1.07e+01

Pos logit contributions
 derail+10.817
 Hague+10.407
 overthrow+10.200
 overturn+10.141
 Dalai+10.095
Neg logit contributions
worldly-14.235
 behavi-11.787
settings-10.249
properties-9.726
resources-9.234
Token in
Token ID287
Feature activation+2.80e+00

Pos logit contributions
 derail+7.836
 overturn+7.294
 overthrow+7.170
 Hague+6.924
 Dalai+6.768
Neg logit contributions
worldly-12.313
 behavi-10.514
settings-9.361
properties-8.939
resources-8.600
Token Switzerland
Token ID14679
Feature activation+6.09e+00

Pos logit contributions
 derail+1.787
 overthrow+1.606
 Hague+1.589
 Dalai+1.578
 overturn+1.566
Neg logit contributions
worldly-3.743
 behavi-3.221
settings-2.951
properties-2.802
resources-2.753
Token),
Token ID828
Feature activation+9.10e-01

Pos logit contributions
 derail+4.698
 overturn+4.299
 overthrow+4.272
 Hague+4.230
 Dalai+4.183
Neg logit contributions
worldly-7.185
 behavi-5.978
settings-5.402
properties-5.092
resources-5.030
Token the
Token ID262
Feature activation+5.68e+00

Pos logit contributions
 derail+0.767
 overthrow+0.709
 Hague+0.698
 Dalai+0.692
 overturn+0.691
Neg logit contributions
worldly-1.048
 behavi-0.847
settings-0.767
properties-0.721
versions-0.700
Token9
Token ID24
Feature activation+5.40e+00

Pos logit contributions
 derail+0.153
 overthrow+0.132
 Dalai+0.130
 Hague+0.130
 overturn+0.121
Neg logit contributions
worldly-0.464
 behavi-0.402
settings-0.373
properties-0.355
versions-0.352
Token37
Token ID2718
Feature activation+8.25e+00

Pos logit contributions
 derail+2.628
 Hague+2.249
 Dalai+2.236
 overthrow+2.221
 overturn+2.100
Neg logit contributions
worldly-7.850
 behavi-6.991
settings-6.280
properties-6.032
versions-5.991
Token save
Token ID3613
Feature activation+1.22e+00

Pos logit contributions
 derail+3.661
 overthrow+3.303
 Hague+3.036
 overturn+3.009
 Dalai+2.946
Neg logit contributions
worldly-12.054
 behavi-10.396
settings-9.863
properties-9.605
resources-9.229
Token percentage
Token ID5873
Feature activation+1.56e-01

Pos logit contributions
 derail+1.037
 overthrow+0.962
 Dalai+0.943
 Hague+0.932
 overturn+0.921
Neg logit contributions
worldly-1.372
 behavi-1.110
settings-1.030
properties-0.967
versions-0.933
Token.
Token ID13
Feature activation-2.87e+00

Pos logit contributions
 derail+0.137
 overthrow+0.128
 Dalai+0.125
 Hague+0.125
 overturn+0.124
Neg logit contributions
worldly-0.170
 behavi-0.139
settings-0.127
properties-0.120
resources-0.116
Token
Token ID198
Feature activation+1.27e+01

Pos logit contributions
worldly+3.199
 behavi+2.586
settings+2.364
properties+2.201
versions+2.165
Neg logit contributions
 derail-2.495
 overthrow-2.305
 Dalai-2.287
 Hague-2.271
 Clintons-2.217
Token
Token ID198
Feature activation+9.18e+00

Pos logit contributions
 derail+8.807
 Dalai+8.575
 Hague+8.572
 overturn+8.062
 Clintons+8.028
Neg logit contributions
 behavi-13.197
worldly-12.919
 redes-10.588
 practition-10.524
 referen-9.832
TokenFor
Token ID1890
Feature activation+6.81e-01

Pos logit contributions
 derail+6.874
 Dalai+6.345
 Hague+6.262
 overthrow+6.182
 overturn+6.014
Neg logit contributions
worldly-9.758
 behavi-8.901
settings-7.199
 redes-6.881
properties-6.753
Token another
Token ID1194
Feature activation-2.06e+00

Pos logit contributions
 derail+0.632
 overthrow+0.584
 Dalai+0.576
 Hague+0.576
 overturn+0.565
Neg logit contributions
worldly-0.714
 behavi-0.574
settings-0.517
properties-0.484
versions-0.469
Token thing
Token ID1517
Feature activation+1.73e-01

Pos logit contributions
worldly+2.295
 behavi+1.861
settings+1.712
properties+1.596
versions+1.565
Neg logit contributions
 derail-1.777
 overthrow-1.643
 Dalai-1.618
 Hague-1.601
 overtake-1.570
Token,
Token ID11
Feature activation+7.44e+00

Pos logit contributions
 derail+0.137
 overthrow+0.125
 Dalai+0.124
 Hague+0.122
 overtake+0.119
Neg logit contributions
worldly-0.206
 behavi-0.169
settings-0.157
properties-0.147
versions-0.144
Token signaling
Token ID22049
Feature activation-5.70e+00

Pos logit contributions
worldly+1.672
 behavi+1.343
settings+1.238
properties+1.157
resources+1.129
Neg logit contributions
 derail-1.340
 overthrow-1.240
 Dalai-1.222
 Hague-1.212
 overturn-1.193
Token that
Token ID326
Feature activation-6.68e-01

Pos logit contributions
worldly+6.470
 behavi+5.228
settings+4.807
properties+4.478
versions+4.415
Neg logit contributions
 derail-4.723
 overthrow-4.344
 Dalai-4.296
 Hague-4.253
 overtake-4.181
Token he
Token ID339
Feature activation-5.25e+00

Pos logit contributions
worldly+0.850
 behavi+0.706
settings+0.660
properties+0.624
versions+0.611
Neg logit contributions
 derail-0.475
 overthrow-0.432
 Dalai-0.423
 Hague-0.421
 overturn-0.410
Token has
Token ID468
Feature activation-6.54e-02

Pos logit contributions
worldly+6.605
 behavi+5.469
settings+5.093
properties+4.798
versions+4.729
Neg logit contributions
 derail-3.768
 overthrow-3.423
 Dalai-3.361
 Hague-3.328
 overtake-3.247
Token no
Token ID645
Feature activation-6.89e-01

Pos logit contributions
worldly+0.073
 behavi+0.059
settings+0.055
properties+0.051
resources+0.050
Neg logit contributions
 derail-0.057
 overthrow-0.052
 Dalai-0.051
 Hague-0.051
 overturn-0.050
Token intention
Token ID6778
Feature activation+1.48e+01

Pos logit contributions
worldly+0.730
 behavi+0.586
settings+0.539
properties+0.501
resources+0.486
Neg logit contributions
 derail-0.640
 overthrow-0.596
 Dalai-0.587
 Hague-0.585
 Clintons-0.571
Token of
Token ID286
Feature activation+9.00e+00

Pos logit contributions
 derail+8.457
 overthrow+7.463
 Dalai+7.235
 overturn+7.231
 Hague+7.202
Neg logit contributions
worldly-18.769
 behavi-15.445
settings-14.886
properties-14.324
resources-13.818
Token stopping
Token ID12225
Feature activation+4.60e+00

Pos logit contributions
 derail+7.034
 overthrow+6.494
 overturn+6.411
 Hague+6.309
 Dalai+6.284
Neg logit contributions
worldly-10.342
 behavi-8.476
settings-7.915
properties-7.539
resources-7.269
Token until
Token ID1566
Feature activation+4.96e+00

Pos logit contributions
 derail+3.688
 overthrow+3.385
 Hague+3.367
 Dalai+3.300
 overturn+3.245
Neg logit contributions
worldly-5.321
 behavi-4.360
settings-4.077
properties-3.847
versions-3.709
Token he
Token ID339
Feature activation-2.58e+00

Pos logit contributions
 derail+3.842
 overthrow+3.589
 Hague+3.529
 Dalai+3.459
 overturn+3.409
Neg logit contributions
worldly-5.833
 behavi-4.909
settings-4.457
properties-4.238
versions-4.126
Token is
Token ID318
Feature activation+2.19e+00

Pos logit contributions
worldly+2.883
 behavi+2.326
settings+2.130
properties+1.994
versions+1.951
Neg logit contributions
 derail-2.238
 overthrow-2.071
 Dalai-2.040
 Hague-2.023
 overturn-1.983
Token.
Token ID13
Feature activation-3.96e+00

Pos logit contributions
 derail+1.591
 overthrow+1.465
 Dalai+1.451
 Hague+1.432
 overturn+1.427
Neg logit contributions
worldly-2.166
 behavi-1.761
settings-1.632
properties-1.523
resources-1.484
Token
Token ID1587
Feature activation-1.76e+00

Pos logit contributions
worldly+4.163
 behavi+3.308
settings+3.018
properties+2.798
versions+2.741
Neg logit contributions
 derail-3.664
 overthrow-3.405
 Dalai-3.363
 Hague-3.338
 Clintons-3.271
Token
Token ID241
Feature activation-3.71e+00

Pos logit contributions
worldly+1.919
 behavi+1.544
settings+1.413
properties+1.318
versions+1.291
Neg logit contributions
 derail-1.558
 overthrow-1.447
 Dalai-1.427
 Hague-1.419
 Clintons-1.388
TokenIf
Token ID1532
Feature activation+9.38e+00

Pos logit contributions
worldly+4.116
 behavi+3.316
settings+3.044
properties+2.840
versions+2.786
Neg logit contributions
 derail-3.218
 overthrow-2.975
 Dalai-2.936
 Hague-2.911
 Clintons-2.849
Token these
Token ID777
Feature activation+3.54e-01

Pos logit contributions
 derail+6.798
 Hague+6.196
 Dalai+6.185
 overthrow+6.160
 overturn+6.049
Neg logit contributions
worldly-10.639
 behavi-8.931
settings-8.174
properties-7.707
versions-7.535
Token were
Token ID547
Feature activation+4.88e+00

Pos logit contributions
 derail+0.325
 overthrow+0.303
 Dalai+0.298
 Hague+0.296
 overturn+0.292
Neg logit contributions
worldly-0.373
 behavi-0.301
settings-0.273
properties-0.255
versions-0.248
Token Ray
Token ID7760
Feature activation+6.69e+00

Pos logit contributions
 derail+4.221
 overthrow+3.914
 Dalai+3.878
 Hague+3.838
 overturn+3.801
Neg logit contributions
worldly-5.144
 behavi-4.326
settings-3.846
properties-3.618
resources-3.570
Token Kelly
Token ID9077
Feature activation+6.74e+00

Pos logit contributions
 derail+4.542
 Hague+4.232
 Dalai+4.134
 overthrow+4.130
 Clintons+4.050
Neg logit contributions
worldly-8.381
 behavi-7.120
settings-6.380
properties-6.033
resources-5.984
Token
Token ID447
Feature activation+2.06e+00

Pos logit contributions
 derail+5.501
 Dalai+5.083
 overthrow+5.044
 Hague+4.949
 overturn+4.918
Neg logit contributions
worldly-7.404
 behavi-6.027
settings-5.530
properties-5.187
resources-5.056
Token
Token ID247
Feature activation-2.14e+00

Pos logit contributions
 derail+1.760
 overthrow+1.632
 Dalai+1.625
 Hague+1.606
 Clintons+1.577
Neg logit contributions
worldly-2.292
 behavi-1.864
settings-1.700
properties-1.584
versions-1.550
Tokens
Token ID82
Feature activation-1.26e+00

Pos logit contributions
worldly+2.087
 behavi+1.637
settings+1.466
properties+1.346
versions+1.314
Neg logit contributions
 derail-2.160
 overthrow-2.018
 Dalai-1.997
 Hague-1.980
 overturn-1.947
TokenM
Token ID44
Feature activation+4.11e+00

Pos logit contributions
 derail+6.621
 overthrow+6.034
 Dalai+6.031
 Hague+6.026
 overturn+5.733
Neg logit contributions
worldly-10.168
 behavi-8.984
settings-7.592
properties-7.135
selves-6.999
Tokenbo
Token ID2127
Feature activation+2.25e+00

Pos logit contributions
 derail+3.178
 overthrow+2.914
 Hague+2.879
 Dalai+2.865
 overturn+2.771
Neg logit contributions
worldly-4.741
 behavi-4.050
settings-3.525
properties-3.319
versions-3.243
Token]
Token ID60
Feature activation-4.73e+00

Pos logit contributions
 derail+1.492
 overthrow+1.355
 Dalai+1.340
 Hague+1.321
 overturn+1.275
Neg logit contributions
worldly-2.936
 behavi-2.481
settings-2.276
properties-2.168
resources-2.122
Token<|endoftext|>
Token ID50256
Feature activation+1.94e+00

Pos logit contributions
worldly+4.264
 behavi+3.254
settings+2.927
properties+2.656
versions+2.596
Neg logit contributions
 derail-5.014
 overthrow-4.718
 Dalai-4.644
 Hague-4.633
 overtake-4.572
TokenNo
Token ID2949
Feature activation-8.50e-01

Pos logit contributions
 derail+1.638
 overthrow+1.503
 Dalai+1.496
 Hague+1.472
 overturn+1.440
Neg logit contributions
worldly-2.152
 behavi-1.809
settings-1.584
properties-1.481
versions-1.451
Token one
Token ID530
Feature activation+2.33e+00

Pos logit contributions
worldly+0.904
 behavi+0.727
settings+0.659
properties+0.611
versions+0.601
Neg logit contributions
 derail-0.778
 overthrow-0.721
 Dalai-0.711
 Hague-0.710
 Clintons-0.696
Token at
Token ID379
Feature activation+1.90e+00

Pos logit contributions
 derail+2.152
 overthrow+1.992
 Dalai+1.965
 overturn+1.933
 Hague+1.926
Neg logit contributions
worldly-2.431
 behavi-2.007
settings-1.801
properties-1.686
versions-1.652
Token Los
Token ID5401
Feature activation+3.84e+00

Pos logit contributions
 derail+1.518
 overthrow+1.387
 Dalai+1.374
 Hague+1.357
 Clintons+1.331
Neg logit contributions
worldly-2.242
 behavi-1.844
settings-1.702
properties-1.602
versions-1.567
Token Angeles
Token ID5652
Feature activation+5.44e+00

Pos logit contributions
 derail+2.308
 Hague+2.055
 overthrow+2.033
 Dalai+2.017
 Clintons+1.909
Neg logit contributions
worldly-5.198
 behavi-4.462
settings-4.131
properties-3.893
versions-3.855
Token International
Token ID4037
Feature activation+1.98e-01

Pos logit contributions
 derail+3.582
 overthrow+3.190
 Dalai+3.162
 Hague+3.133
 overturn+3.020
Neg logit contributions
worldly-7.043
 behavi-5.980
settings-5.474
properties-5.165
versions-5.158
Token Airport
Token ID12690
Feature activation-3.06e+00

Pos logit contributions
 derail+0.126
 overthrow+0.112
 Dalai+0.111
 Hague+0.109
 overtake+0.106
Neg logit contributions
worldly-0.266
 behavi-0.224
settings-0.209
properties-0.198
versions-0.195
Tokenone
Token ID505
Feature activation-2.64e+00

Pos logit contributions
worldly+2.312
 behavi+1.851
settings+1.618
properties+1.512
versions+1.478
Neg logit contributions
 derail-2.230
 overthrow-2.074
 Hague-2.048
 Dalai-2.047
 overtake-2.006
Token format
Token ID5794
Feature activation-3.37e+00

Pos logit contributions
worldly+2.664
 behavi+2.096
settings+1.887
properties+1.754
versions+1.713
Neg logit contributions
 derail-2.575
 overthrow-2.403
 Dalai-2.369
 Hague-2.360
 overturn-2.318
Token,
Token ID11
Feature activation-1.73e+00

Pos logit contributions
worldly+3.736
 behavi+3.007
settings+2.745
properties+2.570
versions+2.524
Neg logit contributions
 derail-2.949
 overthrow-2.726
 Dalai-2.681
 Hague-2.677
 overtake-2.624
Token if
Token ID611
Feature activation-2.50e-02

Pos logit contributions
worldly+2.093
 behavi+1.716
settings+1.585
properties+1.490
versions+1.471
Neg logit contributions
 derail-1.335
 overthrow-1.222
 Dalai-1.198
 Hague-1.194
 overturn-1.167
Token you
Token ID345
Feature activation-5.73e+00

Pos logit contributions
worldly+0.031
 behavi+0.026
settings+0.024
properties+0.023
versions+0.022
Neg logit contributions
 derail-0.018
 overthrow-0.017
 Dalai-0.016
 Hague-0.016
 overturn-0.016
Token were
Token ID547
Feature activation+3.66e+00

Pos logit contributions
worldly+6.657
 behavi+5.371
settings+5.060
properties+4.712
versions+4.621
Neg logit contributions
 derail-4.517
 Dalai-4.130
 overthrow-4.125
 Hague-4.093
 Clintons-4.003
Token to
Token ID284
Feature activation+5.20e+00

Pos logit contributions
 derail+2.877
 overthrow+2.632
 Hague+2.575
 Dalai+2.570
 overturn+2.549
Neg logit contributions
worldly-4.248
 behavi-3.516
settings-3.207
properties-3.026
resources-2.990
Token believe
Token ID1975
Feature activation-1.28e-02

Pos logit contributions
 derail+3.832
 overthrow+3.531
 overtake+3.383
 overturn+3.351
 Dalai+3.261
Neg logit contributions
worldly-6.483
 behavi-5.582
settings-4.943
properties-4.721
versions-4.667
Token ze
Token ID41271
Feature activation-3.84e+00

Pos logit contributions
worldly+0.015
 behavi+0.012
settings+0.012
properties+0.011
versions+0.011
Neg logit contributions
 derail-0.010
 overthrow-0.009
 Dalai-0.009
 Hague-0.009
 overturn-0.009
Tokenves
Token ID1158
Feature activation+2.58e+00

Pos logit contributions
worldly+4.801
 behavi+3.975
settings+3.702
properties+3.489
versions+3.434
Neg logit contributions
 derail-2.753
 overthrow-2.496
 Dalai-2.457
 Hague-2.431
 Clintons-2.374
Token and
Token ID290
Feature activation+4.16e+00

Pos logit contributions
 derail+2.227
 overthrow+2.055
 Hague+2.011
 Dalai+2.003
 overtake+1.966
Neg logit contributions
worldly-2.851
 behavi-2.298
settings-2.079
properties-1.954
versions-1.912
Token:
Token ID25
Feature activation+5.15e+00

Pos logit contributions
 derail+4.010
 overthrow+3.653
 Hague+3.514
 overtake+3.497
 Dalai+3.497
Neg logit contributions
worldly-7.229
 behavi-5.902
settings-5.304
properties-5.078
versions-5.032
Tokenset
Token ID2617
Feature activation+5.06e+00

Pos logit contributions
 derail+5.057
 overthrow+4.775
 overtake+4.656
 overturn+4.637
 Dalai+4.602
Neg logit contributions
worldly-4.849
 behavi-3.882
settings-3.100
properties-2.938
resources-2.830
Token dir
Token ID26672
Feature activation+8.18e+00

Pos logit contributions
 derail+4.235
 overthrow+3.929
 overturn+3.776
 overtake+3.762
 Dalai+3.710
Neg logit contributions
worldly-5.671
 behavi-4.571
settings-4.036
properties-3.826
resources-3.765
Token=
Token ID28
Feature activation+6.71e+00

Pos logit contributions
 derail+6.351
 overthrow+5.967
 overtake+5.656
 Dalai+5.637
 Hague+5.612
Neg logit contributions
worldly-8.936
 behavi-7.384
settings-6.265
resources-5.843
properties-5.842
Tokenc
Token ID66
Feature activation+2.40e+00

Pos logit contributions
 derail+5.719
 overthrow+5.284
 Dalai+5.150
 Hague+5.140
 overturn+5.090
Neg logit contributions
worldly-6.970
 behavi-5.815
settings-4.759
properties-4.519
selves-4.450
Token\
Token ID59
Feature activation+4.83e+00

Pos logit contributions
 derail+2.131
 overthrow+1.992
 Dalai+1.932
 Hague+1.924
 Clintons+1.898
Neg logit contributions
worldly-2.592
 behavi-2.079
settings-1.836
properties-1.721
resources-1.692
Token:\
Token ID7479
Feature activation+4.05e+00

Pos logit contributions
 derail+3.874
 overthrow+3.625
 Dalai+3.506
 Hague+3.492
 overturn+3.491
Neg logit contributions
worldly-5.467
 behavi-4.524
settings-3.863
properties-3.706
versions-3.578
Tokentmp
Token ID22065
Feature activation+3.83e+00

Pos logit contributions
 derail+3.335
 overthrow+3.105
 Dalai+3.018
 overturn+3.012
 Hague+3.003
Neg logit contributions
worldly-4.491
 behavi-3.700
settings-3.160
properties-3.018
versions-2.930
Token:
Token ID25
Feature activation+4.12e+00

Pos logit contributions
 derail+2.952
 overthrow+2.730
 Dalai+2.637
 overturn+2.612
 Hague+2.598
Neg logit contributions
worldly-4.510
 behavi-3.695
settings-3.263
properties-3.135
resources-3.040
Token */
Token ID9466
Feature activation-9.50e-01

Pos logit contributions
 derail+3.268
 overthrow+3.037
 Dalai+2.944
 overturn+2.920
 Hague+2.909
Neg logit contributions
worldly-4.724
 behavi-3.849
settings-3.373
properties-3.241
selves-3.133
Token This
Token ID770
Feature activation-4.82e+00

Pos logit contributions
worldly+0.966
 behavi+0.752
settings+0.673
properties+0.624
resources+0.610
Neg logit contributions
 derail-0.926
 overthrow-0.869
 Dalai-0.849
 Hague-0.849
 overturn-0.841
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
worldly+0.548
 behavi+0.446
settings+0.414
properties+0.392
versions+0.386
Neg logit contributions
 derail-0.356
 overthrow-0.325
 Dalai-0.318
 Hague-0.315
 overturn-0.312
Token smoking
Token ID9216
Feature activation-3.93e+00

Pos logit contributions
 derail+0.850
 overthrow+0.776
 Hague+0.758
 Dalai+0.755
 overturn+0.749
Neg logit contributions
worldly-1.255
 behavi-1.014
settings-0.936
properties-0.890
versions-0.864
Token crack
Token ID8469
Feature activation-4.84e+00

Pos logit contributions
worldly+4.987
 behavi+4.135
settings+3.850
properties+3.632
versions+3.574
Neg logit contributions
 derail-2.811
 overthrow-2.552
 Dalai-2.508
 Hague-2.480
 overtake-2.417
Token in
Token ID287
Feature activation-7.79e+00

Pos logit contributions
worldly+5.903
 behavi+4.853
settings+4.500
properties+4.233
versions+4.164
Neg logit contributions
 derail-3.686
 overthrow-3.367
 Dalai-3.311
 Hague-3.280
 overtake-3.204
Token a
Token ID257
Feature activation-9.55e+00

Pos logit contributions
worldly+8.663
 behavi+6.954
settings+6.448
versions+5.991
properties+5.979
Neg logit contributions
 derail-6.474
 overthrow-5.973
 Dalai-5.904
 Hague-5.839
 overtake-5.720
Token
Token ID564
Feature activation+2.32e+00

Pos logit contributions
worldly+11.105
 behavi+8.932
settings+8.353
versions+7.809
properties+7.703
Neg logit contributions
 derail-7.181
 overthrow-6.645
 Dalai-6.496
 Hague-6.395
 overtake-6.287
Token
Token ID250
Feature activation-9.06e+00

Pos logit contributions
 derail+1.740
 Dalai+1.592
 overthrow+1.586
 Hague+1.556
 overturn+1.526
Neg logit contributions
worldly-2.781
 behavi-2.387
settings-2.152
properties-2.027
versions-1.994
Tokendr
Token ID7109
Feature activation+2.71e+00

Pos logit contributions
worldly+10.061
 behavi+7.849
settings+7.472
properties+6.981
resources+6.845
Neg logit contributions
 derail-7.663
 overthrow-7.121
 Dalai-6.965
 Hague-6.882
 Clintons-6.818
Tokenunk
Token ID2954
Feature activation-2.00e+00

Pos logit contributions
 derail+1.984
 overthrow+1.797
 Dalai+1.796
 Hague+1.761
 overtake+1.731
Neg logit contributions
worldly-3.324
 behavi-2.783
settings-2.567
properties-2.438
versions-2.362
Tokenen
Token ID268
Feature activation+1.18e+00

Pos logit contributions
worldly+2.230
 behavi+1.795
settings+1.651
properties+1.544
versions+1.512
Neg logit contributions
 derail-1.726
 overthrow-1.593
 Dalai-1.572
 Hague-1.556
 overtake-1.530
Token st
Token ID336
Feature activation-2.67e+00

Pos logit contributions
 derail+0.999
 overthrow+0.923
 Dalai+0.908
 Hague+0.902
 overtake+0.889
Neg logit contributions
worldly-1.338
 behavi-1.097
settings-1.008
properties-0.953
resources-0.929
Token off
Token ID572
Feature activation-7.84e-01

Pos logit contributions
worldly+4.566
 behavi+3.697
settings+3.408
properties+3.187
versions+3.130
Neg logit contributions
 derail-3.329
 overthrow-3.064
 Dalai-3.021
 Hague-2.996
 overtake-2.929
Token the
Token ID262
Feature activation+3.04e+00

Pos logit contributions
worldly+0.861
 behavi+0.691
settings+0.635
properties+0.592
versions+0.580
Neg logit contributions
 derail-0.690
 overthrow-0.639
 Dalai-0.630
 Hague-0.624
 overtake-0.612
Token bat
Token ID7365
Feature activation+1.48e+00

Pos logit contributions
 derail+2.415
 Dalai+2.206
 overthrow+2.191
 Hague+2.123
 overturn+2.079
Neg logit contributions
worldly-3.609
 behavi-2.978
settings-2.747
properties-2.590
resources-2.540
Token,
Token ID11
Feature activation+5.83e+00

Pos logit contributions
 derail+1.120
 overthrow+1.019
 Dalai+1.011
 Hague+0.988
 overturn+0.971
Neg logit contributions
worldly-1.801
 behavi-1.493
settings-1.384
properties-1.305
versions-1.275
Token we
Token ID356
Feature activation-5.66e+00

Pos logit contributions
 derail+4.190
 overthrow+3.823
 Dalai+3.780
 Hague+3.680
 overturn+3.649
Neg logit contributions
worldly-7.125
 behavi-5.981
settings-5.506
properties-5.219
versions-5.087
Token played
Token ID2826
Feature activation-4.27e+00

Pos logit contributions
worldly+6.224
 behavi+4.881
settings+4.598
properties+4.266
versions+4.199
Neg logit contributions
 derail-4.790
 overthrow-4.420
 Dalai-4.406
 Hague-4.400
 Clintons-4.313
Token the
Token ID262
Feature activation+5.50e+00

Pos logit contributions
worldly+4.686
 behavi+3.760
settings+3.451
properties+3.215
versions+3.153
Neg logit contributions
 derail-3.744
 overthrow-3.463
 Dalai-3.414
 Hague-3.389
 overtake-3.318
Token "
Token ID366
Feature activation+5.70e+00

Pos logit contributions
 derail+4.838
 overthrow+4.534
 Dalai+4.503
 Hague+4.325
 Clintons+4.325
Neg logit contributions
worldly-5.810
 behavi-4.756
settings-4.354
properties-4.041
resources-3.968
TokenWhat
Token ID2061
Feature activation+1.82e+00

Pos logit contributions
 derail+5.034
 Dalai+4.643
 overthrow+4.642
 Hague+4.544
 Clintons+4.467
Neg logit contributions
worldly-5.656
 behavi-4.800
settings-4.063
properties-3.754
versions-3.661
Token if
Token ID611
Feature activation+1.66e+01

Pos logit contributions
 derail+1.476
 Dalai+1.365
 overthrow+1.359
 Hague+1.327
 Clintons+1.312
Neg logit contributions
worldly-2.057
 behavi-1.712
settings-1.574
properties-1.473
versions-1.429
Token the
Token ID262
Feature activation+1.51e+01

Pos logit contributions
 derail+11.558
 Dalai+11.087
 overthrow+10.648
 Hague+10.467
 Clintons+10.356
Neg logit contributions
worldly-16.388
 behavi-14.123
settings-12.591
properties-11.746
versions-11.494
Token and
Token ID290
Feature activation+9.15e-01

Pos logit contributions
 derail+2.309
 overthrow+2.106
 Dalai+2.078
 Hague+2.044
 overturn+2.035
Neg logit contributions
worldly-4.218
 behavi-3.521
settings-3.285
properties-3.114
versions-3.069
Token international
Token ID3230
Feature activation-1.04e+01

Pos logit contributions
 derail+0.874
 overthrow+0.810
 Dalai+0.808
 Hague+0.798
 overturn+0.790
Neg logit contributions
worldly-0.930
 behavi-0.741
settings-0.680
properties-0.631
versions-0.616
Token co
Token ID763
Feature activation-7.67e+00

Pos logit contributions
worldly+10.297
 behavi+8.200
settings+7.448
properties+6.872
resources+6.857
Neg logit contributions
 derail-9.313
 overthrow-8.702
 Dalai-8.652
 Hague-8.511
 overtake-8.447
Token-
Token ID12
Feature activation-1.03e+01

Pos logit contributions
worldly+7.520
 behavi+5.800
settings+5.383
properties+4.981
versions+4.947
Neg logit contributions
 derail-7.371
 overthrow-6.917
 Dalai-6.806
 Hague-6.636
 Clintons-6.591
Tokenoperation
Token ID27184
Feature activation-2.52e+00

Pos logit contributions
worldly+7.926
 behavi+5.524
settings+5.155
properties+4.695
versions+4.654
Neg logit contributions
 derail-11.726
 overthrow-11.201
 Dalai-11.023
 Clintons-10.797
 overtake-10.786
Token.
Token ID13
Feature activation-7.49e+00

Pos logit contributions
worldly+2.763
 behavi+2.218
settings+2.030
properties+1.892
versions+1.856
Neg logit contributions
 derail-2.242
 overthrow-2.074
 Dalai-2.046
 Hague-2.035
 overturn-1.995
Token
Token ID198
Feature activation+7.20e+00

Pos logit contributions
worldly+7.934
 behavi+6.231
settings+5.852
properties+5.496
resources+5.335
Neg logit contributions
 derail-6.665
 overthrow-6.187
 overtake-6.001
 Dalai-5.930
 assassinate-5.872
Token
Token ID198
Feature activation+1.79e+00

Pos logit contributions
 derail+6.253
 Dalai+5.862
 Hague+5.850
 overthrow+5.772
 overturn+5.715
Neg logit contributions
worldly-7.261
 behavi-7.031
 practition-5.350
 redes-5.277
settings-5.212
TokenJust
Token ID5703
Feature activation+3.48e-01

Pos logit contributions
 derail+1.560
 Dalai+1.465
 overthrow+1.434
 Hague+1.430
 overturn+1.394
Neg logit contributions
worldly-1.936
 behavi-1.643
settings-1.396
properties-1.301
versions-1.289
Token five
Token ID1936
Feature activation+6.88e+00

Pos logit contributions
 derail+0.303
 overthrow+0.282
 Dalai+0.275
 Hague+0.275
 overturn+0.272
Neg logit contributions
worldly-0.387
 behavi-0.312
settings-0.285
properties-0.263
versions-0.261
Token years
Token ID812
Feature activation+1.22e+01

Pos logit contributions
 derail+5.315
 Dalai+4.842
 overthrow+4.816
 Hague+4.767
 overturn+4.677
Neg logit contributions
worldly-7.828
 behavi-6.590
settings-6.035
properties-5.673
versions-5.641
Token on
Token ID319
Feature activation-6.56e+00

Pos logit contributions
worldly+1.009
 behavi+0.830
settings+0.764
properties+0.714
resources+0.700
Neg logit contributions
 derail-0.699
 overthrow-0.638
 Dalai-0.628
 Hague-0.626
 overturn-0.614
Token New
Token ID968
Feature activation-3.63e+00

Pos logit contributions
worldly+7.595
 behavi+6.126
settings+5.630
properties+5.281
versions+5.267
Neg logit contributions
 derail-5.225
 overthrow-4.824
 Dalai-4.769
 Hague-4.681
 overtake-4.622
Token York
Token ID1971
Feature activation+8.55e-02

Pos logit contributions
worldly+4.566
 behavi+3.742
settings+3.512
properties+3.320
versions+3.264
Neg logit contributions
 derail-2.596
 overthrow-2.365
 Dalai-2.318
 Hague-2.267
 overtake-2.241
Token
Token ID447
Feature activation+5.55e-01

Pos logit contributions
 derail+0.069
 overthrow+0.063
 Dalai+0.062
 Hague+0.062
 overturn+0.061
Neg logit contributions
worldly-0.100
 behavi-0.082
settings-0.076
properties-0.071
versions-0.070
Token
Token ID247
Feature activation-6.08e+00

Pos logit contributions
 derail+0.494
 overthrow+0.459
 Dalai+0.456
 Hague+0.452
 overturn+0.441
Neg logit contributions
worldly-0.599
 behavi-0.488
settings-0.442
properties-0.409
versions-0.402
Tokens
Token ID82
Feature activation-3.50e+00

Pos logit contributions
worldly+5.998
 behavi+4.653
settings+4.250
properties+3.924
resources+3.822
Neg logit contributions
 derail-5.984
 overthrow-5.593
 Dalai-5.520
 Hague-5.467
 Clintons-5.378
Token waterfront
Token ID40250
Feature activation-8.78e+00

Pos logit contributions
worldly+4.451
 behavi+3.681
settings+3.430
properties+3.241
versions+3.192
Neg logit contributions
 derail-2.456
 overthrow-2.227
 Dalai-2.191
 Hague-2.162
 overtake-2.113
Token,
Token ID11
Feature activation+7.30e-01

Pos logit contributions
worldly+9.182
 behavi+7.277
settings+6.702
properties+6.246
resources+6.165
Neg logit contributions
 derail-7.642
 overthrow-7.130
 Dalai-7.048
 Hague-7.028
 Clintons-6.823
Token but
Token ID475
Feature activation-3.33e+00

Pos logit contributions
 derail+0.591
 overthrow+0.540
 Dalai+0.533
 Hague+0.528
 overturn+0.527
Neg logit contributions
worldly-0.860
 behavi-0.698
settings-0.648
properties-0.603
versions-0.592
Token the
Token ID262
Feature activation+5.74e+00

Pos logit contributions
worldly+3.789
 behavi+3.072
settings+2.825
properties+2.637
versions+2.589
Neg logit contributions
 derail-2.809
 overthrow-2.588
 Dalai-2.552
 Hague-2.529
 Clintons-2.479
Token building
Token ID2615
Feature activation-3.25e+00

Pos logit contributions
 derail+5.217
 overthrow+4.828
 Dalai+4.758
 Clintons+4.748
 Hague+4.735
Neg logit contributions
worldly-5.922
 behavi-4.816
settings-4.405
properties-4.052
resources-3.975
Token less
Token ID1342
Feature activation-2.58e+00

Pos logit contributions
worldly+0.680
 behavi+0.563
settings+0.518
properties+0.477
resources+0.470
Neg logit contributions
 derail-0.481
 overthrow-0.442
 Dalai-0.435
 Hague-0.432
 Clintons-0.427
Token tax
Token ID1687
Feature activation-2.53e+00

Pos logit contributions
worldly+2.915
 behavi+2.381
settings+2.184
properties+2.024
resources+1.991
Neg logit contributions
 derail-2.205
 overthrow-2.040
 Dalai-2.001
 Hague-1.997
 Clintons-1.955
Token,
Token ID11
Feature activation+2.71e+00

Pos logit contributions
worldly+2.857
 behavi+2.322
settings+2.132
properties+1.988
resources+1.945
Neg logit contributions
 derail-2.169
 overthrow-2.003
 Dalai-1.962
 Hague-1.959
 overturn-1.922
Token while
Token ID981
Feature activation-3.36e+00

Pos logit contributions
 derail+2.231
 overthrow+2.030
 overturn+1.996
 Hague+1.978
 Dalai+1.965
Neg logit contributions
worldly-3.200
 behavi-2.573
settings-2.401
properties-2.231
resources-2.178
Token a
Token ID257
Feature activation-4.31e+00

Pos logit contributions
worldly+4.005
 behavi+3.279
settings+3.036
properties+2.850
versions+2.806
Neg logit contributions
 derail-2.626
 overthrow-2.411
 Dalai-2.374
 Hague-2.351
 overtake-2.296
Token third
Token ID2368
Feature activation-1.24e+00

Pos logit contributions
worldly+4.809
 behavi+3.876
settings+3.563
properties+3.325
versions+3.256
Neg logit contributions
 derail-3.717
 overthrow-3.433
 Dalai-3.384
 Hague-3.357
 overtake-3.285
Token,
Token ID11
Feature activation+8.25e+00

Pos logit contributions
worldly+1.336
 behavi+1.082
settings+0.989
properties+0.927
resources+0.902
Neg logit contributions
 derail-1.123
 overthrow-1.043
 Dalai-1.024
 Hague-1.016
 overturn-1.011
Token less
Token ID1342
Feature activation-2.12e+00

Pos logit contributions
 derail+7.060
 overturn+6.642
 overthrow+6.462
 Hague+6.356
 Dalai+6.295
Neg logit contributions
worldly-8.851
 behavi-6.959
settings-6.427
properties-6.042
selves-5.729
Token-
Token ID12
Feature activation-3.07e+00

Pos logit contributions
worldly+2.098
 behavi+1.664
settings+1.506
properties+1.394
resources+1.353
Neg logit contributions
 derail-2.116
 overthrow-1.978
 Dalai-1.938
 Hague-1.933
 overturn-1.923
Tokendeveloped
Token ID33082
Feature activation-3.40e+00

Pos logit contributions
worldly+2.916
 behavi+2.316
settings+2.021
properties+1.861
versions+1.797
Neg logit contributions
 derail-3.149
 overthrow-2.944
 Dalai-2.909
 Hague-2.894
 overturn-2.859
Token proposal
Token ID6961
Feature activation+4.89e+00

Pos logit contributions
worldly+3.907
 behavi+3.185
settings+2.927
properties+2.753
resources+2.684
Neg logit contributions
 derail-2.835
 overthrow-2.618
 Dalai-2.573
 Hague-2.559
 overturn-2.500
Token and
Token ID290
Feature activation+3.97e+00

Pos logit contributions
 derail+1.884
 overthrow+1.735
 Dalai+1.700
 overtake+1.667
 Hague+1.662
Neg logit contributions
worldly-2.116
 behavi-1.700
settings-1.555
properties-1.471
resources-1.412
Token best
Token ID1266
Feature activation-2.25e+00

Pos logit contributions
 derail+2.919
 overthrow+2.623
 Dalai+2.542
 Hague+2.510
 overturn+2.502
Neg logit contributions
worldly-4.898
 behavi-4.059
settings-3.792
properties-3.628
resources-3.453
Token way
Token ID835
Feature activation+1.98e+00

Pos logit contributions
worldly+2.257
 behavi+1.771
settings+1.606
properties+1.495
resources+1.445
Neg logit contributions
 derail-2.216
 overthrow-2.061
 Dalai-2.034
 Hague-2.007
 overtake-1.984
Token to
Token ID284
Feature activation+5.55e+00

Pos logit contributions
 derail+1.572
 overthrow+1.419
 Dalai+1.410
 Hague+1.387
 overturn+1.373
Neg logit contributions
worldly-2.343
 behavi-1.938
settings-1.774
properties-1.687
versions-1.643
Token accomplish
Token ID9989
Feature activation+1.19e+00

Pos logit contributions
 derail+3.823
 overthrow+3.417
 overtake+3.272
 overturn+3.219
 Dalai+3.131
Neg logit contributions
worldly-7.243
 behavi-6.167
settings-5.628
properties-5.390
resources-5.270
Token this
Token ID428
Feature activation-5.72e+00

Pos logit contributions
 derail+0.979
 overthrow+0.891
 Dalai+0.873
 Hague+0.860
 overturn+0.853
Neg logit contributions
worldly-1.392
 behavi-1.135
settings-1.047
properties-0.999
resources-0.961
Token?
Token ID30
Feature activation+7.04e-01

Pos logit contributions
worldly+5.620
 behavi+4.383
settings+3.951
properties+3.619
versions+3.574
Neg logit contributions
 derail-5.613
 overthrow-5.243
 Dalai-5.184
 Hague-5.174
 Clintons-5.081
Token Lever
Token ID42365
Feature activation+1.25e+00

Pos logit contributions
 derail+0.562
 overthrow+0.512
 Dalai+0.508
 Hague+0.504
 overtake+0.488
Neg logit contributions
worldly-0.838
 behavi-0.696
settings-0.630
properties-0.594
versions-0.582
Tokenage
Token ID496
Feature activation-1.33e+00

Pos logit contributions
 derail+1.014
 overthrow+0.930
 Dalai+0.907
 Hague+0.894
 overturn+0.884
Neg logit contributions
worldly-1.459
 behavi-1.194
settings-1.098
properties-1.037
versions-1.016
Token your
Token ID534
Feature activation-8.32e+00

Pos logit contributions
worldly+1.526
 behavi+1.235
settings+1.138
properties+1.078
resources+1.037
Neg logit contributions
 derail-1.125
 overthrow-1.027
 Dalai-1.011
 Hague-1.000
 overturn-0.987
Token natural
Token ID3288
Feature activation-7.24e+00

Pos logit contributions
worldly+8.937
 behavi+7.152
settings+6.592
versions+6.092
resources+6.046
Neg logit contributions
 derail-7.115
 overthrow-6.644
 Hague-6.576
 Dalai-6.565
 Clintons-6.405
Token live
Token ID2107
Feature activation-5.80e+00

Pos logit contributions
 derail+4.775
 overthrow+4.509
 overturn+4.268
 overtake+4.217
 Dalai+4.104
Neg logit contributions
worldly-7.679
 behavi-6.557
settings-5.980
properties-5.676
resources-5.546
Token harmon
Token ID25625
Feature activation-6.69e+00

Pos logit contributions
worldly+6.350
 behavi+5.084
settings+4.663
properties+4.333
versions+4.274
Neg logit contributions
 derail-5.067
 overthrow-4.668
 Dalai-4.614
 Hague-4.595
 Clintons-4.491
Tokeniously
Token ID6819
Feature activation-5.76e+00

Pos logit contributions
worldly+5.091
 behavi+3.445
settings+3.168
properties+2.789
versions+2.681
Neg logit contributions
 derail-8.096
 overthrow-7.617
 Dalai-7.583
 Hague-7.500
 Clintons-7.453
Token with
Token ID351
Feature activation-2.15e+00

Pos logit contributions
worldly+6.420
 behavi+5.170
settings+4.748
properties+4.429
versions+4.347
Neg logit contributions
 derail-4.963
 overthrow-4.578
 Dalai-4.515
 Hague-4.483
 overtake-4.388
Token your
Token ID534
Feature activation-8.91e+00

Pos logit contributions
worldly+2.521
 behavi+2.062
settings+1.909
properties+1.789
versions+1.750
Neg logit contributions
 derail-1.737
 overthrow-1.602
 Dalai-1.574
 Hague-1.558
 overtake-1.527
Token fellow
Token ID5891
Feature activation-1.10e+01

Pos logit contributions
worldly+9.814
 behavi+7.834
settings+7.169
properties+6.698
versions+6.681
Neg logit contributions
 derail-7.440
 overthrow-6.758
 Hague-6.704
 Dalai-6.663
 assassinate-6.537
Token men
Token ID1450
Feature activation-1.36e+00

Pos logit contributions
worldly+13.186
 behavi+10.413
settings+9.759
properties+9.275
versions+9.215
Neg logit contributions
 derail-7.984
 overthrow-7.271
 Hague-7.178
 Dalai-7.078
 overtake-6.955
Token.
Token ID13
Feature activation-3.92e+00

Pos logit contributions
worldly+1.623
 behavi+1.332
settings+1.233
properties+1.156
versions+1.137
Neg logit contributions
 derail-1.068
 overthrow-0.984
 Dalai-0.969
 Hague-0.956
 overturn-0.939
Token I
Token ID314
Feature activation-9.80e+00

Pos logit contributions
worldly+4.794
 behavi+3.936
settings+3.663
properties+3.451
versions+3.386
Neg logit contributions
 derail-2.946
 overthrow-2.687
 Dalai-2.635
 Hague-2.615
 overtake-2.557
Token leave
Token ID2666
Feature activation-3.53e+00

Pos logit contributions
worldly+11.981
 behavi+9.785
settings+9.111
properties+8.601
resources+8.467
Neg logit contributions
 derail-6.389
 Hague-5.950
 Dalai-5.720
 Clintons-5.684
 overthrow-5.683
Token you
Token ID345
Feature activation-6.57e+00

Pos logit contributions
worldly+4.683
 behavi+3.925
settings+3.665
properties+3.472
resources+3.421
Neg logit contributions
 derail-2.255
 overthrow-2.026
 Dalai-1.996
 Hague-1.971
 overtake-1.911
Token reg
Token ID842
Feature activation-2.28e+00

Pos logit contributions
worldly+4.535
 behavi+3.860
settings+3.625
properties+3.457
versions+3.409
Neg logit contributions
 derail-1.632
 overthrow-1.425
 Dalai-1.395
 Hague-1.372
 overturn-1.323
Tokenains
Token ID1299
Feature activation-2.23e+00

Pos logit contributions
worldly+2.220
 behavi+1.745
settings+1.543
properties+1.420
versions+1.390
Neg logit contributions
 derail-2.307
 overthrow-2.154
 Dalai-2.133
 Hague-2.113
 overtake-2.080
Token some
Token ID617
Feature activation-7.21e+00

Pos logit contributions
worldly+2.371
 behavi+1.886
settings+1.728
properties+1.608
versions+1.572
Neg logit contributions
 derail-2.059
 overthrow-1.909
 Dalai-1.880
 Hague-1.866
 overturn-1.842
Token continuity
Token ID24216
Feature activation+2.31e-01

Pos logit contributions
worldly+8.767
 behavi+7.198
settings+6.660
properties+6.261
resources+6.177
Neg logit contributions
 derail-5.296
 overthrow-4.862
 Dalai-4.807
 Hague-4.761
 Clintons-4.616
Token in
Token ID287
Feature activation-6.91e+00

Pos logit contributions
 derail+0.186
 overthrow+0.169
 Dalai+0.167
 Hague+0.165
 overtake+0.163
Neg logit contributions
worldly-0.272
 behavi-0.221
settings-0.205
properties-0.193
versions-0.191
Token his
Token ID465
Feature activation-1.10e+01

Pos logit contributions
worldly+7.316
 behavi+5.855
settings+5.343
properties+4.937
versions+4.878
Neg logit contributions
 derail-6.205
 overthrow-5.754
 Dalai-5.701
 Hague-5.649
 Clintons-5.553
Token training
Token ID3047
Feature activation-9.52e+00

Pos logit contributions
worldly+11.659
 behavi+9.485
settings+8.649
properties+7.955
versions+7.865
Neg logit contributions
 derail-9.084
 Dalai-8.471
 Hague-8.369
 overthrow-8.363
 Clintons-8.179
Token.
Token ID13
Feature activation-8.06e+00

Pos logit contributions
worldly+10.520
 behavi+8.681
settings+7.961
properties+7.318
versions+7.220
Neg logit contributions
 derail-7.699
 overthrow-7.193
 Dalai-7.165
 Hague-7.126
 Clintons-6.916
Token
Token ID198
Feature activation+5.07e+00

Pos logit contributions
worldly+11.497
 behavi+9.824
settings+9.307
properties+8.833
resources+8.714
Neg logit contributions
 derail-4.089
 overthrow-3.592
 overtake-3.344
 Dalai-3.328
 Hague-3.319
Token
Token ID198
Feature activation+2.27e+00

Pos logit contributions
 derail+4.664
 Dalai+4.390
 Hague+4.355
 overthrow+4.327
 overturn+4.247
Neg logit contributions
worldly-5.022
 behavi-4.668
settings-3.565
 redes-3.441
 practition-3.336
Token
Token ID447
Feature activation-4.39e+00

Pos logit contributions
 derail+1.215
 Dalai+1.077
 Hague+1.044
 overthrow+1.043
 overturn+0.991
Neg logit contributions
worldly-3.212
 behavi-2.797
settings-2.533
properties-2.433
versions-2.381
Token what
Token ID644
Feature activation-8.22e+00

Pos logit contributions
worldly+8.776
 behavi+6.966
settings+6.462
properties+5.982
versions+5.922
Neg logit contributions
 derail-7.222
 overthrow-6.729
 Dalai-6.650
 Hague-6.571
 Clintons-6.451
Token's
Token ID338
Feature activation-3.10e+00

Pos logit contributions
worldly+9.969
 behavi+8.116
settings+7.613
properties+7.133
versions+7.027
Neg logit contributions
 derail-6.085
 overthrow-5.598
 Dalai-5.492
 Hague-5.430
 Clintons-5.314
Token new
Token ID649
Feature activation-8.67e+00

Pos logit contributions
worldly+3.040
 behavi+2.377
settings+2.144
properties+1.973
versions+1.923
Neg logit contributions
 derail-3.099
 overthrow-2.894
 Dalai-2.855
 Hague-2.839
 overtake-2.789
Token on
Token ID319
Feature activation-1.02e+01

Pos logit contributions
worldly+9.685
 behavi+7.768
settings+7.220
properties+6.705
versions+6.642
Neg logit contributions
 derail-7.167
 overthrow-6.623
 Dalai-6.565
 Hague-6.524
 Clintons-6.351
Token your
Token ID534
Feature activation-1.46e+01

Pos logit contributions
worldly+10.924
 behavi+8.599
settings+8.000
versions+7.485
properties+7.314
Neg logit contributions
 derail-8.548
 overthrow-7.972
 Hague-7.882
 Dalai-7.838
 Clintons-7.666
Token favorite
Token ID4004
Feature activation-1.03e+01

Pos logit contributions
worldly+15.474
 behavi+12.242
settings+11.701
properties+10.820
versions+10.804
Neg logit contributions
 derail-11.174
 Hague-10.499
 Dalai-10.348
 overthrow-10.280
 overturn-10.071
Token chess
Token ID19780
Feature activation-7.93e+00

Pos logit contributions
worldly+10.528
 behavi+8.142
settings+7.682
versions+7.010
properties+7.001
Neg logit contributions
 derail-9.290
 Hague-8.657
 overthrow-8.652
 Dalai-8.596
 Clintons-8.351
Token site
Token ID2524
Feature activation-9.01e+00

Pos logit contributions
worldly+7.671
 behavi+5.878
settings+5.477
properties+4.976
versions+4.929
Neg logit contributions
 derail-7.715
 overthrow-7.250
 Dalai-7.229
 Hague-7.141
 Clintons-7.002
Token:
Token ID25
Feature activation-6.15e+00

Pos logit contributions
worldly+9.923
 behavi+7.895
settings+7.407
properties+6.868
versions+6.748
Neg logit contributions
 derail-7.516
 overthrow-6.978
 Dalai-6.907
 Hague-6.862
 overtake-6.674
Token
Token ID198
Feature activation+2.25e+00

Pos logit contributions
worldly+5.764
 behavi+4.442
settings+4.009
properties+3.665
versions+3.582
Neg logit contributions
 derail-6.332
 overthrow-5.935
 Dalai-5.866
 Hague-5.834
 Clintons-5.731
Token
Token ID198
Feature activation-3.88e-01

Pos logit contributions
 derail+2.233
 Dalai+2.087
 overthrow+2.085
 Hague+2.072
 overturn+2.028
Neg logit contributions
worldly-2.147
 behavi-1.858
settings-1.487
properties-1.362
versions-1.335
Token take
Token ID1011
Feature activation-8.03e-01

Pos logit contributions
 derail+5.474
 overthrow+5.111
 overturn+4.901
 overtake+4.758
 assassinate+4.687
Neg logit contributions
worldly-9.162
 behavi-7.952
settings-7.121
properties-6.754
versions-6.651
Token human
Token ID1692
Feature activation-8.98e+00

Pos logit contributions
worldly+0.897
 behavi+0.724
settings+0.668
properties+0.622
versions+0.610
Neg logit contributions
 derail-0.697
 overthrow-0.648
 Dalai-0.637
 Hague-0.636
 overturn-0.621
Token rights
Token ID2489
Feature activation+9.59e-01

Pos logit contributions
worldly+8.066
 behavi+6.177
settings+5.466
properties+5.044
resources+5.037
Neg logit contributions
 derail-9.141
 overthrow-8.525
 Dalai-8.506
 Clintons-8.394
 Hague-8.374
Token,"
Token ID553
Feature activation-2.14e+00

Pos logit contributions
 derail+0.810
 overthrow+0.756
 Dalai+0.738
 Hague+0.735
 overturn+0.716
Neg logit contributions
worldly-1.084
 behavi-0.883
settings-0.813
properties-0.766
versions-0.756
Token he
Token ID339
Feature activation-1.50e+01

Pos logit contributions
worldly+2.647
 behavi+2.183
settings+2.028
properties+1.909
versions+1.879
Neg logit contributions
 derail-1.584
 overthrow-1.443
 Dalai-1.420
 Hague-1.406
 overtake-1.370
Token continued
Token ID3767
Feature activation-1.45e+01

Pos logit contributions
worldly+12.105
 behavi+8.647
settings+8.061
properties+7.480
resources+7.250
Neg logit contributions
 derail-14.570
 overthrow-13.719
 Dalai-13.565
 overtake-13.506
 assassinate-13.353
Token.
Token ID13
Feature activation-1.05e+01

Pos logit contributions
worldly+11.782
 behavi+9.054
settings+8.024
resources+7.144
properties+7.088
Neg logit contributions
 derail-13.680
 overthrow-12.973
 Dalai-12.876
 overtake-12.792
 Hague-12.614
Token
Token ID198
Feature activation+6.93e+00

Pos logit contributions
worldly+8.832
 behavi+6.700
settings+6.168
properties+5.673
resources+5.554
Neg logit contributions
 derail-10.687
 overthrow-10.053
 overtake-9.982
 Dalai-9.901
 assassinate-9.727
Token
Token ID198
Feature activation+9.43e-01

Pos logit contributions
 derail+6.112
 Dalai+5.716
 Hague+5.709
 overthrow+5.637
 overturn+5.533
Neg logit contributions
worldly-6.954
 behavi-6.658
 redes-5.013
 practition-4.992
settings-4.985
TokenLa
Token ID14772
Feature activation-4.81e+00

Pos logit contributions
 derail+0.755
 Dalai+0.695
 overthrow+0.688
 Hague+0.682
 overturn+0.663
Neg logit contributions
worldly-1.094
 behavi-0.940
settings-0.811
properties-0.761
versions-0.749
Token Rue
Token ID45363
Feature activation-4.88e+00

Pos logit contributions
worldly+6.801
 behavi+5.645
settings+5.438
properties+5.152
resources+5.093
Neg logit contributions
 derail-2.582
 overthrow-2.273
 Dalai-2.221
 overtake-2.139
 Clintons-2.136
Token the
Token ID262
Feature activation-7.49e-01

Pos logit contributions
worldly+4.859
 behavi+3.910
settings+3.590
properties+3.345
versions+3.282
Neg logit contributions
 derail-3.835
 overthrow-3.546
 Dalai-3.497
 Hague-3.468
 overtake-3.397
Token nine
Token ID5193
Feature activation-5.94e-02

Pos logit contributions
worldly+0.904
 behavi+0.752
settings+0.692
properties+0.649
versions+0.639
Neg logit contributions
 derail-0.582
 overthrow-0.533
 Dalai-0.527
 Hague-0.520
 Clintons-0.511
Token team
Token ID1074
Feature activation-2.92e+00

Pos logit contributions
worldly+0.066
 behavi+0.054
settings+0.049
properties+0.046
versions+0.045
Neg logit contributions
 derail-0.052
 overthrow-0.047
 Dalai-0.047
 Hague-0.047
 Clintons-0.046
Token M
Token ID337
Feature activation-4.19e+00

Pos logit contributions
worldly+3.191
 behavi+2.570
settings+2.346
properties+2.184
resources+2.139
Neg logit contributions
 derail-2.610
 overthrow-2.414
 Dalai-2.378
 Hague-2.361
 Clintons-2.315
TokenWC
Token ID27353
Feature activation-1.81e+00

Pos logit contributions
worldly+4.442
 behavi+3.518
settings+3.248
properties+3.013
versions+2.945
Neg logit contributions
 derail-3.806
 overthrow-3.529
 Dalai-3.505
 Hague-3.446
 overtake-3.382
Token.
Token ID13
Feature activation-1.11e+01

Pos logit contributions
worldly+2.014
 behavi+1.639
settings+1.489
properties+1.390
resources+1.364
Neg logit contributions
 derail-1.593
 overthrow-1.463
 Dalai-1.444
 Hague-1.431
 Clintons-1.412
Token
Token ID198
Feature activation+4.46e+00

Pos logit contributions
worldly+11.268
 behavi+8.465
settings+8.309
properties+8.000
resources+7.590
Neg logit contributions
 derail-9.656
 overthrow-9.023
 overtake-8.805
 assassinate-8.690
 overturn-8.545
Token
Token ID198
Feature activation-6.07e-01

Pos logit contributions
 derail+4.247
 Dalai+3.979
 Hague+3.971
 overthrow+3.954
 overturn+3.871
Neg logit contributions
worldly-4.341
 behavi-3.924
settings-3.066
 redes-2.833
properties-2.797
TokenI
Token ID40
Feature activation-1.25e+01

Pos logit contributions
worldly+0.650
 behavi+0.540
settings+0.471
properties+0.435
versions+0.429
Neg logit contributions
 derail-0.545
 Dalai-0.503
 overthrow-0.503
 Hague-0.497
 Clintons-0.485
Token will
Token ID481
Feature activation+2.57e+00

Pos logit contributions
worldly+12.756
 behavi+9.586
settings+9.274
properties+8.627
resources+8.489
Neg logit contributions
 derail-10.326
 overthrow-9.584
 Dalai-9.554
 Hague-9.486
 Clintons-9.354
Token reiterate
Token ID49852
Feature activation-1.31e+00

Pos logit contributions
 derail+1.843
 overthrow+1.660
 Dalai+1.598
 Hague+1.596
 overturn+1.581
Neg logit contributions
worldly-3.291
 behavi-2.778
settings-2.539
properties-2.392
versions-2.364