TOP ACTIVATIONS
MAX = 4.161

" The Finnish constitution guarantees everyone the right to
States Constitution . Article IV , Section 1 reads , in
the unfounded claim that Hillary Clinton s aide ,
a decision or issued a written opinion on that issue .
the president didn 't sign a single bill into law limiting
be required to give effect to any public act , record
under Title VI of the Civil Rights Act of 1964 .
d tried to steal a political banner while visiting the country
to Prof V ire nder on 23 February .
, record , or judicial proceeding of any other State ,
as the unfounded claim that Hillary Clinton s aide
, who had filed a discrimination complaint against the club on
effect to any public act , record , or judicial proceeding
did not issue the total Loving v . Virginia victory that
hold that the Four teenth Amendment 's equal protection clause granted
US election ] and think Trump is in with all the
pussy .... I have never said that in my life to
2015 is " in violation of Pakistan law ." It seems
s aide , Huma Abedin , is connected to the Muslim
because the idea that the president simply says criminal laws that

MIN ACTIVATIONS
MIN = -4.231

declining sugar industry that can help rejuven ate both the local
- new terrain management technology help F - 150 Rapt or
a lot of money and help in achieve its climate and
aluminum - all oy body help save more than 500 pounds
was just too expensive for many theaters . Ironically
2009 in a bid to help economic recovery .
and the like to help the world 's two largest
Workers represents nearly 3 , 000 workers at Manitoba Hydro .
improvements leading up to it helped stop Firefox users from defect
individuals have come forward to help the affected . In many
wildlife habitats and urban environments can not only co - exist
enty of new airports but few passengers in China
S . military could help focus and shape a coherent
Z ander F ag erson will pack down in the front
destabil ize the grid could help that jump happen .
the quarterbacks , who continued to split first - team repet
- back pairing , they helped increase the pressure on the
creek . The wider approaches will require new retaining walls ,
were hefty investments expected to help the plants remain in operation
checked drivers and utilities will help with that .

INTERVAL 2.063 - 4.161
CONTAINS 1.414%

attempted to defend the remarks he made about his fellow GOP
you how much part of , you know , the global
- economic construct than a statistical category which could capture the
1 . The government cannot revoke or deny a permit based
previous releases of leaked secret US government documents , in July

INTERVAL -0.035 - 2.063
CONTAINS 48.831%

lead actress winner Emma Stone and La La Land
year , on October 7 , he succeeded in delivering cargo
and vice versa that does not invoke any biological sex or
Sexual orientation : Sexual orientation is about romantic and sexual attraction
November , candidates from each party need net positives of at

INTERVAL -2.133 - -0.035
CONTAINS 48.763%

presidential contender Donald Trump has echoed . The longer
rent , transport and medical costs , rates and rising utility
the public have been looted by ag it ators , vandal
mix to come to a boil , we chop a handful
the extra mile to ensure that your experience with us is

INTERVAL -4.231 - -2.133
CONTAINS 0.992%

that will no - d oubt reach fever pitch around next
being pro - European , many Conservatives are not , and
Wong . He wants greater leadership federally , especially
strength . Thousands of people gather at the final
other bl azes burning through the region , people were pictured
Token
Token ID198
Feature activation-2.57e-02

Pos logit contributions
 newcomers+7.474
 climbers+7.278
 passers+7.269
 youngsters+7.252
 commuters+7.100
Neg logit contributions
 violation-8.683
 coercion-7.840
 quotation-7.829
 ['-7.801
 memorandum-7.755
Token
Token ID198
Feature activation-6.85e-01

Pos logit contributions
 climbers+0.178
 newcomers+0.178
 passers+0.175
 youngsters+0.171
 commuters+0.170
Neg logit contributions
 violation-0.223
 ['-0.209
 quotation-0.200
 Declaration-0.199
 coercion-0.198
Token"
Token ID1
Feature activation+1.10e+00

Pos logit contributions
 newcomers+5.187
 passers+5.128
 climbers+5.080
 youngsters+5.071
 commuters+4.963
Neg logit contributions
 violation-5.203
 ['-4.713
 quotation-4.646
 memorandum-4.617
 Declaration-4.566
TokenThe
Token ID464
Feature activation+2.34e-01

Pos logit contributions
 violation+7.874
 ['+7.479
 Declaration+6.961
 quotation+6.891
 coercion+6.866
Neg logit contributions
 climbers-8.923
 newcomers-8.779
 passers-8.683
 commuters-8.521
 youngsters-8.411
Token Finnish
Token ID26838
Feature activation+1.06e+00

Pos logit contributions
 violation+1.700
 ['+1.594
 quotation+1.502
 Declaration+1.493
 coercion+1.479
Neg logit contributions
 newcomers-1.947
 climbers-1.944
 passers-1.913
 youngsters-1.896
 commuters-1.865
Token constitution
Token ID11784
Feature activation+4.16e+00

Pos logit contributions
 violation+8.412
 ['+7.852
 Declaration+7.508
 quotation+7.445
 coercion+7.396
Neg logit contributions
 climbers-7.917
 newcomers-7.864
 passers-7.736
 commuters-7.576
 youngsters-7.504
Token guarantees
Token ID19026
Feature activation+1.77e+00

Pos logit contributions
 violation+27.469
 ['+26.295
 paragraph+24.608
 memorandum+24.423
 forbids+24.283
Neg logit contributions
 passers-18.692
 climbers-18.407
 newcomers-18.224
heast-18.197
 onlook-17.647
Token everyone
Token ID2506
Feature activation+1.37e+00

Pos logit contributions
 violation+14.820
 ['+13.813
 coercion+13.176
 Declaration+13.074
 quotation+13.001
Neg logit contributions
 climbers-11.180
 newcomers-10.901
heast-10.756
 passers-10.740
 commuters-10.623
Token the
Token ID262
Feature activation+1.07e+00

Pos logit contributions
 violation+9.622
 ['+8.813
 Declaration+8.235
 coercion+8.215
 quotation+8.184
Neg logit contributions
 climbers-11.272
 newcomers-11.024
 passers-10.923
 commuters-10.712
heast-10.624
Token right
Token ID826
Feature activation+9.87e-01

Pos logit contributions
 violation+6.561
 ['+5.879
 Declaration+5.567
 quotation+5.496
 coercion+5.487
Neg logit contributions
 climbers-9.911
 newcomers-9.860
 passers-9.726
 commuters-9.562
 youngsters-9.469
Token to
Token ID284
Feature activation+1.35e+00

Pos logit contributions
 violation+7.519
 ['+6.962
 coercion+6.481
 memorandum+6.447
 quotation+6.436
Neg logit contributions
 climbers-7.525
 newcomers-7.421
 passers-7.334
 commuters-7.204
 onlook-7.041
Token States
Token ID1829
Feature activation+3.43e+00

Pos logit contributions
 Declaration+13.020
 violation+12.861
 ['+12.441
 Authorization+11.921
 paragraph+11.418
Neg logit contributions
 climbers-20.205
 passers-19.932
 newcomers-19.874
 commuters-19.441
heast-19.388
Token Constitution
Token ID7965
Feature activation+2.76e+00

Pos logit contributions
 Declaration+22.846
 violation+22.079
 ['+20.921
 memorandum+20.420
 Authorization+20.225
Neg logit contributions
 climbers-18.469
 passers-18.052
heastern-17.647
heast-17.636
zens-17.579
Token.
Token ID13
Feature activation+1.28e+00

Pos logit contributions
 violation+18.933
 ['+17.961
 Declaration+17.264
 paragraph+17.128
 quotation+16.998
Neg logit contributions
 climbers-17.375
 newcomers-17.153
 passers-17.086
 commuters-16.605
 youngsters-16.459
Token Article
Token ID10172
Feature activation+2.63e+00

Pos logit contributions
 violation+9.343
 ['+8.932
 Declaration+8.609
 Authorization+8.384
 quotation+8.346
Neg logit contributions
 climbers-10.210
 newcomers-10.046
 passers-9.993
 youngsters-9.719
 commuters-9.713
Token IV
Token ID8363
Feature activation+2.79e+00

Pos logit contributions
 violation+15.782
 ['+14.278
 quotation+13.960
 Declaration+13.842
 paragraph+13.659
Neg logit contributions
 climbers-19.459
 passers-19.328
 newcomers-18.989
 commuters-18.953
heastern-18.818
Token,
Token ID11
Feature activation+4.07e+00

Pos logit contributions
 violation+20.161
 paragraph+18.914
 ['+18.782
 Declaration+18.386
 memorandum+18.005
Neg logit contributions
 newcomers-15.407
 passers-15.367
zens-15.181
 youngsters-15.146
 climbers-15.029
Token Section
Token ID7275
Feature activation+2.49e+00

Pos logit contributions
 paragraph+25.860
 violation+25.691
 clause+24.684
 Clause+24.134
 Declaration+24.097
Neg logit contributions
 newcomers-19.690
zens-19.379
 passers-19.317
 commuters-19.217
 climbers-19.138
Token 1
Token ID352
Feature activation+2.08e+00

Pos logit contributions
 violation+13.876
 ['+12.929
 Declaration+12.482
 quotation+12.033
 paragraph+12.031
Neg logit contributions
 passers-19.795
 climbers-19.579
 newcomers-19.461
 commuters-19.198
 youngsters-19.027
Token reads
Token ID9743
Feature activation+6.96e-01

Pos logit contributions
 violation+14.687
 ['+13.784
 paragraph+13.448
 Declaration+13.107
 quotation+13.001
Neg logit contributions
 newcomers-14.505
 passers-14.357
 climbers-14.193
 youngsters-14.026
 commuters-13.782
Token,
Token ID11
Feature activation+2.29e+00

Pos logit contributions
 violation+5.234
 ['+4.937
 Declaration+4.641
 quotation+4.629
 coercion+4.573
Neg logit contributions
 climbers-5.502
 newcomers-5.486
 passers-5.379
 youngsters-5.295
 commuters-5.259
Token in
Token ID287
Feature activation+2.26e+00

Pos logit contributions
 violation+15.420
 ['+14.474
 Declaration+14.011
 paragraph+13.969
 quotation+13.855
Neg logit contributions
 newcomers-16.031
 climbers-15.847
 passers-15.571
 youngsters-15.275
 commuters-15.244
Token the
Token ID262
Feature activation+5.76e-01

Pos logit contributions
 violation+10.731
 ['+10.090
 quotation+9.618
 Declaration+9.607
 coercion+9.410
Neg logit contributions
 climbers-10.933
 newcomers-10.909
 passers-10.834
 youngsters-10.531
 commuters-10.452
Token unfounded
Token ID40530
Feature activation+1.69e+00

Pos logit contributions
 violation+4.281
 ['+3.978
 quotation+3.778
 Declaration+3.767
 coercion+3.720
Neg logit contributions
 climbers-4.674
 newcomers-4.662
 passers-4.596
 youngsters-4.509
 commuters-4.484
Token claim
Token ID1624
Feature activation+1.86e+00

Pos logit contributions
 violation+11.967
 ['+10.854
 quotation+10.707
 Declaration+10.665
 coercion+10.424
Neg logit contributions
 newcomers-12.747
 climbers-12.645
 passers-12.558
 youngsters-12.217
heast-12.091
Token that
Token ID326
Feature activation+1.78e+00

Pos logit contributions
 violation+12.168
 ['+11.287
 quotation+10.789
 Declaration+10.614
 coercion+10.428
Neg logit contributions
 newcomers-13.984
 climbers-13.851
 passers-13.667
 youngsters-13.288
 commuters-13.216
Token Hillary
Token ID5073
Feature activation+3.76e+00

Pos logit contributions
 violation+14.257
 ['+13.476
 quotation+12.883
 Declaration+12.780
 coercion+12.680
Neg logit contributions
 newcomers-10.943
 climbers-10.710
 passers-10.577
heast-10.359
 youngsters-10.328
Token Clinton
Token ID2605
Feature activation+4.06e+00

Pos logit contributions
 violation+24.903
 ['+23.254
 plagiar+22.885
 coercion+22.212
 DID+22.151
Neg logit contributions
 passers-17.943
 newcomers-17.921
 climbers-17.837
heast-17.116
zens-17.108
Token
Token ID447
Feature activation+2.61e-01

Pos logit contributions
 violation+26.569
 plagiar+25.725
 ['+25.474
 DID+24.312
 pretended+24.195
Neg logit contributions
 newcomers-17.872
heast-17.569
 climbers-17.548
 passers-17.264
orthern-17.068
Token
Token ID247
Feature activation-3.16e-02

Pos logit contributions
 violation+2.281
 ['+2.148
 quotation+2.053
 Declaration+2.044
 coercion+2.028
Neg logit contributions
 climbers-1.800
 newcomers-1.798
 passers-1.762
 youngsters-1.728
 commuters-1.714
Tokens
Token ID82
Feature activation+2.75e+00

Pos logit contributions
 newcomers+0.181
 climbers+0.181
 passers+0.177
 youngsters+0.173
 commuters+0.172
Neg logit contributions
 violation-0.308
 ['-0.295
 quotation-0.281
 Declaration-0.280
 coercion-0.279
Token aide
Token ID18727
Feature activation+3.50e+00

Pos logit contributions
 violation+22.383
 ['+20.248
 treason+19.448
 quotation+19.438
 Declaration+19.396
Neg logit contributions
 newcomers-14.244
 climbers-14.175
 passers-14.048
heastern-13.465
heast-13.403
Token,
Token ID11
Feature activation+3.30e+00

Pos logit contributions
 violation+20.819
 ['+18.906
 DID+17.882
 plagiar+17.686
 pretended+17.670
Neg logit contributions
 passers-22.323
 climbers-21.599
 newcomers-21.182
 onlook-20.577
ptoms-20.426
Token a
Token ID257
Feature activation+2.86e+00

Pos logit contributions
 violation+19.073
 quotation+17.519
 paragraph+16.914
 ['+16.798
 Declaration+16.674
Neg logit contributions
 climbers-16.045
 newcomers-16.031
 youngsters-15.747
 passers-15.714
heast-15.712
Token decision
Token ID2551
Feature activation+2.13e+00

Pos logit contributions
 violation+21.223
 quotation+19.598
 Declaration+18.978
 memorandum+18.852
 declaration+18.781
Neg logit contributions
 newcomers-17.102
 climbers-17.014
 youngsters-16.769
 passers-16.372
 commuters-16.265
Token or
Token ID393
Feature activation+2.50e+00

Pos logit contributions
 violation+15.295
 ['+14.547
 quotation+13.683
 memorandum+13.646
 paragraph+13.584
Neg logit contributions
 climbers-14.466
 newcomers-14.379
 passers-14.163
 youngsters-13.868
heast-13.611
Token issued
Token ID4884
Feature activation+3.29e+00

Pos logit contributions
 violation+19.203
 quotation+17.551
 memorandum+17.404
 ['+17.098
 paragraph+17.026
Neg logit contributions
 climbers-15.235
 newcomers-15.013
 youngsters-14.720
heast-14.661
 passers-14.628
Token a
Token ID257
Feature activation+3.43e+00

Pos logit contributions
 violation+22.140
 quotation+20.909
 memorandum+20.567
 paragraph+20.306
 Declaration+19.924
Neg logit contributions
 climbers-18.577
heastern-18.503
heast-18.274
 passers-17.910
 newcomers-17.881
Token written
Token ID3194
Feature activation+4.06e+00

Pos logit contributions
 violation+24.001
 memorandum+22.860
 quotation+22.684
 declaration+22.028
 Declaration+21.947
Neg logit contributions
 climbers-18.447
 youngsters-18.084
 newcomers-18.006
 passers-18.004
heastern-17.751
Token opinion
Token ID4459
Feature activation+2.77e+00

Pos logit contributions
 violation+26.924
 memorandum+25.566
 quotation+25.261
 Declaration+25.109
 declaration+25.049
Neg logit contributions
 climbers-19.433
heast-19.348
 youngsters-19.165
heastern-19.085
 newcomers-18.988
Token on
Token ID319
Feature activation+3.15e+00

Pos logit contributions
 violation+18.738
 ['+17.895
 quotation+17.131
 memorandum+17.010
 paragraph+16.994
Neg logit contributions
 climbers-17.313
 newcomers-17.087
 passers-16.903
 youngsters-16.753
zens-16.390
Token that
Token ID326
Feature activation+1.88e+00

Pos logit contributions
 violation+23.688
 paragraph+21.425
 quotation+21.361
 coercion+21.042
 Declaration+20.934
Neg logit contributions
 climbers-16.669
 newcomers-16.283
heastern-16.218
 passers-16.160
heast-16.075
Token issue
Token ID2071
Feature activation+1.24e+00

Pos logit contributions
 violation+14.001
 quotation+12.411
 ['+12.338
 Declaration+12.220
 paragraph+12.128
Neg logit contributions
 climbers-13.670
 newcomers-13.638
 passers-13.262
 commuters-13.193
 youngsters-13.132
Token.
Token ID13
Feature activation+8.77e-01

Pos logit contributions
 violation+10.901
 ['+10.336
 quotation+9.839
 memorandum+9.726
 paragraph+9.710
Neg logit contributions
 climbers-7.947
 newcomers-7.859
 passers-7.689
 youngsters-7.548
 commuters-7.434
Token the
Token ID262
Feature activation+3.36e-01

Pos logit contributions
 violation+3.124
 ['+2.888
 quotation+2.715
 Declaration+2.702
 coercion+2.677
Neg logit contributions
 climbers-4.091
 newcomers-4.079
 passers-4.018
 youngsters-3.957
 commuters-3.935
Token president
Token ID1893
Feature activation+2.46e+00

Pos logit contributions
 violation+2.487
 ['+2.318
 quotation+2.192
 Declaration+2.175
 coercion+2.166
Neg logit contributions
 climbers-2.746
 newcomers-2.745
 passers-2.695
 youngsters-2.657
 commuters-2.635
Token didn
Token ID1422
Feature activation+9.03e-01

Pos logit contributions
 violation+18.497
 ['+17.701
 Declaration+16.301
 quotation+16.218
 memorandum+16.214
Neg logit contributions
 climbers-15.133
 newcomers-15.050
 passers-15.007
 youngsters-14.355
 commuters-14.290
Token't
Token ID470
Feature activation+1.86e+00

Pos logit contributions
 violation+6.464
 ['+6.135
 Declaration+5.686
 quotation+5.672
 Authorization+5.612
Neg logit contributions
 climbers-7.447
 newcomers-7.353
 passers-7.279
 youngsters-7.107
 commuters-7.104
Token sign
Token ID1051
Feature activation+3.53e+00

Pos logit contributions
 violation+15.505
 ['+14.787
 memorandum+13.762
 coercion+13.603
 Declaration+13.581
Neg logit contributions
 newcomers-11.301
 climbers-11.212
 passers-11.153
 youngsters-10.822
 commuters-10.715
Token a
Token ID257
Feature activation+4.03e+00

Pos logit contributions
 ['+24.735
 violation+24.710
 Declaration+22.908
 paragraph+22.724
 Authorization+22.677
Neg logit contributions
 climbers-17.419
 passers-16.877
 onlook-16.390
 newcomers-16.361
 commuters-16.269
Token single
Token ID2060
Feature activation+2.55e+00

Pos logit contributions
 violation+27.992
 memorandum+26.361
 ['+26.250
 Declaration+25.708
 paragraph+25.173
Neg logit contributions
 passers-18.031
 climbers-17.929
 onlook-17.423
 newcomers-17.374
 commuters-17.140
Token bill
Token ID2855
Feature activation+3.06e+00

Pos logit contributions
 violation+19.136
 ['+17.490
 paragraph+16.999
 Declaration+16.986
 memorandum+16.943
Neg logit contributions
 climbers-15.890
 passers-15.622
 newcomers-15.449
 commuters-15.085
 youngsters-15.041
Token into
Token ID656
Feature activation+2.64e+00

Pos logit contributions
 violation+21.309
 ['+20.700
 paragraph+18.820
 memorandum+18.741
 Declaration+18.567
Neg logit contributions
 passers-17.180
 climbers-17.178
 newcomers-16.824
 youngsters-16.564
heast-16.422
Token law
Token ID1099
Feature activation+2.08e+00

Pos logit contributions
 violation+15.022
 ['+13.565
 paragraph+12.921
 quotation+12.461
 memorandum+12.454
Neg logit contributions
 climbers-20.855
 passers-20.318
 newcomers-20.223
heast-20.008
 youngsters-19.758
Token limiting
Token ID15637
Feature activation+1.87e+00

Pos logit contributions
 violation+15.435
 ['+14.822
 Declaration+13.500
 memorandum+13.477
 paragraph+13.427
Neg logit contributions
 climbers-14.136
 passers-13.983
 newcomers-13.910
 youngsters-13.452
 commuters-13.249
Token be
Token ID307
Feature activation+1.53e+00

Pos logit contributions
 violation+1.876
 ['+1.761
 quotation+1.675
 Declaration+1.673
 coercion+1.666
Neg logit contributions
 climbers-1.488
 newcomers-1.481
 passers-1.454
 youngsters-1.425
 commuters-1.417
Token required
Token ID2672
Feature activation+1.04e+00

Pos logit contributions
 violation+10.239
 ['+9.213
 coercion+8.861
 Declaration+8.696
 paragraph+8.526
Neg logit contributions
 climbers-12.685
 passers-12.544
 newcomers-12.520
 youngsters-12.243
 commuters-12.068
Token to
Token ID284
Feature activation+1.41e+00

Pos logit contributions
 violation+7.556
 ['+7.054
 Declaration+6.606
 coercion+6.563
 paragraph+6.513
Neg logit contributions
 climbers-8.157
 newcomers-7.926
 passers-7.805
 youngsters-7.772
 commuters-7.726
Token give
Token ID1577
Feature activation+2.13e+00

Pos logit contributions
 violation+11.730
 ['+11.027
 Declaration+10.446
 coercion+10.369
 paragraph+10.348
Neg logit contributions
 climbers-9.328
 newcomers-9.159
 passers-9.117
 youngsters-8.861
 commuters-8.851
Token effect
Token ID1245
Feature activation+1.60e+00

Pos logit contributions
 violation+16.705
 ['+14.981
 Declaration+14.941
 coercion+14.926
 paragraph+14.867
Neg logit contributions
heast-13.203
heastern-13.050
 climbers-13.035
 newcomers-12.521
orthern-12.367
Token to
Token ID284
Feature activation+4.02e+00

Pos logit contributions
 violation+10.788
 ['+10.166
 paragraph+9.606
 Declaration+9.475
 coercion+9.443
Neg logit contributions
 climbers-12.099
 newcomers-11.834
 passers-11.718
 youngsters-11.567
 commuters-11.440
Token any
Token ID597
Feature activation+3.22e+00

Pos logit contributions
 violation+26.163
 paragraph+25.781
 Declaration+24.372
 coercion+24.228
 ['+23.993
Neg logit contributions
 climbers-19.210
heast-19.069
heastern-18.575
 commuters-18.535
orthern-17.949
Token public
Token ID1171
Feature activation+3.11e+00

Pos logit contributions
 violation+23.900
 Declaration+22.451
 declaration+21.987
 paragraph+21.724
 coercion+21.583
Neg logit contributions
 climbers-16.252
heast-16.111
heastern-15.980
orthern-15.724
 commuters-15.718
Token act
Token ID719
Feature activation+3.44e+00

Pos logit contributions
 violation+22.179
 declaration+20.461
 coercion+20.235
 Declaration+20.031
 memorandum+20.010
Neg logit contributions
 climbers-17.756
heast-17.617
heastern-17.473
 commuters-17.139
 youngsters-17.126
Token,
Token ID11
Feature activation+3.74e+00

Pos logit contributions
 violation+22.600
 ['+21.497
 Declaration+19.626
 authorizing+19.354
 declaration+19.345
Neg logit contributions
 climbers-19.475
 newcomers-18.849
 youngsters-18.738
 passers-18.715
heast-18.622
Token record
Token ID1700
Feature activation+2.67e+00

Pos logit contributions
 violation+24.992
 memorandum+23.654
 Declaration+23.296
 declaration+23.200
 ['+22.131
Neg logit contributions
heast-20.891
zens-20.000
orthern-19.559
heastern-19.544
oppers-19.493
Token under
Token ID739
Feature activation+1.49e+00

Pos logit contributions
 newcomers+10.988
 climbers+10.826
 passers+10.714
 commuters+10.576
 youngsters+10.570
Neg logit contributions
 violation-7.809
 ['-7.454
 quotation-7.211
 Declaration-7.163
 Authorization-7.040
Token Title
Token ID11851
Feature activation+2.46e+00

Pos logit contributions
 violation+10.987
 ['+9.832
 coercion+9.502
 paragraph+9.385
 Declaration+9.371
Neg logit contributions
 climbers-11.614
 newcomers-11.454
 passers-11.306
 youngsters-11.126
 commuters-11.116
Token VI
Token ID13889
Feature activation+1.52e+00

Pos logit contributions
 violation+10.645
 ['+8.992
 Declaration+8.505
 Authorization+8.169
 quotation+8.143
Neg logit contributions
 passers-23.485
heastern-23.093
 climbers-22.827
 commuters-22.791
 newcomers-22.670
Token of
Token ID286
Feature activation+2.47e+00

Pos logit contributions
 violation+11.397
 ['+10.139
 coercion+9.554
 Authorization+9.486
 Declaration+9.461
Neg logit contributions
 climbers-11.414
 newcomers-11.088
 passers-11.016
 youngsters-10.931
 commuters-10.700
Token the
Token ID262
Feature activation+2.54e+00

Pos logit contributions
 violation+13.235
 ['+11.822
 paragraph+10.483
 coercion+10.438
 Authorization+10.273
Neg logit contributions
 climbers-21.272
 newcomers-20.901
 passers-20.732
 commuters-20.173
heastern-20.033
Token Civil
Token ID7511
Feature activation+4.00e+00

Pos logit contributions
 violation+15.396
 ['+14.091
 Declaration+13.921
 Authorization+13.314
 coercion+12.843
Neg logit contributions
 climbers-18.723
heastern-18.289
 newcomers-18.218
 passers-18.079
heast-18.043
Token Rights
Token ID6923
Feature activation+3.34e+00

Pos logit contributions
 violation+16.885
 ['+15.331
 Declaration+14.664
 Authorization+14.375
 Code+14.113
Neg logit contributions
 climbers-29.464
 newcomers-29.175
heast-28.764
 passers-28.724
 busiest-28.580
Token Act
Token ID2191
Feature activation+2.02e+00

Pos logit contributions
 violation+14.368
 Authorization+12.540
 Clause+11.849
 ['+11.773
 Declaration+11.634
Neg logit contributions
 commuters-28.172
 climbers-27.959
 passers-27.732
 youngsters-27.296
heast-27.290
Token of
Token ID286
Feature activation+3.23e+00

Pos logit contributions
 violation+14.139
 ['+12.995
 Declaration+12.035
 Authorization+11.915
 quotation+11.794
Neg logit contributions
 climbers-14.967
 passers-14.453
 newcomers-14.335
 youngsters-14.173
 commuters-14.008
Token 1964
Token ID17575
Feature activation+8.69e-01

Pos logit contributions
 violation+10.733
 ['+9.378
 1964+9.300
 1968+9.001
 1973+8.760
Neg logit contributions
 climbers-32.030
heast-31.405
heastern-30.628
 busiest-30.061
keyes-30.053
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
 violation+7.558
 ['+7.127
 Declaration+6.724
 quotation+6.692
 Authorization+6.683
Neg logit contributions
 climbers-5.866
 newcomers-5.764
 passers-5.720
 youngsters-5.588
 commuters-5.525
Tokend
Token ID67
Feature activation-3.56e-01

Pos logit contributions
 newcomers+3.065
 climbers+3.060
 passers+3.001
 youngsters+2.953
 commuters+2.886
Neg logit contributions
 violation-5.425
 ['-5.175
 quotation-5.001
 Declaration-4.966
 memorandum-4.931
Token tried
Token ID3088
Feature activation+1.09e+00

Pos logit contributions
 climbers+3.060
 newcomers+3.012
 passers+2.998
 youngsters+2.952
 commuters+2.917
Neg logit contributions
 violation-2.400
 ['-2.257
 quotation-2.153
 Declaration-2.134
 Authorization-2.087
Token to
Token ID284
Feature activation+5.41e-01

Pos logit contributions
 violation+8.436
 ['+7.728
 coercion+7.595
 quotation+7.366
 Declaration+7.282
Neg logit contributions
 newcomers-7.958
 climbers-7.936
 passers-7.745
 commuters-7.647
 youngsters-7.620
Token steal
Token ID8711
Feature activation+1.32e+00

Pos logit contributions
 violation+4.697
 ['+4.380
 coercion+4.185
 quotation+4.176
 Declaration+4.140
Neg logit contributions
 newcomers-3.703
 climbers-3.697
 passers-3.620
 youngsters-3.537
 commuters-3.532
Token a
Token ID257
Feature activation+1.70e+00

Pos logit contributions
 violation+10.869
 ['+9.964
 quotation+9.725
 coercion+9.675
 Declaration+9.602
Neg logit contributions
 newcomers-8.892
 climbers-8.770
 passers-8.618
 youngsters-8.451
heast-8.420
Token political
Token ID1964
Feature activation+3.88e+00

Pos logit contributions
 violation+14.523
 ['+13.177
 quotation+13.024
 Declaration+13.003
 memorandum+12.976
Neg logit contributions
 newcomers-10.666
 climbers-10.410
 passers-10.272
 youngsters-10.056
 commuters-9.975
Token banner
Token ID17625
Feature activation+1.87e+00

Pos logit contributions
 violation+28.355
 quotation+26.270
 memorandum+26.193
 confession+25.251
 coercion+25.195
Neg logit contributions
heast-17.787
heastern-17.422
ptoms-17.312
zens-16.900
orthern-16.693
Token while
Token ID981
Feature activation-2.41e-01

Pos logit contributions
 violation+14.356
 ['+13.186
 quotation+12.607
 Declaration+12.382
 paragraph+12.267
Neg logit contributions
 newcomers-12.821
 climbers-12.699
 passers-12.405
 youngsters-12.190
 commuters-12.096
Token visiting
Token ID10013
Feature activation+4.95e-01

Pos logit contributions
 climbers+1.978
 newcomers+1.964
 passers+1.948
 youngsters+1.914
 commuters+1.901
Neg logit contributions
 violation-1.773
 ['-1.663
 Declaration-1.576
 quotation-1.574
 Authorization-1.556
Token the
Token ID262
Feature activation-2.03e-01

Pos logit contributions
 violation+3.658
 ['+3.391
 quotation+3.217
 Declaration+3.197
 coercion+3.182
Neg logit contributions
 climbers-4.028
 newcomers-4.014
 passers-3.940
 youngsters-3.879
 commuters-3.859
Token country
Token ID1499
Feature activation+9.19e-01

Pos logit contributions
 climbers+1.582
 newcomers+1.581
 passers+1.544
 youngsters+1.528
 commuters+1.508
Neg logit contributions
 violation-1.571
 ['-1.488
 quotation-1.395
 Declaration-1.382
 coercion-1.377
Token to
Token ID284
Feature activation-7.30e-01

Pos logit contributions
 violation+16.146
 ['+14.451
 memorandum+14.186
 Declaration+13.887
 paragraph+13.816
Neg logit contributions
 climbers-14.924
 newcomers-14.833
 passers-14.194
 youngsters-14.065
 commuters-13.720
Token Prof
Token ID4415
Feature activation+5.51e-01

Pos logit contributions
 newcomers+6.488
 passers+6.457
 climbers+6.452
 commuters+6.429
 youngsters+6.383
Neg logit contributions
 violation-4.470
 ['-4.272
 quotation-3.983
 Authorization-3.936
 Declaration-3.935
Token V
Token ID569
Feature activation-2.61e-01

Pos logit contributions
 violation+3.865
 ['+3.581
 quotation+3.363
 Declaration+3.359
 coercion+3.315
Neg logit contributions
 climbers-4.690
 newcomers-4.660
 passers-4.591
 youngsters-4.508
 commuters-4.485
Tokenire
Token ID557
Feature activation-2.36e-01

Pos logit contributions
 newcomers+0.926
 climbers+0.920
 passers+0.903
 commuters+0.871
 youngsters+0.862
Neg logit contributions
 violation-3.087
 ['-2.954
 Declaration-2.882
 quotation-2.876
 coercion-2.859
Tokennder
Token ID681
Feature activation+1.56e+00

Pos logit contributions
 newcomers+1.260
 climbers+1.246
 passers+1.240
 youngsters+1.215
 commuters+1.201
Neg logit contributions
 violation-2.358
 ['-2.264
 Declaration-2.187
 quotation-2.176
 coercion-2.158
Token on
Token ID319
Feature activation+3.87e+00

Pos logit contributions
 violation+12.137
 ['+11.110
 Declaration+10.723
 quotation+10.645
 memorandum+10.580
Neg logit contributions
 climbers-11.053
 newcomers-11.030
 passers-10.808
 youngsters-10.533
 commuters-10.288
Token 23
Token ID2242
Feature activation+1.67e+00

Pos logit contributions
 violation+27.723
 paragraph+24.458
 ['+24.360
 Declaration+24.188
 coercion+24.023
Neg logit contributions
 climbers-17.627
 newcomers-17.564
keyes-16.775
heast-16.772
heastern-16.759
Token February
Token ID3945
Feature activation+7.24e-01

Pos logit contributions
 violation+12.346
 ['+11.326
 paragraph+10.763
 Declaration+10.760
 memorandum+10.682
Neg logit contributions
 climbers-11.875
 newcomers-11.860
 passers-11.567
 commuters-11.176
 youngsters-11.144
Token.
Token ID13
Feature activation+3.39e-02

Pos logit contributions
 violation+6.150
 ['+5.746
 quotation+5.484
 Declaration+5.478
 coercion+5.425
Neg logit contributions
 climbers-5.118
 newcomers-5.076
 passers-4.976
 youngsters-4.882
 commuters-4.827
Token
Token ID198
Feature activation-7.09e-02

Pos logit contributions
 violation+0.275
 ['+0.256
 quotation+0.245
 coercion+0.244
 memorandum+0.241
Neg logit contributions
 newcomers-0.252
 climbers-0.251
 passers-0.247
 youngsters-0.243
 commuters-0.242
Token
Token ID198
Feature activation-4.84e-02

Pos logit contributions
 climbers+0.485
 newcomers+0.484
 passers+0.474
 youngsters+0.466
 commuters+0.462
Neg logit contributions
 violation-0.618
 ['-0.579
 quotation-0.555
 Declaration-0.553
 coercion-0.551
Token,
Token ID11
Feature activation+3.74e+00

Pos logit contributions
 violation+22.600
 ['+21.497
 Declaration+19.626
 authorizing+19.354
 declaration+19.345
Neg logit contributions
 climbers-19.475
 newcomers-18.849
 youngsters-18.738
 passers-18.715
heast-18.622
Token record
Token ID1700
Feature activation+2.67e+00

Pos logit contributions
 violation+24.992
 memorandum+23.654
 Declaration+23.296
 declaration+23.200
 ['+22.131
Neg logit contributions
heast-20.891
zens-20.000
orthern-19.559
heastern-19.544
oppers-19.493
Token,
Token ID11
Feature activation+2.90e+00

Pos logit contributions
 violation+17.805
 ['+15.922
 memorandum+15.066
 Declaration+15.062
 paragraph+14.693
Neg logit contributions
 climbers-18.575
 newcomers-18.290
 passers-18.107
 youngsters-17.848
 commuters-17.624
Token or
Token ID393
Feature activation+2.08e+00

Pos logit contributions
 violation+21.613
 memorandum+20.593
 quotation+20.109
 Declaration+19.861
 paragraph+19.153
Neg logit contributions
heast-17.121
heastern-16.569
 climbers-16.467
 newcomers-16.456
 youngsters-16.435
Token judicial
Token ID13657
Feature activation+2.89e+00

Pos logit contributions
 violation+14.444
 memorandum+13.230
 Declaration+12.839
 quotation+12.809
 ['+12.716
Neg logit contributions
 climbers-15.486
 newcomers-15.439
heast-15.426
 youngsters-15.390
 passers-15.152
Token proceeding
Token ID18788
Feature activation+3.84e+00

Pos logit contributions
 violation+19.205
 memorandum+17.857
 declaration+17.694
 Declaration+17.363
 quotation+17.266
Neg logit contributions
zens-19.003
oppers-18.940
heast-18.888
 youngsters-18.849
 passers-18.743
Token of
Token ID286
Feature activation+2.95e+00

Pos logit contributions
 violation+24.814
 ['+23.294
 memorandum+21.217
 authorizing+21.177
 declaration+21.104
Neg logit contributions
heast-20.086
oppers-19.819
 climbers-19.818
zens-19.429
heastern-19.392
Token any
Token ID597
Feature activation+1.72e+00

Pos logit contributions
 violation+22.980
 coercion+19.564
 paragraph+19.245
 treason+19.185
 Declaration+19.144
Neg logit contributions
 climbers-16.688
heast-16.511
 newcomers-16.260
heastern-16.027
 commuters-15.907
Token other
Token ID584
Feature activation+1.80e+00

Pos logit contributions
 violation+11.371
 ['+9.645
 Declaration+9.602
 memorandum+9.323
 coercion+9.322
Neg logit contributions
 climbers-14.328
 newcomers-14.113
 youngsters-13.841
 passers-13.811
heast-13.694
Token State
Token ID1812
Feature activation+1.95e+00

Pos logit contributions
 violation+12.676
 ['+11.158
 Declaration+10.857
 coercion+10.631
 paragraph+10.564
Neg logit contributions
 climbers-14.077
 newcomers-14.009
 passers-13.844
 youngsters-13.748
 commuters-13.477
Token,
Token ID11
Feature activation+1.82e+00

Pos logit contributions
 violation+14.302
 ['+13.181
 Declaration+12.313
 coercion+12.305
 memorandum+12.287
Neg logit contributions
 climbers-13.799
 newcomers-13.755
 passers-13.627
 youngsters-13.379
 commuters-13.254
Token as
Token ID355
Feature activation+1.48e+00

Pos logit contributions
 violation+0.602
 ['+0.559
 quotation+0.527
 Declaration+0.526
 coercion+0.521
Neg logit contributions
 climbers-0.709
 newcomers-0.708
 passers-0.696
 youngsters-0.685
 commuters-0.681
Token the
Token ID262
Feature activation+5.76e-01

Pos logit contributions
 violation+10.731
 ['+10.090
 quotation+9.618
 Declaration+9.607
 coercion+9.410
Neg logit contributions
 climbers-10.933
 newcomers-10.909
 passers-10.834
 youngsters-10.531
 commuters-10.452
Token unfounded
Token ID40530
Feature activation+1.69e+00

Pos logit contributions
 violation+4.281
 ['+3.978
 quotation+3.778
 Declaration+3.767
 coercion+3.720
Neg logit contributions
 climbers-4.674
 newcomers-4.662
 passers-4.596
 youngsters-4.509
 commuters-4.484
Token claim
Token ID1624
Feature activation+1.86e+00

Pos logit contributions
 violation+11.967
 ['+10.854
 quotation+10.707
 Declaration+10.665
 coercion+10.424
Neg logit contributions
 newcomers-12.747
 climbers-12.645
 passers-12.558
 youngsters-12.217
heast-12.091
Token that
Token ID326
Feature activation+1.78e+00

Pos logit contributions
 violation+12.168
 ['+11.287
 quotation+10.789
 Declaration+10.614
 coercion+10.428
Neg logit contributions
 newcomers-13.984
 climbers-13.851
 passers-13.667
 youngsters-13.288
 commuters-13.216
Token Hillary
Token ID5073
Feature activation+3.76e+00

Pos logit contributions
 violation+14.257
 ['+13.476
 quotation+12.883
 Declaration+12.780
 coercion+12.680
Neg logit contributions
 newcomers-10.943
 climbers-10.710
 passers-10.577
heast-10.359
 youngsters-10.328
Token Clinton
Token ID2605
Feature activation+4.06e+00

Pos logit contributions
 violation+24.903
 ['+23.254
 plagiar+22.885
 coercion+22.212
 DID+22.151
Neg logit contributions
 passers-17.943
 newcomers-17.921
 climbers-17.837
heast-17.116
zens-17.108
Token
Token ID447
Feature activation+2.61e-01

Pos logit contributions
 violation+26.569
 plagiar+25.725
 ['+25.474
 DID+24.312
 pretended+24.195
Neg logit contributions
 newcomers-17.872
heast-17.569
 climbers-17.548
 passers-17.264
orthern-17.068
Token
Token ID247
Feature activation-3.16e-02

Pos logit contributions
 violation+2.281
 ['+2.148
 quotation+2.053
 Declaration+2.044
 coercion+2.028
Neg logit contributions
 climbers-1.800
 newcomers-1.798
 passers-1.762
 youngsters-1.728
 commuters-1.714
Tokens
Token ID82
Feature activation+2.75e+00

Pos logit contributions
 newcomers+0.181
 climbers+0.181
 passers+0.177
 youngsters+0.173
 commuters+0.172
Neg logit contributions
 violation-0.308
 ['-0.295
 quotation-0.281
 Declaration-0.280
 coercion-0.279
Token aide
Token ID18727
Feature activation+3.50e+00

Pos logit contributions
 violation+22.383
 ['+20.248
 treason+19.448
 quotation+19.438
 Declaration+19.396
Neg logit contributions
 newcomers-14.244
 climbers-14.175
 passers-14.048
heastern-13.465
heast-13.403
Token,
Token ID11
Feature activation+1.47e+00

Pos logit contributions
 violation+1.382
 ['+1.304
 quotation+1.247
 Declaration+1.243
 coercion+1.235
Neg logit contributions
 climbers-1.000
 newcomers-0.996
 passers-0.976
 youngsters-0.956
 commuters-0.949
Token who
Token ID508
Feature activation+3.01e-01

Pos logit contributions
 violation+10.940
 ['+10.095
 quotation+9.600
 coercion+9.566
 Declaration+9.456
Neg logit contributions
 climbers-11.133
 newcomers-11.060
 passers-10.786
 youngsters-10.695
 commuters-10.539
Token had
Token ID550
Feature activation+4.58e-01

Pos logit contributions
 violation+1.827
 ['+1.674
 quotation+1.563
 Declaration+1.555
 coercion+1.537
Neg logit contributions
 climbers-2.873
 newcomers-2.864
 passers-2.826
 youngsters-2.785
 commuters-2.773
Token filed
Token ID5717
Feature activation+1.89e+00

Pos logit contributions
 violation+3.344
 ['+3.099
 quotation+2.923
 Declaration+2.902
 coercion+2.901
Neg logit contributions
 climbers-3.796
 newcomers-3.781
 passers-3.724
 youngsters-3.661
 commuters-3.636
Token a
Token ID257
Feature activation+2.78e+00

Pos logit contributions
 violation+13.468
 ['+12.106
 coercion+11.531
 memorandum+11.372
 quotation+11.308
Neg logit contributions
 climbers-14.243
 newcomers-14.155
 passers-13.866
 youngsters-13.704
 commuters-13.541
Token discrimination
Token ID8839
Feature activation+3.76e+00

Pos logit contributions
 violation+21.116
 memorandum+18.464
 ['+18.206
 Declaration+17.934
 coercion+17.833
Neg logit contributions
 climbers-17.093
 passers-16.541
 newcomers-16.524
 youngsters-16.167
 commuters-15.990
Token complaint
Token ID8224
Feature activation+1.98e+00

Pos logit contributions
 violation+25.392
 memorandum+21.615
 declaration+20.595
 coercion+20.380
 Declaration+20.174
Neg logit contributions
 climbers-22.164
 newcomers-21.683
 passers-21.579
zens-21.300
 youngsters-21.223
Token against
Token ID1028
Feature activation+1.38e-01

Pos logit contributions
 violation+15.064
 ['+13.713
 coercion+12.988
 memorandum+12.835
 paragraph+12.817
Neg logit contributions
 climbers-13.599
 newcomers-13.290
 passers-13.065
 youngsters-12.804
 commuters-12.528
Token the
Token ID262
Feature activation-1.00e+00

Pos logit contributions
 violation+0.949
 ['+0.879
 Declaration+0.827
 quotation+0.826
 coercion+0.816
Neg logit contributions
 climbers-1.212
 newcomers-1.209
 passers-1.191
 youngsters-1.170
 commuters-1.163
Token club
Token ID3430
Feature activation+6.34e-01

Pos logit contributions
 newcomers+7.255
 climbers+7.152
 passers+7.025
 youngsters+6.947
 commuters+6.744
Neg logit contributions
 violation-7.497
 ['-7.201
 Declaration-6.865
 quotation-6.806
 coercion-6.795
Token on
Token ID319
Feature activation+2.70e+00

Pos logit contributions
 violation+5.621
 ['+5.270
 quotation+5.029
 Declaration+5.007
 coercion+4.990
Neg logit contributions
 climbers-4.257
 newcomers-4.229
 passers-4.156
 youngsters-4.068
 commuters-4.036
Token effect
Token ID1245
Feature activation+1.60e+00

Pos logit contributions
 violation+16.705
 ['+14.981
 Declaration+14.941
 coercion+14.926
 paragraph+14.867
Neg logit contributions
heast-13.203
heastern-13.050
 climbers-13.035
 newcomers-12.521
orthern-12.367
Token to
Token ID284
Feature activation+4.02e+00

Pos logit contributions
 violation+10.788
 ['+10.166
 paragraph+9.606
 Declaration+9.475
 coercion+9.443
Neg logit contributions
 climbers-12.099
 newcomers-11.834
 passers-11.718
 youngsters-11.567
 commuters-11.440
Token any
Token ID597
Feature activation+3.22e+00

Pos logit contributions
 violation+26.163
 paragraph+25.781
 Declaration+24.372
 coercion+24.228
 ['+23.993
Neg logit contributions
 climbers-19.210
heast-19.069
heastern-18.575
 commuters-18.535
orthern-17.949
Token public
Token ID1171
Feature activation+3.11e+00

Pos logit contributions
 violation+23.900
 Declaration+22.451
 declaration+21.987
 paragraph+21.724
 coercion+21.583
Neg logit contributions
 climbers-16.252
heast-16.111
heastern-15.980
orthern-15.724
 commuters-15.718
Token act
Token ID719
Feature activation+3.44e+00

Pos logit contributions
 violation+22.179
 declaration+20.461
 coercion+20.235
 Declaration+20.031
 memorandum+20.010
Neg logit contributions
 climbers-17.756
heast-17.617
heastern-17.473
 commuters-17.139
 youngsters-17.126
Token,
Token ID11
Feature activation+3.74e+00

Pos logit contributions
 violation+22.600
 ['+21.497
 Declaration+19.626
 authorizing+19.354
 declaration+19.345
Neg logit contributions
 climbers-19.475
 newcomers-18.849
 youngsters-18.738
 passers-18.715
heast-18.622
Token record
Token ID1700
Feature activation+2.67e+00

Pos logit contributions
 violation+24.992
 memorandum+23.654
 Declaration+23.296
 declaration+23.200
 ['+22.131
Neg logit contributions
heast-20.891
zens-20.000
orthern-19.559
heastern-19.544
oppers-19.493
Token,
Token ID11
Feature activation+2.90e+00

Pos logit contributions
 violation+17.805
 ['+15.922
 memorandum+15.066
 Declaration+15.062
 paragraph+14.693
Neg logit contributions
 climbers-18.575
 newcomers-18.290
 passers-18.107
 youngsters-17.848
 commuters-17.624
Token or
Token ID393
Feature activation+2.08e+00

Pos logit contributions
 violation+21.613
 memorandum+20.593
 quotation+20.109
 Declaration+19.861
 paragraph+19.153
Neg logit contributions
heast-17.121
heastern-16.569
 climbers-16.467
 newcomers-16.456
 youngsters-16.435
Token judicial
Token ID13657
Feature activation+2.89e+00

Pos logit contributions
 violation+14.444
 memorandum+13.230
 Declaration+12.839
 quotation+12.809
 ['+12.716
Neg logit contributions
 climbers-15.486
 newcomers-15.439
heast-15.426
 youngsters-15.390
 passers-15.152
Token proceeding
Token ID18788
Feature activation+3.84e+00

Pos logit contributions
 violation+19.205
 memorandum+17.857
 declaration+17.694
 Declaration+17.363
 quotation+17.266
Neg logit contributions
zens-19.003
oppers-18.940
heast-18.888
 youngsters-18.849
 passers-18.743
Token did
Token ID750
Feature activation+1.38e-01

Pos logit contributions
 violation+3.687
 ['+3.377
 quotation+3.142
 Declaration+3.125
 coercion+3.090
Neg logit contributions
 climbers-5.959
 newcomers-5.945
 passers-5.869
 youngsters-5.782
 commuters-5.750
Token not
Token ID407
Feature activation+1.31e-02

Pos logit contributions
 violation+0.999
 ['+0.931
 quotation+0.879
 Declaration+0.876
 coercion+0.869
Neg logit contributions
 climbers-1.148
 newcomers-1.143
 passers-1.125
 youngsters-1.106
 commuters-1.101
Token issue
Token ID2071
Feature activation+2.86e+00

Pos logit contributions
 violation+0.107
 ['+0.101
 quotation+0.096
 Declaration+0.095
 coercion+0.095
Neg logit contributions
 climbers-0.096
 newcomers-0.096
 passers-0.094
 youngsters-0.092
 commuters-0.092
Token the
Token ID262
Feature activation+1.97e+00

Pos logit contributions
 violation+20.423
 ['+18.957
 quotation+18.852
 Declaration+18.712
 paragraph+18.569
Neg logit contributions
 climbers-15.880
 newcomers-15.223
heastern-15.221
heast-15.178
 passers-15.175
Token total
Token ID2472
Feature activation+1.47e+00

Pos logit contributions
 violation+15.488
 Declaration+14.287
 ['+14.174
 memorandum+14.102
 quotation+13.996
Neg logit contributions
 climbers-12.367
 newcomers-12.336
 passers-12.278
 commuters-11.816
 youngsters-11.808
Token Loving
Token ID39202
Feature activation+3.69e+00

Pos logit contributions
 violation+11.562
 ['+10.449
 quotation+10.207
 Declaration+10.197
 memorandum+10.134
Neg logit contributions
 climbers-10.736
 newcomers-10.544
 passers-10.524
 youngsters-10.368
 commuters-10.210
Token v
Token ID410
Feature activation+1.47e+00

Pos logit contributions
 violation+27.369
 Declaration+25.041
 quotation+24.990
 memorandum+24.984
 ['+24.020
Neg logit contributions
heastern-16.994
heast-16.463
 climbers-16.268
zens-16.192
 passers-15.879
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
 violation+12.083
 ['+11.184
 Declaration+11.056
 quotation+10.681
 coercion+10.549
Neg logit contributions
 climbers-9.807
 newcomers-9.489
 passers-9.417
 youngsters-9.136
 onlook-8.927
Token Virginia
Token ID6025
Feature activation+3.56e+00

Pos logit contributions
 violation+10.141
 Declaration+9.387
 ['+8.871
 paragraph+8.278
 quotation+8.121
Neg logit contributions
 climbers-17.104
 newcomers-16.574
 youngsters-16.377
heastern-16.176
 passers-16.072
Token victory
Token ID5373
Feature activation+1.54e+00

Pos logit contributions
 violation+25.293
 Declaration+23.306
 memorandum+22.824
 ['+22.638
 declaration+22.619
Neg logit contributions
heastern-17.611
 climbers-17.402
heast-17.296
zens-17.131
orthern-16.737
Token that
Token ID326
Feature activation+9.14e-01

Pos logit contributions
 violation+12.412
 ['+11.470
 quotation+11.327
 memorandum+11.285
 Declaration+11.206
Neg logit contributions
 newcomers-10.543
 climbers-10.516
 passers-10.256
 youngsters-10.186
 commuters-9.943
Token hold
Token ID1745
Feature activation+1.07e+00

Pos logit contributions
 violation+0.014
 ['+0.014
 Declaration+0.013
 quotation+0.013
 coercion+0.013
Neg logit contributions
 newcomers-0.009
 climbers-0.009
 passers-0.009
 youngsters-0.009
 commuters-0.009
Token that
Token ID326
Feature activation+1.49e+00

Pos logit contributions
 violation+7.429
 ['+6.853
 quotation+6.588
 Declaration+6.463
 memorandum+6.378
Neg logit contributions
 climbers-8.925
 newcomers-8.836
 passers-8.725
 youngsters-8.618
 commuters-8.493
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
 violation+12.036
 ['+10.897
 quotation+10.739
 paragraph+10.593
 Declaration+10.559
Neg logit contributions
 climbers-10.367
 newcomers-10.219
 passers-10.037
 youngsters-9.900
 commuters-9.714
Token Four
Token ID6675
Feature activation+2.02e+00

Pos logit contributions
 violation+7.747
 ['+7.022
 quotation+6.856
 Declaration+6.851
 coercion+6.707
Neg logit contributions
 climbers-8.396
 newcomers-8.324
 passers-8.176
 youngsters-8.086
 commuters-8.002
Tokenteenth
Token ID20283
Feature activation+3.41e+00

Pos logit contributions
 violation+14.241
 ['+13.516
 Declaration+13.469
 paragraph+13.022
 quotation+12.829
Neg logit contributions
 climbers-14.263
 newcomers-13.820
 passers-13.638
 commuters-13.616
 youngsters-13.177
Token Amendment
Token ID8441
Feature activation+3.68e+00

Pos logit contributions
 violation+19.606
 Declaration+18.121
 paragraph+17.858
 Clause+17.198
 ['+17.139
Neg logit contributions
 climbers-22.030
 youngsters-21.771
 newcomers-21.584
 passers-21.514
heastern-21.375
Token's
Token ID338
Feature activation+2.04e+00

Pos logit contributions
 violation+28.729
 ['+25.221
 quotation+25.061
 paragraph+24.884
 coercion+24.716
Neg logit contributions
zens-15.937
heast-15.727
orthern-15.470
heastern-15.394
 youngsters-15.314
Token equal
Token ID4961
Feature activation+1.95e+00

Pos logit contributions
 violation+16.110
 ['+14.151
 quotation+14.130
 Declaration+14.045
 coercion+13.972
Neg logit contributions
 climbers-13.363
 newcomers-13.230
 passers-13.019
 youngsters-12.939
heastern-12.843
Token protection
Token ID4800
Feature activation+3.40e+00

Pos logit contributions
 violation+11.293
 coercion+9.168
 quotation+9.119
 ['+8.929
 paragraph+8.823
Neg logit contributions
 climbers-17.392
heast-17.329
 newcomers-17.150
 passers-17.143
heastern-16.971
Token clause
Token ID13444
Feature activation+3.29e+00

Pos logit contributions
 violation+25.482
 clause+23.178
 quotation+22.049
 paragraph+21.986
 coercion+21.765
Neg logit contributions
heast-17.435
heastern-17.335
 newcomers-16.994
 youngsters-16.767
zens-16.751
Token granted
Token ID7520
Feature activation+4.49e-01

Pos logit contributions
 violation+24.189
 ['+21.708
 paragraph+21.137
 quotation+20.756
 memorandum+20.513
Neg logit contributions
 newcomers-16.928
 passers-16.868
 climbers-16.746
zens-16.633
 youngsters-16.595
Token US
Token ID1294
Feature activation+1.79e+00

Pos logit contributions
 violation+9.851
 ['+9.147
 Declaration+8.813
 quotation+8.765
 memorandum+8.603
Neg logit contributions
 climbers-9.460
 newcomers-9.335
 passers-9.282
 youngsters-9.085
 commuters-9.024
Token election
Token ID3071
Feature activation+2.75e+00

Pos logit contributions
 violation+12.347
 ['+11.292
 Declaration+10.953
 quotation+10.697
 memorandum+10.623
Neg logit contributions
 climbers-13.587
 passers-13.392
 newcomers-13.375
 youngsters-12.970
 commuters-12.925
Token]
Token ID60
Feature activation+1.18e+00

Pos logit contributions
 violation+19.071
 ['+17.319
 quotation+16.741
 memorandum+16.296
 coercion+16.293
Neg logit contributions
 climbers-17.181
 newcomers-16.669
 passers-16.651
 commuters-16.213
 youngsters-16.143
Token and
Token ID290
Feature activation+1.35e+00

Pos logit contributions
 violation+8.040
 ['+7.442
 Declaration+6.978
 quotation+6.927
 coercion+6.876
Neg logit contributions
 climbers-9.889
 newcomers-9.790
 passers-9.668
 youngsters-9.494
 commuters-9.392
Token think
Token ID892
Feature activation+5.58e-01

Pos logit contributions
 violation+10.614
 ['+9.849
 Declaration+9.320
 quotation+9.295
 coercion+9.258
Neg logit contributions
 climbers-9.912
 newcomers-9.767
 passers-9.623
 youngsters-9.506
 commuters-9.428
Token Trump
Token ID1301
Feature activation+3.67e+00

Pos logit contributions
 violation+3.558
 ['+3.273
 quotation+3.064
 Declaration+3.050
 coercion+3.020
Neg logit contributions
 climbers-5.155
 newcomers-5.142
 passers-5.068
 youngsters-4.994
 commuters-4.967
Token is
Token ID318
Feature activation+2.05e+00

Pos logit contributions
 violation+27.576
 ['+25.579
 quotation+24.236
 coercion+24.132
 Declaration+23.860
Neg logit contributions
 climbers-15.649
 newcomers-15.534
 passers-14.970
 youngsters-14.759
heast-14.713
Token in
Token ID287
Feature activation+2.35e+00

Pos logit contributions
 violation+16.223
 ['+15.122
 treason+14.282
 coercion+14.067
 quotation+14.041
Neg logit contributions
 climbers-13.075
 newcomers-12.920
 passers-12.840
 youngsters-12.432
 commuters-12.373
Token with
Token ID351
Feature activation+1.97e+00

Pos logit contributions
 violation+19.800
 ['+17.183
 quotation+16.622
 treason+16.299
 coercion+16.204
Neg logit contributions
 climbers-14.254
 newcomers-14.207
 passers-14.120
 youngsters-13.889
 commuters-13.536
Token all
Token ID477
Feature activation+7.75e-01

Pos logit contributions
 violation+15.524
 ['+14.745
 quotation+13.884
 coercion+13.882
 treason+13.774
Neg logit contributions
 climbers-12.476
 newcomers-12.378
 passers-12.251
 youngsters-11.929
 commuters-11.850
Token the
Token ID262
Feature activation+1.24e-01

Pos logit contributions
 violation+5.797
 ['+5.412
 quotation+5.099
 Declaration+5.072
 coercion+5.026
Neg logit contributions
 climbers-6.150
 newcomers-6.105
 passers-6.032
 youngsters-5.920
 commuters-5.888
Token pussy
Token ID18185
Feature activation+9.11e-01

Pos logit contributions
 violation+10.603
 ['+10.035
 quotation+9.430
 Declaration+9.426
 coercion+9.316
Neg logit contributions
 newcomers-8.212
 climbers-8.168
 passers-8.009
 youngsters-7.752
 commuters-7.739
Token....
Token ID1106
Feature activation+1.24e+00

Pos logit contributions
 violation+7.791
 ['+7.357
 Declaration+6.930
 quotation+6.920
 coercion+6.864
Neg logit contributions
 newcomers-6.231
 climbers-6.223
 passers-6.079
 youngsters-5.925
 commuters-5.899
Token I
Token ID314
Feature activation+2.74e+00

Pos logit contributions
 violation+8.919
 ['+8.347
 Declaration+7.805
 quotation+7.787
 coercion+7.711
Neg logit contributions
 climbers-9.847
 newcomers-9.800
 passers-9.623
 youngsters-9.448
 commuters-9.369
Token have
Token ID423
Feature activation+1.62e+00

Pos logit contributions
 violation+19.673
 ['+18.539
 DID+17.302
 pretended+17.297
 coercion+17.135
Neg logit contributions
 newcomers-16.228
 passers-15.868
 climbers-15.569
 youngsters-15.079
 commuters-15.059
Token never
Token ID1239
Feature activation+2.55e+00

Pos logit contributions
 violation+12.713
 ['+11.807
 quotation+11.091
 coercion+11.048
 Declaration+10.841
Neg logit contributions
 newcomers-11.173
 climbers-11.033
 passers-10.840
 commuters-10.545
 youngsters-10.535
Token said
Token ID531
Feature activation+3.67e+00

Pos logit contributions
 violation+18.223
 ['+17.227
 pretended+15.896
 coercion+15.674
 plagiar+15.577
Neg logit contributions
 newcomers-16.092
 passers-15.644
 climbers-15.525
 commuters-14.963
 youngsters-14.933
Token that
Token ID326
Feature activation+2.65e+00

Pos logit contributions
 violation+25.409
 ['+24.129
 coercion+22.803
 quotation+22.097
 treason+21.786
Neg logit contributions
heast-17.988
 newcomers-17.955
 climbers-17.539
heastern-17.122
orthern-16.958
Token in
Token ID287
Feature activation+3.24e+00

Pos logit contributions
 violation+19.678
 ['+18.395
 quotation+17.181
 coercion+17.164
 paragraph+16.942
Neg logit contributions
 newcomers-16.038
 climbers-15.926
 passers-15.262
heast-14.968
 youngsters-14.839
Token my
Token ID616
Feature activation+2.32e+00

Pos logit contributions
 violation+24.819
 ['+22.329
 quotation+22.017
 paragraph+21.330
 Declaration+21.148
Neg logit contributions
 climbers-16.384
 newcomers-16.352
heastern-15.738
heast-15.649
 commuters-15.324
Token life
Token ID1204
Feature activation+1.96e+00

Pos logit contributions
 violation+18.153
 ['+16.836
 Declaration+16.272
 memorandum+16.057
 paragraph+16.010
Neg logit contributions
 newcomers-14.269
 climbers-13.837
 passers-13.525
 youngsters-13.049
 commuters-12.923
Token to
Token ID284
Feature activation+1.58e+00

Pos logit contributions
 violation+14.927
 ['+14.258
 quotation+13.135
 paragraph+13.069
 Declaration+12.990
Neg logit contributions
 newcomers-12.909
 climbers-12.908
 passers-12.570
 youngsters-12.039
 commuters-11.921
Token 2015
Token ID1853
Feature activation+1.62e+00

Pos logit contributions
 violation+22.409
 ['+20.637
 paragraph+19.926
 coercion+19.783
 Declaration+19.697
Neg logit contributions
 climbers-13.726
 newcomers-13.606
 passers-13.084
heast-13.031
 youngsters-12.832
Token is
Token ID318
Feature activation+1.94e+00

Pos logit contributions
 violation+11.492
 ['+10.698
 quotation+9.905
 paragraph+9.847
 memorandum+9.765
Neg logit contributions
 climbers-12.638
 newcomers-12.613
 passers-12.379
 youngsters-12.194
 commuters-12.033
Token "
Token ID366
Feature activation+1.84e+00

Pos logit contributions
 violation+15.044
 ['+14.110
 quotation+13.208
 paragraph+12.995
 coercion+12.986
Neg logit contributions
 climbers-13.039
 newcomers-12.908
 passers-12.714
 youngsters-12.338
 commuters-12.250
Tokenin
Token ID259
Feature activation+2.48e+00

Pos logit contributions
 violation+14.774
 ['+13.839
 quotation+13.126
 Declaration+13.030
 coercion+13.023
Neg logit contributions
 climbers-11.873
 newcomers-11.774
 passers-11.558
 youngsters-11.327
 commuters-11.183
Token violation
Token ID8747
Feature activation+1.86e+00

Pos logit contributions
 violation+20.056
 ['+17.438
 quotation+17.051
 paragraph+16.525
 coercion+16.247
Neg logit contributions
 newcomers-15.303
 climbers-15.285
 passers-15.115
 youngsters-14.978
 commuters-14.550
Token of
Token ID286
Feature activation+3.66e+00

Pos logit contributions
 violation+11.905
 ['+10.719
 paragraph+9.962
 Declaration+9.927
 memorandum+9.908
Neg logit contributions
 climbers-14.946
 newcomers-14.831
 passers-14.602
 commuters-14.503
 youngsters-14.284
Token Pakistan
Token ID7648
Feature activation+2.14e+00

Pos logit contributions
 violation+27.918
 ['+25.808
 paragraph+25.290
 Declaration+25.260
 memorandum+24.770
Neg logit contributions
 climbers-15.856
 passers-15.530
 newcomers-15.465
 commuters-15.078
heastern-14.921
Token law
Token ID1099
Feature activation+2.40e+00

Pos logit contributions
 violation+16.559
 ['+15.374
 Declaration+15.320
 memorandum+14.705
 Authorization+14.605
Neg logit contributions
 climbers-13.828
 passers-13.702
 newcomers-13.687
 youngsters-13.139
 commuters-13.129
Token."
Token ID526
Feature activation+9.14e-01

Pos logit contributions
 violation+17.986
 ['+16.483
 memorandum+15.325
 Declaration+15.187
 paragraph+15.124
Neg logit contributions
 newcomers-15.802
 climbers-15.800
 passers-15.545
 youngsters-14.958
 onlook-14.825
Token It
Token ID632
Feature activation+1.56e+00

Pos logit contributions
 violation+6.951
 ['+6.516
 Declaration+6.165
 quotation+6.137
 Authorization+6.062
Neg logit contributions
 climbers-7.220
 newcomers-7.167
 passers-7.066
 youngsters-6.925
 commuters-6.890
Token seems
Token ID2331
Feature activation-1.20e-02

Pos logit contributions
 violation+10.511
 ['+9.677
 quotation+9.024
 paragraph+8.872
 memorandum+8.792
Neg logit contributions
 newcomers-12.807
 climbers-12.745
 passers-12.506
 youngsters-12.311
 commuters-12.197
Tokens
Token ID82
Feature activation+2.75e+00

Pos logit contributions
 newcomers+0.181
 climbers+0.181
 passers+0.177
 youngsters+0.173
 commuters+0.172
Neg logit contributions
 violation-0.308
 ['-0.295
 quotation-0.281
 Declaration-0.280
 coercion-0.279
Token aide
Token ID18727
Feature activation+3.50e+00

Pos logit contributions
 violation+22.383
 ['+20.248
 treason+19.448
 quotation+19.438
 Declaration+19.396
Neg logit contributions
 newcomers-14.244
 climbers-14.175
 passers-14.048
heastern-13.465
heast-13.403
Token,
Token ID11
Feature activation+3.30e+00

Pos logit contributions
 violation+20.819
 ['+18.906
 DID+17.882
 plagiar+17.686
 pretended+17.670
Neg logit contributions
 passers-22.323
 climbers-21.599
 newcomers-21.182
 onlook-20.577
ptoms-20.426
Token Huma
Token ID49387
Feature activation+1.59e+00

Pos logit contributions
 violation+21.481
 ['+20.090
 Declaration+18.700
 paragraph+18.674
 memorandum+18.210
Neg logit contributions
 passers-19.868
 climbers-19.702
 newcomers-19.065
 onlook-18.672
 commuters-18.318
Token Abedin
Token ID39562
Feature activation+2.33e+00

Pos logit contributions
 violation+9.005
 ['+8.268
 memorandum+7.693
 Declaration+7.588
 coercion+7.533
Neg logit contributions
 passers-14.680
 climbers-14.640
 newcomers-14.451
 youngsters-14.140
 commuters-14.112
Token,
Token ID11
Feature activation+3.65e+00

Pos logit contributions
 violation+16.035
 ['+14.747
 plagiar+14.144
 coercion+13.878
 Declaration+13.857
Neg logit contributions
 climbers-15.580
 newcomers-15.300
 passers-15.196
 youngsters-14.663
 commuters-14.517
Token is
Token ID318
Feature activation+2.43e+00

Pos logit contributions
 violation+24.887
 ['+23.595
 plagiar+23.587
 treason+22.476
 DID+22.462
Neg logit contributions
 newcomers-17.626
 climbers-17.479
 passers-17.102
 youngsters-16.525
 onlook-16.380
Token connected
Token ID5884
Feature activation+1.13e+00

Pos logit contributions
 violation+17.685
 ['+16.352
 treason+16.060
 quotation+15.603
 coercion+15.544
Neg logit contributions
 newcomers-15.322
 passers-15.280
 climbers-15.205
heastern-14.582
 youngsters-14.494
Token to
Token ID284
Feature activation+3.10e+00

Pos logit contributions
 violation+8.136
 ['+7.479
 quotation+7.009
 paragraph+6.885
 treason+6.884
Neg logit contributions
 newcomers-8.605
 climbers-8.486
 passers-8.413
 youngsters-8.232
 commuters-8.122
Token the
Token ID262
Feature activation+2.06e+00

Pos logit contributions
 violation+22.804
 ['+20.951
 treason+20.826
 coercion+19.967
 Authorization+19.601
Neg logit contributions
 newcomers-16.157
 passers-15.896
 climbers-15.742
heast-15.537
 onlook-15.460
Token Muslim
Token ID3765
Feature activation+2.09e+00

Pos logit contributions
 violation+16.070
 ['+14.678
 Declaration+14.173
 Authorization+14.039
 treason+13.964
Neg logit contributions
 passers-13.435
 climbers-13.197
 newcomers-13.187
 onlook-12.856
 commuters-12.756
Token because
Token ID780
Feature activation+1.19e+00

Pos logit contributions
 violation+10.234
 ['+9.449
 Declaration+8.961
 coercion+8.932
 quotation+8.882
Neg logit contributions
 climbers-10.854
 newcomers-10.829
 passers-10.728
 youngsters-10.449
 commuters-10.388
Token the
Token ID262
Feature activation+3.48e-01

Pos logit contributions
 violation+8.296
 ['+7.659
 coercion+7.229
 Declaration+7.160
 quotation+7.157
Neg logit contributions
 climbers-9.755
 newcomers-9.643
 passers-9.547
 commuters-9.375
 youngsters-9.359
Token idea
Token ID2126
Feature activation+1.83e+00

Pos logit contributions
 violation+2.519
 ['+2.348
 quotation+2.216
 Declaration+2.206
 coercion+2.185
Neg logit contributions
 climbers-2.899
 newcomers-2.893
 passers-2.845
 youngsters-2.801
 commuters-2.782
Token that
Token ID326
Feature activation+2.15e+00

Pos logit contributions
 violation+12.071
 ['+11.231
 coercion+10.328
 quotation+10.321
 paragraph+10.090
Neg logit contributions
 climbers-14.477
 newcomers-14.342
 passers-14.147
 youngsters-13.907
 commuters-13.824
Token the
Token ID262
Feature activation+8.21e-01

Pos logit contributions
 violation+17.922
 ['+16.700
 coercion+16.180
 quotation+15.789
 Declaration+15.653
Neg logit contributions
 climbers-12.394
 newcomers-12.033
heast-11.984
 passers-11.912
 commuters-11.723
Token president
Token ID1893
Feature activation+3.64e+00

Pos logit contributions
 violation+6.053
 ['+5.581
 Declaration+5.291
 quotation+5.277
 coercion+5.252
Neg logit contributions
 climbers-6.673
 newcomers-6.603
 passers-6.531
 commuters-6.401
 youngsters-6.388
Token simply
Token ID2391
Feature activation+2.64e+00

Pos logit contributions
 violation+25.926
 ['+24.315
 coercion+23.203
 memorandum+22.838
 Authorization+22.524
Neg logit contributions
 newcomers-17.321
 climbers-17.234
 passers-17.018
 commuters-16.293
heast-16.225
Token says
Token ID1139
Feature activation+2.38e+00

Pos logit contributions
 violation+20.500
 ['+19.360
 coercion+18.252
 memorandum+18.167
 Authorization+17.699
Neg logit contributions
 newcomers-14.859
 climbers-14.578
 passers-14.278
 youngsters-14.125
heast-13.881
Token criminal
Token ID4301
Feature activation+1.82e+00

Pos logit contributions
 violation+17.787
 ['+17.040
 coercion+15.838
 Declaration+15.712
 quotation+15.458
Neg logit contributions
 newcomers-14.490
 climbers-14.487
heast-13.968
 passers-13.958
 youngsters-13.720
Token laws
Token ID3657
Feature activation+2.37e+00

Pos logit contributions
 violation+15.071
 ['+13.926
 coercion+13.506
 quotation+13.094
 memorandum+13.031
Neg logit contributions
 climbers-11.283
 passers-11.060
 newcomers-11.028
heast-10.895
 commuters-10.799
Token that
Token ID326
Feature activation+2.01e+00

Pos logit contributions
 violation+19.687
 ['+17.755
 coercion+16.890
 memorandum+16.506
 paragraph+16.366
Neg logit contributions
 newcomers-13.546
 climbers-13.446
 passers-13.279
 youngsters-12.805
heast-12.735
Token declining
Token ID18080
Feature activation-1.47e+00

Pos logit contributions
 newcomers+9.298
 climbers+9.193
 youngsters+9.026
 commuters+8.936
 passers+8.907
Neg logit contributions
 violation-7.269
 ['-7.010
 quotation-6.536
 Authorization-6.457
 coercion-6.339
Token sugar
Token ID7543
Feature activation-4.09e-01

Pos logit contributions
 climbers+10.695
 newcomers+10.558
 youngsters+10.396
 passers+10.320
 commuters+10.291
Neg logit contributions
 violation-10.058
 ['-9.930
 Inquiry-9.003
 quotation-8.999
 Authorization-8.972
Token industry
Token ID2831
Feature activation-4.97e-01

Pos logit contributions
 climbers+3.087
 newcomers+3.036
 passers+2.961
 youngsters+2.945
 commuters+2.939
Neg logit contributions
 violation-3.195
 ['-3.046
 Declaration-2.862
 quotation-2.838
 Authorization-2.827
Token that
Token ID326
Feature activation-1.12e+00

Pos logit contributions
 newcomers+3.921
 climbers+3.912
 passers+3.822
 youngsters+3.781
 commuters+3.766
Neg logit contributions
 violation-3.717
 ['-3.473
 Declaration-3.296
 quotation-3.291
 coercion-3.262
Token can
Token ID460
Feature activation-2.79e+00

Pos logit contributions
 climbers+10.587
 newcomers+10.566
 commuters+10.318
 youngsters+10.279
 passers+10.270
Neg logit contributions
 violation-5.927
 ['-5.469
 Declaration-4.997
 quotation-4.986
 Authorization-4.958
Token help
Token ID1037
Feature activation-4.23e+00

Pos logit contributions
 climbers+18.833
 newcomers+18.616
 upscale+18.422
 growers+18.214
 youngsters+18.059
Neg logit contributions
 violation-15.259
 Declaration-14.362
 ['-14.309
 Authorization-13.987
 memorandum-13.653
Token rejuven
Token ID46834
Feature activation-1.80e+00

Pos logit contributions
 growers+27.694
 newcomers+27.577
 climbers+27.183
 youngsters+26.854
 commuters+26.813
Neg logit contributions
 violation-16.312
 ['-16.260
 Authorization-15.070
 Declaration-14.377
 memorandum-14.164
Tokenate
Token ID378
Feature activation-2.20e+00

Pos logit contributions
 newcomers+6.174
 climbers+5.887
 upscale+5.790
 youngsters+5.777
 passers+5.773
Neg logit contributions
 violation-19.234
 quotation-17.864
 Declaration-17.805
 ['-17.676
 coercion-17.666
Token both
Token ID1111
Feature activation-2.07e+00

Pos logit contributions
 climbers+16.979
 newcomers+16.962
 growers+16.871
 youngsters+16.743
 commuters+16.556
Neg logit contributions
 violation-12.122
 ['-11.755
 Authorization-10.730
 quotation-10.608
 coercion-10.365
Token the
Token ID262
Feature activation-2.23e+00

Pos logit contributions
 newcomers+16.446
 climbers+16.215
 growers+16.117
 youngsters+16.057
 commuters+15.801
Neg logit contributions
 violation-11.528
 ['-11.310
 Authorization-10.700
 quotation-10.081
 Declaration-10.040
Token local
Token ID1957
Feature activation-2.03e+00

Pos logit contributions
 climbers+16.347
 newcomers+16.343
 growers+16.084
 youngsters+16.019
 upscale+15.602
Neg logit contributions
 violation-12.960
 ['-12.793
 Authorization-11.641
 quotation-11.429
 coercion-11.186
Token-
Token ID12
Feature activation-1.36e+00

Pos logit contributions
 newcomers+14.708
 climbers+14.027
 passers+13.600
 commuters+13.456
 upscale+13.334
Neg logit contributions
 violation-14.948
 ['-14.853
 Authorization-13.901
 coercion-13.720
 Declaration-13.666
Tokennew
Token ID3605
Feature activation-2.14e+00

Pos logit contributions
 newcomers+6.639
 climbers+6.394
 passers+6.147
 commuters+6.121
 youngsters+5.991
Neg logit contributions
 violation-13.315
 ['-12.707
 quotation-12.391
 Authorization-12.329
 Declaration-12.315
Token terrain
Token ID15510
Feature activation-1.08e+00

Pos logit contributions
 newcomers+16.791
 climbers+16.714
 upscale+16.285
 commuters+16.194
 passers+16.155
Neg logit contributions
 violation-11.430
 ['-11.127
 coercion-10.131
 Declaration-10.082
 Authorization-9.966
Token management
Token ID4542
Feature activation-7.25e-01

Pos logit contributions
 newcomers+8.566
 climbers+8.556
 passers+8.347
 commuters+8.229
 youngsters+8.156
Neg logit contributions
 violation-7.336
 ['-7.127
 Declaration-6.685
 memorandum-6.667
 quotation-6.629
Token technology
Token ID3037
Feature activation-2.12e+00

Pos logit contributions
 newcomers+6.446
 climbers+6.444
 passers+6.340
 commuters+6.221
 youngsters+6.220
Neg logit contributions
 violation-4.636
 ['-4.339
 quotation-4.060
 Declaration-4.054
 coercion-3.993
Token help
Token ID1037
Feature activation-4.18e+00

Pos logit contributions
 newcomers+16.207
 climbers+16.008
 passers+15.522
 commuters+15.471
 upscale+15.247
Neg logit contributions
 violation-12.577
 ['-11.678
 quotation-11.310
 coercion-11.191
 Declaration-11.076
Token F
Token ID376
Feature activation-8.89e-01

Pos logit contributions
 newcomers+28.812
 climbers+28.497
 commuters+27.707
 youngsters+27.107
 passers+27.081
Neg logit contributions
 ['-16.322
 violation-16.137
 memorandum-14.990
 Authorization-14.763
 Declaration-14.714
Token-
Token ID12
Feature activation-1.01e-01

Pos logit contributions
 newcomers+6.149
 climbers+6.100
 passers+6.024
 commuters+5.959
 youngsters+5.931
Neg logit contributions
 violation-7.167
 ['-6.826
 Declaration-6.621
 memorandum-6.535
 quotation-6.515
Token150
Token ID8628
Feature activation-2.39e+00

Pos logit contributions
 newcomers+0.764
 climbers+0.764
 passers+0.748
 youngsters+0.738
 commuters+0.734
Neg logit contributions
 violation-0.807
 ['-0.753
 quotation-0.720
 Declaration-0.714
 memorandum-0.709
Token Rapt
Token ID21301
Feature activation-1.91e-01

Pos logit contributions
 newcomers+18.781
 climbers+18.616
 commuters+18.282
 passers+18.217
 youngsters+17.788
Neg logit contributions
 violation-13.320
 ['-12.847
 memorandum-12.179
 Declaration-11.822
 quotation-11.808
Tokenor
Token ID273
Feature activation-3.30e+00

Pos logit contributions
 newcomers+1.513
 climbers+1.510
 passers+1.487
 youngsters+1.457
 commuters+1.454
Neg logit contributions
 violation-1.459
 ['-1.368
 quotation-1.302
 Declaration-1.301
 memorandum-1.285
Token a
Token ID257
Feature activation+5.19e-01

Pos logit contributions
 violation+4.095
 ['+3.797
 quotation+3.622
 Declaration+3.598
 coercion+3.571
Neg logit contributions
 climbers-4.238
 newcomers-4.228
 passers-4.155
 youngsters-4.064
 commuters-4.012
Token lot
Token ID1256
Feature activation-1.06e+00

Pos logit contributions
 violation+4.299
 ['+3.986
 quotation+3.832
 Declaration+3.816
 memorandum+3.796
Neg logit contributions
 climbers-3.784
 newcomers-3.767
 passers-3.701
 youngsters-3.621
 commuters-3.580
Token of
Token ID286
Feature activation-6.31e-01

Pos logit contributions
 newcomers+7.567
 climbers+7.567
 commuters+7.417
 youngsters+7.409
 passers+7.345
Neg logit contributions
 violation-8.252
 ['-7.775
 Declaration-7.464
 Authorization-7.434
 quotation-7.421
Token money
Token ID1637
Feature activation-1.07e+00

Pos logit contributions
 climbers+4.876
 newcomers+4.847
 commuters+4.810
 youngsters+4.752
 passers+4.730
Neg logit contributions
 violation-4.742
 ['-4.558
 Declaration-4.273
 Authorization-4.261
 quotation-4.236
Token and
Token ID290
Feature activation-1.79e+00

Pos logit contributions
 climbers+8.708
 newcomers+8.675
 commuters+8.567
 youngsters+8.496
 passers+8.462
Neg logit contributions
 violation-7.095
 ['-6.636
 Declaration-6.419
 quotation-6.371
 Authorization-6.321
Token help
Token ID1037
Feature activation-4.04e+00

Pos logit contributions
 commuters+15.019
 climbers+14.824
 newcomers+14.747
 youngsters+14.613
 passers+14.467
Neg logit contributions
 violation-9.857
 ['-9.801
 Declaration-8.876
 quotation-8.721
 Authorization-8.641
Token in
Token ID287
Feature activation-1.09e+00

Pos logit contributions
 commuters+27.536
 youngsters+26.739
 climbers+26.674
 newcomers+26.560
 pave+25.800
Neg logit contributions
 ['-16.314
 violation-15.868
 Authorization-15.298
 Declaration-14.564
 quotation-14.377
Token achieve
Token ID4620
Feature activation-1.56e-01

Pos logit contributions
 climbers+9.130
 newcomers+9.112
 commuters+9.039
 youngsters+8.909
 passers+8.863
Neg logit contributions
 violation-6.864
 ['-6.761
 Authorization-6.251
 Declaration-6.225
 quotation-6.174
Token its
Token ID663
Feature activation-9.46e-02

Pos logit contributions
 climbers+1.355
 newcomers+1.353
 passers+1.328
 commuters+1.317
 youngsters+1.315
Neg logit contributions
 violation-1.060
 ['-0.992
 quotation-0.926
 Declaration-0.919
 coercion-0.915
Token climate
Token ID4258
Feature activation+1.11e+00

Pos logit contributions
 climbers+0.717
 newcomers+0.713
 passers+0.700
 youngsters+0.692
 commuters+0.690
Neg logit contributions
 violation-0.752
 ['-0.709
 quotation-0.670
 Declaration-0.665
 coercion-0.664
Token and
Token ID290
Feature activation-9.59e-01

Pos logit contributions
 violation+8.818
 ['+8.069
 Declaration+7.795
 quotation+7.772
 memorandum+7.735
Neg logit contributions
 climbers-8.122
 newcomers-8.069
 passers-8.043
 youngsters-7.831
 commuters-7.745
Token aluminum
Token ID15955
Feature activation-1.24e+00

Pos logit contributions
 climbers+1.405
 newcomers+1.404
 passers+1.363
 commuters+1.343
 youngsters+1.340
Neg logit contributions
 violation-1.607
 ['-1.521
 Declaration-1.437
 quotation-1.435
 coercion-1.418
Token-
Token ID12
Feature activation-4.64e-01

Pos logit contributions
 climbers+8.805
 newcomers+8.685
 commuters+8.376
 passers+8.344
 youngsters+8.164
Neg logit contributions
 violation-9.242
 ['-9.040
 Declaration-8.517
 Authorization-8.364
 quotation-8.307
Tokenall
Token ID439
Feature activation-1.01e+00

Pos logit contributions
 climbers+3.304
 newcomers+3.237
 passers+3.170
 commuters+3.127
 youngsters+3.116
Neg logit contributions
 violation-3.834
 ['-3.624
 quotation-3.465
 Declaration-3.458
 Authorization-3.397
Tokenoy
Token ID726
Feature activation-7.91e-01

Pos logit contributions
 climbers+4.725
 newcomers+4.702
 passers+4.405
 commuters+4.323
 youngsters+4.283
Neg logit contributions
 violation-10.330
 ['-10.167
 Authorization-9.650
 Declaration-9.626
 quotation-9.587
Token body
Token ID1767
Feature activation-7.14e-01

Pos logit contributions
 climbers+5.908
 newcomers+5.866
 passers+5.689
 commuters+5.655
 youngsters+5.530
Neg logit contributions
 violation-5.953
 ['-5.740
 Declaration-5.427
 memorandum-5.336
 quotation-5.328
Token help
Token ID1037
Feature activation-3.99e+00

Pos logit contributions
 climbers+5.710
 newcomers+5.629
 passers+5.548
 commuters+5.453
 youngsters+5.382
Neg logit contributions
 violation-5.150
 ['-4.879
 Declaration-4.605
 quotation-4.562
 Authorization-4.534
Token save
Token ID3613
Feature activation-1.30e+00

Pos logit contributions
 newcomers+26.722
 climbers+26.520
 passers+25.275
 commuters+25.224
 upscale+25.176
Neg logit contributions
 ['-16.768
 violation-16.650
 Authorization-15.534
 Declaration-15.523
 memorandum-14.905
Token more
Token ID517
Feature activation-1.83e+00

Pos logit contributions
 climbers+11.107
 newcomers+10.948
 commuters+10.942
 passers+10.711
 youngsters+10.686
Neg logit contributions
 violation-7.990
 ['-7.841
 Declaration-7.193
 Authorization-7.045
 memorandum-7.000
Token than
Token ID621
Feature activation+1.08e+00

Pos logit contributions
 climbers+12.563
 commuters+12.206
 newcomers+11.989
 passers+11.850
 youngsters+11.777
Neg logit contributions
 violation-13.296
 ['-13.129
 Declaration-12.242
 memorandum-12.182
 Authorization-12.179
Token 500
Token ID5323
Feature activation-2.10e+00

Pos logit contributions
 violation+5.841
 ['+5.253
 quotation+4.949
 coercion+4.883
 Declaration+4.857
Neg logit contributions
 climbers-10.506
 newcomers-10.424
 passers-10.387
 youngsters-10.179
 commuters-10.079
Token pounds
Token ID8059
Feature activation-9.89e-01

Pos logit contributions
 climbers+15.374
 commuters+15.150
 newcomers+14.924
 youngsters+14.593
 passers+14.532
Neg logit contributions
 violation-13.149
 ['-12.890
 memorandum-11.926
 Declaration-11.913
 coercion-11.905
Token was
Token ID373
Feature activation+3.37e-01

Pos logit contributions
 climbers+4.335
 newcomers+4.315
 passers+4.256
 commuters+4.193
 youngsters+4.189
Neg logit contributions
 violation-2.175
 ['-1.991
 Declaration-1.846
 quotation-1.836
 coercion-1.794
Token just
Token ID655
Feature activation+7.59e-01

Pos logit contributions
 violation+2.619
 ['+2.442
 quotation+2.319
 Declaration+2.308
 coercion+2.293
Neg logit contributions
 climbers-2.635
 newcomers-2.626
 passers-2.584
 youngsters-2.538
 commuters-2.520
Token too
Token ID1165
Feature activation-1.37e+00

Pos logit contributions
 violation+6.033
 ['+5.587
 quotation+5.346
 coercion+5.301
 Declaration+5.297
Neg logit contributions
 climbers-5.713
 newcomers-5.701
 passers-5.590
 youngsters-5.482
 commuters-5.445
Token expensive
Token ID5789
Feature activation-1.60e+00

Pos logit contributions
 climbers+8.152
 newcomers+8.043
 youngsters+7.688
 commuters+7.647
 passers+7.633
Neg logit contributions
 violation-11.666
 ['-11.222
 Declaration-10.925
 Authorization-10.755
 memorandum-10.668
Token for
Token ID329
Feature activation-1.70e+00

Pos logit contributions
 newcomers+11.340
 climbers+11.223
 passers+11.111
 commuters+10.929
 youngsters+10.921
Neg logit contributions
 violation-11.389
 ['-10.852
 Declaration-10.519
 quotation-10.420
 Authorization-10.320
Token many
Token ID867
Feature activation-3.95e+00

Pos logit contributions
 newcomers+14.796
 passers+14.497
 climbers+14.418
 commuters+14.264
 youngsters+14.210
Neg logit contributions
 violation-9.349
 ['-8.994
 Declaration-8.275
 quotation-8.252
 Authorization-8.155
Token theaters
Token ID20550
Feature activation-2.07e+00

Pos logit contributions
 newcomers+28.512
 passers+28.105
 youngsters+27.784
 commuters+27.607
 climbers+27.280
Neg logit contributions
 violation-14.837
 ['-14.539
 Authorization-14.019
 Declaration-13.680
 memorandum-13.665
Token.
Token ID13
Feature activation-1.72e+00

Pos logit contributions
 newcomers+15.624
 passers+15.344
 climbers+15.198
 youngsters+15.020
 commuters+15.000
Neg logit contributions
 violation-12.848
 ['-11.970
 Declaration-11.750
 quotation-11.733
 coercion-11.504
Token
Token ID198
Feature activation-3.89e-01

Pos logit contributions
 newcomers+12.273
 climbers+11.951
 passers+11.709
 commuters+11.671
 youngsters+11.655
Neg logit contributions
 violation-11.921
 coercion-10.824
 quotation-10.632
 memorandum-10.576
 ['-10.471
Token
Token ID198
Feature activation-1.33e+00

Pos logit contributions
 newcomers+2.479
 climbers+2.451
 passers+2.401
 youngsters+2.376
 commuters+2.345
Neg logit contributions
 violation-3.520
 ['-3.279
 quotation-3.187
 coercion-3.178
 memorandum-3.165
TokenIronically
Token ID44850
Feature activation-1.78e+00

Pos logit contributions
 newcomers+9.433
 passers+9.283
 climbers+9.212
 youngsters+9.053
 commuters+9.005
Neg logit contributions
 violation-9.973
 ['-9.009
 quotation-8.943
 memorandum-8.840
 coercion-8.762
Token 2009
Token ID3717
Feature activation-7.41e-01

Pos logit contributions
 violation+5.005
 ['+4.699
 quotation+4.484
 Declaration+4.465
 coercion+4.432
Neg logit contributions
 climbers-3.987
 newcomers-3.945
 passers-3.890
 youngsters-3.805
 commuters-3.780
Token in
Token ID287
Feature activation+1.18e+00

Pos logit contributions
 newcomers+4.739
 climbers+4.735
 passers+4.582
 youngsters+4.538
 commuters+4.531
Neg logit contributions
 violation-6.519
 ['-6.182
 quotation-5.940
 Declaration-5.938
 coercion-5.875
Token a
Token ID257
Feature activation+7.91e-01

Pos logit contributions
 violation+8.810
 ['+7.805
 quotation+7.659
 coercion+7.446
 memorandum+7.436
Neg logit contributions
 climbers-9.226
 newcomers-9.216
 passers-9.033
 youngsters-8.881
heast-8.812
Token bid
Token ID8406
Feature activation-9.54e-01

Pos logit contributions
 violation+5.925
 ['+5.399
 quotation+5.195
 memorandum+5.191
 Declaration+5.155
Neg logit contributions
 climbers-6.372
 newcomers-6.341
 passers-6.247
 youngsters-6.132
 commuters-6.115
Token to
Token ID284
Feature activation-2.57e+00

Pos logit contributions
 climbers+5.976
 newcomers+5.966
 passers+5.854
 youngsters+5.754
 commuters+5.674
Neg logit contributions
 violation-8.501
 ['-8.135
 Declaration-7.785
 quotation-7.751
 Authorization-7.676
Token help
Token ID1037
Feature activation-3.90e+00

Pos logit contributions
 climbers+17.330
 newcomers+16.998
 youngsters+16.923
 passers+16.738
 commuters+16.563
Neg logit contributions
 violation-15.642
 ['-14.665
 Declaration-14.613
 Authorization-14.465
milo-13.872
Token economic
Token ID3034
Feature activation-6.04e-01

Pos logit contributions
 newcomers+26.195
 youngsters+26.052
 commuters+25.667
 climbers+25.584
 passers+24.862
Neg logit contributions
 violation-17.207
 Authorization-16.895
 ['-16.685
 Declaration-15.573
 plagiar-14.897
Token recovery
Token ID7628
Feature activation-1.82e+00

Pos logit contributions
 newcomers+4.094
 climbers+4.046
 youngsters+3.931
 passers+3.884
 commuters+3.852
Neg logit contributions
 violation-5.353
 ['-5.084
 Authorization-4.780
 Declaration-4.779
 quotation-4.749
Token.
Token ID13
Feature activation-1.23e+00

Pos logit contributions
 climbers+13.637
 newcomers+13.399
 youngsters+13.179
 passers+12.981
 commuters+12.951
Neg logit contributions
 violation-12.267
 ['-11.590
 Declaration-11.265
 Authorization-11.184
 quotation-11.064
Token
Token ID198
Feature activation-5.83e-01

Pos logit contributions
 newcomers+7.401
 climbers+7.306
 passers+7.076
 youngsters+7.063
 commuters+7.046
Neg logit contributions
 violation-10.933
 ['-10.105
 coercion-9.904
 quotation-9.781
 memorandum-9.652
Token
Token ID198
Feature activation-1.50e+00

Pos logit contributions
 newcomers+3.542
 climbers+3.473
 passers+3.413
 youngsters+3.394
 commuters+3.343
Neg logit contributions
 violation-5.391
 ['-4.989
 coercion-4.896
 quotation-4.885
 memorandum-4.881
Token and
Token ID290
Feature activation-7.32e-01

Pos logit contributions
 climbers+1.080
 newcomers+1.077
 passers+1.057
 youngsters+1.038
 commuters+1.032
Neg logit contributions
 violation-1.165
 ['-1.092
 quotation-1.039
 Declaration-1.034
 coercion-1.027
Token the
Token ID262
Feature activation-8.42e-01

Pos logit contributions
 newcomers+6.129
 climbers+6.121
 passers+5.983
 youngsters+5.896
 commuters+5.886
Neg logit contributions
 violation-4.956
 ['-4.742
 quotation-4.356
 Declaration-4.311
 Authorization-4.291
Token like
Token ID588
Feature activation-1.20e+00

Pos logit contributions
 newcomers+7.274
 climbers+7.261
 youngsters+6.989
 passers+6.946
 commuters+6.922
Neg logit contributions
 violation-5.272
 ['-5.125
 quotation-4.659
 coercion-4.584
 paragraph-4.488
Token
Token ID960
Feature activation-1.05e+00

Pos logit contributions
 newcomers+8.382
 climbers+8.248
 youngsters+8.067
 passers+8.034
 commuters+7.963
Neg logit contributions
 violation-9.476
 ['-8.891
 quotation-8.540
 coercion-8.504
 Declaration-8.444
Tokento
Token ID1462
Feature activation-2.34e+00

Pos logit contributions
 newcomers+7.875
 climbers+7.849
 commuters+7.614
 passers+7.586
 youngsters+7.585
Neg logit contributions
 violation-7.536
 ['-7.099
 quotation-6.856
 Declaration-6.703
 coercion-6.648
Token help
Token ID1037
Feature activation-3.86e+00

Pos logit contributions
 climbers+16.356
 newcomers+16.330
 commuters+15.671
 youngsters+15.670
 passers+15.528
Neg logit contributions
 violation-14.204
 ['-13.498
 Declaration-12.852
 quotation-12.828
 Authorization-12.563
Token the
Token ID262
Feature activation-2.26e+00

Pos logit contributions
 newcomers+26.733
 climbers+26.580
 commuters+25.973
 youngsters+25.667
 pave+25.335
Neg logit contributions
 ['-15.645
 violation-15.465
 quotation-14.267
 Authorization-13.764
statement-13.333
Token world
Token ID995
Feature activation-2.01e+00

Pos logit contributions
 newcomers+17.915
 climbers+17.270
 youngsters+16.767
 commuters+16.433
 passers+16.137
Neg logit contributions
 ['-12.319
 violation-12.235
 quotation-10.916
 coercion-10.780
 Authorization-10.729
Token's
Token ID338
Feature activation-1.92e+00

Pos logit contributions
 newcomers+15.366
 climbers+15.280
 commuters+14.710
 youngsters+14.652
 passers+14.590
Neg logit contributions
 violation-12.397
 ['-11.687
 quotation-11.295
 Authorization-11.231
 Inquiry-10.958
Token two
Token ID734
Feature activation-1.57e+00

Pos logit contributions
 newcomers+14.831
 climbers+14.498
 youngsters+14.081
 commuters+13.806
 passers+13.628
Neg logit contributions
 violation-12.136
 ['-11.998
 Authorization-11.627
 quotation-11.147
 Declaration-10.769
Token largest
Token ID4387
Feature activation-1.57e+00

Pos logit contributions
 newcomers+10.077
 climbers+10.020
 youngsters+9.368
 passers+9.275
 busiest+9.208
Neg logit contributions
 violation-12.559
 ['-12.229
 Authorization-11.805
 quotation-11.562
 Inquiry-11.426
Token Workers
Token ID16847
Feature activation+1.48e-01

Pos logit contributions
 violation+6.515
 ['+6.145
 Authorization+5.835
 Declaration+5.833
 quotation+5.803
Neg logit contributions
 newcomers-6.833
 climbers-6.775
 passers-6.720
 youngsters-6.598
 commuters-6.485
Token represents
Token ID6870
Feature activation-1.89e+00

Pos logit contributions
 violation+1.029
 ['+0.954
 quotation+0.898
 Declaration+0.893
 coercion+0.887
Neg logit contributions
 climbers-1.279
 newcomers-1.275
 passers-1.255
 youngsters-1.235
 commuters-1.230
Token nearly
Token ID3016
Feature activation+1.12e-01

Pos logit contributions
 newcomers+15.454
 climbers+15.318
 commuters+15.145
 passers+15.086
 youngsters+14.747
Neg logit contributions
 violation-11.113
 ['-10.942
 Declaration-9.823
 Authorization-9.788
 quotation-9.739
Token 3
Token ID513
Feature activation-2.10e+00

Pos logit contributions
 violation+0.683
 ['+0.625
 quotation+0.583
 Declaration+0.581
 coercion+0.572
Neg logit contributions
 climbers-1.077
 newcomers-1.076
 passers-1.057
 youngsters-1.044
 commuters-1.041
Token,
Token ID11
Feature activation+2.10e-01

Pos logit contributions
 climbers+10.941
 newcomers+10.811
 commuters+10.093
 passers+10.056
 youngsters+9.895
Neg logit contributions
 violation-17.680
 ['-17.317
 quotation-16.656
 Declaration-16.444
 coercion-16.384
Token000
Token ID830
Feature activation-3.82e+00

Pos logit contributions
 violation+1.497
 ['+1.384
 quotation+1.311
 Declaration+1.304
 coercion+1.295
Neg logit contributions
 climbers-1.789
 newcomers-1.784
 passers-1.752
 youngsters-1.728
 commuters-1.716
Token workers
Token ID3259
Feature activation-1.56e+00

Pos logit contributions
 newcomers+26.071
 commuters+25.748
 climbers+25.478
 passers+25.096
 residents+24.796
Neg logit contributions
 ['-16.317
 violation-16.076
 Authorization-14.993
 Zionism-14.623
 Declaration-14.385
Token at
Token ID379
Feature activation-8.25e-01

Pos logit contributions
 climbers+11.146
 newcomers+11.061
 commuters+10.970
 passers+10.865
 youngsters+10.598
Neg logit contributions
 violation-11.112
 ['-10.674
 Declaration-10.309
 Authorization-10.212
 quotation-10.023
Token Manitoba
Token ID28293
Feature activation-5.13e-01

Pos logit contributions
 newcomers+6.374
 climbers+6.320
 passers+6.207
 commuters+6.145
 youngsters+6.065
Neg logit contributions
 violation-6.128
 ['-5.880
 quotation-5.477
 Declaration-5.432
 Authorization-5.407
Token Hydro
Token ID32116
Feature activation-7.96e-01

Pos logit contributions
 newcomers+3.094
 climbers+3.091
 passers+3.012
 commuters+2.965
 youngsters+2.952
Neg logit contributions
 violation-4.839
 ['-4.606
 quotation-4.391
 coercion-4.341
 Declaration-4.328
Token.
Token ID13
Feature activation-1.46e+00

Pos logit contributions
 newcomers+6.618
 climbers+6.555
 passers+6.459
 commuters+6.443
 youngsters+6.363
Neg logit contributions
 violation-5.471
 ['-5.178
 quotation-4.816
 coercion-4.774
 Declaration-4.771
Token improvements
Token ID8561
Feature activation-1.71e+00

Pos logit contributions
 climbers+8.381
 newcomers+8.376
 passers+8.217
 commuters+8.069
 youngsters+8.051
Neg logit contributions
 violation-4.517
 ['-4.217
 Declaration-3.954
 Authorization-3.859
 quotation-3.818
Token leading
Token ID3756
Feature activation-2.10e+00

Pos logit contributions
 newcomers+13.530
 climbers+13.486
 passers+13.134
 youngsters+13.011
 commuters+12.874
Neg logit contributions
 violation-10.673
 ['-10.042
 Declaration-9.801
 coercion-9.444
 quotation-9.430
Token up
Token ID510
Feature activation-7.38e-01

Pos logit contributions
 newcomers+15.413
 climbers+15.071
 passers+14.890
 youngsters+14.712
 commuters+14.419
Neg logit contributions
 violation-14.396
 ['-13.635
 Declaration-12.813
 coercion-12.707
 memorandum-12.550
Token to
Token ID284
Feature activation+8.86e-01

Pos logit contributions
 newcomers+4.716
 climbers+4.670
 passers+4.581
 youngsters+4.496
 commuters+4.425
Neg logit contributions
 violation-6.643
 ['-6.320
 Declaration-6.004
 quotation-6.003
 coercion-5.956
Token it
Token ID340
Feature activation-1.06e+00

Pos logit contributions
 violation+6.041
 ['+5.506
 quotation+5.218
 Declaration+5.173
 paragraph+5.150
Neg logit contributions
 climbers-7.608
 newcomers-7.426
 passers-7.402
 youngsters-7.266
 commuters-7.252
Token helped
Token ID4193
Feature activation-3.80e+00

Pos logit contributions
 newcomers+8.377
 climbers+8.370
 passers+8.178
 youngsters+8.053
 commuters+7.984
Neg logit contributions
 violation-7.603
 ['-7.128
 Declaration-6.878
 quotation-6.717
 coercion-6.706
Token stop
Token ID2245
Feature activation-5.12e-01

Pos logit contributions
 newcomers+27.221
 climbers+26.172
 passers+25.918
 onlook+25.597
 youngsters+25.510
Neg logit contributions
 ['-15.649
 violation-15.379
 Authorization-14.171
 Declaration-13.801
 coercion-13.482
Token Firefox
Token ID16802
Feature activation+7.28e-01

Pos logit contributions
 newcomers+4.633
 climbers+4.573
 passers+4.515
 youngsters+4.465
 commuters+4.436
Neg logit contributions
 violation-3.236
 ['-3.118
 Declaration-2.901
 quotation-2.837
 memorandum-2.826
Token users
Token ID2985
Feature activation-1.28e-01

Pos logit contributions
 violation+5.034
 ['+4.613
 quotation+4.326
 Declaration+4.281
 coercion+4.279
Neg logit contributions
 climbers-6.220
 newcomers-6.165
 passers-6.116
 youngsters-6.022
 commuters-5.967
Token from
Token ID422
Feature activation+2.99e-01

Pos logit contributions
 climbers+1.084
 newcomers+1.083
 passers+1.061
 youngsters+1.046
 commuters+1.043
Neg logit contributions
 violation-0.908
 ['-0.849
 quotation-0.802
 Declaration-0.802
 coercion-0.787
Token defect
Token ID11855
Feature activation-2.91e-01

Pos logit contributions
 violation+2.193
 ['+2.034
 quotation+1.923
 Declaration+1.913
 coercion+1.902
Neg logit contributions
 climbers-2.478
 newcomers-2.467
 passers-2.431
 youngsters-2.391
 commuters-2.375
Token individuals
Token ID3925
Feature activation-1.73e+00

Pos logit contributions
 climbers+10.786
 passers+10.721
 newcomers+10.662
 commuters+10.546
 youngsters+10.541
Neg logit contributions
 violation-9.254
 ['-8.952
 Authorization-8.463
 quotation-8.435
 Declaration-8.413
Token have
Token ID423
Feature activation-1.41e+00

Pos logit contributions
 climbers+13.419
 newcomers+13.013
 passers+12.956
 commuters+12.907
 youngsters+12.848
Neg logit contributions
 violation-10.481
 ['-10.135
 quotation-9.733
 Declaration-9.700
 Authorization-9.566
Token come
Token ID1282
Feature activation-1.61e+00

Pos logit contributions
 climbers+11.971
 newcomers+11.727
 passers+11.704
 commuters+11.611
 youngsters+11.563
Neg logit contributions
 violation-8.270
 ['-7.978
 quotation-7.567
 Declaration-7.408
 Authorization-7.397
Token forward
Token ID2651
Feature activation-8.98e-01

Pos logit contributions
 climbers+10.164
 newcomers+9.902
 passers+9.708
 youngsters+9.573
 commuters+9.482
Neg logit contributions
 violation-12.848
 ['-11.965
 Authorization-11.690
 Declaration-11.686
 quotation-11.659
Token to
Token ID284
Feature activation-1.68e+00

Pos logit contributions
 climbers+7.320
 newcomers+7.268
 passers+7.251
 youngsters+7.082
 commuters+7.042
Neg logit contributions
 violation-6.161
 ['-5.775
 Declaration-5.514
 quotation-5.499
 Authorization-5.483
Token help
Token ID1037
Feature activation-3.79e+00

Pos logit contributions
 climbers+12.609
 commuters+12.283
 passers+12.258
 newcomers+12.229
 youngsters+11.986
Neg logit contributions
 violation-10.939
 ['-10.447
 Authorization-9.919
 Declaration-9.909
 quotation-9.893
Token the
Token ID262
Feature activation-2.78e+00

Pos logit contributions
 climbers+26.739
 commuters+26.629
 newcomers+26.402
 youngsters+25.932
 passers+25.902
Neg logit contributions
 violation-15.513
 ['-15.506
 Authorization-15.255
 Declaration-14.672
 quotation-14.646
Token affected
Token ID5676
Feature activation-1.90e+00

Pos logit contributions
 climbers+21.331
 newcomers+21.234
 commuters+21.108
 youngsters+21.052
 passers+20.865
Neg logit contributions
 ['-13.912
 Authorization-13.453
 violation-13.285
 quotation-12.873
 Declaration-12.517
Token.
Token ID13
Feature activation-1.71e+00

Pos logit contributions
 commuters+14.245
 climbers+14.102
 passers+13.990
 newcomers+13.972
 youngsters+13.960
Neg logit contributions
 violation-12.610
 ['-12.340
 Authorization-12.079
 Declaration-11.799
 quotation-11.768
Token In
Token ID554
Feature activation+2.08e-01

Pos logit contributions
 newcomers+12.101
 climbers+12.004
 youngsters+11.668
 commuters+11.666
 passers+11.585
Neg logit contributions
 violation-11.817
 coercion-10.680
 ['-10.611
 quotation-10.515
 memorandum-10.397
Token many
Token ID867
Feature activation-2.16e+00

Pos logit contributions
 violation+1.384
 ['+1.296
 quotation+1.212
 Declaration+1.206
 coercion+1.195
Neg logit contributions
 climbers-1.854
 newcomers-1.848
 passers-1.820
 youngsters-1.796
 commuters-1.788
Token wildlife
Token ID15599
Feature activation-7.15e-01

Pos logit contributions
 newcomers+7.448
 climbers+7.435
 passers+7.263
 youngsters+7.141
 commuters+7.085
Neg logit contributions
 violation-5.477
 ['-5.151
 Declaration-4.840
 Authorization-4.760
 quotation-4.747
Token habitats
Token ID35308
Feature activation-1.17e+00

Pos logit contributions
 climbers+6.153
 newcomers+6.135
 passers+5.977
 youngsters+5.884
 commuters+5.851
Neg logit contributions
 violation-4.790
 ['-4.518
 Declaration-4.285
 Authorization-4.224
 quotation-4.212
Token and
Token ID290
Feature activation-1.31e+00

Pos logit contributions
 climbers+9.634
 newcomers+9.619
 passers+9.402
 youngsters+9.252
 commuters+9.177
Neg logit contributions
 violation-7.735
 ['-7.216
 Declaration-6.890
 quotation-6.868
 Authorization-6.820
Token urban
Token ID7876
Feature activation-1.77e+00

Pos logit contributions
 newcomers+11.281
 climbers+11.244
 passers+10.939
 youngsters+10.748
 commuters+10.673
Neg logit contributions
 violation-7.948
 ['-7.703
 Declaration-7.189
 Authorization-7.074
 quotation-6.944
Token environments
Token ID12493
Feature activation-1.84e+00

Pos logit contributions
 newcomers+14.692
 climbers+14.652
 commuters+14.225
 passers+14.114
 youngsters+14.089
Neg logit contributions
 violation-10.117
 ['-9.707
 Authorization-9.115
 Declaration-9.056
 quotation-9.027
Token can
Token ID460
Feature activation-3.75e+00

Pos logit contributions
 newcomers+15.049
 climbers+15.005
 passers+14.601
 youngsters+14.442
 commuters+14.325
Neg logit contributions
 violation-10.734
 ['-9.881
 Declaration-9.567
 Authorization-9.525
 quotation-9.499
Token not
Token ID407
Feature activation-2.17e+00

Pos logit contributions
 climbers+23.386
 upscale+22.681
 newcomers+22.659
 pave+22.377
 youngsters+22.209
Neg logit contributions
 violation-18.011
 Declaration-17.244
 Authorization-17.065
 ['-16.979
 memorandum-16.234
Token only
Token ID691
Feature activation-3.36e+00

Pos logit contributions
 climbers+13.016
 newcomers+12.912
 youngsters+12.552
 upscale+12.544
 passers+12.283
Neg logit contributions
 violation-16.024
 Declaration-15.142
 ['-15.078
 quotation-14.735
 Authorization-14.707
Token co
Token ID763
Feature activation-1.57e+00

Pos logit contributions
 climbers+21.133
 upscale+20.660
 newcomers+20.520
 pave+20.451
 youngsters+20.309
Neg logit contributions
 violation-17.371
 Declaration-16.685
 ['-16.606
 Authorization-16.469
 quotation-15.792
Token-
Token ID12
Feature activation-1.96e+00

Pos logit contributions
 newcomers+9.367
 climbers+9.363
 passers+8.974
 commuters+8.860
 youngsters+8.712
Neg logit contributions
 violation-13.110
 ['-12.489
 Declaration-11.991
 quotation-11.970
 Authorization-11.900
Tokenexist
Token ID38476
Feature activation-1.87e+00

Pos logit contributions
 climbers+10.627
 newcomers+10.516
 commuters+10.066
 youngsters+10.016
 passers+9.975
Neg logit contributions
 violation-16.299
 ['-15.385
 Declaration-15.097
 quotation-15.028
 Authorization-14.987
Tokenenty
Token ID3787
Feature activation-1.85e+00

Pos logit contributions
 climbers+9.316
 newcomers+9.234
 passers+9.108
zens+9.012
 youngsters+8.955
Neg logit contributions
 violation-14.589
 quotation-13.773
 coercion-13.537
 ['-13.421
 Declaration-13.111
Token of
Token ID286
Feature activation-1.91e+00

Pos logit contributions
 newcomers+14.805
 climbers+14.728
 passers+14.443
 youngsters+14.364
 commuters+14.167
Neg logit contributions
 violation-11.588
 ['-10.665
 quotation-10.054
 Declaration-10.043
 memorandum-10.034
Token new
Token ID649
Feature activation-1.75e+00

Pos logit contributions
 newcomers+16.448
 climbers+16.397
 youngsters+15.940
 passers+15.918
 commuters+15.840
Neg logit contributions
 violation-10.282
 ['-9.843
 memorandum-8.857
 Authorization-8.696
 Declaration-8.684
Token airports
Token ID22600
Feature activation-1.49e+00

Pos logit contributions
 climbers+14.925
 newcomers+14.790
 youngsters+14.402
 passers+14.399
 commuters+14.347
Neg logit contributions
 violation-9.847
 ['-9.473
 coercion-8.472
 Declaration-8.446
 quotation-8.400
Token but
Token ID475
Feature activation-1.71e+00

Pos logit contributions
 climbers+11.888
 newcomers+11.855
 passers+11.602
 commuters+11.578
 youngsters+11.519
Neg logit contributions
 violation-9.529
 ['-9.054
 quotation-8.601
 Declaration-8.576
 coercion-8.558
Token few
Token ID1178
Feature activation-3.74e+00

Pos logit contributions
 newcomers+14.860
 climbers+14.557
 commuters+14.345
 passers+14.237
 youngsters+14.160
Neg logit contributions
 violation-9.405
 ['-8.841
 quotation-8.283
 coercion-8.184
 memorandum-8.103
Token passengers
Token ID10405
Feature activation-1.49e+00

Pos logit contributions
 newcomers+28.543
 commuters+27.397
 climbers+27.100
 passers+26.638
 youngsters+26.379
Neg logit contributions
 violation-13.655
 ['-13.531
 quotation-12.418
 coercion-12.414
 Authorization-12.406
Token in
Token ID287
Feature activation-1.08e+00

Pos logit contributions
 newcomers+11.912
 climbers+11.675
 passers+11.598
 commuters+11.539
 youngsters+11.402
Neg logit contributions
 violation-9.484
 ['-8.928
 quotation-8.672
 Declaration-8.593
 coercion-8.487
Token China
Token ID2807
Feature activation-1.13e+00

Pos logit contributions
 newcomers+9.687
 climbers+9.578
 commuters+9.420
 passers+9.391
 youngsters+9.339
Neg logit contributions
 violation-6.183
 ['-5.984
 Declaration-5.519
 quotation-5.512
 coercion-5.511
Token
Token ID198
Feature activation-1.54e-01

Pos logit contributions
 newcomers+9.293
 climbers+9.200
 commuters+9.070
 passers+8.982
 youngsters+8.920
Neg logit contributions
 violation-7.541
 ['-6.967
 quotation-6.711
 coercion-6.604
 Declaration-6.554
Token
Token ID198
Feature activation-7.34e-01

Pos logit contributions
 climbers+1.074
 newcomers+1.072
 passers+1.049
 youngsters+1.030
 commuters+1.025
Neg logit contributions
 violation-1.332
 ['-1.251
 quotation-1.196
 Declaration-1.190
 coercion-1.185
TokenS
Token ID50
Feature activation+2.38e-01

Pos logit contributions
 violation+4.343
 ['+4.091
 Declaration+3.775
 quotation+3.744
 Authorization+3.688
Neg logit contributions
 climbers-6.004
 newcomers-5.927
 passers-5.887
 youngsters-5.753
 commuters-5.747
Token.
Token ID13
Feature activation+4.25e-01

Pos logit contributions
 violation+2.216
 ['+2.087
 Declaration+2.004
 Authorization+1.978
 quotation+1.974
Neg logit contributions
 climbers-1.484
 passers-1.470
 newcomers-1.461
 youngsters-1.418
 commuters-1.409
Token military
Token ID2422
Feature activation+8.45e-01

Pos logit contributions
 violation+3.197
 ['+2.981
 Declaration+2.819
 quotation+2.815
 coercion+2.791
Neg logit contributions
 climbers-3.409
 newcomers-3.393
 passers-3.351
 youngsters-3.282
 commuters-3.266
Token
Token ID960
Feature activation+5.71e-01

Pos logit contributions
 violation+6.343
 ['+5.902
 coercion+5.595
 memorandum+5.581
 quotation+5.564
Neg logit contributions
 climbers-6.510
 newcomers-6.505
 passers-6.455
 youngsters-6.300
 commuters-6.245
Tokencould
Token ID24089
Feature activation-2.88e+00

Pos logit contributions
 violation+4.317
 ['+4.047
 quotation+3.813
 Declaration+3.805
 coercion+3.771
Neg logit contributions
 climbers-4.536
 newcomers-4.524
 passers-4.463
 youngsters-4.375
 commuters-4.339
Token help
Token ID1037
Feature activation-3.73e+00

Pos logit contributions
 pave+19.191
 climbers+19.037
 newcomers+18.610
 upscale+18.588
 commuters+18.276
Neg logit contributions
 violation-16.030
 ['-15.400
 Declaration-15.115
 coercion-14.649
 quotation-14.538
Token focus
Token ID2962
Feature activation-1.95e+00

Pos logit contributions
 climbers+26.273
 newcomers+26.232
 commuters+25.877
 youngsters+25.531
 pave+25.087
Neg logit contributions
 violation-15.193
 ['-15.167
 quotation-13.505
 Declaration-13.099
 coercion-12.948
Token and
Token ID290
Feature activation-1.90e+00

Pos logit contributions
 newcomers+15.581
 climbers+15.277
 youngsters+15.051
 commuters+14.889
 passers+14.580
Neg logit contributions
 violation-11.131
 ['-10.840
 quotation-9.872
 Declaration-9.721
 coercion-9.419
Token shape
Token ID5485
Feature activation-1.08e+00

Pos logit contributions
 climbers+14.035
 newcomers+13.958
 youngsters+13.577
 commuters+13.553
 upscale+13.506
Neg logit contributions
 violation-12.166
 ['-11.722
 Declaration-10.985
 quotation-10.790
 coercion-10.663
Token a
Token ID257
Feature activation-7.36e-02

Pos logit contributions
 newcomers+10.168
 climbers+10.017
 youngsters+9.811
 passers+9.781
 commuters+9.664
Neg logit contributions
 violation-5.798
 ['-5.492
 quotation-5.031
 coercion-4.813
 Declaration-4.759
Token coherent
Token ID24870
Feature activation+1.26e-01

Pos logit contributions
 newcomers+0.562
 climbers+0.560
 passers+0.544
 youngsters+0.541
 commuters+0.534
Neg logit contributions
 violation-0.578
 ['-0.549
 quotation-0.516
 coercion-0.514
 Declaration-0.507
Token Z
Token ID1168
Feature activation-1.18e+00

Pos logit contributions
 newcomers+8.499
 climbers+8.309
 youngsters+8.256
 passers+8.102
 commuters+7.851
Neg logit contributions
 violation-8.107
 ['-7.494
 quotation-7.244
 coercion-7.142
 memorandum-7.097
Tokenander
Token ID4066
Feature activation-3.72e-01

Pos logit contributions
 newcomers+6.712
 passers+6.660
 climbers+6.604
 youngsters+6.478
 commuters+6.346
Neg logit contributions
 violation-10.694
 ['-9.879
 coercion-9.836
 quotation-9.801
 Authorization-9.711
Token F
Token ID376
Feature activation-9.78e-01

Pos logit contributions
 climbers+2.800
 newcomers+2.798
 passers+2.745
 youngsters+2.717
 commuters+2.675
Neg logit contributions
 violation-2.990
 ['-2.743
 quotation-2.665
 coercion-2.626
 memorandum-2.600
Tokenag
Token ID363
Feature activation-6.35e-01

Pos logit contributions
 climbers+5.839
 newcomers+5.767
 passers+5.659
 youngsters+5.634
zens+5.612
Neg logit contributions
 violation-8.853
 ['-8.336
 quotation-8.049
 coercion-8.013
 Declaration-7.952
Tokenerson
Token ID882
Feature activation-3.24e+00

Pos logit contributions
 climbers+3.678
 newcomers+3.642
 passers+3.576
 youngsters+3.513
zens+3.500
Neg logit contributions
 violation-6.056
 ['-5.631
 quotation-5.552
 coercion-5.496
 Declaration-5.466
Token will
Token ID481
Feature activation-3.71e+00

Pos logit contributions
 newcomers+22.276
 climbers+22.242
 youngsters+21.786
 passers+20.631
 commuters+20.211
Neg logit contributions
 violation-16.848
 Authorization-15.102
 quotation-14.970
 ['-14.952
 Declaration-14.924
Token pack
Token ID2353
Feature activation-1.92e+00

Pos logit contributions
 newcomers+23.512
 youngsters+22.784
 climbers+22.725
 passers+21.620
 swell+21.522
Neg logit contributions
 violation-18.538
 Authorization-17.444
 Declaration-17.402
 coercion-16.961
 ['-16.899
Token down
Token ID866
Feature activation-1.72e+00

Pos logit contributions
 newcomers+15.239
 climbers+15.078
 youngsters+14.992
 passers+14.523
 commuters+14.297
Neg logit contributions
 violation-11.882
 ['-10.748
 Authorization-10.582
 Declaration-10.569
 quotation-10.342
Token in
Token ID287
Feature activation-9.57e-01

Pos logit contributions
 newcomers+14.175
 climbers+14.046
 youngsters+13.876
 passers+13.715
 commuters+13.450
Neg logit contributions
 violation-10.446
 ['-9.729
 quotation-9.187
 Authorization-9.135
 Declaration-9.072
Token the
Token ID262
Feature activation-2.11e+00

Pos logit contributions
 newcomers+8.548
 climbers+8.466
 youngsters+8.347
 passers+8.287
 commuters+8.161
Neg logit contributions
 violation-5.822
 ['-5.494
 Declaration-5.070
 quotation-5.058
 coercion-5.021
Token front
Token ID2166
Feature activation-1.58e+00

Pos logit contributions
 newcomers+16.227
 climbers+16.189
 youngsters+16.047
 passers+15.599
 commuters+15.159
Neg logit contributions
 violation-12.419
 ['-11.880
 quotation-11.136
 Declaration-10.898
 coercion-10.865
Token destabil
Token ID27397
Feature activation-1.20e+00

Pos logit contributions
 climbers+12.787
 newcomers+12.745
 passers+12.418
 upscale+12.398
 youngsters+12.231
Neg logit contributions
 violation-12.492
 ['-11.846
 Declaration-11.739
 Authorization-11.565
 quotation-11.392
Tokenize
Token ID1096
Feature activation-8.09e-01

Pos logit contributions
 newcomers+4.930
 climbers+4.818
 passers+4.636
 onlook+4.480
 youngsters+4.459
Neg logit contributions
 violation-12.680
 ['-12.060
 quotation-11.946
 Declaration-11.752
 Authorization-11.686
Token the
Token ID262
Feature activation-1.15e+00

Pos logit contributions
 newcomers+7.472
 climbers+7.432
 passers+7.338
 commuters+7.230
 youngsters+7.207
Neg logit contributions
 violation-4.745
 ['-4.478
 Authorization-4.157
 quotation-4.125
 Declaration-4.088
Token grid
Token ID10706
Feature activation-4.12e-01

Pos logit contributions
 newcomers+8.697
 climbers+8.634
 passers+8.399
 youngsters+8.315
 commuters+8.256
Neg logit contributions
 violation-8.171
 ['-7.886
 quotation-7.312
 Authorization-7.308
 coercion-7.122
Token could
Token ID714
Feature activation-2.61e+00

Pos logit contributions
 newcomers+3.445
 climbers+3.435
 passers+3.383
 commuters+3.341
 youngsters+3.339
Neg logit contributions
 violation-2.872
 ['-2.701
 Declaration-2.567
 quotation-2.552
 Authorization-2.525
Token help
Token ID1037
Feature activation-3.65e+00

Pos logit contributions
 climbers+18.093
 newcomers+17.904
 pave+17.802
 passers+17.716
 upscale+17.482
Neg logit contributions
 violation-15.135
 Declaration-14.444
 ['-14.377
 Authorization-13.792
 quotation-13.701
Token that
Token ID326
Feature activation-1.75e+00

Pos logit contributions
 newcomers+26.412
 climbers+26.067
 commuters+25.680
 youngsters+25.264
 passers+25.185
Neg logit contributions
 violation-15.143
 ['-15.010
 Authorization-13.510
 quotation-13.424
 Declaration-13.216
Token jump
Token ID4391
Feature activation-2.90e+00

Pos logit contributions
 newcomers+14.738
 climbers+14.569
 youngsters+14.291
 commuters+14.092
 passers+14.084
Neg logit contributions
 violation-9.635
 ['-9.165
 Authorization-8.396
 Declaration-8.339
 quotation-8.280
Token happen
Token ID1645
Feature activation-1.83e+00

Pos logit contributions
 climbers+21.013
 newcomers+20.350
 passers+19.879
 youngsters+19.876
 commuters+19.729
Neg logit contributions
 violation-14.655
 Declaration-13.887
 ['-13.510
 Authorization-13.076
 coercion-13.052
Token.
Token ID13
Feature activation-8.83e-01

Pos logit contributions
 newcomers+12.896
 climbers+12.803
 youngsters+12.627
 passers+12.623
 commuters+12.399
Neg logit contributions
 violation-12.691
 ['-11.692
 Authorization-11.519
 Declaration-11.508
 quotation-11.491
Token
Token ID198
Feature activation+1.84e-01

Pos logit contributions
 newcomers+6.278
 climbers+6.168
 youngsters+5.985
 commuters+5.946
 passers+5.911
Neg logit contributions
 violation-7.060
 coercion-6.342
 ['-6.297
 memorandum-6.239
 quotation-6.234
Token the
Token ID262
Feature activation-2.57e+00

Pos logit contributions
 newcomers+12.862
 passers+12.671
 youngsters+12.670
 climbers+12.597
 commuters+12.081
Neg logit contributions
 violation-9.001
 ['-8.532
 quotation-7.997
 Authorization-7.960
 Declaration-7.872
Token quarterbacks
Token ID23643
Feature activation-1.10e+00

Pos logit contributions
 newcomers+19.085
 youngsters+18.815
 passers+18.468
 climbers+18.203
 commuters+17.037
Neg logit contributions
 violation-14.044
 ['-13.803
 quotation-12.886
 Authorization-12.712
 Declaration-12.684
Token,
Token ID11
Feature activation-8.72e-01

Pos logit contributions
 newcomers+8.516
 climbers+8.495
 passers+8.429
 youngsters+8.343
 commuters+8.097
Neg logit contributions
 violation-7.868
 ['-7.306
 Declaration-7.086
 quotation-7.077
 Authorization-7.069
Token who
Token ID508
Feature activation-2.67e+00

Pos logit contributions
 newcomers+7.849
 climbers+7.824
 passers+7.782
 youngsters+7.662
 commuters+7.496
Neg logit contributions
 violation-5.443
 ['-5.091
 quotation-4.777
 Declaration-4.774
 Authorization-4.744
Token continued
Token ID3767
Feature activation-2.08e+00

Pos logit contributions
 climbers+20.383
 newcomers+20.381
 youngsters+20.273
 passers+20.122
 commuters+19.405
Neg logit contributions
 violation-13.164
 ['-12.126
 Authorization-11.836
 Declaration-11.765
 coercion-11.701
Token to
Token ID284
Feature activation-3.64e+00

Pos logit contributions
 newcomers+14.856
 climbers+14.815
 passers+14.717
 youngsters+14.465
 commuters+14.099
Neg logit contributions
 violation-13.496
 ['-13.135
 Authorization-12.551
 Declaration-12.403
 quotation-12.200
Token split
Token ID6626
Feature activation-1.47e+00

Pos logit contributions
 climbers+23.369
 youngsters+22.832
 newcomers+22.545
 upscale+22.313
 passers+22.298
Neg logit contributions
 violation-17.104
 Declaration-16.788
 Authorization-16.752
 ['-16.566
 coercion-15.843
Token first
Token ID717
Feature activation-8.07e-01

Pos logit contributions
 newcomers+11.614
 passers+11.599
 climbers+11.437
 youngsters+11.334
 commuters+11.236
Neg logit contributions
 violation-9.594
 ['-9.164
 Authorization-8.787
 quotation-8.503
 Declaration-8.462
Token-
Token ID12
Feature activation-9.05e-02

Pos logit contributions
 passers+6.206
 climbers+6.140
 newcomers+6.128
 youngsters+6.020
 commuters+5.870
Neg logit contributions
 violation-5.972
 ['-5.633
 quotation-5.412
 Authorization-5.383
 Declaration-5.377
Tokenteam
Token ID15097
Feature activation-4.31e-01

Pos logit contributions
 climbers+0.666
 newcomers+0.662
 passers+0.660
 youngsters+0.646
 commuters+0.636
Neg logit contributions
 violation-0.742
 ['-0.695
 quotation-0.670
 Declaration-0.664
 coercion-0.659
Token repet
Token ID46152
Feature activation-4.95e-01

Pos logit contributions
 passers+2.602
 newcomers+2.601
 climbers+2.577
 youngsters+2.501
 commuters+2.455
Neg logit contributions
 violation-4.038
 ['-3.839
 Declaration-3.710
 quotation-3.705
 memorandum-3.655
Token-
Token ID12
Feature activation-7.45e-01

Pos logit contributions
 climbers+8.557
 passers+8.438
 newcomers+8.378
 youngsters+8.193
 commuters+7.906
Neg logit contributions
 violation-9.300
 ['-8.756
 quotation-8.362
 Declaration-8.316
 Authorization-8.282
Tokenback
Token ID1891
Feature activation-1.43e+00

Pos logit contributions
 climbers+3.484
 passers+3.430
 newcomers+3.424
 youngsters+3.345
 commuters+3.236
Neg logit contributions
 violation-7.891
 ['-7.508
 quotation-7.371
 Declaration-7.330
 coercion-7.245
Token pairing
Token ID27356
Feature activation-1.35e+00

Pos logit contributions
 climbers+11.157
 passers+11.156
 newcomers+10.990
 youngsters+10.703
 commuters+10.355
Neg logit contributions
 violation-9.971
 ['-9.231
 Declaration-9.004
 Authorization-8.947
 quotation-8.922
Token,
Token ID11
Feature activation-6.77e-01

Pos logit contributions
 climbers+10.107
 newcomers+10.072
 passers+10.043
 youngsters+9.777
 commuters+9.504
Neg logit contributions
 violation-9.896
 ['-9.178
 Authorization-8.878
 quotation-8.853
 Declaration-8.840
Token they
Token ID484
Feature activation-1.83e+00

Pos logit contributions
 newcomers+5.784
 climbers+5.726
 passers+5.675
 youngsters+5.624
 commuters+5.476
Neg logit contributions
 violation-4.606
 ['-4.275
 quotation-4.074
 memorandum-4.031
 Declaration-4.030
Token helped
Token ID4193
Feature activation-3.63e+00

Pos logit contributions
 climbers+15.515
 newcomers+15.121
 youngsters+14.945
 passers+14.906
 commuters+14.444
Neg logit contributions
 violation-10.316
 Declaration-9.470
 ['-9.348
 Authorization-9.210
 quotation-9.200
Token increase
Token ID2620
Feature activation-2.10e+00

Pos logit contributions
 newcomers+25.481
 youngsters+24.949
 climbers+24.911
 passers+24.709
 pave+23.847
Neg logit contributions
 violation-15.201
 ['-14.731
 Authorization-14.320
 quotation-14.149
 Declaration-13.572
Token the
Token ID262
Feature activation-2.34e+00

Pos logit contributions
 newcomers+16.256
 passers+16.210
 climbers+16.014
 youngsters+15.991
 onlook+15.240
Neg logit contributions
 violation-11.436
 ['-11.119
 Authorization-10.815
 Declaration-10.748
 quotation-10.551
Token pressure
Token ID3833
Feature activation-1.31e+00

Pos logit contributions
 newcomers+17.876
 climbers+17.792
 passers+17.736
 youngsters+17.725
 onlook+16.652
Neg logit contributions
 violation-12.206
 ['-11.888
 Declaration-11.463
 Authorization-11.361
 quotation-11.214
Token on
Token ID319
Feature activation-1.16e+00

Pos logit contributions
 climbers+10.342
 newcomers+10.325
 passers+10.206
 youngsters+10.002
 commuters+9.869
Neg logit contributions
 violation-8.937
 ['-8.478
 quotation-8.114
 Authorization-8.092
 Declaration-8.054
Token the
Token ID262
Feature activation-2.22e+00

Pos logit contributions
 newcomers+10.306
 passers+10.132
 climbers+10.072
 youngsters+10.013
 commuters+9.690
Neg logit contributions
 violation-6.954
 ['-6.642
 quotation-6.267
 Declaration-6.187
 memorandum-6.134
Token creek
Token ID39459
Feature activation-1.51e+00

Pos logit contributions
 newcomers+16.511
 climbers+16.511
 youngsters+16.059
 passers+16.007
 commuters+15.952
Neg logit contributions
 violation-11.713
 ['-11.704
 coercion-10.601
 quotation-10.580
 Authorization-10.551
Token.
Token ID13
Feature activation-2.39e+00

Pos logit contributions
 climbers+12.697
 passers+12.539
 newcomers+12.505
 commuters+12.204
 youngsters+12.092
Neg logit contributions
 violation-9.036
 ['-8.620
 Declaration-8.171
 coercion-8.074
 Authorization-8.039
Token The
Token ID383
Feature activation-2.22e+00

Pos logit contributions
 newcomers+15.958
 climbers+15.912
 commuters+15.578
 passers+15.161
 youngsters+14.969
Neg logit contributions
 violation-15.303
 coercion-14.440
 ['-13.727
 quotation-13.693
 memorandum-13.318
Token wider
Token ID10595
Feature activation-1.82e+00

Pos logit contributions
 climbers+18.300
 newcomers+18.287
 commuters+17.617
 passers+17.597
 youngsters+17.546
Neg logit contributions
 ['-11.432
 violation-11.128
 coercion-10.074
 quotation-9.845
 Authorization-9.812
Token approaches
Token ID10581
Feature activation-1.44e+00

Pos logit contributions
 climbers+15.660
 passers+15.208
 newcomers+15.115
 commuters+14.964
 youngsters+14.688
Neg logit contributions
 violation-9.832
 ['-9.486
 Authorization-8.815
 coercion-8.753
 quotation-8.639
Token will
Token ID481
Feature activation-3.61e+00

Pos logit contributions
 climbers+12.622
 newcomers+12.336
 passers+12.248
 commuters+12.159
 youngsters+12.040
Neg logit contributions
 violation-8.233
 ['-7.819
 Declaration-7.620
 coercion-7.381
 quotation-7.326
Token require
Token ID2421
Feature activation-1.80e+00

Pos logit contributions
 climbers+23.107
 pave+22.628
 upscale+22.563
 newcomers+22.167
 passers+22.119
Neg logit contributions
 Declaration-17.990
 violation-17.602
 ['-16.830
 coercion-16.799
 Authorization-16.631
Token new
Token ID649
Feature activation-2.27e+00

Pos logit contributions
 climbers+15.209
 newcomers+14.886
 commuters+14.726
 passers+14.631
 youngsters+14.281
Neg logit contributions
 ['-10.441
 violation-10.331
 Declaration-9.353
 Authorization-9.168
 coercion-9.102
Token retaining
Token ID26645
Feature activation-5.71e-02

Pos logit contributions
 climbers+18.227
 passers+17.327
 commuters+17.218
 newcomers+17.090
 roofs+17.003
Neg logit contributions
 ['-12.420
 violation-12.315
 Declaration-11.037
 Authorization-10.891
 coercion-10.799
Token walls
Token ID7714
Feature activation-1.90e+00

Pos logit contributions
 climbers+0.302
 newcomers+0.300
 passers+0.293
 youngsters+0.285
 commuters+0.282
Neg logit contributions
 violation-0.589
 ['-0.561
 quotation-0.539
 Declaration-0.538
 coercion-0.534
Token,
Token ID11
Feature activation-1.43e+00

Pos logit contributions
 climbers+15.631
 newcomers+15.201
 passers+15.116
 commuters+14.877
 youngsters+14.714
Neg logit contributions
 violation-10.634
 ['-9.902
 Declaration-9.837
 coercion-9.551
 Authorization-9.407
Token were
Token ID547
Feature activation+1.04e-01

Pos logit contributions
 climbers+4.300
 newcomers+4.266
 passers+4.178
 commuters+4.129
 youngsters+4.093
Neg logit contributions
 violation-4.148
 ['-3.911
 quotation-3.697
 coercion-3.685
 Declaration-3.663
Token hefty
Token ID29784
Feature activation-1.85e+00

Pos logit contributions
 violation+0.685
 ['+0.632
 quotation+0.593
 Declaration+0.590
 coercion+0.585
Neg logit contributions
 climbers-0.934
 newcomers-0.931
 passers-0.917
 youngsters-0.903
 commuters-0.899
Token investments
Token ID11115
Feature activation-1.04e+00

Pos logit contributions
 newcomers+14.196
 climbers+14.073
 commuters+13.496
 passers+13.493
 youngsters+13.091
Neg logit contributions
 violation-11.527
 ['-10.948
 Declaration-10.432
 Authorization-10.305
 quotation-10.277
Token expected
Token ID2938
Feature activation-1.45e+00

Pos logit contributions
 newcomers+8.902
 climbers+8.862
 passers+8.656
 commuters+8.606
 youngsters+8.514
Neg logit contributions
 violation-6.682
 ['-6.168
 quotation-5.928
 Declaration-5.911
 coercion-5.825
Token to
Token ID284
Feature activation-2.61e+00

Pos logit contributions
 newcomers+10.472
 climbers+10.398
 passers+10.242
 commuters+10.125
 youngsters+10.052
Neg logit contributions
 violation-10.566
 ['-10.002
 Declaration-9.678
 quotation-9.575
 coercion-9.332
Token help
Token ID1037
Feature activation-3.61e+00

Pos logit contributions
 climbers+16.758
 newcomers+16.237
 upscale+16.221
 passers+15.926
 pave+15.888
Neg logit contributions
 violation-16.737
 ['-16.292
 Declaration-16.087
 Authorization-15.712
 coercion-15.561
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
 newcomers+24.759
 climbers+24.659
 commuters+24.240
 residents+23.666
 passers+23.658
Neg logit contributions
 violation-16.308
 ['-16.015
 Authorization-15.233
 quotation-14.738
 coercion-14.347
Token plants
Token ID6134
Feature activation-3.05e+00

Pos logit contributions
 newcomers+17.498
 climbers+17.287
 commuters+16.553
 passers+16.549
 youngsters+16.507
Neg logit contributions
 violation-13.008
 ['-12.797
 Authorization-11.987
 coercion-11.677
 quotation-11.565
Token remain
Token ID3520
Feature activation-2.33e+00

Pos logit contributions
 climbers+20.954
 newcomers+20.653
 upscale+20.414
 youngsters+20.114
 passers+19.991
Neg logit contributions
 violation-15.981
 Declaration-14.718
 Authorization-14.668
 ['-14.550
 coercion-14.471
Token in
Token ID287
Feature activation+1.58e-01

Pos logit contributions
 climbers+15.868
 newcomers+15.637
 commuters+15.403
 passers+15.385
 upscale+15.261
Neg logit contributions
 violation-15.037
 ['-14.299
 Declaration-13.934
 Authorization-13.813
 coercion-13.515
Token operation
Token ID4905
Feature activation-1.06e+00

Pos logit contributions
 violation+1.078
 ['+0.999
 quotation+0.939
 Declaration+0.936
 coercion+0.928
Neg logit contributions
 climbers-1.388
 newcomers-1.383
 passers-1.362
 youngsters-1.341
 commuters-1.335
Tokenchecked
Token ID26752
Feature activation+3.66e-01

Pos logit contributions
 violation+10.505
 ['+9.783
 Declaration+9.302
 quotation+9.269
 coercion+9.251
Neg logit contributions
 climbers-10.556
 newcomers-10.392
 passers-10.321
 youngsters-10.158
 commuters-10.007
Token drivers
Token ID6643
Feature activation+6.41e-01

Pos logit contributions
 violation+2.997
 ['+2.810
 quotation+2.673
 Declaration+2.663
 coercion+2.644
Neg logit contributions
 climbers-2.708
 newcomers-2.699
 passers-2.651
 youngsters-2.602
 commuters-2.585
Token and
Token ID290
Feature activation+4.38e-01

Pos logit contributions
 violation+4.804
 ['+4.462
 quotation+4.205
 Declaration+4.165
 coercion+4.149
Neg logit contributions
 climbers-5.131
 newcomers-5.114
 passers-5.053
 youngsters-4.931
 commuters-4.923
Token utilities
Token ID20081
Feature activation+4.65e-01

Pos logit contributions
 violation+4.088
 ['+3.858
 quotation+3.701
 Declaration+3.684
 coercion+3.662
Neg logit contributions
 climbers-2.741
 newcomers-2.724
 passers-2.671
 youngsters-2.612
 commuters-2.592
Token will
Token ID481
Feature activation-1.95e+00

Pos logit contributions
 violation+3.365
 ['+3.118
 quotation+2.936
 Declaration+2.916
 coercion+2.900
Neg logit contributions
 climbers-3.865
 newcomers-3.851
 passers-3.799
 youngsters-3.727
 commuters-3.713
Token help
Token ID1037
Feature activation-3.59e+00

Pos logit contributions
 climbers+14.303
 newcomers+14.188
 passers+13.914
 youngsters+13.739
 commuters+13.648
Neg logit contributions
 violation-12.220
 Declaration-11.771
 ['-11.639
 quotation-11.410
 Authorization-11.239
Token with
Token ID351
Feature activation-8.26e-01

Pos logit contributions
 newcomers+26.583
 climbers+25.747
 youngsters+25.198
 passers+24.880
 commuters+24.496
Neg logit contributions
 ['-14.967
 violation-14.900
 Authorization-13.978
 memorandum-13.797
 quotation-13.617
Token that
Token ID326
Feature activation-1.00e+00

Pos logit contributions
 newcomers+8.101
 climbers+7.961
 passers+7.794
 youngsters+7.774
 commuters+7.702
Neg logit contributions
 violation-4.380
 ['-4.283
 quotation-3.827
 memorandum-3.802
 Declaration-3.794
Token.
Token ID13
Feature activation-7.06e-01

Pos logit contributions
 newcomers+7.044
 climbers+7.012
 youngsters+6.771
 passers+6.771
 commuters+6.596
Neg logit contributions
 violation-7.762
 ['-7.533
 Declaration-7.168
 quotation-7.094
 memorandum-7.044
Token
Token ID198
Feature activation-9.57e-02

Pos logit contributions
 newcomers+4.802
 climbers+4.698
 youngsters+4.554
 passers+4.517
 commuters+4.466
Neg logit contributions
 violation-5.989
 ['-5.507
 quotation-5.398
 coercion-5.396
 memorandum-5.377
Token
Token ID198
Feature activation-4.07e-01

Pos logit contributions
 climbers+0.655
 newcomers+0.655
 passers+0.640
 youngsters+0.630
 commuters+0.624
Neg logit contributions
 violation-0.833
 ['-0.780
 quotation-0.749
 Declaration-0.745
 coercion-0.743
Token attempted
Token ID7482
Feature activation+5.90e-01

Pos logit contributions
 violation+5.307
 ['+4.902
 quotation+4.663
 Declaration+4.604
 memorandum+4.573
Neg logit contributions
 climbers-6.293
 newcomers-6.260
 passers-6.173
 youngsters-6.059
 commuters-6.024
Token to
Token ID284
Feature activation-5.98e-01

Pos logit contributions
 violation+4.646
 ['+4.201
 quotation+4.138
 coercion+4.075
 memorandum+4.036
Neg logit contributions
 newcomers-4.454
 climbers-4.450
 passers-4.326
 youngsters-4.319
 commuters-4.244
Token defend
Token ID4404
Feature activation+4.13e-01

Pos logit contributions
 climbers+4.296
 newcomers+4.277
 passers+4.203
 youngsters+4.122
 commuters+4.102
Neg logit contributions
 violation-4.864
 ['-4.632
 Declaration-4.402
 Authorization-4.335
 coercion-4.324
Token the
Token ID262
Feature activation-1.19e-01

Pos logit contributions
 violation+2.645
 ['+2.431
 quotation+2.280
 Declaration+2.267
 coercion+2.246
Neg logit contributions
 climbers-3.799
 newcomers-3.787
 passers-3.733
 youngsters-3.679
 commuters-3.659
Token remarks
Token ID10252
Feature activation+7.93e-01

Pos logit contributions
 newcomers+1.071
 climbers+1.057
 passers+1.038
 youngsters+1.028
 commuters+1.018
Neg logit contributions
 violation-0.777
 ['-0.741
 Declaration-0.673
 coercion-0.672
 quotation-0.672
Token he
Token ID339
Feature activation+2.47e+00

Pos logit contributions
 violation+6.497
 ['+6.091
 quotation+5.812
 Declaration+5.725
 coercion+5.709
Neg logit contributions
 climbers-5.751
 newcomers-5.663
 passers-5.543
 youngsters-5.474
 commuters-5.417
Token made
Token ID925
Feature activation+1.27e+00

Pos logit contributions
 violation+15.251
 ['+14.323
 quotation+13.950
 plagiar+13.756
 paragraph+13.117
Neg logit contributions
 climbers-18.326
 newcomers-17.953
 passers-17.618
 youngsters-17.440
heast-17.270
Token about
Token ID546
Feature activation+8.41e-01

Pos logit contributions
 violation+9.092
 ['+8.442
 quotation+8.012
 coercion+7.874
 paragraph+7.846
Neg logit contributions
 climbers-10.003
 newcomers-9.790
 passers-9.608
 youngsters-9.491
 commuters-9.456
Token his
Token ID465
Feature activation+9.22e-01

Pos logit contributions
 violation+6.688
 ['+6.213
 quotation+5.966
 coercion+5.916
 Declaration+5.914
Neg logit contributions
 climbers-6.262
 newcomers-6.181
 passers-6.082
 youngsters-5.969
 commuters-5.933
Token fellow
Token ID5891
Feature activation+9.88e-02

Pos logit contributions
 violation+8.062
 ['+7.480
 quotation+7.267
 Declaration+7.164
 coercion+7.124
Neg logit contributions
 climbers-6.168
 newcomers-6.059
 passers-5.988
 youngsters-5.850
 commuters-5.828
Token GOP
Token ID6796
Feature activation+5.64e-01

Pos logit contributions
 violation+0.824
 ['+0.775
 quotation+0.736
 Declaration+0.734
 coercion+0.731
Neg logit contributions
 climbers-0.720
 newcomers-0.720
 passers-0.705
 youngsters-0.691
 commuters-0.687
Token you
Token ID345
Feature activation+8.81e-01

Pos logit contributions
 violation+3.028
 ['+2.795
 quotation+2.606
 Declaration+2.580
 coercion+2.559
Neg logit contributions
 climbers-4.493
 newcomers-4.483
 passers-4.421
 youngsters-4.358
 commuters-4.334
Token how
Token ID703
Feature activation-8.61e-01

Pos logit contributions
 violation+5.734
 ['+5.328
 quotation+4.963
 Declaration+4.937
 coercion+4.883
Neg logit contributions
 climbers-7.774
 newcomers-7.767
 passers-7.667
 youngsters-7.513
 commuters-7.488
Token much
Token ID881
Feature activation-6.23e-01

Pos logit contributions
 climbers+6.571
 newcomers+6.521
 passers+6.408
 youngsters+6.285
 commuters+6.252
Neg logit contributions
 violation-6.374
 ['-5.951
 Declaration-5.738
 Authorization-5.714
 quotation-5.615
Token part
Token ID636
Feature activation+6.50e-01

Pos logit contributions
 climbers+5.498
 newcomers+5.450
 passers+5.332
 youngsters+5.296
 commuters+5.271
Neg logit contributions
 violation-3.923
 ['-3.616
 Declaration-3.459
 Authorization-3.452
 quotation-3.392
Token of
Token ID286
Feature activation+1.52e+00

Pos logit contributions
 violation+4.788
 ['+4.475
 quotation+4.218
 Declaration+4.190
 coercion+4.162
Neg logit contributions
 climbers-5.316
 newcomers-5.295
 passers-5.227
 youngsters-5.127
 commuters-5.100
Token,
Token ID11
Feature activation+2.27e+00

Pos logit contributions
 violation+11.225
 ['+10.619
 quotation+9.950
 Declaration+9.841
 coercion+9.790
Neg logit contributions
 newcomers-11.333
 climbers-11.308
 passers-11.279
 youngsters-10.988
 commuters-10.866
Token you
Token ID345
Feature activation+6.07e-01

Pos logit contributions
 violation+17.489
 ['+16.463
 coercion+15.554
 quotation+15.526
 memorandum+15.487
Neg logit contributions
 newcomers-14.261
 passers-14.123
 climbers-13.972
 youngsters-13.529
heast-13.465
Token know
Token ID760
Feature activation-5.34e-01

Pos logit contributions
 violation+4.712
 ['+4.413
 quotation+4.170
 Declaration+4.147
 memorandum+4.116
Neg logit contributions
 climbers-4.682
 newcomers-4.673
 passers-4.608
 youngsters-4.494
 commuters-4.488
Token,
Token ID11
Feature activation+1.28e+00

Pos logit contributions
 climbers+3.504
 newcomers+3.474
 passers+3.395
 youngsters+3.349
 commuters+3.326
Neg logit contributions
 violation-4.743
 ['-4.452
 quotation-4.270
 Declaration-4.263
 coercion-4.235
Token the
Token ID262
Feature activation+3.71e-02

Pos logit contributions
 violation+10.150
 ['+9.533
 quotation+9.072
 coercion+8.977
 Declaration+8.928
Neg logit contributions
 newcomers-9.294
 climbers-9.261
 passers-9.129
 youngsters-8.916
 commuters-8.832
Token global
Token ID3298
Feature activation+2.77e-01

Pos logit contributions
 violation+0.280
 ['+0.262
 Declaration+0.248
 quotation+0.248
 coercion+0.246
Neg logit contributions
 climbers-0.296
 newcomers-0.295
 passers-0.288
 youngsters-0.285
 commuters-0.283
Token-
Token ID12
Feature activation+1.18e+00

Pos logit contributions
 violation+9.969
 ['+9.138
 quotation+8.997
 coercion+8.879
 Declaration+8.826
Neg logit contributions
 passers-11.431
 youngsters-11.136
 climbers-11.125
 newcomers-11.105
 commuters-10.865
Tokeneconomic
Token ID17079
Feature activation+2.10e+00

Pos logit contributions
 violation+8.733
 ['+8.090
 Declaration+7.744
 quotation+7.725
 coercion+7.695
Neg logit contributions
 passers-9.101
 newcomers-9.088
 climbers-9.082
 youngsters-8.827
 commuters-8.693
Token construct
Token ID5678
Feature activation+4.89e-01

Pos logit contributions
 violation+16.829
 quotation+15.210
 coercion+15.009
 ['+14.941
 Declaration+14.810
Neg logit contributions
 passers-13.017
zens-12.585
 climbers-12.566
 newcomers-12.552
 youngsters-12.455
Token than
Token ID621
Feature activation+1.37e+00

Pos logit contributions
 violation+4.344
 ['+4.084
 quotation+3.905
 Declaration+3.881
 coercion+3.864
Neg logit contributions
 climbers-3.266
 newcomers-3.250
 passers-3.204
 youngsters-3.131
 commuters-3.107
Token a
Token ID257
Feature activation+1.85e+00

Pos logit contributions
 violation+9.285
 ['+8.506
 quotation+8.174
 coercion+8.133
 Declaration+8.039
Neg logit contributions
 climbers-11.292
 newcomers-11.093
 passers-11.024
 youngsters-10.804
 commuters-10.732
Token statistical
Token ID13905
Feature activation+2.11e+00

Pos logit contributions
 violation+13.917
 ['+12.584
 quotation+12.468
 Declaration+12.295
 coercion+12.224
Neg logit contributions
 climbers-13.299
 newcomers-12.941
 passers-12.925
 youngsters-12.699
 commuters-12.576
Token category
Token ID6536
Feature activation+1.31e+00

Pos logit contributions
 violation+16.071
 coercion+14.149
 ['+14.109
 quotation+14.074
 memorandum+13.869
Neg logit contributions
 climbers-13.901
 passers-13.766
 newcomers-13.757
 commuters-13.609
 youngsters-13.546
Token which
Token ID543
Feature activation+4.33e-01

Pos logit contributions
 violation+11.514
 ['+10.636
 quotation+10.202
 coercion+10.140
 memorandum+10.009
Neg logit contributions
 climbers-8.276
 passers-8.238
 newcomers-8.099
 youngsters-7.972
 commuters-7.942
Token could
Token ID714
Feature activation-7.94e-01

Pos logit contributions
 violation+2.934
 ['+2.712
 quotation+2.552
 Declaration+2.534
 coercion+2.522
Neg logit contributions
 climbers-3.792
 newcomers-3.776
 passers-3.727
 youngsters-3.667
 commuters-3.645
Token capture
Token ID8006
Feature activation+1.92e-01

Pos logit contributions
 climbers+5.819
 newcomers+5.780
 passers+5.686
 youngsters+5.552
 commuters+5.532
Neg logit contributions
 violation-6.206
 ['-5.863
 Declaration-5.709
 Authorization-5.590
 quotation-5.559
Token the
Token ID262
Feature activation-8.75e-01

Pos logit contributions
 violation+1.387
 ['+1.293
 quotation+1.218
 Declaration+1.215
 coercion+1.202
Neg logit contributions
 newcomers-1.606
 climbers-1.605
 passers-1.581
 youngsters-1.554
 commuters-1.546
Token1
Token ID16
Feature activation-6.62e-01

Pos logit contributions
 violation+4.268
 ['+3.988
 Declaration+3.756
 quotation+3.755
 coercion+3.710
Neg logit contributions
 climbers-4.763
 newcomers-4.735
 passers-4.654
 youngsters-4.582
 commuters-4.559
Token.
Token ID13
Feature activation+8.34e-01

Pos logit contributions
 climbers+4.782
 newcomers+4.755
 passers+4.679
 youngsters+4.601
 commuters+4.551
Neg logit contributions
 violation-5.223
 ['-4.973
 coercion-4.728
 quotation-4.720
 memorandum-4.677
Token The
Token ID383
Feature activation-1.86e-01

Pos logit contributions
 violation+5.882
 ['+5.497
 Declaration+5.187
 quotation+5.150
 Authorization+5.072
Neg logit contributions
 climbers-7.041
 newcomers-6.974
 passers-6.889
 youngsters-6.769
 commuters-6.737
Token government
Token ID1230
Feature activation+2.44e+00

Pos logit contributions
 newcomers+1.583
 climbers+1.572
 passers+1.550
 youngsters+1.529
 commuters+1.515
Neg logit contributions
 violation-1.301
 ['-1.223
 quotation-1.141
 coercion-1.130
 memorandum-1.126
Token cannot
Token ID2314
Feature activation+4.91e-01

Pos logit contributions
 violation+19.218
 ['+17.459
 coercion+17.379
 quotation+16.702
 Declaration+16.636
Neg logit contributions
 newcomers-14.249
 climbers-14.155
 passers-14.011
heast-13.965
 youngsters-13.735
Token revoke
Token ID39041
Feature activation+2.83e+00

Pos logit contributions
 violation+4.279
 ['+4.025
 quotation+3.827
 Declaration+3.808
 coercion+3.799
Neg logit contributions
 climbers-3.372
 newcomers-3.358
 passers-3.297
 youngsters-3.234
 commuters-3.216
Token or
Token ID393
Feature activation+2.05e+00

Pos logit contributions
 violation+23.532
 ['+22.223
 Declaration+21.959
 Authorization+21.484
 quotation+21.320
Neg logit contributions
heast-13.107
heastern-13.045
 climbers-12.828
orthern-12.469
 newcomers-12.466
Token deny
Token ID10129
Feature activation+2.44e+00

Pos logit contributions
 violation+16.845
 ['+15.425
 coercion+14.896
 quotation+14.701
 paragraph+14.674
Neg logit contributions
 newcomers-11.871
 climbers-11.857
 passers-11.850
 youngsters-11.556
heast-11.395
Token a
Token ID257
Feature activation+3.27e+00

Pos logit contributions
 violation+20.793
 ['+19.651
 Declaration+19.027
 Authorization+18.867
 quotation+18.696
Neg logit contributions
heast-11.790
 climbers-11.754
heastern-11.631
 newcomers-11.387
orthern-11.374
Token permit
Token ID8749
Feature activation+1.97e+00

Pos logit contributions
 violation+25.646
 Declaration+23.358
 ['+23.247
 quotation+22.796
 memorandum+22.786
Neg logit contributions
 climbers-15.196
 newcomers-15.134
heast-15.058
 youngsters-14.697
 passers-14.213
Token based
Token ID1912
Feature activation+1.61e+00

Pos logit contributions
 violation+14.690
 ['+13.365
 memorandum+12.545
 quotation+12.483
 Declaration+12.434
Neg logit contributions
 climbers-13.986
 newcomers-13.904
 passers-13.579
 youngsters-13.517
 commuters-13.184
Token previous
Token ID2180
Feature activation+3.58e-01

Pos logit contributions
 climbers+1.940
 newcomers+1.933
 passers+1.905
 youngsters+1.877
 commuters+1.863
Neg logit contributions
 violation-1.681
 ['-1.589
 Declaration-1.476
 quotation-1.473
 coercion-1.468
Token releases
Token ID10050
Feature activation+3.23e-01

Pos logit contributions
 violation+2.474
 ['+2.291
 quotation+2.158
 Declaration+2.148
 coercion+2.129
Neg logit contributions
 climbers-3.105
 newcomers-3.096
 passers-3.049
 youngsters-3.001
 commuters-2.984
Token of
Token ID286
Feature activation+1.40e+00

Pos logit contributions
 violation+2.639
 ['+2.473
 quotation+2.350
 Declaration+2.342
 coercion+2.326
Neg logit contributions
 climbers-2.396
 newcomers-2.387
 passers-2.345
 youngsters-2.300
 commuters-2.285
Token leaked
Token ID14109
Feature activation+1.66e+00

Pos logit contributions
 violation+10.795
 ['+9.844
 Declaration+9.524
 quotation+9.514
 memorandum+9.485
Neg logit contributions
 newcomers-10.321
 climbers-10.236
 passers-10.171
 youngsters-9.843
 commuters-9.762
Token secret
Token ID3200
Feature activation+1.92e+00

Pos logit contributions
 violation+11.475
 ['+10.308
 memorandum+10.196
 quotation+10.139
 Declaration+10.127
Neg logit contributions
 newcomers-12.850
 climbers-12.734
 passers-12.714
heast-12.212
 youngsters-12.179
Token US
Token ID1294
Feature activation+2.68e+00

Pos logit contributions
 violation+13.358
 memorandum+12.044
 ['+12.037
 quotation+11.897
 Declaration+11.878
Neg logit contributions
 newcomers-14.171
 climbers-14.122
 passers-13.993
 youngsters-13.507
 commuters-13.458
Token government
Token ID1230
Feature activation+2.85e+00

Pos logit contributions
 violation+18.719
 memorandum+17.153
 Declaration+17.003
 coercion+16.793
 ['+16.727
Neg logit contributions
 newcomers-16.915
 passers-16.902
 climbers-16.603
heast-15.897
 commuters-15.777
Token documents
Token ID4963
Feature activation+1.02e+00

Pos logit contributions
 violation+19.736
 memorandum+18.341
 coercion+17.794
 quotation+17.729
 Declaration+17.535
Neg logit contributions
 newcomers-17.197
 passers-17.088
heast-16.966
 climbers-16.733
heastern-16.309
Token,
Token ID11
Feature activation+8.85e-01

Pos logit contributions
 violation+8.400
 ['+7.826
 quotation+7.446
 Declaration+7.428
 memorandum+7.407
Neg logit contributions
 newcomers-7.144
 climbers-7.120
 passers-6.986
 youngsters-6.789
 commuters-6.779
Token in
Token ID287
Feature activation+2.58e+00

Pos logit contributions
 violation+6.568
 ['+6.088
 quotation+5.781
 Declaration+5.744
 coercion+5.723
Neg logit contributions
 newcomers-7.075
 climbers-7.065
 passers-6.956
 youngsters-6.822
 commuters-6.775
Token July
Token ID2901
Feature activation+9.91e-01

Pos logit contributions
 violation+19.130
 ['+16.725
 paragraph+16.312
 quotation+16.118
 Declaration+16.011
Neg logit contributions
 passers-16.489
 newcomers-16.339
 climbers-16.333
heast-16.316
heastern-15.820
Token lead
Token ID1085
Feature activation+8.22e-02

Pos logit contributions
 violation+5.491
 ['+5.091
 quotation+4.737
 Declaration+4.723
 coercion+4.657
Neg logit contributions
 climbers-7.700
 newcomers-7.638
 passers-7.538
 youngsters-7.437
 commuters-7.394
Token actress
Token ID14549
Feature activation+6.29e-01

Pos logit contributions
 violation+0.674
 ['+0.630
 quotation+0.602
 Declaration+0.600
 coercion+0.599
Neg logit contributions
 newcomers-0.611
 climbers-0.608
 passers-0.595
 youngsters-0.585
 commuters-0.579
Token winner
Token ID8464
Feature activation+7.19e-01

Pos logit contributions
 violation+4.646
 ['+4.351
 Declaration+4.091
 quotation+4.084
 coercion+4.028
Neg logit contributions
 climbers-5.124
 newcomers-5.075
 passers-5.020
 youngsters-4.929
 commuters-4.902
Token Emma
Token ID18966
Feature activation+1.07e+00

Pos logit contributions
 violation+5.325
 ['+5.001
 Declaration+4.705
 quotation+4.698
 coercion+4.623
Neg logit contributions
 climbers-5.806
 newcomers-5.748
 passers-5.693
 youngsters-5.579
 commuters-5.569
Token Stone
Token ID8026
Feature activation+3.17e-01

Pos logit contributions
 violation+7.940
 ['+7.594
 Declaration+7.163
 quotation+7.123
 coercion+7.050
Neg logit contributions
 climbers-8.142
 newcomers-8.082
 passers-7.986
 youngsters-7.846
 commuters-7.816
Token and
Token ID290
Feature activation+7.71e-01

Pos logit contributions
 violation+2.038
 ['+1.876
 quotation+1.758
 Declaration+1.749
 coercion+1.733
Neg logit contributions
 climbers-2.900
 newcomers-2.892
 passers-2.851
 youngsters-2.808
 commuters-2.794
Token
Token ID564
Feature activation+4.91e-01

Pos logit contributions
 violation+5.024
 ['+4.646
 quotation+4.357
 Declaration+4.339
 coercion+4.276
Neg logit contributions
 climbers-6.941
 newcomers-6.889
 passers-6.796
 youngsters-6.703
 commuters-6.681
Token
Token ID250
Feature activation+9.16e-02

Pos logit contributions
 violation+5.044
 ['+4.788
 quotation+4.606
 Declaration+4.577
 coercion+4.567
Neg logit contributions
 climbers-2.599
 newcomers-2.565
 passers-2.505
 youngsters-2.443
 commuters-2.422
TokenLa
Token ID14772
Feature activation+2.76e-01

Pos logit contributions
 violation+0.648
 ['+0.594
 quotation+0.563
 Declaration+0.561
 coercion+0.558
Neg logit contributions
 newcomers-0.783
 climbers-0.779
 passers-0.769
 youngsters-0.756
 commuters-0.749
Token La
Token ID4689
Feature activation+6.25e-01

Pos logit contributions
 violation+2.207
 ['+2.066
 quotation+1.963
 Declaration+1.955
 coercion+1.941
Neg logit contributions
 climbers-2.094
 newcomers-2.087
 passers-2.050
 youngsters-2.014
 commuters-2.001
Token Land
Token ID6379
Feature activation+2.14e-01

Pos logit contributions
 violation+5.353
 ['+5.066
 quotation+4.828
 Declaration+4.819
 coercion+4.755
Neg logit contributions
 climbers-4.353
 newcomers-4.332
 passers-4.249
 youngsters-4.177
 commuters-4.145
Token year
Token ID614
Feature activation-6.26e-01

Pos logit contributions
 violation+3.789
 ['+3.510
 quotation+3.322
 Declaration+3.303
 coercion+3.277
Neg logit contributions
 climbers-4.382
 newcomers-4.367
 passers-4.305
 youngsters-4.232
 commuters-4.210
Token,
Token ID11
Feature activation-1.86e-01

Pos logit contributions
 newcomers+4.585
 climbers+4.578
 passers+4.479
 youngsters+4.406
 commuters+4.399
Neg logit contributions
 violation-5.007
 ['-4.712
 Declaration-4.475
 quotation-4.449
 coercion-4.421
Token on
Token ID319
Feature activation+1.02e+00

Pos logit contributions
 climbers+1.588
 newcomers+1.580
 passers+1.552
 youngsters+1.531
 commuters+1.529
Neg logit contributions
 violation-1.298
 ['-1.216
 Declaration-1.138
 quotation-1.137
 coercion-1.125
Token October
Token ID3267
Feature activation+1.68e+00

Pos logit contributions
 violation+7.057
 ['+6.445
 Declaration+6.124
 quotation+6.105
 memorandum+6.013
Neg logit contributions
 climbers-8.655
 newcomers-8.632
 passers-8.541
 youngsters-8.369
 commuters-8.338
Token 7
Token ID767
Feature activation-8.60e-01

Pos logit contributions
 violation+7.358
 ['+6.826
 Declaration+6.286
 quotation+5.967
 paragraph+5.954
Neg logit contributions
 climbers-17.078
 newcomers-16.813
 passers-16.789
 commuters-16.488
 youngsters-16.467
Token,
Token ID11
Feature activation+3.48e-01

Pos logit contributions
 climbers+5.585
 newcomers+5.519
 passers+5.415
 commuters+5.290
 youngsters+5.288
Neg logit contributions
 violation-7.554
 ['-7.143
 coercion-6.791
 quotation-6.779
 Declaration-6.771
Token he
Token ID339
Feature activation+6.30e-01

Pos logit contributions
 violation+2.676
 ['+2.503
 quotation+2.369
 Declaration+2.361
 coercion+2.344
Neg logit contributions
 climbers-2.757
 newcomers-2.747
 passers-2.701
 youngsters-2.655
 commuters-2.641
Token succeeded
Token ID14131
Feature activation-4.33e-01

Pos logit contributions
 violation+4.098
 ['+3.781
 quotation+3.545
 Declaration+3.515
 coercion+3.480
Neg logit contributions
 climbers-5.677
 newcomers-5.669
 passers-5.592
 youngsters-5.504
 commuters-5.467
Token in
Token ID287
Feature activation+5.69e-01

Pos logit contributions
 climbers+3.152
 newcomers+3.151
 passers+3.093
 youngsters+3.040
 commuters+3.022
Neg logit contributions
 violation-3.537
 ['-3.323
 Declaration-3.158
 quotation-3.144
 coercion-3.120
Token delivering
Token ID13630
Feature activation+2.78e-01

Pos logit contributions
 violation+4.184
 ['+3.862
 quotation+3.674
 Declaration+3.640
 coercion+3.634
Neg logit contributions
 climbers-4.659
 newcomers-4.651
 passers-4.580
 youngsters-4.508
 commuters-4.456
Token cargo
Token ID15892
Feature activation+2.14e-01

Pos logit contributions
 violation+1.765
 ['+1.623
 quotation+1.520
 Declaration+1.512
 coercion+1.497
Neg logit contributions
 climbers-2.562
 newcomers-2.555
 passers-2.519
 youngsters-2.481
 commuters-2.468
Token and
Token ID290
Feature activation-7.93e-02

Pos logit contributions
 violation+2.348
 ['+2.204
 quotation+2.099
 Declaration+2.089
 coercion+2.081
Neg logit contributions
 climbers-1.847
 newcomers-1.836
 passers-1.796
 youngsters-1.764
 commuters-1.750
Token vice
Token ID7927
Feature activation+5.59e-02

Pos logit contributions
 climbers+0.742
 newcomers+0.741
 passers+0.734
 youngsters+0.719
 commuters+0.718
Neg logit contributions
 violation-0.488
 ['-0.458
 quotation-0.425
 Declaration-0.424
 coercion-0.415
Token versa
Token ID25470
Feature activation-8.05e-01

Pos logit contributions
 violation+0.359
 ['+0.328
 quotation+0.307
 Declaration+0.306
 coercion+0.303
Neg logit contributions
 climbers-0.515
 newcomers-0.512
 passers-0.505
 youngsters-0.499
 commuters-0.494
Token that
Token ID326
Feature activation+1.06e+00

Pos logit contributions
 newcomers+5.367
 climbers+5.330
 passers+5.294
 youngsters+5.176
 commuters+5.144
Neg logit contributions
 violation-6.759
 ['-6.472
 Declaration-6.227
 quotation-6.205
 coercion-6.123
Token does
Token ID857
Feature activation+3.73e-01

Pos logit contributions
 violation+8.222
 ['+7.506
 coercion+7.163
 quotation+7.151
 memorandum+7.093
Neg logit contributions
 climbers-7.906
 newcomers-7.844
 passers-7.753
 youngsters-7.595
 commuters-7.472
Token not
Token ID407
Feature activation+2.29e-01

Pos logit contributions
 violation+2.362
 ['+2.095
 quotation+1.974
 coercion+1.950
 paragraph+1.937
Neg logit contributions
 climbers-3.483
 newcomers-3.456
 passers-3.415
 youngsters-3.375
 commuters-3.335
Token invoke
Token ID26342
Feature activation+1.93e+00

Pos logit contributions
 violation+1.934
 ['+1.806
 quotation+1.719
 Declaration+1.699
 coercion+1.697
Neg logit contributions
 climbers-1.649
 newcomers-1.646
 passers-1.621
 youngsters-1.587
 commuters-1.574
Token any
Token ID597
Feature activation+1.52e+00

Pos logit contributions
 violation+16.070
 coercion+14.462
 quotation+14.222
 ['+14.186
 paragraph+14.179
Neg logit contributions
 climbers-11.910
heast-11.564
heastern-11.464
 newcomers-11.376
 passers-11.278
Token biological
Token ID10685
Feature activation+2.74e+00

Pos logit contributions
 violation+12.436
 coercion+10.937
 ['+10.849
 quotation+10.783
 paragraph+10.656
Neg logit contributions
 climbers-10.461
 newcomers-10.141
 passers-9.966
 commuters-9.908
 youngsters-9.905
Token sex
Token ID1714
Feature activation+1.93e+00

Pos logit contributions
 violation+22.470
 coercion+20.525
 memorandum+19.594
 paragraph+19.521
 quotation+19.377
Neg logit contributions
heast-14.956
 climbers-14.619
 newcomers-14.294
heastern-14.275
orthern-13.927
Token or
Token ID393
Feature activation+1.74e+00

Pos logit contributions
 violation+15.778
 coercion+13.943
 ['+13.803
 Declaration+13.424
 quotation+13.348
Neg logit contributions
 climbers-12.477
 newcomers-12.256
 passers-11.890
 youngsters-11.888
heast-11.789
TokenSexual
Token ID49161
Feature activation+1.67e+00

Pos logit contributions
 violation+3.357
 ['+3.112
 quotation+2.918
 Declaration+2.916
 coercion+2.888
Neg logit contributions
 climbers-4.316
 newcomers-4.292
 passers-4.234
 youngsters-4.162
 commuters-4.150
Token orientation
Token ID12852
Feature activation+1.39e+00

Pos logit contributions
 violation+12.579
 ['+11.426
 coercion+11.197
 Declaration+11.029
 Authorization+10.819
Neg logit contributions
 climbers-12.094
 newcomers-11.863
 passers-11.777
 commuters-11.552
 youngsters-11.454
Token:
Token ID25
Feature activation+1.07e+00

Pos logit contributions
 violation+9.721
 ['+8.845
 Declaration+8.406
 quotation+8.369
 coercion+8.324
Neg logit contributions
 climbers-11.141
 newcomers-10.942
 passers-10.880
 commuters-10.706
 youngsters-10.665
Token Sexual
Token ID19536
Feature activation+1.38e+00

Pos logit contributions
 violation+7.489
 ['+7.123
 Declaration+6.716
 quotation+6.547
 Authorization+6.539
Neg logit contributions
 climbers-8.731
 newcomers-8.602
 passers-8.576
 commuters-8.407
 youngsters-8.278
Token orientation
Token ID12852
Feature activation+1.30e+00

Pos logit contributions
 violation+8.019
 ['+7.077
 coercion+6.803
 quotation+6.513
 Declaration+6.508
Neg logit contributions
 climbers-12.705
 newcomers-12.585
 passers-12.476
 youngsters-12.196
 commuters-12.182
Token is
Token ID318
Feature activation+1.90e+00

Pos logit contributions
 violation+8.310
 ['+7.611
 quotation+7.147
 coercion+7.072
 Declaration+6.977
Neg logit contributions
 climbers-11.182
 newcomers-11.089
 passers-11.058
 youngsters-10.803
 commuters-10.786
Token about
Token ID546
Feature activation+1.22e+00

Pos logit contributions
 violation+13.556
 ['+12.551
 coercion+12.025
 quotation+11.889
 Declaration+11.877
Neg logit contributions
 climbers-13.525
 newcomers-13.348
 passers-13.178
 commuters-13.057
zens-13.007
Token romantic
Token ID14348
Feature activation+9.47e-01

Pos logit contributions
 violation+9.277
 ['+8.606
 coercion+8.210
 quotation+8.065
 Declaration+7.997
Neg logit contributions
 climbers-9.221
 newcomers-9.131
 passers-8.982
 commuters-8.855
 youngsters-8.826
Token and
Token ID290
Feature activation+1.75e-01

Pos logit contributions
 violation+7.854
 ['+7.276
 coercion+6.956
 quotation+6.952
 Declaration+6.837
Neg logit contributions
 climbers-6.738
 newcomers-6.656
 passers-6.551
 commuters-6.475
 youngsters-6.434
Token sexual
Token ID3206
Feature activation+4.67e-01

Pos logit contributions
 violation+1.030
 ['+0.954
 Declaration+0.883
 quotation+0.881
 Authorization+0.862
Neg logit contributions
 climbers-1.689
 newcomers-1.684
 passers-1.662
 youngsters-1.641
 commuters-1.627
Token attraction
Token ID17416
Feature activation+1.30e-01

Pos logit contributions
 violation+4.109
 ['+3.854
 quotation+3.679
 coercion+3.661
 Declaration+3.658
Neg logit contributions
 climbers-3.181
 newcomers-3.163
 passers-3.104
 youngsters-3.040
 commuters-3.026
Token November
Token ID3389
Feature activation+6.59e-01

Pos logit contributions
 violation+12.692
 ['+11.141
 quotation+10.774
 paragraph+10.680
 Declaration+10.571
Neg logit contributions
 climbers-15.320
 newcomers-14.890
 passers-14.873
 commuters-14.572
 youngsters-14.442
Token,
Token ID11
Feature activation+7.43e-01

Pos logit contributions
 violation+5.753
 ['+5.407
 Declaration+5.123
 quotation+5.094
 coercion+5.091
Neg logit contributions
 climbers-4.463
 newcomers-4.399
 passers-4.333
 youngsters-4.234
 commuters-4.197
Token candidates
Token ID5871
Feature activation+8.98e-01

Pos logit contributions
 violation+4.492
 ['+4.103
 quotation+3.822
 Declaration+3.800
 coercion+3.767
Neg logit contributions
 climbers-6.978
 newcomers-6.917
 passers-6.846
 youngsters-6.732
 commuters-6.718
Token from
Token ID422
Feature activation+4.23e-01

Pos logit contributions
 violation+6.816
 ['+6.273
 quotation+5.981
 Declaration+5.922
 memorandum+5.921
Neg logit contributions
 climbers-6.791
 passers-6.755
 newcomers-6.739
 youngsters-6.567
 commuters-6.559
Token each
Token ID1123
Feature activation+4.21e-01

Pos logit contributions
 violation+3.417
 ['+3.187
 quotation+3.032
 Declaration+3.020
 coercion+2.997
Neg logit contributions
 climbers-3.171
 newcomers-3.142
 passers-3.096
 youngsters-3.044
 commuters-3.027
Token party
Token ID2151
Feature activation+6.83e-01

Pos logit contributions
 violation+3.143
 ['+2.921
 quotation+2.766
 Declaration+2.756
 coercion+2.732
Neg logit contributions
 climbers-3.417
 newcomers-3.399
 passers-3.354
 youngsters-3.299
 commuters-3.278
Token need
Token ID761
Feature activation-1.34e+00

Pos logit contributions
 violation+5.055
 ['+4.671
 quotation+4.425
 Declaration+4.419
 memorandum+4.393
Neg logit contributions
 climbers-5.454
 passers-5.412
 newcomers-5.409
 youngsters-5.270
 commuters-5.234
Token net
Token ID2010
Feature activation-7.56e-01

Pos logit contributions
 newcomers+10.312
 climbers+10.224
 passers+10.014
 youngsters+9.848
 commuters+9.835
Neg logit contributions
 violation-9.326
 ['-8.922
 quotation-8.274
 Declaration-8.155
 memorandum-8.046
Token positives
Token ID38548
Feature activation-1.29e+00

Pos logit contributions
 newcomers+5.992
 climbers+5.881
 passers+5.847
 commuters+5.669
 youngsters+5.649
Neg logit contributions
 violation-5.535
 ['-5.319
 quotation-4.958
 Authorization-4.928
 Declaration-4.908
Token of
Token ID286
Feature activation+2.48e-01

Pos logit contributions
 newcomers+10.646
 climbers+10.433
 passers+10.331
 youngsters+10.251
 commuters+10.127
Neg logit contributions
 violation-8.324
 ['-7.617
 Declaration-7.322
 quotation-7.288
 coercion-7.265
Token at
Token ID379
Feature activation+4.59e-01

Pos logit contributions
 violation+1.921
 ['+1.795
 quotation+1.702
 Declaration+1.695
 coercion+1.682
Neg logit contributions
 climbers-1.945
 newcomers-1.938
 passers-1.906
 youngsters-1.873
 commuters-1.861
Token presidential
Token ID4787
Feature activation+1.75e+00

Pos logit contributions
 violation+11.158
 ['+10.455
 quotation+10.091
 Declaration+10.079
 coercion+9.978
Neg logit contributions
 climbers-7.053
 newcomers-6.783
 passers-6.713
 commuters-6.675
 youngsters-6.612
Token contender
Token ID24190
Feature activation+1.70e+00

Pos logit contributions
 violation+15.265
 ['+14.162
 Declaration+13.848
 quotation+13.733
 memorandum+13.597
Neg logit contributions
 climbers-9.577
 commuters-9.176
heast-9.166
 youngsters-9.068
 passers-9.064
Token Donald
Token ID3759
Feature activation+1.93e+00

Pos logit contributions
 violation+10.428
 ['+9.824
 Declaration+9.126
 quotation+9.125
 coercion+8.902
Neg logit contributions
 climbers-14.184
 commuters-13.783
 passers-13.637
 newcomers-13.533
heast-13.438
Token Trump
Token ID1301
Feature activation+1.56e+00

Pos logit contributions
 violation+8.716
 ['+8.258
 Declaration+7.399
 Authorization+7.368
 quotation+7.284
Neg logit contributions
 climbers-18.571
 passers-18.376
 newcomers-17.931
 youngsters-17.861
 commuters-17.758
Token has
Token ID468
Feature activation+1.98e-01

Pos logit contributions
 violation+12.040
 ['+11.190
 quotation+10.966
 Declaration+10.587
 memorandum+10.543
Neg logit contributions
 climbers-10.778
 newcomers-10.749
 passers-10.633
 youngsters-10.284
 commuters-10.242
Token echoed
Token ID22211
Feature activation-6.45e-01

Pos logit contributions
 violation+1.415
 ['+1.313
 quotation+1.240
 Declaration+1.232
 coercion+1.223
Neg logit contributions
 climbers-1.678
 newcomers-1.673
 passers-1.647
 youngsters-1.620
 commuters-1.611
Token.
Token ID13
Feature activation-8.71e-01

Pos logit contributions
 newcomers+4.550
 climbers+4.547
 passers+4.509
 youngsters+4.421
 commuters+4.373
Neg logit contributions
 violation-5.274
 ['-4.985
 Declaration-4.725
 Authorization-4.706
 coercion-4.696
Token
Token ID198
Feature activation-4.02e-01

Pos logit contributions
 newcomers+5.881
 climbers+5.739
 passers+5.657
 youngsters+5.604
 commuters+5.513
Neg logit contributions
 violation-7.248
 ['-6.584
 coercion-6.525
 memorandum-6.496
 quotation-6.462
Token
Token ID198
Feature activation-5.46e-01

Pos logit contributions
 newcomers+2.559
 climbers+2.530
 passers+2.480
 youngsters+2.454
 commuters+2.424
Neg logit contributions
 violation-3.654
 ['-3.392
 quotation-3.303
 coercion-3.298
 Declaration-3.282
TokenThe
Token ID464
Feature activation-1.42e+00

Pos logit contributions
 newcomers+4.213
 climbers+4.148
 passers+4.129
 youngsters+4.055
 commuters+4.010
Neg logit contributions
 violation-4.180
 ['-3.829
 quotation-3.699
 memorandum-3.670
 coercion-3.668
Token longer
Token ID2392
Feature activation-3.66e-01

Pos logit contributions
 newcomers+12.195
 climbers+11.700
 youngsters+11.628
 passers+11.538
 commuters+11.265
Neg logit contributions
 violation-8.210
 ['-8.065
 coercion-7.238
 quotation-7.092
 Authorization-7.056
Token rent
Token ID5602
Feature activation-3.74e-01

Pos logit contributions
 newcomers+3.762
 climbers+3.732
 passers+3.675
 youngsters+3.656
 commuters+3.633
Neg logit contributions
 violation-3.314
 ['-3.126
 Declaration-2.947
 Authorization-2.922
 quotation-2.922
Token,
Token ID11
Feature activation+2.40e-02

Pos logit contributions
 climbers+3.322
 newcomers+3.315
 passers+3.239
 youngsters+3.212
 commuters+3.187
Neg logit contributions
 violation-2.459
 ['-2.309
 Declaration-2.158
 quotation-2.145
 memorandum-2.131
Token transport
Token ID4839
Feature activation-2.28e-01

Pos logit contributions
 violation+0.199
 ['+0.189
 quotation+0.179
 Declaration+0.178
 coercion+0.176
Neg logit contributions
 newcomers-0.174
 climbers-0.174
 passers-0.170
 youngsters-0.169
 commuters-0.167
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
 newcomers+2.025
 climbers+2.023
 passers+1.987
 commuters+1.963
 youngsters+1.961
Neg logit contributions
 violation-1.515
 ['-1.412
 Declaration-1.319
 quotation-1.316
 coercion-1.296
Token medical
Token ID3315
Feature activation+4.75e-01

Pos logit contributions
 newcomers+8.239
 youngsters+8.038
 climbers+8.024
 passers+7.966
 commuters+7.955
Neg logit contributions
 violation-7.040
 ['-6.826
 Declaration-6.285
 Authorization-6.269
 quotation-6.252
Token costs
Token ID3484
Feature activation-1.85e+00

Pos logit contributions
 violation+3.856
 ['+3.593
 quotation+3.439
 Declaration+3.419
 coercion+3.416
Neg logit contributions
 climbers-3.517
 newcomers-3.491
 passers-3.438
 youngsters-3.364
 commuters-3.350
Token,
Token ID11
Feature activation-3.42e-01

Pos logit contributions
 climbers+14.667
 newcomers+14.537
 youngsters+14.219
 commuters+14.061
 passers+14.022
Neg logit contributions
 violation-11.160
 ['-10.517
 Declaration-10.300
 Authorization-9.870
 quotation-9.838
Token rates
Token ID3965
Feature activation+2.66e-01

Pos logit contributions
 newcomers+3.004
 climbers+2.980
 passers+2.934
 commuters+2.917
 youngsters+2.913
Neg logit contributions
 violation-2.284
 ['-2.144
 Declaration-2.012
 quotation-2.002
 Authorization-1.983
Token and
Token ID290
Feature activation-8.35e-01

Pos logit contributions
 violation+1.835
 ['+1.684
 quotation+1.595
 Declaration+1.578
 coercion+1.573
Neg logit contributions
 climbers-2.301
 newcomers-2.296
 passers-2.266
 youngsters-2.227
 commuters-2.210
Token rising
Token ID7396
Feature activation-1.80e+00

Pos logit contributions
 newcomers+6.689
 climbers+6.574
 passers+6.514
 youngsters+6.504
 commuters+6.467
Neg logit contributions
 violation-5.899
 ['-5.693
 Declaration-5.279
 Authorization-5.229
 quotation-5.202
Token utility
Token ID10361
Feature activation-1.54e-01

Pos logit contributions
 newcomers+14.036
 youngsters+13.912
 commuters+13.782
 climbers+13.609
 passers+13.602
Neg logit contributions
 violation-11.016
 ['-10.813
 Declaration-9.974
 Authorization-9.826
 quotation-9.634
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
 violation+1.131
 ['+1.070
 quotation+1.017
 Declaration+1.017
 coercion+1.003
Neg logit contributions
 climbers-0.969
 newcomers-0.964
 passers-0.952
 youngsters-0.933
 commuters-0.931
Token public
Token ID1171
Feature activation-2.03e-01

Pos logit contributions
 passers+8.502
 newcomers+8.492
 commuters+8.460
 youngsters+8.382
 climbers+8.375
Neg logit contributions
 violation-7.541
 ['-7.248
 Authorization-6.940
 quotation-6.842
 Declaration-6.823
Token have
Token ID423
Feature activation-7.16e-01

Pos logit contributions
 climbers+1.658
 newcomers+1.657
 passers+1.640
 youngsters+1.611
 commuters+1.606
Neg logit contributions
 violation-1.499
 ['-1.404
 quotation-1.332
 Declaration-1.329
 coercion-1.312
Token been
Token ID587
Feature activation-3.98e-01

Pos logit contributions
 climbers+5.117
 newcomers+5.114
 passers+5.108
 youngsters+4.987
 commuters+4.967
Neg logit contributions
 violation-5.760
 ['-5.411
 quotation-5.280
 Declaration-5.222
 coercion-5.200
Token looted
Token ID49575
Feature activation-2.65e-01

Pos logit contributions
 climbers+3.066
 newcomers+3.038
 passers+3.026
 youngsters+2.983
 commuters+2.960
Neg logit contributions
 violation-3.026
 ['-2.860
 quotation-2.773
 Declaration-2.756
 coercion-2.717
Token by
Token ID416
Feature activation-3.33e-01

Pos logit contributions
 climbers+2.128
 newcomers+2.124
 passers+2.106
 youngsters+2.062
 commuters+2.049
Neg logit contributions
 violation-1.965
 ['-1.840
 quotation-1.760
 Declaration-1.754
 coercion-1.734
Token ag
Token ID556
Feature activation-5.05e-01

Pos logit contributions
 newcomers+2.556
 climbers+2.554
 passers+2.543
 youngsters+2.489
 commuters+2.477
Neg logit contributions
 violation-2.565
 ['-2.447
 Declaration-2.337
 quotation-2.331
 Authorization-2.287
Tokenit
Token ID270
Feature activation-2.57e-02

Pos logit contributions
 climbers+3.341
 newcomers+3.266
 passers+3.224
 youngsters+3.156
 commuters+3.126
Neg logit contributions
 violation-4.465
 ['-4.198
 Declaration-4.093
 quotation-4.059
 coercion-4.013
Tokenators
Token ID2024
Feature activation-7.66e-01

Pos logit contributions
 climbers+0.130
 newcomers+0.129
 passers+0.126
 youngsters+0.122
 commuters+0.121
Neg logit contributions
 violation-0.272
 ['-0.259
 quotation-0.249
 Declaration-0.248
 coercion-0.247
Token,
Token ID11
Feature activation-3.70e-01

Pos logit contributions
 newcomers+6.049
 climbers+6.048
 passers+6.044
 youngsters+5.893
 commuters+5.860
Neg logit contributions
 violation-5.505
 ['-5.140
 quotation-4.990
 Declaration-4.975
 coercion-4.892
Token vandal
Token ID26271
Feature activation-3.06e-01

Pos logit contributions
 climbers+3.031
 newcomers+3.017
 passers+3.011
 youngsters+2.950
 commuters+2.939
Neg logit contributions
 violation-2.665
 ['-2.535
 Declaration-2.412
 quotation-2.402
 Authorization-2.373
Token mix
Token ID5022
Feature activation-6.85e-01

Pos logit contributions
 climbers+0.201
 newcomers+0.201
 passers+0.198
 youngsters+0.195
 commuters+0.194
Neg logit contributions
 violation-0.149
 ['-0.138
 Declaration-0.130
 quotation-0.129
 coercion-0.128
Token to
Token ID284
Feature activation-8.20e-01

Pos logit contributions
 climbers+4.661
 newcomers+4.644
 passers+4.550
 youngsters+4.495
 commuters+4.408
Neg logit contributions
 violation-5.854
 ['-5.496
 quotation-5.284
 Declaration-5.284
 coercion-5.258
Token come
Token ID1282
Feature activation-9.11e-01

Pos logit contributions
 climbers+5.334
 newcomers+5.251
 passers+5.165
 youngsters+5.111
 commuters+4.992
Neg logit contributions
 violation-7.145
 ['-6.625
 Declaration-6.525
 quotation-6.463
 coercion-6.408
Token to
Token ID284
Feature activation+4.08e-01

Pos logit contributions
 climbers+5.422
 newcomers+5.395
 passers+5.296
 youngsters+5.195
 commuters+5.109
Neg logit contributions
 violation-8.414
 Declaration-7.809
 ['-7.742
 quotation-7.595
 Authorization-7.571
Token a
Token ID257
Feature activation+2.94e-01

Pos logit contributions
 violation+2.547
 ['+2.340
 quotation+2.188
 Declaration+2.173
 coercion+2.154
Neg logit contributions
 climbers-3.810
 newcomers-3.802
 passers-3.746
 youngsters-3.693
 commuters-3.675
Token boil
Token ID20667
Feature activation-8.72e-01

Pos logit contributions
 violation+3.281
 ['+3.130
 quotation+3.020
 Declaration+3.012
 coercion+2.997
Neg logit contributions
 climbers-1.303
 newcomers-1.296
 passers-1.256
 youngsters-1.218
 commuters-1.205
Token,
Token ID11
Feature activation-2.35e-01

Pos logit contributions
 climbers+6.102
 newcomers+6.083
 passers+5.979
 youngsters+5.878
 commuters+5.771
Neg logit contributions
 violation-7.138
 ['-6.669
 Declaration-6.447
 quotation-6.416
 coercion-6.371
Token we
Token ID356
Feature activation-8.53e-02

Pos logit contributions
 climbers+1.720
 newcomers+1.714
 passers+1.682
 youngsters+1.655
 commuters+1.633
Neg logit contributions
 violation-1.925
 ['-1.806
 Declaration-1.730
 quotation-1.714
 coercion-1.706
Token chop
Token ID30506
Feature activation-6.02e-01

Pos logit contributions
 climbers+0.699
 newcomers+0.696
 passers+0.684
 youngsters+0.674
 commuters+0.668
Neg logit contributions
 violation-0.631
 ['-0.587
 Declaration-0.556
 quotation-0.556
 coercion-0.550
Token a
Token ID257
Feature activation+3.84e-01

Pos logit contributions
 climbers+5.194
 newcomers+5.144
 passers+5.048
 youngsters+5.019
 commuters+4.917
Neg logit contributions
 violation-4.039
 ['-3.680
 Declaration-3.562
 quotation-3.507
 memorandum-3.492
Token handful
Token ID10089
Feature activation-1.42e+00

Pos logit contributions
 violation+3.409
 ['+3.212
 quotation+3.069
 Declaration+3.058
 coercion+3.037
Neg logit contributions
 climbers-2.587
 newcomers-2.576
 passers-2.526
 youngsters-2.474
 commuters-2.459
Token the
Token ID262
Feature activation-5.57e-01

Pos logit contributions
 climbers+5.104
 newcomers+5.092
 passers+4.966
 youngsters+4.847
 commuters+4.836
Neg logit contributions
 violation-5.762
 ['-5.445
 Declaration-5.214
 quotation-5.168
 memorandum-5.143
Token extra
Token ID3131
Feature activation-2.34e-01

Pos logit contributions
 newcomers+3.389
 climbers+3.373
 passers+3.259
 youngsters+3.232
 commuters+3.189
Neg logit contributions
 violation-5.193
 ['-4.958
 Declaration-4.727
 quotation-4.687
 Authorization-4.672
Token mile
Token ID10591
Feature activation-9.03e-01

Pos logit contributions
 climbers+1.637
 newcomers+1.621
 passers+1.597
 youngsters+1.564
 commuters+1.559
Neg logit contributions
 violation-1.984
 ['-1.876
 Declaration-1.801
 quotation-1.781
 Authorization-1.771
Token to
Token ID284
Feature activation-9.14e-01

Pos logit contributions
 climbers+6.008
 newcomers+5.990
 passers+5.841
 commuters+5.740
 youngsters+5.715
Neg logit contributions
 violation-7.575
 ['-7.071
 Declaration-6.963
 Authorization-6.829
 quotation-6.803
Token ensure
Token ID4155
Feature activation-1.81e-01

Pos logit contributions
 newcomers+5.795
 climbers+5.780
 passers+5.591
 commuters+5.486
 youngsters+5.466
Neg logit contributions
 violation-8.088
 ['-7.592
 Declaration-7.389
 Authorization-7.300
 coercion-7.254
Token that
Token ID326
Feature activation-4.75e-01

Pos logit contributions
 newcomers+1.895
 climbers+1.885
 passers+1.862
 youngsters+1.832
 commuters+1.829
Neg logit contributions
 violation-0.932
 ['-0.847
 Declaration-0.767
 memorandum-0.767
 quotation-0.766
Token your
Token ID534
Feature activation-6.23e-02

Pos logit contributions
 newcomers+4.735
 climbers+4.700
 passers+4.639
 youngsters+4.549
 commuters+4.537
Neg logit contributions
 violation-2.638
 ['-2.414
 memorandum-2.219
 quotation-2.212
 Declaration-2.209
Token experience
Token ID1998
Feature activation+8.85e-01

Pos logit contributions
 newcomers+0.500
 climbers+0.499
 passers+0.489
 youngsters+0.481
 commuters+0.478
Neg logit contributions
 violation-0.469
 ['-0.441
 quotation-0.414
 coercion-0.412
 Declaration-0.412
Token with
Token ID351
Feature activation+5.80e-01

Pos logit contributions
 violation+5.733
 ['+5.263
 quotation+4.941
 Declaration+4.857
 memorandum+4.839
Neg logit contributions
 climbers-7.756
 newcomers-7.754
 passers-7.638
 youngsters-7.595
 commuters-7.474
Token us
Token ID514
Feature activation+1.67e-01

Pos logit contributions
 violation+4.101
 ['+3.800
 quotation+3.591
 Declaration+3.584
 coercion+3.536
Neg logit contributions
 climbers-4.904
 newcomers-4.869
 passers-4.810
 youngsters-4.731
 commuters-4.706
Token is
Token ID318
Feature activation+1.53e-01

Pos logit contributions
 violation+0.843
 ['+0.756
 quotation+0.695
 Declaration+0.694
 coercion+0.683
Neg logit contributions
 climbers-1.762
 newcomers-1.756
 passers-1.734
 youngsters-1.711
 commuters-1.705
Token that
Token ID326
Feature activation-2.18e-01

Pos logit contributions
 violation+10.971
 ['+10.240
 quotation+9.801
 coercion+9.787
 Declaration+9.736
Neg logit contributions
 climbers-10.180
 newcomers-10.080
 passers-9.969
 youngsters-9.728
 commuters-9.663
Token will
Token ID481
Feature activation-1.94e+00

Pos logit contributions
 newcomers+2.110
 climbers+2.110
 passers+2.067
 youngsters+2.054
 commuters+2.033
Neg logit contributions
 violation-1.258
 ['-1.144
 quotation-1.057
 Authorization-1.054
 Declaration-1.053
Token no
Token ID645
Feature activation-6.64e-01

Pos logit contributions
 climbers+14.853
 newcomers+14.543
 passers+14.533
 youngsters+14.137
 commuters+14.092
Neg logit contributions
 violation-11.444
 Declaration-10.548
 ['-10.509
 Authorization-10.335
 quotation-10.157
Token-
Token ID12
Feature activation-5.95e-02

Pos logit contributions
 newcomers+5.280
 climbers+5.240
 passers+5.153
 youngsters+5.091
 commuters+5.043
Neg logit contributions
 violation-4.853
 ['-4.583
 Declaration-4.335
 quotation-4.319
 memorandum-4.307
Tokend
Token ID67
Feature activation-5.09e-02

Pos logit contributions
 newcomers+0.453
 climbers+0.452
 passers+0.443
 youngsters+0.435
 commuters+0.432
Neg logit contributions
 violation-0.474
 ['-0.444
 quotation-0.423
 Declaration-0.421
 coercion-0.415
Tokenoubt
Token ID47675
Feature activation-2.93e+00

Pos logit contributions
 climbers+0.369
 newcomers+0.367
 passers+0.361
 youngsters+0.354
 commuters+0.351
Neg logit contributions
 violation-0.423
 ['-0.397
 quotation-0.380
 Declaration-0.377
 coercion-0.376
Token reach
Token ID3151
Feature activation-1.45e+00

Pos logit contributions
 climbers+21.326
 newcomers+21.011
 passers+20.841
 youngsters+20.525
 pave+20.327
Neg logit contributions
 violation-13.140
 ['-12.835
 Declaration-12.773
 Authorization-12.494
 quotation-11.951
Token fever
Token ID17372
Feature activation-1.62e+00

Pos logit contributions
 newcomers+12.433
 climbers+12.193
 passers+12.129
 youngsters+11.960
 commuters+11.799
Neg logit contributions
 violation-8.581
 ['-8.000
 memorandum-7.591
 Declaration-7.519
 quotation-7.414
Token pitch
Token ID7078
Feature activation-1.13e+00

Pos logit contributions
 climbers+9.225
 newcomers+8.994
 passers+8.982
 commuters+8.883
 youngsters+8.706
Neg logit contributions
 violation-13.888
 ['-13.234
 Declaration-12.800
 Authorization-12.765
 quotation-12.684
Token around
Token ID1088
Feature activation+2.17e-01

Pos logit contributions
 climbers+9.009
 newcomers+8.931
 passers+8.929
 youngsters+8.794
 commuters+8.711
Neg logit contributions
 violation-7.627
 ['-7.078
 Authorization-6.836
 Declaration-6.812
 quotation-6.766
Token next
Token ID1306
Feature activation+4.11e-01

Pos logit contributions
 violation+1.340
 ['+1.229
 quotation+1.144
 Declaration+1.136
 coercion+1.127
Neg logit contributions
 newcomers-2.043
 climbers-2.039
 passers-2.011
 youngsters-1.986
 commuters-1.974
Token being
Token ID852
Feature activation-7.14e-01

Pos logit contributions
 climbers+0.798
 newcomers+0.798
 passers+0.786
 youngsters+0.778
 commuters+0.774
Neg logit contributions
 violation-0.490
 ['-0.450
 Declaration-0.416
 quotation-0.414
 Authorization-0.411
Token pro
Token ID386
Feature activation+2.26e-01

Pos logit contributions
 climbers+5.970
 newcomers+5.959
 passers+5.813
 youngsters+5.802
 commuters+5.765
Neg logit contributions
 violation-4.970
 ['-4.618
 Authorization-4.358
 Declaration-4.332
 quotation-4.249
Token-
Token ID12
Feature activation-1.21e-01

Pos logit contributions
 violation+1.634
 ['+1.516
 quotation+1.441
 Declaration+1.431
 coercion+1.417
Neg logit contributions
 climbers-1.882
 newcomers-1.880
 passers-1.856
 youngsters-1.821
 commuters-1.812
TokenEuropean
Token ID22030
Feature activation+5.30e-01

Pos logit contributions
 newcomers+0.831
 climbers+0.830
 passers+0.805
 youngsters+0.800
 commuters+0.793
Neg logit contributions
 violation-1.055
 ['-0.989
 quotation-0.945
 Declaration-0.943
 Authorization-0.935
Token,
Token ID11
Feature activation+1.76e-02

Pos logit contributions
 violation+4.471
 ['+4.197
 Declaration+4.031
 quotation+4.022
 coercion+3.983
Neg logit contributions
 climbers-3.682
 newcomers-3.644
 passers-3.635
 commuters-3.509
 youngsters-3.493
Token many
Token ID867
Feature activation-2.57e+00

Pos logit contributions
 violation+0.113
 ['+0.105
 Declaration+0.098
 quotation+0.097
 Authorization+0.096
Neg logit contributions
 newcomers-0.162
 climbers-0.161
 passers-0.158
 youngsters-0.157
 commuters-0.156
Token Conservatives
Token ID17897
Feature activation-5.27e-01

Pos logit contributions
 newcomers+19.324
 youngsters+18.920
 climbers+18.733
 commuters+18.588
 passers+18.372
Neg logit contributions
 violation-14.797
 ['-13.896
 Authorization-13.331
 coercion-12.844
 Declaration-12.697
Token are
Token ID389
Feature activation-1.39e+00

Pos logit contributions
 newcomers+4.047
 climbers+4.040
 youngsters+3.953
 passers+3.937
 commuters+3.900
Neg logit contributions
 violation-4.066
 ['-3.825
 Declaration-3.618
 quotation-3.595
 coercion-3.588
Token not
Token ID407
Feature activation-1.51e+00

Pos logit contributions
 newcomers+11.674
 climbers+11.528
 youngsters+11.437
 passers+11.256
 commuters+11.220
Neg logit contributions
 violation-8.816
 ['-8.177
 Authorization-7.817
 Declaration-7.798
 coercion-7.550
Token,
Token ID11
Feature activation-2.31e-01

Pos logit contributions
 newcomers+11.782
 climbers+11.604
 passers+11.339
 youngsters+11.297
 commuters+11.227
Neg logit contributions
 violation-10.557
 ['-9.709
 Authorization-9.370
 coercion-9.234
 Declaration-9.213
Token and
Token ID290
Feature activation-1.09e+00

Pos logit contributions
 newcomers+2.072
 climbers+2.068
 passers+2.029
 youngsters+2.014
 commuters+2.005
Neg logit contributions
 violation-1.525
 ['-1.410
 quotation-1.316
 Declaration-1.316
 coercion-1.302
Token Wong
Token ID27247
Feature activation-2.27e+00

Pos logit contributions
 newcomers+12.897
 climbers+12.730
 passers+12.444
 youngsters+12.213
 commuters+12.207
Neg logit contributions
 violation-13.164
 ['-12.683
 quotation-11.980
 Declaration-11.867
 memorandum-11.748
Token.
Token ID13
Feature activation-4.30e-01

Pos logit contributions
 newcomers+15.267
 climbers+15.176
 passers+14.993
 commuters+14.763
 youngsters+14.705
Neg logit contributions
 violation-15.305
 ['-14.200
 coercion-13.547
 quotation-13.542
 Declaration-13.539
Token
Token ID198
Feature activation+6.34e-02

Pos logit contributions
 newcomers+3.608
 climbers+3.566
 passers+3.496
 youngsters+3.474
 commuters+3.450
Neg logit contributions
 violation-3.021
 ['-2.755
 quotation-2.643
 coercion-2.632
 memorandum-2.622
Token
Token ID198
Feature activation-9.18e-01

Pos logit contributions
 violation+0.545
 ['+0.512
 quotation+0.489
 Declaration+0.487
 coercion+0.485
Neg logit contributions
 climbers-0.444
 newcomers-0.442
 passers-0.434
 youngsters-0.426
 commuters-0.423
TokenHe
Token ID1544
Feature activation-5.00e-01

Pos logit contributions
 newcomers+6.699
 climbers+6.535
 passers+6.449
 youngsters+6.368
 commuters+6.332
Neg logit contributions
 violation-7.115
 ['-6.518
 quotation-6.387
 memorandum-6.337
 coercion-6.254
Token wants
Token ID3382
Feature activation-2.30e+00

Pos logit contributions
 climbers+4.509
 newcomers+4.491
 youngsters+4.349
 passers+4.341
 commuters+4.323
Neg logit contributions
 violation-3.195
 ['-2.881
 coercion-2.769
 Declaration-2.755
 quotation-2.719
Token greater
Token ID3744
Feature activation-2.30e+00

Pos logit contributions
 newcomers+18.471
 climbers+17.666
 youngsters+17.397
 passers+17.331
 commuters+17.204
Neg logit contributions
 violation-12.162
 ['-11.847
 Declaration-10.691
 quotation-10.556
 memorandum-10.497
Token leadership
Token ID5531
Feature activation-6.54e-01

Pos logit contributions
 newcomers+18.734
 climbers+18.297
 commuters+17.865
 passers+17.815
 youngsters+17.788
Neg logit contributions
 ['-11.001
 violation-10.870
 Declaration-9.726
 quotation-9.596
 memorandum-9.378
Token federally
Token ID33710
Feature activation-5.23e-01

Pos logit contributions
 climbers+5.125
 newcomers+5.109
 passers+4.958
 youngsters+4.932
 commuters+4.902
Neg logit contributions
 violation-4.899
 ['-4.580
 quotation-4.356
 Declaration-4.351
 coercion-4.289
Token,
Token ID11
Feature activation-1.13e-01

Pos logit contributions
 climbers+4.160
 newcomers+4.144
 passers+4.018
 youngsters+3.992
 commuters+3.968
Neg logit contributions
 violation-3.892
 ['-3.622
 quotation-3.478
 Declaration-3.440
 coercion-3.384
Token especially
Token ID2592
Feature activation-8.73e-01

Pos logit contributions
 climbers+1.000
 newcomers+1.000
 passers+0.977
 commuters+0.968
 youngsters+0.967
Neg logit contributions
 violation-0.751
 ['-0.699
 quotation-0.657
 Declaration-0.652
 memorandum-0.641
Token strength
Token ID4202
Feature activation-1.45e+00

Pos logit contributions
 climbers+4.318
 newcomers+4.285
 passers+4.258
 youngsters+4.186
 commuters+4.141
Neg logit contributions
 violation-3.473
 ['-3.335
 quotation-3.119
 memorandum-3.082
 Declaration-3.080
Token.
Token ID13
Feature activation-2.31e+00

Pos logit contributions
 passers+12.046
 climbers+11.936
 newcomers+11.931
 youngsters+11.734
 commuters+11.697
Neg logit contributions
 violation-8.864
 ['-8.260
 quotation-8.009
 Declaration-7.901
 memorandum-7.829
Token
Token ID198
Feature activation-3.75e-01

Pos logit contributions
 newcomers+15.595
 passers+15.546
 climbers+15.364
 youngsters+15.360
 commuters+15.199
Neg logit contributions
 violation-15.541
 quotation-14.002
 memorandum-13.963
 coercion-13.895
 ['-13.524
Token
Token ID198
Feature activation-1.99e+00

Pos logit contributions
 newcomers+2.400
 climbers+2.370
 passers+2.324
 youngsters+2.310
 commuters+2.275
Neg logit contributions
 violation-3.389
 ['-3.132
 quotation-3.063
 coercion-3.059
 memorandum-3.047
TokenThousands
Token ID37482
Feature activation-3.57e+00

Pos logit contributions
 passers+12.367
 newcomers+12.364
Residents+12.327
 youngsters+12.214
zens+12.190
Neg logit contributions
 violation-14.873
 quotation-13.616
 memorandum-13.486
 coercion-13.175
 ['-12.992
Token of
Token ID286
Feature activation-2.64e+00

Pos logit contributions
 passers+23.808
 onlook+23.576
 climbers+23.397
 commuters+23.366
 youngsters+23.365
Neg logit contributions
 violation-15.774
 quotation-15.042
 Authorization-14.541
 ['-14.451
 coercion-14.437
Token people
Token ID661
Feature activation-2.21e+00

Pos logit contributions
 passers+20.911
 commuters+20.601
 newcomers+20.397
 youngsters+20.386
 onlook+20.185
Neg logit contributions
 violation-13.332
 ['-12.587
 Authorization-12.074
 quotation-11.995
 Declaration-11.790
Token gather
Token ID6431
Feature activation-1.61e+00

Pos logit contributions
 passers+16.468
 climbers+16.374
 youngsters+16.251
 commuters+16.112
 newcomers+16.053
Neg logit contributions
 violation-12.669
 ['-11.779
 quotation-11.708
 Declaration-11.631
 Authorization-11.470
Token at
Token ID379
Feature activation-4.06e-01

Pos logit contributions
 passers+11.220
 climbers+11.064
 newcomers+10.901
 youngsters+10.874
 onlook+10.810
Neg logit contributions
 violation-11.648
 quotation-10.688
 ['-10.657
 memorandum-10.571
 Declaration-10.534
Token the
Token ID262
Feature activation-1.64e+00

Pos logit contributions
 newcomers+3.600
 climbers+3.594
 passers+3.594
 commuters+3.511
 youngsters+3.507
Neg logit contributions
 violation-2.653
 ['-2.445
 quotation-2.332
 Declaration-2.299
 coercion-2.275
Token final
Token ID2457
Feature activation-4.16e-01

Pos logit contributions
 newcomers+13.350
 passers+13.238
 climbers+13.135
 commuters+13.077
 youngsters+13.011
Neg logit contributions
 violation-9.469
 ['-8.920
 quotation-8.601
 coercion-8.261
 Declaration-8.117
Token other
Token ID584
Feature activation-2.20e+00

Pos logit contributions
 climbers+20.337
 passers+19.715
 newcomers+19.438
 onlook+19.388
 commuters+19.328
Neg logit contributions
 violation-12.535
 ['-12.269
 Authorization-11.608
 Declaration-11.231
 quotation-11.145
Token bl
Token ID698
Feature activation-1.49e+00

Pos logit contributions
 climbers+17.922
 newcomers+17.672
 passers+17.580
 commuters+17.208
 youngsters+17.173
Neg logit contributions
 ['-11.420
 violation-11.190
 Authorization-10.616
 quotation-10.298
 Declaration-10.254
Tokenazes
Token ID36096
Feature activation-2.50e+00

Pos logit contributions
 climbers+3.753
zens+3.626
 newcomers+3.385
 passers+3.339
 youngsters+3.174
Neg logit contributions
 violation-17.766
 coercion-16.983
 Declaration-16.903
 quotation-16.799
 Authorization-16.436
Token burning
Token ID9482
Feature activation-2.28e+00

Pos logit contributions
 climbers+19.138
 newcomers+18.556
 passers+18.540
 youngsters+18.501
 commuters+18.312
Neg logit contributions
 violation-13.149
 quotation-12.194
 Authorization-12.130
 ['-12.125
 coercion-11.908
Token through
Token ID832
Feature activation-2.14e+00

Pos logit contributions
 climbers+16.602
 passers+16.236
 newcomers+16.165
 commuters+15.891
 youngsters+15.812
Neg logit contributions
 violation-13.653
 ['-13.108
 quotation-12.803
 Authorization-12.712
 Declaration-12.632
Token the
Token ID262
Feature activation-2.35e+00

Pos logit contributions
 climbers+17.080
 passers+16.962
 newcomers+16.853
 youngsters+16.501
 commuters+16.464
Neg logit contributions
 violation-11.612
 ['-11.395
 Authorization-10.755
 Declaration-10.643
 memorandum-10.602
Token region
Token ID3814
Feature activation-1.97e+00

Pos logit contributions
 climbers+17.130
 newcomers+17.041
 upscale+16.644
 passers+16.546
 youngsters+16.375
Neg logit contributions
 violation-13.608
 ['-12.926
 Authorization-12.399
 coercion-12.147
 quotation-12.125
Token,
Token ID11
Feature activation-2.05e+00

Pos logit contributions
 climbers+15.597
 newcomers+15.303
 passers+15.190
 youngsters+15.087
 commuters+14.975
Neg logit contributions
 violation-12.056
 ['-11.013
 quotation-10.845
 Declaration-10.674
 coercion-10.592
Token people
Token ID661
Feature activation-2.21e+00

Pos logit contributions
 climbers+17.381
 newcomers+16.924
 passers+16.922
 youngsters+16.684
 commuters+16.629
Neg logit contributions
 violation-10.909
 ['-10.339
 Authorization-9.724
 quotation-9.653
 Declaration-9.604
Token were
Token ID547
Feature activation-1.80e+00

Pos logit contributions
 climbers+16.848
 newcomers+16.497
 youngsters+16.310
 passers+16.289
 commuters+16.082
Neg logit contributions
 violation-12.774
 Declaration-11.909
 ['-11.769
 coercion-11.487
 Authorization-11.453
Token pictured
Token ID17915
Feature activation-2.56e+00

Pos logit contributions
 climbers+14.405
 newcomers+14.186
 passers+14.102
 commuters+13.923
 youngsters+13.893
Neg logit contributions
 violation-11.121
 ['-10.266
 Declaration-10.041
 quotation-9.897
 Authorization-9.836