TOP ACTIVATIONS
MAX = 2.902

are exposed to , find out the social environment they are
and you can get a hint of what to expect .
for six years to figure out how to fix the walk
Council is trying to figure out what to do about the
lands , and then figure out how to acquire one ,
) re - d iscover the Green Line .
. 1998 Scientists perfect methods for growing human embryonic
ko and Sach dev already knew each other through Per imeter
Syrian refugees who have no idea that the donations come from
Wednesday in an attempt to confirm the spread of the disease
campaign is trying to figure out how to coordinate it all
them out , can find out how to catch these workshop
uit and Thomas M . Smith at the National Center for
are not mysterious find out the ideas that young people
adhere to this and work out a refund policy if applicable
t stupid . They figured out that the only way to
with the goal of figuring out why some Muslims become terrorists
Dallas has to decide almost immediately into the offseason whether they
which two families compete to guess the most popular answers to
v 2 . MOR PH _ BL ACK H AT ,

MIN ACTIVATIONS
MIN = -3.010

supreme Court , and all other Officers of the United States
the supreme Court , and all other Officers of the United
606 7 201 99 ; head water . com ), the
assadors , other public Ministers and Council s , Judges of
shall appoint Amb assadors , other public Ministers and Council s
some indie work in recent films such as Dead fall and
of the Senate , shall appoint Amb assadors , other public
of the supreme Court , and all other Officers of the
, her leadership of that powerful panel -- and how it
are hereby terminated and removed from office , effective immediately .
difficulties for other types of vessels , are significant factors .
But if Os so ff loses , expect the liberal hand
appoint Amb assadors , other public Ministers and Council s ,
in my career that has had this kind of reaction ,"
. " Senior public servants superv ise ang l
the Commonwealth Parliament does not have any specific power to legisl
crazy . I 've never done anything in my career that
they felt hopeless . L osing T 1 early against an
his first shot . Later in the dial , he gets
as well as longer - lived greenhouse gases such as nit

INTERVAL 1.424 - 2.902
CONTAINS 0.538%

employed similar schemes and also found success . Petty 's passer
had been abducted and held since 2010 . Pak
Norway : University of Ber gen . Google Scholar
has also been opened to determine how the error happened ."
punishment , that players know they can get on the field

INTERVAL -0.054 - 1.424
CONTAINS 52.595%

. It s 2016 . We should
D ines h was just talking about the beginnings of this
the school year , it occurred to me that perhaps this
moment that read () returns , the stack pointer is in
create a comic book - like appearance . The opening mission

INTERVAL -1.532 - -0.054
CONTAINS 46.661%

Marco Murray and Tony will be 35 next year . I
Congress would pass legislation to raise the ceiling in time to
are extremely close , really , really close . I love
night for entire house ; $ 24 for approximately 1 -
.[ 21 ] In the same way , Christianity would later

INTERVAL -3.010 - -1.532
CONTAINS 0.205%

fairly simple to mod , and partly due to bits of
) . He alot of times goes from genius to total
Pruitt s two federal PACs Liber ty 2
. 21 : " Board of Directors : John T .
parents to predators , natural disasters and hunger
Token are
Token ID389
Feature activation+4.03e-01

Pos logit contributions
 themselves+8.592
 Zuckerberg+7.904
 Mandela+7.796
 THEY+7.625
 Tsarnaev+7.614
Neg logit contributions
pite-10.495
laughs-10.001
ittee-9.774
restricted-9.490
crew-9.200
Token exposed
Token ID7362
Feature activation+2.76e-01

Pos logit contributions
 themselves+3.458
 Zuckerberg+3.441
 Mandela+3.419
 THEIR+3.327
 THEY+3.326
Neg logit contributions
pite-3.658
laughs-3.452
ittee-3.400
restricted-3.225
acia-3.169
Token to
Token ID284
Feature activation+9.46e-01

Pos logit contributions
 themselves+2.441
 Zuckerberg+2.435
 Mandela+2.428
 THEY+2.369
 THEIR+2.363
Neg logit contributions
pite-2.416
laughs-2.276
ittee-2.226
restricted-2.119
crew-2.078
Token,
Token ID11
Feature activation+1.12e+00

Pos logit contributions
 themselves+7.065
 Zuckerberg+6.665
 Mandela+6.587
 THEY+6.440
 THEIR+6.349
Neg logit contributions
pite-9.621
laughs-9.061
ittee-8.847
restricted-8.482
acia-8.347
Token find
Token ID1064
Feature activation+1.78e+00

Pos logit contributions
 themselves+9.601
 Zuckerberg+9.338
 Mandela+9.187
 THEY+9.020
 THEIR+8.988
Neg logit contributions
pite-9.655
laughs-8.816
ittee-8.790
restricted-8.402
crew-8.201
Token out
Token ID503
Feature activation+2.90e+00

Pos logit contributions
 themselves+11.964
 Zuckerberg+11.022
 THEIR+10.668
 their+10.514
 Mandela+10.446
Neg logit contributions
pite-17.087
ittee-16.281
laughs-15.629
restricted-15.168
acia-15.128
Token the
Token ID262
Feature activation+1.27e+00

Pos logit contributions
 themselves+19.671
 their+18.945
 they+18.939
 THEIR+17.996
 Zuckerberg+17.862
Neg logit contributions
pite-19.972
ittee-19.348
restricted-18.055
laughs-17.838
acia-17.358
Token social
Token ID1919
Feature activation+5.59e-01

Pos logit contributions
 themselves+10.614
 Zuckerberg+10.546
 Mandela+10.502
 Tsarnaev+10.198
 THEIR+9.912
Neg logit contributions
pite-10.450
ittee-10.341
laughs-9.898
restricted-9.300
acia-9.167
Token environment
Token ID2858
Feature activation+7.07e-01

Pos logit contributions
 themselves+4.824
 Mandela+4.779
 Zuckerberg+4.759
 THEY+4.637
 THEIR+4.612
Neg logit contributions
pite-4.949
laughs-4.702
ittee-4.671
restricted-4.423
acia-4.296
Token they
Token ID484
Feature activation+1.23e+00

Pos logit contributions
 themselves+5.481
 Zuckerberg+5.343
 Mandela+5.336
 THEY+5.284
 THEIR+5.184
Neg logit contributions
pite-7.004
laughs-6.668
ittee-6.578
restricted-6.208
acia-6.166
Token are
Token ID389
Feature activation+4.82e-01

Pos logit contributions
 themselves+9.311
 Zuckerberg+8.201
 Tsarnaev+8.038
 they+8.002
 Mandela+7.999
Neg logit contributions
pite-12.333
laughs-11.738
ittee-11.313
restricted-10.991
catentry-10.596
Token and
Token ID290
Feature activation+8.15e-01

Pos logit contributions
 themselves+5.183
 Zuckerberg+5.147
 Mandela+5.058
 THEY+5.011
 THEIR+4.979
Neg logit contributions
pite-6.651
laughs-6.278
ittee-6.190
restricted-5.956
crew-5.831
Token you
Token ID345
Feature activation+1.06e+00

Pos logit contributions
 themselves+6.165
 Zuckerberg+6.111
 THEY+5.968
 Mandela+5.956
 THEIR+5.882
Neg logit contributions
pite-8.036
laughs-7.536
ittee-7.491
restricted-7.245
crew-7.074
Token can
Token ID460
Feature activation+2.21e-01

Pos logit contributions
 themselves+7.144
 Zuckerberg+6.643
 THEY+6.525
 """+6.445
 Mandela+6.442
Neg logit contributions
pite-11.210
laughs-10.544
ittee-10.306
restricted-10.062
acia-9.900
Token get
Token ID651
Feature activation+3.94e-01

Pos logit contributions
 Zuckerberg+1.746
 Mandela+1.745
 themselves+1.734
 THEY+1.694
 THEIR+1.689
Neg logit contributions
pite-2.124
laughs-2.017
ittee-1.983
restricted-1.893
crew-1.859
Token a
Token ID257
Feature activation+7.35e-01

Pos logit contributions
 themselves+2.973
 Zuckerberg+2.953
 Mandela+2.940
 THEIR+2.868
 THEY+2.856
Neg logit contributions
pite-3.977
laughs-3.744
ittee-3.728
restricted-3.544
acia-3.476
Token hint
Token ID9254
Feature activation+2.83e+00

Pos logit contributions
 themselves+5.159
 Zuckerberg+5.135
 Mandela+5.084
 THEIR+4.955
 THEY+4.921
Neg logit contributions
pite-7.526
laughs-7.177
ittee-7.121
restricted-6.799
acia-6.752
Token of
Token ID286
Feature activation+1.56e+00

Pos logit contributions
 themselves+18.467
 they+17.522
 their+16.080
 Zuckerberg+16.067
 THEY+15.977
Neg logit contributions
pite-19.097
ittee-17.736
laughs-17.358
restricted-16.894
foreseen-16.699
Token what
Token ID644
Feature activation+7.64e-01

Pos logit contributions
 themselves+10.541
 THEIR+9.751
 Zuckerberg+9.694
 their+9.500
 Mandela+9.208
Neg logit contributions
pite-15.100
ittee-14.517
restricted-14.060
laughs-13.972
acia-13.351
Token to
Token ID284
Feature activation+3.36e-01

Pos logit contributions
 themselves+5.570
 Zuckerberg+5.515
 THEY+5.490
 THEIR+5.415
 Mandela+5.411
Neg logit contributions
pite-7.719
laughs-7.209
ittee-7.166
restricted-6.918
crew-6.755
Token expect
Token ID1607
Feature activation+1.31e+00

Pos logit contributions
 Zuckerberg+2.326
 Mandela+2.323
 themselves+2.311
 THEY+2.248
 THEIR+2.241
Neg logit contributions
pite-3.566
laughs-3.400
ittee-3.349
restricted-3.214
crew-3.160
Token.
Token ID13
Feature activation+3.85e-01

Pos logit contributions
 themselves+9.281
 THEY+8.380
 Zuckerberg+8.286
 THEIR+8.269
 they+8.243
Neg logit contributions
pite-13.178
ittee-12.348
laughs-12.226
restricted-11.762
acia-11.538
Token for
Token ID329
Feature activation+4.19e-02

Pos logit contributions
 Zuckerberg+2.562
 Mandela+2.559
 themselves+2.556
 THEY+2.487
 THEIR+2.484
Neg logit contributions
pite-2.967
laughs-2.816
ittee-2.762
restricted-2.638
crew-2.586
Token six
Token ID2237
Feature activation+8.57e-02

Pos logit contributions
 Mandela+0.310
 Zuckerberg+0.309
 themselves+0.305
 THEY+0.303
 THEIR+0.300
Neg logit contributions
pite-0.424
laughs-0.403
ittee-0.398
restricted-0.380
such-0.374
Token years
Token ID812
Feature activation+2.15e-01

Pos logit contributions
 Mandela+0.488
 Zuckerberg+0.485
 themselves+0.478
 THEY+0.470
 THEIR+0.468
Neg logit contributions
pite-1.014
laughs-0.974
ittee-0.963
restricted-0.924
crew-0.913
Token to
Token ID284
Feature activation+5.17e-01

Pos logit contributions
 Zuckerberg+1.905
 Mandela+1.903
 themselves+1.891
 THEY+1.855
 THEIR+1.850
Neg logit contributions
pite-1.865
laughs-1.758
ittee-1.729
restricted-1.640
crew-1.606
Token figure
Token ID3785
Feature activation+2.10e+00

Pos logit contributions
 themselves+3.526
 Zuckerberg+3.456
 Mandela+3.454
 THEY+3.350
 THEIR+3.339
Neg logit contributions
pite-5.559
laughs-5.332
ittee-5.205
restricted-5.034
crew-4.932
Token out
Token ID503
Feature activation+2.81e+00

Pos logit contributions
 themselves+11.966
 Zuckerberg+10.368
 their+10.226
 THEIR+10.200
 Mandela+9.960
Neg logit contributions
pite-20.554
laughs-20.036
ittee-19.175
acia-18.833
restricted-18.817
Token how
Token ID703
Feature activation+7.48e-01

Pos logit contributions
 themselves+16.697
 their+16.380
 Zuckerberg+15.622
 they+15.292
 THEIR+15.110
Neg logit contributions
laughs-21.005
pite-20.752
ittee-19.233
restricted-18.480
acia-18.474
Token to
Token ID284
Feature activation+2.61e-01

Pos logit contributions
 themselves+6.058
 Zuckerberg+5.998
 Mandela+5.961
 THEY+5.881
 THEIR+5.811
Neg logit contributions
pite-6.902
laughs-6.703
ittee-6.369
restricted-6.116
crew-6.048
Token fix
Token ID4259
Feature activation+1.20e+00

Pos logit contributions
 Zuckerberg+1.847
 Mandela+1.845
 themselves+1.837
 THEY+1.787
 THEIR+1.781
Neg logit contributions
pite-2.727
laughs-2.599
ittee-2.560
restricted-2.455
crew-2.413
Token the
Token ID262
Feature activation+7.88e-01

Pos logit contributions
 themselves+8.486
 Zuckerberg+8.155
 Mandela+8.107
 THEIR+7.841
 THEY+7.540
Neg logit contributions
pite-11.598
laughs-11.370
ittee-11.099
crew-10.474
restricted-10.244
Token walk
Token ID2513
Feature activation-8.51e-02

Pos logit contributions
 Zuckerberg+5.937
 Mandela+5.937
 themselves+5.928
 THEIR+5.647
 THEY+5.645
Neg logit contributions
pite-7.594
laughs-7.311
ittee-7.281
crew-6.777
restricted-6.745
Token Council
Token ID4281
Feature activation-2.55e-01

Pos logit contributions
pite+4.108
ittee+3.842
laughs+3.733
restricted+3.611
crew+3.570
Neg logit contributions
 Mandela-4.674
 Zuckerberg-4.612
-4.607
 THEIR-4.606
 THEY-4.554
Token is
Token ID318
Feature activation+1.01e-02

Pos logit contributions
pite+2.147
ittee+2.027
laughs+1.998
restricted+1.888
crew+1.860
Neg logit contributions
 Mandela-2.289
 Zuckerberg-2.270
 THEIR-2.236
 THEY-2.235
-2.216
Token trying
Token ID2111
Feature activation+8.29e-01

Pos logit contributions
 Mandela+0.085
 Zuckerberg+0.085
 themselves+0.084
 THEY+0.083
+0.083
Neg logit contributions
pite-0.092
laughs-0.087
ittee-0.085
restricted-0.081
crew-0.080
Token to
Token ID284
Feature activation+6.30e-01

Pos logit contributions
 themselves+6.682
 Zuckerberg+6.439
 Mandela+6.364
 THEIR+6.329
 THEY+6.198
Neg logit contributions
pite-7.719
laughs-7.404
ittee-6.933
restricted-6.867
crew-6.667
Token figure
Token ID3785
Feature activation+2.02e+00

Pos logit contributions
 themselves+4.507
 Mandela+4.376
 Zuckerberg+4.368
 THEIR+4.232
 THEY+4.222
Neg logit contributions
pite-6.524
laughs-6.256
ittee-6.099
restricted-5.931
acia-5.806
Token out
Token ID503
Feature activation+2.80e+00

Pos logit contributions
 themselves+12.354
 Zuckerberg+10.811
 their+10.712
 THEIR+10.622
 they+10.507
Neg logit contributions
pite-19.065
laughs-18.279
ittee-17.702
acia-17.422
restricted-17.286
Token what
Token ID644
Feature activation+1.05e+00

Pos logit contributions
 themselves+17.301
 their+16.759
 Zuckerberg+16.014
 THEIR+15.804
 they+15.684
Neg logit contributions
laughs-20.115
pite-19.776
ittee-18.334
restricted-17.991
acia-17.835
Token to
Token ID284
Feature activation+4.79e-01

Pos logit contributions
 themselves+8.567
 Zuckerberg+8.411
 Mandela+8.353
 THEY+8.326
 THEIR+8.257
Neg logit contributions
pite-9.243
laughs-8.983
ittee-8.460
restricted-8.240
acia-8.139
Token do
Token ID466
Feature activation+7.17e-01

Pos logit contributions
 themselves+3.065
 Mandela+3.056
 Zuckerberg+3.046
 THEY+2.937
 THEIR+2.931
Neg logit contributions
pite-5.341
laughs-5.118
ittee-5.005
restricted-4.827
crew-4.733
Token about
Token ID546
Feature activation+3.62e-01

Pos logit contributions
 themselves+4.723
 Zuckerberg+4.561
 Mandela+4.503
 THEY+4.388
 THEIR+4.357
Neg logit contributions
pite-7.842
laughs-7.492
ittee-7.275
restricted-7.005
acia-6.933
Token the
Token ID262
Feature activation+2.94e-01

Pos logit contributions
 Zuckerberg+2.745
 Mandela+2.745
 themselves+2.744
 THEIR+2.658
 THEY+2.653
Neg logit contributions
pite-3.593
laughs-3.433
ittee-3.363
restricted-3.211
crew-3.169
Token lands
Token ID8604
Feature activation-5.68e-01

Pos logit contributions
pite+4.041
laughs+3.818
ittee+3.742
restricted+3.706
acia+3.685
Neg logit contributions
 Zuckerberg-2.890
 Mandela-2.873
-2.864
 THEIR-2.838
IELD-2.830
Token,
Token ID11
Feature activation-4.08e-01

Pos logit contributions
pite+5.256
ittee+4.967
laughs+4.909
restricted+4.816
crew+4.697
Neg logit contributions
-4.377
 Mandela-4.359
 Zuckerberg-4.358
IELD-4.258
 THEIR-4.190
Token and
Token ID290
Feature activation-2.87e-01

Pos logit contributions
pite+3.784
laughs+3.515
ittee+3.496
restricted+3.385
such+3.384
Neg logit contributions
-3.201
 Mandela-3.180
 Zuckerberg-3.167
IELD-3.081
 THEY-3.076
Token then
Token ID788
Feature activation+2.98e-01

Pos logit contributions
pite+2.782
ittee+2.613
laughs+2.601
restricted+2.500
such+2.471
Neg logit contributions
 Mandela-2.125
-2.121
 Zuckerberg-2.117
 themselves-2.083
IELD-2.073
Token figure
Token ID3785
Feature activation+1.48e+00

Pos logit contributions
 Zuckerberg+2.282
 Mandela+2.279
 themselves+2.276
 THEY+2.216
 THEIR+2.207
Neg logit contributions
pite-2.943
laughs-2.797
ittee-2.749
restricted-2.629
crew-2.582
Token out
Token ID503
Feature activation+2.77e+00

Pos logit contributions
 themselves+9.195
 THEIR+8.168
 Zuckerberg+8.162
 THEY+8.126
 they+8.093
Neg logit contributions
pite-15.758
laughs-14.947
ittee-14.815
acia-14.123
restricted-14.113
Token how
Token ID703
Feature activation+6.01e-01

Pos logit contributions
 themselves+18.888
 they+17.912
 their+17.518
 THEIR+16.639
 THEY+16.360
Neg logit contributions
pite-20.033
laughs-19.074
ittee-18.896
acia-17.502
restricted-17.304
Token to
Token ID284
Feature activation+1.40e-01

Pos logit contributions
 themselves+5.034
 Zuckerberg+4.995
 Mandela+4.994
 THEY+4.922
 THEIR+4.882
Neg logit contributions
pite-5.507
laughs-5.242
ittee-5.086
restricted-4.836
acia-4.749
Token acquire
Token ID12831
Feature activation-1.72e-01

Pos logit contributions
 Mandela+1.025
 Zuckerberg+1.025
 themselves+1.001
+0.994
 THEY+0.990
Neg logit contributions
pite-1.421
laughs-1.352
ittee-1.333
restricted-1.277
crew-1.259
Token one
Token ID530
Feature activation+9.96e-01

Pos logit contributions
pite+1.702
laughs+1.609
ittee+1.583
restricted+1.560
such+1.533
Neg logit contributions
 Mandela-1.243
 Zuckerberg-1.242
-1.241
IELD-1.211
 THEY-1.203
Token,
Token ID11
Feature activation+4.71e-01

Pos logit contributions
 themselves+7.481
 Mandela+6.837
 Zuckerberg+6.835
 THEIR+6.705
 THEY+6.701
Neg logit contributions
pite-9.877
laughs-9.523
ittee-9.219
crew-8.809
restricted-8.734
Token
Token ID251
Feature activation+8.38e-02

Pos logit contributions
pite+1.613
laughs+1.533
ittee+1.499
restricted+1.427
such+1.402
Neg logit contributions
 Mandela-1.795
 Zuckerberg-1.780
 themselves-1.754
 THEIR-1.735
 THEY-1.735
Token)
Token ID8
Feature activation+1.94e+00

Pos logit contributions
 Zuckerberg+0.720
 Mandela+0.719
 themselves+0.709
 THEIR+0.700
 THEY+0.700
Neg logit contributions
pite-0.750
laughs-0.709
ittee-0.697
restricted-0.664
crew-0.653
Token re
Token ID302
Feature activation+1.52e+00

Pos logit contributions
 themselves+15.193
 they+13.720
 Tsarnaev+13.639
 Zuckerberg+13.616
 Mandela+13.577
Neg logit contributions
pite-14.891
laughs-13.802
ittee-13.522
restricted-13.122
acia-12.621
Token-
Token ID12
Feature activation+8.80e-01

Pos logit contributions
 themselves+13.029
 Mandela+12.591
 Zuckerberg+12.500
 THEY+12.262
 Tsarnaev+12.189
Neg logit contributions
pite-10.773
ittee-10.174
laughs-10.074
acia-9.271
restricted-9.231
Tokend
Token ID67
Feature activation+6.42e-01

Pos logit contributions
 themselves+8.149
 Mandela+8.068
 Zuckerberg+7.995
 """+7.803
 THEY+7.803
Neg logit contributions
pite-6.686
ittee-6.233
laughs-6.028
acia-5.655
restricted-5.538
Tokeniscover
Token ID29392
Feature activation+2.72e+00

Pos logit contributions
 themselves+5.868
 Mandela+5.826
 Zuckerberg+5.816
 THEY+5.649
 THEIR+5.616
Neg logit contributions
pite-5.158
laughs-5.021
ittee-4.725
restricted-4.551
acia-4.406
Token the
Token ID262
Feature activation+2.05e+00

Pos logit contributions
 themselves+20.998
 Mandela+19.866
 their+19.523
 they+18.930
 THEIR+18.479
Neg logit contributions
pite-16.459
ittee-15.851
laughs-15.709
esides-14.217
restricted-14.127
Token Green
Token ID3469
Feature activation+3.84e-01

Pos logit contributions
 Mandela+16.071
 themselves+15.281
 basics+14.857
 Mao+14.703
 Zuckerberg+14.683
Neg logit contributions
ittee-14.627
pite-14.479
laughs-14.392
crew-13.126
restricted-13.062
Token Line
Token ID6910
Feature activation+1.50e+00

Pos logit contributions
 Mandela+2.727
 Zuckerberg+2.720
 themselves+2.708
 THEY+2.626
 THEIR+2.617
Neg logit contributions
pite-4.008
laughs-3.820
ittee-3.756
restricted-3.604
crew-3.545
Token.
Token ID13
Feature activation+9.01e-01

Pos logit contributions
 themselves+11.449
 Mandela+11.404
 Zuckerberg+10.923
 """+10.642
 THEY+10.490
Neg logit contributions
pite-12.920
laughs-12.257
ittee-11.883
acia-11.315
restricted-11.295
Token
Token ID198
Feature activation+4.09e-01

Pos logit contributions
 Mandela+8.074
 Zuckerberg+7.873
 themselves+7.799
 THEY+7.750
 THEIR+7.596
Neg logit contributions
pite-7.693
laughs-7.163
ittee-7.139
restricted-6.779
acia-6.674
Token.
Token ID13
Feature activation+2.91e-01

Pos logit contributions
 themselves+3.124
 Zuckerberg+3.072
 Mandela+3.056
 THEY+2.990
 THEIR+2.962
Neg logit contributions
pite-4.066
laughs-3.884
ittee-3.808
restricted-3.629
acia-3.598
Token
Token ID198
Feature activation+2.19e-01

Pos logit contributions
 Zuckerberg+2.040
 Mandela+2.037
 themselves+2.031
 THEY+1.971
 THEIR+1.967
Neg logit contributions
pite-3.066
laughs-2.924
ittee-2.880
restricted-2.761
crew-2.712
Token
Token ID198
Feature activation+5.02e-01

Pos logit contributions
 themselves+1.106
 Zuckerberg+1.090
 Mandela+1.087
 THEY+1.041
 THEIR+1.029
Neg logit contributions
pite-2.747
ittee-2.622
laughs-2.614
restricted-2.508
acia-2.480
Token1998
Token ID21113
Feature activation+7.58e-01

Pos logit contributions
 themselves+2.716
 Zuckerberg+2.716
 Mandela+2.696
 THEY+2.622
 THEIR+2.613
Neg logit contributions
pite-5.987
laughs-5.705
ittee-5.660
restricted-5.439
acia-5.395
Token Scientists
Token ID23782
Feature activation+7.97e-01

Pos logit contributions
 Mandela+6.028
 Zuckerberg+5.986
 themselves+5.843
 THEY+5.768
 Tsarnaev+5.724
Neg logit contributions
pite-7.136
laughs-6.732
ittee-6.623
restricted-6.340
crew-6.295
Token perfect
Token ID2818
Feature activation+2.68e+00

Pos logit contributions
 themselves+5.932
 Mandela+5.770
 Zuckerberg+5.752
 Tsarnaev+5.465
 THEY+5.462
Neg logit contributions
pite-7.962
laughs-7.600
ittee-7.437
restricted-7.150
crew-7.031
Token methods
Token ID5050
Feature activation+8.20e-01

Pos logit contributions
 themselves+19.523
 their+18.271
 Mandela+17.751
 Zuckerberg+17.498
 THEIR+17.469
Neg logit contributions
pite-17.516
ittee-16.858
laughs-16.854
acia-15.838
esides-15.595
Token for
Token ID329
Feature activation+1.32e+00

Pos logit contributions
 themselves+6.102
 Zuckerberg+5.915
 Mandela+5.849
 THEY+5.783
 THEIR+5.639
Neg logit contributions
pite-8.101
laughs-7.821
ittee-7.630
acia-7.360
restricted-7.266
Token growing
Token ID3957
Feature activation+9.29e-01

Pos logit contributions
 themselves+10.520
 Zuckerberg+10.085
 Mandela+10.001
 THEIR+9.822
 Tsarnaev+9.686
Neg logit contributions
pite-11.610
ittee-10.982
laughs-10.935
acia-10.349
restricted-10.303
Token human
Token ID1692
Feature activation+4.36e-01

Pos logit contributions
 themselves+7.347
 Mandela+7.138
 Zuckerberg+7.063
 THEIR+6.871
 THEY+6.789
Neg logit contributions
pite-8.930
laughs-8.423
ittee-8.395
acia-7.799
restricted-7.778
Token embryonic
Token ID49710
Feature activation-1.02e-01

Pos logit contributions
 Mandela+3.273
 Zuckerberg+3.268
 themselves+3.268
 THEY+3.166
 THEIR+3.156
Neg logit contributions
pite-4.379
laughs-4.157
ittee-4.109
restricted-3.915
crew-3.839
Tokenko
Token ID7204
Feature activation+1.28e+00

Pos logit contributions
 themselves+2.932
 Zuckerberg+2.912
 Mandela+2.900
 THEY+2.820
 Tsarnaev+2.805
Neg logit contributions
pite-3.194
laughs-2.998
ittee-2.963
restricted-2.841
such-2.763
Token and
Token ID290
Feature activation-6.40e-02

Pos logit contributions
 themselves+10.439
 Zuckerberg+10.294
 Mandela+10.240
 Tsarnaev+9.837
 MIT+9.689
Neg logit contributions
pite-10.977
ittee-10.105
laughs-10.080
restricted-9.629
acia-9.611
Token Sach
Token ID20678
Feature activation+3.22e-01

Pos logit contributions
pite+0.674
laughs+0.646
ittee+0.635
restricted+0.607
crew+0.601
Neg logit contributions
 Mandela-0.436
 Zuckerberg-0.431
 themselves-0.429
-0.423
 THEY-0.423
Tokendev
Token ID7959
Feature activation+2.04e+00

Pos logit contributions
 themselves+3.021
 Zuckerberg+2.969
 Mandela+2.928
 """+2.868
 THEY+2.845
Neg logit contributions
pite-2.595
laughs-2.414
ittee-2.412
restricted-2.230
acia-2.224
Token already
Token ID1541
Feature activation+1.55e+00

Pos logit contributions
 themselves+16.360
 invented+14.779
 Zuckerberg+14.624
 they+14.568
 Mandela+14.488
Neg logit contributions
pite-14.934
laughs-13.721
acia-13.453
restricted-13.135
ittee-13.104
Token knew
Token ID2993
Feature activation+2.52e+00

Pos logit contributions
 themselves+10.875
 Zuckerberg+9.892
 Mandela+9.608
 MIT+9.573
 invented+9.555
Neg logit contributions
pite-14.675
laughs-14.070
ittee-13.740
acia-13.317
restricted-13.269
Token each
Token ID1123
Feature activation+1.42e-01

Pos logit contributions
 themselves+17.763
 they+17.520
 their+16.842
 Mandela+16.779
 Zuckerberg+16.532
Neg logit contributions
pite-18.521
ittee-17.637
laughs-17.111
restricted-16.603
acia-16.524
Token other
Token ID584
Feature activation+6.14e-01

Pos logit contributions
 Zuckerberg+1.368
 Mandela+1.367
 themselves+1.351
 THEY+1.339
 THEIR+1.333
Neg logit contributions
pite-1.121
laughs-1.044
ittee-1.026
restricted-0.968
such-0.954
Token through
Token ID832
Feature activation+7.90e-01

Pos logit contributions
 themselves+4.821
 Mandela+4.803
 Zuckerberg+4.803
 THEY+4.639
 THEIR+4.604
Neg logit contributions
pite-5.907
laughs-5.583
ittee-5.541
restricted-5.291
acia-5.210
Token Per
Token ID2448
Feature activation-1.24e-01

Pos logit contributions
 themselves+6.015
 Zuckerberg+5.955
 Mandela+5.919
 THEIR+5.752
 THEY+5.691
Neg logit contributions
pite-7.730
laughs-7.377
ittee-7.360
restricted-7.008
acia-6.817
Tokenimeter
Token ID16912
Feature activation+6.96e-01

Pos logit contributions
pite+1.047
laughs+0.972
ittee+0.959
restricted+0.914
such+0.894
Neg logit contributions
 Mandela-1.131
 Zuckerberg-1.129
-1.128
 themselves-1.127
 THEIR-1.118
Token Syrian
Token ID6318
Feature activation+6.35e-02

Pos logit contributions
 Mandela+3.030
 themselves+3.026
 Zuckerberg+3.014
 THEIR+2.881
 THEY+2.876
Neg logit contributions
pite-4.990
laughs-4.778
ittee-4.709
restricted-4.495
crew-4.447
Token refugees
Token ID8015
Feature activation+1.66e-02

Pos logit contributions
 Zuckerberg+0.436
 Mandela+0.435
 themselves+0.426
 THEIR+0.425
 THEY+0.425
Neg logit contributions
pite-0.675
laughs-0.639
ittee-0.635
restricted-0.612
acia-0.599
Token who
Token ID508
Feature activation+1.08e+00

Pos logit contributions
 Zuckerberg+0.125
 Mandela+0.125
 THEIR+0.121
+0.121
 themselves+0.121
Neg logit contributions
pite-0.166
laughs-0.156
ittee-0.153
restricted-0.148
such-0.145
Token have
Token ID423
Feature activation+9.33e-01

Pos logit contributions
 themselves+7.490
 Zuckerberg+7.004
 Mandela+6.967
 Tsarnaev+6.804
 THEY+6.727
Neg logit contributions
pite-11.050
laughs-10.975
ittee-10.423
restricted-10.014
crew-10.005
Token no
Token ID645
Feature activation+8.30e-01

Pos logit contributions
 themselves+6.845
 Mandela+6.475
 Zuckerberg+6.418
 Tsarnaev+6.175
 THEIR+6.137
Neg logit contributions
pite-9.517
laughs-9.171
ittee-9.035
acia-8.446
crew-8.430
Token idea
Token ID2126
Feature activation+2.50e+00

Pos logit contributions
 themselves+5.887
 Mandela+5.735
 Zuckerberg+5.731
 THEIR+5.453
 THEY+5.431
Neg logit contributions
pite-8.577
laughs-8.253
ittee-8.146
restricted-7.710
crew-7.560
Token that
Token ID326
Feature activation+1.56e+00

Pos logit contributions
 they+16.695
 themselves+16.517
 their+15.613
 Assad+14.920
 THEY+14.897
Neg logit contributions
laughs-19.674
ittee-19.614
pite-19.461
catentry-18.931
acia-18.566
Token the
Token ID262
Feature activation+6.35e-01

Pos logit contributions
 themselves+11.655
 they+11.432
 THEY+10.906
 Assad+10.899
 Zuckerberg+10.876
Neg logit contributions
laughs-13.991
pite-13.785
ittee-13.559
acia-12.847
crew-12.736
Token donations
Token ID10976
Feature activation+5.51e-01

Pos logit contributions
 themselves+4.817
 Mandela+4.792
 Zuckerberg+4.773
 Tsarnaev+4.580
 THEIR+4.552
Neg logit contributions
pite-6.217
laughs-5.999
ittee-5.906
crew-5.577
restricted-5.565
Token come
Token ID1282
Feature activation+2.39e-01

Pos logit contributions
 themselves+3.728
 Mandela+3.646
 Zuckerberg+3.643
 THEY+3.547
 THEIR+3.494
Neg logit contributions
pite-5.979
laughs-5.775
ittee-5.626
restricted-5.394
acia-5.343
Token from
Token ID422
Feature activation+5.18e-01

Pos logit contributions
 Mandela+1.811
 Zuckerberg+1.808
 themselves+1.782
 THEY+1.753
 THEIR+1.745
Neg logit contributions
pite-2.392
laughs-2.264
ittee-2.232
restricted-2.140
crew-2.102
Token Wednesday
Token ID3583
Feature activation+4.86e-01

Pos logit contributions
 Mandela+2.496
 Zuckerberg+2.496
 themselves+2.489
 THEIR+2.405
 THEY+2.401
Neg logit contributions
pite-3.885
laughs-3.700
ittee-3.647
restricted-3.487
crew-3.436
Token in
Token ID287
Feature activation+4.73e-01

Pos logit contributions
 themselves+3.686
 Zuckerberg+3.644
 Mandela+3.629
 THEY+3.541
 THEIR+3.519
Neg logit contributions
pite-4.837
laughs-4.611
ittee-4.551
restricted-4.329
crew-4.251
Token an
Token ID281
Feature activation+4.80e-01

Pos logit contributions
 themselves+3.087
 Zuckerberg+3.076
 Mandela+3.067
 THEIR+2.956
 THEY+2.920
Neg logit contributions
pite-5.193
laughs-5.001
ittee-4.945
restricted-4.713
crew-4.665
Token attempt
Token ID2230
Feature activation+2.07e-01

Pos logit contributions
 Zuckerberg+3.621
 Mandela+3.617
 themselves+3.607
 THEY+3.476
 THEIR+3.473
Neg logit contributions
pite-4.804
laughs-4.578
ittee-4.491
restricted-4.288
crew-4.234
Token to
Token ID284
Feature activation+6.15e-01

Pos logit contributions
 Zuckerberg+1.877
 Mandela+1.876
 themselves+1.862
 THEY+1.829
 THEIR+1.826
Neg logit contributions
pite-1.758
laughs-1.657
ittee-1.629
restricted-1.543
crew-1.509
Token confirm
Token ID6216
Feature activation+2.49e+00

Pos logit contributions
 themselves+4.853
 Mandela+4.750
 Zuckerberg+4.721
 THEIR+4.589
 THEY+4.569
Neg logit contributions
pite-5.944
laughs-5.643
ittee-5.590
restricted-5.346
crew-5.218
Token the
Token ID262
Feature activation+1.18e+00

Pos logit contributions
 themselves+17.908
 they+17.745
 their+17.232
 THEY+16.300
 THEIR+16.209
Neg logit contributions
pite-18.621
ittee-17.624
laughs-17.538
acia-16.329
cffffcc-16.154
Token spread
Token ID4104
Feature activation+5.89e-02

Pos logit contributions
 themselves+9.012
 Mandela+8.933
 Zuckerberg+8.845
 Mao+8.526
 Tsarnaev+8.448
Neg logit contributions
pite-10.716
ittee-10.335
laughs-10.250
restricted-9.511
acia-9.437
Token of
Token ID286
Feature activation+1.13e-01

Pos logit contributions
 Mandela+0.456
 Zuckerberg+0.456
 themselves+0.449
 THEIR+0.442
 THEY+0.440
Neg logit contributions
pite-0.576
laughs-0.545
ittee-0.538
restricted-0.515
crew-0.505
Token the
Token ID262
Feature activation+2.03e-01

Pos logit contributions
 Zuckerberg+0.876
 Mandela+0.875
 themselves+0.863
 THEY+0.849
 THEIR+0.845
Neg logit contributions
pite-1.092
laughs-1.034
ittee-1.019
restricted-0.980
such-0.964
Token disease
Token ID4369
Feature activation-4.44e-01

Pos logit contributions
 Zuckerberg+1.505
 Mandela+1.503
 themselves+1.495
 THEY+1.459
 THEIR+1.454
Neg logit contributions
pite-2.049
laughs-1.949
ittee-1.919
restricted-1.839
crew-1.806
Token campaign
Token ID1923
Feature activation-3.88e-01

Pos logit contributions
pite+0.917
laughs+0.868
ittee+0.857
restricted+0.818
crew+0.801
Neg logit contributions
 Mandela-0.720
 themselves-0.718
 THEY-0.715
-0.715
 Zuckerberg-0.713
Token is
Token ID318
Feature activation+6.07e-02

Pos logit contributions
pite+3.459
laughs+3.287
ittee+3.261
restricted+3.090
crew+3.090
Neg logit contributions
 Mandela-3.116
-3.097
IELD-3.089
 THEIR-3.080
 THEY-3.047
Token trying
Token ID2111
Feature activation+5.52e-01

Pos logit contributions
 Mandela+0.486
 Zuckerberg+0.485
 themselves+0.479
 THEY+0.475
 THEIR+0.472
Neg logit contributions
pite-0.576
laughs-0.547
ittee-0.538
restricted-0.514
crew-0.505
Token to
Token ID284
Feature activation-3.27e-03

Pos logit contributions
 themselves+4.689
 Zuckerberg+4.675
 Mandela+4.624
 THEIR+4.498
 THEY+4.486
Neg logit contributions
pite-5.016
laughs-4.709
ittee-4.556
restricted-4.377
acia-4.304
Token figure
Token ID3785
Feature activation+1.63e+00

Pos logit contributions
pite+0.032
laughs+0.030
ittee+0.030
restricted+0.028
crew+0.028
Neg logit contributions
 Mandela-0.025
 Zuckerberg-0.025
-0.025
 THEY-0.025
 themselves-0.025
Token out
Token ID503
Feature activation+2.44e+00

Pos logit contributions
 themselves+9.309
 Zuckerberg+8.944
 Mandela+8.389
 THEIR+8.027
 they+7.989
Neg logit contributions
pite-17.525
ittee-16.592
laughs-16.467
restricted-16.264
acia-16.137
Token how
Token ID703
Feature activation+2.81e-01

Pos logit contributions
 themselves+15.708
 Zuckerberg+15.506
 their+14.572
 Mandela+14.429
 they+14.229
Neg logit contributions
pite-19.762
ittee-18.636
laughs-18.515
acia-17.885
restricted-17.863
Token to
Token ID284
Feature activation-2.25e-01

Pos logit contributions
 Mandela+2.473
 Zuckerberg+2.472
 themselves+2.460
 THEY+2.409
 THEIR+2.404
Neg logit contributions
pite-2.453
laughs-2.312
ittee-2.277
restricted-2.159
crew-2.117
Token coordinate
Token ID20435
Feature activation-1.88e-01

Pos logit contributions
pite+2.140
laughs+2.049
ittee+2.015
crew+1.912
restricted+1.901
Neg logit contributions
-1.741
 Mandela-1.723
 Zuckerberg-1.704
 THEY-1.676
 Tsarnaev-1.667
Token it
Token ID340
Feature activation+7.13e-02

Pos logit contributions
pite+1.852
laughs+1.735
ittee+1.720
crew+1.659
such+1.639
Neg logit contributions
-1.407
 Mandela-1.396
IELD-1.369
 Zuckerberg-1.367
 THEY-1.361
Token all
Token ID477
Feature activation+8.22e-02

Pos logit contributions
 Mandela+0.541
 Zuckerberg+0.538
 themselves+0.526
 THEY+0.524
+0.523
Neg logit contributions
pite-0.708
laughs-0.672
ittee-0.663
restricted-0.633
crew-0.627
Token them
Token ID606
Feature activation+1.29e-01

Pos logit contributions
pite+3.890
laughs+3.683
ittee+3.660
crew+3.467
affected+3.427
Neg logit contributions
 Zuckerberg-3.044
 Mandela-3.038
-3.021
 THEY-2.963
 Tsarnaev-2.918
Token out
Token ID503
Feature activation+2.75e-01

Pos logit contributions
 Zuckerberg+0.974
 Mandela+0.971
 themselves+0.956
 THEY+0.942
 THEIR+0.938
Neg logit contributions
pite-1.289
laughs-1.228
ittee-1.207
restricted-1.150
crew-1.135
Token,
Token ID11
Feature activation+1.05e+00

Pos logit contributions
 Zuckerberg+2.182
 Mandela+2.181
 themselves+2.176
 THEY+2.120
 THEIR+2.114
Neg logit contributions
pite-2.644
laughs-2.508
ittee-2.467
restricted-2.358
crew-2.312
Token can
Token ID460
Feature activation+8.47e-01

Pos logit contributions
 themselves+7.534
 Zuckerberg+7.054
 Mandela+6.987
 THEY+6.935
 THEIR+6.832
Neg logit contributions
pite-10.429
laughs-9.881
ittee-9.875
restricted-9.577
crew-9.304
Token find
Token ID1064
Feature activation+1.47e+00

Pos logit contributions
 themselves+6.099
 Mandela+5.686
 Zuckerberg+5.641
 THEY+5.563
 THEIR+5.562
Neg logit contributions
pite-8.712
laughs-8.293
ittee-8.222
restricted-8.036
acia-7.793
Token out
Token ID503
Feature activation+2.43e+00

Pos logit contributions
 themselves+11.076
 Zuckerberg+9.749
 THEIR+9.700
 Mandela+9.608
 their+9.415
Neg logit contributions
pite-13.913
ittee-13.214
laughs-13.033
restricted-12.759
acia-12.639
Token how
Token ID703
Feature activation+1.92e-01

Pos logit contributions
 themselves+16.663
 they+15.278
 their+14.930
 THEIR+14.912
 Zuckerberg+14.768
Neg logit contributions
pite-19.037
ittee-18.281
restricted-17.501
laughs-17.340
acia-17.218
Token to
Token ID284
Feature activation+1.47e-01

Pos logit contributions
 Zuckerberg+1.624
 Mandela+1.624
 themselves+1.609
 THEY+1.575
 THEIR+1.573
Neg logit contributions
pite-1.740
laughs-1.641
ittee-1.614
restricted-1.538
crew-1.508
Token catch
Token ID4929
Feature activation+7.21e-01

Pos logit contributions
 Zuckerberg+1.068
 Mandela+1.066
 themselves+1.045
+1.033
 THEY+1.030
Neg logit contributions
pite-1.510
laughs-1.440
ittee-1.414
restricted-1.353
crew-1.335
Token these
Token ID777
Feature activation+3.98e-01

Pos logit contributions
 themselves+5.552
 Zuckerberg+5.370
 Mandela+5.324
 THEIR+5.219
 THEY+5.190
Neg logit contributions
pite-7.043
ittee-6.719
laughs-6.706
restricted-6.371
acia-6.233
Token workshop
Token ID20243
Feature activation+6.50e-01

Pos logit contributions
 themselves+3.106
 Mandela+3.092
 Zuckerberg+3.091
 THEY+3.006
 THEIR+2.996
Neg logit contributions
pite-3.865
laughs-3.661
ittee-3.626
restricted-3.462
crew-3.383
Tokenuit
Token ID5013
Feature activation+8.92e-01

Pos logit contributions
pite+0.599
laughs+0.558
ittee+0.551
acia+0.519
restricted+0.519
Neg logit contributions
 Mandela-0.649
 themselves-0.647
 Zuckerberg-0.646
 THEIR-0.641
 THEY-0.637
Token and
Token ID290
Feature activation+5.04e-02

Pos logit contributions
 themselves+7.575
 Zuckerberg+7.507
 Mandela+7.449
 Tsarnaev+7.147
 THEY+7.145
Neg logit contributions
pite-7.410
laughs-7.114
ittee-6.940
restricted-6.519
crew-6.395
Token Thomas
Token ID5658
Feature activation+7.39e-01

Pos logit contributions
 Mandela+0.353
 Zuckerberg+0.353
 themselves+0.351
 THEY+0.345
+0.343
Neg logit contributions
pite-0.527
laughs-0.502
ittee-0.493
restricted-0.474
such-0.466
Token M
Token ID337
Feature activation+6.55e-01

Pos logit contributions
 Zuckerberg+5.020
 Mandela+5.007
 themselves+4.859
 Tsarnaev+4.715
 THEY+4.677
Neg logit contributions
pite-7.807
laughs-7.478
ittee-7.447
restricted-7.129
crew-6.977
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
 Zuckerberg+5.209
 Mandela+5.141
 themselves+5.135
 Tsarnaev+4.952
 THEY+4.938
Neg logit contributions
pite-6.132
laughs-5.868
ittee-5.675
restricted-5.531
crew-5.372
Token Smith
Token ID4176
Feature activation+2.43e+00

Pos logit contributions
 Zuckerberg+10.267
 Mandela+10.211
 themselves+9.722
 Tsarnaev+9.648
 Mao+9.485
Neg logit contributions
pite-13.302
ittee-12.742
laughs-12.686
restricted-12.162
cffffcc-11.926
Token at
Token ID379
Feature activation+7.91e-01

Pos logit contributions
 themselves+17.791
 they+16.600
 Zuckerberg+16.133
 MIT+16.055
 """+15.888
Neg logit contributions
laughs-15.229
pite-15.025
esides-13.958
acia-13.848
ittee-13.526
Token the
Token ID262
Feature activation+9.23e-01

Pos logit contributions
 themselves+5.424
 Zuckerberg+5.403
 Mandela+5.327
 MIT+5.185
 THEIR+5.154
Neg logit contributions
pite-8.277
laughs-7.865
ittee-7.810
restricted-7.457
crew-7.379
Token National
Token ID2351
Feature activation-5.38e-01

Pos logit contributions
 Zuckerberg+5.881
 themselves+5.876
 Mandela+5.741
 MIT+5.663
 THEY+5.455
Neg logit contributions
pite-9.850
ittee-9.460
laughs-9.422
crew-8.995
restricted-8.987
Token Center
Token ID3337
Feature activation+3.25e-02

Pos logit contributions
pite+4.811
laughs+4.433
ittee+4.412
restricted+4.331
acia+4.282
Neg logit contributions
 THEIR-4.270
 Zuckerberg-4.251
 """-4.227
 Mandela-4.226
IELD-4.211
Token for
Token ID329
Feature activation-6.48e-01

Pos logit contributions
 Mandela+0.211
 Zuckerberg+0.208
 themselves+0.205
 THEIR+0.204
 THEY+0.203
Neg logit contributions
pite-0.360
ittee-0.342
laughs-0.342
restricted-0.325
crew-0.320
Token are
Token ID389
Feature activation+4.38e-01

Pos logit contributions
pite+3.085
laughs+2.927
ittee+2.897
 comprises+2.776
restricted+2.751
Neg logit contributions
 Mandela-3.536
IELD-3.529
-3.513
 THEIR-3.512
 Zuckerberg-3.501
Token not
Token ID407
Feature activation+7.05e-01

Pos logit contributions
 themselves+3.499
 Zuckerberg+3.491
 Mandela+3.470
 THEY+3.369
 THEIR+3.366
Neg logit contributions
pite-4.198
laughs-3.995
ittee-3.949
restricted-3.729
crew-3.670
Token mysterious
Token ID11428
Feature activation+3.26e-01

Pos logit contributions
 themselves+5.958
 Zuckerberg+5.882
 Mandela+5.800
 THEIR+5.656
 THEY+5.622
Neg logit contributions
pite-6.422
ittee-6.108
laughs-6.107
restricted-5.655
crew-5.595
Token
Token ID851
Feature activation+1.49e+00

Pos logit contributions
 Zuckerberg+2.616
 Mandela+2.610
 themselves+2.603
 THEY+2.535
 THEIR+2.528
Neg logit contributions
pite-3.107
laughs-2.945
ittee-2.901
restricted-2.764
crew-2.711
Token find
Token ID1064
Feature activation+1.25e+00

Pos logit contributions
 themselves+12.954
 Zuckerberg+12.849
 they+12.577
 Mandela+12.423
 THEY+12.409
Neg logit contributions
pite-11.344
ittee-10.491
laughs-10.395
restricted-9.759
acia-9.548
Token out
Token ID503
Feature activation+2.39e+00

Pos logit contributions
 themselves+9.194
 Zuckerberg+9.109
 Mandela+8.723
 Tsarnaev+8.608
 THEIR+8.458
Neg logit contributions
pite-12.091
ittee-11.434
laughs-11.224
restricted-10.617
acia-10.531
Token the
Token ID262
Feature activation+1.76e+00

Pos logit contributions
 themselves+16.315
 Zuckerberg+14.943
 they+14.544
 their+14.466
 THEIR+14.362
Neg logit contributions
pite-19.638
ittee-18.122
laughs-17.127
restricted-16.960
acia-16.847
Token ideas
Token ID4213
Feature activation+1.02e+00

Pos logit contributions
 Zuckerberg+13.542
 themselves+13.471
 Mandela+13.356
 Tsarnaev+13.255
 basics+13.091
Neg logit contributions
pite-13.837
ittee-13.775
laughs-12.862
acia-12.200
restricted-12.112
Token that
Token ID326
Feature activation+1.42e+00

Pos logit contributions
 themselves+8.229
 Zuckerberg+7.963
 Mandela+7.871
 THEY+7.714
 Tsarnaev+7.515
Neg logit contributions
pite-9.685
laughs-9.289
ittee-8.963
restricted-8.529
acia-8.425
Token young
Token ID1862
Feature activation-2.27e-01

Pos logit contributions
 themselves+11.569
 Zuckerberg+11.427
 Mandela+11.289
 THEY+11.155
 they+11.085
Neg logit contributions
pite-12.071
laughs-11.710
ittee-11.108
acia-10.575
restricted-10.493
Token people
Token ID661
Feature activation+1.09e+00

Pos logit contributions
pite+1.724
laughs+1.623
ittee+1.593
restricted+1.528
crew+1.523
Neg logit contributions
-2.137
 Mandela-2.132
 THEIR-2.113
 THEY-2.113
IELD-2.101
Token adhere
Token ID26325
Feature activation+7.28e-01

Pos logit contributions
 themselves+3.228
 Zuckerberg+3.164
 Mandela+3.158
 THEY+3.087
 THEIR+3.075
Neg logit contributions
pite-4.446
laughs-4.252
ittee-4.181
restricted-4.022
crew-3.932
Token to
Token ID284
Feature activation+2.64e-01

Pos logit contributions
 themselves+5.947
 Zuckerberg+5.678
 Mandela+5.583
 THEIR+5.566
 THEY+5.525
Neg logit contributions
pite-6.758
laughs-6.442
ittee-6.225
restricted-6.025
crew-5.863
Token this
Token ID428
Feature activation+9.97e-01

Pos logit contributions
 Zuckerberg+2.212
 Mandela+2.208
 themselves+2.203
 THEY+2.150
 THEIR+2.146
Neg logit contributions
pite-2.425
laughs-2.296
ittee-2.257
restricted-2.148
crew-2.108
Token and
Token ID290
Feature activation-1.82e-02

Pos logit contributions
 themselves+7.418
 Zuckerberg+7.100
 Mandela+7.038
 THEIR+6.879
 THEY+6.873
Neg logit contributions
pite-9.846
laughs-9.346
ittee-9.121
crew-8.774
acia-8.722
Token work
Token ID670
Feature activation+5.53e-01

Pos logit contributions
pite+0.180
laughs+0.170
ittee+0.168
restricted+0.160
such+0.158
Neg logit contributions
 Mandela-0.139
 Zuckerberg-0.138
 themselves-0.136
-0.135
 THEY-0.134
Token out
Token ID503
Feature activation+2.38e+00

Pos logit contributions
 themselves+4.171
 Mandela+4.039
 Zuckerberg+4.023
 THEIR+3.955
 THEY+3.936
Neg logit contributions
pite-5.548
laughs-5.323
ittee-5.225
restricted-5.002
acia-4.922
Token a
Token ID257
Feature activation+1.06e+00

Pos logit contributions
 themselves+17.540
 their+16.658
 they+16.280
 THEIR+15.939
 Zuckerberg+15.351
Neg logit contributions
ittee-17.223
pite-17.121
laughs-16.521
cffffcc-15.980
restricted-15.910
Token refund
Token ID12929
Feature activation+1.81e+00

Pos logit contributions
 themselves+7.490
 Zuckerberg+7.230
 Mandela+7.172
 THEIR+7.102
 """+7.053
Neg logit contributions
pite-10.478
ittee-10.033
laughs-9.955
acia-9.520
restricted-9.384
Token policy
Token ID2450
Feature activation+4.91e-01

Pos logit contributions
 themselves+12.836
 """+12.092
 they+11.898
 THEY+11.686
 Mandela+11.601
Neg logit contributions
pite-15.842
laughs-14.836
ittee-14.477
acia-13.943
restricted-13.839
Token if
Token ID611
Feature activation+4.17e-01

Pos logit contributions
 themselves+3.510
 Zuckerberg+3.445
 Mandela+3.415
 THEY+3.349
 THEIR+3.320
Neg logit contributions
pite-5.131
laughs-4.907
ittee-4.818
restricted-4.626
crew-4.553
Token applicable
Token ID9723
Feature activation+5.78e-03

Pos logit contributions
 themselves+2.717
 Zuckerberg+2.702
 Mandela+2.685
 THEY+2.636
 THEIR+2.609
Neg logit contributions
pite-4.608
laughs-4.422
ittee-4.354
restricted-4.174
crew-4.116
Tokent
Token ID83
Feature activation+1.75e-01

Pos logit contributions
pite+0.209
laughs+0.195
ittee+0.191
restricted+0.180
such+0.177
Neg logit contributions
 Zuckerberg-0.310
 Mandela-0.309
 themselves-0.303
 THEIR-0.302
 THEY-0.301
Token stupid
Token ID8531
Feature activation+6.26e-01

Pos logit contributions
 Zuckerberg+1.441
 Mandela+1.441
 themselves+1.429
 THEY+1.401
 THEIR+1.397
Neg logit contributions
pite-1.627
laughs-1.541
ittee-1.514
restricted-1.443
crew-1.416
Token.
Token ID13
Feature activation+1.05e+00

Pos logit contributions
 themselves+4.782
 Zuckerberg+4.638
 Mandela+4.577
 THEY+4.532
 THEIR+4.483
Neg logit contributions
pite-6.199
laughs-5.875
ittee-5.819
restricted-5.554
acia-5.454
Token They
Token ID1119
Feature activation+1.35e+00

Pos logit contributions
 Zuckerberg+8.898
 THEY+8.867
 themselves+8.774
 Mandela+8.761
 THEIR+8.585
Neg logit contributions
pite-9.218
ittee-8.444
laughs-8.326
restricted-8.003
acia-7.894
Token figured
Token ID11638
Feature activation+1.51e+00

Pos logit contributions
 themselves+10.024
 THEY+9.085
 """+8.997
ARE+8.987
 Zuckerberg+8.977
Neg logit contributions
pite-12.150
laughs-11.516
ittee-11.110
restricted-10.881
acia-10.730
Token out
Token ID503
Feature activation+2.37e+00

Pos logit contributions
 themselves+11.077
 they+10.475
 THEY+10.472
 Zuckerberg+10.422
 Mandela+10.257
Neg logit contributions
pite-13.862
ittee-13.336
laughs-12.933
acia-12.610
restricted-12.540
Token that
Token ID326
Feature activation+9.18e-01

Pos logit contributions
 themselves+17.485
 they+16.970
 Zuckerberg+16.233
 THEY+16.147
 Mandela+16.082
Neg logit contributions
pite-17.045
ittee-16.312
laughs-15.933
acia-15.571
restricted-15.556
Token the
Token ID262
Feature activation+2.30e-02

Pos logit contributions
 themselves+7.127
 Zuckerberg+7.008
 Mandela+6.994
 THEY+6.925
 THEIR+6.845
Neg logit contributions
pite-8.707
laughs-8.262
ittee-8.222
acia-7.780
restricted-7.756
Token only
Token ID691
Feature activation+4.16e-01

Pos logit contributions
 Zuckerberg+0.163
 Mandela+0.163
 themselves+0.162
 THEY+0.160
 THEIR+0.159
Neg logit contributions
pite-0.236
laughs-0.226
ittee-0.221
restricted-0.215
such-0.210
Token way
Token ID835
Feature activation+5.27e-01

Pos logit contributions
 Zuckerberg+2.738
 Mandela+2.732
 themselves+2.724
 THEY+2.626
 THEIR+2.612
Neg logit contributions
pite-4.580
laughs-4.366
ittee-4.322
restricted-4.125
acia-4.079
Token to
Token ID284
Feature activation-4.58e-02

Pos logit contributions
 Zuckerberg+4.332
 themselves+4.331
 Mandela+4.313
 THEY+4.236
 THEIR+4.192
Neg logit contributions
pite-4.906
laughs-4.640
ittee-4.578
restricted-4.337
acia-4.272
Token with
Token ID351
Feature activation+4.36e-01

Pos logit contributions
 Zuckerberg+0.373
 Mandela+0.372
 themselves+0.370
+0.368
 THEY+0.368
Neg logit contributions
pite-0.350
laughs-0.330
ittee-0.322
restricted-0.309
such-0.304
Token the
Token ID262
Feature activation+6.75e-01

Pos logit contributions
 Zuckerberg+3.411
 Mandela+3.395
 themselves+3.382
 Tsarnaev+3.261
 THEIR+3.241
Neg logit contributions
pite-4.261
laughs-4.059
ittee-4.003
restricted-3.800
crew-3.739
Token goal
Token ID3061
Feature activation+5.46e-01

Pos logit contributions
 Zuckerberg+5.577
 Mandela+5.563
 themselves+5.515
 Tsarnaev+5.446
 THEIR+5.212
Neg logit contributions
pite-6.179
laughs-5.914
ittee-5.850
restricted-5.485
crew-5.448
Token of
Token ID286
Feature activation+9.01e-01

Pos logit contributions
 themselves+3.826
 Zuckerberg+3.790
 Mandela+3.738
 THEY+3.576
 """+3.557
Neg logit contributions
pite-5.716
laughs-5.497
ittee-5.342
restricted-5.156
crew-5.080
Token figuring
Token ID22714
Feature activation+1.34e+00

Pos logit contributions
 themselves+7.538
 Zuckerberg+7.413
 Mandela+7.392
 Tsarnaev+7.152
 Mao+6.983
Neg logit contributions
pite-8.100
laughs-7.714
ittee-7.604
restricted-7.135
acia-7.098
Token out
Token ID503
Feature activation+2.36e+00

Pos logit contributions
 themselves+7.156
 Zuckerberg+6.768
 Mandela+6.452
 Tsarnaev+6.244
 THEY+6.131
Neg logit contributions
pite-15.627
ittee-14.812
laughs-14.729
restricted-14.098
acia-14.079
Token why
Token ID1521
Feature activation+1.04e+00

Pos logit contributions
 themselves+16.109
 Zuckerberg+15.154
 their+15.096
 they+14.953
 Mandela+14.704
Neg logit contributions
pite-18.471
ittee-18.169
laughs-17.399
restricted-16.493
acia-16.276
Token some
Token ID617
Feature activation+2.69e-01

Pos logit contributions
 themselves+8.252
 Zuckerberg+8.140
 Mandela+8.018
 THEY+7.878
 Tsarnaev+7.874
Neg logit contributions
pite-9.647
ittee-9.334
laughs-9.272
restricted-8.566
acia-8.538
Token Muslims
Token ID7045
Feature activation+8.22e-01

Pos logit contributions
 Zuckerberg+2.109
 Mandela+2.107
 themselves+2.096
 THEY+2.046
 THEIR+2.040
Neg logit contributions
pite-2.614
laughs-2.481
ittee-2.442
restricted-2.333
crew-2.290
Token become
Token ID1716
Feature activation-6.98e-02

Pos logit contributions
 themselves+6.260
 Zuckerberg+5.936
 Mandela+5.847
 THEY+5.718
 Tsarnaev+5.710
Neg logit contributions
pite-8.181
laughs-7.860
ittee-7.720
restricted-7.395
acia-7.222
Token terrorists
Token ID10509
Feature activation+4.52e-01

Pos logit contributions
pite+0.723
laughs+0.691
ittee+0.678
restricted+0.655
such+0.652
Neg logit contributions
 Mandela-0.486
 Zuckerberg-0.481
-0.477
 THEY-0.474
 themselves-0.473
Token Dallas
Token ID8533
Feature activation+4.77e-01

Pos logit contributions
 themselves+8.271
 Zuckerberg+8.191
 Mandela+7.944
 they+7.904
 THEY+7.900
Neg logit contributions
pite-9.904
laughs-9.534
ittee-9.452
acia-9.125
restricted-8.809
Token has
Token ID468
Feature activation+4.33e-01

Pos logit contributions
 themselves+4.261
 Zuckerberg+4.218
 Mandela+4.189
 THEY+4.100
 THEIR+4.081
Neg logit contributions
pite-4.130
laughs-3.894
ittee-3.849
restricted-3.605
acia-3.565
Token to
Token ID284
Feature activation+4.73e-01

Pos logit contributions
 themselves+3.355
 Zuckerberg+3.342
 Mandela+3.319
 THEIR+3.232
 THEY+3.223
Neg logit contributions
pite-4.267
laughs-4.045
ittee-3.986
restricted-3.770
acia-3.741
Token decide
Token ID5409
Feature activation+2.07e+00

Pos logit contributions
 themselves+3.462
 Zuckerberg+3.413
 Mandela+3.396
 THEY+3.315
 THEIR+3.307
Neg logit contributions
pite-4.859
laughs-4.631
ittee-4.554
restricted-4.356
acia-4.298
Token almost
Token ID2048
Feature activation+9.66e-01

Pos logit contributions
 themselves+14.692
 they+13.860
 Zuckerberg+13.301
 their+13.191
 THEY+13.114
Neg logit contributions
pite-17.406
laughs-16.737
ittee-16.408
acia-15.897
restricted-15.534
Token immediately
Token ID3393
Feature activation+2.34e+00

Pos logit contributions
 themselves+6.503
 Zuckerberg+6.304
 THEIR+6.134
 Mandela+6.131
 THEY+6.086
Neg logit contributions
pite-10.156
ittee-9.747
laughs-9.695
acia-9.227
restricted-9.225
Token into
Token ID656
Feature activation+1.25e+00

Pos logit contributions
 themselves+15.644
 they+14.800
 Zuckerberg+13.922
 their+13.899
 THEY+13.870
Neg logit contributions
pite-18.961
laughs-17.906
ittee-17.889
acia-17.091
restricted-16.981
Token the
Token ID262
Feature activation+8.87e-01

Pos logit contributions
 themselves+9.593
 Zuckerberg+9.162
 Mandela+8.956
 THEIR+8.919
 their+8.611
Neg logit contributions
pite-11.652
laughs-11.355
ittee-11.339
acia-10.629
restricted-10.391
Token offseason
Token ID16349
Feature activation+1.92e+00

Pos logit contributions
 themselves+6.066
 Zuckerberg+6.058
 Mandela+5.988
 THEY+5.755
 THEIR+5.742
Neg logit contributions
pite-9.117
ittee-8.814
laughs-8.796
acia-8.195
restricted-8.106
Token whether
Token ID1771
Feature activation+1.47e+00

Pos logit contributions
 themselves+13.311
 they+12.370
 Zuckerberg+12.158
 THEY+11.959
 THEIR+11.908
Neg logit contributions
pite-17.287
laughs-16.312
ittee-15.943
restricted-15.459
acia-15.403
Token they
Token ID484
Feature activation+1.38e+00

Pos logit contributions
 themselves+10.048
 they+9.861
 THEY+9.483
 Zuckerberg+9.357
 Mandela+9.237
Neg logit contributions
pite-14.215
laughs-13.742
ittee-13.255
acia-12.889
cffffcc-12.799
Token which
Token ID543
Feature activation-2.01e-01

Pos logit contributions
pite+1.325
laughs+1.248
ittee+1.221
restricted+1.178
acia+1.153
Neg logit contributions
 THEY-1.161
 Mandela-1.160
 Zuckerberg-1.158
 themselves-1.152
-1.147
Token two
Token ID734
Feature activation-7.73e-02

Pos logit contributions
pite+1.831
laughs+1.749
ittee+1.710
restricted+1.640
crew+1.629
Neg logit contributions
 Mandela-1.606
-1.599
 Zuckerberg-1.596
 THEIR-1.583
 themselves-1.579
Token families
Token ID4172
Feature activation-4.55e-04

Pos logit contributions
pite+0.697
laughs+0.664
ittee+0.649
restricted+0.623
crew+0.618
Neg logit contributions
 Zuckerberg-0.639
 Mandela-0.636
-0.632
 themselves-0.629
 THEIR-0.628
Token compete
Token ID9320
Feature activation+6.96e-01

Pos logit contributions
pite+0.004
laughs+0.004
ittee+0.004
restricted+0.004
crew+0.004
Neg logit contributions
 Mandela-0.003
 Zuckerberg-0.003
-0.003
 THEIR-0.003
 THEY-0.003
Token to
Token ID284
Feature activation+5.43e-01

Pos logit contributions
 themselves+5.247
 Zuckerberg+5.110
 Mandela+5.075
 THEY+4.952
 THEIR+4.924
Neg logit contributions
pite-6.984
laughs-6.682
ittee-6.567
restricted-6.302
acia-6.117
Token guess
Token ID4724
Feature activation+2.33e+00

Pos logit contributions
 themselves+3.786
 Zuckerberg+3.756
 Mandela+3.738
 THEY+3.612
 THEIR+3.611
Neg logit contributions
pite-5.759
laughs-5.495
ittee-5.412
restricted-5.235
crew-5.099
Token the
Token ID262
Feature activation+1.38e+00

Pos logit contributions
 themselves+16.078
 their+15.118
 Zuckerberg+14.981
 THEIR+14.852
 Mandela+14.820
Neg logit contributions
pite-18.548
laughs-17.567
ittee-17.522
restricted-17.123
acia-16.212
Token most
Token ID749
Feature activation+3.67e-01

Pos logit contributions
 Mandela+9.258
 Zuckerberg+9.202
 themselves+8.997
 THEIR+8.481
 THEY+8.413
Neg logit contributions
pite-13.238
ittee-12.879
laughs-12.726
restricted-12.254
affected-11.805
Token popular
Token ID2968
Feature activation+2.98e-01

Pos logit contributions
 Zuckerberg+3.088
 Mandela+3.085
 themselves+3.073
 THEY+3.001
 THEIR+2.993
Neg logit contributions
pite-3.357
laughs-3.177
ittee-3.125
restricted-2.975
crew-2.915
Token answers
Token ID7429
Feature activation+9.47e-01

Pos logit contributions
 Zuckerberg+2.372
 Mandela+2.370
 themselves+2.358
 THEY+2.304
 THEIR+2.297
Neg logit contributions
pite-2.855
laughs-2.708
ittee-2.663
restricted-2.543
crew-2.497
Token to
Token ID284
Feature activation+1.11e+00

Pos logit contributions
 themselves+7.267
 Zuckerberg+7.074
 Mandela+7.024
 THEY+6.891
 THEIR+6.699
Neg logit contributions
pite-9.139
laughs-8.584
ittee-8.533
restricted-8.144
acia-8.004
Tokenv
Token ID85
Feature activation+4.67e-01

Pos logit contributions
pite+6.909
laughs+6.706
such+6.318
restricted+6.247
ittee+6.188
Neg logit contributions
 Mandela-9.507
-9.463
IELD-9.453
 THEIR-9.444
 Zuckerberg-9.351
Token2
Token ID17
Feature activation+3.51e-01

Pos logit contributions
 themselves+3.318
 """+3.145
 Mandela+3.141
ARE+3.136
 Zuckerberg+3.134
Neg logit contributions
pite-4.863
ittee-4.610
 destro-4.544
 exha-4.488
laughs-4.464
Token.
Token ID764
Feature activation+1.32e+00

Pos logit contributions
 themselves+1.733
 Zuckerberg+1.670
 Mandela+1.655
 THEY+1.601
 """+1.591
Neg logit contributions
pite-4.465
ittee-4.269
laughs-4.213
restricted-4.071
acia-4.030
Token MOR
Token ID35208
Feature activation+1.43e+00

Pos logit contributions
 THEIR+5.601
 MIT+5.547
 themselves+5.462
 THEY+5.443
 """+5.424
Neg logit contributions
pite-16.379
ittee-15.492
laughs-15.400
restricted-14.824
acia-14.694
TokenPH
Token ID11909
Feature activation+1.93e+00

Pos logit contributions
ARE+5.044
WARE+4.886
IELD+4.735
 THEY+4.523
OX+4.495
Neg logit contributions
 destro-18.795
 exha-18.454
ortunately-17.895
pite-17.567
etheless-17.491
Token_
Token ID62
Feature activation+2.30e+00

Pos logit contributions
ARE+4.126
 themselves+4.101
IELD+3.799
 MIT+3.652
 THEY+3.470
Neg logit contributions
 exha-27.467
 destro-27.213
ortunately-26.476
 tremend-25.851
eatures-25.408
TokenBL
Token ID9148
Feature activation+1.32e+00

Pos logit contributions
ARE+8.449
LED+6.660
IELD+6.385
 THEIR+6.083
WARE+6.059
Neg logit contributions
 destro-29.427
 exha-28.232
 proble-27.639
 tremend-27.617
 obser-27.546
TokenACK
Token ID8120
Feature activation+1.44e+00

Pos logit contributions
ARE+4.831
IELD+4.617
 themselves+4.028
 Mandela+3.971
 THEY+3.960
Neg logit contributions
 destro-17.992
 exha-17.420
ittee-17.116
 obser-17.106
pite-17.055
TokenH
Token ID39
Feature activation+1.16e+00

Pos logit contributions
ARE+5.404
 THEY+5.099
IELD+5.037
 THEIR+4.838
WARE+4.668
Neg logit contributions
 destro-18.718
 exha-17.827
ittee-17.754
 obser-17.708
pite-17.621
TokenAT
Token ID1404
Feature activation+6.18e-01

Pos logit contributions
ARE+5.793
IELD+4.842
 THEY+4.791
 themselves+4.787
 THEIR+4.736
Neg logit contributions
 destro-14.730
 obser-14.058
pite-13.952
 exha-13.916
ittee-13.538
Token,
Token ID837
Feature activation+2.87e-01

Pos logit contributions
 themselves+3.673
 THEY+3.505
 Mandela+3.500
 Zuckerberg+3.478
 """+3.448
Neg logit contributions
pite-7.094
ittee-6.934
 destro-6.665
laughs-6.657
 exha-6.585
Token supreme
Token ID17700
Feature activation-1.42e+00

Pos logit contributions
pite+7.627
such+7.342
 Creatures+6.975
laughs+6.731
ittee+6.723
Neg logit contributions
-10.206
 Mandela-9.640
 Zuckerberg-9.623
 """-9.545
IELD-9.459
Token Court
Token ID3078
Feature activation-1.75e+00

Pos logit contributions
 Creatures+7.404
such+7.399
pite+7.285
 trib+6.959
 Matters+6.658
Neg logit contributions
-15.182
 Mandela-13.908
WARE-13.794
celona-13.179
 Tsarnaev-13.168
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
pite+12.193
ittee+11.498
such+10.785
affected+10.571
laughs+10.360
Neg logit contributions
-13.245
 Mandela-12.907
 Zuckerberg-12.600
ARE-12.294
 THEY-12.240
Token and
Token ID290
Feature activation-2.47e+00

Pos logit contributions
ittee+14.660
such+14.444
pite+14.425
 Creatures+13.585
 trib+13.428
Neg logit contributions
-13.312
 Mandela-12.027
 Tsarnaev-11.411
celona-11.343
IELD-11.237
Token all
Token ID477
Feature activation-2.80e+00

Pos logit contributions
such+18.328
ittee+18.193
pite+17.878
 Creatures+17.331
 trib+16.949
Neg logit contributions
-15.012
 Mandela-13.474
IELD-12.422
ARE-12.126
celona-12.124
Token other
Token ID584
Feature activation-3.01e+00

Pos logit contributions
 Creatures+18.882
such+18.823
 Matters+18.011
ittee+17.802
pite+17.579
Neg logit contributions
-17.023
 Mandela-15.203
ARE-14.404
WARE-14.108
 Zuckerberg-14.068
Token Officers
Token ID23512
Feature activation-1.61e+00

Pos logit contributions
 Creatures+18.457
such+18.225
 Matters+17.883
 notable+17.653
 trib+17.368
Neg logit contributions
-19.051
 Mandela-16.492
celona-16.289
WARE-16.173
 Zuckerberg-15.778
Token of
Token ID286
Feature activation-1.20e+00

Pos logit contributions
pite+11.154
ittee+10.513
 therefore+9.967
such+9.876
affected+9.768
Neg logit contributions
-13.316
 Mandela-12.506
 Zuckerberg-12.259
IELD-12.100
celona-12.069
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
pite+8.630
such+8.495
 tho+7.891
restricted+7.808
ittee+7.783
Neg logit contributions
-10.675
 Mandela-10.196
 Zuckerberg-10.035
 Tsarnaev-10.013
WARE-9.645
Token United
Token ID1578
Feature activation-1.20e+00

Pos logit contributions
pite+9.491
such+9.144
 Creatures+8.822
ittee+8.513
affected+8.430
Neg logit contributions
-10.361
 Mandela-9.614
WARE-9.478
 Zuckerberg-9.378
 Tsarnaev-9.274
Token States
Token ID1829
Feature activation-1.36e+00

Pos logit contributions
pite+6.465
laughs+5.966
ittee+5.883
such+5.771
restricted+5.420
Neg logit contributions
-13.272
 Mandela-12.300
WARE-12.189
 Zuckerberg-12.114
IELD-12.076
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
pite+8.408
such+8.104
ittee+7.814
restricted+7.748
laughs+7.639
Neg logit contributions
-8.363
 Zuckerberg-7.882
 Mandela-7.727
 """-7.676
 Tsarnaev-7.561
Token supreme
Token ID17700
Feature activation-1.42e+00

Pos logit contributions
pite+7.627
such+7.342
 Creatures+6.975
laughs+6.731
ittee+6.723
Neg logit contributions
-10.206
 Mandela-9.640
 Zuckerberg-9.623
 """-9.545
IELD-9.459
Token Court
Token ID3078
Feature activation-1.75e+00

Pos logit contributions
 Creatures+7.404
such+7.399
pite+7.285
 trib+6.959
 Matters+6.658
Neg logit contributions
-15.182
 Mandela-13.908
WARE-13.794
celona-13.179
 Tsarnaev-13.168
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
pite+12.193
ittee+11.498
such+10.785
affected+10.571
laughs+10.360
Neg logit contributions
-13.245
 Mandela-12.907
 Zuckerberg-12.600
ARE-12.294
 THEY-12.240
Token and
Token ID290
Feature activation-2.47e+00

Pos logit contributions
ittee+14.660
such+14.444
pite+14.425
 Creatures+13.585
 trib+13.428
Neg logit contributions
-13.312
 Mandela-12.027
 Tsarnaev-11.411
celona-11.343
IELD-11.237
Token all
Token ID477
Feature activation-2.80e+00

Pos logit contributions
such+18.328
ittee+18.193
pite+17.878
 Creatures+17.331
 trib+16.949
Neg logit contributions
-15.012
 Mandela-13.474
IELD-12.422
ARE-12.126
celona-12.124
Token other
Token ID584
Feature activation-3.01e+00

Pos logit contributions
 Creatures+18.882
such+18.823
 Matters+18.011
ittee+17.802
pite+17.579
Neg logit contributions
-17.023
 Mandela-15.203
ARE-14.404
WARE-14.108
 Zuckerberg-14.068
Token Officers
Token ID23512
Feature activation-1.61e+00

Pos logit contributions
 Creatures+18.457
such+18.225
 Matters+17.883
 notable+17.653
 trib+17.368
Neg logit contributions
-19.051
 Mandela-16.492
celona-16.289
WARE-16.173
 Zuckerberg-15.778
Token of
Token ID286
Feature activation-1.20e+00

Pos logit contributions
pite+11.154
ittee+10.513
 therefore+9.967
such+9.876
affected+9.768
Neg logit contributions
-13.316
 Mandela-12.506
 Zuckerberg-12.259
IELD-12.100
celona-12.069
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
pite+8.630
such+8.495
 tho+7.891
restricted+7.808
ittee+7.783
Neg logit contributions
-10.675
 Mandela-10.196
 Zuckerberg-10.035
 Tsarnaev-10.013
WARE-9.645
Token United
Token ID1578
Feature activation-1.20e+00

Pos logit contributions
pite+9.491
such+9.144
 Creatures+8.822
ittee+8.513
affected+8.430
Neg logit contributions
-10.361
 Mandela-9.614
WARE-9.478
 Zuckerberg-9.378
 Tsarnaev-9.274
Token606
Token ID33206
Feature activation+9.46e-01

Pos logit contributions
 Zuckerberg+0.859
 Mandela+0.855
 themselves+0.853
 THEY+0.834
 THEIR+0.832
Neg logit contributions
pite-1.033
laughs-0.980
ittee-0.956
restricted-0.920
crew-0.903
Token 7
Token ID767
Feature activation+2.98e-01

Pos logit contributions
 Mandela+6.109
 themselves+6.052
 Zuckerberg+5.987
 THEY+5.873
 THEIR+5.857
Neg logit contributions
pite-9.929
ittee-9.579
laughs-9.471
restricted-9.006
affected-8.826
Token201
Token ID1264
Feature activation+9.24e-01

Pos logit contributions
 Zuckerberg+2.304
 Mandela+2.303
 themselves+2.292
 THEY+2.237
 THEIR+2.230
Neg logit contributions
pite-2.916
laughs-2.768
ittee-2.732
restricted-2.607
crew-2.558
Token99
Token ID2079
Feature activation+1.16e+00

Pos logit contributions
 Mandela+6.152
 themselves+6.129
 Zuckerberg+6.017
 THEY+5.909
 THEIR+5.908
Neg logit contributions
pite-9.654
ittee-9.198
laughs-9.133
restricted-8.655
acia-8.547
Token;
Token ID26
Feature activation-1.39e+00

Pos logit contributions
 themselves+7.736
 Mandela+7.679
 Zuckerberg+7.476
 THEY+7.352
 THEIR+7.346
Neg logit contributions
pite-11.672
laughs-11.032
ittee-10.852
restricted-10.274
acia-10.112
Token head
Token ID1182
Feature activation-2.74e+00

Pos logit contributions
pite+11.160
crew+10.282
 comprises+10.101
laughs+10.084
ods+9.920
Neg logit contributions
 Mandela-9.901
IELD-9.812
 Zuckerberg-9.728
 Tsarnaev-9.586
ARE-9.432
Tokenwater
Token ID7050
Feature activation+2.10e-01

Pos logit contributions
crew+13.010
pite+12.587
laughs+12.535
restricted+12.480
such+12.293
Neg logit contributions
 Mandela-21.819
 Yanuk-20.836
 Zuckerberg-20.641
 Mao-20.508
IELD-20.388
Token.
Token ID13
Feature activation-7.28e-01

Pos logit contributions
 Mandela+1.738
 Zuckerberg+1.737
 themselves+1.723
 THEY+1.688
 THEIR+1.683
Neg logit contributions
pite-1.957
laughs-1.854
ittee-1.820
restricted-1.742
crew-1.713
Tokencom
Token ID785
Feature activation+2.75e-02

Pos logit contributions
pite+4.159
laughs+3.678
crew+3.665
ittee+3.553
restricted+3.547
Neg logit contributions
 Mandela-8.304
 Zuckerberg-8.218
-8.180
IELD-8.162
 themselves-8.148
Token),
Token ID828
Feature activation+3.69e-01

Pos logit contributions
 Mandela+0.256
 Zuckerberg+0.256
 themselves+0.252
 THEY+0.249
 THEIR+0.249
Neg logit contributions
pite-0.230
laughs-0.213
ittee-0.212
restricted-0.202
crew-0.199
Token the
Token ID262
Feature activation+3.76e-01

Pos logit contributions
 Zuckerberg+2.786
 Mandela+2.784
 themselves+2.777
 THEY+2.691
 THEIR+2.689
Neg logit contributions
pite-3.672
laughs-3.512
ittee-3.449
restricted-3.294
crew-3.228
Tokenassadors
Token ID33429
Feature activation-9.37e-01

Pos logit contributions
pite+4.939
ittee+4.893
acia+4.746
such+4.019
ods+3.880
Neg logit contributions
-18.572
 Mandela-17.656
WARE-16.913
IELD-16.890
 Tsarnaev-16.888
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
pite+7.744
ittee+7.397
such+6.992
restricted+6.979
laughs+6.769
Neg logit contributions
-7.823
 Mandela-7.670
IELD-7.387
 Zuckerberg-7.340
 THEY-7.172
Token other
Token ID584
Feature activation-2.62e+00

Pos logit contributions
such+15.176
pite+15.097
ittee+15.022
 Creatures+14.094
 trib+14.064
Neg logit contributions
-14.128
 Mandela-12.645
IELD-11.741
 Zuckerberg-11.444
celona-11.394
Token public
Token ID1171
Feature activation-2.31e+00

Pos logit contributions
such+16.686
 Creatures+16.247
 notable+15.679
ittee+15.580
pite+15.565
Neg logit contributions
-17.873
 Mandela-16.291
celona-15.585
 Zuckerberg-15.370
IELD-15.214
Token Ministers
Token ID36647
Feature activation-1.64e+00

Pos logit contributions
ittee+14.572
pite+13.981
ods+13.911
such+13.704
acia+13.526
Neg logit contributions
-17.576
 Mandela-16.533
 Tsarnaev-15.794
 Zuckerberg-15.601
ARE-15.185
Token and
Token ID290
Feature activation-2.73e+00

Pos logit contributions
pite+11.204
ittee+10.954
such+9.946
affected+9.822
restricted+9.740
Neg logit contributions
-13.572
 Mandela-13.082
 Zuckerberg-12.713
ARE-12.426
celona-12.388
Token Council
Token ID4281
Feature activation-8.64e-01

Pos logit contributions
ittee+18.667
such+18.664
 Creatures+18.121
pite+18.005
 trib+17.684
Neg logit contributions
-16.907
 Mandela-15.268
IELD-13.567
celona-13.525
 Tsarnaev-13.470
Tokens
Token ID82
Feature activation-1.70e+00

Pos logit contributions
pite+6.914
ittee+6.616
restricted+6.367
laughs+6.294
such+6.262
Neg logit contributions
-7.617
 Mandela-7.534
 Zuckerberg-7.224
 Tsarnaev-7.042
IELD-7.022
Token,
Token ID11
Feature activation-1.98e+00

Pos logit contributions
pite+12.852
ittee+12.540
such+11.776
affected+11.602
restricted+11.523
Neg logit contributions
-12.835
 Mandela-12.233
IELD-11.801
 Zuckerberg-11.403
 Tsarnaev-11.297
Token Judges
Token ID42495
Feature activation-1.27e+00

Pos logit contributions
ittee+15.102
pite+14.950
such+14.741
 Creatures+14.151
 trib+13.867
Neg logit contributions
-13.700
 Mandela-12.217
celona-11.464
 Tsarnaev-11.311
IELD-11.141
Token of
Token ID286
Feature activation-1.02e+00

Pos logit contributions
pite+9.130
ittee+8.762
such+8.091
restricted+7.976
crew+7.949
Neg logit contributions
-10.692
 Mandela-10.563
 Zuckerberg-10.359
 THEY-10.078
 THEIR-10.050
Token shall
Token ID2236
Feature activation-1.45e+00

Pos logit contributions
pite+10.199
 comprises+9.479
such+9.149
ittee+9.093
 distur+9.079
Neg logit contributions
-12.440
 Mandela-11.785
 Zuckerberg-11.635
IELD-11.191
celona-11.151
Token appoint
Token ID5393
Feature activation-2.53e+00

Pos logit contributions
pite+10.123
such+9.337
 therefore+9.293
 comprises+8.745
ittee+8.743
Neg logit contributions
-12.975
 Mandela-11.986
 Zuckerberg-11.881
WARE-11.479
IELD-11.477
Token Amb
Token ID12457
Feature activation-1.42e+00

Pos logit contributions
ittee+17.572
such+17.565
pite+16.329
 Creatures+16.103
 Matters+15.991
Neg logit contributions
-16.494
 Mandela-14.215
celona-13.836
IELD-13.462
WARE-13.381
Tokenassadors
Token ID33429
Feature activation-9.37e-01

Pos logit contributions
pite+4.939
ittee+4.893
acia+4.746
such+4.019
ods+3.880
Neg logit contributions
-18.572
 Mandela-17.656
WARE-16.913
IELD-16.890
 Tsarnaev-16.888
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
pite+7.744
ittee+7.397
such+6.992
restricted+6.979
laughs+6.769
Neg logit contributions
-7.823
 Mandela-7.670
IELD-7.387
 Zuckerberg-7.340
 THEY-7.172
Token other
Token ID584
Feature activation-2.62e+00

Pos logit contributions
such+15.176
pite+15.097
ittee+15.022
 Creatures+14.094
 trib+14.064
Neg logit contributions
-14.128
 Mandela-12.645
IELD-11.741
 Zuckerberg-11.444
celona-11.394
Token public
Token ID1171
Feature activation-2.31e+00

Pos logit contributions
such+16.686
 Creatures+16.247
 notable+15.679
ittee+15.580
pite+15.565
Neg logit contributions
-17.873
 Mandela-16.291
celona-15.585
 Zuckerberg-15.370
IELD-15.214
Token Ministers
Token ID36647
Feature activation-1.64e+00

Pos logit contributions
ittee+14.572
pite+13.981
ods+13.911
such+13.704
acia+13.526
Neg logit contributions
-17.576
 Mandela-16.533
 Tsarnaev-15.794
 Zuckerberg-15.601
ARE-15.185
Token and
Token ID290
Feature activation-2.73e+00

Pos logit contributions
pite+11.204
ittee+10.954
such+9.946
affected+9.822
restricted+9.740
Neg logit contributions
-13.572
 Mandela-13.082
 Zuckerberg-12.713
ARE-12.426
celona-12.388
Token Council
Token ID4281
Feature activation-8.64e-01

Pos logit contributions
ittee+18.667
such+18.664
 Creatures+18.121
pite+18.005
 trib+17.684
Neg logit contributions
-16.907
 Mandela-15.268
IELD-13.567
celona-13.525
 Tsarnaev-13.470
Tokens
Token ID82
Feature activation-1.70e+00

Pos logit contributions
pite+6.914
ittee+6.616
restricted+6.367
laughs+6.294
such+6.262
Neg logit contributions
-7.617
 Mandela-7.534
 Zuckerberg-7.224
 Tsarnaev-7.042
IELD-7.022
Token some
Token ID617
Feature activation-1.13e+00

Pos logit contributions
pite+11.134
laughs+10.322
ittee+10.229
such+10.127
crew+10.027
Neg logit contributions
-9.624
 Zuckerberg-9.451
IELD-9.257
 THEY-9.164
 """-9.158
Token indie
Token ID19907
Feature activation-5.41e-01

Pos logit contributions
pite+9.267
laughs+8.637
crew+8.436
 notable+8.295
 interacts+8.228
Neg logit contributions
-8.458
IELD-8.437
 """-8.045
 Zuckerberg-8.040
 THEIR-8.036
Token work
Token ID670
Feature activation-1.68e+00

Pos logit contributions
pite+4.546
laughs+4.318
ittee+4.109
crew+4.086
restricted+4.072
Neg logit contributions
 Zuckerberg-4.578
-4.491
 Mandela-4.464
 THEIR-4.399
 THEY-4.397
Token in
Token ID287
Feature activation-1.33e+00

Pos logit contributions
pite+11.952
such+11.857
laughs+11.569
crew+11.221
 comprises+11.065
Neg logit contributions
IELD-12.163
-11.656
 Zuckerberg-11.621
 THEIR-11.429
 Mandela-11.299
Token recent
Token ID2274
Feature activation-1.64e+00

Pos logit contributions
pite+11.780
laughs+11.251
such+10.935
restricted+10.581
crew+10.497
Neg logit contributions
-8.796
 THEY-8.202
 Zuckerberg-8.120
 """-8.115
 THEIR-8.002
Token films
Token ID7328
Feature activation-2.55e+00

Pos logit contributions
pite+12.457
laughs+11.805
ods+11.480
crew+11.395
such+11.331
Neg logit contributions
 Zuckerberg-11.141
IELD-11.086
 THEIR-10.974
 Mandela-10.952
ARE-10.919
Token such
Token ID884
Feature activation-1.70e+00

Pos logit contributions
such+18.210
pite+17.141
 Creatures+16.554
crew+16.400
ittee+16.133
Neg logit contributions
-13.576
IELD-13.391
celona-12.570
ARE-12.228
 THEIR-12.130
Token as
Token ID355
Feature activation-1.06e+00

Pos logit contributions
pite+9.314
such+9.178
laughs+8.769
ods+8.682
crew+8.482
Neg logit contributions
IELD-15.216
-15.148
 THEY-14.552
 THEIR-14.456
WARE-14.445
Token Dead
Token ID5542
Feature activation-3.58e-01

Pos logit contributions
pite+9.597
laughs+9.133
ittee+8.960
such+8.798
crew+8.764
Neg logit contributions
-6.886
IELD-6.660
 Zuckerberg-6.538
 themselves-6.527
ARE-6.440
Tokenfall
Token ID7207
Feature activation-2.99e-01

Pos logit contributions
pite+3.042
laughs+2.896
ittee+2.885
crew+2.818
restricted+2.724
Neg logit contributions
 Zuckerberg-3.043
 THEIR-3.023
-3.016
 Mandela-2.998
 THEY-2.968
Token and
Token ID290
Feature activation-1.64e+00

Pos logit contributions
pite+2.865
laughs+2.723
ittee+2.682
restricted+2.584
crew+2.568
Neg logit contributions
 Zuckerberg-2.271
 Mandela-2.268
 THEIR-2.231
 themselves-2.215
 THEY-2.215
Token of
Token ID286
Feature activation-1.86e+00

Pos logit contributions
pite+7.051
ittee+6.872
such+6.448
 comprises+6.343
acia+6.267
Neg logit contributions
-9.053
 Zuckerberg-8.855
 Mandela-8.732
 Tsarnaev-8.722
 Nguyen-8.388
Token the
Token ID262
Feature activation-1.89e+00

Pos logit contributions
such+12.157
ittee+11.587
pite+10.573
 therefore+10.495
 tho+10.478
Neg logit contributions
-16.383
 Mandela-14.872
WARE-14.605
 Tsarnaev-14.371
 Zuckerberg-14.351
Token Senate
Token ID3845
Feature activation-1.70e+00

Pos logit contributions
such+9.009
ittee+8.492
 Creatures+8.389
pite+8.111
esides+7.941
Neg logit contributions
-19.179
IELD-18.043
 Mandela-17.771
WARE-17.753
 Zuckerberg-17.496
Token,
Token ID11
Feature activation-1.41e+00

Pos logit contributions
ittee+11.830
pite+11.742
 comprises+11.616
such+11.019
acia+10.732
Neg logit contributions
-14.108
 Mandela-13.447
 Zuckerberg-13.282
IELD-12.834
 Tsarnaev-12.563
Token shall
Token ID2236
Feature activation-1.45e+00

Pos logit contributions
pite+10.199
 comprises+9.479
such+9.149
ittee+9.093
 distur+9.079
Neg logit contributions
-12.440
 Mandela-11.785
 Zuckerberg-11.635
IELD-11.191
celona-11.151
Token appoint
Token ID5393
Feature activation-2.53e+00

Pos logit contributions
pite+10.123
such+9.337
 therefore+9.293
 comprises+8.745
ittee+8.743
Neg logit contributions
-12.975
 Mandela-11.986
 Zuckerberg-11.881
WARE-11.479
IELD-11.477
Token Amb
Token ID12457
Feature activation-1.42e+00

Pos logit contributions
ittee+17.572
such+17.565
pite+16.329
 Creatures+16.103
 Matters+15.991
Neg logit contributions
-16.494
 Mandela-14.215
celona-13.836
IELD-13.462
WARE-13.381
Tokenassadors
Token ID33429
Feature activation-9.37e-01

Pos logit contributions
pite+4.939
ittee+4.893
acia+4.746
such+4.019
ods+3.880
Neg logit contributions
-18.572
 Mandela-17.656
WARE-16.913
IELD-16.890
 Tsarnaev-16.888
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
pite+7.744
ittee+7.397
such+6.992
restricted+6.979
laughs+6.769
Neg logit contributions
-7.823
 Mandela-7.670
IELD-7.387
 Zuckerberg-7.340
 THEY-7.172
Token other
Token ID584
Feature activation-2.62e+00

Pos logit contributions
such+15.176
pite+15.097
ittee+15.022
 Creatures+14.094
 trib+14.064
Neg logit contributions
-14.128
 Mandela-12.645
IELD-11.741
 Zuckerberg-11.444
celona-11.394
Token public
Token ID1171
Feature activation-2.31e+00

Pos logit contributions
such+16.686
 Creatures+16.247
 notable+15.679
ittee+15.580
pite+15.565
Neg logit contributions
-17.873
 Mandela-16.291
celona-15.585
 Zuckerberg-15.370
IELD-15.214
Token of
Token ID286
Feature activation-1.02e+00

Pos logit contributions
pite+9.130
ittee+8.762
such+8.091
restricted+7.976
crew+7.949
Neg logit contributions
-10.692
 Mandela-10.563
 Zuckerberg-10.359
 THEY-10.078
 THEIR-10.050
Token the
Token ID262
Feature activation-1.10e+00

Pos logit contributions
pite+8.408
such+8.104
ittee+7.814
restricted+7.748
laughs+7.639
Neg logit contributions
-8.363
 Zuckerberg-7.882
 Mandela-7.727
 """-7.676
 Tsarnaev-7.561
Token supreme
Token ID17700
Feature activation-1.42e+00

Pos logit contributions
pite+7.627
such+7.342
 Creatures+6.975
laughs+6.731
ittee+6.723
Neg logit contributions
-10.206
 Mandela-9.640
 Zuckerberg-9.623
 """-9.545
IELD-9.459
Token Court
Token ID3078
Feature activation-1.75e+00

Pos logit contributions
 Creatures+7.404
such+7.399
pite+7.285
 trib+6.959
 Matters+6.658
Neg logit contributions
-15.182
 Mandela-13.908
WARE-13.794
celona-13.179
 Tsarnaev-13.168
Token,
Token ID11
Feature activation-1.96e+00

Pos logit contributions
pite+12.193
ittee+11.498
such+10.785
affected+10.571
laughs+10.360
Neg logit contributions
-13.245
 Mandela-12.907
 Zuckerberg-12.600
ARE-12.294
 THEY-12.240
Token and
Token ID290
Feature activation-2.47e+00

Pos logit contributions
ittee+14.660
such+14.444
pite+14.425
 Creatures+13.585
 trib+13.428
Neg logit contributions
-13.312
 Mandela-12.027
 Tsarnaev-11.411
celona-11.343
IELD-11.237
Token all
Token ID477
Feature activation-2.80e+00

Pos logit contributions
such+18.328
ittee+18.193
pite+17.878
 Creatures+17.331
 trib+16.949
Neg logit contributions
-15.012
 Mandela-13.474
IELD-12.422
ARE-12.126
celona-12.124
Token other
Token ID584
Feature activation-3.01e+00

Pos logit contributions
 Creatures+18.882
such+18.823
 Matters+18.011
ittee+17.802
pite+17.579
Neg logit contributions
-17.023
 Mandela-15.203
ARE-14.404
WARE-14.108
 Zuckerberg-14.068
Token Officers
Token ID23512
Feature activation-1.61e+00

Pos logit contributions
 Creatures+18.457
such+18.225
 Matters+17.883
 notable+17.653
 trib+17.368
Neg logit contributions
-19.051
 Mandela-16.492
celona-16.289
WARE-16.173
 Zuckerberg-15.778
Token of
Token ID286
Feature activation-1.20e+00

Pos logit contributions
pite+11.154
ittee+10.513
 therefore+9.967
such+9.876
affected+9.768
Neg logit contributions
-13.316
 Mandela-12.506
 Zuckerberg-12.259
IELD-12.100
celona-12.069
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
pite+8.630
such+8.495
 tho+7.891
restricted+7.808
ittee+7.783
Neg logit contributions
-10.675
 Mandela-10.196
 Zuckerberg-10.035
 Tsarnaev-10.013
WARE-9.645
Token,
Token ID11
Feature activation-1.00e+00

Pos logit contributions
pite+8.923
ittee+8.323
laughs+8.002
restricted+7.766
such+7.715
Neg logit contributions
-8.470
IELD-8.441
 Mandela-8.266
 THEIR-8.133
ARE-7.938
Token her
Token ID607
Feature activation-1.03e+00

Pos logit contributions
pite+8.925
ittee+8.418
laughs+8.126
restricted+7.943
such+7.892
Neg logit contributions
-7.275
IELD-7.083
 THEIR-7.025
 Mandela-6.945
 THEY-6.872
Token leadership
Token ID5531
Feature activation-8.70e-01

Pos logit contributions
pite+9.112
ittee+8.820
crew+8.316
laughs+8.307
restricted+8.189
Neg logit contributions
-7.403
 Mandela-7.102
 THEY-7.028
IELD-6.942
 THEIR-6.906
Token of
Token ID286
Feature activation-1.53e+00

Pos logit contributions
pite+7.431
ittee+7.341
laughs+7.065
crew+6.969
 comprises+6.847
Neg logit contributions
IELD-6.693
 Mandela-6.607
 Zuckerberg-6.554
-6.474
 THEIR-6.427
Token that
Token ID326
Feature activation-1.72e+00

Pos logit contributions
ittee+12.228
pite+12.202
such+11.074
restricted+10.926
crew+10.911
Neg logit contributions
-10.461
 THEY-9.993
 Zuckerberg-9.945
 Mandela-9.926
 """-9.820
Token powerful
Token ID3665
Feature activation-2.42e+00

Pos logit contributions
ittee+13.687
pite+12.717
crew+11.735
restricted+11.676
committee+11.599
Neg logit contributions
-12.674
celona-11.545
IELD-11.476
 Tsarnaev-11.461
 Mandela-11.315
Token panel
Token ID6103
Feature activation-1.57e+00

Pos logit contributions
ittee+17.168
pite+15.251
committee+14.836
crew+14.808
 Subcommittee+14.689
Neg logit contributions
-16.084
celona-14.498
 Tsarnaev-13.929
IELD-13.796
 Mandela-13.649
Token --
Token ID1377
Feature activation-6.86e-01

Pos logit contributions
ittee+12.478
pite+11.971
 comprises+11.450
affected+11.444
restricted+11.426
Neg logit contributions
IELD-10.939
 Mandela-10.799
 THEIR-10.695
-10.690
ARE-10.674
Token and
Token ID290
Feature activation-1.14e+00

Pos logit contributions
pite+6.337
ittee+6.108
laughs+5.882
restricted+5.757
such+5.710
Neg logit contributions
-5.179
 Zuckerberg-5.105
 Mandela-5.076
 THEY-5.035
 THEIR-4.988
Token how
Token ID703
Feature activation-6.31e-01

Pos logit contributions
pite+10.567
ittee+10.044
laughs+9.529
 therefore+9.434
restricted+9.303
Neg logit contributions
-7.728
 THEY-7.405
IELD-7.392
 Mandela-7.225
 """-7.171
Token it
Token ID340
Feature activation-9.09e-01

Pos logit contributions
pite+6.169
ittee+5.801
laughs+5.637
such+5.512
restricted+5.504
Neg logit contributions
-4.545
 Mandela-4.407
 Zuckerberg-4.401
 Tsarnaev-4.350
 """-4.296
Token are
Token ID389
Feature activation-4.55e-01

Pos logit contributions
 Zuckerberg+1.373
 Mandela+1.372
 themselves+1.370
 THEY+1.331
 THEIR+1.326
Neg logit contributions
pite-1.957
laughs-1.863
ittee-1.832
restricted-1.758
crew-1.727
Token hereby
Token ID29376
Feature activation-6.97e-01

Pos logit contributions
pite+4.189
laughs+3.889
ittee+3.888
restricted+3.677
crew+3.662
Neg logit contributions
 Mandela-3.664
-3.644
 Zuckerberg-3.632
 THEY-3.498
IELD-3.495
Token terminated
Token ID23083
Feature activation-6.13e-01

Pos logit contributions
pite+5.375
ittee+5.081
laughs+4.843
restricted+4.737
crew+4.651
Neg logit contributions
 Mandela-6.434
-6.321
 Zuckerberg-6.255
IELD-6.088
 THEY-6.041
Token and
Token ID290
Feature activation-1.18e+00

Pos logit contributions
pite+5.279
ittee+4.985
laughs+4.893
restricted+4.704
crew+4.640
Neg logit contributions
 Mandela-5.114
 Zuckerberg-5.076
-5.024
 THEY-4.967
 THEIR-4.949
Token removed
Token ID4615
Feature activation-8.54e-01

Pos logit contributions
pite+10.313
ittee+9.919
 therefore+9.421
restricted+9.363
such+9.302
Neg logit contributions
-8.347
 Mandela-8.070
IELD-7.737
 THEY-7.694
 Zuckerberg-7.685
Token from
Token ID422
Feature activation-2.42e+00

Pos logit contributions
pite+6.788
ittee+6.627
laughs+6.343
restricted+6.040
crew+5.968
Neg logit contributions
 Mandela-7.280
 Zuckerberg-7.205
-7.201
 THEY-7.069
 THEIR-7.044
Token office
Token ID2607
Feature activation-8.91e-01

Pos logit contributions
acia+16.133
pite+15.943
such+15.799
ittee+15.724
forces+15.097
Neg logit contributions
-14.855
WARE-13.371
celona-13.322
 Mandela-13.267
ARE-12.940
Token,
Token ID11
Feature activation-4.03e-01

Pos logit contributions
pite+7.485
ittee+7.107
laughs+6.754
restricted+6.676
affected+6.523
Neg logit contributions
 Mandela-7.070
 Zuckerberg-7.064
-7.052
 THEIR-6.982
 THEY-6.935
Token effective
Token ID4050
Feature activation+1.35e-01

Pos logit contributions
pite+3.634
ittee+3.423
laughs+3.371
restricted+3.241
crew+3.204
Neg logit contributions
 Mandela-3.288
-3.269
 Zuckerberg-3.232
 themselves-3.228
 THEY-3.154
Token immediately
Token ID3393
Feature activation-2.78e-01

Pos logit contributions
 Mandela+0.803
 Zuckerberg+0.801
 themselves+0.793
 THEY+0.771
 THEIR+0.769
Neg logit contributions
pite-1.566
laughs-1.499
ittee-1.480
restricted-1.424
crew-1.406
Token.
Token ID13
Feature activation-4.99e-01

Pos logit contributions
pite+2.353
ittee+2.212
laughs+2.201
restricted+2.090
such+2.036
Neg logit contributions
 Mandela-2.478
 Zuckerberg-2.441
 themselves-2.402
 THEIR-2.402
 THEY-2.382
Token difficulties
Token ID13156
Feature activation-1.58e+00

Pos logit contributions
pite+4.507
laughs+4.167
ittee+4.078
crew+4.048
restricted+3.980
Neg logit contributions
 Zuckerberg-3.796
-3.791
 Mandela-3.728
IELD-3.716
 THEIR-3.625
Token for
Token ID329
Feature activation-1.18e+00

Pos logit contributions
pite+13.203
affected+11.541
 comprises+11.478
such+11.460
 necess+11.378
Neg logit contributions
-11.281
 Zuckerberg-11.202
 THEIR-10.860
IELD-10.780
 Mandela-10.612
Token other
Token ID584
Feature activation-1.43e+00

Pos logit contributions
pite+10.497
crew+9.561
laughs+9.307
such+9.298
restricted+9.176
Neg logit contributions
IELD-8.416
-8.370
 Zuckerberg-8.233
 Mandela-8.169
 THEY-7.974
Token types
Token ID3858
Feature activation-1.60e+00

Pos logit contributions
pite+11.520
crew+10.666
laughs+10.450
forces+10.220
restricted+10.166
Neg logit contributions
IELD-10.667
 Zuckerberg-10.516
 Mandela-10.495
-10.469
 THEY-10.122
Token of
Token ID286
Feature activation-1.76e+00

Pos logit contributions
pite+11.573
ittee+10.305
 comprises+10.087
crew+9.993
such+9.918
Neg logit contributions
 Zuckerberg-12.843
 Mandela-12.790
 THEIR-12.367
IELD-12.206
-12.166
Token vessels
Token ID14891
Feature activation-2.36e+00

Pos logit contributions
pite+12.823
crew+11.648
laughs+11.373
 engagements+11.320
restricted+11.287
Neg logit contributions
IELD-13.260
-13.128
 Zuckerberg-12.546
 Mandela-12.522
 THEIR-12.083
Token,
Token ID11
Feature activation-1.27e+00

Pos logit contributions
pite+17.189
 comprises+16.408
 necess+16.327
such+16.155
 therefore+16.155
Neg logit contributions
-14.396
IELD-14.157
 Zuckerberg-14.010
 THEIR-13.270
 Mandela-13.235
Token are
Token ID389
Feature activation-1.26e+00

Pos logit contributions
pite+11.883
 comprises+10.806
 distur+10.684
such+10.604
 detract+10.576
Neg logit contributions
-8.480
 Zuckerberg-8.283
 Mandela-8.019
 THEIR-8.010
IELD-7.961
Token significant
Token ID2383
Feature activation-8.43e-01

Pos logit contributions
pite+10.766
 therefore+9.969
such+9.827
laughs+9.723
restricted+9.597
Neg logit contributions
IELD-9.275
-9.238
 Zuckerberg-8.874
 Mandela-8.849
ARE-8.553
Token factors
Token ID5087
Feature activation-1.23e+00

Pos logit contributions
pite+7.303
laughs+6.872
ittee+6.420
restricted+6.399
 distur+6.318
Neg logit contributions
 Zuckerberg-6.863
 Mandela-6.783
IELD-6.724
-6.672
 THEIR-6.590
Token.
Token ID13
Feature activation-7.44e-01

Pos logit contributions
pite+10.138
crew+9.252
laughs+9.202
affected+9.083
 detract+9.020
Neg logit contributions
IELD-9.434
 Mandela-9.390
 Zuckerberg-9.307
-9.134
 THEIR-9.102
Token But
Token ID887
Feature activation+4.88e-02

Pos logit contributions
 Mandela+0.836
 Zuckerberg+0.834
 themselves+0.833
 THEY+0.812
 THEIR+0.811
Neg logit contributions
pite-0.894
laughs-0.849
ittee-0.830
restricted-0.790
such-0.779
Token if
Token ID611
Feature activation-2.35e-01

Pos logit contributions
 Mandela+0.343
 Zuckerberg+0.342
 themselves+0.339
 THEY+0.331
 THEIR+0.331
Neg logit contributions
pite-0.512
laughs-0.487
ittee-0.480
restricted-0.460
such-0.453
Token Os
Token ID8834
Feature activation-8.16e-01

Pos logit contributions
pite+2.569
laughs+2.445
ittee+2.422
such+2.344
restricted+2.323
Neg logit contributions
-1.463
 Mandela-1.458
 Zuckerberg-1.445
IELD-1.419
 themselves-1.416
Tokenso
Token ID568
Feature activation-1.44e-01

Pos logit contributions
pite+3.079
laughs+2.569
ittee+2.452
such+2.251
restricted+2.138
Neg logit contributions
-11.166
 Mandela-11.035
 Zuckerberg-10.937
 THEIR-10.928
 THEY-10.845
Tokenff
Token ID487
Feature activation+2.22e-01

Pos logit contributions
pite+1.193
laughs+1.122
ittee+1.100
restricted+1.042
crew+1.019
Neg logit contributions
 Zuckerberg-1.339
 Mandela-1.337
 themselves-1.333
 THEY-1.305
 THEIR-1.301
Token loses
Token ID14754
Feature activation-2.36e+00

Pos logit contributions
 Zuckerberg+2.007
 Mandela+2.005
 themselves+1.996
 THEY+1.955
 THEIR+1.951
Neg logit contributions
pite-1.882
laughs-1.773
ittee-1.740
restricted-1.651
crew-1.615
Token,
Token ID11
Feature activation-3.47e-01

Pos logit contributions
pite+16.937
 therefore+16.514
ittee+16.219
such+15.862
laughs+15.633
Neg logit contributions
-13.589
IELD-11.900
 """-11.754
 Mandela-11.521
ARE-11.464
Token expect
Token ID1607
Feature activation-7.24e-01

Pos logit contributions
pite+3.612
laughs+3.400
ittee+3.384
such+3.264
restricted+3.222
Neg logit contributions
-2.335
 Mandela-2.330
 Zuckerberg-2.271
IELD-2.264
 """-2.236
Token the
Token ID262
Feature activation-4.67e-01

Pos logit contributions
pite+6.640
laughs+6.296
ittee+6.205
such+6.062
restricted+5.946
Neg logit contributions
-5.338
IELD-5.159
 Mandela-5.149
 """-5.090
 THEY-5.020
Token liberal
Token ID7270
Feature activation+2.33e-01

Pos logit contributions
pite+4.521
laughs+4.244
ittee+4.152
restricted+4.017
such+3.965
Neg logit contributions
-3.426
 Mandela-3.395
 THEY-3.386
 Zuckerberg-3.351
 THEIR-3.343
Token hand
Token ID1021
Feature activation-4.56e-01

Pos logit contributions
 Zuckerberg+1.893
 Mandela+1.891
 themselves+1.882
 THEY+1.839
 THEIR+1.834
Neg logit contributions
pite-2.205
laughs-2.089
ittee-2.055
restricted-1.960
crew-1.923
Token appoint
Token ID5393
Feature activation-2.53e+00

Pos logit contributions
pite+10.123
such+9.337
 therefore+9.293
 comprises+8.745
ittee+8.743
Neg logit contributions
-12.975
 Mandela-11.986
 Zuckerberg-11.881
WARE-11.479
IELD-11.477
Token Amb
Token ID12457
Feature activation-1.42e+00

Pos logit contributions
ittee+17.572
such+17.565
pite+16.329
 Creatures+16.103
 Matters+15.991
Neg logit contributions
-16.494
 Mandela-14.215
celona-13.836
IELD-13.462
WARE-13.381
Tokenassadors
Token ID33429
Feature activation-9.37e-01

Pos logit contributions
pite+4.939
ittee+4.893
acia+4.746
such+4.019
ods+3.880
Neg logit contributions
-18.572
 Mandela-17.656
WARE-16.913
IELD-16.890
 Tsarnaev-16.888
Token,
Token ID11
Feature activation-2.01e+00

Pos logit contributions
pite+7.744
ittee+7.397
such+6.992
restricted+6.979
laughs+6.769
Neg logit contributions
-7.823
 Mandela-7.670
IELD-7.387
 Zuckerberg-7.340
 THEY-7.172
Token other
Token ID584
Feature activation-2.62e+00

Pos logit contributions
such+15.176
pite+15.097
ittee+15.022
 Creatures+14.094
 trib+14.064
Neg logit contributions
-14.128
 Mandela-12.645
IELD-11.741
 Zuckerberg-11.444
celona-11.394
Token public
Token ID1171
Feature activation-2.31e+00

Pos logit contributions
such+16.686
 Creatures+16.247
 notable+15.679
ittee+15.580
pite+15.565
Neg logit contributions
-17.873
 Mandela-16.291
celona-15.585
 Zuckerberg-15.370
IELD-15.214
Token Ministers
Token ID36647
Feature activation-1.64e+00

Pos logit contributions
ittee+14.572
pite+13.981
ods+13.911
such+13.704
acia+13.526
Neg logit contributions
-17.576
 Mandela-16.533
 Tsarnaev-15.794
 Zuckerberg-15.601
ARE-15.185
Token and
Token ID290
Feature activation-2.73e+00

Pos logit contributions
pite+11.204
ittee+10.954
such+9.946
affected+9.822
restricted+9.740
Neg logit contributions
-13.572
 Mandela-13.082
 Zuckerberg-12.713
ARE-12.426
celona-12.388
Token Council
Token ID4281
Feature activation-8.64e-01

Pos logit contributions
ittee+18.667
such+18.664
 Creatures+18.121
pite+18.005
 trib+17.684
Neg logit contributions
-16.907
 Mandela-15.268
IELD-13.567
celona-13.525
 Tsarnaev-13.470
Tokens
Token ID82
Feature activation-1.70e+00

Pos logit contributions
pite+6.914
ittee+6.616
restricted+6.367
laughs+6.294
such+6.262
Neg logit contributions
-7.617
 Mandela-7.534
 Zuckerberg-7.224
 Tsarnaev-7.042
IELD-7.022
Token,
Token ID11
Feature activation-1.98e+00

Pos logit contributions
pite+12.852
ittee+12.540
such+11.776
affected+11.602
restricted+11.523
Neg logit contributions
-12.835
 Mandela-12.233
IELD-11.801
 Zuckerberg-11.403
 Tsarnaev-11.297
Token in
Token ID287
Feature activation-1.43e+00

Pos logit contributions
laughs+13.684
such+13.217
pite+13.069
 noteworthy+12.886
 notable+12.859
Neg logit contributions
-14.749
IELD-14.308
ARE-13.105
WARE-12.707
 THEIR-12.603
Token my
Token ID616
Feature activation-1.04e+00

Pos logit contributions
laughs+11.639
pite+11.251
such+11.013
crew+10.364
restricted+10.353
Neg logit contributions
-10.215
ARE-9.798
 THEY-9.607
IELD-9.461
 Zuckerberg-9.444
Token career
Token ID3451
Feature activation-1.97e+00

Pos logit contributions
laughs+6.605
pite+6.599
ittee+6.016
crew+5.909
such+5.884
Neg logit contributions
-9.814
IELD-9.813
WARE-9.624
 Mandela-9.587
 Zuckerberg-9.561
Token that
Token ID326
Feature activation-1.56e+00

Pos logit contributions
laughs+13.628
such+12.471
pite+12.362
crew+12.121
 detract+12.105
Neg logit contributions
IELD-14.433
-13.877
 THEIR-13.028
ARE-12.776
 Zuckerberg-12.622
Token has
Token ID468
Feature activation-1.55e+00

Pos logit contributions
laughs+13.069
pite+12.762
 interacts+12.703
 distur+12.611
 comprises+12.610
Neg logit contributions
IELD-10.267
-9.907
WARE-9.530
ARE-9.376
 THEIR-9.089
Token had
Token ID550
Feature activation-2.31e+00

Pos logit contributions
laughs+11.518
such+10.958
pite+10.916
 effected+10.746
 affected+10.727
Neg logit contributions
-11.381
IELD-11.159
WARE-10.542
ARE-10.485
 Mandela-10.275
Token this
Token ID428
Feature activation-8.13e-01

Pos logit contributions
laughs+15.383
such+15.107
 comprises+14.135
crew+14.111
 cancell+14.027
Neg logit contributions
-14.932
IELD-13.925
WARE-13.413
celona-13.247
ARE-12.849
Token kind
Token ID1611
Feature activation-4.94e-01

Pos logit contributions
pite+6.555
laughs+6.335
ittee+5.984
such+5.913
crew+5.843
Neg logit contributions
-6.921
IELD-6.722
 THEY-6.489
 THEIR-6.486
 Zuckerberg-6.452
Token of
Token ID286
Feature activation-1.16e+00

Pos logit contributions
pite+4.374
laughs+4.207
ittee+4.018
crew+3.857
restricted+3.856
Neg logit contributions
-4.179
 Mandela-4.110
 Zuckerberg-4.075
 THEIR-4.032
IELD-4.026
Token reaction
Token ID6317
Feature activation-9.34e-01

Pos logit contributions
pite+9.196
laughs+8.870
crew+8.386
ittee+8.300
such+8.219
Neg logit contributions
-9.443
IELD-8.839
 THEY-8.361
WARE-8.336
 Mandela-8.308
Token,"
Token ID553
Feature activation-6.27e-01

Pos logit contributions
laughs+7.252
pite+7.216
ittee+6.770
such+6.641
crew+6.478
Neg logit contributions
-8.089
IELD-7.909
 Zuckerberg-7.577
 Mandela-7.553
 THEIR-7.523
Token.
Token ID13
Feature activation-2.27e-01

Pos logit contributions
pite+4.410
laughs+4.178
ittee+4.116
restricted+3.954
acia+3.889
Neg logit contributions
 Mandela-3.999
IELD-3.973
 Zuckerberg-3.953
 THEIR-3.935
 THEY-3.903
Token
Token ID198
Feature activation-2.99e-01

Pos logit contributions
pite+2.505
laughs+2.403
ittee+2.366
restricted+2.289
such+2.286
Neg logit contributions
 Mandela-1.407
 themselves-1.398
 Zuckerberg-1.391
-1.379
IELD-1.372
Token
Token ID198
Feature activation-6.91e-01

Pos logit contributions
pite+3.269
laughs+3.185
ittee+3.060
such+2.989
restricted+2.983
Neg logit contributions
 Mandela-1.910
 Zuckerberg-1.909
-1.884
 THEIR-1.840
 THEY-1.835
Token"
Token ID1
Feature activation-1.81e-01

Pos logit contributions
pite+5.924
laughs+5.731
such+5.479
restricted+5.423
ittee+5.383
Neg logit contributions
 Mandela-5.876
 Zuckerberg-5.818
 themselves-5.785
 THEY-5.724
 Tsarnaev-5.697
TokenSenior
Token ID31224
Feature activation-1.68e+00

Pos logit contributions
pite+1.734
laughs+1.686
ittee+1.600
such+1.571
restricted+1.561
Neg logit contributions
 Zuckerberg-1.433
 Mandela-1.430
 themselves-1.408
-1.388
 Tsarnaev-1.386
Token public
Token ID1171
Feature activation-2.29e+00

Pos logit contributions
pite+11.760
ittee+11.085
laughs+11.007
acia+10.882
forces+10.866
Neg logit contributions
-12.452
 Tsarnaev-11.848
 Zuckerberg-11.806
IELD-11.636
 """-11.461
Token servants
Token ID17523
Feature activation-7.26e-01

Pos logit contributions
pite+11.864
 engagements+11.428
ittee+11.293
forces+10.747
 comprises+10.706
Neg logit contributions
-18.472
 Tsarnaev-17.564
 Zuckerberg-17.362
IELD-17.145
WARE-16.769
Token superv
Token ID29745
Feature activation+1.38e-01

Pos logit contributions
pite+7.208
laughs+6.692
ittee+6.663
such+6.609
 interacts+6.508
Neg logit contributions
-5.235
IELD-4.889
 Zuckerberg-4.883
 Mandela-4.739
 THEIR-4.642
Tokenise
Token ID786
Feature activation-2.02e-01

Pos logit contributions
 Zuckerberg+1.045
 Mandela+1.045
 themselves+1.042
 THEY+1.013
 THEIR+1.010
Neg logit contributions
pite-1.371
laughs-1.306
ittee-1.282
restricted-1.229
crew-1.206
Token ang
Token ID3550
Feature activation-3.47e-01

Pos logit contributions
pite+2.243
laughs+2.118
ittee+2.103
restricted+2.047
crew+2.039
Neg logit contributions
 Zuckerberg-1.258
-1.254
 Mandela-1.243
IELD-1.222
 THEY-1.213
Tokenl
Token ID75
Feature activation+6.34e-02

Pos logit contributions
pite+2.590
ittee+2.417
laughs+2.391
restricted+2.308
crew+2.225
Neg logit contributions
 Mandela-3.410
 Zuckerberg-3.391
 THEY-3.375
 THEIR-3.373
-3.339
Token the
Token ID262
Feature activation-9.43e-01

Pos logit contributions
pite+7.645
ittee+7.310
restricted+7.144
such+7.113
laughs+7.099
Neg logit contributions
-3.612
 Zuckerberg-3.410
 Mandela-3.362
IELD-3.324
 """-3.283
Token Commonwealth
Token ID17165
Feature activation-1.19e+00

Pos logit contributions
pite+7.727
laughs+7.244
restricted+7.133
ittee+7.110
acia+6.911
Neg logit contributions
-7.635
 Zuckerberg-7.254
IELD-7.213
 Mandela-7.120
 THEIR-7.061
Token Parliament
Token ID8411
Feature activation-1.27e+00

Pos logit contributions
pite+10.276
 comprises+9.810
ittee+9.622
 interacts+9.517
restricted+9.361
Neg logit contributions
-8.575
IELD-8.154
 Zuckerberg-7.914
WARE-7.852
 Tsarnaev-7.790
Token does
Token ID857
Feature activation-1.59e+00

Pos logit contributions
 comprises+9.759
pite+9.637
 interacts+9.381
 therefore+9.269
ittee+9.121
Neg logit contributions
-10.352
IELD-9.595
 Zuckerberg-9.343
WARE-9.160
 THEIR-9.143
Token not
Token ID407
Feature activation-1.45e+00

Pos logit contributions
ittee+8.483
such+8.455
pite+8.413
acia+8.256
 therefore+8.023
Neg logit contributions
 Zuckerberg-14.381
-14.361
IELD-13.948
 Mandela-13.925
ARE-13.837
Token have
Token ID423
Feature activation-2.29e+00

Pos logit contributions
pite+10.247
 therefore+9.946
laughs+9.834
 comprises+9.754
such+9.629
Neg logit contributions
-12.721
IELD-11.877
 Zuckerberg-11.452
WARE-11.228
 Mandela-11.067
Token any
Token ID597
Feature activation-1.41e+00

Pos logit contributions
such+15.449
 comprises+14.854
ittee+14.594
restricted+14.545
 therefore+14.461
Neg logit contributions
-14.596
WARE-13.045
 Zuckerberg-13.001
IELD-12.919
ARE-12.716
Token specific
Token ID2176
Feature activation-1.02e+00

Pos logit contributions
such+9.853
pite+9.749
ittee+9.423
 comprises+9.390
restricted+9.301
Neg logit contributions
-11.401
IELD-10.737
WARE-10.562
 Zuckerberg-10.560
 Mandela-10.449
Token power
Token ID1176
Feature activation-1.34e+00

Pos logit contributions
pite+6.624
ittee+6.131
restricted+6.062
laughs+6.029
such+6.021
Neg logit contributions
-9.721
 Zuckerberg-9.443
IELD-9.383
 Mandela-9.286
 THEY-9.212
Token to
Token ID284
Feature activation-1.10e+00

Pos logit contributions
pite+9.116
laughs+8.902
restricted+8.739
such+8.593
ittee+8.410
Neg logit contributions
-11.550
IELD-11.133
 Zuckerberg-11.112
ARE-11.006
 Mandela-10.885
Token legisl
Token ID3560
Feature activation-1.88e-01

Pos logit contributions
pite+7.705
laughs+7.078
 detract+6.942
ittee+6.934
 therefore+6.841
Neg logit contributions
-10.385
IELD-9.596
 Zuckerberg-9.572
WARE-9.225
 Mandela-9.224
Token crazy
Token ID7165
Feature activation-1.35e-01

Pos logit contributions
pite+5.301
laughs+5.071
such+4.834
ittee+4.780
crew+4.696
Neg logit contributions
-6.147
IELD-5.920
 Zuckerberg-5.720
 Mandela-5.672
 THEIR-5.614
Token.
Token ID13
Feature activation-5.68e-01

Pos logit contributions
pite+1.162
laughs+1.102
ittee+1.066
restricted+1.018
crew+1.013
Neg logit contributions
 Zuckerberg-1.185
 Mandela-1.180
-1.174
 THEIR-1.155
 THEY-1.153
Token I
Token ID314
Feature activation-6.65e-01

Pos logit contributions
laughs+4.905
pite+4.794
ittee+4.456
such+4.388
crew+4.278
Neg logit contributions
-4.649
 themselves-4.474
IELD-4.457
 Zuckerberg-4.441
 Mandela-4.397
Token've
Token ID1053
Feature activation-6.97e-01

Pos logit contributions
laughs+5.698
pite+5.578
ittee+5.354
such+5.150
crew+5.035
Neg logit contributions
 Zuckerberg-5.292
IELD-5.274
 Mandela-5.262
 THEIR-5.237
-5.228
Token never
Token ID1239
Feature activation-1.08e+00

Pos logit contributions
laughs+5.669
pite+5.540
ittee+5.174
such+5.073
crew+5.065
Neg logit contributions
IELD-5.890
-5.738
 Zuckerberg-5.700
ARE-5.700
 Mandela-5.667
Token done
Token ID1760
Feature activation-2.26e+00

Pos logit contributions
laughs+7.457
pite+6.731
ittee+6.494
such+6.355
crew+6.226
Neg logit contributions
IELD-10.645
-10.134
ARE-9.940
 Zuckerberg-9.818
 Mandela-9.783
Token anything
Token ID1997
Feature activation-2.09e+00

Pos logit contributions
laughs+14.852
such+14.409
pite+13.978
crew+13.222
 engagements+13.202
Neg logit contributions
-15.362
IELD-14.279
WARE-13.973
ARE-13.772
celona-13.593
Token in
Token ID287
Feature activation-1.43e+00

Pos logit contributions
laughs+13.684
such+13.217
pite+13.069
 noteworthy+12.886
 notable+12.859
Neg logit contributions
-14.749
IELD-14.308
ARE-13.105
WARE-12.707
 THEIR-12.603
Token my
Token ID616
Feature activation-1.04e+00

Pos logit contributions
laughs+11.639
pite+11.251
such+11.013
crew+10.364
restricted+10.353
Neg logit contributions
-10.215
ARE-9.798
 THEY-9.607
IELD-9.461
 Zuckerberg-9.444
Token career
Token ID3451
Feature activation-1.97e+00

Pos logit contributions
laughs+6.605
pite+6.599
ittee+6.016
crew+5.909
such+5.884
Neg logit contributions
-9.814
IELD-9.813
WARE-9.624
 Mandela-9.587
 Zuckerberg-9.561
Token that
Token ID326
Feature activation-1.56e+00

Pos logit contributions
laughs+13.628
such+12.471
pite+12.362
crew+12.121
 detract+12.105
Neg logit contributions
IELD-14.433
-13.877
 THEIR-13.028
ARE-12.776
 Zuckerberg-12.622
Token they
Token ID484
Feature activation+7.71e-01

Pos logit contributions
pite+2.177
laughs+2.060
ittee+2.016
restricted+1.956
such+1.946
Neg logit contributions
 Mandela-1.135
 Zuckerberg-1.130
-1.088
 themselves-1.078
 Tsarnaev-1.065
Token felt
Token ID2936
Feature activation+2.42e-02

Pos logit contributions
 themselves+5.628
 Zuckerberg+5.299
 Mandela+5.258
 THEY+5.242
 THEIR+5.202
Neg logit contributions
pite-7.819
laughs-7.538
ittee-7.447
restricted-7.086
crew-6.994
Token hopeless
Token ID23292
Feature activation+5.43e-01

Pos logit contributions
 Zuckerberg+0.187
 Mandela+0.187
 themselves+0.183
+0.180
 THEY+0.179
Neg logit contributions
pite-0.238
laughs-0.225
ittee-0.220
restricted-0.212
such-0.209
Token.
Token ID13
Feature activation-8.34e-01

Pos logit contributions
 themselves+4.163
 Zuckerberg+4.028
 Mandela+4.020
 THEY+3.997
 THEIR+3.948
Neg logit contributions
pite-5.273
ittee-5.058
laughs-5.039
restricted-4.815
crew-4.715
Token L
Token ID406
Feature activation-1.65e+00

Pos logit contributions
pite+7.723
laughs+7.536
such+6.997
ittee+6.906
restricted+6.743
Neg logit contributions
 Mandela-6.033
 Zuckerberg-5.951
 themselves-5.860
-5.808
 Tsarnaev-5.764
Tokenosing
Token ID2752
Feature activation-2.26e+00

Pos logit contributions
pite+12.972
ods+11.931
acia+11.922
such+11.823
laughs+11.804
Neg logit contributions
 Zuckerberg-11.737
 Tsarnaev-11.333
 Mandela-11.315
-10.995
 THEIR-10.931
Token T
Token ID309
Feature activation-1.53e+00

Pos logit contributions
pite+18.334
laughs+16.821
such+16.420
 comprises+16.324
 therefore+16.300
Neg logit contributions
-12.275
IELD-11.792
WARE-11.496
 Mandela-11.206
celona-11.176
Token1
Token ID16
Feature activation-8.51e-01

Pos logit contributions
pite+13.065
laughs+12.510
such+12.372
acia+12.204
reads+11.936
Neg logit contributions
 Mandela-9.626
 Zuckerberg-9.618
 THEIR-9.511
-9.280
 Tsarnaev-9.253
Token early
Token ID1903
Feature activation-1.47e+00

Pos logit contributions
pite+8.419
laughs+7.891
ittee+7.473
acia+7.418
restricted+7.391
Neg logit contributions
 Mandela-5.828
 Zuckerberg-5.800
-5.568
 THEY-5.525
 THEIR-5.517
Token against
Token ID1028
Feature activation-1.98e+00

Pos logit contributions
pite+12.786
laughs+11.634
 interacts+11.144
acia+11.077
 detract+11.076
Neg logit contributions
-10.198
IELD-9.949
 Mandela-9.908
 Zuckerberg-9.731
ARE-9.298
Token an
Token ID281
Feature activation-1.18e+00

Pos logit contributions
pite+16.598
laughs+15.273
such+15.163
restricted+14.620
 tho+14.286
Neg logit contributions
-10.690
IELD-10.302
WARE-10.192
 Mandela-9.864
 Zuckerberg-9.754
Token his
Token ID465
Feature activation-1.24e+00

Pos logit contributions
pite+11.828
laughs+10.750
such+10.589
restricted+10.534
 Creatures+10.497
Neg logit contributions
-7.628
 Mandela-6.998
IELD-6.871
 Zuckerberg-6.573
 """-6.561
Token first
Token ID717
Feature activation-1.69e+00

Pos logit contributions
pite+12.513
laughs+11.502
 Creatures+11.166
restricted+11.146
crew+11.142
Neg logit contributions
 Mandela-7.167
-7.133
IELD-6.757
 Zuckerberg-6.515
celona-6.474
Token shot
Token ID2823
Feature activation-7.64e-01

Pos logit contributions
pite+13.339
 Creatures+11.629
crew+11.327
such+11.306
 interacts+11.238
Neg logit contributions
-12.489
 Mandela-11.829
IELD-11.459
celona-11.187
WARE-10.960
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
pite+7.011
laughs+6.449
ittee+6.256
such+6.114
restricted+6.057
Neg logit contributions
 Mandela-6.076
-6.045
 Zuckerberg-5.782
IELD-5.633
 THEIR-5.544
Token Later
Token ID11450
Feature activation-1.09e+00

Pos logit contributions
pite+8.825
laughs+8.615
 Creatures+8.083
such+8.026
restricted+7.797
Neg logit contributions
-7.557
IELD-7.555
 themselves-7.355
 Mandela-7.283
 Zuckerberg-7.111
Token in
Token ID287
Feature activation-2.25e+00

Pos logit contributions
pite+10.182
laughs+9.463
crew+9.024
ods+8.984
restricted+8.981
Neg logit contributions
IELD-7.449
 Mandela-7.319
 THEIR-7.221
-7.022
ARE-6.937
Token the
Token ID262
Feature activation-1.22e+00

Pos logit contributions
pite+17.757
laughs+16.464
 Creatures+15.599
such+15.154
ods+14.919
Neg logit contributions
IELD-12.680
-12.334
 THEY-12.319
 Mandela-12.144
ARE-11.712
Token dial
Token ID5980
Feature activation-3.92e-01

Pos logit contributions
pite+10.364
laughs+9.570
crew+9.041
restricted+8.964
ittee+8.891
Neg logit contributions
IELD-8.964
 Mandela-8.862
-8.836
 THEIR-8.836
 THEY-8.735
Token,
Token ID11
Feature activation-9.42e-01

Pos logit contributions
pite+3.493
laughs+3.243
ittee+3.182
restricted+3.099
such+3.019
Neg logit contributions
 Mandela-3.289
 Zuckerberg-3.241
 THEIR-3.233
IELD-3.217
-3.192
Token he
Token ID339
Feature activation-8.57e-01

Pos logit contributions
pite+9.380
laughs+8.697
such+8.475
ittee+8.352
restricted+8.341
Neg logit contributions
-6.019
IELD-5.937
 Mandela-5.818
 Zuckerberg-5.594
 themselves-5.594
Token gets
Token ID3011
Feature activation-1.56e+00

Pos logit contributions
pite+6.754
laughs+6.543
 interacts+6.201
restricted+6.100
such+6.031
Neg logit contributions
 Mandela-7.206
-7.118
 Zuckerberg-7.082
 THEIR-7.004
 THEY-6.995
Token as
Token ID355
Feature activation+5.44e-02

Pos logit contributions
pite+7.327
laughs+6.986
such+6.879
 interacts+6.774
 comprises+6.711
Neg logit contributions
-6.970
IELD-6.840
 Zuckerberg-6.665
 Mandela-6.613
 """-6.566
Token well
Token ID880
Feature activation-9.87e-01

Pos logit contributions
 Mandela+0.470
 Zuckerberg+0.469
 themselves+0.461
+0.457
 THEIR+0.455
Neg logit contributions
pite-0.484
laughs-0.455
ittee-0.446
restricted-0.426
crew-0.420
Token as
Token ID355
Feature activation-7.39e-01

Pos logit contributions
pite+5.972
laughs+5.591
such+5.413
ittee+5.301
restricted+5.298
Neg logit contributions
-10.160
 Zuckerberg-9.840
IELD-9.839
 Mandela-9.831
 THEY-9.678
Token longer
Token ID2392
Feature activation-8.22e-01

Pos logit contributions
pite+6.495
laughs+6.009
ittee+5.856
 interacts+5.807
such+5.804
Neg logit contributions
-5.828
IELD-5.689
 Zuckerberg-5.663
 Mandela-5.603
 """-5.456
Token-
Token ID12
Feature activation-1.42e+00

Pos logit contributions
pite+9.211
laughs+8.727
such+8.477
 comprises+8.446
ittee+8.433
Neg logit contributions
-4.311
IELD-4.157
 Zuckerberg-4.121
 THEY-4.021
 Mandela-4.021
Tokenlived
Token ID24489
Feature activation-2.22e+00

Pos logit contributions
pite+7.110
such+6.284
affected+6.283
laughs+6.250
restricted+6.247
Neg logit contributions
-15.517
 Zuckerberg-15.337
IELD-15.274
 Tsarnaev-14.960
 Mandela-14.910
Token greenhouse
Token ID16325
Feature activation-4.87e-01

Pos logit contributions
pite+14.496
 distur+14.483
 affects+14.255
 interacts+14.078
such+14.043
Neg logit contributions
-14.548
IELD-14.288
WARE-13.773
celona-13.496
 Zuckerberg-13.358
Token gases
Token ID21678
Feature activation-1.68e+00

Pos logit contributions
pite+2.700
laughs+2.580
ittee+2.380
such+2.376
acia+2.278
Neg logit contributions
 Zuckerberg-5.590
 Mandela-5.523
 THEY-5.374
 THEIR-5.356
IELD-5.326
Token such
Token ID884
Feature activation-1.56e+00

Pos logit contributions
such+12.833
pite+12.284
restricted+11.441
 interacts+11.408
laughs+11.276
Neg logit contributions
IELD-12.040
-11.523
 Zuckerberg-11.377
celona-11.237
 Mandela-11.037
Token as
Token ID355
Feature activation-1.29e+00

Pos logit contributions
pite+7.772
such+7.757
laughs+7.343
 comprises+7.120
 therefore+6.936
Neg logit contributions
-15.835
IELD-15.656
 Zuckerberg-14.852
WARE-14.796
 THEY-14.789
Token nit
Token ID13759
Feature activation-6.39e-02

Pos logit contributions
pite+8.462
such+8.301
 interacts+8.215
 comprises+8.047
laughs+7.966
Neg logit contributions
IELD-11.342
-11.290
 Zuckerberg-10.977
WARE-10.804
 Mandela-10.708
Token employed
Token ID9322
Feature activation+6.19e-01

Pos logit contributions
 themselves+13.649
 they+12.436
 Mandela+12.385
 Zuckerberg+12.264
 Tsarnaev+12.179
Neg logit contributions
pite-14.348
laughs-13.755
cffffcc-13.335
ittee-13.192
acia-12.904
Token similar
Token ID2092
Feature activation+7.03e-01

Pos logit contributions
 themselves+4.865
 Zuckerberg+4.794
 Mandela+4.758
 THEIR+4.660
 THEY+4.641
Neg logit contributions
pite-5.947
laughs-5.710
ittee-5.650
restricted-5.325
acia-5.301
Token schemes
Token ID16546
Feature activation+3.84e-01

Pos logit contributions
 themselves+5.668
 Zuckerberg+5.586
 Mandela+5.581
 THEY+5.440
 THEIR+5.390
Neg logit contributions
pite-6.535
laughs-6.228
ittee-6.219
restricted-5.821
acia-5.744
Token and
Token ID290
Feature activation+7.07e-01

Pos logit contributions
 themselves+3.038
 Zuckerberg+2.991
 Mandela+2.984
 THEY+2.918
 THEIR+2.893
Neg logit contributions
pite-3.718
laughs-3.535
ittee-3.484
restricted-3.307
acia-3.269
Token also
Token ID635
Feature activation+9.12e-01

Pos logit contributions
 themselves+5.883
 Zuckerberg+5.731
 Mandela+5.670
 THEY+5.656
 THEIR+5.597
Neg logit contributions
pite-6.437
laughs-6.177
ittee-6.095
acia-5.765
restricted-5.682
Token found
Token ID1043
Feature activation+1.46e+00

Pos logit contributions
 themselves+7.549
 Zuckerberg+7.348
 Mandela+7.282
 THEY+7.096
 THEIR+7.055
Neg logit contributions
pite-8.165
laughs-7.779
ittee-7.620
acia-7.284
restricted-7.058
Token success
Token ID1943
Feature activation+1.08e+00

Pos logit contributions
 themselves+12.569
 they+10.936
 Mandela+10.849
 Zuckerberg+10.842
 THEIR+10.841
Neg logit contributions
pite-12.164
ittee-11.837
laughs-11.455
acia-11.046
restricted-10.730
Token.
Token ID13
Feature activation+3.06e-01

Pos logit contributions
 themselves+7.794
 Mandela+7.166
 Zuckerberg+7.033
 THEY+6.942
 they+6.890
Neg logit contributions
pite-10.703
ittee-10.337
laughs-10.173
acia-9.632
restricted-9.623
Token Petty
Token ID41505
Feature activation+6.54e-02

Pos logit contributions
 Zuckerberg+2.607
 Mandela+2.603
 themselves+2.593
 THEY+2.546
 THEIR+2.533
Neg logit contributions
pite-2.766
laughs-2.605
ittee-2.580
restricted-2.447
crew-2.399
Token's
Token ID338
Feature activation+7.40e-02

Pos logit contributions
 Zuckerberg+0.538
 Mandela+0.537
 themselves+0.526
 THEIR+0.522
 THEY+0.522
Neg logit contributions
pite-0.609
laughs-0.578
ittee-0.567
restricted-0.542
crew-0.532
Token passer
Token ID30286
Feature activation+2.70e-01

Pos logit contributions
 Zuckerberg+0.559
 Mandela+0.558
 themselves+0.552
 THEY+0.541
 THEIR+0.539
Neg logit contributions
pite-0.741
laughs-0.702
ittee-0.690
restricted-0.663
crew-0.653
Token had
Token ID550
Feature activation+1.18e+00

Pos logit contributions
 themselves+13.468
 they+13.033
 Mandela+12.459
 THEY+12.311
 Tsarnaev+12.194
Neg logit contributions
laughs-14.339
pite-14.121
ittee-13.318
catentry-12.674
restricted-12.629
Token been
Token ID587
Feature activation+4.37e-01

Pos logit contributions
 themselves+9.151
 Mandela+8.696
 Tsarnaev+8.407
 THEIR+8.391
 Zuckerberg+8.381
Neg logit contributions
pite-11.104
ittee-10.715
laughs-10.712
acia-9.819
catentry-9.806
Token abducted
Token ID34682
Feature activation+9.02e-01

Pos logit contributions
 themselves+3.753
 Mandela+3.735
 Zuckerberg+3.722
 THEY+3.624
 THEIR+3.614
Neg logit contributions
pite-3.923
laughs-3.714
ittee-3.654
restricted-3.473
acia-3.390
Token and
Token ID290
Feature activation+6.89e-01

Pos logit contributions
 themselves+5.973
 Mandela+5.720
 Zuckerberg+5.526
 Tsarnaev+5.419
 THEY+5.394
Neg logit contributions
pite-9.706
laughs-9.401
ittee-9.217
restricted-8.855
acia-8.792
Token held
Token ID2714
Feature activation+5.92e-01

Pos logit contributions
 themselves+5.705
 Mandela+5.642
 Zuckerberg+5.569
 THEIR+5.429
 THEY+5.420
Neg logit contributions
pite-6.353
laughs-6.092
ittee-5.977
restricted-5.646
acia-5.556
Token since
Token ID1201
Feature activation+1.74e+00

Pos logit contributions
 themselves+4.305
 Mandela+4.283
 Zuckerberg+4.195
 THEY+4.116
 THEIR+4.095
Neg logit contributions
pite-6.039
laughs-5.799
ittee-5.760
restricted-5.447
acia-5.374
Token 2010
Token ID3050
Feature activation+5.94e-01

Pos logit contributions
 themselves+11.542
 they+11.366
 Mandela+11.006
 their+10.771
 Mao+10.489
Neg logit contributions
pite-15.878
ittee-15.835
laughs-15.738
restricted-15.043
acia-14.958
Token.
Token ID13
Feature activation+8.81e-01

Pos logit contributions
 themselves+4.257
 Mandela+4.211
 Zuckerberg+4.175
 THEY+4.037
 THEIR+4.018
Neg logit contributions
pite-6.115
laughs-5.851
ittee-5.781
restricted-5.514
acia-5.417
Token
Token ID198
Feature activation+2.67e-01

Pos logit contributions
 Mandela+5.908
 themselves+5.861
 Zuckerberg+5.844
 THEY+5.736
 THEIR+5.680
Neg logit contributions
pite-9.325
laughs-8.854
ittee-8.720
restricted-8.412
crew-8.251
Token
Token ID198
Feature activation+7.35e-01

Pos logit contributions
 themselves+1.336
 Zuckerberg+1.312
 Mandela+1.308
 THEY+1.260
 THEIR+1.246
Neg logit contributions
pite-3.379
ittee-3.214
laughs-3.205
restricted-3.080
acia-3.045
TokenPak
Token ID29675
Feature activation-2.01e-01

Pos logit contributions
 themselves+6.152
 Mandela+6.099
 Zuckerberg+6.046
 THEIR+6.005
ARE+5.998
Neg logit contributions
pite-6.442
ittee-6.016
laughs-5.975
restricted-5.650
acia-5.595
Token Norway
Token ID15238
Feature activation+7.73e-01

Pos logit contributions
 Zuckerberg+3.254
 Mandela+3.248
 themselves+3.237
 THEY+3.171
 THEIR+3.156
Neg logit contributions
pite-3.945
laughs-3.738
ittee-3.679
restricted-3.506
crew-3.449
Token :
Token ID1058
Feature activation-9.40e-02

Pos logit contributions
 Zuckerberg+4.230
 themselves+4.196
 Mandela+4.162
 THEY+4.021
 THEIR+3.994
Neg logit contributions
pite-9.252
laughs-8.800
ittee-8.580
restricted-8.221
crew-8.207
Token University
Token ID2059
Feature activation+9.05e-02

Pos logit contributions
pite+0.983
laughs+0.941
ittee+0.924
restricted+0.890
such+0.873
Neg logit contributions
 Zuckerberg-0.643
 themselves-0.641
 Mandela-0.640
 THEY-0.625
 THEIR-0.624
Token of
Token ID286
Feature activation-4.53e-01

Pos logit contributions
 Mandela+0.702
 themselves+0.702
 Zuckerberg+0.701
 THEIR+0.691
 THEY+0.688
Neg logit contributions
pite-0.883
laughs-0.835
ittee-0.826
restricted-0.790
acia-0.771
Token Ber
Token ID4312
Feature activation+1.72e-01

Pos logit contributions
pite+5.217
laughs+5.022
ittee+4.902
such+4.854
restricted+4.848
Neg logit contributions
-2.450
IELD-2.331
 Zuckerberg-2.324
 Mandela-2.316
 themselves-2.310
Tokengen
Token ID5235
Feature activation+1.51e+00

Pos logit contributions
 Mandela+1.406
 Zuckerberg+1.403
 themselves+1.393
 THEY+1.363
 THEIR+1.361
Neg logit contributions
pite-1.622
laughs-1.537
ittee-1.513
restricted-1.445
crew-1.415
Token.
Token ID764
Feature activation+5.65e-01

Pos logit contributions
 Zuckerberg+10.536
 themselves+10.375
 Mandela+10.328
 THEY+9.807
 MIT+9.800
Neg logit contributions
pite-13.484
laughs-13.151
ittee-12.644
crew-12.083
cffffcc-11.886
Token
Token ID198
Feature activation+3.14e-01

Pos logit contributions
 Zuckerberg+2.905
 Mandela+2.841
 themselves+2.809
 THEY+2.697
 THEIR+2.686
Neg logit contributions
pite-6.989
laughs-6.703
ittee-6.618
restricted-6.342
crew-6.320
Token
Token ID198
Feature activation+8.51e-01

Pos logit contributions
 themselves+1.497
 Zuckerberg+1.458
 Mandela+1.450
 THEY+1.386
 """+1.378
Neg logit contributions
pite-4.044
ittee-3.879
laughs-3.829
restricted-3.690
acia-3.656
TokenGoogle
Token ID11708
Feature activation+1.93e-01

Pos logit contributions
 Zuckerberg+3.183
 Mandela+2.908
ARE+2.891
 themselves+2.841
IELD+2.797
Neg logit contributions
pite-11.386
 destro-11.031
ittee-10.955
laughs-10.570
 exha-10.516
Token Scholar
Token ID11713
Feature activation+4.82e-01

Pos logit contributions
 Zuckerberg+1.830
 themselves+1.827
 Mandela+1.814
 THEY+1.771
 THEIR+1.757
Neg logit contributions
pite-1.591
laughs-1.462
ittee-1.454
restricted-1.375
crew-1.352
Token has
Token ID468
Feature activation+2.30e-01

Pos logit contributions
pite+6.560
ittee+6.352
 comprises+6.051
laughs+6.001
such+5.823
Neg logit contributions
-8.119
 Mandela-7.963
IELD-7.960
 Zuckerberg-7.739
 THEIR-7.600
Token also
Token ID635
Feature activation+2.65e-01

Pos logit contributions
 themselves+1.622
 Zuckerberg+1.622
 Mandela+1.619
 THEY+1.568
 THEIR+1.564
Neg logit contributions
pite-2.410
laughs-2.299
ittee-2.261
restricted-2.167
crew-2.132
Token been
Token ID587
Feature activation-1.37e-01

Pos logit contributions
 Zuckerberg+1.775
 themselves+1.773
 Mandela+1.769
 THEY+1.712
 THEIR+1.706
Neg logit contributions
pite-2.872
laughs-2.746
ittee-2.700
restricted-2.594
crew-2.550
Token opened
Token ID4721
Feature activation-2.83e-01

Pos logit contributions
pite+1.337
laughs+1.260
ittee+1.243
restricted+1.183
crew+1.175
Neg logit contributions
 Mandela-1.065
 Zuckerberg-1.055
-1.054
IELD-1.039
 THEY-1.034
Token to
Token ID284
Feature activation+2.90e-01

Pos logit contributions
pite+2.523
laughs+2.401
ittee+2.352
restricted+2.238
crew+2.208
Neg logit contributions
 Mandela-2.374
-2.373
 Zuckerberg-2.344
IELD-2.343
 THEY-2.293
Token determine
Token ID5004
Feature activation+1.73e+00

Pos logit contributions
 Zuckerberg+2.291
 Mandela+2.286
 themselves+2.282
 THEY+2.218
 THEIR+2.216
Neg logit contributions
pite-2.810
laughs-2.672
ittee-2.624
restricted-2.507
crew-2.460
Token how
Token ID703
Feature activation-1.86e-01

Pos logit contributions
 themselves+11.402
 Zuckerberg+10.908
 Mandela+10.863
 THEIR+10.653
 their+10.626
Neg logit contributions
pite-15.753
laughs-15.352
ittee-15.159
cffffcc-14.632
restricted-14.260
Token the
Token ID262
Feature activation-3.91e-01

Pos logit contributions
pite+1.699
ittee+1.559
laughs+1.546
restricted+1.507
such+1.501
Neg logit contributions
 Zuckerberg-1.522
 Mandela-1.512
-1.505
 themselves-1.502
IELD-1.485
Token error
Token ID4049
Feature activation+6.81e-02

Pos logit contributions
pite+3.680
laughs+3.454
ittee+3.411
restricted+3.330
crew+3.290
Neg logit contributions
-2.976
 Zuckerberg-2.940
IELD-2.921
 Mandela-2.913
 THEIR-2.898
Token happened
Token ID3022
Feature activation-7.52e-01

Pos logit contributions
 Zuckerberg+0.673
 Mandela+0.672
 themselves+0.666
 THEIR+0.656
 THEY+0.656
Neg logit contributions
pite-0.521
laughs-0.486
ittee-0.477
restricted-0.451
crew-0.440
Token."
Token ID526
Feature activation-2.70e-01

Pos logit contributions
pite+6.163
ittee+5.775
laughs+5.759
crew+5.459
restricted+5.432
Neg logit contributions
-6.334
 Mandela-6.231
 Zuckerberg-6.191
IELD-6.190
 THEIR-6.139
Token punishment
Token ID9837
Feature activation-3.32e-01

Pos logit contributions
 themselves+2.546
 Mandela+2.538
 Zuckerberg+2.535
 THEY+2.478
 THEIR+2.459
Neg logit contributions
pite-2.732
laughs-2.571
ittee-2.535
restricted-2.414
crew-2.360
Token,
Token ID11
Feature activation-2.93e-01

Pos logit contributions
pite+2.828
laughs+2.714
ittee+2.639
restricted+2.515
such+2.481
Neg logit contributions
 Zuckerberg-2.883
-2.856
 Mandela-2.834
IELD-2.834
 THEIR-2.792
Token that
Token ID326
Feature activation+1.65e-01

Pos logit contributions
pite+2.699
laughs+2.568
ittee+2.471
such+2.400
restricted+2.397
Neg logit contributions
-2.364
 Zuckerberg-2.356
 Mandela-2.338
 themselves-2.276
 THEIR-2.275
Token players
Token ID1938
Feature activation+4.30e-02

Pos logit contributions
 Zuckerberg+1.305
 Mandela+1.304
 themselves+1.294
 THEY+1.264
 THEIR+1.263
Neg logit contributions
pite-1.597
laughs-1.514
ittee-1.489
restricted-1.423
crew-1.397
Token know
Token ID760
Feature activation+1.98e+00

Pos logit contributions
 Zuckerberg+0.306
 Mandela+0.305
+0.296
 themselves+0.295
 THEY+0.294
Neg logit contributions
pite-0.449
laughs-0.427
ittee-0.420
restricted-0.402
such-0.398
Token they
Token ID484
Feature activation+1.54e+00

Pos logit contributions
 themselves+14.315
 they+14.221
 THEY+13.365
 their+13.081
 THEIR+12.870
Neg logit contributions
pite-16.056
laughs-15.752
ittee-15.653
acia-15.295
restricted-14.807
Token can
Token ID460
Feature activation+3.55e-01

Pos logit contributions
 themselves+10.771
 they+9.291
 THEY+9.270
 Mandela+9.145
 THEIR+8.989
Neg logit contributions
pite-14.851
laughs-14.284
ittee-13.956
restricted-13.637
catentry-13.386
Token get
Token ID651
Feature activation+5.65e-01

Pos logit contributions
 themselves+2.715
 Mandela+2.705
 Zuckerberg+2.702
 THEY+2.632
 THEIR+2.624
Neg logit contributions
pite-3.513
laughs-3.337
ittee-3.287
restricted-3.149
crew-3.088
Token on
Token ID319
Feature activation-2.31e-01

Pos logit contributions
 themselves+4.640
 Mandela+4.483
 Zuckerberg+4.464
 THEIR+4.402
 THEY+4.353
Neg logit contributions
pite-5.363
laughs-5.076
ittee-5.060
restricted-4.792
acia-4.740
Token the
Token ID262
Feature activation+1.11e-01

Pos logit contributions
pite+2.124
laughs+2.020
ittee+1.997
restricted+1.916
such+1.888
Neg logit contributions
-1.836
 Zuckerberg-1.819
 Mandela-1.811
 """-1.791
 THEY-1.789
Token field
Token ID2214
Feature activation+2.05e-01

Pos logit contributions
 Zuckerberg+0.817
 Mandela+0.811
 themselves+0.807
+0.794
 THEY+0.792
Neg logit contributions
pite-1.123
laughs-1.065
ittee-1.054
restricted-1.009
such-0.990
Token.
Token ID13
Feature activation+4.24e-01

Pos logit contributions
 Zuckerberg+4.547
 themselves+4.545
 Mandela+4.517
 THEY+4.384
 THEIR+4.327
Neg logit contributions
pite-5.833
laughs-5.554
ittee-5.411
restricted-5.195
acia-5.087
Token
Token ID564
Feature activation-1.31e-01

Pos logit contributions
 Zuckerberg+3.216
 Mandela+3.209
 themselves+3.181
 THEY+3.117
 THEIR+3.098
Neg logit contributions
pite-4.249
laughs-4.017
ittee-3.964
restricted-3.803
crew-3.727
Token
Token ID250
Feature activation-1.15e-01

Pos logit contributions
pite+1.047
laughs+0.992
ittee+0.982
restricted+0.914
such+0.903
Neg logit contributions
 Mandela-1.240
 Zuckerberg-1.231
 themselves-1.221
 THEY-1.196
 THEIR-1.193
TokenIt
Token ID1026
Feature activation-6.36e-02

Pos logit contributions
pite+1.231
laughs+1.188
ittee+1.166
restricted+1.117
such+1.104
Neg logit contributions
 Mandela-0.779
 Zuckerberg-0.772
 themselves-0.765
 THEIR-0.745
 THEY-0.743
Token
Token ID447
Feature activation-1.61e-01

Pos logit contributions
pite+0.664
laughs+0.635
ittee+0.629
restricted+0.597
crew+0.591
Neg logit contributions
 Zuckerberg-0.449
 Mandela-0.448
 themselves-0.436
 THEIR-0.434
 THEY-0.434
Token
Token ID247
Feature activation+5.35e-02

Pos logit contributions
pite+1.395
laughs+1.320
ittee+1.306
restricted+1.228
crew+1.213
Neg logit contributions
 Mandela-1.419
 Zuckerberg-1.418
 themselves-1.400
 THEIR-1.374
 THEY-1.373
Tokens
Token ID82
Feature activation-7.47e-02

Pos logit contributions
 Zuckerberg+0.476
 Mandela+0.475
 themselves+0.470
 THEIR+0.462
 THEY+0.462
Neg logit contributions
pite-0.463
laughs-0.438
ittee-0.431
restricted-0.408
crew-0.401
Token 2016
Token ID1584
Feature activation+2.47e-01

Pos logit contributions
pite+0.700
laughs+0.663
ittee+0.654
restricted+0.621
such+0.617
Neg logit contributions
 Zuckerberg-0.601
 Mandela-0.601
 themselves-0.587
 THEY-0.586
-0.584
Token.
Token ID13
Feature activation+7.79e-02

Pos logit contributions
 Zuckerberg+1.914
 Mandela+1.914
 themselves+1.902
 THEY+1.858
 THEIR+1.853
Neg logit contributions
pite-2.422
laughs-2.301
ittee-2.265
restricted-2.165
crew-2.127
Token We
Token ID775
Feature activation+2.55e-01

Pos logit contributions
 Zuckerberg+0.621
 Mandela+0.621
 themselves+0.621
 THEIR+0.602
 THEY+0.599
Neg logit contributions
pite-0.737
laughs-0.706
ittee-0.691
restricted-0.658
such-0.648
Token should
Token ID815
Feature activation+2.53e-01

Pos logit contributions
 Zuckerberg+1.873
 Mandela+1.871
 themselves+1.864
 THEY+1.814
 THEIR+1.807
Neg logit contributions
pite-2.607
laughs-2.481
ittee-2.441
restricted-2.339
crew-2.299
Token D
Token ID360
Feature activation-2.73e-01

Pos logit contributions
laughs+7.277
pite+7.236
ittee+6.670
restricted+6.537
such+6.513
Neg logit contributions
-5.372
 Mandela-5.308
 Zuckerberg-5.221
IELD-5.160
 THEIR-5.115
Tokenines
Token ID1127
Feature activation-3.42e-01

Pos logit contributions
pite+2.490
laughs+2.367
ittee+2.324
acia+2.301
such+2.205
Neg logit contributions
 Mandela-2.275
 Zuckerberg-2.253
 themselves-2.181
 THEIR-2.179
 THEY-2.165
Tokenh
Token ID71
Feature activation+3.25e-01

Pos logit contributions
pite+2.267
laughs+2.104
ittee+2.010
crew+1.908
such+1.884
Neg logit contributions
 Mandela-3.664
-3.643
 Zuckerberg-3.628
 THEIR-3.609
 THEY-3.606
Token was
Token ID373
Feature activation-4.03e-01

Pos logit contributions
 Zuckerberg+2.658
 Mandela+2.652
 themselves+2.643
 THEY+2.578
 THEIR+2.570
Neg logit contributions
pite-3.050
laughs-2.892
ittee-2.846
restricted-2.717
crew-2.660
Token just
Token ID655
Feature activation-1.83e-01

Pos logit contributions
pite+3.806
laughs+3.708
ittee+3.531
such+3.402
crew+3.394
Neg logit contributions
-3.055
 Mandela-3.032
 Zuckerberg-3.028
 THEY-2.978
IELD-2.969
Token talking
Token ID3375
Feature activation+7.86e-02

Pos logit contributions
pite+1.761
laughs+1.693
ittee+1.635
such+1.563
restricted+1.560
Neg logit contributions
 Mandela-1.408
 Zuckerberg-1.401
-1.382
 THEY-1.374
 themselves-1.366
Token about
Token ID546
Feature activation-7.24e-01

Pos logit contributions
 Zuckerberg+0.425
 Mandela+0.424
 themselves+0.415
 THEY+0.409
 THEIR+0.406
Neg logit contributions
pite-0.949
laughs-0.917
ittee-0.897
restricted-0.867
crew-0.859
Token the
Token ID262
Feature activation-1.03e-01

Pos logit contributions
pite+6.897
laughs+6.819
ittee+6.343
such+6.291
crew+6.234
Neg logit contributions
-5.041
 Mandela-4.890
 Zuckerberg-4.836
 THEY-4.836
IELD-4.785
Token beginnings
Token ID33875
Feature activation-9.65e-03

Pos logit contributions
pite+1.033
laughs+0.992
ittee+0.954
restricted+0.927
crew+0.914
Neg logit contributions
 Zuckerberg-0.754
 Mandela-0.751
 themselves-0.739
 THEY-0.739
 THEIR-0.734
Token of
Token ID286
Feature activation-1.01e+00

Pos logit contributions
pite+0.093
laughs+0.089
ittee+0.086
restricted+0.082
crew+0.081
Neg logit contributions
 Mandela-0.076
 Zuckerberg-0.076
 THEIR-0.074
-0.074
 themselves-0.074
Token this
Token ID428
Feature activation-5.63e-01

Pos logit contributions
pite+9.261
laughs+9.128
such+8.575
crew+8.484
ittee+8.425
Neg logit contributions
-6.831
 Mandela-6.631
 THEY-6.518
 Zuckerberg-6.429
ARE-6.317
Token the
Token ID262
Feature activation+1.88e-01

Pos logit contributions
pite+1.069
laughs+1.001
ittee+0.989
restricted+0.960
crew+0.931
Neg logit contributions
 Mandela-0.859
 Zuckerberg-0.855
 themselves-0.843
 THEY-0.842
-0.838
Token school
Token ID1524
Feature activation+2.73e-01

Pos logit contributions
 Zuckerberg+1.339
 Mandela+1.338
 themselves+1.329
 THEY+1.297
 THEIR+1.293
Neg logit contributions
pite-1.964
laughs-1.870
ittee-1.841
restricted-1.768
crew-1.737
Token year
Token ID614
Feature activation+1.06e-02

Pos logit contributions
 Zuckerberg+2.057
 Mandela+2.054
 themselves+2.048
 THEY+1.995
 THEIR+1.985
Neg logit contributions
pite-2.726
laughs-2.590
ittee-2.550
restricted-2.439
crew-2.396
Token,
Token ID11
Feature activation+1.42e-01

Pos logit contributions
 Mandela+0.094
 Zuckerberg+0.094
 themselves+0.093
 THEIR+0.092
 THEY+0.092
Neg logit contributions
pite-0.092
laughs-0.086
ittee-0.085
restricted-0.081
crew-0.079
Token it
Token ID340
Feature activation+1.95e-01

Pos logit contributions
 Mandela+1.144
 Zuckerberg+1.143
 themselves+1.136
 THEY+1.111
 THEIR+1.109
Neg logit contributions
pite-1.347
laughs-1.276
ittee-1.255
restricted-1.199
such-1.175
Token occurred
Token ID5091
Feature activation-1.72e-02

Pos logit contributions
 themselves+1.800
 Zuckerberg+1.800
 Mandela+1.799
 THEY+1.755
 THEIR+1.749
Neg logit contributions
pite-1.625
laughs-1.527
ittee-1.496
restricted-1.419
crew-1.386
Token to
Token ID284
Feature activation-3.47e-01

Pos logit contributions
pite+0.143
laughs+0.135
ittee+0.133
restricted+0.126
crew+0.123
Neg logit contributions
 Zuckerberg-0.157
 Mandela-0.157
 themselves-0.156
 THEY-0.153
 THEIR-0.153
Token me
Token ID502
Feature activation+2.88e-01

Pos logit contributions
pite+2.423
laughs+2.310
ittee+2.266
such+2.094
restricted+2.088
Neg logit contributions
 THEY-3.472
 Mandela-3.459
 THEIR-3.420
 Zuckerberg-3.412
-3.410
Token that
Token ID326
Feature activation+4.04e-01

Pos logit contributions
 Zuckerberg+2.535
 Mandela+2.534
 themselves+2.527
 THEY+2.471
 THEIR+2.461
Neg logit contributions
pite-2.522
laughs-2.377
ittee-2.335
restricted-2.220
crew-2.172
Token perhaps
Token ID3737
Feature activation+4.30e-01

Pos logit contributions
 themselves+3.222
 Zuckerberg+3.219
 Mandela+3.215
 THEY+3.144
 THEIR+3.126
Neg logit contributions
pite-3.856
laughs-3.659
ittee-3.613
restricted-3.432
crew-3.385
Token this
Token ID428
Feature activation+2.25e-01

Pos logit contributions
 themselves+3.494
 Zuckerberg+3.492
 Mandela+3.490
 THEY+3.414
 THEIR+3.394
Neg logit contributions
pite-4.028
laughs-3.826
ittee-3.776
restricted-3.575
crew-3.545
Token moment
Token ID2589
Feature activation+1.66e-03

Pos logit contributions
 Zuckerberg+3.823
 Mandela+3.817
 themselves+3.805
 """+3.653
 THEY+3.651
Neg logit contributions
pite-6.379
laughs-6.060
ittee-6.036
restricted-5.761
crew-5.722
Token that
Token ID326
Feature activation+5.14e-01

Pos logit contributions
 Mandela+0.014
 Zuckerberg+0.014
 themselves+0.014
 THEIR+0.013
 THEY+0.013
Neg logit contributions
pite-0.015
laughs-0.014
ittee-0.014
restricted-0.014
such-0.013
Token read
Token ID1100
Feature activation-1.71e-01

Pos logit contributions
 themselves+3.854
 Zuckerberg+3.841
 Mandela+3.839
 THEY+3.753
 THEIR+3.722
Neg logit contributions
pite-5.121
laughs-4.860
ittee-4.798
restricted-4.597
crew-4.510
Token()
Token ID3419
Feature activation+6.48e-01

Pos logit contributions
pite+1.852
laughs+1.770
ittee+1.750
restricted+1.687
crew+1.667
Neg logit contributions
 Zuckerberg-1.144
 Mandela-1.143
 THEY-1.096
 Tsarnaev-1.094
 themselves-1.090
Token returns
Token ID5860
Feature activation-3.52e-03

Pos logit contributions
 themselves+5.905
 Mandela+5.737
 Zuckerberg+5.737
 """+5.621
 THEY+5.612
Neg logit contributions
pite-5.392
laughs-5.107
ittee-5.020
restricted-4.725
crew-4.623
Token,
Token ID11
Feature activation+2.00e-01

Pos logit contributions
pite+0.036
laughs+0.034
ittee+0.034
restricted+0.032
such+0.032
Neg logit contributions
 Zuckerberg-0.026
 Mandela-0.026
 themselves-0.025
 THEY-0.025
-0.025
Token the
Token ID262
Feature activation+3.87e-01

Pos logit contributions
 Zuckerberg+1.498
 Mandela+1.497
 themselves+1.487
 THEY+1.451
 THEIR+1.447
Neg logit contributions
pite-2.018
laughs-1.917
ittee-1.887
restricted-1.807
crew-1.775
Token stack
Token ID8931
Feature activation-2.31e-01

Pos logit contributions
 Mandela+2.705
 Zuckerberg+2.705
 themselves+2.697
 THEY+2.616
 THEIR+2.608
Neg logit contributions
pite-4.071
laughs-3.880
ittee-3.827
restricted-3.663
crew-3.606
Token pointer
Token ID17562
Feature activation+5.26e-01

Pos logit contributions
pite+2.121
laughs+2.003
ittee+1.939
restricted+1.875
crew+1.859
Neg logit contributions
 Zuckerberg-1.916
 Mandela-1.915
 THEIR-1.852
 Tsarnaev-1.849
 THEY-1.839
Token is
Token ID318
Feature activation+6.50e-02

Pos logit contributions
 themselves+4.638
 Mandela+4.533
 Zuckerberg+4.531
 THEY+4.445
 """+4.421
Neg logit contributions
pite-4.563
laughs-4.350
ittee-4.287
restricted-4.057
acia-3.996
Token in
Token ID287
Feature activation+1.58e-01

Pos logit contributions
 Zuckerberg+0.473
 Mandela+0.471
 themselves+0.464
 THEY+0.458
 THEIR+0.454
Neg logit contributions
pite-0.667
laughs-0.632
ittee-0.622
restricted-0.599
such-0.589
Token create
Token ID2251
Feature activation-3.18e-02

Pos logit contributions
 Zuckerberg+1.469
 Mandela+1.468
 themselves+1.455
 THEY+1.430
 THEIR+1.424
Neg logit contributions
pite-1.726
laughs-1.637
ittee-1.610
restricted-1.534
crew-1.510
Token a
Token ID257
Feature activation+1.04e-02

Pos logit contributions
pite+0.318
laughs+0.301
ittee+0.295
restricted+0.284
such+0.282
Neg logit contributions
 Mandela-0.236
 Zuckerberg-0.236
-0.232
 themselves-0.231
 THEY-0.230
Token comic
Token ID9048
Feature activation+8.94e-01

Pos logit contributions
 Zuckerberg+0.083
 Mandela+0.083
 themselves+0.082
 THEY+0.081
+0.081
Neg logit contributions
pite-0.098
laughs-0.093
ittee-0.091
restricted-0.087
crew-0.086
Token book
Token ID1492
Feature activation+1.33e+00

Pos logit contributions
 themselves+7.259
 Zuckerberg+7.103
 Mandela+7.027
 THEY+6.920
 """+6.897
Neg logit contributions
pite-8.078
laughs-7.762
ittee-7.685
restricted-7.275
acia-7.042
Token-
Token ID12
Feature activation+1.30e+00

Pos logit contributions
 themselves+9.161
 Mandela+8.902
 Zuckerberg+8.838
 """+8.638
 THEY+8.533
Neg logit contributions
pite-12.604
ittee-12.236
laughs-12.030
restricted-11.382
acia-11.134
Tokenlike
Token ID2339
Feature activation+6.06e-01

Pos logit contributions
 themselves+9.754
 Zuckerberg+9.364
 Mandela+9.353
 """+9.290
 THEY+9.055
Neg logit contributions
pite-11.012
ittee-10.721
laughs-10.239
acia-9.816
esides-9.729
Token appearance
Token ID5585
Feature activation-6.74e-02

Pos logit contributions
 themselves+4.347
 Zuckerberg+4.338
 Mandela+4.328
 THEY+4.170
 """+4.104
Neg logit contributions
pite-6.197
laughs-5.916
ittee-5.908
restricted-5.600
acia-5.485
Token.
Token ID13
Feature activation-8.71e-02

Pos logit contributions
pite+0.646
laughs+0.620
ittee+0.606
restricted+0.580
crew+0.574
Neg logit contributions
 Mandela-0.531
 Zuckerberg-0.530
 THEIR-0.517
 themselves-0.511
-0.511
Token The
Token ID383
Feature activation-3.11e-01

Pos logit contributions
pite+0.817
laughs+0.787
ittee+0.759
such+0.724
restricted+0.722
Neg logit contributions
 Mandela-0.687
 Zuckerberg-0.687
 themselves-0.686
-0.671
 THEIR-0.670
Token opening
Token ID4756
Feature activation-9.91e-01

Pos logit contributions
pite+3.059
laughs+2.920
ittee+2.824
restricted+2.762
crew+2.747
Neg logit contributions
 Zuckerberg-2.306
 Mandela-2.280
 THEIR-2.273
 THEY-2.268
 themselves-2.242
Token mission
Token ID4365
Feature activation-4.86e-01

Pos logit contributions
pite+8.389
laughs+8.000
 comprises+7.863
 interacts+7.689
crew+7.571
Neg logit contributions
-7.737
IELD-7.271
 THEIR-7.194
 Zuckerberg-7.027
ARE-6.994
TokenMarco
Token ID37179
Feature activation+6.50e-02

Pos logit contributions
pite+2.272
laughs+2.172
ittee+2.074
restricted+2.001
such+1.948
Neg logit contributions
 Mandela-2.759
 Zuckerberg-2.727
 THEY-2.716
 THEIR-2.705
 themselves-2.677
Token Murray
Token ID12164
Feature activation-4.04e-01

Pos logit contributions
 Zuckerberg+0.455
 Mandela+0.454
 themselves+0.449
 THEY+0.441
 THEIR+0.441
Neg logit contributions
pite-0.683
laughs-0.655
ittee-0.645
restricted-0.618
crew-0.607
Token and
Token ID290
Feature activation-5.64e-01

Pos logit contributions
pite+3.795
laughs+3.670
ittee+3.536
restricted+3.400
crew+3.325
Neg logit contributions
 Zuckerberg-3.156
 Mandela-3.152
 THEIR-3.064
-3.061
 THEY-3.059
Token Tony
Token ID8832
Feature activation+2.17e-01

Pos logit contributions
pite+5.840
laughs+5.608
ittee+5.478
restricted+5.320
such+5.282
Neg logit contributions
-3.772
 Mandela-3.673
IELD-3.671
 Zuckerberg-3.621
 Tsarnaev-3.536
Token will
Token ID481
Feature activation-4.04e-02

Pos logit contributions
 Zuckerberg+1.168
 Mandela+1.164
 themselves+1.154
 THEY+1.119
 THEIR+1.113
Neg logit contributions
pite-2.636
laughs-2.529
ittee-2.495
restricted-2.409
crew-2.374
Token be
Token ID307
Feature activation-1.82e-01

Pos logit contributions
pite+0.417
laughs+0.400
ittee+0.393
restricted+0.376
crew+0.370
Neg logit contributions
 Mandela-0.291
 Zuckerberg-0.291
 THEY-0.280
-0.280
 themselves-0.279
Token 35
Token ID3439
Feature activation+6.98e-01

Pos logit contributions
pite+1.813
laughs+1.732
ittee+1.679
restricted+1.633
crew+1.606
Neg logit contributions
 Mandela-1.357
 Zuckerberg-1.351
-1.323
 THEY-1.311
IELD-1.311
Token next
Token ID1306
Feature activation-8.95e-01

Pos logit contributions
 themselves+5.023
 Mandela+4.799
 Zuckerberg+4.796
 THEY+4.730
 THEIR+4.677
Neg logit contributions
pite-7.168
laughs-6.736
ittee-6.691
restricted-6.379
crew-6.358
Token year
Token ID614
Feature activation-4.79e-01

Pos logit contributions
pite+6.363
laughs+6.267
ittee+5.859
restricted+5.776
crew+5.687
Neg logit contributions
 Zuckerberg-7.934
IELD-7.929
 Mandela-7.922
-7.875
 THEIR-7.664
Token.
Token ID13
Feature activation-2.47e-01

Pos logit contributions
pite+4.187
laughs+4.075
ittee+3.906
restricted+3.779
such+3.672
Neg logit contributions
 Mandela-3.992
 Zuckerberg-3.965
-3.906
IELD-3.873
 THEIR-3.857
Token I
Token ID314
Feature activation-1.16e+00

Pos logit contributions
pite+2.264
laughs+2.230
ittee+2.109
restricted+2.018
such+2.017
Neg logit contributions
-1.947
 Mandela-1.945
 Zuckerberg-1.942
 themselves-1.936
IELD-1.909
Token Congress
Token ID3162
Feature activation-5.23e-01

Pos logit contributions
pite+5.561
laughs+5.253
ittee+5.234
such+5.042
restricted+5.008
Neg logit contributions
 THEIR-4.888
 Mandela-4.859
IELD-4.806
 Zuckerberg-4.798
-4.792
Token would
Token ID561
Feature activation-3.67e-02

Pos logit contributions
pite+4.438
ittee+4.170
laughs+4.091
restricted+3.975
crew+3.901
Neg logit contributions
 Zuckerberg-4.475
 Mandela-4.464
-4.389
 THEIR-4.387
 THEY-4.345
Token pass
Token ID1208
Feature activation+2.14e-01

Pos logit contributions
pite+0.366
laughs+0.346
ittee+0.345
restricted+0.328
crew+0.322
Neg logit contributions
 Zuckerberg-0.278
 Mandela-0.276
-0.271
 THEY-0.267
IELD-0.266
Token legislation
Token ID5520
Feature activation-6.07e-01

Pos logit contributions
 Zuckerberg+1.594
 Mandela+1.593
 themselves+1.582
 THEY+1.546
 THEIR+1.540
Neg logit contributions
pite-2.164
laughs-2.057
ittee-2.027
restricted-1.941
crew-1.906
Token to
Token ID284
Feature activation-4.40e-01

Pos logit contributions
pite+5.753
ittee+5.485
laughs+5.445
restricted+5.313
such+5.264
Neg logit contributions
IELD-4.435
-4.368
 Mandela-4.337
 Zuckerberg-4.277
 THEIR-4.271
Token raise
Token ID5298
Feature activation-8.09e-02

Pos logit contributions
pite+3.911
laughs+3.625
ittee+3.599
restricted+3.414
such+3.376
Neg logit contributions
-3.668
 Zuckerberg-3.576
 Mandela-3.548
 THEY-3.530
 THEIR-3.454
Token the
Token ID262
Feature activation-1.04e-01

Pos logit contributions
pite+0.723
laughs+0.692
ittee+0.681
restricted+0.653
such+0.643
Neg logit contributions
 Mandela-0.677
 Zuckerberg-0.671
 THEY-0.662
 themselves-0.656
-0.655
Token ceiling
Token ID13387
Feature activation+7.01e-03

Pos logit contributions
pite+0.776
laughs+0.729
ittee+0.713
restricted+0.690
such+0.667
Neg logit contributions
 Mandela-1.020
 Zuckerberg-1.014
 THEY-1.006
 themselves-1.004
-1.003
Token in
Token ID287
Feature activation-1.34e-01

Pos logit contributions
 Mandela+0.048
 Zuckerberg+0.048
 themselves+0.047
 THEIR+0.047
 THEY+0.047
Neg logit contributions
pite-0.074
laughs-0.071
ittee-0.070
restricted-0.067
such-0.066
Token time
Token ID640
Feature activation+1.92e-01

Pos logit contributions
pite+1.395
laughs+1.309
ittee+1.299
restricted+1.251
such+1.241
Neg logit contributions
 Mandela-0.944
 Zuckerberg-0.933
 THEY-0.932
-0.931
 themselves-0.916
Token to
Token ID284
Feature activation+3.21e-01

Pos logit contributions
 Zuckerberg+1.266
 Mandela+1.265
 themselves+1.255
 THEY+1.221
 THEIR+1.218
Neg logit contributions
pite-2.099
laughs-2.004
ittee-1.978
restricted-1.900
crew-1.868
Token are
Token ID389
Feature activation-4.76e-01

Pos logit contributions
pite+1.740
laughs+1.686
ittee+1.657
restricted+1.572
crew+1.565
Neg logit contributions
 Zuckerberg-1.161
 Mandela-1.161
 THEIR-1.125
 THEY-1.121
-1.115
Token extremely
Token ID4457
Feature activation-6.04e-01

Pos logit contributions
pite+4.210
laughs+4.121
ittee+3.922
crew+3.865
such+3.812
Neg logit contributions
-3.815
 Mandela-3.789
 Zuckerberg-3.766
IELD-3.759
 THEY-3.677
Token close
Token ID1969
Feature activation-7.92e-01

Pos logit contributions
pite+4.940
laughs+4.700
ittee+4.415
crew+4.391
restricted+4.330
Neg logit contributions
-5.266
 Mandela-5.264
IELD-5.251
 Zuckerberg-5.172
 THEY-5.075
Token,
Token ID11
Feature activation-8.00e-02

Pos logit contributions
laughs+6.648
pite+6.546
crew+6.187
ittee+6.120
affected+6.017
Neg logit contributions
-6.558
 Mandela-6.339
IELD-6.324
 Zuckerberg-6.295
 THEIR-6.162
Token really
Token ID1107
Feature activation-1.04e+00

Pos logit contributions
pite+0.751
laughs+0.721
ittee+0.700
restricted+0.670
crew+0.661
Neg logit contributions
 Mandela-0.644
 Zuckerberg-0.642
-0.635
 themselves-0.635
 THEIR-0.625
Token,
Token ID11
Feature activation-3.02e-01

Pos logit contributions
laughs+8.154
pite+8.000
ittee+7.436
 interacts+7.361
crew+7.327
Neg logit contributions
-8.821
IELD-8.468
 Mandela-8.269
 Zuckerberg-8.228
 THEIR-7.947
Token really
Token ID1107
Feature activation-9.40e-01

Pos logit contributions
pite+2.766
laughs+2.710
ittee+2.571
restricted+2.469
crew+2.461
Neg logit contributions
-2.456
 Mandela-2.428
 Zuckerberg-2.416
 THEIR-2.380
 themselves-2.377
Token close
Token ID1969
Feature activation-8.93e-01

Pos logit contributions
pite+6.808
laughs+6.799
ittee+6.251
crew+6.205
 interacts+6.116
Neg logit contributions
-8.586
IELD-8.327
 Mandela-8.129
 Zuckerberg-8.093
 THEIR-7.825
Token.
Token ID13
Feature activation-5.23e-01

Pos logit contributions
laughs+7.370
pite+7.076
crew+6.804
ittee+6.618
affected+6.469
Neg logit contributions
-7.345
IELD-7.212
 Mandela-7.200
 Zuckerberg-7.148
 THEIR-6.942
Token I
Token ID314
Feature activation-8.04e-01

Pos logit contributions
laughs+4.708
pite+4.666
ittee+4.281
such+4.202
restricted+4.079
Neg logit contributions
-4.165
IELD-4.059
 themselves-4.006
 Zuckerberg-4.006
 Mandela-3.994
Token love
Token ID1842
Feature activation-8.01e-01

Pos logit contributions
laughs+6.646
pite+6.536
ittee+6.284
such+5.998
restricted+5.930
Neg logit contributions
 Mandela-6.575
 Zuckerberg-6.553
-6.490
 THEIR-6.482
IELD-6.422
Token night
Token ID1755
Feature activation+6.94e-01

Pos logit contributions
pite+2.421
laughs+2.280
ittee+2.254
restricted+2.182
such+2.134
Neg logit contributions
 Mandela-2.326
 themselves-2.298
 Zuckerberg-2.283
 THEIR-2.280
 THEY-2.276
Token for
Token ID329
Feature activation+4.96e-02

Pos logit contributions
 themselves+5.387
 Zuckerberg+5.264
 Mandela+5.225
 THEY+5.120
 THEIR+5.073
Neg logit contributions
pite-6.721
laughs-6.325
ittee-6.251
restricted-5.852
acia-5.816
Token entire
Token ID2104
Feature activation-5.04e-01

Pos logit contributions
 Mandela+0.348
 Zuckerberg+0.347
 THEY+0.342
 themselves+0.341
+0.337
Neg logit contributions
pite-0.516
laughs-0.489
ittee-0.481
restricted-0.470
crew-0.461
Token house
Token ID2156
Feature activation-5.52e-01

Pos logit contributions
pite+4.366
laughs+4.116
ittee+4.015
restricted+3.937
crew+3.845
Neg logit contributions
 Mandela-4.103
 THEY-4.103
-4.079
IELD-4.069
 THEIR-4.059
Token;
Token ID26
Feature activation-6.49e-03

Pos logit contributions
pite+4.668
laughs+4.476
ittee+4.354
restricted+4.333
crew+4.220
Neg logit contributions
 Mandela-4.657
 THEIR-4.611
 Zuckerberg-4.565
IELD-4.533
 THEY-4.524
Token $
Token ID720
Feature activation-1.79e-01

Pos logit contributions
pite+0.076
laughs+0.073
ittee+0.072
restricted+0.070
crew+0.069
Neg logit contributions
 Mandela-0.037
 Zuckerberg-0.036
 themselves-0.036
 THEY-0.035
 THEIR-0.035
Token24
Token ID1731
Feature activation+1.51e+00

Pos logit contributions
pite+1.621
laughs+1.557
ittee+1.481
restricted+1.442
such+1.439
Neg logit contributions
 Zuckerberg-1.512
 Mandela-1.499
 themselves-1.492
 THEIR-1.466
 Tsarnaev-1.463
Token for
Token ID329
Feature activation+8.99e-02

Pos logit contributions
 themselves+10.399
 Mandela+9.885
 Zuckerberg+9.858
 they+9.586
+9.460
Neg logit contributions
pite-13.355
laughs-13.127
ittee-13.012
affected-11.902
acia-11.875
Token approximately
Token ID6702
Feature activation+1.75e-01

Pos logit contributions
 Mandela+0.667
 Zuckerberg+0.666
 themselves+0.657
 THEY+0.654
 THEIR+0.648
Neg logit contributions
pite-0.908
laughs-0.857
ittee-0.843
restricted-0.818
crew-0.801
Token 1
Token ID352
Feature activation+1.35e-01

Pos logit contributions
 Zuckerberg+1.240
 Mandela+1.239
 themselves+1.229
 THEY+1.202
 THEIR+1.197
Neg logit contributions
pite-1.835
laughs-1.749
ittee-1.720
restricted-1.655
crew-1.628
Token-
Token ID12
Feature activation-1.43e-01

Pos logit contributions
 Zuckerberg+1.084
 Mandela+1.081
 themselves+1.074
 THEY+1.054
 THEIR+1.053
Neg logit contributions
pite-1.278
laughs-1.208
ittee-1.190
restricted-1.141
crew-1.120
Token.[
Token ID3693
Feature activation-2.49e-01

Pos logit contributions
 themselves+3.515
 Zuckerberg+3.425
 Mandela+3.412
 THEY+3.328
 THEIR+3.287
Neg logit contributions
pite-4.738
laughs-4.480
ittee-4.425
restricted-4.206
acia-4.154
Token21
Token ID2481
Feature activation-8.12e-03

Pos logit contributions
pite+2.211
laughs+2.137
ittee+2.028
restricted+1.987
such+1.982
Neg logit contributions
 Mandela-2.140
 Zuckerberg-2.129
 themselves-2.104
 THEY-2.067
 THEIR-2.064
Token]
Token ID60
Feature activation-2.96e-01

Pos logit contributions
pite+0.067
laughs+0.063
ittee+0.062
restricted+0.058
crew+0.057
Neg logit contributions
 Zuckerberg-0.076
 Mandela-0.076
 themselves-0.076
 THEY-0.074
 THEIR-0.074
Token In
Token ID554
Feature activation-4.79e-01

Pos logit contributions
pite+2.529
laughs+2.422
ittee+2.285
such+2.245
restricted+2.212
Neg logit contributions
 Zuckerberg-2.514
 themselves-2.511
 Mandela-2.498
IELD-2.481
 THEIR-2.464
Token the
Token ID262
Feature activation+6.92e-01

Pos logit contributions
pite+4.594
laughs+4.275
ittee+4.147
such+4.083
restricted+4.068
Neg logit contributions
 THEY-3.626
 Mandela-3.625
 Zuckerberg-3.624
-3.613
IELD-3.529
Token same
Token ID976
Feature activation-4.27e-01

Pos logit contributions
 themselves+5.044
 Zuckerberg+5.019
 Mandela+4.987
 Tsarnaev+4.797
 THEIR+4.736
Neg logit contributions
pite-6.774
laughs-6.563
ittee-6.482
crew-6.148
restricted-6.143
Token way
Token ID835
Feature activation-8.66e-01

Pos logit contributions
pite+4.691
laughs+4.443
ittee+4.394
restricted+4.222
such+4.188
Neg logit contributions
IELD-2.603
-2.594
 Mandela-2.589
 THEIR-2.589
 Zuckerberg-2.574
Token,
Token ID11
Feature activation+6.54e-03

Pos logit contributions
pite+7.023
laughs+6.531
ittee+6.345
such+6.323
crew+6.248
Neg logit contributions
 THEIR-6.919
IELD-6.915
 Zuckerberg-6.840
 Mandela-6.773
-6.770
Token Christianity
Token ID13624
Feature activation+3.34e-01

Pos logit contributions
 Zuckerberg+0.050
 Mandela+0.049
 themselves+0.049
 THEIR+0.048
 THEY+0.048
Neg logit contributions
pite-0.065
laughs-0.061
ittee-0.060
restricted-0.058
such-0.057
Token would
Token ID561
Feature activation+5.57e-01

Pos logit contributions
 themselves+2.864
 Mandela+2.835
 Zuckerberg+2.830
 THEY+2.749
 THEIR+2.737
Neg logit contributions
pite-3.009
laughs-2.851
ittee-2.799
restricted-2.653
crew-2.608
Token later
Token ID1568
Feature activation+7.52e-01

Pos logit contributions
 themselves+4.506
 Mandela+4.378
 Zuckerberg+4.324
 THEY+4.227
 THEIR+4.217
Neg logit contributions
pite-5.317
laughs-5.032
ittee-4.967
restricted-4.746
crew-4.646
Token fairly
Token ID6547
Feature activation+9.68e-03

Pos logit contributions
pite+3.263
laughs+3.130
ittee+3.026
such+2.966
restricted+2.957
Neg logit contributions
-2.561
 Zuckerberg-2.513
 Mandela-2.511
 THEY-2.500
IELD-2.488
Token simple
Token ID2829
Feature activation-2.71e-02

Pos logit contributions
 Mandela+0.073
 Zuckerberg+0.073
 themselves+0.072
 THEY+0.071
+0.071
Neg logit contributions
pite-0.097
laughs-0.091
ittee-0.089
restricted-0.086
crew-0.085
Token to
Token ID284
Feature activation-8.63e-02

Pos logit contributions
pite+0.239
laughs+0.227
ittee+0.219
restricted+0.212
crew+0.208
Neg logit contributions
 Mandela-0.236
 Zuckerberg-0.234
-0.231
 themselves-0.229
 THEY-0.229
Token mod
Token ID953
Feature activation-6.21e-01

Pos logit contributions
pite+0.876
laughs+0.835
ittee+0.813
crew+0.785
restricted+0.779
Neg logit contributions
 Mandela-0.625
 Zuckerberg-0.622
-0.612
 THEY-0.608
 THEIR-0.604
Token,
Token ID11
Feature activation-7.38e-01

Pos logit contributions
pite+5.672
laughs+5.473
ittee+5.213
such+5.131
restricted+5.079
Neg logit contributions
 Mandela-4.790
 THEY-4.669
-4.654
 Zuckerberg-4.645
IELD-4.580
Token and
Token ID290
Feature activation-1.54e+00

Pos logit contributions
pite+6.206
laughs+5.919
ittee+5.613
such+5.506
restricted+5.356
Neg logit contributions
-6.242
 THEIR-6.001
 Mandela-6.001
 THEY-5.894
IELD-5.873
Token partly
Token ID11476
Feature activation-8.39e-01

Pos logit contributions
pite+12.903
laughs+12.201
 comprises+11.782
 therefore+11.676
ittee+11.541
Neg logit contributions
-10.716
IELD-10.109
ARE-10.008
 THEIR-10.008
 Mandela-9.720
Token due
Token ID2233
Feature activation-9.05e-01

Pos logit contributions
pite+6.220
laughs+5.777
ittee+5.309
 comprises+5.118
restricted+5.045
Neg logit contributions
-7.904
ARE-7.718
IELD-7.605
 THEIR-7.532
 Mandela-7.508
Token to
Token ID284
Feature activation-8.42e-01

Pos logit contributions
pite+6.417
laughs+5.996
ittee+5.965
crew+5.344
ods+5.287
Neg logit contributions
 Mandela-8.384
 THEIR-8.325
ARE-8.323
-8.291
 THEY-8.250
Token bits
Token ID10340
Feature activation+3.79e-01

Pos logit contributions
pite+7.675
laughs+7.196
ittee+6.979
such+6.682
crew+6.647
Neg logit contributions
-6.125
 THEY-6.070
 Mandela-6.012
 Tsarnaev-5.982
ARE-5.976
Token of
Token ID286
Feature activation-3.70e-01

Pos logit contributions
 Zuckerberg+2.742
 Mandela+2.737
 themselves+2.729
 THEY+2.653
 THEIR+2.642
Neg logit contributions
pite-3.914
laughs-3.730
ittee-3.672
restricted-3.519
crew-3.454
Token )
Token ID1267
Feature activation-6.34e-02

Pos logit contributions
pite+1.905
laughs+1.779
ittee+1.727
restricted+1.667
such+1.655
Neg logit contributions
 Mandela-1.761
 Zuckerberg-1.757
-1.695
 Tsarnaev-1.687
 THEIR-1.686
Token.
Token ID764
Feature activation-4.44e-01

Pos logit contributions
pite+0.666
laughs+0.635
ittee+0.621
restricted+0.597
such+0.593
Neg logit contributions
 Zuckerberg-0.446
 Mandela-0.446
 themselves-0.432
-0.428
 THEY-0.425
TokenHe
Token ID1544
Feature activation-4.79e-01

Pos logit contributions
pite+4.391
laughs+4.277
ittee+4.073
restricted+3.960
such+3.953
Neg logit contributions
 Mandela-3.148
 Zuckerberg-3.147
 themselves-3.087
-3.083
IELD-3.023
Token alot
Token ID43158
Feature activation-1.28e+00

Pos logit contributions
pite+4.452
laughs+4.234
ittee+3.947
restricted+3.905
such+3.839
Neg logit contributions
 Mandela-3.827
 Zuckerberg-3.822
 THEIR-3.684
 THEY-3.679
 themselves-3.665
Token of
Token ID286
Feature activation-1.55e+00

Pos logit contributions
pite+12.248
laughs+11.348
ittee+10.925
such+10.667
crew+10.601
Neg logit contributions
 Zuckerberg-8.118
 Mandela-7.869
 Tsarnaev-7.808
-7.733
IELD-7.698
Token times
Token ID1661
Feature activation-1.57e+00

Pos logit contributions
pite+14.690
laughs+13.738
crew+12.977
such+12.926
ods+12.899
Neg logit contributions
 Zuckerberg-8.689
 Mandela-8.538
 Tsarnaev-8.409
IELD-8.354
-8.338
Token goes
Token ID2925
Feature activation-7.77e-01

Pos logit contributions
pite+14.420
laughs+13.790
 interacts+13.401
 comprises+12.898
such+12.839
Neg logit contributions
IELD-9.153
-9.022
 Mandela-8.775
 Zuckerberg-8.723
 Tsarnaev-8.607
Token from
Token ID422
Feature activation-8.93e-01

Pos logit contributions
pite+7.514
laughs+7.058
ittee+6.715
crew+6.634
such+6.570
Neg logit contributions
 Zuckerberg-5.543
 Mandela-5.510
IELD-5.359
-5.346
 Tsarnaev-5.257
Token genius
Token ID15632
Feature activation-2.17e-01

Pos logit contributions
pite+8.669
laughs+7.915
such+7.523
ittee+7.453
crew+7.433
Neg logit contributions
 Zuckerberg-6.219
 Mandela-6.140
-6.113
 Tsarnaev-6.033
 THEY-5.885
Token to
Token ID284
Feature activation-9.21e-01

Pos logit contributions
pite+1.892
laughs+1.772
ittee+1.706
restricted+1.627
crew+1.617
Neg logit contributions
 Zuckerberg-1.911
 Mandela-1.890
-1.873
 THEIR-1.854
 Tsarnaev-1.845
Token total
Token ID2472
Feature activation-4.67e-01

Pos logit contributions
pite+8.546
laughs+7.981
ittee+7.515
restricted+7.467
such+7.467
Neg logit contributions
-6.508
 Zuckerberg-6.426
 Mandela-6.294
 Tsarnaev-6.254
IELD-6.213
Token Pruitt
Token ID39637
Feature activation-1.05e+00

Pos logit contributions
pite+11.280
ittee+11.193
laughs+10.496
acia+10.217
such+10.141
Neg logit contributions
 THEY-9.704
-9.647
 THEIR-9.356
 Mandela-9.304
 """-9.174
Token
Token ID447
Feature activation+3.11e-01

Pos logit contributions
pite+9.881
ittee+9.630
laughs+9.370
restricted+9.055
such+9.025
Neg logit contributions
 Mandela-6.957
 THEIR-6.799
-6.790
 Zuckerberg-6.676
 THEY-6.638
Token
Token ID247
Feature activation-4.09e-01

Pos logit contributions
 themselves+2.642
 Zuckerberg+2.613
 Mandela+2.587
 THEY+2.537
 THEIR+2.530
Neg logit contributions
pite-2.799
laughs-2.627
ittee-2.585
restricted-2.455
crew-2.430
Tokens
Token ID82
Feature activation-1.54e+00

Pos logit contributions
pite+3.634
laughs+3.440
ittee+3.379
restricted+3.202
such+3.161
Neg logit contributions
 Mandela-3.531
 Zuckerberg-3.513
 THEIR-3.447
 THEY-3.426
-3.404
Token two
Token ID734
Feature activation-9.65e-01

Pos logit contributions
pite+13.771
ittee+13.383
laughs+12.495
crew+12.369
such+12.270
Neg logit contributions
-9.250
 Mandela-9.236
 THEY-8.649
 THEIR-8.538
 Zuckerberg-8.459
Token federal
Token ID2717
Feature activation-1.61e+00

Pos logit contributions
pite+8.896
ittee+8.456
laughs+8.225
such+7.971
crew+7.872
Neg logit contributions
 Mandela-6.814
 THEIR-6.581
-6.576
 THEY-6.527
 Zuckerberg-6.451
Token PACs
Token ID43838
Feature activation-1.59e+00

Pos logit contributions
pite+11.619
ittee+11.307
laughs+10.923
acia+10.878
 trib+10.577
Neg logit contributions
 Mandela-12.011
 THEIR-11.965
 Tsarnaev-11.924
 Zuckerberg-11.651
 THEY-11.565
Token
Token ID960
Feature activation-7.79e-01

Pos logit contributions
pite+12.202
ittee+12.076
crew+11.313
restricted+11.046
laughs+11.036
Neg logit contributions
IELD-11.396
-11.015
 THEIR-10.944
 Mandela-10.623
 Zuckerberg-10.507
TokenLiber
Token ID31199
Feature activation-8.08e-01

Pos logit contributions
pite+6.908
ittee+6.598
laughs+6.553
such+6.429
restricted+6.220
Neg logit contributions
 Mandela-6.300
 Zuckerberg-6.221
 Tsarnaev-6.051
-5.989
 THEY-5.942
Tokenty
Token ID774
Feature activation-8.85e-02

Pos logit contributions
pite+5.361
ittee+4.926
acia+4.690
laughs+4.657
crew+4.483
Neg logit contributions
 Mandela-8.370
 Zuckerberg-8.176
-8.129
 THEY-8.108
 themselves-8.090
Token 2
Token ID362
Feature activation-3.58e-01

Pos logit contributions
pite+0.761
laughs+0.713
ittee+0.712
restricted+0.671
crew+0.661
Neg logit contributions
 Zuckerberg-0.788
 Mandela-0.783
 themselves-0.779
 THEY-0.768
 THEIR-0.765
Token.
Token ID13
Feature activation+1.03e-02

Pos logit contributions
pite+2.294
laughs+2.193
ittee+2.148
restricted+2.047
acia+2.040
Neg logit contributions
 Mandela-2.011
 Zuckerberg-1.986
 themselves-1.976
 THEY-1.957
 THEIR-1.942
Token 21
Token ID2310
Feature activation+9.52e-01

Pos logit contributions
 Mandela+0.078
 Zuckerberg+0.078
 themselves+0.077
 THEY+0.075
 THEIR+0.075
Neg logit contributions
pite-0.103
laughs-0.099
ittee-0.096
restricted-0.092
such-0.091
Token:
Token ID25
Feature activation+1.36e-01

Pos logit contributions
 Zuckerberg+6.354
 themselves+6.272
 Mandela+6.243
 THEY+6.030
 """+6.013
Neg logit contributions
pite-9.824
ittee-9.277
laughs-9.231
restricted-8.896
crew-8.739
Token "
Token ID366
Feature activation-6.60e-02

Pos logit contributions
 themselves+0.638
 Mandela+0.638
 Zuckerberg+0.637
 THEY+0.608
 THEIR+0.607
Neg logit contributions
pite-1.752
laughs-1.685
ittee-1.662
restricted-1.604
crew-1.585
TokenBoard
Token ID29828
Feature activation-1.29e+00

Pos logit contributions
pite+0.631
laughs+0.601
ittee+0.585
restricted+0.561
such+0.556
Neg logit contributions
 themselves-0.523
 Zuckerberg-0.523
 Mandela-0.522
 THEY-0.513
 THEIR-0.510
Token of
Token ID286
Feature activation-1.70e+00

Pos logit contributions
pite+9.953
ittee+9.951
laughs+9.247
 comprises+9.178
crew+9.008
Neg logit contributions
 THEIR-9.837
-9.734
 THEY-9.634
IELD-9.488
 Mandela-9.388
Token Directors
Token ID29200
Feature activation-8.30e-01

Pos logit contributions
ittee+10.901
pite+10.525
acia+9.998
restricted+9.782
laughs+9.724
Neg logit contributions
 THEY-12.849
 """-12.492
WARE-12.367
 Mubarak-12.303
-12.264
Token:
Token ID25
Feature activation-7.69e-02

Pos logit contributions
pite+8.106
ittee+7.754
 comprises+7.441
laughs+7.408
acia+7.153
Neg logit contributions
-5.646
IELD-5.376
 THEIR-5.376
 THEY-5.347
 Zuckerberg-5.205
Token John
Token ID1757
Feature activation-1.95e-01

Pos logit contributions
pite+0.838
laughs+0.796
ittee+0.787
restricted+0.752
acia+0.744
Neg logit contributions
 themselves-0.500
-0.498
 Zuckerberg-0.497
 Mandela-0.493
 THEY-0.487
Token T
Token ID309
Feature activation-2.63e-01

Pos logit contributions
pite+2.101
laughs+2.003
ittee+1.966
restricted+1.892
acia+1.886
Neg logit contributions
 themselves-1.243
 THEIR-1.241
 THEY-1.234
 Mandela-1.220
-1.219
Token.
Token ID13
Feature activation+7.81e-01

Pos logit contributions
pite+2.421
ittee+2.290
laughs+2.269
such+2.172
acia+2.154
Neg logit contributions
 THEIR-2.139
 Mandela-2.132
 THEY-2.127
 themselves-2.106
 Zuckerberg-2.103
Token parents
Token ID3397
Feature activation-1.55e-01

Pos logit contributions
 Mandela+1.437
 Zuckerberg+1.436
 themselves+1.425
 THEY+1.395
 THEIR+1.391
Neg logit contributions
pite-1.692
laughs-1.602
ittee-1.576
restricted-1.505
crew-1.479
Token to
Token ID284
Feature activation-9.34e-02

Pos logit contributions
pite+1.272
laughs+1.187
ittee+1.178
restricted+1.121
such+1.099
Neg logit contributions
 Mandela-1.438
 Zuckerberg-1.437
-1.416
IELD-1.392
 THEIR-1.392
Token predators
Token ID23311
Feature activation-3.99e-02

Pos logit contributions
pite+0.809
laughs+0.759
ittee+0.742
restricted+0.712
such+0.704
Neg logit contributions
 Mandela-0.820
 Zuckerberg-0.814
-0.811
 THEY-0.792
 themselves-0.791
Token,
Token ID11
Feature activation-8.19e-01

Pos logit contributions
pite+0.374
laughs+0.353
ittee+0.346
restricted+0.332
such+0.331
Neg logit contributions
 Mandela-0.325
 Zuckerberg-0.325
-0.318
IELD-0.314
 themselves-0.314
Token natural
Token ID3288
Feature activation-1.07e+00

Pos logit contributions
pite+7.069
laughs+6.688
such+6.669
ittee+6.510
ods+6.283
Neg logit contributions
-6.310
 Mandela-6.231
 THEIR-6.220
IELD-6.190
 themselves-6.141
Token disasters
Token ID24193
Feature activation-1.63e+00

Pos logit contributions
pite+7.560
laughs+7.229
crew+6.954
 distur+6.946
ods+6.875
Neg logit contributions
 Mandela-9.422
-9.289
 Zuckerberg-9.263
 Tsarnaev-9.115
ARE-9.094
Token and
Token ID290
Feature activation-4.78e-01

Pos logit contributions
pite+11.301
such+11.206
crew+10.707
restricted+10.419
affected+10.345
Neg logit contributions
 Zuckerberg-11.930
-11.922
 Mandela-11.721
IELD-11.563
 THEIR-11.382
Token hunger
Token ID16460
Feature activation-2.93e-02

Pos logit contributions
pite+4.023
laughs+3.784
ittee+3.600
such+3.563
restricted+3.543
Neg logit contributions
-4.194
 Mandela-4.050
 Zuckerberg-4.015
IELD-4.010
ARE-3.909
Token
Token ID851
Feature activation+1.37e-01

Pos logit contributions
pite+0.281
laughs+0.268
ittee+0.260
restricted+0.252
such+0.247
Neg logit contributions
 Zuckerberg-0.234
 Mandela-0.232
 themselves-0.226
-0.224
 THEIR-0.224
Token
Token ID564
Feature activation-3.32e-01

Pos logit contributions
 Zuckerberg+1.042
 Mandela+1.042
 themselves+1.036
 THEY+1.011
 THEIR+1.008
Neg logit contributions
pite-1.355
laughs-1.287
ittee-1.267
restricted-1.212
crew-1.190
Token
Token ID250
Feature activation-2.64e-01

Pos logit contributions
pite+2.749
laughs+2.627
ittee+2.538
restricted+2.423
such+2.411
Neg logit contributions
 Mandela-3.052
 Zuckerberg-3.018
 themselves-2.985
 THEIR-2.952
 THEY-2.945