TOP ACTIVATIONS
MAX = 1.687

i ā Kal an ial i il oa a
Poly techn ic University , Hung Hom ( public square in
una o Kal an ial i il oa .
. Scroll down for video White hot
ā o Kal an ial i il oa .
5 min . walk from Hung Hom M TR Station (
.' Scroll down for video Hillary Clinton
ko Kal an ial i il oa po
emo ko Kal an ial i il oa a
. Draw ing . Font Style ] :: B old )
Pasadena , California . Hide Caption 12 of 15 Photos :
Podcast . Author of The Far rah W ether s Mysteries
System . Draw ing . Color ] :: From Ar gb
at medium - high heat until the onion is clear or
. What is different , HR W said in
H . M . S . Surprise , somewhere in the
20 , 2015 . Hide Caption 8 of 15 Photos :
July 6 , 2010 R iff Tra x Reviewer
pper does have that going for it . Y
. - July 6 , 2010 R iff Tra x

MIN ACTIVATIONS
MIN = -1.661

treated . https :// t . co / ni WB qv
sen Report https :// t . co / l g S
eker Ph yt ot itan vs Torch ling
War : https :// t . co / c Q 3
ap . add _ argument (" - i ", " --
https :// t . co / X h 7
The Simpsons and the mon or ail it put Brock
. sudo ch mod + x / etc /
and another for star Matthew Mc Con aug hey 's performance
Period . https :// t . co / P 9 gc
. N ec ess ities of Life
: 30 in . The lead was short lived though ,
on Facebook Share this comment on Twitter NY T
f inance , wh itt les ea - 375 7 ,
on Facebook Share this comment on Twitter NY T
on Facebook Share this comment on Twitter NY T
meme [ 10 : 47 ] < web wood > lol
p one . 00 96 275 . g 001 Pre -
isher " pic . twitter . com / Eh U g
On September 20 2013 07 : 44 Se eker wrote :

INTERVAL 0.850 - 1.687
CONTAINS 0.211%

quiet colonial town that attracted them . " It
и с л у а е
quick sweep across the bands to find out where the R
in Little Rock with meth on the menu . Heat and
2015 2017 ) [ edit ] 2015

INTERVAL 0.013 - 0.850
CONTAINS 48.037%

who sit younger ones might no longer get home first .
rate of such digestive cancers are due to the ingestion of
plan . The reform could therefore boost GDP , at the
the internet addresses to be blocked , and a lack of
lots of interesting det ours to explore much like Tal

INTERVAL -0.824 - 0.013
CONTAINS 51.495%

strength , military - grade , aluminum - all oy body
a device like the Love t ron 9 000 , he
experience in politics , and to become perhaps a bit more
Even so , as a diplomatic courtesy Cuba invited O
t serve a mission . In fact , a

INTERVAL -1.661 - -0.824
CONTAINS 0.258%

work better than anyone else does . Be very mindful of
plant these plots with a natural mix of wild ed ibles
v . ver tex . xy z += v . normal
? GO AL ! pic . twitter . com / E
a few seconds and let Will know by clicking the link
Token i
Token ID1312
Feature activation+6.50e-01

Pos logit contributions
yip+1.849
+1.707
 XT+1.648
 Junction+1.612
advertisement+1.574
Neg logit contributions
aque-1.197
tained-1.192
.",-1.185
":""},{"-1.139
 Richards-1.100
Tokenā
Token ID10235
Feature activation+6.19e-01

Pos logit contributions
yip+4.637
+4.319
 XT+4.155
 Junction+4.073
PLE+3.932
Neg logit contributions
tained-3.480
aque-3.469
.",-3.450
":""},{"-3.368
 Richards-3.232
Token Kal
Token ID12612
Feature activation+5.15e-01

Pos logit contributions
yip+4.594
+4.258
 XT+4.101
 Junction+4.010
PLE+3.891
Neg logit contributions
tained-3.174
aque-3.172
.",-3.150
":""},{"-3.084
 Richards-2.945
Tokenan
Token ID272
Feature activation+5.34e-01

Pos logit contributions
yip+4.399
+4.109
 XT+3.973
 Junction+3.898
advertisement+3.810
Neg logit contributions
tained-2.059
.",-2.038
aque-2.020
":""},{"-1.991
 Richards-1.878
Tokenial
Token ID498
Feature activation+3.00e-01

Pos logit contributions
yip+3.715
+3.357
 XT+3.243
 Junction+3.225
PLE+3.076
Neg logit contributions
tained-2.976
aque-2.976
.",-2.961
":""},{"-2.921
 Richards-2.817
Tokeni
Token ID72
Feature activation+1.69e+00

Pos logit contributions
yip+2.480
+2.304
 XT+2.235
 Junction+2.195
advertisement+2.136
Neg logit contributions
tained-1.315
aque-1.312
.",-1.300
":""},{"-1.256
 Richards-1.199
Token
Token ID134
Feature activation+6.83e-02

Pos logit contributions
yip+7.919
+7.396
 Junction+7.062
 XT+6.948
advertisement+6.524
Neg logit contributions
tained-11.288
aque-10.933
":""},{"-10.922
.",-10.876
tro-10.587
Token
Token ID119
Feature activation+5.04e-01

Pos logit contributions
yip+0.607
+0.570
 XT+0.553
 Junction+0.543
advertisement+0.534
Neg logit contributions
tained-0.254
aque-0.254
.",-0.251
":""},{"-0.240
 Richards-0.229
Tokenil
Token ID346
Feature activation+1.65e-01

Pos logit contributions
yip+4.131
+3.791
 XT+3.665
 Junction+3.604
advertisement+3.532
Neg logit contributions
tained-2.213
aque-2.212
.",-2.204
":""},{"-2.142
 Richards-2.070
Tokenoa
Token ID12162
Feature activation-1.53e-01

Pos logit contributions
yip+1.517
+1.422
 XT+1.382
 Junction+1.358
advertisement+1.329
Neg logit contributions
tained-0.568
aque-0.567
.",-0.560
":""},{"-0.535
 Richards-0.505
Token a
Token ID257
Feature activation+1.90e-01

Pos logit contributions
aque+0.753
.",+0.750
tained+0.743
":""},{"+0.705
upon+0.691
Neg logit contributions
yip-1.205
-1.102
 XT-1.056
 Junction-1.029
advertisement-1.020
Token Poly
Token ID12280
Feature activation+2.33e-01

Pos logit contributions
aque+0.608
tained+0.608
.",+0.602
":""},{"+0.587
 Richards+0.572
Neg logit contributions
yip-0.840
-0.769
 XT-0.728
 Junction-0.713
advertisement-0.702
Tokentechn
Token ID23873
Feature activation-1.95e-02

Pos logit contributions
yip+1.382
+1.248
 XT+1.191
 Junction+1.163
advertisement+1.118
Neg logit contributions
tained-1.569
aque-1.563
.",-1.558
":""},{"-1.520
 Richards-1.478
Tokenic
Token ID291
Feature activation-3.64e-01

Pos logit contributions
aque+0.114
tained+0.114
.",+0.113
":""},{"+0.109
 Richards+0.106
Neg logit contributions
yip-0.133
-0.122
 XT-0.117
 Junction-0.114
advertisement-0.111
Token University
Token ID2059
Feature activation+2.67e-02

Pos logit contributions
aque+2.178
tained+2.149
.",+2.145
":""},{"+2.084
 Richards+2.067
Neg logit contributions
yip-2.414
-2.198
 XT-2.068
advertisement-2.024
 Junction-1.973
Token,
Token ID11
Feature activation+1.27e-01

Pos logit contributions
yip+0.196
+0.180
 XT+0.173
 Junction+0.169
advertisement+0.165
Neg logit contributions
tained-0.143
aque-0.143
.",-0.141
":""},{"-0.137
 Richards-0.133
Token Hung
Token ID12149
Feature activation+1.65e+00

Pos logit contributions
yip+0.858
+0.786
 XT+0.748
 Junction+0.730
advertisement+0.714
Neg logit contributions
aque-0.750
tained-0.750
.",-0.745
":""},{"-0.723
 Richards-0.704
Token Hom
Token ID8074
Feature activation+3.31e-01

Pos logit contributions
yip+8.862
 Junction+8.062
 XT+8.014
+7.889
PLE+7.035
Neg logit contributions
tained-10.317
aque-10.101
.",-9.962
":""},{"-9.898
upon-9.446
Token (
Token ID357
Feature activation-5.31e-02

Pos logit contributions
yip+2.391
+2.204
 XT+2.129
 Junction+2.084
PLE+2.015
Neg logit contributions
tained-1.788
aque-1.787
.",-1.772
":""},{"-1.721
 Richards-1.660
Tokenpublic
Token ID11377
Feature activation+4.44e-01

Pos logit contributions
aque+0.309
tained+0.306
.",+0.303
":""},{"+0.295
upon+0.287
Neg logit contributions
yip-0.365
-0.334
 XT-0.322
 Junction-0.314
advertisement-0.303
Token square
Token ID6616
Feature activation-4.44e-01

Pos logit contributions
yip+3.321
+3.090
 XT+2.978
 Junction+2.911
advertisement+2.822
Neg logit contributions
aque-2.258
tained-2.241
.",-2.233
":""},{"-2.180
 Richards-2.088
Token in
Token ID287
Feature activation-2.10e-01

Pos logit contributions
tained+1.922
.",+1.913
aque+1.900
":""},{"+1.817
 Richards+1.739
Neg logit contributions
yip-3.688
-3.398
 XT-3.243
 Junction-3.184
advertisement-3.154
Tokenuna
Token ID9613
Feature activation+3.87e-01

Pos logit contributions
aque+0.504
tained+0.496
.",+0.495
":""},{"+0.470
upon+0.453
Neg logit contributions
yip-1.025
-0.961
 XT-0.935
 Junction-0.918
PLE-0.892
Token o
Token ID267
Feature activation+4.95e-02

Pos logit contributions
yip+2.916
+2.698
 XT+2.607
 Junction+2.551
advertisement+2.479
Neg logit contributions
tained-1.968
aque-1.965
.",-1.947
":""},{"-1.890
 Richards-1.817
Token Kal
Token ID12612
Feature activation+5.09e-01

Pos logit contributions
yip+0.379
+0.348
 XT+0.337
 Junction+0.329
advertisement+0.323
Neg logit contributions
tained-0.253
aque-0.253
.",-0.248
":""},{"-0.240
 Richards-0.232
Tokenan
Token ID272
Feature activation+5.38e-01

Pos logit contributions
yip+4.213
+3.934
 XT+3.778
 Junction+3.706
advertisement+3.632
Neg logit contributions
tained-2.155
.",-2.141
aque-2.097
":""},{"-2.095
 Richards-1.990
Tokenial
Token ID498
Feature activation+2.11e-01

Pos logit contributions
yip+3.425
+3.059
 Junction+2.950
 XT+2.925
advertisement+2.789
Neg logit contributions
aque-3.294
tained-3.292
.",-3.286
":""},{"-3.241
 Richards-3.158
Tokeni
Token ID72
Feature activation+1.63e+00

Pos logit contributions
yip+1.725
+1.600
 XT+1.553
 Junction+1.530
advertisement+1.489
Neg logit contributions
tained-0.935
aque-0.932
.",-0.925
":""},{"-0.891
 Richards-0.853
Token
Token ID134
Feature activation+4.26e-02

Pos logit contributions
yip+5.979
+5.634
 Junction+5.478
advertisement+5.216
 XT+5.197
Neg logit contributions
tained-12.439
aque-12.006
.",-11.856
":""},{"-11.591
tro-11.501
Token
Token ID119
Feature activation+4.18e-01

Pos logit contributions
yip+0.380
+0.357
 XT+0.347
 Junction+0.341
advertisement+0.334
Neg logit contributions
tained-0.158
aque-0.158
.",-0.155
":""},{"-0.149
 Richards-0.142
Tokenil
Token ID346
Feature activation-8.37e-02

Pos logit contributions
yip+3.413
+3.142
 XT+3.031
 Junction+2.977
advertisement+2.922
Neg logit contributions
aque-1.846
tained-1.845
.",-1.838
":""},{"-1.781
 Richards-1.727
Tokenoa
Token ID12162
Feature activation-7.43e-02

Pos logit contributions
aque+0.280
tained+0.279
.",+0.275
":""},{"+0.261
 Richards+0.246
Neg logit contributions
yip-0.781
-0.733
 XT-0.714
 Junction-0.701
advertisement-0.685
Token.
Token ID13
Feature activation-1.66e-01

Pos logit contributions
aque+0.373
.",+0.371
tained+0.368
":""},{"+0.349
upon+0.343
Neg logit contributions
yip-0.577
-0.527
 XT-0.508
 Junction-0.495
advertisement-0.488
Token.
Token ID13
Feature activation+1.81e-01

Pos logit contributions
yip+2.150
+1.995
 XT+1.931
 Junction+1.893
advertisement+1.844
Neg logit contributions
aque-1.318
tained-1.318
.",-1.300
":""},{"-1.264
 Richards-1.210
Token
Token ID198
Feature activation-1.13e-01

Pos logit contributions
yip+1.329
+1.226
 XT+1.183
 Junction+1.157
advertisement+1.125
Neg logit contributions
aque-0.955
tained-0.955
.",-0.946
":""},{"-0.917
 Richards-0.884
Token
Token ID198
Feature activation+1.28e-02

Pos logit contributions
tained+0.624
aque+0.623
.",+0.621
":""},{"+0.604
 Richards+0.583
Neg logit contributions
yip-0.817
-0.743
 XT-0.716
 Junction-0.698
PLE-0.675
TokenScroll
Token ID29261
Feature activation+1.28e-01

Pos logit contributions
yip+0.093
+0.084
 XT+0.081
 Junction+0.079
PLE+0.075
Neg logit contributions
tained-0.071
.",-0.071
aque-0.071
":""},{"-0.069
upon-0.066
Token down
Token ID866
Feature activation+3.09e-01

Pos logit contributions
yip+1.040
+0.964
 XT+0.927
 Junction+0.910
PLE+0.886
Neg logit contributions
aque-0.586
tained-0.584
.",-0.581
":""},{"-0.555
 Richards-0.538
Token for
Token ID329
Feature activation+1.60e+00

Pos logit contributions
yip+2.390
+2.219
 XT+2.152
 Junction+2.104
advertisement+2.057
Neg logit contributions
aque-1.512
tained-1.506
.",-1.487
":""},{"-1.450
 Richards-1.380
Token video
Token ID2008
Feature activation+5.78e-01

Pos logit contributions
yip+6.316
 XT+5.957
+5.717
 Advertisement+5.613
 Junction+5.487
Neg logit contributions
tained-11.816
.",-11.757
aque-11.633
":""},{"-11.570
upon-11.252
Token
Token ID198
Feature activation+4.79e-03

Pos logit contributions
yip+3.806
+3.507
 XT+3.398
 Junction+3.297
PLE+3.220
Neg logit contributions
aque-3.443
tained-3.423
.",-3.403
":""},{"-3.294
upon-3.201
Token
Token ID198
Feature activation+1.05e-01

Pos logit contributions
yip+0.033
+0.031
 XT+0.030
 Junction+0.029
advertisement+0.028
Neg logit contributions
tained-0.027
aque-0.027
.",-0.027
":""},{"-0.026
 Richards-0.025
TokenWhite
Token ID12256
Feature activation-2.51e-01

Pos logit contributions
yip+0.668
+0.593
 XT+0.576
 Junction+0.559
advertisement+0.529
Neg logit contributions
tained-0.669
aque-0.666
.",-0.665
":""},{"-0.649
upon-0.629
Token hot
Token ID3024
Feature activation-3.37e-02

Pos logit contributions
aque+1.343
tained+1.333
.",+1.333
":""},{"+1.280
 Richards+1.257
Neg logit contributions
yip-1.830
-1.669
 XT-1.606
advertisement-1.564
 Junction-1.549
Tokenā
Token ID10235
Feature activation+3.72e-01

Pos logit contributions
yip+1.317
+1.222
 XT+1.182
 Junction+1.156
advertisement+1.125
Neg logit contributions
aque-0.817
tained-0.816
.",-0.807
":""},{"-0.779
 Richards-0.749
Token o
Token ID267
Feature activation-2.03e-01

Pos logit contributions
yip+2.778
+2.565
 XT+2.476
 Junction+2.422
advertisement+2.357
Neg logit contributions
tained-1.931
aque-1.930
.",-1.912
":""},{"-1.851
 Richards-1.786
Token Kal
Token ID12612
Feature activation+5.71e-01

Pos logit contributions
tained+1.072
aque+1.066
.",+1.043
":""},{"+1.002
 Richards+0.979
Neg logit contributions
yip-1.522
-1.395
 XT-1.346
 Junction-1.306
advertisement-1.298
Tokenan
Token ID272
Feature activation+4.03e-01

Pos logit contributions
yip+4.506
+4.177
 XT+4.021
 Junction+3.955
advertisement+3.854
Neg logit contributions
tained-2.639
.",-2.614
aque-2.579
":""},{"-2.573
 Richards-2.441
Tokenial
Token ID498
Feature activation+1.13e-01

Pos logit contributions
yip+2.690
+2.426
 XT+2.335
 Junction+2.318
PLE+2.210
Neg logit contributions
aque-2.382
tained-2.381
.",-2.369
":""},{"-2.332
 Richards-2.257
Tokeni
Token ID72
Feature activation+1.59e+00

Pos logit contributions
yip+0.954
+0.891
 XT+0.863
 Junction+0.845
advertisement+0.826
Neg logit contributions
aque-0.482
tained-0.482
.",-0.476
":""},{"-0.457
 Richards-0.438
Token
Token ID134
Feature activation-2.64e-02

Pos logit contributions
yip+6.294
+5.867
 Junction+5.676
 XT+5.438
advertisement+5.377
Neg logit contributions
tained-11.900
aque-11.463
.",-11.374
":""},{"-11.201
tro-11.061
Token
Token ID119
Feature activation+3.77e-01

Pos logit contributions
tained+0.095
aque+0.095
.",+0.094
":""},{"+0.090
 Richards+0.085
Neg logit contributions
yip-0.239
-0.224
 XT-0.218
 Junction-0.214
advertisement-0.209
Tokenil
Token ID346
Feature activation-6.47e-02

Pos logit contributions
yip+3.113
+2.873
 XT+2.778
 Junction+2.729
advertisement+2.670
Neg logit contributions
aque-1.640
tained-1.640
.",-1.629
":""},{"-1.578
 Richards-1.521
Tokenoa
Token ID12162
Feature activation-2.07e-01

Pos logit contributions
aque+0.217
tained+0.216
.",+0.213
":""},{"+0.202
 Richards+0.191
Neg logit contributions
yip-0.604
-0.566
 XT-0.552
 Junction-0.542
advertisement-0.529
Token.
Token ID13
Feature activation-2.13e-03

Pos logit contributions
aque+1.032
.",+1.027
tained+1.018
":""},{"+0.961
upon+0.948
Neg logit contributions
yip-1.616
-1.476
 XT-1.414
 Junction-1.375
advertisement-1.363
Token 5
Token ID642
Feature activation-3.27e-01

Pos logit contributions
yip+1.812
+1.657
 XT+1.590
 Junction+1.551
advertisement+1.504
Neg logit contributions
tained-1.616
aque-1.616
.",-1.605
":""},{"-1.560
 Richards-1.513
Token min
Token ID949
Feature activation+3.56e-01

Pos logit contributions
tained+1.585
.",+1.579
aque+1.578
":""},{"+1.492
 Richards+1.474
Neg logit contributions
yip-2.517
-2.316
 XT-2.248
advertisement-2.183
 Junction-2.172
Token.
Token ID13
Feature activation+4.85e-01

Pos logit contributions
yip+2.649
+2.446
 XT+2.359
 Junction+2.308
advertisement+2.245
Neg logit contributions
tained-1.858
aque-1.857
.",-1.842
":""},{"-1.784
 Richards-1.721
Token walk
Token ID2513
Feature activation+7.88e-02

Pos logit contributions
yip+3.600
+3.301
 XT+3.214
 Junction+3.152
PLE+3.041
Neg logit contributions
aque-2.536
tained-2.531
.",-2.490
":""},{"-2.417
 Richards-2.323
Token from
Token ID422
Feature activation+9.09e-02

Pos logit contributions
yip+0.612
+0.569
 XT+0.541
 Junction+0.529
advertisement+0.521
Neg logit contributions
.",-0.390
aque-0.388
tained-0.388
":""},{"-0.372
 Richards-0.359
Token Hung
Token ID12149
Feature activation+1.58e+00

Pos logit contributions
yip+0.657
+0.605
 XT+0.580
 Junction+0.566
advertisement+0.554
Neg logit contributions
tained-0.494
aque-0.493
.",-0.490
":""},{"-0.475
 Richards-0.460
Token Hom
Token ID8074
Feature activation+4.97e-01

Pos logit contributions
yip+8.880
 Junction+8.081
 XT+7.989
+7.940
PLE+7.158
Neg logit contributions
tained-9.729
aque-9.497
":""},{"-9.368
.",-9.333
upon-8.818
Token M
Token ID337
Feature activation-2.11e-01

Pos logit contributions
yip+3.499
+3.232
 XT+3.137
 Junction+3.099
PLE+2.968
Neg logit contributions
aque-2.738
tained-2.733
.",-2.698
":""},{"-2.640
 Richards-2.541
TokenTR
Token ID5446
Feature activation+3.45e-01

Pos logit contributions
tained+1.349
aque+1.346
.",+1.331
":""},{"+1.287
oys+1.243
Neg logit contributions
yip-1.330
-1.206
 XT-1.153
 Junction-1.125
advertisement-1.093
Token Station
Token ID9327
Feature activation-1.99e-01

Pos logit contributions
yip+2.456
+2.261
 XT+2.181
 Junction+2.133
PLE+2.065
Neg logit contributions
tained-1.901
aque-1.900
.",-1.882
":""},{"-1.830
 Richards-1.766
Token (
Token ID357
Feature activation+9.89e-02

Pos logit contributions
.",+1.010
tained+1.007
aque+1.002
":""},{"+0.952
 Richards+0.932
Neg logit contributions
yip-1.517
-1.387
 XT-1.328
 Junction-1.286
advertisement-1.284
Token.'
Token ID2637
Feature activation-3.21e-01

Pos logit contributions
yip+1.901
+1.759
 XT+1.708
 Junction+1.673
advertisement+1.630
Neg logit contributions
tained-1.175
aque-1.173
.",-1.155
":""},{"-1.122
 Richards-1.074
Token
Token ID198
Feature activation-1.49e-01

Pos logit contributions
tained+1.715
aque+1.695
.",+1.680
":""},{"+1.632
 Richards+1.599
Neg logit contributions
yip-2.375
-2.156
 XT-2.070
 Junction-2.027
PLE-1.970
Token
Token ID198
Feature activation-4.56e-02

Pos logit contributions
tained+0.829
aque+0.827
.",+0.823
":""},{"+0.800
 Richards+0.774
Neg logit contributions
yip-1.066
-0.973
 XT-0.936
 Junction-0.914
PLE-0.885
TokenScroll
Token ID29261
Feature activation+7.68e-02

Pos logit contributions
tained+0.278
aque+0.276
.",+0.275
":""},{"+0.269
upon+0.262
Neg logit contributions
yip-0.305
-0.272
 XT-0.264
 Junction-0.258
PLE-0.242
Token down
Token ID866
Feature activation+3.12e-01

Pos logit contributions
yip+0.640
+0.592
 XT+0.566
 Junction+0.557
PLE+0.542
Neg logit contributions
aque-0.341
tained-0.339
.",-0.339
":""},{"-0.319
 Richards-0.313
Token for
Token ID329
Feature activation+1.55e+00

Pos logit contributions
yip+2.384
+2.215
 XT+2.153
 Junction+2.101
advertisement+2.054
Neg logit contributions
aque-1.548
tained-1.540
.",-1.521
":""},{"-1.492
 Richards-1.411
Token video
Token ID2008
Feature activation+7.36e-01

Pos logit contributions
yip+5.764
 XT+5.557
+5.272
 Advertisement+5.271
 Junction+5.056
Neg logit contributions
.",-11.705
tained-11.641
":""},{"-11.493
aque-11.469
.","-11.072
Token
Token ID198
Feature activation+7.27e-03

Pos logit contributions
yip+4.650
+4.280
 XT+4.214
 Junction+4.044
PLE+3.994
Neg logit contributions
tained-4.491
aque-4.473
.",-4.463
":""},{"-4.301
upon-4.187
Token
Token ID198
Feature activation+1.25e-01

Pos logit contributions
yip+0.050
+0.046
 XT+0.044
 Junction+0.043
advertisement+0.042
Neg logit contributions
aque-0.042
tained-0.042
.",-0.041
":""},{"-0.040
 Richards-0.039
TokenHillary
Token ID20397
Feature activation+2.53e-02

Pos logit contributions
yip+0.794
+0.716
 XT+0.689
 Junction+0.671
advertisement+0.642
Neg logit contributions
tained-0.794
aque-0.792
.",-0.787
":""},{"-0.769
 Richards-0.745
Token Clinton
Token ID2605
Feature activation-1.71e-01

Pos logit contributions
yip+0.141
+0.127
 XT+0.121
 Junction+0.117
advertisement+0.113
Neg logit contributions
tained-0.179
aque-0.179
.",-0.178
":""},{"-0.174
 Richards-0.170
Token
Token ID198
Feature activation+3.10e-01

Pos logit contributions
yip+1.443
+1.361
 XT+1.310
 Junction+1.283
advertisement+1.253
Neg logit contributions
aque-1.392
tained-1.386
.",-1.366
":""},{"-1.324
 Richards-1.288
Tokenko
Token ID7204
Feature activation+7.65e-01

Pos logit contributions
yip+2.007
+1.832
 XT+1.755
 Junction+1.710
advertisement+1.658
Neg logit contributions
tained-1.916
aque-1.915
.",-1.902
":""},{"-1.852
 Richards-1.798
Token Kal
Token ID12612
Feature activation+1.62e-01

Pos logit contributions
yip+5.506
+5.110
 XT+4.921
 Junction+4.837
PLE+4.680
Neg logit contributions
tained-4.001
.",-3.961
aque-3.948
":""},{"-3.851
 Richards-3.713
Tokenan
Token ID272
Feature activation+3.23e-01

Pos logit contributions
yip+1.452
+1.359
 XT+1.320
 Junction+1.297
advertisement+1.267
Neg logit contributions
aque-0.602
tained-0.602
.",-0.594
":""},{"-0.568
 Richards-0.539
Tokenial
Token ID498
Feature activation+1.24e-01

Pos logit contributions
yip+2.165
+1.963
 XT+1.887
 Junction+1.863
PLE+1.784
Neg logit contributions
aque-1.911
tained-1.908
.",-1.901
":""},{"-1.860
 Richards-1.802
Tokeni
Token ID72
Feature activation+1.53e+00

Pos logit contributions
yip+1.046
+0.977
 XT+0.947
 Junction+0.927
advertisement+0.907
Neg logit contributions
aque-0.526
tained-0.525
.",-0.519
":""},{"-0.498
 Richards-0.477
Token
Token ID134
Feature activation+4.56e-04

Pos logit contributions
yip+7.599
+7.020
 Junction+6.720
 XT+6.669
advertisement+6.085
Neg logit contributions
tained-10.182
.",-9.961
aque-9.876
":""},{"-9.839
tro-9.503
Token
Token ID119
Feature activation+4.16e-01

Pos logit contributions
yip+0.004
+0.004
 XT+0.004
 Junction+0.004
advertisement+0.004
Neg logit contributions
tained-0.002
aque-0.002
.",-0.002
":""},{"-0.002
 Richards-0.001
Tokenil
Token ID346
Feature activation-5.72e-02

Pos logit contributions
yip+3.429
+3.170
 XT+3.060
 Junction+3.005
PLE+2.944
Neg logit contributions
aque-1.816
tained-1.816
.",-1.811
":""},{"-1.747
 Richards-1.686
Tokenoa
Token ID12162
Feature activation-3.41e-01

Pos logit contributions
aque+0.189
tained+0.187
.",+0.185
":""},{"+0.175
 Richards+0.165
Neg logit contributions
yip-0.538
-0.504
 XT-0.492
 Junction-0.483
advertisement-0.472
Token po
Token ID745
Feature activation+9.84e-01

Pos logit contributions
aque+1.663
.",+1.643
tained+1.633
":""},{"+1.561
upon+1.531
Neg logit contributions
yip-2.678
-2.443
 XT-2.334
advertisement-2.275
 Junction-2.274
Tokenemo
Token ID41903
Feature activation+5.03e-01

Pos logit contributions
aque+0.060
tained+0.060
.",+0.059
":""},{"+0.057
upon+0.055
Neg logit contributions
yip-0.093
-0.087
 XT-0.085
 Junction-0.083
advertisement-0.080
Token ko
Token ID41727
Feature activation+8.93e-01

Pos logit contributions
yip+3.738
+3.461
 XT+3.337
 Junction+3.268
PLE+3.172
Neg logit contributions
tained-2.596
aque-2.589
.",-2.569
":""},{"-2.499
 Richards-2.407
Token Kal
Token ID12612
Feature activation+1.97e-01

Pos logit contributions
yip+5.646
+5.206
 XT+4.982
 Junction+4.842
PLE+4.678
Neg logit contributions
tained-5.368
.",-5.284
aque-5.256
":""},{"-5.196
 Richards-5.029
Tokenan
Token ID272
Feature activation+3.70e-01

Pos logit contributions
yip+1.645
+1.533
 XT+1.486
 Junction+1.457
advertisement+1.422
Neg logit contributions
aque-0.855
tained-0.852
.",-0.843
":""},{"-0.809
 Richards-0.775
Tokenial
Token ID498
Feature activation+6.71e-03

Pos logit contributions
yip+2.851
+2.629
 XT+2.544
 Junction+2.501
advertisement+2.422
Neg logit contributions
tained-1.821
aque-1.818
.",-1.805
":""},{"-1.755
 Richards-1.686
Tokeni
Token ID72
Feature activation+1.50e+00

Pos logit contributions
yip+0.057
+0.053
 XT+0.051
 Junction+0.050
advertisement+0.049
Neg logit contributions
aque-0.028
tained-0.028
.",-0.028
":""},{"-0.026
 Richards-0.025
Token
Token ID134
Feature activation-1.73e-01

Pos logit contributions
yip+8.269
+7.717
 Junction+7.418
 XT+7.357
advertisement+6.878
Neg logit contributions
tained-9.180
":""},{"-8.898
.",-8.872
aque-8.868
tro-8.604
Token
Token ID119
Feature activation+3.69e-01

Pos logit contributions
aque+0.584
tained+0.583
.",+0.576
":""},{"+0.549
 Richards+0.513
Neg logit contributions
yip-1.609
-1.503
 XT-1.466
 Junction-1.438
PLE-1.399
Tokenil
Token ID346
Feature activation+3.85e-03

Pos logit contributions
yip+3.082
+2.850
 XT+2.761
 Junction+2.712
advertisement+2.651
Neg logit contributions
tained-1.575
aque-1.573
.",-1.563
":""},{"-1.512
 Richards-1.451
Tokenoa
Token ID12162
Feature activation-2.81e-01

Pos logit contributions
yip+0.036
+0.034
 XT+0.033
 Junction+0.032
advertisement+0.031
Neg logit contributions
aque-0.013
tained-0.013
.",-0.013
":""},{"-0.012
 Richards-0.011
Token a
Token ID257
Feature activation-3.82e-01

Pos logit contributions
aque+1.350
.",+1.346
tained+1.324
":""},{"+1.260
upon+1.235
Neg logit contributions
yip-2.242
-2.059
 XT-1.962
 Junction-1.914
advertisement-1.902
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
yip+3.696
+3.603
advertisement+3.498
 XT+3.492
 Junction+3.455
Neg logit contributions
aque-4.051
tained-4.006
":""},{"-3.810
.",-3.785
 Richards-3.717
TokenDraw
Token ID25302
Feature activation+1.54e-01

Pos logit contributions
yip+2.018
+1.877
 XT+1.805
 Junction+1.764
advertisement+1.732
Neg logit contributions
aque-2.016
tained-2.011
.",-1.982
":""},{"-1.932
 Richards-1.876
Tokening
Token ID278
Feature activation+3.99e-01

Pos logit contributions
yip+1.124
+1.041
 XT+1.009
 Junction+0.992
advertisement+0.972
Neg logit contributions
aque-0.810
tained-0.809
.",-0.799
":""},{"-0.776
 Richards-0.750
Token.
Token ID13
Feature activation+4.23e-01

Pos logit contributions
yip+2.515
+2.423
 XT+2.340
 Junction+2.316
advertisement+2.290
Neg logit contributions
aque-2.362
tained-2.342
.",-2.243
":""},{"-2.227
 Richards-2.162
TokenFont
Token ID23252
Feature activation+8.96e-01

Pos logit contributions
yip+2.430
+2.217
 XT+2.115
 Junction+2.062
PLE+2.028
Neg logit contributions
tained-2.853
aque-2.842
.",-2.825
":""},{"-2.771
 Richards-2.701
TokenStyle
Token ID21466
Feature activation+1.49e+00

Pos logit contributions
yip+4.538
+4.221
 XT+3.989
 Junction+3.912
PLE+3.775
Neg logit contributions
tained-6.362
aque-6.354
.",-6.251
":""},{"-6.186
 Richards-6.050
Token]
Token ID60
Feature activation-1.11e-01

Pos logit contributions
+5.194
yip+5.068
 XT+5.029
spect+4.933
 Junction+4.897
Neg logit contributions
tained-11.583
aque-11.454
.",-11.259
oys-10.793
":""},{"-10.766
Token::
Token ID3712
Feature activation-4.10e-02

Pos logit contributions
tained+0.507
aque+0.507
.",+0.502
":""},{"+0.484
 Richards+0.465
Neg logit contributions
yip-0.893
-0.830
 XT-0.804
 Junction-0.788
advertisement-0.768
TokenB
Token ID33
Feature activation+5.10e-02

Pos logit contributions
aque+0.277
.",+0.275
tained+0.275
":""},{"+0.268
 Richards+0.260
Neg logit contributions
yip-0.248
-0.221
 XT-0.211
 Junction-0.204
advertisement-0.194
Tokenold
Token ID727
Feature activation+5.31e-01

Pos logit contributions
yip+0.386
+0.357
 XT+0.345
 Junction+0.338
advertisement+0.329
Neg logit contributions
tained-0.258
aque-0.257
.",-0.255
":""},{"-0.247
 Richards-0.238
Token)
Token ID8
Feature activation+1.16e-01

Pos logit contributions
yip+3.742
+3.505
 XT+3.417
 Junction+3.341
PLE+3.253
Neg logit contributions
.",-2.847
tained-2.818
aque-2.763
":""},{"-2.734
 Richards-2.639
Token Pasadena
Token ID41010
Feature activation+1.90e-01

Pos logit contributions
tained+0.980
.",+0.970
aque+0.968
 Richards+0.943
":""},{"+0.937
Neg logit contributions
yip-0.882
-0.792
 XT-0.730
advertisement-0.712
PLE-0.698
Token,
Token ID11
Feature activation-1.79e-01

Pos logit contributions
yip+1.371
+1.268
 XT+1.231
 Junction+1.208
PLE+1.161
Neg logit contributions
aque-1.026
tained-1.026
.",-1.015
":""},{"-0.996
 Richards-0.951
Token California
Token ID3442
Feature activation+1.18e-01

Pos logit contributions
tained+1.053
aque+1.047
.",+1.037
":""},{"+1.000
 Richards+0.994
Neg logit contributions
yip-1.237
-1.125
 XT-1.064
 Junction-1.029
advertisement-1.020
Token.
Token ID13
Feature activation-2.97e-01

Pos logit contributions
yip+0.860
+0.795
 XT+0.769
 Junction+0.752
advertisement+0.727
Neg logit contributions
aque-0.634
tained-0.634
.",-0.626
":""},{"-0.613
 Richards-0.588
Token Hide
Token ID10415
Feature activation+6.15e-01

Pos logit contributions
tained+1.672
.",+1.663
aque+1.653
 Richards+1.608
":""},{"+1.587
Neg logit contributions
yip-2.128
-1.884
 XT-1.775
 Junction-1.743
advertisement-1.727
Token Caption
Token ID11260
Feature activation+1.47e+00

Pos logit contributions
yip+3.237
+3.116
 XT+3.042
 Junction+3.036
advertisement+2.842
Neg logit contributions
tained-4.187
aque-4.180
.",-4.085
":""},{"-3.988
oys-3.972
Token 12
Token ID1105
Feature activation-2.24e-01

Pos logit contributions
 XT+3.687
 Junction+3.482
+3.338
yip+2.966
 Advertisement+2.889
Neg logit contributions
aque-12.988
oys-12.838
tained-12.832
.",-12.628
":""},{"-12.441
Token of
Token ID286
Feature activation+2.65e-01

Pos logit contributions
tained+1.193
aque+1.190
.",+1.186
":""},{"+1.152
 Richards+1.109
Neg logit contributions
yip-1.658
-1.518
 XT-1.462
 Junction-1.427
PLE-1.384
Token 15
Token ID1315
Feature activation-4.07e-01

Pos logit contributions
yip+1.251
+1.179
 XT+1.117
 Junction+1.077
advertisement+1.020
Neg logit contributions
aque-2.039
tained-2.020
.",-2.008
":""},{"-1.964
tro-1.907
Token Photos
Token ID9434
Feature activation+7.57e-02

Pos logit contributions
.",+2.755
tained+2.753
aque+2.748
":""},{"+2.694
 Richards+2.649
Neg logit contributions
yip-2.449
-2.146
 XT-2.024
 Junction-1.963
advertisement-1.903
Token:
Token ID25
Feature activation-2.57e-01

Pos logit contributions
yip+0.536
+0.506
 XT+0.490
 Junction+0.482
advertisement+0.469
Neg logit contributions
aque-0.408
tained-0.402
.",-0.400
":""},{"-0.385
 Richards-0.372
Token Podcast
Token ID16036
Feature activation-3.09e-01

Pos logit contributions
yip+0.906
+0.833
 XT+0.803
 Junction+0.785
advertisement+0.762
Neg logit contributions
tained-0.720
aque-0.720
.",-0.712
":""},{"-0.693
 Richards-0.670
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
.",+1.548
tained+1.526
aque+1.526
":""},{"+1.461
 Richards+1.421
Neg logit contributions
yip-2.375
-2.176
 XT-2.107
 Junction-2.066
advertisement-1.998
Token Author
Token ID6434
Feature activation+2.55e-01

Pos logit contributions
aque+0.037
tained+0.037
.",+0.037
":""},{"+0.035
 Richards+0.034
Neg logit contributions
yip-0.097
-0.090
 XT-0.087
 Junction-0.086
advertisement-0.084
Token of
Token ID286
Feature activation+2.43e-01

Pos logit contributions
yip+1.681
+1.570
 XT+1.516
 Junction+1.489
advertisement+1.451
Neg logit contributions
aque-1.504
tained-1.490
.",-1.463
":""},{"-1.414
 Richards-1.377
Token The
Token ID383
Feature activation+4.68e-01

Pos logit contributions
yip+1.982
+1.857
 XT+1.803
 Junction+1.771
advertisement+1.723
Neg logit contributions
aque-1.077
tained-1.074
.",-1.061
":""},{"-1.027
 Richards-0.979
Token Far
Token ID6755
Feature activation+1.46e+00

Pos logit contributions
yip+3.493
+3.321
 XT+3.239
 Junction+3.222
advertisement+3.061
Neg logit contributions
aque-2.324
tained-2.292
.",-2.254
":""},{"-2.194
upon-2.085
Tokenrah
Token ID11392
Feature activation+6.62e-01

Pos logit contributions
yip+7.879
+7.223
 Junction+6.766
 XT+6.673
PLE+6.636
Neg logit contributions
aque-8.933
tained-8.803
":""},{"-8.588
.",-8.531
 Richards-8.207
Token W
Token ID370
Feature activation+5.39e-01

Pos logit contributions
yip+3.560
 XT+3.371
+3.303
 Junction+3.288
advertisement+3.027
Neg logit contributions
aque-4.579
tained-4.490
.",-4.388
":""},{"-4.314
tro-4.153
Tokenether
Token ID6750
Feature activation+5.05e-01

Pos logit contributions
yip+4.505
+4.192
 XT+4.076
advertisement+4.048
 Junction+3.983
Neg logit contributions
tained-2.157
aque-2.112
.",-2.077
 Richards-2.063
":""},{"-1.998
Tokens
Token ID82
Feature activation+2.56e-01

Pos logit contributions
yip+2.762
+2.629
 XT+2.475
spect+2.448
 Junction+2.438
Neg logit contributions
tained-3.381
aque-3.331
.",-3.298
":""},{"-3.190
 Richards-3.104
Token Mysteries
Token ID28298
Feature activation+1.39e-01

Pos logit contributions
yip+0.977
+0.843
 XT+0.798
 Junction+0.767
advertisement+0.713
Neg logit contributions
aque-2.248
tained-2.246
.",-2.214
":""},{"-2.183
upon-2.126
TokenSystem
Token ID11964
Feature activation+6.15e-01

Pos logit contributions
tained+0.804
aque+0.804
.",+0.803
":""},{"+0.783
tro+0.755
Neg logit contributions
yip-0.791
-0.706
 XT-0.677
 Junction-0.656
advertisement-0.625
Token.
Token ID13
Feature activation+4.19e-01

Pos logit contributions
yip+3.535
+3.445
 XT+3.344
advertisement+3.334
 Junction+3.296
Neg logit contributions
aque-3.808
tained-3.771
":""},{"-3.587
.",-3.581
 Richards-3.490
TokenDraw
Token ID25302
Feature activation+2.02e-01

Pos logit contributions
yip+2.486
+2.336
 XT+2.259
 Junction+2.205
advertisement+2.191
Neg logit contributions
aque-2.694
tained-2.678
.",-2.626
":""},{"-2.559
 Richards-2.486
Tokening
Token ID278
Feature activation+3.35e-01

Pos logit contributions
yip+1.444
+1.335
 XT+1.296
 Junction+1.275
advertisement+1.255
Neg logit contributions
aque-1.075
tained-1.074
.",-1.060
":""},{"-1.030
 Richards-0.998
Token.
Token ID13
Feature activation+5.77e-01

Pos logit contributions
yip+2.121
+2.034
 XT+1.972
 Junction+1.941
advertisement+1.928
Neg logit contributions
aque-1.987
tained-1.967
.",-1.887
":""},{"-1.869
 Richards-1.814
TokenColor
Token ID10258
Feature activation+1.45e+00

Pos logit contributions
yip+2.236
+2.094
 XT+1.935
PLE+1.880
 Junction+1.879
Neg logit contributions
tained-4.835
aque-4.832
.",-4.738
":""},{"-4.675
 Richards-4.593
Token]
Token ID60
Feature activation+9.94e-02

Pos logit contributions
+4.864
 XT+4.823
 Junction+4.755
spect+4.730
yip+4.579
Neg logit contributions
tained-11.326
aque-11.290
.",-11.180
":""},{"-10.677
oys-10.572
Token::
Token ID3712
Feature activation+1.12e-01

Pos logit contributions
yip+0.755
+0.717
 XT+0.692
 Junction+0.677
advertisement+0.668
Neg logit contributions
aque-0.486
tained-0.483
.",-0.472
":""},{"-0.457
 Richards-0.445
TokenFrom
Token ID4863
Feature activation-1.39e-02

Pos logit contributions
yip+0.755
+0.699
 XT+0.671
 Junction+0.655
PLE+0.639
Neg logit contributions
aque-0.655
tained-0.654
.",-0.648
":""},{"-0.629
 Richards-0.611
TokenAr
Token ID3163
Feature activation-2.52e-01

Pos logit contributions
aque+0.081
tained+0.081
.",+0.080
":""},{"+0.078
 Richards+0.075
Neg logit contributions
yip-0.096
-0.087
 XT-0.083
 Junction-0.081
advertisement-0.078
Tokengb
Token ID22296
Feature activation+3.41e-01

Pos logit contributions
aque+1.335
tained+1.332
.",+1.319
":""},{"+1.276
 Richards+1.222
Neg logit contributions
yip-1.868
-1.718
 XT-1.658
 Junction-1.619
PLE-1.567
Token at
Token ID379
Feature activation+7.59e-01

Pos logit contributions
yip+6.097
+5.604
 XT+5.578
 Junction+5.359
PLE+5.160
Neg logit contributions
aque-5.140
tained-5.123
":""},{"-5.049
.",-4.918
 Richards-4.729
Token medium
Token ID7090
Feature activation+1.23e+00

Pos logit contributions
yip+4.846
+4.381
 XT+4.270
 Junction+4.138
spect+3.930
Neg logit contributions
aque-4.595
tained-4.594
.",-4.590
":""},{"-4.464
 Richards-4.282
Token-
Token ID12
Feature activation+7.13e-01

Pos logit contributions
yip+7.564
+6.906
 XT+6.749
 Junction+6.542
spect+6.250
Neg logit contributions
tained-7.394
aque-7.274
":""},{"-7.189
.",-7.073
 Richards-6.766
Tokenhigh
Token ID8929
Feature activation+1.23e+00

Pos logit contributions
yip+4.995
+4.583
 XT+4.386
 Junction+4.313
spect+4.227
Neg logit contributions
tained-3.909
aque-3.878
.",-3.855
":""},{"-3.802
 Richards-3.684
Token heat
Token ID4894
Feature activation+1.15e+00

Pos logit contributions
yip+7.320
+6.823
 XT+6.671
 Junction+6.519
spect+6.186
Neg logit contributions
tained-7.459
aque-7.438
":""},{"-7.272
.",-7.180
 Richards-6.857
Token until
Token ID1566
Feature activation+1.41e+00

Pos logit contributions
yip+7.114
+6.461
 XT+6.375
 Junction+6.234
PLE+6.012
Neg logit contributions
tained-6.790
aque-6.725
":""},{"-6.695
.",-6.423
tro-6.226
Token the
Token ID262
Feature activation+6.10e-01

Pos logit contributions
yip+8.653
+8.105
 XT+7.839
 Junction+7.535
PLE+7.396
Neg logit contributions
tained-7.910
aque-7.760
":""},{"-7.709
.",-7.541
tro-7.224
Token onion
Token ID21670
Feature activation+4.00e-01

Pos logit contributions
yip+3.899
+3.613
 XT+3.499
 Junction+3.397
PLE+3.271
Neg logit contributions
aque-3.685
tained-3.683
.",-3.608
":""},{"-3.544
 Richards-3.395
Token is
Token ID318
Feature activation+1.81e-01

Pos logit contributions
yip+2.955
+2.730
 XT+2.641
 Junction+2.583
PLE+2.505
Neg logit contributions
tained-2.100
aque-2.095
.",-2.075
":""},{"-2.017
 Richards-1.942
Token clear
Token ID1598
Feature activation+1.73e-01

Pos logit contributions
yip+1.273
+1.167
 XT+1.121
 Junction+1.095
advertisement+1.069
Neg logit contributions
tained-1.017
aque-1.017
.",-1.012
":""},{"-0.976
 Richards-0.948
Token or
Token ID393
Feature activation+7.43e-01

Pos logit contributions
yip+1.308
+1.205
 XT+1.164
 Junction+1.135
advertisement+1.116
Neg logit contributions
tained-0.892
aque-0.890
.",-0.890
":""},{"-0.846
 Richards-0.822
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
.",+2.768
aque+2.748
tained+2.711
 Richards+2.534
":""},{"+2.499
Neg logit contributions
yip-5.009
-4.654
 XT-4.422
advertisement-4.363
PLE-4.313
Token
Token ID198
Feature activation-1.77e-01

Pos logit contributions
tained+1.639
.",+1.614
aque+1.604
":""},{"+1.566
 Richards+1.562
Neg logit contributions
yip-2.675
-2.407
 XT-2.271
 Junction-2.228
advertisement-2.212
Token
Token ID198
Feature activation+6.51e-02

Pos logit contributions
tained+0.983
aque+0.980
.",+0.975
":""},{"+0.948
 Richards+0.916
Neg logit contributions
yip-1.271
-1.159
 XT-1.114
 Junction-1.088
advertisement-1.055
TokenWhat
Token ID2061
Feature activation+2.69e-01

Pos logit contributions
yip+0.442
+0.404
 XT+0.389
 Junction+0.379
advertisement+0.366
Neg logit contributions
tained-0.383
aque-0.382
.",-0.380
":""},{"-0.369
 Richards-0.357
Token is
Token ID318
Feature activation+6.61e-01

Pos logit contributions
yip+2.103
+1.946
 XT+1.890
 Junction+1.841
advertisement+1.790
Neg logit contributions
aque-1.288
tained-1.287
.",-1.275
":""},{"-1.237
 Richards-1.174
Token different
Token ID1180
Feature activation+1.39e+00

Pos logit contributions
yip+4.785
+4.444
 XT+4.378
 Junction+4.240
advertisement+4.131
Neg logit contributions
tained-3.422
aque-3.385
.",-3.296
":""},{"-3.271
 Richards-3.083
Token,
Token ID11
Feature activation+4.97e-02

Pos logit contributions
yip+8.194
 XT+7.545
+7.510
 Junction+7.211
spect+7.020
Neg logit contributions
tained-8.110
aque-7.842
.",-7.669
":""},{"-7.553
oys-7.485
Token HR
Token ID15172
Feature activation-4.09e-01

Pos logit contributions
yip+0.389
+0.361
 XT+0.348
 Junction+0.341
advertisement+0.333
Neg logit contributions
tained-0.240
aque-0.240
.",-0.238
":""},{"-0.230
 Richards-0.222
TokenW
Token ID54
Feature activation-7.04e-01

Pos logit contributions
.",+2.786
tained+2.771
aque+2.751
":""},{"+2.738
 Richards+2.645
Neg logit contributions
yip-2.472
-2.139
 XT-2.031
 Junction-1.966
advertisement-1.856
Token said
Token ID531
Feature activation-2.35e-01

Pos logit contributions
.",+2.637
tained+2.632
aque+2.624
":""},{"+2.515
 Richards+2.485
Neg logit contributions
yip-6.145
-5.736
 XT-5.392
 Junction-5.331
advertisement-5.319
Token in
Token ID287
Feature activation+1.23e-01

Pos logit contributions
aque+1.213
tained+1.207
.",+1.197
":""},{"+1.155
 Richards+1.117
Neg logit contributions
yip-1.772
-1.637
 XT-1.571
 Junction-1.540
advertisement-1.504
Token H
Token ID367
Feature activation+3.15e-01

Pos logit contributions
yip+0.697
+0.643
 XT+0.619
 Junction+0.605
advertisement+0.590
Neg logit contributions
tained-0.486
aque-0.485
.",-0.481
":""},{"-0.466
 Richards-0.452
Token.
Token ID13
Feature activation+7.01e-01

Pos logit contributions
yip+2.358
+2.177
 XT+2.105
 Junction+2.059
advertisement+2.002
Neg logit contributions
tained-1.628
aque-1.624
.",-1.610
":""},{"-1.563
 Richards-1.504
TokenM
Token ID44
Feature activation+2.67e-01

Pos logit contributions
yip+4.643
+4.403
 XT+4.341
 Junction+4.221
PLE+4.099
Neg logit contributions
aque-4.008
tained-3.962
.",-3.843
":""},{"-3.771
tro-3.594
Token.
Token ID13
Feature activation+1.10e+00

Pos logit contributions
yip+1.840
+1.720
 XT+1.667
 Junction+1.633
advertisement+1.585
Neg logit contributions
tained-1.496
aque-1.494
.",-1.446
":""},{"-1.430
 Richards-1.360
TokenS
Token ID50
Feature activation+3.73e-01

Pos logit contributions
yip+5.170
+5.003
 XT+4.747
 Junction+4.671
PLE+4.472
Neg logit contributions
tained-7.919
aque-7.867
.",-7.571
":""},{"-7.411
oys-7.275
Token.
Token ID13
Feature activation+1.37e+00

Pos logit contributions
yip+2.811
+2.592
 XT+2.533
 Junction+2.468
advertisement+2.399
Neg logit contributions
tained-1.877
aque-1.858
":""},{"-1.815
.",-1.782
tro-1.695
Token Surprise
Token ID47893
Feature activation+2.35e-01

Pos logit contributions
yip+7.592
 XT+6.818
+6.810
 Junction+6.721
PLE+6.270
Neg logit contributions
tained-8.760
aque-8.559
.",-8.282
":""},{"-8.200
oys-7.978
Token,
Token ID11
Feature activation+1.72e-01

Pos logit contributions
yip+1.734
+1.599
 XT+1.548
 Junction+1.517
advertisement+1.470
Neg logit contributions
tained-1.232
aque-1.230
.",-1.219
":""},{"-1.188
 Richards-1.139
Token somewhere
Token ID7382
Feature activation-3.30e-01

Pos logit contributions
yip+1.325
+1.227
 XT+1.189
 Junction+1.166
advertisement+1.133
Neg logit contributions
aque-0.853
tained-0.852
.",-0.844
":""},{"-0.818
 Richards-0.781
Token in
Token ID287
Feature activation-3.51e-01

Pos logit contributions
aque+1.460
.",+1.459
tained+1.457
 Richards+1.381
":""},{"+1.373
Neg logit contributions
yip-2.734
-2.542
 XT-2.421
 Junction-2.357
advertisement-2.342
Token the
Token ID262
Feature activation-5.57e-01

Pos logit contributions
tained+1.751
aque+1.734
.",+1.723
 Richards+1.663
":""},{"+1.660
Neg logit contributions
yip-2.678
-2.487
 XT-2.363
advertisement-2.300
PLE-2.276
Token 20
Token ID1160
Feature activation-6.00e-01

Pos logit contributions
.",+1.838
tained+1.828
aque+1.815
":""},{"+1.788
 Richards+1.752
Neg logit contributions
yip-1.909
-1.693
 XT-1.602
 Junction-1.564
PLE-1.520
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
.",+2.729
tained+2.681
aque+2.646
":""},{"+2.530
 Richards+2.523
Neg logit contributions
yip-5.003
-4.493
 XT-4.275
PLE-4.180
 Junction-4.177
Token 2015
Token ID1853
Feature activation-1.87e-01

Pos logit contributions
tained+0.598
.",+0.597
aque+0.593
":""},{"+0.574
 Richards+0.568
Neg logit contributions
yip-0.772
-0.694
 XT-0.663
 Junction-0.652
PLE-0.637
Token.
Token ID13
Feature activation-1.94e-01

Pos logit contributions
aque+0.953
.",+0.952
tained+0.951
":""},{"+0.905
 Richards+0.883
Neg logit contributions
yip-1.422
-1.303
 XT-1.256
 Junction-1.229
advertisement-1.199
Token Hide
Token ID10415
Feature activation+6.83e-01

Pos logit contributions
.",+1.109
tained+1.108
aque+1.099
 Richards+1.062
":""},{"+1.062
Neg logit contributions
yip-1.383
-1.229
 XT-1.166
 Junction-1.147
advertisement-1.113
Token Caption
Token ID11260
Feature activation+1.37e+00

Pos logit contributions
yip+3.453
+3.348
 XT+3.273
 Junction+3.263
advertisement+3.094
Neg logit contributions
tained-4.707
aque-4.700
.",-4.575
oys-4.482
":""},{"-4.474
Token 8
Token ID807
Feature activation-2.82e-01

Pos logit contributions
 XT+4.861
 Junction+4.490
 Advertisement+4.414
+4.410
advertisement+4.269
Neg logit contributions
aque-10.648
tained-10.551
.",-10.403
oys-10.396
 grateful-9.941
Token of
Token ID286
Feature activation+4.01e-01

Pos logit contributions
tained+1.472
.",+1.466
aque+1.466
":""},{"+1.425
 Richards+1.370
Neg logit contributions
yip-2.116
-1.928
 XT-1.857
 Junction-1.814
PLE-1.759
Token 15
Token ID1315
Feature activation-3.26e-01

Pos logit contributions
yip+1.770
+1.719
 XT+1.629
 Junction+1.568
advertisement+1.470
Neg logit contributions
aque-3.145
tained-3.104
.",-3.090
":""},{"-3.021
oys-2.945
Token Photos
Token ID9434
Feature activation+9.73e-02

Pos logit contributions
tained+2.244
aque+2.242
.",+2.240
":""},{"+2.190
 Richards+2.147
Neg logit contributions
yip-1.911
-1.685
 XT-1.595
 Junction-1.547
advertisement-1.494
Token:
Token ID25
Feature activation-2.64e-01

Pos logit contributions
yip+0.683
+0.645
 XT+0.625
 Junction+0.615
advertisement+0.599
Neg logit contributions
aque-0.527
tained-0.520
.",-0.518
":""},{"-0.498
 Richards-0.481
Token July
Token ID2901
Feature activation+3.03e-01

Pos logit contributions
yip+5.157
+4.839
 XT+4.816
 Junction+4.733
 Advertisement+4.520
Neg logit contributions
aque-6.084
tained-5.981
.",-5.879
tro-5.675
":""},{"-5.655
Token 6
Token ID718
Feature activation-2.46e-02

Pos logit contributions
yip+1.677
+1.569
 XT+1.528
 Junction+1.513
PLE+1.448
Neg logit contributions
tained-2.058
aque-2.056
.",-1.992
":""},{"-1.948
oys-1.894
Token,
Token ID11
Feature activation+8.84e-02

Pos logit contributions
aque+0.132
tained+0.132
.",+0.131
":""},{"+0.127
 Richards+0.123
Neg logit contributions
yip-0.177
-0.164
 XT-0.159
 Junction-0.155
advertisement-0.151
Token 2010
Token ID3050
Feature activation+1.34e+00

Pos logit contributions
yip+0.534
+0.494
 XT+0.487
 Junction+0.466
advertisement+0.452
Neg logit contributions
aque-0.571
tained-0.566
.",-0.553
":""},{"-0.541
tro-0.525
TokenR
Token ID49
Feature activation+8.42e-01

Pos logit contributions
yip+4.890
+4.553
PLE+4.422
advertisement+4.199
spect+4.107
Neg logit contributions
aque-10.916
tained-10.798
.",-10.395
":""},{"-10.359
 Richards-10.130
Tokeniff
Token ID733
Feature activation+1.36e+00

Pos logit contributions
yip+3.234
+2.367
 XT+2.340
 Junction+2.308
advertisement+2.305
Neg logit contributions
aque-7.129
tained-7.118
":""},{"-7.093
.",-7.011
 Richards-6.814
TokenTra
Token ID15721
Feature activation-1.63e-01

Pos logit contributions
yip+4.613
+4.323
PLE+3.960
 XT+3.840
advertisement+3.821
Neg logit contributions
tained-11.214
aque-11.121
.",-10.910
 Richards-10.657
":""},{"-10.587
Tokenx
Token ID87
Feature activation-1.15e-01

Pos logit contributions
aque+0.925
tained+0.924
.",+0.917
":""},{"+0.891
 Richards+0.863
Neg logit contributions
yip-1.136
-1.039
 XT-1.000
 Junction-0.980
PLE-0.944
Token
Token ID198
Feature activation-3.65e-02

Pos logit contributions
aque+0.606
tained+0.603
.",+0.599
":""},{"+0.583
 Richards+0.563
Neg logit contributions
yip-0.851
-0.781
 XT-0.741
 Junction-0.734
advertisement-0.715
Token
Token ID198
Feature activation+3.46e-01

Pos logit contributions
aque+0.208
tained+0.208
.",+0.206
":""},{"+0.200
 Richards+0.194
Neg logit contributions
yip-0.254
-0.233
 XT-0.225
 Junction-0.219
advertisement-0.213
TokenReviewer
Token ID35407
Feature activation+2.01e-01

Pos logit contributions
yip+1.542
+1.343
 XT+1.262
 Junction+1.205
PLE+1.162
Neg logit contributions
tained-2.826
aque-2.824
.",-2.815
":""},{"-2.757
 Richards-2.698
Tokenpper
Token ID2848
Feature activation-4.62e-01

Pos logit contributions
aque+2.675
.",+2.599
tained+2.591
":""},{"+2.553
upon+2.427
Neg logit contributions
yip-3.794
-3.438
 XT-3.406
 Junction-3.268
PLE-3.106
Token does
Token ID857
Feature activation-3.48e-01

Pos logit contributions
.",+2.163
aque+2.118
tained+2.105
":""},{"+2.011
 Richards+2.006
Neg logit contributions
yip-3.698
-3.378
 XT-3.143
 Junction-3.133
PLE-3.073
Token have
Token ID423
Feature activation+1.17e-01

Pos logit contributions
.",+1.664
aque+1.620
tained+1.613
":""},{"+1.555
 Richards+1.521
Neg logit contributions
yip-2.754
-2.546
 XT-2.414
 Junction-2.383
PLE-2.339
Token that
Token ID326
Feature activation+2.47e-01

Pos logit contributions
yip+0.917
+0.850
 XT+0.822
 Junction+0.805
advertisement+0.784
Neg logit contributions
tained-0.568
aque-0.567
.",-0.562
":""},{"-0.543
 Richards-0.522
Token going
Token ID1016
Feature activation+2.36e-01

Pos logit contributions
yip+1.822
+1.684
 XT+1.627
 Junction+1.590
advertisement+1.546
Neg logit contributions
tained-1.298
aque-1.298
.",-1.282
":""},{"-1.246
 Richards-1.200
Token for
Token ID329
Feature activation+1.35e+00

Pos logit contributions
yip+1.876
+1.738
 XT+1.681
 Junction+1.648
advertisement+1.606
Neg logit contributions
tained-1.109
aque-1.109
.",-1.100
":""},{"-1.061
 Richards-1.020
Token it
Token ID340
Feature activation+4.31e-01

Pos logit contributions
yip+7.355
+6.997
 XT+6.809
 Junction+6.411
advertisement+6.221
Neg logit contributions
tained-8.669
aque-8.559
":""},{"-8.253
.",-8.047
oys-7.869
Token.
Token ID13
Feature activation-3.71e-02

Pos logit contributions
yip+3.150
+2.916
 XT+2.813
 Junction+2.746
advertisement+2.673
Neg logit contributions
tained-2.286
aque-2.283
.",-2.245
":""},{"-2.201
 Richards-2.116
Token
Token ID198
Feature activation+1.69e-01

Pos logit contributions
.",+0.181
tained+0.180
aque+0.179
":""},{"+0.175
 Richards+0.172
Neg logit contributions
yip-0.294
-0.265
 XT-0.253
 Junction-0.248
advertisement-0.244
Token
Token ID198
Feature activation+3.26e-01

Pos logit contributions
yip+1.076
+1.015
 XT+0.975
 Junction+0.954
advertisement+0.932
Neg logit contributions
aque-1.022
tained-1.017
.",-1.001
":""},{"-0.972
 Richards-0.946
TokenY
Token ID56
Feature activation+1.97e-01

Pos logit contributions
yip+2.101
+1.924
 XT+1.842
 Junction+1.793
advertisement+1.747
Neg logit contributions
tained-2.012
aque-2.011
.",-1.993
":""},{"-1.940
 Richards-1.886
Token.
Token ID13
Feature activation+2.62e-01

Pos logit contributions
yip+0.095
+0.088
 XT+0.083
 Junction+0.082
advertisement+0.080
Neg logit contributions
.",-0.065
aque-0.065
tained-0.064
":""},{"-0.062
 Richards-0.061
Token -
Token ID532
Feature activation+9.33e-01

Pos logit contributions
yip+2.019
+1.870
 XT+1.809
 Junction+1.771
advertisement+1.721
Neg logit contributions
tained-1.291
aque-1.290
.",-1.277
":""},{"-1.236
 Richards-1.186
Token July
Token ID2901
Feature activation+3.03e-01

Pos logit contributions
yip+5.157
+4.839
 XT+4.816
 Junction+4.733
 Advertisement+4.520
Neg logit contributions
aque-6.084
tained-5.981
.",-5.879
tro-5.675
":""},{"-5.655
Token 6
Token ID718
Feature activation-2.46e-02

Pos logit contributions
yip+1.677
+1.569
 XT+1.528
 Junction+1.513
PLE+1.448
Neg logit contributions
tained-2.058
aque-2.056
.",-1.992
":""},{"-1.948
oys-1.894
Token,
Token ID11
Feature activation+8.84e-02

Pos logit contributions
aque+0.132
tained+0.132
.",+0.131
":""},{"+0.127
 Richards+0.123
Neg logit contributions
yip-0.177
-0.164
 XT-0.159
 Junction-0.155
advertisement-0.151
Token 2010
Token ID3050
Feature activation+1.34e+00

Pos logit contributions
yip+0.534
+0.494
 XT+0.487
 Junction+0.466
advertisement+0.452
Neg logit contributions
aque-0.571
tained-0.566
.",-0.553
":""},{"-0.541
tro-0.525
TokenR
Token ID49
Feature activation+8.42e-01

Pos logit contributions
yip+4.890
+4.553
PLE+4.422
advertisement+4.199
spect+4.107
Neg logit contributions
aque-10.916
tained-10.798
.",-10.395
":""},{"-10.359
 Richards-10.130
Tokeniff
Token ID733
Feature activation+1.36e+00

Pos logit contributions
yip+3.234
+2.367
 XT+2.340
 Junction+2.308
advertisement+2.305
Neg logit contributions
aque-7.129
tained-7.118
":""},{"-7.093
.",-7.011
 Richards-6.814
TokenTra
Token ID15721
Feature activation-1.63e-01

Pos logit contributions
yip+4.613
+4.323
PLE+3.960
 XT+3.840
advertisement+3.821
Neg logit contributions
tained-11.214
aque-11.121
.",-10.910
 Richards-10.657
":""},{"-10.587
Tokenx
Token ID87
Feature activation-1.15e-01

Pos logit contributions
aque+0.925
tained+0.924
.",+0.917
":""},{"+0.891
 Richards+0.863
Neg logit contributions
yip-1.136
-1.039
 XT-1.000
 Junction-0.980
PLE-0.944
Token
Token ID198
Feature activation-3.65e-02

Pos logit contributions
aque+0.606
tained+0.603
.",+0.599
":""},{"+0.583
 Richards+0.563
Neg logit contributions
yip-0.851
-0.781
 XT-0.741
 Junction-0.734
advertisement-0.715
Token treated
Token ID5716
Feature activation+1.51e-01

Pos logit contributions
yip+0.597
+0.549
 XT+0.525
 Junction+0.516
advertisement+0.502
Neg logit contributions
aque-0.355
.",-0.354
tained-0.354
":""},{"-0.337
 Richards-0.331
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
yip+1.152
+1.060
 XT+1.022
 Junction+1.003
advertisement+0.972
Neg logit contributions
tained-0.762
aque-0.761
.",-0.757
":""},{"-0.729
 Richards-0.704
Token https
Token ID3740
Feature activation-3.67e-01

Pos logit contributions
tained+2.430
aque+2.369
.",+2.364
 Richards+2.340
":""},{"+2.338
Neg logit contributions
yip-4.084
-3.717
 XT-3.525
advertisement-3.486
 Junction-3.407
Token://
Token ID1378
Feature activation-9.23e-01

Pos logit contributions
tained+1.256
.",+1.244
aque+1.244
":""},{"+1.196
 Richards+1.117
Neg logit contributions
yip-3.425
-3.164
 XT-3.088
 Junction-3.030
advertisement-2.957
Tokent
Token ID83
Feature activation-6.23e-01

Pos logit contributions
tained+3.355
aque+3.313
.",+3.270
":""},{"+3.159
tro+3.134
Neg logit contributions
yip-8.449
-7.665
 XT-7.564
 Junction-7.441
 Heist-7.130
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
.",+2.901
tained+2.821
aque+2.774
":""},{"+2.716
.","+2.624
Neg logit contributions
yip-5.149
-4.631
 XT-4.463
 Junction-4.377
PLE-4.238
Tokenco
Token ID1073
Feature activation-1.18e+00

Pos logit contributions
tained+2.876
.",+2.678
aque+2.484
":""},{"+2.416
tro+2.389
Neg logit contributions
yip-17.599
-16.026
 XT-15.980
 srf-15.925
senal-15.785
Token/
Token ID14
Feature activation-7.20e-01

Pos logit contributions
":""},{"+4.431
tained+4.427
.",+4.382
aque+4.196
upon+4.142
Neg logit contributions
yip-10.524
 srf-9.379
-9.213
 XT-9.111
senal-8.941
Tokenni
Token ID8461
Feature activation+1.61e-01

Pos logit contributions
aque+4.201
tained+4.072
.",+4.018
tro+3.979
":""},{"+3.942
Neg logit contributions
yip-4.619
-4.374
 XT-4.183
 Junction-4.156
advertisement-3.903
TokenWB
Token ID45607
Feature activation-9.42e-02

Pos logit contributions
yip+1.028
+0.936
 XT+0.898
 Junction+0.875
advertisement+0.844
Neg logit contributions
aque-1.007
tained-1.006
.",-0.999
":""},{"-0.973
 Richards-0.944
Tokenqv
Token ID44179
Feature activation-1.50e-01

Pos logit contributions
aque+0.556
tained+0.551
.",+0.545
":""},{"+0.530
upon+0.515
Neg logit contributions
yip-0.636
-0.587
 XT-0.564
 Junction-0.553
advertisement-0.531
Tokensen
Token ID6248
Feature activation+2.77e-01

Pos logit contributions
aque+0.755
tained+0.747
.",+0.733
":""},{"+0.717
 Richards+0.703
Neg logit contributions
yip-0.858
-0.790
 XT-0.758
 Junction-0.732
advertisement-0.726
Token Report
Token ID6358
Feature activation-5.68e-01

Pos logit contributions
yip+2.330
+2.173
 XT+2.110
 Junction+2.070
advertisement+2.017
Neg logit contributions
aque-1.175
tained-1.173
.",-1.161
":""},{"-1.118
 Richards-1.065
Token https
Token ID3740
Feature activation-1.25e-02

Pos logit contributions
tained+2.853
.",+2.796
aque+2.748
":""},{"+2.694
 Richards+2.609
Neg logit contributions
yip-4.302
-3.936
 XT-3.720
 Junction-3.667
advertisement-3.600
Token://
Token ID1378
Feature activation-5.39e-01

Pos logit contributions
aque+0.047
tained+0.047
.",+0.046
":""},{"+0.044
 Richards+0.042
Neg logit contributions
yip-0.110
-0.104
 XT-0.101
 Junction-0.099
advertisement-0.097
Tokent
Token ID83
Feature activation-7.13e-01

Pos logit contributions
tained+2.161
aque+2.137
.",+2.105
":""},{"+2.035
tro+1.989
Neg logit contributions
yip-4.727
-4.338
 XT-4.263
 Junction-4.163
PLE-4.014
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
.",+3.206
tained+3.082
aque+3.007
":""},{"+2.968
upon+2.891
Neg logit contributions
yip-6.099
-5.411
 XT-5.246
 Junction-5.135
 srf-5.039
Tokenco
Token ID1073
Feature activation-9.86e-01

Pos logit contributions
tained+3.002
.",+2.919
tro+2.699
aque+2.579
":""},{"+2.537
Neg logit contributions
yip-17.400
 srf-16.171
 earthqu-16.023
-15.936
senal-15.844
Token/
Token ID14
Feature activation-5.73e-01

Pos logit contributions
tained+3.928
.",+3.897
":""},{"+3.894
aque+3.719
upon+3.697
Neg logit contributions
yip-8.675
-7.624
 srf-7.521
 XT-7.489
 Junction-7.243
Tokenl
Token ID75
Feature activation-1.74e-01

Pos logit contributions
aque+3.361
tained+3.274
.",+3.237
tro+3.170
":""},{"+3.169
Neg logit contributions
yip-3.724
-3.511
 XT-3.354
 Junction-3.326
advertisement-3.138
Tokeng
Token ID70
Feature activation+1.08e-01

Pos logit contributions
aque+1.063
tained+1.052
.",+1.044
":""},{"+1.017
tro+0.990
Neg logit contributions
yip-1.129
-1.044
 XT-0.996
 Junction-0.981
advertisement-0.937
TokenS
Token ID50
Feature activation+1.26e-02

Pos logit contributions
yip+0.709
+0.649
 XT+0.622
 Junction+0.607
advertisement+0.586
Neg logit contributions
aque-0.663
tained-0.662
.",-0.657
":""},{"-0.640
 Richards-0.620
Tokeneker
Token ID28233
Feature activation+2.24e-02

Pos logit contributions
aque+2.083
.",+2.046
tained+2.040
":""},{"+1.912
upon+1.885
Neg logit contributions
yip-4.206
-3.992
 XT-3.815
 Junction-3.734
PLE-3.705
Token
Token ID198
Feature activation-2.19e-02

Pos logit contributions
yip+0.159
+0.146
 XT+0.140
 Junction+0.137
advertisement+0.133
Neg logit contributions
tained-0.125
aque-0.125
.",-0.124
":""},{"-0.120
 Richards-0.116
Token
Token ID198
Feature activation+7.19e-02

Pos logit contributions
aque+0.127
tained+0.127
.",+0.125
":""},{"+0.122
 Richards+0.118
Neg logit contributions
yip-0.150
-0.138
 XT-0.133
 Junction-0.130
advertisement-0.126
TokenPh
Token ID2725
Feature activation-1.70e-01

Pos logit contributions
yip+0.456
+0.410
 XT+0.396
 Junction+0.384
advertisement+0.367
Neg logit contributions
tained-0.457
aque-0.457
.",-0.454
":""},{"-0.443
 Richards-0.430
Tokenyt
Token ID20760
Feature activation-5.72e-01

Pos logit contributions
aque+0.879
tained+0.862
.",+0.860
":""},{"+0.811
tro+0.803
Neg logit contributions
yip-1.270
-1.205
 XT-1.167
 Junction-1.124
PLE-1.093
Tokenot
Token ID313
Feature activation-1.61e+00

Pos logit contributions
aque+2.561
.",+2.511
tained+2.487
":""},{"+2.364
oys+2.310
Neg logit contributions
yip-4.667
-4.315
 XT-4.202
 Junction-4.115
advertisement-4.033
Tokenitan
Token ID18642
Feature activation-1.25e-01

Pos logit contributions
.",+7.983
aque+7.818
tained+7.437
oys+7.344
 Richards+7.298
Neg logit contributions
yip-10.333
 XT-9.721
advertisement-9.704
 Junction-9.589
-9.560
Token vs
Token ID3691
Feature activation+1.35e-01

Pos logit contributions
aque+0.701
tained+0.698
.",+0.692
":""},{"+0.678
 Richards+0.660
Neg logit contributions
yip-0.884
-0.812
 XT-0.777
 Junction-0.754
PLE-0.746
Token Torch
Token ID34868
Feature activation+1.44e-01

Pos logit contributions
yip+0.926
+0.848
 XT+0.815
 Junction+0.794
advertisement+0.772
Neg logit contributions
tained-0.786
aque-0.786
.",-0.780
":""},{"-0.758
 Richards-0.735
Tokenling
Token ID1359
Feature activation+2.36e-02

Pos logit contributions
yip+1.035
+0.952
 XT+0.917
 Junction+0.895
advertisement+0.873
Neg logit contributions
tained-0.787
aque-0.786
.",-0.780
":""},{"-0.757
 Richards-0.734
Token
Token ID198
Feature activation+2.97e-02

Pos logit contributions
yip+0.163
+0.150
 XT+0.144
 Junction+0.139
PLE+0.137
Neg logit contributions
aque-0.135
tained-0.134
.",-0.133
":""},{"-0.129
 Richards-0.127
Token War
Token ID1810
Feature activation+4.45e-01

Pos logit contributions
yip+2.443
+2.252
 XT+2.174
 Junction+2.142
advertisement+2.001
Neg logit contributions
aque-3.012
tained-2.980
.",-2.959
":""},{"-2.892
 Richards-2.833
Token:
Token ID25
Feature activation-1.34e-01

Pos logit contributions
yip+3.156
+2.934
 XT+2.860
 Junction+2.809
advertisement+2.702
Neg logit contributions
aque-2.416
tained-2.403
.",-2.351
":""},{"-2.305
 Richards-2.220
Token https
Token ID3740
Feature activation+1.97e-01

Pos logit contributions
tained+0.689
.",+0.684
aque+0.682
":""},{"+0.662
 Richards+0.642
Neg logit contributions
yip-1.019
-0.928
 XT-0.894
 Junction-0.882
advertisement-0.850
Token://
Token ID1378
Feature activation-5.69e-01

Pos logit contributions
yip+1.608
+1.565
 XT+1.508
 Junction+1.477
advertisement+1.461
Neg logit contributions
aque-0.823
tained-0.803
.",-0.781
":""},{"-0.742
 Richards-0.725
Tokent
Token ID83
Feature activation-5.56e-01

Pos logit contributions
tained+2.285
aque+2.245
.",+2.213
":""},{"+2.127
tro+2.075
Neg logit contributions
yip-4.971
-4.598
 XT-4.512
 Junction-4.438
PLE-4.263
Token.
Token ID13
Feature activation-1.61e+00

Pos logit contributions
.",+2.616
tained+2.555
aque+2.512
":""},{"+2.452
upon+2.371
Neg logit contributions
yip-4.559
-4.128
 XT-3.969
 Junction-3.892
PLE-3.768
Tokenco
Token ID1073
Feature activation-1.11e+00

Pos logit contributions
tained+2.828
.",+2.694
aque+2.519
tro+2.511
":""},{"+2.386
Neg logit contributions
yip-16.988
 srf-15.596
-15.530
 XT-15.423
 earthqu-15.252
Token/
Token ID14
Feature activation-6.28e-01

Pos logit contributions
tained+4.171
":""},{"+4.136
.",+4.107
aque+3.924
upon+3.912
Neg logit contributions
yip-9.930
 srf-8.887
-8.683
 XT-8.609
senal-8.423
Tokenc
Token ID66
Feature activation-1.28e-01

Pos logit contributions
aque+3.672
tained+3.571
.",+3.524
tro+3.474
":""},{"+3.456
Neg logit contributions
yip-4.074
-3.848
 XT-3.685
 Junction-3.655
advertisement-3.436
TokenQ
Token ID48
Feature activation-5.42e-01

Pos logit contributions
aque+0.785
tained+0.781
.",+0.773
":""},{"+0.755
upon+0.732
Neg logit contributions
yip-0.828
-0.761
 XT-0.729
 Junction-0.715
advertisement-0.685
Token3
Token ID18
Feature activation-9.18e-02

Pos logit contributions
aque+3.186
tained+3.112
.",+3.070
":""},{"+2.998
tro+2.992
Neg logit contributions
yip-3.542
-3.334
 XT-3.185
 Junction-3.155
advertisement-2.988
Token ap
Token ID2471
Feature activation+5.12e-02

Pos logit contributions
.",+0.424
aque+0.424
tained+0.423
":""},{"+0.408
 Richards+0.394
Neg logit contributions
yip-0.598
-0.551
 XT-0.528
 Junction-0.518
advertisement-0.504
Token.
Token ID13
Feature activation-3.22e-01

Pos logit contributions
yip+0.363
+0.338
 XT+0.326
 Junction+0.318
advertisement+0.311
Neg logit contributions
aque-0.280
tained-0.280
.",-0.274
":""},{"-0.268
 Richards-0.259
Tokenadd
Token ID2860
Feature activation-3.17e-01

Pos logit contributions
aque+0.969
tained+0.943
.",+0.941
":""},{"+0.888
upon+0.828
Neg logit contributions
yip-3.132
-2.952
 XT-2.870
 Junction-2.834
PLE-2.755
Token_
Token ID62
Feature activation-7.54e-02

Pos logit contributions
tained+1.473
aque+1.465
.",+1.459
":""},{"+1.416
upon+1.350
Neg logit contributions
yip-2.578
-2.351
 XT-2.286
 Junction-2.228
advertisement-2.154
Tokenargument
Token ID49140
Feature activation-2.60e-01

Pos logit contributions
aque+0.332
tained+0.331
.",+0.328
":""},{"+0.316
 Richards+0.301
Neg logit contributions
yip-0.623
-0.580
 XT-0.562
 Junction-0.551
advertisement-0.536
Token("
Token ID7203
Feature activation-1.60e+00

Pos logit contributions
tained+1.502
.",+1.502
":""},{"+1.487
aque+1.483
upon+1.402
Neg logit contributions
yip-1.815
-1.642
 XT-1.571
 Junction-1.549
advertisement-1.488
Token-
Token ID12
Feature activation-2.30e-01

Pos logit contributions
.",+5.341
tained+5.110
aque+5.090
":""},{"+4.999
tro+4.970
Neg logit contributions
yip-14.094
-12.789
��-12.412
 XT-12.352
 srf-12.301
Tokeni
Token ID72
Feature activation-2.64e-01

Pos logit contributions
aque+0.919
.",+0.917
tained+0.916
":""},{"+0.875
upon+0.825
Neg logit contributions
yip-2.001
-1.877
 XT-1.826
 Junction-1.789
advertisement-1.728
Token",
Token ID1600
Feature activation-3.54e-01

Pos logit contributions
.",+0.994
tained+0.972
aque+0.971
":""},{"+0.939
 Richards+0.875
Neg logit contributions
yip-2.390
-2.214
 XT-2.157
 Junction-2.120
advertisement-2.062
Token "
Token ID366
Feature activation-1.20e+00

Pos logit contributions
tained+1.381
.",+1.378
aque+1.366
":""},{"+1.325
 Richards+1.251
Neg logit contributions
yip-3.130
-2.892
 XT-2.766
 Junction-2.725
advertisement-2.686
Token--
Token ID438
Feature activation-3.99e-01

Pos logit contributions
.",+2.557
tained+2.521
aque+2.501
":""},{"+2.270
upon+2.180
Neg logit contributions
yip-12.260
-11.479
 XT-11.077
 Junction-11.056
advertisement-10.687
Token
Token ID235
Feature activation-9.50e-01

Pos logit contributions
yip+0.312
+0.286
 XT+0.277
 Junction+0.271
advertisement+0.262
Neg logit contributions
tained-0.238
aque-0.236
.",-0.234
":""},{"-0.227
 Richards-0.219
Token
Token ID222
Feature activation-4.22e-01

Pos logit contributions
tained+5.265
aque+4.841
+4.823
.",+4.794
":""},{"+4.702
Neg logit contributions
yip-6.482
 XT-5.702
-5.699
 Junction-5.645
advertisement-5.450
Token https
Token ID3740
Feature activation+2.86e-02

Pos logit contributions
tained+1.944
aque+1.923
.",+1.917
":""},{"+1.859
 Richards+1.832
Neg logit contributions
yip-3.315
-3.062
 XT-2.966
 Junction-2.916
advertisement-2.892
Token://
Token ID1378
Feature activation-9.61e-01

Pos logit contributions
yip+0.247
+0.236
 XT+0.228
 Junction+0.224
advertisement+0.220
Neg logit contributions
aque-0.111
tained-0.110
.",-0.108
":""},{"-0.103
 Richards-0.099
Tokent
Token ID83
Feature activation-5.73e-01

Pos logit contributions
tained+3.414
aque+3.322
.",+3.270
tro+3.185
":""},{"+3.077
Neg logit contributions
yip-8.719
-7.984
 XT-7.979
 Junction-7.782
 Heist-7.494
Token.
Token ID13
Feature activation-1.55e+00

Pos logit contributions
.",+2.697
tained+2.642
aque+2.594
":""},{"+2.531
upon+2.450
Neg logit contributions
yip-4.662
-4.234
 XT-4.066
 Junction-3.991
PLE-3.877
Tokenco
Token ID1073
Feature activation-1.13e+00

Pos logit contributions
tained+2.802
.",+2.666
tro+2.540
":""},{"+2.424
aque+2.375
Neg logit contributions
yip-16.231
 XT-14.832
 srf-14.775
-14.665
 Junction-14.583
Token/
Token ID14
Feature activation-6.54e-01

Pos logit contributions
":""},{"+4.304
tained+4.296
.",+4.285
aque+4.054
upon+4.035
Neg logit contributions
yip-10.045
 srf-8.883
-8.807
 XT-8.656
senal-8.417
TokenX
Token ID55
Feature activation-2.92e-01

Pos logit contributions
aque+3.826
tained+3.718
.",+3.676
tro+3.621
":""},{"+3.604
Neg logit contributions
yip-4.228
-4.001
 XT-3.827
 Junction-3.796
advertisement-3.571
Tokenh
Token ID71
Feature activation+2.72e-01

Pos logit contributions
aque+1.726
tained+1.694
.",+1.679
":""},{"+1.638
tro+1.609
Neg logit contributions
yip-1.948
-1.815
 XT-1.743
 Junction-1.714
advertisement-1.635
Token7
Token ID22
Feature activation-2.18e-03

Pos logit contributions
yip+1.795
+1.634
 XT+1.571
 Junction+1.529
PLE+1.486
Neg logit contributions
tained-1.651
aque-1.647
.",-1.638
":""},{"-1.594
 Richards-1.546
Token The
Token ID383
Feature activation+3.27e-01

Pos logit contributions
yip+0.006
+0.006
 XT+0.006
 Junction+0.005
PLE+0.005
Neg logit contributions
tained-0.004
aque-0.004
.",-0.004
":""},{"-0.004
 Richards-0.004
Token Simpsons
Token ID34376
Feature activation+1.47e-01

Pos logit contributions
yip+2.186
+1.998
 XT+1.918
 Junction+1.870
advertisement+1.813
Neg logit contributions
tained-1.956
aque-1.956
.",-1.941
":""},{"-1.888
 Richards-1.831
Token and
Token ID290
Feature activation+1.67e-01

Pos logit contributions
yip+1.154
+1.065
 XT+1.027
 Junction+1.002
PLE+0.982
Neg logit contributions
tained-0.704
aque-0.703
.",-0.701
":""},{"-0.674
 Richards-0.650
Token the
Token ID262
Feature activation-5.53e-02

Pos logit contributions
yip+1.240
+1.144
 XT+1.104
 Junction+1.079
advertisement+1.050
Neg logit contributions
tained-0.870
aque-0.869
.",-0.862
":""},{"-0.835
 Richards-0.806
Token mon
Token ID937
Feature activation+5.88e-02

Pos logit contributions
tained+0.315
.",+0.314
aque+0.313
":""},{"+0.303
 Richards+0.299
Neg logit contributions
yip-0.386
-0.353
 XT-0.335
 Junction-0.325
PLE-0.321
Tokenor
Token ID273
Feature activation-1.54e+00

Pos logit contributions
yip+0.418
+0.385
 XT+0.370
 Junction+0.359
PLE+0.350
Neg logit contributions
aque-0.328
tained-0.327
.",-0.326
":""},{"-0.315
 Richards-0.304
Tokenail
Token ID603
Feature activation-2.81e-01

Pos logit contributions
.",+4.534
aque+4.451
tained+4.417
oys+4.161
iles+3.814
Neg logit contributions
yip-13.695
-12.794
 XT-12.716
advertisement-12.325
PLE-12.120
Token
Token ID960
Feature activation-7.42e-02

Pos logit contributions
aque+1.239
.",+1.234
tained+1.225
":""},{"+1.162
 Richards+1.138
Neg logit contributions
yip-2.301
-2.134
 XT-2.028
PLE-1.962
 Junction-1.961
Tokenit
Token ID270
Feature activation-5.88e-02

Pos logit contributions
aque+0.368
tained+0.364
.",+0.364
":""},{"+0.351
upon+0.341
Neg logit contributions
yip-0.573
-0.528
 XT-0.513
 Junction-0.499
advertisement-0.481
Token put
Token ID1234
Feature activation-5.17e-01

Pos logit contributions
aque+0.286
tained+0.283
.",+0.281
":""},{"+0.271
 Richards+0.261
Neg logit contributions
yip-0.463
-0.425
 XT-0.413
 Junction-0.405
advertisement-0.392
Token Brock
Token ID20501
Feature activation-2.52e-01

Pos logit contributions
.",+2.498
aque+2.489
tained+2.461
 Richards+2.375
upon+2.372
Neg logit contributions
yip-3.949
-3.549
 XT-3.488
 Junction-3.356
PLE-3.354
Token.
Token ID13
Feature activation+2.45e-01

Pos logit contributions
yip+1.452
+1.335
 XT+1.289
 Junction+1.267
advertisement+1.232
Neg logit contributions
tained-0.901
.",-0.900
aque-0.898
":""},{"-0.857
 Richards-0.827
Token
Token ID198
Feature activation+2.35e-03

Pos logit contributions
yip+1.770
+1.630
 XT+1.570
 Junction+1.535
advertisement+1.492
Neg logit contributions
tained-1.327
aque-1.326
.",-1.315
":""},{"-1.276
 Richards-1.233
Token
Token ID198
Feature activation-3.23e-02

Pos logit contributions
yip+0.016
+0.015
 XT+0.014
 Junction+0.014
advertisement+0.014
Neg logit contributions
aque-0.014
tained-0.014
.",-0.013
":""},{"-0.013
 Richards-0.013
Tokensudo
Token ID24032
Feature activation-5.40e-01

Pos logit contributions
aque+0.187
tained+0.186
.",+0.185
":""},{"+0.180
upon+0.174
Neg logit contributions
yip-0.222
-0.204
 XT-0.198
 Junction-0.194
advertisement-0.185
Token ch
Token ID442
Feature activation-5.12e-01

Pos logit contributions
.",+2.571
aque+2.538
tained+2.522
":""},{"+2.403
 Richards+2.374
Neg logit contributions
yip-4.185
-3.947
 XT-3.642
 Junction-3.625
advertisement-3.586
Tokenmod
Token ID4666
Feature activation-1.49e+00

Pos logit contributions
aque+2.030
tained+1.896
.",+1.863
tro+1.753
upon+1.666
Neg logit contributions
yip-4.566
-4.301
 XT-4.107
 Junction-4.042
PLE-4.023
Token +
Token ID1343
Feature activation-9.35e-01

Pos logit contributions
aque+7.297
.",+7.054
tained+6.850
":""},{"+6.785
+6.479
Neg logit contributions
yip-10.698
-9.575
PLE-9.072
spect-9.061
 Junction-8.939
Tokenx
Token ID87
Feature activation-5.69e-01

Pos logit contributions
aque+4.330
.",+4.197
tained+3.901
":""},{"+3.731
 Richards+3.725
Neg logit contributions
yip-7.066
-6.810
 XT-6.474
 Junction-6.470
advertisement-6.321
Token /
Token ID1220
Feature activation-5.08e-01

Pos logit contributions
.",+2.897
tained+2.832
aque+2.799
":""},{"+2.711
 Richards+2.605
Neg logit contributions
yip-4.210
-3.897
 Junction-3.670
advertisement-3.633
 XT-3.621
Tokenetc
Token ID14784
Feature activation-6.56e-02

Pos logit contributions
aque+2.288
.",+2.276
tained+2.254
":""},{"+2.123
tro+2.057
Neg logit contributions
yip-4.129
-3.819
 XT-3.649
 Junction-3.646
advertisement-3.502
Token/
Token ID14
Feature activation-8.21e-01

Pos logit contributions
tained+0.339
aque+0.338
.",+0.336
":""},{"+0.326
 Richards+0.313
Neg logit contributions
yip-0.493
-0.454
 XT-0.438
 Junction-0.428
advertisement-0.415
Tokenand
Token ID392
Feature activation-1.55e-01

Pos logit contributions
aque+2.060
.",+2.047
tained+2.036
upon+1.965
":""},{"+1.958
Neg logit contributions
yip-2.697
-2.410
 XT-2.349
 Junction-2.274
PLE-2.136
Token another
Token ID1194
Feature activation-4.24e-01

Pos logit contributions
tained+0.786
aque+0.783
.",+0.783
":""},{"+0.751
 Richards+0.729
Neg logit contributions
yip-1.190
-1.095
 XT-1.043
 Junction-1.021
PLE-0.999
Token for
Token ID329
Feature activation-1.44e-01

Pos logit contributions
tained+1.975
.",+1.971
aque+1.962
":""},{"+1.882
 Richards+1.828
Neg logit contributions
yip-3.440
-3.152
 XT-2.968
 Junction-2.916
PLE-2.874
Token star
Token ID3491
Feature activation-1.60e-01

Pos logit contributions
.",+0.732
tained+0.731
aque+0.728
":""},{"+0.701
 Richards+0.680
Neg logit contributions
yip-1.100
-1.011
 XT-0.957
 Junction-0.941
PLE-0.923
Token Matthew
Token ID9308
Feature activation-4.49e-01

Pos logit contributions
tained+0.822
.",+0.804
aque+0.801
":""},{"+0.772
 Richards+0.758
Neg logit contributions
yip-1.244
-1.129
 XT-1.071
 Junction-1.053
advertisement-1.023
Token Mc
Token ID1982
Feature activation-1.49e+00

Pos logit contributions
aque+2.256
.",+2.185
 Richards+2.172
tained+2.030
tro+2.002
Neg logit contributions
yip-3.664
-3.217
 XT-3.064
PLE-2.989
advertisement-2.987
TokenCon
Token ID3103
Feature activation-7.55e-02

Pos logit contributions
.",+6.286
aque+5.967
tained+5.593
 Richards+5.555
alus+5.426
Neg logit contributions
yip-12.694
 XT-11.379
-11.160
 Junction-10.235
utra-10.184
Tokenaug
Token ID7493
Feature activation-9.47e-01

Pos logit contributions
tained+0.227
.",+0.220
":""},{"+0.219
aque+0.219
upon+0.195
Neg logit contributions
yip-0.719
-0.679
 Junction-0.660
 XT-0.658
spect-0.638
Tokenhey
Token ID20342
Feature activation-2.59e-01

Pos logit contributions
aque+3.069
.",+2.969
tained+2.875
oys+2.644
alus+2.636
Neg logit contributions
yip-9.010
-8.432
 XT-8.109
advertisement-7.829
PLE-7.807
Token's
Token ID338
Feature activation-4.17e-01

Pos logit contributions
aque+1.122
tained+1.122
.",+1.111
":""},{"+1.047
 Richards+1.019
Neg logit contributions
yip-2.186
-2.006
 XT-1.931
 Junction-1.892
PLE-1.864
Token performance
Token ID2854
Feature activation-4.71e-01

Pos logit contributions
tained+2.198
.",+2.174
aque+2.166
":""},{"+2.075
 Richards+2.067
Neg logit contributions
yip-3.118
-2.842
 XT-2.663
 Junction-2.602
PLE-2.596
Token Period
Token ID18581
Feature activation+4.87e-01

Pos logit contributions
.",+1.332
tained+1.329
aque+1.322
":""},{"+1.299
 Richards+1.286
Neg logit contributions
yip-2.088
-1.874
 XT-1.761
advertisement-1.716
 Junction-1.714
Token.
Token ID13
Feature activation-2.05e-01

Pos logit contributions
yip+3.481
+3.206
 XT+3.099
 Junction+3.027
PLE+2.930
Neg logit contributions
tained-2.640
aque-2.636
.",-2.590
":""},{"-2.545
 Richards-2.457
Token https
Token ID3740
Feature activation+3.04e-02

Pos logit contributions
tained+1.065
aque+1.061
.",+1.060
":""},{"+1.038
 Richards+1.014
Neg logit contributions
yip-1.539
-1.397
 XT-1.316
 Junction-1.280
advertisement-1.278
Token://
Token ID1378
Feature activation-5.97e-01

Pos logit contributions
yip+0.262
+0.250
 XT+0.241
 Junction+0.238
advertisement+0.233
Neg logit contributions
aque-0.118
tained-0.117
.",-0.115
":""},{"-0.109
 Richards-0.105
Tokent
Token ID83
Feature activation-5.28e-01

Pos logit contributions
tained+2.350
aque+2.332
.",+2.243
tro+2.186
":""},{"+2.158
Neg logit contributions
yip-5.206
-4.854
 XT-4.815
 Junction-4.685
PLE-4.498
Token.
Token ID13
Feature activation-1.48e+00

Pos logit contributions
.",+2.476
tained+2.421
aque+2.392
":""},{"+2.311
upon+2.252
Neg logit contributions
yip-4.315
-3.941
 XT-3.793
 Junction-3.710
PLE-3.592
Tokenco
Token ID1073
Feature activation-1.05e+00

Pos logit contributions
tained+2.729
.",+2.576
aque+2.567
tro+2.462
upon+2.244
Neg logit contributions
yip-15.513
-14.356
 XT-14.233
 srf-13.982
 Junction-13.944
Token/
Token ID14
Feature activation-6.22e-01

Pos logit contributions
":""},{"+4.141
.",+4.110
tained+4.098
aque+3.917
upon+3.850
Neg logit contributions
yip-9.311
-8.159
 srf-8.152
 XT-8.032
 Junction-7.757
TokenP
Token ID47
Feature activation-2.99e-01

Pos logit contributions
aque+3.643
tained+3.540
.",+3.503
tro+3.445
":""},{"+3.429
Neg logit contributions
yip-4.022
-3.803
 XT-3.637
 Junction-3.605
advertisement-3.397
Token9
Token ID24
Feature activation-1.77e-01

Pos logit contributions
aque+1.751
tained+1.723
.",+1.703
":""},{"+1.656
upon+1.627
Neg logit contributions
yip-2.009
-1.879
 XT-1.794
 Junction-1.767
advertisement-1.693
Tokengc
Token ID36484
Feature activation-7.30e-02

Pos logit contributions
aque+1.050
tained+1.037
.",+1.026
":""},{"+0.999
tro+0.975
Neg logit contributions
yip-1.182
-1.101
 XT-1.054
 Junction-1.034
advertisement-0.991
Token.
Token ID13
Feature activation-2.26e-01

Pos logit contributions
yip+1.139
+1.048
 XT+1.011
 Junction+0.989
advertisement+0.960
Neg logit contributions
.",-0.766
tained-0.764
aque-0.762
":""},{"-0.735
 Richards-0.712
Token
Token ID198
Feature activation+4.89e-04

Pos logit contributions
tained+1.138
.",+1.130
aque+1.126
 Richards+1.102
":""},{"+1.102
Neg logit contributions
yip-1.777
-1.556
 XT-1.485
 Junction-1.449
advertisement-1.427
Token
Token ID198
Feature activation+7.90e-02

Pos logit contributions
yip+0.003
+0.003
 XT+0.003
 Junction+0.003
advertisement+0.003
Neg logit contributions
aque-0.003
tained-0.003
.",-0.003
":""},{"-0.003
 Richards-0.003
TokenN
Token ID45
Feature activation-3.32e-02

Pos logit contributions
yip+0.545
+0.496
 XT+0.479
 Junction+0.467
advertisement+0.449
Neg logit contributions
tained-0.458
aque-0.458
.",-0.455
":""},{"-0.443
 Richards-0.428
Tokenec
Token ID721
Feature activation-5.21e-01

Pos logit contributions
aque+0.175
tained+0.173
.",+0.172
":""},{"+0.166
 Richards+0.162
Neg logit contributions
yip-0.248
-0.229
 XT-0.220
 Junction-0.216
advertisement-0.208
Tokeness
Token ID408
Feature activation-1.44e+00

Pos logit contributions
aque+2.391
tained+2.299
.",+2.271
":""},{"+2.156
oys+2.139
Neg logit contributions
yip-4.222
-3.876
 XT-3.812
 Junction-3.761
PLE-3.588
Tokenities
Token ID871
Feature activation+7.45e-02

Pos logit contributions
.",+3.621
":""},{"+3.494
upon+3.317
aque+3.253
tained+3.234
Neg logit contributions
yip-13.628
-12.521
 XT-12.232
 Junction-11.989
OUP-11.708
Token of
Token ID286
Feature activation+1.19e-01

Pos logit contributions
yip+0.558
+0.516
 XT+0.497
 Junction+0.485
advertisement+0.473
Neg logit contributions
aque-0.385
tained-0.385
.",-0.382
":""},{"-0.370
 Richards-0.357
Token Life
Token ID5155
Feature activation-9.46e-02

Pos logit contributions
yip+0.872
+0.804
 XT+0.776
 Junction+0.759
advertisement+0.737
Neg logit contributions
aque-0.630
tained-0.630
.",-0.624
":""},{"-0.606
 Richards-0.583
Token
Token ID198
Feature activation-2.29e-01

Pos logit contributions
aque+0.491
tained+0.489
.",+0.486
":""},{"+0.474
 Richards+0.457
Neg logit contributions
yip-0.710
-0.655
 XT-0.626
 Junction-0.610
PLE-0.599
Token
Token ID198
Feature activation-1.38e-01

Pos logit contributions
tained+1.252
aque+1.247
.",+1.245
":""},{"+1.210
 Richards+1.168
Neg logit contributions
yip-1.674
-1.519
 XT-1.461
 Junction-1.426
PLE-1.380
Token:
Token ID25
Feature activation-1.63e-01

Pos logit contributions
tained+1.992
.",+1.971
aque+1.945
 Richards+1.885
upon+1.866
Neg logit contributions
yip-2.915
-2.663
 XT-2.571
advertisement-2.475
 Junction-2.462
Token30
Token ID1270
Feature activation-5.40e-01

Pos logit contributions
.",+0.762
tained+0.761
aque+0.757
":""},{"+0.735
 Richards+0.714
Neg logit contributions
yip-1.316
-1.201
 XT-1.158
 Junction-1.133
PLE-1.099
Token in
Token ID287
Feature activation+6.07e-02

Pos logit contributions
.",+2.398
aque+2.365
tained+2.345
 Richards+2.238
":""},{"+2.214
Neg logit contributions
yip-4.453
-4.053
 XT-3.893
 Junction-3.783
advertisement-3.762
Token.
Token ID13
Feature activation-6.95e-02

Pos logit contributions
yip+0.510
+0.473
 XT+0.455
 Junction+0.447
PLE+0.437
Neg logit contributions
tained-0.264
aque-0.262
.",-0.262
":""},{"-0.249
 Richards-0.242
Token The
Token ID383
Feature activation-3.45e-01

Pos logit contributions
tained+0.366
.",+0.365
aque+0.362
":""},{"+0.349
 Richards+0.348
Neg logit contributions
yip-0.519
-0.472
 XT-0.454
 Junction-0.441
advertisement-0.432
Token lead
Token ID1085
Feature activation-1.43e+00

Pos logit contributions
.",+1.895
tained+1.886
aque+1.880
 Richards+1.830
":""},{"+1.799
Neg logit contributions
yip-2.494
-2.279
 XT-2.149
advertisement-2.072
PLE-2.070
Token was
Token ID373
Feature activation-4.46e-01

Pos logit contributions
tained+5.988
.",+5.809
 Richards+5.656
aque+5.581
upon+5.568
Neg logit contributions
yip-11.446
-10.582
PLE-9.705
 XT-9.700
 Junction-9.680
Token short
Token ID1790
Feature activation-2.26e-01

Pos logit contributions
tained+2.003
.",+1.988
aque+1.933
 Richards+1.893
":""},{"+1.872
Neg logit contributions
yip-3.708
-3.428
 XT-3.197
 Junction-3.168
advertisement-3.124
Token lived
Token ID5615
Feature activation+4.89e-02

Pos logit contributions
tained+1.105
.",+1.094
aque+1.061
":""},{"+1.022
 Richards+1.009
Neg logit contributions
yip-1.791
-1.645
 XT-1.579
 Junction-1.542
advertisement-1.527
Token though
Token ID996
Feature activation+4.96e-02

Pos logit contributions
yip+0.388
+0.358
 XT+0.345
 Junction+0.337
PLE+0.330
Neg logit contributions
tained-0.233
.",-0.233
aque-0.232
":""},{"-0.223
 Richards-0.217
Token,
Token ID11
Feature activation+2.42e-02

Pos logit contributions
yip+0.377
+0.347
 XT+0.334
 Junction+0.326
advertisement+0.320
Neg logit contributions
tained-0.252
.",-0.252
aque-0.251
":""},{"-0.242
 Richards-0.236
Token on
Token ID319
Feature activation-8.57e-01

Pos logit contributions
.",+2.511
tained+2.441
":""},{"+2.398
aque+2.396
upon+2.387
Neg logit contributions
yip-6.083
-5.555
 XT-5.309
 Junction-5.197
PLE-5.125
Token Facebook
Token ID3203
Feature activation-4.99e-01

Pos logit contributions
.",+5.281
 Richards+5.272
tained+5.126
aque+5.125
":""},{"+5.031
Neg logit contributions
yip-5.676
-4.861
 XT-4.363
 srf-4.320
PLE-4.309
Token Share
Token ID8734
Feature activation-7.22e-01

Pos logit contributions
.",+2.474
aque+2.469
tained+2.463
 Richards+2.380
":""},{"+2.380
Neg logit contributions
yip-3.882
-3.500
 XT-3.314
 Junction-3.241
advertisement-3.228
Token this
Token ID428
Feature activation-6.53e-01

Pos logit contributions
.",+2.522
tained+2.459
aque+2.415
":""},{"+2.367
 Richards+2.321
Neg logit contributions
yip-6.624
-6.095
 XT-5.860
 Junction-5.832
PLE-5.681
Token comment
Token ID2912
Feature activation-8.40e-01

Pos logit contributions
.",+2.702
aque+2.699
tained+2.644
":""},{"+2.615
 Richards+2.535
Neg logit contributions
yip-5.508
-5.140
 XT-4.844
 Junction-4.716
PLE-4.679
Token on
Token ID319
Feature activation-1.38e+00

Pos logit contributions
.",+2.999
upon+2.911
tained+2.888
":""},{"+2.885
aque+2.850
Neg logit contributions
yip-7.719
-7.048
 XT-6.736
 Junction-6.590
 srf-6.505
Token Twitter
Token ID3009
Feature activation-1.01e+00

Pos logit contributions
.",+4.441
":""},{"+4.420
 Richards+4.367
tro+4.187
tained+4.075
Neg logit contributions
yip-12.763
-11.179
 srf-10.989
��-10.813
PLE-10.516
Token
Token ID198
Feature activation-6.07e-01

Pos logit contributions
aque+4.591
tained+4.517
 Richards+4.468
.",+4.419
":""},{"+4.219
Neg logit contributions
yip-7.749
-7.090
advertisement-6.683
 XT-6.620
 Junction-6.532
Token
Token ID198
Feature activation-3.77e-01

Pos logit contributions
.",+2.922
tained+2.896
aque+2.851
":""},{"+2.829
upon+2.734
Neg logit contributions
yip-4.936
-4.383
 XT-4.177
 Junction-4.073
 srf-3.966
TokenNY
Token ID12805
Feature activation+1.26e-01

Pos logit contributions
aque+2.410
tained+2.402
.",+2.385
":""},{"+2.308
tro+2.293
Neg logit contributions
yip-2.354
-2.107
 XT-2.054
 Junction-1.968
advertisement-1.858
TokenT
Token ID51
Feature activation-3.80e-01

Pos logit contributions
yip+0.647
+0.577
 XT+0.548
 Junction+0.530
PLE+0.507
Neg logit contributions
tained-0.948
aque-0.946
.",-0.940
":""},{"-0.919
 Richards-0.896
Tokenf
Token ID69
Feature activation-7.30e-01

Pos logit contributions
.",+1.954
aque+1.949
tained+1.945
":""},{"+1.861
 Richards+1.819
Neg logit contributions
yip-2.234
-2.050
 XT-1.990
 Junction-1.942
PLE-1.828
Tokeninance
Token ID14149
Feature activation-5.92e-01

Pos logit contributions
tained+3.803
aque+3.799
.",+3.769
oys+3.564
":""},{"+3.521
Neg logit contributions
yip-5.492
-5.204
 XT-4.958
 Junction-4.808
PLE-4.600
Token,
Token ID11
Feature activation-4.82e-01

Pos logit contributions
tained+2.618
aque+2.606
.",+2.584
":""},{"+2.553
 Richards+2.441
Neg logit contributions
yip-4.853
-4.495
 XT-4.236
PLE-4.163
 Junction-4.153
Token wh
Token ID348
Feature activation-2.60e-01

Pos logit contributions
.",+2.320
aque+2.283
tained+2.283
":""},{"+2.216
 Richards+2.188
Neg logit contributions
yip-3.737
-3.462
 XT-3.301
 Junction-3.254
PLE-3.210
Tokenitt
Token ID715
Feature activation-8.87e-01

Pos logit contributions
tained+0.934
.",+0.933
aque+0.930
":""},{"+0.838
oys+0.832
Neg logit contributions
yip-2.364
-2.275
 XT-2.168
 Junction-2.133
PLE-2.107
Tokenles
Token ID829
Feature activation-1.38e+00

Pos logit contributions
aque+2.579
tained+2.479
.",+2.462
":""},{"+2.326
upon+2.243
Neg logit contributions
yip-8.525
-8.253
 XT-7.994
 Junction-7.908
PLE-7.564
Tokenea
Token ID18213
Feature activation-5.23e-01

Pos logit contributions
aque+4.413
.",+4.038
tained+3.962
upon+3.892
oys+3.801
Neg logit contributions
yip-12.108
-11.986
 XT-11.722
 Junction-11.294
PLE-11.218
Token-
Token ID12
Feature activation-5.39e-01

Pos logit contributions
aque+2.265
.",+2.210
tained+2.162
":""},{"+2.161
upon+2.085
Neg logit contributions
yip-4.372
-3.998
 XT-3.811
advertisement-3.736
PLE-3.709
Token375
Token ID22318
Feature activation-7.39e-01

Pos logit contributions
aque+2.430
tained+2.406
.",+2.380
":""},{"+2.294
upon+2.215
Neg logit contributions
yip-4.286
-4.039
 XT-3.864
 Junction-3.761
PLE-3.733
Token7
Token ID22
Feature activation-7.18e-01

Pos logit contributions
aque+3.770
.",+3.728
tained+3.705
upon+3.596
":""},{"+3.587
Neg logit contributions
yip-5.561
-4.836
 XT-4.738
 Junction-4.560
advertisement-4.521
Token,
Token ID11
Feature activation-4.09e-01

Pos logit contributions
aque+3.133
tained+3.004
.",+2.986
":""},{"+2.872
upon+2.845
Neg logit contributions
yip-5.907
-5.398
 XT-5.173
advertisement-5.051
PLE-5.047
Token on
Token ID319
Feature activation-8.06e-01

Pos logit contributions
.",+2.582
tained+2.503
":""},{"+2.460
aque+2.459
upon+2.446
Neg logit contributions
yip-6.377
-5.821
 XT-5.559
 Junction-5.441
PLE-5.377
Token Facebook
Token ID3203
Feature activation-5.18e-01

Pos logit contributions
 Richards+4.971
.",+4.955
aque+4.862
tained+4.827
":""},{"+4.701
Neg logit contributions
yip-5.297
-4.598
 XT-4.083
PLE-4.050
 Junction-4.008
Token Share
Token ID8734
Feature activation-8.18e-01

Pos logit contributions
aque+2.537
.",+2.533
tained+2.517
 Richards+2.440
":""},{"+2.419
Neg logit contributions
yip-4.061
-3.673
 XT-3.472
 Junction-3.410
advertisement-3.404
Token this
Token ID428
Feature activation-7.15e-01

Pos logit contributions
.",+2.718
tained+2.636
aque+2.576
":""},{"+2.513
 Richards+2.499
Neg logit contributions
yip-7.613
-7.006
 Junction-6.721
 XT-6.702
PLE-6.544
Token comment
Token ID2912
Feature activation-8.28e-01

Pos logit contributions
.",+2.845
aque+2.821
tained+2.754
":""},{"+2.751
 Richards+2.658
Neg logit contributions
yip-6.118
-5.707
 XT-5.368
 Junction-5.237
PLE-5.202
Token on
Token ID319
Feature activation-1.37e+00

Pos logit contributions
.",+2.973
tained+2.867
upon+2.861
":""},{"+2.853
aque+2.829
Neg logit contributions
yip-7.598
-6.933
 XT-6.625
 Junction-6.479
PLE-6.403
Token Twitter
Token ID3009
Feature activation-9.48e-01

Pos logit contributions
 Richards+4.329
.",+4.313
":""},{"+4.255
tro+4.036
tained+3.926
Neg logit contributions
yip-12.757
-11.179
 srf-10.965
��-10.835
PLE-10.500
Token
Token ID198
Feature activation-6.71e-01

Pos logit contributions
aque+4.388
tained+4.335
 Richards+4.294
.",+4.266
":""},{"+4.074
Neg logit contributions
yip-7.229
-6.613
advertisement-6.244
 XT-6.165
 Junction-6.095
Token
Token ID198
Feature activation-3.73e-01

Pos logit contributions
.",+3.160
tained+3.123
aque+3.065
":""},{"+3.058
upon+2.964
Neg logit contributions
yip-5.545
-4.902
 XT-4.670
 Junction-4.548
 srf-4.521
TokenNY
Token ID12805
Feature activation+1.02e-01

Pos logit contributions
aque+2.400
tained+2.389
.",+2.374
":""},{"+2.296
tro+2.281
Neg logit contributions
yip-2.322
-2.072
 XT-2.025
 Junction-1.936
advertisement-1.829
TokenT
Token ID51
Feature activation-3.62e-01

Pos logit contributions
yip+0.530
+0.472
 XT+0.448
 Junction+0.433
advertisement+0.415
Neg logit contributions
tained-0.764
aque-0.763
.",-0.758
":""},{"-0.742
 Richards-0.724
Token on
Token ID319
Feature activation-7.95e-01

Pos logit contributions
.",+2.453
tained+2.391
aque+2.350
":""},{"+2.344
upon+2.321
Neg logit contributions
yip-5.767
-5.271
 XT-5.038
 Junction-4.934
PLE-4.863
Token Facebook
Token ID3203
Feature activation-4.54e-01

Pos logit contributions
.",+5.028
 Richards+4.975
tained+4.894
aque+4.890
":""},{"+4.807
Neg logit contributions
yip-5.166
-4.418
 XT-3.983
PLE-3.915
 Junction-3.902
Token Share
Token ID8734
Feature activation-6.98e-01

Pos logit contributions
.",+2.283
aque+2.279
tained+2.275
":""},{"+2.196
 Richards+2.188
Neg logit contributions
yip-3.504
-3.171
 XT-2.998
 Junction-2.936
advertisement-2.915
Token this
Token ID428
Feature activation-6.20e-01

Pos logit contributions
.",+2.468
tained+2.407
aque+2.366
":""},{"+2.305
 Richards+2.269
Neg logit contributions
yip-6.372
-5.881
 XT-5.649
 Junction-5.628
PLE-5.480
Token comment
Token ID2912
Feature activation-8.47e-01

Pos logit contributions
aque+2.627
.",+2.626
tained+2.579
":""},{"+2.536
 Richards+2.460
Neg logit contributions
yip-5.172
-4.825
 XT-4.554
 Junction-4.436
PLE-4.393
Token on
Token ID319
Feature activation-1.35e+00

Pos logit contributions
.",+3.044
upon+2.945
tained+2.928
":""},{"+2.915
aque+2.888
Neg logit contributions
yip-7.768
-7.102
 XT-6.780
 Junction-6.637
PLE-6.555
Token Twitter
Token ID3009
Feature activation-9.70e-01

Pos logit contributions
.",+4.462
":""},{"+4.409
 Richards+4.359
tro+4.206
tained+4.101
Neg logit contributions
yip-12.421
-10.912
 srf-10.601
��-10.450
PLE-10.255
Token
Token ID198
Feature activation-5.75e-01

Pos logit contributions
aque+4.452
tained+4.366
 Richards+4.319
.",+4.284
":""},{"+4.088
Neg logit contributions
yip-7.442
-6.826
advertisement-6.440
 XT-6.377
 Junction-6.299
Token
Token ID198
Feature activation-4.54e-01

Pos logit contributions
.",+2.804
tained+2.780
aque+2.741
":""},{"+2.714
 Richards+2.622
Neg logit contributions
yip-4.632
-4.125
 XT-3.933
 Junction-3.835
PLE-3.705
TokenNY
Token ID12805
Feature activation+1.35e-01

Pos logit contributions
aque+2.857
tained+2.842
.",+2.827
":""},{"+2.731
tro+2.721
Neg logit contributions
yip-2.870
-2.575
 XT-2.507
 Junction-2.407
advertisement-2.267
TokenT
Token ID51
Feature activation-2.87e-01

Pos logit contributions
yip+0.691
+0.617
 XT+0.586
 Junction+0.567
PLE+0.543
Neg logit contributions
tained-1.020
aque-1.018
.",-1.010
":""},{"-0.988
 Richards-0.963
Token meme
Token ID25336
Feature activation+6.04e-02

Pos logit contributions
.",+0.538
aque+0.535
tained+0.535
":""},{"+0.511
 Richards+0.504
Neg logit contributions
yip-0.739
-0.683
 XT-0.646
 Junction-0.634
PLE-0.621
Token [
Token ID685
Feature activation-5.69e-01

Pos logit contributions
yip+0.419
+0.386
 XT+0.369
 Junction+0.361
advertisement+0.351
Neg logit contributions
aque-0.344
tained-0.343
.",-0.343
":""},{"-0.331
 Richards-0.322
Token10
Token ID940
Feature activation-2.14e-01

Pos logit contributions
aque+2.279
.",+2.270
tained+2.258
":""},{"+2.169
tro+2.131
Neg logit contributions
yip-4.971
-4.556
 XT-4.415
 Junction-4.359
PLE-4.110
Token:
Token ID25
Feature activation-3.63e-01

Pos logit contributions
tained+1.040
aque+1.036
.",+1.030
":""},{"+1.000
 Richards+0.958
Neg logit contributions
yip-1.690
-1.551
 XT-1.497
 Junction-1.463
PLE-1.424
Token47
Token ID2857
Feature activation+2.59e-01

Pos logit contributions
aque+1.659
tained+1.656
.",+1.634
":""},{"+1.588
 Richards+1.511
Neg logit contributions
yip-2.921
-2.708
 XT-2.619
 Junction-2.550
PLE-2.514
Token]
Token ID60
Feature activation-1.31e+00

Pos logit contributions
yip+1.382
+1.270
 XT+1.222
 Junction+1.187
advertisement+1.149
Neg logit contributions
aque-1.847
tained-1.847
.",-1.829
":""},{"-1.779
 Richards-1.730
Token <
Token ID1279
Feature activation-7.58e-01

Pos logit contributions
tained+3.148
.",+3.045
":""},{"+2.963
aque+2.931
 Richards+2.918
Neg logit contributions
yip-12.251
-11.473
advertisement-10.769
spect-10.678
 Junction-10.666
Tokenweb
Token ID12384
Feature activation-1.59e-01

Pos logit contributions
aque+4.263
tained+4.258
.",+4.111
tro+4.064
":""},{"+4.012
Neg logit contributions
yip-5.009
-4.703
 XT-4.618
 Junction-4.557
 Advertisement-4.252
Tokenwood
Token ID3822
Feature activation-1.29e-01

Pos logit contributions
aque+0.844
tained+0.830
.",+0.810
":""},{"+0.792
upon+0.770
Neg logit contributions
yip-1.186
-1.095
 XT-1.063
 Junction-1.037
advertisement-1.010
Token>
Token ID29
Feature activation-2.65e-01

Pos logit contributions
aque+0.686
tained+0.686
.",+0.671
":""},{"+0.654
upon+0.638
Neg logit contributions
yip-0.933
-0.870
 XT-0.840
 Junction-0.823
advertisement-0.796
Token lol
Token ID19462
Feature activation-2.93e-01

Pos logit contributions
.",+1.338
aque+1.331
tained+1.326
":""},{"+1.283
 Richards+1.258
Neg logit contributions
yip-1.997
-1.856
 XT-1.749
 Junction-1.732
advertisement-1.713
Tokenp
Token ID79
Feature activation-2.50e-01

Pos logit contributions
tained+3.797
aque+3.792
.",+3.624
":""},{"+3.588
tro+3.468
Neg logit contributions
yip-5.099
-4.685
 XT-4.603
 Junction-4.465
 Advertisement-4.178
Tokenone
Token ID505
Feature activation-3.39e-01

Pos logit contributions
tained+0.833
aque+0.833
.",+0.823
":""},{"+0.782
 Richards+0.740
Neg logit contributions
yip-2.326
-2.184
 XT-2.124
 Junction-2.088
advertisement-2.044
Token.
Token ID13
Feature activation-5.80e-01

Pos logit contributions
tained+1.665
.",+1.665
aque+1.639
":""},{"+1.602
upon+1.534
Neg logit contributions
yip-2.653
-2.428
 XT-2.352
 Junction-2.297
PLE-2.221
Token00
Token ID405
Feature activation-6.77e-01

Pos logit contributions
tained+1.653
aque+1.608
.",+1.553
":""},{"+1.511
tro+1.373
Neg logit contributions
yip-5.674
-5.358
 XT-5.229
 Junction-5.136
PLE-4.953
Token96
Token ID4846
Feature activation-4.17e-01

Pos logit contributions
tained+3.092
.",+3.035
aque+3.032
":""},{"+2.944
tro+2.833
Neg logit contributions
yip-5.472
-4.941
 XT-4.789
 Junction-4.721
advertisement-4.525
Token275
Token ID23195
Feature activation-1.30e+00

Pos logit contributions
tained+2.125
.",+2.102
aque+2.077
":""},{"+2.041
 Richards+1.960
Neg logit contributions
yip-3.161
-2.873
 XT-2.789
 Junction-2.740
PLE-2.627
Token.
Token ID13
Feature activation-1.24e-01

Pos logit contributions
tained+4.212
.",+4.074
aque+4.005
.—+3.913
 Richards+3.779
Neg logit contributions
yip-12.162
-10.829
 XT-9.988
 Junction-9.896
 srf-9.865
Tokeng
Token ID70
Feature activation+4.85e-02

Pos logit contributions
tained+0.569
aque+0.565
.",+0.564
":""},{"+0.542
upon+0.519
Neg logit contributions
yip-0.988
-0.924
 XT-0.892
 Junction-0.874
advertisement-0.856
Token001
Token ID8298
Feature activation-3.72e-01

Pos logit contributions
yip+0.456
+0.432
 XT+0.419
 Junction+0.412
advertisement+0.406
Neg logit contributions
tained-0.154
aque-0.153
.",-0.150
":""},{"-0.142
 Richards-0.134
Token Pre
Token ID3771
Feature activation-3.25e-01

Pos logit contributions
tained+1.988
aque+1.982
.",+1.950
":""},{"+1.907
 Richards+1.875
Neg logit contributions
yip-2.809
-2.506
 XT-2.349
 Junction-2.266
advertisement-2.265
Token-
Token ID12
Feature activation+9.73e-02

Pos logit contributions
aque+1.765
tained+1.756
.",+1.683
":""},{"+1.641
tro+1.592
Neg logit contributions
yip-2.407
-2.170
 XT-2.126
 Junction-2.035
advertisement-1.992
Tokenisher
Token ID4828
Feature activation-4.33e-01

Pos logit contributions
yip+0.525
+0.490
 XT+0.476
 Junction+0.467
advertisement+0.456
Neg logit contributions
tained-0.240
aque-0.239
.",-0.236
":""},{"-0.227
 Richards-0.216
Token"
Token ID1
Feature activation+1.44e-01

Pos logit contributions
.",+2.092
tained+2.020
aque+2.001
":""},{"+1.955
 Richards+1.882
Neg logit contributions
yip-3.441
-3.183
 XT-3.014
 Junction-2.978
advertisement-2.940
Token pic
Token ID8301
Feature activation+6.16e-03

Pos logit contributions
yip+1.051
+0.967
 XT+0.931
 Junction+0.911
advertisement+0.886
Neg logit contributions
tained-0.769
aque-0.769
.",-0.763
":""},{"-0.740
 Richards-0.715
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
yip+0.045
+0.041
 XT+0.040
 Junction+0.039
advertisement+0.038
Neg logit contributions
aque-0.033
tained-0.033
.",-0.033
":""},{"-0.032
 Richards-0.031
Tokentwitter
Token ID6956
Feature activation-7.34e-01

Pos logit contributions
tained+4.244
.",+4.075
":""},{"+3.905
aque+3.870
tro+3.868
Neg logit contributions
yip-9.165
-8.062
 XT-8.058
 srf-7.921
 Junction-7.792
Token.
Token ID13
Feature activation-1.30e+00

Pos logit contributions
.",+3.271
tained+3.114
":""},{"+3.063
aque+3.044
upon+2.988
Neg logit contributions
yip-6.353
-5.581
 XT-5.420
 srf-5.340
 Junction-5.313
Tokencom
Token ID785
Feature activation-8.16e-01

Pos logit contributions
tained+3.354
upon+3.262
.",+3.259
":""},{"+3.043
aque+3.021
Neg logit contributions
yip-13.199
 srf-11.962
-11.644
 XT-11.543
senal-11.503
Token/
Token ID14
Feature activation-4.48e-01

Pos logit contributions
tained+3.443
.",+3.433
":""},{"+3.409
aque+3.311
upon+3.252
Neg logit contributions
yip-7.043
-6.179
 XT-6.053
 srf-5.944
 Junction-5.842
TokenEh
Token ID43894
Feature activation-9.85e-02

Pos logit contributions
aque+2.654
tained+2.593
.",+2.561
":""},{"+2.503
tro+2.492
Neg logit contributions
yip-2.942
-2.756
 XT-2.638
 Junction-2.609
advertisement-2.473
TokenU
Token ID52
Feature activation-1.94e-01

Pos logit contributions
aque+0.602
tained+0.599
.",+0.593
":""},{"+0.578
upon+0.562
Neg logit contributions
yip-0.643
-0.591
 XT-0.567
 Junction-0.556
advertisement-0.533
Tokeng
Token ID70
Feature activation-3.78e-02

Pos logit contributions
aque+1.130
tained+1.122
.",+1.109
":""},{"+1.080
upon+1.055
Neg logit contributions
yip-1.317
-1.225
 XT-1.173
 Junction-1.155
advertisement-1.106
TokenOn
Token ID2202
Feature activation-8.67e-01

Pos logit contributions
.",+2.470
aque+2.441
tained+2.409
upon+2.404
":""},{"+2.351
Neg logit contributions
yip-4.828
-4.458
 XT-4.368
 Junction-4.256
advertisement-4.023
Token September
Token ID2693
Feature activation-5.63e-01

Pos logit contributions
.",+3.704
tained+3.665
":""},{"+3.655
aque+3.508
 Richards+3.385
Neg logit contributions
yip-7.108
-6.674
 XT-6.180
advertisement-6.112
PLE-6.087
Token 20
Token ID1160
Feature activation-6.93e-01

Pos logit contributions
tained+3.312
.",+3.290
aque+3.271
":""},{"+3.164
 Richards+3.149
Neg logit contributions
yip-3.921
-3.407
 XT-3.236
PLE-3.136
 Junction-3.101
Token 2013
Token ID2211
Feature activation-5.53e-01

Pos logit contributions
.",+3.803
tained+3.774
":""},{"+3.724
aque+3.701
 Richards+3.581
Neg logit contributions
yip-5.175
-4.464
 XT-4.097
 Junction-4.051
PLE-4.039
Token 07
Token ID8753
Feature activation-6.17e-01

Pos logit contributions
.",+2.851
tained+2.820
aque+2.782
":""},{"+2.740
 Richards+2.682
Neg logit contributions
yip-4.217
-3.775
advertisement-3.487
 XT-3.473
PLE-3.445
Token:
Token ID25
Feature activation-1.29e+00

Pos logit contributions
tained+2.642
.",+2.603
aque+2.576
":""},{"+2.572
upon+2.472
Neg logit contributions
yip-5.338
-4.789
 XT-4.602
 Junction-4.486
PLE-4.385
Token44
Token ID2598
Feature activation+9.05e-03

Pos logit contributions
.",+4.226
":""},{"+3.958
tained+3.948
 Richards+3.898
upon+3.889
Neg logit contributions
yip-11.613
 srf-10.289
-10.258
 XT-10.064
 Junction-9.904
Token Se
Token ID1001
Feature activation+7.86e-02

Pos logit contributions
yip+0.066
+0.061
 XT+0.058
 Junction+0.057
advertisement+0.056
Neg logit contributions
tained-0.049
aque-0.048
.",-0.048
":""},{"-0.046
 Richards-0.045
Tokeneker
Token ID28233
Feature activation-3.67e-01

Pos logit contributions
yip+0.635
+0.589
 XT+0.570
 Junction+0.559
advertisement+0.546
Neg logit contributions
tained-0.358
aque-0.358
.",-0.355
":""},{"-0.342
 Richards-0.329
Token wrote
Token ID2630
Feature activation-9.90e-01

Pos logit contributions
tained+1.154
aque+1.141
.",+1.129
":""},{"+1.068
 Richards+1.013
Neg logit contributions
yip-3.513
-3.300
 XT-3.185
 Junction-3.139
advertisement-3.073
Token:
Token ID25
Feature activation-8.01e-01

Pos logit contributions
":""},{"+3.732
tained+3.728
.",+3.728
upon+3.607
aque+3.548
Neg logit contributions
yip-9.084
-8.008
 srf-7.720
 XT-7.682
 Junction-7.448
Token quiet
Token ID5897
Feature activation-3.22e-03

Pos logit contributions
yip+1.050
+0.968
 XT+0.933
 Junction+0.911
advertisement+0.888
Neg logit contributions
tained-0.733
aque-0.732
.",-0.727
":""},{"-0.703
 Richards-0.680
Token colonial
Token ID17091
Feature activation-8.90e-03

Pos logit contributions
tained+0.017
aque+0.017
.",+0.017
":""},{"+0.016
 Richards+0.016
Neg logit contributions
yip-0.024
-0.022
 XT-0.021
 Junction-0.020
PLE-0.020
Token town
Token ID3240
Feature activation-1.25e-02

Pos logit contributions
tained+0.047
.",+0.047
aque+0.047
":""},{"+0.045
 Richards+0.044
Neg logit contributions
yip-0.066
-0.061
 XT-0.058
 Junction-0.056
PLE-0.055
Token that
Token ID326
Feature activation+4.42e-01

Pos logit contributions
aque+0.062
tained+0.062
.",+0.062
":""},{"+0.059
 Richards+0.058
Neg logit contributions
yip-0.096
-0.089
 XT-0.086
 Junction-0.083
advertisement-0.082
Token attracted
Token ID12725
Feature activation+2.74e-01

Pos logit contributions
yip+3.346
+3.105
 XT+3.030
 Junction+2.977
advertisement+2.864
Neg logit contributions
aque-2.196
tained-2.181
.",-2.168
":""},{"-2.118
tro-1.990
Token them
Token ID606
Feature activation+1.09e+00

Pos logit contributions
yip+1.997
+1.840
 XT+1.779
 Junction+1.740
advertisement+1.688
Neg logit contributions
tained-1.469
aque-1.468
.",-1.455
":""},{"-1.412
 Richards-1.359
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
yip+6.837
+6.372
 XT+6.132
 Junction+6.031
PLE+5.824
Neg logit contributions
tained-6.300
aque-6.231
":""},{"-6.190
.",-6.150
tro-5.788
Token
Token ID198
Feature activation+3.27e-02

Pos logit contributions
yip+0.855
+0.787
 XT+0.759
 Junction+0.743
PLE+0.717
Neg logit contributions
tained-0.709
aque-0.708
.",-0.704
":""},{"-0.684
 Richards-0.658
Token
Token ID198
Feature activation+6.55e-03

Pos logit contributions
yip+0.215
+0.201
 XT+0.194
 Junction+0.189
advertisement+0.184
Neg logit contributions
aque-0.195
tained-0.194
.",-0.192
":""},{"-0.186
 Richards-0.181
Token"
Token ID1
Feature activation-2.41e-01

Pos logit contributions
yip+0.046
+0.042
 XT+0.040
 Junction+0.039
advertisement+0.037
Neg logit contributions
aque-0.037
tained-0.037
.",-0.037
":""},{"-0.036
 Richards-0.035
TokenIt
Token ID1026
Feature activation+3.42e-01

Pos logit contributions
.",+1.454
aque+1.446
tained+1.434
":""},{"+1.409
upon+1.375
Neg logit contributions
yip-1.619
-1.419
 XT-1.391
 Junction-1.343
advertisement-1.247
Tokenи
Token ID18849
Feature activation+3.26e-01

Pos logit contributions
.",+0.456
tained+0.455
aque+0.449
":""},{"+0.440
 Richards+0.426
Neg logit contributions
yip-0.451
-0.404
 XT-0.387
 Junction-0.378
advertisement-0.366
Token
Token ID141
Feature activation+2.18e-01

Pos logit contributions
yip+2.033
+1.850
 XT+1.771
 Junction+1.724
PLE+1.663
Neg logit contributions
aque-2.093
tained-2.093
.",-2.074
":""},{"-2.024
 Richards-1.966
Token
Token ID227
Feature activation+1.91e-01

Pos logit contributions
yip+1.653
+1.530
 XT+1.476
 Junction+1.445
advertisement+1.406
Neg logit contributions
aque-1.100
tained-1.100
.",-1.089
":""},{"-1.056
 Richards-1.017
Token
Token ID220
Feature activation+1.29e-02

Pos logit contributions
yip+1.415
+1.304
 XT+1.257
 Junction+1.230
advertisement+1.197
Neg logit contributions
tained-1.008
aque-1.007
.",-1.001
":""},{"-0.969
 Richards-0.935
Tokenс
Token ID21727
Feature activation+4.11e-01

Pos logit contributions
yip+0.082
+0.073
 XT+0.071
 Junction+0.069
advertisement+0.067
Neg logit contributions
.",-0.083
tained-0.083
aque-0.082
":""},{"-0.080
 Richards-0.078
Tokenл
Token ID30143
Feature activation+1.10e+00

Pos logit contributions
yip+2.529
+2.314
 XT+2.204
 Junction+2.141
PLE+2.068
Neg logit contributions
aque-2.649
tained-2.642
.",-2.617
":""},{"-2.557
 Richards-2.487
Tokenу
Token ID35072
Feature activation+1.88e-01

Pos logit contributions
yip+5.513
+5.275
 XT+4.878
 Junction+4.742
advertisement+4.587
Neg logit contributions
aque-7.707
tained-7.703
.",-7.579
":""},{"-7.508
 Richards-7.365
Token
Token ID141
Feature activation-2.13e-01

Pos logit contributions
yip+1.064
+0.956
 XT+0.911
 Junction+0.883
advertisement+0.849
Neg logit contributions
tained-1.319
aque-1.318
.",-1.310
":""},{"-1.280
 Richards-1.246
Token
Token ID229
Feature activation+3.45e-03

Pos logit contributions
aque+1.100
tained+1.094
.",+1.088
":""},{"+1.039
 Richards+1.005
Neg logit contributions
yip-1.617
-1.477
 XT-1.435
 Junction-1.396
PLE-1.362
Tokenа
Token ID16142
Feature activation+5.12e-01

Pos logit contributions
yip+0.022
+0.019
 XT+0.019
 Junction+0.018
advertisement+0.018
Neg logit contributions
tained-0.022
aque-0.022
.",-0.022
":""},{"-0.022
 Richards-0.021
Tokenе
Token ID16843
Feature activation+1.29e-02

Pos logit contributions
yip+3.015
+2.749
 XT+2.639
 Junction+2.572
PLE+2.463
Neg logit contributions
tained-3.391
aque-3.384
.",-3.321
":""},{"-3.266
 Richards-3.176
Token quick
Token ID2068
Feature activation+2.20e-01

Pos logit contributions
yip+1.371
+1.266
 XT+1.208
 Junction+1.182
advertisement+1.160
Neg logit contributions
tained-1.015
aque-1.012
.",-1.006
":""},{"-0.971
 Richards-0.946
Token sweep
Token ID16085
Feature activation+3.56e-01

Pos logit contributions
yip+1.552
+1.432
 XT+1.360
 Junction+1.331
advertisement+1.305
Neg logit contributions
tained-1.236
aque-1.232
.",-1.227
":""},{"-1.186
 Richards-1.157
Token across
Token ID1973
Feature activation+3.99e-01

Pos logit contributions
yip+2.700
+2.503
 XT+2.409
 Junction+2.358
advertisement+2.299
Neg logit contributions
tained-1.805
aque-1.803
.",-1.793
":""},{"-1.731
 Richards-1.668
Token the
Token ID262
Feature activation+2.66e-01

Pos logit contributions
yip+3.200
+2.968
 XT+2.893
 Junction+2.826
PLE+2.738
Neg logit contributions
aque-1.839
tained-1.827
.",-1.805
":""},{"-1.760
 Richards-1.672
Token bands
Token ID11760
Feature activation+3.19e-01

Pos logit contributions
yip+1.973
+1.822
 XT+1.765
 Junction+1.725
advertisement+1.670
Neg logit contributions
aque-1.387
tained-1.385
.",-1.371
":""},{"-1.334
 Richards-1.282
Token to
Token ID284
Feature activation+9.25e-01

Pos logit contributions
yip+2.403
+2.223
 XT+2.143
 Junction+2.098
advertisement+2.043
Neg logit contributions
tained-1.631
aque-1.630
.",-1.617
":""},{"-1.563
 Richards-1.508
Token find
Token ID1064
Feature activation+6.92e-01

Pos logit contributions
yip+5.788
 XT+5.227
+5.215
 Junction+5.060
PLE+4.817
Neg logit contributions
aque-5.663
tained-5.630
":""},{"-5.456
.",-5.434
tro-5.165
Token out
Token ID503
Feature activation+3.30e-01

Pos logit contributions
yip+5.164
+4.776
 XT+4.728
 Junction+4.587
PLE+4.392
Neg logit contributions
aque-3.487
tained-3.449
.",-3.360
":""},{"-3.323
tro-3.139
Token where
Token ID810
Feature activation+2.24e-01

Pos logit contributions
yip+2.513
+2.318
 XT+2.260
 Junction+2.208
PLE+2.136
Neg logit contributions
aque-1.665
tained-1.656
.",-1.627
":""},{"-1.593
 Richards-1.521
Token the
Token ID262
Feature activation+1.27e-01

Pos logit contributions
yip+1.729
+1.600
 XT+1.545
 Junction+1.513
advertisement+1.475
Neg logit contributions
tained-1.111
aque-1.110
.",-1.102
":""},{"-1.065
 Richards-1.025
Token R
Token ID371
Feature activation+4.22e-01

Pos logit contributions
yip+0.816
+0.744
 XT+0.707
 Junction+0.691
advertisement+0.675
Neg logit contributions
tained-0.786
aque-0.783
.",-0.783
":""},{"-0.759
 Richards-0.739
Token in
Token ID287
Feature activation-4.21e-02

Pos logit contributions
yip+0.752
+0.695
 XT+0.671
 Junction+0.656
advertisement+0.638
Neg logit contributions
aque-0.472
tained-0.472
.",-0.469
":""},{"-0.451
 Richards-0.435
Token Little
Token ID7703
Feature activation-3.41e-01

Pos logit contributions
tained+0.217
aque+0.216
.",+0.216
":""},{"+0.209
 Richards+0.203
Neg logit contributions
yip-0.317
-0.292
 XT-0.280
 Junction-0.272
PLE-0.267
Token Rock
Token ID4631
Feature activation-2.62e-02

Pos logit contributions
aque+1.640
.",+1.624
tained+1.624
 Richards+1.601
":""},{"+1.587
Neg logit contributions
yip-2.633
-2.415
 XT-2.313
advertisement-2.237
PLE-2.196
Token with
Token ID351
Feature activation-1.88e-01

Pos logit contributions
aque+0.125
.",+0.125
tained+0.125
":""},{"+0.119
 Richards+0.116
Neg logit contributions
yip-0.208
-0.192
 XT-0.185
 Junction-0.180
PLE-0.176
Token meth
Token ID11248
Feature activation+3.56e-02

Pos logit contributions
tained+0.942
.",+0.933
aque+0.923
":""},{"+0.900
 Richards+0.872
Neg logit contributions
yip-1.446
-1.326
 XT-1.275
 Junction-1.244
PLE-1.231
Token on
Token ID319
Feature activation+9.11e-01

Pos logit contributions
yip+0.269
+0.247
 XT+0.238
 Junction+0.233
PLE+0.227
Neg logit contributions
aque-0.185
tained-0.184
.",-0.183
":""},{"-0.176
 Richards-0.169
Token the
Token ID262
Feature activation+8.74e-01

Pos logit contributions
yip+6.494
+6.117
 XT+5.928
 Junction+5.818
advertisement+5.617
Neg logit contributions
aque-4.559
tained-4.479
.",-4.428
":""},{"-4.410
upon-4.221
Token menu
Token ID6859
Feature activation+6.04e-01

Pos logit contributions
yip+4.325
+3.946
 XT+3.795
 Junction+3.708
advertisement+3.535
Neg logit contributions
aque-6.397
tained-6.284
.",-6.256
":""},{"-6.179
 Richards-6.007
Token.
Token ID13
Feature activation-1.57e-01

Pos logit contributions
yip+4.262
+3.983
 XT+3.827
 Junction+3.743
advertisement+3.675
Neg logit contributions
aque-3.219
tained-3.215
.",-3.154
":""},{"-3.109
 Richards-3.007
Token Heat
Token ID12308
Feature activation-2.36e-01

Pos logit contributions
tained+0.777
.",+0.775
aque+0.765
":""},{"+0.754
 Richards+0.745
Neg logit contributions
yip-1.230
-1.099
 XT-1.051
 Junction-1.021
advertisement-1.006
Token and
Token ID290
Feature activation+2.15e-01

Pos logit contributions
aque+0.862
.",+0.846
tained+0.841
":""},{"+0.807
 Richards+0.774
Neg logit contributions
yip-2.122
-1.981
 XT-1.910
 Junction-1.867
PLE-1.851
Token2015
Token ID4626
Feature activation+2.22e-01

Pos logit contributions
aque+1.933
tained+1.882
.",+1.872
":""},{"+1.790
upon+1.786
Neg logit contributions
yip-2.648
-2.347
 XT-2.300
 Junction-2.231
advertisement-2.141
Token
Token ID1906
Feature activation+2.82e-01

Pos logit contributions
yip+1.605
+1.479
 XT+1.427
 Junction+1.394
advertisement+1.353
Neg logit contributions
aque-1.204
tained-1.204
.",-1.194
":""},{"-1.158
 Richards-1.117
Token2017
Token ID5539
Feature activation+5.24e-02

Pos logit contributions
yip+1.912
+1.769
 XT+1.701
 Junction+1.662
advertisement+1.614
Neg logit contributions
aque-1.633
tained-1.631
.",-1.621
":""},{"-1.573
 Richards-1.518
Token)
Token ID8
Feature activation+1.30e-01

Pos logit contributions
yip+0.418
+0.385
 XT+0.370
 Junction+0.363
advertisement+0.357
Neg logit contributions
tained-0.250
aque-0.249
.",-0.245
":""},{"-0.237
 Richards-0.232
Token [
Token ID685
Feature activation+5.54e-01

Pos logit contributions
yip+0.842
+0.763
 XT+0.728
 Junction+0.710
advertisement+0.688
Neg logit contributions
tained-0.815
aque-0.814
.",-0.807
":""},{"-0.787
 Richards-0.767
Token edit
Token ID4370
Feature activation+9.87e-01

Pos logit contributions
yip+5.182
+5.003
 XT+4.843
spect+4.807
 Junction+4.792
Neg logit contributions
aque-1.544
.",-1.528
tained-1.513
":""},{"-1.452
 Richards-1.354
Token ]
Token ID2361
Feature activation-1.60e-02

Pos logit contributions
yip+3.706
+3.651
 XT+3.545
 Junction+3.474
advertisement+3.452
Neg logit contributions
aque-7.795
tained-7.724
.",-7.700
":""},{"-7.440
oys-7.316
Token
Token ID198
Feature activation+1.02e-01

Pos logit contributions
tained+0.087
aque+0.087
.",+0.087
":""},{"+0.085
 Richards+0.082
Neg logit contributions
yip-0.117
-0.105
 XT-0.102
 Junction-0.099
advertisement-0.097
Token
Token ID198
Feature activation-2.63e-02

Pos logit contributions
yip+0.673
+0.627
 XT+0.604
 Junction+0.590
advertisement+0.575
Neg logit contributions
aque-0.603
tained-0.600
.",-0.593
":""},{"-0.577
 Richards-0.558
Token2015
Token ID4626
Feature activation+1.19e-01

Pos logit contributions
aque+0.155
tained+0.155
.",+0.154
":""},{"+0.150
upon+0.145
Neg logit contributions
yip-0.182
-0.163
 XT-0.158
 Junction-0.154
advertisement-0.146
Token
Token ID1906
Feature activation+4.49e-01

Pos logit contributions
yip+0.841
+0.769
 XT+0.737
 Junction+0.720
advertisement+0.700
Neg logit contributions
tained-0.672
aque-0.671
.",-0.664
":""},{"-0.647
 Richards-0.627
Token who
Token ID508
Feature activation-8.75e-02

Pos logit contributions
yip+1.401
+1.297
 XT+1.256
 Junction+1.231
advertisement+1.198
Neg logit contributions
tained-0.893
aque-0.890
.",-0.880
":""},{"-0.852
 Richards-0.819
Token sit
Token ID1650
Feature activation+3.06e-03

Pos logit contributions
aque+0.416
.",+0.414
tained+0.413
":""},{"+0.398
 Richards+0.383
Neg logit contributions
yip-0.695
-0.644
 XT-0.620
 Junction-0.604
PLE-0.591
Token younger
Token ID7099
Feature activation-5.42e-01

Pos logit contributions
yip+0.024
+0.022
 XT+0.022
 Junction+0.021
advertisement+0.021
Neg logit contributions
aque-0.015
tained-0.015
.",-0.015
":""},{"-0.014
 Richards-0.014
Token ones
Token ID3392
Feature activation-2.96e-01

Pos logit contributions
aque+2.390
.",+2.384
tained+2.308
":""},{"+2.248
upon+2.218
Neg logit contributions
yip-4.492
-4.141
 XT-3.917
 Junction-3.805
PLE-3.757
Token might
Token ID1244
Feature activation+8.78e-02

Pos logit contributions
aque+1.384
.",+1.383
tained+1.352
":""},{"+1.303
upon+1.285
Neg logit contributions
yip-2.395
-2.210
 XT-2.106
 Junction-2.050
PLE-2.009
Token no
Token ID645
Feature activation+3.20e-01

Pos logit contributions
yip+0.683
+0.633
 XT+0.612
 Junction+0.600
advertisement+0.584
Neg logit contributions
tained-0.428
aque-0.428
.",-0.423
":""},{"-0.409
 Richards-0.394
Token longer
Token ID2392
Feature activation-4.45e-01

Pos logit contributions
yip+2.384
+2.205
 XT+2.127
 Junction+2.105
PLE+2.033
Neg logit contributions
tained-1.628
aque-1.615
.",-1.614
":""},{"-1.568
 Richards-1.503
Token get
Token ID651
Feature activation-2.87e-01

Pos logit contributions
.",+2.016
aque+1.966
tained+1.954
":""},{"+1.912
upon+1.833
Neg logit contributions
yip-3.645
-3.396
 XT-3.248
 Junction-3.142
PLE-3.102
Token home
Token ID1363
Feature activation+1.08e-01

Pos logit contributions
tained+1.352
aque+1.350
.",+1.342
":""},{"+1.307
 Richards+1.272
Neg logit contributions
yip-2.306
-2.112
 XT-2.001
 Junction-1.969
PLE-1.939
Token first
Token ID717
Feature activation-3.12e-01

Pos logit contributions
yip+0.860
+0.798
 XT+0.771
 Junction+0.756
advertisement+0.737
Neg logit contributions
tained-0.511
aque-0.511
.",-0.506
":""},{"-0.488
 Richards-0.469
Token.
Token ID13
Feature activation-1.91e-01

Pos logit contributions
.",+1.493
aque+1.487
tained+1.483
":""},{"+1.413
oys+1.361
Neg logit contributions
yip-2.480
-2.295
 XT-2.204
 Junction-2.153
advertisement-2.103
Token rate
Token ID2494
Feature activation+1.54e-01

Pos logit contributions
yip+1.544
+1.426
 XT+1.375
 Junction+1.344
PLE+1.308
Neg logit contributions
aque-1.087
tained-1.085
.",-1.076
":""},{"-1.044
 Richards-1.007
Token of
Token ID286
Feature activation+2.17e-02

Pos logit contributions
yip+1.097
+1.007
 XT+0.969
 Junction+0.947
advertisement+0.922
Neg logit contributions
tained-0.855
aque-0.855
.",-0.848
":""},{"-0.819
 Richards-0.795
Token such
Token ID884
Feature activation+5.25e-02

Pos logit contributions
yip+0.162
+0.149
 XT+0.143
 Junction+0.140
advertisement+0.137
Neg logit contributions
aque-0.113
tained-0.113
.",-0.112
":""},{"-0.109
 Richards-0.105
Token digestive
Token ID34616
Feature activation+9.75e-02

Pos logit contributions
yip+0.399
+0.367
 XT+0.353
 Junction+0.345
PLE+0.337
Neg logit contributions
aque-0.267
tained-0.267
.",-0.266
":""},{"-0.257
 Richards-0.249
Token cancers
Token ID25217
Feature activation+2.47e-01

Pos logit contributions
yip+0.751
+0.687
 XT+0.660
 Junction+0.642
advertisement+0.633
Neg logit contributions
aque-0.494
tained-0.492
.",-0.490
":""},{"-0.475
 Richards-0.457
Token are
Token ID389
Feature activation+2.25e-01

Pos logit contributions
yip+1.889
+1.748
 XT+1.690
 Junction+1.654
advertisement+1.611
Neg logit contributions
tained-1.233
aque-1.231
.",-1.219
":""},{"-1.180
 Richards-1.137
Token due
Token ID2233
Feature activation+5.50e-02

Pos logit contributions
yip+1.750
+1.622
 XT+1.561
 Junction+1.530
PLE+1.489
Neg logit contributions
tained-1.103
aque-1.098
.",-1.094
":""},{"-1.058
 Richards-1.019
Token to
Token ID284
Feature activation-7.71e-02

Pos logit contributions
yip+0.413
+0.382
 XT+0.368
 Junction+0.360
advertisement+0.350
Neg logit contributions
aque-0.283
tained-0.283
.",-0.280
":""},{"-0.272
 Richards-0.262
Token the
Token ID262
Feature activation-2.16e-02

Pos logit contributions
aque+0.381
tained+0.380
.",+0.377
":""},{"+0.364
 Richards+0.351
Neg logit contributions
yip-0.596
-0.553
 XT-0.530
 Junction-0.519
advertisement-0.508
Token ingestion
Token ID38382
Feature activation+3.73e-01

Pos logit contributions
aque+0.106
tained+0.106
.",+0.105
":""},{"+0.101
 Richards+0.098
Neg logit contributions
yip-0.168
-0.156
 XT-0.150
 Junction-0.147
advertisement-0.143
Token of
Token ID286
Feature activation+8.52e-02

Pos logit contributions
yip+2.522
+2.330
 XT+2.250
 Junction+2.188
advertisement+2.126
Neg logit contributions
aque-2.147
tained-2.138
.",-2.115
":""},{"-2.061
 Richards-2.011
Token plan
Token ID1410
Feature activation-2.02e-01

Pos logit contributions
yip+3.926
+3.630
 XT+3.536
 Junction+3.445
PLE+3.322
Neg logit contributions
aque-2.931
tained-2.919
.",-2.891
":""},{"-2.794
upon-2.671
Token.
Token ID13
Feature activation+3.74e-02

Pos logit contributions
.",+0.948
tained+0.947
aque+0.937
":""},{"+0.889
 Richards+0.869
Neg logit contributions
yip-1.621
-1.488
 XT-1.424
 Junction-1.398
PLE-1.377
Token The
Token ID383
Feature activation+6.59e-03

Pos logit contributions
yip+0.283
+0.258
 XT+0.247
 Junction+0.242
advertisement+0.237
Neg logit contributions
tained-0.193
aque-0.192
.",-0.191
":""},{"-0.187
 Richards-0.183
Token reform
Token ID4975
Feature activation+2.09e-01

Pos logit contributions
yip+0.048
+0.044
 XT+0.042
 Junction+0.041
advertisement+0.040
Neg logit contributions
tained-0.036
aque-0.036
.",-0.036
":""},{"-0.035
 Richards-0.034
Token could
Token ID714
Feature activation+2.54e-01

Pos logit contributions
yip+1.663
+1.552
 XT+1.496
 Junction+1.468
advertisement+1.431
Neg logit contributions
tained-0.976
aque-0.973
.",-0.965
":""},{"-0.933
 Richards-0.897
Token therefore
Token ID4361
Feature activation+2.63e-01

Pos logit contributions
yip+1.996
+1.846
 XT+1.788
 Junction+1.752
advertisement+1.702
Neg logit contributions
aque-1.223
tained-1.222
.",-1.205
":""},{"-1.166
 Richards-1.120
Token boost
Token ID5750
Feature activation-3.75e-02

Pos logit contributions
yip+2.020
+1.864
 XT+1.804
 Junction+1.768
advertisement+1.714
Neg logit contributions
aque-1.317
tained-1.316
.",-1.298
":""},{"-1.257
 Richards-1.210
Token GDP
Token ID12396
Feature activation-1.04e-01

Pos logit contributions
tained+0.228
aque+0.228
.",+0.227
":""},{"+0.221
 Richards+0.216
Neg logit contributions
yip-0.246
-0.225
 XT-0.215
 Junction-0.210
PLE-0.204
Token,
Token ID11
Feature activation-3.59e-01

Pos logit contributions
.",+0.494
tained+0.489
aque+0.486
":""},{"+0.476
 Richards+0.456
Neg logit contributions
yip-0.818
-0.757
 XT-0.726
 Junction-0.717
PLE-0.698
Token at
Token ID379
Feature activation-6.58e-01

Pos logit contributions
tained+1.549
aque+1.544
.",+1.540
":""},{"+1.539
 Richards+1.460
Neg logit contributions
yip-2.931
-2.716
 XT-2.597
 Junction-2.579
PLE-2.535
Token the
Token ID262
Feature activation-1.60e-01

Pos logit contributions
tained+2.358
aque+2.330
.",+2.328
":""},{"+2.243
 Richards+2.146
Neg logit contributions
yip-5.943
-5.551
 XT-5.307
PLE-5.218
 Junction-5.206
Token the
Token ID262
Feature activation+5.27e-03

Pos logit contributions
yip+0.935
+0.858
 XT+0.817
 Junction+0.800
advertisement+0.777
Neg logit contributions
tained-0.803
aque-0.798
.",-0.796
":""},{"-0.771
 Richards-0.751
Token internet
Token ID5230
Feature activation+3.69e-01

Pos logit contributions
yip+0.036
+0.033
 XT+0.031
 Junction+0.030
advertisement+0.030
Neg logit contributions
tained-0.031
.",-0.031
aque-0.031
":""},{"-0.030
 Richards-0.029
Token addresses
Token ID9405
Feature activation+1.65e-01

Pos logit contributions
yip+2.690
+2.479
 XT+2.390
 Junction+2.337
advertisement+2.271
Neg logit contributions
tained-1.979
aque-1.978
.",-1.961
":""},{"-1.902
 Richards-1.836
Token to
Token ID284
Feature activation+3.44e-01

Pos logit contributions
yip+1.290
+1.195
 XT+1.153
 Junction+1.131
advertisement+1.100
Neg logit contributions
tained-0.806
aque-0.805
.",-0.800
":""},{"-0.771
 Richards-0.743
Token be
Token ID307
Feature activation+2.93e-01

Pos logit contributions
yip+2.509
+2.318
 XT+2.239
 Junction+2.188
PLE+2.120
Neg logit contributions
aque-1.843
tained-1.835
.",-1.815
":""},{"-1.766
 Richards-1.698
Token blocked
Token ID10226
Feature activation+2.64e-01

Pos logit contributions
yip+2.219
+2.055
 XT+1.988
 Junction+1.943
advertisement+1.889
Neg logit contributions
aque-1.482
tained-1.480
.",-1.466
":""},{"-1.422
 Richards-1.368
Token,
Token ID11
Feature activation-1.99e-02

Pos logit contributions
yip+1.982
+1.832
 XT+1.770
 Junction+1.731
advertisement+1.683
Neg logit contributions
tained-1.354
aque-1.354
.",-1.340
":""},{"-1.300
 Richards-1.253
Token and
Token ID290
Feature activation+6.93e-02

Pos logit contributions
aque+0.101
tained+0.101
.",+0.100
":""},{"+0.097
 Richards+0.094
Neg logit contributions
yip-0.152
-0.140
 XT-0.134
 Junction-0.131
PLE-0.127
Token a
Token ID257
Feature activation-3.19e-02

Pos logit contributions
yip+0.556
+0.517
 XT+0.497
 Junction+0.487
advertisement+0.476
Neg logit contributions
aque-0.323
tained-0.322
.",-0.319
":""},{"-0.308
 Richards-0.298
Token lack
Token ID3092
Feature activation-4.14e-02

Pos logit contributions
tained+0.150
aque+0.150
.",+0.148
":""},{"+0.143
 Richards+0.138
Neg logit contributions
yip-0.255
-0.237
 XT-0.227
 Junction-0.222
advertisement-0.218
Token of
Token ID286
Feature activation-1.55e-01

Pos logit contributions
aque+0.225
tained+0.225
.",+0.223
":""},{"+0.217
 Richards+0.209
Neg logit contributions
yip-0.299
-0.275
 XT-0.265
 Junction-0.259
advertisement-0.252
Token lots
Token ID6041
Feature activation+6.70e-02

Pos logit contributions
tained+0.172
.",+0.171
aque+0.171
":""},{"+0.164
 Richards+0.159
Neg logit contributions
yip-0.289
-0.266
 XT-0.257
 Junction-0.252
PLE-0.245
Token of
Token ID286
Feature activation+2.30e-01

Pos logit contributions
yip+0.484
+0.445
 XT+0.429
 Junction+0.420
advertisement+0.407
Neg logit contributions
tained-0.364
aque-0.364
.",-0.361
":""},{"-0.350
 Richards-0.338
Token interesting
Token ID3499
Feature activation-1.01e-02

Pos logit contributions
yip+1.741
+1.613
 XT+1.560
 Junction+1.526
advertisement+1.485
Neg logit contributions
aque-1.170
tained-1.169
.",-1.156
":""},{"-1.122
 Richards-1.077
Token det
Token ID1062
Feature activation-2.56e-02

Pos logit contributions
tained+0.052
.",+0.052
aque+0.052
":""},{"+0.050
 Richards+0.049
Neg logit contributions
yip-0.076
-0.070
 XT-0.067
 Junction-0.065
PLE-0.064
Tokenours
Token ID4662
Feature activation+2.42e-02

Pos logit contributions
aque+0.172
.",+0.171
tained+0.171
":""},{"+0.165
oys+0.164
Neg logit contributions
yip-0.154
-0.139
 XT-0.131
 Junction-0.128
advertisement-0.123
Token to
Token ID284
Feature activation+8.07e-01

Pos logit contributions
yip+0.188
+0.174
 XT+0.167
 Junction+0.164
PLE+0.160
Neg logit contributions
tained-0.119
aque-0.118
.",-0.118
":""},{"-0.113
 Richards-0.110
Token explore
Token ID7301
Feature activation+3.73e-01

Pos logit contributions
yip+5.621
+5.249
 XT+5.152
 Junction+5.077
PLE+4.830
Neg logit contributions
aque-4.332
tained-4.323
.",-4.183
":""},{"-4.131
upon-3.896
Token
Token ID851
Feature activation+3.98e-01

Pos logit contributions
yip+2.709
+2.515
 XT+2.433
 Junction+2.382
advertisement+2.308
Neg logit contributions
aque-1.988
tained-1.979
.",-1.955
":""},{"-1.913
 Richards-1.832
Token much
Token ID881
Feature activation-6.67e-02

Pos logit contributions
yip+3.017
+2.798
 XT+2.708
 Junction+2.660
advertisement+2.579
Neg logit contributions
aque-2.004
tained-2.003
.",-1.979
":""},{"-1.928
 Richards-1.833
Token like
Token ID588
Feature activation+8.03e-02

Pos logit contributions
aque+0.327
tained+0.326
.",+0.325
":""},{"+0.311
 Richards+0.303
Neg logit contributions
yip-0.521
-0.478
 XT-0.462
 Junction-0.452
PLE-0.439
Token Tal
Token ID7193
Feature activation+3.66e-01

Pos logit contributions
yip+0.543
+0.495
 XT+0.474
 Junction+0.461
advertisement+0.449
Neg logit contributions
tained-0.475
aque-0.474
.",-0.472
":""},{"-0.458
 Richards-0.446
Tokenstrength
Token ID41402
Feature activation-1.90e-01

Pos logit contributions
yip+5.120
+4.762
 XT+4.613
 Junction+4.508
advertisement+4.462
Neg logit contributions
tained-2.885
.",-2.860
aque-2.854
":""},{"-2.746
 Richards-2.672
Token,
Token ID11
Feature activation+3.13e-01

Pos logit contributions
aque+0.924
.",+0.922
tained+0.918
":""},{"+0.896
 Richards+0.874
Neg logit contributions
yip-1.506
-1.384
 XT-1.279
 Junction-1.276
advertisement-1.273
Token military
Token ID2422
Feature activation+1.48e-01

Pos logit contributions
yip+2.443
+2.266
 XT+2.199
 Junction+2.149
advertisement+2.091
Neg logit contributions
tained-1.509
aque-1.507
.",-1.492
":""},{"-1.442
 Richards-1.384
Token-
Token ID12
Feature activation+4.53e-01

Pos logit contributions
yip+1.214
+1.119
 XT+1.066
 Junction+1.054
advertisement+1.032
Neg logit contributions
aque-0.676
tained-0.675
.",-0.675
":""},{"-0.647
 Richards-0.625
Tokengrade
Token ID9526
Feature activation-2.17e-01

Pos logit contributions
yip+3.578
+3.323
 XT+3.228
 Junction+3.147
advertisement+3.093
Neg logit contributions
tained-2.113
aque-2.109
.",-2.096
":""},{"-2.027
 Richards-1.954
Token,
Token ID11
Feature activation-3.04e-03

Pos logit contributions
.",+1.050
aque+1.044
tained+1.039
":""},{"+1.020
 Richards+0.990
Neg logit contributions
yip-1.728
-1.577
 XT-1.457
 Junction-1.453
advertisement-1.450
Token aluminum
Token ID15955
Feature activation-2.78e-01

Pos logit contributions
aque+0.015
.",+0.015
tained+0.015
":""},{"+0.015
 Richards+0.014
Neg logit contributions
yip-0.023
-0.022
 XT-0.020
 Junction-0.020
advertisement-0.020
Token-
Token ID12
Feature activation+2.04e-01

Pos logit contributions
.",+1.432
aque+1.423
tained+1.417
":""},{"+1.369
 Richards+1.343
Neg logit contributions
yip-2.137
-1.926
 Junction-1.787
 XT-1.785
advertisement-1.759
Tokenall
Token ID439
Feature activation+2.21e-02

Pos logit contributions
yip+1.503
+1.387
 XT+1.331
 Junction+1.307
advertisement+1.266
Neg logit contributions
aque-1.094
tained-1.090
.",-1.079
":""},{"-1.046
 Richards-1.009
Tokenoy
Token ID726
Feature activation+4.12e-02

Pos logit contributions
yip+0.192
+0.177
 XT+0.169
 Junction+0.168
advertisement+0.164
Neg logit contributions
aque-0.092
tained-0.091
.",-0.090
":""},{"-0.085
oys-0.085
Token body
Token ID1767
Feature activation+3.55e-01

Pos logit contributions
yip+0.311
+0.286
 XT+0.272
 Junction+0.268
advertisement+0.262
Neg logit contributions
tained-0.214
aque-0.213
.",-0.213
":""},{"-0.205
 Richards-0.199
Token a
Token ID257
Feature activation+1.65e-02

Pos logit contributions
yip+1.252
+1.163
 XT+1.125
 Junction+1.102
advertisement+1.074
Neg logit contributions
tained-0.742
aque-0.741
.",-0.734
":""},{"-0.709
 Richards-0.680
Token device
Token ID3335
Feature activation-8.71e-02

Pos logit contributions
yip+0.130
+0.120
 XT+0.115
 Junction+0.113
advertisement+0.111
Neg logit contributions
aque-0.079
tained-0.079
.",-0.079
":""},{"-0.076
 Richards-0.074
Token like
Token ID588
Feature activation+1.27e-01

Pos logit contributions
aque+0.407
tained+0.405
.",+0.403
":""},{"+0.387
 Richards+0.374
Neg logit contributions
yip-0.699
-0.646
 XT-0.621
 Junction-0.613
PLE-0.595
Token the
Token ID262
Feature activation+9.40e-02

Pos logit contributions
yip+1.049
+0.977
 XT+0.948
 Junction+0.929
advertisement+0.906
Neg logit contributions
aque-0.554
tained-0.554
.",-0.548
":""},{"-0.529
 Richards-0.505
Token Love
Token ID5896
Feature activation-1.52e-01

Pos logit contributions
yip+0.528
+0.472
 XT+0.446
 Junction+0.435
advertisement+0.421
Neg logit contributions
tained-0.664
aque-0.663
.",-0.659
":""},{"-0.643
 Richards-0.629
Tokent
Token ID83
Feature activation-2.41e-03

Pos logit contributions
aque+0.666
tained+0.665
.",+0.657
":""},{"+0.636
tro+0.605
Neg logit contributions
yip-1.257
-1.166
 XT-1.129
 Junction-1.110
advertisement-1.081
Tokenron
Token ID1313
Feature activation-1.26e-01

Pos logit contributions
tained+0.010
aque+0.010
.",+0.010
":""},{"+0.010
 Richards+0.009
Neg logit contributions
yip-0.020
-0.019
 XT-0.018
 Junction-0.018
advertisement-0.018
Token9
Token ID24
Feature activation-2.37e-01

Pos logit contributions
tained+0.753
.",+0.752
aque+0.750
":""},{"+0.729
 Richards+0.711
Neg logit contributions
yip-0.843
-0.763
 XT-0.729
 Junction-0.713
advertisement-0.695
Token000
Token ID830
Feature activation-4.78e-01

Pos logit contributions
tained+0.552
aque+0.552
.",+0.540
":""},{"+0.503
 Richards+0.459
Neg logit contributions
yip-2.451
-2.313
 XT-2.252
 Junction-2.215
PLE-2.169
Token,
Token ID11
Feature activation-1.80e-01

Pos logit contributions
aque+2.201
tained+2.174
.",+2.166
":""},{"+2.043
 Richards+2.011
Neg logit contributions
yip-3.853
-3.492
 Junction-3.321
 XT-3.302
advertisement-3.263
Token he
Token ID339
Feature activation-5.70e-02

Pos logit contributions
tained+0.844
.",+0.840
aque+0.839
":""},{"+0.803
 Richards+0.793
Neg logit contributions
yip-1.433
-1.318
 XT-1.272
 Junction-1.254
advertisement-1.216
Token experience
Token ID1998
Feature activation-3.93e-01

Pos logit contributions
tained+0.917
.",+0.913
aque+0.908
":""},{"+0.874
 Richards+0.848
Neg logit contributions
yip-1.616
-1.486
 XT-1.430
 Junction-1.410
PLE-1.373
Token in
Token ID287
Feature activation-3.74e-01

Pos logit contributions
.",+1.871
aque+1.815
tained+1.793
":""},{"+1.761
 Richards+1.692
Neg logit contributions
yip-3.131
-2.929
 XT-2.798
 Junction-2.727
PLE-2.666
Token politics
Token ID4819
Feature activation+1.23e-02

Pos logit contributions
.",+1.880
tained+1.871
aque+1.827
":""},{"+1.800
 Richards+1.762
Neg logit contributions
yip-2.846
-2.618
 XT-2.497
 Junction-2.438
PLE-2.391
Token,
Token ID11
Feature activation-3.07e-01

Pos logit contributions
yip+0.094
+0.086
 XT+0.083
 Junction+0.081
PLE+0.079
Neg logit contributions
.",-0.063
tained-0.062
aque-0.062
":""},{"-0.060
 Richards-0.058
Token and
Token ID290
Feature activation-3.41e-01

Pos logit contributions
.",+1.473
tained+1.467
aque+1.462
":""},{"+1.442
 Richards+1.389
Neg logit contributions
yip-2.391
-2.212
 XT-2.126
 Junction-2.086
advertisement-2.028
Token to
Token ID284
Feature activation-4.29e-01

Pos logit contributions
.",+1.643
aque+1.632
tained+1.625
":""},{"+1.594
 Richards+1.542
Neg logit contributions
yip-2.657
-2.454
 XT-2.363
 Junction-2.309
advertisement-2.249
Token become
Token ID1716
Feature activation-1.70e-01

Pos logit contributions
.",+2.032
tained+1.990
aque+1.982
":""},{"+1.954
 Richards+1.898
Neg logit contributions
yip-3.379
-3.092
 XT-2.989
 Junction-2.928
advertisement-2.849
Token perhaps
Token ID3737
Feature activation+2.26e-01

Pos logit contributions
.",+0.784
aque+0.778
tained+0.777
":""},{"+0.756
 Richards+0.730
Neg logit contributions
yip-1.376
-1.264
 XT-1.216
 Junction-1.200
advertisement-1.166
Token a
Token ID257
Feature activation-9.91e-02

Pos logit contributions
yip+1.852
+1.726
 XT+1.673
 Junction+1.638
advertisement+1.597
Neg logit contributions
tained-1.009
aque-1.008
.",-0.994
":""},{"-0.958
 Richards-0.916
Token bit
Token ID1643
Feature activation-1.44e-01

Pos logit contributions
tained+0.460
aque+0.460
.",+0.457
":""},{"+0.443
 Richards+0.427
Neg logit contributions
yip-0.793
-0.732
 XT-0.710
 Junction-0.694
advertisement-0.681
Token more
Token ID517
Feature activation-7.97e-02

Pos logit contributions
aque+0.563
.",+0.554
tained+0.547
":""},{"+0.534
 Richards+0.508
Neg logit contributions
yip-1.266
-1.176
 XT-1.141
 Junction-1.123
advertisement-1.096
Token
Token ID198
Feature activation-6.63e-02

Pos logit contributions
tained+1.142
aque+1.139
.",+1.136
":""},{"+1.103
 Richards+1.065
Neg logit contributions
yip-1.512
-1.375
 XT-1.323
 Junction-1.292
advertisement-1.251
TokenEven
Token ID6104
Feature activation-1.22e-01

Pos logit contributions
aque+0.381
tained+0.381
.",+0.378
":""},{"+0.368
 Richards+0.355
Neg logit contributions
yip-0.461
-0.420
 XT-0.405
 Junction-0.396
advertisement-0.380
Token so
Token ID523
Feature activation-5.88e-02

Pos logit contributions
aque+0.588
tained+0.587
.",+0.583
":""},{"+0.561
 Richards+0.548
Neg logit contributions
yip-0.958
-0.891
 XT-0.853
 Junction-0.841
advertisement-0.820
Token,
Token ID11
Feature activation-4.28e-01

Pos logit contributions
tained+0.310
aque+0.310
.",+0.308
":""},{"+0.298
 Richards+0.288
Neg logit contributions
yip-0.433
-0.400
 XT-0.386
 Junction-0.377
advertisement-0.366
Token as
Token ID355
Feature activation-3.75e-01

Pos logit contributions
aque+2.055
.",+2.042
tained+2.025
":""},{"+1.955
 Richards+1.942
Neg logit contributions
yip-3.334
-3.117
 XT-2.927
 Junction-2.899
advertisement-2.860
Token a
Token ID257
Feature activation+8.22e-03

Pos logit contributions
aque+1.821
tained+1.792
.",+1.778
":""},{"+1.728
 Richards+1.705
Neg logit contributions
yip-2.911
-2.722
 XT-2.568
 Junction-2.532
advertisement-2.497
Token diplomatic
Token ID13093
Feature activation-3.36e-01

Pos logit contributions
yip+0.061
+0.057
 XT+0.054
 Junction+0.053
advertisement+0.052
Neg logit contributions
tained-0.043
aque-0.043
.",-0.042
":""},{"-0.041
 Richards-0.040
Token courtesy
Token ID12537
Feature activation-3.15e-01

Pos logit contributions
aque+1.681
tained+1.660
.",+1.642
":""},{"+1.617
 Richards+1.564
Neg logit contributions
yip-2.576
-2.394
 XT-2.236
advertisement-2.205
 Junction-2.200
Token Cuba
Token ID14159
Feature activation-6.46e-01

Pos logit contributions
tained+1.542
aque+1.537
.",+1.511
":""},{"+1.481
 Richards+1.432
Neg logit contributions
yip-2.412
-2.270
 XT-2.167
 Junction-2.130
advertisement-2.078
Token invited
Token ID9392
Feature activation-3.64e-01

Pos logit contributions
aque+2.918
tained+2.862
.",+2.816
":""},{"+2.738
 Richards+2.674
Neg logit contributions
yip-5.222
-4.875
 XT-4.628
 Junction-4.558
advertisement-4.535
Token O
Token ID440
Feature activation-4.61e-01

Pos logit contributions
aque+1.751
.",+1.750
tained+1.743
":""},{"+1.678
 Richards+1.657
Neg logit contributions
yip-2.859
-2.648
 XT-2.502
 Junction-2.481
advertisement-2.424
Token
Token ID447
Feature activation-1.55e-02

Pos logit contributions
yip+0.563
+0.525
 XT+0.503
 Junction+0.491
advertisement+0.480
Neg logit contributions
aque-0.469
tained-0.468
.",-0.463
":""},{"-0.448
 Richards-0.437
Token
Token ID247
Feature activation-7.90e-02

Pos logit contributions
tained+0.092
aque+0.092
.",+0.091
":""},{"+0.089
 Richards+0.086
Neg logit contributions
yip-0.104
-0.095
 XT-0.091
 Junction-0.089
PLE-0.086
Tokent
Token ID83
Feature activation-2.32e-01

Pos logit contributions
tained+0.344
aque+0.343
.",+0.340
":""},{"+0.326
 Richards+0.314
Neg logit contributions
yip-0.661
-0.612
 XT-0.593
 Junction-0.583
advertisement-0.566
Token serve
Token ID4691
Feature activation+7.87e-02

Pos logit contributions
.",+1.121
tained+1.089
aque+1.080
":""},{"+1.056
 Richards+1.017
Neg logit contributions
yip-1.842
-1.694
 XT-1.626
 Junction-1.597
advertisement-1.558
Token a
Token ID257
Feature activation+1.88e-01

Pos logit contributions
yip+0.617
+0.564
 XT+0.541
 Junction+0.532
advertisement+0.519
Neg logit contributions
.",-0.388
tained-0.383
aque-0.383
":""},{"-0.370
 Richards-0.359
Token mission
Token ID4365
Feature activation-2.34e-01

Pos logit contributions
yip+1.386
+1.264
 XT+1.220
 Junction+1.191
advertisement+1.158
Neg logit contributions
tained-1.005
aque-1.004
.",-1.002
":""},{"-0.967
 Richards-0.938
Token.
Token ID13
Feature activation+8.80e-02

Pos logit contributions
.",+1.140
tained+1.112
aque+1.108
":""},{"+1.064
 Richards+1.030
Neg logit contributions
yip-1.843
-1.696
 XT-1.636
 Junction-1.602
PLE-1.550
Token In
Token ID554
Feature activation-2.41e-01

Pos logit contributions
yip+0.671
+0.611
 XT+0.589
 Junction+0.576
advertisement+0.562
Neg logit contributions
tained-0.449
.",-0.449
aque-0.447
":""},{"-0.434
 Richards-0.423
Token fact
Token ID1109
Feature activation+7.99e-01

Pos logit contributions
tained+1.173
.",+1.162
aque+1.149
":""},{"+1.105
 Richards+1.083
Neg logit contributions
yip-1.889
-1.719
 XT-1.660
 Junction-1.634
PLE-1.585
Token,
Token ID11
Feature activation+4.89e-01

Pos logit contributions
yip+5.304
+4.996
 XT+4.791
 Junction+4.642
PLE+4.542
Neg logit contributions
aque-4.572
tained-4.557
":""},{"-4.419
.",-4.372
 Richards-4.152
Token a
Token ID257
Feature activation+1.32e-01

Pos logit contributions
yip+3.569
+3.316
 XT+3.212
 Junction+3.132
PLE+3.049
Neg logit contributions
aque-2.592
tained-2.588
.",-2.518
":""},{"-2.473
 Richards-2.350
Token work
Token ID670
Feature activation-1.96e-01

Pos logit contributions
tained+0.052
.",+0.052
aque+0.051
":""},{"+0.049
 Richards+0.048
Neg logit contributions
yip-0.071
-0.064
 XT-0.062
 Junction-0.061
PLE-0.059
Token better
Token ID1365
Feature activation-3.15e-01

Pos logit contributions
.",+0.966
tained+0.951
aque+0.940
":""},{"+0.903
upon+0.881
Neg logit contributions
yip-1.533
-1.404
 XT-1.353
 Junction-1.322
PLE-1.293
Token than
Token ID621
Feature activation-5.18e-01

Pos logit contributions
.",+1.449
tained+1.434
aque+1.404
":""},{"+1.352
upon+1.314
Neg logit contributions
yip-2.562
-2.359
 XT-2.277
 Junction-2.253
PLE-2.176
Token anyone
Token ID2687
Feature activation-2.52e-01

Pos logit contributions
.",+2.432
tained+2.361
aque+2.315
upon+2.254
":""},{"+2.243
Neg logit contributions
yip-4.110
-3.800
 Junction-3.569
 XT-3.565
PLE-3.451
Token else
Token ID2073
Feature activation-9.79e-01

Pos logit contributions
.",+1.069
aque+1.038
tained+1.030
":""},{"+0.991
upon+0.956
Neg logit contributions
yip-2.145
-1.996
 XT-1.919
 Junction-1.887
PLE-1.839
Token does
Token ID857
Feature activation-9.64e-01

Pos logit contributions
.",+4.388
aque+4.057
tained+3.975
.","+3.917
upon+3.901
Neg logit contributions
yip-7.850
-7.268
 XT-6.954
 Junction-6.775
PLE-6.588
Token.
Token ID13
Feature activation+6.15e-02

Pos logit contributions
.",+4.356
tained+4.022
aque+3.980
.","+3.930
":""},{"+3.835
Neg logit contributions
yip-7.705
-7.065
 XT-6.795
 Junction-6.650
PLE-6.438
Token Be
Token ID1355
Feature activation-2.43e-02

Pos logit contributions
yip+0.473
+0.434
 XT+0.417
 Junction+0.408
advertisement+0.398
Neg logit contributions
tained-0.308
aque-0.307
.",-0.306
":""},{"-0.296
 Richards-0.286
Token very
Token ID845
Feature activation+1.43e-01

Pos logit contributions
aque+0.116
.",+0.115
tained+0.114
":""},{"+0.110
 Richards+0.106
Neg logit contributions
yip-0.195
-0.179
 XT-0.173
 Junction-0.171
advertisement-0.165
Token mindful
Token ID33798
Feature activation+3.85e-02

Pos logit contributions
yip+1.105
+1.023
 XT+0.988
 Junction+0.967
advertisement+0.941
Neg logit contributions
aque-0.706
tained-0.706
.",-0.700
":""},{"-0.676
 Richards-0.651
Token of
Token ID286
Feature activation-1.93e-01

Pos logit contributions
yip+0.290
+0.268
 XT+0.258
 Junction+0.253
advertisement+0.245
Neg logit contributions
.",-0.197
aque-0.197
tained-0.196
":""},{"-0.190
 Richards-0.182
Token plant
Token ID4618
Feature activation+1.50e-01

Pos logit contributions
aque+0.436
tained+0.435
.",+0.435
":""},{"+0.418
 Richards+0.404
Neg logit contributions
yip-0.703
-0.646
 XT-0.626
 Junction-0.613
advertisement-0.597
Token these
Token ID777
Feature activation-1.79e-01

Pos logit contributions
yip+1.154
+1.072
 XT+1.036
 Junction+1.012
advertisement+0.987
Neg logit contributions
aque-0.744
tained-0.743
.",-0.735
":""},{"-0.713
 Richards-0.685
Token plots
Token ID21528
Feature activation-5.39e-02

Pos logit contributions
tained+0.775
.",+0.774
aque+0.772
":""},{"+0.734
 Richards+0.713
Neg logit contributions
yip-1.510
-1.381
 XT-1.344
 Junction-1.320
PLE-1.284
Token with
Token ID351
Feature activation-3.19e-01

Pos logit contributions
tained+0.262
aque+0.261
.",+0.261
":""},{"+0.251
 Richards+0.242
Neg logit contributions
yip-0.421
-0.389
 XT-0.375
 Junction-0.368
advertisement-0.358
Token a
Token ID257
Feature activation-3.11e-01

Pos logit contributions
tained+1.515
aque+1.497
.",+1.484
":""},{"+1.445
 Richards+1.391
Neg logit contributions
yip-2.539
-2.330
 XT-2.239
 Junction-2.210
advertisement-2.159
Token natural
Token ID3288
Feature activation-8.31e-01

Pos logit contributions
tained+1.471
aque+1.465
.",+1.455
":""},{"+1.422
 Richards+1.354
Neg logit contributions
yip-2.457
-2.261
 XT-2.173
 Junction-2.135
advertisement-2.104
Token mix
Token ID5022
Feature activation-2.39e-01

Pos logit contributions
aque+3.957
tained+3.949
.",+3.924
":""},{"+3.854
 Richards+3.793
Neg logit contributions
yip-6.284
-5.677
 XT-5.495
 Junction-5.399
advertisement-5.331
Token of
Token ID286
Feature activation-3.60e-01

Pos logit contributions
aque+1.186
.",+1.185
tained+1.182
":""},{"+1.146
upon+1.090
Neg logit contributions
yip-1.819
-1.679
 XT-1.628
 Junction-1.615
PLE-1.551
Token wild
Token ID4295
Feature activation-1.76e-01

Pos logit contributions
tained+1.681
aque+1.674
.",+1.666
":""},{"+1.608
 Richards+1.567
Neg logit contributions
yip-2.894
-2.645
 XT-2.538
 Junction-2.508
advertisement-2.454
Token ed
Token ID1225
Feature activation-9.67e-02

Pos logit contributions
tained+0.891
aque+0.887
.",+0.876
":""},{"+0.846
 Richards+0.824
Neg logit contributions
yip-1.350
-1.220
 XT-1.186
 Junction-1.169
advertisement-1.146
Tokenibles
Token ID18764
Feature activation-1.41e-01

Pos logit contributions
aque+0.259
tained+0.259
.",+0.255
":""},{"+0.239
 Richards+0.222
Neg logit contributions
yip-0.965
-0.910
 XT-0.886
 Junction-0.873
advertisement-0.855
Token v
Token ID410
Feature activation+4.34e-01

Pos logit contributions
yip+0.739
+0.680
 XT+0.645
 Junction+0.632
advertisement+0.620
Neg logit contributions
aque-0.562
tained-0.557
.",-0.557
":""},{"-0.539
 Richards-0.522
Token.
Token ID13
Feature activation-7.99e-01

Pos logit contributions
yip+3.183
+2.944
 XT+2.848
 Junction+2.783
advertisement+2.708
Neg logit contributions
tained-2.290
aque-2.284
.",-2.263
":""},{"-2.198
 Richards-2.124
Tokenver
Token ID332
Feature activation-5.51e-01

Pos logit contributions
aque+4.341
.",+4.089
tained+4.081
":""},{"+3.993
upon+3.911
Neg logit contributions
yip-5.616
-5.112
 XT-4.768
 Junction-4.743
advertisement-4.694
Tokentex
Token ID16886
Feature activation-1.15e-02

Pos logit contributions
aque+1.390
tained+1.333
.",+1.297
":""},{"+1.217
tro+1.151
Neg logit contributions
yip-5.576
-5.239
 Junction-4.963
 XT-4.955
PLE-4.877
Token.
Token ID13
Feature activation-8.12e-01

Pos logit contributions
aque+0.065
tained+0.065
.",+0.064
":""},{"+0.062
upon+0.060
Neg logit contributions
yip-0.082
-0.075
 XT-0.071
 Junction-0.070
advertisement-0.067
Tokenxy
Token ID5431
Feature activation-9.07e-01

Pos logit contributions
aque+4.373
.",+4.159
tained+4.092
":""},{"+4.035
upon+3.950
Neg logit contributions
yip-5.749
-5.286
 XT-4.936
 Junction-4.923
PLE-4.787
Tokenz
Token ID89
Feature activation-2.90e-01

Pos logit contributions
.",+3.977
aque+3.957
tained+3.862
":""},{"+3.802
tro+3.664
Neg logit contributions
yip-7.180
-6.518
 Junction-6.104
 XT-6.099
advertisement-5.998
Token +=
Token ID15853
Feature activation-8.10e-01

Pos logit contributions
aque+1.459
.",+1.451
tained+1.441
":""},{"+1.400
 Richards+1.344
Neg logit contributions
yip-2.229
-2.004
 XT-1.914
 Junction-1.889
advertisement-1.834
Token v
Token ID410
Feature activation+3.98e-01

Pos logit contributions
.",+4.140
tained+4.031
aque+3.997
":""},{"+3.891
.","+3.787
Neg logit contributions
yip-5.944
-5.539
 XT-4.935
advertisement-4.912
 Junction-4.902
Token.
Token ID13
Feature activation-6.05e-01

Pos logit contributions
yip+2.836
+2.614
 XT+2.532
 Junction+2.480
advertisement+2.415
Neg logit contributions
tained-2.164
aque-2.159
.",-2.137
":""},{"-2.095
 Richards-2.022
Tokennormal
Token ID11265
Feature activation-1.23e-01

Pos logit contributions
aque+3.294
tained+3.177
.",+3.154
":""},{"+3.008
tro+2.983
Neg logit contributions
yip-4.296
-4.018
 XT-3.799
 Junction-3.760
PLE-3.660
Token?
Token ID5633
Feature activation-1.30e-01

Pos logit contributions
.",+3.033
tained+2.990
aque+2.960
":""},{"+2.956
 Richards+2.870
Neg logit contributions
yip-3.533
-3.093
 XT-2.891
 Junction-2.863
spect-2.771
Token GO
Token ID10351
Feature activation-3.45e-01

Pos logit contributions
tained+0.704
.",+0.703
aque+0.697
":""},{"+0.682
 Richards+0.663
Neg logit contributions
yip-0.936
-0.854
 XT-0.816
 Junction-0.801
advertisement-0.773
TokenAL
Token ID1847
Feature activation-2.61e-01

Pos logit contributions
tained+1.996
.",+1.982
":""},{"+1.943
aque+1.938
 Richards+1.895
Neg logit contributions
yip-2.377
-2.117
 XT-1.970
 Junction-1.942
advertisement-1.913
Token!
Token ID0
Feature activation-1.32e-01

Pos logit contributions
tained+1.574
.",+1.563
aque+1.542
":""},{"+1.519
 Richards+1.490
Neg logit contributions
yip-1.726
-1.560
 XT-1.470
 Junction-1.442
advertisement-1.391
Token pic
Token ID8301
Feature activation-6.68e-02

Pos logit contributions
tained+0.701
.",+0.696
aque+0.693
":""},{"+0.681
 Richards+0.664
Neg logit contributions
yip-0.970
-0.873
 XT-0.841
 Junction-0.821
advertisement-0.800
Token.
Token ID13
Feature activation-9.64e-01

Pos logit contributions
tained+0.348
aque+0.347
.",+0.346
":""},{"+0.335
 Richards+0.323
Neg logit contributions
yip-0.500
-0.459
 XT-0.443
 Junction-0.433
PLE-0.420
Tokentwitter
Token ID6956
Feature activation-6.66e-01

Pos logit contributions
tained+4.013
.",+3.769
aque+3.698
":""},{"+3.697
tro+3.628
Neg logit contributions
yip-8.383
 XT-7.443
-7.392
 Junction-7.196
 srf-7.032
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
.",+3.062
tained+2.943
":""},{"+2.905
aque+2.879
upon+2.791
Neg logit contributions
yip-5.673
-4.973
 XT-4.840
 Junction-4.741
 srf-4.584
Tokencom
Token ID785
Feature activation-6.89e-01

Pos logit contributions
tained+3.234
.",+3.074
upon+3.016
":""},{"+2.972
aque+2.965
Neg logit contributions
yip-11.012
-9.747
 XT-9.724
 srf-9.514
 Junction-9.497
Token/
Token ID14
Feature activation-4.54e-01

Pos logit contributions
tained+3.092
.",+3.058
":""},{"+3.049
aque+2.986
upon+2.868
Neg logit contributions
yip-5.793
-5.076
 XT-5.007
 Junction-4.842
PLE-4.700
TokenE
Token ID36
Feature activation-5.07e-01

Pos logit contributions
aque+2.691
tained+2.632
.",+2.602
":""},{"+2.544
tro+2.528
Neg logit contributions
yip-2.973
-2.783
 XT-2.667
 Junction-2.635
advertisement-2.496
Token a
Token ID257
Feature activation-7.25e-02

Pos logit contributions
yip+1.482
+1.355
 XT+1.310
 Junction+1.279
advertisement+1.243
Neg logit contributions
aque-1.085
tained-1.085
.",-1.079
":""},{"-1.043
 Richards-1.012
Token few
Token ID1178
Feature activation+7.82e-02

Pos logit contributions
.",+0.457
aque+0.454
tained+0.453
":""},{"+0.438
 Richards+0.434
Neg logit contributions
yip-0.465
-0.421
 XT-0.402
 Junction-0.390
advertisement-0.375
Token seconds
Token ID4201
Feature activation-1.95e-01

Pos logit contributions
yip+0.579
+0.526
 XT+0.507
 Junction+0.496
advertisement+0.481
Neg logit contributions
aque-0.418
.",-0.417
tained-0.417
":""},{"-0.400
 Richards-0.392
Token and
Token ID290
Feature activation+3.89e-01

Pos logit contributions
.",+0.923
tained+0.911
aque+0.904
":""},{"+0.864
upon+0.843
Neg logit contributions
yip-1.560
-1.451
 XT-1.392
 Junction-1.375
PLE-1.315
Token let
Token ID1309
Feature activation-4.10e-01

Pos logit contributions
yip+2.690
+2.475
 XT+2.404
 Junction+2.348
PLE+2.277
Neg logit contributions
aque-2.220
tained-2.210
.",-2.151
":""},{"-2.125
 Richards-2.039
Token Will
Token ID2561
Feature activation-8.88e-01

Pos logit contributions
.",+1.510
tained+1.468
aque+1.456
 Richards+1.380
":""},{"+1.366
Neg logit contributions
yip-3.663
-3.338
 Junction-3.263
 XT-3.252
advertisement-3.163
Token know
Token ID760
Feature activation-7.20e-01

Pos logit contributions
.",+4.231
 Richards+4.095
aque+4.049
tained+4.022
+3.861
Neg logit contributions
yip-6.766
-6.108
 XT-5.893
 Junction-5.786
advertisement-5.738
Token by
Token ID416
Feature activation-2.82e-01

Pos logit contributions
.",+3.009
tained+2.827
aque+2.806
 Richards+2.724
upon+2.692
Neg logit contributions
yip-6.146
-5.648
 XT-5.383
 Junction-5.341
advertisement-5.185
Token clicking
Token ID12264
Feature activation-3.22e-01

Pos logit contributions
.",+1.492
tained+1.463
aque+1.458
":""},{"+1.387
 Richards+1.385
Neg logit contributions
yip-2.127
-1.927
 XT-1.834
 Junction-1.816
advertisement-1.741
Token the
Token ID262
Feature activation-2.32e-01

Pos logit contributions
.",+1.468
tained+1.458
aque+1.441
":""},{"+1.375
 Richards+1.350
Neg logit contributions
yip-2.612
-2.396
 XT-2.308
 Junction-2.287
PLE-2.212
Token link
Token ID2792
Feature activation-5.14e-01

Pos logit contributions
tained+1.192
.",+1.187
aque+1.179
":""},{"+1.125
 Richards+1.104
Neg logit contributions
yip-1.740
-1.575
 XT-1.524
 Junction-1.508
PLE-1.458