TOP ACTIVATIONS
MAX = 16.583

dollar note all colours and value are worth 1000 , 000
Michigan mud , sand and snow Ford , America
, characterized by oval or poly gon al crater iform pap
Carl Hag elin and Nick Bon ino . No mention of
с к а я е д а л ь
of widespread Syrian espionage and sub version came to light as
members of Ford Nation and par rot Ford s
d iss ident , and pro - im perial ist sc
ι ρ π α γ
she added , he will also be
onen ( who may or may not take an actual roster
sentence against Naval ny and Py ot r Of its er
said , adding salaries and overtime often match what the market
India level , rural and urban combined , the median household
м е л а е с т о
stories , prose , and non - fiction . Cos player
ly anti - democratic or pro - author itarian . In
offering to sell it or possibly trade it in for another
Do use or do not use social media ...
is marginal , and most likely only as scrap .

MIN ACTIVATIONS
MIN = -15.167

on May 25 . ( John McDonnell / The Washington Post
Develop ed by : Relic Entertainment - Sierra Studios (
, the team was 63 - 3 when leading after eight
practice May 25 . ( John McDonnell / The Washington Post
will be eliminated entirely . Keep in mind that this has
lf appeared in the " Keep Fish in '" music video
." " That didn 't work in the 1930 s ,
( Photo by REUTERS / John S omm ers II )
your job isn 't to keep Red Hat or CentOS servers
Last year 's Walking Dead maze was split between
, 055 complaints filed against Wells Fargo which is accused
www . bel fast te legraph . co . uk /
2011 , it was 62 - 3 . It
It was an explosive 80 - yard sprint , and McF
five touchdowns in a 41 - 27 Vanderbilt win that broke
graph ). For the 17 - 40 and 16 - 35
feels like it struggles to keep up with the two top
New - Object System . Draw ing . Rect angle F
the prime challenge Belfast Te legraph . co . uk Dis
2012 , it was 68 - 2 . In 2011 ,

INTERVAL 8.645 - 16.583
CONTAINS 0.452%

said of NHS staff : They care
They are now expected to face a run -
s John Dick erson that pre - existing conditions were going
people ) and other gender non con forming ( G NC
My advice to Donald Trump is simply to turn off

INTERVAL 0.708 - 8.645
CONTAINS 40.046%

election results with granddaughter Kara Trib by , 7 , in
x . Wire x , a provider of bitcoin
picture shoot was delayed by almost an hour .[ 12 ]
a man that tall and heavy and turn back to
The boundary between Europe and Asia is , in fact ,

INTERVAL -7.230 - 0.708
CONTAINS 58.448%

discovery . ... Metroid v ani as are at their best
. To fight . The tension came at the last possible
not , Bounty gate affects every aspect of league punishment and
you asked a Hillary supporter , who s living
that Putin has created . They are acting according to the

INTERVAL -15.167 - -7.230
CONTAINS 1.055%

ru itt has also failed to take legal action to protect
are soldiers marking the 65 th anniversary of the country 's
, DA VID SL ACK Bill Walk
ko . " It really speaks to the synergy we have
a Devil , collecting 142 goals ( 12 th in club
Token dollar
Token ID8872
Feature activation+6.62e+00

Pos logit contributions
skirts+2.844
lik+2.162
haps+2.157
arthy+2.042
EStream+2.032
Neg logit contributions
advertisement-2.792
ENSE-2.314
INT-2.314
ARB-2.243
gerald-2.224
Token note
Token ID3465
Feature activation-2.18e-01

Pos logit contributions
skirts+5.773
lik+4.512
haps+4.406
 vo+4.254
arthy+4.184
Neg logit contributions
advertisement-5.310
ENSE-4.405
INT-4.390
ARB-4.294
gerald-4.236
Token all
Token ID477
Feature activation+2.48e+00

Pos logit contributions
advertisement+0.179
INT+0.147
ENSE+0.145
ARB+0.141
gerald+0.140
Neg logit contributions
skirts-0.195
haps-0.150
lik-0.148
EStream-0.142
arthy-0.142
Token colours
Token ID18915
Feature activation+3.68e+00

Pos logit contributions
skirts+2.181
haps+1.661
lik+1.655
EStream+1.578
arthy+1.573
Neg logit contributions
advertisement-2.067
INT-1.713
ENSE-1.706
ARB-1.653
gerald-1.642
Token and
Token ID290
Feature activation+4.82e+00

Pos logit contributions
skirts+3.241
lik+2.496
haps+2.481
arthy+2.349
 vo+2.335
Neg logit contributions
advertisement-3.029
ENSE-2.495
INT-2.494
ARB-2.418
gerald-2.402
Token value
Token ID1988
Feature activation+1.66e+01

Pos logit contributions
skirts+3.898
lik+2.919
haps+2.896
arthy+2.723
 vo+2.708
Neg logit contributions
advertisement-4.324
ENSE-3.624
INT-3.621
ARB-3.523
gerald-3.504
Token are
Token ID389
Feature activation+2.62e+00

Pos logit contributions
skirts+13.441
lik+10.797
 vo+10.186
haps+10.144
arthy+9.712
Neg logit contributions
advertisement-12.084
ENSE-9.834
gerald-9.787
ARB-9.657
INT-9.609
Token worth
Token ID2861
Feature activation+5.35e+00

Pos logit contributions
skirts+2.301
lik+1.757
haps+1.754
arthy+1.656
EStream+1.654
Neg logit contributions
advertisement-2.188
INT-1.812
ENSE-1.806
ARB-1.754
gerald-1.735
Token 1000
Token ID8576
Feature activation+1.05e+00

Pos logit contributions
skirts+4.492
lik+3.407
haps+3.376
arthy+3.173
 vo+3.162
Neg logit contributions
advertisement-4.626
ENSE-3.842
INT-3.827
ARB-3.732
gerald-3.709
Token,
Token ID11
Feature activation-2.78e+00

Pos logit contributions
skirts+0.977
haps+0.761
lik+0.753
arthy+0.725
EStream+0.717
Neg logit contributions
advertisement-0.824
INT-0.677
ENSE-0.671
ARB-0.652
Db-0.645
Token000
Token ID830
Feature activation+1.98e+00

Pos logit contributions
advertisement+1.550
INT+1.168
ENSE+1.142
ARB+1.079
Db+1.070
Neg logit contributions
skirts-3.169
lik-2.582
haps-2.582
EStream-2.504
arthy-2.495
Token Michigan
Token ID7055
Feature activation-4.48e-01

Pos logit contributions
advertisement+0.860
INT+0.713
ENSE+0.707
ARB+0.694
gerald+0.679
Neg logit contributions
skirts-0.935
lik-0.720
haps-0.715
EStream-0.679
arthy-0.677
Token mud
Token ID17492
Feature activation+2.64e+00

Pos logit contributions
advertisement+0.342
INT+0.279
ENSE+0.276
ARB+0.271
Db+0.264
Neg logit contributions
skirts-0.422
lik-0.331
haps-0.330
EStream-0.314
arthy-0.313
Token,
Token ID11
Feature activation+2.76e+00

Pos logit contributions
skirts+2.304
lik+1.763
haps+1.759
arthy+1.664
EStream+1.655
Neg logit contributions
advertisement-2.210
INT-1.829
ENSE-1.820
ARB-1.776
gerald-1.756
Token sand
Token ID6450
Feature activation+9.17e+00

Pos logit contributions
skirts+2.409
lik+1.842
haps+1.834
arthy+1.734
EStream+1.720
Neg logit contributions
advertisement-2.310
INT-1.911
ENSE-1.909
ARB-1.850
gerald-1.836
Token and
Token ID290
Feature activation+6.30e+00

Pos logit contributions
skirts+8.806
lik+7.068
haps+6.919
 vo+6.733
nels+6.694
Neg logit contributions
advertisement-6.392
ENSE-5.267
INT-5.026
gerald-4.821
INESS-4.756
Token snow
Token ID6729
Feature activation+1.54e+01

Pos logit contributions
skirts+5.096
lik+3.874
haps+3.849
 vo+3.674
arthy+3.589
Neg logit contributions
advertisement-5.520
INT-4.588
ENSE-4.583
gerald-4.439
Db-4.395
Token
Token ID198
Feature activation+6.37e-01

Pos logit contributions
skirts+12.571
lik+9.521
haps+9.145
 vo+8.893
cigarettes+8.786
Neg logit contributions
advertisement-11.720
ENSE-10.223
INT-9.578
gerald-9.385
APTER-9.308
Token
Token ID198
Feature activation-3.25e+00

Pos logit contributions
skirts+0.447
haps+0.314
lik+0.312
EStream+0.292
arthy+0.289
Neg logit contributions
advertisement-0.648
INT-0.555
ENSE-0.552
ARB-0.540
gerald-0.533
TokenFord
Token ID37308
Feature activation-4.19e-01

Pos logit contributions
advertisement+2.090
INT+1.632
ENSE+1.566
ARB+1.553
Bench+1.511
Neg logit contributions
skirts-3.511
haps-2.850
lik-2.808
EStream-2.764
 vo-2.755
Token,
Token ID11
Feature activation-4.38e-02

Pos logit contributions
advertisement+0.328
INT+0.270
ENSE+0.268
ARB+0.261
gerald+0.256
Neg logit contributions
skirts-0.387
lik-0.301
haps-0.299
EStream-0.287
arthy-0.284
Token America
Token ID2253
Feature activation+3.23e+00

Pos logit contributions
advertisement+0.035
INT+0.028
ENSE+0.028
ARB+0.027
gerald+0.027
Neg logit contributions
skirts-0.041
lik-0.031
haps-0.031
EStream-0.030
arthy-0.030
Token,
Token ID11
Feature activation-3.58e-01

Pos logit contributions
advertisement+1.467
INT+1.192
ENSE+1.177
ARB+1.159
Db+1.123
Neg logit contributions
skirts-1.778
haps-1.362
lik-1.352
EStream-1.310
arthy-1.296
Token characterized
Token ID16264
Feature activation+6.75e-01

Pos logit contributions
advertisement+0.301
INT+0.248
ENSE+0.247
ARB+0.243
Db+0.236
Neg logit contributions
skirts-0.315
haps-0.237
lik-0.236
EStream-0.226
arthy-0.225
Token by
Token ID416
Feature activation-6.61e-01

Pos logit contributions
skirts+0.604
lik+0.462
haps+0.460
arthy+0.437
EStream+0.434
Neg logit contributions
advertisement-0.553
INT-0.454
ENSE-0.453
ARB-0.440
gerald-0.434
Token oval
Token ID41186
Feature activation+3.33e-01

Pos logit contributions
advertisement+0.532
INT+0.440
ENSE+0.438
ARB+0.424
Db+0.416
Neg logit contributions
skirts-0.597
haps-0.460
lik-0.459
EStream-0.445
arthy-0.434
Token or
Token ID393
Feature activation+2.81e+00

Pos logit contributions
skirts+0.300
lik+0.231
haps+0.230
EStream+0.220
arthy+0.218
Neg logit contributions
advertisement-0.270
INT-0.222
ENSE-0.221
ARB-0.214
gerald-0.211
Token poly
Token ID7514
Feature activation+1.52e+01

Pos logit contributions
skirts+2.504
lik+1.928
haps+1.920
arthy+1.819
EStream+1.803
Neg logit contributions
advertisement-2.299
INT-1.892
ENSE-1.891
ARB-1.832
gerald-1.816
Tokengon
Token ID14520
Feature activation+7.94e+00

Pos logit contributions
skirts+12.404
haps+9.850
lik+9.532
arthy+9.212
nels+9.157
Neg logit contributions
advertisement-11.247
ENSE-9.783
INESS-9.515
ARB-9.302
INT-9.247
Tokenal
Token ID282
Feature activation+4.40e+00

Pos logit contributions
skirts+6.481
lik+5.001
haps+4.959
arthy+4.660
nels+4.643
Neg logit contributions
advertisement-6.727
ENSE-5.674
INT-5.625
gerald-5.473
INESS-5.459
Token crater
Token ID32593
Feature activation+5.86e+00

Pos logit contributions
skirts+3.703
lik+2.808
haps+2.798
 vo+2.652
arthy+2.639
Neg logit contributions
advertisement-3.751
ENSE-3.128
INT-3.127
gerald-3.034
ARB-3.030
Tokeniform
Token ID6933
Feature activation+1.16e+00

Pos logit contributions
skirts+4.969
lik+3.836
haps+3.796
arthy+3.575
 vo+3.566
Neg logit contributions
advertisement-4.920
ENSE-4.115
INT-4.103
gerald-3.983
ARB-3.959
Token pap
Token ID20461
Feature activation+6.33e+00

Pos logit contributions
skirts+0.972
haps+0.730
lik+0.730
EStream+0.691
arthy+0.688
Neg logit contributions
advertisement-1.021
INT-0.851
ENSE-0.850
ARB-0.828
gerald-0.814
Token Carl
Token ID8124
Feature activation-9.15e+00

Pos logit contributions
skirts+1.503
lik+1.163
haps+1.160
arthy+1.106
EStream+1.099
Neg logit contributions
advertisement-1.296
INT-1.060
ENSE-1.058
ARB-1.025
gerald-1.015
Token Hag
Token ID21375
Feature activation-2.05e+00

Pos logit contributions
advertisement+5.962
INT+4.919
ENSE+4.862
ARB+4.809
Db+4.766
Neg logit contributions
skirts-8.841
haps-7.066
lik-6.973
EStream-6.784
accompan-6.601
Tokenelin
Token ID27176
Feature activation+4.01e-02

Pos logit contributions
advertisement+1.747
INT+1.477
ENSE+1.467
ARB+1.454
gerald+1.416
Neg logit contributions
skirts-1.725
haps-1.319
lik-1.255
EStream-1.248
cigarettes-1.232
Token and
Token ID290
Feature activation+1.40e+00

Pos logit contributions
skirts+0.040
haps+0.032
lik+0.032
arthy+0.030
EStream+0.030
Neg logit contributions
advertisement-0.029
INT-0.023
ENSE-0.023
ARB-0.022
gerald-0.021
Token Nick
Token ID8047
Feature activation+9.41e-01

Pos logit contributions
skirts+1.414
lik+1.126
haps+1.123
arthy+1.072
EStream+1.066
Neg logit contributions
advertisement-0.974
INT-0.773
ENSE-0.771
ARB-0.742
gerald-0.734
Token Bon
Token ID7979
Feature activation+1.52e+01

Pos logit contributions
skirts+0.795
haps+0.600
lik+0.600
EStream+0.564
arthy+0.563
Neg logit contributions
advertisement-0.812
INT-0.678
ENSE-0.677
ARB-0.657
gerald-0.654
Tokenino
Token ID2879
Feature activation+6.33e+00

Pos logit contributions
skirts+10.167
lik+7.964
haps+7.748
 vo+7.440
arthy+7.094
Neg logit contributions
advertisement-13.049
allery-11.883
ENSE-11.707
INT-11.678
ARB-11.461
Token.
Token ID13
Feature activation+3.11e+00

Pos logit contributions
skirts+6.073
lik+4.853
haps+4.782
arthy+4.571
 vo+4.551
Neg logit contributions
advertisement-4.582
ENSE-3.703
INT-3.692
ARB-3.584
gerald-3.524
Token No
Token ID1400
Feature activation+9.97e-02

Pos logit contributions
skirts+2.302
lik+1.659
haps+1.658
arthy+1.540
EStream+1.530
Neg logit contributions
advertisement-3.022
INT-2.572
ENSE-2.566
ARB-2.503
gerald-2.485
Token mention
Token ID3068
Feature activation-1.84e+00

Pos logit contributions
skirts+0.092
haps+0.071
lik+0.070
arthy+0.068
EStream+0.067
Neg logit contributions
advertisement-0.079
INT-0.065
ENSE-0.065
ARB-0.062
gerald-0.061
Token of
Token ID286
Feature activation-5.08e-01

Pos logit contributions
advertisement+1.625
INT+1.360
ENSE+1.342
ARB+1.323
gerald+1.287
Neg logit contributions
skirts-1.521
haps-1.124
lik-1.100
arthy-1.071
EStream-1.065
Tokenс
Token ID21727
Feature activation+5.31e+00

Pos logit contributions
skirts+4.324
lik+3.144
haps+3.092
 vo+2.894
EStream+2.846
Neg logit contributions
advertisement-5.879
ENSE-4.996
INT-4.985
ARB-4.875
gerald-4.861
Tokenк
Token ID31583
Feature activation+9.14e+00

Pos logit contributions
skirts+4.587
lik+3.582
haps+3.514
arthy+3.340
 vo+3.295
Neg logit contributions
advertisement-4.403
INT-3.591
ENSE-3.580
gerald-3.542
ARB-3.493
Tokenа
Token ID16142
Feature activation+3.96e+00

Pos logit contributions
skirts+5.022
lik+3.360
haps+3.276
arthy+2.957
 vo+2.955
Neg logit contributions
advertisement-10.099
ENSE-8.848
INT-8.770
gerald-8.651
ARB-8.593
Tokenя
Token ID40623
Feature activation+2.11e+00

Pos logit contributions
skirts+3.059
lik+2.256
haps+2.231
arthy+2.095
 vo+2.080
Neg logit contributions
advertisement-3.700
ENSE-3.119
INT-3.118
ARB-3.038
gerald-3.021
Token
Token ID12466
Feature activation+6.10e+00

Pos logit contributions
skirts+1.727
haps+1.291
lik+1.286
arthy+1.213
EStream+1.199
Neg logit contributions
advertisement-1.880
INT-1.578
ENSE-1.575
ARB-1.529
gerald-1.513
Token
Token ID120
Feature activation+1.49e+01

Pos logit contributions
skirts+4.540
lik+3.315
haps+3.297
arthy+3.057
EStream+3.050
Neg logit contributions
advertisement-5.808
ENSE-4.927
INT-4.904
ARB-4.797
gerald-4.769
Tokenе
Token ID16843
Feature activation+4.83e+00

Pos logit contributions
skirts+9.218
lik+6.741
haps+6.507
 vo+6.150
arthy+5.986
Neg logit contributions
advertisement-14.102
INT-12.177
ENSE-12.167
ARB-12.041
gerald-11.975
Tokenд
Token ID43666
Feature activation+4.51e+00

Pos logit contributions
skirts+4.061
lik+3.090
haps+3.061
arthy+2.885
 vo+2.874
Neg logit contributions
advertisement-4.162
ENSE-3.463
INT-3.456
ARB-3.358
gerald-3.329
Tokenа
Token ID16142
Feature activation+1.30e+00

Pos logit contributions
skirts+3.364
lik+2.454
haps+2.432
arthy+2.268
 vo+2.248
Neg logit contributions
advertisement-4.330
ENSE-3.673
INT-3.672
ARB-3.579
gerald-3.557
Tokenл
Token ID30143
Feature activation-1.37e-01

Pos logit contributions
skirts+1.113
haps+0.843
lik+0.836
arthy+0.794
EStream+0.788
Neg logit contributions
advertisement-1.107
INT-0.927
ENSE-0.921
ARB-0.894
gerald-0.885
Tokenь
Token ID45367
Feature activation+1.90e+00

Pos logit contributions
advertisement+0.121
INT+0.101
ENSE+0.101
ARB+0.098
gerald+0.097
Neg logit contributions
skirts-0.115
haps-0.086
lik-0.085
arthy-0.081
EStream-0.080
Token of
Token ID286
Feature activation+3.31e+00

Pos logit contributions
skirts+1.684
lik+1.213
haps+1.212
EStream+1.139
arthy+1.132
Neg logit contributions
advertisement-2.093
INT-1.774
ENSE-1.771
ARB-1.731
gerald-1.707
Token widespread
Token ID10095
Feature activation+3.57e+00

Pos logit contributions
skirts+2.559
lik+1.880
haps+1.871
arthy+1.752
EStream+1.735
Neg logit contributions
advertisement-3.092
INT-2.613
ENSE-2.611
ARB-2.543
gerald-2.524
Token Syrian
Token ID6318
Feature activation+8.29e-01

Pos logit contributions
skirts+2.892
lik+2.160
haps+2.149
arthy+2.021
EStream+2.001
Neg logit contributions
advertisement-3.209
INT-2.692
ENSE-2.690
ARB-2.618
gerald-2.597
Token espionage
Token ID30113
Feature activation-2.55e-01

Pos logit contributions
skirts+0.781
lik+0.603
haps+0.603
EStream+0.574
arthy+0.570
Neg logit contributions
advertisement-0.647
INT-0.529
ENSE-0.525
ARB-0.504
gerald-0.497
Token and
Token ID290
Feature activation+1.31e+00

Pos logit contributions
advertisement+0.223
INT+0.187
ENSE+0.186
ARB+0.179
gerald+0.178
Neg logit contributions
skirts-0.216
haps-0.160
lik-0.159
EStream-0.156
arthy-0.152
Token sub
Token ID850
Feature activation+1.49e+01

Pos logit contributions
skirts+1.125
haps+0.847
lik+0.845
EStream+0.799
arthy+0.799
Neg logit contributions
advertisement-1.125
INT-0.936
ENSE-0.930
ARB-0.903
gerald-0.893
Tokenversion
Token ID9641
Feature activation+8.10e+00

Pos logit contributions
skirts+9.837
lik+7.409
haps+7.336
rights+6.570
ropri+6.534
Neg logit contributions
advertisement-13.742
ENSE-11.903
Db-11.677
INESS-11.598
ARB-11.530
Token came
Token ID1625
Feature activation+2.29e-01

Pos logit contributions
skirts+6.016
lik+4.576
haps+4.466
 vo+4.256
arthy+4.191
Neg logit contributions
advertisement-7.447
ENSE-6.332
INT-6.323
ARB-6.192
gerald-6.171
Token to
Token ID284
Feature activation-2.33e+00

Pos logit contributions
skirts+0.196
haps+0.146
lik+0.146
EStream+0.139
arthy+0.139
Neg logit contributions
advertisement-0.196
INT-0.165
ENSE-0.164
ARB-0.159
gerald-0.156
Token light
Token ID1657
Feature activation+2.42e+00

Pos logit contributions
advertisement+1.715
ENSE+1.429
INT+1.420
ARB+1.359
gerald+1.324
Neg logit contributions
skirts-2.253
haps-1.735
lik-1.729
EStream-1.692
arthy-1.652
Token as
Token ID355
Feature activation+8.83e-01

Pos logit contributions
skirts+2.441
lik+1.940
haps+1.935
arthy+1.848
EStream+1.838
Neg logit contributions
advertisement-1.698
INT-1.348
ENSE-1.346
ARB-1.295
gerald-1.280
Token members
Token ID1866
Feature activation+4.60e+00

Pos logit contributions
skirts+1.166
lik+0.858
haps+0.853
arthy+0.800
EStream+0.798
Neg logit contributions
advertisement-1.366
INT-1.154
ENSE-1.152
ARB-1.119
gerald-1.114
Token of
Token ID286
Feature activation+6.44e+00

Pos logit contributions
skirts+3.165
lik+2.268
haps+2.226
arthy+2.093
EStream+2.077
Neg logit contributions
advertisement-4.579
ENSE-3.958
INT-3.945
gerald-3.859
ARB-3.857
Token Ford
Token ID8092
Feature activation+4.74e+00

Pos logit contributions
skirts+5.307
lik+4.033
haps+4.008
arthy+3.784
 vo+3.780
Neg logit contributions
advertisement-5.584
ENSE-4.653
INT-4.639
ARB-4.523
gerald-4.499
Token Nation
Token ID8741
Feature activation+2.39e+00

Pos logit contributions
skirts+4.221
lik+3.278
haps+3.245
arthy+3.077
 vo+3.044
Neg logit contributions
advertisement-3.845
ENSE-3.162
INT-3.153
ARB-3.060
gerald-3.034
Token and
Token ID290
Feature activation+2.82e+00

Pos logit contributions
skirts+2.432
lik+1.939
haps+1.934
arthy+1.848
EStream+1.837
Neg logit contributions
advertisement-1.655
INT-1.308
ENSE-1.307
ARB-1.257
gerald-1.243
Token par
Token ID1582
Feature activation+1.47e+01

Pos logit contributions
skirts+2.555
lik+1.971
haps+1.963
arthy+1.863
EStream+1.854
Neg logit contributions
advertisement-2.262
INT-1.854
ENSE-1.853
ARB-1.794
gerald-1.778
Tokenrot
Token ID10599
Feature activation+2.03e+00

Pos logit contributions
skirts+9.403
haps+7.483
lik+7.075
 vo+6.570
ro+6.298
Neg logit contributions
advertisement-13.747
INESS-11.843
ENSE-11.767
INT-11.701
ARB-11.555
Token Ford
Token ID8092
Feature activation+5.34e+00

Pos logit contributions
skirts+1.799
lik+1.377
haps+1.370
arthy+1.300
EStream+1.295
Neg logit contributions
advertisement-1.682
INT-1.387
ENSE-1.383
ARB-1.341
gerald-1.331
Token
Token ID447
Feature activation+1.43e+00

Pos logit contributions
skirts+4.705
lik+3.640
haps+3.606
arthy+3.424
 vo+3.376
Neg logit contributions
advertisement-4.356
ENSE-3.605
INT-3.586
ARB-3.500
gerald-3.469
Token
Token ID247
Feature activation-2.52e+00

Pos logit contributions
skirts+1.005
lik+0.710
haps+0.705
arthy+0.658
EStream+0.649
Neg logit contributions
advertisement-1.444
INT-1.234
ENSE-1.228
ARB-1.203
gerald-1.193
Tokens
Token ID82
Feature activation-1.51e+00

Pos logit contributions
advertisement+1.446
INT+1.057
ENSE+1.043
gerald+0.989
ARB+0.968
Neg logit contributions
skirts-2.881
haps-2.349
lik-2.327
EStream-2.290
 vo-2.254
Tokend
Token ID67
Feature activation-2.98e+00

Pos logit contributions
skirts+1.487
haps+1.154
lik+1.150
arthy+1.100
 vo+1.090
Neg logit contributions
advertisement-1.305
INT-1.073
ENSE-1.062
ARB-1.032
gerald-1.018
Tokeniss
Token ID747
Feature activation-4.79e+00

Pos logit contributions
advertisement+2.567
ENSE+2.196
INT+2.167
ARB+2.091
MK+2.079
Neg logit contributions
skirts-2.506
lik-1.864
haps-1.846
EStream-1.813
cigarettes-1.766
Tokenident
Token ID738
Feature activation+7.07e-01

Pos logit contributions
advertisement+4.276
ENSE+3.631
INT+3.610
ARB+3.485
MK+3.457
Neg logit contributions
skirts-3.809
EStream-2.735
haps-2.712
cigarettes-2.702
lik-2.701
Token,
Token ID11
Feature activation+1.43e+00

Pos logit contributions
skirts+0.670
haps+0.522
lik+0.520
arthy+0.496
EStream+0.495
Neg logit contributions
advertisement-0.539
INT-0.440
ENSE-0.438
ARB-0.423
gerald-0.418
Token and
Token ID290
Feature activation+2.14e+00

Pos logit contributions
skirts+1.298
lik+0.999
haps+0.999
arthy+0.946
EStream+0.942
Neg logit contributions
advertisement-1.152
INT-0.950
ENSE-0.945
ARB-0.914
gerald-0.906
Token pro
Token ID386
Feature activation+1.45e+01

Pos logit contributions
skirts+1.878
lik+1.429
haps+1.428
arthy+1.350
EStream+1.343
Neg logit contributions
advertisement-1.781
INT-1.477
ENSE-1.473
ARB-1.426
gerald-1.416
Token-
Token ID12
Feature activation+3.05e+00

Pos logit contributions
skirts+9.429
lik+7.066
haps+6.762
rights+6.487
 vo+6.452
Neg logit contributions
advertisement-13.386
INESS-11.479
ENSE-11.362
ARB-11.182
allery-11.138
Tokenim
Token ID320
Feature activation+1.50e+00

Pos logit contributions
skirts+3.013
lik+2.387
haps+2.378
arthy+2.268
EStream+2.254
Neg logit contributions
advertisement-2.193
INT-1.752
ENSE-1.751
ARB-1.688
gerald-1.670
Tokenperial
Token ID7629
Feature activation+2.94e+00

Pos logit contributions
skirts+1.417
lik+1.108
haps+1.104
arthy+1.049
EStream+1.043
Neg logit contributions
advertisement-1.151
INT-0.934
ENSE-0.933
ARB-0.901
gerald-0.893
Tokenist
Token ID396
Feature activation+3.15e+00

Pos logit contributions
skirts+1.942
lik+1.334
haps+1.329
arthy+1.219
EStream+1.206
Neg logit contributions
advertisement-3.085
INT-2.664
ENSE-2.661
ARB-2.595
gerald-2.582
Token sc
Token ID629
Feature activation+3.63e+00

Pos logit contributions
skirts+2.917
lik+2.273
haps+2.262
arthy+2.149
 vo+2.134
Neg logit contributions
advertisement-2.464
INT-2.006
ENSE-2.004
ARB-1.942
gerald-1.923
Tokenι
Token ID29945
Feature activation+1.74e+00

Pos logit contributions
skirts+5.916
lik+4.488
haps+4.451
arthy+4.206
 vo+4.149
Neg logit contributions
advertisement-6.400
ENSE-5.344
INT-5.308
gerald-5.194
ARB-5.182
Token
Token ID157
Feature activation+6.48e+00

Pos logit contributions
skirts+1.488
haps+1.123
lik+1.122
arthy+1.062
EStream+1.056
Neg logit contributions
advertisement-1.487
INT-1.238
ENSE-1.232
ARB-1.198
gerald-1.187
Token
Token ID122
Feature activation+6.17e+00

Pos logit contributions
skirts+5.489
lik+4.210
haps+4.187
arthy+3.924
 vo+3.922
Neg logit contributions
advertisement-5.482
ENSE-4.558
INT-4.489
ARB-4.379
gerald-4.363
Token
Token ID121
Feature activation+3.76e+00

Pos logit contributions
skirts+3.425
lik+2.176
haps+2.173
arthy+1.940
 vo+1.934
Neg logit contributions
advertisement-7.013
ENSE-6.123
INT-6.093
ARB-6.000
gerald-5.975
Token
Token ID28053
Feature activation+7.73e+00

Pos logit contributions
skirts+2.975
lik+2.202
haps+2.194
arthy+2.057
EStream+2.042
Neg logit contributions
advertisement-3.445
INT-2.903
ENSE-2.900
ARB-2.821
gerald-2.798
Token
Token ID120
Feature activation+1.45e+01

Pos logit contributions
skirts+4.944
lik+3.474
haps+3.427
EStream+3.089
arthy+3.045
Neg logit contributions
advertisement-7.992
ENSE-6.897
INT-6.808
ARB-6.750
gerald-6.691
Token
Token ID223
Feature activation+6.90e+00

Pos logit contributions
skirts+9.989
lik+7.598
haps+7.540
 vo+7.107
arthy+6.910
Neg logit contributions
advertisement-12.593
INT-10.825
ENSE-10.708
ARB-10.593
allery-10.427
Tokenρ
Token ID33643
Feature activation+5.30e+00

Pos logit contributions
skirts+4.680
lik+3.311
haps+3.292
arthy+3.023
 vo+3.017
Neg logit contributions
advertisement-6.966
ENSE-5.933
INT-5.887
ARB-5.819
gerald-5.799
Tokenπ
Token ID46582
Feature activation+3.12e+00

Pos logit contributions
skirts+4.108
lik+3.038
haps+3.018
arthy+2.821
 vo+2.805
Neg logit contributions
advertisement-4.903
ENSE-4.120
INT-4.112
ARB-4.016
gerald-4.000
Tokenα
Token ID17394
Feature activation+3.71e+00

Pos logit contributions
skirts+2.549
lik+1.904
haps+1.897
arthy+1.785
EStream+1.772
Neg logit contributions
advertisement-2.784
INT-2.337
ENSE-2.334
ARB-2.267
gerald-2.246
Tokenγ
Token ID42063
Feature activation+5.64e+00

Pos logit contributions
skirts+2.923
lik+2.162
haps+2.152
arthy+2.017
EStream+1.998
Neg logit contributions
advertisement-3.412
INT-2.874
ENSE-2.872
ARB-2.797
gerald-2.776
Token
Token ID447
Feature activation+2.90e+00

Pos logit contributions
skirts+5.255
lik+4.069
haps+4.049
 vo+3.829
arthy+3.806
Neg logit contributions
advertisement-4.865
INT-4.023
ENSE-4.016
ARB-3.890
gerald-3.872
Token
Token ID251
Feature activation+6.09e+00

Pos logit contributions
skirts+2.693
lik+2.098
haps+2.091
arthy+1.986
EStream+1.971
Neg logit contributions
advertisement-2.257
INT-1.838
ENSE-1.837
ARB-1.776
gerald-1.760
Token she
Token ID673
Feature activation+3.24e+00

Pos logit contributions
skirts+5.415
lik+4.200
haps+4.164
arthy+3.935
 vo+3.919
Neg logit contributions
advertisement-4.914
INT-4.041
ENSE-4.032
ARB-3.915
gerald-3.889
Token added
Token ID2087
Feature activation+6.02e+00

Pos logit contributions
skirts+2.732
lik+2.056
haps+2.048
arthy+1.935
EStream+1.929
Neg logit contributions
advertisement-2.817
INT-2.346
ENSE-2.342
ARB-2.277
gerald-2.257
Token,
Token ID11
Feature activation+8.76e+00

Pos logit contributions
skirts+5.844
lik+4.647
haps+4.604
 vo+4.428
arthy+4.411
Neg logit contributions
advertisement-4.307
INT-3.432
ENSE-3.419
ARB-3.324
gerald-3.304
Token
Token ID564
Feature activation+1.44e+01

Pos logit contributions
skirts+7.177
lik+5.455
haps+5.425
 vo+5.099
arthy+5.065
Neg logit contributions
advertisement-7.437
INT-6.256
ENSE-6.220
gerald-6.049
ARB-6.035
Token
Token ID250
Feature activation+1.08e+01

Pos logit contributions
skirts+11.136
haps+8.608
lik+8.593
 vo+8.238
rights+7.800
Neg logit contributions
advertisement-11.083
ENSE-9.591
allery-9.580
INT-9.345
gerald-9.194
Tokenhe
Token ID258
Feature activation+4.26e+00

Pos logit contributions
skirts+9.997
lik+7.996
haps+7.793
cigarettes+7.440
rights+7.343
Neg logit contributions
advertisement-7.753
ENSE-6.363
INT-6.240
ARB-6.144
allery-6.102
Token will
Token ID481
Feature activation+5.46e+00

Pos logit contributions
skirts+3.185
lik+2.304
haps+2.296
arthy+2.142
EStream+2.127
Neg logit contributions
advertisement-4.103
INT-3.487
ENSE-3.482
ARB-3.395
gerald-3.371
Token also
Token ID635
Feature activation+2.13e+00

Pos logit contributions
skirts+4.663
lik+3.580
haps+3.564
arthy+3.362
 vo+3.355
Neg logit contributions
advertisement-4.593
ENSE-3.808
INT-3.798
ARB-3.703
gerald-3.673
Token be
Token ID307
Feature activation+2.95e+00

Pos logit contributions
skirts+1.931
lik+1.479
haps+1.477
EStream+1.410
arthy+1.399
Neg logit contributions
advertisement-1.727
INT-1.420
ENSE-1.412
ARB-1.365
gerald-1.352
Tokenonen
Token ID34481
Feature activation+4.84e+00

Pos logit contributions
skirts+6.598
lik+4.955
haps+4.800
arthy+4.497
cigarettes+4.229
Neg logit contributions
advertisement-9.730
ENSE-8.298
INT-8.193
ARB-8.141
Db-8.040
Token (
Token ID357
Feature activation+2.21e-01

Pos logit contributions
skirts+4.757
lik+3.804
haps+3.736
arthy+3.585
 vo+3.562
Neg logit contributions
advertisement-3.521
INT-2.767
ENSE-2.738
ARB-2.682
INESS-2.635
Tokenwho
Token ID8727
Feature activation-1.76e+00

Pos logit contributions
skirts+0.189
haps+0.144
lik+0.142
arthy+0.137
EStream+0.137
Neg logit contributions
advertisement-0.187
INT-0.157
ENSE-0.154
ARB-0.151
gerald-0.150
Token may
Token ID743
Feature activation+5.19e+00

Pos logit contributions
advertisement+1.564
INT+1.350
ENSE+1.323
gerald+1.319
ARB+1.310
Neg logit contributions
skirts-1.400
haps-1.008
lik-1.004
EStream-0.999
arthy-0.978
Token or
Token ID393
Feature activation+7.42e+00

Pos logit contributions
skirts+4.161
haps+3.184
lik+3.142
 vo+2.986
arthy+2.923
Neg logit contributions
advertisement-4.643
ENSE-3.887
INT-3.871
ARB-3.784
INESS-3.751
Token may
Token ID743
Feature activation+1.40e+01

Pos logit contributions
skirts+3.612
lik+2.269
haps+2.220
 vo+1.934
arthy+1.878
Neg logit contributions
advertisement-8.969
ENSE-7.827
INT-7.774
ARB-7.659
gerald-7.653
Token not
Token ID407
Feature activation+6.45e+00

Pos logit contributions
skirts+6.550
haps+4.848
lik+4.823
 vo+4.398
nels+4.073
Neg logit contributions
advertisement-15.482
ENSE-13.366
INT-13.234
ARB-13.069
allery-13.066
Token take
Token ID1011
Feature activation-1.79e+00

Pos logit contributions
skirts+4.664
haps+3.497
lik+3.490
 vo+3.354
arthy+3.170
Neg logit contributions
advertisement-6.155
ENSE-5.237
INT-5.204
INESS-5.120
ARB-5.108
Token an
Token ID281
Feature activation-1.06e+00

Pos logit contributions
advertisement+1.415
INT+1.185
ENSE+1.172
ARB+1.140
gerald+1.123
Neg logit contributions
skirts-1.631
haps-1.255
lik-1.239
arthy-1.193
EStream-1.186
Token actual
Token ID4036
Feature activation+1.10e+00

Pos logit contributions
advertisement+0.807
INT+0.658
ENSE+0.657
ARB+0.635
gerald+0.624
Neg logit contributions
skirts-1.004
haps-0.776
lik-0.773
arthy-0.745
EStream-0.745
Token roster
Token ID9354
Feature activation+2.38e-01

Pos logit contributions
skirts+0.996
haps+0.766
lik+0.766
arthy+0.727
EStream+0.720
Neg logit contributions
advertisement-0.886
INT-0.729
ENSE-0.727
ARB-0.705
gerald-0.698
Token sentence
Token ID6827
Feature activation+1.78e+00

Pos logit contributions
advertisement+0.440
INT+0.369
ENSE+0.363
ARB+0.350
INESS+0.345
Neg logit contributions
skirts-0.449
haps-0.341
lik-0.337
EStream-0.322
arthy-0.320
Token against
Token ID1028
Feature activation+1.13e+00

Pos logit contributions
skirts+1.580
lik+1.209
haps+1.205
arthy+1.145
EStream+1.137
Neg logit contributions
advertisement-1.470
INT-1.219
ENSE-1.211
ARB-1.174
Db-1.158
Token Naval
Token ID20818
Feature activation+2.71e-01

Pos logit contributions
skirts+1.060
lik+0.827
haps+0.822
arthy+0.786
EStream+0.780
Neg logit contributions
advertisement-0.869
INT-0.711
ENSE-0.706
ARB-0.682
gerald-0.674
Tokenny
Token ID3281
Feature activation+3.26e+00

Pos logit contributions
skirts+0.258
haps+0.198
lik+0.198
EStream+0.192
arthy+0.189
Neg logit contributions
advertisement-0.205
INT-0.166
ENSE-0.166
gerald-0.159
ARB-0.158
Token and
Token ID290
Feature activation+1.97e+00

Pos logit contributions
skirts+3.222
lik+2.553
haps+2.544
arthy+2.426
EStream+2.409
Neg logit contributions
advertisement-2.351
INT-1.879
ENSE-1.877
ARB-1.809
gerald-1.791
Token Py
Token ID9485
Feature activation+1.39e+01

Pos logit contributions
skirts+1.666
lik+1.245
haps+1.242
arthy+1.180
EStream+1.176
Neg logit contributions
advertisement-1.705
INT-1.434
ENSE-1.428
ARB-1.388
gerald-1.369
Tokenot
Token ID313
Feature activation+1.03e+01

Pos logit contributions
skirts+10.545
lik+8.621
haps+8.317
hemer+7.695
arthy+7.618
Neg logit contributions
advertisement-11.339
ENSE-9.807
INT-9.470
INESS-9.360
ARB-9.217
Tokenr
Token ID81
Feature activation+3.97e+00

Pos logit contributions
skirts+6.395
lik+4.770
haps+4.458
 vo+4.389
ropri+4.105
Neg logit contributions
advertisement-10.685
ENSE-9.163
INT-9.010
INESS-8.835
Db-8.811
Token Of
Token ID3226
Feature activation+7.11e+00

Pos logit contributions
skirts+3.695
lik+2.891
haps+2.880
arthy+2.731
 vo+2.711
Neg logit contributions
advertisement-3.086
INT-2.505
ENSE-2.504
ARB-2.420
gerald-2.400
Tokenits
Token ID896
Feature activation+3.80e+00

Pos logit contributions
skirts+6.160
lik+4.790
haps+4.694
arthy+4.459
 vo+4.384
Neg logit contributions
advertisement-5.921
ENSE-4.903
INT-4.893
ARB-4.726
INESS-4.686
Tokener
Token ID263
Feature activation+6.74e+00

Pos logit contributions
skirts+3.455
lik+2.687
haps+2.671
arthy+2.538
 vo+2.511
Neg logit contributions
advertisement-3.024
INT-2.475
ENSE-2.472
ARB-2.393
gerald-2.374
Token said
Token ID531
Feature activation+3.43e+00

Pos logit contributions
skirts+2.351
lik+1.805
haps+1.797
arthy+1.701
 vo+1.686
Neg logit contributions
advertisement-2.216
INT-1.829
ENSE-1.828
ARB-1.771
gerald-1.756
Token,
Token ID11
Feature activation+3.38e+00

Pos logit contributions
skirts+3.286
lik+2.603
haps+2.594
EStream+2.484
arthy+2.477
Neg logit contributions
advertisement-2.503
INT-2.018
ENSE-2.007
gerald-1.942
ARB-1.932
Token adding
Token ID4375
Feature activation+4.40e+00

Pos logit contributions
skirts+3.391
lik+2.705
haps+2.696
arthy+2.571
 vo+2.560
Neg logit contributions
advertisement-2.368
INT-1.884
ENSE-1.883
ARB-1.808
gerald-1.796
Token salaries
Token ID17058
Feature activation+7.04e+00

Pos logit contributions
skirts+3.722
lik+2.821
haps+2.820
 vo+2.652
arthy+2.651
Neg logit contributions
advertisement-3.763
INT-3.131
ENSE-3.126
ARB-3.028
gerald-3.018
Token and
Token ID290
Feature activation+8.41e+00

Pos logit contributions
skirts+5.696
haps+4.262
lik+4.261
 vo+4.072
arthy+3.974
Neg logit contributions
advertisement-6.099
ENSE-5.180
INT-5.159
ARB-5.025
gerald-5.000
Token overtime
Token ID17820
Feature activation+1.38e+01

Pos logit contributions
skirts+6.776
lik+5.255
haps+5.078
 vo+5.048
arthy+4.794
Neg logit contributions
advertisement-6.999
ENSE-5.905
INT-5.892
Db-5.757
ARB-5.749
Token often
Token ID1690
Feature activation+6.50e+00

Pos logit contributions
skirts+10.190
lik+7.500
 vo+7.292
haps+7.197
rights+6.927
Neg logit contributions
advertisement-11.743
ENSE-10.086
INT-9.992
ARB-9.978
allery-9.924
Token match
Token ID2872
Feature activation+4.28e+00

Pos logit contributions
skirts+5.671
haps+4.379
lik+4.354
 vo+4.204
arthy+4.087
Neg logit contributions
advertisement-5.252
INT-4.394
ENSE-4.372
ARB-4.266
gerald-4.209
Token what
Token ID644
Feature activation+1.93e+00

Pos logit contributions
skirts+3.776
lik+2.917
haps+2.903
 vo+2.738
arthy+2.735
Neg logit contributions
advertisement-3.465
ENSE-2.878
INT-2.874
ARB-2.785
gerald-2.758
Token the
Token ID262
Feature activation+3.94e+00

Pos logit contributions
skirts+1.613
lik+1.212
haps+1.203
EStream+1.142
arthy+1.140
Neg logit contributions
advertisement-1.700
INT-1.417
ENSE-1.416
ARB-1.380
gerald-1.363
Token market
Token ID1910
Feature activation+9.66e+00

Pos logit contributions
skirts+3.119
lik+2.317
haps+2.305
arthy+2.164
 vo+2.148
Neg logit contributions
advertisement-3.602
ENSE-3.035
INT-3.032
ARB-2.948
gerald-2.928
TokenIndia
Token ID21569
Feature activation-5.19e+00

Pos logit contributions
advertisement+3.500
INT+2.911
ENSE+2.775
Bench+2.735
Db+2.724
Neg logit contributions
skirts-4.356
EStream-3.506
haps-3.480
 vo-3.343
lik-3.343
Token level
Token ID1241
Feature activation-7.78e-02

Pos logit contributions
advertisement+3.888
INT+3.307
ENSE+3.232
ARB+3.127
Bench+3.093
Neg logit contributions
skirts-4.848
haps-3.654
EStream-3.595
lik-3.562
arthy-3.472
Token,
Token ID11
Feature activation+3.60e-02

Pos logit contributions
advertisement+0.052
INT+0.041
ENSE+0.041
ARB+0.039
gerald+0.038
Neg logit contributions
skirts-0.081
lik-0.065
haps-0.065
arthy-0.062
EStream-0.062
Token rural
Token ID10016
Feature activation+5.43e+00

Pos logit contributions
skirts+0.032
haps+0.024
lik+0.024
EStream+0.023
arthy+0.023
Neg logit contributions
advertisement-0.030
INT-0.025
ENSE-0.025
ARB-0.024
Db-0.023
Token and
Token ID290
Feature activation+3.82e+00

Pos logit contributions
skirts+4.393
lik+3.307
haps+3.285
 vo+3.096
arthy+3.084
Neg logit contributions
advertisement-4.823
ENSE-4.035
INT-3.986
gerald-3.915
ARB-3.914
Token urban
Token ID7876
Feature activation+1.38e+01

Pos logit contributions
skirts+3.384
lik+2.604
haps+2.591
arthy+2.456
 vo+2.450
Neg logit contributions
advertisement-3.124
ENSE-2.577
INT-2.572
ARB-2.488
gerald-2.479
Token combined
Token ID5929
Feature activation+5.75e+00

Pos logit contributions
skirts+9.526
lik+7.135
haps+6.886
 vo+6.650
arthy+6.433
Neg logit contributions
advertisement-12.513
ENSE-10.579
gerald-10.494
INT-10.214
ARB-10.205
Token,
Token ID11
Feature activation+3.93e+00

Pos logit contributions
skirts+5.248
lik+4.102
haps+4.052
 vo+3.862
arthy+3.850
Neg logit contributions
advertisement-4.509
ENSE-3.663
INT-3.654
ARB-3.553
gerald-3.551
Token the
Token ID262
Feature activation+2.72e+00

Pos logit contributions
skirts+3.465
lik+2.664
haps+2.651
arthy+2.510
 vo+2.491
Neg logit contributions
advertisement-3.230
INT-2.664
ENSE-2.663
ARB-2.580
gerald-2.565
Token median
Token ID14288
Feature activation+6.90e+00

Pos logit contributions
skirts+2.404
haps+1.838
lik+1.836
arthy+1.738
EStream+1.735
Neg logit contributions
advertisement-2.245
INT-1.855
ENSE-1.850
ARB-1.793
gerald-1.773
Token household
Token ID6641
Feature activation+3.05e+00

Pos logit contributions
skirts+4.999
lik+3.689
haps+3.570
 vo+3.410
arthy+3.364
Neg logit contributions
advertisement-6.572
ENSE-5.575
INT-5.564
gerald-5.463
ARB-5.445
Tokenм
Token ID43108
Feature activation+1.10e+01

Pos logit contributions
skirts+6.502
lik+4.871
haps+4.750
 vo+4.536
arthy+4.463
Neg logit contributions
advertisement-8.406
ENSE-7.069
INT-7.024
ARB-6.923
gerald-6.921
Tokenе
Token ID16843
Feature activation+5.42e+00

Pos logit contributions
skirts+7.730
lik+5.665
haps+5.528
 vo+5.303
arthy+5.212
Neg logit contributions
advertisement-10.285
INT-8.749
ENSE-8.695
ARB-8.609
gerald-8.592
Tokenл
Token ID30143
Feature activation+1.54e+00

Pos logit contributions
skirts+4.453
lik+3.368
haps+3.327
arthy+3.136
 vo+3.120
Neg logit contributions
advertisement-4.784
ENSE-3.994
INT-3.985
ARB-3.885
gerald-3.846
Tokenа
Token ID16142
Feature activation+2.03e+00

Pos logit contributions
skirts+1.180
haps+0.861
lik+0.855
arthy+0.801
EStream+0.798
Neg logit contributions
advertisement-1.448
INT-1.231
ENSE-1.230
ARB-1.197
gerald-1.181
Token
Token ID12466
Feature activation+7.20e+00

Pos logit contributions
skirts+1.673
haps+1.253
lik+1.244
arthy+1.172
EStream+1.162
Neg logit contributions
advertisement-1.809
INT-1.524
ENSE-1.521
ARB-1.475
gerald-1.458
Token
Token ID120
Feature activation+1.38e+01

Pos logit contributions
skirts+5.442
lik+4.007
haps+3.993
arthy+3.700
EStream+3.693
Neg logit contributions
advertisement-6.714
ENSE-5.677
INT-5.633
ARB-5.531
gerald-5.487
Tokenе
Token ID16843
Feature activation+4.04e+00

Pos logit contributions
skirts+8.579
lik+6.165
haps+5.948
 vo+5.613
arthy+5.591
Neg logit contributions
advertisement-13.240
ENSE-11.441
INT-11.419
ARB-11.265
gerald-11.244
Tokenс
Token ID21727
Feature activation+6.08e+00

Pos logit contributions
skirts+3.401
lik+2.570
haps+2.559
arthy+2.413
EStream+2.393
Neg logit contributions
advertisement-3.510
INT-2.925
ENSE-2.923
ARB-2.838
gerald-2.815
Tokenт
Token ID20375
Feature activation+5.11e+00

Pos logit contributions
skirts+4.917
lik+3.734
haps+3.667
arthy+3.454
 vo+3.441
Neg logit contributions
advertisement-5.366
ENSE-4.462
INT-4.450
ARB-4.356
gerald-4.339
Tokenо
Token ID15166
Feature activation+1.84e+00

Pos logit contributions
skirts+3.669
lik+2.638
haps+2.613
arthy+2.429
 vo+2.408
Neg logit contributions
advertisement-5.028
ENSE-4.291
INT-4.289
ARB-4.179
gerald-4.156
Token
Token ID12466
Feature activation+7.04e+00

Pos logit contributions
skirts+1.553
haps+1.168
lik+1.161
arthy+1.099
EStream+1.091
Neg logit contributions
advertisement-1.591
INT-1.342
ENSE-1.338
ARB-1.294
gerald-1.282
Token stories
Token ID3923
Feature activation-4.80e+00

Pos logit contributions
advertisement+0.008
INT+0.007
ENSE+0.006
ARB+0.006
Db+0.006
Neg logit contributions
skirts-0.006
lik-0.004
haps-0.004
EStream-0.004
arthy-0.004
Token,
Token ID11
Feature activation+2.22e-02

Pos logit contributions
advertisement+2.758
INT+2.023
ENSE+1.975
ARB+1.934
Db+1.874
Neg logit contributions
skirts-5.461
haps-4.498
lik-4.397
EStream-4.369
arthy-4.282
Token prose
Token ID27149
Feature activation-1.39e-01

Pos logit contributions
skirts+0.015
lik+0.011
haps+0.011
EStream+0.010
arthy+0.010
Neg logit contributions
advertisement-0.023
ENSE-0.019
INT-0.019
ARB-0.019
gerald-0.019
Token,
Token ID11
Feature activation+1.30e+00

Pos logit contributions
advertisement+0.091
INT+0.071
ENSE+0.071
ARB+0.068
gerald+0.067
Neg logit contributions
skirts-0.147
lik-0.118
haps-0.118
arthy-0.113
EStream-0.112
Token and
Token ID290
Feature activation+1.93e+00

Pos logit contributions
skirts+1.439
lik+1.175
haps+1.170
arthy+1.123
 vo+1.118
Neg logit contributions
advertisement-0.775
ENSE-0.588
INT-0.587
ARB-0.560
gerald-0.554
Token non
Token ID1729
Feature activation+1.37e+01

Pos logit contributions
skirts+1.544
lik+1.152
haps+1.146
arthy+1.074
 vo+1.068
Neg logit contributions
advertisement-1.744
INT-1.465
ENSE-1.464
ARB-1.425
gerald-1.415
Token-
Token ID12
Feature activation+3.06e+00

Pos logit contributions
skirts+8.020
lik+6.164
haps+5.406
rights+5.248
nels+5.221
Neg logit contributions
advertisement-12.803
allery-11.847
ENSE-11.519
ARB-11.382
INESS-11.297
Tokenfiction
Token ID24046
Feature activation+1.75e+00

Pos logit contributions
skirts+2.241
lik+1.624
haps+1.606
arthy+1.479
cigarettes+1.472
Neg logit contributions
advertisement-2.965
ENSE-2.535
INT-2.531
ARB-2.487
gerald-2.454
Token.
Token ID13
Feature activation+4.40e-01

Pos logit contributions
skirts+1.757
lik+1.404
haps+1.389
arthy+1.332
 vo+1.323
Neg logit contributions
advertisement-1.217
ENSE-0.971
INT-0.968
ARB-0.935
gerald-0.927
Token Cos
Token ID10437
Feature activation+2.19e+00

Pos logit contributions
skirts+0.378
lik+0.287
haps+0.285
EStream+0.271
arthy+0.268
Neg logit contributions
advertisement-0.375
INT-0.312
ENSE-0.311
ARB-0.300
Db-0.298
Tokenplayer
Token ID7829
Feature activation-3.41e+00

Pos logit contributions
skirts+2.256
lik+1.811
haps+1.804
arthy+1.722
 vo+1.709
Neg logit contributions
advertisement-1.480
INT-1.173
ENSE-1.171
ARB-1.135
gerald-1.110
Tokenly
Token ID306
Feature activation-4.43e+00

Pos logit contributions
skirts+0.816
lik+0.624
haps+0.620
arthy+0.591
EStream+0.591
Neg logit contributions
advertisement-0.756
INT-0.623
ENSE-0.622
ARB-0.601
gerald-0.594
Token anti
Token ID3098
Feature activation+5.57e+00

Pos logit contributions
advertisement+3.376
INT+2.707
ENSE+2.695
ARB+2.549
Db+2.522
Neg logit contributions
skirts-4.181
haps-3.166
EStream-3.166
lik-3.129
izons-3.073
Token-
Token ID12
Feature activation-1.69e+00

Pos logit contributions
skirts+4.346
haps+3.276
lik+3.246
arthy+3.055
 vo+3.000
Neg logit contributions
advertisement-5.006
ENSE-4.240
INT-4.188
ARB-4.152
INESS-4.139
Tokendemocratic
Token ID41232
Feature activation-7.12e-01

Pos logit contributions
advertisement+1.558
INT+1.315
ENSE+1.280
ARB+1.254
gerald+1.250
Neg logit contributions
skirts-1.329
haps-0.988
lik-0.980
EStream-0.932
 vo-0.925
Token or
Token ID393
Feature activation+8.74e-01

Pos logit contributions
advertisement+0.459
INT+0.355
ENSE+0.353
ARB+0.335
gerald+0.332
Neg logit contributions
skirts-0.759
haps-0.610
lik-0.608
arthy-0.585
EStream-0.584
Token pro
Token ID386
Feature activation+1.37e+01

Pos logit contributions
skirts+0.799
lik+0.612
haps+0.611
arthy+0.580
EStream+0.579
Neg logit contributions
advertisement-0.702
ENSE-0.574
INT-0.573
ARB-0.553
gerald-0.549
Token-
Token ID12
Feature activation+2.78e+00

Pos logit contributions
skirts+8.839
haps+6.648
lik+6.641
 vo+6.182
rights+6.142
Neg logit contributions
advertisement-12.640
INESS-10.871
ENSE-10.656
ARB-10.586
gerald-10.504
Tokenauthor
Token ID9800
Feature activation-7.57e-01

Pos logit contributions
skirts+2.313
lik+1.743
haps+1.734
arthy+1.634
EStream+1.619
Neg logit contributions
advertisement-2.435
ENSE-2.034
INT-2.031
ARB-1.974
gerald-1.958
Tokenitarian
Token ID8353
Feature activation+2.83e+00

Pos logit contributions
advertisement+0.665
INT+0.566
ENSE+0.562
ARB+0.543
Db+0.536
Neg logit contributions
skirts-0.617
lik-0.458
haps-0.456
EStream-0.439
arthy-0.436
Token.
Token ID13
Feature activation-5.28e-01

Pos logit contributions
skirts+2.898
lik+2.316
haps+2.309
arthy+2.207
EStream+2.193
Neg logit contributions
advertisement-1.938
INT-1.529
ENSE-1.527
ARB-1.468
gerald-1.452
Token In
Token ID554
Feature activation-1.93e+00

Pos logit contributions
advertisement+0.462
INT+0.384
ENSE+0.381
ARB+0.369
Db+0.368
Neg logit contributions
skirts-0.443
haps-0.341
lik-0.335
EStream-0.321
cigarettes-0.313
Token offering
Token ID6011
Feature activation+5.45e+00

Pos logit contributions
skirts+2.853
lik+2.245
haps+2.237
arthy+2.130
EStream+2.115
Neg logit contributions
advertisement-2.209
INT-1.781
ENSE-1.779
ARB-1.717
gerald-1.700
Token to
Token ID284
Feature activation+3.23e+00

Pos logit contributions
skirts+4.888
lik+3.805
haps+3.771
 vo+3.594
arthy+3.554
Neg logit contributions
advertisement-4.341
ENSE-3.549
INT-3.544
ARB-3.447
Db-3.407
Token sell
Token ID3677
Feature activation+5.77e+00

Pos logit contributions
skirts+3.027
lik+2.372
haps+2.367
arthy+2.243
 vo+2.240
Neg logit contributions
advertisement-2.479
ENSE-2.009
INT-2.005
ARB-1.942
gerald-1.917
Token it
Token ID340
Feature activation-9.25e-01

Pos logit contributions
skirts+4.954
lik+3.833
haps+3.793
 vo+3.648
rights+3.555
Neg logit contributions
advertisement-4.753
ENSE-3.914
INT-3.852
ARB-3.777
Db-3.776
Token or
Token ID393
Feature activation+5.34e+00

Pos logit contributions
advertisement+0.700
INT+0.576
ENSE+0.565
gerald+0.549
ARB+0.548
Neg logit contributions
skirts-0.884
lik-0.685
haps-0.683
arthy-0.661
EStream-0.651
Token possibly
Token ID5457
Feature activation+1.37e+01

Pos logit contributions
skirts+4.775
lik+3.764
haps+3.744
 vo+3.550
arthy+3.496
Neg logit contributions
advertisement-4.248
ENSE-3.468
INT-3.453
ARB-3.347
Db-3.319
Token trade
Token ID3292
Feature activation+7.09e+00

Pos logit contributions
skirts+10.654
lik+8.659
haps+8.637
 vo+8.484
nels+8.098
Neg logit contributions
advertisement-10.615
ENSE-8.745
INT-8.603
Db-8.580
INESS-8.547
Token it
Token ID340
Feature activation+2.28e+00

Pos logit contributions
skirts+5.719
lik+4.433
haps+4.341
 vo+4.162
rights+4.069
Neg logit contributions
advertisement-6.060
ENSE-5.066
INT-4.982
Db-4.876
ARB-4.875
Token in
Token ID287
Feature activation+7.92e+00

Pos logit contributions
skirts+2.053
lik+1.579
haps+1.574
arthy+1.494
EStream+1.481
Neg logit contributions
advertisement-1.855
INT-1.527
ENSE-1.522
ARB-1.474
gerald-1.463
Token for
Token ID329
Feature activation+5.19e+00

Pos logit contributions
skirts+6.318
lik+4.940
haps+4.882
 vo+4.705
EStream+4.622
Neg logit contributions
advertisement-6.767
ENSE-5.692
INT-5.587
ARB-5.466
INESS-5.463
Token another
Token ID1194
Feature activation+4.85e+00

Pos logit contributions
skirts+4.320
lik+3.295
haps+3.252
 vo+3.086
arthy+3.056
Neg logit contributions
advertisement-4.486
ENSE-3.738
INT-3.714
ARB-3.623
Db-3.585
Token
Token ID198
Feature activation+6.39e-01

Pos logit contributions
skirts+3.101
lik+2.079
haps+2.044
arthy+1.888
 vo+1.853
Neg logit contributions
advertisement-5.572
ENSE-4.822
INT-4.819
gerald-4.703
ARB-4.695
Token
Token ID198
Feature activation-1.84e+00

Pos logit contributions
skirts+0.456
haps+0.322
lik+0.318
EStream+0.303
arthy+0.297
Neg logit contributions
advertisement-0.644
INT-0.551
ENSE-0.548
ARB-0.535
Db-0.529
TokenDo
Token ID5211
Feature activation-2.70e-01

Pos logit contributions
advertisement+1.219
INT+0.963
ENSE+0.938
ARB+0.905
Db+0.889
Neg logit contributions
skirts-1.948
haps-1.562
lik-1.550
EStream-1.519
arthy-1.511
Token use
Token ID779
Feature activation+1.00e+00

Pos logit contributions
advertisement+0.254
INT+0.216
ENSE+0.216
ARB+0.209
gerald+0.208
Neg logit contributions
skirts-0.206
lik-0.151
haps-0.150
arthy-0.141
EStream-0.140
Token or
Token ID393
Feature activation+6.39e+00

Pos logit contributions
skirts+0.819
haps+0.603
lik+0.599
EStream+0.576
arthy+0.574
Neg logit contributions
advertisement-0.906
INT-0.761
ENSE-0.756
ARB-0.732
gerald-0.726
Token do
Token ID466
Feature activation+1.37e+01

Pos logit contributions
skirts+5.715
lik+4.516
haps+4.488
 vo+4.176
arthy+4.165
Neg logit contributions
advertisement-5.107
ENSE-4.180
INT-4.175
ARB-4.071
gerald-4.009
Token not
Token ID407
Feature activation+7.61e+00

Pos logit contributions
skirts+8.888
haps+6.812
lik+6.682
 vo+6.358
 grav+5.822
Neg logit contributions
advertisement-12.960
ENSE-10.935
ARB-10.882
INT-10.785
gerald-10.672
Token use
Token ID779
Feature activation+5.57e+00

Pos logit contributions
skirts+5.183
lik+3.891
haps+3.863
 vo+3.676
 grav+3.452
Neg logit contributions
advertisement-7.378
ENSE-6.213
INT-6.213
ARB-6.182
gerald-6.139
Token social
Token ID1919
Feature activation+6.00e+00

Pos logit contributions
skirts+4.243
lik+3.150
haps+3.109
 vo+2.934
arthy+2.890
Neg logit contributions
advertisement-5.175
ENSE-4.387
INT-4.372
ARB-4.274
gerald-4.246
Token media
Token ID2056
Feature activation+5.33e+00

Pos logit contributions
skirts+3.970
lik+2.807
haps+2.734
 vo+2.591
cigarettes+2.505
Neg logit contributions
advertisement-6.172
ENSE-5.297
INT-5.264
gerald-5.206
ARB-5.200
Token...
Token ID2644
Feature activation+6.75e+00

Pos logit contributions
skirts+4.376
lik+3.299
haps+3.258
 vo+3.074
arthy+3.056
Neg logit contributions
advertisement-4.671
ENSE-3.903
INT-3.880
ARB-3.803
gerald-3.776
Token is
Token ID318
Feature activation+1.83e+00

Pos logit contributions
skirts+8.473
lik+6.637
 vo+6.467
haps+6.452
nels+6.023
Neg logit contributions
advertisement-9.456
ENSE-7.835
INT-7.775
gerald-7.679
ARB-7.631
Token marginal
Token ID14461
Feature activation+6.14e+00

Pos logit contributions
skirts+1.636
haps+1.240
lik+1.240
EStream+1.188
arthy+1.178
Neg logit contributions
advertisement-1.519
INT-1.257
ENSE-1.251
ARB-1.216
gerald-1.197
Token,
Token ID11
Feature activation+5.54e-01

Pos logit contributions
skirts+6.103
lik+4.943
haps+4.887
 vo+4.675
arthy+4.622
Neg logit contributions
advertisement-4.233
ENSE-3.366
INT-3.348
ARB-3.231
gerald-3.208
Token and
Token ID290
Feature activation+1.56e+00

Pos logit contributions
skirts+0.602
lik+0.486
haps+0.484
arthy+0.467
EStream+0.466
Neg logit contributions
advertisement-0.349
INT-0.268
ENSE-0.267
ARB-0.258
gerald-0.253
Token most
Token ID749
Feature activation+6.91e+00

Pos logit contributions
skirts+1.379
lik+1.052
haps+1.045
arthy+0.998
EStream+0.995
Neg logit contributions
advertisement-1.301
ENSE-1.076
INT-1.075
ARB-1.047
gerald-1.031
Token likely
Token ID1884
Feature activation+1.36e+01

Pos logit contributions
skirts+5.111
haps+3.777
lik+3.775
 vo+3.556
arthy+3.483
Neg logit contributions
advertisement-6.483
INT-5.498
ENSE-5.462
gerald-5.319
ARB-5.311
Token only
Token ID691
Feature activation+5.36e+00

Pos logit contributions
skirts+10.260
 vo+8.077
lik+7.981
haps+7.838
 grav+7.474
Neg logit contributions
advertisement-11.167
ENSE-9.385
INT-9.270
gerald-9.086
INESS-8.977
Token as
Token ID355
Feature activation+2.84e-01

Pos logit contributions
skirts+4.625
lik+3.555
haps+3.528
 vo+3.341
arthy+3.323
Neg logit contributions
advertisement-4.479
ENSE-3.710
INT-3.690
ARB-3.592
gerald-3.575
Token scrap
Token ID15881
Feature activation+4.35e+00

Pos logit contributions
skirts+0.251
lik+0.189
haps+0.189
EStream+0.180
arthy+0.180
Neg logit contributions
advertisement-0.237
INT-0.196
ENSE-0.196
ARB-0.190
gerald-0.188
Token.
Token ID13
Feature activation+1.47e+00

Pos logit contributions
skirts+4.116
lik+3.259
haps+3.229
arthy+3.077
 vo+3.064
Neg logit contributions
advertisement-3.273
ENSE-2.657
INT-2.638
ARB-2.555
gerald-2.534
Token
Token ID198
Feature activation+1.37e+00

Pos logit contributions
skirts+1.209
haps+0.908
lik+0.907
EStream+0.851
arthy+0.845
Neg logit contributions
advertisement-1.315
INT-1.103
ENSE-1.098
ARB-1.069
Db-1.051
Token on
Token ID319
Feature activation+1.88e+00

Pos logit contributions
skirts+0.509
haps+0.399
lik+0.398
arthy+0.381
EStream+0.377
Neg logit contributions
advertisement-0.398
ENSE-0.321
INT-0.321
ARB-0.310
gerald-0.304
Token May
Token ID1737
Feature activation+5.32e+00

Pos logit contributions
skirts+1.561
lik+1.174
haps+1.170
arthy+1.103
EStream+1.095
Neg logit contributions
advertisement-1.653
ENSE-1.382
INT-1.382
ARB-1.342
gerald-1.330
Token 25
Token ID1679
Feature activation+2.05e+00

Pos logit contributions
skirts+4.026
lik+2.972
haps+2.926
 vo+2.751
arthy+2.739
Neg logit contributions
advertisement-4.952
ENSE-4.208
INT-4.194
ARB-4.101
gerald-4.056
Token.
Token ID13
Feature activation+4.74e-01

Pos logit contributions
skirts+2.090
lik+1.668
haps+1.667
arthy+1.592
EStream+1.579
Neg logit contributions
advertisement-1.413
INT-1.116
ENSE-1.115
ARB-1.073
gerald-1.060
Token (
Token ID357
Feature activation-5.26e+00

Pos logit contributions
skirts+0.408
haps+0.317
lik+0.308
EStream+0.294
cigarettes+0.293
Neg logit contributions
advertisement-0.402
ENSE-0.338
INT-0.336
ARB-0.323
gerald-0.322
TokenJohn
Token ID7554
Feature activation-1.52e+01

Pos logit contributions
advertisement+2.960
INT+2.232
ENSE+2.190
ARB+2.084
MK+2.034
Neg logit contributions
skirts-6.032
haps-4.975
lik-4.858
EStream-4.797
arthy-4.714
Token McDonnell
Token ID39441
Feature activation-4.35e+00

Pos logit contributions
advertisement+7.397
 Payton+5.840
INT+5.739
STON+5.536
ARB+5.489
Neg logit contributions
skirts-15.878
EStream-13.641
accompan-13.399
haps-13.387
cigarettes-12.947
Token/
Token ID14
Feature activation-5.68e+00

Pos logit contributions
advertisement+2.526
INT+1.912
ENSE+1.897
ARB+1.763
Db+1.740
Neg logit contributions
skirts-4.941
haps-4.049
lik-3.934
EStream-3.880
accompan-3.862
TokenThe
Token ID464
Feature activation-4.46e+00

Pos logit contributions
advertisement+3.503
INT+2.764
ENSE+2.689
ARB+2.587
Bench+2.546
Neg logit contributions
skirts-6.291
haps-5.091
lik-4.949
EStream-4.886
 constitu-4.877
Token Washington
Token ID2669
Feature activation-4.15e-01

Pos logit contributions
advertisement+2.407
ENSE+1.772
INT+1.769
gerald+1.638
ARB+1.629
Neg logit contributions
skirts-5.172
haps-4.324
lik-4.194
EStream-4.143
cigarettes-4.111
Token Post
Token ID2947
Feature activation-3.88e+00

Pos logit contributions
advertisement+0.352
INT+0.293
ENSE+0.291
ARB+0.282
gerald+0.280
Neg logit contributions
skirts-0.359
haps-0.274
lik-0.272
EStream-0.257
arthy-0.257
Token
Token ID198
Feature activation-4.68e+00

Pos logit contributions
skirts+0.290
haps+0.205
lik+0.203
EStream+0.191
arthy+0.188
Neg logit contributions
advertisement-0.413
INT-0.354
ENSE-0.352
ARB-0.344
Db-0.340
TokenDevelop
Token ID19246
Feature activation-7.46e+00

Pos logit contributions
advertisement+2.927
INT+2.279
ENSE+2.172
ARB+2.163
Db+2.114
Neg logit contributions
skirts-5.096
haps-4.146
lik-4.139
EStream-4.025
arthy-4.009
Tokened
Token ID276
Feature activation-5.83e+00

Pos logit contributions
advertisement+5.342
ENSE+4.251
INT+4.135
Bench+4.101
Db+4.092
Neg logit contributions
skirts-6.945
haps-5.642
lik-5.615
EStream-5.507
arthy-5.470
Token by
Token ID416
Feature activation-6.64e+00

Pos logit contributions
advertisement+4.175
INT+3.429
ENSE+3.334
ARB+3.270
Db+3.228
Neg logit contributions
skirts-5.469
lik-4.358
haps-4.344
arthy-4.211
EStream-4.171
Token:
Token ID25
Feature activation-3.72e+00

Pos logit contributions
advertisement+5.019
INT+4.087
ENSE+3.985
ARB+3.888
gerald+3.827
Neg logit contributions
skirts-6.045
lik-4.736
haps-4.712
EStream-4.706
arthy-4.548
Token Relic
Token ID43437
Feature activation-1.36e+01

Pos logit contributions
advertisement+3.021
INT+2.484
ENSE+2.453
ARB+2.370
gerald+2.338
Neg logit contributions
skirts-3.308
lik-2.556
haps-2.544
EStream-2.504
arthy-2.423
Token Entertainment
Token ID11058
Feature activation-3.54e+00

Pos logit contributions
advertisement+7.679
ENSE+6.323
INT+6.308
Db+5.850
INESS+5.791
Neg logit contributions
skirts-13.299
haps-10.676
lik-10.614
arthy-10.374
EStream-10.373
Token -
Token ID532
Feature activation-6.01e+00

Pos logit contributions
advertisement+3.204
INT+2.720
ENSE+2.719
ARB+2.650
Db+2.610
Neg logit contributions
skirts-2.798
lik-2.074
haps-2.053
EStream-1.979
arthy-1.949
Token Sierra
Token ID17559
Feature activation-6.11e+00

Pos logit contributions
advertisement+4.639
ENSE+3.792
INT+3.788
ARB+3.637
Db+3.599
Neg logit contributions
skirts-5.418
lik-4.238
haps-4.127
EStream-4.117
arthy-3.992
Token Studios
Token ID13799
Feature activation-5.32e+00

Pos logit contributions
advertisement+3.254
INT+2.494
ENSE+2.407
Db+2.306
ARB+2.270
Neg logit contributions
skirts-6.896
haps-5.719
lik-5.681
EStream-5.499
arthy-5.460
Token (
Token ID357
Feature activation-6.09e+00

Pos logit contributions
advertisement+4.699
INT+4.048
ENSE+4.031
ARB+3.934
Db+3.873
Neg logit contributions
skirts-4.192
haps-3.102
lik-3.096
EStream-2.987
arthy-2.940
Token,
Token ID11
Feature activation-1.48e+00

Pos logit contributions
advertisement+1.260
ENSE+0.992
INT+0.989
ARB+0.939
gerald+0.934
Neg logit contributions
skirts-1.997
lik-1.591
haps-1.584
arthy-1.532
EStream-1.532
Token the
Token ID262
Feature activation+5.83e-01

Pos logit contributions
advertisement+1.187
ENSE+0.983
INT+0.974
ARB+0.935
gerald+0.931
Neg logit contributions
skirts-1.337
lik-1.021
haps-1.016
EStream-0.987
arthy-0.982
Token team
Token ID1074
Feature activation+1.61e+00

Pos logit contributions
skirts+0.538
lik+0.414
haps+0.413
arthy+0.395
EStream+0.392
Neg logit contributions
advertisement-0.464
INT-0.380
ENSE-0.379
ARB-0.365
gerald-0.363
Token was
Token ID373
Feature activation-1.57e+00

Pos logit contributions
skirts+1.340
lik+0.997
haps+0.997
EStream+0.949
arthy+0.948
Neg logit contributions
advertisement-1.430
INT-1.191
ENSE-1.190
ARB-1.156
gerald-1.143
Token 63
Token ID8093
Feature activation-5.91e+00

Pos logit contributions
advertisement+1.299
INT+1.064
ENSE+1.063
ARB+1.024
gerald+1.010
Neg logit contributions
skirts-1.400
lik-1.051
haps-1.049
arthy-1.013
EStream-1.008
Token-
Token ID12
Feature activation-1.32e+01

Pos logit contributions
advertisement+4.348
ENSE+3.578
INT+3.578
Db+3.423
ARB+3.377
Neg logit contributions
skirts-5.535
lik-4.430
haps-4.282
arthy-4.181
EStream-4.115
Token3
Token ID18
Feature activation-7.94e+00

Pos logit contributions
advertisement+5.553
INT+4.142
ENSE+3.967
gerald+3.367
Db+3.307
Neg logit contributions
skirts-15.224
lik-12.724
haps-12.489
arthy-12.463
 vo-12.443
Token when
Token ID618
Feature activation-5.04e+00

Pos logit contributions
advertisement+5.084
INT+3.970
ENSE+3.947
ARB+3.709
Db+3.651
Neg logit contributions
skirts-8.207
lik-6.460
haps-6.391
arthy-6.246
cigarettes-6.188
Token leading
Token ID3756
Feature activation-6.29e+00

Pos logit contributions
advertisement+3.636
ENSE+2.898
INT+2.855
ARB+2.733
Bench+2.700
Neg logit contributions
skirts-4.928
lik-3.849
haps-3.826
arthy-3.733
EStream-3.687
Token after
Token ID706
Feature activation-2.37e+00

Pos logit contributions
advertisement+4.140
INT+3.241
ENSE+3.233
ARB+3.085
Bench+2.992
Neg logit contributions
skirts-6.438
lik-5.116
haps-5.059
arthy-4.978
cigarettes-4.892
Token eight
Token ID3624
Feature activation-3.54e+00

Pos logit contributions
advertisement+2.008
ENSE+1.664
INT+1.660
ARB+1.586
gerald+1.578
Neg logit contributions
skirts-2.035
lik-1.547
haps-1.537
arthy-1.484
EStream-1.482
Token practice
Token ID3357
Feature activation+1.00e+00

Pos logit contributions
advertisement+4.450
ENSE+3.651
INT+3.629
ARB+3.444
Bench+3.393
Neg logit contributions
skirts-5.511
haps-4.241
lik-4.199
arthy-4.070
EStream-4.015
Token May
Token ID1737
Feature activation+5.72e+00

Pos logit contributions
skirts+0.889
lik+0.682
haps+0.682
arthy+0.646
EStream+0.640
Neg logit contributions
advertisement-0.827
INT-0.680
ENSE-0.680
ARB-0.658
gerald-0.651
Token 25
Token ID1679
Feature activation+4.32e+00

Pos logit contributions
skirts+4.455
lik+3.327
haps+3.267
 vo+3.081
arthy+3.050
Neg logit contributions
advertisement-5.187
ENSE-4.393
INT-4.370
ARB-4.290
gerald-4.228
Token.
Token ID13
Feature activation+1.52e+00

Pos logit contributions
skirts+4.261
lik+3.374
haps+3.330
EStream+3.180
 vo+3.177
Neg logit contributions
advertisement-3.080
ENSE-2.475
INT-2.470
ARB-2.384
gerald-2.353
Token (
Token ID357
Feature activation-4.70e+00

Pos logit contributions
skirts+1.292
haps+0.990
lik+0.975
arthy+0.918
EStream+0.917
Neg logit contributions
advertisement-1.317
ENSE-1.100
INT-1.097
ARB-1.058
gerald-1.051
TokenJohn
Token ID7554
Feature activation-1.30e+01

Pos logit contributions
advertisement+2.799
INT+2.140
ENSE+2.098
ARB+2.006
Db+1.977
Neg logit contributions
skirts-5.269
haps-4.293
lik-4.233
EStream-4.147
arthy-4.089
Token McDonnell
Token ID39441
Feature activation-2.83e+00

Pos logit contributions
advertisement+7.566
INT+6.215
ARB+6.171
allery+6.143
 Payton+6.084
Neg logit contributions
skirts-12.015
EStream-10.520
haps-9.845
cigarettes-9.634
lik-9.575
Token/
Token ID14
Feature activation-4.79e+00

Pos logit contributions
advertisement+1.707
INT+1.318
ENSE+1.317
ARB+1.226
gerald+1.192
Neg logit contributions
skirts-3.116
haps-2.565
lik-2.500
cigarettes-2.421
EStream-2.411
TokenThe
Token ID464
Feature activation-4.18e+00

Pos logit contributions
advertisement+2.998
INT+2.365
ENSE+2.331
ARB+2.276
Bench+2.219
Neg logit contributions
skirts-5.213
haps-4.129
lik-4.087
EStream-3.976
 vo-3.967
Token Washington
Token ID2669
Feature activation-4.56e-01

Pos logit contributions
advertisement+2.386
ENSE+1.743
INT+1.734
gerald+1.650
ARB+1.644
Neg logit contributions
skirts-4.803
haps-3.923
lik-3.859
arthy-3.804
EStream-3.762
Token Post
Token ID2947
Feature activation-2.91e+00

Pos logit contributions
advertisement+0.391
INT+0.325
ENSE+0.324
ARB+0.314
gerald+0.312
Neg logit contributions
skirts-0.392
haps-0.297
lik-0.296
EStream-0.279
arthy-0.279
Token will
Token ID481
Feature activation+4.99e-01

Pos logit contributions
skirts+1.069
haps+0.793
lik+0.792
arthy+0.741
EStream+0.741
Neg logit contributions
advertisement-1.225
INT-1.034
ENSE-1.027
ARB-0.999
gerald-0.986
Token be
Token ID307
Feature activation-1.53e-01

Pos logit contributions
skirts+0.498
lik+0.393
haps+0.390
EStream+0.378
arthy+0.374
Neg logit contributions
advertisement-0.356
INT-0.286
ENSE-0.285
ARB-0.272
gerald-0.268
Token eliminated
Token ID15254
Feature activation+3.04e-01

Pos logit contributions
advertisement+0.125
INT+0.104
ENSE+0.104
ARB+0.101
Db+0.099
Neg logit contributions
skirts-0.136
lik-0.103
haps-0.103
arthy-0.098
cigarettes-0.097
Token entirely
Token ID5000
Feature activation-3.32e+00

Pos logit contributions
skirts+0.299
haps+0.236
lik+0.235
arthy+0.225
EStream+0.224
Neg logit contributions
advertisement-0.222
INT-0.178
ENSE-0.177
ARB-0.170
Db-0.168
Token.
Token ID13
Feature activation-2.00e+00

Pos logit contributions
advertisement+2.159
INT+1.683
ENSE+1.668
Db+1.587
ARB+1.582
Neg logit contributions
skirts-3.462
haps-2.797
lik-2.756
arthy-2.673
EStream-2.659
Token Keep
Token ID9175
Feature activation-1.30e+01

Pos logit contributions
advertisement+1.748
INT+1.460
ENSE+1.451
ARB+1.390
Db+1.389
Neg logit contributions
skirts-1.675
haps-1.295
lik-1.273
EStream-1.208
arthy-1.176
Token in
Token ID287
Feature activation-3.46e+00

Pos logit contributions
advertisement+8.835
ENSE+7.124
INT+6.822
Bench+6.451
ARB+6.313
Neg logit contributions
skirts-11.652
haps-9.769
EStream-9.745
lik-9.702
cigarettes-9.128
Token mind
Token ID2000
Feature activation-4.31e+00

Pos logit contributions
advertisement+1.845
ENSE+1.371
INT+1.353
ARB+1.255
INESS+1.210
Neg logit contributions
skirts-4.038
lik-3.358
haps-3.334
cigarettes-3.200
EStream-3.197
Token that
Token ID326
Feature activation-2.54e+00

Pos logit contributions
advertisement+3.063
ENSE+2.466
INT+2.455
ARB+2.326
Db+2.285
Neg logit contributions
skirts-4.200
lik-3.375
haps-3.355
arthy-3.207
EStream-3.190
Token this
Token ID428
Feature activation-5.23e+00

Pos logit contributions
advertisement+2.011
ENSE+1.650
INT+1.650
ARB+1.582
Db+1.555
Neg logit contributions
skirts-2.302
lik-1.781
haps-1.776
EStream-1.724
arthy-1.713
Token has
Token ID468
Feature activation-4.48e+00

Pos logit contributions
advertisement+4.490
INT+3.659
ENSE+3.643
ARB+3.484
Bench+3.448
Neg logit contributions
skirts-4.407
haps-3.351
lik-3.308
EStream-3.286
arthy-3.169
Tokenlf
Token ID1652
Feature activation+1.11e+00

Pos logit contributions
advertisement+1.426
INT+1.147
ENSE+1.133
gerald+1.123
ARB+1.101
Neg logit contributions
skirts-1.809
haps-1.396
EStream-1.362
lik-1.361
cigarettes-1.326
Token appeared
Token ID4120
Feature activation+1.20e+00

Pos logit contributions
skirts+0.938
lik+0.702
haps+0.700
EStream+0.663
arthy+0.662
Neg logit contributions
advertisement-0.971
INT-0.814
ENSE-0.809
ARB-0.788
gerald-0.784
Token in
Token ID287
Feature activation+9.55e-01

Pos logit contributions
skirts+1.099
haps+0.849
lik+0.849
arthy+0.808
EStream+0.805
Neg logit contributions
advertisement-0.954
INT-0.785
ENSE-0.779
ARB-0.753
gerald-0.749
Token the
Token ID262
Feature activation-7.74e-01

Pos logit contributions
skirts+0.841
lik+0.645
haps+0.641
arthy+0.609
EStream+0.606
Neg logit contributions
advertisement-0.790
INT-0.655
ENSE-0.652
ARB-0.634
gerald-0.630
Token "
Token ID366
Feature activation-3.43e-01

Pos logit contributions
advertisement+0.584
INT+0.476
ENSE+0.470
ARB+0.456
gerald+0.454
Neg logit contributions
skirts-0.740
lik-0.578
haps-0.574
EStream-0.550
arthy-0.550
TokenKeep
Token ID15597
Feature activation-1.30e+01

Pos logit contributions
advertisement+0.228
INT+0.179
ENSE+0.176
ARB+0.171
Db+0.168
Neg logit contributions
skirts-0.359
haps-0.289
lik-0.288
arthy-0.277
EStream-0.276
Token Fish
Token ID13388
Feature activation-3.68e+00

Pos logit contributions
advertisement+8.756
INT+7.037
ENSE+7.015
Bench+7.010
Db+6.965
Neg logit contributions
skirts-10.896
haps-9.033
EStream-8.868
lik-8.712
 constitu-8.519
Tokenin
Token ID259
Feature activation-9.58e-01

Pos logit contributions
advertisement+2.702
INT+2.213
ENSE+2.180
gerald+2.126
Db+2.113
Neg logit contributions
skirts-3.480
haps-2.750
lik-2.694
EStream-2.613
 vo-2.564
Token'"
Token ID29653
Feature activation-3.57e+00

Pos logit contributions
advertisement+0.618
INT+0.487
ENSE+0.487
ARB+0.466
gerald+0.465
Neg logit contributions
skirts-1.009
lik-0.813
haps-0.811
arthy-0.778
EStream-0.774
Token music
Token ID2647
Feature activation+8.68e-01

Pos logit contributions
advertisement+2.627
INT+2.127
ENSE+2.081
ARB+2.014
gerald+2.009
Neg logit contributions
skirts-3.439
haps-2.681
lik-2.680
EStream-2.605
arthy-2.591
Token video
Token ID2008
Feature activation+3.68e+00

Pos logit contributions
skirts+0.823
lik+0.643
haps+0.639
arthy+0.611
EStream+0.609
Neg logit contributions
advertisement-0.667
INT-0.538
ENSE-0.537
ARB-0.520
gerald-0.516
Token."
Token ID526
Feature activation+1.32e-01

Pos logit contributions
advertisement+1.045
INT+0.817
ENSE+0.815
ARB+0.783
gerald+0.768
Neg logit contributions
skirts-1.642
haps-1.327
lik-1.312
EStream-1.259
arthy-1.255
Token "
Token ID366
Feature activation+1.08e+00

Pos logit contributions
skirts+0.083
haps+0.056
lik+0.056
arthy+0.051
 vo+0.051
Neg logit contributions
advertisement-0.142
INT-0.122
ENSE-0.122
ARB-0.119
gerald-0.119
TokenThat
Token ID2504
Feature activation-8.91e+00

Pos logit contributions
skirts+1.171
haps+0.948
lik+0.943
EStream+0.906
arthy+0.906
Neg logit contributions
advertisement-0.682
INT-0.528
ENSE-0.522
ARB-0.502
Db-0.490
Token didn
Token ID1422
Feature activation-6.02e+00

Pos logit contributions
advertisement+6.576
ENSE+5.569
INT+5.398
gerald+5.188
ARB+5.175
Neg logit contributions
skirts-8.075
EStream-6.216
haps-6.101
arthy-5.965
lik-5.900
Token't
Token ID470
Feature activation-1.04e+01

Pos logit contributions
advertisement+5.295
INT+4.355
ENSE+4.253
ARB+4.101
Db+4.078
Neg logit contributions
skirts-5.059
haps-3.882
EStream-3.853
accompan-3.844
lik-3.683
Token work
Token ID670
Feature activation-1.30e+01

Pos logit contributions
advertisement+6.591
ENSE+5.225
INT+5.081
 itch+4.950
Db+4.657
Neg logit contributions
skirts-10.689
EStream-8.931
lik-8.131
accompan-8.079
izons-8.028
Token in
Token ID287
Feature activation-4.98e+00

Pos logit contributions
advertisement+7.687
INT+6.321
ENSE+5.954
gerald+5.476
ARB+5.378
Neg logit contributions
skirts-13.276
EStream-10.266
haps-10.141
accompan-10.078
arthy-10.016
Token the
Token ID262
Feature activation-4.34e+00

Pos logit contributions
advertisement+3.671
INT+3.086
ENSE+3.048
ARB+2.948
gerald+2.895
Neg logit contributions
skirts-4.700
lik-3.620
haps-3.605
EStream-3.557
arthy-3.447
Token 1930
Token ID15533
Feature activation-1.16e+01

Pos logit contributions
advertisement+3.471
INT+2.893
ENSE+2.852
ARB+2.792
gerald+2.726
Neg logit contributions
skirts-3.877
haps-3.010
lik-2.945
EStream-2.915
arthy-2.822
Tokens
Token ID82
Feature activation-1.02e+01

Pos logit contributions
advertisement+5.183
INT+3.641
ENSE+3.574
ARB+3.344
Db+3.285
Neg logit contributions
skirts-13.487
haps-11.469
lik-11.193
arthy-10.976
EStream-10.912
Token,
Token ID11
Feature activation-7.24e+00

Pos logit contributions
advertisement+5.798
INT+4.494
ENSE+4.407
gerald+4.176
ARB+4.126
Neg logit contributions
skirts-10.854
haps-8.747
lik-8.537
EStream-8.466
arthy-8.421
Token (
Token ID357
Feature activation-5.42e+00

Pos logit contributions
advertisement+2.005
INT+1.660
ENSE+1.646
ARB+1.583
gerald+1.572
Neg logit contributions
skirts-2.184
haps-1.723
lik-1.671
EStream-1.609
cigarettes-1.571
TokenPhoto
Token ID6191
Feature activation-7.88e+00

Pos logit contributions
advertisement+3.248
INT+2.519
ENSE+2.449
ARB+2.385
gerald+2.325
Neg logit contributions
skirts-6.002
haps-4.887
lik-4.804
EStream-4.718
arthy-4.671
Token by
Token ID416
Feature activation-2.13e+00

Pos logit contributions
advertisement+5.557
INT+4.495
ENSE+4.492
gerald+4.385
MK+4.374
Neg logit contributions
skirts-7.458
haps-5.906
lik-5.822
nels-5.605
EStream-5.553
Token REUTERS
Token ID15862
Feature activation-9.62e+00

Pos logit contributions
advertisement+1.411
INT+1.090
ENSE+1.086
ARB+1.049
gerald+1.042
Neg logit contributions
skirts-2.241
lik-1.798
haps-1.796
EStream-1.727
arthy-1.714
Token/
Token ID14
Feature activation-7.49e+00

Pos logit contributions
advertisement+5.147
ENSE+4.178
INT+4.115
gerald+3.871
INESS+3.785
Neg logit contributions
skirts-10.373
haps-8.364
lik-8.286
EStream-8.273
arthy-8.128
TokenJohn
Token ID7554
Feature activation-1.29e+01

Pos logit contributions
advertisement+3.834
INT+2.853
ENSE+2.725
ARB+2.624
Db+2.571
Neg logit contributions
skirts-8.992
haps-7.384
lik-7.251
EStream-7.227
cigarettes-7.067
Token S
Token ID311
Feature activation-7.34e+00

Pos logit contributions
advertisement+7.383
INT+6.097
ENSE+6.005
Db+5.995
 Payton+5.862
Neg logit contributions
skirts-12.602
EStream-10.555
cigarettes-9.936
lik-9.874
haps-9.827
Tokenomm
Token ID2002
Feature activation-1.24e+00

Pos logit contributions
advertisement+5.870
ENSE+4.990
INT+4.884
ARB+4.817
allery+4.710
Neg logit contributions
skirts-6.287
haps-4.884
lik-4.797
EStream-4.790
cigarettes-4.617
Tokeners
Token ID364
Feature activation-2.82e+00

Pos logit contributions
advertisement+0.635
ENSE+0.457
INT+0.457
ARB+0.431
gerald+0.422
Neg logit contributions
skirts-1.481
lik-1.221
haps-1.220
arthy-1.177
EStream-1.173
Token II
Token ID2873
Feature activation-1.48e+00

Pos logit contributions
advertisement+1.931
INT+1.574
ENSE+1.567
gerald+1.504
ARB+1.486
Neg logit contributions
skirts-2.824
haps-2.230
lik-2.208
EStream-2.134
cigarettes-2.132
Token)
Token ID8
Feature activation+2.55e+00

Pos logit contributions
advertisement+1.051
ENSE+0.848
INT+0.846
ARB+0.805
gerald+0.803
Neg logit contributions
skirts-1.478
haps-1.167
lik-1.163
arthy-1.111
EStream-1.110
Token your
Token ID534
Feature activation+1.20e+00

Pos logit contributions
skirts+2.158
lik+1.646
haps+1.640
arthy+1.551
EStream+1.538
Neg logit contributions
advertisement-2.095
INT-1.736
ENSE-1.734
ARB-1.682
gerald-1.667
Token job
Token ID1693
Feature activation-4.49e-01

Pos logit contributions
skirts+1.125
haps+0.873
lik+0.870
arthy+0.829
EStream+0.829
Neg logit contributions
advertisement-0.927
INT-0.758
ENSE-0.756
ARB-0.725
gerald-0.719
Token isn
Token ID2125
Feature activation-1.69e+00

Pos logit contributions
advertisement+0.399
INT+0.335
ENSE+0.332
ARB+0.321
INESS+0.319
Neg logit contributions
skirts-0.372
haps-0.276
lik-0.276
EStream-0.264
arthy-0.262
Token't
Token ID470
Feature activation-4.09e+00

Pos logit contributions
advertisement+1.607
INT+1.348
ENSE+1.325
ARB+1.295
Db+1.281
Neg logit contributions
skirts-1.316
haps-0.974
lik-0.947
EStream-0.935
arthy-0.897
Token to
Token ID284
Feature activation-7.37e+00

Pos logit contributions
advertisement+3.199
INT+2.624
ENSE+2.599
ARB+2.474
gerald+2.466
Neg logit contributions
skirts-3.766
haps-2.899
lik-2.886
EStream-2.820
cigarettes-2.792
Token keep
Token ID1394
Feature activation-1.29e+01

Pos logit contributions
advertisement+5.055
INT+4.021
ENSE+3.970
gerald+3.808
Bench+3.737
Neg logit contributions
skirts-7.266
EStream-5.738
lik-5.594
haps-5.578
cigarettes-5.549
Token Red
Token ID2297
Feature activation-3.35e+00

Pos logit contributions
advertisement+8.741
ENSE+7.410
INT+7.108
ARB+6.843
Bench+6.708
Neg logit contributions
skirts-11.257
EStream-9.156
haps-9.013
lik-8.829
arthy-8.561
Token Hat
Token ID10983
Feature activation-5.76e+00

Pos logit contributions
advertisement+2.590
INT+2.203
ENSE+2.175
Bench+2.105
Db+2.100
Neg logit contributions
skirts-3.037
lik-2.304
haps-2.290
EStream-2.266
 vo-2.214
Token or
Token ID393
Feature activation-1.98e+00

Pos logit contributions
advertisement+4.095
ENSE+3.382
INT+3.305
ARB+3.214
Bench+3.184
Neg logit contributions
skirts-5.562
haps-4.361
EStream-4.311
lik-4.304
cigarettes-4.119
Token CentOS
Token ID32418
Feature activation-2.02e+00

Pos logit contributions
advertisement+1.513
INT+1.241
ENSE+1.241
ARB+1.197
gerald+1.176
Neg logit contributions
skirts-1.870
lik-1.434
haps-1.408
EStream-1.387
arthy-1.385
Token servers
Token ID9597
Feature activation-4.49e+00

Pos logit contributions
advertisement+1.486
ENSE+1.203
INT+1.192
ARB+1.152
gerald+1.137
Neg logit contributions
skirts-1.972
lik-1.542
haps-1.541
EStream-1.499
arthy-1.466
Token
Token ID198
Feature activation-6.83e-01

Pos logit contributions
advertisement+1.524
INT+1.263
ENSE+1.253
ARB+1.225
Db+1.217
Neg logit contributions
skirts-1.502
haps-1.148
lik-1.130
EStream-1.079
cigarettes-1.052
Token
Token ID198
Feature activation-3.06e+00

Pos logit contributions
advertisement+0.677
INT+0.577
ENSE+0.571
ARB+0.560
Db+0.553
Neg logit contributions
skirts-0.504
haps-0.361
lik-0.354
EStream-0.343
arthy-0.332
TokenLast
Token ID5956
Feature activation-8.21e+00

Pos logit contributions
advertisement+2.017
INT+1.543
ENSE+1.499
ARB+1.481
Db+1.436
Neg logit contributions
skirts-3.298
haps-2.644
lik-2.622
EStream-2.563
 vo-2.546
Token year
Token ID614
Feature activation+1.01e+00

Pos logit contributions
advertisement+5.814
INT+4.724
ENSE+4.584
ARB+4.536
Db+4.498
Neg logit contributions
skirts-7.745
arthy-5.878
haps-5.874
lik-5.848
EStream-5.785
Token's
Token ID338
Feature activation-2.89e+00

Pos logit contributions
skirts+0.980
lik+0.770
haps+0.769
arthy+0.733
EStream+0.729
Neg logit contributions
advertisement-0.750
INT-0.604
ENSE-0.603
ARB-0.584
gerald-0.577
Token Walking
Token ID21276
Feature activation-1.28e+01

Pos logit contributions
advertisement+2.227
INT+1.790
ENSE+1.781
ARB+1.730
gerald+1.716
Neg logit contributions
skirts-2.702
haps-2.086
lik-2.083
EStream-2.015
arthy-2.005
Token Dead
Token ID5542
Feature activation-6.03e+00

Pos logit contributions
advertisement+7.565
INT+5.654
Db+5.583
Bench+5.458
ENSE+5.339
Neg logit contributions
skirts-12.940
haps-11.098
EStream-10.900
accompan-10.743
lik-10.419
Token maze
Token ID31237
Feature activation-4.19e+00

Pos logit contributions
advertisement+4.975
INT+4.024
ENSE+3.996
MK+3.925
ARB+3.910
Neg logit contributions
skirts-5.153
haps-3.943
EStream-3.832
lik-3.817
arthy-3.715
Token was
Token ID373
Feature activation-3.60e+00

Pos logit contributions
advertisement+3.549
INT+2.914
ENSE+2.893
ARB+2.826
gerald+2.811
Neg logit contributions
skirts-3.547
haps-2.693
lik-2.648
EStream-2.591
arthy-2.579
Token split
Token ID6626
Feature activation+9.54e-02

Pos logit contributions
advertisement+2.872
INT+2.365
ENSE+2.344
ARB+2.305
gerald+2.265
Neg logit contributions
skirts-3.222
lik-2.449
haps-2.438
EStream-2.369
arthy-2.356
Token between
Token ID1022
Feature activation-8.59e-01

Pos logit contributions
skirts+0.091
lik+0.071
haps+0.070
arthy+0.067
EStream+0.067
Neg logit contributions
advertisement-0.073
INT-0.059
ENSE-0.059
ARB-0.057
gerald-0.056
Token,
Token ID11
Feature activation-6.58e+00

Pos logit contributions
advertisement+5.289
ARB+4.207
INT+4.171
INESS+4.162
ENSE+4.157
Neg logit contributions
skirts-6.660
lik-5.197
arthy-5.094
haps-5.062
EStream-4.954
Token055
Token ID47838
Feature activation-5.45e+00

Pos logit contributions
advertisement+3.599
INT+2.688
ENSE+2.675
ARB+2.508
INESS+2.475
Neg logit contributions
skirts-7.351
lik-6.070
haps-5.950
arthy-5.761
 vo-5.729
Token complaints
Token ID9687
Feature activation-5.79e+00

Pos logit contributions
advertisement+4.383
INT+3.561
ARB+3.555
ENSE+3.521
Db+3.468
Neg logit contributions
skirts-4.855
haps-3.648
lik-3.636
arthy-3.533
EStream-3.523
Token filed
Token ID5717
Feature activation-2.78e+00

Pos logit contributions
advertisement+4.294
INT+3.523
ARB+3.478
ENSE+3.458
Db+3.389
Neg logit contributions
skirts-5.482
haps-4.179
lik-4.139
arthy-4.085
EStream-4.041
Token against
Token ID1028
Feature activation-2.43e+00

Pos logit contributions
advertisement+2.013
INT+1.622
ENSE+1.605
ARB+1.559
gerald+1.549
Neg logit contributions
skirts-2.740
lik-2.142
haps-2.125
arthy-2.069
EStream-2.040
Token Wells
Token ID18292
Feature activation-1.28e+01

Pos logit contributions
advertisement+1.777
INT+1.423
ENSE+1.411
ARB+1.399
gerald+1.369
Neg logit contributions
skirts-2.367
lik-1.845
haps-1.828
arthy-1.780
EStream-1.770
Token Fargo
Token ID28125
Feature activation-4.24e+00

Pos logit contributions
advertisement+5.034
INT+3.729
ENSE+3.555
ARB+3.316
Db+3.311
Neg logit contributions
skirts-15.219
haps-12.499
EStream-12.492
lik-12.491
nels-12.219
Token
Token ID960
Feature activation-4.52e+00

Pos logit contributions
advertisement+2.613
INT+2.037
ENSE+2.022
ARB+1.957
gerald+1.931
Neg logit contributions
skirts-4.570
haps-3.667
lik-3.645
arthy-3.515
EStream-3.512
Tokenwhich
Token ID4758
Feature activation-5.75e+00

Pos logit contributions
advertisement+3.355
INT+2.655
ARB+2.574
ENSE+2.545
gerald+2.510
Neg logit contributions
skirts-4.265
haps-3.352
lik-3.328
EStream-3.315
arthy-3.297
Token is
Token ID318
Feature activation-3.60e+00

Pos logit contributions
advertisement+4.670
INT+3.880
ENSE+3.847
ARB+3.833
gerald+3.782
Neg logit contributions
skirts-4.902
lik-3.702
EStream-3.661
haps-3.647
arthy-3.580
Token accused
Token ID5371
Feature activation+1.84e+00

Pos logit contributions
advertisement+2.935
ENSE+2.381
INT+2.381
ARB+2.343
gerald+2.274
Neg logit contributions
skirts-3.229
lik-2.445
haps-2.428
arthy-2.376
EStream-2.356
Tokenwww
Token ID2503
Feature activation-1.77e+00

Pos logit contributions
advertisement+3.109
INT+2.372
gerald+2.327
ENSE+2.326
Db+2.225
Neg logit contributions
skirts-4.987
EStream-4.050
haps-3.904
 constitu-3.887
lik-3.812
Token.
Token ID13
Feature activation-1.67e+00

Pos logit contributions
advertisement+1.117
INT+0.839
ENSE+0.819
ARB+0.782
Db+0.771
Neg logit contributions
skirts-1.955
haps-1.598
lik-1.564
EStream-1.546
arthy-1.513
Tokenbel
Token ID6667
Feature activation-7.08e+00

Pos logit contributions
advertisement+1.899
INT+1.625
gerald+1.624
ENSE+1.607
ARB+1.581
Neg logit contributions
skirts-0.990
haps-0.596
EStream-0.592
lik-0.585
arthy-0.559
Tokenfast
Token ID7217
Feature activation-7.62e+00

Pos logit contributions
advertisement+2.100
INT+1.018
gerald+0.955
ENSE+0.798
Bench+0.793
Neg logit contributions
skirts-9.686
EStream-8.232
 constitu-8.195
lik-8.067
arthy-7.966
Tokente
Token ID660
Feature activation-8.43e+00

Pos logit contributions
advertisement+5.031
gerald+3.802
INT+3.742
ENSE+3.610
MK+3.549
Neg logit contributions
skirts-7.745
EStream-6.204
 constitu-6.167
haps-5.944
arthy-5.879
Tokenlegraph
Token ID16606
Feature activation-1.26e+01

Pos logit contributions
advertisement+2.957
INT+1.419
gerald+1.303
ENSE+1.245
ARB+1.194
Neg logit contributions
skirts-11.510
EStream-9.644
arthy-9.378
 constitu-9.367
lik-9.356
Token.
Token ID13
Feature activation-3.25e+00

Pos logit contributions
advertisement+5.750
INT+3.529
MK+3.198
Db+3.154
ENSE+3.130
Neg logit contributions
skirts-15.092
accompan-13.258
 constitu-12.967
EStream-12.856
 occas-12.711
Tokenco
Token ID1073
Feature activation-7.64e+00

Pos logit contributions
advertisement+2.683
INT+2.166
gerald+2.121
ENSE+2.096
MK+2.077
Neg logit contributions
skirts-2.897
haps-2.214
EStream-2.202
lik-2.131
arthy-2.116
Token.
Token ID13
Feature activation-3.79e+00

Pos logit contributions
advertisement+4.119
INT+2.832
ENSE+2.723
MK+2.603
Db+2.584
Neg logit contributions
skirts-8.972
accompan-7.548
EStream-7.450
haps-7.408
 constitu-7.310
Tokenuk
Token ID2724
Feature activation+1.40e+00

Pos logit contributions
advertisement+3.640
INT+3.043
ENSE+3.012
MK+2.983
ARB+2.926
Neg logit contributions
skirts-2.857
haps-2.065
EStream-2.014
lik-1.988
arthy-1.966
Token/
Token ID14
Feature activation-4.10e+00

Pos logit contributions
skirts+1.449
lik+1.159
haps+1.157
arthy+1.105
EStream+1.099
Neg logit contributions
advertisement-0.953
INT-0.750
ENSE-0.748
ARB-0.719
gerald-0.711
Token 2011
Token ID2813
Feature activation-5.98e+00

Pos logit contributions
advertisement+3.528
ENSE+2.947
INT+2.922
gerald+2.790
ARB+2.739
Neg logit contributions
skirts-5.222
lik-4.149
haps-4.072
EStream-4.016
arthy-3.993
Token,
Token ID11
Feature activation-4.20e+00

Pos logit contributions
advertisement+3.749
ENSE+2.929
INT+2.899
ARB+2.761
gerald+2.733
Neg logit contributions
skirts-6.312
lik-5.073
haps-5.062
arthy-4.879
EStream-4.836
Token it
Token ID340
Feature activation-3.28e+00

Pos logit contributions
advertisement+3.064
ENSE+2.505
INT+2.462
ARB+2.366
gerald+2.329
Neg logit contributions
skirts-4.055
lik-3.165
haps-3.151
EStream-3.042
arthy-3.028
Token was
Token ID373
Feature activation-1.74e+00

Pos logit contributions
advertisement+2.891
INT+2.445
ENSE+2.439
ARB+2.415
gerald+2.358
Neg logit contributions
skirts-2.661
lik-1.922
haps-1.920
arthy-1.861
EStream-1.846
Token 62
Token ID8190
Feature activation-5.49e+00

Pos logit contributions
advertisement+1.476
ENSE+1.232
INT+1.224
ARB+1.190
gerald+1.175
Neg logit contributions
skirts-1.509
haps-1.123
lik-1.121
EStream-1.092
arthy-1.076
Token-
Token ID12
Feature activation-1.25e+01

Pos logit contributions
advertisement+3.954
INT+3.179
ENSE+3.125
Db+3.022
ARB+2.981
Neg logit contributions
skirts-5.364
haps-4.196
EStream-4.179
lik-4.139
accompan-4.060
Token3
Token ID18
Feature activation-7.97e+00

Pos logit contributions
advertisement+5.080
INT+3.683
ENSE+3.276
MK+3.047
Db+2.934
Neg logit contributions
skirts-14.830
EStream-12.590
haps-12.261
 constitu-12.130
lik-12.094
Token.
Token ID13
Feature activation-2.76e+00

Pos logit contributions
advertisement+4.917
ENSE+3.860
INT+3.812
ARB+3.599
Db+3.542
Neg logit contributions
skirts-8.279
lik-6.622
haps-6.610
arthy-6.392
cigarettes-6.339
Token
Token ID198
Feature activation-5.14e-01

Pos logit contributions
advertisement+2.281
INT+1.914
ENSE+1.897
ARB+1.814
Db+1.808
Neg logit contributions
skirts-2.436
haps-1.890
lik-1.857
EStream-1.759
nels-1.731
Token
Token ID198
Feature activation-3.43e+00

Pos logit contributions
advertisement+0.511
INT+0.434
ENSE+0.430
ARB+0.421
Db+0.417
Neg logit contributions
skirts-0.378
haps-0.270
lik-0.266
EStream-0.256
arthy-0.249
TokenIt
Token ID1026
Feature activation-6.62e+00

Pos logit contributions
advertisement+2.267
INT+1.754
ENSE+1.690
ARB+1.647
Bench+1.629
Neg logit contributions
skirts-3.703
haps-2.987
lik-2.951
EStream-2.882
arthy-2.878
TokenIt
Token ID1026
Feature activation-8.35e+00

Pos logit contributions
advertisement+2.936
INT+2.355
ENSE+2.297
ARB+2.191
Db+2.175
Neg logit contributions
skirts-4.635
haps-3.743
lik-3.692
arthy-3.587
EStream-3.579
Token was
Token ID373
Feature activation-5.76e+00

Pos logit contributions
advertisement+6.977
INT+6.050
ARB+5.956
ENSE+5.865
gerald+5.830
Neg logit contributions
skirts-6.395
lik-4.671
EStream-4.618
arthy-4.566
haps-4.502
Token an
Token ID281
Feature activation-6.15e+00

Pos logit contributions
advertisement+4.403
INT+3.714
ENSE+3.675
gerald+3.498
ARB+3.468
Neg logit contributions
skirts-5.231
haps-4.017
lik-3.997
EStream-3.875
arthy-3.870
Token explosive
Token ID13835
Feature activation-6.75e+00

Pos logit contributions
advertisement+4.198
ENSE+3.495
INT+3.488
ARB+3.232
gerald+3.122
Neg logit contributions
skirts-6.225
lik-4.839
EStream-4.838
haps-4.826
arthy-4.797
Token 80
Token ID4019
Feature activation-6.00e+00

Pos logit contributions
advertisement+4.873
INT+4.118
ENSE+4.063
gerald+3.830
ARB+3.746
Neg logit contributions
skirts-6.536
haps-5.086
lik-4.980
arthy-4.926
cigarettes-4.876
Token-
Token ID12
Feature activation-1.25e+01

Pos logit contributions
advertisement+4.406
INT+3.756
ENSE+3.694
ARB+3.493
Db+3.455
Neg logit contributions
skirts-5.678
lik-4.380
haps-4.362
EStream-4.228
arthy-4.192
Tokenyard
Token ID9413
Feature activation-1.14e+01

Pos logit contributions
advertisement+5.607
INT+4.561
ENSE+4.058
gerald+3.730
ARB+3.683
Neg logit contributions
skirts-14.228
 constitu-11.750
EStream-11.715
haps-11.627
arthy-11.614
Token sprint
Token ID18553
Feature activation-7.62e+00

Pos logit contributions
advertisement+7.383
INT+6.616
ENSE+6.149
gerald+5.868
ARB+5.855
Neg logit contributions
skirts-10.799
lik-8.438
haps-8.388
cigarettes-8.265
izons-8.143
Token,
Token ID11
Feature activation-5.65e+00

Pos logit contributions
advertisement+5.471
INT+4.584
ENSE+4.376
gerald+4.267
ARB+4.193
Neg logit contributions
skirts-7.151
haps-5.676
lik-5.483
cigarettes-5.418
izons-5.410
Token and
Token ID290
Feature activation-3.65e+00

Pos logit contributions
advertisement+3.996
INT+3.247
ENSE+3.218
ARB+3.099
Bench+3.014
Neg logit contributions
skirts-5.524
lik-4.310
haps-4.286
arthy-4.184
EStream-4.121
Token McF
Token ID38222
Feature activation+2.69e+00

Pos logit contributions
advertisement+2.764
INT+2.319
ENSE+2.307
ARB+2.208
Db+2.173
Neg logit contributions
skirts-3.366
haps-2.606
lik-2.581
EStream-2.525
arthy-2.520
Token five
Token ID1936
Feature activation-7.58e-01

Pos logit contributions
advertisement+2.265
ENSE+1.892
INT+1.887
gerald+1.812
ARB+1.781
Neg logit contributions
skirts-2.577
lik-1.981
haps-1.966
arthy-1.905
EStream-1.893
Token touchdowns
Token ID17692
Feature activation-7.44e+00

Pos logit contributions
advertisement+0.649
INT+0.545
ENSE+0.543
ARB+0.523
gerald+0.522
Neg logit contributions
skirts-0.647
lik-0.490
haps-0.490
arthy-0.466
EStream-0.460
Token in
Token ID287
Feature activation-3.17e+00

Pos logit contributions
advertisement+4.595
INT+3.737
ENSE+3.660
ARB+3.387
gerald+3.310
Neg logit contributions
skirts-7.790
haps-6.210
lik-6.105
arthy-5.994
EStream-5.904
Token a
Token ID257
Feature activation-3.37e+00

Pos logit contributions
advertisement+2.451
INT+2.055
ENSE+2.035
gerald+1.949
ARB+1.924
Neg logit contributions
skirts-2.898
lik-2.251
haps-2.247
arthy-2.161
EStream-2.143
Token 41
Token ID6073
Feature activation-7.74e+00

Pos logit contributions
advertisement+2.673
INT+2.226
ENSE+2.200
ARB+2.099
gerald+2.089
Neg logit contributions
skirts-3.090
haps-2.358
lik-2.344
arthy-2.261
cigarettes-2.223
Token-
Token ID12
Feature activation-1.25e+01

Pos logit contributions
advertisement+5.714
INT+4.770
ENSE+4.759
ARB+4.481
Db+4.383
Neg logit contributions
skirts-7.250
lik-5.649
haps-5.567
EStream-5.480
arthy-5.441
Token27
Token ID1983
Feature activation-1.02e+01

Pos logit contributions
advertisement+6.139
INT+4.658
ENSE+4.060
Db+3.791
ARB+3.696
Neg logit contributions
skirts-13.896
 constitu-11.830
EStream-11.794
arthy-11.519
haps-11.513
Token Vanderbilt
Token ID38684
Feature activation-2.01e+00

Pos logit contributions
advertisement+5.994
ENSE+4.816
INT+4.804
ARB+4.497
Db+4.306
Neg logit contributions
skirts-10.770
lik-8.612
haps-8.534
arthy-8.461
cigarettes-8.393
Token win
Token ID1592
Feature activation-2.42e+00

Pos logit contributions
advertisement+1.268
INT+0.992
ENSE+0.991
ARB+0.931
gerald+0.908
Neg logit contributions
skirts-2.167
haps-1.750
lik-1.741
arthy-1.681
EStream-1.674
Token that
Token ID326
Feature activation-5.38e+00

Pos logit contributions
advertisement+1.713
INT+1.375
ENSE+1.361
ARB+1.311
gerald+1.278
Neg logit contributions
skirts-2.411
haps-1.918
lik-1.906
arthy-1.833
EStream-1.815
Token broke
Token ID6265
Feature activation-7.30e+00

Pos logit contributions
advertisement+4.301
ENSE+3.596
INT+3.546
ARB+3.482
gerald+3.406
Neg logit contributions
skirts-4.711
lik-3.611
haps-3.567
EStream-3.501
arthy-3.454
Token graph
Token ID4823
Feature activation-1.20e-01

Pos logit contributions
skirts+0.423
haps+0.327
lik+0.327
EStream+0.311
arthy+0.310
Neg logit contributions
advertisement-0.361
INT-0.297
ENSE-0.294
ARB-0.285
gerald-0.282
Token).
Token ID737
Feature activation+2.59e+00

Pos logit contributions
advertisement+0.089
INT+0.072
ENSE+0.071
ARB+0.069
Db+0.068
Neg logit contributions
skirts-0.115
haps-0.090
lik-0.090
arthy-0.087
EStream-0.086
Token For
Token ID1114
Feature activation-2.62e+00

Pos logit contributions
skirts+2.207
lik+1.672
haps+1.670
arthy+1.571
EStream+1.566
Neg logit contributions
advertisement-2.233
INT-1.860
ENSE-1.855
ARB-1.803
gerald-1.780
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
advertisement+2.047
INT+1.736
ENSE+1.690
ARB+1.647
Db+1.615
Neg logit contributions
skirts-2.353
haps-1.818
lik-1.811
arthy-1.753
EStream-1.750
Token 17
Token ID1596
Feature activation-5.05e+00

Pos logit contributions
advertisement+1.344
INT+1.133
ENSE+1.104
ARB+1.073
Bench+1.055
Neg logit contributions
skirts-1.527
haps-1.185
lik-1.165
EStream-1.126
arthy-1.120
Token-
Token ID12
Feature activation-1.23e+01

Pos logit contributions
advertisement+4.009
INT+3.371
ENSE+3.330
ARB+3.230
Db+3.183
Neg logit contributions
skirts-4.535
haps-3.463
lik-3.429
EStream-3.295
arthy-3.252
Token40
Token ID1821
Feature activation-8.05e+00

Pos logit contributions
advertisement+5.876
INT+4.805
ENSE+4.387
Db+4.313
ARB+4.300
Neg logit contributions
skirts-13.787
haps-11.285
 vo-11.148
nels-10.878
EStream-10.865
Token and
Token ID290
Feature activation-4.34e+00

Pos logit contributions
advertisement+5.746
INT+4.943
ENSE+4.866
MK+4.684
ARB+4.672
Neg logit contributions
skirts-7.346
haps-5.677
lik-5.524
cigarettes-5.490
arthy-5.466
Token 16
Token ID1467
Feature activation+4.24e-01

Pos logit contributions
advertisement+3.271
INT+2.786
ENSE+2.668
ARB+2.635
gerald+2.551
Neg logit contributions
skirts-4.025
haps-3.102
lik-3.038
EStream-2.957
arthy-2.957
Token-
Token ID12
Feature activation-5.86e+00

Pos logit contributions
skirts+0.378
haps+0.288
lik+0.286
EStream+0.275
arthy+0.271
Neg logit contributions
advertisement-0.350
INT-0.291
ENSE-0.288
ARB-0.281
gerald-0.277
Token35
Token ID2327
Feature activation-2.48e+00

Pos logit contributions
advertisement+3.212
INT+2.461
ARB+2.327
ENSE+2.258
MK+2.163
Neg logit contributions
skirts-6.834
haps-5.483
EStream-5.439
 constitu-5.418
 vo-5.374
Token feels
Token ID5300
Feature activation-1.29e+00

Pos logit contributions
skirts+2.031
lik+1.483
haps+1.478
arthy+1.386
EStream+1.373
Neg logit contributions
advertisement-2.494
INT-2.109
ENSE-2.105
ARB-2.051
gerald-2.035
Token like
Token ID588
Feature activation+1.85e+00

Pos logit contributions
advertisement+1.034
ENSE+0.854
INT+0.853
ARB+0.818
gerald+0.810
Neg logit contributions
skirts-1.176
haps-0.910
lik-0.893
EStream-0.874
arthy-0.867
Token it
Token ID340
Feature activation+4.04e+00

Pos logit contributions
skirts+1.635
lik+1.249
haps+1.246
arthy+1.186
EStream+1.175
Neg logit contributions
advertisement-1.537
INT-1.270
ENSE-1.270
ARB-1.228
gerald-1.217
Token struggles
Token ID12766
Feature activation-2.43e+00

Pos logit contributions
skirts+3.001
lik+2.169
haps+2.159
arthy+2.014
EStream+1.995
Neg logit contributions
advertisement-3.904
INT-3.319
ENSE-3.316
ARB-3.232
gerald-3.209
Token to
Token ID284
Feature activation-8.34e+00

Pos logit contributions
advertisement+1.981
INT+1.628
ENSE+1.618
INESS+1.549
gerald+1.535
Neg logit contributions
skirts-2.185
haps-1.674
lik-1.635
arthy-1.601
EStream-1.585
Token keep
Token ID1394
Feature activation-1.23e+01

Pos logit contributions
advertisement+5.413
INT+4.325
ENSE+4.214
Db+3.967
Bench+3.889
Neg logit contributions
skirts-8.680
EStream-6.797
haps-6.669
lik-6.622
arthy-6.571
Token up
Token ID510
Feature activation-3.73e-01

Pos logit contributions
advertisement+7.498
ENSE+6.445
INT+6.177
Db+5.856
ARB+5.783
Neg logit contributions
skirts-11.929
haps-9.696
EStream-9.530
arthy-9.057
lik-8.939
Token with
Token ID351
Feature activation-5.95e+00

Pos logit contributions
advertisement+0.283
ENSE+0.229
INT+0.228
ARB+0.218
Db+0.217
Neg logit contributions
skirts-0.357
haps-0.279
lik-0.276
arthy-0.266
EStream-0.265
Token the
Token ID262
Feature activation-2.83e+00

Pos logit contributions
advertisement+4.542
INT+3.854
ENSE+3.764
Db+3.595
ARB+3.586
Neg logit contributions
skirts-5.380
haps-4.129
lik-4.095
arthy-4.064
EStream-4.037
Token two
Token ID734
Feature activation-5.70e-02

Pos logit contributions
advertisement+2.313
INT+1.925
ENSE+1.901
Db+1.840
ARB+1.811
Neg logit contributions
skirts-2.534
lik-1.901
haps-1.894
EStream-1.835
arthy-1.823
Token top
Token ID1353
Feature activation+3.02e+00

Pos logit contributions
advertisement+0.047
INT+0.039
ENSE+0.039
Db+0.038
ARB+0.037
Neg logit contributions
skirts-0.051
lik-0.038
haps-0.038
EStream-0.037
arthy-0.037
Token New
Token ID968
Feature activation-1.85e+00

Pos logit contributions
advertisement+2.059
INT+1.610
ENSE+1.601
ARB+1.532
Db+1.523
Neg logit contributions
skirts-3.255
haps-2.630
lik-2.588
arthy-2.489
cigarettes-2.479
Token-
Token ID12
Feature activation-6.84e+00

Pos logit contributions
advertisement+1.474
INT+1.212
ENSE+1.187
ARB+1.155
Db+1.146
Neg logit contributions
skirts-1.718
haps-1.335
lik-1.306
EStream-1.292
arthy-1.249
TokenObject
Token ID10267
Feature activation-6.58e+00

Pos logit contributions
advertisement+3.165
INT+2.283
Db+2.048
ARB+1.986
ENSE+1.950
Neg logit contributions
skirts-8.609
haps-7.330
EStream-7.101
lik-7.095
accompan-7.090
Token System
Token ID4482
Feature activation-7.09e+00

Pos logit contributions
advertisement+3.548
INT+2.669
ENSE+2.579
Db+2.498
ARB+2.437
Neg logit contributions
skirts-7.551
haps-6.325
lik-6.180
accompan-6.078
arthy-5.984
Token.
Token ID13
Feature activation-6.97e+00

Pos logit contributions
advertisement+3.924
INT+2.795
ENSE+2.604
Db+2.541
ARB+2.493
Neg logit contributions
skirts-8.238
haps-6.930
accompan-6.897
EStream-6.837
 constitu-6.701
TokenDraw
Token ID25302
Feature activation-1.22e+01

Pos logit contributions
advertisement+3.518
INT+2.585
ENSE+2.412
ARB+2.389
Db+2.381
Neg logit contributions
skirts-8.369
haps-7.196
lik-6.951
EStream-6.888
 constitu-6.876
Tokening
Token ID278
Feature activation-9.13e+00

Pos logit contributions
advertisement+7.761
INT+5.730
Db+5.575
ENSE+5.298
Bench+5.280
Neg logit contributions
skirts-12.375
accompan-10.652
EStream-10.452
 constitu-10.391
 occas-10.135
Token.
Token ID13
Feature activation-6.54e+00

Pos logit contributions
advertisement+4.792
INT+3.300
ENSE+3.056
Db+3.043
Bench+3.022
Neg logit contributions
skirts-10.720
accompan-9.246
haps-9.131
EStream-8.944
 constitu-8.861
TokenRect
Token ID45474
Feature activation-8.23e+00

Pos logit contributions
advertisement+2.752
INT+1.892
ARB+1.780
Db+1.648
Bench+1.646
Neg logit contributions
skirts-8.428
haps-7.225
lik-7.075
 constitu-6.929
accompan-6.909
Tokenangle
Token ID9248
Feature activation-1.85e+00

Pos logit contributions
advertisement+2.183
INT+0.984
ARB+0.800
ENSE+0.703
gerald+0.632
Neg logit contributions
skirts-11.593
EStream-9.905
 constitu-9.818
accompan-9.710
lik-9.694
TokenF
Token ID37
Feature activation-2.63e+00

Pos logit contributions
advertisement+1.107
INT+0.843
ENSE+0.839
ARB+0.803
gerald+0.788
Neg logit contributions
skirts-2.049
lik-1.668
haps-1.666
arthy-1.597
 vo-1.586
Token the
Token ID262
Feature activation+2.22e+00

Pos logit contributions
skirts+0.851
haps+0.654
lik+0.652
EStream+0.625
arthy+0.622
Neg logit contributions
advertisement-0.767
INT-0.625
ENSE-0.623
ARB-0.607
gerald-0.599
Token prime
Token ID6994
Feature activation+4.23e+00

Pos logit contributions
skirts+1.964
haps+1.499
lik+1.496
arthy+1.422
EStream+1.412
Neg logit contributions
advertisement-1.846
INT-1.523
ENSE-1.515
ARB-1.473
gerald-1.459
Token challenge
Token ID4427
Feature activation+1.43e-01

Pos logit contributions
skirts+3.865
lik+3.028
haps+3.009
 vo+2.849
arthy+2.835
Neg logit contributions
advertisement-3.284
ENSE-2.704
INT-2.690
ARB-2.609
gerald-2.586
Token Belfast
Token ID29713
Feature activation-4.16e+00

Pos logit contributions
skirts+0.122
haps+0.093
lik+0.093
EStream+0.088
arthy+0.088
Neg logit contributions
advertisement-0.123
INT-0.102
ENSE-0.100
ARB-0.098
gerald-0.097
TokenTe
Token ID6767
Feature activation-7.46e+00

Pos logit contributions
advertisement+2.975
ENSE+2.366
INT+2.335
INESS+2.285
ARB+2.277
Neg logit contributions
skirts-4.092
haps-3.232
lik-3.156
arthy-3.130
EStream-3.070
Tokenlegraph
Token ID16606
Feature activation-1.22e+01

Pos logit contributions
advertisement+2.409
INT+1.637
ARB+1.345
ENSE+1.262
Db+1.152
Neg logit contributions
skirts-10.169
lik-8.627
arthy-8.533
EStream-8.476
anism-8.323
Token.
Token ID13
Feature activation+1.03e-01

Pos logit contributions
advertisement+5.328
Db+3.296
MK+3.214
INT+3.091
INESS+3.089
Neg logit contributions
skirts-14.881
accompan-12.980
haps-12.745
EStream-12.344
lik-12.035
Tokenco
Token ID1073
Feature activation-4.83e+00

Pos logit contributions
skirts+0.085
haps+0.064
lik+0.064
EStream+0.062
arthy+0.061
Neg logit contributions
advertisement-0.091
INT-0.076
ENSE-0.076
ARB-0.074
Db-0.073
Token.
Token ID13
Feature activation-1.36e+00

Pos logit contributions
advertisement+2.751
INT+1.993
ENSE+1.911
ARB+1.855
Db+1.845
Neg logit contributions
skirts-5.515
accompan-4.526
haps-4.517
EStream-4.467
 constitu-4.414
Tokenuk
Token ID2724
Feature activation+3.68e+00

Pos logit contributions
advertisement+1.325
INT+1.121
ENSE+1.111
ARB+1.091
MK+1.078
Neg logit contributions
skirts-1.004
haps-0.719
lik-0.698
EStream-0.693
arthy-0.681
Token Dis
Token ID3167
Feature activation-4.02e-01

Pos logit contributions
skirts+3.233
lik+2.483
haps+2.457
arthy+2.340
 vo+2.323
Neg logit contributions
advertisement-3.014
ENSE-2.497
INT-2.488
ARB-2.414
gerald-2.401
Token 2012
Token ID2321
Feature activation-4.77e+00

Pos logit contributions
advertisement+4.158
INT+3.447
ENSE+3.430
gerald+3.299
ARB+3.204
Neg logit contributions
skirts-5.260
haps-4.122
lik-4.088
arthy-4.056
EStream-3.989
Token,
Token ID11
Feature activation-3.77e+00

Pos logit contributions
advertisement+3.009
ENSE+2.338
INT+2.329
ARB+2.209
Db+2.189
Neg logit contributions
skirts-5.058
lik-4.077
haps-4.065
arthy-3.902
EStream-3.876
Token it
Token ID340
Feature activation-3.63e+00

Pos logit contributions
advertisement+2.974
ENSE+2.467
INT+2.438
ARB+2.337
Db+2.321
Neg logit contributions
skirts-3.419
lik-2.632
haps-2.624
arthy-2.519
EStream-2.511
Token was
Token ID373
Feature activation-2.79e+00

Pos logit contributions
advertisement+3.164
INT+2.666
ENSE+2.653
ARB+2.635
gerald+2.569
Neg logit contributions
skirts-2.989
lik-2.166
haps-2.158
arthy-2.116
EStream-2.105
Token 68
Token ID8257
Feature activation-3.90e+00

Pos logit contributions
advertisement+2.246
INT+1.854
ENSE+1.845
gerald+1.767
ARB+1.766
Neg logit contributions
skirts-2.523
haps-1.911
lik-1.908
arthy-1.842
EStream-1.839
Token-
Token ID12
Feature activation-1.22e+01

Pos logit contributions
advertisement+2.949
INT+2.385
ENSE+2.351
Db+2.277
ARB+2.259
Neg logit contributions
skirts-3.720
haps-2.875
lik-2.873
EStream-2.808
arthy-2.742
Token2
Token ID17
Feature activation-7.33e+00

Pos logit contributions
advertisement+5.170
INT+3.868
ENSE+3.429
MK+3.176
ARB+3.117
Neg logit contributions
skirts-14.243
EStream-11.977
haps-11.691
lik-11.682
arthy-11.613
Token.
Token ID13
Feature activation-1.87e+00

Pos logit contributions
advertisement+4.755
ENSE+3.734
INT+3.717
ARB+3.497
Db+3.461
Neg logit contributions
skirts-7.471
lik-5.926
haps-5.912
arthy-5.710
cigarettes-5.677
Token In
Token ID554
Feature activation-5.27e+00

Pos logit contributions
advertisement+1.577
INT+1.320
ENSE+1.309
ARB+1.253
Db+1.248
Neg logit contributions
skirts-1.634
haps-1.264
lik-1.240
EStream-1.175
arthy-1.157
Token 2011
Token ID2813
Feature activation-5.98e+00

Pos logit contributions
advertisement+3.528
ENSE+2.947
INT+2.922
gerald+2.790
ARB+2.739
Neg logit contributions
skirts-5.222
lik-4.149
haps-4.072
EStream-4.016
arthy-3.993
Token,
Token ID11
Feature activation-4.20e+00

Pos logit contributions
advertisement+3.749
ENSE+2.929
INT+2.899
ARB+2.761
gerald+2.733
Neg logit contributions
skirts-6.312
lik-5.073
haps-5.062
arthy-4.879
EStream-4.836
Token said
Token ID531
Feature activation+5.08e+00

Pos logit contributions
skirts+3.649
lik+2.782
haps+2.770
arthy+2.617
 vo+2.592
Neg logit contributions
advertisement-3.600
INT-2.987
ENSE-2.985
ARB-2.896
gerald-2.872
Token of
Token ID286
Feature activation+6.67e+00

Pos logit contributions
skirts+4.781
lik+3.751
haps+3.745
arthy+3.554
 vo+3.531
Neg logit contributions
advertisement-3.861
INT-3.138
ENSE-3.136
ARB-3.022
gerald-3.002
Token NHS
Token ID18183
Feature activation-1.91e+00

Pos logit contributions
skirts+5.577
haps+4.264
lik+4.248
 vo+3.997
arthy+3.979
Neg logit contributions
advertisement-5.624
ENSE-4.740
INT-4.724
ARB-4.529
INESS-4.526
Token staff
Token ID3085
Feature activation+1.21e+00

Pos logit contributions
advertisement+1.686
INT+1.410
ENSE+1.394
ARB+1.346
gerald+1.346
Neg logit contributions
skirts-1.597
lik-1.166
haps-1.163
EStream-1.121
arthy-1.105
Token:
Token ID25
Feature activation+6.01e+00

Pos logit contributions
skirts+1.163
lik+0.906
haps+0.905
EStream+0.863
arthy+0.861
Neg logit contributions
advertisement-0.907
INT-0.728
ENSE-0.727
ARB-0.700
gerald-0.697
Token
Token ID198
Feature activation+8.88e+00

Pos logit contributions
skirts+3.700
haps+2.468
lik+2.459
arthy+2.259
EStream+2.232
Neg logit contributions
advertisement-6.451
INT-5.634
ENSE-5.630
ARB-5.492
gerald-5.485
Token
Token ID198
Feature activation+4.72e+00

Pos logit contributions
skirts+4.497
lik+2.956
haps+2.723
arthy+2.573
 vo+2.540
Neg logit contributions
advertisement-9.746
allery-8.841
ENSE-8.748
gerald-8.634
INT-8.612
Token
Token ID447
Feature activation+1.02e+00

Pos logit contributions
skirts+4.714
lik+3.755
haps+3.743
arthy+3.562
cigarettes+3.534
Neg logit contributions
advertisement-3.308
ENSE-2.652
INT-2.642
ARB-2.543
gerald-2.519
Token
Token ID246
Feature activation+3.85e+00

Pos logit contributions
skirts+0.803
lik+0.595
haps+0.590
arthy+0.554
EStream+0.548
Neg logit contributions
advertisement-0.938
INT-0.789
ENSE-0.788
ARB-0.767
gerald-0.761
TokenThey
Token ID2990
Feature activation-1.94e+00

Pos logit contributions
skirts+3.853
lik+3.063
haps+3.053
arthy+2.913
EStream+2.892
Neg logit contributions
advertisement-2.721
INT-2.166
ENSE-2.165
ARB-2.084
gerald-2.062
Token care
Token ID1337
Feature activation+5.86e-01

Pos logit contributions
advertisement+1.739
ENSE+1.442
INT+1.431
ARB+1.396
gerald+1.392
Neg logit contributions
skirts-1.573
haps-1.146
EStream-1.141
lik-1.138
cigarettes-1.072
Token
Token ID198
Feature activation+2.27e+00

Pos logit contributions
advertisement+2.586
INT+2.174
ENSE+2.140
Db+2.091
ARB+2.088
Neg logit contributions
skirts-2.355
haps-1.793
lik-1.754
EStream-1.701
arthy-1.633
Token
Token ID198
Feature activation-1.46e+00

Pos logit contributions
skirts+1.510
lik+1.046
haps+1.038
arthy+0.957
 vo+0.944
Neg logit contributions
advertisement-2.362
ENSE-2.034
INT-2.034
ARB-1.986
gerald-1.976
TokenThey
Token ID2990
Feature activation-3.75e+00

Pos logit contributions
advertisement+0.972
INT+0.755
ENSE+0.731
ARB+0.715
Bench+0.705
Neg logit contributions
skirts-1.556
haps-1.253
lik-1.243
EStream-1.211
arthy-1.203
Token are
Token ID389
Feature activation+2.50e-01

Pos logit contributions
advertisement+3.208
ENSE+2.736
INT+2.712
gerald+2.673
ARB+2.662
Neg logit contributions
skirts-3.062
EStream-2.263
lik-2.229
haps-2.204
arthy-2.103
Token now
Token ID783
Feature activation+3.37e+00

Pos logit contributions
skirts+0.220
lik+0.168
haps+0.167
EStream+0.161
arthy+0.159
Neg logit contributions
advertisement-0.207
INT-0.171
ENSE-0.171
ARB-0.166
gerald-0.164
Token expected
Token ID2938
Feature activation+8.81e+00

Pos logit contributions
skirts+2.996
lik+2.310
haps+2.300
arthy+2.176
 vo+2.166
Neg logit contributions
advertisement-2.751
INT-2.263
ENSE-2.263
ARB-2.191
gerald-2.175
Token to
Token ID284
Feature activation+9.25e-01

Pos logit contributions
skirts+5.916
lik+4.189
haps+4.127
 vo+3.886
arthy+3.787
Neg logit contributions
advertisement-8.713
ENSE-7.452
INT-7.373
ARB-7.301
gerald-7.218
Token face
Token ID1986
Feature activation-1.50e+00

Pos logit contributions
skirts+0.908
lik+0.715
haps+0.709
EStream+0.688
arthy+0.680
Neg logit contributions
advertisement-0.675
INT-0.543
ENSE-0.541
ARB-0.519
gerald-0.515
Token a
Token ID257
Feature activation+5.83e-02

Pos logit contributions
advertisement+1.210
INT+0.992
ENSE+0.989
ARB+0.954
gerald+0.947
Neg logit contributions
skirts-1.337
haps-1.031
lik-1.023
EStream-0.997
arthy-0.989
Token run
Token ID1057
Feature activation+5.02e-01

Pos logit contributions
skirts+0.054
haps+0.043
lik+0.042
arthy+0.040
EStream+0.040
Neg logit contributions
advertisement-0.045
INT-0.037
ENSE-0.037
ARB-0.035
gerald-0.035
Token-
Token ID12
Feature activation-1.79e+00

Pos logit contributions
skirts+0.430
haps+0.326
lik+0.324
EStream+0.308
arthy+0.308
Neg logit contributions
advertisement-0.428
INT-0.356
ENSE-0.355
ARB-0.345
gerald-0.341
Tokens
Token ID82
Feature activation-6.68e+00

Pos logit contributions
advertisement+2.924
INT+2.171
ENSE+2.163
ARB+1.994
gerald+1.993
Neg logit contributions
skirts-6.621
haps-5.413
lik-5.317
EStream-5.278
nels-5.190
Token John
Token ID1757
Feature activation-7.76e+00

Pos logit contributions
advertisement+4.576
ENSE+3.683
INT+3.672
ARB+3.525
Db+3.452
Neg logit contributions
skirts-6.659
haps-5.100
nels-5.075
EStream-5.036
lik-5.005
Token Dick
Token ID11740
Feature activation-4.91e+00

Pos logit contributions
advertisement+4.641
INT+3.778
ENSE+3.714
ARB+3.609
Bench+3.571
Neg logit contributions
skirts-8.232
EStream-6.490
haps-6.490
lik-6.480
nels-6.251
Tokenerson
Token ID882
Feature activation-1.25e-01

Pos logit contributions
advertisement+3.829
INT+3.299
ENSE+3.262
ARB+3.106
gerald+3.057
Neg logit contributions
skirts-4.427
haps-3.365
lik-3.284
EStream-3.200
arthy-3.191
Token that
Token ID326
Feature activation+1.71e+00

Pos logit contributions
advertisement+0.097
ENSE+0.079
INT+0.078
ARB+0.076
gerald+0.075
Neg logit contributions
skirts-0.118
haps-0.092
lik-0.091
arthy-0.087
EStream-0.087
Token pre
Token ID662
Feature activation+9.65e+00

Pos logit contributions
skirts+1.522
lik+1.163
haps+1.160
arthy+1.100
EStream+1.096
Neg logit contributions
advertisement-1.416
ENSE-1.166
INT-1.164
ARB-1.131
gerald-1.117
Token-
Token ID12
Feature activation+3.06e+00

Pos logit contributions
skirts+7.454
haps+5.679
lik+5.646
 vo+5.229
arthy+5.140
Neg logit contributions
advertisement-8.563
ENSE-7.293
INT-7.203
ARB-7.183
INESS-7.047
Tokenexisting
Token ID25687
Feature activation-8.23e-01

Pos logit contributions
skirts+2.695
lik+2.071
haps+2.057
arthy+1.946
EStream+1.929
Neg logit contributions
advertisement-2.521
ENSE-2.085
INT-2.081
ARB-2.018
gerald-2.004
Token conditions
Token ID3403
Feature activation+3.34e+00

Pos logit contributions
advertisement+0.833
ENSE+0.718
INT+0.712
ARB+0.689
gerald+0.688
Neg logit contributions
skirts-0.582
haps-0.405
lik-0.402
EStream-0.376
arthy-0.369
Token were
Token ID547
Feature activation+2.88e+00

Pos logit contributions
skirts+2.526
lik+1.845
haps+1.832
arthy+1.713
 vo+1.699
Neg logit contributions
advertisement-3.174
INT-2.694
ENSE-2.692
ARB-2.622
gerald-2.606
Token going
Token ID1016
Feature activation+5.54e+00

Pos logit contributions
skirts+2.390
lik+1.799
haps+1.791
arthy+1.687
EStream+1.672
Neg logit contributions
advertisement-2.528
INT-2.112
ENSE-2.110
ARB-2.050
gerald-2.034
Token people
Token ID661
Feature activation+1.57e+00

Pos logit contributions
skirts+0.042
haps+0.032
lik+0.032
EStream+0.031
arthy+0.030
Neg logit contributions
advertisement-0.038
INT-0.031
ENSE-0.031
ARB-0.030
Db-0.030
Token)
Token ID8
Feature activation+4.53e-01

Pos logit contributions
skirts+1.386
lik+1.056
haps+1.055
EStream+1.001
arthy+1.000
Neg logit contributions
advertisement-1.296
INT-1.068
ENSE-1.067
ARB-1.033
gerald-1.022
Token and
Token ID290
Feature activation-6.37e-01

Pos logit contributions
skirts+0.457
haps+0.358
lik+0.357
arthy+0.344
EStream+0.344
Neg logit contributions
advertisement-0.320
INT-0.254
ENSE-0.252
ARB-0.246
Db-0.240
Token other
Token ID584
Feature activation+5.60e+00

Pos logit contributions
advertisement+0.527
INT+0.437
ENSE+0.431
ARB+0.426
gerald+0.414
Neg logit contributions
skirts-0.566
haps-0.425
lik-0.424
EStream-0.418
arthy-0.404
Token gender
Token ID5279
Feature activation+4.97e+00

Pos logit contributions
skirts+4.228
lik+3.121
haps+3.082
arthy+2.893
 vo+2.885
Neg logit contributions
advertisement-5.281
ENSE-4.479
INT-4.462
ARB-4.349
gerald-4.335
Token non
Token ID1729
Feature activation+9.01e+00

Pos logit contributions
skirts+4.046
lik+3.091
haps+3.048
arthy+2.869
 vo+2.856
Neg logit contributions
advertisement-4.381
ENSE-3.684
INT-3.665
ARB-3.560
gerald-3.555
Tokencon
Token ID1102
Feature activation+3.37e+00

Pos logit contributions
skirts+7.985
lik+6.408
haps+6.258
arthy+5.944
rights+5.939
Neg logit contributions
advertisement-6.970
ENSE-5.814
INT-5.701
ARB-5.578
gerald-5.488
Tokenforming
Token ID15464
Feature activation+1.18e+00

Pos logit contributions
skirts+1.932
lik+1.232
haps+1.225
arthy+1.106
EStream+1.096
Neg logit contributions
advertisement-3.834
INT-3.346
ENSE-3.342
ARB-3.268
gerald-3.254
Token (
Token ID357
Feature activation+1.91e-01

Pos logit contributions
skirts+0.892
haps+0.642
lik+0.638
EStream+0.603
arthy+0.597
Neg logit contributions
advertisement-1.130
INT-0.962
ENSE-0.954
ARB-0.935
gerald-0.922
TokenG
Token ID38
Feature activation-5.09e+00

Pos logit contributions
skirts+0.179
haps+0.140
lik+0.139
EStream+0.134
arthy+0.133
Neg logit contributions
advertisement-0.148
INT-0.122
ENSE-0.119
ARB-0.116
Db-0.115
TokenNC
Token ID7792
Feature activation-4.10e-01

Pos logit contributions
advertisement+2.888
ENSE+2.288
INT+2.264
ARB+2.186
Db+2.072
Neg logit contributions
skirts-5.789
haps-4.589
EStream-4.577
lik-4.572
cigarettes-4.463
Token
Token ID198
Feature activation-1.51e+00

Pos logit contributions
skirts+0.420
haps+0.297
lik+0.293
EStream+0.279
arthy+0.273
Neg logit contributions
advertisement-0.592
INT-0.505
ENSE-0.501
ARB-0.491
Db-0.485
TokenMy
Token ID3666
Feature activation-3.18e+00

Pos logit contributions
advertisement+1.069
INT+0.837
ENSE+0.813
ARB+0.801
Bench+0.774
Neg logit contributions
skirts-1.555
haps-1.236
lik-1.225
EStream-1.192
arthy-1.191
Token advice
Token ID5608
Feature activation-2.17e+00

Pos logit contributions
advertisement+2.438
INT+2.010
ENSE+1.995
ARB+1.932
gerald+1.915
Neg logit contributions
skirts-2.959
lik-2.270
haps-2.266
arthy-2.199
EStream-2.177
Token to
Token ID284
Feature activation-3.14e+00

Pos logit contributions
advertisement+1.870
ENSE+1.534
INT+1.529
ARB+1.476
Db+1.471
Neg logit contributions
skirts-1.864
lik-1.395
haps-1.381
arthy-1.337
EStream-1.298
Token Donald
Token ID3759
Feature activation+2.50e+00

Pos logit contributions
advertisement+2.426
INT+1.966
ENSE+1.964
ARB+1.910
gerald+1.877
Neg logit contributions
skirts-2.908
lik-2.253
haps-2.215
EStream-2.167
arthy-2.151
Token Trump
Token ID1301
Feature activation+8.82e+00

Pos logit contributions
skirts+2.029
lik+1.520
haps+1.512
arthy+1.424
EStream+1.411
Neg logit contributions
advertisement-2.230
INT-1.867
ENSE-1.867
ARB-1.813
gerald-1.801
Token is
Token ID318
Feature activation+1.70e+00

Pos logit contributions
skirts+7.363
lik+5.725
haps+5.653
arthy+5.300
cigarettes+5.279
Neg logit contributions
advertisement-7.134
ENSE-5.987
INT-5.952
ARB-5.751
gerald-5.738
Token simply
Token ID2391
Feature activation+6.70e-01

Pos logit contributions
skirts+1.524
lik+1.170
haps+1.166
arthy+1.113
EStream+1.103
Neg logit contributions
advertisement-1.382
ENSE-1.140
INT-1.137
ARB-1.095
gerald-1.087
Token to
Token ID284
Feature activation+4.51e-01

Pos logit contributions
skirts+0.601
haps+0.460
lik+0.459
arthy+0.442
EStream+0.437
Neg logit contributions
advertisement-0.545
ENSE-0.452
INT-0.449
ARB-0.431
gerald-0.428
Token turn
Token ID1210
Feature activation-1.05e+00

Pos logit contributions
skirts+0.428
lik+0.332
haps+0.331
EStream+0.321
arthy+0.318
Neg logit contributions
advertisement-0.344
ENSE-0.280
INT-0.280
ARB-0.268
gerald-0.266
Token off
Token ID572
Feature activation+4.36e+00

Pos logit contributions
advertisement+0.769
INT+0.632
ENSE+0.629
ARB+0.602
Db+0.590
Neg logit contributions
skirts-1.014
haps-0.797
lik-0.792
EStream-0.768
arthy-0.762
Token election
Token ID3071
Feature activation+8.13e+00

Pos logit contributions
skirts+1.762
lik+1.341
haps+1.338
arthy+1.264
EStream+1.257
Neg logit contributions
advertisement-1.728
INT-1.431
ENSE-1.430
ARB-1.388
gerald-1.375
Token results
Token ID2482
Feature activation+1.39e+00

Pos logit contributions
skirts+6.361
lik+4.953
haps+4.713
 vo+4.650
arthy+4.499
Neg logit contributions
advertisement-7.082
ENSE-6.045
INT-6.007
ARB-5.886
gerald-5.844
Token with
Token ID351
Feature activation-9.58e-01

Pos logit contributions
skirts+1.253
haps+0.961
lik+0.959
arthy+0.910
EStream+0.906
Neg logit contributions
advertisement-1.136
INT-0.934
ENSE-0.929
ARB-0.902
gerald-0.893
Token granddaughter
Token ID46458
Feature activation-9.21e-01

Pos logit contributions
advertisement+0.789
INT+0.656
ENSE+0.649
ARB+0.637
gerald+0.630
Neg logit contributions
skirts-0.847
haps-0.644
lik-0.642
arthy-0.618
EStream-0.617
Token Kara
Token ID34856
Feature activation-3.23e-01

Pos logit contributions
advertisement+0.677
INT+0.548
ENSE+0.541
ARB+0.527
gerald+0.519
Neg logit contributions
skirts-0.894
haps-0.701
lik-0.694
EStream-0.673
arthy-0.670
Token Trib
Token ID10841
Feature activation+3.78e+00

Pos logit contributions
advertisement+0.239
INT+0.194
ENSE+0.193
ARB+0.187
gerald+0.186
Neg logit contributions
skirts-0.311
haps-0.243
lik-0.243
arthy-0.231
EStream-0.231
Tokenby
Token ID1525
Feature activation+3.77e+00

Pos logit contributions
skirts+3.436
lik+2.669
haps+2.643
arthy+2.510
 vo+2.481
Neg logit contributions
advertisement-3.034
INT-2.486
ENSE-2.481
ARB-2.402
gerald-2.366
Token,
Token ID11
Feature activation+1.36e+00

Pos logit contributions
skirts+3.665
lik+2.901
haps+2.876
arthy+2.748
 vo+2.726
Neg logit contributions
advertisement-2.772
ENSE-2.231
INT-2.225
ARB-2.145
Db-2.112
Token 7
Token ID767
Feature activation+1.92e+00

Pos logit contributions
skirts+1.180
haps+0.890
lik+0.889
EStream+0.850
arthy+0.848
Neg logit contributions
advertisement-1.154
INT-0.954
ENSE-0.952
ARB-0.927
gerald-0.918
Token,
Token ID11
Feature activation+1.38e+00

Pos logit contributions
skirts+1.997
lik+1.602
haps+1.598
arthy+1.528
EStream+1.519
Neg logit contributions
advertisement-1.286
INT-1.008
ENSE-1.007
ARB-0.967
gerald-0.955
Token in
Token ID287
Feature activation+2.90e+00

Pos logit contributions
skirts+1.285
lik+0.997
haps+0.997
arthy+0.949
EStream+0.947
Neg logit contributions
advertisement-1.073
INT-0.872
ENSE-0.866
ARB-0.839
gerald-0.832
Tokenx
Token ID87
Feature activation+3.45e-01

Pos logit contributions
advertisement+3.942
INT+3.151
ENSE+3.046
gerald+2.958
Db+2.942
Neg logit contributions
skirts-6.124
haps-4.982
lik-4.885
EStream-4.879
arthy-4.765
Token.
Token ID13
Feature activation+1.70e+00

Pos logit contributions
skirts+0.316
haps+0.244
lik+0.243
EStream+0.232
arthy+0.230
Neg logit contributions
advertisement-0.275
INT-0.226
ENSE-0.225
ARB-0.218
gerald-0.215
Token
Token ID198
Feature activation+1.47e+00

Pos logit contributions
skirts+1.430
haps+1.080
lik+1.079
arthy+1.013
EStream+1.010
Neg logit contributions
advertisement-1.482
INT-1.236
ENSE-1.233
ARB-1.197
gerald-1.185
Token
Token ID198
Feature activation-1.27e+00

Pos logit contributions
skirts+1.007
lik+0.701
haps+0.700
arthy+0.644
EStream+0.643
Neg logit contributions
advertisement-1.520
INT-1.306
ENSE-1.302
ARB-1.273
gerald-1.261
TokenWire
Token ID29451
Feature activation-4.66e+00

Pos logit contributions
advertisement+0.843
INT+0.657
ENSE+0.642
ARB+0.624
Bench+0.606
Neg logit contributions
skirts-1.353
haps-1.089
lik-1.082
EStream-1.049
arthy-1.045
Tokenx
Token ID87
Feature activation+1.49e+00

Pos logit contributions
advertisement+2.974
INT+2.349
ENSE+2.231
gerald+2.208
Db+2.175
Neg logit contributions
skirts-4.856
EStream-3.944
haps-3.876
lik-3.819
arthy-3.786
Token,
Token ID11
Feature activation-3.47e-02

Pos logit contributions
skirts+1.267
lik+0.957
haps+0.956
EStream+0.903
arthy+0.901
Neg logit contributions
advertisement-1.276
INT-1.065
ENSE-1.062
ARB-1.033
gerald-1.022
Token a
Token ID257
Feature activation-3.50e-02

Pos logit contributions
advertisement+0.027
INT+0.022
ENSE+0.022
ARB+0.022
gerald+0.021
Neg logit contributions
skirts-0.032
lik-0.025
haps-0.025
EStream-0.024
arthy-0.024
Token provider
Token ID10131
Feature activation+4.16e-01

Pos logit contributions
advertisement+0.029
INT+0.024
ENSE+0.023
ARB+0.023
gerald+0.023
Neg logit contributions
skirts-0.031
haps-0.024
lik-0.024
EStream-0.023
arthy-0.023
Token of
Token ID286
Feature activation+5.30e-01

Pos logit contributions
skirts+0.305
lik+0.220
haps+0.218
arthy+0.203
 vo+0.201
Neg logit contributions
advertisement-0.406
ENSE-0.346
INT-0.346
ARB-0.337
gerald-0.336
Token bitcoin
Token ID8550
Feature activation+2.69e+00

Pos logit contributions
skirts+0.469
lik+0.358
haps+0.357
EStream+0.343
arthy+0.340
Neg logit contributions
advertisement-0.440
INT-0.366
ENSE-0.360
ARB-0.350
gerald-0.346
Token picture
Token ID4286
Feature activation+2.39e+00

Pos logit contributions
skirts+1.740
haps+1.347
lik+1.345
arthy+1.281
EStream+1.271
Neg logit contributions
advertisement-1.463
INT-1.205
ENSE-1.194
ARB-1.152
gerald-1.143
Token shoot
Token ID2686
Feature activation+2.91e+00

Pos logit contributions
skirts+1.895
haps+1.399
lik+1.397
arthy+1.312
EStream+1.306
Neg logit contributions
advertisement-2.204
INT-1.866
ENSE-1.858
ARB-1.818
gerald-1.799
Token was
Token ID373
Feature activation+2.28e+00

Pos logit contributions
skirts+2.422
lik+1.819
haps+1.814
arthy+1.709
EStream+1.696
Neg logit contributions
advertisement-2.559
INT-2.142
ENSE-2.137
ARB-2.079
gerald-2.064
Token delayed
Token ID11038
Feature activation+1.71e+00

Pos logit contributions
skirts+1.972
lik+1.503
haps+1.498
arthy+1.415
EStream+1.406
Neg logit contributions
advertisement-1.925
INT-1.598
ENSE-1.594
ARB-1.548
gerald-1.534
Token by
Token ID416
Feature activation-1.78e+00

Pos logit contributions
skirts+1.654
haps+1.295
lik+1.293
arthy+1.233
EStream+1.225
Neg logit contributions
advertisement-1.284
INT-1.040
ENSE-1.035
ARB-0.997
gerald-0.984
Token almost
Token ID2048
Feature activation+3.14e+00

Pos logit contributions
advertisement+1.436
INT+1.204
ENSE+1.181
ARB+1.131
gerald+1.124
Neg logit contributions
skirts-1.596
haps-1.234
lik-1.208
EStream-1.181
arthy-1.169
Token an
Token ID281
Feature activation+1.94e+00

Pos logit contributions
skirts+2.633
lik+1.984
haps+1.977
arthy+1.869
EStream+1.860
Neg logit contributions
advertisement-2.748
INT-2.294
ENSE-2.289
ARB-2.221
gerald-2.200
Token hour
Token ID1711
Feature activation+9.28e-01

Pos logit contributions
skirts+1.885
lik+1.481
haps+1.475
arthy+1.417
EStream+1.415
Neg logit contributions
advertisement-1.444
INT-1.165
ENSE-1.162
ARB-1.116
gerald-1.094
Token.[
Token ID3693
Feature activation-4.95e+00

Pos logit contributions
skirts+0.930
haps+0.736
lik+0.735
arthy+0.702
EStream+0.701
Neg logit contributions
advertisement-0.660
INT-0.529
ENSE-0.526
ARB-0.507
Db-0.497
Token12
Token ID1065
Feature activation-3.80e+00

Pos logit contributions
advertisement+2.384
INT+1.724
ENSE+1.572
ARB+1.555
MK+1.534
Neg logit contributions
skirts-6.004
haps-5.002
lik-4.931
EStream-4.879
arthy-4.795
Token]
Token ID60
Feature activation-6.12e-01

Pos logit contributions
advertisement+2.581
INT+2.058
ENSE+2.051
ARB+1.983
Db+1.942
Neg logit contributions
skirts-3.856
lik-3.098
haps-3.088
arthy-2.950
EStream-2.938
Token a
Token ID257
Feature activation-2.20e+00

Pos logit contributions
advertisement+3.538
INT+2.952
ENSE+2.918
gerald+2.792
ARB+2.775
Neg logit contributions
skirts-4.475
haps-3.516
lik-3.438
arthy-3.348
EStream-3.313
Token man
Token ID582
Feature activation-2.16e+00

Pos logit contributions
advertisement+1.692
INT+1.402
ENSE+1.392
gerald+1.345
ARB+1.310
Neg logit contributions
skirts-2.072
haps-1.623
lik-1.569
arthy-1.511
cigarettes-1.511
Token that
Token ID326
Feature activation-4.11e+00

Pos logit contributions
advertisement+1.627
ENSE+1.328
INT+1.317
ARB+1.259
Bench+1.249
Neg logit contributions
skirts-2.077
haps-1.600
lik-1.561
arthy-1.542
EStream-1.520
Token tall
Token ID7331
Feature activation-2.89e+00

Pos logit contributions
advertisement+3.611
INT+3.086
ENSE+3.043
Bench+2.977
Db+2.962
Neg logit contributions
skirts-3.241
haps-2.404
lik-2.343
arthy-2.307
EStream-2.281
Token and
Token ID290
Feature activation-2.93e+00

Pos logit contributions
advertisement+1.890
ENSE+1.510
INT+1.505
Bench+1.405
gerald+1.403
Neg logit contributions
skirts-2.984
haps-2.415
lik-2.400
arthy-2.313
EStream-2.283
Token heavy
Token ID4334
Feature activation+3.34e+00

Pos logit contributions
advertisement+2.220
ENSE+1.818
INT+1.804
gerald+1.707
Bench+1.700
Neg logit contributions
skirts-2.777
haps-2.191
lik-2.124
EStream-2.081
cigarettes-2.044
Token
Token ID851
Feature activation+1.00e+00

Pos logit contributions
skirts+3.316
lik+2.633
haps+2.618
arthy+2.502
 vo+2.486
Neg logit contributions
advertisement-2.384
INT-1.900
ENSE-1.894
ARB-1.835
gerald-1.810
Token and
Token ID290
Feature activation-2.36e-01

Pos logit contributions
skirts+1.067
haps+0.857
lik+0.856
arthy+0.821
EStream+0.820
Neg logit contributions
advertisement-0.648
INT-0.504
ENSE-0.501
ARB-0.478
gerald-0.473
Token turn
Token ID1210
Feature activation+2.90e+00

Pos logit contributions
advertisement+0.183
INT+0.151
ENSE+0.150
ARB+0.144
gerald+0.143
Neg logit contributions
skirts-0.221
haps-0.171
lik-0.170
EStream-0.165
arthy-0.163
Token back
Token ID736
Feature activation+2.01e+00

Pos logit contributions
skirts+2.636
lik+2.045
haps+2.031
arthy+1.927
 vo+1.916
Neg logit contributions
advertisement-2.320
ENSE-1.891
INT-1.891
ARB-1.835
gerald-1.819
Token to
Token ID284
Feature activation-9.99e-01

Pos logit contributions
skirts+1.780
lik+1.366
haps+1.364
arthy+1.293
EStream+1.283
Neg logit contributions
advertisement-1.648
INT-1.363
ENSE-1.360
ARB-1.317
gerald-1.305
Token The
Token ID383
Feature activation-5.98e-01

Pos logit contributions
skirts+1.208
lik+0.919
haps+0.918
EStream+0.869
arthy+0.860
Neg logit contributions
advertisement-1.200
INT-0.995
ENSE-0.987
ARB-0.961
gerald-0.950
Token boundary
Token ID18645
Feature activation+3.51e+00

Pos logit contributions
advertisement+0.490
INT+0.404
ENSE+0.402
ARB+0.392
gerald+0.387
Neg logit contributions
skirts-0.531
lik-0.403
haps-0.399
arthy-0.382
EStream-0.382
Token between
Token ID1022
Feature activation+1.34e+00

Pos logit contributions
skirts+2.969
lik+2.247
haps+2.238
arthy+2.111
EStream+2.094
Neg logit contributions
advertisement-3.031
INT-2.523
ENSE-2.520
ARB-2.448
gerald-2.427
Token Europe
Token ID2031
Feature activation-1.78e+00

Pos logit contributions
skirts+1.167
lik+0.886
haps+0.880
EStream+0.838
arthy+0.835
Neg logit contributions
advertisement-1.132
INT-0.939
ENSE-0.935
ARB-0.910
gerald-0.897
Token and
Token ID290
Feature activation-1.71e+00

Pos logit contributions
advertisement+1.007
INT+0.748
ENSE+0.744
ARB+0.711
gerald+0.700
Neg logit contributions
skirts-2.019
haps-1.646
lik-1.646
arthy-1.582
EStream-1.575
Token Asia
Token ID7229
Feature activation+3.09e+00

Pos logit contributions
advertisement+1.345
INT+1.108
ENSE+1.100
ARB+1.080
Db+1.045
Neg logit contributions
skirts-1.576
lik-1.224
haps-1.195
EStream-1.169
cigarettes-1.153
Token is
Token ID318
Feature activation-4.55e-01

Pos logit contributions
skirts+2.491
lik+1.851
haps+1.842
arthy+1.732
EStream+1.720
Neg logit contributions
advertisement-2.800
INT-2.353
ENSE-2.347
ARB-2.287
gerald-2.268
Token,
Token ID11
Feature activation+5.64e+00

Pos logit contributions
advertisement+0.379
INT+0.310
ENSE+0.310
ARB+0.304
gerald+0.300
Neg logit contributions
skirts-0.402
lik-0.305
haps-0.302
EStream-0.292
arthy-0.290
Token in
Token ID287
Feature activation+1.51e+00

Pos logit contributions
skirts+5.304
lik+4.191
haps+4.191
 vo+3.962
arthy+3.941
Neg logit contributions
advertisement-4.253
INT-3.455
ENSE-3.453
ARB-3.324
gerald-3.317
Token fact
Token ID1109
Feature activation+5.44e-01

Pos logit contributions
skirts+1.389
lik+1.072
haps+1.062
arthy+1.018
EStream+1.014
Neg logit contributions
advertisement-1.197
INT-0.981
ENSE-0.978
ARB-0.951
gerald-0.938
Token,
Token ID11
Feature activation+3.87e+00

Pos logit contributions
skirts+0.581
lik+0.465
haps+0.465
arthy+0.446
EStream+0.445
Neg logit contributions
advertisement-0.353
INT-0.274
ENSE-0.273
ARB-0.262
gerald-0.258
Token discovery
Token ID9412
Feature activation-8.13e-01

Pos logit contributions
skirts+0.438
haps+0.336
lik+0.332
EStream+0.319
arthy+0.318
Neg logit contributions
advertisement-0.398
INT-0.327
ENSE-0.327
ARB-0.315
gerald-0.311
Token.
Token ID13
Feature activation-1.01e+00

Pos logit contributions
advertisement+0.557
INT+0.436
ENSE+0.434
ARB+0.417
Db+0.408
Neg logit contributions
skirts-0.842
haps-0.669
lik-0.664
arthy-0.644
EStream-0.644
Token...
Token ID2644
Feature activation-1.84e-01

Pos logit contributions
advertisement+0.849
INT+0.695
ENSE+0.686
Db+0.663
ARB+0.662
Neg logit contributions
skirts-0.894
haps-0.691
lik-0.682
EStream-0.660
arthy-0.649
Token Metroid
Token ID45430
Feature activation-3.78e+00

Pos logit contributions
advertisement+0.158
INT+0.131
ENSE+0.130
ARB+0.125
Db+0.125
Neg logit contributions
skirts-0.159
haps-0.121
lik-0.120
arthy-0.115
EStream-0.115
Tokenv
Token ID85
Feature activation+2.31e+00

Pos logit contributions
advertisement+3.155
INT+2.633
ENSE+2.576
gerald+2.534
Db+2.505
Neg logit contributions
skirts-3.246
lik-2.459
haps-2.457
arthy-2.372
EStream-2.354
Tokenani
Token ID3216
Feature activation+3.28e-01

Pos logit contributions
skirts+2.736
lik+2.280
haps+2.271
arthy+2.192
 vo+2.167
Neg logit contributions
advertisement-1.177
ENSE-0.848
INT-0.842
gerald-0.793
ARB-0.790
Tokenas
Token ID292
Feature activation+4.38e-01

Pos logit contributions
skirts+0.263
lik+0.195
haps+0.191
arthy+0.183
EStream+0.181
Neg logit contributions
advertisement-0.300
INT-0.254
ENSE-0.251
ARB-0.248
Db-0.243
Token are
Token ID389
Feature activation-7.39e-01

Pos logit contributions
skirts+0.368
lik+0.277
haps+0.276
arthy+0.261
EStream+0.260
Neg logit contributions
advertisement-0.383
INT-0.319
ENSE-0.318
ARB-0.309
gerald-0.307
Token at
Token ID379
Feature activation+2.33e+00

Pos logit contributions
advertisement+0.598
INT+0.489
ENSE+0.488
ARB+0.472
gerald+0.467
Neg logit contributions
skirts-0.669
lik-0.511
haps-0.510
EStream-0.496
arthy-0.493
Token their
Token ID511
Feature activation-2.86e+00

Pos logit contributions
skirts+1.920
lik+1.431
haps+1.424
arthy+1.349
EStream+1.339
Neg logit contributions
advertisement-2.075
INT-1.741
ENSE-1.738
ARB-1.690
gerald-1.675
Token best
Token ID1266
Feature activation-2.37e+00

Pos logit contributions
advertisement+2.175
INT+1.797
ENSE+1.793
Db+1.738
ARB+1.723
Neg logit contributions
skirts-2.673
lik-2.045
haps-2.036
EStream-1.988
arthy-1.987
Token.
Token ID13
Feature activation-1.54e-01

Pos logit contributions
advertisement+1.636
ENSE+1.347
INT+1.340
ARB+1.293
gerald+1.269
Neg logit contributions
skirts-1.995
lik-1.545
haps-1.544
EStream-1.499
arthy-1.496
Token To
Token ID1675
Feature activation-5.91e+00

Pos logit contributions
advertisement+0.133
INT+0.110
ENSE+0.109
ARB+0.106
Db+0.105
Neg logit contributions
skirts-0.131
haps-0.100
lik-0.099
 vo-0.093
arthy-0.093
Token fight
Token ID1907
Feature activation-6.76e+00

Pos logit contributions
advertisement+4.117
INT+3.295
ENSE+3.254
ARB+3.016
gerald+3.010
Neg logit contributions
skirts-5.917
EStream-4.657
lik-4.489
haps-4.437
cigarettes-4.390
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
advertisement+4.285
INT+3.411
ENSE+3.334
INESS+3.186
ARB+3.135
Neg logit contributions
skirts-6.983
EStream-5.451
haps-5.437
arthy-5.416
lik-5.410
Token The
Token ID383
Feature activation-2.56e+00

Pos logit contributions
advertisement+0.157
INT+0.130
ENSE+0.129
ARB+0.125
Db+0.124
Neg logit contributions
skirts-0.157
haps-0.121
lik-0.120
 vo-0.113
EStream-0.112
Token tension
Token ID12097
Feature activation-1.73e+00

Pos logit contributions
advertisement+2.072
INT+1.726
ENSE+1.709
ARB+1.648
gerald+1.618
Neg logit contributions
skirts-2.291
haps-1.757
lik-1.732
EStream-1.666
arthy-1.661
Token came
Token ID1625
Feature activation-5.95e+00

Pos logit contributions
advertisement+1.478
INT+1.235
ENSE+1.223
ARB+1.191
Db+1.172
Neg logit contributions
skirts-1.490
haps-1.133
lik-1.107
EStream-1.081
arthy-1.060
Token at
Token ID379
Feature activation-4.75e-01

Pos logit contributions
advertisement+4.239
ENSE+3.523
INT+3.518
ARB+3.334
Db+3.279
Neg logit contributions
skirts-5.713
haps-4.417
EStream-4.363
lik-4.340
arthy-4.250
Token the
Token ID262
Feature activation-8.00e-01

Pos logit contributions
advertisement+0.391
ENSE+0.326
INT+0.326
ARB+0.315
gerald+0.308
Neg logit contributions
skirts-0.423
lik-0.321
haps-0.317
EStream-0.307
arthy-0.303
Token last
Token ID938
Feature activation-1.09e+00

Pos logit contributions
advertisement+0.611
ENSE+0.498
INT+0.497
ARB+0.472
Db+0.467
Neg logit contributions
skirts-0.761
lik-0.596
haps-0.590
EStream-0.564
arthy-0.561
Token possible
Token ID1744
Feature activation-1.22e+00

Pos logit contributions
advertisement+0.733
INT+0.586
ENSE+0.583
ARB+0.555
Db+0.541
Neg logit contributions
skirts-1.134
haps-0.900
lik-0.894
arthy-0.866
EStream-0.857
Token not
Token ID407
Feature activation+4.77e+00

Pos logit contributions
skirts+0.251
haps+0.184
lik+0.184
arthy+0.176
EStream+0.175
Neg logit contributions
advertisement-0.275
INT-0.232
ENSE-0.229
ARB-0.225
gerald-0.223
Token,
Token ID11
Feature activation+1.42e+00

Pos logit contributions
skirts+4.263
lik+3.316
haps+3.311
 vo+3.125
arthy+3.107
Neg logit contributions
advertisement-3.807
ENSE-3.131
INT-3.118
ARB-3.029
gerald-3.006
Token Bounty
Token ID38166
Feature activation+8.09e-01

Pos logit contributions
skirts+1.240
lik+0.940
haps+0.936
arthy+0.893
EStream+0.889
Neg logit contributions
advertisement-1.199
INT-0.996
ENSE-0.995
ARB-0.966
gerald-0.955
Tokengate
Token ID10494
Feature activation+3.36e+00

Pos logit contributions
skirts+0.850
haps+0.684
lik+0.684
arthy+0.657
EStream+0.656
Neg logit contributions
advertisement-0.531
INT-0.412
ENSE-0.412
gerald-0.396
ARB-0.393
Token affects
Token ID10975
Feature activation+3.40e+00

Pos logit contributions
skirts+2.617
lik+1.925
haps+1.917
arthy+1.797
EStream+1.784
Neg logit contributions
advertisement-3.133
INT-2.648
ENSE-2.644
ARB-2.575
gerald-2.555
Token every
Token ID790
Feature activation-3.27e-01

Pos logit contributions
skirts+2.922
lik+2.227
haps+2.217
arthy+2.092
 vo+2.079
Neg logit contributions
advertisement-2.883
INT-2.389
ENSE-2.388
ARB-2.319
gerald-2.299
Token aspect
Token ID4843
Feature activation-1.39e+00

Pos logit contributions
advertisement+0.236
INT+0.190
ENSE+0.189
ARB+0.182
gerald+0.180
Neg logit contributions
skirts-0.326
haps-0.256
lik-0.255
arthy-0.245
EStream-0.244
Token of
Token ID286
Feature activation+4.54e-01

Pos logit contributions
advertisement+1.325
INT+1.126
ENSE+1.123
ARB+1.094
gerald+1.081
Neg logit contributions
skirts-1.053
lik-0.765
haps-0.763
arthy-0.713
EStream-0.705
Token league
Token ID4652
Feature activation-1.18e+00

Pos logit contributions
skirts+0.387
haps+0.292
lik+0.289
arthy+0.277
EStream+0.275
Neg logit contributions
advertisement-0.393
INT-0.327
ENSE-0.327
ARB-0.317
gerald-0.315
Token punishment
Token ID9837
Feature activation+1.68e+00

Pos logit contributions
advertisement+1.038
ENSE+0.866
INT+0.866
ARB+0.838
INESS+0.829
Neg logit contributions
skirts-0.998
haps-0.742
lik-0.729
EStream-0.714
arthy-0.703
Token and
Token ID290
Feature activation-7.40e-01

Pos logit contributions
skirts+1.713
haps+1.357
lik+1.354
arthy+1.302
EStream+1.298
Neg logit contributions
advertisement-1.168
INT-0.923
ENSE-0.920
ARB-0.883
gerald-0.867
Token you
Token ID345
Feature activation+3.05e+00

Pos logit contributions
advertisement+0.917
ENSE+0.752
INT+0.750
ARB+0.735
Db+0.716
Neg logit contributions
skirts-1.076
lik-0.827
haps-0.821
arthy-0.790
EStream-0.789
Token asked
Token ID1965
Feature activation-6.98e-01

Pos logit contributions
skirts+2.598
lik+1.970
haps+1.962
arthy+1.853
EStream+1.841
Neg logit contributions
advertisement-2.608
INT-2.168
ENSE-2.166
ARB-2.102
gerald-2.084
Token a
Token ID257
Feature activation+1.42e+00

Pos logit contributions
advertisement+0.545
INT+0.444
ENSE+0.444
ARB+0.427
Db+0.418
Neg logit contributions
skirts-0.652
lik-0.506
haps-0.502
EStream-0.486
arthy-0.480
Token Hillary
Token ID5073
Feature activation-3.52e-01

Pos logit contributions
skirts+1.202
haps+0.902
lik+0.901
EStream+0.842
arthy+0.842
Neg logit contributions
advertisement-1.238
INT-1.035
ENSE-1.029
ARB-0.994
gerald-0.989
Token supporter
Token ID15525
Feature activation+8.82e-01

Pos logit contributions
advertisement+0.342
INT+0.293
ENSE+0.290
ARB+0.283
gerald+0.282
Neg logit contributions
skirts-0.260
haps-0.187
lik-0.186
cigarettes-0.173
arthy-0.172
Token,
Token ID11
Feature activation-2.15e-02

Pos logit contributions
skirts+0.858
lik+0.668
haps+0.668
arthy+0.638
EStream+0.637
Neg logit contributions
advertisement-0.660
INT-0.530
ENSE-0.530
ARB-0.509
gerald-0.502
Token who
Token ID508
Feature activation-1.37e+00

Pos logit contributions
advertisement+0.017
ENSE+0.014
INT+0.014
ARB+0.013
Db+0.013
Neg logit contributions
skirts-0.020
lik-0.015
haps-0.015
EStream-0.015
arthy-0.015
Token
Token ID447
Feature activation+1.84e+00

Pos logit contributions
advertisement+1.208
INT+1.018
ENSE+1.003
ARB+0.991
Db+0.967
Neg logit contributions
skirts-1.124
lik-0.821
haps-0.818
EStream-0.807
arthy-0.786
Token
Token ID247
Feature activation-2.46e+00

Pos logit contributions
skirts+1.317
lik+0.938
haps+0.928
arthy+0.868
EStream+0.855
Neg logit contributions
advertisement-1.840
INT-1.571
ENSE-1.565
ARB-1.527
gerald-1.519
Tokens
Token ID82
Feature activation-2.20e+00

Pos logit contributions
advertisement+1.519
INT+1.123
ENSE+1.091
gerald+1.078
Db+1.022
Neg logit contributions
skirts-2.702
haps-2.161
lik-2.145
EStream-2.138
 vo-2.094
Token living
Token ID2877
Feature activation+8.14e-01

Pos logit contributions
advertisement+1.804
ENSE+1.505
INT+1.494
ARB+1.439
Db+1.427
Neg logit contributions
skirts-1.946
haps-1.466
lik-1.453
EStream-1.423
cigarettes-1.393
Token that
Token ID326
Feature activation-7.64e-01

Pos logit contributions
advertisement+1.037
INT+0.873
ENSE+0.866
ARB+0.845
Db+0.835
Neg logit contributions
skirts-0.961
haps-0.714
lik-0.700
arthy-0.675
cigarettes-0.674
Token Putin
Token ID8144
Feature activation+7.92e+00

Pos logit contributions
advertisement+0.662
ENSE+0.555
INT+0.555
ARB+0.539
Db+0.529
Neg logit contributions
skirts-0.639
lik-0.478
haps-0.477
EStream-0.456
arthy-0.450
Token has
Token ID468
Feature activation+2.91e+00

Pos logit contributions
skirts+6.223
lik+4.772
haps+4.695
 vo+4.440
arthy+4.399
Neg logit contributions
advertisement-7.087
ENSE-5.952
INT-5.887
ARB-5.778
gerald-5.721
Token created
Token ID2727
Feature activation+2.61e+00

Pos logit contributions
skirts+2.454
lik+1.850
haps+1.845
arthy+1.740
EStream+1.732
Neg logit contributions
advertisement-2.516
INT-2.096
ENSE-2.093
ARB-2.032
gerald-2.015
Token.
Token ID13
Feature activation+5.99e-01

Pos logit contributions
skirts+2.565
lik+2.021
haps+2.018
arthy+1.924
EStream+1.915
Neg logit contributions
advertisement-1.909
INT-1.530
ENSE-1.526
ARB-1.469
gerald-1.452
Token They
Token ID1119
Feature activation-3.03e+00

Pos logit contributions
skirts+0.524
haps+0.402
lik+0.401
EStream+0.381
cigarettes+0.374
Neg logit contributions
advertisement-0.504
INT-0.418
ENSE-0.417
ARB-0.399
Db-0.398
Token are
Token ID389
Feature activation-1.48e-01

Pos logit contributions
advertisement+2.626
ENSE+2.214
INT+2.184
ARB+2.128
gerald+2.110
Neg logit contributions
skirts-2.508
EStream-1.827
lik-1.826
haps-1.789
cigarettes-1.720
Token acting
Token ID7205
Feature activation-2.07e+00

Pos logit contributions
advertisement+0.122
ENSE+0.101
INT+0.101
ARB+0.098
gerald+0.096
Neg logit contributions
skirts-0.130
lik-0.098
haps-0.098
EStream-0.095
arthy-0.093
Token according
Token ID1864
Feature activation-2.64e+00

Pos logit contributions
advertisement+1.560
ENSE+1.286
INT+1.283
ARB+1.215
Db+1.201
Neg logit contributions
skirts-1.943
haps-1.493
lik-1.482
EStream-1.449
arthy-1.446
Token to
Token ID284
Feature activation-3.33e-01

Pos logit contributions
advertisement+2.309
ENSE+1.939
INT+1.929
ARB+1.865
Db+1.836
Neg logit contributions
skirts-2.176
haps-1.625
lik-1.614
arthy-1.540
cigarettes-1.518
Token the
Token ID262
Feature activation+4.58e-01

Pos logit contributions
advertisement+0.278
INT+0.231
ENSE+0.230
ARB+0.222
Db+0.218
Neg logit contributions
skirts-0.293
haps-0.221
lik-0.221
arthy-0.210
cigarettes-0.209
Tokenru
Token ID622
Feature activation+3.26e+00

Pos logit contributions
skirts+2.337
lik+1.662
haps+1.659
arthy+1.545
 vo+1.508
Neg logit contributions
advertisement-3.315
INT-2.825
ENSE-2.816
ARB-2.753
gerald-2.748
Tokenitt
Token ID715
Feature activation+8.67e-01

Pos logit contributions
skirts+2.490
lik+1.850
haps+1.840
arthy+1.723
 vo+1.698
Neg logit contributions
advertisement-3.067
INT-2.583
ENSE-2.575
ARB-2.533
gerald-2.511
Token has
Token ID468
Feature activation+1.45e+00

Pos logit contributions
skirts+0.686
haps+0.505
lik+0.502
EStream+0.474
arthy+0.472
Neg logit contributions
advertisement-0.800
INT-0.673
ENSE-0.671
ARB-0.658
gerald-0.654
Token also
Token ID635
Feature activation-7.44e-01

Pos logit contributions
skirts+1.260
haps+0.951
lik+0.949
EStream+0.912
arthy+0.901
Neg logit contributions
advertisement-1.228
ENSE-1.011
INT-1.011
ARB-0.983
gerald-0.970
Token failed
Token ID4054
Feature activation-2.01e+00

Pos logit contributions
advertisement+0.643
INT+0.531
ENSE+0.530
ARB+0.515
gerald+0.510
Neg logit contributions
skirts-0.635
haps-0.470
lik-0.466
EStream-0.461
arthy-0.446
Token to
Token ID284
Feature activation-7.99e+00

Pos logit contributions
advertisement+1.764
INT+1.478
ENSE+1.473
ARB+1.439
INESS+1.417
Neg logit contributions
skirts-1.666
haps-1.244
lik-1.235
arthy-1.178
EStream-1.167
Token take
Token ID1011
Feature activation-6.89e+00

Pos logit contributions
advertisement+5.204
ENSE+4.243
INT+4.080
ARB+3.852
gerald+3.762
Neg logit contributions
skirts-8.186
EStream-6.682
lik-6.190
cigarettes-6.189
haps-6.187
Token legal
Token ID2742
Feature activation-4.41e-01

Pos logit contributions
advertisement+5.274
ENSE+4.504
INT+4.387
ARB+4.310
INESS+4.106
Neg logit contributions
skirts-6.198
haps-4.765
EStream-4.751
lik-4.572
arthy-4.524
Token action
Token ID2223
Feature activation-3.26e+00

Pos logit contributions
advertisement+0.348
ENSE+0.285
INT+0.284
ARB+0.277
gerald+0.268
Neg logit contributions
skirts-0.406
haps-0.315
lik-0.313
arthy-0.299
EStream-0.298
Token to
Token ID284
Feature activation-5.30e+00

Pos logit contributions
advertisement+2.223
INT+1.745
ENSE+1.709
ARB+1.680
Db+1.621
Neg logit contributions
skirts-3.345
haps-2.653
lik-2.600
EStream-2.559
arthy-2.558
Token protect
Token ID1805
Feature activation-2.73e+00

Pos logit contributions
advertisement+3.392
ENSE+2.746
INT+2.743
ARB+2.616
Db+2.537
Neg logit contributions
skirts-5.507
haps-4.378
EStream-4.346
lik-4.311
arthy-4.192
Token are
Token ID389
Feature activation-4.30e-01

Pos logit contributions
advertisement+1.462
INT+1.188
ENSE+1.183
Db+1.128
gerald+1.127
Neg logit contributions
skirts-1.678
haps-1.311
lik-1.308
EStream-1.250
arthy-1.244
Token soldiers
Token ID5795
Feature activation-1.86e+00

Pos logit contributions
advertisement+0.361
INT+0.301
ENSE+0.298
gerald+0.286
ARB+0.286
Neg logit contributions
skirts-0.372
haps-0.283
lik-0.282
EStream-0.274
arthy-0.269
Token marking
Token ID18730
Feature activation-2.45e+00

Pos logit contributions
advertisement+1.456
INT+1.210
ENSE+1.176
gerald+1.127
Db+1.121
Neg logit contributions
skirts-1.712
haps-1.322
lik-1.304
EStream-1.280
izons-1.261
Token the
Token ID262
Feature activation-2.74e-01

Pos logit contributions
advertisement+1.958
INT+1.601
ENSE+1.597
ARB+1.537
gerald+1.508
Neg logit contributions
skirts-2.176
lik-1.689
haps-1.677
arthy-1.631
EStream-1.628
Token 65
Token ID6135
Feature activation-4.24e+00

Pos logit contributions
advertisement+0.237
INT+0.196
ENSE+0.195
ARB+0.188
gerald+0.187
Neg logit contributions
skirts-0.232
lik-0.174
haps-0.174
EStream-0.166
arthy-0.165
Tokenth
Token ID400
Feature activation-7.37e+00

Pos logit contributions
advertisement+3.312
INT+2.745
ENSE+2.708
Db+2.623
MK+2.618
Neg logit contributions
skirts-3.771
lik-2.899
haps-2.894
arthy-2.815
EStream-2.794
Token anniversary
Token ID11162
Feature activation-3.17e+00

Pos logit contributions
advertisement+6.171
INT+5.213
ENSE+5.173
ARB+4.977
MK+4.967
Neg logit contributions
skirts-5.935
lik-4.418
arthy-4.288
haps-4.285
EStream-4.242
Token of
Token ID286
Feature activation-2.83e+00

Pos logit contributions
advertisement+2.982
INT+2.519
ENSE+2.501
ARB+2.433
gerald+2.414
Neg logit contributions
skirts-2.420
haps-1.777
lik-1.764
arthy-1.650
cigarettes-1.631
Token the
Token ID262
Feature activation-2.43e+00

Pos logit contributions
advertisement+2.111
INT+1.752
ENSE+1.741
Db+1.649
ARB+1.647
Neg logit contributions
skirts-2.652
lik-2.075
haps-2.048
arthy-1.975
EStream-1.957
Token country
Token ID1499
Feature activation-1.26e+00

Pos logit contributions
advertisement+1.887
INT+1.584
ENSE+1.550
ARB+1.483
Db+1.472
Neg logit contributions
skirts-2.240
lik-1.704
haps-1.683
EStream-1.616
cigarettes-1.613
Token's
Token ID338
Feature activation-5.81e+00

Pos logit contributions
advertisement+0.942
INT+0.762
ENSE+0.756
ARB+0.722
Db+0.716
Neg logit contributions
skirts-1.208
haps-0.945
lik-0.943
arthy-0.899
EStream-0.892
Token,
Token ID11
Feature activation-8.65e-01

Pos logit contributions
skirts+2.148
lik+1.713
haps+1.709
arthy+1.632
EStream+1.623
Neg logit contributions
advertisement-1.467
INT-1.160
ENSE-1.158
ARB-1.114
gerald-1.102
Token
Token ID198
Feature activation+3.42e+00

Pos logit contributions
advertisement+0.541
INT+0.415
ENSE+0.410
ARB+0.395
gerald+0.387
Neg logit contributions
skirts-0.945
haps-0.762
lik-0.761
EStream-0.734
arthy-0.730
Token
Token ID198
Feature activation+5.20e-01

Pos logit contributions
skirts+2.202
lik+1.521
haps+1.498
arthy+1.381
 vo+1.366
Neg logit contributions
advertisement-3.596
ENSE-3.114
INT-3.106
ARB-3.041
gerald-3.032
TokenDA
Token ID5631
Feature activation-1.58e+00

Pos logit contributions
skirts+0.602
haps+0.492
lik+0.492
arthy+0.475
EStream+0.473
Neg logit contributions
advertisement-0.293
INT-0.218
ENSE-0.214
ARB-0.205
gerald-0.199
TokenVID
Token ID11008
Feature activation-4.41e+00

Pos logit contributions
advertisement+0.969
INT+0.781
ENSE+0.761
ARB+0.735
INESS+0.702
Neg logit contributions
skirts-1.746
haps-1.429
lik-1.406
EStream-1.354
arthy-1.339
Token SL
Token ID12419
Feature activation-7.43e+00

Pos logit contributions
advertisement+2.955
INT+2.405
ENSE+2.381
ARB+2.331
INESS+2.266
Neg logit contributions
skirts-4.517
haps-3.691
lik-3.615
EStream-3.459
arthy-3.439
TokenACK
Token ID8120
Feature activation-3.32e-01

Pos logit contributions
advertisement+2.040
INT+1.183
ENSE+1.059
ARB+1.052
INESS+0.835
Neg logit contributions
skirts-10.655
haps-9.206
lik-9.119
EStream-8.967
accompan-8.859
Token
Token ID198
Feature activation+3.83e+00

Pos logit contributions
advertisement+0.276
INT+0.230
ENSE+0.229
ARB+0.222
gerald+0.219
Neg logit contributions
skirts-0.293
haps-0.223
lik-0.222
arthy-0.209
EStream-0.208
Token
Token ID198
Feature activation+1.13e+00

Pos logit contributions
skirts+2.423
lik+1.666
haps+1.634
arthy+1.506
 vo+1.492
Neg logit contributions
advertisement-4.055
ENSE-3.522
INT-3.510
ARB-3.440
gerald-3.438
TokenBill
Token ID17798
Feature activation-2.37e+00

Pos logit contributions
skirts+1.168
lik+0.931
haps+0.929
arthy+0.893
EStream+0.891
Neg logit contributions
advertisement-0.773
INT-0.608
ENSE-0.600
ARB-0.580
Db-0.567
Token Walk
Token ID6857
Feature activation-3.82e+00

Pos logit contributions
advertisement+1.814
INT+1.473
ENSE+1.451
ARB+1.437
Db+1.411
Neg logit contributions
skirts-2.199
lik-1.724
haps-1.720
EStream-1.653
cigarettes-1.640
Tokenko
Token ID7204
Feature activation+9.82e-01

Pos logit contributions
advertisement+0.134
INT+0.112
ENSE+0.112
gerald+0.109
ARB+0.108
Neg logit contributions
skirts-0.120
haps-0.089
lik-0.088
EStream-0.085
arthy-0.084
Token.
Token ID13
Feature activation+8.29e-01

Pos logit contributions
skirts+0.999
lik+0.799
haps+0.795
arthy+0.761
EStream+0.757
Neg logit contributions
advertisement-0.676
INT-0.534
ENSE-0.534
ARB-0.513
gerald-0.508
Token "
Token ID366
Feature activation+2.52e-01

Pos logit contributions
skirts+0.507
lik+0.344
haps+0.340
arthy+0.306
EStream+0.304
Neg logit contributions
advertisement-0.909
ENSE-0.789
INT-0.786
ARB-0.765
gerald-0.760
TokenIt
Token ID1026
Feature activation-6.13e+00

Pos logit contributions
skirts+0.261
haps+0.208
lik+0.208
EStream+0.200
arthy+0.200
Neg logit contributions
advertisement-0.173
INT-0.137
ENSE-0.136
ARB-0.130
Db-0.128
Token really
Token ID1107
Feature activation-3.34e+00

Pos logit contributions
advertisement+4.842
INT+3.957
ENSE+3.949
ARB+3.844
gerald+3.758
Neg logit contributions
skirts-5.417
EStream-4.198
lik-3.949
haps-3.922
arthy-3.921
Token speaks
Token ID9209
Feature activation-7.38e+00

Pos logit contributions
advertisement+2.840
ENSE+2.342
INT+2.335
ARB+2.228
Db+2.206
Neg logit contributions
skirts-2.862
EStream-2.160
haps-2.108
lik-2.108
arthy-2.066
Token to
Token ID284
Feature activation-6.77e+00

Pos logit contributions
advertisement+5.794
ENSE+4.750
INT+4.652
ARB+4.520
INESS+4.490
Neg logit contributions
skirts-6.533
haps-4.905
lik-4.870
arthy-4.810
EStream-4.745
Token the
Token ID262
Feature activation-3.33e+00

Pos logit contributions
advertisement+4.828
INT+4.015
ENSE+4.012
ARB+3.741
Db+3.652
Neg logit contributions
skirts-6.421
EStream-4.929
lik-4.876
haps-4.867
arthy-4.860
Token synergy
Token ID42193
Feature activation+7.12e-01

Pos logit contributions
advertisement+2.486
INT+2.029
ENSE+1.993
ARB+1.903
gerald+1.889
Neg logit contributions
skirts-3.190
haps-2.480
EStream-2.446
lik-2.420
arthy-2.380
Token we
Token ID356
Feature activation-8.25e-01

Pos logit contributions
skirts+0.619
haps+0.470
lik+0.467
arthy+0.444
EStream+0.443
Neg logit contributions
advertisement-0.603
INT-0.501
ENSE-0.499
ARB-0.485
gerald-0.477
Token have
Token ID423
Feature activation-4.42e-01

Pos logit contributions
advertisement+0.711
INT+0.605
ENSE+0.601
ARB+0.584
gerald+0.581
Neg logit contributions
skirts-0.696
haps-0.510
EStream-0.509
lik-0.506
arthy-0.486
Token a
Token ID257
Feature activation+9.31e-01

Pos logit contributions
advertisement+2.482
ENSE+2.081
INT+2.075
ARB+1.996
gerald+1.978
Neg logit contributions
skirts-2.956
haps-2.297
lik-2.249
arthy-2.165
EStream-2.158
Token Devil
Token ID10489
Feature activation-5.46e+00

Pos logit contributions
skirts+0.806
haps+0.616
lik+0.612
arthy+0.575
EStream+0.573
Neg logit contributions
advertisement-0.785
INT-0.659
ENSE-0.654
ARB-0.634
gerald-0.631
Token,
Token ID11
Feature activation-4.87e+00

Pos logit contributions
advertisement+3.895
INT+3.275
ENSE+3.237
ARB+3.118
Bench+3.049
Neg logit contributions
skirts-5.357
haps-4.210
lik-4.146
EStream-3.971
arthy-3.952
Token collecting
Token ID13157
Feature activation-2.32e+00

Pos logit contributions
advertisement+3.484
ENSE+2.810
INT+2.798
ARB+2.722
Db+2.623
Neg logit contributions
skirts-4.754
lik-3.708
haps-3.704
arthy-3.567
EStream-3.536
Token 142
Token ID25181
Feature activation-3.69e+00

Pos logit contributions
advertisement+1.860
INT+1.563
ENSE+1.558
ARB+1.494
gerald+1.486
Neg logit contributions
skirts-2.045
haps-1.573
lik-1.564
EStream-1.527
arthy-1.504
Token goals
Token ID4661
Feature activation-7.52e+00

Pos logit contributions
advertisement+2.757
INT+2.319
ENSE+2.252
ARB+2.173
Db+2.144
Neg logit contributions
skirts-3.432
haps-2.710
lik-2.707
EStream-2.582
cigarettes-2.536
Token (
Token ID357
Feature activation-7.74e+00

Pos logit contributions
advertisement+3.882
INT+3.081
ENSE+3.029
gerald+2.780
ARB+2.777
Neg logit contributions
skirts-8.565
lik-6.860
haps-6.796
arthy-6.666
 constitu-6.570
Token12
Token ID1065
Feature activation-9.16e+00

Pos logit contributions
advertisement+5.112
INT+4.014
ENSE+3.867
ARB+3.855
gerald+3.777
Neg logit contributions
skirts-7.574
haps-6.033
lik-6.022
EStream-5.996
 vo-5.987
Tokenth
Token ID400
Feature activation-1.10e+01

Pos logit contributions
advertisement+6.461
INT+5.729
ARB+5.273
ENSE+5.252
Db+5.162
Neg logit contributions
skirts-8.256
lik-6.647
haps-6.530
EStream-6.457
hemer-6.326
Token in
Token ID287
Feature activation-6.10e+00

Pos logit contributions
advertisement+6.695
INT+5.725
ENSE+5.541
ARB+5.338
MK+5.105
Neg logit contributions
skirts-10.821
lik-8.556
EStream-8.542
haps-8.426
izons-8.299
Token club
Token ID3430
Feature activation+8.26e-01

Pos logit contributions
advertisement+4.318
INT+3.612
ENSE+3.531
ARB+3.413
Db+3.340
Neg logit contributions
skirts-5.850
lik-4.568
haps-4.510
EStream-4.422
arthy-4.347