TOP ACTIVATIONS
MAX = 24.348

the Congress may by general Laws prescribe the Man ner in
the senator has been in this situation before , having survived
my mouth and her tongue sl ither s into mine .
we must place it front and center . Dec ades ago
Nebraska , has been in this post since February 2013
NHL ers like Derrick P oul iot and Mat hew Dumb
State . And the Congress may by general Laws prescribe the
great leader . Cl arks on added :
. All Rights Reserved . This material may not be published
goals , 10 assists in 31 games this year . He
Obviously , we condemn what he said .
He ath Led ger Sh r ink - RAM ONE 2008
z r IV Thermal Design with Dust Removal Technology . The
she almost has that Batman female origin . She shares that
flagged by the US Anti - D oping Agency once again
to a Stanley Cup . Alex O ve ch kin hasn
tight ends with at least 50 touches on the year .
, so I wanted to make sure people weren
has been charged with conspiracy to bribe a foreign public official
idea to put him in the book but when I saw

MIN ACTIVATIONS
MIN = -22.344

celebrities and posted them to social media sites including Reddit and
not all of them are C eph aly x and their
movie in the age of social media , and it 's
. Estimates from human rights groups and witnesses range from several
gave police drones his off - the - c uff endorsement
Ranger 6 lifted off on its first attempt on January 30
Do not use social media to set a bad
simplistic approach to social commentary . On
Do not use social media to ... actually ,
Do not use social media to constantly post photos
multiple fractures and a partially severed right hand in a crash
: Sherman suffered a rupt ured Achilles tendon in Thursday 's
in Mountain View . ( Justin Sullivan / Getty Images )
, Facebook , the dominant social network with more than 500
, Calif . Photo : Justin Sullivan / Getty Images
week after recovering from a broken leg in round 18 last
Do not use social media to post an unfl
Do not use social media to let people know
detailed images of the comet 's surface . Such images were
Do not use social media to show millions of

INTERVAL 12.675 - 24.348
CONTAINS 0.162%

of 20 70 689 Ontario Ltd ., a Toronto - based
intersection of Y onge and E gl inton will feature large
companies pay me for collecting and arranging transport of the specimen
character (< Space > or < Tab >) { vi :
at the time . " Jan would get her pencil s

INTERVAL 1.002 - 12.675
CONTAINS 40.051%

ve talked about a lot here because he s
, S aki broke into his apartment and murdered Tang Shen
di as pora , we represent a second army of the
main takeaway is that , contrary to all expectations , the
shifts my focus away from learning and toward frequent input ting

INTERVAL -10.671 - 1.002
CONTAINS 59.234%

ored , Bio Shock , or several similar games . It
asure , R . H ., Al lee , W .
posted of her . She likened the unauthorized use of photos
only way to go past them is to go inside them
found out the bombers were Che chens , and they started

INTERVAL -22.344 - -10.671
CONTAINS 0.553%

It comes time to plea - b arg ain . Your
is at least 100 , 000 light - years across .
one less thing for you to deal with . If you
appearing very skinny , in his 30 s , standing 5
Years ) to reach the Andromeda galaxy ( 2
Token the
Token ID262
Feature activation+5.13e+00

Pos logit contributions
umber+2.461
quartered+2.240
esson+2.121
wark+2.059
lav+2.027
Neg logit contributions
opsy-2.450
roma-2.445
verse-2.242
 noses-2.211
 manufact-2.140
Token Congress
Token ID3162
Feature activation+1.41e+01

Pos logit contributions
umber+3.494
quartered+3.205
esson+3.031
wark+2.960
Constructed+2.951
Neg logit contributions
roma-3.535
opsy-3.504
verse-3.183
 noses-3.141
 manufact-3.075
Token may
Token ID743
Feature activation+2.03e+01

Pos logit contributions
umber+6.666
quartered+5.319
esson+4.596
Constructed+4.550
lav+4.506
Neg logit contributions
opsy-11.889
roma-11.605
 noses-10.967
verse-10.779
isoft-10.535
Token by
Token ID416
Feature activation+8.96e+00

Pos logit contributions
umber+9.480
Constructed+7.347
quartered+6.836
+6.779
esson+6.748
Neg logit contributions
opsy-15.478
roma-14.379
rique-14.116
 noses-14.071
 atmosp-13.521
Token general
Token ID2276
Feature activation+9.31e+00

Pos logit contributions
umber+5.550
quartered+4.873
Constructed+4.617
esson+4.607
wark+4.534
Neg logit contributions
opsy-6.463
roma-6.425
 noses-5.885
verse-5.745
iq-5.690
Token Laws
Token ID21153
Feature activation+2.43e+01

Pos logit contributions
umber+4.570
quartered+3.827
esson+3.726
+3.565
izo+3.433
Neg logit contributions
opsy-7.904
roma-7.768
 noses-7.203
iq-7.202
rique-7.184
Token prescribe
Token ID32651
Feature activation+1.34e+01

Pos logit contributions
umber+12.349
esson+9.358
Constructed+9.061
 regard+8.869
inc+8.790
Neg logit contributions
opsy-15.769
roma-15.334
 noses-15.196
rique-14.480
verse-14.399
Token the
Token ID262
Feature activation+9.54e+00

Pos logit contributions
umber+7.740
quartered+5.978
esson+5.812
lav+5.686
Constructed+5.644
Neg logit contributions
opsy-10.047
roma-9.954
 noses-9.080
verse-8.867
enz-8.783
Token Man
Token ID1869
Feature activation+6.47e+00

Pos logit contributions
umber+5.866
quartered+5.145
esson+4.886
wark+4.687
Constructed+4.672
Neg logit contributions
roma-7.012
opsy-6.936
 noses-6.137
verse-6.104
enz-5.974
Tokenner
Token ID1008
Feature activation+1.07e-01

Pos logit contributions
umber+3.153
quartered+2.594
esson+2.404
izo+2.330
wark+2.271
Neg logit contributions
opsy-5.870
roma-5.803
 noses-5.388
verse-5.298
 manufact-5.201
Token in
Token ID287
Feature activation+6.68e+00

Pos logit contributions
umber+0.075
quartered+0.069
esson+0.066
netflix+0.063
wark+0.063
Neg logit contributions
roma-0.075
opsy-0.074
verse-0.068
 noses-0.067
 manufact-0.065
Token the
Token ID262
Feature activation+5.45e+00

Pos logit contributions
roma+0.081
opsy+0.080
verse+0.074
 noses+0.073
 manufact+0.071
Neg logit contributions
umber-0.078
quartered-0.072
esson-0.068
netflix-0.066
wark-0.065
Token senator
Token ID12329
Feature activation+3.20e+00

Pos logit contributions
umber+3.756
quartered+3.493
esson+3.382
lav+3.189
wark+3.181
Neg logit contributions
roma-3.725
opsy-3.620
verse-3.269
 noses-3.221
 manufact-3.182
Token has
Token ID468
Feature activation-6.89e+00

Pos logit contributions
umber+2.265
quartered+2.074
esson+2.006
lav+1.900
wark+1.900
Neg logit contributions
roma-2.178
opsy-2.136
verse-1.968
 noses-1.935
 manufact-1.894
Token been
Token ID587
Feature activation-1.40e+00

Pos logit contributions
roma+4.193
opsy+4.096
 noses+3.952
verse+3.945
 manufact+3.853
Neg logit contributions
umber-5.146
quartered-4.801
esson-4.510
wark-4.449
netflix-4.437
Token in
Token ID287
Feature activation+3.97e+00

Pos logit contributions
roma+0.923
opsy+0.909
verse+0.845
 noses+0.830
 manufact+0.808
Neg logit contributions
umber-1.026
quartered-0.957
esson-0.907
netflix-0.871
lav-0.869
Token this
Token ID428
Feature activation+2.22e+01

Pos logit contributions
umber+2.858
quartered+2.626
esson+2.546
wark+2.421
netflix+2.400
Neg logit contributions
roma-2.654
opsy-2.594
verse-2.344
 noses-2.328
 manufact-2.291
Token situation
Token ID3074
Feature activation+3.24e+00

Pos logit contributions
umber+9.088
esson+8.575
 regard+7.920
quartered+7.674
Constructed+7.545
Neg logit contributions
roma-16.662
opsy-15.995
enz-14.122
dim-13.994
 noses-13.932
Token before
Token ID878
Feature activation+5.11e+00

Pos logit contributions
umber+2.134
quartered+1.939
esson+1.867
netflix+1.771
lav+1.754
Neg logit contributions
roma-2.384
opsy-2.345
verse-2.123
 noses-2.099
 manufact-2.061
Token,
Token ID11
Feature activation+4.76e+00

Pos logit contributions
umber+3.402
quartered+3.022
esson+2.910
wark+2.778
lav+2.766
Neg logit contributions
roma-3.754
opsy-3.667
verse-3.375
 noses-3.315
 manufact-3.260
Token having
Token ID1719
Feature activation+1.62e+00

Pos logit contributions
umber+3.228
quartered+2.885
esson+2.839
wark+2.738
netflix+2.680
Neg logit contributions
roma-3.367
opsy-3.303
verse-3.021
 noses-2.946
 manufact-2.904
Token survived
Token ID11803
Feature activation-1.83e+00

Pos logit contributions
umber+1.141
quartered+1.051
esson+1.002
wark+0.959
netflix+0.959
Neg logit contributions
roma-1.118
opsy-1.108
verse-1.014
 noses-0.991
 manufact-0.972
Token my
Token ID616
Feature activation-5.93e+00

Pos logit contributions
roma+1.773
opsy+1.758
verse+1.649
 noses+1.642
 manufact+1.562
Neg logit contributions
umber-1.757
quartered-1.645
esson-1.558
netflix-1.493
lav-1.487
Token mouth
Token ID5422
Feature activation+6.85e-01

Pos logit contributions
roma+3.198
opsy+3.130
 noses+3.051
verse+2.965
 manufact+2.768
Neg logit contributions
umber-4.866
quartered-4.544
esson-4.473
lav-4.282
netflix-4.266
Token and
Token ID290
Feature activation-2.57e+00

Pos logit contributions
umber+0.488
quartered+0.452
esson+0.429
netflix+0.411
wark+0.411
Neg logit contributions
roma-0.468
opsy-0.461
verse-0.424
 noses-0.417
 manufact-0.408
Token her
Token ID607
Feature activation-1.02e+00

Pos logit contributions
roma+1.612
opsy+1.593
verse+1.476
 noses+1.472
 manufact+1.397
Neg logit contributions
umber-1.936
quartered-1.822
esson-1.736
netflix-1.672
wark-1.665
Token tongue
Token ID11880
Feature activation+2.32e+00

Pos logit contributions
roma+0.635
opsy+0.625
verse+0.576
 noses+0.570
 manufact+0.547
Neg logit contributions
umber-0.788
quartered-0.737
esson-0.704
netflix-0.680
wark-0.676
Token sl
Token ID1017
Feature activation+2.18e+01

Pos logit contributions
umber+1.700
quartered+1.559
esson+1.479
lav+1.430
wark+1.429
Neg logit contributions
roma-1.536
opsy-1.511
verse-1.376
 noses-1.330
 manufact-1.328
Tokenither
Token ID1555
Feature activation+3.54e+00

Pos logit contributions
umber+15.593
lav+12.129
ins+11.840
izo+11.773
wark+11.608
Neg logit contributions
 metic-12.407
roma-12.055
 unden-11.716
 manufact-11.663
 dime-11.099
Tokens
Token ID82
Feature activation-6.24e-01

Pos logit contributions
umber+2.509
quartered+2.309
esson+2.176
lav+2.110
netflix+2.102
Neg logit contributions
roma-2.409
opsy-2.382
verse-2.167
 manufact-2.103
 noses-2.080
Token into
Token ID656
Feature activation+6.65e-01

Pos logit contributions
roma+0.421
opsy+0.415
verse+0.384
 noses+0.380
 manufact+0.368
Neg logit contributions
umber-0.446
quartered-0.415
esson-0.394
netflix-0.378
wark-0.376
Token mine
Token ID6164
Feature activation+4.25e+00

Pos logit contributions
umber+0.445
quartered+0.411
esson+0.388
netflix+0.371
wark+0.371
Neg logit contributions
roma-0.481
opsy-0.474
verse-0.439
 noses-0.432
 manufact-0.423
Token.
Token ID13
Feature activation+5.62e+00

Pos logit contributions
umber+2.812
quartered+2.545
esson+2.434
lav+2.331
wark+2.320
Neg logit contributions
roma-3.105
opsy-3.070
verse-2.787
 manufact-2.717
 noses-2.713
Token we
Token ID356
Feature activation-1.08e+00

Pos logit contributions
umber+3.643
quartered+3.341
esson+3.174
wark+3.058
Constructed+3.037
Neg logit contributions
roma-3.696
opsy-3.693
verse-3.334
 noses-3.310
iq-3.224
Token must
Token ID1276
Feature activation+2.97e+00

Pos logit contributions
roma+0.763
opsy+0.754
verse+0.697
 noses+0.688
 manufact+0.670
Neg logit contributions
umber-0.734
quartered-0.683
esson-0.644
netflix-0.620
lav-0.615
Token place
Token ID1295
Feature activation-6.76e-01

Pos logit contributions
umber+2.052
quartered+1.872
esson+1.783
wark+1.727
lav+1.694
Neg logit contributions
roma-2.073
opsy-2.060
verse-1.883
 noses-1.866
 manufact-1.830
Token it
Token ID340
Feature activation-5.89e+00

Pos logit contributions
roma+0.456
opsy+0.449
verse+0.417
 noses+0.410
 manufact+0.399
Neg logit contributions
umber-0.484
quartered-0.451
esson-0.427
netflix-0.411
wark-0.408
Token front
Token ID2166
Feature activation-2.01e+00

Pos logit contributions
roma+3.702
opsy+3.544
 noses+3.409
verse+3.371
 manufact+3.218
Neg logit contributions
umber-4.345
quartered-4.076
esson-3.907
netflix-3.763
EStream-3.702
Token and
Token ID290
Feature activation+2.16e+01

Pos logit contributions
roma+1.286
opsy+1.278
verse+1.173
 noses+1.159
 manufact+1.120
Neg logit contributions
umber-1.496
quartered-1.386
esson-1.340
netflix-1.281
wark-1.274
Token center
Token ID3641
Feature activation+6.59e-01

Pos logit contributions
umber+5.501
quartered+4.393
Constructed+4.357
esson+3.776
 Rowe+3.224
Neg logit contributions
opsy-20.596
roma-20.096
ccording-18.449
isoft-18.320
rique-18.250
Token.
Token ID13
Feature activation+5.17e+00

Pos logit contributions
umber+0.460
quartered+0.425
esson+0.404
wark+0.386
netflix+0.386
Neg logit contributions
roma-0.458
opsy-0.452
verse-0.417
 noses-0.410
 manufact-0.401
Token Dec
Token ID4280
Feature activation+9.89e-01

Pos logit contributions
umber+3.444
quartered+3.024
esson+3.021
Constructed+2.957
wark+2.921
Neg logit contributions
opsy-3.657
roma-3.650
 noses-3.331
 manufact-3.270
verse-3.262
Tokenades
Token ID2367
Feature activation-1.15e+00

Pos logit contributions
umber+0.698
quartered+0.646
esson+0.612
wark+0.585
netflix+0.585
Neg logit contributions
roma-0.680
opsy-0.671
verse-0.619
 noses-0.613
 manufact-0.596
Token ago
Token ID2084
Feature activation+1.79e+00

Pos logit contributions
roma+0.764
opsy+0.754
verse+0.696
 noses+0.686
 manufact+0.664
Neg logit contributions
umber-0.825
quartered-0.770
esson-0.732
netflix-0.701
wark-0.701
Token Nebraska
Token ID18329
Feature activation+3.35e+00

Pos logit contributions
umber+0.223
quartered+0.209
esson+0.199
netflix+0.192
wark+0.191
Neg logit contributions
roma-0.178
opsy-0.176
verse-0.160
 noses-0.158
 manufact-0.153
Token,
Token ID11
Feature activation-1.63e+00

Pos logit contributions
umber+2.233
quartered+2.046
esson+1.945
wark+1.867
netflix+1.853
Neg logit contributions
roma-2.438
opsy-2.391
verse-2.198
 noses-2.175
 manufact-2.148
Token has
Token ID468
Feature activation-1.81e+00

Pos logit contributions
roma+1.070
opsy+1.058
verse+0.982
 noses+0.969
 manufact+0.937
Neg logit contributions
umber-1.184
quartered-1.109
esson-1.044
netflix-1.009
wark-1.003
Token been
Token ID587
Feature activation+1.23e+00

Pos logit contributions
roma+1.185
opsy+1.166
verse+1.091
 noses+1.077
 manufact+1.042
Neg logit contributions
umber-1.320
quartered-1.228
esson-1.163
netflix-1.124
wark-1.123
Token in
Token ID287
Feature activation+9.92e+00

Pos logit contributions
umber+0.858
quartered+0.794
esson+0.750
netflix+0.721
wark+0.719
Neg logit contributions
roma-0.855
opsy-0.844
verse-0.774
 noses-0.764
 manufact-0.743
Token this
Token ID428
Feature activation+2.05e+01

Pos logit contributions
umber+5.895
quartered+5.256
esson+5.209
wark+4.841
netflix+4.799
Neg logit contributions
roma-7.435
opsy-7.212
 noses-6.470
verse-6.431
 manufact-6.410
Token post
Token ID1281
Feature activation+7.85e+00

Pos logit contributions
umber+10.112
 regard+9.082
esson+8.893
lav+8.826
quartered+8.737
Neg logit contributions
roma-14.547
opsy-13.951
 noses-12.303
Prop-12.049
enz-11.640
Token since
Token ID1201
Feature activation+2.53e+00

Pos logit contributions
umber+4.947
quartered+4.546
esson+4.352
netflix+4.107
wark+4.049
Neg logit contributions
roma-5.865
opsy-5.655
 noses-5.223
 manufact-5.127
verse-5.053
Token February
Token ID3945
Feature activation+2.27e+00

Pos logit contributions
umber+1.910
quartered+1.759
esson+1.682
netflix+1.625
lav+1.622
Neg logit contributions
roma-1.617
opsy-1.598
verse-1.446
 noses-1.421
 manufact-1.386
Token 2013
Token ID2211
Feature activation+5.01e+00

Pos logit contributions
umber+1.592
quartered+1.465
esson+1.398
wark+1.340
netflix+1.332
Neg logit contributions
roma-1.576
opsy-1.557
verse-1.429
 noses-1.404
 manufact-1.383
Token
Token ID198
Feature activation+5.10e+00

Pos logit contributions
umber+3.211
quartered+2.923
esson+2.767
wark+2.675
lav+2.665
Neg logit contributions
roma-3.738
opsy-3.676
verse-3.392
 noses-3.381
 manufact-3.312
Token NHL
Token ID9481
Feature activation+1.52e+01

Pos logit contributions
umber+4.694
quartered+4.300
esson+4.057
lav+3.964
wark+3.926
Neg logit contributions
roma-4.289
opsy-4.200
verse-3.733
 noses-3.664
 manufact-3.646
Tokeners
Token ID364
Feature activation+1.05e+01

Pos logit contributions
umber+9.398
quartered+8.632
esson+8.014
lav+7.651
+7.529
Neg logit contributions
roma-10.436
opsy-10.048
 manufact-8.702
verse-8.631
 noses-8.468
Token like
Token ID588
Feature activation+6.13e+00

Pos logit contributions
umber+6.946
quartered+6.361
esson+6.121
lav+5.943
wark+5.731
Neg logit contributions
roma-7.145
opsy-6.933
verse-6.187
 manufact-5.917
 noses-5.911
Token Derrick
Token ID32979
Feature activation+6.79e+00

Pos logit contributions
umber+4.368
quartered+3.977
esson+3.841
wark+3.673
lav+3.669
Neg logit contributions
roma-4.093
opsy-4.014
verse-3.560
 noses-3.490
 manufact-3.481
Token P
Token ID350
Feature activation+5.48e+00

Pos logit contributions
umber+4.696
quartered+4.234
esson+4.132
lav+3.941
wark+3.939
Neg logit contributions
opsy-4.581
roma-4.543
 manufact-4.123
 noses-4.074
verse-4.070
Tokenoul
Token ID2852
Feature activation+2.04e+01

Pos logit contributions
umber+2.907
esson+2.438
quartered+2.430
izo+2.213
lav+2.201
Neg logit contributions
roma-4.693
opsy-4.634
 noses-4.479
verse-4.403
 manufact-4.390
Tokeniot
Token ID5151
Feature activation+3.44e+00

Pos logit contributions
umber+7.095
esson+5.605
inka+4.227
izo+4.226
wark+4.034
Neg logit contributions
opsy-19.405
roma-18.718
 manufact-18.503
 noses-18.021
 dime-17.819
Token and
Token ID290
Feature activation+7.88e+00

Pos logit contributions
umber+2.375
quartered+2.173
esson+2.094
lav+2.013
izo+1.976
Neg logit contributions
roma-2.397
opsy-2.364
verse-2.186
 noses-2.144
 manufact-2.116
Token Mat
Token ID6550
Feature activation+1.16e+01

Pos logit contributions
umber+5.538
quartered+5.010
esson+4.900
lav+4.668
wark+4.643
Neg logit contributions
roma-5.226
opsy-5.129
verse-4.591
 noses-4.473
 manufact-4.445
Tokenhew
Token ID6391
Feature activation+2.59e+00

Pos logit contributions
umber+7.146
izo+6.085
quartered+6.083
esson+6.072
inka+5.999
Neg logit contributions
roma-8.065
opsy-8.035
 noses-7.717
 manufact-7.585
verse-7.531
Token Dumb
Token ID42989
Feature activation+3.82e+00

Pos logit contributions
umber+1.813
quartered+1.625
esson+1.569
lav+1.501
wark+1.482
Neg logit contributions
roma-1.799
opsy-1.790
verse-1.642
 noses-1.615
 manufact-1.598
Token State
Token ID1812
Feature activation+5.04e+00

Pos logit contributions
umber+1.698
quartered+1.523
esson+1.410
wark+1.356
Constructed+1.339
Neg logit contributions
roma-2.539
opsy-2.526
 noses-2.328
verse-2.323
 manufact-2.241
Token.
Token ID13
Feature activation+2.30e+00

Pos logit contributions
umber+3.386
quartered+3.055
esson+2.851
wark+2.802
izo+2.735
Neg logit contributions
roma-3.638
opsy-3.615
 noses-3.302
verse-3.297
 manufact-3.133
Token And
Token ID843
Feature activation+3.56e+00

Pos logit contributions
umber+1.547
quartered+1.409
esson+1.337
wark+1.293
Constructed+1.280
Neg logit contributions
roma-1.643
opsy-1.643
verse-1.498
 noses-1.489
 manufact-1.450
Token the
Token ID262
Feature activation+5.13e+00

Pos logit contributions
umber+2.461
quartered+2.240
esson+2.121
wark+2.059
lav+2.027
Neg logit contributions
opsy-2.450
roma-2.445
verse-2.242
 noses-2.211
 manufact-2.140
Token Congress
Token ID3162
Feature activation+1.41e+01

Pos logit contributions
umber+3.494
quartered+3.205
esson+3.031
wark+2.960
Constructed+2.951
Neg logit contributions
roma-3.535
opsy-3.504
verse-3.183
 noses-3.141
 manufact-3.075
Token may
Token ID743
Feature activation+2.03e+01

Pos logit contributions
umber+6.666
quartered+5.319
esson+4.596
Constructed+4.550
lav+4.506
Neg logit contributions
opsy-11.889
roma-11.605
 noses-10.967
verse-10.779
isoft-10.535
Token by
Token ID416
Feature activation+8.96e+00

Pos logit contributions
umber+9.480
Constructed+7.347
quartered+6.836
+6.779
esson+6.748
Neg logit contributions
opsy-15.478
roma-14.379
rique-14.116
 noses-14.071
 atmosp-13.521
Token general
Token ID2276
Feature activation+9.31e+00

Pos logit contributions
umber+5.550
quartered+4.873
Constructed+4.617
esson+4.607
wark+4.534
Neg logit contributions
opsy-6.463
roma-6.425
 noses-5.885
verse-5.745
iq-5.690
Token Laws
Token ID21153
Feature activation+2.43e+01

Pos logit contributions
umber+4.570
quartered+3.827
esson+3.726
+3.565
izo+3.433
Neg logit contributions
opsy-7.904
roma-7.768
 noses-7.203
iq-7.202
rique-7.184
Token prescribe
Token ID32651
Feature activation+1.34e+01

Pos logit contributions
umber+12.349
esson+9.358
Constructed+9.061
 regard+8.869
inc+8.790
Neg logit contributions
opsy-15.769
roma-15.334
 noses-15.196
rique-14.480
verse-14.399
Token the
Token ID262
Feature activation+9.54e+00

Pos logit contributions
umber+7.740
quartered+5.978
esson+5.812
lav+5.686
Constructed+5.644
Neg logit contributions
opsy-10.047
roma-9.954
 noses-9.080
verse-8.867
enz-8.783
Token great
Token ID1049
Feature activation-5.52e+00

Pos logit contributions
roma+3.860
opsy+3.737
verse+3.706
 noses+3.447
 manufact+3.412
Neg logit contributions
umber-4.810
quartered-4.508
esson-4.282
wark-4.157
netflix-4.153
Token leader
Token ID3554
Feature activation+5.63e-01

Pos logit contributions
opsy+3.003
roma+2.956
verse+2.907
 noses+2.734
 manufact+2.670
Neg logit contributions
umber-4.488
quartered-4.175
esson-3.992
netflix-3.964
wark-3.885
Token.
Token ID13
Feature activation-5.07e+00

Pos logit contributions
umber+0.399
quartered+0.371
esson+0.352
netflix+0.337
wark+0.336
Neg logit contributions
roma-0.386
opsy-0.380
verse-0.350
 noses-0.345
 manufact-0.336
Token
Token ID447
Feature activation-1.25e+00

Pos logit contributions
roma+3.035
opsy+2.956
verse+2.820
 noses+2.681
iq+2.593
Neg logit contributions
umber-3.908
quartered-3.707
esson-3.420
netflix-3.342
wark-3.298
Token
Token ID251
Feature activation+5.02e-01

Pos logit contributions
roma+0.965
opsy+0.953
verse+0.889
 noses+0.874
 manufact+0.853
Neg logit contributions
umber-0.782
quartered-0.718
esson-0.674
netflix-0.646
wark-0.641
TokenCl
Token ID2601
Feature activation+1.99e+01

Pos logit contributions
umber+0.379
quartered+0.354
esson+0.337
netflix+0.324
wark+0.324
Neg logit contributions
roma-0.319
opsy-0.315
verse-0.288
 noses-0.284
 manufact-0.276
Tokenarks
Token ID5558
Feature activation+1.13e+01

Pos logit contributions
umber+4.521
esson+2.460
arity+1.991
lav+1.577
arks+1.531
Neg logit contributions
 metic-21.857
 Guan-21.398
 noses-21.353
roma-21.086
 unden-20.786
Tokenon
Token ID261
Feature activation+1.04e+00

Pos logit contributions
umber+5.779
quartered+5.221
esson+4.860
iewicz+4.545
lav+4.493
Neg logit contributions
roma-9.465
opsy-9.305
 noses-8.684
verse-8.633
 atmosp-8.363
Token added
Token ID2087
Feature activation+9.88e-01

Pos logit contributions
umber+0.812
quartered+0.758
esson+0.725
lav+0.695
netflix+0.695
Neg logit contributions
roma-0.636
opsy-0.627
verse-0.570
 noses-0.560
 manufact-0.546
Token:
Token ID25
Feature activation-2.48e+00

Pos logit contributions
umber+0.691
quartered+0.642
esson+0.607
lav+0.581
netflix+0.581
Neg logit contributions
roma-0.683
opsy-0.675
verse-0.623
 noses-0.613
 manufact-0.599
Token
Token ID564
Feature activation-1.59e+00

Pos logit contributions
roma+1.527
opsy+1.500
verse+1.388
 noses+1.351
 manufact+1.302
Neg logit contributions
umber-1.907
quartered-1.782
esson-1.694
netflix-1.636
wark-1.632
Token.
Token ID13
Feature activation-7.01e+00

Pos logit contributions
roma+5.582
opsy+5.263
verse+5.108
 noses+4.736
iq+4.720
Neg logit contributions
umber-9.676
quartered-9.475
esson-8.791
Constructed-8.781
netflix-8.630
Token All
Token ID1439
Feature activation-9.08e-01

Pos logit contributions
roma+4.189
verse+4.079
opsy+4.055
 noses+3.679
 Aut+3.601
Neg logit contributions
umber-5.313
quartered-5.098
esson-4.664
netflix-4.527
lav-4.525
Token Rights
Token ID6923
Feature activation-6.14e-01

Pos logit contributions
roma+0.552
opsy+0.543
verse+0.501
 noses+0.490
 manufact+0.475
Neg logit contributions
umber-0.709
quartered-0.661
esson-0.632
netflix-0.615
wark-0.611
Token Reserved
Token ID33876
Feature activation-8.48e-01

Pos logit contributions
roma+0.382
opsy+0.377
verse+0.348
 noses+0.340
 manufact+0.330
Neg logit contributions
umber-0.470
quartered-0.439
esson-0.419
netflix-0.405
wark-0.402
Token.
Token ID13
Feature activation+2.98e+00

Pos logit contributions
roma+0.561
opsy+0.553
verse+0.513
 noses+0.500
 manufact+0.487
Neg logit contributions
umber-0.616
quartered-0.576
esson-0.544
netflix-0.524
wark-0.522
Token This
Token ID770
Feature activation+1.99e+01

Pos logit contributions
umber+1.973
quartered+1.794
esson+1.703
wark+1.651
netflix+1.641
Neg logit contributions
roma-2.179
opsy-2.153
 noses-1.968
verse-1.953
 manufact-1.931
Token material
Token ID2587
Feature activation+9.25e+00

Pos logit contributions
umber+4.281
quartered+2.516
esson+1.899
+1.891
izo+1.871
Neg logit contributions
opsy-20.533
roma-20.233
rique-18.990
 noses-18.824
 unden-18.664
Token may
Token ID743
Feature activation-4.63e-01

Pos logit contributions
umber+3.657
quartered+2.861
esson+2.694
iewicz+2.532
netflix+2.526
Neg logit contributions
roma-9.042
opsy-8.779
 metic-8.329
 noses-8.265
 manufact-8.091
Token not
Token ID407
Feature activation+2.75e+00

Pos logit contributions
roma+0.319
opsy+0.315
verse+0.292
 noses+0.286
 manufact+0.279
Neg logit contributions
umber-0.325
quartered-0.301
esson-0.286
netflix-0.274
wark-0.274
Token be
Token ID307
Feature activation-1.49e+00

Pos logit contributions
umber+1.894
quartered+1.712
esson+1.633
lav+1.567
wark+1.561
Neg logit contributions
roma-1.966
opsy-1.907
 noses-1.760
verse-1.758
 manufact-1.725
Token published
Token ID3199
Feature activation-3.62e-01

Pos logit contributions
roma+1.194
opsy+1.188
verse+1.112
 noses+1.086
 manufact+1.071
Neg logit contributions
umber-0.875
quartered-0.806
esson-0.754
netflix-0.726
wark-0.717
Token goals
Token ID4661
Feature activation-3.52e+00

Pos logit contributions
umber+2.773
quartered+2.520
esson+2.404
lav+2.268
+2.265
Neg logit contributions
roma-3.141
opsy-3.119
verse-2.814
 noses-2.768
 manufact-2.730
Token,
Token ID11
Feature activation+4.51e-01

Pos logit contributions
roma+2.210
opsy+2.171
verse+1.995
 noses+1.963
 manufact+1.896
Neg logit contributions
umber-2.617
quartered-2.482
esson-2.330
netflix-2.260
wark-2.242
Token 10
Token ID838
Feature activation+8.24e+00

Pos logit contributions
umber+0.307
quartered+0.284
esson+0.269
netflix+0.257
wark+0.257
Neg logit contributions
roma-0.321
opsy-0.317
verse-0.293
 noses-0.288
 manufact-0.281
Token assists
Token ID13648
Feature activation-1.71e+00

Pos logit contributions
umber+5.549
esson+5.086
quartered+4.995
wark+4.612
lav+4.612
Neg logit contributions
opsy-5.743
roma-5.658
 manufact-4.942
verse-4.884
 noses-4.884
Token in
Token ID287
Feature activation-2.43e+00

Pos logit contributions
roma+1.115
opsy+1.098
verse+1.015
 noses+1.000
 manufact+0.968
Neg logit contributions
umber-1.254
quartered-1.177
esson-1.107
netflix-1.070
lav-1.061
Token 31
Token ID3261
Feature activation+1.94e+01

Pos logit contributions
roma+1.549
opsy+1.528
verse+1.414
 noses+1.399
 manufact+1.346
Neg logit contributions
umber-1.806
quartered-1.699
esson-1.597
netflix-1.548
lav-1.541
Token games
Token ID1830
Feature activation+1.67e+00

Pos logit contributions
umber+9.364
esson+8.404
quartered+8.189
+7.293
lav+7.148
Neg logit contributions
roma-14.639
opsy-14.098
verse-12.942
 noses-12.621
 manufact-12.245
Token this
Token ID428
Feature activation+5.77e+00

Pos logit contributions
umber+1.164
quartered+1.080
esson+1.024
wark+0.977
netflix+0.977
Neg logit contributions
roma-1.168
opsy-1.151
verse-1.060
 noses-1.046
 manufact-1.016
Token year
Token ID614
Feature activation-4.15e-01

Pos logit contributions
umber+3.933
quartered+3.642
esson+3.484
netflix+3.304
Constructed+3.272
Neg logit contributions
roma-4.045
opsy-3.934
 noses-3.569
verse-3.558
 manufact-3.494
Token.
Token ID13
Feature activation-2.07e+00

Pos logit contributions
roma+0.278
opsy+0.275
verse+0.253
 noses+0.249
 manufact+0.243
Neg logit contributions
umber-0.299
quartered-0.278
esson-0.264
netflix-0.254
wark-0.253
Token He
Token ID679
Feature activation+3.06e+00

Pos logit contributions
roma+1.262
opsy+1.238
verse+1.150
 noses+1.116
 manufact+1.075
Neg logit contributions
umber-1.610
quartered-1.506
esson-1.424
netflix-1.376
wark-1.371
TokenObviously
Token ID20670
Feature activation-6.97e-01

Pos logit contributions
umber+2.558
quartered+2.323
esson+2.244
Constructed+2.173
netflix+2.156
Neg logit contributions
roma-2.595
opsy-2.568
 noses-2.362
verse-2.306
 manufact-2.290
Token,
Token ID11
Feature activation+1.87e+00

Pos logit contributions
roma+0.467
opsy+0.462
verse+0.426
 noses+0.420
 manufact+0.408
Neg logit contributions
umber-0.502
quartered-0.467
esson-0.442
netflix-0.425
wark-0.424
Token we
Token ID356
Feature activation-4.16e-01

Pos logit contributions
umber+1.325
quartered+1.226
esson+1.168
netflix+1.118
wark+1.116
Neg logit contributions
roma-1.277
opsy-1.256
verse-1.156
 noses-1.136
 manufact-1.109
Token condemn
Token ID15402
Feature activation+1.19e+00

Pos logit contributions
roma+0.289
opsy+0.285
verse+0.264
 noses+0.260
 manufact+0.253
Neg logit contributions
umber-0.290
quartered-0.269
esson-0.254
netflix-0.244
wark-0.243
Token what
Token ID644
Feature activation+8.52e+00

Pos logit contributions
umber+0.835
quartered+0.771
esson+0.733
netflix+0.705
wark+0.702
Neg logit contributions
roma-0.819
opsy-0.808
verse-0.742
 noses-0.734
 manufact-0.713
Token he
Token ID339
Feature activation+1.89e+01

Pos logit contributions
umber+5.483
quartered+4.882
esson+4.713
netflix+4.471
+4.465
Neg logit contributions
roma-6.128
opsy-5.925
 noses-5.503
verse-5.494
 manufact-5.377
Token said
Token ID531
Feature activation+6.49e+00

Pos logit contributions
umber+9.433
esson+8.358
iewicz+7.515
wark+7.510
izo+7.442
Neg logit contributions
opsy-13.548
roma-13.018
 noses-12.442
rique-12.027
rises-11.711
Token.
Token ID13
Feature activation-2.20e+00

Pos logit contributions
umber+4.244
quartered+3.756
esson+3.681
wark+3.476
lav+3.433
Neg logit contributions
roma-4.709
opsy-4.692
 noses-4.279
verse-4.239
 manufact-4.128
Token
Token ID447
Feature activation-2.73e-02

Pos logit contributions
roma+1.448
opsy+1.426
verse+1.334
 noses+1.286
 manufact+1.255
Neg logit contributions
umber-1.596
quartered-1.511
esson-1.400
lav-1.357
netflix-1.353
Token
Token ID251
Feature activation-2.74e+00

Pos logit contributions
roma+0.021
opsy+0.020
verse+0.019
 noses+0.019
 manufact+0.018
Neg logit contributions
umber-0.017
quartered-0.016
esson-0.015
netflix-0.014
wark-0.014
Token
Token ID198
Feature activation-4.79e+00

Pos logit contributions
roma+1.810
opsy+1.779
verse+1.672
 noses+1.609
 manufact+1.567
Neg logit contributions
umber-1.979
quartered-1.864
esson-1.732
netflix-1.680
lav-1.671
TokenHe
Token ID1544
Feature activation+1.05e+00

Pos logit contributions
umber+3.335
quartered+2.990
esson+2.925
netflix+2.841
Constructed+2.834
Neg logit contributions
roma-2.990
opsy-2.944
 noses-2.708
 manufact-2.663
verse-2.650
Tokenath
Token ID776
Feature activation+8.20e-01

Pos logit contributions
umber+0.785
quartered+0.729
esson+0.695
netflix+0.668
lav+0.665
Neg logit contributions
roma-0.676
opsy-0.667
verse-0.614
 noses-0.606
 manufact-0.590
Token Led
Token ID22964
Feature activation+3.79e+00

Pos logit contributions
umber+0.670
quartered+0.628
esson+0.600
netflix+0.579
wark+0.577
Neg logit contributions
roma-0.471
opsy-0.464
verse-0.420
 noses-0.415
 manufact-0.402
Tokenger
Token ID1362
Feature activation-4.55e+00

Pos logit contributions
umber+2.241
quartered+2.021
esson+1.913
wark+1.843
netflix+1.809
Neg logit contributions
roma-3.018
opsy-2.983
 noses-2.805
 manufact-2.748
verse-2.743
Token Sh
Token ID911
Feature activation+5.20e+00

Pos logit contributions
opsy+3.020
roma+3.014
verse+2.813
 noses+2.727
 manufact+2.612
Neg logit contributions
umber-3.210
quartered-3.068
esson-2.861
netflix-2.749
wark-2.748
Tokenr
Token ID81
Feature activation+1.89e+01

Pos logit contributions
umber+3.867
quartered+3.472
esson+3.383
lav+3.281
izo+3.265
Neg logit contributions
roma-3.263
opsy-3.214
verse-2.998
 manufact-2.966
 noses-2.947
Tokenink
Token ID676
Feature activation-2.98e+00

Pos logit contributions
umber+10.247
esson+7.806
izo+7.683
quartered+7.620
ins+7.459
Neg logit contributions
roma-13.533
opsy-12.940
 manufact-12.853
 dime-12.541
verse-12.489
Token-
Token ID12
Feature activation+5.06e+00

Pos logit contributions
roma+1.968
opsy+1.961
verse+1.808
 noses+1.768
 manufact+1.708
Neg logit contributions
umber-2.120
quartered-1.992
esson-1.890
netflix-1.838
wark-1.808
TokenRAM
Token ID24115
Feature activation-2.43e+00

Pos logit contributions
umber+3.671
quartered+3.281
esson+3.213
Constructed+3.119
netflix+3.118
Neg logit contributions
roma-3.302
opsy-3.253
 noses-3.004
 manufact-2.954
verse-2.919
TokenONE
Token ID11651
Feature activation-1.04e+01

Pos logit contributions
roma+1.581
opsy+1.568
verse+1.444
 noses+1.415
 manufact+1.366
Neg logit contributions
umber-1.782
quartered-1.684
esson-1.580
netflix-1.533
wark-1.517
Token 2008
Token ID3648
Feature activation-7.94e-01

Pos logit contributions
opsy+6.284
roma+6.259
verse+6.005
 noses+5.689
iq+5.546
Neg logit contributions
quartered-7.290
umber-7.229
esson-6.481
netflix-6.371
lav-6.271
Tokenz
Token ID89
Feature activation-1.20e+00

Pos logit contributions
umber+1.604
quartered+1.476
esson+1.398
lav+1.338
izo+1.336
Neg logit contributions
roma-1.893
opsy-1.872
 noses-1.731
verse-1.720
 manufact-1.689
Tokenr
Token ID81
Feature activation+5.23e+00

Pos logit contributions
roma+0.881
opsy+0.865
verse+0.807
 noses+0.791
 manufact+0.773
Neg logit contributions
umber-0.795
quartered-0.740
esson-0.693
netflix-0.667
wark-0.662
Token IV
Token ID8363
Feature activation+6.44e+00

Pos logit contributions
umber+4.077
quartered+3.781
esson+3.585
Constructed+3.485
lav+3.475
Neg logit contributions
roma-3.154
opsy-3.117
verse-2.748
 noses-2.723
 manufact-2.551
Token Thermal
Token ID41590
Feature activation-2.13e+00

Pos logit contributions
umber+4.413
quartered+4.021
esson+3.725
wark+3.663
Constructed+3.653
Neg logit contributions
opsy-4.436
roma-4.418
verse-3.983
 noses-3.902
 manufact-3.720
Token Design
Token ID8495
Feature activation+4.66e+00

Pos logit contributions
roma+1.102
opsy+1.087
verse+0.984
 noses+0.956
 manufact+0.930
Neg logit contributions
umber-1.850
quartered-1.751
esson-1.685
netflix-1.632
wark-1.619
Token with
Token ID351
Feature activation+1.82e+01

Pos logit contributions
umber+3.289
quartered+3.015
esson+2.809
wark+2.756
lav+2.741
Neg logit contributions
roma-3.170
opsy-3.170
verse-2.862
 noses-2.831
 manufact-2.730
Token Dust
Token ID16240
Feature activation+7.20e+00

Pos logit contributions
umber+11.072
quartered+9.740
Constructed+9.495
lav+9.391
wark+9.297
Neg logit contributions
opsy-11.072
roma-10.720
verse-9.509
 noses-9.359
rique-9.287
Token Removal
Token ID46209
Feature activation+7.70e+00

Pos logit contributions
umber+4.337
quartered+3.902
esson+3.639
lav+3.602
Constructed+3.543
Neg logit contributions
opsy-5.407
roma-5.389
verse-4.934
 noses-4.907
 manufact-4.724
Token Technology
Token ID8987
Feature activation+5.81e+00

Pos logit contributions
umber+5.015
quartered+4.519
Constructed+4.175
wark+4.161
esson+4.151
Neg logit contributions
opsy-5.478
roma-5.424
verse-4.931
 noses-4.812
iq-4.721
Token.
Token ID13
Feature activation+4.17e+00

Pos logit contributions
umber+3.777
quartered+3.447
esson+3.180
wark+3.150
Constructed+3.113
Neg logit contributions
roma-4.219
opsy-4.201
verse-3.835
 noses-3.752
 manufact-3.631
TokenThe
Token ID464
Feature activation+2.27e+00

Pos logit contributions
umber+2.821
quartered+2.572
esson+2.447
Constructed+2.407
wark+2.377
Neg logit contributions
opsy-2.947
roma-2.935
verse-2.659
 noses-2.658
 manufact-2.558
Token she
Token ID673
Feature activation+3.24e+00

Pos logit contributions
umber+7.091
quartered+6.269
esson+6.092
netflix+5.826
wark+5.754
Neg logit contributions
roma-7.429
opsy-7.052
verse-6.276
 manufact-6.245
 noses-6.239
Token almost
Token ID2048
Feature activation+6.01e+00

Pos logit contributions
umber+2.184
quartered+1.976
esson+1.911
wark+1.821
netflix+1.815
Neg logit contributions
roma-2.307
opsy-2.272
verse-2.085
 noses-2.034
 manufact-2.015
Token has
Token ID468
Feature activation+8.03e+00

Pos logit contributions
umber+4.037
quartered+3.650
esson+3.540
lav+3.383
wark+3.373
Neg logit contributions
roma-4.220
opsy-4.163
verse-3.705
 manufact-3.575
iq-3.557
Token that
Token ID326
Feature activation+1.39e+01

Pos logit contributions
umber+5.053
quartered+4.525
esson+4.268
netflix+4.114
izo+4.099
Neg logit contributions
roma-5.935
opsy-5.791
verse-5.143
 manufact-5.061
 noses-5.039
Token Batman
Token ID9827
Feature activation+1.28e+01

Pos logit contributions
umber+9.190
quartered+8.202
esson+7.780
wark+7.534
lav+7.432
Neg logit contributions
roma-8.810
opsy-8.725
 noses-7.340
rique-7.327
Prop-7.245
Token female
Token ID4048
Feature activation+1.82e+01

Pos logit contributions
umber+8.442
quartered+7.582
esson+7.491
wark+7.139
netflix+7.128
Neg logit contributions
roma-7.985
opsy-7.940
 noses-6.912
rique-6.743
 manufact-6.706
Token origin
Token ID8159
Feature activation+9.54e+00

Pos logit contributions
umber+10.879
quartered+9.762
izo+9.322
esson+9.275
wark+9.245
Neg logit contributions
roma-11.125
opsy-11.099
Prop-9.348
rique-9.304
 noses-9.148
Token.
Token ID13
Feature activation+7.64e+00

Pos logit contributions
umber+5.835
quartered+5.124
esson+5.020
wark+4.793
izo+4.720
Neg logit contributions
roma-7.044
opsy-6.808
 noses-6.261
 manufact-6.140
verse-6.008
Token She
Token ID1375
Feature activation+3.93e+00

Pos logit contributions
umber+4.670
esson+4.077
quartered+3.849
netflix+3.831
wark+3.823
Neg logit contributions
roma-5.777
opsy-5.735
 manufact-5.201
 noses-5.175
verse-5.002
Token shares
Token ID7303
Feature activation+6.41e+00

Pos logit contributions
umber+2.618
quartered+2.361
esson+2.296
lav+2.163
wark+2.163
Neg logit contributions
roma-2.818
opsy-2.791
verse-2.541
 manufact-2.475
 noses-2.474
Token that
Token ID326
Feature activation+1.46e+01

Pos logit contributions
umber+4.201
quartered+3.726
esson+3.566
izo+3.400
wark+3.400
Neg logit contributions
roma-4.646
opsy-4.588
verse-4.098
 manufact-4.051
 noses-4.014
Token flagged
Token ID34060
Feature activation+8.19e-01

Pos logit contributions
umber+3.087
quartered+2.837
esson+2.666
wark+2.586
netflix+2.581
Neg logit contributions
opsy-3.055
roma-3.039
verse-2.728
 noses-2.691
 manufact-2.625
Token by
Token ID416
Feature activation+1.46e+00

Pos logit contributions
umber+0.569
quartered+0.526
esson+0.498
netflix+0.479
wark+0.477
Neg logit contributions
roma-0.573
opsy-0.564
verse-0.520
 noses-0.513
 manufact-0.501
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
umber+1.011
quartered+0.933
esson+0.888
netflix+0.851
wark+0.848
Neg logit contributions
roma-1.018
opsy-1.002
verse-0.920
 noses-0.908
 manufact-0.889
Token US
Token ID1294
Feature activation+5.86e-01

Pos logit contributions
roma+0.768
opsy+0.759
verse+0.705
 noses+0.693
 manufact+0.672
Neg logit contributions
umber-0.806
quartered-0.753
esson-0.711
netflix-0.683
wark-0.681
Token Anti
Token ID12297
Feature activation+5.47e+00

Pos logit contributions
umber+0.349
quartered+0.318
esson+0.298
netflix+0.284
wark+0.282
Neg logit contributions
roma-0.467
opsy-0.461
verse-0.430
 noses-0.425
 manufact-0.416
Token-
Token ID12
Feature activation+1.81e+01

Pos logit contributions
umber+3.784
quartered+3.366
esson+3.326
netflix+3.170
wark+3.164
Neg logit contributions
roma-3.867
opsy-3.665
 noses-3.391
 manufact-3.319
verse-3.307
TokenD
Token ID35
Feature activation+3.73e+00

Pos logit contributions
umber+6.886
netflix+5.534
esson+5.507
Desk+5.061
quartered+5.046
Neg logit contributions
roma-15.782
 noses-15.395
opsy-15.300
 sugg-14.475
 manufact-14.444
Tokenoping
Token ID15816
Feature activation+7.57e+00

Pos logit contributions
umber+2.601
quartered+2.307
esson+2.260
netflix+2.174
lav+2.156
Neg logit contributions
roma-2.567
opsy-2.533
 noses-2.388
verse-2.308
 manufact-2.291
Token Agency
Token ID7732
Feature activation+5.09e+00

Pos logit contributions
umber+4.475
quartered+3.978
esson+3.717
wark+3.688
izo+3.540
Neg logit contributions
roma-5.985
opsy-5.894
verse-5.325
iq-5.210
 noses-5.189
Token once
Token ID1752
Feature activation-6.18e+00

Pos logit contributions
umber+3.404
quartered+3.113
esson+2.981
wark+2.891
netflix+2.874
Neg logit contributions
roma-3.617
opsy-3.526
verse-3.220
 noses-3.168
 manufact-3.146
Token again
Token ID757
Feature activation-1.11e+00

Pos logit contributions
opsy+2.719
roma+2.683
verse+2.472
 noses+2.432
 manufact+2.285
Neg logit contributions
umber-5.568
quartered-5.400
esson-5.111
wark-5.031
lav-4.972
Token to
Token ID284
Feature activation+2.22e+00

Pos logit contributions
umber+3.258
quartered+3.000
esson+2.778
wark+2.708
lav+2.694
Neg logit contributions
roma-3.944
opsy-3.869
verse-3.560
 noses-3.442
 manufact-3.394
Token a
Token ID257
Feature activation-3.12e-01

Pos logit contributions
umber+1.545
quartered+1.430
esson+1.358
wark+1.299
lav+1.295
Neg logit contributions
roma-1.543
opsy-1.517
verse-1.403
 noses-1.373
 manufact-1.333
Token Stanley
Token ID14167
Feature activation+7.50e+00

Pos logit contributions
roma+0.211
opsy+0.209
verse+0.193
 noses+0.190
 manufact+0.185
Neg logit contributions
umber-0.222
quartered-0.207
esson-0.196
netflix-0.188
wark-0.188
Token Cup
Token ID5454
Feature activation+1.50e+00

Pos logit contributions
umber+4.114
quartered+3.479
esson+3.411
izo+3.235
lav+3.184
Neg logit contributions
roma-6.201
opsy-6.064
verse-5.679
 noses-5.660
 manufact-5.544
Token.
Token ID13
Feature activation+4.87e+00

Pos logit contributions
umber+1.236
quartered+1.153
esson+1.109
lav+1.068
wark+1.067
Neg logit contributions
roma-0.844
opsy-0.827
verse-0.751
 noses-0.738
 manufact-0.710
Token Alex
Token ID4422
Feature activation+1.81e+01

Pos logit contributions
umber+3.303
esson+2.911
quartered+2.895
wark+2.762
Constructed+2.750
Neg logit contributions
roma-3.455
opsy-3.414
 noses-3.069
 manufact-3.036
verse-3.023
Token O
Token ID440
Feature activation+4.97e+00

Pos logit contributions
umber+9.208
esson+7.946
lav+7.935
quartered+7.560
+7.439
Neg logit contributions
roma-13.037
opsy-12.763
 manufact-11.926
 sugg-11.643
=-=-=-=-=-=-=-=--11.562
Tokenve
Token ID303
Feature activation-1.28e-01

Pos logit contributions
umber+4.107
esson+3.737
quartered+3.715
izo+3.619
wark+3.583
Neg logit contributions
roma-2.784
opsy-2.656
 noses-2.480
verse-2.378
 manufact-2.362
Tokench
Token ID354
Feature activation+5.07e+00

Pos logit contributions
roma+0.075
opsy+0.074
verse+0.067
 noses+0.066
 manufact+0.064
Neg logit contributions
umber-0.103
quartered-0.097
esson-0.092
netflix-0.089
wark-0.089
Tokenkin
Token ID5116
Feature activation+1.39e+00

Pos logit contributions
umber+5.110
quartered+4.720
esson+4.692
lav+4.633
inka+4.508
Neg logit contributions
roma-1.903
opsy-1.901
 noses-1.580
verse-1.578
 manufact-1.572
Token hasn
Token ID5818
Feature activation-1.04e+00

Pos logit contributions
umber+0.992
quartered+0.914
esson+0.872
lav+0.833
netflix+0.830
Neg logit contributions
roma-0.945
opsy-0.932
verse-0.857
 noses-0.839
 manufact-0.821
Token tight
Token ID5381
Feature activation-2.21e+00

Pos logit contributions
umber+4.877
quartered+4.474
esson+4.235
Constructed+4.130
wark+4.019
Neg logit contributions
opsy-5.906
roma-5.863
verse-5.173
 manufact-5.047
iq-5.022
Token ends
Token ID5645
Feature activation+4.68e+00

Pos logit contributions
roma+1.214
opsy+1.197
verse+1.093
 noses+1.086
 manufact+1.027
Neg logit contributions
umber-1.851
quartered-1.737
esson-1.661
netflix-1.614
wark-1.609
Token with
Token ID351
Feature activation+1.06e+01

Pos logit contributions
umber+3.114
quartered+2.789
esson+2.692
wark+2.581
lav+2.539
Neg logit contributions
roma-3.427
opsy-3.366
verse-3.102
 noses-3.007
 manufact-2.988
Token at
Token ID379
Feature activation+3.42e+00

Pos logit contributions
umber+6.218
quartered+5.563
esson+5.263
Constructed+5.119
netflix+5.104
Neg logit contributions
opsy-7.827
roma-7.785
verse-7.034
 manufact-6.846
 noses-6.651
Token least
Token ID1551
Feature activation+6.41e+00

Pos logit contributions
umber+1.995
quartered+1.775
esson+1.682
lav+1.632
netflix+1.612
Neg logit contributions
roma-2.747
opsy-2.722
verse-2.517
 noses-2.470
 manufact-2.454
Token 50
Token ID2026
Feature activation+1.80e+01

Pos logit contributions
umber+3.950
quartered+3.519
esson+3.408
netflix+3.279
Constructed+3.263
Neg logit contributions
roma-4.850
opsy-4.782
verse-4.316
 manufact-4.208
 noses-4.194
Token touches
Token ID18105
Feature activation+9.61e+00

Pos logit contributions
umber+9.549
quartered+8.629
esson+8.541
Constructed+8.356
izo+8.044
Neg logit contributions
opsy-12.250
roma-11.900
verse-10.747
iq-10.307
 manufact-10.286
Token on
Token ID319
Feature activation+2.37e+00

Pos logit contributions
umber+5.647
quartered+4.891
esson+4.796
Constructed+4.772
+4.631
Neg logit contributions
opsy-7.361
roma-7.343
verse-6.564
 manufact-6.323
 noses-6.301
Token the
Token ID262
Feature activation+9.18e+00

Pos logit contributions
umber+1.639
quartered+1.503
esson+1.424
lav+1.366
Constructed+1.366
Neg logit contributions
roma-1.667
opsy-1.648
verse-1.511
 noses-1.471
 manufact-1.453
Token year
Token ID614
Feature activation-1.25e+00

Pos logit contributions
umber+4.923
quartered+4.571
esson+4.097
lav+4.035
netflix+3.987
Neg logit contributions
roma-7.610
opsy-7.390
iq-6.688
verse-6.560
 manufact-6.502
Token.
Token ID13
Feature activation+5.15e+00

Pos logit contributions
roma+0.818
opsy+0.806
verse+0.744
 noses+0.733
 manufact+0.709
Neg logit contributions
umber-0.910
quartered-0.852
esson-0.807
netflix-0.777
lav-0.773
Token,
Token ID11
Feature activation-4.47e-01

Pos logit contributions
umber+2.164
quartered+1.972
esson+1.895
lav+1.809
izo+1.797
Neg logit contributions
roma-2.325
opsy-2.296
verse-2.102
 noses-2.066
 manufact-2.039
Token so
Token ID523
Feature activation+9.94e+00

Pos logit contributions
roma+0.290
opsy+0.287
verse+0.263
 noses+0.259
 manufact+0.252
Neg logit contributions
umber-0.332
quartered-0.309
esson-0.294
netflix-0.283
wark-0.282
Token I
Token ID314
Feature activation+4.35e+00

Pos logit contributions
umber+6.203
quartered+5.644
esson+5.370
lav+5.185
izo+5.078
Neg logit contributions
roma-7.170
opsy-6.952
verse-6.504
 noses-6.195
 manufact-6.139
Token wanted
Token ID2227
Feature activation+1.16e+01

Pos logit contributions
umber+3.010
quartered+2.752
esson+2.656
lav+2.520
wark+2.509
Neg logit contributions
roma-3.000
opsy-2.960
verse-2.743
 noses-2.643
 manufact-2.595
Token to
Token ID284
Feature activation+3.10e+00

Pos logit contributions
umber+6.341
quartered+5.661
lav+5.419
esson+5.388
+5.267
Neg logit contributions
roma-8.677
opsy-8.604
verse-8.212
 manufact-7.811
 noses-7.706
Token make
Token ID787
Feature activation+1.78e+01

Pos logit contributions
umber+2.196
quartered+2.008
esson+1.927
wark+1.855
lav+1.849
Neg logit contributions
roma-2.107
opsy-2.079
verse-1.898
 noses-1.857
 manufact-1.830
Token sure
Token ID1654
Feature activation+1.32e+01

Pos logit contributions
umber+8.690
izo+7.614
lav+7.559
quartered+7.490
+7.356
Neg logit contributions
roma-12.638
opsy-12.478
verse-11.499
 manufact-11.031
dim-10.905
Token people
Token ID661
Feature activation+5.72e+00

Pos logit contributions
umber+7.484
quartered+6.899
lav+6.680
esson+6.500
wark+6.098
Neg logit contributions
roma-9.401
opsy-9.110
verse-8.542
 noses-8.191
 manufact-8.169
Token weren
Token ID6304
Feature activation+3.55e+00

Pos logit contributions
umber+3.668
quartered+3.336
esson+3.225
lav+3.074
netflix+3.038
Neg logit contributions
roma-4.175
opsy-4.079
verse-3.787
 noses-3.664
 manufact-3.623
Token
Token ID447
Feature activation+1.53e+00

Pos logit contributions
umber+3.050
quartered+2.745
esson+2.741
Constructed+2.650
wark+2.632
Neg logit contributions
roma-1.901
opsy-1.873
 noses-1.665
 manufact-1.649
verse-1.634
Token
Token ID247
Feature activation-1.64e+00

Pos logit contributions
umber+0.690
quartered+0.602
esson+0.558
lav+0.517
wark+0.513
Neg logit contributions
roma-1.438
opsy-1.427
verse-1.350
 noses-1.333
 manufact-1.313
Token has
Token ID468
Feature activation-1.07e+00

Pos logit contributions
roma+0.533
opsy+0.526
verse+0.485
 noses+0.478
 manufact+0.465
Neg logit contributions
umber-0.603
quartered-0.563
esson-0.533
netflix-0.514
wark-0.511
Token been
Token ID587
Feature activation-6.16e-02

Pos logit contributions
roma+0.685
opsy+0.673
verse+0.623
 noses+0.616
 manufact+0.596
Neg logit contributions
umber-0.806
quartered-0.751
esson-0.713
netflix-0.689
lav-0.686
Token charged
Token ID5047
Feature activation+5.66e-02

Pos logit contributions
roma+0.041
opsy+0.041
verse+0.038
 noses+0.037
 manufact+0.036
Neg logit contributions
umber-0.044
quartered-0.041
esson-0.039
netflix-0.037
wark-0.037
Token with
Token ID351
Feature activation+7.50e+00

Pos logit contributions
umber+0.040
quartered+0.037
esson+0.035
netflix+0.034
wark+0.033
Neg logit contributions
roma-0.039
opsy-0.038
verse-0.036
 noses-0.035
 manufact-0.034
Token conspiracy
Token ID10086
Feature activation+9.24e+00

Pos logit contributions
umber+4.885
quartered+4.304
esson+4.240
wark+4.023
lav+3.924
Neg logit contributions
roma-5.406
opsy-5.287
verse-4.790
 noses-4.653
iq-4.543
Token to
Token ID284
Feature activation+1.78e+01

Pos logit contributions
umber+5.454
quartered+4.960
esson+4.525
netflix+4.397
wark+4.366
Neg logit contributions
roma-7.104
opsy-6.964
verse-6.492
 noses-6.245
 manufact-6.102
Token bribe
Token ID40879
Feature activation+1.11e+01

Pos logit contributions
umber+9.676
esson+8.204
 regard+7.910
wark+7.905
izo+7.626
Neg logit contributions
roma-12.137
opsy-11.959
rises-10.791
rique-10.604
pheus-10.579
Token a
Token ID257
Feature activation+2.94e+00

Pos logit contributions
umber+6.833
quartered+6.012
esson+5.899
wark+5.581
lav+5.535
Neg logit contributions
roma-7.867
opsy-7.846
verse-7.079
 noses-6.807
iq-6.613
Token foreign
Token ID3215
Feature activation+9.97e+00

Pos logit contributions
umber+2.017
quartered+1.857
esson+1.749
wark+1.692
netflix+1.687
Neg logit contributions
roma-2.060
opsy-2.045
verse-1.851
 noses-1.845
 manufact-1.765
Token public
Token ID1171
Feature activation+7.39e-01

Pos logit contributions
umber+5.991
quartered+5.391
esson+4.934
wark+4.880
lav+4.871
Neg logit contributions
roma-7.538
opsy-7.258
verse-6.583
 noses-6.514
iq-6.321
Token official
Token ID1743
Feature activation+5.06e+00

Pos logit contributions
umber+0.481
quartered+0.442
esson+0.418
wark+0.398
netflix+0.397
Neg logit contributions
roma-0.549
opsy-0.541
verse-0.502
 noses-0.496
 manufact-0.483
Token idea
Token ID2126
Feature activation+3.75e+00

Pos logit contributions
umber+3.456
quartered+3.147
esson+3.027
lav+2.883
wark+2.869
Neg logit contributions
roma-3.283
opsy-3.221
verse-2.882
 noses-2.837
 manufact-2.792
Token to
Token ID284
Feature activation+3.83e+00

Pos logit contributions
umber+2.516
quartered+2.268
esson+2.195
netflix+2.066
wark+2.063
Neg logit contributions
roma-2.713
opsy-2.661
verse-2.439
 noses-2.407
 manufact-2.374
Token put
Token ID1234
Feature activation+8.59e+00

Pos logit contributions
umber+2.667
quartered+2.411
esson+2.318
wark+2.251
izo+2.218
Neg logit contributions
roma-2.646
opsy-2.607
verse-2.360
 noses-2.323
 manufact-2.292
Token him
Token ID683
Feature activation+3.01e+00

Pos logit contributions
umber+5.352
quartered+4.651
esson+4.567
netflix+4.336
wark+4.329
Neg logit contributions
roma-6.332
opsy-6.300
verse-5.618
 manufact-5.450
iq-5.431
Token in
Token ID287
Feature activation+1.27e+01

Pos logit contributions
umber+1.981
quartered+1.796
esson+1.701
netflix+1.644
wark+1.641
Neg logit contributions
roma-2.198
opsy-2.180
verse-2.000
 noses-1.951
 manufact-1.929
Token the
Token ID262
Feature activation+1.78e+01

Pos logit contributions
umber+7.571
quartered+6.718
esson+6.452
wark+6.088
lav+5.929
Neg logit contributions
roma-9.203
opsy-9.059
iq-7.943
 manufact-7.863
rique-7.795
Token book
Token ID1492
Feature activation+1.02e+01

Pos logit contributions
umber+10.952
quartered+9.542
esson+9.444
izo+8.995
wark+8.707
Neg logit contributions
opsy-11.040
roma-10.942
rique-9.268
enz-9.022
iq-9.008
Token but
Token ID475
Feature activation+2.23e+00

Pos logit contributions
umber+6.593
quartered+5.788
esson+5.730
wark+5.340
lav+5.223
Neg logit contributions
roma-7.109
opsy-7.048
 noses-6.420
 manufact-6.243
iq-6.109
Token when
Token ID618
Feature activation+4.90e+00

Pos logit contributions
umber+1.554
quartered+1.433
esson+1.359
netflix+1.298
lav+1.297
Neg logit contributions
roma-1.558
opsy-1.531
verse-1.402
 noses-1.379
 manufact-1.345
Token I
Token ID314
Feature activation-3.30e+00

Pos logit contributions
umber+3.264
quartered+2.935
esson+2.810
netflix+2.686
wark+2.671
Neg logit contributions
roma-3.564
opsy-3.483
verse-3.157
 noses-3.128
 manufact-3.058
Token saw
Token ID2497
Feature activation+9.19e+00

Pos logit contributions
roma+2.053
opsy+2.019
verse+1.849
 noses+1.827
 manufact+1.762
Neg logit contributions
umber-2.525
quartered-2.422
esson-2.245
netflix-2.188
-2.186
Token celebrities
Token ID21305
Feature activation-8.19e-01

Pos logit contributions
umber+5.824
quartered+5.388
esson+5.022
netflix+4.923
lav+4.880
Neg logit contributions
roma-7.339
opsy-7.274
rique-6.443
 noses-6.313
verse-6.256
Token and
Token ID290
Feature activation+4.11e-01

Pos logit contributions
roma+0.554
opsy+0.546
verse+0.507
 noses+0.499
 manufact+0.485
Neg logit contributions
umber-0.585
quartered-0.544
esson-0.514
netflix-0.494
wark-0.493
Token posted
Token ID4481
Feature activation-2.49e+00

Pos logit contributions
umber+0.288
quartered+0.267
esson+0.254
netflix+0.243
wark+0.242
Neg logit contributions
roma-0.284
opsy-0.280
verse-0.258
 noses-0.254
 manufact-0.247
Token them
Token ID606
Feature activation-9.87e-01

Pos logit contributions
roma+1.624
opsy+1.584
verse+1.494
 noses+1.459
 manufact+1.407
Neg logit contributions
umber-1.845
quartered-1.727
esson-1.629
wark-1.562
izo-1.561
Token to
Token ID284
Feature activation-7.76e-01

Pos logit contributions
roma+0.722
opsy+0.712
verse+0.667
 noses+0.655
 manufact+0.638
Neg logit contributions
umber-0.649
quartered-0.601
esson-0.567
netflix-0.539
wark-0.538
Token social
Token ID1919
Feature activation-2.23e+01

Pos logit contributions
roma+0.530
opsy+0.522
verse+0.484
 noses+0.475
 manufact+0.462
Neg logit contributions
umber-0.551
quartered-0.512
esson-0.485
netflix-0.464
wark-0.464
Token media
Token ID2056
Feature activation-4.12e+00

Pos logit contributions
opsy+9.120
verse+8.835
roma+8.589
ose+8.025
iq+7.704
Neg logit contributions
umber-16.915
quartered-16.585
esson-15.986
EStream-15.362
Constructed-14.973
Token sites
Token ID5043
Feature activation-6.18e-01

Pos logit contributions
roma+2.399
opsy+2.339
verse+2.219
 noses+2.147
 manufact+2.060
Neg logit contributions
umber-3.325
quartered-3.111
esson-2.977
netflix-2.864
wark-2.839
Token including
Token ID1390
Feature activation-2.60e+00

Pos logit contributions
roma+0.428
opsy+0.422
verse+0.391
 noses+0.384
 manufact+0.374
Neg logit contributions
umber-0.432
quartered-0.401
esson-0.379
netflix-0.364
wark-0.362
Token Reddit
Token ID10750
Feature activation-3.08e+00

Pos logit contributions
roma+1.787
opsy+1.750
verse+1.643
 noses+1.610
 manufact+1.556
Neg logit contributions
umber-1.823
quartered-1.707
esson-1.609
wark-1.529
lav-1.529
Token and
Token ID290
Feature activation-3.07e+00

Pos logit contributions
roma+2.006
opsy+1.954
verse+1.838
 noses+1.784
 manufact+1.715
Neg logit contributions
umber-2.285
quartered-2.141
esson-2.021
Constructed-1.939
netflix-1.933
Token not
Token ID407
Feature activation+2.74e+00

Pos logit contributions
roma+0.630
opsy+0.624
verse+0.577
 noses+0.569
 manufact+0.554
Neg logit contributions
umber-0.651
quartered-0.605
esson-0.574
netflix-0.551
wark-0.551
Token all
Token ID477
Feature activation-4.90e-01

Pos logit contributions
umber+1.940
quartered+1.800
esson+1.697
netflix+1.640
lav+1.636
Neg logit contributions
roma-1.870
opsy-1.840
verse-1.681
 noses-1.650
 manufact-1.608
Token of
Token ID286
Feature activation+1.08e-02

Pos logit contributions
roma+0.332
opsy+0.328
verse+0.303
 noses+0.299
 manufact+0.291
Neg logit contributions
umber-0.349
quartered-0.323
esson-0.307
netflix-0.295
wark-0.294
Token them
Token ID606
Feature activation+2.11e+00

Pos logit contributions
umber+0.008
quartered+0.007
esson+0.007
netflix+0.006
wark+0.006
Neg logit contributions
roma-0.007
opsy-0.007
verse-0.007
 noses-0.007
 manufact-0.006
Token are
Token ID389
Feature activation+1.73e+00

Pos logit contributions
umber+1.481
quartered+1.374
esson+1.297
wark+1.246
lav+1.245
Neg logit contributions
roma-1.453
opsy-1.430
verse-1.314
 noses-1.291
 manufact-1.259
Token C
Token ID327
Feature activation-2.19e+01

Pos logit contributions
umber+1.226
quartered+1.149
esson+1.082
lav+1.037
wark+1.034
Neg logit contributions
roma-1.167
opsy-1.154
verse-1.056
 noses-1.035
 manufact-1.002
Tokeneph
Token ID27446
Feature activation-5.30e+00

Pos logit contributions
roma+13.785
opsy+13.178
verse+12.561
ops+12.506
ose+12.061
Neg logit contributions
EStream-11.261
quartered-10.978
netflix-10.669
umber-10.520
-9.903
Tokenaly
Token ID3400
Feature activation+8.62e-01

Pos logit contributions
roma+3.539
opsy+3.487
verse+3.180
 noses+3.108
 manufact+3.093
Neg logit contributions
umber-3.749
quartered-3.569
esson-3.273
netflix-3.267
wark-3.221
Tokenx
Token ID87
Feature activation-3.86e+00

Pos logit contributions
umber+0.644
quartered+0.601
esson+0.572
netflix+0.549
wark+0.549
Neg logit contributions
roma-0.555
opsy-0.546
verse-0.501
 noses-0.492
 manufact-0.478
Token and
Token ID290
Feature activation-2.46e+00

Pos logit contributions
opsy+2.440
roma+2.435
verse+2.261
 noses+2.228
 manufact+2.182
Neg logit contributions
umber-2.868
quartered-2.672
esson-2.538
netflix-2.495
wark-2.457
Token their
Token ID511
Feature activation-3.55e+00

Pos logit contributions
roma+1.635
opsy+1.620
verse+1.497
 noses+1.475
 manufact+1.445
Neg logit contributions
umber-1.780
quartered-1.649
esson-1.569
netflix-1.521
wark-1.503
Token movie
Token ID3807
Feature activation-5.12e+00

Pos logit contributions
roma+1.923
opsy+1.878
verse+1.769
 noses+1.697
 manufact+1.671
Neg logit contributions
umber-2.400
quartered-2.237
esson-2.125
wark-2.059
netflix-2.045
Token in
Token ID287
Feature activation+3.62e+00

Pos logit contributions
roma+3.169
opsy+3.094
verse+2.969
 noses+2.821
 manufact+2.791
Neg logit contributions
umber-3.902
quartered-3.567
esson-3.416
wark-3.315
lav-3.314
Token the
Token ID262
Feature activation+5.07e+00

Pos logit contributions
umber+2.430
quartered+2.239
esson+2.114
netflix+2.052
wark+1.999
Neg logit contributions
roma-2.604
opsy-2.576
verse-2.329
 noses-2.314
 manufact-2.253
Token age
Token ID2479
Feature activation-2.43e+00

Pos logit contributions
umber+3.469
quartered+3.226
esson+3.035
netflix+2.979
Constructed+2.926
Neg logit contributions
roma-3.487
opsy-3.463
 noses-3.063
verse-3.045
 manufact-2.963
Token of
Token ID286
Feature activation-1.55e+00

Pos logit contributions
roma+1.595
opsy+1.567
verse+1.465
 noses+1.423
 manufact+1.391
Neg logit contributions
umber-1.772
quartered-1.641
esson-1.569
wark-1.500
lav-1.494
Token social
Token ID1919
Feature activation-2.13e+01

Pos logit contributions
roma+1.052
opsy+1.034
verse+0.964
 noses+0.947
 manufact+0.921
Neg logit contributions
umber-1.104
quartered-1.024
esson-0.972
wark-0.933
lav-0.928
Token media
Token ID2056
Feature activation-9.11e+00

Pos logit contributions
opsy+7.529
verse+7.494
roma+7.371
 atmosp+6.793
 propri+6.781
Neg logit contributions
umber-17.507
quartered-16.859
lav-15.826
wark-15.704
esson-15.704
Token,
Token ID11
Feature activation-3.07e+00

Pos logit contributions
roma+4.824
opsy+4.781
verse+4.657
 noses+4.446
 manufact+4.383
Neg logit contributions
umber-7.276
quartered-6.914
esson-6.474
wark-6.336
lav-6.320
Token and
Token ID290
Feature activation-5.43e+00

Pos logit contributions
roma+1.911
opsy+1.871
verse+1.763
 noses+1.718
 manufact+1.673
Neg logit contributions
umber-2.335
quartered-2.189
esson-2.068
wark-1.992
lav-1.985
Token it
Token ID340
Feature activation-1.03e+00

Pos logit contributions
roma+3.272
opsy+3.199
verse+3.061
 noses+2.973
 manufact+2.923
Neg logit contributions
umber-4.127
quartered-3.861
esson-3.662
wark-3.567
Constructed-3.529
Token's
Token ID338
Feature activation+5.26e-01

Pos logit contributions
roma+0.686
opsy+0.676
verse+0.627
 noses+0.618
 manufact+0.599
Neg logit contributions
umber-0.747
quartered-0.697
esson-0.657
netflix-0.632
wark-0.630
Token.
Token ID13
Feature activation-2.26e+00

Pos logit contributions
roma+0.959
opsy+0.944
verse+0.876
 noses+0.864
 manufact+0.836
Neg logit contributions
umber-1.055
quartered-0.985
esson-0.938
netflix-0.901
wark-0.893
Token Estimates
Token ID47052
Feature activation-1.53e+01

Pos logit contributions
roma+1.458
opsy+1.435
verse+1.337
 noses+1.293
 manufact+1.247
Neg logit contributions
umber-1.672
quartered-1.579
esson-1.478
netflix-1.427
lav-1.427
Token from
Token ID422
Feature activation-2.50e+00

Pos logit contributions
roma+7.044
opsy+6.788
 noses+6.620
verse+6.420
 sugg+6.284
Neg logit contributions
umber-11.727
EStream-11.370
quartered-11.231
esson-10.791
-10.778
Token human
Token ID1692
Feature activation-8.34e+00

Pos logit contributions
roma+1.614
opsy+1.595
verse+1.473
 noses+1.459
 manufact+1.402
Neg logit contributions
umber-1.846
quartered-1.742
esson-1.651
netflix-1.585
lav-1.581
Token rights
Token ID2489
Feature activation-2.77e+00

Pos logit contributions
opsy+3.910
roma+3.844
 noses+3.632
verse+3.520
 manufact+3.316
Neg logit contributions
umber-7.233
quartered-7.030
esson-6.662
lav-6.492
wark-6.481
Token groups
Token ID2628
Feature activation-2.11e+01

Pos logit contributions
roma+1.389
opsy+1.378
verse+1.239
 noses+1.226
 manufact+1.168
Neg logit contributions
umber-2.449
quartered-2.299
esson-2.212
netflix-2.150
wark-2.142
Token and
Token ID290
Feature activation+1.22e+00

Pos logit contributions
roma+9.068
opsy+8.995
 sugg+8.800
 dictate+8.721
 noses+8.712
Neg logit contributions
EStream-14.771
umber-14.329
quartered-14.107
Redditor-13.812
-13.775
Token witnesses
Token ID11432
Feature activation-8.98e+00

Pos logit contributions
umber+0.861
quartered+0.797
esson+0.755
netflix+0.724
wark+0.722
Neg logit contributions
roma-0.841
opsy-0.830
verse-0.765
 noses-0.750
 manufact-0.733
Token range
Token ID2837
Feature activation-1.72e+00

Pos logit contributions
roma+4.980
opsy+4.860
 noses+4.639
verse+4.576
 sugg+4.416
Neg logit contributions
umber-6.816
quartered-6.496
EStream-6.309
esson-6.174
netflix-6.110
Token from
Token ID422
Feature activation+8.02e-01

Pos logit contributions
roma+1.132
opsy+1.122
verse+1.028
 noses+1.020
 manufact+0.983
Neg logit contributions
umber-1.246
quartered-1.170
esson-1.110
netflix-1.063
wark-1.059
Token several
Token ID1811
Feature activation-2.94e+00

Pos logit contributions
umber+0.552
quartered+0.508
esson+0.481
netflix+0.461
wark+0.460
Neg logit contributions
roma-0.566
opsy-0.558
verse-0.516
 noses-0.506
 manufact-0.495
Token gave
Token ID2921
Feature activation-6.91e+00

Pos logit contributions
roma+1.191
opsy+1.165
verse+1.086
 noses+1.069
 manufact+1.038
Neg logit contributions
umber-1.343
quartered-1.261
esson-1.178
netflix-1.150
wark-1.138
Token police
Token ID1644
Feature activation-2.01e+00

Pos logit contributions
roma+4.154
opsy+4.127
verse+3.900
 noses+3.786
 manufact+3.590
Neg logit contributions
umber-5.146
quartered-4.955
netflix-4.561
esson-4.556
wark-4.506
Token drones
Token ID15382
Feature activation-1.89e+00

Pos logit contributions
roma+1.298
opsy+1.286
verse+1.196
 noses+1.185
 manufact+1.140
Neg logit contributions
umber-1.472
quartered-1.382
esson-1.307
wark-1.264
netflix-1.262
Token his
Token ID465
Feature activation-6.64e+00

Pos logit contributions
roma+1.224
opsy+1.208
verse+1.124
 noses+1.113
 manufact+1.075
Neg logit contributions
umber-1.387
quartered-1.295
esson-1.236
wark-1.187
netflix-1.186
Token off
Token ID572
Feature activation-7.96e+00

Pos logit contributions
roma+3.040
opsy+3.013
 noses+2.795
verse+2.721
 manufact+2.541
Neg logit contributions
umber-5.976
quartered-5.746
netflix-5.306
wark-5.264
Constructed-5.256
Token-
Token ID12
Feature activation-2.10e+01

Pos logit contributions
opsy+4.183
roma+4.182
verse+3.817
 noses+3.814
 manufact+3.535
Neg logit contributions
umber-6.448
quartered-6.255
netflix-5.828
Constructed-5.813
izo-5.761
Tokenthe
Token ID1169
Feature activation-1.06e+01

Pos logit contributions
verse+8.675
opsy+8.667
roma+8.605
Prop+7.887
ose+7.635
Neg logit contributions
quartered-15.124
izo-15.076
umber-14.987
��-14.911
rencies-14.477
Token-
Token ID12
Feature activation-1.54e+01

Pos logit contributions
roma+4.818
opsy+4.737
verse+4.676
 noses+4.449
 manufact+4.006
Neg logit contributions
umber-8.898
quartered-8.829
esson-8.160
lav-8.033
netflix-8.033
Tokenc
Token ID66
Feature activation-8.08e+00

Pos logit contributions
roma+7.889
opsy+7.626
verse+7.461
ops+6.940
 noses+6.772
Neg logit contributions
quartered-11.257
umber-11.075
izo-10.645
��-10.531
Constructed-10.479
Tokenuff
Token ID1648
Feature activation-1.13e+00

Pos logit contributions
opsy+4.086
roma+3.984
verse+3.460
ops+3.343
 noses+3.239
Neg logit contributions
quartered-6.854
umber-6.510
esson-6.344
-6.255
��-6.201
Token endorsement
Token ID17819
Feature activation+5.30e+00

Pos logit contributions
roma+0.721
opsy+0.712
verse+0.656
 noses+0.647
 manufact+0.627
Neg logit contributions
umber-0.855
quartered-0.799
esson-0.755
netflix-0.733
wark-0.729
Token Ranger
Token ID21888
Feature activation+8.46e-01

Pos logit contributions
roma+2.475
opsy+2.443
verse+2.292
 noses+2.270
 manufact+2.200
Neg logit contributions
umber-2.038
quartered-1.897
esson-1.758
netflix-1.719
wark-1.690
Token 6
Token ID718
Feature activation+5.36e+00

Pos logit contributions
umber+0.575
quartered+0.530
esson+0.502
wark+0.479
netflix+0.479
Neg logit contributions
roma-0.604
opsy-0.595
verse-0.551
 noses-0.542
 manufact-0.529
Token lifted
Token ID13663
Feature activation-3.66e+00

Pos logit contributions
umber+3.775
quartered+3.464
esson+3.283
Constructed+3.187
wark+3.178
Neg logit contributions
roma-3.592
opsy-3.513
verse-3.241
 noses-3.177
 manufact-3.100
Token off
Token ID572
Feature activation-7.66e+00

Pos logit contributions
roma+2.105
opsy+2.060
verse+1.915
 noses+1.907
 manufact+1.822
Neg logit contributions
umber-2.946
quartered-2.777
esson-2.647
netflix-2.569
wark-2.535
Token on
Token ID319
Feature activation-5.34e+00

Pos logit contributions
roma+4.514
opsy+4.348
 noses+4.201
verse+4.129
 manufact+3.933
Neg logit contributions
umber-5.775
quartered-5.427
esson-5.233
netflix-5.081
wark-4.987
Token its
Token ID663
Feature activation-2.04e+01

Pos logit contributions
roma+3.480
opsy+3.449
verse+3.196
 noses+3.196
 manufact+3.077
Neg logit contributions
umber-3.812
quartered-3.592
esson-3.384
netflix-3.333
wark-3.247
Token first
Token ID717
Feature activation-1.31e+01

Pos logit contributions
 noses+10.360
 dime+10.039
opsy+10.005
roma+9.965
verse+9.579
Neg logit contributions
umber-13.532
quartered-13.170
rencies-12.742
esson-12.532
netflix-12.440
Token attempt
Token ID2230
Feature activation-4.97e+00

Pos logit contributions
opsy+7.165
roma+7.104
 noses+6.840
verse+6.772
 dime+6.641
Neg logit contributions
umber-9.706
quartered-9.165
esson-8.921
rencies-8.527
netflix-8.441
Token on
Token ID319
Feature activation-6.77e+00

Pos logit contributions
roma+3.091
opsy+3.057
 noses+2.826
verse+2.803
 manufact+2.688
Neg logit contributions
umber-3.691
quartered-3.475
esson-3.314
netflix-3.199
wark-3.168
Token January
Token ID3269
Feature activation+1.36e+00

Pos logit contributions
roma+4.284
opsy+4.225
 noses+3.923
verse+3.915
 manufact+3.780
Neg logit contributions
umber-4.866
quartered-4.629
esson-4.335
netflix-4.280
wark-4.189
Token 30
Token ID1542
Feature activation+4.71e-02

Pos logit contributions
umber+0.921
quartered+0.841
esson+0.804
netflix+0.767
wark+0.765
Neg logit contributions
roma-0.980
opsy-0.969
verse-0.889
 noses-0.880
 manufact-0.862
Token
Token ID198
Feature activation-1.40e+00

Pos logit contributions
umber+0.671
quartered+0.617
esson+0.589
netflix+0.564
wark+0.562
Neg logit contributions
roma-0.680
opsy-0.672
verse-0.618
 noses-0.611
 manufact-0.597
Token
Token ID198
Feature activation+1.91e+00

Pos logit contributions
roma+0.921
opsy+0.908
verse+0.854
 noses+0.823
 manufact+0.796
Neg logit contributions
umber-1.018
quartered-0.972
esson-0.902
netflix-0.868
lav-0.864
TokenDo
Token ID5211
Feature activation+3.63e-02

Pos logit contributions
umber+1.431
quartered+1.315
esson+1.264
netflix+1.225
Constructed+1.223
Neg logit contributions
roma-1.218
opsy-1.202
 noses-1.093
verse-1.085
 manufact-1.061
Token not
Token ID407
Feature activation-2.56e-01

Pos logit contributions
umber+0.025
quartered+0.023
esson+0.022
netflix+0.021
wark+0.021
Neg logit contributions
roma-0.025
opsy-0.025
verse-0.023
 noses-0.023
 manufact-0.022
Token use
Token ID779
Feature activation-7.30e+00

Pos logit contributions
roma+0.179
opsy+0.176
verse+0.164
 noses+0.161
 manufact+0.157
Neg logit contributions
umber-0.177
quartered-0.164
esson-0.155
netflix-0.149
wark-0.149
Token social
Token ID1919
Feature activation-2.04e+01

Pos logit contributions
roma+4.228
opsy+4.148
verse+4.019
 noses+3.852
 manufact+3.702
Neg logit contributions
umber-5.635
quartered-5.404
esson-4.988
lav-4.861
wark-4.832
Token media
Token ID2056
Feature activation-6.14e+00

Pos logit contributions
verse+6.997
 noses+5.675
opsy+5.576
roma+5.526
 atmosp+5.408
Neg logit contributions
quartered-18.622
umber-18.019
��-16.454
EStream-16.450
lav-16.423
Token to
Token ID284
Feature activation-2.32e+00

Pos logit contributions
roma+3.636
opsy+3.588
verse+3.380
 noses+3.285
 manufact+3.134
Neg logit contributions
umber-4.736
quartered-4.466
esson-4.202
netflix-4.107
Constructed-4.058
Token set
Token ID900
Feature activation-4.99e+00

Pos logit contributions
roma+1.532
opsy+1.506
verse+1.408
 noses+1.377
 manufact+1.341
Neg logit contributions
umber-1.682
quartered-1.582
esson-1.491
netflix-1.442
lav-1.431
Token a
Token ID257
Feature activation+2.03e+00

Pos logit contributions
roma+2.930
opsy+2.899
 noses+2.780
verse+2.744
 manufact+2.648
Neg logit contributions
umber-3.868
quartered-3.665
esson-3.457
wark-3.361
netflix-3.350
Token bad
Token ID2089
Feature activation-7.34e+00

Pos logit contributions
umber+1.453
quartered+1.344
esson+1.273
netflix+1.236
lav+1.225
Neg logit contributions
roma-1.361
opsy-1.346
verse-1.218
 noses-1.203
 manufact-1.171
Token
Token ID447
Feature activation+1.53e+00

Pos logit contributions
roma+2.224
opsy+2.183
verse+2.024
 noses+2.007
 manufact+1.940
Neg logit contributions
umber-2.567
quartered-2.415
esson-2.263
netflix-2.218
wark-2.198
Token
Token ID247
Feature activation-8.31e+00

Pos logit contributions
umber+0.700
quartered+0.615
esson+0.568
wark+0.527
lav+0.526
Neg logit contributions
roma-1.419
opsy-1.408
verse-1.329
 noses-1.313
 manufact-1.295
Token simplistic
Token ID35010
Feature activation-1.81e-01

Pos logit contributions
roma+4.943
opsy+4.786
verse+4.601
 noses+4.352
 manufact+4.300
Neg logit contributions
umber-6.171
quartered-5.830
netflix-5.447
wark-5.409
esson-5.406
Token approach
Token ID3164
Feature activation+6.11e-01

Pos logit contributions
roma+0.117
opsy+0.115
verse+0.106
 noses+0.104
 manufact+0.101
Neg logit contributions
umber-0.135
quartered-0.126
esson-0.120
netflix-0.115
wark-0.115
Token to
Token ID284
Feature activation-2.30e+00

Pos logit contributions
umber+0.436
quartered+0.404
esson+0.384
netflix+0.368
wark+0.367
Neg logit contributions
roma-0.415
opsy-0.409
verse-0.377
 noses-0.371
 manufact-0.362
Token social
Token ID1919
Feature activation-2.00e+01

Pos logit contributions
roma+1.462
opsy+1.443
verse+1.343
 noses+1.317
 manufact+1.280
Neg logit contributions
umber-1.728
quartered-1.611
esson-1.529
netflix-1.492
lav-1.478
Token commentary
Token ID14604
Feature activation-3.33e+00

Pos logit contributions
opsy+9.412
verse+9.274
roma+9.044
 atmosp+8.710
 propri+8.636
Neg logit contributions
umber-14.326
quartered-13.857
netflix-12.992
lav-12.965
wark-12.640
Token.
Token ID13
Feature activation+1.24e+00

Pos logit contributions
roma+2.116
opsy+2.093
verse+1.940
 noses+1.911
 manufact+1.846
Neg logit contributions
umber-2.461
quartered-2.321
esson-2.167
netflix-2.133
lav-2.114
Token
Token ID198
Feature activation-6.88e-01

Pos logit contributions
umber+0.861
quartered+0.788
esson+0.758
wark+0.724
netflix+0.721
Neg logit contributions
roma-0.868
opsy-0.857
verse-0.784
 noses-0.780
 manufact-0.762
Token
Token ID198
Feature activation+9.47e-01

Pos logit contributions
roma+0.464
opsy+0.458
verse+0.426
 noses+0.416
 manufact+0.404
Neg logit contributions
umber-0.491
quartered-0.462
esson-0.433
netflix-0.417
wark-0.415
TokenOn
Token ID2202
Feature activation+4.27e+00

Pos logit contributions
umber+0.696
quartered+0.645
esson+0.617
netflix+0.591
Constructed+0.590
Neg logit contributions
roma-0.621
opsy-0.613
verse-0.560
 noses-0.558
 manufact-0.542
Token
Token ID198
Feature activation-7.33e-01

Pos logit contributions
roma+1.106
opsy+1.093
verse+1.010
 noses+0.988
 manufact+0.961
Neg logit contributions
umber-1.197
quartered-1.125
esson-1.057
netflix-1.019
wark-1.015
Token
Token ID198
Feature activation+1.76e+00

Pos logit contributions
roma+0.494
opsy+0.487
verse+0.454
 noses+0.442
 manufact+0.429
Neg logit contributions
umber-0.525
quartered-0.494
esson-0.463
netflix-0.445
wark-0.443
TokenDo
Token ID5211
Feature activation+3.90e-01

Pos logit contributions
umber+1.334
quartered+1.230
esson+1.181
netflix+1.142
Constructed+1.138
Neg logit contributions
roma-1.112
opsy-1.095
 noses-0.995
verse-0.994
 manufact-0.967
Token not
Token ID407
Feature activation-1.00e-01

Pos logit contributions
umber+0.270
quartered+0.250
esson+0.237
netflix+0.227
wark+0.226
Neg logit contributions
roma-0.273
opsy-0.269
verse-0.248
 noses-0.245
 manufact-0.239
Token use
Token ID779
Feature activation-7.36e+00

Pos logit contributions
roma+0.070
opsy+0.069
verse+0.064
 noses+0.063
 manufact+0.061
Neg logit contributions
umber-0.069
quartered-0.064
esson-0.061
netflix-0.058
wark-0.058
Token social
Token ID1919
Feature activation-1.99e+01

Pos logit contributions
roma+4.192
opsy+4.125
verse+3.996
 noses+3.837
 manufact+3.685
Neg logit contributions
umber-5.726
quartered-5.516
esson-5.067
lav-4.954
wark-4.929
Token media
Token ID2056
Feature activation-5.58e+00

Pos logit contributions
verse+6.938
opsy+5.662
 noses+5.646
roma+5.558
 atmosp+5.442
Neg logit contributions
quartered-18.157
umber-17.614
EStream-16.038
��-16.000
lav-15.971
Token to
Token ID284
Feature activation-1.82e+00

Pos logit contributions
roma+3.351
opsy+3.299
verse+3.103
 noses+3.038
 manufact+2.894
Neg logit contributions
umber-4.290
quartered-4.056
esson-3.806
netflix-3.718
Constructed-3.675
Token...
Token ID2644
Feature activation-8.22e+00

Pos logit contributions
roma+1.220
opsy+1.201
verse+1.120
 noses+1.098
 manufact+1.069
Neg logit contributions
umber-1.309
quartered-1.226
esson-1.157
netflix-1.117
lav-1.108
Token actually
Token ID1682
Feature activation-5.85e+00

Pos logit contributions
roma+5.315
opsy+5.221
verse+4.915
 noses+4.775
 manufact+4.663
Neg logit contributions
umber-5.742
quartered-5.455
esson-5.010
netflix-4.927
-4.912
Token,
Token ID11
Feature activation-3.42e+00

Pos logit contributions
roma+3.661
opsy+3.605
verse+3.409
 noses+3.333
 manufact+3.252
Neg logit contributions
umber-4.331
quartered-4.141
esson-3.835
lav-3.725
-3.724
Token
Token ID198
Feature activation-2.92e+00

Pos logit contributions
roma+1.030
opsy+1.017
verse+0.940
 noses+0.920
 manufact+0.894
Neg logit contributions
umber-1.111
quartered-1.043
esson-0.982
netflix-0.947
lav-0.939
Token
Token ID198
Feature activation+2.56e-01

Pos logit contributions
roma+1.827
opsy+1.797
verse+1.723
 noses+1.619
iq+1.568
Neg logit contributions
umber-2.199
quartered-2.160
esson-1.964
netflix-1.896
lav-1.891
TokenDo
Token ID5211
Feature activation+8.53e-01

Pos logit contributions
umber+0.194
quartered+0.180
esson+0.172
netflix+0.165
wark+0.165
Neg logit contributions
roma-0.163
opsy-0.160
verse-0.147
 noses-0.145
 manufact-0.141
Token not
Token ID407
Feature activation-2.64e-01

Pos logit contributions
umber+0.587
quartered+0.542
esson+0.515
netflix+0.493
wark+0.490
Neg logit contributions
roma-0.600
opsy-0.593
verse-0.546
 noses-0.539
 manufact-0.526
Token use
Token ID779
Feature activation-7.23e+00

Pos logit contributions
roma+0.182
opsy+0.179
verse+0.166
 noses+0.163
 manufact+0.159
Neg logit contributions
umber-0.186
quartered-0.173
esson-0.164
netflix-0.157
wark-0.157
Token social
Token ID1919
Feature activation-1.97e+01

Pos logit contributions
roma+4.224
opsy+4.151
verse+4.012
 noses+3.859
 manufact+3.702
Neg logit contributions
umber-5.540
quartered-5.314
esson-4.899
lav-4.759
wark-4.741
Token media
Token ID2056
Feature activation-4.44e+00

Pos logit contributions
verse+6.861
opsy+5.692
 noses+5.675
roma+5.557
 atmosp+5.410
Neg logit contributions
quartered-17.937
umber-17.525
EStream-15.879
lav-15.840
��-15.786
Token to
Token ID284
Feature activation-2.01e+00

Pos logit contributions
roma+2.741
opsy+2.699
verse+2.540
 noses+2.481
 manufact+2.380
Neg logit contributions
umber-3.378
quartered-3.174
esson-2.994
netflix-2.919
Constructed-2.875
Token constantly
Token ID7558
Feature activation-3.16e+00

Pos logit contributions
roma+1.343
opsy+1.322
verse+1.234
 noses+1.209
 manufact+1.176
Neg logit contributions
umber-1.448
quartered-1.358
esson-1.283
netflix-1.238
lav-1.229
Token post
Token ID1281
Feature activation+2.09e+00

Pos logit contributions
roma+2.005
opsy+1.975
verse+1.858
 noses+1.819
 manufact+1.759
Neg logit contributions
umber-2.351
quartered-2.240
esson-2.102
netflix-2.036
-2.025
Token photos
Token ID5205
Feature activation+4.34e+00

Pos logit contributions
umber+1.485
quartered+1.371
esson+1.310
netflix+1.255
lav+1.251
Neg logit contributions
roma-1.417
opsy-1.403
verse-1.270
 noses-1.254
 manufact-1.222
Token multiple
Token ID3294
Feature activation-2.77e+00

Pos logit contributions
roma+2.746
opsy+2.721
verse+2.570
 noses+2.548
 manufact+2.415
Neg logit contributions
umber-3.400
quartered-3.227
esson-3.011
wark-2.931
netflix-2.918
Token fractures
Token ID39381
Feature activation-3.85e+00

Pos logit contributions
roma+1.754
opsy+1.738
verse+1.636
 noses+1.627
 manufact+1.558
Neg logit contributions
umber-2.073
quartered-1.950
esson-1.844
wark-1.785
netflix-1.783
Token and
Token ID290
Feature activation-1.93e+00

Pos logit contributions
roma+2.405
opsy+2.379
verse+2.234
 noses+2.220
 manufact+2.117
Neg logit contributions
umber-2.874
quartered-2.710
esson-2.572
netflix-2.469
wark-2.466
Token a
Token ID257
Feature activation-6.07e+00

Pos logit contributions
roma+1.267
opsy+1.253
verse+1.169
 noses+1.161
 manufact+1.119
Neg logit contributions
umber-1.414
quartered-1.318
esson-1.247
netflix-1.207
wark-1.202
Token partially
Token ID12387
Feature activation+3.57e-01

Pos logit contributions
roma+3.817
opsy+3.688
verse+3.563
 noses+3.450
 manufact+3.314
Neg logit contributions
umber-4.530
quartered-4.288
esson-4.048
wark-3.941
netflix-3.863
Token severed
Token ID32160
Feature activation-1.97e+01

Pos logit contributions
umber+0.247
quartered+0.228
esson+0.216
netflix+0.207
wark+0.206
Neg logit contributions
roma-0.250
opsy-0.246
verse-0.227
 noses-0.224
 manufact-0.218
Token right
Token ID826
Feature activation-1.34e+01

Pos logit contributions
 noses+8.807
verse+8.600
roma+8.150
opsy+7.939
 manufact+7.884
Neg logit contributions
quartered-14.483
umber-14.315
esson-13.740
Redditor-13.470
EStream-13.373
Token hand
Token ID1021
Feature activation+2.84e+00

Pos logit contributions
verse+6.954
opsy+6.931
roma+6.901
 noses+6.751
 manufact+6.511
Neg logit contributions
umber-9.644
quartered-9.589
esson-9.195
rencies-8.796
-8.721
Token in
Token ID287
Feature activation+2.60e+00

Pos logit contributions
umber+1.945
quartered+1.788
esson+1.698
lav+1.632
wark+1.622
Neg logit contributions
roma-2.001
opsy-1.967
verse-1.800
 noses-1.787
 manufact-1.741
Token a
Token ID257
Feature activation-9.97e+00

Pos logit contributions
umber+1.775
quartered+1.626
esson+1.552
wark+1.482
lav+1.481
Neg logit contributions
roma-1.841
opsy-1.808
verse-1.657
 noses-1.635
 manufact-1.601
Token crash
Token ID7014
Feature activation-3.99e+00

Pos logit contributions
roma+5.912
opsy+5.841
verse+5.670
 noses+5.616
 manufact+5.386
Neg logit contributions
umber-7.287
quartered-7.114
esson-6.637
wark-6.340
netflix-6.305
Token:
Token ID25
Feature activation+3.71e+00

Pos logit contributions
roma+0.867
opsy+0.852
verse+0.792
 noses+0.778
 manufact+0.752
Neg logit contributions
umber-0.931
quartered-0.871
esson-0.819
netflix-0.790
lav-0.785
Token Sherman
Token ID23198
Feature activation+4.12e+00

Pos logit contributions
umber+2.614
quartered+2.367
esson+2.293
wark+2.206
Constructed+2.173
Neg logit contributions
roma-2.527
opsy-2.508
verse-2.262
 noses-2.247
 manufact-2.203
Token suffered
Token ID6989
Feature activation-3.54e-02

Pos logit contributions
umber+2.794
quartered+2.533
esson+2.443
wark+2.326
izo+2.324
Neg logit contributions
roma-2.910
opsy-2.897
verse-2.590
 noses-2.549
 manufact-2.515
Token a
Token ID257
Feature activation-4.76e+00

Pos logit contributions
roma+0.024
opsy+0.024
verse+0.022
 noses+0.022
 manufact+0.021
Neg logit contributions
umber-0.025
quartered-0.023
esson-0.022
netflix-0.021
wark-0.021
Token rupt
Token ID31580
Feature activation+4.96e+00

Pos logit contributions
roma+3.081
opsy+2.996
verse+2.894
 noses+2.821
 manufact+2.693
Neg logit contributions
umber-3.505
quartered-3.303
esson-3.119
wark-3.017
netflix-2.974
Tokenured
Token ID1522
Feature activation-1.96e+01

Pos logit contributions
umber+3.125
quartered+2.623
esson+2.557
lav+2.508
wark+2.470
Neg logit contributions
roma-3.770
opsy-3.673
 noses-3.569
 manufact-3.490
verse-3.452
Token Achilles
Token ID39561
Feature activation-4.17e+00

Pos logit contributions
opsy+8.235
verse+8.186
 noses+8.144
roma+7.754
 dime+7.715
Neg logit contributions
esson-14.346
quartered-14.178
umber-14.069
Redditor-13.358
rencies-13.332
Token tendon
Token ID42205
Feature activation+4.16e+00

Pos logit contributions
roma+2.137
opsy+2.126
verse+1.937
 noses+1.870
 manufact+1.801
Neg logit contributions
umber-3.594
quartered-3.461
esson-3.278
netflix-3.207
-3.152
Token in
Token ID287
Feature activation+4.95e+00

Pos logit contributions
umber+2.770
quartered+2.490
esson+2.398
wark+2.305
lav+2.299
Neg logit contributions
roma-3.009
opsy-2.964
verse-2.682
 noses-2.661
 manufact-2.627
Token Thursday
Token ID3635
Feature activation-7.12e+00

Pos logit contributions
umber+3.487
quartered+3.207
esson+3.032
wark+2.948
Constructed+2.909
Neg logit contributions
roma-3.334
opsy-3.313
 noses-2.955
verse-2.933
 manufact-2.893
Token's
Token ID338
Feature activation+5.81e+00

Pos logit contributions
roma+3.849
opsy+3.812
verse+3.601
 noses+3.507
 manufact+3.290
Neg logit contributions
umber-5.706
quartered-5.516
esson-5.188
netflix-5.138
lav-5.005
Token in
Token ID287
Feature activation-6.72e+00

Pos logit contributions
roma+1.737
opsy+1.710
verse+1.602
 noses+1.551
 manufact+1.524
Neg logit contributions
umber-1.907
quartered-1.768
esson-1.693
netflix-1.629
lav-1.608
Token Mountain
Token ID9638
Feature activation-8.16e+00

Pos logit contributions
opsy+4.136
roma+4.086
verse+3.892
 manufact+3.724
 noses+3.717
Neg logit contributions
umber-4.922
quartered-4.610
esson-4.585
netflix-4.314
izo-4.258
Token View
Token ID3582
Feature activation-3.06e+00

Pos logit contributions
opsy+4.350
roma+4.310
verse+4.096
iq+3.731
 noses+3.730
Neg logit contributions
umber-6.595
quartered-6.394
esson-6.194
netflix-5.924
-5.874
Token.
Token ID13
Feature activation-1.06e+01

Pos logit contributions
roma+1.960
opsy+1.938
verse+1.798
 noses+1.745
 manufact+1.701
Neg logit contributions
umber-2.278
quartered-2.116
esson-2.025
netflix-1.941
lav-1.929
Token (
Token ID357
Feature activation-8.42e+00

Pos logit contributions
roma+6.170
opsy+5.893
verse+5.889
 Aut+5.373
Prop+5.290
Neg logit contributions
umber-7.882
quartered-7.617
esson-6.922
lav-6.711
netflix-6.665
TokenJustin
Token ID33229
Feature activation-1.96e+01

Pos logit contributions
roma+4.979
opsy+4.778
verse+4.679
Prop+4.268
iq+4.209
Neg logit contributions
umber-6.378
quartered-6.008
esson-5.649
netflix-5.495
wark-5.489
Token Sullivan
Token ID18501
Feature activation+4.37e+00

Pos logit contributions
roma+7.657
opsy+7.497
verse+7.370
 Aut+7.015
iq+6.901
Neg logit contributions
umber-15.734
quartered-15.493
netflix-14.872
esson-14.731
wark-14.408
Token/
Token ID14
Feature activation-6.56e+00

Pos logit contributions
umber+2.642
quartered+2.447
esson+2.317
wark+2.219
+2.205
Neg logit contributions
roma-3.434
opsy-3.341
 noses-3.093
verse-3.074
 manufact-2.980
TokenGetty
Token ID6633
Feature activation-8.50e-01

Pos logit contributions
roma+4.517
opsy+4.502
verse+4.293
iq+4.082
 noses+3.967
Neg logit contributions
umber-4.374
quartered-4.108
esson-3.987
lav-3.754
netflix-3.703
Token Images
Token ID5382
Feature activation-3.57e+00

Pos logit contributions
roma+0.551
opsy+0.544
verse+0.502
 noses+0.491
 manufact+0.477
Neg logit contributions
umber-0.631
quartered-0.588
esson-0.558
netflix-0.538
lav-0.535
Token)
Token ID8
Feature activation-1.74e+00

Pos logit contributions
roma+2.179
opsy+2.163
verse+2.008
 noses+1.923
 manufact+1.886
Neg logit contributions
umber-2.729
quartered-2.572
esson-2.413
netflix-2.344
lav-2.333
Token,
Token ID11
Feature activation+4.88e-01

Pos logit contributions
roma+0.966
opsy+0.951
verse+0.887
 noses+0.857
 manufact+0.835
Neg logit contributions
umber-1.061
quartered-0.997
esson-0.934
lav-0.898
netflix-0.898
Token Facebook
Token ID3203
Feature activation-3.31e+00

Pos logit contributions
umber+0.351
quartered+0.326
esson+0.310
netflix+0.297
wark+0.296
Neg logit contributions
roma-0.328
opsy-0.324
verse-0.298
 noses-0.294
 manufact-0.286
Token,
Token ID11
Feature activation+2.64e+00

Pos logit contributions
roma+2.236
opsy+2.213
verse+2.070
 noses+2.005
 manufact+1.968
Neg logit contributions
umber-2.337
quartered-2.184
esson-2.047
netflix-1.976
wark-1.964
Token the
Token ID262
Feature activation-3.74e+00

Pos logit contributions
umber+1.770
quartered+1.619
esson+1.550
netflix+1.485
wark+1.466
Neg logit contributions
roma-1.922
opsy-1.901
verse-1.735
 noses-1.727
 manufact-1.679
Token dominant
Token ID11410
Feature activation-5.51e+00

Pos logit contributions
roma+2.453
opsy+2.409
verse+2.311
 noses+2.228
 manufact+2.186
Neg logit contributions
umber-2.698
quartered-2.516
esson-2.405
lav-2.284
Constructed-2.283
Token social
Token ID1919
Feature activation-1.96e+01

Pos logit contributions
roma+3.526
opsy+3.486
verse+3.457
 manufact+3.282
 noses+3.252
Neg logit contributions
umber-3.903
quartered-3.654
esson-3.515
lav-3.386
netflix-3.347
Token network
Token ID3127
Feature activation-3.43e+00

Pos logit contributions
verse+7.455
opsy+6.692
roma+6.604
 manufact+6.335
 dime+6.135
Neg logit contributions
umber-16.189
quartered-15.546
esson-15.137
lav-15.080
EStream-14.991
Token with
Token ID351
Feature activation+5.23e+00

Pos logit contributions
roma+2.208
opsy+2.176
verse+2.045
 noses+1.975
 manufact+1.951
Neg logit contributions
umber-2.536
quartered-2.324
esson-2.221
netflix-2.165
lav-2.162
Token more
Token ID517
Feature activation-1.49e+00

Pos logit contributions
umber+3.373
quartered+3.120
esson+2.887
netflix+2.803
lav+2.788
Neg logit contributions
roma-3.829
opsy-3.797
verse-3.400
 noses-3.387
 manufact-3.280
Token than
Token ID621
Feature activation-1.93e+00

Pos logit contributions
roma+1.092
opsy+1.077
verse+1.016
 noses+1.000
 manufact+0.975
Neg logit contributions
umber-0.970
quartered-0.896
esson-0.851
netflix-0.810
wark-0.809
Token 500
Token ID5323
Feature activation+1.30e+00

Pos logit contributions
roma+1.282
opsy+1.264
verse+1.184
 noses+1.159
 manufact+1.127
Neg logit contributions
umber-1.394
quartered-1.299
esson-1.239
wark-1.185
netflix-1.180
Token,
Token ID11
Feature activation-4.00e+00

Pos logit contributions
roma+3.244
opsy+3.191
verse+2.918
 noses+2.841
 manufact+2.749
Neg logit contributions
umber-3.885
quartered-3.650
esson-3.455
netflix-3.399
Constructed-3.344
Token Calif
Token ID3192
Feature activation-4.72e+00

Pos logit contributions
roma+2.399
opsy+2.361
verse+2.175
 noses+2.113
 manufact+2.054
Neg logit contributions
umber-3.115
quartered-2.953
esson-2.794
netflix-2.726
lav-2.689
Token.
Token ID13
Feature activation-2.68e+00

Pos logit contributions
roma+2.923
opsy+2.878
verse+2.652
 noses+2.562
 manufact+2.498
Neg logit contributions
umber-3.541
quartered-3.361
esson-3.148
netflix-3.080
Constructed-3.055
Token Photo
Token ID5555
Feature activation+5.69e+00

Pos logit contributions
roma+1.740
opsy+1.716
verse+1.593
 noses+1.544
 manufact+1.493
Neg logit contributions
umber-1.977
quartered-1.858
esson-1.752
netflix-1.696
lav-1.690
Token:
Token ID25
Feature activation-4.15e+00

Pos logit contributions
umber+3.598
quartered+3.309
esson+3.134
lav+2.983
wark+2.973
Neg logit contributions
roma-4.158
opsy-4.093
 noses-3.836
verse-3.796
 manufact-3.737
Token Justin
Token ID10799
Feature activation-1.95e+01

Pos logit contributions
roma+2.673
opsy+2.622
verse+2.449
 noses+2.365
 manufact+2.302
Neg logit contributions
umber-3.063
quartered-2.883
esson-2.712
netflix-2.617
lav-2.596
Token Sullivan
Token ID18501
Feature activation+3.43e+00

Pos logit contributions
opsy+6.999
verse+6.914
roma+6.858
 Aut+6.641
 Miranda+6.544
Neg logit contributions
umber-16.419
quartered-16.180
netflix-15.792
wark-15.310
esson-15.011
Token/
Token ID14
Feature activation-6.71e+00

Pos logit contributions
umber+2.188
quartered+2.013
esson+1.928
wark+1.846
+1.835
Neg logit contributions
roma-2.585
opsy-2.532
 noses-2.329
verse-2.326
 manufact-2.259
TokenGetty
Token ID6633
Feature activation+1.02e+00

Pos logit contributions
roma+4.653
opsy+4.550
verse+4.392
iq+4.099
 noses+4.055
Neg logit contributions
umber-4.509
quartered-4.229
esson-4.075
netflix-3.778
lav-3.774
Token Images
Token ID5382
Feature activation-1.02e+00

Pos logit contributions
umber+0.729
quartered+0.676
esson+0.645
netflix+0.618
wark+0.616
Neg logit contributions
roma-0.687
opsy-0.678
verse-0.623
 noses-0.616
 manufact-0.599
Token
Token ID198
Feature activation-4.65e+00

Pos logit contributions
roma+0.691
opsy+0.682
verse+0.631
 noses+0.617
 manufact+0.602
Neg logit contributions
umber-0.733
quartered-0.684
esson-0.645
netflix-0.621
lav-0.618
Token week
Token ID1285
Feature activation-5.09e-01

Pos logit contributions
umber+4.422
quartered+4.074
esson+3.997
lav+3.816
Constructed+3.782
Neg logit contributions
roma-3.897
opsy-3.801
verse-3.410
 noses-3.346
 manufact-3.294
Token after
Token ID706
Feature activation+5.21e+00

Pos logit contributions
roma+0.341
opsy+0.336
verse+0.311
 noses+0.306
 manufact+0.297
Neg logit contributions
umber-0.366
quartered-0.341
esson-0.323
netflix-0.311
wark-0.309
Token recovering
Token ID20222
Feature activation-3.31e+00

Pos logit contributions
umber+3.289
quartered+2.972
esson+2.873
lav+2.732
wark+2.730
Neg logit contributions
roma-3.872
opsy-3.826
verse-3.487
 noses-3.404
 manufact-3.394
Token from
Token ID422
Feature activation+3.58e+00

Pos logit contributions
roma+2.137
opsy+2.103
verse+1.957
 noses+1.937
 manufact+1.855
Neg logit contributions
umber-2.430
quartered-2.288
esson-2.160
netflix-2.096
wark-2.069
Token a
Token ID257
Feature activation-6.30e+00

Pos logit contributions
umber+2.527
quartered+2.317
esson+2.250
lav+2.158
netflix+2.124
Neg logit contributions
roma-2.426
opsy-2.384
verse-2.170
 noses-2.121
 manufact-2.086
Token broken
Token ID5445
Feature activation-1.94e+01

Pos logit contributions
roma+3.921
opsy+3.850
verse+3.690
 noses+3.623
 manufact+3.451
Neg logit contributions
umber-4.698
quartered-4.483
esson-4.137
wark-4.045
netflix-4.005
Token leg
Token ID1232
Feature activation-5.20e-01

Pos logit contributions
 noses+9.300
verse+8.771
opsy+8.449
roma+7.948
 manufact+7.784
Neg logit contributions
umber-13.634
quartered-13.417
Redditor-12.709
esson-12.685
EStream-12.669
Token in
Token ID287
Feature activation-3.96e-01

Pos logit contributions
roma+0.352
opsy+0.348
verse+0.322
 noses+0.316
 manufact+0.308
Neg logit contributions
umber-0.371
quartered-0.345
esson-0.327
netflix-0.315
wark-0.313
Token round
Token ID2835
Feature activation+1.95e+00

Pos logit contributions
roma+0.260
opsy+0.256
verse+0.236
 noses+0.232
 manufact+0.226
Neg logit contributions
umber-0.291
quartered-0.271
esson-0.257
netflix-0.248
wark-0.247
Token 18
Token ID1248
Feature activation+6.98e+00

Pos logit contributions
umber+1.299
quartered+1.186
esson+1.130
wark+1.071
netflix+1.069
Neg logit contributions
roma-1.426
opsy-1.397
verse-1.289
 noses-1.276
 manufact-1.259
Token last
Token ID938
Feature activation+2.43e+00

Pos logit contributions
umber+4.553
quartered+4.064
esson+3.917
wark+3.753
lav+3.707
Neg logit contributions
roma-5.102
opsy-4.973
verse-4.651
 manufact-4.459
 noses-4.444
Token
Token ID198
Feature activation+1.96e+00

Pos logit contributions
umber+1.935
quartered+1.735
esson+1.694
netflix+1.614
wark+1.608
Neg logit contributions
roma-2.122
opsy-2.100
 noses-1.906
verse-1.902
 manufact-1.868
Token
Token ID198
Feature activation+1.05e+00

Pos logit contributions
umber+1.295
quartered+1.145
esson+1.123
Constructed+1.077
netflix+1.073
Neg logit contributions
roma-1.432
opsy-1.417
 noses-1.296
verse-1.278
 manufact-1.273
TokenDo
Token ID5211
Feature activation+5.69e-01

Pos logit contributions
umber+0.790
quartered+0.731
esson+0.700
netflix+0.676
Constructed+0.672
Neg logit contributions
roma-0.673
opsy-0.664
verse-0.603
 noses-0.602
 manufact-0.584
Token not
Token ID407
Feature activation+1.11e-01

Pos logit contributions
umber+0.393
quartered+0.364
esson+0.345
netflix+0.330
wark+0.329
Neg logit contributions
roma-0.399
opsy-0.394
verse-0.363
 noses-0.359
 manufact-0.350
Token use
Token ID779
Feature activation-6.05e+00

Pos logit contributions
umber+0.078
quartered+0.072
esson+0.068
netflix+0.065
wark+0.065
Neg logit contributions
roma-0.077
opsy-0.076
verse-0.070
 noses-0.069
 manufact-0.068
Token social
Token ID1919
Feature activation-1.94e+01

Pos logit contributions
roma+3.671
opsy+3.611
verse+3.468
 noses+3.342
 manufact+3.221
Neg logit contributions
umber-4.577
quartered-4.352
esson-4.030
lav-3.916
wark-3.902
Token media
Token ID2056
Feature activation-4.82e+00

Pos logit contributions
verse+6.875
 noses+5.643
opsy+5.561
roma+5.455
 atmosp+5.392
Neg logit contributions
quartered-17.676
umber-17.237
EStream-15.682
lav-15.642
��-15.621
Token to
Token ID284
Feature activation-1.17e+00

Pos logit contributions
roma+2.941
opsy+2.903
verse+2.734
 noses+2.663
 manufact+2.554
Neg logit contributions
umber-3.677
quartered-3.454
esson-3.254
netflix-3.183
Constructed-3.133
Token post
Token ID1281
Feature activation+8.04e-01

Pos logit contributions
roma+0.788
opsy+0.777
verse+0.722
 noses+0.708
 manufact+0.690
Neg logit contributions
umber-0.840
quartered-0.785
esson-0.742
netflix-0.715
lav-0.711
Token an
Token ID281
Feature activation+5.40e+00

Pos logit contributions
umber+0.576
quartered+0.535
esson+0.509
netflix+0.488
wark+0.486
Neg logit contributions
roma-0.542
opsy-0.536
verse-0.490
 noses-0.483
 manufact-0.470
Token unfl
Token ID42880
Feature activation+7.74e+00

Pos logit contributions
umber+3.734
quartered+3.423
esson+3.270
Constructed+3.135
lav+3.121
Neg logit contributions
roma-3.782
opsy-3.622
verse-3.340
 noses-3.293
 manufact-3.172
Token
Token ID198
Feature activation+2.44e+00

Pos logit contributions
umber+1.552
quartered+1.408
esson+1.362
netflix+1.306
wark+1.291
Neg logit contributions
roma-1.643
opsy-1.628
verse-1.489
 noses-1.485
 manufact-1.457
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
umber+1.594
quartered+1.392
esson+1.378
Constructed+1.326
netflix+1.315
Neg logit contributions
roma-1.812
opsy-1.793
 noses-1.643
 manufact-1.617
verse-1.610
TokenDo
Token ID5211
Feature activation+8.00e-01

Pos logit contributions
umber+0.888
quartered+0.820
esson+0.785
netflix+0.759
Constructed+0.755
Neg logit contributions
roma-0.767
opsy-0.758
verse-0.688
 noses-0.687
 manufact-0.667
Token not
Token ID407
Feature activation+6.56e-01

Pos logit contributions
umber+0.550
quartered+0.509
esson+0.483
netflix+0.462
wark+0.460
Neg logit contributions
roma-0.563
opsy-0.556
verse-0.512
 noses-0.506
 manufact-0.493
Token use
Token ID779
Feature activation-5.82e+00

Pos logit contributions
umber+0.455
quartered+0.420
esson+0.399
netflix+0.382
wark+0.381
Neg logit contributions
roma-0.458
opsy-0.453
verse-0.417
 noses-0.411
 manufact-0.401
Token social
Token ID1919
Feature activation-1.94e+01

Pos logit contributions
roma+3.581
opsy+3.502
verse+3.363
 noses+3.253
 manufact+3.136
Neg logit contributions
umber-4.385
quartered-4.160
esson-3.853
lav-3.739
wark-3.716
Token media
Token ID2056
Feature activation-4.41e+00

Pos logit contributions
verse+6.787
 noses+5.756
opsy+5.711
roma+5.593
 atmosp+5.455
Neg logit contributions
quartered-17.427
umber-17.112
lav-15.499
EStream-15.477
esson-15.413
Token to
Token ID284
Feature activation-4.07e-01

Pos logit contributions
roma+2.726
opsy+2.681
verse+2.530
 noses+2.464
 manufact+2.373
Neg logit contributions
umber-3.343
quartered-3.132
esson-2.960
netflix-2.892
Constructed-2.839
Token let
Token ID1309
Feature activation-9.62e+00

Pos logit contributions
roma+0.277
opsy+0.273
verse+0.253
 noses+0.249
 manufact+0.242
Neg logit contributions
umber-0.290
quartered-0.270
esson-0.255
netflix-0.245
lav-0.244
Token people
Token ID661
Feature activation-7.09e+00

Pos logit contributions
roma+5.522
opsy+5.464
verse+5.373
 noses+5.264
 manufact+5.012
Neg logit contributions
umber-7.203
quartered-6.800
esson-6.516
izo-6.286
netflix-6.247
Token know
Token ID760
Feature activation-6.36e+00

Pos logit contributions
roma+4.132
opsy+4.077
verse+3.890
 noses+3.782
 manufact+3.677
Neg logit contributions
umber-5.435
quartered-5.225
-4.877
esson-4.852
EStream-4.820
Token detailed
Token ID6496
Feature activation-6.58e+00

Pos logit contributions
roma+6.160
verse+5.729
opsy+5.724
 noses+5.716
 manufact+5.644
Neg logit contributions
umber-8.624
quartered-8.146
wark-7.703
esson-7.605
netflix-7.489
Token images
Token ID4263
Feature activation-5.69e-01

Pos logit contributions
roma+4.166
opsy+4.114
 noses+3.926
verse+3.894
 manufact+3.828
Neg logit contributions
umber-4.778
quartered-4.497
esson-4.186
wark-4.149
netflix-4.090
Token of
Token ID286
Feature activation-9.41e+00

Pos logit contributions
roma+0.386
opsy+0.381
verse+0.353
 noses+0.347
 manufact+0.337
Neg logit contributions
umber-0.405
quartered-0.377
esson-0.357
netflix-0.343
wark-0.342
Token the
Token ID262
Feature activation-1.13e+01

Pos logit contributions
roma+5.833
opsy+5.483
 noses+5.280
verse+5.249
 manufact+5.105
Neg logit contributions
umber-6.853
quartered-6.476
esson-6.052
netflix-6.013
wark-5.910
Token comet
Token ID31733
Feature activation-8.17e+00

Pos logit contributions
roma+6.719
opsy+6.275
 noses+6.159
verse+6.075
 manufact+6.019
Neg logit contributions
umber-8.523
quartered-7.896
netflix-7.424
esson-7.405
wark-7.281
Token's
Token ID338
Feature activation-1.94e+01

Pos logit contributions
roma+4.881
opsy+4.612
verse+4.523
 noses+4.421
 manufact+4.261
Neg logit contributions
umber-6.233
quartered-5.929
esson-5.500
netflix-5.352
-5.327
Token surface
Token ID4417
Feature activation-2.34e+00

Pos logit contributions
roma+9.908
 atmosp+9.699
 noses+9.344
verse+9.108
 manufact+8.911
Neg logit contributions
umber-14.532
quartered-13.270
netflix-12.521
wark-12.429
esson-12.403
Token.
Token ID13
Feature activation-3.38e+00

Pos logit contributions
roma+1.511
opsy+1.487
verse+1.382
 noses+1.358
 manufact+1.319
Neg logit contributions
umber-1.741
quartered-1.622
esson-1.537
netflix-1.482
wark-1.475
Token Such
Token ID8013
Feature activation-2.48e+00

Pos logit contributions
roma+2.186
opsy+2.117
verse+1.990
 noses+1.930
 manufact+1.865
Neg logit contributions
umber-2.509
quartered-2.373
esson-2.211
netflix-2.160
lav-2.120
Token images
Token ID4263
Feature activation-7.16e-01

Pos logit contributions
roma+1.740
opsy+1.716
verse+1.607
 noses+1.594
 manufact+1.551
Neg logit contributions
umber-1.707
quartered-1.582
esson-1.490
netflix-1.441
wark-1.435
Token were
Token ID547
Feature activation-4.92e-01

Pos logit contributions
roma+0.478
opsy+0.472
verse+0.435
 noses+0.429
 manufact+0.416
Neg logit contributions
umber-0.518
quartered-0.482
esson-0.457
netflix-0.440
wark-0.438
Token
Token ID198
Feature activation-1.85e+00

Pos logit contributions
umber+1.775
quartered+1.596
esson+1.556
netflix+1.485
wark+1.476
Neg logit contributions
roma-1.888
opsy-1.871
verse-1.702
 noses-1.701
 manufact-1.675
Token
Token ID198
Feature activation+1.74e+00

Pos logit contributions
roma+1.200
opsy+1.182
verse+1.118
 noses+1.069
 manufact+1.030
Neg logit contributions
umber-1.364
quartered-1.314
esson-1.212
netflix-1.167
lav-1.161
TokenDo
Token ID5211
Feature activation-8.73e-01

Pos logit contributions
umber+1.275
quartered+1.170
esson+1.122
netflix+1.084
Constructed+1.081
Neg logit contributions
roma-1.153
opsy-1.134
 noses-1.036
verse-1.034
 manufact-1.009
Token not
Token ID407
Feature activation-1.15e+00

Pos logit contributions
roma+0.599
opsy+0.591
verse+0.548
 noses+0.538
 manufact+0.523
Neg logit contributions
umber-0.615
quartered-0.572
esson-0.541
netflix-0.519
wark-0.519
Token use
Token ID779
Feature activation-7.29e+00

Pos logit contributions
roma+0.784
opsy+0.772
verse+0.719
 noses+0.704
 manufact+0.685
Neg logit contributions
umber-0.810
quartered-0.759
esson-0.714
netflix-0.688
wark-0.684
Token social
Token ID1919
Feature activation-1.92e+01

Pos logit contributions
roma+4.181
opsy+4.108
verse+3.979
 noses+3.818
 manufact+3.671
Neg logit contributions
umber-5.646
quartered-5.435
esson-4.994
lav-4.890
wark-4.871
Token media
Token ID2056
Feature activation-5.89e+00

Pos logit contributions
verse+6.892
opsy+5.615
roma+5.613
 noses+5.612
 atmosp+5.431
Neg logit contributions
quartered-17.525
umber-16.995
lav-15.438
EStream-15.422
��-15.406
Token to
Token ID284
Feature activation-3.32e+00

Pos logit contributions
roma+3.499
opsy+3.446
verse+3.250
 noses+3.178
 manufact+3.019
Neg logit contributions
umber-4.531
quartered-4.288
esson-4.036
netflix-3.932
Constructed-3.881
Token show
Token ID905
Feature activation-6.31e+00

Pos logit contributions
roma+2.135
opsy+2.098
verse+1.967
 noses+1.923
 manufact+1.868
Neg logit contributions
umber-2.446
quartered-2.314
esson-2.180
netflix-2.114
lav-2.095
Token millions
Token ID5242
Feature activation-3.64e+00

Pos logit contributions
opsy+3.748
roma+3.746
 noses+3.567
verse+3.508
 manufact+3.334
Neg logit contributions
umber-4.778
quartered-4.542
esson-4.251
netflix-4.140
Constructed-4.114
Token of
Token ID286
Feature activation-2.85e+00

Pos logit contributions
opsy+2.316
roma+2.308
verse+2.143
 noses+2.132
 manufact+2.013
Neg logit contributions
umber-2.663
quartered-2.523
esson-2.373
Constructed-2.291
wark-2.287
Token of
Token ID286
Feature activation+5.45e+00

Pos logit contributions
umber+5.567
quartered+5.134
esson+4.694
wark+4.428
lav+4.368
Neg logit contributions
roma-7.717
opsy-7.583
 noses-6.934
verse-6.827
 manufact-6.668
Token 20
Token ID1160
Feature activation+1.20e+01

Pos logit contributions
umber+3.634
quartered+3.349
esson+3.167
netflix+3.045
lav+3.031
Neg logit contributions
roma-3.859
opsy-3.830
verse-3.461
 noses-3.412
 manufact-3.355
Token70
Token ID2154
Feature activation+1.17e+01

Pos logit contributions
umber+7.842
quartered+7.231
esson+6.663
lav+6.412
+6.248
Neg logit contributions
opsy-8.171
roma-8.058
verse-6.991
 noses-6.945
enz-6.783
Token689
Token ID40523
Feature activation+8.63e+00

Pos logit contributions
umber+8.362
quartered+7.537
esson+7.232
lav+6.884
wark+6.819
Neg logit contributions
roma-7.441
opsy-7.283
 noses-6.448
 manufact-6.259
verse-6.228
Token Ontario
Token ID10553
Feature activation+4.10e+00

Pos logit contributions
umber+6.048
quartered+5.490
esson+5.248
lav+5.053
wark+4.958
Neg logit contributions
roma-5.766
opsy-5.708
 noses-5.074
verse-5.027
 manufact-4.934
Token Ltd
Token ID12052
Feature activation+1.29e+01

Pos logit contributions
umber+2.941
quartered+2.717
esson+2.552
netflix+2.449
lav+2.446
Neg logit contributions
roma-2.792
opsy-2.739
verse-2.476
 noses-2.469
 manufact-2.387
Token.,
Token ID1539
Feature activation+5.13e+00

Pos logit contributions
umber+7.624
quartered+6.722
esson+6.368
wark+5.974
izo+5.934
Neg logit contributions
opsy-9.565
roma-9.550
verse-8.774
 noses-8.692
 manufact-8.512
Token a
Token ID257
Feature activation+4.00e+00

Pos logit contributions
umber+3.389
quartered+3.126
esson+2.943
netflix+2.829
wark+2.810
Neg logit contributions
roma-3.725
opsy-3.719
verse-3.351
 noses-3.311
 manufact-3.223
Token Toronto
Token ID6586
Feature activation+4.54e+00

Pos logit contributions
umber+2.783
quartered+2.572
esson+2.406
netflix+2.332
wark+2.296
Neg logit contributions
opsy-2.775
roma-2.767
verse-2.462
 noses-2.453
 manufact-2.356
Token-
Token ID12
Feature activation+7.42e+00

Pos logit contributions
umber+3.196
quartered+3.014
esson+2.751
netflix+2.659
Constructed+2.652
Neg logit contributions
roma-3.131
opsy-3.094
verse-2.772
 noses-2.760
 manufact-2.642
Tokenbased
Token ID3106
Feature activation+3.07e+00

Pos logit contributions
umber+5.125
quartered+4.897
lav+4.328
Constructed+4.226
esson+4.205
Neg logit contributions
roma-5.178
opsy-5.060
 noses-4.709
 manufact-4.496
verse-4.408
Token intersection
Token ID16246
Feature activation-3.20e+00

Pos logit contributions
roma+0.676
opsy+0.666
verse+0.616
 noses+0.606
 manufact+0.591
Neg logit contributions
umber-0.732
quartered-0.680
esson-0.648
netflix-0.624
wark-0.619
Token of
Token ID286
Feature activation+1.69e+00

Pos logit contributions
roma+2.062
opsy+2.031
verse+1.905
 noses+1.864
 manufact+1.805
Neg logit contributions
umber-2.363
quartered-2.198
esson-2.106
netflix-2.038
wark-1.999
Token Y
Token ID575
Feature activation-7.48e+00

Pos logit contributions
umber+1.197
quartered+1.102
esson+1.042
wark+1.004
lav+1.001
Neg logit contributions
roma-1.159
opsy-1.141
verse-1.051
 noses-1.031
 manufact-1.005
Tokenonge
Token ID14220
Feature activation+9.16e+00

Pos logit contributions
roma+4.384
opsy+4.326
verse+3.975
iq+3.753
 noses+3.732
Neg logit contributions
umber-5.524
quartered-5.506
esson-5.142
-5.060
netflix-5.018
Token and
Token ID290
Feature activation+1.43e+01

Pos logit contributions
umber+5.781
quartered+4.905
wark+4.780
lav+4.588
esson+4.556
Neg logit contributions
roma-6.902
opsy-6.719
verse-6.142
 noses-5.969
 manufact-5.969
Token E
Token ID412
Feature activation+1.54e+01

Pos logit contributions
umber+8.912
wark+7.144
quartered+7.061
lav+6.903
esson+6.864
Neg logit contributions
roma-10.252
opsy-9.946
verse-9.022
 noses-8.758
enz-8.683
Tokengl
Token ID4743
Feature activation+3.45e+00

Pos logit contributions
umber+7.466
wark+5.555
izo+5.357
esson+5.255
quartered+5.056
Neg logit contributions
opsy-12.900
roma-12.466
 noses-11.919
 manufact-11.877
 sugg-11.645
Tokeninton
Token ID2371
Feature activation+3.17e+00

Pos logit contributions
umber+2.311
quartered+2.021
esson+1.986
lav+1.871
wark+1.862
Neg logit contributions
roma-2.521
opsy-2.484
 noses-2.318
verse-2.294
 manufact-2.234
Token will
Token ID481
Feature activation+7.22e-01

Pos logit contributions
umber+2.312
quartered+2.118
esson+2.004
wark+1.966
lav+1.950
Neg logit contributions
roma-2.108
opsy-2.075
verse-1.901
 noses-1.839
 manufact-1.813
Token feature
Token ID3895
Feature activation+5.86e+00

Pos logit contributions
umber+0.509
quartered+0.471
esson+0.447
wark+0.429
netflix+0.427
Neg logit contributions
roma-0.497
opsy-0.491
verse-0.451
 noses-0.445
 manufact-0.434
Token large
Token ID1588
Feature activation+1.70e+00

Pos logit contributions
umber+3.954
quartered+3.595
esson+3.368
lav+3.305
Constructed+3.275
Neg logit contributions
opsy-4.083
roma-4.058
verse-3.690
 noses-3.642
 manufact-3.471
Token companies
Token ID2706
Feature activation+1.09e-01

Pos logit contributions
opsy+2.052
roma+2.029
verse+1.900
 noses+1.872
 manufact+1.829
Neg logit contributions
umber-2.464
quartered-2.306
esson-2.171
netflix-2.126
wark-2.119
Token pay
Token ID1414
Feature activation+6.17e+00

Pos logit contributions
umber+0.078
quartered+0.073
esson+0.069
netflix+0.066
wark+0.066
Neg logit contributions
roma-0.074
opsy-0.073
verse-0.067
 noses-0.066
 manufact-0.064
Token me
Token ID502
Feature activation+7.92e+00

Pos logit contributions
umber+3.971
quartered+3.647
esson+3.474
lav+3.264
netflix+3.252
Neg logit contributions
roma-4.527
opsy-4.319
verse-3.985
 noses-3.905
 manufact-3.837
Token for
Token ID329
Feature activation+9.87e+00

Pos logit contributions
umber+4.878
quartered+4.445
esson+4.268
lav+4.041
izo+3.961
Neg logit contributions
roma-5.916
opsy-5.716
verse-5.248
 noses-5.109
rique-5.018
Token collecting
Token ID13157
Feature activation+6.64e+00

Pos logit contributions
umber+5.952
quartered+5.272
esson+5.228
lav+4.929
wark+4.804
Neg logit contributions
roma-7.279
opsy-6.887
verse-6.257
rique-6.127
enz-6.052
Token and
Token ID290
Feature activation+1.42e+01

Pos logit contributions
umber+4.389
quartered+3.971
esson+3.877
lav+3.666
netflix+3.618
Neg logit contributions
roma-4.651
opsy-4.517
verse-4.101
 manufact-3.986
rique-3.967
Token arranging
Token ID41878
Feature activation+9.91e+00

Pos logit contributions
umber+7.825
quartered+7.056
esson+6.817
wark+6.240
izo+6.044
Neg logit contributions
roma-10.167
opsy-9.553
rique-9.061
enz-8.724
 noses-8.719
Token transport
Token ID4839
Feature activation+8.14e+00

Pos logit contributions
umber+5.990
quartered+5.435
esson+5.254
lav+4.950
iewicz+4.874
Neg logit contributions
roma-7.155
opsy-6.821
verse-6.280
rique-6.149
 manufact-6.019
Token of
Token ID286
Feature activation+3.27e+00

Pos logit contributions
umber+4.852
quartered+4.335
esson+4.059
lav+3.975
+3.911
Neg logit contributions
roma-6.282
opsy-6.029
verse-5.607
 manufact-5.418
 noses-5.408
Token the
Token ID262
Feature activation+1.02e+01

Pos logit contributions
umber+2.377
quartered+2.205
esson+2.104
lav+2.014
netflix+2.000
Neg logit contributions
roma-2.155
opsy-2.092
verse-1.915
 noses-1.855
 manufact-1.838
Token specimen
Token ID31674
Feature activation+4.10e+00

Pos logit contributions
umber+6.327
quartered+5.897
esson+5.582
lav+5.338
izo+5.280
Neg logit contributions
roma-7.099
opsy-6.711
verse-6.153
rique-5.975
 manufact-5.975
Token character
Token ID2095
Feature activation-1.04e+00

Pos logit contributions
umber+2.398
quartered+2.155
esson+2.031
izo+1.926
+1.924
Neg logit contributions
roma-3.012
opsy-2.975
 noses-2.763
verse-2.752
 manufact-2.664
Token (<
Token ID38155
Feature activation+2.20e+00

Pos logit contributions
roma+0.724
opsy+0.718
verse+0.662
 noses+0.651
 manufact+0.635
Neg logit contributions
umber-0.725
quartered-0.675
esson-0.639
netflix-0.613
wark-0.612
TokenSpace
Token ID14106
Feature activation-1.80e+00

Pos logit contributions
umber+1.386
quartered+1.252
esson+1.172
netflix+1.126
lav+1.119
Neg logit contributions
roma-1.686
opsy-1.663
 noses-1.553
verse-1.540
 manufact-1.512
Token>
Token ID29
Feature activation-7.58e+00

Pos logit contributions
roma+1.081
opsy+1.075
verse+0.988
 noses+0.955
 manufact+0.926
Neg logit contributions
umber-1.411
quartered-1.341
esson-1.259
netflix-1.220
wark-1.214
Token or
Token ID393
Feature activation+9.08e+00

Pos logit contributions
roma+4.254
opsy+4.214
verse+3.838
 noses+3.642
 manufact+3.598
Neg logit contributions
umber-5.865
quartered-5.745
esson-5.367
wark-5.163
netflix-5.148
Token <
Token ID1279
Feature activation+1.49e+01

Pos logit contributions
umber+5.791
quartered+4.975
esson+4.831
Constructed+4.700
izo+4.670
Neg logit contributions
roma-6.727
opsy-6.462
 noses-5.965
 metic-5.894
verse-5.802
TokenTab
Token ID33349
Feature activation+6.17e+00

Pos logit contributions
umber+7.137
Constructed+6.067
Desk+5.581
izo+5.309
netflix+5.296
Neg logit contributions
 metic-12.859
 noses-11.990
roma-11.899
opsy-11.854
 unden-11.739
Token>)
Token ID43734
Feature activation-1.04e+00

Pos logit contributions
umber+3.569
quartered+3.148
esson+3.013
izo+2.887
wark+2.873
Neg logit contributions
roma-4.926
opsy-4.784
 noses-4.496
 manufact-4.476
verse-4.452
Token {
Token ID1391
Feature activation+2.52e+00

Pos logit contributions
roma+0.688
opsy+0.678
verse+0.625
 noses+0.615
 manufact+0.599
Neg logit contributions
umber-0.754
quartered-0.707
esson-0.669
netflix-0.645
wark-0.642
Tokenvi
Token ID8903
Feature activation-1.84e+00

Pos logit contributions
umber+1.597
quartered+1.451
esson+1.376
lav+1.317
netflix+1.310
Neg logit contributions
roma-1.890
opsy-1.873
 noses-1.735
verse-1.727
 manufact-1.695
Token:
Token ID25
Feature activation-3.18e+00

Pos logit contributions
roma+1.222
opsy+1.205
verse+1.125
 noses+1.086
 manufact+1.054
Neg logit contributions
umber-1.325
quartered-1.271
esson-1.176
netflix-1.133
lav-1.128
Token at
Token ID379
Feature activation-1.14e+01

Pos logit contributions
umber+2.993
quartered+2.707
esson+2.602
netflix+2.487
wark+2.450
Neg logit contributions
roma-3.287
opsy-3.239
verse-2.912
 noses-2.871
 manufact-2.866
Token the
Token ID262
Feature activation-4.72e+00

Pos logit contributions
roma+5.807
opsy+5.773
 noses+5.291
verse+5.171
 Aut+4.799
Neg logit contributions
umber-8.987
quartered-8.737
izo-8.306
netflix-8.156
-7.949
Token time
Token ID640
Feature activation-8.97e-01

Pos logit contributions
roma+2.944
opsy+2.917
verse+2.709
 noses+2.683
 manufact+2.590
Neg logit contributions
umber-3.530
quartered-3.298
esson-3.130
netflix-3.070
izo-3.054
Token.
Token ID13
Feature activation+6.97e-01

Pos logit contributions
roma+0.599
opsy+0.592
verse+0.546
 noses+0.537
 manufact+0.521
Neg logit contributions
umber-0.648
quartered-0.603
esson-0.572
netflix-0.551
wark-0.548
Token "
Token ID366
Feature activation+5.83e+00

Pos logit contributions
umber+0.489
quartered+0.452
esson+0.431
netflix+0.413
wark+0.413
Neg logit contributions
roma-0.480
opsy-0.474
verse-0.435
 noses-0.431
 manufact-0.421
TokenJan
Token ID12128
Feature activation+1.29e+01

Pos logit contributions
umber+3.809
Constructed+3.327
quartered+3.305
esson+3.303
netflix+3.185
Neg logit contributions
roma-4.223
opsy-4.162
 noses-3.845
 manufact-3.785
 sugg-3.703
Token would
Token ID561
Feature activation+2.35e+00

Pos logit contributions
umber+9.109
quartered+8.137
esson+7.996
iewicz+7.705
izo+7.552
Neg logit contributions
roma-7.839
opsy-7.575
 noses-6.949
 manufact-6.898
verse-6.802
Token get
Token ID651
Feature activation+4.46e+00

Pos logit contributions
umber+1.630
quartered+1.472
esson+1.419
netflix+1.358
wark+1.358
Neg logit contributions
roma-1.637
opsy-1.609
verse-1.483
 noses-1.458
 manufact-1.436
Token her
Token ID607
Feature activation+7.90e+00

Pos logit contributions
umber+2.995
quartered+2.720
esson+2.574
Constructed+2.480
netflix+2.462
Neg logit contributions
roma-3.194
opsy-3.136
verse-2.837
 noses-2.760
 manufact-2.728
Token pencil
Token ID21613
Feature activation+6.94e+00

Pos logit contributions
umber+5.552
quartered+4.996
esson+4.826
lav+4.542
netflix+4.540
Neg logit contributions
roma-5.295
opsy-5.043
verse-4.528
iq-4.369
rique-4.295
Tokens
Token ID82
Feature activation+5.73e+00

Pos logit contributions
umber+4.676
quartered+4.231
esson+4.010
wark+3.819
lav+3.803
Neg logit contributions
roma-4.878
opsy-4.805
verse-4.342
 noses-4.196
 manufact-4.145
Tokenve
Token ID303
Feature activation+3.23e+00

Pos logit contributions
umber+2.483
quartered+2.336
esson+2.244
lav+2.187
wark+2.163
Neg logit contributions
roma-1.572
opsy-1.567
 noses-1.392
verse-1.385
 manufact-1.337
Token talked
Token ID6619
Feature activation+3.27e-02

Pos logit contributions
umber+2.153
quartered+1.980
esson+1.890
lav+1.807
izo+1.798
Neg logit contributions
roma-2.308
opsy-2.298
verse-2.092
 noses-2.060
 manufact-2.014
Token about
Token ID546
Feature activation+8.86e-01

Pos logit contributions
umber+0.024
quartered+0.022
esson+0.021
netflix+0.021
wark+0.020
Neg logit contributions
roma-0.021
opsy-0.021
verse-0.019
 noses-0.019
 manufact-0.019
Token a
Token ID257
Feature activation-1.93e+00

Pos logit contributions
umber+0.610
quartered+0.562
esson+0.534
wark+0.510
netflix+0.510
Neg logit contributions
roma-0.625
opsy-0.617
verse-0.569
 noses-0.560
 manufact-0.546
Token lot
Token ID1256
Feature activation+2.52e+00

Pos logit contributions
roma+1.388
opsy+1.369
verse+1.289
 noses+1.253
 manufact+1.228
Neg logit contributions
umber-1.294
quartered-1.207
esson-1.132
netflix-1.089
wark-1.086
Token here
Token ID994
Feature activation+1.33e+00

Pos logit contributions
umber+1.738
quartered+1.588
esson+1.512
wark+1.445
lav+1.444
Neg logit contributions
roma-1.789
opsy-1.754
verse-1.606
 noses-1.589
 manufact-1.553
Token because
Token ID780
Feature activation-3.12e+00

Pos logit contributions
umber+0.928
quartered+0.857
esson+0.816
wark+0.780
lav+0.777
Neg logit contributions
roma-0.926
opsy-0.913
verse-0.840
 noses-0.831
 manufact-0.811
Token he
Token ID339
Feature activation+1.15e+00

Pos logit contributions
roma+1.973
opsy+1.940
verse+1.801
 noses+1.761
 manufact+1.729
Neg logit contributions
umber-2.321
quartered-2.179
esson-2.071
netflix-2.003
wark-1.983
Token
Token ID447
Feature activation+8.90e-01

Pos logit contributions
umber+0.833
quartered+0.772
esson+0.737
wark+0.703
netflix+0.702
Neg logit contributions
roma-0.769
opsy-0.759
verse-0.698
 noses-0.686
 manufact-0.669
Token
Token ID247
Feature activation+2.82e+00

Pos logit contributions
umber+0.389
quartered+0.340
esson+0.313
lav+0.289
netflix+0.288
Neg logit contributions
roma-0.849
opsy-0.843
verse-0.796
 noses-0.788
 manufact-0.775
Tokens
Token ID82
Feature activation-2.22e-01

Pos logit contributions
umber+2.040
quartered+1.891
esson+1.819
lav+1.742
netflix+1.733
Neg logit contributions
roma-1.853
opsy-1.840
verse-1.671
 noses-1.662
 manufact-1.608
Token,
Token ID11
Feature activation+6.37e+00

Pos logit contributions
roma+0.547
opsy+0.540
verse+0.499
 noses+0.492
 manufact+0.478
Neg logit contributions
umber-0.588
quartered-0.548
esson-0.521
netflix-0.500
wark-0.497
Token S
Token ID311
Feature activation-6.22e+00

Pos logit contributions
umber+4.303
quartered+3.886
esson+3.720
netflix+3.539
wark+3.524
Neg logit contributions
roma-4.490
opsy-4.483
verse-4.106
 noses-3.923
 manufact-3.870
Tokenaki
Token ID8182
Feature activation-1.28e+00

Pos logit contributions
roma+1.652
opsy+1.585
verse+1.297
iq+1.183
 noses+1.097
Neg logit contributions
umber-6.727
quartered-6.658
-6.266
netflix-6.239
esson-6.224
Token broke
Token ID6265
Feature activation-4.11e+00

Pos logit contributions
roma+0.839
opsy+0.828
verse+0.765
 noses+0.755
 manufact+0.732
Neg logit contributions
umber-0.941
quartered-0.881
esson-0.834
netflix-0.805
lav-0.798
Token into
Token ID656
Feature activation+4.93e+00

Pos logit contributions
roma+2.587
opsy+2.553
 noses+2.433
verse+2.392
 manufact+2.314
Neg logit contributions
umber-2.994
quartered-2.817
esson-2.690
netflix-2.585
izo-2.573
Token his
Token ID465
Feature activation+6.26e+00

Pos logit contributions
umber+3.484
quartered+3.136
esson+2.977
wark+2.870
netflix+2.866
Neg logit contributions
roma-3.437
opsy-3.388
verse-3.089
 noses-2.977
 manufact-2.896
Token apartment
Token ID7962
Feature activation+1.00e+00

Pos logit contributions
umber+4.410
quartered+4.095
esson+3.818
wark+3.725
lav+3.687
Neg logit contributions
roma-4.238
opsy-4.207
verse-3.698
 noses-3.585
iq-3.531
Token and
Token ID290
Feature activation-1.78e+00

Pos logit contributions
umber+0.707
quartered+0.654
esson+0.622
netflix+0.598
wark+0.595
Neg logit contributions
roma-0.687
opsy-0.679
verse-0.622
 noses-0.612
 manufact-0.597
Token murdered
Token ID12864
Feature activation+5.27e+00

Pos logit contributions
roma+1.132
opsy+1.121
verse+1.034
 noses+1.023
 manufact+0.993
Neg logit contributions
umber-1.332
quartered-1.241
esson-1.179
netflix-1.138
lav-1.132
Token Tang
Token ID18816
Feature activation-1.72e+00

Pos logit contributions
umber+3.749
quartered+3.365
esson+3.230
netflix+3.126
wark+3.095
Neg logit contributions
roma-3.603
opsy-3.568
verse-3.250
 noses-3.116
 manufact-3.044
Token Shen
Token ID22323
Feature activation-5.68e-01

Pos logit contributions
roma+1.099
opsy+1.078
verse+0.995
 noses+0.979
 manufact+0.949
Neg logit contributions
umber-1.286
quartered-1.216
esson-1.139
netflix-1.106
Constructed-1.099
Token di
Token ID2566
Feature activation-7.48e+00

Pos logit contributions
umber+3.019
quartered+2.821
esson+2.660
wark+2.593
netflix+2.544
Neg logit contributions
roma-2.885
opsy-2.861
 noses-2.581
verse-2.530
 manufact-2.454
Tokenas
Token ID292
Feature activation+2.08e+00

Pos logit contributions
opsy+4.398
roma+4.291
verse+4.162
 noses+3.761
iq+3.685
Neg logit contributions
umber-5.578
quartered-5.443
esson-4.993
-4.951
lav-4.941
Tokenpora
Token ID38851
Feature activation+6.52e-02

Pos logit contributions
umber+1.459
quartered+1.344
esson+1.280
wark+1.223
lav+1.222
Neg logit contributions
roma-1.445
opsy-1.421
verse-1.301
 noses-1.299
 manufact-1.255
Token,
Token ID11
Feature activation+5.29e+00

Pos logit contributions
umber+0.046
quartered+0.043
esson+0.041
netflix+0.039
wark+0.039
Neg logit contributions
roma-0.045
opsy-0.044
verse-0.041
 noses-0.040
 manufact-0.039
Token we
Token ID356
Feature activation+3.70e+00

Pos logit contributions
umber+3.333
quartered+3.044
esson+2.846
netflix+2.747
wark+2.728
Neg logit contributions
roma-3.984
opsy-3.887
verse-3.559
 noses-3.514
 manufact-3.450
Token represent
Token ID2380
Feature activation+1.40e+00

Pos logit contributions
umber+2.542
quartered+2.319
esson+2.215
wark+2.140
netflix+2.112
Neg logit contributions
roma-2.596
opsy-2.541
verse-2.342
 noses-2.300
 manufact-2.246
Token a
Token ID257
Feature activation+4.16e+00

Pos logit contributions
umber+0.956
quartered+0.880
esson+0.835
netflix+0.799
wark+0.798
Neg logit contributions
roma-0.999
opsy-0.985
verse-0.907
 noses-0.891
 manufact-0.871
Token second
Token ID1218
Feature activation+1.86e+00

Pos logit contributions
umber+2.987
quartered+2.769
esson+2.639
wark+2.576
+2.525
Neg logit contributions
roma-2.749
opsy-2.703
 noses-2.444
verse-2.407
 manufact-2.328
Token army
Token ID5428
Feature activation+6.71e+00

Pos logit contributions
umber+1.339
quartered+1.231
esson+1.171
wark+1.134
netflix+1.126
Neg logit contributions
roma-1.247
opsy-1.230
verse-1.130
 noses-1.113
 manufact-1.084
Token of
Token ID286
Feature activation+2.88e+00

Pos logit contributions
umber+4.306
quartered+4.039
esson+3.762
wark+3.654
Constructed+3.600
Neg logit contributions
roma-4.935
opsy-4.727
verse-4.365
 noses-4.328
enz-4.177
Token the
Token ID262
Feature activation+7.81e+00

Pos logit contributions
umber+1.985
quartered+1.836
esson+1.741
wark+1.664
lav+1.659
Neg logit contributions
roma-2.013
opsy-1.977
verse-1.812
 noses-1.777
 manufact-1.740
Token main
Token ID1388
Feature activation+2.28e+00

Pos logit contributions
roma+0.648
opsy+0.638
verse+0.593
 noses+0.583
 manufact+0.566
Neg logit contributions
umber-0.712
quartered-0.664
esson-0.630
netflix-0.606
lav-0.602
Token takeaway
Token ID43678
Feature activation-6.86e+00

Pos logit contributions
umber+1.750
quartered+1.632
esson+1.560
wark+1.503
netflix+1.493
Neg logit contributions
roma-1.424
opsy-1.402
verse-1.248
 noses-1.243
 manufact-1.203
Token is
Token ID318
Feature activation+1.20e+00

Pos logit contributions
roma+3.880
opsy+3.827
verse+3.660
 noses+3.494
 manufact+3.418
Neg logit contributions
umber-5.375
quartered-5.061
esson-4.856
netflix-4.728
lav-4.707
Token that
Token ID326
Feature activation-2.92e+00

Pos logit contributions
umber+0.832
quartered+0.767
esson+0.729
netflix+0.697
wark+0.697
Neg logit contributions
roma-0.839
opsy-0.830
verse-0.762
 noses-0.751
 manufact-0.731
Token,
Token ID11
Feature activation-1.74e+00

Pos logit contributions
roma+1.865
opsy+1.831
verse+1.723
 noses+1.697
 manufact+1.655
Neg logit contributions
umber-2.143
quartered-2.033
esson-1.911
netflix-1.854
lav-1.843
Token contrary
Token ID10388
Feature activation+7.93e+00

Pos logit contributions
roma+1.172
opsy+1.155
verse+1.077
 noses+1.063
 manufact+1.032
Neg logit contributions
umber-1.241
quartered-1.161
esson-1.096
netflix-1.055
wark-1.047
Token to
Token ID284
Feature activation+6.65e+00

Pos logit contributions
umber+4.611
esson+3.881
quartered+3.840
netflix+3.727
Constructed+3.703
Neg logit contributions
roma-6.274
opsy-6.158
 metic-5.756
 noses-5.638
 manufact-5.588
Token all
Token ID477
Feature activation+2.77e+00

Pos logit contributions
umber+4.201
esson+3.646
quartered+3.610
wark+3.497
netflix+3.446
Neg logit contributions
roma-4.951
opsy-4.865
verse-4.335
 noses-4.266
 manufact-4.181
Token expectations
Token ID9027
Feature activation-6.29e+00

Pos logit contributions
umber+1.905
quartered+1.724
esson+1.668
netflix+1.592
wark+1.592
Neg logit contributions
roma-1.954
opsy-1.917
verse-1.749
 noses-1.720
 manufact-1.682
Token,
Token ID11
Feature activation-1.95e+00

Pos logit contributions
roma+3.680
opsy+3.624
verse+3.429
 noses+3.328
 manufact+3.196
Neg logit contributions
umber-4.826
quartered-4.587
esson-4.291
lav-4.201
netflix-4.195
Token the
Token ID262
Feature activation-1.47e+00

Pos logit contributions
roma+1.242
opsy+1.219
verse+1.138
 noses+1.118
 manufact+1.088
Neg logit contributions
umber-1.448
quartered-1.368
esson-1.294
netflix-1.250
lav-1.244
Token shifts
Token ID15381
Feature activation+5.16e-01

Pos logit contributions
roma+1.336
opsy+1.318
verse+1.228
 noses+1.224
 manufact+1.177
Neg logit contributions
umber-1.441
quartered-1.351
esson-1.272
netflix-1.227
wark-1.216
Token my
Token ID616
Feature activation+3.56e+00

Pos logit contributions
umber+0.367
quartered+0.340
esson+0.323
netflix+0.309
wark+0.309
Neg logit contributions
roma-0.352
opsy-0.347
verse-0.320
 noses-0.314
 manufact-0.306
Token focus
Token ID2962
Feature activation-1.64e-01

Pos logit contributions
umber+2.424
quartered+2.236
esson+2.115
Constructed+2.008
lav+2.006
Neg logit contributions
roma-2.550
opsy-2.477
verse-2.277
 noses-2.205
 manufact-2.157
Token away
Token ID1497
Feature activation-4.22e+00

Pos logit contributions
roma+0.111
opsy+0.110
verse+0.101
 noses+0.100
 manufact+0.097
Neg logit contributions
umber-0.117
quartered-0.109
esson-0.104
netflix-0.100
wark-0.099
Token from
Token ID422
Feature activation-1.49e-01

Pos logit contributions
roma+2.530
opsy+2.498
verse+2.335
 noses+2.276
 manufact+2.209
Neg logit contributions
umber-3.229
quartered-3.092
esson-2.895
netflix-2.804
wark-2.801
Token learning
Token ID4673
Feature activation+1.60e+00

Pos logit contributions
roma+0.101
opsy+0.100
verse+0.092
 noses+0.091
 manufact+0.088
Neg logit contributions
umber-0.107
quartered-0.099
esson-0.094
netflix-0.090
wark-0.090
Token and
Token ID290
Feature activation+1.75e+00

Pos logit contributions
umber+1.124
quartered+1.032
esson+0.988
netflix+0.936
lav+0.936
Neg logit contributions
roma-1.104
opsy-1.088
verse-1.004
 noses-0.984
 manufact-0.956
Token toward
Token ID3812
Feature activation+4.47e+00

Pos logit contributions
umber+1.187
quartered+1.092
esson+1.040
wark+0.991
netflix+0.986
Neg logit contributions
roma-1.247
opsy-1.231
verse-1.137
 noses-1.110
 manufact-1.081
Token frequent
Token ID10792
Feature activation+4.88e+00

Pos logit contributions
umber+3.142
quartered+2.883
esson+2.771
wark+2.641
lav+2.634
Neg logit contributions
roma-3.052
opsy-3.001
verse-2.720
 noses-2.650
iq-2.571
Token input
Token ID5128
Feature activation+8.77e-01

Pos logit contributions
umber+3.460
quartered+3.175
esson+3.058
izo+2.920
lav+2.914
Neg logit contributions
roma-3.302
opsy-3.207
verse-2.890
 noses-2.847
iq-2.735
Tokenting
Token ID889
Feature activation-4.96e-01

Pos logit contributions
umber+0.621
quartered+0.572
esson+0.545
wark+0.520
netflix+0.520
Neg logit contributions
roma-0.603
opsy-0.594
verse-0.547
 noses-0.539
 manufact-0.526
Tokenored
Token ID1850
Feature activation-1.88e+00

Pos logit contributions
umber+0.515
quartered+0.479
esson+0.456
lav+0.437
netflix+0.436
Neg logit contributions
roma-0.444
opsy-0.440
verse-0.402
 noses-0.397
 manufact-0.385
Token,
Token ID11
Feature activation-2.89e+00

Pos logit contributions
roma+1.221
opsy+1.205
verse+1.113
 noses+1.093
 manufact+1.057
Neg logit contributions
umber-1.399
quartered-1.311
esson-1.233
netflix-1.198
lav-1.190
Token Bio
Token ID16024
Feature activation-1.55e+00

Pos logit contributions
roma+1.947
opsy+1.919
verse+1.777
 noses+1.741
 manufact+1.697
Neg logit contributions
umber-2.067
quartered-1.927
esson-1.802
netflix-1.753
lav-1.746
TokenShock
Token ID31646
Feature activation+4.13e-01

Pos logit contributions
roma+0.685
opsy+0.674
verse+0.596
 noses+0.568
 manufact+0.543
Neg logit contributions
umber-1.476
quartered-1.406
esson-1.341
netflix-1.305
lav-1.302
Token,
Token ID11
Feature activation-7.93e+00

Pos logit contributions
umber+0.289
quartered+0.268
esson+0.255
netflix+0.244
wark+0.243
Neg logit contributions
roma-0.285
opsy-0.281
verse-0.260
 noses-0.256
 manufact-0.249
Token or
Token ID393
Feature activation-3.95e+00

Pos logit contributions
roma+4.577
opsy+4.530
verse+4.202
 noses+4.067
 manufact+3.908
Neg logit contributions
umber-6.119
quartered-5.883
esson-5.393
netflix-5.294
lav-5.214
Token several
Token ID1811
Feature activation-1.69e+00

Pos logit contributions
roma+2.656
opsy+2.632
verse+2.465
 noses+2.399
 manufact+2.333
Neg logit contributions
umber-2.792
quartered-2.597
esson-2.439
netflix-2.390
wark-2.347
Token similar
Token ID2092
Feature activation-6.32e+00

Pos logit contributions
roma+1.138
opsy+1.123
verse+1.045
 noses+1.022
 manufact+1.000
Neg logit contributions
umber-1.205
quartered-1.119
esson-1.061
netflix-1.026
wark-1.023
Token games
Token ID1830
Feature activation-4.24e+00

Pos logit contributions
roma+3.710
opsy+3.689
verse+3.489
 noses+3.411
 manufact+3.329
Neg logit contributions
umber-4.852
quartered-4.513
esson-4.284
netflix-4.239
wark-4.220
Token.
Token ID13
Feature activation+3.97e-01

Pos logit contributions
roma+2.718
opsy+2.687
verse+2.480
 noses+2.443
 manufact+2.379
Neg logit contributions
umber-3.119
quartered-2.901
esson-2.720
netflix-2.695
wark-2.647
Token It
Token ID632
Feature activation-4.15e+00

Pos logit contributions
umber+0.272
quartered+0.251
esson+0.239
netflix+0.228
wark+0.228
Neg logit contributions
roma-0.280
opsy-0.277
verse-0.255
 noses-0.252
 manufact-0.246
Tokenasure
Token ID5015
Feature activation-2.61e+00

Pos logit contributions
roma+5.988
opsy+5.885
verse+5.469
iq+5.301
enz+5.246
Neg logit contributions
umber-5.293
quartered-5.202
netflix-4.850
Constructed-4.743
-4.726
Token,
Token ID11
Feature activation-4.90e+00

Pos logit contributions
roma+1.717
opsy+1.697
verse+1.578
 noses+1.535
 manufact+1.493
Neg logit contributions
umber-1.883
quartered-1.769
esson-1.660
netflix-1.624
Constructed-1.600
Token R
Token ID371
Feature activation+2.32e-01

Pos logit contributions
roma+3.099
opsy+3.073
verse+2.942
 noses+2.853
 manufact+2.726
Neg logit contributions
umber-3.538
quartered-3.468
esson-3.124
netflix-3.086
lav-3.045
Token.
Token ID13
Feature activation-1.16e-01

Pos logit contributions
umber+0.165
quartered+0.153
esson+0.146
netflix+0.140
wark+0.139
Neg logit contributions
roma-0.158
opsy-0.155
verse-0.143
 noses-0.141
 manufact-0.138
Token H
Token ID367
Feature activation-3.66e+00

Pos logit contributions
roma+0.080
opsy+0.079
verse+0.073
 noses+0.072
 manufact+0.070
Neg logit contributions
umber-0.081
quartered-0.075
esson-0.071
netflix-0.068
wark-0.068
Token.,
Token ID1539
Feature activation-8.05e+00

Pos logit contributions
roma+2.353
opsy+2.310
verse+2.122
 noses+2.047
 manufact+2.006
Neg logit contributions
umber-2.676
quartered-2.596
esson-2.379
Constructed-2.344
netflix-2.341
Token Al
Token ID978
Feature activation-1.43e+00

Pos logit contributions
roma+5.246
opsy+5.152
verse+4.954
 noses+4.709
 manufact+4.570
Neg logit contributions
quartered-5.410
umber-5.401
esson-4.796
Constructed-4.735
netflix-4.684
Tokenlee
Token ID7197
Feature activation-1.29e+00

Pos logit contributions
roma+0.928
opsy+0.912
verse+0.843
 noses+0.819
 manufact+0.797
Neg logit contributions
umber-1.053
quartered-0.989
esson-0.931
netflix-0.903
Constructed-0.898
Token,
Token ID11
Feature activation+3.99e+00

Pos logit contributions
roma+0.864
opsy+0.856
verse+0.790
 noses+0.769
 manufact+0.749
Neg logit contributions
umber-0.927
quartered-0.866
esson-0.817
netflix-0.788
Constructed-0.783
Token W
Token ID370
Feature activation-3.92e+00

Pos logit contributions
umber+2.615
esson+2.287
quartered+2.276
wark+2.175
Constructed+2.146
Neg logit contributions
roma-2.960
opsy-2.913
 noses-2.609
verse-2.593
 manufact-2.573
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
roma+2.465
opsy+2.433
verse+2.231
 noses+2.168
 manufact+2.098
Neg logit contributions
umber-2.902
quartered-2.783
netflix-2.572
esson-2.565
Constructed-2.532
Token posted
Token ID4481
Feature activation-3.85e+00

Pos logit contributions
roma+2.494
opsy+2.424
verse+2.314
 noses+2.269
 manufact+2.214
Neg logit contributions
umber-2.693
quartered-2.525
esson-2.384
lav-2.287
netflix-2.285
Token of
Token ID286
Feature activation-4.11e+00

Pos logit contributions
roma+2.592
opsy+2.522
verse+2.410
 noses+2.356
 manufact+2.264
Neg logit contributions
umber-2.707
quartered-2.531
esson-2.402
netflix-2.290
-2.282
Token her
Token ID607
Feature activation+7.03e+00

Pos logit contributions
roma+2.387
opsy+2.331
verse+2.221
 noses+2.175
 manufact+2.087
Neg logit contributions
umber-3.264
quartered-3.077
esson-2.930
wark-2.836
izo-2.827
Token.
Token ID13
Feature activation+4.22e+00

Pos logit contributions
umber+4.355
quartered+3.932
esson+3.755
lav+3.648
wark+3.606
Neg logit contributions
roma-5.212
opsy-5.178
verse-4.632
 noses-4.556
 manufact-4.489
Token She
Token ID1375
Feature activation-1.05e+00

Pos logit contributions
umber+2.734
quartered+2.437
esson+2.412
netflix+2.319
wark+2.304
Neg logit contributions
opsy-3.084
roma-3.065
 noses-2.814
 manufact-2.753
verse-2.733
Token likened
Token ID41679
Feature activation-4.75e+00

Pos logit contributions
roma+0.718
opsy+0.707
verse+0.658
 noses+0.648
 manufact+0.629
Neg logit contributions
umber-0.735
quartered-0.685
esson-0.645
netflix-0.620
lav-0.617
Token the
Token ID262
Feature activation-4.64e+00

Pos logit contributions
roma+2.940
opsy+2.869
verse+2.754
 noses+2.691
 manufact+2.567
Neg logit contributions
umber-3.566
quartered-3.348
esson-3.168
-3.064
izo-3.028
Token unauthorized
Token ID22959
Feature activation+2.69e+00

Pos logit contributions
roma+3.015
opsy+2.960
verse+2.847
 noses+2.796
 manufact+2.665
Neg logit contributions
umber-3.380
quartered-3.139
esson-2.959
netflix-2.870
izo-2.851
Token use
Token ID779
Feature activation-2.45e+00

Pos logit contributions
umber+1.902
quartered+1.748
esson+1.673
netflix+1.609
lav+1.595
Neg logit contributions
roma-1.830
opsy-1.816
verse-1.632
 noses-1.603
 manufact-1.568
Token of
Token ID286
Feature activation-4.55e+00

Pos logit contributions
roma+1.626
opsy+1.599
verse+1.487
 noses+1.453
 manufact+1.412
Neg logit contributions
umber-1.770
quartered-1.653
esson-1.565
wark-1.498
netflix-1.498
Token photos
Token ID5205
Feature activation+1.12e+00

Pos logit contributions
roma+2.883
opsy+2.786
verse+2.687
 noses+2.630
 manufact+2.521
Neg logit contributions
umber-3.404
quartered-3.207
esson-3.022
wark-2.886
netflix-2.884
Token only
Token ID691
Feature activation-2.13e+00

Pos logit contributions
umber+0.798
quartered+0.737
esson+0.700
lav+0.668
wark+0.668
Neg logit contributions
roma-0.762
opsy-0.753
verse-0.688
 noses-0.675
 manufact-0.662
Token way
Token ID835
Feature activation-3.16e-01

Pos logit contributions
roma+1.366
opsy+1.347
verse+1.269
 noses+1.248
 manufact+1.200
Neg logit contributions
umber-1.586
quartered-1.486
esson-1.404
netflix-1.376
lav-1.355
Token to
Token ID284
Feature activation+3.03e+00

Pos logit contributions
roma+0.212
opsy+0.209
verse+0.193
 noses+0.190
 manufact+0.185
Neg logit contributions
umber-0.228
quartered-0.212
esson-0.201
netflix-0.193
wark-0.193
Token go
Token ID467
Feature activation-4.73e-03

Pos logit contributions
umber+2.146
quartered+1.949
esson+1.870
wark+1.817
izo+1.795
Neg logit contributions
roma-2.068
opsy-2.054
verse-1.844
 noses-1.812
 manufact-1.794
Token past
Token ID1613
Feature activation+3.30e-01

Pos logit contributions
roma+0.003
opsy+0.003
verse+0.003
 noses+0.003
 manufact+0.003
Neg logit contributions
umber-0.004
quartered-0.003
esson-0.003
netflix-0.003
wark-0.003
Token them
Token ID606
Feature activation+3.14e-01

Pos logit contributions
umber+0.237
quartered+0.220
esson+0.209
netflix+0.200
wark+0.200
Neg logit contributions
roma-0.223
opsy-0.220
verse-0.203
 noses-0.199
 manufact-0.194
Token is
Token ID318
Feature activation+3.07e+00

Pos logit contributions
umber+0.236
quartered+0.220
esson+0.210
netflix+0.202
wark+0.201
Neg logit contributions
roma-0.200
opsy-0.198
verse-0.181
 noses-0.178
 manufact-0.173
Token to
Token ID284
Feature activation+7.97e+00

Pos logit contributions
umber+2.068
quartered+1.882
esson+1.778
wark+1.709
netflix+1.702
Neg logit contributions
roma-2.215
opsy-2.178
verse-2.004
 noses-1.940
 manufact-1.931
Token go
Token ID467
Feature activation-9.43e-02

Pos logit contributions
umber+5.446
quartered+4.757
wark+4.534
esson+4.518
izo+4.451
Neg logit contributions
opsy-5.402
roma-5.391
verse-4.726
 manufact-4.695
rique-4.571
Token inside
Token ID2641
Feature activation+1.19e+00

Pos logit contributions
roma+0.065
opsy+0.064
verse+0.059
 noses+0.058
 manufact+0.057
Neg logit contributions
umber-0.066
quartered-0.061
esson-0.058
netflix-0.056
wark-0.056
Token them
Token ID606
Feature activation-1.10e+00

Pos logit contributions
umber+0.848
quartered+0.785
esson+0.745
wark+0.713
netflix+0.712
Neg logit contributions
roma-0.815
opsy-0.803
verse-0.736
 noses-0.723
 manufact-0.709
Token found
Token ID1043
Feature activation+2.61e+00

Pos logit contributions
roma+0.500
opsy+0.494
verse+0.455
 noses+0.450
 manufact+0.435
Neg logit contributions
umber-0.530
quartered-0.495
esson-0.466
netflix-0.449
wark-0.447
Token out
Token ID503
Feature activation+2.55e+00

Pos logit contributions
umber+1.605
quartered+1.444
esson+1.388
netflix+1.318
lav+1.307
Neg logit contributions
roma-2.030
opsy-1.995
verse-1.845
 noses-1.799
 manufact-1.782
Token the
Token ID262
Feature activation+7.79e-01

Pos logit contributions
umber+1.742
quartered+1.596
esson+1.530
netflix+1.457
lav+1.447
Neg logit contributions
roma-1.813
opsy-1.771
verse-1.634
 noses-1.608
 manufact-1.584
Token bombers
Token ID26127
Feature activation-1.07e+00

Pos logit contributions
umber+0.564
quartered+0.522
esson+0.497
netflix+0.477
wark+0.475
Neg logit contributions
roma-0.522
opsy-0.514
verse-0.472
 noses-0.464
 manufact-0.453
Token were
Token ID547
Feature activation+5.35e-01

Pos logit contributions
roma+0.704
opsy+0.696
verse+0.642
 noses+0.633
 manufact+0.613
Neg logit contributions
umber-0.781
quartered-0.728
esson-0.689
netflix-0.663
wark-0.663
Token Che
Token ID2580
Feature activation-1.99e+00

Pos logit contributions
umber+0.386
quartered+0.359
esson+0.341
netflix+0.328
lav+0.326
Neg logit contributions
roma-0.358
opsy-0.354
verse-0.325
 noses-0.320
 manufact-0.312
Tokenchens
Token ID29937
Feature activation+2.57e+00

Pos logit contributions
roma+1.235
opsy+1.200
verse+1.105
 noses+1.076
 manufact+1.042
Neg logit contributions
umber-1.537
quartered-1.450
esson-1.362
Constructed-1.319
netflix-1.318
Token,
Token ID11
Feature activation-4.25e-02

Pos logit contributions
umber+1.823
quartered+1.672
esson+1.603
netflix+1.540
lav+1.529
Neg logit contributions
roma-1.749
opsy-1.721
verse-1.584
 noses-1.559
 manufact-1.535
Token and
Token ID290
Feature activation-2.66e+00

Pos logit contributions
roma+0.029
opsy+0.028
verse+0.026
 noses+0.026
 manufact+0.025
Neg logit contributions
umber-0.031
quartered-0.028
esson-0.027
netflix-0.026
wark-0.026
Token they
Token ID484
Feature activation+8.25e-01

Pos logit contributions
roma+1.708
opsy+1.694
verse+1.566
 noses+1.549
 manufact+1.496
Neg logit contributions
umber-1.970
quartered-1.835
esson-1.728
wark-1.679
netflix-1.677
Token started
Token ID2067
Feature activation-6.95e-01

Pos logit contributions
umber+0.581
quartered+0.538
esson+0.512
netflix+0.491
wark+0.489
Neg logit contributions
roma-0.566
opsy-0.559
verse-0.516
 noses-0.507
 manufact-0.496
TokenIt
Token ID1026
Feature activation-1.12e+00

Pos logit contributions
roma+0.705
opsy+0.696
verse+0.646
 noses+0.624
 manufact+0.607
Neg logit contributions
umber-0.804
quartered-0.756
esson-0.712
netflix-0.684
wark-0.683
Token comes
Token ID2058
Feature activation-7.56e-01

Pos logit contributions
roma+0.738
opsy+0.727
verse+0.671
 noses+0.663
 manufact+0.641
Neg logit contributions
umber-0.821
quartered-0.769
esson-0.725
netflix-0.699
wark-0.695
Token time
Token ID640
Feature activation-5.96e-01

Pos logit contributions
roma+0.492
opsy+0.485
verse+0.448
 noses+0.442
 manufact+0.428
Neg logit contributions
umber-0.557
quartered-0.523
esson-0.496
netflix-0.476
lav-0.474
Token to
Token ID284
Feature activation+3.62e-01

Pos logit contributions
roma+0.396
opsy+0.392
verse+0.361
 noses+0.356
 manufact+0.345
Neg logit contributions
umber-0.432
quartered-0.403
esson-0.382
netflix-0.367
wark-0.366
Token plea
Token ID16720
Feature activation-1.43e+01

Pos logit contributions
umber+0.262
quartered+0.243
esson+0.231
netflix+0.222
wark+0.222
Neg logit contributions
roma-0.242
opsy-0.239
verse-0.220
 noses-0.216
 manufact-0.211
Token-
Token ID12
Feature activation-1.26e+01

Pos logit contributions
 noses+6.349
opsy+6.344
verse+6.277
roma+5.973
 dime+5.730
Neg logit contributions
umber-11.387
quartered-11.111
lav-10.492
esson-10.402
netflix-10.240
Tokenb
Token ID65
Feature activation-8.03e+00

Pos logit contributions
verse+4.716
opsy+4.293
roma+4.161
lit+3.867
dim+3.694
Neg logit contributions
umber-11.664
iewicz-11.249
esson-10.969
quartered-10.947
netflix-10.730
Tokenarg
Token ID853
Feature activation-9.83e-01

Pos logit contributions
opsy+2.576
verse+2.403
roma+2.371
 noses+1.943
enz+1.823
Neg logit contributions
quartered-8.192
umber-8.118
esson-7.682
Constructed-7.642
-7.619
Tokenain
Token ID391
Feature activation-4.90e+00

Pos logit contributions
roma+0.565
opsy+0.556
verse+0.506
 noses+0.499
 manufact+0.478
Neg logit contributions
umber-0.798
quartered-0.752
esson-0.716
netflix-0.695
wark-0.690
Token.
Token ID13
Feature activation-2.98e-01

Pos logit contributions
opsy+2.927
roma+2.903
verse+2.771
 noses+2.724
 manufact+2.629
Neg logit contributions
umber-3.681
quartered-3.526
esson-3.299
netflix-3.241
lav-3.198
Token Your
Token ID3406
Feature activation-1.24e+00

Pos logit contributions
roma+0.206
opsy+0.203
verse+0.188
 noses+0.185
 manufact+0.180
Neg logit contributions
umber-0.209
quartered-0.195
esson-0.184
netflix-0.177
lav-0.176
Token is
Token ID318
Feature activation+3.42e+00

Pos logit contributions
umber+1.354
quartered+1.261
esson+1.198
lav+1.150
wark+1.145
Neg logit contributions
roma-1.199
opsy-1.197
verse-1.085
 noses-1.067
 manufact-1.040
Token at
Token ID379
Feature activation-4.24e+00

Pos logit contributions
umber+2.335
quartered+2.170
esson+2.041
netflix+1.956
Constructed+1.949
Neg logit contributions
opsy-2.391
roma-2.377
verse-2.174
 noses-2.141
 manufact-2.079
Token least
Token ID1551
Feature activation-3.82e+00

Pos logit contributions
roma+2.703
opsy+2.610
verse+2.440
 noses+2.412
 manufact+2.335
Neg logit contributions
umber-3.125
quartered-2.966
esson-2.815
netflix-2.716
wark-2.712
Token 100
Token ID1802
Feature activation-1.02e+01

Pos logit contributions
roma+2.469
opsy+2.404
verse+2.273
 noses+2.237
 manufact+2.178
Neg logit contributions
umber-2.780
quartered-2.624
esson-2.477
wark-2.389
lav-2.383
Token,
Token ID11
Feature activation-1.76e+00

Pos logit contributions
roma+5.585
opsy+5.096
verse+4.937
 noses+4.851
 atmosp+4.777
Neg logit contributions
umber-8.052
quartered-7.474
netflix-7.330
esson-7.246
wark-7.048
Token000
Token ID830
Feature activation-1.19e+01

Pos logit contributions
roma+1.215
opsy+1.197
verse+1.116
 noses+1.090
 manufact+1.060
Neg logit contributions
umber-1.216
quartered-1.145
esson-1.073
netflix-1.029
wark-1.028
Token light
Token ID1657
Feature activation-4.03e+00

Pos logit contributions
roma+3.654
opsy+3.018
verse+3.018
 noses+2.902
 atmosp+2.894
Neg logit contributions
umber-12.052
quartered-11.458
netflix-11.195
esson-11.092
izo-10.994
Token-
Token ID12
Feature activation-1.01e+01

Pos logit contributions
roma+2.247
opsy+2.149
verse+2.060
 noses+2.025
 manufact+1.902
Neg logit contributions
umber-3.297
quartered-3.139
esson-2.990
wark-2.912
izo-2.901
Tokenyears
Token ID19002
Feature activation-5.18e+00

Pos logit contributions
roma+1.864
verse+1.627
opsy+1.573
dim+1.465
lit+1.234
Neg logit contributions
umber-11.308
quartered-11.244
esson-10.768
izo-10.723
��-10.586
Token across
Token ID1973
Feature activation-2.77e+00

Pos logit contributions
roma+3.210
opsy+3.034
verse+2.901
 noses+2.838
 manufact+2.724
Neg logit contributions
umber-3.971
quartered-3.720
esson-3.533
netflix-3.440
wark-3.435
Token.
Token ID13
Feature activation+5.53e+00

Pos logit contributions
roma+1.773
opsy+1.733
verse+1.623
 noses+1.584
 manufact+1.532
Neg logit contributions
umber-2.071
quartered-1.939
esson-1.839
netflix-1.784
wark-1.767
Token one
Token ID530
Feature activation-3.28e-01

Pos logit contributions
roma+0.315
opsy+0.311
verse+0.287
 noses+0.282
 manufact+0.275
Neg logit contributions
umber-0.337
quartered-0.314
esson-0.297
netflix-0.286
wark-0.285
Token less
Token ID1342
Feature activation-2.58e+00

Pos logit contributions
roma+0.222
opsy+0.219
verse+0.202
 noses+0.198
 manufact+0.193
Neg logit contributions
umber-0.235
quartered-0.218
esson-0.207
netflix-0.199
wark-0.198
Token thing
Token ID1517
Feature activation-5.99e+00

Pos logit contributions
roma+1.542
opsy+1.522
verse+1.413
 noses+1.347
 manufact+1.328
Neg logit contributions
umber-2.026
quartered-1.908
esson-1.819
netflix-1.772
lav-1.757
Token for
Token ID329
Feature activation-3.13e+00

Pos logit contributions
roma+3.612
opsy+3.539
verse+3.295
 noses+3.221
 manufact+3.123
Neg logit contributions
umber-4.534
quartered-4.288
esson-4.028
netflix-3.923
izo-3.881
Token you
Token ID345
Feature activation-4.71e+00

Pos logit contributions
roma+1.964
opsy+1.926
verse+1.809
 noses+1.766
 manufact+1.728
Neg logit contributions
umber-2.357
quartered-2.201
esson-2.083
netflix-2.021
lav-2.000
Token to
Token ID284
Feature activation-1.11e+01

Pos logit contributions
roma+2.930
opsy+2.843
verse+2.639
 noses+2.588
 manufact+2.501
Neg logit contributions
umber-3.541
quartered-3.351
esson-3.165
netflix-3.099
lav-3.033
Token deal
Token ID1730
Feature activation-4.39e+00

Pos logit contributions
roma+5.780
opsy+5.463
verse+5.238
iq+4.900
 noses+4.895
Neg logit contributions
umber-8.755
quartered-8.651
netflix-8.197
lav-8.000
esson-7.935
Token with
Token ID351
Feature activation-1.46e+00

Pos logit contributions
roma+2.734
opsy+2.689
verse+2.500
 noses+2.438
 manufact+2.350
Neg logit contributions
umber-3.325
quartered-3.127
esson-2.940
netflix-2.878
lav-2.857
Token.
Token ID13
Feature activation+9.65e-01

Pos logit contributions
roma+0.935
opsy+0.921
verse+0.851
 noses+0.835
 manufact+0.809
Neg logit contributions
umber-1.083
quartered-1.021
esson-0.963
netflix-0.932
lav-0.924
Token If
Token ID1002
Feature activation-2.73e-02

Pos logit contributions
umber+0.673
quartered+0.620
esson+0.592
wark+0.567
netflix+0.565
Neg logit contributions
roma-0.670
opsy-0.663
verse-0.608
 noses-0.602
 manufact-0.586
Token you
Token ID345
Feature activation-2.18e+00

Pos logit contributions
roma+0.018
opsy+0.018
verse+0.017
 noses+0.016
 manufact+0.016
Neg logit contributions
umber-0.020
quartered-0.018
esson-0.017
netflix-0.017
wark-0.017
Token appearing
Token ID12655
Feature activation-1.04e+01

Pos logit contributions
roma+6.007
opsy+5.676
 noses+5.132
verse+5.118
iq+4.834
Neg logit contributions
umber-7.721
quartered-7.440
esson-7.116
netflix-7.049
wark-6.983
Token very
Token ID845
Feature activation-1.12e+01

Pos logit contributions
roma+5.857
opsy+5.626
 noses+5.255
verse+5.223
 dime+4.834
Neg logit contributions
umber-7.998
quartered-7.759
esson-7.270
wark-7.056
netflix-7.044
Token skinny
Token ID29494
Feature activation+1.37e+00

Pos logit contributions
roma+6.478
opsy+5.907
 noses+5.873
verse+5.820
 manufact+5.342
Neg logit contributions
umber-8.547
quartered-8.429
esson-7.863
EStream-7.691
wark-7.661
Token,
Token ID11
Feature activation-1.23e+01

Pos logit contributions
umber+0.950
quartered+0.876
esson+0.830
wark+0.797
netflix+0.797
Neg logit contributions
roma-0.950
opsy-0.940
verse-0.863
 noses-0.849
 manufact-0.835
Token in
Token ID287
Feature activation-1.57e+01

Pos logit contributions
roma+6.522
opsy+6.263
 noses+5.971
verse+5.775
iq+5.462
Neg logit contributions
umber-9.382
quartered-9.331
netflix-8.767
esson-8.742
EStream-8.668
Token his
Token ID465
Feature activation-1.44e+01

Pos logit contributions
roma+6.960
 noses+6.217
opsy+6.210
verse+5.971
 dime+5.522
Neg logit contributions
quartered-12.804
umber-12.527
esson-12.189
netflix-12.134
EStream-12.105
Token 30
Token ID1542
Feature activation-3.80e+00

Pos logit contributions
roma+6.995
verse+6.700
opsy+6.673
 dime+6.566
 noses+6.494
Neg logit contributions
quartered-10.875
umber-10.828
esson-10.632
EStream-10.372
netflix-10.345
Tokens
Token ID82
Feature activation-4.51e+00

Pos logit contributions
roma+2.180
opsy+2.163
verse+1.981
 noses+1.916
 manufact+1.852
Neg logit contributions
umber-3.012
quartered-2.914
esson-2.743
netflix-2.687
lav-2.674
Token,
Token ID11
Feature activation-1.51e+01

Pos logit contributions
roma+2.792
opsy+2.729
verse+2.533
 noses+2.517
iq+2.379
Neg logit contributions
umber-3.414
quartered-3.228
esson-3.095
netflix-2.995
wark-2.951
Token standing
Token ID5055
Feature activation-2.07e+00

Pos logit contributions
roma+7.853
opsy+7.359
 noses+7.174
verse+7.029
 manufact+6.525
Neg logit contributions
umber-11.178
quartered-11.097
netflix-10.479
esson-10.427
EStream-10.305
Token 5
Token ID642
Feature activation-7.00e-01

Pos logit contributions
roma+1.326
opsy+1.303
verse+1.203
 noses+1.195
 manufact+1.144
Neg logit contributions
umber-1.546
quartered-1.458
esson-1.382
netflix-1.332
wark-1.328
Token Years
Token ID13212
Feature activation-6.20e+00

Pos logit contributions
roma+1.284
opsy+1.247
verse+1.145
 noses+1.111
 manufact+1.049
Neg logit contributions
umber-2.250
quartered-2.147
esson-2.049
wark-1.986
netflix-1.982
Token)
Token ID8
Feature activation-8.63e-01

Pos logit contributions
roma+3.664
opsy+3.551
verse+3.332
 noses+3.268
 manufact+3.126
Neg logit contributions
umber-4.788
quartered-4.550
esson-4.313
wark-4.175
netflix-4.151
Token
Token ID198
Feature activation+9.15e-01

Pos logit contributions
roma+0.587
opsy+0.578
verse+0.535
 noses+0.526
 manufact+0.513
Neg logit contributions
umber-0.612
quartered-0.572
esson-0.540
netflix-0.519
wark-0.518
Token
Token ID198
Feature activation-2.23e+00

Pos logit contributions
umber+0.626
quartered+0.570
esson+0.548
netflix+0.524
wark+0.523
Neg logit contributions
roma-0.648
opsy-0.640
verse-0.586
 noses-0.584
 manufact-0.571
Tokento
Token ID1462
Feature activation+8.41e-01

Pos logit contributions
roma+1.368
opsy+1.342
verse+1.247
 noses+1.196
 manufact+1.168
Neg logit contributions
umber-1.717
quartered-1.620
esson-1.544
wark-1.480
netflix-1.480
Token reach
Token ID3151
Feature activation-1.31e+01

Pos logit contributions
umber+0.604
quartered+0.560
esson+0.532
netflix+0.511
wark+0.510
Neg logit contributions
roma-0.565
opsy-0.558
verse-0.512
 noses-0.505
 manufact-0.492
Token the
Token ID262
Feature activation-1.32e+01

Pos logit contributions
roma+7.801
opsy+7.081
verse+6.840
 manufact+6.771
 atmosp+6.665
Neg logit contributions
umber-9.321
quartered-9.101
netflix-8.449
esson-8.375
EStream-8.117
Token Andromeda
Token ID34542
Feature activation-5.56e+00

Pos logit contributions
roma+8.358
opsy+7.587
 manufact+7.374
verse+7.303
 noses+7.265
Neg logit contributions
umber-9.108
quartered-8.541
netflix-8.216
esson-7.982
iewicz-7.898
Token galaxy
Token ID16161
Feature activation-7.17e+00

Pos logit contributions
roma+4.202
opsy+4.056
verse+3.887
 noses+3.748
 manufact+3.713
Neg logit contributions
umber-3.569
quartered-3.259
esson-3.063
netflix-2.995
iewicz-2.922
Token (
Token ID357
Feature activation-2.20e+00

Pos logit contributions
roma+3.914
opsy+3.805
verse+3.523
 noses+3.399
 manufact+3.303
Neg logit contributions
umber-5.862
quartered-5.540
esson-5.238
wark-5.089
izo-5.067
Token2
Token ID17
Feature activation-7.40e+00

Pos logit contributions
roma+1.513
opsy+1.482
verse+1.388
 noses+1.342
 manufact+1.305
Neg logit contributions
umber-1.537
quartered-1.431
esson-1.353
wark-1.301
netflix-1.295