TOP ACTIVATIONS
MAX = 1.033

and Policy Prior ities ( CB PP ) offered further analysis
Marine and Atmospheric Sciences ( So MAS ).
. x . 108 10 46 358 x . x .
' a , whereas with Atomic Ptr all bets are off
= c v 2 . get Struct uring Element ( cv
functions using the archived file from S 3 bucket . It
of your very own starship ! Dec orate it , expand
Some of the outer land m asses are so tiny and
via remote control or rear - mounted buttons , and its
applied on Line 26 to reduce high frequency noise . We
said she risked severe infection and scar ring if she didn
x ic Karen Carpenter 2008 01 15 O & A -
15 Photo : Le a Suzuki , The Chronicle 2 of
with another very fast TN Film panel and 144 Hz refresh
. ( Photo : Kare em El g azz ar )
, urged kids to get the best education they can and
qu ered flag ! Race . Shoot . Win .
Fight With Each other 200 80 205 O & A -
, c v 2 . COLOR _ B GR 2 G
. Nate Ter ning , AF RL rapid reaction projects director

MIN ACTIVATIONS
MIN = -1.057

Name Add a comment here Ver ification Send Email Cancel
debate Bernie : " it seems inappropriate that I would debate
but it is containing everything and beyond the universe of the
at medium - high heat until the onion is clear or
b ] Personal best : [/ b ] 1 : 07
= True , help =" path to images directory ") args
of the films . It is also interesting the Tob in
PF Bentley / Civil Beat Good win
[ b ] Console : [/ b ] Xbox 360 (
. You can read our Privacy Policy here . Sign up
--+ + -------------------- --+ < -- pop / pop / pop
complex Layer 7 attacks ( See a recommended D DoS Mit
art ext = ($ text - split "\ r
++) { if ($ i - eq 1 ) { $
--+ + -------------------- --+ < -- Start of read () '
op said . F ul op had hoped to
to Christopher Nolan s Batman Begins .) Ar on
moments of heightened attention like o ases in a broad desert
6 { ( 44 s )} 22 . R c 3
making of the films . It is also interesting the Tob

INTERVAL 0.510 - 1.033
CONTAINS 0.431%

ing and the one that I think reduces immersion more on
wear iness and confusion to the minds of readers .
or in one of the dinner conversations where connections are made
, and one glass door , taped shut , leads directly
years imprisonment and / or a $ 1 . 1 million

INTERVAL -0.012 - 0.510
CONTAINS 53.005%

a brick wall . I was broke , lonely , and
. That information has provided scientists a library of comparisons for
to show that there is a lot of something ( p
, she was an assistant director of the Women 's Army
n ovation FT W ! To some , this

INTERVAL -0.535 - -0.012
CONTAINS 46.257%

. I love the refreshing light taste of these cookies ,
Club and a new sponsor is for the Premier League season
s second - ranked guard , then par lay ed
, 27 - 21 . With quarterback Kyle Morris
ms was still alive - a music critic for the New

INTERVAL -1.057 - -0.535
CONTAINS 0.307%

you view the Seahawks historically if they repeat as Super Bowl
against Rep . Bill Cassidy , standing
int f d | < -- Stack pointer + -------------------- --+
k A lf ax Ob pg after my finals and would
Club 's decision makers is a very positive step .
Token and
Token ID290
Feature activation-4.17e-02

Pos logit contributions
netflix+2.266
voy+2.203
��+2.203
+2.192
��+2.165
Neg logit contributions
rador-1.746
 Fil-1.695
psey-1.651
 NEXT-1.633
ername-1.558
Token Policy
Token ID7820
Feature activation+2.54e-01

Pos logit contributions
 Fil+0.281
 NEXT+0.260
rador+0.255
psey+0.251
 Cle+0.243
Neg logit contributions
netflix-0.544
��-0.530
-0.529
voy-0.526
��-0.524
Token Prior
Token ID14481
Feature activation-4.60e-01

Pos logit contributions
voy+2.235
netflix+2.227
��+2.207
+2.166
��+2.142
Neg logit contributions
rador-2.647
psey-2.518
 Fil-2.511
 NEXT-2.460
ername-2.410
Tokenities
Token ID871
Feature activation-2.60e-01

Pos logit contributions
 Fil+1.933
 NEXT+1.761
rador+1.601
 Cle+1.528
psey+1.508
Neg logit contributions
netflix-7.039
��-7.024
��-6.954
 Solitaire-6.808
-6.805
Token (
Token ID357
Feature activation+7.11e-01

Pos logit contributions
 Fil+2.047
 NEXT+1.953
rador+1.943
 Cle+1.825
psey+1.756
Neg logit contributions
netflix-3.040
��-2.971
��-2.930
-2.920
voy-2.861
TokenCB
Token ID23199
Feature activation+1.03e+00

Pos logit contributions
netflix+4.505
PROV+4.500
voy+4.499
��+4.492
+4.417
Neg logit contributions
rador-8.692
psey-8.518
 Fil-8.460
 NEXT-8.174
 Ake-8.161
TokenPP
Token ID10246
Feature activation+5.06e-01

Pos logit contributions
+0.788
netflix+0.469
PROV+0.352
voy+0.262
PI+0.230
Neg logit contributions
rador-18.067
psey-17.534
 Ake-17.218
 Fil-16.880
perty-16.842
Token)
Token ID8
Feature activation-3.31e-01

Pos logit contributions
voy+4.715
netflix+4.698
+4.685
��+4.672
��+4.665
Neg logit contributions
rador-4.725
 Fil-4.495
psey-4.484
 NEXT-4.336
ername-4.255
Token offered
Token ID4438
Feature activation-3.86e-01

Pos logit contributions
 Fil+2.632
rador+2.522
 NEXT+2.521
 Cle+2.365
psey+2.324
Neg logit contributions
netflix-3.789
��-3.697
-3.656
��-3.648
voy-3.560
Token further
Token ID2252
Feature activation-4.09e-01

Pos logit contributions
 Fil+2.818
rador+2.691
 NEXT+2.670
 Cle+2.571
 Ake+2.490
Neg logit contributions
netflix-4.616
-4.523
voy-4.488
��-4.477
��-4.434
Token analysis
Token ID3781
Feature activation-1.45e-01

Pos logit contributions
 Fil+3.057
rador+2.902
 NEXT+2.857
psey+2.753
 Cle+2.751
Neg logit contributions
netflix-4.833
-4.685
��-4.667
��-4.666
voy-4.549
Token Marine
Token ID11000
Feature activation-1.46e-01

Pos logit contributions
netflix+0.208
��+0.204
+0.201
voy+0.199
��+0.198
Neg logit contributions
 Fil-0.182
rador-0.174
 NEXT-0.173
psey-0.167
 Cle-0.164
Token and
Token ID290
Feature activation-2.79e-01

Pos logit contributions
 Fil+1.134
rador+1.062
 NEXT+1.048
 Ake+1.007
 Cle+0.996
Neg logit contributions
��-1.758
netflix-1.742
-1.726
��-1.691
voy-1.652
Token Atmospheric
Token ID41516
Feature activation-2.10e-01

Pos logit contributions
 Fil+2.411
 NEXT+2.241
 Cle+2.178
rador+2.133
psey+2.129
Neg logit contributions
netflix-3.128
��-3.088
-3.005
��-2.996
voy-2.933
Token Sciences
Token ID13473
Feature activation+2.37e-01

Pos logit contributions
 Fil+1.858
 NEXT+1.747
 Cle+1.722
rador+1.696
 Ake+1.644
Neg logit contributions
netflix-2.376
��-2.282
-2.235
��-2.218
voy-2.179
Token (
Token ID357
Feature activation+7.35e-01

Pos logit contributions
netflix+2.625
��+2.561
+2.559
voy+2.557
��+2.520
Neg logit contributions
rador-1.940
 Fil-1.919
 NEXT-1.840
psey-1.835
ername-1.731
TokenSo
Token ID2396
Feature activation+1.03e+00

Pos logit contributions
netflix+5.931
+5.719
voy+5.658
zzo+5.556
PROV+5.432
Neg logit contributions
rador-7.866
psey-7.724
 Fil-7.686
 NEXT-7.495
ername-7.371
TokenMAS
Token ID31180
Feature activation+5.51e-01

Pos logit contributions
zzo+10.668
netflix+10.658
asters+10.244
voy+10.183
+10.143
Neg logit contributions
rador-8.073
 Fil-7.532
psey-7.527
 NEXT-7.402
perty-7.098
Token).
Token ID737
Feature activation+2.57e-01

Pos logit contributions
netflix+5.385
+5.340
voy+5.313
��+5.233
zzo+5.155
Neg logit contributions
rador-5.003
 Fil-4.822
psey-4.730
 NEXT-4.668
perty-4.513
Token
Token ID198
Feature activation+1.08e-01

Pos logit contributions
netflix+2.819
+2.759
voy+2.750
��+2.739
��+2.702
Neg logit contributions
rador-2.140
 Fil-2.095
psey-2.033
 NEXT-2.016
outube-1.906
Token
Token ID198
Feature activation+2.15e-01

Pos logit contributions
netflix+1.245
��+1.219
+1.212
voy+1.211
��+1.200
Neg logit contributions
rador-0.832
 Fil-0.829
 NEXT-0.791
psey-0.780
ername-0.737
Token
Token ID447
Feature activation-2.18e-01

Pos logit contributions
netflix+2.172
��+2.120
voy+2.105
+2.102
��+2.084
Neg logit contributions
rador-1.968
 Fil-1.957
 NEXT-1.880
psey-1.862
ername-1.779
Token.
Token ID13
Feature activation-1.79e-01

Pos logit contributions
 Fil+0.133
rador+0.130
 NEXT+0.126
psey+0.119
 Ake+0.116
Neg logit contributions
netflix-0.214
��-0.214
��-0.210
-0.209
voy-0.207
Tokenx
Token ID87
Feature activation-1.65e-01

Pos logit contributions
 Fil+1.404
rador+1.382
 NEXT+1.319
yg+1.274
psey+1.245
Neg logit contributions
��-2.067
netflix-2.050
-2.014
��-1.997
 Solitaire-1.968
Token.
Token ID13
Feature activation+3.66e-02

Pos logit contributions
 Fil+1.305
rador+1.262
 NEXT+1.231
 Ake+1.157
yg+1.149
Neg logit contributions
��-1.906
netflix-1.905
-1.851
��-1.846
voy-1.818
Token108
Token ID15711
Feature activation+1.39e-01

Pos logit contributions
netflix+0.305
��+0.297
+0.294
voy+0.291
��+0.286
Neg logit contributions
 Fil-0.404
rador-0.403
 NEXT-0.390
psey-0.383
 Ake-0.373
Token 10
Token ID838
Feature activation+3.79e-01

Pos logit contributions
netflix+1.416
��+1.392
+1.377
voy+1.373
��+1.363
Neg logit contributions
 Fil-1.278
rador-1.259
 NEXT-1.237
psey-1.185
 Ake-1.150
Token46
Token ID3510
Feature activation+9.71e-01

Pos logit contributions
netflix+3.054
voy+2.969
+2.968
��+2.913
zzo+2.883
Neg logit contributions
rador-4.248
 Fil-4.167
psey-4.103
 NEXT-4.040
perty-3.902
Token358
Token ID31128
Feature activation+1.33e-01

Pos logit contributions
netflix+6.335
+6.245
voy+6.063
zzo+5.845
��+5.656
Neg logit contributions
rador-11.341
psey-11.083
 Fil-10.727
perty-10.669
 NEXT-10.539
Token x
Token ID2124
Feature activation+6.11e-02

Pos logit contributions
netflix+1.222
��+1.189
+1.181
voy+1.181
��+1.164
Neg logit contributions
rador-1.350
 Fil-1.346
 NEXT-1.300
psey-1.286
ername-1.232
Token.
Token ID13
Feature activation+1.87e-02

Pos logit contributions
netflix+0.709
��+0.700
+0.691
voy+0.688
��+0.688
Neg logit contributions
 Fil-0.471
rador-0.468
 NEXT-0.449
psey-0.436
ername-0.415
Tokenx
Token ID87
Feature activation+3.68e-02

Pos logit contributions
netflix+0.204
��+0.204
��+0.200
+0.200
voy+0.197
Neg logit contributions
 Fil-0.155
rador-0.154
 NEXT-0.148
psey-0.144
ername-0.139
Token.
Token ID13
Feature activation-5.20e-02

Pos logit contributions
netflix+0.432
��+0.429
+0.422
��+0.422
voy+0.419
Neg logit contributions
 Fil-0.280
rador-0.276
 NEXT-0.266
psey-0.255
ername-0.245
Token '
Token ID705
Feature activation+4.57e-01

Pos logit contributions
netflix+2.913
��+2.842
+2.832
voy+2.832
��+2.792
Neg logit contributions
rador-2.326
 Fil-2.299
 NEXT-2.206
psey-2.200
ername-2.083
Tokena
Token ID64
Feature activation+2.45e-01

Pos logit contributions
netflix+3.351
��+3.249
voy+3.238
+3.220
��+3.156
Neg logit contributions
rador-5.405
 Fil-5.280
psey-5.231
 NEXT-5.132
perty-4.985
Token,
Token ID837
Feature activation+4.25e-01

Pos logit contributions
netflix+2.712
��+2.646
voy+2.638
+2.638
��+2.603
Neg logit contributions
rador-2.008
 Fil-1.990
 NEXT-1.907
psey-1.896
ername-1.790
Token whereas
Token ID9472
Feature activation+3.67e-01

Pos logit contributions
netflix+4.631
��+4.516
+4.501
voy+4.487
zzo+4.428
Neg logit contributions
rador-3.542
 Fil-3.436
psey-3.346
 NEXT-3.290
ername-3.168
Token with
Token ID351
Feature activation+1.41e-01

Pos logit contributions
netflix+3.955
��+3.854
+3.846
voy+3.832
��+3.781
Neg logit contributions
rador-3.118
 Fil-3.062
psey-2.955
 NEXT-2.927
outube-2.787
Token Atomic
Token ID28976
Feature activation+9.65e-01

Pos logit contributions
netflix+1.511
��+1.472
voy+1.470
+1.466
��+1.445
Neg logit contributions
 Fil-1.220
rador-1.220
 NEXT-1.180
psey-1.151
 Ake-1.097
TokenPtr
Token ID46745
Feature activation+4.91e-01

Pos logit contributions
��+8.256
+8.196
netflix+8.151
zzo+7.934
��+7.915
Neg logit contributions
psey-8.938
rador-8.868
 Fil-8.612
 NEXT-8.475
perty-8.257
Token all
Token ID477
Feature activation+5.82e-02

Pos logit contributions
netflix+4.939
+4.880
��+4.862
voy+4.797
zzo+4.762
Neg logit contributions
rador-4.378
 Fil-4.284
psey-4.236
 NEXT-4.118
outube-3.951
Token bets
Token ID29222
Feature activation+2.39e-01

Pos logit contributions
netflix+0.650
��+0.630
voy+0.630
+0.628
��+0.622
Neg logit contributions
 Fil-0.483
rador-0.479
 NEXT-0.466
psey-0.450
 Ake-0.431
Token are
Token ID389
Feature activation+2.85e-01

Pos logit contributions
netflix+2.633
��+2.566
voy+2.562
+2.561
��+2.526
Neg logit contributions
rador-1.973
 Fil-1.956
 NEXT-1.877
psey-1.863
ername-1.760
Token off
Token ID572
Feature activation+3.99e-01

Pos logit contributions
netflix+3.533
��+3.476
+3.447
voy+3.445
��+3.417
Neg logit contributions
rador-1.952
 Fil-1.912
psey-1.818
 NEXT-1.816
ername-1.689
Token =
Token ID796
Feature activation-2.67e-01

Pos logit contributions
netflix+4.364
+4.295
��+4.285
voy+4.270
��+4.210
Neg logit contributions
rador-2.916
 Fil-2.862
psey-2.774
 NEXT-2.741
perty-2.596
Token c
Token ID269
Feature activation+3.73e-01

Pos logit contributions
 Fil+2.240
 NEXT+2.133
rador+2.095
psey+1.948
 Cle+1.944
Neg logit contributions
netflix-3.020
��-2.919
voy-2.892
��-2.867
-2.864
Tokenv
Token ID85
Feature activation-1.27e-01

Pos logit contributions
netflix+3.360
voy+3.348
��+3.258
+3.254
zzo+3.214
Neg logit contributions
rador-3.784
 Fil-3.720
 NEXT-3.637
psey-3.623
 Ake-3.474
Token2
Token ID17
Feature activation+2.56e-01

Pos logit contributions
 Fil+1.097
rador+1.090
 NEXT+1.038
psey+1.012
ername+0.987
Neg logit contributions
netflix-1.340
��-1.319
-1.312
��-1.305
voy-1.303
Token.
Token ID13
Feature activation+9.23e-02

Pos logit contributions
netflix+2.795
��+2.730
voy+2.725
+2.725
��+2.681
Neg logit contributions
rador-2.137
 Fil-2.112
 NEXT-2.029
psey-2.021
ername-1.907
Tokenget
Token ID1136
Feature activation+9.64e-01

Pos logit contributions
netflix+0.924
��+0.889
voy+0.885
+0.885
��+0.877
Neg logit contributions
 Fil-0.866
rador-0.864
 NEXT-0.826
psey-0.816
ername-0.780
TokenStruct
Token ID44909
Feature activation+2.59e-01

Pos logit contributions
+7.531
voy+7.436
netflix+7.396
��+7.335
zzo+7.252
Neg logit contributions
rador-9.776
 NEXT-9.560
psey-9.497
 Fil-9.465
perty-9.303
Tokenuring
Token ID870
Feature activation+5.23e-01

Pos logit contributions
netflix+2.924
��+2.853
voy+2.847
+2.846
��+2.807
Neg logit contributions
rador-2.068
 Fil-2.050
 NEXT-1.966
psey-1.951
ername-1.837
TokenElement
Token ID20180
Feature activation+4.05e-01

Pos logit contributions
netflix+4.826
+4.773
��+4.713
voy+4.691
zzo+4.599
Neg logit contributions
rador-5.137
 Fil-5.035
psey-4.966
 NEXT-4.930
perty-4.697
Token(
Token ID7
Feature activation+1.15e-01

Pos logit contributions
netflix+3.974
��+3.887
+3.877
voy+3.874
��+3.798
Neg logit contributions
rador-3.787
 Fil-3.743
 NEXT-3.628
psey-3.611
perty-3.422
Tokencv
Token ID33967
Feature activation+1.63e-01

Pos logit contributions
netflix+1.062
��+1.036
+1.025
voy+1.024
��+1.016
Neg logit contributions
rador-1.158
 Fil-1.155
 NEXT-1.112
psey-1.106
ername-1.058
Token functions
Token ID5499
Feature activation+1.58e-01

Pos logit contributions
 Fil+2.021
 NEXT+1.889
rador+1.860
 Ake+1.728
 Cle+1.724
Neg logit contributions
netflix-2.935
��-2.894
-2.864
��-2.859
voy-2.844
Token using
Token ID1262
Feature activation+1.17e-01

Pos logit contributions
netflix+1.760
voy+1.713
��+1.712
+1.709
��+1.682
Neg logit contributions
rador-1.289
 Fil-1.271
 NEXT-1.219
psey-1.216
ername-1.147
Token the
Token ID262
Feature activation+1.54e-01

Pos logit contributions
netflix+1.300
voy+1.267
+1.265
��+1.265
��+1.245
Neg logit contributions
rador-0.955
 Fil-0.941
psey-0.904
 NEXT-0.903
ername-0.850
Token archived
Token ID33962
Feature activation-7.01e-02

Pos logit contributions
netflix+1.555
��+1.512
+1.505
voy+1.504
��+1.482
Neg logit contributions
rador-1.423
 Fil-1.414
 NEXT-1.357
psey-1.349
outube-1.283
Token file
Token ID2393
Feature activation+4.43e-01

Pos logit contributions
 Fil+0.563
rador+0.544
 NEXT+0.522
psey+0.502
 Ake+0.488
Neg logit contributions
netflix-0.809
��-0.804
-0.790
��-0.787
voy-0.784
Token from
Token ID422
Feature activation+9.35e-01

Pos logit contributions
netflix+4.343
voy+4.273
+4.165
zzo+4.147
 Solitaire+4.125
Neg logit contributions
rador-4.109
psey-4.062
 Fil-3.886
perty-3.864
 NEXT-3.789
Token S
Token ID311
Feature activation+4.50e-01

Pos logit contributions
 Solitaire+6.366
+6.359
netflix+6.264
zzo+6.142
��+6.083
Neg logit contributions
rador-10.327
psey-9.872
redo-9.624
perty-9.333
arning-9.333
Token3
Token ID18
Feature activation+9.69e-02

Pos logit contributions
netflix+3.733
voy+3.575
zzo+3.541
+3.533
PROV+3.395
Neg logit contributions
rador-4.903
psey-4.750
 Fil-4.729
 NEXT-4.630
perty-4.606
Token bucket
Token ID19236
Feature activation+1.56e-01

Pos logit contributions
netflix+1.257
��+1.232
+1.228
voy+1.227
��+1.215
Neg logit contributions
rador-0.618
 Fil-0.617
 NEXT-0.581
psey-0.571
outube-0.530
Token.
Token ID13
Feature activation+1.42e-01

Pos logit contributions
netflix+1.731
voy+1.685
+1.682
��+1.678
zzo+1.651
Neg logit contributions
rador-1.274
 Fil-1.251
psey-1.209
 NEXT-1.201
ername-1.132
Token It
Token ID632
Feature activation+2.59e-01

Pos logit contributions
netflix+1.336
��+1.300
voy+1.293
+1.292
��+1.275
Neg logit contributions
rador-1.404
 Fil-1.400
 NEXT-1.350
psey-1.336
ername-1.278
Token of
Token ID286
Feature activation+9.08e-02

Pos logit contributions
netflix+2.680
voy+2.619
��+2.614
+2.608
��+2.563
Neg logit contributions
rador-1.879
 Fil-1.842
psey-1.769
 NEXT-1.768
outube-1.666
Token your
Token ID534
Feature activation+1.18e-01

Pos logit contributions
netflix+0.977
��+0.958
+0.954
voy+0.948
��+0.942
Neg logit contributions
 Fil-0.778
rador-0.773
 NEXT-0.746
psey-0.733
 Ake-0.695
Token very
Token ID845
Feature activation+9.54e-02

Pos logit contributions
netflix+1.255
��+1.224
+1.220
voy+1.209
��+1.205
Neg logit contributions
 Fil-1.025
rador-1.024
 NEXT-0.990
psey-0.970
 Ake-0.919
Token own
Token ID898
Feature activation-1.67e-02

Pos logit contributions
netflix+1.143
��+1.114
+1.111
voy+1.103
��+1.102
Neg logit contributions
rador-0.706
 Fil-0.704
 NEXT-0.680
psey-0.661
ername-0.621
Token starship
Token ID48744
Feature activation-4.75e-02

Pos logit contributions
 Fil+0.148
rador+0.146
 NEXT+0.143
psey+0.137
 Ake+0.133
Neg logit contributions
netflix-0.175
��-0.172
-0.170
��-0.170
voy-0.169
Token!
Token ID0
Feature activation+9.01e-01

Pos logit contributions
 Fil+0.382
rador+0.374
 NEXT+0.368
psey+0.351
 Ake+0.340
Neg logit contributions
netflix-0.541
��-0.529
��-0.525
-0.521
voy-0.515
Token Dec
Token ID4280
Feature activation-9.28e-02

Pos logit contributions
 Solitaire+8.270
voy+8.178
netflix+8.176
+8.063
zzo+7.849
Neg logit contributions
rador-8.395
psey-8.108
outube-7.551
perty-7.436
arning-7.391
Tokenorate
Token ID16262
Feature activation+8.25e-02

Pos logit contributions
rador+0.349
 Fil+0.346
 NEXT+0.320
psey+0.319
estine+0.281
Neg logit contributions
��-1.439
netflix-1.434
-1.429
��-1.423
voy-1.391
Token it
Token ID340
Feature activation+2.44e-01

Pos logit contributions
netflix+0.866
��+0.852
+0.844
voy+0.840
��+0.835
Neg logit contributions
 Fil-0.733
rador-0.722
 NEXT-0.703
psey-0.684
 Ake-0.655
Token,
Token ID11
Feature activation+3.67e-01

Pos logit contributions
netflix+2.680
voy+2.609
��+2.606
+2.602
��+2.560
Neg logit contributions
rador-2.033
 Fil-2.000
psey-1.920
 NEXT-1.917
outube-1.810
Token expand
Token ID4292
Feature activation+4.38e-01

Pos logit contributions
netflix+4.043
voy+3.942
��+3.938
+3.921
��+3.868
Neg logit contributions
rador-3.038
 Fil-2.975
psey-2.853
 NEXT-2.852
outube-2.702
Token Some
Token ID2773
Feature activation-9.90e-02

Pos logit contributions
 Fil+0.804
 NEXT+0.757
rador+0.746
 Cle+0.708
 Ake+0.707
Neg logit contributions
netflix-1.084
��-1.080
��-1.043
voy-1.038
-1.035
Token of
Token ID286
Feature activation-1.78e-01

Pos logit contributions
 Fil+0.831
rador+0.805
 NEXT+0.784
psey+0.769
 Cle+0.738
Neg logit contributions
netflix-1.099
��-1.070
-1.056
voy-1.052
��-1.051
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
 Fil+1.327
rador+1.260
 NEXT+1.231
psey+1.200
 Ake+1.179
Neg logit contributions
netflix-2.141
��-2.111
voy-2.072
-2.067
��-2.061
Token outer
Token ID12076
Feature activation+1.95e-01

Pos logit contributions
netflix+1.738
��+1.692
+1.678
voy+1.676
��+1.664
Neg logit contributions
 Fil-1.322
rador-1.313
 NEXT-1.262
psey-1.245
 Ake-1.175
Token land
Token ID1956
Feature activation+1.10e-01

Pos logit contributions
netflix+2.019
��+1.967
voy+1.962
+1.962
��+1.933
Neg logit contributions
rador-1.734
 Fil-1.718
 NEXT-1.654
psey-1.645
ername-1.561
Tokenm
Token ID76
Feature activation+8.96e-01

Pos logit contributions
netflix+1.190
��+1.160
+1.156
voy+1.155
��+1.140
Neg logit contributions
rador-0.929
 Fil-0.921
 NEXT-0.884
psey-0.878
ername-0.830
Tokenasses
Token ID13978
Feature activation-1.79e-01

Pos logit contributions
netflix+3.641
asters+3.527
voy+3.392
zzo+3.262
adr+3.007
Neg logit contributions
rador-12.482
 Fil-12.466
outube-12.309
 NEXT-12.230
psey-12.140
Token are
Token ID389
Feature activation-1.50e-01

Pos logit contributions
 Fil+1.353
rador+1.335
 NEXT+1.287
psey+1.260
estine+1.196
Neg logit contributions
netflix-2.104
��-2.083
��-2.047
-2.045
voy-2.033
Token so
Token ID523
Feature activation+1.21e-01

Pos logit contributions
 Fil+1.201
rador+1.166
 NEXT+1.128
psey+1.101
 Cle+1.055
Neg logit contributions
netflix-1.736
��-1.688
voy-1.672
-1.666
��-1.643
Token tiny
Token ID7009
Feature activation+2.07e-01

Pos logit contributions
netflix+1.459
��+1.426
voy+1.421
+1.418
��+1.402
Neg logit contributions
rador-0.890
 Fil-0.886
 NEXT-0.838
psey-0.834
outube-0.776
Token and
Token ID290
Feature activation-2.21e-02

Pos logit contributions
netflix+2.161
��+2.105
voy+2.097
+2.096
��+2.068
Neg logit contributions
rador-1.830
 Fil-1.818
 NEXT-1.747
psey-1.736
ername-1.646
Token via
Token ID2884
Feature activation-3.92e-02

Pos logit contributions
netflix+3.671
voy+3.608
+3.585
��+3.570
zzo+3.538
Neg logit contributions
rador-2.758
 Fil-2.696
psey-2.619
 NEXT-2.586
perty-2.460
Token remote
Token ID6569
Feature activation-1.89e-01

Pos logit contributions
 Fil+0.324
rador+0.315
 NEXT+0.306
psey+0.297
 Ake+0.285
Neg logit contributions
netflix-0.444
��-0.434
-0.428
voy-0.425
��-0.423
Token control
Token ID1630
Feature activation+9.58e-02

Pos logit contributions
 Fil+1.431
rador+1.336
 NEXT+1.310
psey+1.248
 Ake+1.228
Neg logit contributions
netflix-2.304
��-2.272
-2.230
��-2.220
voy-2.199
Token or
Token ID393
Feature activation+4.62e-03

Pos logit contributions
netflix+1.095
��+1.070
+1.063
voy+1.060
��+1.051
Neg logit contributions
rador-0.759
 Fil-0.759
 NEXT-0.723
psey-0.714
ername-0.676
Token rear
Token ID8286
Feature activation+3.78e-01

Pos logit contributions
netflix+0.053
��+0.052
+0.051
voy+0.051
��+0.051
Neg logit contributions
 Fil-0.037
rador-0.037
 NEXT-0.035
psey-0.034
 Ake-0.033
Token-
Token ID12
Feature activation+8.94e-01

Pos logit contributions
netflix+3.944
voy+3.918
+3.899
zzo+3.836
��+3.827
Neg logit contributions
rador-3.248
 Fil-3.150
psey-3.121
 NEXT-3.086
ername-2.960
Tokenmounted
Token ID29728
Feature activation-1.92e-01

Pos logit contributions
voy+7.793
zzo+7.479
+7.437
netflix+7.428
asters+7.024
Neg logit contributions
psey-8.505
rador-8.498
 NEXT-8.272
 Fil-8.270
ername-7.977
Token buttons
Token ID12163
Feature activation-2.50e-03

Pos logit contributions
 Fil+1.586
 NEXT+1.484
rador+1.476
 Ake+1.392
 Cle+1.376
Neg logit contributions
netflix-2.206
��-2.180
-2.122
��-2.119
voy-2.098
Token,
Token ID11
Feature activation+1.81e-02

Pos logit contributions
 Fil+0.019
rador+0.019
 NEXT+0.019
psey+0.018
ername+0.017
Neg logit contributions
netflix-0.029
��-0.028
-0.028
voy-0.028
��-0.028
Token and
Token ID290
Feature activation-8.49e-02

Pos logit contributions
netflix+0.214
��+0.211
+0.208
��+0.207
voy+0.206
Neg logit contributions
 Fil-0.139
rador-0.136
 NEXT-0.130
psey-0.128
 Ake-0.121
Token its
Token ID663
Feature activation+1.03e-01

Pos logit contributions
 Fil+0.694
rador+0.664
 NEXT+0.649
psey+0.620
 Ake+0.607
Neg logit contributions
netflix-0.981
��-0.960
-0.946
��-0.938
voy-0.934
Token applied
Token ID5625
Feature activation+5.43e-01

Pos logit contributions
netflix+2.215
��+2.155
voy+2.150
+2.148
��+2.121
Neg logit contributions
rador-1.706
 Fil-1.702
 NEXT-1.630
psey-1.613
ername-1.526
Token on
Token ID319
Feature activation+3.85e-01

Pos logit contributions
netflix+5.484
voy+5.373
��+5.356
zzo+5.345
+5.313
Neg logit contributions
rador-4.789
psey-4.584
 Fil-4.479
 NEXT-4.333
outube-4.285
Token Line
Token ID6910
Feature activation-7.88e-02

Pos logit contributions
netflix+4.086
+3.986
voy+3.985
��+3.973
��+3.926
Neg logit contributions
rador-3.293
 Fil-3.201
psey-3.128
 NEXT-3.102
ername-2.972
Token 26
Token ID2608
Feature activation+2.24e-02

Pos logit contributions
 Fil+0.688
 NEXT+0.656
rador+0.644
 Cle+0.608
 Ake+0.600
Neg logit contributions
netflix-0.865
��-0.847
-0.840
��-0.834
voy-0.828
Token to
Token ID284
Feature activation+4.87e-01

Pos logit contributions
netflix+0.265
��+0.256
voy+0.256
+0.255
��+0.252
Neg logit contributions
 Fil-0.172
rador-0.169
 NEXT-0.164
psey-0.159
outube-0.150
Token reduce
Token ID4646
Feature activation+8.94e-01

Pos logit contributions
netflix+5.294
��+5.201
+5.195
voy+5.170
��+5.141
Neg logit contributions
rador-3.957
 Fil-3.767
psey-3.753
 NEXT-3.671
ername-3.526
Token high
Token ID1029
Feature activation+3.29e-01

Pos logit contributions
 HDR+8.137
voy+7.994
 Solitaire+7.965
zzo+7.951
netflix+7.936
Neg logit contributions
rador-7.964
psey-7.548
ername-7.442
perty-7.340
 NEXT-7.285
Token frequency
Token ID8373
Feature activation+3.38e-01

Pos logit contributions
netflix+3.340
voy+3.268
+3.262
��+3.246
��+3.197
Neg logit contributions
rador-2.949
 Fil-2.878
psey-2.806
 NEXT-2.805
ername-2.690
Token noise
Token ID7838
Feature activation+4.45e-02

Pos logit contributions
netflix+3.591
voy+3.507
+3.498
��+3.490
zzo+3.436
Neg logit contributions
rador-2.896
 Fil-2.798
psey-2.744
 NEXT-2.714
ername-2.606
Token.
Token ID13
Feature activation+1.23e-01

Pos logit contributions
netflix+0.514
��+0.505
voy+0.501
+0.499
��+0.496
Neg logit contributions
 Fil-0.351
rador-0.343
 NEXT-0.329
psey-0.324
 Ake-0.305
Token We
Token ID775
Feature activation+1.23e-01

Pos logit contributions
netflix+1.300
��+1.265
voy+1.260
+1.256
��+1.238
Neg logit contributions
 Fil-1.109
rador-1.084
 NEXT-1.054
psey-1.013
 Ake-0.980
Token said
Token ID531
Feature activation+5.02e-01

Pos logit contributions
netflix+3.209
voy+3.124
+3.074
zzo+3.073
��+3.023
Neg logit contributions
rador-3.318
 Fil-3.173
psey-3.158
 NEXT-3.091
perty-3.056
Token she
Token ID673
Feature activation+2.31e-01

Pos logit contributions
netflix+4.699
voy+4.567
zzo+4.448
+4.446
 Solitaire+4.372
Neg logit contributions
rador-4.962
psey-4.705
 Fil-4.605
perty-4.561
 NEXT-4.558
Token risked
Token ID42179
Feature activation-7.67e-02

Pos logit contributions
netflix+2.186
��+2.129
+2.121
voy+2.117
��+2.090
Neg logit contributions
rador-2.269
 Fil-2.261
 NEXT-2.181
psey-2.162
ername-2.061
Token severe
Token ID6049
Feature activation+3.30e-01

Pos logit contributions
 Fil+0.467
rador+0.455
 NEXT+0.430
psey+0.414
 Ake+0.392
Neg logit contributions
netflix-1.034
��-1.025
-1.016
��-1.008
zzo-0.990
Token infection
Token ID10280
Feature activation+1.17e-01

Pos logit contributions
netflix+3.327
voy+3.251
+3.176
��+3.159
zzo+3.126
Neg logit contributions
rador-3.024
 Fil-2.922
psey-2.888
 NEXT-2.849
perty-2.776
Token and
Token ID290
Feature activation+8.81e-01

Pos logit contributions
netflix+1.332
��+1.304
+1.299
voy+1.297
��+1.283
Neg logit contributions
rador-0.928
 Fil-0.923
 NEXT-0.882
psey-0.874
ername-0.824
Token scar
Token ID10153
Feature activation-1.42e-01

Pos logit contributions
netflix+5.714
voy+5.127
zzo+4.823
 Solitaire+4.786
 HDR+4.772
Neg logit contributions
perty-10.547
rador-10.460
psey-10.281
 Fil-10.222
ername-10.219
Tokenring
Token ID1806
Feature activation+2.46e-01

Pos logit contributions
 Fil+1.074
rador+1.058
 NEXT+0.989
psey+0.967
arning+0.962
Neg logit contributions
��-1.681
netflix-1.660
-1.656
��-1.653
zzo-1.609
Token if
Token ID611
Feature activation+2.58e-01

Pos logit contributions
netflix+2.798
voy+2.729
+2.711
��+2.697
��+2.671
Neg logit contributions
rador-1.956
 Fil-1.902
psey-1.853
 NEXT-1.834
ername-1.732
Token she
Token ID673
Feature activation+3.96e-01

Pos logit contributions
netflix+2.654
voy+2.571
+2.537
��+2.531
zzo+2.517
Neg logit contributions
rador-2.350
 Fil-2.269
psey-2.228
 NEXT-2.208
perty-2.125
Token didn
Token ID1422
Feature activation+7.48e-02

Pos logit contributions
netflix+3.693
voy+3.617
zzo+3.538
��+3.460
+3.450
Neg logit contributions
rador-3.900
psey-3.793
 Fil-3.752
 NEXT-3.657
perty-3.612
Tokenx
Token ID87
Feature activation-9.90e-02

Pos logit contributions
 Fil+1.961
rador+1.915
 NEXT+1.869
psey+1.811
 Cle+1.762
Neg logit contributions
netflix-2.087
��-2.013
-2.004
��-2.002
voy-1.935
Tokenic
Token ID291
Feature activation+3.12e-01

Pos logit contributions
 Fil+1.063
rador+1.034
 NEXT+1.020
psey+0.993
 Ake+0.966
Neg logit contributions
netflix-0.863
-0.859
��-0.850
��-0.837
voy-0.820
Token Karen
Token ID18678
Feature activation+1.01e-01

Pos logit contributions
netflix+3.267
voy+3.187
��+3.163
+3.156
zzo+3.114
Neg logit contributions
rador-2.739
 Fil-2.658
psey-2.609
 NEXT-2.580
ername-2.456
Token Carpenter
Token ID32659
Feature activation+8.94e-02

Pos logit contributions
netflix+1.094
��+1.068
+1.064
voy+1.059
��+1.052
Neg logit contributions
 Fil-0.865
rador-0.863
 NEXT-0.826
psey-0.815
 Ake-0.773
Token 2008
Token ID3648
Feature activation+1.15e-01

Pos logit contributions
netflix+0.956
��+0.932
+0.929
voy+0.929
��+0.916
Neg logit contributions
rador-0.770
 Fil-0.764
 NEXT-0.733
psey-0.728
ername-0.689
Token01
Token ID486
Feature activation+8.77e-01

Pos logit contributions
netflix+1.291
��+1.264
voy+1.257
+1.256
��+1.243
Neg logit contributions
rador-0.937
 Fil-0.930
 NEXT-0.892
psey-0.882
ername-0.835
Token15
Token ID1314
Feature activation-3.01e-01

Pos logit contributions
+5.434
voy+5.085
��+4.998
PROV+4.986
netflix+4.976
Neg logit contributions
psey-10.586
rador-10.561
 Fil-9.936
 NEXT-9.790
estine-9.723
Token O
Token ID440
Feature activation-3.85e-01

Pos logit contributions
 Fil+2.104
 NEXT+1.967
 Ake+1.813
 Cle+1.796
rador+1.746
Neg logit contributions
��-3.925
��-3.895
netflix-3.853
-3.756
voy-3.728
Token&
Token ID5
Feature activation+1.01e-01

Pos logit contributions
 Fil+2.301
 NEXT+2.138
rador+2.117
 Ake+1.935
yg+1.921
Neg logit contributions
��-5.188
��-5.186
netflix-5.074
voy-4.900
-4.880
TokenA
Token ID32
Feature activation+1.58e-01

Pos logit contributions
netflix+1.104
��+1.079
voy+1.071
+1.063
��+1.062
Neg logit contributions
 Fil-0.851
rador-0.839
 NEXT-0.812
psey-0.780
 Ake-0.761
Token-
Token ID12
Feature activation+2.48e-02

Pos logit contributions
netflix+1.695
��+1.653
voy+1.647
+1.647
��+1.625
Neg logit contributions
rador-1.350
 Fil-1.339
 NEXT-1.286
psey-1.277
ername-1.209
Token 15
Token ID1315
Feature activation+6.92e-01

Pos logit contributions
netflix+3.517
+3.399
voy+3.396
zzo+3.383
��+3.346
Neg logit contributions
rador-3.937
psey-3.755
 Fil-3.688
 NEXT-3.574
perty-3.544
Token Photo
Token ID5555
Feature activation+1.35e-01

Pos logit contributions
netflix+6.742
voy+6.579
+6.576
zzo+6.538
 Solitaire+6.448
Neg logit contributions
rador-6.329
psey-6.065
 Fil-5.623
perty-5.509
ername-5.500
Token:
Token ID25
Feature activation+2.75e-01

Pos logit contributions
netflix+1.539
��+1.502
voy+1.498
+1.498
��+1.478
Neg logit contributions
rador-1.070
 Fil-1.060
 NEXT-1.014
psey-1.008
ername-0.949
Token Le
Token ID1004
Feature activation+2.69e-01

Pos logit contributions
netflix+2.914
+2.835
��+2.832
voy+2.825
��+2.784
Neg logit contributions
rador-2.381
 Fil-2.316
psey-2.259
 NEXT-2.238
ername-2.135
Tokena
Token ID64
Feature activation+3.51e-01

Pos logit contributions
netflix+2.203
voy+2.157
��+2.135
+2.123
zzo+2.112
Neg logit contributions
rador-2.962
 Fil-2.910
 NEXT-2.847
psey-2.826
ername-2.724
Token Suzuki
Token ID35807
Feature activation+8.68e-01

Pos logit contributions
netflix+2.559
voy+2.514
+2.488
��+2.485
zzo+2.435
Neg logit contributions
rador-4.166
psey-4.045
 Fil-4.033
 NEXT-3.973
ername-3.869
Token,
Token ID11
Feature activation+4.21e-01

Pos logit contributions
zzo+7.327
netflix+7.247
PROV+7.171
��+7.140
+7.113
Neg logit contributions
rador-8.346
psey-8.315
perty-7.900
arning-7.732
 NEXT-7.690
Token The
Token ID383
Feature activation+1.73e-01

Pos logit contributions
netflix+4.139
voy+4.012
+4.011
��+3.996
zzo+3.957
Neg logit contributions
rador-3.888
 Fil-3.752
psey-3.746
 NEXT-3.680
ername-3.546
Token Chronicle
Token ID14160
Feature activation+4.33e-01

Pos logit contributions
netflix+0.842
voy+0.796
+0.795
��+0.786
zzo+0.759
Neg logit contributions
rador-2.493
 Fil-2.459
psey-2.423
 NEXT-2.407
ername-2.338
Token 2
Token ID362
Feature activation+4.30e-01

Pos logit contributions
netflix+3.681
��+3.560
voy+3.546
+3.521
zzo+3.489
Neg logit contributions
rador-4.593
psey-4.446
 Fil-4.433
 NEXT-4.349
perty-4.211
Token of
Token ID286
Feature activation+3.18e-01

Pos logit contributions
netflix+4.238
voy+4.174
+4.137
zzo+4.123
��+4.119
Neg logit contributions
rador-4.020
 Fil-3.812
psey-3.782
 NEXT-3.660
perty-3.631
Token with
Token ID351
Feature activation+1.99e-01

Pos logit contributions
 Fil+0.129
rador+0.127
 NEXT+0.122
psey+0.118
outube+0.113
Neg logit contributions
netflix-0.173
��-0.168
voy-0.168
-0.167
��-0.167
Token another
Token ID1194
Feature activation+4.16e-01

Pos logit contributions
netflix+2.240
��+2.187
+2.179
voy+2.176
��+2.153
Neg logit contributions
 Fil-1.607
rador-1.596
 NEXT-1.529
psey-1.508
outube-1.424
Token very
Token ID845
Feature activation+4.70e-01

Pos logit contributions
netflix+4.215
voy+4.098
+4.094
��+4.087
��+4.018
Neg logit contributions
rador-3.780
 Fil-3.687
psey-3.578
 NEXT-3.562
ername-3.406
Token fast
Token ID3049
Feature activation+5.78e-01

Pos logit contributions
netflix+4.369
voy+4.270
��+4.254
+4.245
zzo+4.174
Neg logit contributions
rador-4.594
 Fil-4.448
psey-4.375
 NEXT-4.309
ername-4.216
Token TN
Token ID29025
Feature activation+3.62e-01

Pos logit contributions
netflix+4.902
voy+4.732
+4.695
��+4.678
 Solitaire+4.651
Neg logit contributions
rador-6.112
 Fil-5.803
psey-5.797
 NEXT-5.670
ername-5.561
Token Film
Token ID13741
Feature activation+8.67e-01

Pos logit contributions
netflix+2.797
voy+2.660
+2.655
��+2.645
��+2.640
Neg logit contributions
rador-4.178
psey-3.996
 NEXT-3.924
 Fil-3.919
ername-3.895
Token panel
Token ID6103
Feature activation+3.35e-01

Pos logit contributions
netflix+6.926
 HDR+6.750
voy+6.666
 Solitaire+6.634
+6.598
Neg logit contributions
rador-9.070
psey-8.620
ername-8.378
 Fil-8.249
 NEXT-8.223
Token and
Token ID290
Feature activation+2.83e-01

Pos logit contributions
netflix+3.632
voy+3.536
+3.525
��+3.520
zzo+3.460
Neg logit contributions
rador-2.812
 Fil-2.759
psey-2.666
 NEXT-2.652
ername-2.531
Token 144
Token ID20224
Feature activation+2.64e-01

Pos logit contributions
netflix+3.046
��+2.968
+2.959
voy+2.958
��+2.919
Neg logit contributions
rador-2.429
 Fil-2.418
 NEXT-2.320
psey-2.299
ername-2.176
TokenHz
Token ID7399
Feature activation+5.70e-01

Pos logit contributions
netflix+2.163
voy+2.101
+2.092
��+2.089
��+2.043
Neg logit contributions
rador-2.920
 Fil-2.873
 NEXT-2.796
psey-2.789
ername-2.686
Token refresh
Token ID14976
Feature activation+3.29e-01

Pos logit contributions
netflix+6.000
voy+5.929
+5.884
��+5.855
 Solitaire+5.798
Neg logit contributions
rador-4.826
psey-4.490
 Fil-4.480
 NEXT-4.385
ername-4.254
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
 Fil+0.016
rador+0.016
 NEXT+0.015
psey+0.015
 Ake+0.014
Neg logit contributions
netflix-0.025
��-0.024
voy-0.024
-0.024
��-0.024
Token (
Token ID357
Feature activation+2.98e-02

Pos logit contributions
netflix+1.206
��+1.176
voy+1.171
+1.167
��+1.154
Neg logit contributions
 Fil-0.940
rador-0.934
 NEXT-0.901
psey-0.879
 Ake-0.844
TokenPhoto
Token ID6191
Feature activation-2.39e-01

Pos logit contributions
netflix+0.284
��+0.276
voy+0.274
��+0.271
+0.269
Neg logit contributions
 Fil-0.294
rador-0.290
 NEXT-0.282
psey-0.273
 Ake-0.270
Token:
Token ID25
Feature activation+1.72e-01

Pos logit contributions
 Fil+1.666
rador+1.612
 NEXT+1.580
psey+1.474
 Cle+1.452
Neg logit contributions
netflix-2.960
��-2.898
voy-2.885
-2.838
��-2.829
Token Kare
Token ID49396
Feature activation+8.58e-02

Pos logit contributions
netflix+1.851
��+1.805
voy+1.799
+1.797
��+1.773
Neg logit contributions
rador-1.476
 Fil-1.467
 NEXT-1.408
psey-1.396
ername-1.322
Tokenem
Token ID368
Feature activation+8.64e-01

Pos logit contributions
netflix+0.835
��+0.810
+0.805
voy+0.800
��+0.796
Neg logit contributions
 Fil-0.827
rador-0.827
 NEXT-0.795
psey-0.791
outube-0.757
Token El
Token ID2574
Feature activation+4.99e-01

Pos logit contributions
zzo+6.979
voy+6.788
 Mason+6.745
netflix+6.712
asters+6.671
Neg logit contributions
rador-9.420
terday-8.434
ername-8.433
estine-8.272
arning-8.117
Tokeng
Token ID70
Feature activation-1.26e-01

Pos logit contributions
netflix+5.478
voy+5.392
zzo+5.345
��+5.270
+5.185
Neg logit contributions
rador-4.089
psey-3.759
 NEXT-3.711
 Fil-3.702
perty-3.683
Tokenazz
Token ID8101
Feature activation-3.20e-02

Pos logit contributions
rador+0.880
 Fil+0.873
 NEXT+0.825
psey+0.818
ername+0.766
Neg logit contributions
netflix-1.545
��-1.518
-1.510
voy-1.509
��-1.493
Tokenar
Token ID283
Feature activation+8.26e-02

Pos logit contributions
rador+0.220
 Fil+0.218
psey+0.204
 NEXT+0.200
redo+0.191
Neg logit contributions
netflix-0.402
��-0.399
-0.394
��-0.392
voy-0.391
Token)
Token ID8
Feature activation+9.16e-02

Pos logit contributions
netflix+0.915
��+0.894
+0.890
voy+0.890
��+0.879
Neg logit contributions
rador-0.682
 Fil-0.679
 NEXT-0.647
psey-0.641
ername-0.606
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
netflix+1.716
��+1.673
voy+1.668
+1.668
��+1.646
Neg logit contributions
rador-1.257
 Fil-1.246
 NEXT-1.193
psey-1.185
ername-1.119
Token urged
Token ID11643
Feature activation-9.85e-02

Pos logit contributions
netflix+1.207
��+1.176
+1.173
voy+1.170
��+1.161
Neg logit contributions
 Fil-0.894
rador-0.894
 NEXT-0.856
psey-0.845
 Ake-0.796
Token kids
Token ID3988
Feature activation-7.07e-02

Pos logit contributions
 Fil+0.741
rador+0.721
 NEXT+0.692
psey+0.676
 Cle+0.645
Neg logit contributions
netflix-1.188
��-1.164
-1.157
��-1.154
voy-1.151
Token to
Token ID284
Feature activation-1.77e-01

Pos logit contributions
rador+0.541
 Fil+0.536
 NEXT+0.512
psey+0.508
ername+0.476
Neg logit contributions
netflix-0.823
��-0.804
voy-0.802
-0.801
��-0.792
Token get
Token ID651
Feature activation+1.26e-01

Pos logit contributions
 Fil+1.209
rador+1.204
 NEXT+1.143
psey+1.116
 Cle+1.043
Neg logit contributions
netflix-2.237
��-2.189
��-2.189
voy-2.174
-2.174
Token the
Token ID262
Feature activation+8.55e-01

Pos logit contributions
netflix+1.357
+1.324
voy+1.319
��+1.313
zzo+1.303
Neg logit contributions
rador-1.065
 Fil-1.045
psey-1.003
 NEXT-1.000
ername-0.952
Token best
Token ID1266
Feature activation+6.39e-01

Pos logit contributions
netflix+6.736
+6.696
voy+6.657
zzo+6.421
PROV+6.366
Neg logit contributions
rador-8.672
 Fil-8.199
psey-8.173
ername-8.035
outube-8.026
Token education
Token ID3707
Feature activation+3.14e-01

Pos logit contributions
voy+5.308
netflix+5.302
+5.265
zzo+5.208
��+5.052
Neg logit contributions
rador-6.508
 Fil-6.370
psey-6.198
 NEXT-6.100
ername-6.010
Token they
Token ID484
Feature activation+2.30e-01

Pos logit contributions
netflix+2.857
+2.755
voy+2.734
��+2.728
zzo+2.677
Neg logit contributions
rador-3.175
 Fil-3.090
psey-3.030
 NEXT-3.024
ername-2.941
Token can
Token ID460
Feature activation+1.48e-01

Pos logit contributions
netflix+2.346
��+2.280
voy+2.273
+2.271
��+2.232
Neg logit contributions
rador-2.082
 Fil-2.061
 NEXT-1.990
psey-1.984
ername-1.878
Token and
Token ID290
Feature activation+9.94e-02

Pos logit contributions
netflix+1.654
��+1.612
voy+1.609
+1.608
��+1.584
Neg logit contributions
rador-1.194
 Fil-1.181
 NEXT-1.131
psey-1.127
ername-1.063
Tokenqu
Token ID421
Feature activation+1.88e-01

Pos logit contributions
rador+0.170
 Fil+0.168
psey+0.162
 NEXT+0.161
ername+0.154
Neg logit contributions
netflix-0.218
��-0.215
-0.214
voy-0.210
��-0.209
Tokenered
Token ID1068
Feature activation+4.15e-02

Pos logit contributions
netflix+1.800
voy+1.745
��+1.743
+1.739
��+1.711
Neg logit contributions
rador-1.831
 Fil-1.814
 NEXT-1.753
psey-1.742
outube-1.659
Token flag
Token ID6056
Feature activation+1.88e-01

Pos logit contributions
netflix+0.625
��+0.616
voy+0.615
+0.613
��+0.609
Neg logit contributions
 Fil-0.175
rador-0.174
 NEXT-0.160
psey-0.155
ername-0.139
Token!
Token ID0
Feature activation+5.82e-01

Pos logit contributions
netflix+2.078
��+2.027
voy+2.023
+2.023
��+1.994
Neg logit contributions
rador-1.543
 Fil-1.527
 NEXT-1.464
psey-1.457
ername-1.375
Token Race
Token ID12588
Feature activation+4.36e-01

Pos logit contributions
netflix+5.505
+5.442
voy+5.425
 Solitaire+5.368
��+5.318
Neg logit contributions
rador-5.437
psey-5.280
 Fil-5.058
perty-4.911
ername-4.898
Token.
Token ID13
Feature activation+8.50e-01

Pos logit contributions
netflix+4.532
��+4.460
voy+4.437
+4.423
 Solitaire+4.380
Neg logit contributions
rador-3.786
 Fil-3.618
psey-3.566
 NEXT-3.468
outube-3.348
Token Shoot
Token ID17011
Feature activation+2.91e-01

Pos logit contributions
 Solitaire+8.163
netflix+8.155
+8.127
��+8.050
voy+8.029
Neg logit contributions
rador-7.345
psey-7.106
 Fil-6.792
perty-6.597
outube-6.573
Token.
Token ID13
Feature activation+6.74e-01

Pos logit contributions
netflix+3.134
��+3.064
+3.052
voy+3.050
��+3.018
Neg logit contributions
rador-2.437
 Fil-2.395
psey-2.321
 NEXT-2.289
outube-2.177
Token Win
Token ID7178
Feature activation+9.79e-02

Pos logit contributions
 Solitaire+6.438
netflix+6.388
+6.256
voy+6.242
zzo+6.228
Neg logit contributions
rador-6.254
psey-5.988
perty-5.628
 Fil-5.625
arning-5.597
Token.
Token ID13
Feature activation+6.24e-01

Pos logit contributions
netflix+1.108
��+1.079
voy+1.077
+1.076
��+1.061
Neg logit contributions
rador-0.785
 Fil-0.781
 NEXT-0.751
psey-0.739
ername-0.700
Token
Token ID198
Feature activation-5.15e-02

Pos logit contributions
netflix+5.940
 Solitaire+5.867
+5.818
voy+5.791
zzo+5.779
Neg logit contributions
rador-5.810
psey-5.557
 Fil-5.321
perty-5.229
arning-5.152
Token Fight
Token ID10480
Feature activation+2.96e-01

Pos logit contributions
netflix+1.854
��+1.806
voy+1.801
+1.800
��+1.775
Neg logit contributions
rador-1.538
 Fil-1.526
 NEXT-1.466
psey-1.457
ername-1.381
Token With
Token ID2080
Feature activation+2.56e-01

Pos logit contributions
netflix+3.080
voy+2.993
��+2.986
+2.978
��+2.940
Neg logit contributions
rador-2.631
 Fil-2.564
psey-2.496
 NEXT-2.480
ername-2.355
Token Each
Token ID5501
Feature activation+2.15e-01

Pos logit contributions
netflix+2.756
��+2.678
voy+2.677
+2.672
��+2.630
Neg logit contributions
rador-2.191
 Fil-2.153
psey-2.079
 NEXT-2.075
ername-1.959
Tokenother
Token ID847
Feature activation+3.45e-01

Pos logit contributions
netflix+1.768
voy+1.696
+1.693
��+1.677
zzo+1.659
Neg logit contributions
rador-2.374
 Fil-2.295
psey-2.257
 NEXT-2.232
outube-2.157
Token 200
Token ID939
Feature activation+3.91e-01

Pos logit contributions
netflix+3.445
voy+3.321
+3.301
��+3.270
zzo+3.256
Neg logit contributions
rador-3.182
 Fil-3.044
psey-3.025
 NEXT-2.957
ername-2.882
Token80
Token ID1795
Feature activation+8.49e-01

Pos logit contributions
netflix+2.175
+2.019
voy+2.016
��+1.994
zzo+1.983
Neg logit contributions
rador-5.370
 Fil-5.287
psey-5.170
 NEXT-5.114
perty-4.980
Token205
Token ID21261
Feature activation-2.70e-01

Pos logit contributions
+3.672
��+3.554
netflix+3.545
voy+3.507
PROV+3.340
Neg logit contributions
rador-12.357
psey-12.167
perty-11.981
 Fil-11.428
 NEXT-11.288
Token O
Token ID440
Feature activation-4.01e-01

Pos logit contributions
 Fil+1.856
 NEXT+1.761
 Ake+1.607
 Cle+1.581
rador+1.568
Neg logit contributions
��-3.527
��-3.472
netflix-3.465
voy-3.370
-3.358
Token&
Token ID5
Feature activation+6.17e-03

Pos logit contributions
 Fil+2.375
 NEXT+2.196
rador+2.161
yg+1.982
 Ake+1.979
Neg logit contributions
��-5.463
��-5.452
netflix-5.288
-5.126
 pestic-5.125
TokenA
Token ID32
Feature activation+1.41e-01

Pos logit contributions
netflix+0.070
��+0.069
voy+0.068
��+0.068
+0.067
Neg logit contributions
 Fil-0.050
rador-0.048
 NEXT-0.048
 Ake-0.045
psey-0.044
Token-
Token ID12
Feature activation-1.79e-01

Pos logit contributions
netflix+1.528
��+1.495
+1.484
voy+1.484
��+1.470
Neg logit contributions
rador-1.197
 Fil-1.193
 NEXT-1.143
psey-1.128
ername-1.071
Token,
Token ID11
Feature activation+1.77e-01

Pos logit contributions
netflix+0.791
��+0.770
voy+0.768
+0.767
��+0.757
Neg logit contributions
rador-0.650
 Fil-0.647
 NEXT-0.620
psey-0.616
ername-0.585
Token c
Token ID269
Feature activation+5.91e-01

Pos logit contributions
netflix+1.872
��+1.824
voy+1.815
+1.815
��+1.794
Neg logit contributions
 Fil-1.556
rador-1.554
 NEXT-1.490
psey-1.469
outube-1.395
Tokenv
Token ID85
Feature activation+1.43e-01

Pos logit contributions
voy+5.073
netflix+5.040
zzo+4.890
��+4.868
+4.842
Neg logit contributions
rador-6.111
 Fil-5.904
 NEXT-5.890
psey-5.774
 Ake-5.601
Token2
Token ID17
Feature activation+2.51e-01

Pos logit contributions
netflix+1.392
��+1.355
+1.349
voy+1.346
��+1.331
Neg logit contributions
rador-1.370
 Fil-1.364
 NEXT-1.308
psey-1.303
ername-1.245
Token.
Token ID13
Feature activation+4.18e-01

Pos logit contributions
netflix+2.821
��+2.753
+2.748
voy+2.748
��+2.708
Neg logit contributions
rador-2.021
 Fil-2.000
 NEXT-1.917
psey-1.907
ername-1.796
TokenCOLOR
Token ID46786
Feature activation+8.41e-01

Pos logit contributions
netflix+3.965
voy+3.878
+3.876
��+3.875
��+3.786
Neg logit contributions
rador-4.055
 Fil-3.992
psey-3.875
 NEXT-3.857
ername-3.690
Token_
Token ID62
Feature activation+7.94e-01

Pos logit contributions
+6.669
netflix+6.575
��+6.526
voy+6.489
��+6.317
Neg logit contributions
rador-8.858
psey-8.660
 Fil-8.313
 NEXT-8.164
 Ake-8.128
TokenB
Token ID33
Feature activation+5.77e-01

Pos logit contributions
netflix+6.845
PROV+6.841
��+6.789
+6.759
voy+6.646
Neg logit contributions
rador-8.088
psey-7.943
 Fil-7.591
perty-7.394
ername-7.280
TokenGR
Token ID10761
Feature activation+3.77e-01

Pos logit contributions
netflix+4.585
��+4.445
+4.361
��+4.343
voy+4.333
Neg logit contributions
rador-6.329
psey-6.211
 Fil-6.199
 NEXT-6.035
 Ake-5.920
Token2
Token ID17
Feature activation+3.39e-01

Pos logit contributions
netflix+3.675
��+3.599
+3.575
voy+3.572
zzo+3.532
Neg logit contributions
rador-3.584
 Fil-3.399
psey-3.382
 NEXT-3.333
perty-3.211
TokenG
Token ID38
Feature activation+4.58e-01

Pos logit contributions
netflix+3.396
��+3.330
+3.320
voy+3.301
��+3.262
Neg logit contributions
rador-3.130
 Fil-3.069
psey-2.982
 NEXT-2.960
ername-2.824
Token.
Token ID13
Feature activation+9.23e-02

Pos logit contributions
netflix+0.350
��+0.343
+0.342
voy+0.340
��+0.337
Neg logit contributions
rador-0.243
 Fil-0.242
 NEXT-0.230
psey-0.228
ername-0.215
Token Nate
Token ID23486
Feature activation-3.62e-01

Pos logit contributions
netflix+0.982
��+0.965
voy+0.952
+0.951
��+0.944
Neg logit contributions
 Fil-0.816
rador-0.799
 NEXT-0.776
psey-0.751
 Ake-0.735
Token Ter
Token ID3813
Feature activation-9.33e-02

Pos logit contributions
 Fil+3.456
 Cle+3.153
 NEXT+3.138
 Ake+3.088
rador+3.051
Neg logit contributions
netflix-3.756
��-3.688
��-3.603
-3.554
voy-3.469
Tokenning
Token ID768
Feature activation+1.32e-01

Pos logit contributions
rador+0.821
 Fil+0.817
psey+0.788
 NEXT+0.770
ername+0.739
Neg logit contributions
��-0.979
netflix-0.977
-0.967
��-0.952
voy-0.936
Token,
Token ID11
Feature activation+5.89e-02

Pos logit contributions
netflix+1.472
��+1.436
+1.431
voy+1.430
��+1.412
Neg logit contributions
rador-1.081
 Fil-1.072
 NEXT-1.026
psey-1.020
ername-0.963
Token AF
Token ID12341
Feature activation+8.40e-01

Pos logit contributions
netflix+0.648
��+0.629
+0.625
voy+0.620
��+0.619
Neg logit contributions
 Fil-0.502
rador-0.498
 NEXT-0.481
psey-0.474
 Ake-0.448
TokenRL
Token ID7836
Feature activation+1.10e-01

Pos logit contributions
netflix+4.751
+4.703
PROV+4.618
voy+4.604
zzo+4.485
Neg logit contributions
rador-10.228
 Fil-10.187
psey-10.163
ername-9.897
perty-9.861
Token rapid
Token ID5801
Feature activation-3.00e-02

Pos logit contributions
netflix+1.208
��+1.177
+1.166
voy+1.164
��+1.158
Neg logit contributions
rador-0.919
 Fil-0.916
 NEXT-0.878
psey-0.867
outube-0.819
Token reaction
Token ID6317
Feature activation-9.89e-02

Pos logit contributions
rador+0.267
 Fil+0.264
 NEXT+0.256
psey+0.252
ername+0.238
Neg logit contributions
netflix-0.314
��-0.306
-0.304
voy-0.303
��-0.300
Token projects
Token ID4493
Feature activation-2.12e-01

Pos logit contributions
rador+0.780
 Fil+0.780
 NEXT+0.755
psey+0.719
 Ake+0.696
Neg logit contributions
netflix-1.161
��-1.126
��-1.107
-1.103
voy-1.084
Token director
Token ID3437
Feature activation+9.27e-02

Pos logit contributions
 Fil+1.712
rador+1.705
 NEXT+1.646
psey+1.607
 Cle+1.539
Neg logit contributions
netflix-2.436
��-2.370
��-2.365
-2.341
voy-2.276
Token Name
Token ID6530
Feature activation-1.07e-01

Pos logit contributions
netflix+3.181
voy+3.165
��+3.099
+3.063
��+2.986
Neg logit contributions
rador-5.642
psey-5.309
perty-5.114
arning-4.937
 NEXT-4.934
Token Add
Token ID3060
Feature activation+2.04e-01

Pos logit contributions
 Fil+0.527
 NEXT+0.484
ername+0.421
 Cle+0.414
 Ake+0.405
Neg logit contributions
netflix-1.631
-1.605
��-1.599
��-1.593
voy-1.551
Token a
Token ID257
Feature activation+1.96e-01

Pos logit contributions
netflix+1.978
voy+1.954
��+1.936
+1.934
��+1.901
Neg logit contributions
rador-1.941
 Fil-1.857
psey-1.846
 NEXT-1.818
perty-1.741
Token comment
Token ID2912
Feature activation-3.38e-02

Pos logit contributions
netflix+1.856
voy+1.804
��+1.803
+1.795
��+1.763
Neg logit contributions
rador-1.937
 Fil-1.886
 NEXT-1.830
psey-1.819
ername-1.745
Token here
Token ID994
Feature activation+3.15e-02

Pos logit contributions
rador+0.266
 Fil+0.253
psey+0.247
 NEXT+0.244
perty+0.237
Neg logit contributions
netflix-0.389
voy-0.377
-0.377
��-0.373
zzo-0.372
Token Ver
Token ID4643
Feature activation-1.06e+00

Pos logit contributions
netflix+0.412
voy+0.408
+0.404
��+0.403
��+0.401
Neg logit contributions
rador-0.198
 Fil-0.186
psey-0.178
 NEXT-0.176
perty-0.165
Tokenification
Token ID2649
Feature activation-5.35e-01

Pos logit contributions
 NEXT+2.453
 Fil+2.120
ername+1.871
 Ake+1.773
 Cle+1.620
Neg logit contributions
-16.867
 Ukrain-16.811
voy-16.767
��-16.719
 conflic-16.697
Token Send
Token ID16290
Feature activation-1.21e-03

Pos logit contributions
 Fil+1.439
 NEXT+1.055
 Cle+0.946
 Equal+0.802
rador+0.712
Neg logit contributions
-9.427
��-9.248
netflix-9.151
voy-9.042
��-9.020
Token Email
Token ID9570
Feature activation-1.22e-01

Pos logit contributions
 Fil+0.010
rador+0.010
 NEXT+0.010
psey+0.010
ername+0.009
Neg logit contributions
netflix-0.013
-0.013
��-0.013
voy-0.013
��-0.013
Token Cancel
Token ID27910
Feature activation+1.54e-01

Pos logit contributions
 Fil+0.705
 NEXT+0.673
rador+0.659
psey+0.606
 Cle+0.576
Neg logit contributions
netflix-1.684
-1.652
��-1.650
voy-1.637
��-1.629
Token
Token ID198
Feature activation-2.58e-01

Pos logit contributions
netflix+1.655
voy+1.620
��+1.615
+1.596
��+1.588
Neg logit contributions
rador-1.328
 Fil-1.248
psey-1.235
 NEXT-1.193
perty-1.161
Token debate
Token ID4384
Feature activation+5.20e-02

Pos logit contributions
netflix+0.284
��+0.273
��+0.273
+0.271
voy+0.271
Neg logit contributions
 Fil-0.226
 NEXT-0.221
rador-0.218
psey-0.209
 Cle-0.200
Token Bernie
Token ID10477
Feature activation-1.08e-01

Pos logit contributions
netflix+0.575
��+0.560
+0.559
voy+0.555
��+0.555
Neg logit contributions
 Fil-0.438
rador-0.428
 NEXT-0.420
psey-0.408
 Cle-0.389
Token:
Token ID25
Feature activation+1.92e-01

Pos logit contributions
 Fil+0.999
rador+0.977
 NEXT+0.970
psey+0.937
 Cle+0.900
Neg logit contributions
netflix-1.093
��-1.055
voy-1.054
-1.050
��-1.047
Token "
Token ID366
Feature activation+4.03e-02

Pos logit contributions
netflix+2.037
��+1.985
voy+1.979
+1.978
��+1.950
Neg logit contributions
rador-1.663
 Fil-1.649
 NEXT-1.584
psey-1.575
ername-1.492
Token it
Token ID340
Feature activation+4.34e-02

Pos logit contributions
netflix+0.419
��+0.410
��+0.409
voy+0.405
+0.405
Neg logit contributions
 Fil-0.360
rador-0.355
 NEXT-0.344
psey-0.329
ername-0.320
Token seems
Token ID2331
Feature activation-1.02e+00

Pos logit contributions
netflix+0.491
��+0.484
+0.479
voy+0.476
��+0.476
Neg logit contributions
 Fil-0.350
rador-0.347
 NEXT-0.331
psey-0.322
outube-0.309
Token inappropriate
Token ID15679
Feature activation-4.67e-02

Pos logit contributions
 Fil+7.101
 NEXT+6.458
 Cle+6.404
rador+6.092
 Ake+6.028
Neg logit contributions
netflix-10.980
-10.937
��-10.760
voy-10.752
��-10.727
Token that
Token ID326
Feature activation-7.62e-02

Pos logit contributions
 Fil+0.372
rador+0.364
 NEXT+0.354
psey+0.342
 Ake+0.328
Neg logit contributions
netflix-0.530
��-0.521
-0.519
��-0.515
voy-0.515
Token I
Token ID314
Feature activation+4.32e-02

Pos logit contributions
 Fil+0.632
rador+0.621
 NEXT+0.602
psey+0.585
outube+0.561
Neg logit contributions
netflix-0.840
��-0.825
-0.824
voy-0.815
��-0.813
Token would
Token ID561
Feature activation+1.50e-01

Pos logit contributions
netflix+0.426
��+0.415
+0.413
voy+0.412
��+0.407
Neg logit contributions
rador-0.407
 Fil-0.405
 NEXT-0.390
psey-0.387
outube-0.368
Token debate
Token ID4384
Feature activation+8.74e-02

Pos logit contributions
netflix+1.295
��+1.247
+1.241
voy+1.235
��+1.229
Neg logit contributions
rador-1.616
 Fil-1.610
 NEXT-1.564
psey-1.547
ername-1.485
Token but
Token ID475
Feature activation-3.39e-01

Pos logit contributions
 Fil+0.587
rador+0.573
 NEXT+0.546
psey+0.528
 Ake+0.500
Neg logit contributions
��-1.084
netflix-1.081
-1.065
��-1.063
voy-1.052
Token it
Token ID340
Feature activation-7.47e-01

Pos logit contributions
 Fil+2.755
 NEXT+2.556
rador+2.553
 Cle+2.355
estine+2.316
Neg logit contributions
��-3.939
��-3.862
netflix-3.776
-3.734
voy-3.646
Token is
Token ID318
Feature activation-8.14e-01

Pos logit contributions
 Fil+4.092
 NEXT+3.434
rador+3.406
 Cle+3.183
 Equal+2.963
Neg logit contributions
��-10.741
��-10.413
 pestic-10.144
-9.924
netflix-9.888
Tokencontaining
Token ID38301
Feature activation-6.48e-01

Pos logit contributions
 Fil+5.830
rador+5.176
 NEXT+5.165
yg+5.062
outube+4.993
Neg logit contributions
��-9.475
��-9.371
netflix-9.105
PROV-8.828
voy-8.744
Token everything
Token ID2279
Feature activation-4.40e-01

Pos logit contributions
 Fil+4.495
rador+4.326
 NEXT+4.283
yg+3.927
estine+3.909
Neg logit contributions
��-7.808
voy-7.363
netflix-7.354
��-7.294
-7.284
Token and
Token ID290
Feature activation-9.86e-01

Pos logit contributions
 Fil+2.740
 NEXT+2.550
rador+2.465
 Cle+2.277
 Ake+2.228
Neg logit contributions
��-5.881
��-5.807
-5.582
netflix-5.571
voy-5.466
Token beyond
Token ID3675
Feature activation-5.61e-01

Pos logit contributions
 Fil+3.161
 NEXT+3.144
yg+2.305
 fork+2.174
 ‎+2.058
Neg logit contributions
��-15.555
��-15.197
-14.677
netflix-14.574
voy-14.451
Token the
Token ID262
Feature activation-5.53e-01

Pos logit contributions
 Fil+3.293
 NEXT+3.030
rador+2.839
 Cle+2.649
 Ake+2.635
Neg logit contributions
��-7.714
��-7.594
netflix-7.398
 pestic-7.307
-7.212
Token universe
Token ID6881
Feature activation-3.06e-01

Pos logit contributions
 NEXT+4.591
 Fil+4.415
rador+4.303
redo+4.070
psey+4.042
Neg logit contributions
��-5.976
��-5.858
netflix-5.780
zzo-5.616
 Solitaire-5.560
Token of
Token ID286
Feature activation-2.21e-01

Pos logit contributions
 Fil+1.895
 NEXT+1.763
rador+1.702
 Cle+1.586
 Ake+1.550
Neg logit contributions
��-4.120
��-4.095
netflix-4.006
-3.926
voy-3.867
Token the
Token ID262
Feature activation-1.22e-01

Pos logit contributions
 Fil+1.577
rador+1.490
 NEXT+1.484
psey+1.378
 Ake+1.348
Neg logit contributions
��-2.731
netflix-2.694
��-2.691
-2.632
voy-2.603
Token at
Token ID379
Feature activation-2.22e-01

Pos logit contributions
 Fil+2.793
 NEXT+2.642
rador+2.467
 Cle+2.409
 fork+2.369
Neg logit contributions
-4.904
��-4.828
netflix-4.819
��-4.731
voy-4.680
Token medium
Token ID7090
Feature activation-2.76e-01

Pos logit contributions
 Fil+1.632
rador+1.604
 NEXT+1.527
psey+1.444
perty+1.393
Neg logit contributions
netflix-2.689
-2.641
voy-2.575
��-2.556
zzo-2.555
Token-
Token ID12
Feature activation-2.88e-01

Pos logit contributions
 Fil+2.416
rador+2.346
 NEXT+2.260
psey+2.223
 Cle+2.141
Neg logit contributions
netflix-2.908
-2.901
��-2.793
voy-2.775
��-2.759
Tokenhigh
Token ID8929
Feature activation-3.97e-01

Pos logit contributions
rador+2.369
 Fil+2.337
 NEXT+2.233
psey+2.218
redo+2.168
Neg logit contributions
-3.147
netflix-3.105
��-3.021
 Solitaire-2.984
��-2.969
Token heat
Token ID4894
Feature activation-1.83e-01

Pos logit contributions
 Fil+3.462
rador+3.323
 NEXT+3.243
psey+3.090
 Cle+3.070
Neg logit contributions
-4.170
netflix-4.156
��-3.980
��-3.965
voy-3.901
Token until
Token ID1566
Feature activation-9.83e-01

Pos logit contributions
 Fil+1.348
rador+1.286
 NEXT+1.278
psey+1.194
 Cle+1.174
Neg logit contributions
netflix-2.191
-2.166
��-2.162
��-2.121
voy-2.119
Token the
Token ID262
Feature activation-5.34e-01

Pos logit contributions
 Fil+6.358
 fork+6.055
rador+5.445
 Cle+5.426
redo+5.369
Neg logit contributions
-11.662
netflix-11.528
voy-11.267
��-11.188
zzo-10.807
Token onion
Token ID21670
Feature activation-7.01e-02

Pos logit contributions
 Fil+3.838
 NEXT+3.449
rador+3.432
 fork+3.309
 Cle+3.259
Neg logit contributions
-6.477
netflix-6.413
��-6.322
��-6.261
voy-6.177
Token is
Token ID318
Feature activation-8.16e-01

Pos logit contributions
 Fil+0.555
rador+0.550
 NEXT+0.528
psey+0.511
redo+0.492
Neg logit contributions
netflix-0.797
-0.787
��-0.783
voy-0.778
��-0.767
Token clear
Token ID1598
Feature activation-3.83e-01

Pos logit contributions
 Fil+5.355
 fork+5.280
rador+4.835
 Cle+4.691
redo+4.587
Neg logit contributions
netflix-9.713
-9.686
voy-9.684
��-9.257
 Solitaire-9.232
Token or
Token ID393
Feature activation-4.63e-01

Pos logit contributions
 Fil+2.678
rador+2.577
 NEXT+2.422
 Cle+2.319
psey+2.282
Neg logit contributions
netflix-4.637
��-4.635
-4.603
voy-4.569
��-4.534
Tokenb
Token ID65
Feature activation+1.37e-01

Pos logit contributions
netflix+0.716
��+0.699
+0.695
voy+0.691
��+0.686
Neg logit contributions
rador-0.663
 Fil-0.662
 NEXT-0.637
psey-0.628
ername-0.604
Token]
Token ID60
Feature activation+1.24e-01

Pos logit contributions
netflix+1.480
voy+1.442
+1.441
��+1.437
��+1.418
Neg logit contributions
rador-1.157
 Fil-1.145
 NEXT-1.100
psey-1.097
perty-1.033
TokenPersonal
Token ID30228
Feature activation+2.62e-01

Pos logit contributions
netflix+1.218
��+1.182
+1.176
voy+1.174
��+1.164
Neg logit contributions
 Fil-1.177
rador-1.172
 NEXT-1.134
psey-1.117
ername-1.073
Token best
Token ID1266
Feature activation+2.14e-01

Pos logit contributions
netflix+2.802
��+2.737
voy+2.735
+2.728
��+2.690
Neg logit contributions
rador-2.254
 Fil-2.206
 NEXT-2.127
psey-2.121
ername-2.012
Token:
Token ID25
Feature activation+1.70e-01

Pos logit contributions
netflix+2.374
��+2.316
+2.309
voy+2.309
��+2.278
Neg logit contributions
rador-1.752
 Fil-1.737
 NEXT-1.665
psey-1.653
ername-1.561
Token[/
Token ID13412
Feature activation-9.78e-01

Pos logit contributions
netflix+1.679
��+1.633
voy+1.628
+1.627
��+1.603
Neg logit contributions
rador-1.606
 Fil-1.597
 NEXT-1.538
psey-1.528
ername-1.455
Tokenb
Token ID65
Feature activation-3.41e-01

Pos logit contributions
bet+4.187
 Fil+4.128
bin+4.113
bending+4.112
rador+3.845
Neg logit contributions
��-13.606
��-13.231
-13.135
 pestic-13.092
 conflic-12.729
Token]
Token ID60
Feature activation-1.03e-02

Pos logit contributions
 Fil+2.413
rador+2.348
 NEXT+2.248
psey+2.159
ername+2.115
Neg logit contributions
netflix-4.115
��-4.057
-4.006
voy-3.985
��-3.936
Token 1
Token ID352
Feature activation+1.95e-01

Pos logit contributions
 Fil+0.092
rador+0.089
 NEXT+0.088
psey+0.086
 Ake+0.082
Neg logit contributions
netflix-0.108
-0.104
��-0.104
voy-0.104
��-0.103
Token:
Token ID25
Feature activation-1.03e-01

Pos logit contributions
netflix+2.107
+2.067
voy+2.059
��+2.058
��+2.033
Neg logit contributions
rador-1.636
 Fil-1.604
psey-1.548
 NEXT-1.540
ername-1.464
Token07
Token ID2998
Feature activation-1.28e-01

Pos logit contributions
 Fil+0.938
 NEXT+0.897
rador+0.885
 Cle+0.841
ername+0.825
Neg logit contributions
netflix-1.082
��-1.072
��-1.068
-1.039
voy-1.020
Token=
Token ID28
Feature activation-6.35e-01

Pos logit contributions
 Fil+1.155
 NEXT+0.995
rador+0.966
 Cle+0.862
ername+0.845
Neg logit contributions
��-3.814
��-3.792
netflix-3.710
-3.645
voy-3.630
TokenTrue
Token ID17821
Feature activation-3.04e-01

Pos logit contributions
 Fil+2.595
Fil+2.382
Sum+2.315
 NEXT+2.188
 Equal+2.139
Neg logit contributions
 pestic-9.218
��-9.218
��-9.109
netflix-9.007
 conflic-8.980
Token,
Token ID11
Feature activation-4.82e-01

Pos logit contributions
 Fil+2.039
rador+1.872
 NEXT+1.869
 Cle+1.746
ername+1.704
Neg logit contributions
��-3.888
��-3.884
netflix-3.785
-3.775
voy-3.699
Token help
Token ID1037
Feature activation-6.15e-01

Pos logit contributions
 Fil+3.845
 NEXT+3.632
rador+3.464
 Cle+3.411
psey+3.355
Neg logit contributions
��-5.540
��-5.531
netflix-5.330
-5.320
voy-5.254
Token="
Token ID2625
Feature activation-7.70e-01

Pos logit contributions
 Fil+3.167
 NEXT+2.777
rador+2.640
Fil+2.526
 Equal+2.472
Neg logit contributions
��-8.718
��-8.605
 pestic-8.473
netflix-8.267
 Solitaire-8.101
Tokenpath
Token ID6978
Feature activation-9.77e-01

Pos logit contributions
 Fil+5.721
rador+5.701
bending+5.472
redo+5.414
ername+5.387
Neg logit contributions
��-8.456
��-8.333
 pestic-8.091
 Solitaire-7.987
-7.847
Token to
Token ID284
Feature activation-8.07e-01

Pos logit contributions
 Fil+4.131
 NEXT+3.397
rador+3.039
 Cle+3.035
 Ake+2.774
Neg logit contributions
��-14.104
��-14.044
 pestic-13.690
netflix-13.435
-13.382
Token images
Token ID4263
Feature activation-3.42e-01

Pos logit contributions
 Fil+4.614
 NEXT+4.042
rador+3.748
 Dat+3.739
estine+3.507
Neg logit contributions
��-10.800
��-10.794
netflix-10.468
-10.310
voy-10.227
Token directory
Token ID8619
Feature activation-7.25e-01

Pos logit contributions
 Fil+3.213
rador+3.081
 NEXT+3.025
psey+2.962
 fork+2.924
Neg logit contributions
��-3.404
netflix-3.365
��-3.335
-3.265
voy-3.204
Token")
Token ID4943
Feature activation-3.58e-01

Pos logit contributions
 Fil+2.353
 NEXT+2.010
rador+1.747
 Cle+1.738
ername+1.708
Neg logit contributions
��-11.281
��-11.148
 pestic-10.889
-10.770
���-10.546
Token args
Token ID26498
Feature activation-4.09e-01

Pos logit contributions
 Fil+3.390
rador+3.214
 NEXT+3.160
psey+2.991
yg+2.955
Neg logit contributions
��-3.620
netflix-3.566
��-3.564
-3.493
voy-3.459
Token of
Token ID286
Feature activation-3.26e-01

Pos logit contributions
 Fil+2.619
 NEXT+2.331
rador+2.205
 Cle+2.171
 Ake+2.069
Neg logit contributions
��-6.663
��-6.628
netflix-6.354
 pestic-6.256
-6.227
Token the
Token ID262
Feature activation-5.37e-01

Pos logit contributions
 Fil+2.220
 NEXT+2.025
rador+2.002
 Cle+1.845
 Ake+1.845
Neg logit contributions
��-4.239
��-4.154
netflix-4.117
-4.020
voy-3.950
Token films
Token ID7328
Feature activation-5.24e-01

Pos logit contributions
 Fil+4.743
 NEXT+4.266
rador+4.127
Fil+4.113
outube+3.887
Neg logit contributions
��-5.832
��-5.781
-5.485
netflix-5.444
 Solitaire-5.329
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
 Fil+3.077
 NEXT+2.854
rador+2.656
 Cle+2.534
Fil+2.508
Neg logit contributions
��-7.090
��-7.059
 pestic-6.804
netflix-6.729
voy-6.632
TokenIt
Token ID1026
Feature activation-8.79e-01

Pos logit contributions
 Fil+4.338
rador+4.092
 NEXT+4.042
Fil+3.819
outube+3.783
Neg logit contributions
��-5.476
netflix-5.453
��-5.442
voy-5.378
zzo-5.336
Token is
Token ID318
Feature activation-9.52e-01

Pos logit contributions
 Fil+4.664
rador+4.103
 NEXT+3.866
 Cle+3.557
estine+3.524
Neg logit contributions
��-12.072
��-11.759
-11.246
 pestic-11.228
netflix-11.191
Token also
Token ID635
Feature activation-7.94e-01

Pos logit contributions
 Fil+6.077
 NEXT+5.771
rador+5.542
estine+5.244
redo+5.225
Neg logit contributions
��-10.861
��-10.688
netflix-10.412
voy-10.144
-10.128
Token interesting
Token ID3499
Feature activation-4.00e-01

Pos logit contributions
 Fil+3.214
rador+3.049
 NEXT+2.830
urious+2.807
yg+2.484
Neg logit contributions
��-11.438
��-11.125
netflix-11.119
-10.914
PROV-10.699
Token the
Token ID262
Feature activation-6.51e-01

Pos logit contributions
 Fil+2.673
rador+2.523
 NEXT+2.478
 Cle+2.290
psey+2.290
Neg logit contributions
��-5.078
netflix-4.972
-4.949
��-4.937
voy-4.825
Token Tob
Token ID13695
Feature activation-2.36e-01

Pos logit contributions
 Fil+5.571
 NEXT+5.282
rador+5.218
outube+4.977
psey+4.857
Neg logit contributions
��-6.679
��-6.500
netflix-6.261
-6.252
zzo-6.201
Tokenin
Token ID259
Feature activation-1.27e-01

Pos logit contributions
rador+1.909
 Fil+1.892
psey+1.732
 NEXT+1.731
outube+1.646
Neg logit contributions
��-2.757
netflix-2.688
��-2.654
-2.630
voy-2.612
Token
Token ID198
Feature activation+2.16e-01

Pos logit contributions
 Fil+0.185
rador+0.180
 NEXT+0.177
psey+0.169
 Ake+0.163
Neg logit contributions
netflix-0.277
��-0.273
voy-0.270
-0.269
��-0.267
Token
Token ID198
Feature activation+1.04e-01

Pos logit contributions
netflix+2.391
voy+2.338
+2.323
��+2.308
zzo+2.285
Neg logit contributions
rador-1.773
 Fil-1.724
psey-1.693
 NEXT-1.661
perty-1.591
TokenPF
Token ID42668
Feature activation-2.87e-01

Pos logit contributions
netflix+1.068
��+1.043
voy+1.034
��+1.031
+1.027
Neg logit contributions
 Fil-0.927
rador-0.925
 NEXT-0.888
psey-0.869
ername-0.839
Token Bentley
Token ID38881
Feature activation-2.95e-01

Pos logit contributions
 Fil+2.606
 NEXT+2.525
 Ake+2.450
rador+2.427
 Cle+2.359
Neg logit contributions
netflix-3.092
��-3.006
��-2.906
-2.871
voy-2.866
Token/
Token ID14
Feature activation-7.06e-01

Pos logit contributions
 Fil+2.352
 NEXT+2.176
rador+2.120
 Ake+2.047
 Cle+2.028
Neg logit contributions
��-3.503
netflix-3.478
��-3.457
-3.372
voy-3.278
TokenCivil
Token ID32610
Feature activation-9.44e-01

Pos logit contributions
 Fil+5.913
Fil+5.671
Sum+5.539
 NEXT+5.514
 Equal+5.332
Neg logit contributions
netflix-7.399
��-6.969
��-6.883
 pestic-6.849
zzo-6.792
Token Beat
Token ID12568
Feature activation-2.34e-01

Pos logit contributions
 Fil+6.442
 NEXT+5.471
 Cle+5.407
 Equal+5.386
 Ake+5.380
Neg logit contributions
��-12.007
-11.702
 pestic-11.526
netflix-11.522
��-11.463
Token
Token ID198
Feature activation-1.85e-02

Pos logit contributions
 Fil+1.727
rador+1.724
 NEXT+1.647
psey+1.593
 Ake+1.516
Neg logit contributions
netflix-2.783
��-2.727
voy-2.697
-2.693
��-2.667
Token
Token ID198
Feature activation-5.17e-02

Pos logit contributions
 Fil+0.134
rador+0.129
 NEXT+0.126
psey+0.117
 Ake+0.116
Neg logit contributions
netflix-0.225
��-0.224
��-0.222
-0.220
voy-0.218
TokenGood
Token ID10248
Feature activation-2.13e-01

Pos logit contributions
 Fil+0.445
rador+0.439
 NEXT+0.426
psey+0.404
ername+0.401
Neg logit contributions
netflix-0.543
��-0.533
��-0.531
voy-0.526
zzo-0.522
Tokenwin
Token ID5404
Feature activation-2.38e-01

Pos logit contributions
rador+1.585
 Fil+1.546
 NEXT+1.486
psey+1.475
redo+1.415
Neg logit contributions
netflix-2.534
��-2.497
-2.490
��-2.434
voy-2.397
Token [
Token ID685
Feature activation-1.72e-01

Pos logit contributions
netflix+3.320
��+3.244
voy+3.236
+3.229
��+3.174
Neg logit contributions
rador-3.238
 Fil-3.166
psey-3.067
 NEXT-3.054
ername-2.912
Tokenb
Token ID65
Feature activation-2.38e-01

Pos logit contributions
 Fil+1.359
rador+1.351
 NEXT+1.297
psey+1.266
outube+1.228
Neg logit contributions
netflix-1.934
��-1.901
-1.885
voy-1.858
��-1.857
Token]
Token ID60
Feature activation+1.29e-01

Pos logit contributions
 Fil+1.648
rador+1.576
 NEXT+1.555
ername+1.437
psey+1.414
Neg logit contributions
��-2.985
netflix-2.940
��-2.926
-2.887
voy-2.827
TokenConsole
Token ID47581
Feature activation+6.03e-03

Pos logit contributions
netflix+1.230
��+1.197
+1.191
voy+1.185
��+1.179
Neg logit contributions
 Fil-1.267
rador-1.262
 NEXT-1.217
psey-1.202
outube-1.156
Token:
Token ID25
Feature activation-6.45e-02

Pos logit contributions
netflix+0.069
��+0.067
+0.066
��+0.065
voy+0.065
Neg logit contributions
 Fil-0.050
rador-0.048
 NEXT-0.047
psey-0.046
 Cle-0.044
Token[/
Token ID13412
Feature activation-9.30e-01

Pos logit contributions
 Fil+0.586
rador+0.577
 NEXT+0.560
psey+0.550
outube+0.528
Neg logit contributions
netflix-0.660
��-0.644
voy-0.640
-0.638
��-0.634
Tokenb
Token ID65
Feature activation-3.37e-01

Pos logit contributions
bet+4.021
bending+3.982
 Fil+3.965
bin+3.959
redo+3.643
Neg logit contributions
��-13.238
��-12.889
 pestic-12.818
-12.688
 conflic-12.471
Token]
Token ID60
Feature activation-9.13e-02

Pos logit contributions
 Fil+2.398
rador+2.343
 NEXT+2.234
psey+2.159
outube+2.104
Neg logit contributions
netflix-4.053
��-4.001
-3.959
voy-3.932
��-3.885
Token Xbox
Token ID9445
Feature activation-6.75e-02

Pos logit contributions
 Fil+0.798
rador+0.768
 NEXT+0.763
psey+0.736
outube+0.714
Neg logit contributions
netflix-0.974
��-0.952
-0.947
voy-0.942
��-0.941
Token 360
Token ID11470
Feature activation+6.23e-02

Pos logit contributions
 Fil+0.666
rador+0.651
 NEXT+0.639
psey+0.628
outube+0.604
Neg logit contributions
netflix-0.646
voy-0.625
-0.622
��-0.622
zzo-0.613
Token (
Token ID357
Feature activation-1.18e-01

Pos logit contributions
netflix+0.640
��+0.620
+0.617
voy+0.616
��+0.612
Neg logit contributions
 Fil-0.568
rador-0.562
 NEXT-0.544
psey-0.536
outube-0.514
Token.
Token ID13
Feature activation+1.37e-02

Pos logit contributions
 Fil+2.305
 NEXT+2.166
rador+2.075
 Cle+1.963
 Ake+1.927
Neg logit contributions
��-4.870
��-4.831
netflix-4.760
-4.608
 pestic-4.572
Token You
Token ID921
Feature activation+1.44e-01

Pos logit contributions
netflix+0.119
��+0.116
+0.114
voy+0.114
��+0.114
Neg logit contributions
 Fil-0.149
rador-0.146
 NEXT-0.142
psey-0.139
ername-0.135
Token can
Token ID460
Feature activation-1.57e-01

Pos logit contributions
netflix+1.503
��+1.464
+1.459
voy+1.459
��+1.439
Neg logit contributions
rador-1.282
 Fil-1.273
 NEXT-1.223
psey-1.215
ername-1.153
Token read
Token ID1100
Feature activation-6.33e-02

Pos logit contributions
 Fil+1.072
rador+1.030
 NEXT+1.007
psey+0.971
 Cle+0.920
Neg logit contributions
netflix-1.976
��-1.945
-1.943
��-1.928
voy-1.901
Token our
Token ID674
Feature activation-3.79e-01

Pos logit contributions
 Fil+0.544
rador+0.531
 NEXT+0.520
psey+0.505
 Cle+0.483
Neg logit contributions
netflix-0.684
-0.674
��-0.669
��-0.666
voy-0.663
Token Privacy
Token ID16777
Feature activation-9.16e-01

Pos logit contributions
 Fil+2.269
 NEXT+2.080
psey+1.981
 Cle+1.928
rador+1.923
Neg logit contributions
netflix-5.189
��-5.085
��-5.066
-4.999
voy-4.995
Token Policy
Token ID7820
Feature activation-3.48e-01

Pos logit contributions
 Fil+4.055
 Cle+3.733
 NEXT+3.578
 Dat+2.888
 WATCH+2.718
Neg logit contributions
��-13.763
netflix-13.522
��-13.427
 pestic-13.179
 conflic-12.878
Token here
Token ID994
Feature activation-2.93e-01

Pos logit contributions
 NEXT+1.786
 Fil+1.785
rador+1.623
 Cle+1.539
outube+1.507
Neg logit contributions
netflix-4.840
-4.783
��-4.763
��-4.759
voy-4.658
Token.
Token ID13
Feature activation-1.42e-01

Pos logit contributions
 Fil+1.972
 NEXT+1.875
rador+1.812
 Ake+1.668
 Cle+1.665
Neg logit contributions
��-3.721
��-3.696
netflix-3.665
-3.628
voy-3.546
Token Sign
Token ID5865
Feature activation+1.25e-01

Pos logit contributions
 Fil+1.019
 NEXT+0.952
rador+0.939
psey+0.879
 Cle+0.856
Neg logit contributions
netflix-1.762
��-1.732
��-1.719
-1.713
voy-1.702
Token up
Token ID510
Feature activation+1.61e-01

Pos logit contributions
netflix+1.157
��+1.132
+1.124
voy+1.120
��+1.109
Neg logit contributions
rador-1.248
 Fil-1.246
 NEXT-1.201
psey-1.191
ername-1.138
Token--+
Token ID44785
Feature activation+2.86e-02

Pos logit contributions
 Fil+1.617
rador+1.591
 NEXT+1.567
 Ake+1.498
psey+1.495
Neg logit contributions
netflix-1.766
��-1.725
-1.681
voy-1.679
��-1.662
Token +
Token ID1343
Feature activation-1.67e-01

Pos logit contributions
netflix+0.287
��+0.283
+0.279
voy+0.278
��+0.278
Neg logit contributions
 Fil-0.267
rador-0.263
 NEXT-0.257
psey-0.247
 Ake-0.241
Token--------------------
Token ID19351
Feature activation-2.24e-01

Pos logit contributions
 Fil+1.634
rador+1.613
 NEXT+1.602
psey+1.521
yg+1.479
Neg logit contributions
netflix-1.575
��-1.543
��-1.542
-1.514
voy-1.504
Token--+
Token ID44785
Feature activation-4.78e-02

Pos logit contributions
 Fil+2.040
rador+1.998
 NEXT+1.972
 Ake+1.898
psey+1.868
Neg logit contributions
netflix-2.299
��-2.249
-2.185
voy-2.179
��-2.161
Token <
Token ID1279
Feature activation-2.16e-01

Pos logit contributions
rador+0.496
 Fil+0.496
 NEXT+0.480
psey+0.474
ername+0.455
Neg logit contributions
netflix-0.424
��-0.413
voy-0.412
-0.411
��-0.404
Token--
Token ID438
Feature activation-9.15e-01

Pos logit contributions
 Fil+1.632
 NEXT+1.547
rador+1.534
redo+1.414
ername+1.394
Neg logit contributions
��-2.565
��-2.561
netflix-2.550
-2.505
voy-2.438
Token pop
Token ID1461
Feature activation+7.28e-02

Pos logit contributions
 Fil+9.172
 NEXT+8.804
 Cle+8.229
rador+8.110
 Equal+8.082
Neg logit contributions
netflix-8.053
voy-7.932
��-7.815
-7.642
��-7.459
Token/
Token ID14
Feature activation-1.30e-01

Pos logit contributions
netflix+0.712
+0.690
��+0.689
voy+0.687
��+0.678
Neg logit contributions
rador-0.694
 Fil-0.686
psey-0.661
 NEXT-0.659
outube-0.630
Tokenpop
Token ID12924
Feature activation+2.72e-01

Pos logit contributions
rador+1.160
 Fil+1.143
 NEXT+1.108
psey+1.082
redo+1.061
Neg logit contributions
��-1.361
��-1.348
-1.328
netflix-1.321
voy-1.298
Token/
Token ID14
Feature activation+7.13e-02

Pos logit contributions
netflix+2.590
voy+2.527
+2.500
��+2.484
zzo+2.463
Neg logit contributions
rador-2.642
 Fil-2.593
psey-2.523
 NEXT-2.504
perty-2.418
Tokenpop
Token ID12924
Feature activation+2.82e-01

Pos logit contributions
netflix+0.688
��+0.676
+0.671
voy+0.667
��+0.664
Neg logit contributions
rador-0.686
 Fil-0.680
 NEXT-0.657
psey-0.652
ername-0.622
Token complex
Token ID3716
Feature activation-2.80e-01

Pos logit contributions
 Fil+0.628
rador+0.626
 NEXT+0.587
psey+0.569
 Ake+0.542
Neg logit contributions
netflix-1.045
��-1.023
-1.015
��-1.009
voy-1.008
Token Layer
Token ID34398
Feature activation+8.80e-02

Pos logit contributions
 Fil+2.431
 NEXT+2.255
rador+2.244
psey+2.127
 Dat+2.087
Neg logit contributions
��-3.114
netflix-3.022
voy-2.987
��-2.973
-2.949
Token 7
Token ID767
Feature activation-2.54e-01

Pos logit contributions
netflix+0.897
��+0.885
+0.871
voy+0.869
��+0.864
Neg logit contributions
 Fil-0.816
rador-0.800
 NEXT-0.781
psey-0.756
 Ake-0.728
Token attacks
Token ID3434
Feature activation+7.49e-02

Pos logit contributions
 Fil+2.047
rador+1.923
 NEXT+1.915
psey+1.798
 Cle+1.767
Neg logit contributions
netflix-2.905
��-2.892
voy-2.814
��-2.805
-2.782
Token (
Token ID357
Feature activation-2.14e-01

Pos logit contributions
netflix+0.864
��+0.845
voy+0.840
+0.834
��+0.833
Neg logit contributions
 Fil-0.590
rador-0.588
 NEXT-0.559
psey-0.548
ername-0.519
TokenSee
Token ID6214
Feature activation-9.07e-01

Pos logit contributions
 Fil+1.900
rador+1.862
 NEXT+1.778
psey+1.722
redo+1.670
Neg logit contributions
netflix-2.184
��-2.180
��-2.153
voy-2.104
 Solitaire-2.098
Token a
Token ID257
Feature activation-5.14e-01

Pos logit contributions
 Fil+6.331
 NEXT+5.810
 Dat+5.608
 Cle+5.380
rador+5.279
Neg logit contributions
��-10.332
��-10.259
netflix-10.218
voy-10.173
-10.088
Token recommended
Token ID7151
Feature activation-5.09e-01

Pos logit contributions
 Fil+4.147
 NEXT+3.900
rador+3.788
 Dat+3.709
 Cle+3.676
Neg logit contributions
��-5.770
��-5.634
netflix-5.624
voy-5.528
-5.490
Token D
Token ID360
Feature activation-1.13e-01

Pos logit contributions
 Fil+4.252
 NEXT+4.002
 Cle+3.773
 Dat+3.761
rador+3.727
Neg logit contributions
��-5.705
netflix-5.637
��-5.556
voy-5.425
-5.423
TokenDoS
Token ID46498
Feature activation-4.66e-01

Pos logit contributions
rador+1.025
 Fil+1.018
 NEXT+0.973
psey+0.967
outube+0.940
Neg logit contributions
��-1.181
netflix-1.149
��-1.131
-1.128
voy-1.122
Token Mit
Token ID11707
Feature activation-3.70e-02

Pos logit contributions
 Fil+2.563
 NEXT+2.144
rador+2.093
 Cle+2.047
 Ake+1.966
Neg logit contributions
��-6.631
netflix-6.569
��-6.420
voy-6.395
-6.307
Tokenart
Token ID433
Feature activation-1.97e-01

Pos logit contributions
 Fil+3.362
rador+3.232
 NEXT+3.090
redo+3.074
ername+3.024
Neg logit contributions
netflix-3.577
��-3.553
��-3.528
-3.519
voy-3.439
Tokenext
Token ID2302
Feature activation-3.92e-01

Pos logit contributions
 Fil+1.538
rador+1.501
 NEXT+1.396
psey+1.392
ername+1.354
Neg logit contributions
netflix-2.267
��-2.233
-2.222
��-2.209
voy-2.194
Token =
Token ID796
Feature activation-5.59e-01

Pos logit contributions
 Fil+2.427
rador+2.319
 NEXT+2.166
psey+2.070
ername+2.063
Neg logit contributions
netflix-5.049
��-5.048
voy-4.963
��-4.959
-4.909
Token ($
Token ID7198
Feature activation-2.68e-01

Pos logit contributions
 Fil+4.628
 NEXT+4.289
rador+4.050
 Equal+4.004
 Cle+3.956
Neg logit contributions
netflix-6.108
voy-6.067
��-6.009
��-6.004
-5.957
Tokentext
Token ID5239
Feature activation-1.37e-01

Pos logit contributions
 Fil+2.551
rador+2.504
 NEXT+2.379
ername+2.341
psey+2.329
Neg logit contributions
netflix-2.537
��-2.483
voy-2.461
��-2.456
-2.442
Token -
Token ID532
Feature activation-9.04e-01

Pos logit contributions
 Fil+1.161
rador+1.136
 NEXT+1.082
psey+1.060
ername+1.039
Neg logit contributions
netflix-1.495
��-1.449
voy-1.446
-1.442
��-1.439
Tokensplit
Token ID35312
Feature activation-1.74e-01

Pos logit contributions
 Fil+7.854
 Equal+6.936
 NEXT+6.823
rador+6.787
 Cle+6.777
Neg logit contributions
netflix-8.829
��-8.575
��-8.541
-8.391
voy-8.376
Token "\
Token ID37082
Feature activation-4.88e-02

Pos logit contributions
 Fil+1.480
rador+1.394
 NEXT+1.370
psey+1.324
ername+1.315
Neg logit contributions
netflix-1.925
��-1.885
-1.882
voy-1.882
��-1.852
Tokenr
Token ID81
Feature activation+1.40e-01

Pos logit contributions
 Fil+0.431
rador+0.423
 NEXT+0.409
psey+0.403
ername+0.390
Neg logit contributions
netflix-0.516
��-0.498
-0.498
voy-0.496
��-0.491
Token
Token ID198
Feature activation-9.62e-02

Pos logit contributions
netflix+1.612
��+1.577
voy+1.569
+1.569
��+1.553
Neg logit contributions
rador-1.092
 Fil-1.088
 NEXT-1.039
psey-1.029
ername-0.970
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
 Fil+0.684
rador+0.662
 NEXT+0.642
psey+0.603
ername+0.587
Neg logit contributions
��-1.182
netflix-1.178
��-1.173
-1.156
voy-1.138
Token++)
Token ID29577
Feature activation-3.17e-01

Pos logit contributions
 Fil+2.645
 NEXT+2.553
rador+2.499
ername+2.397
psey+2.352
Neg logit contributions
netflix-3.695
-3.643
voy-3.626
��-3.576
��-3.576
Token {
Token ID1391
Feature activation-1.55e-01

Pos logit contributions
 Fil+2.396
 NEXT+2.228
rador+2.222
 Cle+2.052
psey+2.052
Neg logit contributions
-3.757
netflix-3.748
��-3.717
voy-3.709
��-3.671
Token if
Token ID611
Feature activation-6.25e-01

Pos logit contributions
 Fil+1.216
 NEXT+1.130
rador+1.123
psey+1.053
 Cle+1.040
Neg logit contributions
netflix-1.812
-1.807
��-1.798
voy-1.778
��-1.769
Token ($
Token ID7198
Feature activation-5.97e-02

Pos logit contributions
 Fil+3.484
 NEXT+3.160
rador+3.050
ername+2.826
 Equal+2.819
Neg logit contributions
-8.269
voy-8.187
��-8.185
��-8.168
netflix-8.162
Tokeni
Token ID72
Feature activation-3.86e-01

Pos logit contributions
 Fil+0.567
rador+0.561
 NEXT+0.540
psey+0.533
ername+0.522
Neg logit contributions
netflix-0.576
��-0.568
-0.565
��-0.563
voy-0.561
Token -
Token ID532
Feature activation-8.96e-01

Pos logit contributions
 Fil+3.009
 NEXT+2.846
rador+2.789
ername+2.644
 Cle+2.638
Neg logit contributions
netflix-4.384
-4.361
voy-4.359
��-4.351
��-4.265
Tokeneq
Token ID27363
Feature activation-1.31e-01

Pos logit contributions
 Fil+5.910
ername+5.580
rador+5.476
redo+5.463
 NEXT+5.452
Neg logit contributions
��-9.897
-9.862
��-9.684
voy-9.601
netflix-9.518
Token 1
Token ID352
Feature activation+2.09e-01

Pos logit contributions
 Fil+1.023
rador+0.978
 NEXT+0.967
psey+0.918
ername+0.886
Neg logit contributions
netflix-1.513
��-1.506
-1.495
voy-1.493
��-1.481
Token)
Token ID8
Feature activation-1.22e-01

Pos logit contributions
netflix+2.264
+2.194
��+2.194
voy+2.189
��+2.159
Neg logit contributions
rador-1.777
 Fil-1.753
psey-1.683
 NEXT-1.678
ername-1.589
Token {
Token ID1391
Feature activation-8.59e-02

Pos logit contributions
 Fil+0.975
rador+0.945
 NEXT+0.922
psey+0.885
 Cle+0.845
Neg logit contributions
netflix-1.396
-1.384
��-1.382
voy-1.379
��-1.366
Token $
Token ID720
Feature activation+3.60e-02

Pos logit contributions
 Fil+0.690
rador+0.655
 NEXT+0.648
psey+0.615
 Cle+0.594
Neg logit contributions
netflix-0.986
-0.977
��-0.976
voy-0.969
��-0.962
Token--+
Token ID44785
Feature activation+1.49e-01

Pos logit contributions
 Fil+2.119
rador+2.089
 NEXT+2.051
 Ake+1.965
psey+1.946
Neg logit contributions
netflix-2.431
��-2.374
-2.315
voy-2.309
��-2.283
Token +
Token ID1343
Feature activation-7.80e-02

Pos logit contributions
netflix+1.449
��+1.409
voy+1.405
+1.404
��+1.383
Neg logit contributions
rador-1.419
 Fil-1.409
 NEXT-1.359
psey-1.351
ername-1.287
Token--------------------
Token ID19351
Feature activation-1.57e-01

Pos logit contributions
 Fil+0.785
rador+0.780
 NEXT+0.766
psey+0.740
outube+0.714
Neg logit contributions
netflix-0.715
��-0.699
��-0.695
-0.689
voy-0.685
Token--+
Token ID44785
Feature activation+1.13e-01

Pos logit contributions
 Fil+1.470
rador+1.452
 NEXT+1.418
 Ake+1.361
psey+1.361
Neg logit contributions
netflix-1.579
��-1.543
-1.506
voy-1.502
��-1.486
Token <
Token ID1279
Feature activation-5.60e-02

Pos logit contributions
netflix+1.063
+1.025
��+1.020
voy+1.016
��+1.006
Neg logit contributions
rador-1.126
 Fil-1.101
psey-1.078
 NEXT-1.060
outube-1.026
Token--
Token ID438
Feature activation-8.94e-01

Pos logit contributions
 Fil+0.462
rador+0.451
 NEXT+0.441
psey+0.420
redo+0.409
Neg logit contributions
netflix-0.622
��-0.618
��-0.612
-0.609
voy-0.600
Token Start
Token ID7253
Feature activation-2.21e-01

Pos logit contributions
 Fil+7.899
 NEXT+7.399
 Cle+6.827
 Dat+6.764
 Equal+6.729
Neg logit contributions
netflix-9.171
voy-8.996
��-8.827
-8.792
��-8.701
Token of
Token ID286
Feature activation-3.96e-01

Pos logit contributions
 Fil+1.458
rador+1.407
 NEXT+1.372
psey+1.260
 Cle+1.242
Neg logit contributions
��-2.840
netflix-2.825
��-2.798
-2.773
voy-2.754
Token read
Token ID1100
Feature activation+1.60e-01

Pos logit contributions
 Fil+2.736
 NEXT+2.696
rador+2.685
psey+2.460
 fork+2.397
Neg logit contributions
netflix-4.895
��-4.856
voy-4.843
��-4.765
-4.756
Token()
Token ID3419
Feature activation+1.01e-01

Pos logit contributions
netflix+1.387
��+1.344
+1.338
voy+1.338
��+1.316
Neg logit contributions
rador-1.696
 Fil-1.684
 NEXT-1.631
psey-1.621
ername-1.553
Token'
Token ID6
Feature activation+3.10e-01

Pos logit contributions
netflix+1.072
��+1.047
+1.042
voy+1.040
��+1.030
Neg logit contributions
rador-0.882
 Fil-0.880
 NEXT-0.845
psey-0.834
ername-0.792
Tokenop
Token ID404
Feature activation-8.07e-02

Pos logit contributions
netflix+0.032
��+0.032
+0.031
��+0.031
voy+0.031
Neg logit contributions
rador-0.031
 Fil-0.031
 NEXT-0.029
psey-0.029
ername-0.028
Token said
Token ID531
Feature activation-3.86e-02

Pos logit contributions
 Fil+0.652
rador+0.642
 NEXT+0.618
psey+0.602
 Cle+0.579
Neg logit contributions
netflix-0.913
-0.888
��-0.887
��-0.877
voy-0.874
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
 Fil+0.288
rador+0.286
 NEXT+0.275
psey+0.267
outube+0.253
Neg logit contributions
netflix-0.461
��-0.449
-0.446
voy-0.444
��-0.442
Token
Token ID198
Feature activation+2.51e-02

Pos logit contributions
 Fil+0.836
rador+0.836
 NEXT+0.797
psey+0.786
ername+0.742
Neg logit contributions
netflix-1.157
��-1.136
voy-1.125
-1.125
��-1.112
Token
Token ID198
Feature activation+1.34e-01

Pos logit contributions
netflix+0.299
��+0.297
��+0.292
+0.292
voy+0.290
Neg logit contributions
 Fil-0.186
rador-0.183
 NEXT-0.176
psey-0.168
 Ake-0.162
TokenF
Token ID37
Feature activation-8.90e-01

Pos logit contributions
netflix+1.338
��+1.311
��+1.287
voy+1.287
+1.286
Neg logit contributions
rador-1.249
 Fil-1.245
 NEXT-1.193
psey-1.171
ername-1.127
Tokenul
Token ID377
Feature activation-6.10e-02

Pos logit contributions
rador+5.587
 Fil+5.439
psey+5.303
ife+5.171
urious+5.026
Neg logit contributions
��-11.278
��-11.157
netflix-10.968
-10.961
 Solitaire-10.732
Tokenop
Token ID404
Feature activation-3.62e-02

Pos logit contributions
rador+0.562
 Fil+0.560
 NEXT+0.535
psey+0.534
redo+0.505
Neg logit contributions
��-0.614
netflix-0.611
-0.603
��-0.599
voy-0.589
Token had
Token ID550
Feature activation+9.81e-02

Pos logit contributions
 Fil+0.291
rador+0.285
 NEXT+0.275
psey+0.266
 Cle+0.257
Neg logit contributions
netflix-0.415
��-0.403
-0.402
��-0.401
voy-0.395
Token hoped
Token ID10719
Feature activation+1.54e-02

Pos logit contributions
netflix+1.052
��+1.022
voy+1.018
+1.017
��+1.012
Neg logit contributions
 Fil-0.853
rador-0.852
 NEXT-0.812
psey-0.801
 Cle-0.759
Token to
Token ID284
Feature activation-3.16e-01

Pos logit contributions
netflix+0.180
��+0.177
+0.175
voy+0.175
��+0.173
Neg logit contributions
 Fil-0.119
rador-0.118
 NEXT-0.112
psey-0.111
 Ake-0.104
Token to
Token ID284
Feature activation-3.81e-01

Pos logit contributions
 Fil+3.514
 NEXT+3.217
rador+3.181
psey+3.021
 Cle+2.966
Neg logit contributions
��-5.721
voy-5.693
netflix-5.685
-5.613
��-5.611
Token Christopher
Token ID12803
Feature activation-3.37e-01

Pos logit contributions
 Fil+3.040
 NEXT+2.784
rador+2.769
psey+2.652
 Cle+2.615
Neg logit contributions
netflix-4.319
��-4.298
voy-4.254
-4.252
��-4.247
Token Nolan
Token ID27788
Feature activation-2.78e-01

Pos logit contributions
 Fil+2.679
 NEXT+2.336
 Cle+2.329
rador+2.278
psey+2.223
Neg logit contributions
netflix-4.132
��-4.034
��-4.025
voy-3.871
-3.778
Token
Token ID447
Feature activation-2.44e-01

Pos logit contributions
 Fil+2.061
rador+1.904
 NEXT+1.866
psey+1.833
arning+1.733
Neg logit contributions
netflix-3.371
��-3.251
��-3.249
voy-3.200
-3.188
Token
Token ID247
Feature activation-2.79e-01

Pos logit contributions
rador+2.780
 Fil+2.757
 NEXT+2.628
psey+2.570
ername+2.569
Neg logit contributions
netflix-1.972
voy-1.872
-1.869
 Solitaire-1.814
zzo-1.735
Tokens
Token ID82
Feature activation-8.88e-01

Pos logit contributions
 Fil+2.072
rador+1.964
 NEXT+1.918
psey+1.891
arning+1.789
Neg logit contributions
��-3.342
netflix-3.327
��-3.293
-3.240
voy-3.146
Token Batman
Token ID9827
Feature activation-2.10e-01

Pos logit contributions
 Fil+6.899
 NEXT+6.345
rador+5.902
 Cle+5.824
psey+5.822
Neg logit contributions
netflix-9.775
��-9.441
��-9.408
-9.293
voy-8.979
Token Begins
Token ID48139
Feature activation-2.12e-02

Pos logit contributions
 Fil+1.781
 NEXT+1.697
rador+1.694
psey+1.633
 Cle+1.540
Neg logit contributions
netflix-2.276
��-2.270
-2.242
��-2.235
voy-2.209
Token.)
Token ID2014
Feature activation-9.98e-02

Pos logit contributions
 Fil+0.170
rador+0.166
 NEXT+0.162
psey+0.158
outube+0.148
Neg logit contributions
netflix-0.242
��-0.238
voy-0.235
-0.235
��-0.234
Token Ar
Token ID943
Feature activation-9.36e-02

Pos logit contributions
 Fil+0.902
 NEXT+0.843
rador+0.830
 Cle+0.784
psey+0.783
Neg logit contributions
netflix-1.069
��-1.069
��-1.046
voy-1.038
-1.034
Tokenon
Token ID261
Feature activation+1.72e-01

Pos logit contributions
rador+0.634
 Fil+0.625
 NEXT+0.590
psey+0.586
outube+0.548
Neg logit contributions
��-1.172
netflix-1.165
-1.163
��-1.151
voy-1.133
Token moments
Token ID7188
Feature activation+2.87e-01

Pos logit contributions
netflix+1.715
��+1.670
+1.664
voy+1.660
��+1.641
Neg logit contributions
rador-1.336
 Fil-1.332
 NEXT-1.275
psey-1.264
ername-1.193
Token of
Token ID286
Feature activation+2.99e-01

Pos logit contributions
netflix+3.152
voy+3.088
+3.066
��+3.059
��+3.022
Neg logit contributions
rador-2.355
 Fil-2.320
 NEXT-2.239
psey-2.229
ername-2.103
Token heightened
Token ID25556
Feature activation+2.29e-01

Pos logit contributions
netflix+3.221
voy+3.144
��+3.126
+3.125
��+3.080
Neg logit contributions
rador-2.521
 Fil-2.473
psey-2.387
 NEXT-2.387
ername-2.262
Token attention
Token ID3241
Feature activation+1.57e-01

Pos logit contributions
netflix+2.466
voy+2.403
��+2.399
+2.395
��+2.360
Neg logit contributions
rador-1.949
 Fil-1.927
 NEXT-1.852
psey-1.843
ername-1.746
Token like
Token ID588
Feature activation+2.10e-01

Pos logit contributions
netflix+1.715
��+1.672
voy+1.668
+1.667
��+1.644
Neg logit contributions
rador-1.307
 Fil-1.295
 NEXT-1.243
psey-1.235
ername-1.167
Token o
Token ID267
Feature activation-8.87e-01

Pos logit contributions
netflix+2.304
��+2.248
+2.240
voy+2.240
��+2.209
Neg logit contributions
rador-1.752
 Fil-1.739
 NEXT-1.667
psey-1.654
ername-1.563
Tokenases
Token ID1386
Feature activation+1.47e-01

Pos logit contributions
rador+7.213
 Fil+7.143
arning+6.873
yg+6.854
psey+6.764
Neg logit contributions
��-8.782
��-8.651
netflix-8.476
-8.348
 Solitaire-8.001
Token in
Token ID287
Feature activation+1.53e-01

Pos logit contributions
netflix+1.650
��+1.614
+1.604
voy+1.602
��+1.585
Neg logit contributions
rador-1.185
 Fil-1.180
 NEXT-1.128
psey-1.119
ername-1.056
Token a
Token ID257
Feature activation+3.59e-01

Pos logit contributions
netflix+1.649
��+1.611
+1.603
voy+1.601
��+1.580
Neg logit contributions
rador-1.311
 Fil-1.309
 NEXT-1.252
psey-1.239
ername-1.172
Token broad
Token ID3154
Feature activation+2.02e-01

Pos logit contributions
netflix+3.920
voy+3.805
��+3.793
+3.787
��+3.746
Neg logit contributions
rador-2.987
 Fil-2.911
psey-2.835
 NEXT-2.821
ername-2.696
Token desert
Token ID10326
Feature activation+1.54e-01

Pos logit contributions
netflix+2.178
��+2.123
voy+2.117
+2.117
��+2.087
Neg logit contributions
rador-1.726
 Fil-1.711
 NEXT-1.643
psey-1.633
ername-1.545
Token6
Token ID21
Feature activation-4.87e-01

Pos logit contributions
 Fil+0.217
rador+0.216
 NEXT+0.205
psey+0.205
outube+0.196
Neg logit contributions
netflix-0.258
��-0.251
-0.248
��-0.246
voy-0.245
Token {
Token ID1391
Feature activation+9.36e-02

Pos logit contributions
 Fil+3.277
 NEXT+3.068
rador+2.855
 Cle+2.787
 Ake+2.743
Neg logit contributions
��-6.357
��-6.135
netflix-6.125
-6.031
voy-5.903
Token(
Token ID7
Feature activation+3.45e-01

Pos logit contributions
netflix+1.011
��+0.994
+0.982
voy+0.982
��+0.979
Neg logit contributions
 Fil-0.792
rador-0.790
 NEXT-0.759
psey-0.743
ername-0.711
Token44
Token ID2598
Feature activation+1.44e-01

Pos logit contributions
netflix+2.937
+2.839
voy+2.830
��+2.825
��+2.766
Neg logit contributions
rador-3.710
 Fil-3.663
psey-3.560
 NEXT-3.549
perty-3.390
Tokens
Token ID82
Feature activation-1.50e-01

Pos logit contributions
netflix+1.508
+1.470
��+1.467
voy+1.465
��+1.442
Neg logit contributions
rador-1.261
 Fil-1.237
psey-1.192
 NEXT-1.190
ername-1.135
Token)}
Token ID38165
Feature activation-8.87e-01

Pos logit contributions
 Fil+1.211
rador+1.189
 NEXT+1.160
psey+1.078
 Ake+1.069
Neg logit contributions
netflix-1.678
��-1.670
��-1.665
-1.640
voy-1.624
Token 22
Token ID2534
Feature activation-1.74e-01

Pos logit contributions
 Fil+5.414
 NEXT+5.042
 Cle+4.856
 Ake+4.747
 Equal+4.699
Neg logit contributions
netflix-10.942
��-10.924
-10.793
voy-10.653
��-10.322
Token.
Token ID13
Feature activation+4.39e-02

Pos logit contributions
 Fil+1.217
 NEXT+1.139
rador+1.129
 Cle+1.043
 Ake+1.018
Neg logit contributions
��-2.227
netflix-2.194
��-2.191
-2.134
voy-2.111
Token R
Token ID371
Feature activation+4.86e-01

Pos logit contributions
netflix+0.492
��+0.479
voy+0.475
+0.475
��+0.470
Neg logit contributions
 Fil-0.369
rador-0.360
 NEXT-0.351
psey-0.337
 Ake-0.329
Tokenc
Token ID66
Feature activation-5.91e-03

Pos logit contributions
netflix+4.947
voy+4.906
+4.867
zzo+4.863
��+4.779
Neg logit contributions
rador-4.196
 Fil-4.141
 NEXT-4.051
psey-4.024
outube-3.775
Token3
Token ID18
Feature activation-2.90e-01

Pos logit contributions
 Fil+0.052
rador+0.052
 NEXT+0.049
psey+0.048
redo+0.047
Neg logit contributions
netflix-0.064
��-0.061
-0.061
voy-0.060
��-0.060
Token making
Token ID1642
Feature activation-4.67e-01

Pos logit contributions
 Fil+2.720
 NEXT+2.390
yg+2.284
rador+2.261
 Ake+2.174
Neg logit contributions
��-6.443
��-6.303
netflix-6.288
-6.023
zzo-5.930
Token of
Token ID286
Feature activation-3.26e-01

Pos logit contributions
 Fil+2.619
 NEXT+2.331
rador+2.205
 Cle+2.171
 Ake+2.069
Neg logit contributions
��-6.663
��-6.628
netflix-6.354
 pestic-6.256
-6.227
Token the
Token ID262
Feature activation-5.37e-01

Pos logit contributions
 Fil+2.220
 NEXT+2.025
rador+2.002
 Cle+1.845
 Ake+1.845
Neg logit contributions
��-4.239
��-4.154
netflix-4.117
-4.020
voy-3.950
Token films
Token ID7328
Feature activation-5.24e-01

Pos logit contributions
 Fil+4.743
 NEXT+4.266
rador+4.127
Fil+4.113
outube+3.887
Neg logit contributions
��-5.832
��-5.781
-5.485
netflix-5.444
 Solitaire-5.329
Token.
Token ID13
Feature activation-5.24e-01

Pos logit contributions
 Fil+3.077
 NEXT+2.854
rador+2.656
 Cle+2.534
Fil+2.508
Neg logit contributions
��-7.090
��-7.059
 pestic-6.804
netflix-6.729
voy-6.632
TokenIt
Token ID1026
Feature activation-8.79e-01

Pos logit contributions
 Fil+4.338
rador+4.092
 NEXT+4.042
Fil+3.819
outube+3.783
Neg logit contributions
��-5.476
netflix-5.453
��-5.442
voy-5.378
zzo-5.336
Token is
Token ID318
Feature activation-9.52e-01

Pos logit contributions
 Fil+4.664
rador+4.103
 NEXT+3.866
 Cle+3.557
estine+3.524
Neg logit contributions
��-12.072
��-11.759
-11.246
 pestic-11.228
netflix-11.191
Token also
Token ID635
Feature activation-7.94e-01

Pos logit contributions
 Fil+6.077
 NEXT+5.771
rador+5.542
estine+5.244
redo+5.225
Neg logit contributions
��-10.861
��-10.688
netflix-10.412
voy-10.144
-10.128
Token interesting
Token ID3499
Feature activation-4.00e-01

Pos logit contributions
 Fil+3.214
rador+3.049
 NEXT+2.830
urious+2.807
yg+2.484
Neg logit contributions
��-11.438
��-11.125
netflix-11.119
-10.914
PROV-10.699
Token the
Token ID262
Feature activation-6.51e-01

Pos logit contributions
 Fil+2.673
rador+2.523
 NEXT+2.478
 Cle+2.290
psey+2.290
Neg logit contributions
��-5.078
netflix-4.972
-4.949
��-4.937
voy-4.825
Token Tob
Token ID13695
Feature activation-2.36e-01

Pos logit contributions
 Fil+5.571
 NEXT+5.282
rador+5.218
outube+4.977
psey+4.857
Neg logit contributions
��-6.679
��-6.500
netflix-6.261
-6.252
zzo-6.201
Tokening
Token ID278
Feature activation+1.04e-01

Pos logit contributions
 Fil+0.031
rador+0.030
 NEXT+0.029
psey+0.029
outube+0.027
Neg logit contributions
netflix-0.049
��-0.049
-0.048
��-0.048
voy-0.047
Token and
Token ID290
Feature activation+9.96e-02

Pos logit contributions
netflix+1.171
��+1.146
+1.136
voy+1.135
��+1.128
Neg logit contributions
 Fil-0.836
rador-0.834
 NEXT-0.799
psey-0.787
ername-0.745
Token the
Token ID262
Feature activation+4.26e-01

Pos logit contributions
netflix+1.097
��+1.074
+1.066
voy+1.060
��+1.058
Neg logit contributions
 Fil-0.842
rador-0.827
 NEXT-0.803
psey-0.777
 Ake-0.748
Token one
Token ID530
Feature activation+3.57e-01

Pos logit contributions
netflix+4.336
voy+4.245
+4.228
��+4.205
zzo+4.133
Neg logit contributions
rador-3.843
 Fil-3.705
psey-3.632
 NEXT-3.577
ername-3.436
Token that
Token ID326
Feature activation+3.75e-01

Pos logit contributions
netflix+3.597
voy+3.505
+3.491
��+3.483
��+3.407
Neg logit contributions
rador-3.281
 Fil-3.186
psey-3.121
 NEXT-3.073
ername-2.944
Token I
Token ID314
Feature activation+5.20e-01

Pos logit contributions
netflix+3.910
voy+3.811
+3.797
��+3.790
zzo+3.723
Neg logit contributions
rador-3.299
 Fil-3.197
psey-3.149
 NEXT-3.095
perty-2.949
Token think
Token ID892
Feature activation+2.86e-01

Pos logit contributions
netflix+5.162
zzo+5.088
voy+5.086
��+5.002
+4.968
Neg logit contributions
rador-4.666
 Fil-4.441
psey-4.416
 NEXT-4.327
ername-4.205
Token reduces
Token ID12850
Feature activation+5.52e-01

Pos logit contributions
netflix+3.034
voy+2.950
��+2.948
+2.944
��+2.894
Neg logit contributions
rador-2.471
 Fil-2.421
psey-2.347
 NEXT-2.329
ername-2.210
Token immersion
Token ID36375
Feature activation+3.51e-01

Pos logit contributions
voy+5.286
netflix+5.284
 Solitaire+5.215
zzo+5.183
+5.173
Neg logit contributions
rador-5.105
 Fil-4.772
psey-4.746
 NEXT-4.633
ername-4.595
Token more
Token ID517
Feature activation+2.90e-01

Pos logit contributions
netflix+3.733
voy+3.668
+3.633
��+3.602
zzo+3.563
Neg logit contributions
rador-2.999
 Fil-2.911
psey-2.803
 NEXT-2.797
perty-2.667
Token on
Token ID319
Feature activation-1.85e-02

Pos logit contributions
netflix+3.408
voy+3.364
+3.344
��+3.341
zzo+3.285
Neg logit contributions
rador-2.137
 Fil-2.092
psey-1.998
 NEXT-1.992
ername-1.884
Token wear
Token ID5806
Feature activation-1.54e-01

Pos logit contributions
 Fil+0.453
rador+0.444
 NEXT+0.429
psey+0.420
 Ake+0.402
Neg logit contributions
netflix-0.626
��-0.621
voy-0.607
-0.607
��-0.607
Tokeniness
Token ID1272
Feature activation-2.91e-01

Pos logit contributions
 Fil+1.145
rador+1.116
 NEXT+1.080
psey+1.048
ername+1.011
Neg logit contributions
��-1.820
netflix-1.803
��-1.785
-1.782
voy-1.766
Token and
Token ID290
Feature activation-4.11e-01

Pos logit contributions
 Fil+2.032
rador+1.968
 NEXT+1.912
psey+1.860
 Cle+1.769
Neg logit contributions
��-3.611
��-3.535
netflix-3.531
-3.461
voy-3.417
Token confusion
Token ID10802
Feature activation-2.55e-01

Pos logit contributions
 Fil+3.416
rador+3.252
 NEXT+3.170
psey+3.073
 Cle+3.061
Neg logit contributions
��-4.556
netflix-4.550
��-4.439
-4.386
voy-4.297
Token to
Token ID284
Feature activation+3.07e-01

Pos logit contributions
 Fil+1.800
rador+1.725
 NEXT+1.694
psey+1.599
 Cle+1.567
Neg logit contributions
��-3.141
netflix-3.113
��-3.060
-3.017
voy-2.977
Token the
Token ID262
Feature activation+5.81e-01

Pos logit contributions
netflix+3.272
voy+3.225
+3.214
��+3.189
zzo+3.157
Neg logit contributions
rador-2.605
 Fil-2.548
psey-2.465
 NEXT-2.462
ername-2.337
Token minds
Token ID9017
Feature activation-1.51e-01

Pos logit contributions
voy+5.598
+5.467
zzo+5.391
netflix+5.371
��+5.370
Neg logit contributions
rador-5.346
 Fil-5.092
psey-5.050
 NEXT-4.962
ername-4.859
Token of
Token ID286
Feature activation+2.40e-01

Pos logit contributions
rador+1.066
 Fil+1.065
 NEXT+1.015
psey+0.999
ername+0.935
Neg logit contributions
netflix-1.838
��-1.808
��-1.773
-1.772
voy-1.763
Token readers
Token ID7183
Feature activation+5.43e-02

Pos logit contributions
netflix+2.645
voy+2.581
��+2.579
+2.577
��+2.540
Neg logit contributions
rador-1.975
 Fil-1.950
 NEXT-1.872
psey-1.865
ername-1.763
Token.
Token ID13
Feature activation+4.27e-02

Pos logit contributions
netflix+0.624
��+0.611
+0.605
voy+0.603
��+0.601
Neg logit contributions
rador-0.426
 Fil-0.426
 NEXT-0.406
psey-0.401
ername-0.377
Token
Token ID198
Feature activation-1.20e-02

Pos logit contributions
netflix+0.470
��+0.462
voy+0.454
��+0.451
+0.449
Neg logit contributions
 Fil-0.371
rador-0.354
 NEXT-0.353
psey-0.330
 Cle-0.329
Token or
Token ID393
Feature activation+4.45e-02

Pos logit contributions
 Fil+0.687
rador+0.659
 NEXT+0.643
psey+0.626
 Cle+0.599
Neg logit contributions
netflix-0.966
-0.950
��-0.950
voy-0.948
��-0.940
Token in
Token ID287
Feature activation+2.00e-01

Pos logit contributions
netflix+0.504
��+0.497
+0.492
voy+0.491
��+0.488
Neg logit contributions
 Fil-0.360
rador-0.351
 NEXT-0.340
psey-0.330
outube-0.317
Token one
Token ID530
Feature activation-1.98e-01

Pos logit contributions
netflix+2.120
��+2.068
+2.060
voy+2.059
��+2.031
Neg logit contributions
rador-1.743
 Fil-1.733
 NEXT-1.663
psey-1.651
ername-1.563
Token of
Token ID286
Feature activation+1.66e-01

Pos logit contributions
 Fil+1.426
rador+1.358
 NEXT+1.317
psey+1.285
 Cle+1.234
Neg logit contributions
netflix-2.392
��-2.361
-2.345
voy-2.344
��-2.327
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
netflix+1.835
��+1.794
+1.787
voy+1.787
��+1.762
Neg logit contributions
 Fil-1.375
rador-1.374
 NEXT-1.313
psey-1.300
outube-1.226
Token dinner
Token ID8073
Feature activation+5.42e-01

Pos logit contributions
netflix+1.567
��+1.536
+1.529
voy+1.524
��+1.510
Neg logit contributions
 Fil-1.204
rador-1.193
 NEXT-1.152
psey-1.132
outube-1.072
Token conversations
Token ID10275
Feature activation+2.64e-01

Pos logit contributions
netflix+5.617
voy+5.548
+5.524
��+5.440
��+5.339
Neg logit contributions
rador-4.673
 Fil-4.451
psey-4.378
 NEXT-4.298
ername-4.260
Token where
Token ID810
Feature activation-9.14e-03

Pos logit contributions
netflix+2.863
��+2.792
voy+2.787
+2.783
��+2.746
Neg logit contributions
rador-2.232
 Fil-2.192
 NEXT-2.109
psey-2.107
ername-1.994
Token connections
Token ID8787
Feature activation-1.39e-01

Pos logit contributions
 Fil+0.074
rador+0.071
 NEXT+0.069
psey+0.068
 Ake+0.065
Neg logit contributions
netflix-0.105
��-0.103
-0.102
voy-0.101
��-0.101
Token are
Token ID389
Feature activation+2.02e-01

Pos logit contributions
 Fil+1.060
rador+1.029
 NEXT+0.994
psey+0.969
 Ake+0.921
Neg logit contributions
netflix-1.619
��-1.603
-1.584
voy-1.568
��-1.560
Token made
Token ID925
Feature activation+1.74e-01

Pos logit contributions
netflix+2.239
��+2.180
+2.172
voy+2.172
��+2.142
Neg logit contributions
rador-1.666
 Fil-1.657
 NEXT-1.586
psey-1.573
ername-1.485
Token,
Token ID11
Feature activation+1.80e-01

Pos logit contributions
netflix+1.633
��+1.594
voy+1.588
+1.588
��+1.568
Neg logit contributions
rador-1.178
 Fil-1.167
 NEXT-1.119
psey-1.109
ername-1.046
Token and
Token ID290
Feature activation+7.57e-02

Pos logit contributions
netflix+2.059
��+2.011
voy+2.004
+2.002
��+1.977
Neg logit contributions
rador-1.424
 Fil-1.415
 NEXT-1.353
psey-1.341
ername-1.260
Token one
Token ID530
Feature activation+7.28e-02

Pos logit contributions
netflix+0.851
��+0.829
voy+0.822
+0.820
��+0.814
Neg logit contributions
 Fil-0.625
rador-0.617
 NEXT-0.600
psey-0.583
 Ake-0.558
Token glass
Token ID5405
Feature activation+4.75e-02

Pos logit contributions
netflix+0.851
��+0.833
voy+0.824
+0.823
��+0.819
Neg logit contributions
 Fil-0.562
rador-0.560
 NEXT-0.534
psey-0.523
 Ake-0.497
Token door
Token ID3420
Feature activation+2.27e-01

Pos logit contributions
netflix+0.510
��+0.500
+0.493
voy+0.492
��+0.491
Neg logit contributions
rador-0.411
 Fil-0.410
 NEXT-0.392
psey-0.386
perty-0.364
Token,
Token ID11
Feature activation+5.17e-01

Pos logit contributions
netflix+2.545
voy+2.483
+2.483
��+2.476
��+2.438
Neg logit contributions
rador-1.816
 Fil-1.793
psey-1.723
 NEXT-1.720
ername-1.622
Token taped
Token ID36114
Feature activation+1.64e-01

Pos logit contributions
netflix+5.568
+5.496
voy+5.486
zzo+5.403
��+5.389
Neg logit contributions
rador-4.190
 Fil-3.999
psey-3.966
 NEXT-3.880
ername-3.825
Token shut
Token ID4423
Feature activation+2.12e-01

Pos logit contributions
netflix+2.064
��+2.022
+2.014
voy+2.013
��+1.993
Neg logit contributions
rador-1.112
 Fil-1.104
 NEXT-1.048
psey-1.035
ername-0.962
Token,
Token ID11
Feature activation+4.46e-01

Pos logit contributions
netflix+2.343
��+2.285
voy+2.283
+2.282
��+2.245
Neg logit contributions
rador-1.739
 Fil-1.716
psey-1.644
 NEXT-1.643
ername-1.549
Token leads
Token ID5983
Feature activation+2.31e-01

Pos logit contributions
netflix+4.792
voy+4.692
+4.689
zzo+4.640
��+4.635
Neg logit contributions
rador-3.687
 Fil-3.545
psey-3.542
 NEXT-3.440
ername-3.348
Token directly
Token ID3264
Feature activation+5.77e-01

Pos logit contributions
netflix+2.553
��+2.487
+2.485
voy+2.483
��+2.445
Neg logit contributions
rador-1.905
 Fil-1.880
 NEXT-1.804
psey-1.799
ername-1.696
Token years
Token ID812
Feature activation-1.52e-02

Pos logit contributions
 Fil+0.134
rador+0.132
 NEXT+0.125
psey+0.121
 Ake+0.113
Neg logit contributions
netflix-0.246
��-0.240
voy-0.240
��-0.238
-0.237
Token imprisonment
Token ID16510
Feature activation+7.43e-02

Pos logit contributions
rador+0.112
 Fil+0.112
 NEXT+0.107
psey+0.103
outube+0.099
Neg logit contributions
netflix-0.182
��-0.177
voy-0.176
-0.176
��-0.176
Token and
Token ID290
Feature activation+4.30e-01

Pos logit contributions
netflix+0.849
��+0.829
voy+0.825
+0.825
��+0.816
Neg logit contributions
rador-0.588
 Fil-0.583
 NEXT-0.558
psey-0.551
ername-0.521
Token/
Token ID14
Feature activation-2.92e-01

Pos logit contributions
netflix+4.085
+3.996
voy+3.980
��+3.959
zzo+3.914
Neg logit contributions
rador-4.020
 Fil-3.969
psey-3.929
 NEXT-3.836
ername-3.753
Tokenor
Token ID273
Feature activation+4.47e-01

Pos logit contributions
 Fil+1.950
rador+1.911
 NEXT+1.829
redo+1.755
yg+1.707
Neg logit contributions
��-3.583
��-3.566
netflix-3.566
-3.546
 Solitaire-3.473
Token a
Token ID257
Feature activation+7.47e-01

Pos logit contributions
netflix+4.156
+4.073
voy+4.058
��+4.045
zzo+4.005
Neg logit contributions
rador-4.194
 Fil-4.173
psey-4.140
 NEXT-4.040
ername-3.971
Token $
Token ID720
Feature activation+2.50e-01

Pos logit contributions
+6.059
netflix+6.045
��+5.975
voy+5.904
zzo+5.773
Neg logit contributions
rador-7.491
 Fil-7.340
psey-7.321
ername-7.231
 NEXT-7.069
Token1
Token ID16
Feature activation-1.25e-02

Pos logit contributions
netflix+2.427
��+2.360
voy+2.351
+2.351
��+2.315
Neg logit contributions
rador-2.402
 Fil-2.385
 NEXT-2.300
psey-2.286
ername-2.179
Token.
Token ID13
Feature activation-1.46e-01

Pos logit contributions
rador+0.102
 Fil+0.101
psey+0.096
 NEXT+0.096
perty+0.091
Neg logit contributions
netflix-0.140
voy-0.135
��-0.135
-0.134
��-0.132
Token1
Token ID16
Feature activation+1.18e-01

Pos logit contributions
 Fil+1.261
rador+1.227
 NEXT+1.202
 Cle+1.131
ername+1.110
Neg logit contributions
netflix-1.597
��-1.561
voy-1.541
��-1.539
-1.526
Token million
Token ID1510
Feature activation+2.30e-01

Pos logit contributions
netflix+1.125
��+1.091
voy+1.087
+1.085
��+1.070
Neg logit contributions
rador-1.164
 Fil-1.155
 NEXT-1.114
psey-1.108
ername-1.055
Token a
Token ID257
Feature activation-2.06e-02

Pos logit contributions
 Fil+0.990
rador+0.936
 NEXT+0.914
psey+0.889
 Cle+0.866
Neg logit contributions
netflix-1.394
��-1.384
voy-1.370
-1.361
��-1.336
Token brick
Token ID17214
Feature activation-2.02e-01

Pos logit contributions
 Fil+0.176
rador+0.171
 NEXT+0.166
psey+0.162
 Cle+0.156
Neg logit contributions
netflix-0.225
��-0.225
-0.221
voy-0.220
��-0.218
Token wall
Token ID3355
Feature activation-1.64e-01

Pos logit contributions
 Fil+1.676
rador+1.643
 NEXT+1.622
psey+1.562
 Cle+1.495
Neg logit contributions
netflix-2.223
��-2.181
��-2.137
voy-2.136
-2.118
Token.
Token ID13
Feature activation-8.51e-02

Pos logit contributions
 Fil+1.227
rador+1.205
 NEXT+1.172
psey+1.124
 Cle+1.071
Neg logit contributions
netflix-1.949
��-1.936
��-1.897
voy-1.886
-1.877
Token I
Token ID314
Feature activation+1.12e-02

Pos logit contributions
 Fil+0.760
 NEXT+0.706
rador+0.682
 Cle+0.666
 Ake+0.656
Neg logit contributions
netflix-0.936
��-0.936
��-0.913
voy-0.904
-0.892
Token was
Token ID373
Feature activation+2.35e-03

Pos logit contributions
netflix+0.126
��+0.123
+0.123
��+0.123
voy+0.122
Neg logit contributions
 Fil-0.092
rador-0.090
 NEXT-0.086
psey-0.085
 Cle-0.080
Token broke
Token ID6265
Feature activation+7.30e-02

Pos logit contributions
netflix+0.027
��+0.026
+0.026
��+0.026
voy+0.026
Neg logit contributions
 Fil-0.019
rador-0.019
 NEXT-0.018
psey-0.018
 Cle-0.017
Token,
Token ID11
Feature activation+1.50e-02

Pos logit contributions
netflix+0.831
��+0.811
+0.805
voy+0.805
��+0.800
Neg logit contributions
rador-0.581
 Fil-0.580
 NEXT-0.554
psey-0.546
ername-0.515
Token lonely
Token ID21757
Feature activation+8.33e-02

Pos logit contributions
netflix+0.173
��+0.169
+0.169
voy+0.168
��+0.167
Neg logit contributions
 Fil-0.119
rador-0.117
 NEXT-0.113
psey-0.110
 Ake-0.105
Token,
Token ID11
Feature activation+3.79e-02

Pos logit contributions
netflix+0.951
��+0.931
voy+0.925
+0.924
��+0.917
Neg logit contributions
 Fil-0.659
rador-0.657
 NEXT-0.626
psey-0.617
ername-0.583
Token and
Token ID290
Feature activation+5.24e-02

Pos logit contributions
netflix+0.441
��+0.432
+0.431
voy+0.429
��+0.427
Neg logit contributions
 Fil-0.294
rador-0.292
 NEXT-0.278
psey-0.272
 Ake-0.258
Token.
Token ID13
Feature activation-1.12e-01

Pos logit contributions
netflix+3.224
voy+3.164
+3.148
��+3.133
zzo+3.096
Neg logit contributions
rador-2.516
 Fil-2.460
psey-2.374
 NEXT-2.371
outube-2.252
Token That
Token ID1320
Feature activation-1.49e-01

Pos logit contributions
 Fil+0.903
rador+0.865
 NEXT+0.862
 Ake+0.803
 Cle+0.800
Neg logit contributions
netflix-1.297
��-1.272
voy-1.249
��-1.243
-1.236
Token information
Token ID1321
Feature activation-9.80e-02

Pos logit contributions
 Fil+1.187
rador+1.150
 NEXT+1.143
psey+1.071
 Ake+1.063
Neg logit contributions
netflix-1.734
��-1.689
-1.683
��-1.674
voy-1.655
Token has
Token ID468
Feature activation-1.01e-01

Pos logit contributions
 Fil+0.781
rador+0.762
 NEXT+0.745
psey+0.714
 Cle+0.697
Neg logit contributions
netflix-1.135
��-1.103
-1.096
��-1.083
voy-1.075
Token provided
Token ID2810
Feature activation+5.32e-02

Pos logit contributions
 Fil+0.829
rador+0.819
 NEXT+0.792
psey+0.766
 Cle+0.740
Neg logit contributions
netflix-1.141
��-1.102
-1.092
��-1.087
voy-1.083
Token scientists
Token ID5519
Feature activation+2.23e-01

Pos logit contributions
netflix+0.589
��+0.577
voy+0.569
+0.568
��+0.564
Neg logit contributions
 Fil-0.444
rador-0.440
 NEXT-0.426
psey-0.419
 Ake-0.397
Token a
Token ID257
Feature activation+1.64e-01

Pos logit contributions
netflix+2.421
+2.361
voy+2.359
��+2.356
��+2.324
Neg logit contributions
rador-1.871
 Fil-1.844
 NEXT-1.768
psey-1.761
ername-1.670
Token library
Token ID5888
Feature activation+3.05e-01

Pos logit contributions
netflix+1.830
��+1.794
+1.779
voy+1.774
��+1.760
Neg logit contributions
rador-1.339
 Fil-1.334
 NEXT-1.280
psey-1.266
outube-1.187
Token of
Token ID286
Feature activation-1.06e-02

Pos logit contributions
netflix+3.183
voy+3.137
+3.128
��+3.060
��+3.021
Neg logit contributions
rador-2.584
 Fil-2.575
psey-2.504
 NEXT-2.451
outube-2.408
Token comparisons
Token ID17909
Feature activation+1.11e-01

Pos logit contributions
 Fil+0.086
rador+0.085
 NEXT+0.082
psey+0.081
 Cle+0.076
Neg logit contributions
netflix-0.121
��-0.119
-0.117
��-0.116
voy-0.116
Token for
Token ID329
Feature activation+1.26e-02

Pos logit contributions
netflix+1.286
��+1.260
+1.247
voy+1.246
��+1.236
Neg logit contributions
 Fil-0.853
rador-0.849
 NEXT-0.813
psey-0.799
ername-0.754
Token to
Token ID284
Feature activation+3.49e-01

Pos logit contributions
netflix+3.461
voy+3.369
��+3.369
+3.363
��+3.325
Neg logit contributions
rador-2.711
 Fil-2.666
psey-2.587
 NEXT-2.582
ername-2.445
Token show
Token ID905
Feature activation+2.26e-01

Pos logit contributions
netflix+3.807
voy+3.704
��+3.698
+3.696
��+3.636
Neg logit contributions
rador-2.904
 Fil-2.855
psey-2.755
 NEXT-2.750
ername-2.594
Token that
Token ID326
Feature activation+1.78e-01

Pos logit contributions
netflix+2.504
��+2.446
voy+2.436
+2.436
��+2.402
Neg logit contributions
rador-1.858
 Fil-1.846
 NEXT-1.765
psey-1.748
ername-1.652
Token there
Token ID612
Feature activation+3.99e-02

Pos logit contributions
netflix+1.916
��+1.871
+1.863
voy+1.862
��+1.836
Neg logit contributions
rador-1.523
 Fil-1.521
 NEXT-1.454
psey-1.435
outube-1.360
Token is
Token ID318
Feature activation+1.80e-01

Pos logit contributions
netflix+0.452
voy+0.439
��+0.439
+0.438
��+0.433
Neg logit contributions
rador-0.319
 Fil-0.319
 NEXT-0.304
psey-0.299
outube-0.285
Token a
Token ID257
Feature activation+3.03e-01

Pos logit contributions
netflix+1.948
��+1.902
+1.898
voy+1.892
��+1.867
Neg logit contributions
rador-1.538
 Fil-1.537
 NEXT-1.471
psey-1.447
outube-1.373
Token lot
Token ID1256
Feature activation+1.89e-01

Pos logit contributions
netflix+3.310
��+3.218
voy+3.215
+3.210
��+3.168
Neg logit contributions
rador-2.531
 Fil-2.500
 NEXT-2.403
psey-2.397
ername-2.269
Token of
Token ID286
Feature activation+3.40e-01

Pos logit contributions
netflix+2.116
��+2.061
voy+2.055
+2.053
��+2.027
Neg logit contributions
rador-1.533
 Fil-1.511
 NEXT-1.451
psey-1.451
ername-1.366
Token something
Token ID1223
Feature activation+6.63e-02

Pos logit contributions
netflix+3.612
voy+3.508
��+3.505
+3.494
��+3.450
Neg logit contributions
rador-2.923
 Fil-2.871
psey-2.787
 NEXT-2.768
ername-2.650
Token (
Token ID357
Feature activation+2.26e-01

Pos logit contributions
netflix+0.753
��+0.736
voy+0.733
+0.731
��+0.724
Neg logit contributions
 Fil-0.530
rador-0.527
 NEXT-0.506
psey-0.494
outube-0.469
Tokenp
Token ID79
Feature activation-2.90e-01

Pos logit contributions
netflix+2.337
��+2.279
+2.270
voy+2.268
��+2.238
Neg logit contributions
rador-2.018
 Fil-2.002
 NEXT-1.925
psey-1.912
ername-1.816
Token,
Token ID11
Feature activation-1.47e-01

Pos logit contributions
rador+0.142
 Fil+0.142
 NEXT+0.135
psey+0.133
ername+0.126
Neg logit contributions
netflix-0.204
��-0.201
voy-0.200
-0.200
��-0.198
Token she
Token ID673
Feature activation-1.39e-01

Pos logit contributions
 Fil+1.194
rador+1.138
 NEXT+1.106
psey+1.081
 Cle+1.064
Neg logit contributions
netflix-1.667
��-1.646
-1.639
��-1.633
voy-1.623
Token was
Token ID373
Feature activation-2.64e-01

Pos logit contributions
 Fil+1.025
rador+1.001
 NEXT+0.948
psey+0.909
 Cle+0.890
Neg logit contributions
netflix-1.678
��-1.648
-1.640
voy-1.635
��-1.633
Token an
Token ID281
Feature activation-9.82e-02

Pos logit contributions
 Fil+2.119
rador+2.045
 NEXT+1.946
psey+1.881
 Cle+1.866
Neg logit contributions
netflix-3.061
��-2.979
-2.952
��-2.950
voy-2.909
Token assistant
Token ID8796
Feature activation+2.15e-01

Pos logit contributions
 Fil+0.809
rador+0.780
 NEXT+0.759
psey+0.736
 Ake+0.728
Neg logit contributions
netflix-1.122
��-1.074
-1.066
voy-1.064
��-1.054
Token director
Token ID3437
Feature activation+1.28e-01

Pos logit contributions
netflix+2.360
��+2.301
voy+2.300
+2.294
��+2.264
Neg logit contributions
rador-1.776
 Fil-1.758
 NEXT-1.688
psey-1.678
ername-1.587
Token of
Token ID286
Feature activation+3.24e-02

Pos logit contributions
netflix+1.469
��+1.435
+1.430
voy+1.429
��+1.411
Neg logit contributions
rador-0.998
 Fil-0.989
 NEXT-0.945
psey-0.939
ername-0.883
Token the
Token ID262
Feature activation+4.22e-02

Pos logit contributions
netflix+0.373
��+0.364
+0.362
voy+0.361
��+0.358
Neg logit contributions
 Fil-0.258
rador-0.257
 NEXT-0.246
psey-0.242
 Ake-0.228
Token Women
Token ID6926
Feature activation-2.93e-01

Pos logit contributions
netflix+0.450
��+0.439
+0.438
voy+0.433
��+0.431
Neg logit contributions
 Fil-0.372
rador-0.368
 NEXT-0.355
psey-0.347
 Ake-0.333
Token's
Token ID338
Feature activation-1.09e-01

Pos logit contributions
 Fil+2.123
rador+2.014
 NEXT+1.966
 Cle+1.852
 Equal+1.851
Neg logit contributions
netflix-3.558
��-3.511
-3.510
��-3.505
voy-3.397
Token Army
Token ID5407
Feature activation-1.88e-02

Pos logit contributions
 Fil+1.004
rador+0.955
 NEXT+0.943
 Cle+0.906
psey+0.904
Neg logit contributions
netflix-1.136
��-1.120
-1.107
��-1.098
voy-1.083
Tokenn
Token ID77
Feature activation-2.99e-01

Pos logit contributions
 Fil+0.400
rador+0.400
 NEXT+0.382
psey+0.371
outube+0.350
Neg logit contributions
netflix-0.625
��-0.613
-0.606
��-0.601
voy-0.600
Tokenovation
Token ID17882
Feature activation+3.27e-02

Pos logit contributions
 Fil+2.725
rador+2.642
 NEXT+2.638
psey+2.559
estine+2.460
Neg logit contributions
netflix-3.070
��-2.969
��-2.895
-2.887
voy-2.874
Token FT
Token ID19446
Feature activation+5.42e-01

Pos logit contributions
netflix+0.371
��+0.360
voy+0.356
+0.356
��+0.352
Neg logit contributions
 Fil-0.267
rador-0.264
 NEXT-0.255
psey-0.248
ername-0.237
TokenW
Token ID54
Feature activation+1.91e-01

Pos logit contributions
voy+4.116
+4.039
��+3.983
��+3.977
netflix+3.966
Neg logit contributions
rador-6.010
psey-5.911
 Fil-5.775
 NEXT-5.703
ername-5.575
Token!
Token ID0
Feature activation+2.72e-01

Pos logit contributions
netflix+1.958
��+1.907
voy+1.904
+1.904
��+1.873
Neg logit contributions
rador-1.734
 Fil-1.716
 NEXT-1.651
psey-1.647
ername-1.562
Token
Token ID198
Feature activation+1.92e-01

Pos logit contributions
netflix+2.862
voy+2.789
+2.786
��+2.782
��+2.736
Neg logit contributions
rador-2.385
 Fil-2.346
psey-2.264
 NEXT-2.257
ername-2.141
Token
Token ID198
Feature activation+2.45e-01

Pos logit contributions
netflix+2.129
voy+2.079
+2.067
��+2.052
zzo+2.032
Neg logit contributions
rador-1.582
 Fil-1.534
psey-1.512
 NEXT-1.479
perty-1.420
TokenTo
Token ID2514
Feature activation+4.64e-01

Pos logit contributions
netflix+2.432
��+2.365
voy+2.358
+2.358
��+2.321
Neg logit contributions
rador-2.298
 Fil-2.280
 NEXT-2.197
psey-2.186
ername-2.079
Token some
Token ID617
Feature activation+2.66e-01

Pos logit contributions
netflix+4.865
voy+4.731
+4.727
��+4.677
zzo+4.672
Neg logit contributions
rador-3.984
 Fil-3.867
psey-3.719
 NEXT-3.700
perty-3.560
Token,
Token ID11
Feature activation+3.23e-02

Pos logit contributions
netflix+2.923
voy+2.855
+2.845
��+2.843
��+2.796
Neg logit contributions
rador-2.188
 Fil-2.155
 NEXT-2.077
psey-2.066
ername-1.948
Token this
Token ID428
Feature activation-1.58e-02

Pos logit contributions
netflix+0.365
��+0.357
+0.353
��+0.350
voy+0.349
Neg logit contributions
 Fil-0.264
rador-0.262
 NEXT-0.251
psey-0.246
 Ake-0.235
Token.
Token ID13
Feature activation-2.32e-01

Pos logit contributions
netflix+0.701
��+0.685
+0.684
voy+0.680
��+0.675
Neg logit contributions
 Fil-0.496
rador-0.495
 NEXT-0.479
psey-0.466
ername-0.442
Token I
Token ID314
Feature activation-1.16e-01

Pos logit contributions
 Fil+2.067
 NEXT+1.968
rador+1.854
 Cle+1.836
 Ake+1.793
Neg logit contributions
netflix-2.542
��-2.520
��-2.452
-2.429
voy-2.414
Token love
Token ID1842
Feature activation+1.60e-01

Pos logit contributions
 Fil+0.936
rador+0.902
 NEXT+0.883
psey+0.847
 Cle+0.812
Neg logit contributions
netflix-1.330
-1.310
��-1.307
��-1.293
voy-1.266
Token the
Token ID262
Feature activation-1.63e-01

Pos logit contributions
netflix+1.754
��+1.713
+1.707
voy+1.703
��+1.682
Neg logit contributions
rador-1.340
 Fil-1.339
 NEXT-1.281
psey-1.265
ername-1.194
Token refreshing
Token ID23056
Feature activation-1.40e-01

Pos logit contributions
 Fil+1.308
rador+1.242
 NEXT+1.229
psey+1.165
 Cle+1.159
Neg logit contributions
netflix-1.873
��-1.847
-1.825
��-1.801
voy-1.791
Token light
Token ID1657
Feature activation-2.59e-01

Pos logit contributions
 Fil+1.111
rador+1.068
 NEXT+1.032
 Cle+1.001
psey+0.986
Neg logit contributions
netflix-1.602
��-1.588
-1.575
��-1.550
voy-1.543
Token taste
Token ID6938
Feature activation-1.30e-01

Pos logit contributions
 Fil+2.079
rador+2.012
 NEXT+1.942
 Cle+1.885
psey+1.855
Neg logit contributions
��-2.952
netflix-2.882
-2.859
��-2.832
zzo-2.788
Token of
Token ID286
Feature activation-6.96e-02

Pos logit contributions
 Fil+0.946
rador+0.930
 NEXT+0.896
psey+0.868
 Cle+0.837
Neg logit contributions
netflix-1.570
��-1.540
-1.523
voy-1.521
��-1.518
Token these
Token ID777
Feature activation-3.19e-02

Pos logit contributions
 Fil+0.582
rador+0.568
 NEXT+0.550
psey+0.527
 Cle+0.515
Neg logit contributions
netflix-0.774
��-0.764
-0.756
voy-0.748
��-0.744
Token cookies
Token ID14746
Feature activation+1.39e-01

Pos logit contributions
 Fil+0.260
rador+0.253
 NEXT+0.246
psey+0.241
 Cle+0.231
Neg logit contributions
netflix-0.363
��-0.355
��-0.351
voy-0.350
-0.350
Token,
Token ID11
Feature activation+4.97e-02

Pos logit contributions
netflix+1.555
��+1.519
+1.513
voy+1.513
��+1.495
Neg logit contributions
rador-1.133
 Fil-1.126
 NEXT-1.079
psey-1.069
ername-1.008
Token Club
Token ID6289
Feature activation+9.56e-02

Pos logit contributions
netflix+1.502
��+1.468
+1.462
voy+1.455
��+1.449
Neg logit contributions
 Fil-1.203
rador-1.193
 NEXT-1.149
psey-1.129
 Cle-1.074
Token and
Token ID290
Feature activation-2.67e-01

Pos logit contributions
netflix+1.099
��+1.074
+1.070
voy+1.069
��+1.059
Neg logit contributions
rador-0.748
 Fil-0.745
 NEXT-0.712
psey-0.704
outube-0.662
Token a
Token ID257
Feature activation+1.11e-01

Pos logit contributions
 Fil+2.254
 NEXT+2.170
rador+2.145
 Cle+2.029
psey+2.008
Neg logit contributions
��-2.913
netflix-2.906
��-2.888
-2.843
voy-2.824
Token new
Token ID649
Feature activation-4.80e-02

Pos logit contributions
netflix+1.199
��+1.183
+1.171
��+1.168
voy+1.163
Neg logit contributions
 Fil-0.941
rador-0.937
 NEXT-0.908
psey-0.888
 Cle-0.840
Token sponsor
Token ID17487
Feature activation+5.78e-04

Pos logit contributions
 Fil+0.397
rador+0.394
 NEXT+0.379
psey+0.368
 Ake+0.354
Neg logit contributions
netflix-0.536
��-0.530
��-0.521
-0.520
voy-0.517
Token is
Token ID318
Feature activation-4.42e-02

Pos logit contributions
netflix+0.006
��+0.006
+0.006
voy+0.006
��+0.006
Neg logit contributions
 Fil-0.005
rador-0.005
 NEXT-0.005
psey-0.004
 Ake-0.004
Token for
Token ID329
Feature activation-4.59e-02

Pos logit contributions
 Fil+0.346
rador+0.335
 NEXT+0.331
psey+0.319
 Ake+0.308
Neg logit contributions
netflix-0.520
��-0.500
-0.497
voy-0.497
��-0.493
Token the
Token ID262
Feature activation+2.22e-01

Pos logit contributions
 Fil+0.379
rador+0.370
 NEXT+0.365
psey+0.353
 Cle+0.338
Neg logit contributions
netflix-0.510
��-0.505
-0.501
voy-0.496
��-0.495
Token Premier
Token ID9952
Feature activation-2.03e-01

Pos logit contributions
netflix+2.483
��+2.423
+2.416
voy+2.415
��+2.384
Neg logit contributions
rador-1.798
 Fil-1.786
 NEXT-1.711
psey-1.697
ername-1.600
Token League
Token ID4041
Feature activation+1.53e-01

Pos logit contributions
 Fil+1.876
rador+1.784
 NEXT+1.773
psey+1.743
 Cle+1.689
Neg logit contributions
netflix-2.070
��-2.013
-2.007
voy-1.991
��-1.978
Token season
Token ID1622
Feature activation-1.07e-01

Pos logit contributions
netflix+1.665
��+1.622
+1.616
voy+1.614
��+1.594
Neg logit contributions
rador-1.291
 Fil-1.289
 NEXT-1.237
psey-1.222
ername-1.156
Token
Token ID247
Feature activation-1.19e-01

Pos logit contributions
rador+0.125
 Fil+0.123
 NEXT+0.120
psey+0.118
ername+0.117
Neg logit contributions
netflix-0.075
voy-0.072
-0.071
 Solitaire-0.068
zzo-0.068
Tokens
Token ID82
Feature activation+1.21e-01

Pos logit contributions
 Fil+0.913
rador+0.894
 NEXT+0.880
psey+0.857
ername+0.817
Neg logit contributions
netflix-1.402
��-1.353
-1.340
��-1.338
voy-1.333
Token second
Token ID1218
Feature activation-3.45e-01

Pos logit contributions
netflix+1.356
+1.314
��+1.312
voy+1.307
��+1.294
Neg logit contributions
 Fil-1.004
rador-0.986
 NEXT-0.968
psey-0.936
 Cle-0.892
Token-
Token ID12
Feature activation-2.54e-01

Pos logit contributions
 Fil+2.498
 NEXT+2.322
rador+2.316
psey+2.201
 Cle+2.191
Neg logit contributions
netflix-4.227
��-4.107
��-4.092
voy-4.076
-4.057
Tokenranked
Token ID28282
Feature activation-9.12e-02

Pos logit contributions
 Fil+2.213
rador+2.148
 NEXT+2.069
psey+2.039
arning+1.995
Neg logit contributions
netflix-2.652
��-2.609
-2.582
��-2.579
 Solitaire-2.561
Token guard
Token ID4860
Feature activation-9.01e-02

Pos logit contributions
 Fil+0.668
rador+0.638
 NEXT+0.637
psey+0.605
 Cle+0.589
Neg logit contributions
netflix-1.142
-1.098
��-1.090
voy-1.088
��-1.077
Token,
Token ID11
Feature activation-1.22e-01

Pos logit contributions
 Fil+0.664
rador+0.651
 NEXT+0.645
psey+0.614
 Cle+0.586
Neg logit contributions
netflix-1.088
��-1.059
-1.045
voy-1.043
��-1.040
Token then
Token ID788
Feature activation+1.50e-01

Pos logit contributions
 Fil+0.875
 NEXT+0.832
rador+0.832
psey+0.789
 Cle+0.771
Neg logit contributions
netflix-1.478
��-1.469
voy-1.459
-1.450
��-1.444
Token par
Token ID1582
Feature activation-1.59e-01

Pos logit contributions
netflix+1.744
��+1.703
voy+1.700
+1.694
��+1.675
Neg logit contributions
 Fil-1.166
rador-1.155
 NEXT-1.111
psey-1.090
 Ake-1.029
Tokenlay
Token ID10724
Feature activation+5.00e-02

Pos logit contributions
 Fil+1.362
rador+1.348
 NEXT+1.321
psey+1.304
 Ake+1.241
Neg logit contributions
netflix-1.696
��-1.645
voy-1.638
-1.636
��-1.625
Tokened
Token ID276
Feature activation-1.37e-02

Pos logit contributions
netflix+0.476
��+0.462
voy+0.457
+0.455
��+0.450
Neg logit contributions
 Fil-0.492
rador-0.491
 NEXT-0.475
psey-0.469
ername-0.449
Token,
Token ID11
Feature activation-1.87e-01

Pos logit contributions
 Fil+1.065
rador+1.034
 NEXT+1.030
psey+0.984
 Ake+0.937
Neg logit contributions
netflix-1.751
��-1.684
-1.671
��-1.664
voy-1.664
Token 27
Token ID2681
Feature activation-3.75e-01

Pos logit contributions
 Fil+1.383
rador+1.314
 NEXT+1.289
psey+1.243
 Ake+1.217
Neg logit contributions
netflix-2.287
��-2.237
voy-2.204
-2.191
��-2.183
Token-
Token ID12
Feature activation-1.09e-01

Pos logit contributions
 Fil+2.585
 NEXT+2.490
rador+2.463
psey+2.228
 Ake+2.215
Neg logit contributions
netflix-4.772
��-4.571
��-4.553
voy-4.462
-4.432
Token21
Token ID2481
Feature activation+2.03e-01

Pos logit contributions
 Fil+0.995
rador+0.980
 NEXT+0.951
psey+0.907
ername+0.898
Neg logit contributions
��-1.123
netflix-1.116
��-1.112
-1.087
voy-1.062
Token.
Token ID13
Feature activation+9.38e-02

Pos logit contributions
netflix+2.214
+2.171
��+2.169
voy+2.165
��+2.137
Neg logit contributions
rador-1.688
 Fil-1.656
psey-1.593
 NEXT-1.587
ername-1.502
Token
Token ID198
Feature activation-6.11e-02

Pos logit contributions
netflix+1.057
��+1.034
voy+1.027
+1.025
��+1.015
Neg logit contributions
 Fil-0.755
rador-0.753
 NEXT-0.724
psey-0.709
ername-0.673
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
 Fil+0.435
rador+0.414
 NEXT+0.408
 Cle+0.373
psey+0.373
Neg logit contributions
netflix-0.754
��-0.753
��-0.745
-0.736
voy-0.727
TokenWith
Token ID3152
Feature activation-1.54e-01

Pos logit contributions
 Fil+1.029
rador+0.998
 NEXT+0.984
psey+0.933
ername+0.918
Neg logit contributions
netflix-1.180
��-1.164
��-1.157
voy-1.134
 Solitaire-1.114
Token quarterback
Token ID7880
Feature activation+1.19e-01

Pos logit contributions
 Fil+1.301
 NEXT+1.238
rador+1.229
psey+1.173
 Ake+1.163
Neg logit contributions
netflix-1.755
��-1.697
-1.680
voy-1.672
��-1.651
Token Kyle
Token ID14316
Feature activation-9.74e-02

Pos logit contributions
netflix+1.293
��+1.258
+1.251
voy+1.247
��+1.234
Neg logit contributions
 Fil-1.021
rador-1.006
 NEXT-0.974
psey-0.956
 Ake-0.920
Token Morris
Token ID14433
Feature activation+1.38e-01

Pos logit contributions
 Fil+0.889
rador+0.841
 NEXT+0.826
 Ake+0.801
psey+0.801
Neg logit contributions
netflix-1.046
��-1.020
-1.012
��-0.988
voy-0.988
Tokenms
Token ID907
Feature activation+6.00e-02

Pos logit contributions
netflix+1.707
voy+1.629
+1.593
��+1.592
zzo+1.576
Neg logit contributions
rador-3.609
 Fil-3.578
 NEXT-3.491
psey-3.484
ername-3.358
Token was
Token ID373
Feature activation-1.54e-01

Pos logit contributions
netflix+0.691
��+0.676
+0.671
��+0.668
voy+0.666
Neg logit contributions
 Fil-0.479
rador-0.470
 NEXT-0.450
psey-0.442
 Ake-0.421
Token still
Token ID991
Feature activation-2.82e-01

Pos logit contributions
 Fil+1.272
rador+1.198
 NEXT+1.157
psey+1.129
 Cle+1.122
Neg logit contributions
netflix-1.755
��-1.732
��-1.715
-1.710
voy-1.680
Token alive
Token ID6776
Feature activation-4.51e-02

Pos logit contributions
 Fil+2.144
rador+1.970
 NEXT+1.874
 Cle+1.843
 Ake+1.823
Neg logit contributions
��-3.393
netflix-3.389
-3.354
��-3.343
voy-3.246
Token -
Token ID532
Feature activation+1.99e-01

Pos logit contributions
rador+0.362
 Fil+0.361
 NEXT+0.345
psey+0.341
ername+0.323
Neg logit contributions
netflix-0.506
��-0.497
voy-0.494
-0.494
��-0.489
Token a
Token ID257
Feature activation-1.34e-01

Pos logit contributions
netflix+2.146
��+2.095
+2.086
voy+2.086
��+2.059
Neg logit contributions
rador-1.697
 Fil-1.686
 NEXT-1.615
psey-1.605
ername-1.517
Token music
Token ID2647
Feature activation+5.67e-02

Pos logit contributions
 Fil+1.116
rador+1.080
 NEXT+1.014
 Cle+0.985
psey+0.984
Neg logit contributions
��-1.513
netflix-1.496
��-1.482
-1.472
voy-1.435
Token critic
Token ID4014
Feature activation+8.47e-02

Pos logit contributions
netflix+0.587
��+0.574
+0.568
��+0.566
voy+0.561
Neg logit contributions
rador-0.514
 Fil-0.511
 NEXT-0.489
psey-0.485
outube-0.464
Token for
Token ID329
Feature activation+2.86e-02

Pos logit contributions
netflix+0.912
+0.891
��+0.890
��+0.877
voy+0.876
Neg logit contributions
 Fil-0.733
rador-0.728
 NEXT-0.693
psey-0.687
 Ake-0.655
Token the
Token ID262
Feature activation+1.53e-01

Pos logit contributions
netflix+0.321
��+0.315
+0.314
voy+0.313
��+0.309
Neg logit contributions
 Fil-0.236
rador-0.232
 NEXT-0.224
psey-0.218
 Ake-0.208
Token New
Token ID968
Feature activation-3.72e-02

Pos logit contributions
netflix+1.664
��+1.625
+1.618
voy+1.610
��+1.595
Neg logit contributions
 Fil-1.311
rador-1.306
 NEXT-1.254
psey-1.230
outube-1.170
Token you
Token ID345
Feature activation+1.98e-01

Pos logit contributions
 Fil+0.297
rador+0.289
 NEXT+0.284
psey+0.275
outube+0.264
Neg logit contributions
netflix-0.312
��-0.307
-0.305
��-0.303
voy-0.303
Token view
Token ID1570
Feature activation-1.97e-01

Pos logit contributions
netflix+2.065
��+2.008
+2.002
voy+1.999
��+1.972
Neg logit contributions
rador-1.763
 Fil-1.755
 NEXT-1.689
psey-1.670
ername-1.587
Token the
Token ID262
Feature activation-1.70e-01

Pos logit contributions
 Fil+1.601
 NEXT+1.503
rador+1.474
psey+1.421
 Cle+1.395
Neg logit contributions
��-2.227
netflix-2.226
-2.212
��-2.190
voy-2.185
Token Seahawks
Token ID16732
Feature activation-3.29e-01

Pos logit contributions
 Fil+1.360
 NEXT+1.308
rador+1.271
psey+1.239
 Cle+1.189
Neg logit contributions
netflix-1.967
-1.941
��-1.934
��-1.908
voy-1.906
Token historically
Token ID15074
Feature activation-1.42e-01

Pos logit contributions
 Fil+2.435
 NEXT+2.388
rador+2.312
psey+2.193
 Cle+2.138
Neg logit contributions
netflix-3.897
-3.788
��-3.773
voy-3.718
��-3.716
Token if
Token ID611
Feature activation-5.55e-01

Pos logit contributions
 Fil+1.025
rador+1.004
 NEXT+0.992
psey+0.925
 Cle+0.901
Neg logit contributions
netflix-1.722
��-1.684
��-1.681
-1.676
voy-1.658
Token they
Token ID484
Feature activation-3.76e-01

Pos logit contributions
 Fil+4.496
 NEXT+4.148
 Cle+4.060
 Ake+3.998
rador+3.986
Neg logit contributions
netflix-6.135
-6.033
��-5.944
��-5.928
voy-5.845
Token repeat
Token ID9585
Feature activation-4.53e-01

Pos logit contributions
 Fil+2.716
 NEXT+2.633
rador+2.505
psey+2.395
 Ake+2.347
Neg logit contributions
netflix-4.510
-4.430
��-4.400
��-4.361
voy-4.352
Token as
Token ID355
Feature activation-3.52e-01

Pos logit contributions
 Fil+3.243
 NEXT+3.049
 Cle+2.920
rador+2.868
 Ake+2.842
Neg logit contributions
-5.384
netflix-5.381
voy-5.378
��-5.359
��-5.352
Token Super
Token ID3115
Feature activation-3.99e-01

Pos logit contributions
 Fil+2.682
 NEXT+2.560
rador+2.460
 Cle+2.382
psey+2.366
Neg logit contributions
netflix-4.163
-4.096
��-4.042
��-3.991
voy-3.958
Token Bowl
Token ID8693
Feature activation-3.51e-01

Pos logit contributions
 Fil+2.448
rador+2.335
 NEXT+2.294
psey+2.193
 Cle+2.123
Neg logit contributions
netflix-5.256
-5.182
��-5.075
voy-5.012
��-5.010
Token against
Token ID1028
Feature activation+3.19e-02

Pos logit contributions
netflix+0.628
��+0.611
+0.609
voy+0.609
��+0.597
Neg logit contributions
 Fil-0.488
rador-0.487
 NEXT-0.469
psey-0.461
 Ake-0.436
Token Rep
Token ID1432
Feature activation+1.55e-01

Pos logit contributions
netflix+0.360
+0.351
��+0.351
voy+0.348
��+0.345
Neg logit contributions
 Fil-0.266
rador-0.257
 NEXT-0.251
psey-0.243
 Ake-0.234
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
netflix+1.727
voy+1.682
��+1.680
+1.679
��+1.654
Neg logit contributions
rador-1.253
 Fil-1.237
 NEXT-1.187
psey-1.186
ername-1.116
Token Bill
Token ID3941
Feature activation-3.47e-01

Pos logit contributions
netflix+2.362
��+2.303
voy+2.297
+2.296
��+2.265
Neg logit contributions
rador-1.796
 Fil-1.776
 NEXT-1.705
psey-1.697
ername-1.603
Token Cassidy
Token ID41373
Feature activation-1.86e-01

Pos logit contributions
 Fil+3.153
 NEXT+2.878
 Cle+2.854
rador+2.786
 Ake+2.675
Neg logit contributions
netflix-3.922
��-3.743
��-3.692
-3.690
voy-3.556
Token,
Token ID11
Feature activation-5.77e-01

Pos logit contributions
 Fil+1.425
rador+1.377
 NEXT+1.372
psey+1.263
 Ake+1.257
Neg logit contributions
netflix-2.169
��-2.136
-2.106
voy-2.099
��-2.084
Token
Token ID564
Feature activation-2.14e-01

Pos logit contributions
 Fil+4.504
 NEXT+4.223
rador+4.077
 Cle+3.932
 Ake+3.896
Neg logit contributions
netflix-6.533
-6.458
��-6.382
��-6.271
voy-6.260
Token
Token ID250
Feature activation-2.58e-01

Pos logit contributions
rador+2.272
 Fil+2.234
 NEXT+2.177
arning+2.093
ername+2.092
Neg logit contributions
netflix-1.877
voy-1.795
-1.768
 Solitaire-1.748
zzo-1.718
Tokenstanding
Token ID5646
Feature activation-2.29e-01

Pos logit contributions
 Fil+2.242
rador+2.196
 NEXT+2.130
ername+2.052
psey+2.046
Neg logit contributions
netflix-2.721
��-2.671
-2.620
��-2.604
voy-2.579
Token
Token ID447
Feature activation+2.63e-01

Pos logit contributions
 Fil+1.755
 NEXT+1.660
rador+1.637
psey+1.565
ername+1.557
Neg logit contributions
netflix-2.704
��-2.668
-2.666
voy-2.658
��-2.645
Token
Token ID251
Feature activation-1.79e-01

Pos logit contributions
netflix+2.181
��+2.142
+2.113
voy+2.111
��+2.095
Neg logit contributions
rador-2.870
 Fil-2.849
 NEXT-2.764
psey-2.759
outube-2.633
Token int
Token ID493
Feature activation+3.10e-01

Pos logit contributions
netflix+1.788
��+1.738
voy+1.735
+1.733
��+1.713
Neg logit contributions
rador-1.456
 Fil-1.427
psey-1.385
 NEXT-1.365
perty-1.307
Token f
Token ID277
Feature activation-3.38e-02

Pos logit contributions
netflix+3.047
voy+2.958
+2.928
��+2.901
zzo+2.892
Neg logit contributions
rador-2.912
psey-2.805
 Fil-2.794
 NEXT-2.721
perty-2.677
Tokend
Token ID67
Feature activation+1.32e-02

Pos logit contributions
 Fil+0.235
rador+0.235
 NEXT+0.221
psey+0.218
ername+0.206
Neg logit contributions
netflix-0.416
��-0.414
-0.410
��-0.406
voy-0.402
Token |
Token ID930
Feature activation-4.08e-01

Pos logit contributions
netflix+0.137
��+0.136
��+0.134
+0.134
voy+0.133
Neg logit contributions
 Fil-0.119
rador-0.117
 NEXT-0.114
psey-0.109
ername-0.106
Token <
Token ID1279
Feature activation-1.35e-01

Pos logit contributions
 Fil+3.777
 NEXT+3.711
rador+3.642
psey+3.437
 Cle+3.389
Neg logit contributions
netflix-4.020
��-3.990
voy-3.981
-3.958
��-3.872
Token--
Token ID438
Feature activation-5.56e-01

Pos logit contributions
 Fil+1.072
rador+1.038
 NEXT+1.022
psey+0.963
redo+0.947
Neg logit contributions
netflix-1.529
��-1.528
��-1.516
-1.501
voy-1.466
Token Stack
Token ID23881
Feature activation+1.15e-01

Pos logit contributions
 Fil+4.343
 NEXT+3.986
rador+3.784
 Cle+3.700
 Dat+3.632
Neg logit contributions
netflix-6.488
voy-6.471
-6.467
��-6.384
��-6.269
Token pointer
Token ID17562
Feature activation+2.45e-02

Pos logit contributions
netflix+1.113
��+1.082
voy+1.079
+1.078
��+1.061
Neg logit contributions
rador-1.110
 Fil-1.101
 NEXT-1.062
psey-1.057
ername-1.007
Token +
Token ID1343
Feature activation+1.80e-02

Pos logit contributions
netflix+0.247
��+0.244
+0.240
��+0.239
voy+0.239
Neg logit contributions
 Fil-0.228
rador-0.225
 NEXT-0.220
psey-0.212
 Ake-0.206
Token--------------------
Token ID19351
Feature activation-1.60e-01

Pos logit contributions
netflix+0.159
��+0.154
+0.153
��+0.152
voy+0.152
Neg logit contributions
 Fil-0.189
rador-0.189
 NEXT-0.184
psey-0.180
ername-0.174
Token--+
Token ID44785
Feature activation+2.14e-01

Pos logit contributions
 Fil+1.511
rador+1.497
 NEXT+1.461
psey+1.405
 Ake+1.396
Neg logit contributions
netflix-1.606
��-1.556
-1.529
voy-1.521
��-1.511
Tokenk
Token ID74
Feature activation-4.43e-01

Pos logit contributions
rador+2.280
 Fil+2.269
yg+2.168
psey+2.149
 NEXT+2.127
Neg logit contributions
netflix-2.495
��-2.449
��-2.440
-2.372
 Solitaire-2.334
TokenA
Token ID32
Feature activation-5.21e-01

Pos logit contributions
 Fil+4.025
rador+4.018
yg+3.864
psey+3.799
 NEXT+3.759
Neg logit contributions
netflix-4.237
��-4.178
��-4.159
-3.984
 Solitaire-3.980
Tokenlf
Token ID1652
Feature activation-4.91e-01

Pos logit contributions
 Fil+4.420
rador+4.374
yg+4.257
psey+4.184
ername+4.080
Neg logit contributions
��-5.361
��-5.280
netflix-5.258
-4.957
 Solitaire-4.943
Tokenax
Token ID897
Feature activation-6.83e-01

Pos logit contributions
 Fil+4.377
rador+4.367
yg+4.264
psey+4.116
 NEXT+4.097
Neg logit contributions
netflix-4.750
��-4.702
��-4.673
-4.454
 Solitaire-4.418
TokenOb
Token ID5944
Feature activation-4.06e-01

Pos logit contributions
 Fil+5.942
yg+5.847
rador+5.799
 NEXT+5.558
outube+5.489
Neg logit contributions
netflix-6.451
��-6.378
��-6.345
-6.022
 Solitaire-5.952
Tokenpg
Token ID6024
Feature activation-5.45e-01

Pos logit contributions
 Fil+3.663
rador+3.659
yg+3.499
 NEXT+3.439
psey+3.436
Neg logit contributions
netflix-4.003
��-3.889
��-3.866
-3.740
 Solitaire-3.693
Token after
Token ID706
Feature activation-3.31e-01

Pos logit contributions
 Fil+4.904
rador+4.680
 NEXT+4.652
yg+4.537
outube+4.478
Neg logit contributions
netflix-5.372
��-5.137
��-5.101
voy-4.966
-4.965
Token my
Token ID616
Feature activation-2.07e-01

Pos logit contributions
 Fil+2.627
 NEXT+2.557
rador+2.459
psey+2.397
 Cle+2.346
Neg logit contributions
netflix-3.797
��-3.700
voy-3.629
-3.612
��-3.570
Token finals
Token ID18576
Feature activation-2.32e-01

Pos logit contributions
 Fil+1.655
 NEXT+1.612
rador+1.572
psey+1.507
outube+1.481
Neg logit contributions
netflix-2.390
��-2.302
-2.302
voy-2.283
��-2.243
Token and
Token ID290
Feature activation+1.71e-01

Pos logit contributions
 Fil+1.768
 NEXT+1.727
rador+1.693
psey+1.603
outube+1.565
Neg logit contributions
netflix-2.756
��-2.659
voy-2.640
-2.638
��-2.592
Token would
Token ID561
Feature activation+1.57e-01

Pos logit contributions
netflix+1.937
��+1.890
+1.882
voy+1.880
��+1.857
Neg logit contributions
rador-1.370
 Fil-1.367
 NEXT-1.309
psey-1.293
outube-1.221
Token Club
Token ID6289
Feature activation-3.00e-01

Pos logit contributions
netflix+1.690
��+1.646
voy+1.640
+1.638
��+1.623
Neg logit contributions
 Fil-1.325
rador-1.323
 NEXT-1.270
psey-1.258
ername-1.186
Token's
Token ID338
Feature activation+9.24e-02

Pos logit contributions
 Fil+2.195
rador+2.126
 NEXT+2.075
 Cle+1.937
psey+1.924
Neg logit contributions
netflix-3.633
��-3.550
��-3.522
-3.520
voy-3.490
Token decision
Token ID2551
Feature activation-2.74e-01

Pos logit contributions
netflix+1.053
��+1.026
voy+1.022
+1.020
��+1.013
Neg logit contributions
 Fil-0.744
rador-0.737
 NEXT-0.710
psey-0.701
 Cle-0.657
Token makers
Token ID14429
Feature activation-2.30e-01

Pos logit contributions
 Fil+1.984
rador+1.965
 NEXT+1.836
psey+1.826
arning+1.727
Neg logit contributions
netflix-3.346
��-3.292
��-3.236
voy-3.184
-3.173
Token is
Token ID318
Feature activation-6.58e-01

Pos logit contributions
 Fil+1.732
rador+1.706
 NEXT+1.642
psey+1.561
 Cle+1.538
Neg logit contributions
netflix-2.707
��-2.678
-2.654
��-2.641
voy-2.636
Token a
Token ID257
Feature activation-5.37e-01

Pos logit contributions
 Fil+4.170
 NEXT+3.864
 Cle+3.774
 demonstr+3.766
rador+3.650
Neg logit contributions
netflix-8.413
��-8.052
��-7.972
voy-7.912
-7.739
Token very
Token ID845
Feature activation-2.14e-01

Pos logit contributions
 Fil+3.431
 NEXT+3.223
rador+3.124
 Cle+3.088
 fork+2.969
Neg logit contributions
��-6.911
netflix-6.784
��-6.753
-6.589
voy-6.553
Token positive
Token ID3967
Feature activation-7.22e-01

Pos logit contributions
 Fil+1.387
rador+1.317
 NEXT+1.308
 Cle+1.232
psey+1.229
Neg logit contributions
netflix-2.792
��-2.736
-2.702
��-2.695
voy-2.664
Token step
Token ID2239
Feature activation-8.89e-02

Pos logit contributions
 Fil+3.928
 NEXT+3.889
 Ake+3.425
 Cle+3.421
 demonstr+3.385
Neg logit contributions
��-9.942
netflix-9.704
��-9.541
-9.367
zzo-9.182
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
 Fil+0.664
rador+0.659
 NEXT+0.636
psey+0.617
 Ake+0.582
Neg logit contributions
netflix-1.047
��-1.039
��-1.034
-1.024
voy-1.016
Token
Token ID198
Feature activation-2.32e-01

Pos logit contributions
 Fil+1.428
 NEXT+1.341
rador+1.292
 Cle+1.262
 Ake+1.230
Neg logit contributions
��-2.324
netflix-2.303
��-2.277
voy-2.262
-2.210