TOP ACTIVATIONS
MAX = 9.589

. And the game lightly discour ages players from using the
of its orders from the propag and ists at Fox News
secretary - general of the Diplom atic Council of Catalonia (
owned by Alex Jones that propag ates conspiracy theories like the
we investigated the in vivo pharmac ological profile of N -
problem . They also thought strategically and designed and optimized their
diplomats who hail not from antagon istic states , but from
against Kul ik was widely propag and ized , which created
s also quite a bit swe eter so you may want
Intelligent , Techn ophile , Diplom atic - Susp
at one point Mrs . Obama urged kids to
their behavior when miners behave strategically is an open question .
asks Stephen , examining a peac ock 's tail feathers in
great things : Decre ases blood glucose .
child is transgender , society inst ills in them rules about
Has bro , Barbie , Lego , Toys R Us ,
Virginia . Di plom atic cables are internal documents
, investors risk paying an artificially high price .
it is possible to perform pharmac ological and elect roph ys
difference is that Bac ile indirectly and inadvertently infl amed people

MIN ACTIVATIONS
MIN = -10.694

the Today show 's Savannah Guth rie who brought up that
ross . Carlton L . Guth rie . Marion T .
by NBC News ' Savannah Guth rie ( right ) to
ist has cut Mr . Gian fort e s
while . So I 'm sitting there and thinking like ,
colleague , Bed rett in Gund uz , who had helped
shooting last December -- were seated in the VIP section near
here . -- Seth Abram ov itch
the King 's judgment )." Sah ih al - B uk
vm ash ship yard at Sever od v insk , which
last week that Mr . Gian fort e has sent mixed
, many people were already seated and a few were checking
to film producer Sub hash Gh ai , the Supreme Court
private property was sat ir ical .
fish debuted at Sund ance , was acquired by
This was are you sitting down ? a lie
it 's non stop awesome grappling hook aw es omen ess
March case , and Yuri Kul iy s case
and gender identities © 2014 Seth Jam ison Rain ess
Jackson , who was sitting next to him on the

INTERVAL 4.518 - 9.589
CONTAINS 0.430%

if the process was more gradual ? This is the more
View and just have some accidentally - ve gan Heat Wave
step would produce trans uran ics such
cus , and My c opl asma hom inis which replace
some communities seeing double - digit gains , as buyers are

INTERVAL -0.552 - 4.518
CONTAINS 70.450%

) was sent by one of the A 330 's three
last two decades , and he had never heard of the
a reasonable contract for the next three years , he
Grey ? Let us know in the comments below …. <|endoftext|>
it ? It s Ins ane Clown Pos se

INTERVAL -5.623 - -0.552
CONTAINS 28.974%

own without fish as McDonald 's is the official chip maker
a massive traffic jam . Then , while they
bright clear wedge of lime rises up and through . K
с к а я е д а л ь
He gave up soccer for good when he was about 13

INTERVAL -10.694 - -5.623
CONTAINS 0.146%

of what that is . So yeah , just finding things
blurry photo of a man sitting in a lawn chair .
el an . So if you 're interested in
worlds seemed to stop . So much so , that we
training entails . So it s time
Token.
Token ID13
Feature activation-4.27e-01

Pos logit contributions
+0.144
OPLE+0.134
 citiz+0.131
tv+0.129
 refunds+0.126
Neg logit contributions
ophen-0.114
onent-0.112
cial-0.111
reated-0.106
hemat-0.104
Token And
Token ID843
Feature activation-3.05e+00

Pos logit contributions
+0.303
OPLE+0.280
 citiz+0.270
tv+0.269
 refunds+0.260
Neg logit contributions
ophen-0.319
onent-0.313
cial-0.312
reated-0.302
hemat-0.295
Token the
Token ID262
Feature activation-3.52e-01

Pos logit contributions
+2.117
OPLE+1.951
 citiz+1.921
tv+1.881
 refunds+1.854
Neg logit contributions
ophen-2.276
onent-2.253
cial-2.237
reated-2.160
hemat-2.125
Token game
Token ID983
Feature activation-4.58e-01

Pos logit contributions
+0.244
OPLE+0.224
 citiz+0.219
tv+0.215
 refunds+0.210
Neg logit contributions
ophen-0.268
onent-0.264
cial-0.263
reated-0.254
hemat-0.249
Token lightly
Token ID15376
Feature activation+2.87e+00

Pos logit contributions
+0.335
OPLE+0.309
 citiz+0.302
tv+0.298
 refunds+0.292
Neg logit contributions
ophen-0.331
onent-0.325
cial-0.323
reated-0.312
hemat-0.306
Token discour
Token ID15117
Feature activation+9.59e+00

Pos logit contributions
ophen+2.215
onent+2.181
cial+2.167
reated+2.106
hemat+2.056
Neg logit contributions
-1.977
OPLE-1.814
 citiz-1.766
tv-1.713
pour-1.661
Tokenages
Token ID1095
Feature activation+5.45e+00

Pos logit contributions
cial+1.970
ophen+1.950
onent+1.874
reated+1.793
hemat+1.666
Neg logit contributions
-12.014
 citiz-11.372
OPLE-11.302
PDATE-10.970
tv-10.771
Token players
Token ID1938
Feature activation-6.07e-01

Pos logit contributions
ophen+4.087
onent+4.043
cial+3.987
reated+3.882
pite+3.809
Neg logit contributions
-3.754
OPLE-3.444
 citiz-3.326
tv-3.257
mbuds-3.167
Token from
Token ID422
Feature activation+4.77e-01

Pos logit contributions
+0.459
OPLE+0.424
 citiz+0.415
tv+0.407
 refunds+0.399
Neg logit contributions
ophen-0.427
onent-0.420
cial-0.417
reated-0.401
hemat-0.393
Token using
Token ID1262
Feature activation-1.83e+00

Pos logit contributions
ophen+0.358
onent+0.353
cial+0.351
reated+0.339
hemat+0.332
Neg logit contributions
-0.337
OPLE-0.310
 citiz-0.303
tv-0.296
 refunds-0.289
Token the
Token ID262
Feature activation+5.36e-01

Pos logit contributions
+1.265
OPLE+1.160
 citiz+1.136
tv+1.114
 refunds+1.100
Neg logit contributions
ophen-1.393
onent-1.374
cial-1.364
reated-1.319
hemat-1.297
Token of
Token ID286
Feature activation-2.77e-01

Pos logit contributions
+1.873
OPLE+1.750
 citiz+1.712
tv+1.684
 refunds+1.679
Neg logit contributions
ophen-2.015
onent-1.988
cial-1.959
reated-1.917
pite-1.867
Token its
Token ID663
Feature activation-1.90e+00

Pos logit contributions
+0.192
OPLE+0.175
 citiz+0.170
tv+0.166
 refunds+0.162
Neg logit contributions
ophen-0.213
onent-0.210
cial-0.209
reated-0.202
hemat-0.198
Token orders
Token ID6266
Feature activation+2.33e+00

Pos logit contributions
+1.169
OPLE+1.068
 citiz+1.050
tv+1.027
 refunds+1.015
Neg logit contributions
ophen-1.590
onent-1.563
cial-1.556
reated-1.513
hemat-1.489
Token from
Token ID422
Feature activation-1.29e-01

Pos logit contributions
ophen+1.621
onent+1.589
cial+1.586
reated+1.517
hemat+1.493
Neg logit contributions
-1.784
OPLE-1.654
 citiz-1.610
tv-1.577
 refunds-1.530
Token the
Token ID262
Feature activation-2.41e-01

Pos logit contributions
+0.089
OPLE+0.082
 citiz+0.080
tv+0.078
 refunds+0.076
Neg logit contributions
ophen-0.099
onent-0.097
cial-0.097
reated-0.094
hemat-0.092
Token propag
Token ID8928
Feature activation+9.29e+00

Pos logit contributions
+0.158
OPLE+0.145
 citiz+0.141
tv+0.139
 refunds+0.135
Neg logit contributions
ophen-0.192
onent-0.189
cial-0.188
reated-0.182
hemat-0.179
Tokenand
Token ID392
Feature activation-1.11e-01

Pos logit contributions
ophen+3.902
onent+3.881
cial+3.820
reated+3.560
hemat+3.442
Neg logit contributions
-9.492
 citiz-8.893
OPLE-8.788
 refunds-8.488
 sugg-8.413
Tokenists
Token ID1023
Feature activation+9.70e-01

Pos logit contributions
+0.111
OPLE+0.105
 citiz+0.103
tv+0.102
 refunds+0.100
Neg logit contributions
ophen-0.051
onent-0.050
cial-0.049
reated-0.046
hemat-0.045
Token at
Token ID379
Feature activation+5.81e-01

Pos logit contributions
ophen+0.631
onent+0.620
cial+0.618
reated+0.591
hemat+0.578
Neg logit contributions
-0.783
OPLE-0.729
 citiz-0.713
tv-0.699
 refunds-0.685
Token Fox
Token ID5426
Feature activation-1.77e+00

Pos logit contributions
ophen+0.434
cial+0.427
onent+0.427
reated+0.410
hemat+0.402
Neg logit contributions
-0.415
OPLE-0.380
 citiz-0.372
tv-0.363
 refunds-0.355
Token News
Token ID3000
Feature activation+2.63e-01

Pos logit contributions
+1.307
OPLE+1.212
 citiz+1.185
tv+1.173
 refunds+1.140
Neg logit contributions
ophen-1.264
onent-1.244
cial-1.233
reated-1.191
pite-1.169
Token secretary
Token ID7705
Feature activation+4.65e+00

Pos logit contributions
ophen+0.139
onent+0.137
cial+0.136
reated+0.131
hemat+0.129
Neg logit contributions
-0.136
OPLE-0.126
 citiz-0.123
tv-0.120
 refunds-0.117
Token-
Token ID12
Feature activation-1.72e-01

Pos logit contributions
ophen+3.121
cial+3.060
onent+3.046
reated+2.879
pite+2.844
Neg logit contributions
-3.634
OPLE-3.325
 citiz-3.214
tv-3.184
 refunds-3.124
Token general
Token ID2276
Feature activation+1.29e+00

Pos logit contributions
+0.144
OPLE+0.134
 citiz+0.132
tv+0.130
 refunds+0.127
Neg logit contributions
ophen-0.106
onent-0.104
cial-0.103
reated-0.099
hemat-0.096
Token of
Token ID286
Feature activation+3.71e-01

Pos logit contributions
ophen+0.728
onent+0.711
cial+0.709
reated+0.672
pite+0.655
Neg logit contributions
-1.159
OPLE-1.082
 citiz-1.060
tv-1.045
 refunds-1.028
Token the
Token ID262
Feature activation-1.34e-01

Pos logit contributions
ophen+0.262
onent+0.257
cial+0.257
reated+0.247
hemat+0.241
Neg logit contributions
-0.280
OPLE-0.258
 citiz-0.252
tv-0.247
 refunds-0.242
Token Diplom
Token ID34905
Feature activation+9.05e+00

Pos logit contributions
+0.102
OPLE+0.094
 citiz+0.092
tv+0.090
 refunds+0.088
Neg logit contributions
ophen-0.093
onent-0.092
cial-0.091
reated-0.088
pite-0.086
Tokenatic
Token ID1512
Feature activation+4.88e+00

Pos logit contributions
ophen+4.395
cial+4.377
onent+4.267
reated+3.998
hemat+3.888
Neg logit contributions
-8.737
OPLE-8.105
 citiz-7.969
 refunds-7.822
tv-7.706
Token Council
Token ID4281
Feature activation+2.55e-01

Pos logit contributions
ophen+3.307
cial+3.250
onent+3.234
reated+3.101
pite+3.024
Neg logit contributions
-3.814
OPLE-3.492
 citiz-3.386
tv-3.326
 refunds-3.277
Token of
Token ID286
Feature activation+6.95e-02

Pos logit contributions
ophen+0.155
onent+0.152
cial+0.151
reated+0.145
hemat+0.141
Neg logit contributions
-0.217
OPLE-0.203
 citiz-0.199
tv-0.195
 refunds-0.192
Token Catalonia
Token ID33859
Feature activation-2.89e+00

Pos logit contributions
ophen+0.047
onent+0.046
cial+0.046
reated+0.044
hemat+0.043
Neg logit contributions
-0.055
OPLE-0.051
 citiz-0.050
tv-0.049
 refunds-0.048
Token (
Token ID357
Feature activation-1.10e+00

Pos logit contributions
+2.193
OPLE+2.066
 citiz+2.022
tv+1.982
 refunds+1.935
Neg logit contributions
ophen-1.974
onent-1.946
cial-1.923
reated-1.850
hemat-1.820
Token owned
Token ID6898
Feature activation+1.51e+00

Pos logit contributions
+0.219
OPLE+0.201
 citiz+0.196
tv+0.192
 refunds+0.187
Neg logit contributions
ophen-0.236
onent-0.232
cial-0.231
reated-0.223
hemat-0.218
Token by
Token ID416
Feature activation+6.39e-01

Pos logit contributions
ophen+1.001
onent+0.996
cial+0.982
reated+0.943
hemat+0.924
Neg logit contributions
-1.183
OPLE-1.108
 citiz-1.076
tv-1.048
 refunds-1.037
Token Alex
Token ID4422
Feature activation-1.09e+00

Pos logit contributions
ophen+0.461
onent+0.454
cial+0.452
reated+0.433
hemat+0.425
Neg logit contributions
-0.472
OPLE-0.435
 citiz-0.426
tv-0.412
 refunds-0.405
Token Jones
Token ID5437
Feature activation+1.31e-01

Pos logit contributions
+0.927
OPLE+0.866
 citiz+0.848
tv+0.831
 refunds+0.817
Neg logit contributions
ophen-0.665
onent-0.652
cial-0.649
reated-0.620
hemat-0.605
Token that
Token ID326
Feature activation-4.35e-01

Pos logit contributions
ophen+0.083
onent+0.082
cial+0.081
reated+0.078
hemat+0.076
Neg logit contributions
-0.107
OPLE-0.100
 citiz-0.098
tv-0.096
 refunds-0.094
Token propag
Token ID8928
Feature activation+9.04e+00

Pos logit contributions
+0.332
OPLE+0.307
 citiz+0.300
tv+0.293
 refunds+0.287
Neg logit contributions
ophen-0.304
onent-0.299
cial-0.297
reated-0.286
hemat-0.280
Tokenates
Token ID689
Feature activation+3.12e+00

Pos logit contributions
onent+2.746
ophen+2.694
reated+2.542
cial+2.513
hemat+2.304
Neg logit contributions
-10.446
 citiz-9.950
OPLE-9.757
PDATE-9.417
 sugg-9.332
Token conspiracy
Token ID10086
Feature activation-4.20e+00

Pos logit contributions
ophen+2.502
onent+2.469
cial+2.449
reated+2.361
pite+2.323
Neg logit contributions
-2.062
OPLE-1.867
 citiz-1.837
tv-1.720
pour-1.714
Token theories
Token ID10946
Feature activation-2.51e+00

Pos logit contributions
+2.041
OPLE+1.817
tv+1.749
 citiz+1.737
 refunds+1.680
Neg logit contributions
ophen-4.015
onent-3.968
cial-3.937
reated-3.855
pite-3.809
Token like
Token ID588
Feature activation-2.98e+00

Pos logit contributions
+1.945
OPLE+1.814
 citiz+1.767
tv+1.763
 refunds+1.725
Neg logit contributions
ophen-1.688
onent-1.661
cial-1.646
reated-1.583
pite-1.551
Token the
Token ID262
Feature activation+2.00e-01

Pos logit contributions
+2.095
OPLE+1.931
tv+1.881
 citiz+1.880
 refunds+1.831
Neg logit contributions
ophen-2.213
onent-2.190
cial-2.175
reated-2.106
pite-2.058
Token we
Token ID356
Feature activation+1.27e+00

Pos logit contributions
ophen+0.037
onent+0.037
cial+0.037
reated+0.035
hemat+0.035
Neg logit contributions
-0.032
OPLE-0.029
 citiz-0.029
tv-0.028
 refunds-0.027
Token investigated
Token ID12565
Feature activation+7.41e-01

Pos logit contributions
ophen+1.000
onent+0.985
cial+0.981
reated+0.949
hemat+0.930
Neg logit contributions
-0.851
OPLE-0.777
 citiz-0.758
tv-0.739
 refunds-0.721
Token the
Token ID262
Feature activation+8.35e-01

Pos logit contributions
ophen+0.549
onent+0.540
cial+0.538
reated+0.519
hemat+0.508
Neg logit contributions
-0.531
OPLE-0.489
 citiz-0.477
tv-0.467
 refunds-0.456
Token in
Token ID287
Feature activation+1.77e+00

Pos logit contributions
ophen+0.574
onent+0.564
cial+0.561
reated+0.539
hemat+0.528
Neg logit contributions
-0.645
OPLE-0.598
 citiz-0.584
tv-0.573
 refunds-0.561
Token vivo
Token ID34714
Feature activation+4.07e+00

Pos logit contributions
ophen+0.914
onent+0.892
cial+0.887
reated+0.841
hemat+0.816
Neg logit contributions
-1.678
OPLE-1.577
 citiz-1.548
tv-1.521
 refunds-1.498
Token pharmac
Token ID25702
Feature activation+8.67e+00

Pos logit contributions
ophen+2.786
cial+2.730
onent+2.707
reated+2.607
hemat+2.536
Neg logit contributions
-3.157
OPLE-2.904
 citiz-2.846
tv-2.787
wine-2.728
Tokenological
Token ID2770
Feature activation+4.46e+00

Pos logit contributions
ophen+4.128
cial+3.738
onent+3.724
reated+3.530
hemat+3.390
Neg logit contributions
-8.645
OPLE-8.205
 citiz-8.130
tv-7.835
 refunds-7.809
Token profile
Token ID7034
Feature activation-1.89e+00

Pos logit contributions
ophen+3.131
onent+3.026
cial+3.019
reated+2.902
pite+2.829
Neg logit contributions
-3.405
OPLE-3.136
 citiz-3.041
tv-2.981
wine-2.899
Token of
Token ID286
Feature activation+9.81e-01

Pos logit contributions
+1.607
OPLE+1.488
 citiz+1.476
tv+1.445
 refunds+1.421
Neg logit contributions
ophen-1.120
onent-1.117
cial-1.106
reated-1.059
hemat-1.039
Token N
Token ID399
Feature activation-3.58e-01

Pos logit contributions
ophen+0.761
cial+0.749
onent+0.748
reated+0.723
hemat+0.706
Neg logit contributions
-0.672
OPLE-0.614
 citiz-0.598
tv-0.585
 refunds-0.571
Token-
Token ID12
Feature activation+6.36e-01

Pos logit contributions
+0.288
OPLE+0.268
 citiz+0.262
tv+0.259
 refunds+0.252
Neg logit contributions
ophen-0.233
onent-0.228
cial-0.227
reated-0.218
hemat-0.213
Token problem
Token ID1917
Feature activation-1.54e+00

Pos logit contributions
+0.233
OPLE+0.213
 citiz+0.208
tv+0.203
 refunds+0.199
Neg logit contributions
ophen-0.275
onent-0.271
cial-0.270
reated-0.261
hemat-0.256
Token.
Token ID13
Feature activation+2.04e-02

Pos logit contributions
+1.244
OPLE+1.157
 citiz+1.130
tv+1.111
 refunds+1.091
Neg logit contributions
ophen-1.007
onent-0.989
cial-0.982
reated-0.946
hemat-0.921
Token They
Token ID1119
Feature activation+3.96e-01

Pos logit contributions
ophen+0.015
onent+0.015
cial+0.015
reated+0.014
pite+0.014
Neg logit contributions
-0.015
OPLE-0.013
 citiz-0.013
tv-0.013
 refunds-0.012
Token also
Token ID635
Feature activation-5.80e-02

Pos logit contributions
ophen+0.302
onent+0.297
cial+0.296
reated+0.286
hemat+0.280
Neg logit contributions
-0.276
OPLE-0.254
 citiz-0.247
tv-0.242
 refunds-0.236
Token thought
Token ID1807
Feature activation+1.40e+00

Pos logit contributions
+0.039
OPLE+0.036
 citiz+0.035
tv+0.034
 refunds+0.033
Neg logit contributions
ophen-0.046
onent-0.045
cial-0.045
reated-0.043
hemat-0.043
Token strategically
Token ID34235
Feature activation+8.44e+00

Pos logit contributions
ophen+0.996
onent+0.979
cial+0.974
reated+0.936
hemat+0.921
Neg logit contributions
-1.047
OPLE-0.966
 citiz-0.943
tv-0.923
 refunds-0.902
Token and
Token ID290
Feature activation+4.66e-01

Pos logit contributions
ophen+5.003
cial+4.968
onent+4.950
reated+4.645
hemat+4.536
Neg logit contributions
-6.975
OPLE-6.468
 citiz-6.344
tv-6.201
mbuds-6.082
Token designed
Token ID3562
Feature activation+3.39e+00

Pos logit contributions
ophen+0.335
onent+0.330
cial+0.328
reated+0.316
hemat+0.310
Neg logit contributions
-0.344
OPLE-0.318
 citiz-0.310
tv-0.304
 refunds-0.298
Token and
Token ID290
Feature activation+2.86e-01

Pos logit contributions
ophen+2.495
onent+2.460
cial+2.456
reated+2.354
hemat+2.319
Neg logit contributions
-2.443
OPLE-2.249
 citiz-2.187
tv-2.142
pour-2.082
Token optimized
Token ID23392
Feature activation+1.75e+00

Pos logit contributions
ophen+0.200
onent+0.196
cial+0.195
reated+0.188
hemat+0.184
Neg logit contributions
-0.216
OPLE-0.200
 citiz-0.195
tv-0.192
 refunds-0.188
Token their
Token ID511
Feature activation-4.19e-01

Pos logit contributions
ophen+1.307
onent+1.288
cial+1.283
reated+1.235
hemat+1.212
Neg logit contributions
-1.244
OPLE-1.145
 citiz-1.116
tv-1.092
 refunds-1.060
Token diplomats
Token ID24836
Feature activation+2.56e+00

Pos logit contributions
ophen+1.633
cial+1.612
onent+1.609
reated+1.544
pite+1.510
Neg logit contributions
-1.625
OPLE-1.491
 citiz-1.446
tv-1.419
pour-1.380
Token who
Token ID508
Feature activation+6.91e-02

Pos logit contributions
ophen+1.697
cial+1.690
onent+1.656
reated+1.617
hemat+1.573
Neg logit contributions
-2.009
OPLE-1.890
 citiz-1.829
tv-1.786
 refunds-1.741
Token hail
Token ID32405
Feature activation-2.07e+00

Pos logit contributions
ophen+0.052
onent+0.052
cial+0.051
reated+0.049
hemat+0.049
Neg logit contributions
-0.048
OPLE-0.045
 citiz-0.043
tv-0.043
 refunds-0.042
Token not
Token ID407
Feature activation-3.93e-01

Pos logit contributions
+1.507
OPLE+1.395
 citiz+1.364
tv+1.339
 refunds+1.314
Neg logit contributions
ophen-1.505
onent-1.486
cial-1.463
reated-1.416
hemat-1.391
Token from
Token ID422
Feature activation-1.89e-01

Pos logit contributions
+0.277
OPLE+0.255
 citiz+0.249
tv+0.244
 refunds+0.238
Neg logit contributions
ophen-0.294
onent-0.291
cial-0.289
reated-0.278
hemat-0.273
Token antagon
Token ID23775
Feature activation+8.27e+00

Pos logit contributions
+0.137
OPLE+0.126
 citiz+0.123
tv+0.121
 refunds+0.118
Neg logit contributions
ophen-0.139
onent-0.137
cial-0.136
reated-0.131
hemat-0.129
Tokenistic
Token ID2569
Feature activation+3.41e+00

Pos logit contributions
cial+2.785
ophen+2.753
onent+2.548
reated+2.322
obic+2.320
Neg logit contributions
-9.070
OPLE-8.757
 citiz-8.573
tv-8.354
 refunds-8.346
Token states
Token ID2585
Feature activation-1.34e-02

Pos logit contributions
ophen+2.822
cial+2.795
onent+2.782
reated+2.689
pite+2.632
Neg logit contributions
-2.133
OPLE-1.921
 citiz-1.856
tv-1.814
 refunds-1.765
Token,
Token ID11
Feature activation-3.07e-02

Pos logit contributions
+0.010
OPLE+0.009
 citiz+0.009
tv+0.008
 refunds+0.008
Neg logit contributions
ophen-0.010
onent-0.010
cial-0.010
reated-0.010
hemat-0.009
Token but
Token ID475
Feature activation+3.56e-01

Pos logit contributions
+0.021
OPLE+0.020
 citiz+0.019
tv+0.019
 refunds+0.018
Neg logit contributions
ophen-0.024
onent-0.023
cial-0.023
reated-0.022
hemat-0.022
Token from
Token ID422
Feature activation-3.69e-02

Pos logit contributions
ophen+0.252
onent+0.248
cial+0.247
reated+0.238
hemat+0.233
Neg logit contributions
-0.266
OPLE-0.246
 citiz-0.241
tv-0.236
 refunds-0.231
Token against
Token ID1028
Feature activation+2.99e-01

Pos logit contributions
ophen+1.151
onent+1.131
cial+1.125
reated+1.082
hemat+1.058
Neg logit contributions
-1.292
OPLE-1.197
 citiz-1.170
tv-1.148
 refunds-1.125
Token Kul
Token ID50024
Feature activation-7.43e+00

Pos logit contributions
ophen+0.212
onent+0.209
cial+0.208
reated+0.200
hemat+0.196
Neg logit contributions
-0.224
OPLE-0.207
 citiz-0.202
tv-0.198
 refunds-0.194
Tokenik
Token ID1134
Feature activation-8.86e-01

Pos logit contributions
+5.696
tv+5.633
OPLE+5.480
 citiz+5.358
pour+5.316
Neg logit contributions
ophen-4.386
cial-4.282
onent-4.278
reated-4.142
ulet-4.011
Token was
Token ID373
Feature activation-7.40e-01

Pos logit contributions
+0.796
OPLE+0.753
 citiz+0.737
tv+0.731
pour+0.713
Neg logit contributions
ophen-0.486
onent-0.476
cial-0.474
reated-0.452
hemat-0.438
Token widely
Token ID6768
Feature activation+7.12e-02

Pos logit contributions
+0.528
OPLE+0.487
 citiz+0.478
tv+0.473
 refunds+0.460
Neg logit contributions
ophen-0.544
onent-0.536
cial-0.530
reated-0.518
hemat-0.505
Token propag
Token ID8928
Feature activation+8.26e+00

Pos logit contributions
ophen+0.052
onent+0.051
cial+0.051
reated+0.049
hemat+0.048
Neg logit contributions
-0.052
OPLE-0.048
 citiz-0.046
tv-0.046
 refunds-0.045
Tokenand
Token ID392
Feature activation-7.46e-01

Pos logit contributions
onent+2.515
ophen+2.505
cial+2.367
reated+2.243
hemat+2.053
Neg logit contributions
-9.601
 citiz-9.285
OPLE-9.181
 refunds-8.749
PDATE-8.676
Tokenized
Token ID1143
Feature activation-4.47e-01

Pos logit contributions
+0.847
OPLE+0.803
tv+0.791
 citiz+0.785
pour+0.770
Neg logit contributions
ophen-0.231
onent-0.225
cial-0.216
reated-0.200
pite-0.197
Token,
Token ID11
Feature activation-4.96e-01

Pos logit contributions
+0.356
OPLE+0.332
 citiz+0.325
tv+0.321
 refunds+0.314
Neg logit contributions
ophen-0.294
onent-0.287
cial-0.285
reated-0.275
hemat-0.269
Token which
Token ID543
Feature activation+1.02e+00

Pos logit contributions
+0.361
OPLE+0.335
 citiz+0.326
tv+0.322
 refunds+0.314
Neg logit contributions
ophen-0.359
onent-0.353
cial-0.351
reated-0.340
hemat-0.332
Token created
Token ID2727
Feature activation-5.06e-01

Pos logit contributions
ophen+0.709
onent+0.697
cial+0.693
reated+0.668
hemat+0.653
Neg logit contributions
-0.781
OPLE-0.724
 citiz-0.709
tv-0.695
 refunds-0.682
Tokens
Token ID82
Feature activation+6.56e-01

Pos logit contributions
+0.274
OPLE+0.247
 citiz+0.239
tv+0.238
 refunds+0.230
Neg logit contributions
ophen-0.392
onent-0.387
cial-0.385
reated-0.373
hemat-0.367
Token also
Token ID635
Feature activation-2.29e-01

Pos logit contributions
ophen+0.499
onent+0.492
cial+0.489
reated+0.473
hemat+0.464
Neg logit contributions
-0.457
OPLE-0.419
 citiz-0.409
tv-0.401
 refunds-0.393
Token quite
Token ID2407
Feature activation-1.12e+00

Pos logit contributions
+0.157
OPLE+0.144
 citiz+0.141
tv+0.138
 refunds+0.135
Neg logit contributions
ophen-0.176
onent-0.173
cial-0.172
reated-0.167
hemat-0.164
Token a
Token ID257
Feature activation-8.43e-02

Pos logit contributions
+0.761
OPLE+0.690
 citiz+0.680
tv+0.669
 refunds+0.666
Neg logit contributions
ophen-0.852
onent-0.840
cial-0.827
reated-0.813
hemat-0.795
Token bit
Token ID1643
Feature activation+6.81e-01

Pos logit contributions
+0.061
OPLE+0.056
 citiz+0.055
tv+0.054
 refunds+0.053
Neg logit contributions
ophen-0.061
onent-0.060
cial-0.059
reated-0.058
hemat-0.057
Token swe
Token ID3490
Feature activation+8.24e+00

Pos logit contributions
ophen+0.490
onent+0.483
cial+0.480
reated+0.464
hemat+0.455
Neg logit contributions
-0.499
OPLE-0.461
 citiz-0.451
tv-0.442
 refunds-0.434
Tokeneter
Token ID2357
Feature activation+1.63e+00

Pos logit contributions
ophen+2.637
cial+2.502
onent+2.481
reated+2.213
pite+2.180
Neg logit contributions
-9.563
 citiz-9.133
OPLE-9.074
PDATE-8.623
 refunds-8.614
Token so
Token ID523
Feature activation-4.54e+00

Pos logit contributions
ophen+1.019
onent+1.001
cial+0.996
reated+0.952
hemat+0.929
Neg logit contributions
-1.362
OPLE-1.271
 citiz-1.243
tv-1.220
 refunds-1.197
Token you
Token ID345
Feature activation-9.13e-01

Pos logit contributions
+2.793
OPLE+2.540
 citiz+2.452
tv+2.449
 refunds+2.434
Neg logit contributions
ophen-3.698
onent-3.674
cial-3.628
reated-3.529
hemat-3.484
Token may
Token ID743
Feature activation-2.82e+00

Pos logit contributions
+0.607
OPLE+0.552
 citiz+0.538
tv+0.530
 refunds+0.517
Neg logit contributions
ophen-0.723
onent-0.712
cial-0.709
reated-0.686
pite-0.673
Token want
Token ID765
Feature activation+1.00e+00

Pos logit contributions
+1.338
OPLE+1.177
tv+1.130
 citiz+1.120
 refunds+1.081
Neg logit contributions
ophen-2.729
cial-2.703
onent-2.701
reated-2.625
pite-2.593
Token Intelligent
Token ID49452
Feature activation+1.20e+00

Pos logit contributions
ophen+0.885
onent+0.870
cial+0.866
reated+0.833
hemat+0.814
Neg logit contributions
-0.983
OPLE-0.910
 citiz-0.889
tv-0.873
 refunds-0.855
Token,
Token ID11
Feature activation+3.04e-01

Pos logit contributions
ophen+0.780
onent+0.768
cial+0.762
reated+0.732
hemat+0.714
Neg logit contributions
-0.973
OPLE-0.906
 citiz-0.881
tv-0.870
 refunds-0.854
Token Techn
Token ID5429
Feature activation+7.29e+00

Pos logit contributions
ophen+0.207
onent+0.204
cial+0.203
reated+0.195
hemat+0.190
Neg logit contributions
-0.236
OPLE-0.218
 citiz-0.214
tv-0.210
 refunds-0.206
Tokenophile
Token ID37161
Feature activation-8.45e-01

Pos logit contributions
ophen+3.430
cial+3.362
onent+3.258
reated+3.087
obic+3.021
Neg logit contributions
-7.261
OPLE-6.869
 citiz-6.746
 refunds-6.563
tv-6.446
Token,
Token ID11
Feature activation+2.44e-01

Pos logit contributions
+0.659
OPLE+0.611
 citiz+0.598
tv+0.586
 refunds+0.574
Neg logit contributions
ophen-0.573
onent-0.563
cial-0.560
reated-0.538
hemat-0.526
Token Diplom
Token ID34905
Feature activation+8.23e+00

Pos logit contributions
ophen+0.167
onent+0.164
cial+0.163
reated+0.157
hemat+0.153
Neg logit contributions
-0.189
OPLE-0.175
 citiz-0.172
tv-0.169
 refunds-0.165
Tokenatic
Token ID1512
Feature activation+4.66e+00

Pos logit contributions
ophen+4.590
onent+4.546
cial+4.513
reated+4.234
hemat+4.142
Neg logit contributions
-7.452
OPLE-6.994
 citiz-6.802
 refunds-6.618
tv-6.511
Token
Token ID198
Feature activation+7.45e-01

Pos logit contributions
ophen+3.029
cial+2.989
onent+2.989
reated+2.827
pite+2.774
Neg logit contributions
-3.725
OPLE-3.465
 citiz-3.354
tv-3.301
 refunds-3.272
Token
Token ID198
Feature activation+7.90e-01

Pos logit contributions
ophen+0.556
onent+0.546
cial+0.541
reated+0.525
hemat+0.514
Neg logit contributions
-0.518
OPLE-0.483
tv-0.473
pour-0.455
 refunds-0.453
Token-
Token ID12
Feature activation+6.01e-02

Pos logit contributions
ophen+0.496
onent+0.487
cial+0.484
reated+0.464
hemat+0.453
Neg logit contributions
-0.656
OPLE-0.611
 citiz-0.598
tv-0.588
 refunds-0.577
Token Susp
Token ID31922
Feature activation-1.67e+00

Pos logit contributions
ophen+0.041
onent+0.041
cial+0.040
reated+0.039
hemat+0.038
Neg logit contributions
-0.046
OPLE-0.043
 citiz-0.042
tv-0.041
 refunds-0.040
Token at
Token ID379
Feature activation+2.00e+00

Pos logit contributions
ophen+0.738
onent+0.727
cial+0.724
reated+0.696
hemat+0.681
Neg logit contributions
-0.795
OPLE-0.735
 citiz-0.717
tv-0.703
 refunds-0.688
Token one
Token ID530
Feature activation+2.21e+00

Pos logit contributions
ophen+1.385
onent+1.362
cial+1.355
reated+1.304
pite+1.275
Neg logit contributions
-1.528
OPLE-1.411
 citiz-1.380
tv-1.351
 refunds-1.322
Token point
Token ID966
Feature activation+1.43e-01

Pos logit contributions
ophen+1.639
onent+1.614
cial+1.604
reated+1.549
hemat+1.521
Neg logit contributions
-1.581
OPLE-1.458
 citiz-1.423
tv-1.395
 refunds-1.365
Token
Token ID198
Feature activation+1.54e+00

Pos logit contributions
ophen+0.092
onent+0.090
cial+0.090
reated+0.086
hemat+0.084
Neg logit contributions
-0.116
OPLE-0.108
 citiz-0.106
tv-0.104
 refunds-0.102
Token
Token ID198
Feature activation+1.27e+00

Pos logit contributions
ophen+1.118
onent+1.099
cial+1.092
reated+1.054
hemat+1.032
Neg logit contributions
-1.129
OPLE-1.045
 citiz-1.013
tv-1.005
 refunds-0.980
TokenMrs
Token ID27034
Feature activation+8.23e+00

Pos logit contributions
ophen+0.902
onent+0.887
cial+0.882
reated+0.849
hemat+0.831
Neg logit contributions
-0.956
OPLE-0.884
 citiz-0.863
tv-0.847
 refunds-0.829
Token.
Token ID13
Feature activation+4.26e-01

Pos logit contributions
ophen+4.160
onent+4.125
cial+4.096
reated+3.846
hemat+3.751
Neg logit contributions
-7.705
OPLE-7.034
 citiz-6.845
mbuds-6.676
 sugg-6.654
Token Obama
Token ID2486
Feature activation+3.24e+00

Pos logit contributions
ophen+0.270
onent+0.265
cial+0.263
reated+0.252
hemat+0.246
Neg logit contributions
-0.353
OPLE-0.328
 citiz-0.321
tv-0.316
 refunds-0.310
Token urged
Token ID11643
Feature activation+3.72e+00

Pos logit contributions
ophen+2.013
cial+1.970
onent+1.967
reated+1.886
pite+1.832
Neg logit contributions
-2.712
OPLE-2.526
 citiz-2.462
tv-2.405
 refunds-2.334
Token kids
Token ID3988
Feature activation+2.30e+00

Pos logit contributions
ophen+2.712
cial+2.696
onent+2.694
reated+2.590
pite+2.522
Neg logit contributions
-2.708
OPLE-2.413
 citiz-2.360
tv-2.313
mbuds-2.253
Token to
Token ID284
Feature activation+1.08e+00

Pos logit contributions
ophen+1.419
onent+1.388
cial+1.386
reated+1.329
hemat+1.293
Neg logit contributions
-1.931
OPLE-1.795
 citiz-1.762
tv-1.719
 refunds-1.686
Token their
Token ID511
Feature activation-3.75e-01

Pos logit contributions
+1.626
OPLE+1.524
 citiz+1.504
tv+1.478
 refunds+1.469
Neg logit contributions
ophen-1.407
onent-1.384
cial-1.378
reated-1.330
hemat-1.294
Token behavior
Token ID4069
Feature activation-4.75e-01

Pos logit contributions
+0.238
OPLE+0.216
 citiz+0.211
tv+0.206
 refunds+0.203
Neg logit contributions
ophen-0.308
onent-0.305
cial-0.303
reated-0.293
hemat-0.288
Token when
Token ID618
Feature activation+2.46e+00

Pos logit contributions
+0.358
OPLE+0.331
 citiz+0.325
tv+0.319
 refunds+0.315
Neg logit contributions
ophen-0.332
onent-0.326
cial-0.324
reated-0.313
hemat-0.306
Token miners
Token ID18295
Feature activation-4.30e+00

Pos logit contributions
ophen+1.888
onent+1.860
cial+1.856
reated+1.790
pite+1.757
Neg logit contributions
-1.699
OPLE-1.552
 citiz-1.515
tv-1.474
pour-1.438
Token behave
Token ID17438
Feature activation+6.32e-01

Pos logit contributions
+2.490
OPLE+2.327
tv+2.304
 refunds+2.270
 citiz+2.228
Neg logit contributions
ophen-3.565
cial-3.490
onent-3.487
reated-3.372
hemat-3.356
Token strategically
Token ID34235
Feature activation+8.18e+00

Pos logit contributions
ophen+0.477
onent+0.469
cial+0.468
reated+0.451
hemat+0.442
Neg logit contributions
-0.445
OPLE-0.409
 citiz-0.399
tv-0.391
 refunds-0.384
Token is
Token ID318
Feature activation+7.67e-01

Pos logit contributions
ophen+5.169
onent+5.112
cial+5.089
reated+4.846
pite+4.844
Neg logit contributions
-6.598
OPLE-6.085
 citiz-5.888
tv-5.703
mbuds-5.622
Token an
Token ID281
Feature activation-3.12e-01

Pos logit contributions
ophen+0.571
onent+0.563
cial+0.560
reated+0.540
hemat+0.529
Neg logit contributions
-0.546
OPLE-0.504
 citiz-0.491
tv-0.482
 refunds-0.471
Token open
Token ID1280
Feature activation-2.37e+00

Pos logit contributions
+0.208
OPLE+0.191
 citiz+0.185
tv+0.182
 refunds+0.177
Neg logit contributions
ophen-0.247
onent-0.243
cial-0.242
reated-0.234
hemat-0.230
Token question
Token ID1808
Feature activation-2.54e+00

Pos logit contributions
+1.703
OPLE+1.574
 citiz+1.532
tv+1.517
pour+1.477
Neg logit contributions
ophen-1.717
onent-1.696
cial-1.675
reated-1.628
pite-1.592
Token.
Token ID13
Feature activation-1.02e+00

Pos logit contributions
+1.985
OPLE+1.880
 citiz+1.817
tv+1.799
 refunds+1.764
Neg logit contributions
ophen-1.668
onent-1.634
cial-1.627
reated-1.574
pite-1.531
Token asks
Token ID7893
Feature activation+2.75e+00

Pos logit contributions
ophen+1.273
onent+1.246
cial+1.242
reated+1.188
hemat+1.162
Neg logit contributions
-1.578
OPLE-1.462
 citiz-1.433
tv-1.402
 refunds-1.374
Token Stephen
Token ID7970
Feature activation+9.94e-01

Pos logit contributions
ophen+1.809
cial+1.760
onent+1.760
reated+1.678
hemat+1.644
Neg logit contributions
-2.213
OPLE-2.042
 citiz-1.999
tv-1.954
 refunds-1.918
Token,
Token ID11
Feature activation+6.82e-01

Pos logit contributions
ophen+0.659
onent+0.648
cial+0.644
reated+0.618
hemat+0.605
Neg logit contributions
-0.790
OPLE-0.734
 citiz-0.718
tv-0.705
 refunds-0.691
Token examining
Token ID17247
Feature activation-1.75e+00

Pos logit contributions
ophen+0.505
onent+0.497
cial+0.495
reated+0.477
hemat+0.468
Neg logit contributions
-0.488
OPLE-0.450
 citiz-0.440
tv-0.431
 refunds-0.422
Token a
Token ID257
Feature activation+1.07e-01

Pos logit contributions
+1.290
OPLE+1.205
 citiz+1.180
tv+1.166
 refunds+1.142
Neg logit contributions
ophen-1.224
onent-1.217
cial-1.202
reated-1.166
hemat-1.138
Token peac
Token ID41961
Feature activation+8.17e+00

Pos logit contributions
ophen+0.082
onent+0.081
cial+0.080
reated+0.078
hemat+0.076
Neg logit contributions
-0.074
OPLE-0.068
 citiz-0.066
tv-0.065
 refunds-0.063
Tokenock
Token ID735
Feature activation+2.79e+00

Pos logit contributions
ophen+4.627
onent+4.541
cial+4.459
reated+4.381
ulet+4.182
Neg logit contributions
-7.378
OPLE-6.878
 citiz-6.824
 refunds-6.471
 sugg-6.448
Token's
Token ID338
Feature activation-6.42e-01

Pos logit contributions
ophen+1.960
cial+1.937
onent+1.928
reated+1.845
hemat+1.811
Neg logit contributions
-2.115
OPLE-1.950
 citiz-1.907
tv-1.834
 refunds-1.806
Token tail
Token ID7894
Feature activation-3.24e-02

Pos logit contributions
+0.438
OPLE+0.402
 citiz+0.394
tv+0.388
 refunds+0.380
Neg logit contributions
ophen-0.495
onent-0.489
cial-0.484
reated-0.467
hemat-0.459
Token feathers
Token ID27400
Feature activation-5.23e-01

Pos logit contributions
+0.026
OPLE+0.024
 citiz+0.023
tv+0.023
 refunds+0.022
Neg logit contributions
ophen-0.021
onent-0.021
cial-0.021
reated-0.020
hemat-0.020
Token in
Token ID287
Feature activation+6.75e-01

Pos logit contributions
+0.411
OPLE+0.381
 citiz+0.374
tv+0.370
 refunds+0.362
Neg logit contributions
ophen-0.349
onent-0.343
cial-0.340
reated-0.327
hemat-0.320
Token great
Token ID1049
Feature activation-8.77e-01

Pos logit contributions
+0.948
OPLE+0.878
 citiz+0.866
tv+0.851
 refunds+0.844
Neg logit contributions
ophen-0.716
onent-0.702
cial-0.687
reated-0.666
hemat-0.651
Token things
Token ID1243
Feature activation+3.48e-01

Pos logit contributions
+0.598
OPLE+0.542
 citiz+0.531
tv+0.522
 refunds+0.515
Neg logit contributions
ophen-0.678
onent-0.672
cial-0.655
reated-0.641
pite-0.630
Token:
Token ID25
Feature activation+6.55e-01

Pos logit contributions
ophen+0.213
onent+0.209
cial+0.207
reated+0.199
hemat+0.194
Neg logit contributions
-0.294
OPLE-0.275
 citiz-0.269
tv-0.265
 refunds-0.260
Token
Token ID198
Feature activation+7.72e-01

Pos logit contributions
ophen+0.471
onent+0.463
cial+0.460
reated+0.444
hemat+0.435
Neg logit contributions
-0.483
OPLE-0.446
 citiz-0.436
tv-0.427
 refunds-0.419
Token
Token ID198
Feature activation+8.44e-01

Pos logit contributions
ophen+0.586
onent+0.574
cial+0.566
reated+0.553
pite+0.542
Neg logit contributions
-0.525
OPLE-0.492
tv-0.484
pour-0.464
 refunds-0.462
TokenDecre
Token ID43198
Feature activation+8.10e+00

Pos logit contributions
ophen+0.569
onent+0.557
cial+0.552
reated+0.533
pite+0.520
Neg logit contributions
-0.660
OPLE-0.615
 citiz-0.594
tv-0.592
 refunds-0.579
Tokenases
Token ID1386
Feature activation+3.95e+00

Pos logit contributions
onent+1.042
cial+0.996
ophen+0.957
reated+0.744
hemat+0.643
Neg logit contributions
-10.842
 citiz-10.257
OPLE-10.215
tv-9.913
 sugg-9.808
Token blood
Token ID2910
Feature activation-1.08e+00

Pos logit contributions
ophen+2.755
cial+2.738
onent+2.734
reated+2.620
pite+2.557
Neg logit contributions
-2.958
OPLE-2.732
 citiz-2.657
tv-2.612
wine-2.546
Token glucose
Token ID15701
Feature activation-2.89e+00

Pos logit contributions
+0.707
OPLE+0.644
 citiz+0.630
tv+0.618
 refunds+0.603
Neg logit contributions
ophen-0.866
onent-0.853
cial-0.847
reated-0.819
hemat-0.805
Token.
Token ID13
Feature activation+3.88e-01

Pos logit contributions
+1.972
OPLE+1.828
 citiz+1.777
tv+1.767
 refunds+1.732
Neg logit contributions
ophen-2.188
onent-2.137
cial-2.133
reated-2.061
pite-2.032
Token
Token ID198
Feature activation+2.20e-01

Pos logit contributions
ophen+0.273
onent+0.269
cial+0.267
reated+0.257
hemat+0.252
Neg logit contributions
-0.292
OPLE-0.270
 citiz-0.263
tv-0.259
 refunds-0.253
Token child
Token ID1200
Feature activation+2.06e+00

Pos logit contributions
ophen+0.413
onent+0.407
cial+0.407
reated+0.389
hemat+0.382
Neg logit contributions
-0.470
OPLE-0.439
 citiz-0.428
tv-0.422
 refunds-0.414
Token is
Token ID318
Feature activation+1.31e+00

Pos logit contributions
ophen+1.448
onent+1.424
cial+1.418
reated+1.363
hemat+1.332
Neg logit contributions
-1.558
OPLE-1.439
 citiz-1.405
tv-1.371
 refunds-1.341
Token transgender
Token ID10637
Feature activation+1.13e+00

Pos logit contributions
ophen+0.969
onent+0.954
cial+0.950
reated+0.916
hemat+0.898
Neg logit contributions
-0.932
OPLE-0.860
 citiz-0.838
tv-0.822
 refunds-0.804
Token,
Token ID11
Feature activation+3.64e-01

Pos logit contributions
ophen+0.767
onent+0.753
cial+0.749
reated+0.720
hemat+0.704
Neg logit contributions
-0.886
OPLE-0.822
 citiz-0.803
tv-0.788
 refunds-0.772
Token society
Token ID3592
Feature activation-1.77e+00

Pos logit contributions
ophen+0.264
onent+0.259
cial+0.258
reated+0.249
hemat+0.243
Neg logit contributions
-0.267
OPLE-0.247
 citiz-0.241
tv-0.237
 refunds-0.232
Token inst
Token ID916
Feature activation+8.09e+00

Pos logit contributions
+1.250
OPLE+1.162
 citiz+1.141
tv+1.132
 refunds+1.124
Neg logit contributions
ophen-1.294
onent-1.272
cial-1.263
reated-1.219
hemat-1.199
Tokenills
Token ID2171
Feature activation+5.53e+00

Pos logit contributions
onent+4.482
ophen+4.470
reated+4.317
cial+4.306
hemat+4.200
Neg logit contributions
-7.310
 citiz-6.845
OPLE-6.739
PDATE-6.333
tv-6.209
Token in
Token ID287
Feature activation+7.54e-01

Pos logit contributions
ophen+3.897
cial+3.857
onent+3.854
reated+3.757
pite+3.603
Neg logit contributions
-4.125
OPLE-3.754
 citiz-3.650
tv-3.542
pour-3.527
Token them
Token ID606
Feature activation+1.52e+00

Pos logit contributions
ophen+0.516
onent+0.507
cial+0.505
reated+0.485
hemat+0.475
Neg logit contributions
-0.583
OPLE-0.540
 citiz-0.528
tv-0.518
 refunds-0.507
Token rules
Token ID3173
Feature activation+1.32e+00

Pos logit contributions
ophen+1.108
onent+1.090
cial+1.090
reated+1.049
hemat+1.023
Neg logit contributions
-1.116
OPLE-1.027
 citiz-1.000
tv-0.978
pour-0.953
Token about
Token ID546
Feature activation+1.49e-01

Pos logit contributions
ophen+0.926
onent+0.910
cial+0.906
reated+0.872
hemat+0.853
Neg logit contributions
-0.997
OPLE-0.922
 citiz-0.901
tv-0.882
 refunds-0.863
Token Has
Token ID7875
Feature activation+2.81e+00

Pos logit contributions
+0.012
OPLE+0.011
 citiz+0.011
tv+0.011
 refunds+0.011
Neg logit contributions
ophen-0.009
onent-0.009
cial-0.009
reated-0.009
hemat-0.009
Tokenbro
Token ID7957
Feature activation+3.97e+00

Pos logit contributions
ophen+1.405
cial+1.371
onent+1.370
reated+1.283
hemat+1.251
Neg logit contributions
-2.700
OPLE-2.540
 citiz-2.495
tv-2.441
 refunds-2.412
Token,
Token ID11
Feature activation-3.26e-01

Pos logit contributions
ophen+2.482
cial+2.449
onent+2.421
reated+2.294
hemat+2.257
Neg logit contributions
-3.303
OPLE-3.055
 citiz-3.014
tv-2.904
 refunds-2.881
Token Barbie
Token ID46829
Feature activation+4.03e+00

Pos logit contributions
+0.266
OPLE+0.248
 citiz+0.242
tv+0.239
 refunds+0.234
Neg logit contributions
ophen-0.209
onent-0.205
cial-0.204
reated-0.196
hemat-0.191
Token,
Token ID11
Feature activation+1.94e-01

Pos logit contributions
ophen+2.481
cial+2.446
onent+2.429
reated+2.279
hemat+2.258
Neg logit contributions
-3.382
OPLE-3.105
 citiz-3.094
tv-2.953
 refunds-2.949
Token Lego
Token ID33198
Feature activation+8.05e+00

Pos logit contributions
ophen+0.130
onent+0.128
cial+0.127
reated+0.122
hemat+0.119
Neg logit contributions
-0.153
OPLE-0.142
 citiz-0.139
tv-0.136
 refunds-0.133
Token,
Token ID11
Feature activation+2.39e-01

Pos logit contributions
ophen+4.570
onent+4.472
cial+4.438
reated+4.184
hemat+4.132
Neg logit contributions
-6.926
 citiz-6.283
OPLE-6.272
 refunds-6.082
pour-6.069
Token Toys
Token ID35397
Feature activation+5.53e+00

Pos logit contributions
ophen+0.160
onent+0.157
cial+0.157
reated+0.150
hemat+0.147
Neg logit contributions
-0.188
OPLE-0.175
 citiz-0.171
tv-0.168
 refunds-0.164
Token R
Token ID371
Feature activation-8.54e-01

Pos logit contributions
ophen+3.733
cial+3.712
onent+3.663
reated+3.545
pite+3.424
Neg logit contributions
-4.351
OPLE-3.911
 citiz-3.870
 refunds-3.729
 sugg-3.702
Token Us
Token ID4021
Feature activation+2.68e+00

Pos logit contributions
+0.638
OPLE+0.596
 citiz+0.578
tv+0.573
 refunds+0.559
Neg logit contributions
ophen-0.604
onent-0.588
cial-0.585
reated-0.562
pite-0.554
Token,
Token ID11
Feature activation+4.22e-01

Pos logit contributions
ophen+1.687
cial+1.652
onent+1.650
reated+1.570
hemat+1.539
Neg logit contributions
-2.216
OPLE-2.058
 citiz-2.023
tv-1.971
 refunds-1.943
Token Virginia
Token ID6025
Feature activation-3.16e-01

Pos logit contributions
ophen+0.112
onent+0.110
cial+0.109
reated+0.104
pite+0.102
Neg logit contributions
-0.169
OPLE-0.156
 citiz-0.154
tv-0.150
 refunds-0.147
Token.
Token ID13
Feature activation-6.62e-02

Pos logit contributions
+0.241
OPLE+0.225
 citiz+0.221
tv+0.217
 refunds+0.213
Neg logit contributions
ophen-0.217
onent-0.214
cial-0.212
reated-0.204
hemat-0.201
Token
Token ID198
Feature activation+1.03e-01

Pos logit contributions
+0.048
OPLE+0.045
 citiz+0.043
tv+0.043
 refunds+0.041
Neg logit contributions
ophen-0.048
onent-0.048
cial-0.047
reated-0.046
hemat-0.045
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
ophen+0.078
onent+0.077
cial+0.076
reated+0.074
pite+0.073
Neg logit contributions
-0.069
OPLE-0.065
tv-0.064
pour-0.062
 refunds-0.061
TokenDi
Token ID18683
Feature activation+2.57e-01

Pos logit contributions
ophen+0.149
onent+0.147
cial+0.146
reated+0.141
hemat+0.138
Neg logit contributions
-0.136
OPLE-0.125
 citiz-0.120
tv-0.120
 refunds-0.116
Tokenplom
Token ID7302
Feature activation+7.99e+00

Pos logit contributions
ophen+0.105
onent+0.102
cial+0.101
reated+0.094
hemat+0.091
Neg logit contributions
-0.269
OPLE-0.255
 citiz-0.251
tv-0.247
 refunds-0.243
Tokenatic
Token ID1512
Feature activation+5.25e+00

Pos logit contributions
ophen+3.947
cial+3.917
onent+3.851
reated+3.603
hemat+3.559
Neg logit contributions
-7.754
OPLE-7.176
 citiz-7.063
tv-6.816
 refunds-6.800
Token cables
Token ID18018
Feature activation+1.15e+00

Pos logit contributions
ophen+4.296
cial+4.251
onent+4.229
reated+4.119
pite+4.002
Neg logit contributions
-3.391
OPLE-3.031
 citiz-2.910
tv-2.829
pour-2.723
Token are
Token ID389
Feature activation+6.46e-01

Pos logit contributions
ophen+0.786
onent+0.772
cial+0.770
reated+0.739
hemat+0.724
Neg logit contributions
-0.892
OPLE-0.826
 citiz-0.807
tv-0.791
 refunds-0.774
Token internal
Token ID5387
Feature activation+1.02e-01

Pos logit contributions
ophen+0.480
onent+0.473
cial+0.469
reated+0.454
hemat+0.445
Neg logit contributions
-0.459
OPLE-0.424
 citiz-0.414
tv-0.406
 refunds-0.398
Token documents
Token ID4963
Feature activation-9.62e-01

Pos logit contributions
ophen+0.081
onent+0.080
cial+0.079
reated+0.077
hemat+0.075
Neg logit contributions
-0.068
OPLE-0.063
 citiz-0.061
tv-0.060
 refunds-0.058
Token,
Token ID11
Feature activation+4.98e-01

Pos logit contributions
ophen+3.028
onent+2.974
cial+2.973
reated+2.827
pite+2.767
Neg logit contributions
-3.889
OPLE-3.672
 citiz-3.534
tv-3.461
 refunds-3.386
Token investors
Token ID7713
Feature activation-7.60e-02

Pos logit contributions
ophen+0.375
onent+0.369
cial+0.367
reated+0.355
hemat+0.348
Neg logit contributions
-0.350
OPLE-0.322
 citiz-0.315
tv-0.308
 refunds-0.302
Token risk
Token ID2526
Feature activation-3.10e-01

Pos logit contributions
+0.055
OPLE+0.051
 citiz+0.050
tv+0.049
 refunds+0.048
Neg logit contributions
ophen-0.055
onent-0.054
cial-0.054
reated-0.052
hemat-0.051
Token paying
Token ID5989
Feature activation-1.64e+00

Pos logit contributions
+0.225
OPLE+0.207
 citiz+0.203
tv+0.200
 refunds+0.198
Neg logit contributions
ophen-0.226
onent-0.222
cial-0.220
reated-0.213
hemat-0.209
Token an
Token ID281
Feature activation+4.20e-01

Pos logit contributions
+1.155
OPLE+1.057
 citiz+1.046
tv+1.029
 refunds+1.019
Neg logit contributions
ophen-1.228
onent-1.204
cial-1.192
reated-1.158
hemat-1.135
Token artificially
Token ID32455
Feature activation+7.90e+00

Pos logit contributions
ophen+0.316
onent+0.311
cial+0.309
reated+0.299
hemat+0.293
Neg logit contributions
-0.298
OPLE-0.273
 citiz-0.267
tv-0.261
 refunds-0.255
Token high
Token ID1029
Feature activation-1.64e+00

Pos logit contributions
ophen+6.030
cial+6.009
onent+5.960
reated+5.838
pite+5.710
Neg logit contributions
-5.269
OPLE-4.935
 citiz-4.654
tv-4.500
pour-4.436
Token price
Token ID2756
Feature activation+3.86e-01

Pos logit contributions
+1.184
OPLE+1.089
 citiz+1.073
tv+1.065
 refunds+1.047
Neg logit contributions
ophen-1.192
onent-1.167
cial-1.161
reated-1.115
hemat-1.102
Token.
Token ID13
Feature activation+2.58e-01

Pos logit contributions
ophen+0.259
onent+0.255
cial+0.254
reated+0.243
hemat+0.238
Neg logit contributions
-0.304
OPLE-0.282
 citiz-0.276
tv-0.271
 refunds-0.265
Token
Token ID198
Feature activation+1.15e-01

Pos logit contributions
ophen+0.197
onent+0.194
cial+0.192
reated+0.187
pite+0.182
Neg logit contributions
-0.177
OPLE-0.164
tv-0.159
 citiz-0.158
 refunds-0.153
Token
Token ID198
Feature activation+2.26e-01

Pos logit contributions
ophen+0.088
onent+0.086
cial+0.085
reated+0.083
pite+0.081
Neg logit contributions
-0.078
OPLE-0.073
tv-0.072
pour-0.069
 refunds-0.069
Token it
Token ID340
Feature activation+2.95e-01

Pos logit contributions
ophen+0.977
onent+0.959
cial+0.957
reated+0.921
hemat+0.902
Neg logit contributions
-0.945
OPLE-0.869
 citiz-0.849
tv-0.830
 refunds-0.810
Token is
Token ID318
Feature activation+1.28e+00

Pos logit contributions
ophen+0.185
onent+0.182
cial+0.181
reated+0.174
hemat+0.170
Neg logit contributions
-0.243
OPLE-0.227
 citiz-0.222
tv-0.219
 refunds-0.215
Token possible
Token ID1744
Feature activation+1.48e+00

Pos logit contributions
ophen+0.977
onent+0.962
cial+0.958
reated+0.924
hemat+0.906
Neg logit contributions
-0.892
OPLE-0.820
 citiz-0.799
tv-0.782
 refunds-0.764
Token to
Token ID284
Feature activation+3.11e-01

Pos logit contributions
ophen+0.862
onent+0.844
cial+0.840
reated+0.797
pite+0.775
Neg logit contributions
-1.301
OPLE-1.214
 citiz-1.189
tv-1.168
 refunds-1.148
Token perform
Token ID1620
Feature activation-5.98e-01

Pos logit contributions
ophen+0.246
onent+0.243
cial+0.242
reated+0.234
hemat+0.230
Neg logit contributions
-0.206
OPLE-0.189
 citiz-0.183
tv-0.180
 refunds-0.176
Token pharmac
Token ID25702
Feature activation+7.90e+00

Pos logit contributions
+0.432
OPLE+0.399
 citiz+0.388
tv+0.381
 refunds+0.377
Neg logit contributions
ophen-0.435
onent-0.431
cial-0.426
reated-0.413
hemat-0.406
Tokenological
Token ID2770
Feature activation+3.90e+00

Pos logit contributions
ophen+3.777
cial+3.544
onent+3.487
reated+3.299
hemat+3.209
Neg logit contributions
-7.831
OPLE-7.401
 citiz-7.351
tv-7.065
 refunds-7.012
Token and
Token ID290
Feature activation+6.43e-01

Pos logit contributions
ophen+3.074
cial+3.012
onent+2.995
reated+2.899
pite+2.831
Neg logit contributions
-2.629
OPLE-2.399
 citiz-2.341
tv-2.265
pour-2.195
Token elect
Token ID1742
Feature activation+1.49e+00

Pos logit contributions
ophen+0.425
onent+0.418
cial+0.416
reated+0.400
hemat+0.390
Neg logit contributions
-0.511
OPLE-0.475
 citiz-0.465
tv-0.457
 refunds-0.448
Tokenroph
Token ID10051
Feature activation+4.00e+00

Pos logit contributions
ophen+0.620
onent+0.606
cial+0.599
reated+0.565
pite+0.544
Neg logit contributions
-1.548
OPLE-1.463
 citiz-1.438
tv-1.428
 refunds-1.398
Tokenys
Token ID893
Feature activation+2.16e+00

Pos logit contributions
ophen+1.771
cial+1.736
onent+1.733
reated+1.620
hemat+1.572
Neg logit contributions
-4.047
OPLE-3.830
 citiz-3.786
tv-3.695
 refunds-3.659
Token difference
Token ID3580
Feature activation+3.10e+00

Pos logit contributions
+0.165
OPLE+0.152
 citiz+0.148
tv+0.146
 refunds+0.142
Neg logit contributions
ophen-0.165
onent-0.162
cial-0.161
reated-0.155
hemat-0.152
Token is
Token ID318
Feature activation+1.09e+00

Pos logit contributions
ophen+1.848
cial+1.810
onent+1.803
reated+1.706
hemat+1.668
Neg logit contributions
-2.673
OPLE-2.489
 citiz-2.444
tv-2.384
 refunds-2.334
Token that
Token ID326
Feature activation+1.48e-01

Pos logit contributions
ophen+0.835
cial+0.821
onent+0.820
reated+0.788
hemat+0.774
Neg logit contributions
-0.763
OPLE-0.698
 citiz-0.681
tv-0.663
 refunds-0.647
Token Bac
Token ID35583
Feature activation-1.93e+00

Pos logit contributions
ophen+0.107
onent+0.106
cial+0.105
reated+0.101
hemat+0.099
Neg logit contributions
-0.109
OPLE-0.100
 citiz-0.098
tv-0.096
 refunds-0.094
Tokenile
Token ID576
Feature activation-1.92e-01

Pos logit contributions
+2.241
OPLE+2.147
tv+2.143
pour+2.122
wine+2.105
Neg logit contributions
ophen-0.483
onent-0.467
cial-0.449
reated-0.397
hemat-0.392
Token indirectly
Token ID20762
Feature activation+7.86e+00

Pos logit contributions
+0.143
OPLE+0.133
 citiz+0.130
tv+0.128
 refunds+0.126
Neg logit contributions
ophen-0.136
onent-0.134
cial-0.133
reated-0.128
hemat-0.126
Token and
Token ID290
Feature activation+8.89e-01

Pos logit contributions
ophen+5.270
onent+5.194
cial+5.168
reated+5.075
pport+4.855
Neg logit contributions
-6.134
OPLE-5.647
 citiz-5.370
tv-5.177
mbuds-5.144
Token inadvertently
Token ID29243
Feature activation+4.89e+00

Pos logit contributions
ophen+0.673
onent+0.662
cial+0.660
reated+0.637
hemat+0.624
Neg logit contributions
-0.620
OPLE-0.572
 citiz-0.557
tv-0.547
 refunds-0.535
Token infl
Token ID28472
Feature activation+4.97e+00

Pos logit contributions
ophen+3.365
onent+3.293
cial+3.292
reated+3.196
hemat+3.068
Neg logit contributions
-3.785
OPLE-3.494
 citiz-3.378
tv-3.284
mbuds-3.193
Tokenamed
Token ID2434
Feature activation-6.58e-01

Pos logit contributions
ophen+2.689
onent+2.631
cial+2.587
reated+2.483
hemat+2.415
Neg logit contributions
-4.623
OPLE-4.305
 citiz-4.267
tv-4.140
 refunds-4.086
Token people
Token ID661
Feature activation-2.16e+00

Pos logit contributions
+0.450
OPLE+0.414
 citiz+0.404
tv+0.397
 refunds+0.388
Neg logit contributions
ophen-0.508
onent-0.500
cial-0.498
reated-0.481
hemat-0.472
Token the
Token ID262
Feature activation+5.16e-01

Pos logit contributions
onent+1.985
ophen+1.980
cial+1.978
reated+1.891
pite+1.844
Neg logit contributions
-1.611
OPLE-1.408
 citiz-1.403
tv-1.321
 refunds-1.312
Token Today
Token ID6288
Feature activation-1.16e-01

Pos logit contributions
ophen+0.350
onent+0.347
cial+0.344
reated+0.330
pite+0.322
Neg logit contributions
-0.402
OPLE-0.370
 citiz-0.363
tv-0.354
 refunds-0.347
Token show
Token ID905
Feature activation-2.12e+00

Pos logit contributions
+0.074
OPLE+0.068
 citiz+0.066
tv+0.064
 refunds+0.062
Neg logit contributions
ophen-0.095
onent-0.093
cial-0.093
reated-0.090
hemat-0.088
Token's
Token ID338
Feature activation-1.46e+00

Pos logit contributions
+1.654
OPLE+1.552
 citiz+1.507
tv+1.500
pour+1.462
Neg logit contributions
ophen-1.420
onent-1.388
cial-1.373
reated-1.332
pite-1.301
Token Savannah
Token ID40695
Feature activation-3.83e+00

Pos logit contributions
+1.080
OPLE+0.993
 citiz+0.970
tv+0.949
 refunds+0.929
Neg logit contributions
ophen-1.046
onent-1.030
cial-1.026
reated-0.986
pite-0.964
Token Guth
Token ID44003
Feature activation-1.07e+01

Pos logit contributions
+1.592
OPLE+1.396
 citiz+1.354
tv+1.299
pour+1.271
Neg logit contributions
ophen-3.959
onent-3.899
cial-3.868
reated-3.808
pite-3.720
Tokenrie
Token ID5034
Feature activation-1.59e+00

Pos logit contributions
+3.951
pour+3.891
tv+3.747
wine+3.415
OPLE+3.348
Neg logit contributions
ophen-10.763
onent-10.540
cial-10.414
reated-10.201
pite-10.186
Token who
Token ID508
Feature activation-2.71e-02

Pos logit contributions
+1.269
OPLE+1.187
 citiz+1.155
tv+1.144
pour+1.118
Neg logit contributions
ophen-1.040
onent-1.023
cial-1.014
reated-0.978
pite-0.953
Token brought
Token ID3181
Feature activation-1.36e+00

Pos logit contributions
+0.022
OPLE+0.021
 citiz+0.020
tv+0.020
 refunds+0.020
Neg logit contributions
ophen-0.017
onent-0.017
cial-0.017
reated-0.016
hemat-0.016
Token up
Token ID510
Feature activation-8.16e-01

Pos logit contributions
+0.904
OPLE+0.824
 citiz+0.805
tv+0.782
 refunds+0.762
Neg logit contributions
ophen-1.086
onent-1.070
cial-1.067
reated-1.027
hemat-1.009
Token that
Token ID326
Feature activation-6.52e-01

Pos logit contributions
+0.595
OPLE+0.548
 citiz+0.535
tv+0.523
 refunds+0.511
Neg logit contributions
ophen-0.597
onent-0.588
cial-0.585
reated-0.563
hemat-0.552
Tokenross
Token ID1214
Feature activation-1.46e+00

Pos logit contributions
+1.739
OPLE+1.635
tv+1.601
 citiz+1.582
pour+1.551
Neg logit contributions
ophen-1.283
cial-1.252
onent-1.250
reated-1.195
hemat-1.161
Token.
Token ID13
Feature activation-5.88e-01

Pos logit contributions
+1.223
OPLE+1.144
 citiz+1.117
tv+1.107
 refunds+1.083
Neg logit contributions
ophen-0.891
onent-0.873
cial-0.866
reated-0.831
hemat-0.811
Token Carlton
Token ID44348
Feature activation-1.89e+00

Pos logit contributions
+0.448
OPLE+0.418
 citiz+0.404
tv+0.404
 refunds+0.392
Neg logit contributions
ophen-0.406
onent-0.399
cial-0.395
reated-0.383
hemat-0.374
Token L
Token ID406
Feature activation+9.59e-02

Pos logit contributions
+1.254
OPLE+1.163
tv+1.132
 citiz+1.118
pour+1.092
Neg logit contributions
ophen-1.472
onent-1.455
cial-1.442
reated-1.393
hemat-1.366
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
ophen+0.056
onent+0.055
cial+0.054
reated+0.052
hemat+0.050
Neg logit contributions
-0.085
OPLE-0.079
 citiz-0.078
tv-0.076
 refunds-0.075
Token Guth
Token ID44003
Feature activation-1.03e+01

Pos logit contributions
+0.084
OPLE+0.077
 citiz+0.075
tv+0.074
 refunds+0.072
Neg logit contributions
ophen-0.089
onent-0.087
cial-0.087
reated-0.084
hemat-0.082
Tokenrie
Token ID5034
Feature activation-1.16e+00

Pos logit contributions
+5.275
tv+5.246
wine+5.126
pour+4.877
OPLE+4.773
Neg logit contributions
ophen-8.382
onent-8.277
cial-8.133
reated-7.885
pite-7.882
Token.
Token ID13
Feature activation-8.32e-01

Pos logit contributions
+0.947
OPLE+0.885
 citiz+0.864
tv+0.857
 refunds+0.836
Neg logit contributions
ophen-0.732
onent-0.718
cial-0.710
reated-0.684
hemat-0.667
Token Marion
Token ID32473
Feature activation-1.42e+00

Pos logit contributions
+0.631
OPLE+0.590
tv+0.571
 citiz+0.569
 refunds+0.553
Neg logit contributions
ophen-0.575
onent-0.565
cial-0.559
reated-0.542
hemat-0.529
Token T
Token ID309
Feature activation-4.93e-01

Pos logit contributions
+0.958
OPLE+0.887
tv+0.863
 citiz+0.858
pour+0.833
Neg logit contributions
ophen-1.084
onent-1.071
cial-1.062
reated-1.027
hemat-1.006
Token.
Token ID13
Feature activation-3.68e-01

Pos logit contributions
+0.432
OPLE+0.404
 citiz+0.396
tv+0.389
 refunds+0.382
Neg logit contributions
ophen-0.288
onent-0.282
cial-0.280
reated-0.267
hemat-0.260
Token by
Token ID416
Feature activation+8.81e-01

Pos logit contributions
+0.244
OPLE+0.227
 citiz+0.221
tv+0.218
 refunds+0.213
Neg logit contributions
ophen-0.230
onent-0.226
cial-0.225
reated-0.217
hemat-0.212
Token NBC
Token ID9716
Feature activation-3.94e-02

Pos logit contributions
ophen+0.714
onent+0.711
cial+0.705
reated+0.681
hemat+0.667
Neg logit contributions
-0.570
OPLE-0.509
 citiz-0.501
tv-0.479
 refunds-0.474
Token News
Token ID3000
Feature activation+1.06e+00

Pos logit contributions
+0.028
OPLE+0.026
 citiz+0.026
tv+0.025
 refunds+0.024
Neg logit contributions
ophen-0.029
onent-0.029
cial-0.028
reated-0.027
hemat-0.027
Token'
Token ID6
Feature activation-1.42e+00

Pos logit contributions
ophen+0.797
onent+0.787
cial+0.782
reated+0.753
hemat+0.740
Neg logit contributions
-0.748
OPLE-0.685
 citiz-0.668
tv-0.651
 refunds-0.638
Token Savannah
Token ID40695
Feature activation-4.89e+00

Pos logit contributions
+1.092
OPLE+1.001
 citiz+0.979
tv+0.953
 refunds+0.936
Neg logit contributions
ophen-0.986
onent-0.975
cial-0.968
reated-0.929
pite-0.908
Token Guth
Token ID44003
Feature activation-1.02e+01

Pos logit contributions
+1.875
OPLE+1.636
tv+1.595
 citiz+1.558
pour+1.526
Neg logit contributions
ophen-5.156
onent-5.057
cial-5.023
reated-4.917
pite-4.836
Tokenrie
Token ID5034
Feature activation-1.62e+00

Pos logit contributions
tv+3.823
pour+3.724
+3.430
OPLE+3.212
wine+3.156
Neg logit contributions
ophen-9.743
onent-9.671
cial-9.502
hemat-9.372
accompan-9.364
Token (
Token ID357
Feature activation-5.56e-01

Pos logit contributions
+1.166
OPLE+1.091
 citiz+1.062
tv+1.053
 refunds+1.029
Neg logit contributions
ophen-1.177
onent-1.154
cial-1.148
reated-1.116
pite-1.089
Tokenright
Token ID3506
Feature activation-2.02e+00

Pos logit contributions
+0.373
OPLE+0.343
 citiz+0.332
tv+0.329
 refunds+0.318
Neg logit contributions
ophen-0.437
onent-0.430
cial-0.427
reated-0.413
hemat-0.405
Token)
Token ID8
Feature activation+2.34e+00

Pos logit contributions
+1.696
OPLE+1.582
 citiz+1.547
tv+1.529
 refunds+1.497
Neg logit contributions
ophen-1.243
onent-1.220
cial-1.211
reated-1.156
hemat-1.131
Token to
Token ID284
Feature activation+3.67e-01

Pos logit contributions
ophen+1.615
onent+1.593
cial+1.587
reated+1.514
hemat+1.488
Neg logit contributions
-1.808
OPLE-1.654
 citiz-1.619
tv-1.572
 refunds-1.540
Tokenist
Token ID396
Feature activation+6.89e-01

Pos logit contributions
+2.014
tv+1.984
OPLE+1.934
pour+1.933
wine+1.885
Neg logit contributions
ophen-1.279
onent-1.241
cial-1.227
reated-1.215
hemat-1.176
Token has
Token ID468
Feature activation+1.50e+00

Pos logit contributions
ophen+0.467
onent+0.458
cial+0.456
reated+0.439
hemat+0.428
Neg logit contributions
-0.539
OPLE-0.499
 citiz-0.488
tv-0.480
 refunds-0.471
Token cut
Token ID2005
Feature activation+2.15e+00

Pos logit contributions
ophen+1.026
onent+1.009
cial+1.004
reated+0.964
hemat+0.943
Neg logit contributions
-1.165
OPLE-1.081
 citiz-1.056
tv-1.036
 refunds-1.014
Token Mr
Token ID1770
Feature activation+3.22e+00

Pos logit contributions
ophen+1.540
onent+1.521
cial+1.518
reated+1.458
pite+1.423
Neg logit contributions
-1.592
OPLE-1.466
 citiz-1.432
tv-1.397
pour-1.360
Token.
Token ID13
Feature activation+1.58e-01

Pos logit contributions
ophen+1.823
onent+1.788
cial+1.781
reated+1.688
hemat+1.644
Neg logit contributions
-2.894
OPLE-2.701
 citiz-2.679
tv-2.578
 refunds-2.552
Token Gian
Token ID30851
Feature activation-9.12e+00

Pos logit contributions
ophen+0.114
onent+0.112
cial+0.111
reated+0.107
hemat+0.105
Neg logit contributions
-0.118
OPLE-0.109
 citiz-0.106
tv-0.104
 refunds-0.101
Tokenfort
Token ID3319
Feature activation+2.59e+00

Pos logit contributions
+7.361
tv+7.065
OPLE+6.994
pour+6.900
 citiz+6.802
Neg logit contributions
onent-5.245
ophen-5.210
cial-5.054
reated-4.922
hemat-4.808
Tokene
Token ID68
Feature activation-7.97e-01

Pos logit contributions
ophen+1.541
onent+1.510
cial+1.502
reated+1.434
hemat+1.398
Neg logit contributions
-2.234
OPLE-2.088
 citiz-2.046
tv-2.013
 refunds-1.976
Token
Token ID447
Feature activation-3.18e-01

Pos logit contributions
+0.573
OPLE+0.529
 citiz+0.516
tv+0.509
 refunds+0.503
Neg logit contributions
ophen-0.586
onent-0.574
cial-0.571
reated-0.552
hemat-0.541
Token
Token ID247
Feature activation-8.34e-01

Pos logit contributions
+0.189
OPLE+0.171
 citiz+0.166
tv+0.162
 refunds+0.157
Neg logit contributions
ophen-0.274
onent-0.270
cial-0.269
reated-0.261
hemat-0.257
Tokens
Token ID82
Feature activation-1.40e-01

Pos logit contributions
+0.495
OPLE+0.448
 citiz+0.434
tv+0.433
 refunds+0.419
Neg logit contributions
ophen-0.716
onent-0.705
cial-0.698
reated-0.678
hemat-0.670
Token while
Token ID981
Feature activation-7.56e-01

Pos logit contributions
ophen+0.056
onent+0.055
cial+0.055
reated+0.053
hemat+0.052
Neg logit contributions
-0.054
OPLE-0.050
 citiz-0.049
tv-0.048
 refunds-0.047
Token.
Token ID13
Feature activation-5.40e-01

Pos logit contributions
+0.615
OPLE+0.575
 citiz+0.563
tv+0.558
 refunds+0.547
Neg logit contributions
ophen-0.481
onent-0.471
cial-0.467
reated-0.452
hemat-0.440
Token So
Token ID1406
Feature activation-5.98e+00

Pos logit contributions
+0.352
OPLE+0.325
tv+0.312
 citiz+0.312
 refunds+0.302
Neg logit contributions
ophen-0.433
onent-0.426
cial-0.423
reated-0.411
pite-0.403
Token I
Token ID314
Feature activation-1.15e+00

Pos logit contributions
+3.629
OPLE+3.480
 refunds+3.410
tv+3.354
 citiz+3.316
Neg logit contributions
ophen-4.699
onent-4.662
cial-4.525
reated-4.510
pite-4.404
Token'm
Token ID1101
Feature activation-2.22e+00

Pos logit contributions
+0.803
OPLE+0.737
 citiz+0.715
tv+0.711
 refunds+0.696
Neg logit contributions
ophen-0.864
onent-0.854
cial-0.848
reated-0.817
pite-0.806
Token sitting
Token ID5586
Feature activation-8.93e+00

Pos logit contributions
+1.479
OPLE+1.358
 citiz+1.315
tv+1.314
 refunds+1.293
Neg logit contributions
ophen-1.728
onent-1.713
cial-1.697
reated-1.647
pite-1.616
Token there
Token ID612
Feature activation+3.12e-01

Pos logit contributions
+5.097
OPLE+4.732
 refunds+4.731
tv+4.675
 citiz+4.574
Neg logit contributions
ophen-7.206
cial-7.125
onent-7.078
reated-6.878
hemat-6.739
Token and
Token ID290
Feature activation-4.40e-01

Pos logit contributions
ophen+0.226
onent+0.223
cial+0.222
reated+0.214
hemat+0.209
Neg logit contributions
-0.230
OPLE-0.212
 citiz-0.207
tv-0.202
 refunds-0.197
Token thinking
Token ID3612
Feature activation-4.60e-01

Pos logit contributions
+0.299
OPLE+0.275
 citiz+0.267
tv+0.263
 refunds+0.257
Neg logit contributions
ophen-0.342
onent-0.336
cial-0.335
reated-0.324
hemat-0.317
Token like
Token ID588
Feature activation-2.53e+00

Pos logit contributions
+0.364
OPLE+0.338
 citiz+0.330
tv+0.325
 refunds+0.319
Neg logit contributions
ophen-0.307
onent-0.302
cial-0.301
reated-0.289
hemat-0.282
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
+1.884
OPLE+1.751
 citiz+1.706
tv+1.684
 refunds+1.658
Neg logit contributions
ophen-1.786
onent-1.758
cial-1.747
reated-1.691
pite-1.648
Token colleague
Token ID16008
Feature activation-2.55e+00

Pos logit contributions
+2.058
OPLE+1.907
 citiz+1.878
tv+1.839
 refunds+1.783
Neg logit contributions
ophen-2.512
onent-2.456
cial-2.440
reated-2.384
hemat-2.331
Token,
Token ID11
Feature activation-2.31e-01

Pos logit contributions
+1.946
OPLE+1.819
 citiz+1.792
tv+1.770
pour+1.730
Neg logit contributions
ophen-1.723
onent-1.712
cial-1.676
reated-1.633
pite-1.592
Token Bed
Token ID15585
Feature activation-1.70e+00

Pos logit contributions
+0.164
OPLE+0.151
 citiz+0.147
tv+0.143
 refunds+0.140
Neg logit contributions
ophen-0.174
onent-0.171
cial-0.171
reated-0.164
hemat-0.161
Tokenrett
Token ID11489
Feature activation-8.80e-01

Pos logit contributions
+1.312
OPLE+1.222
tv+1.188
 citiz+1.188
pour+1.157
Neg logit contributions
ophen-1.148
onent-1.129
cial-1.119
reated-1.074
hemat-1.059
Tokenin
Token ID259
Feature activation-2.21e-01

Pos logit contributions
+0.640
OPLE+0.591
 citiz+0.579
tv+0.569
pour+0.554
Neg logit contributions
ophen-0.638
onent-0.630
cial-0.625
reated-0.603
hemat-0.590
Token Gund
Token ID23323
Feature activation-8.93e+00

Pos logit contributions
+0.152
OPLE+0.140
 citiz+0.136
tv+0.133
 refunds+0.130
Neg logit contributions
ophen-0.170
onent-0.167
cial-0.167
reated-0.161
hemat-0.158
Tokenuz
Token ID10277
Feature activation-3.34e+00

Pos logit contributions
+6.555
tv+6.388
pour+6.208
wine+6.048
OPLE+6.045
Neg logit contributions
ophen-5.522
onent-5.499
cial-5.421
reated-5.244
hemat-5.228
Token,
Token ID11
Feature activation-6.30e-01

Pos logit contributions
+2.502
OPLE+2.331
tv+2.301
 citiz+2.297
pour+2.260
Neg logit contributions
ophen-2.256
onent-2.250
cial-2.201
reated-2.147
hemat-2.084
Token who
Token ID508
Feature activation-7.25e-02

Pos logit contributions
+0.455
OPLE+0.420
 citiz+0.412
tv+0.405
 refunds+0.395
Neg logit contributions
ophen-0.460
onent-0.454
cial-0.449
reated-0.436
hemat-0.426
Token had
Token ID550
Feature activation-1.73e-01

Pos logit contributions
+0.058
OPLE+0.054
 citiz+0.053
tv+0.052
 refunds+0.051
Neg logit contributions
ophen-0.047
onent-0.046
cial-0.046
reated-0.044
hemat-0.043
Token helped
Token ID4193
Feature activation+1.85e+00

Pos logit contributions
+0.132
OPLE+0.123
 citiz+0.120
tv+0.118
 refunds+0.116
Neg logit contributions
ophen-0.119
onent-0.118
cial-0.117
reated-0.112
hemat-0.110
Token shooting
Token ID4395
Feature activation-3.80e+00

Pos logit contributions
ophen+1.585
onent+1.562
cial+1.555
reated+1.504
hemat+1.477
Neg logit contributions
-1.264
OPLE-1.153
 citiz-1.122
tv-1.097
 refunds-1.070
Token last
Token ID938
Feature activation+2.37e+00

Pos logit contributions
+2.507
OPLE+2.305
 citiz+2.248
tv+2.235
 refunds+2.201
Neg logit contributions
ophen-2.979
onent-2.922
cial-2.883
reated-2.796
hemat-2.774
Token December
Token ID3426
Feature activation+5.92e-01

Pos logit contributions
ophen+1.597
onent+1.569
cial+1.562
reated+1.499
hemat+1.467
Neg logit contributions
-1.863
OPLE-1.726
 citiz-1.679
tv-1.642
 refunds-1.608
Token --
Token ID1377
Feature activation+1.09e-01

Pos logit contributions
ophen+0.481
onent+0.474
cial+0.473
reated+0.457
hemat+0.448
Neg logit contributions
-0.383
OPLE-0.350
 citiz-0.340
tv-0.331
 refunds-0.323
Token were
Token ID547
Feature activation-3.12e-01

Pos logit contributions
ophen+0.078
onent+0.077
cial+0.076
reated+0.073
hemat+0.072
Neg logit contributions
-0.081
OPLE-0.075
 citiz-0.073
tv-0.072
 refunds-0.071
Token seated
Token ID21639
Feature activation-8.83e+00

Pos logit contributions
+0.233
OPLE+0.217
 citiz+0.213
tv+0.211
 refunds+0.209
Neg logit contributions
ophen-0.218
onent-0.215
cial-0.212
reated-0.205
hemat-0.202
Token in
Token ID287
Feature activation-5.33e-01

Pos logit contributions
+5.010
 refunds+4.916
 reservations+4.695
OPLE+4.667
tv+4.661
Neg logit contributions
cial-7.032
ophen-7.018
onent-6.978
hemat-6.686
reated-6.639
Token the
Token ID262
Feature activation-6.08e-01

Pos logit contributions
+0.372
OPLE+0.343
 citiz+0.334
tv+0.327
 refunds+0.320
Neg logit contributions
ophen-0.404
onent-0.398
cial-0.396
reated-0.382
hemat-0.375
Token VIP
Token ID24791
Feature activation+1.44e-02

Pos logit contributions
+0.416
OPLE+0.382
 citiz+0.372
tv+0.364
 refunds+0.355
Neg logit contributions
ophen-0.471
onent-0.463
cial-0.461
reated-0.446
hemat-0.437
Token section
Token ID2665
Feature activation-1.00e+00

Pos logit contributions
ophen+0.010
onent+0.010
cial+0.010
reated+0.010
hemat+0.010
Neg logit contributions
-0.011
OPLE-0.010
 citiz-0.010
tv-0.009
 refunds-0.009
Token near
Token ID1474
Feature activation+1.11e+00

Pos logit contributions
+0.831
OPLE+0.776
 citiz+0.762
tv+0.754
 refunds+0.743
Neg logit contributions
ophen-0.624
onent-0.615
cial-0.609
reated-0.584
hemat-0.571
Token here
Token ID994
Feature activation-1.26e-01

Pos logit contributions
ophen+1.021
onent+1.003
cial+1.001
reated+0.960
hemat+0.940
Neg logit contributions
-1.030
OPLE-0.941
 citiz-0.926
tv-0.897
 refunds-0.881
Token.
Token ID13
Feature activation-5.63e-01

Pos logit contributions
+0.109
OPLE+0.102
 citiz+0.100
tv+0.098
 refunds+0.096
Neg logit contributions
ophen-0.075
onent-0.074
cial-0.073
reated-0.070
hemat-0.068
Token
Token ID198
Feature activation-1.45e-01

Pos logit contributions
+0.445
OPLE+0.413
 citiz+0.404
tv+0.397
 refunds+0.389
Neg logit contributions
ophen-0.376
onent-0.369
cial-0.367
reated-0.353
hemat-0.345
Token
Token ID198
Feature activation-3.03e-03

Pos logit contributions
+0.100
OPLE+0.094
tv+0.092
pour+0.088
 refunds+0.088
Neg logit contributions
ophen-0.109
onent-0.107
cial-0.106
reated-0.103
hemat-0.101
Token--
Token ID438
Feature activation+1.45e+00

Pos logit contributions
+0.002
OPLE+0.002
 citiz+0.002
tv+0.002
 refunds+0.002
Neg logit contributions
ophen-0.002
onent-0.002
cial-0.002
reated-0.002
hemat-0.002
Token Seth
Token ID20194
Feature activation-8.82e+00

Pos logit contributions
ophen+1.056
cial+1.043
onent+1.038
reated+0.998
hemat+0.978
Neg logit contributions
-1.062
OPLE-0.968
 citiz-0.958
tv-0.920
 refunds-0.911
Token Abram
Token ID39631
Feature activation+4.71e-01

Pos logit contributions
+5.853
tv+5.447
 citiz+5.418
OPLE+5.344
wine+5.172
Neg logit contributions
ophen-6.405
onent-6.353
cial-6.333
reated-5.944
pite-5.909
Tokenov
Token ID709
Feature activation+1.30e+00

Pos logit contributions
ophen+0.208
onent+0.202
cial+0.201
reated+0.188
hemat+0.182
Neg logit contributions
-0.481
OPLE-0.453
 citiz-0.446
tv-0.438
 refunds-0.433
Tokenitch
Token ID2007
Feature activation-4.24e-01

Pos logit contributions
ophen+0.859
onent+0.843
cial+0.839
reated+0.805
hemat+0.786
Neg logit contributions
-1.038
OPLE-0.962
 citiz-0.942
tv-0.922
 refunds-0.905
Token
Token ID198
Feature activation+2.74e-01

Pos logit contributions
+0.354
OPLE+0.331
 citiz+0.324
tv+0.318
 refunds+0.312
Neg logit contributions
ophen-0.264
onent-0.259
cial-0.258
reated-0.247
hemat-0.241
Token
Token ID198
Feature activation+3.54e-01

Pos logit contributions
ophen+0.206
onent+0.203
cial+0.200
reated+0.195
hemat+0.191
Neg logit contributions
-0.187
OPLE-0.175
tv-0.172
pour-0.166
 refunds-0.164
Token the
Token ID262
Feature activation+2.05e-01

Pos logit contributions
ophen+0.965
onent+0.948
cial+0.947
reated+0.913
hemat+0.893
Neg logit contributions
-0.965
OPLE-0.893
 citiz-0.870
tv-0.850
 refunds-0.828
Token King
Token ID2677
Feature activation+4.22e+00

Pos logit contributions
ophen+0.155
onent+0.153
cial+0.152
reated+0.147
hemat+0.144
Neg logit contributions
-0.143
OPLE-0.131
 citiz-0.128
tv-0.126
 refunds-0.123
Token's
Token ID338
Feature activation-4.60e-01

Pos logit contributions
ophen+2.751
cial+2.711
onent+2.660
reated+2.593
pite+2.521
Neg logit contributions
-3.354
OPLE-3.167
 citiz-3.067
tv-2.973
 refunds-2.939
Token judgment
Token ID8492
Feature activation-1.92e+00

Pos logit contributions
+0.287
OPLE+0.260
 citiz+0.253
tv+0.247
 refunds+0.241
Neg logit contributions
ophen-0.384
onent-0.378
cial-0.377
reated-0.365
hemat-0.358
Token)."
Token ID21387
Feature activation+1.50e+00

Pos logit contributions
+1.526
OPLE+1.409
 citiz+1.374
tv+1.367
 refunds+1.336
Neg logit contributions
ophen-1.268
onent-1.250
cial-1.234
reated-1.187
pite-1.161
TokenSah
Token ID32194
Feature activation-8.75e+00

Pos logit contributions
ophen+1.099
onent+1.078
cial+1.074
reated+1.036
hemat+1.016
Neg logit contributions
-1.082
OPLE-0.997
 citiz-0.979
tv-0.950
 refunds-0.934
Tokenih
Token ID4449
Feature activation-3.61e+00

Pos logit contributions
+5.304
OPLE+4.837
tv+4.819
 citiz+4.746
pour+4.700
Neg logit contributions
onent-6.730
ophen-6.660
cial-6.610
reated-6.339
obic-6.295
Token al
Token ID435
Feature activation-1.60e+00

Pos logit contributions
+2.591
OPLE+2.364
 citiz+2.304
tv+2.300
pour+2.253
Neg logit contributions
ophen-2.604
cial-2.594
onent-2.576
reated-2.476
pport-2.445
Token-
Token ID12
Feature activation-1.64e+00

Pos logit contributions
+1.316
OPLE+1.223
 citiz+1.198
tv+1.181
 refunds+1.157
Neg logit contributions
ophen-1.018
onent-0.998
cial-0.994
reated-0.949
hemat-0.926
TokenB
Token ID33
Feature activation-1.12e+00

Pos logit contributions
+1.028
OPLE+0.931
 citiz+0.905
tv+0.892
 refunds+0.864
Neg logit contributions
ophen-1.361
onent-1.343
cial-1.332
reated-1.291
pite-1.267
Tokenuk
Token ID2724
Feature activation-1.62e+00

Pos logit contributions
+1.082
OPLE+1.019
 citiz+1.000
tv+0.989
 refunds+0.970
Neg logit contributions
ophen-0.550
onent-0.538
cial-0.533
reated-0.504
hemat-0.488
Tokenvm
Token ID14761
Feature activation+1.21e-01

Pos logit contributions
+1.150
OPLE+1.076
tv+1.044
 citiz+1.044
 refunds+1.011
Neg logit contributions
ophen-0.942
onent-0.926
cial-0.911
reated-0.886
hemat-0.859
Tokenash
Token ID1077
Feature activation-3.29e+00

Pos logit contributions
ophen+0.080
onent+0.079
cial+0.078
reated+0.075
hemat+0.073
Neg logit contributions
-0.096
OPLE-0.089
 citiz-0.088
tv-0.086
 refunds-0.084
Token ship
Token ID4074
Feature activation+1.74e+00

Pos logit contributions
+2.493
OPLE+2.334
tv+2.274
 citiz+2.271
pour+2.196
Neg logit contributions
ophen-2.254
onent-2.202
cial-2.168
reated-2.117
hemat-2.053
Tokenyard
Token ID9413
Feature activation+2.24e+00

Pos logit contributions
ophen+1.306
cial+1.290
onent+1.284
reated+1.236
hemat+1.220
Neg logit contributions
-1.227
OPLE-1.120
 citiz-1.101
tv-1.065
 refunds-1.045
Token at
Token ID379
Feature activation+2.47e-01

Pos logit contributions
ophen+1.487
onent+1.468
cial+1.467
reated+1.396
pite+1.375
Neg logit contributions
-1.786
OPLE-1.646
 citiz-1.614
tv-1.570
 refunds-1.543
Token Sever
Token ID26434
Feature activation-8.71e+00

Pos logit contributions
ophen+0.184
onent+0.181
cial+0.180
reated+0.174
hemat+0.170
Neg logit contributions
-0.177
OPLE-0.163
 citiz-0.159
tv-0.155
 refunds-0.152
Tokenod
Token ID375
Feature activation+1.09e+00

Pos logit contributions
+6.424
tv+6.237
OPLE+6.006
 citiz+5.871
 refunds+5.849
Neg logit contributions
onent-5.526
ophen-5.465
cial-5.464
hemat-5.394
reated-5.254
Tokenv
Token ID85
Feature activation-5.62e-01

Pos logit contributions
ophen+0.839
cial+0.817
onent+0.816
reated+0.782
pite+0.776
Neg logit contributions
-0.773
OPLE-0.702
 citiz-0.692
 refunds-0.661
tv-0.649
Tokeninsk
Token ID35803
Feature activation+1.40e+00

Pos logit contributions
+0.517
OPLE+0.483
 citiz+0.474
tv+0.465
 refunds+0.459
Neg logit contributions
ophen-0.305
onent-0.299
cial-0.297
reated-0.282
pite-0.274
Token,
Token ID11
Feature activation-7.97e-01

Pos logit contributions
ophen+0.894
cial+0.880
onent+0.880
reated+0.838
pite+0.821
Neg logit contributions
-1.150
OPLE-1.062
 citiz-1.040
tv-1.017
 refunds-1.003
Token which
Token ID543
Feature activation+6.22e-01

Pos logit contributions
+0.580
OPLE+0.535
 citiz+0.523
tv+0.512
 refunds+0.501
Neg logit contributions
ophen-0.582
onent-0.573
cial-0.570
reated-0.550
hemat-0.539
Token last
Token ID938
Feature activation+1.93e+00

Pos logit contributions
+0.540
OPLE+0.498
 citiz+0.486
tv+0.477
 refunds+0.467
Neg logit contributions
ophen-0.546
onent-0.537
cial-0.534
reated-0.515
hemat-0.504
Token week
Token ID1285
Feature activation-8.86e-01

Pos logit contributions
ophen+1.407
onent+1.385
cial+1.379
reated+1.325
hemat+1.303
Neg logit contributions
-1.410
OPLE-1.301
 citiz-1.267
tv-1.237
 refunds-1.209
Token that
Token ID326
Feature activation+7.87e-02

Pos logit contributions
+0.583
OPLE+0.533
 citiz+0.519
tv+0.508
 refunds+0.495
Neg logit contributions
ophen-0.709
onent-0.698
cial-0.695
reated-0.672
hemat-0.659
Token Mr
Token ID1770
Feature activation+3.96e+00

Pos logit contributions
ophen+0.054
onent+0.053
cial+0.053
reated+0.051
hemat+0.050
Neg logit contributions
-0.061
OPLE-0.056
 citiz-0.055
tv-0.054
 refunds-0.052
Token.
Token ID13
Feature activation+1.46e-01

Pos logit contributions
ophen+2.202
onent+2.159
cial+2.153
reated+2.028
hemat+1.980
Neg logit contributions
-3.626
 citiz-3.376
OPLE-3.372
tv-3.193
 refunds-3.183
Token Gian
Token ID30851
Feature activation-8.61e+00

Pos logit contributions
ophen+0.101
onent+0.100
cial+0.099
reated+0.095
hemat+0.093
Neg logit contributions
-0.112
OPLE-0.104
 citiz-0.102
tv-0.099
 refunds-0.097
Tokenfort
Token ID3319
Feature activation+2.47e+00

Pos logit contributions
tv+6.241
pour+6.184
+6.128
wine+5.919
OPLE+5.846
Neg logit contributions
ophen-5.263
cial-5.130
onent-5.110
reated-5.094
hemat-4.834
Tokene
Token ID68
Feature activation-7.55e-01

Pos logit contributions
ophen+1.483
onent+1.454
cial+1.446
reated+1.380
hemat+1.349
Neg logit contributions
-2.114
OPLE-1.976
 citiz-1.930
tv-1.913
 refunds-1.874
Token has
Token ID468
Feature activation+1.31e+00

Pos logit contributions
+0.597
OPLE+0.552
 citiz+0.539
tv+0.533
 refunds+0.524
Neg logit contributions
ophen-0.503
onent-0.494
cial-0.493
reated-0.472
hemat-0.461
Token sent
Token ID1908
Feature activation+2.97e+00

Pos logit contributions
ophen+0.920
onent+0.905
cial+0.900
reated+0.866
hemat+0.848
Neg logit contributions
-0.983
OPLE-0.910
 citiz-0.888
tv-0.870
 refunds-0.852
Token mixed
Token ID7668
Feature activation-3.02e+00

Pos logit contributions
ophen+2.216
cial+2.188
onent+2.185
reated+2.104
pite+2.055
Neg logit contributions
-2.120
OPLE-1.920
 citiz-1.885
tv-1.829
pour-1.772
Token,
Token ID11
Feature activation-7.03e-01

Pos logit contributions
+1.182
OPLE+1.091
 citiz+1.066
tv+1.055
 refunds+1.034
Neg logit contributions
ophen-1.191
onent-1.173
cial-1.158
reated-1.119
hemat-1.102
Token many
Token ID867
Feature activation-1.24e+00

Pos logit contributions
+0.516
OPLE+0.476
 citiz+0.465
tv+0.458
 refunds+0.447
Neg logit contributions
ophen-0.509
onent-0.501
cial-0.498
reated-0.480
hemat-0.470
Token people
Token ID661
Feature activation-3.03e+00

Pos logit contributions
+0.818
OPLE+0.751
 citiz+0.732
tv+0.718
 refunds+0.701
Neg logit contributions
ophen-0.983
onent-0.970
cial-0.964
reated-0.931
hemat-0.916
Token were
Token ID547
Feature activation-9.04e-01

Pos logit contributions
+2.232
OPLE+2.080
tv+2.029
 citiz+2.021
 refunds+1.983
Neg logit contributions
ophen-2.138
onent-2.109
cial-2.067
reated-2.003
hemat-1.968
Token already
Token ID1541
Feature activation-2.48e+00

Pos logit contributions
+0.638
OPLE+0.589
 citiz+0.573
tv+0.566
 refunds+0.553
Neg logit contributions
ophen-0.677
onent-0.669
cial-0.662
reated-0.640
hemat-0.628
Token seated
Token ID21639
Feature activation-8.40e+00

Pos logit contributions
+1.731
OPLE+1.611
tv+1.571
 citiz+1.558
 refunds+1.535
Neg logit contributions
ophen-1.855
onent-1.834
cial-1.799
reated-1.748
hemat-1.726
Token and
Token ID290
Feature activation-6.68e-01

Pos logit contributions
+5.507
OPLE+5.075
tv+5.033
 citiz+5.010
 refunds+5.000
Neg logit contributions
ophen-6.157
onent-6.090
cial-6.009
reated-5.738
hemat-5.679
Token a
Token ID257
Feature activation-8.09e-01

Pos logit contributions
+0.462
OPLE+0.426
 citiz+0.416
tv+0.408
 refunds+0.400
Neg logit contributions
ophen-0.510
onent-0.503
cial-0.500
reated-0.482
hemat-0.474
Token few
Token ID1178
Feature activation-6.24e-01

Pos logit contributions
+0.506
OPLE+0.463
 citiz+0.451
tv+0.441
 refunds+0.429
Neg logit contributions
ophen-0.670
onent-0.662
cial-0.656
reated-0.636
hemat-0.626
Token were
Token ID547
Feature activation-9.18e-01

Pos logit contributions
+0.437
OPLE+0.403
 citiz+0.393
tv+0.385
 refunds+0.377
Neg logit contributions
ophen-0.472
onent-0.465
cial-0.463
reated-0.446
hemat-0.438
Token checking
Token ID10627
Feature activation-2.59e+00

Pos logit contributions
+0.620
OPLE+0.572
 citiz+0.555
tv+0.548
 refunds+0.536
Neg logit contributions
ophen-0.714
onent-0.705
cial-0.699
reated-0.676
hemat-0.664
Token to
Token ID284
Feature activation+9.91e-01

Pos logit contributions
ophen+0.808
onent+0.794
cial+0.789
reated+0.757
hemat+0.739
Neg logit contributions
-1.007
OPLE-0.937
 citiz-0.917
tv-0.901
 refunds-0.883
Token film
Token ID2646
Feature activation+4.57e-01

Pos logit contributions
ophen+0.723
onent+0.711
cial+0.708
reated+0.683
pite+0.668
Neg logit contributions
-0.723
OPLE-0.665
 citiz-0.650
tv-0.635
 refunds-0.619
Token producer
Token ID9920
Feature activation-1.87e+00

Pos logit contributions
ophen+0.381
onent+0.375
cial+0.374
reated+0.362
hemat+0.355
Neg logit contributions
-0.286
OPLE-0.260
 citiz-0.253
tv-0.247
 refunds-0.240
Token Sub
Token ID3834
Feature activation-5.66e-01

Pos logit contributions
+1.322
OPLE+1.227
 citiz+1.193
tv+1.177
 refunds+1.154
Neg logit contributions
ophen-1.391
onent-1.369
cial-1.359
reated-1.312
hemat-1.288
Tokenhash
Token ID17831
Feature activation-2.95e+00

Pos logit contributions
+0.429
OPLE+0.397
 citiz+0.386
tv+0.376
 refunds+0.370
Neg logit contributions
ophen-0.399
onent-0.392
cial-0.390
reated-0.376
hemat-0.367
Token Gh
Token ID11972
Feature activation-8.37e+00

Pos logit contributions
+2.060
OPLE+1.934
 citiz+1.881
tv+1.876
pour+1.821
Neg logit contributions
ophen-2.165
onent-2.138
cial-2.131
reated-2.048
hemat-1.995
Tokenai
Token ID1872
Feature activation+2.18e-01

Pos logit contributions
+6.094
OPLE+5.951
tv+5.922
pour+5.896
wine+5.754
Neg logit contributions
ophen-5.043
cial-5.005
onent-4.846
reated-4.716
hemat-4.616
Token,
Token ID11
Feature activation-2.80e-01

Pos logit contributions
ophen+0.155
onent+0.153
cial+0.152
reated+0.147
hemat+0.143
Neg logit contributions
-0.162
OPLE-0.150
 citiz-0.146
tv-0.143
 refunds-0.140
Token the
Token ID262
Feature activation-1.13e-01

Pos logit contributions
+0.204
OPLE+0.188
 citiz+0.184
tv+0.180
 refunds+0.176
Neg logit contributions
ophen-0.205
onent-0.202
cial-0.201
reated-0.193
hemat-0.189
Token Supreme
Token ID5617
Feature activation-1.92e+00

Pos logit contributions
+0.080
OPLE+0.074
 citiz+0.072
tv+0.070
 refunds+0.069
Neg logit contributions
ophen-0.084
onent-0.083
cial-0.083
reated-0.080
hemat-0.078
Token Court
Token ID3078
Feature activation+5.04e-02

Pos logit contributions
+1.192
OPLE+1.087
tv+1.070
 citiz+1.070
 refunds+1.041
Neg logit contributions
ophen-1.574
onent-1.558
cial-1.551
reated-1.500
hemat-1.471
Token private
Token ID2839
Feature activation+1.05e+00

Pos logit contributions
+0.796
OPLE+0.736
 citiz+0.716
tv+0.703
 refunds+0.688
Neg logit contributions
ophen-0.851
onent-0.838
cial-0.832
reated-0.804
hemat-0.790
Token property
Token ID3119
Feature activation+1.19e+00

Pos logit contributions
ophen+0.690
onent+0.680
cial+0.680
reated+0.651
hemat+0.634
Neg logit contributions
-0.845
OPLE-0.781
 citiz-0.763
tv-0.747
 refunds-0.733
Token was
Token ID373
Feature activation+5.56e-01

Pos logit contributions
ophen+0.777
onent+0.763
cial+0.760
reated+0.729
hemat+0.711
Neg logit contributions
-0.962
OPLE-0.894
 citiz-0.875
tv-0.858
 refunds-0.840
Token
Token ID564
Feature activation-7.29e-01

Pos logit contributions
ophen+0.396
onent+0.390
cial+0.388
reated+0.374
hemat+0.366
Neg logit contributions
-0.414
OPLE-0.383
 citiz-0.374
tv-0.367
 refunds-0.360
Token
Token ID250
Feature activation-9.46e-01

Pos logit contributions
+0.456
OPLE+0.414
 citiz+0.402
tv+0.394
 refunds+0.383
Neg logit contributions
ophen-0.606
onent-0.598
cial-0.596
reated-0.576
hemat-0.566
Tokensat
Token ID49720
Feature activation-8.37e+00

Pos logit contributions
+0.720
OPLE+0.662
 citiz+0.646
tv+0.643
pour+0.629
Neg logit contributions
ophen-0.654
onent-0.645
cial-0.637
reated-0.608
hemat-0.601
Tokenir
Token ID343
Feature activation-2.60e+00

Pos logit contributions
+5.913
tv+5.371
OPLE+5.350
 citiz+5.293
 refunds+5.188
Neg logit contributions
ophen-5.843
cial-5.780
onent-5.742
reated-5.437
pite-5.402
Tokenical
Token ID605
Feature activation-1.19e+00

Pos logit contributions
+2.928
OPLE+2.756
tv+2.711
 citiz+2.691
pour+2.643
Neg logit contributions
ophen-0.815
onent-0.814
cial-0.764
reated-0.726
pite-0.687
Token
Token ID447
Feature activation-1.77e+00

Pos logit contributions
+0.947
OPLE+0.879
 citiz+0.859
tv+0.852
 refunds+0.831
Neg logit contributions
ophen-0.789
onent-0.776
cial-0.769
reated-0.740
hemat-0.721
Token
Token ID251
Feature activation+1.36e-02

Pos logit contributions
+0.886
OPLE+0.784
 citiz+0.753
tv+0.742
 refunds+0.709
Neg logit contributions
ophen-1.686
onent-1.662
cial-1.660
reated-1.608
hemat-1.590
Token.
Token ID13
Feature activation-7.75e-01

Pos logit contributions
ophen+0.009
onent+0.009
cial+0.009
reated+0.008
hemat+0.008
Neg logit contributions
-0.011
OPLE-0.010
 citiz-0.010
tv-0.010
 refunds-0.009
Tokenfish
Token ID11084
Feature activation-3.16e+00

Pos logit contributions
+0.824
OPLE+0.768
tv+0.754
 citiz+0.750
pour+0.733
Neg logit contributions
ophen-0.567
onent-0.554
cial-0.552
reated-0.527
hemat-0.514
Token
Token ID447
Feature activation-5.67e-01

Pos logit contributions
+2.420
OPLE+2.255
 citiz+2.195
tv+2.190
 refunds+2.146
Neg logit contributions
ophen-2.158
onent-2.108
cial-2.105
reated-2.012
pite-1.992
Token
Token ID251
Feature activation+7.13e-01

Pos logit contributions
+0.298
OPLE+0.266
 citiz+0.256
tv+0.250
 refunds+0.242
Neg logit contributions
ophen-0.529
onent-0.522
cial-0.520
reated-0.505
hemat-0.498
Token debuted
Token ID26376
Feature activation-7.45e-01

Pos logit contributions
ophen+0.509
onent+0.500
cial+0.498
reated+0.481
hemat+0.469
Neg logit contributions
-0.531
OPLE-0.491
 citiz-0.480
tv-0.468
 refunds-0.458
Token at
Token ID379
Feature activation-2.12e-02

Pos logit contributions
+0.525
OPLE+0.482
 citiz+0.470
tv+0.461
 refunds+0.451
Neg logit contributions
ophen-0.562
onent-0.553
cial-0.550
reated-0.531
hemat-0.521
Token Sund
Token ID3309
Feature activation-8.32e+00

Pos logit contributions
+0.016
OPLE+0.015
 citiz+0.014
tv+0.014
 refunds+0.014
Neg logit contributions
ophen-0.015
onent-0.015
cial-0.015
reated-0.014
hemat-0.014
Tokenance
Token ID590
Feature activation-3.62e+00

Pos logit contributions
+4.589
tv+4.220
OPLE+4.190
wine+3.900
 citiz+3.869
Neg logit contributions
ophen-7.238
cial-7.092
onent-7.080
pite-6.946
hemat-6.871
Token,
Token ID11
Feature activation-4.78e-01

Pos logit contributions
+2.697
OPLE+2.510
 citiz+2.448
tv+2.443
 refunds+2.369
Neg logit contributions
ophen-2.544
onent-2.497
cial-2.478
reated-2.381
pite-2.348
Token was
Token ID373
Feature activation-1.06e+00

Pos logit contributions
+0.350
OPLE+0.323
 citiz+0.315
tv+0.311
 refunds+0.303
Neg logit contributions
ophen-0.347
onent-0.341
cial-0.339
reated-0.327
hemat-0.321
Token acquired
Token ID9477
Feature activation-2.31e+00

Pos logit contributions
+0.766
OPLE+0.707
 citiz+0.690
tv+0.682
 refunds+0.664
Neg logit contributions
ophen-0.779
onent-0.765
cial-0.760
reated-0.736
pite-0.722
Token by
Token ID416
Feature activation+5.51e-01

Pos logit contributions
+1.676
OPLE+1.544
 citiz+1.516
tv+1.490
 refunds+1.454
Neg logit contributions
ophen-1.682
onent-1.642
cial-1.641
reated-1.588
hemat-1.554
Token This
Token ID770
Feature activation+7.31e-01

Pos logit contributions
+0.237
OPLE+0.220
 citiz+0.212
tv+0.211
 refunds+0.205
Neg logit contributions
ophen-0.246
onent-0.242
cial-0.241
reated-0.233
hemat-0.228
Token was
Token ID373
Feature activation-1.09e+00

Pos logit contributions
ophen+0.502
onent+0.493
cial+0.490
reated+0.471
hemat+0.461
Neg logit contributions
-0.564
OPLE-0.523
 citiz-0.510
tv-0.501
 refunds-0.490
Token
Token ID960
Feature activation+3.73e-01

Pos logit contributions
+0.780
OPLE+0.721
 citiz+0.703
tv+0.692
 refunds+0.677
Neg logit contributions
ophen-0.801
onent-0.790
cial-0.785
reated-0.759
hemat-0.744
Tokenare
Token ID533
Feature activation+9.75e-01

Pos logit contributions
ophen+0.261
onent+0.257
cial+0.256
reated+0.246
hemat+0.241
Neg logit contributions
-0.283
OPLE-0.261
 citiz-0.256
tv-0.250
 refunds-0.245
Token you
Token ID345
Feature activation-1.58e+00

Pos logit contributions
ophen+0.696
cial+0.684
onent+0.680
reated+0.655
hemat+0.645
Neg logit contributions
-0.724
OPLE-0.669
 citiz-0.656
tv-0.636
 refunds-0.625
Token sitting
Token ID5586
Feature activation-8.31e+00

Pos logit contributions
+0.977
OPLE+0.888
 citiz+0.862
tv+0.850
 refunds+0.822
Neg logit contributions
ophen-1.319
onent-1.304
cial-1.297
reated-1.258
hemat-1.232
Token down
Token ID866
Feature activation-1.74e+00

Pos logit contributions
+5.229
OPLE+4.788
tv+4.665
 citiz+4.651
 refunds+4.602
Neg logit contributions
ophen-6.416
cial-6.387
onent-6.374
reated-6.126
hemat-6.029
Token?
Token ID30
Feature activation+3.29e-01

Pos logit contributions
+1.396
OPLE+1.299
 citiz+1.270
tv+1.253
 refunds+1.225
Neg logit contributions
ophen-1.139
onent-1.120
cial-1.113
reated-1.070
hemat-1.047
Token
Token ID960
Feature activation+1.05e+00

Pos logit contributions
ophen+0.230
onent+0.226
cial+0.225
reated+0.216
hemat+0.212
Neg logit contributions
-0.250
OPLE-0.231
 citiz-0.226
tv-0.221
 refunds-0.217
Tokena
Token ID64
Feature activation+8.36e-01

Pos logit contributions
ophen+0.750
onent+0.737
cial+0.737
reated+0.707
hemat+0.694
Neg logit contributions
-0.790
OPLE-0.729
 citiz-0.715
tv-0.691
 refunds-0.682
Token lie
Token ID6486
Feature activation-2.37e+00

Pos logit contributions
ophen+0.642
onent+0.631
cial+0.630
reated+0.607
hemat+0.595
Neg logit contributions
-0.578
OPLE-0.530
 citiz-0.516
tv-0.505
 refunds-0.493
Token it
Token ID340
Feature activation-1.25e+00

Pos logit contributions
+3.185
OPLE+2.909
tv+2.852
 refunds+2.848
 citiz+2.837
Neg logit contributions
ophen-3.857
onent-3.802
cial-3.769
reated-3.666
hemat-3.631
Token's
Token ID338
Feature activation-9.11e-01

Pos logit contributions
+0.893
OPLE+0.822
 citiz+0.804
tv+0.798
 refunds+0.785
Neg logit contributions
ophen-0.924
onent-0.906
cial-0.902
reated-0.869
hemat-0.854
Token non
Token ID1729
Feature activation+2.40e+00

Pos logit contributions
+0.624
OPLE+0.571
 citiz+0.558
tv+0.551
 refunds+0.540
Neg logit contributions
ophen-0.699
onent-0.690
cial-0.687
reated-0.663
hemat-0.652
Tokenstop
Token ID11338
Feature activation+1.79e+00

Pos logit contributions
ophen+1.473
onent+1.445
cial+1.441
reated+1.366
hemat+1.342
Neg logit contributions
-2.021
OPLE-1.901
 citiz-1.851
tv-1.804
 refunds-1.766
Token awesome
Token ID7427
Feature activation-1.75e+00

Pos logit contributions
ophen+1.252
onent+1.233
cial+1.230
reated+1.181
pite+1.154
Neg logit contributions
-1.351
OPLE-1.254
 citiz-1.222
tv-1.189
 refunds-1.167
Token grappling
Token ID36762
Feature activation-8.28e+00

Pos logit contributions
+1.403
OPLE+1.302
 citiz+1.276
tv+1.271
 refunds+1.233
Neg logit contributions
ophen-1.142
onent-1.117
cial-1.107
reated-1.068
hemat-1.045
Token hook
Token ID8011
Feature activation-3.80e-01

Pos logit contributions
+5.782
tv+5.320
OPLE+5.287
 citiz+5.185
 devs+5.066
Neg logit contributions
ophen-5.899
onent-5.736
cial-5.625
hemat-5.532
reated-5.516
Token aw
Token ID3253
Feature activation-4.50e-01

Pos logit contributions
+0.287
OPLE+0.266
 citiz+0.260
tv+0.255
 refunds+0.249
Neg logit contributions
ophen-0.267
onent-0.263
cial-0.262
reated-0.252
hemat-0.246
Tokenes
Token ID274
Feature activation-1.33e+00

Pos logit contributions
+0.316
OPLE+0.291
 citiz+0.282
tv+0.274
 refunds+0.270
Neg logit contributions
ophen-0.342
onent-0.336
cial-0.335
reated-0.323
hemat-0.317
Tokenomen
Token ID3674
Feature activation+1.18e+00

Pos logit contributions
+1.224
OPLE+1.150
 citiz+1.124
tv+1.115
pour+1.097
Neg logit contributions
ophen-0.697
onent-0.690
cial-0.680
reated-0.653
hemat-0.633
Tokeness
Token ID408
Feature activation+6.87e-02

Pos logit contributions
ophen+0.765
onent+0.752
cial+0.748
reated+0.719
hemat+0.704
Neg logit contributions
-0.949
OPLE-0.884
 citiz-0.861
tv-0.854
 refunds-0.836
Token March
Token ID2805
Feature activation+1.28e+00

Pos logit contributions
+0.931
OPLE+0.864
 citiz+0.844
tv+0.831
 refunds+0.811
Neg logit contributions
ophen-0.831
onent-0.815
cial-0.809
reated-0.779
pite-0.760
Token case
Token ID1339
Feature activation-2.40e+00

Pos logit contributions
ophen+0.963
onent+0.948
cial+0.945
reated+0.911
hemat+0.894
Neg logit contributions
-0.905
OPLE-0.830
 citiz-0.809
tv-0.788
 refunds-0.773
Token,
Token ID11
Feature activation-1.41e-01

Pos logit contributions
+1.935
OPLE+1.795
 citiz+1.760
tv+1.742
 refunds+1.703
Neg logit contributions
ophen-1.557
onent-1.523
cial-1.499
reated-1.451
pite-1.418
Token and
Token ID290
Feature activation-3.72e-02

Pos logit contributions
+0.104
OPLE+0.096
 citiz+0.093
tv+0.091
 refunds+0.089
Neg logit contributions
ophen-0.102
onent-0.101
cial-0.100
reated-0.096
hemat-0.095
Token Yuri
Token ID38450
Feature activation-6.68e-01

Pos logit contributions
+0.028
OPLE+0.026
 citiz+0.026
tv+0.025
 refunds+0.025
Neg logit contributions
ophen-0.026
onent-0.025
cial-0.025
reated-0.024
hemat-0.024
Token Kul
Token ID50024
Feature activation-8.25e+00

Pos logit contributions
+0.464
OPLE+0.428
 citiz+0.418
tv+0.411
 refunds+0.399
Neg logit contributions
ophen-0.508
onent-0.500
cial-0.498
reated-0.480
pite-0.469
Tokeniy
Token ID7745
Feature activation-3.09e+00

Pos logit contributions
tv+6.240
+6.188
OPLE+5.969
pour+5.961
wine+5.917
Neg logit contributions
ophen-4.815
onent-4.690
cial-4.631
reated-4.538
pport-4.331
Token
Token ID447
Feature activation-6.93e-01

Pos logit contributions
+2.286
OPLE+2.164
tv+2.157
 citiz+2.119
pour+2.084
Neg logit contributions
ophen-2.074
onent-2.057
cial-2.029
reated-1.973
hemat-1.921
Token
Token ID247
Feature activation-9.04e-01

Pos logit contributions
+0.397
OPLE+0.357
 citiz+0.347
tv+0.338
 refunds+0.328
Neg logit contributions
ophen-0.614
onent-0.605
cial-0.603
reated-0.585
hemat-0.576
Tokens
Token ID82
Feature activation-5.14e-01

Pos logit contributions
+0.598
OPLE+0.548
 citiz+0.533
tv+0.527
 refunds+0.509
Neg logit contributions
ophen-0.717
onent-0.707
cial-0.700
reated-0.679
hemat-0.666
Token case
Token ID1339
Feature activation-2.42e+00

Pos logit contributions
+0.358
OPLE+0.330
 citiz+0.322
tv+0.315
 refunds+0.308
Neg logit contributions
ophen-0.390
onent-0.384
cial-0.382
reated-0.369
hemat-0.362
Token and
Token ID290
Feature activation+5.32e-01

Pos logit contributions
ophen+0.477
onent+0.470
cial+0.468
reated+0.453
hemat+0.445
Neg logit contributions
-0.375
OPLE-0.342
 citiz-0.333
tv-0.325
 refunds-0.317
Token gender
Token ID5279
Feature activation+6.80e-01

Pos logit contributions
ophen+0.347
onent+0.341
cial+0.339
reated+0.325
hemat+0.318
Neg logit contributions
-0.428
OPLE-0.398
 citiz-0.389
tv-0.383
 refunds-0.375
Token identities
Token ID18413
Feature activation-7.95e-01

Pos logit contributions
ophen+0.577
onent+0.568
cial+0.566
reated+0.548
hemat+0.539
Neg logit contributions
-0.417
OPLE-0.379
 citiz-0.367
tv-0.357
 refunds-0.348
Token ©
Token ID10673
Feature activation+1.65e+00

Pos logit contributions
+0.571
OPLE+0.528
 citiz+0.515
tv+0.506
 refunds+0.494
Neg logit contributions
ophen-0.584
onent-0.577
cial-0.573
reated-0.552
hemat-0.541
Token 2014
Token ID1946
Feature activation+1.01e+00

Pos logit contributions
ophen+1.131
onent+1.110
cial+1.104
reated+1.062
hemat+1.038
Neg logit contributions
-1.272
OPLE-1.177
 citiz-1.152
tv-1.125
 refunds-1.105
Token Seth
Token ID20194
Feature activation-8.20e+00

Pos logit contributions
ophen+0.700
onent+0.688
cial+0.685
reated+0.659
hemat+0.644
Neg logit contributions
-0.769
OPLE-0.710
 citiz-0.695
tv-0.680
 refunds-0.667
Token Jam
Token ID9986
Feature activation-3.78e+00

Pos logit contributions
+5.589
tv+5.235
OPLE+5.216
 citiz+5.121
wine+5.079
Neg logit contributions
ophen-5.732
onent-5.684
cial-5.623
reated-5.368
pite-5.286
Tokenison
Token ID1653
Feature activation-1.48e+00

Pos logit contributions
+3.235
OPLE+3.063
tv+3.045
 citiz+2.990
pour+2.979
Neg logit contributions
ophen-2.116
onent-2.052
cial-2.023
reated-1.957
hemat-1.907
Token Rain
Token ID10301
Feature activation-3.56e+00

Pos logit contributions
+1.176
OPLE+1.107
 citiz+1.070
tv+1.068
 refunds+1.039
Neg logit contributions
ophen-0.968
onent-0.956
cial-0.945
reated-0.910
hemat-0.888
Tokeness
Token ID408
Feature activation+1.05e+00

Pos logit contributions
+2.697
OPLE+2.505
tv+2.455
 citiz+2.434
pour+2.388
Neg logit contributions
ophen-2.423
onent-2.394
cial-2.368
reated-2.267
hemat-2.235
Token
Token ID198
Feature activation+2.96e-01

Pos logit contributions
ophen+0.678
onent+0.665
cial+0.662
reated+0.635
hemat+0.620
Neg logit contributions
-0.848
OPLE-0.788
 citiz-0.772
tv-0.757
 refunds-0.743
Token
Token ID198
Feature activation-1.57e-01

Pos logit contributions
+0.220
OPLE+0.206
tv+0.206
pour+0.196
 refunds+0.195
Neg logit contributions
ophen-0.254
onent-0.250
cial-0.247
reated-0.241
pite-0.236
TokenJackson
Token ID31270
Feature activation-4.25e-01

Pos logit contributions
+0.110
OPLE+0.103
tv+0.098
 citiz+0.096
pour+0.094
Neg logit contributions
ophen-0.119
onent-0.117
cial-0.116
reated-0.112
pite-0.110
Token,
Token ID11
Feature activation+3.53e-01

Pos logit contributions
+0.336
OPLE+0.315
 citiz+0.307
tv+0.304
 refunds+0.300
Neg logit contributions
ophen-0.279
onent-0.275
cial-0.273
reated-0.261
hemat-0.258
Token who
Token ID508
Feature activation+4.56e-01

Pos logit contributions
ophen+0.235
onent+0.231
cial+0.229
reated+0.220
hemat+0.216
Neg logit contributions
-0.279
OPLE-0.260
 citiz-0.254
tv-0.251
 refunds-0.246
Token was
Token ID373
Feature activation+4.36e-04

Pos logit contributions
ophen+0.300
onent+0.295
cial+0.293
reated+0.281
hemat+0.277
Neg logit contributions
-0.363
OPLE-0.338
 citiz-0.332
tv-0.326
 refunds-0.322
Token sitting
Token ID5586
Feature activation-8.11e+00

Pos logit contributions
ophen+0.000
onent+0.000
cial+0.000
reated+0.000
hemat+0.000
Neg logit contributions
-0.000
OPLE-0.000
 citiz-0.000
tv-0.000
 refunds-0.000
Token next
Token ID1306
Feature activation+1.04e+00

Pos logit contributions
+4.459
OPLE+4.139
 refunds+4.129
tv+4.116
 citiz+4.103
Neg logit contributions
cial-6.674
ophen-6.652
onent-6.603
hemat-6.384
reated-6.367
Token to
Token ID284
Feature activation+5.42e-02

Pos logit contributions
ophen+0.625
onent+0.612
cial+0.609
reated+0.583
hemat+0.569
Neg logit contributions
-0.889
OPLE-0.832
 citiz-0.812
tv-0.807
 refunds-0.791
Token him
Token ID683
Feature activation+1.64e-01

Pos logit contributions
ophen+0.035
onent+0.035
cial+0.035
reated+0.033
hemat+0.032
Neg logit contributions
-0.043
OPLE-0.041
 citiz-0.040
tv-0.039
 refunds-0.038
Token on
Token ID319
Feature activation-2.13e+00

Pos logit contributions
ophen+0.116
onent+0.115
cial+0.114
reated+0.110
hemat+0.107
Neg logit contributions
-0.123
OPLE-0.114
 citiz-0.111
tv-0.109
 refunds-0.107
Token the
Token ID262
Feature activation-7.00e-02

Pos logit contributions
+1.536
OPLE+1.428
 citiz+1.390
tv+1.375
 refunds+1.339
Neg logit contributions
ophen-1.557
onent-1.536
cial-1.527
reated-1.469
hemat-1.445
Token if
Token ID611
Feature activation+2.27e+00

Pos logit contributions
ophen+0.077
onent+0.076
cial+0.076
reated+0.073
hemat+0.072
Neg logit contributions
-0.074
OPLE-0.068
 citiz-0.067
tv-0.065
 refunds-0.064
Token the
Token ID262
Feature activation+7.54e-01

Pos logit contributions
ophen+1.690
onent+1.665
cial+1.655
reated+1.591
hemat+1.563
Neg logit contributions
-1.629
OPLE-1.482
 citiz-1.446
tv-1.407
 refunds-1.373
Token process
Token ID1429
Feature activation+7.26e-01

Pos logit contributions
ophen+0.593
onent+0.584
cial+0.581
reated+0.561
hemat+0.551
Neg logit contributions
-0.507
OPLE-0.464
 citiz-0.452
tv-0.441
 refunds-0.431
Token was
Token ID373
Feature activation+3.25e-01

Pos logit contributions
ophen+0.461
onent+0.453
cial+0.451
reated+0.431
hemat+0.422
Neg logit contributions
-0.596
OPLE-0.556
 citiz-0.544
tv-0.535
 refunds-0.526
Token more
Token ID517
Feature activation+2.76e+00

Pos logit contributions
ophen+0.206
onent+0.202
cial+0.201
reated+0.192
pite+0.189
Neg logit contributions
-0.266
OPLE-0.249
 citiz-0.243
tv-0.241
 refunds-0.237
Token gradual
Token ID24972
Feature activation+5.27e+00

Pos logit contributions
ophen+1.921
onent+1.888
cial+1.878
reated+1.807
hemat+1.766
Neg logit contributions
-2.098
OPLE-1.943
 citiz-1.898
tv-1.859
 refunds-1.815
Token?
Token ID30
Feature activation+9.57e-01

Pos logit contributions
ophen+2.989
cial+2.954
onent+2.942
reated+2.788
pite+2.687
Neg logit contributions
-4.706
OPLE-4.377
 citiz-4.306
tv-4.139
mbuds-4.073
Token This
Token ID770
Feature activation+1.05e+00

Pos logit contributions
ophen+0.681
onent+0.672
cial+0.668
reated+0.641
hemat+0.630
Neg logit contributions
-0.718
OPLE-0.658
 citiz-0.650
tv-0.627
 refunds-0.618
Token is
Token ID318
Feature activation+3.34e-01

Pos logit contributions
ophen+0.723
onent+0.711
cial+0.707
reated+0.681
hemat+0.666
Neg logit contributions
-0.810
OPLE-0.751
 citiz-0.734
tv-0.722
 refunds-0.708
Token the
Token ID262
Feature activation-9.37e-03

Pos logit contributions
ophen+0.245
onent+0.242
cial+0.240
reated+0.232
hemat+0.227
Neg logit contributions
-0.242
OPLE-0.223
 citiz-0.217
tv-0.213
 refunds-0.208
Token more
Token ID517
Feature activation+2.73e+00

Pos logit contributions
+0.006
OPLE+0.006
 citiz+0.006
tv+0.006
 refunds+0.006
Neg logit contributions
ophen-0.007
onent-0.007
cial-0.007
reated-0.007
hemat-0.007
TokenView
Token ID7680
Feature activation-9.11e-01

Pos logit contributions
+0.429
OPLE+0.383
 citiz+0.368
tv+0.359
 refunds+0.346
Neg logit contributions
ophen-0.787
onent-0.777
cial-0.774
reated-0.753
hemat-0.741
Token and
Token ID290
Feature activation-3.74e-01

Pos logit contributions
+0.664
OPLE+0.613
 citiz+0.599
tv+0.590
 refunds+0.579
Neg logit contributions
ophen-0.662
onent-0.651
cial-0.647
reated-0.623
hemat-0.613
Token just
Token ID655
Feature activation-2.41e+00

Pos logit contributions
+0.268
OPLE+0.248
 citiz+0.242
tv+0.238
 refunds+0.234
Neg logit contributions
ophen-0.276
onent-0.272
cial-0.270
reated-0.261
hemat-0.256
Token have
Token ID423
Feature activation+2.46e-01

Pos logit contributions
+1.622
OPLE+1.504
 citiz+1.459
tv+1.450
 refunds+1.430
Neg logit contributions
ophen-1.847
onent-1.826
cial-1.814
reated-1.743
hemat-1.724
Token some
Token ID617
Feature activation-5.24e-01

Pos logit contributions
ophen+0.183
onent+0.180
cial+0.179
reated+0.173
hemat+0.169
Neg logit contributions
-0.176
OPLE-0.162
 citiz-0.158
tv-0.155
 refunds-0.152
Token accidentally
Token ID14716
Feature activation+4.73e+00

Pos logit contributions
+0.362
OPLE+0.333
 citiz+0.325
tv+0.318
 refunds+0.312
Neg logit contributions
ophen-0.401
onent-0.395
cial-0.393
reated-0.379
hemat-0.373
Token-
Token ID12
Feature activation-4.11e-01

Pos logit contributions
ophen+3.560
cial+3.500
onent+3.494
reated+3.380
pite+3.271
Neg logit contributions
-3.320
OPLE-3.056
 citiz-2.951
tv-2.869
 refunds-2.777
Tokenve
Token ID303
Feature activation+2.11e+00

Pos logit contributions
+0.312
OPLE+0.288
tv+0.283
 citiz+0.281
pour+0.276
Neg logit contributions
ophen-0.283
onent-0.278
cial-0.275
reated-0.264
pite-0.259
Tokengan
Token ID1030
Feature activation-1.14e+00

Pos logit contributions
ophen+1.539
onent+1.516
cial+1.513
reated+1.457
hemat+1.425
Neg logit contributions
-1.536
OPLE-1.417
 citiz-1.383
tv-1.349
 refunds-1.321
Token Heat
Token ID12308
Feature activation+1.97e-01

Pos logit contributions
+0.804
OPLE+0.743
 citiz+0.724
tv+0.712
 refunds+0.698
Neg logit contributions
ophen-0.848
onent-0.835
cial-0.829
reated-0.802
hemat-0.790
Token Wave
Token ID17084
Feature activation+4.79e+00

Pos logit contributions
ophen+0.134
cial+0.132
onent+0.132
reated+0.126
hemat+0.124
Neg logit contributions
-0.152
OPLE-0.140
 citiz-0.137
tv-0.136
 refunds-0.133
Token step
Token ID2239
Feature activation+3.36e-01

Pos logit contributions
ophen+1.411
onent+1.388
cial+1.381
reated+1.327
hemat+1.299
Neg logit contributions
-1.568
OPLE-1.454
 citiz-1.421
tv-1.395
 refunds-1.366
Token would
Token ID561
Feature activation-3.73e+00

Pos logit contributions
ophen+0.224
onent+0.221
cial+0.220
reated+0.212
hemat+0.206
Neg logit contributions
-0.264
OPLE-0.245
 citiz-0.239
tv-0.236
 refunds-0.231
Token produce
Token ID4439
Feature activation+6.87e-01

Pos logit contributions
+2.024
OPLE+1.803
 citiz+1.729
tv+1.713
pour+1.656
Neg logit contributions
ophen-3.347
onent-3.330
cial-3.324
reated-3.238
pite-3.187
Token
Token ID564
Feature activation+5.39e-01

Pos logit contributions
ophen+0.509
onent+0.500
cial+0.498
reated+0.480
hemat+0.471
Neg logit contributions
-0.494
OPLE-0.455
 citiz-0.443
tv-0.434
 refunds-0.424
Token
Token ID250
Feature activation+2.22e-01

Pos logit contributions
ophen+0.508
onent+0.501
cial+0.499
reated+0.486
hemat+0.478
Neg logit contributions
-0.279
OPLE-0.249
 citiz-0.240
tv-0.232
 refunds-0.225
Tokentrans
Token ID7645
Feature activation+4.67e+00

Pos logit contributions
ophen+0.158
onent+0.155
cial+0.154
reated+0.148
hemat+0.145
Neg logit contributions
-0.166
OPLE-0.154
 citiz-0.150
tv-0.148
 refunds-0.144
Tokenuran
Token ID42211
Feature activation-2.88e+00

Pos logit contributions
ophen+3.647
onent+3.600
cial+3.599
reated+3.474
pite+3.407
Neg logit contributions
-3.190
OPLE-2.881
 citiz-2.835
tv-2.672
 refunds-2.622
Tokenics
Token ID873
Feature activation+7.02e-01

Pos logit contributions
+2.278
OPLE+2.159
 citiz+2.073
tv+2.072
pour+2.041
Neg logit contributions
ophen-1.838
onent-1.826
cial-1.799
reated-1.756
pite-1.710
Token
Token ID447
Feature activation-3.02e-01

Pos logit contributions
ophen+0.467
onent+0.459
cial+0.456
reated+0.438
hemat+0.428
Neg logit contributions
-0.557
OPLE-0.518
 citiz-0.506
tv-0.497
 refunds-0.487
Token
Token ID251
Feature activation+1.19e+00

Pos logit contributions
+0.195
OPLE+0.178
 citiz+0.173
tv+0.169
 refunds+0.165
Neg logit contributions
ophen-0.245
onent-0.242
cial-0.240
reated-0.233
hemat-0.228
Token such
Token ID884
Feature activation-4.50e+00

Pos logit contributions
ophen+0.827
onent+0.808
cial+0.807
reated+0.778
hemat+0.761
Neg logit contributions
-0.911
OPLE-0.844
 citiz-0.823
tv-0.804
 refunds-0.789
Tokencus
Token ID9042
Feature activation+1.26e+00

Pos logit contributions
ophen+1.207
onent+1.186
cial+1.184
reated+1.136
hemat+1.115
Neg logit contributions
-1.207
OPLE-1.112
 citiz-1.089
tv-1.057
 refunds-1.040
Token,
Token ID11
Feature activation-3.06e-01

Pos logit contributions
ophen+0.836
onent+0.817
cial+0.817
reated+0.782
hemat+0.768
Neg logit contributions
-1.005
OPLE-0.932
 citiz-0.912
tv-0.892
 refunds-0.875
Token and
Token ID290
Feature activation-1.09e-01

Pos logit contributions
+0.225
OPLE+0.208
 citiz+0.204
tv+0.200
 refunds+0.195
Neg logit contributions
ophen-0.220
onent-0.216
cial-0.215
reated-0.208
pite-0.204
Token My
Token ID2011
Feature activation-4.27e-01

Pos logit contributions
+0.078
OPLE+0.072
 citiz+0.070
tv+0.069
 refunds+0.067
Neg logit contributions
ophen-0.081
onent-0.079
cial-0.079
reated-0.076
pite-0.075
Tokenc
Token ID66
Feature activation+1.46e+00

Pos logit contributions
+0.315
OPLE+0.291
 citiz+0.284
tv+0.281
 refunds+0.273
Neg logit contributions
ophen-0.305
onent-0.300
cial-0.298
reated-0.288
hemat-0.282
Tokenopl
Token ID20106
Feature activation+5.43e+00

Pos logit contributions
ophen+0.674
onent+0.655
cial+0.651
reated+0.612
hemat+0.592
Neg logit contributions
-1.461
OPLE-1.377
 citiz-1.356
tv-1.329
 refunds-1.313
Tokenasma
Token ID11797
Feature activation-1.98e+00

Pos logit contributions
ophen+2.913
onent+2.810
cial+2.786
reated+2.660
hemat+2.606
Neg logit contributions
-5.140
OPLE-4.794
 citiz-4.779
 refunds-4.559
 sugg-4.534
Token hom
Token ID3488
Feature activation+2.83e+00

Pos logit contributions
+1.549
OPLE+1.441
 citiz+1.406
tv+1.390
 refunds+1.360
Neg logit contributions
ophen-1.326
onent-1.308
cial-1.298
reated-1.247
pite-1.221
Tokeninis
Token ID16661
Feature activation-3.37e-01

Pos logit contributions
ophen+2.251
onent+2.215
cial+2.204
reated+2.132
hemat+2.091
Neg logit contributions
-1.889
OPLE-1.725
 citiz-1.686
tv-1.637
 refunds-1.604
Token which
Token ID543
Feature activation+1.59e+00

Pos logit contributions
+0.279
OPLE+0.260
 citiz+0.255
tv+0.250
 refunds+0.246
Neg logit contributions
ophen-0.212
onent-0.208
cial-0.207
reated-0.198
hemat-0.193
Token replace
Token ID6330
Feature activation-3.65e-01

Pos logit contributions
ophen+1.166
onent+1.145
cial+1.142
reated+1.099
hemat+1.077
Neg logit contributions
-1.163
OPLE-1.072
 citiz-1.046
tv-1.023
 refunds-0.999
Token some
Token ID617
Feature activation+3.90e-01

Pos logit contributions
+1.051
OPLE+0.974
 citiz+0.954
tv+0.942
 refunds+0.938
Neg logit contributions
ophen-1.134
onent-1.117
cial-1.101
reated-1.067
hemat-1.051
Token communities
Token ID5348
Feature activation-1.43e+00

Pos logit contributions
ophen+0.294
onent+0.290
cial+0.288
reated+0.278
hemat+0.273
Neg logit contributions
-0.271
OPLE-0.252
 citiz-0.246
tv-0.241
 refunds-0.239
Token seeing
Token ID4379
Feature activation-5.33e-01

Pos logit contributions
+1.002
OPLE+0.917
 citiz+0.897
tv+0.892
 refunds+0.884
Neg logit contributions
ophen-1.072
onent-1.060
cial-1.051
reated-1.019
hemat-1.001
Token double
Token ID4274
Feature activation+1.33e+00

Pos logit contributions
+0.364
OPLE+0.335
 citiz+0.328
 refunds+0.324
tv+0.324
Neg logit contributions
ophen-0.411
onent-0.404
cial-0.400
reated-0.386
hemat-0.380
Token-
Token ID12
Feature activation-1.94e-03

Pos logit contributions
ophen+0.848
onent+0.832
cial+0.827
reated+0.793
hemat+0.774
Neg logit contributions
-1.088
OPLE-1.013
 citiz-0.992
tv-0.974
 refunds-0.956
Tokendigit
Token ID27003
Feature activation+4.89e+00

Pos logit contributions
+0.002
OPLE+0.002
tv+0.002
 citiz+0.002
 refunds+0.002
Neg logit contributions
ophen-0.001
onent-0.001
cial-0.001
reated-0.001
hemat-0.001
Token gains
Token ID8810
Feature activation+1.22e+00

Pos logit contributions
ophen+2.977
cial+2.947
onent+2.936
reated+2.790
hemat+2.726
Neg logit contributions
-4.146
OPLE-3.838
 citiz-3.764
tv-3.657
pour-3.578
Token,
Token ID11
Feature activation-3.56e-02

Pos logit contributions
ophen+0.863
onent+0.848
cial+0.843
reated+0.811
hemat+0.796
Neg logit contributions
-0.921
OPLE-0.852
 citiz-0.833
tv-0.819
 refunds-0.803
Token as
Token ID355
Feature activation-1.92e+00

Pos logit contributions
+0.026
OPLE+0.024
 citiz+0.023
tv+0.023
 refunds+0.023
Neg logit contributions
ophen-0.026
onent-0.026
cial-0.025
reated-0.025
hemat-0.024
Token buyers
Token ID14456
Feature activation+6.29e-01

Pos logit contributions
+1.232
OPLE+1.141
 citiz+1.113
tv+1.100
 refunds+1.085
Neg logit contributions
ophen-1.537
onent-1.519
cial-1.491
reated-1.462
hemat-1.435
Token are
Token ID389
Feature activation+6.32e-01

Pos logit contributions
ophen+0.480
onent+0.473
cial+0.470
reated+0.454
hemat+0.446
Neg logit contributions
-0.434
OPLE-0.400
 citiz-0.389
tv-0.383
 refunds-0.376
Token)
Token ID8
Feature activation+2.27e+00

Pos logit contributions
+1.410
OPLE+1.317
 citiz+1.276
tv+1.269
 refunds+1.246
Neg logit contributions
ophen-1.210
onent-1.188
cial-1.184
reated-1.147
pite-1.120
Token was
Token ID373
Feature activation-2.23e-01

Pos logit contributions
ophen+1.517
cial+1.489
onent+1.486
reated+1.422
hemat+1.392
Neg logit contributions
-1.794
OPLE-1.668
 citiz-1.624
tv-1.586
pour-1.544
Token sent
Token ID1908
Feature activation+2.22e+00

Pos logit contributions
+0.160
OPLE+0.148
 citiz+0.144
tv+0.142
 refunds+0.139
Neg logit contributions
ophen-0.165
onent-0.162
cial-0.161
reated-0.156
hemat-0.152
Token by
Token ID416
Feature activation+1.27e+00

Pos logit contributions
ophen+1.380
onent+1.351
cial+1.349
reated+1.284
hemat+1.255
Neg logit contributions
-1.869
OPLE-1.744
 citiz-1.707
tv-1.672
 refunds-1.635
Token one
Token ID530
Feature activation+1.66e+00

Pos logit contributions
ophen+0.919
onent+0.905
cial+0.902
reated+0.868
hemat+0.849
Neg logit contributions
-0.938
OPLE-0.863
 citiz-0.844
tv-0.826
 refunds-0.804
Token of
Token ID286
Feature activation+5.29e-01

Pos logit contributions
ophen+1.145
onent+1.127
cial+1.122
reated+1.077
hemat+1.053
Neg logit contributions
-1.280
OPLE-1.184
 citiz-1.158
tv-1.134
 refunds-1.108
Token the
Token ID262
Feature activation+4.42e-02

Pos logit contributions
ophen+0.394
onent+0.388
cial+0.387
reated+0.373
hemat+0.365
Neg logit contributions
-0.378
OPLE-0.347
 citiz-0.339
tv-0.331
 refunds-0.322
Token A
Token ID317
Feature activation-2.67e-02

Pos logit contributions
ophen+0.034
onent+0.034
cial+0.033
reated+0.032
hemat+0.032
Neg logit contributions
-0.030
OPLE-0.028
 citiz-0.027
tv-0.027
 refunds-0.026
Token330
Token ID26073
Feature activation-2.51e+00

Pos logit contributions
+0.017
OPLE+0.016
 citiz+0.015
tv+0.015
 refunds+0.014
Neg logit contributions
ophen-0.022
onent-0.022
cial-0.022
reated-0.021
hemat-0.021
Token's
Token ID338
Feature activation-1.05e+00

Pos logit contributions
+1.590
OPLE+1.461
 citiz+1.410
tv+1.391
 refunds+1.366
Neg logit contributions
ophen-2.052
onent-2.026
cial-1.999
reated-1.950
pite-1.917
Token three
Token ID1115
Feature activation+9.22e-01

Pos logit contributions
+0.665
OPLE+0.610
 citiz+0.594
tv+0.582
 refunds+0.574
Neg logit contributions
ophen-0.858
onent-0.845
cial-0.837
reated-0.813
hemat-0.798
Token last
Token ID938
Feature activation+1.99e+00

Pos logit contributions
+0.117
OPLE+0.109
 citiz+0.106
tv+0.105
 refunds+0.103
Neg logit contributions
ophen-0.093
onent-0.091
cial-0.090
reated-0.087
hemat-0.085
Token two
Token ID734
Feature activation+2.03e+00

Pos logit contributions
ophen+1.301
onent+1.278
cial+1.273
reated+1.218
hemat+1.190
Neg logit contributions
-1.596
OPLE-1.483
 citiz-1.451
tv-1.422
 refunds-1.394
Token decades
Token ID4647
Feature activation+1.40e-01

Pos logit contributions
ophen+1.266
onent+1.242
cial+1.236
reated+1.182
hemat+1.154
Neg logit contributions
-1.700
OPLE-1.585
 citiz-1.552
tv-1.525
 refunds-1.496
Token,
Token ID11
Feature activation-1.91e-01

Pos logit contributions
ophen+0.093
onent+0.091
cial+0.091
reated+0.087
hemat+0.085
Neg logit contributions
-0.111
OPLE-0.103
 citiz-0.101
tv-0.099
 refunds-0.097
Token and
Token ID290
Feature activation-1.35e-01

Pos logit contributions
+0.151
OPLE+0.140
 citiz+0.137
tv+0.135
 refunds+0.132
Neg logit contributions
ophen-0.127
onent-0.125
cial-0.124
reated-0.119
hemat-0.117
Token he
Token ID339
Feature activation-6.49e-02

Pos logit contributions
+0.105
OPLE+0.098
 citiz+0.096
tv+0.094
 refunds+0.092
Neg logit contributions
ophen-0.091
onent-0.089
cial-0.089
reated-0.086
hemat-0.084
Token had
Token ID550
Feature activation-7.63e-01

Pos logit contributions
+0.053
OPLE+0.049
 citiz+0.049
tv+0.048
 refunds+0.047
Neg logit contributions
ophen-0.041
onent-0.040
cial-0.040
reated-0.038
hemat-0.038
Token never
Token ID1239
Feature activation+1.24e+00

Pos logit contributions
+0.570
OPLE+0.527
 citiz+0.515
tv+0.510
 refunds+0.502
Neg logit contributions
ophen-0.538
onent-0.530
cial-0.524
reated-0.508
hemat-0.498
Token heard
Token ID2982
Feature activation+1.56e+00

Pos logit contributions
ophen+0.756
onent+0.742
cial+0.738
reated+0.705
hemat+0.687
Neg logit contributions
-1.056
OPLE-0.986
 citiz-0.966
tv-0.948
 refunds-0.931
Token of
Token ID286
Feature activation-1.43e-02

Pos logit contributions
ophen+0.878
cial+0.856
onent+0.854
reated+0.812
pite+0.789
Neg logit contributions
-1.397
OPLE-1.313
 citiz-1.287
tv-1.259
 refunds-1.238
Token the
Token ID262
Feature activation-3.48e-01

Pos logit contributions
+0.011
OPLE+0.010
 citiz+0.010
tv+0.009
 refunds+0.009
Neg logit contributions
ophen-0.010
onent-0.010
cial-0.010
reated-0.010
hemat-0.010
Token a
Token ID257
Feature activation-4.12e-01

Pos logit contributions
+1.682
OPLE+1.556
 citiz+1.520
tv+1.492
 refunds+1.461
Neg logit contributions
ophen-1.713
onent-1.680
cial-1.671
reated-1.610
hemat-1.587
Token reasonable
Token ID6397
Feature activation-1.91e+00

Pos logit contributions
+0.294
OPLE+0.273
 citiz+0.267
tv+0.262
 refunds+0.257
Neg logit contributions
ophen-0.304
onent-0.298
cial-0.296
reated-0.285
hemat-0.282
Token contract
Token ID2775
Feature activation-1.85e+00

Pos logit contributions
+1.180
OPLE+1.087
 citiz+1.062
tv+1.041
 refunds+1.024
Neg logit contributions
ophen-1.582
onent-1.549
cial-1.544
reated-1.489
hemat-1.480
Token for
Token ID329
Feature activation-4.19e-01

Pos logit contributions
+1.413
OPLE+1.306
 citiz+1.285
tv+1.261
 refunds+1.241
Neg logit contributions
ophen-1.275
onent-1.245
cial-1.241
reated-1.195
hemat-1.170
Token the
Token ID262
Feature activation-1.44e-01

Pos logit contributions
+0.318
OPLE+0.296
 citiz+0.289
tv+0.285
 refunds+0.280
Neg logit contributions
ophen-0.290
onent-0.285
cial-0.284
reated-0.273
hemat-0.268
Token next
Token ID1306
Feature activation+1.22e+00

Pos logit contributions
+0.107
OPLE+0.099
 citiz+0.097
tv+0.095
 refunds+0.093
Neg logit contributions
ophen-0.102
onent-0.100
cial-0.100
reated-0.096
hemat-0.095
Token three
Token ID1115
Feature activation+1.16e+00

Pos logit contributions
ophen+0.817
onent+0.800
cial+0.797
reated+0.765
hemat+0.751
Neg logit contributions
-0.959
OPLE-0.892
 citiz-0.873
tv-0.859
 refunds-0.842
Token years
Token ID812
Feature activation-2.63e-01

Pos logit contributions
ophen+0.822
onent+0.807
cial+0.803
reated+0.774
hemat+0.758
Neg logit contributions
-0.875
OPLE-0.809
 citiz-0.791
tv-0.776
 refunds-0.760
Token,
Token ID11
Feature activation-3.53e-01

Pos logit contributions
+0.209
OPLE+0.195
 citiz+0.190
tv+0.187
 refunds+0.183
Neg logit contributions
ophen-0.173
onent-0.170
cial-0.170
reated-0.163
hemat-0.159
Token he
Token ID339
Feature activation+4.09e-01

Pos logit contributions
+0.266
OPLE+0.246
 citiz+0.241
tv+0.237
 refunds+0.232
Neg logit contributions
ophen-0.248
onent-0.244
cial-0.242
reated-0.234
hemat-0.229
Token
Token ID447
Feature activation+6.11e-01

Pos logit contributions
ophen+0.278
onent+0.273
cial+0.272
reated+0.261
hemat+0.256
Neg logit contributions
-0.319
OPLE-0.296
 citiz-0.289
tv-0.284
 refunds-0.278
Token Grey
Token ID13980
Feature activation-1.21e+00

Pos logit contributions
+1.558
OPLE+1.439
 citiz+1.396
tv+1.382
 refunds+1.344
Neg logit contributions
ophen-1.807
onent-1.796
cial-1.776
reated-1.723
hemat-1.694
Token?
Token ID30
Feature activation+9.25e-01

Pos logit contributions
+0.918
OPLE+0.854
 citiz+0.831
tv+0.828
 refunds+0.801
Neg logit contributions
ophen-0.840
onent-0.827
cial-0.821
reated-0.794
hemat-0.777
Token Let
Token ID3914
Feature activation-1.12e-01

Pos logit contributions
ophen+0.683
onent+0.671
cial+0.670
reated+0.644
hemat+0.632
Neg logit contributions
-0.666
OPLE-0.612
 citiz-0.602
tv-0.581
 refunds-0.574
Token us
Token ID514
Feature activation+7.85e-01

Pos logit contributions
+0.077
OPLE+0.070
 citiz+0.068
tv+0.067
 refunds+0.065
Neg logit contributions
ophen-0.086
onent-0.085
cial-0.085
reated-0.082
hemat-0.080
Token know
Token ID760
Feature activation+2.18e+00

Pos logit contributions
ophen+0.743
onent+0.736
cial+0.731
reated+0.710
pite+0.697
Neg logit contributions
-0.405
OPLE-0.359
 citiz-0.350
tv-0.330
 refunds-0.323
Token in
Token ID287
Feature activation+3.73e-01

Pos logit contributions
ophen+1.513
onent+1.498
cial+1.486
reated+1.422
hemat+1.391
Neg logit contributions
-1.656
OPLE-1.526
 citiz-1.494
tv-1.448
 refunds-1.429
Token the
Token ID262
Feature activation-2.65e-01

Pos logit contributions
ophen+0.271
onent+0.267
cial+0.266
reated+0.256
hemat+0.251
Neg logit contributions
-0.273
OPLE-0.252
 citiz-0.246
tv-0.241
 refunds-0.236
Token comments
Token ID3651
Feature activation+1.23e+00

Pos logit contributions
+0.180
OPLE+0.164
 citiz+0.160
tv+0.156
 refunds+0.153
Neg logit contributions
ophen-0.206
onent-0.203
cial-0.202
reated-0.195
hemat-0.191
Token below
Token ID2174
Feature activation-6.71e-01

Pos logit contributions
ophen+0.810
onent+0.797
cial+0.793
reated+0.759
hemat+0.743
Neg logit contributions
-0.990
OPLE-0.918
 citiz-0.899
tv-0.880
 refunds-0.865
Token….
Token ID11580
Feature activation+3.07e+00

Pos logit contributions
+0.542
OPLE+0.506
 citiz+0.495
tv+0.487
 refunds+0.477
Neg logit contributions
ophen-0.436
onent-0.427
cial-0.424
reated-0.408
hemat-0.398
Token<|endoftext|>
Token ID50256
Feature activation+4.56e-01

Pos logit contributions
ophen+1.795
cial+1.750
onent+1.741
reated+1.655
hemat+1.618
Neg logit contributions
-2.679
OPLE-2.483
 citiz-2.456
tv-2.364
 refunds-2.360
Token it
Token ID340
Feature activation-1.16e+00

Pos logit contributions
+0.025
OPLE+0.023
 citiz+0.023
tv+0.022
 refunds+0.022
Neg logit contributions
ophen-0.027
onent-0.026
cial-0.026
reated-0.025
hemat-0.025
Token?
Token ID30
Feature activation+5.75e-01

Pos logit contributions
+0.974
OPLE+0.910
 citiz+0.890
tv+0.879
 refunds+0.861
Neg logit contributions
ophen-0.713
onent-0.701
cial-0.695
reated-0.666
hemat-0.650
Token It
Token ID632
Feature activation-1.35e+00

Pos logit contributions
ophen+0.432
onent+0.425
cial+0.425
reated+0.409
hemat+0.401
Neg logit contributions
-0.407
OPLE-0.374
 citiz-0.366
tv-0.355
 refunds-0.349
Token
Token ID447
Feature activation+3.44e-01

Pos logit contributions
+1.002
OPLE+0.925
 citiz+0.904
tv+0.891
 refunds+0.872
Neg logit contributions
ophen-0.960
onent-0.943
cial-0.938
reated-0.903
hemat-0.883
Token
Token ID247
Feature activation-6.39e-02

Pos logit contributions
ophen+0.298
onent+0.294
cial+0.293
reated+0.284
hemat+0.279
Neg logit contributions
-0.205
OPLE-0.186
 citiz-0.180
tv-0.174
 refunds-0.170
Tokens
Token ID82
Feature activation+5.50e-01

Pos logit contributions
+0.040
OPLE+0.036
 citiz+0.035
tv+0.035
 refunds+0.034
Neg logit contributions
ophen-0.053
onent-0.052
cial-0.052
reated-0.050
hemat-0.049
Token Ins
Token ID7088
Feature activation+3.86e+00

Pos logit contributions
ophen+0.422
onent+0.414
cial+0.414
reated+0.398
hemat+0.392
Neg logit contributions
-0.382
OPLE-0.350
 citiz-0.342
tv-0.332
 refunds-0.324
Tokenane
Token ID1531
Feature activation+2.35e-01

Pos logit contributions
ophen+1.974
onent+1.918
cial+1.916
reated+1.806
pite+1.774
Neg logit contributions
-3.679
OPLE-3.460
 citiz-3.402
tv-3.307
 refunds-3.273
Token Clown
Token ID36887
Feature activation-3.24e+00

Pos logit contributions
ophen+0.176
onent+0.174
cial+0.173
reated+0.167
hemat+0.163
Neg logit contributions
-0.166
OPLE-0.153
 citiz-0.149
tv-0.146
 refunds-0.143
Token Pos
Token ID18574
Feature activation-2.88e+00

Pos logit contributions
+1.433
OPLE+1.251
 citiz+1.197
tv+1.160
 refunds+1.113
Neg logit contributions
ophen-3.287
onent-3.248
cial-3.236
reated-3.156
hemat-3.112
Tokense
Token ID325
Feature activation-2.11e+00

Pos logit contributions
+2.192
OPLE+2.032
 citiz+1.985
tv+1.954
 refunds+1.911
Neg logit contributions
ophen-2.000
onent-1.970
cial-1.956
reated-1.884
hemat-1.847
Token own
Token ID898
Feature activation-8.41e-01

Pos logit contributions
+0.384
OPLE+0.344
 citiz+0.333
tv+0.324
 refunds+0.314
Neg logit contributions
ophen-0.643
onent-0.635
cial-0.633
reated-0.614
hemat-0.604
Token without
Token ID1231
Feature activation+8.17e-01

Pos logit contributions
+0.499
OPLE+0.453
 citiz+0.440
tv+0.429
 refunds+0.418
Neg logit contributions
ophen-0.726
onent-0.715
cial-0.712
reated-0.691
hemat-0.679
Token fish
Token ID5916
Feature activation-1.56e+00

Pos logit contributions
ophen+0.619
onent+0.610
cial+0.607
reated+0.586
hemat+0.574
Neg logit contributions
-0.572
OPLE-0.526
 citiz-0.513
tv-0.502
 refunds-0.491
Token as
Token ID355
Feature activation-1.38e+00

Pos logit contributions
+1.109
OPLE+1.026
 citiz+0.995
tv+0.985
 refunds+0.964
Neg logit contributions
ophen-1.154
onent-1.143
cial-1.133
reated-1.093
hemat-1.069
Token McDonald
Token ID14115
Feature activation+5.56e-01

Pos logit contributions
+0.956
OPLE+0.882
 citiz+0.859
tv+0.844
 refunds+0.828
Neg logit contributions
ophen-1.044
onent-1.032
cial-1.021
reated-0.991
hemat-0.970
Token's
Token ID338
Feature activation-7.68e-01

Pos logit contributions
ophen+0.413
onent+0.406
cial+0.404
reated+0.390
hemat+0.382
Neg logit contributions
-0.398
OPLE-0.366
 citiz-0.357
tv-0.350
 refunds-0.342
Token is
Token ID318
Feature activation+2.81e-01

Pos logit contributions
+0.556
OPLE+0.514
 citiz+0.503
tv+0.493
 refunds+0.492
Neg logit contributions
ophen-0.556
onent-0.551
cial-0.546
reated-0.528
hemat-0.520
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
ophen+0.217
onent+0.214
cial+0.213
reated+0.206
hemat+0.202
Neg logit contributions
-0.192
OPLE-0.176
 citiz-0.171
tv-0.168
 refunds-0.165
Token official
Token ID1743
Feature activation+2.28e+00

Pos logit contributions
+0.084
OPLE+0.078
 citiz+0.076
tv+0.075
 refunds+0.073
Neg logit contributions
ophen-0.084
onent-0.083
cial-0.082
reated-0.079
hemat-0.078
Token chip
Token ID11594
Feature activation-2.83e+00

Pos logit contributions
ophen+1.733
onent+1.706
cial+1.699
reated+1.638
hemat+1.601
Neg logit contributions
-1.590
OPLE-1.459
 citiz-1.421
tv-1.389
 refunds-1.354
Token maker
Token ID16009
Feature activation-1.08e+00

Pos logit contributions
+1.760
OPLE+1.627
 citiz+1.558
tv+1.543
 refunds+1.525
Neg logit contributions
ophen-2.311
onent-2.286
cial-2.267
reated-2.207
hemat-2.179
Token a
Token ID257
Feature activation+4.20e-01

Pos logit contributions
ophen+0.903
onent+0.890
cial+0.886
reated+0.855
hemat+0.837
Neg logit contributions
-0.870
OPLE-0.798
 citiz-0.777
tv-0.761
 refunds-0.739
Token massive
Token ID4858
Feature activation-1.16e-01

Pos logit contributions
ophen+0.376
onent+0.371
cial+0.369
reated+0.358
hemat+0.353
Neg logit contributions
-0.235
OPLE-0.212
 citiz-0.206
tv-0.200
 refunds-0.195
Token traffic
Token ID4979
Feature activation+6.47e-01

Pos logit contributions
+0.074
OPLE+0.068
 citiz+0.067
tv+0.065
 refunds+0.064
Neg logit contributions
ophen-0.094
onent-0.093
cial-0.092
reated-0.089
hemat-0.088
Token jam
Token ID16853
Feature activation-2.85e+00

Pos logit contributions
ophen+0.619
onent+0.612
cial+0.609
reated+0.592
hemat+0.584
Neg logit contributions
-0.322
OPLE-0.287
 citiz-0.277
tv-0.268
 refunds-0.260
Token.
Token ID13
Feature activation-6.43e-01

Pos logit contributions
+2.210
OPLE+2.051
 citiz+2.011
tv+1.988
pour+1.957
Neg logit contributions
ophen-1.916
onent-1.876
cial-1.856
reated-1.790
hemat-1.747
Token Then
Token ID3244
Feature activation-1.78e+00

Pos logit contributions
+0.444
OPLE+0.409
 citiz+0.396
tv+0.392
 refunds+0.381
Neg logit contributions
ophen-0.493
onent-0.485
cial-0.482
reated-0.466
hemat-0.457
Token,
Token ID11
Feature activation-5.66e-01

Pos logit contributions
+1.318
OPLE+1.228
 citiz+1.198
tv+1.174
 refunds+1.156
Neg logit contributions
ophen-1.269
onent-1.253
cial-1.242
reated-1.197
hemat-1.173
Token while
Token ID981
Feature activation-1.29e+00

Pos logit contributions
+0.413
OPLE+0.382
 citiz+0.373
tv+0.365
 refunds+0.358
Neg logit contributions
ophen-0.412
onent-0.405
cial-0.403
reated-0.388
hemat-0.381
Token they
Token ID484
Feature activation+6.96e-01

Pos logit contributions
+0.894
OPLE+0.824
 citiz+0.803
tv+0.785
 refunds+0.773
Neg logit contributions
ophen-0.980
onent-0.965
cial-0.960
reated-0.927
hemat-0.909
Token
Token ID447
Feature activation+5.20e-01

Pos logit contributions
ophen+0.468
onent+0.460
cial+0.457
reated+0.440
hemat+0.430
Neg logit contributions
-0.547
OPLE-0.509
 citiz-0.497
tv-0.487
 refunds-0.476
Token
Token ID247
Feature activation+1.65e-01

Pos logit contributions
ophen+0.462
onent+0.456
cial+0.454
reated+0.441
hemat+0.433
Neg logit contributions
-0.297
OPLE-0.268
 citiz-0.260
tv-0.252
 refunds-0.245
Token bright
Token ID6016
Feature activation+1.26e+00

Pos logit contributions
ophen+0.270
onent+0.266
cial+0.263
reated+0.254
hemat+0.250
Neg logit contributions
-0.267
OPLE-0.246
 citiz-0.242
tv-0.238
 refunds-0.232
Token clear
Token ID1598
Feature activation+3.61e-01

Pos logit contributions
ophen+0.903
onent+0.889
cial+0.884
reated+0.851
hemat+0.833
Neg logit contributions
-0.930
OPLE-0.859
 citiz-0.839
tv-0.824
 refunds-0.805
Token wedge
Token ID35901
Feature activation+1.57e+00

Pos logit contributions
ophen+0.260
onent+0.256
cial+0.254
reated+0.245
pite+0.240
Neg logit contributions
-0.264
OPLE-0.244
 citiz-0.239
tv-0.235
pour-0.229
Token of
Token ID286
Feature activation-2.38e-01

Pos logit contributions
ophen+0.977
onent+0.959
cial+0.953
reated+0.913
hemat+0.892
Neg logit contributions
-1.312
OPLE-1.224
 citiz-1.199
tv-1.180
 refunds-1.159
Token lime
Token ID28738
Feature activation-1.85e+00

Pos logit contributions
+0.174
OPLE+0.161
 citiz+0.157
tv+0.154
 refunds+0.151
Neg logit contributions
ophen-0.172
onent-0.170
cial-0.169
reated-0.163
hemat-0.159
Token rises
Token ID16736
Feature activation-8.86e-01

Pos logit contributions
+1.313
OPLE+1.224
tv+1.195
 citiz+1.193
 refunds+1.165
Neg logit contributions
ophen-1.346
onent-1.327
cial-1.313
reated-1.266
hemat-1.252
Token up
Token ID510
Feature activation-3.39e-01

Pos logit contributions
+0.610
OPLE+0.562
 citiz+0.552
tv+0.542
 refunds+0.532
Neg logit contributions
ophen-0.672
onent-0.663
cial-0.662
reated-0.637
hemat-0.627
Token and
Token ID290
Feature activation-5.49e-02

Pos logit contributions
+0.252
OPLE+0.232
 citiz+0.228
tv+0.225
 refunds+0.220
Neg logit contributions
ophen-0.240
onent-0.237
cial-0.235
reated-0.227
hemat-0.222
Token through
Token ID832
Feature activation+2.26e-01

Pos logit contributions
+0.038
OPLE+0.035
 citiz+0.035
tv+0.034
 refunds+0.033
Neg logit contributions
ophen-0.042
onent-0.041
cial-0.041
reated-0.039
hemat-0.039
Token.
Token ID13
Feature activation-8.57e-01

Pos logit contributions
ophen+0.162
onent+0.160
cial+0.159
reated+0.153
hemat+0.150
Neg logit contributions
-0.168
OPLE-0.155
 citiz-0.151
tv-0.148
 refunds-0.145
Token K
Token ID509
Feature activation-4.23e-01

Pos logit contributions
+0.578
OPLE+0.534
tv+0.520
 citiz+0.510
pour+0.500
Neg logit contributions
ophen-0.663
onent-0.652
cial-0.647
reated-0.629
hemat-0.616
Tokenс
Token ID21727
Feature activation-2.25e+00

Pos logit contributions
+2.988
OPLE+2.714
tv+2.656
 citiz+2.625
 refunds+2.525
Neg logit contributions
ophen-2.880
onent-2.863
cial-2.797
reated-2.726
hemat-2.639
Tokenк
Token ID31583
Feature activation-1.02e+00

Pos logit contributions
+1.423
OPLE+1.266
 citiz+1.237
tv+1.236
pour+1.186
Neg logit contributions
ophen-1.862
onent-1.843
cial-1.832
reated-1.773
hemat-1.741
Tokenа
Token ID16142
Feature activation-2.32e+00

Pos logit contributions
+0.765
OPLE+0.708
 citiz+0.690
tv+0.680
 refunds+0.663
Neg logit contributions
ophen-0.711
onent-0.700
cial-0.698
reated-0.671
hemat-0.657
Tokenя
Token ID40623
Feature activation-2.98e+00

Pos logit contributions
+1.781
OPLE+1.640
 citiz+1.598
tv+1.593
 refunds+1.532
Neg logit contributions
ophen-1.596
onent-1.573
cial-1.560
reated-1.510
pite-1.467
Token
Token ID12466
Feature activation+1.06e-01

Pos logit contributions
+2.371
OPLE+2.190
tv+2.134
 citiz+2.129
 refunds+2.054
Neg logit contributions
ophen-1.957
onent-1.915
cial-1.894
reated-1.846
pite-1.790
Token
Token ID120
Feature activation-1.81e+00

Pos logit contributions
ophen+0.089
onent+0.087
cial+0.087
reated+0.084
hemat+0.083
Neg logit contributions
-0.066
OPLE-0.060
 citiz-0.058
tv-0.057
 refunds-0.055
Tokenе
Token ID16843
Feature activation-2.89e+00

Pos logit contributions
+1.277
OPLE+1.158
tv+1.125
 citiz+1.120
 refunds+1.078
Neg logit contributions
ophen-1.355
onent-1.334
cial-1.325
reated-1.284
hemat-1.257
Tokenд
Token ID43666
Feature activation-1.75e+00

Pos logit contributions
+1.750
OPLE+1.558
tv+1.526
 citiz+1.503
 refunds+1.439
Neg logit contributions
ophen-2.460
onent-2.427
cial-2.392
reated-2.341
hemat-2.287
Tokenа
Token ID16142
Feature activation-1.75e+00

Pos logit contributions
+1.263
OPLE+1.151
tv+1.126
 citiz+1.116
pour+1.074
Neg logit contributions
ophen-1.282
onent-1.260
cial-1.247
reated-1.212
hemat-1.182
Tokenл
Token ID30143
Feature activation+1.76e+00

Pos logit contributions
+1.135
OPLE+1.027
tv+0.994
 citiz+0.988
 refunds+0.946
Neg logit contributions
ophen-1.416
onent-1.395
cial-1.384
reated-1.347
hemat-1.315
Tokenь
Token ID45367
Feature activation-2.24e+00

Pos logit contributions
ophen+1.039
onent+1.021
cial+1.014
reated+0.965
hemat+0.941
Neg logit contributions
-1.523
OPLE-1.428
 citiz-1.402
tv-1.368
 refunds-1.352
Token He
Token ID679
Feature activation-9.12e-02

Pos logit contributions
+0.139
OPLE+0.129
tv+0.126
 citiz+0.124
 refunds+0.120
Neg logit contributions
ophen-0.143
onent-0.141
cial-0.140
reated-0.136
pite-0.133
Token gave
Token ID2921
Feature activation-2.88e-01

Pos logit contributions
+0.069
OPLE+0.064
 citiz+0.063
tv+0.062
 refunds+0.061
Neg logit contributions
ophen-0.063
onent-0.062
cial-0.062
reated-0.059
hemat-0.058
Token up
Token ID510
Feature activation+3.90e-01

Pos logit contributions
+0.203
OPLE+0.187
 citiz+0.183
tv+0.179
 refunds+0.175
Neg logit contributions
ophen-0.216
onent-0.213
cial-0.212
reated-0.205
hemat-0.201
Token soccer
Token ID11783
Feature activation+2.36e-01

Pos logit contributions
ophen+0.290
onent+0.286
cial+0.284
reated+0.274
hemat+0.269
Neg logit contributions
-0.279
OPLE-0.256
 citiz-0.250
tv-0.245
 refunds-0.240
Token for
Token ID329
Feature activation-2.15e-01

Pos logit contributions
ophen+0.167
onent+0.165
cial+0.163
reated+0.158
hemat+0.155
Neg logit contributions
-0.176
OPLE-0.162
 citiz-0.159
tv-0.157
 refunds-0.154
Token good
Token ID922
Feature activation-2.10e+00

Pos logit contributions
+0.147
OPLE+0.136
 citiz+0.132
tv+0.130
 refunds+0.128
Neg logit contributions
ophen-0.164
onent-0.162
cial-0.161
reated-0.156
hemat-0.153
Token when
Token ID618
Feature activation+2.42e+00

Pos logit contributions
+1.554
OPLE+1.426
 citiz+1.405
tv+1.386
 refunds+1.360
Neg logit contributions
ophen-1.495
onent-1.473
cial-1.447
reated-1.413
pite-1.375
Token he
Token ID339
Feature activation+1.06e+00

Pos logit contributions
ophen+1.487
onent+1.457
cial+1.455
reated+1.387
hemat+1.352
Neg logit contributions
-2.045
OPLE-1.900
 citiz-1.864
tv-1.824
pour-1.789
Token was
Token ID373
Feature activation+5.25e-01

Pos logit contributions
ophen+0.698
onent+0.686
cial+0.683
reated+0.654
hemat+0.642
Neg logit contributions
-0.841
OPLE-0.783
 citiz-0.767
tv-0.755
 refunds-0.741
Token about
Token ID546
Feature activation+9.20e-02

Pos logit contributions
ophen+0.340
onent+0.333
cial+0.331
reated+0.317
hemat+0.311
Neg logit contributions
-0.423
OPLE-0.396
 citiz-0.388
tv-0.382
 refunds-0.375
Token 13
Token ID1511
Feature activation+2.74e-01

Pos logit contributions
ophen+0.059
onent+0.058
cial+0.057
reated+0.055
hemat+0.054
Neg logit contributions
-0.074
OPLE-0.070
 citiz-0.068
tv-0.067
 refunds-0.066
Token of
Token ID286
Feature activation+1.53e-01

Pos logit contributions
ophen+0.564
onent+0.553
cial+0.551
reated+0.525
hemat+0.513
Neg logit contributions
-0.800
OPLE-0.747
 citiz-0.732
tv-0.717
 refunds-0.705
Token what
Token ID644
Feature activation-8.65e-01

Pos logit contributions
ophen+0.112
onent+0.110
cial+0.110
reated+0.106
hemat+0.103
Neg logit contributions
-0.111
OPLE-0.103
 citiz-0.100
tv-0.098
 refunds-0.096
Token that
Token ID326
Feature activation+3.45e-01

Pos logit contributions
+0.602
OPLE+0.553
 citiz+0.539
tv+0.528
 refunds+0.516
Neg logit contributions
ophen-0.659
onent-0.649
cial-0.646
reated-0.624
hemat-0.611
Token is
Token ID318
Feature activation+1.34e+00

Pos logit contributions
ophen+0.245
onent+0.241
cial+0.240
reated+0.231
hemat+0.226
Neg logit contributions
-0.259
OPLE-0.239
 citiz-0.234
tv-0.229
 refunds-0.224
Token.
Token ID13
Feature activation-4.45e-01

Pos logit contributions
ophen+0.840
cial+0.823
onent+0.822
reated+0.785
hemat+0.769
Neg logit contributions
-1.106
OPLE-1.034
 citiz-1.013
tv-0.990
 refunds-0.973
Token So
Token ID1406
Feature activation-6.43e+00

Pos logit contributions
+0.265
OPLE+0.242
 citiz+0.232
tv+0.229
 refunds+0.222
Neg logit contributions
ophen-0.382
onent-0.377
cial-0.375
reated-0.364
hemat-0.357
Token yeah
Token ID10194
Feature activation-3.63e+00

Pos logit contributions
+4.047
OPLE+3.850
tv+3.663
 citiz+3.660
 refunds+3.622
Neg logit contributions
ophen-4.972
onent-4.962
cial-4.812
reated-4.776
pite-4.650
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
+2.797
OPLE+2.584
 citiz+2.521
tv+2.496
 refunds+2.439
Neg logit contributions
ophen-2.485
onent-2.456
cial-2.419
reated-2.334
pite-2.284
Token just
Token ID655
Feature activation-2.32e+00

Pos logit contributions
+0.727
OPLE+0.668
 citiz+0.651
tv+0.639
 refunds+0.624
Neg logit contributions
ophen-0.847
onent-0.836
cial-0.829
reated-0.804
hemat-0.787
Token finding
Token ID4917
Feature activation-3.75e+00

Pos logit contributions
+1.551
OPLE+1.427
 citiz+1.384
tv+1.367
 refunds+1.336
Neg logit contributions
ophen-1.809
onent-1.791
cial-1.766
reated-1.711
hemat-1.678
Token things
Token ID1243
Feature activation+1.02e-01

Pos logit contributions
+2.633
OPLE+2.450
 citiz+2.363
tv+2.349
 refunds+2.321
Neg logit contributions
ophen-2.765
onent-2.744
cial-2.700
reated-2.635
pite-2.570
Token blurry
Token ID44701
Feature activation-1.74e+00

Pos logit contributions
+0.526
OPLE+0.482
 citiz+0.472
tv+0.462
 refunds+0.452
Neg logit contributions
ophen-0.603
onent-0.597
cial-0.588
reated-0.572
hemat-0.561
Token photo
Token ID4590
Feature activation+2.14e-02

Pos logit contributions
+1.214
OPLE+1.121
 citiz+1.097
tv+1.078
 refunds+1.049
Neg logit contributions
ophen-1.313
onent-1.293
cial-1.284
reated-1.241
hemat-1.218
Token of
Token ID286
Feature activation+1.29e-01

Pos logit contributions
ophen+0.012
onent+0.012
cial+0.012
reated+0.012
hemat+0.011
Neg logit contributions
-0.019
OPLE-0.017
 citiz-0.017
tv-0.017
 refunds-0.017
Token a
Token ID257
Feature activation-4.41e-01

Pos logit contributions
ophen+0.092
onent+0.090
cial+0.090
reated+0.086
hemat+0.084
Neg logit contributions
-0.097
OPLE-0.090
 citiz-0.088
tv-0.086
 refunds-0.084
Token man
Token ID582
Feature activation-1.33e+00

Pos logit contributions
+0.313
OPLE+0.288
 citiz+0.283
tv+0.276
 refunds+0.270
Neg logit contributions
ophen-0.329
onent-0.325
cial-0.323
reated-0.312
hemat-0.305
Token sitting
Token ID5586
Feature activation-7.67e+00

Pos logit contributions
+0.927
OPLE+0.852
 citiz+0.828
tv+0.818
 refunds+0.795
Neg logit contributions
ophen-1.006
onent-0.993
cial-0.984
reated-0.954
hemat-0.935
Token in
Token ID287
Feature activation-3.71e-01

Pos logit contributions
+4.662
OPLE+4.215
 citiz+4.175
tv+4.153
 refunds+4.059
Neg logit contributions
cial-6.092
ophen-6.075
onent-6.017
reated-5.794
hemat-5.719
Token a
Token ID257
Feature activation-6.49e-01

Pos logit contributions
+0.275
OPLE+0.254
 citiz+0.248
tv+0.243
 refunds+0.237
Neg logit contributions
ophen-0.266
onent-0.262
cial-0.260
reated-0.251
hemat-0.246
Token lawn
Token ID21133
Feature activation-1.71e-01

Pos logit contributions
+0.428
OPLE+0.392
 citiz+0.383
tv+0.375
 refunds+0.365
Neg logit contributions
ophen-0.517
onent-0.509
cial-0.506
reated-0.490
hemat-0.481
Tokenchair
Token ID16337
Feature activation-1.23e+00

Pos logit contributions
+0.099
OPLE+0.090
 citiz+0.087
tv+0.085
 refunds+0.082
Neg logit contributions
ophen-0.149
onent-0.147
cial-0.147
reated-0.142
hemat-0.140
Token.
Token ID13
Feature activation-8.22e-01

Pos logit contributions
+0.934
OPLE+0.862
 citiz+0.844
tv+0.832
 refunds+0.813
Neg logit contributions
ophen-0.853
onent-0.838
cial-0.834
reated-0.800
hemat-0.785
Tokenel
Token ID417
Feature activation-2.07e-02

Pos logit contributions
+0.175
OPLE+0.163
 citiz+0.159
tv+0.157
 refunds+0.154
Neg logit contributions
ophen-0.135
onent-0.132
cial-0.131
reated-0.126
hemat-0.123
Tokenan
Token ID272
Feature activation-1.31e+00

Pos logit contributions
+0.016
OPLE+0.014
 citiz+0.014
tv+0.014
 refunds+0.014
Neg logit contributions
ophen-0.015
onent-0.014
cial-0.014
reated-0.014
hemat-0.013
Token.
Token ID13
Feature activation-9.29e-01

Pos logit contributions
+0.967
OPLE+0.894
 citiz+0.872
tv+0.855
 refunds+0.837
Neg logit contributions
ophen-0.944
onent-0.929
cial-0.924
reated-0.890
hemat-0.871
Token
Token ID198
Feature activation-3.35e-01

Pos logit contributions
+0.625
OPLE+0.576
 citiz+0.554
tv+0.554
 refunds+0.534
Neg logit contributions
ophen-0.727
onent-0.715
cial-0.710
reated-0.690
pite-0.674
Token
Token ID198
Feature activation-2.55e-01

Pos logit contributions
+0.223
OPLE+0.210
tv+0.209
pour+0.199
 refunds+0.197
Neg logit contributions
ophen-0.257
onent-0.252
cial-0.248
reated-0.243
pite-0.238
TokenSo
Token ID2396
Feature activation-5.98e+00

Pos logit contributions
+0.168
OPLE+0.154
 citiz+0.148
tv+0.147
 refunds+0.142
Neg logit contributions
ophen-0.204
onent-0.201
cial-0.200
reated-0.194
hemat-0.190
Token if
Token ID611
Feature activation+7.03e-01

Pos logit contributions
+3.829
OPLE+3.625
tv+3.462
 citiz+3.451
 refunds+3.442
Neg logit contributions
ophen-4.608
onent-4.592
cial-4.454
reated-4.444
hemat-4.311
Token you
Token ID345
Feature activation-8.56e-01

Pos logit contributions
ophen+0.551
onent+0.542
cial+0.541
reated+0.521
hemat+0.513
Neg logit contributions
-0.475
OPLE-0.433
 citiz-0.423
tv-0.412
 refunds-0.402
Token're
Token ID821
Feature activation-1.32e+00

Pos logit contributions
+0.550
OPLE+0.507
 citiz+0.490
tv+0.486
 refunds+0.474
Neg logit contributions
ophen-0.689
onent-0.679
cial-0.676
reated-0.655
pite-0.646
Token interested
Token ID4609
Feature activation-4.47e-01

Pos logit contributions
+0.919
OPLE+0.849
 citiz+0.825
tv+0.815
 refunds+0.794
Neg logit contributions
ophen-0.994
onent-0.981
cial-0.974
reated-0.944
pite-0.923
Token in
Token ID287
Feature activation-1.96e-01

Pos logit contributions
+0.331
OPLE+0.306
 citiz+0.299
tv+0.293
 refunds+0.286
Neg logit contributions
ophen-0.320
onent-0.315
cial-0.314
reated-0.302
hemat-0.296
Token worlds
Token ID11621
Feature activation-1.22e+00

Pos logit contributions
ophen+0.961
onent+0.944
cial+0.942
reated+0.910
hemat+0.891
Neg logit contributions
-0.858
OPLE-0.785
 citiz-0.763
tv-0.746
 refunds-0.727
Token seemed
Token ID3947
Feature activation+1.68e-01

Pos logit contributions
+0.939
OPLE+0.869
 citiz+0.854
tv+0.843
 refunds+0.825
Neg logit contributions
ophen-0.831
onent-0.822
cial-0.813
reated-0.783
pite-0.765
Token to
Token ID284
Feature activation-6.17e-02

Pos logit contributions
ophen+0.120
onent+0.118
cial+0.118
reated+0.113
hemat+0.111
Neg logit contributions
-0.124
OPLE-0.115
 citiz-0.112
tv-0.110
 refunds-0.108
Token stop
Token ID2245
Feature activation+1.30e+00

Pos logit contributions
+0.039
OPLE+0.036
 citiz+0.035
tv+0.034
 refunds+0.033
Neg logit contributions
ophen-0.051
onent-0.050
cial-0.050
reated-0.048
hemat-0.047
Token.
Token ID13
Feature activation-1.02e-01

Pos logit contributions
ophen+0.936
onent+0.920
cial+0.917
reated+0.882
hemat+0.864
Neg logit contributions
-0.963
OPLE-0.889
 citiz-0.868
tv-0.850
 refunds-0.832
Token So
Token ID1406
Feature activation-6.00e+00

Pos logit contributions
+0.071
OPLE+0.066
 citiz+0.064
tv+0.063
 refunds+0.061
Neg logit contributions
ophen-0.077
onent-0.076
cial-0.075
reated-0.073
hemat-0.072
Token much
Token ID881
Feature activation-8.74e-01

Pos logit contributions
+3.952
OPLE+3.792
tv+3.660
 citiz+3.602
 refunds+3.588
Neg logit contributions
onent-4.439
ophen-4.401
cial-4.277
hemat-4.216
reated-4.205
Token so
Token ID523
Feature activation-5.02e+00

Pos logit contributions
+0.631
OPLE+0.583
 citiz+0.569
tv+0.562
 refunds+0.550
Neg logit contributions
ophen-0.639
onent-0.631
cial-0.625
reated-0.604
hemat-0.594
Token,
Token ID11
Feature activation-6.90e-01

Pos logit contributions
+3.236
OPLE+2.989
 citiz+2.911
tv+2.898
 refunds+2.803
Neg logit contributions
onent-3.943
ophen-3.931
cial-3.856
reated-3.719
hemat-3.709
Token that
Token ID326
Feature activation-1.01e+00

Pos logit contributions
+0.466
OPLE+0.428
 citiz+0.417
tv+0.408
 refunds+0.398
Neg logit contributions
ophen-0.539
onent-0.531
cial-0.528
reated-0.511
hemat-0.501
Token we
Token ID356
Feature activation+1.32e+00

Pos logit contributions
+0.730
OPLE+0.677
 citiz+0.659
tv+0.648
 refunds+0.634
Neg logit contributions
ophen-0.745
onent-0.735
cial-0.729
reated-0.705
hemat-0.691
Token training
Token ID3047
Feature activation+1.21e+00

Pos logit contributions
ophen+1.094
onent+1.078
cial+1.071
reated+1.034
hemat+1.013
Neg logit contributions
-1.110
OPLE-1.026
 citiz-1.001
tv-0.983
 refunds-0.961
Token entails
Token ID35445
Feature activation-7.57e-01

Pos logit contributions
ophen+0.866
onent+0.851
cial+0.848
reated+0.815
hemat+0.798
Neg logit contributions
-0.906
OPLE-0.837
 citiz-0.818
tv-0.801
 refunds-0.783
Token.
Token ID13
Feature activation-3.14e-01

Pos logit contributions
+0.637
OPLE+0.593
 citiz+0.583
tv+0.574
 refunds+0.564
Neg logit contributions
ophen-0.465
onent-0.457
cial-0.453
reated-0.434
hemat-0.423
Token
Token ID198
Feature activation-2.42e-01

Pos logit contributions
+0.222
OPLE+0.206
 citiz+0.199
tv+0.197
 refunds+0.191
Neg logit contributions
ophen-0.235
onent-0.231
cial-0.229
reated-0.222
hemat-0.217
Token
Token ID198
Feature activation-2.20e-01

Pos logit contributions
+0.161
OPLE+0.151
tv+0.150
pour+0.144
 refunds+0.142
Neg logit contributions
ophen-0.186
onent-0.183
cial-0.180
reated-0.176
pite-0.172
TokenSo
Token ID2396
Feature activation-5.88e+00

Pos logit contributions
+0.143
OPLE+0.132
tv+0.126
 citiz+0.124
pour+0.121
Neg logit contributions
ophen-0.177
onent-0.176
cial-0.173
reated-0.169
pite-0.165
Token it
Token ID340
Feature activation-1.65e+00

Pos logit contributions
+3.885
OPLE+3.721
tv+3.530
 citiz+3.507
 refunds+3.477
Neg logit contributions
onent-4.350
ophen-4.312
cial-4.194
reated-4.189
hemat-4.086
Token
Token ID447
Feature activation+2.55e-01

Pos logit contributions
+1.271
OPLE+1.174
 citiz+1.150
tv+1.139
 refunds+1.117
Neg logit contributions
ophen-1.122
onent-1.106
cial-1.094
reated-1.054
pite-1.029
Token
Token ID247
Feature activation-2.67e-01

Pos logit contributions
ophen+0.223
onent+0.221
cial+0.220
reated+0.214
hemat+0.209
Neg logit contributions
-0.148
OPLE-0.134
 citiz-0.131
tv-0.126
 refunds-0.123
Tokens
Token ID82
Feature activation+5.22e-01

Pos logit contributions
+0.159
OPLE+0.143
 citiz+0.138
tv+0.138
 refunds+0.133
Neg logit contributions
ophen-0.229
onent-0.227
cial-0.225
reated-0.218
pite-0.214
Token time
Token ID640
Feature activation+6.82e-01

Pos logit contributions
ophen+0.395
onent+0.389
cial+0.387
reated+0.373
hemat+0.366
Neg logit contributions
-0.366
OPLE-0.337
 citiz-0.328
tv-0.321
 refunds-0.313