TOP ACTIVATIONS
MAX = 1.623

) and 70 . 2 ( 1 ) of the Criminal
so - called 501 ( c )( 3 ) nonprofit that
the D ugg ars told Fox News in June that it
laptop , a cooler or AC and a refrigerator it
't take the exchange rate into account , but even adjusting
atch 21 B atch 20 B atch
Get our daily newsletter Upgrade your inbox and get
I was qualified to do both . I did not expect
by either producing a valid Emirates ID or passport with a
t necessarily take into account all of the possible
terms of a qu id pro quo , it
our daily newsletter Upgrade your inbox and get our Daily Dispatch
be dismissed more easily both factors likely play a role
Get our daily newsletter Upgrade your inbox and get
false accounting under Section 83 ( 1 ) of the Crimes
s a little bit of both , but for sure I
of visitors the city should welcome while preserving its historic vibe
use a model ine for Vim before version 5 . 7
recently for his proposal to hand over the maintenance costs ,
our daily newsletter Upgrade your inbox and get our Daily Dispatch

MIN ACTIVATIONS
MIN = -1.690

game ge ek . com / thread / 54 33 95
aj s 41 ), K ond rl í k ,
ley under ground . com / 2016 / 09 / 08
money for convenience in making exact payments ". He observed that
ible - limit - tr ading - sub s idy -
there silently for almost an hour . Then he opened his
and floors me and makes me feel connected to my fellow
change depending on who 's making them , but justice is
k ] }} for k = 1 , 2 , ..
t make the same mistake . The high point
ripping them off . The information in the center
a . dot . gov / people / in jury /
namely Tony Ferguson versus Kh ab ib Nur mag omed
that first became aware in late 2015 of
l inson , John O ' K ane , J ovan
a . dot . gov / pub s / 8 11
:// www . eng in . um ich . edu /.
the defensive line . Bo z eman snaps it and takes
castle . The cracked concrete became slick as a heavy fog
himself . His face cont orted slightly . So

INTERVAL 0.795 - 1.623
CONTAINS 0.548%

The 67 - year - old is now an adviser to
is using an H VM AM I , to help differentiate
can interpret what s going on better or what
for the cuts is a further demonstration of priorities : allegiance
of 15 million days s ocked away and reduces their

INTERVAL -0.033 - 0.795
CONTAINS 54.441%

I think most of us feel an emotional
America . They ll just write off the other
less thing for you to deal with . If you really
the death of the Dru ze police officer .
Sachs Came to Rule the World .

INTERVAL -0.861 - -0.033
CONTAINS 44.663%

complaints against the Trump campaign for solic iting foreign nationals via
I ve h iked in rain fore
Sing , Bird of Pre y have crafted a unique debut
sunk en fireplace . After that , he turned his back
rix bones to the newly discovered million year old specimens .

INTERVAL -1.690 - -0.861
CONTAINS 0.348%

09 campaign , is being re - established in order to
emb ry o <|endoftext|> Over the weekend Fairfax media reported that
She pulls away and I immediately miss the contact of her
the truth prevail . OK , you can stop laughing at
- independence flag - the est el ada - to the
Token)
Token ID8
Feature activation+6.48e-01

Pos logit contributions
)].+5.439
 Direction+4.930
 alike+4.525
 ≡+4.474
 mL+4.402
Neg logit contributions
iris-5.651
thens-5.196
agra-4.991
ements-4.970
rs-4.915
Token and
Token ID290
Feature activation+5.99e-01

Pos logit contributions
)].+3.934
 Direction+3.820
 ≡+3.536
 alike+3.452
aukee+3.321
Neg logit contributions
iris-4.156
thens-3.812
agra-3.616
 Wolver-3.556
rs-3.538
Token 70
Token ID4317
Feature activation+7.72e-01

Pos logit contributions
)].+3.186
 Direction+3.096
 alike+2.837
 ≡+2.765
 mL+2.657
Neg logit contributions
iris-4.325
thens-4.048
agra-3.868
ements-3.772
download-3.737
Token.
Token ID13
Feature activation+9.17e-01

Pos logit contributions
)].+4.612
 Direction+4.432
 alike+4.129
 ≡+3.998
 mL+3.919
Neg logit contributions
iris-5.046
thens-4.586
agra-4.385
rs-4.312
ffield-4.265
Token2
Token ID17
Feature activation+1.07e+00

Pos logit contributions
)].+4.397
 Direction+4.313
 alike+3.768
 ≡+3.657
 mL+3.501
Neg logit contributions
iris-6.867
thens-6.454
agra-6.235
ffield-6.148
ements-6.065
Token(
Token ID7
Feature activation+1.62e+00

Pos logit contributions
)].+6.021
 Direction+5.790
 alike+5.377
 ≡+5.159
 mL+4.934
Neg logit contributions
iris-7.283
thens-6.580
agra-6.292
ements-6.140
 Wolver-6.113
Token1
Token ID16
Feature activation+9.27e-01

Pos logit contributions
)].+7.667
 Direction+6.686
 alike+6.118
 mL+5.911
opez+5.740
Neg logit contributions
iris-10.822
thens-10.309
 Wolver-10.096
agra-9.967
ffield-9.873
Token)
Token ID8
Feature activation+5.63e-01

Pos logit contributions
)].+5.833
 Direction+5.194
 alike+4.868
 ≡+4.725
 mL+4.666
Neg logit contributions
iris-5.917
thens-5.500
agra-5.291
ements-5.191
 Wolver-5.109
Token of
Token ID286
Feature activation+6.91e-01

Pos logit contributions
)].+3.146
 Direction+3.053
 alike+2.844
 ≡+2.786
aukee+2.627
Neg logit contributions
iris-3.939
thens-3.583
agra-3.427
rs-3.416
ements-3.398
Token the
Token ID262
Feature activation+6.29e-01

Pos logit contributions
)].+3.876
 Direction+3.869
 alike+3.508
 ≡+3.496
 mL+3.292
Neg logit contributions
iris-4.644
thens-4.255
download-4.035
rs-4.030
agra-4.030
Token Criminal
Token ID15531
Feature activation+3.18e-02

Pos logit contributions
 Direction+3.197
)].+3.149
 alike+2.810
 ≡+2.770
 Upper+2.575
Neg logit contributions
iris-4.649
thens-4.308
rs-4.098
ffield-4.072
ements-4.068
Token so
Token ID523
Feature activation-2.14e-02

Pos logit contributions
iris+4.569
thens+4.147
rs+4.088
ements+4.001
agra+3.970
Neg logit contributions
)].-4.408
 Direction-4.223
 alike-3.972
 ≡-3.919
aukee-3.912
Token-
Token ID12
Feature activation-5.47e-01

Pos logit contributions
iris+0.105
thens+0.092
rs+0.090
ements+0.087
agra+0.087
Neg logit contributions
)].-0.164
 Direction-0.159
 alike-0.151
aukee-0.150
 ≡-0.150
Tokencalled
Token ID7174
Feature activation-2.51e-01

Pos logit contributions
iris+3.603
thens+3.263
rs+3.222
ements+3.175
agra+3.147
Neg logit contributions
)].-3.328
 Direction-3.143
 alike-2.956
aukee-2.909
 ≡-2.903
Token 501
Token ID24555
Feature activation+5.79e-01

Pos logit contributions
iris+1.599
thens+1.454
rs+1.426
ements+1.408
agra+1.384
Neg logit contributions
)].-1.576
 Direction-1.522
 alike-1.424
 ≡-1.410
aukee-1.387
Token(
Token ID7
Feature activation+9.97e-01

Pos logit contributions
)].+3.645
 Direction+3.536
 ≡+3.327
 alike+3.251
aukee+3.236
Neg logit contributions
iris-3.506
thens-3.280
agra-3.111
rs-3.097
ements-3.087
Tokenc
Token ID66
Feature activation+1.59e+00

Pos logit contributions
)].+4.945
 Direction+4.458
 alike+4.107
 ≡+4.074
 mL+4.019
Neg logit contributions
iris-7.149
thens-6.715
 Wolver-6.614
ffield-6.554
agra-6.374
Token)(
Token ID5769
Feature activation+1.04e+00

Pos logit contributions
)].+8.257
 Direction+7.227
 ≡+6.905
aukee+6.812
 alike+6.774
Neg logit contributions
iris-10.972
thens-10.183
agra-9.807
ffield-9.693
 Wolver-9.635
Token3
Token ID18
Feature activation+6.82e-01

Pos logit contributions
)].+4.273
 Direction+3.849
 ≡+3.590
 alike+3.577
 mL+3.512
Neg logit contributions
iris-8.183
thens-8.001
ements-7.739
agra-7.679
 Wolver-7.671
Token)
Token ID8
Feature activation+5.33e-01

Pos logit contributions
)].+4.379
 Direction+4.078
 alike+3.869
 ≡+3.799
aukee+3.764
Neg logit contributions
iris-4.174
thens-3.861
agra-3.705
ements-3.693
download-3.592
Token nonprofit
Token ID15346
Feature activation+4.62e-01

Pos logit contributions
)].+2.916
 Direction+2.826
 alike+2.637
 ≡+2.560
aukee+2.532
Neg logit contributions
iris-3.655
thens-3.421
ements-3.285
ffield-3.232
agra-3.216
Token that
Token ID326
Feature activation+7.67e-02

Pos logit contributions
)].+2.574
 Direction+2.471
 alike+2.316
 ≡+2.236
 mL+2.184
Neg logit contributions
iris-3.214
thens-3.000
ements-2.883
agra-2.857
ffield-2.830
Token the
Token ID262
Feature activation+5.16e-02

Pos logit contributions
 Direction+2.273
)].+2.269
 alike+2.085
 ≡+1.983
 interviewer+1.950
Neg logit contributions
iris-2.939
thens-2.609
ffield-2.533
agra-2.530
ements-2.489
Token D
Token ID360
Feature activation+2.92e-01

Pos logit contributions
)].+0.303
 Direction+0.296
 alike+0.268
aukee+0.264
 ≡+0.261
Neg logit contributions
iris-0.355
thens-0.318
rs-0.308
agra-0.306
ffield-0.305
Tokenugg
Token ID6837
Feature activation-9.00e-02

Pos logit contributions
)].+1.690
 Direction+1.548
aukee+1.470
opez+1.468
 alike+1.445
Neg logit contributions
iris-1.958
thens-1.774
 Wolver-1.704
ffield-1.703
ements-1.700
Tokenars
Token ID945
Feature activation+1.54e-01

Pos logit contributions
iris+0.369
thens+0.324
download+0.306
rs+0.305
ements+0.303
Neg logit contributions
)].-0.753
 Direction-0.722
 alike-0.695
 ≡-0.688
aukee-0.688
Token told
Token ID1297
Feature activation+4.73e-01

Pos logit contributions
)].+0.863
 Direction+0.809
 alike+0.785
 ≡+0.748
aukee+0.736
Neg logit contributions
iris-1.093
thens-1.015
agra-0.970
download-0.967
rs-0.964
Token Fox
Token ID5426
Feature activation+1.48e+00

Pos logit contributions
)].+2.617
 Direction+2.599
 alike+2.402
 ≡+2.305
 interviewer+2.281
Neg logit contributions
iris-3.340
thens-2.952
agra-2.904
rs-2.874
download-2.871
Token News
Token ID3000
Feature activation+5.94e-01

Pos logit contributions
)].+6.427
 Direction+6.342
 alike+5.945
 interviewer+5.733
 ≡+5.601
Neg logit contributions
iris-11.054
thens-10.075
agra-9.882
ements-9.593
download-9.485
Token in
Token ID287
Feature activation+3.78e-01

Pos logit contributions
)].+3.185
 Direction+3.071
 alike+2.935
 ≡+2.765
 interviewer+2.745
Neg logit contributions
iris-4.288
thens-3.815
agra-3.801
ements-3.731
ffield-3.713
Token June
Token ID2795
Feature activation+7.90e-02

Pos logit contributions
)].+2.492
 Direction+2.407
 alike+2.256
 ≡+2.217
aukee+2.183
Neg logit contributions
iris-2.295
thens-2.016
rs-1.968
agra-1.944
ffield-1.931
Token that
Token ID326
Feature activation+6.72e-02

Pos logit contributions
)].+0.496
 Direction+0.474
 alike+0.453
 ≡+0.439
aukee+0.436
Neg logit contributions
iris-0.508
thens-0.457
agra-0.446
ements-0.441
rs-0.439
Token it
Token ID340
Feature activation-1.67e-01

Pos logit contributions
)].+0.414
 Direction+0.401
 alike+0.378
 ≡+0.365
aukee+0.354
Neg logit contributions
iris-0.443
thens-0.395
agra-0.381
rs-0.381
ements-0.379
Token laptop
Token ID13224
Feature activation+7.82e-01

Pos logit contributions
)].+3.367
 Direction+3.107
 alike+3.016
 ≡+2.933
 mL+2.832
Neg logit contributions
iris-4.390
thens-4.097
rs-3.950
ements-3.917
agra-3.914
Token,
Token ID11
Feature activation+2.37e-01

Pos logit contributions
)].+4.629
 alike+4.368
 Direction+4.362
 ≡+4.101
aukee+3.986
Neg logit contributions
iris-5.073
thens-4.654
ements-4.450
ffield-4.424
rs-4.417
Token a
Token ID257
Feature activation+2.86e-01

Pos logit contributions
)].+1.310
 Direction+1.250
 alike+1.187
 ≡+1.158
aukee+1.140
Neg logit contributions
iris-1.664
thens-1.537
rs-1.479
agra-1.472
ements-1.469
Token cooler
Token ID19346
Feature activation+7.78e-01

Pos logit contributions
)].+1.278
 Direction+1.229
 alike+1.144
 ≡+1.094
aukee+1.081
Neg logit contributions
iris-2.309
thens-2.149
agra-2.081
rs-2.078
ements-2.075
Token or
Token ID393
Feature activation+4.83e-01

Pos logit contributions
)].+5.137
 Direction+4.910
 alike+4.707
 ≡+4.625
aukee+4.576
Neg logit contributions
iris-4.543
thens-4.123
rs-3.975
ements-3.931
agra-3.892
Token AC
Token ID7125
Feature activation+1.47e+00

Pos logit contributions
)].+2.259
 Direction+2.138
 alike+1.980
 ≡+1.937
aukee+1.896
Neg logit contributions
iris-3.791
thens-3.540
rs-3.438
ements-3.411
agra-3.390
Token and
Token ID290
Feature activation+6.04e-02

Pos logit contributions
)].+8.061
 alike+7.487
 Direction+7.447
 ≡+7.130
 mL+6.748
Neg logit contributions
iris-9.206
thens-8.608
agra-8.197
ffield-8.176
ements-8.065
Token a
Token ID257
Feature activation+5.40e-03

Pos logit contributions
)].+0.272
 Direction+0.254
 alike+0.235
 ≡+0.231
aukee+0.228
Neg logit contributions
iris-0.491
thens-0.456
rs-0.447
ements-0.442
agra-0.440
Token refrigerator
Token ID30500
Feature activation+4.84e-01

Pos logit contributions
)].+0.024
 Direction+0.023
 alike+0.021
 ≡+0.020
aukee+0.020
Neg logit contributions
iris-0.044
thens-0.041
rs-0.040
agra-0.040
ements-0.040
Token
Token ID960
Feature activation+4.11e-02

Pos logit contributions
)].+2.953
 Direction+2.818
 alike+2.759
 ≡+2.634
aukee+2.581
Neg logit contributions
iris-3.091
thens-2.808
rs-2.723
ements-2.710
agra-2.707
Tokenit
Token ID270
Feature activation+3.72e-01

Pos logit contributions
)].+0.310
 Direction+0.297
 alike+0.288
 ≡+0.281
aukee+0.279
Neg logit contributions
iris-0.200
thens-0.176
ements-0.173
agra-0.172
 Wolver-0.172
Token't
Token ID470
Feature activation-2.57e-01

Pos logit contributions
iris+1.195
thens+1.096
ffield+1.050
agra+1.046
 Wolver+1.036
Neg logit contributions
)].-1.157
 Direction-1.057
 alike-1.003
 mL-0.964
 ≡-0.947
Token take
Token ID1011
Feature activation+2.34e-01

Pos logit contributions
iris+1.871
thens+1.720
rs+1.675
ements+1.651
agra+1.647
Neg logit contributions
)].-1.395
 Direction-1.320
 alike-1.263
 ≡-1.211
aukee-1.210
Token the
Token ID262
Feature activation+5.57e-01

Pos logit contributions
)].+0.868
 Direction+0.845
 alike+0.788
aukee+0.693
 ≡+0.682
Neg logit contributions
iris-2.074
thens-1.938
rs-1.885
download-1.876
ffield-1.869
Token exchange
Token ID5163
Feature activation-3.25e-01

Pos logit contributions
 Direction+3.048
)].+3.016
 alike+2.831
 interviewer+2.732
 mL+2.659
Neg logit contributions
iris-3.769
thens-3.520
rs-3.382
agra-3.355
download-3.332
Token rate
Token ID2494
Feature activation+6.44e-02

Pos logit contributions
iris+2.028
thens+1.841
rs+1.781
ements+1.760
agra+1.754
Neg logit contributions
)].-2.093
 Direction-2.008
 alike-1.905
 ≡-1.861
aukee-1.848
Token into
Token ID656
Feature activation+1.46e+00

Pos logit contributions
)].+0.336
 Direction+0.324
 alike+0.312
 ≡+0.289
aukee+0.287
Neg logit contributions
iris-0.478
thens-0.442
agra-0.425
rs-0.420
download-0.420
Token account
Token ID1848
Feature activation-2.97e-01

Pos logit contributions
)].+4.497
 Direction+4.332
 alike+3.778
aukee+3.496
 ≡+3.426
Neg logit contributions
iris-12.822
thens-12.191
ements-11.665
rent-11.649
rs-11.630
Token,
Token ID11
Feature activation-5.01e-01

Pos logit contributions
iris+1.882
thens+1.715
rs+1.670
ements+1.647
agra+1.639
Neg logit contributions
)].-1.857
 Direction-1.782
 alike-1.704
aukee-1.657
 ≡-1.647
Token but
Token ID475
Feature activation-5.79e-01

Pos logit contributions
iris+3.101
thens+2.813
rs+2.741
ements+2.691
agra+2.677
Neg logit contributions
)].-3.227
 Direction-3.106
 alike-2.954
 ≡-2.878
aukee-2.866
Token even
Token ID772
Feature activation+2.18e-01

Pos logit contributions
iris+3.995
thens+3.647
rs+3.570
ements+3.512
agra+3.508
Neg logit contributions
)].-3.349
 Direction-3.205
 alike-3.017
 ≡-2.931
aukee-2.897
Token adjusting
Token ID22000
Feature activation-3.83e-01

Pos logit contributions
)].+1.160
 Direction+1.111
 alike+1.064
 ≡+0.994
 mL+0.984
Neg logit contributions
iris-1.602
thens-1.442
agra-1.396
rs-1.390
download-1.389
Tokenatch
Token ID963
Feature activation+1.37e+00

Pos logit contributions
)].+2.997
 Direction+2.634
 alike+2.433
opez+2.344
 mL+2.336
Neg logit contributions
iris-4.084
thens-3.793
 Wolver-3.731
ements-3.596
ffield-3.528
Token 21
Token ID2310
Feature activation+4.87e-02

Pos logit contributions
 Direction+6.727
)].+6.051
 alike+5.950
 ≡+5.589
 Upper+5.344
Neg logit contributions
iris-9.763
thens-8.690
ffield-8.580
ederation-8.414
agra-8.378
Token
Token ID198
Feature activation+4.44e-01

Pos logit contributions
)].+0.383
 Direction+0.366
 alike+0.350
aukee+0.349
 ≡+0.346
Neg logit contributions
iris-0.231
thens-0.203
rs-0.201
download-0.194
ements-0.193
Token
Token ID198
Feature activation+2.42e-01

Pos logit contributions
)].+2.664
 Direction+2.511
 alike+2.357
 mL+2.291
 Upper+2.169
Neg logit contributions
iris-2.939
thens-2.729
ffield-2.669
icter-2.606
agra-2.583
TokenB
Token ID33
Feature activation+6.09e-01

Pos logit contributions
)].+1.507
 Direction+1.433
 alike+1.338
aukee+1.328
 ≡+1.324
Neg logit contributions
iris-1.538
thens-1.398
rs-1.361
ements-1.348
agra-1.340
Tokenatch
Token ID963
Feature activation+1.45e+00

Pos logit contributions
)].+3.199
 Direction+2.814
 alike+2.595
 mL+2.490
opez+2.486
Neg logit contributions
iris-4.408
thens-4.094
 Wolver-4.046
ements-3.882
ffield-3.823
Token 20
Token ID1160
Feature activation+8.36e-02

Pos logit contributions
 Direction+7.452
)].+6.681
 ≡+6.485
 alike+6.396
 Upper+6.041
Neg logit contributions
iris-9.044
thens-8.681
rs-8.641
ffield-8.566
ldon-8.257
Token
Token ID198
Feature activation+4.70e-01

Pos logit contributions
)].+0.656
 Direction+0.629
 alike+0.600
aukee+0.599
 ≡+0.594
Neg logit contributions
iris-0.396
thens-0.349
rs-0.344
download-0.331
ements-0.331
Token
Token ID198
Feature activation+2.42e-01

Pos logit contributions
)].+2.814
 Direction+2.645
 alike+2.482
 mL+2.411
 Upper+2.284
Neg logit contributions
iris-3.116
thens-2.895
ffield-2.837
icter-2.773
agra-2.740
TokenB
Token ID33
Feature activation+6.16e-01

Pos logit contributions
)].+1.493
 Direction+1.416
 alike+1.321
aukee+1.312
 ≡+1.307
Neg logit contributions
iris-1.548
thens-1.410
rs-1.369
ements-1.359
agra-1.352
Tokenatch
Token ID963
Feature activation+9.97e-01

Pos logit contributions
)].+3.239
 Direction+2.856
 alike+2.633
 mL+2.527
opez+2.520
Neg logit contributions
iris-4.452
thens-4.129
 Wolver-4.086
ements-3.914
ffield-3.864
Token
Token ID198
Feature activation-1.95e-01

Pos logit contributions
)].+1.549
 Direction+1.547
 alike+1.430
 ≡+1.418
aukee+1.362
Neg logit contributions
iris-1.488
thens-1.361
ements-1.299
agra-1.293
rs-1.283
Token
Token ID198
Feature activation+3.33e-02

Pos logit contributions
iris+1.110
thens+1.014
agra+0.960
ffield+0.954
ements+0.949
Neg logit contributions
)].-1.368
 Direction-1.319
 alike-1.240
 mL-1.200
 ≡-1.188
TokenGet
Token ID3855
Feature activation-1.67e-01

Pos logit contributions
)].+0.194
 Direction+0.182
 alike+0.169
 ≡+0.165
aukee+0.164
Neg logit contributions
iris-0.226
thens-0.209
ements-0.202
agra-0.200
 Wolver-0.199
Token our
Token ID674
Feature activation+1.62e-01

Pos logit contributions
iris+1.273
thens+1.180
rs+1.154
agra+1.136
ements+1.135
Neg logit contributions
)].-0.834
 Direction-0.795
 alike-0.728
 ≡-0.721
aukee-0.709
Token daily
Token ID4445
Feature activation+8.02e-01

Pos logit contributions
)].+0.633
 Direction+0.610
 ≡+0.535
 alike+0.527
 mL+0.504
Neg logit contributions
iris-1.381
thens-1.321
rs-1.283
agra-1.281
ements-1.264
Token newsletter
Token ID13129
Feature activation+1.44e+00

Pos logit contributions
)].+2.214
 Direction+1.992
 mL+1.822
 ≡+1.699
 interviewer+1.478
Neg logit contributions
iris-7.807
thens-7.332
agra-7.196
ements-6.997
rs-6.965
Token Upgrade
Token ID24236
Feature activation+1.00e+00

Pos logit contributions
)].+1.564
 Direction+0.916
 ≡+0.739
 Upper+0.516
 mL+0.472
Neg logit contributions
iris-15.734
thens-14.537
agra-14.508
ements-14.296
ffield-14.082
Token your
Token ID534
Feature activation+1.24e-01

Pos logit contributions
)].+4.645
 Direction+4.450
 alike+4.046
 Upper+3.765
 ≡+3.757
Neg logit contributions
iris-7.426
thens-6.944
ffield-6.819
agra-6.626
ederation-6.466
Token inbox
Token ID13734
Feature activation+1.42e+00

Pos logit contributions
)].+0.438
 Direction+0.422
 ≡+0.370
 alike+0.359
 mL+0.346
Neg logit contributions
iris-1.099
thens-1.037
agra-1.024
rs-1.006
rent-1.002
Token and
Token ID290
Feature activation+4.06e-01

Pos logit contributions
)].+6.402
 Direction+5.544
 mL+5.358
 ≡+5.249
 alike+5.106
Neg logit contributions
iris-10.602
agra-9.464
thens-9.377
ederation-9.249
ements-9.023
Token get
Token ID651
Feature activation+2.70e-01

Pos logit contributions
)].+2.371
 Direction+2.140
 alike+2.088
 mL+1.993
 ≡+1.982
Neg logit contributions
iris-2.642
thens-2.588
agra-2.499
rs-2.439
ements-2.437
Token I
Token ID314
Feature activation+4.56e-01

Pos logit contributions
)].+1.693
 Direction+1.632
 alike+1.554
 ≡+1.468
aukee+1.448
Neg logit contributions
iris-2.304
thens-2.092
agra-2.022
ements-2.010
ffield-1.993
Token was
Token ID373
Feature activation+3.02e-01

Pos logit contributions
)].+2.438
 Direction+2.278
 alike+2.200
 ≡+2.173
aukee+2.111
Neg logit contributions
iris-3.251
thens-3.034
agra-2.884
ements-2.884
 Wolver-2.856
Token qualified
Token ID10617
Feature activation-1.38e-01

Pos logit contributions
)].+1.704
 Direction+1.649
 alike+1.602
aukee+1.512
 ≡+1.512
Neg logit contributions
iris-2.081
thens-1.881
ffield-1.788
ements-1.787
rs-1.784
Token to
Token ID284
Feature activation+5.12e-01

Pos logit contributions
iris+0.818
thens+0.738
rs+0.719
ements+0.708
agra+0.702
Neg logit contributions
)].-0.923
 Direction-0.887
 alike-0.845
 ≡-0.831
aukee-0.826
Token do
Token ID466
Feature activation+5.06e-01

Pos logit contributions
)].+2.716
 Direction+2.546
 alike+2.437
 ≡+2.348
aukee+2.320
Neg logit contributions
iris-3.728
thens-3.489
ements-3.348
agra-3.289
rs-3.263
Token both
Token ID1111
Feature activation+1.43e+00

Pos logit contributions
)].+2.744
 Direction+2.651
 alike+2.511
aukee+2.423
 ≡+2.375
Neg logit contributions
iris-3.544
thens-3.301
ements-3.181
rs-3.172
ffield-3.133
Token.
Token ID13
Feature activation+2.08e-01

Pos logit contributions
)].+7.296
 Direction+6.965
 alike+6.961
aukee+6.402
 ≡+6.303
Neg logit contributions
iris-9.534
thens-8.822
agra-8.610
ffield-8.371
download-8.291
Token I
Token ID314
Feature activation+2.01e-01

Pos logit contributions
)].+1.103
 Direction+1.096
 alike+0.973
 ≡+0.959
 mL+0.925
Neg logit contributions
iris-1.539
thens-1.420
ffield-1.369
agra-1.365
ements-1.354
Token did
Token ID750
Feature activation+3.88e-01

Pos logit contributions
)].+1.200
 Direction+1.127
 alike+1.107
 ≡+1.065
aukee+1.025
Neg logit contributions
iris-1.338
thens-1.234
ements-1.174
agra-1.171
download-1.162
Token not
Token ID407
Feature activation+8.15e-02

Pos logit contributions
)].+2.270
 Direction+2.153
 alike+2.086
 ≡+1.965
aukee+1.960
Neg logit contributions
iris-2.604
thens-2.415
ements-2.297
agra-2.294
rs-2.289
Token expect
Token ID1607
Feature activation+1.27e-01

Pos logit contributions
)].+0.470
 Direction+0.437
 alike+0.433
 ≡+0.410
aukee+0.401
Neg logit contributions
iris-0.565
thens-0.522
agra-0.501
rs-0.495
ements-0.495
Token by
Token ID416
Feature activation-3.29e-01

Pos logit contributions
iris+0.263
thens+0.242
agra+0.236
rs+0.232
download+0.231
Neg logit contributions
)].-0.214
 Direction-0.202
 alike-0.196
 ≡-0.192
aukee-0.184
Token either
Token ID2035
Feature activation+5.10e-02

Pos logit contributions
iris+2.081
thens+1.883
rs+1.833
agra+1.812
ements+1.807
Neg logit contributions
)].-2.078
 Direction-2.010
 alike-1.913
 ≡-1.875
aukee-1.819
Token producing
Token ID9194
Feature activation+4.09e-01

Pos logit contributions
)].+0.306
 Direction+0.297
 alike+0.281
 ≡+0.276
 mL+0.263
Neg logit contributions
iris-0.337
thens-0.309
agra-0.301
ements-0.295
rs-0.294
Token a
Token ID257
Feature activation+2.27e-01

Pos logit contributions
)].+2.319
 Direction+2.251
 ≡+2.111
 alike+2.054
aukee+1.991
Neg logit contributions
iris-2.745
thens-2.565
agra-2.448
ements-2.428
rs-2.428
Token valid
Token ID4938
Feature activation+2.58e-01

Pos logit contributions
 Direction+1.296
)].+1.295
 ≡+1.198
 alike+1.171
aukee+1.126
Neg logit contributions
iris-1.509
thens-1.415
agra-1.353
rs-1.347
ements-1.340
Token Emirates
Token ID24880
Feature activation+1.43e+00

Pos logit contributions
)].+1.407
 Direction+1.379
 ≡+1.260
 alike+1.245
aukee+1.211
Neg logit contributions
iris-1.789
thens-1.694
agra-1.618
rs-1.616
download-1.603
Token ID
Token ID4522
Feature activation+2.37e-01

Pos logit contributions
 Direction+5.723
)].+5.463
 ≡+4.848
 alike+4.673
aukee+4.427
Neg logit contributions
iris-10.891
thens-10.829
ements-10.231
ffield-10.160
agra-10.084
Token or
Token ID393
Feature activation+2.23e-01

Pos logit contributions
)].+1.462
 Direction+1.404
 ≡+1.325
 alike+1.315
aukee+1.261
Neg logit contributions
iris-1.512
thens-1.398
ements-1.329
rs-1.328
agra-1.316
Token passport
Token ID17981
Feature activation+1.37e-01

Pos logit contributions
)].+1.288
 Direction+1.240
 alike+1.166
 ≡+1.162
aukee+1.105
Neg logit contributions
iris-1.490
thens-1.395
rs-1.342
agra-1.341
ements-1.335
Token with
Token ID351
Feature activation-2.92e-01

Pos logit contributions
)].+0.843
 Direction+0.807
 alike+0.777
 ≡+0.762
aukee+0.722
Neg logit contributions
iris-0.876
thens-0.813
agra-0.772
rs-0.766
ements-0.765
Token a
Token ID257
Feature activation-2.44e-01

Pos logit contributions
iris+1.872
thens+1.706
rs+1.667
ements+1.635
agra+1.630
Neg logit contributions
)].-1.810
 Direction-1.752
 alike-1.636
 ≡-1.622
aukee-1.606
Token
Token ID447
Feature activation+4.18e-02

Pos logit contributions
iris+0.488
thens+0.450
agra+0.436
ffield+0.433
ements+0.430
Neg logit contributions
)].-0.319
 Direction-0.297
 alike-0.275
 ≡-0.264
 mL-0.263
Token
Token ID247
Feature activation-6.90e-02

Pos logit contributions
)].+0.176
 Direction+0.159
 alike+0.147
aukee+0.142
 ≡+0.142
Neg logit contributions
iris-0.361
thens-0.328
agra-0.326
ements-0.323
ffield-0.322
Tokent
Token ID83
Feature activation-8.55e-02

Pos logit contributions
iris+0.491
agra+0.432
thens+0.427
 Wolver+0.427
download+0.420
Neg logit contributions
)].-0.397
 Direction-0.367
 alike-0.344
aukee-0.335
 ≡-0.327
Token necessarily
Token ID6646
Feature activation-2.88e-02

Pos logit contributions
iris+0.619
thens+0.565
rs+0.548
agra+0.542
ements+0.540
Neg logit contributions
)].-0.472
 Direction-0.446
 alike-0.429
 ≡-0.409
aukee-0.404
Token take
Token ID1011
Feature activation+2.90e-01

Pos logit contributions
iris+0.212
thens+0.194
rs+0.189
agra+0.187
ements+0.186
Neg logit contributions
)].-0.155
 Direction-0.146
 alike-0.139
 ≡-0.134
aukee-0.132
Token into
Token ID656
Feature activation+1.43e+00

Pos logit contributions
)].+1.177
 Direction+1.142
 alike+1.071
aukee+0.975
 ≡+0.972
Neg logit contributions
iris-2.471
thens-2.302
ements-2.227
rs-2.219
download-2.195
Token account
Token ID1848
Feature activation-5.10e-02

Pos logit contributions
)].+5.646
 Direction+5.440
 alike+4.977
 ≡+4.846
aukee+4.790
Neg logit contributions
iris-11.405
thens-10.761
ements-10.310
rs-10.302
rent-10.285
Token all
Token ID477
Feature activation+2.88e-02

Pos logit contributions
iris+0.337
thens+0.304
rs+0.294
agra+0.293
ements+0.292
Neg logit contributions
)].-0.309
 Direction-0.297
 alike-0.283
 ≡-0.276
aukee-0.273
Token of
Token ID286
Feature activation-2.72e-02

Pos logit contributions
)].+0.175
 Direction+0.170
 alike+0.161
 ≡+0.157
aukee+0.156
Neg logit contributions
iris-0.189
thens-0.173
rs-0.164
agra-0.164
ements-0.162
Token the
Token ID262
Feature activation+2.79e-02

Pos logit contributions
iris+0.167
thens+0.150
rs+0.147
ements+0.145
agra+0.144
Neg logit contributions
)].-0.176
 Direction-0.169
 alike-0.160
 ≡-0.157
aukee-0.157
Token possible
Token ID1744
Feature activation-2.62e-02

Pos logit contributions
)].+0.165
 Direction+0.158
 alike+0.149
 ≡+0.145
aukee+0.143
Neg logit contributions
iris-0.192
thens-0.174
rs-0.167
agra-0.165
ements-0.165
Token terms
Token ID2846
Feature activation-1.25e-01

Pos logit contributions
)].+0.067
 Direction+0.064
 alike+0.060
 ≡+0.059
aukee+0.058
Neg logit contributions
iris-0.078
thens-0.070
rs-0.068
ements-0.067
agra-0.066
Token of
Token ID286
Feature activation-3.17e-01

Pos logit contributions
iris+0.884
thens+0.818
agra+0.783
ffield+0.776
rs+0.772
Neg logit contributions
)].-0.721
 Direction-0.679
 alike-0.654
aukee-0.602
 mL-0.602
Token a
Token ID257
Feature activation-2.99e-01

Pos logit contributions
iris+2.098
thens+1.901
rs+1.858
ements+1.823
agra+1.814
Neg logit contributions
)].-1.933
 Direction-1.852
 alike-1.742
aukee-1.702
 ≡-1.700
Token qu
Token ID627
Feature activation-6.30e-01

Pos logit contributions
iris+1.923
thens+1.747
rs+1.700
ements+1.662
agra+1.660
Neg logit contributions
)].-1.886
 Direction-1.814
 alike-1.703
 ≡-1.670
aukee-1.656
Tokenid
Token ID312
Feature activation-1.26e-01

Pos logit contributions
iris+3.966
thens+3.580
rs+3.544
ements+3.466
agra+3.442
Neg logit contributions
)].-3.953
 Direction-3.792
 alike-3.577
 ≡-3.529
aukee-3.511
Token pro
Token ID386
Feature activation+1.42e+00

Pos logit contributions
iris+0.739
thens+0.664
rs+0.652
ements+0.637
download+0.632
Neg logit contributions
)].-0.843
 Direction-0.813
 alike-0.769
 ≡-0.760
aukee-0.759
Token quo
Token ID18658
Feature activation-3.73e-01

Pos logit contributions
 Direction+5.338
 alike+5.076
)].+4.748
 ≡+4.368
 mL+4.345
Neg logit contributions
iris-10.553
rs-10.169
thens-10.074
ements-9.941
 Wolver-9.881
Token,
Token ID11
Feature activation+5.43e-02

Pos logit contributions
iris+2.293
thens+2.068
rs+2.019
ements+1.991
agra+1.972
Neg logit contributions
)].-2.432
 Direction-2.335
 alike-2.209
 ≡-2.170
aukee-2.161
Token it
Token ID340
Feature activation+2.62e-01

Pos logit contributions
)].+0.312
 Direction+0.299
 alike+0.282
 ≡+0.273
aukee+0.265
Neg logit contributions
iris-0.376
thens-0.340
rs-0.331
ements-0.330
agra-0.330
Token
Token ID447
Feature activation+1.76e-01

Pos logit contributions
)].+1.521
 alike+1.423
 Direction+1.411
 ≡+1.353
 mL+1.308
Neg logit contributions
iris-1.734
thens-1.582
download-1.544
ements-1.534
rent-1.533
Token
Token ID247
Feature activation+3.92e-02

Pos logit contributions
)].+0.725
 Direction+0.650
 alike+0.615
 ≡+0.581
 mL+0.579
Neg logit contributions
iris-1.513
thens-1.381
agra-1.378
ements-1.366
ffield-1.364
Token our
Token ID674
Feature activation+1.62e-01

Pos logit contributions
iris+1.273
thens+1.180
rs+1.154
agra+1.136
ements+1.135
Neg logit contributions
)].-0.834
 Direction-0.795
 alike-0.728
 ≡-0.721
aukee-0.709
Token daily
Token ID4445
Feature activation+8.02e-01

Pos logit contributions
)].+0.633
 Direction+0.610
 ≡+0.535
 alike+0.527
 mL+0.504
Neg logit contributions
iris-1.381
thens-1.321
rs-1.283
agra-1.281
ements-1.264
Token newsletter
Token ID13129
Feature activation+1.44e+00

Pos logit contributions
)].+2.214
 Direction+1.992
 mL+1.822
 ≡+1.699
 interviewer+1.478
Neg logit contributions
iris-7.807
thens-7.332
agra-7.196
ements-6.997
rs-6.965
Token Upgrade
Token ID24236
Feature activation+1.00e+00

Pos logit contributions
)].+1.564
 Direction+0.916
 ≡+0.739
 Upper+0.516
 mL+0.472
Neg logit contributions
iris-15.734
thens-14.537
agra-14.508
ements-14.296
ffield-14.082
Token your
Token ID534
Feature activation+1.24e-01

Pos logit contributions
)].+4.645
 Direction+4.450
 alike+4.046
 Upper+3.765
 ≡+3.757
Neg logit contributions
iris-7.426
thens-6.944
ffield-6.819
agra-6.626
ederation-6.466
Token inbox
Token ID13734
Feature activation+1.42e+00

Pos logit contributions
)].+0.438
 Direction+0.422
 ≡+0.370
 alike+0.359
 mL+0.346
Neg logit contributions
iris-1.099
thens-1.037
agra-1.024
rs-1.006
rent-1.002
Token and
Token ID290
Feature activation+4.06e-01

Pos logit contributions
)].+6.402
 Direction+5.544
 mL+5.358
 ≡+5.249
 alike+5.106
Neg logit contributions
iris-10.602
agra-9.464
thens-9.377
ederation-9.249
ements-9.023
Token get
Token ID651
Feature activation+2.70e-01

Pos logit contributions
)].+2.371
 Direction+2.140
 alike+2.088
 mL+1.993
 ≡+1.982
Neg logit contributions
iris-2.642
thens-2.588
agra-2.499
rs-2.439
ements-2.437
Token our
Token ID674
Feature activation+3.09e-01

Pos logit contributions
)].+1.131
 Direction+1.052
 alike+1.008
 ≡+0.930
 mL+0.891
Neg logit contributions
iris-2.281
thens-2.143
agra-2.066
rs-2.027
download-2.024
Token Daily
Token ID6714
Feature activation+6.90e-01

Pos logit contributions
)].+0.924
 Direction+0.873
 alike+0.746
 ≡+0.690
 mL+0.688
Neg logit contributions
iris-2.919
thens-2.801
agra-2.718
rs-2.706
ements-2.706
Token Dispatch
Token ID35934
Feature activation+1.08e+00

Pos logit contributions
)].+2.359
 Direction+2.220
 mL+1.844
 alike+1.808
 Piece+1.774
Neg logit contributions
iris-6.401
agra-5.939
thens-5.865
ffield-5.586
ements-5.560
Token be
Token ID307
Feature activation+2.28e-01

Pos logit contributions
)].+0.731
 Direction+0.669
 alike+0.661
 ≡+0.634
aukee+0.619
Neg logit contributions
iris-0.784
thens-0.729
agra-0.692
rs-0.692
ements-0.686
Token dismissed
Token ID11126
Feature activation+4.46e-01

Pos logit contributions
)].+1.354
 Direction+1.288
 alike+1.272
aukee+1.194
 ≡+1.183
Neg logit contributions
iris-1.516
thens-1.411
download-1.335
ements-1.309
rs-1.300
Token more
Token ID517
Feature activation+4.77e-02

Pos logit contributions
)].+2.492
 alike+2.386
 Direction+2.304
 ≡+2.219
aukee+2.214
Neg logit contributions
iris-3.034
thens-2.836
download-2.696
agra-2.684
rent-2.673
Token easily
Token ID3538
Feature activation+1.94e-01

Pos logit contributions
)].+0.251
 Direction+0.234
 alike+0.232
aukee+0.219
 ≡+0.218
Neg logit contributions
iris-0.352
thens-0.325
agra-0.313
rs-0.311
download-0.311
Token
Token ID960
Feature activation+3.21e-01

Pos logit contributions
)].+1.127
 Direction+1.066
 alike+1.029
 ≡+0.991
aukee+0.983
Neg logit contributions
iris-1.321
thens-1.210
ements-1.170
agra-1.167
rs-1.155
Tokenboth
Token ID16885
Feature activation+1.41e+00

Pos logit contributions
)].+2.017
 Direction+1.945
 alike+1.863
aukee+1.833
 ≡+1.801
Neg logit contributions
iris-1.923
thens-1.776
agra-1.696
ements-1.694
ffield-1.676
Token factors
Token ID5087
Feature activation+1.21e+00

Pos logit contributions
)].+7.344
 alike+7.184
 Direction+7.007
aukee+6.527
 ≡+6.430
Neg logit contributions
iris-9.152
thens-8.614
download-8.202
agra-8.194
ffield-7.871
Token likely
Token ID1884
Feature activation+7.89e-01

Pos logit contributions
)].+6.283
 alike+5.819
 Direction+5.770
 ≡+5.320
 mL+5.143
Neg logit contributions
iris-8.552
thens-7.829
agra-7.646
download-7.629
ements-7.544
Token play
Token ID711
Feature activation+1.35e+00

Pos logit contributions
)].+4.461
 alike+4.053
 Direction+4.038
 ≡+3.799
aukee+3.761
Neg logit contributions
iris-5.456
thens-5.058
download-4.849
agra-4.833
 Wolver-4.801
Token a
Token ID257
Feature activation+1.14e+00

Pos logit contributions
)].+6.171
 alike+5.835
 Direction+5.745
aukee+5.389
 ≡+5.228
Neg logit contributions
iris-10.090
thens-9.401
download-8.945
agra-8.862
ffield-8.851
Token role
Token ID2597
Feature activation+6.69e-01

Pos logit contributions
)].+4.333
 Direction+4.222
 alike+3.886
aukee+3.438
 mL+3.426
Neg logit contributions
iris-9.634
thens-9.032
download-8.614
ffield-8.591
agra-8.480
Token
Token ID198
Feature activation-1.47e-02

Pos logit contributions
)].+1.005
 Direction+0.990
 alike+0.905
 ≡+0.896
 mL+0.860
Neg logit contributions
iris-1.123
thens-1.026
ffield-0.990
ements-0.984
agra-0.980
Token
Token ID198
Feature activation+4.57e-02

Pos logit contributions
iris+0.088
thens+0.081
ffield+0.077
agra+0.077
ements+0.075
Neg logit contributions
)].-0.099
 Direction-0.095
 alike-0.089
 mL-0.086
 ≡-0.084
TokenGet
Token ID3855
Feature activation-1.59e-01

Pos logit contributions
)].+0.259
 Direction+0.240
 alike+0.223
aukee+0.218
 ≡+0.218
Neg logit contributions
iris-0.317
thens-0.292
 Wolver-0.284
ements-0.283
ffield-0.279
Token our
Token ID674
Feature activation+2.60e-01

Pos logit contributions
iris+1.221
thens+1.135
rs+1.105
download+1.090
agra+1.088
Neg logit contributions
)].-0.777
 Direction-0.742
 ≡-0.675
 alike-0.673
aukee-0.658
Token daily
Token ID4445
Feature activation+9.06e-01

Pos logit contributions
)].+0.955
 Direction+0.915
 ≡+0.796
 mL+0.764
 alike+0.763
Neg logit contributions
iris-2.275
thens-2.171
rs-2.112
agra-2.098
ements-2.093
Token newsletter
Token ID13129
Feature activation+1.41e+00

Pos logit contributions
)].+2.321
 Direction+2.071
 mL+1.938
 ≡+1.721
 interviewer+1.563
Neg logit contributions
iris-8.977
thens-8.457
agra-8.181
ements-8.058
rs-8.006
Token Upgrade
Token ID24236
Feature activation+1.03e+00

Pos logit contributions
)].+1.275
 Direction+0.518
 ≡+0.424
 mL+0.292
 Upper+0.237
Neg logit contributions
iris-15.761
thens-14.562
agra-14.507
ements-14.424
ffield-14.121
Token your
Token ID534
Feature activation+1.14e-01

Pos logit contributions
)].+4.635
 Direction+4.460
 alike+4.039
 Upper+3.774
 ≡+3.682
Neg logit contributions
iris-7.641
thens-7.182
ffield-7.122
agra-6.837
ederation-6.702
Token inbox
Token ID13734
Feature activation+1.37e+00

Pos logit contributions
)].+0.401
 Direction+0.386
 ≡+0.333
 alike+0.326
 mL+0.314
Neg logit contributions
iris-1.022
thens-0.964
agra-0.951
rs-0.935
rent-0.930
Token and
Token ID290
Feature activation+4.34e-01

Pos logit contributions
)].+6.297
 Direction+5.361
 mL+5.208
 ≡+5.019
 alike+4.958
Neg logit contributions
iris-10.260
thens-9.103
agra-9.071
ederation-8.953
 Wolver-8.815
Token get
Token ID651
Feature activation+3.04e-01

Pos logit contributions
)].+2.477
 Direction+2.220
 alike+2.189
 mL+2.091
 ≡+2.056
Neg logit contributions
iris-2.868
thens-2.820
agra-2.713
rs-2.644
ements-2.639
Token false
Token ID3991
Feature activation+9.01e-02

Pos logit contributions
iris+0.143
thens+0.130
rs+0.124
agra+0.122
ements+0.122
Neg logit contributions
)].-0.156
 Direction-0.152
 alike-0.142
 ≡-0.140
aukee-0.137
Token accounting
Token ID14317
Feature activation-1.62e-01

Pos logit contributions
)].+0.530
 Direction+0.516
 alike+0.475
 ≡+0.471
aukee+0.461
Neg logit contributions
iris-0.604
thens-0.559
rs-0.540
agra-0.538
download-0.523
Token under
Token ID739
Feature activation-2.59e-01

Pos logit contributions
iris+1.008
thens+0.912
rs+0.881
agra+0.873
download+0.866
Neg logit contributions
)].-1.051
 Direction-1.008
 alike-0.953
 ≡-0.933
aukee-0.927
Token Section
Token ID7275
Feature activation+7.96e-01

Pos logit contributions
iris+1.550
thens+1.395
rs+1.351
agra+1.333
ements+1.330
Neg logit contributions
)].-1.729
 Direction-1.688
 alike-1.572
 ≡-1.554
aukee-1.526
Token 83
Token ID9698
Feature activation+7.98e-01

Pos logit contributions
)].+4.253
 Direction+4.163
 alike+3.679
 ≡+3.640
 mL+3.590
Neg logit contributions
iris-5.636
thens-5.262
ffield-5.030
agra-5.011
ements-4.948
Token(
Token ID7
Feature activation+1.40e+00

Pos logit contributions
)].+4.565
 Direction+4.451
 alike+4.062
 ≡+3.995
 mL+3.860
Neg logit contributions
iris-5.377
thens-4.927
agra-4.681
ffield-4.569
rs-4.557
Token1
Token ID16
Feature activation+8.38e-01

Pos logit contributions
)].+6.346
 Direction+5.546
 alike+4.968
 mL+4.846
opez+4.725
Neg logit contributions
iris-9.787
thens-9.393
 Wolver-9.126
agra-9.007
ffield-8.980
Token)
Token ID8
Feature activation+4.83e-01

Pos logit contributions
)].+5.437
 Direction+4.990
 alike+4.607
 ≡+4.543
 mL+4.475
Neg logit contributions
iris-5.203
thens-4.774
agra-4.562
ements-4.500
rs-4.468
Token of
Token ID286
Feature activation+4.28e-01

Pos logit contributions
)].+2.775
 Direction+2.705
 alike+2.478
 ≡+2.464
aukee+2.364
Neg logit contributions
iris-3.324
thens-3.021
agra-2.873
ements-2.868
rs-2.865
Token the
Token ID262
Feature activation+6.15e-01

Pos logit contributions
)].+2.593
 Direction+2.545
 ≡+2.333
 alike+2.330
 mL+2.229
Neg logit contributions
iris-2.741
thens-2.492
rs-2.392
ements-2.385
download-2.369
Token Crimes
Token ID32655
Feature activation-4.70e-03

Pos logit contributions
 Direction+3.096
)].+3.046
 ≡+2.656
 alike+2.651
 Upper+2.484
Neg logit contributions
iris-4.571
thens-4.250
download-4.027
ffield-4.026
rs-4.023
Tokens
Token ID82
Feature activation+6.15e-01

Pos logit contributions
)].+0.727
 Direction+0.698
 alike+0.671
 ≡+0.655
aukee+0.652
Neg logit contributions
iris-0.630
thens-0.548
agra-0.539
download-0.535
 Wolver-0.535
Token a
Token ID257
Feature activation-4.64e-03

Pos logit contributions
)].+3.498
 Direction+3.306
 alike+3.242
 ≡+3.007
aukee+2.989
Neg logit contributions
iris-4.154
thens-3.849
ffield-3.643
download-3.630
ements-3.620
Token little
Token ID1310
Feature activation+5.14e-01

Pos logit contributions
iris+0.031
thens+0.028
rs+0.027
ements+0.027
download+0.027
Neg logit contributions
)].-0.028
 Direction-0.027
 alike-0.026
 ≡-0.025
aukee-0.025
Token bit
Token ID1643
Feature activation+4.42e-01

Pos logit contributions
)].+3.169
 alike+3.001
 Direction+2.987
 ≡+2.715
aukee+2.703
Neg logit contributions
iris-3.337
thens-2.980
rs-2.847
ements-2.841
download-2.823
Token of
Token ID286
Feature activation+8.01e-01

Pos logit contributions
)].+2.558
 alike+2.423
 Direction+2.408
 ≡+2.205
aukee+2.201
Neg logit contributions
iris-3.001
thens-2.744
ements-2.619
rs-2.619
agra-2.610
Token both
Token ID1111
Feature activation+1.40e+00

Pos logit contributions
)].+2.630
 alike+2.347
 Direction+2.333
aukee+1.979
 ≡+1.949
Neg logit contributions
iris-7.288
thens-6.886
rs-6.680
ements-6.677
ffield-6.588
Token,
Token ID11
Feature activation+1.68e-01

Pos logit contributions
)].+6.886
 alike+6.696
 Direction+6.621
 ≡+5.845
 mL+5.796
Neg logit contributions
iris-9.433
thens-8.818
agra-8.313
ffield-8.219
download-8.190
Token but
Token ID475
Feature activation+2.67e-01

Pos logit contributions
)].+0.997
 Direction+0.977
 alike+0.931
 ≡+0.876
 mL+0.863
Neg logit contributions
iris-1.113
thens-1.025
agra-0.976
ements-0.974
rs-0.964
Token for
Token ID329
Feature activation+1.94e-01

Pos logit contributions
)].+1.444
 Direction+1.397
 alike+1.338
 ≡+1.238
aukee+1.219
Neg logit contributions
iris-1.901
thens-1.750
agra-1.694
download-1.682
rs-1.673
Token sure
Token ID1654
Feature activation+3.47e-01

Pos logit contributions
)].+1.092
 Direction+1.046
 alike+0.998
 ≡+0.935
 mL+0.927
Neg logit contributions
iris-1.364
thens-1.232
agra-1.184
download-1.183
rs-1.181
Token I
Token ID314
Feature activation+1.57e-01

Pos logit contributions
)].+1.982
 Direction+1.890
 alike+1.820
 ≡+1.718
aukee+1.692
Neg logit contributions
iris-2.372
thens-2.183
agra-2.117
download-2.080
ements-2.075
Token of
Token ID286
Feature activation+6.70e-01

Pos logit contributions
)].+4.500
 Direction+4.307
 alike+4.055
 ≡+4.008
aukee+3.999
Neg logit contributions
iris-4.441
thens-4.018
rs-3.968
ements-3.881
download-3.844
Token visitors
Token ID9692
Feature activation+2.64e-01

Pos logit contributions
 Direction+3.708
)].+3.652
 alike+3.511
aukee+3.361
 ≡+3.298
Neg logit contributions
iris-4.508
thens-4.255
rs-4.004
agra-3.994
ements-3.990
Token the
Token ID262
Feature activation+2.87e-02

Pos logit contributions
)].+1.517
 Direction+1.482
 alike+1.425
aukee+1.347
 ≡+1.345
Neg logit contributions
iris-1.797
thens-1.669
agra-1.603
ements-1.598
rs-1.575
Token city
Token ID1748
Feature activation-3.77e-02

Pos logit contributions
)].+0.163
 Direction+0.157
 alike+0.149
 ≡+0.144
aukee+0.143
Neg logit contributions
iris-0.201
thens-0.184
rs-0.176
ements-0.175
agra-0.175
Token should
Token ID815
Feature activation+2.38e-01

Pos logit contributions
iris+0.253
thens+0.236
agra+0.226
ements+0.225
rs+0.224
Neg logit contributions
)].-0.222
 Direction-0.210
 alike-0.203
 ≡-0.195
aukee-0.191
Token welcome
Token ID7062
Feature activation+1.39e+00

Pos logit contributions
)].+1.391
 Direction+1.288
 alike+1.258
 ≡+1.204
aukee+1.184
Neg logit contributions
iris-1.613
thens-1.522
rs-1.437
ements-1.430
agra-1.428
Token while
Token ID981
Feature activation+1.37e-01

Pos logit contributions
 alike+6.808
)].+6.328
 Direction+6.208
aukee+5.707
 ≡+5.656
Neg logit contributions
iris-9.388
thens-9.029
agra-8.613
ffield-8.539
ements-8.508
Token preserving
Token ID23934
Feature activation-5.40e-01

Pos logit contributions
)].+0.794
 Direction+0.764
 alike+0.738
aukee+0.700
 ≡+0.697
Neg logit contributions
iris-0.936
thens-0.851
agra-0.825
rs-0.820
ements-0.815
Token its
Token ID663
Feature activation-8.60e-02

Pos logit contributions
iris+3.167
thens+2.848
rs+2.797
ements+2.732
agra+2.713
Neg logit contributions
)].-3.643
 Direction-3.505
 alike-3.320
 ≡-3.272
aukee-3.269
Token historic
Token ID9566
Feature activation+1.28e-01

Pos logit contributions
iris+0.620
thens+0.576
rs+0.552
agra+0.548
download+0.545
Neg logit contributions
)].-0.471
 Direction-0.446
 alike-0.422
aukee-0.409
 ≡-0.406
Token vibe
Token ID27426
Feature activation+7.55e-01

Pos logit contributions
)].+0.694
 Direction+0.661
 alike+0.628
aukee+0.608
 ≡+0.595
Neg logit contributions
iris-0.919
thens-0.862
agra-0.826
rs-0.812
ements-0.810
Token use
Token ID779
Feature activation-3.15e-01

Pos logit contributions
iris+0.124
thens+0.113
rs+0.111
agra+0.109
 Wolver+0.108
Neg logit contributions
)].-0.102
 Direction-0.097
 alike-0.091
 ≡-0.090
 mL-0.086
Token a
Token ID257
Feature activation-1.19e-01

Pos logit contributions
iris+2.202
thens+2.015
rs+1.989
ements+1.951
agra+1.937
Neg logit contributions
)].-1.767
 Direction-1.686
 alike-1.575
 ≡-1.551
aukee-1.549
Token model
Token ID2746
Feature activation+2.20e-01

Pos logit contributions
iris+0.862
thens+0.794
rs+0.773
ements+0.760
agra+0.758
Neg logit contributions
)].-0.650
 Direction-0.624
 alike-0.580
 ≡-0.573
aukee-0.556
Tokenine
Token ID500
Feature activation+3.16e-01

Pos logit contributions
)].+1.163
 Direction+1.067
 alike+1.025
 ≡+0.973
 mL+0.951
Neg logit contributions
iris-1.623
thens-1.520
 Wolver-1.464
rs-1.457
agra-1.439
Token for
Token ID329
Feature activation+1.37e-01

Pos logit contributions
)].+1.901
 Direction+1.809
 alike+1.754
 ≡+1.704
 mL+1.622
Neg logit contributions
iris-2.063
thens-1.898
rs-1.822
ements-1.808
ffield-1.806
Token Vim
Token ID36645
Feature activation+1.39e+00

Pos logit contributions
)].+0.659
 Direction+0.628
 alike+0.577
 ≡+0.567
 mL+0.538
Neg logit contributions
iris-1.072
thens-0.991
rs-0.965
agra-0.953
ffield-0.950
Token before
Token ID878
Feature activation+2.34e-01

Pos logit contributions
)].+6.318
 Direction+6.227
 ≡+5.714
 alike+5.652
 mL+5.367
Neg logit contributions
iris-10.024
thens-9.131
ffield-8.891
 Wolver-8.781
rs-8.720
Token version
Token ID2196
Feature activation+3.53e-01

Pos logit contributions
)].+1.117
 Direction+1.064
 ≡+0.964
 alike+0.957
 mL+0.908
Neg logit contributions
iris-1.836
thens-1.689
rs-1.652
agra-1.630
ffield-1.624
Token 5
Token ID642
Feature activation+3.43e-02

Pos logit contributions
)].+1.853
 Direction+1.769
 ≡+1.665
 alike+1.664
 mL+1.549
Neg logit contributions
iris-2.584
thens-2.355
rs-2.287
 Wolver-2.273
ffield-2.263
Token.
Token ID13
Feature activation+6.12e-01

Pos logit contributions
)].+0.209
 Direction+0.200
 alike+0.189
 ≡+0.187
aukee+0.181
Neg logit contributions
iris-0.223
thens-0.204
rs-0.197
ements-0.195
agra-0.194
Token7
Token ID22
Feature activation+3.54e-01

Pos logit contributions
)].+3.079
 Direction+2.921
 alike+2.716
 ≡+2.609
 mL+2.550
Neg logit contributions
iris-4.458
thens-4.178
ffield-4.078
agra-4.049
 Wolver-3.967
Token recently
Token ID2904
Feature activation-1.31e-01

Pos logit contributions
iris+1.644
thens+1.508
rs+1.462
ements+1.448
agra+1.440
Neg logit contributions
)].-1.563
 Direction-1.494
 alike-1.426
 ≡-1.391
aukee-1.390
Token for
Token ID329
Feature activation-1.44e-01

Pos logit contributions
iris+0.811
thens+0.747
rs+0.712
ements+0.712
agra+0.703
Neg logit contributions
)].-0.837
 Direction-0.801
 alike-0.774
aukee-0.747
 ≡-0.747
Token his
Token ID465
Feature activation-5.41e-01

Pos logit contributions
iris+0.945
thens+0.856
rs+0.828
ements+0.820
agra+0.809
Neg logit contributions
)].-0.892
 Direction-0.852
 alike-0.806
aukee-0.789
 ≡-0.789
Token proposal
Token ID6961
Feature activation-2.76e-02

Pos logit contributions
iris+3.559
thens+3.239
rs+3.188
ements+3.123
agra+3.100
Neg logit contributions
)].-3.266
 Direction-3.128
 alike-2.933
 ≡-2.897
aukee-2.888
Token to
Token ID284
Feature activation+2.25e-01

Pos logit contributions
iris+0.163
thens+0.147
rs+0.143
ements+0.141
agra+0.139
Neg logit contributions
)].-0.184
 Direction-0.177
 alike-0.168
 ≡-0.166
aukee-0.166
Token hand
Token ID1021
Feature activation+1.38e+00

Pos logit contributions
)].+1.286
 Direction+1.216
 alike+1.167
 ≡+1.143
aukee+1.115
Neg logit contributions
iris-1.576
thens-1.438
ements-1.380
rs-1.375
agra-1.348
Token over
Token ID625
Feature activation+6.13e-01

Pos logit contributions
)].+6.502
 Direction+6.381
 alike+6.078
aukee+6.062
 ≡+5.942
Neg logit contributions
iris-9.676
thens-8.934
rs-8.385
 Wolver-8.371
agra-8.364
Token the
Token ID262
Feature activation+3.29e-01

Pos logit contributions
)].+3.422
 Direction+3.309
 alike+3.091
 ≡+3.047
aukee+2.996
Neg logit contributions
iris-4.241
thens-3.934
rs-3.724
ffield-3.682
agra-3.642
Token maintenance
Token ID9262
Feature activation+1.17e-02

Pos logit contributions
)].+1.876
 Direction+1.807
 ≡+1.664
 alike+1.642
aukee+1.633
Neg logit contributions
iris-2.280
thens-2.113
rs-2.052
ements-1.997
download-1.985
Token costs
Token ID3484
Feature activation+5.99e-01

Pos logit contributions
)].+0.073
 Direction+0.070
 alike+0.065
 ≡+0.065
aukee+0.064
Neg logit contributions
iris-0.075
thens-0.069
rs-0.066
agra-0.066
ements-0.065
Token,
Token ID11
Feature activation+2.34e-02

Pos logit contributions
)].+3.416
 Direction+3.253
 alike+3.153
 ≡+3.039
aukee+2.938
Neg logit contributions
iris-4.002
thens-3.702
agra-3.603
ements-3.516
download-3.501
Token our
Token ID674
Feature activation+2.60e-01

Pos logit contributions
iris+1.221
thens+1.135
rs+1.105
download+1.090
agra+1.088
Neg logit contributions
)].-0.777
 Direction-0.742
 ≡-0.675
 alike-0.673
aukee-0.658
Token daily
Token ID4445
Feature activation+9.06e-01

Pos logit contributions
)].+0.955
 Direction+0.915
 ≡+0.796
 mL+0.764
 alike+0.763
Neg logit contributions
iris-2.275
thens-2.171
rs-2.112
agra-2.098
ements-2.093
Token newsletter
Token ID13129
Feature activation+1.41e+00

Pos logit contributions
)].+2.321
 Direction+2.071
 mL+1.938
 ≡+1.721
 interviewer+1.563
Neg logit contributions
iris-8.977
thens-8.457
agra-8.181
ements-8.058
rs-8.006
Token Upgrade
Token ID24236
Feature activation+1.03e+00

Pos logit contributions
)].+1.275
 Direction+0.518
 ≡+0.424
 mL+0.292
 Upper+0.237
Neg logit contributions
iris-15.761
thens-14.562
agra-14.507
ements-14.424
ffield-14.121
Token your
Token ID534
Feature activation+1.14e-01

Pos logit contributions
)].+4.635
 Direction+4.460
 alike+4.039
 Upper+3.774
 ≡+3.682
Neg logit contributions
iris-7.641
thens-7.182
ffield-7.122
agra-6.837
ederation-6.702
Token inbox
Token ID13734
Feature activation+1.37e+00

Pos logit contributions
)].+0.401
 Direction+0.386
 ≡+0.333
 alike+0.326
 mL+0.314
Neg logit contributions
iris-1.022
thens-0.964
agra-0.951
rs-0.935
rent-0.930
Token and
Token ID290
Feature activation+4.34e-01

Pos logit contributions
)].+6.297
 Direction+5.361
 mL+5.208
 ≡+5.019
 alike+4.958
Neg logit contributions
iris-10.260
thens-9.103
agra-9.071
ederation-8.953
 Wolver-8.815
Token get
Token ID651
Feature activation+3.04e-01

Pos logit contributions
)].+2.477
 Direction+2.220
 alike+2.189
 mL+2.091
 ≡+2.056
Neg logit contributions
iris-2.868
thens-2.820
agra-2.713
rs-2.644
ements-2.639
Token our
Token ID674
Feature activation+3.59e-01

Pos logit contributions
)].+1.274
 Direction+1.155
 alike+1.115
 ≡+1.017
 mL+0.983
Neg logit contributions
iris-2.595
thens-2.433
agra-2.354
rs-2.311
download-2.303
Token Daily
Token ID6714
Feature activation+7.07e-01

Pos logit contributions
)].+1.039
 Direction+0.973
 alike+0.832
 mL+0.769
 ≡+0.759
Neg logit contributions
iris-3.426
thens-3.275
agra-3.186
rs-3.171
rent-3.169
Token Dispatch
Token ID35934
Feature activation+1.09e+00

Pos logit contributions
)].+2.313
 Direction+2.175
 mL+1.805
 alike+1.756
 Piece+1.720
Neg logit contributions
iris-6.670
agra-6.176
thens-6.075
ffield-5.832
ements-5.781
Tokengame
Token ID6057
Feature activation+3.52e-01

Pos logit contributions
)].+3.292
 Direction+2.975
 alike+2.851
 mL+2.688
opez+2.660
Neg logit contributions
iris-4.105
thens-3.977
 Wolver-3.866
ffield-3.754
ements-3.708
Tokenge
Token ID469
Feature activation+4.68e-01

Pos logit contributions
)].+1.260
 Direction+1.150
 alike+1.063
 ≡+0.991
aukee+0.984
Neg logit contributions
iris-3.095
thens-2.921
 Wolver-2.846
ements-2.825
rs-2.811
Tokenek
Token ID988
Feature activation+7.01e-01

Pos logit contributions
)].+3.076
 Direction+2.775
aukee+2.700
 alike+2.657
 mL+2.596
Neg logit contributions
iris-2.679
thens-2.565
 Wolver-2.511
ffield-2.301
download-2.298
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
)].+3.291
 Direction+2.721
 alike+2.705
 mL+2.573
opez+2.311
Neg logit contributions
iris-5.444
thens-5.228
ffield-5.114
 Wolver-4.999
agra-4.987
Tokencom
Token ID785
Feature activation-6.00e-01

Pos logit contributions
iris+1.601
thens+1.474
agra+1.399
 Wolver+1.382
ffield+1.380
Neg logit contributions
)].-1.772
 Direction-1.682
 alike-1.596
 ≡-1.529
 mL-1.526
Token/
Token ID14
Feature activation-1.69e+00

Pos logit contributions
iris+3.045
thens+2.702
rs+2.572
ements+2.558
agra+2.532
Neg logit contributions
)].-4.610
 Direction-4.412
 alike-4.199
 ≡-4.121
aukee-4.107
Tokenthread
Token ID16663
Feature activation-5.91e-01

Pos logit contributions
download+6.696
iris+6.565
rs+6.220
rent+5.756
thens+5.710
Neg logit contributions
)].-12.673
 Direction-12.218
aukee-11.603
 ≡-11.570
 alike-11.469
Token/
Token ID14
Feature activation-5.70e-01

Pos logit contributions
iris+3.069
thens+2.721
ements+2.564
rs+2.556
agra+2.554
Neg logit contributions
)].-4.509
 Direction-4.282
 alike-4.079
 ≡-3.957
aukee-3.943
Token54
Token ID4051
Feature activation-9.77e-02

Pos logit contributions
iris+3.527
thens+3.187
rs+3.143
ements+3.071
agra+3.047
Neg logit contributions
)].-3.647
 Direction-3.501
 alike-3.301
 ≡-3.259
aukee-3.254
Token33
Token ID2091
Feature activation-5.46e-01

Pos logit contributions
iris+0.594
thens+0.539
rs+0.520
ements+0.517
agra+0.510
Neg logit contributions
)].-0.647
 Direction-0.610
 alike-0.580
 ≡-0.569
aukee-0.560
Token95
Token ID3865
Feature activation-5.20e-01

Pos logit contributions
iris+2.538
rs+2.249
thens+2.237
download+2.159
ements+2.125
Neg logit contributions
)].-4.207
 Direction-4.107
aukee-3.920
 alike-3.903
 ≡-3.874
Tokenaj
Token ID1228
Feature activation+2.68e-01

Pos logit contributions
iris+0.026
thens+0.024
download+0.023
rs+0.023
ements+0.023
Neg logit contributions
)].-0.029
 Direction-0.027
aukee-0.026
 alike-0.026
 ≡-0.025
Tokens
Token ID82
Feature activation+1.14e-01

Pos logit contributions
)].+1.826
 Direction+1.719
aukee+1.611
 alike+1.606
 ≡+1.604
Neg logit contributions
iris-1.499
thens-1.381
ements-1.339
 Wolver-1.330
download-1.307
Token 41
Token ID6073
Feature activation-1.54e-01

Pos logit contributions
)].+0.730
 Direction+0.688
 alike+0.642
 ≡+0.642
aukee+0.639
Neg logit contributions
iris-0.698
thens-0.656
ements-0.628
download-0.619
rs-0.618
Token),
Token ID828
Feature activation-2.16e-02

Pos logit contributions
iris+0.938
thens+0.846
rs+0.836
ements+0.814
download+0.808
Neg logit contributions
)].-0.995
 Direction-0.957
 alike-0.903
 ≡-0.892
aukee-0.892
Token K
Token ID509
Feature activation-1.42e-01

Pos logit contributions
iris+0.141
thens+0.129
rs+0.124
ements+0.124
agra+0.123
Neg logit contributions
)].-0.132
 Direction-0.127
 alike-0.118
 ≡-0.116
aukee-0.115
Tokenond
Token ID623
Feature activation-1.69e+00

Pos logit contributions
iris+0.885
thens+0.806
rs+0.782
ements+0.773
download+0.771
Neg logit contributions
)].-0.901
 Direction-0.866
 alike-0.812
aukee-0.805
 ≡-0.800
Tokenrl
Token ID45895
Feature activation-3.09e-01

Pos logit contributions
iris+9.286
rs+8.883
rent+8.175
thens+7.970
agra+7.969
Neg logit contributions
)].-10.369
 Direction-10.185
 alike-9.511
aukee-9.385
PDATE-9.340
Tokení
Token ID8836
Feature activation-1.31e-01

Pos logit contributions
iris+1.940
rs+1.738
thens+1.728
agra+1.672
ements+1.670
Neg logit contributions
)].-1.950
 Direction-1.872
 alike-1.762
 ≡-1.740
aukee-1.730
Tokenk
Token ID74
Feature activation+3.96e-02

Pos logit contributions
iris+0.769
thens+0.694
rs+0.693
agra+0.661
download+0.661
Neg logit contributions
)].-0.877
 Direction-0.843
 alike-0.797
 ≡-0.789
aukee-0.783
Token,
Token ID11
Feature activation-1.27e-01

Pos logit contributions
)].+0.259
 Direction+0.249
 alike+0.235
 ≡+0.232
aukee+0.231
Neg logit contributions
iris-0.240
thens-0.217
rs-0.212
ements-0.209
agra-0.207
Token
Token ID25370
Feature activation-3.95e-01

Pos logit contributions
iris+0.831
thens+0.758
rs+0.736
ements+0.728
download+0.724
Neg logit contributions
)].-0.772
 Direction-0.744
 alike-0.690
 ≡-0.680
aukee-0.677
Tokenley
Token ID1636
Feature activation-3.52e-01

Pos logit contributions
iris+2.005
thens+1.815
rs+1.786
ements+1.746
agra+1.736
Neg logit contributions
)].-2.198
 Direction-2.099
 alike-2.001
aukee-1.973
 ≡-1.956
Tokenunder
Token ID4625
Feature activation-1.89e-01

Pos logit contributions
iris+2.237
thens+2.029
rs+2.000
ements+1.957
agra+1.942
Neg logit contributions
)].-2.193
 Direction-2.102
 alike-1.979
 ≡-1.952
aukee-1.949
Tokenground
Token ID2833
Feature activation-5.49e-01

Pos logit contributions
iris+1.674
thens+1.569
rs+1.545
ements+1.531
agra+1.525
Neg logit contributions
)].-0.698
 Direction-0.649
 alike-0.584
 ≡-0.565
aukee-0.556
Token.
Token ID13
Feature activation-1.05e+00

Pos logit contributions
iris+3.149
rs+2.847
download+2.802
thens+2.778
rent+2.747
Neg logit contributions
)].-3.774
 Direction-3.610
aukee-3.406
 ≡-3.382
 alike-3.314
Tokencom
Token ID785
Feature activation-6.97e-01

Pos logit contributions
iris+5.059
rs+4.434
thens+4.322
download+4.264
rent+4.166
Neg logit contributions
)].-8.135
 Direction-7.788
 ≡-7.419
 alike-7.380
aukee-7.373
Token/
Token ID14
Feature activation-1.43e+00

Pos logit contributions
iris+2.709
rs+2.384
download+2.350
thens+2.270
agra+2.197
Neg logit contributions
)].-5.895
 Direction-5.637
aukee-5.389
 alike-5.387
 ≡-5.311
Token2016
Token ID5304
Feature activation-3.61e-01

Pos logit contributions
iris+8.925
download+8.196
rs+8.099
thens+7.843
rent+7.766
Neg logit contributions
)].-8.607
 Direction-8.094
 alike-7.618
aukee-7.586
 ≡-7.526
Token/
Token ID14
Feature activation-5.84e-01

Pos logit contributions
iris+1.471
rs+1.263
download+1.248
thens+1.212
rent+1.201
Neg logit contributions
)].-3.028
 Direction-2.941
 alike-2.820
aukee-2.815
 ≡-2.811
Token09
Token ID2931
Feature activation-3.56e-02

Pos logit contributions
iris+3.841
thens+3.495
rs+3.446
ements+3.376
agra+3.350
Neg logit contributions
)].-3.511
 Direction-3.360
 alike-3.155
 ≡-3.111
aukee-3.105
Token/
Token ID14
Feature activation-7.06e-01

Pos logit contributions
iris+0.207
thens+0.188
ffield+0.176
agra+0.174
 Wolver+0.174
Neg logit contributions
)].-0.255
 Direction-0.233
 alike-0.223
 mL-0.217
 ≡-0.211
Token08
Token ID2919
Feature activation-3.06e-01

Pos logit contributions
iris+4.159
thens+3.721
rs+3.720
download+3.602
ements+3.557
Neg logit contributions
)].-4.699
 Direction-4.539
 alike-4.272
 ≡-4.256
aukee-4.243
Token money
Token ID1637
Feature activation-8.13e-01

Pos logit contributions
iris+3.597
thens+3.273
rs+3.204
ements+3.148
download+3.136
Neg logit contributions
)].-3.380
 Direction-3.241
 alike-3.067
 ≡-3.002
aukee-2.979
Token for
Token ID329
Feature activation-7.09e-01

Pos logit contributions
iris+5.046
thens+4.548
rs+4.534
ements+4.409
agra+4.360
Neg logit contributions
)].-5.128
 Direction-4.935
 alike-4.608
aukee-4.598
 ≡-4.582
Token convenience
Token ID15607
Feature activation-5.53e-01

Pos logit contributions
iris+4.430
thens+4.020
rs+3.968
ements+3.877
agra+3.843
Neg logit contributions
)].-4.448
 Direction-4.270
 alike-4.002
aukee-3.971
 ≡-3.970
Token in
Token ID287
Feature activation-4.61e-01

Pos logit contributions
iris+3.348
thens+3.020
rs+2.982
ements+2.910
agra+2.883
Neg logit contributions
)].-3.590
 Direction-3.457
 alike-3.249
 ≡-3.218
aukee-3.217
Token making
Token ID1642
Feature activation-8.68e-01

Pos logit contributions
iris+3.019
thens+2.738
rs+2.691
ements+2.639
agra+2.622
Neg logit contributions
)].-2.817
 Direction-2.697
 alike-2.545
 ≡-2.501
aukee-2.490
Token exact
Token ID2748
Feature activation-1.43e+00

Pos logit contributions
iris+5.381
thens+4.878
rs+4.834
ements+4.735
agra+4.670
Neg logit contributions
)].-5.419
 Direction-5.196
aukee-4.865
 alike-4.848
 ≡-4.840
Token payments
Token ID7524
Feature activation-4.93e-01

Pos logit contributions
iris+8.092
rs+7.415
ements+7.319
thens+7.159
rent+7.117
Neg logit contributions
)].-9.005
 Direction-8.754
aukee-8.270
 ≡-8.113
 alike-7.986
Token".
Token ID1911
Feature activation+1.29e-01

Pos logit contributions
iris+3.146
thens+2.856
rs+2.800
ements+2.746
agra+2.736
Neg logit contributions
)].-3.085
 Direction-2.946
 alike-2.794
 ≡-2.741
aukee-2.729
Token He
Token ID679
Feature activation-1.29e-01

Pos logit contributions
)].+0.726
 Direction+0.714
 alike+0.646
 ≡+0.633
 mL+0.614
Neg logit contributions
iris-0.919
thens-0.846
agra-0.816
ements-0.810
ffield-0.809
Token observed
Token ID6515
Feature activation+1.99e-01

Pos logit contributions
iris+0.935
thens+0.860
download+0.839
rs+0.834
agra+0.827
Neg logit contributions
)].-0.690
 Direction-0.659
 alike-0.629
 ≡-0.601
aukee-0.588
Token that
Token ID326
Feature activation+1.11e-01

Pos logit contributions
)].+1.175
 Direction+1.136
 alike+1.083
 ≡+1.045
aukee+1.042
Neg logit contributions
iris-1.323
thens-1.203
rs-1.169
agra-1.162
ements-1.161
Tokenible
Token ID856
Feature activation-8.56e-01

Pos logit contributions
iris+3.676
rs+3.286
thens+3.248
ements+3.127
download+3.104
Neg logit contributions
)].-5.663
 Direction-5.520
aukee-5.252
 alike-5.244
 ≡-5.186
Token-
Token ID12
Feature activation-6.73e-01

Pos logit contributions
iris+3.818
rs+3.337
thens+3.270
download+3.177
ements+3.151
Neg logit contributions
)].-6.686
 Direction-6.478
aukee-6.264
 alike-6.255
 ≡-6.239
Tokenlimit
Token ID32374
Feature activation-1.06e+00

Pos logit contributions
iris+4.216
thens+3.809
rs+3.773
ements+3.673
download+3.663
Neg logit contributions
)].-4.255
 Direction-4.076
 alike-3.848
 ≡-3.804
aukee-3.793
Token-
Token ID12
Feature activation-6.05e-01

Pos logit contributions
iris+4.931
rs+4.452
thens+4.368
ements+4.285
rent+4.183
Neg logit contributions
)].-7.952
 Direction-7.710
aukee-7.470
 alike-7.367
 ≡-7.361
Tokentr
Token ID2213
Feature activation+4.11e-03

Pos logit contributions
iris+3.536
thens+3.177
rs+3.130
ements+3.053
download+3.028
Neg logit contributions
)].-4.077
 Direction-3.920
 alike-3.709
 ≡-3.666
aukee-3.659
Tokenading
Token ID4980
Feature activation-1.43e+00

Pos logit contributions
)].+0.026
 Direction+0.024
 alike+0.023
 ≡+0.022
aukee+0.022
Neg logit contributions
iris-0.026
thens-0.024
rs-0.023
ements-0.023
 Wolver-0.023
Token-
Token ID12
Feature activation-7.57e-01

Pos logit contributions
iris+6.914
rs+6.364
thens+6.189
rent+6.076
ements+6.059
Neg logit contributions
)].-9.866
 Direction-9.519
aukee-9.437
 alike-9.184
 ≡-9.161
Tokensub
Token ID7266
Feature activation-3.20e-01

Pos logit contributions
iris+4.548
thens+4.103
rs+4.076
ements+3.952
download+3.942
Neg logit contributions
)].-4.950
 Direction-4.741
 alike-4.502
 ≡-4.456
aukee-4.440
Tokens
Token ID82
Feature activation-5.07e-01

Pos logit contributions
iris+1.944
thens+1.753
rs+1.721
ements+1.689
agra+1.674
Neg logit contributions
)].-2.091
 Direction-2.006
 alike-1.894
 ≡-1.867
aukee-1.860
Tokenidy
Token ID19325
Feature activation-8.44e-01

Pos logit contributions
iris+2.887
thens+2.589
rs+2.544
ements+2.484
agra+2.462
Neg logit contributions
)].-3.489
 Direction-3.357
 alike-3.179
 ≡-3.142
aukee-3.134
Token-
Token ID12
Feature activation-5.98e-01

Pos logit contributions
iris+4.453
rs+3.997
thens+3.993
ements+3.855
download+3.839
Neg logit contributions
)].-5.925
 Direction-5.733
aukee-5.447
 alike-5.436
 ≡-5.401
Token there
Token ID612
Feature activation-9.92e-01

Pos logit contributions
iris+2.562
thens+2.382
rs+2.333
download+2.304
ements+2.300
Neg logit contributions
)].-1.687
 Direction-1.604
 alike-1.534
 ≡-1.449
aukee-1.438
Token silently
Token ID24595
Feature activation-6.88e-01

Pos logit contributions
iris+5.895
rs+5.239
thens+5.203
ements+5.043
agra+5.033
Neg logit contributions
)].-6.526
 Direction-6.266
 ≡-5.864
aukee-5.862
 alike-5.737
Token for
Token ID329
Feature activation-6.36e-01

Pos logit contributions
iris+4.033
thens+3.615
rs+3.573
ements+3.481
agra+3.453
Neg logit contributions
)].-4.621
 Direction-4.444
 alike-4.180
 ≡-4.154
aukee-4.150
Token almost
Token ID2048
Feature activation-8.21e-01

Pos logit contributions
iris+4.204
thens+3.828
rs+3.764
ements+3.696
download+3.669
Neg logit contributions
)].-3.814
 Direction-3.651
 alike-3.431
 ≡-3.376
aukee-3.364
Token an
Token ID281
Feature activation-6.05e-01

Pos logit contributions
iris+5.638
thens+5.154
rs+5.099
ements+4.989
agra+4.953
Neg logit contributions
)].-4.656
 Direction-4.446
 alike-4.142
aukee-4.116
 ≡-4.103
Token hour
Token ID1711
Feature activation-1.43e+00

Pos logit contributions
iris+3.961
thens+3.598
rs+3.552
ements+3.471
agra+3.451
Neg logit contributions
)].-3.649
 Direction-3.495
 alike-3.274
 ≡-3.244
aukee-3.240
Token.
Token ID13
Feature activation-4.99e-01

Pos logit contributions
iris+7.684
rs+6.934
thens+6.743
ements+6.501
download+6.438
Neg logit contributions
)].-9.614
 Direction-9.472
 ≡-8.840
aukee-8.773
PDATE-8.521
Token Then
Token ID3244
Feature activation-5.84e-01

Pos logit contributions
iris+3.082
thens+2.788
rs+2.727
ements+2.684
agra+2.663
Neg logit contributions
)].-3.224
 Direction-3.110
 alike-2.917
 ≡-2.880
aukee-2.857
Token he
Token ID339
Feature activation-2.95e-01

Pos logit contributions
iris+3.639
thens+3.294
rs+3.243
ements+3.176
agra+3.150
Neg logit contributions
)].-3.718
 Direction-3.567
 alike-3.363
 ≡-3.317
aukee-3.309
Token opened
Token ID4721
Feature activation-1.64e-01

Pos logit contributions
iris+1.978
thens+1.820
download+1.757
rs+1.750
ements+1.744
Neg logit contributions
)].-1.761
 Direction-1.672
 alike-1.599
 ≡-1.542
aukee-1.517
Token his
Token ID465
Feature activation-1.53e-01

Pos logit contributions
iris+0.964
thens+0.867
rs+0.848
ements+0.834
agra+0.829
Neg logit contributions
)].-1.108
 Direction-1.066
 alike-1.011
 ≡-0.991
aukee-0.988
Token and
Token ID290
Feature activation-4.16e-01

Pos logit contributions
iris+4.007
thens+3.680
rs+3.608
ements+3.565
download+3.531
Neg logit contributions
)].-3.175
 Direction-3.018
 alike-2.841
 ≡-2.780
aukee-2.770
Token floors
Token ID18570
Feature activation+1.52e-01

Pos logit contributions
iris+2.800
thens+2.533
rs+2.483
download+2.453
ements+2.452
Neg logit contributions
)].-2.472
 Direction-2.363
 alike-2.231
 ≡-2.159
aukee-2.157
Token me
Token ID502
Feature activation-3.85e-01

Pos logit contributions
)].+0.831
 Direction+0.793
 alike+0.762
 ≡+0.724
aukee+0.712
Neg logit contributions
iris-1.074
thens-0.986
ements-0.959
rs-0.958
agra-0.955
Token and
Token ID290
Feature activation-2.81e-01

Pos logit contributions
iris+2.546
thens+2.323
rs+2.281
ements+2.247
agra+2.227
Neg logit contributions
)].-2.312
 Direction-2.202
 alike-2.090
 ≡-2.042
aukee-2.036
Token makes
Token ID1838
Feature activation-7.39e-01

Pos logit contributions
iris+1.779
thens+1.600
rs+1.567
download+1.551
ements+1.541
Neg logit contributions
)].-1.776
 Direction-1.698
 alike-1.630
aukee-1.562
 ≡-1.561
Token me
Token ID502
Feature activation-1.42e+00

Pos logit contributions
iris+4.762
thens+4.317
rs+4.282
ements+4.181
download+4.150
Neg logit contributions
)].-4.488
 Direction-4.294
 alike-4.010
 ≡-3.993
aukee-3.987
Token feel
Token ID1254
Feature activation-4.39e-01

Pos logit contributions
iris+7.883
rs+7.042
thens+6.780
ements+6.752
agra+6.720
Neg logit contributions
)].-9.227
 Direction-9.013
 ≡-8.538
aukee-8.369
PDATE-8.281
Token connected
Token ID5884
Feature activation-2.10e-01

Pos logit contributions
iris+2.805
thens+2.560
rs+2.487
agra+2.448
ements+2.446
Neg logit contributions
)].-2.745
 Direction-2.643
 alike-2.531
 ≡-2.442
aukee-2.433
Token to
Token ID284
Feature activation+3.89e-02

Pos logit contributions
iris+1.284
thens+1.168
rs+1.132
ements+1.121
agra+1.115
Neg logit contributions
)].-1.376
 Direction-1.305
 alike-1.256
 ≡-1.221
aukee-1.218
Token my
Token ID616
Feature activation+3.98e-02

Pos logit contributions
)].+0.224
 Direction+0.214
 alike+0.202
 ≡+0.196
aukee+0.192
Neg logit contributions
iris-0.269
thens-0.248
agra-0.239
ements-0.239
rs-0.238
Token fellow
Token ID5891
Feature activation+1.58e-01

Pos logit contributions
)].+0.227
 Direction+0.219
 alike+0.204
aukee+0.197
 ≡+0.195
Neg logit contributions
iris-0.277
thens-0.257
agra-0.245
ements-0.245
rs-0.244
Token change
Token ID1487
Feature activation-3.18e-01

Pos logit contributions
iris+0.898
thens+0.820
rs+0.804
agra+0.783
ements+0.783
Neg logit contributions
)].-0.873
 Direction-0.849
 alike-0.805
 ≡-0.787
aukee-0.777
Token depending
Token ID6906
Feature activation-3.59e-01

Pos logit contributions
iris+2.027
thens+1.843
rs+1.800
agra+1.766
ements+1.761
Neg logit contributions
)].-1.977
 Direction-1.913
 alike-1.819
 ≡-1.772
aukee-1.739
Token on
Token ID319
Feature activation+1.23e-01

Pos logit contributions
iris+2.114
rs+1.922
thens+1.898
ements+1.874
rent+1.850
Neg logit contributions
)].-2.410
 Direction-2.266
aukee-2.184
 ≡-2.118
 alike-2.104
Token who
Token ID508
Feature activation+2.96e-01

Pos logit contributions
)].+0.737
 Direction+0.735
 alike+0.689
 ≡+0.664
 mL+0.660
Neg logit contributions
iris-0.805
thens-0.739
rs-0.709
agra-0.705
ements-0.703
Token's
Token ID338
Feature activation-8.91e-02

Pos logit contributions
)].+1.736
 Direction+1.727
 alike+1.634
 ≡+1.583
 interviewer+1.541
Neg logit contributions
iris-1.866
thens-1.711
agra-1.667
rs-1.651
ements-1.625
Token making
Token ID1642
Feature activation-1.40e+00

Pos logit contributions
iris+0.596
thens+0.553
rs+0.526
ements+0.524
agra+0.522
Neg logit contributions
)].-0.520
 Direction-0.509
 alike-0.487
 ≡-0.465
aukee-0.457
Token them
Token ID606
Feature activation-9.63e-01

Pos logit contributions
iris+7.407
rs+6.499
thens+6.471
ements+6.348
download+6.326
Neg logit contributions
)].-9.850
 Direction-9.345
aukee-9.062
 ≡-8.745
 alike-8.731
Token,
Token ID11
Feature activation-1.05e-01

Pos logit contributions
iris+5.700
rs+5.150
thens+5.147
ements+5.003
download+4.961
Neg logit contributions
)].-6.199
 Direction-5.862
aukee-5.597
 alike-5.511
 ≡-5.458
Token but
Token ID475
Feature activation-1.50e-01

Pos logit contributions
iris+0.679
thens+0.613
rs+0.598
agra+0.591
ements+0.588
Neg logit contributions
)].-0.638
 Direction-0.627
 alike-0.594
 ≡-0.573
 mL-0.557
Token justice
Token ID5316
Feature activation-4.28e-01

Pos logit contributions
iris+1.026
thens+0.934
rs+0.905
agra+0.905
ements+0.895
Neg logit contributions
)].-0.869
 Direction-0.848
 alike-0.806
 ≡-0.768
 mL-0.752
Token is
Token ID318
Feature activation-2.17e-01

Pos logit contributions
iris+2.719
thens+2.469
rs+2.407
ements+2.357
download+2.356
Neg logit contributions
)].-2.703
 Direction-2.585
 alike-2.451
 ≡-2.420
aukee-2.384
Token k
Token ID479
Feature activation-3.30e-01

Pos logit contributions
iris+3.119
thens+2.840
rs+2.800
ements+2.743
agra+2.723
Neg logit contributions
)].-2.829
 Direction-2.707
 alike-2.540
 ≡-2.505
aukee-2.499
Token]
Token ID60
Feature activation-4.83e-01

Pos logit contributions
iris+1.898
thens+1.703
rs+1.682
ements+1.631
download+1.618
Neg logit contributions
)].-2.246
 Direction-2.169
 alike-2.055
aukee-2.036
 ≡-2.025
Token}}
Token ID11709
Feature activation-5.38e-01

Pos logit contributions
iris+2.913
thens+2.630
rs+2.604
ements+2.535
download+2.505
Neg logit contributions
)].-3.139
 Direction-3.017
 alike-2.854
aukee-2.822
 ≡-2.808
Token for
Token ID329
Feature activation+2.27e-01

Pos logit contributions
iris+3.425
thens+3.105
rs+3.063
ements+2.996
agra+2.971
Neg logit contributions
)].-3.344
 Direction-3.206
 alike-3.018
 ≡-2.977
aukee-2.973
Token k
Token ID479
Feature activation-2.78e-01

Pos logit contributions
)].+1.287
 Direction+1.245
 alike+1.150
 ≡+1.134
aukee+1.108
Neg logit contributions
iris-1.546
thens-1.431
rs-1.396
agra-1.386
ements-1.382
Token=
Token ID28
Feature activation-1.40e+00

Pos logit contributions
iris+1.863
thens+1.699
rs+1.670
ements+1.642
agra+1.636
Neg logit contributions
)].-1.644
 Direction-1.569
 alike-1.472
 ≡-1.455
aukee-1.445
Token1
Token ID16
Feature activation-7.02e-01

Pos logit contributions
iris+9.246
rs+8.466
thens+8.261
download+8.028
rent+7.877
Neg logit contributions
)].-7.621
 Direction-7.382
 alike-7.062
aukee-7.034
 ≡-6.946
Token,
Token ID11
Feature activation-3.86e-01

Pos logit contributions
iris+4.006
thens+3.579
rs+3.535
ements+3.428
download+3.412
Neg logit contributions
)].-4.776
 Direction-4.612
 alike-4.373
aukee-4.333
 ≡-4.311
Token2
Token ID17
Feature activation-4.66e-01

Pos logit contributions
iris+2.668
thens+2.448
rs+2.398
ements+2.370
agra+2.363
Neg logit contributions
)].-2.202
 Direction-2.113
 alike-1.968
 ≡-1.942
aukee-1.924
Token,
Token ID11
Feature activation+3.66e-02

Pos logit contributions
iris+2.728
thens+2.450
rs+2.417
ements+2.353
download+2.335
Neg logit contributions
)].-3.120
 Direction-3.005
 alike-2.840
aukee-2.811
 ≡-2.804
Token..
Token ID492
Feature activation-3.79e-01

Pos logit contributions
)].+0.193
 Direction+0.183
 alike+0.169
 ≡+0.167
aukee+0.165
Neg logit contributions
iris-0.267
thens-0.247
ements-0.241
agra-0.240
 Wolver-0.240
Token
Token ID447
Feature activation+2.59e-03

Pos logit contributions
)].+0.983
 Direction+0.865
 alike+0.825
 mL+0.804
 ≡+0.769
Neg logit contributions
iris-1.123
thens-1.031
ffield-1.007
agra-0.991
 Wolver-0.975
Token
Token ID247
Feature activation-8.83e-02

Pos logit contributions
)].+0.012
 Direction+0.012
 alike+0.011
aukee+0.010
 ≡+0.010
Neg logit contributions
iris-0.021
thens-0.019
agra-0.018
ements-0.018
 Wolver-0.018
Tokent
Token ID83
Feature activation-2.29e-01

Pos logit contributions
iris+0.619
thens+0.549
agra+0.546
 Wolver+0.545
ffield+0.535
Neg logit contributions
)].-0.512
 Direction-0.467
 alike-0.444
aukee-0.437
 mL-0.427
Token make
Token ID787
Feature activation-9.07e-01

Pos logit contributions
iris+1.555
thens+1.430
rs+1.386
agra+1.370
ements+1.364
Neg logit contributions
)].-1.335
 Direction-1.271
 alike-1.216
 ≡-1.178
aukee-1.168
Token the
Token ID262
Feature activation-6.67e-01

Pos logit contributions
iris+5.693
thens+5.151
rs+5.115
ements+5.006
agra+4.953
Neg logit contributions
)].-5.640
 Direction-5.369
 alike-5.026
aukee-5.009
 ≡-4.996
Token same
Token ID976
Feature activation-1.39e+00

Pos logit contributions
iris+4.505
thens+4.118
rs+4.051
ements+3.967
agra+3.943
Neg logit contributions
)].-3.911
 Direction-3.743
 alike-3.508
 ≡-3.454
aukee-3.448
Token mistake
Token ID7457
Feature activation-2.98e-01

Pos logit contributions
iris+7.493
rs+6.681
ements+6.590
 Wolver+6.460
thens+6.453
Neg logit contributions
)].-9.232
 Direction-8.831
 ≡-8.418
 alike-8.362
aukee-8.302
Token.
Token ID13
Feature activation+7.18e-02

Pos logit contributions
iris+1.955
thens+1.781
rs+1.740
ements+1.712
agra+1.705
Neg logit contributions
)].-1.806
 Direction-1.738
 alike-1.636
 ≡-1.603
aukee-1.598
Token The
Token ID383
Feature activation+6.08e-04

Pos logit contributions
)].+0.434
 Direction+0.428
 ≡+0.388
 alike+0.388
 mL+0.374
Neg logit contributions
iris-0.480
thens-0.440
ffield-0.422
agra-0.419
rs-0.417
Token high
Token ID1029
Feature activation+1.38e-01

Pos logit contributions
)].+0.003
 Direction+0.003
 alike+0.003
 ≡+0.003
aukee+0.003
Neg logit contributions
iris-0.004
thens-0.004
rs-0.004
ements-0.004
download-0.004
Token point
Token ID966
Feature activation-4.69e-01

Pos logit contributions
)].+0.906
 Direction+0.889
 alike+0.823
aukee+0.822
 ≡+0.812
Neg logit contributions
iris-0.842
thens-0.769
rs-0.725
agra-0.718
ements-0.717
Token ripping
Token ID34759
Feature activation-2.55e-01

Pos logit contributions
iris+3.022
thens+2.727
rs+2.651
ements+2.625
agra+2.609
Neg logit contributions
)].-3.049
 Direction-2.920
 alike-2.788
 ≡-2.703
aukee-2.690
Token them
Token ID606
Feature activation+2.21e-01

Pos logit contributions
iris+2.228
thens+2.070
rs+2.036
ements+2.018
agra+2.011
Neg logit contributions
)].-0.999
 Direction-0.934
 alike-0.857
 ≡-0.819
aukee-0.813
Token off
Token ID572
Feature activation-9.09e-02

Pos logit contributions
)].+0.527
 Direction+0.437
 alike+0.407
 ≡+0.348
aukee+0.338
Neg logit contributions
iris-2.283
thens-2.141
ements-2.108
agra-2.086
download-2.077
Token.
Token ID13
Feature activation-1.23e-01

Pos logit contributions
iris+0.586
thens+0.533
rs+0.521
ements+0.513
agra+0.511
Neg logit contributions
)].-0.559
 Direction-0.534
 alike-0.509
 ≡-0.497
aukee-0.495
Token The
Token ID383
Feature activation-5.06e-01

Pos logit contributions
iris+0.899
thens+0.824
ements+0.798
ffield+0.797
agra+0.795
Neg logit contributions
)].-0.666
 Direction-0.646
 alike-0.585
 ≡-0.579
aukee-0.555
Token information
Token ID1321
Feature activation-1.39e+00

Pos logit contributions
iris+3.489
thens+3.175
rs+3.118
ements+3.073
agra+3.057
Neg logit contributions
)].-2.944
 Direction-2.808
 alike-2.613
 ≡-2.572
aukee-2.566
Token in
Token ID287
Feature activation-6.94e-01

Pos logit contributions
iris+8.343
rs+7.632
thens+7.616
ements+7.323
download+7.304
Neg logit contributions
)].-8.614
 Direction-8.236
aukee-7.755
 ≡-7.739
 alike-7.678
Token the
Token ID262
Feature activation-4.84e-01

Pos logit contributions
iris+4.134
thens+3.726
rs+3.692
download+3.599
ements+3.597
Neg logit contributions
)].-4.511
 Direction-4.343
 alike-4.106
 ≡-4.056
aukee-4.047
Token center
Token ID3641
Feature activation-5.51e-01

Pos logit contributions
iris+3.326
thens+3.043
rs+2.980
ements+2.937
agra+2.922
Neg logit contributions
)].-2.803
 Direction-2.679
 alike-2.502
 ≡-2.460
aukee-2.454
Token
Token ID447
Feature activation+2.02e-01

Pos logit contributions
iris+3.342
thens+3.011
rs+2.934
ements+2.884
agra+2.876
Neg logit contributions
)].-3.640
 Direction-3.489
 alike-3.327
 ≡-3.252
aukee-3.231
Token
Token ID247
Feature activation-7.53e-02

Pos logit contributions
)].+0.991
 Direction+0.934
 alike+0.868
 ≡+0.855
aukee+0.848
Neg logit contributions
iris-1.556
thens-1.424
rs-1.407
ements-1.393
agra-1.388
Tokena
Token ID64
Feature activation+3.84e-01

Pos logit contributions
iris+0.128
thens+0.119
 Wolver+0.115
ements+0.113
ffield+0.109
Neg logit contributions
)].-0.131
 Direction-0.120
 alike-0.115
aukee-0.114
 ≡-0.110
Token.
Token ID13
Feature activation-2.65e-01

Pos logit contributions
)].+1.962
 Direction+1.721
 alike+1.647
 mL+1.575
opez+1.484
Neg logit contributions
iris-2.900
thens-2.739
ffield-2.637
 Wolver-2.622
agra-2.583
Tokendot
Token ID26518
Feature activation+5.63e-01

Pos logit contributions
iris+1.925
rs+1.773
thens+1.770
download+1.724
agra+1.703
Neg logit contributions
)].-1.414
 Direction-1.336
 alike-1.252
 ≡-1.243
aukee-1.230
Token.
Token ID13
Feature activation-3.17e-01

Pos logit contributions
)].+2.827
 Direction+2.502
 alike+2.336
 mL+2.258
opez+2.029
Neg logit contributions
iris-4.322
thens-4.126
ffield-4.007
agra-3.871
 Wolver-3.843
Tokengov
Token ID9567
Feature activation-5.39e-01

Pos logit contributions
iris+1.944
thens+1.783
agra+1.687
ffield+1.681
 Wolver+1.680
Neg logit contributions
)].-2.093
 Direction-1.989
 alike-1.858
aukee-1.823
 ≡-1.795
Token/
Token ID14
Feature activation-1.39e+00

Pos logit contributions
iris+2.779
thens+2.467
rs+2.350
ements+2.330
agra+2.310
Neg logit contributions
)].-4.107
 Direction-3.934
 alike-3.725
 ≡-3.657
aukee-3.652
Tokenpeople
Token ID15332
Feature activation-3.90e-01

Pos logit contributions
iris+7.029
rs+6.375
download+6.316
thens+6.048
rent+5.946
Neg logit contributions
)].-9.733
 Direction-9.415
 alike-9.028
 ≡-8.919
aukee-8.749
Token/
Token ID14
Feature activation-7.97e-01

Pos logit contributions
iris+1.969
thens+1.738
rs+1.708
ements+1.659
download+1.643
Neg logit contributions
)].-2.934
 Direction-2.834
 alike-2.696
 ≡-2.668
aukee-2.662
Tokenin
Token ID259
Feature activation+1.51e-01

Pos logit contributions
iris+4.822
rs+4.325
thens+4.309
download+4.197
ements+4.146
Neg logit contributions
)].-5.177
 Direction-4.975
 alike-4.695
 ≡-4.657
aukee-4.598
Tokenjury
Token ID21871
Feature activation-8.99e-01

Pos logit contributions
)].+1.207
 Direction+1.159
 alike+1.110
aukee+1.105
 ≡+1.090
Neg logit contributions
iris-0.678
thens-0.593
 Wolver-0.563
ements-0.549
rs-0.546
Token/
Token ID14
Feature activation-1.11e+00

Pos logit contributions
iris+4.662
rs+4.200
thens+4.156
download+4.064
ements+3.986
Neg logit contributions
)].-6.463
 Direction-6.283
 ≡-5.937
 alike-5.923
aukee-5.868
Token
Token ID784
Feature activation-4.68e-01

Pos logit contributions
iris+3.214
thens+2.892
rs+2.855
ements+2.787
download+2.762
Neg logit contributions
)].-3.512
 Direction-3.373
 alike-3.178
 ≡-3.146
aukee-3.143
Token namely
Token ID14811
Feature activation-3.78e-02

Pos logit contributions
iris+3.074
thens+2.804
rs+2.746
ements+2.690
agra+2.684
Neg logit contributions
)].-2.833
 Direction-2.725
 alike-2.574
 ≡-2.532
aukee-2.499
Token Tony
Token ID8832
Feature activation-5.24e-01

Pos logit contributions
iris+0.236
thens+0.215
rs+0.209
agra+0.208
download+0.206
Neg logit contributions
)].-0.241
 Direction-0.232
 alike-0.221
 ≡-0.217
aukee-0.211
Token Ferguson
Token ID15031
Feature activation-2.27e-01

Pos logit contributions
iris+1.780
thens+1.463
rs+1.424
ements+1.359
download+1.339
Neg logit contributions
)].-4.805
 Direction-4.666
 alike-4.484
aukee-4.454
 ≡-4.449
Token versus
Token ID9051
Feature activation-8.30e-01

Pos logit contributions
iris+1.359
thens+1.231
rs+1.204
ements+1.181
agra+1.172
Neg logit contributions
)].-1.496
 Direction-1.439
 alike-1.365
 ≡-1.350
aukee-1.338
Token Kh
Token ID5311
Feature activation-1.38e+00

Pos logit contributions
iris+5.026
thens+4.533
rs+4.485
ements+4.379
download+4.340
Neg logit contributions
)].-5.328
 Direction-5.098
 alike-4.813
aukee-4.798
 ≡-4.755
Tokenab
Token ID397
Feature activation-8.50e-01

Pos logit contributions
iris+7.918
rs+7.127
thens+6.909
rent+6.855
agra+6.756
Neg logit contributions
)].-9.182
 Direction-9.069
 ≡-8.550
aukee-8.474
 alike-8.462
Tokenib
Token ID571
Feature activation-5.33e-01

Pos logit contributions
iris+4.452
rs+3.986
thens+3.741
rent+3.719
agra+3.690
Neg logit contributions
)].-6.211
 Direction-6.051
 ≡-5.793
aukee-5.771
 alike-5.659
Token Nur
Token ID49281
Feature activation-4.23e-02

Pos logit contributions
iris+3.082
thens+2.733
rs+2.721
ements+2.624
download+2.621
Neg logit contributions
)].-3.621
 Direction-3.491
 alike-3.298
aukee-3.280
 ≡-3.264
Tokenmag
Token ID19726
Feature activation+8.07e-02

Pos logit contributions
iris+0.329
thens+0.309
ements+0.302
ffield+0.298
agra+0.297
Neg logit contributions
)].-0.199
 Direction-0.189
 alike-0.177
 ≡-0.175
opez-0.163
Tokenomed
Token ID12657
Feature activation+1.98e-01

Pos logit contributions
)].+0.577
 Direction+0.539
 alike+0.522
 ≡+0.508
aukee+0.503
Neg logit contributions
iris-0.420
thens-0.393
ements-0.383
agra-0.374
ffield-0.373
Token
Token ID960
Feature activation-3.45e-01

Pos logit contributions
)].+1.049
 Direction+1.012
 alike+0.957
 ≡+0.937
aukee+0.923
Neg logit contributions
iris-1.070
thens-0.972
rs-0.948
ements-0.940
agra-0.936
Tokenthat
Token ID5562
Feature activation-2.11e-01

Pos logit contributions
iris+2.354
thens+2.152
rs+2.118
ements+2.084
agra+2.075
Neg logit contributions
)].-1.982
 Direction-1.895
 alike-1.773
 ≡-1.739
aukee-1.739
Token
Token ID564
Feature activation-1.87e-01

Pos logit contributions
iris+1.381
thens+1.257
rs+1.219
agra+1.215
ements+1.208
Neg logit contributions
)].-1.292
 Direction-1.244
 alike-1.184
 ≡-1.150
aukee-1.117
Token
Token ID250
Feature activation-2.58e-01

Pos logit contributions
iris+1.340
thens+1.228
rs+1.211
ements+1.191
agra+1.184
Neg logit contributions
)].-1.015
 Direction-0.967
 alike-0.902
 ≡-0.889
aukee-0.884
Tokenfirst
Token ID11085
Feature activation-6.02e-01

Pos logit contributions
iris+1.590
thens+1.433
agra+1.410
rs+1.401
ements+1.393
Neg logit contributions
)].-1.653
 Direction-1.577
 alike-1.495
 ≡-1.459
aukee-1.458
Token became
Token ID2627
Feature activation-1.38e+00

Pos logit contributions
iris+4.035
thens+3.682
rs+3.620
ements+3.551
agra+3.537
Neg logit contributions
)].-3.559
 Direction-3.405
 alike-3.200
 ≡-3.143
aukee-3.128
Token aware
Token ID3910
Feature activation-4.69e-01

Pos logit contributions
iris+6.933
rs+6.196
thens+6.017
agra+5.848
ements+5.817
Neg logit contributions
)].-9.832
 Direction-9.472
aukee-9.111
 ≡-8.912
 alike-8.853
Token in
Token ID287
Feature activation-3.42e-01

Pos logit contributions
iris+2.810
thens+2.555
rs+2.554
ements+2.461
download+2.443
Neg logit contributions
)].-3.040
 Direction-2.903
aukee-2.746
 ≡-2.713
 alike-2.712
Token late
Token ID2739
Feature activation+1.91e-01

Pos logit contributions
iris+2.311
thens+2.087
rs+2.042
agra+2.013
ements+2.003
Neg logit contributions
)].-2.039
 Direction-1.966
 alike-1.839
 ≡-1.812
aukee-1.779
Token 2015
Token ID1853
Feature activation-2.42e-02

Pos logit contributions
)].+1.292
 Direction+1.247
 alike+1.200
 ≡+1.147
 mL+1.114
Neg logit contributions
iris-1.136
thens-1.009
agra-1.001
ffield-0.972
rent-0.947
Token of
Token ID286
Feature activation+2.04e-02

Pos logit contributions
iris+0.161
thens+0.146
rs+0.143
agra+0.141
ements+0.141
Neg logit contributions
)].-0.144
 Direction-0.138
 alike-0.130
 ≡-0.127
aukee-0.126
Tokenl
Token ID75
Feature activation-3.47e-01

Pos logit contributions
iris+4.627
rs+4.220
thens+4.158
download+3.991
ements+3.991
Neg logit contributions
)].-5.247
 Direction-4.995
 ≡-4.721
 alike-4.701
aukee-4.671
Tokeninson
Token ID7899
Feature activation-2.57e-01

Pos logit contributions
iris+2.608
thens+2.430
ements+2.365
rs+2.351
download+2.336
Neg logit contributions
)].-1.712
 Direction-1.675
 alike-1.568
aukee-1.496
 ≡-1.467
Token,
Token ID11
Feature activation-3.95e-01

Pos logit contributions
iris+1.398
thens+1.243
rs+1.238
ements+1.183
 Wolver+1.178
Neg logit contributions
)].-1.834
 Direction-1.757
 alike-1.665
aukee-1.658
 ≡-1.656
Token John
Token ID1757
Feature activation-6.30e-01

Pos logit contributions
iris+2.524
thens+2.290
rs+2.239
ements+2.207
agra+2.196
Neg logit contributions
)].-2.458
 Direction-2.384
 alike-2.243
 ≡-2.174
aukee-2.156
Token O
Token ID440
Feature activation-4.40e-01

Pos logit contributions
iris+3.959
rs+3.586
thens+3.560
 Wolver+3.446
ements+3.431
Neg logit contributions
)].-3.936
 Direction-3.736
 alike-3.557
 ≡-3.542
aukee-3.523
Token'
Token ID6
Feature activation-1.37e+00

Pos logit contributions
iris+2.881
thens+2.617
rs+2.588
ements+2.530
download+2.505
Neg logit contributions
)].-2.661
 Direction-2.543
 alike-2.381
 ≡-2.370
aukee-2.356
TokenK
Token ID42
Feature activation-7.05e-01

Pos logit contributions
iris+7.627
rs+7.093
thens+6.901
 Wolver+6.823
download+6.762
Neg logit contributions
)].-9.002
 Direction-8.519
aukee-8.354
 ≡-8.291
 alike-8.143
Tokenane
Token ID1531
Feature activation-8.30e-01

Pos logit contributions
iris+3.952
rs+3.526
thens+3.467
ements+3.378
agra+3.371
Neg logit contributions
)].-4.893
 Direction-4.724
 ≡-4.469
 alike-4.428
aukee-4.387
Token,
Token ID11
Feature activation-4.70e-01

Pos logit contributions
iris+4.273
rs+3.802
thens+3.711
 Wolver+3.585
rent+3.581
Neg logit contributions
)].-6.007
 Direction-5.717
aukee-5.458
 ≡-5.423
 alike-5.415
Token J
Token ID449
Feature activation-3.42e-01

Pos logit contributions
iris+2.989
thens+2.711
rs+2.657
ements+2.612
agra+2.597
Neg logit contributions
)].-2.941
 Direction-2.841
 alike-2.674
 ≡-2.608
aukee-2.592
Tokenovan
Token ID22590
Feature activation-6.90e-01

Pos logit contributions
iris+2.272
thens+2.083
rs+2.042
ements+2.010
download+1.998
Neg logit contributions
)].-2.024
 Direction-1.941
 alike-1.812
aukee-1.795
 ≡-1.778
Tokena
Token ID64
Feature activation-3.05e-01

Pos logit contributions
iris+3.189
thens+2.895
rs+2.791
ements+2.762
agra+2.737
Neg logit contributions
)].-3.586
 Direction-3.414
 alike-3.248
aukee-3.200
 ≡-3.183
Token.
Token ID13
Feature activation-7.95e-01

Pos logit contributions
iris+2.083
thens+1.924
rs+1.840
 Wolver+1.831
ements+1.827
Neg logit contributions
)].-1.792
 Direction-1.677
 alike-1.586
aukee-1.527
 ≡-1.511
Tokendot
Token ID26518
Feature activation-1.24e-01

Pos logit contributions
iris+4.476
rs+4.030
thens+4.004
download+3.869
ements+3.825
Neg logit contributions
)].-5.499
 Direction-5.263
 alike-4.996
 ≡-4.961
aukee-4.947
Token.
Token ID13
Feature activation-7.39e-01

Pos logit contributions
iris+0.849
thens+0.789
ffield+0.750
ements+0.747
rs+0.743
Neg logit contributions
)].-0.718
 Direction-0.675
 alike-0.631
aukee-0.607
 mL-0.606
Tokengov
Token ID9567
Feature activation-7.53e-01

Pos logit contributions
iris+4.138
thens+3.701
rs+3.629
ements+3.546
agra+3.510
Neg logit contributions
)].-5.175
 Direction-4.986
 alike-4.723
 ≡-4.665
aukee-4.657
Token/
Token ID14
Feature activation-1.37e+00

Pos logit contributions
iris+2.689
rs+2.404
download+2.394
thens+2.237
rent+2.144
Neg logit contributions
)].-6.424
 Direction-6.203
 alike-6.025
 ≡-5.993
aukee-5.868
Tokenpub
Token ID12984
Feature activation-5.91e-01

Pos logit contributions
iris+8.318
rs+7.536
download+7.444
thens+7.298
rent+7.063
Neg logit contributions
)].-8.467
 Direction-8.147
 alike-7.769
 ≡-7.718
aukee-7.458
Tokens
Token ID82
Feature activation-5.11e-01

Pos logit contributions
iris+2.822
rs+2.508
thens+2.406
download+2.405
rent+2.308
Neg logit contributions
)].-4.543
 Direction-4.476
 ≡-4.250
 alike-4.244
aukee-4.230
Token/
Token ID14
Feature activation-9.87e-01

Pos logit contributions
iris+2.045
rs+1.825
download+1.743
thens+1.682
rent+1.621
Neg logit contributions
)].-4.324
 Direction-4.278
 ≡-4.051
aukee-4.047
 alike-4.024
Token8
Token ID23
Feature activation-4.30e-01

Pos logit contributions
iris+6.030
rs+5.427
thens+5.370
download+5.289
ements+5.171
Neg logit contributions
)].-6.312
 Direction-6.079
 alike-5.751
 ≡-5.712
aukee-5.625
Token11
Token ID1157
Feature activation-4.04e-01

Pos logit contributions
iris+2.480
thens+2.229
rs+2.182
ements+2.141
agra+2.121
Neg logit contributions
)].-2.932
 Direction-2.817
 alike-2.667
 ≡-2.630
aukee-2.629
Token://
Token ID1378
Feature activation-8.53e-01

Pos logit contributions
iris+2.175
thens+1.950
 Wolver+1.835
agra+1.834
ffield+1.831
Neg logit contributions
)].-2.924
 Direction-2.723
 alike-2.625
aukee-2.549
 mL-2.539
Tokenwww
Token ID2503
Feature activation-4.25e-01

Pos logit contributions
iris+5.263
thens+4.708
rs+4.689
download+4.562
ements+4.533
Neg logit contributions
)].-5.478
 Direction-5.268
 alike-4.964
 ≡-4.938
aukee-4.832
Token.
Token ID13
Feature activation-9.54e-01

Pos logit contributions
iris+2.910
thens+2.676
agra+2.571
ements+2.570
rs+2.565
Neg logit contributions
)].-2.508
 Direction-2.329
 alike-2.201
aukee-2.112
 ≡-2.109
Tokeneng
Token ID1516
Feature activation-3.06e-01

Pos logit contributions
iris+6.552
rs+5.921
thens+5.907
download+5.728
ements+5.713
Neg logit contributions
)].-5.443
 Direction-5.238
 alike-4.894
 ≡-4.892
aukee-4.715
Tokenin
Token ID259
Feature activation-8.87e-01

Pos logit contributions
iris+1.746
thens+1.561
rs+1.525
agra+1.497
ements+1.495
Neg logit contributions
)].-2.091
 Direction-2.000
 alike-1.902
aukee-1.875
 ≡-1.868
Token.
Token ID13
Feature activation-1.37e+00

Pos logit contributions
iris+5.340
rs+4.801
thens+4.791
ements+4.645
download+4.612
Neg logit contributions
)].-5.823
 Direction-5.634
 alike-5.282
 ≡-5.219
aukee-5.158
Tokenum
Token ID388
Feature activation-6.24e-01

Pos logit contributions
iris+8.713
rs+7.997
thens+7.669
 Wolver+7.604
download+7.570
Neg logit contributions
)].-8.247
 Direction-7.863
 ≡-7.498
PDATE-7.386
 alike-7.355
Tokenich
Token ID488
Feature activation-1.70e-02

Pos logit contributions
iris+3.681
thens+3.309
rs+3.260
ements+3.183
agra+3.156
Neg logit contributions
)].-4.178
 Direction-4.017
 alike-3.797
 ≡-3.752
aukee-3.744
Token.
Token ID13
Feature activation-1.13e+00

Pos logit contributions
iris+0.105
thens+0.094
rs+0.094
ements+0.091
download+0.091
Neg logit contributions
)].-0.110
 Direction-0.105
 alike-0.099
 ≡-0.098
aukee-0.098
Tokenedu
Token ID15532
Feature activation-7.54e-01

Pos logit contributions
iris+3.872
rs+3.357
thens+3.043
 Wolver+3.041
download+3.037
Neg logit contributions
)].-10.184
 Direction-9.778
 ≡-9.410
 alike-9.396
aukee-9.307
Token/.
Token ID11757
Feature activation-6.18e-01

Pos logit contributions
iris+3.810
rs+3.388
thens+3.361
download+3.282
 Wolver+3.239
Neg logit contributions
)].-5.497
 Direction-5.256
 alike-5.053
 ≡-4.978
aukee-4.938
Token the
Token ID262
Feature activation-2.06e-01

Pos logit contributions
iris+1.648
thens+1.481
rs+1.461
ements+1.422
agra+1.409
Neg logit contributions
)].-1.962
 Direction-1.883
 alike-1.786
aukee-1.767
 ≡-1.765
Token defensive
Token ID6110
Feature activation+7.85e-02

Pos logit contributions
iris+1.319
thens+1.197
rs+1.161
ements+1.157
download+1.138
Neg logit contributions
)].-1.284
 Direction-1.244
 alike-1.168
aukee-1.150
 ≡-1.146
Token line
Token ID1627
Feature activation-4.40e-01

Pos logit contributions
)].+0.478
 Direction+0.463
 alike+0.433
 ≡+0.427
aukee+0.425
Neg logit contributions
iris-0.520
thens-0.471
rs-0.455
ements-0.451
agra-0.446
Token.
Token ID13
Feature activation+6.55e-02

Pos logit contributions
iris+2.758
thens+2.498
rs+2.462
ements+2.408
agra+2.389
Neg logit contributions
)].-2.779
 Direction-2.665
 alike-2.510
 ≡-2.478
aukee-2.474
Token Bo
Token ID3248
Feature activation-3.26e-01

Pos logit contributions
)].+0.352
 Direction+0.349
 alike+0.308
 ≡+0.308
 mL+0.298
Neg logit contributions
iris-0.476
thens-0.444
agra-0.428
ffield-0.425
ements-0.424
Tokenz
Token ID89
Feature activation-1.37e+00

Pos logit contributions
iris+2.044
thens+1.855
rs+1.826
ements+1.787
agra+1.772
Neg logit contributions
)].-2.058
 Direction-1.975
 alike-1.861
 ≡-1.833
aukee-1.832
Tokeneman
Token ID8463
Feature activation-5.40e-01

Pos logit contributions
iris+8.624
rs+7.652
thens+7.483
agra+7.447
download+7.289
Neg logit contributions
)].-8.214
 Direction-7.819
 alike-7.461
 ≡-7.434
PDATE-7.215
Token snaps
Token ID23429
Feature activation-1.43e-01

Pos logit contributions
iris+3.242
thens+2.922
rs+2.870
ements+2.809
agra+2.794
Neg logit contributions
)].-3.576
 Direction-3.441
 alike-3.257
 ≡-3.218
aukee-3.187
Token it
Token ID340
Feature activation-6.62e-02

Pos logit contributions
iris+0.986
thens+0.899
rs+0.878
ements+0.867
download+0.864
Neg logit contributions
)].-0.828
 Direction-0.794
 alike-0.749
 ≡-0.724
aukee-0.721
Token and
Token ID290
Feature activation-4.90e-01

Pos logit contributions
iris+0.445
thens+0.406
ements+0.392
rs+0.392
download+0.389
Neg logit contributions
)].-0.387
 Direction-0.376
 alike-0.362
 ≡-0.347
aukee-0.342
Token takes
Token ID2753
Feature activation+1.63e-01

Pos logit contributions
iris+3.039
thens+2.735
rs+2.686
ements+2.641
agra+2.635
Neg logit contributions
)].-3.175
 Direction-3.056
 alike-2.893
 ≡-2.837
aukee-2.792
Token castle
Token ID16669
Feature activation-8.54e-02

Pos logit contributions
)].+2.128
 Direction+2.074
 alike+1.933
 ≡+1.849
aukee+1.835
Neg logit contributions
iris-2.907
thens-2.674
rs-2.563
ements-2.523
download-2.509
Token.
Token ID13
Feature activation-4.66e-02

Pos logit contributions
iris+0.568
thens+0.521
rs+0.505
ements+0.500
agra+0.497
Neg logit contributions
)].-0.510
 Direction-0.488
 alike-0.466
 ≡-0.453
aukee-0.448
Token The
Token ID383
Feature activation-2.10e-01

Pos logit contributions
iris+0.316
thens+0.290
ements+0.278
agra+0.277
rs+0.276
Neg logit contributions
)].-0.273
 Direction-0.270
 alike-0.247
 ≡-0.242
aukee-0.237
Token cracked
Token ID21368
Feature activation-1.62e-01

Pos logit contributions
iris+1.420
thens+1.295
rs+1.250
download+1.244
ements+1.241
Neg logit contributions
)].-1.247
 Direction-1.208
 alike-1.134
aukee-1.105
 ≡-1.091
Token concrete
Token ID10017
Feature activation-4.57e-01

Pos logit contributions
iris+1.085
thens+0.988
rs+0.965
agra+0.950
ements+0.945
Neg logit contributions
)].-0.960
 Direction-0.923
 alike-0.873
aukee-0.852
 ≡-0.845
Token became
Token ID2627
Feature activation-1.36e+00

Pos logit contributions
iris+3.111
thens+2.836
rs+2.788
ements+2.730
download+2.723
Neg logit contributions
)].-2.659
 Direction-2.548
 alike-2.399
 ≡-2.348
aukee-2.343
Token slick
Token ID29395
Feature activation-7.11e-01

Pos logit contributions
iris+8.298
rs+7.532
thens+7.461
ements+7.382
 Wolver+7.336
Neg logit contributions
)].-8.318
 Direction-7.922
 ≡-7.404
aukee-7.372
 alike-7.245
Token as
Token ID355
Feature activation-7.33e-01

Pos logit contributions
iris+4.403
thens+3.982
rs+3.923
ements+3.837
agra+3.806
Neg logit contributions
)].-4.540
 Direction-4.356
 alike-4.105
 ≡-4.053
aukee-4.045
Token a
Token ID257
Feature activation-9.12e-01

Pos logit contributions
iris+4.698
thens+4.276
rs+4.236
ements+4.145
 Wolver+4.125
Neg logit contributions
)].-4.469
 Direction-4.231
 alike-3.980
 ≡-3.959
aukee-3.948
Token heavy
Token ID4334
Feature activation-5.24e-02

Pos logit contributions
iris+6.083
thens+5.545
rs+5.491
ements+5.403
 Wolver+5.376
Neg logit contributions
)].-5.263
 Direction-4.973
 alike-4.669
 ≡-4.664
aukee-4.630
Token fog
Token ID19558
Feature activation-1.74e-01

Pos logit contributions
iris+0.355
thens+0.322
rs+0.315
ements+0.309
agra+0.309
Neg logit contributions
)].-0.311
 Direction-0.298
 alike-0.282
aukee-0.271
 ≡-0.268
Token himself
Token ID2241
Feature activation+1.42e-01

Pos logit contributions
)].+2.257
 Direction+2.181
 alike+2.118
 ≡+2.030
aukee+1.996
Neg logit contributions
iris-2.611
thens-2.413
ements-2.312
ffield-2.306
agra-2.300
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
)].+0.845
 Direction+0.809
 alike+0.782
 ≡+0.756
aukee+0.740
Neg logit contributions
iris-0.934
thens-0.856
ements-0.828
rs-0.821
agra-0.820
Token His
Token ID2399
Feature activation-9.55e-02

Pos logit contributions
iris+0.741
thens+0.683
agra+0.661
rs+0.659
ements+0.655
Neg logit contributions
)].-0.574
 Direction-0.559
 alike-0.513
 ≡-0.508
aukee-0.486
Token face
Token ID1986
Feature activation-4.71e-01

Pos logit contributions
iris+0.634
thens+0.586
rs+0.569
agra+0.562
ements+0.562
Neg logit contributions
)].-0.570
 Direction-0.554
 alike-0.514
 ≡-0.506
aukee-0.498
Token cont
Token ID542
Feature activation-5.89e-01

Pos logit contributions
iris+2.908
thens+2.638
rs+2.580
ements+2.529
download+2.517
Neg logit contributions
)].-3.030
 Direction-2.907
 alike-2.773
 ≡-2.710
aukee-2.690
Tokenorted
Token ID9741
Feature activation-1.36e+00

Pos logit contributions
iris+2.715
thens+2.354
rs+2.343
ements+2.260
agra+2.230
Neg logit contributions
)].-4.681
 Direction-4.552
 alike-4.298
aukee-4.280
 ≡-4.273
Token slightly
Token ID4622
Feature activation-1.25e+00

Pos logit contributions
iris+8.901
rs+8.083
thens+8.032
ements+7.809
download+7.691
Neg logit contributions
)].-7.828
 Direction-7.452
aukee-6.908
 ≡-6.819
PDATE-6.665
Token.
Token ID13
Feature activation-4.17e-01

Pos logit contributions
iris+7.227
thens+6.481
rs+6.479
ements+6.164
download+6.121
Neg logit contributions
)].-8.250
 Direction-7.884
aukee-7.416
 ≡-7.361
 alike-7.166
Token
Token ID564
Feature activation-1.20e-01

Pos logit contributions
iris+2.920
thens+2.678
rs+2.624
ements+2.584
agra+2.579
Neg logit contributions
)].-2.359
 Direction-2.266
 alike-2.107
 ≡-2.079
aukee-2.048
Token
Token ID250
Feature activation+2.15e-01

Pos logit contributions
iris+0.728
thens+0.658
rs+0.648
ements+0.633
download+0.628
Neg logit contributions
)].-0.780
 Direction-0.748
 alike-0.705
aukee-0.697
 ≡-0.696
TokenSo
Token ID2396
Feature activation+4.72e-03

Pos logit contributions
)].+1.155
 Direction+1.087
 alike+0.990
 ≡+0.966
aukee+0.962
Neg logit contributions
iris-1.523
thens-1.403
agra-1.374
 Wolver-1.371
ements-1.369
Token The
Token ID383
Feature activation+1.18e-01

Pos logit contributions
)].+1.624
 Direction+1.559
 alike+1.490
aukee+1.481
 ≡+1.476
Neg logit contributions
iris-1.006
thens-0.885
rs-0.873
ements-0.842
agra-0.832
Token 67
Token ID8275
Feature activation+8.51e-02

Pos logit contributions
)].+0.694
 Direction+0.690
 alike+0.624
 ≡+0.617
aukee+0.607
Neg logit contributions
iris-0.780
thens-0.715
rs-0.689
ements-0.680
download-0.680
Token-
Token ID12
Feature activation+6.03e-01

Pos logit contributions
)].+0.662
 Direction+0.639
aukee+0.612
 alike+0.610
 ≡+0.608
Neg logit contributions
iris-0.411
rs-0.359
thens-0.359
ements-0.345
 Wolver-0.338
Tokenyear
Token ID1941
Feature activation+2.15e-01

Pos logit contributions
)].+2.929
 Direction+2.580
 alike+2.382
 interviewer+2.202
 mL+2.195
Neg logit contributions
iris-4.316
thens-4.106
agra-4.056
ffield-4.000
 Wolver-3.967
Token-
Token ID12
Feature activation+2.50e-01

Pos logit contributions
)].+1.398
 Direction+1.319
 alike+1.266
 interviewer+1.188
 mL+1.185
Neg logit contributions
iris-1.321
thens-1.209
agra-1.149
download-1.137
ements-1.127
Tokenold
Token ID727
Feature activation+8.39e-01

Pos logit contributions
)].+1.299
 Direction+1.165
 alike+1.124
 mL+1.046
opez+1.038
Neg logit contributions
iris-1.792
thens-1.678
 Wolver-1.643
ffield-1.623
ements-1.592
Token is
Token ID318
Feature activation+2.21e-01

Pos logit contributions
 Direction+4.662
)].+4.649
 alike+4.344
 interviewer+4.111
 ≡+4.091
Neg logit contributions
iris-5.485
thens-5.023
agra-4.969
download-4.884
ements-4.849
Token now
Token ID783
Feature activation+4.15e-01

Pos logit contributions
)].+1.179
 Direction+1.164
 alike+1.091
 ≡+1.018
 mL+1.006
Neg logit contributions
iris-1.579
thens-1.453
ements-1.424
download-1.404
agra-1.400
Token an
Token ID281
Feature activation+2.59e-01

Pos logit contributions
 Direction+2.262
)].+2.244
 alike+2.101
 ≡+1.986
aukee+1.975
Neg logit contributions
iris-2.844
thens-2.611
ements-2.577
download-2.528
agra-2.505
Token adviser
Token ID12534
Feature activation+4.45e-02

Pos logit contributions
)].+1.443
 Direction+1.436
 alike+1.335
aukee+1.300
 interviewer+1.288
Neg logit contributions
iris-1.735
thens-1.606
rs-1.574
ements-1.564
download-1.562
Token to
Token ID284
Feature activation-4.40e-01

Pos logit contributions
)].+0.285
 Direction+0.276
 alike+0.262
 ≡+0.256
aukee+0.255
Neg logit contributions
iris-0.276
thens-0.248
rs-0.240
agra-0.237
ements-0.236
Token is
Token ID318
Feature activation-2.03e-01

Pos logit contributions
iris+1.979
thens+1.794
rs+1.752
ements+1.719
agra+1.715
Neg logit contributions
)].-1.937
 Direction-1.886
 alike-1.773
 ≡-1.741
aukee-1.719
Token using
Token ID1262
Feature activation-5.72e-01

Pos logit contributions
iris+1.368
thens+1.249
rs+1.220
ements+1.202
agra+1.200
Neg logit contributions
)].-1.182
 Direction-1.153
 alike-1.093
 ≡-1.055
aukee-1.035
Token an
Token ID281
Feature activation+1.25e-01

Pos logit contributions
iris+4.080
thens+3.738
rs+3.699
ements+3.628
agra+3.597
Neg logit contributions
)].-3.113
 Direction-2.957
 alike-2.756
 ≡-2.721
aukee-2.714
Token H
Token ID367
Feature activation+9.29e-02

Pos logit contributions
)].+0.660
 Direction+0.635
 alike+0.604
aukee+0.584
 ≡+0.576
Neg logit contributions
iris-0.895
thens-0.826
rs-0.807
ements-0.798
download-0.795
TokenVM
Token ID15996
Feature activation-2.73e-01

Pos logit contributions
)].+0.565
 Direction+0.541
 alike+0.508
 ≡+0.502
aukee+0.501
Neg logit contributions
iris-0.604
thens-0.549
rs-0.542
ements-0.530
agra-0.526
Token AM
Token ID3001
Feature activation+8.29e-01

Pos logit contributions
iris+1.905
thens+1.750
rs+1.695
ements+1.687
agra+1.665
Neg logit contributions
)].-1.547
 Direction-1.488
 alike-1.400
 ≡-1.362
aukee-1.353
TokenI
Token ID40
Feature activation-4.83e-02

Pos logit contributions
)].+3.517
 Direction+3.308
 alike+3.063
aukee+2.924
 ≡+2.924
Neg logit contributions
iris-6.724
thens-6.329
ements-6.152
rs-6.145
agra-6.054
Token,
Token ID11
Feature activation-3.12e-01

Pos logit contributions
iris+0.316
thens+0.290
rs+0.276
ements+0.276
agra+0.272
Neg logit contributions
)].-0.295
 Direction-0.286
 alike-0.273
 ≡-0.264
aukee-0.260
Token to
Token ID284
Feature activation-2.61e-01

Pos logit contributions
iris+2.100
thens+1.926
rs+1.880
ements+1.854
agra+1.851
Neg logit contributions
)].-1.828
 Direction-1.768
 alike-1.659
 ≡-1.622
aukee-1.607
Token help
Token ID1037
Feature activation-3.30e-01

Pos logit contributions
iris+1.681
thens+1.541
rs+1.507
ements+1.464
agra+1.464
Neg logit contributions
)].-1.612
 Direction-1.549
 alike-1.463
 ≡-1.439
aukee-1.417
Token differentiate
Token ID28754
Feature activation-1.55e-01

Pos logit contributions
iris+2.281
thens+2.086
rs+2.044
ements+2.009
agra+2.005
Neg logit contributions
)].-1.894
 Direction-1.821
 alike-1.714
 ≡-1.668
aukee-1.664
Token can
Token ID460
Feature activation-5.90e-01

Pos logit contributions
iris+3.782
thens+3.425
rs+3.382
ements+3.306
agra+3.279
Neg logit contributions
)].-3.735
 Direction-3.580
 alike-3.368
 ≡-3.326
aukee-3.320
Token interpret
Token ID6179
Feature activation+4.68e-01

Pos logit contributions
iris+3.837
thens+3.486
rs+3.439
ements+3.366
agra+3.340
Neg logit contributions
)].-3.591
 Direction-3.439
 alike-3.231
 ≡-3.188
aukee-3.182
Token what
Token ID644
Feature activation+7.13e-01

Pos logit contributions
)].+2.630
 Direction+2.566
 alike+2.448
 ≡+2.332
aukee+2.225
Neg logit contributions
iris-3.204
thens-2.951
download-2.841
rs-2.832
agra-2.822
Token
Token ID447
Feature activation-2.79e-01

Pos logit contributions
)].+3.946
 Direction+3.848
 alike+3.487
 ≡+3.459
 mL+3.331
Neg logit contributions
iris-4.886
thens-4.484
rs-4.316
ements-4.308
download-4.307
Token
Token ID247
Feature activation-9.79e-02

Pos logit contributions
iris+2.233
thens+2.049
rs+2.023
ements+2.006
agra+1.995
Neg logit contributions
)].-1.330
 Direction-1.242
 alike-1.151
 ≡-1.131
aukee-1.116
Tokens
Token ID82
Feature activation+8.02e-01

Pos logit contributions
iris+0.570
thens+0.505
rs+0.495
agra+0.490
ements+0.489
Neg logit contributions
)].-0.666
 Direction-0.642
 alike-0.607
 ≡-0.595
aukee-0.591
Token going
Token ID1016
Feature activation-1.99e-01

Pos logit contributions
)].+3.734
 Direction+3.530
 alike+3.376
 ≡+3.159
 interviewer+3.051
Neg logit contributions
iris-6.104
thens-5.789
ffield-5.493
agra-5.446
ements-5.436
Token on
Token ID319
Feature activation-5.22e-02

Pos logit contributions
iris+1.237
rs+1.100
thens+1.095
ements+1.077
agra+1.060
Neg logit contributions
)].-1.265
 Direction-1.214
 ≡-1.134
 alike-1.133
aukee-1.122
Token better
Token ID1365
Feature activation+3.78e-02

Pos logit contributions
iris+0.345
thens+0.321
ements+0.309
rs+0.307
agra+0.307
Neg logit contributions
)].-0.313
 Direction-0.298
 alike-0.284
 ≡-0.274
aukee-0.272
Token or
Token ID393
Feature activation-2.52e-01

Pos logit contributions
)].+0.230
 Direction+0.221
 alike+0.211
 ≡+0.203
aukee+0.201
Neg logit contributions
iris-0.246
thens-0.227
rs-0.218
ements-0.216
agra-0.216
Token what
Token ID644
Feature activation+3.82e-02

Pos logit contributions
iris+1.640
thens+1.496
rs+1.453
ements+1.431
agra+1.430
Neg logit contributions
)].-1.558
 Direction-1.482
 alike-1.414
 ≡-1.362
aukee-1.352
Token for
Token ID329
Feature activation+2.53e-01

Pos logit contributions
)].+1.836
 Direction+1.798
 alike+1.723
 ≡+1.671
aukee+1.658
Neg logit contributions
iris-1.776
thens-1.602
rs-1.528
ements-1.526
download-1.519
Token the
Token ID262
Feature activation+1.44e-01

Pos logit contributions
)].+1.508
 Direction+1.439
 alike+1.375
 ≡+1.354
aukee+1.310
Neg logit contributions
iris-1.685
thens-1.537
ements-1.469
download-1.468
ffield-1.461
Token cuts
Token ID6630
Feature activation+1.94e-01

Pos logit contributions
)].+0.854
 Direction+0.816
 alike+0.767
 ≡+0.764
aukee+0.748
Neg logit contributions
iris-0.967
thens-0.885
rs-0.851
download-0.846
ements-0.840
Token is
Token ID318
Feature activation+4.24e-01

Pos logit contributions
)].+1.211
 Direction+1.146
 alike+1.112
 ≡+1.085
aukee+1.061
Neg logit contributions
iris-1.223
thens-1.119
download-1.082
agra-1.076
ements-1.072
Token a
Token ID257
Feature activation-6.71e-02

Pos logit contributions
)].+2.525
 Direction+2.366
 alike+2.304
 ≡+2.203
aukee+2.143
Neg logit contributions
iris-2.815
thens-2.630
download-2.500
ements-2.486
agra-2.467
Token further
Token ID2252
Feature activation+8.67e-01

Pos logit contributions
iris+0.439
thens+0.402
rs+0.383
download+0.378
agra+0.378
Neg logit contributions
)].-0.417
 Direction-0.401
 alike-0.379
 ≡-0.370
aukee-0.363
Token demonstration
Token ID13646
Feature activation-7.87e-02

Pos logit contributions
)].+5.079
 Direction+4.867
 alike+4.528
 ≡+4.261
aukee+4.235
Neg logit contributions
iris-5.629
thens-5.115
agra-4.910
download-4.890
rs-4.817
Token of
Token ID286
Feature activation-3.08e-01

Pos logit contributions
iris+0.487
rs+0.443
thens+0.438
ements+0.424
download+0.420
Neg logit contributions
)].-0.501
 Direction-0.478
 alike-0.444
aukee-0.443
 ≡-0.443
Token priorities
Token ID15369
Feature activation-6.78e-02

Pos logit contributions
iris+1.891
thens+1.711
rs+1.673
ements+1.643
agra+1.632
Neg logit contributions
)].-1.993
 Direction-1.915
 alike-1.808
 ≡-1.785
aukee-1.777
Token:
Token ID25
Feature activation+3.39e-01

Pos logit contributions
iris+0.429
thens+0.392
rs+0.378
ements+0.375
agra+0.373
Neg logit contributions
)].-0.426
 Direction-0.411
 alike-0.390
 ≡-0.382
aukee-0.379
Token allegiance
Token ID25696
Feature activation-1.78e-01

Pos logit contributions
)].+1.900
 Direction+1.879
 alike+1.760
 ≡+1.724
aukee+1.633
Neg logit contributions
iris-2.337
thens-2.126
agra-2.048
ements-2.043
rs-2.042
Token of
Token ID286
Feature activation+9.09e-02

Pos logit contributions
)].+0.036
 Direction+0.034
 alike+0.033
 ≡+0.032
aukee+0.031
Neg logit contributions
iris-0.040
thens-0.036
rs-0.035
agra-0.034
ements-0.034
Token 15
Token ID1315
Feature activation-2.51e-01

Pos logit contributions
)].+0.528
 Direction+0.501
 ≡+0.473
 alike+0.471
aukee+0.452
Neg logit contributions
iris-0.630
thens-0.574
agra-0.551
rs-0.550
ements-0.549
Token million
Token ID1510
Feature activation+3.50e-01

Pos logit contributions
iris+1.612
thens+1.470
rs+1.427
agra+1.405
ements+1.398
Neg logit contributions
)].-1.566
 Direction-1.502
 alike-1.420
 ≡-1.400
aukee-1.387
Token days
Token ID1528
Feature activation-2.06e-01

Pos logit contributions
)].+1.963
 Direction+1.853
 alike+1.794
 ≡+1.748
aukee+1.701
Neg logit contributions
iris-2.427
thens-2.259
agra-2.177
download-2.137
rs-2.129
Token s
Token ID264
Feature activation-2.98e-01

Pos logit contributions
iris+1.374
thens+1.253
rs+1.227
ements+1.210
agra+1.205
Neg logit contributions
)].-1.230
 Direction-1.172
 alike-1.106
 ≡-1.089
aukee-1.086
Tokenocked
Token ID3543
Feature activation+8.12e-01

Pos logit contributions
iris+1.899
thens+1.720
rs+1.700
ements+1.661
agra+1.649
Neg logit contributions
)].-1.846
 Direction-1.769
 alike-1.664
 ≡-1.641
aukee-1.639
Token away
Token ID1497
Feature activation-2.12e-01

Pos logit contributions
)].+3.587
 Direction+3.255
 alike+3.238
 ≡+3.044
aukee+3.028
Neg logit contributions
iris-6.387
thens-5.946
agra-5.824
ffield-5.815
ements-5.783
Token
Token ID851
Feature activation+3.18e-01

Pos logit contributions
iris+1.445
thens+1.321
rs+1.289
ements+1.278
agra+1.270
Neg logit contributions
)].-1.235
 Direction-1.172
 alike-1.109
 ≡-1.092
aukee-1.081
Token and
Token ID290
Feature activation-1.34e-01

Pos logit contributions
)].+1.987
 Direction+1.923
 alike+1.828
 ≡+1.812
aukee+1.775
Neg logit contributions
iris-1.987
thens-1.795
agra-1.728
ements-1.727
rs-1.726
Token reduces
Token ID12850
Feature activation-3.70e-01

Pos logit contributions
iris+0.841
thens+0.751
rs+0.730
ements+0.726
download+0.725
Neg logit contributions
)].-0.858
 Direction-0.822
 alike-0.778
 ≡-0.768
aukee-0.753
Token their
Token ID511
Feature activation-1.36e-01

Pos logit contributions
iris+2.303
thens+2.085
rs+2.036
agra+2.003
ements+1.992
Neg logit contributions
)].-2.386
 Direction-2.295
 alike-2.169
aukee-2.131
 ≡-2.126
Token
Token ID198
Feature activation+2.55e-01

Pos logit contributions
iris+0.591
thens+0.538
ffield+0.517
agra+0.509
ements+0.499
Neg logit contributions
)].-0.679
 Direction-0.643
 alike-0.604
 mL-0.587
 ≡-0.572
Token
Token ID447
Feature activation+7.95e-02

Pos logit contributions
)].+1.338
 Direction+1.266
 alike+1.162
 ≡+1.158
aukee+1.143
Neg logit contributions
iris-1.863
thens-1.719
ements-1.675
rs-1.669
agra-1.661
Token
Token ID246
Feature activation+8.99e-02

Pos logit contributions
)].+0.413
 Direction+0.390
 alike+0.362
aukee+0.355
 ≡+0.353
Neg logit contributions
iris-0.586
thens-0.542
rs-0.537
ements-0.525
download-0.521
TokenI
Token ID40
Feature activation+2.18e-02

Pos logit contributions
)].+0.508
 Direction+0.482
 alike+0.445
 ≡+0.439
aukee+0.433
Neg logit contributions
iris-0.620
thens-0.571
ements-0.559
rs-0.554
agra-0.552
Token think
Token ID892
Feature activation-3.15e-01

Pos logit contributions
)].+0.129
 Direction+0.121
 alike+0.117
 ≡+0.114
aukee+0.110
Neg logit contributions
iris-0.145
thens-0.136
ements-0.131
agra-0.127
rs-0.127
Token most
Token ID749
Feature activation+3.53e-01

Pos logit contributions
iris+2.148
thens+1.958
rs+1.919
ements+1.897
agra+1.884
Neg logit contributions
)].-1.836
 Direction-1.756
 alike-1.656
 ≡-1.614
aukee-1.598
Token of
Token ID286
Feature activation-2.53e-01

Pos logit contributions
)].+1.909
 Direction+1.831
 alike+1.755
 ≡+1.684
aukee+1.634
Neg logit contributions
iris-2.528
thens-2.321
agra-2.248
ements-2.246
rs-2.229
Token us
Token ID514
Feature activation-4.63e-02

Pos logit contributions
iris+1.805
thens+1.655
rs+1.630
ements+1.608
agra+1.592
Neg logit contributions
)].-1.381
 Direction-1.317
 alike-1.231
 ≡-1.208
aukee-1.200
Token feel
Token ID1254
Feature activation-2.04e-01

Pos logit contributions
iris+0.322
thens+0.299
ements+0.289
rs+0.286
agra+0.285
Neg logit contributions
)].-0.264
 Direction-0.247
 alike-0.241
 ≡-0.229
aukee-0.225
Token an
Token ID281
Feature activation+6.63e-02

Pos logit contributions
iris+1.352
thens+1.232
rs+1.201
download+1.186
ements+1.186
Neg logit contributions
)].-1.225
 Direction-1.178
 alike-1.133
 ≡-1.082
aukee-1.066
Token emotional
Token ID7016
Feature activation-5.34e-02

Pos logit contributions
)].+0.352
 Direction+0.332
 alike+0.315
aukee+0.304
 ≡+0.301
Neg logit contributions
iris-0.483
thens-0.444
rs-0.438
ements-0.432
download-0.431
Token America
Token ID2253
Feature activation-1.92e-02

Pos logit contributions
iris+0.047
thens+0.043
rs+0.042
agra+0.041
download+0.040
Neg logit contributions
)].-0.039
 Direction-0.038
 alike-0.036
 ≡-0.035
aukee-0.034
Token.
Token ID13
Feature activation-2.63e-01

Pos logit contributions
iris+0.124
thens+0.113
rs+0.110
ements+0.108
agra+0.108
Neg logit contributions
)].-0.118
 Direction-0.114
 alike-0.108
 ≡-0.106
aukee-0.104
Token They
Token ID1119
Feature activation+1.15e-01

Pos logit contributions
iris+1.849
thens+1.696
agra+1.642
ements+1.625
rs+1.624
Neg logit contributions
)].-1.507
 Direction-1.480
 alike-1.356
 ≡-1.335
 mL-1.287
Token
Token ID447
Feature activation-7.60e-02

Pos logit contributions
)].+0.693
 Direction+0.658
 alike+0.638
 ≡+0.609
aukee+0.599
Neg logit contributions
iris-0.750
thens-0.695
ements-0.665
rs-0.662
agra-0.657
Token
Token ID247
Feature activation+5.16e-02

Pos logit contributions
iris+0.638
thens+0.584
rs+0.573
agra+0.573
ements+0.572
Neg logit contributions
)].-0.336
 Direction-0.306
 alike-0.287
 ≡-0.277
aukee-0.269
Tokenll
Token ID297
Feature activation+2.13e-01

Pos logit contributions
)].+0.343
 Direction+0.326
 alike+0.316
 ≡+0.304
aukee+0.300
Neg logit contributions
iris-0.309
thens-0.274
agra-0.268
ements-0.264
 Wolver-0.263
Token just
Token ID655
Feature activation-5.33e-02

Pos logit contributions
)].+1.288
 Direction+1.200
 alike+1.197
 ≡+1.118
 mL+1.097
Neg logit contributions
iris-1.402
thens-1.289
ements-1.233
agra-1.221
rs-1.213
Token write
Token ID3551
Feature activation+5.09e-01

Pos logit contributions
iris+0.344
thens+0.317
rs+0.301
ements+0.301
agra+0.301
Neg logit contributions
)].-0.336
 Direction-0.318
 alike-0.311
 ≡-0.293
aukee-0.290
Token off
Token ID572
Feature activation+4.89e-01

Pos logit contributions
)].+2.792
 Direction+2.688
 alike+2.604
 ≡+2.444
 mL+2.347
Neg logit contributions
iris-3.545
thens-3.300
agra-3.194
rs-3.177
ements-3.175
Token the
Token ID262
Feature activation+4.22e-01

Pos logit contributions
)].+2.627
 Direction+2.551
 alike+2.481
 ≡+2.290
 mL+2.229
Neg logit contributions
iris-3.426
thens-3.174
agra-3.098
ements-3.060
download-3.054
Token other
Token ID584
Feature activation+4.49e-01

Pos logit contributions
)].+2.258
 Direction+2.206
 alike+2.054
 interviewer+1.959
 ≡+1.941
Neg logit contributions
iris-2.999
thens-2.787
agra-2.689
rs-2.672
download-2.671
Token less
Token ID1342
Feature activation+5.35e-01

Pos logit contributions
)].+2.506
 Direction+2.441
 alike+2.266
 ≡+2.227
aukee+2.183
Neg logit contributions
iris-2.764
thens-2.494
rs-2.461
ements-2.420
download-2.399
Token thing
Token ID1517
Feature activation+8.66e-02

Pos logit contributions
)].+3.082
 Direction+3.014
 alike+2.747
 ≡+2.697
aukee+2.651
Neg logit contributions
iris-3.694
rs-3.274
agra-3.259
ements-3.257
thens-3.241
Token for
Token ID329
Feature activation+2.18e-01

Pos logit contributions
)].+0.544
 Direction+0.528
 alike+0.502
 ≡+0.491
aukee+0.483
Neg logit contributions
iris-0.542
thens-0.488
rs-0.479
ements-0.475
agra-0.473
Token you
Token ID345
Feature activation+2.85e-01

Pos logit contributions
)].+1.336
 Direction+1.287
 alike+1.211
 ≡+1.199
aukee+1.154
Neg logit contributions
iris-1.419
thens-1.270
rs-1.257
ements-1.255
agra-1.249
Token to
Token ID284
Feature activation+2.50e-01

Pos logit contributions
)].+1.879
 Direction+1.807
 alike+1.713
 ≡+1.694
aukee+1.679
Neg logit contributions
iris-1.706
thens-1.530
rs-1.495
agra-1.484
ements-1.479
Token deal
Token ID1730
Feature activation+4.84e-01

Pos logit contributions
)].+1.294
 Direction+1.228
 ≡+1.150
 alike+1.143
aukee+1.098
Neg logit contributions
iris-1.885
thens-1.712
ements-1.693
agra-1.687
rs-1.676
Token with
Token ID351
Feature activation+3.49e-01

Pos logit contributions
)].+2.598
 Direction+2.486
 alike+2.327
 ≡+2.292
aukee+2.214
Neg logit contributions
iris-3.469
thens-3.151
rs-3.045
ements-3.041
ffield-3.041
Token.
Token ID13
Feature activation-2.60e-01

Pos logit contributions
)].+1.999
 Direction+1.937
 alike+1.868
 ≡+1.822
aukee+1.750
Neg logit contributions
iris-2.326
thens-2.121
ements-2.077
agra-2.074
rs-2.069
Token If
Token ID1002
Feature activation-1.59e-01

Pos logit contributions
iris+1.738
thens+1.590
rs+1.541
ements+1.534
agra+1.525
Neg logit contributions
)].-1.554
 Direction-1.517
 alike-1.401
 ≡-1.389
aukee-1.346
Token you
Token ID345
Feature activation-6.71e-02

Pos logit contributions
iris+0.965
thens+0.872
rs+0.857
ements+0.843
agra+0.833
Neg logit contributions
)].-1.042
 Direction-1.005
 alike-0.947
 ≡-0.936
aukee-0.927
Token really
Token ID1107
Feature activation-4.11e-01

Pos logit contributions
iris+0.446
thens+0.406
ements+0.397
agra+0.392
rs+0.390
Neg logit contributions
)].-0.403
 Direction-0.389
 ≡-0.368
 alike-0.367
aukee-0.352
Token the
Token ID262
Feature activation-1.64e-01

Pos logit contributions
)].+0.598
 Direction+0.561
 alike+0.538
 ≡+0.526
aukee+0.509
Neg logit contributions
iris-0.665
thens-0.609
ements-0.585
download-0.584
rs-0.583
Token death
Token ID1918
Feature activation-1.18e-01

Pos logit contributions
iris+1.061
thens+0.971
rs+0.944
agra+0.927
ements+0.922
Neg logit contributions
)].-1.016
 Direction-0.973
 alike-0.923
 ≡-0.904
aukee-0.891
Token of
Token ID286
Feature activation+4.25e-01

Pos logit contributions
iris+0.677
rs+0.618
thens+0.604
ements+0.599
download+0.586
Neg logit contributions
)].-0.794
 Direction-0.776
aukee-0.733
 alike-0.717
 ≡-0.717
Token the
Token ID262
Feature activation+3.18e-01

Pos logit contributions
)].+2.333
 Direction+2.201
 alike+2.098
 ≡+2.021
 mL+1.904
Neg logit contributions
iris-3.026
thens-2.792
ffield-2.689
agra-2.684
rs-2.673
Token Dru
Token ID21295
Feature activation+2.78e-01

Pos logit contributions
)].+1.834
 Direction+1.748
 alike+1.639
 ≡+1.609
 interviewer+1.544
Neg logit contributions
iris-2.160
thens-1.998
ements-1.939
agra-1.929
rs-1.927
Tokenze
Token ID2736
Feature activation+5.31e-01

Pos logit contributions
)].+1.196
 Direction+1.117
 alike+1.022
aukee+0.982
 ≡+0.980
Neg logit contributions
iris-2.264
thens-2.127
ements-2.077
rs-2.076
 Wolver-2.064
Token police
Token ID1644
Feature activation+3.51e-01

Pos logit contributions
)].+3.137
 Direction+2.968
 alike+2.905
 ≡+2.824
aukee+2.720
Neg logit contributions
iris-3.511
thens-3.232
ements-3.163
agra-3.094
ffield-3.061
Token officer
Token ID3818
Feature activation+1.07e-01

Pos logit contributions
)].+2.153
 Direction+2.052
 alike+1.997
 ≡+1.928
aukee+1.924
Neg logit contributions
iris-2.258
thens-2.068
agra-1.986
ements-1.967
rs-1.966
Token.
Token ID13
Feature activation+3.95e-01

Pos logit contributions
)].+0.648
 Direction+0.611
 alike+0.595
 ≡+0.572
aukee+0.556
Neg logit contributions
iris-0.707
thens-0.644
rs-0.628
agra-0.628
ements-0.622
Token
Token ID198
Feature activation-3.52e-01

Pos logit contributions
 Direction+2.248
)].+2.246
 alike+2.036
 ≡+2.032
 mL+1.954
Neg logit contributions
iris-2.745
thens-2.552
ffield-2.459
agra-2.419
ements-2.405
Token
Token ID198
Feature activation-1.19e-01

Pos logit contributions
iris+1.979
thens+1.789
agra+1.693
ffield+1.690
ements+1.674
Neg logit contributions
)].-2.521
 Direction-2.416
 alike-2.286
 mL-2.203
 ≡-2.190
Token Sachs
Token ID25158
Feature activation+2.10e-02

Pos logit contributions
)].+0.950
 Direction+0.930
 ≡+0.853
 alike+0.849
aukee+0.830
Neg logit contributions
iris-1.060
thens-0.954
rs-0.922
download-0.915
agra-0.907
Token Came
Token ID32653
Feature activation+7.78e-03

Pos logit contributions
)].+0.126
 Direction+0.123
 alike+0.113
 ≡+0.113
aukee+0.110
Neg logit contributions
iris-0.141
thens-0.128
rs-0.124
download-0.123
agra-0.122
Token to
Token ID284
Feature activation-1.39e-01

Pos logit contributions
)].+0.049
 Direction+0.048
 alike+0.045
 ≡+0.044
aukee+0.043
Neg logit contributions
iris-0.049
thens-0.044
rs-0.043
agra-0.043
download-0.042
Token Rule
Token ID14330
Feature activation+2.48e-01

Pos logit contributions
iris+0.934
thens+0.848
rs+0.831
ements+0.820
agra+0.814
Neg logit contributions
)].-0.831
 Direction-0.798
 alike-0.744
 ≡-0.736
aukee-0.730
Token the
Token ID262
Feature activation+1.57e-01

Pos logit contributions
)].+1.399
 Direction+1.362
 alike+1.266
 ≡+1.241
aukee+1.174
Neg logit contributions
iris-1.750
thens-1.582
rs-1.533
download-1.514
ements-1.512
Token World
Token ID2159
Feature activation+1.75e-01

Pos logit contributions
)].+0.930
 Direction+0.928
 alike+0.828
 ≡+0.818
 Piece+0.770
Neg logit contributions
iris-1.083
thens-0.985
ffield-0.943
rs-0.937
download-0.930
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
)].+1.047
 Direction+1.007
 alike+0.943
 ≡+0.931
aukee+0.907
Neg logit contributions
iris-1.164
thens-1.063
rs-1.030
ements-1.025
agra-1.018
Token
Token ID447
Feature activation-5.61e-02

Pos logit contributions
)].+0.883
 Direction+0.856
 alike+0.796
 ≡+0.794
aukee+0.769
Neg logit contributions
iris-0.848
thens-0.768
rs-0.737
ements-0.737
agra-0.735
Token
Token ID251
Feature activation+2.27e-01

Pos logit contributions
iris+0.347
thens+0.316
rs+0.311
ements+0.303
download+0.302
Neg logit contributions
)].-0.357
 Direction-0.342
 alike-0.323
aukee-0.319
 ≡-0.318
Token
Token ID564
Feature activation-1.74e-01

Pos logit contributions
)].+1.407
 Direction+1.368
 alike+1.265
 ≡+1.256
aukee+1.208
Neg logit contributions
iris-1.471
thens-1.333
ffield-1.294
agra-1.289
ements-1.278
Token
Token ID250
Feature activation+3.05e-01

Pos logit contributions
iris+1.005
thens+0.905
rs+0.892
ements+0.866
download+0.862
Neg logit contributions
)].-1.174
 Direction-1.131
 alike-1.069
aukee-1.058
 ≡-1.055
Token complaints
Token ID9687
Feature activation-4.10e-01

Pos logit contributions
)].+3.100
 Direction+2.973
 alike+2.798
 ≡+2.791
aukee+2.711
Neg logit contributions
iris-3.168
thens-2.907
agra-2.751
rs-2.712
ffield-2.690
Token against
Token ID1028
Feature activation-1.95e-01

Pos logit contributions
iris+2.578
thens+2.326
rs+2.269
ements+2.232
agra+2.220
Neg logit contributions
)].-2.620
 Direction-2.509
 alike-2.385
 ≡-2.333
aukee-2.331
Token the
Token ID262
Feature activation-9.86e-02

Pos logit contributions
iris+1.372
thens+1.252
rs+1.227
agra+1.210
ements+1.207
Neg logit contributions
)].-1.094
 Direction-1.051
 alike-0.980
 ≡-0.965
aukee-0.947
Token Trump
Token ID1301
Feature activation+7.03e-02

Pos logit contributions
iris+0.725
thens+0.669
rs+0.655
agra+0.646
ements+0.643
Neg logit contributions
)].-0.522
 Direction-0.504
 alike-0.461
 ≡-0.455
aukee-0.449
Token campaign
Token ID1923
Feature activation+1.38e-01

Pos logit contributions
)].+0.371
 Direction+0.365
 alike+0.329
 ≡+0.329
aukee+0.325
Neg logit contributions
iris-0.509
thens-0.475
rs-0.458
agra-0.456
ements-0.454
Token for
Token ID329
Feature activation-1.87e-01

Pos logit contributions
)].+0.813
 Direction+0.783
 alike+0.754
 ≡+0.726
aukee+0.717
Neg logit contributions
iris-0.927
thens-0.847
agra-0.805
ements-0.804
rs-0.800
Token solic
Token ID25806
Feature activation+1.79e-01

Pos logit contributions
iris+1.193
thens+1.072
rs+1.037
agra+1.025
ements+1.025
Neg logit contributions
)].-1.191
 Direction-1.138
 alike-1.081
 ≡-1.066
aukee-1.050
Tokeniting
Token ID1780
Feature activation+8.23e-03

Pos logit contributions
)].+1.055
 Direction+1.005
 ≡+0.958
aukee+0.954
 alike+0.950
Neg logit contributions
iris-1.153
thens-1.068
rs-1.050
 Wolver-1.021
ements-1.012
Token foreign
Token ID3215
Feature activation-1.88e-01

Pos logit contributions
)].+0.048
 Direction+0.046
 alike+0.044
 ≡+0.043
aukee+0.042
Neg logit contributions
iris-0.056
thens-0.052
rs-0.050
ements-0.049
agra-0.049
Token nationals
Token ID22970
Feature activation-2.71e-01

Pos logit contributions
iris+1.258
thens+1.162
rs+1.120
download+1.093
ements+1.088
Neg logit contributions
)].-1.108
 Direction-1.070
 alike-1.021
 ≡-0.991
aukee-0.991
Token via
Token ID2884
Feature activation-1.32e-01

Pos logit contributions
iris+1.676
thens+1.534
rs+1.492
ements+1.472
agra+1.464
Neg logit contributions
)].-1.737
 Direction-1.670
 alike-1.603
 ≡-1.568
aukee-1.555
Token
Token ID564
Feature activation-3.95e-02

Pos logit contributions
)].+1.182
 Direction+1.173
 alike+1.066
 ≡+1.062
 mL+1.023
Neg logit contributions
iris-1.273
thens-1.159
agra-1.120
ffield-1.119
ements-1.111
Token
Token ID250
Feature activation+2.57e-01

Pos logit contributions
iris+0.237
thens+0.214
rs+0.211
ements+0.206
download+0.204
Neg logit contributions
)].-0.260
 Direction-0.250
 alike-0.236
 ≡-0.233
aukee-0.232
TokenI
Token ID40
Feature activation-1.35e-02

Pos logit contributions
)].+1.501
 Direction+1.445
 alike+1.350
 ≡+1.313
aukee+1.310
Neg logit contributions
iris-1.708
thens-1.544
ements-1.511
agra-1.509
ffield-1.505
Token
Token ID447
Feature activation-4.45e-01

Pos logit contributions
iris+0.086
thens+0.079
ements+0.076
rs+0.074
agra+0.074
Neg logit contributions
)].-0.084
 Direction-0.081
 alike-0.077
 ≡-0.075
aukee-0.074
Token
Token ID247
Feature activation+2.45e-02

Pos logit contributions
iris+3.471
thens+3.201
rs+3.160
ements+3.114
agra+3.094
Neg logit contributions
)].-2.158
 Direction-2.036
 alike-1.883
 ≡-1.851
aukee-1.837
Tokenve
Token ID303
Feature activation-3.64e-01

Pos logit contributions
)].+0.143
 Direction+0.136
 alike+0.129
 ≡+0.126
aukee+0.125
Neg logit contributions
iris-0.165
thens-0.151
ements-0.146
agra-0.145
rs-0.144
Token h
Token ID289
Feature activation-1.18e-01

Pos logit contributions
iris+2.374
thens+2.171
rs+2.102
ements+2.076
agra+2.059
Neg logit contributions
)].-2.253
 Direction-2.156
 alike-2.025
 ≡-1.987
aukee-1.972
Tokeniked
Token ID17951
Feature activation+1.75e-01

Pos logit contributions
iris+0.846
thens+0.784
rs+0.759
ements+0.757
download+0.746
Neg logit contributions
)].-0.637
 Direction-0.606
 alike-0.571
 ≡-0.562
aukee-0.560
Token in
Token ID287
Feature activation+2.54e-01

Pos logit contributions
)].+1.001
 Direction+0.969
 alike+0.924
 ≡+0.896
aukee+0.884
Neg logit contributions
iris-1.194
thens-1.101
ements-1.053
rs-1.052
rent-1.045
Token rain
Token ID6290
Feature activation-2.71e-01

Pos logit contributions
)].+1.442
 Direction+1.377
 alike+1.297
 ≡+1.278
aukee+1.269
Neg logit contributions
iris-1.774
thens-1.612
rs-1.562
ements-1.528
download-1.527
Tokenfore
Token ID754
Feature activation+2.90e-01

Pos logit contributions
iris+1.462
thens+1.314
rs+1.275
ements+1.249
agra+1.238
Neg logit contributions
)].-1.939
 Direction-1.880
 alike-1.793
aukee-1.764
 ≡-1.762
TokenSing
Token ID29974
Feature activation-2.46e-01

Pos logit contributions
iris+2.739
thens+2.508
rs+2.464
ements+2.428
agra+2.412
Neg logit contributions
)].-2.242
 Direction-2.144
 alike-2.004
 ≡-1.968
aukee-1.957
Token,
Token ID11
Feature activation+1.43e-02

Pos logit contributions
iris+1.343
thens+1.209
rs+1.200
ements+1.157
download+1.141
Neg logit contributions
)].-1.738
 Direction-1.646
aukee-1.580
 alike-1.571
 ≡-1.561
Token Bird
Token ID14506
Feature activation+1.39e-01

Pos logit contributions
)].+0.074
 Direction+0.069
 alike+0.064
aukee+0.064
 ≡+0.064
Neg logit contributions
iris-0.106
thens-0.098
rs-0.098
download-0.095
agra-0.095
Token of
Token ID286
Feature activation-1.34e-01

Pos logit contributions
)].+0.919
 Direction+0.871
aukee+0.824
 alike+0.821
 ≡+0.814
Neg logit contributions
iris-0.830
thens-0.754
rs-0.751
ements-0.726
download-0.724
Token Pre
Token ID3771
Feature activation+3.63e-01

Pos logit contributions
iris+0.952
thens+0.866
ements+0.838
rs+0.833
agra+0.829
Neg logit contributions
)].-0.766
 Direction-0.742
 alike-0.680
 ≡-0.661
aukee-0.642
Tokeny
Token ID88
Feature activation-3.46e-01

Pos logit contributions
)].+2.354
 Direction+2.168
 alike+2.066
 mL+2.004
aukee+1.968
Neg logit contributions
iris-2.171
thens-2.023
 Wolver-1.926
ements-1.924
agra-1.904
Token have
Token ID423
Feature activation+1.82e-01

Pos logit contributions
iris+2.098
thens+1.892
rs+1.858
ements+1.821
agra+1.807
Neg logit contributions
)].-2.271
 Direction-2.185
 alike-2.062
 ≡-2.039
aukee-2.023
Token crafted
Token ID18025
Feature activation+1.17e-01

Pos logit contributions
)].+1.119
 Direction+1.071
 alike+1.013
 ≡+0.990
aukee+0.958
Neg logit contributions
iris-1.182
thens-1.094
rs-1.039
agra-1.033
download-1.032
Token a
Token ID257
Feature activation-1.55e-01

Pos logit contributions
)].+0.660
 Direction+0.632
 alike+0.595
 ≡+0.582
aukee+0.565
Neg logit contributions
iris-0.824
thens-0.760
rs-0.733
agra-0.724
download-0.721
Token unique
Token ID3748
Feature activation+1.56e-01

Pos logit contributions
iris+1.039
thens+0.954
rs+0.928
ements+0.909
agra+0.905
Neg logit contributions
)].-0.921
 Direction-0.893
 alike-0.836
 ≡-0.817
aukee-0.806
Token debut
Token ID8886
Feature activation+7.48e-01

Pos logit contributions
)].+0.939
 Direction+0.921
 alike+0.849
 ≡+0.822
aukee+0.814
Neg logit contributions
iris-1.031
thens-0.959
rs-0.916
agra-0.907
ements-0.905
Token sunk
Token ID24790
Feature activation-3.38e-01

Pos logit contributions
iris+2.035
thens+1.869
rs+1.824
ements+1.798
agra+1.796
Neg logit contributions
)].-1.692
 Direction-1.625
 alike-1.517
 ≡-1.485
aukee-1.481
Tokenen
Token ID268
Feature activation+9.21e-02

Pos logit contributions
iris+1.754
thens+1.547
rs+1.508
agra+1.468
download+1.464
Neg logit contributions
)].-2.527
 Direction-2.429
 alike-2.317
 ≡-2.268
aukee-2.261
Token fireplace
Token ID44250
Feature activation-3.88e-02

Pos logit contributions
)].+0.630
 Direction+0.608
 alike+0.581
 ≡+0.566
aukee+0.562
Neg logit contributions
iris-0.529
thens-0.479
rs-0.458
agra-0.456
ements-0.451
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
iris+0.259
thens+0.239
agra+0.232
rs+0.231
ements+0.230
Neg logit contributions
)].-0.227
 Direction-0.218
 alike-0.209
 ≡-0.201
aukee-0.200
Token After
Token ID2293
Feature activation+6.32e-02

Pos logit contributions
iris+0.839
thens+0.764
rs+0.740
agra+0.736
ements+0.734
Neg logit contributions
)].-0.806
 Direction-0.793
 alike-0.727
 ≡-0.722
aukee-0.701
Token that
Token ID326
Feature activation-3.61e-01

Pos logit contributions
)].+0.360
 Direction+0.344
 alike+0.326
 ≡+0.314
aukee+0.313
Neg logit contributions
iris-0.431
thens-0.396
ements-0.390
rs-0.388
agra-0.383
Token,
Token ID11
Feature activation-1.12e-01

Pos logit contributions
iris+1.891
thens+1.641
rs+1.630
ements+1.566
download+1.547
Neg logit contributions
)].-2.642
 Direction-2.533
aukee-2.422
 ≡-2.404
 alike-2.385
Token he
Token ID339
Feature activation+3.90e-01

Pos logit contributions
iris+0.735
thens+0.675
rs+0.664
ements+0.654
agra+0.651
Neg logit contributions
)].-0.673
 Direction-0.648
 alike-0.614
 ≡-0.595
aukee-0.583
Token turned
Token ID2900
Feature activation-9.91e-02

Pos logit contributions
)].+2.222
 Direction+2.062
 alike+2.060
 ≡+1.916
 mL+1.863
Neg logit contributions
iris-2.609
thens-2.438
agra-2.364
download-2.357
rs-2.328
Token his
Token ID465
Feature activation-5.14e-02

Pos logit contributions
iris+0.644
thens+0.594
rs+0.580
ements+0.575
download+0.571
Neg logit contributions
)].-0.602
 Direction-0.578
 alike-0.548
 ≡-0.531
aukee-0.518
Token back
Token ID736
Feature activation+5.49e-01

Pos logit contributions
iris+0.350
thens+0.327
rs+0.315
ements+0.311
download+0.310
Neg logit contributions
)].-0.296
 Direction-0.287
 alike-0.269
 ≡-0.259
aukee-0.255
Tokenrix
Token ID8609
Feature activation+1.28e-01

Pos logit contributions
)].+2.155
 Direction+2.060
 alike+1.980
opez+1.935
aukee+1.918
Neg logit contributions
iris-1.842
thens-1.712
ffield-1.658
 Wolver-1.650
download-1.608
Token bones
Token ID11945
Feature activation+1.21e-01

Pos logit contributions
)].+0.766
 Direction+0.737
 alike+0.714
 ≡+0.677
aukee+0.674
Neg logit contributions
iris-0.839
thens-0.775
download-0.734
ements-0.734
agra-0.731
Token to
Token ID284
Feature activation-1.58e-01

Pos logit contributions
)].+0.700
 Direction+0.659
 alike+0.658
 ≡+0.609
aukee+0.602
Neg logit contributions
iris-0.824
thens-0.758
agra-0.721
download-0.719
ements-0.719
Token the
Token ID262
Feature activation-1.88e-01

Pos logit contributions
iris+1.074
thens+0.982
rs+0.945
ements+0.941
download+0.938
Neg logit contributions
)].-0.925
 Direction-0.889
 alike-0.845
aukee-0.819
 ≡-0.817
Token newly
Token ID8308
Feature activation-3.32e-01

Pos logit contributions
iris+1.315
thens+1.198
rs+1.154
download+1.150
agra+1.145
Neg logit contributions
)].-1.071
 Direction-1.036
 alike-0.982
 ≡-0.941
aukee-0.937
Token discovered
Token ID5071
Feature activation-1.16e-01

Pos logit contributions
iris+2.367
thens+2.167
rs+2.108
ements+2.081
download+2.076
Neg logit contributions
)].-1.836
 Direction-1.749
 alike-1.660
aukee-1.607
 ≡-1.592
Token million
Token ID1510
Feature activation+1.83e-01

Pos logit contributions
iris+0.752
thens+0.680
agra+0.652
rs+0.652
download+0.652
Neg logit contributions
)].-0.717
 Direction-0.692
 alike-0.657
 ≡-0.633
aukee-0.631
Token year
Token ID614
Feature activation-6.31e-01

Pos logit contributions
)].+1.241
 Direction+1.195
 alike+1.130
 ≡+1.117
aukee+1.094
Neg logit contributions
iris-1.067
thens-0.958
rs-0.918
agra-0.905
download-0.905
Token old
Token ID1468
Feature activation-9.26e-02

Pos logit contributions
iris+3.540
rs+3.170
thens+3.166
ements+3.067
agra+3.021
Neg logit contributions
)].-4.339
 Direction-4.189
aukee-3.940
 alike-3.939
 ≡-3.921
Token specimens
Token ID27569
Feature activation-9.32e-03

Pos logit contributions
iris+0.609
thens+0.553
rs+0.532
agra+0.529
download+0.529
Neg logit contributions
)].-0.564
 Direction-0.543
 alike-0.515
 ≡-0.498
aukee-0.495
Token.
Token ID13
Feature activation+5.00e-02

Pos logit contributions
iris+0.064
thens+0.059
agra+0.056
ements+0.056
rs+0.056
Neg logit contributions
)].-0.054
 Direction-0.051
 alike-0.050
 ≡-0.047
aukee-0.046
Token09
Token ID2931
Feature activation-1.68e-02

Pos logit contributions
iris+1.550
thens+1.440
agra+1.389
ements+1.364
rs+1.354
Neg logit contributions
)].-1.187
 Direction-1.160
 alike-1.083
 ≡-1.034
aukee-1.023
Token campaign
Token ID1923
Feature activation-5.67e-02

Pos logit contributions
iris+0.118
thens+0.107
rs+0.104
agra+0.103
download+0.103
Neg logit contributions
)].-0.095
 Direction-0.091
 alike-0.086
 ≡-0.084
aukee-0.082
Token,
Token ID11
Feature activation-9.52e-02

Pos logit contributions
iris+0.346
thens+0.310
rs+0.300
ements+0.299
agra+0.299
Neg logit contributions
)].-0.371
 Direction-0.360
 alike-0.342
 ≡-0.335
aukee-0.330
Token is
Token ID318
Feature activation-2.07e-01

Pos logit contributions
iris+0.627
thens+0.575
agra+0.547
rs+0.545
download+0.543
Neg logit contributions
)].-0.574
 Direction-0.558
 alike-0.529
 ≡-0.516
 mL-0.494
Token being
Token ID852
Feature activation-2.33e-01

Pos logit contributions
iris+1.535
thens+1.404
rs+1.352
download+1.352
ements+1.348
Neg logit contributions
)].-1.100
 Direction-1.065
 alike-1.008
 ≡-0.957
aukee-0.952
Token re
Token ID302
Feature activation-1.03e+00

Pos logit contributions
iris+1.601
thens+1.484
download+1.417
rs+1.413
ements+1.403
Neg logit contributions
)].-1.345
 Direction-1.284
 alike-1.230
 ≡-1.181
aukee-1.180
Token-
Token ID12
Feature activation-1.55e-01

Pos logit contributions
iris+5.002
rs+4.363
thens+4.289
download+4.154
agra+4.109
Neg logit contributions
)].-7.802
 Direction-7.525
 alike-7.212
 ≡-7.178
aukee-7.101
Tokenestablished
Token ID27718
Feature activation-1.93e-01

Pos logit contributions
iris+0.994
thens+0.924
ements+0.891
rs+0.888
agra+0.880
Neg logit contributions
)].-0.935
 Direction-0.891
aukee-0.829
 alike-0.829
 ≡-0.816
Token in
Token ID287
Feature activation+4.60e-04

Pos logit contributions
iris+1.320
thens+1.210
agra+1.159
rs+1.154
ements+1.151
Neg logit contributions
)].-1.137
 Direction-1.094
 alike-1.040
 ≡-0.993
aukee-0.982
Token order
Token ID1502
Feature activation-8.50e-01

Pos logit contributions
)].+0.003
 Direction+0.003
 alike+0.002
 ≡+0.002
aukee+0.002
Neg logit contributions
iris-0.003
thens-0.003
rs-0.003
agra-0.003
download-0.003
Token to
Token ID284
Feature activation-2.64e-01

Pos logit contributions
iris+3.709
rs+3.305
ements+3.302
agra+3.180
 Wolver+3.141
Neg logit contributions
)].-6.343
 Direction-6.178
 ≡-6.040
aukee-5.808
 alike-5.800
Tokenemb
Token ID24419
Feature activation-2.53e-01

Pos logit contributions
iris+0.122
thens+0.105
rs+0.104
 Wolver+0.103
ements+0.101
Neg logit contributions
)].-0.204
 Direction-0.194
 alike-0.187
aukee-0.185
 ≡-0.184
Tokenry
Token ID563
Feature activation-2.80e-01

Pos logit contributions
iris+1.361
thens+1.243
 Wolver+1.185
ements+1.167
rs+1.164
Neg logit contributions
)].-1.839
 Direction-1.715
 alike-1.650
aukee-1.617
 ≡-1.604
Tokeno
Token ID78
Feature activation-8.94e-01

Pos logit contributions
iris+1.616
thens+1.441
rs+1.389
 Wolver+1.383
ements+1.374
Neg logit contributions
)].-1.958
 Direction-1.845
 alike-1.749
aukee-1.720
opez-1.703
Token<|endoftext|>
Token ID50256
Feature activation+2.50e-01

Pos logit contributions
iris+3.719
rs+3.304
download+3.220
thens+3.159
agra+3.082
Neg logit contributions
)].-7.145
 Direction-6.746
aukee-6.585
 alike-6.487
 ≡-6.456
TokenOver
Token ID5886
Feature activation-3.69e-01

Pos logit contributions
)].+1.448
 Direction+1.358
 alike+1.262
aukee+1.249
 ≡+1.230
Neg logit contributions
iris-1.666
thens-1.538
 Wolver-1.488
ements-1.487
agra-1.469
Token the
Token ID262
Feature activation-9.21e-01

Pos logit contributions
iris+2.256
thens+2.042
rs+2.007
ements+1.968
agra+1.951
Neg logit contributions
)].-2.386
 Direction-2.298
 alike-2.165
 ≡-2.137
aukee-2.125
Token weekend
Token ID5041
Feature activation+4.23e-01

Pos logit contributions
iris+6.346
thens+5.776
rs+5.749
 Wolver+5.662
ements+5.630
Neg logit contributions
)].-4.993
 Direction-4.686
aukee-4.430
 alike-4.420
 ≡-4.411
Token Fairfax
Token ID33514
Feature activation-4.67e-02

Pos logit contributions
)].+2.614
 Direction+2.533
 alike+2.398
 ≡+2.328
aukee+2.311
Neg logit contributions
iris-2.678
thens-2.434
rs-2.342
agra-2.335
ements-2.315
Token media
Token ID2056
Feature activation+1.33e-01

Pos logit contributions
iris+0.424
thens+0.398
rs+0.391
ements+0.386
agra+0.386
Neg logit contributions
)].-0.165
 Direction-0.154
 alike-0.137
 ≡-0.133
aukee-0.132
Token reported
Token ID2098
Feature activation+3.96e-01

Pos logit contributions
)].+0.731
 Direction+0.701
 alike+0.665
 ≡+0.635
aukee+0.628
Neg logit contributions
iris-0.937
thens-0.875
agra-0.837
rs-0.834
download-0.828
Token that
Token ID326
Feature activation+3.92e-01

Pos logit contributions
)].+2.170
 Direction+2.131
 alike+1.985
 ≡+1.922
aukee+1.909
Neg logit contributions
iris-2.731
thens-2.508
rs-2.425
agra-2.415
ements-2.414
Token She
Token ID1375
Feature activation-6.60e-02

Pos logit contributions
iris+3.115
thens+2.812
rs+2.761
ements+2.712
agra+2.690
Neg logit contributions
)].-3.201
 Direction-3.098
 alike-2.902
 ≡-2.867
aukee-2.849
Token pulls
Token ID16194
Feature activation+7.20e-02

Pos logit contributions
iris+0.389
thens+0.345
download+0.339
ements+0.335
rs+0.335
Neg logit contributions
)].-0.435
 Direction-0.417
 alike-0.408
 ≡-0.389
 mL-0.387
Token away
Token ID1497
Feature activation-5.60e-01

Pos logit contributions
)].+0.366
 Direction+0.350
 alike+0.333
 ≡+0.311
aukee+0.310
Neg logit contributions
iris-0.537
thens-0.489
rs-0.484
ements-0.482
agra-0.480
Token and
Token ID290
Feature activation-4.41e-01

Pos logit contributions
iris+3.502
thens+3.168
rs+3.118
ements+3.056
agra+3.031
Neg logit contributions
)].-3.551
 Direction-3.406
 alike-3.218
 ≡-3.166
aukee-3.158
Token I
Token ID314
Feature activation-5.47e-01

Pos logit contributions
iris+2.931
thens+2.636
rs+2.596
ements+2.583
download+2.576
Neg logit contributions
)].-2.649
 Direction-2.555
 alike-2.422
 ≡-2.339
aukee-2.301
Token immediately
Token ID3393
Feature activation-9.43e-01

Pos logit contributions
iris+3.503
thens+3.189
rs+3.092
ements+3.086
download+3.044
Neg logit contributions
)].-3.415
 Direction-3.251
 alike-3.117
 ≡-3.026
aukee-2.994
Token miss
Token ID2051
Feature activation+1.14e-01

Pos logit contributions
iris+5.939
thens+5.373
rs+5.349
ements+5.192
agra+5.167
Neg logit contributions
)].-5.835
 Direction-5.612
 alike-5.231
aukee-5.212
 ≡-5.210
Token the
Token ID262
Feature activation+1.53e-01

Pos logit contributions
)].+0.693
 Direction+0.674
 alike+0.637
aukee+0.621
 ≡+0.616
Neg logit contributions
iris-0.737
thens-0.668
rs-0.647
agra-0.640
ements-0.640
Token contact
Token ID2800
Feature activation-1.86e-01

Pos logit contributions
)].+0.924
 Direction+0.892
aukee+0.832
 alike+0.829
 ≡+0.806
Neg logit contributions
iris-0.999
thens-0.902
rs-0.873
ements-0.870
agra-0.858
Token of
Token ID286
Feature activation+4.14e-01

Pos logit contributions
iris+1.206
thens+1.096
rs+1.064
ements+1.053
agra+1.049
Neg logit contributions
)].-1.135
 Direction-1.095
 alike-1.042
 ≡-1.009
aukee-1.005
Token her
Token ID607
Feature activation+5.80e-01

Pos logit contributions
)].+2.295
 Direction+2.277
 alike+2.112
aukee+2.006
 mL+2.005
Neg logit contributions
iris-2.817
thens-2.540
ements-2.511
rs-2.505
download-2.453
Token the
Token ID262
Feature activation+9.69e-02

Pos logit contributions
)].+0.667
 Direction+0.645
 alike+0.619
aukee+0.583
 ≡+0.582
Neg logit contributions
iris-0.803
thens-0.739
ements-0.703
rs-0.703
download-0.693
Token truth
Token ID3872
Feature activation-7.49e-01

Pos logit contributions
)].+0.563
 Direction+0.540
 alike+0.516
 ≡+0.493
 interviewer+0.483
Neg logit contributions
iris-0.654
thens-0.603
rs-0.577
ements-0.576
download-0.570
Token prevail
Token ID28615
Feature activation-1.48e-01

Pos logit contributions
iris+4.522
thens+4.061
rs+4.046
ements+3.929
agra+3.902
Neg logit contributions
)].-4.861
 Direction-4.677
 alike-4.378
 ≡-4.371
aukee-4.347
Token.
Token ID13
Feature activation-4.99e-02

Pos logit contributions
iris+0.945
thens+0.866
rs+0.836
ements+0.828
agra+0.828
Neg logit contributions
)].-0.911
 Direction-0.871
 alike-0.840
aukee-0.813
 ≡-0.811
Token OK
Token ID7477
Feature activation-2.76e-01

Pos logit contributions
iris+0.331
thens+0.304
ements+0.293
agra+0.291
ffield+0.291
Neg logit contributions
)].-0.300
 Direction-0.297
 alike-0.273
 ≡-0.266
 mL-0.259
Token,
Token ID11
Feature activation-9.73e-01

Pos logit contributions
iris+1.514
thens+1.353
rs+1.346
ements+1.295
download+1.289
Neg logit contributions
)].-1.959
 Direction-1.870
 ≡-1.777
aukee-1.768
 alike-1.764
Token you
Token ID345
Feature activation+3.59e-02

Pos logit contributions
iris+6.263
thens+5.709
rs+5.654
ements+5.491
download+5.462
Neg logit contributions
)].-5.904
 Direction-5.604
aukee-5.265
 ≡-5.245
 alike-5.241
Token can
Token ID460
Feature activation+7.77e-02

Pos logit contributions
)].+0.191
 Direction+0.183
 alike+0.174
 ≡+0.165
aukee+0.162
Neg logit contributions
iris-0.258
thens-0.241
agra-0.234
ements-0.232
rs-0.230
Token stop
Token ID2245
Feature activation+4.27e-01

Pos logit contributions
)].+0.465
 Direction+0.435
 alike+0.428
 ≡+0.401
 mL+0.397
Neg logit contributions
iris-0.518
thens-0.475
ements-0.462
agra-0.457
rs-0.457
Token laughing
Token ID14376
Feature activation-3.76e-02

Pos logit contributions
)].+2.511
 Direction+2.398
 alike+2.391
 mL+2.184
 interviewer+2.160
Neg logit contributions
iris-2.785
thens-2.509
agra-2.443
ements-2.441
rs-2.412
Token at
Token ID379
Feature activation-2.16e-01

Pos logit contributions
iris+0.244
thens+0.220
rs+0.213
agra+0.213
download+0.213
Neg logit contributions
)].-0.228
 Direction-0.219
 alike-0.211
 ≡-0.204
aukee-0.201
Token-
Token ID12
Feature activation+2.06e-01

Pos logit contributions
)].+2.740
 Direction+2.603
 alike+2.458
opez+2.273
 mL+2.269
Neg logit contributions
iris-2.635
thens-2.430
 Wolver-2.308
ffield-2.275
agra-2.260
Tokenindependence
Token ID39894
Feature activation+3.09e-01

Pos logit contributions
)].+0.944
 Direction+0.873
 alike+0.814
opez+0.750
aukee+0.732
Neg logit contributions
iris-1.581
 Wolver-1.513
thens-1.501
ffield-1.449
rs-1.434
Token flag
Token ID6056
Feature activation+6.01e-03

Pos logit contributions
)].+1.335
 Direction+1.295
 alike+1.228
 mL+1.078
 interviewer+1.065
Neg logit contributions
iris-2.550
thens-2.414
ements-2.324
agra-2.297
download-2.285
Token -
Token ID532
Feature activation-3.07e-01

Pos logit contributions
)].+0.042
 Direction+0.041
aukee+0.038
 alike+0.038
 ≡+0.038
Neg logit contributions
iris-0.033
thens-0.030
rs-0.029
ements-0.028
agra-0.028
Tokenthe
Token ID1169
Feature activation-4.80e-01

Pos logit contributions
iris+2.000
thens+1.826
rs+1.820
ements+1.759
agra+1.758
Neg logit contributions
)].-1.817
 Direction-1.741
aukee-1.645
 alike-1.617
 ≡-1.613
Token est
Token ID1556
Feature activation-9.92e-01

Pos logit contributions
iris+2.669
thens+2.380
rs+2.362
ements+2.287
download+2.259
Neg logit contributions
)].-3.318
 Direction-3.156
aukee-3.033
 alike-3.024
 ≡-3.020
Tokenel
Token ID417
Feature activation+3.17e-01

Pos logit contributions
iris+3.147
thens+2.534
rs+2.489
ements+2.365
agra+2.318
Neg logit contributions
)].-9.281
 Direction-9.053
 alike-8.704
 ≡-8.654
aukee-8.621
Tokenada
Token ID4763
Feature activation+7.67e-02

Pos logit contributions
)].+1.408
 Direction+1.281
 alike+1.156
opez+1.133
aukee+1.127
Neg logit contributions
iris-2.506
thens-2.385
rs-2.346
 Wolver-2.339
ffield-2.294
Token -
Token ID532
Feature activation-2.49e-01

Pos logit contributions
)].+0.515
 Direction+0.501
 alike+0.473
 ≡+0.465
aukee+0.461
Neg logit contributions
iris-0.449
thens-0.404
rs-0.394
ements-0.391
agra-0.385
Token to
Token ID284
Feature activation-8.37e-01

Pos logit contributions
iris+1.438
thens+1.290
rs+1.270
ements+1.239
agra+1.228
Neg logit contributions
)].-1.702
 Direction-1.637
 alike-1.549
 ≡-1.531
aukee-1.529
Token the
Token ID262
Feature activation+6.52e-02

Pos logit contributions
iris+4.024
thens+3.573
rs+3.464
ements+3.427
 Wolver+3.338
Neg logit contributions
)].-6.216
 Direction-5.810
aukee-5.737
 ≡-5.655
 alike-5.647