TOP ACTIVATIONS
MAX = 2.194

QB Tom Brady Contract : A two - year
DE Cameron Wake Contract : A two - year
Bil al Powell Contract : A three - year
$ font 2 , $ brush 2 , $ rect 2
only to manage their medi ocr ity , perhaps with small
Inc ogn ito Contract : A three - year
stack frame | void * arg | + -------------------- --+ |
' s stack frame | void * arg | + --------------------
manually with the sl s invoke command . If the function
path to images directory ") args = v ars ( ap
, the relationship is still going strong at Aut - O
Department of Health advised the NHS to move away from the
. Students , workers , employ ees seemed to have risen
PNG larger image larg er image T IFF original
Bon net ( Equ ipe Cycl iste FD J - Big
s frame | void * arg | + -------------------- --+ |
fractured collar bone in Foot sc ray s win
16 , 1981 Birth place : Long Beach ,
a major Oscars cont ender , potentially picking up
. list _ images ( args [ " images " ]

MIN ACTIVATIONS
MIN = -2.124

a recent article for World Net Daily , for example ,
That is a far cry from gu arant
States 6 12 Posts # 17 Good to hear .
States 3 472 Posts # 11 On September 20 2013 07
States 228 65 Posts # 13 On September 20 2013 07
T 235 21 Posts # 18 Good stuff ! Great to
Ahmed of Mars abit National Park in Kenya in the early
being there whether I set foot in it or not ."
States 5 667 Posts # 12 I 'm glad there aren
worth a hell of a lot more than what he
minions . IGN : Sure . Before we move
States 21 50 Posts # 16 Glad to hear everything worked
She doesn 't want to alien ate the Europeans when she
: And it varies a bit along the way .
States 10 53 Posts # 15 honestly if we just had
on Twitter @ T ara Per kins . The
policy . US AT OD AY OP IN ION
] WHERE path LIKE ' % project . cl j '
the low orbit of the space station as safely as possible
but can you talk a little bit about what you responded

INTERVAL 1.114 - 2.194
CONTAINS 0.271%

Conference in H ilo , Hawaii June 10 . Rep .
part , the government blames what it calls an
Passage M ael strom Pulse vs Paradise M antle
Never in my wild est imagination , with the political capital
spread the word about this campaign to your friends , family

INTERVAL 0.035 - 1.114
CONTAINS 45.609%

/ pop / pop / ret , which we 'll call
and 1949 , the Rams and their former city existed in
as well as all Mercenary units and Mercenary character sol os
into my bag , I take out my letter from her
ve got in store for well , the Store !

INTERVAL -1.044 - 0.035
CONTAINS 53.665%

seasons ( and produced one movie ), in which Sed aris
them . " You would tend to go off the circuit
U . S . President Donald Trump waves as
The reason : it requires the server to care about when
into a perennial powerhouse . I m sure I

INTERVAL -2.124 - -1.044
CONTAINS 0.454%

Facebook Twitter Pinterest Lake Ur mia Photograph : K ave
oe ing in Yosemite National Park under frost ed sequ o
hour edition network 's World News Tonight will be dedicated to
vandal ized and set on fire in an organized manner
and he did not fight back . Jal en was doing
Token QB
Token ID16135
Feature activation-1.70e-01

Pos logit contributions
 FAR+0.833
senal+0.776
 Ig+0.757
atican+0.738
IOR+0.706
Neg logit contributions
hire-0.768
ortun-0.689
icked-0.675
image-0.667
anced-0.622
Token Tom
Token ID4186
Feature activation+7.23e-02

Pos logit contributions
 FAR+0.649
senal+0.635
 Ig+0.595
atican+0.591
lishes+0.558
Neg logit contributions
hire-0.590
ortun-0.525
image-0.519
icked-0.513
anced-0.474
Token Brady
Token ID15260
Feature activation-5.55e-01

Pos logit contributions
hire+0.250
ortun+0.222
image+0.218
icked+0.215
anced+0.200
Neg logit contributions
 FAR-0.277
senal-0.266
 Ig-0.254
atican-0.251
foundland-0.240
Token
Token ID198
Feature activation+2.60e-01

Pos logit contributions
 FAR+2.253
senal+2.225
atican+2.079
 Ig+2.067
foundland+1.977
Neg logit contributions
hire-1.786
ortun-1.584
image-1.557
icked-1.545
anced-1.414
Token
Token ID198
Feature activation+3.85e-01

Pos logit contributions
hire+0.933
ortun+0.848
icked+0.834
image+0.798
anced+0.770
Neg logit contributions
 FAR-0.964
 Ig-0.882
atican-0.866
IOR-0.821
 SAT-0.803
TokenContract
Token ID45845
Feature activation+2.19e+00

Pos logit contributions
hire+0.873
ortun+0.720
image+0.719
icked+0.693
anced+0.596
Neg logit contributions
 FAR-1.943
senal-1.939
atican-1.820
 Ig-1.815
foundland-1.757
Token:
Token ID25
Feature activation+2.59e-01

Pos logit contributions
hire+5.675
image+4.874
ortun+4.699
icked+4.516
anced+4.004
Neg logit contributions
senal-10.644
 FAR-9.865
 practition-9.395
ikuman-9.337
atican-9.321
Token A
Token ID317
Feature activation+3.42e-01

Pos logit contributions
hire+0.940
ortun+0.850
image+0.832
icked+0.827
anced+0.769
Neg logit contributions
 FAR-0.952
senal-0.925
 Ig-0.868
atican-0.863
foundland-0.814
Token two
Token ID734
Feature activation+2.01e-01

Pos logit contributions
hire+1.209
ortun+1.086
image+1.069
icked+1.058
anced+0.980
Neg logit contributions
 FAR-1.282
senal-1.258
 Ig-1.170
atican-1.168
foundland-1.105
Token-
Token ID12
Feature activation+1.34e-01

Pos logit contributions
hire+0.661
ortun+0.599
icked+0.585
image+0.572
anced+0.540
Neg logit contributions
 FAR-0.795
 Ig-0.735
atican-0.713
senal-0.693
IOR-0.684
Tokenyear
Token ID1941
Feature activation+2.02e-01

Pos logit contributions
hire+0.426
ortun+0.391
icked+0.385
image+0.372
anced+0.360
Neg logit contributions
 FAR-0.520
 Ig-0.477
atican-0.467
senal-0.464
IOR-0.454
Token DE
Token ID5550
Feature activation-3.97e-01

Pos logit contributions
 FAR+0.031
senal+0.030
 Ig+0.028
atican+0.028
IOR+0.026
Neg logit contributions
hire-0.031
ortun-0.028
icked-0.028
image-0.028
anced-0.026
Token Cameron
Token ID10821
Feature activation+1.91e-01

Pos logit contributions
 FAR+1.392
senal+1.350
 Ig+1.271
atican+1.256
IOR+1.184
Neg logit contributions
hire-1.495
ortun-1.352
icked-1.328
image-1.326
anced-1.234
Token Wake
Token ID20441
Feature activation+5.04e-01

Pos logit contributions
hire+0.682
ortun+0.620
icked+0.605
image+0.600
anced+0.563
Neg logit contributions
 FAR-0.716
senal-0.684
 Ig-0.663
atican-0.651
foundland-0.607
Token
Token ID198
Feature activation+2.21e-01

Pos logit contributions
hire+1.547
image+1.348
ortun+1.347
icked+1.322
anced+1.200
Neg logit contributions
senal-2.118
 FAR-2.100
atican-1.969
 Ig-1.916
foundland-1.861
Token
Token ID198
Feature activation+5.56e-01

Pos logit contributions
hire+0.795
ortun+0.723
icked+0.712
image+0.679
anced+0.656
Neg logit contributions
 FAR-0.816
 Ig-0.748
atican-0.732
IOR-0.695
 SAT-0.680
TokenContract
Token ID45845
Feature activation+2.04e+00

Pos logit contributions
hire+1.220
image+1.001
ortun+0.984
icked+0.947
anced+0.809
Neg logit contributions
senal-2.857
 FAR-2.851
atican-2.669
 Ig-2.659
foundland-2.585
Token:
Token ID25
Feature activation+1.43e-01

Pos logit contributions
hire+5.365
image+4.599
ortun+4.485
icked+4.309
anced+3.821
Neg logit contributions
senal-9.692
 FAR-9.145
atican-8.607
ikuman-8.556
 practition-8.513
Token A
Token ID317
Feature activation+3.25e-01

Pos logit contributions
hire+0.534
ortun+0.486
image+0.473
icked+0.470
anced+0.442
Neg logit contributions
 FAR-0.515
senal-0.490
 Ig-0.470
atican-0.461
IOR-0.435
Token two
Token ID734
Feature activation+2.14e-01

Pos logit contributions
hire+1.164
ortun+1.046
image+1.030
icked+1.018
anced+0.946
Neg logit contributions
 FAR-1.206
senal-1.182
 Ig-1.101
atican-1.096
foundland-1.037
Token-
Token ID12
Feature activation+2.15e-01

Pos logit contributions
hire+0.701
ortun+0.630
icked+0.616
image+0.606
anced+0.571
Neg logit contributions
 FAR-0.849
 Ig-0.784
atican-0.763
senal-0.761
IOR-0.730
Tokenyear
Token ID1941
Feature activation+3.30e-01

Pos logit contributions
hire+0.621
ortun+0.539
image+0.532
icked+0.523
anced+0.468
Neg logit contributions
 FAR-0.953
senal-0.951
atican-0.892
 Ig-0.883
foundland-0.847
Token Bil
Token ID24207
Feature activation-1.39e-01

Pos logit contributions
 FAR+0.746
senal+0.729
 Ig+0.677
atican+0.674
foundland+0.634
Neg logit contributions
hire-0.816
ortun-0.737
image-0.725
icked-0.722
anced-0.671
Tokenal
Token ID282
Feature activation+8.41e-01

Pos logit contributions
 FAR+0.511
senal+0.501
 Ig+0.470
atican+0.468
foundland+0.441
Neg logit contributions
hire-0.500
ortun-0.449
image-0.442
icked-0.437
anced-0.405
Token Powell
Token ID19675
Feature activation-5.30e-01

Pos logit contributions
hire+3.385
ortun+3.103
image+3.077
icked+3.038
anced+2.842
Neg logit contributions
 FAR-2.703
senal-2.693
atican-2.417
 Ig-2.348
IOR-2.283
Token
Token ID198
Feature activation+2.48e-01

Pos logit contributions
 FAR+2.192
senal+2.168
atican+2.027
 Ig+2.000
foundland+1.934
Neg logit contributions
hire-1.676
ortun-1.479
image-1.457
icked-1.449
anced-1.325
Token
Token ID198
Feature activation+4.12e-01

Pos logit contributions
hire+0.891
ortun+0.810
icked+0.798
image+0.762
anced+0.736
Neg logit contributions
 FAR-0.918
 Ig-0.841
atican-0.824
IOR-0.783
 SAT-0.765
TokenContract
Token ID45845
Feature activation+2.02e+00

Pos logit contributions
hire+0.921
image+0.757
ortun+0.753
icked+0.724
anced+0.623
Neg logit contributions
senal-2.097
 FAR-2.095
atican-1.963
 Ig-1.957
foundland-1.899
Token:
Token ID25
Feature activation+1.50e-01

Pos logit contributions
hire+5.298
image+4.552
ortun+4.410
icked+4.244
anced+3.780
Neg logit contributions
senal-9.695
 FAR-9.080
atican-8.545
 practition-8.532
ikuman-8.518
Token A
Token ID317
Feature activation+2.25e-01

Pos logit contributions
hire+0.555
ortun+0.504
image+0.491
icked+0.488
anced+0.456
Neg logit contributions
 FAR-0.544
senal-0.522
 Ig-0.497
atican-0.489
IOR-0.461
Token three
Token ID1115
Feature activation+6.69e-01

Pos logit contributions
hire+0.813
ortun+0.732
image+0.720
icked+0.712
anced+0.662
Neg logit contributions
 FAR-0.825
senal-0.806
 Ig-0.752
atican-0.747
foundland-0.706
Token-
Token ID12
Feature activation+2.74e-01

Pos logit contributions
hire+2.113
ortun+1.887
icked+1.839
image+1.823
anced+1.687
Neg logit contributions
 FAR-2.757
senal-2.561
 Ig-2.549
atican-2.512
IOR-2.394
Tokenyear
Token ID1941
Feature activation+2.73e-01

Pos logit contributions
hire+0.869
ortun+0.790
icked+0.780
image+0.754
anced+0.724
Neg logit contributions
 FAR-1.086
 Ig-0.996
senal-0.990
atican-0.981
IOR-0.948
Token $
Token ID720
Feature activation+9.76e-01

Pos logit contributions
hire+1.663
ortun+1.523
icked+1.488
image+1.478
anced+1.398
Neg logit contributions
 FAR-1.237
senal-1.145
 Ig-1.102
atican-1.088
IOR-1.017
Tokenfont
Token ID10331
Feature activation+9.88e-01

Pos logit contributions
hire+3.515
image+3.204
ortun+3.106
icked+3.080
anced+2.794
Neg logit contributions
senal-3.631
 FAR-3.586
 Ig-3.264
atican-3.250
ikuman-3.107
Token2
Token ID17
Feature activation+5.99e-01

Pos logit contributions
hire+3.230
ortun+2.873
image+2.811
icked+2.801
anced+2.567
Neg logit contributions
 FAR-3.960
senal-3.880
atican-3.648
 Ig-3.634
foundland-3.453
Token,
Token ID11
Feature activation+4.97e-01

Pos logit contributions
hire+1.716
image+1.499
ortun+1.456
icked+1.428
anced+1.294
Neg logit contributions
senal-2.641
 FAR-2.608
 Ig-2.428
atican-2.422
foundland-2.348
Token $
Token ID720
Feature activation+8.90e-01

Pos logit contributions
hire+2.081
ortun+1.910
icked+1.866
image+1.849
anced+1.756
Neg logit contributions
 FAR-1.559
senal-1.442
 Ig-1.381
atican-1.379
IOR-1.286
Tokenbrush
Token ID32680
Feature activation+2.00e+00

Pos logit contributions
hire+2.094
image+1.773
ortun+1.717
icked+1.702
anced+1.447
Neg logit contributions
 FAR-4.405
senal-4.395
 Ig-4.112
atican-4.099
ikuman-3.949
Token2
Token ID17
Feature activation+9.11e-01

Pos logit contributions
hire+5.702
image+5.105
ortun+4.900
icked+4.714
anced+4.261
Neg logit contributions
senal-9.162
 FAR-8.726
 practition-8.120
ikuman-8.085
 Ig-8.079
Token,
Token ID11
Feature activation+5.78e-01

Pos logit contributions
hire+2.529
image+2.195
ortun+2.133
icked+2.081
anced+1.888
Neg logit contributions
senal-4.050
 FAR-3.972
atican-3.718
 Ig-3.705
foundland-3.620
Token $
Token ID720
Feature activation+9.23e-01

Pos logit contributions
hire+2.430
ortun+2.229
icked+2.178
image+2.162
anced+2.048
Neg logit contributions
 FAR-1.807
senal-1.662
 Ig-1.605
atican-1.593
IOR-1.487
Tokenrect
Token ID2554
Feature activation+1.29e+00

Pos logit contributions
hire+3.054
image+2.778
ortun+2.709
icked+2.639
anced+2.411
Neg logit contributions
 FAR-3.668
senal-3.655
 Ig-3.381
atican-3.351
ikuman-3.202
Token2
Token ID17
Feature activation+4.59e-01

Pos logit contributions
hire+4.155
ortun+3.689
image+3.629
icked+3.591
anced+3.288
Neg logit contributions
 FAR-5.217
senal-5.167
atican-4.807
 Ig-4.793
foundland-4.570
Token only
Token ID691
Feature activation+8.73e-01

Pos logit contributions
hire+0.721
ortun+0.658
image+0.649
icked+0.645
anced+0.603
Neg logit contributions
 FAR-0.577
senal-0.558
 Ig-0.518
atican-0.514
foundland-0.478
Token to
Token ID284
Feature activation+2.23e-01

Pos logit contributions
hire+3.554
ortun+3.237
image+3.192
icked+3.173
anced+2.966
Neg logit contributions
 FAR-2.804
senal-2.753
atican-2.526
 Ig-2.516
foundland-2.360
Token manage
Token ID6687
Feature activation+4.69e-01

Pos logit contributions
hire+0.863
ortun+0.789
image+0.781
icked+0.771
anced+0.722
Neg logit contributions
 FAR-0.756
senal-0.728
 Ig-0.680
atican-0.680
IOR-0.630
Token their
Token ID511
Feature activation-1.05e-01

Pos logit contributions
hire+1.715
ortun+1.547
image+1.519
icked+1.509
anced+1.400
Neg logit contributions
 FAR-1.689
senal-1.673
atican-1.550
 Ig-1.537
foundland-1.463
Token medi
Token ID16957
Feature activation+6.24e-01

Pos logit contributions
 FAR+0.359
senal+0.345
 Ig+0.324
atican+0.319
IOR+0.298
Neg logit contributions
hire-0.408
ortun-0.368
icked-0.364
image-0.363
anced-0.340
Tokenocr
Token ID1696
Feature activation+2.00e+00

Pos logit contributions
hire+1.821
ortun+1.597
image+1.558
icked+1.553
anced+1.409
Neg logit contributions
 FAR-2.740
senal-2.706
atican-2.523
 Ig-2.522
foundland-2.421
Tokenity
Token ID414
Feature activation+1.92e-01

Pos logit contributions
hire+2.794
ortun+2.146
image+2.046
icked+1.992
anced+1.479
Neg logit contributions
senal-11.825
 FAR-11.684
 Ig-10.989
atican-10.986
ikuman-10.708
Token,
Token ID11
Feature activation+1.78e-01

Pos logit contributions
hire+0.699
ortun+0.631
image+0.620
icked+0.615
anced+0.570
Neg logit contributions
 FAR-0.702
senal-0.681
 Ig-0.638
atican-0.638
foundland-0.600
Token perhaps
Token ID3737
Feature activation+1.91e-01

Pos logit contributions
hire+0.694
ortun+0.630
image+0.621
icked+0.617
anced+0.575
Neg logit contributions
 FAR-0.604
senal-0.593
atican-0.546
 Ig-0.545
foundland-0.512
Token with
Token ID351
Feature activation+1.84e-01

Pos logit contributions
hire+0.742
ortun+0.675
image+0.667
icked+0.663
anced+0.619
Neg logit contributions
 FAR-0.651
senal-0.625
 Ig-0.588
atican-0.582
foundland-0.545
Token small
Token ID1402
Feature activation-2.49e-01

Pos logit contributions
hire+0.699
ortun+0.631
image+0.622
icked+0.618
anced+0.576
Neg logit contributions
 FAR-0.646
senal-0.629
 Ig-0.585
atican-0.582
foundland-0.545
Token Inc
Token ID3457
Feature activation-3.01e-01

Pos logit contributions
hire+1.080
ortun+0.969
image+0.955
icked+0.948
anced+0.875
Neg logit contributions
 FAR-1.144
senal-1.130
atican-1.050
 Ig-1.042
foundland-0.992
Tokenogn
Token ID2360
Feature activation+6.92e-02

Pos logit contributions
 FAR+0.656
senal+0.618
atican+0.567
 Ig+0.560
IOR+0.493
Neg logit contributions
hire-1.535
ortun-1.420
image-1.405
icked-1.404
anced-1.332
Tokenito
Token ID10094
Feature activation+3.03e-01

Pos logit contributions
hire+0.248
ortun+0.223
image+0.220
icked+0.218
anced+0.202
Neg logit contributions
 FAR-0.255
senal-0.251
atican-0.233
 Ig-0.233
foundland-0.220
Token
Token ID198
Feature activation+3.16e-01

Pos logit contributions
hire+1.078
ortun+0.968
image+0.953
icked+0.946
anced+0.874
Neg logit contributions
 FAR-1.128
senal-1.115
atican-1.033
 Ig-1.028
foundland-0.977
Token
Token ID198
Feature activation+2.35e-01

Pos logit contributions
hire+1.132
ortun+1.027
icked+1.011
image+0.966
anced+0.931
Neg logit contributions
 FAR-1.177
 Ig-1.081
atican-1.059
IOR-1.004
 SAT-0.985
TokenContract
Token ID45845
Feature activation+1.99e+00

Pos logit contributions
hire+0.812
ortun+0.731
icked+0.714
image+0.708
anced+0.662
Neg logit contributions
 FAR-0.894
senal-0.862
 Ig-0.819
atican-0.818
IOR-0.781
Token:
Token ID25
Feature activation+1.68e-01

Pos logit contributions
hire+6.782
image+5.941
ortun+5.919
icked+5.784
anced+5.313
Neg logit contributions
senal-7.361
 FAR-7.282
atican-6.788
 Ig-6.582
ikuman-6.509
Token A
Token ID317
Feature activation+1.89e-01

Pos logit contributions
hire+0.675
ortun+0.617
image+0.602
icked+0.598
anced+0.565
Neg logit contributions
 FAR-0.556
senal-0.538
 Ig-0.502
atican-0.493
IOR-0.464
Token three
Token ID1115
Feature activation+2.63e-01

Pos logit contributions
hire+0.688
ortun+0.620
image+0.610
icked+0.606
anced+0.561
Neg logit contributions
 FAR-0.690
senal-0.679
atican-0.630
 Ig-0.627
foundland-0.594
Token-
Token ID12
Feature activation+2.19e-01

Pos logit contributions
hire+0.703
ortun+0.612
image+0.598
icked+0.586
anced+0.524
Neg logit contributions
senal-1.226
 FAR-1.203
atican-1.139
 Ig-1.115
foundland-1.085
Tokenyear
Token ID1941
Feature activation+2.65e-01

Pos logit contributions
hire+0.647
ortun+0.568
image+0.557
icked+0.551
anced+0.498
Neg logit contributions
 FAR-0.948
senal-0.938
atican-0.880
 Ig-0.877
foundland-0.838
Token stack
Token ID8931
Feature activation+8.82e-02

Pos logit contributions
 FAR+2.901
senal+2.842
atican+2.619
 Ig+2.618
foundland+2.456
Neg logit contributions
hire-3.346
ortun-3.031
image-2.981
icked-2.966
anced-2.762
Token frame
Token ID5739
Feature activation-6.21e-02

Pos logit contributions
hire+0.233
image+0.203
ortun+0.199
icked+0.194
anced+0.177
Neg logit contributions
 FAR-0.405
senal-0.404
atican-0.383
 Ig-0.380
foundland-0.367
Token |
Token ID930
Feature activation+2.74e-01

Pos logit contributions
senal+0.302
 FAR+0.301
atican+0.285
 Ig+0.281
foundland+0.274
Neg logit contributions
hire-0.149
ortun-0.127
image-0.127
icked-0.122
anced-0.108
Token void
Token ID7951
Feature activation+1.86e+00

Pos logit contributions
hire+0.841
ortun+0.742
image+0.731
icked+0.721
anced+0.656
Neg logit contributions
 FAR-1.147
senal-1.140
atican-1.064
 Ig-1.057
foundland-1.015
Token *
Token ID1635
Feature activation+7.84e-01

Pos logit contributions
hire+4.533
image+4.209
ortun+4.064
icked+3.823
anced+3.580
Neg logit contributions
senal-8.127
 FAR-7.983
atican-7.736
foundland-7.466
 Ig-7.360
Tokenarg
Token ID853
Feature activation+1.92e+00

Pos logit contributions
hire+1.356
image+1.130
ortun+1.069
icked+1.006
anced+0.832
Neg logit contributions
senal-4.391
 FAR-4.323
atican-4.083
 Ig-4.076
lishes-3.959
Token |
Token ID930
Feature activation+7.94e-01

Pos logit contributions
hire+3.265
image+2.831
icked+2.641
ortun+2.614
anced+2.137
Neg logit contributions
senal-10.067
 FAR-9.828
atican-9.319
 Ig-9.135
foundland-9.110
Token +
Token ID1343
Feature activation+3.38e-01

Pos logit contributions
hire+2.460
image+2.192
ortun+2.184
icked+2.111
anced+1.934
Neg logit contributions
senal-3.276
 FAR-3.220
atican-3.032
 Ig-2.962
foundland-2.902
Token--------------------
Token ID19351
Feature activation-5.18e-02

Pos logit contributions
hire+1.241
ortun+1.119
image+1.101
icked+1.092
anced+1.012
Neg logit contributions
 FAR-1.219
senal-1.202
atican-1.115
 Ig-1.108
foundland-1.049
Token--+
Token ID44785
Feature activation+8.53e-01

Pos logit contributions
 FAR+0.163
senal+0.149
atican+0.147
 Ig+0.147
IOR+0.140
Neg logit contributions
hire-0.210
ortun-0.191
icked-0.189
image-0.187
anced-0.175
Token |
Token ID930
Feature activation+1.81e-02

Pos logit contributions
hire+2.830
ortun+2.544
image+2.542
icked+2.455
anced+2.274
Neg logit contributions
senal-3.322
 FAR-3.246
atican-3.082
 Ig-2.966
foundland-2.915
Token'
Token ID6
Feature activation-1.03e+00

Pos logit contributions
 FAR+4.457
senal+4.339
 Ig+4.058
atican+4.047
IOR+3.860
Neg logit contributions
hire-3.946
ortun-3.527
icked-3.438
image-3.420
anced-3.158
Tokens
Token ID82
Feature activation-8.58e-01

Pos logit contributions
 FAR+3.578
 Ig+3.265
atican+3.197
IOR+3.130
ftime+3.014
Neg logit contributions
hire-3.635
ortun-3.320
icked-3.229
anced-3.049
image-3.017
Token stack
Token ID8931
Feature activation+8.82e-02

Pos logit contributions
 FAR+2.901
senal+2.842
atican+2.619
 Ig+2.618
foundland+2.456
Neg logit contributions
hire-3.346
ortun-3.031
image-2.981
icked-2.966
anced-2.762
Token frame
Token ID5739
Feature activation-6.21e-02

Pos logit contributions
hire+0.233
image+0.203
ortun+0.199
icked+0.194
anced+0.177
Neg logit contributions
 FAR-0.405
senal-0.404
atican-0.383
 Ig-0.380
foundland-0.367
Token |
Token ID930
Feature activation+2.74e-01

Pos logit contributions
senal+0.302
 FAR+0.301
atican+0.285
 Ig+0.281
foundland+0.274
Neg logit contributions
hire-0.149
ortun-0.127
image-0.127
icked-0.122
anced-0.108
Token void
Token ID7951
Feature activation+1.86e+00

Pos logit contributions
hire+0.841
ortun+0.742
image+0.731
icked+0.721
anced+0.656
Neg logit contributions
 FAR-1.147
senal-1.140
atican-1.064
 Ig-1.057
foundland-1.015
Token *
Token ID1635
Feature activation+7.84e-01

Pos logit contributions
hire+4.533
image+4.209
ortun+4.064
icked+3.823
anced+3.580
Neg logit contributions
senal-8.127
 FAR-7.983
atican-7.736
foundland-7.466
 Ig-7.360
Tokenarg
Token ID853
Feature activation+1.92e+00

Pos logit contributions
hire+1.356
image+1.130
ortun+1.069
icked+1.006
anced+0.832
Neg logit contributions
senal-4.391
 FAR-4.323
atican-4.083
 Ig-4.076
lishes-3.959
Token |
Token ID930
Feature activation+7.94e-01

Pos logit contributions
hire+3.265
image+2.831
icked+2.641
ortun+2.614
anced+2.137
Neg logit contributions
senal-10.067
 FAR-9.828
atican-9.319
 Ig-9.135
foundland-9.110
Token +
Token ID1343
Feature activation+3.38e-01

Pos logit contributions
hire+2.460
image+2.192
ortun+2.184
icked+2.111
anced+1.934
Neg logit contributions
senal-3.276
 FAR-3.220
atican-3.032
 Ig-2.962
foundland-2.902
Token--------------------
Token ID19351
Feature activation-5.18e-02

Pos logit contributions
hire+1.241
ortun+1.119
image+1.101
icked+1.092
anced+1.012
Neg logit contributions
 FAR-1.219
senal-1.202
atican-1.115
 Ig-1.108
foundland-1.049
Token manually
Token ID14500
Feature activation-1.16e-01

Pos logit contributions
hire+0.126
ortun+0.115
image+0.113
icked+0.112
anced+0.105
Neg logit contributions
 FAR-0.102
senal-0.095
 Ig-0.091
atican-0.089
IOR-0.085
Token with
Token ID351
Feature activation+3.37e-01

Pos logit contributions
 FAR+0.413
senal+0.390
 Ig+0.374
atican+0.369
IOR+0.353
Neg logit contributions
hire-0.431
ortun-0.389
image-0.381
icked-0.379
anced-0.352
Token the
Token ID262
Feature activation-9.99e-03

Pos logit contributions
hire+1.259
ortun+1.132
image+1.115
icked+1.109
anced+1.031
Neg logit contributions
 FAR-1.200
senal-1.176
 Ig-1.090
atican-1.087
foundland-1.024
Token sl
Token ID1017
Feature activation+4.98e-01

Pos logit contributions
 FAR+0.035
senal+0.034
 Ig+0.032
atican+0.031
IOR+0.030
Neg logit contributions
hire-0.038
ortun-0.034
image-0.033
icked-0.033
anced-0.031
Tokens
Token ID82
Feature activation+1.31e-01

Pos logit contributions
hire+1.659
ortun+1.477
image+1.450
icked+1.438
anced+1.322
Neg logit contributions
 FAR-1.964
senal-1.933
atican-1.807
 Ig-1.801
foundland-1.709
Token invoke
Token ID26342
Feature activation+1.85e+00

Pos logit contributions
hire+0.446
ortun+0.397
icked+0.390
image+0.389
anced+0.358
Neg logit contributions
 FAR-0.508
senal-0.499
atican-0.467
 Ig-0.466
IOR-0.440
Token command
Token ID3141
Feature activation+9.30e-02

Pos logit contributions
hire+5.739
ortun+5.289
image+5.187
icked+5.049
anced+4.665
Neg logit contributions
senal-7.272
 FAR-7.176
atican-6.741
 Ig-6.578
ikuman-6.426
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
hire+0.307
ortun+0.272
image+0.266
icked+0.265
anced+0.244
Neg logit contributions
 FAR-0.370
senal-0.360
 Ig-0.340
atican-0.338
IOR-0.323
Token If
Token ID1002
Feature activation-2.86e-01

Pos logit contributions
hire+0.453
ortun+0.407
image+0.397
icked+0.397
anced+0.370
Neg logit contributions
 FAR-0.457
senal-0.441
 Ig-0.417
atican-0.412
foundland-0.389
Token the
Token ID262
Feature activation+1.43e-01

Pos logit contributions
 FAR+0.957
senal+0.917
 Ig+0.863
atican+0.846
foundland+0.797
Neg logit contributions
hire-1.128
ortun-1.020
image-1.003
icked-1.002
anced-0.938
Token function
Token ID2163
Feature activation+4.05e-01

Pos logit contributions
hire+0.494
ortun+0.441
image+0.433
icked+0.430
anced+0.399
Neg logit contributions
 FAR-0.548
senal-0.532
 Ig-0.499
atican-0.496
foundland-0.470
Tokenpath
Token ID6978
Feature activation-5.74e-01

Pos logit contributions
hire+0.910
ortun+0.829
icked+0.820
image+0.789
anced+0.766
Neg logit contributions
 FAR-0.705
senal-0.642
atican-0.631
 Ig-0.626
IOR-0.597
Token to
Token ID284
Feature activation-3.90e-01

Pos logit contributions
 FAR+1.747
 Ig+1.546
atican+1.468
 SAT+1.412
IOR+1.405
Neg logit contributions
hire-2.384
ortun-2.181
icked-2.175
image-2.095
odder-2.011
Token images
Token ID4263
Feature activation+1.67e-01

Pos logit contributions
 FAR+1.464
 Ig+1.332
senal+1.275
atican+1.262
 SAT+1.230
Neg logit contributions
hire-1.439
ortun-1.245
icked-1.233
image-1.142
anced-1.118
Token directory
Token ID8619
Feature activation+3.09e-01

Pos logit contributions
hire+0.670
ortun+0.610
icked+0.602
image+0.584
anced+0.558
Neg logit contributions
 FAR-0.560
senal-0.507
atican-0.493
 Ig-0.492
lishes-0.459
Token")
Token ID4943
Feature activation+3.22e-01

Pos logit contributions
hire+1.144
ortun+1.032
icked+1.016
image+1.000
anced+0.940
Neg logit contributions
 FAR-1.113
atican-1.002
 Ig-0.997
senal-0.991
IOR-0.940
Token args
Token ID26498
Feature activation+1.85e+00

Pos logit contributions
hire+0.888
ortun+0.773
image+0.758
icked+0.748
anced+0.674
Neg logit contributions
 FAR-1.447
senal-1.437
atican-1.349
 Ig-1.341
foundland-1.290
Token =
Token ID796
Feature activation+2.00e-01

Pos logit contributions
hire+4.805
image+4.187
ortun+4.115
icked+3.931
anced+3.560
Neg logit contributions
senal-8.918
 FAR-8.469
ikuman-7.861
atican-7.855
 practition-7.845
Token v
Token ID410
Feature activation+8.71e-01

Pos logit contributions
hire+0.948
ortun+0.872
icked+0.860
image+0.846
anced+0.801
Neg logit contributions
 FAR-0.527
 Ig-0.453
atican-0.448
senal-0.447
IOR-0.409
Tokenars
Token ID945
Feature activation-2.79e-02

Pos logit contributions
hire+3.050
ortun+2.730
icked+2.680
image+2.670
anced+2.450
Neg logit contributions
 FAR-3.295
senal-3.136
atican-3.024
 Ig-2.991
IOR-2.835
Token(
Token ID7
Feature activation-5.31e-02

Pos logit contributions
 FAR+0.089
 Ig+0.079
atican+0.079
IOR+0.074
 SAT+0.070
Neg logit contributions
hire-0.114
ortun-0.105
icked-0.104
image-0.100
anced-0.097
Tokenap
Token ID499
Feature activation+1.18e+00

Pos logit contributions
 FAR+0.190
atican+0.178
 Ig+0.173
senal+0.165
IOR+0.164
Neg logit contributions
hire-0.187
ortun-0.167
icked-0.166
image-0.158
anced-0.150
Token,
Token ID11
Feature activation+8.71e-02

Pos logit contributions
hire+0.196
ortun+0.173
image+0.170
icked+0.169
anced+0.155
Neg logit contributions
 FAR-0.240
senal-0.238
atican-0.222
 Ig-0.220
foundland-0.211
Token the
Token ID262
Feature activation+8.65e-02

Pos logit contributions
hire+0.334
ortun+0.303
image+0.299
icked+0.297
anced+0.276
Neg logit contributions
 FAR-0.300
senal-0.292
 Ig-0.272
atican-0.270
foundland-0.253
Token relationship
Token ID2776
Feature activation+3.05e-01

Pos logit contributions
hire+0.334
ortun+0.300
image+0.296
icked+0.295
anced+0.275
Neg logit contributions
 FAR-0.300
senal-0.288
 Ig-0.273
atican-0.269
foundland-0.251
Token is
Token ID318
Feature activation+6.24e-02

Pos logit contributions
hire+1.167
ortun+1.056
image+1.040
icked+1.031
anced+0.957
Neg logit contributions
 FAR-1.061
senal-1.029
 Ig-0.960
atican-0.959
foundland-0.897
Token still
Token ID991
Feature activation+9.46e-01

Pos logit contributions
hire+0.256
ortun+0.235
image+0.230
icked+0.230
anced+0.213
Neg logit contributions
 FAR-0.202
senal-0.190
 Ig-0.180
atican-0.178
foundland-0.164
Token going
Token ID1016
Feature activation+1.84e+00

Pos logit contributions
hire+3.603
ortun+3.256
image+3.213
icked+3.189
anced+2.966
Neg logit contributions
 FAR-3.271
senal-3.236
atican-2.981
 Ig-2.963
foundland-2.805
Token strong
Token ID1913
Feature activation+1.39e+00

Pos logit contributions
hire+5.243
ortun+4.506
image+4.378
icked+4.341
anced+3.899
Neg logit contributions
senal-7.818
 FAR-7.652
atican-7.329
 Ig-7.143
foundland-7.042
Token at
Token ID379
Feature activation+9.33e-02

Pos logit contributions
hire+4.395
ortun+3.802
image+3.786
icked+3.729
anced+3.416
Neg logit contributions
senal-5.666
 FAR-5.532
atican-5.163
 Ig-5.080
ikuman-5.029
Token Aut
Token ID5231
Feature activation-2.30e-01

Pos logit contributions
hire+0.336
ortun+0.303
image+0.298
icked+0.296
anced+0.274
Neg logit contributions
 FAR-0.343
senal-0.337
atican-0.313
 Ig-0.312
foundland-0.295
Token-
Token ID12
Feature activation-2.25e-01

Pos logit contributions
 FAR+0.888
 Ig+0.836
atican+0.816
senal+0.795
IOR+0.775
Neg logit contributions
hire-0.758
ortun-0.669
icked-0.658
image-0.652
anced-0.616
TokenO
Token ID46
Feature activation-1.02e-01

Pos logit contributions
 FAR+0.648
 Ig+0.603
atican+0.594
IOR+0.557
senal+0.556
Neg logit contributions
hire-0.946
ortun-0.866
icked-0.857
image-0.834
anced-0.808
Token Department
Token ID2732
Feature activation-2.84e-01

Pos logit contributions
 FAR+0.163
senal+0.156
 Ig+0.148
atican+0.146
foundland+0.137
Neg logit contributions
hire-0.168
ortun-0.152
image-0.149
icked-0.148
anced-0.139
Token of
Token ID286
Feature activation+4.38e-03

Pos logit contributions
 FAR+0.940
senal+0.923
atican+0.852
 Ig+0.842
foundland+0.803
Neg logit contributions
hire-1.138
ortun-1.027
image-1.017
icked-1.012
anced-0.938
Token Health
Token ID3893
Feature activation+2.47e-02

Pos logit contributions
hire+0.016
ortun+0.014
icked+0.014
image+0.014
anced+0.013
Neg logit contributions
 FAR-0.016
senal-0.016
 Ig-0.015
atican-0.014
foundland-0.014
Token advised
Token ID13030
Feature activation-3.28e-01

Pos logit contributions
hire+0.092
ortun+0.084
image+0.082
icked+0.082
anced+0.076
Neg logit contributions
 FAR-0.087
senal-0.085
 Ig-0.079
atican-0.079
foundland-0.074
Token the
Token ID262
Feature activation+6.92e-02

Pos logit contributions
 FAR+1.101
senal+1.053
 Ig+1.003
atican+0.971
foundland+0.922
Neg logit contributions
hire-1.273
ortun-1.176
image-1.149
icked-1.140
anced-1.067
Token NHS
Token ID18183
Feature activation+1.84e+00

Pos logit contributions
hire+0.244
ortun+0.220
image+0.215
icked+0.214
anced+0.199
Neg logit contributions
 FAR-0.261
senal-0.252
 Ig-0.238
atican-0.236
foundland-0.223
Token to
Token ID284
Feature activation-1.36e-01

Pos logit contributions
hire+6.653
image+5.808
icked+5.791
ortun+5.738
anced+5.271
Neg logit contributions
senal-6.394
 FAR-6.369
atican-5.964
ikuman-5.760
 Ig-5.703
Token move
Token ID1445
Feature activation-4.63e-01

Pos logit contributions
 FAR+0.446
senal+0.407
 Ig+0.402
atican+0.392
IOR+0.365
Neg logit contributions
hire-0.534
ortun-0.503
image-0.487
icked-0.485
anced-0.454
Token away
Token ID1497
Feature activation-1.23e+00

Pos logit contributions
 FAR+1.557
senal+1.484
 Ig+1.402
atican+1.365
foundland+1.296
Neg logit contributions
hire-1.810
ortun-1.664
icked-1.630
image-1.617
anced-1.508
Token from
Token ID422
Feature activation+3.99e-01

Pos logit contributions
 FAR+2.960
 Ig+2.586
atican+2.414
 SAT+2.327
IOR+2.162
Neg logit contributions
hire-5.776
ortun-5.440
icked-5.305
image-5.185
anced-5.041
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
hire+1.527
ortun+1.379
image+1.365
icked+1.354
anced+1.256
Neg logit contributions
 FAR-1.371
senal-1.361
atican-1.255
 Ig-1.236
foundland-1.175
Token.
Token ID13
Feature activation+8.92e-02

Pos logit contributions
 FAR+1.152
senal+1.123
 Ig+1.049
atican+1.046
foundland+0.988
Neg logit contributions
hire-1.166
ortun-1.044
image-1.029
icked-1.026
anced-0.946
TokenStudents
Token ID28239
Feature activation-8.05e-02

Pos logit contributions
hire+0.309
ortun+0.277
icked+0.270
image+0.270
anced+0.249
Neg logit contributions
 FAR-0.341
senal-0.333
 Ig-0.312
atican-0.312
foundland-0.295
Token,
Token ID11
Feature activation+1.38e-01

Pos logit contributions
 FAR+0.296
senal+0.287
atican+0.271
 Ig+0.270
foundland+0.254
Neg logit contributions
hire-0.286
ortun-0.260
image-0.255
icked-0.253
anced-0.234
Tokenworkers
Token ID22896
Feature activation+8.62e-01

Pos logit contributions
hire+0.514
ortun+0.464
image+0.457
icked+0.454
anced+0.421
Neg logit contributions
 FAR-0.491
senal-0.483
atican-0.447
 Ig-0.446
foundland-0.421
Token,
Token ID11
Feature activation+1.48e-01

Pos logit contributions
hire+2.855
ortun+2.508
image+2.454
icked+2.445
anced+2.236
Neg logit contributions
senal-3.432
 FAR-3.403
atican-3.145
 Ig-3.110
ikuman-3.002
Tokenemploy
Token ID7033
Feature activation+1.83e+00

Pos logit contributions
hire+0.524
ortun+0.470
image+0.463
icked+0.459
anced+0.424
Neg logit contributions
 FAR-0.553
senal-0.545
atican-0.506
 Ig-0.504
foundland-0.478
Tokenees
Token ID2841
Feature activation+2.95e-01

Pos logit contributions
hire+4.828
ortun+4.337
icked+4.000
image+3.982
anced+3.696
Neg logit contributions
senal-8.308
 FAR-8.145
atican-7.582
 Ig-7.538
ikuman-7.395
Token seemed
Token ID3947
Feature activation+2.22e-01

Pos logit contributions
hire+1.004
ortun+0.896
image+0.880
icked+0.873
anced+0.804
Neg logit contributions
 FAR-1.147
senal-1.134
atican-1.053
 Ig-1.049
foundland-0.998
Token to
Token ID284
Feature activation+6.33e-01

Pos logit contributions
hire+0.825
ortun+0.745
image+0.734
icked+0.728
anced+0.676
Neg logit contributions
 FAR-0.792
senal-0.783
atican-0.725
 Ig-0.720
foundland-0.681
Token have
Token ID423
Feature activation+5.58e-01

Pos logit contributions
hire+2.377
ortun+2.142
image+2.117
icked+2.097
anced+1.947
Neg logit contributions
 FAR-2.227
senal-2.217
atican-2.031
 Ig-2.017
foundland-1.915
Token risen
Token ID17450
Feature activation+3.49e-01

Pos logit contributions
hire+2.144
ortun+1.948
image+1.918
icked+1.903
anced+1.775
Neg logit contributions
 FAR-1.904
senal-1.888
atican-1.736
 Ig-1.720
foundland-1.625
Token PNG
Token ID36182
Feature activation-1.14e-03

Pos logit contributions
hire+1.843
ortun+1.697
image+1.676
icked+1.670
anced+1.586
Neg logit contributions
 FAR-0.923
senal-0.887
 Ig-0.801
atican-0.798
foundland-0.720
Token larger
Token ID4025
Feature activation+8.70e-01

Pos logit contributions
 FAR+0.004
senal+0.004
 Ig+0.004
atican+0.004
IOR+0.004
Neg logit contributions
hire-0.004
ortun-0.004
icked-0.004
image-0.004
anced-0.003
Token image
Token ID2939
Feature activation+2.22e-01

Pos logit contributions
hire+2.693
ortun+2.339
icked+2.274
image+2.174
anced+2.053
Neg logit contributions
 FAR-3.744
senal-3.522
 Ig-3.462
atican-3.405
foundland-3.243
Token
Token ID198
Feature activation+4.35e-01

Pos logit contributions
hire+0.803
ortun+0.718
icked+0.714
image+0.672
anced+0.646
Neg logit contributions
 FAR-0.845
 Ig-0.776
atican-0.740
senal-0.738
IOR-0.714
Token
Token ID198
Feature activation+6.21e-01

Pos logit contributions
hire+1.520
ortun+1.365
icked+1.348
image+1.300
anced+1.239
Neg logit contributions
 FAR-1.663
 Ig-1.525
atican-1.502
senal-1.431
IOR-1.424
Tokenlarg
Token ID15521
Feature activation+1.81e+00

Pos logit contributions
hire+2.310
ortun+2.072
icked+2.040
image+2.010
anced+1.886
Neg logit contributions
 FAR-2.209
senal-2.160
atican-2.012
 Ig-2.004
IOR-1.906
Tokener
Token ID263
Feature activation+1.38e+00

Pos logit contributions
hire+5.491
ortun+4.868
image+4.809
icked+4.704
anced+4.287
Neg logit contributions
senal-7.796
 FAR-7.630
atican-7.108
 Ig-7.023
foundland-6.797
Token image
Token ID2939
Feature activation+8.53e-01

Pos logit contributions
hire+3.893
ortun+3.394
image+3.318
icked+3.291
anced+2.965
Neg logit contributions
 FAR-6.126
senal-6.043
atican-5.688
 Ig-5.673
foundland-5.428
Token T
Token ID309
Feature activation+2.84e-01

Pos logit contributions
hire+3.173
ortun+2.869
image+2.834
icked+2.805
anced+2.604
Neg logit contributions
 FAR-3.017
senal-2.988
atican-2.761
 Ig-2.734
foundland-2.597
TokenIFF
Token ID29267
Feature activation-1.51e-01

Pos logit contributions
hire+0.697
ortun+0.579
icked+0.553
image+0.550
anced+0.490
Neg logit contributions
 FAR-1.393
senal-1.346
 Ig-1.292
atican-1.292
IOR-1.238
Token original
Token ID2656
Feature activation-5.10e-02

Pos logit contributions
 FAR+0.428
senal+0.383
 Ig+0.379
atican+0.372
IOR+0.354
Neg logit contributions
hire-0.685
ortun-0.612
icked-0.605
image-0.590
anced-0.569
Token Bon
Token ID7979
Feature activation-4.38e-01

Pos logit contributions
hire+0.551
ortun+0.497
image+0.488
icked+0.486
anced+0.452
Neg logit contributions
 FAR-0.523
senal-0.507
 Ig-0.477
atican-0.476
foundland-0.446
Tokennet
Token ID3262
Feature activation-3.53e-01

Pos logit contributions
 FAR+1.597
senal+1.555
atican+1.464
 Ig+1.460
foundland+1.378
Neg logit contributions
hire-1.574
ortun-1.419
image-1.387
icked-1.382
anced-1.273
Token (
Token ID357
Feature activation-6.22e-02

Pos logit contributions
 FAR+1.195
senal+1.193
atican+1.088
 Ig+1.068
foundland+1.027
Neg logit contributions
hire-1.378
ortun-1.247
image-1.230
icked-1.221
anced-1.133
TokenEqu
Token ID23588
Feature activation+8.88e-01

Pos logit contributions
 FAR+0.222
senal+0.213
atican+0.203
 Ig+0.202
IOR+0.193
Neg logit contributions
hire-0.227
ortun-0.207
icked-0.203
image-0.200
anced-0.189
Tokenipe
Token ID3757
Feature activation+5.27e-01

Pos logit contributions
hire+2.677
ortun+2.354
image+2.318
icked+2.287
anced+2.087
Neg logit contributions
 FAR-3.766
senal-3.663
atican-3.512
 Ig-3.487
foundland-3.316
Token Cycl
Token ID28007
Feature activation+1.81e+00

Pos logit contributions
hire+1.813
ortun+1.628
image+1.595
icked+1.591
anced+1.478
Neg logit contributions
 FAR-2.041
senal-1.925
 Ig-1.888
atican-1.856
foundland-1.743
Tokeniste
Token ID40833
Feature activation+7.27e-01

Pos logit contributions
hire+4.345
ortun+3.587
image+3.561
icked+3.561
anced+3.042
Neg logit contributions
senal-9.085
 FAR-8.817
atican-8.241
 Ig-8.213
ikuman-8.044
Token FD
Token ID30002
Feature activation+4.28e-02

Pos logit contributions
hire+2.560
ortun+2.247
image+2.214
icked+2.213
ester+1.999
Neg logit contributions
senal-2.718
 FAR-2.658
atican-2.467
ikuman-2.358
 Ig-2.355
TokenJ
Token ID41
Feature activation+4.77e-01

Pos logit contributions
hire+0.177
ortun+0.163
icked+0.161
image+0.159
anced+0.152
Neg logit contributions
 FAR-0.132
 Ig-0.122
atican-0.117
IOR-0.117
senal-0.108
Token -
Token ID532
Feature activation+2.18e-01

Pos logit contributions
hire+1.561
ortun+1.419
icked+1.388
image+1.351
anced+1.267
Neg logit contributions
 FAR-1.924
 Ig-1.773
atican-1.754
senal-1.747
IOR-1.659
Token Big
Token ID4403
Feature activation+5.07e-01

Pos logit contributions
hire+0.832
ortun+0.748
icked+0.738
image+0.736
anced+0.682
Neg logit contributions
senal-0.749
 FAR-0.741
atican-0.679
 Ig-0.655
foundland-0.643
Tokens
Token ID82
Feature activation-7.60e-01

Pos logit contributions
 FAR+3.628
 Ig+3.296
atican+3.229
IOR+3.151
 SAT+3.042
Neg logit contributions
hire-3.760
ortun-3.454
icked-3.377
anced-3.187
image-3.136
Token frame
Token ID5739
Feature activation-2.04e-01

Pos logit contributions
 FAR+2.504
senal+2.420
 Ig+2.253
atican+2.236
IOR+2.103
Neg logit contributions
hire-3.034
ortun-2.749
image-2.697
icked-2.693
anced-2.510
Token |
Token ID930
Feature activation+3.69e-01

Pos logit contributions
senal+0.998
 FAR+0.992
atican+0.938
 Ig+0.923
foundland+0.905
Neg logit contributions
hire-0.489
image-0.420
ortun-0.416
icked-0.401
anced-0.359
Token void
Token ID7951
Feature activation+1.61e+00

Pos logit contributions
hire+1.163
ortun+1.030
image+1.018
icked+1.000
anced+0.915
Neg logit contributions
 FAR-1.514
senal-1.507
atican-1.402
 Ig-1.392
foundland-1.338
Token *
Token ID1635
Feature activation+7.71e-01

Pos logit contributions
hire+4.663
image+4.220
ortun+4.169
icked+3.999
anced+3.667
Neg logit contributions
senal-6.973
 FAR-6.647
atican-6.319
foundland-6.123
ikuman-6.097
Tokenarg
Token ID853
Feature activation+1.79e+00

Pos logit contributions
hire+1.542
image+1.324
ortun+1.260
icked+1.200
anced+1.020
Neg logit contributions
senal-4.092
 FAR-4.058
 Ig-3.800
atican-3.794
lishes-3.693
Token |
Token ID930
Feature activation+9.70e-01

Pos logit contributions
hire+4.032
image+3.431
ortun+3.366
icked+3.263
anced+2.811
Neg logit contributions
senal-9.243
 FAR-8.715
atican-8.241
ikuman-8.174
 practition-8.167
Token +
Token ID1343
Feature activation+9.40e-01

Pos logit contributions
hire+2.831
image+2.574
ortun+2.501
icked+2.420
anced+2.223
Neg logit contributions
senal-4.156
 FAR-4.010
atican-3.823
foundland-3.694
 Ig-3.690
Token--------------------
Token ID19351
Feature activation+2.41e-01

Pos logit contributions
hire+2.809
ortun+2.469
image+2.415
icked+2.405
anced+2.180
Neg logit contributions
 FAR-4.038
senal-3.943
atican-3.729
 Ig-3.725
foundland-3.555
Token--+
Token ID44785
Feature activation+9.28e-01

Pos logit contributions
hire+0.968
ortun+0.882
icked+0.871
image+0.860
anced+0.804
Neg logit contributions
 FAR-0.783
senal-0.717
atican-0.706
 Ig-0.701
IOR-0.670
Token |
Token ID930
Feature activation-9.07e-02

Pos logit contributions
hire+2.451
ortun+2.114
image+2.069
icked+2.045
anced+1.825
Neg logit contributions
 FAR-4.297
senal-4.258
atican-4.007
 Ig-3.992
foundland-3.833
Token fractured
Token ID34051
Feature activation-2.11e-01

Pos logit contributions
hire+0.810
ortun+0.728
image+0.716
icked+0.712
anced+0.661
Neg logit contributions
 FAR-0.755
senal-0.725
 Ig-0.685
atican-0.682
IOR-0.637
Token collar
Token ID19908
Feature activation+9.03e-01

Pos logit contributions
 FAR+0.772
senal+0.721
 Ig+0.706
atican+0.693
foundland+0.651
Neg logit contributions
hire-0.777
ortun-0.696
icked-0.682
image-0.680
anced-0.625
Token bone
Token ID9970
Feature activation-2.05e-01

Pos logit contributions
hire+2.639
ortun+2.296
image+2.284
icked+2.241
anced+2.029
Neg logit contributions
 FAR-3.915
senal-3.911
atican-3.630
 Ig-3.612
foundland-3.472
Token in
Token ID287
Feature activation+6.27e-02

Pos logit contributions
 FAR+0.680
senal+0.643
 Ig+0.616
atican+0.607
foundland+0.567
Neg logit contributions
hire-0.818
ortun-0.742
icked-0.726
image-0.725
anced-0.677
Token Foot
Token ID7870
Feature activation-3.26e-01

Pos logit contributions
hire+0.236
ortun+0.214
image+0.210
icked+0.209
anced+0.194
Neg logit contributions
 FAR-0.220
senal-0.216
atican-0.200
 Ig-0.199
foundland-0.188
Tokensc
Token ID1416
Feature activation+1.79e+00

Pos logit contributions
 FAR+1.131
senal+1.111
 Ig+1.033
atican+1.025
foundland+0.966
Neg logit contributions
hire-1.232
ortun-1.116
icked-1.091
image-1.089
anced-1.020
Tokenray
Token ID2433
Feature activation+2.12e-01

Pos logit contributions
hire+3.506
ortun+3.087
image+2.930
icked+2.831
anced+2.321
Neg logit contributions
senal-9.460
 FAR-9.330
atican-8.639
 Ig-8.632
foundland-8.511
Token
Token ID447
Feature activation+3.74e-01

Pos logit contributions
hire+0.735
ortun+0.656
image+0.648
icked+0.641
anced+0.591
Neg logit contributions
 FAR-0.808
senal-0.799
atican-0.744
 Ig-0.737
foundland-0.703
Token
Token ID247
Feature activation+4.50e-01

Pos logit contributions
hire+1.502
ortun+1.357
image+1.344
icked+1.332
anced+1.236
Neg logit contributions
 FAR-1.233
senal-1.206
atican-1.110
 Ig-1.104
foundland-1.039
Tokens
Token ID82
Feature activation+7.90e-02

Pos logit contributions
hire+1.569
ortun+1.405
image+1.385
icked+1.371
anced+1.264
Neg logit contributions
 FAR-1.706
senal-1.684
atican-1.564
 Ig-1.556
foundland-1.478
Token win
Token ID1592
Feature activation-6.58e-01

Pos logit contributions
hire+0.306
ortun+0.277
image+0.272
icked+0.271
anced+0.253
Neg logit contributions
 FAR-0.270
senal-0.263
 Ig-0.245
atican-0.243
foundland-0.228
Token 16
Token ID1467
Feature activation+9.66e-01

Pos logit contributions
hire+0.464
ortun+0.425
icked+0.417
image+0.406
anced+0.392
Neg logit contributions
 FAR-0.346
 Ig-0.314
atican-0.304
senal-0.288
 SAT-0.281
Token,
Token ID11
Feature activation+6.61e-01

Pos logit contributions
hire+2.938
image+2.565
ortun+2.561
icked+2.501
anced+2.297
Neg logit contributions
senal-4.077
 FAR-4.012
atican-3.688
 Ig-3.669
ikuman-3.594
Token 1981
Token ID14745
Feature activation+3.08e-01

Pos logit contributions
hire+3.160
ortun+2.929
icked+2.872
image+2.866
anced+2.715
Neg logit contributions
 FAR-1.694
senal-1.573
 Ig-1.478
atican-1.469
IOR-1.326
Token
Token ID198
Feature activation+6.93e-01

Pos logit contributions
hire+0.926
ortun+0.813
image+0.798
icked+0.793
anced+0.713
Neg logit contributions
senal-1.318
 FAR-1.301
atican-1.203
 Ig-1.188
lishes-1.158
Token
Token ID198
Feature activation+6.18e-01

Pos logit contributions
hire+2.395
ortun+2.159
icked+2.112
image+2.056
anced+1.947
Neg logit contributions
 FAR-2.689
 Ig-2.470
atican-2.442
senal-2.365
IOR-2.309
TokenBirth
Token ID38480
Feature activation+1.77e+00

Pos logit contributions
hire+1.980
ortun+1.769
icked+1.723
image+1.710
anced+1.583
Neg logit contributions
 FAR-2.508
senal-2.441
atican-2.323
 Ig-2.310
IOR-2.213
Tokenplace
Token ID5372
Feature activation+5.41e-01

Pos logit contributions
hire+3.877
ortun+3.235
image+3.215
icked+3.049
anced+2.579
Neg logit contributions
senal-8.954
 FAR-8.822
atican-8.272
 Ig-8.199
ikuman-8.053
Token:
Token ID25
Feature activation-3.16e-01

Pos logit contributions
hire+1.706
ortun+1.528
icked+1.485
image+1.469
anced+1.362
Neg logit contributions
 FAR-2.263
senal-2.096
 Ig-2.094
atican-2.075
IOR-1.958
Token Long
Token ID5882
Feature activation-8.96e-02

Pos logit contributions
 FAR+1.060
senal+1.001
 Ig+0.962
atican+0.952
foundland+0.883
Neg logit contributions
hire-1.246
ortun-1.136
image-1.116
icked-1.112
anced-1.043
Token Beach
Token ID8511
Feature activation+6.43e-01

Pos logit contributions
 FAR+0.334
senal+0.327
atican+0.305
 Ig+0.305
foundland+0.289
Neg logit contributions
hire-0.317
ortun-0.286
image-0.279
icked-0.279
anced-0.258
Token,
Token ID11
Feature activation+3.93e-01

Pos logit contributions
hire+1.974
ortun+1.731
image+1.709
icked+1.685
anced+1.543
Neg logit contributions
senal-2.702
 FAR-2.669
atican-2.483
 Ig-2.454
lishes-2.361
Token a
Token ID257
Feature activation-1.67e-01

Pos logit contributions
hire+0.025
ortun+0.023
image+0.022
icked+0.022
anced+0.021
Neg logit contributions
 FAR-0.024
senal-0.023
 Ig-0.021
atican-0.021
IOR-0.020
Token major
Token ID1688
Feature activation-1.05e-01

Pos logit contributions
 FAR+0.564
senal+0.542
 Ig+0.510
atican+0.506
IOR+0.471
Neg logit contributions
hire-0.657
ortun-0.596
image-0.586
icked-0.583
anced-0.545
Token Oscars
Token ID46485
Feature activation+5.41e-01

Pos logit contributions
 FAR+0.368
senal+0.351
 Ig+0.339
atican+0.334
IOR+0.310
Neg logit contributions
hire-0.397
ortun-0.358
image-0.349
icked-0.349
anced-0.327
Token
Token ID198
Feature activation-9.76e-02

Pos logit contributions
hire+2.057
ortun+1.860
image+1.836
icked+1.820
anced+1.691
Neg logit contributions
 FAR-1.878
senal-1.857
atican-1.710
 Ig-1.696
foundland-1.612
Token
Token ID198
Feature activation+6.86e-02

Pos logit contributions
 FAR+0.353
 Ig+0.321
atican+0.314
IOR+0.299
 SAT+0.293
Neg logit contributions
hire-0.356
ortun-0.326
icked-0.321
image-0.304
anced-0.295
Tokencont
Token ID3642
Feature activation+1.73e+00

Pos logit contributions
hire+0.234
ortun+0.212
icked+0.207
image+0.204
anced+0.190
Neg logit contributions
 FAR-0.260
senal-0.250
atican-0.239
 Ig-0.238
IOR-0.228
Tokenender
Token ID2194
Feature activation+6.92e-01

Pos logit contributions
hire+6.398
image+6.121
icked+5.970
ortun+5.889
ester+5.569
Neg logit contributions
senal-6.240
 FAR-5.985
atican-5.208
ikuman-5.173
 practition-5.091
Token,
Token ID11
Feature activation+1.28e-01

Pos logit contributions
hire+2.382
ortun+2.141
image+2.109
icked+2.086
anced+1.931
Neg logit contributions
 FAR-2.642
senal-2.629
atican-2.429
 Ig-2.402
foundland-2.302
Token potentially
Token ID6196
Feature activation-6.17e-02

Pos logit contributions
hire+0.500
ortun+0.453
image+0.446
icked+0.443
anced+0.413
Neg logit contributions
 FAR-0.434
senal-0.423
 Ig-0.392
atican-0.391
foundland-0.365
Token picking
Token ID10868
Feature activation+2.16e-01

Pos logit contributions
 FAR+0.213
senal+0.204
 Ig+0.192
atican+0.189
IOR+0.179
Neg logit contributions
hire-0.240
ortun-0.216
image-0.212
icked-0.211
anced-0.198
Token up
Token ID510
Feature activation-3.22e-01

Pos logit contributions
hire+0.764
ortun+0.686
image+0.674
icked+0.671
anced+0.618
Neg logit contributions
 FAR-0.806
senal-0.798
atican-0.740
 Ig-0.735
foundland-0.699
Token.
Token ID764
Feature activation+6.81e-01

Pos logit contributions
hire+1.661
ortun+1.498
icked+1.470
image+1.444
anced+1.340
Neg logit contributions
 FAR-1.833
 Ig-1.674
atican-1.641
senal-1.640
IOR-1.556
Token list
Token ID1351
Feature activation+8.73e-01

Pos logit contributions
hire+2.704
image+2.494
ortun+2.457
icked+2.424
anced+2.270
Neg logit contributions
senal-2.241
 FAR-2.195
atican-2.029
 Ig-1.984
foundland-1.907
Token_
Token ID62
Feature activation+1.73e-01

Pos logit contributions
hire+2.450
ortun+2.137
image+2.068
icked+2.067
anced+1.857
Neg logit contributions
 FAR-3.925
senal-3.799
atican-3.638
 Ig-3.633
IOR-3.465
Tokenimages
Token ID17566
Feature activation-1.50e-02

Pos logit contributions
hire+0.288
ortun+0.214
icked+0.209
image+0.164
anced+0.161
Neg logit contributions
 FAR-0.966
 Ig-0.905
atican-0.905
senal-0.896
IOR-0.886
Token (
Token ID357
Feature activation+2.31e-01

Pos logit contributions
 FAR+0.044
 Ig+0.039
atican+0.038
senal+0.037
IOR+0.036
Neg logit contributions
hire-0.066
ortun-0.061
icked-0.059
image-0.058
anced-0.055
Token args
Token ID26498
Feature activation+1.73e+00

Pos logit contributions
hire+0.711
ortun+0.626
icked+0.603
image+0.601
anced+0.541
Neg logit contributions
 FAR-0.995
senal-0.935
 Ig-0.910
atican-0.900
IOR-0.868
Token [
Token ID685
Feature activation-1.01e-01

Pos logit contributions
hire+6.190
image+5.547
ortun+5.480
icked+5.379
anced+5.058
Neg logit contributions
senal-6.553
 FAR-6.172
ikuman-5.749
atican-5.646
 practition-5.602
Token "
Token ID366
Feature activation-1.63e-01

Pos logit contributions
 FAR+0.402
senal+0.392
atican+0.368
 Ig+0.368
IOR+0.349
Neg logit contributions
hire-0.337
ortun-0.300
icked-0.294
image-0.294
anced-0.270
Tokenimages
Token ID17566
Feature activation+3.79e-01

Pos logit contributions
 FAR+0.907
atican+0.852
 Ig+0.851
senal+0.846
IOR+0.830
Neg logit contributions
hire-0.270
ortun-0.198
icked-0.194
image-0.155
anced-0.152
Token"
Token ID1
Feature activation-5.35e-01

Pos logit contributions
hire+1.149
ortun+1.024
icked+0.992
image+0.971
anced+0.891
Neg logit contributions
 FAR-1.602
atican-1.480
 Ig-1.475
senal-1.471
IOR-1.404
Token ]
Token ID2361
Feature activation+1.41e-01

Pos logit contributions
 FAR+2.181
 Ig+1.996
senal+1.994
atican+1.977
IOR+1.890
Neg logit contributions
hire-1.707
ortun-1.535
icked-1.486
image-1.448
anced-1.355
Token a
Token ID257
Feature activation-1.00e-01

Pos logit contributions
 FAR+0.517
senal+0.500
 Ig+0.469
atican+0.461
foundland+0.432
Neg logit contributions
hire-0.598
ortun-0.543
icked-0.530
image-0.527
anced-0.494
Token recent
Token ID2274
Feature activation-1.32e-01

Pos logit contributions
 FAR+0.328
senal+0.321
atican+0.296
 Ig+0.295
foundland+0.276
Neg logit contributions
hire-0.401
ortun-0.365
image-0.359
icked-0.357
anced-0.334
Token article
Token ID2708
Feature activation-4.85e-01

Pos logit contributions
 FAR+0.457
senal+0.440
 Ig+0.414
atican+0.410
foundland+0.383
Neg logit contributions
hire-0.509
ortun-0.459
image-0.450
icked-0.450
anced-0.420
Token for
Token ID329
Feature activation-3.36e-01

Pos logit contributions
 FAR+1.623
senal+1.583
 Ig+1.468
atican+1.459
foundland+1.365
Neg logit contributions
hire-1.911
ortun-1.738
image-1.704
icked-1.693
anced-1.582
Token World
Token ID2159
Feature activation-1.89e-01

Pos logit contributions
 FAR+1.188
senal+1.138
 Ig+1.077
atican+1.058
foundland+0.991
Neg logit contributions
hire-1.275
ortun-1.151
image-1.123
icked-1.118
anced-1.049
TokenNet
Token ID7934
Feature activation-2.12e+00

Pos logit contributions
 FAR+0.722
senal+0.707
 Ig+0.661
atican+0.659
IOR+0.616
Neg logit contributions
hire-0.659
ortun-0.584
image-0.569
icked-0.563
anced-0.528
TokenDaily
Token ID28545
Feature activation-6.02e-01

Pos logit contributions
 FAR+6.144
senal+5.872
 Ig+5.732
atican+5.481
IOR+5.011
Neg logit contributions
hire-8.643
ortun-7.814
icked-7.505
anced-7.494
image-7.438
Token,
Token ID11
Feature activation-1.09e-02

Pos logit contributions
 FAR+2.304
senal+2.272
 Ig+2.120
atican+2.113
foundland+2.000
Neg logit contributions
hire-2.068
ortun-1.849
image-1.811
icked-1.800
anced-1.661
Token for
Token ID329
Feature activation-2.62e-01

Pos logit contributions
 FAR+0.038
senal+0.037
atican+0.035
 Ig+0.035
foundland+0.033
Neg logit contributions
hire-0.041
ortun-0.037
image-0.037
icked-0.036
anced-0.034
Token example
Token ID1672
Feature activation+1.31e-01

Pos logit contributions
 FAR+0.947
senal+0.922
 Ig+0.865
atican+0.856
foundland+0.807
Neg logit contributions
hire-0.959
ortun-0.868
image-0.844
icked-0.843
anced-0.785
Token,
Token ID11
Feature activation+3.54e-02

Pos logit contributions
hire+0.438
ortun+0.394
image+0.384
icked+0.384
anced+0.354
Neg logit contributions
 FAR-0.511
senal-0.493
 Ig-0.471
atican-0.470
foundland-0.441
Token
Token ID198
Feature activation-9.25e-02

Pos logit contributions
 FAR+0.381
 Ig+0.347
atican+0.338
IOR+0.321
 SAT+0.315
Neg logit contributions
hire-0.390
ortun-0.359
icked-0.354
image-0.334
anced-0.326
TokenThat
Token ID2504
Feature activation-4.49e-01

Pos logit contributions
 FAR+0.345
senal+0.317
atican+0.315
 Ig+0.314
IOR+0.303
Neg logit contributions
hire-0.324
ortun-0.294
icked-0.288
image-0.277
anced-0.266
Token is
Token ID318
Feature activation+3.11e-02

Pos logit contributions
 FAR+1.584
senal+1.491
 Ig+1.438
atican+1.423
IOR+1.317
Neg logit contributions
hire-1.690
ortun-1.511
icked-1.486
image-1.484
anced-1.386
Token a
Token ID257
Feature activation-3.84e-01

Pos logit contributions
hire+0.126
ortun+0.115
image+0.113
icked+0.112
anced+0.105
Neg logit contributions
 FAR-0.102
senal-0.098
 Ig-0.091
atican-0.090
IOR-0.084
Token far
Token ID1290
Feature activation-9.36e-01

Pos logit contributions
 FAR+1.260
senal+1.190
 Ig+1.115
atican+1.110
IOR+1.026
Neg logit contributions
hire-1.556
ortun-1.414
image-1.395
icked-1.385
anced-1.298
Token cry
Token ID3960
Feature activation-1.96e+00

Pos logit contributions
 FAR+2.042
senal+1.844
atican+1.686
 Ig+1.664
foundland+1.479
Neg logit contributions
hire-4.799
ortun-4.465
image-4.398
icked-4.358
anced-4.132
Token from
Token ID422
Feature activation-6.46e-01

Pos logit contributions
 FAR+5.756
senal+5.564
atican+5.137
 Ig+5.056
IOR+4.714
Neg logit contributions
hire-8.295
ortun-7.703
image-7.520
icked-7.513
anced-7.044
Token
Token ID564
Feature activation-4.28e-01

Pos logit contributions
 FAR+2.090
senal+1.935
 Ig+1.902
atican+1.835
IOR+1.694
Neg logit contributions
hire-2.604
icked-2.347
ortun-2.345
image-2.303
anced-2.166
Token
Token ID250
Feature activation-1.11e-01

Pos logit contributions
 FAR+1.587
senal+1.546
 Ig+1.452
atican+1.450
IOR+1.376
Neg logit contributions
hire-1.492
ortun-1.353
icked-1.315
image-1.310
anced-1.219
Tokengu
Token ID5162
Feature activation-2.05e-03

Pos logit contributions
 FAR+0.413
senal+0.402
atican+0.380
 Ig+0.378
foundland+0.359
Neg logit contributions
hire-0.385
ortun-0.347
icked-0.341
image-0.336
anced-0.316
Tokenarant
Token ID4741
Feature activation+1.84e-01

Pos logit contributions
 FAR+0.009
senal+0.008
atican+0.008
 Ig+0.008
foundland+0.008
Neg logit contributions
hire-0.006
ortun-0.006
image-0.005
icked-0.005
anced-0.005
Token States
Token ID1829
Feature activation-5.51e-02

Pos logit contributions
hire+0.829
ortun+0.746
image+0.728
icked+0.720
anced+0.661
Neg logit contributions
senal-1.043
 FAR-1.029
atican-0.952
 Ig-0.934
ftime-0.912
Token 6
Token ID718
Feature activation+3.35e-01

Pos logit contributions
 FAR+0.181
 Ig+0.162
senal+0.161
atican+0.159
IOR+0.146
Neg logit contributions
hire-0.225
ortun-0.208
icked-0.202
image-0.199
anced-0.190
Token12
Token ID1065
Feature activation+5.01e-02

Pos logit contributions
hire+1.292
ortun+1.175
icked+1.148
image+1.146
anced+1.070
Neg logit contributions
 FAR-1.152
senal-1.114
atican-1.046
 Ig-1.039
foundland-0.975
Token Posts
Token ID12043
Feature activation+1.64e-01

Pos logit contributions
hire+0.150
ortun+0.134
icked+0.130
image+0.124
anced+0.119
Neg logit contributions
 FAR-0.221
 Ig-0.204
atican-0.201
senal-0.201
IOR-0.191
Token #
Token ID1303
Feature activation-4.65e-01

Pos logit contributions
hire+0.442
image+0.383
ortun+0.381
icked+0.368
anced+0.319
Neg logit contributions
 FAR-0.745
senal-0.743
atican-0.697
 Ig-0.678
foundland-0.675
Token17
Token ID1558
Feature activation-1.94e+00

Pos logit contributions
 FAR+1.128
 Ig+0.952
atican+0.938
IOR+0.927
enburg+0.858
Neg logit contributions
hire-2.158
icked-2.035
ortun-2.030
anced-1.914
image-1.903
Token Good
Token ID4599
Feature activation+1.94e-01

Pos logit contributions
 FAR+6.195
 Ig+5.824
senal+5.452
atican+5.223
 SAT+5.061
Neg logit contributions
hire-7.323
ortun-6.631
icked-6.510
image-6.498
anced-6.210
Token to
Token ID284
Feature activation-2.97e-01

Pos logit contributions
hire+0.719
ortun+0.648
image+0.637
icked+0.634
anced+0.588
Neg logit contributions
 FAR-0.691
senal-0.679
 Ig-0.629
atican-0.628
foundland-0.593
Token hear
Token ID3285
Feature activation-2.77e-01

Pos logit contributions
 FAR+1.047
senal+0.994
 Ig+0.952
atican+0.936
IOR+0.870
Neg logit contributions
hire-1.119
ortun-1.010
image-1.003
icked-0.988
anced-0.930
Token.
Token ID13
Feature activation+9.65e-02

Pos logit contributions
 FAR+0.906
senal+0.862
 Ig+0.819
atican+0.798
foundland+0.750
Neg logit contributions
hire-1.118
ortun-1.015
image-1.003
icked-0.998
anced-0.934
Token
Token ID198
Feature activation+4.41e-02

Pos logit contributions
hire+0.355
ortun+0.319
image+0.313
icked+0.312
anced+0.290
Neg logit contributions
 FAR-0.350
senal-0.339
 Ig-0.320
atican-0.316
foundland-0.299
Token States
Token ID1829
Feature activation-3.68e-03

Pos logit contributions
hire+1.639
ortun+1.496
image+1.458
icked+1.427
anced+1.296
Neg logit contributions
senal-2.274
 FAR-2.228
atican-2.078
 Ig-2.039
ftime-2.007
Token 3
Token ID513
Feature activation+1.38e-01

Pos logit contributions
 FAR+0.012
senal+0.011
 Ig+0.011
atican+0.011
IOR+0.010
Neg logit contributions
hire-0.015
ortun-0.014
icked-0.013
image-0.013
anced-0.012
Token472
Token ID37856
Feature activation+9.10e-02

Pos logit contributions
hire+0.498
ortun+0.451
icked+0.438
image+0.436
anced+0.406
Neg logit contributions
 FAR-0.512
senal-0.493
atican-0.470
 Ig-0.465
foundland-0.439
Token Posts
Token ID12043
Feature activation+2.10e-01

Pos logit contributions
hire+0.274
ortun+0.248
icked+0.241
image+0.227
anced+0.220
Neg logit contributions
 FAR-0.398
 Ig-0.370
atican-0.359
senal-0.345
IOR-0.344
Token #
Token ID1303
Feature activation-7.20e-01

Pos logit contributions
hire+0.585
image+0.509
ortun+0.507
icked+0.494
anced+0.430
Neg logit contributions
 FAR-0.933
senal-0.932
atican-0.871
 Ig-0.850
foundland-0.846
Token11
Token ID1157
Feature activation-1.93e+00

Pos logit contributions
 FAR+2.020
 Ig+1.848
atican+1.770
IOR+1.724
enburg+1.640
Neg logit contributions
hire-3.043
ortun-2.829
icked-2.777
anced-2.620
image-2.592
Token On
Token ID1550
Feature activation-7.13e-01

Pos logit contributions
 FAR+6.307
 Ig+5.893
senal+5.617
atican+5.336
 SAT+5.200
Neg logit contributions
hire-7.244
ortun-6.533
image-6.350
icked-6.334
anced-6.061
Token September
Token ID2693
Feature activation-1.11e+00

Pos logit contributions
 FAR+2.498
senal+2.378
 Ig+2.302
atican+2.221
IOR+2.126
Neg logit contributions
hire-2.652
ortun-2.397
icked-2.341
image-2.313
anced-2.191
Token 20
Token ID1160
Feature activation+3.19e-01

Pos logit contributions
 FAR+3.059
 Ig+2.784
atican+2.528
 SAT+2.495
 Ley+2.351
Neg logit contributions
hire-4.969
ortun-4.584
icked-4.470
anced-4.346
image-4.275
Token 2013
Token ID2211
Feature activation-1.91e-01

Pos logit contributions
hire+1.391
ortun+1.276
image+1.265
icked+1.255
anced+1.173
Neg logit contributions
 FAR-0.922
senal-0.918
atican-0.824
 Ig-0.812
foundland-0.767
Token 07
Token ID8753
Feature activation-1.73e-01

Pos logit contributions
 FAR+0.561
 Ig+0.503
atican+0.482
senal+0.464
 SAT+0.446
Neg logit contributions
hire-0.842
ortun-0.782
icked-0.759
image-0.750
anced-0.729
Token States
Token ID1829
Feature activation-1.99e-01

Pos logit contributions
hire+0.668
ortun+0.602
image+0.588
icked+0.581
anced+0.533
Neg logit contributions
senal-0.833
 FAR-0.828
atican-0.765
 Ig-0.750
ftime-0.730
Token 228
Token ID29041
Feature activation-3.14e-02

Pos logit contributions
 FAR+0.654
 Ig+0.587
senal+0.583
atican+0.575
IOR+0.526
Neg logit contributions
hire-0.810
ortun-0.752
icked-0.731
image-0.717
anced-0.685
Token65
Token ID2996
Feature activation+6.63e-02

Pos logit contributions
 FAR+0.112
senal+0.107
 Ig+0.102
atican+0.102
IOR+0.095
Neg logit contributions
hire-0.118
ortun-0.107
icked-0.104
image-0.103
anced-0.097
Token Posts
Token ID12043
Feature activation+1.42e-01

Pos logit contributions
hire+0.195
ortun+0.177
icked+0.170
image+0.163
anced+0.156
Neg logit contributions
 FAR-0.293
 Ig-0.271
atican-0.266
senal-0.259
IOR-0.252
Token #
Token ID1303
Feature activation-4.79e-01

Pos logit contributions
hire+0.389
image+0.337
ortun+0.336
icked+0.327
anced+0.283
Neg logit contributions
 FAR-0.635
senal-0.633
atican-0.591
 Ig-0.579
foundland-0.576
Token13
Token ID1485
Feature activation-1.93e+00

Pos logit contributions
 FAR+1.226
 Ig+1.058
IOR+1.020
atican+0.996
 SAT+0.952
Neg logit contributions
hire-2.110
ortun-2.056
icked-2.015
anced-1.934
umbnail-1.914
Token On
Token ID1550
Feature activation-6.26e-01

Pos logit contributions
 FAR+6.166
 Ig+5.777
senal+5.358
atican+5.199
 SAT+5.003
Neg logit contributions
hire-7.370
ortun-6.729
icked-6.536
image-6.503
anced-6.238
Token September
Token ID2693
Feature activation-9.18e-01

Pos logit contributions
 FAR+1.902
 Ig+1.725
atican+1.596
senal+1.594
 SAT+1.539
Neg logit contributions
hire-2.584
ortun-2.400
icked-2.357
image-2.316
-2.208
Token 20
Token ID1160
Feature activation+4.08e-01

Pos logit contributions
 FAR+2.731
senal+2.599
 Ig+2.423
atican+2.392
IOR+2.215
Neg logit contributions
hire-3.947
ortun-3.613
image-3.547
icked-3.535
anced-3.339
Token 2013
Token ID2211
Feature activation+1.48e-02

Pos logit contributions
hire+1.891
ortun+1.760
icked+1.713
image+1.711
anced+1.629
Neg logit contributions
 FAR-1.094
senal-0.969
 Ig-0.963
atican-0.944
IOR-0.869
Token 07
Token ID8753
Feature activation+1.16e-01

Pos logit contributions
hire+0.064
ortun+0.059
image+0.057
icked+0.057
anced+0.055
Neg logit contributions
 FAR-0.045
senal-0.041
 Ig-0.040
atican-0.039
IOR-0.036
TokenT
Token ID51
Feature activation+3.44e-01

Pos logit contributions
hire+1.717
ortun+1.536
image+1.491
icked+1.477
anced+1.388
Neg logit contributions
 FAR-1.764
senal-1.678
 Ig-1.605
atican-1.582
IOR-1.524
Token 235
Token ID28878
Feature activation+3.75e-01

Pos logit contributions
hire+1.300
ortun+1.176
image+1.156
icked+1.148
anced+1.071
Neg logit contributions
 FAR-1.206
senal-1.178
atican-1.092
 Ig-1.092
foundland-1.028
Token21
Token ID2481
Feature activation+3.97e-01

Pos logit contributions
hire+1.399
ortun+1.264
image+1.243
icked+1.236
anced+1.147
Neg logit contributions
 FAR-1.328
senal-1.306
atican-1.209
 Ig-1.205
foundland-1.138
Token Posts
Token ID12043
Feature activation+7.60e-02

Pos logit contributions
hire+1.118
ortun+1.002
icked+0.963
image+0.929
anced+0.872
Neg logit contributions
 FAR-1.820
 Ig-1.679
senal-1.662
atican-1.653
IOR-1.582
Token #
Token ID1303
Feature activation-4.80e-01

Pos logit contributions
hire+0.208
image+0.180
ortun+0.180
icked+0.174
anced+0.152
Neg logit contributions
 FAR-0.341
senal-0.340
atican-0.319
 Ig-0.312
foundland-0.309
Token18
Token ID1507
Feature activation-1.90e+00

Pos logit contributions
 FAR+1.093
atican+0.928
 Ig+0.922
IOR+0.902
 SAT+0.827
Neg logit contributions
hire-2.314
icked-2.146
ortun-2.105
odder-1.988
ドラ-1.963
Token Good
Token ID4599
Feature activation+2.70e-01

Pos logit contributions
 FAR+6.191
 Ig+5.780
senal+5.425
atican+5.140
 SAT+5.025
Neg logit contributions
hire-7.182
ortun-6.514
icked-6.399
image-6.335
anced-6.072
Token stuff
Token ID3404
Feature activation-8.98e-02

Pos logit contributions
hire+0.991
ortun+0.893
image+0.878
icked+0.873
anced+0.809
Neg logit contributions
 FAR-0.975
senal-0.959
atican-0.888
 Ig-0.887
foundland-0.838
Token!
Token ID0
Feature activation-5.79e-02

Pos logit contributions
 FAR+0.341
senal+0.333
 Ig+0.313
atican+0.311
foundland+0.295
Neg logit contributions
hire-0.313
ortun-0.279
image-0.275
icked-0.273
anced-0.253
Token Great
Token ID3878
Feature activation+4.52e-01

Pos logit contributions
 FAR+0.212
senal+0.204
 Ig+0.194
atican+0.191
foundland+0.181
Neg logit contributions
hire-0.211
ortun-0.190
image-0.186
icked-0.186
anced-0.173
Token to
Token ID284
Feature activation-5.26e-01

Pos logit contributions
hire+1.688
ortun+1.524
image+1.500
icked+1.490
anced+1.384
Neg logit contributions
 FAR-1.599
senal-1.574
atican-1.456
 Ig-1.450
foundland-1.370
Token Ahmed
Token ID21157
Feature activation-1.04e-01

Pos logit contributions
 FAR+0.126
senal+0.124
atican+0.114
 Ig+0.113
foundland+0.107
Neg logit contributions
hire-0.148
ortun-0.135
image-0.133
icked-0.132
anced-0.123
Token of
Token ID286
Feature activation-2.06e-01

Pos logit contributions
 FAR+0.346
senal+0.338
 Ig+0.313
atican+0.312
foundland+0.291
Neg logit contributions
hire-0.410
ortun-0.372
image-0.365
icked-0.364
anced-0.340
Token Mars
Token ID8706
Feature activation-7.14e-01

Pos logit contributions
 FAR+0.677
senal+0.660
 Ig+0.612
atican+0.611
foundland+0.572
Neg logit contributions
hire-0.819
ortun-0.745
image-0.734
icked-0.730
anced-0.682
Tokenabit
Token ID29968
Feature activation-2.00e-01

Pos logit contributions
 FAR+2.860
senal+2.798
 Ig+2.629
atican+2.613
foundland+2.500
Neg logit contributions
hire-2.285
ortun-2.050
image-2.011
icked-2.007
anced-1.844
Token National
Token ID2351
Feature activation-4.28e-02

Pos logit contributions
 FAR+0.778
senal+0.764
 Ig+0.714
atican+0.713
foundland+0.678
Neg logit contributions
hire-0.676
ortun-0.605
image-0.595
icked-0.591
anced-0.545
Token Park
Token ID3250
Feature activation-1.89e+00

Pos logit contributions
 FAR+0.176
senal+0.174
 Ig+0.164
atican+0.163
foundland+0.155
Neg logit contributions
hire-0.134
ortun-0.119
image-0.116
icked-0.115
anced-0.106
Token in
Token ID287
Feature activation-3.25e-01

Pos logit contributions
 FAR+6.682
senal+6.304
 Ig+6.066
atican+5.815
foundland+5.782
Neg logit contributions
hire-6.433
ortun-5.877
image-5.804
icked-5.732
anced-5.382
Token Kenya
Token ID21506
Feature activation+4.04e-01

Pos logit contributions
 FAR+0.987
senal+0.929
 Ig+0.891
atican+0.871
foundland+0.813
Neg logit contributions
hire-1.383
ortun-1.267
icked-1.243
image-1.242
anced-1.169
Token in
Token ID287
Feature activation-8.41e-02

Pos logit contributions
hire+1.288
ortun+1.138
image+1.120
icked+1.111
anced+1.014
Neg logit contributions
 FAR-1.652
senal-1.639
atican-1.529
 Ig-1.520
foundland-1.452
Token the
Token ID262
Feature activation+2.40e-01

Pos logit contributions
 FAR+0.241
senal+0.223
 Ig+0.216
atican+0.210
foundland+0.192
Neg logit contributions
hire-0.375
ortun-0.346
icked-0.339
image-0.336
anced-0.319
Token early
Token ID1903
Feature activation+1.10e-01

Pos logit contributions
hire+0.995
ortun+0.903
icked+0.888
image+0.885
anced+0.829
Neg logit contributions
 FAR-0.771
senal-0.732
 Ig-0.694
atican-0.688
foundland-0.633
Token being
Token ID852
Feature activation+5.77e-02

Pos logit contributions
 FAR+1.450
senal+1.336
 Ig+1.286
atican+1.238
IOR+1.170
Neg logit contributions
hire-1.802
ortun-1.655
image-1.624
icked-1.613
anced-1.495
Token there
Token ID612
Feature activation+7.92e-02

Pos logit contributions
hire+0.231
ortun+0.210
image+0.208
icked+0.203
anced+0.189
Neg logit contributions
 FAR-0.192
senal-0.179
 Ig-0.172
atican-0.167
lishes-0.157
Token whether
Token ID1771
Feature activation-1.02e+00

Pos logit contributions
hire+0.326
ortun+0.298
image+0.293
icked+0.291
anced+0.271
Neg logit contributions
 FAR-0.253
senal-0.237
 Ig-0.227
atican-0.223
foundland-0.208
Token I
Token ID314
Feature activation+1.22e-01

Pos logit contributions
 FAR+3.279
senal+3.035
 Ig+2.965
atican+2.827
foundland+2.715
Neg logit contributions
hire-4.044
ortun-3.682
image-3.654
icked-3.610
anced-3.349
Token set
Token ID900
Feature activation+7.85e-01

Pos logit contributions
hire+0.438
image+0.398
ortun+0.398
icked+0.389
+0.358
Neg logit contributions
 FAR-0.442
 Ig-0.400
senal-0.397
atican-0.392
lishes-0.368
Token foot
Token ID2366
Feature activation-1.88e+00

Pos logit contributions
hire+2.929
ortun+2.653
image+2.607
icked+2.591
anced+2.410
Neg logit contributions
 FAR-2.756
senal-2.750
atican-2.529
 Ig-2.500
foundland-2.385
Token in
Token ID287
Feature activation-4.17e-01

Pos logit contributions
 FAR+5.270
 Ig+4.671
senal+4.512
atican+4.430
foundland+4.155
Neg logit contributions
hire-8.084
ortun-7.367
image-7.248
icked-7.149
umbnail-6.761
Token it
Token ID340
Feature activation-7.79e-02

Pos logit contributions
 FAR+1.432
senal+1.372
 Ig+1.311
atican+1.273
foundland+1.204
Neg logit contributions
hire-1.598
ortun-1.437
image-1.418
icked-1.413
anced-1.304
Token or
Token ID393
Feature activation-1.48e-01

Pos logit contributions
 FAR+0.267
senal+0.254
 Ig+0.241
atican+0.239
foundland+0.225
Neg logit contributions
hire-0.300
ortun-0.272
image-0.268
icked-0.266
anced-0.246
Token not
Token ID407
Feature activation+3.93e-01

Pos logit contributions
 FAR+0.396
senal+0.369
 Ig+0.348
atican+0.339
IOR+0.310
Neg logit contributions
hire-0.678
ortun-0.627
image-0.622
icked-0.612
anced-0.575
Token."
Token ID526
Feature activation+2.50e-01

Pos logit contributions
hire+1.240
ortun+1.092
image+1.074
icked+1.062
anced+0.973
Neg logit contributions
senal-1.624
 FAR-1.617
atican-1.502
 Ig-1.488
foundland-1.428
Token States
Token ID1829
Feature activation-1.96e-01

Pos logit contributions
hire+0.637
ortun+0.574
image+0.559
icked+0.552
anced+0.505
Neg logit contributions
senal-0.830
 FAR-0.819
atican-0.759
 Ig-0.744
ftime-0.728
Token 5
Token ID642
Feature activation+3.20e-01

Pos logit contributions
 FAR+0.644
 Ig+0.580
senal+0.576
atican+0.567
IOR+0.519
Neg logit contributions
hire-0.798
ortun-0.738
icked-0.718
image-0.704
anced-0.674
Token667
Token ID28933
Feature activation+3.06e-01

Pos logit contributions
hire+1.227
ortun+1.112
image+1.091
icked+1.087
anced+1.012
Neg logit contributions
 FAR-1.102
senal-1.078
atican-1.001
 Ig-0.996
foundland-0.938
Token Posts
Token ID12043
Feature activation+1.82e-01

Pos logit contributions
hire+0.861
ortun+0.763
icked+0.736
image+0.717
anced+0.668
Neg logit contributions
 FAR-1.388
 Ig-1.294
senal-1.273
atican-1.273
IOR-1.209
Token #
Token ID1303
Feature activation-6.81e-01

Pos logit contributions
hire+0.502
image+0.437
ortun+0.434
icked+0.422
anced+0.366
Neg logit contributions
 FAR-0.817
senal-0.812
atican-0.758
 Ig-0.741
foundland-0.739
Token12
Token ID1065
Feature activation-1.83e+00

Pos logit contributions
 FAR+1.714
 Ig+1.567
atican+1.471
IOR+1.454
 SAT+1.366
Neg logit contributions
hire-3.092
icked-2.880
ortun-2.879
anced-2.719
odder-2.661
Token I
Token ID314
Feature activation-2.56e-02

Pos logit contributions
 FAR+5.886
 Ig+5.497
senal+5.194
atican+4.970
 SAT+4.761
Neg logit contributions
hire-6.995
ortun-6.402
icked-6.196
image-6.157
anced-5.921
Token'm
Token ID1101
Feature activation-5.99e-02

Pos logit contributions
 FAR+0.089
senal+0.083
 Ig+0.080
atican+0.079
IOR+0.075
Neg logit contributions
hire-0.098
ortun-0.089
image-0.087
icked-0.086
anced-0.081
Token glad
Token ID9675
Feature activation+8.40e-02

Pos logit contributions
 FAR+0.194
senal+0.185
 Ig+0.173
atican+0.171
foundland+0.159
Neg logit contributions
hire-0.245
ortun-0.223
image-0.220
icked-0.218
anced-0.204
Token there
Token ID612
Feature activation-4.68e-01

Pos logit contributions
hire+0.331
ortun+0.301
image+0.296
icked+0.294
anced+0.275
Neg logit contributions
 FAR-0.281
senal-0.274
 Ig-0.254
atican-0.253
foundland-0.238
Token aren
Token ID3588
Feature activation-2.94e-01

Pos logit contributions
 FAR+1.697
senal+1.636
 Ig+1.549
atican+1.540
foundland+1.432
Neg logit contributions
hire-1.703
ortun-1.523
image-1.510
icked-1.494
anced-1.396
Token worth
Token ID2861
Feature activation-1.60e-01

Pos logit contributions
 FAR+0.898
senal+0.855
 Ig+0.816
atican+0.780
IOR+0.754
Neg logit contributions
hire-0.983
ortun-0.886
image-0.872
icked-0.863
anced-0.815
Token a
Token ID257
Feature activation-1.27e-01

Pos logit contributions
 FAR+0.541
senal+0.509
 Ig+0.488
atican+0.472
IOR+0.448
Neg logit contributions
hire-0.628
ortun-0.568
image-0.559
icked-0.555
anced-0.518
Token hell
Token ID5968
Feature activation-4.83e-01

Pos logit contributions
 FAR+0.434
senal+0.412
 Ig+0.392
atican+0.385
IOR+0.361
Neg logit contributions
hire-0.494
ortun-0.446
image-0.442
icked-0.435
anced-0.408
Token of
Token ID286
Feature activation-7.76e-01

Pos logit contributions
 FAR+1.960
senal+1.952
atican+1.807
 Ig+1.805
foundland+1.737
Neg logit contributions
hire-1.565
ortun-1.380
icked-1.364
image-1.361
anced-1.247
Token a
Token ID257
Feature activation-6.51e-01

Pos logit contributions
 FAR+2.833
senal+2.689
 Ig+2.621
atican+2.552
IOR+2.443
Neg logit contributions
hire-2.752
ortun-2.447
image-2.446
icked-2.387
anced-2.253
Token lot
Token ID1256
Feature activation-1.83e+00

Pos logit contributions
 FAR+1.900
senal+1.714
 Ig+1.682
atican+1.616
IOR+1.521
Neg logit contributions
hire-2.844
ortun-2.588
image-2.585
icked-2.536
anced-2.413
Token more
Token ID517
Feature activation-6.27e-01

Pos logit contributions
 FAR+5.767
senal+5.165
 Ig+5.150
atican+4.915
IOR+4.692
Neg logit contributions
hire-7.147
ortun-6.488
image-6.438
icked-6.286
anced-5.829
Token than
Token ID621
Feature activation-4.90e-01

Pos logit contributions
 FAR+2.214
senal+2.090
 Ig+2.027
atican+1.962
IOR+1.881
Neg logit contributions
hire-2.330
ortun-2.092
image-2.068
icked-2.038
anced-1.884
Token what
Token ID644
Feature activation+3.04e-01

Pos logit contributions
 FAR+1.596
senal+1.501
 Ig+1.472
atican+1.396
IOR+1.333
Neg logit contributions
hire-1.954
ortun-1.777
image-1.749
icked-1.742
anced-1.654
Token he
Token ID339
Feature activation+9.54e-02

Pos logit contributions
hire+1.226
ortun+1.115
image+1.103
icked+1.093
anced+1.026
Neg logit contributions
 FAR-0.989
senal-0.961
 Ig-0.896
atican-0.885
IOR-0.827
Token
Token ID447
Feature activation-2.04e-01

Pos logit contributions
hire+0.365
ortun+0.331
image+0.325
icked+0.321
anced+0.300
Neg logit contributions
 FAR-0.330
senal-0.316
atican-0.298
 Ig-0.298
IOR-0.278
Token minions
Token ID22811
Feature activation-1.47e-01

Pos logit contributions
hire+1.173
ortun+1.063
image+1.054
icked+1.047
anced+0.977
Neg logit contributions
 FAR-0.978
senal-0.974
atican-0.886
 Ig-0.878
foundland-0.836
Token.
Token ID13
Feature activation-5.36e-01

Pos logit contributions
 FAR+0.589
senal+0.576
 Ig+0.541
atican+0.540
foundland+0.512
Neg logit contributions
hire-0.480
ortun-0.428
image-0.419
icked-0.416
anced-0.381
Token
Token ID198
Feature activation-5.50e-02

Pos logit contributions
 FAR+1.941
 Ig+1.778
senal+1.750
atican+1.703
IOR+1.619
Neg logit contributions
hire-1.989
ortun-1.816
icked-1.760
image-1.732
anced-1.640
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
 FAR+0.197
 Ig+0.180
atican+0.175
IOR+0.167
 SAT+0.164
Neg logit contributions
hire-0.203
ortun-0.186
icked-0.184
image-0.173
anced-0.170
TokenIGN
Token ID16284
Feature activation-1.37e+00

Pos logit contributions
 FAR+0.424
senal+0.412
atican+0.383
 Ig+0.383
IOR+0.361
Neg logit contributions
hire-0.511
ortun-0.467
icked-0.456
image-0.454
anced-0.427
Token:
Token ID25
Feature activation-1.82e+00

Pos logit contributions
 FAR+4.435
 Ig+4.106
atican+3.830
IOR+3.747
 SAT+3.660
Neg logit contributions
hire-5.122
ortun-4.880
icked-4.782
anced-4.406
uyomi-4.379
Token Sure
Token ID10889
Feature activation-7.60e-01

Pos logit contributions
 FAR+5.642
 Ig+5.268
 Oo+4.685
atican+4.661
 Ley+4.592
Neg logit contributions
hire-7.317
ortun-6.765
icked-6.495
image-6.217
odder-6.200
Token.
Token ID13
Feature activation-4.33e-01

Pos logit contributions
 FAR+3.000
senal+2.928
 Ig+2.789
atican+2.758
foundland+2.613
Neg logit contributions
hire-2.472
ortun-2.198
icked-2.140
image-2.136
anced-1.950
Token Before
Token ID7413
Feature activation-1.07e-01

Pos logit contributions
 FAR+1.525
 Ig+1.404
senal+1.357
atican+1.338
IOR+1.251
Neg logit contributions
hire-1.659
ortun-1.512
icked-1.472
image-1.438
anced-1.365
Token we
Token ID356
Feature activation+3.18e-01

Pos logit contributions
 FAR+0.359
senal+0.349
 Ig+0.326
atican+0.321
foundland+0.302
Neg logit contributions
hire-0.422
ortun-0.383
image-0.376
icked-0.375
anced-0.349
Token move
Token ID1445
Feature activation-3.15e-01

Pos logit contributions
hire+1.226
ortun+1.119
image+1.106
icked+1.088
anced+1.013
Neg logit contributions
 FAR-1.085
senal-1.042
 Ig-0.979
atican-0.975
foundland-0.911
Token States
Token ID1829
Feature activation-1.68e-02

Pos logit contributions
hire+1.326
ortun+1.199
image+1.163
icked+1.154
anced+1.051
Neg logit contributions
senal-1.739
 FAR-1.718
atican-1.589
 Ig-1.556
ftime-1.522
Token 21
Token ID2310
Feature activation+1.32e-01

Pos logit contributions
 FAR+0.055
senal+0.050
 Ig+0.050
atican+0.049
IOR+0.045
Neg logit contributions
hire-0.068
ortun-0.063
icked-0.061
image-0.060
anced-0.057
Token50
Token ID1120
Feature activation+8.05e-02

Pos logit contributions
hire+0.495
ortun+0.449
icked+0.437
image+0.432
anced+0.407
Neg logit contributions
 FAR-0.479
senal-0.449
 Ig-0.434
atican-0.433
IOR-0.405
Token Posts
Token ID12043
Feature activation+3.34e-02

Pos logit contributions
hire+0.237
ortun+0.212
icked+0.204
image+0.195
anced+0.187
Neg logit contributions
 FAR-0.360
 Ig-0.332
atican-0.326
senal-0.316
IOR-0.310
Token #
Token ID1303
Feature activation-4.20e-01

Pos logit contributions
hire+0.093
image+0.080
ortun+0.080
icked+0.077
anced+0.067
Neg logit contributions
 FAR-0.149
senal-0.149
atican-0.140
 Ig-0.136
foundland-0.135
Token16
Token ID1433
Feature activation-1.82e+00

Pos logit contributions
 FAR+0.929
 Ig+0.797
atican+0.785
IOR+0.742
ftime+0.699
Neg logit contributions
hire-2.047
ortun-1.962
icked-1.924
anced-1.821
image-1.786
Token Glad
Token ID28442
Feature activation+2.88e-01

Pos logit contributions
 FAR+5.686
 Ig+5.314
senal+4.985
atican+4.773
 SAT+4.567
Neg logit contributions
hire-7.080
ortun-6.469
icked-6.313
image-6.288
anced-6.020
Token to
Token ID284
Feature activation+2.39e-01

Pos logit contributions
hire+1.102
ortun+0.998
image+0.983
icked+0.977
anced+0.909
Neg logit contributions
 FAR-0.992
senal-0.972
 Ig-0.901
atican-0.898
foundland-0.846
Token hear
Token ID3285
Feature activation+6.74e-03

Pos logit contributions
hire+0.892
ortun+0.807
image+0.797
icked+0.789
anced+0.736
Neg logit contributions
 FAR-0.848
senal-0.820
 Ig-0.770
atican-0.768
foundland-0.717
Token everything
Token ID2279
Feature activation+1.24e-01

Pos logit contributions
hire+0.026
ortun+0.024
image+0.023
icked+0.023
anced+0.022
Neg logit contributions
 FAR-0.023
senal-0.022
 Ig-0.021
atican-0.021
foundland-0.019
Token worked
Token ID3111
Feature activation+2.74e-01

Pos logit contributions
hire+0.442
ortun+0.395
image+0.390
icked+0.386
anced+0.357
Neg logit contributions
 FAR-0.463
senal-0.447
 Ig-0.424
atican-0.421
foundland-0.396
TokenShe
Token ID3347
Feature activation-3.01e-01

Pos logit contributions
 FAR+0.927
atican+0.845
 Ig+0.845
senal+0.839
IOR+0.825
Neg logit contributions
hire-0.822
ortun-0.742
icked-0.733
image-0.691
anced-0.674
Token doesn
Token ID1595
Feature activation+6.23e-02

Pos logit contributions
 FAR+1.082
senal+0.988
atican+0.986
 Ig+0.977
lishes+0.903
Neg logit contributions
hire-1.113
ortun-1.012
image-0.991
icked-0.975
anced-0.901
Token't
Token ID470
Feature activation+1.76e-01

Pos logit contributions
hire+0.291
ortun+0.273
icked+0.266
image+0.260
anced+0.252
Neg logit contributions
 FAR-0.162
 Ig-0.141
atican-0.140
IOR-0.129
 SAT-0.122
Token want
Token ID765
Feature activation+1.88e-01

Pos logit contributions
hire+0.644
ortun+0.589
image+0.578
icked+0.576
anced+0.536
Neg logit contributions
 FAR-0.637
senal-0.579
atican-0.575
 Ig-0.575
IOR-0.533
Token to
Token ID284
Feature activation-5.12e-01

Pos logit contributions
hire+0.681
ortun+0.613
image+0.603
icked+0.599
anced+0.554
Neg logit contributions
 FAR-0.687
senal-0.675
atican-0.627
 Ig-0.625
foundland-0.591
Token alien
Token ID8756
Feature activation-1.81e+00

Pos logit contributions
 FAR+1.579
atican+1.400
 Ig+1.393
senal+1.352
IOR+1.265
Neg logit contributions
hire-2.105
ortun-1.967
image-1.929
icked-1.914
anced-1.797
Tokenate
Token ID378
Feature activation-8.06e-01

Pos logit contributions
 FAR+4.180
atican+3.955
 Ig+3.851
IOR+3.641
ftime+3.463
Neg logit contributions
hire-7.927
ortun-7.182
icked-7.177
image-7.080
odder-6.806
Token the
Token ID262
Feature activation-3.78e-01

Pos logit contributions
 FAR+2.629
 Ig+2.382
senal+2.356
atican+2.327
IOR+2.150
Neg logit contributions
hire-3.200
ortun-2.877
image-2.863
icked-2.860
odder-2.625
Token Europeans
Token ID20006
Feature activation-1.54e-01

Pos logit contributions
 FAR+1.267
senal+1.156
 Ig+1.143
atican+1.120
IOR+1.038
Neg logit contributions
hire-1.499
ortun-1.351
icked-1.331
image-1.321
anced-1.241
Token when
Token ID618
Feature activation-2.34e-03

Pos logit contributions
 FAR+0.568
senal+0.532
 Ig+0.517
atican+0.513
IOR+0.478
Neg logit contributions
hire-0.554
ortun-0.500
image-0.491
icked-0.489
anced-0.448
Token she
Token ID673
Feature activation-4.42e-01

Pos logit contributions
 FAR+0.007
senal+0.007
 Ig+0.007
atican+0.007
foundland+0.006
Neg logit contributions
hire-0.010
ortun-0.009
image-0.009
icked-0.009
anced-0.008
Token:
Token ID25
Feature activation-7.01e-01

Pos logit contributions
hire+2.637
ortun+2.400
image+2.361
icked+2.338
anced+2.168
Neg logit contributions
 FAR-2.730
senal-2.687
atican-2.511
 Ig-2.475
foundland-2.374
Token And
Token ID843
Feature activation-8.41e-01

Pos logit contributions
 FAR+2.430
 Ig+2.206
senal+2.177
atican+2.130
IOR+1.978
Neg logit contributions
hire-2.694
ortun-2.473
icked-2.394
image-2.347
anced-2.236
Token it
Token ID340
Feature activation-6.41e-01

Pos logit contributions
 FAR+2.712
senal+2.463
 Ig+2.441
atican+2.362
 SAT+2.189
Neg logit contributions
hire-3.390
ortun-3.076
icked-3.038
image-3.017
anced-2.812
Token varies
Token ID17806
Feature activation-5.50e-01

Pos logit contributions
 FAR+2.268
 Ig+2.042
senal+2.028
atican+2.024
lishes+1.888
Neg logit contributions
hire-2.406
ortun-2.166
image-2.122
icked-2.092
anced-1.936
Token a
Token ID257
Feature activation-7.87e-01

Pos logit contributions
 FAR+1.810
senal+1.608
 Ig+1.600
atican+1.541
IOR+1.446
Neg logit contributions
hire-2.234
ortun-2.028
icked-1.998
image-1.994
anced-1.838
Token bit
Token ID1643
Feature activation-1.81e+00

Pos logit contributions
 FAR+2.378
senal+2.198
 Ig+2.068
atican+2.034
lishes+1.881
Neg logit contributions
hire-3.376
ortun-3.071
image-3.042
icked-2.971
anced-2.818
Token along
Token ID1863
Feature activation-9.23e-01

Pos logit contributions
 FAR+5.588
 Ig+4.940
atican+4.656
senal+4.553
 SAT+4.393
Neg logit contributions
hire-7.141
ortun-6.586
icked-6.458
image-6.364
ドラ-5.943
Token the
Token ID262
Feature activation-2.47e-01

Pos logit contributions
 FAR+3.069
senal+2.897
 Ig+2.781
atican+2.734
lishes+2.547
Neg logit contributions
hire-3.634
ortun-3.312
icked-3.221
image-3.215
anced-3.006
Token way
Token ID835
Feature activation-4.40e-01

Pos logit contributions
 FAR+0.651
senal+0.626
 Ig+0.569
atican+0.569
foundland+0.520
Neg logit contributions
hire-1.154
ortun-1.065
image-1.050
icked-1.045
anced-0.987
Token.
Token ID13
Feature activation-6.57e-01

Pos logit contributions
 FAR+1.778
senal+1.716
 Ig+1.635
atican+1.631
foundland+1.536
Neg logit contributions
hire-1.410
ortun-1.266
image-1.231
icked-1.230
anced-1.115
Token
Token ID198
Feature activation-3.39e-01

Pos logit contributions
 FAR+2.280
 Ig+2.087
atican+1.990
senal+1.985
 SAT+1.874
Neg logit contributions
hire-2.544
ortun-2.351
icked-2.282
image-2.195
anced-2.119
Token States
Token ID1829
Feature activation-2.76e-01

Pos logit contributions
hire+0.251
ortun+0.227
image+0.221
icked+0.219
anced+0.200
Neg logit contributions
senal-0.313
 FAR-0.310
atican-0.287
 Ig-0.282
ftime-0.272
Token 10
Token ID838
Feature activation+1.37e-02

Pos logit contributions
 FAR+0.901
 Ig+0.809
senal+0.798
atican+0.790
IOR+0.723
Neg logit contributions
hire-1.129
ortun-1.049
icked-1.019
image-0.997
anced-0.957
Token53
Token ID4310
Feature activation-2.78e-01

Pos logit contributions
hire+0.053
ortun+0.049
icked+0.047
image+0.047
anced+0.044
Neg logit contributions
 FAR-0.047
senal-0.045
atican-0.043
 Ig-0.042
foundland-0.040
Token Posts
Token ID12043
Feature activation+1.10e-01

Pos logit contributions
 FAR+1.201
 Ig+1.108
atican+1.068
IOR+1.038
senal+1.022
Neg logit contributions
hire-0.858
ortun-0.777
icked-0.745
image-0.699
anced-0.690
Token #
Token ID1303
Feature activation-6.24e-01

Pos logit contributions
hire+0.306
image+0.266
ortun+0.265
icked+0.257
anced+0.224
Neg logit contributions
 FAR-0.489
senal-0.487
atican-0.457
 Ig-0.445
foundland-0.442
Token15
Token ID1314
Feature activation-1.81e+00

Pos logit contributions
 FAR+1.254
 Ig+1.042
atican+0.992
IOR+0.967
 SAT+0.905
Neg logit contributions
hire-3.113
ortun-3.014
icked-2.948
umbnail-2.813
anced-2.798
Token honestly
Token ID12698
Feature activation+9.95e-01

Pos logit contributions
 FAR+5.717
 Ig+5.389
senal+5.093
atican+4.887
 SAT+4.637
Neg logit contributions
hire-6.972
ortun-6.336
icked-6.178
image-6.145
anced-5.871
Token if
Token ID611
Feature activation-9.59e-02

Pos logit contributions
hire+3.558
ortun+3.201
image+3.171
icked+3.141
anced+2.894
Neg logit contributions
senal-3.629
 FAR-3.609
atican-3.336
 Ig-3.243
ikuman-3.136
Token we
Token ID356
Feature activation+4.39e-01

Pos logit contributions
 FAR+0.325
senal+0.317
 Ig+0.295
atican+0.293
foundland+0.275
Neg logit contributions
hire-0.373
ortun-0.339
image-0.333
icked-0.332
anced-0.310
Token just
Token ID655
Feature activation+6.34e-01

Pos logit contributions
hire+1.692
ortun+1.549
image+1.530
icked+1.510
anced+1.409
Neg logit contributions
 FAR-1.499
senal-1.429
 Ig-1.351
atican-1.343
foundland-1.257
Token had
Token ID550
Feature activation+5.54e-01

Pos logit contributions
hire+2.458
ortun+2.239
image+2.210
icked+2.189
anced+2.038
Neg logit contributions
 FAR-2.160
senal-2.083
 Ig-1.950
atican-1.940
foundland-1.818
Token on
Token ID319
Feature activation+6.08e-01

Pos logit contributions
hire+1.608
ortun+1.443
image+1.417
icked+1.412
anced+1.308
Neg logit contributions
 FAR-1.644
senal-1.631
atican-1.513
 Ig-1.494
foundland-1.436
Token Twitter
Token ID3009
Feature activation+1.26e-01

Pos logit contributions
hire+1.947
ortun+1.728
image+1.699
icked+1.673
anced+1.528
Neg logit contributions
 FAR-2.448
senal-2.441
atican-2.283
 Ig-2.240
ikuman-2.184
Token @
Token ID2488
Feature activation+9.58e-01

Pos logit contributions
hire+0.417
ortun+0.370
image+0.364
icked+0.361
anced+0.331
Neg logit contributions
 FAR-0.502
senal-0.497
atican-0.463
 Ig-0.460
foundland-0.439
TokenT
Token ID51
Feature activation-2.06e-01

Pos logit contributions
hire+2.990
ortun+2.603
image+2.588
icked+2.541
anced+2.321
Neg logit contributions
 FAR-3.991
senal-3.971
 Ig-3.716
atican-3.672
foundland-3.514
Tokenara
Token ID3301
Feature activation-2.54e-01

Pos logit contributions
 FAR+0.649
senal+0.631
atican+0.593
 Ig+0.574
IOR+0.548
Neg logit contributions
hire-0.835
ortun-0.761
image-0.749
icked-0.745
anced-0.695
TokenPer
Token ID5990
Feature activation-1.80e+00

Pos logit contributions
 FAR+1.049
senal+1.028
atican+0.968
 Ig+0.961
IOR+0.924
Neg logit contributions
hire-0.793
ortun-0.699
image-0.686
icked-0.679
anced-0.619
Tokenkins
Token ID5331
Feature activation-3.95e-01

Pos logit contributions
 FAR+3.693
atican+3.505
ftime+3.030
IOR+2.980
enburg+2.937
Neg logit contributions
hire-8.354
image-7.654
icked-7.602
ortun-7.517
ドラ-7.394
Token.
Token ID13
Feature activation+8.06e-02

Pos logit contributions
 FAR+1.647
senal+1.618
atican+1.517
 Ig+1.514
foundland+1.448
Neg logit contributions
hire-1.220
ortun-1.083
image-1.061
icked-1.056
anced-0.961
Token
Token ID198
Feature activation+2.94e-01

Pos logit contributions
hire+0.266
ortun+0.237
image+0.234
icked+0.231
anced+0.212
Neg logit contributions
 FAR-0.319
senal-0.316
atican-0.295
 Ig-0.293
foundland-0.279
Token
Token ID198
Feature activation+2.06e-01

Pos logit contributions
hire+1.031
ortun+0.933
icked+0.917
image+0.885
anced+0.841
Neg logit contributions
 FAR-1.113
 Ig-1.016
atican-1.008
IOR-0.960
senal-0.959
TokenThe
Token ID464
Feature activation+2.31e-01

Pos logit contributions
hire+0.708
ortun+0.638
icked+0.623
image+0.617
anced+0.574
Neg logit contributions
 FAR-0.785
senal-0.759
atican-0.718
 Ig-0.714
IOR-0.684
Token policy
Token ID2450
Feature activation-3.47e-01

Pos logit contributions
 FAR+2.294
senal+2.107
 Ig+1.980
atican+1.899
IOR+1.735
Neg logit contributions
hire-5.899
icked-5.443
ortun-5.421
image-5.363
anced-5.072
Token.
Token ID13
Feature activation-3.32e-01

Pos logit contributions
 FAR+1.458
senal+1.423
 Ig+1.341
atican+1.335
IOR+1.270
Neg logit contributions
hire-1.072
ortun-0.941
icked-0.922
image-0.917
anced-0.831
Token
Token ID198
Feature activation-1.46e-01

Pos logit contributions
 FAR+1.245
senal+1.155
 Ig+1.145
atican+1.131
IOR+1.046
Neg logit contributions
hire-1.190
ortun-1.076
icked-1.048
image-1.032
anced-0.971
Token
Token ID198
Feature activation-3.24e-02

Pos logit contributions
 FAR+0.520
 Ig+0.475
atican+0.462
IOR+0.437
 SAT+0.430
Neg logit contributions
hire-0.541
ortun-0.498
icked-0.490
image-0.463
anced-0.451
TokenUS
Token ID2937
Feature activation-2.03e-01

Pos logit contributions
 FAR+0.121
atican+0.112
senal+0.111
 Ig+0.111
IOR+0.107
Neg logit contributions
hire-0.113
ortun-0.103
icked-0.101
image-0.097
anced-0.093
TokenAT
Token ID1404
Feature activation-1.80e+00

Pos logit contributions
 FAR+0.706
senal+0.688
atican+0.640
 Ig+0.639
IOR+0.601
Neg logit contributions
hire-0.771
ortun-0.700
image-0.686
icked-0.683
anced-0.636
TokenOD
Token ID3727
Feature activation-2.72e-01

Pos logit contributions
 FAR+5.590
senal+5.201
 Ig+5.034
atican+4.872
IOR+4.820
Neg logit contributions
hire-7.405
ortun-6.671
icked-6.429
image-6.251
anced-6.097
TokenAY
Token ID4792
Feature activation-6.23e-01

Pos logit contributions
 FAR+0.661
senal+0.582
atican+0.573
 Ig+0.546
IOR+0.542
Neg logit contributions
hire-1.320
ortun-1.228
icked-1.211
image-1.171
anced-1.136
Token OP
Token ID13349
Feature activation+1.41e-01

Pos logit contributions
 FAR+2.470
senal+2.374
 Ig+2.247
atican+2.236
IOR+2.140
Neg logit contributions
hire-2.076
ortun-1.850
icked-1.809
image-1.790
anced-1.678
TokenIN
Token ID1268
Feature activation-2.71e-01

Pos logit contributions
hire+0.492
ortun+0.440
image+0.432
icked+0.430
anced+0.396
Neg logit contributions
 FAR-0.535
senal-0.526
atican-0.490
 Ig-0.489
foundland-0.463
TokenION
Token ID2849
Feature activation-4.90e-01

Pos logit contributions
 FAR+0.875
senal+0.826
 Ig+0.775
atican+0.774
IOR+0.749
Neg logit contributions
hire-1.113
ortun-0.999
icked-0.985
image-0.983
anced-0.915
Token ]
Token ID2361
Feature activation+1.45e-01

Pos logit contributions
hire+0.908
image+0.782
ortun+0.758
icked+0.743
anced+0.669
Neg logit contributions
senal-1.653
 FAR-1.624
 Ig-1.535
atican-1.534
foundland-1.498
Token WHERE
Token ID33411
Feature activation+5.69e-01

Pos logit contributions
hire+0.445
ortun+0.389
image+0.388
icked+0.379
anced+0.346
Neg logit contributions
senal-0.608
 FAR-0.598
atican-0.565
 Ig-0.560
foundland-0.545
Token path
Token ID3108
Feature activation+5.45e-01

Pos logit contributions
hire+2.096
ortun+1.893
image+1.888
icked+1.869
anced+1.725
Neg logit contributions
senal-2.036
 FAR-1.972
atican-1.860
 Ig-1.813
foundland-1.767
Token LIKE
Token ID34178
Feature activation-3.61e-01

Pos logit contributions
hire+1.789
image+1.603
ortun+1.561
icked+1.553
anced+1.421
Neg logit contributions
senal-2.168
 FAR-2.087
atican-1.991
 Ig-1.976
foundland-1.928
Token '
Token ID705
Feature activation-3.84e-01

Pos logit contributions
 FAR+1.139
senal+1.075
 Ig+0.997
atican+0.991
IOR+0.927
Neg logit contributions
hire-1.506
ortun-1.379
image-1.349
icked-1.347
anced-1.264
Token%
Token ID4
Feature activation-1.80e+00

Pos logit contributions
 FAR+1.438
senal+1.408
atican+1.313
 Ig+1.308
foundland+1.238
Neg logit contributions
hire-1.359
ortun-1.223
image-1.196
icked-1.191
anced-1.104
Tokenproject
Token ID16302
Feature activation+1.68e-01

Pos logit contributions
 FAR+6.567
senal+5.999
atican+5.893
 Ig+5.881
ftime+5.677
Neg logit contributions
hire-5.991
ortun-5.404
icked-5.257
image-5.077
anced-4.972
Token.
Token ID13
Feature activation-1.07e+00

Pos logit contributions
hire+0.545
ortun+0.483
image+0.477
icked+0.471
anced+0.432
Neg logit contributions
 FAR-0.675
senal-0.669
atican-0.623
 Ig-0.621
foundland-0.592
Tokencl
Token ID565
Feature activation-1.71e-01

Pos logit contributions
 FAR+4.898
senal+4.747
atican+4.557
 Ig+4.554
lishes+4.362
Neg logit contributions
hire-2.736
ortun-2.411
icked-2.315
image-2.236
anced-2.094
Tokenj
Token ID73
Feature activation+1.16e+00

Pos logit contributions
 FAR+0.765
senal+0.761
atican+0.706
 Ig+0.705
foundland+0.678
Neg logit contributions
hire-0.488
ortun-0.422
image-0.419
icked-0.416
anced-0.372
Token'
Token ID6
Feature activation+9.90e-03

Pos logit contributions
hire+3.689
image+3.326
ortun+3.202
icked+3.171
anced+2.884
Neg logit contributions
senal-4.722
 FAR-4.508
 Ig-4.246
atican-4.244
foundland-4.126
Token the
Token ID262
Feature activation+3.54e-01

Pos logit contributions
hire+1.616
ortun+1.471
image+1.450
icked+1.440
anced+1.344
Neg logit contributions
 FAR-1.328
senal-1.318
atican-1.208
 Ig-1.193
foundland-1.129
Token low
Token ID1877
Feature activation+2.54e-01

Pos logit contributions
hire+1.363
ortun+1.245
image+1.222
icked+1.214
anced+1.130
Neg logit contributions
senal-1.204
 FAR-1.194
atican-1.098
 Ig-1.078
foundland-1.030
Token orbit
Token ID13066
Feature activation-5.69e-01

Pos logit contributions
hire+0.903
ortun+0.814
image+0.799
icked+0.791
anced+0.732
Neg logit contributions
senal-0.938
 FAR-0.936
atican-0.865
 Ig-0.853
ikuman-0.812
Token of
Token ID286
Feature activation-6.20e-01

Pos logit contributions
 FAR+2.161
senal+2.057
 Ig+1.966
atican+1.932
IOR+1.833
Neg logit contributions
hire-2.010
ortun-1.788
icked-1.754
image-1.751
anced-1.612
Token the
Token ID262
Feature activation-5.50e-01

Pos logit contributions
 FAR+1.882
senal+1.774
 Ig+1.692
atican+1.624
foundland+1.531
Neg logit contributions
hire-2.648
ortun-2.409
image-2.380
icked-2.360
anced-2.239
Token space
Token ID2272
Feature activation-1.79e+00

Pos logit contributions
 FAR+1.741
senal+1.639
 Ig+1.561
atican+1.528
foundland+1.429
Neg logit contributions
hire-2.290
ortun-2.069
image-2.053
icked-2.037
anced-1.912
Token station
Token ID4429
Feature activation-8.11e-01

Pos logit contributions
 FAR+3.924
senal+3.423
 Ig+3.350
atican+3.209
foundland+2.984
Neg logit contributions
hire-8.866
ortun-8.042
icked-7.960
image-7.904
anced-7.614
Token as
Token ID355
Feature activation+5.43e-01

Pos logit contributions
 FAR+3.248
senal+3.032
 Ig+2.967
atican+2.884
IOR+2.785
Neg logit contributions
hire-2.669
ortun-2.353
icked-2.318
image-2.307
anced-2.114
Token safely
Token ID11512
Feature activation-1.89e-01

Pos logit contributions
hire+1.913
ortun+1.728
image+1.698
icked+1.682
anced+1.556
Neg logit contributions
 FAR-2.013
senal-2.010
atican-1.862
 Ig-1.839
foundland-1.761
Token as
Token ID355
Feature activation+3.55e-01

Pos logit contributions
 FAR+0.689
senal+0.665
 Ig+0.624
atican+0.621
IOR+0.587
Neg logit contributions
hire-0.686
ortun-0.621
image-0.607
icked-0.605
anced-0.557
Token possible
Token ID1744
Feature activation+1.91e-01

Pos logit contributions
hire+1.280
ortun+1.153
image+1.133
icked+1.124
anced+1.040
Neg logit contributions
 FAR-1.302
senal-1.290
atican-1.195
 Ig-1.185
foundland-1.128
Token but
Token ID475
Feature activation+4.28e-01

Pos logit contributions
hire+0.710
ortun+0.644
image+0.632
icked+0.627
anced+0.583
Neg logit contributions
 FAR-0.651
senal-0.627
 Ig-0.590
atican-0.586
foundland-0.548
Token can
Token ID460
Feature activation-1.20e-01

Pos logit contributions
hire+1.674
ortun+1.520
image+1.496
icked+1.488
anced+1.384
Neg logit contributions
 FAR-1.450
senal-1.411
 Ig-1.308
atican-1.307
foundland-1.225
Token you
Token ID345
Feature activation-1.45e-01

Pos logit contributions
 FAR+0.405
senal+0.393
 Ig+0.366
atican+0.365
foundland+0.343
Neg logit contributions
hire-0.466
ortun-0.423
image-0.415
icked-0.412
anced-0.384
Token talk
Token ID1561
Feature activation-4.49e-01

Pos logit contributions
 FAR+0.486
 Ig+0.432
senal+0.428
atican+0.421
IOR+0.398
Neg logit contributions
hire-0.574
ortun-0.525
icked-0.509
image-0.509
anced-0.475
Token a
Token ID257
Feature activation-3.12e-01

Pos logit contributions
 FAR+1.691
senal+1.617
 Ig+1.533
atican+1.518
IOR+1.437
Neg logit contributions
hire-1.600
ortun-1.426
image-1.400
icked-1.394
anced-1.272
Token little
Token ID1310
Feature activation-1.79e+00

Pos logit contributions
 FAR+1.053
senal+1.020
atican+0.946
 Ig+0.945
IOR+0.887
Neg logit contributions
hire-1.223
ortun-1.109
image-1.091
icked-1.081
anced-1.007
Token bit
Token ID1643
Feature activation-1.52e+00

Pos logit contributions
 FAR+5.016
 Ig+4.350
senal+4.200
atican+4.060
 Ley+3.872
Neg logit contributions
hire-8.025
ortun-7.262
image-7.114
icked-7.072
umbnail-6.629
Token about
Token ID546
Feature activation-3.69e-01

Pos logit contributions
 FAR+5.140
 Ig+4.578
senal+4.434
atican+4.389
IOR+4.141
Neg logit contributions
hire-5.933
ortun-5.315
image-5.203
icked-5.135
umbnail-4.732
Token what
Token ID644
Feature activation+3.08e-01

Pos logit contributions
 FAR+1.186
senal+1.123
 Ig+1.072
atican+1.037
IOR+0.970
Neg logit contributions
hire-1.513
ortun-1.367
icked-1.349
image-1.342
anced-1.251
Token you
Token ID345
Feature activation-4.57e-01

Pos logit contributions
hire+1.198
ortun+1.080
image+1.062
icked+1.056
anced+0.981
Neg logit contributions
 FAR-1.061
senal-1.022
 Ig-0.962
atican-0.950
foundland-0.891
Token responded
Token ID7082
Feature activation-6.62e-01

Pos logit contributions
 FAR+1.470
 Ig+1.299
senal+1.298
atican+1.298
IOR+1.191
Neg logit contributions
hire-1.834
ortun-1.689
image-1.667
icked-1.637
-1.521
Token Conference
Token ID8785
Feature activation-1.66e-01

Pos logit contributions
 FAR+1.141
senal+1.103
 Ig+1.043
atican+1.038
foundland+0.982
Neg logit contributions
hire-1.059
ortun-0.952
image-0.935
icked-0.931
anced-0.859
Token in
Token ID287
Feature activation+4.35e-03

Pos logit contributions
 FAR+0.556
senal+0.529
 Ig+0.503
atican+0.498
IOR+0.465
Neg logit contributions
hire-0.650
ortun-0.590
image-0.580
icked-0.577
anced-0.542
Token H
Token ID367
Feature activation+2.10e-01

Pos logit contributions
hire+0.017
ortun+0.016
image+0.015
icked+0.015
anced+0.014
Neg logit contributions
 FAR-0.015
senal-0.014
 Ig-0.013
atican-0.013
IOR-0.012
Tokenilo
Token ID18526
Feature activation+2.14e-01

Pos logit contributions
hire+0.808
ortun+0.728
image+0.717
icked+0.712
anced+0.662
Neg logit contributions
 FAR-0.720
senal-0.700
atican-0.656
 Ig-0.650
IOR-0.610
Token,
Token ID11
Feature activation+4.44e-01

Pos logit contributions
hire+0.804
ortun+0.728
image+0.715
icked+0.711
anced+0.661
Neg logit contributions
 FAR-0.758
senal-0.737
 Ig-0.689
atican-0.686
foundland-0.645
Token Hawaii
Token ID13708
Feature activation+1.24e+00

Pos logit contributions
hire+1.752
ortun+1.591
image+1.568
icked+1.558
anced+1.453
Neg logit contributions
 FAR-1.480
senal-1.455
atican-1.339
 Ig-1.334
foundland-1.255
Token June
Token ID2795
Feature activation-6.34e-01

Pos logit contributions
hire+4.138
ortun+3.695
image+3.648
icked+3.604
anced+3.309
Neg logit contributions
senal-4.825
 FAR-4.762
atican-4.418
 Ig-4.356
ikuman-4.205
Token 10
Token ID838
Feature activation+3.57e-01

Pos logit contributions
 FAR+1.880
 Ig+1.714
atican+1.598
 SAT+1.541
IOR+1.491
Neg logit contributions
hire-2.739
ortun-2.530
icked-2.506
image-2.384
anced-2.357
Token.
Token ID13
Feature activation+1.53e-01

Pos logit contributions
hire+1.086
ortun+0.954
image+0.944
icked+0.924
anced+0.842
Neg logit contributions
senal-1.507
 FAR-1.506
atican-1.395
 Ig-1.390
ikuman-1.335
Token Rep
Token ID1432
Feature activation+3.81e-01

Pos logit contributions
hire+0.555
ortun+0.500
icked+0.487
image+0.487
anced+0.451
Neg logit contributions
 FAR-0.562
senal-0.546
atican-0.511
 Ig-0.511
IOR-0.480
Token.
Token ID13
Feature activation-4.59e-02

Pos logit contributions
hire+0.998
ortun+0.877
image+0.845
icked+0.832
anced+0.747
Neg logit contributions
 FAR-1.790
senal-1.770
 Ig-1.648
atican-1.648
foundland-1.595
Token part
Token ID636
Feature activation-5.35e-01

Pos logit contributions
 FAR+0.834
senal+0.804
 Ig+0.749
atican+0.743
foundland+0.696
Neg logit contributions
hire-0.960
ortun-0.874
icked-0.859
image-0.854
anced-0.800
Token,
Token ID11
Feature activation-1.07e-01

Pos logit contributions
 FAR+2.155
senal+2.124
atican+1.985
 Ig+1.979
foundland+1.883
Neg logit contributions
hire-1.738
ortun-1.546
image-1.517
icked-1.506
anced-1.379
Token the
Token ID262
Feature activation-5.35e-02

Pos logit contributions
 FAR+0.369
senal+0.363
atican+0.335
 Ig+0.333
foundland+0.315
Neg logit contributions
hire-0.412
ortun-0.373
image-0.367
icked-0.365
anced-0.339
Token government
Token ID1230
Feature activation+2.51e-01

Pos logit contributions
 FAR+0.179
senal+0.174
 Ig+0.161
atican+0.161
foundland+0.150
Neg logit contributions
hire-0.212
ortun-0.192
image-0.189
icked-0.188
anced-0.176
Token blames
Token ID34061
Feature activation-8.69e-02

Pos logit contributions
hire+0.925
ortun+0.833
image+0.821
icked+0.815
anced+0.756
Neg logit contributions
 FAR-0.903
senal-0.891
atican-0.824
 Ig-0.820
foundland-0.777
Token what
Token ID644
Feature activation+1.30e+00

Pos logit contributions
 FAR+0.296
senal+0.288
 Ig+0.268
atican+0.266
foundland+0.249
Neg logit contributions
hire-0.339
ortun-0.307
image-0.301
icked-0.300
anced-0.279
Token it
Token ID340
Feature activation+1.86e-01

Pos logit contributions
hire+4.396
ortun+3.991
image+3.935
icked+3.904
anced+3.566
Neg logit contributions
senal-4.858
 FAR-4.830
atican-4.487
 Ig-4.328
ikuman-4.308
Token calls
Token ID3848
Feature activation+3.50e-01

Pos logit contributions
hire+0.729
ortun+0.664
image+0.649
icked+0.648
anced+0.603
Neg logit contributions
 FAR-0.630
senal-0.601
atican-0.568
 Ig-0.567
foundland-0.526
Token an
Token ID281
Feature activation+4.78e-01

Pos logit contributions
hire+1.328
ortun+1.201
image+1.185
icked+1.178
anced+1.097
Neg logit contributions
 FAR-1.203
senal-1.199
atican-1.102
 Ig-1.089
foundland-1.039
Token
Token ID564
Feature activation+4.58e-02

Pos logit contributions
hire+1.883
ortun+1.712
image+1.690
icked+1.677
anced+1.566
Neg logit contributions
 FAR-1.576
senal-1.567
atican-1.441
 Ig-1.417
foundland-1.352
Token
Token ID250
Feature activation+2.67e-01

Pos logit contributions
hire+0.180
ortun+0.164
image+0.160
icked+0.160
anced+0.150
Neg logit contributions
 FAR-0.152
senal-0.149
 Ig-0.138
atican-0.138
foundland-0.129
Token Passage
Token ID46169
Feature activation+2.59e-01

Pos logit contributions
hire+0.330
ortun+0.297
image+0.293
icked+0.291
anced+0.269
Neg logit contributions
 FAR-0.322
senal-0.318
atican-0.294
 Ig-0.293
foundland-0.277
Token
Token ID198
Feature activation+3.59e-02

Pos logit contributions
hire+0.940
ortun+0.839
image+0.827
icked+0.818
anced+0.756
Neg logit contributions
senal-0.946
 FAR-0.944
atican-0.869
 Ig-0.851
foundland-0.819
Token
Token ID198
Feature activation-1.16e-01

Pos logit contributions
hire+0.131
ortun+0.120
icked+0.118
image+0.113
anced+0.109
Neg logit contributions
 FAR-0.131
 Ig-0.120
atican-0.116
IOR-0.110
 SAT-0.109
TokenM
Token ID44
Feature activation-7.95e-02

Pos logit contributions
 FAR+0.433
senal+0.423
atican+0.397
 Ig+0.396
IOR+0.377
Neg logit contributions
hire-0.406
ortun-0.366
icked-0.358
image-0.357
anced-0.331
Tokenael
Token ID3010
Feature activation+7.02e-01

Pos logit contributions
 FAR+0.302
senal+0.295
atican+0.279
 Ig+0.277
foundland+0.262
Neg logit contributions
hire-0.273
ortun-0.245
image-0.240
icked-0.238
anced-0.220
Tokenstrom
Token ID20282
Feature activation+1.15e+00

Pos logit contributions
hire+3.417
image+3.138
ortun+3.122
icked+3.079
anced+2.878
Neg logit contributions
 FAR-1.735
senal-1.711
atican-1.479
 Ig-1.438
lishes-1.354
Token Pulse
Token ID25062
Feature activation+1.15e+00

Pos logit contributions
hire+4.214
ortun+3.760
image+3.728
icked+3.642
+3.293
Neg logit contributions
senal-4.166
 FAR-4.065
ikuman-3.724
atican-3.683
lishes-3.657
Token vs
Token ID3691
Feature activation-2.05e-02

Pos logit contributions
hire+4.663
image+4.116
icked+4.073
ortun+4.041
anced+3.811
Neg logit contributions
senal-3.604
 FAR-3.560
atican-3.262
foundland-3.135
ikuman-3.131
Token Paradise
Token ID20494
Feature activation-5.07e-02

Pos logit contributions
 FAR+0.069
senal+0.068
atican+0.062
 Ig+0.062
foundland+0.059
Neg logit contributions
hire-0.080
ortun-0.073
image-0.072
icked-0.071
anced-0.066
Token M
Token ID337
Feature activation+5.09e-01

Pos logit contributions
 FAR+0.177
senal+0.176
atican+0.161
 Ig+0.160
foundland+0.152
Neg logit contributions
hire-0.192
ortun-0.173
image-0.171
icked-0.170
anced-0.157
Tokenantle
Token ID16941
Feature activation+1.32e+00

Pos logit contributions
hire+1.680
ortun+1.495
image+1.471
icked+1.458
anced+1.339
Neg logit contributions
 FAR-2.025
senal-2.005
atican-1.859
 Ig-1.857
foundland-1.772
TokenNever
Token ID12295
Feature activation+6.95e-02

Pos logit contributions
hire+0.110
ortun+0.100
icked+0.098
image+0.094
anced+0.091
Neg logit contributions
 FAR-0.131
senal-0.121
 Ig-0.119
atican-0.118
IOR-0.116
Token in
Token ID287
Feature activation-5.25e-02

Pos logit contributions
hire+0.273
ortun+0.248
image+0.244
icked+0.243
anced+0.227
Neg logit contributions
 FAR-0.233
senal-0.225
 Ig-0.209
atican-0.209
foundland-0.197
Token my
Token ID616
Feature activation-2.11e-01

Pos logit contributions
 FAR+0.182
senal+0.173
 Ig+0.163
atican+0.161
foundland+0.153
Neg logit contributions
hire-0.202
ortun-0.184
image-0.181
icked-0.180
anced-0.167
Token wild
Token ID4295
Feature activation-4.30e-01

Pos logit contributions
 FAR+0.662
senal+0.629
 Ig+0.584
atican+0.581
foundland+0.540
Neg logit contributions
hire-0.881
ortun-0.806
image-0.793
icked-0.791
anced-0.739
Tokenest
Token ID395
Feature activation+2.97e-01

Pos logit contributions
 FAR+1.740
senal+1.701
atican+1.606
 Ig+1.598
foundland+1.526
Neg logit contributions
hire-1.380
ortun-1.224
image-1.202
icked-1.192
anced-1.091
Token imagination
Token ID13843
Feature activation+1.17e+00

Pos logit contributions
hire+0.789
ortun+0.685
image+0.669
icked+0.661
anced+0.591
Neg logit contributions
 FAR-1.365
senal-1.354
atican-1.272
 Ig-1.267
foundland-1.216
Token,
Token ID11
Feature activation+2.27e-01

Pos logit contributions
hire+4.166
ortun+3.727
image+3.679
icked+3.656
anced+3.389
Neg logit contributions
senal-4.358
 FAR-4.283
atican-3.954
 Ig-3.911
ikuman-3.754
Token with
Token ID351
Feature activation+1.35e-01

Pos logit contributions
hire+0.880
ortun+0.800
image+0.787
icked+0.780
anced+0.727
Neg logit contributions
 FAR-0.780
senal-0.751
 Ig-0.704
atican-0.699
foundland-0.655
Token the
Token ID262
Feature activation-1.95e-01

Pos logit contributions
hire+0.529
ortun+0.480
image+0.473
icked+0.469
anced+0.437
Neg logit contributions
 FAR-0.458
senal-0.442
 Ig-0.411
atican-0.409
foundland-0.385
Token political
Token ID1964
Feature activation+2.31e-01

Pos logit contributions
 FAR+0.663
senal+0.626
 Ig+0.594
atican+0.587
foundland+0.548
Neg logit contributions
hire-0.762
ortun-0.689
image-0.680
icked-0.679
anced-0.634
Token capital
Token ID3139
Feature activation-6.47e-01

Pos logit contributions
hire+0.894
ortun+0.810
image+0.798
icked+0.793
anced+0.738
Neg logit contributions
 FAR-0.790
senal-0.776
atican-0.717
 Ig-0.714
foundland-0.673
Token spread
Token ID4104
Feature activation+2.96e-01

Pos logit contributions
hire+0.016
ortun+0.014
image+0.014
icked+0.014
anced+0.013
Neg logit contributions
 FAR-0.017
senal-0.016
 Ig-0.016
atican-0.015
IOR-0.015
Token the
Token ID262
Feature activation+4.11e-01

Pos logit contributions
hire+1.100
ortun+0.994
image+0.979
icked+0.971
anced+0.902
Neg logit contributions
 FAR-1.050
senal-1.035
atican-0.958
 Ig-0.953
foundland-0.901
Token word
Token ID1573
Feature activation-2.08e-01

Pos logit contributions
hire+1.652
ortun+1.502
image+1.478
icked+1.470
anced+1.373
Neg logit contributions
 FAR-1.345
senal-1.319
atican-1.213
 Ig-1.211
foundland-1.135
Token about
Token ID546
Feature activation+3.67e-01

Pos logit contributions
 FAR+0.743
senal+0.714
 Ig+0.670
atican+0.663
foundland+0.625
Neg logit contributions
hire-0.777
ortun-0.704
image-0.689
icked-0.686
anced-0.634
Token this
Token ID428
Feature activation+3.46e-01

Pos logit contributions
hire+1.317
ortun+1.185
image+1.167
icked+1.158
anced+1.070
Neg logit contributions
 FAR-1.349
senal-1.335
atican-1.240
 Ig-1.226
foundland-1.169
Token campaign
Token ID1923
Feature activation+1.31e+00

Pos logit contributions
hire+1.325
ortun+1.199
image+1.177
icked+1.173
anced+1.090
Neg logit contributions
 FAR-1.204
senal-1.173
 Ig-1.088
atican-1.085
foundland-1.018
Token to
Token ID284
Feature activation-9.97e-02

Pos logit contributions
hire+4.593
ortun+4.116
image+4.071
icked+4.028
anced+3.740
Neg logit contributions
senal-4.828
 FAR-4.646
atican-4.469
 Ig-4.318
ikuman-4.224
Token your
Token ID534
Feature activation+8.87e-02

Pos logit contributions
 FAR+0.318
senal+0.299
 Ig+0.284
atican+0.280
IOR+0.260
Neg logit contributions
hire-0.411
ortun-0.374
image-0.369
icked-0.368
anced-0.343
Token friends
Token ID2460
Feature activation-1.16e-01

Pos logit contributions
hire+0.323
ortun+0.290
image+0.286
icked+0.284
anced+0.262
Neg logit contributions
 FAR-0.325
senal-0.312
 Ig-0.294
atican-0.293
foundland-0.275
Token,
Token ID11
Feature activation+2.10e-01

Pos logit contributions
 FAR+0.419
senal+0.407
 Ig+0.379
atican+0.379
foundland+0.357
Neg logit contributions
hire-0.425
ortun-0.385
image-0.378
icked-0.376
anced-0.347
Token family
Token ID1641
Feature activation-3.23e-01

Pos logit contributions
hire+0.783
ortun+0.707
image+0.696
icked+0.691
anced+0.642
Neg logit contributions
 FAR-0.745
senal-0.734
atican-0.679
 Ig-0.676
foundland-0.639
Token/
Token ID14
Feature activation+8.78e-02

Pos logit contributions
 FAR+0.280
senal+0.276
atican+0.258
 Ig+0.257
foundland+0.245
Neg logit contributions
hire-0.218
ortun-0.193
image-0.189
icked-0.188
anced-0.172
Tokenpop
Token ID12924
Feature activation+1.64e-01

Pos logit contributions
hire+0.289
ortun+0.257
image+0.256
icked+0.250
anced+0.229
Neg logit contributions
 FAR-0.352
senal-0.350
 Ig-0.324
atican-0.324
foundland-0.307
Token/
Token ID14
Feature activation-6.15e-02

Pos logit contributions
hire+0.508
ortun+0.450
image+0.445
icked+0.438
anced+0.399
Neg logit contributions
 FAR-0.682
senal-0.679
atican-0.634
 Ig-0.629
foundland-0.600
Tokenpop
Token ID12924
Feature activation+6.17e-02

Pos logit contributions
 FAR+0.236
senal+0.233
atican+0.217
 Ig+0.216
foundland+0.205
Neg logit contributions
hire-0.212
ortun-0.189
image-0.187
icked-0.185
anced-0.170
Token/
Token ID14
Feature activation-1.87e-01

Pos logit contributions
hire+0.205
ortun+0.183
image+0.179
icked+0.178
anced+0.163
Neg logit contributions
 FAR-0.244
senal-0.241
atican-0.225
 Ig-0.224
foundland-0.213
Tokenret
Token ID1186
Feature activation+1.50e-01

Pos logit contributions
 FAR+0.714
senal+0.704
atican+0.655
 Ig+0.653
foundland+0.620
Neg logit contributions
hire-0.646
ortun-0.579
image-0.568
icked-0.565
anced-0.521
Token,
Token ID837
Feature activation+2.61e-01

Pos logit contributions
hire+0.524
ortun+0.469
image+0.460
icked+0.457
anced+0.422
Neg logit contributions
 FAR-0.567
senal-0.556
atican-0.518
 Ig-0.517
foundland-0.490
Token which
Token ID543
Feature activation+2.16e-01

Pos logit contributions
hire+1.046
ortun+0.947
image+0.933
icked+0.926
anced+0.867
Neg logit contributions
 FAR-0.863
senal-0.833
 Ig-0.775
atican-0.769
IOR-0.723
Token we
Token ID356
Feature activation+3.76e-01

Pos logit contributions
hire+0.825
ortun+0.737
icked+0.725
image+0.724
anced+0.673
Neg logit contributions
 FAR-0.769
senal-0.721
 Ig-0.693
atican-0.687
IOR-0.644
Token'll
Token ID1183
Feature activation-2.36e-01

Pos logit contributions
hire+1.350
ortun+1.220
image+1.206
icked+1.184
anced+1.100
Neg logit contributions
 FAR-1.398
senal-1.309
 Ig-1.272
atican-1.258
IOR-1.187
Token call
Token ID869
Feature activation+2.22e-01

Pos logit contributions
 FAR+0.795
 Ig+0.707
atican+0.691
senal+0.691
IOR+0.647
Neg logit contributions
hire-0.935
ortun-0.855
image-0.838
icked-0.833
anced-0.787
Token and
Token ID290
Feature activation+2.40e-01

Pos logit contributions
 FAR+1.168
senal+1.145
atican+1.061
 Ig+1.058
foundland+0.997
Neg logit contributions
hire-1.249
ortun-1.130
image-1.112
icked-1.106
anced-1.025
Token 1949
Token ID24977
Feature activation+3.12e-01

Pos logit contributions
hire+1.015
ortun+0.929
image+0.915
icked+0.911
anced+0.854
Neg logit contributions
 FAR-0.736
senal-0.716
atican-0.658
 Ig-0.657
foundland-0.609
Token,
Token ID11
Feature activation+7.34e-01

Pos logit contributions
hire+0.980
ortun+0.866
image+0.851
icked+0.839
anced+0.774
Neg logit contributions
 FAR-1.269
senal-1.268
atican-1.186
 Ig-1.167
foundland-1.133
Token the
Token ID262
Feature activation+3.54e-01

Pos logit contributions
hire+2.811
ortun+2.542
image+2.515
icked+2.490
anced+2.327
Neg logit contributions
senal-2.481
 FAR-2.479
atican-2.291
 Ig-2.248
foundland-2.163
Token Rams
Token ID16077
Feature activation+3.34e-03

Pos logit contributions
hire+1.392
ortun+1.264
image+1.245
icked+1.237
anced+1.154
Neg logit contributions
 FAR-1.183
senal-1.162
atican-1.070
 Ig-1.067
foundland-1.002
Token and
Token ID290
Feature activation+1.05e-01

Pos logit contributions
hire+0.013
ortun+0.011
image+0.011
icked+0.011
anced+0.010
Neg logit contributions
 FAR-0.012
senal-0.011
atican-0.011
 Ig-0.011
foundland-0.010
Token their
Token ID511
Feature activation-4.31e-02

Pos logit contributions
hire+0.424
ortun+0.386
image+0.380
icked+0.378
anced+0.353
Neg logit contributions
 FAR-0.343
senal-0.336
atican-0.309
 Ig-0.309
foundland-0.289
Token former
Token ID1966
Feature activation-3.69e-01

Pos logit contributions
 FAR+0.145
senal+0.140
 Ig+0.131
atican+0.130
foundland+0.122
Neg logit contributions
hire-0.168
ortun-0.153
image-0.151
icked-0.150
anced-0.140
Token city
Token ID1748
Feature activation+2.81e-01

Pos logit contributions
 FAR+1.247
senal+1.170
 Ig+1.131
atican+1.100
foundland+1.032
Neg logit contributions
hire-1.427
ortun-1.308
icked-1.287
image-1.286
anced-1.197
Token existed
Token ID11196
Feature activation+2.48e-01

Pos logit contributions
hire+1.024
ortun+0.923
image+0.908
icked+0.902
anced+0.836
Neg logit contributions
 FAR-1.017
senal-1.001
atican-0.928
 Ig-0.925
foundland-0.875
Token in
Token ID287
Feature activation+2.57e-02

Pos logit contributions
hire+1.004
ortun+0.914
image+0.900
icked+0.895
anced+0.835
Neg logit contributions
 FAR-0.806
senal-0.789
atican-0.726
 Ig-0.724
foundland-0.679
Token as
Token ID355
Feature activation+5.85e-01

Pos logit contributions
 FAR+0.572
senal+0.554
 Ig+0.513
atican+0.508
foundland+0.481
Neg logit contributions
hire-0.651
ortun-0.596
icked-0.584
image-0.583
anced-0.541
Token well
Token ID880
Feature activation+1.62e-01

Pos logit contributions
hire+2.224
ortun+2.015
image+1.989
icked+1.966
anced+1.832
Neg logit contributions
senal-2.010
 FAR-2.008
atican-1.845
 Ig-1.826
foundland-1.739
Token as
Token ID355
Feature activation+1.59e-02

Pos logit contributions
hire+0.524
ortun+0.461
image+0.455
icked+0.447
anced+0.418
Neg logit contributions
 FAR-0.650
senal-0.643
atican-0.607
 Ig-0.597
foundland-0.578
Token all
Token ID477
Feature activation+7.88e-01

Pos logit contributions
hire+0.061
ortun+0.055
image+0.055
icked+0.054
anced+0.051
Neg logit contributions
 FAR-0.055
senal-0.053
 Ig-0.050
atican-0.049
IOR-0.046
Token Mercenary
Token ID49969
Feature activation+4.35e-01

Pos logit contributions
hire+2.924
ortun+2.630
image+2.588
icked+2.565
anced+2.386
Neg logit contributions
 FAR-2.784
senal-2.766
atican-2.558
 Ig-2.523
foundland-2.394
Token units
Token ID4991
Feature activation+7.42e-02

Pos logit contributions
hire+1.767
ortun+1.606
image+1.588
icked+1.574
anced+1.473
Neg logit contributions
 FAR-1.390
senal-1.374
atican-1.263
 Ig-1.246
foundland-1.175
Token and
Token ID290
Feature activation-1.74e-01

Pos logit contributions
hire+0.284
ortun+0.258
image+0.253
icked+0.252
anced+0.234
Neg logit contributions
 FAR-0.257
senal-0.249
 Ig-0.232
atican-0.231
foundland-0.217
Token Mercenary
Token ID49969
Feature activation+3.43e-01

Pos logit contributions
 FAR+0.564
senal+0.543
 Ig+0.508
atican+0.499
foundland+0.468
Neg logit contributions
hire-0.705
ortun-0.642
icked-0.633
image-0.633
anced-0.589
Token character
Token ID2095
Feature activation-2.08e-02

Pos logit contributions
hire+1.517
ortun+1.391
image+1.376
icked+1.366
anced+1.287
Neg logit contributions
 FAR-0.973
senal-0.960
atican-0.871
 Ig-0.859
foundland-0.803
Token sol
Token ID1540
Feature activation-4.29e-01

Pos logit contributions
 FAR+0.059
senal+0.058
atican+0.052
 Ig+0.052
foundland+0.048
Neg logit contributions
hire-0.092
ortun-0.085
image-0.084
icked-0.083
anced-0.078
Tokenos
Token ID418
Feature activation-3.28e-01

Pos logit contributions
 FAR+1.511
senal+1.471
atican+1.374
 Ig+1.368
IOR+1.286
Neg logit contributions
hire-1.603
ortun-1.439
image-1.420
icked-1.410
anced-1.302
Token into
Token ID656
Feature activation+3.58e-01

Pos logit contributions
hire+0.343
ortun+0.311
image+0.306
icked+0.305
anced+0.284
Neg logit contributions
 FAR-0.299
senal-0.294
atican-0.271
 Ig-0.270
foundland-0.254
Token my
Token ID616
Feature activation+2.20e-01

Pos logit contributions
hire+1.276
ortun+1.157
image+1.134
icked+1.125
anced+1.043
Neg logit contributions
 FAR-1.316
senal-1.306
atican-1.212
 Ig-1.192
foundland-1.135
Token bag
Token ID6131
Feature activation+5.84e-01

Pos logit contributions
hire+0.851
ortun+0.769
image+0.758
icked+0.754
anced+0.701
Neg logit contributions
 FAR-0.755
senal-0.736
 Ig-0.682
atican-0.681
foundland-0.641
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
hire+2.002
ortun+1.806
image+1.768
icked+1.766
anced+1.620
Neg logit contributions
 FAR-2.227
senal-2.226
atican-2.054
 Ig-2.028
ikuman-1.938
Token I
Token ID314
Feature activation+8.20e-02

Pos logit contributions
hire+0.493
ortun+0.445
image+0.439
icked+0.435
anced+0.405
Neg logit contributions
 FAR-0.455
senal-0.446
atican-0.413
 Ig-0.413
foundland-0.389
Token take
Token ID1011
Feature activation+5.36e-01

Pos logit contributions
hire+0.305
ortun+0.277
image+0.273
icked+0.271
anced+0.253
Neg logit contributions
 FAR-0.287
 Ig-0.263
senal-0.262
atican-0.258
IOR-0.242
Token out
Token ID503
Feature activation-1.52e-01

Pos logit contributions
hire+1.947
ortun+1.754
image+1.725
icked+1.714
anced+1.587
Neg logit contributions
 FAR-1.951
senal-1.923
atican-1.782
 Ig-1.774
foundland-1.681
Token my
Token ID616
Feature activation-2.17e-01

Pos logit contributions
 FAR+0.563
senal+0.553
 Ig+0.514
atican+0.514
foundland+0.486
Neg logit contributions
hire-0.541
ortun-0.485
image-0.477
icked-0.475
anced-0.438
Token letter
Token ID3850
Feature activation-8.54e-03

Pos logit contributions
 FAR+0.781
senal+0.756
 Ig+0.717
atican+0.700
foundland+0.662
Neg logit contributions
hire-0.802
ortun-0.717
icked-0.705
image-0.704
anced-0.651
Token from
Token ID422
Feature activation+7.06e-01

Pos logit contributions
 FAR+0.031
senal+0.030
atican+0.028
 Ig+0.028
foundland+0.027
Neg logit contributions
hire-0.031
ortun-0.028
image-0.028
icked-0.028
anced-0.026
Token her
Token ID607
Feature activation-5.73e-02

Pos logit contributions
hire+2.673
ortun+2.431
image+2.396
icked+2.367
anced+2.200
Neg logit contributions
 FAR-2.429
senal-2.422
atican-2.234
 Ig-2.186
foundland-2.090
Tokenve
Token ID303
Feature activation-8.24e-02

Pos logit contributions
hire+1.044
ortun+0.948
image+0.915
icked+0.913
anced+0.833
Neg logit contributions
 FAR-1.390
senal-1.361
atican-1.285
 Ig-1.280
foundland-1.221
Token got
Token ID1392
Feature activation+5.46e-04

Pos logit contributions
 FAR+0.252
senal+0.237
 Ig+0.227
atican+0.222
IOR+0.206
Neg logit contributions
hire-0.347
ortun-0.317
image-0.314
icked-0.310
anced-0.292
Token in
Token ID287
Feature activation-7.10e-02

Pos logit contributions
hire+0.002
ortun+0.002
image+0.002
icked+0.002
anced+0.002
Neg logit contributions
 FAR-0.002
senal-0.002
 Ig-0.002
atican-0.002
foundland-0.001
Token store
Token ID3650
Feature activation+2.19e-02

Pos logit contributions
 FAR+0.249
senal+0.243
 Ig+0.227
atican+0.224
foundland+0.211
Neg logit contributions
hire-0.268
ortun-0.242
image-0.237
icked-0.236
anced-0.221
Token for
Token ID329
Feature activation+3.22e-02

Pos logit contributions
hire+0.088
ortun+0.080
image+0.079
icked+0.079
anced+0.074
Neg logit contributions
 FAR-0.071
senal-0.069
atican-0.064
 Ig-0.064
foundland-0.060
Token
Token ID1399
Feature activation+4.39e-01

Pos logit contributions
hire+0.123
ortun+0.112
image+0.110
icked+0.109
anced+0.102
Neg logit contributions
 FAR-0.111
senal-0.109
 Ig-0.101
atican-0.100
foundland-0.095
Token well
Token ID880
Feature activation-3.79e-01

Pos logit contributions
hire+1.482
ortun+1.315
image+1.296
icked+1.279
anced+1.172
Neg logit contributions
 FAR-1.715
senal-1.699
atican-1.589
 Ig-1.571
foundland-1.500
Token,
Token ID11
Feature activation-3.81e-01

Pos logit contributions
 FAR+1.519
senal+1.501
atican+1.402
 Ig+1.395
foundland+1.330
Neg logit contributions
hire-1.236
ortun-1.099
image-1.079
icked-1.070
anced-0.980
Token the
Token ID262
Feature activation+1.78e-02

Pos logit contributions
 FAR+1.254
senal+1.196
 Ig+1.128
atican+1.099
foundland+1.040
Neg logit contributions
hire-1.516
ortun-1.380
image-1.369
icked-1.360
anced-1.270
Token Store
Token ID9363
Feature activation-3.83e-01

Pos logit contributions
hire+0.070
ortun+0.064
image+0.063
icked+0.063
anced+0.058
Neg logit contributions
 FAR-0.059
senal-0.058
 Ig-0.053
atican-0.053
foundland-0.050
Token!
Token ID0
Feature activation-2.64e-01

Pos logit contributions
 FAR+1.581
senal+1.550
 Ig+1.454
atican+1.452
foundland+1.378
Neg logit contributions
hire-1.202
ortun-1.066
icked-1.040
image-1.037
anced-0.948
Token seasons
Token ID7028
Feature activation+6.31e-01

Pos logit contributions
hire+0.036
ortun+0.032
image+0.032
icked+0.032
anced+0.029
Neg logit contributions
 FAR-0.041
senal-0.039
 Ig-0.037
atican-0.037
foundland-0.035
Token (
Token ID357
Feature activation+7.06e-02

Pos logit contributions
hire+2.124
ortun+1.889
image+1.858
icked+1.851
anced+1.701
Neg logit contributions
 FAR-2.462
senal-2.444
atican-2.269
 Ig-2.252
foundland-2.142
Tokenand
Token ID392
Feature activation+8.18e-02

Pos logit contributions
hire+0.251
ortun+0.228
icked+0.221
image+0.219
anced+0.207
Neg logit contributions
 FAR-0.259
senal-0.253
atican-0.237
 Ig-0.237
foundland-0.225
Token produced
Token ID4635
Feature activation-4.60e-01

Pos logit contributions
hire+0.322
ortun+0.293
image+0.289
icked+0.287
anced+0.268
Neg logit contributions
 FAR-0.273
senal-0.265
 Ig-0.247
atican-0.244
foundland-0.230
Token one
Token ID530
Feature activation-7.31e-02

Pos logit contributions
 FAR+1.574
senal+1.504
 Ig+1.429
atican+1.386
foundland+1.314
Neg logit contributions
hire-1.769
ortun-1.617
image-1.590
icked-1.572
anced-1.470
Token movie
Token ID3807
Feature activation-1.79e-01

Pos logit contributions
 FAR+0.261
senal+0.251
 Ig+0.239
atican+0.235
foundland+0.221
Neg logit contributions
hire-0.271
ortun-0.246
image-0.241
icked-0.239
anced-0.223
Token),
Token ID828
Feature activation+6.23e-03

Pos logit contributions
 FAR+0.631
senal+0.604
 Ig+0.574
atican+0.567
foundland+0.538
Neg logit contributions
hire-0.669
ortun-0.609
image-0.598
icked-0.592
anced-0.550
Token in
Token ID287
Feature activation+2.36e-01

Pos logit contributions
hire+0.025
ortun+0.022
image+0.022
icked+0.022
anced+0.020
Neg logit contributions
 FAR-0.021
senal-0.020
 Ig-0.019
atican-0.019
foundland-0.017
Token which
Token ID543
Feature activation+5.27e-01

Pos logit contributions
hire+0.992
ortun+0.907
image+0.893
icked+0.888
anced+0.833
Neg logit contributions
 FAR-0.730
senal-0.710
 Ig-0.654
atican-0.653
foundland-0.609
Token Sed
Token ID22710
Feature activation-5.19e-01

Pos logit contributions
hire+2.096
ortun+1.909
image+1.876
icked+1.869
anced+1.749
Neg logit contributions
senal-1.716
 FAR-1.714
atican-1.573
 Ig-1.535
foundland-1.468
Tokenaris
Token ID20066
Feature activation+6.54e-03

Pos logit contributions
 FAR+1.291
atican+1.157
 Ig+1.121
lishes+1.087
IOR+1.053
Neg logit contributions
hire-2.355
icked-2.196
ortun-2.167
image-2.149
ドラ-2.038
Token them
Token ID606
Feature activation-2.18e-01

Pos logit contributions
 FAR+1.102
senal+1.048
 Ig+1.002
atican+0.986
IOR+0.932
Neg logit contributions
hire-1.133
ortun-1.025
image-1.009
icked-1.008
anced-0.930
Token.
Token ID13
Feature activation+2.92e-01

Pos logit contributions
 FAR+0.853
senal+0.813
 Ig+0.779
atican+0.774
IOR+0.734
Neg logit contributions
hire-0.731
ortun-0.656
image-0.644
icked-0.639
anced-0.584
Token "
Token ID366
Feature activation+2.90e-01

Pos logit contributions
hire+1.142
ortun+1.053
icked+1.027
image+1.010
anced+0.958
Neg logit contributions
 FAR-0.997
senal-0.922
 Ig-0.904
atican-0.884
IOR-0.830
TokenYou
Token ID1639
Feature activation-1.61e-01

Pos logit contributions
hire+0.974
ortun+0.891
icked+0.868
image+0.838
anced+0.800
Neg logit contributions
 FAR-1.135
senal-1.057
 Ig-1.045
atican-1.030
IOR-1.005
Token would
Token ID561
Feature activation-6.97e-01

Pos logit contributions
 FAR+0.537
 Ig+0.478
atican+0.472
senal+0.469
IOR+0.439
Neg logit contributions
hire-0.623
ortun-0.579
image-0.579
icked-0.559
-0.525
Token tend
Token ID4327
Feature activation-3.77e-02

Pos logit contributions
 FAR+2.191
 Ig+1.905
atican+1.883
senal+1.792
IOR+1.769
Neg logit contributions
hire-2.775
ortun-2.607
image-2.584
icked-2.499
ドラ-2.399
Token to
Token ID284
Feature activation-4.33e-01

Pos logit contributions
 FAR+0.146
senal+0.144
atican+0.136
 Ig+0.133
foundland+0.128
Neg logit contributions
hire-0.129
ortun-0.115
image-0.112
icked-0.112
anced-0.105
Token go
Token ID467
Feature activation-1.12e-01

Pos logit contributions
 FAR+1.404
 Ig+1.256
senal+1.246
atican+1.219
IOR+1.160
Neg logit contributions
hire-1.721
ortun-1.596
image-1.571
icked-1.542
anced-1.459
Token off
Token ID572
Feature activation+2.69e-01

Pos logit contributions
 FAR+0.367
senal+0.341
 Ig+0.326
atican+0.323
IOR+0.302
Neg logit contributions
hire-0.453
ortun-0.413
image-0.408
icked-0.403
anced-0.378
Token the
Token ID262
Feature activation-6.90e-01

Pos logit contributions
hire+1.030
ortun+0.932
image+0.918
icked+0.912
anced+0.848
Neg logit contributions
 FAR-0.931
senal-0.910
atican-0.843
 Ig-0.842
foundland-0.792
Token circuit
Token ID10349
Feature activation+4.25e-01

Pos logit contributions
 FAR+2.441
senal+2.320
 Ig+2.199
atican+2.194
IOR+2.055
Neg logit contributions
hire-2.570
ortun-2.319
image-2.291
icked-2.262
anced-2.093
Token
Token ID198
Feature activation-4.16e-01

Pos logit contributions
 FAR+0.045
senal+0.044
atican+0.041
 Ig+0.041
foundland+0.039
Neg logit contributions
hire-0.042
ortun-0.038
image-0.037
icked-0.037
anced-0.034
Token
Token ID198
Feature activation-3.63e-01

Pos logit contributions
 FAR+1.429
 Ig+1.307
atican+1.261
IOR+1.204
 SAT+1.178
Neg logit contributions
hire-1.571
ortun-1.458
icked-1.443
image-1.348
anced-1.329
TokenU
Token ID52
Feature activation-1.83e-01

Pos logit contributions
 FAR+1.224
senal+1.146
atican+1.116
 Ig+1.098
IOR+1.050
Neg logit contributions
hire-1.399
ortun-1.276
icked-1.248
image-1.230
anced-1.159
Token.
Token ID13
Feature activation-1.88e-01

Pos logit contributions
 FAR+0.829
senal+0.819
atican+0.769
 Ig+0.767
foundland+0.736
Neg logit contributions
hire-0.507
ortun-0.441
image-0.430
icked-0.428
anced-0.384
TokenS
Token ID50
Feature activation-3.81e-01

Pos logit contributions
 FAR+0.481
 Ig+0.430
atican+0.419
IOR+0.400
senal+0.397
Neg logit contributions
hire-0.884
ortun-0.816
icked-0.813
image-0.796
anced-0.774
Token.
Token ID13
Feature activation-5.57e-01

Pos logit contributions
 FAR+1.556
senal+1.447
 Ig+1.441
atican+1.438
IOR+1.368
Neg logit contributions
hire-1.197
ortun-1.056
image-1.041
icked-1.024
anced-0.940
Token President
Token ID1992
Feature activation-1.25e+00

Pos logit contributions
 FAR+2.077
senal+1.985
 Ig+1.894
atican+1.866
IOR+1.774
Neg logit contributions
hire-1.973
ortun-1.773
image-1.729
icked-1.718
anced-1.583
Token Donald
Token ID3759
Feature activation-1.30e+00

Pos logit contributions
 FAR+3.446
senal+3.201
 Ig+3.053
atican+2.994
IOR+2.773
Neg logit contributions
hire-5.460
ortun-4.981
image-4.924
icked-4.890
anced-4.622
Token Trump
Token ID1301
Feature activation-1.02e+00

Pos logit contributions
 FAR+3.259
 Ig+2.861
atican+2.663
 SAT+2.511
IOR+2.496
Neg logit contributions
hire-6.056
ortun-5.582
icked-5.540
image-5.485
anced-5.232
Token waves
Token ID9813
Feature activation-1.17e-01

Pos logit contributions
 FAR+3.645
 Ig+3.335
atican+3.302
senal+3.299
IOR+3.056
Neg logit contributions
hire-3.643
ortun-3.349
image-3.276
icked-3.240
anced-2.982
Token as
Token ID355
Feature activation-3.72e-01

Pos logit contributions
 FAR+0.421
senal+0.412
 Ig+0.385
atican+0.384
foundland+0.362
Neg logit contributions
hire-0.430
ortun-0.388
image-0.381
icked-0.380
anced-0.351
Token The
Token ID383
Feature activation+8.91e-02

Pos logit contributions
 FAR+0.566
senal+0.519
 Ig+0.511
atican+0.500
IOR+0.477
Neg logit contributions
hire-0.541
ortun-0.490
icked-0.477
image-0.471
anced-0.441
Token reason
Token ID1738
Feature activation-3.32e-01

Pos logit contributions
hire+0.364
ortun+0.330
image+0.324
icked+0.323
anced+0.302
Neg logit contributions
 FAR-0.290
senal-0.276
 Ig-0.259
atican-0.256
IOR-0.239
Token :
Token ID1058
Feature activation+5.38e-01

Pos logit contributions
 FAR+1.107
senal+1.033
 Ig+0.987
atican+0.973
IOR+0.914
Neg logit contributions
hire-1.326
ortun-1.210
icked-1.185
image-1.185
anced-1.104
Token it
Token ID340
Feature activation+3.92e-01

Pos logit contributions
hire+2.039
ortun+1.845
image+1.818
icked+1.804
anced+1.677
Neg logit contributions
 FAR-1.867
senal-1.850
atican-1.705
 Ig-1.694
foundland-1.604
Token requires
Token ID4433
Feature activation-2.08e-01

Pos logit contributions
hire+1.484
ortun+1.342
image+1.320
icked+1.312
anced+1.218
Neg logit contributions
 FAR-1.375
senal-1.348
atican-1.249
 Ig-1.245
foundland-1.173
Token the
Token ID262
Feature activation-7.39e-02

Pos logit contributions
 FAR+0.729
senal+0.688
 Ig+0.650
atican+0.641
IOR+0.612
Neg logit contributions
hire-0.794
ortun-0.720
image-0.705
icked-0.703
anced-0.649
Token server
Token ID4382
Feature activation+2.99e-01

Pos logit contributions
 FAR+0.274
senal+0.261
 Ig+0.247
atican+0.244
IOR+0.234
Neg logit contributions
hire-0.267
ortun-0.241
image-0.236
icked-0.235
anced-0.217
Token to
Token ID284
Feature activation-1.32e-02

Pos logit contributions
hire+1.059
ortun+0.951
image+0.941
icked+0.929
anced+0.860
Neg logit contributions
 FAR-1.104
senal-1.101
atican-1.020
 Ig-1.010
foundland-0.967
Token care
Token ID1337
Feature activation+8.23e-02

Pos logit contributions
 FAR+0.045
senal+0.041
 Ig+0.040
atican+0.040
IOR+0.038
Neg logit contributions
hire-0.051
ortun-0.047
icked-0.046
image-0.046
anced-0.043
Token about
Token ID546
Feature activation+7.46e-01

Pos logit contributions
hire+0.282
ortun+0.253
image+0.249
icked+0.247
anced+0.228
Neg logit contributions
 FAR-0.315
senal-0.312
atican-0.290
 Ig-0.289
foundland-0.275
Token when
Token ID618
Feature activation+1.80e-01

Pos logit contributions
hire+2.797
ortun+2.533
image+2.500
icked+2.475
anced+2.303
Neg logit contributions
senal-2.606
 FAR-2.597
atican-2.397
 Ig-2.366
foundland-2.256
Token into
Token ID656
Feature activation-3.97e-01

Pos logit contributions
hire+0.277
ortun+0.253
image+0.249
icked+0.248
anced+0.232
Neg logit contributions
 FAR-0.201
senal-0.196
atican-0.179
 Ig-0.179
foundland-0.167
Token a
Token ID257
Feature activation-4.53e-01

Pos logit contributions
 FAR+1.432
senal+1.370
 Ig+1.305
atican+1.287
foundland+1.206
Neg logit contributions
hire-1.475
ortun-1.321
image-1.300
icked-1.290
anced-1.204
Token perennial
Token ID39983
Feature activation+2.30e-01

Pos logit contributions
 FAR+1.549
senal+1.454
 Ig+1.401
atican+1.373
 SAT+1.272
Neg logit contributions
hire-1.773
ortun-1.605
image-1.568
icked-1.551
anced-1.466
Token powerhouse
Token ID34886
Feature activation+3.14e-01

Pos logit contributions
hire+0.891
ortun+0.807
image+0.792
icked+0.786
anced+0.733
Neg logit contributions
 FAR-0.790
senal-0.768
atican-0.714
 Ig-0.713
foundland-0.666
Token.
Token ID13
Feature activation+2.21e-01

Pos logit contributions
hire+1.053
ortun+0.940
image+0.922
icked+0.914
anced+0.841
Neg logit contributions
 FAR-1.235
senal-1.210
atican-1.133
 Ig-1.131
foundland-1.072
Token I
Token ID314
Feature activation-1.76e-01

Pos logit contributions
hire+0.827
ortun+0.752
icked+0.733
image+0.728
anced+0.682
Neg logit contributions
 FAR-0.798
senal-0.750
 Ig-0.730
atican-0.718
IOR-0.675
Token
Token ID447
Feature activation+3.82e-01

Pos logit contributions
 FAR+0.607
senal+0.587
 Ig+0.551
atican+0.550
IOR+0.519
Neg logit contributions
hire-0.668
ortun-0.605
image-0.595
icked-0.586
anced-0.545
Token
Token ID247
Feature activation+5.70e-01

Pos logit contributions
hire+1.457
ortun+1.331
image+1.293
icked+1.278
anced+1.197
Neg logit contributions
 FAR-1.312
senal-1.280
 Ig-1.198
atican-1.195
IOR-1.121
Tokenm
Token ID76
Feature activation-1.36e-02

Pos logit contributions
hire+1.912
ortun+1.711
image+1.668
icked+1.653
anced+1.519
Neg logit contributions
 FAR-2.241
senal-2.194
 Ig-2.064
atican-2.059
foundland-1.949
Token sure
Token ID1654
Feature activation+4.42e-03

Pos logit contributions
 FAR+0.043
senal+0.042
 Ig+0.039
atican+0.039
foundland+0.036
Neg logit contributions
hire-0.056
ortun-0.051
image-0.050
icked-0.050
anced-0.046
Token I
Token ID314
Feature activation-7.16e-01

Pos logit contributions
hire+0.018
ortun+0.016
image+0.016
icked+0.016
anced+0.015
Neg logit contributions
 FAR-0.014
senal-0.014
 Ig-0.013
atican-0.013
IOR-0.012
Token
Token ID198
Feature activation+8.48e-02

Pos logit contributions
hire+0.380
ortun+0.346
icked+0.341
image+0.326
anced+0.315
Neg logit contributions
 FAR-0.388
 Ig-0.356
atican-0.348
IOR-0.330
 SAT-0.323
TokenFacebook
Token ID12025
Feature activation+9.07e-01

Pos logit contributions
hire+0.298
ortun+0.269
icked+0.264
image+0.257
anced+0.243
Neg logit contributions
 FAR-0.316
senal-0.299
atican-0.292
 Ig-0.289
IOR-0.279
Token Twitter
Token ID3009
Feature activation+1.19e+00

Pos logit contributions
hire+3.194
ortun+2.877
icked+2.798
image+2.795
anced+2.590
Neg logit contributions
 FAR-3.432
senal-3.277
 Ig-3.133
atican-3.131
IOR-2.933
Token Pinterest
Token ID17334
Feature activation-2.14e-03

Pos logit contributions
hire+5.426
ortun+4.999
image+4.932
icked+4.910
anced+4.629
Neg logit contributions
 FAR-3.205
senal-3.131
atican-2.827
 Ig-2.815
foundland-2.601
Token Lake
Token ID6233
Feature activation-2.99e-01

Pos logit contributions
 FAR+0.007
senal+0.007
atican+0.006
 Ig+0.006
foundland+0.006
Neg logit contributions
hire-0.008
ortun-0.008
image-0.008
icked-0.007
anced-0.007
Token Ur
Token ID8799
Feature activation-1.14e+00

Pos logit contributions
 FAR+0.882
senal+0.869
atican+0.782
 Ig+0.766
foundland+0.725
Neg logit contributions
hire-1.298
ortun-1.194
image-1.183
icked-1.163
anced-1.098
Tokenmia
Token ID20730
Feature activation-3.83e-01

Pos logit contributions
 FAR+2.661
atican+2.494
 Ig+2.487
lishes+2.297
ftime+2.275
Neg logit contributions
hire-5.140
ortun-4.757
icked-4.732
image-4.517
ドラ-4.373
Token Photograph
Token ID18903
Feature activation+6.75e-01

Pos logit contributions
 FAR+1.415
senal+1.374
 Ig+1.306
atican+1.300
foundland+1.226
Neg logit contributions
hire-1.365
ortun-1.228
icked-1.203
image-1.188
anced-1.104
Token:
Token ID25
Feature activation+2.99e-01

Pos logit contributions
hire+2.027
ortun+1.793
image+1.744
icked+1.737
anced+1.588
Neg logit contributions
 FAR-2.879
senal-2.789
atican-2.660
 Ig-2.655
IOR-2.519
Token K
Token ID509
Feature activation+3.96e-01

Pos logit contributions
hire+1.184
ortun+1.078
image+1.066
icked+1.054
anced+0.984
Neg logit contributions
 FAR-0.983
senal-0.980
atican-0.894
 Ig-0.885
foundland-0.841
Tokenave
Token ID1015
Feature activation-5.99e-01

Pos logit contributions
hire+1.461
ortun+1.318
image+1.297
icked+1.289
anced+1.192
Neg logit contributions
 FAR-1.421
senal-1.401
atican-1.296
 Ig-1.292
foundland-1.223
Tokenoe
Token ID2577
Feature activation+7.50e-01

Pos logit contributions
hire+1.584
ortun+1.432
image+1.407
icked+1.400
anced+1.286
Neg logit contributions
 FAR-1.627
senal-1.617
atican-1.483
 Ig-1.482
foundland-1.403
Tokening
Token ID278
Feature activation-5.61e-01

Pos logit contributions
hire+2.663
ortun+2.364
image+2.329
icked+2.327
anced+2.133
Neg logit contributions
senal-2.808
 FAR-2.784
atican-2.586
 Ig-2.543
ikuman-2.422
Token in
Token ID287
Feature activation-1.29e-01

Pos logit contributions
 FAR+1.983
senal+1.896
 Ig+1.801
atican+1.777
foundland+1.682
Neg logit contributions
hire-2.095
ortun-1.896
image-1.870
icked-1.859
anced-1.724
Token Yosemite
Token ID43188
Feature activation-3.20e-01

Pos logit contributions
 FAR+0.444
senal+0.434
 Ig+0.403
atican+0.402
foundland+0.378
Neg logit contributions
hire-0.492
ortun-0.446
image-0.439
icked-0.436
anced-0.407
Token National
Token ID2351
Feature activation-2.86e-02

Pos logit contributions
 FAR+1.206
senal+1.184
 Ig+1.104
atican+1.099
foundland+1.043
Neg logit contributions
hire-1.120
ortun-1.009
image-0.990
icked-0.984
anced-0.908
Token Park
Token ID3250
Feature activation-1.16e+00

Pos logit contributions
 FAR+0.128
senal+0.126
 Ig+0.120
atican+0.118
foundland+0.113
Neg logit contributions
hire-0.080
ortun-0.070
image-0.067
icked-0.067
anced-0.061
Token under
Token ID739
Feature activation+1.34e-01

Pos logit contributions
 FAR+4.063
senal+3.811
 Ig+3.685
atican+3.591
foundland+3.432
Neg logit contributions
hire-4.269
ortun-3.906
image-3.855
icked-3.837
anced-3.566
Token frost
Token ID21682
Feature activation-2.44e-01

Pos logit contributions
hire+0.502
ortun+0.453
image+0.446
icked+0.443
anced+0.411
Neg logit contributions
 FAR-0.473
senal-0.467
atican-0.431
 Ig-0.429
foundland-0.406
Tokened
Token ID276
Feature activation-5.85e-01

Pos logit contributions
 FAR+0.915
senal+0.900
atican+0.836
 Ig+0.835
foundland+0.791
Neg logit contributions
hire-0.862
ortun-0.774
image-0.760
icked-0.754
anced-0.698
Token sequ
Token ID4726
Feature activation+3.01e-01

Pos logit contributions
 FAR+2.086
senal+1.982
 Ig+1.907
atican+1.862
foundland+1.766
Neg logit contributions
hire-2.179
ortun-1.958
image-1.931
icked-1.914
anced-1.790
Tokeno
Token ID78
Feature activation+1.30e-01

Pos logit contributions
hire+1.035
ortun+0.928
image+0.910
icked+0.905
anced+0.831
Neg logit contributions
 FAR-1.161
senal-1.156
atican-1.065
 Ig-1.062
foundland-1.013
Tokenhour
Token ID9769
Feature activation-3.40e-01

Pos logit contributions
 FAR+0.114
senal+0.111
atican+0.106
 Ig+0.106
foundland+0.101
Neg logit contributions
hire-0.073
ortun-0.064
icked-0.062
image-0.062
anced-0.056
Token edition
Token ID8313
Feature activation-2.71e-01

Pos logit contributions
 FAR+1.192
senal+1.132
 Ig+1.094
atican+1.071
foundland+1.000
Neg logit contributions
hire-1.276
ortun-1.152
icked-1.142
image-1.139
anced-1.054
Token network
Token ID3127
Feature activation-1.50e-01

Pos logit contributions
 FAR+1.041
senal+0.987
 Ig+0.957
atican+0.946
lishes+0.895
Neg logit contributions
hire-0.939
ortun-0.831
image-0.817
icked-0.814
anced-0.744
Token's
Token ID338
Feature activation-2.65e-01

Pos logit contributions
 FAR+0.534
senal+0.516
 Ig+0.486
atican+0.483
IOR+0.453
Neg logit contributions
hire-0.561
ortun-0.505
image-0.497
icked-0.494
anced-0.457
Token World
Token ID2159
Feature activation-4.39e-01

Pos logit contributions
 FAR+0.880
senal+0.853
 Ig+0.794
atican+0.783
foundland+0.732
Neg logit contributions
hire-1.058
ortun-0.958
icked-0.944
image-0.940
anced-0.876
Token News
Token ID3000
Feature activation-1.70e+00

Pos logit contributions
 FAR+1.634
senal+1.607
atican+1.497
 Ig+1.496
IOR+1.398
Neg logit contributions
hire-1.563
ortun-1.393
image-1.355
icked-1.352
anced-1.256
Token Tonight
Token ID27117
Feature activation-5.76e-01

Pos logit contributions
 FAR+5.486
senal+4.988
 Ig+4.952
atican+4.785
ftime+4.525
Neg logit contributions
hire-6.534
ortun-5.927
icked-5.892
image-5.855
anced-5.537
Token will
Token ID481
Feature activation-4.95e-01

Pos logit contributions
 FAR+2.138
senal+2.032
 Ig+1.967
atican+1.930
lishes+1.832
Neg logit contributions
hire-2.037
ortun-1.814
icked-1.791
image-1.789
anced-1.646
Token be
Token ID307
Feature activation-2.98e-02

Pos logit contributions
 FAR+1.600
senal+1.447
 Ig+1.429
atican+1.404
IOR+1.323
Neg logit contributions
hire-1.981
ortun-1.815
icked-1.789
image-1.781
anced-1.662
Token dedicated
Token ID7256
Feature activation-1.09e+00

Pos logit contributions
 FAR+0.107
senal+0.102
 Ig+0.097
atican+0.095
lishes+0.090
Neg logit contributions
hire-0.111
ortun-0.100
image-0.098
icked-0.097
anced-0.090
Token to
Token ID284
Feature activation-2.53e-01

Pos logit contributions
 FAR+3.832
senal+3.758
 Ig+3.481
atican+3.476
foundland+3.270
Neg logit contributions
hire-4.071
ortun-3.662
image-3.602
icked-3.585
anced-3.316
Token vandal
Token ID26271
Feature activation-2.34e-01

Pos logit contributions
hire+0.566
ortun+0.515
image+0.508
icked+0.505
anced+0.472
Neg logit contributions
 FAR-0.458
senal-0.448
 Ig-0.412
atican-0.411
foundland-0.384
Tokenized
Token ID1143
Feature activation-8.72e-01

Pos logit contributions
 FAR+1.191
senal+1.174
atican+1.116
 Ig+1.115
foundland+1.069
Neg logit contributions
hire-0.515
ortun-0.430
image-0.417
icked-0.409
anced-0.356
Token and
Token ID290
Feature activation-3.15e-01

Pos logit contributions
 FAR+3.114
senal+3.009
 Ig+2.849
atican+2.814
foundland+2.640
Neg logit contributions
hire-3.195
ortun-2.895
image-2.840
icked-2.827
anced-2.616
Token set
Token ID900
Feature activation+2.65e-01

Pos logit contributions
 FAR+1.020
senal+0.965
 Ig+0.918
atican+0.900
IOR+0.835
Neg logit contributions
hire-1.278
ortun-1.174
image-1.151
icked-1.143
anced-1.068
Token on
Token ID319
Feature activation-4.09e-01

Pos logit contributions
hire+0.999
ortun+0.902
image+0.888
icked+0.883
anced+0.820
Neg logit contributions
 FAR-0.931
senal-0.917
atican-0.847
 Ig-0.843
foundland-0.797
Token fire
Token ID2046
Feature activation-1.17e+00

Pos logit contributions
 FAR+1.516
senal+1.468
 Ig+1.392
atican+1.376
foundland+1.293
Neg logit contributions
hire-1.427
ortun-1.313
image-1.284
icked-1.258
anced-1.175
Token in
Token ID287
Feature activation-5.22e-01

Pos logit contributions
 FAR+4.662
senal+4.480
 Ig+4.335
atican+4.246
IOR+4.030
Neg logit contributions
hire-3.658
ortun-3.324
icked-3.212
image-3.199
anced-2.928
Token an
Token ID281
Feature activation-4.82e-01

Pos logit contributions
 FAR+1.789
senal+1.688
 Ig+1.639
atican+1.587
IOR+1.476
Neg logit contributions
hire-2.010
ortun-1.841
icked-1.798
image-1.792
anced-1.669
Token organized
Token ID8389
Feature activation-3.89e-01

Pos logit contributions
 FAR+1.617
 Ig+1.501
senal+1.493
atican+1.428
IOR+1.341
Neg logit contributions
hire-1.892
ortun-1.720
icked-1.690
image-1.669
anced-1.557
Token manner
Token ID5642
Feature activation-8.15e-01

Pos logit contributions
 FAR+1.275
senal+1.194
 Ig+1.156
atican+1.133
IOR+1.041
Neg logit contributions
hire-1.557
ortun-1.435
image-1.403
icked-1.398
anced-1.308
Token
Token ID447
Feature activation-3.50e-02

Pos logit contributions
 FAR+3.374
senal+3.280
 Ig+3.128
atican+3.099
IOR+2.936
Neg logit contributions
hire-2.504
ortun-2.253
image-2.181
icked-2.175
anced-1.986
Token and
Token ID290
Feature activation-6.32e-01

Pos logit contributions
 FAR+1.723
senal+1.582
 Ig+1.545
atican+1.494
IOR+1.401
Neg logit contributions
hire-2.169
ortun-1.977
image-1.941
icked-1.903
anced-1.774
Token he
Token ID339
Feature activation-6.74e-01

Pos logit contributions
 FAR+1.937
 Ig+1.735
senal+1.727
atican+1.613
IOR+1.538
Neg logit contributions
hire-2.700
ortun-2.461
image-2.420
icked-2.398
anced-2.235
Token did
Token ID750
Feature activation-3.51e-01

Pos logit contributions
 FAR+2.132
 Ig+1.861
atican+1.816
senal+1.733
 SAT+1.677
Neg logit contributions
hire-2.809
ortun-2.554
image-2.528
icked-2.460
odder-2.302
Token not
Token ID407
Feature activation-4.30e-01

Pos logit contributions
 FAR+0.973
senal+0.897
 Ig+0.851
atican+0.828
IOR+0.771
Neg logit contributions
hire-1.586
ortun-1.460
image-1.448
icked-1.439
anced-1.354
Token fight
Token ID1907
Feature activation-8.33e-02

Pos logit contributions
 FAR+1.389
 Ig+1.236
atican+1.206
senal+1.181
IOR+1.124
Neg logit contributions
hire-1.736
ortun-1.600
image-1.577
icked-1.558
anced-1.464
Token back
Token ID736
Feature activation-1.06e+00

Pos logit contributions
 FAR+0.291
senal+0.270
 Ig+0.263
atican+0.255
IOR+0.243
Neg logit contributions
hire-0.321
ortun-0.292
image-0.285
icked-0.284
anced-0.263
Token.
Token ID13
Feature activation-3.02e-01

Pos logit contributions
 FAR+3.945
 Ig+3.626
senal+3.594
atican+3.476
IOR+3.367
Neg logit contributions
hire-3.698
ortun-3.302
image-3.246
icked-3.204
anced-2.931
Token Jal
Token ID28649
Feature activation-1.70e-01

Pos logit contributions
 FAR+0.996
 Ig+0.904
senal+0.877
atican+0.865
IOR+0.819
Neg logit contributions
hire-1.220
ortun-1.125
icked-1.101
image-1.068
anced-1.035
Tokenen
Token ID268
Feature activation-4.49e-01

Pos logit contributions
 FAR+0.400
atican+0.351
senal+0.349
 Ig+0.348
ftime+0.317
Neg logit contributions
hire-0.816
ortun-0.765
image-0.747
icked-0.746
anced-0.708
Token was
Token ID373
Feature activation-6.48e-02

Pos logit contributions
 FAR+1.560
senal+1.444
 Ig+1.424
atican+1.372
IOR+1.296
Neg logit contributions
hire-1.716
ortun-1.564
image-1.542
icked-1.518
anced-1.416
Token doing
Token ID1804
Feature activation+2.36e-01

Pos logit contributions
 FAR+0.217
senal+0.204
 Ig+0.194
atican+0.189
IOR+0.179
Neg logit contributions
hire-0.260
ortun-0.235
image-0.232
icked-0.230
anced-0.214