TOP ACTIVATIONS
MAX = 0.991

. Part 4 . S ucker Waves
} -> " S ", { 2 , 15 } ->
heart l xp Profile Joined September 2010 United States
all these explanations have in common is that they basically come
starch ( which is ground dehyd rated k uz u root
adds to delays . At worst , it leads to cheating
ific ... Flag Flag Reply 14 Recommend
. In the five examples above , T read well lines
everything and Vietnamese people work hard I m
Share this comment on Twitter NY T Pick M
K ada ver BB Profile Blog Joined June 2009 Germany
zip file . file . A S 3 bucket
: 50 AM ET Fri , 28 July 2017 | 00
to gr ays cale image = c v 2 . im
a climate that is at worst hostile and at best indifferent
movie adaptation of Charlie and the Chocolate Factory initially met similar
group . Guys are working hard and we have a physical
this comment on Facebook Share this comment on Twitter
- out - the - vote efforts , and populist Sen
at worst hostile and at best indifferent to pressing urban priorities

MIN ACTIVATIONS
MIN = -1.041

: 47 ] < mom per > lol m 2 k
19 . Brian Al ain u ' uese ( 6
: 48 ] < min ion 13 37 > le l
: 47 ] < mom per > lol m 2 k
: 48 ] < mom per > lol m 2 k
: 47 ] < mom per > lol m 2 k
& mut self ) { let prev : Board = self
( 13 , 5 )) sq K ernel = c v
School Psychology International , 25 , 287 - 300 .
Crim in ology , 35 , 177 - 202 .
- January 4 , 2010 User reviews are a bad idea
= count y : * & key =
6 0 2016 7 1 Total 121 24 International
6 ) F ours qu are # of
head over to the passenger side and lift up the e
General Chuck Ye ager Interview ,
shield . Plug this loop into your receiver and use it
In Hum bers ide a mosque chairman described how
. twitter . com / u I ba H Ex TT
) 6 . Rob Harley ( 152 ) (

INTERVAL 0.483 - 0.991
CONTAINS 0.432%

vertex . x yz += v . normal * 1 ;
gave her final speech as first lady on Friday , during
men , they were cur t ailing holiday decorations to match
407 x . x . x . 99 10 88 350
cited the Sessions - R osen stein recommendation to explain the

INTERVAL -0.025 - 0.483
CONTAINS 56.367%

N em ov confirmed his ownership of Olymp Oil but
issue will face a greater burden of proof than most ballot
at 3 / 4 s . I also recommend upgrading to
Black stone bought Sea World for $ 2 . 3 billion
AFL 's longest active streak of games at 130 , were

INTERVAL -0.533 - -0.025
CONTAINS 42.962%

The city is in preliminary talks with Hudson County officials to
more clueless about the potential fallout . Those around him say
the F - 150 lineup . It will be paired with
app d great is you - the user - and the
Do not use social media to randomly criticize a contemporary (

INTERVAL -1.041 - -0.533
CONTAINS 0.239%

these sites in Hum bers ide and we will take action
0 and the sum f of all un am pl
$ font 2 ) $ rect 2 = New - Object
ank _ k 10 > ke k m 2 k [
, 2 , .. *) graph List = Table [ {
Token.
Token ID13
Feature activation+7.25e-02

Pos logit contributions
 Worker+1.236
 competitor+1.169
 logger+1.154
roid+1.154
 refugees+1.139
Neg logit contributions
nown-1.506
Jac-1.414
)-1.405
bay-1.373
ffic-1.360
Token
Token ID198
Feature activation+1.26e-01

Pos logit contributions
 Worker+0.359
 competitor+0.338
roid+0.335
 logger+0.333
 subordinates+0.332
Neg logit contributions
nown-0.398
Jac-0.370
)-0.369
bay-0.361
cape-0.357
Token
Token ID198
Feature activation+9.55e-02

Pos logit contributions
 Worker+0.591
 competitor+0.551
roid+0.551
 logger+0.542
 refugees+0.541
Neg logit contributions
nown-0.732
)-0.670
Jac-0.666
bay-0.655
cape-0.651
TokenPart
Token ID7841
Feature activation+2.60e-01

Pos logit contributions
 Worker+0.552
 competitor+0.527
roid+0.526
 logger+0.518
 subordinates+0.517
Neg logit contributions
nown-0.440
)-0.401
Jac-0.400
bay-0.389
cape-0.385
Token 4
Token ID604
Feature activation+6.02e-01

Pos logit contributions
 Worker+1.269
 competitor+1.198
roid+1.190
 logger+1.179
 subordinates+1.172
Neg logit contributions
nown-1.446
Jac-1.339
)-1.336
bay-1.314
cape-1.297
Token.
Token ID13
Feature activation+9.91e-01

Pos logit contributions
 Worker+2.874
 competitor+2.656
roid+2.630
 logger+2.628
 refugees+2.621
Neg logit contributions
nown-3.417
Jac-3.183
)-3.155
bay-3.088
cape-3.042
Token S
Token ID311
Feature activation+1.74e-01

Pos logit contributions
 Worker+4.678
 Developer+4.259
roid+4.111
 competitor+4.105
 refugees+4.074
Neg logit contributions
nown-5.778
Jac-5.231
)-5.222
bay-5.123
ffic-4.960
Tokenucker
Token ID12603
Feature activation+7.75e-02

Pos logit contributions
 Worker+0.930
roid+0.883
 competitor+0.882
 logger+0.873
 subordinates+0.871
Neg logit contributions
nown-0.875
Jac-0.809
)-0.807
bay-0.783
cape-0.770
Token Waves
Token ID42733
Feature activation+1.23e-01

Pos logit contributions
 Worker+0.369
 competitor+0.346
roid+0.346
 subordinates+0.340
 logger+0.340
Neg logit contributions
nown-0.442
Jac-0.409
)-0.409
bay-0.397
cape-0.394
Token
Token ID198
Feature activation+9.14e-02

Pos logit contributions
 Worker+0.592
 competitor+0.555
roid+0.553
 logger+0.548
 subordinates+0.546
Neg logit contributions
nown-0.690
Jac-0.643
)-0.641
bay-0.626
ffic-0.615
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
 Worker+0.435
 competitor+0.407
roid+0.406
 logger+0.400
 refugees+0.399
Neg logit contributions
nown-0.522
)-0.480
Jac-0.478
bay-0.469
cape-0.465
Token}
Token ID92
Feature activation-3.32e-01

Pos logit contributions
nown+0.103
Jac+0.096
)+0.096
bay+0.094
cape+0.093
Neg logit contributions
 Worker-0.077
 competitor-0.073
roid-0.072
 logger-0.071
 subordinates-0.071
Token ->
Token ID4613
Feature activation+2.41e-01

Pos logit contributions
nown+1.575
)+1.460
Jac+1.456
bay+1.416
cape+1.402
Neg logit contributions
 Worker-1.858
 competitor-1.777
roid-1.770
 subordinates-1.758
 logger-1.753
Token"
Token ID1
Feature activation-3.80e-02

Pos logit contributions
 Worker+0.829
 competitor+0.760
roid+0.749
 logger+0.745
 refugees+0.736
Neg logit contributions
nown-1.689
Jac-1.584
)-1.578
bay-1.556
cape-1.545
TokenS
Token ID50
Feature activation+3.81e-01

Pos logit contributions
nown+0.167
Jac+0.150
)+0.150
bay+0.146
cape+0.144
Neg logit contributions
 Worker-0.229
 competitor-0.219
roid-0.218
 logger-0.216
 subordinates-0.216
Token",
Token ID1600
Feature activation+4.33e-01

Pos logit contributions
 Worker+1.528
roid+1.423
 competitor+1.420
 logger+1.398
 subordinates+1.385
Neg logit contributions
nown-2.430
)-2.280
Jac-2.271
bay-2.222
cape-2.206
Token {
Token ID1391
Feature activation+9.76e-01

Pos logit contributions
 Worker+2.145
 competitor+2.025
 logger+2.004
roid+1.990
 refugees+1.976
Neg logit contributions
nown-2.387
Jac-2.186
)-2.175
bay-2.145
cape-2.128
Token2
Token ID17
Feature activation+6.77e-01

Pos logit contributions
 Worker+5.766
roid+5.556
 competitor+5.553
 refugees+5.410
 logger+5.401
Neg logit contributions
nown-4.132
)-3.663
Jac-3.650
cape-3.512
ffic-3.488
Token,
Token ID11
Feature activation+2.52e-01

Pos logit contributions
 Worker+2.458
roid+2.274
 competitor+2.213
 refugees+2.179
 Developer+2.155
Neg logit contributions
nown-4.605
Jac-4.155
)-4.141
ffic-4.123
foreseen-4.120
Token 15
Token ID1315
Feature activation+2.34e-01

Pos logit contributions
 Worker+1.426
 competitor+1.354
roid+1.342
 logger+1.335
 refugees+1.328
Neg logit contributions
nown-1.212
Jac-1.096
)-1.089
bay-1.077
cape-1.063
Token}
Token ID92
Feature activation-1.49e-01

Pos logit contributions
 Worker+0.991
roid+0.924
 competitor+0.919
 logger+0.911
 refugees+0.900
Neg logit contributions
nown-1.451
Jac-1.344
)-1.330
bay-1.317
cape-1.310
Token ->
Token ID4613
Feature activation+3.30e-01

Pos logit contributions
nown+0.739
Jac+0.677
)+0.675
bay+0.661
cape+0.652
Neg logit contributions
 Worker-0.814
 competitor-0.773
roid-0.769
 logger-0.764
 subordinates-0.761
Token
Token ID198
Feature activation+2.25e-01

Pos logit contributions
nown+0.784
Jac+0.727
)+0.725
bay+0.710
cape+0.703
Neg logit contributions
 Worker-0.730
 competitor-0.692
roid-0.688
 logger-0.682
 subordinates-0.681
Token
Token ID198
Feature activation+5.01e-02

Pos logit contributions
 Worker+1.036
roid+0.961
 competitor+0.954
 logger+0.939
 refugees+0.939
Neg logit contributions
nown-1.335
)-1.220
Jac-1.209
bay-1.190
cape-1.181
Tokenheart
Token ID11499
Feature activation-3.36e-01

Pos logit contributions
 Worker+0.284
roid+0.273
 competitor+0.271
 logger+0.266
 subordinates+0.266
Neg logit contributions
nown-0.236
)-0.216
Jac-0.214
bay-0.208
cape-0.207
Tokenl
Token ID75
Feature activation-1.31e-01

Pos logit contributions
nown+1.713
Jac+1.582
)+1.572
bay+1.551
cape+1.528
Neg logit contributions
 Worker-1.776
 competitor-1.689
roid-1.669
 subordinates-1.668
 logger-1.667
Tokenxp
Token ID42372
Feature activation-2.22e-02

Pos logit contributions
nown+0.650
)+0.598
Jac+0.597
bay+0.581
cape+0.576
Neg logit contributions
 Worker-0.716
 competitor-0.680
roid-0.679
 logger-0.672
 subordinates-0.670
Token Profile
Token ID13118
Feature activation+9.04e-01

Pos logit contributions
nown+0.100
)+0.092
Jac+0.091
bay+0.089
cape+0.088
Neg logit contributions
 Worker-0.131
 competitor-0.125
roid-0.125
 logger-0.123
 subordinates-0.123
Token Joined
Token ID16709
Feature activation+4.56e-01

Pos logit contributions
 Worker+4.398
 Developer+4.102
 logger+4.024
 competitor+4.024
 refugees+3.957
Neg logit contributions
nown-4.978
Jac-4.622
)-4.517
bay-4.491
cape-4.430
Token September
Token ID2693
Feature activation+4.94e-01

Pos logit contributions
 Worker+2.171
 competitor+2.013
 Developer+1.993
 logger+1.976
 refugees+1.974
Neg logit contributions
nown-2.640
)-2.408
Jac-2.406
bay-2.343
cape-2.341
Token 2010
Token ID3050
Feature activation+3.31e-01

Pos logit contributions
 Worker+2.674
 Developer+2.487
 competitor+2.481
 refugees+2.468
roid+2.468
Neg logit contributions
nown-2.541
)-2.290
Jac-2.288
cape-2.206
bay-2.198
Token United
Token ID1578
Feature activation+4.73e-02

Pos logit contributions
 Worker+1.570
 competitor+1.447
 refugees+1.433
roid+1.426
 Developer+1.425
Neg logit contributions
nown-1.931
)-1.771
Jac-1.755
bay-1.718
cape-1.703
Token States
Token ID1829
Feature activation-8.72e-02

Pos logit contributions
 Worker+0.193
 competitor+0.179
roid+0.177
 logger+0.175
 refugees+0.174
Neg logit contributions
nown-0.303
Jac-0.283
)-0.283
bay-0.277
cape-0.276
Token all
Token ID477
Feature activation+2.15e-01

Pos logit contributions
 Worker+0.211
 competitor+0.199
roid+0.197
 logger+0.196
 subordinates+0.196
Neg logit contributions
nown-0.274
Jac-0.255
)-0.255
bay-0.250
cape-0.248
Token these
Token ID777
Feature activation-3.79e-02

Pos logit contributions
 Worker+0.959
 competitor+0.905
 refugees+0.896
roid+0.896
 logger+0.893
Neg logit contributions
nown-1.272
)-1.183
Jac-1.180
bay-1.157
cape-1.146
Token explanations
Token ID18681
Feature activation-3.31e-01

Pos logit contributions
nown+0.243
Jac+0.228
)+0.227
bay+0.224
cape+0.222
Neg logit contributions
 Worker-0.152
 competitor-0.142
roid-0.141
 logger-0.139
 subordinates-0.139
Token have
Token ID423
Feature activation-5.31e-02

Pos logit contributions
nown+1.922
Jac+1.793
)+1.789
bay+1.758
cape+1.745
Neg logit contributions
 Worker-1.516
 competitor-1.428
roid-1.416
 logger-1.402
 subordinates-1.395
Token in
Token ID287
Feature activation+3.52e-01

Pos logit contributions
nown+0.281
Jac+0.259
)+0.259
bay+0.254
cape+0.251
Neg logit contributions
 Worker-0.271
 competitor-0.257
roid-0.256
 logger-0.253
 subordinates-0.252
Token common
Token ID2219
Feature activation+9.01e-01

Pos logit contributions
 Worker+0.862
 competitor+0.769
 logger+0.762
roid+0.752
 subordinates+0.742
Neg logit contributions
nown-2.796
)-2.658
Jac-2.648
bay-2.619
cape-2.600
Token is
Token ID318
Feature activation+1.27e-01

Pos logit contributions
 Worker+3.523
 competitor+3.300
 subordinates+3.281
 logger+3.277
roid+3.269
Neg logit contributions
nown-5.653
)-5.382
Jac-5.351
ffic-5.211
bay-5.188
Token that
Token ID326
Feature activation-4.67e-02

Pos logit contributions
 Worker+0.544
 competitor+0.510
roid+0.505
 logger+0.502
 subordinates+0.500
Neg logit contributions
nown-0.782
)-0.731
Jac-0.730
bay-0.717
cape-0.710
Token they
Token ID484
Feature activation-2.35e-01

Pos logit contributions
nown+0.284
Jac+0.265
)+0.265
bay+0.260
cape+0.258
Neg logit contributions
 Worker-0.201
 competitor-0.189
roid-0.188
 logger-0.186
 subordinates-0.186
Token basically
Token ID6209
Feature activation-1.89e-01

Pos logit contributions
nown+1.369
Jac+1.275
)+1.273
bay+1.251
cape+1.239
Neg logit contributions
 Worker-1.079
 competitor-1.015
roid-1.009
 logger-0.999
 subordinates-0.994
Token come
Token ID1282
Feature activation+1.86e-02

Pos logit contributions
nown+1.116
Jac+1.041
)+1.039
bay+1.019
cape+1.009
Neg logit contributions
 Worker-0.847
 competitor-0.798
roid-0.792
 logger-0.786
 subordinates-0.784
Token starch
Token ID39515
Feature activation+3.64e-01

Pos logit contributions
 Worker+1.273
 competitor+1.199
roid+1.177
 logger+1.170
 subordinates+1.167
Neg logit contributions
nown-2.028
Jac-1.904
)-1.899
bay-1.850
cape-1.843
Token (
Token ID357
Feature activation+4.63e-01

Pos logit contributions
 Worker+1.608
 competitor+1.522
roid+1.499
 logger+1.487
 subordinates+1.481
Neg logit contributions
nown-2.183
)-2.041
Jac-2.039
bay-1.979
ffic-1.971
Tokenwhich
Token ID4758
Feature activation+1.21e-01

Pos logit contributions
 Worker+2.490
roid+2.372
 competitor+2.365
 logger+2.312
 subordinates+2.305
Neg logit contributions
nown-2.299
)-2.106
Jac-2.085
ffic-2.012
cape-2.008
Token is
Token ID318
Feature activation+4.95e-02

Pos logit contributions
 Worker+0.541
 competitor+0.509
roid+0.505
 logger+0.501
 subordinates+0.500
Neg logit contributions
nown-0.716
Jac-0.668
)-0.667
bay-0.654
cape-0.648
Token ground
Token ID2323
Feature activation+3.95e-02

Pos logit contributions
 Worker+0.208
 competitor+0.194
roid+0.194
 logger+0.192
 subordinates+0.191
Neg logit contributions
nown-0.306
)-0.287
Jac-0.286
bay-0.282
cape-0.279
Token dehyd
Token ID28151
Feature activation+9.00e-01

Pos logit contributions
 Worker+0.155
 competitor+0.145
roid+0.143
 logger+0.142
 subordinates+0.141
Neg logit contributions
nown-0.256
Jac-0.240
)-0.240
bay-0.236
cape-0.234
Tokenrated
Token ID4111
Feature activation+2.38e-01

Pos logit contributions
roid+3.491
 Worker+3.474
 competitor+3.258
 refugees+3.216
 logger+3.160
Neg logit contributions
nown-5.572
Jac-5.339
)-5.271
bay-5.146
cape-5.146
Token k
Token ID479
Feature activation+1.18e-01

Pos logit contributions
 Worker+0.920
 competitor+0.858
roid+0.847
 logger+0.842
 subordinates+0.837
Neg logit contributions
nown-1.564
Jac-1.466
)-1.456
bay-1.433
cape-1.424
Tokenuz
Token ID10277
Feature activation+1.56e-02

Pos logit contributions
 Worker+0.682
roid+0.653
 competitor+0.648
 subordinates+0.641
 logger+0.639
Neg logit contributions
nown-0.541
Jac-0.498
)-0.494
bay-0.476
cape-0.475
Tokenu
Token ID84
Feature activation-2.00e-02

Pos logit contributions
 Worker+0.101
 competitor+0.097
roid+0.097
 subordinates+0.096
 logger+0.096
Neg logit contributions
nown-0.061
Jac-0.055
)-0.055
bay-0.053
cape-0.052
Token root
Token ID6808
Feature activation+1.33e-01

Pos logit contributions
nown+0.122
Jac+0.114
)+0.114
bay+0.112
cape+0.111
Neg logit contributions
 Worker-0.086
 competitor-0.081
roid-0.080
 logger-0.080
 subordinates-0.080
Token adds
Token ID6673
Feature activation-7.17e-02

Pos logit contributions
 Worker+0.389
 competitor+0.365
roid+0.362
 logger+0.358
 subordinates+0.357
Neg logit contributions
nown-0.564
Jac-0.528
)-0.527
bay-0.519
cape-0.514
Token to
Token ID284
Feature activation+7.88e-02

Pos logit contributions
nown+0.430
Jac+0.402
)+0.402
bay+0.394
cape+0.390
Neg logit contributions
 Worker-0.313
 competitor-0.294
roid-0.293
 logger-0.290
 subordinates-0.290
Token delays
Token ID16119
Feature activation-9.25e-02

Pos logit contributions
 Worker+0.360
 competitor+0.340
roid+0.336
 logger+0.334
 subordinates+0.333
Neg logit contributions
nown-0.462
Jac-0.430
)-0.430
bay-0.421
cape-0.416
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
nown+0.521
Jac+0.484
)+0.484
bay+0.476
cape+0.473
Neg logit contributions
 Worker-0.438
 competitor-0.415
roid-0.412
 logger-0.407
 subordinates-0.407
Token At
Token ID1629
Feature activation+2.58e-01

Pos logit contributions
nown+1.051
Jac+0.978
)+0.975
bay+0.954
cape+0.944
Neg logit contributions
 Worker-1.007
 competitor-0.960
roid-0.956
 logger-0.948
 subordinates-0.947
Token worst
Token ID5290
Feature activation+8.95e-01

Pos logit contributions
 Worker+1.150
 competitor+1.089
 logger+1.079
 subordinates+1.062
roid+1.061
Neg logit contributions
nown-1.537
Jac-1.438
)-1.435
bay-1.412
cape-1.388
Token,
Token ID11
Feature activation+8.70e-02

Pos logit contributions
 Worker+3.330
 competitor+3.202
 logger+3.108
 subordinates+3.095
 refugees+3.093
Neg logit contributions
nown-5.904
Jac-5.511
)-5.480
bay-5.389
ffic-5.273
Token it
Token ID340
Feature activation-1.63e-02

Pos logit contributions
 Worker+0.400
 competitor+0.379
roid+0.373
 logger+0.372
 subordinates+0.371
Neg logit contributions
nown-0.507
Jac-0.472
)-0.471
bay-0.462
cape-0.456
Token leads
Token ID5983
Feature activation+7.35e-02

Pos logit contributions
nown+0.105
Jac+0.099
)+0.099
bay+0.097
cape+0.097
Neg logit contributions
 Worker-0.064
 competitor-0.059
roid-0.059
 logger-0.058
 subordinates-0.058
Token to
Token ID284
Feature activation-4.50e-03

Pos logit contributions
 Worker+0.347
 competitor+0.328
roid+0.325
 subordinates+0.324
 logger+0.323
Neg logit contributions
nown-0.419
Jac-0.390
)-0.389
bay-0.380
cape-0.377
Token cheating
Token ID21608
Feature activation-2.82e-01

Pos logit contributions
nown+0.029
Jac+0.027
)+0.027
bay+0.027
cape+0.026
Neg logit contributions
 Worker-0.018
 competitor-0.017
roid-0.017
 logger-0.016
 subordinates-0.016
Tokenific
Token ID811
Feature activation-1.52e-01

Pos logit contributions
nown+0.553
Jac+0.499
)+0.498
bay+0.487
cape+0.483
Neg logit contributions
 Worker-0.697
 competitor-0.663
roid-0.663
 logger-0.655
 refugees-0.652
Token...
Token ID986
Feature activation-3.61e-01

Pos logit contributions
nown+0.856
Jac+0.795
)+0.794
bay+0.776
cape+0.768
Neg logit contributions
 Worker-0.736
 competitor-0.694
roid-0.689
 logger-0.683
 subordinates-0.680
Token Flag
Token ID19762
Feature activation-9.07e-02

Pos logit contributions
nown+1.852
Jac+1.722
)+1.707
bay+1.684
cape+1.666
Neg logit contributions
 Worker-1.862
 competitor-1.786
roid-1.767
 subordinates-1.764
 logger-1.763
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
nown+0.479
)+0.446
Jac+0.446
bay+0.433
ffic+0.430
Neg logit contributions
 Worker-0.456
roid-0.437
 competitor-0.433
 subordinates-0.430
 logger-0.430
Token
Token ID198
Feature activation+3.88e-01

Pos logit contributions
 Worker+0.921
roid+0.855
 competitor+0.848
 refugees+0.835
 logger+0.833
Neg logit contributions
nown-1.167
)-1.063
Jac-1.049
bay-1.040
cape-1.026
TokenFlag
Token ID34227
Feature activation+8.94e-01

Pos logit contributions
 Worker+2.219
roid+2.123
 competitor+2.115
 logger+2.070
 subordinates+2.068
Neg logit contributions
nown-1.808
)-1.634
Jac-1.609
bay-1.582
ffic-1.549
Token Reply
Token ID14883
Feature activation+5.45e-01

Pos logit contributions
 Worker+5.573
 competitor+5.273
 logger+5.209
 Developer+5.205
 refugees+5.132
Neg logit contributions
nown-3.870
)-3.363
Jac-3.362
bay-3.314
cape-3.218
Token
Token ID198
Feature activation+3.05e-01

Pos logit contributions
 Worker+2.484
 competitor+2.285
 Developer+2.274
 refugees+2.250
 logger+2.246
Neg logit contributions
nown-3.315
)-3.017
bay-2.955
Jac-2.943
cape-2.934
Token
Token ID198
Feature activation+3.02e-01

Pos logit contributions
 Worker+1.373
roid+1.262
 competitor+1.251
 refugees+1.242
 Developer+1.239
Neg logit contributions
nown-1.851
)-1.674
Jac-1.654
bay-1.637
ffic-1.622
Token14
Token ID1415
Feature activation+3.02e-01

Pos logit contributions
 Worker+1.818
 competitor+1.730
roid+1.728
 logger+1.696
 Developer+1.691
Neg logit contributions
nown-1.348
)-1.203
Jac-1.174
bay-1.165
cape-1.147
Token Recommend
Token ID30361
Feature activation+2.53e-01

Pos logit contributions
 Worker+1.435
 competitor+1.353
roid+1.328
 logger+1.318
 subordinates+1.317
Neg logit contributions
nown-1.741
)-1.603
Jac-1.589
bay-1.568
cape-1.547
Token.
Token ID13
Feature activation+4.67e-03

Pos logit contributions
 Worker+0.247
 competitor+0.234
roid+0.233
 logger+0.231
 subordinates+0.230
Neg logit contributions
nown-0.275
Jac-0.255
)-0.255
bay-0.249
cape-0.247
Token In
Token ID554
Feature activation+1.67e-01

Pos logit contributions
 Worker+0.024
 competitor+0.022
roid+0.022
 logger+0.022
 subordinates+0.022
Neg logit contributions
nown-0.025
)-0.023
Jac-0.023
bay-0.023
cape-0.022
Token the
Token ID262
Feature activation+1.12e-01

Pos logit contributions
 Worker+0.774
 competitor+0.732
roid+0.721
 logger+0.716
 subordinates+0.716
Neg logit contributions
nown-0.969
)-0.902
Jac-0.898
bay-0.877
cape-0.871
Token five
Token ID1936
Feature activation+1.20e-02

Pos logit contributions
 Worker+0.465
 competitor+0.437
roid+0.432
 logger+0.428
 subordinates+0.427
Neg logit contributions
nown-0.704
Jac-0.659
)-0.658
bay-0.647
cape-0.641
Token examples
Token ID6096
Feature activation+1.87e-02

Pos logit contributions
 Worker+0.049
 competitor+0.045
roid+0.045
 logger+0.045
 subordinates+0.044
Neg logit contributions
nown-0.076
Jac-0.071
)-0.071
bay-0.070
cape-0.069
Token above
Token ID2029
Feature activation+8.82e-01

Pos logit contributions
 Worker+0.071
 competitor+0.066
roid+0.065
 logger+0.065
 subordinates+0.064
Neg logit contributions
nown-0.124
Jac-0.116
)-0.116
bay-0.114
cape-0.113
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
 Worker+3.256
 competitor+3.031
 logger+3.013
 refugees+2.995
roid+2.973
Neg logit contributions
nown-5.650
)-5.472
Jac-5.319
cape-5.227
ffic-5.207
Token T
Token ID309
Feature activation+9.17e-02

Pos logit contributions
 Worker+0.954
 competitor+0.903
roid+0.886
 logger+0.886
 subordinates+0.883
Neg logit contributions
nown-1.255
)-1.178
Jac-1.172
bay-1.148
cape-1.141
Tokenread
Token ID961
Feature activation-4.27e-03

Pos logit contributions
 Worker+0.447
roid+0.427
 competitor+0.426
 logger+0.416
 subordinates+0.415
Neg logit contributions
nown-0.501
Jac-0.466
)-0.466
cape-0.454
bay-0.451
Tokenwell
Token ID4053
Feature activation-7.73e-03

Pos logit contributions
nown+0.024
Jac+0.022
)+0.022
bay+0.021
cape+0.021
Neg logit contributions
 Worker-0.021
 competitor-0.020
roid-0.020
 logger-0.019
 subordinates-0.019
Token lines
Token ID3951
Feature activation+7.43e-02

Pos logit contributions
nown+0.044
Jac+0.041
)+0.041
bay+0.040
cape+0.040
Neg logit contributions
 Worker-0.036
 competitor-0.034
roid-0.034
 logger-0.034
 subordinates-0.034
Token everything
Token ID2279
Feature activation+9.13e-02

Pos logit contributions
 Worker+1.085
 competitor+1.018
roid+1.004
 logger+0.995
 subordinates+0.994
Neg logit contributions
nown-1.659
Jac-1.549
)-1.549
bay-1.521
cape-1.506
Token and
Token ID290
Feature activation-2.32e-02

Pos logit contributions
 Worker+0.428
 competitor+0.404
roid+0.402
 logger+0.399
 subordinates+0.398
Neg logit contributions
nown-0.519
Jac-0.485
)-0.484
bay-0.474
cape-0.470
Token Vietnamese
Token ID23618
Feature activation-1.30e-01

Pos logit contributions
nown+0.139
Jac+0.130
)+0.130
bay+0.127
cape+0.126
Neg logit contributions
 Worker-0.102
roid-0.095
 competitor-0.095
 logger-0.094
 subordinates-0.094
Token people
Token ID661
Feature activation+2.56e-01

Pos logit contributions
nown+0.906
Jac+0.857
)+0.856
bay+0.843
cape+0.835
Neg logit contributions
 Worker-0.438
roid-0.403
 competitor-0.402
 logger-0.398
 subordinates-0.394
Token work
Token ID670
Feature activation+1.14e-01

Pos logit contributions
 Worker+1.191
 competitor+1.129
roid+1.117
 subordinates+1.111
 refugees+1.108
Neg logit contributions
nown-1.472
Jac-1.368
)-1.358
bay-1.335
cape-1.317
Token hard
Token ID1327
Feature activation+8.79e-01

Pos logit contributions
 Worker+0.435
 competitor+0.406
roid+0.401
 logger+0.397
 subordinates+0.397
Neg logit contributions
nown-0.754
Jac-0.707
)-0.706
bay-0.694
cape-0.687
Token
Token ID851
Feature activation+2.08e-01

Pos logit contributions
 Worker+3.885
 competitor+3.685
 subordinates+3.609
 refugees+3.587
roid+3.572
Neg logit contributions
nown-5.155
Jac-4.839
)-4.768
bay-4.618
ffic-4.608
Token I
Token ID314
Feature activation+4.44e-02

Pos logit contributions
 Worker+0.917
 competitor+0.863
roid+0.852
 logger+0.846
 subordinates+0.845
Neg logit contributions
nown-1.254
Jac-1.168
)-1.166
bay-1.145
cape-1.134
Token
Token ID447
Feature activation-1.66e-01

Pos logit contributions
 Worker+0.217
 competitor+0.206
roid+0.204
 logger+0.203
 subordinates+0.202
Neg logit contributions
nown-0.245
Jac-0.227
)-0.226
bay-0.222
cape-0.220
Token
Token ID247
Feature activation-1.82e-01

Pos logit contributions
nown+1.211
Jac+1.145
)+1.143
bay+1.126
cape+1.117
Neg logit contributions
 Worker-0.515
 competitor-0.471
roid-0.467
 logger-0.460
 subordinates-0.459
Tokenm
Token ID76
Feature activation-8.89e-02

Pos logit contributions
nown+0.847
Jac+0.774
)+0.774
bay+0.756
cape+0.746
Neg logit contributions
 Worker-1.043
 competitor-0.996
roid-0.989
 logger-0.985
 subordinates-0.983
Token Share
Token ID8734
Feature activation+7.89e-01

Pos logit contributions
 Worker+2.298
 Developer+2.139
 shares+2.131
 competitor+2.130
 logger+2.089
Neg logit contributions
nown-2.474
Jac-2.193
)-2.193
foreseen-2.112
bay-2.099
Token this
Token ID428
Feature activation+8.31e-01

Pos logit contributions
 Worker+3.090
 competitor+2.948
roid+2.910
 shares+2.900
 logger+2.878
Neg logit contributions
nown-5.096
)-4.654
Jac-4.624
bay-4.575
ffic-4.537
Token comment
Token ID2912
Feature activation+5.30e-01

Pos logit contributions
 Worker+1.847
 competitor+1.750
 logger+1.701
 reviewer+1.650
roid+1.626
Neg logit contributions
nown-6.846
Jac-6.391
)-6.349
cape-6.251
bay-6.224
Token on
Token ID319
Feature activation+5.54e-01

Pos logit contributions
 Worker+2.221
 competitor+2.065
roid+2.054
 logger+2.042
 refugees+2.037
Neg logit contributions
nown-3.355
Jac-3.041
)-3.040
bay-2.972
cape-2.943
Token Twitter
Token ID3009
Feature activation+5.30e-01

Pos logit contributions
 Worker+2.527
 competitor+2.331
 Developer+2.308
 logger+2.308
 refugees+2.288
Neg logit contributions
nown-3.348
)-3.048
Jac-3.024
cape-2.964
bay-2.946
Token
Token ID198
Feature activation+8.75e-01

Pos logit contributions
 Worker+2.476
 competitor+2.305
 logger+2.289
roid+2.280
 refugees+2.263
Neg logit contributions
nown-3.093
)-2.861
Jac-2.780
bay-2.738
cape-2.722
Token
Token ID198
Feature activation+7.28e-01

Pos logit contributions
 Worker+3.481
 Developer+3.123
roid+3.113
 shares+3.087
 refugees+3.074
Neg logit contributions
nown-5.734
-5.186
)-5.155
foreseen-5.040
Jac-5.002
TokenNY
Token ID12805
Feature activation+4.91e-01

Pos logit contributions
 Worker+3.411
roid+3.262
 competitor+3.234
 logger+3.135
 refugees+3.113
Neg logit contributions
nown-4.035
)-3.734
ffic-3.549
Jac-3.539
cape-3.506
TokenT
Token ID51
Feature activation+5.55e-01

Pos logit contributions
 Worker+3.034
roid+2.913
 competitor+2.897
 logger+2.840
 Developer+2.829
Neg logit contributions
nown-2.095
)-1.842
Jac-1.779
bay-1.772
ffic-1.756
Token Pick
Token ID12346
Feature activation+6.18e-01

Pos logit contributions
 Worker+1.887
 competitor+1.713
roid+1.681
 logger+1.666
 refugees+1.638
Neg logit contributions
nown-3.959
Jac-3.708
)-3.703
bay-3.615
cape-3.592
Token M
Token ID337
Feature activation+1.24e-01

Pos logit contributions
 Worker+3.152
 competitor+2.965
 logger+2.940
 refugees+2.925
roid+2.902
Neg logit contributions
nown-3.306
)-3.035
Jac-2.967
bay-2.924
cape-2.888
Token
Token ID198
Feature activation+7.99e-02

Pos logit contributions
 Worker+0.592
roid+0.553
 competitor+0.553
 logger+0.544
 refugees+0.543
Neg logit contributions
nown-0.723
)-0.664
Jac-0.662
bay-0.648
cape-0.643
TokenK
Token ID42
Feature activation-9.72e-02

Pos logit contributions
 Worker+0.458
roid+0.438
 competitor+0.437
 logger+0.429
 subordinates+0.428
Neg logit contributions
nown-0.371
)-0.339
Jac-0.336
bay-0.326
cape-0.324
Tokenada
Token ID4763
Feature activation+1.62e-01

Pos logit contributions
nown+0.482
)+0.443
Jac+0.443
bay+0.433
cape+0.428
Neg logit contributions
 Worker-0.529
 competitor-0.503
roid-0.502
 logger-0.496
 subordinates-0.495
Tokenver
Token ID332
Feature activation-1.04e-01

Pos logit contributions
 Worker+0.911
 competitor+0.867
roid+0.863
 logger+0.851
 subordinates+0.850
Neg logit contributions
nown-0.777
)-0.713
Jac-0.706
cape-0.691
bay-0.690
TokenBB
Token ID15199
Feature activation-1.82e-01

Pos logit contributions
nown+0.502
Jac+0.461
)+0.460
bay+0.448
cape+0.443
Neg logit contributions
 Worker-0.584
 competitor-0.557
roid-0.554
 logger-0.549
 subordinates-0.548
Token Profile
Token ID13118
Feature activation+8.71e-01

Pos logit contributions
nown+0.853
Jac+0.780
)+0.779
bay+0.758
cape+0.750
Neg logit contributions
 Worker-1.047
 competitor-0.998
roid-0.994
 logger-0.986
 subordinates-0.984
Token Blog
Token ID14001
Feature activation+2.15e-01

Pos logit contributions
 Worker+4.177
 Developer+3.872
 logger+3.847
 competitor+3.833
roid+3.760
Neg logit contributions
nown-4.837
Jac-4.493
)-4.434
bay-4.349
cape-4.304
Token Joined
Token ID16709
Feature activation+1.92e-02

Pos logit contributions
 Worker+1.140
 competitor+1.062
 logger+1.055
 Developer+1.050
 refugees+1.049
Neg logit contributions
nown-1.113
Jac-1.021
)-1.015
cape-0.991
bay-0.985
Token June
Token ID2795
Feature activation+1.66e-01

Pos logit contributions
 Worker+0.096
 competitor+0.091
roid+0.090
 logger+0.090
 subordinates+0.089
Neg logit contributions
nown-0.104
Jac-0.096
)-0.096
bay-0.094
cape-0.093
Token 2009
Token ID3717
Feature activation+1.85e-01

Pos logit contributions
 Worker+0.947
 competitor+0.896
roid+0.890
 logger+0.885
 refugees+0.884
Neg logit contributions
nown-0.796
)-0.721
Jac-0.720
bay-0.697
cape-0.692
Token Germany
Token ID4486
Feature activation-1.52e-01

Pos logit contributions
 Worker+0.894
 competitor+0.825
 refugees+0.818
 Developer+0.815
roid+0.814
Neg logit contributions
nown-1.053
)-0.974
Jac-0.967
bay-0.947
cape-0.941
Tokenzip
Token ID13344
Feature activation+2.43e-01

Pos logit contributions
nown+0.371
Jac+0.342
)+0.341
bay+0.336
cape+0.331
Neg logit contributions
 Worker-0.367
 competitor-0.350
roid-0.346
 subordinates-0.344
 logger-0.343
Token file
Token ID2393
Feature activation+2.77e-01

Pos logit contributions
 Worker+0.913
 competitor+0.849
roid+0.843
 logger+0.836
 subordinates+0.824
Neg logit contributions
nown-1.620
Jac-1.522
)-1.521
bay-1.492
cape-1.480
Token.
Token ID13
Feature activation-3.40e-02

Pos logit contributions
 Worker+1.321
 competitor+1.247
roid+1.238
 logger+1.231
 subordinates+1.224
Neg logit contributions
nown-1.562
Jac-1.452
)-1.451
bay-1.420
cape-1.405
Token
Token ID198
Feature activation+2.34e-01

Pos logit contributions
nown+0.180
Jac+0.167
)+0.166
bay+0.163
cape+0.161
Neg logit contributions
 Worker-0.172
 competitor-0.164
roid-0.163
 subordinates-0.161
 logger-0.161
Token
Token ID198
Feature activation+3.87e-01

Pos logit contributions
 Worker+1.085
roid+1.006
 competitor+1.005
 logger+0.989
 refugees+0.989
Neg logit contributions
nown-1.372
)-1.254
Jac-1.246
bay-1.223
cape-1.216
Tokenfile
Token ID7753
Feature activation+8.64e-01

Pos logit contributions
 Worker+2.176
roid+2.073
 competitor+2.071
 logger+2.044
 subordinates+2.029
Neg logit contributions
nown-1.834
)-1.670
Jac-1.648
bay-1.613
cape-1.601
Token.
Token ID13
Feature activation+4.25e-01

Pos logit contributions
 Worker+3.758
 logger+3.509
roid+3.497
 competitor+3.472
 shares+3.398
Neg logit contributions
nown-5.129
Jac-4.724
)-4.720
bay-4.658
cape-4.628
Token A
Token ID317
Feature activation-1.75e-01

Pos logit contributions
 Worker+2.048
 competitor+1.908
 logger+1.901
 subordinates+1.882
roid+1.875
Neg logit contributions
nown-2.400
Jac-2.236
)-2.227
bay-2.185
cape-2.158
Token S
Token ID311
Feature activation+9.43e-04

Pos logit contributions
nown+1.115
)+1.049
Jac+1.049
bay+1.028
cape+1.020
Neg logit contributions
 Worker-0.687
 competitor-0.642
roid-0.641
 subordinates-0.639
 refugees-0.628
Token3
Token ID18
Feature activation+6.52e-02

Pos logit contributions
 Worker+0.005
 competitor+0.005
roid+0.005
 subordinates+0.005
 logger+0.005
Neg logit contributions
nown-0.005
Jac-0.004
)-0.004
bay-0.004
cape-0.004
Token bucket
Token ID19236
Feature activation-2.95e-01

Pos logit contributions
 Worker+0.238
 competitor+0.221
roid+0.219
 subordinates+0.216
 logger+0.216
Neg logit contributions
nown-0.440
Jac-0.414
)-0.414
bay-0.407
cape-0.403
Token:
Token ID25
Feature activation-1.75e-01

Pos logit contributions
nown+0.469
Jac+0.432
)+0.431
bay+0.422
cape+0.417
Neg logit contributions
 Worker-0.497
 competitor-0.473
roid-0.471
 logger-0.467
 subordinates-0.466
Token50
Token ID1120
Feature activation+9.89e-02

Pos logit contributions
nown+0.766
Jac+0.703
)+0.701
bay+0.682
cape+0.674
Neg logit contributions
 Worker-1.048
 competitor-1.006
roid-0.999
 logger-0.995
 subordinates-0.994
Token AM
Token ID3001
Feature activation+2.14e-01

Pos logit contributions
 Worker+0.569
 competitor+0.537
 logger+0.532
roid+0.527
 subordinates+0.526
Neg logit contributions
nown-0.472
)-0.428
Jac-0.425
bay-0.418
cape-0.411
Token ET
Token ID12152
Feature activation+3.20e-01

Pos logit contributions
 Worker+0.999
 competitor+0.933
 logger+0.918
roid+0.913
 refugees+0.908
Neg logit contributions
nown-1.269
)-1.168
Jac-1.154
bay-1.139
cape-1.123
Token Fri
Token ID19480
Feature activation+2.20e-01

Pos logit contributions
 Worker+1.973
 Developer+1.850
 competitor+1.849
 shares+1.823
 refugees+1.819
Neg logit contributions
nown-1.418
)-1.248
Jac-1.221
bay-1.196
ffic-1.193
Token,
Token ID11
Feature activation+8.59e-01

Pos logit contributions
 Worker+0.886
 competitor+0.829
roid+0.817
 refugees+0.798
 logger+0.797
Neg logit contributions
nown-1.423
)-1.313
Jac-1.312
bay-1.284
cape-1.284
Token 28
Token ID2579
Feature activation+3.96e-01

Pos logit contributions
 Worker+4.366
 Developer+4.009
 competitor+4.004
 refugees+3.951
 shares+3.926
Neg logit contributions
nown-4.689
Jac-4.077
foreseen-4.074
)-4.071
ffic-4.052
Token July
Token ID2901
Feature activation+1.63e-01

Pos logit contributions
 Worker+2.091
 competitor+1.956
 refugees+1.924
 Developer+1.921
roid+1.912
Neg logit contributions
nown-2.098
)-1.863
Jac-1.859
ffic-1.815
bay-1.814
Token 2017
Token ID2177
Feature activation+8.65e-02

Pos logit contributions
 Worker+0.908
 competitor+0.869
roid+0.862
 logger+0.852
 subordinates+0.850
Neg logit contributions
nown-0.782
)-0.710
Jac-0.709
bay-0.700
cape-0.686
Token |
Token ID930
Feature activation+4.45e-01

Pos logit contributions
 Worker+0.440
 competitor+0.415
roid+0.411
 logger+0.409
 subordinates+0.408
Neg logit contributions
nown-0.465
)-0.427
Jac-0.426
bay-0.419
cape-0.413
Token 00
Token ID3571
Feature activation+1.14e-01

Pos logit contributions
 Worker+3.073
 Developer+2.940
 shares+2.907
 competitor+2.884
 refugees+2.877
Neg logit contributions
nown-1.683
bay-1.389
)-1.371
foreseen-1.358
ffic-1.346
Token to
Token ID284
Feature activation+1.36e-02

Pos logit contributions
 Worker+0.096
 competitor+0.091
roid+0.090
 logger+0.089
 subordinates+0.089
Neg logit contributions
nown-0.111
Jac-0.103
)-0.103
bay-0.101
cape-0.100
Token gr
Token ID1036
Feature activation-2.28e-02

Pos logit contributions
 Worker+0.052
 competitor+0.048
roid+0.048
 logger+0.047
 subordinates+0.047
Neg logit contributions
nown-0.090
Jac-0.085
)-0.085
bay-0.083
cape-0.082
Tokenays
Token ID592
Feature activation+2.41e-01

Pos logit contributions
nown+0.072
Jac+0.063
)+0.063
bay+0.060
cape+0.060
Neg logit contributions
 Worker-0.164
roid-0.158
 competitor-0.158
 subordinates-0.157
 logger-0.156
Tokencale
Token ID38765
Feature activation+1.18e-01

Pos logit contributions
 Worker+0.873
roid+0.820
 competitor+0.810
 refugees+0.790
 subordinates+0.790
Neg logit contributions
nown-1.613
Jac-1.523
)-1.520
bay-1.480
ffic-1.466
Token image
Token ID2939
Feature activation-5.42e-02

Pos logit contributions
 Worker+0.502
 competitor+0.475
 logger+0.468
roid+0.465
 subordinates+0.459
Neg logit contributions
nown-0.728
)-0.681
Jac-0.681
bay-0.665
cape-0.656
Token =
Token ID796
Feature activation+8.56e-01

Pos logit contributions
nown+0.235
)+0.212
Jac+0.208
bay+0.204
cape+0.200
Neg logit contributions
 Worker-0.332
 competitor-0.316
roid-0.314
 logger-0.314
 refugees-0.312
Token c
Token ID269
Feature activation+2.08e-01

Pos logit contributions
 Worker+4.272
 logger+4.197
 competitor+4.124
 refugees+4.055
 shares+4.017
Neg logit contributions
nown-4.489
Jac-4.192
)-4.188
bay-4.124
ffic-4.056
Tokenv
Token ID85
Feature activation+3.25e-01

Pos logit contributions
 Worker+1.130
roid+1.080
 competitor+1.067
 refugees+1.055
 logger+1.051
Neg logit contributions
nown-1.029
)-0.954
Jac-0.941
bay-0.905
cape-0.897
Token2
Token ID17
Feature activation+1.39e-01

Pos logit contributions
 Worker+2.057
roid+1.973
 competitor+1.964
 refugees+1.929
 logger+1.927
Neg logit contributions
nown-1.338
)-1.186
Jac-1.165
bay-1.131
cape-1.122
Token.
Token ID764
Feature activation+3.21e-01

Pos logit contributions
 Worker+0.700
 competitor+0.658
roid+0.655
 logger+0.651
 refugees+0.644
Neg logit contributions
nown-0.758
)-0.696
Jac-0.691
bay-0.676
cape-0.668
Token im
Token ID545
Feature activation+2.83e-01

Pos logit contributions
 Worker+1.496
 competitor+1.448
 logger+1.445
 refugees+1.417
 subordinates+1.399
Neg logit contributions
nown-1.837
Jac-1.731
)-1.718
bay-1.694
cape-1.637
Token a
Token ID257
Feature activation-7.35e-02

Pos logit contributions
 Worker+0.693
 competitor+0.650
roid+0.639
 subordinates+0.637
 logger+0.637
Neg logit contributions
nown-1.028
)-0.961
Jac-0.958
bay-0.938
cape-0.928
Token climate
Token ID4258
Feature activation-1.64e-01

Pos logit contributions
nown+0.468
Jac+0.441
)+0.439
bay+0.432
cape+0.429
Neg logit contributions
 Worker-0.294
roid-0.274
 competitor-0.274
 logger-0.271
 subordinates-0.271
Token that
Token ID326
Feature activation+1.48e-01

Pos logit contributions
nown+0.963
Jac+0.900
)+0.897
bay+0.882
cape+0.876
Neg logit contributions
 Worker-0.741
 competitor-0.695
roid-0.694
 logger-0.685
 subordinates-0.685
Token is
Token ID318
Feature activation+6.48e-02

Pos logit contributions
 Worker+0.628
 competitor+0.589
roid+0.585
 logger+0.579
 subordinates+0.578
Neg logit contributions
nown-0.912
Jac-0.853
)-0.852
bay-0.837
cape-0.829
Token at
Token ID379
Feature activation-1.23e-01

Pos logit contributions
 Worker+0.274
 competitor+0.257
roid+0.256
 logger+0.253
 subordinates+0.252
Neg logit contributions
nown-0.399
Jac-0.374
)-0.373
bay-0.366
cape-0.363
Token worst
Token ID5290
Feature activation+8.53e-01

Pos logit contributions
nown+0.723
Jac+0.674
)+0.672
bay+0.660
cape+0.653
Neg logit contributions
 Worker-0.557
 competitor-0.525
roid-0.521
 logger-0.517
 subordinates-0.515
Token hostile
Token ID12524
Feature activation-1.10e-01

Pos logit contributions
 Worker+3.348
 competitor+3.205
 refugees+3.127
 logger+3.082
 subordinates+3.046
Neg logit contributions
nown-5.519
)-5.158
Jac-5.129
bay-5.012
ffic-4.986
Token and
Token ID290
Feature activation+9.96e-02

Pos logit contributions
nown+0.626
Jac+0.581
)+0.581
bay+0.572
cape+0.566
Neg logit contributions
 Worker-0.513
 competitor-0.483
roid-0.480
 logger-0.478
 subordinates-0.475
Token at
Token ID379
Feature activation+2.03e-01

Pos logit contributions
 Worker+0.462
 competitor+0.435
roid+0.433
 logger+0.429
 subordinates+0.427
Neg logit contributions
nown-0.574
Jac-0.535
)-0.533
bay-0.523
cape-0.518
Token best
Token ID1266
Feature activation+8.27e-01

Pos logit contributions
 Worker+0.880
 competitor+0.842
 logger+0.838
 refugees+0.826
 subordinates+0.818
Neg logit contributions
nown-1.234
)-1.153
Jac-1.150
bay-1.131
cape-1.116
Token indifferent
Token ID31655
Feature activation+2.52e-02

Pos logit contributions
 Worker+3.157
 competitor+3.069
 refugees+2.960
 subordinates+2.958
 logger+2.927
Neg logit contributions
nown-5.406
)-5.041
Jac-5.013
bay-4.924
ffic-4.867
Token movie
Token ID3807
Feature activation+8.50e-02

Pos logit contributions
 Worker+0.076
 competitor+0.071
roid+0.070
 logger+0.069
 subordinates+0.069
Neg logit contributions
nown-0.124
Jac-0.117
)-0.116
bay-0.114
cape-0.113
Token adaptation
Token ID16711
Feature activation-8.24e-02

Pos logit contributions
 Worker+0.384
 competitor+0.362
roid+0.357
 logger+0.355
 subordinates+0.354
Neg logit contributions
nown-0.502
Jac-0.468
)-0.468
bay-0.458
cape-0.454
Token of
Token ID286
Feature activation-1.81e-02

Pos logit contributions
nown+0.471
Jac+0.438
)+0.437
bay+0.429
cape+0.425
Neg logit contributions
 Worker-0.386
 competitor-0.364
roid-0.362
 logger-0.359
 subordinates-0.358
Token Charlie
Token ID11526
Feature activation+3.22e-01

Pos logit contributions
nown+0.104
Jac+0.097
)+0.097
bay+0.095
cape+0.094
Neg logit contributions
 Worker-0.084
 competitor-0.080
roid-0.079
 logger-0.078
 subordinates-0.078
Token and
Token ID290
Feature activation+5.10e-01

Pos logit contributions
 Worker+1.297
 competitor+1.204
 refugees+1.181
 subordinates+1.178
 logger+1.177
Neg logit contributions
nown-2.059
)-1.928
Jac-1.923
bay-1.889
ffic-1.873
Token the
Token ID262
Feature activation+8.51e-01

Pos logit contributions
 Worker+2.257
 competitor+2.102
roid+2.070
 refugees+2.061
 logger+2.060
Neg logit contributions
nown-3.078
)-2.846
Jac-2.818
bay-2.781
cape-2.778
Token Chocolate
Token ID24777
Feature activation+2.84e-01

Pos logit contributions
 Worker+1.680
 competitor+1.357
 Developer+1.339
 logger+1.311
 refugees+1.309
Neg logit contributions
nown-7.243
Jac-6.853
)-6.784
bay-6.768
ffic-6.687
Token Factory
Token ID19239
Feature activation-2.86e-02

Pos logit contributions
 Worker+0.919
 competitor+0.832
roid+0.825
 logger+0.809
 subordinates+0.807
Neg logit contributions
nown-2.060
Jac-1.937
)-1.936
bay-1.905
cape-1.887
Token initially
Token ID7317
Feature activation-1.86e-01

Pos logit contributions
nown+0.163
Jac+0.152
)+0.152
bay+0.149
cape+0.147
Neg logit contributions
 Worker-0.134
 competitor-0.127
roid-0.126
 logger-0.125
 subordinates-0.125
Token met
Token ID1138
Feature activation-2.24e-02

Pos logit contributions
nown+1.072
Jac+0.998
)+0.995
bay+0.977
cape+0.968
Neg logit contributions
 Worker-0.866
 competitor-0.816
roid-0.811
 logger-0.804
 subordinates-0.802
Token similar
Token ID2092
Feature activation-2.29e-01

Pos logit contributions
nown+0.135
Jac+0.126
)+0.126
bay+0.123
cape+0.122
Neg logit contributions
 Worker-0.099
 competitor-0.093
roid-0.092
 logger-0.091
 subordinates-0.091
Token group
Token ID1448
Feature activation-1.39e-01

Pos logit contributions
nown+0.734
Jac+0.690
)+0.688
bay+0.678
cape+0.670
Neg logit contributions
 Worker-0.480
roid-0.449
 competitor-0.443
 subordinates-0.442
 logger-0.442
Token.
Token ID13
Feature activation+1.92e-03

Pos logit contributions
nown+0.793
Jac+0.738
)+0.736
bay+0.723
cape+0.719
Neg logit contributions
 Worker-0.647
 competitor-0.607
roid-0.606
 subordinates-0.600
 logger-0.600
Token Guys
Token ID29899
Feature activation+7.68e-02

Pos logit contributions
 Worker+0.010
 competitor+0.009
roid+0.009
 logger+0.009
 subordinates+0.009
Neg logit contributions
nown-0.010
)-0.010
Jac-0.010
bay-0.009
cape-0.009
Token are
Token ID389
Feature activation-1.31e-01

Pos logit contributions
 Worker+0.342
 competitor+0.324
roid+0.320
 logger+0.317
 subordinates+0.316
Neg logit contributions
nown-0.457
Jac-0.426
)-0.426
bay-0.418
ffic-0.413
Token working
Token ID1762
Feature activation-5.21e-02

Pos logit contributions
nown+0.835
Jac+0.790
)+0.786
bay+0.773
cape+0.769
Neg logit contributions
 Worker-0.522
roid-0.486
 competitor-0.480
 logger-0.478
 subordinates-0.477
Token hard
Token ID1327
Feature activation+8.37e-01

Pos logit contributions
nown+0.320
Jac+0.302
)+0.300
bay+0.298
cape+0.294
Neg logit contributions
 Worker-0.218
roid-0.203
 competitor-0.203
 logger-0.201
 subordinates-0.200
Token and
Token ID290
Feature activation+1.19e-01

Pos logit contributions
 Worker+3.519
 competitor+3.322
 logger+3.244
roid+3.224
 refugees+3.207
Neg logit contributions
nown-5.097
Jac-4.754
)-4.744
ffic-4.576
bay-4.529
Token we
Token ID356
Feature activation-4.77e-03

Pos logit contributions
 Worker+0.500
 competitor+0.470
roid+0.465
 logger+0.461
 subordinates+0.460
Neg logit contributions
nown-0.735
Jac-0.686
)-0.686
bay-0.673
cape-0.667
Token have
Token ID423
Feature activation+2.12e-01

Pos logit contributions
nown+0.027
Jac+0.025
)+0.025
bay+0.024
cape+0.024
Neg logit contributions
 Worker-0.023
 competitor-0.022
roid-0.022
 logger-0.022
 subordinates-0.021
Token a
Token ID257
Feature activation+1.35e-03

Pos logit contributions
 Worker+0.904
 competitor+0.849
roid+0.842
 logger+0.834
 subordinates+0.832
Neg logit contributions
nown-1.298
Jac-1.213
)-1.211
bay-1.189
cape-1.178
Token physical
Token ID3518
Feature activation+2.25e-01

Pos logit contributions
 Worker+0.005
roid+0.005
 competitor+0.005
 subordinates+0.005
 logger+0.005
Neg logit contributions
nown-0.009
Jac-0.008
)-0.008
bay-0.008
cape-0.008
Token this
Token ID428
Feature activation+4.43e-01

Pos logit contributions
 Worker+1.363
 competitor+1.289
roid+1.274
 logger+1.265
 subordinates+1.258
Neg logit contributions
nown-1.839
)-1.698
Jac-1.693
bay-1.661
ffic-1.643
Token comment
Token ID2912
Feature activation+4.02e-01

Pos logit contributions
 Worker+1.139
 competitor+1.042
 logger+1.011
roid+1.008
 subordinates+0.989
Neg logit contributions
nown-3.508
Jac-3.290
)-3.286
bay-3.229
cape-3.222
Token on
Token ID319
Feature activation+4.59e-01

Pos logit contributions
 Worker+1.715
 competitor+1.584
 logger+1.569
roid+1.568
 refugees+1.561
Neg logit contributions
nown-2.530
)-2.291
Jac-2.283
bay-2.243
cape-2.225
Token Facebook
Token ID3203
Feature activation+4.45e-01

Pos logit contributions
 Worker+1.345
 competitor+1.198
 logger+1.184
roid+1.179
 refugees+1.164
Neg logit contributions
nown-3.484
)-3.256
Jac-3.249
bay-3.201
cape-3.187
Token Share
Token ID8734
Feature activation+7.89e-01

Pos logit contributions
 Worker+2.298
 Developer+2.139
 shares+2.131
 competitor+2.130
 logger+2.089
Neg logit contributions
nown-2.474
Jac-2.193
)-2.193
foreseen-2.112
bay-2.099
Token this
Token ID428
Feature activation+8.31e-01

Pos logit contributions
 Worker+3.090
 competitor+2.948
roid+2.910
 shares+2.900
 logger+2.878
Neg logit contributions
nown-5.096
)-4.654
Jac-4.624
bay-4.575
ffic-4.537
Token comment
Token ID2912
Feature activation+5.30e-01

Pos logit contributions
 Worker+1.847
 competitor+1.750
 logger+1.701
 reviewer+1.650
roid+1.626
Neg logit contributions
nown-6.846
Jac-6.391
)-6.349
cape-6.251
bay-6.224
Token on
Token ID319
Feature activation+5.54e-01

Pos logit contributions
 Worker+2.221
 competitor+2.065
roid+2.054
 logger+2.042
 refugees+2.037
Neg logit contributions
nown-3.355
Jac-3.041
)-3.040
bay-2.972
cape-2.943
Token Twitter
Token ID3009
Feature activation+5.30e-01

Pos logit contributions
 Worker+2.527
 competitor+2.331
 Developer+2.308
 logger+2.308
 refugees+2.288
Neg logit contributions
nown-3.348
)-3.048
Jac-3.024
cape-2.964
bay-2.946
Token
Token ID198
Feature activation+8.75e-01

Pos logit contributions
 Worker+2.476
 competitor+2.305
 logger+2.289
roid+2.280
 refugees+2.263
Neg logit contributions
nown-3.093
)-2.861
Jac-2.780
bay-2.738
cape-2.722
Token
Token ID198
Feature activation+7.28e-01

Pos logit contributions
 Worker+3.481
 Developer+3.123
roid+3.113
 shares+3.087
 refugees+3.074
Neg logit contributions
nown-5.734
-5.186
)-5.155
foreseen-5.040
Jac-5.002
Token-
Token ID12
Feature activation+7.50e-02

Pos logit contributions
 Worker+0.178
 competitor+0.169
roid+0.166
 subordinates+0.165
 logger+0.165
Neg logit contributions
nown-0.214
)-0.199
Jac-0.199
bay-0.194
cape-0.193
Tokenout
Token ID448
Feature activation-8.13e-02

Pos logit contributions
 Worker+0.322
roid+0.302
 competitor+0.301
 logger+0.300
 subordinates+0.296
Neg logit contributions
nown-0.455
)-0.427
Jac-0.424
cape-0.412
bay-0.412
Token-
Token ID12
Feature activation-2.61e-02

Pos logit contributions
nown+0.456
)+0.423
Jac+0.423
bay+0.413
cape+0.410
Neg logit contributions
 Worker-0.391
 competitor-0.371
roid-0.367
 logger-0.364
 subordinates-0.363
Tokenthe
Token ID1169
Feature activation-1.85e-02

Pos logit contributions
nown+0.115
)+0.104
Jac+0.103
bay+0.100
cape+0.100
Neg logit contributions
 Worker-0.157
 competitor-0.150
roid-0.149
 logger-0.148
 subordinates-0.148
Token-
Token ID12
Feature activation-1.37e-01

Pos logit contributions
nown+0.105
Jac+0.097
)+0.097
bay+0.095
cape+0.094
Neg logit contributions
 Worker-0.088
 competitor-0.084
roid-0.083
 logger-0.082
 subordinates-0.082
Tokenvote
Token ID27257
Feature activation+8.29e-01

Pos logit contributions
nown+0.640
)+0.582
Jac+0.581
bay+0.565
cape+0.560
Neg logit contributions
 Worker-0.789
 competitor-0.751
roid-0.749
 logger-0.741
 subordinates-0.741
Token efforts
Token ID4040
Feature activation-9.01e-03

Pos logit contributions
 Worker+3.060
 competitor+2.922
 refugees+2.813
 logger+2.789
 subordinates+2.781
Neg logit contributions
nown-5.622
)-5.206
Jac-5.176
bay-5.055
cape-4.992
Token,
Token ID11
Feature activation+1.12e-01

Pos logit contributions
nown+0.052
Jac+0.048
)+0.048
bay+0.047
cape+0.047
Neg logit contributions
 Worker-0.042
 competitor-0.040
roid-0.039
 logger-0.039
 subordinates-0.039
Token and
Token ID290
Feature activation-3.26e-02

Pos logit contributions
 Worker+0.527
 competitor+0.498
roid+0.490
 logger+0.489
 subordinates+0.489
Neg logit contributions
nown-0.641
)-0.594
Jac-0.593
bay-0.581
cape-0.575
Token populist
Token ID26815
Feature activation-1.81e-01

Pos logit contributions
nown+0.191
Jac+0.178
)+0.178
bay+0.175
cape+0.173
Neg logit contributions
 Worker-0.148
 competitor-0.140
roid-0.139
 logger-0.137
 subordinates-0.137
Token Sen
Token ID2311
Feature activation-1.70e-01

Pos logit contributions
nown+1.141
Jac+1.084
)+1.078
bay+1.063
cape+1.052
Neg logit contributions
 Worker-0.720
roid-0.679
 competitor-0.667
 logger-0.666
 subordinates-0.664
Token at
Token ID379
Feature activation-1.23e-01

Pos logit contributions
 Worker+0.274
 competitor+0.257
roid+0.256
 logger+0.253
 subordinates+0.252
Neg logit contributions
nown-0.399
Jac-0.374
)-0.373
bay-0.366
cape-0.363
Token worst
Token ID5290
Feature activation+8.53e-01

Pos logit contributions
nown+0.723
Jac+0.674
)+0.672
bay+0.660
cape+0.653
Neg logit contributions
 Worker-0.557
 competitor-0.525
roid-0.521
 logger-0.517
 subordinates-0.515
Token hostile
Token ID12524
Feature activation-1.10e-01

Pos logit contributions
 Worker+3.348
 competitor+3.205
 refugees+3.127
 logger+3.082
 subordinates+3.046
Neg logit contributions
nown-5.519
)-5.158
Jac-5.129
bay-5.012
ffic-4.986
Token and
Token ID290
Feature activation+9.96e-02

Pos logit contributions
nown+0.626
Jac+0.581
)+0.581
bay+0.572
cape+0.566
Neg logit contributions
 Worker-0.513
 competitor-0.483
roid-0.480
 logger-0.478
 subordinates-0.475
Token at
Token ID379
Feature activation+2.03e-01

Pos logit contributions
 Worker+0.462
 competitor+0.435
roid+0.433
 logger+0.429
 subordinates+0.427
Neg logit contributions
nown-0.574
Jac-0.535
)-0.533
bay-0.523
cape-0.518
Token best
Token ID1266
Feature activation+8.27e-01

Pos logit contributions
 Worker+0.880
 competitor+0.842
 logger+0.838
 refugees+0.826
 subordinates+0.818
Neg logit contributions
nown-1.234
)-1.153
Jac-1.150
bay-1.131
cape-1.116
Token indifferent
Token ID31655
Feature activation+2.52e-02

Pos logit contributions
 Worker+3.157
 competitor+3.069
 refugees+2.960
 subordinates+2.958
 logger+2.927
Neg logit contributions
nown-5.406
)-5.041
Jac-5.013
bay-4.924
ffic-4.867
Token to
Token ID284
Feature activation+1.66e-01

Pos logit contributions
 Worker+0.126
 competitor+0.119
roid+0.118
 logger+0.117
 subordinates+0.117
Neg logit contributions
nown-0.137
Jac-0.127
)-0.126
bay-0.124
cape-0.122
Token pressing
Token ID12273
Feature activation+3.51e-02

Pos logit contributions
 Worker+0.662
 competitor+0.618
roid+0.614
 logger+0.608
 subordinates+0.605
Neg logit contributions
nown-1.064
Jac-0.998
)-0.996
bay-0.979
cape-0.971
Token urban
Token ID7876
Feature activation+1.32e-01

Pos logit contributions
 Worker+0.122
 competitor+0.113
roid+0.112
 logger+0.110
 subordinates+0.110
Neg logit contributions
nown-0.243
Jac-0.229
)-0.229
bay-0.225
cape-0.224
Token priorities
Token ID15369
Feature activation-1.26e-01

Pos logit contributions
 Worker+0.407
 competitor+0.373
roid+0.366
 logger+0.363
 refugees+0.362
Neg logit contributions
nown-0.974
)-0.919
Jac-0.914
bay-0.901
cape-0.890
Token:
Token ID25
Feature activation-4.28e-01

Pos logit contributions
 Worker+0.049
roid+0.046
 competitor+0.046
 logger+0.046
 refugees+0.045
Neg logit contributions
nown-0.044
Jac-0.040
)-0.040
bay-0.039
cape-0.039
Token47
Token ID2857
Feature activation-2.44e-01

Pos logit contributions
nown+1.998
Jac+1.869
)+1.845
bay+1.813
cape+1.773
Neg logit contributions
 Worker-2.392
 competitor-2.327
 subordinates-2.322
roid-2.314
 logger-2.301
Token]
Token ID60
Feature activation-3.78e-01

Pos logit contributions
nown+1.320
Jac+1.231
)+1.231
bay+1.204
cape+1.189
Neg logit contributions
 Worker-1.214
 competitor-1.155
roid-1.143
 subordinates-1.138
 logger-1.137
Token <
Token ID1279
Feature activation-5.78e-01

Pos logit contributions
nown+2.344
)+2.213
Jac+2.198
bay+2.154
ffic+2.130
Neg logit contributions
 Worker-1.559
 competitor-1.459
roid-1.454
 subordinates-1.437
 logger-1.422
Tokenmom
Token ID32542
Feature activation-7.93e-01

Pos logit contributions
nown+2.897
Jac+2.688
)+2.662
bay+2.648
cape+2.593
Neg logit contributions
 Worker-3.110
 competitor-2.960
 subordinates-2.936
 logger-2.929
 refugees-2.906
Tokenper
Token ID525
Feature activation-1.04e+00

Pos logit contributions
nown+4.984
)+4.644
bay+4.623
Jac+4.601
ffic+4.552
Neg logit contributions
 Worker-3.262
 logger-3.081
 subordinates-3.067
 competitor-3.061
 refugees-3.013
Token>
Token ID29
Feature activation-6.15e-01

Pos logit contributions
nown+4.507
)+4.348
Jac+4.317
bay+4.221
cape+4.155
Neg logit contributions
 Worker-5.962
 competitor-5.843
 subordinates-5.827
 strikers-5.778
 logger-5.756
Token lol
Token ID19462
Feature activation-4.82e-01

Pos logit contributions
nown+3.334
)+3.128
Jac+3.093
bay+3.038
cape+3.014
Neg logit contributions
 Worker-2.948
 competitor-2.797
roid-2.797
 subordinates-2.768
 logger-2.746
Token m
Token ID285
Feature activation-5.86e-01

Pos logit contributions
nown+2.451
)+2.278
Jac+2.259
bay+2.220
cape+2.179
Neg logit contributions
 Worker-2.515
 competitor-2.384
roid-2.374
 subordinates-2.352
 logger-2.349
Token2
Token ID17
Feature activation-4.87e-01

Pos logit contributions
nown+1.927
)+1.693
Jac+1.688
bay+1.676
cape+1.622
Neg logit contributions
 Worker-4.141
 competitor-3.990
 logger-3.988
 subordinates-3.966
 refugees-3.937
Tokenk
Token ID74
Feature activation-2.02e-01

Pos logit contributions
nown+2.104
Jac+1.934
)+1.923
bay+1.918
cape+1.865
Neg logit contributions
 Worker-2.925
 competitor-2.816
 subordinates-2.805
 logger-2.800
roid-2.783
Token
Token ID198
Feature activation-1.32e-01

Pos logit contributions
nown+0.935
Jac+0.867
)+0.865
bay+0.848
cape+0.839
Neg logit contributions
 Worker-0.843
 competitor-0.798
roid-0.793
 logger-0.787
 subordinates-0.785
Token19
Token ID1129
Feature activation-1.49e-01

Pos logit contributions
nown+0.418
)+0.359
Jac+0.355
bay+0.345
cape+0.340
Neg logit contributions
 Worker-0.954
 competitor-0.924
roid-0.916
 subordinates-0.909
 logger-0.908
Token.
Token ID13
Feature activation-2.50e-01

Pos logit contributions
nown+0.824
Jac+0.757
)+0.757
bay+0.742
cape+0.736
Neg logit contributions
 Worker-0.737
 competitor-0.694
roid-0.691
 logger-0.684
 subordinates-0.681
Token Brian
Token ID8403
Feature activation-5.72e-01

Pos logit contributions
nown+1.318
Jac+1.227
)+1.221
bay+1.196
cape+1.181
Neg logit contributions
 Worker-1.275
 competitor-1.212
roid-1.206
 logger-1.195
 subordinates-1.195
Token Al
Token ID978
Feature activation-6.29e-01

Pos logit contributions
nown+3.193
Jac+2.980
)+2.966
bay+2.932
cape+2.879
Neg logit contributions
 Worker-2.685
 competitor-2.558
 subordinates-2.529
 logger-2.519
roid-2.519
Tokenain
Token ID391
Feature activation-1.03e+00

Pos logit contributions
nown+2.135
Jac+1.900
)+1.879
bay+1.861
cape+1.792
Neg logit contributions
 Worker-4.385
 competitor-4.212
 subordinates-4.197
 logger-4.188
roid-4.165
Tokenu
Token ID84
Feature activation-7.45e-01

Pos logit contributions
nown+3.048
)+2.776
bay+2.758
Jac+2.734
cape+2.642
Neg logit contributions
 Worker-7.366
 competitor-7.191
 subordinates-7.154
 logger-7.117
 strikers-7.096
Token'
Token ID6
Feature activation-2.90e-01

Pos logit contributions
nown+3.679
Jac+3.429
)+3.414
bay+3.351
cape+3.302
Neg logit contributions
 Worker-3.920
roid-3.738
 competitor-3.729
 subordinates-3.722
 logger-3.707
Tokenuese
Token ID20506
Feature activation-5.98e-01

Pos logit contributions
nown+1.639
Jac+1.524
)+1.522
bay+1.487
cape+1.474
Neg logit contributions
 Worker-1.386
 competitor-1.304
 logger-1.296
 subordinates-1.289
roid-1.287
Token (
Token ID357
Feature activation-5.71e-01

Pos logit contributions
nown+3.285
Jac+3.110
)+3.050
bay+3.038
cape+2.975
Neg logit contributions
 Worker-2.821
 competitor-2.664
roid-2.662
 subordinates-2.652
 logger-2.637
Token6
Token ID21
Feature activation-6.07e-02

Pos logit contributions
nown+2.425
Jac+2.281
)+2.229
bay+2.198
cape+2.158
Neg logit contributions
 Worker-3.461
 competitor-3.320
 logger-3.302
 subordinates-3.297
roid-3.278
Token:
Token ID25
Feature activation-3.42e-01

Pos logit contributions
 Worker+0.258
roid+0.243
 competitor+0.243
 logger+0.240
 refugees+0.240
Neg logit contributions
nown-0.236
Jac-0.213
)-0.213
bay-0.208
cape-0.206
Token48
Token ID2780
Feature activation-5.77e-02

Pos logit contributions
nown+1.699
Jac+1.586
)+1.574
bay+1.528
cape+1.489
Neg logit contributions
 Worker-1.826
 competitor-1.770
 logger-1.744
roid-1.744
 subordinates-1.743
Token]
Token ID60
Feature activation-3.74e-01

Pos logit contributions
nown+0.325
Jac+0.300
)+0.299
bay+0.293
cape+0.290
Neg logit contributions
 Worker-0.278
 competitor-0.261
roid-0.260
 logger-0.258
 subordinates-0.257
Token <
Token ID1279
Feature activation-5.92e-01

Pos logit contributions
nown+2.382
)+2.254
Jac+2.243
bay+2.202
cape+2.184
Neg logit contributions
 Worker-1.483
 competitor-1.391
roid-1.383
 subordinates-1.369
 logger-1.360
Tokenmin
Token ID1084
Feature activation-4.46e-01

Pos logit contributions
nown+2.979
Jac+2.760
)+2.734
bay+2.728
cape+2.671
Neg logit contributions
 Worker-3.170
 competitor-3.017
 subordinates-2.988
 logger-2.984
 refugees-2.962
Tokenion
Token ID295
Feature activation-1.01e+00

Pos logit contributions
nown+2.537
Jac+2.350
)+2.342
bay+2.308
cape+2.285
Neg logit contributions
 Worker-2.113
 competitor-2.003
 subordinates-1.974
 logger-1.970
 refugees-1.956
Token13
Token ID1485
Feature activation-6.89e-01

Pos logit contributions
nown+2.253
Jac+1.968
bay+1.948
)+1.871
cape+1.747
Neg logit contributions
 Worker-8.144
 subordinates-7.891
 logger-7.886
 competitor-7.885
 refugees-7.724
Token37
Token ID2718
Feature activation-4.73e-01

Pos logit contributions
nown+3.007
Jac+2.766
)+2.752
bay+2.741
ffic+2.681
Neg logit contributions
 Worker-4.072
 competitor-3.926
 subordinates-3.921
 logger-3.880
roid-3.874
Token>
Token ID29
Feature activation-4.88e-01

Pos logit contributions
nown+2.306
)+2.172
Jac+2.166
bay+2.109
cape+2.094
Neg logit contributions
 Worker-2.562
 competitor-2.463
 subordinates-2.439
 logger-2.434
roid-2.415
Token le
Token ID443
Feature activation-4.90e-01

Pos logit contributions
nown+2.717
)+2.546
Jac+2.524
bay+2.471
cape+2.457
Neg logit contributions
 Worker-2.299
roid-2.180
 competitor-2.179
 subordinates-2.151
 logger-2.141
Tokenl
Token ID75
Feature activation-5.70e-01

Pos logit contributions
nown+3.210
)+3.021
Jac+3.012
bay+2.980
cape+2.961
Neg logit contributions
 Worker-1.848
 competitor-1.720
 logger-1.693
 subordinates-1.693
roid-1.684
Token:
Token ID25
Feature activation-3.96e-01

Pos logit contributions
 Worker+0.150
roid+0.142
 competitor+0.142
 logger+0.140
 refugees+0.140
Neg logit contributions
nown-0.136
Jac-0.124
)-0.123
bay-0.120
cape-0.119
Token47
Token ID2857
Feature activation-2.01e-01

Pos logit contributions
nown+1.870
Jac+1.749
)+1.726
bay+1.695
cape+1.658
Neg logit contributions
 Worker-2.200
 competitor-2.134
 subordinates-2.126
roid-2.123
 logger-2.111
Token]
Token ID60
Feature activation-3.45e-01

Pos logit contributions
nown+1.094
Jac+1.018
)+1.016
bay+0.995
cape+0.983
Neg logit contributions
 Worker-0.990
 competitor-0.939
roid-0.931
 logger-0.925
 subordinates-0.925
Token <
Token ID1279
Feature activation-5.50e-01

Pos logit contributions
nown+2.150
)+2.029
Jac+2.017
bay+1.976
cape+1.956
Neg logit contributions
 Worker-1.419
 competitor-1.327
roid-1.322
 subordinates-1.306
 logger-1.295
Tokenmom
Token ID32542
Feature activation-6.29e-01

Pos logit contributions
nown+2.764
Jac+2.562
)+2.536
bay+2.523
cape+2.477
Neg logit contributions
 Worker-2.956
 competitor-2.813
 subordinates-2.790
 logger-2.783
 refugees-2.763
Tokenper
Token ID525
Feature activation-1.01e+00

Pos logit contributions
nown+3.911
)+3.648
Jac+3.638
bay+3.610
cape+3.561
Neg logit contributions
 Worker-2.624
 competitor-2.464
 logger-2.440
 subordinates-2.435
 refugees-2.412
Token>
Token ID29
Feature activation-5.47e-01

Pos logit contributions
nown+4.550
)+4.248
Jac+4.235
bay+4.217
cape+4.125
Neg logit contributions
 Worker-5.747
 competitor-5.588
 subordinates-5.563
 logger-5.507
 strikers-5.481
Token lol
Token ID19462
Feature activation-5.37e-01

Pos logit contributions
nown+3.042
)+2.852
Jac+2.828
bay+2.776
cape+2.757
Neg logit contributions
 Worker-2.567
roid-2.436
 competitor-2.430
 subordinates-2.405
 logger-2.389
Token m
Token ID285
Feature activation-6.36e-01

Pos logit contributions
nown+2.825
)+2.630
Jac+2.612
bay+2.569
cape+2.528
Neg logit contributions
 Worker-2.705
 competitor-2.556
roid-2.550
 subordinates-2.527
 logger-2.520
Token2
Token ID17
Feature activation-5.20e-01

Pos logit contributions
nown+1.934
Jac+1.683
)+1.679
bay+1.653
cape+1.602
Neg logit contributions
 Worker-4.630
 competitor-4.455
 logger-4.439
 subordinates-4.417
 refugees-4.404
Tokenk
Token ID74
Feature activation-2.00e-01

Pos logit contributions
nown+2.242
Jac+2.056
)+2.045
bay+2.043
cape+1.985
Neg logit contributions
 Worker-3.126
 competitor-3.004
 subordinates-2.997
 logger-2.989
roid-2.973
Token:
Token ID25
Feature activation-2.35e-01

Pos logit contributions
 Worker+0.962
roid+0.905
 competitor+0.899
 refugees+0.890
 logger+0.890
Neg logit contributions
nown-0.928
)-0.833
Jac-0.833
bay-0.813
cape-0.804
Token48
Token ID2780
Feature activation+1.03e-01

Pos logit contributions
nown+1.217
Jac+1.128
)+1.120
bay+1.095
cape+1.072
Neg logit contributions
 Worker-1.218
 competitor-1.164
roid-1.154
 logger-1.149
 subordinates-1.145
Token]
Token ID60
Feature activation-3.39e-01

Pos logit contributions
 Worker+0.481
roid+0.449
 competitor+0.447
 logger+0.442
 refugees+0.441
Neg logit contributions
nown-0.602
Jac-0.550
)-0.548
bay-0.539
cape-0.536
Token <
Token ID1279
Feature activation-5.41e-01

Pos logit contributions
nown+2.134
)+2.014
Jac+2.006
bay+1.965
cape+1.945
Neg logit contributions
 Worker-1.378
 competitor-1.288
roid-1.282
 subordinates-1.267
 logger-1.257
Tokenmom
Token ID32542
Feature activation-7.11e-01

Pos logit contributions
nown+2.704
Jac+2.506
)+2.485
bay+2.470
cape+2.422
Neg logit contributions
 Worker-2.914
 competitor-2.774
 subordinates-2.748
 logger-2.745
 refugees-2.726
Tokenper
Token ID525
Feature activation-9.84e-01

Pos logit contributions
nown+4.495
)+4.200
bay+4.171
Jac+4.165
ffic+4.117
Neg logit contributions
 Worker-2.912
 logger-2.740
 competitor-2.724
 subordinates-2.721
 refugees-2.677
Token>
Token ID29
Feature activation-7.18e-01

Pos logit contributions
nown+4.293
)+4.137
Jac+4.121
bay+4.010
cape+3.961
Neg logit contributions
 Worker-5.618
 competitor-5.487
 subordinates-5.463
 strikers-5.420
 logger-5.409
Token lol
Token ID19462
Feature activation-4.75e-01

Pos logit contributions
nown+3.848
)+3.612
Jac+3.557
bay+3.500
cape+3.474
Neg logit contributions
 Worker-3.468
roid-3.293
 competitor-3.290
 subordinates-3.257
 logger-3.230
Token m
Token ID285
Feature activation-4.85e-01

Pos logit contributions
nown+2.379
)+2.204
Jac+2.186
bay+2.146
cape+2.109
Neg logit contributions
 Worker-2.517
 competitor-2.386
roid-2.379
 subordinates-2.354
 logger-2.354
Token2
Token ID17
Feature activation-5.89e-01

Pos logit contributions
nown+1.609
Jac+1.425
)+1.424
bay+1.403
cape+1.359
Neg logit contributions
 Worker-3.419
 logger-3.297
 competitor-3.296
 subordinates-3.273
 strikers-3.249
Tokenk
Token ID74
Feature activation-3.12e-01

Pos logit contributions
nown+2.484
Jac+2.286
bay+2.277
)+2.275
cape+2.205
Neg logit contributions
 Worker-3.572
 competitor-3.445
 subordinates-3.439
 logger-3.428
roid-3.402
Token:
Token ID25
Feature activation-4.05e-01

Pos logit contributions
 Worker+0.456
roid+0.430
 competitor+0.429
 logger+0.424
 refugees+0.423
Neg logit contributions
nown-0.423
Jac-0.381
)-0.381
bay-0.372
cape-0.368
Token47
Token ID2857
Feature activation-1.82e-01

Pos logit contributions
nown+1.911
Jac+1.790
)+1.763
bay+1.739
cape+1.700
Neg logit contributions
 Worker-2.248
 competitor-2.180
 subordinates-2.174
roid-2.168
 logger-2.159
Token]
Token ID60
Feature activation-3.52e-01

Pos logit contributions
nown+0.994
Jac+0.924
)+0.923
bay+0.904
cape+0.894
Neg logit contributions
 Worker-0.895
 competitor-0.849
roid-0.842
 logger-0.836
 subordinates-0.836
Token <
Token ID1279
Feature activation-5.50e-01

Pos logit contributions
nown+2.193
)+2.069
Jac+2.057
bay+2.016
cape+1.995
Neg logit contributions
 Worker-1.446
 competitor-1.352
roid-1.345
 subordinates-1.330
 logger-1.319
Tokenmom
Token ID32542
Feature activation-6.42e-01

Pos logit contributions
nown+2.762
Jac+2.561
)+2.536
bay+2.524
cape+2.475
Neg logit contributions
 Worker-2.949
 competitor-2.807
 subordinates-2.781
 logger-2.777
 refugees-2.758
Tokenper
Token ID525
Feature activation-9.62e-01

Pos logit contributions
nown+4.114
)+3.838
Jac+3.820
bay+3.811
ffic+3.759
Neg logit contributions
 Worker-2.575
 logger-2.415
 competitor-2.408
 subordinates-2.401
 refugees-2.364
Token>
Token ID29
Feature activation-6.83e-01

Pos logit contributions
nown+4.239
)+4.069
Jac+4.054
bay+3.952
cape+3.902
Neg logit contributions
 Worker-5.470
 competitor-5.341
 subordinates-5.322
 strikers-5.274
 logger-5.267
Token lol
Token ID19462
Feature activation-4.18e-01

Pos logit contributions
nown+3.670
)+3.447
Jac+3.399
bay+3.340
cape+3.319
Neg logit contributions
 Worker-3.287
 competitor-3.126
roid-3.124
 subordinates-3.095
 logger-3.066
Token m
Token ID285
Feature activation-5.20e-01

Pos logit contributions
nown+2.142
)+1.987
Jac+1.975
bay+1.937
cape+1.907
Neg logit contributions
 Worker-2.176
 competitor-2.065
roid-2.055
 logger-2.037
 subordinates-2.037
Token2
Token ID17
Feature activation-4.43e-01

Pos logit contributions
nown+1.732
Jac+1.526
)+1.526
bay+1.506
cape+1.461
Neg logit contributions
 Worker-3.659
 logger-3.526
 competitor-3.524
 subordinates-3.503
 refugees-3.478
Tokenk
Token ID74
Feature activation-1.77e-01

Pos logit contributions
nown+1.934
Jac+1.777
)+1.769
bay+1.757
cape+1.713
Neg logit contributions
 Worker-2.641
 competitor-2.539
 subordinates-2.525
 logger-2.523
roid-2.512
Token &
Token ID1222
Feature activation-1.24e-01

Pos logit contributions
 Worker+0.349
 competitor+0.332
roid+0.329
 logger+0.327
 subordinates+0.326
Neg logit contributions
nown-0.349
Jac-0.322
)-0.321
bay-0.315
cape-0.311
Token mut
Token ID4517
Feature activation-4.44e-01

Pos logit contributions
nown+0.613
Jac+0.563
)+0.562
bay+0.549
cape+0.543
Neg logit contributions
 Worker-0.678
 competitor-0.645
roid-0.642
 logger-0.637
 subordinates-0.636
Token self
Token ID2116
Feature activation-1.45e-01

Pos logit contributions
nown+2.536
)+2.363
Jac+2.357
bay+2.297
cape+2.295
Neg logit contributions
 Worker-2.029
roid-1.922
 competitor-1.909
 subordinates-1.895
 logger-1.885
Token )
Token ID1267
Feature activation+1.72e-01

Pos logit contributions
nown+0.642
)+0.582
Jac+0.582
bay+0.565
cape+0.558
Neg logit contributions
 Worker-0.871
 competitor-0.831
roid-0.828
 logger-0.822
 subordinates-0.821
Token {
Token ID1391
Feature activation+7.82e-02

Pos logit contributions
 Worker+0.887
 competitor+0.839
roid+0.830
 logger+0.830
 subordinates+0.823
Neg logit contributions
nown-0.912
Jac-0.838
)-0.833
bay-0.819
cape-0.809
Token let
Token ID1309
Feature activation-9.11e-01

Pos logit contributions
 Worker+0.313
 competitor+0.293
roid+0.292
 logger+0.288
 subordinates+0.288
Neg logit contributions
nown-0.499
Jac-0.467
)-0.467
bay-0.459
cape-0.455
Token prev
Token ID8654
Feature activation-6.58e-01

Pos logit contributions
nown+6.328
)+5.949
Jac+5.907
bay+5.799
cape+5.797
Neg logit contributions
 Worker-2.828
roid-2.608
 Developer-2.522
 logger-2.509
 strikers-2.502
Token :
Token ID1058
Feature activation+2.15e-01

Pos logit contributions
nown+2.518
Jac+2.315
)+2.276
bay+2.236
ffic+2.175
Neg logit contributions
 Worker-4.242
 competitor-4.101
 subordinates-4.094
 logger-4.080
 strikers-4.051
Token Board
Token ID5926
Feature activation-1.15e-01

Pos logit contributions
 Worker+1.080
 competitor+1.022
roid+1.014
 logger+1.006
 subordinates+1.005
Neg logit contributions
nown-1.157
Jac-1.074
)-1.072
bay-1.049
cape-1.038
Token =
Token ID796
Feature activation+2.89e-01

Pos logit contributions
nown+0.553
Jac+0.508
)+0.504
bay+0.492
cape+0.488
Neg logit contributions
 Worker-0.643
 competitor-0.614
roid-0.611
 subordinates-0.607
 logger-0.607
Token self
Token ID2116
Feature activation+5.91e-02

Pos logit contributions
 Worker+1.273
 competitor+1.195
roid+1.180
 logger+1.171
 subordinates+1.169
Neg logit contributions
nown-1.738
Jac-1.627
)-1.623
bay-1.592
cape-1.578
Token (
Token ID357
Feature activation-3.00e-01

Pos logit contributions
nown+0.800
Jac+0.744
)+0.744
bay+0.730
cape+0.723
Neg logit contributions
 Worker-0.634
 competitor-0.598
roid-0.594
 subordinates-0.588
 logger-0.588
Token13
Token ID1485
Feature activation-1.52e-01

Pos logit contributions
nown+1.404
Jac+1.285
)+1.280
bay+1.264
cape+1.244
Neg logit contributions
 Worker-1.708
 competitor-1.632
 subordinates-1.614
 logger-1.613
roid-1.606
Token,
Token ID11
Feature activation-1.51e-01

Pos logit contributions
nown+0.857
)+0.797
Jac+0.797
bay+0.780
cape+0.772
Neg logit contributions
 Worker-0.728
 competitor-0.688
roid-0.683
 logger-0.677
 subordinates-0.676
Token 5
Token ID642
Feature activation-2.72e-01

Pos logit contributions
nown+0.688
Jac+0.628
)+0.626
bay+0.611
cape+0.602
Neg logit contributions
 Worker-0.886
 competitor-0.846
roid-0.842
 logger-0.836
 subordinates-0.835
Token))
Token ID4008
Feature activation-9.08e-02

Pos logit contributions
nown+1.353
)+1.259
Jac+1.250
bay+1.222
cape+1.203
Neg logit contributions
 Worker-1.467
 competitor-1.397
roid-1.385
 subordinates-1.377
 logger-1.375
Token sq
Token ID19862
Feature activation-9.03e-01

Pos logit contributions
nown+0.540
Jac+0.504
)+0.503
bay+0.494
cape+0.489
Neg logit contributions
 Worker-0.404
 competitor-0.380
roid-0.377
 logger-0.374
 subordinates-0.373
TokenK
Token ID42
Feature activation-7.21e-02

Pos logit contributions
nown+4.771
Jac+4.438
bay+4.369
ffic+4.287
cape+4.250
Neg logit contributions
 Worker-4.445
 logger-4.263
 competitor-4.254
 subordinates-4.241
 reviewer-4.173
Tokenernel
Token ID7948
Feature activation+3.37e-02

Pos logit contributions
nown+0.578
Jac+0.548
)+0.547
bay+0.540
cape+0.536
Neg logit contributions
 Worker-0.173
 competitor-0.155
 logger-0.150
 subordinates-0.150
roid-0.149
Token =
Token ID796
Feature activation+5.16e-01

Pos logit contributions
 Worker+0.197
 competitor+0.188
roid+0.187
 logger+0.186
 subordinates+0.185
Neg logit contributions
nown-0.154
)-0.140
Jac-0.140
bay-0.136
cape-0.135
Token c
Token ID269
Feature activation+8.61e-02

Pos logit contributions
 Worker+2.765
 competitor+2.639
 logger+2.631
 refugees+2.595
roid+2.589
Neg logit contributions
nown-2.609
Jac-2.419
)-2.408
bay-2.330
ffic-2.319
Tokenv
Token ID85
Feature activation+1.58e-01

Pos logit contributions
 Worker+0.469
roid+0.450
 competitor+0.443
 refugees+0.439
 logger+0.436
Neg logit contributions
nown-0.422
)-0.390
Jac-0.389
bay-0.370
ffic-0.369
Token School
Token ID3961
Feature activation+2.77e-01

Pos logit contributions
nown+2.896
Jac+2.760
)+2.725
bay+2.688
cape+2.667
Neg logit contributions
 Worker-2.038
roid-1.983
 competitor-1.980
 logger-1.954
 subordinates-1.943
Token Psychology
Token ID18430
Feature activation+3.89e-01

Pos logit contributions
 Worker+1.201
 competitor+1.119
roid+1.107
 subordinates+1.095
 logger+1.093
Neg logit contributions
nown-1.693
)-1.579
Jac-1.577
bay-1.542
cape-1.525
Token International
Token ID4037
Feature activation+1.51e-01

Pos logit contributions
 Worker+1.659
 competitor+1.524
roid+1.509
 refugees+1.489
 logger+1.482
Neg logit contributions
nown-2.437
)-2.270
Jac-2.266
bay-2.210
ffic-2.183
Token,
Token ID11
Feature activation-2.09e-01

Pos logit contributions
 Worker+0.656
 competitor+0.610
roid+0.601
 logger+0.596
 subordinates+0.595
Neg logit contributions
nown-0.927
)-0.861
Jac-0.859
bay-0.839
cape-0.832
Token 25
Token ID1679
Feature activation-2.43e-01

Pos logit contributions
nown+1.020
Jac+0.939
)+0.938
bay+0.916
cape+0.905
Neg logit contributions
 Worker-1.147
 competitor-1.093
roid-1.089
 logger-1.080
 subordinates-1.079
Token,
Token ID11
Feature activation-9.00e-01

Pos logit contributions
nown+1.420
Jac+1.327
)+1.319
bay+1.299
cape+1.282
Neg logit contributions
 Worker-1.097
 competitor-1.034
roid-1.028
 logger-1.020
 subordinates-1.018
Token 287
Token ID38721
Feature activation-4.05e-01

Pos logit contributions
nown+3.871
Jac+3.644
)+3.605
bay+3.509
cape+3.463
Neg logit contributions
 Worker-5.129
roid-4.997
 competitor-4.931
 subordinates-4.923
 logger-4.879
Token -
Token ID532
Feature activation-1.92e-01

Pos logit contributions
nown+1.874
Jac+1.727
)+1.716
bay+1.678
cape+1.655
Neg logit contributions
 Worker-2.324
 competitor-2.224
roid-2.214
 subordinates-2.199
 logger-2.197
Token 300
Token ID5867
Feature activation-3.84e-01

Pos logit contributions
nown+0.810
Jac+0.739
)+0.736
bay+0.715
cape+0.710
Neg logit contributions
 Worker-1.180
roid-1.132
 competitor-1.131
 logger-1.120
 subordinates-1.119
Token.
Token ID764
Feature activation-5.69e-01

Pos logit contributions
nown+1.892
Jac+1.759
)+1.745
bay+1.713
cape+1.692
Neg logit contributions
 Worker-2.077
 competitor-1.987
roid-1.967
 subordinates-1.961
 logger-1.960
Token
Token ID198
Feature activation-2.39e-01

Pos logit contributions
nown+2.863
Jac+2.723
)+2.691
bay+2.644
cape+2.603
Neg logit contributions
 Worker-2.955
 competitor-2.858
 subordinates-2.822
 logger-2.814
roid-2.808
Token Crim
Token ID9822
Feature activation+2.08e-01

Pos logit contributions
nown+2.939
Jac+2.798
)+2.765
bay+2.724
cape+2.698
Neg logit contributions
 Worker-2.074
 competitor-2.015
roid-2.009
 logger-1.987
 subordinates-1.975
Tokenin
Token ID259
Feature activation-2.92e-01

Pos logit contributions
 Worker+1.072
roid+1.018
 competitor+1.011
 logger+0.996
 subordinates+0.996
Neg logit contributions
nown-1.098
)-1.012
Jac-1.004
bay-0.983
cape-0.977
Tokenology
Token ID1435
Feature activation+5.26e-04

Pos logit contributions
nown+2.017
Jac+1.922
)+1.900
bay+1.890
ffic+1.864
Neg logit contributions
 Worker-0.997
 competitor-0.933
 subordinates-0.916
 logger-0.910
roid-0.905
Token,
Token ID11
Feature activation-3.02e-01

Pos logit contributions
 Worker+0.002
 competitor+0.002
roid+0.002
 logger+0.002
 subordinates+0.002
Neg logit contributions
nown-0.003
Jac-0.003
)-0.003
bay-0.003
cape-0.003
Token 35
Token ID3439
Feature activation-3.20e-01

Pos logit contributions
nown+1.473
Jac+1.369
)+1.364
bay+1.334
cape+1.315
Neg logit contributions
 Worker-1.642
 competitor-1.571
roid-1.566
 logger-1.553
 subordinates-1.552
Token,
Token ID11
Feature activation-8.91e-01

Pos logit contributions
nown+1.892
Jac+1.769
)+1.756
bay+1.731
cape+1.708
Neg logit contributions
 Worker-1.431
 competitor-1.347
roid-1.337
 logger-1.329
 subordinates-1.328
Token 177
Token ID26607
Feature activation-4.69e-01

Pos logit contributions
nown+3.852
Jac+3.638
)+3.593
bay+3.500
cape+3.467
Neg logit contributions
 Worker-5.049
roid-4.910
 competitor-4.851
 subordinates-4.835
 logger-4.792
Token -
Token ID532
Feature activation-2.04e-01

Pos logit contributions
nown+2.138
Jac+1.981
)+1.959
bay+1.915
cape+1.889
Neg logit contributions
 Worker-2.711
 competitor-2.599
roid-2.584
 subordinates-2.577
 logger-2.566
Token 202
Token ID22131
Feature activation-4.44e-01

Pos logit contributions
nown+0.769
Jac+0.684
)+0.683
bay+0.662
cape+0.649
Neg logit contributions
 Worker-1.357
 competitor-1.304
roid-1.292
 logger-1.288
 subordinates-1.287
Token.
Token ID764
Feature activation-5.39e-01

Pos logit contributions
nown+2.147
Jac+2.009
)+1.990
bay+1.953
cape+1.931
Neg logit contributions
 Worker-2.429
 competitor-2.330
 subordinates-2.306
roid-2.302
 logger-2.299
Token
Token ID198
Feature activation-2.15e-01

Pos logit contributions
nown+2.734
Jac+2.595
)+2.565
bay+2.521
cape+2.484
Neg logit contributions
 Worker-2.784
 competitor-2.685
 subordinates-2.652
 logger-2.646
roid-2.641
Token -
Token ID532
Feature activation+4.87e-03

Pos logit contributions
 Worker+0.203
 competitor+0.192
roid+0.190
 logger+0.189
 subordinates+0.189
Neg logit contributions
nown-0.207
)-0.190
Jac-0.190
bay-0.185
cape-0.184
Token January
Token ID3269
Feature activation-1.10e-01

Pos logit contributions
 Worker+0.026
 competitor+0.025
roid+0.025
 logger+0.024
 subordinates+0.024
Neg logit contributions
nown-0.025
)-0.023
Jac-0.022
bay-0.022
cape-0.022
Token 4
Token ID604
Feature activation+3.12e-02

Pos logit contributions
nown+0.587
)+0.545
Jac+0.544
bay+0.533
cape+0.526
Neg logit contributions
 Worker-0.553
 competitor-0.527
roid-0.522
 logger-0.520
 subordinates-0.519
Token,
Token ID11
Feature activation-2.57e-01

Pos logit contributions
 Worker+0.131
 competitor+0.122
roid+0.122
 logger+0.120
 subordinates+0.120
Neg logit contributions
nown-0.195
Jac-0.181
)-0.181
bay-0.178
cape-0.176
Token 2010
Token ID3050
Feature activation-1.74e-01

Pos logit contributions
nown+1.172
)+1.080
Jac+1.074
bay+1.039
cape+1.031
Neg logit contributions
 Worker-1.473
 competitor-1.422
roid-1.411
 logger-1.405
 subordinates-1.404
TokenUser
Token ID12982
Feature activation-8.88e-01

Pos logit contributions
nown+0.713
Jac+0.644
)+0.643
bay+0.625
cape+0.615
Neg logit contributions
 Worker-1.092
 competitor-1.047
roid-1.041
 logger-1.036
 subordinates-1.035
Token reviews
Token ID8088
Feature activation-1.39e-01

Pos logit contributions
nown+6.434
)+6.258
Jac+6.250
bay+6.240
ffic+6.137
Neg logit contributions
 Worker-2.502
roid-2.163
 logger-2.121
 competitor-2.069
 subordinates-2.002
Token are
Token ID389
Feature activation-3.64e-01

Pos logit contributions
nown+0.726
Jac+0.670
)+0.670
bay+0.656
cape+0.650
Neg logit contributions
 Worker-0.723
 competitor-0.685
roid-0.681
 logger-0.676
 subordinates-0.675
Token a
Token ID257
Feature activation-3.19e-01

Pos logit contributions
nown+2.119
)+1.991
Jac+1.982
bay+1.951
cape+1.928
Neg logit contributions
 Worker-1.653
 competitor-1.550
roid-1.545
 subordinates-1.528
 logger-1.526
Token bad
Token ID2089
Feature activation-6.51e-01

Pos logit contributions
nown+2.268
)+2.154
Jac+2.144
bay+2.124
cape+2.096
Neg logit contributions
 Worker-1.033
roid-0.933
 competitor-0.930
 subordinates-0.923
 logger-0.913
Token idea
Token ID2126
Feature activation-5.28e-01

Pos logit contributions
nown+2.923
)+2.676
Jac+2.661
bay+2.552
ffic+2.513
Neg logit contributions
 Worker-3.770
roid-3.563
 competitor-3.553
 subordinates-3.539
 strikers-3.533
Token=
Token ID28
Feature activation-1.88e-01

Pos logit contributions
nown+2.474
Jac+2.293
)+2.252
bay+2.241
ffic+2.178
Neg logit contributions
 Worker-3.193
 subordinates-3.062
 competitor-3.062
roid-3.061
 logger-3.049
Tokencount
Token ID9127
Feature activation-7.08e-01

Pos logit contributions
nown+0.882
Jac+0.810
)+0.808
bay+0.788
cape+0.778
Neg logit contributions
 Worker-1.073
 competitor-1.024
roid-1.017
 logger-1.012
 subordinates-1.011
Tokeny
Token ID88
Feature activation-3.48e-01

Pos logit contributions
nown+3.966
)+3.765
Jac+3.737
bay+3.680
cape+3.638
Neg logit contributions
 Worker-3.265
 logger-3.113
 competitor-3.107
 subordinates-3.053
 refugees-3.013
Token:
Token ID25
Feature activation-3.10e-02

Pos logit contributions
nown+1.664
Jac+1.544
)+1.539
bay+1.505
cape+1.484
Neg logit contributions
 Worker-1.924
 competitor-1.843
roid-1.825
 logger-1.821
 subordinates-1.819
Token*
Token ID9
Feature activation-2.95e-01

Pos logit contributions
nown+0.144
Jac+0.131
)+0.131
bay+0.128
cape+0.126
Neg logit contributions
 Worker-0.179
 competitor-0.170
roid-0.169
 logger-0.168
 subordinates-0.168
Token&
Token ID5
Feature activation-8.81e-01

Pos logit contributions
nown+1.417
Jac+1.313
)+1.308
bay+1.282
cape+1.261
Neg logit contributions
 Worker-1.627
 competitor-1.559
roid-1.550
 logger-1.542
 subordinates-1.541
Tokenkey
Token ID2539
Feature activation+2.02e-01

Pos logit contributions
nown+4.313
Jac+4.008
bay+3.991
)+3.953
cape+3.886
Neg logit contributions
 Worker-4.659
 competitor-4.474
 logger-4.448
 subordinates-4.426
 refugees-4.397
Token=
Token ID28
Feature activation+7.20e-02

Pos logit contributions
 Worker+1.162
 competitor+1.109
roid+1.105
 logger+1.096
 subordinates+1.085
Neg logit contributions
nown-0.940
Jac-0.858
)-0.858
bay-0.835
cape-0.821
Token
Token ID1399
Feature activation-2.84e-01

Pos logit contributions
 Worker+0.418
 competitor+0.399
roid+0.398
 logger+0.394
 subordinates+0.391
Neg logit contributions
nown-0.332
)-0.299
Jac-0.299
bay-0.292
cape-0.289
Token
Token ID198
Feature activation-2.48e-01

Pos logit contributions
nown+1.427
Jac+1.322
)+1.319
bay+1.295
cape+1.276
Neg logit contributions
 Worker-1.511
 competitor-1.444
roid-1.436
 subordinates-1.426
 logger-1.424
Token
Token ID198
Feature activation-1.00e-01

Pos logit contributions
nown+1.329
Jac+1.241
)+1.236
bay+1.212
cape+1.197
Neg logit contributions
 Worker-1.246
 competitor-1.185
roid-1.175
 logger-1.169
 subordinates-1.168
Token 6
Token ID718
Feature activation+1.89e-01

Pos logit contributions
nown+0.287
Jac+0.264
)+0.264
bay+0.258
cape+0.255
Neg logit contributions
 Worker-0.296
 competitor-0.281
roid-0.279
 logger-0.277
 subordinates-0.276
Token 0
Token ID657
Feature activation+3.39e-01

Pos logit contributions
 Worker+1.077
 competitor+1.026
roid+1.015
 logger+1.008
 refugees+1.006
Neg logit contributions
nown-0.899
Jac-0.812
)-0.808
bay-0.791
cape-0.782
Token 2016
Token ID1584
Feature activation-5.85e-02

Pos logit contributions
 Worker+1.763
 competitor+1.669
roid+1.654
 logger+1.643
 subordinates+1.640
Neg logit contributions
nown-1.774
Jac-1.629
)-1.627
bay-1.597
cape-1.577
Token 7
Token ID767
Feature activation+1.65e-01

Pos logit contributions
nown+0.307
Jac+0.283
)+0.282
bay+0.277
cape+0.273
Neg logit contributions
 Worker-0.303
 competitor-0.287
roid-0.285
 logger-0.283
 subordinates-0.282
Token 1
Token ID352
Feature activation+3.41e-01

Pos logit contributions
 Worker+0.948
 competitor+0.903
roid+0.896
 logger+0.889
 subordinates+0.887
Neg logit contributions
nown-0.771
Jac-0.700
)-0.696
bay-0.682
cape-0.673
Token Total
Token ID7472
Feature activation-8.67e-01

Pos logit contributions
 Worker+1.851
 competitor+1.747
 refugees+1.710
roid+1.706
 subordinates+1.703
Neg logit contributions
nown-1.730
Jac-1.592
)-1.584
bay-1.544
cape-1.531
Token 121
Token ID20416
Feature activation-2.63e-01

Pos logit contributions
nown+4.615
)+4.370
Jac+4.296
bay+4.252
ffic+4.244
Neg logit contributions
 Worker-4.042
roid-3.869
 subordinates-3.802
 logger-3.798
 competitor-3.796
Token 24
Token ID1987
Feature activation+1.24e-01

Pos logit contributions
nown+1.273
Jac+1.177
)+1.176
bay+1.148
cape+1.133
Neg logit contributions
 Worker-1.442
 competitor-1.376
roid-1.372
 logger-1.361
 subordinates-1.360
Token
Token ID198
Feature activation-1.48e-01

Pos logit contributions
 Worker+0.677
 competitor+0.641
roid+0.637
 logger+0.632
 subordinates+0.631
Neg logit contributions
nown-0.616
Jac-0.563
)-0.562
bay-0.549
cape-0.543
Token
Token ID198
Feature activation-9.30e-02

Pos logit contributions
nown+0.806
Jac+0.748
)+0.746
bay+0.731
cape+0.723
Neg logit contributions
 Worker-0.729
 competitor-0.691
roid-0.686
 logger-0.681
 subordinates-0.680
TokenInternational
Token ID24274
Feature activation+1.39e-01

Pos logit contributions
nown+0.453
Jac+0.417
)+0.415
bay+0.406
cape+0.401
Neg logit contributions
 Worker-0.515
 competitor-0.490
roid-0.487
 logger-0.485
 subordinates-0.483
Token
Token ID198
Feature activation+2.00e-02

Pos logit contributions
 Worker+0.059
 competitor+0.056
roid+0.055
 logger+0.055
 subordinates+0.055
Neg logit contributions
nown-0.068
)-0.063
Jac-0.063
bay-0.062
cape-0.061
Token6
Token ID21
Feature activation+1.74e-01

Pos logit contributions
 Worker+0.112
 competitor+0.107
roid+0.107
 logger+0.106
 subordinates+0.106
Neg logit contributions
nown-0.095
)-0.087
Jac-0.087
bay-0.085
cape-0.084
Token)
Token ID8
Feature activation+4.84e-01

Pos logit contributions
 Worker+0.885
 competitor+0.838
 logger+0.824
roid+0.823
 subordinates+0.822
Neg logit contributions
nown-0.932
Jac-0.856
)-0.851
bay-0.836
cape-0.832
Token F
Token ID376
Feature activation-2.43e-01

Pos logit contributions
 Worker+2.350
 competitor+2.199
roid+2.158
 Developer+2.151
 logger+2.148
Neg logit contributions
nown-2.727
)-2.502
Jac-2.498
bay-2.452
cape-2.429
Tokenours
Token ID4662
Feature activation+1.09e-02

Pos logit contributions
nown+1.181
Jac+1.084
)+1.081
bay+1.056
cape+1.042
Neg logit contributions
 Worker-1.347
 competitor-1.283
roid-1.273
 logger-1.270
 subordinates-1.267
Tokenqu
Token ID421
Feature activation-8.58e-01

Pos logit contributions
 Worker+0.073
 competitor+0.070
roid+0.070
 logger+0.069
 subordinates+0.069
Neg logit contributions
nown-0.041
Jac-0.036
)-0.036
bay-0.035
cape-0.034
Tokenare
Token ID533
Feature activation-2.64e-01

Pos logit contributions
nown+1.929
Jac+1.634
)+1.632
bay+1.559
ffic+1.479
Neg logit contributions
 Worker-6.898
 competitor-6.659
 logger-6.647
 subordinates-6.633
roid-6.583
Token
Token ID198
Feature activation+3.21e-02

Pos logit contributions
nown+1.460
)+1.358
Jac+1.357
bay+1.333
cape+1.318
Neg logit contributions
 Worker-1.263
roid-1.195
 competitor-1.189
 logger-1.184
 subordinates-1.180
Token
Token ID198
Feature activation+1.96e-01

Pos logit contributions
 Worker+0.154
 competitor+0.145
roid+0.145
 logger+0.143
 refugees+0.142
Neg logit contributions
nown-0.181
)-0.167
Jac-0.166
bay-0.163
cape-0.161
Token#
Token ID2
Feature activation+9.26e-02

Pos logit contributions
 Worker+1.136
 competitor+1.091
roid+1.089
 subordinates+1.067
 refugees+1.065
Neg logit contributions
nown-0.893
)-0.812
Jac-0.803
bay-0.784
cape-0.777
Token of
Token ID286
Feature activation+5.29e-02

Pos logit contributions
 Worker+0.551
 competitor+0.527
roid+0.524
 logger+0.521
 subordinates+0.520
Neg logit contributions
nown-0.413
Jac-0.376
)-0.375
bay-0.365
cape-0.360
Token head
Token ID1182
Feature activation-1.35e-01

Pos logit contributions
nown+1.283
)+1.199
Jac+1.196
bay+1.181
cape+1.169
Neg logit contributions
 Worker-0.931
roid-0.872
 competitor-0.872
 subordinates-0.865
 logger-0.864
Token over
Token ID625
Feature activation-3.26e-01

Pos logit contributions
nown+0.788
Jac+0.734
)+0.733
bay+0.719
cape+0.712
Neg logit contributions
 Worker-0.615
 competitor-0.579
roid-0.575
 logger-0.571
 subordinates-0.569
Token to
Token ID284
Feature activation-4.26e-01

Pos logit contributions
nown+1.755
Jac+1.625
)+1.624
bay+1.596
cape+1.569
Neg logit contributions
 Worker-1.628
 competitor-1.539
roid-1.535
 subordinates-1.526
 logger-1.520
Token the
Token ID262
Feature activation-7.65e-02

Pos logit contributions
nown+2.340
Jac+2.173
)+2.171
bay+2.147
cape+2.103
Neg logit contributions
 Worker-2.047
roid-1.946
 competitor-1.939
 subordinates-1.935
 logger-1.921
Token passenger
Token ID11849
Feature activation-4.30e-01

Pos logit contributions
nown+0.478
Jac+0.447
)+0.447
bay+0.440
cape+0.435
Neg logit contributions
 Worker-0.317
 competitor-0.297
roid-0.295
 logger-0.292
 subordinates-0.292
Token side
Token ID1735
Feature activation-8.58e-01

Pos logit contributions
nown+2.289
)+2.128
Jac+2.126
bay+2.101
cape+2.057
Neg logit contributions
 Worker-2.155
roid-2.037
 competitor-2.035
 subordinates-2.011
 logger-2.004
Token and
Token ID290
Feature activation-7.62e-02

Pos logit contributions
nown+4.793
bay+4.530
)+4.524
Jac+4.483
cape+4.406
Neg logit contributions
 Worker-3.920
 subordinates-3.693
roid-3.675
 competitor-3.654
 refugees-3.622
Token lift
Token ID10303
Feature activation-4.27e-02

Pos logit contributions
nown+0.481
)+0.450
Jac+0.450
bay+0.443
cape+0.438
Neg logit contributions
 Worker-0.311
 competitor-0.290
roid-0.289
 logger-0.286
 subordinates-0.285
Token up
Token ID510
Feature activation-3.32e-02

Pos logit contributions
nown+0.247
Jac+0.230
)+0.229
bay+0.224
cape+0.223
Neg logit contributions
 Worker-0.198
 competitor-0.186
roid-0.185
 logger-0.184
 subordinates-0.182
Token the
Token ID262
Feature activation-2.27e-01

Pos logit contributions
nown+0.188
Jac+0.175
)+0.175
bay+0.170
cape+0.169
Neg logit contributions
 Worker-0.158
 competitor-0.150
roid-0.148
 logger-0.147
 subordinates-0.146
Token e
Token ID304
Feature activation-5.61e-02

Pos logit contributions
nown+1.407
)+1.317
Jac+1.316
bay+1.296
cape+1.281
Neg logit contributions
 Worker-0.947
 competitor-0.887
roid-0.884
 subordinates-0.873
 logger-0.872
Token
Token ID198
Feature activation+2.23e-02

Pos logit contributions
 Worker+1.084
 competitor+1.023
roid+1.017
 logger+1.009
 refugees+1.001
Neg logit contributions
nown-1.116
)-1.025
Jac-1.022
bay-1.000
cape-0.994
Token
Token ID198
Feature activation-6.06e-02

Pos logit contributions
 Worker+0.107
 competitor+0.100
roid+0.100
 logger+0.099
 subordinates+0.099
Neg logit contributions
nown-0.126
)-0.116
Jac-0.116
bay-0.114
cape-0.113
Token
Token ID447
Feature activation-6.67e-02

Pos logit contributions
nown+0.269
Jac+0.245
)+0.245
bay+0.238
cape+0.235
Neg logit contributions
 Worker-0.361
 competitor-0.345
roid-0.344
 logger-0.341
 subordinates-0.341
Token
Token ID250
Feature activation-3.46e-01

Pos logit contributions
nown+0.420
Jac+0.395
)+0.394
bay+0.387
cape+0.382
Neg logit contributions
 Worker-0.273
 competitor-0.256
roid-0.253
 logger-0.252
 subordinates-0.251
TokenGeneral
Token ID12218
Feature activation-3.49e-01

Pos logit contributions
nown+1.657
Jac+1.549
)+1.523
bay+1.496
cape+1.468
Neg logit contributions
 Worker-1.935
 competitor-1.847
 logger-1.837
roid-1.828
 subordinates-1.826
Token Chuck
Token ID14663
Feature activation-8.47e-01

Pos logit contributions
nown+1.856
Jac+1.736
)+1.726
bay+1.696
cape+1.669
Neg logit contributions
 Worker-1.740
 competitor-1.660
roid-1.646
 logger-1.638
 subordinates-1.633
Token Ye
Token ID11609
Feature activation-4.34e-01

Pos logit contributions
nown+4.038
)+3.748
Jac+3.735
bay+3.657
ffic+3.595
Neg logit contributions
 Worker-4.445
 competitor-4.324
 logger-4.308
roid-4.231
 blocker-4.215
Tokenager
Token ID3536
Feature activation-2.40e-01

Pos logit contributions
nown+2.418
)+2.228
Jac+2.226
bay+2.203
ffic+2.163
Neg logit contributions
 Worker-2.075
 competitor-1.972
 logger-1.957
 subordinates-1.955
 refugees-1.931
Token Interview
Token ID19371
Feature activation-2.91e-02

Pos logit contributions
nown+1.288
Jac+1.194
)+1.193
bay+1.168
cape+1.152
Neg logit contributions
 Worker-1.206
 competitor-1.145
roid-1.141
 logger-1.131
 subordinates-1.127
Token,
Token ID11
Feature activation-3.15e-01

Pos logit contributions
nown+0.157
Jac+0.145
)+0.145
bay+0.142
cape+0.140
Neg logit contributions
 Worker-0.147
 competitor-0.139
roid-0.138
 logger-0.137
 subordinates-0.137
Token
Token ID447
Feature activation+4.88e-02

Pos logit contributions
nown+1.514
Jac+1.395
)+1.391
bay+1.357
cape+1.336
Neg logit contributions
 Worker-1.755
 competitor-1.675
roid-1.668
 logger-1.655
 subordinates-1.652
Token shield
Token ID7614
Feature activation+2.52e-02

Pos logit contributions
nown+1.332
)+1.246
Jac+1.244
bay+1.225
cape+1.213
Neg logit contributions
 Worker-0.954
 competitor-0.894
roid-0.893
 logger-0.881
 subordinates-0.880
Token.
Token ID13
Feature activation-6.53e-02

Pos logit contributions
 Worker+0.115
 competitor+0.109
roid+0.108
 logger+0.107
 subordinates+0.107
Neg logit contributions
nown-0.147
Jac-0.137
)-0.137
bay-0.134
cape-0.133
Token Plug
Token ID22689
Feature activation-2.58e-01

Pos logit contributions
nown+0.357
Jac+0.331
)+0.331
bay+0.324
cape+0.320
Neg logit contributions
 Worker-0.322
 competitor-0.305
roid-0.303
 logger-0.300
 subordinates-0.300
Token this
Token ID428
Feature activation+1.91e-02

Pos logit contributions
nown+1.428
Jac+1.326
)+1.325
bay+1.301
cape+1.285
Neg logit contributions
 Worker-1.250
 competitor-1.182
roid-1.177
 logger-1.164
 subordinates-1.164
Token loop
Token ID9052
Feature activation-2.72e-01

Pos logit contributions
 Worker+0.087
 competitor+0.082
roid+0.081
 logger+0.080
 subordinates+0.080
Neg logit contributions
nown-0.112
Jac-0.104
)-0.104
bay-0.102
cape-0.101
Token into
Token ID656
Feature activation-8.38e-01

Pos logit contributions
nown+1.478
)+1.366
Jac+1.355
bay+1.346
cape+1.331
Neg logit contributions
 Worker-1.338
 competitor-1.264
roid-1.259
 subordinates-1.251
 logger-1.241
Token your
Token ID534
Feature activation-3.16e-01

Pos logit contributions
nown+4.370
)+4.068
bay+4.046
Jac+4.019
cape+3.922
Neg logit contributions
 Worker-4.082
roid-3.946
 subordinates-3.882
 competitor-3.845
 refugees-3.806
Token receiver
Token ID9733
Feature activation-2.23e-01

Pos logit contributions
nown+1.939
Jac+1.813
)+1.812
bay+1.788
cape+1.766
Neg logit contributions
 Worker-1.325
roid-1.246
 competitor-1.236
 subordinates-1.224
 logger-1.217
Token and
Token ID290
Feature activation-2.57e-01

Pos logit contributions
nown+1.322
Jac+1.231
)+1.230
bay+1.218
cape+1.202
Neg logit contributions
 Worker-0.984
 competitor-0.924
roid-0.922
 subordinates-0.913
 logger-0.908
Token use
Token ID779
Feature activation-1.75e-01

Pos logit contributions
nown+1.579
)+1.482
Jac+1.471
bay+1.459
cape+1.442
Neg logit contributions
 Worker-1.078
roid-1.010
 competitor-1.010
 subordinates-0.996
 logger-0.991
Token it
Token ID340
Feature activation-1.78e-01

Pos logit contributions
nown+1.010
Jac+0.941
)+0.941
bay+0.925
cape+0.914
Neg logit contributions
 Worker-0.805
 competitor-0.758
roid-0.756
 subordinates-0.747
 logger-0.746
Token
Token ID198
Feature activation-1.05e-02

Pos logit contributions
 Worker+1.058
 competitor+0.986
roid+0.980
 refugees+0.976
 subordinates+0.971
Neg logit contributions
nown-1.164
)-1.069
Jac-1.065
bay-1.044
cape-1.036
Token
Token ID198
Feature activation-1.51e-03

Pos logit contributions
nown+0.059
)+0.054
Jac+0.054
bay+0.053
cape+0.053
Neg logit contributions
 Worker-0.050
 competitor-0.047
roid-0.047
 logger-0.047
 refugees-0.046
TokenIn
Token ID818
Feature activation-1.67e-01

Pos logit contributions
nown+0.007
)+0.006
Jac+0.006
bay+0.006
cape+0.006
Neg logit contributions
 Worker-0.009
 competitor-0.008
roid-0.008
 logger-0.008
 subordinates-0.008
Token Hum
Token ID11054
Feature activation-1.40e-01

Pos logit contributions
nown+0.969
Jac+0.907
)+0.903
bay+0.891
cape+0.880
Neg logit contributions
 Worker-0.757
roid-0.716
 competitor-0.714
 subordinates-0.703
 logger-0.702
Tokenbers
Token ID1213
Feature activation+2.31e-01

Pos logit contributions
nown+0.951
Jac+0.895
)+0.894
bay+0.879
cape+0.872
Neg logit contributions
 Worker-0.508
 competitor-0.470
roid-0.466
 logger-0.461
 subordinates-0.459
Tokenide
Token ID485
Feature activation-8.38e-01

Pos logit contributions
 Worker+1.712
 competitor+1.651
roid+1.633
 logger+1.629
 refugees+1.626
Neg logit contributions
nown-0.692
Jac-0.610
)-0.597
bay-0.556
ffic-0.545
Token a
Token ID257
Feature activation+8.90e-02

Pos logit contributions
nown+4.794
bay+4.572
)+4.501
Jac+4.479
ffic+4.470
Neg logit contributions
 Worker-3.529
roid-3.440
 competitor-3.382
 subordinates-3.355
 logger-3.342
Token mosque
Token ID18575
Feature activation+1.65e-01

Pos logit contributions
 Worker+0.346
 competitor+0.322
roid+0.318
 logger+0.316
 subordinates+0.314
Neg logit contributions
nown-0.582
Jac-0.546
)-0.546
bay-0.536
cape-0.531
Token chairman
Token ID8900
Feature activation+1.37e-01

Pos logit contributions
 Worker+0.769
 competitor+0.727
roid+0.715
 logger+0.714
 refugees+0.709
Neg logit contributions
nown-0.952
)-0.885
Jac-0.883
bay-0.862
cape-0.853
Token described
Token ID3417
Feature activation+1.02e-01

Pos logit contributions
 Worker+0.722
 competitor+0.686
roid+0.679
 logger+0.678
 refugees+0.674
Neg logit contributions
nown-0.704
)-0.647
Jac-0.646
bay-0.630
cape-0.625
Token how
Token ID703
Feature activation+8.04e-02

Pos logit contributions
 Worker+0.466
 competitor+0.439
roid+0.435
 logger+0.433
 subordinates+0.431
Neg logit contributions
nown-0.602
Jac-0.560
)-0.559
bay-0.548
cape-0.543
Token.
Token ID13
Feature activation-2.95e-01

Pos logit contributions
nown+0.164
)+0.150
Jac+0.150
bay+0.147
cape+0.146
Neg logit contributions
 Worker-0.144
 competitor-0.135
roid-0.135
 logger-0.133
 refugees-0.133
Tokentwitter
Token ID6956
Feature activation-2.28e-01

Pos logit contributions
nown+1.408
Jac+1.299
)+1.298
bay+1.273
cape+1.252
Neg logit contributions
 Worker-1.655
 competitor-1.582
roid-1.560
 subordinates-1.559
 logger-1.559
Token.
Token ID13
Feature activation-2.81e-01

Pos logit contributions
nown+1.233
Jac+1.141
)+1.139
bay+1.115
cape+1.103
Neg logit contributions
 Worker-1.142
 competitor-1.082
roid-1.076
 logger-1.067
 subordinates-1.064
Tokencom
Token ID785
Feature activation-3.62e-01

Pos logit contributions
nown+1.035
Jac+0.930
)+0.926
bay+0.899
cape+0.884
Neg logit contributions
 Worker-1.883
 competitor-1.811
roid-1.796
 logger-1.793
 subordinates-1.792
Token/
Token ID14
Feature activation-5.28e-01

Pos logit contributions
nown+1.851
Jac+1.721
)+1.714
bay+1.679
cape+1.659
Neg logit contributions
 Worker-1.901
 competitor-1.812
roid-1.796
 subordinates-1.788
 logger-1.788
Tokenu
Token ID84
Feature activation-8.37e-01

Pos logit contributions
nown+2.430
Jac+2.249
)+2.212
bay+2.184
cape+2.151
Neg logit contributions
 Worker-3.058
 competitor-2.926
 logger-2.900
 subordinates-2.891
roid-2.866
TokenI
Token ID40
Feature activation-5.58e-01

Pos logit contributions
nown+3.663
Jac+3.454
)+3.323
bay+3.299
ffic+3.248
Neg logit contributions
 Worker-4.951
 competitor-4.754
 logger-4.718
 subordinates-4.703
 refugees-4.649
Tokenba
Token ID7012
Feature activation-4.69e-01

Pos logit contributions
nown+2.470
Jac+2.270
)+2.228
bay+2.219
cape+2.171
Neg logit contributions
 Worker-3.320
 competitor-3.189
 logger-3.159
 subordinates-3.153
 refugees-3.127
TokenH
Token ID39
Feature activation-3.83e-01

Pos logit contributions
nown+2.052
Jac+1.895
)+1.867
bay+1.828
cape+1.795
Neg logit contributions
 Worker-2.814
 competitor-2.697
 logger-2.674
 subordinates-2.665
roid-2.653
TokenEx
Token ID3109
Feature activation-5.07e-01

Pos logit contributions
nown+1.735
Jac+1.593
)+1.577
bay+1.551
cape+1.524
Neg logit contributions
 Worker-2.253
 competitor-2.154
 logger-2.132
 subordinates-2.129
roid-2.122
TokenTT
Token ID15751
Feature activation-2.32e-01

Pos logit contributions
nown+2.310
Jac+2.132
)+2.103
bay+2.071
cape+2.039
Neg logit contributions
 Worker-2.955
 competitor-2.826
 logger-2.802
 subordinates-2.796
roid-2.778
Token)
Token ID8
Feature activation-1.39e-01

Pos logit contributions
 Worker+0.812
 competitor+0.764
roid+0.761
 logger+0.753
 refugees+0.751
Neg logit contributions
nown-0.866
Jac-0.789
)-0.774
bay-0.770
cape-0.768
Token
Token ID198
Feature activation-2.14e-01

Pos logit contributions
nown+0.744
Jac+0.689
)+0.688
bay+0.674
cape+0.666
Neg logit contributions
 Worker-0.696
 competitor-0.660
roid-0.656
 logger-0.651
 subordinates-0.649
Token
Token ID198
Feature activation-7.46e-02

Pos logit contributions
nown+1.155
Jac+1.075
)+1.072
bay+1.051
cape+1.038
Neg logit contributions
 Worker-1.062
 competitor-1.009
roid-1.001
 logger-0.995
 subordinates-0.994
Token6
Token ID21
Feature activation-3.95e-02

Pos logit contributions
nown+0.324
Jac+0.295
)+0.294
bay+0.286
cape+0.282
Neg logit contributions
 Worker-0.453
 competitor-0.433
roid-0.431
 logger-0.428
 subordinates-0.427
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
nown+0.221
)+0.202
Jac+0.202
bay+0.198
cape+0.197
Neg logit contributions
 Worker-0.193
 competitor-0.181
roid-0.180
 logger-0.179
 refugees-0.178
Token Rob
Token ID3851
Feature activation-8.35e-01

Pos logit contributions
nown+1.139
Jac+1.063
)+1.057
bay+1.035
cape+1.022
Neg logit contributions
 Worker-1.097
 competitor-1.043
roid-1.037
 logger-1.029
 subordinates-1.029
Token Harley
Token ID27440
Feature activation-2.08e-01

Pos logit contributions
nown+3.988
Jac+3.676
)+3.670
bay+3.665
ffic+3.536
Neg logit contributions
 Worker-4.482
 competitor-4.337
 subordinates-4.329
roid-4.294
 logger-4.275
Token (
Token ID357
Feature activation-5.08e-01

Pos logit contributions
nown+1.156
Jac+1.080
)+1.078
bay+1.057
cape+1.045
Neg logit contributions
 Worker-0.997
 competitor-0.940
roid-0.937
 subordinates-0.928
 logger-0.927
Token152
Token ID17827
Feature activation-6.86e-02

Pos logit contributions
nown+2.249
Jac+2.123
)+2.080
bay+2.053
cape+2.015
Neg logit contributions
 Worker-2.986
 competitor-2.861
 logger-2.844
 subordinates-2.839
roid-2.827
Token)
Token ID8
Feature activation-2.52e-01

Pos logit contributions
nown+0.356
Jac+0.328
)+0.326
bay+0.320
cape+0.317
Neg logit contributions
 Worker-0.359
 competitor-0.340
roid-0.338
 logger-0.336
 subordinates-0.335
Token (
Token ID357
Feature activation-4.02e-01

Pos logit contributions
nown+1.480
Jac+1.384
)+1.381
bay+1.355
cape+1.341
Neg logit contributions
 Worker-1.131
 competitor-1.067
roid-1.061
 logger-1.051
 subordinates-1.049
Token vertex
Token ID37423
Feature activation+7.00e-01

Pos logit contributions
 Worker+2.096
 competitor+2.034
 logger+1.973
 refugees+1.967
 subordinates+1.958
Neg logit contributions
nown-3.722
Jac-3.605
)-3.577
bay-3.456
ffic-3.383
Token.
Token ID764
Feature activation+6.06e-01

Pos logit contributions
 Worker+3.180
 competitor+2.956
 logger+2.952
 Developer+2.914
 shares+2.913
Neg logit contributions
nown-4.126
)-3.889
Jac-3.728
foreseen-3.700
bay-3.675
Token x
Token ID2124
Feature activation+3.20e-01

Pos logit contributions
 Worker+2.883
 competitor+2.789
 logger+2.747
roid+2.730
 refugees+2.689
Neg logit contributions
nown-3.324
Jac-3.194
)-3.184
bay-3.022
cape-2.981
Tokenyz
Token ID45579
Feature activation+1.27e-01

Pos logit contributions
 Worker+1.890
roid+1.815
 competitor+1.806
 subordinates+1.786
 logger+1.785
Neg logit contributions
nown-1.430
)-1.333
Jac-1.307
bay-1.247
cape-1.230
Token +=
Token ID15853
Feature activation+5.63e-01

Pos logit contributions
 Worker+0.905
 competitor+0.871
roid+0.871
 logger+0.867
 subordinates+0.864
Neg logit contributions
nown-0.422
)-0.378
Jac-0.371
bay-0.357
cape-0.344
Token v
Token ID410
Feature activation+6.04e-01

Pos logit contributions
 Worker+2.756
 competitor+2.660
 logger+2.611
 refugees+2.568
 survivor+2.552
Neg logit contributions
nown-3.115
)-2.930
Jac-2.926
bay-2.843
foreseen-2.772
Token.
Token ID764
Feature activation+7.69e-01

Pos logit contributions
 Worker+2.789
roid+2.607
 competitor+2.593
 logger+2.585
 shares+2.549
Neg logit contributions
nown-3.537
)-3.249
Jac-3.164
foreseen-3.123
bay-3.116
Token normal
Token ID3487
Feature activation+6.87e-01

Pos logit contributions
 Worker+1.917
 competitor+1.821
roid+1.778
 logger+1.731
 subordinates+1.728
Neg logit contributions
nown-5.923
Jac-5.743
)-5.734
bay-5.589
cape-5.480
Token *
Token ID1635
Feature activation+4.60e-01

Pos logit contributions
 Worker+2.521
 competitor+2.345
 logger+2.319
roid+2.311
 Developer+2.258
Neg logit contributions
nown-4.614
)-4.347
Jac-4.281
bay-4.231
foreseen-4.176
Token 1
Token ID352
Feature activation+4.57e-01

Pos logit contributions
 Worker+2.518
 competitor+2.406
roid+2.347
 logger+2.339
 refugees+2.337
Neg logit contributions
nown-2.258
Jac-2.085
)-2.079
bay-2.021
cape-2.008
Token ;
Token ID2162
Feature activation+5.50e-01

Pos logit contributions
 Worker+2.227
 competitor+2.092
 logger+2.074
roid+2.061
 refugees+2.040
Neg logit contributions
nown-2.553
)-2.364
Jac-2.332
bay-2.284
cape-2.252
Token gave
Token ID2921
Feature activation-1.96e-02

Pos logit contributions
 Worker+1.032
 competitor+0.970
roid+0.963
 subordinates+0.959
 refugees+0.955
Neg logit contributions
nown-1.235
Jac-1.140
)-1.136
bay-1.110
cape-1.103
Token her
Token ID607
Feature activation+1.89e-01

Pos logit contributions
nown+0.121
Jac+0.113
)+0.113
bay+0.111
cape+0.110
Neg logit contributions
 Worker-0.083
 competitor-0.078
roid-0.078
 logger-0.077
 subordinates-0.076
Token final
Token ID2457
Feature activation+1.48e-01

Pos logit contributions
 Worker+0.787
 competitor+0.742
 subordinates+0.728
roid+0.726
 logger+0.724
Neg logit contributions
nown-1.194
)-1.107
Jac-1.107
bay-1.089
cape-1.075
Token speech
Token ID4046
Feature activation+7.08e-02

Pos logit contributions
 Worker+0.598
 competitor+0.561
roid+0.551
 logger+0.550
 subordinates+0.549
Neg logit contributions
nown-0.950
Jac-0.884
)-0.884
bay-0.871
cape-0.857
Token as
Token ID355
Feature activation+9.98e-02

Pos logit contributions
 Worker+0.323
 competitor+0.305
roid+0.302
 logger+0.300
 subordinates+0.299
Neg logit contributions
nown-0.414
Jac-0.386
)-0.384
bay-0.377
cape-0.373
Token first
Token ID717
Feature activation+5.41e-01

Pos logit contributions
 Worker+0.400
 competitor+0.375
roid+0.367
 logger+0.367
 subordinates+0.366
Neg logit contributions
nown-0.641
Jac-0.599
)-0.599
bay-0.589
cape-0.583
Token lady
Token ID10846
Feature activation-7.45e-02

Pos logit contributions
 Worker+1.162
 competitor+1.039
 refugees+0.991
roid+0.974
 survivor+0.965
Neg logit contributions
nown-4.484
)-4.236
Jac-4.235
cape-4.173
bay-4.171
Token on
Token ID319
Feature activation+1.66e-01

Pos logit contributions
nown+0.427
Jac+0.397
)+0.397
bay+0.389
cape+0.385
Neg logit contributions
 Worker-0.349
 competitor-0.329
roid-0.327
 logger-0.324
 subordinates-0.323
Token Friday
Token ID3217
Feature activation+3.60e-02

Pos logit contributions
 Worker+0.734
 competitor+0.688
roid+0.679
 logger+0.677
 subordinates+0.676
Neg logit contributions
nown-0.997
Jac-0.927
)-0.927
bay-0.909
cape-0.899
Token,
Token ID11
Feature activation+2.40e-01

Pos logit contributions
 Worker+0.160
 competitor+0.151
roid+0.149
 logger+0.148
 subordinates+0.148
Neg logit contributions
nown-0.214
Jac-0.200
)-0.200
bay-0.196
cape-0.194
Token during
Token ID1141
Feature activation-1.93e-02

Pos logit contributions
 Worker+1.092
 competitor+1.027
 logger+1.012
 subordinates+1.012
roid+1.011
Neg logit contributions
nown-1.406
Jac-1.308
)-1.305
bay-1.283
cape-1.270
Tokenmen
Token ID3653
Feature activation+2.79e-01

Pos logit contributions
nown+0.877
Jac+0.820
)+0.819
bay+0.804
cape+0.796
Neg logit contributions
 Worker-0.622
 competitor-0.584
roid-0.579
 logger-0.574
 subordinates-0.572
Token,
Token ID11
Feature activation-9.90e-02

Pos logit contributions
 Worker+1.184
 competitor+1.114
roid+1.101
 refugees+1.097
 logger+1.096
Neg logit contributions
nown-1.719
)-1.609
Jac-1.602
bay-1.572
cape-1.551
Token they
Token ID484
Feature activation-7.46e-02

Pos logit contributions
nown+0.589
Jac+0.552
)+0.547
bay+0.540
cape+0.534
Neg logit contributions
 Worker-0.437
 competitor-0.411
roid-0.411
 logger-0.404
 subordinates-0.402
Token were
Token ID547
Feature activation-1.28e-01

Pos logit contributions
nown+0.421
Jac+0.392
)+0.391
bay+0.384
cape+0.381
Neg logit contributions
 Worker-0.354
 competitor-0.334
roid-0.332
 logger-0.329
 subordinates-0.328
Token cur
Token ID1090
Feature activation+9.78e-02

Pos logit contributions
nown+0.797
Jac+0.752
)+0.747
bay+0.736
cape+0.731
Neg logit contributions
 Worker-0.526
 competitor-0.490
roid-0.489
 logger-0.482
 subordinates-0.479
Tokent
Token ID83
Feature activation+5.72e-01

Pos logit contributions
 Worker+0.484
roid+0.459
 competitor+0.459
 logger+0.452
 subordinates+0.452
Neg logit contributions
nown-0.530
Jac-0.493
)-0.492
bay-0.482
cape-0.477
Tokenailing
Token ID11608
Feature activation+2.31e-01

Pos logit contributions
 Worker+2.277
roid+2.183
 competitor+2.149
 logger+2.110
 refugees+2.105
Neg logit contributions
nown-3.576
Jac-3.360
)-3.342
bay-3.294
cape-3.286
Token holiday
Token ID9912
Feature activation-7.03e-02

Pos logit contributions
 Worker+1.027
 competitor+0.968
roid+0.954
 logger+0.951
 subordinates+0.949
Neg logit contributions
nown-1.383
)-1.291
Jac-1.290
bay-1.264
cape-1.251
Token decorations
Token ID39011
Feature activation-1.43e-01

Pos logit contributions
nown+0.459
Jac+0.432
)+0.429
bay+0.425
cape+0.422
Neg logit contributions
 Worker-0.269
roid-0.250
 competitor-0.249
 logger-0.246
 subordinates-0.245
Token to
Token ID284
Feature activation+4.77e-02

Pos logit contributions
nown+0.795
Jac+0.737
)+0.731
bay+0.724
cape+0.722
Neg logit contributions
 Worker-0.693
roid-0.655
 competitor-0.654
 logger-0.646
 subordinates-0.642
Token match
Token ID2872
Feature activation-6.42e-02

Pos logit contributions
 Worker+0.209
 competitor+0.196
roid+0.195
 logger+0.193
 subordinates+0.193
Neg logit contributions
nown-0.288
Jac-0.269
)-0.268
bay-0.263
cape-0.261
Token407
Token ID30120
Feature activation+4.71e-01

Pos logit contributions
nown+0.413
Jac+0.381
)+0.381
bay+0.372
cape+0.368
Neg logit contributions
 Worker-0.442
 competitor-0.421
roid-0.418
 logger-0.415
 subordinates-0.414
Token x
Token ID2124
Feature activation-8.01e-02

Pos logit contributions
 Worker+2.340
 competitor+2.181
 logger+2.148
roid+2.142
 Developer+2.134
Neg logit contributions
nown-2.633
)-2.373
Jac-2.361
bay-2.337
cape-2.300
Token.
Token ID13
Feature activation-1.69e-03

Pos logit contributions
nown+0.443
)+0.407
Jac+0.407
bay+0.398
cape+0.394
Neg logit contributions
 Worker-0.394
 competitor-0.371
roid-0.370
 logger-0.365
 subordinates-0.364
Tokenx
Token ID87
Feature activation+3.90e-01

Pos logit contributions
nown+0.008
)+0.007
Jac+0.007
bay+0.007
cape+0.007
Neg logit contributions
 Worker-0.010
roid-0.009
 competitor-0.009
 logger-0.009
 subordinates-0.009
Token.
Token ID13
Feature activation+1.36e-01

Pos logit contributions
 Worker+1.745
roid+1.646
 competitor+1.602
 refugees+1.587
 Developer+1.574
Neg logit contributions
nown-2.344
)-2.122
Jac-2.109
bay-2.058
ffic-2.054
Tokenx
Token ID87
Feature activation+4.91e-01

Pos logit contributions
 Worker+0.752
roid+0.721
 competitor+0.712
 refugees+0.696
 logger+0.693
Neg logit contributions
nown-0.665
)-0.608
Jac-0.602
bay-0.581
cape-0.575
Token.
Token ID13
Feature activation+1.54e-01

Pos logit contributions
 Worker+2.124
roid+2.007
 competitor+1.941
 refugees+1.918
 Developer+1.909
Neg logit contributions
nown-2.987
)-2.711
Jac-2.694
ffic-2.634
cape-2.620
Token99
Token ID2079
Feature activation+3.89e-03

Pos logit contributions
 Worker+1.114
 competitor+1.085
roid+1.072
 refugees+1.057
 subordinates+1.053
Neg logit contributions
nown-0.475
)-0.420
Jac-0.409
bay-0.388
ffic-0.386
Token 10
Token ID838
Feature activation+2.15e-02

Pos logit contributions
 Worker+0.023
 competitor+0.022
roid+0.022
 logger+0.022
 subordinates+0.022
Neg logit contributions
nown-0.017
Jac-0.016
)-0.016
bay-0.015
cape-0.015
Token88
Token ID3459
Feature activation+2.56e-02

Pos logit contributions
 Worker+0.122
 competitor+0.116
roid+0.115
 logger+0.114
 subordinates+0.114
Neg logit contributions
nown-0.102
Jac-0.093
)-0.093
bay-0.091
cape-0.090
Token350
Token ID14877
Feature activation+5.48e-01

Pos logit contributions
 Worker+0.142
 competitor+0.136
roid+0.135
 logger+0.134
 subordinates+0.133
Neg logit contributions
nown-0.124
Jac-0.114
)-0.114
bay-0.111
cape-0.109
Token cited
Token ID9181
Feature activation+1.83e-01

Pos logit contributions
 Worker+1.157
 competitor+1.098
roid+1.091
 subordinates+1.085
 logger+1.082
Neg logit contributions
nown-1.373
)-1.276
Jac-1.274
bay-1.247
cape-1.236
Token the
Token ID262
Feature activation-6.25e-02

Pos logit contributions
 Worker+0.783
 competitor+0.736
roid+0.729
 logger+0.724
 subordinates+0.723
Neg logit contributions
nown-1.116
Jac-1.043
)-1.042
bay-1.023
cape-1.013
Token Sessions
Token ID15184
Feature activation+9.01e-02

Pos logit contributions
nown+0.396
Jac+0.371
)+0.370
bay+0.362
cape+0.360
Neg logit contributions
 Worker-0.253
roid-0.236
 competitor-0.235
 logger-0.232
 subordinates-0.229
Token-
Token ID12
Feature activation+1.71e-01

Pos logit contributions
 Worker+0.373
 competitor+0.350
roid+0.347
 logger+0.344
 subordinates+0.343
Neg logit contributions
nown-0.565
Jac-0.529
)-0.528
bay-0.519
cape-0.514
TokenR
Token ID49
Feature activation+4.18e-01

Pos logit contributions
 Worker+0.911
roid+0.870
 competitor+0.869
 subordinates+0.860
 logger+0.855
Neg logit contributions
nown-0.865
)-0.796
Jac-0.789
bay-0.774
cape-0.766
Tokenosen
Token ID5233
Feature activation+6.45e-01

Pos logit contributions
 Worker+1.277
 competitor+1.173
roid+1.170
 logger+1.144
 subordinates+1.143
Neg logit contributions
nown-3.061
Jac-2.896
)-2.889
bay-2.842
cape-2.829
Tokenstein
Token ID5714
Feature activation+4.38e-01

Pos logit contributions
 Worker+2.972
roid+2.861
 competitor+2.798
 subordinates+2.740
 refugees+2.737
Neg logit contributions
nown-3.712
)-3.453
Jac-3.445
bay-3.323
cape-3.305
Token recommendation
Token ID15602
Feature activation+3.32e-01

Pos logit contributions
 Worker+1.672
 competitor+1.577
 subordinates+1.559
roid+1.559
 logger+1.544
Neg logit contributions
nown-2.869
)-2.692
Jac-2.684
bay-2.641
cape-2.613
Token to
Token ID284
Feature activation-3.01e-02

Pos logit contributions
 Worker+1.473
 competitor+1.394
 subordinates+1.376
roid+1.375
 logger+1.372
Neg logit contributions
nown-1.973
Jac-1.843
)-1.841
bay-1.799
cape-1.782
Token explain
Token ID4727
Feature activation-3.33e-01

Pos logit contributions
nown+0.184
Jac+0.172
)+0.171
bay+0.169
cape+0.167
Neg logit contributions
 Worker-0.129
 competitor-0.121
roid-0.120
 logger-0.119
 subordinates-0.118
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
nown+2.058
Jac+1.938
)+1.925
bay+1.898
cape+1.885
Neg logit contributions
 Worker-1.366
roid-1.281
 competitor-1.265
 logger-1.244
 subordinates-1.228
Token
Token ID198
Feature activation-1.29e-02

Pos logit contributions
 Worker+0.406
 competitor+0.380
roid+0.379
 logger+0.374
 refugees+0.372
Neg logit contributions
nown-0.495
)-0.454
Jac-0.453
bay-0.445
cape-0.441
TokenN
Token ID45
Feature activation-1.22e-01

Pos logit contributions
nown+0.059
Jac+0.055
)+0.054
bay+0.053
cape+0.052
Neg logit contributions
 Worker-0.075
 competitor-0.071
roid-0.071
 logger-0.071
 subordinates-0.070
Tokenem
Token ID368
Feature activation-3.14e-01

Pos logit contributions
nown+0.536
Jac+0.489
)+0.487
bay+0.479
cape+0.470
Neg logit contributions
 Worker-0.732
 competitor-0.698
 logger-0.692
roid-0.691
 subordinates-0.690
Tokenov
Token ID709
Feature activation+1.33e-01

Pos logit contributions
nown+1.308
Jac+1.178
)+1.171
bay+1.164
cape+1.129
Neg logit contributions
 Worker-1.958
 competitor-1.886
 subordinates-1.868
 logger-1.867
roid-1.837
Token confirmed
Token ID4999
Feature activation+2.26e-01

Pos logit contributions
 Worker+0.601
 competitor+0.566
roid+0.562
 logger+0.558
 subordinates+0.555
Neg logit contributions
nown-0.782
Jac-0.730
)-0.728
bay-0.715
cape-0.708
Token his
Token ID465
Feature activation+1.05e-03

Pos logit contributions
 Worker+1.003
 competitor+0.944
roid+0.932
 subordinates+0.927
 logger+0.926
Neg logit contributions
nown-1.352
)-1.261
Jac-1.260
bay-1.232
cape-1.224
Token ownership
Token ID9238
Feature activation+1.82e-03

Pos logit contributions
 Worker+0.004
roid+0.004
 competitor+0.004
 logger+0.004
 subordinates+0.004
Neg logit contributions
nown-0.007
Jac-0.006
)-0.006
bay-0.006
cape-0.006
Token of
Token ID286
Feature activation-1.63e-02

Pos logit contributions
 Worker+0.008
 competitor+0.008
roid+0.008
 logger+0.008
 subordinates+0.008
Neg logit contributions
nown-0.011
Jac-0.010
)-0.010
bay-0.010
cape-0.010
Token Olymp
Token ID7001
Feature activation-4.91e-02

Pos logit contributions
nown+0.094
Jac+0.088
)+0.087
bay+0.086
cape+0.085
Neg logit contributions
 Worker-0.075
 competitor-0.071
roid-0.071
 logger-0.070
 subordinates-0.070
Token Oil
Token ID11474
Feature activation-1.76e-01

Pos logit contributions
nown+0.325
Jac+0.305
)+0.304
bay+0.300
cape+0.297
Neg logit contributions
 Worker-0.185
 competitor-0.173
roid-0.172
 logger-0.170
 subordinates-0.169
Token but
Token ID475
Feature activation-2.09e-03

Pos logit contributions
nown+0.996
Jac+0.923
)+0.923
bay+0.907
cape+0.896
Neg logit contributions
 Worker-0.828
roid-0.783
 competitor-0.779
 logger-0.774
 subordinates-0.769
Token issue
Token ID2071
Feature activation-1.61e-01

Pos logit contributions
nown+1.254
Jac+1.176
)+1.175
bay+1.163
cape+1.149
Neg logit contributions
 Worker-0.756
roid-0.700
 competitor-0.697
 logger-0.691
 subordinates-0.689
Token will
Token ID481
Feature activation+4.13e-03

Pos logit contributions
nown+0.893
Jac+0.832
)+0.827
bay+0.817
cape+0.812
Neg logit contributions
 Worker-0.774
roid-0.728
 competitor-0.728
 subordinates-0.718
 logger-0.717
Token face
Token ID1986
Feature activation+1.10e-01

Pos logit contributions
 Worker+0.017
 competitor+0.016
roid+0.016
 logger+0.016
 subordinates+0.015
Neg logit contributions
nown-0.026
Jac-0.024
)-0.024
bay-0.024
cape-0.024
Token a
Token ID257
Feature activation+4.39e-02

Pos logit contributions
 Worker+0.474
 competitor+0.444
roid+0.442
 logger+0.437
 subordinates+0.436
Neg logit contributions
nown-0.674
Jac-0.630
)-0.629
bay-0.618
cape-0.612
Token greater
Token ID3744
Feature activation-7.10e-02

Pos logit contributions
 Worker+0.169
 competitor+0.158
roid+0.157
 logger+0.155
 subordinates+0.155
Neg logit contributions
nown-0.287
Jac-0.270
)-0.269
bay-0.265
cape-0.262
Token burden
Token ID10538
Feature activation+3.33e-02

Pos logit contributions
nown+0.440
Jac+0.412
)+0.412
bay+0.405
cape+0.402
Neg logit contributions
 Worker-0.295
roid-0.274
 subordinates-0.271
 competitor-0.271
 logger-0.268
Token of
Token ID286
Feature activation+2.58e-01

Pos logit contributions
 Worker+0.153
 competitor+0.145
roid+0.144
 logger+0.142
 subordinates+0.142
Neg logit contributions
nown-0.193
Jac-0.179
)-0.179
bay-0.176
cape-0.174
Token proof
Token ID6617
Feature activation-4.95e-02

Pos logit contributions
 Worker+1.048
 competitor+0.987
 logger+0.970
roid+0.970
 subordinates+0.962
Neg logit contributions
nown-1.642
Jac-1.539
)-1.538
bay-1.512
cape-1.496
Token than
Token ID621
Feature activation-5.52e-02

Pos logit contributions
nown+0.290
Jac+0.270
)+0.269
bay+0.265
cape+0.262
Neg logit contributions
 Worker-0.225
 competitor-0.212
roid-0.210
 logger-0.209
 subordinates-0.208
Token most
Token ID749
Feature activation+2.50e-01

Pos logit contributions
nown+0.327
Jac+0.305
)+0.304
bay+0.299
cape+0.296
Neg logit contributions
 Worker-0.246
roid-0.231
 competitor-0.230
 logger-0.228
 subordinates-0.227
Token ballot
Token ID11100
Feature activation+7.24e-02

Pos logit contributions
 Worker+1.065
 competitor+1.007
roid+0.988
 logger+0.986
 refugees+0.983
Neg logit contributions
nown-1.537
)-1.439
Jac-1.438
bay-1.409
cape-1.396
Token at
Token ID379
Feature activation+8.42e-02

Pos logit contributions
 Worker+0.556
 competitor+0.522
roid+0.520
 logger+0.514
 subordinates+0.513
Neg logit contributions
nown-0.715
Jac-0.669
)-0.668
bay-0.655
cape-0.648
Token 3
Token ID513
Feature activation-1.28e-01

Pos logit contributions
 Worker+0.407
 competitor+0.385
roid+0.382
 logger+0.379
 subordinates+0.378
Neg logit contributions
nown-0.469
Jac-0.435
)-0.435
bay-0.426
cape-0.421
Token/
Token ID14
Feature activation+1.76e-02

Pos logit contributions
nown+0.692
Jac+0.641
)+0.640
bay+0.627
cape+0.620
Neg logit contributions
 Worker-0.640
 competitor-0.606
roid-0.603
 logger-0.598
 subordinates-0.597
Token4
Token ID19
Feature activation-2.87e-01

Pos logit contributions
 Worker+0.117
 competitor+0.112
roid+0.112
 logger+0.111
 subordinates+0.111
Neg logit contributions
nown-0.066
Jac-0.059
)-0.059
bay-0.057
cape-0.056
Tokens
Token ID82
Feature activation-5.32e-02

Pos logit contributions
nown+1.705
)+1.604
Jac+1.601
bay+1.583
cape+1.550
Neg logit contributions
 Worker-1.267
 competitor-1.188
roid-1.185
 subordinates-1.172
 logger-1.170
Token.
Token ID13
Feature activation+3.49e-02

Pos logit contributions
nown+0.304
Jac+0.282
)+0.282
bay+0.277
cape+0.274
Neg logit contributions
 Worker-0.250
 competitor-0.236
roid-0.234
 subordinates-0.232
 logger-0.232
Token I
Token ID314
Feature activation+3.55e-02

Pos logit contributions
 Worker+0.177
 competitor+0.168
roid+0.166
 logger+0.165
 subordinates+0.164
Neg logit contributions
nown-0.186
Jac-0.172
)-0.172
bay-0.167
cape-0.166
Token also
Token ID635
Feature activation+2.67e-02

Pos logit contributions
 Worker+0.166
 competitor+0.157
roid+0.156
 logger+0.155
 subordinates+0.154
Neg logit contributions
nown-0.203
Jac-0.189
)-0.188
bay-0.185
cape-0.183
Token recommend
Token ID4313
Feature activation+5.55e-02

Pos logit contributions
 Worker+0.120
 competitor+0.112
roid+0.112
 logger+0.111
 subordinates+0.110
Neg logit contributions
nown-0.158
Jac-0.147
)-0.147
bay-0.144
cape-0.143
Token upgrading
Token ID25485
Feature activation+1.16e-01

Pos logit contributions
 Worker+0.232
 competitor+0.218
roid+0.215
 logger+0.214
 subordinates+0.213
Neg logit contributions
nown-0.347
Jac-0.324
)-0.323
bay-0.318
cape-0.315
Token to
Token ID284
Feature activation-7.37e-02

Pos logit contributions
 Worker+0.545
 competitor+0.516
roid+0.511
 logger+0.510
 subordinates+0.503
Neg logit contributions
nown-0.671
Jac-0.622
)-0.619
bay-0.603
cape-0.601
TokenBlack
Token ID9915
Feature activation-1.05e-01

Pos logit contributions
nown+0.200
Jac+0.183
)+0.183
bay+0.178
cape+0.176
Neg logit contributions
 Worker-0.228
 competitor-0.217
roid-0.216
 logger-0.215
 subordinates-0.214
Tokenstone
Token ID6440
Feature activation+1.71e-01

Pos logit contributions
nown+0.586
Jac+0.545
)+0.544
bay+0.534
cape+0.528
Neg logit contributions
 Worker-0.503
 competitor-0.475
roid-0.472
 logger-0.468
 subordinates-0.467
Token bought
Token ID5839
Feature activation+9.36e-02

Pos logit contributions
 Worker+0.818
 competitor+0.776
roid+0.765
 subordinates+0.761
 logger+0.759
Neg logit contributions
nown-0.967
Jac-0.897
)-0.893
bay-0.877
cape-0.867
Token Sea
Token ID6896
Feature activation-6.80e-02

Pos logit contributions
 Worker+0.434
 competitor+0.410
roid+0.406
 logger+0.404
 subordinates+0.402
Neg logit contributions
nown-0.540
Jac-0.502
)-0.501
bay-0.492
cape-0.487
TokenWorld
Token ID10603
Feature activation-2.67e-01

Pos logit contributions
nown+0.374
Jac+0.347
)+0.347
bay+0.340
cape+0.336
Neg logit contributions
 Worker-0.333
 competitor-0.315
roid-0.313
 logger-0.311
 subordinates-0.310
Token for
Token ID329
Feature activation+2.24e-01

Pos logit contributions
nown+1.422
Jac+1.315
)+1.314
bay+1.290
cape+1.284
Neg logit contributions
 Worker-1.345
roid-1.273
 competitor-1.262
 subordinates-1.259
 logger-1.259
Token $
Token ID720
Feature activation-1.85e-01

Pos logit contributions
 Worker+1.135
 competitor+1.079
roid+1.067
 logger+1.062
 subordinates+1.059
Neg logit contributions
nown-1.201
Jac-1.110
)-1.105
bay-1.086
cape-1.072
Token2
Token ID17
Feature activation-2.40e-01

Pos logit contributions
nown+0.772
Jac+0.702
)+0.699
bay+0.680
cape+0.671
Neg logit contributions
 Worker-1.146
 competitor-1.098
roid-1.091
 logger-1.087
 subordinates-1.085
Token.
Token ID13
Feature activation-1.34e-01

Pos logit contributions
nown+1.333
Jac+1.242
)+1.241
bay+1.219
cape+1.204
Neg logit contributions
 Worker-1.161
 competitor-1.095
roid-1.090
 logger-1.079
 subordinates-1.078
Token3
Token ID18
Feature activation-4.43e-02

Pos logit contributions
nown+0.539
Jac+0.485
)+0.484
bay+0.470
cape+0.463
Neg logit contributions
 Worker-0.852
 competitor-0.816
roid-0.812
 logger-0.807
 subordinates-0.806
Token billion
Token ID2997
Feature activation-1.99e-01

Pos logit contributions
nown+0.277
Jac+0.259
)+0.258
bay+0.254
cape+0.251
Neg logit contributions
 Worker-0.185
 competitor-0.173
roid-0.172
 logger-0.170
 subordinates-0.170
Token AFL
Token ID25363
Feature activation-1.03e-01

Pos logit contributions
 Worker+0.280
 competitor+0.264
 logger+0.260
roid+0.259
 subordinates+0.257
Neg logit contributions
nown-0.396
)-0.369
Jac-0.369
bay-0.362
cape-0.359
Token's
Token ID338
Feature activation+1.61e-01

Pos logit contributions
nown+0.650
Jac+0.608
)+0.607
bay+0.596
cape+0.591
Neg logit contributions
 Worker-0.427
 competitor-0.400
roid-0.397
 logger-0.393
 subordinates-0.392
Token longest
Token ID14069
Feature activation+2.14e-02

Pos logit contributions
 Worker+0.738
 competitor+0.698
 logger+0.687
roid+0.684
 subordinates+0.680
Neg logit contributions
nown-0.944
Jac-0.875
)-0.874
bay-0.858
cape-0.851
Token active
Token ID4075
Feature activation+3.15e-01

Pos logit contributions
 Worker+0.092
 competitor+0.086
roid+0.085
 logger+0.085
 subordinates+0.084
Neg logit contributions
nown-0.131
Jac-0.122
)-0.122
bay-0.120
cape-0.119
Token streak
Token ID15113
Feature activation+8.82e-02

Pos logit contributions
 Worker+1.288
 competitor+1.207
 logger+1.192
roid+1.170
 refugees+1.166
Neg logit contributions
nown-2.018
Jac-1.878
)-1.875
bay-1.839
cape-1.828
Token of
Token ID286
Feature activation+2.99e-01

Pos logit contributions
 Worker+0.401
 competitor+0.379
roid+0.374
 logger+0.373
 subordinates+0.371
Neg logit contributions
nown-0.518
Jac-0.482
)-0.481
bay-0.472
cape-0.467
Token games
Token ID1830
Feature activation+1.46e-01

Pos logit contributions
 Worker+1.447
 competitor+1.384
 logger+1.362
 subordinates+1.356
 refugees+1.352
Neg logit contributions
nown-1.688
)-1.546
Jac-1.546
bay-1.520
cape-1.499
Token at
Token ID379
Feature activation+7.26e-02

Pos logit contributions
 Worker+0.732
 competitor+0.695
 logger+0.689
roid+0.687
 subordinates+0.683
Neg logit contributions
nown-0.789
Jac-0.729
)-0.728
bay-0.714
cape-0.702
Token 130
Token ID11323
Feature activation-2.50e-01

Pos logit contributions
 Worker+0.386
 competitor+0.368
 logger+0.364
roid+0.363
 subordinates+0.362
Neg logit contributions
nown-0.372
Jac-0.341
)-0.341
bay-0.333
cape-0.329
Token,
Token ID11
Feature activation-2.95e-01

Pos logit contributions
nown+1.496
Jac+1.397
)+1.396
bay+1.373
cape+1.360
Neg logit contributions
 Worker-1.093
 competitor-1.026
roid-1.021
 logger-1.008
 subordinates-1.007
Token were
Token ID547
Feature activation-1.11e-01

Pos logit contributions
nown+1.644
Jac+1.529
)+1.523
bay+1.500
cape+1.483
Neg logit contributions
 Worker-1.421
 competitor-1.340
roid-1.336
 subordinates-1.319
 logger-1.317
TokenThe
Token ID464
Feature activation+2.63e-02

Pos logit contributions
nown+0.452
)+0.412
Jac+0.411
bay+0.401
cape+0.396
Neg logit contributions
 Worker-0.559
 competitor-0.534
roid-0.532
 logger-0.527
 subordinates-0.526
Token city
Token ID1748
Feature activation-7.22e-02

Pos logit contributions
 Worker+0.112
 competitor+0.105
roid+0.104
 logger+0.103
 subordinates+0.103
Neg logit contributions
nown-0.162
)-0.152
Jac-0.152
bay-0.149
cape-0.147
Token is
Token ID318
Feature activation-6.01e-02

Pos logit contributions
nown+0.423
Jac+0.394
)+0.393
bay+0.386
cape+0.383
Neg logit contributions
 Worker-0.328
 competitor-0.309
roid-0.307
 logger-0.304
 subordinates-0.303
Token in
Token ID287
Feature activation-7.99e-02

Pos logit contributions
nown+0.358
)+0.334
Jac+0.334
bay+0.326
cape+0.323
Neg logit contributions
 Worker-0.268
 competitor-0.253
roid-0.250
 logger-0.249
 subordinates-0.248
Token preliminary
Token ID15223
Feature activation-3.64e-01

Pos logit contributions
nown+0.520
Jac+0.488
)+0.487
bay+0.479
cape+0.475
Neg logit contributions
 Worker-0.312
 competitor-0.290
roid-0.288
 logger-0.284
 subordinates-0.284
Token talks
Token ID6130
Feature activation-1.98e-01

Pos logit contributions
nown+2.490
Jac+2.364
)+2.340
bay+2.334
cape+2.318
Neg logit contributions
 Worker-1.248
roid-1.138
 competitor-1.124
 subordinates-1.119
 logger-1.106
Token with
Token ID351
Feature activation-2.13e-01

Pos logit contributions
nown+1.066
Jac+0.987
)+0.975
bay+0.964
cape+0.962
Neg logit contributions
 Worker-0.993
roid-0.935
 competitor-0.935
 subordinates-0.922
 logger-0.920
Token Hudson
Token ID19995
Feature activation-1.02e-01

Pos logit contributions
nown+1.324
Jac+1.242
)+1.236
bay+1.217
cape+1.207
Neg logit contributions
 Worker-0.892
 competitor-0.833
roid-0.832
 logger-0.821
 subordinates-0.819
Token County
Token ID3418
Feature activation-5.86e-02

Pos logit contributions
nown+0.622
Jac+0.580
)+0.580
bay+0.568
cape+0.563
Neg logit contributions
 Worker-0.443
 competitor-0.416
roid-0.412
 logger-0.409
 subordinates-0.407
Token officials
Token ID2828
Feature activation-1.32e-02

Pos logit contributions
nown+0.342
Jac+0.319
)+0.318
bay+0.313
cape+0.310
Neg logit contributions
 Worker-0.266
 competitor-0.251
roid-0.249
 logger-0.247
 subordinates-0.247
Token to
Token ID284
Feature activation+2.70e-01

Pos logit contributions
nown+0.076
Jac+0.071
)+0.071
bay+0.069
cape+0.069
Neg logit contributions
 Worker-0.062
 competitor-0.058
roid-0.058
 logger-0.057
 subordinates-0.057
Token more
Token ID517
Feature activation-3.28e-01

Pos logit contributions
 Worker+0.430
 competitor+0.404
roid+0.399
 subordinates+0.396
 logger+0.396
Neg logit contributions
nown-0.653
Jac-0.611
)-0.611
bay-0.600
cape-0.594
Token clueless
Token ID44865
Feature activation-4.31e-02

Pos logit contributions
nown+2.076
Jac+1.949
)+1.943
bay+1.913
cape+1.897
Neg logit contributions
 Worker-1.323
roid-1.231
 competitor-1.220
 logger-1.197
 subordinates-1.192
Token about
Token ID546
Feature activation+1.93e-01

Pos logit contributions
nown+0.255
Jac+0.237
)+0.237
bay+0.233
cape+0.231
Neg logit contributions
 Worker-0.194
 competitor-0.182
roid-0.181
 logger-0.179
 subordinates-0.178
Token the
Token ID262
Feature activation-1.33e-02

Pos logit contributions
 Worker+0.795
 competitor+0.747
roid+0.737
 subordinates+0.733
 logger+0.732
Neg logit contributions
nown-1.211
Jac-1.134
)-1.134
bay-1.114
cape-1.102
Token potential
Token ID2785
Feature activation+1.48e-01

Pos logit contributions
nown+0.083
Jac+0.078
)+0.078
bay+0.076
cape+0.076
Neg logit contributions
 Worker-0.055
 competitor-0.051
roid-0.051
 logger-0.051
 subordinates-0.050
Token fallout
Token ID27707
Feature activation-8.71e-02

Pos logit contributions
 Worker+0.649
 competitor+0.613
roid+0.605
 logger+0.601
 subordinates+0.601
Neg logit contributions
nown-0.887
Jac-0.829
)-0.826
bay-0.813
cape-0.802
Token.
Token ID13
Feature activation-4.71e-02

Pos logit contributions
nown+0.476
Jac+0.439
)+0.439
bay+0.431
cape+0.428
Neg logit contributions
 Worker-0.431
 competitor-0.407
roid-0.406
 logger-0.401
 subordinates-0.400
Token Those
Token ID5845
Feature activation-1.83e-01

Pos logit contributions
nown+0.248
Jac+0.228
)+0.228
bay+0.223
cape+0.221
Neg logit contributions
 Worker-0.244
 competitor-0.230
roid-0.229
 logger-0.227
 subordinates-0.227
Token around
Token ID1088
Feature activation-9.71e-02

Pos logit contributions
nown+1.115
Jac+1.043
)+1.040
bay+1.020
cape+1.011
Neg logit contributions
 Worker-0.783
 competitor-0.733
roid-0.731
 logger-0.720
 subordinates-0.716
Token him
Token ID683
Feature activation-1.20e-01

Pos logit contributions
nown+0.652
Jac+0.613
)+0.612
bay+0.602
cape+0.597
Neg logit contributions
 Worker-0.359
 competitor-0.333
roid-0.330
 logger-0.327
 subordinates-0.325
Token say
Token ID910
Feature activation+1.88e-01

Pos logit contributions
nown+0.671
Jac+0.624
)+0.623
bay+0.610
cape+0.603
Neg logit contributions
 Worker-0.578
 competitor-0.547
roid-0.543
 logger-0.539
 subordinates-0.538
Token the
Token ID262
Feature activation-5.71e-03

Pos logit contributions
 Worker+0.476
 competitor+0.451
roid+0.443
 logger+0.442
 subordinates+0.439
Neg logit contributions
nown-0.634
Jac-0.591
)-0.590
bay-0.577
cape-0.573
Token F
Token ID376
Feature activation+4.51e-02

Pos logit contributions
nown+0.035
Jac+0.032
)+0.032
bay+0.032
cape+0.031
Neg logit contributions
 Worker-0.025
 competitor-0.023
roid-0.023
 logger-0.023
 subordinates-0.023
Token-
Token ID12
Feature activation-1.34e-01

Pos logit contributions
 Worker+0.212
 competitor+0.201
roid+0.199
 logger+0.196
 refugees+0.195
Neg logit contributions
nown-0.259
)-0.239
Jac-0.239
bay-0.233
cape-0.232
Token150
Token ID8628
Feature activation+2.90e-02

Pos logit contributions
nown+0.543
)+0.488
Jac+0.487
bay+0.471
cape+0.464
Neg logit contributions
 Worker-0.850
 competitor-0.817
roid-0.811
 logger-0.806
 subordinates-0.804
Token lineup
Token ID12750
Feature activation+1.46e-01

Pos logit contributions
 Worker+0.125
 competitor+0.117
roid+0.117
 logger+0.116
 subordinates+0.116
Neg logit contributions
nown-0.176
Jac-0.164
)-0.164
bay-0.161
cape-0.160
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
 Worker+0.695
 competitor+0.656
roid+0.652
 logger+0.646
 subordinates+0.645
Neg logit contributions
nown-0.821
Jac-0.763
)-0.762
bay-0.747
cape-0.740
Token It
Token ID632
Feature activation-2.13e-02

Pos logit contributions
nown+0.636
Jac+0.589
)+0.588
bay+0.576
cape+0.569
Neg logit contributions
 Worker-0.587
 competitor-0.556
roid-0.552
 logger-0.548
 subordinates-0.547
Token will
Token ID481
Feature activation-1.25e-02

Pos logit contributions
nown+0.129
Jac+0.122
)+0.121
bay+0.120
cape+0.118
Neg logit contributions
 Worker-0.091
 competitor-0.085
roid-0.084
 logger-0.083
 refugees-0.082
Token be
Token ID307
Feature activation-1.18e-01

Pos logit contributions
nown+0.076
)+0.072
Jac+0.071
bay+0.070
cape+0.070
Neg logit contributions
 Worker-0.054
roid-0.050
 competitor-0.050
 subordinates-0.049
 logger-0.049
Token paired
Token ID20312
Feature activation-9.29e-05

Pos logit contributions
nown+0.775
Jac+0.732
)+0.731
bay+0.721
cape+0.712
Neg logit contributions
 Worker-0.441
roid-0.408
 competitor-0.405
 logger-0.401
 subordinates-0.401
Token with
Token ID351
Feature activation-2.06e-01

Pos logit contributions
nown+0.001
Jac+0.000
)+0.000
bay+0.000
cape+0.000
Neg logit contributions
 Worker-0.000
 competitor-0.000
roid-0.000
 logger-0.000
 subordinates-0.000
Tokenapp
Token ID1324
Feature activation-6.19e-02

Pos logit contributions
nown+0.653
)+0.601
Jac+0.598
bay+0.593
cape+0.586
Neg logit contributions
 Worker-0.613
 competitor-0.583
 subordinates-0.580
 logger-0.578
roid-0.574
Tokend
Token ID67
Feature activation-5.56e-02

Pos logit contributions
nown+0.325
Jac+0.301
)+0.300
bay+0.294
cape+0.290
Neg logit contributions
 Worker-0.318
 competitor-0.302
roid-0.300
 logger-0.298
 subordinates-0.298
Token great
Token ID1049
Feature activation-4.51e-02

Pos logit contributions
nown+0.349
Jac+0.327
)+0.326
bay+0.321
cape+0.318
Neg logit contributions
 Worker-0.229
 competitor-0.214
roid-0.213
 logger-0.211
 subordinates-0.210
Token is
Token ID318
Feature activation+1.13e-01

Pos logit contributions
nown+0.269
Jac+0.251
)+0.251
bay+0.246
cape+0.244
Neg logit contributions
 Worker-0.200
 competitor-0.188
roid-0.187
 logger-0.185
 subordinates-0.185
Token you
Token ID345
Feature activation+5.40e-02

Pos logit contributions
 Worker+0.492
 competitor+0.463
roid+0.458
 logger+0.454
 subordinates+0.452
Neg logit contributions
nown-0.683
Jac-0.637
)-0.636
bay-0.625
cape-0.618
Token -
Token ID532
Feature activation-1.73e-01

Pos logit contributions
 Worker+0.236
 competitor+0.222
roid+0.220
 logger+0.218
 subordinates+0.218
Neg logit contributions
nown-0.327
Jac-0.305
)-0.304
bay-0.299
cape-0.296
Token the
Token ID262
Feature activation+3.76e-02

Pos logit contributions
nown+1.014
Jac+0.946
)+0.944
bay+0.924
cape+0.915
Neg logit contributions
 Worker-0.786
 competitor-0.740
roid-0.736
 logger-0.730
 subordinates-0.728
Token user
Token ID2836
Feature activation-1.11e-01

Pos logit contributions
 Worker+0.145
 competitor+0.135
roid+0.133
 logger+0.132
 subordinates+0.131
Neg logit contributions
nown-0.248
Jac-0.232
)-0.231
bay-0.228
cape-0.226
Token -
Token ID532
Feature activation-1.55e-01

Pos logit contributions
nown+0.595
Jac+0.552
)+0.550
bay+0.540
cape+0.534
Neg logit contributions
 Worker-0.555
 competitor-0.526
roid-0.523
 logger-0.519
 subordinates-0.518
Token and
Token ID290
Feature activation+1.27e-01

Pos logit contributions
nown+0.930
Jac+0.869
)+0.867
bay+0.852
cape+0.844
Neg logit contributions
 Worker-0.679
 competitor-0.638
roid-0.633
 logger-0.628
 subordinates-0.626
Token the
Token ID262
Feature activation+5.16e-02

Pos logit contributions
 Worker+0.586
 competitor+0.553
roid+0.549
 logger+0.544
 subordinates+0.543
Neg logit contributions
nown-0.732
Jac-0.681
)-0.680
bay-0.667
cape-0.661
TokenDo
Token ID5211
Feature activation-4.89e-02

Pos logit contributions
nown+1.457
Jac+1.358
)+1.340
bay+1.310
cape+1.292
Neg logit contributions
 Worker-1.858
 competitor-1.765
 logger-1.763
roid-1.750
 subordinates-1.750
Token not
Token ID407
Feature activation-2.67e-01

Pos logit contributions
nown+0.308
Jac+0.289
)+0.288
bay+0.283
cape+0.281
Neg logit contributions
 Worker-0.200
 competitor-0.188
roid-0.186
 logger-0.184
 subordinates-0.184
Token use
Token ID779
Feature activation-2.94e-01

Pos logit contributions
nown+1.676
)+1.579
Jac+1.571
bay+1.547
cape+1.534
Neg logit contributions
 Worker-1.083
 competitor-1.002
roid-1.000
 logger-0.988
 subordinates-0.985
Token social
Token ID1919
Feature activation-2.58e-01

Pos logit contributions
nown+1.827
Jac+1.722
)+1.718
bay+1.688
cape+1.675
Neg logit contributions
 Worker-1.209
roid-1.131
 competitor-1.113
 logger-1.105
 subordinates-1.100
Token media
Token ID2056
Feature activation-5.09e-01

Pos logit contributions
nown+1.556
Jac+1.453
)+1.451
bay+1.429
cape+1.412
Neg logit contributions
 Worker-1.117
 competitor-1.046
roid-1.045
 logger-1.028
 subordinates-1.027
Token to
Token ID284
Feature activation-8.67e-02

Pos logit contributions
nown+2.660
Jac+2.460
)+2.444
bay+2.425
ffic+2.402
Neg logit contributions
 Worker-2.567
roid-2.435
 competitor-2.402
 logger-2.381
 subordinates-2.364
Token randomly
Token ID15456
Feature activation+1.60e-01

Pos logit contributions
nown+0.527
Jac+0.492
)+0.492
bay+0.483
cape+0.479
Neg logit contributions
 Worker-0.374
 competitor-0.349
roid-0.348
 logger-0.344
 subordinates-0.343
Token criticize
Token ID24628
Feature activation-1.06e-01

Pos logit contributions
 Worker+0.672
 competitor+0.636
roid+0.629
 logger+0.625
 subordinates+0.621
Neg logit contributions
nown-0.996
Jac-0.936
)-0.932
bay-0.912
cape-0.903
Token a
Token ID257
Feature activation-1.12e-01

Pos logit contributions
nown+0.653
Jac+0.615
)+0.612
bay+0.600
cape+0.596
Neg logit contributions
 Worker-0.448
roid-0.418
 competitor-0.413
 logger-0.413
 subordinates-0.406
Token contemporary
Token ID11811
Feature activation-1.84e-01

Pos logit contributions
nown+0.785
Jac+0.747
)+0.741
bay+0.729
cape+0.727
Neg logit contributions
 Worker-0.371
roid-0.343
 logger-0.332
 subordinates-0.332
 refugees-0.329
Token (
Token ID357
Feature activation-4.37e-02

Pos logit contributions
nown+1.439
Jac+1.373
)+1.365
bay+1.345
cape+1.342
Neg logit contributions
 Worker-0.459
roid-0.413
 subordinates-0.399
 logger-0.398
 refugees-0.392
Token these
Token ID777
Feature activation-2.46e-01

Pos logit contributions
nown+0.311
)+0.292
Jac+0.291
bay+0.286
cape+0.283
Neg logit contributions
 Worker-0.239
roid-0.225
 competitor-0.224
 logger-0.221
 subordinates-0.220
Token sites
Token ID5043
Feature activation+5.69e-02

Pos logit contributions
nown+1.505
)+1.427
Jac+1.416
bay+1.395
cape+1.387
Neg logit contributions
 Worker-1.026
roid-0.966
 competitor-0.954
 subordinates-0.935
 logger-0.934
Token in
Token ID287
Feature activation+1.38e-01

Pos logit contributions
 Worker+0.229
 competitor+0.214
roid+0.212
 logger+0.210
 subordinates+0.210
Neg logit contributions
nown-0.363
Jac-0.340
)-0.340
bay-0.334
cape-0.331
Token Hum
Token ID11054
Feature activation+2.69e-02

Pos logit contributions
 Worker+0.592
 competitor+0.556
roid+0.550
 logger+0.547
 subordinates+0.545
Neg logit contributions
nown-0.847
Jac-0.791
)-0.790
bay-0.776
cape-0.769
Tokenbers
Token ID1213
Feature activation-1.87e-01

Pos logit contributions
 Worker+0.093
 competitor+0.086
roid+0.085
 logger+0.084
 subordinates+0.084
Neg logit contributions
nown-0.186
Jac-0.176
)-0.175
bay-0.172
cape-0.171
Tokenide
Token ID485
Feature activation-6.38e-01

Pos logit contributions
nown+0.513
Jac+0.439
)+0.437
bay+0.418
cape+0.408
Neg logit contributions
 Worker-1.435
 competitor-1.386
roid-1.380
 logger-1.373
 subordinates-1.372
Token and
Token ID290
Feature activation+5.15e-02

Pos logit contributions
nown+3.525
bay+3.328
)+3.302
Jac+3.263
ffic+3.260
Neg logit contributions
 Worker-2.895
 competitor-2.810
roid-2.792
 logger-2.789
 subordinates-2.770
Token we
Token ID356
Feature activation+7.07e-02

Pos logit contributions
 Worker+0.239
 competitor+0.225
roid+0.224
 logger+0.222
 subordinates+0.222
Neg logit contributions
nown-0.296
Jac-0.276
)-0.276
bay-0.270
cape-0.268
Token will
Token ID481
Feature activation+2.41e-01

Pos logit contributions
 Worker+0.337
 competitor+0.319
roid+0.317
 logger+0.314
 subordinates+0.313
Neg logit contributions
nown-0.399
Jac-0.370
)-0.369
bay-0.362
cape-0.358
Token take
Token ID1011
Feature activation-2.09e-02

Pos logit contributions
 Worker+0.984
 competitor+0.920
roid+0.912
 logger+0.904
 subordinates+0.901
Neg logit contributions
nown-1.525
Jac-1.431
)-1.423
bay-1.397
cape-1.388
Token action
Token ID2223
Feature activation+3.63e-02

Pos logit contributions
nown+0.133
Jac+0.125
)+0.125
bay+0.123
cape+0.122
Neg logit contributions
 Worker-0.083
roid-0.078
 competitor-0.077
 logger-0.076
 subordinates-0.076
Token
Token ID119
Feature activation-2.67e-01

Pos logit contributions
nown+1.575
Jac+1.497
)+1.492
bay+1.474
cape+1.461
Neg logit contributions
 Worker-0.553
 competitor-0.496
roid-0.492
 logger-0.488
 subordinates-0.479
Token 0
Token ID657
Feature activation-3.91e-01

Pos logit contributions
nown+1.318
Jac+1.214
)+1.210
bay+1.183
cape+1.169
Neg logit contributions
 Worker-1.460
 competitor-1.387
roid-1.378
 logger-1.370
 subordinates-1.366
Token and
Token ID290
Feature activation-1.02e-01

Pos logit contributions
nown+2.096
Jac+1.954
)+1.932
bay+1.906
cape+1.889
Neg logit contributions
 Worker-1.947
 competitor-1.835
roid-1.820
 logger-1.804
 subordinates-1.794
Token the
Token ID262
Feature activation-7.40e-02

Pos logit contributions
nown+0.561
Jac+0.521
)+0.520
bay+0.509
cape+0.504
Neg logit contributions
 Worker-0.501
 competitor-0.474
roid-0.471
 logger-0.467
 subordinates-0.466
Token sum
Token ID2160
Feature activation-3.67e-01

Pos logit contributions
nown+0.455
Jac+0.426
)+0.425
bay+0.418
cape+0.414
Neg logit contributions
 Worker-0.313
 competitor-0.293
roid-0.292
 logger-0.288
 subordinates-0.288
Token f
Token ID277
Feature activation-6.53e-01

Pos logit contributions
nown+2.087
Jac+1.947
)+1.938
bay+1.904
cape+1.891
Neg logit contributions
 Worker-1.719
 competitor-1.610
 logger-1.586
roid-1.585
 subordinates-1.582
Token of
Token ID286
Feature activation-1.58e-01

Pos logit contributions
nown+3.516
Jac+3.313
)+3.266
bay+3.200
cape+3.175
Neg logit contributions
 Worker-3.183
 competitor-2.982
 logger-2.949
 subordinates-2.925
roid-2.924
Token all
Token ID477
Feature activation-2.24e-01

Pos logit contributions
nown+0.906
Jac+0.844
)+0.841
bay+0.826
cape+0.818
Neg logit contributions
 Worker-0.741
 competitor-0.697
roid-0.694
 logger-0.687
 subordinates-0.685
Token un
Token ID555
Feature activation-2.25e-01

Pos logit contributions
nown+1.325
Jac+1.243
)+1.239
bay+1.215
cape+1.205
Neg logit contributions
 Worker-0.997
roid-0.930
 competitor-0.930
 logger-0.914
 subordinates-0.912
Tokenam
Token ID321
Feature activation+9.44e-02

Pos logit contributions
nown+1.128
Jac+1.043
)+1.035
bay+1.019
cape+1.006
Neg logit contributions
 Worker-1.213
 competitor-1.155
roid-1.145
 logger-1.140
 subordinates-1.134
Tokenpl
Token ID489
Feature activation-7.74e-02

Pos logit contributions
 Worker+0.433
 competitor+0.408
roid+0.406
 logger+0.402
 subordinates+0.401
Neg logit contributions
nown-0.549
)-0.512
Jac-0.511
bay-0.500
cape-0.496
Token $
Token ID720
Feature activation-3.69e-01

Pos logit contributions
nown+1.644
Jac+1.527
)+1.525
bay+1.486
cape+1.471
Neg logit contributions
 Worker-1.773
 competitor-1.697
roid-1.689
 subordinates-1.683
 logger-1.676
Tokenfont
Token ID10331
Feature activation-3.39e-01

Pos logit contributions
nown+1.499
Jac+1.357
)+1.348
bay+1.319
cape+1.300
Neg logit contributions
 Worker-2.337
 competitor-2.238
roid-2.221
 logger-2.219
 subordinates-2.218
Token2
Token ID17
Feature activation+5.16e-03

Pos logit contributions
nown+1.163
)+1.042
Jac+1.038
bay+1.006
cape+0.992
Neg logit contributions
 Worker-2.353
 competitor-2.264
 logger-2.252
 subordinates-2.247
roid-2.240
Token)
Token ID8
Feature activation-5.89e-02

Pos logit contributions
 Worker+0.026
 competitor+0.024
roid+0.024
 subordinates+0.024
 logger+0.024
Neg logit contributions
nown-0.028
)-0.026
Jac-0.026
bay-0.025
cape-0.025
Token $
Token ID720
Feature activation-4.30e-01

Pos logit contributions
nown+0.320
Jac+0.298
)+0.297
bay+0.290
cape+0.288
Neg logit contributions
 Worker-0.289
 competitor-0.275
roid-0.275
 subordinates-0.272
 logger-0.270
Tokenrect
Token ID2554
Feature activation-5.58e-01

Pos logit contributions
nown+2.281
Jac+2.123
)+2.093
bay+2.075
cape+2.060
Neg logit contributions
 Worker-2.184
 competitor-2.072
 logger-2.054
 subordinates-2.050
 refugees-2.023
Token2
Token ID17
Feature activation-2.77e-01

Pos logit contributions
nown+1.809
Jac+1.612
)+1.601
cape+1.543
bay+1.540
Neg logit contributions
 Worker-3.965
 competitor-3.815
 logger-3.789
 subordinates-3.789
roid-3.771
Token =
Token ID796
Feature activation+1.70e-01

Pos logit contributions
nown+1.095
Jac+0.999
)+0.993
bay+0.964
cape+0.952
Neg logit contributions
 Worker-1.778
 competitor-1.712
roid-1.696
 logger-1.694
 subordinates-1.694
Token New
Token ID968
Feature activation+6.41e-02

Pos logit contributions
 Worker+0.570
 competitor+0.514
roid+0.508
 logger+0.500
 refugees+0.498
Neg logit contributions
nown-1.212
Jac-1.137
)-1.135
bay-1.124
ffic-1.109
Token-
Token ID12
Feature activation-2.10e-01

Pos logit contributions
 Worker+0.302
 competitor+0.282
roid+0.282
 logger+0.278
 refugees+0.277
Neg logit contributions
nown-0.369
)-0.341
Jac-0.339
bay-0.333
cape-0.329
TokenObject
Token ID10267
Feature activation-4.93e-02

Pos logit contributions
nown+0.953
Jac+0.894
)+0.879
bay+0.857
cape+0.853
Neg logit contributions
 Worker-1.211
 competitor-1.166
 subordinates-1.157
 logger-1.152
roid-1.149
Tokenank
Token ID962
Feature activation-3.69e-01

Pos logit contributions
nown+2.391
Jac+2.162
)+2.152
bay+2.117
cape+2.062
Neg logit contributions
 Worker-3.514
 competitor-3.372
 logger-3.335
 subordinates-3.323
roid-3.312
Token_
Token ID62
Feature activation-4.08e-01

Pos logit contributions
nown+2.156
)+2.001
Jac+2.000
bay+1.974
cape+1.950
Neg logit contributions
 Worker-1.695
 competitor-1.597
 subordinates-1.574
 logger-1.573
roid-1.559
Tokenk
Token ID74
Feature activation-1.65e-01

Pos logit contributions
nown+2.033
Jac+1.873
)+1.866
bay+1.841
cape+1.812
Neg logit contributions
 Worker-2.218
 competitor-2.107
 subordinates-2.081
 logger-2.081
roid-2.075
Token10
Token ID940
Feature activation-3.19e-01

Pos logit contributions
nown+0.832
Jac+0.766
)+0.765
bay+0.747
cape+0.739
Neg logit contributions
 Worker-0.886
 competitor-0.843
roid-0.836
 logger-0.832
 subordinates-0.830
Token>
Token ID29
Feature activation-2.95e-01

Pos logit contributions
nown+1.617
Jac+1.489
)+1.487
bay+1.458
cape+1.440
Neg logit contributions
 Worker-1.706
 competitor-1.622
roid-1.606
 logger-1.603
 subordinates-1.600
Token ke
Token ID885
Feature activation-5.36e-01

Pos logit contributions
nown+1.681
)+1.569
Jac+1.565
bay+1.532
cape+1.521
Neg logit contributions
 Worker-1.378
 competitor-1.301
roid-1.299
 subordinates-1.282
 logger-1.282
Tokenk
Token ID74
Feature activation-2.31e-01

Pos logit contributions
nown+2.610
Jac+2.385
)+2.358
bay+2.347
cape+2.316
Neg logit contributions
 Worker-2.934
 competitor-2.802
 subordinates-2.777
 logger-2.776
 refugees-2.757
Token m
Token ID285
Feature activation-3.94e-01

Pos logit contributions
nown+1.401
Jac+1.306
)+1.304
bay+1.283
cape+1.270
Neg logit contributions
 Worker-0.999
 competitor-0.935
roid-0.930
 logger-0.923
 subordinates-0.920
Token2
Token ID17
Feature activation-3.39e-01

Pos logit contributions
nown+1.365
Jac+1.207
)+1.202
bay+1.178
cape+1.144
Neg logit contributions
 Worker-2.723
 competitor-2.617
 logger-2.601
roid-2.588
 subordinates-2.586
Tokenk
Token ID74
Feature activation-1.15e-01

Pos logit contributions
nown+1.514
)+1.381
Jac+1.377
bay+1.361
cape+1.334
Neg logit contributions
 Worker-1.998
 competitor-1.911
roid-1.900
 subordinates-1.896
 logger-1.892
Token [
Token ID685
Feature activation-1.02e-01

Pos logit contributions
nown+0.732
Jac+0.686
)+0.684
bay+0.673
cape+0.667
Neg logit contributions
 Worker-0.467
 competitor-0.436
roid-0.434
 logger-0.429
 subordinates-0.429
Token,
Token ID11
Feature activation-2.45e-01

Pos logit contributions
nown+1.569
)+1.467
Jac+1.457
bay+1.419
cape+1.398
Neg logit contributions
 Worker-1.410
 competitor-1.332
roid-1.326
 subordinates-1.317
 logger-1.316
Token2
Token ID17
Feature activation-9.32e-02

Pos logit contributions
nown+1.303
Jac+1.211
)+1.203
bay+1.179
cape+1.160
Neg logit contributions
 Worker-1.230
 competitor-1.167
roid-1.161
 subordinates-1.156
 logger-1.155
Token,
Token ID11
Feature activation-6.14e-02

Pos logit contributions
nown+0.514
)+0.479
Jac+0.478
bay+0.466
cape+0.461
Neg logit contributions
 Worker-0.455
 competitor-0.430
roid-0.428
 logger-0.424
 subordinates-0.423
Token..
Token ID492
Feature activation-1.11e-01

Pos logit contributions
nown+0.219
Jac+0.195
)+0.194
bay+0.188
cape+0.185
Neg logit contributions
 Worker-0.420
 competitor-0.403
roid-0.401
 logger-0.399
 subordinates-0.399
Token *)
Token ID31936
Feature activation-7.98e-02

Pos logit contributions
nown+0.572
Jac+0.529
)+0.527
bay+0.516
cape+0.509
Neg logit contributions
 Worker-0.587
 competitor-0.558
roid-0.554
 logger-0.551
 subordinates-0.551
Token graph
Token ID4823
Feature activation-5.69e-01

Pos logit contributions
nown+0.465
Jac+0.434
)+0.433
bay+0.425
cape+0.421
Neg logit contributions
 Worker-0.363
 competitor-0.342
roid-0.341
 logger-0.337
 subordinates-0.337
TokenList
Token ID8053
Feature activation-2.84e-01

Pos logit contributions
nown+3.188
Jac+2.999
)+2.965
cape+2.923
bay+2.913
Neg logit contributions
 Worker-2.658
 competitor-2.496
 subordinates-2.492
roid-2.484
 logger-2.480
Token =
Token ID796
Feature activation+2.31e-01

Pos logit contributions
nown+1.691
Jac+1.591
)+1.578
bay+1.553
cape+1.545
Neg logit contributions
 Worker-1.248
 competitor-1.171
roid-1.160
 logger-1.158
 subordinates-1.157
Token Table
Token ID8655
Feature activation-2.30e-01

Pos logit contributions
 Worker+1.042
 competitor+0.981
roid+0.967
 logger+0.963
 subordinates+0.958
Neg logit contributions
nown-1.375
Jac-1.279
)-1.277
bay-1.255
cape-1.240
Token[
Token ID58
Feature activation-2.16e-01

Pos logit contributions
nown+1.732
Jac+1.646
)+1.640
bay+1.617
cape+1.608
Neg logit contributions
 Worker-0.661
 competitor-0.597
roid-0.590
 logger-0.585
 subordinates-0.582
Token {
Token ID1391
Feature activation-4.94e-02

Pos logit contributions
nown+1.176
Jac+1.098
)+1.088
bay+1.070
cape+1.056
Neg logit contributions
 Worker-1.063
 competitor-1.003
 logger-0.997
roid-0.997
 subordinates-0.993