TOP ACTIVATIONS
MAX = 3.454

David Duke . ( Joe Bur bank / Or lando Sentinel
] According to Mub rak p uri , only
. ( Photo : Jason Bah r / Getty Images )
Int rig uing ly , the few
cr acker doll named Colonel Nut ley . ( The nut
days . R S enth il Kumar / Press
organisations . R S enth il Kumar / Press
United Supporters Trust ( N UST ), N UFC Fans United
: ( Example : Th urb ion the Paladin became a
a cocktail a Rob Roy and eating with his
the American mind . Rachel Salt z c alls it "
Netflix is probably the best streaming help in the world .
among many . <|endoftext|> B ud we iser Cly des d
pleasures of " The Three Mus ket eers " or Sherlock
SAR 0 . 1 3 Brune i Dar uss alam 2
you have some metal wire laying around ) If your c
in Rub en " R UB INO " Vill arro el
ac ism Vs Xbox - Lou C k Bob & Jimmy
who qu its his pro dig iously paying job to pursue
the growing number of r umb lings , and the prag

MIN ACTIVATIONS
MIN = -2.885

mimic that pattern . When finished tie a knot on the
this girl who has won Miss Chinatown ; she
LO EB / AFP / Getty Images ORIG FILE ID :
Mellon University in Pittsburgh , Pennsylvania , who is a co
mysteries in the history of Herm osa Beach , officials announced
dominant palm opl ant ar ker at oder ma with variable
<|endoftext|> Donald Mir alle / Getty Images The latest
ker at oder ma , inherited as an autos omal dominant
Ke ans burg , New Jersey , following a massive fish
/ Los Angeles Times / Getty Images In a
a black hat and rain coat , with the words "
ococ cus , Gard n erella vaginal is , Mobil un
to push hemp across the finish line , not only for
23 Republican - held districts won by Clinton last year ,
his landslide re - election victory last night , New Jersey
been diagnosed with post - traumatic stress disorder and , with
executions took place in Chong jin in North Ham gy ong
is q ars / f atal . html . CDC .
no longer professional enough or unpre jud iced enough to publish
Photo by John Moore / Getty Images ) Jeff Sessions Sen

INTERVAL 1.869 - 3.454
CONTAINS 0.123%

down every day and then mineral izes , forming a hard
National Post reading " ISIS Ur ges Jihad ists To Attack
expand the program to include broadband internet access .
for comments about the mother grizz ly killing a person in
. 3 Texas at this point . Especially if

INTERVAL 0.285 - 1.869
CONTAINS 28.100%

the Singapore an version is similar to the current Port rait
! ( 9 ) Surv ive the manual override
its . He has expressed his sadness and displeasure at how
you re craft beer new bie or a veteran
than the Dolphins . Only three quarterbacks have scored at least

INTERVAL -1.300 - 0.285
CONTAINS 70.623%

Desc ends on Large , Conf used , Nation
it ; this gives the Yes campaign ample political space to
icester have an excellent home record in this competition and we
< num Part icles ; ++ u Part icle ) {
Sad ain noted that an education department policy states that

INTERVAL -2.885 - -1.300
CONTAINS 1.154%

, can no longer be avoided as the effects of a
has had an easy run of things lately . Owner Sylvia
Wait ... Did that Pokémon just talk ?
In 2002 , N ana won the Sh og ak uk
though . [ Photo : YouTube screen grab ]
Token David
Token ID3271
Feature activation+4.50e-01

Pos logit contributions
jud+0.886
andre+0.849
ca+0.826
imposed+0.786
traumatic+0.770
Neg logit contributions
idge-0.951
eering-0.950
IBLE-0.787
 nodd-0.781
rients-0.779
Token Duke
Token ID11083
Feature activation+5.65e-02

Pos logit contributions
eering+1.725
idge+1.690
IBLE+1.441
 nodd+1.400
yssey+1.396
Neg logit contributions
jud-1.397
andre-1.323
ca-1.293
imposed-1.243
traumatic-1.214
Token.
Token ID13
Feature activation-1.67e-01

Pos logit contributions
eering+0.247
idge+0.243
IBLE+0.211
 nodd+0.207
yssey+0.206
Neg logit contributions
jud-0.145
andre-0.137
ca-0.133
imposed-0.126
traumatic-0.122
Token (
Token ID357
Feature activation+1.34e+00

Pos logit contributions
jud+0.560
andre+0.535
ca+0.529
imposed+0.500
traumatic+0.491
Neg logit contributions
eering-0.594
idge-0.586
 nodd-0.491
IBLE-0.489
yssey-0.483
TokenJoe
Token ID19585
Feature activation+8.65e-02

Pos logit contributions
eering+5.508
idge+5.422
IBLE+4.703
 nodd+4.558
yssey+4.548
Neg logit contributions
jud-3.636
andre-3.445
ca-3.285
imposed-3.181
traumatic-3.117
Token Bur
Token ID5481
Feature activation+3.45e+00

Pos logit contributions
eering+0.328
idge+0.323
IBLE+0.274
 nodd+0.267
yssey+0.266
Neg logit contributions
jud-0.272
andre-0.259
ca-0.253
imposed-0.242
traumatic-0.236
Tokenbank
Token ID17796
Feature activation+6.76e-01

Pos logit contributions
idge+10.492
eering+9.989
yssey+8.308
rient+8.242
achev+7.937
Neg logit contributions
jud-10.926
andre-10.773
imposed-10.105
traumatic-10.050
{\-9.931
Token/
Token ID14
Feature activation-1.15e+00

Pos logit contributions
eering+3.146
idge+3.103
IBLE+2.715
yssey+2.660
 nodd+2.650
Neg logit contributions
jud-1.538
andre-1.430
ca-1.368
imposed-1.304
traumatic-1.263
TokenOr
Token ID5574
Feature activation-1.37e+00

Pos logit contributions
jud+3.183
andre+2.980
ca+2.901
imposed+2.755
cas+2.597
Neg logit contributions
eering-4.808
idge-4.685
rients-4.030
 nodd-4.012
IBLE-4.007
Tokenlando
Token ID11993
Feature activation-4.10e-02

Pos logit contributions
jud+4.254
ca+4.119
andre+4.049
cas+3.813
activated+3.647
Neg logit contributions
eering-5.146
idge-4.932
IBLE-4.357
rients-4.262
 nodd-4.203
Token Sentinel
Token ID26716
Feature activation-6.94e-01

Pos logit contributions
jud+0.077
andre+0.071
ca+0.068
imposed+0.063
traumatic+0.061
Neg logit contributions
eering-0.207
idge-0.204
IBLE-0.181
 nodd-0.178
yssey-0.177
Token]
Token ID60
Feature activation+1.01e+00

Pos logit contributions
eering+4.154
idge+4.102
IBLE+3.444
yssey+3.319
 nodd+3.314
Neg logit contributions
jud-3.584
andre-3.396
ca-3.242
imposed-3.191
traumatic-3.108
Token
Token ID198
Feature activation+3.23e-01

Pos logit contributions
eering+3.737
idge+3.672
IBLE+3.110
rients+2.986
yssey+2.977
Neg logit contributions
jud-3.233
andre-3.095
ca-2.957
imposed-2.906
traumatic-2.834
Token
Token ID198
Feature activation-1.47e-01

Pos logit contributions
eering+1.203
idge+1.183
IBLE+0.996
rients+0.954
yssey+0.946
Neg logit contributions
jud-1.020
andre-0.980
imposed-0.931
ca-0.925
traumatic-0.904
TokenAccording
Token ID4821
Feature activation+5.37e-01

Pos logit contributions
jud+0.394
andre+0.371
ca+0.362
imposed+0.342
traumatic+0.333
Neg logit contributions
eering-0.631
idge-0.621
IBLE-0.537
 nodd-0.530
yssey-0.525
Token to
Token ID284
Feature activation-2.18e-01

Pos logit contributions
eering+2.141
idge+2.114
IBLE+1.808
yssey+1.767
rients+1.753
Neg logit contributions
jud-1.556
andre-1.482
ca-1.434
imposed-1.387
traumatic-1.359
Token Mub
Token ID40052
Feature activation+3.05e+00

Pos logit contributions
jud+0.698
andre+0.665
ca+0.652
imposed+0.617
traumatic+0.606
Neg logit contributions
eering-0.818
idge-0.803
IBLE-0.681
 nodd-0.664
rients-0.662
Tokenrak
Token ID17716
Feature activation+4.63e-01

Pos logit contributions
idge+12.250
eering+12.207
yssey+10.398
rients+10.280
rient+10.045
Neg logit contributions
jud-7.090
andre-6.675
imposed-6.530
traumatic-6.316
ca-6.255
Tokenp
Token ID79
Feature activation-4.97e-01

Pos logit contributions
eering+1.923
idge+1.891
IBLE+1.631
 nodd+1.596
yssey+1.589
Neg logit contributions
jud-1.291
andre-1.219
ca-1.186
imposed-1.128
traumatic-1.100
Tokenuri
Token ID9900
Feature activation-3.40e-01

Pos logit contributions
jud+1.484
andre+1.394
ca+1.380
imposed+1.292
traumatic+1.252
Neg logit contributions
eering-1.988
idge-1.912
IBLE-1.687
 nodd-1.650
rients-1.622
Token,
Token ID11
Feature activation-3.11e-01

Pos logit contributions
jud+0.937
andre+0.875
ca+0.861
imposed+0.806
traumatic+0.786
Neg logit contributions
eering-1.436
idge-1.405
IBLE-1.224
 nodd-1.194
yssey-1.188
Token only
Token ID691
Feature activation-6.19e-01

Pos logit contributions
jud+0.999
andre+0.943
ca+0.938
imposed+0.892
traumatic+0.867
Neg logit contributions
eering-1.170
idge-1.143
IBLE-0.972
 nodd-0.946
yssey-0.945
Token.
Token ID13
Feature activation+3.14e-01

Pos logit contributions
eering+3.279
idge+3.233
IBLE+2.780
 nodd+2.713
rients+2.707
Neg logit contributions
jud-2.108
andre-1.988
ca-1.908
imposed-1.846
traumatic-1.787
Token (
Token ID357
Feature activation+1.17e+00

Pos logit contributions
eering+1.255
idge+1.237
IBLE+1.060
 nodd+1.041
yssey+1.034
Neg logit contributions
jud-0.921
andre-0.872
ca-0.853
imposed-0.809
traumatic-0.791
TokenPhoto
Token ID6191
Feature activation-1.22e+00

Pos logit contributions
eering+4.717
idge+4.668
IBLE+4.047
yssey+3.863
 nodd+3.840
Neg logit contributions
jud-3.186
andre-2.989
ca-2.872
imposed-2.805
traumatic-2.757
Token:
Token ID25
Feature activation+5.45e-01

Pos logit contributions
jud+3.333
andre+3.144
ca+3.111
imposed+2.888
traumatic+2.809
Neg logit contributions
eering-5.024
idge-4.963
 nodd-4.346
IBLE-4.322
yssey-4.224
Token Jason
Token ID8982
Feature activation+2.19e-01

Pos logit contributions
eering+1.992
idge+1.960
IBLE+1.651
 nodd+1.605
yssey+1.596
Neg logit contributions
jud-1.781
andre-1.693
ca-1.652
imposed-1.595
traumatic-1.563
Token Bah
Token ID13081
Feature activation+2.98e+00

Pos logit contributions
eering+0.741
idge+0.728
IBLE+0.604
 nodd+0.593
rients+0.587
Neg logit contributions
jud-0.779
andre-0.749
ca-0.733
imposed-0.699
traumatic-0.689
Tokenr
Token ID81
Feature activation-2.69e-01

Pos logit contributions
eering+11.154
idge+10.785
yssey+9.120
rients+8.976
rient+8.894
Neg logit contributions
jud-8.125
andre-7.452
imposed-7.203
traumatic-7.168
enforcement-6.926
Token/
Token ID14
Feature activation-7.83e-01

Pos logit contributions
jud+0.588
andre+0.555
ca+0.536
imposed+0.490
traumatic+0.476
Neg logit contributions
eering-1.279
idge-1.250
IBLE-1.112
 nodd-1.094
rients-1.086
TokenGetty
Token ID6633
Feature activation-2.26e+00

Pos logit contributions
jud+2.130
andre+2.019
ca+1.966
imposed+1.830
cas+1.766
Neg logit contributions
eering-3.310
idge-3.247
 nodd-2.796
IBLE-2.792
rients-2.757
Token Images
Token ID5382
Feature activation-8.73e-01

Pos logit contributions
jud+4.574
andre+4.301
ca+4.213
imposed+3.842
traumatic+3.685
Neg logit contributions
eering-10.363
idge-10.239
yip-9.123
 nodd-9.079
yssey-8.998
Token)
Token ID8
Feature activation+2.06e-01

Pos logit contributions
jud+2.784
andre+2.652
ca+2.606
imposed+2.467
traumatic+2.410
Neg logit contributions
eering-3.239
idge-3.192
IBLE-2.688
 nodd-2.642
rients-2.642
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
eering+0.148
idge+0.146
IBLE+0.126
 nodd+0.124
yssey+0.123
Neg logit contributions
jud-0.102
andre-0.097
ca-0.096
imposed-0.089
traumatic-0.088
Token
Token ID198
Feature activation-1.61e-01

Pos logit contributions
eering+0.581
idge+0.572
IBLE+0.481
rients+0.465
yssey+0.462
Neg logit contributions
jud-0.476
andre-0.457
imposed-0.433
ca-0.432
traumatic-0.419
Token
Token ID447
Feature activation-1.30e-02

Pos logit contributions
jud+0.433
andre+0.407
ca+0.400
imposed+0.374
traumatic+0.367
Neg logit contributions
eering-0.688
idge-0.675
 nodd-0.585
IBLE-0.584
yssey-0.572
Token
Token ID250
Feature activation+8.61e-01

Pos logit contributions
jud+0.049
andre+0.047
ca+0.046
imposed+0.045
traumatic+0.044
Neg logit contributions
eering-0.041
idge-0.040
IBLE-0.033
 nodd-0.032
rients-0.031
TokenInt
Token ID5317
Feature activation+4.41e-01

Pos logit contributions
eering+3.573
idge+3.535
IBLE+3.043
yssey+2.953
rients+2.952
Neg logit contributions
jud-2.358
andre-2.228
ca-2.144
imposed-2.073
traumatic-2.012
Tokenrig
Token ID4359
Feature activation+2.95e+00

Pos logit contributions
eering+2.036
idge+2.012
IBLE+1.752
rients+1.724
yssey+1.723
Neg logit contributions
jud-1.006
andre-0.946
ca-0.911
imposed-0.863
traumatic-0.840
Tokenuing
Token ID4250
Feature activation+1.72e-01

Pos logit contributions
idge+7.174
eering+7.133
IBLE+5.408
yssey+5.207
rients+4.955
Neg logit contributions
jud-12.088
andre-11.886
ca-11.666
imposed-11.358
traumatic-11.264
Tokenly
Token ID306
Feature activation+7.15e-02

Pos logit contributions
eering+0.301
idge+0.287
IBLE+0.194
 nodd+0.180
rients+0.176
Neg logit contributions
jud-0.898
andre-0.867
ca-0.860
imposed-0.832
traumatic-0.823
Token,
Token ID11
Feature activation-1.31e-01

Pos logit contributions
eering+0.309
idge+0.305
IBLE+0.264
 nodd+0.258
yssey+0.257
Neg logit contributions
jud-0.187
andre-0.176
ca-0.170
imposed-0.162
traumatic-0.158
Token the
Token ID262
Feature activation-8.05e-02

Pos logit contributions
jud+0.400
andre+0.378
ca+0.372
imposed+0.352
traumatic+0.345
Neg logit contributions
eering-0.509
idge-0.499
IBLE-0.428
 nodd-0.416
yssey-0.416
Token few
Token ID1178
Feature activation+7.09e-01

Pos logit contributions
jud+0.286
andre+0.273
ca+0.270
imposed+0.256
traumatic+0.252
Neg logit contributions
eering-0.274
idge-0.266
IBLE-0.222
yssey-0.216
rients-0.215
Tokencr
Token ID6098
Feature activation+9.39e-01

Pos logit contributions
eering+9.108
idge+8.967
IBLE+7.674
rients+7.592
 nodd+7.553
Neg logit contributions
jud-5.991
andre-5.729
ca-5.260
imposed-5.228
traumatic-5.088
Tokenacker
Token ID10735
Feature activation+7.47e-01

Pos logit contributions
idge+2.938
eering+2.909
IBLE+2.306
yssey+2.250
rients+2.238
Neg logit contributions
jud-3.518
andre-3.402
ca-3.284
imposed-3.220
traumatic-3.159
Token doll
Token ID3654
Feature activation-8.48e-01

Pos logit contributions
eering+3.091
idge+3.048
IBLE+2.619
 nodd+2.565
yssey+2.548
Neg logit contributions
jud-2.081
andre-1.969
ca-1.909
imposed-1.822
traumatic-1.781
Token named
Token ID3706
Feature activation+7.00e-01

Pos logit contributions
jud+2.250
andre+2.116
ca+2.082
imposed+1.971
traumatic+1.874
Neg logit contributions
eering-3.617
idge-3.516
IBLE-3.108
rients-3.062
 nodd-3.011
Token Colonel
Token ID21636
Feature activation+1.03e+00

Pos logit contributions
eering+2.583
idge+2.542
IBLE+2.140
 nodd+2.077
yssey+2.071
Neg logit contributions
jud-2.267
andre-2.155
ca-2.099
imposed-2.031
traumatic-1.985
Token Nut
Token ID11959
Feature activation+2.95e+00

Pos logit contributions
eering+3.665
idge+3.607
IBLE+3.007
rients+2.911
 nodd+2.899
Neg logit contributions
jud-3.447
andre-3.286
ca-3.186
imposed-3.117
traumatic-3.039
Tokenley
Token ID1636
Feature activation+4.05e-01

Pos logit contributions
idge+12.225
eering+11.967
rients+10.517
rient+10.233
yssey+10.069
Neg logit contributions
jud-7.131
andre-6.820
imposed-6.178
ca-6.059
traumatic-5.970
Token.
Token ID13
Feature activation+5.39e-02

Pos logit contributions
eering+1.740
idge+1.713
IBLE+1.486
 nodd+1.453
yssey+1.449
Neg logit contributions
jud-1.068
andre-1.006
ca-0.977
imposed-0.927
traumatic-0.902
Token (
Token ID357
Feature activation+1.41e+00

Pos logit contributions
eering+0.210
idge+0.207
IBLE+0.176
 nodd+0.176
yssey+0.173
Neg logit contributions
jud-0.163
andre-0.155
ca-0.153
imposed-0.144
traumatic-0.141
TokenThe
Token ID464
Feature activation+4.66e-01

Pos logit contributions
eering+5.760
idge+5.714
IBLE+5.012
yssey+4.743
rients+4.742
Neg logit contributions
jud-3.775
andre-3.621
ca-3.382
imposed-3.342
traumatic-3.241
Token nut
Token ID6701
Feature activation+2.12e+00

Pos logit contributions
eering+1.612
idge+1.580
IBLE+1.317
yssey+1.280
 nodd+1.278
Neg logit contributions
jud-1.626
andre-1.551
ca-1.520
imposed-1.460
traumatic-1.432
Token days
Token ID1528
Feature activation+9.87e-01

Pos logit contributions
eering+4.113
idge+4.013
IBLE+3.452
 nodd+3.397
rients+3.337
Neg logit contributions
jud-2.995
andre-2.877
ca-2.762
imposed-2.626
traumatic-2.610
Token.
Token ID13
Feature activation-1.99e-01

Pos logit contributions
eering+4.117
idge+4.047
IBLE+3.472
 nodd+3.398
yssey+3.394
Neg logit contributions
jud-2.671
andre-2.522
ca-2.420
imposed-2.337
traumatic-2.283
Token
Token ID198
Feature activation-7.58e-02

Pos logit contributions
jud+0.579
andre+0.550
ca+0.542
imposed+0.509
traumatic+0.498
Neg logit contributions
eering-0.796
idge-0.784
IBLE-0.670
 nodd-0.667
yssey-0.657
Token
Token ID198
Feature activation-2.90e-01

Pos logit contributions
jud+0.231
andre+0.221
ca+0.211
imposed+0.208
traumatic+0.202
Neg logit contributions
eering-0.294
idge-0.289
IBLE-0.246
rients-0.236
yssey-0.236
TokenR
Token ID49
Feature activation+1.08e+00

Pos logit contributions
jud+0.776
andre+0.732
ca+0.714
imposed+0.674
traumatic+0.656
Neg logit contributions
eering-1.242
idge-1.222
IBLE-1.055
 nodd-1.050
yssey-1.033
Token S
Token ID311
Feature activation+2.89e+00

Pos logit contributions
eering+4.161
idge+4.132
IBLE+3.486
yssey+3.387
rients+3.380
Neg logit contributions
jud-3.319
andre-3.150
ca-3.063
imposed-2.953
traumatic-2.892
Tokenenth
Token ID7944
Feature activation+1.40e+00

Pos logit contributions
idge+10.736
eering+10.608
IBLE+8.995
rients+8.950
rient+8.690
Neg logit contributions
jud-7.878
andre-7.491
ca-7.324
imposed-7.232
traumatic-6.920
Tokenil
Token ID346
Feature activation+8.22e-02

Pos logit contributions
eering+5.427
idge+5.421
IBLE+4.548
rients+4.478
yssey+4.461
Neg logit contributions
jud-4.143
andre-3.926
ca-3.831
imposed-3.692
traumatic-3.589
Token Kumar
Token ID26105
Feature activation+3.00e-01

Pos logit contributions
eering+0.329
idge+0.321
IBLE+0.277
 nodd+0.271
yssey+0.269
Neg logit contributions
jud-0.243
andre-0.231
ca-0.225
imposed-0.214
traumatic-0.209
Token/
Token ID14
Feature activation-1.09e+00

Pos logit contributions
eering+1.212
idge+1.193
IBLE+1.025
 nodd+1.001
yssey+0.997
Neg logit contributions
jud-0.865
andre-0.820
ca-0.798
imposed-0.761
traumatic-0.743
TokenPress
Token ID13800
Feature activation-6.52e-01

Pos logit contributions
jud+2.930
andre+2.740
ca+2.659
imposed+2.491
qs+2.381
Neg logit contributions
eering-4.633
idge-4.524
IBLE-3.898
 nodd-3.873
rients-3.834
Token organisations
Token ID16435
Feature activation+2.84e-01

Pos logit contributions
eering+2.096
idge+2.059
IBLE+1.702
 nodd+1.655
rients+1.642
Neg logit contributions
jud-2.277
andre-2.179
ca-2.124
imposed-2.063
traumatic-2.026
Token.
Token ID13
Feature activation+1.68e-01

Pos logit contributions
eering+1.209
idge+1.191
IBLE+1.031
 nodd+1.007
rients+1.003
Neg logit contributions
jud-0.762
andre-0.718
ca-0.696
imposed-0.663
traumatic-0.647
Token
Token ID198
Feature activation+1.99e-01

Pos logit contributions
eering+0.663
idge+0.653
IBLE+0.557
 nodd+0.548
yssey+0.543
Neg logit contributions
jud-0.505
andre-0.479
ca-0.470
imposed-0.445
traumatic-0.436
Token
Token ID198
Feature activation-9.75e-02

Pos logit contributions
eering+0.752
idge+0.738
IBLE+0.623
rients+0.597
yssey+0.595
Neg logit contributions
jud-0.623
andre-0.599
imposed-0.568
ca-0.566
traumatic-0.551
TokenR
Token ID49
Feature activation+9.80e-01

Pos logit contributions
jud+0.263
andre+0.248
ca+0.241
imposed+0.229
traumatic+0.223
Neg logit contributions
eering-0.414
idge-0.407
IBLE-0.352
 nodd-0.346
yssey-0.343
Token S
Token ID311
Feature activation+2.79e+00

Pos logit contributions
eering+3.788
idge+3.746
IBLE+3.176
yssey+3.090
 nodd+3.089
Neg logit contributions
jud-2.987
andre-2.834
ca-2.756
imposed-2.653
traumatic-2.594
Tokenenth
Token ID7944
Feature activation+6.96e-01

Pos logit contributions
idge+4.609
eering+4.287
rients+2.614
yssey+2.608
IBLE+2.605
Neg logit contributions
jud-13.416
ca-13.058
andre-13.051
imposed-12.988
cas-12.771
Tokenil
Token ID346
Feature activation+7.85e-02

Pos logit contributions
idge+2.204
eering+2.202
IBLE+1.755
rients+1.723
yssey+1.716
Neg logit contributions
jud-2.572
andre-2.463
ca-2.397
imposed-2.356
traumatic-2.311
Token Kumar
Token ID26105
Feature activation-3.89e-02

Pos logit contributions
eering+0.295
idge+0.290
IBLE+0.245
 nodd+0.238
yssey+0.238
Neg logit contributions
jud-0.250
andre-0.238
ca-0.232
imposed-0.223
traumatic-0.218
Token/
Token ID14
Feature activation-9.51e-01

Pos logit contributions
jud+0.086
andre+0.079
ca+0.076
imposed+0.072
traumatic+0.069
Neg logit contributions
eering-0.184
idge-0.182
IBLE-0.159
 nodd-0.156
yssey-0.156
Token Press
Token ID4332
Feature activation-8.48e-01

Pos logit contributions
jud+2.367
andre+2.246
ca+2.182
imposed+2.032
traumatic+1.954
Neg logit contributions
eering-4.208
idge-4.119
IBLE-3.582
 nodd-3.573
rients-3.513
Token United
Token ID1578
Feature activation-6.71e-01

Pos logit contributions
jud+1.406
andre+1.322
ca+1.316
imposed+1.228
traumatic+1.191
Neg logit contributions
eering-2.089
idge-2.044
IBLE-1.789
 nodd-1.745
yssey-1.739
Token Supporters
Token ID39995
Feature activation-1.55e+00

Pos logit contributions
jud+2.011
andre+1.914
ca+1.911
imposed+1.775
traumatic+1.729
Neg logit contributions
eering-2.594
idge-2.548
IBLE-2.216
 nodd-2.163
yssey-2.148
Token Trust
Token ID9870
Feature activation+7.41e-01

Pos logit contributions
jud+4.011
ca+3.765
andre+3.732
imposed+3.438
qs+3.348
Neg logit contributions
eering-6.479
idge-6.310
IBLE-5.579
 nodd-5.544
yssey-5.410
Token (
Token ID357
Feature activation+7.58e-02

Pos logit contributions
eering+3.124
idge+3.084
IBLE+2.655
 nodd+2.591
rients+2.588
Neg logit contributions
jud-2.005
andre-1.895
ca-1.829
imposed-1.753
traumatic-1.707
TokenN
Token ID45
Feature activation+5.79e-01

Pos logit contributions
eering+0.316
idge+0.311
IBLE+0.268
 nodd+0.262
yssey+0.261
Neg logit contributions
jud-0.211
andre-0.199
ca-0.194
imposed-0.185
traumatic-0.180
TokenUST
Token ID7759
Feature activation+2.79e+00

Pos logit contributions
eering+2.256
idge+2.223
IBLE+1.895
 nodd+1.844
yssey+1.841
Neg logit contributions
jud-1.753
andre-1.664
ca-1.621
imposed-1.552
traumatic-1.518
Token),
Token ID828
Feature activation+4.16e-01

Pos logit contributions
idge+10.059
eering+10.023
IBLE+8.666
yssey+8.103
rients+8.083
Neg logit contributions
jud-8.089
andre-7.836
ca-7.363
imposed-7.314
traumatic-7.173
Token N
Token ID399
Feature activation-1.08e-03

Pos logit contributions
eering+1.455
idge+1.429
IBLE+1.194
 nodd+1.162
yssey+1.157
Neg logit contributions
jud-1.428
andre-1.364
ca-1.336
imposed-1.284
traumatic-1.258
TokenUFC
Token ID44534
Feature activation+1.19e+00

Pos logit contributions
jud+0.004
andre+0.004
ca+0.004
imposed+0.003
traumatic+0.003
Neg logit contributions
eering-0.004
idge-0.004
IBLE-0.003
 nodd-0.003
rients-0.003
Token Fans
Token ID24964
Feature activation-1.09e+00

Pos logit contributions
eering+4.797
idge+4.741
IBLE+4.048
yssey+3.936
rients+3.932
Neg logit contributions
jud-3.381
andre-3.209
ca-3.074
imposed-2.963
traumatic-2.896
Token United
Token ID1578
Feature activation+1.76e-02

Pos logit contributions
jud+2.912
andre+2.726
ca+2.723
imposed+2.507
traumatic+2.454
Neg logit contributions
eering-4.550
idge-4.436
IBLE-3.942
 nodd-3.887
yssey-3.788
Token:
Token ID25
Feature activation+2.85e-01

Pos logit contributions
eering+0.797
idge+0.784
IBLE+0.675
 nodd+0.659
yssey+0.656
Neg logit contributions
jud-0.573
andre-0.543
ca-0.530
imposed-0.504
traumatic-0.493
Token (
Token ID357
Feature activation+6.83e-01

Pos logit contributions
eering+1.138
idge+1.121
IBLE+0.960
 nodd+0.937
yssey+0.934
Neg logit contributions
jud-0.842
andre-0.797
ca-0.780
imposed-0.743
traumatic-0.726
TokenExample
Token ID16281
Feature activation-7.36e-01

Pos logit contributions
eering+2.707
idge+2.669
IBLE+2.290
yssey+2.221
 nodd+2.214
Neg logit contributions
jud-1.990
andre-1.901
ca-1.815
imposed-1.760
traumatic-1.717
Token:
Token ID25
Feature activation+7.31e-01

Pos logit contributions
jud+2.037
andre+1.912
ca+1.891
imposed+1.777
traumatic+1.748
Neg logit contributions
eering-3.034
idge-2.994
IBLE-2.579
 nodd-2.525
yssey-2.522
Token Th
Token ID536
Feature activation+8.78e-01

Pos logit contributions
eering+2.556
idge+2.505
IBLE+2.083
 nodd+2.023
yssey+2.023
Neg logit contributions
jud-2.499
andre-2.394
ca-2.319
imposed-2.249
traumatic-2.205
Tokenurb
Token ID5945
Feature activation+2.75e+00

Pos logit contributions
eering+2.818
idge+2.774
IBLE+2.259
rients+2.192
yssey+2.185
Neg logit contributions
jud-3.238
andre-3.106
ca-3.034
imposed-2.945
traumatic-2.899
Tokenion
Token ID295
Feature activation+3.75e-02

Pos logit contributions
idge+11.143
eering+10.866
IBLE+9.121
yssey+9.059
rients+8.972
Neg logit contributions
jud-7.292
andre-7.048
imposed-6.606
ca-6.567
traumatic-6.360
Token the
Token ID262
Feature activation+2.41e-02

Pos logit contributions
eering+0.142
idge+0.140
IBLE+0.120
 nodd+0.116
yssey+0.116
Neg logit contributions
jud-0.118
andre-0.112
ca-0.110
imposed-0.105
traumatic-0.102
Token Paladin
Token ID26693
Feature activation+2.53e-01

Pos logit contributions
eering+0.087
idge+0.086
IBLE+0.072
 nodd+0.070
yssey+0.070
Neg logit contributions
jud-0.080
andre-0.076
ca-0.075
imposed-0.071
traumatic-0.070
Token became
Token ID2627
Feature activation+1.64e-01

Pos logit contributions
eering+0.950
idge+0.934
IBLE+0.793
 nodd+0.772
yssey+0.768
Neg logit contributions
jud-0.807
andre-0.768
ca-0.750
imposed-0.718
traumatic-0.703
Token a
Token ID257
Feature activation-1.16e-01

Pos logit contributions
eering+0.619
idge+0.608
IBLE+0.515
 nodd+0.503
yssey+0.502
Neg logit contributions
jud-0.516
andre-0.491
ca-0.479
imposed-0.459
traumatic-0.450
Token a
Token ID257
Feature activation+8.68e-02

Pos logit contributions
eering+1.552
idge+1.524
IBLE+1.293
yssey+1.257
 nodd+1.257
Neg logit contributions
jud-1.330
andre-1.264
ca-1.237
imposed-1.185
traumatic-1.159
Token cocktail
Token ID24554
Feature activation-6.09e-01

Pos logit contributions
eering+0.297
idge+0.289
IBLE+0.243
rients+0.236
yssey+0.235
Neg logit contributions
jud-0.308
andre-0.291
ca-0.287
imposed-0.275
traumatic-0.269
Token
Token ID851
Feature activation+5.71e-01

Pos logit contributions
jud+1.600
andre+1.484
ca+1.455
imposed+1.381
traumatic+1.323
Neg logit contributions
eering-2.626
idge-2.558
IBLE-2.261
rients-2.214
yssey-2.202
Token a
Token ID257
Feature activation+5.46e-01

Pos logit contributions
eering+2.181
idge+2.145
IBLE+1.822
 nodd+1.776
yssey+1.770
Neg logit contributions
jud-1.778
andre-1.690
ca-1.648
imposed-1.579
traumatic-1.544
Token Rob
Token ID3851
Feature activation-8.87e-02

Pos logit contributions
eering+1.924
idge+1.890
IBLE+1.581
 nodd+1.537
yssey+1.531
Neg logit contributions
jud-1.866
andre-1.782
ca-1.742
imposed-1.676
traumatic-1.642
Token Roy
Token ID9817
Feature activation+2.70e+00

Pos logit contributions
jud+0.268
andre+0.254
ca+0.250
imposed+0.234
traumatic+0.229
Neg logit contributions
eering-0.350
idge-0.340
IBLE-0.297
 nodd-0.290
rients-0.285
Token
Token ID851
Feature activation+7.45e-01

Pos logit contributions
idge+11.016
eering+10.903
IBLE+9.096
yssey+9.090
 nodd+9.020
Neg logit contributions
jud-6.778
andre-6.462
imposed-5.906
traumatic-5.905
ca-5.821
Token and
Token ID290
Feature activation+1.29e-01

Pos logit contributions
eering+3.055
idge+3.011
IBLE+2.577
 nodd+2.524
yssey+2.512
Neg logit contributions
jud-2.103
andre-1.995
ca-1.929
imposed-1.842
traumatic-1.807
Token eating
Token ID6600
Feature activation+3.28e-01

Pos logit contributions
eering+0.504
idge+0.497
IBLE+0.424
yssey+0.413
rients+0.412
Neg logit contributions
jud-0.392
andre-0.371
ca-0.363
imposed-0.347
traumatic-0.338
Token with
Token ID351
Feature activation+1.02e-01

Pos logit contributions
eering+1.193
idge+1.171
IBLE+0.989
yssey+0.961
 nodd+0.959
Neg logit contributions
jud-1.081
andre-1.029
ca-1.007
imposed-0.966
traumatic-0.946
Token his
Token ID465
Feature activation+3.45e-01

Pos logit contributions
eering+0.356
idge+0.349
IBLE+0.292
 nodd+0.285
yssey+0.285
Neg logit contributions
jud-0.353
andre-0.338
ca-0.330
imposed-0.315
traumatic-0.310
Token the
Token ID262
Feature activation+1.80e-01

Pos logit contributions
eering+1.308
idge+1.287
IBLE+1.088
 nodd+1.057
yssey+1.055
Neg logit contributions
jud-1.145
andre-1.089
ca-1.065
imposed-1.019
traumatic-0.995
Token American
Token ID1605
Feature activation+2.64e-01

Pos logit contributions
eering+0.667
idge+0.657
IBLE+0.557
yssey+0.541
rients+0.538
Neg logit contributions
jud-0.584
andre-0.553
ca-0.541
imposed-0.518
traumatic-0.507
Token mind
Token ID2000
Feature activation+1.47e-01

Pos logit contributions
eering+0.983
idge+0.966
IBLE+0.818
yssey+0.795
 nodd+0.794
Neg logit contributions
jud-0.852
andre-0.808
ca-0.791
imposed-0.758
traumatic-0.741
Token.
Token ID13
Feature activation-1.40e-02

Pos logit contributions
eering+0.645
idge+0.637
IBLE+0.555
 nodd+0.542
yssey+0.540
Neg logit contributions
jud-0.377
andre-0.353
ca-0.344
imposed-0.325
traumatic-0.316
Token Rachel
Token ID15984
Feature activation+3.88e-01

Pos logit contributions
jud+0.044
andre+0.042
ca+0.042
imposed+0.039
traumatic+0.038
Neg logit contributions
eering-0.053
idge-0.052
IBLE-0.044
 nodd-0.044
yssey-0.043
Token Salt
Token ID13754
Feature activation+2.66e+00

Pos logit contributions
eering+1.344
idge+1.316
IBLE+1.100
 nodd+1.067
yssey+1.061
Neg logit contributions
jud-1.355
andre-1.297
ca-1.269
imposed-1.211
traumatic-1.193
Tokenz
Token ID89
Feature activation+6.27e-01

Pos logit contributions
idge+10.209
eering+9.802
rients+8.061
IBLE+8.035
yssey+7.995
Neg logit contributions
jud-7.521
andre-7.172
imposed-6.858
ca-6.854
traumatic-6.653
Tokenc
Token ID66
Feature activation-1.12e+00

Pos logit contributions
eering+2.487
idge+2.461
IBLE+2.089
 nodd+2.042
yssey+2.036
Neg logit contributions
jud-1.844
andre-1.748
ca-1.695
imposed-1.630
traumatic-1.593
Tokenalls
Token ID5691
Feature activation-4.37e-01

Pos logit contributions
jud+4.249
andre+4.053
ca+4.017
imposed+3.740
qs+3.703
Neg logit contributions
eering-3.567
idge-3.298
IBLE-2.897
rients-2.758
 nodd-2.754
Token it
Token ID340
Feature activation-8.19e-02

Pos logit contributions
jud+1.327
andre+1.240
ca+1.232
imposed+1.153
traumatic+1.126
Neg logit contributions
eering-1.708
idge-1.661
IBLE-1.449
 nodd-1.403
yssey-1.394
Token "
Token ID366
Feature activation-4.51e-01

Pos logit contributions
jud+0.266
andre+0.253
ca+0.248
imposed+0.236
traumatic+0.232
Neg logit contributions
eering-0.302
idge-0.296
IBLE-0.251
 nodd-0.244
yssey-0.243
Token Netflix
Token ID12074
Feature activation+2.05e+00

Pos logit contributions
eering+2.930
idge+2.884
IBLE+2.468
 nodd+2.404
yssey+2.401
Neg logit contributions
jud-2.198
andre-2.088
ca-2.022
imposed-1.944
traumatic-1.897
Token is
Token ID318
Feature activation+2.95e-01

Pos logit contributions
eering+6.590
idge+6.523
IBLE+5.348
yssey+5.219
 nodd+5.219
Neg logit contributions
jud-7.166
andre-6.871
ca-6.600
imposed-6.510
traumatic-6.345
Token probably
Token ID2192
Feature activation-4.69e-01

Pos logit contributions
eering+1.087
idge+1.069
IBLE+0.901
 nodd+0.878
yssey+0.875
Neg logit contributions
jud-0.962
andre-0.917
ca-0.896
imposed-0.860
traumatic-0.841
Token the
Token ID262
Feature activation-5.06e-02

Pos logit contributions
jud+1.433
andre+1.359
ca+1.339
imposed+1.266
traumatic+1.233
Neg logit contributions
eering-1.807
idge-1.783
IBLE-1.517
rients-1.486
yssey-1.476
Token best
Token ID1266
Feature activation+3.52e-01

Pos logit contributions
jud+0.185
andre+0.177
ca+0.174
imposed+0.166
traumatic+0.163
Neg logit contributions
eering-0.167
idge-0.164
IBLE-0.135
rients-0.131
yssey-0.131
Token streaming
Token ID11305
Feature activation+2.66e+00

Pos logit contributions
eering+1.162
idge+1.140
IBLE+0.941
 nodd+0.912
yssey+0.909
Neg logit contributions
jud-1.280
andre-1.225
ca-1.199
imposed-1.157
traumatic-1.135
Token help
Token ID1037
Feature activation-1.60e-01

Pos logit contributions
eering+8.043
idge+7.850
IBLE+6.542
 nodd+6.323
yssey+6.097
Neg logit contributions
jud-9.403
andre-9.101
ca-8.649
imposed-8.558
traumatic-8.450
Token in
Token ID287
Feature activation+3.30e-01

Pos logit contributions
jud+0.523
andre+0.497
ca+0.488
imposed+0.464
traumatic+0.453
Neg logit contributions
eering-0.586
idge-0.576
IBLE-0.483
 nodd-0.474
rients-0.471
Token the
Token ID262
Feature activation-4.14e-02

Pos logit contributions
eering+1.201
idge+1.180
IBLE+0.993
 nodd+0.967
yssey+0.963
Neg logit contributions
jud-1.089
andre-1.039
ca-1.014
imposed-0.975
traumatic-0.955
Token world
Token ID995
Feature activation+6.87e-01

Pos logit contributions
jud+0.162
andre+0.155
ca+0.153
imposed+0.147
traumatic+0.144
Neg logit contributions
eering-0.125
idge-0.123
IBLE-0.100
rients-0.096
 nodd-0.095
Token.
Token ID13
Feature activation+3.06e-02

Pos logit contributions
eering+2.865
idge+2.824
IBLE+2.433
 nodd+2.377
yssey+2.373
Neg logit contributions
jud-1.881
andre-1.776
ca-1.714
imposed-1.644
traumatic-1.610
Token among
Token ID1871
Feature activation+5.43e-01

Pos logit contributions
jud+0.484
andre+0.455
ca+0.446
imposed+0.420
traumatic+0.408
Neg logit contributions
eering-0.749
idge-0.739
IBLE-0.639
 nodd-0.628
yssey-0.626
Token many
Token ID867
Feature activation+1.71e-01

Pos logit contributions
eering+2.200
idge+2.165
IBLE+1.858
 nodd+1.815
yssey+1.807
Neg logit contributions
jud-1.565
andre-1.482
ca-1.439
imposed-1.376
traumatic-1.344
Token.
Token ID13
Feature activation+3.94e-01

Pos logit contributions
eering+0.652
idge+0.643
IBLE+0.545
yssey+0.532
 nodd+0.531
Neg logit contributions
jud-0.532
andre-0.505
ca-0.496
imposed-0.474
traumatic-0.462
Token<|endoftext|>
Token ID50256
Feature activation-4.78e-01

Pos logit contributions
eering+1.490
idge+1.465
IBLE+1.243
 nodd+1.207
yssey+1.204
Neg logit contributions
jud-1.242
andre-1.182
ca-1.150
imposed-1.107
traumatic-1.082
TokenB
Token ID33
Feature activation+1.35e+00

Pos logit contributions
jud+1.297
andre+1.218
ca+1.193
imposed+1.120
traumatic+1.095
Neg logit contributions
eering-2.035
idge-2.000
 nodd-1.725
IBLE-1.721
rients-1.690
Tokenud
Token ID463
Feature activation+2.65e+00

Pos logit contributions
eering+5.097
idge+5.053
IBLE+4.298
rients+4.143
yssey+4.141
Neg logit contributions
jud-4.160
andre-3.956
ca-3.844
imposed-3.705
traumatic-3.636
Tokenwe
Token ID732
Feature activation+3.01e-02

Pos logit contributions
idge+9.915
eering+9.523
IBLE+8.252
apest+8.169
yssey+8.141
Neg logit contributions
jud-7.694
andre-7.396
imposed-7.011
ca-6.980
traumatic-6.786
Tokeniser
Token ID5847
Feature activation-1.07e+00

Pos logit contributions
eering+0.117
idge+0.115
IBLE+0.099
 nodd+0.096
yssey+0.096
Neg logit contributions
jud-0.092
andre-0.087
ca-0.085
imposed-0.081
traumatic-0.079
Token Cly
Token ID29133
Feature activation-1.58e+00

Pos logit contributions
jud+3.494
ca+3.283
andre+3.255
imposed+3.085
traumatic+3.012
Neg logit contributions
eering-3.823
idge-3.748
IBLE-3.246
yssey-3.149
rients-3.135
Tokendes
Token ID8906
Feature activation-8.58e-01

Pos logit contributions
jud+2.267
ca+1.931
andre+1.822
imposed+1.491
cas+1.430
Neg logit contributions
eering-8.883
idge-8.519
IBLE-8.004
 nodd-7.872
ADRA-7.803
Tokend
Token ID67
Feature activation-7.55e-01

Pos logit contributions
jud+3.135
andre+3.030
ca+2.978
imposed+2.725
neg+2.713
Neg logit contributions
eering-2.856
idge-2.713
IBLE-2.343
 nodd-2.290
rients-2.290
Token pleasures
Token ID35396
Feature activation+1.41e-01

Pos logit contributions
jud+0.624
andre+0.591
ca+0.580
imposed+0.552
traumatic+0.539
Neg logit contributions
eering-0.771
idge-0.758
IBLE-0.650
 nodd-0.631
rients-0.628
Token of
Token ID286
Feature activation-1.01e+00

Pos logit contributions
eering+0.582
idge+0.574
IBLE+0.493
 nodd+0.483
yssey+0.481
Neg logit contributions
jud-0.392
andre-0.371
ca-0.359
imposed-0.343
traumatic-0.336
Token "
Token ID366
Feature activation-1.05e+00

Pos logit contributions
jud+3.174
andre+3.000
ca+2.969
imposed+2.795
cas+2.734
Neg logit contributions
eering-3.695
idge-3.615
IBLE-3.139
rients-3.012
yssey-2.995
TokenThe
Token ID464
Feature activation-2.71e-01

Pos logit contributions
jud+2.852
andre+2.636
ca+2.617
imposed+2.435
traumatic+2.401
Neg logit contributions
eering-4.415
idge-4.344
IBLE-3.818
 nodd-3.733
rients-3.685
Token Three
Token ID7683
Feature activation+6.35e-01

Pos logit contributions
jud+0.929
andre+0.885
ca+0.869
imposed+0.824
traumatic+0.815
Neg logit contributions
eering-0.948
idge-0.933
IBLE-0.789
 nodd-0.763
rients-0.756
Token Mus
Token ID2629
Feature activation+2.63e+00

Pos logit contributions
eering+2.288
idge+2.248
IBLE+1.877
 nodd+1.828
yssey+1.828
Neg logit contributions
jud-2.100
andre-2.018
ca-1.958
imposed-1.898
traumatic-1.856
Tokenket
Token ID7126
Feature activation+9.74e-02

Pos logit contributions
eering+7.981
idge+7.694
rients+6.322
yssey+6.264
IBLE+6.069
Neg logit contributions
jud-9.033
andre-8.731
imposed-8.251
ca-8.081
traumatic-8.038
Tokeneers
Token ID47619
Feature activation+3.82e-01

Pos logit contributions
eering+0.142
idge+0.139
IBLE+0.088
 nodd+0.079
yssey+0.076
Neg logit contributions
jud-0.530
andre-0.514
ca-0.509
imposed-0.494
traumatic-0.489
Token"
Token ID1
Feature activation+4.11e-01

Pos logit contributions
eering+1.684
idge+1.658
IBLE+1.438
 nodd+1.411
yssey+1.407
Neg logit contributions
jud-0.965
andre-0.908
ca-0.877
imposed-0.834
traumatic-0.811
Token or
Token ID393
Feature activation-9.85e-01

Pos logit contributions
eering+1.687
idge+1.661
IBLE+1.425
 nodd+1.395
yssey+1.390
Neg logit contributions
jud-1.163
andre-1.101
ca-1.068
imposed-1.020
traumatic-0.997
Token Sherlock
Token ID25730
Feature activation+2.66e-01

Pos logit contributions
jud+3.180
andre+3.026
ca+3.026
imposed+2.822
cas+2.772
Neg logit contributions
eering-3.547
idge-3.462
IBLE-2.967
rients-2.863
yssey-2.855
Token SAR
Token ID47341
Feature activation+5.74e-02

Pos logit contributions
jud+0.263
andre+0.251
ca+0.246
imposed+0.236
traumatic+0.232
Neg logit contributions
eering-0.251
idge-0.246
IBLE-0.204
 nodd-0.199
yssey-0.198
Token 0
Token ID657
Feature activation+1.81e-01

Pos logit contributions
eering+0.228
idge+0.224
IBLE+0.192
 nodd+0.187
yssey+0.187
Neg logit contributions
jud-0.170
andre-0.161
ca-0.158
imposed-0.150
traumatic-0.147
Token.
Token ID13
Feature activation+4.20e-01

Pos logit contributions
eering+0.668
idge+0.656
IBLE+0.554
 nodd+0.539
yssey+0.537
Neg logit contributions
jud-0.585
andre-0.557
ca-0.542
imposed-0.522
traumatic-0.511
Token1
Token ID16
Feature activation+5.42e-01

Pos logit contributions
eering+1.932
idge+1.906
IBLE+1.673
rients+1.627
yssey+1.624
Neg logit contributions
jud-0.975
andre-0.910
ca-0.867
imposed-0.836
traumatic-0.805
Token 3
Token ID513
Feature activation+6.45e-01

Pos logit contributions
eering+2.014
idge+1.980
IBLE+1.671
 nodd+1.629
rients+1.624
Neg logit contributions
jud-1.734
andre-1.654
ca-1.603
imposed-1.543
traumatic-1.513
Token Brune
Token ID46070
Feature activation+2.62e+00

Pos logit contributions
eering+2.409
idge+2.364
IBLE+2.008
rients+1.940
yssey+1.938
Neg logit contributions
jud-2.040
andre-1.948
ca-1.877
imposed-1.827
traumatic-1.790
Tokeni
Token ID72
Feature activation-7.37e-01

Pos logit contributions
idge+6.824
eering+6.595
rients+5.169
rient+5.144
IBLE+5.106
Neg logit contributions
jud-10.208
andre-10.080
imposed-9.591
traumatic-9.433
enforcement-9.068
Token Dar
Token ID7491
Feature activation-4.91e-01

Pos logit contributions
jud+2.389
andre+2.240
ca+2.215
imposed+2.092
traumatic+2.057
Neg logit contributions
eering-2.747
idge-2.692
IBLE-2.261
rients-2.215
yssey-2.195
Tokenuss
Token ID1046
Feature activation+4.58e-01

Pos logit contributions
jud+1.069
andre+1.004
ca+0.952
imposed+0.912
traumatic+0.882
Neg logit contributions
eering-2.319
idge-2.296
IBLE-2.013
yssey-1.980
rients-1.966
Tokenalam
Token ID44949
Feature activation-4.06e-01

Pos logit contributions
eering+1.765
idge+1.744
IBLE+1.478
yssey+1.437
rients+1.434
Neg logit contributions
jud-1.403
andre-1.333
ca-1.296
imposed-1.246
traumatic-1.219
Token 2
Token ID362
Feature activation-1.64e-01

Pos logit contributions
jud+1.169
andre+1.102
ca+1.086
imposed+1.025
traumatic+1.003
Neg logit contributions
eering-1.646
idge-1.618
IBLE-1.393
 nodd-1.365
yssey-1.356
Token you
Token ID345
Feature activation-6.43e-03

Pos logit contributions
jud+0.537
andre+0.506
ca+0.497
imposed+0.470
traumatic+0.454
Neg logit contributions
eering-0.796
idge-0.780
IBLE-0.670
yssey-0.659
rients-0.657
Token have
Token ID423
Feature activation-7.48e-01

Pos logit contributions
jud+0.021
andre+0.020
ca+0.020
imposed+0.019
traumatic+0.018
Neg logit contributions
eering-0.023
idge-0.023
IBLE-0.019
rients-0.019
 nodd-0.019
Token some
Token ID617
Feature activation+1.07e-01

Pos logit contributions
jud+2.271
andre+2.150
ca+2.119
imposed+1.984
traumatic+1.955
Neg logit contributions
eering-2.844
idge-2.810
IBLE-2.398
yssey-2.354
rients-2.347
Token metal
Token ID6147
Feature activation+2.36e-01

Pos logit contributions
eering+0.356
idge+0.350
IBLE+0.290
yssey+0.282
rients+0.281
Neg logit contributions
jud-0.384
andre-0.366
ca-0.360
imposed-0.344
traumatic-0.338
Token wire
Token ID6503
Feature activation-5.49e-01

Pos logit contributions
eering+0.891
idge+0.877
IBLE+0.742
yssey+0.727
 nodd+0.722
Neg logit contributions
jud-0.743
andre-0.707
ca-0.692
imposed-0.659
traumatic-0.646
Token laying
Token ID16299
Feature activation+2.62e+00

Pos logit contributions
jud+1.613
andre+1.539
ca+1.499
imposed+1.418
traumatic+1.381
Neg logit contributions
eering-2.168
idge-2.147
IBLE-1.824
yssey-1.804
 nodd-1.790
Token around
Token ID1088
Feature activation+7.53e-01

Pos logit contributions
eering+7.970
idge+7.583
IBLE+6.238
 nodd+5.980
rients+5.970
Neg logit contributions
jud-9.370
andre-9.114
ca-8.702
traumatic-8.562
imposed-8.556
Token)
Token ID8
Feature activation-4.51e-01

Pos logit contributions
eering+2.909
idge+2.864
IBLE+2.442
 nodd+2.377
yssey+2.364
Neg logit contributions
jud-2.298
andre-2.184
ca-2.122
imposed-2.039
traumatic-1.997
TokenIf
Token ID1532
Feature activation-9.11e-01

Pos logit contributions
jud+1.215
andre+1.144
ca+1.126
imposed+1.061
traumatic+1.020
Neg logit contributions
eering-1.899
idge-1.874
IBLE-1.618
 nodd-1.579
yssey-1.576
Token your
Token ID534
Feature activation-4.96e-02

Pos logit contributions
jud+2.374
andre+2.248
ca+2.213
imposed+2.060
activated+1.975
Neg logit contributions
eering-3.874
idge-3.775
IBLE-3.277
yssey-3.229
rients-3.214
Token c
Token ID269
Feature activation-5.96e-01

Pos logit contributions
jud+0.169
andre+0.160
ca+0.158
imposed+0.151
traumatic+0.147
Neg logit contributions
eering-0.174
idge-0.171
IBLE-0.144
rients-0.140
yssey-0.140
Token in
Token ID287
Feature activation+7.29e-01

Pos logit contributions
eering+1.266
idge+1.246
IBLE+1.068
 nodd+1.042
yssey+1.039
Neg logit contributions
jud-0.930
andre-0.882
ca-0.860
imposed-0.820
traumatic-0.801
Token Rub
Token ID6256
Feature activation+1.32e+00

Pos logit contributions
eering+2.663
idge+2.626
IBLE+2.209
 nodd+2.155
yssey+2.142
Neg logit contributions
jud-2.374
andre-2.254
ca-2.199
imposed-2.125
traumatic-2.081
Tokenen
Token ID268
Feature activation-9.39e-02

Pos logit contributions
eering+4.709
idge+4.699
IBLE+3.908
yssey+3.852
 nodd+3.782
Neg logit contributions
jud-4.346
andre-4.122
ca-3.996
imposed-3.927
traumatic-3.831
Token "
Token ID366
Feature activation-4.92e-01

Pos logit contributions
jud+0.334
andre+0.321
ca+0.314
imposed+0.302
traumatic+0.296
Neg logit contributions
eering-0.318
idge-0.310
IBLE-0.257
 nodd-0.251
yssey-0.249
TokenR
Token ID49
Feature activation+5.72e-01

Pos logit contributions
jud+1.275
andre+1.196
ca+1.171
imposed+1.092
traumatic+1.067
Neg logit contributions
eering-2.161
idge-2.107
IBLE-1.840
 nodd-1.821
rients-1.797
TokenUB
Token ID10526
Feature activation+2.62e+00

Pos logit contributions
eering+2.214
idge+2.169
IBLE+1.853
 nodd+1.811
rients+1.799
Neg logit contributions
jud-1.766
andre-1.678
ca-1.638
imposed-1.562
traumatic-1.525
TokenINO
Token ID46016
Feature activation+1.95e-01

Pos logit contributions
eering+9.543
idge+9.509
IBLE+8.252
yssey+7.833
ADRA+7.741
Neg logit contributions
jud-7.659
andre-7.404
imposed-6.949
ca-6.800
traumatic-6.798
Token"
Token ID1
Feature activation+8.73e-01

Pos logit contributions
eering+0.883
idge+0.872
IBLE+0.761
 nodd+0.745
yssey+0.743
Neg logit contributions
jud-0.470
andre-0.440
ca-0.425
imposed-0.402
traumatic-0.390
Token Vill
Token ID9757
Feature activation-3.90e-01

Pos logit contributions
eering+3.160
idge+3.112
IBLE+2.639
rients+2.536
 nodd+2.535
Neg logit contributions
jud-2.867
andre-2.714
ca-2.636
imposed-2.579
traumatic-2.522
Tokenarro
Token ID34852
Feature activation-1.38e-01

Pos logit contributions
jud+1.095
andre+1.037
ca+1.014
imposed+0.955
traumatic+0.926
Neg logit contributions
eering-1.626
idge-1.584
IBLE-1.382
 nodd-1.352
yssey-1.334
Tokenel
Token ID417
Feature activation-3.96e-01

Pos logit contributions
jud+0.346
andre+0.324
ca+0.314
imposed+0.298
traumatic+0.289
Neg logit contributions
eering-0.614
idge-0.605
IBLE-0.527
 nodd-0.516
yssey-0.514
Tokenac
Token ID330
Feature activation+8.88e-01

Pos logit contributions
eering+2.239
idge+2.192
IBLE+1.874
 nodd+1.831
yssey+1.821
Neg logit contributions
jud-1.781
andre-1.691
ca-1.650
imposed-1.575
traumatic-1.541
Tokenism
Token ID1042
Feature activation-2.06e-01

Pos logit contributions
eering+4.025
idge+3.976
IBLE+3.441
 nodd+3.379
yssey+3.379
Neg logit contributions
jud-2.088
andre-1.957
ca-1.882
imposed-1.803
traumatic-1.749
Token Vs
Token ID31280
Feature activation+7.74e-02

Pos logit contributions
jud+0.615
andre+0.585
ca+0.575
imposed+0.539
traumatic+0.530
Neg logit contributions
eering-0.803
idge-0.796
IBLE-0.680
 nodd-0.675
yssey-0.668
Token Xbox
Token ID9445
Feature activation-6.30e-02

Pos logit contributions
eering+0.283
idge+0.279
IBLE+0.235
 nodd+0.229
yssey+0.229
Neg logit contributions
jud-0.253
andre-0.242
ca-0.236
imposed-0.226
traumatic-0.222
Token-
Token ID12
Feature activation+2.75e-01

Pos logit contributions
jud+0.190
andre+0.180
ca+0.176
imposed+0.167
traumatic+0.164
Neg logit contributions
eering-0.246
idge-0.243
IBLE-0.209
 nodd-0.205
rients-0.203
TokenLou
Token ID24016
Feature activation+2.62e+00

Pos logit contributions
eering+1.179
idge+1.162
IBLE+1.007
 nodd+0.981
yssey+0.980
Neg logit contributions
jud-0.724
andre-0.682
ca-0.661
imposed-0.630
traumatic-0.613
Token C
Token ID327
Feature activation+1.35e+00

Pos logit contributions
eering+8.280
idge+8.167
yssey+6.584
rients+6.558
IBLE+6.471
Neg logit contributions
jud-8.888
andre-8.525
imposed-8.300
traumatic-7.979
enforcement-7.941
Tokenk
Token ID74
Feature activation-6.03e-02

Pos logit contributions
eering+5.252
idge+5.214
IBLE+4.383
yssey+4.260
rients+4.259
Neg logit contributions
jud-3.980
andre-3.765
ca-3.638
imposed-3.578
traumatic-3.493
Token Bob
Token ID5811
Feature activation+1.22e+00

Pos logit contributions
jud+0.181
andre+0.173
ca+0.168
imposed+0.158
traumatic+0.155
Neg logit contributions
eering-0.237
idge-0.233
IBLE-0.202
 nodd-0.197
rients-0.195
Token &
Token ID1222
Feature activation+1.11e+00

Pos logit contributions
eering+4.819
idge+4.779
IBLE+4.001
yssey+3.936
rients+3.926
Neg logit contributions
jud-3.537
andre-3.347
ca-3.155
imposed-3.126
traumatic-3.029
Token Jimmy
Token ID12963
Feature activation+1.62e-01

Pos logit contributions
eering+4.033
idge+3.980
IBLE+3.328
 nodd+3.228
yssey+3.223
Neg logit contributions
jud-3.634
andre-3.455
ca-3.343
imposed-3.286
traumatic-3.191
Token who
Token ID508
Feature activation-3.81e-01

Pos logit contributions
jud+1.516
andre+1.425
ca+1.396
imposed+1.330
traumatic+1.296
Neg logit contributions
eering-2.055
idge-2.014
IBLE-1.749
rients-1.704
 nodd-1.699
Token qu
Token ID627
Feature activation-1.91e-01

Pos logit contributions
jud+1.263
andre+1.209
ca+1.180
imposed+1.124
traumatic+1.103
Neg logit contributions
eering-1.378
idge-1.349
IBLE-1.150
 nodd-1.115
rients-1.100
Tokenits
Token ID896
Feature activation-5.95e-01

Pos logit contributions
jud+0.579
andre+0.551
ca+0.533
imposed+0.508
traumatic+0.494
Neg logit contributions
eering-0.752
idge-0.729
IBLE-0.637
 nodd-0.626
rients-0.608
Token his
Token ID465
Feature activation-4.85e-01

Pos logit contributions
jud+1.943
andre+1.824
ca+1.813
imposed+1.734
traumatic+1.686
Neg logit contributions
eering-2.172
idge-2.150
IBLE-1.815
yssey-1.772
rients-1.764
Token pro
Token ID386
Feature activation-8.88e-01

Pos logit contributions
jud+1.577
andre+1.486
ca+1.464
imposed+1.422
traumatic+1.359
Neg logit contributions
eering-1.759
idge-1.745
IBLE-1.493
rients-1.448
yip-1.434
Tokendig
Token ID12894
Feature activation+2.61e+00

Pos logit contributions
jud+2.286
andre+2.139
ca+2.100
imposed+1.961
cas+1.889
Neg logit contributions
eering-3.853
idge-3.785
IBLE-3.354
 nodd-3.275
yssey-3.232
Tokeniously
Token ID6819
Feature activation-1.26e-01

Pos logit contributions
eering+8.869
idge+8.549
yssey+7.061
IBLE+6.941
rients+6.766
Neg logit contributions
jud-8.478
andre-8.112
traumatic-7.741
ca-7.713
imposed-7.617
Token paying
Token ID5989
Feature activation+8.49e-01

Pos logit contributions
jud+0.395
andre+0.372
ca+0.364
imposed+0.352
traumatic+0.341
Neg logit contributions
eering-0.477
idge-0.473
IBLE-0.400
rients-0.390
 nodd-0.389
Token job
Token ID1693
Feature activation-8.84e-01

Pos logit contributions
eering+3.054
idge+2.992
IBLE+2.514
 nodd+2.448
yssey+2.435
Neg logit contributions
jud-2.817
andre-2.697
ca-2.617
imposed-2.530
traumatic-2.483
Token to
Token ID284
Feature activation+1.99e-01

Pos logit contributions
jud+2.335
andre+2.168
ca+2.138
imposed+2.008
traumatic+1.950
Neg logit contributions
eering-3.723
idge-3.722
IBLE-3.255
 nodd-3.168
yssey-3.164
Token pursue
Token ID10660
Feature activation+5.55e-01

Pos logit contributions
eering+0.635
idge+0.624
IBLE+0.513
 nodd+0.496
yssey+0.493
Neg logit contributions
jud-0.743
andre-0.711
ca-0.698
imposed-0.673
traumatic-0.659
Token the
Token ID262
Feature activation-2.97e-01

Pos logit contributions
jud+1.469
andre+1.386
ca+1.357
imposed+1.305
traumatic+1.248
Neg logit contributions
eering-1.863
idge-1.835
IBLE-1.558
yssey-1.542
 nodd-1.527
Token growing
Token ID3957
Feature activation+1.60e+00

Pos logit contributions
jud+0.972
andre+0.923
ca+0.904
imposed+0.876
traumatic+0.847
Neg logit contributions
eering-1.079
idge-1.066
IBLE-0.901
yssey-0.879
rients-0.876
Token number
Token ID1271
Feature activation-5.20e-01

Pos logit contributions
eering+5.497
idge+5.364
IBLE+4.467
 nodd+4.385
rients+4.270
Neg logit contributions
jud-5.417
andre-5.154
ca-5.009
imposed-4.808
traumatic-4.781
Token of
Token ID286
Feature activation-3.87e-01

Pos logit contributions
jud+1.395
andre+1.316
ca+1.283
imposed+1.221
traumatic+1.186
Neg logit contributions
eering-2.206
idge-2.173
IBLE-1.880
yssey-1.836
 nodd-1.831
Token r
Token ID374
Feature activation+2.96e-01

Pos logit contributions
jud+1.276
andre+1.210
ca+1.200
imposed+1.147
traumatic+1.111
Neg logit contributions
eering-1.392
idge-1.379
IBLE-1.148
yssey-1.134
rients-1.123
Tokenumb
Token ID2178
Feature activation+2.61e+00

Pos logit contributions
eering+1.099
idge+1.077
IBLE+0.911
 nodd+0.890
yssey+0.884
Neg logit contributions
jud-0.960
andre-0.913
ca-0.891
imposed-0.854
traumatic-0.834
Tokenlings
Token ID17783
Feature activation+8.45e-01

Pos logit contributions
eering+4.238
idge+4.158
yssey+2.315
IBLE+2.311
 nodd+2.194
Neg logit contributions
jud-13.398
andre-13.121
ca-12.666
imposed-12.588
traumatic-12.479
Token,
Token ID11
Feature activation-2.25e-01

Pos logit contributions
eering+3.224
idge+3.133
IBLE+2.649
 nodd+2.596
yssey+2.569
Neg logit contributions
jud-2.623
andre-2.514
ca-2.409
imposed-2.323
traumatic-2.290
Token and
Token ID290
Feature activation+1.26e-01

Pos logit contributions
jud+0.638
andre+0.598
ca+0.588
imposed+0.563
traumatic+0.540
Neg logit contributions
eering-0.921
idge-0.910
IBLE-0.781
yssey-0.770
rients-0.765
Token the
Token ID262
Feature activation-2.15e-01

Pos logit contributions
eering+0.482
idge+0.474
IBLE+0.402
yssey+0.392
 nodd+0.392
Neg logit contributions
jud-0.395
andre-0.376
ca-0.367
imposed-0.353
traumatic-0.343
Token prag
Token ID23864
Feature activation-3.64e-01

Pos logit contributions
jud+0.672
andre+0.635
ca+0.623
imposed+0.602
traumatic+0.581
Neg logit contributions
eering-0.813
idge-0.804
IBLE-0.681
rients-0.667
yssey-0.667
Token mimic
Token ID26332
Feature activation+1.13e-01

Pos logit contributions
eering+0.862
idge+0.849
IBLE+0.714
rients+0.689
 nodd+0.689
Neg logit contributions
jud-0.842
andre-0.799
ca-0.788
imposed-0.752
traumatic-0.736
Token that
Token ID326
Feature activation-3.87e-01

Pos logit contributions
eering+0.424
idge+0.418
IBLE+0.354
yssey+0.347
 nodd+0.344
Neg logit contributions
jud-0.359
andre-0.340
ca-0.334
imposed-0.320
traumatic-0.310
Token pattern
Token ID3912
Feature activation-2.20e-01

Pos logit contributions
jud+1.261
andre+1.183
ca+1.172
imposed+1.126
traumatic+1.091
Neg logit contributions
eering-1.420
idge-1.391
IBLE-1.181
rients-1.162
yssey-1.151
Token.
Token ID13
Feature activation+1.57e-01

Pos logit contributions
jud+0.568
andre+0.532
ca+0.522
imposed+0.492
traumatic+0.477
Neg logit contributions
eering-0.949
idge-0.944
IBLE-0.820
 nodd-0.803
rients-0.802
Token When
Token ID1649
Feature activation-2.69e-01

Pos logit contributions
eering+0.559
idge+0.551
IBLE+0.463
 nodd+0.453
yssey+0.450
Neg logit contributions
jud-0.530
andre-0.504
ca-0.498
imposed-0.473
traumatic-0.465
Token finished
Token ID5201
Feature activation-2.88e+00

Pos logit contributions
jud+0.772
andre+0.724
ca+0.717
imposed+0.681
traumatic+0.659
Neg logit contributions
eering-1.086
idge-1.067
IBLE-0.917
yssey-0.901
rients-0.900
Token tie
Token ID9839
Feature activation-9.73e-01

Pos logit contributions
jud+7.164
ca+6.979
cas+6.305
andre+6.295
imposed+6.251
Neg logit contributions
eering-10.767
idge-10.627
IBLE-9.690
yssey-9.510
rients-9.485
Token a
Token ID257
Feature activation-4.63e-02

Pos logit contributions
jud+2.935
andre+2.721
ca+2.695
imposed+2.550
traumatic+2.486
Neg logit contributions
eering-3.736
idge-3.666
IBLE-3.172
yssey-3.113
rients-3.089
Token knot
Token ID29654
Feature activation-9.30e-02

Pos logit contributions
jud+0.168
andre+0.159
ca+0.158
imposed+0.151
traumatic+0.148
Neg logit contributions
eering-0.153
idge-0.150
IBLE-0.125
rients-0.122
yssey-0.121
Token on
Token ID319
Feature activation+3.56e-02

Pos logit contributions
jud+0.261
andre+0.244
ca+0.241
imposed+0.227
traumatic+0.222
Neg logit contributions
eering-0.385
idge-0.378
IBLE-0.327
yssey-0.319
 nodd-0.318
Token the
Token ID262
Feature activation+1.89e-01

Pos logit contributions
eering+0.136
idge+0.133
IBLE+0.114
yssey+0.111
rients+0.111
Neg logit contributions
jud-0.111
andre-0.105
ca-0.103
imposed-0.098
traumatic-0.096
Token this
Token ID428
Feature activation+2.82e-02

Pos logit contributions
jud+1.413
andre+1.340
ca+1.306
imposed+1.241
traumatic+1.212
Neg logit contributions
eering-1.912
idge-1.877
IBLE-1.611
 nodd-1.572
rients-1.569
Token girl
Token ID2576
Feature activation-8.63e-01

Pos logit contributions
eering+0.113
idge+0.110
IBLE+0.095
 nodd+0.092
rients+0.092
Neg logit contributions
jud-0.083
andre-0.079
ca-0.077
imposed-0.073
traumatic-0.072
Token who
Token ID508
Feature activation-6.49e-01

Pos logit contributions
jud+2.454
andre+2.349
ca+2.300
imposed+2.153
traumatic+2.115
Neg logit contributions
eering-3.478
idge-3.374
IBLE-2.954
rients-2.874
 nodd-2.858
Token has
Token ID468
Feature activation-8.83e-01

Pos logit contributions
jud+2.036
andre+1.965
ca+1.918
imposed+1.812
traumatic+1.775
Neg logit contributions
eering-2.441
idge-2.374
IBLE-2.045
 nodd-1.989
rients-1.968
Token won
Token ID1839
Feature activation-1.72e+00

Pos logit contributions
jud+2.604
andre+2.469
ca+2.430
imposed+2.294
traumatic+2.264
Neg logit contributions
eering-3.482
idge-3.413
IBLE-2.952
rients-2.849
 nodd-2.848
Token Miss
Token ID4544
Feature activation-2.87e+00

Pos logit contributions
jud+5.018
andre+4.711
ca+4.697
imposed+4.353
cas+4.281
Neg logit contributions
eering-6.627
idge-6.507
IBLE-5.615
yssey-5.502
rients-5.437
Token Chinatown
Token ID47043
Feature activation+5.51e-01

Pos logit contributions
jud+8.111
ca+7.583
andre+7.341
imposed+6.801
ippi+6.745
Neg logit contributions
eering-10.799
idge-10.413
 nodd-9.213
 rall-8.906
yip-8.809
Token;
Token ID26
Feature activation-4.41e-01

Pos logit contributions
eering+2.226
idge+2.185
IBLE+1.866
 nodd+1.830
yssey+1.825
Neg logit contributions
jud-1.583
andre-1.509
ca-1.456
imposed-1.405
traumatic-1.370
Token she
Token ID673
Feature activation-2.92e-01

Pos logit contributions
jud+1.306
andre+1.240
ca+1.212
imposed+1.148
traumatic+1.120
Neg logit contributions
eering-1.755
idge-1.721
IBLE-1.476
 nodd-1.443
rients-1.440
Token
Token ID447
Feature activation-1.79e-01

Pos logit contributions
jud+0.909
andre+0.868
ca+0.845
imposed+0.803
traumatic+0.785
Neg logit contributions
eering-1.119
idge-1.093
IBLE-0.939
 nodd-0.912
rients-0.905
Token
Token ID247
Feature activation+8.83e-02

Pos logit contributions
jud+0.569
andre+0.541
ca+0.529
imposed+0.505
traumatic+0.496
Neg logit contributions
eering-0.671
idge-0.657
IBLE-0.558
 nodd-0.547
yssey-0.542
Token LO
Token ID17579
Feature activation+3.60e-01

Pos logit contributions
eering+0.443
idge+0.436
IBLE+0.374
 nodd+0.366
yssey+0.365
Neg logit contributions
jud-0.313
andre-0.297
ca-0.289
imposed-0.275
traumatic-0.269
TokenEB
Token ID30195
Feature activation+4.39e-01

Pos logit contributions
eering+1.489
idge+1.466
IBLE+1.265
 nodd+1.231
yssey+1.229
Neg logit contributions
jud-1.005
andre-0.950
ca-0.922
imposed-0.881
traumatic-0.858
Token/
Token ID14
Feature activation-1.17e+00

Pos logit contributions
eering+1.799
idge+1.774
IBLE+1.531
yssey+1.481
 nodd+1.479
Neg logit contributions
jud-1.249
andre-1.181
ca-1.138
imposed-1.097
traumatic-1.071
TokenAFP
Token ID17449
Feature activation-1.04e+00

Pos logit contributions
jud+2.369
andre+2.218
ca+2.115
imposed+1.957
rh+1.798
Neg logit contributions
eering-5.700
idge-5.633
 nodd-4.962
IBLE-4.957
rients-4.910
Token/
Token ID14
Feature activation-1.49e+00

Pos logit contributions
jud+2.020
andre+1.877
ca+1.839
imposed+1.653
traumatic+1.618
Neg logit contributions
eering-5.163
idge-5.114
 nodd-4.457
IBLE-4.454
rients-4.418
TokenGetty
Token ID6633
Feature activation-2.78e+00

Pos logit contributions
jud+2.771
andre+2.548
ca+2.527
imposed+2.215
rh+2.069
Neg logit contributions
eering-7.519
idge-7.455
 nodd-6.707
rients-6.475
yssey-6.469
Token Images
Token ID5382
Feature activation-1.25e+00

Pos logit contributions
jud+5.521
andre+5.166
ca+5.015
imposed+4.611
traumatic+4.475
Neg logit contributions
eering-12.808
idge-12.602
yip-11.101
yssey-11.064
 nodd-10.980
Token ORIG
Token ID43901
Feature activation-3.01e-01

Pos logit contributions
jud+3.380
andre+3.217
ca+3.148
imposed+2.904
traumatic+2.807
Neg logit contributions
eering-5.161
idge-5.104
yssey-4.403
 nodd-4.385
IBLE-4.356
Token FILE
Token ID45811
Feature activation-7.46e-01

Pos logit contributions
jud+1.295
andre+1.266
ca+1.242
imposed+1.205
traumatic+1.182
Neg logit contributions
eering-0.785
idge-0.763
 nodd-0.573
yssey-0.569
rients-0.568
Token ID
Token ID4522
Feature activation-3.87e-01

Pos logit contributions
jud+2.222
andre+2.109
ca+2.087
imposed+1.946
traumatic+1.919
Neg logit contributions
eering-2.910
idge-2.858
IBLE-2.449
 nodd-2.406
yssey-2.400
Token:
Token ID25
Feature activation-9.28e-02

Pos logit contributions
jud+1.067
andre+1.016
ca+1.006
imposed+0.933
traumatic+0.912
Neg logit contributions
eering-1.599
idge-1.571
IBLE-1.340
 nodd-1.338
rients-1.331
Token Mellon
Token ID49808
Feature activation-2.47e+00

Pos logit contributions
jud+1.875
andre+1.778
ca+1.732
imposed+1.621
traumatic+1.546
Neg logit contributions
eering-3.506
idge-3.360
IBLE-2.977
rients-2.951
 nodd-2.943
Token University
Token ID2059
Feature activation-9.56e-01

Pos logit contributions
jud+6.609
ca+6.276
andre+6.158
imposed+5.728
traumatic+5.559
Neg logit contributions
eering-9.760
idge-9.387
IBLE-8.352
yip-8.280
rients-8.278
Token in
Token ID287
Feature activation-9.73e-01

Pos logit contributions
jud+2.569
andre+2.421
ca+2.383
imposed+2.243
traumatic+2.146
Neg logit contributions
eering-4.017
idge-3.914
IBLE-3.436
 nodd-3.376
rients-3.355
Token Pittsburgh
Token ID12767
Feature activation-1.23e+00

Pos logit contributions
jud+2.205
ca+2.087
andre+2.028
imposed+1.868
activated+1.847
Neg logit contributions
eering-4.492
idge-4.426
IBLE-3.929
rients-3.907
 nodd-3.897
Token,
Token ID11
Feature activation-1.46e+00

Pos logit contributions
jud+3.088
andre+2.913
ca+2.902
imposed+2.691
traumatic+2.585
Neg logit contributions
eering-5.338
idge-5.175
IBLE-4.580
 nodd-4.580
rients-4.502
Token Pennsylvania
Token ID9589
Feature activation-2.76e+00

Pos logit contributions
jud+3.995
ca+3.890
andre+3.706
imposed+3.635
activated+3.387
Neg logit contributions
eering-5.916
idge-5.706
 nodd-4.946
IBLE-4.946
yip-4.944
Token,
Token ID11
Feature activation-9.89e-01

Pos logit contributions
jud+5.428
andre+5.148
ca+5.078
imposed+4.514
traumatic+4.271
Neg logit contributions
eering-12.623
idge-12.056
 nodd-11.043
yip-10.877
IBLE-10.849
Token who
Token ID508
Feature activation-4.60e-01

Pos logit contributions
jud+2.929
andre+2.787
ca+2.767
imposed+2.623
traumatic+2.494
Neg logit contributions
eering-3.894
idge-3.764
IBLE-3.271
 nodd-3.203
rients-3.194
Token is
Token ID318
Feature activation-1.57e-01

Pos logit contributions
jud+1.381
andre+1.301
ca+1.277
imposed+1.218
traumatic+1.168
Neg logit contributions
eering-1.811
idge-1.778
IBLE-1.541
 nodd-1.498
rients-1.481
Token a
Token ID257
Feature activation-2.92e-02

Pos logit contributions
jud+0.506
andre+0.479
ca+0.470
imposed+0.451
traumatic+0.436
Neg logit contributions
eering-0.585
idge-0.576
IBLE-0.488
 nodd-0.474
rients-0.474
Token co
Token ID763
Feature activation-1.08e+00

Pos logit contributions
jud+0.104
andre+0.099
ca+0.097
imposed+0.093
traumatic+0.090
Neg logit contributions
eering-0.099
idge-0.098
IBLE-0.082
 nodd-0.079
rients-0.079
Token mysteries
Token ID26045
Feature activation-4.37e-01

Pos logit contributions
eering+1.590
idge+1.562
IBLE+1.329
 nodd+1.296
yssey+1.290
Neg logit contributions
jud-1.257
andre-1.199
ca-1.161
imposed-1.117
traumatic-1.088
Token in
Token ID287
Feature activation+9.66e-02

Pos logit contributions
jud+1.197
andre+1.118
ca+1.105
imposed+1.042
traumatic+1.018
Neg logit contributions
eering-1.813
idge-1.802
IBLE-1.556
 nodd-1.528
rients-1.522
Token the
Token ID262
Feature activation-2.53e-01

Pos logit contributions
eering+0.367
idge+0.361
IBLE+0.307
 nodd+0.299
yssey+0.298
Neg logit contributions
jud-0.304
andre-0.288
ca-0.282
imposed-0.269
traumatic-0.264
Token history
Token ID2106
Feature activation-7.57e-02

Pos logit contributions
jud+0.830
andre+0.786
ca+0.771
imposed+0.736
traumatic+0.725
Neg logit contributions
eering-0.929
idge-0.916
IBLE-0.774
 nodd-0.752
rients-0.751
Token of
Token ID286
Feature activation-4.39e-01

Pos logit contributions
jud+0.220
andre+0.211
ca+0.201
imposed+0.196
traumatic+0.192
Neg logit contributions
eering-0.306
idge-0.300
IBLE-0.254
yssey-0.248
rients-0.247
Token Herm
Token ID18113
Feature activation-2.76e+00

Pos logit contributions
jud+1.358
andre+1.279
ca+1.256
imposed+1.190
traumatic+1.165
Neg logit contributions
eering-1.688
idge-1.667
IBLE-1.419
rients-1.387
yssey-1.387
Tokenosa
Token ID8546
Feature activation-1.90e+00

Pos logit contributions
jud+3.779
andre+3.531
ca+3.421
cas+2.794
ippi+2.658
Neg logit contributions
eering-14.660
idge-14.022
 nodd-13.548
 rall-13.247
IBLE-13.043
Token Beach
Token ID8511
Feature activation-1.09e+00

Pos logit contributions
jud+4.751
ca+4.359
andre+4.300
qs+3.869
traumatic+3.843
Neg logit contributions
eering-8.033
idge-7.938
 nodd-7.091
IBLE-6.991
yip-6.894
Token,
Token ID11
Feature activation-5.50e-01

Pos logit contributions
jud+3.072
andre+2.845
ca+2.841
traumatic+2.635
imposed+2.623
Neg logit contributions
eering-4.417
idge-4.390
IBLE-3.772
rients-3.711
 nodd-3.690
Token officials
Token ID2828
Feature activation-7.22e-01

Pos logit contributions
jud+1.712
andre+1.603
ca+1.590
imposed+1.511
traumatic+1.476
Neg logit contributions
eering-2.103
idge-2.085
IBLE-1.765
rients-1.740
yssey-1.724
Token announced
Token ID3414
Feature activation+1.93e-02

Pos logit contributions
jud+2.293
andre+2.151
ca+2.131
imposed+2.028
traumatic+1.968
Neg logit contributions
eering-2.684
idge-2.662
IBLE-2.268
rients-2.244
 nodd-2.200
Token dominant
Token ID11410
Feature activation-6.94e-01

Pos logit contributions
jud+1.313
andre+1.257
ca+1.252
imposed+1.204
traumatic+1.186
Neg logit contributions
eering-0.877
idge-0.858
IBLE-0.680
rients-0.659
 nodd-0.656
Token palm
Token ID18057
Feature activation+4.25e-01

Pos logit contributions
jud+2.169
ca+2.054
andre+2.042
imposed+1.909
traumatic+1.893
Neg logit contributions
eering-2.623
idge-2.593
IBLE-2.193
yssey-2.147
 nodd-2.132
Tokenopl
Token ID20106
Feature activation-1.26e+00

Pos logit contributions
eering+1.682
idge+1.651
IBLE+1.417
 nodd+1.378
yssey+1.377
Neg logit contributions
jud-1.277
andre-1.210
ca-1.180
imposed-1.122
traumatic-1.103
Tokenant
Token ID415
Feature activation-1.13e-01

Pos logit contributions
jud+3.373
ca+3.271
andre+3.257
imposed+2.932
cas+2.901
Neg logit contributions
eering-5.255
idge-4.897
 nodd-4.471
IBLE-4.442
 rall-4.303
Tokenar
Token ID283
Feature activation-4.92e-01

Pos logit contributions
jud+0.334
ca+0.313
andre+0.312
imposed+0.294
traumatic+0.289
Neg logit contributions
eering-0.451
idge-0.436
IBLE-0.381
 nodd-0.378
rients-0.368
Token ker
Token ID41927
Feature activation-2.73e+00

Pos logit contributions
jud+1.466
ca+1.381
andre+1.378
imposed+1.294
traumatic+1.281
Neg logit contributions
eering-1.969
idge-1.881
IBLE-1.648
 nodd-1.596
yssey-1.590
Tokenat
Token ID265
Feature activation-2.63e-01

Pos logit contributions
jud+6.228
ca+5.988
andre+5.590
traumatic+5.355
cas+5.222
Neg logit contributions
eering-12.201
idge-11.429
IBLE-10.616
 nodd-10.569
 rall-10.197
Tokenoder
Token ID12342
Feature activation-9.58e-01

Pos logit contributions
jud+0.677
andre+0.633
ca+0.621
imposed+0.579
traumatic+0.570
Neg logit contributions
eering-1.147
idge-1.128
IBLE-0.996
 nodd-0.964
yssey-0.962
Tokenma
Token ID2611
Feature activation-1.37e+00

Pos logit contributions
jud+1.965
ca+1.814
andre+1.724
imposed+1.610
rh+1.597
Neg logit contributions
eering-4.734
idge-4.525
IBLE-4.075
 nodd-4.057
yssey-3.995
Token with
Token ID351
Feature activation-6.24e-01

Pos logit contributions
jud+3.649
ca+3.463
andre+3.406
imposed+3.150
traumatic+3.139
Neg logit contributions
eering-5.655
idge-5.541
IBLE-4.846
yssey-4.787
rients-4.768
Token variable
Token ID7885
Feature activation-5.04e-01

Pos logit contributions
jud+1.939
andre+1.841
ca+1.829
imposed+1.746
traumatic+1.716
Neg logit contributions
eering-2.365
idge-2.317
IBLE-1.979
yssey-1.932
rients-1.913
Token<|endoftext|>
Token ID50256
Feature activation-4.82e-01

Pos logit contributions
jud+2.708
ca+2.545
andre+2.500
imposed+2.270
traumatic+2.209
Neg logit contributions
eering-5.107
idge-4.986
 nodd-4.389
IBLE-4.301
yssey-4.295
TokenDonald
Token ID7371
Feature activation+6.70e-01

Pos logit contributions
jud+1.305
andre+1.225
ca+1.200
imposed+1.127
traumatic+1.102
Neg logit contributions
eering-2.053
idge-2.016
 nodd-1.741
IBLE-1.737
rients-1.705
Token Mir
Token ID7381
Feature activation+8.35e-01

Pos logit contributions
eering+2.063
idge+2.012
IBLE+1.624
 nodd+1.577
rients+1.567
Neg logit contributions
jud-2.570
andre-2.466
ca-2.420
imposed-2.350
traumatic-2.307
Tokenalle
Token ID6765
Feature activation+8.14e-01

Pos logit contributions
eering+3.599
idge+3.559
IBLE+3.068
rients+2.997
yssey+2.997
Neg logit contributions
jud-2.171
andre-2.041
ca-1.972
imposed-1.888
traumatic-1.840
Token/
Token ID14
Feature activation-1.37e+00

Pos logit contributions
eering+3.312
idge+3.269
IBLE+2.784
rients+2.727
 nodd+2.726
Neg logit contributions
jud-2.310
andre-2.184
ca-2.101
imposed-2.036
traumatic-1.990
TokenGetty
Token ID6633
Feature activation-2.71e+00

Pos logit contributions
jud+4.120
andre+3.859
ca+3.857
imposed+3.575
cas+3.442
Neg logit contributions
eering-5.358
idge-5.185
 nodd-4.443
IBLE-4.423
rients-4.383
Token Images
Token ID5382
Feature activation-1.23e+00

Pos logit contributions
jud+5.568
ca+5.185
andre+5.124
imposed+4.630
traumatic+4.562
Neg logit contributions
eering-11.937
idge-11.668
 nodd-10.476
yssey-10.443
yip-10.323
Token
Token ID198
Feature activation-6.77e-02

Pos logit contributions
jud+3.386
andre+3.170
ca+3.148
imposed+2.930
traumatic+2.863
Neg logit contributions
eering-5.012
idge-4.934
 nodd-4.276
IBLE-4.261
rients-4.235
Token
Token ID198
Feature activation-7.37e-02

Pos logit contributions
jud+0.209
andre+0.200
ca+0.190
imposed+0.188
traumatic+0.183
Neg logit contributions
eering-0.259
idge-0.256
IBLE-0.217
rients-0.207
yssey-0.207
TokenThe
Token ID464
Feature activation-1.57e-01

Pos logit contributions
jud+0.201
andre+0.189
ca+0.184
imposed+0.175
traumatic+0.171
Neg logit contributions
eering-0.311
idge-0.306
IBLE-0.264
 nodd-0.259
yssey-0.258
Token latest
Token ID3452
Feature activation-3.28e-01

Pos logit contributions
jud+0.534
andre+0.508
ca+0.495
imposed+0.473
traumatic+0.467
Neg logit contributions
eering-0.559
idge-0.550
IBLE-0.462
 nodd-0.450
rients-0.449
Token ker
Token ID41927
Feature activation-2.25e+00

Pos logit contributions
jud+2.384
ca+2.261
andre+2.260
imposed+2.106
traumatic+2.095
Neg logit contributions
eering-3.139
idge-3.081
IBLE-2.654
yssey-2.632
rients-2.608
Tokenat
Token ID265
Feature activation+3.36e-01

Pos logit contributions
jud+4.388
ca+4.269
andre+4.123
traumatic+3.708
cas+3.632
Neg logit contributions
eering-10.683
idge-10.261
IBLE-9.714
 nodd-9.696
 rall-9.442
Tokenoder
Token ID12342
Feature activation-1.23e-01

Pos logit contributions
eering+1.380
idge+1.360
IBLE+1.167
 nodd+1.141
yssey+1.138
Neg logit contributions
jud-0.948
andre-0.896
ca-0.870
imposed-0.832
traumatic-0.810
Tokenma
Token ID2611
Feature activation-7.83e-01

Pos logit contributions
jud+0.259
andre+0.238
ca+0.236
imposed+0.218
traumatic+0.210
Neg logit contributions
eering-0.596
idge-0.585
IBLE-0.520
 nodd-0.511
rients-0.507
Token,
Token ID11
Feature activation+6.79e-02

Pos logit contributions
jud+2.155
ca+2.032
andre+2.002
imposed+1.896
traumatic+1.844
Neg logit contributions
eering-3.237
idge-3.184
IBLE-2.787
yssey-2.733
 nodd-2.729
Token inherited
Token ID19552
Feature activation-2.69e+00

Pos logit contributions
eering+0.272
idge+0.267
IBLE+0.229
 nodd+0.224
yssey+0.224
Neg logit contributions
jud-0.200
andre-0.188
ca-0.187
imposed-0.177
traumatic-0.173
Token as
Token ID355
Feature activation+2.12e-02

Pos logit contributions
jud+6.717
ca+6.347
andre+6.035
imposed+5.736
traumatic+5.573
Neg logit contributions
eering-10.858
idge-10.810
IBLE-9.599
yssey-9.529
 nodd-9.353
Token an
Token ID281
Feature activation-4.55e-01

Pos logit contributions
eering+0.081
idge+0.079
IBLE+0.067
yssey+0.066
 nodd+0.066
Neg logit contributions
jud-0.066
andre-0.063
ca-0.062
imposed-0.059
traumatic-0.058
Token autos
Token ID44619
Feature activation-3.40e-01

Pos logit contributions
jud+1.455
andre+1.391
ca+1.377
imposed+1.307
traumatic+1.282
Neg logit contributions
eering-1.700
idge-1.654
IBLE-1.412
yssey-1.377
 nodd-1.376
Tokenomal
Token ID18048
Feature activation+3.03e-01

Pos logit contributions
jud+1.060
andre+1.007
ca+0.982
imposed+0.942
traumatic+0.921
Neg logit contributions
eering-1.298
idge-1.278
IBLE-1.084
yssey-1.054
 nodd-1.053
Token dominant
Token ID11410
Feature activation-5.27e-01

Pos logit contributions
eering+0.778
idge+0.757
IBLE+0.585
 nodd+0.567
yssey+0.556
Neg logit contributions
jud-1.321
andre-1.272
ca-1.246
imposed-1.211
traumatic-1.195
Token Ke
Token ID3873
Feature activation-1.40e+00

Pos logit contributions
jud+2.341
andre+2.211
ca+2.164
imposed+2.053
traumatic+2.021
Neg logit contributions
eering-2.761
idge-2.731
IBLE-2.338
rients-2.280
yssey-2.276
Tokenans
Token ID504
Feature activation-5.49e-01

Pos logit contributions
jud+5.062
andre+4.984
ca+4.816
imposed+4.590
cas+4.480
Neg logit contributions
eering-4.507
idge-4.339
 nodd-3.877
IBLE-3.781
 rall-3.623
Tokenburg
Token ID7423
Feature activation-9.38e-01

Pos logit contributions
jud+1.153
andre+1.063
ca+1.055
imposed+0.958
traumatic+0.926
Neg logit contributions
eering-2.653
idge-2.600
IBLE-2.318
 nodd-2.285
rients-2.274
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
jud+2.539
andre+2.354
ca+2.351
imposed+2.191
traumatic+2.129
Neg logit contributions
eering-3.943
idge-3.865
IBLE-3.415
 nodd-3.322
rients-3.304
Token New
Token ID968
Feature activation-2.01e+00

Pos logit contributions
jud+3.450
ca+3.250
andre+3.195
imposed+3.081
traumatic+2.958
Neg logit contributions
idge-3.982
eering-3.952
IBLE-3.330
rients-3.323
yssey-3.303
Token Jersey
Token ID8221
Feature activation-2.67e+00

Pos logit contributions
jud+4.043
andre+3.698
ca+3.693
imposed+3.326
neg+3.186
Neg logit contributions
eering-9.360
idge-9.356
IBLE-8.486
ADRA-8.429
 nodd-8.333
Token,
Token ID11
Feature activation-9.78e-01

Pos logit contributions
jud+6.395
ca+5.883
andre+5.628
imposed+5.256
traumatic+5.095
Neg logit contributions
eering-11.259
idge-10.992
IBLE-9.992
 nodd-9.772
rients-9.718
Token following
Token ID1708
Feature activation-5.33e-02

Pos logit contributions
jud+2.760
ca+2.592
andre+2.563
imposed+2.416
traumatic+2.332
Neg logit contributions
eering-3.979
idge-3.934
IBLE-3.424
 nodd-3.322
rients-3.316
Token a
Token ID257
Feature activation-5.48e-02

Pos logit contributions
jud+0.166
andre+0.158
ca+0.155
imposed+0.148
traumatic+0.145
Neg logit contributions
eering-0.203
idge-0.201
IBLE-0.171
rients-0.166
 nodd-0.166
Token massive
Token ID4858
Feature activation-4.18e-01

Pos logit contributions
jud+0.184
andre+0.174
ca+0.172
imposed+0.164
traumatic+0.161
Neg logit contributions
eering-0.196
idge-0.194
IBLE-0.164
rients-0.159
 nodd-0.159
Token fish
Token ID5916
Feature activation+6.86e-01

Pos logit contributions
jud+1.416
andre+1.345
ca+1.338
imposed+1.264
traumatic+1.242
Neg logit contributions
eering-1.460
idge-1.443
IBLE-1.221
rients-1.187
 nodd-1.182
Token/
Token ID14
Feature activation-1.11e+00

Pos logit contributions
jud+2.544
andre+2.440
ca+2.373
imposed+2.191
traumatic+2.120
Neg logit contributions
eering-4.560
idge-4.528
IBLE-3.966
 nodd-3.925
yssey-3.874
TokenLos
Token ID28903
Feature activation-6.16e-01

Pos logit contributions
jud+3.330
andre+3.191
ca+3.117
imposed+2.922
cas+2.811
Neg logit contributions
eering-4.298
idge-4.218
IBLE-3.594
 nodd-3.557
rients-3.528
Token Angeles
Token ID5652
Feature activation-3.07e-01

Pos logit contributions
jud+1.457
andre+1.365
ca+1.325
imposed+1.211
traumatic+1.184
Neg logit contributions
eering-2.801
idge-2.775
 nodd-2.437
IBLE-2.437
rients-2.386
Token Times
Token ID3782
Feature activation-9.54e-01

Pos logit contributions
jud+0.894
andre+0.846
ca+0.826
imposed+0.780
traumatic+0.768
Neg logit contributions
eering-1.237
idge-1.217
IBLE-1.044
 nodd-1.023
rients-1.020
Token/
Token ID14
Feature activation-1.26e+00

Pos logit contributions
jud+2.489
andre+2.403
ca+2.340
imposed+2.174
traumatic+2.143
Neg logit contributions
eering-4.050
idge-3.982
IBLE-3.456
 nodd-3.421
rients-3.391
TokenGetty
Token ID6633
Feature activation-2.66e+00

Pos logit contributions
jud+3.090
andre+2.924
ca+2.855
imposed+2.623
rh+2.515
Neg logit contributions
eering-5.618
idge-5.487
rients-4.758
 nodd-4.754
IBLE-4.752
Token Images
Token ID5382
Feature activation-1.15e+00

Pos logit contributions
jud+5.102
andre+4.939
ca+4.787
traumatic+4.293
imposed+4.282
Neg logit contributions
eering-12.139
idge-11.895
 nodd-10.719
yip-10.701
yssey-10.532
Token
Token ID198
Feature activation-7.23e-02

Pos logit contributions
jud+2.930
andre+2.819
ca+2.734
imposed+2.532
traumatic+2.504
Neg logit contributions
eering-4.900
idge-4.823
IBLE-4.191
 nodd-4.172
rients-4.136
Token
Token ID198
Feature activation-3.91e-01

Pos logit contributions
jud+0.221
andre+0.212
ca+0.202
imposed+0.200
traumatic+0.194
Neg logit contributions
eering-0.279
idge-0.275
IBLE-0.233
rients-0.225
yssey-0.224
TokenIn
Token ID818
Feature activation-2.93e-01

Pos logit contributions
jud+1.016
andre+0.957
ca+0.934
imposed+0.874
traumatic+0.852
Neg logit contributions
eering-1.700
idge-1.670
 nodd-1.456
IBLE-1.445
yssey-1.420
Token a
Token ID257
Feature activation-4.00e-02

Pos logit contributions
jud+0.910
andre+0.863
ca+0.847
imposed+0.802
traumatic+0.786
Neg logit contributions
eering-1.122
idge-1.102
IBLE-0.943
 nodd-0.918
yssey-0.912
Token a
Token ID257
Feature activation+2.13e-01

Pos logit contributions
jud+0.264
andre+0.250
ca+0.248
imposed+0.238
traumatic+0.228
Neg logit contributions
eering-0.314
idge-0.311
IBLE-0.262
rients-0.258
yssey-0.258
Token black
Token ID2042
Feature activation+4.50e-01

Pos logit contributions
eering+0.756
idge+0.745
IBLE+0.623
rients+0.608
yssey+0.607
Neg logit contributions
jud-0.719
andre-0.684
ca-0.671
imposed-0.645
traumatic-0.630
Token hat
Token ID6877
Feature activation-2.68e-01

Pos logit contributions
eering+1.589
idge+1.561
IBLE+1.306
 nodd+1.271
yssey+1.258
Neg logit contributions
jud-1.532
andre-1.463
ca-1.427
imposed-1.374
traumatic-1.349
Token and
Token ID290
Feature activation+9.09e-02

Pos logit contributions
jud+0.693
andre+0.652
ca+0.641
imposed+0.604
traumatic+0.581
Neg logit contributions
eering-1.159
idge-1.147
IBLE-0.994
yssey-0.978
rients-0.975
Token rain
Token ID6290
Feature activation+9.28e-01

Pos logit contributions
eering+0.326
idge+0.322
IBLE+0.270
yssey+0.263
rients+0.263
Neg logit contributions
jud-0.303
andre-0.289
ca-0.284
imposed-0.273
traumatic-0.266
Tokencoat
Token ID31434
Feature activation-2.66e+00

Pos logit contributions
eering+3.606
idge+3.515
IBLE+3.022
 nodd+2.921
yssey+2.887
Neg logit contributions
jud-2.753
andre-2.636
ca-2.478
imposed-2.452
traumatic-2.405
Token,
Token ID11
Feature activation+9.45e-02

Pos logit contributions
jud+6.434
ca+6.182
andre+5.959
imposed+5.786
cas+5.383
Neg logit contributions
idge-10.605
eering-10.355
yssey-9.390
rients-9.329
ADRA-9.149
Token with
Token ID351
Feature activation+1.40e-01

Pos logit contributions
eering+0.369
idge+0.364
IBLE+0.310
rients+0.304
yssey+0.304
Neg logit contributions
jud-0.286
andre-0.270
ca-0.267
imposed-0.255
traumatic-0.245
Token the
Token ID262
Feature activation+1.50e-01

Pos logit contributions
eering+0.509
idge+0.502
IBLE+0.422
yssey+0.412
 nodd+0.411
Neg logit contributions
jud-0.457
andre-0.435
ca-0.426
imposed-0.410
traumatic-0.400
Token words
Token ID2456
Feature activation-4.71e-03

Pos logit contributions
eering+0.546
idge+0.538
IBLE+0.452
 nodd+0.441
yssey+0.441
Neg logit contributions
jud-0.492
andre-0.466
ca-0.458
imposed-0.440
traumatic-0.429
Token "
Token ID366
Feature activation-4.98e-01

Pos logit contributions
jud+0.015
andre+0.014
ca+0.014
imposed+0.014
traumatic+0.013
Neg logit contributions
eering-0.017
idge-0.017
IBLE-0.014
 nodd-0.014
yssey-0.014
Tokenococ
Token ID34403
Feature activation-1.46e+00

Pos logit contributions
jud+1.822
andre+1.738
ca+1.715
imposed+1.625
traumatic+1.611
Neg logit contributions
eering-1.846
idge-1.795
IBLE-1.547
 nodd-1.520
rients-1.465
Tokencus
Token ID9042
Feature activation-1.78e+00

Pos logit contributions
jud+2.549
ca+2.430
andre+2.113
cas+2.022
neg+1.860
Neg logit contributions
eering-7.672
idge-7.384
IBLE-6.832
 nodd-6.633
ADRA-6.429
Token,
Token ID11
Feature activation-3.08e-01

Pos logit contributions
jud+4.112
ca+3.951
andre+3.885
imposed+3.465
traumatic+3.428
Neg logit contributions
eering-8.035
idge-7.784
IBLE-6.872
yssey-6.825
 nodd-6.777
Token Gard
Token ID12790
Feature activation-7.17e-01

Pos logit contributions
jud+0.917
ca+0.865
andre+0.865
imposed+0.806
traumatic+0.797
Neg logit contributions
eering-1.224
idge-1.199
IBLE-1.030
yssey-1.003
 nodd-1.000
Tokenn
Token ID77
Feature activation-5.39e-01

Pos logit contributions
jud+2.029
ca+1.929
andre+1.915
imposed+1.774
traumatic+1.740
Neg logit contributions
eering-2.932
idge-2.852
IBLE-2.517
 nodd-2.467
yssey-2.391
Tokenerella
Token ID36648
Feature activation-2.63e+00

Pos logit contributions
jud+2.132
andre+2.050
ca+2.030
imposed+1.929
traumatic+1.900
Neg logit contributions
eering-1.618
idge-1.506
IBLE-1.300
 nodd-1.277
rients-1.230
Token vaginal
Token ID30131
Feature activation-1.15e+00

Pos logit contributions
jud+6.128
ca+6.120
andre+5.517
rh+5.285
neg+5.237
Neg logit contributions
eering-11.353
idge-10.927
IBLE-9.799
yssey-9.750
 nodd-9.478
Tokenis
Token ID271
Feature activation-1.57e+00

Pos logit contributions
jud+2.402
andre+2.275
ca+2.191
imposed+1.977
traumatic+1.913
Neg logit contributions
eering-5.471
idge-5.342
IBLE-4.859
yssey-4.727
yip-4.727
Token,
Token ID11
Feature activation-4.66e-01

Pos logit contributions
jud+3.803
ca+3.689
andre+3.653
imposed+3.268
traumatic+3.214
Neg logit contributions
eering-6.890
idge-6.661
IBLE-5.864
yssey-5.824
 nodd-5.762
Token Mobil
Token ID45951
Feature activation+1.53e-01

Pos logit contributions
jud+1.393
ca+1.317
andre+1.314
imposed+1.223
traumatic+1.213
Neg logit contributions
eering-1.845
idge-1.803
IBLE-1.552
yssey-1.509
 nodd-1.502
Tokenun
Token ID403
Feature activation-2.11e-01

Pos logit contributions
eering+0.615
idge+0.603
IBLE+0.520
 nodd+0.507
yssey+0.504
Neg logit contributions
jud-0.447
andre-0.423
ca-0.413
imposed-0.393
traumatic-0.384
Token to
Token ID284
Feature activation-1.10e-01

Pos logit contributions
jud+1.560
andre+1.447
ca+1.418
imposed+1.332
traumatic+1.302
Neg logit contributions
eering-2.704
idge-2.659
IBLE-2.331
 nodd-2.313
yssey-2.284
Token push
Token ID4574
Feature activation-6.73e-01

Pos logit contributions
jud+0.416
andre+0.399
ca+0.393
imposed+0.376
traumatic+0.369
Neg logit contributions
eering-0.346
idge-0.339
IBLE-0.279
 nodd-0.271
rients-0.270
Token hemp
Token ID28246
Feature activation+7.07e-01

Pos logit contributions
jud+2.175
andre+2.042
ca+2.027
imposed+1.912
traumatic+1.888
Neg logit contributions
eering-2.450
idge-2.434
IBLE-2.041
 nodd-2.012
yssey-2.008
Token across
Token ID1973
Feature activation-4.54e-01

Pos logit contributions
eering+2.555
idge+2.508
IBLE+2.108
 nodd+2.047
yssey+2.041
Neg logit contributions
jud-2.347
andre-2.241
ca-2.184
imposed-2.104
traumatic-2.060
Token the
Token ID262
Feature activation-8.40e-01

Pos logit contributions
jud+1.408
andre+1.321
ca+1.305
imposed+1.243
traumatic+1.211
Neg logit contributions
eering-1.741
idge-1.712
IBLE-1.444
 nodd-1.424
yssey-1.419
Token finish
Token ID5461
Feature activation-2.62e+00

Pos logit contributions
jud+2.609
andre+2.446
ca+2.420
imposed+2.285
traumatic+2.221
Neg logit contributions
eering-3.181
idge-3.142
IBLE-2.655
yssey-2.629
rients-2.590
Token line
Token ID1627
Feature activation-1.42e+00

Pos logit contributions
jud+7.568
andre+6.982
ca+6.807
imposed+6.490
neg+6.301
Neg logit contributions
eering-9.881
idge-9.600
ADRA-8.458
IBLE-8.408
yssey-8.389
Token,
Token ID11
Feature activation-2.90e-01

Pos logit contributions
jud+3.640
andre+3.360
ca+3.330
imposed+3.129
traumatic+2.997
Neg logit contributions
eering-6.026
idge-5.975
IBLE-5.229
 nodd-5.206
yssey-5.129
Token not
Token ID407
Feature activation+1.70e-01

Pos logit contributions
jud+0.788
andre+0.743
ca+0.733
imposed+0.687
traumatic+0.669
Neg logit contributions
eering-1.228
idge-1.206
IBLE-1.048
 nodd-1.027
yssey-1.021
Token only
Token ID691
Feature activation-1.21e+00

Pos logit contributions
eering+0.698
idge+0.690
IBLE+0.594
 nodd+0.582
rients+0.577
Neg logit contributions
jud-0.483
andre-0.453
ca-0.445
imposed-0.422
traumatic-0.407
Token for
Token ID329
Feature activation-9.59e-02

Pos logit contributions
jud+3.195
ca+3.052
andre+2.965
imposed+2.762
traumatic+2.688
Neg logit contributions
idge-5.007
eering-4.981
IBLE-4.351
yssey-4.288
 nodd-4.255
Token 23
Token ID2242
Feature activation-7.50e-03

Pos logit contributions
jud+0.441
andre+0.419
ca+0.411
imposed+0.392
traumatic+0.383
Neg logit contributions
eering-0.476
idge-0.470
IBLE-0.395
yssey-0.384
 nodd-0.384
Token Republican
Token ID3415
Feature activation+5.08e-01

Pos logit contributions
jud+0.024
andre+0.023
ca+0.023
imposed+0.022
traumatic+0.021
Neg logit contributions
eering-0.028
idge-0.027
IBLE-0.023
yssey-0.023
 nodd-0.022
Token-
Token ID12
Feature activation-3.42e-02

Pos logit contributions
eering+1.889
idge+1.857
IBLE+1.566
 nodd+1.528
yssey+1.521
Neg logit contributions
jud-1.630
andre-1.557
ca-1.514
imposed-1.456
traumatic-1.428
Tokenheld
Token ID10217
Feature activation+2.79e-01

Pos logit contributions
jud+0.094
andre+0.088
ca+0.086
imposed+0.082
traumatic+0.080
Neg logit contributions
eering-0.144
idge-0.141
IBLE-0.122
 nodd-0.120
yssey-0.119
Token districts
Token ID12815
Feature activation-3.85e-01

Pos logit contributions
eering+1.048
idge+1.029
IBLE+0.871
 nodd+0.850
yssey+0.845
Neg logit contributions
jud-0.887
andre-0.845
ca-0.824
imposed-0.791
traumatic-0.775
Token won
Token ID1839
Feature activation-2.61e+00

Pos logit contributions
jud+1.162
andre+1.094
ca+1.076
imposed+1.025
traumatic+0.991
Neg logit contributions
eering-1.504
idge-1.486
IBLE-1.263
yssey-1.241
 nodd-1.241
Token by
Token ID416
Feature activation-3.36e-01

Pos logit contributions
jud+5.641
andre+5.362
ca+5.154
imposed+4.759
't+4.601
Neg logit contributions
eering-11.483
idge-11.318
 nodd-10.028
IBLE-9.906
rients-9.856
Token Clinton
Token ID2605
Feature activation-7.79e-01

Pos logit contributions
jud+1.036
andre+0.990
ca+0.963
imposed+0.915
traumatic+0.890
Neg logit contributions
eering-1.285
idge-1.265
IBLE-1.079
yssey-1.057
 nodd-1.055
Token last
Token ID938
Feature activation-8.38e-03

Pos logit contributions
jud+2.052
andre+1.951
ca+1.884
imposed+1.768
traumatic+1.723
Neg logit contributions
eering-3.304
idge-3.240
IBLE-2.846
 nodd-2.794
yssey-2.773
Token year
Token ID614
Feature activation+4.32e-01

Pos logit contributions
jud+0.029
andre+0.027
ca+0.027
imposed+0.026
traumatic+0.025
Neg logit contributions
eering-0.030
idge-0.029
IBLE-0.024
 nodd-0.024
rients-0.024
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
eering+1.840
idge+1.816
IBLE+1.569
 nodd+1.530
yssey+1.528
Neg logit contributions
jud-1.151
andre-1.084
ca-1.050
imposed-1.003
traumatic-0.977
Token his
Token ID465
Feature activation+1.56e-01

Pos logit contributions
jud+0.798
andre+0.757
ca+0.746
imposed+0.703
traumatic+0.689
Neg logit contributions
eering-0.997
idge-0.980
IBLE-0.838
 nodd-0.817
yssey-0.812
Token landslide
Token ID36348
Feature activation-5.33e-01

Pos logit contributions
eering+0.542
idge+0.533
IBLE+0.446
 nodd+0.433
rients+0.432
Neg logit contributions
jud-0.538
andre-0.513
ca-0.502
imposed-0.481
traumatic-0.474
Token re
Token ID302
Feature activation-1.45e+00

Pos logit contributions
jud+1.641
andre+1.554
ca+1.527
imposed+1.451
traumatic+1.413
Neg logit contributions
eering-2.020
idge-2.008
IBLE-1.729
rients-1.686
 nodd-1.679
Token-
Token ID12
Feature activation-9.18e-01

Pos logit contributions
jud+2.922
andre+2.633
ca+2.518
imposed+2.439
traumatic+2.207
Neg logit contributions
eering-7.046
idge-6.941
IBLE-6.328
rients-6.185
 nodd-6.151
Tokenelection
Token ID14300
Feature activation-5.90e-01

Pos logit contributions
jud+1.237
andre+1.049
ca+1.016
imposed+0.922
traumatic+0.809
Neg logit contributions
eering-5.140
idge-5.066
IBLE-4.628
 nodd-4.548
rients-4.541
Token victory
Token ID5373
Feature activation-2.61e+00

Pos logit contributions
jud+2.141
andre+2.064
ca+2.023
imposed+1.946
traumatic+1.900
Neg logit contributions
idge-1.878
eering-1.862
IBLE-1.562
rients-1.524
 nodd-1.516
Token last
Token ID938
Feature activation-6.10e-01

Pos logit contributions
jud+6.087
andre+5.748
ca+5.719
imposed+5.432
traumatic+5.046
Neg logit contributions
idge-10.782
eering-10.387
rients-9.478
IBLE-9.423
 nodd-9.348
Token night
Token ID1755
Feature activation+7.96e-01

Pos logit contributions
jud+1.999
andre+1.888
ca+1.855
imposed+1.779
traumatic+1.727
Neg logit contributions
eering-2.218
idge-2.186
IBLE-1.865
rients-1.816
yssey-1.806
Token,
Token ID11
Feature activation-5.79e-01

Pos logit contributions
eering+3.290
idge+3.239
IBLE+2.780
 nodd+2.705
yssey+2.695
Neg logit contributions
jud-2.191
andre-2.068
ca-2.001
imposed-1.912
traumatic-1.883
Token New
Token ID968
Feature activation-1.45e+00

Pos logit contributions
jud+1.925
andre+1.823
ca+1.790
imposed+1.714
traumatic+1.658
Neg logit contributions
eering-2.091
idge-2.058
IBLE-1.746
rients-1.703
yssey-1.696
Token Jersey
Token ID8221
Feature activation-1.76e+00

Pos logit contributions
jud+4.074
andre+3.845
ca+3.808
imposed+3.546
neg+3.377
Neg logit contributions
eering-5.787
idge-5.649
IBLE-5.096
yssey-4.922
 nodd-4.886
Token been
Token ID587
Feature activation-2.27e-01

Pos logit contributions
jud+1.511
andre+1.422
ca+1.399
imposed+1.338
traumatic+1.305
Neg logit contributions
eering-1.978
idge-1.965
IBLE-1.681
rients-1.645
 nodd-1.635
Token diagnosed
Token ID14641
Feature activation-7.42e-01

Pos logit contributions
jud+0.700
andre+0.662
ca+0.651
imposed+0.623
traumatic+0.609
Neg logit contributions
eering-0.869
idge-0.862
IBLE-0.732
rients-0.714
 nodd-0.712
Token with
Token ID351
Feature activation-4.93e-01

Pos logit contributions
jud+1.973
andre+1.852
ca+1.819
imposed+1.723
traumatic+1.688
Neg logit contributions
eering-3.129
idge-3.092
IBLE-2.702
rients-2.626
yssey-2.625
Token post
Token ID1281
Feature activation-1.54e+00

Pos logit contributions
jud+1.464
andre+1.396
ca+1.376
traumatic+1.291
imposed+1.288
Neg logit contributions
eering-1.925
idge-1.907
IBLE-1.649
yssey-1.610
rients-1.592
Token-
Token ID12
Feature activation-1.72e+00

Pos logit contributions
jud+3.184
ca+2.942
andre+2.884
traumatic+2.851
imposed+2.634
Neg logit contributions
idge-7.353
eering-7.327
IBLE-6.555
yssey-6.345
rients-6.284
Tokentraumatic
Token ID41521
Feature activation-2.58e+00

Pos logit contributions
jud+1.810
traumatic+1.519
ca+1.492
andre+1.472
imposed+1.193
Neg logit contributions
eering-9.971
idge-9.861
IBLE-9.042
 nodd-8.869
rients-8.853
Token stress
Token ID5503
Feature activation-1.50e+00

Pos logit contributions
jud+5.225
ca+4.771
traumatic+4.620
andre+4.497
imposed+4.057
Neg logit contributions
eering-11.865
idge-11.801
IBLE-10.833
yssey-10.431
yip-10.374
Token disorder
Token ID8967
Feature activation-1.60e+00

Pos logit contributions
jud+4.393
ca+4.174
andre+4.103
traumatic+3.972
imposed+3.863
Neg logit contributions
eering-5.894
idge-5.798
IBLE-4.981
yssey-4.888
rients-4.779
Token and
Token ID290
Feature activation-5.45e-01

Pos logit contributions
jud+3.787
ca+3.499
andre+3.472
imposed+3.250
traumatic+3.238
Neg logit contributions
eering-7.018
idge-6.936
IBLE-6.120
 nodd-6.042
yssey-6.041
Token,
Token ID11
Feature activation-6.19e-01

Pos logit contributions
jud+1.693
andre+1.596
ca+1.580
imposed+1.497
traumatic+1.489
Neg logit contributions
eering-2.070
idge-2.051
IBLE-1.752
rients-1.702
 nodd-1.694
Token with
Token ID351
Feature activation-1.23e-01

Pos logit contributions
jud+1.809
ca+1.680
andre+1.675
imposed+1.577
traumatic+1.555
Neg logit contributions
eering-2.462
idge-2.462
IBLE-2.117
rients-2.071
 nodd-2.063
Token executions
Token ID30632
Feature activation-6.78e-01

Pos logit contributions
jud+2.063
andre+1.938
ca+1.935
imposed+1.848
traumatic+1.803
Neg logit contributions
eering-2.177
idge-2.163
IBLE-1.806
yssey-1.776
rients-1.756
Token took
Token ID1718
Feature activation-8.05e-01

Pos logit contributions
jud+2.005
andre+1.919
ca+1.898
imposed+1.807
traumatic+1.742
Neg logit contributions
idge-2.638
eering-2.631
IBLE-2.248
 nodd-2.197
rients-2.197
Token place
Token ID1295
Feature activation-3.70e-01

Pos logit contributions
jud+2.971
andre+2.838
ca+2.790
imposed+2.678
traumatic+2.612
Neg logit contributions
eering-2.577
idge-2.567
IBLE-2.091
 nodd-2.029
yssey-2.028
Token in
Token ID287
Feature activation-2.81e-04

Pos logit contributions
jud+1.091
andre+1.034
ca+1.010
imposed+0.963
traumatic+0.937
Neg logit contributions
eering-1.473
idge-1.455
IBLE-1.246
 nodd-1.217
yssey-1.215
Token Chong
Token ID46892
Feature activation-6.20e-01

Pos logit contributions
jud+0.001
andre+0.001
ca+0.001
imposed+0.001
traumatic+0.001
Neg logit contributions
eering-0.001
idge-0.001
IBLE-0.001
yssey-0.001
 nodd-0.001
Tokenjin
Token ID18594
Feature activation-2.55e+00

Pos logit contributions
jud+1.239
andre+1.111
ca+1.095
imposed+0.982
traumatic+0.947
Neg logit contributions
eering-3.086
idge-3.058
IBLE-2.717
 nodd-2.667
yssey-2.645
Token in
Token ID287
Feature activation-7.71e-01

Pos logit contributions
jud+6.902
ca+6.446
andre+6.215
ju+5.864
cas+5.754
Neg logit contributions
eering-9.925
idge-9.901
IBLE-8.765
 nodd-8.723
yssey-8.533
Token North
Token ID2258
Feature activation-8.07e-01

Pos logit contributions
jud+2.547
andre+2.392
ca+2.369
imposed+2.223
activated+2.189
Neg logit contributions
eering-2.789
idge-2.765
IBLE-2.346
 nodd-2.283
rients-2.261
Token Ham
Token ID4345
Feature activation+4.59e-01

Pos logit contributions
jud+2.530
ca+2.336
andre+2.326
imposed+2.201
cas+2.130
Neg logit contributions
eering-3.095
idge-3.036
IBLE-2.635
 nodd-2.588
rients-2.519
Tokengy
Token ID1360
Feature activation-1.39e-01

Pos logit contributions
eering+1.754
idge+1.722
IBLE+1.430
yssey+1.424
rients+1.411
Neg logit contributions
jud-1.398
andre-1.345
ca-1.290
imposed-1.248
traumatic-1.226
Tokenong
Token ID506
Feature activation-1.37e+00

Pos logit contributions
jud+0.449
andre+0.428
ca+0.419
imposed+0.396
traumatic+0.390
Neg logit contributions
eering-0.518
idge-0.509
IBLE-0.441
 nodd-0.430
rients-0.418
Tokenis
Token ID271
Feature activation-1.43e+00

Pos logit contributions
jud+3.448
andre+3.234
ca+3.201
imposed+2.970
qs+2.932
Neg logit contributions
eering-6.014
idge-5.828
IBLE-5.241
 nodd-5.230
 rall-5.088
Tokenq
Token ID80
Feature activation+6.68e-02

Pos logit contributions
jud+1.864
andre+1.648
ca+1.585
qs+1.388
imposed+1.344
Neg logit contributions
eering-7.965
idge-7.862
IBLE-7.099
 nodd-7.018
rients-6.905
Tokenars
Token ID945
Feature activation-4.36e-01

Pos logit contributions
eering+0.266
idge+0.261
IBLE+0.223
yssey+0.218
rients+0.217
Neg logit contributions
jud-0.197
andre-0.187
ca-0.181
imposed-0.174
traumatic-0.170
Token/
Token ID14
Feature activation-9.11e-01

Pos logit contributions
jud+1.047
andre+0.985
ca+0.955
imposed+0.898
traumatic+0.875
Neg logit contributions
eering-1.964
idge-1.942
IBLE-1.699
 nodd-1.674
yssey-1.660
Tokenf
Token ID69
Feature activation-5.29e-01

Pos logit contributions
jud+2.385
andre+2.211
ca+2.199
imposed+2.049
traumatic+2.026
Neg logit contributions
eering-3.905
idge-3.859
IBLE-3.378
 nodd-3.326
yssey-3.285
Tokenatal
Token ID10254
Feature activation-2.55e+00

Pos logit contributions
jud+1.496
andre+1.406
ca+1.395
imposed+1.305
traumatic+1.270
Neg logit contributions
eering-2.176
idge-2.134
IBLE-1.851
 nodd-1.834
yssey-1.791
Token.
Token ID13
Feature activation+4.92e-01

Pos logit contributions
jud+5.533
andre+5.179
ca+5.144
imposed+4.593
cas+4.552
Neg logit contributions
eering-11.522
idge-11.367
IBLE-10.266
 nodd-10.241
 rall-9.969
Tokenhtml
Token ID6494
Feature activation-2.71e-01

Pos logit contributions
eering+1.937
idge+1.905
IBLE+1.624
 nodd+1.583
yssey+1.580
Neg logit contributions
jud-1.471
andre-1.399
ca-1.355
imposed-1.303
traumatic-1.270
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
jud+0.742
andre+0.700
ca+0.683
imposed+0.645
traumatic+0.634
Neg logit contributions
eering-1.135
idge-1.118
IBLE-0.965
 nodd-0.947
yssey-0.944
Token CDC
Token ID20434
Feature activation+1.05e+00

Pos logit contributions
eering+0.428
idge+0.421
IBLE+0.355
 nodd+0.347
yssey+0.345
Neg logit contributions
jud-0.376
andre-0.358
ca-0.351
imposed-0.335
traumatic-0.329
Token.
Token ID13
Feature activation+1.73e-01

Pos logit contributions
eering+4.398
idge+4.310
IBLE+3.737
rients+3.630
 nodd+3.621
Neg logit contributions
jud-2.824
andre-2.673
ca-2.580
imposed-2.503
traumatic-2.387
Token no
Token ID645
Feature activation-5.29e-01

Pos logit contributions
jud+1.419
andre+1.331
ca+1.319
imposed+1.252
traumatic+1.211
Neg logit contributions
eering-1.738
idge-1.722
IBLE-1.461
rients-1.424
yssey-1.416
Token longer
Token ID2392
Feature activation-1.30e-01

Pos logit contributions
jud+1.659
andre+1.561
ca+1.529
imposed+1.448
traumatic+1.402
Neg logit contributions
eering-2.027
idge-1.979
IBLE-1.689
 nodd-1.657
rients-1.633
Token professional
Token ID4708
Feature activation-1.24e+00

Pos logit contributions
jud+0.426
andre+0.404
ca+0.398
imposed+0.380
traumatic+0.370
Neg logit contributions
eering-0.476
idge-0.467
IBLE-0.391
yssey-0.383
 nodd-0.382
Token enough
Token ID1576
Feature activation-9.45e-01

Pos logit contributions
jud+3.635
andre+3.333
ca+3.329
imposed+3.129
cas+3.020
Neg logit contributions
eering-4.936
idge-4.813
IBLE-4.093
yssey-4.089
yip-4.077
Token or
Token ID393
Feature activation-1.01e+00

Pos logit contributions
jud+2.451
andre+2.272
ca+2.218
imposed+2.084
traumatic+2.020
Neg logit contributions
eering-4.076
idge-4.016
IBLE-3.481
 nodd-3.419
yssey-3.416
Token unpre
Token ID11771
Feature activation-2.54e+00

Pos logit contributions
jud+3.267
andre+3.079
ca+3.069
imposed+2.890
cas+2.837
Neg logit contributions
eering-3.708
idge-3.657
IBLE-3.065
rients-3.020
 nodd-3.002
Tokenjud
Token ID10456
Feature activation-2.60e-01

Pos logit contributions
jud+4.829
ca+3.849
andre+3.535
imposed+3.074
condition+3.011
Neg logit contributions
eering-13.383
idge-12.898
IBLE-11.800
 nodd-11.590
ADRA-11.488
Tokeniced
Token ID3711
Feature activation-1.10e+00

Pos logit contributions
jud+0.667
andre+0.626
ca+0.607
imposed+0.576
traumatic+0.560
Neg logit contributions
eering-1.136
idge-1.120
IBLE-0.973
 nodd-0.954
yssey-0.949
Token enough
Token ID1576
Feature activation-4.26e-01

Pos logit contributions
jud+2.558
andre+2.338
ca+2.300
imposed+2.112
traumatic+2.053
Neg logit contributions
eering-5.033
idge-4.972
IBLE-4.342
 nodd-4.301
yssey-4.285
Token to
Token ID284
Feature activation+7.15e-02

Pos logit contributions
jud+1.159
andre+1.089
ca+1.061
imposed+1.005
traumatic+0.980
Neg logit contributions
eering-1.796
idge-1.770
IBLE-1.528
 nodd-1.498
yssey-1.493
Token publish
Token ID7715
Feature activation-5.03e-01

Pos logit contributions
eering+0.225
idge+0.221
IBLE+0.180
 nodd+0.175
yssey+0.174
Neg logit contributions
jud-0.271
andre-0.259
ca-0.254
imposed-0.245
traumatic-0.241
TokenPhoto
Token ID6191
Feature activation-1.21e+00

Pos logit contributions
eering+5.121
idge+5.065
IBLE+4.441
yssey+4.230
 nodd+4.215
Neg logit contributions
jud-3.411
andre-3.209
ca-3.068
imposed-3.018
traumatic-2.938
Token by
Token ID416
Feature activation+3.85e-01

Pos logit contributions
jud+2.471
andre+2.297
ca+2.227
imposed+2.064
traumatic+1.967
Neg logit contributions
eering-5.840
idge-5.783
IBLE-5.097
rients-5.015
yssey-5.000
Token John
Token ID1757
Feature activation-4.08e-01

Pos logit contributions
eering+1.475
idge+1.452
IBLE+1.232
 nodd+1.202
yssey+1.196
Neg logit contributions
jud-1.196
andre-1.134
ca-1.105
imposed-1.062
traumatic-1.040
Token Moore
Token ID8877
Feature activation+6.51e-01

Pos logit contributions
jud+1.361
andre+1.305
ca+1.280
imposed+1.216
traumatic+1.193
Neg logit contributions
eering-1.458
idge-1.442
IBLE-1.229
 nodd-1.188
yssey-1.177
Token/
Token ID14
Feature activation-1.18e+00

Pos logit contributions
eering+2.995
idge+2.954
IBLE+2.567
yssey+2.522
rients+2.511
Neg logit contributions
jud-1.525
andre-1.414
ca-1.357
imposed-1.296
traumatic-1.255
TokenGetty
Token ID6633
Feature activation-2.53e+00

Pos logit contributions
jud+2.959
andre+2.779
ca+2.693
imposed+2.526
neg+2.363
Neg logit contributions
eering-5.180
idge-5.045
IBLE-4.410
rients-4.376
 nodd-4.362
Token Images
Token ID5382
Feature activation-9.66e-01

Pos logit contributions
jud+4.871
andre+4.548
ca+4.517
imposed+4.082
traumatic+3.880
Neg logit contributions
eering-11.672
idge-11.540
 nodd-10.183
yssey-10.173
yip-10.145
Token)
Token ID8
Feature activation+5.28e-01

Pos logit contributions
jud+3.081
andre+2.949
ca+2.890
imposed+2.741
traumatic+2.676
Neg logit contributions
eering-3.566
idge-3.514
IBLE-2.946
rients-2.895
yssey-2.888
Token Jeff
Token ID5502
Feature activation+7.41e-01

Pos logit contributions
eering+1.880
idge+1.847
IBLE+1.549
 nodd+1.501
yssey+1.496
Neg logit contributions
jud-1.780
andre-1.700
ca-1.658
imposed-1.599
traumatic-1.566
Token Sessions
Token ID15184
Feature activation+7.24e-01

Pos logit contributions
eering+2.099
idge+2.047
IBLE+1.639
 nodd+1.566
rients+1.556
Neg logit contributions
jud-3.033
andre-2.916
ca-2.851
imposed-2.776
traumatic-2.728
Token Sen
Token ID2311
Feature activation-5.38e-01

Pos logit contributions
eering+2.750
idge+2.702
IBLE+2.280
 nodd+2.221
yssey+2.207
Neg logit contributions
jud-2.250
andre-2.147
ca-2.094
imposed-2.016
traumatic-1.973
Token down
Token ID866
Feature activation+9.90e-01

Pos logit contributions
eering+3.645
idge+3.584
IBLE+3.036
rients+2.951
 nodd+2.935
Neg logit contributions
jud-3.112
andre-2.982
ca-2.883
imposed-2.795
traumatic-2.741
Token every
Token ID790
Feature activation+9.51e-01

Pos logit contributions
eering+3.877
idge+3.826
IBLE+3.269
rients+3.191
 nodd+3.163
Neg logit contributions
jud-2.905
andre-2.789
ca-2.674
imposed-2.577
traumatic-2.531
Token day
Token ID1110
Feature activation+9.81e-01

Pos logit contributions
eering+3.510
idge+3.444
IBLE+2.894
 nodd+2.831
rients+2.813
Neg logit contributions
jud-3.042
andre-2.917
ca-2.811
imposed-2.717
traumatic-2.664
Token and
Token ID290
Feature activation+2.99e-01

Pos logit contributions
eering+4.081
idge+4.024
IBLE+3.475
rients+3.391
 nodd+3.372
Neg logit contributions
jud-2.661
andre-2.543
ca-2.420
imposed-2.334
traumatic-2.281
Token then
Token ID788
Feature activation+6.92e-01

Pos logit contributions
eering+1.066
idge+1.049
IBLE+0.880
 nodd+0.857
yssey+0.851
Neg logit contributions
jud-1.007
andre-0.961
ca-0.943
imposed-0.904
traumatic-0.883
Token mineral
Token ID18352
Feature activation+2.16e+00

Pos logit contributions
eering+2.484
idge+2.442
IBLE+2.047
 nodd+1.992
rients+1.985
Neg logit contributions
jud-2.312
andre-2.204
ca-2.150
imposed-2.070
traumatic-2.031
Tokenizes
Token ID4340
Feature activation+5.36e-01

Pos logit contributions
idge+7.367
eering+7.324
rients+6.194
IBLE+6.061
rient+5.822
Neg logit contributions
jud-7.096
andre-6.848
ca-6.499
traumatic-6.392
imposed-6.367
Token,
Token ID11
Feature activation-1.75e-03

Pos logit contributions
eering+2.185
idge+2.151
IBLE+1.848
 nodd+1.802
rients+1.797
Neg logit contributions
jud-1.531
andre-1.450
ca-1.408
imposed-1.346
traumatic-1.313
Token forming
Token ID14583
Feature activation+4.58e-01

Pos logit contributions
jud+0.005
andre+0.005
ca+0.005
imposed+0.005
traumatic+0.005
Neg logit contributions
eering-0.007
idge-0.007
IBLE-0.006
 nodd-0.006
yssey-0.006
Token a
Token ID257
Feature activation+3.10e-01

Pos logit contributions
eering+1.660
idge+1.630
IBLE+1.373
 nodd+1.336
rients+1.329
Neg logit contributions
jud-1.514
andre-1.443
ca-1.409
imposed-1.355
traumatic-1.327
Token hard
Token ID1327
Feature activation-2.52e-01

Pos logit contributions
eering+1.085
idge+1.070
IBLE+0.892
yssey+0.867
 nodd+0.866
Neg logit contributions
jud-1.063
andre-1.013
ca-0.994
imposed-0.953
traumatic-0.938
Token National
Token ID2351
Feature activation-7.90e-01

Pos logit contributions
eering+0.666
idge+0.654
IBLE+0.549
 nodd+0.532
rients+0.530
Neg logit contributions
jud-0.661
andre-0.631
ca-0.620
imposed-0.592
traumatic-0.581
Token Post
Token ID2947
Feature activation+8.79e-02

Pos logit contributions
jud+2.495
ca+2.393
andre+2.381
imposed+2.204
traumatic+2.165
Neg logit contributions
eering-2.947
idge-2.870
IBLE-2.488
 nodd-2.438
yssey-2.391
Token reading
Token ID3555
Feature activation+8.22e-01

Pos logit contributions
eering+0.355
idge+0.349
IBLE+0.300
 nodd+0.293
yssey+0.292
Neg logit contributions
jud-0.255
andre-0.241
ca-0.236
imposed-0.224
traumatic-0.219
Token "
Token ID366
Feature activation-1.06e+00

Pos logit contributions
eering+3.158
idge+3.106
IBLE+2.640
 nodd+2.575
rients+2.551
Neg logit contributions
jud-2.515
andre-2.398
ca-2.314
imposed-2.237
traumatic-2.192
TokenISIS
Token ID29322
Feature activation-6.77e-01

Pos logit contributions
jud+2.811
ca+2.606
andre+2.594
imposed+2.402
traumatic+2.343
Neg logit contributions
eering-4.550
idge-4.431
 nodd-3.882
IBLE-3.831
rients-3.811
Token Ur
Token ID8799
Feature activation+2.37e+00

Pos logit contributions
jud+2.267
andre+2.155
ca+2.142
imposed+2.032
traumatic+1.991
Neg logit contributions
eering-2.375
idge-2.355
IBLE-1.978
yssey-1.933
 nodd-1.930
Tokenges
Token ID3212
Feature activation+5.71e-01

Pos logit contributions
idge+8.609
eering+8.470
rients+7.216
IBLE+7.189
yssey+7.098
Neg logit contributions
jud-7.043
andre-6.732
ca-6.299
imposed-6.293
traumatic-6.204
Token Jihad
Token ID35243
Feature activation-4.34e-01

Pos logit contributions
eering+2.085
idge+2.043
IBLE+1.717
 nodd+1.664
yssey+1.660
Neg logit contributions
jud-1.872
andre-1.789
ca-1.735
imposed-1.684
traumatic-1.647
Tokenists
Token ID1023
Feature activation-5.08e-01

Pos logit contributions
jud+1.102
andre+1.032
ca+1.016
imposed+0.942
traumatic+0.918
Neg logit contributions
eering-1.869
idge-1.864
IBLE-1.633
 nodd-1.615
yssey-1.596
Token To
Token ID1675
Feature activation+2.03e-01

Pos logit contributions
jud+1.441
andre+1.365
ca+1.343
imposed+1.261
traumatic+1.234
Neg logit contributions
eering-2.049
idge-2.029
IBLE-1.753
 nodd-1.729
yssey-1.713
Token Attack
Token ID8307
Feature activation+1.91e-01

Pos logit contributions
eering+0.689
idge+0.679
IBLE+0.567
 nodd+0.553
yssey+0.550
Neg logit contributions
jud-0.717
andre-0.685
ca-0.673
imposed-0.643
traumatic-0.631
Token expand
Token ID4292
Feature activation+9.90e-01

Pos logit contributions
eering+1.031
idge+1.011
IBLE+0.830
 nodd+0.804
rients+0.800
Neg logit contributions
jud-1.213
andre-1.156
ca-1.138
imposed-1.101
traumatic-1.073
Token the
Token ID262
Feature activation+1.59e-01

Pos logit contributions
eering+3.378
idge+3.315
IBLE+2.758
 nodd+2.673
rients+2.658
Neg logit contributions
jud-3.433
andre-3.308
ca-3.220
imposed-3.105
traumatic-3.063
Token program
Token ID1430
Feature activation+3.75e-01

Pos logit contributions
eering+0.548
idge+0.538
IBLE+0.448
 nodd+0.436
yssey+0.436
Neg logit contributions
jud-0.558
andre-0.529
ca-0.520
imposed-0.500
traumatic-0.488
Token to
Token ID284
Feature activation+4.52e-01

Pos logit contributions
eering+1.543
idge+1.518
IBLE+1.304
 nodd+1.270
yssey+1.266
Neg logit contributions
jud-1.053
andre-1.002
ca-0.969
imposed-0.923
traumatic-0.905
Token include
Token ID2291
Feature activation+1.93e-01

Pos logit contributions
eering+1.485
idge+1.456
IBLE+1.201
 nodd+1.165
yssey+1.159
Neg logit contributions
jud-1.650
andre-1.581
ca-1.545
imposed-1.494
traumatic-1.467
Token broadband
Token ID18729
Feature activation+2.13e+00

Pos logit contributions
eering+0.694
idge+0.683
IBLE+0.573
 nodd+0.557
yssey+0.557
Neg logit contributions
jud-0.645
andre-0.615
ca-0.602
imposed-0.577
traumatic-0.565
Token internet
Token ID5230
Feature activation+2.05e+00

Pos logit contributions
eering+7.670
idge+7.518
IBLE+6.428
 nodd+6.187
rients+6.079
Neg logit contributions
jud-6.705
andre-6.413
ca-6.115
traumatic-5.915
imposed-5.915
Token access
Token ID1895
Feature activation+6.77e-01

Pos logit contributions
eering+6.851
idge+6.600
IBLE+5.567
 nodd+5.326
rients+5.272
Neg logit contributions
jud-7.058
andre-6.697
ca-6.454
traumatic-6.298
imposed-6.263
Token.
Token ID13
Feature activation+6.15e-02

Pos logit contributions
eering+2.751
idge+2.707
IBLE+2.333
 nodd+2.265
rients+2.258
Neg logit contributions
jud-1.933
andre-1.832
ca-1.777
imposed-1.699
traumatic-1.663
Token
Token ID198
Feature activation+4.05e-01

Pos logit contributions
eering+0.226
idge+0.223
IBLE+0.188
 nodd+0.187
yssey+0.184
Neg logit contributions
jud-0.200
andre-0.190
ca-0.187
imposed-0.178
traumatic-0.174
Token
Token ID198
Feature activation+2.19e-01

Pos logit contributions
eering+1.496
idge+1.475
IBLE+1.226
rients+1.181
yssey+1.174
Neg logit contributions
jud-1.291
andre-1.247
imposed-1.183
ca-1.168
traumatic-1.151
Token for
Token ID329
Feature activation+1.75e-01

Pos logit contributions
eering+5.542
idge+5.520
IBLE+4.703
 nodd+4.522
rients+4.508
Neg logit contributions
jud-4.463
andre-4.241
ca-4.111
imposed-4.020
traumatic-3.865
Token comments
Token ID3651
Feature activation+2.43e-01

Pos logit contributions
eering+0.644
idge+0.630
IBLE+0.532
yssey+0.521
 nodd+0.521
Neg logit contributions
jud-0.572
andre-0.547
ca-0.536
imposed-0.510
traumatic-0.499
Token about
Token ID546
Feature activation+7.76e-01

Pos logit contributions
eering+0.985
idge+0.970
IBLE+0.832
 nodd+0.813
yssey+0.810
Neg logit contributions
jud-0.699
andre-0.662
ca-0.644
imposed-0.615
traumatic-0.600
Token the
Token ID262
Feature activation+3.97e-01

Pos logit contributions
eering+2.861
idge+2.818
IBLE+2.372
 nodd+2.312
rients+2.296
Neg logit contributions
jud-2.509
andre-2.388
ca-2.326
imposed-2.246
traumatic-2.195
Token mother
Token ID2802
Feature activation-4.23e-01

Pos logit contributions
eering+1.399
idge+1.375
IBLE+1.150
 nodd+1.119
yssey+1.115
Neg logit contributions
jud-1.357
andre-1.295
ca-1.270
imposed-1.219
traumatic-1.195
Token grizz
Token ID46307
Feature activation+2.21e+00

Pos logit contributions
jud+1.235
andre+1.162
ca+1.150
imposed+1.099
traumatic+1.059
Neg logit contributions
eering-1.684
idge-1.653
IBLE-1.434
rients-1.401
yssey-1.399
Tokenly
Token ID306
Feature activation-3.80e-01

Pos logit contributions
eering+4.857
idge+4.776
IBLE+3.494
yssey+3.263
 nodd+3.242
Neg logit contributions
jud-10.194
andre-9.850
ca-9.474
imposed-9.411
traumatic-9.306
Token killing
Token ID5170
Feature activation-3.26e-01

Pos logit contributions
jud+1.330
ca+1.284
andre+1.274
imposed+1.214
traumatic+1.187
Neg logit contributions
eering-1.287
idge-1.250
IBLE-1.050
rients-1.028
 nodd-1.019
Token a
Token ID257
Feature activation-1.75e-02

Pos logit contributions
jud+0.941
andre+0.888
ca+0.887
imposed+0.835
traumatic+0.813
Neg logit contributions
eering-1.303
idge-1.286
IBLE-1.105
rients-1.087
yssey-1.082
Token person
Token ID1048
Feature activation-1.57e-01

Pos logit contributions
jud+0.063
andre+0.060
ca+0.059
imposed+0.056
traumatic+0.055
Neg logit contributions
eering-0.059
idge-0.057
IBLE-0.048
yssey-0.047
rients-0.047
Token in
Token ID287
Feature activation+1.06e-01

Pos logit contributions
jud+0.443
andre+0.417
ca+0.410
imposed+0.389
traumatic+0.379
Neg logit contributions
eering-0.646
idge-0.636
IBLE-0.550
yssey-0.538
rients-0.537
Token.
Token ID13
Feature activation+6.79e-01

Pos logit contributions
jud+0.778
andre+0.732
ca+0.706
imposed+0.676
traumatic+0.656
Neg logit contributions
eering-1.322
idge-1.307
IBLE-1.132
yssey-1.103
rients-1.100
Token 3
Token ID513
Feature activation+1.60e+00

Pos logit contributions
eering+2.557
idge+2.514
IBLE+2.130
yssey+2.046
rients+2.043
Neg logit contributions
jud-2.141
andre-2.047
ca-1.939
imposed-1.928
traumatic-1.864
Token Texas
Token ID3936
Feature activation+2.95e-01

Pos logit contributions
eering+5.856
idge+5.781
IBLE+4.874
 nodd+4.722
yssey+4.653
Neg logit contributions
jud-5.051
andre-4.878
ca-4.673
imposed-4.472
traumatic-4.456
Token at
Token ID379
Feature activation+7.69e-01

Pos logit contributions
eering+1.158
idge+1.138
IBLE+0.970
 nodd+0.946
yssey+0.944
Neg logit contributions
jud-0.892
andre-0.846
ca-0.823
imposed-0.788
traumatic-0.770
Token this
Token ID428
Feature activation+2.97e-01

Pos logit contributions
eering+2.676
idge+2.633
IBLE+2.196
 nodd+2.129
yssey+2.119
Neg logit contributions
jud-2.649
andre-2.533
ca-2.470
imposed-2.383
traumatic-2.334
Token point
Token ID966
Feature activation+2.04e+00

Pos logit contributions
eering+0.966
idge+0.946
IBLE+0.778
 nodd+0.754
yssey+0.753
Neg logit contributions
jud-1.096
andre-1.049
ca-1.026
imposed-0.992
traumatic-0.971
Token.
Token ID13
Feature activation-1.28e-01

Pos logit contributions
eering+7.543
idge+7.451
IBLE+6.273
rients+6.204
 nodd+6.075
Neg logit contributions
jud-5.984
andre-5.805
ca-5.455
imposed-5.392
traumatic-5.215
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
jud+0.388
andre+0.368
ca+0.364
imposed+0.342
traumatic+0.334
Neg logit contributions
eering-0.501
idge-0.494
 nodd-0.421
IBLE-0.420
yssey-0.413
Token
Token ID198
Feature activation+8.93e-02

Pos logit contributions
eering+1.190
idge+1.173
IBLE+0.984
rients+0.938
yssey+0.932
Neg logit contributions
jud-1.028
andre-0.990
imposed-0.942
ca-0.931
traumatic-0.915
TokenEspecially
Token ID48464
Feature activation-5.37e-04

Pos logit contributions
eering+0.379
idge+0.373
IBLE+0.323
 nodd+0.315
yssey+0.314
Neg logit contributions
jud-0.240
andre-0.226
ca-0.220
imposed-0.209
traumatic-0.204
Token if
Token ID611
Feature activation-8.51e-01

Pos logit contributions
jud+0.002
andre+0.001
ca+0.001
imposed+0.001
traumatic+0.001
Neg logit contributions
eering-0.002
idge-0.002
IBLE-0.002
 nodd-0.002
yssey-0.002
Token the
Token ID262
Feature activation+4.37e-01

Pos logit contributions
eering+1.954
idge+1.920
IBLE+1.620
 nodd+1.577
yssey+1.572
Neg logit contributions
jud-1.721
andre-1.641
ca-1.598
imposed-1.540
traumatic-1.510
Token Singapore
Token ID12551
Feature activation+1.82e+00

Pos logit contributions
eering+1.495
idge+1.468
IBLE+1.221
 nodd+1.186
yssey+1.182
Neg logit contributions
jud-1.537
andre-1.468
ca-1.440
imposed-1.384
traumatic-1.355
Tokenan
Token ID272
Feature activation-9.37e-01

Pos logit contributions
eering+7.033
idge+6.914
IBLE+5.874
 nodd+5.709
yssey+5.709
Neg logit contributions
jud-5.197
andre-4.981
traumatic-4.636
ca-4.629
imposed-4.587
Token version
Token ID2196
Feature activation-3.96e-01

Pos logit contributions
jud+3.071
ca+2.903
andre+2.886
imposed+2.743
cas+2.639
Neg logit contributions
eering-3.365
idge-3.306
IBLE-2.804
rients-2.749
yssey-2.728
Token is
Token ID318
Feature activation+4.22e-01

Pos logit contributions
jud+1.301
andre+1.235
ca+1.225
imposed+1.162
traumatic+1.132
Neg logit contributions
eering-1.427
idge-1.407
IBLE-1.195
 nodd-1.162
yssey-1.158
Token similar
Token ID2092
Feature activation+3.54e-01

Pos logit contributions
eering+1.595
idge+1.568
IBLE+1.329
 nodd+1.296
yssey+1.291
Neg logit contributions
jud-1.334
andre-1.269
ca-1.238
imposed-1.187
traumatic-1.162
Token to
Token ID284
Feature activation+4.74e-01

Pos logit contributions
eering+1.454
idge+1.432
IBLE+1.233
 nodd+1.203
yssey+1.198
Neg logit contributions
jud-1.002
andre-0.948
ca-0.920
imposed-0.879
traumatic-0.858
Token the
Token ID262
Feature activation+4.50e-01

Pos logit contributions
eering+1.758
idge+1.728
IBLE+1.460
 nodd+1.422
yssey+1.416
Neg logit contributions
jud-1.527
andre-1.454
ca-1.419
imposed-1.363
traumatic-1.335
Token current
Token ID1459
Feature activation+1.00e+00

Pos logit contributions
eering+1.587
idge+1.560
IBLE+1.304
 nodd+1.269
yssey+1.264
Neg logit contributions
jud-1.535
andre-1.464
ca-1.433
imposed-1.377
traumatic-1.347
Token Port
Token ID4347
Feature activation+2.10e-01

Pos logit contributions
eering+3.550
idge+3.475
IBLE+2.910
 nodd+2.840
yssey+2.801
Neg logit contributions
jud-3.339
andre-3.213
ca-3.101
imposed-3.007
traumatic-2.967
Tokenrait
Token ID12907
Feature activation-1.53e+00

Pos logit contributions
eering+0.756
idge+0.742
IBLE+0.631
 nodd+0.614
yssey+0.608
Neg logit contributions
jud-0.697
andre-0.663
ca-0.654
imposed-0.624
traumatic-0.610
Token!
Token ID0
Feature activation+2.42e-01

Pos logit contributions
jud+0.051
andre+0.048
ca+0.048
imposed+0.045
traumatic+0.044
Neg logit contributions
eering-0.071
idge-0.069
IBLE-0.059
 nodd-0.058
rients-0.058
Token (
Token ID357
Feature activation+1.01e+00

Pos logit contributions
eering+0.987
idge+0.970
IBLE+0.832
 nodd+0.816
yssey+0.813
Neg logit contributions
jud-0.690
andre-0.651
ca-0.637
imposed-0.605
traumatic-0.589
Token9
Token ID24
Feature activation+7.02e-01

Pos logit contributions
eering+4.514
idge+4.493
IBLE+3.923
rients+3.788
yssey+3.779
Neg logit contributions
jud-2.287
andre-2.168
imposed-2.012
ca-1.987
traumatic-1.921
Token)
Token ID8
Feature activation+1.07e+00

Pos logit contributions
eering+2.350
idge+2.310
IBLE+1.916
yssey+1.829
rients+1.829
Neg logit contributions
jud-2.478
andre-2.382
ca-2.280
imposed-2.268
traumatic-2.220
Token
Token ID198
Feature activation+7.45e-01

Pos logit contributions
eering+4.079
idge+4.033
IBLE+3.446
 nodd+3.348
rients+3.316
Neg logit contributions
jud-3.263
andre-3.106
ca-2.979
imposed-2.893
traumatic-2.831
Token
Token ID198
Feature activation+3.18e-01

Pos logit contributions
eering+2.659
idge+2.607
IBLE+2.171
rients+2.059
yssey+2.044
Neg logit contributions
jud-2.430
andre-2.358
imposed-2.268
ca-2.190
traumatic-2.189
TokenSurv
Token ID34652
Feature activation-3.01e-01

Pos logit contributions
eering+1.314
idge+1.295
IBLE+1.115
yssey+1.084
rients+1.082
Neg logit contributions
jud-0.887
andre-0.841
ca-0.814
imposed-0.780
traumatic-0.761
Tokenive
Token ID425
Feature activation-4.51e-01

Pos logit contributions
jud+0.867
andre+0.811
ca+0.794
imposed+0.750
traumatic+0.735
Neg logit contributions
eering-1.224
idge-1.192
 nodd-1.029
IBLE-1.028
rients-1.012
Token the
Token ID262
Feature activation-6.40e-01

Pos logit contributions
jud+1.384
andre+1.301
ca+1.287
imposed+1.222
traumatic+1.208
Neg logit contributions
eering-1.726
idge-1.701
IBLE-1.453
yssey-1.421
 nodd-1.416
Token manual
Token ID10107
Feature activation-5.81e-01

Pos logit contributions
jud+2.170
andre+2.062
ca+2.034
imposed+1.928
traumatic+1.913
Neg logit contributions
eering-2.220
idge-2.193
IBLE-1.837
rients-1.797
 nodd-1.787
Token override
Token ID20957
Feature activation-3.39e-01

Pos logit contributions
jud+1.957
andre+1.842
ca+1.834
imposed+1.745
traumatic+1.716
Neg logit contributions
eering-2.032
idge-2.010
IBLE-1.695
 nodd-1.653
rients-1.650
Tokenits
Token ID896
Feature activation+1.17e-02

Pos logit contributions
eering+0.286
idge+0.286
IBLE+0.243
yssey+0.238
rients+0.238
Neg logit contributions
jud-0.189
andre-0.180
ca-0.173
imposed-0.166
traumatic-0.163
Token.
Token ID13
Feature activation+3.04e-01

Pos logit contributions
eering+0.050
idge+0.049
IBLE+0.042
 nodd+0.041
rients+0.041
Neg logit contributions
jud-0.032
andre-0.030
ca-0.029
imposed-0.028
traumatic-0.027
Token He
Token ID679
Feature activation+3.49e-01

Pos logit contributions
eering+1.138
idge+1.120
IBLE+0.947
 nodd+0.926
yssey+0.921
Neg logit contributions
jud-0.970
andre-0.922
ca-0.902
imposed-0.863
traumatic-0.844
Token has
Token ID468
Feature activation+1.84e-01

Pos logit contributions
eering+1.254
idge+1.228
IBLE+1.037
 nodd+1.008
yssey+1.000
Neg logit contributions
jud-1.170
andre-1.121
ca-1.095
imposed-1.051
traumatic-1.024
Token expressed
Token ID6241
Feature activation+1.09e+00

Pos logit contributions
eering+0.703
idge+0.692
IBLE+0.588
 nodd+0.573
yssey+0.571
Neg logit contributions
jud-0.575
andre-0.546
ca-0.533
imposed-0.511
traumatic-0.499
Token his
Token ID465
Feature activation+6.83e-01

Pos logit contributions
eering+3.970
idge+3.877
IBLE+3.270
 nodd+3.214
yssey+3.160
Neg logit contributions
jud-3.512
andre-3.357
ca-3.264
imposed-3.135
traumatic-3.080
Token sadness
Token ID25303
Feature activation+9.35e-02

Pos logit contributions
eering+2.422
idge+2.374
IBLE+1.991
 nodd+1.938
yssey+1.927
Neg logit contributions
jud-2.303
andre-2.200
ca-2.148
imposed-2.064
traumatic-2.028
Token and
Token ID290
Feature activation-3.20e-01

Pos logit contributions
eering+0.392
idge+0.386
IBLE+0.333
 nodd+0.326
rients+0.325
Neg logit contributions
jud-0.256
andre-0.242
ca-0.235
imposed-0.224
traumatic-0.218
Token displeasure
Token ID47313
Feature activation+8.34e-02

Pos logit contributions
jud+0.994
andre+0.942
ca+0.924
imposed+0.888
traumatic+0.861
Neg logit contributions
eering-1.220
idge-1.206
IBLE-1.026
rients-1.003
 nodd-0.995
Token at
Token ID379
Feature activation+3.37e-01

Pos logit contributions
eering+0.346
idge+0.341
IBLE+0.294
 nodd+0.287
yssey+0.286
Neg logit contributions
jud-0.233
andre-0.220
ca-0.214
imposed-0.204
traumatic-0.198
Token how
Token ID703
Feature activation-4.32e-01

Pos logit contributions
eering+1.253
idge+1.231
IBLE+1.041
 nodd+1.014
yssey+1.009
Neg logit contributions
jud-1.079
andre-1.029
ca-1.003
imposed-0.963
traumatic-0.944
Token you
Token ID345
Feature activation-1.33e-01

Pos logit contributions
eering+0.373
idge+0.367
IBLE+0.315
 nodd+0.308
yssey+0.307
Neg logit contributions
jud-0.265
andre-0.251
ca-0.244
imposed-0.233
traumatic-0.228
Token
Token ID447
Feature activation-2.40e-01

Pos logit contributions
jud+0.400
andre+0.378
ca+0.372
imposed+0.352
traumatic+0.344
Neg logit contributions
eering-0.523
idge-0.516
IBLE-0.443
rients-0.430
 nodd-0.430
Token
Token ID247
Feature activation-4.01e-02

Pos logit contributions
jud+0.761
andre+0.724
ca+0.709
imposed+0.674
traumatic+0.664
Neg logit contributions
eering-0.907
idge-0.886
IBLE-0.753
 nodd-0.747
yssey-0.733
Tokenre
Token ID260
Feature activation-1.04e+00

Pos logit contributions
jud+0.104
andre+0.098
ca+0.096
imposed+0.090
traumatic+0.087
Neg logit contributions
eering-0.175
idge-0.171
IBLE-0.150
 nodd-0.147
rients-0.146
Token craft
Token ID5977
Feature activation-3.71e-02

Pos logit contributions
jud+3.133
ca+2.952
andre+2.941
imposed+2.748
cas+2.721
Neg logit contributions
eering-3.939
idge-3.879
IBLE-3.346
rients-3.272
yssey-3.255
Token beer
Token ID6099
Feature activation+5.22e-01

Pos logit contributions
jud+0.148
andre+0.142
ca+0.140
imposed+0.135
traumatic+0.133
Neg logit contributions
eering-0.109
idge-0.107
IBLE-0.086
 nodd-0.083
rients-0.083
Token new
Token ID649
Feature activation-5.55e-01

Pos logit contributions
eering+2.077
idge+2.045
IBLE+1.744
 nodd+1.708
yssey+1.697
Neg logit contributions
jud-1.537
andre-1.461
ca-1.417
imposed-1.361
traumatic-1.328
Tokenbie
Token ID12590
Feature activation-1.87e-02

Pos logit contributions
jud+2.089
andre+2.006
ca+1.994
imposed+1.891
cas+1.858
Neg logit contributions
eering-1.762
idge-1.696
IBLE-1.426
yssey-1.368
rients-1.356
Token or
Token ID393
Feature activation+5.32e-02

Pos logit contributions
jud+0.048
andre+0.045
ca+0.044
imposed+0.042
traumatic+0.041
Neg logit contributions
eering-0.081
idge-0.080
IBLE-0.069
 nodd-0.068
yssey-0.068
Token a
Token ID257
Feature activation+1.24e-01

Pos logit contributions
eering+0.197
idge+0.194
IBLE+0.164
 nodd+0.159
rients+0.159
Neg logit contributions
jud-0.171
andre-0.163
ca-0.160
imposed-0.153
traumatic-0.149
Token veteran
Token ID9298
Feature activation+1.08e-01

Pos logit contributions
eering+0.429
idge+0.421
IBLE+0.352
 nodd+0.341
rients+0.341
Neg logit contributions
jud-0.433
andre-0.413
ca-0.405
imposed-0.388
traumatic-0.380
Token than
Token ID621
Feature activation+1.00e-01

Pos logit contributions
eering+0.217
idge+0.214
IBLE+0.185
 nodd+0.180
yssey+0.180
Neg logit contributions
jud-0.144
andre-0.136
ca-0.132
imposed-0.126
traumatic-0.123
Token the
Token ID262
Feature activation+2.45e-01

Pos logit contributions
eering+0.384
idge+0.378
IBLE+0.321
 nodd+0.313
yssey+0.312
Neg logit contributions
jud-0.313
andre-0.298
ca-0.290
imposed-0.278
traumatic-0.272
Token Dolphins
Token ID21668
Feature activation-2.43e-01

Pos logit contributions
eering+0.885
idge+0.870
IBLE+0.732
yssey+0.713
 nodd+0.713
Neg logit contributions
jud-0.818
andre-0.781
ca-0.759
imposed-0.731
traumatic-0.714
Token.
Token ID13
Feature activation+1.40e-01

Pos logit contributions
jud+0.632
andre+0.595
ca+0.577
imposed+0.546
traumatic+0.532
Neg logit contributions
eering-1.051
idge-1.035
IBLE-0.901
 nodd-0.882
rients-0.880
Token Only
Token ID5514
Feature activation+9.42e-02

Pos logit contributions
eering+0.495
idge+0.487
IBLE+0.407
 nodd+0.399
yssey+0.396
Neg logit contributions
jud-0.473
andre-0.453
ca-0.443
imposed-0.424
traumatic-0.416
Token three
Token ID1115
Feature activation+1.24e+00

Pos logit contributions
eering+0.361
idge+0.355
IBLE+0.302
 nodd+0.294
rients+0.294
Neg logit contributions
jud-0.292
andre-0.278
ca-0.271
imposed-0.259
traumatic-0.254
Token quarterbacks
Token ID23643
Feature activation+5.90e-01

Pos logit contributions
eering+4.099
idge+4.047
IBLE+3.353
 nodd+3.260
rients+3.187
Neg logit contributions
jud-4.435
andre-4.212
ca-4.125
imposed-3.988
traumatic-3.927
Token have
Token ID423
Feature activation+9.34e-01

Pos logit contributions
eering+2.078
idge+2.047
IBLE+1.701
yssey+1.656
 nodd+1.651
Neg logit contributions
jud-2.004
andre-1.916
ca-1.861
imposed-1.802
traumatic-1.768
Token scored
Token ID7781
Feature activation+1.03e-01

Pos logit contributions
eering+3.532
idge+3.483
IBLE+2.945
 nodd+2.858
yssey+2.846
Neg logit contributions
jud-2.910
andre-2.768
ca-2.625
imposed-2.562
traumatic-2.530
Token at
Token ID379
Feature activation+4.63e-01

Pos logit contributions
eering+0.383
idge+0.377
IBLE+0.319
 nodd+0.311
yssey+0.309
Neg logit contributions
jud-0.329
andre-0.313
ca-0.305
imposed-0.293
traumatic-0.287
Token least
Token ID1551
Feature activation-1.23e+00

Pos logit contributions
eering+1.657
idge+1.630
IBLE+1.367
 nodd+1.329
yssey+1.324
Neg logit contributions
jud-1.554
andre-1.482
ca-1.447
imposed-1.394
traumatic-1.364
Token Desc
Token ID39373
Feature activation+1.91e-01

Pos logit contributions
eering+3.351
idge+3.290
IBLE+2.786
yssey+2.712
 nodd+2.712
Neg logit contributions
jud-2.743
andre-2.620
ca-2.530
imposed-2.450
traumatic-2.389
Tokenends
Token ID2412
Feature activation+1.24e-01

Pos logit contributions
eering+0.584
idge+0.571
IBLE+0.463
 nodd+0.450
yssey+0.445
Neg logit contributions
jud-0.743
andre-0.716
ca-0.702
imposed-0.676
traumatic-0.666
Token on
Token ID319
Feature activation-1.77e-01

Pos logit contributions
eering+0.478
idge+0.470
IBLE+0.400
 nodd+0.391
yssey+0.389
Neg logit contributions
jud-0.386
andre-0.366
ca-0.358
imposed-0.342
traumatic-0.335
Token Large
Token ID13601
Feature activation-4.78e-01

Pos logit contributions
jud+0.573
andre+0.545
ca+0.537
imposed+0.508
traumatic+0.500
Neg logit contributions
eering-0.650
idge-0.639
IBLE-0.541
 nodd-0.531
yssey-0.527
Token,
Token ID11
Feature activation-6.45e-02

Pos logit contributions
jud+1.575
andre+1.494
ca+1.484
imposed+1.399
traumatic+1.379
Neg logit contributions
eering-1.723
idge-1.695
IBLE-1.444
 nodd-1.420
yssey-1.408
Token Conf
Token ID7326
Feature activation-3.29e-01

Pos logit contributions
jud+0.218
andre+0.208
ca+0.205
imposed+0.195
traumatic+0.192
Neg logit contributions
eering-0.228
idge-0.224
IBLE-0.188
 nodd-0.185
yssey-0.184
Tokenused
Token ID1484
Feature activation-9.48e-01

Pos logit contributions
jud+0.957
andre+0.906
ca+0.885
imposed+0.836
traumatic+0.815
Neg logit contributions
eering-1.333
idge-1.304
IBLE-1.127
 nodd-1.113
yssey-1.107
Token,
Token ID11
Feature activation-5.95e-01

Pos logit contributions
jud+3.040
ca+2.870
andre+2.861
imposed+2.643
traumatic+2.636
Neg logit contributions
eering-3.435
idge-3.406
IBLE-2.895
 nodd-2.880
yssey-2.855
Token Nation
Token ID8741
Feature activation+5.75e-01

Pos logit contributions
jud+1.922
andre+1.815
ca+1.809
imposed+1.697
traumatic+1.683
Neg logit contributions
eering-2.180
idge-2.159
IBLE-1.823
 nodd-1.806
yssey-1.791
Token
Token ID198
Feature activation+2.75e-01

Pos logit contributions
eering+2.048
idge+2.012
IBLE+1.687
 nodd+1.641
yssey+1.634
Neg logit contributions
jud-1.939
andre-1.850
ca-1.808
imposed-1.739
traumatic-1.704
Token
Token ID198
Feature activation-9.81e-02

Pos logit contributions
eering+1.074
idge+1.058
IBLE+0.914
rients+0.873
yssey+0.872
Neg logit contributions
jud-0.825
andre-0.787
ca-0.751
imposed-0.735
traumatic-0.713
Token it
Token ID340
Feature activation+1.43e-01

Pos logit contributions
eering+4.045
idge+3.932
IBLE+3.321
 nodd+3.228
rients+3.207
Neg logit contributions
jud-3.477
andre-3.298
ca-3.187
imposed-3.075
traumatic-3.028
Token;
Token ID26
Feature activation-5.25e-01

Pos logit contributions
eering+0.613
idge+0.605
IBLE+0.524
 nodd+0.512
yssey+0.511
Neg logit contributions
jud-0.382
andre-0.360
ca-0.351
imposed-0.332
traumatic-0.323
Token this
Token ID428
Feature activation-1.41e-01

Pos logit contributions
jud+1.532
andre+1.457
ca+1.432
imposed+1.357
traumatic+1.318
Neg logit contributions
eering-2.080
idge-2.056
IBLE-1.776
yssey-1.731
rients-1.727
Token gives
Token ID3607
Feature activation-2.44e-01

Pos logit contributions
jud+0.467
andre+0.445
ca+0.436
imposed+0.419
traumatic+0.408
Neg logit contributions
eering-0.509
idge-0.505
IBLE-0.426
rients-0.413
yssey-0.412
Token the
Token ID262
Feature activation+2.19e-02

Pos logit contributions
jud+0.817
andre+0.783
ca+0.767
imposed+0.731
traumatic+0.716
Neg logit contributions
eering-0.870
idge-0.855
IBLE-0.722
yssey-0.704
rients-0.700
Token Yes
Token ID3363
Feature activation-8.05e-01

Pos logit contributions
eering+0.079
idge+0.078
IBLE+0.066
 nodd+0.064
yssey+0.064
Neg logit contributions
jud-0.073
andre-0.069
ca-0.068
imposed-0.065
traumatic-0.064
Token campaign
Token ID1923
Feature activation-5.27e-01

Pos logit contributions
jud+3.199
andre+3.097
ca+3.089
cas+2.889
imposed+2.886
Neg logit contributions
eering-2.307
idge-2.219
IBLE-1.823
rients-1.758
yssey-1.726
Token ample
Token ID23933
Feature activation-9.00e-02

Pos logit contributions
jud+1.630
andre+1.555
ca+1.526
imposed+1.459
traumatic+1.417
Neg logit contributions
eering-1.990
idge-1.975
IBLE-1.680
yssey-1.642
rients-1.637
Token political
Token ID1964
Feature activation-8.43e-01

Pos logit contributions
jud+0.267
andre+0.252
ca+0.247
imposed+0.235
traumatic+0.229
Neg logit contributions
eering-0.355
idge-0.351
IBLE-0.300
yssey-0.294
 nodd-0.293
Token space
Token ID2272
Feature activation+2.59e-01

Pos logit contributions
jud+2.706
ca+2.585
andre+2.583
imposed+2.393
traumatic+2.377
Neg logit contributions
idge-3.025
eering-2.990
IBLE-2.543
yssey-2.499
 nodd-2.487
Token to
Token ID284
Feature activation+2.57e-02

Pos logit contributions
eering+1.076
idge+1.059
IBLE+0.912
 nodd+0.891
yssey+0.888
Neg logit contributions
jud-0.718
andre-0.678
ca-0.659
imposed-0.628
traumatic-0.613
Tokenicester
Token ID26382
Feature activation-6.38e-01

Pos logit contributions
jud+1.291
andre+1.231
ca+1.194
imposed+1.088
cas+1.053
Neg logit contributions
eering-2.299
idge-2.271
IBLE-1.990
 nodd-1.969
rients-1.944
Token have
Token ID423
Feature activation-2.06e-01

Pos logit contributions
jud+1.953
andre+1.844
ca+1.821
imposed+1.717
traumatic+1.688
Neg logit contributions
eering-2.459
idge-2.417
IBLE-2.068
 nodd-2.017
rients-2.008
Token an
Token ID281
Feature activation+2.39e-02

Pos logit contributions
jud+0.615
andre+0.580
ca+0.569
imposed+0.543
traumatic+0.529
Neg logit contributions
eering-0.817
idge-0.804
IBLE-0.687
 nodd-0.671
yssey-0.670
Token excellent
Token ID6275
Feature activation+6.60e-01

Pos logit contributions
eering+0.087
idge+0.085
IBLE+0.072
 nodd+0.070
rients+0.070
Neg logit contributions
jud-0.078
andre-0.075
ca-0.073
imposed-0.070
traumatic-0.069
Token home
Token ID1363
Feature activation-9.85e-01

Pos logit contributions
eering+2.209
idge+2.171
IBLE+1.793
 nodd+1.741
yssey+1.734
Neg logit contributions
jud-2.362
andre-2.261
ca-2.212
imposed-2.136
traumatic-2.095
Token record
Token ID1700
Feature activation-3.84e-01

Pos logit contributions
jud+3.367
ca+3.210
andre+3.196
imposed+2.996
traumatic+2.910
Neg logit contributions
eering-3.396
idge-3.266
IBLE-2.818
 nodd-2.753
rients-2.752
Token in
Token ID287
Feature activation-5.98e-03

Pos logit contributions
jud+1.026
andre+0.974
ca+0.944
imposed+0.889
traumatic+0.868
Neg logit contributions
eering-1.632
idge-1.607
IBLE-1.401
 nodd-1.375
rients-1.368
Token this
Token ID428
Feature activation-7.96e-02

Pos logit contributions
jud+0.018
andre+0.018
ca+0.017
imposed+0.016
traumatic+0.016
Neg logit contributions
eering-0.023
idge-0.023
IBLE-0.019
 nodd-0.019
yssey-0.019
Token competition
Token ID5449
Feature activation+7.68e-01

Pos logit contributions
jud+0.254
andre+0.241
ca+0.237
imposed+0.226
traumatic+0.219
Neg logit contributions
eering-0.296
idge-0.292
IBLE-0.248
 nodd-0.241
rients-0.241
Token and
Token ID290
Feature activation-4.07e-02

Pos logit contributions
eering+3.208
idge+3.156
IBLE+2.712
yssey+2.646
 nodd+2.645
Neg logit contributions
jud-2.098
andre-1.981
ca-1.913
imposed-1.840
traumatic-1.794
Token we
Token ID356
Feature activation+3.03e-01

Pos logit contributions
jud+0.122
andre+0.116
ca+0.113
imposed+0.107
traumatic+0.105
Neg logit contributions
eering-0.161
idge-0.158
IBLE-0.135
 nodd-0.132
yssey-0.131
Token <
Token ID1279
Feature activation-3.84e-01

Pos logit contributions
eering+0.578
idge+0.570
IBLE+0.498
 nodd+0.489
yssey+0.487
Neg logit contributions
jud-0.298
andre-0.278
ca-0.270
imposed-0.254
traumatic-0.246
Token num
Token ID997
Feature activation-7.12e-01

Pos logit contributions
jud+1.198
andre+1.128
ca+1.121
imposed+1.065
traumatic+1.035
Neg logit contributions
eering-1.466
idge-1.438
IBLE-1.215
yssey-1.190
 nodd-1.181
TokenPart
Token ID7841
Feature activation+1.07e+00

Pos logit contributions
jud+1.411
andre+1.286
ca+1.251
imposed+1.163
traumatic+1.114
Neg logit contributions
eering-3.537
idge-3.476
IBLE-3.066
 nodd-3.025
yssey-3.022
Tokenicles
Token ID2983
Feature activation+6.38e-02

Pos logit contributions
idge+4.321
eering+4.237
rients+3.627
IBLE+3.613
yssey+3.505
Neg logit contributions
jud-3.045
andre-2.916
ca-2.674
imposed-2.669
traumatic-2.591
Token ;
Token ID2162
Feature activation+1.50e-01

Pos logit contributions
eering+0.284
idge+0.280
IBLE+0.244
 nodd+0.239
yssey+0.238
Neg logit contributions
jud-0.158
andre-0.149
ca-0.144
imposed-0.136
traumatic-0.133
Token ++
Token ID19969
Feature activation-1.10e+00

Pos logit contributions
eering+0.647
idge+0.637
IBLE+0.552
 nodd+0.540
yssey+0.539
Neg logit contributions
jud-0.392
andre-0.368
ca-0.358
imposed-0.340
traumatic-0.330
Token u
Token ID334
Feature activation+4.64e-01

Pos logit contributions
jud+2.819
ca+2.553
andre+2.549
imposed+2.383
traumatic+2.301
Neg logit contributions
eering-4.868
idge-4.704
 nodd-4.091
IBLE-4.090
rients-4.018
TokenPart
Token ID7841
Feature activation+8.11e-01

Pos logit contributions
eering+2.152
idge+2.140
IBLE+1.886
rients+1.834
yssey+1.825
Neg logit contributions
jud-1.031
andre-0.973
ca-0.918
imposed-0.890
traumatic-0.856
Tokenicle
Token ID1548
Feature activation-1.15e-01

Pos logit contributions
idge+2.811
eering+2.769
IBLE+2.268
yssey+2.207
rients+2.203
Neg logit contributions
jud-2.770
andre-2.680
imposed-2.515
ca-2.506
traumatic-2.455
Token )
Token ID1267
Feature activation+5.76e-02

Pos logit contributions
jud+0.377
andre+0.358
ca+0.351
imposed+0.336
traumatic+0.328
Neg logit contributions
eering-0.423
idge-0.414
IBLE-0.348
 nodd-0.341
rients-0.338
Token {
Token ID1391
Feature activation-2.65e-02

Pos logit contributions
eering+0.253
idge+0.249
IBLE+0.216
 nodd+0.211
yssey+0.211
Neg logit contributions
jud-0.147
andre-0.137
ca-0.134
imposed-0.127
traumatic-0.123
Token
Token ID198
Feature activation+4.22e-02

Pos logit contributions
eering+0.800
idge+0.787
IBLE+0.662
rients+0.637
yssey+0.635
Neg logit contributions
jud-0.658
andre-0.635
imposed-0.599
ca-0.598
traumatic-0.582
TokenSad
Token ID26699
Feature activation-6.95e-01

Pos logit contributions
eering+0.179
idge+0.176
IBLE+0.152
 nodd+0.149
yssey+0.148
Neg logit contributions
jud-0.114
andre-0.107
ca-0.104
imposed-0.099
traumatic-0.096
Tokenain
Token ID391
Feature activation+4.09e-01

Pos logit contributions
jud+2.051
andre+1.941
ca+1.906
imposed+1.793
traumatic+1.751
Neg logit contributions
eering-2.778
idge-2.711
IBLE-2.365
 nodd-2.344
rients-2.268
Token noted
Token ID4367
Feature activation+3.40e-01

Pos logit contributions
eering+1.499
idge+1.471
IBLE+1.242
 nodd+1.211
yssey+1.204
Neg logit contributions
jud-1.339
andre-1.275
ca-1.247
imposed-1.196
traumatic-1.171
Token that
Token ID326
Feature activation-3.46e-01

Pos logit contributions
eering+1.292
idge+1.271
IBLE+1.079
 nodd+1.051
yssey+1.048
Neg logit contributions
jud-1.068
andre-1.014
ca-0.991
imposed-0.950
traumatic-0.928
Token an
Token ID281
Feature activation+1.15e-01

Pos logit contributions
jud+1.049
andre+0.979
ca+0.973
imposed+0.929
traumatic+0.897
Neg logit contributions
eering-1.353
idge-1.341
IBLE-1.145
rients-1.114
yssey-1.113
Token education
Token ID3707
Feature activation+1.75e+00

Pos logit contributions
eering+0.413
idge+0.404
IBLE+0.339
 nodd+0.332
rients+0.331
Neg logit contributions
jud-0.388
andre-0.366
ca-0.362
imposed-0.348
traumatic-0.338
Token department
Token ID5011
Feature activation+3.74e-01

Pos logit contributions
eering+5.803
idge+5.600
IBLE+4.669
yssey+4.470
 nodd+4.407
Neg logit contributions
jud-6.224
andre-6.040
ca-5.822
traumatic-5.618
imposed-5.608
Token policy
Token ID2450
Feature activation-3.89e-01

Pos logit contributions
eering+1.380
idge+1.357
IBLE+1.145
 nodd+1.115
yssey+1.111
Neg logit contributions
jud-1.214
andre-1.155
ca-1.129
imposed-1.085
traumatic-1.060
Token states
Token ID2585
Feature activation+5.21e-01

Pos logit contributions
jud+1.245
andre+1.174
ca+1.165
imposed+1.117
traumatic+1.076
Neg logit contributions
eering-1.432
idge-1.430
IBLE-1.219
 nodd-1.187
yssey-1.174
Token that
Token ID326
Feature activation-1.79e-01

Pos logit contributions
eering+1.958
idge+1.924
IBLE+1.629
 nodd+1.588
yssey+1.581
Neg logit contributions
jud-1.656
andre-1.577
ca-1.537
imposed-1.475
traumatic-1.444
Token,
Token ID11
Feature activation-7.19e-01

Pos logit contributions
jud+1.485
andre+1.392
ca+1.357
imposed+1.279
traumatic+1.239
Neg logit contributions
eering-2.666
idge-2.624
IBLE-2.289
yssey-2.239
 nodd-2.239
Token can
Token ID460
Feature activation-9.03e-01

Pos logit contributions
jud+2.236
ca+2.125
andre+2.112
imposed+1.997
traumatic+1.935
Neg logit contributions
eering-2.711
idge-2.681
IBLE-2.295
 nodd-2.239
rients-2.209
Token no
Token ID645
Feature activation-1.87e-01

Pos logit contributions
jud+3.420
ca+3.280
andre+3.279
imposed+3.084
traumatic+3.013
Neg logit contributions
eering-2.809
idge-2.731
IBLE-2.253
 nodd-2.180
rients-2.172
Token longer
Token ID2392
Feature activation+7.14e-02

Pos logit contributions
jud+0.574
andre+0.541
ca+0.531
imposed+0.505
traumatic+0.491
Neg logit contributions
eering-0.734
idge-0.718
IBLE-0.609
rients-0.597
 nodd-0.596
Token be
Token ID307
Feature activation-4.78e-01

Pos logit contributions
eering+0.190
idge+0.185
IBLE+0.145
 nodd+0.139
rients+0.139
Neg logit contributions
jud-0.306
andre-0.295
ca-0.290
imposed-0.281
traumatic-0.276
Token avoided
Token ID13941
Feature activation-1.30e+00

Pos logit contributions
jud+1.474
andre+1.386
ca+1.375
imposed+1.322
traumatic+1.265
Neg logit contributions
eering-1.847
idge-1.812
IBLE-1.535
 nodd-1.510
yssey-1.510
Token as
Token ID355
Feature activation-4.46e-02

Pos logit contributions
jud+3.030
andre+2.803
ca+2.793
imposed+2.571
traumatic+2.478
Neg logit contributions
eering-5.835
idge-5.758
IBLE-5.089
 nodd-5.048
yssey-5.028
Token the
Token ID262
Feature activation-2.78e-01

Pos logit contributions
jud+0.142
andre+0.135
ca+0.132
imposed+0.127
traumatic+0.123
Neg logit contributions
eering-0.167
idge-0.164
IBLE-0.139
yssey-0.136
 nodd-0.136
Token effects
Token ID3048
Feature activation-1.43e+00

Pos logit contributions
jud+0.932
andre+0.883
ca+0.868
imposed+0.827
traumatic+0.813
Neg logit contributions
eering-0.993
idge-0.976
IBLE-0.821
 nodd-0.804
yssey-0.804
Token of
Token ID286
Feature activation-1.03e+00

Pos logit contributions
jud+3.568
ca+3.339
andre+3.328
imposed+3.117
traumatic+3.003
Neg logit contributions
eering-6.071
idge-6.055
IBLE-5.249
 nodd-5.154
rients-5.149
Token a
Token ID257
Feature activation-2.41e-01

Pos logit contributions
jud+3.089
ca+2.935
andre+2.924
imposed+2.793
traumatic+2.699
Neg logit contributions
eering-3.860
idge-3.755
IBLE-3.290
yssey-3.228
 nodd-3.191
Token has
Token ID468
Feature activation-1.81e-01

Pos logit contributions
jud+3.069
andre+2.909
ca+2.891
imposed+2.716
traumatic+2.659
Neg logit contributions
eering-3.839
idge-3.755
IBLE-3.274
rients-3.156
yssey-3.153
Token had
Token ID550
Feature activation-6.88e-01

Pos logit contributions
jud+0.553
andre+0.524
ca+0.514
imposed+0.491
traumatic+0.478
Neg logit contributions
eering-0.702
idge-0.691
IBLE-0.589
yssey-0.574
rients-0.574
Token an
Token ID281
Feature activation-4.93e-01

Pos logit contributions
jud+2.088
andre+1.982
ca+1.978
imposed+1.859
traumatic+1.829
Neg logit contributions
eering-2.617
idge-2.593
IBLE-2.207
rients-2.178
yssey-2.171
Token easy
Token ID2562
Feature activation-1.17e+00

Pos logit contributions
jud+1.552
andre+1.474
ca+1.450
imposed+1.397
traumatic+1.366
Neg logit contributions
eering-1.854
idge-1.797
IBLE-1.526
rients-1.522
 nodd-1.501
Token run
Token ID1057
Feature activation-1.10e+00

Pos logit contributions
jud+3.646
andre+3.435
ca+3.409
imposed+3.241
traumatic+3.173
Neg logit contributions
eering-4.356
idge-4.225
IBLE-3.672
rients-3.623
 nodd-3.545
Token of
Token ID286
Feature activation-1.41e+00

Pos logit contributions
jud+2.877
ca+2.661
andre+2.652
imposed+2.506
traumatic+2.415
Neg logit contributions
eering-4.606
idge-4.587
IBLE-4.005
rients-3.940
 nodd-3.911
Token things
Token ID1243
Feature activation+2.18e-01

Pos logit contributions
jud+4.194
ca+3.997
andre+3.966
imposed+3.744
cas+3.679
Neg logit contributions
eering-5.323
idge-5.187
IBLE-4.407
rients-4.404
 nodd-4.366
Token lately
Token ID16537
Feature activation+9.74e-01

Pos logit contributions
eering+0.918
idge+0.904
IBLE+0.780
 nodd+0.763
yssey+0.761
Neg logit contributions
jud-0.593
andre-0.560
ca-0.544
imposed-0.518
traumatic-0.505
Token.
Token ID13
Feature activation+1.21e-01

Pos logit contributions
eering+4.054
idge+3.997
IBLE+3.428
 nodd+3.336
yssey+3.333
Neg logit contributions
jud-2.626
andre-2.508
ca-2.390
imposed-2.318
traumatic-2.253
Token Owner
Token ID23853
Feature activation+6.51e-01

Pos logit contributions
eering+0.425
idge+0.418
IBLE+0.349
 nodd+0.344
yssey+0.340
Neg logit contributions
jud-0.411
andre-0.392
ca-0.385
imposed-0.368
traumatic-0.361
Token Sylvia
Token ID45425
Feature activation+6.18e-02

Pos logit contributions
eering+2.410
idge+2.368
IBLE+1.986
 nodd+1.928
yssey+1.924
Neg logit contributions
jud-2.096
andre-1.990
ca-1.939
imposed-1.880
traumatic-1.835
Token Wait
Token ID16314
Feature activation-3.07e-01

Pos logit contributions
eering+5.105
idge+5.072
IBLE+4.284
 nodd+4.174
yssey+4.165
Neg logit contributions
jud-3.957
andre-3.767
ca-3.612
imposed-3.507
traumatic-3.437
Token...
Token ID986
Feature activation+4.36e-01

Pos logit contributions
jud+0.783
andre+0.731
ca+0.720
imposed+0.669
traumatic+0.650
Neg logit contributions
eering-1.346
idge-1.320
IBLE-1.155
 nodd-1.131
yssey-1.122
Token
Token ID198
Feature activation+4.75e-01

Pos logit contributions
eering+1.667
idge+1.641
IBLE+1.394
 nodd+1.359
yssey+1.354
Neg logit contributions
jud-1.352
andre-1.286
ca-1.253
imposed-1.201
traumatic-1.175
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
eering+1.737
idge+1.707
IBLE+1.434
rients+1.362
yssey+1.354
Neg logit contributions
jud-1.531
andre-1.473
imposed-1.411
ca-1.381
traumatic-1.365
TokenDid
Token ID11633
Feature activation-1.01e+00

Pos logit contributions
eering+0.835
idge+0.823
IBLE+0.712
 nodd+0.696
yssey+0.695
Neg logit contributions
jud-0.518
andre-0.488
ca-0.473
imposed-0.451
traumatic-0.439
Token that
Token ID326
Feature activation-1.43e+00

Pos logit contributions
jud+2.701
ca+2.556
andre+2.551
imposed+2.311
cas+2.275
Neg logit contributions
eering-4.246
idge-4.104
IBLE-3.592
 nodd-3.545
rients-3.537
Token Pokémon
Token ID9887
Feature activation-3.70e-01

Pos logit contributions
jud+5.027
andre+4.710
ca+4.644
imposed+4.400
cas+4.320
Neg logit contributions
eering-4.753
idge-4.543
IBLE-3.916
rients-3.863
yssey-3.789
Token just
Token ID655
Feature activation-1.86e-02

Pos logit contributions
jud+1.299
andre+1.235
ca+1.209
imposed+1.149
traumatic+1.125
Neg logit contributions
eering-1.252
idge-1.232
IBLE-1.047
rients-1.012
 nodd-1.003
Token talk
Token ID1561
Feature activation+6.47e-01

Pos logit contributions
jud+0.068
andre+0.065
ca+0.064
imposed+0.061
traumatic+0.059
Neg logit contributions
eering-0.061
idge-0.060
IBLE-0.050
rients-0.048
 nodd-0.048
Token?
Token ID30
Feature activation+3.16e-01

Pos logit contributions
eering+2.657
idge+2.620
IBLE+2.247
 nodd+2.200
yssey+2.188
Neg logit contributions
jud-1.827
andre-1.722
ca-1.673
imposed-1.606
traumatic-1.566
Token
Token ID198
Feature activation+4.94e-01

Pos logit contributions
eering+1.227
idge+1.207
IBLE+1.028
 nodd+1.002
yssey+0.999
Neg logit contributions
jud-0.964
andre-0.915
ca-0.891
imposed-0.854
traumatic-0.835
Token In
Token ID554
Feature activation-4.86e-02

Pos logit contributions
eering+3.058
idge+3.009
IBLE+2.547
rients+2.457
yssey+2.454
Neg logit contributions
jud-2.675
andre-2.557
ca-2.464
imposed-2.405
traumatic-2.342
Token 2002
Token ID6244
Feature activation+2.92e-04

Pos logit contributions
jud+0.155
andre+0.147
ca+0.144
imposed+0.137
traumatic+0.134
Neg logit contributions
eering-0.184
idge-0.180
IBLE-0.153
 nodd-0.149
yssey-0.148
Token,
Token ID11
Feature activation-4.17e-03

Pos logit contributions
eering+0.001
idge+0.001
IBLE+0.001
 nodd+0.001
yssey+0.001
Neg logit contributions
jud-0.001
andre-0.001
ca-0.001
imposed-0.001
traumatic-0.001
Token N
Token ID399
Feature activation+5.38e-02

Pos logit contributions
jud+0.013
andre+0.012
ca+0.012
imposed+0.011
traumatic+0.011
Neg logit contributions
eering-0.016
idge-0.016
IBLE-0.014
 nodd-0.013
rients-0.013
Tokenana
Token ID2271
Feature activation+2.49e-02

Pos logit contributions
eering+0.218
idge+0.209
IBLE+0.181
 nodd+0.178
rients+0.177
Neg logit contributions
jud-0.159
andre-0.149
ca-0.148
imposed-0.140
traumatic-0.135
Token won
Token ID1839
Feature activation-1.36e+00

Pos logit contributions
eering+0.095
idge+0.093
IBLE+0.080
 nodd+0.078
rients+0.077
Neg logit contributions
jud-0.078
andre-0.074
ca-0.073
imposed-0.069
traumatic-0.067
Token the
Token ID262
Feature activation-7.42e-01

Pos logit contributions
jud+4.088
andre+3.863
ca+3.836
imposed+3.591
traumatic+3.429
Neg logit contributions
eering-5.200
idge-5.063
IBLE-4.342
 nodd-4.280
rients-4.269
Token Sh
Token ID911
Feature activation-1.43e-01

Pos logit contributions
jud+2.422
andre+2.292
ca+2.269
imposed+2.129
traumatic+2.058
Neg logit contributions
eering-2.714
idge-2.642
IBLE-2.234
 nodd-2.213
rients-2.208
Tokenog
Token ID519
Feature activation+6.32e-01

Pos logit contributions
jud+0.433
andre+0.409
ca+0.400
imposed+0.381
traumatic+0.369
Neg logit contributions
eering-0.562
idge-0.542
IBLE-0.471
 nodd-0.464
rients-0.457
Tokenak
Token ID461
Feature activation-2.43e-02

Pos logit contributions
eering+2.470
idge+2.428
IBLE+2.065
yssey+2.023
 nodd+2.008
Neg logit contributions
jud-1.887
andre-1.801
ca-1.746
imposed-1.690
traumatic-1.653
Tokenuk
Token ID2724
Feature activation-2.37e-03

Pos logit contributions
jud+0.071
andre+0.065
ca+0.065
imposed+0.060
traumatic+0.058
Neg logit contributions
eering-0.102
idge-0.098
IBLE-0.085
rients-0.083
 nodd-0.083
Token though
Token ID996
Feature activation+4.33e-01

Pos logit contributions
jud+0.145
andre+0.138
ca+0.135
imposed+0.129
traumatic+0.126
Neg logit contributions
eering-0.175
idge-0.173
IBLE-0.148
rients-0.144
yssey-0.144
Token.
Token ID13
Feature activation+3.17e-01

Pos logit contributions
eering+1.674
idge+1.643
IBLE+1.397
 nodd+1.363
yssey+1.351
Neg logit contributions
jud-1.322
andre-1.260
ca-1.223
imposed-1.174
traumatic-1.151
Token
Token ID198
Feature activation+2.52e-01

Pos logit contributions
eering+1.162
idge+1.143
IBLE+0.963
 nodd+0.941
yssey+0.936
Neg logit contributions
jud-1.037
andre-0.988
ca-0.966
imposed-0.926
traumatic-0.907
Token
Token ID198
Feature activation-2.45e-02

Pos logit contributions
eering+0.945
idge+0.928
IBLE+0.783
rients+0.751
yssey+0.747
Neg logit contributions
jud-0.792
andre-0.762
imposed-0.723
ca-0.718
traumatic-0.702
Token[
Token ID58
Feature activation+6.55e-01

Pos logit contributions
jud+0.067
andre+0.064
ca+0.062
imposed+0.059
traumatic+0.057
Neg logit contributions
eering-0.103
idge-0.101
IBLE-0.087
 nodd-0.086
yssey-0.085
TokenPhoto
Token ID6191
Feature activation-1.45e+00

Pos logit contributions
eering+2.816
idge+2.775
IBLE+2.407
 nodd+2.344
yssey+2.343
Neg logit contributions
jud-1.709
andre-1.611
ca-1.550
imposed-1.487
traumatic-1.447
Token:
Token ID25
Feature activation+2.94e-01

Pos logit contributions
jud+3.559
andre+3.401
ca+3.367
imposed+3.075
traumatic+3.008
Neg logit contributions
eering-6.223
idge-6.134
IBLE-5.441
yssey-5.343
rients-5.321
Token YouTube
Token ID7444
Feature activation+5.46e-01

Pos logit contributions
eering+1.098
idge+1.079
IBLE+0.914
 nodd+0.890
yssey+0.888
Neg logit contributions
jud-0.940
andre-0.895
ca-0.875
imposed-0.838
traumatic-0.819
Token screen
Token ID3159
Feature activation-5.13e-01

Pos logit contributions
eering+2.269
idge+2.238
IBLE+1.923
 nodd+1.875
yssey+1.868
Neg logit contributions
jud-1.510
andre-1.422
ca-1.372
imposed-1.322
traumatic-1.287
Token grab
Token ID5552
Feature activation-8.91e-01

Pos logit contributions
jud+1.383
ca+1.294
andre+1.292
imposed+1.208
traumatic+1.154
Neg logit contributions
eering-2.166
idge-2.143
IBLE-1.878
rients-1.826
 nodd-1.822
Token]
Token ID60
Feature activation+3.75e-02

Pos logit contributions
jud+2.275
andre+2.150
ca+2.149
imposed+1.980
traumatic+1.928
Neg logit contributions
eering-3.825
idge-3.769
IBLE-3.302
 nodd-3.248
yssey-3.235