TOP ACTIVATIONS
MAX = 2.472

21 22 23 24 SELECT REG EXP _ EX TR ACT
. y ) ; return normal ; # endif } If
) * Get Volume () ; return f Mass ; }
insurers in this market are reg aining profitability ,
ensity + ambient D ensity ) * Get Volume () ;
, would not work " pro per ly " with the
city manager to pro pose guidelines to prevent this
members of Ford Nation and par rot Ford s
iod (" P ulse period ", Float ) = 1 //
" P ulse amount " , Range ( 0 , 0
ulate D ensity Grid ( Uniform Grid < float > &
Volume () ; return f Mass ; } Ass
Of Position ( r Position ) ; // Note that the
{ ... prev _ state : Option < Arc < Board
it in a high - tem perature environment . They have
per cent . Ad vertising The I
> & density Grid , const Vector < Part icle >
your actions at length , spec ulating on the nature of
positive ( tem perature - am pl ifying ) and negative
ity of it alone lends cred ence to the idea .

MIN ACTIVATIONS
MIN = -2.681

{ ( 244 s )} Be 7 { ( 8 s
deep fried . 6 . Drain on a paper towel -
said J PL 's Luther Be eg le , Curiosity sampling
a referendum on whether to exit the EU .
crew and two more powerful Be cht el A 1 B
rather - than - bre xit - is - the -
future . One photo shows Be b op and Rock stead
what we re gonna do . The
However , the vast majority of early investors have
paired with a developing Nathan Be aul ieu , will mean
over there ," said Sandy Be ale , one of the
explored in 3 D with To make this possible , in
either ; we re the only species that really
Ark t ika decom mission ed in 1992 and 2008 ,
the name they 'd given Be hemoth now seemed downright optimistic
Columbia Pictures With Ghostbusters we learned that if you find a
, says facility director Tony Be asley . At
the sugar cane plantations , Be eg le noted . <|endoftext|>
's Basketball Player of the Year by both the ACC coaches
which shares some characteristics with Be eth oven s

INTERVAL 1.184 - 2.472
CONTAINS 0.760%

the Melbourne Mag istrates Court this afternoon , where he was
from here . Anton has put in a lot of effort
New American Translation , Brown Jud aic Studies , no .
Width ), ($ SR . Height ))) ) } } Catch
. Now the kids think there are going to

INTERVAL -0.104 - 1.184
CONTAINS 58.458%

member of the Senate Environment and Public Works Committee , displayed
of fun with it . I think people love him there
up in foster care . This gift means nont
takes a step to the play side . B ugg s
community activism , a massive infusion of provincial funds and ,

INTERVAL -1.393 - -0.104
CONTAINS 40.605%

effects . T ape to back things up works
we got back home , we weren 't allowed to chat
idea here is that any distribution which provides the Snap framework
and the respect of the more extreme pun ks in return
:// forums . s gc afe . com Source

INTERVAL -2.681 - -1.393
CONTAINS 0.177%

xit - is - the - prime - chall enge -
) that tore the country apart . Likewise , the neoliberal
longer there because it was ready to loan him to Bel
/ ret , which we 'll call " pp pr "
( 536 s )} N xd 5 { ( 106 s
Token 21
Token ID2310
Feature activation+6.65e-01

Pos logit contributions
Alert+4.037
ulate+3.964
ents+3.845
aic+3.805
PLIED+3.748
Neg logit contributions
cloth-4.155
omas-4.053
 interviewer-3.924
leneck-3.907
bsite-3.787
Token 22
Token ID2534
Feature activation+6.79e-01

Pos logit contributions
Alert+3.829
ulate+3.784
ents+3.647
aic+3.614
PLIED+3.576
Neg logit contributions
cloth-3.976
omas-3.915
 interviewer-3.786
leneck-3.767
bsite-3.661
Token 23
Token ID2242
Feature activation+5.33e-01

Pos logit contributions
Alert+3.942
ulate+3.905
ents+3.771
aic+3.738
PLIED+3.684
Neg logit contributions
cloth-4.029
omas-3.980
 interviewer-3.831
leneck-3.812
 filler-3.723
Token 24
Token ID1987
Feature activation+4.83e-01

Pos logit contributions
Alert+3.235
ulate+3.212
ents+3.105
aic+3.070
PLIED+3.024
Neg logit contributions
cloth-3.038
omas-2.981
 interviewer-2.894
leneck-2.879
 filler-2.793
Token SELECT
Token ID33493
Feature activation+1.04e+00

Pos logit contributions
Alert+2.644
ulate+2.625
ents+2.529
aic+2.519
imum+2.475
Neg logit contributions
cloth-3.046
omas-3.000
 interviewer-2.918
leneck-2.896
 filler-2.832
Token REG
Token ID23337
Feature activation+2.47e+00

Pos logit contributions
ulate+5.589
Alert+5.580
ents+5.492
imum+5.414
aic+5.343
Neg logit contributions
cloth-6.225
omas-5.922
leneck-5.885
 interviewer-5.814
auc-5.750
TokenEXP
Token ID49864
Feature activation+1.15e+00

Pos logit contributions
ulate+11.128
ents+10.820
Alert+10.129
dict+10.106
ulatory+9.901
Neg logit contributions
cloth-14.915
auc-14.460
 interviewer-14.437
leneck-14.311
omas-14.152
Token_
Token ID62
Feature activation+6.37e-01

Pos logit contributions
ulate+5.840
Alert+5.814
ents+5.786
aic+5.646
imum+5.642
Neg logit contributions
cloth-7.215
omas-7.010
 interviewer-6.995
leneck-6.883
bsite-6.707
TokenEX
Token ID6369
Feature activation+1.05e+00

Pos logit contributions
Alert+3.120
ulate+3.061
ents+2.956
aic+2.934
imum+2.893
Neg logit contributions
cloth-4.364
omas-4.320
 interviewer-4.248
leneck-4.212
 filler-4.137
TokenTR
Token ID5446
Feature activation+2.18e-01

Pos logit contributions
Alert+4.562
ulate+4.479
ents+4.359
PLIED+4.319
imum+4.238
Neg logit contributions
cloth-7.613
omas-7.507
leneck-7.418
 interviewer-7.330
 filler-7.198
TokenACT
Token ID10659
Feature activation+1.79e+00

Pos logit contributions
Alert+1.036
ulate+1.034
aic+0.983
ents+0.973
imum+0.961
Neg logit contributions
cloth-1.522
omas-1.516
 interviewer-1.478
leneck-1.471
 filler-1.453
Token.
Token ID764
Feature activation+4.43e-01

Pos logit contributions
Alert+1.342
ulate+1.323
aic+1.288
ents+1.282
imum+1.277
Neg logit contributions
cloth-1.140
omas-1.125
 interviewer-1.092
leneck-1.071
 filler-1.067
Token y
Token ID331
Feature activation+5.41e-01

Pos logit contributions
Alert+3.024
ulate+2.994
ents+2.914
aic+2.899
imum+2.881
Neg logit contributions
cloth-2.194
omas-2.153
 interviewer-2.085
leneck-2.071
 filler-2.023
Token )
Token ID1267
Feature activation+1.19e+00

Pos logit contributions
Alert+3.145
ulate+3.122
ents+3.018
aic+3.007
imum+2.978
Neg logit contributions
cloth-3.226
omas-3.175
 interviewer-3.106
leneck-3.079
 filler-3.027
Token ;
Token ID2162
Feature activation+1.00e+00

Pos logit contributions
Alert+7.334
ulate+7.305
ents+7.140
imum+6.959
aic+6.938
Neg logit contributions
cloth-6.123
omas-6.007
leneck-5.861
bsite-5.743
 interviewer-5.728
Token return
Token ID1441
Feature activation+8.40e-01

Pos logit contributions
Alert+6.817
ulate+6.808
ents+6.585
aic+6.460
imum+6.412
Neg logit contributions
cloth-4.655
omas-4.556
leneck-4.451
 interviewer-4.335
bsite-4.302
Token normal
Token ID3487
Feature activation+2.39e+00

Pos logit contributions
Alert+4.767
ulate+4.761
ents+4.618
aic+4.524
imum+4.480
Neg logit contributions
cloth-4.957
omas-4.895
leneck-4.811
 interviewer-4.683
bsite-4.625
Token ;
Token ID2162
Feature activation+1.21e+00

Pos logit contributions
Alert+12.505
ulate+12.330
imum+11.842
ents+11.734
dict+11.730
Neg logit contributions
cloth-11.932
leneck-11.890
 interviewer-11.678
omas-11.475
bsite-11.244
Token #
Token ID1303
Feature activation-8.01e-02

Pos logit contributions
Alert+5.179
ulate+5.132
ents+5.042
aic+4.768
imum+4.705
Neg logit contributions
cloth-8.523
omas-8.257
leneck-8.223
bsite-8.041
 interviewer-8.038
Tokenendif
Token ID32088
Feature activation+4.19e-01

Pos logit contributions
cloth+0.395
omas+0.385
 interviewer+0.372
leneck+0.370
 filler+0.367
Neg logit contributions
Alert-0.549
ulate-0.542
aic-0.533
imum-0.528
ents-0.528
Token }
Token ID1782
Feature activation+8.95e-01

Pos logit contributions
Alert+2.463
ulate+2.448
ents+2.359
aic+2.340
imum+2.328
Neg logit contributions
cloth-2.468
omas-2.427
leneck-2.363
 interviewer-2.359
bsite-2.284
Token If
Token ID1002
Feature activation+4.04e-01

Pos logit contributions
Alert+5.263
ulate+5.225
ents+5.025
aic+4.930
imum+4.868
Neg logit contributions
cloth-5.081
omas-5.039
leneck-4.901
 interviewer-4.837
bsite-4.763
Token )
Token ID1267
Feature activation+2.30e+00

Pos logit contributions
Alert+4.989
ulate+4.698
ents+4.639
imum+4.444
aic+4.369
Neg logit contributions
cloth-11.540
omas-11.500
 interviewer-11.337
leneck-11.299
bsite-11.161
Token *
Token ID1635
Feature activation+1.75e+00

Pos logit contributions
Alert+8.751
ulate+8.712
imum+8.565
ents+8.285
dict+8.026
Neg logit contributions
leneck-14.142
cloth-14.134
omas-14.133
bsite-13.721
 interviewer-13.585
Token Get
Token ID3497
Feature activation+2.51e-01

Pos logit contributions
ulate+8.918
Alert+8.721
ents+8.547
imum+8.409
aic+8.167
Neg logit contributions
omas-9.920
cloth-9.905
leneck-9.839
bsite-9.429
 interviewer-9.428
TokenVolume
Token ID31715
Feature activation+1.07e+00

Pos logit contributions
Alert+1.217
ulate+1.217
aic+1.175
ents+1.169
imum+1.148
Neg logit contributions
cloth-1.746
omas-1.715
 interviewer-1.661
leneck-1.659
 filler-1.638
Token()
Token ID3419
Feature activation+1.32e+00

Pos logit contributions
Alert+5.516
ulate+5.369
ents+5.113
imum+5.043
aic+5.042
Neg logit contributions
omas-6.912
cloth-6.883
 interviewer-6.748
leneck-6.691
bsite-6.642
Token ;
Token ID2162
Feature activation+2.34e+00

Pos logit contributions
Alert+5.591
ulate+5.510
ents+5.208
imum+5.172
aic+5.050
Neg logit contributions
cloth-9.295
omas-9.259
leneck-9.107
 interviewer-9.029
bsite-8.908
Token return
Token ID1441
Feature activation+1.51e+00

Pos logit contributions
ulate+10.831
Alert+10.299
ents+9.952
imum+9.731
dict+9.428
Neg logit contributions
leneck-13.289
cloth-13.179
omas-13.002
bsite-12.684
kamp-12.311
Token f
Token ID277
Feature activation+1.28e+00

Pos logit contributions
ulate+7.361
Alert+7.326
ents+7.141
imum+7.068
aic+6.679
Neg logit contributions
cloth-9.284
leneck-9.266
omas-9.248
bsite-8.961
 interviewer-8.915
TokenMass
Token ID20273
Feature activation+2.19e+00

Pos logit contributions
Alert+5.120
ulate+4.868
imum+4.683
ents+4.638
PLIED+4.475
Neg logit contributions
cloth-9.539
omas-9.498
 interviewer-9.495
leneck-9.274
 filler-9.209
Token ;
Token ID2162
Feature activation+1.53e+00

Pos logit contributions
Alert+9.244
ulate+8.847
imum+8.782
ents+8.569
aic+8.348
Neg logit contributions
omas-13.560
leneck-13.474
cloth-13.390
 interviewer-13.300
bsite-13.226
Token }
Token ID1782
Feature activation+8.55e-01

Pos logit contributions
Alert+5.049
ulate+5.037
ents+4.872
imum+4.527
PLIED+4.476
Neg logit contributions
cloth-11.967
leneck-11.637
omas-11.582
 interviewer-11.507
bsite-11.473
Token insurers
Token ID24668
Feature activation+4.80e-01

Pos logit contributions
Alert+5.035
ulate+4.996
ents+4.912
imum+4.776
aic+4.757
Neg logit contributions
cloth-4.931
omas-4.879
leneck-4.689
 interviewer-4.623
 filler-4.511
Token in
Token ID287
Feature activation-1.22e-01

Pos logit contributions
Alert+2.878
ulate+2.860
ents+2.769
aic+2.753
imum+2.709
Neg logit contributions
cloth-2.750
omas-2.712
 interviewer-2.633
leneck-2.618
 filler-2.552
Token this
Token ID428
Feature activation+6.10e-01

Pos logit contributions
cloth+0.655
omas+0.647
 interviewer+0.627
leneck+0.624
 filler+0.613
Neg logit contributions
Alert-0.789
ulate-0.776
aic-0.759
ents-0.753
PLIED-0.746
Token market
Token ID1910
Feature activation+8.31e-01

Pos logit contributions
Alert+3.739
ulate+3.731
ents+3.608
aic+3.601
imum+3.580
Neg logit contributions
cloth-3.354
omas-3.313
 interviewer-3.204
leneck-3.174
 filler-3.104
Token are
Token ID389
Feature activation+8.85e-01

Pos logit contributions
Alert+4.834
ulate+4.828
ents+4.661
aic+4.629
imum+4.532
Neg logit contributions
cloth-4.725
omas-4.708
 interviewer-4.589
leneck-4.528
 filler-4.424
Token reg
Token ID842
Feature activation+2.34e+00

Pos logit contributions
Alert+5.000
ulate+4.951
ents+4.938
imum+4.715
aic+4.667
Neg logit contributions
cloth-5.166
omas-5.118
leneck-4.964
 interviewer-4.913
kamp-4.729
Tokenaining
Token ID1397
Feature activation+6.79e-01

Pos logit contributions
ents+9.528
ulate+9.478
ulatory+8.037
imum+8.000
dict+7.896
Neg logit contributions
cloth-15.768
 interviewer-14.994
 filler-14.724
kamp-14.703
omas-14.593
Token profitability
Token ID37129
Feature activation+5.80e-01

Pos logit contributions
ulate+4.006
Alert+3.983
ents+3.850
imum+3.837
aic+3.833
Neg logit contributions
cloth-3.924
omas-3.870
leneck-3.751
 interviewer-3.695
 filler-3.595
Token,
Token ID11
Feature activation+9.86e-01

Pos logit contributions
Alert+3.514
ulate+3.499
ents+3.396
aic+3.367
imum+3.360
Neg logit contributions
cloth-3.247
omas-3.197
 interviewer-3.140
leneck-3.098
 filler-3.048
Token
Token ID447
Feature activation-3.42e-01

Pos logit contributions
Alert+5.620
ulate+5.581
ents+5.415
aic+5.324
imum+5.238
Neg logit contributions
cloth-5.761
omas-5.668
 interviewer-5.412
leneck-5.362
 filler-5.296
Token
Token ID251
Feature activation+2.50e-01

Pos logit contributions
cloth+1.570
omas+1.555
 interviewer+1.509
 filler+1.480
leneck+1.478
Neg logit contributions
ulate-2.413
Alert-2.396
aic-2.349
imum-2.340
ents-2.308
Tokenensity
Token ID6377
Feature activation+1.28e+00

Pos logit contributions
ulate+4.155
Alert+4.102
imum+3.989
ents+3.960
aic+3.908
Neg logit contributions
cloth-6.567
omas-6.432
 interviewer-6.308
leneck-6.303
 filler-6.206
Token +
Token ID1343
Feature activation+1.48e+00

Pos logit contributions
Alert+6.020
ulate+5.780
ents+5.613
imum+5.571
aic+5.469
Neg logit contributions
omas-8.499
cloth-8.495
leneck-8.292
 interviewer-8.291
bsite-8.188
Token ambient
Token ID25237
Feature activation+1.29e+00

Pos logit contributions
ulate+7.603
Alert+7.512
ents+7.405
imum+7.280
aic+7.069
Neg logit contributions
omas-8.702
cloth-8.623
leneck-8.530
 interviewer-8.318
bsite-8.103
TokenD
Token ID35
Feature activation+9.61e-01

Pos logit contributions
Alert+5.861
ulate+5.535
ents+5.432
imum+5.372
aic+5.295
Neg logit contributions
omas-8.867
cloth-8.811
 interviewer-8.785
leneck-8.507
 filler-8.460
Tokenensity
Token ID6377
Feature activation+1.49e+00

Pos logit contributions
ulate+3.152
imum+3.011
ents+3.008
Alert+2.915
aic+2.900
Neg logit contributions
cloth-7.909
 interviewer-7.882
 filler-7.793
omas-7.753
 Ukrain-7.584
Token )
Token ID1267
Feature activation+2.30e+00

Pos logit contributions
Alert+4.989
ulate+4.698
ents+4.639
imum+4.444
aic+4.369
Neg logit contributions
cloth-11.540
omas-11.500
 interviewer-11.337
leneck-11.299
bsite-11.161
Token *
Token ID1635
Feature activation+1.75e+00

Pos logit contributions
Alert+8.751
ulate+8.712
imum+8.565
ents+8.285
dict+8.026
Neg logit contributions
leneck-14.142
cloth-14.134
omas-14.133
bsite-13.721
 interviewer-13.585
Token Get
Token ID3497
Feature activation+2.51e-01

Pos logit contributions
ulate+8.918
Alert+8.721
ents+8.547
imum+8.409
aic+8.167
Neg logit contributions
omas-9.920
cloth-9.905
leneck-9.839
bsite-9.429
 interviewer-9.428
TokenVolume
Token ID31715
Feature activation+1.07e+00

Pos logit contributions
Alert+1.217
ulate+1.217
aic+1.175
ents+1.169
imum+1.148
Neg logit contributions
cloth-1.746
omas-1.715
 interviewer-1.661
leneck-1.659
 filler-1.638
Token()
Token ID3419
Feature activation+1.32e+00

Pos logit contributions
Alert+5.516
ulate+5.369
ents+5.113
imum+5.043
aic+5.042
Neg logit contributions
omas-6.912
cloth-6.883
 interviewer-6.748
leneck-6.691
bsite-6.642
Token ;
Token ID2162
Feature activation+2.34e+00

Pos logit contributions
Alert+5.591
ulate+5.510
ents+5.208
imum+5.172
aic+5.050
Neg logit contributions
cloth-9.295
omas-9.259
leneck-9.107
 interviewer-9.029
bsite-8.908
Token,
Token ID11
Feature activation-2.40e-01

Pos logit contributions
cloth+4.088
omas+4.076
 interviewer+3.898
 filler+3.859
leneck+3.778
Neg logit contributions
Alert-3.230
ulate-3.190
aic-3.145
PLIED-3.123
imum-3.078
Token would
Token ID561
Feature activation-6.19e-01

Pos logit contributions
cloth+1.389
omas+1.368
 interviewer+1.336
 filler+1.312
leneck+1.312
Neg logit contributions
Alert-1.439
ulate-1.409
aic-1.377
imum-1.370
ents-1.362
Token not
Token ID407
Feature activation+4.07e-02

Pos logit contributions
omas+2.847
cloth+2.838
 interviewer+2.706
leneck+2.671
 filler+2.655
Neg logit contributions
Alert-4.463
ulate-4.283
aic-4.251
imum-4.246
PLIED-4.206
Token work
Token ID670
Feature activation-7.55e-02

Pos logit contributions
Alert+0.278
ulate+0.275
aic+0.268
ents+0.267
imum+0.265
Neg logit contributions
cloth-0.203
omas-0.199
 interviewer-0.193
leneck-0.191
 filler-0.188
Token "
Token ID366
Feature activation-2.03e-01

Pos logit contributions
cloth+0.418
omas+0.411
 interviewer+0.399
leneck+0.393
 filler+0.392
Neg logit contributions
Alert-0.471
ulate-0.468
aic-0.455
ents-0.452
imum-0.449
Tokenpro
Token ID1676
Feature activation+2.26e+00

Pos logit contributions
cloth+1.004
omas+0.970
leneck+0.930
 interviewer+0.928
 filler+0.903
Neg logit contributions
Alert-1.386
ulate-1.380
aic-1.341
ents-1.335
imum-1.332
Tokenper
Token ID525
Feature activation+1.28e-01

Pos logit contributions
Alert+11.004
ulate+10.337
ents+9.977
imum+9.955
dict+9.914
Neg logit contributions
leneck-12.611
 interviewer-12.586
omas-12.332
cloth-12.084
bsite-11.767
Tokenly
Token ID306
Feature activation+5.41e-01

Pos logit contributions
Alert+0.995
ulate+0.988
aic+0.963
ents+0.962
imum+0.955
Neg logit contributions
cloth-0.516
omas-0.505
 interviewer-0.485
leneck-0.479
 filler-0.469
Token"
Token ID1
Feature activation+4.33e-01

Pos logit contributions
Alert+3.124
ulate+3.094
ents+2.989
aic+2.976
imum+2.940
Neg logit contributions
cloth-3.200
omas-3.165
leneck-3.101
 interviewer-3.088
 filler-2.985
Token with
Token ID351
Feature activation+3.37e-01

Pos logit contributions
Alert+2.496
ulate+2.485
ents+2.403
aic+2.380
imum+2.348
Neg logit contributions
cloth-2.568
omas-2.549
leneck-2.467
 interviewer-2.462
 filler-2.394
Token the
Token ID262
Feature activation-1.52e-01

Pos logit contributions
Alert+1.927
ulate+1.919
ents+1.847
aic+1.847
imum+1.821
Neg logit contributions
cloth-2.046
omas-2.015
leneck-1.953
 interviewer-1.942
 filler-1.899
Token city
Token ID1748
Feature activation-4.12e-01

Pos logit contributions
cloth+2.591
 interviewer+2.582
omas+2.528
leneck+2.452
 filler+2.425
Neg logit contributions
Alert-2.718
ulate-2.678
aic-2.619
imum-2.610
PLIED-2.598
Token manager
Token ID4706
Feature activation-1.73e-01

Pos logit contributions
cloth+2.152
 interviewer+2.089
omas+2.070
leneck+2.015
 filler+1.981
Neg logit contributions
Alert-2.669
ulate-2.606
aic-2.590
imum-2.567
PLIED-2.557
Token to
Token ID284
Feature activation-3.42e-01

Pos logit contributions
cloth+0.938
omas+0.926
 interviewer+0.905
leneck+0.887
 filler+0.874
Neg logit contributions
Alert-1.094
ulate-1.082
aic-1.063
PLIED-1.047
imum-1.046
Token
Token ID564
Feature activation-2.43e-01

Pos logit contributions
cloth+1.864
omas+1.837
 interviewer+1.793
leneck+1.760
 filler+1.739
Neg logit contributions
Alert-2.168
ulate-2.137
aic-2.083
ents-2.076
imum-2.072
Token
Token ID250
Feature activation-7.99e-01

Pos logit contributions
cloth+1.235
omas+1.213
 interviewer+1.175
leneck+1.161
 filler+1.141
Neg logit contributions
Alert-1.615
ulate-1.609
aic-1.568
ents-1.558
imum-1.551
Tokenpro
Token ID1676
Feature activation+2.23e+00

Pos logit contributions
cloth+4.144
omas+3.929
leneck+3.793
 interviewer+3.733
auc+3.724
Neg logit contributions
Alert-5.107
PLIED-5.082
ulate-5.048
imum-5.045
aic-4.957
Tokenpose
Token ID3455
Feature activation+9.53e-02

Pos logit contributions
ulate+11.867
Alert+11.193
dict+11.191
ents+10.022
aic+9.943
Neg logit contributions
omas-11.900
 interviewer-11.813
leneck-11.752
 filler-11.646
cloth-11.453
Token guidelines
Token ID9949
Feature activation+4.02e-01

Pos logit contributions
Alert+0.628
ulate+0.622
aic+0.604
ents+0.603
imum+0.598
Neg logit contributions
cloth-0.497
omas-0.488
 interviewer-0.474
leneck-0.470
 filler-0.462
Token to
Token ID284
Feature activation+1.06e-01

Pos logit contributions
Alert+2.426
ulate+2.408
ents+2.335
aic+2.314
imum+2.285
Neg logit contributions
cloth-2.303
omas-2.278
 interviewer-2.217
leneck-2.215
 filler-2.160
Token prevent
Token ID2948
Feature activation+7.36e-02

Pos logit contributions
Alert+0.617
ulate+0.610
aic+0.590
ents+0.590
imum+0.584
Neg logit contributions
cloth-0.632
omas-0.623
 interviewer-0.606
leneck-0.601
 filler-0.592
Token this
Token ID428
Feature activation-4.18e-01

Pos logit contributions
Alert+0.461
ulate+0.456
aic+0.443
ents+0.442
imum+0.439
Neg logit contributions
cloth-0.408
omas-0.402
 interviewer-0.391
leneck-0.386
 filler-0.381
Token members
Token ID1866
Feature activation+4.54e-01

Pos logit contributions
Alert+1.670
ulate+1.654
aic+1.599
ents+1.598
imum+1.584
Neg logit contributions
cloth-1.622
omas-1.598
 interviewer-1.554
leneck-1.542
 filler-1.519
Token of
Token ID286
Feature activation+2.60e-01

Pos logit contributions
Alert+2.558
ulate+2.540
ents+2.457
aic+2.442
imum+2.423
Neg logit contributions
cloth-2.773
omas-2.729
 interviewer-2.666
leneck-2.648
 filler-2.600
Token Ford
Token ID8092
Feature activation+4.92e-02

Pos logit contributions
Alert+1.622
ulate+1.605
aic+1.554
ents+1.553
imum+1.541
Neg logit contributions
cloth-1.447
omas-1.425
 interviewer-1.382
leneck-1.371
 filler-1.350
Token Nation
Token ID8741
Feature activation+4.08e-01

Pos logit contributions
Alert+0.300
ulate+0.298
aic+0.288
ents+0.286
imum+0.285
Neg logit contributions
cloth-0.283
omas-0.277
leneck-0.267
 interviewer-0.266
 filler-0.262
Token and
Token ID290
Feature activation+7.70e-01

Pos logit contributions
Alert+2.530
ulate+2.507
ents+2.430
aic+2.427
imum+2.402
Neg logit contributions
cloth-2.279
omas-2.241
 interviewer-2.185
leneck-2.167
 filler-2.130
Token par
Token ID1582
Feature activation+2.23e+00

Pos logit contributions
ulate+4.627
Alert+4.606
aic+4.447
ents+4.441
imum+4.300
Neg logit contributions
cloth-4.324
omas-4.185
leneck-4.110
 interviewer-4.062
 filler-3.965
Tokenrot
Token ID10599
Feature activation+1.01e+00

Pos logit contributions
ulate+13.692
aic+13.033
ents+12.904
Alert+12.838
imum+12.518
Neg logit contributions
cloth-10.246
leneck-9.626
 interviewer-9.510
 filler-9.183
bsite-9.074
Token Ford
Token ID8092
Feature activation+2.02e-01

Pos logit contributions
ulate+5.643
Alert+5.523
aic+5.437
ents+5.318
dict+5.106
Neg logit contributions
cloth-5.935
leneck-5.772
omas-5.748
bsite-5.602
 interviewer-5.555
Token
Token ID447
Feature activation+2.84e-01

Pos logit contributions
Alert+1.217
ulate+1.206
aic+1.166
ents+1.165
imum+1.153
Neg logit contributions
cloth-1.161
omas-1.143
 interviewer-1.111
leneck-1.104
 filler-1.086
Token
Token ID247
Feature activation+4.62e-01

Pos logit contributions
Alert+1.911
ulate+1.897
ents+1.843
aic+1.818
imum+1.806
Neg logit contributions
cloth-1.428
omas-1.408
 interviewer-1.356
leneck-1.352
 filler-1.329
Tokens
Token ID82
Feature activation+1.15e-01

Pos logit contributions
Alert+1.864
ulate+1.856
ents+1.761
aic+1.761
imum+1.717
Neg logit contributions
cloth-3.564
omas-3.517
 interviewer-3.449
leneck-3.432
 filler-3.392
Tokeniod
Token ID2101
Feature activation+1.83e+00

Pos logit contributions
Alert+5.706
imum+5.498
ulate+5.490
aic+5.478
dict+5.389
Neg logit contributions
 interviewer-6.304
 filler-6.290
cloth-6.268
omas-6.223
leneck-6.125
Token ("
Token ID5855
Feature activation+7.87e-01

Pos logit contributions
ulate+9.117
Alert+8.973
imum+8.784
ents+8.705
aic+8.146
Neg logit contributions
cloth-10.872
omas-10.573
 interviewer-10.522
leneck-10.121
bsite-9.992
TokenP
Token ID47
Feature activation+7.27e-01

Pos logit contributions
Alert+4.301
ulate+4.224
ents+4.035
imum+4.024
aic+4.009
Neg logit contributions
cloth-4.958
omas-4.908
 interviewer-4.817
leneck-4.748
 filler-4.684
Tokenulse
Token ID9615
Feature activation+5.25e-01

Pos logit contributions
ulate+1.834
Alert+1.722
aic+1.602
imum+1.594
ents+1.523
Neg logit contributions
cloth-6.798
 interviewer-6.647
 filler-6.614
omas-6.564
 Ukrain-6.474
Token period
Token ID2278
Feature activation+1.83e+00

Pos logit contributions
Alert+3.049
ulate+3.031
ents+2.909
aic+2.909
imum+2.909
Neg logit contributions
cloth-3.138
omas-3.078
 interviewer-2.979
leneck-2.974
 filler-2.893
Token",
Token ID1600
Feature activation+2.22e+00

Pos logit contributions
ulate+7.916
imum+7.796
Alert+7.444
ents+7.426
aic+7.233
Neg logit contributions
cloth-12.226
 interviewer-11.735
omas-11.697
sbm-11.463
bsite-11.430
Token Float
Token ID48436
Feature activation+1.11e+00

Pos logit contributions
ulate+7.937
Alert+7.641
imum+7.485
ents+7.041
dict+6.642
Neg logit contributions
cloth-15.485
omas-15.261
 interviewer-14.642
bsite-14.408
leneck-14.350
Token)
Token ID8
Feature activation+1.56e+00

Pos logit contributions
Alert+6.241
ulate+6.141
ents+5.904
imum+5.874
dict+5.694
Neg logit contributions
cloth-6.687
omas-6.610
 interviewer-6.482
leneck-6.388
bsite-6.213
Token =
Token ID796
Feature activation+1.24e+00

Pos logit contributions
ulate+6.332
Alert+6.264
imum+6.083
ents+6.074
PLIED+5.724
Neg logit contributions
omas-10.891
cloth-10.885
 interviewer-10.571
bsite-10.300
leneck-10.264
Token 1
Token ID352
Feature activation+1.11e+00

Pos logit contributions
ulate+6.207
Alert+6.086
imum+5.804
ents+5.782
aic+5.597
Neg logit contributions
cloth-8.030
omas-7.924
leneck-7.798
 interviewer-7.584
bsite-7.469
Token //
Token ID3373
Feature activation+8.00e-01

Pos logit contributions
ulate+6.638
Alert+6.601
ents+6.423
imum+6.310
PLIED+6.229
Neg logit contributions
cloth-6.080
omas-5.987
 interviewer-5.862
leneck-5.823
bsite-5.660
Token "
Token ID366
Feature activation+3.32e-01

Pos logit contributions
ulate+7.210
imum+6.748
Alert+6.737
ents+6.568
PLIED+6.361
Neg logit contributions
cloth-10.628
omas-10.345
 interviewer-10.076
leneck-10.010
bsite-9.933
TokenP
Token ID47
Feature activation+7.19e-01

Pos logit contributions
Alert+1.566
ulate+1.513
ents+1.416
aic+1.414
imum+1.406
Neg logit contributions
omas-2.379
cloth-2.375
 interviewer-2.327
leneck-2.322
 filler-2.289
Tokenulse
Token ID9615
Feature activation+4.96e-01

Pos logit contributions
ulate+1.680
Alert+1.504
aic+1.409
imum+1.395
ents+1.358
Neg logit contributions
cloth-6.887
 interviewer-6.798
 filler-6.766
omas-6.652
 Ukrain-6.602
Token amount
Token ID2033
Feature activation+1.26e+00

Pos logit contributions
Alert+3.080
ulate+3.039
imum+2.952
ents+2.872
aic+2.862
Neg logit contributions
cloth-2.753
omas-2.710
leneck-2.671
bsite-2.571
 interviewer-2.562
Token"
Token ID1
Feature activation+1.86e+00

Pos logit contributions
ulate+4.243
Alert+4.072
imum+4.011
aic+3.957
ents+3.922
Neg logit contributions
 interviewer-9.462
omas-9.448
bsite-9.437
cloth-9.365
leneck-9.358
Token,
Token ID837
Feature activation+2.22e+00

Pos logit contributions
ulate+4.644
imum+4.295
ents+4.011
Alert+3.941
dict+3.873
Neg logit contributions
omas-15.360
bsite-14.935
 interviewer-14.898
cloth-14.786
leneck-14.359
Token Range
Token ID13667
Feature activation+5.91e-01

Pos logit contributions
ulate+5.661
imum+5.030
Alert+4.977
ents+4.704
dict+4.306
Neg logit contributions
omas-17.772
cloth-17.603
bsite-16.788
cember-16.774
 interviewer-16.751
Token (
Token ID357
Feature activation+8.06e-01

Pos logit contributions
Alert+2.325
ulate+2.213
ents+2.125
imum+2.105
aic+2.063
Neg logit contributions
omas-4.606
cloth-4.432
leneck-4.429
 interviewer-4.386
bsite-4.380
Token 0
Token ID657
Feature activation+9.54e-01

Pos logit contributions
ulate+3.230
Alert+3.134
ents+3.028
imum+2.935
PLIED+2.823
Neg logit contributions
omas-6.089
cloth-5.960
leneck-5.806
bsite-5.754
 interviewer-5.743
Token,
Token ID837
Feature activation+1.04e+00

Pos logit contributions
Alert+2.854
ulate+2.775
ents+2.765
imum+2.630
aic+2.564
Neg logit contributions
omas-7.898
 challeng-7.833
bsite-7.805
 interviewer-7.780
leneck-7.700
Token 0
Token ID657
Feature activation+4.77e-01

Pos logit contributions
ulate+3.930
Alert+3.881
ents+3.728
imum+3.576
PLIED+3.504
Neg logit contributions
omas-7.918
cloth-7.738
leneck-7.583
bsite-7.505
 interviewer-7.501
Tokenulate
Token ID5039
Feature activation+1.53e+00

Pos logit contributions
ulate+2.857
imum+0.509
aic+0.454
Alert+0.440
ents+0.327
Neg logit contributions
 interviewer-16.506
 filler-16.451
 Ukrain-16.391
cloth-16.390
 simultane-16.113
TokenD
Token ID35
Feature activation+1.13e+00

Pos logit contributions
Alert+7.057
ulate+6.900
imum+6.470
ents+6.455
aic+6.311
Neg logit contributions
 interviewer-10.241
omas-10.229
cloth-10.165
 filler-9.847
leneck-9.831
Tokenensity
Token ID6377
Feature activation+7.16e-01

Pos logit contributions
ulate+3.618
ents+3.479
imum+3.298
aic+3.295
Alert+3.221
Neg logit contributions
 interviewer-9.417
cloth-9.273
 filler-9.223
leneck-9.023
omas-8.962
TokenGrid
Token ID41339
Feature activation+1.12e+00

Pos logit contributions
Alert+4.982
ulate+4.913
ents+4.755
aic+4.747
imum+4.718
Neg logit contributions
cloth-3.423
omas-3.407
 interviewer-3.320
leneck-3.243
 filler-3.208
Token(
Token ID7
Feature activation+1.80e+00

Pos logit contributions
ulate+5.446
Alert+5.367
ents+5.160
aic+5.080
imum+5.060
Neg logit contributions
omas-7.385
cloth-7.368
 interviewer-7.269
leneck-7.193
bsite-7.066
Token Uniform
Token ID35712
Feature activation+2.20e+00

Pos logit contributions
ulate+9.891
Alert+9.628
ents+9.175
imum+9.173
dict+9.080
Neg logit contributions
omas-9.926
cloth-9.782
 interviewer-9.740
leneck-9.603
 filler-9.519
TokenGrid
Token ID41339
Feature activation+1.73e+00

Pos logit contributions
Alert+12.065
ulate+12.015
ents+11.291
imum+11.164
dict+11.150
Neg logit contributions
omas-11.409
 interviewer-11.146
cloth-10.962
sbm-10.607
bsite-10.590
Token<
Token ID27
Feature activation+1.29e+00

Pos logit contributions
ulate+9.968
Alert+9.796
ents+9.609
aic+9.344
imum+9.311
Neg logit contributions
omas-8.773
 interviewer-8.565
leneck-8.543
cloth-8.532
sbm-8.314
Token float
Token ID12178
Feature activation+1.30e+00

Pos logit contributions
ulate+7.047
Alert+6.982
ents+6.602
imum+6.593
aic+6.480
Neg logit contributions
omas-7.664
cloth-7.642
 interviewer-7.587
leneck-7.454
 filler-7.346
Token >
Token ID1875
Feature activation+1.80e+00

Pos logit contributions
ulate+6.384
Alert+6.344
ents+6.226
imum+6.002
aic+5.958
Neg logit contributions
cloth-8.330
omas-8.305
 interviewer-8.197
leneck-8.046
bsite-7.873
Token &
Token ID1222
Feature activation+9.99e-01

Pos logit contributions
ulate+10.604
ents+10.240
Alert+10.107
imum+9.953
aic+9.763
Neg logit contributions
omas-8.788
 interviewer-8.646
cloth-8.601
leneck-8.591
sbm-8.171
TokenVolume
Token ID31715
Feature activation+1.07e+00

Pos logit contributions
Alert+1.217
ulate+1.217
aic+1.175
ents+1.169
imum+1.148
Neg logit contributions
cloth-1.746
omas-1.715
 interviewer-1.661
leneck-1.659
 filler-1.638
Token()
Token ID3419
Feature activation+1.32e+00

Pos logit contributions
Alert+5.516
ulate+5.369
ents+5.113
imum+5.043
aic+5.042
Neg logit contributions
omas-6.912
cloth-6.883
 interviewer-6.748
leneck-6.691
bsite-6.642
Token ;
Token ID2162
Feature activation+2.34e+00

Pos logit contributions
Alert+5.591
ulate+5.510
ents+5.208
imum+5.172
aic+5.050
Neg logit contributions
cloth-9.295
omas-9.259
leneck-9.107
 interviewer-9.029
bsite-8.908
Token return
Token ID1441
Feature activation+1.51e+00

Pos logit contributions
ulate+10.831
Alert+10.299
ents+9.952
imum+9.731
dict+9.428
Neg logit contributions
leneck-13.289
cloth-13.179
omas-13.002
bsite-12.684
kamp-12.311
Token f
Token ID277
Feature activation+1.28e+00

Pos logit contributions
ulate+7.361
Alert+7.326
ents+7.141
imum+7.068
aic+6.679
Neg logit contributions
cloth-9.284
leneck-9.266
omas-9.248
bsite-8.961
 interviewer-8.915
TokenMass
Token ID20273
Feature activation+2.19e+00

Pos logit contributions
Alert+5.120
ulate+4.868
imum+4.683
ents+4.638
PLIED+4.475
Neg logit contributions
cloth-9.539
omas-9.498
 interviewer-9.495
leneck-9.274
 filler-9.209
Token ;
Token ID2162
Feature activation+1.53e+00

Pos logit contributions
Alert+9.244
ulate+8.847
imum+8.782
ents+8.569
aic+8.348
Neg logit contributions
omas-13.560
leneck-13.474
cloth-13.390
 interviewer-13.300
bsite-13.226
Token }
Token ID1782
Feature activation+8.55e-01

Pos logit contributions
Alert+5.049
ulate+5.037
ents+4.872
imum+4.527
PLIED+4.476
Neg logit contributions
cloth-11.967
leneck-11.637
omas-11.582
 interviewer-11.507
bsite-11.473
Token
Token ID198
Feature activation-4.39e-01

Pos logit contributions
ulate+5.980
Alert+5.970
ents+5.770
imum+5.723
aic+5.667
Neg logit contributions
cloth-3.932
omas-3.881
leneck-3.745
 interviewer-3.680
bsite-3.582
Token
Token ID198
Feature activation+2.61e-01

Pos logit contributions
cloth+1.416
omas+1.383
 interviewer+1.320
leneck+1.310
 filler+1.270
Neg logit contributions
Alert-3.777
ulate-3.748
ents-3.664
aic-3.658
imum-3.625
TokenAss
Token ID8021
Feature activation+6.47e-01

Pos logit contributions
Alert+1.492
ulate+1.473
ents+1.421
aic+1.420
imum+1.406
Neg logit contributions
cloth-1.587
omas-1.566
 interviewer-1.529
leneck-1.518
 filler-1.495
TokenOf
Token ID5189
Feature activation+7.71e-01

Pos logit contributions
Alert+7.370
ulate+7.013
ents+6.809
imum+6.754
aic+6.603
Neg logit contributions
omas-8.950
cloth-8.882
 interviewer-8.815
leneck-8.735
 filler-8.550
TokenPosition
Token ID26545
Feature activation+1.54e+00

Pos logit contributions
Alert+4.015
ulate+3.926
ents+3.784
imum+3.777
aic+3.745
Neg logit contributions
cloth-5.013
omas-4.965
 interviewer-4.882
leneck-4.843
 filler-4.793
Token(
Token ID7
Feature activation+1.04e+00

Pos logit contributions
Alert+7.489
ulate+6.867
ents+6.820
imum+6.670
PLIED+6.632
Neg logit contributions
leneck-9.783
omas-9.765
 interviewer-9.705
cloth-9.663
 filler-9.531
Token r
Token ID374
Feature activation+5.94e-01

Pos logit contributions
ulate+5.363
Alert+5.169
ents+5.095
imum+5.051
aic+4.950
Neg logit contributions
cloth-6.568
omas-6.450
leneck-6.353
 interviewer-6.349
bsite-6.251
TokenPosition
Token ID26545
Feature activation+1.78e+00

Pos logit contributions
Alert+2.822
ulate+2.794
ents+2.671
aic+2.637
imum+2.635
Neg logit contributions
cloth-4.163
omas-4.114
 interviewer-4.031
leneck-4.001
 filler-3.951
Token )
Token ID1267
Feature activation+2.15e+00

Pos logit contributions
Alert+7.229
ents+6.649
ulate+6.619
imum+6.400
PLIED+6.277
Neg logit contributions
omas-12.321
leneck-12.201
 interviewer-12.200
cloth-12.068
bsite-11.873
Token ;
Token ID2162
Feature activation+1.75e+00

Pos logit contributions
Alert+7.761
ulate+7.293
ents+6.961
imum+6.738
PLIED+6.555
Neg logit contributions
cloth-14.687
omas-14.532
 interviewer-14.395
leneck-14.296
bsite-14.235
Token //
Token ID3373
Feature activation+1.10e+00

Pos logit contributions
ulate+8.750
Alert+8.408
ents+8.117
imum+7.916
aic+7.703
Neg logit contributions
cloth-10.381
leneck-10.251
bsite-10.233
omas-10.167
 interviewer-9.763
Token Note
Token ID5740
Feature activation-1.52e-01

Pos logit contributions
ulate+6.277
Alert+6.198
ents+5.882
imum+5.864
aic+5.753
Neg logit contributions
omas-6.369
cloth-6.366
leneck-6.258
 interviewer-6.152
bsite-6.076
Token that
Token ID326
Feature activation+5.92e-01

Pos logit contributions
cloth+0.824
omas+0.813
 interviewer+0.788
 filler+0.784
leneck+0.770
Neg logit contributions
Alert-0.955
ulate-0.938
aic-0.924
ents-0.913
PLIED-0.911
Token the
Token ID262
Feature activation+3.03e-01

Pos logit contributions
ulate+3.457
Alert+3.442
ents+3.315
imum+3.278
aic+3.260
Neg logit contributions
cloth-3.473
omas-3.433
leneck-3.348
 interviewer-3.318
bsite-3.263
Token {
Token ID1391
Feature activation+8.66e-01

Pos logit contributions
Alert+3.730
ulate+3.581
ents+3.378
aic+3.328
imum+3.301
Neg logit contributions
cloth-7.815
omas-7.778
 interviewer-7.705
leneck-7.648
bsite-7.543
Token...
Token ID2644
Feature activation+1.14e+00

Pos logit contributions
ulate+5.237
Alert+5.118
ents+5.052
aic+4.896
imum+4.869
Neg logit contributions
cloth-4.731
omas-4.615
leneck-4.519
bsite-4.449
 interviewer-4.447
Token prev
Token ID8654
Feature activation+1.45e+00

Pos logit contributions
ulate+6.854
Alert+6.777
ents+6.694
dict+6.486
imum+6.385
Neg logit contributions
cloth-6.028
omas-5.818
leneck-5.815
 interviewer-5.700
bsite-5.618
Token_
Token ID62
Feature activation+5.12e-01

Pos logit contributions
ulate+6.358
Alert+6.284
ents+6.207
dict+5.784
imum+5.671
Neg logit contributions
cloth-10.034
omas-9.751
 interviewer-9.727
 filler-9.566
leneck-9.511
Tokenstate
Token ID5219
Feature activation+1.21e+00

Pos logit contributions
ulate+2.401
Alert+2.392
ents+2.297
aic+2.261
imum+2.237
Neg logit contributions
cloth-3.596
omas-3.555
 interviewer-3.505
leneck-3.450
 filler-3.435
Token :
Token ID1058
Feature activation+2.14e+00

Pos logit contributions
Alert+5.245
ulate+5.187
ents+5.043
aic+4.783
dict+4.772
Neg logit contributions
cloth-8.577
omas-8.348
 interviewer-8.315
leneck-8.235
bsite-8.109
Token Option
Token ID16018
Feature activation+1.00e+00

Pos logit contributions
Alert+8.918
ulate+8.837
ents+8.338
dict+8.140
imum+8.139
Neg logit contributions
cloth-13.243
bsite-12.686
omas-12.609
leneck-12.489
auc-12.280
Token <
Token ID1279
Feature activation+1.87e+00

Pos logit contributions
Alert+5.686
ulate+5.630
ents+5.376
aic+5.345
imum+5.279
Neg logit contributions
cloth-5.956
omas-5.883
 interviewer-5.787
 filler-5.607
leneck-5.598
Token Arc
Token ID10173
Feature activation+1.65e+00

Pos logit contributions
Alert+8.820
ulate+8.544
ents+8.135
aic+7.899
dict+7.824
Neg logit contributions
cloth-10.935
omas-10.844
auc-10.761
leneck-10.699
 interviewer-10.687
Token <
Token ID1279
Feature activation+1.00e+00

Pos logit contributions
Alert+7.307
ulate+7.268
ents+6.984
aic+6.692
imum+6.600
Neg logit contributions
cloth-11.009
omas-10.846
 interviewer-10.638
bsite-10.507
auc-10.347
Token Board
Token ID5926
Feature activation+1.06e+00

Pos logit contributions
Alert+5.476
ulate+5.472
ents+5.240
aic+5.139
imum+5.054
Neg logit contributions
cloth-6.036
omas-5.903
leneck-5.821
 interviewer-5.784
bsite-5.718
Token it
Token ID340
Feature activation+5.64e-01

Pos logit contributions
Alert+2.294
ulate+2.273
aic+2.197
ents+2.196
imum+2.174
Neg logit contributions
cloth-2.248
omas-2.212
 interviewer-2.152
leneck-2.137
 filler-2.103
Token in
Token ID287
Feature activation+4.73e-01

Pos logit contributions
ulate+3.308
Alert+3.275
ents+3.213
aic+3.180
imum+3.132
Neg logit contributions
cloth-3.238
omas-3.198
leneck-3.127
 interviewer-3.087
bsite-3.047
Token a
Token ID257
Feature activation+8.89e-01

Pos logit contributions
ulate+2.850
Alert+2.847
ents+2.750
aic+2.736
imum+2.719
Neg logit contributions
cloth-2.711
omas-2.640
leneck-2.579
 interviewer-2.567
bsite-2.507
Token high
Token ID1029
Feature activation+7.20e-01

Pos logit contributions
ulate+5.468
Alert+5.410
ents+5.240
imum+5.215
aic+5.150
Neg logit contributions
cloth-4.733
omas-4.727
leneck-4.567
 interviewer-4.536
bsite-4.413
Token-
Token ID12
Feature activation+7.38e-01

Pos logit contributions
Alert+4.485
ulate+4.482
imum+4.342
ents+4.318
aic+4.316
Neg logit contributions
cloth-3.871
omas-3.781
 interviewer-3.700
leneck-3.698
bsite-3.569
Tokentem
Token ID11498
Feature activation+2.13e+00

Pos logit contributions
ulate+4.927
Alert+4.906
ents+4.769
aic+4.768
imum+4.754
Neg logit contributions
cloth-3.621
 interviewer-3.618
omas-3.597
leneck-3.474
 filler-3.439
Tokenperature
Token ID21069
Feature activation+2.42e-01

Pos logit contributions
ents+6.545
ulate+6.406
Alert+6.147
PLIED+6.011
aic+5.995
Neg logit contributions
cloth-15.447
omas-15.342
 interviewer-15.226
 filler-15.179
sbm-15.166
Token environment
Token ID2858
Feature activation+8.07e-01

Pos logit contributions
Alert+1.532
ulate+1.520
ents+1.472
aic+1.471
imum+1.459
Neg logit contributions
cloth-1.323
omas-1.302
 interviewer-1.262
leneck-1.254
 filler-1.231
Token.
Token ID13
Feature activation+3.22e-01

Pos logit contributions
Alert+4.535
ulate+4.518
ents+4.447
aic+4.412
imum+4.358
Neg logit contributions
cloth-4.720
omas-4.689
 interviewer-4.541
leneck-4.540
bsite-4.445
Token They
Token ID1119
Feature activation-2.32e-01

Pos logit contributions
Alert+2.019
ulate+1.988
ents+1.937
aic+1.929
imum+1.911
Neg logit contributions
cloth-1.775
omas-1.766
leneck-1.711
 interviewer-1.699
bsite-1.658
Token have
Token ID423
Feature activation-4.32e-01

Pos logit contributions
cloth+1.064
omas+1.045
 interviewer+1.011
leneck+0.999
 filler+0.983
Neg logit contributions
Alert-1.683
ulate-1.653
aic-1.607
ents-1.606
imum-1.603
Token per
Token ID583
Feature activation+5.81e-01

Pos logit contributions
ulate+4.878
Alert+4.856
ents+4.750
aic+4.629
imum+4.584
Neg logit contributions
cloth-5.458
omas-5.403
leneck-5.330
 interviewer-5.294
bsite-5.145
Token cent
Token ID1247
Feature activation+7.60e-01

Pos logit contributions
Alert+2.513
ulate+2.512
aic+2.433
ents+2.408
imum+2.393
Neg logit contributions
cloth-4.145
leneck-4.084
omas-4.060
bsite-3.937
 interviewer-3.917
Token.
Token ID13
Feature activation+4.47e-01

Pos logit contributions
ulate+4.466
Alert+4.394
aic+4.281
ents+4.254
imum+4.169
Neg logit contributions
omas-4.362
cloth-4.337
leneck-4.179
 interviewer-4.173
bsite-4.068
Token
Token ID198
Feature activation+5.89e-01

Pos logit contributions
Alert+3.522
ulate+3.481
ents+3.402
aic+3.394
imum+3.367
Neg logit contributions
cloth-1.740
omas-1.708
 interviewer-1.636
leneck-1.617
 filler-1.581
Token
Token ID198
Feature activation+6.98e-01

Pos logit contributions
Alert+4.242
ulate+4.118
ents+4.084
aic+4.016
dict+3.968
Neg logit contributions
 challeng-2.691
leneck-2.605
omas-2.572
 interviewer-2.528
 filler-2.528
TokenAd
Token ID2782
Feature activation+2.13e+00

Pos logit contributions
Alert+3.682
ulate+3.543
aic+3.408
ents+3.395
imum+3.329
Neg logit contributions
cloth-4.555
omas-4.536
 interviewer-4.495
leneck-4.424
 filler-4.402
Tokenvertising
Token ID31809
Feature activation+3.63e-01

Pos logit contributions
ulate+13.057
Alert+12.648
ents+12.378
imum+12.196
aic+12.172
Neg logit contributions
cloth-9.296
 interviewer-9.285
 filler-8.925
omas-8.632
leneck-8.477
Token
Token ID198
Feature activation+4.53e-01

Pos logit contributions
Alert+2.739
ulate+2.723
aic+2.638
ents+2.625
imum+2.597
Neg logit contributions
cloth-1.568
omas-1.518
 interviewer-1.467
leneck-1.441
 filler-1.408
Token
Token ID198
Feature activation+6.87e-01

Pos logit contributions
Alert+3.292
ulate+3.234
ents+3.201
aic+3.139
dict+3.109
Neg logit contributions
 challeng-2.008
leneck-1.965
omas-1.955
 interviewer-1.926
cloth-1.923
TokenThe
Token ID464
Feature activation+1.68e-01

Pos logit contributions
Alert+3.612
ulate+3.484
aic+3.343
ents+3.337
imum+3.276
Neg logit contributions
cloth-4.492
omas-4.467
 interviewer-4.428
leneck-4.368
 filler-4.335
Token I
Token ID314
Feature activation+8.36e-01

Pos logit contributions
Alert+1.042
ulate+1.033
aic+0.999
ents+0.999
imum+0.989
Neg logit contributions
cloth-0.939
omas-0.924
 interviewer-0.897
leneck-0.891
 filler-0.876
Token >
Token ID1875
Feature activation+1.80e+00

Pos logit contributions
ulate+6.384
Alert+6.344
ents+6.226
imum+6.002
aic+5.958
Neg logit contributions
cloth-8.330
omas-8.305
 interviewer-8.197
leneck-8.046
bsite-7.873
Token &
Token ID1222
Feature activation+9.99e-01

Pos logit contributions
ulate+10.604
ents+10.240
Alert+10.107
imum+9.953
aic+9.763
Neg logit contributions
omas-8.788
 interviewer-8.646
cloth-8.601
leneck-8.591
sbm-8.171
Token density
Token ID12109
Feature activation+1.24e+00

Pos logit contributions
ulate+7.202
Alert+7.162
ents+7.052
imum+6.922
aic+6.904
Neg logit contributions
omas-4.276
cloth-4.239
 interviewer-4.211
leneck-4.080
 filler-3.940
TokenGrid
Token ID41339
Feature activation+1.52e+00

Pos logit contributions
Alert+6.731
ulate+6.581
ents+6.416
imum+6.270
aic+6.247
Neg logit contributions
 interviewer-7.492
cloth-7.458
omas-7.457
leneck-7.258
 filler-7.142
Token,
Token ID837
Feature activation+1.96e+00

Pos logit contributions
Alert+6.475
ulate+6.291
ents+6.199
imum+5.897
aic+5.858
Neg logit contributions
 interviewer-10.597
cloth-10.565
omas-10.543
leneck-10.382
 filler-10.177
Token const
Token ID1500
Feature activation+2.12e+00

Pos logit contributions
ulate+10.517
Alert+9.923
ents+9.800
imum+9.628
dict+9.387
Neg logit contributions
cloth-10.647
omas-10.490
leneck-10.276
bsite-10.272
 interviewer-10.044
Token Vector
Token ID20650
Feature activation+1.34e+00

Pos logit contributions
ulate+10.607
Alert+10.161
ents+10.050
imum+9.678
dict+9.417
Neg logit contributions
omas-11.715
cloth-11.688
bsite-11.427
leneck-11.324
 interviewer-11.113
Token<
Token ID27
Feature activation+1.57e+00

Pos logit contributions
Alert+10.241
ulate+10.018
ents+9.818
imum+9.723
aic+9.524
Neg logit contributions
cloth-4.944
omas-4.810
leneck-4.698
 interviewer-4.569
bsite-4.426
Token Part
Token ID2142
Feature activation+1.06e+00

Pos logit contributions
ulate+6.720
Alert+6.451
ents+6.098
imum+6.087
dict+5.850
Neg logit contributions
cloth-10.772
omas-10.756
 interviewer-10.614
leneck-10.573
bsite-10.445
Tokenicle
Token ID1548
Feature activation+1.37e+00

Pos logit contributions
ulate+7.080
Alert+6.988
aic+6.723
ents+6.722
imum+6.695
Neg logit contributions
cloth-5.378
omas-5.229
 interviewer-5.157
leneck-5.039
 filler-4.929
Token >
Token ID1875
Feature activation+1.66e+00

Pos logit contributions
ulate+7.769
Alert+7.683
ents+7.303
imum+7.142
aic+7.083
Neg logit contributions
cloth-7.842
omas-7.696
 interviewer-7.617
leneck-7.518
bsite-7.359
Token your
Token ID534
Feature activation+5.76e-01

Pos logit contributions
Alert+0.762
ulate+0.754
aic+0.729
ents+0.729
imum+0.722
Neg logit contributions
cloth-0.762
omas-0.750
 interviewer-0.730
leneck-0.724
 filler-0.714
Token actions
Token ID4028
Feature activation+3.41e-01

Pos logit contributions
Alert+3.699
ulate+3.679
ents+3.562
imum+3.525
aic+3.520
Neg logit contributions
cloth-3.057
omas-2.956
leneck-2.871
 interviewer-2.822
 filler-2.765
Token at
Token ID379
Feature activation+5.57e-01

Pos logit contributions
Alert+2.152
ulate+2.133
ents+2.068
aic+2.063
imum+2.046
Neg logit contributions
cloth-1.854
omas-1.826
 interviewer-1.778
leneck-1.767
 filler-1.732
Token length
Token ID4129
Feature activation-1.39e-02

Pos logit contributions
Alert+3.577
ulate+3.526
aic+3.430
ents+3.418
imum+3.390
Neg logit contributions
cloth-2.981
omas-2.908
leneck-2.812
 interviewer-2.797
 filler-2.759
Token,
Token ID11
Feature activation+3.00e-01

Pos logit contributions
cloth+0.082
omas+0.080
 interviewer+0.078
leneck+0.078
 filler+0.077
Neg logit contributions
Alert-0.082
ulate-0.081
aic-0.079
ents-0.079
imum-0.078
Token spec
Token ID1020
Feature activation+2.11e+00

Pos logit contributions
Alert+1.917
ulate+1.897
ents+1.838
aic+1.828
imum+1.809
Neg logit contributions
cloth-1.615
omas-1.590
leneck-1.531
 interviewer-1.526
 filler-1.491
Tokenulating
Token ID8306
Feature activation+6.44e-01

Pos logit contributions
ulate+8.825
ents+8.265
Alert+7.764
iously+7.722
dict+7.568
Neg logit contributions
leneck-14.095
cloth-14.046
 interviewer-13.919
omas-13.876
hower-13.675
Token on
Token ID319
Feature activation+2.46e-01

Pos logit contributions
ulate+3.737
Alert+3.718
ents+3.582
aic+3.541
imum+3.481
Neg logit contributions
cloth-3.777
omas-3.714
leneck-3.617
 interviewer-3.598
bsite-3.522
Token the
Token ID262
Feature activation+5.73e-01

Pos logit contributions
Alert+1.437
ulate+1.428
ents+1.379
aic+1.374
imum+1.361
Neg logit contributions
cloth-1.461
omas-1.434
 interviewer-1.393
leneck-1.390
 filler-1.360
Token nature
Token ID3450
Feature activation+4.36e-02

Pos logit contributions
ulate+3.538
Alert+3.535
ents+3.452
aic+3.427
imum+3.405
Neg logit contributions
cloth-3.102
omas-3.043
leneck-2.926
 interviewer-2.872
bsite-2.850
Token of
Token ID286
Feature activation-1.72e-02

Pos logit contributions
Alert+0.267
ulate+0.264
aic+0.255
ents+0.254
imum+0.252
Neg logit contributions
cloth-0.249
omas-0.245
 interviewer-0.238
leneck-0.237
 filler-0.233
Token positive
Token ID3967
Feature activation+3.70e-01

Pos logit contributions
cloth+0.371
omas+0.363
 interviewer+0.356
leneck+0.354
 filler+0.349
Neg logit contributions
Alert-0.447
ulate-0.436
ents-0.425
aic-0.425
imum-0.419
Token (
Token ID357
Feature activation+6.06e-01

Pos logit contributions
Alert+2.394
ulate+2.380
aic+2.303
ents+2.301
imum+2.284
Neg logit contributions
cloth-1.966
omas-1.931
 interviewer-1.868
leneck-1.855
 filler-1.817
Tokentem
Token ID11498
Feature activation+2.01e+00

Pos logit contributions
Alert+3.908
ulate+3.894
aic+3.775
ents+3.768
imum+3.735
Neg logit contributions
cloth-3.171
omas-3.165
 interviewer-3.080
leneck-3.022
 filler-2.999
Tokenperature
Token ID21069
Feature activation+8.25e-01

Pos logit contributions
ulate+7.436
ents+7.115
aic+6.987
Alert+6.977
imum+6.814
Neg logit contributions
cloth-13.796
omas-13.785
 interviewer-13.622
 filler-13.434
leneck-13.367
Token-
Token ID12
Feature activation+1.10e+00

Pos logit contributions
Alert+4.665
ulate+4.619
aic+4.462
ents+4.444
imum+4.411
Neg logit contributions
cloth-4.943
omas-4.915
 interviewer-4.726
leneck-4.693
bsite-4.596
Tokenam
Token ID321
Feature activation+2.09e+00

Pos logit contributions
Alert+7.250
ulate+7.215
aic+6.986
ents+6.923
imum+6.886
Neg logit contributions
omas-5.428
 interviewer-5.266
cloth-5.222
 filler-5.112
leneck-5.038
Tokenpl
Token ID489
Feature activation+1.36e+00

Pos logit contributions
ulate+10.639
dict+9.715
ents+9.712
imum+9.667
PLIED+9.556
Neg logit contributions
omas-11.567
cloth-11.561
leneck-11.267
 interviewer-11.265
 filler-11.220
Tokenifying
Token ID4035
Feature activation+7.69e-01

Pos logit contributions
ulate+4.063
imum+3.795
aic+3.788
Alert+3.710
ents+3.638
Neg logit contributions
cloth-11.623
omas-11.533
 interviewer-11.404
 filler-11.187
leneck-11.112
Token)
Token ID8
Feature activation+4.77e-01

Pos logit contributions
ulate+3.024
Alert+3.017
ents+2.840
aic+2.831
imum+2.799
Neg logit contributions
cloth-5.975
omas-5.924
 interviewer-5.788
leneck-5.758
 filler-5.647
Token and
Token ID290
Feature activation+1.32e-01

Pos logit contributions
ulate+3.033
Alert+3.001
aic+2.910
ents+2.902
imum+2.882
Neg logit contributions
cloth-2.569
omas-2.545
 interviewer-2.436
leneck-2.423
bsite-2.360
Token negative
Token ID4633
Feature activation+5.92e-01

Pos logit contributions
Alert+1.101
ulate+1.089
ents+1.065
aic+1.064
imum+1.057
Neg logit contributions
cloth-0.454
omas-0.445
 interviewer-0.426
leneck-0.421
 filler-0.413
Tokenity
Token ID414
Feature activation+2.32e-02

Pos logit contributions
cloth+0.569
omas+0.554
 interviewer+0.551
leneck+0.536
 filler+0.534
Neg logit contributions
Alert-0.716
ulate-0.701
ents-0.685
aic-0.679
imum-0.676
Token of
Token ID286
Feature activation+2.46e-01

Pos logit contributions
Alert+0.145
ulate+0.143
aic+0.139
imum+0.139
ents+0.139
Neg logit contributions
cloth-0.130
omas-0.126
 interviewer-0.125
leneck-0.123
 filler-0.122
Token it
Token ID340
Feature activation+6.04e-01

Pos logit contributions
Alert+1.467
ulate+1.458
ents+1.407
aic+1.404
imum+1.390
Neg logit contributions
cloth-1.434
omas-1.414
 interviewer-1.371
leneck-1.367
 filler-1.338
Token alone
Token ID3436
Feature activation+6.08e-01

Pos logit contributions
Alert+3.648
ulate+3.614
ents+3.541
aic+3.474
imum+3.436
Neg logit contributions
cloth-3.368
omas-3.347
leneck-3.242
 interviewer-3.234
 filler-3.138
Token lends
Token ID37733
Feature activation+1.30e+00

Pos logit contributions
Alert+3.670
ulate+3.627
ents+3.579
aic+3.514
imum+3.435
Neg logit contributions
cloth-3.385
omas-3.372
leneck-3.283
 interviewer-3.257
 filler-3.166
Token cred
Token ID2600
Feature activation+2.08e+00

Pos logit contributions
ulate+6.784
Alert+6.646
ents+6.620
aic+6.470
imum+6.381
Neg logit contributions
omas-7.825
cloth-7.811
leneck-7.569
bsite-7.458
sbm-7.444
Tokenence
Token ID594
Feature activation+1.05e+00

Pos logit contributions
ents+5.959
ulate+5.340
iously+4.703
aic+4.597
imum+4.317
Neg logit contributions
 interviewer-17.087
leneck-16.764
cloth-16.717
 Ukrain-16.565
 filler-16.457
Token to
Token ID284
Feature activation+6.02e-01

Pos logit contributions
ulate+6.135
Alert+6.080
ents+6.007
aic+5.882
imum+5.805
Neg logit contributions
cloth-5.908
omas-5.819
leneck-5.701
 interviewer-5.700
bsite-5.578
Token the
Token ID262
Feature activation+5.67e-01

Pos logit contributions
ulate+3.522
Alert+3.520
ents+3.392
aic+3.373
imum+3.326
Neg logit contributions
cloth-3.548
omas-3.488
leneck-3.389
 interviewer-3.370
 filler-3.299
Token idea
Token ID2126
Feature activation+5.58e-01

Pos logit contributions
ulate+3.414
Alert+3.401
aic+3.278
ents+3.276
imum+3.220
Neg logit contributions
cloth-3.228
omas-3.158
leneck-3.070
 interviewer-3.015
bsite-2.966
Token.
Token ID13
Feature activation+5.81e-01

Pos logit contributions
Alert+3.379
ulate+3.355
ents+3.246
aic+3.241
imum+3.203
Neg logit contributions
cloth-3.195
omas-3.148
 interviewer-3.054
leneck-3.037
 filler-2.985
Token {
Token ID1391
Feature activation+5.95e-01

Pos logit contributions
ulate+2.644
Alert+2.640
aic+2.344
ents+2.330
imum+2.176
Neg logit contributions
cloth-8.160
omas-8.135
leneck-8.087
 interviewer-8.005
 filler-7.907
Token(
Token ID7
Feature activation-6.02e-01

Pos logit contributions
Alert+2.545
ulate+2.382
aic+2.303
ents+2.297
dict+2.200
Neg logit contributions
omas-4.462
leneck-4.430
 interviewer-4.417
 filler-4.401
cloth-4.343
Token244
Token ID25707
Feature activation+1.01e+00

Pos logit contributions
cloth+2.330
omas+2.208
 interviewer+2.090
leneck+2.048
 filler+2.016
Neg logit contributions
Alert-4.690
ulate-4.684
aic-4.573
imum-4.539
ents-4.509
Tokens
Token ID82
Feature activation+7.68e-01

Pos logit contributions
ents+2.422
Alert+2.323
ulate+2.298
aic+2.060
dict+2.017
Neg logit contributions
leneck-9.022
cloth-8.971
 interviewer-8.903
omas-8.840
 filler-8.795
Token)}
Token ID38165
Feature activation+2.00e-01

Pos logit contributions
Alert+1.087
ulate+0.908
ents+0.875
aic+0.782
PLIED+0.697
Neg logit contributions
leneck-7.824
omas-7.781
cloth-7.706
 interviewer-7.694
 filler-7.601
Token Be
Token ID1355
Feature activation-2.68e+00

Pos logit contributions
Alert+1.093
ulate+1.087
ents+1.045
aic+1.042
imum+1.026
Neg logit contributions
cloth-1.264
omas-1.248
leneck-1.215
 interviewer-1.213
 filler-1.188
Token7
Token ID22
Feature activation+7.57e-01

Pos logit contributions
auc+6.690
omas+6.539
cloth+6.524
7+6.378
6+6.335
Neg logit contributions
Alert-19.629
PLIED-19.376
imum-18.949
ulate-18.694
ulatory-18.541
Token {
Token ID1391
Feature activation+3.87e-01

Pos logit contributions
Alert+2.038
ulate+2.014
aic+1.823
ents+1.817
imum+1.753
Neg logit contributions
cloth-6.798
omas-6.744
 interviewer-6.663
leneck-6.647
 filler-6.581
Token(
Token ID7
Feature activation-8.24e-01

Pos logit contributions
Alert+1.768
ulate+1.656
ents+1.610
aic+1.609
imum+1.525
Neg logit contributions
omas-2.826
leneck-2.793
 interviewer-2.787
cloth-2.776
 filler-2.774
Token8
Token ID23
Feature activation+1.67e-01

Pos logit contributions
cloth+3.419
omas+3.222
 interviewer+3.069
leneck+3.000
auc+2.977
Neg logit contributions
ulate-6.089
Alert-6.073
aic-5.934
imum-5.881
ents-5.837
Tokens
Token ID82
Feature activation+3.25e-01

Pos logit contributions
Alert+0.607
ulate+0.598
aic+0.568
ents+0.559
imum+0.558
Neg logit contributions
cloth-1.368
omas-1.354
 interviewer-1.323
leneck-1.314
 filler-1.305
Token deep
Token ID2769
Feature activation-3.39e-01

Pos logit contributions
cloth+2.124
omas+2.083
 interviewer+2.001
leneck+1.972
 filler+1.944
Neg logit contributions
Alert-3.735
ulate-3.703
aic-3.609
ents-3.596
imum-3.582
Token fried
Token ID23018
Feature activation-4.90e-01

Pos logit contributions
cloth+2.012
omas+1.980
 interviewer+1.928
leneck+1.914
 filler+1.886
Neg logit contributions
Alert-1.990
ulate-1.971
aic-1.904
ents-1.903
imum-1.885
Token.
Token ID13
Feature activation-4.87e-01

Pos logit contributions
cloth+2.535
omas+2.490
 interviewer+2.413
leneck+2.394
 filler+2.351
Neg logit contributions
Alert-3.255
ulate-3.228
aic-3.131
ents-3.130
imum-3.103
Token 6
Token ID718
Feature activation-4.81e-01

Pos logit contributions
cloth+2.026
omas+1.963
 filler+1.904
 interviewer+1.890
leneck+1.876
Neg logit contributions
Alert-3.687
ulate-3.604
ents-3.538
aic-3.534
imum-3.518
Token.
Token ID13
Feature activation-8.07e-01

Pos logit contributions
cloth+2.490
omas+2.447
 interviewer+2.372
leneck+2.354
 filler+2.311
Neg logit contributions
Alert-3.188
ulate-3.161
aic-3.065
ents-3.065
imum-3.037
Token Drain
Token ID36024
Feature activation-2.22e+00

Pos logit contributions
cloth+3.578
 interviewer+3.318
omas+3.312
 filler+3.186
auc+3.180
Neg logit contributions
Alert-5.795
ulate-5.713
ents-5.656
aic-5.652
imum-5.641
Token on
Token ID319
Feature activation-1.45e+00

Pos logit contributions
cloth+6.975
 interviewer+6.196
auc+6.018
omas+6.010
cho+5.779
Neg logit contributions
isSpecial-16.027
imum-16.009
itime-15.882
Alert-15.845
ulate-15.762
Token a
Token ID257
Feature activation-8.01e-01

Pos logit contributions
cloth+5.595
 interviewer+5.253
 filler+5.181
omas+5.175
auc+4.916
Neg logit contributions
Alert-10.807
imum-10.560
ents-10.543
ulate-10.446
PLIED-10.297
Token paper
Token ID3348
Feature activation-1.55e-01

Pos logit contributions
cloth+3.944
omas+3.728
 interviewer+3.694
 filler+3.645
leneck+3.514
Neg logit contributions
Alert-5.535
ulate-5.383
ents-5.341
aic-5.316
imum-5.296
Token towel
Token ID24808
Feature activation-8.27e-01

Pos logit contributions
omas+0.600
cloth+0.595
leneck+0.574
 interviewer+0.563
 filler+0.538
Neg logit contributions
ulate-1.234
Alert-1.230
ents-1.178
aic-1.176
imum-1.175
Token-
Token ID12
Feature activation+5.51e-01

Pos logit contributions
cloth+3.620
omas+3.429
 interviewer+3.371
 filler+3.291
leneck+3.276
Neg logit contributions
Alert-6.159
ulate-6.073
imum-5.947
aic-5.946
ents-5.929
Token said
Token ID531
Feature activation-7.07e-01

Pos logit contributions
cloth+1.619
omas+1.607
 interviewer+1.583
leneck+1.552
 filler+1.523
Neg logit contributions
Alert-1.953
ulate-1.928
aic-1.893
imum-1.869
ents-1.869
Token J
Token ID449
Feature activation+5.51e-01

Pos logit contributions
 interviewer+3.825
omas+3.782
cloth+3.750
leneck+3.618
 filler+3.608
Neg logit contributions
Alert-4.324
ulate-4.260
aic-4.190
ents-4.160
imum-4.153
TokenPL
Token ID6489
Feature activation+8.67e-02

Pos logit contributions
Alert+2.867
ulate+2.820
ents+2.780
aic+2.721
imum+2.696
Neg logit contributions
cloth-3.609
omas-3.500
 interviewer-3.474
leneck-3.472
 filler-3.423
Token's
Token ID338
Feature activation-2.67e-01

Pos logit contributions
Alert+0.562
ulate+0.557
aic+0.539
ents+0.538
imum+0.535
Neg logit contributions
cloth-0.461
omas-0.455
 interviewer-0.444
leneck-0.436
 filler-0.431
Token Luther
Token ID17461
Feature activation+5.35e-01

Pos logit contributions
cloth+1.513
omas+1.506
 interviewer+1.478
leneck+1.441
 filler+1.431
Neg logit contributions
Alert-1.616
ulate-1.597
aic-1.555
ents-1.548
imum-1.545
Token Be
Token ID1355
Feature activation-2.19e+00

Pos logit contributions
Alert+3.033
ulate+2.987
aic+2.907
ents+2.902
imum+2.867
Neg logit contributions
cloth-3.248
omas-3.178
 interviewer-3.105
leneck-3.102
 filler-3.061
Tokeneg
Token ID1533
Feature activation-6.68e-01

Pos logit contributions
omas+11.150
auc+11.089
cloth+10.995
itiz+10.298
ilet+10.251
Neg logit contributions
PLIED-11.361
Alert-11.131
ulate-10.587
 sidx-10.277
imum-10.217
Tokenle
Token ID293
Feature activation-4.58e-01

Pos logit contributions
cloth+3.613
omas+3.476
leneck+3.361
kamp+3.346
 interviewer+3.301
Neg logit contributions
Alert-4.146
ulate-4.033
PLIED-4.014
aic-3.958
imum-3.871
Token,
Token ID11
Feature activation-3.54e-01

Pos logit contributions
cloth+2.697
omas+2.658
 interviewer+2.602
leneck+2.526
kamp+2.526
Neg logit contributions
Alert-2.661
ulate-2.624
ents-2.531
aic-2.502
imum-2.483
Token Curiosity
Token ID39736
Feature activation-3.07e-01

Pos logit contributions
cloth+2.060
omas+2.050
 interviewer+2.038
 filler+1.975
leneck+1.958
Neg logit contributions
Alert-2.074
ulate-2.041
aic-1.983
ents-1.977
imum-1.974
Token sampling
Token ID19232
Feature activation-1.16e+00

Pos logit contributions
cloth+1.647
omas+1.630
 interviewer+1.630
 filler+1.577
leneck+1.555
Neg logit contributions
Alert-1.940
ulate-1.927
aic-1.863
ents-1.858
imum-1.849
Token a
Token ID257
Feature activation-3.04e-01

Pos logit contributions
cloth+1.628
omas+1.614
 interviewer+1.572
 filler+1.540
leneck+1.528
Neg logit contributions
Alert-1.997
ulate-1.937
aic-1.895
imum-1.891
ents-1.888
Token referendum
Token ID11467
Feature activation-9.37e-01

Pos logit contributions
cloth+1.590
omas+1.565
 interviewer+1.530
leneck+1.499
 filler+1.499
Neg logit contributions
Alert-1.999
ulate-1.963
aic-1.913
ents-1.911
imum-1.902
Token on
Token ID319
Feature activation-3.57e-01

Pos logit contributions
omas+4.439
 interviewer+4.428
cloth+4.423
 filler+4.307
leneck+4.173
Neg logit contributions
Alert-6.282
ulate-6.140
aic-6.046
imum-6.011
PLIED-6.005
Token whether
Token ID1771
Feature activation-1.08e+00

Pos logit contributions
cloth+2.079
omas+2.074
 interviewer+2.030
 filler+1.979
leneck+1.968
Neg logit contributions
Alert-2.138
ulate-2.054
aic-2.036
PLIED-2.015
ents-1.998
Token to
Token ID284
Feature activation-1.14e+00

Pos logit contributions
omas+5.889
cloth+5.724
 interviewer+5.682
 filler+5.572
 disconnect+5.380
Neg logit contributions
Alert-6.552
aic-6.149
PLIED-6.116
ulate-6.095
imum-6.028
Token exit
Token ID8420
Feature activation-2.18e+00

Pos logit contributions
omas+5.771
 interviewer+5.553
 filler+5.329
cloth+5.329
 disconnect+5.301
Neg logit contributions
Alert-7.579
imum-7.191
PLIED-7.166
ents-7.120
aic-7.110
Token the
Token ID262
Feature activation-1.09e+00

Pos logit contributions
omas+8.166
cloth+7.939
 interviewer+7.550
 filler+7.235
 disconnect+7.174
Neg logit contributions
Alert-14.293
aic-13.610
PLIED-13.555
imum-13.489
rious-13.373
Token EU
Token ID4576
Feature activation-1.35e+00

Pos logit contributions
cloth+4.999
omas+4.962
 interviewer+4.900
 filler+4.710
 inability+4.610
Neg logit contributions
Alert-7.561
ulate-7.177
imum-7.068
PLIED-7.034
ents-6.982
Token.
Token ID13
Feature activation-1.19e-02

Pos logit contributions
cloth+6.047
omas+5.791
 filler+5.659
 interviewer+5.585
leneck+5.304
Neg logit contributions
Alert-9.037
imum-8.836
ulate-8.822
aic-8.777
PLIED-8.672
Token
Token ID198
Feature activation+6.35e-01

Pos logit contributions
cloth+0.043
omas+0.042
 interviewer+0.040
leneck+0.039
 filler+0.038
Neg logit contributions
Alert-0.097
ulate-0.097
aic-0.095
ents-0.094
imum-0.094
Token
Token ID198
Feature activation+5.48e-01

Pos logit contributions
Alert+4.472
ulate+4.366
ents+4.314
aic+4.244
dict+4.182
Neg logit contributions
 challeng-2.981
leneck-2.865
omas-2.789
 destro-2.771
cloth-2.765
Token crew
Token ID5462
Feature activation+3.32e-02

Pos logit contributions
Alert+0.217
ulate+0.214
aic+0.207
imum+0.207
ents+0.207
Neg logit contributions
cloth-0.178
omas-0.177
 interviewer-0.171
 filler-0.169
leneck-0.167
Token and
Token ID290
Feature activation-2.83e-01

Pos logit contributions
Alert+0.207
ulate+0.205
aic+0.198
ents+0.197
imum+0.197
Neg logit contributions
cloth-0.186
omas-0.183
 interviewer-0.177
leneck-0.175
 filler-0.175
Token two
Token ID734
Feature activation-3.86e-01

Pos logit contributions
cloth+1.478
omas+1.457
 interviewer+1.416
 filler+1.409
leneck+1.384
Neg logit contributions
Alert-1.845
ulate-1.819
imum-1.776
aic-1.767
ents-1.751
Token more
Token ID517
Feature activation-1.17e-01

Pos logit contributions
cloth+2.097
omas+2.076
 interviewer+2.039
 filler+2.021
leneck+1.958
Neg logit contributions
Alert-2.407
ulate-2.380
imum-2.310
aic-2.296
ents-2.285
Token powerful
Token ID3665
Feature activation-1.76e-01

Pos logit contributions
cloth+0.641
omas+0.634
 interviewer+0.619
 filler+0.609
leneck+0.604
Neg logit contributions
Alert-0.731
ulate-0.721
imum-0.699
ents-0.697
aic-0.696
Token Be
Token ID1355
Feature activation-2.12e+00

Pos logit contributions
cloth+0.959
omas+0.949
 interviewer+0.928
 filler+0.915
leneck+0.896
Neg logit contributions
Alert-1.103
ulate-1.095
imum-1.062
aic-1.055
ents-1.055
Tokencht
Token ID21474
Feature activation-8.16e-01

Pos logit contributions
cloth+9.615
omas+9.298
auc+9.016
cho+8.258
bsite+8.245
Neg logit contributions
imum-12.489
ulate-12.096
PLIED-11.948
Alert-11.917
 sidx-11.425
Tokenel
Token ID417
Feature activation-3.54e-01

Pos logit contributions
cloth+3.343
omas+3.224
leneck+3.203
kamp+2.978
auc+2.928
Neg logit contributions
Alert-6.110
imum-6.041
PLIED-5.932
ulate-5.860
aic-5.681
Token A
Token ID317
Feature activation-3.10e-01

Pos logit contributions
cloth+1.859
omas+1.833
 interviewer+1.766
 filler+1.755
leneck+1.723
Neg logit contributions
Alert-2.289
ulate-2.276
imum-2.208
aic-2.193
ents-2.187
Token1
Token ID16
Feature activation-9.47e-02

Pos logit contributions
omas+1.313
cloth+1.302
 filler+1.208
 interviewer+1.204
auc+1.190
Neg logit contributions
Alert-2.313
ulate-2.308
imum-2.233
PLIED-2.226
aic-2.223
TokenB
Token ID33
Feature activation-4.38e-01

Pos logit contributions
cloth+0.557
omas+0.550
 interviewer+0.532
 filler+0.527
leneck+0.523
Neg logit contributions
ulate-0.554
Alert-0.554
imum-0.531
aic-0.529
ents-0.529
Tokenrather
Token ID34330
Feature activation-9.46e-01

Pos logit contributions
cloth+4.738
omas+4.594
leneck+4.464
auc+4.329
sole+4.281
Neg logit contributions
Alert-6.238
PLIED-5.874
ulate-5.608
imum-5.558
aic-5.530
Token-
Token ID12
Feature activation-1.21e+00

Pos logit contributions
cloth+4.106
omas+3.987
 interviewer+3.742
leneck+3.710
auc+3.627
Neg logit contributions
Alert-7.005
ulate-6.891
PLIED-6.739
aic-6.731
ents-6.639
Tokenthan
Token ID14813
Feature activation-6.69e-01

Pos logit contributions
cloth+5.658
omas+5.409
auc+5.240
cho+5.201
kamp+5.029
Neg logit contributions
Alert-8.372
PLIED-8.361
ulate-7.928
imum-7.752
aic-7.659
Token-
Token ID12
Feature activation-9.49e-01

Pos logit contributions
cloth+3.097
omas+3.036
 interviewer+2.918
leneck+2.885
 filler+2.826
Neg logit contributions
Alert-4.796
ulate-4.741
aic-4.616
ents-4.610
imum-4.580
Tokenbre
Token ID4679
Feature activation-1.23e+00

Pos logit contributions
cloth+4.621
omas+4.489
auc+4.260
leneck+4.182
kamp+4.115
Neg logit contributions
Alert-6.539
PLIED-6.346
ulate-6.271
imum-6.108
aic-6.080
Tokenxit
Token ID10198
Feature activation-2.11e+00

Pos logit contributions
cloth+5.371
omas+5.313
bsite+4.840
cho+4.820
auc+4.731
Neg logit contributions
Alert-8.908
PLIED-8.641
ulate-8.479
ents-8.421
imum-8.237
Token-
Token ID12
Feature activation-1.40e+00

Pos logit contributions
cloth+6.610
omas+6.356
auc+5.520
bsite+5.183
kamp+5.019
Neg logit contributions
Alert-16.187
ulate-15.766
aic-15.719
PLIED-15.617
isSpecial-15.440
Tokenis
Token ID271
Feature activation-1.41e+00

Pos logit contributions
cloth+4.718
omas+4.659
auc+4.184
bsite+3.944
leneck+3.881
Neg logit contributions
Alert-11.310
PLIED-11.260
ulate-10.892
aic-10.610
imum-10.604
Token-
Token ID12
Feature activation-1.55e+00

Pos logit contributions
cloth+5.307
omas+5.143
auc+4.670
leneck+4.577
 interviewer+4.528
Neg logit contributions
Alert-10.822
ulate-10.466
PLIED-10.417
aic-10.303
imum-10.156
Tokenthe
Token ID1169
Feature activation-1.04e+00

Pos logit contributions
cloth+6.620
omas+6.494
kamp+6.036
auc+6.022
sole+6.000
Neg logit contributions
Alert-11.257
PLIED-10.937
ulate-10.431
aic-10.190
 sidx-10.148
Token-
Token ID12
Feature activation-1.52e+00

Pos logit contributions
cloth+4.239
omas+4.202
 interviewer+4.040
leneck+3.883
 filler+3.867
Neg logit contributions
Alert-7.854
ulate-7.672
aic-7.529
ents-7.471
imum-7.451
Token future
Token ID2003
Feature activation+6.34e-01

Pos logit contributions
cloth+1.721
omas+1.690
 interviewer+1.643
 filler+1.624
leneck+1.590
Neg logit contributions
Alert-2.719
ulate-2.644
aic-2.608
imum-2.583
PLIED-2.564
Token.
Token ID13
Feature activation+2.78e-01

Pos logit contributions
Alert+3.874
ulate+3.839
ents+3.731
aic+3.706
imum+3.660
Neg logit contributions
cloth-3.551
omas-3.492
 interviewer-3.395
leneck-3.389
 filler-3.296
Token One
Token ID1881
Feature activation+3.17e-02

Pos logit contributions
Alert+2.013
ulate+1.988
ents+1.941
aic+1.936
imum+1.920
Neg logit contributions
cloth-1.267
omas-1.245
 interviewer-1.195
leneck-1.193
 filler-1.162
Token photo
Token ID4590
Feature activation-8.17e-01

Pos logit contributions
Alert+0.195
ulate+0.192
aic+0.186
ents+0.186
imum+0.184
Neg logit contributions
cloth-0.181
omas-0.177
 interviewer-0.174
leneck-0.171
 filler-0.170
Token shows
Token ID2523
Feature activation-5.88e-01

Pos logit contributions
 interviewer+4.070
cloth+4.064
omas+4.012
 filler+4.010
leneck+3.868
Neg logit contributions
Alert-5.243
ulate-5.191
imum-5.113
aic-5.007
ents-4.998
Token Be
Token ID1355
Feature activation-2.05e+00

Pos logit contributions
cloth+3.074
omas+2.992
 interviewer+2.987
 filler+2.908
leneck+2.871
Neg logit contributions
Alert-3.768
ulate-3.670
imum-3.634
aic-3.578
ents-3.577
Tokenb
Token ID65
Feature activation-1.06e+00

Pos logit contributions
omas+10.035
cloth+9.913
auc+9.835
cho+9.330
ilet+9.233
Neg logit contributions
PLIED-11.237
ulate-10.821
Alert-10.768
imum-10.765
 sidx-10.383
Tokenop
Token ID404
Feature activation+4.19e-02

Pos logit contributions
omas+5.564
cloth+5.482
leneck+5.300
bsite+5.280
kamp+5.212
Neg logit contributions
PLIED-6.505
Alert-6.253
aic-6.231
ulate-6.156
imum-6.022
Token and
Token ID290
Feature activation+1.48e-02

Pos logit contributions
Alert+0.252
ulate+0.249
aic+0.240
ents+0.240
imum+0.239
Neg logit contributions
cloth-0.244
omas-0.240
 interviewer-0.233
leneck-0.232
 filler-0.229
Token Rock
Token ID4631
Feature activation-1.66e-01

Pos logit contributions
Alert+0.088
ulate+0.087
aic+0.085
ents+0.084
imum+0.084
Neg logit contributions
cloth-0.086
omas-0.085
 interviewer-0.083
leneck-0.082
 filler-0.081
Tokenstead
Token ID28044
Feature activation+1.23e-01

Pos logit contributions
cloth+0.952
omas+0.926
 interviewer+0.906
leneck+0.891
 filler+0.882
Neg logit contributions
Alert-1.007
ulate-0.996
aic-0.967
ents-0.961
PLIED-0.955
Token what
Token ID644
Feature activation-5.00e-01

Pos logit contributions
cloth+3.192
omas+3.122
 interviewer+3.107
 filler+3.049
leneck+3.018
Neg logit contributions
Alert-3.351
ulate-3.226
aic-3.175
ents-3.147
imum-3.136
Token we
Token ID356
Feature activation-1.09e+00

Pos logit contributions
cloth+2.677
omas+2.673
 interviewer+2.637
 filler+2.562
leneck+2.547
Neg logit contributions
Alert-3.102
ulate-3.079
aic-3.029
imum-2.960
PLIED-2.960
Token
Token ID447
Feature activation-2.99e-01

Pos logit contributions
cloth+4.928
 interviewer+4.909
bsite+4.880
omas+4.868
leneck+4.726
Neg logit contributions
Alert-7.506
ulate-7.154
PLIED-7.094
imum-7.058
aic-6.958
Token
Token ID247
Feature activation+2.95e-02

Pos logit contributions
cloth+1.454
omas+1.427
 interviewer+1.384
leneck+1.367
 filler+1.342
Neg logit contributions
Alert-2.063
ulate-2.048
aic-1.994
ents-1.988
imum-1.977
Tokenre
Token ID260
Feature activation-1.07e+00

Pos logit contributions
Alert+0.199
ulate+0.197
aic+0.191
ents+0.191
imum+0.190
Neg logit contributions
cloth-0.149
omas-0.146
 interviewer-0.142
leneck-0.140
 filler-0.138
Token gonna
Token ID8066
Feature activation-2.03e+00

Pos logit contributions
cloth+4.732
omas+4.614
bsite+4.548
leneck+4.497
 interviewer+4.478
Neg logit contributions
ulate-7.308
Alert-7.259
aic-7.151
ents-6.989
PLIED-6.970
Token do
Token ID466
Feature activation-6.27e-01

Pos logit contributions
omas+8.351
 interviewer+8.286
bsite+8.150
 dich+8.016
cloth+7.914
Neg logit contributions
Alert-13.505
PLIED-12.839
aic-12.255
Grade-12.248
imum-12.168
Token.
Token ID13
Feature activation+2.20e-01

Pos logit contributions
cloth+3.216
omas+3.111
 interviewer+3.029
leneck+3.021
 filler+2.957
Neg logit contributions
Alert-4.135
ulate-4.074
aic-4.003
PLIED-3.974
ents-3.945
Token
Token ID198
Feature activation-9.17e-02

Pos logit contributions
Alert+1.684
ulate+1.672
ents+1.632
aic+1.629
imum+1.615
Neg logit contributions
cloth-0.905
omas-0.885
 interviewer-0.849
leneck-0.843
 filler-0.821
Token
Token ID198
Feature activation+2.87e-01

Pos logit contributions
omas+0.336
cloth+0.336
leneck+0.327
 interviewer+0.325
 filler+0.318
Neg logit contributions
Alert-0.744
ulate-0.737
ents-0.724
aic-0.718
imum-0.705
TokenThe
Token ID464
Feature activation-2.40e-02

Pos logit contributions
Alert+1.598
ulate+1.574
ents+1.520
aic+1.518
imum+1.498
Neg logit contributions
cloth-1.797
omas-1.775
 interviewer-1.739
leneck-1.722
 filler-1.701
Token
Token ID198
Feature activation-4.51e-02

Pos logit contributions
cloth+0.815
omas+0.802
 interviewer+0.774
leneck+0.764
 filler+0.747
Neg logit contributions
Alert-1.259
ulate-1.256
aic-1.225
ents-1.215
imum-1.214
Token
Token ID198
Feature activation+1.65e-02

Pos logit contributions
cloth+0.168
omas+0.167
leneck+0.163
 interviewer+0.162
 filler+0.159
Neg logit contributions
Alert-0.364
ulate-0.358
ents-0.352
aic-0.350
imum-0.344
TokenHowever
Token ID4864
Feature activation+3.86e-02

Pos logit contributions
ulate+0.087
Alert+0.087
aic+0.084
ents+0.084
imum+0.083
Neg logit contributions
cloth-0.107
omas-0.105
 interviewer-0.102
leneck-0.102
 filler-0.100
Token,
Token ID11
Feature activation-2.48e-01

Pos logit contributions
Alert+0.185
ulate+0.181
aic+0.176
ents+0.175
imum+0.174
Neg logit contributions
cloth-0.270
omas-0.267
 interviewer-0.262
leneck-0.260
 filler-0.258
Token the
Token ID262
Feature activation-6.86e-01

Pos logit contributions
cloth+1.379
omas+1.357
 interviewer+1.335
leneck+1.310
 filler+1.297
Neg logit contributions
Alert-1.542
ulate-1.513
aic-1.478
ents-1.471
imum-1.466
Token vast
Token ID5909
Feature activation-2.03e+00

Pos logit contributions
 interviewer+3.593
cloth+3.510
omas+3.498
 filler+3.428
 inability+3.373
Neg logit contributions
Alert-4.405
ulate-4.274
aic-4.215
imum-4.191
PLIED-4.145
Token majority
Token ID3741
Feature activation-1.68e-01

Pos logit contributions
 disconnect+7.402
 interviewer+7.327
cloth+7.282
 inability+7.200
omas+7.009
Neg logit contributions
Alert-13.931
PLIED-13.516
ulate-13.308
ents-13.048
imum-12.986
Token of
Token ID286
Feature activation+2.16e-02

Pos logit contributions
cloth+0.937
omas+0.921
 interviewer+0.904
leneck+0.890
 filler+0.879
Neg logit contributions
Alert-1.043
ulate-1.011
ents-0.988
imum-0.983
aic-0.979
Token early
Token ID1903
Feature activation+3.21e-01

Pos logit contributions
Alert+0.130
ulate+0.128
aic+0.124
ents+0.124
imum+0.123
Neg logit contributions
cloth-0.125
omas-0.123
 interviewer-0.121
leneck-0.119
 filler-0.118
Token investors
Token ID7713
Feature activation+1.06e-01

Pos logit contributions
Alert+2.064
ulate+2.051
ents+1.990
aic+1.989
imum+1.961
Neg logit contributions
cloth-1.721
omas-1.682
leneck-1.632
 interviewer-1.625
 filler-1.589
Token have
Token ID423
Feature activation-1.58e-01

Pos logit contributions
Alert+0.658
ulate+0.652
aic+0.631
ents+0.631
imum+0.625
Neg logit contributions
cloth-0.591
omas-0.582
 interviewer-0.565
leneck-0.561
 filler-0.552
Token paired
Token ID20312
Feature activation+1.11e-02

Pos logit contributions
Alert+1.875
ulate+1.866
ents+1.808
aic+1.800
imum+1.779
Neg logit contributions
cloth-1.813
omas-1.783
 interviewer-1.728
leneck-1.724
 filler-1.682
Token with
Token ID351
Feature activation-3.29e-01

Pos logit contributions
Alert+0.059
ulate+0.059
ents+0.057
aic+0.057
imum+0.056
Neg logit contributions
cloth-0.072
omas-0.071
 interviewer-0.069
leneck-0.068
 filler-0.067
Token a
Token ID257
Feature activation+6.73e-02

Pos logit contributions
omas+1.723
cloth+1.716
 interviewer+1.687
 filler+1.650
leneck+1.645
Neg logit contributions
Alert-2.126
ulate-2.082
aic-2.048
ents-2.028
PLIED-2.010
Token developing
Token ID5922
Feature activation+5.86e-02

Pos logit contributions
Alert+0.427
ulate+0.424
ents+0.410
aic+0.410
imum+0.407
Neg logit contributions
cloth-0.367
omas-0.361
 interviewer-0.350
leneck-0.348
 filler-0.342
Token Nathan
Token ID18106
Feature activation+1.90e-01

Pos logit contributions
Alert+0.384
ulate+0.380
aic+0.368
ents+0.368
imum+0.365
Neg logit contributions
cloth-0.309
omas-0.304
 interviewer-0.295
leneck-0.292
 filler-0.288
Token Be
Token ID1355
Feature activation-2.02e+00

Pos logit contributions
Alert+1.213
ulate+1.203
aic+1.164
ents+1.163
imum+1.150
Neg logit contributions
cloth-1.025
omas-1.010
 interviewer-0.980
leneck-0.971
 filler-0.954
Tokenaul
Token ID2518
Feature activation-8.12e-01

Pos logit contributions
auc+3.952
omas+3.486
ilet+3.328
cloth+3.323
ater+2.821
Neg logit contributions
PLIED-19.216
Alert-19.086
 sidx-18.217
ents-17.676
ulate-17.495
Tokenieu
Token ID22304
Feature activation+4.45e-01

Pos logit contributions
cloth+5.864
omas+5.701
 interviewer+5.547
auc+5.439
leneck+5.402
Neg logit contributions
ulate-3.669
Alert-3.630
PLIED-3.422
ents-3.364
dict-3.302
Token,
Token ID11
Feature activation+2.76e-01

Pos logit contributions
Alert+2.452
ulate+2.439
ents+2.358
aic+2.343
imum+2.331
Neg logit contributions
cloth-2.768
omas-2.725
 interviewer-2.655
leneck-2.642
 filler-2.586
Token will
Token ID481
Feature activation-5.68e-02

Pos logit contributions
Alert+1.654
ulate+1.641
ents+1.589
aic+1.586
imum+1.570
Neg logit contributions
cloth-1.604
omas-1.578
 interviewer-1.534
leneck-1.527
 filler-1.493
Token mean
Token ID1612
Feature activation-3.98e-01

Pos logit contributions
cloth+0.336
omas+0.332
 interviewer+0.323
leneck+0.321
 filler+0.317
Neg logit contributions
Alert-0.334
ulate-0.330
aic-0.319
ents-0.319
imum-0.316
Token over
Token ID625
Feature activation-6.01e-03

Pos logit contributions
cloth+1.736
omas+1.691
 interviewer+1.640
 filler+1.629
leneck+1.613
Neg logit contributions
Alert-1.936
ulate-1.906
aic-1.879
imum-1.868
ents-1.854
Token there
Token ID612
Feature activation-7.45e-02

Pos logit contributions
cloth+0.031
omas+0.031
 interviewer+0.030
leneck+0.030
 filler+0.029
Neg logit contributions
Alert-0.040
ulate-0.039
aic-0.038
ents-0.038
imum-0.038
Token,"
Token ID553
Feature activation-2.52e-01

Pos logit contributions
cloth+0.442
omas+0.432
 interviewer+0.420
leneck+0.417
 filler+0.415
Neg logit contributions
Alert-0.438
ulate-0.435
aic-0.423
ents-0.419
imum-0.419
Token said
Token ID531
Feature activation-6.70e-01

Pos logit contributions
cloth+1.364
omas+1.344
 interviewer+1.322
leneck+1.289
 filler+1.279
Neg logit contributions
Alert-1.593
ulate-1.576
aic-1.531
imum-1.529
ents-1.522
Token Sandy
Token ID17581
Feature activation+1.54e-01

Pos logit contributions
cloth+3.705
omas+3.701
 interviewer+3.692
 filler+3.538
leneck+3.504
Neg logit contributions
Alert-4.007
ulate-3.955
imum-3.878
aic-3.844
ents-3.827
Token Be
Token ID1355
Feature activation-2.01e+00

Pos logit contributions
Alert+0.878
ulate+0.871
aic+0.840
ents+0.839
imum+0.832
Neg logit contributions
cloth-0.942
omas-0.927
 interviewer-0.902
leneck-0.895
 filler-0.881
Tokenale
Token ID1000
Feature activation-6.70e-02

Pos logit contributions
auc+10.490
omas+10.458
cloth+10.304
bsite+9.577
ilet+9.549
Neg logit contributions
PLIED-10.723
Alert-10.277
ulate-10.030
 sidx-9.704
imum-9.694
Token,
Token ID11
Feature activation+3.29e-01

Pos logit contributions
cloth+0.459
omas+0.450
 interviewer+0.435
leneck+0.433
 filler+0.427
Neg logit contributions
Alert-0.336
ulate-0.331
aic-0.319
ents-0.318
imum-0.318
Token one
Token ID530
Feature activation+4.78e-01

Pos logit contributions
Alert+1.925
ulate+1.909
ents+1.845
aic+1.843
imum+1.804
Neg logit contributions
cloth-1.944
omas-1.924
leneck-1.854
 interviewer-1.840
 filler-1.812
Token of
Token ID286
Feature activation+4.67e-01

Pos logit contributions
Alert+2.746
ulate+2.724
aic+2.628
ents+2.625
imum+2.596
Neg logit contributions
cloth-2.892
omas-2.847
 interviewer-2.756
leneck-2.749
 filler-2.707
Token the
Token ID262
Feature activation+3.43e-01

Pos logit contributions
Alert+2.850
ulate+2.818
aic+2.726
ents+2.725
imum+2.672
Neg logit contributions
cloth-2.674
omas-2.624
 interviewer-2.523
leneck-2.519
 filler-2.474
Token explored
Token ID18782
Feature activation-6.48e-01

Pos logit contributions
cloth+4.391
omas+4.280
 interviewer+4.241
 filler+4.188
bsite+4.160
Neg logit contributions
Alert-3.543
ulate-3.468
aic-3.400
imum-3.397
ents-3.309
Token in
Token ID287
Feature activation-2.99e-01

Pos logit contributions
cloth+3.349
omas+3.220
 interviewer+3.136
leneck+3.083
 filler+3.072
Neg logit contributions
Alert-4.254
ulate-4.183
aic-4.107
imum-4.079
ents-4.057
Token 3
Token ID513
Feature activation+1.75e-01

Pos logit contributions
cloth+1.593
omas+1.558
 interviewer+1.514
 filler+1.497
leneck+1.481
Neg logit contributions
Alert-1.921
ulate-1.900
aic-1.859
ents-1.851
imum-1.836
TokenD
Token ID35
Feature activation-7.18e-01

Pos logit contributions
Alert+0.880
ulate+0.869
aic+0.833
ents+0.831
imum+0.824
Neg logit contributions
cloth-1.191
omas-1.176
 interviewer-1.146
leneck-1.138
 filler-1.127
Token with
Token ID351
Feature activation-1.35e-01

Pos logit contributions
cloth+3.804
omas+3.695
 filler+3.547
 interviewer+3.541
leneck+3.485
Neg logit contributions
Alert-4.540
ulate-4.428
aic-4.414
imum-4.361
ents-4.320
Token To
Token ID1675
Feature activation-2.01e+00

Pos logit contributions
cloth+0.726
omas+0.709
 interviewer+0.695
leneck+0.683
 filler+0.678
Neg logit contributions
Alert-0.862
ulate-0.848
aic-0.827
ents-0.824
imum-0.824
Token make
Token ID787
Feature activation-1.18e+00

Pos logit contributions
omas+8.262
ilet+7.959
cho+7.328
cloth+7.286
leneck+7.196
Neg logit contributions
Alert-13.092
PLIED-12.975
ents-12.589
aic-12.445
icum-12.377
Token this
Token ID428
Feature activation-3.43e-01

Pos logit contributions
cloth+5.686
omas+5.540
 interviewer+5.404
 filler+5.210
leneck+5.134
Neg logit contributions
Alert-7.755
PLIED-7.514
ulate-7.484
imum-7.465
aic-7.457
Token possible
Token ID1744
Feature activation-5.84e-01

Pos logit contributions
cloth+2.223
omas+2.193
 interviewer+2.137
leneck+2.127
 filler+2.094
Neg logit contributions
Alert-1.829
ulate-1.818
aic-1.742
ents-1.740
imum-1.723
Token,
Token ID11
Feature activation-1.34e+00

Pos logit contributions
cloth+3.967
omas+3.886
 interviewer+3.775
 filler+3.706
leneck+3.706
Neg logit contributions
Alert-2.853
ulate-2.842
aic-2.742
ents-2.720
imum-2.709
Tokenin
Token ID259
Feature activation-1.47e+00

Pos logit contributions
cloth+5.660
 interviewer+5.442
omas+5.438
 filler+5.150
leneck+4.994
Neg logit contributions
Alert-9.487
ulate-9.410
imum-9.184
aic-9.141
PLIED-9.081
Token either
Token ID2035
Feature activation-8.93e-02

Pos logit contributions
ulate+5.641
Alert+5.624
aic+5.343
ents+5.327
imum+5.266
Neg logit contributions
cloth-5.839
omas-5.772
leneck-5.578
bsite-5.511
 interviewer-5.482
Token;
Token ID26
Feature activation-3.60e-01

Pos logit contributions
cloth+0.478
omas+0.469
 interviewer+0.458
leneck+0.452
 filler+0.447
Neg logit contributions
Alert-0.576
ulate-0.569
aic-0.556
ents-0.555
imum-0.548
Token we
Token ID356
Feature activation-1.23e+00

Pos logit contributions
cloth+1.992
omas+1.964
 interviewer+1.923
leneck+1.883
 filler+1.868
Neg logit contributions
Alert-2.245
ulate-2.208
ents-2.149
imum-2.146
aic-2.143
Token
Token ID447
Feature activation-4.27e-01

Pos logit contributions
cloth+5.781
omas+5.646
leneck+5.631
bsite+5.601
 interviewer+5.595
Neg logit contributions
Alert-8.128
PLIED-7.888
imum-7.814
ents-7.766
ulate-7.744
Token
Token ID247
Feature activation-1.05e-01

Pos logit contributions
cloth+1.942
omas+1.902
 interviewer+1.846
leneck+1.820
 filler+1.785
Neg logit contributions
Alert-3.072
ulate-3.039
aic-2.985
ents-2.953
imum-2.952
Tokenre
Token ID260
Feature activation-1.99e+00

Pos logit contributions
cloth+0.535
omas+0.525
 interviewer+0.509
leneck+0.507
 filler+0.496
Neg logit contributions
Alert-0.706
ulate-0.694
aic-0.675
ents-0.672
imum-0.671
Token the
Token ID262
Feature activation-3.72e-01

Pos logit contributions
cloth+9.527
omas+9.231
 inability+9.092
 interviewer+9.029
leneck+8.968
Neg logit contributions
ulate-11.042
Alert-10.944
aic-10.807
ents-10.799
imum-10.797
Token only
Token ID691
Feature activation-7.36e-01

Pos logit contributions
cloth+1.869
omas+1.863
 interviewer+1.827
leneck+1.789
 filler+1.778
Neg logit contributions
Alert-2.482
ulate-2.451
aic-2.365
ents-2.365
imum-2.356
Token species
Token ID4693
Feature activation-3.28e-01

Pos logit contributions
 interviewer+3.706
cloth+3.606
omas+3.599
 filler+3.479
leneck+3.438
Neg logit contributions
Alert-4.831
ulate-4.719
aic-4.648
imum-4.643
ents-4.621
Token that
Token ID326
Feature activation-1.67e-01

Pos logit contributions
cloth+1.848
omas+1.813
 interviewer+1.781
leneck+1.763
 filler+1.735
Neg logit contributions
Alert-2.013
ulate-1.967
aic-1.923
imum-1.896
ents-1.892
Token really
Token ID1107
Feature activation+3.05e-01

Pos logit contributions
cloth+0.860
omas+0.850
 interviewer+0.824
leneck+0.818
 filler+0.800
Neg logit contributions
Alert-1.102
ulate-1.086
aic-1.059
ents-1.055
imum-1.047
Token Ark
Token ID9128
Feature activation-7.88e-01

Pos logit contributions
cloth+2.492
omas+2.474
 interviewer+2.410
 filler+2.372
leneck+2.357
Neg logit contributions
Alert-2.577
ulate-2.554
imum-2.478
ents-2.472
aic-2.464
Tokent
Token ID83
Feature activation-9.87e-01

Pos logit contributions
cloth+5.274
omas+5.229
leneck+5.017
 interviewer+4.990
 filler+4.969
Neg logit contributions
Alert-3.776
ulate-3.680
PLIED-3.653
imum-3.552
aic-3.539
Tokenika
Token ID9232
Feature activation-5.75e-01

Pos logit contributions
omas+3.719
cloth+3.509
leneck+3.377
auc+3.252
 interviewer+3.176
Neg logit contributions
Alert-7.683
PLIED-7.441
ulate-7.417
ents-7.179
imum-7.121
Token decom
Token ID26969
Feature activation-8.25e-02

Pos logit contributions
cloth+3.169
omas+3.167
 filler+3.084
 interviewer+3.078
leneck+2.980
Neg logit contributions
Alert-3.466
ulate-3.435
imum-3.337
aic-3.322
ents-3.303
Tokenmission
Token ID3411
Feature activation-4.50e-01

Pos logit contributions
cloth+0.389
omas+0.383
leneck+0.365
 interviewer+0.363
 filler+0.356
Neg logit contributions
Alert-0.584
ulate-0.576
aic-0.565
imum-0.561
ents-0.560
Tokened
Token ID276
Feature activation-1.99e+00

Pos logit contributions
cloth+2.344
omas+2.325
 interviewer+2.231
leneck+2.212
 filler+2.205
Neg logit contributions
Alert-2.921
ulate-2.862
aic-2.793
PLIED-2.760
imum-2.756
Token in
Token ID287
Feature activation-1.59e+00

Pos logit contributions
cloth+7.559
omas+7.438
 filler+7.348
 interviewer+7.270
 incomp+7.002
Neg logit contributions
Alert-13.150
PLIED-12.940
imum-12.696
ulate-12.657
aic-12.601
Token 1992
Token ID9768
Feature activation-6.52e-01

Pos logit contributions
 interviewer+6.081
 filler+5.891
cloth+5.868
omas+5.851
 inability+5.466
Neg logit contributions
Alert-11.224
PLIED-11.074
ents-10.969
ulate-10.921
imum-10.905
Token and
Token ID290
Feature activation-6.43e-01

Pos logit contributions
cloth+3.122
omas+3.106
 interviewer+2.960
 filler+2.917
leneck+2.896
Neg logit contributions
Alert-4.435
ulate-4.382
aic-4.252
PLIED-4.230
ents-4.209
Token 2008
Token ID3648
Feature activation-6.47e-01

Pos logit contributions
cloth+3.272
omas+3.239
 interviewer+3.219
 filler+3.135
leneck+3.114
Neg logit contributions
Alert-4.154
ulate-4.116
imum-4.030
ents-3.991
aic-3.991
Token,
Token ID11
Feature activation-9.57e-01

Pos logit contributions
cloth+2.532
omas+2.529
 interviewer+2.419
 filler+2.375
leneck+2.359
Neg logit contributions
Alert-4.966
ulate-4.874
aic-4.734
PLIED-4.723
ents-4.718
Token the
Token ID262
Feature activation-3.51e-01

Pos logit contributions
Alert+1.898
ulate+1.886
ents+1.832
aic+1.809
imum+1.791
Neg logit contributions
cloth-1.872
omas-1.862
leneck-1.808
 interviewer-1.786
bsite-1.741
Token name
Token ID1438
Feature activation-4.88e-01

Pos logit contributions
cloth+1.857
 interviewer+1.823
omas+1.820
 filler+1.771
leneck+1.755
Neg logit contributions
Alert-2.279
ulate-2.234
aic-2.177
ents-2.166
imum-2.162
Token they
Token ID484
Feature activation-6.28e-02

Pos logit contributions
cloth+2.574
omas+2.438
 interviewer+2.403
 filler+2.378
leneck+2.338
Neg logit contributions
Alert-3.189
ulate-3.186
aic-3.085
imum-3.082
ents-3.059
Token'd
Token ID1549
Feature activation-4.13e-01

Pos logit contributions
cloth+0.329
omas+0.322
 interviewer+0.313
leneck+0.310
 filler+0.306
Neg logit contributions
Alert-0.414
ulate-0.410
aic-0.397
ents-0.397
imum-0.395
Token given
Token ID1813
Feature activation+8.01e-01

Pos logit contributions
cloth+2.444
omas+2.417
 interviewer+2.360
leneck+2.323
 filler+2.320
Neg logit contributions
Alert-2.430
ulate-2.369
imum-2.306
aic-2.283
ents-2.279
Token Be
Token ID1355
Feature activation-1.99e+00

Pos logit contributions
ulate+4.548
Alert+4.523
ents+4.393
aic+4.326
imum+4.293
Neg logit contributions
omas-4.672
cloth-4.616
leneck-4.570
 interviewer-4.467
bsite-4.395
Tokenhemoth
Token ID34394
Feature activation-1.03e-01

Pos logit contributions
cloth+8.813
auc+8.786
omas+8.729
bsite+8.051
itiz+8.050
Neg logit contributions
Alert-11.611
PLIED-11.556
 sidx-11.181
ulate-11.167
imum-11.086
Token now
Token ID783
Feature activation+7.08e-03

Pos logit contributions
cloth+0.515
omas+0.500
 interviewer+0.484
 filler+0.477
leneck+0.476
Neg logit contributions
Alert-0.696
ulate-0.694
ents-0.671
aic-0.670
imum-0.670
Token seemed
Token ID3947
Feature activation-1.98e-01

Pos logit contributions
Alert+0.045
ulate+0.045
aic+0.044
ents+0.043
imum+0.043
Neg logit contributions
cloth-0.038
omas-0.038
 interviewer-0.037
leneck-0.036
 filler-0.036
Token downright
Token ID31464
Feature activation-2.32e-01

Pos logit contributions
cloth+1.073
omas+1.056
 interviewer+1.028
 filler+1.018
leneck+1.010
Neg logit contributions
Alert-1.257
ulate-1.245
aic-1.209
ents-1.207
imum-1.204
Token optimistic
Token ID16915
Feature activation+2.34e-01

Pos logit contributions
cloth+1.362
omas+1.338
 interviewer+1.311
 filler+1.298
leneck+1.279
Neg logit contributions
Alert-1.369
ulate-1.360
ents-1.316
aic-1.315
imum-1.310
Token Columbia
Token ID9309
Feature activation+7.44e-01

Pos logit contributions
cloth+1.774
omas+1.746
 interviewer+1.702
leneck+1.679
 filler+1.664
Neg logit contributions
Alert-1.754
ulate-1.737
aic-1.680
ents-1.669
imum-1.666
Token Pictures
Token ID20051
Feature activation-3.54e-01

Pos logit contributions
Alert+3.754
ulate+3.652
ents+3.577
aic+3.516
imum+3.438
Neg logit contributions
cloth-4.946
omas-4.907
leneck-4.766
 interviewer-4.753
 filler-4.715
Token With
Token ID2080
Feature activation-1.05e-01

Pos logit contributions
cloth+1.443
omas+1.401
 interviewer+1.358
leneck+1.330
 filler+1.301
Neg logit contributions
ulate-2.695
Alert-2.684
aic-2.608
imum-2.607
ents-2.594
Token Ghostbusters
Token ID50037
Feature activation-3.57e-01

Pos logit contributions
cloth+0.592
omas+0.587
 interviewer+0.570
leneck+0.565
 filler+0.559
Neg logit contributions
Alert-0.650
ulate-0.638
aic-0.625
imum-0.618
ents-0.616
Token we
Token ID356
Feature activation-6.75e-01

Pos logit contributions
cloth+1.789
omas+1.725
 interviewer+1.723
 filler+1.688
leneck+1.642
Neg logit contributions
ulate-2.384
Alert-2.360
aic-2.313
imum-2.312
ents-2.274
Token learned
Token ID4499
Feature activation-1.98e+00

Pos logit contributions
cloth+4.175
omas+4.064
 interviewer+3.913
leneck+3.882
 filler+3.847
Neg logit contributions
ulate-3.763
Alert-3.735
imum-3.644
aic-3.571
PLIED-3.532
Token that
Token ID326
Feature activation-3.36e-01

Pos logit contributions
cloth+7.382
omas+7.046
 interviewer+6.839
 filler+6.502
 disconnect+6.380
Neg logit contributions
PLIED-13.695
aic-13.587
Alert-13.358
ents-13.314
itime-13.289
Token if
Token ID611
Feature activation-6.81e-01

Pos logit contributions
cloth+2.443
omas+2.411
 interviewer+2.360
leneck+2.345
 filler+2.318
Neg logit contributions
Alert-1.526
ulate-1.507
aic-1.442
ents-1.440
imum-1.422
Token you
Token ID345
Feature activation-1.71e-01

Pos logit contributions
cloth+3.981
omas+3.925
 interviewer+3.809
 filler+3.737
leneck+3.709
Neg logit contributions
Alert-4.001
ulate-3.949
aic-3.812
imum-3.807
ents-3.797
Token find
Token ID1064
Feature activation-1.18e+00

Pos logit contributions
cloth+1.009
omas+0.986
leneck+0.975
 interviewer+0.969
 filler+0.948
Neg logit contributions
ulate-1.010
Alert-1.006
aic-0.970
ents-0.970
imum-0.950
Token a
Token ID257
Feature activation-1.02e-01

Pos logit contributions
cloth+5.004
omas+4.854
 interviewer+4.754
 filler+4.626
leneck+4.470
Neg logit contributions
Alert-8.559
ulate-8.380
imum-8.289
ents-8.284
aic-8.194
Token,
Token ID11
Feature activation-5.53e-01

Pos logit contributions
omas+3.003
cloth+2.932
 interviewer+2.851
 filler+2.819
leneck+2.779
Neg logit contributions
Alert-4.076
ulate-4.050
aic-3.974
PLIED-3.965
imum-3.943
Token says
Token ID1139
Feature activation-6.87e-01

Pos logit contributions
cloth+3.101
omas+3.046
 interviewer+3.011
 filler+2.948
leneck+2.940
Neg logit contributions
Alert-3.298
ulate-3.259
imum-3.192
aic-3.185
ents-3.181
Token facility
Token ID6841
Feature activation-4.94e-01

Pos logit contributions
omas+4.108
cloth+4.097
 interviewer+4.056
 filler+3.885
leneck+3.878
Neg logit contributions
Alert-3.824
ulate-3.731
aic-3.619
imum-3.606
ents-3.592
Token director
Token ID3437
Feature activation-3.08e-01

Pos logit contributions
cloth+2.401
omas+2.398
 interviewer+2.371
leneck+2.260
 filler+2.245
Neg logit contributions
Alert-3.337
ulate-3.304
aic-3.237
ents-3.214
imum-3.200
Token Tony
Token ID8832
Feature activation+1.15e+00

Pos logit contributions
cloth+1.706
omas+1.701
 interviewer+1.638
leneck+1.602
 filler+1.582
Neg logit contributions
Alert-1.908
ulate-1.893
aic-1.840
ents-1.820
PLIED-1.818
Token Be
Token ID1355
Feature activation-1.95e+00

Pos logit contributions
Alert+6.368
ulate+6.226
aic+6.160
ents+6.081
imum+5.947
Neg logit contributions
cloth-6.753
 interviewer-6.507
leneck-6.476
omas-6.456
 filler-6.438
Tokenasley
Token ID30705
Feature activation-2.93e-01

Pos logit contributions
omas+9.901
cloth+9.831
auc+9.831
ilet+9.038
bsite+8.981
Neg logit contributions
PLIED-10.882
Alert-10.620
ulate-9.988
 sidx-9.788
imum-9.746
Token.
Token ID13
Feature activation+3.46e-01

Pos logit contributions
cloth+1.556
omas+1.522
 interviewer+1.462
leneck+1.452
 filler+1.426
Neg logit contributions
Alert-1.886
ulate-1.877
aic-1.818
ents-1.808
imum-1.805
Token
Token ID198
Feature activation+5.97e-01

Pos logit contributions
Alert+2.365
ulate+2.323
ents+2.271
aic+2.264
imum+2.235
Neg logit contributions
cloth-1.719
omas-1.693
leneck-1.642
 interviewer-1.638
 filler-1.604
Token
Token ID198
Feature activation+1.06e-01

Pos logit contributions
Alert+4.236
ulate+4.129
ents+4.105
aic+4.035
dict+3.969
Neg logit contributions
 challeng-2.861
 destro-2.663
leneck-2.647
omas-2.598
 filler-2.596
TokenAt
Token ID2953
Feature activation+4.39e-01

Pos logit contributions
Alert+0.564
ulate+0.561
ents+0.540
aic+0.539
imum+0.535
Neg logit contributions
cloth-0.686
omas-0.675
 interviewer-0.657
leneck-0.653
 filler-0.644
Token the
Token ID262
Feature activation-1.53e-01

Pos logit contributions
cloth+0.865
omas+0.845
 interviewer+0.827
 filler+0.813
leneck+0.813
Neg logit contributions
Alert-0.935
ulate-0.915
ents-0.895
aic-0.888
imum-0.886
Token sugar
Token ID7543
Feature activation-2.21e-01

Pos logit contributions
cloth+0.886
omas+0.872
 interviewer+0.854
leneck+0.841
 filler+0.839
Neg logit contributions
Alert-0.913
ulate-0.895
ents-0.874
aic-0.868
imum-0.866
Token cane
Token ID33009
Feature activation-5.91e-01

Pos logit contributions
cloth+1.240
omas+1.199
 interviewer+1.158
 filler+1.152
leneck+1.137
Neg logit contributions
Alert-1.381
ulate-1.357
ents-1.333
aic-1.327
imum-1.320
Token plantations
Token ID49441
Feature activation+2.15e-01

Pos logit contributions
cloth+3.267
omas+3.178
 filler+3.121
 interviewer+3.089
leneck+2.952
Neg logit contributions
Alert-3.684
ulate-3.570
ents-3.552
imum-3.514
aic-3.508
Token,
Token ID11
Feature activation-5.48e-01

Pos logit contributions
Alert+1.332
ulate+1.325
aic+1.281
ents+1.281
imum+1.266
Neg logit contributions
cloth-1.201
omas-1.183
 interviewer-1.150
leneck-1.142
 filler-1.118
Token Be
Token ID1355
Feature activation-1.94e+00

Pos logit contributions
cloth+2.959
omas+2.892
 interviewer+2.819
 filler+2.801
leneck+2.799
Neg logit contributions
Alert-3.455
ulate-3.356
ents-3.293
aic-3.276
imum-3.258
Tokeneg
Token ID1533
Feature activation-5.95e-01

Pos logit contributions
cloth+6.920
auc+6.893
omas+6.649
bsite+6.136
itiz+6.126
Neg logit contributions
Alert-13.557
PLIED-13.445
ulate-13.016
 sidx-12.918
imum-12.787
Tokenle
Token ID293
Feature activation-4.12e-01

Pos logit contributions
cloth+3.098
omas+3.020
leneck+2.920
 interviewer+2.913
 filler+2.832
Neg logit contributions
Alert-3.933
ulate-3.864
aic-3.762
ents-3.759
imum-3.747
Token noted
Token ID4367
Feature activation-4.56e-01

Pos logit contributions
cloth+2.206
omas+2.180
 interviewer+2.109
leneck+2.105
 filler+2.049
Neg logit contributions
Alert-2.621
ulate-2.576
aic-2.513
ents-2.493
imum-2.482
Token.
Token ID13
Feature activation-1.22e-01

Pos logit contributions
cloth+2.176
omas+2.115
 interviewer+2.066
leneck+2.050
 filler+2.009
Neg logit contributions
Alert-3.146
ulate-3.122
aic-3.053
imum-3.028
ents-3.022
Token<|endoftext|>
Token ID50256
Feature activation-3.29e-01

Pos logit contributions
cloth+0.452
omas+0.439
 interviewer+0.420
leneck+0.416
 filler+0.406
Neg logit contributions
Alert-0.980
ulate-0.975
aic-0.950
ents-0.949
imum-0.944
Token's
Token ID338
Feature activation-3.91e-01

Pos logit contributions
Alert+0.727
ulate+0.719
aic+0.698
ents+0.695
imum+0.681
Neg logit contributions
cloth-0.856
omas-0.845
leneck-0.822
 interviewer-0.820
 filler-0.814
Token Basketball
Token ID25911
Feature activation+7.33e-02

Pos logit contributions
cloth+2.408
omas+2.383
 interviewer+2.353
 filler+2.291
leneck+2.276
Neg logit contributions
Alert-2.150
ulate-2.144
ents-2.075
PLIED-2.066
imum-2.050
Token Player
Token ID7853
Feature activation-2.05e-01

Pos logit contributions
Alert+0.464
ulate+0.459
aic+0.446
ents+0.445
imum+0.440
Neg logit contributions
cloth-0.402
omas-0.396
leneck-0.382
 interviewer-0.381
 filler-0.373
Token of
Token ID286
Feature activation-3.77e-01

Pos logit contributions
cloth+1.154
omas+1.122
 interviewer+1.113
leneck+1.090
 filler+1.082
Neg logit contributions
Alert-1.251
ulate-1.245
ents-1.207
aic-1.198
PLIED-1.191
Token the
Token ID262
Feature activation+2.27e-02

Pos logit contributions
cloth+2.102
omas+2.070
 interviewer+2.009
leneck+1.996
 filler+1.962
Neg logit contributions
Alert-2.352
ulate-2.330
ents-2.255
aic-2.255
imum-2.232
Token Year
Token ID6280
Feature activation-1.93e+00

Pos logit contributions
Alert+0.099
ulate+0.098
aic+0.093
ents+0.093
imum+0.092
Neg logit contributions
cloth-0.168
omas-0.166
 interviewer-0.163
leneck-0.161
 filler-0.160
Token by
Token ID416
Feature activation-3.18e-01

Pos logit contributions
 interviewer+8.167
cloth+8.126
omas+8.039
 filler+7.927
 dich+7.595
Neg logit contributions
ulate-11.761
Alert-11.659
PLIED-11.636
imum-11.614
aic-11.492
Token both
Token ID1111
Feature activation-1.90e-01

Pos logit contributions
cloth+1.977
omas+1.959
 interviewer+1.953
leneck+1.884
 filler+1.875
Neg logit contributions
Alert-1.746
ulate-1.723
aic-1.649
ents-1.649
imum-1.635
Token the
Token ID262
Feature activation+2.18e-01

Pos logit contributions
cloth+1.206
omas+1.188
 interviewer+1.176
leneck+1.142
 filler+1.139
Neg logit contributions
Alert-1.024
ulate-1.014
ents-0.969
aic-0.966
imum-0.964
Token ACC
Token ID15859
Feature activation+6.20e-01

Pos logit contributions
Alert+1.170
ulate+1.156
aic+1.118
ents+1.114
imum+1.100
Neg logit contributions
cloth-1.397
omas-1.378
leneck-1.338
 interviewer-1.335
 filler-1.315
Token coaches
Token ID11070
Feature activation+1.87e-01

Pos logit contributions
Alert+3.731
ulate+3.661
aic+3.586
ents+3.570
imum+3.475
Neg logit contributions
cloth-3.535
omas-3.460
leneck-3.369
 interviewer-3.320
 filler-3.316
Token which
Token ID543
Feature activation+2.98e-01

Pos logit contributions
Alert+0.424
ulate+0.419
aic+0.408
ents+0.408
imum+0.407
Neg logit contributions
cloth-0.285
omas-0.280
 interviewer-0.272
leneck-0.268
 filler-0.265
Token shares
Token ID7303
Feature activation-5.84e-01

Pos logit contributions
Alert+1.980
ulate+1.967
aic+1.906
ents+1.905
imum+1.888
Neg logit contributions
cloth-1.530
omas-1.501
 interviewer-1.454
leneck-1.445
 filler-1.415
Token some
Token ID617
Feature activation+1.92e-01

Pos logit contributions
cloth+2.967
omas+2.955
 interviewer+2.881
 filler+2.876
leneck+2.836
Neg logit contributions
Alert-3.773
imum-3.645
ulate-3.638
aic-3.606
ents-3.591
Token characteristics
Token ID9695
Feature activation+6.20e-02

Pos logit contributions
Alert+1.315
ulate+1.304
aic+1.266
ents+1.266
imum+1.256
Neg logit contributions
cloth-0.957
omas-0.939
 interviewer-0.910
leneck-0.901
 filler-0.886
Token with
Token ID351
Feature activation-7.33e-02

Pos logit contributions
Alert+0.357
ulate+0.351
aic+0.338
ents+0.338
imum+0.337
Neg logit contributions
cloth-0.376
omas-0.371
 interviewer-0.361
leneck-0.357
 filler-0.354
Token Be
Token ID1355
Feature activation-1.93e+00

Pos logit contributions
cloth+0.407
omas+0.401
 interviewer+0.392
leneck+0.388
 filler+0.384
Neg logit contributions
Alert-0.455
ulate-0.449
ents-0.438
aic-0.437
imum-0.434
Tokeneth
Token ID2788
Feature activation-1.28e+00

Pos logit contributions
auc+9.607
omas+9.319
cloth+8.998
bsite+8.891
cho+8.786
Neg logit contributions
PLIED-10.586
 sidx-10.436
Alert-10.273
ulate-10.104
imum-9.923
Tokenoven
Token ID16206
Feature activation+3.72e-01

Pos logit contributions
omas+5.103
auc+5.078
leneck+4.873
cloth+4.770
kamp+4.408
Neg logit contributions
Alert-9.123
PLIED-8.944
ulate-8.852
imum-8.649
dict-8.624
Token
Token ID447
Feature activation-3.70e-01

Pos logit contributions
Alert+2.306
ulate+2.286
ents+2.215
aic+2.211
imum+2.190
Neg logit contributions
cloth-2.079
omas-2.049
 interviewer-1.989
leneck-1.973
 filler-1.938
Token
Token ID247
Feature activation+1.59e-01

Pos logit contributions
cloth+1.425
omas+1.381
 interviewer+1.340
leneck+1.329
 filler+1.294
Neg logit contributions
Alert-2.884
ulate-2.854
aic-2.827
imum-2.801
ents-2.783
Tokens
Token ID82
Feature activation-9.55e-02

Pos logit contributions
Alert+0.951
ulate+0.939
aic+0.910
ents+0.906
imum+0.904
Neg logit contributions
cloth-0.924
omas-0.909
 interviewer-0.883
leneck-0.880
 filler-0.865
Token the
Token ID262
Feature activation+7.94e-01

Pos logit contributions
Alert+0.822
ulate+0.813
ents+0.786
aic+0.785
imum+0.779
Neg logit contributions
cloth-0.842
omas-0.829
 interviewer-0.809
leneck-0.801
 filler-0.790
Token Melbourne
Token ID14819
Feature activation+9.80e-01

Pos logit contributions
Alert+4.003
ulate+3.965
aic+3.803
ents+3.786
imum+3.668
Neg logit contributions
cloth-5.297
omas-5.230
leneck-5.060
 interviewer-5.013
bsite-4.954
Token Mag
Token ID2944
Feature activation+1.45e+00

Pos logit contributions
Alert+4.335
ulate+4.264
aic+4.086
ents+4.024
PLIED+3.925
Neg logit contributions
cloth-7.018
omas-7.008
leneck-6.780
bsite-6.681
 interviewer-6.653
Tokenistrates
Token ID37909
Feature activation+2.38e-01

Pos logit contributions
ents+1.004
aic+0.888
istrates+0.794
ulate+0.782
Alert+0.752
Neg logit contributions
cloth-15.013
leneck-14.903
omas-14.888
bsite-14.729
 filler-14.703
Token Court
Token ID3078
Feature activation+6.77e-01

Pos logit contributions
Alert+1.383
ulate+1.369
aic+1.322
ents+1.317
imum+1.310
Neg logit contributions
cloth-1.431
omas-1.408
 interviewer-1.376
leneck-1.363
 filler-1.345
Token this
Token ID428
Feature activation+1.53e+00

Pos logit contributions
Alert+3.875
ulate+3.826
ents+3.722
aic+3.691
PLIED+3.598
Neg logit contributions
cloth-3.981
omas-3.961
leneck-3.858
 interviewer-3.817
bsite-3.754
Token afternoon
Token ID6672
Feature activation+1.47e+00

Pos logit contributions
ulate+2.643
Alert+2.600
ents+2.273
dict+2.169
PLIED+2.138
Neg logit contributions
cloth-14.370
omas-14.203
leneck-14.122
bsite-13.916
 interviewer-13.670
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
Alert+7.579
ents+7.477
ulate+7.401
dict+7.157
PLIED+7.093
Neg logit contributions
omas-8.640
cloth-8.417
leneck-8.409
 interviewer-8.254
bsite-8.254
Token where
Token ID810
Feature activation-1.50e-01

Pos logit contributions
Alert+5.401
ulate+5.348
ents+5.254
PLIED+5.108
dict+5.093
Neg logit contributions
omas-6.020
cloth-5.921
leneck-5.802
 interviewer-5.620
bsite-5.618
Token he
Token ID339
Feature activation+7.43e-01

Pos logit contributions
cloth+0.577
omas+0.561
 interviewer+0.549
leneck+0.528
 filler+0.524
Neg logit contributions
Alert-1.199
ulate-1.184
imum-1.158
aic-1.157
ents-1.151
Token was
Token ID373
Feature activation+1.19e+00

Pos logit contributions
ulate+4.504
Alert+4.499
ents+4.466
aic+4.281
PLIED+4.271
Neg logit contributions
omas-3.982
cloth-3.971
leneck-3.894
 interviewer-3.824
 filler-3.750
Token from
Token ID422
Feature activation+4.71e-01

Pos logit contributions
Alert+0.681
ulate+0.674
aic+0.656
ents+0.654
imum+0.650
Neg logit contributions
cloth-0.542
omas-0.532
 interviewer-0.515
leneck-0.510
 filler-0.503
Token here
Token ID994
Feature activation+4.76e-01

Pos logit contributions
Alert+3.005
ulate+3.003
ents+2.892
aic+2.888
imum+2.862
Neg logit contributions
cloth-2.503
omas-2.471
leneck-2.400
 interviewer-2.379
 filler-2.313
Token.
Token ID13
Feature activation+6.00e-01

Pos logit contributions
Alert+2.971
ulate+2.967
ents+2.872
aic+2.848
imum+2.827
Neg logit contributions
cloth-2.612
omas-2.577
 interviewer-2.506
leneck-2.488
 filler-2.435
Token Anton
Token ID9261
Feature activation+2.33e-01

Pos logit contributions
Alert+3.489
ulate+3.372
ents+3.336
aic+3.293
imum+3.261
Neg logit contributions
omas-3.519
cloth-3.491
leneck-3.453
 interviewer-3.389
 filler-3.354
Token has
Token ID468
Feature activation+2.36e-01

Pos logit contributions
Alert+1.517
ulate+1.504
aic+1.457
ents+1.457
imum+1.444
Neg logit contributions
cloth-1.230
omas-1.209
 interviewer-1.172
leneck-1.163
 filler-1.143
Token put
Token ID1234
Feature activation+1.41e+00

Pos logit contributions
Alert+1.422
ulate+1.416
ents+1.369
aic+1.363
imum+1.350
Neg logit contributions
cloth-1.355
omas-1.335
leneck-1.292
 interviewer-1.291
 filler-1.257
Token in
Token ID287
Feature activation-1.58e-02

Pos logit contributions
ulate+7.368
Alert+7.008
ents+6.997
aic+6.761
imum+6.703
Neg logit contributions
omas-8.424
cloth-8.407
leneck-8.129
sbm-7.989
 interviewer-7.977
Token a
Token ID257
Feature activation+7.46e-01

Pos logit contributions
cloth+0.089
omas+0.088
 interviewer+0.085
leneck+0.084
 filler+0.083
Neg logit contributions
Alert-0.098
ulate-0.097
aic-0.094
ents-0.094
imum-0.093
Token lot
Token ID1256
Feature activation+3.74e-01

Pos logit contributions
ulate+4.578
Alert+4.518
ents+4.379
imum+4.350
aic+4.288
Neg logit contributions
cloth-4.091
omas-4.019
leneck-3.922
 interviewer-3.815
auc-3.718
Token of
Token ID286
Feature activation+6.61e-01

Pos logit contributions
Alert+2.158
ulate+2.142
ents+2.066
aic+2.063
imum+2.043
Neg logit contributions
cloth-2.241
omas-2.210
 interviewer-2.150
leneck-2.139
 filler-2.095
Token effort
Token ID3626
Feature activation+5.48e-01

Pos logit contributions
ulate+3.749
Alert+3.689
ents+3.613
aic+3.565
imum+3.509
Neg logit contributions
cloth-3.966
omas-3.889
leneck-3.821
 interviewer-3.714
auc-3.666
Token New
Token ID968
Feature activation+9.51e-01

Pos logit contributions
Alert+0.378
ulate+0.375
ents+0.362
aic+0.360
imum+0.357
Neg logit contributions
cloth-0.406
omas-0.398
 interviewer-0.388
leneck-0.383
 filler-0.381
Token American
Token ID1605
Feature activation+3.80e-01

Pos logit contributions
Alert+5.378
ulate+5.236
aic+5.170
imum+5.017
ents+4.967
Neg logit contributions
omas-5.544
leneck-5.535
cloth-5.526
 interviewer-5.359
 filler-5.303
Token Translation
Token ID33322
Feature activation+1.45e-01

Pos logit contributions
Alert+2.124
ulate+2.104
aic+2.049
ents+2.023
imum+2.015
Neg logit contributions
cloth-2.352
omas-2.313
leneck-2.260
 interviewer-2.251
 filler-2.206
Token,
Token ID11
Feature activation+3.93e-01

Pos logit contributions
Alert+0.856
ulate+0.846
ents+0.817
aic+0.816
imum+0.810
Neg logit contributions
cloth-0.862
omas-0.846
 interviewer-0.824
leneck-0.813
 filler-0.806
Token Brown
Token ID4373
Feature activation+5.55e-01

Pos logit contributions
Alert+2.265
ulate+2.261
aic+2.183
ents+2.175
imum+2.150
Neg logit contributions
cloth-2.325
omas-2.304
leneck-2.260
 interviewer-2.227
 filler-2.173
Token Jud
Token ID4794
Feature activation+1.22e+00

Pos logit contributions
Alert+3.259
ulate+3.236
ents+3.132
aic+3.120
imum+3.081
Neg logit contributions
cloth-3.231
omas-3.222
 interviewer-3.135
leneck-3.116
 filler-3.066
Tokenaic
Token ID18452
Feature activation+8.16e-01

Pos logit contributions
aic+6.514
Alert+6.120
ents+6.047
ulate+5.875
imum+5.725
Neg logit contributions
cloth-7.816
leneck-7.578
 filler-7.529
 interviewer-7.503
omas-7.450
Token Studies
Token ID10422
Feature activation+1.94e-01

Pos logit contributions
Alert+4.309
ulate+4.288
aic+4.195
ents+4.129
imum+4.054
Neg logit contributions
cloth-5.146
omas-5.059
leneck-5.057
 interviewer-4.960
 filler-4.890
Token,
Token ID11
Feature activation+3.39e-01

Pos logit contributions
Alert+1.052
ulate+1.043
aic+1.004
ents+1.001
imum+0.994
Neg logit contributions
cloth-1.236
omas-1.213
 interviewer-1.187
leneck-1.170
 filler-1.160
Token no
Token ID645
Feature activation-8.21e-01

Pos logit contributions
Alert+2.209
ulate+2.192
aic+2.124
ents+2.120
imum+2.097
Neg logit contributions
cloth-1.776
omas-1.758
leneck-1.713
 interviewer-1.686
 filler-1.642
Token.
Token ID13
Feature activation+4.06e-02

Pos logit contributions
cloth+3.479
omas+3.465
 interviewer+3.373
 filler+3.313
bsite+3.177
Neg logit contributions
Alert-6.038
ulate-5.900
aic-5.772
ents-5.694
PLIED-5.646
TokenWidth
Token ID30916
Feature activation+1.14e+00

Pos logit contributions
Alert+5.562
ulate+5.269
ents+5.002
aic+4.784
imum+4.746
Neg logit contributions
omas-10.173
 interviewer-10.080
leneck-9.780
cloth-9.763
 filler-9.649
Token),
Token ID828
Feature activation+8.10e-01

Pos logit contributions
Alert+4.738
ulate+4.582
ents+4.480
aic+4.386
imum+4.318
Neg logit contributions
omas-8.239
cloth-8.191
 interviewer-8.123
leneck-8.084
bsite-7.899
Token($
Token ID16763
Feature activation-9.87e-02

Pos logit contributions
Alert+5.258
ulate+5.180
ents+5.059
aic+5.003
imum+4.893
Neg logit contributions
omas-4.173
cloth-4.150
 interviewer-4.034
leneck-3.994
 filler-3.896
TokenSR
Token ID12562
Feature activation+6.43e-01

Pos logit contributions
cloth+0.799
omas+0.781
leneck+0.765
 interviewer+0.764
bsite+0.755
Neg logit contributions
ulate-0.361
Alert-0.360
ents-0.345
aic-0.344
imum-0.340
Token.
Token ID13
Feature activation+9.89e-01

Pos logit contributions
Alert+3.321
ulate+3.186
ents+3.159
aic+3.052
dict+3.000
Neg logit contributions
 challeng-4.336
omas-4.181
 interviewer-4.104
leneck-4.086
 filler-4.054
TokenHeight
Token ID23106
Feature activation+1.53e+00

Pos logit contributions
Alert+3.412
ulate+3.140
ents+3.028
imum+2.891
aic+2.873
Neg logit contributions
omas-8.213
cloth-8.119
 interviewer-8.096
 filler-7.919
leneck-7.880
Token)))
Token ID22305
Feature activation+1.34e+00

Pos logit contributions
Alert+5.312
ulate+5.077
ents+4.891
aic+4.816
PLIED+4.652
Neg logit contributions
omas-11.607
cloth-11.598
 interviewer-11.465
leneck-11.435
bsite-11.176
Token )
Token ID1267
Feature activation+1.47e+00

Pos logit contributions
Alert+5.972
ulate+5.894
ents+5.803
aic+5.561
imum+5.494
Neg logit contributions
cloth-8.976
omas-8.881
leneck-8.880
 interviewer-8.875
bsite-8.631
Token }
Token ID1782
Feature activation+8.04e-01

Pos logit contributions
Alert+6.805
ulate+6.793
ents+6.653
aic+6.383
imum+6.261
Neg logit contributions
cloth-9.478
leneck-9.266
omas-9.243
 interviewer-9.206
bsite-8.950
Token }
Token ID1782
Feature activation+8.59e-01

Pos logit contributions
Alert+4.437
ulate+4.416
ents+4.253
aic+4.172
imum+4.054
Neg logit contributions
cloth-4.901
omas-4.861
leneck-4.768
 interviewer-4.709
bsite-4.561
Token Catch
Token ID25750
Feature activation+7.06e-01

Pos logit contributions
Alert+4.914
ulate+4.874
ents+4.671
aic+4.570
dict+4.477
Neg logit contributions
cloth-5.063
omas-5.050
leneck-4.922
 interviewer-4.840
bsite-4.688
Token.
Token ID13
Feature activation+4.19e-01

Pos logit contributions
Alert+2.426
ulate+2.410
ents+2.322
aic+2.322
imum+2.288
Neg logit contributions
cloth-2.270
omas-2.238
 interviewer-2.174
leneck-2.168
 filler-2.119
Token
Token ID198
Feature activation+5.61e-02

Pos logit contributions
Alert+2.917
ulate+2.856
ents+2.798
aic+2.785
imum+2.748
Neg logit contributions
cloth-2.015
omas-2.000
leneck-1.933
 interviewer-1.913
 filler-1.868
Token
Token ID198
Feature activation+3.80e-01

Pos logit contributions
Alert+0.450
ulate+0.442
ents+0.436
aic+0.432
imum+0.424
Neg logit contributions
omas-0.212
cloth-0.211
leneck-0.208
 interviewer-0.205
 filler-0.202
TokenNow
Token ID3844
Feature activation+7.39e-02

Pos logit contributions
Alert+1.838
ulate+1.775
ents+1.713
aic+1.709
imum+1.676
Neg logit contributions
cloth-2.663
omas-2.647
 interviewer-2.595
leneck-2.583
 filler-2.553
Token the
Token ID262
Feature activation-5.69e-02

Pos logit contributions
Alert+0.395
ulate+0.389
aic+0.377
ents+0.374
imum+0.374
Neg logit contributions
cloth-0.477
omas-0.468
 interviewer-0.461
leneck-0.453
 filler-0.450
Token kids
Token ID3988
Feature activation+1.24e+00

Pos logit contributions
cloth+0.292
omas+0.286
 interviewer+0.282
leneck+0.275
 filler+0.274
Neg logit contributions
Alert-0.380
ulate-0.376
aic-0.365
ents-0.364
imum-0.363
Token think
Token ID892
Feature activation+1.47e+00

Pos logit contributions
ulate+6.989
Alert+6.898
ents+6.785
aic+6.608
dict+6.303
Neg logit contributions
cloth-6.943
omas-6.889
leneck-6.868
auc-6.562
 interviewer-6.557
Token there
Token ID612
Feature activation+1.14e-01

Pos logit contributions
ulate+7.486
Alert+7.450
ents+7.147
aic+7.008
imum+6.923
Neg logit contributions
leneck-8.817
cloth-8.786
omas-8.725
sbm-8.349
auc-8.282
Token are
Token ID389
Feature activation+2.09e-01

Pos logit contributions
Alert+0.711
ulate+0.705
aic+0.683
ents+0.681
imum+0.676
Neg logit contributions
cloth-0.636
omas-0.625
 interviewer-0.608
leneck-0.602
 filler-0.594
Token going
Token ID1016
Feature activation-1.53e-01

Pos logit contributions
Alert+1.304
ulate+1.296
aic+1.253
ents+1.253
imum+1.238
Neg logit contributions
cloth-1.165
omas-1.145
 interviewer-1.109
leneck-1.107
 filler-1.079
Token to
Token ID284
Feature activation-4.94e-01

Pos logit contributions
cloth+0.816
omas+0.803
leneck+0.782
 interviewer+0.779
 filler+0.759
Neg logit contributions
Alert-0.992
ulate-0.985
ents-0.960
aic-0.954
imum-0.943
Token member
Token ID2888
Feature activation-2.62e-01

Pos logit contributions
cloth+0.469
omas+0.459
 interviewer+0.452
leneck+0.442
 filler+0.438
Neg logit contributions
Alert-0.604
ulate-0.598
aic-0.581
ents-0.580
imum-0.577
Token of
Token ID286
Feature activation+7.97e-02

Pos logit contributions
cloth+1.633
omas+1.621
leneck+1.574
 interviewer+1.560
 filler+1.536
Neg logit contributions
Alert-1.459
ulate-1.450
ents-1.397
aic-1.388
imum-1.376
Token the
Token ID262
Feature activation+2.17e-01

Pos logit contributions
Alert+0.506
ulate+0.501
ents+0.486
aic+0.486
imum+0.482
Neg logit contributions
cloth-0.434
omas-0.426
 interviewer-0.416
leneck-0.411
 filler-0.405
Token Senate
Token ID3845
Feature activation-1.53e-01

Pos logit contributions
Alert+1.293
ulate+1.281
ents+1.239
aic+1.234
imum+1.214
Neg logit contributions
cloth-1.274
omas-1.256
leneck-1.216
 interviewer-1.212
 filler-1.184
Token Environment
Token ID9344
Feature activation-7.22e-01

Pos logit contributions
cloth+0.837
omas+0.820
 interviewer+0.803
leneck+0.788
 filler+0.785
Neg logit contributions
Alert-0.960
ulate-0.954
imum-0.928
aic-0.926
ents-0.919
Token and
Token ID290
Feature activation+2.24e-01

Pos logit contributions
omas+3.097
cloth+3.036
 interviewer+2.980
leneck+2.939
 filler+2.936
Neg logit contributions
ulate-5.174
aic-5.129
Alert-5.126
imum-5.020
ents-5.002
Token Public
Token ID5094
Feature activation+1.98e-01

Pos logit contributions
Alert+1.009
ulate+1.001
aic+0.932
ents+0.930
imum+0.898
Neg logit contributions
cloth-1.649
omas-1.626
leneck-1.601
 interviewer-1.591
 filler-1.548
Token Works
Token ID10933
Feature activation-6.91e-01

Pos logit contributions
Alert+1.072
ulate+1.054
ents+1.003
aic+1.001
imum+0.992
Neg logit contributions
cloth-1.286
omas-1.259
leneck-1.243
 interviewer-1.234
 filler-1.204
Token Committee
Token ID4606
Feature activation-2.74e-01

Pos logit contributions
cloth+2.942
omas+2.876
 interviewer+2.782
 filler+2.733
leneck+2.671
Neg logit contributions
Alert-5.042
ulate-4.998
aic-4.932
imum-4.928
PLIED-4.911
Token,
Token ID11
Feature activation-1.06e+00

Pos logit contributions
cloth+1.836
omas+1.805
 interviewer+1.782
leneck+1.747
 filler+1.739
Neg logit contributions
Alert-1.362
ulate-1.358
aic-1.325
imum-1.301
PLIED-1.286
Token displayed
Token ID9066
Feature activation-3.15e-01

Pos logit contributions
cloth+5.550
 interviewer+5.501
omas+5.351
 filler+5.238
auc+5.217
Neg logit contributions
Alert-6.541
ulate-6.464
imum-6.371
aic-6.350
PLIED-6.227
Token of
Token ID286
Feature activation+4.11e-01

Pos logit contributions
cloth+0.165
omas+0.161
 interviewer+0.158
leneck+0.155
 filler+0.155
Neg logit contributions
Alert-0.183
ulate-0.179
aic-0.174
ents-0.173
imum-0.173
Token fun
Token ID1257
Feature activation+3.13e-01

Pos logit contributions
ulate+2.494
Alert+2.470
ents+2.412
aic+2.392
imum+2.359
Neg logit contributions
cloth-2.334
omas-2.296
leneck-2.234
 interviewer-2.182
bsite-2.144
Token with
Token ID351
Feature activation+3.06e-01

Pos logit contributions
Alert+1.835
ulate+1.827
ents+1.769
aic+1.754
imum+1.730
Neg logit contributions
cloth-1.834
omas-1.816
 interviewer-1.755
leneck-1.752
 filler-1.709
Token it
Token ID340
Feature activation+4.12e-01

Pos logit contributions
Alert+1.802
ulate+1.795
ents+1.737
aic+1.727
imum+1.710
Neg logit contributions
cloth-1.802
omas-1.772
 interviewer-1.716
leneck-1.713
 filler-1.680
Token.
Token ID13
Feature activation+3.49e-01

Pos logit contributions
Alert+2.591
ulate+2.577
ents+2.490
aic+2.484
imum+2.447
Neg logit contributions
cloth-2.240
omas-2.210
 interviewer-2.137
leneck-2.134
 filler-2.081
Token I
Token ID314
Feature activation+8.25e-01

Pos logit contributions
Alert+2.254
ulate+2.204
ents+2.164
aic+2.147
imum+2.122
Neg logit contributions
cloth-1.862
omas-1.852
leneck-1.791
 interviewer-1.780
 filler-1.747
Token think
Token ID892
Feature activation+1.55e-01

Pos logit contributions
ulate+4.540
Alert+4.411
ents+4.366
aic+4.329
dict+4.157
Neg logit contributions
omas-4.892
cloth-4.890
leneck-4.758
sbm-4.672
 interviewer-4.666
Token people
Token ID661
Feature activation+8.39e-02

Pos logit contributions
Alert+1.106
ulate+1.100
ents+1.069
aic+1.067
imum+1.057
Neg logit contributions
cloth-0.726
omas-0.712
 interviewer-0.685
leneck-0.683
 filler-0.666
Token love
Token ID1842
Feature activation-1.75e-01

Pos logit contributions
Alert+0.504
ulate+0.501
ents+0.484
aic+0.484
imum+0.477
Neg logit contributions
cloth-0.485
omas-0.478
 interviewer-0.463
leneck-0.462
 filler-0.453
Token him
Token ID683
Feature activation+1.98e-01

Pos logit contributions
cloth+0.900
omas+0.893
 interviewer+0.872
 filler+0.850
leneck+0.841
Neg logit contributions
Alert-1.159
ulate-1.140
aic-1.117
imum-1.110
ents-1.105
Token there
Token ID612
Feature activation+5.72e-02

Pos logit contributions
Alert+1.236
ulate+1.226
ents+1.186
aic+1.185
imum+1.173
Neg logit contributions
cloth-1.104
omas-1.087
 interviewer-1.055
leneck-1.048
 filler-1.029
Token up
Token ID510
Feature activation-5.35e-01

Pos logit contributions
cloth+2.922
omas+2.864
 interviewer+2.811
 filler+2.786
leneck+2.780
Neg logit contributions
Alert-2.786
imum-2.727
ulate-2.713
aic-2.687
ents-2.652
Token in
Token ID287
Feature activation+2.44e-02

Pos logit contributions
cloth+2.666
 interviewer+2.542
omas+2.529
 filler+2.483
leneck+2.429
Neg logit contributions
ulate-3.616
Alert-3.608
imum-3.552
aic-3.501
ents-3.445
Token foster
Token ID17016
Feature activation+1.14e-01

Pos logit contributions
Alert+0.150
ulate+0.149
ents+0.144
aic+0.144
imum+0.143
Neg logit contributions
cloth-0.138
omas-0.136
 interviewer-0.132
leneck-0.131
 filler-0.129
Token care
Token ID1337
Feature activation-1.27e-01

Pos logit contributions
Alert+0.713
ulate+0.707
aic+0.685
ents+0.683
imum+0.679
Neg logit contributions
cloth-0.634
omas-0.624
 interviewer-0.605
leneck-0.600
 filler-0.592
Token.
Token ID13
Feature activation+1.16e-01

Pos logit contributions
cloth+0.657
omas+0.646
 interviewer+0.619
leneck+0.612
 filler+0.608
Neg logit contributions
Alert-0.853
ulate-0.843
aic-0.822
imum-0.818
ents-0.813
Token
Token ID564
Feature activation+1.43e-01

Pos logit contributions
Alert+0.873
ulate+0.866
aic+0.843
ents+0.843
imum+0.836
Neg logit contributions
cloth-0.493
omas-0.482
 interviewer-0.464
leneck-0.460
 filler-0.450
Token
Token ID250
Feature activation-5.30e-01

Pos logit contributions
Alert+0.947
ulate+0.945
aic+0.914
ents+0.911
imum+0.908
Neg logit contributions
cloth-0.736
omas-0.726
 interviewer-0.701
leneck-0.695
 filler-0.685
TokenThis
Token ID1212
Feature activation+1.45e-01

Pos logit contributions
cloth+3.264
omas+3.176
leneck+3.059
 interviewer+3.030
auc+2.999
Neg logit contributions
ulate-2.926
imum-2.847
Alert-2.827
ents-2.815
aic-2.809
Token gift
Token ID6979
Feature activation+6.57e-01

Pos logit contributions
Alert+0.963
ulate+0.955
ents+0.926
aic+0.925
imum+0.916
Neg logit contributions
cloth-0.745
omas-0.732
 interviewer-0.708
leneck-0.704
 filler-0.690
Token means
Token ID1724
Feature activation-4.88e-01

Pos logit contributions
Alert+3.811
ulate+3.774
ents+3.669
aic+3.580
imum+3.549
Neg logit contributions
cloth-3.807
omas-3.783
leneck-3.691
 interviewer-3.645
 filler-3.572
Token nont
Token ID45930
Feature activation-1.37e-01

Pos logit contributions
cloth+2.544
omas+2.501
 interviewer+2.435
 filler+2.385
leneck+2.367
Neg logit contributions
Alert-3.168
ulate-3.128
aic-3.073
imum-3.037
ents-3.037
Token takes
Token ID2753
Feature activation+9.68e-02

Pos logit contributions
Alert+2.842
ulate+2.816
ents+2.739
aic+2.703
imum+2.698
Neg logit contributions
cloth-2.566
omas-2.524
leneck-2.488
 interviewer-2.444
 filler-2.371
Token a
Token ID257
Feature activation-2.87e-02

Pos logit contributions
Alert+0.615
ulate+0.609
aic+0.590
ents+0.590
imum+0.585
Neg logit contributions
cloth-0.529
omas-0.520
 interviewer-0.504
leneck-0.501
 filler-0.492
Token step
Token ID2239
Feature activation+5.26e-01

Pos logit contributions
cloth+0.144
omas+0.142
 interviewer+0.138
leneck+0.136
 filler+0.135
Neg logit contributions
Alert-0.195
ulate-0.193
aic-0.188
ents-0.187
imum-0.186
Token to
Token ID284
Feature activation+1.20e-01

Pos logit contributions
Alert+3.218
ulate+3.188
ents+3.099
aic+3.073
imum+3.063
Neg logit contributions
cloth-2.909
omas-2.898
 interviewer-2.798
leneck-2.786
 filler-2.715
Token the
Token ID262
Feature activation-5.02e-01

Pos logit contributions
Alert+0.750
ulate+0.742
aic+0.720
ents+0.719
imum+0.713
Neg logit contributions
cloth-0.670
omas-0.661
 interviewer-0.642
leneck-0.635
 filler-0.627
Token play
Token ID711
Feature activation-6.54e-03

Pos logit contributions
cloth+2.540
 interviewer+2.494
omas+2.442
 filler+2.431
leneck+2.401
Neg logit contributions
Alert-3.348
ulate-3.268
aic-3.213
imum-3.189
PLIED-3.183
Token side
Token ID1735
Feature activation-3.59e-01

Pos logit contributions
cloth+0.037
omas+0.036
 interviewer+0.035
leneck+0.035
 filler+0.034
Neg logit contributions
Alert-0.041
ulate-0.040
ents-0.039
aic-0.039
imum-0.039
Token.
Token ID13
Feature activation-8.16e-03

Pos logit contributions
cloth+2.000
omas+1.912
 interviewer+1.890
 filler+1.866
leneck+1.863
Neg logit contributions
Alert-2.249
ulate-2.196
aic-2.156
ents-2.148
PLIED-2.135
Token B
Token ID347
Feature activation-5.12e-01

Pos logit contributions
cloth+0.049
omas+0.048
 interviewer+0.047
leneck+0.046
 filler+0.046
Neg logit contributions
Alert-0.047
ulate-0.047
aic-0.045
ents-0.045
imum-0.045
Tokenugg
Token ID6837
Feature activation+4.00e-01

Pos logit contributions
cloth+1.679
omas+1.678
leneck+1.549
 interviewer+1.532
auc+1.522
Neg logit contributions
Alert-4.271
ulate-4.217
aic-4.178
PLIED-4.155
ents-4.108
Tokens
Token ID82
Feature activation-5.09e-02

Pos logit contributions
Alert+1.119
ulate+1.114
ents+1.104
aic+1.036
imum+1.028
Neg logit contributions
cloth-3.549
omas-3.478
 interviewer-3.454
leneck-3.426
 filler-3.402
Token community
Token ID2055
Feature activation-5.34e-02

Pos logit contributions
cloth+0.420
omas+0.412
 interviewer+0.402
leneck+0.399
 filler+0.392
Neg logit contributions
Alert-0.486
ulate-0.482
aic-0.468
ents-0.468
imum-0.465
Token activism
Token ID23034
Feature activation+1.76e-01

Pos logit contributions
cloth+0.267
omas+0.261
 interviewer+0.254
leneck+0.251
 filler+0.248
Neg logit contributions
Alert-0.363
ulate-0.360
ents-0.351
aic-0.350
imum-0.349
Token,
Token ID11
Feature activation-1.18e-01

Pos logit contributions
Alert+1.093
ulate+1.082
aic+1.049
ents+1.047
imum+1.043
Neg logit contributions
cloth-0.993
omas-0.975
 interviewer-0.946
leneck-0.941
 filler-0.927
Token a
Token ID257
Feature activation+3.69e-02

Pos logit contributions
cloth+0.585
omas+0.574
 interviewer+0.558
leneck+0.553
 filler+0.542
Neg logit contributions
Alert-0.810
ulate-0.802
ents-0.782
aic-0.780
imum-0.776
Token massive
Token ID4858
Feature activation-6.36e-01

Pos logit contributions
Alert+0.236
ulate+0.233
ents+0.226
aic+0.226
imum+0.224
Neg logit contributions
cloth-0.200
omas-0.196
 interviewer-0.191
leneck-0.189
 filler-0.186
Token infusion
Token ID35547
Feature activation+7.25e-01

Pos logit contributions
cloth+3.250
 interviewer+3.193
omas+3.117
 filler+3.084
leneck+3.051
Neg logit contributions
Alert-4.110
ulate-4.050
ents-4.042
PLIED-3.996
imum-3.990
Token of
Token ID286
Feature activation+6.21e-01

Pos logit contributions
ulate+3.843
Alert+3.825
aic+3.681
ents+3.636
imum+3.619
Neg logit contributions
cloth-4.593
omas-4.584
 interviewer-4.418
leneck-4.384
 filler-4.338
Token provincial
Token ID15818
Feature activation-1.89e-01

Pos logit contributions
ulate+3.771
Alert+3.751
aic+3.596
ents+3.554
imum+3.526
Neg logit contributions
cloth-3.470
omas-3.462
leneck-3.327
 interviewer-3.297
bsite-3.224
Token funds
Token ID5153
Feature activation-4.22e-02

Pos logit contributions
cloth+0.998
omas+0.972
 interviewer+0.947
leneck+0.930
 filler+0.929
Neg logit contributions
Alert-1.231
ulate-1.211
ents-1.198
aic-1.188
imum-1.182
Token and
Token ID290
Feature activation-1.69e-01

Pos logit contributions
cloth+0.243
omas+0.238
leneck+0.230
 interviewer+0.230
 filler+0.227
Neg logit contributions
Alert-0.257
ulate-0.252
aic-0.245
imum-0.244
ents-0.244
Token,
Token ID11
Feature activation-1.19e-01

Pos logit contributions
cloth+0.913
omas+0.895
 interviewer+0.878
leneck+0.863
 filler+0.855
Neg logit contributions
Alert-1.078
ulate-1.063
ents-1.040
aic-1.036
imum-1.033
Token effects
Token ID3048
Feature activation-2.29e-01

Pos logit contributions
cloth+0.490
omas+0.480
 interviewer+0.465
leneck+0.461
 filler+0.455
Neg logit contributions
Alert-0.574
ulate-0.569
aic-0.552
ents-0.551
imum-0.548
Token.
Token ID13
Feature activation-4.03e-01

Pos logit contributions
cloth+1.161
omas+1.142
 interviewer+1.106
leneck+1.094
 filler+1.082
Neg logit contributions
Alert-1.522
ulate-1.507
aic-1.466
ents-1.458
imum-1.451
Token
Token ID198
Feature activation-6.29e-01

Pos logit contributions
cloth+1.960
omas+1.900
 interviewer+1.861
leneck+1.821
 filler+1.805
Neg logit contributions
Alert-2.748
ulate-2.738
aic-2.672
ents-2.664
imum-2.649
Token
Token ID198
Feature activation-2.31e-01

Pos logit contributions
cloth+1.849
omas+1.781
 interviewer+1.683
leneck+1.649
 filler+1.598
Neg logit contributions
Alert-5.559
ulate-5.527
aic-5.406
ents-5.396
imum-5.376
TokenT
Token ID51
Feature activation+3.30e-01

Pos logit contributions
cloth+1.401
omas+1.367
 interviewer+1.328
leneck+1.323
 filler+1.298
Neg logit contributions
ulate-1.310
Alert-1.305
aic-1.269
ents-1.266
imum-1.256
Tokenape
Token ID1758
Feature activation-1.38e-01

Pos logit contributions
Alert+2.216
ulate+2.205
ents+2.154
aic+2.139
imum+2.118
Neg logit contributions
cloth-1.671
omas-1.634
 interviewer-1.596
leneck-1.573
 filler-1.556
Token to
Token ID284
Feature activation-3.18e-01

Pos logit contributions
cloth+0.727
omas+0.712
 interviewer+0.694
leneck+0.690
 filler+0.679
Neg logit contributions
Alert-0.892
ulate-0.884
aic-0.862
ents-0.862
imum-0.852
Token back
Token ID736
Feature activation-8.25e-02

Pos logit contributions
cloth+1.658
omas+1.619
 interviewer+1.607
leneck+1.563
 filler+1.559
Neg logit contributions
Alert-2.069
ulate-2.042
aic-2.000
ents-1.985
PLIED-1.984
Token things
Token ID1243
Feature activation+4.62e-02

Pos logit contributions
cloth+0.475
omas+0.465
 interviewer+0.453
leneck+0.450
 filler+0.443
Neg logit contributions
Alert-0.501
ulate-0.493
aic-0.480
ents-0.477
imum-0.476
Token up
Token ID510
Feature activation-1.17e-01

Pos logit contributions
Alert+0.240
ulate+0.238
ents+0.228
aic+0.228
imum+0.225
Neg logit contributions
cloth-0.306
omas-0.302
 interviewer-0.294
leneck-0.293
 filler-0.288
Token works
Token ID2499
Feature activation-2.17e-01

Pos logit contributions
cloth+0.639
omas+0.627
 interviewer+0.610
leneck+0.604
 filler+0.596
Neg logit contributions
Alert-0.737
ulate-0.730
aic-0.709
ents-0.708
imum-0.702
Token we
Token ID356
Feature activation-3.50e-01

Pos logit contributions
cloth+2.314
 interviewer+2.288
omas+2.288
 filler+2.187
leneck+2.153
Neg logit contributions
Alert-2.933
ulate-2.878
imum-2.833
aic-2.830
ents-2.791
Token got
Token ID1392
Feature activation-1.49e-01

Pos logit contributions
cloth+1.925
omas+1.876
 interviewer+1.847
 filler+1.791
leneck+1.786
Neg logit contributions
Alert-2.229
ulate-2.177
imum-2.146
aic-2.141
ents-2.093
Token back
Token ID736
Feature activation-1.97e-01

Pos logit contributions
cloth+0.774
omas+0.755
 interviewer+0.741
 filler+0.722
leneck+0.720
Neg logit contributions
Alert-0.987
ulate-0.969
aic-0.956
imum-0.947
ents-0.944
Token home
Token ID1363
Feature activation-3.58e-01

Pos logit contributions
cloth+1.106
omas+1.092
 interviewer+1.058
 filler+1.035
leneck+1.034
Neg logit contributions
Alert-1.217
ulate-1.191
aic-1.171
ents-1.153
imum-1.153
Token,
Token ID11
Feature activation-6.18e-01

Pos logit contributions
cloth+2.221
omas+2.180
 interviewer+2.133
 filler+2.085
leneck+2.070
Neg logit contributions
Alert-2.000
ulate-1.964
imum-1.926
aic-1.922
ents-1.893
Token we
Token ID356
Feature activation-5.89e-01

Pos logit contributions
cloth+3.182
omas+3.093
 interviewer+3.088
 filler+2.994
leneck+2.923
Neg logit contributions
Alert-4.051
ulate-3.953
imum-3.918
aic-3.909
ents-3.844
Token weren
Token ID6304
Feature activation-1.99e-01

Pos logit contributions
cloth+3.208
omas+3.078
 interviewer+3.028
 filler+2.950
bsite+2.927
Neg logit contributions
Alert-3.781
ulate-3.701
imum-3.699
aic-3.647
PLIED-3.554
Token't
Token ID470
Feature activation+3.63e-01

Pos logit contributions
cloth+0.804
omas+0.793
 interviewer+0.766
leneck+0.766
 filler+0.747
Neg logit contributions
Alert-1.552
ulate-1.538
ents-1.504
aic-1.490
imum-1.475
Token allowed
Token ID3142
Feature activation+5.47e-01

Pos logit contributions
Alert+2.047
ulate+2.009
ents+1.950
aic+1.921
imum+1.879
Neg logit contributions
cloth-2.216
omas-2.201
leneck-2.150
 interviewer-2.125
 filler-2.076
Token to
Token ID284
Feature activation-1.27e-01

Pos logit contributions
ulate+3.416
Alert+3.385
ents+3.290
aic+3.260
imum+3.205
Neg logit contributions
cloth-3.003
omas-2.951
leneck-2.881
 interviewer-2.855
 filler-2.763
Token chat
Token ID8537
Feature activation+5.48e-01

Pos logit contributions
cloth+0.687
omas+0.675
 interviewer+0.656
leneck+0.649
 filler+0.640
Neg logit contributions
Alert-0.816
ulate-0.807
aic-0.784
ents-0.782
imum-0.777
Token idea
Token ID2126
Feature activation+4.37e-01

Pos logit contributions
cloth+1.574
omas+1.544
 interviewer+1.539
 filler+1.482
leneck+1.480
Neg logit contributions
Alert-2.193
ulate-2.151
aic-2.095
ents-2.092
imum-2.081
Token here
Token ID994
Feature activation+8.15e-01

Pos logit contributions
Alert+2.923
ulate+2.907
ents+2.820
aic+2.820
imum+2.795
Neg logit contributions
cloth-2.219
omas-2.179
 interviewer-2.111
leneck-2.099
 filler-2.056
Token is
Token ID318
Feature activation+5.59e-02

Pos logit contributions
ulate+5.105
Alert+5.095
ents+4.964
aic+4.937
imum+4.918
Neg logit contributions
cloth-4.296
omas-4.199
 interviewer-4.140
leneck-4.122
 filler-4.022
Token that
Token ID326
Feature activation+2.96e-02

Pos logit contributions
Alert+0.358
ulate+0.353
aic+0.343
ents+0.343
imum+0.340
Neg logit contributions
cloth-0.302
omas-0.296
 interviewer-0.289
leneck-0.284
 filler-0.283
Token any
Token ID597
Feature activation-5.25e-01

Pos logit contributions
Alert+0.186
ulate+0.184
aic+0.178
ents+0.178
imum+0.177
Neg logit contributions
cloth-0.164
omas-0.161
 interviewer-0.157
leneck-0.155
 filler-0.153
Token distribution
Token ID6082
Feature activation-3.88e-01

Pos logit contributions
cloth+2.267
omas+2.206
 interviewer+2.205
 filler+2.131
leneck+2.067
Neg logit contributions
Alert-3.891
ulate-3.801
ents-3.742
imum-3.718
aic-3.702
Token which
Token ID543
Feature activation+9.97e-02

Pos logit contributions
cloth+2.072
 interviewer+2.013
omas+2.000
 filler+1.969
leneck+1.916
Neg logit contributions
Alert-2.505
ulate-2.431
aic-2.379
imum-2.371
ents-2.349
Token provides
Token ID3769
Feature activation+1.00e-01

Pos logit contributions
Alert+0.639
ulate+0.638
ents+0.618
aic+0.616
imum+0.610
Neg logit contributions
cloth-0.535
omas-0.525
 interviewer-0.508
leneck-0.508
 filler-0.496
Token the
Token ID262
Feature activation+1.14e-01

Pos logit contributions
Alert+0.645
ulate+0.640
aic+0.620
ents+0.620
imum+0.613
Neg logit contributions
cloth-0.540
omas-0.531
 interviewer-0.514
leneck-0.512
 filler-0.501
Token Snap
Token ID16026
Feature activation-4.92e-01

Pos logit contributions
Alert+0.746
ulate+0.741
aic+0.718
ents+0.718
imum+0.711
Neg logit contributions
cloth-0.601
omas-0.591
 interviewer-0.570
leneck-0.570
 filler-0.556
Token framework
Token ID9355
Feature activation+2.75e-01

Pos logit contributions
cloth+2.195
omas+2.083
 interviewer+2.055
 filler+2.031
leneck+2.001
Neg logit contributions
Alert-3.542
ulate-3.509
ents-3.431
aic-3.423
imum-3.390
Token and
Token ID290
Feature activation-9.08e-02

Pos logit contributions
cloth+2.298
omas+2.261
 interviewer+2.183
 filler+2.176
leneck+2.124
Neg logit contributions
Alert-2.957
ulate-2.865
aic-2.810
ents-2.800
imum-2.795
Token the
Token ID262
Feature activation-1.50e-01

Pos logit contributions
cloth+0.491
omas+0.487
 interviewer+0.476
 filler+0.466
leneck+0.465
Neg logit contributions
Alert-0.579
ulate-0.572
aic-0.556
ents-0.555
imum-0.553
Token respect
Token ID2461
Feature activation+6.39e-01

Pos logit contributions
cloth+0.790
omas+0.786
 interviewer+0.767
leneck+0.748
 filler+0.745
Neg logit contributions
Alert-0.975
ulate-0.962
ents-0.934
aic-0.933
imum-0.927
Token of
Token ID286
Feature activation-1.25e-03

Pos logit contributions
ulate+3.767
Alert+3.760
ents+3.650
aic+3.640
imum+3.587
Neg logit contributions
cloth-3.714
omas-3.633
 interviewer-3.551
leneck-3.530
 filler-3.462
Token the
Token ID262
Feature activation-6.08e-02

Pos logit contributions
cloth+0.008
omas+0.008
 interviewer+0.007
leneck+0.007
 filler+0.007
Neg logit contributions
Alert-0.007
ulate-0.007
aic-0.007
ents-0.007
imum-0.007
Token more
Token ID517
Feature activation-6.17e-01

Pos logit contributions
cloth+0.326
omas+0.322
 interviewer+0.315
leneck+0.308
 filler+0.306
Neg logit contributions
Alert-0.392
ulate-0.386
aic-0.375
ents-0.374
imum-0.372
Token extreme
Token ID3257
Feature activation-2.83e-01

Pos logit contributions
cloth+3.533
omas+3.500
 interviewer+3.489
 filler+3.424
leneck+3.355
Neg logit contributions
Alert-3.602
ulate-3.526
imum-3.512
aic-3.417
PLIED-3.408
Token pun
Token ID4000
Feature activation+1.17e+00

Pos logit contributions
omas+1.540
cloth+1.538
 interviewer+1.510
 filler+1.479
leneck+1.450
Neg logit contributions
Alert-1.793
ulate-1.746
aic-1.706
imum-1.704
ents-1.685
Tokenks
Token ID591
Feature activation+2.04e-01

Pos logit contributions
ents+7.101
ulate+7.008
aic+6.997
Alert+6.779
dict+6.743
Neg logit contributions
cloth-5.999
 interviewer-5.909
omas-5.793
 filler-5.750
leneck-5.656
Token in
Token ID287
Feature activation-3.71e-01

Pos logit contributions
Alert+1.248
ulate+1.238
ents+1.197
aic+1.197
imum+1.185
Neg logit contributions
cloth-1.162
omas-1.143
 interviewer-1.111
leneck-1.104
 filler-1.085
Token return
Token ID1441
Feature activation+1.30e-01

Pos logit contributions
omas+2.098
cloth+2.094
 interviewer+2.058
 filler+2.011
leneck+1.982
Neg logit contributions
Alert-2.247
ulate-2.188
aic-2.164
ents-2.150
imum-2.137
Token://
Token ID1378
Feature activation+2.29e-02

Pos logit contributions
Alert+0.950
ulate+0.939
aic+0.908
ents+0.903
imum+0.894
Neg logit contributions
cloth-1.033
omas-1.027
 interviewer-1.011
leneck-1.008
 filler-0.993
Tokenforums
Token ID37141
Feature activation-2.86e-01

Pos logit contributions
Alert+0.134
ulate+0.132
ents+0.127
aic+0.127
imum+0.127
Neg logit contributions
cloth-0.137
omas-0.135
 interviewer-0.131
leneck-0.130
 filler-0.128
Token.
Token ID13
Feature activation-9.75e-02

Pos logit contributions
cloth+1.368
omas+1.338
leneck+1.275
 interviewer+1.266
auc+1.249
Neg logit contributions
Alert-2.005
ulate-1.963
aic-1.908
imum-1.907
PLIED-1.894
Tokens
Token ID82
Feature activation-1.11e+00

Pos logit contributions
cloth+0.750
omas+0.743
leneck+0.719
 interviewer+0.719
 filler+0.709
Neg logit contributions
Alert-0.405
ulate-0.395
ents-0.370
aic-0.370
imum-0.368
Tokengc
Token ID36484
Feature activation-6.39e-01

Pos logit contributions
omas+5.038
cloth+4.995
auc+4.645
leneck+4.498
bsite+4.465
Neg logit contributions
Alert-7.858
PLIED-7.404
ulate-7.402
imum-7.195
ents-7.121
Tokenafe
Token ID8635
Feature activation-7.81e-01

Pos logit contributions
omas+2.871
cloth+2.866
auc+2.659
bsite+2.649
 interviewer+2.631
Neg logit contributions
Alert-4.579
ulate-4.450
PLIED-4.403
aic-4.329
ents-4.284
Token.
Token ID13
Feature activation+4.55e-02

Pos logit contributions
cloth+3.492
omas+3.394
bsite+3.183
 interviewer+3.179
auc+3.160
Neg logit contributions
Alert-5.611
ulate-5.548
PLIED-5.401
aic-5.338
imum-5.290
Tokencom
Token ID785
Feature activation+4.87e-01

Pos logit contributions
Alert+0.327
ulate+0.325
ents+0.315
aic+0.314
imum+0.313
Neg logit contributions
cloth-0.211
omas-0.207
 interviewer-0.198
leneck-0.197
 filler-0.193
Token
Token ID198
Feature activation+3.37e-02

Pos logit contributions
Alert+2.851
ulate+2.818
ents+2.741
aic+2.729
imum+2.691
Neg logit contributions
cloth-2.874
omas-2.813
 interviewer-2.741
leneck-2.730
 filler-2.677
Token
Token ID198
Feature activation+2.98e-01

Pos logit contributions
Alert+0.272
ulate+0.268
ents+0.263
aic+0.261
imum+0.256
Neg logit contributions
cloth-0.125
omas-0.125
leneck-0.122
 interviewer-0.121
 filler-0.118
TokenSource
Token ID7416
Feature activation-2.09e-01

Pos logit contributions
Alert+1.709
ulate+1.675
aic+1.620
ents+1.619
imum+1.595
Neg logit contributions
cloth-1.813
omas-1.790
 interviewer-1.746
leneck-1.738
 filler-1.707
Tokenxit
Token ID10198
Feature activation-2.11e+00

Pos logit contributions
cloth+5.371
omas+5.313
bsite+4.840
cho+4.820
auc+4.731
Neg logit contributions
Alert-8.908
PLIED-8.641
ulate-8.479
ents-8.421
imum-8.237
Token-
Token ID12
Feature activation-1.40e+00

Pos logit contributions
cloth+6.610
omas+6.356
auc+5.520
bsite+5.183
kamp+5.019
Neg logit contributions
Alert-16.187
ulate-15.766
aic-15.719
PLIED-15.617
isSpecial-15.440
Tokenis
Token ID271
Feature activation-1.41e+00

Pos logit contributions
cloth+4.718
omas+4.659
auc+4.184
bsite+3.944
leneck+3.881
Neg logit contributions
Alert-11.310
PLIED-11.260
ulate-10.892
aic-10.610
imum-10.604
Token-
Token ID12
Feature activation-1.55e+00

Pos logit contributions
cloth+5.307
omas+5.143
auc+4.670
leneck+4.577
 interviewer+4.528
Neg logit contributions
Alert-10.822
ulate-10.466
PLIED-10.417
aic-10.303
imum-10.156
Tokenthe
Token ID1169
Feature activation-1.04e+00

Pos logit contributions
cloth+6.620
omas+6.494
kamp+6.036
auc+6.022
sole+6.000
Neg logit contributions
Alert-11.257
PLIED-10.937
ulate-10.431
aic-10.190
 sidx-10.148
Token-
Token ID12
Feature activation-1.52e+00

Pos logit contributions
cloth+4.239
omas+4.202
 interviewer+4.040
leneck+3.883
 filler+3.867
Neg logit contributions
Alert-7.854
ulate-7.672
aic-7.529
ents-7.471
imum-7.451
Tokenprime
Token ID35505
Feature activation+5.16e-02

Pos logit contributions
cloth+9.770
omas+9.608
kamp+9.326
sole+9.299
auc+9.240
Neg logit contributions
Alert-7.683
PLIED-7.128
ulate-7.008
aic-6.665
 sidx-6.652
Token-
Token ID12
Feature activation-8.78e-01

Pos logit contributions
Alert+0.329
ulate+0.328
ents+0.315
aic+0.313
imum+0.310
Neg logit contributions
cloth-0.279
omas-0.273
 interviewer-0.264
 filler-0.263
leneck-0.263
Tokenchall
Token ID36747
Feature activation-7.47e-01

Pos logit contributions
cloth+2.021
omas+1.902
leneck+1.773
auc+1.646
 interviewer+1.640
Neg logit contributions
Alert-8.240
PLIED-8.059
ulate-8.040
ents-7.925
aic-7.912
Tokenenge
Token ID3540
Feature activation-6.48e-01

Pos logit contributions
cloth+2.116
omas+2.075
leneck+2.015
 interviewer+1.959
bsite+1.815
Neg logit contributions
Alert-6.541
ulate-6.536
PLIED-6.431
aic-6.358
ents-6.305
Token-
Token ID12
Feature activation-7.27e-01

Pos logit contributions
cloth+2.874
omas+2.829
 interviewer+2.696
 filler+2.640
bsite+2.635
Neg logit contributions
ulate-4.631
Alert-4.631
aic-4.541
imum-4.520
ents-4.425
Token)
Token ID8
Feature activation-3.75e-01

Pos logit contributions
Alert+1.770
ulate+1.750
aic+1.679
ents+1.678
imum+1.661
Neg logit contributions
cloth-2.423
omas-2.389
 interviewer-2.335
leneck-2.321
 filler-2.290
Token that
Token ID326
Feature activation-3.32e-01

Pos logit contributions
cloth+1.833
omas+1.777
 interviewer+1.734
leneck+1.712
 filler+1.689
Neg logit contributions
Alert-2.589
ulate-2.571
aic-2.504
imum-2.495
ents-2.484
Token tore
Token ID26197
Feature activation+4.35e-01

Pos logit contributions
cloth+1.775
omas+1.761
 interviewer+1.712
leneck+1.685
 filler+1.675
Neg logit contributions
Alert-2.116
ulate-2.092
aic-2.052
imum-2.040
ents-2.008
Token the
Token ID262
Feature activation+2.34e-02

Pos logit contributions
ulate+2.783
Alert+2.756
ents+2.694
aic+2.666
imum+2.632
Neg logit contributions
cloth-2.305
omas-2.294
leneck-2.203
 interviewer-2.196
 filler-2.166
Token country
Token ID1499
Feature activation-7.91e-01

Pos logit contributions
Alert+0.149
ulate+0.146
aic+0.142
ents+0.141
imum+0.141
Neg logit contributions
cloth-0.130
omas-0.127
 interviewer-0.124
leneck-0.122
 filler-0.120
Token apart
Token ID5475
Feature activation-1.82e+00

Pos logit contributions
cloth+2.695
omas+2.381
 interviewer+2.371
 filler+2.312
leneck+2.271
Neg logit contributions
Alert-6.636
ulate-6.493
PLIED-6.438
imum-6.437
aic-6.413
Token.
Token ID13
Feature activation-3.52e-01

Pos logit contributions
cloth+7.455
omas+6.780
leneck+6.547
 interviewer+6.526
kamp+6.361
Neg logit contributions
Alert-12.315
ulate-12.112
PLIED-12.033
imum-12.000
aic-11.782
Token Likewise
Token ID22660
Feature activation-3.29e-01

Pos logit contributions
cloth+1.800
omas+1.761
 interviewer+1.705
leneck+1.678
 filler+1.642
Neg logit contributions
Alert-2.342
ulate-2.340
aic-2.266
imum-2.253
ents-2.241
Token,
Token ID11
Feature activation-4.16e-01

Pos logit contributions
cloth+2.155
omas+2.145
 interviewer+2.092
leneck+2.065
 filler+2.046
Neg logit contributions
Alert-1.731
ulate-1.651
aic-1.629
imum-1.625
PLIED-1.593
Token the
Token ID262
Feature activation-4.27e-01

Pos logit contributions
cloth+2.257
omas+2.241
 interviewer+2.194
leneck+2.146
 filler+2.123
Neg logit contributions
Alert-2.646
ulate-2.563
imum-2.514
aic-2.504
ents-2.479
Token neoliberal
Token ID31972
Feature activation-2.87e-01

Pos logit contributions
omas+2.332
cloth+2.324
 interviewer+2.312
leneck+2.232
 filler+2.216
Neg logit contributions
Alert-2.707
ulate-2.603
imum-2.542
aic-2.529
ents-2.527
Token longer
Token ID2392
Feature activation+6.68e-01

Pos logit contributions
cloth+2.357
omas+2.328
 interviewer+2.326
leneck+2.287
 filler+2.280
Neg logit contributions
Alert-3.334
ulate-3.302
aic-3.243
ents-3.237
imum-3.188
Token there
Token ID612
Feature activation-4.03e-01

Pos logit contributions
Alert+4.306
ulate+4.283
ents+4.134
aic+4.102
imum+4.052
Neg logit contributions
cloth-3.505
omas-3.500
leneck-3.346
 interviewer-3.288
bsite-3.210
Token because
Token ID780
Feature activation-4.84e-01

Pos logit contributions
cloth+2.051
omas+2.002
 interviewer+1.966
 filler+1.930
leneck+1.916
Neg logit contributions
Alert-2.651
ulate-2.624
aic-2.583
ents-2.543
imum-2.530
Token it
Token ID340
Feature activation-6.31e-01

Pos logit contributions
cloth+2.458
omas+2.416
 interviewer+2.406
 filler+2.325
bsite+2.282
Neg logit contributions
Alert-3.211
ulate-3.144
aic-3.095
PLIED-3.092
ents-3.051
Token was
Token ID373
Feature activation-4.53e-01

Pos logit contributions
cloth+2.703
omas+2.658
leneck+2.570
 interviewer+2.531
 filler+2.489
Neg logit contributions
Alert-4.660
ulate-4.640
aic-4.516
PLIED-4.451
ents-4.435
Token ready
Token ID3492
Feature activation-1.45e+00

Pos logit contributions
cloth+2.340
omas+2.305
 interviewer+2.263
 filler+2.233
leneck+2.194
Neg logit contributions
Alert-2.992
ulate-2.925
aic-2.873
ents-2.839
imum-2.817
Token to
Token ID284
Feature activation-9.31e-01

Pos logit contributions
cloth+6.533
 interviewer+6.232
 filler+6.190
omas+6.145
leneck+5.852
Neg logit contributions
ulate-9.597
Alert-9.551
aic-9.263
PLIED-9.070
imum-9.036
Token loan
Token ID8063
Feature activation-1.99e-01

Pos logit contributions
 interviewer+4.443
omas+4.441
cloth+4.418
 filler+4.374
leneck+4.265
Neg logit contributions
Alert-6.209
aic-5.982
ulate-5.936
imum-5.928
ents-5.882
Token him
Token ID683
Feature activation-2.91e-01

Pos logit contributions
cloth+0.976
omas+0.957
 interviewer+0.921
leneck+0.911
 filler+0.901
Neg logit contributions
Alert-1.385
ulate-1.364
aic-1.332
ents-1.330
imum-1.315
Token to
Token ID284
Feature activation-2.67e-01

Pos logit contributions
cloth+1.496
omas+1.471
 interviewer+1.425
 filler+1.399
leneck+1.399
Neg logit contributions
Alert-1.951
ulate-1.899
aic-1.877
ents-1.852
imum-1.839
Token Bel
Token ID3944
Feature activation-1.21e+00

Pos logit contributions
cloth+1.489
omas+1.474
 interviewer+1.433
 filler+1.411
leneck+1.405
Neg logit contributions
Alert-1.662
ulate-1.623
aic-1.582
PLIED-1.574
ents-1.572
Token/
Token ID14
Feature activation+4.37e-02

Pos logit contributions
ulate+3.695
Alert+3.639
ents+3.518
imum+3.456
aic+3.449
Neg logit contributions
cloth-3.988
omas-3.925
 interviewer-3.809
leneck-3.767
bsite-3.685
Tokenret
Token ID1186
Feature activation+4.81e-01

Pos logit contributions
Alert+0.250
ulate+0.246
aic+0.239
ents+0.237
imum+0.236
Neg logit contributions
cloth-0.266
omas-0.262
 interviewer-0.254
leneck-0.253
 filler-0.250
Token,
Token ID837
Feature activation+1.46e-03

Pos logit contributions
Alert+2.639
ulate+2.622
ents+2.530
aic+2.514
imum+2.498
Neg logit contributions
cloth-3.021
omas-2.977
 interviewer-2.899
leneck-2.875
 filler-2.815
Token which
Token ID543
Feature activation+2.60e-01

Pos logit contributions
Alert+0.010
ulate+0.010
aic+0.009
ents+0.009
imum+0.009
Neg logit contributions
cloth-0.008
omas-0.007
 interviewer-0.007
leneck-0.007
 filler-0.007
Token we
Token ID356
Feature activation-5.25e-01

Pos logit contributions
Alert+1.653
ulate+1.649
ents+1.592
aic+1.586
imum+1.572
Neg logit contributions
cloth-1.401
omas-1.379
leneck-1.335
 interviewer-1.332
 filler-1.289
Token'll
Token ID1183
Feature activation-1.46e+00

Pos logit contributions
cloth+2.877
omas+2.825
 interviewer+2.780
 filler+2.736
leneck+2.699
Neg logit contributions
Alert-3.301
ulate-3.207
imum-3.145
aic-3.129
ents-3.113
Token call
Token ID869
Feature activation+4.41e-01

Pos logit contributions
omas+6.835
 interviewer+6.600
cloth+6.581
 filler+6.526
 disconnect+6.484
Neg logit contributions
Alert-9.818
PLIED-9.388
imum-9.203
aic-9.149
ulate-9.113
Token "
Token ID366
Feature activation-1.72e-01

Pos logit contributions
ulate+2.696
Alert+2.662
ents+2.581
aic+2.552
imum+2.518
Neg logit contributions
cloth-2.493
omas-2.469
leneck-2.393
 interviewer-2.365
bsite-2.311
Tokenpp
Token ID381
Feature activation+4.10e-01

Pos logit contributions
cloth+0.939
omas+0.913
 interviewer+0.888
leneck+0.884
 filler+0.876
Neg logit contributions
Alert-1.083
ulate-1.068
aic-1.046
imum-1.036
ents-1.034
Tokenpr
Token ID1050
Feature activation+6.82e-01

Pos logit contributions
Alert+2.683
ulate+2.672
ents+2.585
aic+2.579
imum+2.553
Neg logit contributions
cloth-2.140
omas-2.098
 interviewer-2.051
leneck-2.020
 filler-1.987
Token"
Token ID1
Feature activation+3.70e-01

Pos logit contributions
ulate+4.305
Alert+4.277
ents+4.173
imum+4.127
aic+4.102
Neg logit contributions
cloth-3.653
omas-3.576
 interviewer-3.481
leneck-3.453
 filler-3.396
Token(
Token ID7
Feature activation-6.64e-01

Pos logit contributions
Alert+2.507
ulate+2.339
aic+2.268
ents+2.239
dict+2.148
Neg logit contributions
omas-4.431
leneck-4.418
 interviewer-4.399
 challeng-4.394
 filler-4.381
Token536
Token ID44468
Feature activation+9.53e-01

Pos logit contributions
cloth+2.714
omas+2.578
 interviewer+2.448
leneck+2.401
auc+2.362
Neg logit contributions
Alert-5.027
ulate-5.022
aic-4.880
ents-4.846
imum-4.844
Tokens
Token ID82
Feature activation+6.10e-01

Pos logit contributions
ents+2.163
Alert+2.098
ulate+2.042
aic+1.880
dict+1.820
Neg logit contributions
leneck-8.706
cloth-8.669
 interviewer-8.631
 filler-8.549
omas-8.506
Token)}
Token ID38165
Feature activation+5.35e-01

Pos logit contributions
Alert+0.908
ulate+0.796
ents+0.745
aic+0.683
PLIED+0.629
Neg logit contributions
leneck-6.174
omas-6.167
cloth-6.121
 interviewer-6.099
 filler-6.022
Token N
Token ID399
Feature activation+2.34e-01

Pos logit contributions
ulate+2.715
Alert+2.713
ents+2.591
aic+2.553
imum+2.496
Neg logit contributions
cloth-3.511
omas-3.503
leneck-3.450
 interviewer-3.365
 filler-3.301
Tokenxd
Token ID24954
Feature activation-1.62e+00

Pos logit contributions
Alert+1.498
ulate+1.489
ents+1.446
aic+1.438
imum+1.417
Neg logit contributions
cloth-1.259
omas-1.240
 interviewer-1.223
leneck-1.208
 filler-1.188
Token5
Token ID20
Feature activation+9.38e-01

Pos logit contributions
omas+2.866
cloth+2.730
auc+2.229
 interviewer+2.170
kamp+2.107
Neg logit contributions
Alert-14.753
ulate-14.720
imum-14.206
aic-14.142
PLIED-14.083
Token {
Token ID1391
Feature activation+3.28e-01

Pos logit contributions
Alert+1.877
ulate+1.705
aic+1.478
ents+1.448
imum+1.301
Neg logit contributions
leneck-8.968
cloth-8.869
omas-8.851
bsite-8.675
 filler-8.663
Token(
Token ID7
Feature activation-9.46e-01

Pos logit contributions
Alert+1.497
ulate+1.407
aic+1.363
ents+1.356
imum+1.296
Neg logit contributions
omas-2.400
leneck-2.386
 interviewer-2.376
cloth-2.369
 filler-2.357
Token106
Token ID15801
Feature activation-2.52e-02

Pos logit contributions
cloth+4.013
omas+3.797
 interviewer+3.589
leneck+3.513
auc+3.493
Neg logit contributions
ulate-6.829
Alert-6.814
aic-6.625
imum-6.564
ents-6.556
Tokens
Token ID82
Feature activation+1.83e-01

Pos logit contributions
cloth+0.217
omas+0.214
 interviewer+0.212
leneck+0.211
 filler+0.209
Neg logit contributions
Alert-0.080
ulate-0.079
ents-0.076
aic-0.074
imum-0.072