TOP ACTIVATIONS
MAX = 5.669

15 , 2010 Subject : Que er magnet ?
6 , 2010 Subject : R iff Tra x
13 , 2010 Subject : boys beware This film
5 , 2009 Subject : Que ers are Here
4 , 2010 Subject : User reviews are a
9 , 2010 Subject : R ipe for M
26 , 2009 Subject : THE IR ONY OF
with the words " F --- YOU " written across his
es - Jones tried to dial down the risk , and
cohort of patients ( n = 91 ) only received trans
http :// www . A chieve ment . org / aut
www . bel fast te legraph . co . uk /
terms of micro - and macro - level retrieval of detail
Military sources later told the BBC that the overnight air strikes
the prime challenge Belfast Te legraph . co . uk Dis
lot of something ( p oth oles in New Jersey ,
1 Res a ech says , ' As you
P ause . " O uch , what loss ," he
bb 9 d 970 b L isted below are links to
Also , anyone who would like to learn all of the

MIN ACTIVATIONS
MIN = -4.704

.) War ner Bros / Courtesy Everett Collection Amy
very sad 1 %. We wonder if those in charge of
Act on ) In an unpublished article written around May of
The built - in cl ojure . test was by far
people , protecting affordable health care and Medicare , and demanding
notice of proposed rule making for civil -
of the Bernie Bros . Apparently ,
this query to the cl ojure . files table SELECT *
uck ley NBA <|endoftext|> Ted Nug ent promised Thursday that he
and how . Take a moment to digest his thoughts :
, fight for affordable health care and support tax cuts for
HI , born at Gu adal upe Valley Hospital on July
Gunn 's GU ARD I ANS OF THE G AL AX
then gathered herself for a moment . I don
when I fed the Dev ial ets bass - heavy music
Donald Trump . WATCH : T ucker
evidence . We rate this Pants on Fire .
PA agreed to a Collective Barg aining Agreement ( C BA
B V ), an over growth of an aer obic pathogens
George - 8 Track Porn Pt 4 - Ott os Old

INTERVAL 3.076 - 5.669
CONTAINS 0.074%

things like m eth cookies appear as
M ATE 16 . 04 + at all , I
the season . The score less first period was quite an
to the report from Xin hua . The code
extra level of safety ! Donald J . Trump (@

INTERVAL 0.483 - 3.076
CONTAINS 25.567%

the only human left in Springfield . Join the
schools assess student and teacher performance . The investment was made
from the Ben ih ana Burn ! According to
level , I thought you fit in nicely with all these
and posting pictures on popular social networking websites Facebook and Twitter

INTERVAL -2.111 - 0.483
CONTAINS 73.571%

you re also waiting for doom . I
) week . I hope they make the walk . I
170 percent Increase of adherents during a Buddhist bo
fair for users , developers and stakeholders . Exper
that certain parts of the Eve Radio website will look a

INTERVAL -4.704 - -2.111
CONTAINS 0.788%

how do they place that risk . Our entire goal is
begin to see those things then it may be an indication
findings at the 2012 Western Pacific Ge oph ysics Meeting in
Whereas normal kidney cells ( left ) are arranged next to
day deadlines and instead require that complaints be processed
Token 15
Token ID1315
Feature activation+9.52e-01

Pos logit contributions
uri+0.326
itime+0.326
ocious+0.325
iegel+0.324
 stranded+0.322
Neg logit contributions
catentry-0.503
 Slater-0.495
imov-0.492
 Accountability-0.488
isite-0.488
Token,
Token ID11
Feature activation-1.09e-01

Pos logit contributions
uri+0.518
iegel+0.503
ocious+0.498
itime+0.496
 stranded+0.496
Neg logit contributions
catentry-0.756
 Slater-0.744
 relapse-0.738
isite-0.738
imov-0.737
Token 2010
Token ID3050
Feature activation+9.36e-03

Pos logit contributions
 Slater+0.069
 Accountability+0.068
catentry+0.068
imov+0.067
isite+0.067
Neg logit contributions
ocious-0.077
itime-0.076
 stranded-0.075
iegel-0.075
uri-0.074
Token
Token ID198
Feature activation+3.63e-01

Pos logit contributions
uri+0.006
ocious+0.006
iegel+0.006
itime+0.006
 stranded+0.006
Neg logit contributions
catentry-0.006
 Slater-0.006
imov-0.006
 Accountability-0.006
 relapse-0.006
Token
Token ID198
Feature activation+1.66e-01

Pos logit contributions
uri+0.233
iegel+0.231
ocious+0.230
 stranded+0.229
itime+0.228
Neg logit contributions
catentry-0.253
 Slater-0.248
imov-0.246
 Accountability-0.244
isite-0.244
TokenSubject
Token ID19776
Feature activation+5.67e+00

Pos logit contributions
ocious+0.073
itime+0.073
iegel+0.072
uri+0.072
 stranded+0.072
Neg logit contributions
 Slater-0.149
catentry-0.148
 Accountability-0.147
imov-0.146
 relapse-0.145
Token:
Token ID25
Feature activation+1.47e-01

Pos logit contributions
uri+1.863
iegel+1.816
ocious+1.782
enged+1.712
wrong+1.690
Neg logit contributions
catentry-5.680
 Slater-5.565
imov-5.503
 Accountability-5.481
 relapse-5.475
Token Que
Token ID4670
Feature activation-3.49e-01

Pos logit contributions
ocious+0.095
itime+0.095
iegel+0.094
 stranded+0.094
uri+0.094
Neg logit contributions
 Slater-0.102
catentry-0.101
 Accountability-0.100
imov-0.099
 relapse-0.098
Tokener
Token ID263
Feature activation+1.52e+00

Pos logit contributions
 Slater+0.242
catentry+0.241
 Accountability+0.238
imov+0.237
 relapse+0.234
Neg logit contributions
ocious-0.225
itime-0.225
iegel-0.223
 stranded-0.222
uri-0.221
Token magnet
Token ID19972
Feature activation+6.39e-01

Pos logit contributions
ocious+0.963
itime+0.957
 stranded+0.953
iegel+0.951
uri+0.944
Neg logit contributions
catentry-1.070
 Slater-1.064
imov-1.045
 Accountability-1.041
pron-1.031
Token?
Token ID30
Feature activation+3.81e-01

Pos logit contributions
ocious+0.406
itime+0.405
iegel+0.402
 stranded+0.400
uri+0.398
Neg logit contributions
 Slater-0.449
catentry-0.448
 Accountability-0.441
imov-0.439
 relapse-0.434
Token 6
Token ID718
Feature activation+7.47e-01

Pos logit contributions
uri+0.646
itime+0.642
iegel+0.639
ocious+0.638
 stranded+0.636
Neg logit contributions
catentry-1.032
imov-1.008
 Slater-1.007
isite-0.999
 Accountability-0.997
Token,
Token ID11
Feature activation-4.50e-01

Pos logit contributions
uri+0.409
iegel+0.397
itime+0.393
ocious+0.393
 stranded+0.391
Neg logit contributions
catentry-0.592
 Slater-0.583
imov-0.578
 relapse-0.576
isite-0.575
Token 2010
Token ID3050
Feature activation+1.00e-01

Pos logit contributions
 Slater+0.287
catentry+0.284
 Accountability+0.284
imov+0.279
isite+0.278
Neg logit contributions
ocious-0.317
itime-0.315
iegel-0.311
 stranded-0.309
uri-0.309
Token
Token ID198
Feature activation+3.90e-01

Pos logit contributions
uri+0.066
ocious+0.065
iegel+0.065
 stranded+0.065
itime+0.065
Neg logit contributions
catentry-0.068
 Slater-0.067
imov-0.066
 Accountability-0.066
 relapse-0.066
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
uri+0.250
iegel+0.248
ocious+0.247
 stranded+0.246
itime+0.245
Neg logit contributions
catentry-0.271
 Slater-0.266
imov-0.263
 Accountability-0.262
isite-0.261
TokenSubject
Token ID19776
Feature activation+5.65e+00

Pos logit contributions
ocious+0.106
itime+0.105
iegel+0.104
 stranded+0.103
uri+0.102
Neg logit contributions
 Slater-0.199
catentry-0.198
 Accountability-0.196
imov-0.195
pron-0.193
Token:
Token ID25
Feature activation+2.68e-01

Pos logit contributions
uri+1.884
iegel+1.848
ocious+1.814
enged+1.727
itime+1.716
Neg logit contributions
catentry-5.617
 Slater-5.522
imov-5.468
 Accountability-5.435
 relapse-5.430
Token R
Token ID371
Feature activation+9.20e-01

Pos logit contributions
ocious+0.171
itime+0.171
iegel+0.169
 stranded+0.169
uri+0.168
Neg logit contributions
 Slater-0.187
catentry-0.186
 Accountability-0.184
imov-0.183
 relapse-0.181
Tokeniff
Token ID733
Feature activation+1.87e+00

Pos logit contributions
ocious+0.541
itime+0.540
iegel+0.536
 stranded+0.531
uri+0.531
Neg logit contributions
 Slater-0.691
catentry-0.691
 Accountability-0.680
imov-0.675
 relapse-0.669
TokenTra
Token ID15721
Feature activation+1.00e+00

Pos logit contributions
ocious+1.116
itime+1.114
iegel+1.106
 stranded+1.100
uri+1.095
Neg logit contributions
 Slater-1.383
catentry-1.380
 Accountability-1.357
imov-1.353
 relapse-1.335
Tokenx
Token ID87
Feature activation+3.41e-01

Pos logit contributions
ocious+0.720
itime+0.720
uri+0.719
iegel+0.713
 stranded+0.707
Neg logit contributions
catentry-0.629
 Slater-0.616
 Accountability-0.611
imov-0.608
 relapse-0.596
Token 13
Token ID1511
Feature activation+3.98e-01

Pos logit contributions
uri+0.755
itime+0.749
iegel+0.745
ocious+0.743
 stranded+0.743
Neg logit contributions
catentry-1.154
 Slater-1.130
imov-1.127
isite-1.117
 Accountability-1.112
Token,
Token ID11
Feature activation-4.12e-01

Pos logit contributions
uri+0.226
iegel+0.222
 stranded+0.222
itime+0.222
ocious+0.222
Neg logit contributions
catentry-0.305
 Slater-0.303
imov-0.300
 relapse-0.298
 Accountability-0.297
Token 2010
Token ID3050
Feature activation+1.47e-02

Pos logit contributions
 Slater+0.279
catentry+0.278
 Accountability+0.273
imov+0.272
 relapse+0.269
Neg logit contributions
ocious-0.272
itime-0.272
iegel-0.270
 stranded-0.269
uri-0.268
Token
Token ID198
Feature activation+9.68e-01

Pos logit contributions
uri+0.010
ocious+0.009
iegel+0.009
 stranded+0.009
itime+0.009
Neg logit contributions
catentry-0.010
 Slater-0.010
imov-0.010
isite-0.010
 relapse-0.010
Token
Token ID198
Feature activation+6.46e-01

Pos logit contributions
uri+0.615
iegel+0.607
ocious+0.605
 stranded+0.603
itime+0.599
Neg logit contributions
catentry-0.678
 Slater-0.666
imov-0.660
isite-0.655
 Accountability-0.654
TokenSubject
Token ID19776
Feature activation+5.63e+00

Pos logit contributions
itime+0.335
ocious+0.333
iegel+0.329
 stranded+0.328
abwe+0.316
Neg logit contributions
catentry-0.535
 Slater-0.528
 Accountability-0.524
imov-0.521
pron-0.519
Token:
Token ID25
Feature activation+1.24e-01

Pos logit contributions
uri+1.881
iegel+1.858
ocious+1.829
 stranded+1.748
itime+1.729
Neg logit contributions
catentry-5.545
 Slater-5.478
imov-5.442
 Accountability-5.382
 relapse-5.365
Token boys
Token ID6510
Feature activation-1.18e+00

Pos logit contributions
ocious+0.080
itime+0.080
iegel+0.079
 stranded+0.079
uri+0.078
Neg logit contributions
 Slater-0.086
catentry-0.086
 Accountability-0.085
imov-0.084
 relapse-0.083
Token beware
Token ID40600
Feature activation-2.78e-01

Pos logit contributions
 Slater+0.862
catentry+0.857
 Accountability+0.848
imov+0.844
 relapse+0.838
Neg logit contributions
itime-0.715
ocious-0.714
iegel-0.705
 stranded-0.701
uri-0.700
Token This
Token ID770
Feature activation+1.77e+00

Pos logit contributions
 Slater+0.195
catentry+0.194
 Accountability+0.191
imov+0.190
 relapse+0.188
Neg logit contributions
ocious-0.177
itime-0.177
iegel-0.175
 stranded-0.174
uri-0.173
Token film
Token ID2646
Feature activation+9.97e-01

Pos logit contributions
ocious+1.104
iegel+1.101
 stranded+1.100
itime+1.099
uri+1.089
Neg logit contributions
 Slater-1.261
catentry-1.260
imov-1.237
 Accountability-1.237
pron-1.216
Token 5
Token ID642
Feature activation+1.16e+00

Pos logit contributions
uri+0.694
iegel+0.688
itime+0.688
 stranded+0.683
ocious+0.683
Neg logit contributions
catentry-1.102
imov-1.075
 Slater-1.075
isite-1.065
 Accountability-1.060
Token,
Token ID11
Feature activation-1.71e-02

Pos logit contributions
uri+0.623
iegel+0.608
itime+0.597
ocious+0.596
 stranded+0.595
Neg logit contributions
catentry-0.926
 Slater-0.910
isite-0.900
imov-0.900
 relapse-0.898
Token 2009
Token ID3717
Feature activation+9.15e-01

Pos logit contributions
 Slater+0.011
 Accountability+0.011
catentry+0.011
isite+0.011
imov+0.011
Neg logit contributions
ocious-0.012
itime-0.012
 stranded-0.012
iegel-0.012
uri-0.012
Token
Token ID198
Feature activation+1.53e-01

Pos logit contributions
uri+0.586
iegel+0.581
 stranded+0.579
ocious+0.578
itime+0.574
Neg logit contributions
catentry-0.636
 Slater-0.622
imov-0.617
isite-0.613
 relapse-0.610
Token
Token ID198
Feature activation+5.47e-02

Pos logit contributions
uri+0.099
iegel+0.098
ocious+0.098
 stranded+0.097
itime+0.097
Neg logit contributions
catentry-0.106
 Slater-0.104
imov-0.103
 Accountability-0.102
isite-0.102
TokenSubject
Token ID19776
Feature activation+5.61e+00

Pos logit contributions
ocious+0.023
itime+0.023
uri+0.023
iegel+0.023
 stranded+0.023
Neg logit contributions
 Slater-0.050
catentry-0.049
imov-0.049
 Accountability-0.049
 relapse-0.048
Token:
Token ID25
Feature activation+3.79e-01

Pos logit contributions
uri+1.861
iegel+1.820
ocious+1.777
enged+1.719
 stranded+1.701
Neg logit contributions
catentry-5.614
 Slater-5.498
imov-5.427
 Accountability-5.423
isite-5.403
Token Que
Token ID4670
Feature activation-8.35e-02

Pos logit contributions
ocious+0.246
itime+0.245
iegel+0.243
 stranded+0.243
uri+0.241
Neg logit contributions
 Slater-0.261
catentry-0.260
 Accountability-0.256
imov-0.256
 relapse-0.252
Tokeners
Token ID364
Feature activation+1.16e+00

Pos logit contributions
 Slater+0.057
catentry+0.056
 Accountability+0.056
imov+0.055
 relapse+0.055
Neg logit contributions
ocious-0.055
itime-0.055
iegel-0.055
 stranded-0.054
uri-0.054
Token are
Token ID389
Feature activation+4.31e-01

Pos logit contributions
ocious+0.707
itime+0.704
 stranded+0.703
iegel+0.700
uri+0.694
Neg logit contributions
catentry-0.850
 Slater-0.847
 Accountability-0.829
imov-0.829
pron-0.821
Token Here
Token ID3423
Feature activation+8.97e-01

Pos logit contributions
ocious+0.258
itime+0.256
 stranded+0.256
iegel+0.255
uri+0.253
Neg logit contributions
catentry-0.318
 Slater-0.317
 Accountability-0.312
imov-0.312
isite-0.308
Token 4
Token ID604
Feature activation+8.69e-01

Pos logit contributions
itime+0.394
uri+0.394
ocious+0.393
iegel+0.392
 stranded+0.390
Neg logit contributions
catentry-0.588
 Slater-0.577
imov-0.575
 Accountability-0.570
isite-0.569
Token,
Token ID11
Feature activation+3.25e-01

Pos logit contributions
uri+0.472
iegel+0.462
ocious+0.456
itime+0.455
 stranded+0.453
Neg logit contributions
catentry-0.690
 Slater-0.677
imov-0.671
 relapse-0.670
isite-0.670
Token 2010
Token ID3050
Feature activation+4.52e-01

Pos logit contributions
itime+0.207
ocious+0.207
iegel+0.207
 stranded+0.206
uri+0.205
Neg logit contributions
catentry-0.226
 Slater-0.225
imov-0.221
 Accountability-0.220
 relapse-0.218
Token
Token ID198
Feature activation+2.12e-01

Pos logit contributions
uri+0.293
iegel+0.291
ocious+0.290
 stranded+0.289
itime+0.289
Neg logit contributions
catentry-0.310
 Slater-0.305
imov-0.302
isite-0.299
 Accountability-0.299
Token
Token ID198
Feature activation+8.98e-02

Pos logit contributions
uri+0.137
iegel+0.136
ocious+0.135
 stranded+0.135
itime+0.134
Neg logit contributions
catentry-0.147
 Slater-0.144
imov-0.143
 Accountability-0.142
isite-0.142
TokenSubject
Token ID19776
Feature activation+5.59e+00

Pos logit contributions
ocious+0.039
itime+0.039
iegel+0.039
uri+0.038
 stranded+0.038
Neg logit contributions
 Slater-0.081
catentry-0.080
imov-0.079
 Accountability-0.079
 relapse-0.079
Token:
Token ID25
Feature activation+2.46e-01

Pos logit contributions
uri+1.870
iegel+1.829
ocious+1.794
enged+1.721
 stranded+1.711
Neg logit contributions
catentry-5.578
 Slater-5.471
imov-5.403
 Accountability-5.390
isite-5.369
Token User
Token ID11787
Feature activation-3.94e-01

Pos logit contributions
ocious+0.159
itime+0.159
iegel+0.157
 stranded+0.156
uri+0.156
Neg logit contributions
 Slater-0.170
catentry-0.169
 Accountability-0.167
imov-0.166
 relapse-0.164
Token reviews
Token ID8088
Feature activation-1.23e+00

Pos logit contributions
 Slater+0.281
catentry+0.279
 Accountability+0.275
imov+0.274
 relapse+0.271
Neg logit contributions
ocious-0.247
itime-0.246
iegel-0.244
 stranded-0.244
uri-0.242
Token are
Token ID389
Feature activation-1.06e-01

Pos logit contributions
 Slater+0.882
catentry+0.869
 Accountability+0.867
imov+0.858
 relapse+0.857
Neg logit contributions
itime-0.765
ocious-0.762
iegel-0.750
uri-0.742
 stranded-0.742
Token a
Token ID257
Feature activation-1.32e-01

Pos logit contributions
 Slater+0.077
catentry+0.077
 Accountability+0.076
imov+0.076
 relapse+0.075
Neg logit contributions
ocious-0.065
itime-0.064
iegel-0.064
 stranded-0.064
uri-0.063
Token 9
Token ID860
Feature activation+1.23e+00

Pos logit contributions
uri+0.292
itime+0.291
ocious+0.291
iegel+0.290
 stranded+0.288
Neg logit contributions
catentry-0.441
 Slater-0.432
imov-0.431
isite-0.427
 Accountability-0.427
Token,
Token ID11
Feature activation-4.15e-01

Pos logit contributions
uri+0.662
iegel+0.644
ocious+0.634
 stranded+0.631
itime+0.630
Neg logit contributions
catentry-0.987
 Slater-0.966
isite-0.961
imov-0.961
 relapse-0.959
Token 2010
Token ID3050
Feature activation+9.46e-04

Pos logit contributions
 Slater+0.265
catentry+0.262
 Accountability+0.262
imov+0.259
isite+0.257
Neg logit contributions
ocious-0.292
itime-0.290
iegel-0.286
 stranded-0.285
uri-0.285
Token
Token ID198
Feature activation+2.31e-02

Pos logit contributions
uri+0.001
ocious+0.001
iegel+0.001
itime+0.001
 stranded+0.001
Neg logit contributions
catentry-0.001
 Slater-0.001
imov-0.001
 Accountability-0.001
 relapse-0.001
Token
Token ID198
Feature activation-5.58e-02

Pos logit contributions
uri+0.015
iegel+0.015
ocious+0.015
 stranded+0.015
itime+0.015
Neg logit contributions
catentry-0.016
 Slater-0.016
imov-0.015
 Accountability-0.015
isite-0.015
TokenSubject
Token ID19776
Feature activation+5.57e+00

Pos logit contributions
 Slater+0.050
catentry+0.049
 Accountability+0.049
imov+0.049
 relapse+0.048
Neg logit contributions
ocious-0.025
itime-0.025
iegel-0.025
 stranded-0.025
uri-0.024
Token:
Token ID25
Feature activation+2.29e-01

Pos logit contributions
uri+1.854
iegel+1.818
ocious+1.783
enged+1.699
wrong+1.688
Neg logit contributions
catentry-5.552
 Slater-5.445
imov-5.389
 Accountability-5.366
 relapse-5.355
Token R
Token ID371
Feature activation+9.39e-01

Pos logit contributions
ocious+0.147
itime+0.147
iegel+0.146
 stranded+0.145
uri+0.144
Neg logit contributions
 Slater-0.159
catentry-0.158
 Accountability-0.156
imov-0.155
 relapse-0.153
Tokenipe
Token ID3757
Feature activation+2.14e+00

Pos logit contributions
ocious+0.603
itime+0.603
iegel+0.598
 stranded+0.593
uri+0.593
Neg logit contributions
 Slater-0.654
catentry-0.653
 Accountability-0.642
imov-0.638
 relapse-0.632
Token for
Token ID329
Feature activation+1.60e+00

Pos logit contributions
ocious+1.388
 stranded+1.371
itime+1.367
iegel+1.366
uri+1.362
Neg logit contributions
catentry-1.477
 Slater-1.464
imov-1.453
 Accountability-1.429
isite-1.423
Token M
Token ID337
Feature activation-1.76e-01

Pos logit contributions
ocious+1.013
 stranded+1.004
iegel+1.004
itime+1.002
uri+0.991
Neg logit contributions
catentry-1.132
 Slater-1.119
imov-1.104
 Accountability-1.095
isite-1.089
Token 26
Token ID2608
Feature activation-1.15e+00

Pos logit contributions
uri+0.610
itime+0.605
iegel+0.605
ocious+0.604
 stranded+0.600
Neg logit contributions
catentry-0.946
 Slater-0.926
imov-0.922
isite-0.916
 Accountability-0.914
Token,
Token ID11
Feature activation-4.65e-01

Pos logit contributions
catentry+0.868
 Slater+0.863
imov+0.850
 Accountability+0.848
 relapse+0.844
Neg logit contributions
uri-0.666
ocious-0.664
itime-0.663
iegel-0.662
 stranded-0.656
Token 2009
Token ID3717
Feature activation+1.16e-02

Pos logit contributions
 Slater+0.297
catentry+0.294
 Accountability+0.293
imov+0.289
isite+0.288
Neg logit contributions
ocious-0.328
itime-0.324
iegel-0.321
 stranded-0.319
uri-0.317
Token
Token ID198
Feature activation-1.39e-01

Pos logit contributions
uri+0.008
ocious+0.008
iegel+0.008
 stranded+0.008
itime+0.008
Neg logit contributions
catentry-0.008
 Slater-0.008
imov-0.008
 Accountability-0.008
isite-0.008
Token
Token ID198
Feature activation-2.89e-01

Pos logit contributions
catentry+0.095
 Slater+0.094
imov+0.093
 Accountability+0.092
isite+0.092
Neg logit contributions
uri-0.090
iegel-0.090
ocious-0.090
 stranded-0.089
itime-0.089
TokenSubject
Token ID19776
Feature activation+5.26e+00

Pos logit contributions
 Slater+0.261
catentry+0.259
imov+0.257
 Accountability+0.256
 relapse+0.256
Neg logit contributions
ocious-0.125
itime-0.124
uri-0.123
iegel-0.123
 stranded-0.122
Token:
Token ID25
Feature activation+2.80e-01

Pos logit contributions
uri+1.784
iegel+1.743
ocious+1.707
enged+1.646
itime+1.639
Neg logit contributions
catentry-5.220
 Slater-5.127
imov-5.057
 Accountability-5.052
isite-5.038
Token THE
Token ID3336
Feature activation-8.20e-01

Pos logit contributions
ocious+0.182
itime+0.182
iegel+0.180
 stranded+0.180
uri+0.179
Neg logit contributions
 Slater-0.192
catentry-0.191
 Accountability-0.188
imov-0.188
 relapse-0.185
Token IR
Token ID14826
Feature activation-1.64e+00

Pos logit contributions
 Slater+0.524
catentry+0.521
 Accountability+0.512
imov+0.510
 relapse+0.504
Neg logit contributions
ocious-0.575
itime-0.574
iegel-0.569
 stranded-0.567
uri-0.564
TokenONY
Token ID40508
Feature activation-9.25e-01

Pos logit contributions
 Slater+1.056
catentry+1.043
 Accountability+1.034
imov+1.025
 relapse+1.015
Neg logit contributions
itime-1.150
ocious-1.144
iegel-1.136
 stranded-1.132
uri-1.124
Token OF
Token ID3963
Feature activation+2.46e-01

Pos logit contributions
 Slater+0.607
catentry+0.602
 Accountability+0.594
imov+0.591
 relapse+0.585
Neg logit contributions
ocious-0.632
itime-0.632
iegel-0.626
 stranded-0.623
uri-0.620
Token with
Token ID351
Feature activation-4.43e-02

Pos logit contributions
 stranded+0.193
itime+0.189
ocious+0.189
iegel+0.189
uri+0.187
Neg logit contributions
catentry-0.231
 Slater-0.228
 Accountability-0.226
imov-0.225
isite-0.224
Token the
Token ID262
Feature activation-2.56e-01

Pos logit contributions
 Slater+0.030
catentry+0.030
 Accountability+0.030
imov+0.030
 relapse+0.030
Neg logit contributions
ocious-0.029
itime-0.029
 stranded-0.028
iegel-0.028
uri-0.028
Token words
Token ID2456
Feature activation-1.38e+00

Pos logit contributions
 Slater+0.182
catentry+0.181
 Accountability+0.178
imov+0.178
 relapse+0.176
Neg logit contributions
ocious-0.161
itime-0.161
iegel-0.159
 stranded-0.159
uri-0.158
Token "
Token ID366
Feature activation+6.67e-01

Pos logit contributions
 Slater+0.796
catentry+0.787
 Accountability+0.777
imov+0.774
 relapse+0.762
Neg logit contributions
ocious-1.048
itime-1.048
iegel-1.038
 stranded-1.031
uri-1.029
TokenF
Token ID37
Feature activation+5.38e-01

Pos logit contributions
ocious+0.436
itime+0.435
iegel+0.431
 stranded+0.429
uri+0.429
Neg logit contributions
 Slater-0.456
catentry-0.453
 Accountability-0.448
imov-0.445
 relapse-0.442
Token---
Token ID6329
Feature activation+4.77e+00

Pos logit contributions
ocious+0.309
itime+0.309
iegel+0.305
 stranded+0.304
uri+0.302
Neg logit contributions
 Slater-0.412
catentry-0.409
 Accountability-0.404
imov-0.403
 relapse-0.399
Token YOU
Token ID7013
Feature activation+2.91e-01

Pos logit contributions
ocious+2.987
 stranded+2.978
itime+2.978
iegel+2.974
uri+2.966
Neg logit contributions
catentry-3.355
 Slater-3.253
imov-3.210
 Accountability-3.208
isite-3.196
Token"
Token ID1
Feature activation+4.56e-01

Pos logit contributions
ocious+0.177
itime+0.177
iegel+0.175
 stranded+0.174
uri+0.173
Neg logit contributions
 Slater-0.212
catentry-0.211
 Accountability-0.208
imov-0.207
 relapse-0.205
Token written
Token ID3194
Feature activation-5.78e-01

Pos logit contributions
 stranded+0.307
ocious+0.302
iegel+0.300
itime+0.299
uri+0.296
Neg logit contributions
catentry-0.311
 Slater-0.303
 Accountability-0.302
imov-0.298
isite-0.297
Token across
Token ID1973
Feature activation-5.84e-01

Pos logit contributions
 Slater+0.448
catentry+0.444
 Accountability+0.439
imov+0.438
 relapse+0.434
Neg logit contributions
ocious-0.326
itime-0.326
iegel-0.322
 stranded-0.320
uri-0.319
Token his
Token ID465
Feature activation-2.66e-01

Pos logit contributions
 Slater+0.371
catentry+0.369
 Accountability+0.363
imov+0.361
 relapse+0.357
Neg logit contributions
ocious-0.410
itime-0.410
iegel-0.407
 stranded-0.406
uri-0.403
Tokenes
Token ID274
Feature activation+5.14e-01

Pos logit contributions
iegel+0.118
ocious+0.118
itime+0.118
uri+0.117
 stranded+0.116
Neg logit contributions
catentry-0.155
 Slater-0.155
 Accountability-0.153
imov-0.151
 relapse-0.150
Token-
Token ID12
Feature activation-1.30e-01

Pos logit contributions
ocious+0.210
itime+0.209
iegel+0.209
 stranded+0.207
uri+0.206
Neg logit contributions
catentry-0.476
 Slater-0.475
 Accountability-0.469
imov-0.467
isite-0.465
TokenJones
Token ID25784
Feature activation+4.66e-01

Pos logit contributions
 Slater+0.057
catentry+0.057
 Accountability+0.056
imov+0.056
 relapse+0.055
Neg logit contributions
ocious-0.115
iegel-0.115
itime-0.115
uri-0.115
 stranded-0.114
Token tried
Token ID3088
Feature activation-3.00e-01

Pos logit contributions
 stranded+0.266
itime+0.264
ocious+0.264
iegel+0.263
uri+0.261
Neg logit contributions
catentry-0.360
 Slater-0.355
imov-0.350
 Accountability-0.349
isite-0.349
Token to
Token ID284
Feature activation+9.18e-01

Pos logit contributions
 Slater+0.227
catentry+0.226
 Accountability+0.223
imov+0.222
 relapse+0.220
Neg logit contributions
ocious-0.175
itime-0.175
iegel-0.173
 stranded-0.172
uri-0.171
Token dial
Token ID5980
Feature activation+4.71e+00

Pos logit contributions
 stranded+0.555
iegel+0.550
itime+0.550
uri+0.543
ocious+0.541
Neg logit contributions
 Slater-0.674
catentry-0.672
 Accountability-0.660
isite-0.659
imov-0.658
Token down
Token ID866
Feature activation+2.10e+00

Pos logit contributions
 stranded+2.593
ocious+2.590
itime+2.583
iegel+2.576
uri+2.530
Neg logit contributions
catentry-3.672
 Slater-3.613
imov-3.595
 Accountability-3.571
isite-3.571
Token the
Token ID262
Feature activation+5.71e-01

Pos logit contributions
ocious+1.408
itime+1.406
 stranded+1.404
iegel+1.399
uri+1.390
Neg logit contributions
catentry-1.389
 Slater-1.379
imov-1.361
 Accountability-1.356
isite-1.341
Token risk
Token ID2526
Feature activation+2.15e-01

Pos logit contributions
ocious+0.361
itime+0.360
 stranded+0.359
iegel+0.358
uri+0.355
Neg logit contributions
catentry-0.401
 Slater-0.400
imov-0.394
 Accountability-0.393
isite-0.388
Token,
Token ID11
Feature activation+7.96e-01

Pos logit contributions
itime+0.126
ocious+0.125
iegel+0.123
 stranded+0.123
uri+0.122
Neg logit contributions
 Slater-0.163
catentry-0.160
 Accountability-0.160
imov-0.159
 relapse-0.158
Token and
Token ID290
Feature activation+4.13e-01

Pos logit contributions
 stranded+0.483
iegel+0.481
itime+0.481
ocious+0.478
uri+0.473
Neg logit contributions
catentry-0.578
 Slater-0.576
imov-0.570
 Accountability-0.564
isite-0.562
Token cohort
Token ID20812
Feature activation+6.99e-01

Pos logit contributions
catentry+0.046
 Slater+0.045
 Accountability+0.045
imov+0.045
pron+0.044
Neg logit contributions
 stranded-0.047
iegel-0.047
ocious-0.047
itime-0.047
uri-0.046
Token of
Token ID286
Feature activation-5.56e-01

Pos logit contributions
ocious+0.425
 stranded+0.424
iegel+0.423
itime+0.422
uri+0.414
Neg logit contributions
catentry-0.514
 Slater-0.507
imov-0.501
 Accountability-0.499
pron-0.494
Token patients
Token ID3871
Feature activation-7.02e-01

Pos logit contributions
 Slater+0.356
catentry+0.356
 Accountability+0.349
imov+0.348
pron+0.342
Neg logit contributions
ocious-0.387
itime-0.386
iegel-0.383
 stranded-0.383
uri-0.380
Token (
Token ID357
Feature activation-2.76e-01

Pos logit contributions
 Slater+0.511
catentry+0.507
 Accountability+0.501
imov+0.498
 relapse+0.495
Neg logit contributions
ocious-0.429
itime-0.429
iegel-0.424
 stranded-0.422
uri-0.420
Tokenn
Token ID77
Feature activation+3.34e-04

Pos logit contributions
 Slater+0.187
catentry+0.186
 Accountability+0.183
imov+0.182
 relapse+0.180
Neg logit contributions
ocious-0.183
itime-0.182
iegel-0.181
 stranded-0.180
uri-0.179
Token=
Token ID28
Feature activation+4.52e+00

Pos logit contributions
itime+0.000
ocious+0.000
iegel+0.000
 stranded+0.000
uri+0.000
Neg logit contributions
 Slater-0.000
catentry-0.000
 Accountability-0.000
imov-0.000
 relapse-0.000
Token91
Token ID6420
Feature activation+1.17e-02

Pos logit contributions
ocious+2.761
itime+2.747
uri+2.737
iegel+2.729
 stranded+2.695
Neg logit contributions
catentry-3.241
 Slater-3.190
 Accountability-3.171
imov-3.143
isite-3.099
Token)
Token ID8
Feature activation-2.53e-01

Pos logit contributions
ocious+0.006
itime+0.006
iegel+0.006
 stranded+0.006
uri+0.006
Neg logit contributions
 Slater-0.009
catentry-0.009
 Accountability-0.009
imov-0.009
 relapse-0.009
Token only
Token ID691
Feature activation-6.33e-01

Pos logit contributions
catentry+0.192
 Slater+0.191
 Accountability+0.188
imov+0.188
pron+0.185
Neg logit contributions
ocious-0.147
itime-0.147
 stranded-0.146
iegel-0.146
uri-0.144
Token received
Token ID2722
Feature activation-7.82e-01

Pos logit contributions
catentry+0.446
 Slater+0.444
 Accountability+0.437
imov+0.436
pron+0.430
Neg logit contributions
ocious-0.402
itime-0.401
 stranded-0.398
iegel-0.398
uri-0.394
Token trans
Token ID1007
Feature activation-7.65e-02

Pos logit contributions
catentry+0.501
 Slater+0.498
 Accountability+0.490
imov+0.489
pron+0.481
Neg logit contributions
ocious-0.545
itime-0.545
 stranded-0.540
iegel-0.540
uri-0.536
Token http
Token ID2638
Feature activation+1.82e-01

Pos logit contributions
 Slater+0.100
catentry+0.100
imov+0.098
 Accountability+0.098
 relapse+0.097
Neg logit contributions
ocious-0.090
itime-0.090
iegel-0.090
 stranded-0.089
uri-0.089
Token://
Token ID1378
Feature activation+1.16e+00

Pos logit contributions
uri+0.087
ocious+0.087
iegel+0.086
itime+0.086
 stranded+0.085
Neg logit contributions
catentry-0.157
 Slater-0.155
 Accountability-0.153
imov-0.153
 relapse-0.153
Tokenwww
Token ID2503
Feature activation+5.05e-01

Pos logit contributions
ocious+0.693
itime+0.692
iegel+0.685
 stranded+0.680
uri+0.680
Neg logit contributions
 Slater-0.859
catentry-0.853
 Accountability-0.842
imov-0.838
 relapse-0.832
Token.
Token ID13
Feature activation+9.42e-01

Pos logit contributions
uri+0.198
ocious+0.197
iegel+0.197
itime+0.197
 stranded+0.192
Neg logit contributions
catentry-0.477
 Slater-0.473
 Accountability-0.468
imov-0.466
 relapse-0.465
TokenA
Token ID32
Feature activation+1.19e+00

Pos logit contributions
ocious+0.584
itime+0.583
iegel+0.577
 stranded+0.574
uri+0.573
Neg logit contributions
 Slater-0.678
catentry-0.674
 Accountability-0.665
imov-0.662
 relapse-0.656
Tokenchieve
Token ID24957
Feature activation+4.44e+00

Pos logit contributions
itime+0.722
ocious+0.722
iegel+0.714
 stranded+0.711
uri+0.710
Neg logit contributions
 Slater-0.875
catentry-0.872
 Accountability-0.858
imov-0.855
 relapse-0.848
Tokenment
Token ID434
Feature activation+3.53e+00

Pos logit contributions
ocious+2.511
itime+2.489
uri+2.460
iegel+2.451
 stranded+2.432
Neg logit contributions
 Slater-3.434
catentry-3.420
 relapse-3.345
imov-3.329
isite-3.303
Token.
Token ID13
Feature activation+1.60e+00

Pos logit contributions
ocious+2.152
itime+2.150
iegel+2.136
uri+2.132
 stranded+2.097
Neg logit contributions
 Slater-2.557
catentry-2.549
 relapse-2.492
imov-2.490
 Accountability-2.478
Tokenorg
Token ID2398
Feature activation+3.49e-01

Pos logit contributions
ocious+0.544
itime+0.542
iegel+0.531
uri+0.529
 stranded+0.524
Neg logit contributions
 Slater-1.604
catentry-1.597
 Accountability-1.577
imov-1.573
 relapse-1.569
Token/
Token ID14
Feature activation-6.13e-01

Pos logit contributions
ocious+0.181
itime+0.181
iegel+0.179
 stranded+0.178
uri+0.177
Neg logit contributions
 Slater-0.285
catentry-0.284
 Accountability-0.280
imov-0.279
 relapse-0.277
Tokenaut
Token ID2306
Feature activation-2.00e-01

Pos logit contributions
 Slater+0.421
catentry+0.420
 Accountability+0.414
imov+0.413
pron+0.409
Neg logit contributions
ocious-0.398
itime-0.397
iegel-0.393
 stranded-0.393
uri-0.388
Tokenwww
Token ID2503
Feature activation+3.91e-01

Pos logit contributions
iegel+0.548
ocious+0.542
uri+0.537
itime+0.535
 stranded+0.521
Neg logit contributions
 Slater-0.842
 Accountability-0.834
catentry-0.820
imov-0.807
 relapse-0.802
Token.
Token ID13
Feature activation+5.14e-01

Pos logit contributions
uri+0.147
iegel+0.146
ocious+0.144
itime+0.143
 stranded+0.141
Neg logit contributions
catentry-0.377
 Slater-0.371
 Accountability-0.369
imov-0.368
 relapse-0.367
Tokenbel
Token ID6667
Feature activation+9.66e-01

Pos logit contributions
ocious+0.312
iegel+0.311
itime+0.310
uri+0.307
 stranded+0.303
Neg logit contributions
 Slater-0.382
 Accountability-0.377
catentry-0.376
imov-0.368
 relapse-0.367
Tokenfast
Token ID7217
Feature activation-9.29e-02

Pos logit contributions
iegel+0.396
itime+0.390
ocious+0.387
uri+0.381
 stranded+0.377
Neg logit contributions
 Slater-0.907
catentry-0.902
 Accountability-0.899
imov-0.883
 relapse-0.882
Tokente
Token ID660
Feature activation+1.29e+00

Pos logit contributions
 Slater+0.075
catentry+0.074
 Accountability+0.074
imov+0.073
 relapse+0.072
Neg logit contributions
iegel-0.050
ocious-0.050
itime-0.050
uri-0.049
 stranded-0.049
Tokenlegraph
Token ID16606
Feature activation+4.42e+00

Pos logit contributions
uri+0.610
iegel+0.608
ocious+0.601
itime+0.595
 stranded+0.576
Neg logit contributions
 Slater-1.123
 Accountability-1.123
catentry-1.118
imov-1.097
 relapse-1.092
Token.
Token ID13
Feature activation-1.14e-01

Pos logit contributions
uri+1.466
iegel+1.438
ocious+1.336
itime+1.332
enged+1.321
Neg logit contributions
catentry-4.510
 Accountability-4.381
 Slater-4.348
 relapse-4.334
isite-4.320
Tokenco
Token ID1073
Feature activation-1.40e+00

Pos logit contributions
 Slater+0.055
catentry+0.054
 Accountability+0.054
 relapse+0.053
imov+0.053
Neg logit contributions
uri-0.098
ocious-0.097
iegel-0.097
itime-0.097
 stranded-0.095
Token.
Token ID13
Feature activation-6.29e-01

Pos logit contributions
 Slater+1.260
catentry+1.251
 Accountability+1.240
imov+1.236
pron+1.225
Neg logit contributions
itime-0.617
ocious-0.616
iegel-0.605
 stranded-0.604
uri-0.594
Tokenuk
Token ID2724
Feature activation+6.63e-01

Pos logit contributions
 Slater+0.460
catentry+0.459
 Accountability+0.454
imov+0.450
 relapse+0.448
Neg logit contributions
ocious-0.380
itime-0.378
uri-0.378
iegel-0.377
 stranded-0.372
Token/
Token ID14
Feature activation-1.06e+00

Pos logit contributions
uri+0.304
iegel+0.299
itime+0.296
ocious+0.296
 stranded+0.288
Neg logit contributions
catentry-0.586
 Slater-0.578
 relapse-0.572
 Accountability-0.571
imov-0.569
Token terms
Token ID2846
Feature activation-2.20e+00

Pos logit contributions
ocious+0.104
itime+0.104
iegel+0.103
 stranded+0.103
uri+0.102
Neg logit contributions
 Slater-0.110
catentry-0.110
 Accountability-0.108
imov-0.108
 relapse-0.106
Token of
Token ID286
Feature activation-3.75e-01

Pos logit contributions
 Slater+1.529
catentry+1.516
 Accountability+1.498
imov+1.490
 relapse+1.475
Neg logit contributions
itime-1.427
ocious-1.426
iegel-1.409
 stranded-1.403
uri-1.394
Token micro
Token ID4580
Feature activation+8.02e-01

Pos logit contributions
 Slater+0.260
catentry+0.258
 Accountability+0.255
imov+0.254
 relapse+0.251
Neg logit contributions
ocious-0.243
itime-0.242
iegel-0.240
 stranded-0.240
uri-0.238
Token-
Token ID12
Feature activation+1.21e+00

Pos logit contributions
ocious+0.475
itime+0.475
iegel+0.470
 stranded+0.467
uri+0.463
Neg logit contributions
 Slater-0.599
catentry-0.595
 Accountability-0.587
imov-0.585
pron-0.580
Token and
Token ID290
Feature activation+6.98e-01

Pos logit contributions
ocious+0.808
itime+0.807
iegel+0.799
 stranded+0.796
uri+0.795
Neg logit contributions
 Slater-0.819
catentry-0.814
 Accountability-0.802
imov-0.798
 relapse-0.790
Token macro
Token ID15021
Feature activation+4.24e+00

Pos logit contributions
itime+0.471
ocious+0.470
iegel+0.468
 stranded+0.467
uri+0.463
Neg logit contributions
 Slater-0.463
catentry-0.461
 Accountability-0.453
imov-0.452
isite-0.445
Token-
Token ID12
Feature activation+1.92e-01

Pos logit contributions
ocious+2.111
itime+2.105
uri+2.081
iegel+2.076
 stranded+2.074
Neg logit contributions
 Slater-3.531
catentry-3.523
imov-3.488
 Accountability-3.482
 relapse-3.433
Tokenlevel
Token ID5715
Feature activation-4.98e-01

Pos logit contributions
ocious+0.128
itime+0.128
iegel+0.127
 stranded+0.126
uri+0.126
Neg logit contributions
 Slater-0.129
catentry-0.128
 Accountability-0.126
imov-0.125
 relapse-0.124
Token retrieval
Token ID45069
Feature activation+1.55e+00

Pos logit contributions
 Slater+0.343
catentry+0.341
 Accountability+0.337
imov+0.334
 relapse+0.332
Neg logit contributions
ocious-0.324
itime-0.324
iegel-0.320
 stranded-0.319
uri-0.317
Token of
Token ID286
Feature activation+1.28e-01

Pos logit contributions
itime+0.910
ocious+0.908
iegel+0.907
 stranded+0.902
uri+0.899
Neg logit contributions
catentry-1.159
 Slater-1.153
imov-1.136
 Accountability-1.131
pron-1.119
Token detail
Token ID3703
Feature activation+9.87e-01

Pos logit contributions
ocious+0.077
itime+0.077
iegel+0.076
 stranded+0.076
uri+0.075
Neg logit contributions
 Slater-0.095
catentry-0.095
 Accountability-0.093
imov-0.093
 relapse-0.092
TokenMilitary
Token ID37837
Feature activation+1.73e-01

Pos logit contributions
ocious+0.386
itime+0.385
iegel+0.380
uri+0.380
 stranded+0.378
Neg logit contributions
 Slater-0.455
catentry-0.453
 Accountability-0.446
imov-0.444
 relapse-0.442
Token sources
Token ID4237
Feature activation-2.01e-01

Pos logit contributions
ocious+0.103
itime+0.103
 stranded+0.102
iegel+0.102
uri+0.101
Neg logit contributions
 Slater-0.128
catentry-0.128
imov-0.126
 Accountability-0.126
pron-0.124
Token later
Token ID1568
Feature activation-5.09e-01

Pos logit contributions
 Slater+0.149
catentry+0.149
 Accountability+0.146
imov+0.146
isite+0.144
Neg logit contributions
itime-0.120
ocious-0.120
iegel-0.119
 stranded-0.119
uri-0.118
Token told
Token ID1297
Feature activation+5.17e-01

Pos logit contributions
 Slater+0.370
catentry+0.369
 Accountability+0.363
imov+0.362
pron+0.358
Neg logit contributions
itime-0.311
ocious-0.310
iegel-0.308
 stranded-0.307
uri-0.305
Token the
Token ID262
Feature activation+7.23e-01

Pos logit contributions
itime+0.338
ocious+0.338
iegel+0.335
 stranded+0.334
uri+0.332
Neg logit contributions
 Slater-0.354
catentry-0.354
 Accountability-0.346
imov-0.345
 relapse-0.342
Token BBC
Token ID7823
Feature activation+4.24e+00

Pos logit contributions
itime+0.459
ocious+0.457
iegel+0.455
 stranded+0.454
uri+0.451
Neg logit contributions
catentry-0.509
 Slater-0.507
imov-0.497
 Accountability-0.496
 relapse-0.491
Token that
Token ID326
Feature activation-6.56e-01

Pos logit contributions
 stranded+2.617
itime+2.584
ocious+2.578
uri+2.575
iegel+2.575
Neg logit contributions
catentry-3.099
 Slater-3.011
imov-2.983
pron-2.961
 Accountability-2.948
Token the
Token ID262
Feature activation-2.07e-01

Pos logit contributions
 Slater+0.485
catentry+0.480
 Accountability+0.476
imov+0.474
 relapse+0.470
Neg logit contributions
ocious-0.393
itime-0.393
iegel-0.389
 stranded-0.385
uri-0.384
Token overnight
Token ID13417
Feature activation-7.15e-02

Pos logit contributions
catentry+0.166
 Slater+0.166
 Accountability+0.163
imov+0.162
pron+0.161
Neg logit contributions
ocious-0.111
itime-0.111
 stranded-0.111
iegel-0.110
uri-0.109
Token air
Token ID1633
Feature activation+1.55e+00

Pos logit contributions
 Slater+0.052
catentry+0.051
 Accountability+0.051
imov+0.051
 relapse+0.050
Neg logit contributions
ocious-0.044
itime-0.044
iegel-0.043
 stranded-0.043
uri-0.043
Token strikes
Token ID8956
Feature activation+1.27e+00

Pos logit contributions
itime+0.832
ocious+0.832
 stranded+0.822
iegel+0.821
uri+0.813
Neg logit contributions
 Slater-1.245
catentry-1.240
 Accountability-1.226
imov-1.219
pron-1.206
Token the
Token ID262
Feature activation+6.89e-03

Pos logit contributions
 Slater+0.158
catentry+0.157
 Accountability+0.155
imov+0.154
 relapse+0.153
Neg logit contributions
ocious-0.137
itime-0.137
iegel-0.136
 stranded-0.136
uri-0.135
Token prime
Token ID6994
Feature activation+7.05e-01

Pos logit contributions
 stranded+0.004
ocious+0.004
itime+0.004
iegel+0.004
uri+0.004
Neg logit contributions
 Slater-0.005
catentry-0.005
imov-0.005
 Accountability-0.005
pron-0.005
Token challenge
Token ID4427
Feature activation-2.17e-01

Pos logit contributions
 stranded+0.472
itime+0.471
ocious+0.470
iegel+0.467
uri+0.464
Neg logit contributions
 Slater-0.470
catentry-0.469
imov-0.459
 Accountability-0.456
pron-0.454
Token Belfast
Token ID29713
Feature activation+7.38e-01

Pos logit contributions
 Slater+0.165
catentry+0.164
 Accountability+0.162
imov+0.161
 relapse+0.160
Neg logit contributions
ocious-0.126
itime-0.126
iegel-0.124
uri-0.123
 stranded-0.122
TokenTe
Token ID6767
Feature activation+2.33e+00

Pos logit contributions
 stranded+0.482
iegel+0.479
ocious+0.479
itime+0.475
uri+0.468
Neg logit contributions
catentry-0.504
imov-0.499
 Slater-0.497
 Accountability-0.486
pron-0.485
Tokenlegraph
Token ID16606
Feature activation+4.18e+00

Pos logit contributions
uri+1.021
itime+0.994
ocious+0.992
iegel+0.986
 stranded+0.983
Neg logit contributions
catentry-2.107
imov-2.095
 Accountability-2.067
 Slater-2.067
pron-2.043
Token.
Token ID13
Feature activation+2.51e-01

Pos logit contributions
iegel+1.543
itime+1.494
 stranded+1.485
uri+1.482
ocious+1.475
Neg logit contributions
catentry-4.069
imov-3.951
pron-3.946
 Accountability-3.943
 relapse-3.925
Tokenco
Token ID1073
Feature activation-7.11e-01

Pos logit contributions
itime+0.208
ocious+0.206
iegel+0.206
uri+0.205
 stranded+0.204
Neg logit contributions
catentry-0.130
 Slater-0.129
imov-0.127
 relapse-0.123
 Accountability-0.123
Token.
Token ID13
Feature activation-1.53e-01

Pos logit contributions
 Slater+0.646
catentry+0.643
 Accountability+0.636
imov+0.634
 relapse+0.629
Neg logit contributions
ocious-0.305
itime-0.305
iegel-0.301
 stranded-0.299
uri-0.297
Tokenuk
Token ID2724
Feature activation+2.23e+00

Pos logit contributions
catentry+0.114
 Slater+0.113
 Accountability+0.113
imov+0.112
 relapse+0.111
Neg logit contributions
ocious-0.091
uri-0.090
itime-0.090
iegel-0.090
 stranded-0.089
Token Dis
Token ID3167
Feature activation+1.34e+00

Pos logit contributions
itime+1.413
ocious+1.405
iegel+1.404
 stranded+1.401
uri+1.392
Neg logit contributions
catentry-1.569
 Slater-1.523
imov-1.523
 relapse-1.511
isite-1.496
Token lot
Token ID1256
Feature activation-1.92e-01

Pos logit contributions
itime+0.147
ocious+0.146
 stranded+0.146
iegel+0.145
uri+0.144
Neg logit contributions
catentry-0.165
 Slater-0.165
 Accountability-0.162
imov-0.162
isite-0.160
Token of
Token ID286
Feature activation+5.42e-01

Pos logit contributions
 Slater+0.133
catentry+0.132
 Accountability+0.130
imov+0.129
 relapse+0.128
Neg logit contributions
ocious-0.124
itime-0.123
iegel-0.122
uri-0.121
 stranded-0.121
Token something
Token ID1223
Feature activation+2.37e-01

Pos logit contributions
itime+0.332
ocious+0.332
 stranded+0.331
iegel+0.329
uri+0.326
Neg logit contributions
catentry-0.391
 Slater-0.390
 Accountability-0.384
imov-0.384
isite-0.379
Token (
Token ID357
Feature activation-5.59e-01

Pos logit contributions
ocious+0.144
itime+0.144
iegel+0.142
 stranded+0.141
uri+0.141
Neg logit contributions
 Slater-0.174
catentry-0.173
 Accountability-0.170
imov-0.170
 relapse-0.168
Tokenp
Token ID79
Feature activation-5.40e-01

Pos logit contributions
 Slater+0.383
catentry+0.382
 Accountability+0.375
imov+0.375
pron+0.371
Neg logit contributions
ocious-0.365
itime-0.364
iegel-0.361
 stranded-0.360
uri-0.357
Tokenoth
Token ID849
Feature activation+4.16e+00

Pos logit contributions
 Slater+0.437
catentry+0.432
imov+0.429
 Accountability+0.428
 relapse+0.423
Neg logit contributions
ocious-0.286
itime-0.284
 stranded-0.283
iegel-0.281
uri-0.277
Tokenoles
Token ID4316
Feature activation+5.56e-01

Pos logit contributions
ocious+2.170
itime+2.169
uri+2.138
iegel+2.137
 stranded+2.084
Neg logit contributions
catentry-3.440
 Slater-3.392
 Accountability-3.361
imov-3.312
 relapse-3.291
Token in
Token ID287
Feature activation+9.02e-01

Pos logit contributions
ocious+0.328
itime+0.327
 stranded+0.325
iegel+0.324
uri+0.322
Neg logit contributions
catentry-0.417
 Slater-0.416
 Accountability-0.411
imov-0.408
isite-0.403
Token New
Token ID968
Feature activation+4.47e-01

Pos logit contributions
ocious+0.547
 stranded+0.547
itime+0.546
iegel+0.543
uri+0.539
Neg logit contributions
catentry-0.658
 Slater-0.651
 Accountability-0.642
imov-0.642
isite-0.635
Token Jersey
Token ID8221
Feature activation-3.79e-01

Pos logit contributions
itime+0.257
ocious+0.255
iegel+0.254
 stranded+0.254
uri+0.253
Neg logit contributions
catentry-0.341
 Slater-0.341
 Accountability-0.335
imov-0.334
 relapse-0.332
Token,
Token ID11
Feature activation-6.22e-02

Pos logit contributions
 Slater+0.279
catentry+0.277
 Accountability+0.273
imov+0.272
 relapse+0.270
Neg logit contributions
ocious-0.229
itime-0.228
iegel-0.226
 stranded-0.225
uri-0.224
Token 1
Token ID352
Feature activation+9.03e-01

Pos logit contributions
ocious+0.349
itime+0.347
iegel+0.344
uri+0.344
 stranded+0.343
Neg logit contributions
 Slater-0.324
catentry-0.323
 Accountability-0.315
imov-0.314
 relapse-0.313
Token
Token ID198
Feature activation-7.04e-01

Pos logit contributions
ocious+0.535
itime+0.532
 stranded+0.530
iegel+0.527
uri+0.524
Neg logit contributions
 Slater-0.672
catentry-0.671
 Accountability-0.659
imov-0.656
 relapse-0.650
Token
Token ID198
Feature activation-4.25e-01

Pos logit contributions
 Slater+0.452
catentry+0.450
 Accountability+0.443
imov+0.441
 relapse+0.436
Neg logit contributions
ocious-0.489
itime-0.488
iegel-0.485
 stranded-0.482
uri-0.481
TokenRes
Token ID4965
Feature activation-2.32e-01

Pos logit contributions
 Slater+0.294
catentry+0.293
imov+0.288
 Accountability+0.288
pron+0.285
Neg logit contributions
itime-0.274
ocious-0.274
iegel-0.272
 stranded-0.271
uri-0.269
Tokena
Token ID64
Feature activation+9.52e-01

Pos logit contributions
 Slater+0.175
catentry+0.174
 Accountability+0.172
imov+0.171
 relapse+0.169
Neg logit contributions
ocious-0.136
itime-0.136
iegel-0.135
 stranded-0.134
uri-0.134
Tokenech
Token ID3055
Feature activation+4.10e+00

Pos logit contributions
ocious+0.577
itime+0.574
iegel+0.568
 stranded+0.568
uri+0.566
Neg logit contributions
 Slater-0.699
catentry-0.697
 Accountability-0.686
imov-0.683
 relapse-0.678
Token says
Token ID1139
Feature activation-3.61e-01

Pos logit contributions
uri+2.549
ocious+2.544
 stranded+2.523
iegel+2.511
itime+2.508
Neg logit contributions
catentry-2.940
 Slater-2.878
 Accountability-2.846
imov-2.811
 relapse-2.808
Token,
Token ID11
Feature activation-8.49e-01

Pos logit contributions
 Slater+0.261
catentry+0.259
 Accountability+0.256
imov+0.255
 relapse+0.252
Neg logit contributions
ocious-0.223
itime-0.223
iegel-0.220
 stranded-0.219
uri-0.218
Token '
Token ID705
Feature activation-7.08e-01

Pos logit contributions
 Slater+0.459
catentry+0.458
 Accountability+0.448
imov+0.446
 relapse+0.439
Neg logit contributions
ocious-0.677
itime-0.676
iegel-0.671
 stranded-0.671
uri-0.667
Token As
Token ID1081
Feature activation-5.47e-01

Pos logit contributions
 Slater+0.489
catentry+0.486
 Accountability+0.479
imov+0.477
 relapse+0.472
Neg logit contributions
ocious-0.458
itime-0.458
iegel-0.454
 stranded-0.452
uri-0.449
Token you
Token ID345
Feature activation+4.99e-01

Pos logit contributions
 Slater+0.378
catentry+0.376
 Accountability+0.370
imov+0.368
 relapse+0.365
Neg logit contributions
ocious-0.354
itime-0.354
iegel-0.351
 stranded-0.350
uri-0.348
Token P
Token ID350
Feature activation+7.73e-01

Pos logit contributions
ocious+0.268
itime+0.268
iegel+0.266
uri+0.265
 stranded+0.265
Neg logit contributions
catentry-0.282
 Slater-0.280
imov-0.276
 Accountability-0.275
 relapse-0.273
Tokenause
Token ID682
Feature activation+3.14e+00

Pos logit contributions
ocious+0.457
itime+0.457
iegel+0.452
 stranded+0.449
uri+0.448
Neg logit contributions
 Slater-0.578
catentry-0.576
 Accountability-0.568
imov-0.565
 relapse-0.560
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
 stranded+1.574
itime+1.569
ocious+1.567
iegel+1.563
uri+1.548
Neg logit contributions
catentry-2.640
 Slater-2.599
 Accountability-2.566
imov-2.551
isite-2.543
Token "
Token ID366
Feature activation+2.06e-01

Pos logit contributions
ocious+1.098
itime+1.096
uri+1.096
iegel+1.092
 stranded+1.089
Neg logit contributions
catentry-0.841
 Slater-0.815
imov-0.807
 relapse-0.800
isite-0.799
TokenO
Token ID46
Feature activation+1.82e+00

Pos logit contributions
ocious+0.131
itime+0.131
iegel+0.129
 stranded+0.129
uri+0.128
Neg logit contributions
 Slater-0.146
catentry-0.145
 Accountability-0.143
imov-0.142
 relapse-0.141
Tokenuch
Token ID794
Feature activation+4.10e+00

Pos logit contributions
ocious+1.021
itime+1.018
iegel+1.007
uri+1.005
 stranded+1.001
Neg logit contributions
 Slater-1.414
catentry-1.408
 Accountability-1.399
imov-1.379
 relapse-1.368
Token,
Token ID11
Feature activation+1.51e+00

Pos logit contributions
ocious+2.131
itime+2.126
 stranded+2.107
uri+2.103
iegel+2.102
Neg logit contributions
 Slater-3.300
catentry-3.293
 Accountability-3.264
imov-3.229
 relapse-3.210
Token what
Token ID644
Feature activation+5.89e-01

Pos logit contributions
ocious+0.931
 stranded+0.931
itime+0.929
iegel+0.925
uri+0.916
Neg logit contributions
catentry-1.081
 Slater-1.077
 Accountability-1.061
imov-1.058
isite-1.046
Token loss
Token ID2994
Feature activation+4.43e-01

Pos logit contributions
ocious+0.328
itime+0.328
iegel+0.324
 stranded+0.322
uri+0.320
Neg logit contributions
 Slater-0.461
catentry-0.459
 Accountability-0.453
imov-0.451
 relapse-0.447
Token,"
Token ID553
Feature activation+3.78e-01

Pos logit contributions
ocious+0.281
itime+0.280
iegel+0.278
 stranded+0.277
uri+0.275
Neg logit contributions
 Slater-0.312
catentry-0.311
 Accountability-0.306
imov-0.304
 relapse-0.301
Token he
Token ID339
Feature activation-3.79e-01

Pos logit contributions
ocious+0.210
itime+0.210
iegel+0.208
 stranded+0.207
uri+0.206
Neg logit contributions
 Slater-0.296
catentry-0.294
 Accountability-0.291
imov-0.290
 relapse-0.287
Tokenbb
Token ID11848
Feature activation-1.23e+00

Pos logit contributions
ocious+0.509
itime+0.508
iegel+0.504
uri+0.502
 stranded+0.501
Neg logit contributions
 Slater-0.528
catentry-0.527
 Accountability-0.518
imov-0.515
isite-0.509
Token9
Token ID24
Feature activation+1.23e-01

Pos logit contributions
 Slater+0.832
catentry+0.827
 Accountability+0.814
imov+0.813
 relapse+0.804
Neg logit contributions
ocious-0.811
itime-0.811
iegel-0.802
 stranded-0.801
uri-0.794
Tokend
Token ID67
Feature activation-9.91e-01

Pos logit contributions
ocious+0.080
itime+0.080
iegel+0.079
 stranded+0.079
uri+0.079
Neg logit contributions
 Slater-0.084
catentry-0.084
 Accountability-0.082
imov-0.082
 relapse-0.081
Token970
Token ID43587
Feature activation+2.26e-01

Pos logit contributions
 Slater+0.678
catentry+0.674
 Accountability+0.664
imov+0.662
 relapse+0.654
Neg logit contributions
ocious-0.648
itime-0.648
iegel-0.641
 stranded-0.639
uri-0.635
Tokenb
Token ID65
Feature activation-1.55e+00

Pos logit contributions
ocious+0.145
itime+0.145
iegel+0.143
 stranded+0.143
uri+0.142
Neg logit contributions
 Slater-0.158
catentry-0.157
 Accountability-0.155
imov-0.154
 relapse-0.152
Token L
Token ID406
Feature activation+4.07e+00

Pos logit contributions
 Slater+1.057
catentry+1.049
 Accountability+1.035
imov+1.033
 relapse+1.021
Neg logit contributions
itime-1.014
ocious-1.014
iegel-1.003
 stranded-1.001
uri-0.992
Tokenisted
Token ID6347
Feature activation+1.56e+00

Pos logit contributions
itime+2.517
uri+2.505
ocious+2.491
 stranded+2.468
iegel+2.466
Neg logit contributions
catentry-2.915
 Slater-2.889
 Accountability-2.869
imov-2.834
isite-2.829
Token below
Token ID2174
Feature activation+8.84e-01

Pos logit contributions
itime+0.960
ocious+0.957
iegel+0.950
 stranded+0.948
uri+0.943
Neg logit contributions
 Slater-1.118
catentry-1.113
 Accountability-1.098
imov-1.095
pron-1.090
Token are
Token ID389
Feature activation+4.64e-01

Pos logit contributions
itime+0.498
ocious+0.495
iegel+0.491
 stranded+0.491
uri+0.487
Neg logit contributions
 Slater-0.683
catentry-0.682
 Accountability-0.672
imov-0.669
pron-0.665
Token links
Token ID6117
Feature activation+7.53e-02

Pos logit contributions
itime+0.290
ocious+0.290
iegel+0.288
 stranded+0.287
uri+0.285
Neg logit contributions
 Slater-0.330
catentry-0.328
 Accountability-0.324
imov-0.322
 relapse-0.319
Token to
Token ID284
Feature activation-7.18e-02

Pos logit contributions
ocious+0.040
itime+0.040
iegel+0.040
 stranded+0.039
uri+0.039
Neg logit contributions
 Slater-0.061
catentry-0.060
 Accountability-0.060
imov-0.059
 relapse-0.059
Token Also
Token ID4418
Feature activation+1.16e+00

Pos logit contributions
itime+0.294
ocious+0.294
uri+0.292
iegel+0.290
 stranded+0.289
Neg logit contributions
catentry-0.322
 Slater-0.317
imov-0.313
 relapse-0.313
 Accountability-0.310
Token,
Token ID11
Feature activation+2.35e+00

Pos logit contributions
itime+0.695
ocious+0.693
 stranded+0.688
iegel+0.687
uri+0.682
Neg logit contributions
 Slater-0.850
catentry-0.850
 Accountability-0.837
imov-0.832
 relapse-0.825
Token anyone
Token ID2687
Feature activation+1.43e-01

Pos logit contributions
 stranded+1.395
itime+1.383
iegel+1.371
ocious+1.367
uri+1.365
Neg logit contributions
catentry-1.745
 Slater-1.712
imov-1.692
 Accountability-1.688
isite-1.679
Token who
Token ID508
Feature activation+1.01e+00

Pos logit contributions
ocious+0.077
itime+0.077
 stranded+0.076
iegel+0.076
uri+0.076
Neg logit contributions
 Slater-0.114
catentry-0.114
 Accountability-0.112
imov-0.112
 relapse-0.110
Token would
Token ID561
Feature activation+3.79e-01

Pos logit contributions
 stranded+0.512
ocious+0.504
itime+0.503
iegel+0.503
uri+0.495
Neg logit contributions
catentry-0.848
 Slater-0.825
imov-0.819
 Accountability-0.818
isite-0.807
Token like
Token ID588
Feature activation+4.05e+00

Pos logit contributions
itime+0.242
ocious+0.240
 stranded+0.240
iegel+0.240
uri+0.237
Neg logit contributions
catentry-0.268
 Slater-0.265
 Accountability-0.262
imov-0.260
isite-0.257
Token to
Token ID284
Feature activation+8.89e-01

Pos logit contributions
itime+1.996
 stranded+1.979
ocious+1.968
iegel+1.933
uri+1.916
Neg logit contributions
catentry-3.412
 Slater-3.374
imov-3.337
 Accountability-3.320
pron-3.308
Token learn
Token ID2193
Feature activation-2.48e-01

Pos logit contributions
 stranded+0.499
itime+0.494
iegel+0.492
uri+0.489
ocious+0.484
Neg logit contributions
catentry-0.694
 Slater-0.684
 Accountability-0.677
imov-0.673
isite-0.671
Token all
Token ID477
Feature activation+3.38e-01

Pos logit contributions
 Slater+0.179
catentry+0.179
 Accountability+0.176
imov+0.175
 relapse+0.173
Neg logit contributions
ocious-0.152
itime-0.152
iegel-0.151
 stranded-0.150
uri-0.149
Token of
Token ID286
Feature activation-6.05e-01

Pos logit contributions
itime+0.191
ocious+0.191
 stranded+0.189
iegel+0.189
uri+0.188
Neg logit contributions
catentry-0.260
 Slater-0.259
 Accountability-0.255
imov-0.254
 relapse-0.252
Token the
Token ID262
Feature activation+2.55e-01

Pos logit contributions
catentry+0.416
 Slater+0.416
 Accountability+0.409
imov+0.407
 relapse+0.403
Neg logit contributions
ocious-0.391
itime-0.390
 stranded-0.387
iegel-0.387
uri-0.385
Token.)
Token ID2014
Feature activation+1.13e+00

Pos logit contributions
catentry+0.981
 Slater+0.976
 Accountability+0.961
imov+0.960
isite+0.943
Neg logit contributions
itime-0.924
ocious-0.921
iegel-0.917
 stranded-0.917
uri-0.913
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
uri+0.739
ocious+0.737
itime+0.733
iegel+0.732
 stranded+0.729
Neg logit contributions
catentry-0.775
 Slater-0.756
imov-0.749
 relapse-0.745
isite-0.741
Token
Token ID198
Feature activation-4.41e-01

Pos logit contributions
catentry+0.247
 Slater+0.244
 Accountability+0.240
imov+0.240
 relapse+0.238
Neg logit contributions
ocious-0.242
uri-0.242
itime-0.241
iegel-0.241
 stranded-0.239
TokenWar
Token ID13195
Feature activation-3.65e-01

Pos logit contributions
 Slater+0.313
catentry+0.311
 Accountability+0.307
imov+0.305
 relapse+0.303
Neg logit contributions
ocious-0.277
itime-0.277
iegel-0.274
 stranded-0.273
uri-0.273
Tokenner
Token ID1008
Feature activation-1.78e+00

Pos logit contributions
 Slater+0.220
catentry+0.219
 Accountability+0.215
imov+0.214
 relapse+0.211
Neg logit contributions
ocious-0.269
itime-0.269
iegel-0.266
 stranded-0.265
uri-0.264
Token Bros
Token ID14266
Feature activation-4.70e+00

Pos logit contributions
 Slater+1.048
 Accountability+1.023
catentry+1.018
imov+1.005
 relapse+0.996
Neg logit contributions
itime-1.343
ocious-1.342
iegel-1.327
 stranded-1.325
uri-1.322
Token /
Token ID1220
Feature activation-1.84e+00

Pos logit contributions
 Slater+3.933
 Accountability+3.862
catentry+3.789
imov+3.785
pron+3.772
Neg logit contributions
itime-2.374
ocious-2.336
iegel-2.265
 stranded-2.256
uri-2.233
Token Courtesy
Token ID22984
Feature activation-4.13e-01

Pos logit contributions
 Slater+1.140
catentry+1.134
 Accountability+1.115
imov+1.110
 relapse+1.097
Neg logit contributions
ocious-1.319
itime-1.318
iegel-1.307
 stranded-1.302
uri-1.296
Token Everett
Token ID36815
Feature activation-4.89e-01

Pos logit contributions
catentry+0.288
 Slater+0.287
imov+0.281
 Accountability+0.281
 relapse+0.279
Neg logit contributions
ocious-0.264
itime-0.263
iegel-0.261
 stranded-0.261
uri-0.259
Token Collection
Token ID12251
Feature activation-1.43e+00

Pos logit contributions
 Slater+0.274
catentry+0.270
 Accountability+0.267
imov+0.265
 relapse+0.261
Neg logit contributions
itime-0.382
ocious-0.382
iegel-0.378
 stranded-0.377
uri-0.375
Token Amy
Token ID14235
Feature activation-8.59e-01

Pos logit contributions
 Slater+0.932
catentry+0.916
 Accountability+0.914
imov+0.906
 relapse+0.894
Neg logit contributions
itime-0.989
ocious-0.988
iegel-0.975
 stranded-0.970
uri-0.965
Token very
Token ID845
Feature activation+1.49e-01

Pos logit contributions
 Slater+0.035
catentry+0.035
 Accountability+0.034
imov+0.034
 relapse+0.034
Neg logit contributions
ocious-0.030
itime-0.030
iegel-0.030
 stranded-0.030
uri-0.029
Token sad
Token ID6507
Feature activation-5.56e-01

Pos logit contributions
ocious+0.095
itime+0.094
iegel+0.094
 stranded+0.093
uri+0.093
Neg logit contributions
 Slater-0.104
catentry-0.104
 Accountability-0.102
imov-0.102
 relapse-0.101
Token 1
Token ID352
Feature activation-6.50e-01

Pos logit contributions
 Slater+0.398
catentry+0.396
 Accountability+0.391
imov+0.389
 relapse+0.385
Neg logit contributions
ocious-0.346
itime-0.346
iegel-0.343
 stranded-0.341
uri-0.340
Token%.
Token ID7225
Feature activation-9.07e-01

Pos logit contributions
 Slater+0.442
catentry+0.438
 Accountability+0.432
imov+0.431
 relapse+0.426
Neg logit contributions
itime-0.428
ocious-0.428
iegel-0.424
 stranded-0.421
uri-0.420
Token We
Token ID775
Feature activation-5.54e-01

Pos logit contributions
 Slater+0.603
catentry+0.598
 Accountability+0.590
imov+0.587
 relapse+0.580
Neg logit contributions
ocious-0.612
itime-0.612
iegel-0.606
 stranded-0.604
uri-0.601
Token wonder
Token ID4240
Feature activation-4.55e+00

Pos logit contributions
 Slater+0.403
catentry+0.402
 Accountability+0.396
imov+0.394
 relapse+0.390
Neg logit contributions
ocious-0.338
itime-0.337
iegel-0.334
 stranded-0.333
uri-0.331
Token if
Token ID611
Feature activation-1.09e+00

Pos logit contributions
 Slater+3.026
 Accountability+2.932
imov+2.912
 relapse+2.891
pron+2.888
Neg logit contributions
itime-3.091
ocious-3.084
iegel-3.019
uri-2.987
asio-2.953
Token those
Token ID883
Feature activation-2.56e-01

Pos logit contributions
 Slater+0.732
catentry+0.722
 Accountability+0.716
imov+0.711
 relapse+0.705
Neg logit contributions
itime-0.732
ocious-0.731
iegel-0.723
 stranded-0.718
uri-0.717
Token in
Token ID287
Feature activation+7.35e-01

Pos logit contributions
 Slater+0.192
catentry+0.191
 Accountability+0.189
imov+0.188
 relapse+0.186
Neg logit contributions
itime-0.151
ocious-0.151
iegel-0.149
 stranded-0.148
uri-0.148
Token charge
Token ID3877
Feature activation+9.85e-02

Pos logit contributions
ocious+0.468
itime+0.465
 stranded+0.464
iegel+0.463
uri+0.459
Neg logit contributions
catentry-0.518
 Slater-0.513
 Accountability-0.506
imov-0.503
 relapse-0.499
Token of
Token ID286
Feature activation-5.44e-01

Pos logit contributions
itime+0.060
ocious+0.060
 stranded+0.059
iegel+0.059
uri+0.059
Neg logit contributions
catentry-0.072
 Slater-0.072
imov-0.070
 Accountability-0.070
isite-0.070
Token Act
Token ID2191
Feature activation-1.30e+00

Pos logit contributions
itime+0.688
iegel+0.686
ocious+0.681
 stranded+0.678
uri+0.676
Neg logit contributions
catentry-0.730
 Slater-0.706
isite-0.703
 relapse-0.701
 Accountability-0.698
Tokenon
Token ID261
Feature activation-1.10e+00

Pos logit contributions
 Slater+1.085
catentry+1.076
 Accountability+1.066
imov+1.064
 relapse+1.052
Neg logit contributions
ocious-0.652
itime-0.649
iegel-0.641
 stranded-0.639
uri-0.632
Token)
Token ID8
Feature activation+1.11e+00

Pos logit contributions
 Slater+0.810
catentry+0.807
 Accountability+0.795
imov+0.792
 relapse+0.785
Neg logit contributions
ocious-0.656
itime-0.655
iegel-0.649
 stranded-0.646
uri-0.642
TokenIn
Token ID818
Feature activation-8.46e-01

Pos logit contributions
itime+0.726
ocious+0.724
 stranded+0.716
iegel+0.716
uri+0.715
Neg logit contributions
catentry-0.747
 Slater-0.743
imov-0.729
 relapse-0.728
isite-0.725
Token an
Token ID281
Feature activation+3.23e-01

Pos logit contributions
 Slater+0.550
catentry+0.548
 Accountability+0.539
imov+0.536
 relapse+0.531
Neg logit contributions
ocious-0.582
itime-0.582
iegel-0.577
 stranded-0.575
uri-0.572
Token unpublished
Token ID42686
Feature activation-4.34e+00

Pos logit contributions
ocious+0.229
itime+0.229
iegel+0.228
 stranded+0.227
uri+0.226
Neg logit contributions
 Slater-0.203
catentry-0.202
imov-0.198
 Accountability-0.197
pron-0.196
Token article
Token ID2708
Feature activation-2.13e+00

Pos logit contributions
 Slater+2.608
catentry+2.558
 Accountability+2.547
 relapse+2.511
imov+2.506
Neg logit contributions
ocious-3.215
itime-3.204
iegel-3.167
uri-3.138
 stranded-3.124
Token written
Token ID3194
Feature activation-2.48e-01

Pos logit contributions
 Slater+1.499
catentry+1.482
 Accountability+1.467
imov+1.461
 relapse+1.441
Neg logit contributions
ocious-1.354
itime-1.354
iegel-1.340
 stranded-1.333
uri-1.327
Token around
Token ID1088
Feature activation-6.87e-01

Pos logit contributions
 Slater+0.186
catentry+0.185
 Accountability+0.183
imov+0.182
 relapse+0.180
Neg logit contributions
ocious-0.146
itime-0.146
iegel-0.145
 stranded-0.144
uri-0.143
Token May
Token ID1737
Feature activation+1.76e-01

Pos logit contributions
 Slater+0.483
catentry+0.481
 Accountability+0.474
imov+0.472
 relapse+0.467
Neg logit contributions
ocious-0.437
itime-0.437
iegel-0.432
 stranded-0.431
uri-0.428
Token of
Token ID286
Feature activation-8.03e-01

Pos logit contributions
itime+0.109
ocious+0.109
iegel+0.108
 stranded+0.107
uri+0.107
Neg logit contributions
 Slater-0.126
catentry-0.126
 Accountability-0.124
imov-0.123
 relapse-0.122
Token The
Token ID383
Feature activation+2.73e-01

Pos logit contributions
ocious+0.168
itime+0.167
uri+0.166
iegel+0.166
 stranded+0.166
Neg logit contributions
catentry-0.161
 Slater-0.160
imov-0.158
 Accountability-0.157
 relapse-0.157
Token built
Token ID3170
Feature activation+9.97e-01

Pos logit contributions
ocious+0.183
itime+0.182
 stranded+0.181
iegel+0.181
uri+0.180
Neg logit contributions
 Slater-0.182
catentry-0.182
imov-0.179
 Accountability-0.179
 relapse-0.176
Token-
Token ID12
Feature activation-5.37e-01

Pos logit contributions
ocious+0.465
itime+0.462
 stranded+0.455
iegel+0.453
uri+0.453
Neg logit contributions
 Slater-0.872
catentry-0.870
 Accountability-0.857
imov-0.855
 relapse-0.846
Tokenin
Token ID259
Feature activation+4.79e-01

Pos logit contributions
 Slater+0.269
catentry+0.266
 Accountability+0.261
imov+0.258
 relapse+0.255
Neg logit contributions
ocious-0.453
itime-0.452
uri-0.448
iegel-0.447
 stranded-0.447
Token cl
Token ID537
Feature activation-1.11e+00

Pos logit contributions
ocious+0.344
itime+0.343
 stranded+0.343
iegel+0.341
uri+0.341
Neg logit contributions
catentry-0.295
 Slater-0.293
imov-0.288
 Accountability-0.287
isite-0.283
Tokenojure
Token ID32511
Feature activation-4.21e+00

Pos logit contributions
 Slater+0.824
catentry+0.810
imov+0.809
 Accountability+0.802
pron+0.801
Neg logit contributions
ocious-0.651
itime-0.649
iegel-0.645
 stranded-0.642
uri-0.636
Token.
Token ID13
Feature activation-8.68e-02

Pos logit contributions
 Slater+3.344
pron+3.291
 Accountability+3.282
imov+3.281
catentry+3.276
Neg logit contributions
ocious-2.224
itime-2.218
iegel-2.186
 stranded-2.154
uri-2.136
Tokentest
Token ID9288
Feature activation-2.64e-01

Pos logit contributions
 Slater+0.060
catentry+0.060
 Accountability+0.059
imov+0.059
 relapse+0.058
Neg logit contributions
ocious-0.056
itime-0.056
iegel-0.055
 stranded-0.055
uri-0.055
Token was
Token ID373
Feature activation+3.41e-01

Pos logit contributions
 Slater+0.187
catentry+0.186
 Accountability+0.183
imov+0.182
 relapse+0.180
Neg logit contributions
ocious-0.166
itime-0.166
iegel-0.165
 stranded-0.164
uri-0.163
Token by
Token ID416
Feature activation+1.18e-01

Pos logit contributions
ocious+0.215
itime+0.213
 stranded+0.213
iegel+0.212
uri+0.211
Neg logit contributions
catentry-0.242
 Slater-0.241
imov-0.237
 Accountability-0.237
 relapse-0.234
Token far
Token ID1290
Feature activation-4.89e-01

Pos logit contributions
ocious+0.082
itime+0.081
iegel+0.081
 stranded+0.081
uri+0.081
Neg logit contributions
catentry-0.075
 Slater-0.075
 Accountability-0.074
imov-0.074
 relapse-0.073
Token people
Token ID661
Feature activation+1.90e-01

Pos logit contributions
 Slater+0.296
 Accountability+0.292
catentry+0.291
imov+0.290
 relapse+0.287
Neg logit contributions
itime-0.219
ocious-0.218
iegel-0.216
uri-0.214
 stranded-0.212
Token,
Token ID11
Feature activation-6.03e-01

Pos logit contributions
ocious+0.115
itime+0.115
iegel+0.114
 stranded+0.113
uri+0.113
Neg logit contributions
 Slater-0.139
catentry-0.138
 Accountability-0.137
imov-0.136
 relapse-0.135
Token protecting
Token ID10192
Feature activation-3.87e-02

Pos logit contributions
 Slater+0.431
catentry+0.426
 Accountability+0.423
imov+0.418
 relapse+0.415
Neg logit contributions
itime-0.377
ocious-0.376
iegel-0.372
 stranded-0.370
uri-0.369
Token affordable
Token ID10935
Feature activation-1.25e+00

Pos logit contributions
catentry+0.027
 Slater+0.026
imov+0.026
 Accountability+0.026
 relapse+0.026
Neg logit contributions
ocious-0.025
itime-0.025
 stranded-0.025
iegel-0.025
uri-0.025
Token health
Token ID1535
Feature activation-1.45e+00

Pos logit contributions
 Slater+0.723
catentry+0.710
 Accountability+0.710
 relapse+0.695
imov+0.694
Neg logit contributions
ocious-0.957
itime-0.955
iegel-0.950
uri-0.942
 stranded-0.939
Token care
Token ID1337
Feature activation-4.18e+00

Pos logit contributions
 Slater+0.952
 Accountability+0.937
catentry+0.929
 relapse+0.921
imov+0.912
Neg logit contributions
ocious-0.993
itime-0.989
iegel-0.975
 stranded-0.968
uri-0.964
Token and
Token ID290
Feature activation-8.19e-01

Pos logit contributions
 Slater+2.859
 Accountability+2.823
catentry+2.770
 relapse+2.742
pron+2.716
Neg logit contributions
itime-2.745
ocious-2.734
iegel-2.679
 stranded-2.660
uri-2.640
Token Medicare
Token ID13771
Feature activation+4.56e-01

Pos logit contributions
 Slater+0.559
catentry+0.552
 Accountability+0.549
imov+0.541
 relapse+0.539
Neg logit contributions
itime-0.540
ocious-0.539
iegel-0.533
uri-0.528
 stranded-0.528
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
ocious+0.240
itime+0.240
iegel+0.237
 stranded+0.236
uri+0.235
Neg logit contributions
 Slater-0.369
catentry-0.369
 Accountability-0.363
imov-0.363
 relapse-0.359
Token and
Token ID290
Feature activation+4.13e-01

Pos logit contributions
ocious+0.069
itime+0.069
iegel+0.068
 stranded+0.068
uri+0.068
Neg logit contributions
 Slater-0.077
catentry-0.077
 Accountability-0.076
imov-0.075
 relapse-0.075
Token demanding
Token ID11334
Feature activation+8.24e-02

Pos logit contributions
ocious+0.267
itime+0.267
iegel+0.264
 stranded+0.264
uri+0.262
Neg logit contributions
 Slater-0.286
catentry-0.285
 Accountability-0.280
imov-0.280
 relapse-0.276
Token
Token ID250
Feature activation-8.66e-01

Pos logit contributions
 Slater+1.123
catentry+1.117
 Accountability+1.103
imov+1.098
 relapse+1.089
Neg logit contributions
ocious-0.831
itime-0.829
iegel-0.820
 stranded-0.817
uri-0.813
Tokennotice
Token ID42138
Feature activation-3.15e+00

Pos logit contributions
 Slater+0.580
catentry+0.575
 Accountability+0.566
imov+0.565
 relapse+0.559
Neg logit contributions
ocious-0.581
itime-0.580
iegel-0.574
uri-0.571
 stranded-0.571
Token of
Token ID286
Feature activation+3.29e-01

Pos logit contributions
 Slater+2.352
catentry+2.342
 Accountability+2.332
pron+2.290
imov+2.290
Neg logit contributions
itime-1.852
ocious-1.847
iegel-1.820
 stranded-1.803
uri-1.784
Token proposed
Token ID5150
Feature activation-5.95e-01

Pos logit contributions
ocious+0.211
itime+0.209
 stranded+0.209
iegel+0.208
uri+0.207
Neg logit contributions
 Slater-0.229
catentry-0.229
imov-0.224
 Accountability-0.223
pron-0.221
Token rule
Token ID3896
Feature activation+8.36e-01

Pos logit contributions
 Slater+0.382
catentry+0.382
imov+0.374
 Accountability+0.371
pron+0.367
Neg logit contributions
ocious-0.412
itime-0.411
 stranded-0.409
iegel-0.408
uri-0.405
Tokenmaking
Token ID8601
Feature activation-4.14e+00

Pos logit contributions
ocious+0.724
uri+0.721
iegel+0.718
 stranded+0.718
itime+0.718
Neg logit contributions
 Slater-0.391
catentry-0.380
imov-0.379
 Accountability-0.372
 relapse-0.370
Token
Token ID447
Feature activation-8.24e-01

Pos logit contributions
 Slater+3.104
catentry+3.088
 Accountability+3.076
imov+3.017
 relapse+3.002
Neg logit contributions
itime-2.441
ocious-2.428
iegel-2.404
 stranded-2.379
uri-2.356
Token
Token ID251
Feature activation-8.99e-01

Pos logit contributions
 Slater+0.536
catentry+0.533
 Accountability+0.525
imov+0.522
 relapse+0.517
Neg logit contributions
ocious-0.567
itime-0.566
iegel-0.561
 stranded-0.559
uri-0.556
Token for
Token ID329
Feature activation+5.64e-01

Pos logit contributions
 Slater+0.650
catentry+0.647
imov+0.637
 Accountability+0.636
 relapse+0.630
Neg logit contributions
ocious-0.553
itime-0.551
iegel-0.546
 stranded-0.545
uri-0.543
Token civil
Token ID3026
Feature activation-3.31e-01

Pos logit contributions
ocious+0.357
itime+0.355
iegel+0.353
 stranded+0.353
uri+0.350
Neg logit contributions
catentry-0.397
 Slater-0.397
imov-0.389
 Accountability-0.388
 relapse-0.384
Token-
Token ID12
Feature activation-1.40e+00

Pos logit contributions
 Slater+0.288
 Accountability+0.284
catentry+0.284
 relapse+0.281
pron+0.280
Neg logit contributions
ocious-0.158
itime-0.157
iegel-0.153
uri-0.153
 stranded-0.151
Token of
Token ID286
Feature activation-4.11e-01

Pos logit contributions
catentry+0.047
 Slater+0.046
 Accountability+0.046
imov+0.046
 relapse+0.045
Neg logit contributions
ocious-0.040
itime-0.040
iegel-0.040
 stranded-0.040
uri-0.040
Token the
Token ID262
Feature activation-4.86e-01

Pos logit contributions
catentry+0.296
 Slater+0.296
 Accountability+0.290
imov+0.290
 relapse+0.287
Neg logit contributions
ocious-0.253
itime-0.253
iegel-0.251
 stranded-0.251
uri-0.249
Token
Token ID564
Feature activation-9.86e-01

Pos logit contributions
catentry+0.346
 Slater+0.345
imov+0.339
 Accountability+0.337
 relapse+0.336
Neg logit contributions
itime-0.303
ocious-0.303
 stranded-0.302
iegel-0.301
uri-0.298
Token
Token ID250
Feature activation-4.48e-01

Pos logit contributions
 Slater+0.568
catentry+0.565
 Accountability+0.555
imov+0.552
 relapse+0.544
Neg logit contributions
ocious-0.753
itime-0.752
iegel-0.746
 stranded-0.743
uri-0.739
TokenBernie
Token ID33433
Feature activation-1.36e+00

Pos logit contributions
 Slater+0.320
catentry+0.319
 Accountability+0.314
imov+0.313
 relapse+0.311
Neg logit contributions
itime-0.279
ocious-0.279
iegel-0.276
 stranded-0.275
uri-0.273
Token Bros
Token ID14266
Feature activation-4.11e+00

Pos logit contributions
 Slater+0.963
catentry+0.950
 Accountability+0.945
imov+0.935
 relapse+0.927
Neg logit contributions
ocious-0.866
itime-0.866
iegel-0.856
 stranded-0.850
uri-0.843
Token.
Token ID13
Feature activation+5.15e-01

Pos logit contributions
 Slater+2.923
 Accountability+2.886
catentry+2.855
pron+2.814
imov+2.811
Neg logit contributions
itime-2.597
ocious-2.571
iegel-2.538
 stranded-2.527
asio-2.481
Token
Token ID447
Feature activation-8.91e-01

Pos logit contributions
ocious+0.318
 stranded+0.313
uri+0.313
itime+0.312
iegel+0.312
Neg logit contributions
catentry-0.371
 Slater-0.368
imov-0.365
 relapse-0.361
 Accountability-0.359
Token
Token ID251
Feature activation+2.04e-01

Pos logit contributions
 Slater+0.543
catentry+0.540
 Accountability+0.531
imov+0.528
 relapse+0.522
Neg logit contributions
itime-0.650
ocious-0.650
iegel-0.644
 stranded-0.641
uri-0.637
Token Apparently
Token ID18626
Feature activation+2.43e+00

Pos logit contributions
ocious+0.125
itime+0.124
iegel+0.123
uri+0.123
 stranded+0.122
Neg logit contributions
 Slater-0.149
catentry-0.148
imov-0.146
 Accountability-0.146
 relapse-0.144
Token,
Token ID11
Feature activation+3.76e-01

Pos logit contributions
 stranded+1.445
ocious+1.442
iegel+1.436
uri+1.433
itime+1.427
Neg logit contributions
catentry-1.800
 Slater-1.759
imov-1.751
 Accountability-1.736
 relapse-1.733
Token this
Token ID428
Feature activation+1.86e-01

Pos logit contributions
ocious+0.693
 stranded+0.692
iegel+0.690
itime+0.689
uri+0.683
Neg logit contributions
 Slater-0.734
catentry-0.734
 Accountability-0.721
imov-0.717
 relapse-0.709
Token query
Token ID12405
Feature activation+1.62e-01

Pos logit contributions
ocious+0.118
itime+0.117
 stranded+0.117
iegel+0.117
uri+0.116
Neg logit contributions
 Slater-0.131
catentry-0.130
 Accountability-0.128
imov-0.128
 relapse-0.126
Token to
Token ID284
Feature activation+5.07e-01

Pos logit contributions
ocious+0.093
itime+0.093
iegel+0.092
 stranded+0.092
uri+0.091
Neg logit contributions
 Slater-0.124
catentry-0.124
 Accountability-0.122
imov-0.122
 relapse-0.121
Token the
Token ID262
Feature activation+6.94e-01

Pos logit contributions
ocious+0.317
itime+0.316
 stranded+0.315
iegel+0.314
uri+0.311
Neg logit contributions
catentry-0.361
 Slater-0.360
 Accountability-0.354
imov-0.353
 relapse-0.349
Token cl
Token ID537
Feature activation-1.10e+00

Pos logit contributions
itime+0.457
ocious+0.457
 stranded+0.457
iegel+0.454
uri+0.451
Neg logit contributions
catentry-0.469
 Slater-0.466
 Accountability-0.459
imov-0.458
pron-0.452
Tokenojure
Token ID32511
Feature activation-4.08e+00

Pos logit contributions
 Slater+0.821
catentry+0.809
imov+0.808
isite+0.799
 Accountability+0.798
Neg logit contributions
ocious-0.648
itime-0.643
iegel-0.640
 stranded-0.639
uri-0.632
Token.
Token ID13
Feature activation-6.16e-02

Pos logit contributions
 Slater+3.389
 Accountability+3.313
isite+3.307
catentry+3.303
pron+3.298
Neg logit contributions
ocious-2.057
itime-2.005
iegel-1.989
 stranded-1.954
uri-1.950
Tokenfiles
Token ID16624
Feature activation-7.93e-01

Pos logit contributions
 Slater+0.043
catentry+0.043
 Accountability+0.042
imov+0.042
 relapse+0.042
Neg logit contributions
ocious-0.040
itime-0.040
iegel-0.039
 stranded-0.039
uri-0.039
Token table
Token ID3084
Feature activation-3.77e-01

Pos logit contributions
 Slater+0.584
catentry+0.583
 Accountability+0.573
imov+0.570
 relapse+0.565
Neg logit contributions
ocious-0.478
itime-0.478
iegel-0.474
 stranded-0.472
uri-0.470
Token SELECT
Token ID33493
Feature activation+5.42e-01

Pos logit contributions
 Slater+0.262
catentry+0.261
 Accountability+0.257
imov+0.256
 relapse+0.253
Neg logit contributions
ocious-0.242
itime-0.242
iegel-0.240
 stranded-0.239
uri-0.238
Token *
Token ID1635
Feature activation+6.04e-03

Pos logit contributions
itime+0.381
ocious+0.380
 stranded+0.378
iegel+0.377
uri+0.376
Neg logit contributions
 Slater-0.344
catentry-0.343
 Accountability-0.338
imov-0.336
 relapse-0.331
Tokenuck
Token ID1347
Feature activation+1.13e-01

Pos logit contributions
ocious+0.937
iegel+0.936
itime+0.933
uri+0.923
 stranded+0.912
Neg logit contributions
catentry-1.118
 Slater-1.116
 Accountability-1.105
imov-1.089
 relapse-1.086
Tokenley
Token ID1636
Feature activation+9.27e-01

Pos logit contributions
ocious+0.066
itime+0.066
iegel+0.065
 stranded+0.065
uri+0.065
Neg logit contributions
 Slater-0.085
catentry-0.085
 Accountability-0.084
imov-0.083
 relapse-0.083
TokenNBA
Token ID32470
Feature activation+1.43e+00

Pos logit contributions
ocious+0.587
itime+0.586
iegel+0.581
 stranded+0.577
uri+0.576
Neg logit contributions
 Slater-0.653
catentry-0.652
 Accountability-0.642
imov-0.638
 relapse-0.632
Token<|endoftext|>
Token ID50256
Feature activation+2.46e-01

Pos logit contributions
ocious+0.881
itime+0.879
iegel+0.872
 stranded+0.867
uri+0.863
Neg logit contributions
 Slater-1.029
catentry-1.026
 Accountability-1.011
imov-1.006
 relapse-0.996
TokenTed
Token ID38972
Feature activation-1.21e+00

Pos logit contributions
ocious+0.161
itime+0.161
iegel+0.159
 stranded+0.158
uri+0.158
Neg logit contributions
 Slater-0.169
catentry-0.168
 Accountability-0.166
imov-0.165
 relapse-0.163
Token Nug
Token ID45777
Feature activation-4.04e+00

Pos logit contributions
 Slater+0.739
catentry+0.721
 Accountability+0.712
imov+0.706
 relapse+0.700
Neg logit contributions
ocious-0.882
itime-0.881
iegel-0.875
 stranded-0.868
uri-0.861
Tokenent
Token ID298
Feature activation-1.64e+00

Pos logit contributions
 Slater+2.087
pron+2.036
 Accountability+2.033
isite+2.016
 relapse+2.004
Neg logit contributions
itime-3.285
 stranded-3.225
ocious-3.220
iegel-3.173
abwe-3.156
Token promised
Token ID8072
Feature activation-1.10e-01

Pos logit contributions
 Slater+1.226
catentry+1.212
 Accountability+1.203
imov+1.194
 relapse+1.185
Neg logit contributions
itime-0.975
ocious-0.973
iegel-0.965
 stranded-0.956
uri-0.951
Token Thursday
Token ID3635
Feature activation+8.49e-02

Pos logit contributions
 Slater+0.082
catentry+0.081
 Accountability+0.081
imov+0.080
 relapse+0.079
Neg logit contributions
itime-0.065
ocious-0.065
iegel-0.064
 stranded-0.063
uri-0.063
Token that
Token ID326
Feature activation-4.12e-01

Pos logit contributions
itime+0.053
ocious+0.053
iegel+0.053
 stranded+0.052
uri+0.052
Neg logit contributions
 Slater-0.061
catentry-0.060
 Accountability-0.059
imov-0.059
 relapse-0.059
Token he
Token ID339
Feature activation-1.23e+00

Pos logit contributions
 Slater+0.308
catentry+0.305
 Accountability+0.302
imov+0.300
 relapse+0.298
Neg logit contributions
itime-0.244
ocious-0.244
iegel-0.241
 stranded-0.239
uri-0.239
Token and
Token ID290
Feature activation+1.19e-01

Pos logit contributions
 Slater+0.458
catentry+0.455
 Accountability+0.450
imov+0.448
 relapse+0.444
Neg logit contributions
ocious-0.328
itime-0.328
iegel-0.323
 stranded-0.322
uri-0.320
Token how
Token ID703
Feature activation-5.46e-01

Pos logit contributions
ocious+0.078
itime+0.078
iegel+0.078
 stranded+0.078
uri+0.077
Neg logit contributions
catentry-0.080
 Slater-0.080
imov-0.079
 Accountability-0.078
isite-0.077
Token.
Token ID13
Feature activation+1.95e-01

Pos logit contributions
 Slater+0.375
catentry+0.370
 Accountability+0.367
imov+0.365
 relapse+0.362
Neg logit contributions
ocious-0.359
itime-0.358
iegel-0.353
 stranded-0.350
uri-0.349
Token Take
Token ID7214
Feature activation-1.58e-01

Pos logit contributions
itime+0.134
ocious+0.134
iegel+0.133
 stranded+0.132
uri+0.132
Neg logit contributions
catentry-0.127
 Slater-0.126
imov-0.124
 Accountability-0.124
 relapse-0.123
Token a
Token ID257
Feature activation-1.15e-01

Pos logit contributions
 Slater+0.114
catentry+0.112
 Accountability+0.111
imov+0.111
 relapse+0.110
Neg logit contributions
ocious-0.099
itime-0.098
iegel-0.097
 stranded-0.097
uri-0.096
Token moment
Token ID2589
Feature activation-4.01e+00

Pos logit contributions
 Slater+0.077
catentry+0.076
 Accountability+0.075
imov+0.075
 relapse+0.074
Neg logit contributions
itime-0.078
ocious-0.078
iegel-0.077
 stranded-0.077
uri-0.076
Token to
Token ID284
Feature activation+1.03e-01

Pos logit contributions
 Slater+3.031
 Accountability+2.945
catentry+2.942
imov+2.924
pron+2.924
Neg logit contributions
ocious-2.321
itime-2.303
 stranded-2.258
iegel-2.249
uri-2.236
Token digest
Token ID16274
Feature activation-2.98e-01

Pos logit contributions
 stranded+0.057
itime+0.057
iegel+0.057
uri+0.056
ocious+0.056
Neg logit contributions
catentry-0.082
 Slater-0.081
 Accountability-0.080
imov-0.079
isite-0.079
Token his
Token ID465
Feature activation-3.40e-02

Pos logit contributions
 Slater+0.200
catentry+0.197
 Accountability+0.195
imov+0.194
 relapse+0.193
Neg logit contributions
ocious-0.200
itime-0.200
 stranded-0.197
iegel-0.197
uri-0.196
Token thoughts
Token ID6066
Feature activation+5.79e-01

Pos logit contributions
 Slater+0.022
catentry+0.022
 Accountability+0.022
imov+0.022
isite+0.022
Neg logit contributions
ocious-0.023
itime-0.023
iegel-0.023
 stranded-0.023
uri-0.023
Token:
Token ID25
Feature activation+4.19e-02

Pos logit contributions
itime+0.294
ocious+0.294
iegel+0.293
 stranded+0.291
uri+0.288
Neg logit contributions
catentry-0.480
 Slater-0.478
 Accountability-0.471
imov-0.470
 relapse-0.465
Token,
Token ID11
Feature activation-3.46e-02

Pos logit contributions
 Slater+0.393
catentry+0.389
 Accountability+0.387
imov+0.383
 relapse+0.380
Neg logit contributions
ocious-0.269
itime-0.269
iegel-0.265
 stranded-0.263
uri-0.261
Token fight
Token ID1907
Feature activation-7.68e-01

Pos logit contributions
 Slater+0.025
catentry+0.025
 Accountability+0.025
imov+0.025
 relapse+0.025
Neg logit contributions
ocious-0.021
itime-0.021
iegel-0.021
 stranded-0.021
uri-0.020
Token for
Token ID329
Feature activation-1.16e-01

Pos logit contributions
 Slater+0.602
catentry+0.590
 Accountability+0.589
 relapse+0.583
imov+0.582
Neg logit contributions
ocious-0.431
itime-0.431
iegel-0.425
uri-0.420
 stranded-0.417
Token affordable
Token ID10935
Feature activation-3.21e-01

Pos logit contributions
catentry+0.085
 Slater+0.085
imov+0.083
 Accountability+0.083
 relapse+0.082
Neg logit contributions
ocious-0.070
itime-0.070
 stranded-0.070
iegel-0.070
uri-0.069
Token health
Token ID1535
Feature activation-4.57e-01

Pos logit contributions
 Slater+0.233
catentry+0.231
 Accountability+0.229
imov+0.226
 relapse+0.226
Neg logit contributions
ocious-0.197
itime-0.197
iegel-0.195
uri-0.193
 stranded-0.193
Token care
Token ID1337
Feature activation-3.99e+00

Pos logit contributions
 Slater+0.311
catentry+0.306
 Accountability+0.305
imov+0.301
 relapse+0.300
Neg logit contributions
ocious-0.303
itime-0.302
iegel-0.299
 stranded-0.297
uri-0.296
Token and
Token ID290
Feature activation-5.25e-01

Pos logit contributions
 Slater+2.776
 Accountability+2.742
catentry+2.712
 relapse+2.676
pron+2.642
Neg logit contributions
itime-2.576
ocious-2.564
iegel-2.512
 stranded-2.489
uri-2.482
Token support
Token ID1104
Feature activation-1.30e+00

Pos logit contributions
 Slater+0.386
catentry+0.383
 Accountability+0.379
imov+0.376
 relapse+0.374
Neg logit contributions
ocious-0.317
itime-0.317
iegel-0.314
 stranded-0.312
uri-0.311
Token tax
Token ID1687
Feature activation+1.70e-01

Pos logit contributions
 Slater+0.944
catentry+0.929
 Accountability+0.926
imov+0.915
 relapse+0.909
Neg logit contributions
itime-0.801
ocious-0.799
iegel-0.791
uri-0.783
 stranded-0.780
Token cuts
Token ID6630
Feature activation-1.19e+00

Pos logit contributions
itime+0.106
ocious+0.105
iegel+0.105
 stranded+0.104
uri+0.104
Neg logit contributions
 Slater-0.121
catentry-0.121
 Accountability-0.119
imov-0.119
 relapse-0.117
Token for
Token ID329
Feature activation-1.96e-01

Pos logit contributions
 Slater+0.959
 Accountability+0.941
catentry+0.939
imov+0.929
 relapse+0.923
Neg logit contributions
itime-0.639
ocious-0.634
iegel-0.623
 stranded-0.617
uri-0.615
Token HI
Token ID36210
Feature activation-2.03e+00

Pos logit contributions
 Slater+0.130
catentry+0.130
 Accountability+0.128
imov+0.128
 relapse+0.127
Neg logit contributions
ocious-0.109
itime-0.108
 stranded-0.107
iegel-0.107
uri-0.106
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
 Slater+1.564
catentry+1.539
 Accountability+1.524
imov+1.521
pron+1.504
Neg logit contributions
itime-1.160
ocious-1.155
iegel-1.150
 stranded-1.136
uri-1.128
Token born
Token ID4642
Feature activation+8.70e-01

Pos logit contributions
 Slater+0.784
catentry+0.779
 Accountability+0.769
imov+0.765
 relapse+0.757
Neg logit contributions
itime-0.652
ocious-0.651
iegel-0.645
 stranded-0.641
uri-0.638
Token at
Token ID379
Feature activation-2.11e-01

Pos logit contributions
ocious+0.600
itime+0.596
 stranded+0.593
iegel+0.593
uri+0.588
Neg logit contributions
catentry-0.565
 Slater-0.563
 Accountability-0.553
imov-0.552
 relapse-0.547
Token Gu
Token ID1962
Feature activation-1.13e+00

Pos logit contributions
catentry+0.152
 Slater+0.150
imov+0.149
 Accountability+0.148
 relapse+0.146
Neg logit contributions
ocious-0.131
itime-0.131
 stranded-0.130
iegel-0.130
uri-0.129
Tokenadal
Token ID31682
Feature activation-3.99e+00

Pos logit contributions
 Slater+0.707
catentry+0.703
 Accountability+0.691
imov+0.688
 relapse+0.680
Neg logit contributions
ocious-0.808
itime-0.808
iegel-0.801
 stranded-0.798
uri-0.794
Tokenupe
Token ID48722
Feature activation-1.38e+00

Pos logit contributions
 Slater+2.565
catentry+2.552
imov+2.527
 Accountability+2.520
pron+2.509
Neg logit contributions
ocious-2.699
itime-2.688
iegel-2.686
 stranded-2.684
uri-2.627
Token Valley
Token ID6916
Feature activation-1.45e+00

Pos logit contributions
 Slater+0.910
catentry+0.897
 Accountability+0.888
imov+0.881
pron+0.872
Neg logit contributions
ocious-0.943
itime-0.939
iegel-0.931
 stranded-0.926
uri-0.919
Token Hospital
Token ID9256
Feature activation-9.02e-01

Pos logit contributions
 Slater+0.855
catentry+0.843
 Accountability+0.834
imov+0.825
 relapse+0.816
Neg logit contributions
ocious-1.088
itime-1.085
iegel-1.076
 stranded-1.071
uri-1.065
Token on
Token ID319
Feature activation-1.72e+00

Pos logit contributions
 Slater+0.676
catentry+0.673
 Accountability+0.664
imov+0.661
 relapse+0.655
Neg logit contributions
ocious-0.531
itime-0.531
iegel-0.525
 stranded-0.523
uri-0.520
Token July
Token ID2901
Feature activation+1.25e-01

Pos logit contributions
 Slater+0.954
catentry+0.952
 Accountability+0.931
imov+0.929
 relapse+0.916
Neg logit contributions
ocious-1.349
itime-1.347
iegel-1.338
 stranded-1.335
uri-1.328
Token Gunn
Token ID40837
Feature activation-1.25e+00

Pos logit contributions
 Slater+0.280
catentry+0.278
 Accountability+0.274
imov+0.273
 relapse+0.270
Neg logit contributions
itime-0.256
ocious-0.256
iegel-0.253
 stranded-0.252
uri-0.250
Token's
Token ID338
Feature activation-2.69e-01

Pos logit contributions
 Slater+1.045
catentry+1.039
 Accountability+1.028
imov+1.024
 relapse+1.015
Neg logit contributions
itime-0.628
ocious-0.627
iegel-0.618
 stranded-0.615
uri-0.611
Token GU
Token ID19348
Feature activation-1.64e+00

Pos logit contributions
catentry+0.187
 Slater+0.186
 Accountability+0.183
imov+0.182
isite+0.182
Neg logit contributions
ocious-0.173
itime-0.173
 stranded-0.172
iegel-0.172
uri-0.171
TokenARD
Token ID9795
Feature activation-3.56e+00

Pos logit contributions
 Slater+1.216
catentry+1.208
 Accountability+1.194
imov+1.189
 relapse+1.178
Neg logit contributions
ocious-0.977
itime-0.973
iegel-0.965
 stranded-0.958
uri-0.955
TokenI
Token ID40
Feature activation-1.98e+00

Pos logit contributions
 Slater+2.196
 Accountability+2.138
imov+2.130
 relapse+2.108
catentry+2.104
Neg logit contributions
ocious-2.547
itime-2.546
iegel-2.523
uri-2.514
 stranded-2.497
TokenANS
Token ID15037
Feature activation-3.97e+00

Pos logit contributions
 Slater+1.579
 Accountability+1.547
imov+1.542
catentry+1.517
 relapse+1.507
Neg logit contributions
iegel-1.088
itime-1.084
ocious-1.054
asio-1.041
abwe-1.036
Token OF
Token ID3963
Feature activation-1.68e+00

Pos logit contributions
 Slater+2.408
 Accountability+2.363
catentry+2.354
imov+2.342
 relapse+2.306
Neg logit contributions
itime-2.909
ocious-2.908
iegel-2.890
uri-2.860
asio-2.829
Token THE
Token ID3336
Feature activation-1.40e+00

Pos logit contributions
catentry+1.172
 Slater+1.154
 relapse+1.140
 Accountability+1.136
imov+1.127
Neg logit contributions
 stranded-1.095
uri-1.085
iegel-1.084
itime-1.076
ocious-1.071
Token G
Token ID402
Feature activation-1.21e+00

Pos logit contributions
catentry+0.965
 Slater+0.965
 Accountability+0.945
imov+0.942
 relapse+0.935
Neg logit contributions
ocious-0.908
itime-0.907
iegel-0.901
 stranded-0.900
uri-0.892
TokenAL
Token ID1847
Feature activation-3.69e-01

Pos logit contributions
 Slater+1.131
imov+1.105
catentry+1.100
 Accountability+1.097
pron+1.096
Neg logit contributions
ocious-0.504
itime-0.504
 stranded-0.485
iegel-0.476
uri-0.475
TokenAX
Token ID25922
Feature activation-4.11e-01

Pos logit contributions
 Slater+0.259
 Accountability+0.252
catentry+0.252
imov+0.252
 relapse+0.250
Neg logit contributions
itime-0.235
ocious-0.234
iegel-0.233
uri-0.230
 stranded-0.228
Token then
Token ID788
Feature activation-3.26e-01

Pos logit contributions
ocious+0.060
 stranded+0.060
itime+0.060
iegel+0.059
uri+0.059
Neg logit contributions
catentry-0.069
 Slater-0.069
imov-0.068
 Accountability-0.068
isite-0.067
Token gathered
Token ID9272
Feature activation-7.91e-01

Pos logit contributions
 Slater+0.241
catentry+0.241
 Accountability+0.237
imov+0.236
 relapse+0.234
Neg logit contributions
ocious-0.195
itime-0.194
iegel-0.193
 stranded-0.192
uri-0.191
Token herself
Token ID5223
Feature activation-2.48e-01

Pos logit contributions
 Slater+0.588
catentry+0.584
 Accountability+0.578
imov+0.574
 relapse+0.569
Neg logit contributions
itime-0.471
ocious-0.471
iegel-0.465
 stranded-0.462
uri-0.461
Token for
Token ID329
Feature activation-7.27e-01

Pos logit contributions
 Slater+0.187
catentry+0.186
 Accountability+0.183
imov+0.182
 relapse+0.181
Neg logit contributions
ocious-0.146
itime-0.146
iegel-0.144
 stranded-0.143
uri-0.143
Token a
Token ID257
Feature activation-7.35e-01

Pos logit contributions
catentry+0.504
 Slater+0.504
 Accountability+0.495
imov+0.494
isite+0.488
Neg logit contributions
ocious-0.468
itime-0.466
iegel-0.463
 stranded-0.462
uri-0.458
Token moment
Token ID2589
Feature activation-3.96e+00

Pos logit contributions
catentry+0.488
 Slater+0.487
 Accountability+0.478
imov+0.478
isite+0.470
Neg logit contributions
ocious-0.495
itime-0.494
 stranded-0.491
iegel-0.490
uri-0.487
Token.
Token ID13
Feature activation+7.35e-02

Pos logit contributions
 Slater+2.883
 Accountability+2.808
catentry+2.807
pron+2.782
imov+2.774
Neg logit contributions
itime-2.430
ocious-2.418
iegel-2.381
uri-2.349
 stranded-2.341
Token
Token ID564
Feature activation+9.61e-02

Pos logit contributions
ocious+0.050
itime+0.050
iegel+0.049
 stranded+0.049
uri+0.049
Neg logit contributions
catentry-0.048
 Slater-0.048
imov-0.047
 Accountability-0.047
 relapse-0.047
Token
Token ID250
Feature activation+5.33e-01

Pos logit contributions
ocious+0.062
itime+0.062
iegel+0.062
 stranded+0.062
uri+0.061
Neg logit contributions
 Slater-0.066
catentry-0.066
 Accountability-0.065
imov-0.065
 relapse-0.064
TokenI
Token ID40
Feature activation-1.84e-01

Pos logit contributions
ocious+0.339
itime+0.339
iegel+0.335
 stranded+0.334
uri+0.333
Neg logit contributions
 Slater-0.374
catentry-0.372
 Accountability-0.366
imov-0.365
 relapse-0.361
Token don
Token ID836
Feature activation+1.71e-02

Pos logit contributions
 Slater+0.135
catentry+0.134
 Accountability+0.132
imov+0.132
 relapse+0.130
Neg logit contributions
ocious-0.111
itime-0.111
iegel-0.110
 stranded-0.109
uri-0.109
Token when
Token ID618
Feature activation+1.41e-01

Pos logit contributions
ocious+0.110
itime+0.110
iegel+0.108
 stranded+0.108
uri+0.107
Neg logit contributions
 Slater-0.121
catentry-0.120
 Accountability-0.119
imov-0.118
 relapse-0.117
Token I
Token ID314
Feature activation+1.86e-01

Pos logit contributions
ocious+0.091
itime+0.091
iegel+0.090
 stranded+0.089
uri+0.089
Neg logit contributions
 Slater-0.099
catentry-0.098
 Accountability-0.097
imov-0.096
 relapse-0.095
Token fed
Token ID11672
Feature activation+7.14e-01

Pos logit contributions
 stranded+0.109
itime+0.109
ocious+0.109
iegel+0.108
uri+0.107
Neg logit contributions
catentry-0.140
 Slater-0.139
 Accountability-0.137
imov-0.136
isite-0.135
Token the
Token ID262
Feature activation-3.42e-02

Pos logit contributions
ocious+0.459
itime+0.459
iegel+0.454
 stranded+0.452
uri+0.450
Neg logit contributions
 Slater-0.498
catentry-0.494
 Accountability-0.488
imov-0.485
 relapse-0.481
Token Dev
Token ID6245
Feature activation-1.11e+00

Pos logit contributions
 Slater+0.023
catentry+0.023
 Accountability+0.023
imov+0.023
 relapse+0.023
Neg logit contributions
ocious-0.022
itime-0.022
iegel-0.022
 stranded-0.022
uri-0.022
Tokenial
Token ID498
Feature activation-3.96e+00

Pos logit contributions
 Slater+0.796
catentry+0.790
 Accountability+0.779
imov+0.777
pron+0.768
Neg logit contributions
ocious-0.693
itime-0.692
iegel-0.685
 stranded-0.683
uri-0.677
Tokenets
Token ID1039
Feature activation-4.75e-01

Pos logit contributions
 Slater+2.831
 Accountability+2.765
imov+2.765
catentry+2.760
pron+2.759
Neg logit contributions
ocious-2.417
itime-2.407
 stranded-2.375
iegel-2.367
uri-2.334
Token bass
Token ID12702
Feature activation+2.58e-01

Pos logit contributions
 Slater+0.345
catentry+0.341
 Accountability+0.338
imov+0.337
 relapse+0.334
Neg logit contributions
itime-0.292
ocious-0.292
iegel-0.288
 stranded-0.286
uri-0.285
Token-
Token ID12
Feature activation+1.40e-01

Pos logit contributions
ocious+0.164
itime+0.164
iegel+0.163
 stranded+0.163
uri+0.162
Neg logit contributions
catentry-0.180
 Slater-0.180
 Accountability-0.177
imov-0.176
 relapse-0.174
Tokenheavy
Token ID23701
Feature activation+9.20e-01

Pos logit contributions
ocious+0.090
itime+0.090
iegel+0.089
 stranded+0.089
uri+0.089
Neg logit contributions
 Slater-0.097
catentry-0.097
 Accountability-0.095
imov-0.095
 relapse-0.094
Token music
Token ID2647
Feature activation-9.19e-01

Pos logit contributions
ocious+0.609
itime+0.609
 stranded+0.606
iegel+0.604
uri+0.601
Neg logit contributions
catentry-0.620
 Slater-0.616
 Accountability-0.607
imov-0.607
 relapse-0.597
Token Donald
Token ID3759
Feature activation-1.02e+00

Pos logit contributions
ocious+0.576
itime+0.575
iegel+0.570
 stranded+0.567
uri+0.565
Neg logit contributions
catentry-0.846
 Slater-0.843
 Accountability-0.829
imov-0.828
 relapse-0.822
Token Trump
Token ID1301
Feature activation-7.46e-02

Pos logit contributions
 Slater+0.686
catentry+0.676
 Accountability+0.669
imov+0.667
 relapse+0.657
Neg logit contributions
itime-0.689
ocious-0.688
iegel-0.681
 stranded-0.678
uri-0.673
Token.
Token ID13
Feature activation-7.22e-02

Pos logit contributions
 Slater+0.061
catentry+0.060
 Accountability+0.060
imov+0.059
 relapse+0.059
Neg logit contributions
itime-0.039
ocious-0.039
iegel-0.038
 stranded-0.038
uri-0.038
Token
Token ID198
Feature activation-1.66e-01

Pos logit contributions
catentry+0.047
 Slater+0.047
imov+0.046
 Accountability+0.046
 relapse+0.046
Neg logit contributions
ocious-0.049
itime-0.049
iegel-0.049
uri-0.048
 stranded-0.048
Token
Token ID198
Feature activation-1.72e-02

Pos logit contributions
catentry+0.112
 Slater+0.111
imov+0.109
 Accountability+0.109
 relapse+0.108
Neg logit contributions
ocious-0.110
uri-0.110
iegel-0.110
itime-0.110
 stranded-0.109
TokenWATCH
Token ID35192
Feature activation-3.95e+00

Pos logit contributions
 Slater+0.013
catentry+0.013
 Accountability+0.013
imov+0.013
 relapse+0.013
Neg logit contributions
ocious-0.010
itime-0.010
iegel-0.010
 stranded-0.010
uri-0.010
Token:
Token ID25
Feature activation+5.10e-02

Pos logit contributions
 Slater+3.270
catentry+3.165
 Accountability+3.148
imov+3.140
 relapse+3.102
Neg logit contributions
itime-2.078
ocious-2.068
iegel-2.011
 stranded-2.011
uri-1.990
Token
Token ID198
Feature activation+6.20e-02

Pos logit contributions
ocious+0.035
itime+0.035
iegel+0.035
 stranded+0.035
uri+0.035
Neg logit contributions
catentry-0.033
 Slater-0.033
 Accountability-0.032
imov-0.032
 relapse-0.032
Token
Token ID198
Feature activation+1.50e-01

Pos logit contributions
uri+0.040
ocious+0.040
iegel+0.040
itime+0.040
 stranded+0.040
Neg logit contributions
catentry-0.042
 Slater-0.042
imov-0.041
 Accountability-0.041
 relapse-0.041
TokenT
Token ID51
Feature activation-3.06e-01

Pos logit contributions
ocious+0.087
itime+0.087
iegel+0.086
 stranded+0.086
uri+0.086
Neg logit contributions
 Slater-0.114
catentry-0.113
 Accountability-0.112
imov-0.111
 relapse-0.110
Tokenucker
Token ID12603
Feature activation-7.19e-01

Pos logit contributions
 Slater+0.315
catentry+0.312
 Accountability+0.309
imov+0.309
 relapse+0.306
Neg logit contributions
itime-0.095
ocious-0.095
 stranded-0.093
iegel-0.092
uri-0.090
Token evidence
Token ID2370
Feature activation-5.80e-01

Pos logit contributions
 stranded+1.114
ocious+1.112
iegel+1.109
itime+1.104
uri+1.100
Neg logit contributions
catentry-1.030
 Slater-1.017
imov-1.003
 Accountability-0.987
 relapse-0.985
Token.
Token ID13
Feature activation-5.42e-01

Pos logit contributions
 Slater+0.472
 Accountability+0.463
catentry+0.462
imov+0.458
 relapse+0.456
Neg logit contributions
itime-0.310
ocious-0.308
iegel-0.303
 stranded-0.299
uri-0.299
Token
Token ID198
Feature activation-3.30e-02

Pos logit contributions
catentry+0.364
 Slater+0.361
imov+0.355
 Accountability+0.353
 relapse+0.352
Neg logit contributions
itime-0.360
ocious-0.360
iegel-0.357
uri-0.356
 stranded-0.355
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
catentry+0.023
 Slater+0.022
imov+0.022
 Accountability+0.022
isite+0.022
Neg logit contributions
uri-0.022
iegel-0.021
ocious-0.021
itime-0.021
 stranded-0.021
TokenWe
Token ID1135
Feature activation+1.88e-01

Pos logit contributions
ocious+0.075
itime+0.075
iegel+0.075
uri+0.074
 stranded+0.074
Neg logit contributions
 Slater-0.084
catentry-0.084
 Accountability-0.083
imov-0.082
 relapse-0.082
Token rate
Token ID2494
Feature activation-3.94e+00

Pos logit contributions
iegel+0.129
itime+0.129
 stranded+0.129
ocious+0.128
uri+0.128
Neg logit contributions
catentry-0.122
 Slater-0.122
 Accountability-0.119
isite-0.118
imov-0.117
Token this
Token ID428
Feature activation+1.05e+00

Pos logit contributions
 Slater+2.541
catentry+2.481
 Accountability+2.477
imov+2.438
 relapse+2.427
Neg logit contributions
itime-2.753
ocious-2.749
iegel-2.694
 stranded-2.688
uri-2.674
Token Pants
Token ID41689
Feature activation-7.33e-01

Pos logit contributions
iegel+0.759
 stranded+0.756
itime+0.754
ocious+0.753
uri+0.751
Neg logit contributions
catentry-0.641
 Slater-0.627
imov-0.623
isite-0.618
 Accountability-0.611
Token on
Token ID319
Feature activation+2.26e-01

Pos logit contributions
 Slater+0.565
catentry+0.559
 Accountability+0.555
imov+0.554
 relapse+0.549
Neg logit contributions
ocious-0.418
itime-0.414
iegel-0.408
 stranded-0.406
uri-0.402
Token Fire
Token ID3764
Feature activation+1.68e+00

Pos logit contributions
ocious+0.107
itime+0.106
iegel+0.106
 stranded+0.106
uri+0.104
Neg logit contributions
catentry-0.195
 Slater-0.194
imov-0.191
 relapse-0.190
 Accountability-0.190
Token.
Token ID13
Feature activation-5.92e-01

Pos logit contributions
 stranded+0.842
iegel+0.840
ocious+0.840
itime+0.839
uri+0.834
Neg logit contributions
catentry-1.402
 Slater-1.394
imov-1.371
isite-1.364
 Accountability-1.359
TokenPA
Token ID4537
Feature activation-8.26e-01

Pos logit contributions
 Slater+0.324
catentry+0.324
imov+0.317
 Accountability+0.315
 relapse+0.313
Neg logit contributions
itime-0.296
ocious-0.295
iegel-0.294
 stranded-0.293
uri-0.291
Token agreed
Token ID4987
Feature activation+3.08e-01

Pos logit contributions
 Slater+0.638
catentry+0.636
 Accountability+0.626
imov+0.625
 relapse+0.619
Neg logit contributions
ocious-0.467
itime-0.467
iegel-0.463
 stranded-0.461
uri-0.458
Token to
Token ID284
Feature activation-7.36e-01

Pos logit contributions
itime+0.165
ocious+0.165
iegel+0.163
 stranded+0.162
uri+0.161
Neg logit contributions
 Slater-0.247
catentry-0.246
 Accountability-0.242
imov-0.242
 relapse-0.240
Token a
Token ID257
Feature activation+6.61e-01

Pos logit contributions
 Slater+0.538
catentry+0.536
 Accountability+0.527
imov+0.526
 relapse+0.521
Neg logit contributions
itime-0.446
ocious-0.445
iegel-0.441
 stranded-0.440
uri-0.438
Token Collective
Token ID29128
Feature activation-2.17e+00

Pos logit contributions
itime+0.410
 stranded+0.408
ocious+0.407
iegel+0.406
uri+0.405
Neg logit contributions
catentry-0.471
 Slater-0.470
imov-0.463
 Accountability-0.461
isite-0.459
Token Barg
Token ID50062
Feature activation-3.93e+00

Pos logit contributions
 Accountability+1.579
 Slater+1.559
catentry+1.557
imov+1.511
pron+1.497
Neg logit contributions
itime-1.356
ocious-1.351
iegel-1.332
uri-1.315
 stranded-1.314
Tokenaining
Token ID1397
Feature activation-2.46e+00

Pos logit contributions
 Slater+2.764
 Accountability+2.762
isite+2.752
catentry+2.735
imov+2.714
Neg logit contributions
 stranded-2.471
itime-2.445
ocious-2.416
asio-2.401
iegel-2.391
Token Agreement
Token ID12729
Feature activation+7.96e-01

Pos logit contributions
 Accountability+1.963
 Slater+1.922
catentry+1.906
 relapse+1.886
imov+1.878
Neg logit contributions
itime-1.368
ocious-1.346
iegel-1.319
 stranded-1.319
abwe-1.305
Token (
Token ID357
Feature activation-2.45e+00

Pos logit contributions
ocious+0.473
iegel+0.472
itime+0.472
 stranded+0.469
uri+0.466
Neg logit contributions
catentry-0.587
 Slater-0.587
pron-0.574
imov-0.574
 Accountability-0.573
TokenC
Token ID34
Feature activation-1.70e+00

Pos logit contributions
catentry+1.525
 Slater+1.515
 Accountability+1.493
imov+1.489
pron+1.470
Neg logit contributions
ocious-1.755
itime-1.752
iegel-1.733
 stranded-1.732
uri-1.710
TokenBA
Token ID4339
Feature activation+1.18e+00

Pos logit contributions
 Slater+1.194
catentry+1.187
imov+1.182
 Accountability+1.176
 relapse+1.154
Neg logit contributions
ocious-1.083
itime-1.081
 stranded-1.074
iegel-1.054
uri-1.047
TokenB
Token ID33
Feature activation-1.93e+00

Pos logit contributions
 Slater+0.534
catentry+0.532
 Accountability+0.524
imov+0.521
 relapse+0.515
Neg logit contributions
ocious-0.566
itime-0.565
iegel-0.560
 stranded-0.558
uri-0.556
TokenV
Token ID53
Feature activation+1.92e+00

Pos logit contributions
 Slater+0.928
catentry+0.904
imov+0.894
 Accountability+0.892
 relapse+0.881
Neg logit contributions
ocious-1.663
itime-1.660
 stranded-1.643
iegel-1.643
uri-1.628
Token),
Token ID828
Feature activation-1.34e-01

Pos logit contributions
itime+1.075
ocious+1.073
 stranded+1.066
iegel+1.063
uri+1.053
Neg logit contributions
catentry-1.496
 Slater-1.479
 Accountability-1.462
imov-1.457
pron-1.439
Token an
Token ID281
Feature activation-3.58e-01

Pos logit contributions
catentry+0.090
 Slater+0.089
imov+0.088
 Accountability+0.088
isite+0.087
Neg logit contributions
 stranded-0.089
ocious-0.089
iegel-0.088
itime-0.088
uri-0.087
Token over
Token ID625
Feature activation-6.33e-01

Pos logit contributions
 Slater+0.238
catentry+0.237
imov+0.232
 Accountability+0.232
pron+0.228
Neg logit contributions
ocious-0.243
iegel-0.240
 stranded-0.240
itime-0.240
uri-0.238
Tokengrowth
Token ID27922
Feature activation-3.92e+00

Pos logit contributions
 Slater+0.316
catentry+0.316
 Accountability+0.308
imov+0.305
 relapse+0.300
Neg logit contributions
ocious-0.532
itime-0.530
iegel-0.527
 stranded-0.527
uri-0.524
Token of
Token ID286
Feature activation-3.50e-01

Pos logit contributions
 Slater+2.609
catentry+2.569
 relapse+2.562
 Accountability+2.538
imov+2.530
Neg logit contributions
itime-2.634
ocious-2.600
iegel-2.574
uri-2.552
 stranded-2.535
Token an
Token ID281
Feature activation-2.73e-02

Pos logit contributions
catentry+0.231
 Slater+0.230
 Accountability+0.226
imov+0.226
isite+0.222
Neg logit contributions
ocious-0.238
itime-0.237
 stranded-0.237
iegel-0.236
uri-0.234
Tokenaer
Token ID25534
Feature activation-3.25e-01

Pos logit contributions
 Slater+0.018
catentry+0.018
 Accountability+0.018
imov+0.018
 relapse+0.018
Neg logit contributions
ocious-0.018
itime-0.018
 stranded-0.018
iegel-0.018
uri-0.018
Tokenobic
Token ID20803
Feature activation-4.38e-01

Pos logit contributions
 Slater+0.242
catentry+0.239
imov+0.237
 Accountability+0.236
pron+0.234
Neg logit contributions
ocious-0.193
itime-0.192
iegel-0.191
 stranded-0.190
uri-0.188
Token pathogens
Token ID38366
Feature activation-8.58e-01

Pos logit contributions
catentry+0.267
 Slater+0.267
 Accountability+0.262
imov+0.261
 relapse+0.257
Neg logit contributions
ocious-0.318
itime-0.318
 stranded-0.316
iegel-0.316
uri-0.314
Token George
Token ID4502
Feature activation+3.47e-01

Pos logit contributions
 Slater+0.925
catentry+0.915
 Accountability+0.906
imov+0.900
 relapse+0.891
Neg logit contributions
itime-0.837
ocious-0.837
iegel-0.828
 stranded-0.824
uri-0.819
Token-
Token ID12
Feature activation-9.42e-01

Pos logit contributions
ocious+0.214
itime+0.214
iegel+0.212
 stranded+0.211
uri+0.210
Neg logit contributions
catentry-0.249
 Slater-0.249
 Accountability-0.245
imov-0.244
 relapse-0.241
Token8
Token ID23
Feature activation+6.35e-01

Pos logit contributions
 Slater+0.612
catentry+0.608
 Accountability+0.599
imov+0.596
 relapse+0.589
Neg logit contributions
ocious-0.650
itime-0.649
iegel-0.643
 stranded-0.641
uri-0.637
Token Track
Token ID17762
Feature activation+1.30e-01

Pos logit contributions
ocious+0.425
itime+0.425
iegel+0.422
 stranded+0.420
uri+0.418
Neg logit contributions
catentry-0.424
 Slater-0.423
imov-0.414
 Accountability-0.413
 relapse-0.409
Token Porn
Token ID29629
Feature activation-4.91e-02

Pos logit contributions
ocious+0.086
itime+0.086
iegel+0.085
 stranded+0.084
uri+0.084
Neg logit contributions
catentry-0.088
 Slater-0.088
imov-0.086
 Accountability-0.086
 relapse-0.085
Token Pt
Token ID34352
Feature activation-3.91e+00

Pos logit contributions
 Slater+0.036
catentry+0.036
 Accountability+0.036
imov+0.036
 relapse+0.035
Neg logit contributions
ocious-0.029
itime-0.029
iegel-0.029
 stranded-0.029
uri-0.029
Token 4
Token ID604
Feature activation+5.42e-01

Pos logit contributions
 Slater+2.554
 Accountability+2.522
catentry+2.464
imov+2.463
isite+2.456
Neg logit contributions
itime-2.668
ocious-2.645
iegel-2.631
 stranded-2.631
uri-2.567
Token-
Token ID12
Feature activation-2.36e-01

Pos logit contributions
ocious+0.282
itime+0.282
iegel+0.278
 stranded+0.277
uri+0.275
Neg logit contributions
 Slater-0.444
catentry-0.441
 Accountability-0.437
imov-0.435
 relapse-0.432
TokenOtt
Token ID49092
Feature activation-2.34e-01

Pos logit contributions
 Slater+0.152
catentry+0.151
 Accountability+0.149
imov+0.148
 relapse+0.147
Neg logit contributions
ocious-0.163
itime-0.163
iegel-0.162
 stranded-0.161
uri-0.161
Tokenos
Token ID418
Feature activation-2.49e+00

Pos logit contributions
 Slater+0.185
catentry+0.185
 Accountability+0.182
imov+0.181
 relapse+0.180
Neg logit contributions
ocious-0.128
itime-0.128
iegel-0.127
 stranded-0.126
uri-0.126
Token Old
Token ID5706
Feature activation-6.30e-01

Pos logit contributions
 Slater+1.692
catentry+1.667
 Accountability+1.659
imov+1.652
pron+1.630
Neg logit contributions
itime-1.644
ocious-1.640
iegel-1.625
 stranded-1.621
uri-1.604
Token things
Token ID1243
Feature activation+1.60e-01

Pos logit contributions
ocious+1.017
 stranded+1.016
iegel+1.015
itime+1.014
uri+1.011
Neg logit contributions
catentry-1.107
 Slater-1.094
imov-1.079
 Accountability-1.074
isite-1.069
Token like
Token ID588
Feature activation-2.34e-01

Pos logit contributions
ocious+0.106
itime+0.106
iegel+0.106
 stranded+0.105
uri+0.105
Neg logit contributions
catentry-0.107
 Slater-0.107
 Accountability-0.105
imov-0.105
isite-0.103
Token
Token ID564
Feature activation-9.43e-01

Pos logit contributions
catentry+0.158
 Slater+0.158
imov+0.155
 Accountability+0.155
isite+0.153
Neg logit contributions
itime-0.155
ocious-0.155
iegel-0.153
 stranded-0.153
uri-0.152
Token
Token ID250
Feature activation-4.88e-01

Pos logit contributions
 Slater+0.703
catentry+0.699
 Accountability+0.690
imov+0.687
 relapse+0.681
Neg logit contributions
ocious-0.559
itime-0.559
iegel-0.553
 stranded-0.551
uri-0.548
Tokenm
Token ID76
Feature activation-1.91e-01

Pos logit contributions
 Slater+0.335
catentry+0.333
 Accountability+0.328
imov+0.326
 relapse+0.323
Neg logit contributions
ocious-0.318
itime-0.318
iegel-0.315
 stranded-0.314
uri-0.313
Tokeneth
Token ID2788
Feature activation+3.12e+00

Pos logit contributions
 Slater+0.149
catentry+0.148
 Accountability+0.146
imov+0.146
 relapse+0.145
Neg logit contributions
ocious-0.106
itime-0.106
 stranded-0.105
iegel-0.105
uri-0.103
Token cookies
Token ID14746
Feature activation-1.04e+00

Pos logit contributions
ocious+1.880
iegel+1.877
itime+1.873
uri+1.863
 stranded+1.856
Neg logit contributions
catentry-2.282
 Slater-2.268
imov-2.244
 Accountability-2.241
isite-2.201
Token
Token ID447
Feature activation-8.20e-01

Pos logit contributions
 Slater+0.758
catentry+0.753
 Accountability+0.744
imov+0.740
 relapse+0.734
Neg logit contributions
itime-0.641
ocious-0.641
iegel-0.634
 stranded-0.630
uri-0.626
Token
Token ID251
Feature activation-1.21e+00

Pos logit contributions
 Slater+0.495
catentry+0.493
 Accountability+0.488
imov+0.485
 relapse+0.477
Neg logit contributions
itime-0.602
ocious-0.601
iegel-0.596
 stranded-0.592
uri-0.590
Token appear
Token ID1656
Feature activation+4.66e-01

Pos logit contributions
 Slater+0.880
catentry+0.875
 Accountability+0.863
imov+0.860
 relapse+0.854
Neg logit contributions
itime-0.740
ocious-0.739
iegel-0.730
 stranded-0.726
uri-0.722
Token as
Token ID355
Feature activation-1.16e-02

Pos logit contributions
ocious+0.274
itime+0.273
 stranded+0.271
iegel+0.271
uri+0.269
Neg logit contributions
catentry-0.350
 Slater-0.349
 Accountability-0.344
imov-0.343
 relapse-0.339
Token M
Token ID337
Feature activation+9.20e-01

Pos logit contributions
itime+0.323
ocious+0.322
iegel+0.319
 stranded+0.319
uri+0.316
Neg logit contributions
 Slater-0.423
catentry-0.422
 Accountability-0.416
imov-0.415
 relapse-0.410
TokenATE
Token ID6158
Feature activation-1.23e-01

Pos logit contributions
itime+0.430
ocious+0.430
iegel+0.424
 stranded+0.423
uri+0.420
Neg logit contributions
 Slater-0.801
catentry-0.798
 Accountability-0.789
imov-0.786
 relapse-0.779
Token 16
Token ID1467
Feature activation+1.18e+00

Pos logit contributions
 Slater+0.091
catentry+0.090
 Accountability+0.089
imov+0.089
 relapse+0.088
Neg logit contributions
itime-0.074
ocious-0.074
iegel-0.074
 stranded-0.073
uri-0.073
Token.
Token ID13
Feature activation+7.02e-01

Pos logit contributions
ocious+0.514
itime+0.513
iegel+0.506
 stranded+0.503
uri+0.499
Neg logit contributions
 Slater-1.068
catentry-1.064
 Accountability-1.052
imov-1.049
 relapse-1.041
Token04
Token ID3023
Feature activation+2.91e-01

Pos logit contributions
ocious+0.543
itime+0.542
 stranded+0.540
uri+0.537
iegel+0.535
Neg logit contributions
 Slater-0.393
catentry-0.385
imov-0.384
 relapse-0.377
isite-0.372
Token+
Token ID10
Feature activation+3.22e+00

Pos logit contributions
itime+0.179
ocious+0.179
iegel+0.177
 stranded+0.177
uri+0.176
Neg logit contributions
 Slater-0.210
catentry-0.210
 Accountability-0.206
imov-0.206
 relapse-0.203
Token at
Token ID379
Feature activation-5.23e-02

Pos logit contributions
 stranded+1.849
itime+1.830
ocious+1.830
uri+1.817
iegel+1.807
Neg logit contributions
catentry-2.409
 Slater-2.408
 Accountability-2.398
imov-2.388
 relapse-2.336
Token all
Token ID477
Feature activation+3.49e-01

Pos logit contributions
 Slater+0.036
catentry+0.036
 Accountability+0.035
imov+0.035
 relapse+0.035
Neg logit contributions
itime-0.034
ocious-0.034
iegel-0.034
 stranded-0.034
uri-0.033
Token,
Token ID11
Feature activation+5.04e-01

Pos logit contributions
itime+0.207
ocious+0.206
iegel+0.205
 stranded+0.205
uri+0.204
Neg logit contributions
 Slater-0.259
catentry-0.258
 Accountability-0.255
imov-0.255
isite-0.251
Token I
Token ID314
Feature activation+1.81e-01

Pos logit contributions
itime+0.306
ocious+0.305
 stranded+0.304
iegel+0.303
uri+0.301
Neg logit contributions
catentry-0.367
 Slater-0.367
 Accountability-0.361
imov-0.360
pron-0.356
Token
Token ID447
Feature activation+2.10e-01

Pos logit contributions
itime+0.103
 stranded+0.103
ocious+0.102
iegel+0.102
uri+0.102
Neg logit contributions
catentry-0.139
 Slater-0.138
 Accountability-0.136
imov-0.135
isite-0.134
Token the
Token ID262
Feature activation-4.94e-01

Pos logit contributions
catentry+0.337
 Slater+0.336
 Accountability+0.330
imov+0.329
isite+0.327
Neg logit contributions
ocious-0.295
 stranded-0.293
itime-0.293
iegel-0.291
uri-0.290
Token season
Token ID1622
Feature activation-1.73e-01

Pos logit contributions
catentry+0.217
 Slater+0.217
 Accountability+0.211
imov+0.209
isite+0.206
Neg logit contributions
ocious-0.443
itime-0.443
 stranded-0.441
iegel-0.439
uri-0.438
Token.
Token ID13
Feature activation-1.13e-01

Pos logit contributions
 Slater+0.141
catentry+0.141
 Accountability+0.139
imov+0.139
 relapse+0.137
Neg logit contributions
ocious-0.091
itime-0.091
iegel-0.090
 stranded-0.089
uri-0.089
Token The
Token ID383
Feature activation+2.74e-01

Pos logit contributions
catentry+0.079
 Slater+0.077
imov+0.076
 relapse+0.076
isite+0.076
Neg logit contributions
ocious-0.073
itime-0.072
 stranded-0.072
iegel-0.072
uri-0.072
Token score
Token ID4776
Feature activation+6.18e-01

Pos logit contributions
ocious+0.179
itime+0.178
 stranded+0.177
iegel+0.176
uri+0.175
Neg logit contributions
catentry-0.189
 Slater-0.188
 Accountability-0.185
imov-0.184
isite-0.182
Tokenless
Token ID1203
Feature activation+3.32e+00

Pos logit contributions
ocious+0.353
itime+0.352
iegel+0.350
 stranded+0.349
uri+0.347
Neg logit contributions
 Slater-0.474
catentry-0.474
 Accountability-0.467
imov-0.465
 relapse-0.459
Token first
Token ID717
Feature activation-6.00e-01

Pos logit contributions
 stranded+2.052
ocious+1.992
uri+1.990
iegel+1.979
itime+1.967
Neg logit contributions
catentry-2.495
 Accountability-2.378
 Slater-2.355
isite-2.350
imov-2.348
Token period
Token ID2278
Feature activation+3.32e+00

Pos logit contributions
 Slater+0.361
catentry+0.360
 Accountability+0.354
imov+0.352
 relapse+0.347
Neg logit contributions
ocious-0.442
itime-0.441
iegel-0.438
 stranded-0.437
uri-0.435
Token was
Token ID373
Feature activation+4.83e-01

Pos logit contributions
 stranded+1.899
uri+1.790
itime+1.765
iegel+1.765
ocious+1.762
Neg logit contributions
catentry-2.652
imov-2.547
isite-2.545
 Slater-2.522
 Accountability-2.509
Token quite
Token ID2407
Feature activation-3.61e-01

Pos logit contributions
ocious+0.301
 stranded+0.300
itime+0.299
iegel+0.297
uri+0.295
Neg logit contributions
catentry-0.346
 Slater-0.343
 Accountability-0.337
imov-0.337
isite-0.333
Token an
Token ID281
Feature activation-9.55e-02

Pos logit contributions
 Slater+0.259
catentry+0.257
 Accountability+0.254
imov+0.253
 relapse+0.250
Neg logit contributions
itime-0.225
ocious-0.225
iegel-0.222
 stranded-0.221
uri-0.220
Token to
Token ID284
Feature activation-2.45e-02

Pos logit contributions
iegel+0.952
ocious+0.948
 stranded+0.948
uri+0.948
itime+0.945
Neg logit contributions
catentry-1.731
 Slater-1.714
imov-1.699
isite-1.697
 relapse-1.695
Token the
Token ID262
Feature activation-2.69e-01

Pos logit contributions
 Slater+0.018
catentry+0.018
imov+0.017
 Accountability+0.017
 relapse+0.017
Neg logit contributions
itime-0.015
ocious-0.015
iegel-0.015
 stranded-0.015
uri-0.015
Token report
Token ID989
Feature activation-6.20e-01

Pos logit contributions
 Slater+0.192
catentry+0.191
imov+0.188
 Accountability+0.187
 relapse+0.186
Neg logit contributions
itime-0.168
ocious-0.168
iegel-0.167
 stranded-0.166
uri-0.165
Token from
Token ID422
Feature activation+8.80e-01

Pos logit contributions
 Slater+0.463
catentry+0.462
 Accountability+0.454
imov+0.453
 relapse+0.450
Neg logit contributions
ocious-0.366
itime-0.366
iegel-0.362
 stranded-0.361
uri-0.359
Token Xin
Token ID25426
Feature activation+1.48e+00

Pos logit contributions
itime+0.537
ocious+0.536
iegel+0.531
 stranded+0.530
uri+0.524
Neg logit contributions
catentry-0.642
 Slater-0.635
imov-0.625
 relapse-0.620
 Accountability-0.618
Tokenhua
Token ID33061
Feature activation+3.20e+00

Pos logit contributions
itime+0.501
ocious+0.496
iegel+0.487
 stranded+0.482
uri+0.480
Neg logit contributions
 Slater-1.480
catentry-1.478
 Accountability-1.458
imov-1.457
 relapse-1.447
Token.
Token ID764
Feature activation-1.20e+00

Pos logit contributions
itime+1.885
ocious+1.874
iegel+1.870
 stranded+1.842
uri+1.838
Neg logit contributions
catentry-2.400
 Slater-2.365
imov-2.339
 relapse-2.324
 Accountability-2.315
Token
Token ID198
Feature activation-7.66e-01

Pos logit contributions
 Slater+0.775
catentry+0.769
 Accountability+0.759
imov+0.755
 relapse+0.746
Neg logit contributions
ocious-0.832
itime-0.831
iegel-0.824
 stranded-0.821
uri-0.816
Token
Token ID198
Feature activation-4.93e-01

Pos logit contributions
catentry+0.506
 Slater+0.504
 Accountability+0.494
imov+0.493
 relapse+0.490
Neg logit contributions
ocious-0.516
itime-0.514
iegel-0.513
uri-0.513
 stranded-0.510
TokenThe
Token ID464
Feature activation-1.97e-01

Pos logit contributions
 Slater+0.353
catentry+0.351
 Accountability+0.346
imov+0.344
 relapse+0.342
Neg logit contributions
ocious-0.307
itime-0.307
iegel-0.303
 stranded-0.302
uri-0.301
Token code
Token ID2438
Feature activation+5.88e-02

Pos logit contributions
 Slater+0.139
catentry+0.139
 Accountability+0.136
imov+0.136
 relapse+0.135
Neg logit contributions
ocious-0.124
itime-0.124
iegel-0.123
 stranded-0.123
uri-0.122
Token extra
Token ID3131
Feature activation-2.90e-01

Pos logit contributions
 Slater+0.247
catentry+0.245
imov+0.242
 Accountability+0.241
pron+0.239
Neg logit contributions
ocious-0.165
iegel-0.164
 stranded-0.164
itime-0.163
uri-0.163
Token level
Token ID1241
Feature activation-1.78e-02

Pos logit contributions
 Slater+0.229
catentry+0.228
imov+0.225
 Accountability+0.225
 relapse+0.221
Neg logit contributions
ocious-0.158
 stranded-0.157
itime-0.157
uri-0.156
iegel-0.156
Token of
Token ID286
Feature activation-1.53e-01

Pos logit contributions
catentry+0.013
 Slater+0.013
 Accountability+0.013
imov+0.013
isite+0.013
Neg logit contributions
iegel-0.010
uri-0.010
ocious-0.010
 stranded-0.010
itime-0.010
Token safety
Token ID3747
Feature activation-7.30e-01

Pos logit contributions
 Slater+0.093
catentry+0.093
imov+0.092
isite+0.091
 Accountability+0.091
Neg logit contributions
 stranded-0.110
ocious-0.109
iegel-0.109
itime-0.109
uri-0.109
Token!
Token ID0
Feature activation-3.57e-01

Pos logit contributions
 Slater+0.583
catentry+0.575
 Accountability+0.574
imov+0.567
pron+0.561
Neg logit contributions
ocious-0.399
itime-0.397
iegel-0.393
 stranded-0.390
uri-0.386
Token
Token ID851
Feature activation+3.68e+00

Pos logit contributions
 Slater+0.270
catentry+0.268
 Accountability+0.265
imov+0.263
 relapse+0.261
Neg logit contributions
ocious-0.209
itime-0.209
iegel-0.207
 stranded-0.206
uri-0.204
Token Donald
Token ID3759
Feature activation+2.11e-01

Pos logit contributions
ocious+1.463
itime+1.434
 stranded+1.432
uri+1.427
iegel+1.392
Neg logit contributions
catentry-3.460
 Slater-3.425
imov-3.410
isite-3.377
 Accountability-3.375
Token J
Token ID449
Feature activation-3.88e-01

Pos logit contributions
ocious+0.123
itime+0.123
iegel+0.122
 stranded+0.122
uri+0.121
Neg logit contributions
catentry-0.158
 Slater-0.158
 Accountability-0.155
imov-0.155
 relapse-0.153
Token.
Token ID13
Feature activation-6.15e-01

Pos logit contributions
 Slater+0.352
catentry+0.350
 Accountability+0.346
imov+0.345
 relapse+0.342
Neg logit contributions
ocious-0.168
itime-0.168
iegel-0.165
 stranded-0.164
uri-0.163
Token Trump
Token ID1301
Feature activation+9.19e-01

Pos logit contributions
catentry+0.435
 Slater+0.428
imov+0.425
isite+0.424
 Accountability+0.424
Neg logit contributions
ocious-0.384
itime-0.383
iegel-0.382
uri-0.382
 stranded-0.380
Token (@
Token ID4275
Feature activation-5.55e-01

Pos logit contributions
ocious+0.693
itime+0.692
 stranded+0.690
iegel+0.688
uri+0.686
Neg logit contributions
catentry-0.531
 Slater-0.529
imov-0.520
 Accountability-0.515
 relapse-0.514
Token the
Token ID262
Feature activation+5.07e-02

Pos logit contributions
itime+0.180
ocious+0.180
iegel+0.178
 stranded+0.176
uri+0.176
Neg logit contributions
 Slater-0.218
catentry-0.216
 Accountability-0.214
imov-0.213
 relapse-0.211
Token only
Token ID691
Feature activation+1.35e-01

Pos logit contributions
ocious+0.031
itime+0.031
 stranded+0.031
iegel+0.031
uri+0.031
Neg logit contributions
catentry-0.036
 Slater-0.036
 Accountability-0.036
imov-0.036
 relapse-0.035
Token human
Token ID1692
Feature activation-1.79e-01

Pos logit contributions
ocious+0.082
itime+0.082
 stranded+0.081
iegel+0.081
uri+0.081
Neg logit contributions
 Slater-0.099
catentry-0.098
 Accountability-0.097
imov-0.096
 relapse-0.095
Token left
Token ID1364
Feature activation-1.67e+00

Pos logit contributions
 Slater+0.132
catentry+0.131
 Accountability+0.130
imov+0.129
 relapse+0.128
Neg logit contributions
ocious-0.108
itime-0.108
iegel-0.107
 stranded-0.106
uri-0.106
Token in
Token ID287
Feature activation-3.25e-01

Pos logit contributions
 Slater+1.304
 Accountability+1.274
catentry+1.264
imov+1.262
 relapse+1.257
Neg logit contributions
ocious-0.938
itime-0.934
iegel-0.915
uri-0.905
asio-0.899
Token Springfield
Token ID27874
Feature activation+1.47e+00

Pos logit contributions
catentry+0.217
 Slater+0.217
 Accountability+0.213
imov+0.212
 relapse+0.210
Neg logit contributions
ocious-0.218
itime-0.218
 stranded-0.216
iegel-0.216
uri-0.214
Token.
Token ID13
Feature activation+6.85e-01

Pos logit contributions
 stranded+0.822
itime+0.819
ocious+0.813
iegel+0.811
uri+0.808
Neg logit contributions
catentry-1.156
 Slater-1.133
imov-1.117
 Accountability-1.114
isite-1.109
Token
Token ID198
Feature activation-9.75e-02

Pos logit contributions
ocious+0.444
itime+0.443
 stranded+0.441
iegel+0.441
uri+0.440
Neg logit contributions
catentry-0.471
 Slater-0.465
imov-0.457
 relapse-0.454
 Accountability-0.454
Token
Token ID198
Feature activation-2.19e-01

Pos logit contributions
catentry+0.066
 Slater+0.065
imov+0.064
 Accountability+0.064
isite+0.064
Neg logit contributions
uri-0.064
ocious-0.064
iegel-0.064
itime-0.064
 stranded-0.063
TokenJoin
Token ID18234
Feature activation+1.34e-01

Pos logit contributions
 Slater+0.153
catentry+0.152
 Accountability+0.150
imov+0.149
 relapse+0.147
Neg logit contributions
ocious-0.140
itime-0.140
iegel-0.139
 stranded-0.138
uri-0.137
Token the
Token ID262
Feature activation+9.99e-01

Pos logit contributions
ocious+0.089
itime+0.089
iegel+0.088
 stranded+0.088
uri+0.088
Neg logit contributions
 Slater-0.091
catentry-0.090
 Accountability-0.089
imov-0.088
 relapse-0.087
Token schools
Token ID4266
Feature activation+5.71e-02

Pos logit contributions
 Slater+0.602
catentry+0.599
 Accountability+0.591
imov+0.589
 relapse+0.583
Neg logit contributions
ocious-0.467
itime-0.466
iegel-0.461
 stranded-0.458
uri-0.456
Token assess
Token ID4659
Feature activation-4.54e-01

Pos logit contributions
itime+0.034
ocious+0.034
 stranded+0.034
iegel+0.034
uri+0.034
Neg logit contributions
 Slater-0.042
catentry-0.042
imov-0.041
 Accountability-0.041
isite-0.041
Token student
Token ID3710
Feature activation+2.76e-01

Pos logit contributions
 Slater+0.322
catentry+0.320
 Accountability+0.315
imov+0.314
 relapse+0.311
Neg logit contributions
ocious-0.286
itime-0.286
iegel-0.283
 stranded-0.282
uri-0.280
Token and
Token ID290
Feature activation+8.33e-01

Pos logit contributions
ocious+0.181
itime+0.181
iegel+0.179
 stranded+0.178
uri+0.177
Neg logit contributions
 Slater-0.189
catentry-0.188
 Accountability-0.185
imov-0.184
 relapse-0.183
Token teacher
Token ID4701
Feature activation-1.71e-01

Pos logit contributions
ocious+0.464
itime+0.463
 stranded+0.459
iegel+0.459
uri+0.454
Neg logit contributions
 Slater-0.649
catentry-0.647
 Accountability-0.637
imov-0.637
pron-0.629
Token performance
Token ID2854
Feature activation+5.90e-01

Pos logit contributions
 Slater+0.112
catentry+0.112
 Accountability+0.110
imov+0.109
 relapse+0.108
Neg logit contributions
ocious-0.117
itime-0.117
iegel-0.116
 stranded-0.116
uri-0.115
Token.
Token ID13
Feature activation+1.14e-01

Pos logit contributions
ocious+0.311
itime+0.309
iegel+0.308
 stranded+0.306
uri+0.305
Neg logit contributions
 Slater-0.477
catentry-0.477
imov-0.469
 Accountability-0.468
 relapse-0.463
Token The
Token ID383
Feature activation+2.14e-01

Pos logit contributions
ocious+0.076
itime+0.076
iegel+0.076
 stranded+0.075
uri+0.075
Neg logit contributions
catentry-0.076
 Slater-0.075
imov-0.074
 Accountability-0.074
 relapse-0.073
Token investment
Token ID4896
Feature activation+5.29e-01

Pos logit contributions
ocious+0.120
itime+0.119
iegel+0.119
 stranded+0.118
uri+0.117
Neg logit contributions
catentry-0.165
 Slater-0.165
imov-0.163
 Accountability-0.162
 relapse-0.161
Token was
Token ID373
Feature activation+7.33e-01

Pos logit contributions
 stranded+0.297
ocious+0.295
uri+0.294
itime+0.293
iegel+0.292
Neg logit contributions
catentry-0.413
 Slater-0.407
imov-0.404
 Accountability-0.403
isite-0.398
Token made
Token ID925
Feature activation+2.09e-01

Pos logit contributions
 stranded+0.484
ocious+0.483
iegel+0.481
itime+0.480
uri+0.477
Neg logit contributions
catentry-0.496
 Slater-0.488
imov-0.483
 Accountability-0.481
isite-0.476
Token from
Token ID422
Feature activation+7.04e-01

Pos logit contributions
ocious+0.220
itime+0.219
iegel+0.217
 stranded+0.216
uri+0.216
Neg logit contributions
 Slater-0.327
catentry-0.326
 Accountability-0.322
imov-0.321
 relapse-0.318
Token the
Token ID262
Feature activation-1.60e-01

Pos logit contributions
 stranded+0.457
ocious+0.454
iegel+0.450
itime+0.449
uri+0.448
Neg logit contributions
catentry-0.487
imov-0.478
 Slater-0.477
 Accountability-0.477
isite-0.471
Token Ben
Token ID3932
Feature activation-4.69e-01

Pos logit contributions
catentry+0.110
 Slater+0.110
 Accountability+0.108
imov+0.108
isite+0.106
Neg logit contributions
ocious-0.104
itime-0.103
 stranded-0.103
iegel-0.103
uri-0.102
Tokenih
Token ID4449
Feature activation-1.60e+00

Pos logit contributions
 Slater+0.244
catentry+0.242
 Accountability+0.237
imov+0.235
 relapse+0.232
Neg logit contributions
ocious-0.384
itime-0.384
iegel-0.381
 stranded-0.380
uri-0.378
Tokenana
Token ID2271
Feature activation-1.07e+00

Pos logit contributions
 Slater+1.094
catentry+1.087
 Accountability+1.072
imov+1.068
 relapse+1.056
Neg logit contributions
ocious-1.045
itime-1.044
iegel-1.032
 stranded-1.031
uri-1.023
Token Burn
Token ID8942
Feature activation+1.11e+00

Pos logit contributions
 Slater+0.765
catentry+0.760
 Accountability+0.750
imov+0.747
 relapse+0.740
Neg logit contributions
ocious-0.668
itime-0.667
iegel-0.660
 stranded-0.657
uri-0.654
Token!
Token ID0
Feature activation-5.46e-01

Pos logit contributions
ocious+0.693
 stranded+0.691
iegel+0.691
itime+0.689
uri+0.685
Neg logit contributions
catentry-0.781
 Slater-0.778
imov-0.765
 Accountability-0.762
pron-0.754
Token
Token ID198
Feature activation+1.19e+00

Pos logit contributions
 Slater+0.376
catentry+0.375
 Accountability+0.369
imov+0.368
 relapse+0.364
Neg logit contributions
ocious-0.354
itime-0.353
iegel-0.350
 stranded-0.349
uri-0.347
Token
Token ID198
Feature activation+1.41e+00

Pos logit contributions
uri+0.763
ocious+0.758
iegel+0.757
itime+0.751
 stranded+0.751
Neg logit contributions
catentry-0.830
 Slater-0.811
imov-0.804
 Accountability-0.800
isite-0.798
TokenAccording
Token ID4821
Feature activation+1.25e+00

Pos logit contributions
ocious+0.908
itime+0.901
uri+0.896
iegel+0.891
 stranded+0.890
Neg logit contributions
 Slater-0.977
catentry-0.976
 Accountability-0.970
imov-0.962
 relapse-0.956
Token to
Token ID284
Feature activation-2.25e-01

Pos logit contributions
ocious+0.603
iegel+0.602
 stranded+0.601
itime+0.598
uri+0.598
Neg logit contributions
catentry-1.062
 Slater-1.053
imov-1.046
 Accountability-1.043
isite-1.036
Token level
Token ID1241
Feature activation+3.56e-01

Pos logit contributions
itime+0.981
ocious+0.981
 stranded+0.977
iegel+0.973
uri+0.970
Neg logit contributions
catentry-1.254
 Slater-1.246
imov-1.232
 Accountability-1.229
isite-1.211
Token,
Token ID11
Feature activation+1.49e+00

Pos logit contributions
ocious+0.203
itime+0.203
iegel+0.202
 stranded+0.201
uri+0.199
Neg logit contributions
catentry-0.272
 Slater-0.272
 Accountability-0.268
imov-0.267
isite-0.264
Token I
Token ID314
Feature activation+4.40e-01

Pos logit contributions
 stranded+0.909
iegel+0.900
itime+0.899
ocious+0.896
uri+0.889
Neg logit contributions
catentry-1.079
 Slater-1.054
imov-1.049
 Accountability-1.044
isite-1.040
Token thought
Token ID1807
Feature activation+8.85e-01

Pos logit contributions
itime+0.259
ocious+0.259
 stranded+0.257
iegel+0.256
uri+0.255
Neg logit contributions
catentry-0.329
 Slater-0.328
 Accountability-0.322
imov-0.321
isite-0.319
Token you
Token ID345
Feature activation-3.75e-01

Pos logit contributions
 stranded+0.553
ocious+0.551
itime+0.551
iegel+0.550
uri+0.545
Neg logit contributions
catentry-0.629
 Slater-0.617
imov-0.611
 Accountability-0.609
isite-0.606
Token fit
Token ID4197
Feature activation+6.00e-01

Pos logit contributions
 Slater+0.287
catentry+0.286
 Accountability+0.282
imov+0.281
 relapse+0.278
Neg logit contributions
ocious-0.214
itime-0.214
iegel-0.212
 stranded-0.211
uri-0.210
Token in
Token ID287
Feature activation+1.18e+00

Pos logit contributions
ocious+0.366
itime+0.366
iegel+0.363
 stranded+0.362
uri+0.359
Neg logit contributions
 Slater-0.435
catentry-0.434
 Accountability-0.428
imov-0.426
 relapse-0.422
Token nicely
Token ID16576
Feature activation+6.31e-01

Pos logit contributions
ocious+0.682
itime+0.681
iegel+0.676
 stranded+0.675
uri+0.670
Neg logit contributions
catentry-0.888
 Slater-0.886
 Accountability-0.872
imov-0.868
 relapse-0.861
Token with
Token ID351
Feature activation-2.78e-01

Pos logit contributions
ocious+0.352
itime+0.352
iegel+0.348
 stranded+0.347
uri+0.345
Neg logit contributions
 Slater-0.491
catentry-0.490
 Accountability-0.483
imov-0.481
 relapse-0.477
Token all
Token ID477
Feature activation+4.20e-01

Pos logit contributions
catentry+0.194
 Slater+0.194
 Accountability+0.191
imov+0.190
 relapse+0.188
Neg logit contributions
ocious-0.178
itime-0.177
iegel-0.176
 stranded-0.176
uri-0.174
Token these
Token ID777
Feature activation+6.97e-01

Pos logit contributions
ocious+0.266
itime+0.264
iegel+0.263
 stranded+0.263
uri+0.261
Neg logit contributions
catentry-0.295
 Slater-0.294
 Accountability-0.290
imov-0.288
isite-0.286
Token and
Token ID290
Feature activation-8.71e-01

Pos logit contributions
ocious+0.213
itime+0.213
iegel+0.212
 stranded+0.211
uri+0.209
Neg logit contributions
 Slater-0.310
catentry-0.310
 Accountability-0.305
imov-0.304
 relapse-0.301
Token posting
Token ID10754
Feature activation-1.29e+00

Pos logit contributions
 Slater+0.591
catentry+0.588
 Accountability+0.579
imov+0.577
 relapse+0.570
Neg logit contributions
ocious-0.575
itime-0.574
iegel-0.569
 stranded-0.568
uri-0.565
Token pictures
Token ID5986
Feature activation-5.62e-01

Pos logit contributions
 Slater+0.918
catentry+0.917
 Accountability+0.900
imov+0.899
 relapse+0.886
Neg logit contributions
ocious-0.806
itime-0.804
iegel-0.799
 stranded-0.798
uri-0.792
Token on
Token ID319
Feature activation-2.56e-01

Pos logit contributions
 Slater+0.430
catentry+0.428
 Accountability+0.422
imov+0.421
 relapse+0.416
Neg logit contributions
ocious-0.322
itime-0.322
iegel-0.319
 stranded-0.318
uri-0.316
Token popular
Token ID2968
Feature activation-1.68e-01

Pos logit contributions
catentry+0.180
 Slater+0.179
 Accountability+0.176
imov+0.176
 relapse+0.173
Neg logit contributions
ocious-0.162
itime-0.162
 stranded-0.161
iegel-0.161
uri-0.159
Token social
Token ID1919
Feature activation+1.20e+00

Pos logit contributions
 Slater+0.121
catentry+0.121
 Accountability+0.119
imov+0.118
 relapse+0.117
Neg logit contributions
ocious-0.103
itime-0.103
iegel-0.102
 stranded-0.102
uri-0.102
Token networking
Token ID19140
Feature activation-4.29e-01

Pos logit contributions
 stranded+0.726
ocious+0.725
iegel+0.723
itime+0.721
uri+0.719
Neg logit contributions
catentry-0.880
 Slater-0.877
imov-0.867
 Accountability-0.862
isite-0.851
Token websites
Token ID9293
Feature activation+2.47e-01

Pos logit contributions
 Slater+0.365
catentry+0.361
 Accountability+0.358
imov+0.357
 relapse+0.355
Neg logit contributions
itime-0.211
ocious-0.211
iegel-0.207
 stranded-0.205
uri-0.204
Token Facebook
Token ID3203
Feature activation-5.44e-01

Pos logit contributions
ocious+0.133
itime+0.133
iegel+0.131
 stranded+0.130
uri+0.130
Neg logit contributions
 Slater-0.198
catentry-0.197
 Accountability-0.194
imov-0.193
 relapse-0.192
Token and
Token ID290
Feature activation+6.86e-01

Pos logit contributions
 Slater+0.395
catentry+0.392
 Accountability+0.387
imov+0.385
 relapse+0.382
Neg logit contributions
ocious-0.334
itime-0.334
iegel-0.330
 stranded-0.328
uri-0.327
Token Twitter
Token ID3009
Feature activation+2.70e+00

Pos logit contributions
ocious+0.513
itime+0.513
iegel+0.510
 stranded+0.509
uri+0.507
Neg logit contributions
catentry-0.403
 Slater-0.403
 Accountability-0.394
imov-0.393
 relapse-0.387
Token
Token ID851
Feature activation+2.45e-01

Pos logit contributions
ocious+0.032
itime+0.032
 stranded+0.032
iegel+0.032
uri+0.031
Neg logit contributions
 Slater-0.041
catentry-0.041
 Accountability-0.040
imov-0.040
isite-0.039
Token you
Token ID345
Feature activation-7.61e-01

Pos logit contributions
ocious+0.172
itime+0.171
 stranded+0.170
iegel+0.170
uri+0.169
Neg logit contributions
catentry-0.156
 Slater-0.156
 Accountability-0.152
imov-0.152
 relapse-0.150
Token
Token ID447
Feature activation-1.20e-01

Pos logit contributions
 Slater+0.555
catentry+0.553
 Accountability+0.544
imov+0.543
 relapse+0.536
Neg logit contributions
ocious-0.464
itime-0.464
iegel-0.459
 stranded-0.458
uri-0.456
Token
Token ID247
Feature activation-1.62e-01

Pos logit contributions
 Slater+0.092
catentry+0.091
 Accountability+0.090
imov+0.090
isite+0.089
Neg logit contributions
ocious-0.068
itime-0.067
iegel-0.067
uri-0.067
 stranded-0.067
Tokenre
Token ID260
Feature activation-1.83e-01

Pos logit contributions
 Slater+0.085
catentry+0.084
 Accountability+0.082
imov+0.082
 relapse+0.080
Neg logit contributions
itime-0.133
ocious-0.132
 stranded-0.131
iegel-0.131
uri-0.131
Token also
Token ID635
Feature activation+1.02e-01

Pos logit contributions
 Slater+0.130
catentry+0.129
 Accountability+0.127
imov+0.127
 relapse+0.125
Neg logit contributions
ocious-0.115
itime-0.115
iegel-0.114
 stranded-0.114
uri-0.113
Token waiting
Token ID4953
Feature activation-2.21e+00

Pos logit contributions
ocious+0.066
itime+0.066
 stranded+0.066
iegel+0.065
uri+0.065
Neg logit contributions
 Slater-0.070
catentry-0.070
 Accountability-0.069
imov-0.069
 relapse-0.068
Token for
Token ID329
Feature activation-5.28e-01

Pos logit contributions
 Slater+1.782
catentry+1.757
 Accountability+1.753
imov+1.740
 relapse+1.727
Neg logit contributions
ocious-1.177
itime-1.176
iegel-1.165
 stranded-1.148
uri-1.148
Token doom
Token ID27666
Feature activation-2.28e-01

Pos logit contributions
 Slater+0.366
catentry+0.366
imov+0.358
 Accountability+0.358
isite+0.353
Neg logit contributions
ocious-0.340
itime-0.338
 stranded-0.336
iegel-0.336
uri-0.333
Token.
Token ID13
Feature activation+5.86e-01

Pos logit contributions
 Slater+0.191
catentry+0.190
 Accountability+0.188
imov+0.187
 relapse+0.185
Neg logit contributions
ocious-0.114
itime-0.114
iegel-0.112
 stranded-0.112
uri-0.111
Token I
Token ID314
Feature activation+2.48e-01

Pos logit contributions
uri+0.369
ocious+0.368
itime+0.366
iegel+0.364
 stranded+0.362
Neg logit contributions
catentry-0.414
 Slater-0.404
imov-0.402
 relapse-0.400
isite-0.399
Token)
Token ID8
Feature activation-6.96e-01

Pos logit contributions
 Slater+0.202
catentry+0.201
 Accountability+0.198
imov+0.197
 relapse+0.194
Neg logit contributions
ocious-0.231
itime-0.231
iegel-0.229
 stranded-0.228
uri-0.227
Token week
Token ID1285
Feature activation-5.25e-01

Pos logit contributions
 Slater+0.493
catentry+0.492
 Accountability+0.485
imov+0.483
 relapse+0.478
Neg logit contributions
ocious-0.436
itime-0.436
iegel-0.433
 stranded-0.432
uri-0.431
Token.
Token ID13
Feature activation-6.00e-01

Pos logit contributions
 Slater+0.442
catentry+0.440
 Accountability+0.435
imov+0.434
 relapse+0.430
Neg logit contributions
ocious-0.260
itime-0.260
iegel-0.257
 stranded-0.255
uri-0.253
Token I
Token ID314
Feature activation-5.81e-01

Pos logit contributions
 Slater+0.410
catentry+0.408
 Accountability+0.401
imov+0.400
 relapse+0.397
Neg logit contributions
ocious-0.393
itime-0.392
iegel-0.389
 stranded-0.387
uri-0.386
Token hope
Token ID2911
Feature activation-2.10e+00

Pos logit contributions
 Slater+0.420
catentry+0.419
 Accountability+0.412
imov+0.410
 relapse+0.406
Neg logit contributions
itime-0.357
ocious-0.357
iegel-0.353
 stranded-0.352
uri-0.351
Token they
Token ID484
Feature activation-7.56e-01

Pos logit contributions
 Slater+1.500
catentry+1.468
 Accountability+1.460
imov+1.453
pron+1.441
Neg logit contributions
itime-1.329
ocious-1.321
iegel-1.305
 stranded-1.296
uri-1.285
Token make
Token ID787
Feature activation-8.59e-01

Pos logit contributions
 Slater+0.559
catentry+0.559
 Accountability+0.548
imov+0.547
isite+0.542
Neg logit contributions
ocious-0.451
itime-0.451
iegel-0.447
 stranded-0.447
uri-0.446
Token the
Token ID262
Feature activation-7.57e-01

Pos logit contributions
catentry+0.608
 Slater+0.606
 Accountability+0.596
imov+0.595
 relapse+0.588
Neg logit contributions
ocious-0.541
itime-0.538
iegel-0.535
 stranded-0.535
uri-0.532
Token walk
Token ID2513
Feature activation-5.01e-01

Pos logit contributions
catentry+0.532
 Slater+0.532
 Accountability+0.522
imov+0.521
 relapse+0.514
Neg logit contributions
ocious-0.480
itime-0.479
iegel-0.475
 stranded-0.475
uri-0.471
Token.
Token ID13
Feature activation-3.45e-01

Pos logit contributions
 Slater+0.405
catentry+0.403
 Accountability+0.398
imov+0.397
 relapse+0.393
Neg logit contributions
ocious-0.266
itime-0.265
iegel-0.262
 stranded-0.261
uri-0.259
Token I
Token ID314
Feature activation-6.31e-01

Pos logit contributions
 Slater+0.235
catentry+0.235
imov+0.230
 Accountability+0.230
 relapse+0.228
Neg logit contributions
ocious-0.226
itime-0.226
iegel-0.224
uri-0.223
 stranded-0.223
Token170
Token ID17279
Feature activation-7.78e-02

Pos logit contributions
ocious+0.266
itime+0.265
iegel+0.263
 stranded+0.262
uri+0.261
Neg logit contributions
 Slater-0.270
catentry-0.268
 Accountability-0.264
imov-0.263
 relapse-0.260
Token percent
Token ID1411
Feature activation+1.59e+00

Pos logit contributions
 Slater+0.056
catentry+0.055
 Accountability+0.055
imov+0.054
 relapse+0.054
Neg logit contributions
ocious-0.048
itime-0.048
iegel-0.048
 stranded-0.048
uri-0.048
Token Increase
Token ID25285
Feature activation-1.00e+00

Pos logit contributions
ocious+0.967
itime+0.964
 stranded+0.962
iegel+0.959
uri+0.951
Neg logit contributions
catentry-1.163
 Slater-1.152
imov-1.136
 Accountability-1.127
isite-1.122
Token of
Token ID286
Feature activation-9.24e-02

Pos logit contributions
 Slater+0.741
catentry+0.737
 Accountability+0.727
imov+0.724
 relapse+0.717
Neg logit contributions
itime-0.600
ocious-0.599
iegel-0.592
 stranded-0.589
uri-0.587
Token adherents
Token ID46372
Feature activation-5.28e-01

Pos logit contributions
 Slater+0.067
catentry+0.067
 Accountability+0.066
imov+0.066
 relapse+0.065
Neg logit contributions
ocious-0.056
itime-0.056
iegel-0.056
 stranded-0.056
uri-0.055
Token during
Token ID1141
Feature activation+7.34e-02

Pos logit contributions
 Slater+0.392
catentry+0.389
 Accountability+0.384
imov+0.382
 relapse+0.379
Neg logit contributions
itime-0.316
ocious-0.316
iegel-0.312
 stranded-0.310
uri-0.309
Token a
Token ID257
Feature activation-1.73e-02

Pos logit contributions
ocious+0.046
itime+0.046
iegel+0.046
 stranded+0.046
uri+0.045
Neg logit contributions
 Slater-0.052
catentry-0.052
 Accountability-0.051
imov-0.051
 relapse-0.050
Token Buddhist
Token ID20838
Feature activation-3.35e-01

Pos logit contributions
catentry+0.012
 Slater+0.012
 Accountability+0.012
imov+0.012
isite+0.012
Neg logit contributions
ocious-0.011
itime-0.011
 stranded-0.011
iegel-0.011
uri-0.011
Token
Token ID564
Feature activation-1.05e+00

Pos logit contributions
 Slater+0.233
catentry+0.230
 Accountability+0.228
imov+0.226
 relapse+0.225
Neg logit contributions
itime-0.217
ocious-0.217
iegel-0.215
 stranded-0.213
uri-0.213
Token
Token ID250
Feature activation+9.87e-02

Pos logit contributions
 Slater+0.671
catentry+0.669
 Accountability+0.659
imov+0.655
 relapse+0.647
Neg logit contributions
itime-0.728
ocious-0.727
iegel-0.722
 stranded-0.717
uri-0.711
Tokenbo
Token ID2127
Feature activation-2.74e-01

Pos logit contributions
ocious+0.065
itime+0.065
iegel+0.064
 stranded+0.064
uri+0.064
Neg logit contributions
 Slater-0.067
catentry-0.067
 Accountability-0.066
imov-0.066
 relapse-0.065
Token fair
Token ID3148
Feature activation+3.52e-01

Pos logit contributions
catentry+0.147
 Slater+0.147
imov+0.144
 Accountability+0.144
isite+0.142
Neg logit contributions
itime-0.140
ocious-0.140
 stranded-0.139
iegel-0.139
uri-0.137
Token for
Token ID329
Feature activation+1.62e-01

Pos logit contributions
ocious+0.179
itime+0.179
iegel+0.177
 stranded+0.176
uri+0.175
Neg logit contributions
 Slater-0.293
catentry-0.291
 Accountability-0.288
imov-0.287
 relapse-0.284
Token users
Token ID2985
Feature activation-8.10e-01

Pos logit contributions
itime+0.085
ocious+0.085
 stranded+0.084
iegel+0.084
uri+0.083
Neg logit contributions
catentry-0.132
 Slater-0.131
imov-0.129
 Accountability-0.129
isite-0.128
Token,
Token ID11
Feature activation-5.35e-01

Pos logit contributions
 Slater+0.640
catentry+0.635
 Accountability+0.629
imov+0.625
 relapse+0.620
Neg logit contributions
ocious-0.445
itime-0.444
iegel-0.438
 stranded-0.435
uri-0.433
Token developers
Token ID6505
Feature activation-8.43e-01

Pos logit contributions
catentry+0.388
 Slater+0.388
imov+0.382
 Accountability+0.381
 relapse+0.377
Neg logit contributions
ocious-0.325
itime-0.325
iegel-0.323
 stranded-0.323
uri-0.319
Token and
Token ID290
Feature activation-4.79e-01

Pos logit contributions
 Slater+0.607
catentry+0.603
 Accountability+0.595
imov+0.593
 relapse+0.587
Neg logit contributions
ocious-0.521
itime-0.521
iegel-0.516
 stranded-0.513
uri-0.511
Token stakeholders
Token ID26138
Feature activation+5.18e-01

Pos logit contributions
 Slater+0.375
catentry+0.375
imov+0.368
 Accountability+0.368
isite+0.364
Neg logit contributions
itime-0.265
ocious-0.265
 stranded-0.264
iegel-0.263
uri-0.260
Token.
Token ID13
Feature activation+7.63e-02

Pos logit contributions
ocious+0.281
itime+0.280
iegel+0.277
 stranded+0.276
uri+0.275
Neg logit contributions
 Slater-0.413
catentry-0.411
 Accountability-0.406
imov-0.405
 relapse-0.401
Token
Token ID198
Feature activation-3.06e-01

Pos logit contributions
ocious+0.050
itime+0.050
iegel+0.050
uri+0.050
 stranded+0.050
Neg logit contributions
catentry-0.051
 Slater-0.051
imov-0.050
 relapse-0.050
isite-0.050
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
catentry+0.207
 Slater+0.205
imov+0.201
 Accountability+0.201
 relapse+0.200
Neg logit contributions
uri-0.201
ocious-0.201
iegel-0.201
itime-0.200
 stranded-0.199
TokenExper
Token ID20468
Feature activation-2.08e+00

Pos logit contributions
 Slater+0.122
catentry+0.121
 Accountability+0.120
imov+0.119
 relapse+0.118
Neg logit contributions
ocious-0.104
itime-0.103
iegel-0.103
uri-0.102
 stranded-0.102
Token that
Token ID326
Feature activation-8.48e-02

Pos logit contributions
 Slater+1.307
catentry+1.290
 Accountability+1.276
imov+1.270
 relapse+1.261
Neg logit contributions
ocious-1.242
itime-1.239
iegel-1.225
uri-1.215
 stranded-1.213
Token certain
Token ID1728
Feature activation-9.69e-01

Pos logit contributions
catentry+0.060
 Slater+0.060
 Accountability+0.059
imov+0.059
isite+0.058
Neg logit contributions
itime-0.053
ocious-0.053
 stranded-0.053
iegel-0.053
uri-0.052
Token parts
Token ID3354
Feature activation-1.09e-01

Pos logit contributions
 Slater+0.680
catentry+0.679
 Accountability+0.668
imov+0.665
 relapse+0.657
Neg logit contributions
itime-0.615
ocious-0.614
 stranded-0.609
iegel-0.609
uri-0.604
Token of
Token ID286
Feature activation-8.54e-01

Pos logit contributions
 Slater+0.077
catentry+0.077
 Accountability+0.076
imov+0.075
 relapse+0.074
Neg logit contributions
itime-0.069
ocious-0.069
iegel-0.068
 stranded-0.068
uri-0.068
Token the
Token ID262
Feature activation-8.78e-01

Pos logit contributions
 Slater+0.585
catentry+0.585
 Accountability+0.575
imov+0.573
 relapse+0.566
Neg logit contributions
itime-0.555
ocious-0.554
 stranded-0.551
iegel-0.550
uri-0.546
Token Eve
Token ID12882
Feature activation-1.28e+00

Pos logit contributions
 Slater+0.608
catentry+0.607
 Accountability+0.598
imov+0.596
 relapse+0.587
Neg logit contributions
itime-0.565
ocious-0.563
 stranded-0.562
iegel-0.559
uri-0.555
Token Radio
Token ID8829
Feature activation-6.72e-02

Pos logit contributions
 Slater+0.877
catentry+0.873
 Accountability+0.860
imov+0.856
 relapse+0.847
Neg logit contributions
ocious-0.840
itime-0.840
iegel-0.832
 stranded-0.829
uri-0.825
Token website
Token ID3052
Feature activation-6.37e-02

Pos logit contributions
catentry+0.045
 Slater+0.045
imov+0.044
 Accountability+0.044
pron+0.043
Neg logit contributions
itime-0.045
ocious-0.045
 stranded-0.044
iegel-0.044
uri-0.044
Token will
Token ID481
Feature activation-1.32e+00

Pos logit contributions
catentry+0.046
 Slater+0.046
 Accountability+0.045
imov+0.045
pron+0.045
Neg logit contributions
itime-0.039
ocious-0.039
iegel-0.038
 stranded-0.038
uri-0.038
Token look
Token ID804
Feature activation-8.05e-01

Pos logit contributions
 Slater+0.954
catentry+0.952
 Accountability+0.937
imov+0.932
isite+0.920
Neg logit contributions
itime-0.814
ocious-0.813
iegel-0.807
 stranded-0.807
uri-0.800
Token a
Token ID257
Feature activation+3.34e-01

Pos logit contributions
 Slater+0.558
catentry+0.555
 Accountability+0.547
imov+0.545
 relapse+0.539
Neg logit contributions
ocious-0.519
itime-0.519
iegel-0.514
 stranded-0.513
uri-0.510
Token how
Token ID703
Feature activation-5.80e-01

Pos logit contributions
 Slater+0.342
catentry+0.341
 Accountability+0.335
imov+0.334
 relapse+0.330
Neg logit contributions
ocious-0.330
itime-0.330
iegel-0.327
 stranded-0.326
uri-0.324
Token do
Token ID466
Feature activation+6.03e-01

Pos logit contributions
 Slater+0.411
catentry+0.409
 Accountability+0.403
imov+0.402
 relapse+0.397
Neg logit contributions
ocious-0.365
itime-0.365
iegel-0.361
 stranded-0.360
uri-0.358
Token they
Token ID484
Feature activation-8.40e-01

Pos logit contributions
ocious+0.408
itime+0.408
iegel+0.406
 stranded+0.404
uri+0.402
Neg logit contributions
 Slater-0.397
catentry-0.395
 Accountability-0.388
imov-0.387
 relapse-0.383
Token place
Token ID1295
Feature activation-8.89e-01

Pos logit contributions
 Slater+0.635
catentry+0.633
imov+0.621
 Accountability+0.620
isite+0.615
Neg logit contributions
itime-0.489
 stranded-0.489
iegel-0.486
ocious-0.485
uri-0.483
Token that
Token ID326
Feature activation-6.70e-01

Pos logit contributions
catentry+0.584
 Slater+0.581
imov+0.569
 Accountability+0.569
 relapse+0.562
Neg logit contributions
 stranded-0.604
itime-0.604
ocious-0.603
iegel-0.600
uri-0.595
Token risk
Token ID2526
Feature activation-2.24e+00

Pos logit contributions
catentry+0.461
 Slater+0.459
imov+0.453
 Accountability+0.451
pron+0.446
Neg logit contributions
 stranded-0.433
itime-0.431
ocious-0.430
iegel-0.429
uri-0.425
Token.
Token ID13
Feature activation+2.69e-02

Pos logit contributions
 Slater+1.679
catentry+1.654
 Accountability+1.642
imov+1.635
 relapse+1.621
Neg logit contributions
ocious-1.335
itime-1.328
iegel-1.310
 stranded-1.303
uri-1.297
Token Our
Token ID3954
Feature activation+6.15e-02

Pos logit contributions
ocious+0.017
itime+0.017
iegel+0.017
uri+0.017
 stranded+0.017
Neg logit contributions
catentry-0.019
 Slater-0.019
 relapse-0.018
imov-0.018
 Accountability-0.018
Token entire
Token ID2104
Feature activation+3.70e-01

Pos logit contributions
ocious+0.043
itime+0.043
 stranded+0.043
iegel+0.043
uri+0.043
Neg logit contributions
 Slater-0.039
catentry-0.039
imov-0.038
 Accountability-0.038
isite-0.038
Token goal
Token ID3061
Feature activation-1.49e+00

Pos logit contributions
ocious+0.246
itime+0.246
 stranded+0.244
iegel+0.244
uri+0.242
Neg logit contributions
 Slater-0.249
catentry-0.248
imov-0.244
 Accountability-0.243
isite-0.240
Token is
Token ID318
Feature activation-5.32e-01

Pos logit contributions
 Slater+1.112
catentry+1.106
 Accountability+1.092
imov+1.087
 relapse+1.077
Neg logit contributions
ocious-0.876
itime-0.876
iegel-0.866
 stranded-0.862
uri-0.857
Token begin
Token ID2221
Feature activation+1.51e-02

Pos logit contributions
 stranded+0.313
ocious+0.312
iegel+0.312
itime+0.311
uri+0.309
Neg logit contributions
catentry-0.399
 Slater-0.397
 Accountability-0.390
imov-0.387
isite-0.385
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
ocious+0.009
itime+0.009
 stranded+0.009
iegel+0.009
uri+0.009
Neg logit contributions
 Slater-0.011
catentry-0.011
 Accountability-0.011
imov-0.011
isite-0.011
Token see
Token ID766
Feature activation-6.74e-01

Pos logit contributions
 stranded+0.670
iegel+0.664
ocious+0.661
itime+0.659
uri+0.656
Neg logit contributions
catentry-0.842
 Slater-0.842
 Accountability-0.828
isite-0.818
imov-0.818
Token those
Token ID883
Feature activation-3.38e-01

Pos logit contributions
 Slater+0.452
catentry+0.450
 Accountability+0.443
imov+0.441
 relapse+0.435
Neg logit contributions
ocious-0.450
itime-0.449
iegel-0.446
 stranded-0.446
uri-0.442
Token things
Token ID1243
Feature activation-8.78e-01

Pos logit contributions
 Slater+0.235
catentry+0.234
 Accountability+0.231
imov+0.229
isite+0.227
Neg logit contributions
 stranded-0.216
ocious-0.216
itime-0.215
iegel-0.215
uri-0.213
Token then
Token ID788
Feature activation-2.32e+00

Pos logit contributions
 Slater+0.619
catentry+0.618
 Accountability+0.609
imov+0.606
isite+0.599
Neg logit contributions
ocious-0.555
itime-0.553
iegel-0.550
 stranded-0.549
uri-0.544
Token it
Token ID340
Feature activation-2.02e-01

Pos logit contributions
 Slater+1.502
catentry+1.494
 Accountability+1.468
imov+1.464
 relapse+1.452
Neg logit contributions
ocious-1.609
itime-1.609
iegel-1.590
 stranded-1.578
uri-1.576
Token may
Token ID743
Feature activation-5.61e-01

Pos logit contributions
 Slater+0.149
catentry+0.148
 Accountability+0.146
imov+0.145
 relapse+0.144
Neg logit contributions
ocious-0.123
itime-0.122
iegel-0.121
 stranded-0.121
uri-0.120
Token be
Token ID307
Feature activation+3.73e-01

Pos logit contributions
 Slater+0.415
catentry+0.413
 Accountability+0.408
imov+0.406
 relapse+0.401
Neg logit contributions
ocious-0.335
itime-0.335
iegel-0.332
 stranded-0.331
uri-0.329
Token an
Token ID281
Feature activation+2.31e-01

Pos logit contributions
 stranded+0.231
ocious+0.230
itime+0.229
iegel+0.228
uri+0.227
Neg logit contributions
 Slater-0.267
catentry-0.266
 Accountability-0.263
imov-0.261
isite-0.259
Token indication
Token ID12955
Feature activation+1.21e+00

Pos logit contributions
ocious+0.151
itime+0.150
iegel+0.150
 stranded+0.150
uri+0.149
Neg logit contributions
 Slater-0.159
catentry-0.156
imov-0.155
 Accountability-0.155
pron-0.153
Token findings
Token ID6373
Feature activation+5.69e-01

Pos logit contributions
ocious+0.781
 stranded+0.779
itime+0.767
iegel+0.766
uri+0.764
Neg logit contributions
catentry-0.638
 Slater-0.637
imov-0.619
 Accountability-0.618
pron-0.613
Token at
Token ID379
Feature activation-1.71e-02

Pos logit contributions
ocious+0.324
itime+0.323
iegel+0.322
 stranded+0.321
uri+0.318
Neg logit contributions
catentry-0.438
 Slater-0.436
 Accountability-0.429
imov-0.428
isite-0.424
Token the
Token ID262
Feature activation-9.82e-02

Pos logit contributions
catentry+0.012
 Slater+0.012
 Accountability+0.012
imov+0.012
 relapse+0.012
Neg logit contributions
ocious-0.011
itime-0.011
iegel-0.011
 stranded-0.011
uri-0.011
Token 2012
Token ID2321
Feature activation-5.02e-01

Pos logit contributions
catentry+0.071
 Slater+0.071
imov+0.069
 Accountability+0.069
 relapse+0.069
Neg logit contributions
ocious-0.060
itime-0.060
 stranded-0.060
iegel-0.060
uri-0.059
Token Western
Token ID4885
Feature activation-9.34e-01

Pos logit contributions
catentry+0.361
 Slater+0.360
 Accountability+0.353
imov+0.353
 relapse+0.349
Neg logit contributions
ocious-0.312
itime-0.311
iegel-0.308
 stranded-0.308
uri-0.306
Token Pacific
Token ID8211
Feature activation-2.30e+00

Pos logit contributions
 Slater+0.633
catentry+0.630
 Accountability+0.620
imov+0.618
 relapse+0.611
Neg logit contributions
ocious-0.617
itime-0.616
iegel-0.611
 stranded-0.608
uri-0.605
Token Ge
Token ID2269
Feature activation-1.70e-01

Pos logit contributions
 Slater+1.518
 Accountability+1.482
catentry+1.476
imov+1.452
 relapse+1.444
Neg logit contributions
itime-1.570
ocious-1.565
iegel-1.548
 stranded-1.535
uri-1.526
Tokenoph
Token ID2522
Feature activation+6.34e-01

Pos logit contributions
catentry+0.114
 Slater+0.114
 Accountability+0.112
imov+0.111
 relapse+0.110
Neg logit contributions
ocious-0.114
itime-0.113
iegel-0.112
uri-0.112
 stranded-0.112
Tokenysics
Token ID23154
Feature activation-3.41e-01

Pos logit contributions
itime+0.466
ocious+0.466
iegel+0.463
uri+0.460
 stranded+0.459
Neg logit contributions
 Slater-0.381
catentry-0.381
 Accountability-0.373
imov-0.371
 relapse-0.368
Token Meeting
Token ID22244
Feature activation-1.17e+00

Pos logit contributions
 Slater+0.242
catentry+0.242
 Accountability+0.237
imov+0.237
 relapse+0.234
Neg logit contributions
ocious-0.214
itime-0.213
iegel-0.212
 stranded-0.211
uri-0.210
Token in
Token ID287
Feature activation-3.64e-01

Pos logit contributions
 Slater+0.895
catentry+0.888
 Accountability+0.879
imov+0.875
 relapse+0.865
Neg logit contributions
ocious-0.673
itime-0.673
iegel-0.665
 stranded-0.661
uri-0.657
Token Whereas
Token ID28016
Feature activation-2.39e-01

Pos logit contributions
catentry+0.522
 Slater+0.516
imov+0.507
 Accountability+0.504
 relapse+0.504
Neg logit contributions
ocious-0.520
itime-0.519
 stranded-0.518
iegel-0.517
uri-0.516
Token normal
Token ID3487
Feature activation-1.96e+00

Pos logit contributions
catentry+0.172
 Slater+0.172
 Accountability+0.169
imov+0.168
isite+0.166
Neg logit contributions
 stranded-0.147
ocious-0.147
itime-0.147
iegel-0.146
uri-0.145
Token kidney
Token ID21919
Feature activation+1.17e+00

Pos logit contributions
 Slater+1.350
catentry+1.339
 Accountability+1.323
imov+1.317
 relapse+1.307
Neg logit contributions
ocious-1.272
itime-1.268
iegel-1.255
uri-1.243
 stranded-1.243
Token cells
Token ID4778
Feature activation-9.97e-01

Pos logit contributions
ocious+0.735
itime+0.732
 stranded+0.730
iegel+0.727
uri+0.724
Neg logit contributions
catentry-0.829
 Slater-0.829
imov-0.815
 Accountability-0.812
pron-0.800
Token (
Token ID357
Feature activation-1.36e+00

Pos logit contributions
catentry+0.716
 Slater+0.712
 Accountability+0.700
imov+0.698
isite+0.691
Neg logit contributions
 stranded-0.620
ocious-0.618
itime-0.618
iegel-0.617
uri-0.610
Tokenleft
Token ID9464
Feature activation-2.36e+00

Pos logit contributions
 Slater+0.916
catentry+0.911
 Accountability+0.895
imov+0.894
pron+0.884
Neg logit contributions
itime-0.909
ocious-0.909
iegel-0.900
 stranded-0.896
uri-0.890
Token)
Token ID8
Feature activation-1.13e+00

Pos logit contributions
 Slater+1.840
catentry+1.814
 Accountability+1.794
 relapse+1.790
imov+1.789
Neg logit contributions
ocious-1.320
itime-1.310
iegel-1.293
uri-1.278
 stranded-1.269
Token are
Token ID389
Feature activation-9.86e-01

Pos logit contributions
catentry+0.817
 Slater+0.814
 Accountability+0.800
imov+0.797
isite+0.787
Neg logit contributions
ocious-0.696
itime-0.696
 stranded-0.695
iegel-0.693
uri-0.686
Token arranged
Token ID14921
Feature activation-1.48e-01

Pos logit contributions
 Slater+0.690
catentry+0.689
 Accountability+0.678
imov+0.676
 relapse+0.667
Neg logit contributions
ocious-0.627
itime-0.626
 stranded-0.624
iegel-0.622
uri-0.617
Token next
Token ID1306
Feature activation+7.57e-01

Pos logit contributions
catentry+0.114
 Slater+0.113
 Accountability+0.112
imov+0.111
isite+0.110
Neg logit contributions
 stranded-0.085
ocious-0.084
itime-0.084
iegel-0.084
uri-0.083
Token to
Token ID284
Feature activation-3.53e-01

Pos logit contributions
ocious+0.396
itime+0.396
iegel+0.391
 stranded+0.389
uri+0.387
Neg logit contributions
 Slater-0.617
catentry-0.614
 Accountability-0.606
imov-0.604
 relapse-0.599
Tokenday
Token ID820
Feature activation+4.58e-01

Pos logit contributions
uri+0.420
iegel+0.417
ocious+0.417
itime+0.417
 stranded+0.414
Neg logit contributions
 Slater-0.532
catentry-0.530
 Accountability-0.525
imov-0.518
 relapse-0.517
Token deadlines
Token ID36773
Feature activation+1.33e+00

Pos logit contributions
itime+0.333
ocious+0.332
 stranded+0.332
iegel+0.330
uri+0.327
Neg logit contributions
 Slater-0.279
catentry-0.278
imov-0.272
 Accountability-0.270
pron-0.268
Token and
Token ID290
Feature activation+2.18e-01

Pos logit contributions
ocious+0.710
 stranded+0.710
itime+0.708
iegel+0.706
uri+0.701
Neg logit contributions
catentry-1.064
 Slater-1.059
imov-1.041
 Accountability-1.040
isite-1.034
Token instead
Token ID2427
Feature activation-7.95e-01

Pos logit contributions
 stranded+0.136
itime+0.136
ocious+0.136
iegel+0.135
uri+0.134
Neg logit contributions
 Slater-0.154
catentry-0.153
imov-0.151
 Accountability-0.150
pron-0.150
Token require
Token ID2421
Feature activation-7.08e-01

Pos logit contributions
 Slater+0.573
catentry+0.572
imov+0.563
 Accountability+0.561
pron+0.556
Neg logit contributions
itime-0.488
ocious-0.488
 stranded-0.486
iegel-0.484
uri-0.480
Token that
Token ID326
Feature activation-2.28e+00

Pos logit contributions
 Slater+0.495
catentry+0.494
 Accountability+0.485
imov+0.485
 relapse+0.479
Neg logit contributions
ocious-0.451
itime-0.450
iegel-0.447
 stranded-0.445
uri-0.443
Token complaints
Token ID9687
Feature activation-1.66e+00

Pos logit contributions
 Slater+1.608
 Accountability+1.587
catentry+1.586
imov+1.552
 relapse+1.551
Neg logit contributions
itime-1.457
ocious-1.450
iegel-1.434
uri-1.415
 stranded-1.414
Token be
Token ID307
Feature activation-1.89e+00

Pos logit contributions
 Slater+1.257
catentry+1.251
 Accountability+1.234
imov+1.229
 relapse+1.217
Neg logit contributions
ocious-0.965
itime-0.964
iegel-0.954
 stranded-0.949
uri-0.944
Token processed
Token ID13686
Feature activation-2.88e+00

Pos logit contributions
 Slater+1.338
catentry+1.335
imov+1.309
 Accountability+1.309
 relapse+1.291
Neg logit contributions
itime-1.190
ocious-1.189
 stranded-1.178
iegel-1.178
uri-1.170
Token
Token ID564
Feature activation-6.26e-01

Pos logit contributions
 Slater+1.972
catentry+1.942
 Accountability+1.939
imov+1.917
 relapse+1.906
Neg logit contributions
ocious-1.883
itime-1.878
iegel-1.860
 stranded-1.841
uri-1.841
Token
Token ID250
Feature activation-2.74e-01

Pos logit contributions
 Slater+0.490
catentry+0.488
 Accountability+0.482
imov+0.479
 relapse+0.476
Neg logit contributions
ocious-0.347
itime-0.346
 stranded-0.342
iegel-0.342
uri-0.341