TOP ACTIVATIONS
MAX = 4.902

how is the Court of Appeals going to know from looking
dead ," one of the officers told Di allo , the
said in response to the commissioner 's report , released on
best education they can and then use it to ' lead
to the media about the diplomatic talks , said the sides
client when it s done , optionally providing results
is occasionally dod gy and occasionally brilliant , hits one out
information , how is the Court of Appeals going to know
- his philosophy , is basically an isolation ist ," Cheney
a very complicated ( and sometimes volatile ) one , especially
Milo 's comments in the video " very troubling and upsetting
I wasn t really sure what that actually meant
the prophet Mohammed , was not because I am against the
She questioned if it was absolutely necessary but doctors said she
to engage in an open and straightforward communication with staff ,
, and I 'm a little bit angry at him for
( the library ), the server does the work then not
why she had allowed a parliamentary debate in July on continuing
so it 's always a little bit strange ."
off to entertain ( or be entertained ) elsewhere .

MIN ACTIVATIONS
MIN = -5.226

there are many industries that didn 't even exist when the
Annie Hall and Inter iors didn t already signify
osexual act ", because it doesn 't involve the opposite sex
. While Wallace didn t struggle getting
medicinal uses of marijuana that don t have the
work is when the result doesn t feel like
to be here , I didn t know if
- ed cigarettes don t kill
nominated . Theodore Be ale didn t just beat
Good win says he doesn t plan to
Pat rik L aine hasn t exactly been
wall where natural geographic barriers don t already exist
Still , while I didn t entirely get
plays within that . It doesn t just play
will prefer spectator activities that don t require much
must admit that RPC simply doesn t fit this
Good win doesn t want to
2013 This game didn t lack for
That was me ! I didn t realize then
your Even though the president didn 't sign a single bill

INTERVAL 2.370 - 4.902
CONTAINS 0.671%

, and it s just funny , it
, to the explosive , in the case of candid assessments
, then she s not going to be able
- California , in a radio debate hosted by La Opinion
late Friday hunted for another teen suspected of beating an 88

INTERVAL -0.162 - 2.370
CONTAINS 56.607%

moving downward , the authors write .
) and so on . Clan - based personal names did
to do , Warren said . I
and I guess maybe I will go out under another Not
, a brand new , fully integrated point of sale system

INTERVAL -2.694 - -0.162
CONTAINS 42.441%

of Earth . He s the most primitive man
characteristics of both . Al ma from Ex Lib
As a result , a National Sleep Foundation poll
and has no idea of the complexities out there ."
is five years . Head water ( 01 606

INTERVAL -5.226 - -2.694
CONTAINS 0.280%

- old company Path acqu ired back in June . Talk
the brush . I personally didn t find painting
plumbing culprit s couldn t be accused
s privacy and I don t like their
But their tour didn t mark the
Token how
Token ID703
Feature activation+8.29e-01

Pos logit contributions
iencies+0.987
nces+0.980
adoes+0.924
uilt+0.918
ortium+0.918
Neg logit contributions
 proceeding-0.507
 pretending-0.504
 parting-0.504
 inaction-0.488
 questioning-0.479
Token is
Token ID318
Feature activation+2.20e+00

Pos logit contributions
 proceeding+4.641
 pretending+4.611
 parting+4.566
 questioning+4.514
 inaction+4.510
Neg logit contributions
iencies-4.631
nces-4.593
adoes-4.238
uilt-4.236
ortium-4.174
Token the
Token ID262
Feature activation+8.18e-01

Pos logit contributions
 proceeding+8.567
 pretending+8.359
 questioning+8.347
 parting+8.112
 inaction+8.076
Neg logit contributions
iencies-15.542
nces-15.526
adoes-14.642
uilt-14.580
ortium-14.348
Token Court
Token ID3078
Feature activation+4.28e+00

Pos logit contributions
 proceeding+3.707
 pretending+3.694
 parting+3.643
 questioning+3.595
 inaction+3.591
Neg logit contributions
iencies-5.414
nces-5.398
uilt-5.052
adoes-5.019
ortium-4.991
Token of
Token ID286
Feature activation+2.02e+00

Pos logit contributions
 proceeding+20.244
 questioning+19.074
 pretending+18.918
 telling+18.897
 asking+18.875
Neg logit contributions
nces-20.175
iencies-20.032
uilt-18.821
adoes-18.670
ortium-18.164
Token Appeals
Token ID20172
Feature activation+4.90e+00

Pos logit contributions
 proceeding+8.797
 pretending+8.389
 inaction+8.274
 asking+8.269
 questioning+8.256
Neg logit contributions
iencies-12.975
nces-12.937
uilt-12.651
ortium-12.060
emale-11.685
Token going
Token ID1016
Feature activation+4.02e-01

Pos logit contributions
 proceeding+21.358
 questioning+20.500
 asking+20.347
 telling+20.321
 confirming+20.143
Neg logit contributions
nces-24.154
iencies-23.909
uilt-22.490
ortium-22.329
emale-22.143
Token to
Token ID284
Feature activation+2.20e+00

Pos logit contributions
 proceeding+2.168
 pretending+2.154
 parting+2.142
 inaction+2.118
 questioning+2.112
Neg logit contributions
iencies-2.321
nces-2.306
adoes-2.133
uilt-2.131
ortium-2.099
Token know
Token ID760
Feature activation+2.53e-01

Pos logit contributions
 proceeding+8.888
 questioning+8.607
 pretending+8.540
 confirming+8.358
 parting+8.271
Neg logit contributions
nces-15.029
iencies-14.922
uilt-14.200
ortium-14.062
adoes-13.763
Token from
Token ID422
Feature activation+2.04e+00

Pos logit contributions
 proceeding+0.890
 pretending+0.883
 parting+0.876
 inaction+0.861
 questioning+0.851
Neg logit contributions
iencies-1.902
nces-1.889
adoes-1.789
uilt-1.777
ortium-1.766
Token looking
Token ID2045
Feature activation+1.11e-01

Pos logit contributions
 proceeding+7.941
 pretending+7.753
 questioning+7.741
 parting+7.595
 inaction+7.532
Neg logit contributions
iencies-14.290
nces-14.240
uilt-13.503
adoes-13.395
ortium-13.318
Token dead
Token ID2636
Feature activation+1.05e+00

Pos logit contributions
 proceeding+5.045
 pretending+5.000
 parting+4.945
 questioning+4.911
 inaction+4.887
Neg logit contributions
iencies-5.855
nces-5.835
uilt-5.426
adoes-5.396
ortium-5.343
Token,"
Token ID553
Feature activation+1.41e+00

Pos logit contributions
 proceeding+5.072
 pretending+5.027
 parting+4.959
 questioning+4.926
 inaction+4.895
Neg logit contributions
iencies-6.711
nces-6.684
uilt-6.249
adoes-6.214
ortium-6.155
Token one
Token ID530
Feature activation+1.98e+00

Pos logit contributions
 proceeding+6.853
 pretending+6.789
 parting+6.697
 questioning+6.683
 inaction+6.597
Neg logit contributions
iencies-8.827
nces-8.795
uilt-8.203
adoes-8.143
ortium-8.081
Token of
Token ID286
Feature activation+1.04e+00

Pos logit contributions
 proceeding+7.866
 pretending+7.712
 questioning+7.689
 parting+7.666
 inaction+7.373
Neg logit contributions
iencies-14.035
nces-14.028
uilt-13.185
adoes-13.089
ortium-13.045
Token the
Token ID262
Feature activation+1.33e+00

Pos logit contributions
 proceeding+3.676
 pretending+3.630
 parting+3.569
 questioning+3.524
 inaction+3.509
Neg logit contributions
iencies-8.014
nces-7.995
uilt-7.549
adoes-7.530
ortium-7.467
Token officers
Token ID3790
Feature activation+4.72e+00

Pos logit contributions
 proceeding+5.809
 pretending+5.731
 questioning+5.673
 parting+5.668
 inaction+5.573
Neg logit contributions
iencies-9.088
nces-9.056
uilt-8.518
adoes-8.423
ortium-8.405
Token told
Token ID1297
Feature activation+2.63e+00

Pos logit contributions
 shouting+19.049
 asking+19.011
 questioning+18.860
 proceeding+18.851
 saying+18.809
Neg logit contributions
nces-23.799
iencies-23.152
ortium-22.581
uilt-22.518
emale-21.385
Token Di
Token ID6031
Feature activation+1.03e+00

Pos logit contributions
 proceeding+13.181
 questioning+12.964
 pretending+12.905
 parting+12.681
 asking+12.445
Neg logit contributions
iencies-14.755
nces-14.506
uilt-13.696
ortium-13.286
adoes-13.202
Tokenallo
Token ID49457
Feature activation+1.40e+00

Pos logit contributions
 proceeding+6.780
 pretending+6.682
 questioning+6.592
 inaction+6.568
 hiding+6.525
Neg logit contributions
nces-4.661
iencies-4.537
uilt-4.164
adoes-4.100
ortium-4.008
Token,
Token ID11
Feature activation+4.55e-01

Pos logit contributions
 proceeding+6.177
 pretending+6.146
 questioning+6.032
 parting+6.023
 inaction+5.904
Neg logit contributions
iencies-9.455
nces-9.385
uilt-8.820
adoes-8.733
ortium-8.699
Token the
Token ID262
Feature activation+2.40e-01

Pos logit contributions
 proceeding+2.404
 parting+2.373
 pretending+2.371
 inaction+2.370
 questioning+2.328
Neg logit contributions
iencies-2.623
nces-2.608
adoes-2.437
uilt-2.420
ortium-2.396
Token said
Token ID531
Feature activation+1.01e+00

Pos logit contributions
 proceeding+2.338
 pretending+2.316
 parting+2.283
 questioning+2.258
 inaction+2.255
Neg logit contributions
iencies-4.060
nces-4.047
uilt-3.803
adoes-3.795
ortium-3.755
Token in
Token ID287
Feature activation+2.48e+00

Pos logit contributions
 proceeding+4.371
 pretending+4.322
 parting+4.264
 questioning+4.234
 inaction+4.199
Neg logit contributions
iencies-6.982
nces-6.956
uilt-6.538
adoes-6.503
ortium-6.448
Token response
Token ID2882
Feature activation+2.41e+00

Pos logit contributions
 questioning+10.117
 proceeding+10.042
 parting+9.791
 confirming+9.773
 pretending+9.622
Neg logit contributions
nces-16.449
iencies-16.410
uilt-15.780
ortium-15.459
adoes-15.404
Token to
Token ID284
Feature activation+2.74e+00

Pos logit contributions
 questioning+9.714
 proceeding+9.407
 confirming+9.311
 asking+9.279
 pretending+9.228
Neg logit contributions
iencies-15.152
nces-14.951
uilt-14.481
adoes-14.118
ortium-14.116
Token the
Token ID262
Feature activation+1.22e+00

Pos logit contributions
 questioning+12.776
 proceeding+12.395
 asking+12.019
 parting+11.998
 inaction+11.970
Neg logit contributions
nces-15.886
iencies-15.838
uilt-15.241
adoes-15.160
ortium-14.492
Token commissioner
Token ID16248
Feature activation+4.43e+00

Pos logit contributions
 proceeding+5.996
 pretending+5.860
 parting+5.843
 questioning+5.836
 inaction+5.790
Neg logit contributions
nces-7.618
iencies-7.617
uilt-7.129
adoes-7.084
ortium-7.003
Token's
Token ID338
Feature activation+1.91e+00

Pos logit contributions
 asking+18.615
 questioning+18.575
 proceeding+18.572
 confirming+18.275
 saying+17.912
Neg logit contributions
iencies-21.960
uilt-21.486
nces-21.263
emale-20.202
adoes-20.079
Token report
Token ID989
Feature activation+1.84e+00

Pos logit contributions
 proceeding+8.732
 questioning+8.656
 parting+8.519
 inaction+8.507
 pretending+8.368
Neg logit contributions
iencies-12.198
nces-12.171
uilt-11.482
adoes-11.418
ortium-11.343
Token,
Token ID11
Feature activation+5.50e-01

Pos logit contributions
 proceeding+8.134
 questioning+8.003
 pretending+7.945
 parting+7.849
 confirming+7.806
Neg logit contributions
iencies-12.188
nces-12.172
uilt-11.502
adoes-11.350
ortium-11.295
Token released
Token ID2716
Feature activation+8.36e-01

Pos logit contributions
 proceeding+2.498
 pretending+2.488
 parting+2.459
 inaction+2.437
 questioning+2.406
Neg logit contributions
iencies-3.621
nces-3.601
uilt-3.373
adoes-3.356
ortium-3.338
Token on
Token ID319
Feature activation+1.54e+00

Pos logit contributions
 proceeding+3.224
 pretending+3.175
 parting+3.129
 questioning+3.118
 inaction+3.074
Neg logit contributions
iencies-6.128
nces-6.109
uilt-5.782
adoes-5.739
ortium-5.703
Token best
Token ID1266
Feature activation+1.51e+00

Pos logit contributions
 proceeding+5.351
 pretending+5.271
 parting+5.220
 questioning+5.209
 inaction+5.102
Neg logit contributions
iencies-7.690
nces-7.605
uilt-7.158
ortium-7.106
adoes-7.093
Token education
Token ID3707
Feature activation+2.21e+00

Pos logit contributions
 proceeding+6.293
 pretending+6.127
 parting+6.073
 questioning+6.067
 inaction+5.903
Neg logit contributions
iencies-10.596
nces-10.449
uilt-9.908
adoes-9.800
ortium-9.787
Token they
Token ID484
Feature activation+1.38e-01

Pos logit contributions
 proceeding+6.741
 pretending+6.674
 questioning+6.434
 parting+6.434
 hiding+6.322
Neg logit contributions
iencies-16.149
nces-15.929
adoes-15.542
uilt-15.484
ortium-15.317
Token can
Token ID460
Feature activation+1.83e-01

Pos logit contributions
 proceeding+0.830
 pretending+0.823
 parting+0.820
 inaction+0.814
 questioning+0.811
Neg logit contributions
iencies-0.716
nces-0.712
uilt-0.651
adoes-0.648
ortium-0.639
Token and
Token ID290
Feature activation+2.19e+00

Pos logit contributions
 proceeding+1.233
 pretending+1.218
 parting+1.217
 inaction+1.209
 questioning+1.205
Neg logit contributions
nces-0.815
iencies-0.811
uilt-0.731
adoes-0.727
ortium-0.720
Token then
Token ID788
Feature activation+4.36e+00

Pos logit contributions
 pretending+6.463
 proceeding+6.390
 questioning+6.046
 parting+6.036
 hiding+5.933
Neg logit contributions
iencies-17.064
nces-16.720
uilt-15.950
adoes-15.943
ortium-15.857
Token use
Token ID779
Feature activation+1.55e+00

Pos logit contributions
 pretending+14.433
 proceeding+14.108
 questioning+13.656
 inaction+13.618
 asking+13.615
Neg logit contributions
iencies-25.400
nces-24.167
ortium-24.069
uilt-23.748
adoes-23.174
Token it
Token ID340
Feature activation+1.42e+00

Pos logit contributions
 proceeding+4.244
 pretending+4.214
 questioning+4.086
 inaction+4.059
 parting+4.007
Neg logit contributions
iencies-12.975
nces-12.925
uilt-12.448
ortium-12.267
adoes-12.218
Token to
Token ID284
Feature activation+1.54e+00

Pos logit contributions
 pretending+6.508
 proceeding+6.495
 questioning+6.396
 parting+6.284
 inaction+6.244
Neg logit contributions
iencies-9.105
nces-9.037
uilt-8.764
ortium-8.566
adoes-8.425
Token '
Token ID705
Feature activation+1.77e+00

Pos logit contributions
 proceeding+6.447
 pretending+6.405
 questioning+6.297
 parting+6.291
 inaction+6.239
Neg logit contributions
iencies-10.585
nces-10.492
uilt-9.911
ortium-9.825
adoes-9.819
Tokenlead
Token ID28230
Feature activation+1.66e+00

Pos logit contributions
 proceeding+10.175
 pretending+9.984
 questioning+9.927
 inaction+9.740
 confirming+9.692
Neg logit contributions
iencies-8.959
nces-8.816
uilt-8.315
adoes-8.022
ortium-8.002
Token to
Token ID284
Feature activation+1.50e+00

Pos logit contributions
 proceeding+5.211
 pretending+5.138
 questioning+5.076
 parting+5.055
 confirming+4.967
Neg logit contributions
nces-8.309
iencies-8.268
uilt-7.839
adoes-7.711
ortium-7.643
Token the
Token ID262
Feature activation+1.06e+00

Pos logit contributions
 proceeding+7.876
 pretending+7.789
 questioning+7.717
 parting+7.666
 inaction+7.608
Neg logit contributions
nces-8.814
iencies-8.763
uilt-8.184
adoes-8.094
ortium-7.912
Token media
Token ID2056
Feature activation+1.83e+00

Pos logit contributions
 proceeding+4.175
 pretending+4.120
 parting+4.059
 questioning+4.032
 inaction+3.998
Neg logit contributions
iencies-7.672
nces-7.655
uilt-7.211
adoes-7.171
ortium-7.104
Token about
Token ID546
Feature activation+1.86e+00

Pos logit contributions
 proceeding+9.180
 pretending+9.067
 questioning+9.007
 confirming+8.935
 parting+8.779
Neg logit contributions
nces-10.884
iencies-10.808
uilt-10.272
adoes-10.065
ortium-9.866
Token the
Token ID262
Feature activation+8.68e-01

Pos logit contributions
 proceeding+7.495
 parting+7.289
 pretending+7.234
 questioning+7.209
 inaction+7.130
Neg logit contributions
nces-13.133
iencies-12.928
uilt-12.312
adoes-12.278
ortium-12.056
Token diplomatic
Token ID13093
Feature activation+4.34e+00

Pos logit contributions
 proceeding+3.711
 pretending+3.676
 parting+3.623
 questioning+3.595
 inaction+3.572
Neg logit contributions
iencies-5.996
nces-5.973
uilt-5.611
adoes-5.586
ortium-5.539
Token talks
Token ID6130
Feature activation+2.12e+00

Pos logit contributions
 proceeding+20.652
 parting+19.807
 inaction+19.282
 questioning+18.546
 pretending+18.418
Neg logit contributions
nces-20.958
uilt-20.348
iencies-19.861
ortium-19.281
emale-18.639
Token,
Token ID11
Feature activation+6.04e-02

Pos logit contributions
 proceeding+10.065
 pretending+9.816
 confirming+9.707
 questioning+9.668
 parting+9.610
Neg logit contributions
nces-13.060
iencies-13.043
uilt-12.232
adoes-12.126
ortium-11.869
Token said
Token ID531
Feature activation+9.00e-01

Pos logit contributions
 proceeding+0.264
 pretending+0.264
 inaction+0.264
 parting+0.261
 bothering+0.256
Neg logit contributions
iencies-0.404
nces-0.397
adoes-0.373
uilt-0.370
ortium-0.370
Token the
Token ID262
Feature activation+3.95e-01

Pos logit contributions
 proceeding+3.968
 pretending+3.933
 parting+3.876
 questioning+3.844
 inaction+3.823
Neg logit contributions
iencies-6.102
nces-6.077
uilt-5.697
adoes-5.674
ortium-5.628
Token sides
Token ID5389
Feature activation+1.82e+00

Pos logit contributions
 pretending+1.831
 proceeding+1.830
 parting+1.784
 questioning+1.783
 inaction+1.772
Neg logit contributions
iencies-2.567
nces-2.537
uilt-2.379
adoes-2.377
ortium-2.361
Token client
Token ID5456
Feature activation+3.03e+00

Pos logit contributions
 proceeding+5.117
 pretending+5.016
 parting+4.943
 questioning+4.865
 inaction+4.815
Neg logit contributions
iencies-11.742
nces-11.727
uilt-11.066
adoes-11.027
ortium-10.877
Token when
Token ID618
Feature activation+2.57e+00

Pos logit contributions
 proceeding+14.248
 pretending+14.117
 asking+14.040
 confirming+14.029
 parting+13.753
Neg logit contributions
iencies-17.145
nces-16.677
adoes-15.716
uilt-15.637
ortium-15.478
Token it
Token ID340
Feature activation+7.94e-02

Pos logit contributions
 proceeding+10.985
 parting+10.572
 confirming+10.533
 inaction+10.516
 pretending+10.510
Neg logit contributions
iencies-16.429
nces-16.384
adoes-15.679
uilt-15.302
ortium-15.220
Token
Token ID447
Feature activation+3.07e-01

Pos logit contributions
 pretending+0.508
 proceeding+0.508
 parting+0.502
 questioning+0.500
 inaction+0.495
Neg logit contributions
iencies-0.378
nces-0.373
uilt-0.341
adoes-0.337
-0.330
Token
Token ID247
Feature activation+2.82e-01

Pos logit contributions
 proceeding+1.851
 pretending+1.842
 parting+1.822
 questioning+1.807
 inaction+1.800
Neg logit contributions
iencies-1.586
nces-1.578
uilt-1.450
adoes-1.436
ortium-1.417
Tokens
Token ID82
Feature activation+4.34e+00

Pos logit contributions
 proceeding+1.496
 pretending+1.489
 parting+1.477
 questioning+1.462
 inaction+1.462
Neg logit contributions
nces-1.675
iencies-1.673
uilt-1.536
adoes-1.534
ortium-1.507
Token done
Token ID1760
Feature activation+1.25e+00

Pos logit contributions
 proceeding+17.267
 asking+16.658
 confirming+16.330
 pretending+16.325
 doing+16.300
Neg logit contributions
iencies-24.032
nces-23.791
adoes-22.745
ortium-22.303
uilt-22.210
Token,
Token ID11
Feature activation+1.11e+00

Pos logit contributions
 proceeding+6.379
 pretending+6.317
 parting+6.228
 questioning+6.175
 inaction+6.143
Neg logit contributions
iencies-7.555
nces-7.523
uilt-7.006
adoes-6.976
ortium-6.887
Token optionally
Token ID42976
Feature activation+3.36e+00

Pos logit contributions
 proceeding+6.707
 pretending+6.645
 parting+6.565
 questioning+6.523
 inaction+6.520
Neg logit contributions
nces-5.725
iencies-5.722
uilt-5.240
adoes-5.213
ortium-5.141
Token providing
Token ID4955
Feature activation+2.17e+00

Pos logit contributions
 proceeding+15.111
 confirming+14.885
 pretending+14.785
 asking+14.632
 hiding+14.307
Neg logit contributions
nces-19.403
iencies-19.283
uilt-18.599
ortium-18.108
adoes-18.086
Token results
Token ID2482
Feature activation+2.28e+00

Pos logit contributions
 proceeding+11.081
 pretending+10.922
 inaction+10.879
 parting+10.797
 confirming+10.747
Neg logit contributions
iencies-12.152
nces-12.083
uilt-11.436
adoes-11.347
ortium-11.289
Token is
Token ID318
Feature activation+7.38e-01

Pos logit contributions
iencies+2.311
nces+2.297
uilt+2.130
adoes+2.126
ortium+2.094
Neg logit contributions
 proceeding-2.028
 inaction-2.021
 parting-2.015
 pretending-2.005
 questioning-1.959
Token occasionally
Token ID10491
Feature activation+2.26e+00

Pos logit contributions
 proceeding+3.569
 pretending+3.541
 parting+3.506
 questioning+3.469
 inaction+3.469
Neg logit contributions
iencies-4.676
nces-4.673
uilt-4.362
adoes-4.339
ortium-4.295
Token dod
Token ID20764
Feature activation-6.22e-01

Pos logit contributions
 proceeding+9.612
 pretending+9.388
 questioning+9.281
 parting+9.247
 hiding+9.036
Neg logit contributions
iencies-14.809
nces-14.522
uilt-13.853
adoes-13.688
ortium-13.550
Tokengy
Token ID1360
Feature activation-8.59e-01

Pos logit contributions
iencies+3.443
nces+3.365
uilt+3.173
adoes+3.126
ortium+3.039
Neg logit contributions
 proceeding-3.510
 inaction-3.466
 parting-3.465
 pretending-3.451
 questioning-3.389
Token and
Token ID290
Feature activation+1.68e+00

Pos logit contributions
nces+4.981
iencies+4.911
adoes+4.599
uilt+4.581
ortium+4.454
Neg logit contributions
 proceeding-4.268
 parting-4.250
 inaction-4.238
 pretending-4.154
 questioning-4.143
Token occasionally
Token ID10491
Feature activation+4.30e+00

Pos logit contributions
 proceeding+8.334
 pretending+8.316
 questioning+8.181
 parting+8.135
 inaction+8.028
Neg logit contributions
iencies-10.130
nces-10.064
uilt-9.474
ortium-9.356
adoes-9.311
Token brilliant
Token ID10457
Feature activation+1.19e-01

Pos logit contributions
 proceeding+18.982
 pretending+18.853
 telling+18.738
 questioning+18.492
 asking+18.396
Neg logit contributions
iencies-20.705
nces-20.299
uilt-19.118
ortium-18.980
emale-18.088
Token,
Token ID11
Feature activation-5.93e-01

Pos logit contributions
 proceeding+0.580
 pretending+0.570
 parting+0.569
 inaction+0.567
 questioning+0.561
Neg logit contributions
iencies-0.751
nces-0.750
uilt-0.697
adoes-0.697
ortium-0.682
Token hits
Token ID7127
Feature activation-7.23e-01

Pos logit contributions
iencies+3.397
nces+3.374
adoes+3.154
uilt+3.129
ortium+3.061
Neg logit contributions
 inaction-3.080
 proceeding-3.076
 pretending-3.010
 parting-3.003
 questioning-2.983
Token one
Token ID530
Feature activation+1.25e+00

Pos logit contributions
nces+5.086
iencies+5.059
adoes+4.808
uilt+4.717
ortium+4.601
Neg logit contributions
 proceeding-2.795
 inaction-2.706
 pretending-2.694
 questioning-2.665
 parting-2.663
Token out
Token ID503
Feature activation+9.84e-01

Pos logit contributions
 proceeding+5.301
 pretending+5.244
 parting+5.178
 questioning+5.131
 inaction+5.097
Neg logit contributions
iencies-8.659
nces-8.634
uilt-8.111
adoes-8.069
ortium-8.006
Token information
Token ID1321
Feature activation+1.74e-01

Pos logit contributions
 proceeding+3.355
 pretending+3.332
 parting+3.282
 inaction+3.246
 questioning+3.246
Neg logit contributions
iencies-4.770
nces-4.759
uilt-4.437
adoes-4.429
ortium-4.377
Token,
Token ID11
Feature activation-1.37e-01

Pos logit contributions
 proceeding+0.747
 parting+0.741
 pretending+0.740
 inaction+0.735
 questioning+0.713
Neg logit contributions
iencies-1.168
nces-1.165
adoes-1.091
uilt-1.080
ortium-1.076
Token how
Token ID703
Feature activation+8.29e-01

Pos logit contributions
iencies+0.987
nces+0.980
adoes+0.924
uilt+0.918
ortium+0.918
Neg logit contributions
 proceeding-0.507
 pretending-0.504
 parting-0.504
 inaction-0.488
 questioning-0.479
Token is
Token ID318
Feature activation+2.20e+00

Pos logit contributions
 proceeding+4.641
 pretending+4.611
 parting+4.566
 questioning+4.514
 inaction+4.510
Neg logit contributions
iencies-4.631
nces-4.593
adoes-4.238
uilt-4.236
ortium-4.174
Token the
Token ID262
Feature activation+8.18e-01

Pos logit contributions
 proceeding+8.567
 pretending+8.359
 questioning+8.347
 parting+8.112
 inaction+8.076
Neg logit contributions
iencies-15.542
nces-15.526
adoes-14.642
uilt-14.580
ortium-14.348
Token Court
Token ID3078
Feature activation+4.28e+00

Pos logit contributions
 proceeding+3.707
 pretending+3.694
 parting+3.643
 questioning+3.595
 inaction+3.591
Neg logit contributions
iencies-5.414
nces-5.398
uilt-5.052
adoes-5.019
ortium-4.991
Token of
Token ID286
Feature activation+2.02e+00

Pos logit contributions
 proceeding+20.244
 questioning+19.074
 pretending+18.918
 telling+18.897
 asking+18.875
Neg logit contributions
nces-20.175
iencies-20.032
uilt-18.821
adoes-18.670
ortium-18.164
Token Appeals
Token ID20172
Feature activation+4.90e+00

Pos logit contributions
 proceeding+8.797
 pretending+8.389
 inaction+8.274
 asking+8.269
 questioning+8.256
Neg logit contributions
iencies-12.975
nces-12.937
uilt-12.651
ortium-12.060
emale-11.685
Token going
Token ID1016
Feature activation+4.02e-01

Pos logit contributions
 proceeding+21.358
 questioning+20.500
 asking+20.347
 telling+20.321
 confirming+20.143
Neg logit contributions
nces-24.154
iencies-23.909
uilt-22.490
ortium-22.329
emale-22.143
Token to
Token ID284
Feature activation+2.20e+00

Pos logit contributions
 proceeding+2.168
 pretending+2.154
 parting+2.142
 inaction+2.118
 questioning+2.112
Neg logit contributions
iencies-2.321
nces-2.306
adoes-2.133
uilt-2.131
ortium-2.099
Token know
Token ID760
Feature activation+2.53e-01

Pos logit contributions
 proceeding+8.888
 questioning+8.607
 pretending+8.540
 confirming+8.358
 parting+8.271
Neg logit contributions
nces-15.029
iencies-14.922
uilt-14.200
ortium-14.062
adoes-13.763
Token -
Token ID532
Feature activation+2.27e+00

Pos logit contributions
 proceeding+2.118
 pretending+2.112
 parting+2.069
 inaction+2.036
 questioning+2.035
Neg logit contributions
iencies-2.969
nces-2.958
adoes-2.771
uilt-2.753
ortium-2.716
Token his
Token ID465
Feature activation+1.18e+00

Pos logit contributions
 questioning+11.195
 proceeding+11.166
 pretending+11.075
 parting+10.990
 inaction+10.971
Neg logit contributions
iencies-13.010
nces-12.988
uilt-12.080
adoes-12.008
ortium-11.927
Token philosophy
Token ID8876
Feature activation+8.84e-01

Pos logit contributions
 proceeding+5.721
 pretending+5.668
 parting+5.615
 questioning+5.586
 inaction+5.555
Neg logit contributions
iencies-7.472
nces-7.450
uilt-6.957
adoes-6.898
ortium-6.854
Token,
Token ID11
Feature activation+9.00e-01

Pos logit contributions
 proceeding+4.705
 pretending+4.663
 parting+4.612
 questioning+4.589
 inaction+4.554
Neg logit contributions
iencies-5.181
nces-5.159
uilt-4.796
adoes-4.765
ortium-4.717
Token is
Token ID318
Feature activation+2.89e+00

Pos logit contributions
 proceeding+3.892
 pretending+3.853
 parting+3.799
 questioning+3.776
 inaction+3.746
Neg logit contributions
iencies-6.173
nces-6.156
uilt-5.779
adoes-5.750
ortium-5.707
Token basically
Token ID6209
Feature activation+4.15e+00

Pos logit contributions
 pretending+12.821
 proceeding+12.671
 questioning+12.637
 inaction+12.329
 hiding+12.252
Neg logit contributions
iencies-17.494
nces-17.379
ortium-16.393
uilt-16.084
adoes-15.929
Token an
Token ID281
Feature activation+2.57e+00

Pos logit contributions
 pretending+18.067
 questioning+17.669
 saying+17.653
 proceeding+17.589
 trying+17.489
Neg logit contributions
iencies-21.386
nces-21.341
ortium-19.927
uilt-19.403
adoes-19.188
Token isolation
Token ID15133
Feature activation+1.29e+00

Pos logit contributions
 pretending+8.468
 inaction+8.430
 proceeding+8.294
 questioning+8.226
 parting+8.209
Neg logit contributions
nces-18.556
iencies-18.407
ortium-17.413
uilt-17.393
adoes-16.926
Tokenist
Token ID396
Feature activation+6.87e-01

Pos logit contributions
 proceeding+6.368
 pretending+6.348
 questioning+6.199
 parting+6.191
 inaction+6.136
Neg logit contributions
nces-8.096
iencies-8.029
uilt-7.507
ortium-7.435
adoes-7.393
Token,"
Token ID553
Feature activation+2.02e-01

Pos logit contributions
 proceeding+3.209
 pretending+3.168
 parting+3.144
 questioning+3.107
 inaction+3.103
Neg logit contributions
iencies-4.452
nces-4.448
uilt-4.156
adoes-4.147
ortium-4.095
Token Cheney
Token ID31287
Feature activation+1.86e+00

Pos logit contributions
 proceeding+1.081
 pretending+1.070
 parting+1.058
 inaction+1.055
 questioning+1.038
Neg logit contributions
nces-1.158
iencies-1.155
uilt-1.080
ortium-1.069
adoes-1.069
Token a
Token ID257
Feature activation+1.54e+00

Pos logit contributions
 proceeding+7.570
 pretending+7.408
 parting+7.323
 questioning+7.322
 bothering+7.086
Neg logit contributions
iencies-12.594
nces-12.471
uilt-11.728
ortium-11.636
adoes-11.569
Token very
Token ID845
Feature activation+1.93e+00

Pos logit contributions
 proceeding+6.688
 pretending+6.580
 parting+6.572
 questioning+6.479
 inaction+6.353
Neg logit contributions
iencies-10.482
nces-10.437
uilt-9.755
adoes-9.677
ortium-9.614
Token complicated
Token ID8253
Feature activation-4.33e-01

Pos logit contributions
 proceeding+7.778
 parting+7.688
 pretending+7.630
 questioning+7.535
 confirming+7.272
Neg logit contributions
iencies-13.676
nces-13.546
uilt-12.699
ortium-12.540
adoes-12.486
Token (
Token ID357
Feature activation-6.10e-01

Pos logit contributions
nces+3.037
iencies+3.025
adoes+2.961
uilt+2.957
ortium+2.890
Neg logit contributions
 pretending-1.517
 inaction-1.499
 confirming-1.476
 proceeding-1.452
 questioning-1.443
Tokenand
Token ID392
Feature activation+2.93e+00

Pos logit contributions
nces+2.650
iencies+2.611
adoes+2.417
uilt+2.372
+2.342
Neg logit contributions
 parting-4.015
 proceeding-3.982
 pretending-3.951
 inaction-3.945
 questioning-3.874
Token sometimes
Token ID3360
Feature activation+4.12e+00

Pos logit contributions
 proceeding+13.393
 questioning+13.029
 pretending+12.997
 parting+12.976
 bothering+12.596
Neg logit contributions
iencies-17.623
nces-17.360
uilt-16.276
ortium-15.974
adoes-15.751
Token volatile
Token ID22750
Feature activation+4.87e-03

Pos logit contributions
 proceeding+17.189
 questioning+16.966
 pretending+16.688
 parting+16.601
 bothering+16.492
Neg logit contributions
iencies-23.338
nces-22.681
uilt-21.613
ortium-20.848
emale-20.456
Token)
Token ID8
Feature activation+6.87e-01

Pos logit contributions
 proceeding+0.025
 pretending+0.025
 parting+0.024
 inaction+0.024
 questioning+0.024
Neg logit contributions
nces-0.029
iencies-0.029
uilt-0.027
adoes-0.027
ortium-0.027
Token one
Token ID530
Feature activation+2.21e-01

Pos logit contributions
 proceeding+2.375
 pretending+2.372
 parting+2.312
 inaction+2.305
 questioning+2.291
Neg logit contributions
iencies-5.258
nces-5.253
uilt-4.994
adoes-4.972
ortium-4.922
Token,
Token ID11
Feature activation-1.08e+00

Pos logit contributions
 inaction+1.032
 proceeding+1.030
 pretending+1.029
 parting+1.021
 questioning+1.006
Neg logit contributions
nces-1.401
iencies-1.392
uilt-1.308
adoes-1.308
ortium-1.301
Token especially
Token ID2592
Feature activation+3.25e-01

Pos logit contributions
iencies+5.761
nces+5.684
adoes+5.393
uilt+5.342
ortium+5.317
Neg logit contributions
 inaction-5.461
 parting-5.318
 proceeding-5.296
 pretending-5.283
 questioning-5.121
Token Milo
Token ID32662
Feature activation+2.21e+00

Pos logit contributions
nces+1.024
iencies+1.012
ortium+0.956
adoes+0.953
uilt+0.945
Neg logit contributions
 proceeding-0.658
 pretending-0.646
 inaction-0.633
 parting-0.633
 questioning-0.617
Token's
Token ID338
Feature activation+4.98e-01

Pos logit contributions
 proceeding+9.917
 parting+9.916
 pretending+9.914
 questioning+9.884
 inaction+9.660
Neg logit contributions
iencies-14.160
nces-14.013
uilt-13.368
adoes-12.879
ortium-12.853
Token comments
Token ID3651
Feature activation+2.43e+00

Pos logit contributions
 pretending+2.005
 proceeding+2.000
 parting+1.919
 confirming+1.905
 questioning+1.904
Neg logit contributions
nces-3.493
iencies-3.493
adoes-3.306
uilt-3.281
ortium-3.258
Token in
Token ID287
Feature activation+2.83e+00

Pos logit contributions
 questioning+9.875
 proceeding+9.704
 pretending+9.673
 parting+9.483
 confirming+9.386
Neg logit contributions
iencies-16.538
nces-16.434
uilt-15.816
ortium-15.207
adoes-15.028
Token the
Token ID262
Feature activation+1.44e+00

Pos logit contributions
 questioning+14.104
 proceeding+13.786
 pretending+13.781
 parting+13.768
 confirming+13.481
Neg logit contributions
iencies-15.846
nces-15.493
uilt-14.561
adoes-14.507
ortium-14.014
Token video
Token ID2008
Feature activation+4.07e+00

Pos logit contributions
 proceeding+6.136
 pretending+5.982
 parting+5.978
 questioning+5.967
 inaction+5.811
Neg logit contributions
nces-9.967
iencies-9.966
uilt-9.286
adoes-9.214
ortium-9.121
Token "
Token ID366
Feature activation+1.01e+00

Pos logit contributions
 questioning+17.895
 proceeding+17.189
 pretending+17.170
 asking+16.838
 telling+16.773
Neg logit contributions
iencies-22.290
nces-21.805
uilt-21.213
ortium-20.273
emale-19.856
Tokenvery
Token ID548
Feature activation+1.59e+00

Pos logit contributions
 proceeding+5.529
 pretending+5.481
 parting+5.440
 questioning+5.384
 inaction+5.367
Neg logit contributions
iencies-5.735
nces-5.729
uilt-5.299
adoes-5.273
ortium-5.220
Token troubling
Token ID20294
Feature activation-7.17e-01

Pos logit contributions
 proceeding+5.640
 pretending+5.566
 parting+5.484
 questioning+5.474
 inaction+5.348
Neg logit contributions
iencies-12.053
nces-11.940
uilt-11.351
adoes-11.236
ortium-11.175
Token and
Token ID290
Feature activation+2.30e+00

Pos logit contributions
nces+4.584
iencies+4.493
adoes+4.223
uilt+4.199
ortium+4.130
Neg logit contributions
 proceeding-3.071
 parting-3.052
 pretending-3.041
 inaction-2.958
 bothering-2.926
Token upsetting
Token ID38423
Feature activation+4.64e-02

Pos logit contributions
 proceeding+9.834
 questioning+9.761
 pretending+9.686
 parting+9.542
 confirming+9.387
Neg logit contributions
iencies-15.311
nces-15.084
uilt-14.417
adoes-14.169
ortium-14.131
Token I
Token ID314
Feature activation+6.01e-01

Pos logit contributions
 proceeding+9.555
 pretending+9.481
 parting+9.366
 questioning+9.363
 inaction+9.228
Neg logit contributions
iencies-12.065
nces-12.027
adoes-11.208
ortium-11.174
uilt-11.121
Token wasn
Token ID2492
Feature activation-1.66e+00

Pos logit contributions
 proceeding+3.114
 pretending+3.087
 parting+3.052
 questioning+3.033
 inaction+3.013
Neg logit contributions
iencies-3.610
nces-3.597
uilt-3.346
adoes-3.326
ortium-3.295
Token
Token ID447
Feature activation+1.05e+00

Pos logit contributions
iencies+4.856
+4.827
nces+4.504
adoes+4.349
ados+3.965
Neg logit contributions
 parting-12.485
 pretending-12.211
 questioning-12.183
 inaction-12.129
 proceeding-12.076
Token
Token ID247
Feature activation+1.63e+00

Pos logit contributions
 proceeding+5.728
 pretending+5.678
 parting+5.623
 questioning+5.597
 inaction+5.548
Neg logit contributions
iencies-5.899
nces-5.877
uilt-5.439
adoes-5.417
ortium-5.377
Tokent
Token ID83
Feature activation+3.38e+00

Pos logit contributions
 proceeding+8.248
 pretending+8.186
 questioning+8.063
 parting+8.057
 confirming+7.907
Neg logit contributions
iencies-9.634
nces-9.510
adoes-8.886
uilt-8.812
ortium-8.806
Token really
Token ID1107
Feature activation+3.96e+00

Pos logit contributions
 pretending+14.232
 proceeding+14.181
 questioning+14.038
 parting+13.840
 asking+13.813
Neg logit contributions
iencies-21.002
nces-20.283
uilt-19.173
ortium-19.164
adoes-18.381
Token sure
Token ID1654
Feature activation+9.31e-01

Pos logit contributions
 questioning+16.560
 pretending+16.543
 proceeding+16.382
 bothering+16.247
 asking+16.203
Neg logit contributions
iencies-23.114
nces-21.743
ortium-20.979
uilt-20.847
emale-20.447
Token what
Token ID644
Feature activation+8.05e-01

Pos logit contributions
 proceeding+3.218
 pretending+3.178
 parting+3.125
 questioning+3.092
 inaction+3.077
Neg logit contributions
iencies-7.185
nces-7.173
uilt-6.781
adoes-6.757
ortium-6.699
Token that
Token ID326
Feature activation+1.76e+00

Pos logit contributions
 proceeding+3.725
 pretending+3.691
 parting+3.640
 questioning+3.612
 inaction+3.599
Neg logit contributions
iencies-5.278
nces-5.265
uilt-4.938
adoes-4.902
ortium-4.852
Token actually
Token ID1682
Feature activation+1.11e+00

Pos logit contributions
 proceeding+7.883
 pretending+7.846
 parting+7.800
 questioning+7.765
 inaction+7.653
Neg logit contributions
iencies-11.655
nces-11.492
adoes-10.734
uilt-10.649
ortium-10.644
Token meant
Token ID4001
Feature activation+6.01e-01

Pos logit contributions
 proceeding+4.767
 pretending+4.718
 parting+4.652
 questioning+4.624
 inaction+4.572
Neg logit contributions
iencies-7.697
nces-7.651
uilt-7.179
adoes-7.168
ortium-7.094
Token the
Token ID262
Feature activation-6.41e-01

Pos logit contributions
 proceeding+2.205
 pretending+2.171
 parting+2.156
 questioning+2.121
 inaction+2.115
Neg logit contributions
iencies-2.765
nces-2.763
adoes-2.578
uilt-2.553
ortium-2.525
Token prophet
Token ID22298
Feature activation+5.02e-01

Pos logit contributions
iencies+3.830
nces+3.803
adoes+3.617
uilt+3.511
ortium+3.483
Neg logit contributions
 pretending-3.017
 proceeding-3.015
 parting-2.966
 bothering-2.936
 confirming-2.911
Token Mohammed
Token ID19773
Feature activation+1.07e+00

Pos logit contributions
 proceeding+2.471
 pretending+2.432
 parting+2.416
 inaction+2.410
 questioning+2.388
Neg logit contributions
iencies-3.142
nces-3.132
adoes-2.924
uilt-2.919
ortium-2.874
Token,
Token ID11
Feature activation+3.94e-01

Pos logit contributions
 proceeding+5.361
 pretending+5.324
 parting+5.250
 questioning+5.225
 inaction+5.180
Neg logit contributions
iencies-6.591
nces-6.565
uilt-6.106
adoes-6.066
ortium-6.015
Token was
Token ID373
Feature activation+3.03e+00

Pos logit contributions
 proceeding+2.079
 pretending+2.056
 parting+2.046
 inaction+2.028
 questioning+2.008
Neg logit contributions
iencies-2.307
nces-2.300
adoes-2.135
uilt-2.131
ortium-2.108
Token not
Token ID407
Feature activation+3.96e+00

Pos logit contributions
 proceeding+13.791
 pretending+13.653
 questioning+13.635
 asking+13.245
 confirming+13.225
Neg logit contributions
iencies-18.040
nces-17.393
uilt-16.505
ortium-16.409
adoes-16.005
Token because
Token ID780
Feature activation+1.45e+00

Pos logit contributions
 proceeding+17.450
 questioning+17.046
 pretending+17.035
 bothering+16.974
 asking+16.849
Neg logit contributions
iencies-22.145
nces-20.991
uilt-20.343
ortium-19.607
emale-19.116
Token I
Token ID314
Feature activation+1.92e-01

Pos logit contributions
 proceeding+6.999
 pretending+6.950
 parting+6.849
 questioning+6.833
 inaction+6.779
Neg logit contributions
iencies-9.131
nces-9.106
uilt-8.523
adoes-8.458
ortium-8.395
Token am
Token ID716
Feature activation+2.09e+00

Pos logit contributions
 proceeding+0.951
 pretending+0.932
 parting+0.932
 inaction+0.926
 questioning+0.908
Neg logit contributions
iencies-1.187
nces-1.184
uilt-1.099
adoes-1.099
ortium-1.087
Token against
Token ID1028
Feature activation+1.01e+00

Pos logit contributions
 pretending+9.078
 questioning+8.979
 proceeding+8.959
 parting+8.728
 confirming+8.624
Neg logit contributions
iencies-13.911
nces-13.782
uilt-13.012
ortium-12.817
adoes-12.802
Token the
Token ID262
Feature activation-1.08e-01

Pos logit contributions
 proceeding+4.614
 pretending+4.562
 parting+4.512
 questioning+4.460
 inaction+4.448
Neg logit contributions
iencies-6.695
nces-6.666
uilt-6.247
adoes-6.220
ortium-6.152
TokenShe
Token ID3347
Feature activation+1.70e+00

Pos logit contributions
 proceeding+8.631
 pretending+8.511
 confirming+8.349
 questioning+8.349
 inaction+8.323
Neg logit contributions
iencies-12.049
nces-11.929
uilt-11.339
ortium-11.050
adoes-11.041
Token questioned
Token ID11434
Feature activation+1.80e+00

Pos logit contributions
 questioning+5.032
 proceeding+4.959
 pretending+4.820
 parting+4.650
 confirming+4.565
Neg logit contributions
nces-13.755
iencies-13.753
uilt-13.141
adoes-13.067
ortium-12.818
Token if
Token ID611
Feature activation+2.93e+00

Pos logit contributions
 proceeding+5.965
 questioning+5.953
 pretending+5.882
 parting+5.673
 confirming+5.670
Neg logit contributions
iencies-13.899
nces-13.821
uilt-13.293
ortium-13.216
adoes-13.074
Token it
Token ID340
Feature activation+1.24e+00

Pos logit contributions
 proceeding+6.072
 pretending+5.686
 questioning+5.598
 asking+5.522
 confirming+5.405
Neg logit contributions
nces-25.606
uilt-24.715
iencies-24.683
ortium-24.164
emale-24.137
Token was
Token ID373
Feature activation+2.22e+00

Pos logit contributions
 proceeding+5.993
 questioning+5.864
 pretending+5.857
 parting+5.750
 confirming+5.703
Neg logit contributions
nces-7.984
iencies-7.882
uilt-7.472
ortium-7.244
adoes-7.228
Token absolutely
Token ID5543
Feature activation+3.90e+00

Pos logit contributions
 proceeding+6.407
 questioning+6.088
 pretending+5.966
 confirming+5.820
 parting+5.817
Neg logit contributions
nces-17.647
iencies-17.475
adoes-16.703
ortium-16.594
uilt-16.504
Token necessary
Token ID3306
Feature activation+7.37e-01

Pos logit contributions
 proceeding+5.483
 questioning+4.907
 asking+4.718
 telling+4.113
 pleading+3.891
Neg logit contributions
nces-33.870
uilt-33.378
emale-32.526
ortium-32.081
iencies-31.664
Token but
Token ID475
Feature activation+2.11e+00

Pos logit contributions
 proceeding+4.277
 pretending+4.239
 parting+4.190
 questioning+4.190
 inaction+4.124
Neg logit contributions
iencies-3.976
nces-3.972
uilt-3.718
ortium-3.618
adoes-3.613
Token doctors
Token ID7519
Feature activation+2.83e+00

Pos logit contributions
 pretending+7.783
 questioning+7.664
 proceeding+7.634
 confirming+7.514
 asking+7.459
Neg logit contributions
nces-14.903
iencies-14.784
uilt-14.299
ortium-14.273
emale-13.901
Token said
Token ID531
Feature activation+1.17e+00

Pos logit contributions
 saying+7.736
 proceeding+7.569
 pretending+7.565
 hiding+7.421
 questioning+7.402
Neg logit contributions
nces-21.737
iencies-21.510
uilt-21.421
emale-21.384
ortium-21.088
Token she
Token ID673
Feature activation+1.17e+00

Pos logit contributions
 proceeding+2.846
 pretending+2.781
 questioning+2.709
 parting+2.699
 inaction+2.634
Neg logit contributions
nces-10.189
iencies-10.166
uilt-9.777
ortium-9.655
adoes-9.642
Token to
Token ID284
Feature activation+1.34e+00

Pos logit contributions
 proceeding+1.623
 pretending+1.608
 parting+1.592
 questioning+1.579
 inaction+1.572
Neg logit contributions
iencies-1.817
nces-1.811
uilt-1.681
adoes-1.670
ortium-1.653
Token engage
Token ID8209
Feature activation+5.59e-01

Pos logit contributions
 proceeding+5.562
 pretending+5.475
 questioning+5.407
 parting+5.375
 inaction+5.303
Neg logit contributions
nces-9.268
iencies-9.262
uilt-8.714
adoes-8.653
ortium-8.640
Token in
Token ID287
Feature activation+2.38e+00

Pos logit contributions
 proceeding+2.636
 pretending+2.612
 parting+2.578
 questioning+2.563
 inaction+2.547
Neg logit contributions
iencies-3.612
nces-3.600
uilt-3.374
adoes-3.353
ortium-3.325
Token an
Token ID281
Feature activation+2.59e+00

Pos logit contributions
 proceeding+8.090
 pretending+8.078
 questioning+8.067
 inaction+7.830
 parting+7.813
Neg logit contributions
nces-16.891
iencies-16.684
adoes-16.100
uilt-16.047
ortium-15.814
Token open
Token ID1280
Feature activation+2.18e+00

Pos logit contributions
 proceeding+7.044
 asking+6.850
 questioning+6.847
 pretending+6.777
 inaction+6.666
Neg logit contributions
nces-20.083
iencies-19.662
ortium-19.032
uilt-18.972
adoes-18.721
Token and
Token ID290
Feature activation+3.90e+00

Pos logit contributions
 proceeding+12.711
 questioning+12.602
 pretending+12.306
 parting+12.226
 asking+12.181
Neg logit contributions
nces-11.016
iencies-10.782
uilt-10.622
adoes-10.007
ortium-9.963
Token straightforward
Token ID15836
Feature activation+1.81e+00

Pos logit contributions
 questioning+9.782
 proceeding+9.651
 pretending+9.067
 asking+8.933
 pleading+8.854
Neg logit contributions
nces-28.491
uilt-27.464
iencies-27.378
emale-26.871
ortium-26.706
Token communication
Token ID6946
Feature activation+2.14e+00

Pos logit contributions
 proceeding+6.834
 questioning+6.583
 parting+6.529
 pretending+6.476
 inaction+6.390
Neg logit contributions
nces-13.255
iencies-13.144
uilt-12.798
adoes-12.551
ortium-12.518
Token with
Token ID351
Feature activation+1.30e+00

Pos logit contributions
 proceeding+8.817
 pretending+8.749
 questioning+8.637
 confirming+8.543
 asking+8.472
Neg logit contributions
iencies-13.787
nces-13.583
uilt-13.266
ortium-13.119
adoes-13.084
Token staff
Token ID3085
Feature activation+9.96e-01

Pos logit contributions
 proceeding+4.104
 pretending+4.060
 questioning+4.033
 parting+4.011
 inaction+3.938
Neg logit contributions
iencies-10.250
nces-10.226
uilt-9.807
ortium-9.680
adoes-9.677
Token,
Token ID11
Feature activation-4.53e-01

Pos logit contributions
 proceeding+5.264
 pretending+5.233
 parting+5.169
 questioning+5.147
 inaction+5.096
Neg logit contributions
iencies-5.842
nces-5.832
uilt-5.410
adoes-5.392
ortium-5.332
Token,
Token ID11
Feature activation-7.25e-01

Pos logit contributions
 proceeding+1.965
 inaction+1.950
 pretending+1.937
 parting+1.928
 questioning+1.911
Neg logit contributions
nces-2.965
iencies-2.961
uilt-2.814
adoes-2.786
ortium-2.765
Token and
Token ID290
Feature activation+1.30e+00

Pos logit contributions
iencies+4.316
nces+4.244
uilt+4.054
adoes+4.049
ortium+3.983
Neg logit contributions
 inaction-3.319
 parting-3.262
 proceeding-3.261
 pretending-3.209
 questioning-3.175
Token I
Token ID314
Feature activation-3.35e-01

Pos logit contributions
 proceeding+5.350
 pretending+5.299
 parting+5.224
 questioning+5.181
 inaction+5.120
Neg logit contributions
iencies-9.219
nces-9.185
uilt-8.624
adoes-8.597
ortium-8.518
Token'm
Token ID1101
Feature activation+1.26e+00

Pos logit contributions
nces+2.029
iencies+2.011
adoes+1.882
uilt+1.877
ortium+1.856
Neg logit contributions
 inaction-1.648
 proceeding-1.647
 pretending-1.608
 parting-1.607
 questioning-1.578
Token a
Token ID257
Feature activation+1.38e+00

Pos logit contributions
 proceeding+4.613
 pretending+4.567
 parting+4.485
 questioning+4.449
 inaction+4.381
Neg logit contributions
iencies-9.514
nces-9.480
uilt-8.943
adoes-8.892
ortium-8.843
Token little
Token ID1310
Feature activation+3.87e+00

Pos logit contributions
 proceeding+6.268
 pretending+6.255
 parting+6.204
 questioning+6.144
 inaction+6.042
Neg logit contributions
iencies-9.090
nces-9.071
uilt-8.454
adoes-8.395
ortium-8.355
Token bit
Token ID1643
Feature activation+3.44e+00

Pos logit contributions
 questioning+15.751
 parting+15.425
 pretending+15.326
 asking+14.969
 concerned+14.945
Neg logit contributions
nces-22.580
iencies-22.386
ortium-21.146
uilt-21.046
emale-20.465
Token angry
Token ID7954
Feature activation+1.80e-01

Pos logit contributions
 questioning+13.668
 parting+13.497
 pretending+13.439
 proceeding+13.259
 wishing+13.074
Neg logit contributions
iencies-21.654
nces-21.402
ortium-20.188
uilt-20.169
emale-19.561
Token at
Token ID379
Feature activation+1.14e+00

Pos logit contributions
 proceeding+0.746
 inaction+0.727
 pretending+0.723
 parting+0.719
 questioning+0.706
Neg logit contributions
iencies-1.243
nces-1.239
adoes-1.166
uilt-1.153
ortium-1.144
Token him
Token ID683
Feature activation+1.33e-01

Pos logit contributions
 proceeding+5.086
 pretending+5.044
 parting+4.981
 questioning+4.939
 inaction+4.905
Neg logit contributions
iencies-7.658
nces-7.633
uilt-7.153
adoes-7.120
ortium-7.052
Token for
Token ID329
Feature activation+2.09e+00

Pos logit contributions
 proceeding+0.677
 inaction+0.666
 pretending+0.659
 parting+0.658
 questioning+0.643
Neg logit contributions
nces-0.791
iencies-0.780
adoes-0.726
ortium-0.726
uilt-0.721
Token (
Token ID357
Feature activation-2.30e-01

Pos logit contributions
 proceeding+14.552
 pretending+14.385
 asking+14.082
 parting+14.004
 inaction+13.976
Neg logit contributions
iencies-15.739
nces-15.441
adoes-14.503
uilt-14.213
ortium-14.199
Tokenthe
Token ID1169
Feature activation+1.40e-01

Pos logit contributions
iencies+1.220
nces+1.219
uilt+1.124
adoes+1.124
+1.109
Neg logit contributions
 parting-1.295
 pretending-1.294
 proceeding-1.292
 questioning-1.281
 inaction-1.266
Token library
Token ID5888
Feature activation+2.14e+00

Pos logit contributions
 pretending+0.580
 proceeding+0.579
 parting+0.569
 questioning+0.567
 inaction+0.561
Neg logit contributions
iencies-0.963
nces-0.960
uilt-0.905
adoes-0.902
ortium-0.897
Token),
Token ID828
Feature activation+7.51e-01

Pos logit contributions
 proceeding+11.823
 pretending+11.685
 parting+11.459
 inaction+11.321
 questioning+11.310
Neg logit contributions
nces-11.581
iencies-11.394
uilt-10.482
adoes-10.375
ortium-10.319
Token the
Token ID262
Feature activation+1.19e+00

Pos logit contributions
 proceeding+4.492
 pretending+4.462
 parting+4.425
 questioning+4.399
 inaction+4.375
Neg logit contributions
iencies-3.896
nces-3.873
uilt-3.567
adoes-3.541
ortium-3.501
Token server
Token ID4382
Feature activation+3.85e+00

Pos logit contributions
 proceeding+5.103
 pretending+5.042
 parting+4.979
 questioning+4.935
 inaction+4.907
Neg logit contributions
iencies-8.152
nces-8.133
uilt-7.629
adoes-7.595
ortium-7.527
Token does
Token ID857
Feature activation-1.35e-01

Pos logit contributions
 proceeding+19.113
 pretending+18.840
 asking+18.709
 telling+18.362
 parting+18.355
Neg logit contributions
nces-17.748
iencies-17.685
uilt-16.756
ortium-16.496
adoes-16.149
Token the
Token ID262
Feature activation+8.31e-01

Pos logit contributions
iencies+1.014
nces+1.008
uilt+0.961
ortium+0.948
+0.947
Neg logit contributions
 proceeding-0.465
 pretending-0.459
 parting-0.458
 questioning-0.451
 inaction-0.438
Token work
Token ID670
Feature activation+1.59e+00

Pos logit contributions
 proceeding+4.088
 pretending+4.054
 parting+4.015
 questioning+3.976
 inaction+3.962
Neg logit contributions
iencies-5.182
nces-5.165
uilt-4.830
adoes-4.788
ortium-4.751
Token then
Token ID788
Feature activation+2.17e+00

Pos logit contributions
 proceeding+8.157
 pretending+8.077
 parting+7.869
 questioning+7.815
 confirming+7.791
Neg logit contributions
iencies-9.490
nces-9.469
adoes-8.800
uilt-8.794
ortium-8.647
Token not
Token ID407
Feature activation+2.43e+00

Pos logit contributions
 proceeding+11.901
 pretending+11.751
 parting+11.505
 confirming+11.462
 inaction+11.442
Neg logit contributions
iencies-11.682
nces-11.646
uilt-10.818
adoes-10.632
ortium-10.579
Token why
Token ID1521
Feature activation+3.61e-01

Pos logit contributions
iencies+0.119
nces+0.118
adoes+0.111
uilt+0.110
ortium+0.110
Neg logit contributions
 pretending-0.048
 proceeding-0.048
 parting-0.047
 questioning-0.045
 confirming-0.045
Token she
Token ID673
Feature activation-3.12e-01

Pos logit contributions
 pretending+1.214
 proceeding+1.213
 parting+1.198
 inaction+1.147
 questioning+1.146
Neg logit contributions
nces-2.792
iencies-2.788
uilt-2.629
adoes-2.609
ortium-2.596
Token had
Token ID550
Feature activation-4.36e-01

Pos logit contributions
iencies+1.727
nces+1.683
adoes+1.571
uilt+1.548
ortium+1.542
Neg logit contributions
 proceeding-1.675
 pretending-1.668
 parting-1.664
 inaction-1.638
 confirming-1.621
Token allowed
Token ID3142
Feature activation-6.19e-01

Pos logit contributions
iencies+2.854
nces+2.823
adoes+2.662
ortium+2.650
uilt+2.645
Neg logit contributions
 proceeding-1.827
 parting-1.826
 pretending-1.821
 confirming-1.765
 inaction-1.758
Token a
Token ID257
Feature activation+2.98e-01

Pos logit contributions
nces+4.265
iencies+4.261
uilt+4.001
adoes+3.926
ortium+3.925
Neg logit contributions
 pretending-2.377
 parting-2.373
 proceeding-2.361
 confirming-2.250
 bothering-2.237
Token parliamentary
Token ID17521
Feature activation+3.84e+00

Pos logit contributions
 pretending+1.278
 proceeding+1.258
 parting+1.250
 inaction+1.231
 questioning+1.220
Neg logit contributions
iencies-2.001
nces-1.990
uilt-1.876
adoes-1.856
ortium-1.851
Token debate
Token ID4384
Feature activation+1.25e+00

Pos logit contributions
 proceeding+17.525
 questioning+16.861
 inaction+16.299
 parting+16.039
 pretending+15.931
Neg logit contributions
nces-21.726
iencies-21.131
uilt-21.056
ortium-20.149
adoes-19.736
Token in
Token ID287
Feature activation+1.68e+00

Pos logit contributions
 proceeding+6.040
 pretending+5.974
 parting+5.897
 questioning+5.868
 inaction+5.820
Neg logit contributions
iencies-7.905
nces-7.879
uilt-7.363
adoes-7.329
ortium-7.257
Token July
Token ID2901
Feature activation+5.12e-01

Pos logit contributions
 proceeding+9.009
 pretending+8.897
 questioning+8.820
 parting+8.806
 inaction+8.737
Neg logit contributions
nces-9.623
iencies-9.556
uilt-8.875
adoes-8.854
ortium-8.766
Token on
Token ID319
Feature activation+8.06e-01

Pos logit contributions
 proceeding+2.569
 pretending+2.554
 parting+2.534
 inaction+2.508
 questioning+2.486
Neg logit contributions
iencies-3.118
nces-3.097
uilt-2.900
adoes-2.875
ortium-2.842
Token continuing
Token ID8282
Feature activation-3.66e-01

Pos logit contributions
 proceeding+3.587
 pretending+3.558
 parting+3.515
 questioning+3.465
 inaction+3.464
Neg logit contributions
iencies-5.421
nces-5.395
uilt-5.061
adoes-5.030
ortium-4.981
Token so
Token ID523
Feature activation+2.33e+00

Pos logit contributions
 proceeding+3.086
 pretending+3.050
 parting+3.036
 inaction+3.016
 questioning+3.006
Neg logit contributions
iencies-2.687
nces-2.666
uilt-2.452
adoes-2.450
ortium-2.406
Token it
Token ID340
Feature activation-2.83e-01

Pos logit contributions
 proceeding+8.944
 pretending+8.912
 parting+8.751
 questioning+8.735
 inaction+8.458
Neg logit contributions
nces-16.257
iencies-16.248
uilt-15.290
ortium-15.204
adoes-14.985
Token's
Token ID338
Feature activation+1.84e+00

Pos logit contributions
nces+1.650
iencies+1.642
adoes+1.548
uilt+1.490
ortium+1.459
Neg logit contributions
 inaction-1.461
 proceeding-1.447
 pretending-1.428
 parting-1.413
 questioning-1.380
Token always
Token ID1464
Feature activation+2.74e+00

Pos logit contributions
 proceeding+6.738
 pretending+6.654
 questioning+6.548
 parting+6.545
 bothering+6.345
Neg logit contributions
iencies-13.681
nces-13.648
uilt-12.889
ortium-12.773
adoes-12.694
Token a
Token ID257
Feature activation+2.42e+00

Pos logit contributions
 proceeding+10.531
 questioning+10.515
 pretending+10.452
 parting+10.361
 bothering+10.265
Neg logit contributions
iencies-18.915
nces-18.803
uilt-17.621
ortium-17.500
emale-17.330
Token little
Token ID1310
Feature activation+3.81e+00

Pos logit contributions
 proceeding+10.231
 parting+10.229
 questioning+10.209
 pretending+10.040
 bothering+9.733
Neg logit contributions
iencies-15.950
nces-15.903
uilt-14.791
ortium-14.628
adoes-14.511
Token bit
Token ID1643
Feature activation+3.33e+00

Pos logit contributions
 questioning+14.232
 parting+13.815
 proceeding+13.614
 asking+13.517
 hiding+13.467
Neg logit contributions
iencies-23.774
nces-23.541
uilt-22.654
ortium-22.643
emale-22.548
Token strange
Token ID6283
Feature activation+8.68e-01

Pos logit contributions
 questioning+12.442
 parting+12.143
 proceeding+12.128
 pretending+11.916
 asking+11.804
Neg logit contributions
iencies-22.046
nces-21.606
uilt-20.816
ortium-20.587
emale-20.449
Token."
Token ID526
Feature activation+7.60e-01

Pos logit contributions
 proceeding+4.085
 pretending+4.035
 parting+3.992
 inaction+3.970
 questioning+3.954
Neg logit contributions
iencies-5.592
nces-5.582
uilt-5.214
adoes-5.207
ortium-5.128
Token
Token ID198
Feature activation+1.63e-01

Pos logit contributions
 proceeding+3.477
 pretending+3.437
 parting+3.389
 questioning+3.368
 inaction+3.347
Neg logit contributions
iencies-5.011
nces-4.999
uilt-4.704
adoes-4.656
ortium-4.617
Token
Token ID198
Feature activation+5.83e-01

Pos logit contributions
 proceeding+0.796
 pretending+0.792
 parting+0.790
 questioning+0.778
 inaction+0.773
Neg logit contributions
iencies-1.023
nces-1.016
adoes-0.948
uilt-0.934
ortium-0.929
Token off
Token ID572
Feature activation+1.10e+00

Pos logit contributions
 proceeding+2.281
 pretending+2.252
 parting+2.231
 questioning+2.209
 inaction+2.207
Neg logit contributions
iencies-3.197
nces-3.190
uilt-2.972
adoes-2.972
ortium-2.929
Token to
Token ID284
Feature activation+1.71e+00

Pos logit contributions
 proceeding+5.416
 pretending+5.373
 parting+5.304
 questioning+5.269
 inaction+5.229
Neg logit contributions
iencies-6.922
nces-6.891
uilt-6.436
adoes-6.399
ortium-6.341
Token entertain
Token ID8204
Feature activation+4.74e-01

Pos logit contributions
 pretending+7.615
 proceeding+7.600
 parting+7.486
 questioning+7.447
 inaction+7.342
Neg logit contributions
iencies-11.222
nces-11.192
adoes-10.413
uilt-10.405
ortium-10.387
Token (
Token ID357
Feature activation-8.90e-02

Pos logit contributions
 proceeding+2.043
 pretending+2.004
 parting+1.979
 inaction+1.966
 questioning+1.965
Neg logit contributions
nces-3.267
iencies-3.262
adoes-3.050
uilt-3.049
ortium-3.014
Tokenor
Token ID273
Feature activation+2.94e+00

Pos logit contributions
nces+0.465
iencies+0.461
adoes+0.426
uilt+0.424
+0.421
Neg logit contributions
 proceeding-0.519
 parting-0.518
 pretending-0.512
 inaction-0.511
 questioning-0.505
Token be
Token ID307
Feature activation+3.79e+00

Pos logit contributions
 pretending+13.763
 questioning+13.323
 proceeding+13.246
 parting+13.090
 inaction+13.017
Neg logit contributions
iencies-16.812
nces-16.596
ortium-15.765
uilt-15.503
adoes-15.029
Token entertained
Token ID34988
Feature activation+5.88e-01

Pos logit contributions
 pretending+19.713
 questioning+18.755
 proceeding+18.739
 hiding+18.699
 parting+18.662
Neg logit contributions
iencies-17.928
nces-16.843
ortium-15.928
uilt-15.829
adoes-15.490
Token)
Token ID8
Feature activation+7.80e-01

Pos logit contributions
 proceeding+3.023
 pretending+2.990
 parting+2.965
 questioning+2.940
 inaction+2.930
Neg logit contributions
iencies-3.549
nces-3.549
uilt-3.290
adoes-3.278
ortium-3.243
Token elsewhere
Token ID8057
Feature activation+1.24e+00

Pos logit contributions
 proceeding+3.387
 pretending+3.357
 parting+3.311
 questioning+3.284
 inaction+3.259
Neg logit contributions
iencies-5.332
nces-5.313
uilt-4.989
adoes-4.961
ortium-4.922
Token.
Token ID13
Feature activation+7.91e-01

Pos logit contributions
 proceeding+5.476
 pretending+5.449
 parting+5.339
 questioning+5.312
 inaction+5.244
Neg logit contributions
iencies-8.365
nces-8.295
uilt-7.817
adoes-7.774
ortium-7.715
Token
Token ID198
Feature activation-2.66e-01

Pos logit contributions
 proceeding+3.902
 pretending+3.866
 parting+3.822
 questioning+3.796
 inaction+3.776
Neg logit contributions
iencies-4.945
nces-4.928
uilt-4.590
adoes-4.574
ortium-4.528
Token there
Token ID612
Feature activation-2.17e+00

Pos logit contributions
nces+4.033
iencies+3.940
uilt+3.684
adoes+3.680
+3.595
Neg logit contributions
 proceeding-2.378
 parting-2.347
 pretending-2.340
 questioning-2.274
 inaction-2.269
Token are
Token ID389
Feature activation-1.47e+00

Pos logit contributions
nces+9.241
iencies+8.671
adoes+8.093
uilt+7.832
+7.710
Neg logit contributions
 parting-12.393
 pretending-12.258
 inaction-12.208
 proceeding-12.195
 bothering-11.976
Token many
Token ID867
Feature activation-2.54e-01

Pos logit contributions
nces+9.096
iencies+8.878
adoes+8.216
uilt+8.085
+7.924
Neg logit contributions
 proceeding-6.225
 pretending-6.167
 parting-6.119
 bothering-6.020
 inaction-6.017
Token industries
Token ID11798
Feature activation+1.56e-03

Pos logit contributions
nces+1.485
iencies+1.481
adoes+1.357
uilt+1.356
ortium+1.316
Neg logit contributions
 proceeding-1.312
 pretending-1.305
 parting-1.283
 inaction-1.275
 bothering-1.255
Token that
Token ID326
Feature activation-2.14e+00

Pos logit contributions
 proceeding+0.006
 inaction+0.006
 parting+0.006
 pretending+0.006
 questioning+0.006
Neg logit contributions
nces-0.011
iencies-0.011
uilt-0.010
adoes-0.010
ortium-0.010
Token didn
Token ID1422
Feature activation-5.23e+00

Pos logit contributions
nces+11.564
iencies+11.477
uilt+10.627
ortium+10.417
+10.401
Neg logit contributions
 parting-9.263
 proceeding-9.211
 pretending-9.186
 inaction-9.024
 confirming-8.979
Token't
Token ID470
Feature activation-1.68e+00

Pos logit contributions
+15.569
iencies+14.246
ados+13.179
adoes+12.945
nces+12.010
Neg logit contributions
 parting-25.913
 questioning-24.901
 exha-24.776
 inaction-24.575
 pretending-24.528
Token even
Token ID772
Feature activation-6.45e-01

Pos logit contributions
iencies+9.396
nces+9.289
adoes+8.905
+8.742
uilt+8.645
Neg logit contributions
 inaction-7.747
 pretending-7.292
 confirming-7.291
 parting-7.228
 proceeding-7.210
Token exist
Token ID2152
Feature activation-2.62e+00

Pos logit contributions
iencies+3.681
nces+3.664
adoes+3.462
uilt+3.428
ortium+3.400
Neg logit contributions
 inaction-3.273
 proceeding-3.219
 parting-3.201
 pretending-3.174
 questioning-3.154
Token when
Token ID618
Feature activation-2.14e-03

Pos logit contributions
nces+13.482
iencies+13.197
adoes+12.481
+12.338
ences+12.215
Neg logit contributions
 inaction-10.415
 parting-10.036
 proceeding-9.927
 questioning-9.912
 pretending-9.675
Token the
Token ID262
Feature activation-9.41e-01

Pos logit contributions
nces+0.015
iencies+0.015
uilt+0.014
adoes+0.014
ortium+0.013
Neg logit contributions
 pretending-0.009
 proceeding-0.009
 parting-0.009
 inaction-0.009
 questioning-0.009
Token Annie
Token ID25860
Feature activation+1.81e-01

Pos logit contributions
nces+2.677
iencies+2.662
uilt+2.557
ortium+2.534
adoes+2.520
Neg logit contributions
 proceeding-1.271
 pretending-1.263
 inaction-1.226
 parting-1.222
 questioning-1.219
Token Hall
Token ID4789
Feature activation-3.01e-01

Pos logit contributions
 proceeding+0.991
 pretending+0.983
 parting+0.973
 questioning+0.969
 inaction+0.965
Neg logit contributions
iencies-1.028
nces-1.022
uilt-0.950
adoes-0.949
ortium-0.938
Token and
Token ID290
Feature activation+2.73e-01

Pos logit contributions
iencies+1.615
nces+1.614
uilt+1.509
adoes+1.502
ortium+1.470
Neg logit contributions
 proceeding-1.703
 pretending-1.688
 parting-1.682
 inaction-1.666
 questioning-1.659
Token Inter
Token ID4225
Feature activation+4.57e-01

Pos logit contributions
 proceeding+1.437
 pretending+1.424
 parting+1.406
 inaction+1.394
 questioning+1.393
Neg logit contributions
iencies-1.599
nces-1.583
uilt-1.496
adoes-1.482
ortium-1.474
Tokeniors
Token ID12706
Feature activation-8.15e-01

Pos logit contributions
 proceeding+3.384
 pretending+3.355
 parting+3.337
 questioning+3.324
 inaction+3.312
Neg logit contributions
nces-1.698
iencies-1.698
uilt-1.509
adoes-1.495
ortium-1.477
Token didn
Token ID1422
Feature activation-5.19e+00

Pos logit contributions
iencies+4.506
nces+4.492
adoes+4.226
uilt+4.203
+4.139
Neg logit contributions
 proceeding-4.225
 inaction-4.197
 pretending-4.191
 parting-4.145
 questioning-4.087
Token
Token ID447
Feature activation-6.43e-01

Pos logit contributions
+19.275
iencies+16.598
adoes+15.558
ados+15.305
nces+15.206
Neg logit contributions
 parting-23.143
 pretending-22.294
 inaction-22.002
 questioning-21.870
 proceeding-21.395
Token
Token ID247
Feature activation-9.45e-01

Pos logit contributions
iencies+2.874
nces+2.810
+2.602
uilt+2.566
adoes+2.502
Neg logit contributions
 pretending-4.242
 parting-4.205
 proceeding-4.183
 inaction-4.100
 questioning-4.097
Tokent
Token ID83
Feature activation-3.29e-01

Pos logit contributions
nces+4.973
iencies+4.897
+4.502
adoes+4.456
uilt+4.402
Neg logit contributions
 parting-5.463
 proceeding-5.421
 pretending-5.379
 inaction-5.359
 questioning-5.283
Token already
Token ID1541
Feature activation-4.32e-01

Pos logit contributions
nces+2.148
iencies+2.138
adoes+2.024
uilt+2.005
ortium+1.983
Neg logit contributions
 inaction-1.463
 proceeding-1.461
 pretending-1.453
 parting-1.429
 questioning-1.409
Token signify
Token ID44078
Feature activation-6.08e-01

Pos logit contributions
iencies+2.724
nces+2.721
adoes+2.571
uilt+2.550
ortium+2.522
Neg logit contributions
 inaction-2.005
 proceeding-1.973
 pretending-1.968
 parting-1.935
 questioning-1.889
Tokenosexual
Token ID8542
Feature activation+1.30e+00

Pos logit contributions
 proceeding+1.329
 parting+1.322
 pretending+1.320
 inaction+1.312
 questioning+1.298
Neg logit contributions
iencies-1.050
nces-1.016
uilt-0.957
adoes-0.956
ortium-0.940
Token act
Token ID719
Feature activation+1.40e-02

Pos logit contributions
 proceeding+5.159
 pretending+5.091
 parting+5.014
 questioning+4.977
 inaction+4.941
Neg logit contributions
iencies-9.383
nces-9.373
uilt-8.817
adoes-8.771
ortium-8.700
Token",
Token ID1600
Feature activation-4.17e-01

Pos logit contributions
 parting+0.059
 proceeding+0.059
 inaction+0.059
 pretending+0.059
 questioning+0.058
Neg logit contributions
iencies-0.094
nces-0.094
uilt-0.087
adoes-0.087
ortium-0.087
Token because
Token ID780
Feature activation+4.77e-01

Pos logit contributions
iencies+2.551
nces+2.515
adoes+2.376
uilt+2.369
ortium+2.328
Neg logit contributions
 proceeding-1.909
 pretending-1.895
 parting-1.895
 inaction-1.877
 questioning-1.834
Token it
Token ID340
Feature activation-1.48e+00

Pos logit contributions
 proceeding+1.822
 parting+1.783
 pretending+1.773
 questioning+1.741
 inaction+1.740
Neg logit contributions
iencies-3.452
nces-3.450
adoes-3.255
uilt-3.227
ortium-3.178
Token doesn
Token ID1595
Feature activation-5.06e+00

Pos logit contributions
iencies+8.033
nces+7.990
adoes+7.657
+7.124
ortium+7.085
Neg logit contributions
 inaction-6.900
 parting-6.784
 proceeding-6.715
 pretending-6.663
 bothering-6.436
Token't
Token ID470
Feature activation-9.46e-01

Pos logit contributions
+15.626
iencies+13.798
adoes+12.844
ados+12.779
%%+12.406
Neg logit contributions
 parting-25.734
 pretending-24.208
 questioning-24.180
 exha-24.143
 inaction-23.895
Token involve
Token ID6211
Feature activation+5.43e-01

Pos logit contributions
iencies+5.750
nces+5.670
adoes+5.425
ortium+5.306
+5.289
Neg logit contributions
 inaction-4.264
 proceeding-4.079
 pretending-4.040
 parting-4.033
 questioning-3.894
Token the
Token ID262
Feature activation-4.27e-01

Pos logit contributions
 proceeding+2.158
 pretending+2.106
 parting+2.093
 inaction+2.087
 questioning+2.068
Neg logit contributions
iencies-3.865
nces-3.854
adoes-3.610
uilt-3.588
ortium-3.536
Token opposite
Token ID6697
Feature activation+2.52e-01

Pos logit contributions
iencies+2.877
nces+2.833
adoes+2.694
uilt+2.670
ortium+2.629
Neg logit contributions
 proceeding-1.685
 pretending-1.654
 inaction-1.650
 parting-1.611
 questioning-1.590
Token sex
Token ID1714
Feature activation+6.58e-01

Pos logit contributions
 proceeding+1.045
 pretending+1.033
 parting+1.029
 inaction+1.012
 bothering+1.001
Neg logit contributions
iencies-1.737
nces-1.722
adoes-1.632
uilt-1.612
-1.593
Token.
Token ID13
Feature activation+3.25e-01

Pos logit contributions
 proceeding+7.231
 pretending+7.166
 parting+7.062
 questioning+7.028
 inaction+6.954
Neg logit contributions
iencies-8.449
nces-8.447
uilt-7.834
adoes-7.718
ortium-7.708
Token
Token ID198
Feature activation-3.67e-01

Pos logit contributions
 proceeding+1.725
 pretending+1.714
 questioning+1.692
 parting+1.691
 inaction+1.674
Neg logit contributions
iencies-1.897
nces-1.888
adoes-1.761
uilt-1.742
ortium-1.726
Token
Token ID198
Feature activation-2.80e-02

Pos logit contributions
iencies+2.118
nces+2.083
adoes+1.956
+1.953
ortium+1.876
Neg logit contributions
 parting-1.946
 pretending-1.924
 proceeding-1.916
 questioning-1.897
 inaction-1.882
TokenWhile
Token ID3633
Feature activation-4.07e-01

Pos logit contributions
iencies+0.157
nces+0.156
adoes+0.147
+0.146
uilt+0.145
Neg logit contributions
 pretending-0.153
 proceeding-0.153
 parting-0.153
 questioning-0.152
 inaction-0.148
Token Wallace
Token ID17161
Feature activation-1.01e+00

Pos logit contributions
iencies+2.602
nces+2.597
adoes+2.467
uilt+2.450
ortium+2.387
Neg logit contributions
 proceeding-1.794
 pretending-1.766
 inaction-1.729
 parting-1.727
 questioning-1.719
Token didn
Token ID1422
Feature activation-4.85e+00

Pos logit contributions
iencies+5.615
nces+5.549
adoes+5.315
uilt+5.312
+5.217
Neg logit contributions
 proceeding-5.105
 pretending-5.054
 parting-5.027
 inaction-4.968
 confirming-4.959
Token
Token ID447
Feature activation-3.60e-01

Pos logit contributions
+18.063
iencies+15.767
adoes+14.933
ados+14.424
nces+14.325
Neg logit contributions
 parting-22.729
 pretending-21.944
 questioning-21.922
 inaction-21.246
 proceeding-21.168
Token
Token ID247
Feature activation-4.91e-01

Pos logit contributions
iencies+1.746
nces+1.722
uilt+1.588
+1.553
adoes+1.542
Neg logit contributions
 pretending-2.282
 proceeding-2.264
 parting-2.262
 inaction-2.215
 questioning-2.211
Tokent
Token ID83
Feature activation-4.33e-01

Pos logit contributions
iencies+2.680
nces+2.673
adoes+2.433
uilt+2.420
+2.396
Neg logit contributions
 proceeding-2.810
 parting-2.803
 pretending-2.774
 inaction-2.758
 questioning-2.733
Token struggle
Token ID6531
Feature activation-1.36e+00

Pos logit contributions
nces+2.788
iencies+2.775
adoes+2.649
uilt+2.598
+2.573
Neg logit contributions
 proceeding-1.933
 inaction-1.917
 pretending-1.904
 parting-1.902
 confirming-1.874
Token getting
Token ID1972
Feature activation-6.24e-01

Pos logit contributions
nces+8.587
iencies+8.376
adoes+8.067
uilt+7.852
+7.838
Neg logit contributions
 inaction-5.318
 proceeding-5.313
 pretending-5.187
 parting-5.112
 questioning-5.047
Token medicinal
Token ID30381
Feature activation+1.72e-01

Pos logit contributions
iencies+3.612
nces+3.583
adoes+3.363
uilt+3.247
+3.232
Neg logit contributions
 proceeding-2.515
 pretending-2.501
 parting-2.464
 questioning-2.426
 inaction-2.423
Token uses
Token ID3544
Feature activation-1.10e+00

Pos logit contributions
 proceeding+1.117
 pretending+1.114
 parting+1.105
 inaction+1.095
 questioning+1.092
Neg logit contributions
iencies-0.808
nces-0.786
adoes-0.720
uilt-0.695
ortium-0.689
Token of
Token ID286
Feature activation-4.07e-01

Pos logit contributions
iencies+6.529
nces+6.416
adoes+5.997
+5.938
ortium+5.832
Neg logit contributions
 parting-5.042
 inaction-4.933
 proceeding-4.912
 pretending-4.855
 questioning-4.774
Token marijuana
Token ID5727
Feature activation+2.88e-01

Pos logit contributions
iencies+2.280
nces+2.244
adoes+2.094
+2.074
uilt+2.056
Neg logit contributions
 proceeding-2.098
 pretending-2.068
 parting-2.061
 inaction-2.037
 questioning-2.027
Token that
Token ID326
Feature activation-1.12e+00

Pos logit contributions
 proceeding+1.380
 parting+1.369
 pretending+1.369
 inaction+1.344
 questioning+1.339
Neg logit contributions
iencies-1.818
nces-1.796
adoes-1.676
uilt-1.675
ortium-1.660
Token don
Token ID836
Feature activation-4.77e+00

Pos logit contributions
nces+6.886
iencies+6.861
adoes+6.465
+6.438
ortium+6.393
Neg logit contributions
 parting-4.790
 pretending-4.765
 proceeding-4.732
 confirming-4.595
 questioning-4.551
Token
Token ID447
Feature activation-3.74e-01

Pos logit contributions
+17.750
iencies+15.869
adoes+14.300
nces+14.038
ados+13.821
Neg logit contributions
 parting-23.128
 pretending-22.128
 questioning-21.625
 proceeding-21.427
 inaction-21.348
Token
Token ID247
Feature activation-1.96e-01

Pos logit contributions
iencies+1.840
nces+1.812
uilt+1.668
+1.626
adoes+1.617
Neg logit contributions
 pretending-2.342
 proceeding-2.331
 parting-2.322
 questioning-2.274
 inaction-2.274
Tokent
Token ID83
Feature activation-6.21e-02

Pos logit contributions
iencies+1.074
nces+1.065
uilt+0.969
adoes+0.966
+0.960
Neg logit contributions
 proceeding-1.131
 pretending-1.120
 parting-1.119
 inaction-1.104
 questioning-1.102
Token have
Token ID423
Feature activation-6.00e-01

Pos logit contributions
iencies+0.407
nces+0.403
adoes+0.377
uilt+0.372
ortium+0.372
Neg logit contributions
 proceeding-0.282
 pretending-0.281
 parting-0.280
 inaction-0.278
 questioning-0.274
Token the
Token ID262
Feature activation-1.06e+00

Pos logit contributions
iencies+4.046
nces+3.981
adoes+3.786
+3.722
ortium+3.700
Neg logit contributions
 proceeding-2.433
 parting-2.418
 pretending-2.379
 inaction-2.332
 questioning-2.314
Token work
Token ID670
Feature activation-1.66e+00

Pos logit contributions
nces+5.493
iencies+5.473
adoes+5.238
+5.075
uilt+5.017
Neg logit contributions
 proceeding-4.313
 pretending-4.270
 inaction-4.201
 parting-4.187
 confirming-4.181
Token is
Token ID318
Feature activation+3.44e-02

Pos logit contributions
nces+9.872
iencies+9.756
adoes+9.167
+9.030
uilt+8.991
Neg logit contributions
 inaction-6.941
 proceeding-6.799
 parting-6.753
 pretending-6.702
 confirming-6.599
Token when
Token ID618
Feature activation+2.60e-01

Pos logit contributions
 parting+0.127
 proceeding+0.127
 pretending+0.127
 inaction+0.126
 questioning+0.123
Neg logit contributions
nces-0.249
iencies-0.248
adoes-0.235
uilt-0.234
-0.230
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
 proceeding+0.849
 pretending+0.838
 parting+0.818
 inaction+0.814
 questioning+0.813
Neg logit contributions
nces-2.043
iencies-2.038
adoes-1.928
uilt-1.928
ortium-1.896
Token result
Token ID1255
Feature activation-4.24e-01

Pos logit contributions
iencies+7.032
nces+6.939
adoes+6.607
+6.529
uilt+6.473
Neg logit contributions
 pretending-4.656
 proceeding-4.608
 inaction-4.485
 confirming-4.478
 bothering-4.399
Token doesn
Token ID1595
Feature activation-4.76e+00

Pos logit contributions
iencies+2.404
nces+2.372
adoes+2.211
uilt+2.185
+2.181
Neg logit contributions
 pretending-2.246
 proceeding-2.246
 inaction-2.216
 parting-2.206
 questioning-2.197
Token
Token ID447
Feature activation-1.06e+00

Pos logit contributions
+17.550
iencies+15.615
adoes+14.370
ados+14.078
nces+13.895
Neg logit contributions
 parting-22.498
 pretending-21.848
 questioning-21.541
 inaction-21.247
 proceeding-20.987
Token
Token ID247
Feature activation-7.10e-01

Pos logit contributions
iencies+4.425
nces+4.304
+3.879
uilt+3.834
ortium+3.734
Neg logit contributions
 pretending-7.183
 parting-7.118
 proceeding-7.039
 inaction-6.924
 questioning-6.874
Tokent
Token ID83
Feature activation+1.66e-01

Pos logit contributions
nces+3.673
iencies+3.671
uilt+3.380
+3.333
adoes+3.303
Neg logit contributions
 proceeding-4.109
 parting-4.092
 inaction-4.092
 pretending-4.091
 questioning-3.987
Token feel
Token ID1254
Feature activation+6.00e-01

Pos logit contributions
 proceeding+0.717
 pretending+0.714
 inaction+0.705
 parting+0.703
 questioning+0.695
Neg logit contributions
iencies-1.129
nces-1.126
adoes-1.059
uilt-1.053
ortium-1.044
Token like
Token ID588
Feature activation+9.40e-01

Pos logit contributions
 proceeding+2.600
 pretending+2.577
 parting+2.546
 questioning+2.516
 inaction+2.510
Neg logit contributions
iencies-4.075
nces-4.068
adoes-3.827
uilt-3.824
ortium-3.768
Token to
Token ID284
Feature activation-2.47e-01

Pos logit contributions
nces+8.993
iencies+8.571
adoes+8.479
uilt+8.418
+8.374
Neg logit contributions
 proceeding-8.897
 pretending-8.642
 parting-8.602
 confirming-8.562
 inaction-8.505
Token be
Token ID307
Feature activation+9.19e-01

Pos logit contributions
nces+1.704
iencies+1.695
adoes+1.620
uilt+1.616
+1.582
Neg logit contributions
 proceeding-0.989
 inaction-0.987
 pretending-0.985
 parting-0.969
 questioning-0.950
Token here
Token ID994
Feature activation-5.16e-01

Pos logit contributions
 proceeding+3.304
 pretending+3.248
 parting+3.208
 inaction+3.165
 questioning+3.157
Neg logit contributions
nces-6.970
iencies-6.964
uilt-6.586
adoes-6.553
ortium-6.495
Token,
Token ID11
Feature activation-1.25e+00

Pos logit contributions
nces+3.051
iencies+2.972
uilt+2.824
adoes+2.823
ortium+2.813
Neg logit contributions
 proceeding-2.525
 inaction-2.506
 parting-2.467
 pretending-2.453
 questioning-2.431
Token I
Token ID314
Feature activation-9.82e-01

Pos logit contributions
iencies+7.315
nces+7.227
uilt+7.138
+6.941
adoes+6.836
Neg logit contributions
 proceeding-5.442
 inaction-5.424
 parting-5.386
 pretending-5.224
 questioning-5.211
Token didn
Token ID1422
Feature activation-4.75e+00

Pos logit contributions
nces+6.015
iencies+5.952
uilt+5.755
+5.679
adoes+5.526
Neg logit contributions
 proceeding-4.479
 inaction-4.397
 parting-4.328
 pretending-4.291
 confirming-4.252
Token
Token ID447
Feature activation-1.19e+00

Pos logit contributions
+17.253
iencies+14.599
adoes+13.389
ados+13.097
%%+12.939
Neg logit contributions
 parting-22.847
 pretending-22.341
 questioning-22.024
 proceeding-21.958
 inaction-21.392
Token
Token ID247
Feature activation-9.85e-01

Pos logit contributions
iencies+4.973
nces+4.869
+4.521
uilt+4.419
ortium+4.185
Neg logit contributions
 pretending-7.921
 parting-7.898
 proceeding-7.786
 inaction-7.635
 questioning-7.621
Tokent
Token ID83
Feature activation-2.08e-01

Pos logit contributions
nces+4.844
iencies+4.818
+4.426
uilt+4.382
adoes+4.222
Neg logit contributions
 inaction-5.941
 proceeding-5.922
 parting-5.876
 pretending-5.843
 questioning-5.718
Token know
Token ID760
Feature activation-5.82e-01

Pos logit contributions
nces+1.341
iencies+1.329
adoes+1.261
uilt+1.254
+1.234
Neg logit contributions
 proceeding-0.936
 inaction-0.923
 pretending-0.912
 parting-0.906
 confirming-0.898
Token if
Token ID611
Feature activation+6.94e-01

Pos logit contributions
nces+4.215
iencies+4.195
adoes+4.050
uilt+4.002
+3.983
Neg logit contributions
 proceeding-1.955
 pretending-1.933
 inaction-1.906
 parting-1.881
 questioning-1.861
Token-
Token ID12
Feature activation+7.36e-01

Pos logit contributions
 proceeding+1.565
 pretending+1.527
 inaction+1.522
 parting+1.507
 bothering+1.457
Neg logit contributions
iencies-3.874
nces-3.836
adoes-3.691
uilt-3.684
ortium-3.655
Tokened
Token ID276
Feature activation+1.50e+00

Pos logit contributions
 proceeding+4.011
 pretending+3.978
 parting+3.936
 questioning+3.913
 inaction+3.891
Neg logit contributions
iencies-4.223
nces-4.211
uilt-3.899
adoes-3.875
ortium-3.834
Token
Token ID564
Feature activation+1.16e+00

Pos logit contributions
 proceeding+7.253
 pretending+7.243
 questioning+7.154
 parting+7.095
 confirming+6.965
Neg logit contributions
iencies-9.479
nces-9.403
uilt-8.743
adoes-8.689
ortium-8.575
Token
Token ID250
Feature activation+1.14e+00

Pos logit contributions
 proceeding+6.411
 pretending+6.349
 parting+6.298
 questioning+6.268
 inaction+6.219
Neg logit contributions
iencies-6.457
nces-6.444
uilt-5.964
adoes-5.930
ortium-5.871
Tokencigarettes
Token ID32529
Feature activation+2.95e-02

Pos logit contributions
 proceeding+6.406
 pretending+6.363
 parting+6.279
 questioning+6.265
 inaction+6.217
Neg logit contributions
iencies-6.320
nces-6.275
uilt-5.808
adoes-5.768
ortium-5.708
Token don
Token ID836
Feature activation-4.73e+00

Pos logit contributions
 proceeding+0.162
 pretending+0.160
 parting+0.159
 inaction+0.158
 questioning+0.157
Neg logit contributions
nces-0.165
iencies-0.164
uilt-0.153
adoes-0.152
ortium-0.151
Token
Token ID447
Feature activation+1.03e-01

Pos logit contributions
+18.604
iencies+16.630
adoes+15.271
nces+15.196
ados+14.817
Neg logit contributions
 parting-21.325
 pretending-20.332
 proceeding-19.929
 questioning-19.604
 inaction-19.469
Token
Token ID247
Feature activation-6.23e-01

Pos logit contributions
 pretending+0.625
 proceeding+0.625
 parting+0.620
 inaction+0.609
 questioning+0.608
Neg logit contributions
iencies-0.524
nces-0.520
uilt-0.478
adoes-0.470
ortium-0.465
Tokent
Token ID83
Feature activation-1.45e+00

Pos logit contributions
nces+3.361
iencies+3.329
uilt+3.026
adoes+3.025
+2.989
Neg logit contributions
 proceeding-3.541
 parting-3.489
 pretending-3.479
 inaction-3.420
 questioning-3.403
Token kill
Token ID1494
Feature activation-1.68e+00

Pos logit contributions
iencies+8.767
nces+8.637
+8.482
adoes+8.327
uilt+8.120
Neg logit contributions
 proceeding-6.075
 parting-5.927
 pretending-5.919
 inaction-5.873
 confirming-5.815
Token
Token ID447
Feature activation+7.49e-01

Pos logit contributions
iencies+10.884
nces+10.824
adoes+10.781
+10.255
uilt+9.934
Neg logit contributions
 parting-6.151
 proceeding-6.025
 questioning-5.961
 pretending-5.953
 pleading-5.867
Token nominated
Token ID19332
Feature activation-9.41e-02

Pos logit contributions
 proceeding+7.527
 pretending+7.511
 parting+7.356
 questioning+7.325
 inaction+7.300
Neg logit contributions
iencies-8.301
nces-8.252
uilt-7.679
adoes-7.625
ortium-7.544
Token.
Token ID13
Feature activation+1.55e-01

Pos logit contributions
nces+0.584
iencies+0.576
adoes+0.538
uilt+0.535
ortium+0.530
Neg logit contributions
 proceeding-0.462
 parting-0.455
 pretending-0.453
 inaction-0.450
 questioning-0.448
Token Theodore
Token ID36494
Feature activation-8.84e-01

Pos logit contributions
 proceeding+0.804
 pretending+0.795
 questioning+0.793
 parting+0.792
 inaction+0.780
Neg logit contributions
nces-0.915
iencies-0.912
adoes-0.849
-0.838
ortium-0.837
Token Be
Token ID1355
Feature activation+8.22e-01

Pos logit contributions
iencies+4.623
nces+4.548
adoes+4.278
+4.259
ortium+4.196
Neg logit contributions
 proceeding-4.758
 inaction-4.731
 parting-4.710
 pretending-4.667
 questioning-4.660
Tokenale
Token ID1000
Feature activation-1.23e+00

Pos logit contributions
 proceeding+4.444
 pretending+4.407
 parting+4.349
 questioning+4.330
 inaction+4.310
Neg logit contributions
iencies-4.737
nces-4.711
uilt-4.366
adoes-4.348
ortium-4.297
Token didn
Token ID1422
Feature activation-4.70e+00

Pos logit contributions
nces+6.928
iencies+6.767
+6.494
adoes+6.432
uilt+6.399
Neg logit contributions
 proceeding-5.817
 parting-5.741
 inaction-5.671
 questioning-5.635
 pretending-5.611
Token
Token ID447
Feature activation-6.16e-01

Pos logit contributions
+18.266
iencies+16.361
adoes+15.398
nces+15.046
ados+14.945
Neg logit contributions
 parting-21.228
 pretending-20.412
 questioning-20.286
 inaction-19.968
 proceeding-19.864
Token
Token ID247
Feature activation-6.06e-01

Pos logit contributions
iencies+3.138
nces+3.109
uilt+2.907
+2.882
adoes+2.811
Neg logit contributions
 pretending-3.644
 parting-3.620
 proceeding-3.618
 inaction-3.531
 questioning-3.519
Tokent
Token ID83
Feature activation-2.08e-01

Pos logit contributions
nces+3.202
iencies+3.173
+2.884
uilt+2.884
adoes+2.870
Neg logit contributions
 proceeding-3.528
 parting-3.526
 inaction-3.495
 pretending-3.476
 questioning-3.440
Token just
Token ID655
Feature activation-4.61e-01

Pos logit contributions
iencies+1.398
nces+1.396
adoes+1.318
uilt+1.306
+1.297
Neg logit contributions
 proceeding-0.886
 parting-0.870
 inaction-0.865
 pretending-0.860
 questioning-0.847
Token beat
Token ID4405
Feature activation-4.44e-01

Pos logit contributions
iencies+3.127
nces+3.112
adoes+2.958
uilt+2.951
+2.922
Neg logit contributions
 proceeding-1.866
 inaction-1.823
 parting-1.821
 pretending-1.796
 questioning-1.777
Token
Token ID198
Feature activation+6.67e-02

Pos logit contributions
iencies+1.812
nces+1.790
adoes+1.675
+1.661
ortium+1.620
Neg logit contributions
 parting-1.590
 pretending-1.579
 proceeding-1.577
 questioning-1.556
 inaction-1.543
TokenGood
Token ID10248
Feature activation-1.41e+00

Pos logit contributions
 proceeding+0.362
 pretending+0.362
 parting+0.361
 questioning+0.355
 inaction+0.349
Neg logit contributions
iencies-0.382
nces-0.381
adoes-0.355
-0.353
uilt-0.350
Tokenwin
Token ID5404
Feature activation-1.51e+00

Pos logit contributions
iencies+6.692
nces+6.430
uilt+5.897
adoes+5.830
ortium+5.769
Neg logit contributions
 pretending-8.492
 proceeding-8.332
 parting-8.297
 inaction-8.210
 bothering-8.149
Token says
Token ID1139
Feature activation-3.96e-01

Pos logit contributions
iencies+8.424
nces+8.290
adoes+7.852
+7.774
uilt+7.758
Neg logit contributions
 proceeding-7.196
 inaction-7.180
 pretending-7.154
 parting-7.122
 confirming-6.967
Token he
Token ID339
Feature activation-2.19e+00

Pos logit contributions
iencies+3.096
nces+3.075
adoes+2.938
uilt+2.924
+2.861
Neg logit contributions
 proceeding-1.170
 pretending-1.161
 parting-1.137
 confirming-1.111
 inaction-1.110
Token doesn
Token ID1595
Feature activation-4.70e+00

Pos logit contributions
iencies+12.106
nces+11.982
+11.590
adoes+11.270
uilt+11.266
Neg logit contributions
 proceeding-9.487
 inaction-9.440
 pretending-9.337
 confirming-9.295
 parting-9.246
Token
Token ID447
Feature activation-8.04e-01

Pos logit contributions
+17.440
iencies+15.099
adoes+13.843
%%+13.735
ados+13.670
Neg logit contributions
 parting-22.521
 pretending-21.713
 questioning-21.354
 proceeding-21.140
 inaction-21.023
Token
Token ID247
Feature activation-6.65e-01

Pos logit contributions
iencies+3.591
nces+3.511
uilt+3.179
+3.172
ortium+3.059
Neg logit contributions
 pretending-5.286
 parting-5.233
 proceeding-5.219
 inaction-5.117
 questioning-5.089
Tokent
Token ID83
Feature activation-8.27e-01

Pos logit contributions
iencies+3.502
nces+3.454
uilt+3.165
adoes+3.149
+3.139
Neg logit contributions
 proceeding-3.883
 inaction-3.847
 parting-3.845
 pretending-3.832
 questioning-3.743
Token plan
Token ID1410
Feature activation-2.01e+00

Pos logit contributions
iencies+5.172
nces+5.043
adoes+4.827
+4.815
uilt+4.766
Neg logit contributions
 inaction-3.726
 proceeding-3.690
 pretending-3.584
 confirming-3.564
 parting-3.557
Token to
Token ID284
Feature activation-4.74e-01

Pos logit contributions
iencies+11.110
nces+10.644
adoes+10.337
uilt+10.312
+10.140
Neg logit contributions
 pretending-9.179
 proceeding-9.056
 parting-9.005
 questioning-8.916
 confirming-8.903
Token
Token ID198
Feature activation-4.17e-01

Pos logit contributions
iencies+4.881
nces+4.768
+4.610
adoes+4.495
ados+4.195
Neg logit contributions
 parting-5.497
 pretending-5.365
 questioning-5.340
 proceeding-5.309
 inaction-5.253
TokenPat
Token ID12130
Feature activation-4.63e-02

Pos logit contributions
nces+2.287
iencies+2.279
+2.193
adoes+2.129
uilt+2.098
Neg logit contributions
 pretending-2.261
 parting-2.259
 proceeding-2.243
 questioning-2.238
 inaction-2.177
Tokenrik
Token ID12602
Feature activation-6.73e-01

Pos logit contributions
iencies+0.204
nces+0.203
uilt+0.186
adoes+0.182
ortium+0.180
Neg logit contributions
 proceeding-0.310
 pretending-0.309
 inaction-0.307
 questioning-0.306
 parting-0.305
Token L
Token ID406
Feature activation-1.09e+00

Pos logit contributions
iencies+2.796
nces+2.776
uilt+2.524
+2.499
adoes+2.498
Neg logit contributions
 inaction-4.402
 pretending-4.377
 proceeding-4.366
 questioning-4.363
 parting-4.281
Tokenaine
Token ID5718
Feature activation-2.07e+00

Pos logit contributions
iencies+3.835
nces+3.683
uilt+3.497
adoes+3.355
+3.328
Neg logit contributions
 proceeding-8.034
 parting-8.003
 bothering-7.906
 pretending-7.862
 inaction-7.842
Token hasn
Token ID5818
Feature activation-4.66e+00

Pos logit contributions
iencies+11.235
nces+11.162
+10.967
uilt+10.825
adoes+10.651
Neg logit contributions
 inaction-8.535
 proceeding-8.522
 pretending-8.400
 parting-8.365
 questioning-8.307
Token
Token ID447
Feature activation-1.04e+00

Pos logit contributions
+18.204
iencies+15.816
nces+14.827
adoes+14.475
tera+14.161
Neg logit contributions
 parting-21.153
 pretending-20.440
 proceeding-20.047
 questioning-19.948
 inaction-19.907
Token
Token ID247
Feature activation-1.17e+00

Pos logit contributions
iencies+4.197
nces+4.155
+3.772
uilt+3.750
adoes+3.584
Neg logit contributions
 pretending-7.053
 proceeding-6.968
 parting-6.934
 inaction-6.810
 questioning-6.804
Tokent
Token ID83
Feature activation-1.01e+00

Pos logit contributions
nces+5.871
iencies+5.743
uilt+5.299
+5.277
ortium+5.148
Neg logit contributions
 proceeding-6.923
 parting-6.919
 inaction-6.897
 pretending-6.881
 questioning-6.749
Token exactly
Token ID3446
Feature activation-5.46e-01

Pos logit contributions
nces+6.289
iencies+6.184
adoes+5.829
uilt+5.827
ortium+5.753
Neg logit contributions
 inaction-4.424
 parting-4.323
 proceeding-4.319
 pretending-4.258
 confirming-4.156
Token been
Token ID587
Feature activation+5.38e-01

Pos logit contributions
nces+3.504
iencies+3.458
uilt+3.302
adoes+3.250
ortium+3.205
Neg logit contributions
 proceeding-2.418
 inaction-2.416
 parting-2.390
 pretending-2.379
 questioning-2.309
Token wall
Token ID3355
Feature activation+4.27e-01

Pos logit contributions
iencies+6.931
nces+6.910
adoes+6.590
uilt+6.536
ortium+6.456
Neg logit contributions
 pretending-4.426
 proceeding-4.414
 parting-4.334
 bothering-4.305
 questioning-4.261
Token where
Token ID810
Feature activation-7.91e-01

Pos logit contributions
 proceeding+2.429
 pretending+2.416
 parting+2.389
 questioning+2.374
 inaction+2.365
Neg logit contributions
iencies-2.323
nces-2.314
uilt-2.149
adoes-2.139
ortium-2.096
Token natural
Token ID3288
Feature activation-6.60e-02

Pos logit contributions
nces+5.312
iencies+5.293
adoes+5.067
uilt+4.938
+4.935
Neg logit contributions
 pretending-3.102
 proceeding-3.085
 parting-3.035
 bothering-3.019
 confirming-2.968
Token geographic
Token ID22987
Feature activation-2.98e-01

Pos logit contributions
iencies+0.393
nces+0.385
adoes+0.367
ortium+0.349
uilt+0.348
Neg logit contributions
 pretending-0.338
 proceeding-0.330
 bothering-0.327
 inaction-0.327
 parting-0.326
Token barriers
Token ID14725
Feature activation-1.01e+00

Pos logit contributions
iencies+1.526
nces+1.478
adoes+1.381
uilt+1.317
ortium+1.307
Neg logit contributions
 pretending-1.770
 proceeding-1.761
 bothering-1.735
 questioning-1.729
 parting-1.728
Token don
Token ID836
Feature activation-4.66e+00

Pos logit contributions
iencies+5.754
nces+5.591
adoes+5.377
uilt+5.272
+5.247
Neg logit contributions
 proceeding-4.912
 pretending-4.828
 questioning-4.800
 inaction-4.780
 parting-4.762
Token
Token ID447
Feature activation-1.07e+00

Pos logit contributions
+18.222
iencies+17.286
adoes+15.639
nces+15.444
ados+15.444
Neg logit contributions
 parting-20.550
 pretending-20.259
 questioning-19.789
 proceeding-19.432
 inaction-19.249
Token
Token ID247
Feature activation-5.75e-01

Pos logit contributions
iencies+4.799
nces+4.630
uilt+4.309
+4.281
ortium+4.089
Neg logit contributions
 pretending-6.871
 proceeding-6.746
 parting-6.745
 inaction-6.620
 bothering-6.563
Tokent
Token ID83
Feature activation-6.01e-01

Pos logit contributions
iencies+3.217
nces+3.210
uilt+2.938
adoes+2.904
ortium+2.889
Neg logit contributions
 proceeding-3.144
 pretending-3.128
 parting-3.106
 inaction-3.062
 questioning-3.048
Token already
Token ID1541
Feature activation-5.07e-01

Pos logit contributions
iencies+3.192
nces+3.151
adoes+3.001
uilt+2.938
+2.919
Neg logit contributions
 proceeding-3.263
 pretending-3.253
 inaction-3.242
 questioning-3.220
 parting-3.186
Token exist
Token ID2152
Feature activation-2.09e+00

Pos logit contributions
iencies+2.247
nces+2.221
adoes+2.074
uilt+2.025
ortium+1.996
Neg logit contributions
 pretending-3.222
 proceeding-3.200
 questioning-3.183
 parting-3.180
 inaction-3.176
Token
Token ID198
Feature activation+2.59e-01

Pos logit contributions
iencies+2.459
nces+2.427
+2.277
adoes+2.271
ortium+2.179
Neg logit contributions
 parting-2.317
 pretending-2.288
 proceeding-2.283
 questioning-2.257
 inaction-2.242
TokenStill
Token ID9590
Feature activation+6.41e-01

Pos logit contributions
 proceeding+1.417
 parting+1.416
 pretending+1.405
 questioning+1.393
 inaction+1.370
Neg logit contributions
nces-1.477
iencies-1.466
-1.361
adoes-1.359
uilt-1.347
Token,
Token ID11
Feature activation-5.43e-02

Pos logit contributions
 proceeding+2.898
 pretending+2.865
 parting+2.834
 questioning+2.805
 inaction+2.796
Neg logit contributions
nces-4.269
iencies-4.262
uilt-3.988
adoes-3.967
ortium-3.936
Token while
Token ID981
Feature activation+3.17e-01

Pos logit contributions
nces+0.384
iencies+0.375
uilt+0.356
adoes+0.355
ortium+0.353
Neg logit contributions
 proceeding-0.214
 parting-0.209
 pretending-0.209
 inaction-0.206
 questioning-0.202
Token I
Token ID314
Feature activation-1.70e+00

Pos logit contributions
 proceeding+1.388
 pretending+1.354
 parting+1.352
 inaction+1.334
 questioning+1.320
Neg logit contributions
nces-2.142
iencies-2.112
adoes-1.981
uilt-1.971
ortium-1.952
Token didn
Token ID1422
Feature activation-4.62e+00

Pos logit contributions
nces+10.156
iencies+10.067
+9.895
ortium+9.568
uilt+9.432
Neg logit contributions
 proceeding-7.059
 inaction-6.955
 parting-6.879
 pretending-6.859
 confirming-6.680
Token
Token ID447
Feature activation-2.89e-01

Pos logit contributions
+18.066
iencies+15.842
adoes+14.265
%%+14.157
nces+14.149
Neg logit contributions
 parting-21.362
 pretending-20.910
 proceeding-20.328
 questioning-20.300
 inaction-20.258
Token
Token ID247
Feature activation-6.57e-01

Pos logit contributions
iencies+1.365
nces+1.343
+1.218
uilt+1.217
adoes+1.203
Neg logit contributions
 pretending-1.857
 proceeding-1.844
 parting-1.839
 questioning-1.806
 inaction-1.801
Tokent
Token ID83
Feature activation+1.87e-01

Pos logit contributions
iencies+3.389
nces+3.375
+3.133
uilt+3.064
adoes+3.022
Neg logit contributions
 proceeding-3.867
 parting-3.845
 inaction-3.837
 pretending-3.825
 questioning-3.745
Token entirely
Token ID5000
Feature activation+1.41e+00

Pos logit contributions
 proceeding+0.848
 pretending+0.836
 inaction+0.834
 parting+0.830
 questioning+0.811
Neg logit contributions
iencies-1.237
nces-1.232
adoes-1.156
uilt-1.146
ortium-1.131
Token get
Token ID651
Feature activation+1.23e-01

Pos logit contributions
 proceeding+6.060
 pretending+6.028
 questioning+5.992
 parting+5.972
 confirming+5.799
Neg logit contributions
nces-9.520
iencies-9.510
uilt-9.010
ortium-8.852
adoes-8.806
Token plays
Token ID5341
Feature activation-9.18e-01

Pos logit contributions
iencies+1.619
nces+1.593
uilt+1.467
adoes+1.465
+1.422
Neg logit contributions
 proceeding-1.434
 pretending-1.410
 inaction-1.399
 parting-1.388
 confirming-1.385
Token within
Token ID1626
Feature activation+1.01e-01

Pos logit contributions
nces+5.998
iencies+5.972
+5.617
adoes+5.615
uilt+5.590
Neg logit contributions
 proceeding-3.636
 parting-3.607
 inaction-3.607
 pretending-3.566
 questioning-3.478
Token that
Token ID326
Feature activation+1.03e-01

Pos logit contributions
 proceeding+0.471
 pretending+0.464
 parting+0.464
 inaction+0.454
 questioning+0.452
Neg logit contributions
iencies-0.645
nces-0.642
adoes-0.604
uilt-0.602
ortium-0.595
Token.
Token ID13
Feature activation+5.30e-01

Pos logit contributions
 pretending+0.491
 proceeding+0.491
 parting+0.484
 inaction+0.480
 confirming+0.475
Neg logit contributions
iencies-0.635
nces-0.634
adoes-0.597
uilt-0.595
ortium-0.588
Token It
Token ID632
Feature activation-1.88e+00

Pos logit contributions
 proceeding+2.650
 pretending+2.626
 parting+2.601
 questioning+2.580
 inaction+2.567
Neg logit contributions
iencies-3.282
nces-3.272
adoes-3.039
uilt-3.035
ortium-3.005
Token doesn
Token ID1595
Feature activation-4.61e+00

Pos logit contributions
nces+9.861
iencies+9.717
uilt+9.308
adoes+9.263
+9.258
Neg logit contributions
 parting-8.975
 inaction-8.963
 proceeding-8.928
 pretending-8.889
 bothering-8.574
Token
Token ID447
Feature activation-9.27e-01

Pos logit contributions
+17.057
iencies+15.251
adoes+14.189
ados+13.993
nces+13.912
Neg logit contributions
 parting-22.104
 pretending-21.271
 questioning-20.758
 proceeding-20.627
 inaction-20.621
Token
Token ID247
Feature activation-7.70e-01

Pos logit contributions
iencies+4.656
nces+4.607
uilt+4.258
+4.233
ortium+4.134
Neg logit contributions
 pretending-5.481
 parting-5.428
 proceeding-5.416
 inaction-5.315
 questioning-5.257
Tokent
Token ID83
Feature activation-5.70e-01

Pos logit contributions
nces+3.952
iencies+3.924
adoes+3.535
uilt+3.520
+3.513
Neg logit contributions
 proceeding-4.559
 parting-4.556
 pretending-4.507
 inaction-4.506
 questioning-4.387
Token just
Token ID655
Feature activation-8.44e-01

Pos logit contributions
nces+3.819
iencies+3.787
adoes+3.618
+3.556
uilt+3.534
Neg logit contributions
 proceeding-2.343
 inaction-2.315
 parting-2.305
 pretending-2.286
 confirming-2.239
Token play
Token ID711
Feature activation-7.66e-01

Pos logit contributions
nces+5.412
iencies+5.318
adoes+5.138
+5.070
uilt+5.058
Neg logit contributions
 inaction-3.559
 proceeding-3.547
 parting-3.536
 pretending-3.502
 confirming-3.359
Token will
Token ID481
Feature activation-8.77e-01

Pos logit contributions
iencies+7.556
nces+7.412
adoes+7.073
uilt+7.058
+7.003
Neg logit contributions
 proceeding-5.682
 inaction-5.666
 parting-5.655
 pretending-5.644
 bothering-5.530
Token prefer
Token ID4702
Feature activation-3.70e-01

Pos logit contributions
nces+5.611
iencies+5.578
adoes+5.344
uilt+5.280
+5.252
Neg logit contributions
 inaction-3.605
 proceeding-3.572
 pretending-3.558
 parting-3.518
 confirming-3.462
Token spectator
Token ID43893
Feature activation+2.25e-01

Pos logit contributions
nces+2.324
iencies+2.313
adoes+2.192
uilt+2.186
+2.139
Neg logit contributions
 proceeding-1.672
 pretending-1.622
 parting-1.613
 questioning-1.595
 inaction-1.584
Token activities
Token ID4568
Feature activation-1.15e+00

Pos logit contributions
 proceeding+1.077
 pretending+1.050
 parting+1.049
 questioning+1.027
 inaction+1.023
Neg logit contributions
nces-1.435
iencies-1.426
adoes-1.348
uilt-1.322
-1.300
Token that
Token ID326
Feature activation-1.32e+00

Pos logit contributions
nces+7.219
iencies+7.108
uilt+6.728
adoes+6.596
+6.558
Neg logit contributions
 inaction-4.757
 proceeding-4.692
 parting-4.645
 pretending-4.521
 questioning-4.486
Token don
Token ID836
Feature activation-4.58e+00

Pos logit contributions
nces+8.091
iencies+8.031
uilt+7.530
adoes+7.499
+7.487
Neg logit contributions
 pretending-5.478
 proceeding-5.456
 parting-5.442
 confirming-5.350
 questioning-5.337
Token
Token ID447
Feature activation-6.26e-01

Pos logit contributions
+17.553
iencies+16.093
adoes+14.594
nces+14.590
ados+14.306
Neg logit contributions
 parting-21.564
 pretending-20.632
 questioning-20.203
 inaction-20.120
 proceeding-20.056
Token
Token ID247
Feature activation-6.54e-01

Pos logit contributions
iencies+2.934
nces+2.868
uilt+2.633
+2.616
adoes+2.562
Neg logit contributions
 pretending-4.013
 proceeding-3.984
 parting-3.978
 inaction-3.901
 questioning-3.883
Tokent
Token ID83
Feature activation+8.48e-02

Pos logit contributions
nces+3.412
iencies+3.395
uilt+3.100
+3.066
adoes+3.052
Neg logit contributions
 proceeding-3.845
 parting-3.823
 pretending-3.798
 inaction-3.798
 questioning-3.742
Token require
Token ID2421
Feature activation+3.13e-02

Pos logit contributions
 proceeding+0.413
 pretending+0.409
 parting+0.407
 inaction+0.407
 questioning+0.400
Neg logit contributions
iencies-0.525
nces-0.524
adoes-0.488
uilt-0.487
ortium-0.481
Token much
Token ID881
Feature activation-2.32e-01

Pos logit contributions
 proceeding+0.121
 parting+0.119
 pretending+0.118
 inaction+0.116
 questioning+0.115
Neg logit contributions
nces-0.221
iencies-0.219
adoes-0.208
uilt-0.207
-0.204
Token must
Token ID1276
Feature activation+5.41e-01

Pos logit contributions
 pretending+1.735
 proceeding+1.735
 parting+1.707
 questioning+1.702
 inaction+1.692
Neg logit contributions
iencies-1.778
nces-1.754
adoes-1.633
uilt-1.631
ortium-1.607
Token admit
Token ID9159
Feature activation+1.11e+00

Pos logit contributions
 proceeding+2.358
 pretending+2.332
 parting+2.307
 questioning+2.285
 inaction+2.279
Neg logit contributions
iencies-3.692
nces-3.680
uilt-3.450
adoes-3.444
ortium-3.403
Token that
Token ID326
Feature activation+5.58e-01

Pos logit contributions
 proceeding+3.520
 pretending+3.468
 parting+3.406
 questioning+3.366
 inaction+3.340
Neg logit contributions
iencies-8.889
nces-8.867
uilt-8.401
adoes-8.367
ortium-8.306
Token RPC
Token ID39400
Feature activation+3.05e-01

Pos logit contributions
 proceeding+2.294
 pretending+2.273
 parting+2.245
 questioning+2.234
 inaction+2.203
Neg logit contributions
iencies-3.917
nces-3.912
uilt-3.678
adoes-3.660
ortium-3.615
Token simply
Token ID2391
Feature activation-1.12e+00

Pos logit contributions
 proceeding+1.728
 pretending+1.713
 parting+1.703
 questioning+1.687
 inaction+1.677
Neg logit contributions
iencies-1.674
nces-1.665
uilt-1.533
adoes-1.532
ortium-1.505
Token doesn
Token ID1595
Feature activation-4.56e+00

Pos logit contributions
iencies+5.547
nces+5.528
adoes+5.173
+5.087
uilt+4.990
Neg logit contributions
 proceeding-6.214
 parting-6.198
 questioning-6.159
 pretending-6.128
 inaction-6.074
Token
Token ID447
Feature activation-8.44e-01

Pos logit contributions
+16.864
iencies+14.858
adoes+13.704
ados+13.514
nces+13.346
Neg logit contributions
 parting-22.666
 pretending-21.703
 questioning-21.307
 inaction-21.252
 proceeding-21.098
Token
Token ID247
Feature activation-9.15e-01

Pos logit contributions
iencies+3.715
nces+3.664
uilt+3.341
+3.332
adoes+3.218
Neg logit contributions
 pretending-5.560
 parting-5.516
 proceeding-5.503
 inaction-5.412
 questioning-5.344
Tokent
Token ID83
Feature activation-1.49e-01

Pos logit contributions
nces+4.573
iencies+4.545
+4.140
uilt+4.127
adoes+4.075
Neg logit contributions
 parting-5.529
 proceeding-5.477
 inaction-5.419
 pretending-5.406
 questioning-5.319
Token fit
Token ID4197
Feature activation-1.34e+00

Pos logit contributions
nces+0.938
iencies+0.937
adoes+0.881
uilt+0.875
+0.866
Neg logit contributions
 proceeding-0.699
 pretending-0.692
 parting-0.691
 inaction-0.689
 questioning-0.681
Token this
Token ID428
Feature activation+3.89e-01

Pos logit contributions
nces+9.121
iencies+8.948
+8.541
adoes+8.522
uilt+8.434
Neg logit contributions
 proceeding-4.619
 pretending-4.583
 parting-4.580
 inaction-4.524
 questioning-4.467
Token
Token ID251
Feature activation+1.30e+00

Pos logit contributions
 proceeding+7.121
 pretending+7.076
 parting+7.012
 questioning+6.994
 inaction+6.954
Neg logit contributions
nces-6.150
iencies-6.136
uilt-5.669
adoes-5.639
ortium-5.607
Token
Token ID198
Feature activation+1.68e-03

Pos logit contributions
 proceeding+5.899
 pretending+5.856
 parting+5.771
 questioning+5.736
 inaction+5.712
Neg logit contributions
iencies-8.436
nces-8.435
uilt-7.925
adoes-7.837
ortium-7.796
Token
Token ID198
Feature activation+5.97e-01

Pos logit contributions
 proceeding+0.008
 parting+0.008
 pretending+0.008
 questioning+0.008
 inaction+0.008
Neg logit contributions
iencies-0.010
nces-0.010
adoes-0.010
uilt-0.009
-0.009
TokenGood
Token ID10248
Feature activation-9.22e-01

Pos logit contributions
 proceeding+3.418
 pretending+3.392
 parting+3.369
 questioning+3.338
 inaction+3.312
Neg logit contributions
iencies-3.284
nces-3.271
uilt-3.014
adoes-3.010
ortium-2.964
Tokenwin
Token ID5404
Feature activation-1.35e+00

Pos logit contributions
iencies+4.711
nces+4.577
uilt+4.210
adoes+4.192
ortium+4.127
Neg logit contributions
 pretending-5.417
 proceeding-5.362
 parting-5.313
 inaction-5.242
 bothering-5.192
Token doesn
Token ID1595
Feature activation-4.56e+00

Pos logit contributions
iencies+7.739
nces+7.610
adoes+7.209
+7.171
uilt+7.141
Neg logit contributions
 proceeding-6.419
 pretending-6.367
 inaction-6.302
 parting-6.289
 bothering-6.196
Token
Token ID447
Feature activation-6.18e-01

Pos logit contributions
+17.489
iencies+15.534
adoes+14.471
nces+14.070
ados+14.053
Neg logit contributions
 parting-21.339
 pretending-20.708
 proceeding-20.209
 questioning-20.201
 inaction-20.008
Token
Token ID247
Feature activation-4.59e-01

Pos logit contributions
iencies+2.811
nces+2.743
+2.484
uilt+2.480
adoes+2.425
Neg logit contributions
 pretending-4.054
 parting-4.042
 proceeding-4.019
 questioning-3.933
 inaction-3.928
Tokent
Token ID83
Feature activation-8.04e-01

Pos logit contributions
iencies+2.442
nces+2.424
uilt+2.219
adoes+2.214
+2.200
Neg logit contributions
 proceeding-2.654
 parting-2.638
 pretending-2.628
 inaction-2.606
 questioning-2.560
Token want
Token ID765
Feature activation-9.16e-01

Pos logit contributions
iencies+4.855
nces+4.808
adoes+4.582
+4.546
uilt+4.482
Neg logit contributions
 proceeding-3.685
 inaction-3.654
 parting-3.570
 pretending-3.562
 confirming-3.547
Token to
Token ID284
Feature activation+8.79e-02

Pos logit contributions
iencies+5.328
nces+5.258
adoes+5.037
uilt+4.981
+4.944
Neg logit contributions
 proceeding-4.445
 pretending-4.373
 parting-4.308
 confirming-4.300
 questioning-4.190
Token 2013
Token ID2211
Feature activation+2.86e-01

Pos logit contributions
iencies+4.630
nces+4.427
adoes+4.393
+4.168
 Rates+3.730
Neg logit contributions
 pretending-11.908
 questioning-11.800
 inaction-11.798
 parting-11.768
 proceeding-11.727
Token
Token ID198
Feature activation+1.56e-01

Pos logit contributions
 proceeding+1.408
 pretending+1.398
 parting+1.387
 questioning+1.374
 inaction+1.364
Neg logit contributions
iencies-1.785
nces-1.779
adoes-1.661
uilt-1.639
ortium-1.629
Token
Token ID198
Feature activation+7.51e-01

Pos logit contributions
 proceeding+0.767
 pretending+0.763
 parting+0.762
 questioning+0.751
 inaction+0.746
Neg logit contributions
iencies-0.972
nces-0.964
adoes-0.901
uilt-0.884
ortium-0.882
TokenThis
Token ID1212
Feature activation-1.04e+00

Pos logit contributions
 proceeding+4.364
 pretending+4.329
 parting+4.301
 questioning+4.268
 inaction+4.243
Neg logit contributions
iencies-4.046
nces-4.045
uilt-3.728
adoes-3.714
ortium-3.659
Token game
Token ID983
Feature activation-2.19e-01

Pos logit contributions
nces+6.246
iencies+6.157
adoes+6.042
uilt+5.956
+5.733
Neg logit contributions
 pretending-4.485
 proceeding-4.394
 confirming-4.359
 inaction-4.347
 questioning-4.303
Token didn
Token ID1422
Feature activation-4.55e+00

Pos logit contributions
nces+1.245
iencies+1.245
adoes+1.169
uilt+1.160
+1.127
Neg logit contributions
 proceeding-1.154
 pretending-1.147
 inaction-1.138
 parting-1.133
 questioning-1.117
Token
Token ID447
Feature activation-1.07e+00

Pos logit contributions
+16.321
iencies+14.091
adoes+13.047
nces+12.691
ados+12.662
Neg logit contributions
 parting-23.325
 pretending-22.377
 questioning-22.134
 proceeding-21.682
 inaction-21.453
Token
Token ID247
Feature activation-8.52e-01

Pos logit contributions
iencies+4.507
nces+4.441
+4.140
uilt+4.061
adoes+3.876
Neg logit contributions
 pretending-7.123
 parting-7.047
 proceeding-6.996
 inaction-6.872
 questioning-6.848
Tokent
Token ID83
Feature activation+1.94e-01

Pos logit contributions
nces+4.401
iencies+4.318
+3.980
uilt+3.968
adoes+3.929
Neg logit contributions
 proceeding-5.013
 parting-4.986
 pretending-4.977
 inaction-4.920
 questioning-4.852
Token lack
Token ID3092
Feature activation-9.56e-01

Pos logit contributions
 proceeding+0.848
 pretending+0.842
 parting+0.831
 inaction+0.830
 questioning+0.820
Neg logit contributions
iencies-1.309
nces-1.309
adoes-1.229
uilt-1.224
ortium-1.207
Token for
Token ID329
Feature activation+3.72e-01

Pos logit contributions
nces+5.859
iencies+5.809
adoes+5.473
+5.342
uilt+5.313
Neg logit contributions
 proceeding-4.315
 pretending-4.253
 parting-4.236
 questioning-4.211
 confirming-4.180
Token That
Token ID1320
Feature activation+3.96e-01

Pos logit contributions
nces+0.281
iencies+0.278
adoes+0.258
+0.255
uilt+0.254
Neg logit contributions
 proceeding-0.317
 pretending-0.314
 parting-0.312
 questioning-0.311
 inaction-0.307
Token was
Token ID373
Feature activation+9.39e-01

Pos logit contributions
 proceeding+1.856
 pretending+1.821
 parting+1.789
 inaction+1.787
 questioning+1.772
Neg logit contributions
nces-2.531
iencies-2.514
uilt-2.376
adoes-2.376
ortium-2.313
Token me
Token ID502
Feature activation-1.78e-01

Pos logit contributions
 proceeding+3.956
 pretending+3.912
 parting+3.859
 questioning+3.826
 inaction+3.805
Neg logit contributions
iencies-6.546
nces-6.531
uilt-6.135
adoes-6.111
ortium-6.053
Token!
Token ID0
Feature activation-6.06e-01

Pos logit contributions
nces+1.107
iencies+1.082
adoes+1.032
uilt+1.021
ortium+1.015
Neg logit contributions
 proceeding-0.840
 inaction-0.820
 parting-0.810
 pretending-0.808
 questioning-0.805
Token I
Token ID314
Feature activation-1.10e+00

Pos logit contributions
nces+3.467
iencies+3.421
+3.285
adoes+3.212
uilt+3.170
Neg logit contributions
 proceeding-3.066
 questioning-3.021
 pretending-3.020
 parting-2.993
 inaction-2.960
Token didn
Token ID1422
Feature activation-4.53e+00

Pos logit contributions
nces+6.487
iencies+6.327
uilt+6.082
adoes+6.064
+6.029
Neg logit contributions
 proceeding-5.024
 inaction-5.002
 parting-4.838
 pretending-4.818
 questioning-4.791
Token
Token ID447
Feature activation-1.11e+00

Pos logit contributions
+16.847
iencies+14.764
adoes+13.772
ados+13.408
%%+13.037
Neg logit contributions
 parting-21.896
 pretending-20.950
 questioning-20.764
 proceeding-20.452
 inaction-20.193
Token
Token ID247
Feature activation-7.24e-01

Pos logit contributions
iencies+4.774
nces+4.688
+4.367
uilt+4.205
ortium+4.155
Neg logit contributions
 pretending-6.994
 parting-6.942
 proceeding-6.917
 inaction-6.794
 bothering-6.786
Tokent
Token ID83
Feature activation+4.66e-02

Pos logit contributions
nces+3.625
iencies+3.569
+3.333
uilt+3.265
adoes+3.181
Neg logit contributions
 proceeding-4.330
 parting-4.268
 inaction-4.257
 pretending-4.238
 questioning-4.209
Token realize
Token ID6537
Feature activation-9.66e-01

Pos logit contributions
 proceeding+0.205
 pretending+0.202
 inaction+0.201
 parting+0.200
 questioning+0.198
Neg logit contributions
nces-0.308
iencies-0.308
adoes-0.290
uilt-0.289
-0.284
Token then
Token ID788
Feature activation+1.31e-01

Pos logit contributions
nces+6.995
iencies+6.946
adoes+6.723
+6.632
uilt+6.530
Neg logit contributions
 proceeding-2.955
 parting-2.857
 pretending-2.814
 inaction-2.811
 questioning-2.803
Token your
Token ID534
Feature activation-4.17e-01

Pos logit contributions
iencies+0.011
nces+0.011
adoes+0.010
uilt+0.010
+0.010
Neg logit contributions
 proceeding-0.006
 pretending-0.006
 parting-0.006
 questioning-0.006
 inaction-0.006
Token Even
Token ID3412
Feature activation-6.54e-01

Pos logit contributions
iencies+2.630
nces+2.614
adoes+2.489
uilt+2.451
+2.365
Neg logit contributions
 pretending-1.880
 proceeding-1.867
 parting-1.830
 confirming-1.785
 questioning-1.766
Token though
Token ID996
Feature activation-3.83e-01

Pos logit contributions
nces+3.964
iencies+3.932
adoes+3.732
uilt+3.700
+3.646
Neg logit contributions
 proceeding-3.058
 parting-2.993
 pretending-2.973
 inaction-2.921
 questioning-2.916
Token the
Token ID262
Feature activation-1.69e+00

Pos logit contributions
nces+2.836
iencies+2.801
adoes+2.626
uilt+2.621
+2.550
Neg logit contributions
 proceeding-1.366
 parting-1.342
 pretending-1.324
 questioning-1.278
 bothering-1.261
Token president
Token ID1893
Feature activation+1.04e-01

Pos logit contributions
nces+10.230
iencies+10.161
adoes+9.395
uilt+9.391
ortium+9.039
Neg logit contributions
 pretending-6.784
 proceeding-6.633
 bothering-6.490
 parting-6.469
 confirming-6.391
Token didn
Token ID1422
Feature activation-4.53e+00

Pos logit contributions
 proceeding+0.572
 parting+0.567
 pretending+0.566
 questioning+0.555
 inaction+0.555
Neg logit contributions
iencies-0.585
nces-0.584
adoes-0.540
uilt-0.536
ortium-0.522
Token't
Token ID470
Feature activation-6.08e-01

Pos logit contributions
+17.223
iencies+14.975
adoes+13.906
ados+13.652
nces+13.611
Neg logit contributions
 parting-21.290
 pretending-20.976
 questioning-20.665
 proceeding-20.126
 inaction-20.091
Token sign
Token ID1051
Feature activation-8.21e-01

Pos logit contributions
iencies+4.131
nces+4.077
adoes+3.866
uilt+3.836
+3.808
Neg logit contributions
 proceeding-2.416
 parting-2.413
 pretending-2.382
 inaction-2.380
 questioning-2.317
Token a
Token ID257
Feature activation-4.21e-01

Pos logit contributions
nces+6.000
iencies+5.974
adoes+5.606
uilt+5.536
ortium+5.402
Neg logit contributions
 proceeding-2.911
 parting-2.817
 pretending-2.808
 questioning-2.684
 pleading-2.673
Token single
Token ID2060
Feature activation+3.20e-01

Pos logit contributions
iencies+2.782
nces+2.769
uilt+2.614
adoes+2.589
ortium+2.527
Neg logit contributions
 pretending-1.762
 proceeding-1.756
 questioning-1.706
 parting-1.704
 inaction-1.680
Token bill
Token ID2855
Feature activation+4.49e-02

Pos logit contributions
 pretending+1.375
 proceeding+1.374
 parting+1.348
 questioning+1.329
 inaction+1.306
Neg logit contributions
iencies-2.185
nces-2.173
uilt-2.049
adoes-2.027
ortium-1.981
Token,
Token ID11
Feature activation+3.77e-01

Pos logit contributions
 proceeding+1.202
 pretending+1.177
 parting+1.175
 inaction+1.172
 questioning+1.163
Neg logit contributions
nces-1.511
iencies-1.505
uilt-1.398
adoes-1.396
ortium-1.376
Token and
Token ID290
Feature activation+2.03e+00

Pos logit contributions
 proceeding+2.124
 pretending+2.094
 parting+2.086
 inaction+2.073
 questioning+2.069
Neg logit contributions
iencies-2.085
nces-2.073
uilt-1.912
adoes-1.910
ortium-1.880
Token it
Token ID340
Feature activation-5.73e-01

Pos logit contributions
 pretending+8.928
 proceeding+8.835
 parting+8.698
 questioning+8.659
 asking+8.443
Neg logit contributions
iencies-13.226
nces-13.063
uilt-12.464
ortium-12.297
adoes-12.214
Token
Token ID447
Feature activation-4.34e-01

Pos logit contributions
nces+3.333
iencies+3.285
adoes+3.057
+3.030
uilt+3.013
Neg logit contributions
 proceeding-2.976
 inaction-2.960
 pretending-2.921
 parting-2.905
 questioning-2.851
Token
Token ID247
Feature activation+3.10e-02

Pos logit contributions
iencies+1.947
nces+1.942
uilt+1.736
+1.702
adoes+1.697
Neg logit contributions
 pretending-2.877
 proceeding-2.833
 parting-2.833
 inaction-2.790
 questioning-2.778
Tokens
Token ID82
Feature activation+2.44e+00

Pos logit contributions
 proceeding+0.187
 pretending+0.186
 parting+0.185
 inaction+0.185
 questioning+0.183
Neg logit contributions
nces-0.161
iencies-0.160
uilt-0.145
adoes-0.145
ortium-0.142
Token just
Token ID655
Feature activation+1.99e+00

Pos logit contributions
 proceeding+12.190
 pretending+12.175
 questioning+11.964
 parting+11.943
 bothering+11.891
Neg logit contributions
iencies-13.652
nces-13.440
ortium-12.657
uilt-12.648
adoes-12.414
Token funny
Token ID8258
Feature activation+2.74e-01

Pos logit contributions
 proceeding+9.192
 pretending+9.185
 parting+9.015
 questioning+8.933
 inaction+8.795
Neg logit contributions
iencies-12.612
nces-12.517
uilt-11.721
ortium-11.651
adoes-11.605
Token,
Token ID11
Feature activation+3.17e-02

Pos logit contributions
 proceeding+1.263
 pretending+1.236
 parting+1.232
 inaction+1.221
 questioning+1.214
Neg logit contributions
nces-1.799
iencies-1.782
adoes-1.673
uilt-1.668
-1.630
Token it
Token ID340
Feature activation-6.57e-01

Pos logit contributions
 proceeding+0.142
 parting+0.140
 pretending+0.139
 inaction+0.138
 questioning+0.137
Neg logit contributions
iencies-0.208
nces-0.208
uilt-0.194
adoes-0.194
ortium-0.191
Token
Token ID447
Feature activation-2.51e-01

Pos logit contributions
nces+3.844
iencies+3.747
adoes+3.515
+3.515
uilt+3.491
Neg logit contributions
 proceeding-3.360
 inaction-3.306
 parting-3.273
 pretending-3.261
 confirming-3.189
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
 proceeding+3.043
 pretending+3.010
 parting+2.995
 inaction+2.967
 questioning+2.961
Neg logit contributions
iencies-3.427
nces-3.414
uilt-3.168
adoes-3.153
ortium-3.130
Token to
Token ID284
Feature activation+1.16e+00

Pos logit contributions
iencies+5.977
nces+5.813
uilt+5.440
adoes+5.387
ortium+5.356
Neg logit contributions
 proceeding-5.310
 parting-5.291
 pretending-5.218
 inaction-5.163
 bothering-5.130
Token the
Token ID262
Feature activation+6.62e-01

Pos logit contributions
 proceeding+5.286
 pretending+5.234
 parting+5.169
 questioning+5.123
 inaction+5.097
Neg logit contributions
iencies-7.731
nces-7.709
uilt-7.218
adoes-7.185
ortium-7.121
Token explosive
Token ID13835
Feature activation-8.29e-01

Pos logit contributions
 proceeding+3.060
 pretending+3.060
 parting+3.026
 inaction+2.978
 questioning+2.966
Neg logit contributions
iencies-4.319
nces-4.306
uilt-4.010
adoes-4.009
ortium-3.958
Token,
Token ID11
Feature activation-7.07e-01

Pos logit contributions
nces+5.042
iencies+5.019
adoes+4.690
uilt+4.591
ortium+4.524
Neg logit contributions
 pretending-3.759
 proceeding-3.742
 parting-3.727
 inaction-3.663
 bothering-3.600
Token in
Token ID287
Feature activation+2.55e+00

Pos logit contributions
iencies+4.288
nces+4.258
uilt+3.982
adoes+3.965
ortium+3.937
Neg logit contributions
 parting-3.168
 proceeding-3.135
 pretending-3.135
 inaction-3.108
 bothering-2.973
Token the
Token ID262
Feature activation+6.84e-01

Pos logit contributions
 proceeding+9.521
 questioning+9.485
 pretending+9.375
 confirming+9.212
 inaction+9.078
Neg logit contributions
iencies-17.071
nces-17.067
adoes-16.081
uilt-16.035
ortium-15.724
Token case
Token ID1339
Feature activation+1.35e+00

Pos logit contributions
 proceeding+2.246
 pretending+2.242
 parting+2.199
 inaction+2.155
 questioning+2.151
Neg logit contributions
iencies-5.384
nces-5.358
uilt-5.077
adoes-5.060
ortium-5.020
Token of
Token ID286
Feature activation+1.30e+00

Pos logit contributions
 proceeding+5.055
 pretending+4.932
 questioning+4.912
 parting+4.813
 inaction+4.805
Neg logit contributions
nces-9.978
iencies-9.950
uilt-9.609
ortium-9.392
adoes-9.326
Token candid
Token ID2960
Feature activation+9.56e-02

Pos logit contributions
 proceeding+6.464
 pretending+6.404
 parting+6.322
 questioning+6.317
 inaction+6.257
Neg logit contributions
iencies-8.088
nces-8.054
uilt-7.522
adoes-7.473
ortium-7.398
Token assessments
Token ID21837
Feature activation-3.63e-01

Pos logit contributions
 proceeding+0.470
 pretending+0.467
 parting+0.459
 inaction+0.453
 bothering+0.450
Neg logit contributions
nces-0.592
iencies-0.591
adoes-0.547
uilt-0.541
ortium-0.539
Token,
Token ID11
Feature activation+1.09e-01

Pos logit contributions
 proceeding+7.294
 pretending+7.173
 parting+7.071
 questioning+7.071
 inaction+7.015
Neg logit contributions
iencies-8.452
nces-8.425
uilt-7.790
adoes-7.776
ortium-7.726
Token then
Token ID788
Feature activation+1.86e+00

Pos logit contributions
 proceeding+0.381
 pretending+0.375
 parting+0.375
 inaction+0.364
 questioning+0.363
Neg logit contributions
iencies-0.817
nces-0.815
uilt-0.765
adoes-0.765
ortium-0.760
Token she
Token ID673
Feature activation+3.41e-02

Pos logit contributions
 proceeding+8.090
 pretending+8.035
 questioning+7.911
 parting+7.854
 inaction+7.805
Neg logit contributions
iencies-12.389
nces-12.301
uilt-11.648
adoes-11.478
ortium-11.437
Token
Token ID447
Feature activation-2.16e-01

Pos logit contributions
 proceeding+0.198
 pretending+0.196
 parting+0.195
 inaction+0.194
 questioning+0.193
Neg logit contributions
iencies-0.183
nces-0.182
adoes-0.167
uilt-0.167
ortium-0.163
Token
Token ID247
Feature activation-3.76e-01

Pos logit contributions
nces+1.040
iencies+1.037
uilt+0.943
adoes+0.924
ortium+0.918
Neg logit contributions
 pretending-1.371
 proceeding-1.362
 parting-1.359
 inaction-1.333
 questioning-1.333
Tokens
Token ID82
Feature activation+2.47e+00

Pos logit contributions
nces+2.150
iencies+2.139
adoes+1.943
uilt+1.911
ortium+1.887
Neg logit contributions
 proceeding-2.067
 parting-2.059
 pretending-2.040
 inaction-2.036
 questioning-2.019
Token not
Token ID407
Feature activation+3.31e+00

Pos logit contributions
 proceeding+10.595
 pretending+10.556
 questioning+10.347
 parting+10.196
 asking+10.100
Neg logit contributions
nces-16.094
iencies-16.057
uilt-15.109
ortium-15.062
adoes-14.869
Token going
Token ID1016
Feature activation+4.53e-01

Pos logit contributions
 proceeding+14.147
 pretending+14.139
 questioning+13.940
 bothering+13.657
 asking+13.592
Neg logit contributions
iencies-20.130
nces-20.085
uilt-19.166
ortium-18.691
adoes-18.307
Token to
Token ID284
Feature activation+1.47e+00

Pos logit contributions
 proceeding+2.393
 pretending+2.367
 parting+2.346
 questioning+2.329
 inaction+2.318
Neg logit contributions
iencies-2.670
nces-2.663
uilt-2.469
adoes-2.463
ortium-2.428
Token be
Token ID307
Feature activation+3.01e+00

Pos logit contributions
 proceeding+6.139
 pretending+6.068
 questioning+5.972
 parting+5.941
 inaction+5.858
Neg logit contributions
iencies-10.134
nces-10.115
uilt-9.527
ortium-9.438
adoes-9.382
Token able
Token ID1498
Feature activation-1.38e-01

Pos logit contributions
 proceeding+12.718
 pretending+12.558
 questioning+12.486
 asking+12.126
 parting+12.081
Neg logit contributions
iencies-19.204
nces-18.923
uilt-18.213
ortium-18.065
adoes-17.610
Token-
Token ID12
Feature activation+4.04e-02

Pos logit contributions
 proceeding+3.600
 pretending+3.482
 questioning+3.319
 inaction+3.283
 confirming+3.239
Neg logit contributions
nces-13.951
iencies-13.871
uilt-13.604
ortium-13.280
adoes-13.067
TokenCalifornia
Token ID25284
Feature activation+1.53e+00

Pos logit contributions
 parting+0.274
 pretending+0.269
 proceeding+0.268
 inaction+0.265
 bothering+0.262
Neg logit contributions
nces-0.178
iencies-0.178
adoes-0.160
-0.157
uilt-0.152
Token,
Token ID11
Feature activation+1.37e-01

Pos logit contributions
 pretending+6.245
 proceeding+6.219
 questioning+6.129
 parting+6.103
 confirming+5.950
Neg logit contributions
iencies-10.419
nces-10.402
uilt-9.874
adoes-9.663
ortium-9.558
Token in
Token ID287
Feature activation+1.27e+00

Pos logit contributions
 proceeding+0.651
 pretending+0.639
 parting+0.636
 inaction+0.633
 questioning+0.622
Neg logit contributions
iencies-0.871
nces-0.867
adoes-0.805
uilt-0.804
ortium-0.795
Token a
Token ID257
Feature activation+5.29e-01

Pos logit contributions
 proceeding+7.221
 pretending+7.152
 parting+7.087
 questioning+7.055
 inaction+6.998
Neg logit contributions
iencies-6.912
nces-6.882
uilt-6.380
adoes-6.311
ortium-6.254
Token radio
Token ID5243
Feature activation+3.05e+00

Pos logit contributions
 pretending+2.639
 proceeding+2.634
 inaction+2.595
 parting+2.579
 questioning+2.564
Neg logit contributions
iencies-3.253
nces-3.250
adoes-3.017
uilt-3.013
ortium-2.967
Token debate
Token ID4384
Feature activation+1.86e+00

Pos logit contributions
 proceeding+13.922
 questioning+13.444
 pretending+13.280
 parting+13.160
 asking+13.035
Neg logit contributions
nces-18.108
iencies-17.970
ortium-17.208
uilt-17.009
adoes-16.740
Token hosted
Token ID12007
Feature activation-3.02e-01

Pos logit contributions
 proceeding+8.916
 pretending+8.797
 questioning+8.690
 parting+8.626
 confirming+8.444
Neg logit contributions
iencies-11.603
nces-11.534
uilt-10.865
adoes-10.776
ortium-10.748
Token by
Token ID416
Feature activation+6.68e-01

Pos logit contributions
iencies+1.587
nces+1.577
adoes+1.445
uilt+1.429
ortium+1.422
Neg logit contributions
 proceeding-1.750
 parting-1.735
 pretending-1.725
 inaction-1.721
 questioning-1.691
Token La
Token ID4689
Feature activation+1.14e+00

Pos logit contributions
 proceeding+3.752
 pretending+3.730
 parting+3.687
 questioning+3.655
 inaction+3.655
Neg logit contributions
iencies-3.718
nces-3.713
uilt-3.421
adoes-3.408
ortium-3.385
Token Opinion
Token ID29525
Feature activation-2.43e-01

Pos logit contributions
 proceeding+6.061
 pretending+5.986
 parting+5.923
 questioning+5.877
 inaction+5.876
Neg logit contributions
iencies-6.678
nces-6.635
uilt-6.183
adoes-6.120
ortium-6.062
Token late
Token ID2739
Feature activation+1.53e+00

Pos logit contributions
 proceeding+7.375
 pretending+7.304
 questioning+7.247
 parting+7.223
 inaction+7.105
Neg logit contributions
iencies-8.185
nces-8.131
uilt-7.570
adoes-7.514
ortium-7.469
Token Friday
Token ID3217
Feature activation+2.34e+00

Pos logit contributions
 proceeding+4.739
 pretending+4.654
 parting+4.576
 questioning+4.568
 inaction+4.468
Neg logit contributions
iencies-12.326
nces-12.248
uilt-11.643
adoes-11.578
ortium-11.506
Token hunted
Token ID34275
Feature activation-3.87e-01

Pos logit contributions
 proceeding+10.781
 pretending+10.611
 questioning+10.610
 parting+10.524
 confirming+10.329
Neg logit contributions
iencies-14.471
nces-14.354
uilt-13.379
adoes-13.317
ortium-13.306
Token for
Token ID329
Feature activation+5.18e-02

Pos logit contributions
nces+2.454
iencies+2.402
adoes+2.270
uilt+2.239
ortium+2.204
Neg logit contributions
 proceeding-1.804
 parting-1.771
 pretending-1.757
 inaction-1.732
 questioning-1.699
Token another
Token ID1194
Feature activation+3.33e-01

Pos logit contributions
 proceeding+0.215
 pretending+0.210
 parting+0.209
 inaction+0.205
 bothering+0.200
Neg logit contributions
nces-0.363
iencies-0.358
adoes-0.338
uilt-0.333
ortium-0.332
Token teen
Token ID6036
Feature activation+2.38e+00

Pos logit contributions
 proceeding+1.584
 parting+1.552
 pretending+1.550
 inaction+1.547
 questioning+1.515
Neg logit contributions
nces-2.127
iencies-2.090
adoes-1.968
uilt-1.962
ortium-1.919
Token suspected
Token ID9885
Feature activation-7.55e-02

Pos logit contributions
 pretending+11.180
 proceeding+10.948
 questioning+10.876
 hiding+10.689
 parting+10.681
Neg logit contributions
iencies-14.855
nces-14.560
uilt-13.778
ortium-13.413
adoes-13.376
Token of
Token ID286
Feature activation+6.14e-01

Pos logit contributions
iencies+0.517
nces+0.516
uilt+0.485
adoes+0.484
ortium+0.480
Neg logit contributions
 proceeding-0.324
 pretending-0.319
 parting-0.317
 questioning-0.312
 inaction-0.311
Token beating
Token ID11226
Feature activation+5.89e-01

Pos logit contributions
 proceeding+2.023
 pretending+1.965
 inaction+1.956
 parting+1.948
 questioning+1.912
Neg logit contributions
nces-4.779
iencies-4.761
adoes-4.526
uilt-4.455
ortium-4.434
Token an
Token ID281
Feature activation+9.56e-01

Pos logit contributions
 proceeding+3.222
 pretending+3.194
 parting+3.162
 questioning+3.138
 inaction+3.126
Neg logit contributions
iencies-3.367
nces-3.359
uilt-3.106
adoes-3.094
ortium-3.056
Token 88
Token ID9193
Feature activation+1.69e-01

Pos logit contributions
 proceeding+5.301
 pretending+5.272
 parting+5.187
 questioning+5.180
 inaction+5.147
Neg logit contributions
iencies-5.383
nces-5.352
uilt-4.970
adoes-4.908
ortium-4.905
Token moving
Token ID3867
Feature activation-4.30e-01

Pos logit contributions
 proceeding+5.459
 pretending+5.332
 parting+5.319
 questioning+5.211
 inaction+5.123
Neg logit contributions
iencies-10.210
nces-10.187
uilt-9.641
ortium-9.543
adoes-9.504
Token downward
Token ID20841
Feature activation-8.66e-01

Pos logit contributions
iencies+2.768
nces+2.761
adoes+2.628
uilt+2.565
+2.520
Neg logit contributions
 pretending-1.898
 proceeding-1.885
 parting-1.840
 questioning-1.840
 inaction-1.829
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
nces+4.840
iencies+4.761
adoes+4.494
+4.431
uilt+4.428
Neg logit contributions
 inaction-4.332
 proceeding-4.307
 pretending-4.301
 parting-4.289
 questioning-4.272
Token
Token ID447
Feature activation+2.59e-01

Pos logit contributions
iencies+8.621
nces+8.418
adoes+8.271
+8.071
uilt+7.970
Neg logit contributions
 inaction-3.759
 pretending-3.672
 questioning-3.619
 proceeding-3.567
 parting-3.511
Token
Token ID251
Feature activation+8.43e-01

Pos logit contributions
 pretending+1.556
 proceeding+1.548
 parting+1.530
 inaction+1.521
 questioning+1.509
Neg logit contributions
iencies-1.341
nces-1.329
uilt-1.221
adoes-1.210
ortium-1.192
Token the
Token ID262
Feature activation-1.29e-01

Pos logit contributions
 proceeding+4.494
 pretending+4.446
 parting+4.404
 questioning+4.370
 inaction+4.341
Neg logit contributions
iencies-4.938
nces-4.924
uilt-4.569
adoes-4.537
ortium-4.501
Token authors
Token ID7035
Feature activation+4.35e-01

Pos logit contributions
iencies+0.835
nces+0.828
adoes+0.787
uilt+0.772
ortium+0.767
Neg logit contributions
 pretending-0.579
 proceeding-0.570
 questioning-0.558
 inaction-0.557
 parting-0.556
Token write
Token ID3551
Feature activation-3.63e-01

Pos logit contributions
 proceeding+1.992
 pretending+1.976
 parting+1.963
 inaction+1.956
 questioning+1.938
Neg logit contributions
nces-2.853
iencies-2.846
uilt-2.667
adoes-2.665
ortium-2.644
Token.
Token ID13
Feature activation-1.05e-02

Pos logit contributions
nces+2.273
iencies+2.220
adoes+2.107
uilt+2.102
ortium+2.087
Neg logit contributions
 parting-1.697
 proceeding-1.694
 pretending-1.691
 inaction-1.689
 questioning-1.653
Token
Token ID198
Feature activation-9.05e-02

Pos logit contributions
nces+0.062
iencies+0.062
adoes+0.059
+0.058
uilt+0.057
Neg logit contributions
 pretending-0.054
 proceeding-0.053
 questioning-0.053
 parting-0.053
 inaction-0.052
Token
Token ID198
Feature activation+2.47e-01

Pos logit contributions
iencies+0.542
nces+0.535
adoes+0.502
+0.495
ortium+0.486
Neg logit contributions
 parting-0.463
 pretending-0.460
 proceeding-0.459
 questioning-0.453
 inaction-0.450
Token)
Token ID8
Feature activation-1.64e+00

Pos logit contributions
nces+6.276
iencies+6.250
uilt+6.029
ortium+5.919
+5.811
Neg logit contributions
 inaction-4.668
 pretending-4.579
 parting-4.552
 proceeding-4.550
 bothering-4.550
Token and
Token ID290
Feature activation-2.75e-01

Pos logit contributions
iencies+8.093
nces+7.923
uilt+7.714
ortium+7.583
+7.303
Neg logit contributions
 inaction-8.875
 parting-8.704
 proceeding-8.667
 pretending-8.569
 questioning-8.548
Token so
Token ID523
Feature activation+5.15e-01

Pos logit contributions
iencies+1.396
nces+1.388
uilt+1.294
adoes+1.282
ortium+1.257
Neg logit contributions
 proceeding-1.636
 pretending-1.628
 parting-1.607
 inaction-1.606
 questioning-1.600
Token on
Token ID319
Feature activation-5.33e-01

Pos logit contributions
 proceeding+2.158
 pretending+2.132
 parting+2.104
 questioning+2.086
 inaction+2.068
Neg logit contributions
iencies-3.593
nces-3.585
uilt-3.369
adoes-3.353
ortium-3.322
Token.
Token ID13
Feature activation+2.73e-01

Pos logit contributions
nces+3.248
iencies+3.246
uilt+3.042
adoes+3.027
ortium+3.018
Neg logit contributions
 inaction-2.559
 proceeding-2.558
 pretending-2.550
 parting-2.526
 questioning-2.517
Token Clan
Token ID22467
Feature activation+1.20e+00

Pos logit contributions
 proceeding+1.477
 pretending+1.466
 parting+1.451
 questioning+1.444
 inaction+1.437
Neg logit contributions
iencies-1.574
nces-1.570
uilt-1.449
adoes-1.447
ortium-1.432
Token-
Token ID12
Feature activation+1.38e+00

Pos logit contributions
 proceeding+6.180
 pretending+6.089
 parting+6.035
 questioning+6.005
 inaction+5.920
Neg logit contributions
iencies-7.106
nces-7.101
adoes-6.607
uilt-6.591
ortium-6.500
Tokenbased
Token ID3106
Feature activation-6.22e-01

Pos logit contributions
 proceeding+7.065
 pretending+6.948
 parting+6.842
 questioning+6.837
 confirming+6.713
Neg logit contributions
iencies-8.157
nces-8.088
adoes-7.562
uilt-7.455
ortium-7.390
Token personal
Token ID2614
Feature activation+9.58e-01

Pos logit contributions
iencies+3.334
nces+3.326
uilt+3.039
adoes+3.024
ortium+2.957
Neg logit contributions
 pretending-3.383
 proceeding-3.379
 inaction-3.362
 parting-3.311
 questioning-3.288
Token names
Token ID3891
Feature activation-3.77e-01

Pos logit contributions
 proceeding+4.988
 pretending+4.939
 parting+4.894
 questioning+4.864
 inaction+4.792
Neg logit contributions
nces-5.668
iencies-5.664
uilt-5.284
adoes-5.247
ortium-5.191
Token did
Token ID750
Feature activation-2.01e+00

Pos logit contributions
iencies+2.071
nces+2.058
uilt+1.914
adoes+1.900
ortium+1.898
Neg logit contributions
 inaction-2.038
 proceeding-2.034
 pretending-2.022
 parting-2.020
 questioning-1.990
Token to
Token ID284
Feature activation+5.81e-01

Pos logit contributions
iencies+3.047
nces+3.003
uilt+2.807
adoes+2.805
ortium+2.752
Neg logit contributions
 proceeding-2.969
 pretending-2.937
 parting-2.909
 questioning-2.892
 inaction-2.875
Token do
Token ID466
Feature activation-1.02e+00

Pos logit contributions
 proceeding+2.415
 pretending+2.393
 parting+2.350
 questioning+2.341
 inaction+2.321
Neg logit contributions
iencies-4.081
nces-4.059
uilt-3.836
adoes-3.817
ortium-3.772
Token,
Token ID11
Feature activation-1.64e+00

Pos logit contributions
nces+6.070
iencies+5.946
+5.615
adoes+5.572
uilt+5.568
Neg logit contributions
 proceeding-4.615
 inaction-4.577
 pretending-4.571
 parting-4.553
 confirming-4.484
Token
Token ID447
Feature activation+9.05e-01

Pos logit contributions
iencies+8.356
nces+8.207
+8.123
uilt+7.943
adoes+7.726
Neg logit contributions
 proceeding-8.031
 parting-7.916
 pretending-7.876
 inaction-7.810
 questioning-7.665
Token
Token ID251
Feature activation+1.02e+00

Pos logit contributions
 proceeding+5.409
 pretending+5.367
 parting+5.316
 questioning+5.284
 inaction+5.262
Neg logit contributions
iencies-4.716
nces-4.698
uilt-4.318
adoes-4.290
ortium-4.240
Token Warren
Token ID11328
Feature activation+1.07e+00

Pos logit contributions
 proceeding+5.624
 pretending+5.594
 parting+5.534
 questioning+5.512
 inaction+5.457
Neg logit contributions
iencies-5.795
nces-5.764
uilt-5.320
adoes-5.306
ortium-5.247
Token said
Token ID531
Feature activation+3.04e-01

Pos logit contributions
 proceeding+4.855
 pretending+4.816
 parting+4.744
 questioning+4.736
 inaction+4.667
Neg logit contributions
iencies-7.050
nces-7.017
uilt-6.545
adoes-6.526
ortium-6.485
Token.
Token ID13
Feature activation+3.23e-01

Pos logit contributions
 proceeding+1.353
 pretending+1.333
 parting+1.327
 inaction+1.312
 questioning+1.301
Neg logit contributions
nces-2.020
iencies-2.008
uilt-1.884
adoes-1.883
ortium-1.864
Token
Token ID564
Feature activation+1.77e+00

Pos logit contributions
 proceeding+1.623
 pretending+1.606
 parting+1.596
 questioning+1.583
 inaction+1.561
Neg logit contributions
iencies-1.987
nces-1.985
adoes-1.847
uilt-1.828
-1.820
Token
Token ID250
Feature activation+1.33e+00

Pos logit contributions
 proceeding+9.063
 pretending+8.959
 parting+8.906
 questioning+8.854
 inaction+8.813
Neg logit contributions
iencies-10.033
nces-9.998
uilt-9.423
adoes-9.391
ortium-9.362
TokenI
Token ID40
Feature activation-1.01e+00

Pos logit contributions
 proceeding+6.213
 pretending+6.155
 inaction+6.017
 parting+6.009
 questioning+6.007
Neg logit contributions
iencies-8.535
nces-8.489
uilt-7.986
adoes-7.888
ortium-7.865
Token and
Token ID290
Feature activation+9.26e-01

Pos logit contributions
 proceeding+0.411
 pretending+0.409
 parting+0.408
 inaction+0.400
 bothering+0.396
Neg logit contributions
iencies-0.601
nces-0.599
adoes-0.566
uilt-0.564
ortium-0.552
Token I
Token ID314
Feature activation+5.13e-01

Pos logit contributions
 proceeding+4.205
 pretending+4.159
 parting+4.115
 questioning+4.075
 inaction+4.058
Neg logit contributions
iencies-6.155
nces-6.134
uilt-5.750
adoes-5.721
ortium-5.664
Token guess
Token ID4724
Feature activation+1.40e+00

Pos logit contributions
 proceeding+2.552
 pretending+2.516
 parting+2.501
 inaction+2.487
 questioning+2.472
Neg logit contributions
iencies-3.185
nces-3.182
uilt-2.961
adoes-2.947
ortium-2.912
Token maybe
Token ID3863
Feature activation+1.66e+00

Pos logit contributions
 proceeding+6.078
 pretending+6.058
 parting+5.933
 questioning+5.919
 inaction+5.854
Neg logit contributions
iencies-9.473
nces-9.429
uilt-8.829
adoes-8.783
ortium-8.766
Token I
Token ID314
Feature activation-4.39e-01

Pos logit contributions
 pretending+8.020
 proceeding+7.996
 questioning+7.833
 parting+7.822
 inaction+7.738
Neg logit contributions
iencies-10.288
nces-10.242
ortium-9.486
uilt-9.479
adoes-9.430
Token will
Token ID481
Feature activation+1.61e-01

Pos logit contributions
nces+2.526
iencies+2.514
adoes+2.339
uilt+2.334
ortium+2.283
Neg logit contributions
 proceeding-2.279
 inaction-2.252
 parting-2.219
 pretending-2.205
 questioning-2.175
Token go
Token ID467
Feature activation-6.82e-01

Pos logit contributions
 proceeding+0.679
 pretending+0.667
 inaction+0.664
 parting+0.661
 questioning+0.651
Neg logit contributions
nces-1.101
iencies-1.098
uilt-1.035
adoes-1.035
ortium-1.014
Token out
Token ID503
Feature activation+8.51e-01

Pos logit contributions
nces+4.433
iencies+4.396
adoes+4.176
uilt+4.171
+4.084
Neg logit contributions
 proceeding-2.894
 parting-2.833
 inaction-2.802
 pretending-2.747
 confirming-2.738
Token under
Token ID739
Feature activation+2.22e+00

Pos logit contributions
 proceeding+4.406
 pretending+4.367
 parting+4.318
 questioning+4.289
 inaction+4.267
Neg logit contributions
iencies-5.112
nces-5.094
uilt-4.737
adoes-4.711
ortium-4.665
Token another
Token ID1194
Feature activation+1.14e+00

Pos logit contributions
 pretending+10.730
 proceeding+10.698
 questioning+10.644
 parting+10.465
 inaction+10.400
Neg logit contributions
iencies-13.097
nces-12.804
ortium-12.003
adoes-11.947
uilt-11.919
Token Not
Token ID1892
Feature activation+1.44e+00

Pos logit contributions
 proceeding+5.894
 pretending+5.843
 parting+5.775
 questioning+5.738
 inaction+5.707
Neg logit contributions
iencies-6.866
nces-6.838
uilt-6.357
adoes-6.326
ortium-6.265
Token,
Token ID11
Feature activation-1.88e+00

Pos logit contributions
iencies+4.533
nces+4.458
uilt+4.218
ortium+4.114
adoes+4.108
Neg logit contributions
 parting-3.960
 proceeding-3.932
 pretending-3.921
 questioning-3.903
 inaction-3.865
Token a
Token ID257
Feature activation-4.76e-01

Pos logit contributions
iencies+11.672
nces+11.421
uilt+11.366
+11.119
ortium+10.993
Neg logit contributions
 inaction-6.788
 pretending-6.606
 bothering-6.599
 proceeding-6.571
 questioning-6.488
Token brand
Token ID4508
Feature activation+7.44e-01

Pos logit contributions
iencies+2.980
nces+2.967
uilt+2.832
ortium+2.792
adoes+2.774
Neg logit contributions
 pretending-2.150
 proceeding-2.133
 inaction-2.110
 questioning-2.081
 parting-2.067
Token new
Token ID649
Feature activation-3.57e-01

Pos logit contributions
 proceeding+3.855
 pretending+3.826
 parting+3.766
 inaction+3.756
 questioning+3.749
Neg logit contributions
iencies-4.481
nces-4.458
uilt-4.174
adoes-4.122
ortium-4.110
Token,
Token ID11
Feature activation-3.56e-01

Pos logit contributions
iencies+2.378
nces+2.365
uilt+2.267
adoes+2.230
ortium+2.225
Neg logit contributions
 pretending-1.511
 proceeding-1.506
 inaction-1.472
 parting-1.468
 questioning-1.459
Token fully
Token ID3938
Feature activation+1.88e+00

Pos logit contributions
iencies+2.352
nces+2.333
uilt+2.220
adoes+2.188
ortium+2.170
Neg logit contributions
 proceeding-1.546
 pretending-1.544
 parting-1.507
 inaction-1.503
 questioning-1.492
Token integrated
Token ID11521
Feature activation-1.62e-01

Pos logit contributions
 proceeding+9.103
 pretending+8.927
 parting+8.866
 questioning+8.852
 inaction+8.664
Neg logit contributions
iencies-11.498
nces-11.456
adoes-10.541
uilt-10.485
ortium-10.318
Token point
Token ID966
Feature activation+1.13e+00

Pos logit contributions
iencies+1.034
nces+1.022
uilt+0.969
adoes+0.961
ortium+0.950
Neg logit contributions
 proceeding-0.753
 pretending-0.751
 parting-0.734
 inaction-0.733
 questioning-0.730
Token of
Token ID286
Feature activation+5.10e-01

Pos logit contributions
 proceeding+4.824
 pretending+4.772
 parting+4.708
 questioning+4.669
 inaction+4.646
Neg logit contributions
iencies-7.842
nces-7.815
uilt-7.349
adoes-7.308
ortium-7.248
Token sale
Token ID5466
Feature activation-1.34e-01

Pos logit contributions
 proceeding+2.185
 pretending+2.180
 parting+2.117
 inaction+2.088
 questioning+2.085
Neg logit contributions
iencies-3.481
nces-3.448
uilt-3.273
adoes-3.243
ortium-3.228
Token system
Token ID1080
Feature activation-1.00e-01

Pos logit contributions
iencies+0.873
nces+0.864
uilt+0.813
adoes+0.805
ortium+0.803
Neg logit contributions
 proceeding-0.604
 pretending-0.597
 inaction-0.594
 parting-0.592
 questioning-0.585
Token of
Token ID286
Feature activation+4.17e-01

Pos logit contributions
 parting+1.416
 proceeding+1.414
 pretending+1.408
 inaction+1.388
 questioning+1.387
Neg logit contributions
iencies-1.975
nces-1.974
adoes-1.859
uilt-1.822
ortium-1.801
Token Earth
Token ID3668
Feature activation+1.77e-01

Pos logit contributions
 proceeding+2.182
 pretending+2.161
 parting+2.149
 inaction+2.116
 questioning+2.114
Neg logit contributions
iencies-2.480
nces-2.459
adoes-2.307
uilt-2.275
ortium-2.233
Token.
Token ID13
Feature activation-3.03e-01

Pos logit contributions
 proceeding+0.870
 parting+0.862
 pretending+0.855
 inaction+0.854
 questioning+0.845
Neg logit contributions
nces-1.099
iencies-1.085
adoes-1.019
uilt-1.013
ortium-1.003
Token He
Token ID679
Feature activation-1.72e+00

Pos logit contributions
iencies+1.704
nces+1.687
adoes+1.595
+1.573
ortium+1.541
Neg logit contributions
 pretending-1.598
 questioning-1.598
 parting-1.595
 proceeding-1.593
 inaction-1.566
Token
Token ID447
Feature activation-7.17e-01

Pos logit contributions
iencies+9.953
nces+9.791
adoes+9.541
ortium+8.941
uilt+8.908
Neg logit contributions
 inaction-7.551
 parting-7.252
 proceeding-7.239
 bothering-7.205
 pretending-7.125
Token
Token ID247
Feature activation-6.43e-01

Pos logit contributions
nces+3.225
iencies+3.219
uilt+3.061
+3.036
ortium+2.902
Neg logit contributions
 pretending-4.498
 inaction-4.461
 proceeding-4.460
 parting-4.458
 bothering-4.383
Tokens
Token ID82
Feature activation+1.06e+00

Pos logit contributions
nces+3.526
iencies+3.496
uilt+3.248
+3.211
adoes+3.205
Neg logit contributions
 parting-3.498
 pretending-3.483
 proceeding-3.479
 inaction-3.459
 questioning-3.417
Token the
Token ID262
Feature activation-3.64e-01

Pos logit contributions
 proceeding+4.805
 pretending+4.756
 parting+4.699
 questioning+4.659
 inaction+4.647
Neg logit contributions
iencies-7.105
nces-7.099
uilt-6.651
adoes-6.627
ortium-6.559
Token most
Token ID749
Feature activation+9.97e-01

Pos logit contributions
nces+2.315
iencies+2.292
uilt+2.197
ortium+2.179
adoes+2.152
Neg logit contributions
 proceeding-1.621
 pretending-1.614
 inaction-1.580
 bothering-1.556
 parting-1.554
Token primitive
Token ID20049
Feature activation-1.72e+00

Pos logit contributions
 proceeding+4.492
 pretending+4.450
 parting+4.395
 questioning+4.346
 inaction+4.337
Neg logit contributions
iencies-6.671
nces-6.656
uilt-6.232
adoes-6.215
ortium-6.142
Token man
Token ID582
Feature activation-3.70e-01

Pos logit contributions
nces+10.415
iencies+10.130
uilt+9.928
+9.673
adoes+9.635
Neg logit contributions
 inaction-6.358
 parting-6.254
 proceeding-6.241
 pretending-6.207
 bothering-6.137
Token characteristics
Token ID9695
Feature activation-9.78e-01

Pos logit contributions
iencies+11.525
nces+11.319
uilt+10.549
+10.411
ortium+10.333
Neg logit contributions
 pretending-7.018
 proceeding-6.906
 bothering-6.772
 parting-6.744
 confirming-6.689
Token of
Token ID286
Feature activation-4.63e-01

Pos logit contributions
iencies+6.963
nces+6.881
uilt+6.530
+6.519
ortium+6.407
Neg logit contributions
 inaction-3.346
 pretending-3.306
 proceeding-3.295
 parting-3.248
 bothering-3.141
Token both
Token ID1111
Feature activation-2.20e-01

Pos logit contributions
iencies+3.063
nces+3.047
uilt+2.875
ortium+2.811
adoes+2.802
Neg logit contributions
 pretending-1.988
 proceeding-1.967
 parting-1.912
 inaction-1.907
 bothering-1.901
Token.
Token ID13
Feature activation-4.16e-01

Pos logit contributions
iencies+1.329
nces+1.318
uilt+1.238
adoes+1.219
ortium+1.204
Neg logit contributions
 pretending-1.079
 proceeding-1.078
 parting-1.063
 inaction-1.062
 questioning-1.055
Token
Token ID198
Feature activation-8.72e-01

Pos logit contributions
iencies+2.379
nces+2.374
+2.182
adoes+2.166
ortium+2.164
Neg logit contributions
 pretending-2.217
 proceeding-2.209
 questioning-2.191
 parting-2.181
 inaction-2.154
Token
Token ID198
Feature activation-5.58e-01

Pos logit contributions
iencies+4.529
nces+4.431
+4.236
adoes+4.169
ados+3.897
Neg logit contributions
 parting-4.981
 pretending-4.855
 questioning-4.804
 proceeding-4.793
 inaction-4.759
TokenAl
Token ID2348
Feature activation-7.23e-01

Pos logit contributions
iencies+2.933
nces+2.928
+2.732
uilt+2.670
adoes+2.660
Neg logit contributions
 parting-3.261
 pretending-3.243
 proceeding-3.206
 questioning-3.202
 inaction-3.130
Tokenma
Token ID2611
Feature activation-1.66e+00

Pos logit contributions
iencies+3.433
nces+3.358
uilt+3.086
adoes+3.034
ortium+3.028
Neg logit contributions
 pretending-4.590
 proceeding-4.540
 questioning-4.496
 inaction-4.495
 parting-4.482
Token from
Token ID422
Feature activation-4.89e-01

Pos logit contributions
iencies+9.536
nces+9.106
uilt+8.822
ortium+8.814
+8.678
Neg logit contributions
 pretending-7.659
 proceeding-7.631
 inaction-7.610
 questioning-7.541
 parting-7.505
Token Ex
Token ID1475
Feature activation-9.52e-01

Pos logit contributions
iencies+2.720
nces+2.716
uilt+2.492
ortium+2.468
adoes+2.462
Neg logit contributions
 pretending-2.659
 proceeding-2.635
 questioning-2.602
 parting-2.590
 inaction-2.575
Token Lib
Token ID7980
Feature activation-2.22e+00

Pos logit contributions
iencies+5.300
nces+5.267
ortium+4.867
uilt+4.810
adoes+4.608
Neg logit contributions
 proceeding-5.133
 pretending-5.130
 questioning-5.097
 parting-5.085
 inaction-5.070
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
 proceeding+2.300
 pretending+2.272
 parting+2.256
 questioning+2.231
 inaction+2.229
Neg logit contributions
iencies-2.760
nces-2.758
uilt-2.561
adoes-2.544
ortium-2.515
Token
Token ID198
Feature activation+1.24e-01

Pos logit contributions
iencies+0.661
nces+0.653
adoes+0.611
+0.602
ortium+0.591
Neg logit contributions
 parting-0.574
 pretending-0.569
 proceeding-0.569
 questioning-0.561
 inaction-0.557
TokenAs
Token ID1722
Feature activation+2.89e-01

Pos logit contributions
 parting+0.673
 proceeding+0.666
 pretending+0.663
 questioning+0.658
 inaction+0.643
Neg logit contributions
nces-0.719
iencies-0.715
adoes-0.664
-0.664
uilt-0.651
Token a
Token ID257
Feature activation+1.64e-01

Pos logit contributions
 proceeding+1.297
 pretending+1.281
 parting+1.264
 questioning+1.247
 inaction+1.246
Neg logit contributions
nces-1.945
iencies-1.935
uilt-1.795
adoes-1.789
ortium-1.774
Token result
Token ID1255
Feature activation-7.56e-01

Pos logit contributions
 pretending+0.639
 proceeding+0.632
 inaction+0.613
 questioning+0.611
 parting+0.608
Neg logit contributions
iencies-1.166
nces-1.165
adoes-1.097
uilt-1.093
ortium-1.086
Token,
Token ID11
Feature activation-1.21e+00

Pos logit contributions
nces+4.695
iencies+4.686
adoes+4.342
uilt+4.303
ortium+4.253
Neg logit contributions
 proceeding-3.597
 pretending-3.545
 parting-3.541
 questioning-3.486
 inaction-3.423
Token a
Token ID257
Feature activation-2.28e-01

Pos logit contributions
nces+7.861
iencies+7.652
adoes+7.121
uilt+7.012
ortium+6.896
Neg logit contributions
 proceeding-4.788
 parting-4.788
 pretending-4.645
 inaction-4.572
 questioning-4.514
Token National
Token ID2351
Feature activation-1.48e+00

Pos logit contributions
nces+1.426
iencies+1.422
uilt+1.322
adoes+1.317
ortium+1.315
Neg logit contributions
 pretending-1.072
 proceeding-1.058
 parting-1.034
 inaction-1.029
 questioning-1.025
Token Sleep
Token ID17376
Feature activation-6.49e-01

Pos logit contributions
nces+5.495
iencies+5.062
ortium+4.863
 Rates+4.665
adoes+4.589
Neg logit contributions
 pretending-9.776
 proceeding-9.711
 parting-9.586
 questioning-9.562
 bothering-9.510
Token Foundation
Token ID5693
Feature activation-7.56e-01

Pos logit contributions
iencies+2.264
nces+2.068
adoes+1.799
ortium+1.784
+1.783
Neg logit contributions
 pretending-4.883
 questioning-4.861
 bothering-4.804
 parting-4.802
 proceeding-4.784
Token poll
Token ID3278
Feature activation-9.80e-01

Pos logit contributions
nces+4.670
iencies+4.599
ortium+4.481
adoes+4.332
uilt+4.283
Neg logit contributions
 pretending-3.351
 parting-3.251
 bothering-3.217
 proceeding-3.217
 inaction-3.194
Token and
Token ID290
Feature activation-3.18e-01

Pos logit contributions
nces+9.764
iencies+9.731
uilt+9.356
adoes+9.243
ortium+9.224
Neg logit contributions
 proceeding-6.205
 inaction-6.165
 parting-6.057
 questioning-5.790
 pretending-5.683
Token has
Token ID468
Feature activation-1.52e+00

Pos logit contributions
iencies+2.031
nces+2.003
uilt+1.913
adoes+1.902
ortium+1.864
Neg logit contributions
 proceeding-1.436
 parting-1.408
 inaction-1.406
 questioning-1.388
 pretending-1.386
Token no
Token ID645
Feature activation-2.27e-01

Pos logit contributions
iencies+9.266
nces+9.244
adoes+8.915
uilt+8.904
ortium+8.662
Neg logit contributions
 proceeding-6.006
 inaction-5.939
 parting-5.915
 pretending-5.796
 confirming-5.764
Token idea
Token ID2126
Feature activation-9.20e-01

Pos logit contributions
iencies+1.501
nces+1.475
adoes+1.411
uilt+1.372
ortium+1.348
Neg logit contributions
 proceeding-0.991
 pretending-0.971
 inaction-0.968
 parting-0.946
 questioning-0.942
Token of
Token ID286
Feature activation+2.47e-01

Pos logit contributions
iencies+6.652
nces+6.449
adoes+6.302
uilt+6.089
+6.003
Neg logit contributions
 proceeding-3.045
 inaction-3.004
 parting-2.990
 pretending-2.913
 questioning-2.898
Token the
Token ID262
Feature activation-4.58e-01

Pos logit contributions
 proceeding+0.910
 pretending+0.897
 parting+0.894
 questioning+0.883
 inaction+0.878
Neg logit contributions
iencies-1.822
nces-1.800
adoes-1.718
uilt-1.711
ortium-1.679
Token complexities
Token ID42292
Feature activation-9.24e-01

Pos logit contributions
iencies+2.843
nces+2.781
adoes+2.657
uilt+2.621
ortium+2.580
Neg logit contributions
 pretending-2.144
 proceeding-2.126
 parting-2.100
 inaction-2.072
 questioning-2.065
Token out
Token ID503
Feature activation+4.69e-01

Pos logit contributions
iencies+5.969
nces+5.749
adoes+5.471
uilt+5.341
ortium+5.232
Neg logit contributions
 inaction-3.848
 parting-3.843
 proceeding-3.784
 pretending-3.774
 questioning-3.736
Token there
Token ID612
Feature activation-6.31e-01

Pos logit contributions
 proceeding+1.155
 parting+1.134
 pretending+1.127
 questioning+1.099
 inaction+1.095
Neg logit contributions
iencies-4.084
nces-4.072
uilt-3.868
adoes-3.862
ortium-3.823
Token."
Token ID526
Feature activation+1.20e+00

Pos logit contributions
iencies+4.005
nces+4.000
adoes+3.723
uilt+3.702
ortium+3.675
Neg logit contributions
 inaction-2.755
 parting-2.711
 proceeding-2.699
 questioning-2.596
 pretending-2.589
Token
Token ID198
Feature activation-2.97e-01

Pos logit contributions
 proceeding+5.988
 pretending+5.923
 parting+5.853
 questioning+5.795
 inaction+5.785
Neg logit contributions
iencies-7.367
nces-7.346
uilt-6.894
adoes-6.778
ortium-6.718
Token is
Token ID318
Feature activation-1.11e+00

Pos logit contributions
iencies+2.606
nces+2.516
adoes+2.375
uilt+2.374
ortium+2.286
Neg logit contributions
 pretending-2.505
 proceeding-2.502
 inaction-2.475
 parting-2.457
 questioning-2.456
Token five
Token ID1936
Feature activation-9.67e-01

Pos logit contributions
iencies+5.031
nces+4.875
adoes+4.664
uilt+4.654
+4.444
Neg logit contributions
 inaction-6.401
 pretending-6.338
 parting-6.331
 proceeding-6.310
 bothering-6.246
Token years
Token ID812
Feature activation-9.82e-01

Pos logit contributions
iencies+5.175
nces+5.112
adoes+4.818
uilt+4.709
ortium+4.608
Neg logit contributions
 inaction-5.151
 proceeding-5.047
 parting-5.038
 pretending-4.962
 questioning-4.950
Token.
Token ID13
Feature activation-2.12e-02

Pos logit contributions
nces+5.478
iencies+5.443
adoes+5.077
uilt+5.012
ortium+4.916
Neg logit contributions
 inaction-4.815
 pretending-4.814
 proceeding-4.747
 parting-4.716
 questioning-4.686
Token
Token ID198
Feature activation-5.44e-01

Pos logit contributions
nces+0.120
iencies+0.120
adoes+0.111
ortium+0.109
uilt+0.109
Neg logit contributions
 proceeding-0.115
 pretending-0.115
 questioning-0.113
 inaction-0.113
 parting-0.113
Token
Token ID198
Feature activation-8.33e-01

Pos logit contributions
iencies+3.054
nces+3.003
adoes+2.828
+2.812
ortium+2.689
Neg logit contributions
 parting-2.929
 pretending-2.887
 proceeding-2.868
 questioning-2.863
 inaction-2.834
TokenHead
Token ID13847
Feature activation+2.40e-01

Pos logit contributions
nces+4.257
iencies+4.240
+4.016
adoes+3.967
uilt+3.887
Neg logit contributions
 parting-4.766
 questioning-4.741
 pretending-4.737
 proceeding-4.724
 inaction-4.617
Tokenwater
Token ID7050
Feature activation-1.81e+00

Pos logit contributions
 proceeding+1.256
 pretending+1.243
 parting+1.223
 inaction+1.221
 questioning+1.219
Neg logit contributions
iencies-1.436
nces-1.431
uilt-1.335
adoes-1.313
ortium-1.309
Token (
Token ID357
Feature activation-1.32e+00

Pos logit contributions
iencies+9.380
nces+9.282
ortium+8.551
adoes+8.531
uilt+8.475
Neg logit contributions
 pretending-8.924
 proceeding-8.917
 inaction-8.828
 questioning-8.725
 pleading-8.658
Token01
Token ID486
Feature activation-1.23e+00

Pos logit contributions
iencies+5.759
nces+5.703
uilt+5.303
adoes+5.233
ortium+5.028
Neg logit contributions
 proceeding-8.336
 parting-8.302
 questioning-8.241
 inaction-8.216
 pretending-8.206
Token606
Token ID33206
Feature activation-1.65e+00

Pos logit contributions
iencies+4.636
nces+4.627
adoes+4.174
+4.076
uilt+4.031
Neg logit contributions
 proceeding-8.335
 questioning-8.318
 parting-8.316
 pretending-8.293
 inaction-8.275
Token-
Token ID12
Feature activation+2.01e-01

Pos logit contributions
nces+4.048
iencies+4.020
adoes+3.844
uilt+3.841
ortium+3.825
Neg logit contributions
 pretending-1.913
 proceeding-1.832
 parting-1.815
 questioning-1.809
 confirming-1.792
Tokenold
Token ID727
Feature activation-3.61e-01

Pos logit contributions
 proceeding+1.216
 pretending+1.210
 parting+1.200
 questioning+1.190
 inaction+1.186
Neg logit contributions
iencies-1.033
nces-1.028
uilt-0.944
adoes-0.940
ortium-0.928
Token company
Token ID1664
Feature activation-6.58e-02

Pos logit contributions
iencies+2.525
nces+2.505
uilt+2.412
adoes+2.392
ortium+2.374
Neg logit contributions
 proceeding-1.381
 pretending-1.376
 inaction-1.338
 parting-1.336
 questioning-1.319
Token Path
Token ID10644
Feature activation+4.48e-01

Pos logit contributions
nces+0.498
iencies+0.497
uilt+0.481
ortium+0.473
+0.469
Neg logit contributions
 proceeding-0.222
 inaction-0.218
 parting-0.218
 pretending-0.217
 questioning-0.210
Tokenacqu
Token ID43561
Feature activation-2.31e+00

Pos logit contributions
 proceeding+2.456
 pretending+2.432
 parting+2.408
 questioning+2.392
 inaction+2.383
Neg logit contributions
iencies-2.562
nces-2.549
uilt-2.375
adoes-2.352
ortium-2.331
Tokenired
Token ID1202
Feature activation-3.07e+00

Pos logit contributions
iencies+11.402
nces+10.968
uilt+10.494
adoes+10.135
+9.924
Neg logit contributions
 proceeding-12.371
 parting-12.321
 pretending-12.097
 questioning-12.002
 inaction-11.889
Token back
Token ID736
Feature activation-4.29e-01

Pos logit contributions
nces+17.877
uilt+17.236
+17.081
ortium+17.055
iencies+17.008
Neg logit contributions
 proceeding-9.748
 inaction-9.723
 pretending-9.269
 bothering-9.178
 parting-9.029
Token in
Token ID287
Feature activation-1.61e-02

Pos logit contributions
nces+2.501
iencies+2.467
uilt+2.344
adoes+2.314
ortium+2.301
Neg logit contributions
 proceeding-2.205
 pretending-2.159
 inaction-2.136
 parting-2.131
 questioning-2.124
Token June
Token ID2795
Feature activation-8.68e-01

Pos logit contributions
iencies+0.074
nces+0.074
adoes+0.069
uilt+0.067
ortium+0.066
Neg logit contributions
 proceeding-0.102
 pretending-0.102
 parting-0.100
 inaction-0.100
 questioning-0.100
Token.
Token ID13
Feature activation+1.04e-02

Pos logit contributions
nces+5.352
iencies+5.333
uilt+5.160
+5.054
ortium+5.039
Neg logit contributions
 inaction-3.800
 proceeding-3.758
 pretending-3.737
 parting-3.693
 questioning-3.632
Token Talk
Token ID12167
Feature activation+9.54e-01

Pos logit contributions
 proceeding+0.056
 pretending+0.056
 parting+0.055
 questioning+0.055
 inaction+0.055
Neg logit contributions
iencies-0.059
nces-0.058
uilt-0.055
-0.054
adoes-0.054
Token the
Token ID262
Feature activation+4.39e-01

Pos logit contributions
 proceeding+1.976
 pretending+1.934
 inaction+1.900
 parting+1.899
 questioning+1.892
Neg logit contributions
nces-3.558
iencies-3.557
adoes-3.344
uilt-3.333
ortium-3.280
Token brush
Token ID14093
Feature activation+7.45e-01

Pos logit contributions
 proceeding+2.102
 pretending+2.085
 inaction+2.038
 questioning+2.035
 parting+2.028
Neg logit contributions
iencies-2.785
nces-2.764
adoes-2.590
uilt-2.587
ortium-2.548
Token.
Token ID13
Feature activation+6.93e-01

Pos logit contributions
 proceeding+3.609
 pretending+3.560
 inaction+3.529
 questioning+3.517
 parting+3.517
Neg logit contributions
nces-4.733
iencies-4.722
uilt-4.389
adoes-4.383
ortium-4.342
Token I
Token ID314
Feature activation-5.11e-01

Pos logit contributions
 proceeding+3.360
 pretending+3.334
 parting+3.292
 questioning+3.278
 inaction+3.260
Neg logit contributions
iencies-4.385
nces-4.370
uilt-4.074
adoes-4.064
ortium-4.023
Token personally
Token ID7620
Feature activation-3.21e-01

Pos logit contributions
nces+3.253
iencies+3.216
adoes+3.024
uilt+3.018
ortium+2.985
Neg logit contributions
 inaction-2.366
 proceeding-2.307
 pretending-2.282
 parting-2.261
 questioning-2.256
Token didn
Token ID1422
Feature activation-3.47e+00

Pos logit contributions
nces+2.016
iencies+2.010
uilt+1.878
adoes+1.877
ortium+1.849
Neg logit contributions
 inaction-1.522
 proceeding-1.503
 pretending-1.493
 parting-1.467
 questioning-1.461
Token
Token ID447
Feature activation-3.12e-01

Pos logit contributions
+10.526
iencies+10.291
adoes+9.680
ados+9.247
nces+9.147
Neg logit contributions
 parting-20.933
 inaction-20.201
 questioning-20.137
 pretending-20.033
 proceeding-19.557
Token
Token ID247
Feature activation-1.62e-02

Pos logit contributions
nces+1.612
iencies+1.588
uilt+1.475
+1.467
adoes+1.449
Neg logit contributions
 proceeding-1.861
 pretending-1.856
 inaction-1.831
 parting-1.825
 questioning-1.798
Tokent
Token ID83
Feature activation+9.39e-01

Pos logit contributions
nces+0.082
iencies+0.082
uilt+0.076
+0.075
adoes+0.074
Neg logit contributions
 proceeding-0.099
 pretending-0.098
 inaction-0.097
 parting-0.097
 questioning-0.096
Token find
Token ID1064
Feature activation-7.85e-01

Pos logit contributions
 proceeding+4.025
 pretending+3.973
 parting+3.930
 questioning+3.894
 inaction+3.817
Neg logit contributions
iencies-6.483
nces-6.459
uilt-6.067
adoes-6.024
ortium-6.002
Token painting
Token ID12036
Feature activation+7.85e-01

Pos logit contributions
iencies+5.749
nces+5.713
uilt+5.471
adoes+5.379
+5.368
Neg logit contributions
 proceeding-2.605
 inaction-2.576
 pretending-2.564
 questioning-2.448
 parting-2.424
Token plumbing
Token ID40199
Feature activation-6.33e-01

Pos logit contributions
nces+6.496
iencies+6.431
adoes+6.292
uilt+6.083
+5.994
Neg logit contributions
 pretending-4.018
 proceeding-3.999
 parting-3.885
 inaction-3.880
 confirming-3.850
Token
Token ID447
Feature activation-5.08e-01

Pos logit contributions
iencies+3.742
nces+3.690
uilt+3.444
adoes+3.434
+3.412
Neg logit contributions
 pretending-3.062
 proceeding-3.033
 parting-3.028
 questioning-2.990
 inaction-2.985
Token
Token ID247
Feature activation+9.49e-01

Pos logit contributions
iencies+2.332
nces+2.274
uilt+2.029
+2.029
adoes+2.009
Neg logit contributions
 pretending-3.344
 proceeding-3.298
 parting-3.292
 inaction-3.230
 questioning-3.230
Token culprit
Token ID26581
Feature activation+2.73e-01

Pos logit contributions
 proceeding+5.204
 pretending+5.157
 parting+5.096
 questioning+5.076
 inaction+5.048
Neg logit contributions
iencies-5.385
nces-5.364
uilt-4.977
adoes-4.945
ortium-4.902
Tokens
Token ID82
Feature activation-6.30e-03

Pos logit contributions
 proceeding+1.489
 pretending+1.474
 parting+1.467
 questioning+1.449
 inaction+1.447
Neg logit contributions
iencies-1.563
nces-1.555
adoes-1.438
uilt-1.437
ortium-1.416
Token couldn
Token ID3521
Feature activation-3.88e+00

Pos logit contributions
iencies+0.036
nces+0.036
adoes+0.033
uilt+0.033
ortium+0.032
Neg logit contributions
 proceeding-0.034
 pretending-0.033
 parting-0.033
 inaction-0.033
 questioning-0.033
Token
Token ID447
Feature activation-7.50e-01

Pos logit contributions
+14.005
iencies+12.453
adoes+11.128
nces+10.856
ados+10.778
Neg logit contributions
 parting-21.507
 pretending-20.861
 questioning-20.644
 proceeding-20.441
 inaction-20.120
Token
Token ID247
Feature activation-4.71e-01

Pos logit contributions
iencies+3.446
nces+3.329
+3.072
uilt+3.045
adoes+3.015
Neg logit contributions
 pretending-4.867
 parting-4.820
 proceeding-4.813
 inaction-4.718
 questioning-4.716
Tokent
Token ID83
Feature activation+5.81e-01

Pos logit contributions
nces+2.520
iencies+2.512
uilt+2.299
adoes+2.271
+2.258
Neg logit contributions
 proceeding-2.703
 pretending-2.696
 parting-2.657
 inaction-2.645
 questioning-2.644
Token be
Token ID307
Feature activation+2.10e+00

Pos logit contributions
 proceeding+2.629
 pretending+2.602
 parting+2.569
 questioning+2.550
 inaction+2.538
Neg logit contributions
iencies-3.869
nces-3.855
uilt-3.609
adoes-3.601
ortium-3.558
Token accused
Token ID5371
Feature activation-5.82e-02

Pos logit contributions
 proceeding+9.004
 pretending+8.930
 questioning+8.731
 parting+8.665
 hiding+8.631
Neg logit contributions
iencies-14.034
nces-13.938
uilt-13.197
ortium-13.059
adoes-12.948
Token
Token ID247
Feature activation+6.19e-01

Pos logit contributions
 pretending+1.007
 proceeding+1.007
 parting+0.993
 inaction+0.984
 questioning+0.980
Neg logit contributions
iencies-0.786
nces-0.779
uilt-0.704
adoes-0.699
-0.684
Tokens
Token ID82
Feature activation+5.05e-01

Pos logit contributions
 proceeding+3.243
 pretending+3.212
 parting+3.180
 questioning+3.150
 inaction+3.145
Neg logit contributions
iencies-3.689
nces-3.685
uilt-3.421
adoes-3.395
ortium-3.360
Token privacy
Token ID6782
Feature activation+7.87e-01

Pos logit contributions
 proceeding+2.499
 pretending+2.468
 parting+2.440
 inaction+2.409
 questioning+2.407
Neg logit contributions
nces-3.139
iencies-3.127
adoes-2.920
uilt-2.913
ortium-2.865
Token and
Token ID290
Feature activation+1.62e+00

Pos logit contributions
 proceeding+4.163
 pretending+4.127
 parting+4.082
 questioning+4.056
 inaction+4.034
Neg logit contributions
iencies-4.637
nces-4.621
uilt-4.291
adoes-4.268
ortium-4.223
Token I
Token ID314
Feature activation-8.38e-01

Pos logit contributions
 proceeding+7.500
 pretending+7.478
 questioning+7.389
 parting+7.376
 inaction+7.305
Neg logit contributions
iencies-10.370
nces-10.310
uilt-9.619
adoes-9.549
ortium-9.529
Token don
Token ID836
Feature activation-3.50e+00

Pos logit contributions
nces+4.507
iencies+4.499
+4.290
uilt+4.196
ortium+4.116
Neg logit contributions
 proceeding-4.497
 pretending-4.417
 inaction-4.385
 parting-4.379
 confirming-4.261
Token
Token ID447
Feature activation-9.60e-01

Pos logit contributions
+12.582
iencies+11.242
nces+10.089
adoes+9.956
ados+9.444
Neg logit contributions
 parting-20.288
 pretending-19.896
 proceeding-19.685
 inaction-19.371
 questioning-19.217
Token
Token ID247
Feature activation-4.32e-01

Pos logit contributions
iencies+4.063
nces+3.950
+3.682
uilt+3.609
adoes+3.448
Neg logit contributions
 pretending-6.416
 parting-6.403
 proceeding-6.336
 questioning-6.226
 inaction-6.226
Tokent
Token ID83
Feature activation+4.13e-01

Pos logit contributions
nces+2.229
iencies+2.214
+2.033
uilt+2.027
adoes+1.974
Neg logit contributions
 proceeding-2.571
 inaction-2.547
 pretending-2.544
 parting-2.542
 questioning-2.490
Token like
Token ID588
Feature activation+1.51e+00

Pos logit contributions
 proceeding+1.859
 pretending+1.834
 parting+1.816
 inaction+1.813
 questioning+1.786
Neg logit contributions
iencies-2.754
nces-2.740
uilt-2.572
adoes-2.569
ortium-2.529
Token their
Token ID511
Feature activation+3.77e-01

Pos logit contributions
 proceeding+5.973
 pretending+5.969
 parting+5.860
 questioning+5.824
 inaction+5.755
Neg logit contributions
iencies-10.865
nces-10.797
uilt-10.146
adoes-10.104
ortium-10.076
Token
Token ID198
Feature activation+3.40e-01

Pos logit contributions
 proceeding+1.903
 pretending+1.883
 parting+1.864
 questioning+1.847
 inaction+1.838
Neg logit contributions
iencies-2.538
nces-2.530
uilt-2.360
adoes-2.352
ortium-2.330
Token
Token ID198
Feature activation+9.65e-01

Pos logit contributions
 proceeding+1.597
 pretending+1.582
 parting+1.565
 questioning+1.551
 inaction+1.543
Neg logit contributions
iencies-2.205
nces-2.197
uilt-2.050
adoes-2.045
ortium-2.024
TokenBut
Token ID1537
Feature activation+5.16e-01

Pos logit contributions
 proceeding+5.973
 pretending+5.929
 parting+5.871
 questioning+5.839
 inaction+5.817
Neg logit contributions
iencies-4.812
nces-4.791
uilt-4.387
adoes-4.355
ortium-4.304
Token their
Token ID511
Feature activation+2.65e-01

Pos logit contributions
 proceeding+2.442
 pretending+2.408
 parting+2.386
 inaction+2.354
 questioning+2.350
Neg logit contributions
nces-3.332
iencies-3.322
uilt-3.108
adoes-3.106
ortium-3.072
Token tour
Token ID4205
Feature activation+3.85e-01

Pos logit contributions
 proceeding+0.992
 pretending+0.990
 inaction+0.942
 parting+0.941
 confirming+0.936
Neg logit contributions
iencies-1.955
nces-1.942
adoes-1.839
uilt-1.827
ortium-1.807
Token didn
Token ID1422
Feature activation-3.84e+00

Pos logit contributions
 proceeding+1.899
 pretending+1.889
 parting+1.857
 inaction+1.851
 questioning+1.848
Neg logit contributions
iencies-2.393
nces-2.384
uilt-2.228
adoes-2.225
ortium-2.177
Token
Token ID447
Feature activation-9.21e-01

Pos logit contributions
+14.545
iencies+12.941
adoes+11.994
ados+11.728
nces+11.712
Neg logit contributions
 parting-20.752
 pretending-20.101
 questioning-19.799
 inaction-19.261
 pleading-19.143
Token
Token ID247
Feature activation-2.15e-01

Pos logit contributions
iencies+4.368
nces+4.268
uilt+3.961
+3.933
ortium+3.834
Neg logit contributions
 pretending-5.693
 parting-5.621
 proceeding-5.601
 inaction-5.485
 bothering-5.426
Tokent
Token ID83
Feature activation+1.04e+00

Pos logit contributions
nces+1.190
iencies+1.181
uilt+1.086
ortium+1.072
adoes+1.072
Neg logit contributions
 proceeding-1.215
 pretending-1.212
 parting-1.204
 inaction-1.193
 questioning-1.185
Token mark
Token ID1317
Feature activation-2.82e-01

Pos logit contributions
 proceeding+4.462
 pretending+4.369
 parting+4.351
 questioning+4.300
 inaction+4.220
Neg logit contributions
iencies-7.116
nces-7.100
uilt-6.677
adoes-6.602
ortium-6.567
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
nces+2.153
iencies+2.127
adoes+2.026
uilt+2.025
ortium+2.005
Neg logit contributions
 proceeding-0.961
 pretending-0.952
 inaction-0.915
 questioning-0.907
 parting-0.904