TOP ACTIVATIONS
MAX = 1.796

or her head , and must roll to avoid following the
alive by As mode us . The player must know choose
her head , and must roll to avoid following the commands
However we would like to remind you once again that the
. Charles Burns : Yeah , it
a failed will save , your character now receives secret voices
head , and must roll to avoid following the commands in
contain the instructions to create lambda functions using the archived file
of a failed will save , your character now receives secret
file and creates corresponding templates . This templates will contain the
youtube . com / channel / UC _ K k BY
such as through a or ) must not modify the state
Board :: apply _ move () and Board :: undo _
Worth and then the Mar cell us shale of Pennsylvania ,
. Charles Burns : Yeah , I mean
. file and creates corresponding templates . This templates
usually happens at an event filled with Tor ont on ians
the headquarters of the regional government . Similar tractor
, and must roll to avoid following the commands in his
is created and the archived file is uploaded to it

MIN ACTIVATIONS
MIN = -1.642

s director of research , F am
But if we take the phone calls , repeated
quickly came to a steep incl ine at mile 16 .
xit - is - the - prime - chall enge -
: 13 PH OTOS Donald Trump 's potential running
( 107 , 226 km / h ). Sun
- still - h angs - or - does /
Rated 5 out of 5 by d anc erg ur
t be easy be ­ ing Jimmy Carter . Al
Dixon , Mozilla 's chief legal and business officer , said
a 1 , 000 ft incl ine over 1 . 3
Rated 5 out of 5 by Quake D ude from
and 16 - 35 f / 2 . 8 II ,
en inter dis ant l ' ag ression de t out
Powell : New England Patriots : QB Tom Brady
= True , help =" path to images directory ") args
( 720 , 000 km / h ). Even at this
month , a Center for Public Integrity investigation found that ,
. - December 13 , 2010 boys beware Reviewer
report , Lead ing Beyond Limits : May oral

INTERVAL 0.936 - 1.796
CONTAINS 0.167%

FIN DE TRAN SM ISSION /// <|endoftext|> Our ELE X Woo
At the tender age of 69 ,
the commands in his head ). Kid n apping or Death
. AV C : It must be fun to
McC aff rey and Brian Albert have taken on the task

INTERVAL 0.077 - 0.936
CONTAINS 38.652%

be more inclined to take a job if staff can work
Ros common and owned by Richard Siber ry ( on the
in a side that had Paul Sch oles , David Beckham
understand both sides . The mixed kids that I
Jay replied with a w ry smile :

INTERVAL -0.783 - 0.077
CONTAINS 60.657%

to NBC that he had decided to fire Comey before Attorney
dots on the upper left front corner of the new polymer
Bl acks in Philly have a different overall accent
just couldn 't really hold the wheel in the last couple
developer hand - holding could be a good thing .

INTERVAL -1.642 - -0.783
CONTAINS 0.524%

cinnamon , and 2 he aping tablespoons of peanut butter :
The company received an estimated $ 9 . 7 million
joke , and the steep incl ines can be treacherous with
ships can be sent to explore , harvest resources
's jobs . The book The Rise of the Robots highlights
Token or
Token ID393
Feature activation+2.35e-01

Pos logit contributions
%%%%+3.037
IJ+2.893
Submit+2.826
 Hearing+2.630
NPR+2.593
Neg logit contributions
alore-4.764
liga-4.200
rade-4.130
aints-4.111
ammy-4.039
Token her
Token ID607
Feature activation+1.31e+00

Pos logit contributions
%%%%+0.778
IJ+0.752
Submit+0.731
NPR+0.669
 Hearing+0.662
Neg logit contributions
alore-1.424
rade-1.250
liga-1.241
aints-1.236
ints-1.218
Token head
Token ID1182
Feature activation+1.08e+00

Pos logit contributions
%%%%+7.292
IJ+7.050
Submit+6.922
 Hearing+6.741
NPR+6.624
Neg logit contributions
alore-4.695
liga-3.971
rade-3.821
aints-3.674
ammy-3.603
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
%%%%+4.226
IJ+4.028
Submit+3.984
NPR+3.624
Psy+3.581
Neg logit contributions
alore-6.006
liga-5.066
aints-5.026
ammy-5.014
rade-5.004
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
%%%%+4.196
IJ+4.014
Submit+3.968
 Hearing+3.598
NPR+3.569
Neg logit contributions
alore-5.824
liga-4.929
rade-4.898
ammy-4.896
aints-4.865
Token must
Token ID1276
Feature activation+1.80e+00

Pos logit contributions
%%%%+4.849
Submit+4.516
IJ+4.423
 Hearing+4.105
 Continuous+3.989
Neg logit contributions
alore-6.970
liga-5.858
ammy-5.824
rade-5.789
aints-5.734
Token roll
Token ID4836
Feature activation+1.67e+00

Pos logit contributions
%%%%+8.537
Submit+7.926
IJ+7.698
Psy+7.389
 Continuous+7.269
Neg logit contributions
alore-7.799
liga-6.470
ints-6.364
rade-6.362
aints-6.300
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
%%%%+5.034
Submit+4.759
IJ+4.719
 Hearing+4.209
NPR+4.045
Neg logit contributions
alore-10.116
aints-8.834
liga-8.801
ammy-8.747
ints-8.738
Token avoid
Token ID3368
Feature activation+1.42e+00

Pos logit contributions
%%%%+5.362
Submit+4.733
IJ+4.718
 Hearing+4.249
 Again+4.179
Neg logit contributions
alore-9.638
liga-8.642
rade-8.371
aints-8.357
oat-8.330
Token following
Token ID1708
Feature activation+8.10e-01

Pos logit contributions
%%%%+4.403
IJ+4.134
Submit+3.905
 Hearing+3.715
NPR+3.565
Neg logit contributions
alore-8.894
rade-7.942
aints-7.758
liga-7.737
ints-7.730
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
%%%%+3.023
IJ+2.926
Submit+2.823
 Hearing+2.566
NPR+2.560
Neg logit contributions
alore-4.634
rade-4.006
liga-3.994
aints-3.971
ammy-3.934
Token alive
Token ID6776
Feature activation+8.47e-01

Pos logit contributions
%%%%+1.359
IJ+1.321
Submit+1.287
NPR+1.192
 Hearing+1.191
Neg logit contributions
alore-1.726
rade-1.491
liga-1.477
aints-1.454
ints-1.441
Token by
Token ID416
Feature activation+6.81e-01

Pos logit contributions
%%%%+3.573
IJ+3.488
Submit+3.381
NPR+3.175
 Hearing+3.161
Neg logit contributions
alore-4.399
liga-3.775
rade-3.695
ammy-3.686
aints-3.665
Token As
Token ID1081
Feature activation+2.67e-01

Pos logit contributions
%%%%+1.996
IJ+1.905
Submit+1.838
 Hearing+1.718
NPR+1.628
Neg logit contributions
alore-4.393
rade-3.897
liga-3.880
aints-3.818
ints-3.786
Tokenmode
Token ID14171
Feature activation+5.03e-01

Pos logit contributions
%%%%+1.784
IJ+1.739
Submit+1.730
NPR+1.646
 Hearing+1.629
Neg logit contributions
alore-0.723
rade-0.510
liga-0.509
aints-0.501
ints-0.495
Tokenus
Token ID385
Feature activation+9.35e-01

Pos logit contributions
%%%%+2.031
IJ+1.958
Submit+1.915
NPR+1.769
 Hearing+1.738
Neg logit contributions
alore-2.706
liga-2.318
rade-2.293
aints-2.274
ints-2.243
Token.
Token ID13
Feature activation+1.74e+00

Pos logit contributions
%%%%+3.988
IJ+3.846
Submit+3.708
NPR+3.426
 Hearing+3.411
Neg logit contributions
alore-4.935
liga-4.116
rade-4.108
aints-4.077
ammy-4.055
Token The
Token ID383
Feature activation+9.25e-01

Pos logit contributions
%%%%+4.999
IJ+4.713
Submit+4.499
 Hearing+4.433
 Again+4.236
Neg logit contributions
alore-11.331
liga-9.761
ammy-9.707
aints-9.697
rade-9.660
Token player
Token ID2137
Feature activation+8.32e-01

Pos logit contributions
%%%%+5.120
IJ+4.921
Submit+4.840
 Hearing+4.571
NPR+4.564
Neg logit contributions
alore-3.573
liga-2.879
aints-2.803
rade-2.795
ammy-2.757
Token must
Token ID1276
Feature activation+1.05e+00

Pos logit contributions
%%%%+3.413
IJ+3.185
Submit+3.171
 Hearing+2.935
NPR+2.838
Neg logit contributions
alore-4.430
rade-3.755
liga-3.730
ammy-3.660
ints-3.625
Token know
Token ID760
Feature activation+8.27e-01

Pos logit contributions
%%%%+5.019
IJ+4.728
Submit+4.631
 Hearing+4.421
NPR+4.299
Neg logit contributions
alore-4.750
rade-4.030
liga-3.855
aints-3.808
ints-3.789
Token choose
Token ID3853
Feature activation+4.29e-01

Pos logit contributions
%%%%+3.985
IJ+3.832
Submit+3.798
 Hearing+3.587
NPR+3.540
Neg logit contributions
alore-3.722
rade-3.129
liga-3.042
ints-3.003
aints-2.979
Token her
Token ID607
Feature activation+1.31e+00

Pos logit contributions
%%%%+0.778
IJ+0.752
Submit+0.731
NPR+0.669
 Hearing+0.662
Neg logit contributions
alore-1.424
rade-1.250
liga-1.241
aints-1.236
ints-1.218
Token head
Token ID1182
Feature activation+1.08e+00

Pos logit contributions
%%%%+7.292
IJ+7.050
Submit+6.922
 Hearing+6.741
NPR+6.624
Neg logit contributions
alore-4.695
liga-3.971
rade-3.821
aints-3.674
ammy-3.603
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
%%%%+4.226
IJ+4.028
Submit+3.984
NPR+3.624
Psy+3.581
Neg logit contributions
alore-6.006
liga-5.066
aints-5.026
ammy-5.014
rade-5.004
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
%%%%+4.196
IJ+4.014
Submit+3.968
 Hearing+3.598
NPR+3.569
Neg logit contributions
alore-5.824
liga-4.929
rade-4.898
ammy-4.896
aints-4.865
Token must
Token ID1276
Feature activation+1.80e+00

Pos logit contributions
%%%%+4.849
Submit+4.516
IJ+4.423
 Hearing+4.105
 Continuous+3.989
Neg logit contributions
alore-6.970
liga-5.858
ammy-5.824
rade-5.789
aints-5.734
Token roll
Token ID4836
Feature activation+1.67e+00

Pos logit contributions
%%%%+8.537
Submit+7.926
IJ+7.698
Psy+7.389
 Continuous+7.269
Neg logit contributions
alore-7.799
liga-6.470
ints-6.364
rade-6.362
aints-6.300
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
%%%%+5.034
Submit+4.759
IJ+4.719
 Hearing+4.209
NPR+4.045
Neg logit contributions
alore-10.116
aints-8.834
liga-8.801
ammy-8.747
ints-8.738
Token avoid
Token ID3368
Feature activation+1.42e+00

Pos logit contributions
%%%%+5.362
Submit+4.733
IJ+4.718
 Hearing+4.249
 Again+4.179
Neg logit contributions
alore-9.638
liga-8.642
rade-8.371
aints-8.357
oat-8.330
Token following
Token ID1708
Feature activation+8.10e-01

Pos logit contributions
%%%%+4.403
IJ+4.134
Submit+3.905
 Hearing+3.715
NPR+3.565
Neg logit contributions
alore-8.894
rade-7.942
aints-7.758
liga-7.737
ints-7.730
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
%%%%+3.023
IJ+2.926
Submit+2.823
 Hearing+2.566
NPR+2.560
Neg logit contributions
alore-4.634
rade-4.006
liga-3.994
aints-3.971
ammy-3.934
Token commands
Token ID9729
Feature activation+1.39e+00

Pos logit contributions
%%%%+5.492
IJ+5.294
Submit+5.097
NPR+4.797
 Hearing+4.754
Neg logit contributions
alore-5.217
liga-4.348
rade-4.317
ammy-4.250
ints-4.098
Token However
Token ID2102
Feature activation+1.40e+00

Pos logit contributions
%%%%+2.523
IJ+2.454
Submit+2.358
 Hearing+2.149
NPR+2.027
Neg logit contributions
alore-6.514
rade-5.778
liga-5.748
aints-5.679
ammy-5.574
Token we
Token ID356
Feature activation+1.10e+00

Pos logit contributions
%%%%+3.775
IJ+3.637
Submit+3.358
 Hearing+2.942
 Again+2.923
Neg logit contributions
alore-9.089
aints-8.030
rade-7.936
liga-7.845
ammy-7.839
Token would
Token ID561
Feature activation+1.00e+00

Pos logit contributions
%%%%+3.952
IJ+3.906
Submit+3.725
 Hearing+3.477
 Again+3.408
Neg logit contributions
alore-6.063
rade-5.343
aints-5.239
liga-5.236
ammy-5.193
Token like
Token ID588
Feature activation+9.15e-01

Pos logit contributions
%%%%+3.202
Submit+2.948
IJ+2.932
 Again+2.615
 Hearing+2.614
Neg logit contributions
alore-6.178
rade-5.434
liga-5.350
aints-5.285
ammy-5.210
Token to
Token ID284
Feature activation+9.41e-01

Pos logit contributions
%%%%+3.275
Submit+3.105
IJ+3.092
 Hearing+2.764
NPR+2.751
Neg logit contributions
alore-5.266
aints-4.548
liga-4.544
rade-4.527
ammy-4.481
Token remind
Token ID7101
Feature activation+1.63e+00

Pos logit contributions
%%%%+4.345
Submit+4.117
IJ+4.075
 Hearing+3.894
NPR+3.830
Neg logit contributions
alore-4.322
liga-3.674
rade-3.651
aints-3.527
oat-3.504
Token you
Token ID345
Feature activation+1.39e+00

Pos logit contributions
%%%%+4.650
IJ+4.130
Submit+4.120
NPR+3.591
 Again+3.587
Neg logit contributions
alore-10.507
liga-9.256
aints-9.086
oat-8.965
ammy-8.922
Token once
Token ID1752
Feature activation+1.30e+00

Pos logit contributions
%%%%+3.468
IJ+3.419
Submit+3.250
 Again+2.866
 Hearing+2.828
Neg logit contributions
alore-9.383
liga-8.148
rade-8.142
ammy-8.119
aints-8.032
Token again
Token ID757
Feature activation+6.19e-01

Pos logit contributions
%%%%+2.507
IJ+2.459
Submit+2.314
 Again+2.120
NPR+1.909
Neg logit contributions
alore-9.473
rade-8.470
ints-8.369
aints-8.359
oat-8.284
Token that
Token ID326
Feature activation+1.39e+00

Pos logit contributions
%%%%+2.252
IJ+2.200
Submit+2.148
NPR+1.957
 Hearing+1.925
Neg logit contributions
alore-3.557
liga-3.067
rade-3.063
aints-3.032
ints-2.990
Token the
Token ID262
Feature activation+1.01e+00

Pos logit contributions
%%%%+4.673
IJ+4.584
Submit+4.342
 Hearing+3.936
NPR+3.778
Neg logit contributions
alore-8.097
aints-7.111
liga-7.012
rade-6.966
ammy-6.930
Token.
Token ID13
Feature activation+4.75e-01

Pos logit contributions
%%%%+1.926
IJ+1.887
Submit+1.834
 Hearing+1.720
NPR+1.709
Neg logit contributions
alore-2.159
rade-1.839
liga-1.803
aints-1.802
ints-1.761
Token
Token ID198
Feature activation+4.85e-01

Pos logit contributions
%%%%+1.634
IJ+1.594
Submit+1.543
NPR+1.405
 Hearing+1.405
Neg logit contributions
alore-2.832
rade-2.486
liga-2.450
aints-2.442
ints-2.413
Token
Token ID198
Feature activation+6.36e-01

Pos logit contributions
%%%%+1.850
IJ+1.811
Submit+1.763
NPR+1.621
 Hearing+1.604
Neg logit contributions
alore-2.732
rade-2.346
liga-2.338
aints-2.320
ammy-2.283
TokenCharles
Token ID28711
Feature activation+2.22e-01

Pos logit contributions
%%%%+2.305
IJ+2.267
Submit+2.189
NPR+2.029
Psy+1.994
Neg logit contributions
alore-3.693
liga-3.181
rade-3.181
aints-3.141
ints-3.103
Token Burns
Token ID23592
Feature activation+3.79e-01

Pos logit contributions
%%%%+0.883
IJ+0.863
Submit+0.836
 Hearing+0.781
NPR+0.770
Neg logit contributions
alore-1.199
rade-1.035
liga-1.020
aints-1.016
ints-1.005
Token :
Token ID1058
Feature activation+1.63e+00

Pos logit contributions
%%%%+1.477
IJ+1.447
Submit+1.399
 Hearing+1.303
NPR+1.287
Neg logit contributions
alore-2.079
rade-1.802
liga-1.777
aints-1.764
ints-1.743
Token Yeah
Token ID9425
Feature activation+6.03e-01

Pos logit contributions
%%%%+4.613
IJ+4.513
Submit+4.317
 Hearing+4.300
 Again+4.039
Neg logit contributions
alore-10.457
liga-9.055
rade-9.001
aints-8.955
ints-8.862
Token,
Token ID11
Feature activation+2.47e-01

Pos logit contributions
%%%%+2.656
IJ+2.614
Submit+2.538
 Hearing+2.373
NPR+2.357
Neg logit contributions
alore-2.992
rade-2.546
aints-2.502
liga-2.500
ints-2.448
Token it
Token ID340
Feature activation+6.74e-01

Pos logit contributions
%%%%+1.011
IJ+0.988
Submit+0.963
NPR+0.890
 Hearing+0.884
Neg logit contributions
alore-1.310
rade-1.134
liga-1.112
aints-1.109
ints-1.095
Token
Token ID447
Feature activation+1.38e-01

Pos logit contributions
%%%%+2.701
IJ+2.688
Submit+2.575
NPR+2.428
 Hearing+2.414
Neg logit contributions
alore-3.516
rade-3.029
liga-3.005
aints-2.964
ints-2.904
Token
Token ID247
Feature activation+2.78e-02

Pos logit contributions
%%%%+0.318
IJ+0.302
Submit+0.289
 Hearing+0.247
NPR+0.247
Neg logit contributions
alore-0.983
rade-0.880
liga-0.871
aints-0.871
ints-0.861
Token a
Token ID257
Feature activation+9.87e-01

Pos logit contributions
%%%%+4.666
IJ+4.305
Submit+4.251
 Hearing+3.817
Psy+3.758
Neg logit contributions
alore-7.912
liga-6.784
rade-6.777
aints-6.723
ammy-6.524
Token failed
Token ID4054
Feature activation+5.73e-01

Pos logit contributions
%%%%+3.693
IJ+3.483
Submit+3.419
 Hearing+3.120
Psy+3.112
Neg logit contributions
alore-5.515
rade-4.818
liga-4.769
aints-4.718
oat-4.627
Token will
Token ID481
Feature activation+1.31e+00

Pos logit contributions
%%%%+2.665
IJ+2.565
Submit+2.527
 Hearing+2.375
Psy+2.337
Neg logit contributions
alore-2.737
rade-2.275
liga-2.274
aints-2.252
ints-2.214
Token save
Token ID3613
Feature activation+7.82e-01

Pos logit contributions
%%%%+4.388
Submit+4.020
IJ+3.831
Psy+3.450
PF+3.376
Neg logit contributions
alore-7.871
liga-6.837
rade-6.695
aints-6.665
ints-6.635
Token,
Token ID11
Feature activation+1.56e+00

Pos logit contributions
%%%%+3.191
IJ+3.049
Submit+3.035
NPR+2.771
Psy+2.739
Neg logit contributions
alore-4.233
liga-3.579
aints-3.551
rade-3.537
ammy-3.519
Token your
Token ID534
Feature activation+1.63e+00

Pos logit contributions
%%%%+4.154
Submit+3.688
IJ+3.498
NPR+3.154
Psy+3.123
Neg logit contributions
alore-10.405
liga-9.394
aints-9.141
ammy-9.092
ints-9.042
Token character
Token ID2095
Feature activation+1.00e+00

Pos logit contributions
%%%%+4.428
IJ+4.092
Submit+3.995
 Hearing+3.516
 Continuous+3.370
Neg logit contributions
alore-10.379
liga-9.357
oat-9.098
aints-9.090
ammy-9.035
Token now
Token ID783
Feature activation+1.21e+00

Pos logit contributions
%%%%+3.816
IJ+3.535
Submit+3.495
NPR+3.266
 Hearing+3.207
Neg logit contributions
alore-5.459
liga-4.841
rade-4.699
ammy-4.687
oat-4.635
Token receives
Token ID11583
Feature activation+6.96e-01

Pos logit contributions
%%%%+4.566
Submit+4.167
IJ+3.922
 Hearing+3.694
NPR+3.669
Neg logit contributions
alore-6.735
liga-6.187
ammy-6.026
oat-5.824
netflix-5.723
Token secret
Token ID3200
Feature activation+5.55e-01

Pos logit contributions
%%%%+2.683
Submit+2.550
IJ+2.531
 Hearing+2.370
NPR+2.312
Neg logit contributions
alore-3.869
liga-3.332
rade-3.309
aints-3.266
oat-3.195
Token voices
Token ID10839
Feature activation+1.17e+00

Pos logit contributions
%%%%+2.258
IJ+2.182
Submit+2.121
 Hearing+1.992
NPR+1.984
Neg logit contributions
alore-2.982
rade-2.584
liga-2.540
aints-2.487
oat-2.468
Token head
Token ID1182
Feature activation+1.08e+00

Pos logit contributions
%%%%+7.292
IJ+7.050
Submit+6.922
 Hearing+6.741
NPR+6.624
Neg logit contributions
alore-4.695
liga-3.971
rade-3.821
aints-3.674
ammy-3.603
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
%%%%+4.226
IJ+4.028
Submit+3.984
NPR+3.624
Psy+3.581
Neg logit contributions
alore-6.006
liga-5.066
aints-5.026
ammy-5.014
rade-5.004
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
%%%%+4.196
IJ+4.014
Submit+3.968
 Hearing+3.598
NPR+3.569
Neg logit contributions
alore-5.824
liga-4.929
rade-4.898
ammy-4.896
aints-4.865
Token must
Token ID1276
Feature activation+1.80e+00

Pos logit contributions
%%%%+4.849
Submit+4.516
IJ+4.423
 Hearing+4.105
 Continuous+3.989
Neg logit contributions
alore-6.970
liga-5.858
ammy-5.824
rade-5.789
aints-5.734
Token roll
Token ID4836
Feature activation+1.67e+00

Pos logit contributions
%%%%+8.537
Submit+7.926
IJ+7.698
Psy+7.389
 Continuous+7.269
Neg logit contributions
alore-7.799
liga-6.470
ints-6.364
rade-6.362
aints-6.300
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
%%%%+5.034
Submit+4.759
IJ+4.719
 Hearing+4.209
NPR+4.045
Neg logit contributions
alore-10.116
aints-8.834
liga-8.801
ammy-8.747
ints-8.738
Token avoid
Token ID3368
Feature activation+1.42e+00

Pos logit contributions
%%%%+5.362
Submit+4.733
IJ+4.718
 Hearing+4.249
 Again+4.179
Neg logit contributions
alore-9.638
liga-8.642
rade-8.371
aints-8.357
oat-8.330
Token following
Token ID1708
Feature activation+8.10e-01

Pos logit contributions
%%%%+4.403
IJ+4.134
Submit+3.905
 Hearing+3.715
NPR+3.565
Neg logit contributions
alore-8.894
rade-7.942
aints-7.758
liga-7.737
ints-7.730
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
%%%%+3.023
IJ+2.926
Submit+2.823
 Hearing+2.566
NPR+2.560
Neg logit contributions
alore-4.634
rade-4.006
liga-3.994
aints-3.971
ammy-3.934
Token commands
Token ID9729
Feature activation+1.39e+00

Pos logit contributions
%%%%+5.492
IJ+5.294
Submit+5.097
NPR+4.797
 Hearing+4.754
Neg logit contributions
alore-5.217
liga-4.348
rade-4.317
ammy-4.250
ints-4.098
Token in
Token ID287
Feature activation+1.29e+00

Pos logit contributions
%%%%+5.270
IJ+5.040
Submit+4.928
NPR+4.496
 Hearing+4.398
Neg logit contributions
alore-7.726
ammy-6.521
aints-6.511
liga-6.508
rade-6.395
Token contain
Token ID3994
Feature activation+1.17e+00

Pos logit contributions
%%%%+4.054
Submit+3.797
IJ+3.708
 Hearing+3.300
 Again+3.246
Neg logit contributions
alore-7.326
rade-6.343
ints-6.267
aints-6.241
liga-6.228
Token the
Token ID262
Feature activation+8.13e-01

Pos logit contributions
%%%%+4.081
IJ+3.938
Submit+3.812
 Hearing+3.397
Psy+3.303
Neg logit contributions
alore-6.713
rade-5.918
aints-5.869
liga-5.858
ammy-5.787
Token instructions
Token ID7729
Feature activation+1.10e+00

Pos logit contributions
%%%%+4.676
IJ+4.576
Submit+4.507
 Hearing+4.244
NPR+4.182
Neg logit contributions
alore-2.891
rade-2.368
liga-2.289
aints-2.224
ints-2.214
Token to
Token ID284
Feature activation+8.58e-01

Pos logit contributions
%%%%+3.809
IJ+3.644
Submit+3.630
 Hearing+3.251
NPR+3.174
Neg logit contributions
alore-6.442
ammy-5.556
rade-5.551
aints-5.542
liga-5.517
Token create
Token ID2251
Feature activation+6.19e-01

Pos logit contributions
%%%%+2.847
IJ+2.794
Submit+2.761
 Hearing+2.398
Psy+2.393
Neg logit contributions
alore-5.131
liga-4.455
rade-4.447
ints-4.346
ammy-4.339
Token lambda
Token ID37456
Feature activation+1.58e+00

Pos logit contributions
%%%%+3.081
IJ+3.041
Submit+2.973
 Hearing+2.774
NPR+2.751
Neg logit contributions
alore-2.638
rade-2.250
liga-2.177
aints-2.168
ints-2.152
Token functions
Token ID5499
Feature activation+9.64e-01

Pos logit contributions
%%%%+6.243
IJ+6.098
Submit+5.863
NPR+5.332
Psy+5.213
Neg logit contributions
alore-8.063
liga-7.153
rade-7.056
ammy-6.951
oat-6.733
Token using
Token ID1262
Feature activation+1.04e+00

Pos logit contributions
%%%%+3.433
IJ+3.362
Submit+3.246
 Hearing+2.967
NPR+2.951
Neg logit contributions
alore-5.602
rade-4.841
liga-4.756
aints-4.715
ints-4.710
Token the
Token ID262
Feature activation+9.42e-01

Pos logit contributions
%%%%+3.696
IJ+3.635
Submit+3.502
 Hearing+3.167
Psy+3.123
Neg logit contributions
alore-5.966
aints-5.212
rade-5.196
liga-5.157
ammy-5.096
Token archived
Token ID33962
Feature activation+1.02e+00

Pos logit contributions
%%%%+3.929
IJ+3.902
Submit+3.765
 Hearing+3.586
NPR+3.469
Neg logit contributions
alore-4.677
rade-4.059
liga-3.969
ints-3.941
aints-3.890
Token file
Token ID2393
Feature activation+8.69e-01

Pos logit contributions
%%%%+4.254
IJ+4.208
Submit+4.067
 Hearing+3.786
NPR+3.655
Neg logit contributions
alore-5.199
rade-4.406
liga-4.386
ints-4.326
aints-4.312
Token of
Token ID286
Feature activation+1.34e+00

Pos logit contributions
%%%%+3.222
IJ+3.009
Submit+2.949
 Hearing+2.594
Psy+2.560
Neg logit contributions
alore-7.165
aints-6.271
rade-6.222
liga-6.207
ammy-6.170
Token a
Token ID257
Feature activation+9.87e-01

Pos logit contributions
%%%%+4.666
IJ+4.305
Submit+4.251
 Hearing+3.817
Psy+3.758
Neg logit contributions
alore-7.912
liga-6.784
rade-6.777
aints-6.723
ammy-6.524
Token failed
Token ID4054
Feature activation+5.73e-01

Pos logit contributions
%%%%+3.693
IJ+3.483
Submit+3.419
 Hearing+3.120
Psy+3.112
Neg logit contributions
alore-5.515
rade-4.818
liga-4.769
aints-4.718
oat-4.627
Token will
Token ID481
Feature activation+1.31e+00

Pos logit contributions
%%%%+2.665
IJ+2.565
Submit+2.527
 Hearing+2.375
Psy+2.337
Neg logit contributions
alore-2.737
rade-2.275
liga-2.274
aints-2.252
ints-2.214
Token save
Token ID3613
Feature activation+7.82e-01

Pos logit contributions
%%%%+4.388
Submit+4.020
IJ+3.831
Psy+3.450
PF+3.376
Neg logit contributions
alore-7.871
liga-6.837
rade-6.695
aints-6.665
ints-6.635
Token,
Token ID11
Feature activation+1.56e+00

Pos logit contributions
%%%%+3.191
IJ+3.049
Submit+3.035
NPR+2.771
Psy+2.739
Neg logit contributions
alore-4.233
liga-3.579
aints-3.551
rade-3.537
ammy-3.519
Token your
Token ID534
Feature activation+1.63e+00

Pos logit contributions
%%%%+4.154
Submit+3.688
IJ+3.498
NPR+3.154
Psy+3.123
Neg logit contributions
alore-10.405
liga-9.394
aints-9.141
ammy-9.092
ints-9.042
Token character
Token ID2095
Feature activation+1.00e+00

Pos logit contributions
%%%%+4.428
IJ+4.092
Submit+3.995
 Hearing+3.516
 Continuous+3.370
Neg logit contributions
alore-10.379
liga-9.357
oat-9.098
aints-9.090
ammy-9.035
Token now
Token ID783
Feature activation+1.21e+00

Pos logit contributions
%%%%+3.816
IJ+3.535
Submit+3.495
NPR+3.266
 Hearing+3.207
Neg logit contributions
alore-5.459
liga-4.841
rade-4.699
ammy-4.687
oat-4.635
Token receives
Token ID11583
Feature activation+6.96e-01

Pos logit contributions
%%%%+4.566
Submit+4.167
IJ+3.922
 Hearing+3.694
NPR+3.669
Neg logit contributions
alore-6.735
liga-6.187
ammy-6.026
oat-5.824
netflix-5.723
Token secret
Token ID3200
Feature activation+5.55e-01

Pos logit contributions
%%%%+2.683
Submit+2.550
IJ+2.531
 Hearing+2.370
NPR+2.312
Neg logit contributions
alore-3.869
liga-3.332
rade-3.309
aints-3.266
oat-3.195
Tokenfile
Token ID7753
Feature activation+3.47e-01

Pos logit contributions
alore+0.377
rade+0.324
liga+0.318
aints+0.317
ints+0.312
Neg logit contributions
%%%%-0.316
IJ-0.310
Submit-0.302
NPR-0.280
 Hearing-0.279
Token and
Token ID290
Feature activation+1.13e+00

Pos logit contributions
%%%%+1.528
IJ+1.492
Submit+1.466
 Hearing+1.357
NPR+1.340
Neg logit contributions
alore-1.753
rade-1.474
liga-1.446
aints-1.434
ints-1.424
Token creates
Token ID8075
Feature activation+1.43e+00

Pos logit contributions
%%%%+4.676
IJ+4.652
Submit+4.582
 Hearing+4.260
NPR+4.128
Neg logit contributions
alore-5.772
rade-4.836
liga-4.798
ammy-4.664
ints-4.617
Token corresponding
Token ID11188
Feature activation+1.01e+00

Pos logit contributions
%%%%+3.618
IJ+3.530
Submit+3.303
 Hearing+3.134
Psy+2.805
Neg logit contributions
alore-9.618
liga-8.515
rade-8.464
aints-8.396
ints-8.347
Token templates
Token ID24019
Feature activation+1.23e+00

Pos logit contributions
%%%%+4.437
IJ+4.398
Submit+4.291
 Hearing+4.106
 Continuous+3.993
Neg logit contributions
alore-4.819
rade-4.158
liga-4.043
ints-3.954
ammy-3.897
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
%%%%+4.865
IJ+4.799
Submit+4.641
 Hearing+4.270
NPR+4.163
Neg logit contributions
alore-6.693
rade-5.579
ammy-5.546
aints-5.521
liga-5.489
Token This
Token ID770
Feature activation+1.30e+00

Pos logit contributions
%%%%+3.081
IJ+2.996
Submit+2.966
 Hearing+2.709
 Again+2.602
Neg logit contributions
alore-11.400
ammy-9.939
liga-9.936
rade-9.901
ints-9.817
Token templates
Token ID24019
Feature activation+3.91e-01

Pos logit contributions
%%%%+6.088
IJ+5.811
Submit+5.778
 Hearing+5.267
Psy+5.243
Neg logit contributions
alore-5.872
rade-4.938
liga-4.866
ashtra-4.683
ints-4.605
Token will
Token ID481
Feature activation+1.22e+00

Pos logit contributions
%%%%+1.904
IJ+1.865
Submit+1.830
 Hearing+1.708
NPR+1.699
Neg logit contributions
alore-1.778
rade-1.473
liga-1.439
aints-1.427
ints-1.425
Token contain
Token ID3994
Feature activation+1.17e+00

Pos logit contributions
%%%%+4.054
Submit+3.797
IJ+3.708
 Hearing+3.300
 Again+3.246
Neg logit contributions
alore-7.326
rade-6.343
ints-6.267
aints-6.241
liga-6.228
Token the
Token ID262
Feature activation+8.13e-01

Pos logit contributions
%%%%+4.081
IJ+3.938
Submit+3.812
 Hearing+3.397
Psy+3.303
Neg logit contributions
alore-6.713
rade-5.918
aints-5.869
liga-5.858
ammy-5.787
Tokenyoutube
Token ID11604
Feature activation+2.38e-01

Pos logit contributions
%%%%+1.548
IJ+1.486
Submit+1.477
NPR+1.333
Psy+1.293
Neg logit contributions
alore-2.829
rade-2.477
liga-2.427
aints-2.423
ints-2.392
Token.
Token ID13
Feature activation+7.41e-01

Pos logit contributions
%%%%+1.099
IJ+1.075
Submit+1.052
NPR+0.977
 Hearing+0.973
Neg logit contributions
alore-1.152
rade-0.967
liga-0.956
aints-0.948
ints-0.933
Tokencom
Token ID785
Feature activation+6.91e-01

Pos logit contributions
%%%%+2.698
IJ+2.589
Submit+2.584
NPR+2.334
Psy+2.254
Neg logit contributions
alore-4.264
rade-3.649
aints-3.621
liga-3.617
ints-3.565
Token/
Token ID14
Feature activation+3.54e-01

Pos logit contributions
%%%%+2.697
IJ+2.645
Submit+2.562
 Hearing+2.353
NPR+2.317
Neg logit contributions
alore-3.784
rade-3.245
liga-3.210
aints-3.165
ints-3.136
Tokenchannel
Token ID17620
Feature activation+7.94e-01

Pos logit contributions
%%%%+1.927
IJ+1.906
Submit+1.875
NPR+1.762
 Hearing+1.734
Neg logit contributions
alore-1.395
rade-1.121
liga-1.099
aints-1.093
ints-1.065
Token/
Token ID14
Feature activation+1.53e+00

Pos logit contributions
%%%%+3.049
IJ+2.889
Submit+2.882
NPR+2.570
Psy+2.558
Neg logit contributions
alore-4.518
liga-3.902
rade-3.836
aints-3.777
ammy-3.774
TokenUC
Token ID9598
Feature activation+2.64e-01

Pos logit contributions
%%%%+5.346
Submit+5.346
IJ+5.302
NPR+4.771
Psy+4.655
Neg logit contributions
alore-8.514
liga-7.299
rade-7.205
aints-7.196
ashtra-7.144
Token_
Token ID62
Feature activation+3.32e-01

Pos logit contributions
%%%%+1.062
IJ+1.048
Submit+1.012
NPR+0.937
Psy+0.921
Neg logit contributions
alore-1.416
rade-1.221
liga-1.206
aints-1.199
ints-1.182
TokenK
Token ID42
Feature activation+7.16e-01

Pos logit contributions
%%%%+1.331
IJ+1.315
Submit+1.269
NPR+1.176
Psy+1.154
Neg logit contributions
alore-1.788
rade-1.542
liga-1.523
aints-1.517
ints-1.492
Tokenk
Token ID74
Feature activation+5.56e-01

Pos logit contributions
%%%%+2.842
IJ+2.841
Submit+2.716
NPR+2.526
Psy+2.475
Neg logit contributions
alore-3.837
rade-3.298
liga-3.271
aints-3.247
ints-3.183
TokenBY
Token ID17513
Feature activation+3.96e-01

Pos logit contributions
%%%%+2.160
IJ+2.149
Submit+2.060
NPR+1.905
Psy+1.871
Neg logit contributions
alore-3.039
rade-2.628
liga-2.604
aints-2.582
ints-2.538
Tokensuch
Token ID10508
Feature activation+3.15e-01

Pos logit contributions
%%%%+2.202
IJ+2.113
Submit+2.094
NPR+1.873
Psy+1.872
Neg logit contributions
alore-3.866
liga-3.315
rade-3.301
aints-3.277
ammy-3.234
Token as
Token ID355
Feature activation+6.17e-01

Pos logit contributions
%%%%+1.347
IJ+1.317
Submit+1.292
NPR+1.192
 Hearing+1.183
Neg logit contributions
alore-1.626
rade-1.381
liga-1.372
aints-1.360
ints-1.332
Token through
Token ID832
Feature activation+1.04e+00

Pos logit contributions
%%%%+2.996
IJ+2.934
Submit+2.903
 Hearing+2.719
NPR+2.686
Neg logit contributions
alore-2.800
liga-2.321
rade-2.318
aints-2.267
ammy-2.214
Token a
Token ID257
Feature activation+8.54e-01

Pos logit contributions
%%%%+3.841
IJ+3.639
Submit+3.624
 Hearing+3.269
Psy+3.212
Neg logit contributions
alore-5.942
liga-5.095
rade-5.061
aints-4.955
ints-4.874
Token or
Token ID393
Feature activation+1.31e-01

Pos logit contributions
%%%%+3.190
IJ+3.079
Submit+3.055
 Hearing+2.839
NPR+2.743
Neg logit contributions
alore-4.737
liga-4.121
rade-4.117
ints-3.974
aints-3.971
Token )
Token ID1267
Feature activation+1.51e+00

Pos logit contributions
%%%%+0.477
IJ+0.462
Submit+0.454
 Hearing+0.414
NPR+0.408
Neg logit contributions
alore-0.758
rade-0.662
liga-0.653
aints-0.645
ints-0.640
Token must
Token ID1276
Feature activation+1.32e+00

Pos logit contributions
%%%%+5.435
Submit+5.238
IJ+4.977
 Hearing+4.580
 Continuous+4.543
Neg logit contributions
alore-8.460
ammy-7.198
liga-7.155
rade-7.098
upid-7.009
Token not
Token ID407
Feature activation+9.21e-01

Pos logit contributions
%%%%+3.062
Submit+2.723
IJ+2.501
NPR+2.143
Psy+2.122
Neg logit contributions
alore-9.174
liga-8.301
aints-8.115
rade-8.109
ammy-8.038
Token modify
Token ID13096
Feature activation+1.03e+00

Pos logit contributions
%%%%+2.966
Submit+2.775
IJ+2.752
Psy+2.505
NPR+2.443
Neg logit contributions
alore-5.590
liga-4.983
rade-4.771
ints-4.769
ammy-4.729
Token the
Token ID262
Feature activation+1.02e+00

Pos logit contributions
%%%%+3.612
IJ+3.499
Submit+3.413
 Hearing+3.029
Psy+2.997
Neg logit contributions
alore-6.039
aints-5.217
liga-5.216
ammy-5.198
rade-5.189
Token state
Token ID1181
Feature activation+1.13e+00

Pos logit contributions
%%%%+3.303
IJ+3.165
Submit+3.135
 Hearing+2.713
NPR+2.712
Neg logit contributions
alore-6.133
liga-5.353
aints-5.283
rade-5.264
ints-5.223
Token Board
Token ID5926
Feature activation+4.33e-01

Pos logit contributions
%%%%+3.050
IJ+2.909
Submit+2.864
 Hearing+2.694
 Again+2.554
Neg logit contributions
alore-6.403
rade-5.598
liga-5.529
aints-5.475
ammy-5.455
Token::
Token ID3712
Feature activation+2.31e-01

Pos logit contributions
%%%%+1.548
IJ+1.495
Submit+1.475
 Hearing+1.324
NPR+1.309
Neg logit contributions
alore-2.520
rade-2.195
liga-2.171
aints-2.139
ammy-2.125
Tokenapply
Token ID39014
Feature activation+9.57e-01

Pos logit contributions
%%%%+1.123
IJ+1.098
Submit+1.090
NPR+1.009
 Hearing+0.996
Neg logit contributions
alore-1.041
rade-0.870
liga-0.859
aints-0.850
ammy-0.831
Token_
Token ID62
Feature activation-1.20e-01

Pos logit contributions
%%%%+2.611
IJ+2.503
Submit+2.493
 Hearing+2.120
NPR+2.117
Neg logit contributions
alore-6.264
liga-5.538
rade-5.533
ammy-5.476
aints-5.400
Tokenmove
Token ID21084
Feature activation+8.83e-01

Pos logit contributions
alore+0.425
rade+0.337
liga+0.329
aints+0.327
ints+0.318
Neg logit contributions
%%%%-0.700
IJ-0.688
Submit-0.678
NPR-0.641
 Hearing-0.638
Token()
Token ID3419
Feature activation+1.47e+00

Pos logit contributions
%%%%+3.122
IJ+2.973
Submit+2.970
 Hearing+2.630
NPR+2.618
Neg logit contributions
alore-5.091
rade-4.438
liga-4.418
ammy-4.373
oat-4.251
Token and
Token ID290
Feature activation+6.72e-01

Pos logit contributions
%%%%+5.753
Submit+5.506
IJ+5.500
 Hearing+5.066
NPR+4.936
Neg logit contributions
alore-7.487
liga-6.545
rade-6.469
ammy-6.416
ints-6.147
Token Board
Token ID5926
Feature activation+8.46e-01

Pos logit contributions
%%%%+2.604
Submit+2.487
IJ+2.484
 Hearing+2.306
NPR+2.233
Neg logit contributions
alore-3.682
rade-3.181
liga-3.160
aints-3.133
ammy-3.091
Token::
Token ID3712
Feature activation+6.43e-02

Pos logit contributions
%%%%+2.556
IJ+2.415
Submit+2.410
 Hearing+2.028
NPR+2.011
Neg logit contributions
alore-5.443
liga-4.728
rade-4.704
aints-4.684
ammy-4.663
Tokenundo
Token ID41204
Feature activation+2.01e-01

Pos logit contributions
%%%%+0.390
IJ+0.384
Submit+0.380
NPR+0.358
 Hearing+0.356
Neg logit contributions
alore-0.214
rade-0.165
liga-0.162
aints-0.161
ints-0.155
Token_
Token ID62
Feature activation-4.02e-01

Pos logit contributions
%%%%+0.623
IJ+0.602
Submit+0.585
NPR+0.523
 Hearing+0.518
Neg logit contributions
alore-1.275
rade-1.124
liga-1.114
aints-1.108
ints-1.096
Token Worth
Token ID22301
Feature activation-3.34e-01

Pos logit contributions
alore+0.971
rade+0.833
liga+0.821
aints+0.815
ints+0.805
Neg logit contributions
%%%%-0.813
IJ-0.797
Submit-0.778
 Hearing-0.723
NPR-0.722
Token and
Token ID290
Feature activation-1.23e-02

Pos logit contributions
alore+1.573
rade+1.333
liga+1.300
aints+1.297
ints+1.279
Neg logit contributions
%%%%-1.560
IJ-1.525
Submit-1.491
NPR-1.398
 Hearing-1.393
Token then
Token ID788
Feature activation+5.49e-01

Pos logit contributions
alore+0.059
rade+0.050
liga+0.049
aints+0.049
ints+0.048
Neg logit contributions
%%%%-0.056
IJ-0.055
Submit-0.054
NPR-0.050
 Hearing-0.050
Token the
Token ID262
Feature activation+4.17e-02

Pos logit contributions
%%%%+2.355
IJ+2.334
Submit+2.291
NPR+2.126
 Hearing+2.113
Neg logit contributions
alore-2.721
rade-2.326
aints-2.291
liga-2.289
ints-2.227
Token Mar
Token ID1526
Feature activation+5.94e-01

Pos logit contributions
%%%%+0.176
IJ+0.172
Submit+0.168
NPR+0.156
 Hearing+0.156
Neg logit contributions
alore-0.215
rade-0.185
liga-0.182
aints-0.181
ints-0.179
Tokencell
Token ID3846
Feature activation+1.47e+00

Pos logit contributions
%%%%+2.222
IJ+2.208
Submit+2.151
 Hearing+1.980
NPR+1.926
Neg logit contributions
alore-3.286
liga-2.810
rade-2.780
aints-2.774
oat-2.772
Tokenus
Token ID385
Feature activation+2.54e-01

Pos logit contributions
IJ+4.688
%%%%+4.342
Submit+4.301
 Hearing+4.152
NPR+4.077
Neg logit contributions
alore-8.408
ammy-7.511
aints-7.509
rade-7.419
liga-7.405
Token shale
Token ID30498
Feature activation+6.19e-01

Pos logit contributions
%%%%+1.046
IJ+1.030
Submit+1.007
 Hearing+0.930
NPR+0.921
Neg logit contributions
alore-1.338
rade-1.153
liga-1.132
aints-1.127
ints-1.114
Token of
Token ID286
Feature activation+2.25e-01

Pos logit contributions
%%%%+2.683
IJ+2.653
Submit+2.652
 Hearing+2.406
NPR+2.405
Neg logit contributions
alore-3.052
rade-2.581
aints-2.547
liga-2.543
ints-2.526
Token Pennsylvania
Token ID9589
Feature activation-2.23e-01

Pos logit contributions
%%%%+0.968
IJ+0.952
Submit+0.931
 Hearing+0.864
NPR+0.858
Neg logit contributions
alore-1.139
rade-0.977
liga-0.964
aints-0.957
ints-0.939
Token,
Token ID11
Feature activation-2.89e-01

Pos logit contributions
alore+1.093
rade+0.932
liga+0.915
aints+0.912
ints+0.898
Neg logit contributions
%%%%-1.001
IJ-0.977
Submit-0.953
NPR-0.890
 Hearing-0.889
Token.
Token ID13
Feature activation+9.03e-01

Pos logit contributions
%%%%+1.872
IJ+1.841
Submit+1.795
 Hearing+1.682
NPR+1.678
Neg logit contributions
alore-1.725
rade-1.443
liga-1.421
aints-1.397
ints-1.376
Token
Token ID198
Feature activation+6.06e-01

Pos logit contributions
%%%%+3.049
IJ+2.981
Submit+2.884
 Hearing+2.711
NPR+2.618
Neg logit contributions
alore-5.425
rade-4.705
liga-4.663
aints-4.625
ints-4.574
Token
Token ID198
Feature activation+6.72e-01

Pos logit contributions
%%%%+2.259
IJ+2.202
Submit+2.149
NPR+1.965
 Hearing+1.943
Neg logit contributions
alore-3.482
liga-2.986
rade-2.976
aints-2.949
ammy-2.920
TokenCharles
Token ID28711
Feature activation+3.98e-01

Pos logit contributions
%%%%+1.379
IJ+1.306
Submit+1.246
NPR+1.042
Psy+1.028
Neg logit contributions
alore-4.963
liga-4.439
rade-4.431
aints-4.365
ints-4.336
Token Burns
Token ID23592
Feature activation+6.74e-01

Pos logit contributions
%%%%+1.447
IJ+1.406
Submit+1.362
 Hearing+1.253
NPR+1.243
Neg logit contributions
alore-2.301
rade-2.009
liga-1.984
aints-1.969
ints-1.948
Token :
Token ID1058
Feature activation+1.46e+00

Pos logit contributions
%%%%+2.465
IJ+2.398
Submit+2.316
 Hearing+2.150
NPR+2.092
Neg logit contributions
alore-3.897
rade-3.365
liga-3.343
aints-3.305
ammy-3.265
Token Yeah
Token ID9425
Feature activation+7.74e-01

Pos logit contributions
%%%%+3.825
IJ+3.743
Submit+3.569
 Hearing+3.467
 Again+3.260
Neg logit contributions
alore-9.709
rade-8.482
liga-8.455
aints-8.388
ints-8.315
Token,
Token ID11
Feature activation+4.37e-01

Pos logit contributions
%%%%+3.338
IJ+3.290
Submit+3.187
 Hearing+2.979
NPR+2.955
Neg logit contributions
alore-3.886
rade-3.313
aints-3.255
liga-3.252
ints-3.190
Token I
Token ID314
Feature activation+4.92e-01

Pos logit contributions
%%%%+1.781
IJ+1.742
Submit+1.697
NPR+1.570
 Hearing+1.565
Neg logit contributions
alore-2.326
rade-2.012
liga-1.977
aints-1.971
ints-1.944
Token mean
Token ID1612
Feature activation-1.10e-01

Pos logit contributions
%%%%+2.157
IJ+2.154
Submit+2.084
NPR+1.960
 Hearing+1.949
Neg logit contributions
alore-2.408
rade-2.047
aints-2.012
liga-2.011
ints-1.965
Token
Token ID851
Feature activation+5.63e-01

Pos logit contributions
alore+0.531
rade+0.455
liga+0.446
aints+0.441
ints+0.437
Neg logit contributions
%%%%-0.502
IJ-0.491
Submit-0.481
NPR-0.450
 Hearing-0.444
Token.
Token ID13
Feature activation-1.82e-01

Pos logit contributions
alore+2.282
rade+1.945
liga+1.915
aints+1.903
ints+1.865
Neg logit contributions
%%%%-2.460
IJ-2.403
Submit-2.352
NPR-2.216
-2.195
Token
Token ID198
Feature activation-4.23e-01

Pos logit contributions
alore+0.997
rade+0.868
liga+0.854
aints+0.854
ints+0.836
Neg logit contributions
%%%%-0.713
IJ-0.694
Submit-0.675
NPR-0.624
-0.615
Token
Token ID198
Feature activation-7.37e-02

Pos logit contributions
alore+2.084
rade+1.823
aints+1.767
liga+1.754
ints+1.746
Neg logit contributions
%%%%-1.821
IJ-1.788
Submit-1.734
-1.646
 Hearing-1.630
Tokenfile
Token ID7753
Feature activation+3.47e-01

Pos logit contributions
alore+0.377
rade+0.324
liga+0.318
aints+0.317
ints+0.312
Neg logit contributions
%%%%-0.316
IJ-0.310
Submit-0.302
NPR-0.280
 Hearing-0.279
Token and
Token ID290
Feature activation+1.13e+00

Pos logit contributions
%%%%+1.528
IJ+1.492
Submit+1.466
 Hearing+1.357
NPR+1.340
Neg logit contributions
alore-1.753
rade-1.474
liga-1.446
aints-1.434
ints-1.424
Token creates
Token ID8075
Feature activation+1.43e+00

Pos logit contributions
%%%%+4.676
IJ+4.652
Submit+4.582
 Hearing+4.260
NPR+4.128
Neg logit contributions
alore-5.772
rade-4.836
liga-4.798
ammy-4.664
ints-4.617
Token corresponding
Token ID11188
Feature activation+1.01e+00

Pos logit contributions
%%%%+3.618
IJ+3.530
Submit+3.303
 Hearing+3.134
Psy+2.805
Neg logit contributions
alore-9.618
liga-8.515
rade-8.464
aints-8.396
ints-8.347
Token templates
Token ID24019
Feature activation+1.23e+00

Pos logit contributions
%%%%+4.437
IJ+4.398
Submit+4.291
 Hearing+4.106
 Continuous+3.993
Neg logit contributions
alore-4.819
rade-4.158
liga-4.043
ints-3.954
ammy-3.897
Token.
Token ID13
Feature activation+1.55e+00

Pos logit contributions
%%%%+4.865
IJ+4.799
Submit+4.641
 Hearing+4.270
NPR+4.163
Neg logit contributions
alore-6.693
rade-5.579
ammy-5.546
aints-5.521
liga-5.489
Token This
Token ID770
Feature activation+1.30e+00

Pos logit contributions
%%%%+3.081
IJ+2.996
Submit+2.966
 Hearing+2.709
 Again+2.602
Neg logit contributions
alore-11.400
ammy-9.939
liga-9.936
rade-9.901
ints-9.817
Token templates
Token ID24019
Feature activation+3.91e-01

Pos logit contributions
%%%%+6.088
IJ+5.811
Submit+5.778
 Hearing+5.267
Psy+5.243
Neg logit contributions
alore-5.872
rade-4.938
liga-4.866
ashtra-4.683
ints-4.605
Token usually
Token ID3221
Feature activation-1.30e-01

Pos logit contributions
alore+0.852
rade+0.733
liga+0.717
aints+0.714
ints+0.705
Neg logit contributions
%%%%-0.731
IJ-0.715
Submit-0.697
NPR-0.648
 Hearing-0.645
Token happens
Token ID4325
Feature activation+3.55e-01

Pos logit contributions
alore+0.631
rade+0.538
liga+0.528
aints+0.526
ints+0.518
Neg logit contributions
%%%%-0.589
IJ-0.577
Submit-0.564
NPR-0.526
 Hearing-0.525
Token at
Token ID379
Feature activation+2.36e-01

Pos logit contributions
%%%%+1.536
IJ+1.495
Submit+1.450
 Hearing+1.358
NPR+1.356
Neg logit contributions
alore-1.800
rade-1.539
liga-1.512
aints-1.508
ints-1.492
Token an
Token ID281
Feature activation+4.98e-01

Pos logit contributions
%%%%+1.061
IJ+1.036
Submit+1.012
 Hearing+0.946
NPR+0.941
Neg logit contributions
alore-1.163
rade-0.987
liga-0.970
aints-0.964
ints-0.949
Token event
Token ID1785
Feature activation+5.66e-01

Pos logit contributions
%%%%+2.121
IJ+2.089
Submit+2.035
NPR+1.902
 Hearing+1.893
Neg logit contributions
alore-2.535
rade-2.164
liga-2.143
aints-2.107
ints-2.073
Token filled
Token ID5901
Feature activation+1.43e+00

Pos logit contributions
%%%%+2.306
IJ+2.239
Submit+2.164
 Hearing+2.020
NPR+2.013
Neg logit contributions
alore-2.999
rade-2.573
aints-2.547
liga-2.523
ints-2.505
Token with
Token ID351
Feature activation+6.18e-01

Pos logit contributions
%%%%+5.458
IJ+5.448
Submit+5.207
 Hearing+4.892
NPR+4.848
Neg logit contributions
alore-7.332
rade-6.429
ints-6.277
liga-6.262
ammy-6.240
Token Tor
Token ID4022
Feature activation+5.05e-01

Pos logit contributions
%%%%+2.709
IJ+2.636
Submit+2.590
 Hearing+2.417
NPR+2.385
Neg logit contributions
alore-3.058
rade-2.603
liga-2.549
aints-2.548
ammy-2.512
Tokenont
Token ID756
Feature activation+9.30e-01

Pos logit contributions
%%%%+2.988
IJ+2.954
Submit+2.915
 Hearing+2.745
NPR+2.734
Neg logit contributions
alore-1.748
rade-1.360
liga-1.337
aints-1.308
ammy-1.292
Tokenon
Token ID261
Feature activation+2.36e-01

Pos logit contributions
%%%%+4.392
Submit+4.253
IJ+4.149
NPR+4.044
 Hearing+3.953
Neg logit contributions
alore-4.109
liga-3.501
aints-3.352
netflix-3.340
ammy-3.332
Tokenians
Token ID1547
Feature activation+4.13e-01

Pos logit contributions
%%%%+0.577
IJ+0.549
Submit+0.527
 Hearing+0.457
NPR+0.456
Neg logit contributions
alore-1.645
rade-1.473
liga-1.456
aints-1.445
ammy-1.433
Token the
Token ID262
Feature activation+7.07e-01

Pos logit contributions
%%%%+3.410
IJ+3.364
Submit+3.220
 Hearing+3.066
NPR+2.962
Neg logit contributions
alore-5.217
rade-4.552
aints-4.540
liga-4.534
ammy-4.420
Token headquarters
Token ID10043
Feature activation+1.25e+00

Pos logit contributions
%%%%+2.584
IJ+2.534
Submit+2.434
 Hearing+2.294
NPR+2.246
Neg logit contributions
alore-3.962
rade-3.497
liga-3.434
aints-3.424
ints-3.353
Token of
Token ID286
Feature activation+1.02e+00

Pos logit contributions
%%%%+3.530
IJ+3.418
Submit+3.238
 Hearing+2.948
NPR+2.927
Neg logit contributions
alore-8.053
aints-7.026
rade-6.996
liga-6.971
ammy-6.955
Token the
Token ID262
Feature activation+5.18e-01

Pos logit contributions
IJ+3.579
%%%%+3.553
Submit+3.408
 Hearing+3.195
NPR+3.112
Neg logit contributions
alore-5.692
aints-4.990
rade-4.932
liga-4.907
ammy-4.830
Token regional
Token ID7915
Feature activation+5.35e-01

Pos logit contributions
%%%%+1.319
IJ+1.288
Submit+1.237
 Hearing+1.098
NPR+1.096
Neg logit contributions
alore-3.482
rade-3.123
aints-3.095
liga-3.082
ints-3.043
Token government
Token ID1230
Feature activation+1.43e+00

Pos logit contributions
%%%%+2.007
IJ+1.988
Submit+1.906
 Hearing+1.773
NPR+1.770
Neg logit contributions
alore-2.978
rade-2.594
liga-2.548
aints-2.547
ints-2.520
Token.
Token ID13
Feature activation+9.33e-01

Pos logit contributions
IJ+5.416
%%%%+5.359
 Hearing+5.083
Submit+5.066
NPR+4.856
Neg logit contributions
alore-7.304
rade-6.236
liga-6.096
oat-6.094
ammy-6.088
Token
Token ID198
Feature activation+5.21e-01

Pos logit contributions
%%%%+3.188
IJ+3.172
Submit+3.036
 Hearing+2.972
NPR+2.815
Neg logit contributions
alore-5.347
rade-4.620
aints-4.532
liga-4.529
ammy-4.501
Token
Token ID198
Feature activation+6.23e-01

Pos logit contributions
%%%%+1.948
IJ+1.892
Submit+1.849
NPR+1.684
 Hearing+1.678
Neg logit contributions
alore-2.990
liga-2.569
rade-2.559
aints-2.532
ammy-2.506
TokenSimilar
Token ID18925
Feature activation+3.97e-01

Pos logit contributions
%%%%+2.146
IJ+2.136
Submit+2.072
NPR+1.897
Psy+1.863
Neg logit contributions
alore-3.718
rade-3.205
liga-3.196
aints-3.168
ammy-3.158
Token tractor
Token ID38278
Feature activation+3.70e-01

Pos logit contributions
%%%%+1.896
IJ+1.861
Submit+1.825
 Hearing+1.711
NPR+1.685
Neg logit contributions
alore-1.797
rade-1.528
aints-1.475
liga-1.474
ammy-1.458
Token,
Token ID11
Feature activation+1.06e+00

Pos logit contributions
%%%%+4.226
IJ+4.028
Submit+3.984
NPR+3.624
Psy+3.581
Neg logit contributions
alore-6.006
liga-5.066
aints-5.026
ammy-5.014
rade-5.004
Token and
Token ID290
Feature activation+1.26e+00

Pos logit contributions
%%%%+4.196
IJ+4.014
Submit+3.968
 Hearing+3.598
NPR+3.569
Neg logit contributions
alore-5.824
liga-4.929
rade-4.898
ammy-4.896
aints-4.865
Token must
Token ID1276
Feature activation+1.80e+00

Pos logit contributions
%%%%+4.849
Submit+4.516
IJ+4.423
 Hearing+4.105
 Continuous+3.989
Neg logit contributions
alore-6.970
liga-5.858
ammy-5.824
rade-5.789
aints-5.734
Token roll
Token ID4836
Feature activation+1.67e+00

Pos logit contributions
%%%%+8.537
Submit+7.926
IJ+7.698
Psy+7.389
 Continuous+7.269
Neg logit contributions
alore-7.799
liga-6.470
ints-6.364
rade-6.362
aints-6.300
Token to
Token ID284
Feature activation+1.61e+00

Pos logit contributions
%%%%+5.034
Submit+4.759
IJ+4.719
 Hearing+4.209
NPR+4.045
Neg logit contributions
alore-10.116
aints-8.834
liga-8.801
ammy-8.747
ints-8.738
Token avoid
Token ID3368
Feature activation+1.42e+00

Pos logit contributions
%%%%+5.362
Submit+4.733
IJ+4.718
 Hearing+4.249
 Again+4.179
Neg logit contributions
alore-9.638
liga-8.642
rade-8.371
aints-8.357
oat-8.330
Token following
Token ID1708
Feature activation+8.10e-01

Pos logit contributions
%%%%+4.403
IJ+4.134
Submit+3.905
 Hearing+3.715
NPR+3.565
Neg logit contributions
alore-8.894
rade-7.942
aints-7.758
liga-7.737
ints-7.730
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
%%%%+3.023
IJ+2.926
Submit+2.823
 Hearing+2.566
NPR+2.560
Neg logit contributions
alore-4.634
rade-4.006
liga-3.994
aints-3.971
ammy-3.934
Token commands
Token ID9729
Feature activation+1.39e+00

Pos logit contributions
%%%%+5.492
IJ+5.294
Submit+5.097
NPR+4.797
 Hearing+4.754
Neg logit contributions
alore-5.217
liga-4.348
rade-4.317
ammy-4.250
ints-4.098
Token in
Token ID287
Feature activation+1.29e+00

Pos logit contributions
%%%%+5.270
IJ+5.040
Submit+4.928
NPR+4.496
 Hearing+4.398
Neg logit contributions
alore-7.726
ammy-6.521
aints-6.511
liga-6.508
rade-6.395
Token his
Token ID465
Feature activation+8.22e-01

Pos logit contributions
%%%%+4.645
IJ+4.435
Submit+4.303
 Hearing+4.048
NPR+4.024
Neg logit contributions
alore-7.208
liga-6.365
aints-6.247
rade-6.210
ammy-6.131
Token
Token ID198
Feature activation-1.72e-01

Pos logit contributions
alore+1.910
rade+1.668
aints+1.619
liga+1.606
ints+1.600
Neg logit contributions
%%%%-1.656
IJ-1.625
Submit-1.578
-1.492
 Hearing-1.478
Tokenis
Token ID271
Feature activation+7.68e-01

Pos logit contributions
alore+0.930
rade+0.807
liga+0.794
aints+0.792
ints+0.781
Neg logit contributions
%%%%-0.680
IJ-0.664
Submit-0.645
 Hearing-0.597
NPR-0.597
Token created
Token ID2727
Feature activation+9.37e-01

Pos logit contributions
%%%%+3.345
IJ+3.330
Submit+3.269
 Hearing+3.017
NPR+2.989
Neg logit contributions
alore-3.779
rade-3.219
liga-3.156
ints-3.136
aints-3.099
Token and
Token ID290
Feature activation+1.39e+00

Pos logit contributions
%%%%+3.762
IJ+3.665
Submit+3.600
 Hearing+3.299
NPR+3.253
Neg logit contributions
alore-5.010
rade-4.226
liga-4.220
ammy-4.173
aints-4.171
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
%%%%+4.604
IJ+4.513
Submit+4.362
 Hearing+3.950
Psy+3.784
Neg logit contributions
alore-8.178
liga-7.074
aints-7.046
ammy-7.027
rade-7.026
Token archived
Token ID33962
Feature activation+1.42e+00

Pos logit contributions
%%%%+4.188
IJ+4.175
Submit+4.027
 Hearing+3.827
NPR+3.741
Neg logit contributions
alore-5.050
rade-4.337
liga-4.222
ints-4.183
aints-4.115
Token file
Token ID2393
Feature activation+8.98e-01

Pos logit contributions
%%%%+5.493
IJ+5.437
Submit+5.306
 Hearing+4.915
 Continuous+4.730
Neg logit contributions
alore-7.587
liga-6.397
rade-6.361
ammy-6.338
ints-6.247
Token is
Token ID318
Feature activation+8.01e-01

Pos logit contributions
%%%%+3.276
IJ+3.201
Submit+3.157
NPR+2.837
 Hearing+2.821
Neg logit contributions
alore-5.133
liga-4.435
rade-4.376
ammy-4.366
aints-4.295
Token uploaded
Token ID19144
Feature activation+9.92e-01

Pos logit contributions
%%%%+2.476
IJ+2.427
Submit+2.410
NPR+2.103
 Hearing+2.089
Neg logit contributions
alore-4.983
rade-4.358
liga-4.346
ints-4.322
aints-4.273
Token to
Token ID284
Feature activation+9.75e-01

Pos logit contributions
%%%%+3.387
IJ+3.273
Submit+3.222
 Hearing+2.906
NPR+2.890
Neg logit contributions
alore-5.863
liga-5.067
aints-5.045
ints-5.041
ammy-5.036
Token it
Token ID340
Feature activation+8.49e-01

Pos logit contributions
%%%%+3.162
IJ+3.120
Submit+3.026
NPR+2.704
 Hearing+2.683
Neg logit contributions
alore-5.823
rade-5.133
aints-5.062
liga-5.052
ammy-5.045
Token
Token ID447
Feature activation-1.95e-01

Pos logit contributions
alore+2.523
rade+2.168
aints+2.122
liga+2.107
ints+2.080
Neg logit contributions
%%%%-2.325
IJ-2.249
Submit-2.220
NPR-2.052
 Hearing-2.041
Token
Token ID247
Feature activation-1.32e-01

Pos logit contributions
alore+1.342
rade+1.206
liga+1.187
aints+1.182
ints+1.174
Neg logit contributions
%%%%-0.478
IJ-0.465
Submit-0.443
NPR-0.387
 Hearing-0.386
Tokens
Token ID82
Feature activation-5.84e-01

Pos logit contributions
alore+0.606
rade+0.509
liga+0.500
aints+0.497
ints+0.490
Neg logit contributions
%%%%-0.634
IJ-0.623
Submit-0.609
NPR-0.570
 Hearing-0.570
Token director
Token ID3437
Feature activation-6.89e-01

Pos logit contributions
alore+2.750
rade+2.368
aints+2.331
liga+2.283
ints+2.269
Neg logit contributions
%%%%-2.680
IJ-2.588
Submit-2.551
-2.390
NPR-2.357
Token of
Token ID286
Feature activation-8.02e-01

Pos logit contributions
alore+3.701
rade+3.223
liga+3.177
aints+3.174
ints+3.093
Neg logit contributions
%%%%-2.671
IJ-2.637
Submit-2.571
NPR-2.353
Psy-2.331
Token research
Token ID2267
Feature activation-1.64e+00

Pos logit contributions
alore+3.835
rade+3.297
aints+3.242
liga+3.226
ints+3.205
Neg logit contributions
%%%%-3.574
IJ-3.423
Submit-3.395
-3.211
NPR-3.135
Token,
Token ID11
Feature activation-6.70e-01

Pos logit contributions
alore+6.712
rade+5.780
aints+5.708
liga+5.497
oat+5.356
Neg logit contributions
%%%%-7.948
IJ-7.694
Submit-7.661
-7.222
NPR-7.060
Token
Token ID564
Feature activation-3.94e-01

Pos logit contributions
alore+3.422
rade+2.973
aints+2.927
liga+2.884
ints+2.842
Neg logit contributions
%%%%-2.819
IJ-2.739
Submit-2.682
NPR-2.478
-2.467
Token
Token ID250
Feature activation-3.10e-01

Pos logit contributions
alore+2.561
rade+2.285
liga+2.245
aints+2.237
ints+2.225
Neg logit contributions
%%%%-1.121
IJ-1.101
Submit-1.049
NPR-0.941
 Hearing-0.932
TokenF
Token ID37
Feature activation-2.01e-01

Pos logit contributions
alore+1.625
rade+1.408
aints+1.382
liga+1.378
ints+1.361
Neg logit contributions
%%%%-1.279
IJ-1.250
Submit-1.216
 Hearing-1.129
NPR-1.125
Tokenam
Token ID321
Feature activation-2.06e-01

Pos logit contributions
alore+0.987
rade+0.844
aints+0.826
liga+0.825
ints+0.814
Neg logit contributions
%%%%-0.897
IJ-0.878
Submit-0.859
 Hearing-0.799
NPR-0.799
Token
Token ID198
Feature activation+1.30e-01

Pos logit contributions
alore+0.839
rade+0.742
aints+0.724
liga+0.723
ints+0.713
Neg logit contributions
%%%%-0.574
IJ-0.558
Submit-0.544
NPR-0.497
-0.493
Token
Token ID198
Feature activation+1.43e-01

Pos logit contributions
%%%%+0.521
IJ+0.509
Submit+0.496
NPR+0.458
 Hearing+0.456
Neg logit contributions
alore-0.702
rade-0.607
liga-0.598
aints-0.596
ints-0.588
TokenBut
Token ID1537
Feature activation-1.63e-01

Pos logit contributions
%%%%+0.483
IJ+0.471
Submit+0.456
NPR+0.415
 Hearing+0.410
Neg logit contributions
alore-0.868
rade-0.762
liga-0.753
aints-0.750
ints-0.740
Token if
Token ID611
Feature activation+9.04e-02

Pos logit contributions
alore+0.827
rade+0.716
liga+0.697
aints+0.694
ints+0.686
Neg logit contributions
%%%%-0.700
IJ-0.684
Submit-0.668
NPR-0.620
 Hearing-0.616
Token we
Token ID356
Feature activation-4.40e-01

Pos logit contributions
%%%%+0.386
IJ+0.379
Submit+0.369
NPR+0.343
 Hearing+0.342
Neg logit contributions
alore-0.462
rade-0.396
liga-0.390
aints-0.389
ints-0.383
Token take
Token ID1011
Feature activation-1.55e+00

Pos logit contributions
alore+2.171
rade+1.851
liga+1.815
aints+1.805
ints+1.791
Neg logit contributions
%%%%-1.968
IJ-1.917
Submit-1.871
NPR-1.744
 Hearing-1.744
Token the
Token ID262
Feature activation-5.76e-01

Pos logit contributions
alore+6.862
rade+6.028
aints+5.754
ints+5.712
oat+5.688
Neg logit contributions
%%%%-7.266
Submit-6.859
IJ-6.855
Psy-6.348
NPR-6.328
Token phone
Token ID3072
Feature activation+5.25e-01

Pos logit contributions
alore+2.736
rade+2.329
liga+2.269
aints+2.260
ints+2.246
Neg logit contributions
%%%%-2.664
IJ-2.594
Submit-2.546
NPR-2.378
-2.356
Token calls
Token ID3848
Feature activation-1.30e-01

Pos logit contributions
%%%%+2.080
IJ+2.046
Submit+1.966
 Hearing+1.828
NPR+1.819
Neg logit contributions
alore-2.819
rade-2.454
liga-2.424
aints-2.421
ints-2.381
Token,
Token ID11
Feature activation-9.42e-02

Pos logit contributions
alore+0.662
rade+0.569
liga+0.558
aints+0.554
ints+0.547
Neg logit contributions
%%%%-0.560
IJ-0.547
Submit-0.537
NPR-0.495
 Hearing-0.494
Token repeated
Token ID5100
Feature activation-6.35e-02

Pos logit contributions
alore+0.478
rade+0.414
liga+0.402
aints+0.401
ints+0.396
Neg logit contributions
%%%%-0.407
IJ-0.397
Submit-0.389
NPR-0.361
 Hearing-0.359
Token quickly
Token ID2952
Feature activation+2.76e-01

Pos logit contributions
alore+0.597
rade+0.511
liga+0.501
aints+0.499
ints+0.493
Neg logit contributions
%%%%-0.530
IJ-0.518
Submit-0.507
NPR-0.471
 Hearing-0.469
Token came
Token ID1625
Feature activation-6.46e-01

Pos logit contributions
%%%%+1.222
IJ+1.206
Submit+1.176
 Hearing+1.103
NPR+1.096
Neg logit contributions
alore-1.353
rade-1.156
liga-1.144
aints-1.131
ints-1.115
Token to
Token ID284
Feature activation-3.90e-01

Pos logit contributions
alore+3.255
rade+2.793
aints+2.747
liga+2.705
ints+2.703
Neg logit contributions
%%%%-2.796
IJ-2.690
Submit-2.666
NPR-2.475
Psy-2.454
Token a
Token ID257
Feature activation-3.40e-01

Pos logit contributions
alore+1.943
rade+1.667
aints+1.636
liga+1.623
ints+1.612
Neg logit contributions
%%%%-1.715
IJ-1.672
Submit-1.633
NPR-1.527
 Hearing-1.509
Token steep
Token ID14559
Feature activation-7.42e-01

Pos logit contributions
alore+1.581
rade+1.351
aints+1.306
liga+1.303
ints+1.286
Neg logit contributions
%%%%-1.608
IJ-1.571
Submit-1.524
NPR-1.448
-1.426
Token incl
Token ID13358
Feature activation-1.52e+00

Pos logit contributions
alore+3.358
rade+2.888
aints+2.760
liga+2.724
ints+2.697
Neg logit contributions
%%%%-3.584
IJ-3.462
Submit-3.342
NPR-3.205
 Hearing-3.166
Tokenine
Token ID500
Feature activation-7.71e-01

Pos logit contributions
alore+4.124
rade+3.452
aints+3.176
ints+3.066
oat+3.043
Neg logit contributions
%%%%-9.511
IJ-9.361
Submit-9.106
-9.000
NPR-8.984
Token at
Token ID379
Feature activation+1.20e-02

Pos logit contributions
alore+3.556
rade+3.079
aints+2.967
liga+2.917
ints+2.897
Neg logit contributions
%%%%-3.617
IJ-3.521
Submit-3.408
NPR-3.226
 Hearing-3.203
Token mile
Token ID10591
Feature activation+8.39e-02

Pos logit contributions
%%%%+0.050
IJ+0.049
Submit+0.048
NPR+0.044
 Hearing+0.044
Neg logit contributions
alore-0.063
rade-0.054
liga-0.053
aints-0.053
ints-0.052
Token 16
Token ID1467
Feature activation-1.45e-01

Pos logit contributions
%%%%+0.344
IJ+0.337
Submit+0.328
NPR+0.303
 Hearing+0.303
Neg logit contributions
alore-0.444
rade-0.384
liga-0.377
aints-0.376
ints-0.371
Token.
Token ID13
Feature activation-7.52e-02

Pos logit contributions
alore+0.706
rade+0.605
aints+0.589
liga+0.588
ints+0.581
Neg logit contributions
%%%%-0.657
IJ-0.644
Submit-0.628
NPR-0.585
 Hearing-0.585
Tokenxit
Token ID10198
Feature activation-5.59e-01

Pos logit contributions
%%%%+0.226
IJ+0.222
Submit+0.215
 Hearing+0.201
NPR+0.201
Neg logit contributions
alore-0.232
rade-0.196
aints-0.192
liga-0.192
ints-0.188
Token-
Token ID12
Feature activation-9.90e-01

Pos logit contributions
alore+3.129
rade+2.762
aints+2.695
liga+2.669
ints+2.658
Neg logit contributions
IJ-2.042
%%%%-2.041
Submit-1.975
-1.847
NPR-1.832
Tokenis
Token ID271
Feature activation-4.70e-01

Pos logit contributions
alore+4.284
rade+3.571
aints+3.507
ints+3.500
oat+3.450
Neg logit contributions
%%%%-4.887
IJ-4.835
Submit-4.767
 Hearing-4.580
-4.555
Token-
Token ID12
Feature activation-1.08e+00

Pos logit contributions
alore+2.659
rade+2.352
liga+2.275
aints+2.273
ints+2.260
Neg logit contributions
%%%%-1.709
IJ-1.691
Submit-1.652
 Hearing-1.532
NPR-1.526
Tokenthe
Token ID1169
Feature activation-8.12e-01

Pos logit contributions
alore+4.851
rade+4.153
liga+3.986
ints+3.964
aints+3.950
Neg logit contributions
%%%%-5.146
IJ-5.060
Submit-5.023
 Hearing-4.796
-4.723
Token-
Token ID12
Feature activation-1.49e+00

Pos logit contributions
alore+4.502
rade+3.956
aints+3.809
liga+3.800
ints+3.760
Neg logit contributions
IJ-3.051
%%%%-3.041
Submit-2.937
-2.791
 Hearing-2.751
Tokenprime
Token ID35505
Feature activation-5.36e-01

Pos logit contributions
alore+5.355
rade+4.483
liga+4.161
aints+4.145
oat+4.096
Neg logit contributions
%%%%-8.326
IJ-8.250
Submit-8.127
-7.932
 Hearing-7.908
Token-
Token ID12
Feature activation-1.26e+00

Pos logit contributions
alore+2.955
rade+2.631
aints+2.537
liga+2.530
ints+2.505
Neg logit contributions
%%%%-1.991
IJ-1.960
Submit-1.911
 Hearing-1.795
-1.786
Tokenchall
Token ID36747
Feature activation-2.25e-01

Pos logit contributions
alore+4.800
rade+4.174
aints+3.968
ints+3.934
oat+3.883
Neg logit contributions
%%%%-6.578
IJ-6.401
Submit-6.323
 Hearing-6.276
-6.246
Tokenenge
Token ID3540
Feature activation-3.40e-01

Pos logit contributions
alore+0.611
rade+0.449
liga+0.432
aints+0.428
ints+0.415
Neg logit contributions
%%%%-1.505
IJ-1.485
Submit-1.462
NPR-1.396
 Hearing-1.394
Token-
Token ID12
Feature activation-5.20e-01

Pos logit contributions
alore+1.911
rade+1.673
aints+1.662
liga+1.652
ints+1.632
Neg logit contributions
%%%%-1.245
IJ-1.234
Submit-1.189
NPR-1.093
 Hearing-1.075
Token:
Token ID25
Feature activation-2.03e-01

Pos logit contributions
%%%%+0.234
IJ+0.229
Submit+0.224
NPR+0.209
 Hearing+0.209
Neg logit contributions
alore-0.255
rade-0.217
liga-0.213
aints-0.212
ints-0.209
Token
Token ID198
Feature activation-1.22e-01

Pos logit contributions
alore+1.141
rade+0.998
liga+0.979
aints+0.979
ints+0.965
Neg logit contributions
%%%%-0.761
IJ-0.743
Submit-0.718
NPR-0.657
 Hearing-0.654
Token
Token ID198
Feature activation-8.46e-03

Pos logit contributions
alore+0.635
rade+0.552
aints+0.538
liga+0.538
ints+0.532
Neg logit contributions
%%%%-0.507
IJ-0.496
Submit-0.483
 Hearing-0.448
NPR-0.448
Token13
Token ID1485
Feature activation-9.25e-02

Pos logit contributions
alore+0.048
rade+0.042
liga+0.041
aints+0.041
ints+0.040
Neg logit contributions
%%%%-0.032
IJ-0.031
Submit-0.030
NPR-0.028
 Hearing-0.028
Token PH
Token ID9370
Feature activation-4.05e-01

Pos logit contributions
alore+0.438
rade+0.372
liga+0.364
aints+0.363
ints+0.358
Neg logit contributions
%%%%-0.430
IJ-0.422
Submit-0.412
NPR-0.385
 Hearing-0.384
TokenOTOS
Token ID33291
Feature activation-1.44e+00

Pos logit contributions
alore+2.085
rade+1.793
aints+1.752
liga+1.743
ints+1.736
Neg logit contributions
%%%%-1.708
IJ-1.675
Submit-1.637
 Hearing-1.515
NPR-1.514
Token Donald
Token ID3759
Feature activation-1.32e-01

Pos logit contributions
alore+6.521
rade+5.815
aints+5.509
ints+5.453
liga+5.421
Neg logit contributions
%%%%-6.062
IJ-5.994
Submit-5.904
��-5.561
-5.479
Token Trump
Token ID1301
Feature activation-5.69e-01

Pos logit contributions
alore+0.518
rade+0.422
liga+0.412
aints+0.410
ints+0.402
Neg logit contributions
%%%%-0.724
IJ-0.712
Submit-0.698
NPR-0.659
 Hearing-0.659
Token's
Token ID338
Feature activation-8.52e-01

Pos logit contributions
alore+2.713
rade+2.320
aints+2.258
liga+2.254
ints+2.250
Neg logit contributions
%%%%-2.588
IJ-2.535
Submit-2.474
NPR-2.309
Psy-2.288
Token potential
Token ID2785
Feature activation-7.86e-01

Pos logit contributions
alore+3.172
rade+2.614
aints+2.543
ints+2.532
liga+2.529
Neg logit contributions
%%%%-4.671
IJ-4.621
Submit-4.487
-4.294
Psy-4.226
Token running
Token ID2491
Feature activation-1.52e-01

Pos logit contributions
alore+3.256
rade+2.746
aints+2.629
liga+2.619
ints+2.619
Neg logit contributions
%%%%-4.033
IJ-3.976
Submit-3.858
-3.651
Psy-3.624
Token (
Token ID357
Feature activation-8.90e-01

Pos logit contributions
alore+1.505
rade+1.307
aints+1.286
liga+1.278
ints+1.267
Neg logit contributions
%%%%-1.089
IJ-1.064
Submit-1.045
NPR-0.962
 Hearing-0.958
Token107
Token ID15982
Feature activation-1.91e-01

Pos logit contributions
alore+4.306
rade+3.739
aints+3.636
liga+3.621
ints+3.601
Neg logit contributions
%%%%-3.813
IJ-3.739
Submit-3.677
 Hearing-3.459
-3.411
Token,
Token ID11
Feature activation-3.04e-01

Pos logit contributions
alore+0.904
rade+0.769
liga+0.753
aints+0.750
ints+0.739
Neg logit contributions
%%%%-0.884
IJ-0.868
Submit-0.849
NPR-0.793
 Hearing-0.791
Token226
Token ID24909
Feature activation-9.48e-02

Pos logit contributions
alore+1.514
rade+1.300
liga+1.278
aints+1.277
ints+1.258
Neg logit contributions
%%%%-1.325
IJ-1.291
Submit-1.275
NPR-1.181
 Hearing-1.180
Token km
Token ID10571
Feature activation-1.53e-01

Pos logit contributions
alore+0.542
rade+0.472
liga+0.465
aints+0.463
ints+0.458
Neg logit contributions
%%%%-0.348
IJ-0.340
Submit-0.331
NPR-0.302
 Hearing-0.301
Token/
Token ID14
Feature activation-1.43e+00

Pos logit contributions
alore+0.786
rade+0.676
liga+0.664
aints+0.663
ints+0.653
Neg logit contributions
%%%%-0.647
IJ-0.633
Submit-0.619
NPR-0.573
 Hearing-0.572
Tokenh
Token ID71
Feature activation-2.80e-01

Pos logit contributions
alore+5.817
rade+5.027
liga+4.770
aints+4.719
oat+4.707
Neg logit contributions
%%%%-7.027
Submit-6.873
IJ-6.844
-6.570
 Hearing-6.551
Token).
Token ID737
Feature activation-3.26e-01

Pos logit contributions
alore+1.438
rade+1.239
aints+1.211
liga+1.210
ints+1.193
Neg logit contributions
%%%%-1.187
IJ-1.161
Submit-1.138
NPR-1.053
 Hearing-1.050
Token
Token ID198
Feature activation-2.04e-01

Pos logit contributions
alore+1.788
rade+1.583
aints+1.547
liga+1.537
ints+1.515
Neg logit contributions
%%%%-1.234
IJ-1.199
Submit-1.171
-1.068
NPR-1.062
Token
Token ID198
Feature activation-1.82e-01

Pos logit contributions
alore+1.039
rade+0.905
aints+0.881
liga+0.878
ints+0.870
Neg logit contributions
%%%%-0.858
IJ-0.840
Submit-0.816
 Hearing-0.761
-0.759
TokenSun
Token ID16012
Feature activation-2.75e-01

Pos logit contributions
alore+1.107
rade+0.985
aints+0.964
liga+0.963
ints+0.955
Neg logit contributions
%%%%-0.588
IJ-0.570
Submit-0.549
 Hearing-0.507
NPR-0.496
Token-
Token ID12
Feature activation-8.80e-01

Pos logit contributions
alore+1.085
rade+0.948
liga+0.933
aints+0.929
ints+0.917
Neg logit contributions
%%%%-0.714
IJ-0.694
Submit-0.674
NPR-0.619
 Hearing-0.618
Tokenstill
Token ID24219
Feature activation-4.33e-01

Pos logit contributions
alore+4.123
rade+3.533
aints+3.456
liga+3.450
ints+3.412
Neg logit contributions
%%%%-4.032
IJ-3.972
Submit-3.893
 Hearing-3.640
NPR-3.612
Token-
Token ID12
Feature activation-1.24e+00

Pos logit contributions
alore+2.523
rade+2.214
aints+2.178
liga+2.168
ints+2.151
Neg logit contributions
%%%%-1.529
IJ-1.507
Submit-1.462
NPR-1.337
 Hearing-1.327
Tokenh
Token ID71
Feature activation-1.73e-01

Pos logit contributions
alore+5.865
rade+5.029
aints+4.966
ints+4.923
liga+4.847
Neg logit contributions
%%%%-5.413
IJ-5.379
Submit-5.272
 Hearing-4.982
-4.909
Tokenangs
Token ID27725
Feature activation-5.02e-01

Pos logit contributions
alore+0.748
rade+0.624
liga+0.609
aints+0.608
ints+0.599
Neg logit contributions
%%%%-0.880
IJ-0.864
Submit-0.847
NPR-0.797
 Hearing-0.795
Token-
Token ID12
Feature activation-1.40e+00

Pos logit contributions
alore+2.890
rade+2.533
aints+2.506
liga+2.493
ints+2.470
Neg logit contributions
%%%%-1.791
IJ-1.767
Submit-1.715
NPR-1.565
 Hearing-1.556
Tokenor
Token ID273
Feature activation-7.41e-01

Pos logit contributions
alore+6.090
aints+5.121
rade+5.115
ints+5.021
liga+4.994
Neg logit contributions
%%%%-6.583
IJ-6.551
Submit-6.403
 Hearing-6.080
-6.030
Token-
Token ID12
Feature activation-4.00e-01

Pos logit contributions
alore+4.030
rade+3.456
aints+3.449
liga+3.405
ints+3.405
Neg logit contributions
IJ-2.802
Submit-2.766
%%%%-2.764
NPR-2.511
 Hearing-2.489
Tokendoes
Token ID22437
Feature activation+2.93e-01

Pos logit contributions
alore+1.971
rade+1.687
aints+1.653
liga+1.650
ints+1.631
Neg logit contributions
%%%%-1.771
IJ-1.745
Submit-1.709
 Hearing-1.589
NPR-1.585
Token/
Token ID14
Feature activation-6.61e-02

Pos logit contributions
%%%%+0.963
IJ+0.928
Submit+0.895
NPR+0.817
 Hearing+0.811
Neg logit contributions
alore-1.791
rade-1.579
liga-1.554
aints-1.549
ints-1.536
Token
Token ID198
Feature activation+5.17e-01

Pos logit contributions
alore+0.375
rade+0.328
liga+0.323
aints+0.322
ints+0.318
Neg logit contributions
%%%%-0.245
IJ-0.239
Submit-0.232
NPR-0.213
 Hearing-0.212
Token
Token ID198
Feature activation+3.00e-01

Pos logit contributions
%%%%+0.319
IJ+0.312
Submit+0.304
NPR+0.280
 Hearing+0.279
Neg logit contributions
alore-0.429
rade-0.371
liga-0.366
aints-0.364
ints-0.359
TokenRated
Token ID15322
Feature activation+5.80e-01

Pos logit contributions
%%%%+1.447
IJ+1.423
Submit+1.400
NPR+1.311
Psy+1.285
Neg logit contributions
alore-1.386
rade-1.159
liga-1.140
aints-1.133
ints-1.110
Token 5
Token ID642
Feature activation-6.60e-01

Pos logit contributions
%%%%+2.220
Submit+2.144
IJ+2.138
 Hearing+1.968
NPR+1.932
Neg logit contributions
alore-3.232
rade-2.799
liga-2.764
aints-2.710
ammy-2.678
Token out
Token ID503
Feature activation-5.47e-02

Pos logit contributions
alore+2.472
rade+2.056
aints+1.998
liga+1.980
ints+1.961
Neg logit contributions
%%%%-3.595
IJ-3.548
Submit-3.461
-3.313
 Hearing-3.296
Token of
Token ID286
Feature activation+1.25e-01

Pos logit contributions
alore+0.311
rade+0.275
liga+0.269
aints+0.268
ints+0.265
Neg logit contributions
%%%%-0.199
IJ-0.195
Submit-0.189
NPR-0.174
 Hearing-0.174
Token 5
Token ID642
Feature activation-1.39e+00

Pos logit contributions
%%%%+0.510
IJ+0.496
Submit+0.486
 Hearing+0.448
NPR+0.447
Neg logit contributions
alore-0.672
rade-0.580
liga-0.571
aints-0.569
ints-0.560
Token by
Token ID416
Feature activation-1.68e-01

Pos logit contributions
alore+4.665
rade+4.002
ints+3.744
aints+3.702
ary+3.624
Neg logit contributions
%%%%-7.511
IJ-7.252
��-7.218
Submit-7.206
CLASSIFIED-7.197
Token d
Token ID288
Feature activation-3.74e-01

Pos logit contributions
alore+0.888
rade+0.768
liga+0.754
aints+0.752
ints+0.741
Neg logit contributions
%%%%-0.687
IJ-0.671
Submit-0.655
NPR-0.605
 Hearing-0.602
Tokenanc
Token ID1192
Feature activation-7.82e-02

Pos logit contributions
alore+1.763
rade+1.494
liga+1.463
aints+1.459
ints+1.435
Neg logit contributions
%%%%-1.747
IJ-1.712
Submit-1.679
 Hearing-1.567
NPR-1.566
Tokenerg
Token ID6422
Feature activation-4.81e-01

Pos logit contributions
alore+0.373
rade+0.316
liga+0.311
aints+0.309
ints+0.305
Neg logit contributions
%%%%-0.361
IJ-0.354
Submit-0.345
NPR-0.323
 Hearing-0.322
Tokenur
Token ID333
Feature activation-5.41e-01

Pos logit contributions
alore+2.045
rade+1.710
aints+1.661
liga+1.659
ints+1.629
Neg logit contributions
%%%%-2.480
IJ-2.424
Submit-2.387
NPR-2.242
 Hearing-2.241
Token
Token ID247
Feature activation+1.90e-02

Pos logit contributions
alore+0.060
rade+0.054
liga+0.053
aints+0.053
ints+0.053
Neg logit contributions
%%%%-0.024
IJ-0.023
Submit-0.022
NPR-0.020
 Hearing-0.020
Tokent
Token ID83
Feature activation+3.85e-01

Pos logit contributions
%%%%+0.083
IJ+0.081
Submit+0.079
NPR+0.073
 Hearing+0.073
Neg logit contributions
alore-0.096
rade-0.082
liga-0.081
aints-0.080
ints-0.079
Token be
Token ID307
Feature activation+2.00e-01

Pos logit contributions
%%%%+1.500
IJ+1.467
Submit+1.433
NPR+1.314
 Hearing+1.313
Neg logit contributions
alore-2.097
rade-1.824
liga-1.797
aints-1.787
ints-1.761
Token easy
Token ID2562
Feature activation+2.10e-01

Pos logit contributions
%%%%+0.882
IJ+0.867
Submit+0.845
NPR+0.786
 Hearing+0.785
Neg logit contributions
alore-0.998
rade-0.852
aints-0.834
liga-0.833
ints-0.820
Token be
Token ID307
Feature activation-6.66e-02

Pos logit contributions
%%%%+0.859
IJ+0.841
Submit+0.819
 Hearing+0.758
NPR+0.757
Neg logit contributions
alore-1.117
rade-0.965
liga-0.948
aints-0.946
ints-0.933
Token­
Token ID3907
Feature activation-1.38e+00

Pos logit contributions
alore+0.336
rade+0.288
liga+0.283
aints+0.282
ints+0.278
Neg logit contributions
%%%%-0.289
IJ-0.283
Submit-0.276
NPR-0.257
 Hearing-0.256
Tokening
Token ID278
Feature activation-5.98e-01

Pos logit contributions
alore+6.166
rade+5.334
ints+5.179
liga+5.144
aints+5.086
Neg logit contributions
%%%%-6.481
IJ-6.401
Submit-6.302
-6.043
Psy-5.881
Token Jimmy
Token ID12963
Feature activation+1.23e-01

Pos logit contributions
alore+2.962
rade+2.562
liga+2.508
aints+2.500
ints+2.464
Neg logit contributions
%%%%-2.618
IJ-2.544
Submit-2.492
NPR-2.310
 Hearing-2.294
Token Carter
Token ID10831
Feature activation+2.33e-01

Pos logit contributions
%%%%+0.487
IJ+0.476
Submit+0.463
NPR+0.427
 Hearing+0.427
Neg logit contributions
alore-0.670
rade-0.580
liga-0.572
aints-0.569
ints-0.560
Token.
Token ID13
Feature activation+9.02e-02

Pos logit contributions
%%%%+0.994
IJ+0.975
Submit+0.948
 Hearing+0.885
NPR+0.883
Neg logit contributions
alore-1.198
rade-1.020
liga-1.011
aints-1.004
ints-0.987
Token Al
Token ID978
Feature activation+1.71e-01

Pos logit contributions
%%%%+0.322
IJ+0.314
Submit+0.305
NPR+0.278
 Hearing+0.278
Neg logit contributions
alore-0.525
rade-0.460
liga-0.453
aints-0.451
ints-0.446
Token Dixon
Token ID31178
Feature activation-2.85e-03

Pos logit contributions
alore+0.985
rade+0.855
liga+0.839
aints+0.837
ints+0.825
Neg logit contributions
%%%%-0.742
IJ-0.725
Submit-0.707
NPR-0.653
 Hearing-0.648
Token,
Token ID11
Feature activation+1.15e-01

Pos logit contributions
alore+0.014
rade+0.012
liga+0.011
aints+0.011
ints+0.011
Neg logit contributions
%%%%-0.013
IJ-0.013
Submit-0.012
NPR-0.012
 Hearing-0.012
Token Mozilla
Token ID29258
Feature activation-5.33e-01

Pos logit contributions
%%%%+0.484
IJ+0.474
Submit+0.463
NPR+0.428
 Hearing+0.426
Neg logit contributions
alore-0.597
rade-0.515
liga-0.505
aints-0.505
ints-0.497
Token's
Token ID338
Feature activation-7.50e-01

Pos logit contributions
alore+2.505
rade+2.135
aints+2.103
liga+2.081
ints+2.079
Neg logit contributions
%%%%-2.478
IJ-2.398
Submit-2.367
NPR-2.199
 Hearing-2.188
Token chief
Token ID4039
Feature activation-4.93e-01

Pos logit contributions
alore+3.214
rade+2.697
aints+2.659
ints+2.635
liga+2.606
Neg logit contributions
%%%%-3.755
IJ-3.642
Submit-3.602
-3.387
NPR-3.342
Token legal
Token ID2742
Feature activation-1.38e+00

Pos logit contributions
alore+2.315
rade+1.976
liga+1.947
aints+1.939
ints+1.904
Neg logit contributions
%%%%-2.283
IJ-2.230
Submit-2.186
NPR-2.043
-2.031
Token and
Token ID290
Feature activation+3.21e-02

Pos logit contributions
alore+5.494
rade+4.703
aints+4.622
liga+4.573
oat+4.458
Neg logit contributions
%%%%-7.004
Submit-6.667
IJ-6.651
-6.361
NPR-6.236
Token business
Token ID1597
Feature activation-1.97e-01

Pos logit contributions
%%%%+0.151
IJ+0.148
Submit+0.145
NPR+0.135
 Hearing+0.135
Neg logit contributions
alore-0.151
rade-0.128
liga-0.125
aints-0.124
ints-0.122
Token officer
Token ID3818
Feature activation-4.78e-01

Pos logit contributions
alore+0.884
rade+0.741
liga+0.725
aints+0.721
ints+0.710
Neg logit contributions
%%%%-0.968
IJ-0.953
Submit-0.930
NPR-0.873
 Hearing-0.873
Token,
Token ID11
Feature activation-6.18e-01

Pos logit contributions
alore+2.212
rade+1.877
liga+1.844
aints+1.829
ints+1.797
Neg logit contributions
%%%%-2.274
IJ-2.220
Submit-2.185
NPR-2.035
 Hearing-2.017
Token said
Token ID531
Feature activation-2.59e-01

Pos logit contributions
alore+2.956
rade+2.530
liga+2.462
aints+2.452
ints+2.419
Neg logit contributions
%%%%-2.861
IJ-2.764
Submit-2.725
NPR-2.527
 Hearing-2.497
Token a
Token ID257
Feature activation+6.94e-02

Pos logit contributions
alore+1.092
rade+0.934
aints+0.912
liga+0.910
ints+0.903
Neg logit contributions
%%%%-1.008
IJ-0.986
Submit-0.962
NPR-0.898
 Hearing-0.894
Token 1
Token ID352
Feature activation-1.34e-01

Pos logit contributions
%%%%+0.329
IJ+0.323
Submit+0.315
NPR+0.295
 Hearing+0.295
Neg logit contributions
alore-0.323
rade-0.273
liga-0.267
aints-0.266
ints-0.262
Token,
Token ID11
Feature activation+1.14e-01

Pos logit contributions
alore+0.655
rade+0.557
liga+0.547
aints+0.545
ints+0.536
Neg logit contributions
%%%%-0.603
IJ-0.591
Submit-0.577
NPR-0.538
 Hearing-0.537
Token000
Token ID830
Feature activation+4.69e-01

Pos logit contributions
%%%%+0.558
IJ+0.549
Submit+0.536
NPR+0.503
 Hearing+0.502
Neg logit contributions
alore-0.510
rade-0.427
liga-0.419
aints-0.417
ints-0.410
Token ft
Token ID10117
Feature activation+1.32e-01

Pos logit contributions
%%%%+2.339
IJ+2.285
Submit+2.253
NPR+2.109
 Hearing+2.096
Neg logit contributions
alore-2.071
rade-1.708
liga-1.705
aints-1.675
ints-1.638
Token incl
Token ID13358
Feature activation-1.36e+00

Pos logit contributions
%%%%+0.615
IJ+0.603
Submit+0.589
NPR+0.551
 Hearing+0.550
Neg logit contributions
alore-0.622
rade-0.527
liga-0.517
aints-0.515
ints-0.507
Tokenine
Token ID500
Feature activation-5.84e-01

Pos logit contributions
alore+3.705
rade+3.037
aints+2.823
oat+2.706
ints+2.691
Neg logit contributions
%%%%-8.737
IJ-8.532
Submit-8.328
-8.199
NPR-8.187
Token over
Token ID625
Feature activation+1.89e-01

Pos logit contributions
alore+2.730
rade+2.358
aints+2.281
ints+2.249
liga+2.248
Neg logit contributions
%%%%-2.698
IJ-2.636
Submit-2.581
NPR-2.415
-2.397
Token 1
Token ID352
Feature activation-4.63e-01

Pos logit contributions
%%%%+0.804
IJ+0.789
Submit+0.768
 Hearing+0.711
NPR+0.710
Neg logit contributions
alore-0.968
rade-0.831
liga-0.819
aints-0.812
ints-0.801
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
alore+2.148
rade+1.835
liga+1.787
aints+1.772
ints+1.752
Neg logit contributions
%%%%-2.177
IJ-2.136
Submit-2.091
NPR-1.967
 Hearing-1.959
Token3
Token ID18
Feature activation+5.32e-01

Pos logit contributions
%%%%+0.782
IJ+0.763
Submit+0.742
NPR+0.683
 Hearing+0.681
Neg logit contributions
alore-1.133
rade-0.985
liga-0.970
aints-0.966
ints-0.954
Token
Token ID198
Feature activation+3.79e-01

Pos logit contributions
%%%%+1.260
IJ+1.229
Submit+1.202
NPR+1.099
 Hearing+1.090
Neg logit contributions
alore-1.852
rade-1.593
liga-1.589
aints-1.570
ints-1.550
TokenRated
Token ID15322
Feature activation+6.89e-01

Pos logit contributions
%%%%+1.808
IJ+1.781
Submit+1.757
NPR+1.641
Psy+1.606
Neg logit contributions
alore-1.784
rade-1.489
liga-1.472
aints-1.457
ints-1.431
Token 5
Token ID642
Feature activation-6.09e-01

Pos logit contributions
%%%%+2.586
Submit+2.506
IJ+2.483
 Hearing+2.298
NPR+2.241
Neg logit contributions
alore-3.878
rade-3.365
liga-3.325
aints-3.250
ammy-3.218
Token out
Token ID503
Feature activation-2.31e-02

Pos logit contributions
alore+2.300
rade+1.913
aints+1.859
liga+1.845
ints+1.825
Neg logit contributions
%%%%-3.305
IJ-3.260
Submit-3.181
-3.038
 Hearing-3.029
Token of
Token ID286
Feature activation+1.36e-01

Pos logit contributions
alore+0.132
rade+0.117
liga+0.114
aints+0.114
ints+0.113
Neg logit contributions
%%%%-0.084
IJ-0.082
Submit-0.079
NPR-0.073
 Hearing-0.073
Token 5
Token ID642
Feature activation-1.36e+00

Pos logit contributions
%%%%+0.548
IJ+0.533
Submit+0.523
 Hearing+0.481
NPR+0.480
Neg logit contributions
alore-0.732
rade-0.633
liga-0.623
aints-0.620
ints-0.610
Token by
Token ID416
Feature activation-1.70e-01

Pos logit contributions
alore+4.592
rade+3.935
ints+3.684
aints+3.643
liga+3.564
Neg logit contributions
%%%%-7.342
IJ-7.107
Submit-7.059
��-7.054
CLASSIFIED-7.012
Token Quake
Token ID42901
Feature activation-3.78e-01

Pos logit contributions
alore+0.904
rade+0.782
liga+0.768
aints+0.766
ints+0.755
Neg logit contributions
%%%%-0.694
IJ-0.678
Submit-0.661
NPR-0.611
 Hearing-0.608
TokenD
Token ID35
Feature activation-1.55e-01

Pos logit contributions
alore+1.990
rade+1.719
aints+1.682
liga+1.682
ints+1.659
Neg logit contributions
%%%%-1.567
IJ-1.519
Submit-1.482
 Hearing-1.375
NPR-1.373
Tokenude
Token ID2507
Feature activation-7.90e-02

Pos logit contributions
alore+0.746
rade+0.635
liga+0.622
aints+0.620
ints+0.610
Neg logit contributions
%%%%-0.711
IJ-0.697
Submit-0.681
NPR-0.636
 Hearing-0.634
Token from
Token ID422
Feature activation+4.19e-02

Pos logit contributions
alore+0.416
rade+0.359
liga+0.353
aints+0.352
ints+0.347
Neg logit contributions
%%%%-0.326
IJ-0.319
Submit-0.311
NPR-0.288
 Hearing-0.287
Token and
Token ID290
Feature activation-5.45e-02

Pos logit contributions
alore+1.321
rade+1.125
liga+1.101
aints+1.099
ints+1.082
Neg logit contributions
%%%%-1.262
IJ-1.237
Submit-1.214
NPR-1.134
 Hearing-1.127
Token 16
Token ID1467
Feature activation-1.09e-01

Pos logit contributions
alore+0.251
rade+0.212
liga+0.208
aints+0.207
ints+0.204
Neg logit contributions
%%%%-0.260
IJ-0.255
Submit-0.249
 Hearing-0.233
NPR-0.233
Token-
Token ID12
Feature activation-1.33e-01

Pos logit contributions
alore+0.645
rade+0.569
liga+0.559
aints+0.558
ints+0.552
Neg logit contributions
%%%%-0.377
IJ-0.368
Submit-0.357
NPR-0.326
 Hearing-0.325
Token35
Token ID2327
Feature activation+1.54e-01

Pos logit contributions
alore+0.701
rade+0.606
liga+0.595
aints+0.594
ints+0.586
Neg logit contributions
%%%%-0.549
IJ-0.537
Submit-0.524
NPR-0.485
 Hearing-0.484
Token f
Token ID277
Feature activation-2.80e-01

Pos logit contributions
%%%%+0.694
IJ+0.680
Submit+0.661
 Hearing+0.618
NPR+0.616
Neg logit contributions
alore-0.752
rade-0.642
liga-0.630
aints-0.626
ints-0.616
Token/
Token ID14
Feature activation-1.36e+00

Pos logit contributions
alore+1.423
rade+1.228
aints+1.205
liga+1.198
ints+1.186
Neg logit contributions
%%%%-1.196
IJ-1.176
Submit-1.146
NPR-1.067
 Hearing-1.066
Token2
Token ID17
Feature activation-6.26e-01

Pos logit contributions
alore+6.115
rade+5.097
ints+5.028
aints+5.020
liga+4.914
Neg logit contributions
%%%%-6.316
IJ-6.161
Submit-6.086
NPR-5.711
 Hearing-5.708
Token.
Token ID13
Feature activation+7.81e-02

Pos logit contributions
alore+2.728
rade+2.282
aints+2.230
ints+2.213
liga+2.210
Neg logit contributions
%%%%-3.108
IJ-3.024
Submit-3.008
NPR-2.828
-2.803
Token8
Token ID23
Feature activation-2.38e-02

Pos logit contributions
%%%%+0.298
IJ+0.291
Submit+0.283
 Hearing+0.259
NPR+0.259
Neg logit contributions
alore-0.438
rade-0.381
liga-0.376
aints-0.373
ints-0.368
Token II
Token ID2873
Feature activation+7.54e-03

Pos logit contributions
alore+0.131
rade+0.114
liga+0.112
aints+0.111
ints+0.110
Neg logit contributions
%%%%-0.093
IJ-0.091
Submit-0.088
NPR-0.081
 Hearing-0.081
Token,
Token ID11
Feature activation+1.97e-02

Pos logit contributions
%%%%+0.034
IJ+0.034
Submit+0.033
NPR+0.031
 Hearing+0.031
Neg logit contributions
alore-0.037
rade-0.031
liga-0.031
aints-0.030
ints-0.030
Token en
Token ID551
Feature activation+1.46e-01

Pos logit contributions
alore+1.154
rade+0.995
liga+0.977
aints+0.970
ints+0.959
Neg logit contributions
%%%%-0.971
IJ-0.951
Submit-0.924
NPR-0.863
 Hearing-0.859
Token inter
Token ID987
Feature activation-1.52e-02

Pos logit contributions
%%%%+0.662
IJ+0.649
Submit+0.635
NPR+0.591
 Hearing+0.591
Neg logit contributions
alore-0.713
rade-0.607
liga-0.595
aints-0.593
ints-0.584
Tokendis
Token ID6381
Feature activation+6.90e-02

Pos logit contributions
alore+0.073
rade+0.062
liga+0.060
aints+0.060
ints+0.059
Neg logit contributions
%%%%-0.071
IJ-0.069
Submit-0.068
NPR-0.063
 Hearing-0.063
Tokenant
Token ID415
Feature activation-2.48e-01

Pos logit contributions
%%%%+0.315
IJ+0.309
Submit+0.302
NPR+0.282
 Hearing+0.281
Neg logit contributions
alore-0.333
rade-0.284
liga-0.279
aints-0.278
ints-0.274
Token l
Token ID300
Feature activation-1.08e-01

Pos logit contributions
alore+1.249
rade+1.075
liga+1.054
aints+1.049
ints+1.035
Neg logit contributions
%%%%-1.082
IJ-1.060
Submit-1.031
NPR-0.964
 Hearing-0.958
Token'
Token ID6
Feature activation-1.35e+00

Pos logit contributions
alore+0.582
rade+0.505
liga+0.496
aints+0.495
ints+0.489
Neg logit contributions
%%%%-0.430
IJ-0.420
Submit-0.409
NPR-0.378
 Hearing-0.377
Tokenag
Token ID363
Feature activation-5.44e-01

Pos logit contributions
alore+6.028
rade+5.155
aints+5.068
ints+5.024
liga+5.009
Neg logit contributions
%%%%-6.228
IJ-6.036
Submit-5.947
 Hearing-5.702
-5.683
Tokenression
Token ID2234
Feature activation-3.32e-01

Pos logit contributions
alore+2.586
rade+2.254
liga+2.172
aints+2.154
ints+2.127
Neg logit contributions
%%%%-2.532
IJ-2.486
Submit-2.402
-2.259
 Hearing-2.250
Token de
Token ID390
Feature activation-1.87e-01

Pos logit contributions
alore+1.755
rade+1.527
liga+1.498
aints+1.493
ints+1.474
Neg logit contributions
%%%%-1.357
IJ-1.327
Submit-1.291
NPR-1.199
 Hearing-1.193
Token t
Token ID256
Feature activation-2.72e-01

Pos logit contributions
alore+0.935
rade+0.806
liga+0.790
aints+0.785
ints+0.777
Neg logit contributions
%%%%-0.814
IJ-0.798
Submit-0.776
NPR-0.725
 Hearing-0.719
Tokenout
Token ID448
Feature activation-1.06e-01

Pos logit contributions
alore+0.998
rade+0.810
aints+0.790
liga+0.780
ints+0.770
Neg logit contributions
%%%%-1.553
IJ-1.524
Submit-1.501
 Hearing-1.419
NPR-1.416
Token Powell
Token ID19675
Feature activation-5.14e-01

Pos logit contributions
alore+1.077
rade+0.928
liga+0.909
aints+0.906
ints+0.894
Neg logit contributions
%%%%-0.912
IJ-0.894
Submit-0.872
NPR-0.810
 Hearing-0.806
Token:
Token ID25
Feature activation-1.41e-01

Pos logit contributions
alore+2.516
rade+2.163
liga+2.126
aints+2.101
ints+2.067
Neg logit contributions
%%%%-2.278
IJ-2.227
Submit-2.175
NPR-2.033
Psy-2.020
Token
Token ID198
Feature activation+2.86e-01

Pos logit contributions
alore+0.758
rade+0.659
aints+0.647
liga+0.646
ints+0.636
Neg logit contributions
%%%%-0.565
IJ-0.551
Submit-0.537
NPR-0.497
 Hearing-0.491
Token
Token ID198
Feature activation+2.75e-01

Pos logit contributions
%%%%+1.110
IJ+1.083
Submit+1.055
NPR+0.970
 Hearing+0.963
Neg logit contributions
alore-1.594
rade-1.374
liga-1.364
aints-1.352
ints-1.333
TokenNew
Token ID3791
Feature activation-4.02e-01

Pos logit contributions
%%%%+1.002
IJ+0.979
Submit+0.955
NPR+0.871
Psy+0.850
Neg logit contributions
alore-1.602
rade-1.388
liga-1.375
aints-1.364
ints-1.348
Token England
Token ID4492
Feature activation-1.33e+00

Pos logit contributions
alore+1.558
rade+1.273
aints+1.245
liga+1.244
ints+1.216
Neg logit contributions
%%%%-2.207
IJ-2.168
Submit-2.127
NPR-2.018
 Hearing-2.002
Token Patriots
Token ID13104
Feature activation-6.57e-01

Pos logit contributions
alore+4.462
aints+3.804
rade+3.743
liga+3.597
ints+3.590
Neg logit contributions
%%%%-7.515
IJ-7.277
Submit-7.277
NPR-6.943
-6.835
Token:
Token ID25
Feature activation+8.20e-02

Pos logit contributions
alore+2.954
rade+2.525
aints+2.482
liga+2.461
ints+2.406
Neg logit contributions
%%%%-3.132
IJ-3.072
Submit-3.020
NPR-2.842
Psy-2.810
Token QB
Token ID16135
Feature activation-3.96e-01

Pos logit contributions
%%%%+0.370
IJ+0.363
Submit+0.354
NPR+0.330
 Hearing+0.329
Neg logit contributions
alore-0.400
rade-0.342
liga-0.335
aints-0.334
ints-0.329
Token Tom
Token ID4186
Feature activation-1.24e-01

Pos logit contributions
alore+1.843
rade+1.564
liga+1.532
aints+1.531
ints+1.500
Neg logit contributions
%%%%-1.864
IJ-1.825
Submit-1.787
NPR-1.674
 Hearing-1.658
Token Brady
Token ID15260
Feature activation-6.63e-01

Pos logit contributions
alore+0.558
rade+0.468
liga+0.461
aints+0.460
ints+0.450
Neg logit contributions
%%%%-0.608
IJ-0.595
Submit-0.584
NPR-0.549
Psy-0.542
Token=
Token ID28
Feature activation-4.95e-01

Pos logit contributions
alore+1.315
rade+1.164
liga+1.131
aints+1.130
ints+1.119
Neg logit contributions
%%%%-0.874
IJ-0.861
Submit-0.832
 Hearing-0.780
NPR-0.772
TokenTrue
Token ID17821
Feature activation-5.45e-01

Pos logit contributions
alore+2.229
rade+1.928
aints+1.876
ints+1.865
liga+1.847
Neg logit contributions
%%%%-2.306
IJ-2.270
Submit-2.172
 Hearing-2.101
-2.068
Token,
Token ID11
Feature activation-1.98e-01

Pos logit contributions
alore+2.385
rade+2.051
aints+1.958
liga+1.948
ints+1.942
Neg logit contributions
%%%%-2.639
IJ-2.611
Submit-2.526
 Hearing-2.408
-2.407
Token help
Token ID1037
Feature activation+2.79e-02

Pos logit contributions
alore+0.935
rade+0.793
liga+0.776
aints+0.776
ints+0.766
Neg logit contributions
%%%%-0.913
IJ-0.901
Submit-0.880
 Hearing-0.823
NPR-0.822
Token="
Token ID2625
Feature activation-8.07e-01

Pos logit contributions
%%%%+0.093
IJ+0.090
Submit+0.087
 Hearing+0.079
NPR+0.079
Neg logit contributions
alore-0.168
rade-0.149
liga-0.146
aints-0.146
ints-0.145
Tokenpath
Token ID6978
Feature activation-1.32e+00

Pos logit contributions
alore+3.338
rade+2.812
aints+2.740
liga+2.724
ints+2.713
Neg logit contributions
IJ-4.086
%%%%-4.080
Submit-3.945
 Hearing-3.836
-3.798
Token to
Token ID284
Feature activation-3.79e-01

Pos logit contributions
alore+5.626
rade+4.907
liga+4.665
aints+4.618
ints+4.546
Neg logit contributions
IJ-6.158
%%%%-6.076
Submit-5.905
CLASSIFIED-5.811
-5.773
Token images
Token ID4263
Feature activation-1.29e+00

Pos logit contributions
alore+1.543
rade+1.263
aints+1.247
ints+1.246
liga+1.241
Neg logit contributions
%%%%-1.970
IJ-1.936
Submit-1.917
-1.815
NPR-1.812
Token directory
Token ID8619
Feature activation-6.53e-01

Pos logit contributions
alore+5.093
liga+4.324
rade+4.318
aints+4.217
ints+4.111
Neg logit contributions
%%%%-6.409
IJ-6.347
Submit-6.259
NPR-5.915
CLASSIFIED-5.911
Token")
Token ID4943
Feature activation+9.02e-02

Pos logit contributions
alore+2.960
rade+2.555
liga+2.481
aints+2.481
ints+2.440
Neg logit contributions
%%%%-3.039
IJ-2.992
Submit-2.910
 Hearing-2.757
NPR-2.749
Token args
Token ID26498
Feature activation-6.31e-01

Pos logit contributions
%%%%+0.421
IJ+0.411
Submit+0.400
NPR+0.376
 Hearing+0.374
Neg logit contributions
alore-0.428
rade-0.365
liga-0.357
aints-0.355
ammy-0.347
Token (
Token ID357
Feature activation-8.49e-01

Pos logit contributions
alore+1.247
rade+1.083
aints+1.064
liga+1.062
ints+1.049
Neg logit contributions
%%%%-0.879
IJ-0.859
Submit-0.838
NPR-0.772
 Hearing-0.771
Token720
Token ID23906
Feature activation-1.81e-01

Pos logit contributions
alore+4.191
rade+3.603
aints+3.553
liga+3.542
ints+3.490
Neg logit contributions
%%%%-3.633
IJ-3.541
Submit-3.477
 Hearing-3.284
NPR-3.205
Token,
Token ID11
Feature activation-3.91e-01

Pos logit contributions
alore+0.858
rade+0.726
liga+0.711
aints+0.710
ints+0.699
Neg logit contributions
%%%%-0.836
IJ-0.822
Submit-0.806
NPR-0.752
 Hearing-0.748
Token000
Token ID830
Feature activation-9.85e-02

Pos logit contributions
alore+1.636
rade+1.357
liga+1.330
aints+1.327
ints+1.303
Neg logit contributions
%%%%-2.017
IJ-1.971
Submit-1.953
 Hearing-1.828
NPR-1.825
Token km
Token ID10571
Feature activation-2.98e-01

Pos logit contributions
alore+0.565
rade+0.494
liga+0.486
aints+0.484
ints+0.479
Neg logit contributions
%%%%-0.358
IJ-0.350
Submit-0.341
NPR-0.311
 Hearing-0.310
Token/
Token ID14
Feature activation-1.32e+00

Pos logit contributions
alore+1.513
rade+1.300
aints+1.276
liga+1.274
ints+1.257
Neg logit contributions
%%%%-1.275
IJ-1.251
Submit-1.226
NPR-1.133
 Hearing-1.132
Tokenh
Token ID71
Feature activation-1.17e-01

Pos logit contributions
alore+5.456
rade+4.708
ints+4.462
liga+4.461
aints+4.457
Neg logit contributions
%%%%-6.494
IJ-6.385
Submit-6.297
-6.091
 Hearing-6.059
Token).
Token ID737
Feature activation-2.17e-01

Pos logit contributions
alore+0.618
rade+0.534
liga+0.524
aints+0.523
ints+0.516
Neg logit contributions
%%%%-0.479
IJ-0.469
Submit-0.457
NPR-0.422
 Hearing-0.421
Token Even
Token ID3412
Feature activation+2.22e-01

Pos logit contributions
alore+1.177
rade+1.039
aints+1.016
liga+1.009
ints+0.996
Neg logit contributions
%%%%-0.841
IJ-0.817
Submit-0.799
NPR-0.728
-0.725
Token at
Token ID379
Feature activation+1.44e-01

Pos logit contributions
%%%%+0.924
IJ+0.906
Submit+0.882
NPR+0.819
 Hearing+0.818
Neg logit contributions
alore-1.159
rade-0.998
liga-0.982
aints-0.977
ints-0.963
Token this
Token ID428
Feature activation+2.21e-01

Pos logit contributions
%%%%+0.592
IJ+0.580
Submit+0.564
NPR+0.522
 Hearing+0.522
Neg logit contributions
alore-0.757
rade-0.653
liga-0.643
aints-0.640
ints-0.630
Token month
Token ID1227
Feature activation+7.61e-01

Pos logit contributions
%%%%+0.721
IJ+0.703
Submit+0.688
NPR+0.639
 Hearing+0.637
Neg logit contributions
alore-0.948
rade-0.816
liga-0.805
aints-0.800
ints-0.792
Token,
Token ID11
Feature activation+4.97e-01

Pos logit contributions
%%%%+3.136
IJ+3.094
Submit+3.004
 Hearing+2.843
NPR+2.818
Neg logit contributions
alore-3.955
rade-3.388
liga-3.347
aints-3.274
ints-3.262
Token a
Token ID257
Feature activation+1.98e-01

Pos logit contributions
%%%%+1.992
IJ+1.945
Submit+1.882
 Hearing+1.776
NPR+1.756
Neg logit contributions
alore-2.679
rade-2.314
liga-2.283
aints-2.249
ints-2.239
Token Center
Token ID3337
Feature activation-2.10e-01

Pos logit contributions
%%%%+0.851
IJ+0.834
Submit+0.813
NPR+0.756
 Hearing+0.756
Neg logit contributions
alore-1.008
rade-0.866
liga-0.850
aints-0.846
ints-0.835
Token for
Token ID329
Feature activation-6.16e-01

Pos logit contributions
alore+1.098
rade+0.952
liga+0.931
aints+0.929
ints+0.913
Neg logit contributions
%%%%-0.875
IJ-0.854
Submit-0.829
NPR-0.764
 Hearing-0.760
Token Public
Token ID5094
Feature activation-1.32e+00

Pos logit contributions
alore+3.501
rade+3.121
aints+3.121
liga+3.071
ints+3.039
Neg logit contributions
%%%%-2.191
IJ-2.086
Submit-2.041
Psy-1.874
-1.873
Token Integrity
Token ID39348
Feature activation-5.68e-01

Pos logit contributions
alore+6.879
aints+6.102
rade+6.052
liga+5.847
ints+5.727
Neg logit contributions
%%%%-5.183
Submit-4.828
IJ-4.790
-4.571
NPR-4.387
Token investigation
Token ID3645
Feature activation-1.95e-01

Pos logit contributions
alore+2.592
rade+2.223
aints+2.203
liga+2.173
oat+2.124
Neg logit contributions
%%%%-2.642
IJ-2.516
Submit-2.500
-2.374
Psy-2.317
Token found
Token ID1043
Feature activation+2.73e-01

Pos logit contributions
alore+0.972
rade+0.834
liga+0.820
aints+0.818
ints+0.801
Neg logit contributions
%%%%-0.861
IJ-0.839
Submit-0.823
NPR-0.760
 Hearing-0.759
Token that
Token ID326
Feature activation+1.58e-01

Pos logit contributions
%%%%+1.092
IJ+1.068
Submit+1.038
 Hearing+0.958
NPR+0.958
Neg logit contributions
alore-1.477
rade-1.281
liga-1.264
aints-1.244
ints-1.237
Token,
Token ID11
Feature activation+2.23e-02

Pos logit contributions
%%%%+0.653
IJ+0.638
Submit+0.622
 Hearing+0.577
NPR+0.576
Neg logit contributions
alore-0.832
rade-0.718
liga-0.706
aints-0.701
ints-0.693
Token.
Token ID13
Feature activation-8.05e-03

Pos logit contributions
alore+0.397
rade+0.340
liga+0.333
aints+0.332
ints+0.327
Neg logit contributions
%%%%-0.351
IJ-0.344
Submit-0.335
 Hearing-0.312
NPR-0.312
Token -
Token ID532
Feature activation+6.80e-02

Pos logit contributions
alore+0.046
rade+0.041
aints+0.040
liga+0.040
ints+0.039
Neg logit contributions
%%%%-0.029
IJ-0.029
Submit-0.028
NPR-0.025
 Hearing-0.025
Token December
Token ID3426
Feature activation-1.55e-01

Pos logit contributions
%%%%+0.305
IJ+0.300
Submit+0.293
NPR+0.273
 Hearing+0.273
Neg logit contributions
alore-0.333
rade-0.284
liga-0.279
aints-0.278
ints-0.273
Token 13
Token ID1511
Feature activation-4.05e-01

Pos logit contributions
alore+0.769
rade+0.672
liga+0.655
aints+0.650
ints+0.647
Neg logit contributions
%%%%-0.673
IJ-0.652
Submit-0.639
-0.596
NPR-0.590
Token,
Token ID11
Feature activation-7.12e-01

Pos logit contributions
alore+1.778
rade+1.529
aints+1.467
ints+1.460
liga+1.455
Neg logit contributions
%%%%-1.954
IJ-1.921
Submit-1.878
 Hearing-1.789
-1.775
Token 2010
Token ID3050
Feature activation-1.31e+00

Pos logit contributions
alore+2.399
rade+2.005
ints+1.917
aints+1.872
liga+1.850
Neg logit contributions
%%%%-4.049
IJ-3.994
Submit-3.933
-3.761
��-3.722
Tokenboys
Token ID13202
Feature activation-8.28e-01

Pos logit contributions
alore+3.189
rade+2.452
ints+2.412
aints+2.375
liga+2.197
Neg logit contributions
%%%%-8.618
Submit-8.467
IJ-8.464
 Hearing-8.397
-8.161
Token beware
Token ID40600
Feature activation-6.06e-01

Pos logit contributions
alore+2.616
rade+2.011
aints+1.991
liga+1.988
ints+1.936
Neg logit contributions
IJ-5.011
%%%%-4.964
Submit-4.961
-4.703
Psy-4.689
Token
Token ID198
Feature activation-5.41e-01

Pos logit contributions
alore+2.997
rade+2.609
aints+2.560
liga+2.537
ints+2.523
Neg logit contributions
%%%%-2.570
IJ-2.505
Submit-2.448
-2.274
NPR-2.245
Token
Token ID198
Feature activation-3.03e-01

Pos logit contributions
alore+2.582
rade+2.269
aints+2.198
ints+2.153
liga+2.150
Neg logit contributions
%%%%-2.375
IJ-2.339
Submit-2.269
-2.176
��-2.155
TokenReviewer
Token ID35407
Feature activation+6.56e-01

Pos logit contributions
alore+1.904
rade+1.700
aints+1.671
liga+1.655
ints+1.653
Neg logit contributions
%%%%-0.915
IJ-0.889
Submit-0.848
 Hearing-0.784
NPR-0.760
Token report
Token ID989
Feature activation-2.14e-01

Pos logit contributions
%%%%+0.095
IJ+0.093
Submit+0.090
NPR+0.084
 Hearing+0.084
Neg logit contributions
alore-0.117
rade-0.101
liga-0.099
aints-0.099
ints-0.097
Token,
Token ID11
Feature activation-4.00e-02

Pos logit contributions
alore+1.050
rade+0.895
liga+0.879
aints+0.875
ints+0.862
Neg logit contributions
%%%%-0.961
IJ-0.943
Submit-0.919
NPR-0.859
 Hearing-0.856
Token
Token ID564
Feature activation-8.76e-02

Pos logit contributions
alore+0.227
rade+0.198
liga+0.195
aints+0.194
ints+0.192
Neg logit contributions
%%%%-0.148
IJ-0.145
Submit-0.141
NPR-0.129
 Hearing-0.128
Token
Token ID250
Feature activation-2.33e-01

Pos logit contributions
alore+0.619
rade+0.556
liga+0.549
aints+0.548
ints+0.543
Neg logit contributions
%%%%-0.203
IJ-0.195
Submit-0.186
NPR-0.161
 Hearing-0.160
TokenLead
Token ID20451
Feature activation-4.31e-01

Pos logit contributions
alore+1.296
rade+1.131
liga+1.112
aints+1.109
ints+1.094
Neg logit contributions
%%%%-0.887
IJ-0.865
Submit-0.842
NPR-0.772
 Hearing-0.769
Tokening
Token ID278
Feature activation-1.30e+00

Pos logit contributions
alore+1.976
rade+1.685
liga+1.652
aints+1.650
ints+1.615
Neg logit contributions
%%%%-2.034
IJ-1.999
Submit-1.953
NPR-1.832
Psy-1.816
Token Beyond
Token ID12197
Feature activation-6.07e-01

Pos logit contributions
alore+6.688
rade+5.873
liga+5.798
aints+5.794
ints+5.701
Neg logit contributions
%%%%-5.058
IJ-4.915
Submit-4.810
-4.455
Psy-4.438
Token Limits
Token ID44943
Feature activation+7.28e-02

Pos logit contributions
alore+3.268
rade+2.857
liga+2.802
aints+2.799
ints+2.763
Neg logit contributions
%%%%-2.404
IJ-2.336
Submit-2.267
NPR-2.091
-2.087
Token:
Token ID25
Feature activation-7.38e-01

Pos logit contributions
%%%%+0.315
IJ+0.309
Submit+0.300
 Hearing+0.281
NPR+0.280
Neg logit contributions
alore-0.370
rade-0.317
liga-0.311
aints-0.309
ints-0.306
Token May
Token ID1737
Feature activation-2.05e-01

Pos logit contributions
alore+4.003
rade+3.518
aints+3.451
liga+3.441
ints+3.408
Neg logit contributions
%%%%-2.800
IJ-2.745
Submit-2.684
-2.447
NPR-2.443
Tokenoral
Token ID6864
Feature activation-8.62e-02

Pos logit contributions
alore+0.803
rade+0.658
liga+0.639
aints+0.637
ints+0.625
Neg logit contributions
%%%%-1.121
IJ-1.105
Submit-1.081
NPR-1.022
 Hearing-1.016
TokenFIN
Token ID20032
Feature activation+1.80e-01

Pos logit contributions
%%%%+0.970
IJ+0.946
Submit+0.919
NPR+0.843
 Hearing+0.841
Neg logit contributions
alore-1.480
rade-1.293
liga-1.272
aints-1.268
ints-1.253
Token DE
Token ID5550
Feature activation+3.96e-01

Pos logit contributions
%%%%+0.644
IJ+0.627
Submit+0.609
NPR+0.557
 Hearing+0.555
Neg logit contributions
alore-1.044
rade-0.915
liga-0.900
aints-0.898
ints-0.887
Token TRAN
Token ID48213
Feature activation-2.04e-02

Pos logit contributions
%%%%+1.633
IJ+1.620
Submit+1.562
NPR+1.450
 Hearing+1.447
Neg logit contributions
alore-2.090
rade-1.788
liga-1.764
aints-1.763
ints-1.726
TokenSM
Token ID12310
Feature activation+3.29e-01

Pos logit contributions
alore+0.109
rade+0.095
liga+0.092
aints+0.092
ints+0.091
Neg logit contributions
%%%%-0.083
IJ-0.080
Submit-0.079
 Hearing-0.073
NPR-0.072
TokenISSION
Token ID40373
Feature activation+3.07e-01

Pos logit contributions
%%%%+0.723
IJ+0.695
Submit+0.660
NPR+0.564
 Hearing+0.561
Neg logit contributions
alore-2.365
rade-2.129
liga-2.103
aints-2.097
ints-2.077
Token ///
Token ID34013
Feature activation+1.01e+00

Pos logit contributions
%%%%+1.146
IJ+1.121
Submit+1.089
 Hearing+0.994
NPR+0.994
Neg logit contributions
alore-1.744
rade-1.516
liga-1.498
aints-1.494
ints-1.473
Token<|endoftext|>
Token ID50256
Feature activation-4.32e-01

Pos logit contributions
%%%%+2.418
IJ+2.269
Submit+2.233
NPR+1.875
 Hearing+1.871
Neg logit contributions
alore-7.038
rade-6.268
liga-6.200
aints-6.191
oat-6.104
TokenOur
Token ID5122
Feature activation-4.21e-02

Pos logit contributions
alore+2.495
rade+2.196
liga+2.157
aints+2.155
ints+2.131
Neg logit contributions
%%%%-1.528
IJ-1.485
Submit-1.434
 Hearing-1.336
NPR-1.311
Token ELE
Token ID40342
Feature activation+5.56e-01

Pos logit contributions
alore+0.213
rade+0.182
liga+0.179
aints+0.178
ints+0.176
Neg logit contributions
%%%%-0.183
IJ-0.179
Submit-0.175
NPR-0.163
 Hearing-0.162
TokenX
Token ID55
Feature activation+5.37e-01

Pos logit contributions
IJ+2.026
%%%%+2.005
Submit+1.952
NPR+1.775
 Hearing+1.771
Neg logit contributions
alore-3.174
rade-2.772
liga-2.725
aints-2.708
ammy-2.673
Token Woo
Token ID39832
Feature activation+2.04e-01

Pos logit contributions
%%%%+2.223
IJ+2.222
Submit+2.148
 Hearing+2.002
NPR+1.980
Neg logit contributions
alore-2.780
rade-2.374
liga-2.348
aints-2.334
ints-2.307
Token
Token ID198
Feature activation-4.03e-01

Pos logit contributions
alore+1.789
rade+1.562
aints+1.526
liga+1.522
ints+1.496
Neg logit contributions
%%%%-1.411
IJ-1.380
Submit-1.346
NPR-1.230
 Hearing-1.226
Token
Token ID198
Feature activation-3.56e-01

Pos logit contributions
alore+1.999
rade+1.748
aints+1.694
liga+1.684
ints+1.671
Neg logit contributions
%%%%-1.721
IJ-1.691
Submit-1.643
-1.541
 Hearing-1.537
Token
Token ID447
Feature activation+1.99e-02

Pos logit contributions
alore+1.922
rade+1.689
aints+1.647
liga+1.646
ints+1.629
Neg logit contributions
%%%%-1.376
IJ-1.345
Submit-1.304
 Hearing-1.220
NPR-1.188
Token
Token ID250
Feature activation-2.05e-01

Pos logit contributions
%%%%+0.047
IJ+0.045
Submit+0.043
NPR+0.037
 Hearing+0.037
Neg logit contributions
alore-0.140
rade-0.126
liga-0.124
aints-0.124
ints-0.123
TokenAt
Token ID2953
Feature activation+1.71e-01

Pos logit contributions
alore+1.167
rade+1.024
liga+1.004
aints+1.003
ints+0.990
Neg logit contributions
%%%%-0.753
IJ-0.734
Submit-0.713
 Hearing-0.654
NPR-0.653
Token the
Token ID262
Feature activation+9.38e-01

Pos logit contributions
%%%%+0.710
IJ+0.694
Submit+0.676
 Hearing+0.631
NPR+0.630
Neg logit contributions
alore-0.901
rade-0.767
liga-0.762
aints-0.755
ints-0.747
Token tender
Token ID15403
Feature activation+5.92e-01

Pos logit contributions
%%%%+3.729
IJ+3.689
Submit+3.571
 Hearing+3.452
NPR+3.391
Neg logit contributions
alore-4.992
rade-4.240
liga-4.225
aints-4.166
ints-4.116
Token age
Token ID2479
Feature activation+2.36e-01

Pos logit contributions
%%%%+3.193
IJ+3.088
Submit+3.023
 Hearing+2.932
NPR+2.913
Neg logit contributions
alore-2.379
rade-1.873
liga-1.856
aints-1.837
ints-1.824
Token of
Token ID286
Feature activation+2.11e-01

Pos logit contributions
%%%%+0.798
IJ+0.773
Submit+0.750
 Hearing+0.685
NPR+0.685
Neg logit contributions
alore-1.417
rade-1.240
liga-1.224
aints-1.221
ints-1.206
Token 69
Token ID8644
Feature activation+1.96e-01

Pos logit contributions
%%%%+0.835
IJ+0.817
Submit+0.799
 Hearing+0.738
NPR+0.734
Neg logit contributions
alore-1.149
rade-0.993
liga-0.982
aints-0.977
ints-0.961
Token,
Token ID11
Feature activation+1.74e-01

Pos logit contributions
%%%%+0.860
IJ+0.842
Submit+0.821
NPR+0.766
 Hearing+0.765
Neg logit contributions
alore-0.993
rade-0.838
liga-0.832
aints-0.827
ints-0.811
Token the
Token ID262
Feature activation+1.14e+00

Pos logit contributions
%%%%+3.023
IJ+2.926
Submit+2.823
 Hearing+2.566
NPR+2.560
Neg logit contributions
alore-4.634
rade-4.006
liga-3.994
aints-3.971
ammy-3.934
Token commands
Token ID9729
Feature activation+1.39e+00

Pos logit contributions
%%%%+5.492
IJ+5.294
Submit+5.097
NPR+4.797
 Hearing+4.754
Neg logit contributions
alore-5.217
liga-4.348
rade-4.317
ammy-4.250
ints-4.098
Token in
Token ID287
Feature activation+1.29e+00

Pos logit contributions
%%%%+5.270
IJ+5.040
Submit+4.928
NPR+4.496
 Hearing+4.398
Neg logit contributions
alore-7.726
ammy-6.521
aints-6.511
liga-6.508
rade-6.395
Token his
Token ID465
Feature activation+8.22e-01

Pos logit contributions
%%%%+4.645
IJ+4.435
Submit+4.303
 Hearing+4.048
NPR+4.024
Neg logit contributions
alore-7.208
liga-6.365
aints-6.247
rade-6.210
ammy-6.131
Token head
Token ID1182
Feature activation+7.89e-01

Pos logit contributions
%%%%+4.607
IJ+4.468
Submit+4.397
 Hearing+4.228
NPR+4.162
Neg logit contributions
alore-3.069
liga-2.515
rade-2.471
aints-2.427
ammy-2.336
Token).
Token ID737
Feature activation+1.03e+00

Pos logit contributions
%%%%+2.902
IJ+2.789
Submit+2.715
 Hearing+2.483
NPR+2.457
Neg logit contributions
alore-4.545
rade-3.894
liga-3.853
aints-3.848
ints-3.835
Token Kid
Token ID16828
Feature activation-2.11e-01

Pos logit contributions
%%%%+4.116
IJ+3.936
Submit+3.907
 Hearing+3.858
NPR+3.582
Neg logit contributions
alore-5.466
liga-4.683
rade-4.656
ints-4.516
aints-4.510
Tokenn
Token ID77
Feature activation+1.56e-01

Pos logit contributions
alore+1.036
rade+0.890
aints+0.877
liga+0.866
ints+0.858
Neg logit contributions
%%%%-0.930
IJ-0.920
Submit-0.894
NPR-0.835
 Hearing-0.822
Tokenapping
Token ID5912
Feature activation-6.39e-01

Pos logit contributions
%%%%+0.758
IJ+0.746
Submit+0.731
NPR+0.685
 Hearing+0.683
Neg logit contributions
alore-0.701
rade-0.590
liga-0.578
aints-0.568
ints-0.558
Token or
Token ID393
Feature activation+1.19e-01

Pos logit contributions
alore+3.166
rade+2.729
aints+2.701
liga+2.669
ints+2.658
Neg logit contributions
IJ-2.742
%%%%-2.741
Submit-2.646
NPR-2.465
-2.433
Token Death
Token ID5830
Feature activation+3.28e-01

Pos logit contributions
%%%%+0.495
IJ+0.483
Submit+0.471
 Hearing+0.438
NPR+0.436
Neg logit contributions
alore-0.620
rade-0.534
liga-0.525
aints-0.523
ints-0.516
Token.
Token ID13
Feature activation+6.64e-01

Pos logit contributions
%%%%+1.500
IJ+1.465
Submit+1.417
 Hearing+1.337
NPR+1.331
Neg logit contributions
alore-1.733
rade-1.476
liga-1.457
aints-1.452
ints-1.428
Token
Token ID198
Feature activation+9.33e-01

Pos logit contributions
%%%%+2.149
IJ+2.104
Submit+2.012
 Hearing+1.902
NPR+1.850
Neg logit contributions
alore-4.107
rade-3.579
liga-3.562
aints-3.519
ints-3.491
Token
Token ID198
Feature activation+9.56e-01

Pos logit contributions
%%%%+3.195
IJ+3.159
Submit+3.063
NPR+2.820
Psy+2.715
Neg logit contributions
alore-5.637
liga-4.854
rade-4.789
aints-4.772
ammy-4.761
TokenAV
Token ID10116
Feature activation+9.36e-01

Pos logit contributions
IJ+2.812
%%%%+2.776
Submit+2.625
NPR+2.448
Psy+2.298
Neg logit contributions
alore-6.281
liga-5.482
rade-5.479
aints-5.404
ammy-5.339
TokenC
Token ID34
Feature activation+5.49e-01

Pos logit contributions
%%%%+2.726
IJ+2.719
Submit+2.645
NPR+2.379
Psy+2.220
Neg logit contributions
alore-6.152
rade-5.391
ammy-5.342
aints-5.337
liga-5.335
Token:
Token ID25
Feature activation+1.24e+00

Pos logit contributions
%%%%+2.372
IJ+2.314
Submit+2.257
NPR+2.093
 Hearing+2.073
Neg logit contributions
alore-2.859
rade-2.398
liga-2.395
aints-2.381
ammy-2.344
Token It
Token ID632
Feature activation+7.73e-01

Pos logit contributions
%%%%+3.753
IJ+3.683
Submit+3.514
 Hearing+3.482
NPR+3.262
Neg logit contributions
alore-7.946
liga-6.843
rade-6.779
aints-6.730
ints-6.659
Token must
Token ID1276
Feature activation+8.30e-01

Pos logit contributions
%%%%+3.046
IJ+3.039
Submit+2.893
NPR+2.791
 Hearing+2.749
Neg logit contributions
alore-4.065
rade-3.502
liga-3.479
aints-3.415
ints-3.333
Token be
Token ID307
Feature activation+5.56e-01

Pos logit contributions
%%%%+3.198
IJ+3.082
Submit+2.982
NPR+2.825
 Hearing+2.776
Neg logit contributions
alore-4.487
rade-3.879
liga-3.848
aints-3.796
ints-3.727
Token fun
Token ID1257
Feature activation+4.83e-01

Pos logit contributions
%%%%+2.390
IJ+2.348
Submit+2.275
 Hearing+2.146
NPR+2.145
Neg logit contributions
alore-2.825
rade-2.400
liga-2.367
aints-2.360
ints-2.311
Token to
Token ID284
Feature activation+1.61e-01

Pos logit contributions
%%%%+1.950
IJ+1.902
Submit+1.839
 Hearing+1.738
NPR+1.716
Neg logit contributions
alore-2.575
rade-2.221
liga-2.188
aints-2.187
ints-2.151
Token McC
Token ID5108
Feature activation+2.52e-01

Pos logit contributions
%%%%+0.124
IJ+0.121
Submit+0.118
 Hearing+0.110
NPR+0.109
Neg logit contributions
alore-0.163
rade-0.141
liga-0.139
aints-0.138
ints-0.136
Tokenaff
Token ID2001
Feature activation+5.12e-01

Pos logit contributions
%%%%+1.230
IJ+1.221
Submit+1.194
NPR+1.126
 Hearing+1.113
Neg logit contributions
alore-1.115
liga-0.915
rade-0.909
aints-0.902
ints-0.881
Tokenrey
Token ID4364
Feature activation-1.31e-01

Pos logit contributions
%%%%+2.034
IJ+1.999
Submit+1.906
 Hearing+1.802
NPR+1.793
Neg logit contributions
alore-2.695
liga-2.331
rade-2.319
aints-2.293
ints-2.245
Token and
Token ID290
Feature activation+1.41e-01

Pos logit contributions
alore+0.631
rade+0.538
liga+0.527
aints+0.525
ints+0.517
Neg logit contributions
%%%%-0.596
IJ-0.585
Submit-0.571
NPR-0.533
 Hearing-0.532
Token Brian
Token ID8403
Feature activation+4.43e-01

Pos logit contributions
%%%%+0.556
IJ+0.543
Submit+0.529
NPR+0.488
 Hearing+0.487
Neg logit contributions
alore-0.766
rade-0.664
liga-0.653
aints-0.651
ints-0.642
Token Albert
Token ID9966
Feature activation+1.01e+00

Pos logit contributions
%%%%+1.787
IJ+1.754
Submit+1.698
 Hearing+1.605
NPR+1.578
Neg logit contributions
alore-2.345
rade-2.015
liga-2.003
aints-1.991
ints-1.962
Token have
Token ID423
Feature activation-3.84e-01

Pos logit contributions
IJ+4.103
%%%%+4.048
Submit+3.946
NPR+3.677
 Hearing+3.613
Neg logit contributions
alore-5.143
rade-4.346
aints-4.329
liga-4.323
ints-4.237
Token taken
Token ID2077
Feature activation-6.87e-01

Pos logit contributions
alore+1.897
rade+1.624
liga+1.605
aints+1.582
ints+1.570
Neg logit contributions
%%%%-1.714
IJ-1.669
Submit-1.619
NPR-1.509
 Hearing-1.506
Token on
Token ID319
Feature activation-1.64e-01

Pos logit contributions
alore+3.296
rade+2.845
aints+2.813
liga+2.769
ints+2.769
Neg logit contributions
%%%%-3.104
IJ-3.039
Submit-2.955
NPR-2.744
-2.733
Token the
Token ID262
Feature activation+2.31e-01

Pos logit contributions
alore+0.849
rade+0.730
liga+0.717
aints+0.714
ints+0.704
Neg logit contributions
%%%%-0.694
IJ-0.680
Submit-0.663
NPR-0.615
 Hearing-0.614
Token task
Token ID4876
Feature activation+2.13e-02

Pos logit contributions
%%%%+1.096
IJ+1.079
Submit+1.056
 Hearing+0.988
NPR+0.987
Neg logit contributions
alore-1.075
rade-0.908
liga-0.890
aints-0.881
ints-0.868
Token be
Token ID307
Feature activation+9.70e-02

Pos logit contributions
%%%%+0.678
IJ+0.664
Submit+0.648
NPR+0.602
 Hearing+0.601
Neg logit contributions
alore-0.789
rade-0.677
liga-0.665
aints-0.662
ints-0.653
Token more
Token ID517
Feature activation+2.14e-01

Pos logit contributions
%%%%+0.405
IJ+0.394
Submit+0.384
NPR+0.356
 Hearing+0.354
Neg logit contributions
alore-0.509
rade-0.438
liga-0.429
aints-0.429
ints-0.423
Token inclined
Token ID19514
Feature activation+3.23e-01

Pos logit contributions
%%%%+0.829
IJ+0.808
Submit+0.787
NPR+0.725
 Hearing+0.722
Neg logit contributions
alore-1.182
rade-1.027
liga-1.010
aints-1.007
ints-0.994
Token to
Token ID284
Feature activation-1.81e-01

Pos logit contributions
%%%%+1.292
IJ+1.265
Submit+1.230
 Hearing+1.136
NPR+1.134
Neg logit contributions
alore-1.740
rade-1.502
liga-1.487
aints-1.479
ints-1.458
Token take
Token ID1011
Feature activation-3.38e-01

Pos logit contributions
alore+0.843
rade+0.713
aints+0.699
liga+0.691
ints+0.690
Neg logit contributions
%%%%-0.851
IJ-0.839
Submit-0.814
NPR-0.764
 Hearing-0.761
Token a
Token ID257
Feature activation+2.23e-01

Pos logit contributions
alore+1.644
rade+1.395
aints+1.373
liga+1.365
ints+1.357
Neg logit contributions
%%%%-1.534
IJ-1.492
Submit-1.466
NPR-1.361
 Hearing-1.359
Token job
Token ID1693
Feature activation+1.44e-01

Pos logit contributions
%%%%+1.041
IJ+1.021
Submit+0.998
NPR+0.933
 Hearing+0.931
Neg logit contributions
alore-1.049
rade-0.890
liga-0.872
aints-0.869
ints-0.855
Token if
Token ID611
Feature activation-1.38e-01

Pos logit contributions
%%%%+0.587
IJ+0.575
Submit+0.560
NPR+0.518
 Hearing+0.517
Neg logit contributions
alore-0.767
rade-0.662
liga-0.652
aints-0.649
ints-0.641
Token staff
Token ID3085
Feature activation-2.15e-03

Pos logit contributions
alore+0.737
rade+0.641
liga+0.625
aints+0.622
ints+0.617
Neg logit contributions
%%%%-0.564
IJ-0.543
Submit-0.531
NPR-0.489
 Hearing-0.488
Token can
Token ID460
Feature activation-8.48e-03

Pos logit contributions
alore+0.011
rade+0.009
liga+0.009
aints+0.009
ints+0.009
Neg logit contributions
%%%%-0.009
IJ-0.009
Submit-0.009
NPR-0.008
 Hearing-0.008
Token work
Token ID670
Feature activation+6.88e-02

Pos logit contributions
alore+0.042
rade+0.036
liga+0.035
aints+0.035
ints+0.035
Neg logit contributions
%%%%-0.037
IJ-0.037
Submit-0.036
NPR-0.033
 Hearing-0.033
Token Ros
Token ID10018
Feature activation-2.54e-02

Pos logit contributions
%%%%+0.673
IJ+0.658
Submit+0.641
 Hearing+0.598
NPR+0.595
Neg logit contributions
alore-0.811
rade-0.696
liga-0.686
aints-0.682
ints-0.670
Tokencommon
Token ID11321
Feature activation-1.62e-01

Pos logit contributions
alore+0.118
rade+0.100
liga+0.098
aints+0.098
ints+0.096
Neg logit contributions
%%%%-0.120
IJ-0.117
Submit-0.115
 Hearing-0.107
NPR-0.107
Token and
Token ID290
Feature activation+4.05e-01

Pos logit contributions
alore+0.806
rade+0.690
liga+0.677
aints+0.675
ints+0.665
Neg logit contributions
%%%%-0.715
IJ-0.700
Submit-0.683
NPR-0.636
 Hearing-0.635
Token owned
Token ID6898
Feature activation+8.81e-01

Pos logit contributions
%%%%+1.718
IJ+1.698
Submit+1.643
 Hearing+1.548
NPR+1.532
Neg logit contributions
alore-2.056
rade-1.758
liga-1.738
aints-1.729
ints-1.695
Token by
Token ID416
Feature activation+4.91e-01

Pos logit contributions
%%%%+3.634
IJ+3.628
Submit+3.478
NPR+3.296
 Hearing+3.210
Neg logit contributions
alore-4.387
rade-3.826
liga-3.751
aints-3.750
ammy-3.679
Token Richard
Token ID6219
Feature activation+4.40e-01

Pos logit contributions
%%%%+2.039
IJ+1.990
Submit+1.920
 Hearing+1.808
NPR+1.786
Neg logit contributions
alore-2.571
rade-2.209
liga-2.188
aints-2.160
ints-2.128
Token Siber
Token ID28394
Feature activation+7.23e-01

Pos logit contributions
%%%%+1.791
IJ+1.755
Submit+1.693
 Hearing+1.596
NPR+1.562
Neg logit contributions
alore-2.325
rade-1.987
liga-1.974
aints-1.956
ints-1.933
Tokenry
Token ID563
Feature activation+3.75e-01

Pos logit contributions
%%%%+3.079
IJ+3.057
Submit+2.960
 Hearing+2.776
NPR+2.770
Neg logit contributions
alore-3.607
aints-3.050
liga-3.023
rade-3.010
ints-2.963
Token (
Token ID357
Feature activation+4.05e-01

Pos logit contributions
%%%%+1.616
IJ+1.591
Submit+1.542
 Hearing+1.441
NPR+1.433
Neg logit contributions
alore-1.883
rade-1.605
liga-1.581
aints-1.575
ints-1.556
Tokenon
Token ID261
Feature activation+6.51e-01

Pos logit contributions
%%%%+1.740
IJ+1.710
Submit+1.660
NPR+1.558
Psy+1.540
Neg logit contributions
alore-2.058
rade-1.751
liga-1.744
aints-1.719
ints-1.699
Token the
Token ID262
Feature activation+7.90e-01

Pos logit contributions
%%%%+2.692
IJ+2.620
Submit+2.585
 Hearing+2.412
NPR+2.355
Neg logit contributions
alore-3.382
liga-2.877
rade-2.869
aints-2.824
ammy-2.787
Token in
Token ID287
Feature activation+3.28e-02

Pos logit contributions
alore+0.438
rade+0.375
liga+0.368
aints+0.367
ints+0.362
Neg logit contributions
%%%%-0.379
IJ-0.371
Submit-0.362
NPR-0.337
 Hearing-0.336
Token a
Token ID257
Feature activation+1.86e-01

Pos logit contributions
%%%%+0.140
IJ+0.137
Submit+0.133
 Hearing+0.124
NPR+0.124
Neg logit contributions
alore-0.168
rade-0.144
liga-0.142
aints-0.141
ints-0.139
Token side
Token ID1735
Feature activation-1.53e-01

Pos logit contributions
%%%%+0.865
IJ+0.851
Submit+0.831
 Hearing+0.780
NPR+0.779
Neg logit contributions
alore-0.877
rade-0.740
aints-0.727
liga-0.726
ints-0.712
Token that
Token ID326
Feature activation-1.98e-01

Pos logit contributions
alore+0.764
rade+0.654
liga+0.640
aints+0.640
ints+0.630
Neg logit contributions
%%%%-0.671
IJ-0.660
Submit-0.644
NPR-0.600
 Hearing-0.598
Token had
Token ID550
Feature activation-2.50e-01

Pos logit contributions
alore+0.972
rade+0.829
aints+0.812
liga+0.812
ints+0.799
Neg logit contributions
%%%%-0.888
IJ-0.873
Submit-0.851
NPR-0.794
 Hearing-0.793
Token Paul
Token ID3362
Feature activation+2.10e-01

Pos logit contributions
alore+1.232
rade+1.053
liga+1.033
aints+1.028
ints+1.014
Neg logit contributions
%%%%-1.116
IJ-1.092
Submit-1.067
NPR-0.994
 Hearing-0.991
Token Sch
Token ID3059
Feature activation+4.31e-01

Pos logit contributions
%%%%+0.850
IJ+0.833
Submit+0.809
 Hearing+0.760
NPR+0.745
Neg logit contributions
alore-1.113
rade-0.946
liga-0.943
aints-0.931
ints-0.920
Tokenoles
Token ID4316
Feature activation+4.13e-01

Pos logit contributions
%%%%+2.359
IJ+2.324
Submit+2.293
NPR+2.163
 Hearing+2.162
Neg logit contributions
alore-1.624
aints-1.280
liga-1.265
rade-1.260
ints-1.239
Token,
Token ID11
Feature activation+3.26e-01

Pos logit contributions
%%%%+1.779
IJ+1.762
Submit+1.698
 Hearing+1.594
NPR+1.592
Neg logit contributions
alore-2.065
rade-1.752
liga-1.719
aints-1.714
ints-1.697
Token David
Token ID3271
Feature activation+2.16e-01

Pos logit contributions
%%%%+1.289
IJ+1.263
Submit+1.223
 Hearing+1.144
NPR+1.133
Neg logit contributions
alore-1.761
rade-1.509
liga-1.496
aints-1.490
ints-1.476
Token Beckham
Token ID37244
Feature activation-1.81e-01

Pos logit contributions
%%%%+0.851
IJ+0.833
Submit+0.810
NPR+0.747
 Hearing+0.747
Neg logit contributions
alore-1.173
rade-1.017
liga-1.001
aints-0.997
ints-0.984
Token understand
Token ID1833
Feature activation-4.60e-01

Pos logit contributions
alore+2.366
rade+1.918
aints+1.898
liga+1.869
ints+1.854
Neg logit contributions
%%%%-3.501
IJ-3.455
Submit-3.370
NPR-3.211
 Hearing-3.163
Token both
Token ID1111
Feature activation-3.89e-01

Pos logit contributions
alore+2.309
rade+1.992
liga+1.943
aints+1.936
ints+1.921
Neg logit contributions
%%%%-1.988
IJ-1.942
Submit-1.900
NPR-1.762
 Hearing-1.741
Token sides
Token ID5389
Feature activation-1.01e-01

Pos logit contributions
alore+1.623
rade+1.355
aints+1.326
liga+1.325
ints+1.303
Neg logit contributions
%%%%-2.009
IJ-1.968
Submit-1.935
NPR-1.822
 Hearing-1.809
Token.
Token ID13
Feature activation+2.83e-01

Pos logit contributions
alore+0.496
rade+0.424
liga+0.417
aints+0.415
ints+0.409
Neg logit contributions
%%%%-0.448
IJ-0.440
Submit-0.429
NPR-0.400
 Hearing-0.399
Token The
Token ID383
Feature activation-5.93e-02

Pos logit contributions
%%%%+0.937
IJ+0.912
Submit+0.882
NPR+0.800
 Hearing+0.797
Neg logit contributions
alore-1.720
rade-1.517
liga-1.494
aints-1.490
ints-1.473
Token mixed
Token ID7668
Feature activation+2.43e-01

Pos logit contributions
alore+0.271
rade+0.229
liga+0.224
aints+0.223
ints+0.220
Neg logit contributions
%%%%-0.286
IJ-0.280
Submit-0.274
NPR-0.257
 Hearing-0.256
Token kids
Token ID3988
Feature activation+1.33e-01

Pos logit contributions
%%%%+1.181
IJ+1.161
Submit+1.133
NPR+1.065
 Hearing+1.065
Neg logit contributions
alore-1.098
rade-0.923
liga-0.903
aints-0.898
ints-0.884
Token that
Token ID326
Feature activation-7.04e-02

Pos logit contributions
%%%%+0.578
IJ+0.565
Submit+0.551
NPR+0.515
 Hearing+0.515
Neg logit contributions
alore-0.669
rade-0.574
liga-0.563
aints-0.560
ints-0.552
Token I
Token ID314
Feature activation-8.03e-02

Pos logit contributions
alore+0.366
rade+0.316
liga+0.310
aints+0.309
ints+0.305
Neg logit contributions
%%%%-0.295
IJ-0.289
Submit-0.281
NPR-0.261
 Hearing-0.260
Token
Token ID447
Feature activation+1.86e-01

Pos logit contributions
alore+0.395
rade+0.338
liga+0.332
aints+0.330
ints+0.326
Neg logit contributions
%%%%-0.358
IJ-0.352
Submit-0.343
NPR-0.321
 Hearing-0.319
Token
Token ID247
Feature activation+1.13e-01

Pos logit contributions
%%%%+0.422
IJ+0.394
Submit+0.379
 Hearing+0.322
NPR+0.322
Neg logit contributions
alore-1.340
rade-1.197
liga-1.190
aints-1.189
ints-1.173
Token
Token ID198
Feature activation+3.95e-01

Pos logit contributions
%%%%+1.390
IJ+1.360
Submit+1.320
 Hearing+1.223
NPR+1.217
Neg logit contributions
alore-2.060
rade-1.791
liga-1.766
aints-1.760
ints-1.739
Token
Token ID198
Feature activation+5.14e-01

Pos logit contributions
%%%%+1.497
IJ+1.460
Submit+1.427
NPR+1.310
 Hearing+1.294
Neg logit contributions
alore-2.236
liga-1.922
rade-1.921
aints-1.900
ammy-1.874
TokenJay
Token ID30568
Feature activation-1.07e-02

Pos logit contributions
%%%%+1.719
IJ+1.693
Submit+1.635
NPR+1.502
Psy+1.451
Neg logit contributions
alore-3.144
rade-2.738
liga-2.728
aints-2.710
ints-2.676
Token replied
Token ID8712
Feature activation+3.94e-01

Pos logit contributions
alore+0.055
rade+0.047
liga+0.047
aints+0.047
ints+0.046
Neg logit contributions
%%%%-0.045
IJ-0.044
Submit-0.043
NPR-0.040
 Hearing-0.040
Token with
Token ID351
Feature activation-2.38e-01

Pos logit contributions
%%%%+1.670
IJ+1.643
Submit+1.599
NPR+1.500
 Hearing+1.495
Neg logit contributions
alore-1.997
rade-1.718
liga-1.699
aints-1.690
ints-1.655
Token a
Token ID257
Feature activation+1.47e-01

Pos logit contributions
alore+1.205
rade+1.034
liga+1.014
aints+1.011
ints+0.999
Neg logit contributions
%%%%-1.034
IJ-1.011
Submit-0.987
NPR-0.917
 Hearing-0.915
Token w
Token ID266
Feature activation+2.52e-01

Pos logit contributions
%%%%+0.669
IJ+0.656
Submit+0.641
NPR+0.598
 Hearing+0.597
Neg logit contributions
alore-0.711
rade-0.605
liga-0.594
aints-0.592
ints-0.582
Tokenry
Token ID563
Feature activation+3.17e-01

Pos logit contributions
%%%%+1.463
IJ+1.453
Submit+1.409
NPR+1.347
 Hearing+1.341
Neg logit contributions
alore-0.900
rade-0.709
liga-0.707
aints-0.681
ints-0.668
Token smile
Token ID8212
Feature activation+4.30e-01

Pos logit contributions
%%%%+1.327
IJ+1.300
Submit+1.269
 Hearing+1.180
NPR+1.178
Neg logit contributions
alore-1.637
rade-1.410
aints-1.390
liga-1.388
ints-1.356
Token:
Token ID25
Feature activation+6.39e-01

Pos logit contributions
%%%%+1.902
IJ+1.848
Submit+1.796
 Hearing+1.692
NPR+1.683
Neg logit contributions
alore-2.134
rade-1.822
liga-1.793
aints-1.789
ints-1.745
Token
Token ID198
Feature activation+7.38e-01

Pos logit contributions
%%%%+2.430
IJ+2.377
Submit+2.288
 Hearing+2.148
NPR+2.141
Neg logit contributions
alore-3.547
rade-3.076
liga-3.032
aints-3.021
ints-2.982
Token to
Token ID284
Feature activation+4.62e-01

Pos logit contributions
%%%%+1.116
IJ+1.091
Submit+1.059
 Hearing+0.995
NPR+0.986
Neg logit contributions
alore-1.484
rade-1.288
liga-1.267
aints-1.261
ints-1.246
Token NBC
Token ID9716
Feature activation-5.28e-02

Pos logit contributions
%%%%+1.844
IJ+1.810
Submit+1.746
NPR+1.650
 Hearing+1.641
Neg logit contributions
alore-2.491
rade-2.143
liga-2.127
aints-2.121
ints-2.072
Token that
Token ID326
Feature activation+4.90e-01

Pos logit contributions
alore+0.277
rade+0.239
liga+0.235
aints+0.234
ints+0.231
Neg logit contributions
%%%%-0.219
IJ-0.215
Submit-0.209
NPR-0.195
 Hearing-0.194
Token he
Token ID339
Feature activation+5.88e-02

Pos logit contributions
%%%%+1.989
IJ+1.960
Submit+1.895
 Hearing+1.793
NPR+1.773
Neg logit contributions
alore-2.596
rade-2.217
liga-2.198
aints-2.193
ints-2.163
Token had
Token ID550
Feature activation-2.19e-01

Pos logit contributions
%%%%+0.251
IJ+0.248
Submit+0.241
 Hearing+0.226
NPR+0.225
Neg logit contributions
alore-0.299
rade-0.255
liga-0.253
aints-0.251
ints-0.246
Token decided
Token ID3066
Feature activation-1.88e-01

Pos logit contributions
alore+1.161
rade+1.005
liga+0.987
aints+0.983
ints+0.971
Neg logit contributions
%%%%-0.894
IJ-0.872
Submit-0.849
NPR-0.786
 Hearing-0.782
Token to
Token ID284
Feature activation-2.50e-03

Pos logit contributions
alore+1.011
rade+0.876
liga+0.860
aints+0.857
ints+0.846
Neg logit contributions
%%%%-0.754
IJ-0.737
Submit-0.716
NPR-0.662
 Hearing-0.660
Token fire
Token ID2046
Feature activation+1.90e-02

Pos logit contributions
alore+0.011
rade+0.010
liga+0.009
aints+0.009
ints+0.009
Neg logit contributions
%%%%-0.012
IJ-0.012
Submit-0.012
NPR-0.011
 Hearing-0.011
Token Comey
Token ID12388
Feature activation-9.89e-03

Pos logit contributions
%%%%+0.077
IJ+0.076
Submit+0.074
NPR+0.068
 Hearing+0.068
Neg logit contributions
alore-0.101
rade-0.087
liga-0.086
aints-0.086
ints-0.084
Token before
Token ID878
Feature activation-3.76e-01

Pos logit contributions
alore+0.050
rade+0.043
liga+0.042
aints+0.042
ints+0.041
Neg logit contributions
%%%%-0.043
IJ-0.042
Submit-0.041
NPR-0.038
 Hearing-0.038
Token Attorney
Token ID8123
Feature activation+1.13e-01

Pos logit contributions
alore+1.908
rade+1.650
liga+1.614
aints+1.605
ints+1.584
Neg logit contributions
%%%%-1.612
IJ-1.574
Submit-1.537
NPR-1.427
-1.409
Token dots
Token ID22969
Feature activation-1.79e-01

Pos logit contributions
%%%%+0.682
IJ+0.669
Submit+0.653
 Hearing+0.607
NPR+0.606
Neg logit contributions
alore-0.804
rade-0.689
liga-0.677
aints-0.674
ints-0.665
Token on
Token ID319
Feature activation+2.65e-01

Pos logit contributions
alore+0.897
rade+0.768
liga+0.754
aints+0.751
ints+0.741
Neg logit contributions
%%%%-0.783
IJ-0.766
Submit-0.749
NPR-0.695
 Hearing-0.693
Token the
Token ID262
Feature activation+2.06e-01

Pos logit contributions
%%%%+1.088
IJ+1.067
Submit+1.039
 Hearing+0.964
NPR+0.956
Neg logit contributions
alore-1.395
rade-1.195
aints-1.181
liga-1.180
ints-1.156
Token upper
Token ID6727
Feature activation-2.00e-01

Pos logit contributions
%%%%+0.939
IJ+0.923
Submit+0.900
 Hearing+0.843
NPR+0.838
Neg logit contributions
alore-0.992
rade-0.840
liga-0.825
aints-0.824
ints-0.808
Token left
Token ID1364
Feature activation-9.01e-01

Pos logit contributions
alore+1.027
rade+0.894
liga+0.874
aints+0.869
ints+0.862
Neg logit contributions
%%%%-0.852
IJ-0.828
Submit-0.811
NPR-0.753
 Hearing-0.746
Token front
Token ID2166
Feature activation-6.93e-01

Pos logit contributions
alore+4.302
rade+3.735
liga+3.642
aints+3.587
ints+3.571
Neg logit contributions
%%%%-4.080
IJ-3.948
Submit-3.880
NPR-3.598
 Hearing-3.592
Token corner
Token ID5228
Feature activation-3.79e-01

Pos logit contributions
alore+3.525
rade+3.090
liga+3.012
aints+2.997
ints+2.967
Neg logit contributions
%%%%-2.953
IJ-2.860
Submit-2.805
NPR-2.607
 Hearing-2.584
Token of
Token ID286
Feature activation+7.46e-02

Pos logit contributions
alore+2.050
rade+1.777
liga+1.749
aints+1.741
ints+1.721
Neg logit contributions
%%%%-1.511
IJ-1.473
Submit-1.432
NPR-1.320
 Hearing-1.318
Token the
Token ID262
Feature activation-8.15e-02

Pos logit contributions
%%%%+0.305
IJ+0.299
Submit+0.291
 Hearing+0.269
NPR+0.269
Neg logit contributions
alore-0.395
rade-0.340
liga-0.335
aints-0.335
ints-0.329
Token new
Token ID649
Feature activation+4.47e-02

Pos logit contributions
alore+0.392
rade+0.334
liga+0.327
aints+0.326
ints+0.321
Neg logit contributions
%%%%-0.373
IJ-0.366
Submit-0.358
NPR-0.334
 Hearing-0.333
Token polymer
Token ID30528
Feature activation+5.70e-01

Pos logit contributions
%%%%+0.200
IJ+0.197
Submit+0.191
 Hearing+0.179
NPR+0.178
Neg logit contributions
alore-0.220
rade-0.187
liga-0.183
aints-0.183
ints-0.180
Token
Token ID198
Feature activation-6.27e-02

Pos logit contributions
%%%%+0.141
IJ+0.137
Submit+0.134
NPR+0.122
 Hearing+0.121
Neg logit contributions
alore-0.230
rade-0.203
liga-0.199
aints-0.199
ints-0.196
Token
Token ID198
Feature activation+9.01e-02

Pos logit contributions
alore+0.329
rade+0.285
liga+0.279
aints+0.279
ints+0.275
Neg logit contributions
%%%%-0.258
IJ-0.252
Submit-0.245
NPR-0.227
 Hearing-0.227
TokenBl
Token ID3629
Feature activation-1.53e-01

Pos logit contributions
%%%%+0.315
IJ+0.307
Submit+0.298
NPR+0.272
 Hearing+0.270
Neg logit contributions
alore-0.532
rade-0.467
liga-0.460
aints-0.458
ints-0.453
Tokenacks
Token ID4595
Feature activation-2.51e-01

Pos logit contributions
alore+0.650
rade+0.539
liga+0.527
aints+0.525
ints+0.516
Neg logit contributions
%%%%-0.790
IJ-0.777
Submit-0.761
NPR-0.716
 Hearing-0.715
Token in
Token ID287
Feature activation-1.74e-02

Pos logit contributions
alore+1.252
rade+1.072
liga+1.055
aints+1.052
ints+1.035
Neg logit contributions
%%%%-1.100
IJ-1.077
Submit-1.052
NPR-0.976
 Hearing-0.970
Token Philly
Token ID30029
Feature activation-3.90e-01

Pos logit contributions
alore+0.083
rade+0.071
liga+0.069
aints+0.069
ints+0.068
Neg logit contributions
%%%%-0.080
IJ-0.078
Submit-0.077
NPR-0.072
 Hearing-0.071
Token have
Token ID423
Feature activation+5.71e-02

Pos logit contributions
alore+1.947
rade+1.661
liga+1.637
aints+1.633
ints+1.612
Neg logit contributions
%%%%-1.708
IJ-1.665
Submit-1.631
NPR-1.511
Psy-1.498
Token a
Token ID257
Feature activation+1.08e-01

Pos logit contributions
%%%%+0.244
IJ+0.239
Submit+0.233
NPR+0.216
 Hearing+0.216
Neg logit contributions
alore-0.292
rade-0.251
liga-0.247
aints-0.246
ints-0.242
Token different
Token ID1180
Feature activation+2.24e-01

Pos logit contributions
%%%%+0.483
IJ+0.473
Submit+0.462
NPR+0.431
 Hearing+0.430
Neg logit contributions
alore-0.532
rade-0.454
liga-0.446
aints-0.444
ints-0.437
Token overall
Token ID4045
Feature activation+1.13e-01

Pos logit contributions
%%%%+1.049
IJ+1.029
Submit+1.005
 Hearing+0.946
NPR+0.943
Neg logit contributions
alore-1.051
rade-0.894
liga-0.875
aints-0.872
ints-0.857
Token accent
Token ID18702
Feature activation+6.46e-01

Pos logit contributions
%%%%+0.508
IJ+0.498
Submit+0.485
NPR+0.453
 Hearing+0.453
Neg logit contributions
alore-0.555
rade-0.474
liga-0.465
aints-0.463
ints-0.456
Token just
Token ID655
Feature activation-7.77e-02

Pos logit contributions
alore+0.480
rade+0.410
aints+0.402
liga+0.402
ints+0.395
Neg logit contributions
%%%%-0.419
IJ-0.415
Submit-0.401
NPR-0.376
 Hearing-0.375
Token couldn
Token ID3521
Feature activation+9.64e-02

Pos logit contributions
alore+0.397
rade+0.341
liga+0.335
aints+0.334
ints+0.329
Neg logit contributions
%%%%-0.331
IJ-0.326
Submit-0.316
 Hearing-0.294
NPR-0.294
Token't
Token ID470
Feature activation-4.97e-02

Pos logit contributions
%%%%+0.336
IJ+0.327
Submit+0.317
NPR+0.289
 Hearing+0.287
Neg logit contributions
alore-0.571
rade-0.501
liga-0.493
aints-0.492
ints-0.486
Token really
Token ID1107
Feature activation-3.35e-01

Pos logit contributions
alore+0.250
rade+0.214
liga+0.210
aints+0.210
ints+0.207
Neg logit contributions
%%%%-0.217
IJ-0.212
Submit-0.207
NPR-0.193
 Hearing-0.192
Token hold
Token ID1745
Feature activation-4.04e-01

Pos logit contributions
alore+1.597
rade+1.359
liga+1.331
aints+1.327
ints+1.307
Neg logit contributions
%%%%-1.541
IJ-1.512
Submit-1.478
NPR-1.386
 Hearing-1.376
Token the
Token ID262
Feature activation-2.19e-01

Pos logit contributions
alore+2.034
rade+1.756
aints+1.712
liga+1.708
ints+1.684
Neg logit contributions
%%%%-1.762
IJ-1.718
Submit-1.681
NPR-1.564
 Hearing-1.555
Token wheel
Token ID7825
Feature activation-3.96e-01

Pos logit contributions
alore+0.976
rade+0.819
liga+0.801
aints+0.798
ints+0.785
Neg logit contributions
%%%%-1.077
IJ-1.058
Submit-1.035
NPR-0.971
 Hearing-0.969
Token in
Token ID287
Feature activation-3.02e-01

Pos logit contributions
alore+1.919
rade+1.635
liga+1.603
aints+1.598
ints+1.571
Neg logit contributions
%%%%-1.797
IJ-1.761
Submit-1.723
NPR-1.608
 Hearing-1.601
Token the
Token ID262
Feature activation-3.26e-01

Pos logit contributions
alore+1.494
rade+1.274
liga+1.253
aints+1.249
ints+1.230
Neg logit contributions
%%%%-1.342
IJ-1.317
Submit-1.284
 Hearing-1.195
NPR-1.194
Token last
Token ID938
Feature activation-5.78e-01

Pos logit contributions
alore+1.609
rade+1.379
liga+1.352
aints+1.343
ints+1.327
Neg logit contributions
%%%%-1.450
IJ-1.420
Submit-1.386
NPR-1.293
 Hearing-1.286
Token couple
Token ID3155
Feature activation+3.70e-01

Pos logit contributions
alore+2.612
rade+2.217
liga+2.155
aints+2.145
ints+2.116
Neg logit contributions
%%%%-2.801
IJ-2.755
Submit-2.685
NPR-2.524
 Hearing-2.517
Token developer
Token ID8517
Feature activation+2.18e-01

Pos logit contributions
alore+1.629
rade+1.385
liga+1.360
aints+1.355
ints+1.342
Neg logit contributions
%%%%-1.587
IJ-1.550
Submit-1.519
NPR-1.422
 Hearing-1.410
Token hand
Token ID1021
Feature activation+3.32e-01

Pos logit contributions
%%%%+0.955
IJ+0.935
Submit+0.914
NPR+0.850
 Hearing+0.849
Neg logit contributions
alore-1.093
rade-0.937
liga-0.918
aints-0.916
ints-0.904
Token-
Token ID12
Feature activation-8.53e-01

Pos logit contributions
%%%%+1.416
IJ+1.383
Submit+1.345
NPR+1.253
 Hearing+1.251
Neg logit contributions
alore-1.704
rade-1.460
liga-1.434
aints-1.427
ints-1.403
Tokenholding
Token ID19216
Feature activation-3.00e-02

Pos logit contributions
alore+4.156
rade+3.592
aints+3.554
liga+3.546
oat+3.477
Neg logit contributions
%%%%-3.710
IJ-3.657
Submit-3.541
-3.413
 Hearing-3.334
Token could
Token ID714
Feature activation-3.83e-01

Pos logit contributions
alore+0.148
rade+0.126
liga+0.124
aints+0.123
ints+0.121
Neg logit contributions
%%%%-0.134
IJ-0.132
Submit-0.128
NPR-0.120
 Hearing-0.120
Token be
Token ID307
Feature activation-2.13e-02

Pos logit contributions
alore+1.856
rade+1.580
liga+1.546
aints+1.545
ints+1.524
Neg logit contributions
%%%%-1.738
IJ-1.706
Submit-1.665
NPR-1.552
 Hearing-1.547
Token a
Token ID257
Feature activation-5.80e-02

Pos logit contributions
alore+0.105
rade+0.090
aints+0.088
liga+0.088
ints+0.086
Neg logit contributions
%%%%-0.094
IJ-0.093
Submit-0.090
 Hearing-0.084
NPR-0.084
Token good
Token ID922
Feature activation-4.14e-01

Pos logit contributions
alore+0.256
rade+0.214
liga+0.209
aints+0.209
ints+0.205
Neg logit contributions
%%%%-0.288
IJ-0.283
Submit-0.277
NPR-0.261
 Hearing-0.261
Token thing
Token ID1517
Feature activation-3.54e-01

Pos logit contributions
alore+1.652
rade+1.358
liga+1.327
aints+1.318
ints+1.295
Neg logit contributions
%%%%-2.231
IJ-2.196
Submit-2.151
NPR-2.028
 Hearing-2.022
Token.
Token ID13
Feature activation-2.11e-01

Pos logit contributions
alore+1.673
rade+1.422
liga+1.394
aints+1.384
ints+1.367
Neg logit contributions
%%%%-1.648
IJ-1.615
Submit-1.584
NPR-1.472
 Hearing-1.467
Token
Token ID198
Feature activation-2.58e-01

Pos logit contributions
alore+1.128
rade+0.985
aints+0.969
liga+0.965
ints+0.952
Neg logit contributions
%%%%-0.844
IJ-0.818
Submit-0.798
NPR-0.730
-0.725
Token cinnamon
Token ID29181
Feature activation-3.51e-01

Pos logit contributions
%%%%+0.795
IJ+0.782
Submit+0.761
 Hearing+0.713
NPR+0.712
Neg logit contributions
alore-0.838
rade-0.713
aints-0.697
liga-0.696
ints-0.685
Token,
Token ID11
Feature activation+1.68e-01

Pos logit contributions
alore+1.669
rade+1.412
liga+1.388
aints+1.385
ints+1.363
Neg logit contributions
%%%%-1.622
IJ-1.596
Submit-1.552
 Hearing-1.451
NPR-1.450
Token and
Token ID290
Feature activation-8.84e-02

Pos logit contributions
%%%%+0.686
IJ+0.673
Submit+0.653
NPR+0.605
 Hearing+0.604
Neg logit contributions
alore-0.891
rade-0.771
liga-0.754
aints-0.754
ints-0.742
Token 2
Token ID362
Feature activation-1.71e-01

Pos logit contributions
alore+0.481
rade+0.417
liga+0.410
aints+0.409
ints+0.404
Neg logit contributions
%%%%-0.349
IJ-0.341
Submit-0.332
NPR-0.306
 Hearing-0.305
Token he
Token ID339
Feature activation+1.16e-02

Pos logit contributions
alore+0.768
rade+0.645
liga+0.632
aints+0.630
ints+0.620
Neg logit contributions
%%%%-0.840
IJ-0.824
Submit-0.807
NPR-0.757
 Hearing-0.755
Tokenaping
Token ID9269
Feature activation-8.23e-01

Pos logit contributions
%%%%+0.055
IJ+0.054
Submit+0.053
NPR+0.049
 Hearing+0.049
Neg logit contributions
alore-0.055
rade-0.046
liga-0.045
aints-0.045
ints-0.044
Token tablespoons
Token ID30064
Feature activation-5.36e-01

Pos logit contributions
alore+3.340
rade+2.762
liga+2.755
ints+2.738
aints+2.729
Neg logit contributions
%%%%-4.280
IJ-4.208
Submit-4.151
NPR-3.875
-3.866
Token of
Token ID286
Feature activation+2.82e-02

Pos logit contributions
alore+2.938
rade+2.563
liga+2.551
aints+2.510
ints+2.488
Neg logit contributions
%%%%-2.041
IJ-1.991
Submit-1.974
NPR-1.809
 Hearing-1.794
Token peanut
Token ID26636
Feature activation-3.26e-01

Pos logit contributions
%%%%+0.132
IJ+0.129
Submit+0.126
NPR+0.118
 Hearing+0.118
Neg logit contributions
alore-0.134
rade-0.113
liga-0.111
aints-0.111
ints-0.109
Token butter
Token ID9215
Feature activation-1.72e-01

Pos logit contributions
alore+1.189
rade+0.954
liga+0.929
aints+0.927
ints+0.908
Neg logit contributions
%%%%-1.868
IJ-1.831
Submit-1.805
 Hearing-1.709
NPR-1.708
Token:
Token ID25
Feature activation+1.15e-01

Pos logit contributions
alore+0.821
rade+0.699
liga+0.688
aints+0.684
ints+0.673
Neg logit contributions
%%%%-0.792
IJ-0.777
Submit-0.758
NPR-0.708
 Hearing-0.707
Token
Token ID198
Feature activation-4.85e-01

Pos logit contributions
alore+2.757
rade+2.420
aints+2.334
liga+2.314
ints+2.309
Neg logit contributions
%%%%-2.502
IJ-2.458
Submit-2.385
-2.272
��-2.244
TokenThe
Token ID464
Feature activation-2.17e-01

Pos logit contributions
alore+2.695
rade+2.373
aints+2.311
liga+2.302
ints+2.298
Neg logit contributions
%%%%-1.775
IJ-1.728
Submit-1.664
 Hearing-1.589
-1.533
Token company
Token ID1664
Feature activation-5.65e-01

Pos logit contributions
alore+1.111
rade+0.953
aints+0.935
liga+0.933
ints+0.921
Neg logit contributions
%%%%-0.926
IJ-0.906
Submit-0.883
NPR-0.819
Psy-0.812
Token received
Token ID2722
Feature activation-3.15e-01

Pos logit contributions
alore+2.750
rade+2.357
liga+2.293
aints+2.291
ints+2.252
Neg logit contributions
%%%%-2.568
IJ-2.492
Submit-2.425
NPR-2.265
 Hearing-2.251
Token an
Token ID281
Feature activation-4.92e-01

Pos logit contributions
alore+1.581
rade+1.349
aints+1.330
liga+1.320
ints+1.313
Neg logit contributions
%%%%-1.365
IJ-1.338
Submit-1.300
NPR-1.212
-1.208
Token estimated
Token ID6108
Feature activation-1.07e+00

Pos logit contributions
alore+2.374
rade+2.010
aints+1.991
liga+1.950
ints+1.949
Neg logit contributions
%%%%-2.238
IJ-2.176
Submit-2.111
-1.988
Psy-1.984
Token $
Token ID720
Feature activation-7.74e-01

Pos logit contributions
alore+4.849
rade+4.099
aints+4.016
oat+3.969
ints+3.921
Neg logit contributions
%%%%-4.994
IJ-4.882
Submit-4.677
-4.556
Psy-4.427
Token9
Token ID24
Feature activation-2.61e-01

Pos logit contributions
alore+3.775
rade+3.302
aints+3.245
liga+3.234
ints+3.177
Neg logit contributions
%%%%-3.291
IJ-3.184
Submit-3.128
 Hearing-2.940
-2.897
Token.
Token ID13
Feature activation-1.33e-01

Pos logit contributions
alore+1.287
rade+1.099
liga+1.079
aints+1.076
ints+1.060
Neg logit contributions
%%%%-1.164
IJ-1.141
Submit-1.115
NPR-1.037
 Hearing-1.034
Token7
Token ID22
Feature activation-6.74e-02

Pos logit contributions
alore+0.737
rade+0.644
liga+0.632
aints+0.632
ints+0.623
Neg logit contributions
%%%%-0.505
IJ-0.493
Submit-0.479
NPR-0.440
 Hearing-0.439
Token million
Token ID1510
Feature activation-3.17e-01

Pos logit contributions
alore+0.344
rade+0.296
liga+0.290
aints+0.290
ints+0.285
Neg logit contributions
%%%%-0.288
IJ-0.282
Submit-0.275
NPR-0.255
 Hearing-0.255
Token joke
Token ID9707
Feature activation+2.38e-01

Pos logit contributions
alore+0.224
rade+0.187
liga+0.183
aints+0.182
ints+0.179
Neg logit contributions
%%%%-0.261
IJ-0.257
Submit-0.251
NPR-0.236
 Hearing-0.236
Token,
Token ID11
Feature activation+2.74e-02

Pos logit contributions
%%%%+1.062
IJ+1.043
Submit+1.020
 Hearing+0.949
NPR+0.948
Neg logit contributions
alore-1.168
rade-0.994
liga-0.979
aints-0.973
ints-0.957
Token and
Token ID290
Feature activation-2.39e-01

Pos logit contributions
%%%%+0.120
IJ+0.117
Submit+0.114
NPR+0.107
 Hearing+0.106
Neg logit contributions
alore-0.138
rade-0.118
aints-0.116
liga-0.115
ints-0.114
Token the
Token ID262
Feature activation-2.02e-02

Pos logit contributions
alore+1.218
rade+1.064
aints+1.030
liga+1.022
ints+1.022
Neg logit contributions
%%%%-1.014
IJ-0.994
Submit-0.967
NPR-0.906
-0.889
Token steep
Token ID14559
Feature activation-4.18e-01

Pos logit contributions
alore+0.094
rade+0.080
liga+0.078
aints+0.078
ints+0.077
Neg logit contributions
%%%%-0.095
IJ-0.094
Submit-0.092
NPR-0.086
 Hearing-0.085
Token incl
Token ID13358
Feature activation-1.17e+00

Pos logit contributions
alore+1.957
rade+1.666
aints+1.629
liga+1.610
ints+1.595
Neg logit contributions
%%%%-1.967
IJ-1.918
Submit-1.874
NPR-1.757
 Hearing-1.756
Tokenines
Token ID1127
Feature activation-4.79e-01

Pos logit contributions
alore+3.365
aints+2.656
rade+2.644
ints+2.599
liga+2.437
Neg logit contributions
%%%%-7.388
IJ-7.200
Submit-7.123
-6.884
NPR-6.876
Token can
Token ID460
Feature activation-2.88e-01

Pos logit contributions
alore+2.331
rade+2.018
aints+1.969
liga+1.952
ints+1.938
Neg logit contributions
%%%%-2.137
IJ-2.095
Submit-2.033
NPR-1.900
 Hearing-1.895
Token be
Token ID307
Feature activation+7.91e-02

Pos logit contributions
alore+1.497
rade+1.289
aints+1.259
liga+1.256
ints+1.241
Neg logit contributions
%%%%-1.212
IJ-1.188
Submit-1.153
 Hearing-1.073
NPR-1.071
Token treacherous
Token ID45365
Feature activation-3.30e-01

Pos logit contributions
%%%%+0.336
IJ+0.329
Submit+0.321
NPR+0.298
 Hearing+0.297
Neg logit contributions
alore-0.406
rade-0.349
liga-0.343
aints-0.342
ints-0.337
Token with
Token ID351
Feature activation-2.19e-01

Pos logit contributions
alore+1.573
rade+1.350
aints+1.315
liga+1.307
ints+1.298
Neg logit contributions
%%%%-1.508
IJ-1.477
Submit-1.443
NPR-1.357
-1.340
Token ships
Token ID7937
Feature activation-5.55e-01

Pos logit contributions
alore+3.295
rade+2.837
aints+2.749
ints+2.719
liga+2.698
Neg logit contributions
%%%%-3.478
IJ-3.431
Submit-3.353
NPR-3.179
-3.163
Token can
Token ID460
Feature activation-6.34e-01

Pos logit contributions
alore+2.537
rade+2.141
aints+2.089
liga+2.085
ints+2.055
Neg logit contributions
%%%%-2.662
IJ-2.612
Submit-2.558
NPR-2.396
 Hearing-2.394
Token be
Token ID307
Feature activation-1.68e-01

Pos logit contributions
alore+3.088
rade+2.608
liga+2.546
aints+2.541
ints+2.515
Neg logit contributions
%%%%-2.885
IJ-2.829
Submit-2.756
 Hearing-2.574
NPR-2.566
Token sent
Token ID1908
Feature activation-2.03e-01

Pos logit contributions
alore+0.836
rade+0.715
liga+0.700
aints+0.697
ints+0.687
Neg logit contributions
%%%%-0.743
IJ-0.727
Submit-0.711
NPR-0.662
 Hearing-0.660
Token
Token ID198
Feature activation-7.81e-01

Pos logit contributions
alore+1.013
rade+0.870
aints+0.850
liga+0.848
ints+0.837
Neg logit contributions
%%%%-0.883
IJ-0.867
Submit-0.848
NPR-0.785
 Hearing-0.784
Token
Token ID198
Feature activation-8.29e-01

Pos logit contributions
alore+3.616
rade+3.200
aints+3.050
ints+3.038
liga+3.031
Neg logit contributions
%%%%-3.442
IJ-3.395
Submit-3.296
-3.184
��-3.164
Tokento
Token ID1462
Feature activation-1.71e-02

Pos logit contributions
alore+3.914
rade+3.346
aints+3.275
liga+3.267
ints+3.232
Neg logit contributions
%%%%-3.746
IJ-3.677
Submit-3.570
 Hearing-3.408
-3.357
Token explore
Token ID7301
Feature activation-1.27e-01

Pos logit contributions
alore+0.084
rade+0.072
liga+0.070
aints+0.070
ints+0.069
Neg logit contributions
%%%%-0.077
IJ-0.075
Submit-0.073
NPR-0.068
 Hearing-0.068
Token,
Token ID11
Feature activation-6.31e-02

Pos logit contributions
alore+0.625
rade+0.535
liga+0.524
aints+0.523
ints+0.515
Neg logit contributions
%%%%-0.563
IJ-0.552
Submit-0.539
NPR-0.502
 Hearing-0.501
Token harvest
Token ID13222
Feature activation+6.50e-02

Pos logit contributions
alore+0.289
rade+0.244
liga+0.239
aints+0.238
ints+0.234
Neg logit contributions
%%%%-0.303
IJ-0.298
Submit-0.291
NPR-0.273
 Hearing-0.272
Token resources
Token ID4133
Feature activation+2.63e-02

Pos logit contributions
%%%%+0.304
IJ+0.297
Submit+0.291
NPR+0.272
 Hearing+0.271
Neg logit contributions
alore-0.306
rade-0.259
liga-0.256
aints-0.253
ints-0.248
Token's
Token ID338
Feature activation-5.18e-02

Pos logit contributions
%%%%+0.962
IJ+0.947
Submit+0.922
NPR+0.860
 Hearing+0.859
Neg logit contributions
alore-1.019
rade-0.866
liga-0.854
aints-0.845
ints-0.834
Token jobs
Token ID3946
Feature activation+3.74e-01

Pos logit contributions
alore+0.236
rade+0.198
liga+0.193
aints+0.193
ints+0.190
Neg logit contributions
%%%%-0.251
IJ-0.246
Submit-0.241
NPR-0.226
 Hearing-0.226
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
%%%%+1.652
IJ+1.622
Submit+1.570
 Hearing+1.470
NPR+1.463
Neg logit contributions
alore-1.850
rade-1.580
liga-1.554
aints-1.528
ints-1.522
Token The
Token ID383
Feature activation-1.15e-01

Pos logit contributions
alore+0.771
rade+0.678
aints+0.664
liga+0.661
ints+0.654
Neg logit contributions
%%%%-0.511
IJ-0.496
Submit-0.484
NPR-0.444
-0.436
Token book
Token ID1492
Feature activation-4.35e-01

Pos logit contributions
alore+0.536
rade+0.454
liga+0.445
aints+0.443
ints+0.435
Neg logit contributions
%%%%-0.545
IJ-0.535
Submit-0.522
NPR-0.489
 Hearing-0.488
Token The
Token ID383
Feature activation-8.94e-01

Pos logit contributions
alore+1.886
rade+1.595
aints+1.545
liga+1.543
ints+1.508
Neg logit contributions
%%%%-2.203
IJ-2.149
Submit-2.103
NPR-1.983
 Hearing-1.962
Token Rise
Token ID15648
Feature activation-6.06e-01

Pos logit contributions
alore+4.727
rade+4.059
aints+3.999
liga+3.971
ints+3.956
Neg logit contributions
%%%%-3.612
IJ-3.511
Submit-3.442
NPR-3.204
-3.144
Token of
Token ID286
Feature activation-4.36e-01

Pos logit contributions
alore+3.260
rade+2.827
aints+2.798
liga+2.788
ints+2.759
Neg logit contributions
%%%%-2.375
IJ-2.322
Submit-2.280
NPR-2.107
-2.062
Token the
Token ID262
Feature activation-6.78e-01

Pos logit contributions
alore+2.242
rade+1.951
aints+1.906
liga+1.897
ints+1.887
Neg logit contributions
%%%%-1.827
IJ-1.780
Submit-1.737
NPR-1.631
-1.586
Token Robots
Token ID39096
Feature activation-1.65e-01

Pos logit contributions
alore+3.628
rade+3.176
aints+3.126
ints+3.098
liga+3.092
Neg logit contributions
%%%%-2.656
IJ-2.562
Submit-2.508
NPR-2.355
-2.292
Token highlights
Token ID11330
Feature activation+1.32e-01

Pos logit contributions
alore+0.815
rade+0.697
liga+0.683
aints+0.681
ints+0.671
Neg logit contributions
%%%%-0.738
IJ-0.723
Submit-0.706
NPR-0.658
 Hearing-0.656