TOP ACTIVATIONS
MAX = 11.732

***** When we watched this little gag with Erin and Jill
c py In Memory 4 66 284 4 64 406 Mem
It includes lots of great set photography , art works and
Far rington , D . P . ( 1997 ). The
Writ ings Related to Scripture ( Philadelphia : Jewish Publication Society
Charles B . Cha vel ( New York : Sh ilo
Se eker wrote : Cl arity Gaming will also
Dark ho arse wrote : Show nested quote +
c py In Memory 4 66 284 4 64 406 Mem
ho arse wrote : Show nested quote +
for Policy , Law , and Ethics at Silicon Valley
. x . 107 10 60 212 x . x .
a . Australian Journal of Zo ology 62 ( 5 )
pur ak ushi wrote : Happy it worked out
Hall ion , Richard P . Design ers and Test
too . In Daily provides valuable , local independent journalism in
I did give it a little thought . I started to
Red Tent : A Novel ( New York : St .
. x . 169 11 66 589 x . x .
OC R OP / h 342 / UST r ump 220

MIN ACTIVATIONS
MIN = -11.752

iz arro j à v u that rare st
become that day 's ze it geist . '
ex ­ claimed Marco Ru ­ b io on Fox News
, this j à v u isn t
ve made , we more or less know where we are
---- -+ -------------------------------- + ------- | | 1 | org .
other in a one - on - one , single -
(), half _ m oves : self . half _ m
the F . I . R .
Even more h u ­ mi ­ li ­ at
ain told a Phoenix ra ­ d io st a ­
Obama is the worst ne ­ go ­ ti ­ at
d | + -------------------- --+ | [ address of " pp
-------------------- -+ -------- -+ ------- | | org . cl ojure
arg | + -------------------- --+ | [ return address ] |
at the po ­ di ­ um and al ­ l
... | + -------------------- --+ + -------------------- --+ < -- Start
arg | + -------------------- --+ | [ return address ] |
camp during an 11 - on - 11 session . His
, once he enters the DC U , and to have

INTERVAL 5.861 - 11.732
CONTAINS 0.401%

K ada ver BB Profile Blog Joined June 2009 Germany 247
, the intersection of Y onge and E gl inton will
Men . T ER T IA P AR S
twitter . com / E q 4 R IL t 2
{ ( 55 s )} Ra 8 { ( 113 s

INTERVAL -0.010 - 5.861
CONTAINS 49.286%

each of Pittsburgh s top three lines just
being rigged to change Donald Trump votes to Hillary Clinton votes
became tainted and was eventually lost . He compared it to
This is part of academic freedom in
a difference in approaches to governing . Abrams consistently favored compromise

INTERVAL -5.881 - -0.010
CONTAINS 49.926%

328 positions in the public service to be filled by bilingual
that an early rate rise appears to be off the agenda
have Out wit , and his STOP click is pure chaos
also warns : Question marks over the rights of
Instead , he was five years away from a fateful date

INTERVAL -11.752 - -5.881
CONTAINS 0.388%

hitting a police officer several times . I van
sketch comedy and how - to guide for hosting . Ton
considered a good place to invest and the government bonds of
other and shaking with the longing of twenty years
bel itt ling African di at ribe on Lu ol Deng
Token*****
Token ID35625
Feature activation-1.17e-01

Pos logit contributions
nih+0.171
+0.166
Downloadha+0.165
ubi+0.159
+0.158
Neg logit contributions
OTE-0.214
imore-0.199
adelphia-0.192
Wars-0.190
 Rasmussen-0.188
Token When
Token ID1649
Feature activation-4.28e+00

Pos logit contributions
nih+0.073
Downloadha+0.071
+0.071
ubi+0.068
+0.068
Neg logit contributions
OTE-0.094
imore-0.087
adelphia-0.084
Wars-0.084
 Rasmussen-0.082
Token we
Token ID356
Feature activation+3.60e-01

Pos logit contributions
nih+2.634
+2.582
Downloadha+2.558
ubi+2.480
+2.475
Neg logit contributions
OTE-3.403
imore-3.169
adelphia-3.066
Wars-3.015
finger-2.974
Token watched
Token ID7342
Feature activation+3.49e+00

Pos logit contributions
OTE+0.241
imore+0.220
adelphia+0.209
Wars+0.208
 Rasmussen+0.204
Neg logit contributions
nih-0.272
Downloadha-0.266
-0.266
ubi-0.257
-0.256
Token this
Token ID428
Feature activation+3.36e+00

Pos logit contributions
OTE+2.651
imore+2.479
adelphia+2.360
Wars+2.344
 Rasmussen+2.324
Neg logit contributions
nih-2.293
Downloadha-2.248
-2.230
-2.127
ubi-2.124
Token little
Token ID1310
Feature activation+1.17e+01

Pos logit contributions
OTE+2.193
imore+2.005
adelphia+1.890
Wars+1.883
finger+1.863
Neg logit contributions
nih-2.573
Downloadha-2.543
-2.511
ubi-2.442
-2.433
Token gag
Token ID32261
Feature activation+3.50e+00

Pos logit contributions
OTE+8.006
imore+7.453
finger+7.359
 Rasmussen+7.130
adelphia+6.978
Neg logit contributions
Downloadha-7.889
-7.628
nih-7.547
ubi-7.409
ancial-7.363
Token with
Token ID351
Feature activation-1.44e+00

Pos logit contributions
OTE+2.094
imore+1.897
adelphia+1.794
Wars+1.789
 Rasmussen+1.758
Neg logit contributions
nih-2.868
Downloadha-2.819
-2.816
ubi-2.729
-2.718
Token Erin
Token ID28894
Feature activation+1.29e+00

Pos logit contributions
nih+0.980
+0.955
Downloadha+0.953
ubi+0.918
+0.914
Neg logit contributions
OTE-1.072
imore-0.988
adelphia-0.948
Wars-0.944
 Rasmussen-0.926
Token and
Token ID290
Feature activation+3.29e-01

Pos logit contributions
OTE+1.023
imore+0.948
adelphia+0.913
Wars+0.908
 Rasmussen+0.889
Neg logit contributions
nih-0.810
-0.788
Downloadha-0.784
ubi-0.754
-0.751
Token Jill
Token ID22772
Feature activation-5.64e-01

Pos logit contributions
OTE+0.317
imore+0.299
adelphia+0.289
Wars+0.289
 Rasmussen+0.285
Neg logit contributions
nih-0.151
-0.145
Downloadha-0.145
ubi-0.137
-0.136
Tokenc
Token ID66
Feature activation+3.28e+00

Pos logit contributions
OTE+4.798
imore+4.558
Wars+4.381
adelphia+4.374
finger+4.325
Neg logit contributions
nih-2.617
-2.614
Downloadha-2.610
-2.460
-2.421
Tokenpy
Token ID9078
Feature activation+1.20e+00

Pos logit contributions
OTE+1.899
imore+1.720
adelphia+1.620
Wars+1.610
finger+1.577
Neg logit contributions
nih-2.757
Downloadha-2.717
-2.703
-2.619
ubi-2.605
TokenIn
Token ID818
Feature activation-1.72e+00

Pos logit contributions
OTE+1.012
imore+0.940
Wars+0.912
adelphia+0.908
finger+0.881
Neg logit contributions
nih-0.698
Downloadha-0.677
-0.671
ubi-0.643
-0.637
TokenMemory
Token ID30871
Feature activation+2.86e+00

Pos logit contributions
nih+1.314
+1.291
Downloadha+1.262
Fit+1.247
ubi+1.233
Neg logit contributions
OTE-1.128
imore-1.045
adelphia-1.001
 Canary-0.980
 Rasmussen-0.967
Token 4
Token ID604
Feature activation+3.84e+00

Pos logit contributions
OTE+1.758
imore+1.610
adelphia+1.528
Wars+1.504
 Canary+1.496
Neg logit contributions
nih-2.272
Downloadha-2.269
-2.241
-2.161
ubi-2.140
Token66
Token ID2791
Feature activation+1.12e+01

Pos logit contributions
OTE+2.079
imore+1.824
Wars+1.733
adelphia+1.732
 Rasmussen+1.697
Neg logit contributions
nih-3.360
Downloadha-3.329
-3.279
ubi-3.197
-3.173
Token284
Token ID30336
Feature activation-2.35e+00

Pos logit contributions
OTE+4.952
imore+4.499
finger+4.304
adelphia+4.260
Wars+4.096
Neg logit contributions
nih-10.254
-9.896
Downloadha-9.812
ancial-9.523
-9.516
Token 4
Token ID604
Feature activation-3.71e+00

Pos logit contributions
nih+1.615
+1.584
ubi+1.532
Downloadha+1.517
Fit+1.516
Neg logit contributions
OTE-1.714
imore-1.580
adelphia-1.512
Wars-1.504
finger-1.461
Token64
Token ID2414
Feature activation+2.96e+00

Pos logit contributions
nih+2.573
+2.520
ubi+2.477
Fit+2.466
Downloadha+2.436
Neg logit contributions
OTE-2.580
imore-2.453
adelphia-2.361
 Canary-2.302
Wars-2.295
Token406
Token ID29703
Feature activation-2.07e-02

Pos logit contributions
OTE+1.913
imore+1.732
adelphia+1.648
Wars+1.644
finger+1.614
Neg logit contributions
nih-2.286
-2.250
Downloadha-2.245
Fit-2.159
ubi-2.157
Token Mem
Token ID4942
Feature activation+1.51e+00

Pos logit contributions
nih+0.020
+0.019
Downloadha+0.019
ubi+0.019
Fit+0.019
Neg logit contributions
OTE-0.010
imore-0.008
adelphia-0.008
Wars-0.008
finger-0.007
Token It
Token ID632
Feature activation+1.43e+00

Pos logit contributions
nih+0.684
+0.664
Downloadha+0.662
ubi+0.634
+0.631
Neg logit contributions
OTE-0.977
imore-0.910
adelphia-0.877
Wars-0.874
 Rasmussen-0.859
Token includes
Token ID3407
Feature activation+6.52e-01

Pos logit contributions
OTE+0.848
imore+0.771
Wars+0.728
adelphia+0.722
 Rasmussen+0.706
Neg logit contributions
nih-1.177
Downloadha-1.159
-1.157
ubi-1.119
-1.116
Token lots
Token ID6041
Feature activation-5.90e-02

Pos logit contributions
OTE+0.526
imore+0.490
adelphia+0.471
Wars+0.471
 Rasmussen+0.460
Neg logit contributions
nih-0.402
-0.391
Downloadha-0.391
ubi-0.374
-0.373
Token of
Token ID286
Feature activation-6.08e-01

Pos logit contributions
nih+0.037
+0.036
Downloadha+0.035
ubi+0.035
+0.034
Neg logit contributions
OTE-0.047
imore-0.044
adelphia-0.042
Wars-0.042
 Canary-0.041
Token great
Token ID1049
Feature activation-1.90e-01

Pos logit contributions
nih+0.283
+0.274
Downloadha+0.265
ubi+0.258
Fit+0.257
Neg logit contributions
OTE-0.583
imore-0.544
adelphia-0.534
Wars-0.527
 Rasmussen-0.522
Token set
Token ID900
Feature activation+1.11e+01

Pos logit contributions
nih+0.120
+0.117
Downloadha+0.114
ubi+0.112
Fit+0.112
Neg logit contributions
OTE-0.150
imore-0.139
adelphia-0.136
Wars-0.134
 Rasmussen-0.131
Token photography
Token ID16223
Feature activation+1.31e+00

Pos logit contributions
OTE+4.332
imore+3.690
 Rasmussen+3.454
finger+3.397
Wars+3.314
Neg logit contributions
Downloadha-10.794
-10.665
nih-10.526
ubi-10.402
-10.357
Token,
Token ID11
Feature activation+4.71e+00

Pos logit contributions
OTE+1.100
imore+1.024
adelphia+0.988
Wars+0.981
 Rasmussen+0.964
Neg logit contributions
nih-0.763
-0.741
Downloadha-0.735
ubi-0.706
-0.702
Token art
Token ID1242
Feature activation+3.95e+00

Pos logit contributions
OTE+2.749
imore+2.483
adelphia+2.322
Wars+2.309
finger+2.273
Neg logit contributions
nih-3.933
Downloadha-3.897
-3.854
ubi-3.738
-3.732
Tokenworks
Token ID5225
Feature activation+6.14e-01

Pos logit contributions
OTE+2.865
imore+2.695
Wars+2.617
adelphia+2.559
finger+2.531
Neg logit contributions
nih-2.654
Downloadha-2.627
-2.614
-2.510
Fit-2.500
Token and
Token ID290
Feature activation+2.00e-01

Pos logit contributions
OTE+0.492
imore+0.456
adelphia+0.440
Wars+0.437
 Rasmussen+0.428
Neg logit contributions
nih-0.384
-0.373
Downloadha-0.369
ubi-0.357
-0.355
Token Far
Token ID6755
Feature activation-9.37e-01

Pos logit contributions
OTE+2.385
imore+2.187
 Rasmussen+2.136
Wars+2.110
 Canary+2.096
Neg logit contributions
nih-2.229
Downloadha-2.216
-2.178
ubi-2.091
-2.080
Tokenrington
Token ID24833
Feature activation+2.01e+00

Pos logit contributions
nih+0.606
+0.588
Downloadha+0.586
ubi+0.567
+0.561
Neg logit contributions
OTE-0.729
imore-0.672
adelphia-0.647
Wars-0.646
 Rasmussen-0.633
Token,
Token ID11
Feature activation+4.35e+00

Pos logit contributions
OTE+1.578
imore+1.484
adelphia+1.412
Wars+1.404
 Rasmussen+1.404
Neg logit contributions
nih-1.251
-1.236
Downloadha-1.229
ubi-1.174
-1.165
Token D
Token ID360
Feature activation+4.40e+00

Pos logit contributions
OTE+3.275
imore+2.991
 Rasmussen+2.962
 Canary+2.903
Wars+2.899
Neg logit contributions
Downloadha-2.904
nih-2.871
-2.784
ubi-2.711
ancial-2.678
Token.
Token ID13
Feature activation+6.57e+00

Pos logit contributions
OTE+3.286
imore+3.069
adelphia+2.929
Wars+2.890
finger+2.835
Neg logit contributions
nih-2.925
Downloadha-2.911
-2.870
-2.729
ubi-2.705
Token P
Token ID350
Feature activation+1.07e+01

Pos logit contributions
OTE+4.657
 Rasmussen+4.211
imore+4.146
 Canary+4.105
Wars+4.066
Neg logit contributions
Downloadha-4.673
nih-4.567
-4.468
ubi-4.329
ancial-4.282
Token.
Token ID13
Feature activation+5.67e+00

Pos logit contributions
OTE+6.580
imore+6.100
adelphia+5.757
Wars+5.712
finger+5.546
Neg logit contributions
Downloadha-7.864
-7.819
nih-7.731
-7.397
Fit-7.246
Token (
Token ID357
Feature activation+6.43e+00

Pos logit contributions
OTE+3.663
imore+3.229
 Rasmussen+3.184
Wars+3.133
 Canary+3.111
Neg logit contributions
Downloadha-4.399
nih-4.328
-4.299
ubi-4.134
-4.087
Token 1997
Token ID8309
Feature activation+5.75e+00

Pos logit contributions
OTE+4.038
imore+3.613
Wars+3.527
 Rasmussen+3.490
 Canary+3.433
Neg logit contributions
Downloadha-4.995
nih-4.895
-4.871
-4.680
ubi-4.678
Token ).
Token ID6739
Feature activation+5.55e-01

Pos logit contributions
OTE+3.100
imore+2.728
Wars+2.677
adelphia+2.602
 Rasmussen+2.586
Neg logit contributions
nih-4.855
Downloadha-4.816
-4.803
ubi-4.675
-4.626
Token The
Token ID383
Feature activation+4.17e+00

Pos logit contributions
OTE+0.431
imore+0.398
adelphia+0.383
Wars+0.382
 Rasmussen+0.377
Neg logit contributions
nih-0.360
Downloadha-0.351
-0.349
ubi-0.336
-0.334
Token Writ
Token ID12257
Feature activation+1.40e+00

Pos logit contributions
OTE+2.486
imore+2.303
Wars+2.236
adelphia+2.207
 Canary+2.198
Neg logit contributions
Downloadha-2.068
nih-2.061
-2.017
ubi-1.916
-1.911
Tokenings
Token ID654
Feature activation+5.40e+00

Pos logit contributions
OTE+1.182
imore+1.107
adelphia+1.070
Wars+1.063
finger+1.046
Neg logit contributions
nih-0.807
Downloadha-0.789
-0.788
-0.744
ubi-0.742
Token Related
Token ID19809
Feature activation+5.04e+00

Pos logit contributions
OTE+3.868
imore+3.557
Wars+3.440
adelphia+3.404
 Canary+3.389
Neg logit contributions
nih-3.717
Downloadha-3.641
-3.614
ubi-3.496
-3.431
Token to
Token ID284
Feature activation+3.10e+00

Pos logit contributions
OTE+3.306
imore+3.071
Wars+3.033
adelphia+2.911
 Canary+2.903
Neg logit contributions
nih-3.616
-3.582
Downloadha-3.554
ubi-3.419
-3.404
Token Scripture
Token ID25595
Feature activation+1.90e+00

Pos logit contributions
OTE+2.171
imore+2.018
Wars+1.941
adelphia+1.911
 Canary+1.900
Neg logit contributions
nih-2.195
Downloadha-2.173
-2.165
ubi-2.068
-2.065
Token (
Token ID357
Feature activation+1.06e+01

Pos logit contributions
OTE+1.409
imore+1.298
Wars+1.251
adelphia+1.249
 Rasmussen+1.221
Neg logit contributions
nih-1.283
Downloadha-1.258
-1.255
ubi-1.202
-1.196
TokenPhiladelphia
Token ID42349
Feature activation+4.38e+00

Pos logit contributions
OTE+6.658
Wars+6.377
imore+6.116
adelphia+5.850
finger+5.774
Neg logit contributions
Downloadha-7.674
nih-7.591
-7.383
ubi-7.359
ancial-7.243
Token:
Token ID25
Feature activation+4.49e+00

Pos logit contributions
OTE+3.051
imore+2.840
adelphia+2.731
Wars+2.724
 Canary+2.652
Neg logit contributions
Downloadha-3.109
nih-3.075
-3.055
ubi-2.972
Fit-2.945
Token Jewish
Token ID5582
Feature activation-3.44e-01

Pos logit contributions
OTE+3.559
imore+3.265
Wars+3.231
adelphia+3.210
 Rasmussen+3.201
Neg logit contributions
Downloadha-2.805
nih-2.780
-2.719
ubi-2.630
-2.583
Token Publication
Token ID45065
Feature activation-9.93e-01

Pos logit contributions
nih+0.226
Downloadha+0.220
+0.220
ubi+0.211
+0.210
Neg logit contributions
OTE-0.264
imore-0.244
adelphia-0.235
Wars-0.234
 Rasmussen-0.229
Token Society
Token ID7023
Feature activation+4.79e+00

Pos logit contributions
nih+0.650
+0.633
Downloadha+0.632
ubi+0.608
+0.605
Neg logit contributions
OTE-0.764
imore-0.707
adelphia-0.679
Wars-0.676
 Rasmussen-0.664
Token Charles
Token ID7516
Feature activation+3.59e+00

Pos logit contributions
OTE+3.426
imore+3.134
Wars+3.070
 Rasmussen+3.049
 Canary+3.022
Neg logit contributions
nih-3.001
Downloadha-2.944
-2.913
ubi-2.843
Fit-2.770
Token B
Token ID347
Feature activation+3.77e+00

Pos logit contributions
OTE+2.674
imore+2.477
 Rasmussen+2.409
Wars+2.378
adelphia+2.375
Neg logit contributions
nih-2.394
Downloadha-2.383
-2.328
ubi-2.242
-2.214
Token.
Token ID13
Feature activation+3.33e+00

Pos logit contributions
OTE+2.983
imore+2.812
adelphia+2.666
 Rasmussen+2.660
Wars+2.648
Neg logit contributions
Downloadha-2.348
-2.342
nih-2.327
-2.214
Fit-2.177
Token Cha
Token ID20703
Feature activation-3.59e-01

Pos logit contributions
OTE+2.460
imore+2.248
 Rasmussen+2.197
Wars+2.184
adelphia+2.168
Neg logit contributions
nih-2.258
Downloadha-2.249
-2.201
ubi-2.135
-2.106
Tokenvel
Token ID626
Feature activation+3.14e+00

Pos logit contributions
nih+0.256
+0.249
Downloadha+0.249
ubi+0.240
+0.239
Neg logit contributions
OTE-0.255
imore-0.234
adelphia-0.224
Wars-0.223
 Rasmussen-0.219
Token (
Token ID357
Feature activation+1.06e+01

Pos logit contributions
OTE+2.361
imore+2.230
adelphia+2.116
Wars+2.095
 Rasmussen+2.078
Neg logit contributions
nih-2.063
Downloadha-2.056
-2.046
-1.942
ubi-1.926
TokenNew
Token ID3791
Feature activation+2.03e+00

Pos logit contributions
OTE+6.822
Wars+6.542
imore+6.329
adelphia+6.053
finger+6.013
Neg logit contributions
Downloadha-7.375
nih-7.274
-7.185
ubi-7.094
ancial-7.084
Token York
Token ID1971
Feature activation+1.49e+00

Pos logit contributions
OTE+1.576
imore+1.484
adelphia+1.430
Wars+1.423
 Canary+1.402
Neg logit contributions
nih-1.295
Downloadha-1.277
-1.266
ubi-1.220
-1.200
Token:
Token ID25
Feature activation+6.19e+00

Pos logit contributions
OTE+1.070
imore+0.994
adelphia+0.961
Wars+0.957
 Rasmussen+0.936
Neg logit contributions
nih-1.035
Downloadha-1.033
-1.017
ubi-0.990
Fit-0.976
Token Sh
Token ID911
Feature activation+1.73e-01

Pos logit contributions
OTE+4.588
Wars+4.186
 Rasmussen+4.184
 Canary+4.157
imore+4.154
Neg logit contributions
Downloadha-4.224
nih-4.104
-3.924
ubi-3.905
-3.716
Tokenilo
Token ID18526
Feature activation+1.17e+00

Pos logit contributions
OTE+0.148
imore+0.138
adelphia+0.133
Wars+0.132
 Rasmussen+0.130
Neg logit contributions
nih-0.098
Downloadha-0.096
-0.096
ubi-0.091
-0.091
Token Se
Token ID1001
Feature activation-1.82e+00

Pos logit contributions
OTE+0.886
imore+0.821
adelphia+0.783
Wars+0.778
 Rasmussen+0.770
Neg logit contributions
nih-0.833
Downloadha-0.820
-0.811
-0.784
Fit-0.781
Tokeneker
Token ID28233
Feature activation+5.85e+00

Pos logit contributions
nih+1.144
+1.112
Downloadha+1.108
ubi+1.066
+1.059
Neg logit contributions
OTE-1.450
imore-1.344
adelphia-1.294
Wars-1.288
 Rasmussen-1.266
Token wrote
Token ID2630
Feature activation+2.71e+00

Pos logit contributions
OTE+4.257
imore+3.827
Wars+3.804
adelphia+3.652
finger+3.612
Neg logit contributions
nih-4.012
Downloadha-3.964
-3.826
ubi-3.743
-3.732
Token:
Token ID25
Feature activation+8.52e+00

Pos logit contributions
OTE+1.986
imore+1.826
Wars+1.755
adelphia+1.753
 Rasmussen+1.724
Neg logit contributions
nih-1.855
Downloadha-1.829
-1.805
ubi-1.742
-1.736
Token
Token ID198
Feature activation+9.69e+00

Pos logit contributions
OTE+5.674
imore+5.135
Wars+5.061
finger+4.985
 Rasmussen+4.979
Neg logit contributions
Downloadha-6.110
nih-5.928
 tremend-5.886
ancial-5.739
-5.728
Token
Token ID198
Feature activation+1.05e+01

Pos logit contributions
OTE+6.043
imore+5.414
Wars+5.414
finger+5.338
 Rasmussen+5.304
Neg logit contributions
 tremend-7.577
Downloadha-7.359
 tiss-7.277
nih-7.035
ancial-6.977
TokenCl
Token ID2601
Feature activation-3.91e+00

Pos logit contributions
OTE+6.674
imore+5.965
Wars+5.912
finger+5.717
adelphia+5.523
Neg logit contributions
Downloadha-7.366
nih-7.355
osate-7.316
ancial-7.258
 Indra-7.190
Tokenarity
Token ID6806
Feature activation+2.17e+00

Pos logit contributions
nih+2.375
+2.279
ubi+2.250
Downloadha+2.246
Fit+2.192
Neg logit contributions
OTE-3.153
imore-2.901
Wars-2.803
adelphia-2.800
 Rasmussen-2.779
Token Gaming
Token ID14426
Feature activation+2.45e+00

Pos logit contributions
OTE+1.659
imore+1.524
Wars+1.470
adelphia+1.468
 Rasmussen+1.438
Neg logit contributions
nih-1.433
Downloadha-1.407
-1.383
-1.338
ubi-1.335
Token will
Token ID481
Feature activation+5.45e-01

Pos logit contributions
OTE+1.838
imore+1.695
Wars+1.634
adelphia+1.624
 Rasmussen+1.602
Neg logit contributions
nih-1.641
Downloadha-1.616
-1.589
ubi-1.526
-1.526
Token also
Token ID635
Feature activation-1.22e-01

Pos logit contributions
OTE+0.383
imore+0.350
adelphia+0.335
Wars+0.333
 Rasmussen+0.328
Neg logit contributions
nih-0.394
Downloadha-0.386
-0.384
ubi-0.369
-0.369
Token Dark
Token ID3801
Feature activation+2.09e+00

Pos logit contributions
nih+0.763
+0.744
Downloadha+0.738
ubi+0.715
+0.708
Neg logit contributions
OTE-0.864
imore-0.796
adelphia-0.765
Wars-0.764
 Rasmussen-0.745
Tokenho
Token ID8873
Feature activation+1.03e+00

Pos logit contributions
OTE+1.229
imore+1.117
adelphia+1.054
Wars+1.049
finger+1.028
Neg logit contributions
nih-1.731
Downloadha-1.708
-1.698
ubi-1.653
-1.650
Tokenarse
Token ID17208
Feature activation-2.89e+00

Pos logit contributions
OTE+0.524
imore+0.462
Wars+0.435
adelphia+0.434
finger+0.418
Neg logit contributions
nih-0.938
Downloadha-0.921
-0.920
ubi-0.894
-0.892
Token wrote
Token ID2630
Feature activation+2.91e+00

Pos logit contributions
nih+1.863
+1.827
Downloadha+1.806
ubi+1.748
+1.731
Neg logit contributions
OTE-2.235
imore-2.072
adelphia-1.998
Wars-1.975
 Rasmussen-1.954
Token:
Token ID25
Feature activation+8.15e+00

Pos logit contributions
OTE+2.112
imore+1.936
Wars+1.866
adelphia+1.861
 Rasmussen+1.832
Neg logit contributions
nih-2.019
Downloadha-2.000
-1.963
ubi-1.898
-1.893
Token
Token ID198
Feature activation+1.04e+01

Pos logit contributions
OTE+5.416
imore+4.903
Wars+4.837
finger+4.793
 Rasmussen+4.754
Neg logit contributions
Downloadha-5.893
 tremend-5.797
nih-5.701
ancial-5.536
osate-5.525
Token
Token ID198
Feature activation+1.03e+01

Pos logit contributions
OTE+6.294
Wars+5.628
imore+5.598
 Rasmussen+5.541
finger+5.530
Neg logit contributions
 tremend-8.541
 tiss-8.176
Downloadha-8.112
 exting-7.838
ailability-7.748
TokenShow
Token ID15307
Feature activation-2.65e-01

Pos logit contributions
OTE+5.893
imore+5.220
Wars+5.090
finger+4.974
adelphia+4.688
Neg logit contributions
Downloadha-7.971
nih-7.858
osate-7.817
-7.730
 Indra-7.694
Token nested
Token ID28376
Feature activation+5.56e+00

Pos logit contributions
nih+0.147
+0.146
Downloadha+0.142
ubi+0.135
+0.134
Neg logit contributions
OTE-0.230
imore-0.216
adelphia-0.208
Wars-0.207
 Rasmussen-0.203
Token quote
Token ID9577
Feature activation+2.75e-01

Pos logit contributions
OTE+4.337
imore+3.872
Wars+3.692
 Rasmussen+3.677
finger+3.659
Neg logit contributions
Downloadha-3.593
nih-3.516
Fit-3.387
ubi-3.361
-3.326
Token +
Token ID1343
Feature activation-3.26e-01

Pos logit contributions
OTE+0.158
imore+0.143
adelphia+0.135
Wars+0.134
finger+0.130
Neg logit contributions
nih-0.233
-0.227
Downloadha-0.225
ubi-0.221
-0.220
Tokenc
Token ID66
Feature activation+2.45e+00

Pos logit contributions
OTE+4.374
imore+4.151
Wars+3.988
adelphia+3.986
finger+3.936
Neg logit contributions
nih-2.320
-2.308
Downloadha-2.295
-2.169
ubi-2.119
Tokenpy
Token ID9078
Feature activation+1.02e+00

Pos logit contributions
OTE+1.457
imore+1.318
adelphia+1.247
Wars+1.240
finger+1.209
Neg logit contributions
nih-2.027
Downloadha-1.987
-1.986
-1.918
ubi-1.918
TokenIn
Token ID818
Feature activation-2.38e+00

Pos logit contributions
OTE+0.864
imore+0.806
Wars+0.778
adelphia+0.776
finger+0.754
Neg logit contributions
nih-0.590
Downloadha-0.570
-0.569
ubi-0.544
-0.541
TokenMemory
Token ID30871
Feature activation+2.33e+00

Pos logit contributions
nih+1.771
+1.745
Downloadha+1.697
Fit+1.691
ubi+1.663
Neg logit contributions
OTE-1.585
imore-1.479
adelphia-1.419
 Canary-1.396
 Rasmussen-1.375
Token 4
Token ID604
Feature activation+2.28e+00

Pos logit contributions
OTE+1.545
imore+1.409
adelphia+1.346
Wars+1.338
 Rasmussen+1.318
Neg logit contributions
nih-1.763
Downloadha-1.746
-1.728
-1.659
ubi-1.657
Token66
Token ID2791
Feature activation+1.03e+01

Pos logit contributions
OTE+1.231
imore+1.086
adelphia+1.028
Wars+1.022
finger+1.000
Neg logit contributions
nih-1.999
Downloadha-1.965
-1.955
ubi-1.907
Fit-1.894
Token284
Token ID30336
Feature activation-2.41e+00

Pos logit contributions
OTE+4.729
imore+4.306
adelphia+4.064
finger+4.055
Wars+3.868
Neg logit contributions
nih-9.281
-8.849
Downloadha-8.690
Fit-8.503
-8.447
Token 4
Token ID604
Feature activation-4.96e+00

Pos logit contributions
nih+1.671
+1.637
ubi+1.584
Downloadha+1.579
Fit+1.571
Neg logit contributions
OTE-1.748
imore-1.608
adelphia-1.543
Wars-1.533
finger-1.491
Token64
Token ID2414
Feature activation+2.25e+00

Pos logit contributions
nih+3.354
+3.291
ubi+3.235
Fit+3.218
+3.180
Neg logit contributions
OTE-3.481
imore-3.334
adelphia-3.208
 Canary-3.125
Wars-3.098
Token406
Token ID29703
Feature activation+2.72e-01

Pos logit contributions
OTE+1.476
imore+1.340
adelphia+1.278
Wars+1.274
finger+1.249
Neg logit contributions
nih-1.722
-1.691
Downloadha-1.687
ubi-1.625
Fit-1.623
Token Mem
Token ID4942
Feature activation+1.28e+00

Pos logit contributions
OTE+0.125
imore+0.110
adelphia+0.102
Wars+0.101
finger+0.096
Neg logit contributions
nih-0.262
-0.258
Downloadha-0.256
ubi-0.251
Fit-0.250
Tokenho
Token ID8873
Feature activation+1.03e+00

Pos logit contributions
OTE+1.229
imore+1.117
adelphia+1.054
Wars+1.049
finger+1.028
Neg logit contributions
nih-1.731
Downloadha-1.708
-1.698
ubi-1.653
-1.650
Tokenarse
Token ID17208
Feature activation-2.89e+00

Pos logit contributions
OTE+0.524
imore+0.462
Wars+0.435
adelphia+0.434
finger+0.418
Neg logit contributions
nih-0.938
Downloadha-0.921
-0.920
ubi-0.894
-0.892
Token wrote
Token ID2630
Feature activation+2.91e+00

Pos logit contributions
nih+1.863
+1.827
Downloadha+1.806
ubi+1.748
+1.731
Neg logit contributions
OTE-2.235
imore-2.072
adelphia-1.998
Wars-1.975
 Rasmussen-1.954
Token:
Token ID25
Feature activation+8.15e+00

Pos logit contributions
OTE+2.112
imore+1.936
Wars+1.866
adelphia+1.861
 Rasmussen+1.832
Neg logit contributions
nih-2.019
Downloadha-2.000
-1.963
ubi-1.898
-1.893
Token
Token ID198
Feature activation+1.04e+01

Pos logit contributions
OTE+5.416
imore+4.903
Wars+4.837
finger+4.793
 Rasmussen+4.754
Neg logit contributions
Downloadha-5.893
 tremend-5.797
nih-5.701
ancial-5.536
osate-5.525
Token
Token ID198
Feature activation+1.03e+01

Pos logit contributions
OTE+6.294
Wars+5.628
imore+5.598
 Rasmussen+5.541
finger+5.530
Neg logit contributions
 tremend-8.541
 tiss-8.176
Downloadha-8.112
 exting-7.838
ailability-7.748
TokenShow
Token ID15307
Feature activation-2.65e-01

Pos logit contributions
OTE+5.893
imore+5.220
Wars+5.090
finger+4.974
adelphia+4.688
Neg logit contributions
Downloadha-7.971
nih-7.858
osate-7.817
-7.730
 Indra-7.694
Token nested
Token ID28376
Feature activation+5.56e+00

Pos logit contributions
nih+0.147
+0.146
Downloadha+0.142
ubi+0.135
+0.134
Neg logit contributions
OTE-0.230
imore-0.216
adelphia-0.208
Wars-0.207
 Rasmussen-0.203
Token quote
Token ID9577
Feature activation+2.75e-01

Pos logit contributions
OTE+4.337
imore+3.872
Wars+3.692
 Rasmussen+3.677
finger+3.659
Neg logit contributions
Downloadha-3.593
nih-3.516
Fit-3.387
ubi-3.361
-3.326
Token +
Token ID1343
Feature activation-3.26e-01

Pos logit contributions
OTE+0.158
imore+0.143
adelphia+0.135
Wars+0.134
finger+0.130
Neg logit contributions
nih-0.233
-0.227
Downloadha-0.225
ubi-0.221
-0.220
Token
Token ID198
Feature activation+6.37e-01

Pos logit contributions
nih+0.187
+0.185
Downloadha+0.180
+0.174
ubi+0.173
Neg logit contributions
OTE-0.276
imore-0.260
adelphia-0.250
Wars-0.246
finger-0.242
Token for
Token ID329
Feature activation-1.29e+00

Pos logit contributions
nih+2.571
+2.499
Downloadha+2.474
+2.411
Fit+2.388
Neg logit contributions
OTE-3.252
imore-3.029
adelphia-2.897
Wars-2.877
finger-2.822
Token Policy
Token ID7820
Feature activation-2.45e-01

Pos logit contributions
nih+0.873
+0.849
Downloadha+0.846
ubi+0.817
+0.814
Neg logit contributions
OTE-0.968
imore-0.892
adelphia-0.859
Wars-0.852
finger-0.834
Token,
Token ID11
Feature activation+1.86e+00

Pos logit contributions
nih+0.163
+0.158
Downloadha+0.158
ubi+0.152
+0.151
Neg logit contributions
OTE-0.186
imore-0.172
adelphia-0.165
Wars-0.165
 Rasmussen-0.161
Token Law
Token ID3854
Feature activation-2.25e+00

Pos logit contributions
OTE+1.358
imore+1.247
Wars+1.196
adelphia+1.192
 Rasmussen+1.181
Neg logit contributions
nih-1.292
Downloadha-1.266
-1.264
ubi-1.213
-1.203
Token,
Token ID11
Feature activation+6.86e+00

Pos logit contributions
nih+1.410
+1.366
Downloadha+1.353
ubi+1.318
+1.313
Neg logit contributions
OTE-1.784
imore-1.649
adelphia-1.595
Wars-1.573
 Rasmussen-1.547
Token and
Token ID290
Feature activation+1.02e+01

Pos logit contributions
OTE+4.638
imore+4.238
 Rasmussen+4.150
Wars+4.149
 Canary+4.118
Neg logit contributions
nih-4.926
Downloadha-4.919
-4.851
ubi-4.608
-4.533
Token Ethics
Token ID25092
Feature activation+3.79e-01

Pos logit contributions
OTE+6.350
 Rasmussen+5.935
 Canary+5.865
Wars+5.822
imore+5.798
Neg logit contributions
Downloadha-7.431
nih-7.324
-7.303
ubi-6.903
-6.700
Token at
Token ID379
Feature activation-9.46e-01

Pos logit contributions
OTE+0.301
imore+0.279
Wars+0.268
adelphia+0.268
 Rasmussen+0.263
Neg logit contributions
nih-0.239
Downloadha-0.233
-0.232
ubi-0.222
-0.220
Token Silicon
Token ID18210
Feature activation-1.48e+00

Pos logit contributions
nih+0.630
+0.612
Downloadha+0.608
ubi+0.587
Fit+0.585
Neg logit contributions
OTE-0.717
imore-0.660
adelphia-0.638
Wars-0.631
finger-0.618
Token Valley
Token ID6916
Feature activation-2.61e+00

Pos logit contributions
nih+0.855
+0.829
Downloadha+0.828
ubi+0.791
+0.783
Neg logit contributions
OTE-1.258
imore-1.174
adelphia-1.134
Wars-1.131
 Rasmussen-1.111
Token
Token ID447
Feature activation+5.37e-01

Pos logit contributions
nih+1.393
+1.352
Downloadha+1.340
+1.290
Fit+1.281
Neg logit contributions
OTE-2.310
imore-2.154
adelphia-2.087
Wars-2.069
 Rasmussen-2.034
Token.
Token ID13
Feature activation-4.26e+00

Pos logit contributions
nih+0.942
+0.906
ubi+0.864
Fit+0.860
Downloadha+0.858
Neg logit contributions
OTE-1.551
imore-1.453
adelphia-1.400
Wars-1.386
 Canary-1.376
Tokenx
Token ID87
Feature activation-1.04e+00

Pos logit contributions
nih+1.925
+1.803
ubi+1.734
Fit+1.711
Downloadha+1.686
Neg logit contributions
OTE-4.182
imore-3.901
 Canary-3.806
adelphia-3.787
 Rasmussen-3.747
Token.
Token ID13
Feature activation-1.85e+00

Pos logit contributions
nih+0.579
+0.559
Downloadha+0.539
ubi+0.532
+0.527
Neg logit contributions
OTE-0.905
imore-0.844
adelphia-0.815
Wars-0.808
 Canary-0.801
Token107
Token ID15982
Feature activation+1.56e+00

Pos logit contributions
nih+0.945
Downloadha+0.914
+0.913
ubi+0.868
+0.861
Neg logit contributions
OTE-1.682
imore-1.576
adelphia-1.523
Wars-1.520
 Rasmussen-1.495
Token 10
Token ID838
Feature activation+2.23e+00

Pos logit contributions
OTE+1.127
imore+1.024
adelphia+0.988
Wars+0.986
 Rasmussen+0.973
Neg logit contributions
nih-1.084
Downloadha-1.075
-1.067
ubi-1.025
-1.020
Token60
Token ID1899
Feature activation+1.02e+01

Pos logit contributions
OTE+1.470
imore+1.311
Wars+1.271
adelphia+1.269
finger+1.242
Neg logit contributions
nih-1.716
Downloadha-1.682
-1.660
ubi-1.612
-1.593
Token212
Token ID21777
Feature activation-2.16e+00

Pos logit contributions
OTE+6.373
adelphia+5.470
imore+5.432
finger+5.421
Wars+5.379
Neg logit contributions
nih-7.566
Downloadha-7.296
-7.040
ancial-6.888
ubi-6.870
Token x
Token ID2124
Feature activation+1.38e-01

Pos logit contributions
nih+0.970
+0.944
Downloadha+0.898
ubi+0.883
Fit+0.879
Neg logit contributions
OTE-2.098
imore-1.980
adelphia-1.916
Wars-1.906
finger-1.859
Token.
Token ID13
Feature activation-4.56e+00

Pos logit contributions
OTE+0.117
imore+0.109
adelphia+0.105
Wars+0.105
 Canary+0.103
Neg logit contributions
nih-0.079
-0.076
Downloadha-0.075
ubi-0.073
Fit-0.072
Tokenx
Token ID87
Feature activation-1.28e+00

Pos logit contributions
nih+2.014
+1.906
ubi+1.830
Fit+1.815
Downloadha+1.773
Neg logit contributions
OTE-4.484
imore-4.197
 Canary-4.095
adelphia-4.055
Wars-4.027
Token.
Token ID13
Feature activation-3.79e+00

Pos logit contributions
nih+0.705
+0.680
Downloadha+0.654
ubi+0.649
Fit+0.643
Neg logit contributions
OTE-1.119
imore-1.044
adelphia-1.006
Wars-0.998
 Canary-0.990
Tokena
Token ID64
Feature activation+3.20e+00

Pos logit contributions
OTE+0.194
imore+0.179
adelphia+0.171
Wars+0.171
 Rasmussen+0.167
Neg logit contributions
nih-0.189
-0.185
Downloadha-0.184
ubi-0.178
-0.177
Token.
Token ID13
Feature activation+1.04e+00

Pos logit contributions
OTE+2.461
imore+2.244
Wars+2.171
adelphia+2.157
 Canary+2.133
Neg logit contributions
nih-2.094
Downloadha-2.042
-2.034
Fit-1.954
ubi-1.944
Token Australian
Token ID6638
Feature activation+4.07e+00

Pos logit contributions
OTE+0.750
imore+0.688
adelphia+0.660
Wars+0.657
 Rasmussen+0.645
Neg logit contributions
nih-0.734
-0.716
Downloadha-0.715
ubi-0.689
-0.686
Token Journal
Token ID4913
Feature activation+6.19e+00

Pos logit contributions
OTE+2.616
imore+2.318
Wars+2.254
adelphia+2.243
 Canary+2.233
Neg logit contributions
nih-3.161
Downloadha-3.117
-3.073
ubi-2.995
-2.983
Token of
Token ID286
Feature activation+5.65e+00

Pos logit contributions
OTE+4.155
imore+3.664
Wars+3.658
adelphia+3.574
 Canary+3.542
Neg logit contributions
Downloadha-4.545
nih-4.512
-4.493
ubi-4.282
Fit-4.272
Token Zo
Token ID31645
Feature activation+1.01e+01

Pos logit contributions
OTE+4.116
imore+3.653
Wars+3.642
 Canary+3.628
adelphia+3.570
Neg logit contributions
Downloadha-3.861
nih-3.857
-3.780
-3.622
ubi-3.621
Tokenology
Token ID1435
Feature activation+6.44e+00

Pos logit contributions
OTE+6.788
imore+6.334
adelphia+6.144
Wars+5.981
finger+5.585
Neg logit contributions
-7.359
Downloadha-7.249
nih-6.968
-6.856
-6.849
Token 62
Token ID8190
Feature activation+6.75e+00

Pos logit contributions
OTE+4.863
imore+4.421
Wars+4.331
adelphia+4.232
 Canary+4.204
Neg logit contributions
nih-4.213
Downloadha-4.180
-4.130
-3.881
ubi-3.867
Token (
Token ID357
Feature activation+6.22e+00

Pos logit contributions
OTE+4.293
imore+3.834
Wars+3.799
 Canary+3.661
adelphia+3.655
Neg logit contributions
Downloadha-5.183
nih-5.131
-4.992
ubi-4.869
-4.831
Token5
Token ID20
Feature activation+1.14e+00

Pos logit contributions
OTE+3.846
imore+3.448
Wars+3.410
finger+3.259
 Rasmussen+3.228
Neg logit contributions
nih-4.779
Downloadha-4.730
-4.687
ubi-4.559
ancial-4.517
Token)
Token ID8
Feature activation+3.29e+00

Pos logit contributions
OTE+0.693
imore+0.624
Wars+0.594
adelphia+0.593
 Rasmussen+0.580
Neg logit contributions
nih-0.929
-0.908
Downloadha-0.908
ubi-0.876
Fit-0.875
Token pur
Token ID1308
Feature activation+4.57e+00

Pos logit contributions
OTE+0.381
imore+0.352
adelphia+0.337
Wars+0.336
 Rasmussen+0.331
Neg logit contributions
nih-0.338
Downloadha-0.330
-0.330
ubi-0.316
-0.316
Tokenak
Token ID461
Feature activation+2.89e+00

Pos logit contributions
OTE+4.010
imore+3.737
adelphia+3.612
Wars+3.571
finger+3.556
Neg logit contributions
Downloadha-2.427
-2.399
nih-2.382
-2.280
ancial-2.232
Tokenushi
Token ID17731
Feature activation+6.32e-01

Pos logit contributions
OTE+2.557
imore+2.401
adelphia+2.286
Wars+2.270
finger+2.263
Neg logit contributions
nih-1.523
-1.517
Downloadha-1.512
Fit-1.428
-1.428
Token wrote
Token ID2630
Feature activation+1.84e+00

Pos logit contributions
OTE+0.461
imore+0.421
Wars+0.406
adelphia+0.401
 Rasmussen+0.393
Neg logit contributions
nih-0.440
Downloadha-0.432
-0.424
ubi-0.412
Fit-0.412
Token:
Token ID25
Feature activation+8.76e+00

Pos logit contributions
OTE+1.378
imore+1.270
adelphia+1.219
Wars+1.216
 Rasmussen+1.195
Neg logit contributions
nih-1.242
Downloadha-1.214
-1.210
ubi-1.164
-1.160
Token
Token ID198
Feature activation+1.01e+01

Pos logit contributions
OTE+5.817
imore+5.220
Wars+5.145
 Rasmussen+5.056
finger+5.031
Neg logit contributions
Downloadha-6.333
nih-6.177
 tremend-6.077
ancial-5.960
-5.930
Token
Token ID198
Feature activation+9.64e+00

Pos logit contributions
OTE+6.357
imore+5.628
Wars+5.603
 Rasmussen+5.513
finger+5.485
Neg logit contributions
 tremend-7.912
Downloadha-7.642
 tiss-7.553
nih-7.389
osate-7.273
TokenHappy
Token ID25082
Feature activation+6.19e+00

Pos logit contributions
OTE+4.880
imore+4.137
Wars+4.039
finger+3.961
adelphia+3.788
Neg logit contributions
nih-8.390
Downloadha-8.208
osate-8.129
-8.002
ancial-8.000
Token it
Token ID340
Feature activation+5.41e+00

Pos logit contributions
OTE+4.177
imore+3.789
 Rasmussen+3.606
Wars+3.592
adelphia+3.582
Neg logit contributions
Downloadha-4.616
nih-4.524
-4.453
-4.343
ubi-4.320
Token worked
Token ID3111
Feature activation-2.67e+00

Pos logit contributions
OTE+2.838
imore+2.582
Wars+2.425
 Rasmussen+2.418
adelphia+2.339
Neg logit contributions
Downloadha-4.811
nih-4.661
-4.618
Fit-4.495
ubi-4.494
Token out
Token ID503
Feature activation-3.74e+00

Pos logit contributions
nih+1.116
+1.066
Downloadha+1.007
ubi+0.994
Fit+0.978
Neg logit contributions
OTE-2.683
imore-2.515
adelphia-2.471
Wars-2.450
finger-2.402
Token
Token ID198
Feature activation-6.51e-02

Pos logit contributions
OTE+0.018
imore+0.017
adelphia+0.016
Wars+0.016
 Canary+0.016
Neg logit contributions
nih-0.012
-0.012
Downloadha-0.012
ubi-0.011
Fit-0.011
TokenHall
Token ID34194
Feature activation+1.95e+00

Pos logit contributions
nih+0.043
+0.042
Downloadha+0.042
ubi+0.040
+0.040
Neg logit contributions
OTE-0.050
imore-0.046
adelphia-0.044
Wars-0.044
 Rasmussen-0.043
Tokenion
Token ID295
Feature activation+1.55e+00

Pos logit contributions
OTE+1.499
imore+1.396
Wars+1.334
adelphia+1.329
 Rasmussen+1.312
Neg logit contributions
nih-1.263
-1.252
Downloadha-1.248
ubi-1.188
-1.185
Token,
Token ID11
Feature activation+3.80e+00

Pos logit contributions
OTE+1.216
imore+1.127
Wars+1.084
adelphia+1.081
 Rasmussen+1.069
Neg logit contributions
nih-0.980
Downloadha-0.965
-0.961
ubi-0.918
-0.915
Token Richard
Token ID6219
Feature activation+2.46e+00

Pos logit contributions
OTE+2.839
imore+2.586
Wars+2.528
 Rasmussen+2.527
adelphia+2.507
Neg logit contributions
nih-2.542
Downloadha-2.519
-2.460
ubi-2.393
Fit-2.354
Token P
Token ID350
Feature activation+1.01e+01

Pos logit contributions
OTE+1.911
imore+1.766
Wars+1.708
adelphia+1.695
 Rasmussen+1.689
Neg logit contributions
nih-1.580
Downloadha-1.556
-1.541
ubi-1.480
-1.471
Token.
Token ID13
Feature activation+5.54e+00

Pos logit contributions
OTE+6.694
imore+6.195
Wars+6.054
adelphia+5.963
finger+5.809
Neg logit contributions
Downloadha-6.890
-6.851
nih-6.740
-6.471
ubi-6.453
Token Design
Token ID8495
Feature activation+4.85e+00

Pos logit contributions
OTE+4.140
Wars+3.755
imore+3.716
 Rasmussen+3.636
adelphia+3.630
Neg logit contributions
nih-3.611
Downloadha-3.605
-3.505
ubi-3.429
-3.345
Tokeners
Token ID364
Feature activation+5.20e+00

Pos logit contributions
OTE+3.701
imore+3.382
Wars+3.359
adelphia+3.327
 Canary+3.285
Neg logit contributions
Downloadha-3.086
nih-3.060
-2.995
ubi-2.920
-2.824
Token and
Token ID290
Feature activation+5.03e+00

Pos logit contributions
OTE+3.785
imore+3.440
Wars+3.374
adelphia+3.313
 Rasmussen+3.297
Neg logit contributions
nih-3.523
Downloadha-3.489
-3.429
ubi-3.300
-3.241
Token Test
Token ID6208
Feature activation+1.39e+00

Pos logit contributions
OTE+3.716
imore+3.415
Wars+3.367
 Canary+3.344
 Rasmussen+3.329
Neg logit contributions
nih-3.261
Downloadha-3.218
-3.164
ubi-3.090
-3.005
Token too
Token ID1165
Feature activation+3.38e+00

Pos logit contributions
nih+1.456
+1.391
ubi+1.348
Downloadha+1.341
Fit+1.327
Neg logit contributions
OTE-3.381
imore-3.118
adelphia-3.065
Wars-3.021
finger-2.993
Token.
Token ID13
Feature activation+3.08e+00

Pos logit contributions
OTE+2.656
imore+2.461
Wars+2.371
 Rasmussen+2.338
adelphia+2.336
Neg logit contributions
nih-2.127
Downloadha-2.120
-2.059
-1.996
Fit-1.979
Token In
Token ID554
Feature activation+1.02e-01

Pos logit contributions
OTE+2.438
imore+2.233
 Rasmussen+2.179
adelphia+2.173
Wars+2.167
Neg logit contributions
nih-1.938
Downloadha-1.918
-1.875
ubi-1.792
-1.776
TokenDaily
Token ID28545
Feature activation+1.87e-01

Pos logit contributions
OTE+0.077
imore+0.072
adelphia+0.069
Wars+0.068
finger+0.067
Neg logit contributions
nih-0.068
-0.067
Downloadha-0.066
ubi-0.064
-0.064
Token provides
Token ID3769
Feature activation+4.25e+00

Pos logit contributions
OTE+0.163
imore+0.152
adelphia+0.148
Wars+0.145
finger+0.143
Neg logit contributions
nih-0.103
Downloadha-0.100
-0.100
ubi-0.096
Fit-0.096
Token valuable
Token ID8119
Feature activation+9.96e+00

Pos logit contributions
OTE+2.497
imore+2.254
adelphia+2.099
 Canary+2.090
 Rasmussen+2.088
Neg logit contributions
nih-3.557
Downloadha-3.498
-3.493
ubi-3.373
-3.369
Token,
Token ID11
Feature activation+8.06e+00

Pos logit contributions
OTE+6.160
 Rasmussen+5.734
 Canary+5.529
Wars+5.499
imore+5.478
Neg logit contributions
Downloadha-7.530
nih-7.239
ubi-7.238
-7.061
 Indra-6.868
Token local
Token ID1957
Feature activation+6.28e+00

Pos logit contributions
OTE+5.372
 Rasmussen+4.964
imore+4.829
Wars+4.810
 Canary+4.747
Neg logit contributions
nih-6.033
-5.671
Downloadha-5.609
ubi-5.466
-5.425
Token independent
Token ID4795
Feature activation+6.59e+00

Pos logit contributions
OTE+3.157
imore+2.778
Wars+2.630
 Rasmussen+2.567
 Canary+2.531
Neg logit contributions
nih-5.773
Downloadha-5.702
-5.684
ubi-5.520
-5.463
Token journalism
Token ID14033
Feature activation+4.07e+00

Pos logit contributions
OTE+3.077
imore+2.783
Wars+2.548
 Rasmussen+2.524
 Canary+2.495
Neg logit contributions
nih-6.214
-6.133
Downloadha-6.119
ubi-6.024
-5.898
Token in
Token ID287
Feature activation+4.91e+00

Pos logit contributions
OTE+2.949
imore+2.675
Wars+2.645
 Rasmussen+2.638
 Canary+2.589
Neg logit contributions
nih-2.756
Downloadha-2.709
-2.670
-2.538
osate-2.534
Token I
Token ID314
Feature activation+2.47e-01

Pos logit contributions
nih+0.981
+0.955
Downloadha+0.943
ubi+0.913
+0.908
Neg logit contributions
OTE-1.207
imore-1.119
adelphia-1.080
Wars-1.073
finger-1.048
Token did
Token ID750
Feature activation-1.42e-01

Pos logit contributions
OTE+0.180
imore+0.166
adelphia+0.158
Wars+0.158
 Rasmussen+0.156
Neg logit contributions
nih-0.171
Downloadha-0.168
-0.166
ubi-0.160
-0.160
Token give
Token ID1577
Feature activation+3.22e-01

Pos logit contributions
nih+0.094
+0.092
Downloadha+0.091
ubi+0.088
+0.087
Neg logit contributions
OTE-0.107
imore-0.099
adelphia-0.095
Wars-0.095
 Rasmussen-0.093
Token it
Token ID340
Feature activation+3.31e+00

Pos logit contributions
OTE+0.244
imore+0.225
adelphia+0.216
Wars+0.215
 Rasmussen+0.212
Neg logit contributions
nih-0.215
Downloadha-0.209
-0.209
ubi-0.201
-0.199
Token a
Token ID257
Feature activation+5.68e+00

Pos logit contributions
OTE+2.269
imore+2.078
adelphia+1.977
 Rasmussen+1.969
 Canary+1.964
Neg logit contributions
nih-2.419
Downloadha-2.380
-2.367
ubi-2.287
-2.261
Token little
Token ID1310
Feature activation+9.93e+00

Pos logit contributions
OTE+3.048
imore+2.737
 Rasmussen+2.578
 Canary+2.577
adelphia+2.566
Neg logit contributions
nih-4.915
Downloadha-4.857
-4.809
ubi-4.755
Fit-4.649
Token thought
Token ID1807
Feature activation+6.11e+00

Pos logit contributions
OTE+6.206
imore+5.662
finger+5.526
 Canary+5.517
 Rasmussen+5.363
Neg logit contributions
Downloadha-7.420
nih-7.376
ubi-7.216
-7.132
Fit-6.911
Token.
Token ID13
Feature activation+3.85e+00

Pos logit contributions
OTE+4.589
imore+4.284
adelphia+4.022
Wars+4.013
 Rasmussen+4.007
Neg logit contributions
nih-3.966
Downloadha-3.919
-3.894
ubi-3.783
-3.728
Token I
Token ID314
Feature activation+4.14e+00

Pos logit contributions
OTE+2.897
imore+2.636
Wars+2.569
 Rasmussen+2.552
 Canary+2.542
Neg logit contributions
nih-2.560
Downloadha-2.523
-2.457
ubi-2.414
-2.360
Token started
Token ID2067
Feature activation-2.49e-01

Pos logit contributions
OTE+2.888
imore+2.635
 Rasmussen+2.516
Wars+2.495
adelphia+2.478
Neg logit contributions
Downloadha-2.938
nih-2.924
-2.827
ubi-2.767
-2.751
Token to
Token ID284
Feature activation-1.26e+00

Pos logit contributions
nih+0.164
+0.160
Downloadha+0.159
ubi+0.153
+0.153
Neg logit contributions
OTE-0.190
imore-0.176
adelphia-0.169
Wars-0.168
 Rasmussen-0.165
Token Red
Token ID2297
Feature activation+5.26e+00

Pos logit contributions
OTE+1.539
imore+1.419
adelphia+1.369
Wars+1.367
 Canary+1.345
Neg logit contributions
nih-1.358
Downloadha-1.340
-1.318
ubi-1.274
-1.264
Token Tent
Token ID30765
Feature activation+1.26e+00

Pos logit contributions
OTE+4.045
imore+3.769
Wars+3.651
 Canary+3.650
finger+3.632
Neg logit contributions
Downloadha-3.337
nih-3.270
-3.234
-3.112
ubi-3.070
Token:
Token ID25
Feature activation+6.39e+00

Pos logit contributions
OTE+0.971
imore+0.901
Wars+0.864
adelphia+0.864
 Rasmussen+0.847
Neg logit contributions
nih-0.822
Downloadha-0.806
-0.804
-0.769
ubi-0.769
Token A
Token ID317
Feature activation+6.89e+00

Pos logit contributions
OTE+4.683
imore+4.249
Wars+4.189
 Canary+4.166
 Rasmussen+4.141
Neg logit contributions
Downloadha-4.179
nih-4.177
-4.031
ubi-3.957
Fit-3.868
Token Novel
Token ID24467
Feature activation+1.83e+00

Pos logit contributions
OTE+4.826
imore+4.382
 Rasmussen+4.275
 Canary+4.274
Wars+4.247
Neg logit contributions
Downloadha-4.837
nih-4.792
-4.591
ubi-4.484
-4.422
Token (
Token ID357
Feature activation+9.91e+00

Pos logit contributions
OTE+1.372
imore+1.266
adelphia+1.214
Wars+1.209
 Rasmussen+1.192
Neg logit contributions
nih-1.236
Downloadha-1.214
-1.212
ubi-1.161
-1.154
TokenNew
Token ID3791
Feature activation+2.67e+00

Pos logit contributions
OTE+6.415
Wars+6.087
imore+5.909
finger+5.626
adelphia+5.595
Neg logit contributions
Downloadha-6.893
nih-6.826
-6.741
ubi-6.681
osate-6.620
Token York
Token ID1971
Feature activation+2.11e+00

Pos logit contributions
OTE+2.016
imore+1.907
adelphia+1.833
Wars+1.832
 Canary+1.813
Neg logit contributions
nih-1.733
Downloadha-1.727
-1.708
ubi-1.640
Fit-1.607
Token:
Token ID25
Feature activation+6.25e+00

Pos logit contributions
OTE+1.500
imore+1.396
adelphia+1.350
Wars+1.341
 Rasmussen+1.319
Neg logit contributions
Downloadha-1.466
nih-1.461
-1.441
ubi-1.398
Fit-1.378
Token St
Token ID520
Feature activation-8.57e-02

Pos logit contributions
OTE+4.537
 Rasmussen+4.142
Wars+4.124
 Canary+4.122
imore+4.121
Neg logit contributions
Downloadha-4.346
nih-4.218
-4.037
ubi-4.007
-3.822
Token.
Token ID13
Feature activation+1.12e+00

Pos logit contributions
nih+0.051
+0.050
Downloadha+0.050
ubi+0.048
+0.048
Neg logit contributions
OTE-0.070
imore-0.066
adelphia-0.063
Wars-0.063
 Rasmussen-0.062
Token.
Token ID13
Feature activation-5.46e+00

Pos logit contributions
nih+1.673
+1.606
ubi+1.534
Fit+1.528
+1.497
Neg logit contributions
OTE-2.964
imore-2.780
adelphia-2.681
 Canary-2.648
Wars-2.645
Tokenx
Token ID87
Feature activation-2.87e+00

Pos logit contributions
nih+2.332
+2.200
ubi+2.112
Fit+2.100
Downloadha+2.017
Neg logit contributions
OTE-5.431
imore-5.067
 Canary-4.984
adelphia-4.924
 Rasmussen-4.867
Token.
Token ID13
Feature activation-3.18e+00

Pos logit contributions
nih+1.550
+1.496
ubi+1.419
Downloadha+1.417
Fit+1.414
Neg logit contributions
OTE-2.554
imore-2.382
adelphia-2.300
Wars-2.271
 Canary-2.269
Token169
Token ID22172
Feature activation+5.09e-01

Pos logit contributions
nih+1.618
+1.567
Downloadha+1.543
Fit+1.468
+1.461
Neg logit contributions
OTE-2.930
imore-2.741
adelphia-2.645
 Rasmussen-2.613
Wars-2.608
Token 11
Token ID1367
Feature activation-3.02e+00

Pos logit contributions
OTE+0.373
imore+0.342
adelphia+0.329
Wars+0.328
 Rasmussen+0.323
Neg logit contributions
nih-0.351
Downloadha-0.344
-0.343
ubi-0.330
-0.328
Token66
Token ID2791
Feature activation+9.91e+00

Pos logit contributions
nih+2.097
+2.070
Downloadha+2.024
+1.998
Fit+1.986
Neg logit contributions
OTE-2.155
imore-2.047
adelphia-1.918
Wars-1.897
 Rasmussen-1.866
Token589
Token ID44169
Feature activation-6.14e-01

Pos logit contributions
OTE+6.307
adelphia+5.461
Wars+5.383
finger+5.361
imore+5.361
Neg logit contributions
nih-7.242
Downloadha-6.916
-6.805
ancial-6.642
ubi-6.621
Token x
Token ID2124
Feature activation+3.89e-01

Pos logit contributions
nih+0.288
+0.280
Downloadha+0.271
ubi+0.263
Fit+0.261
Neg logit contributions
OTE-0.585
imore-0.552
adelphia-0.533
Wars-0.532
finger-0.520
Token.
Token ID13
Feature activation-3.25e+00

Pos logit contributions
OTE+0.330
imore+0.308
adelphia+0.297
Wars+0.295
 Rasmussen+0.290
Neg logit contributions
nih-0.224
-0.217
Downloadha-0.215
ubi-0.208
Fit-0.206
Tokenx
Token ID87
Feature activation-1.77e+00

Pos logit contributions
nih+1.507
+1.441
ubi+1.375
Downloadha+1.368
Fit+1.366
Neg logit contributions
OTE-3.133
imore-2.940
 Canary-2.843
adelphia-2.837
Wars-2.815
Token.
Token ID13
Feature activation-4.85e+00

Pos logit contributions
nih+0.952
+0.918
ubi+0.877
Downloadha+0.874
Fit+0.871
Neg logit contributions
OTE-1.574
imore-1.474
adelphia-1.420
Wars-1.404
 Canary-1.395
TokenOC
Token ID4503
Feature activation-1.30e-01

Pos logit contributions
OTE+5.600
imore+4.686
Wars+4.616
adelphia+4.449
 Rasmussen+4.196
Neg logit contributions
nih-7.417
Downloadha-7.255
-7.112
ancial-6.927
osate-6.893
TokenR
Token ID49
Feature activation+2.26e-01

Pos logit contributions
nih+0.084
+0.081
Downloadha+0.081
ubi+0.079
Fit+0.078
Neg logit contributions
OTE-0.100
imore-0.094
adelphia-0.090
Wars-0.088
 Rasmussen-0.087
TokenOP
Token ID3185
Feature activation+5.80e+00

Pos logit contributions
OTE+0.138
imore+0.125
adelphia+0.119
Wars+0.118
 Rasmussen+0.115
Neg logit contributions
nih-0.183
-0.180
Downloadha-0.179
ubi-0.174
-0.173
Token/
Token ID14
Feature activation+3.07e+00

Pos logit contributions
OTE+3.563
imore+3.239
 Rasmussen+3.072
Wars+3.005
adelphia+2.928
Neg logit contributions
nih-4.648
Downloadha-4.645
-4.607
ubi-4.427
osate-4.391
Tokenh
Token ID71
Feature activation+2.01e+00

Pos logit contributions
OTE+2.582
imore+2.417
Wars+2.334
 Rasmussen+2.258
finger+2.248
Neg logit contributions
Downloadha-1.777
nih-1.770
-1.708
ancial-1.667
-1.645
Token342
Token ID31575
Feature activation+9.85e+00

Pos logit contributions
OTE+1.240
imore+1.128
adelphia+1.078
Wars+1.066
 Rasmussen+1.042
Neg logit contributions
nih-1.615
-1.583
Downloadha-1.573
ubi-1.531
Fit-1.524
Token/
Token ID14
Feature activation+5.30e-01

Pos logit contributions
OTE+5.384
imore+4.603
Wars+4.406
finger+4.385
adelphia+4.275
Neg logit contributions
Downloadha-8.033
ancial-7.988
nih-7.958
-7.789
ubi-7.749
TokenUST
Token ID7759
Feature activation+2.65e+00

Pos logit contributions
OTE+0.388
imore+0.357
adelphia+0.342
Wars+0.342
finger+0.334
Neg logit contributions
nih-0.367
-0.358
Downloadha-0.357
ubi-0.344
-0.343
Tokenr
Token ID81
Feature activation+5.15e+00

Pos logit contributions
OTE+1.971
imore+1.802
Wars+1.736
adelphia+1.724
finger+1.676
Neg logit contributions
nih-1.810
-1.770
Downloadha-1.769
-1.698
ubi-1.687
Tokenump
Token ID931
Feature activation+5.14e+00

Pos logit contributions
OTE+3.976
imore+3.618
Wars+3.508
adelphia+3.500
finger+3.418
Neg logit contributions
nih-3.352
Downloadha-3.261
-3.202
-3.097
Fit-3.055
Token220
Token ID17572
Feature activation+6.58e+00

Pos logit contributions
OTE+3.813
imore+3.528
Wars+3.429
adelphia+3.403
finger+3.287
Neg logit contributions
nih-3.389
Downloadha-3.357
-3.357
-3.195
ubi-3.164
Tokeniz
Token ID528
Feature activation+6.97e-01

Pos logit contributions
nih+2.306
+2.214
Downloadha+2.200
ubi+2.157
Fit+2.123
Neg logit contributions
OTE-2.794
imore-2.582
adelphia-2.496
Wars-2.479
 Canary-2.436
Tokenarro
Token ID34852
Feature activation-3.60e+00

Pos logit contributions
OTE+0.488
imore+0.450
adelphia+0.431
Wars+0.429
finger+0.420
Neg logit contributions
nih-0.501
Downloadha-0.494
-0.491
ubi-0.472
-0.471
Token
Token ID39073
Feature activation+8.59e-01

Pos logit contributions
nih+2.230
+2.179
Downloadha+2.136
ubi+2.080
Fit+2.072
Neg logit contributions
OTE-2.870
imore-2.657
Wars-2.548
adelphia-2.546
 Rasmussen-2.498
Tokenj
Token ID73
Feature activation-3.19e+00

Pos logit contributions
OTE+0.632
imore+0.580
adelphia+0.557
Wars+0.554
 Rasmussen+0.544
Neg logit contributions
nih-0.594
-0.577
Downloadha-0.577
ubi-0.556
-0.553
Tokenà
Token ID24247
Feature activation-4.44e+00

Pos logit contributions
nih+1.255
+1.200
Downloadha+1.191
ubi+1.120
+1.107
Neg logit contributions
OTE-3.282
imore-3.081
adelphia-2.992
Wars-2.990
 Rasmussen-2.963
Token v
Token ID410
Feature activation-1.18e+01

Pos logit contributions
nih+2.155
+2.116
Downloadha+2.075
Fit+1.972
ubi+1.961
Neg logit contributions
OTE-4.130
imore-3.842
adelphia-3.744
Wars-3.698
 Canary-3.673
Tokenu
Token ID84
Feature activation-1.27e+00

Pos logit contributions
ubi+4.380
nih+4.346
ald+4.058
Fit+3.786
+3.770
Neg logit contributions
OTE-11.439
 Canary-10.583
Wars-10.524
imore-10.401
adelphia-10.173
Token
Token ID960
Feature activation+1.33e+00

Pos logit contributions
nih+0.779
+0.758
Downloadha+0.754
ubi+0.725
+0.721
Neg logit contributions
OTE-1.030
imore-0.957
adelphia-0.921
Wars-0.917
 Rasmussen-0.901
Tokenthat
Token ID5562
Feature activation-2.28e-01

Pos logit contributions
OTE+0.965
imore+0.889
adelphia+0.852
Wars+0.851
finger+0.838
Neg logit contributions
nih-0.926
Downloadha-0.905
-0.903
ubi-0.870
-0.868
Token rare
Token ID4071
Feature activation+5.68e+00

Pos logit contributions
nih+0.152
Downloadha+0.148
+0.148
ubi+0.142
+0.142
Neg logit contributions
OTE-0.173
imore-0.159
adelphia-0.153
Wars-0.152
 Rasmussen-0.150
Tokenst
Token ID301
Feature activation+4.49e-01

Pos logit contributions
OTE+3.966
imore+3.680
adelphia+3.546
 Rasmussen+3.476
finger+3.458
Neg logit contributions
Downloadha-3.948
nih-3.929
-3.806
-3.731
ubi-3.704
Token become
Token ID1716
Feature activation-9.87e-01

Pos logit contributions
OTE+0.786
imore+0.720
adelphia+0.689
Wars+0.683
 Rasmussen+0.675
Neg logit contributions
nih-0.842
Downloadha-0.830
-0.824
-0.793
ubi-0.791
Token that
Token ID326
Feature activation+2.08e+00

Pos logit contributions
nih+0.661
+0.644
Downloadha+0.641
ubi+0.619
+0.616
Neg logit contributions
OTE-0.743
imore-0.686
adelphia-0.658
Wars-0.655
 Rasmussen-0.642
Token day
Token ID1110
Feature activation-1.57e-02

Pos logit contributions
OTE+1.537
imore+1.412
adelphia+1.354
Wars+1.350
 Rasmussen+1.330
Neg logit contributions
nih-1.422
Downloadha-1.397
-1.384
ubi-1.323
-1.323
Token's
Token ID338
Feature activation+1.47e+00

Pos logit contributions
nih+0.011
+0.010
Downloadha+0.010
ubi+0.010
+0.010
Neg logit contributions
OTE-0.012
imore-0.011
adelphia-0.010
Wars-0.010
 Rasmussen-0.010
Token ze
Token ID41271
Feature activation+3.34e+00

Pos logit contributions
OTE+1.088
imore+1.003
adelphia+0.960
Wars+0.954
 Canary+0.951
Neg logit contributions
nih-1.012
Downloadha-0.990
-0.976
ubi-0.942
-0.933
Tokenit
Token ID270
Feature activation-1.13e+01

Pos logit contributions
OTE+2.293
imore+2.090
finger+2.017
adelphia+2.012
Wars+1.992
Neg logit contributions
nih-2.427
Downloadha-2.381
-2.351
-2.263
ubi-2.224
Tokengeist
Token ID49782
Feature activation+3.80e+00

Pos logit contributions
nih+7.465
ubi+7.409
Downloadha+7.345
ancial+7.315
+7.167
Neg logit contributions
OTE-7.763
imore-6.811
adelphia-6.541
Wars-6.491
 Rasmussen-6.423
Token.
Token ID13
Feature activation+1.17e+00

Pos logit contributions
OTE+2.745
imore+2.508
adelphia+2.398
Wars+2.392
 Rasmussen+2.359
Neg logit contributions
nih-2.677
Downloadha-2.644
-2.610
ubi-2.483
-2.466
Token
Token ID198
Feature activation+2.35e+00

Pos logit contributions
OTE+0.949
imore+0.880
adelphia+0.848
Wars+0.846
 Rasmussen+0.832
Neg logit contributions
nih-0.720
-0.700
Downloadha-0.699
ubi-0.670
-0.666
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
OTE+1.886
imore+1.749
Wars+1.684
adelphia+1.682
 Rasmussen+1.655
Neg logit contributions
nih-1.452
Downloadha-1.420
-1.409
ubi-1.353
-1.346
Token'
Token ID6
Feature activation+9.12e-01

Pos logit contributions
OTE+0.210
imore+0.190
adelphia+0.182
Wars+0.182
finger+0.177
Neg logit contributions
nih-0.249
-0.243
Downloadha-0.243
ubi-0.235
-0.234
Token ex
Token ID409
Feature activation+2.23e+00

Pos logit contributions
nih+0.864
+0.846
Downloadha+0.835
ubi+0.803
+0.801
Neg logit contributions
OTE-1.059
imore-0.985
adelphia-0.946
Wars-0.942
finger-0.921
Token­
Token ID3907
Feature activation+1.43e-01

Pos logit contributions
OTE+2.314
imore+2.177
adelphia+2.126
Wars+2.100
 Rasmussen+2.079
Neg logit contributions
Downloadha-0.850
nih-0.831
-0.822
-0.769
ubi-0.750
Tokenclaimed
Token ID12795
Feature activation-1.18e+00

Pos logit contributions
OTE+0.095
imore+0.086
adelphia+0.083
Wars+0.082
 Rasmussen+0.080
Neg logit contributions
nih-0.108
Downloadha-0.106
-0.106
ubi-0.102
-0.102
Token Marco
Token ID16556
Feature activation+1.67e+00

Pos logit contributions
nih+0.781
+0.762
Downloadha+0.756
ubi+0.728
+0.725
Neg logit contributions
OTE-0.896
imore-0.829
adelphia-0.795
Wars-0.793
finger-0.775
Token Ru
Token ID11667
Feature activation-2.97e+00

Pos logit contributions
OTE+1.120
imore+1.019
Wars+0.975
adelphia+0.970
 Rasmussen+0.965
Neg logit contributions
nih-1.245
Downloadha-1.234
-1.217
ubi-1.186
-1.185
Token­
Token ID3907
Feature activation-1.11e+01

Pos logit contributions
nih+1.147
+1.081
Downloadha+1.081
ubi+1.026
Fit+1.005
Neg logit contributions
OTE-3.081
imore-2.902
adelphia-2.823
Wars-2.819
 Rasmussen-2.784
Tokenb
Token ID65
Feature activation-7.47e+00

Pos logit contributions
nih+8.217
ald+7.522
ubi+7.417
+7.357
Fit+7.344
Neg logit contributions
OTE-7.400
imore-6.437
 Canary-6.434
Wars-6.065
 Rasmussen-6.051
Tokenio
Token ID952
Feature activation+2.78e-01

Pos logit contributions
nih+4.732
ubi+4.398
+4.361
ald+4.269
Downloadha+4.265
Neg logit contributions
OTE-5.846
imore-5.252
 Canary-5.197
adelphia-5.166
Wars-5.164
Token on
Token ID319
Feature activation+1.84e+00

Pos logit contributions
OTE+0.225
imore+0.209
Wars+0.201
adelphia+0.201
 Rasmussen+0.198
Neg logit contributions
nih-0.171
Downloadha-0.166
-0.166
ubi-0.159
-0.158
Token Fox
Token ID5426
Feature activation+6.98e-01

Pos logit contributions
OTE+1.385
imore+1.276
adelphia+1.227
Wars+1.224
 Rasmussen+1.212
Neg logit contributions
nih-1.231
-1.201
Downloadha-1.201
ubi-1.155
-1.151
Token News
Token ID3000
Feature activation+1.18e+00

Pos logit contributions
OTE+0.623
imore+0.583
adelphia+0.564
Wars+0.563
 Rasmussen+0.554
Neg logit contributions
nih-0.370
Downloadha-0.360
-0.358
ubi-0.340
-0.338
Token,
Token ID11
Feature activation-1.07e+00

Pos logit contributions
OTE+0.444
imore+0.410
adelphia+0.395
Wars+0.395
 Rasmussen+0.390
Neg logit contributions
nih-0.331
Downloadha-0.321
-0.320
ubi-0.308
-0.306
Token this
Token ID428
Feature activation+2.14e-01

Pos logit contributions
nih+0.691
+0.674
Downloadha+0.668
ubi+0.643
+0.641
Neg logit contributions
OTE-0.826
imore-0.768
adelphia-0.739
Wars-0.732
finger-0.717
Token
Token ID39073
Feature activation+1.51e+00

Pos logit contributions
OTE+0.162
imore+0.149
adelphia+0.143
Wars+0.143
 Rasmussen+0.140
Neg logit contributions
nih-0.143
-0.139
Downloadha-0.139
ubi-0.133
-0.133
Tokenj
Token ID73
Feature activation-3.80e+00

Pos logit contributions
OTE+1.394
imore+1.310
adelphia+1.266
Wars+1.262
 Rasmussen+1.242
Neg logit contributions
nih-0.758
-0.735
Downloadha-0.732
ubi-0.693
-0.691
Tokenà
Token ID24247
Feature activation-5.24e+00

Pos logit contributions
nih+1.443
+1.372
Downloadha+1.354
ubi+1.275
Fit+1.266
Neg logit contributions
OTE-3.965
imore-3.724
Wars-3.629
adelphia-3.615
 Rasmussen-3.580
Token v
Token ID410
Feature activation-1.10e+01

Pos logit contributions
+2.426
nih+2.418
Downloadha+2.319
Fit+2.230
+2.196
Neg logit contributions
OTE-4.938
imore-4.627
adelphia-4.509
Wars-4.472
finger-4.438
Tokenu
Token ID84
Feature activation-1.02e+00

Pos logit contributions
ubi+4.064
nih+4.018
ald+3.817
+3.621
Fit+3.576
Neg logit contributions
OTE-10.648
Wars-9.892
 Canary-9.854
imore-9.796
adelphia-9.619
Token isn
Token ID2125
Feature activation-5.09e-01

Pos logit contributions
nih+0.658
+0.642
Downloadha+0.639
ubi+0.615
+0.612
Neg logit contributions
OTE-0.791
imore-0.732
adelphia-0.703
Wars-0.700
 Rasmussen-0.686
Token
Token ID447
Feature activation-1.91e+00

Pos logit contributions
nih+0.405
+0.400
Downloadha+0.392
ubi+0.383
+0.383
Neg logit contributions
OTE-0.320
imore-0.293
adelphia-0.278
Wars-0.274
 Canary-0.269
Token
Token ID247
Feature activation-1.76e+00

Pos logit contributions
nih+1.209
+1.197
Downloadha+1.158
ubi+1.134
+1.131
Neg logit contributions
OTE-1.489
imore-1.384
Wars-1.337
adelphia-1.330
 Canary-1.307
Tokent
Token ID83
Feature activation-3.17e-01

Pos logit contributions
nih+1.043
+1.011
Downloadha+1.007
ubi+0.966
+0.960
Neg logit contributions
OTE-1.468
imore-1.366
adelphia-1.317
Wars-1.313
 Rasmussen-1.289
Tokenve
Token ID303
Feature activation-1.58e+00

Pos logit contributions
nih+0.147
Downloadha+0.145
+0.145
ubi+0.139
+0.139
Neg logit contributions
OTE-0.147
imore-0.135
adelphia-0.128
Wars-0.128
 Rasmussen-0.127
Token made
Token ID925
Feature activation-1.26e+00

Pos logit contributions
nih+1.073
+1.046
Downloadha+1.044
ubi+1.006
+1.001
Neg logit contributions
OTE-1.172
imore-1.081
adelphia-1.037
Wars-1.032
 Rasmussen-1.013
Token,
Token ID11
Feature activation-2.61e+00

Pos logit contributions
nih+0.768
+0.747
Downloadha+0.744
ubi+0.713
+0.711
Neg logit contributions
OTE-1.027
imore-0.957
adelphia-0.921
Wars-0.916
finger-0.899
Token we
Token ID356
Feature activation-2.00e+00

Pos logit contributions
nih+1.670
+1.625
Downloadha+1.613
ubi+1.552
Fit+1.542
Neg logit contributions
OTE-2.023
imore-1.897
adelphia-1.819
Wars-1.785
finger-1.759
Token more
Token ID517
Feature activation-4.67e-01

Pos logit contributions
nih+1.463
+1.431
Downloadha+1.422
ubi+1.378
+1.372
Neg logit contributions
OTE-1.386
imore-1.274
adelphia-1.219
Wars-1.209
 Rasmussen-1.183
Token or
Token ID393
Feature activation-1.08e+01

Pos logit contributions
nih+0.321
+0.313
Downloadha+0.312
ubi+0.301
+0.300
Neg logit contributions
OTE-0.344
imore-0.317
adelphia-0.303
Wars-0.302
 Rasmussen-0.296
Token less
Token ID1342
Feature activation-2.19e+00

Pos logit contributions
+4.462
nih+4.220
Downloadha+4.048
Fit+3.712
ald+3.694
Neg logit contributions
OTE-10.450
adelphia-9.819
imore-9.752
Wars-9.377
 Canary-9.371
Token know
Token ID760
Feature activation-2.28e+00

Pos logit contributions
nih+1.494
+1.453
Downloadha+1.443
ubi+1.392
+1.390
Neg logit contributions
OTE-1.619
imore-1.498
adelphia-1.436
Wars-1.423
 Canary-1.389
Token where
Token ID810
Feature activation-2.82e+00

Pos logit contributions
nih+1.469
+1.431
Downloadha+1.424
ubi+1.374
Fit+1.364
Neg logit contributions
OTE-1.768
imore-1.644
adelphia-1.571
Wars-1.563
 Rasmussen-1.533
Token we
Token ID356
Feature activation+4.73e-01

Pos logit contributions
nih+1.830
+1.779
Downloadha+1.758
ubi+1.704
+1.697
Neg logit contributions
OTE-2.163
imore-2.022
adelphia-1.942
Wars-1.906
finger-1.874
Token are
Token ID389
Feature activation+1.26e+00

Pos logit contributions
OTE+0.322
imore+0.293
Wars+0.279
adelphia+0.279
 Rasmussen+0.273
Neg logit contributions
nih-0.352
Downloadha-0.345
-0.343
-0.331
ubi-0.330
Token ----
Token ID13498
Feature activation-1.68e-01

Pos logit contributions
nih+3.223
+3.107
Downloadha+3.043
ubi+2.985
+2.969
Neg logit contributions
OTE-3.393
imore-3.155
adelphia-3.075
Wars-3.041
finger-2.953
Token-+
Token ID19529
Feature activation-8.39e+00

Pos logit contributions
nih+0.144
+0.141
Downloadha+0.141
ubi+0.136
+0.136
Neg logit contributions
OTE-0.096
imore-0.086
adelphia-0.081
Wars-0.081
 Rasmussen-0.079
Token--------------------------------
Token ID3880
Feature activation-4.20e+00

Pos logit contributions
nih+5.931
+5.926
Downloadha+5.716
ubi+5.695
Fit+5.672
Neg logit contributions
OTE-5.493
imore-5.129
adelphia-4.984
Wars-4.852
 Rasmussen-4.795
Token+
Token ID10
Feature activation-9.57e+00

Pos logit contributions
nih+3.160
+3.095
Downloadha+3.020
ubi+2.992
+2.986
Neg logit contributions
OTE-2.741
imore-2.528
adelphia-2.451
Wars-2.419
 Rasmussen-2.365
Token-------
Token ID26866
Feature activation-4.89e+00

Pos logit contributions
nih+6.523
+6.460
+6.203
Downloadha+6.172
ubi+6.155
Neg logit contributions
OTE-6.457
imore-6.032
adelphia-5.897
Wars-5.772
 Rasmussen-5.615
Token|
Token ID91
Feature activation-1.07e+01

Pos logit contributions
nih+2.577
+2.490
Downloadha+2.421
ubi+2.385
+2.358
Neg logit contributions
OTE-4.318
imore-4.043
adelphia-3.951
Wars-3.930
 Rasmussen-3.861
Token |
Token ID930
Feature activation-4.13e+00

Pos logit contributions
nih+5.444
+5.206
+5.059
Downloadha+4.972
ubi+4.960
Neg logit contributions
OTE-8.971
imore-8.451
adelphia-8.412
Wars-8.301
 Rasmussen-7.979
Token 1
Token ID352
Feature activation-2.16e+00

Pos logit contributions
nih+1.836
+1.717
Downloadha+1.677
+1.618
ubi+1.614
Neg logit contributions
OTE-4.031
imore-3.793
adelphia-3.700
Wars-3.688
finger-3.613
Token |
Token ID930
Feature activation-1.68e+00

Pos logit contributions
nih+1.377
+1.336
Downloadha+1.320
ubi+1.285
+1.275
Neg logit contributions
OTE-1.699
imore-1.577
adelphia-1.517
Wars-1.512
 Rasmussen-1.479
Token org
Token ID8745
Feature activation+2.39e+00

Pos logit contributions
nih+0.928
+0.896
Downloadha+0.889
ubi+0.853
+0.849
Neg logit contributions
OTE-1.460
imore-1.364
adelphia-1.319
Wars-1.313
finger-1.287
Token.
Token ID764
Feature activation+1.60e-01

Pos logit contributions
OTE+1.741
imore+1.611
adelphia+1.536
Wars+1.535
 Rasmussen+1.505
Neg logit contributions
Downloadha-1.631
nih-1.620
-1.596
ubi-1.541
-1.523
Token other
Token ID584
Feature activation+3.80e-01

Pos logit contributions
nih+0.269
+0.264
Downloadha+0.264
ubi+0.255
+0.253
Neg logit contributions
OTE-0.202
imore-0.182
adelphia-0.173
Wars-0.172
 Rasmussen-0.168
Token in
Token ID287
Feature activation+1.55e-01

Pos logit contributions
OTE+0.300
imore+0.278
adelphia+0.268
Wars+0.266
 Rasmussen+0.262
Neg logit contributions
nih-0.240
Downloadha-0.235
-0.234
Fit-0.224
-0.223
Token a
Token ID257
Feature activation-1.68e+00

Pos logit contributions
OTE+0.113
imore+0.104
adelphia+0.100
Wars+0.100
 Rasmussen+0.098
Neg logit contributions
nih-0.107
-0.104
Downloadha-0.104
ubi-0.100
-0.100
Token one
Token ID530
Feature activation+6.75e-01

Pos logit contributions
nih+1.110
+1.082
Downloadha+1.072
ubi+1.038
+1.033
Neg logit contributions
OTE-1.284
imore-1.186
adelphia-1.134
Wars-1.132
finger-1.110
Token-
Token ID12
Feature activation-1.59e+00

Pos logit contributions
OTE+0.516
imore+0.476
adelphia+0.461
Wars+0.456
 Rasmussen+0.448
Neg logit contributions
nih-0.446
Downloadha-0.436
-0.433
ubi-0.418
-0.414
Tokenon
Token ID261
Feature activation-1.07e+01

Pos logit contributions
nih+1.148
+1.115
Downloadha+1.106
ubi+1.075
Fit+1.071
Neg logit contributions
OTE-1.112
imore-1.018
adelphia-0.975
Wars-0.969
 Canary-0.955
Token-
Token ID12
Feature activation-6.87e+00

Pos logit contributions
nih+4.581
+4.486
ald+4.192
Fit+4.092
ubi+4.063
Neg logit contributions
OTE-9.785
imore-9.022
 Canary-8.980
adelphia-8.948
Wars-8.748
Tokenone
Token ID505
Feature activation-2.70e+00

Pos logit contributions
nih+4.252
+3.993
Downloadha+3.896
ubi+3.808
Fit+3.800
Neg logit contributions
OTE-5.502
 Canary-5.052
imore-5.030
adelphia-4.970
 Rasmussen-4.964
Token,
Token ID11
Feature activation+8.13e-01

Pos logit contributions
nih+1.707
+1.661
Downloadha+1.646
Fit+1.592
ubi+1.587
Neg logit contributions
OTE-2.134
imore-1.970
adelphia-1.892
Wars-1.885
finger-1.851
Token single
Token ID2060
Feature activation-9.05e-01

Pos logit contributions
OTE+0.615
imore+0.568
adelphia+0.548
Wars+0.542
 Rasmussen+0.536
Neg logit contributions
nih-0.541
-0.528
Downloadha-0.527
ubi-0.506
-0.505
Token-
Token ID12
Feature activation-9.54e-01

Pos logit contributions
nih+0.579
+0.564
Downloadha+0.562
ubi+0.540
+0.538
Neg logit contributions
OTE-0.709
imore-0.657
adelphia-0.631
Wars-0.629
 Rasmussen-0.618
Token (),
Token ID29994
Feature activation-2.08e+00

Pos logit contributions
OTE+0.814
imore+0.730
Wars+0.694
adelphia+0.692
finger+0.671
Neg logit contributions
nih-1.142
Downloadha-1.118
-1.114
ubi-1.074
-1.070
Token half
Token ID2063
Feature activation+2.44e+00

Pos logit contributions
nih+1.260
+1.224
Downloadha+1.215
ubi+1.173
+1.163
Neg logit contributions
OTE-1.693
imore-1.573
adelphia-1.517
Wars-1.510
 Rasmussen-1.483
Token_
Token ID62
Feature activation-2.27e+00

Pos logit contributions
OTE+1.771
imore+1.631
Wars+1.562
adelphia+1.561
finger+1.529
Neg logit contributions
nih-1.699
Downloadha-1.664
-1.661
ubi-1.594
-1.585
Tokenm
Token ID76
Feature activation+5.99e-02

Pos logit contributions
nih+1.092
+1.046
Downloadha+1.045
ubi+0.993
+0.984
Neg logit contributions
OTE-2.152
imore-2.011
adelphia-1.949
Wars-1.944
 Rasmussen-1.920
Tokenoves
Token ID5241
Feature activation-4.46e+00

Pos logit contributions
OTE+0.038
imore+0.035
adelphia+0.033
Wars+0.033
 Rasmussen+0.032
Neg logit contributions
nih-0.047
-0.046
Downloadha-0.046
ubi-0.044
-0.044
Token :
Token ID1058
Feature activation-1.02e+01

Pos logit contributions
nih+3.343
+3.294
Fit+3.209
+3.180
ubi+3.153
Neg logit contributions
OTE-2.838
imore-2.634
adelphia-2.564
 Canary-2.477
Wars-2.429
Token self
Token ID2116
Feature activation-3.19e+00

Pos logit contributions
nih+4.195
+4.074
ubi+3.931
Fit+3.811
+3.771
Neg logit contributions
OTE-9.381
imore-9.030
adelphia-8.773
Wars-8.618
 Rasmussen-8.422
Token.
Token ID764
Feature activation-1.14e+00

Pos logit contributions
nih+1.915
+1.856
+1.750
Fit+1.743
ubi+1.742
Neg logit contributions
OTE-2.628
imore-2.445
adelphia-2.375
Wars-2.321
 Canary-2.309
Tokenhalf
Token ID13959
Feature activation+4.60e-01

Pos logit contributions
nih+0.822
+0.803
Downloadha+0.801
ubi+0.773
+0.770
Neg logit contributions
OTE-0.806
imore-0.741
adelphia-0.708
Wars-0.706
 Rasmussen-0.691
Token_
Token ID62
Feature activation-3.14e+00

Pos logit contributions
OTE+0.328
imore+0.303
adelphia+0.291
Wars+0.290
finger+0.285
Neg logit contributions
nih-0.325
-0.319
Downloadha-0.317
ubi-0.306
Fit-0.303
Tokenm
Token ID76
Feature activation+4.59e-01

Pos logit contributions
nih+1.525
Downloadha+1.464
+1.460
ubi+1.386
+1.380
Neg logit contributions
OTE-2.958
imore-2.758
adelphia-2.675
Wars-2.666
 Rasmussen-2.639
Token the
Token ID262
Feature activation-1.13e+00

Pos logit contributions
nih+0.186
+0.181
Downloadha+0.181
ubi+0.174
+0.173
Neg logit contributions
OTE-0.196
imore-0.180
adelphia-0.173
Wars-0.172
 Rasmussen-0.169
Token F
Token ID376
Feature activation+9.58e-01

Pos logit contributions
nih+0.755
+0.736
Downloadha+0.731
ubi+0.706
+0.703
Neg logit contributions
OTE-0.849
imore-0.784
adelphia-0.754
Wars-0.749
 Rasmussen-0.732
Token.
Token ID13
Feature activation-5.57e-02

Pos logit contributions
OTE+0.817
imore+0.763
adelphia+0.737
Wars+0.732
 Canary+0.721
Neg logit contributions
nih-0.548
-0.529
Downloadha-0.522
ubi-0.506
-0.503
TokenI
Token ID40
Feature activation-7.55e+00

Pos logit contributions
nih+0.036
+0.035
Downloadha+0.034
Fit+0.034
+0.034
Neg logit contributions
OTE-0.043
imore-0.041
adelphia-0.039
Wars-0.038
finger-0.038
Token.
Token ID13
Feature activation-5.82e+00

Pos logit contributions
nih+3.206
+3.115
ubi+2.995
+2.923
Fit+2.905
Neg logit contributions
OTE-7.089
imore-6.763
adelphia-6.615
 Canary-6.380
finger-6.346
TokenR
Token ID49
Feature activation-1.02e+01

Pos logit contributions
nih+3.186
+3.122
Fit+3.018
+2.960
ubi+2.903
Neg logit contributions
OTE-4.871
imore-4.739
adelphia-4.583
finger-4.312
 Canary-4.308
Token.
Token ID13
Feature activation-5.62e+00

Pos logit contributions
nih+5.246
+5.083
Downloadha+5.009
+4.928
ubi+4.833
Neg logit contributions
OTE-8.513
imore-7.977
adelphia-7.800
Wars-7.550
finger-7.507
Token
Token ID198
Feature activation-6.43e-01

Pos logit contributions
nih+3.295
+3.190
Downloadha+3.176
+3.036
ubi+3.003
Neg logit contributions
OTE-4.594
imore-4.339
adelphia-4.187
Wars-4.085
finger-4.004
Token
Token ID198
Feature activation-4.53e-01

Pos logit contributions
nih+0.362
+0.353
Downloadha+0.340
Fit+0.333
ubi+0.332
Neg logit contributions
OTE-0.554
imore-0.520
adelphia-0.501
Wars-0.494
 Canary-0.488
Token
Token ID447
Feature activation+2.39e+00

Pos logit contributions
nih+0.314
+0.306
Downloadha+0.306
ubi+0.295
+0.293
Neg logit contributions
OTE-0.331
imore-0.304
adelphia-0.291
Wars-0.291
 Rasmussen-0.284
Token
Token ID250
Feature activation+6.99e-01

Pos logit contributions
OTE+1.962
imore+1.812
adelphia+1.748
Wars+1.740
 Rasmussen+1.713
Neg logit contributions
nih-1.444
Downloadha-1.417
-1.389
ubi-1.348
Fit-1.324
Token
Token ID198
Feature activation+2.67e-01

Pos logit contributions
nih+0.938
+0.920
Downloadha+0.876
Fit+0.865
+0.859
Neg logit contributions
OTE-1.451
imore-1.366
adelphia-1.314
Wars-1.294
 Canary-1.285
TokenEven
Token ID6104
Feature activation+1.16e+00

Pos logit contributions
OTE+0.212
imore+0.197
adelphia+0.189
Wars+0.188
 Rasmussen+0.185
Neg logit contributions
nih-0.167
-0.163
Downloadha-0.162
ubi-0.156
-0.155
Token more
Token ID517
Feature activation-1.30e+00

Pos logit contributions
OTE+0.886
imore+0.819
adelphia+0.786
Wars+0.783
 Rasmussen+0.775
Neg logit contributions
nih-0.760
Downloadha-0.743
-0.739
ubi-0.714
-0.710
Token h
Token ID289
Feature activation+6.22e-01

Pos logit contributions
nih+0.817
+0.795
Downloadha+0.791
ubi+0.762
+0.757
Neg logit contributions
OTE-1.037
imore-0.964
adelphia-0.926
Wars-0.922
 Rasmussen-0.904
Tokenu
Token ID84
Feature activation+1.96e+00

Pos logit contributions
OTE+0.509
imore+0.473
adelphia+0.456
Wars+0.454
 Rasmussen+0.445
Neg logit contributions
nih-0.376
Downloadha-0.369
-0.369
-0.350
Fit-0.347
Token­
Token ID3907
Feature activation-1.01e+01

Pos logit contributions
OTE+2.042
imore+1.949
adelphia+1.883
Wars+1.872
finger+1.857
Neg logit contributions
nih-0.725
-0.713
Downloadha-0.711
-0.658
Fit-0.642
Tokenmi
Token ID11632
Feature activation-3.42e+00

Pos logit contributions
nih+6.988
ubi+6.472
ald+6.352
+6.191
dn+6.152
Neg logit contributions
OTE-7.454
 Canary-6.366
imore-6.238
Wars-6.192
 Rasmussen-5.956
Token­
Token ID3907
Feature activation-9.28e+00

Pos logit contributions
nih+1.333
+1.280
Downloadha+1.265
ubi+1.207
Fit+1.180
Neg logit contributions
OTE-3.513
imore-3.322
adelphia-3.220
Wars-3.215
finger-3.175
Tokenli
Token ID4528
Feature activation-5.39e+00

Pos logit contributions
nih+5.823
ubi+5.299
+5.232
Downloadha+5.165
Fit+5.135
Neg logit contributions
OTE-7.540
imore-6.843
 Canary-6.502
 Rasmussen-6.401
Wars-6.329
Token­
Token ID3907
Feature activation-8.65e+00

Pos logit contributions
nih+1.847
+1.767
Downloadha+1.747
ubi+1.666
Fit+1.627
Neg logit contributions
OTE-5.738
imore-5.444
Wars-5.284
adelphia-5.277
finger-5.224
Tokenat
Token ID265
Feature activation-5.35e+00

Pos logit contributions
nih+5.554
ubi+5.126
+5.119
Downloadha+5.064
Fit+5.027
Neg logit contributions
OTE-6.818
imore-6.207
 Canary-5.884
 Rasmussen-5.818
Wars-5.793
Tokenain
Token ID391
Feature activation-1.51e+00

Pos logit contributions
nih+4.882
ubi+4.628
ald+4.466
Fit+4.334
+4.326
Neg logit contributions
OTE-6.301
imore-5.792
 Canary-5.708
adelphia-5.648
Wars-5.613
Token told
Token ID1297
Feature activation+2.90e+00

Pos logit contributions
nih+0.917
+0.891
Downloadha+0.884
ubi+0.852
+0.847
Neg logit contributions
OTE-1.229
imore-1.142
adelphia-1.100
Wars-1.095
 Rasmussen-1.075
Token a
Token ID257
Feature activation+2.37e+00

Pos logit contributions
OTE+2.168
imore+1.987
adelphia+1.907
Wars+1.905
 Rasmussen+1.901
Neg logit contributions
nih-1.954
Downloadha-1.918
-1.903
ubi-1.831
-1.823
Token Phoenix
Token ID9643
Feature activation+6.61e+00

Pos logit contributions
OTE+1.816
imore+1.672
adelphia+1.612
 Rasmussen+1.602
Wars+1.602
Neg logit contributions
nih-1.558
Downloadha-1.530
-1.512
ubi-1.459
Fit-1.452
Token ra
Token ID2179
Feature activation+1.45e+00

Pos logit contributions
OTE+5.223
imore+4.883
adelphia+4.752
 Rasmussen+4.676
Wars+4.642
Neg logit contributions
nih-4.011
Downloadha-4.001
-3.885
ubi-3.731
Fit-3.698
Token­
Token ID3907
Feature activation-1.00e+01

Pos logit contributions
OTE+1.484
imore+1.418
adelphia+1.366
Wars+1.358
finger+1.338
Neg logit contributions
nih-0.566
-0.550
Downloadha-0.548
-0.512
Fit-0.498
Tokend
Token ID67
Feature activation-7.03e+00

Pos logit contributions
nih+7.510
+6.950
ubi+6.900
Downloadha+6.894
ald+6.883
Neg logit contributions
OTE-6.644
imore-5.831
 Canary-5.681
Wars-5.507
 Rasmussen-5.420
Tokenio
Token ID952
Feature activation+8.07e-01

Pos logit contributions
nih+4.697
+4.483
ubi+4.438
Downloadha+4.416
+4.300
Neg logit contributions
OTE-5.090
imore-4.621
Wars-4.566
 Canary-4.491
adelphia-4.461
Token st
Token ID336
Feature activation-2.63e-01

Pos logit contributions
OTE+0.651
imore+0.606
adelphia+0.583
Wars+0.581
 Rasmussen+0.571
Neg logit contributions
nih-0.498
Downloadha-0.486
-0.485
ubi-0.463
-0.462
Tokena
Token ID64
Feature activation+3.14e+00

Pos logit contributions
nih+0.172
+0.168
Downloadha+0.168
ubi+0.161
+0.160
Neg logit contributions
OTE-0.203
imore-0.188
adelphia-0.181
Wars-0.180
 Rasmussen-0.176
Token­
Token ID3907
Feature activation-7.29e+00

Pos logit contributions
OTE+3.184
imore+3.036
adelphia+2.924
Wars+2.888
 Rasmussen+2.861
Neg logit contributions
nih-1.280
Downloadha-1.247
-1.208
-1.147
ubi-1.107
Token Obama
Token ID2486
Feature activation-1.27e+00

Pos logit contributions
nih+0.022
Downloadha+0.022
+0.022
ubi+0.021
+0.021
Neg logit contributions
OTE-0.020
imore-0.018
adelphia-0.017
Wars-0.017
 Rasmussen-0.017
Token is
Token ID318
Feature activation+6.04e-01

Pos logit contributions
nih+0.806
+0.786
Downloadha+0.777
ubi+0.750
+0.746
Neg logit contributions
OTE-0.993
imore-0.924
adelphia-0.887
Wars-0.884
finger-0.865
Token the
Token ID262
Feature activation-9.63e-01

Pos logit contributions
OTE+0.468
imore+0.433
adelphia+0.416
Wars+0.414
 Rasmussen+0.407
Neg logit contributions
nih-0.391
Downloadha-0.381
-0.381
ubi-0.366
-0.365
Token worst
Token ID5290
Feature activation-1.16e+00

Pos logit contributions
nih+0.626
+0.610
Downloadha+0.606
ubi+0.584
+0.581
Neg logit contributions
OTE-0.745
imore-0.690
adelphia-0.663
Wars-0.660
 Rasmussen-0.646
Token ne
Token ID497
Feature activation-1.59e+00

Pos logit contributions
nih+0.761
+0.740
Downloadha+0.735
ubi+0.709
+0.705
Neg logit contributions
OTE-0.891
imore-0.826
adelphia-0.793
Wars-0.790
 Rasmussen-0.773
Token­
Token ID3907
Feature activation-9.99e+00

Pos logit contributions
nih+0.639
+0.612
Downloadha+0.611
ubi+0.571
+0.567
Neg logit contributions
OTE-1.618
imore-1.527
adelphia-1.482
Wars-1.477
 Rasmussen-1.458
Tokengo
Token ID2188
Feature activation-6.62e+00

Pos logit contributions
nih+6.511
ubi+6.098
Fit+5.965
Downloadha+5.930
ancial+5.870
Neg logit contributions
OTE-7.433
 Canary-6.638
imore-6.573
Wars-6.438
 Rasmussen-6.265
Token­
Token ID3907
Feature activation-6.79e+00

Pos logit contributions
nih+2.460
+2.228
Fit+2.175
ubi+2.174
Downloadha+2.156
Neg logit contributions
OTE-6.809
imore-6.432
Wars-6.277
 Canary-6.260
adelphia-6.249
Tokenti
Token ID20259
Feature activation-1.49e+00

Pos logit contributions
nih+4.351
+3.994
ubi+3.968
Fit+3.942
Downloadha+3.940
Neg logit contributions
OTE-5.318
imore-4.890
 Canary-4.759
Wars-4.584
adelphia-4.504
Token­
Token ID3907
Feature activation-8.83e+00

Pos logit contributions
nih+0.522
+0.496
Downloadha+0.496
ubi+0.456
+0.455
Neg logit contributions
OTE-1.607
imore-1.519
adelphia-1.481
Wars-1.473
 Rasmussen-1.455
Tokenat
Token ID265
Feature activation-3.21e+00

Pos logit contributions
nih+5.875
ancial+5.223
ubi+5.212
+5.197
Fit+5.179
Neg logit contributions
OTE-6.782
imore-6.266
 Canary-5.857
Wars-5.841
 Rasmussen-5.784
Tokend
Token ID67
Feature activation-1.57e+00

Pos logit contributions
nih+0.774
+0.714
Downloadha+0.699
ubi+0.675
Fit+0.663
Neg logit contributions
OTE-2.122
imore-1.991
adelphia-1.944
Wars-1.929
 Canary-1.921
Token |
Token ID930
Feature activation-6.38e+00

Pos logit contributions
nih+0.596
+0.568
Downloadha+0.544
ubi+0.526
Fit+0.526
Neg logit contributions
OTE-1.642
imore-1.551
adelphia-1.513
Wars-1.503
 Canary-1.481
Token +
Token ID1343
Feature activation-6.61e+00

Pos logit contributions
nih+4.218
+4.105
Downloadha+3.982
Fit+3.921
+3.913
Neg logit contributions
OTE-4.702
imore-4.329
adelphia-4.235
Wars-4.195
 Rasmussen-4.088
Token--------------------
Token ID19351
Feature activation+2.18e+00

Pos logit contributions
nih+4.287
ubi+4.064
+4.064
Downloadha+3.960
Fit+3.883
Neg logit contributions
OTE-4.881
imore-4.597
 Canary-4.545
adelphia-4.487
Wars-4.443
Token--+
Token ID44785
Feature activation-4.06e+00

Pos logit contributions
OTE+1.110
imore+0.984
adelphia+0.923
Wars+0.917
 Rasmussen+0.889
Neg logit contributions
nih-1.998
-1.961
Downloadha-1.957
ubi-1.904
-1.898
Token |
Token ID930
Feature activation-9.96e+00

Pos logit contributions
nih+1.443
+1.377
+1.280
Fit+1.279
ubi+1.268
Neg logit contributions
OTE-4.261
imore-4.078
adelphia-3.972
Wars-3.911
 Canary-3.824
Token [
Token ID685
Feature activation+7.34e-01

Pos logit contributions
nih+5.875
+5.571
+5.372
ubi+5.269
Fit+5.234
Neg logit contributions
OTE-7.706
imore-7.225
adelphia-7.124
Wars-7.052
 Rasmussen-6.850
Tokenaddress
Token ID21975
Feature activation-7.63e-02

Pos logit contributions
OTE+0.564
imore+0.519
adelphia+0.499
Wars+0.494
finger+0.494
Neg logit contributions
nih-0.478
Downloadha-0.474
-0.471
ubi-0.450
Fit-0.448
Token of
Token ID286
Feature activation-2.67e-01

Pos logit contributions
nih+0.052
Downloadha+0.051
+0.051
ubi+0.049
Fit+0.048
Neg logit contributions
OTE-0.057
imore-0.052
adelphia-0.050
Wars-0.050
 Rasmussen-0.049
Token "
Token ID366
Feature activation+5.35e-01

Pos logit contributions
nih+0.149
+0.145
Downloadha+0.144
ubi+0.138
+0.137
Neg logit contributions
OTE-0.232
imore-0.216
adelphia-0.209
Wars-0.208
 Rasmussen-0.205
Tokenpp
Token ID381
Feature activation+7.37e+00

Pos logit contributions
OTE+0.422
imore+0.390
adelphia+0.377
Wars+0.374
finger+0.368
Neg logit contributions
nih-0.337
-0.332
Downloadha-0.331
ubi-0.317
-0.316
Token --------------------
Token ID41436
Feature activation+4.23e-01

Pos logit contributions
OTE+0.703
imore+0.654
adelphia+0.627
Wars+0.625
 Rasmussen+0.617
Neg logit contributions
nih-0.499
Downloadha-0.486
-0.486
ubi-0.464
-0.463
Token-+
Token ID19529
Feature activation-7.57e+00

Pos logit contributions
OTE+0.276
imore+0.251
adelphia+0.238
Wars+0.237
 Rasmussen+0.232
Neg logit contributions
nih-0.328
Downloadha-0.321
-0.320
ubi-0.309
-0.308
Token--------
Token ID982
Feature activation-4.38e+00

Pos logit contributions
nih+5.077
+4.971
ubi+4.838
Downloadha+4.798
+4.782
Neg logit contributions
OTE-5.289
imore-4.995
adelphia-4.890
Wars-4.775
 Rasmussen-4.624
Token-+
Token ID19529
Feature activation-9.02e+00

Pos logit contributions
nih+4.538
+4.458
Downloadha+4.388
ubi+4.357
+4.326
Neg logit contributions
OTE-1.639
imore-1.408
adelphia-1.340
Wars-1.294
 Rasmussen-1.228
Token-------
Token ID26866
Feature activation-4.55e+00

Pos logit contributions
nih+6.140
+5.984
ubi+5.805
Downloadha+5.801
Fit+5.773
Neg logit contributions
OTE-6.100
imore-5.778
adelphia-5.693
Wars-5.477
 Rasmussen-5.350
Token|
Token ID91
Feature activation-9.93e+00

Pos logit contributions
nih+3.338
+3.251
Downloadha+3.193
ubi+3.179
+3.128
Neg logit contributions
OTE-3.062
imore-2.814
adelphia-2.751
Wars-2.710
 Rasmussen-2.632
Token |
Token ID930
Feature activation-3.00e+00

Pos logit contributions
nih+6.440
+6.317
Downloadha+6.159
ubi+6.100
+6.081
Neg logit contributions
OTE-6.861
imore-6.483
adelphia-6.391
Wars-6.199
 Rasmussen-6.101
Token org
Token ID8745
Feature activation+1.51e+00

Pos logit contributions
nih+1.832
+1.769
Downloadha+1.755
ubi+1.692
+1.685
Neg logit contributions
OTE-2.426
imore-2.264
adelphia-2.192
Wars-2.180
 Rasmussen-2.126
Token.
Token ID764
Feature activation-3.19e-01

Pos logit contributions
OTE+1.275
imore+1.188
adelphia+1.145
Wars+1.143
 Rasmussen+1.125
Neg logit contributions
nih-0.864
Downloadha-0.853
-0.845
ubi-0.808
-0.800
Token cl
Token ID537
Feature activation-2.01e+00

Pos logit contributions
nih+0.158
+0.153
Downloadha+0.153
ubi+0.145
+0.144
Neg logit contributions
OTE-0.295
imore-0.277
adelphia-0.268
Wars-0.267
 Rasmussen-0.263
Tokenojure
Token ID32511
Feature activation+2.38e+00

Pos logit contributions
nih+1.241
+1.199
Downloadha+1.193
ubi+1.154
Fit+1.142
Neg logit contributions
OTE-1.617
imore-1.497
adelphia-1.444
Wars-1.441
 Rasmussen-1.414
Tokenarg
Token ID853
Feature activation-1.19e-01

Pos logit contributions
OTE+1.358
imore+1.271
adelphia+1.227
Wars+1.223
finger+1.205
Neg logit contributions
nih-0.813
-0.789
Downloadha-0.788
ubi-0.746
-0.745
Token |
Token ID930
Feature activation-2.31e+00

Pos logit contributions
nih+0.047
+0.044
Downloadha+0.043
ubi+0.042
Fit+0.041
Neg logit contributions
OTE-0.122
imore-0.116
adelphia-0.113
Wars-0.112
 Canary-0.110
Token +
Token ID1343
Feature activation-6.66e+00

Pos logit contributions
nih+1.989
+1.952
Downloadha+1.942
ubi+1.891
+1.885
Neg logit contributions
OTE-1.289
imore-1.156
adelphia-1.095
Wars-1.088
 Rasmussen-1.056
Token--------------------
Token ID19351
Feature activation+3.23e+00

Pos logit contributions
nih+4.286
ubi+4.073
+4.063
Downloadha+3.929
Fit+3.865
Neg logit contributions
OTE-4.932
imore-4.648
 Canary-4.602
adelphia-4.533
Wars-4.499
Token--+
Token ID44785
Feature activation-3.66e+00

Pos logit contributions
OTE+1.588
imore+1.390
adelphia+1.313
Wars+1.287
 Rasmussen+1.253
Neg logit contributions
nih-3.008
-2.959
Downloadha-2.955
-2.869
ubi-2.863
Token |
Token ID930
Feature activation-9.84e+00

Pos logit contributions
nih+1.234
+1.176
Fit+1.093
ubi+1.089
+1.089
Neg logit contributions
OTE-3.913
imore-3.746
adelphia-3.655
Wars-3.603
 Canary-3.549
Token [
Token ID685
Feature activation+1.21e+00

Pos logit contributions
nih+4.131
+3.809
+3.584
 hay+3.532
ubi+3.513
Neg logit contributions
OTE-9.266
imore-8.772
adelphia-8.767
Wars-8.724
 Rasmussen-8.361
Tokenreturn
Token ID7783
Feature activation-4.88e+00

Pos logit contributions
OTE+0.961
imore+0.891
adelphia+0.855
Wars+0.850
finger+0.842
Neg logit contributions
nih-0.760
Downloadha-0.748
-0.745
ubi-0.709
-0.709
Token address
Token ID2209
Feature activation-5.37e+00

Pos logit contributions
nih+2.755
+2.608
ubi+2.469
Downloadha+2.469
+2.448
Neg logit contributions
OTE-4.111
imore-3.899
Wars-3.801
adelphia-3.764
finger-3.637
Token]
Token ID60
Feature activation-5.34e-01

Pos logit contributions
nih+2.209
+2.119
ubi+1.990
Fit+1.951
+1.935
Neg logit contributions
OTE-5.283
imore-5.028
adelphia-4.921
Wars-4.848
 Canary-4.771
Token |
Token ID930
Feature activation-3.23e+00

Pos logit contributions
nih+0.203
+0.194
Downloadha+0.184
ubi+0.182
Fit+0.179
Neg logit contributions
OTE-0.556
imore-0.528
adelphia-0.513
Wars-0.510
 Canary-0.502
Token at
Token ID379
Feature activation-8.98e-01

Pos logit contributions
OTE+1.014
imore+0.940
adelphia+0.902
Wars+0.895
 Rasmussen+0.884
Neg logit contributions
nih-0.824
Downloadha-0.808
-0.801
ubi-0.772
-0.769
Token the
Token ID262
Feature activation-3.02e-01

Pos logit contributions
nih+0.615
+0.600
Downloadha+0.592
ubi+0.573
+0.572
Neg logit contributions
OTE-0.664
imore-0.611
adelphia-0.587
Wars-0.586
finger-0.572
Token po
Token ID745
Feature activation+1.52e+00

Pos logit contributions
nih+0.205
Downloadha+0.201
+0.200
ubi+0.193
+0.192
Neg logit contributions
OTE-0.224
imore-0.207
adelphia-0.199
Wars-0.197
 Rasmussen-0.194
Token­
Token ID3907
Feature activation-9.47e+00

Pos logit contributions
OTE+1.561
imore+1.489
adelphia+1.441
Wars+1.430
finger+1.414
Neg logit contributions
Downloadha-0.585
nih-0.580
-0.573
-0.536
ubi-0.524
Tokendi
Token ID10989
Feature activation-2.96e+00

Pos logit contributions
nih+7.010
ubi+6.463
+6.436
ald+6.322
Fit+6.312
Neg logit contributions
OTE-6.143
imore-5.700
 Canary-5.504
Wars-5.236
 Rasmussen-5.172
Token­
Token ID3907
Feature activation-9.78e+00

Pos logit contributions
nih+0.974
+0.923
Downloadha+0.914
ubi+0.853
Fit+0.838
Neg logit contributions
OTE-3.230
imore-3.061
adelphia-2.971
Wars-2.967
 Rasmussen-2.932
Tokenum
Token ID388
Feature activation-7.06e-01

Pos logit contributions
nih+7.403
ubi+6.712
+6.662
Fit+6.637
ald+6.587
Neg logit contributions
OTE-6.443
imore-5.924
 Canary-5.644
Wars-5.407
 Rasmussen-5.344
Token and
Token ID290
Feature activation-7.13e-01

Pos logit contributions
nih+0.399
Downloadha+0.389
+0.388
ubi+0.370
+0.368
Neg logit contributions
OTE-0.606
imore-0.566
adelphia-0.546
Wars-0.543
 Rasmussen-0.535
Token al
Token ID435
Feature activation+1.10e+00

Pos logit contributions
nih+0.464
+0.452
Downloadha+0.450
ubi+0.433
+0.431
Neg logit contributions
OTE-0.551
imore-0.510
adelphia-0.490
Wars-0.488
 Rasmussen-0.478
Token­
Token ID3907
Feature activation-9.70e-01

Pos logit contributions
OTE+1.160
imore+1.106
adelphia+1.075
Wars+1.066
finger+1.056
Neg logit contributions
nih-0.399
-0.385
Downloadha-0.385
-0.355
ubi-0.351
Tokenl
Token ID75
Feature activation-2.56e+00

Pos logit contributions
nih+0.681
+0.666
Downloadha+0.665
ubi+0.640
+0.638
Neg logit contributions
OTE-0.699
imore-0.644
adelphia-0.617
Wars-0.613
 Rasmussen-0.601
Token...
Token ID2644
Feature activation-2.15e+00

Pos logit contributions
nih+5.670
+5.553
+5.327
Downloadha+5.310
ubi+5.310
Neg logit contributions
OTE-5.273
imore-4.916
Wars-4.727
adelphia-4.723
 Rasmussen-4.563
Token |
Token ID930
Feature activation-3.33e+00

Pos logit contributions
nih+0.778
+0.746
ubi+0.699
Downloadha+0.696
+0.691
Neg logit contributions
OTE-2.266
imore-2.154
adelphia-2.099
Wars-2.082
 Canary-2.040
Token +
Token ID1343
Feature activation-7.99e+00

Pos logit contributions
nih+2.824
+2.773
Downloadha+2.744
ubi+2.681
+2.678
Neg logit contributions
OTE-1.892
imore-1.705
adelphia-1.624
Wars-1.613
 Rasmussen-1.557
Token--------------------
Token ID19351
Feature activation+2.65e+00

Pos logit contributions
nih+4.914
+4.703
ubi+4.674
Fit+4.447
Downloadha+4.437
Neg logit contributions
OTE-5.997
imore-5.750
 Canary-5.672
adelphia-5.636
Wars-5.507
Token--+
Token ID44785
Feature activation-3.77e+00

Pos logit contributions
OTE+1.327
imore+1.171
adelphia+1.101
Wars+1.088
 Rasmussen+1.056
Neg logit contributions
nih-2.440
-2.396
Downloadha-2.392
ubi-2.324
-2.322
Token +
Token ID1343
Feature activation-9.76e+00

Pos logit contributions
nih+2.969
+2.923
Downloadha+2.814
+2.812
ubi+2.798
Neg logit contributions
OTE-2.333
imore-2.155
adelphia-2.075
Wars-2.033
finger-1.915
Token--------------------
Token ID19351
Feature activation+1.64e+00

Pos logit contributions
nih+5.758
+5.464
ubi+5.388
Fit+5.162
ald+5.103
Neg logit contributions
OTE-7.412
imore-7.138
 Canary-7.092
adelphia-7.035
Wars-6.795
Token--+
Token ID44785
Feature activation-5.92e+00

Pos logit contributions
OTE+0.847
imore+0.753
adelphia+0.705
Wars+0.703
 Rasmussen+0.682
Neg logit contributions
nih-1.494
-1.465
Downloadha-1.462
ubi-1.425
-1.417
Token <
Token ID1279
Feature activation-2.35e+00

Pos logit contributions
nih+3.973
+3.866
Downloadha+3.749
+3.708
Fit+3.673
Neg logit contributions
OTE-4.340
imore-4.052
adelphia-3.937
Wars-3.896
 Rasmussen-3.776
Token--
Token ID438
Feature activation+2.65e+00

Pos logit contributions
nih+1.497
+1.479
Fit+1.418
+1.395
Downloadha+1.387
Neg logit contributions
OTE-1.827
imore-1.709
adelphia-1.654
Wars-1.603
 Canary-1.602
Token Start
Token ID7253
Feature activation-1.07e+00

Pos logit contributions
OTE+2.006
imore+1.843
adelphia+1.769
Wars+1.754
 Canary+1.751
Neg logit contributions
nih-1.771
Downloadha-1.739
-1.719
ubi-1.659
-1.645
Tokenarg
Token ID853
Feature activation-1.96e-01

Pos logit contributions
OTE+1.091
imore+1.022
adelphia+0.982
finger+0.978
Wars+0.976
Neg logit contributions
nih-0.714
Downloadha-0.706
-0.702
-0.660
Fit-0.658
Token |
Token ID930
Feature activation-3.23e+00

Pos logit contributions
nih+0.076
+0.073
Downloadha+0.070
ubi+0.068
Fit+0.067
Neg logit contributions
OTE-0.203
imore-0.192
adelphia-0.187
Wars-0.186
 Canary-0.182
Token +
Token ID1343
Feature activation-6.53e+00

Pos logit contributions
nih+2.692
+2.641
Downloadha+2.616
ubi+2.555
+2.548
Neg logit contributions
OTE-1.890
imore-1.704
adelphia-1.625
Wars-1.616
 Rasmussen-1.568
Token--------------------
Token ID19351
Feature activation+3.54e+00

Pos logit contributions
nih+4.210
ubi+4.010
+4.007
Downloadha+3.889
Fit+3.810
Neg logit contributions
OTE-4.834
imore-4.541
 Canary-4.466
adelphia-4.431
Wars-4.396
Token--+
Token ID44785
Feature activation-4.01e+00

Pos logit contributions
OTE+1.735
imore+1.517
adelphia+1.434
Wars+1.404
 Rasmussen+1.367
Neg logit contributions
nih-3.308
-3.255
Downloadha-3.250
-3.155
ubi-3.147
Token |
Token ID930
Feature activation-9.69e+00

Pos logit contributions
nih+1.378
+1.318
+1.228
Fit+1.227
ubi+1.223
Neg logit contributions
OTE-4.250
imore-4.064
adelphia-3.975
Wars-3.913
 Canary-3.838
Token [
Token ID685
Feature activation+1.82e+00

Pos logit contributions
nih+4.726
+4.510
+4.271
ubi+4.181
Fit+4.142
Neg logit contributions
OTE-8.461
imore-8.012
adelphia-7.921
Wars-7.856
 Rasmussen-7.620
Tokenreturn
Token ID7783
Feature activation-4.07e+00

Pos logit contributions
OTE+1.402
imore+1.289
adelphia+1.237
finger+1.229
Wars+1.225
Neg logit contributions
nih-1.186
Downloadha-1.185
-1.170
ubi-1.112
Fit-1.111
Token address
Token ID2209
Feature activation-5.10e+00

Pos logit contributions
nih+2.335
+2.204
ubi+2.104
Downloadha+2.097
+2.073
Neg logit contributions
OTE-3.414
imore-3.225
Wars-3.139
adelphia-3.116
finger-3.010
Token]
Token ID60
Feature activation-6.55e-01

Pos logit contributions
nih+2.096
+2.025
ubi+1.906
Fit+1.857
+1.846
Neg logit contributions
OTE-5.014
imore-4.777
adelphia-4.663
Wars-4.596
 Canary-4.521
Token |
Token ID930
Feature activation-4.27e+00

Pos logit contributions
nih+0.249
+0.238
Downloadha+0.227
ubi+0.224
+0.221
Neg logit contributions
OTE-0.681
imore-0.647
adelphia-0.628
Wars-0.625
 Canary-0.613
Token camp
Token ID1413
Feature activation+1.48e-01

Pos logit contributions
OTE+0.730
imore+0.678
adelphia+0.647
Wars+0.640
 Canary+0.631
Neg logit contributions
nih-0.678
Downloadha-0.670
-0.657
ubi-0.633
-0.630
Token during
Token ID1141
Feature activation-2.55e-01

Pos logit contributions
OTE+0.120
imore+0.112
adelphia+0.108
Wars+0.107
 Rasmussen+0.105
Neg logit contributions
nih-0.091
-0.088
Downloadha-0.088
ubi-0.085
-0.084
Token an
Token ID281
Feature activation-2.66e+00

Pos logit contributions
nih+0.175
+0.171
Downloadha+0.170
ubi+0.165
+0.164
Neg logit contributions
OTE-0.188
imore-0.173
adelphia-0.166
Wars-0.165
 Rasmussen-0.162
Token 11
Token ID1367
Feature activation-1.74e+00

Pos logit contributions
nih+1.748
Downloadha+1.700
+1.698
ubi+1.645
Fit+1.629
Neg logit contributions
OTE-2.025
imore-1.860
adelphia-1.793
Wars-1.788
 Rasmussen-1.752
Token-
Token ID12
Feature activation-2.00e-01

Pos logit contributions
nih+1.070
+1.042
Downloadha+1.038
ubi+1.001
+0.998
Neg logit contributions
OTE-1.391
imore-1.292
adelphia-1.243
Wars-1.240
 Rasmussen-1.222
Tokenon
Token ID261
Feature activation-9.68e+00

Pos logit contributions
nih+0.148
+0.145
Downloadha+0.144
ubi+0.140
+0.139
Neg logit contributions
OTE-0.136
imore-0.125
adelphia-0.119
Wars-0.119
 Rasmussen-0.117
Token-
Token ID12
Feature activation-6.42e+00

Pos logit contributions
+4.186
nih+4.165
ald+3.847
Fit+3.845
+3.804
Neg logit contributions
OTE-8.863
imore-8.170
adelphia-8.144
 Canary-8.077
Wars-7.914
Token11
Token ID1157
Feature activation-5.19e+00

Pos logit contributions
nih+3.949
Downloadha+3.768
+3.730
+3.622
Fit+3.583
Neg logit contributions
OTE-5.127
imore-4.693
adelphia-4.644
 Canary-4.604
 Rasmussen-4.531
Token session
Token ID6246
Feature activation+5.64e-01

Pos logit contributions
nih+3.093
+2.976
Downloadha+2.930
Fit+2.913
ubi+2.863
Neg logit contributions
OTE-4.221
imore-3.818
Wars-3.733
adelphia-3.727
finger-3.620
Token.
Token ID13
Feature activation+1.07e+00

Pos logit contributions
OTE+0.422
imore+0.391
adelphia+0.374
Wars+0.372
 Rasmussen+0.366
Neg logit contributions
nih-0.380
Downloadha-0.370
-0.370
ubi-0.356
-0.354
Token His
Token ID2399
Feature activation+2.23e+00

Pos logit contributions
OTE+0.847
imore+0.785
adelphia+0.756
Wars+0.753
 Rasmussen+0.740
Neg logit contributions
nih-0.672
-0.653
Downloadha-0.652
ubi-0.626
-0.622
Token,
Token ID11
Feature activation+2.29e+00

Pos logit contributions
nih+0.474
+0.461
Downloadha+0.460
ubi+0.443
+0.441
Neg logit contributions
OTE-0.536
imore-0.495
adelphia-0.475
Wars-0.473
 Rasmussen-0.464
Token once
Token ID1752
Feature activation-1.02e+00

Pos logit contributions
OTE+1.703
imore+1.571
Wars+1.503
adelphia+1.500
 Canary+1.487
Neg logit contributions
nih-1.550
Downloadha-1.522
-1.519
ubi-1.454
-1.453
Token he
Token ID339
Feature activation+2.52e-01

Pos logit contributions
nih+0.674
+0.656
Downloadha+0.654
ubi+0.630
+0.627
Neg logit contributions
OTE-0.780
imore-0.721
adelphia-0.693
Wars-0.690
 Rasmussen-0.676
Token enters
Token ID14170
Feature activation+1.51e+00

Pos logit contributions
OTE+0.186
imore+0.173
adelphia+0.165
Wars+0.164
 Rasmussen+0.162
Neg logit contributions
nih-0.172
Downloadha-0.169
-0.168
ubi-0.161
-0.161
Token the
Token ID262
Feature activation+9.08e-02

Pos logit contributions
OTE+1.080
imore+0.993
adelphia+0.950
Wars+0.947
 Canary+0.933
Neg logit contributions
nih-1.065
Downloadha-1.043
-1.043
ubi-1.001
-0.998
Token DC
Token ID6257
Feature activation-9.66e+00

Pos logit contributions
OTE+0.070
imore+0.065
adelphia+0.062
Wars+0.062
 Canary+0.061
Neg logit contributions
nih-0.060
Downloadha-0.058
-0.058
ubi-0.055
-0.055
TokenU
Token ID52
Feature activation+1.12e+00

Pos logit contributions
nih+4.718
+4.669
Fit+4.549
ubi+4.507
+4.444
Neg logit contributions
OTE-8.321
imore-7.904
adelphia-7.631
 Rasmussen-7.525
finger-7.395
Token,
Token ID11
Feature activation-6.42e-01

Pos logit contributions
OTE+0.874
imore+0.812
Wars+0.778
adelphia+0.776
 Canary+0.767
Neg logit contributions
nih-0.716
Downloadha-0.703
-0.701
-0.667
ubi-0.666
Token and
Token ID290
Feature activation+1.53e+00

Pos logit contributions
nih+0.421
+0.408
Downloadha+0.407
ubi+0.393
+0.390
Neg logit contributions
OTE-0.495
imore-0.457
adelphia-0.439
Wars-0.437
 Rasmussen-0.427
Token to
Token ID284
Feature activation+6.44e-01

Pos logit contributions
OTE+1.136
imore+1.051
adelphia+1.004
Wars+1.002
 Canary+0.993
Neg logit contributions
nih-1.040
-1.019
Downloadha-1.018
ubi-0.976
-0.975
Token have
Token ID423
Feature activation+1.16e+00

Pos logit contributions
OTE+0.455
imore+0.417
adelphia+0.399
Wars+0.397
 Rasmussen+0.390
Neg logit contributions
nih-0.463
Downloadha-0.453
-0.452
ubi-0.435
-0.433
TokenK
Token ID42
Feature activation-1.24e+00

Pos logit contributions
OTE+0.468
imore+0.432
Wars+0.415
adelphia+0.414
finger+0.407
Neg logit contributions
nih-0.392
Downloadha-0.384
-0.381
ubi-0.366
-0.366
Tokenada
Token ID4763
Feature activation+4.24e-01

Pos logit contributions
nih+0.773
+0.752
Downloadha+0.749
ubi+0.720
+0.716
Neg logit contributions
OTE-0.995
imore-0.924
adelphia-0.889
Wars-0.885
 Rasmussen-0.870
Tokenver
Token ID332
Feature activation+7.14e-01

Pos logit contributions
OTE+0.342
imore+0.317
Wars+0.304
adelphia+0.304
finger+0.297
Neg logit contributions
nih-0.264
Downloadha-0.256
-0.255
-0.243
ubi-0.243
TokenBB
Token ID15199
Feature activation+2.21e+00

Pos logit contributions
OTE+0.512
imore+0.470
adelphia+0.451
Wars+0.449
finger+0.439
Neg logit contributions
nih-0.505
Downloadha-0.495
-0.491
-0.473
ubi-0.471
Token Profile
Token ID13118
Feature activation+4.34e+00

Pos logit contributions
OTE+1.576
imore+1.442
adelphia+1.388
Wars+1.385
finger+1.353
Neg logit contributions
nih-1.559
Downloadha-1.532
-1.516
-1.466
ubi-1.462
Token Blog
Token ID14001
Feature activation+9.04e+00

Pos logit contributions
OTE+2.442
imore+2.170
adelphia+2.087
Wars+2.059
 Canary+2.043
Neg logit contributions
nih-3.661
Downloadha-3.607
-3.574
ubi-3.509
Fit-3.475
Token Joined
Token ID16709
Feature activation+5.77e+00

Pos logit contributions
OTE+2.800
imore+2.245
Wars+2.190
 Rasmussen+2.121
adelphia+2.058
Neg logit contributions
Downloadha-9.718
nih-9.617
-9.469
ubi-9.454
-9.326
Token June
Token ID2795
Feature activation+6.15e+00

Pos logit contributions
OTE+4.370
 Rasmussen+3.999
 Canary+3.973
Wars+3.964
imore+3.911
Neg logit contributions
nih-3.721
Downloadha-3.704
ubi-3.550
-3.525
-3.480
Token 2009
Token ID3717
Feature activation+6.19e-01

Pos logit contributions
OTE+3.435
 Rasmussen+2.996
 Canary+2.969
Wars+2.955
imore+2.925
Neg logit contributions
Downloadha-5.238
nih-5.174
-5.035
ubi-4.951
-4.919
Token Germany
Token ID4486
Feature activation-2.61e-01

Pos logit contributions
OTE+0.443
imore+0.406
adelphia+0.390
Wars+0.388
 Rasmussen+0.382
Neg logit contributions
nih-0.438
Downloadha-0.427
-0.426
ubi-0.412
-0.409
Token 247
Token ID30179
Feature activation+1.65e+00

Pos logit contributions
nih+0.180
+0.176
Downloadha+0.175
ubi+0.169
Fit+0.169
Neg logit contributions
OTE-0.190
imore-0.176
adelphia-0.168
Wars-0.167
finger-0.164
Token,
Token ID11
Feature activation-1.26e+00

Pos logit contributions
OTE+2.985
imore+2.784
adelphia+2.653
Wars+2.648
 Canary+2.609
Neg logit contributions
nih-2.585
Downloadha-2.568
-2.528
ubi-2.443
-2.400
Token the
Token ID262
Feature activation-2.40e+00

Pos logit contributions
nih+0.848
+0.825
Downloadha+0.819
ubi+0.792
+0.791
Neg logit contributions
OTE-0.952
imore-0.877
adelphia-0.841
Wars-0.840
finger-0.823
Token intersection
Token ID16246
Feature activation+2.35e+00

Pos logit contributions
nih+1.576
+1.531
Downloadha+1.510
ubi+1.472
+1.468
Neg logit contributions
OTE-1.833
imore-1.693
adelphia-1.625
Wars-1.623
finger-1.591
Token of
Token ID286
Feature activation+2.96e+00

Pos logit contributions
OTE+1.743
imore+1.617
adelphia+1.544
Wars+1.536
 Rasmussen+1.510
Neg logit contributions
nih-1.584
Downloadha-1.572
-1.541
ubi-1.503
-1.473
Token Y
Token ID575
Feature activation+2.70e+00

Pos logit contributions
OTE+2.162
imore+2.008
adelphia+1.916
Wars+1.914
 Rasmussen+1.901
Neg logit contributions
nih-2.033
Downloadha-1.997
-1.981
ubi-1.916
-1.895
Tokenonge
Token ID14220
Feature activation+5.92e+00

Pos logit contributions
OTE+2.155
imore+1.998
adelphia+1.929
Wars+1.924
 Rasmussen+1.879
Neg logit contributions
nih-1.660
Downloadha-1.654
-1.649
-1.567
Fit-1.538
Token and
Token ID290
Feature activation+4.09e+00

Pos logit contributions
OTE+4.367
imore+4.040
Wars+3.879
adelphia+3.850
 Canary+3.832
Neg logit contributions
nih-3.972
Downloadha-3.969
-3.828
ubi-3.730
-3.674
Token E
Token ID412
Feature activation+2.31e+00

Pos logit contributions
OTE+3.063
imore+2.829
Wars+2.734
 Canary+2.726
adelphia+2.721
Neg logit contributions
nih-2.714
Downloadha-2.670
-2.655
ubi-2.572
-2.524
Tokengl
Token ID4743
Feature activation-1.01e+00

Pos logit contributions
OTE+1.916
imore+1.789
Wars+1.723
adelphia+1.719
finger+1.689
Neg logit contributions
nih-1.373
Downloadha-1.339
-1.338
ubi-1.271
-1.264
Tokeninton
Token ID2371
Feature activation+1.67e+00

Pos logit contributions
nih+0.503
+0.477
Downloadha+0.466
ubi+0.458
+0.455
Neg logit contributions
OTE-0.937
imore-0.870
adelphia-0.843
Wars-0.840
 Rasmussen-0.835
Token will
Token ID481
Feature activation+1.96e+00

Pos logit contributions
OTE+1.335
imore+1.238
adelphia+1.190
Wars+1.188
 Rasmussen+1.166
Neg logit contributions
nih-1.049
Downloadha-1.020
-1.020
ubi-0.979
-0.972
Token Men
Token ID6065
Feature activation-3.11e-01

Pos logit contributions
OTE+0.910
imore+0.846
Wars+0.817
adelphia+0.812
 Canary+0.801
Neg logit contributions
nih-0.707
Downloadha-0.692
-0.691
-0.661
ubi-0.656
Token.
Token ID13
Feature activation+1.03e+00

Pos logit contributions
nih+0.187
+0.182
Downloadha+0.181
ubi+0.174
+0.173
Neg logit contributions
OTE-0.256
imore-0.238
adelphia-0.229
Wars-0.228
 Rasmussen-0.225
Token
Token ID198
Feature activation+1.44e+00

Pos logit contributions
OTE+0.814
imore+0.754
Wars+0.727
adelphia+0.724
 Rasmussen+0.715
Neg logit contributions
nih-0.645
Downloadha-0.628
-0.626
ubi-0.600
-0.598
Token
Token ID198
Feature activation+2.19e-01

Pos logit contributions
OTE+1.179
imore+1.095
adelphia+1.055
Wars+1.052
 Rasmussen+1.034
Neg logit contributions
nih-0.876
-0.851
Downloadha-0.851
ubi-0.815
-0.811
TokenT
Token ID51
Feature activation-7.82e-01

Pos logit contributions
OTE+0.161
imore+0.147
Wars+0.141
adelphia+0.141
finger+0.138
Neg logit contributions
nih-0.152
Downloadha-0.149
-0.148
ubi-0.143
-0.142
Token ER
Token ID13793
Feature activation+6.61e+00

Pos logit contributions
nih+0.511
+0.497
Downloadha+0.496
ubi+0.477
+0.475
Neg logit contributions
OTE-0.603
imore-0.557
adelphia-0.536
Wars-0.533
 Rasmussen-0.523
TokenT
Token ID51
Feature activation+3.34e+00

Pos logit contributions
OTE+5.187
imore+4.577
Wars+4.407
 Rasmussen+4.394
adelphia+4.350
Neg logit contributions
Downloadha-4.287
nih-4.212
-4.049
-3.906
osate-3.890
TokenIA
Token ID3539
Feature activation+2.03e+00

Pos logit contributions
OTE+2.610
imore+2.366
Wars+2.250
 Rasmussen+2.239
adelphia+2.239
Neg logit contributions
nih-2.165
Downloadha-2.163
-2.087
-2.015
ubi-2.009
Token P
Token ID350
Feature activation+1.67e+00

Pos logit contributions
OTE+1.524
imore+1.377
Wars+1.325
adelphia+1.316
 Rasmussen+1.309
Neg logit contributions
nih-1.385
Downloadha-1.357
-1.330
ubi-1.291
-1.284
Token AR
Token ID5923
Feature activation+2.73e+00

Pos logit contributions
OTE+1.128
imore+1.016
Wars+0.971
adelphia+0.966
 Rasmussen+0.956
Neg logit contributions
nih-1.257
Downloadha-1.236
-1.221
ubi-1.182
-1.176
TokenS
Token ID50
Feature activation+2.71e+00

Pos logit contributions
OTE+1.886
imore+1.647
Wars+1.599
 Rasmussen+1.562
adelphia+1.559
Neg logit contributions
nih-2.045
Downloadha-2.027
-1.970
ubi-1.909
-1.907
Tokentwitter
Token ID6956
Feature activation+2.58e+00

Pos logit contributions
OTE+2.665
imore+2.423
adelphia+2.350
Wars+2.331
finger+2.324
Neg logit contributions
nih-2.730
Downloadha-2.727
-2.696
ubi-2.587
-2.582
Token.
Token ID13
Feature activation-3.92e-02

Pos logit contributions
OTE+2.087
imore+1.932
adelphia+1.867
Wars+1.865
 Rasmussen+1.830
Neg logit contributions
nih-1.574
Downloadha-1.550
-1.524
ubi-1.463
-1.454
Tokencom
Token ID785
Feature activation+2.86e+00

Pos logit contributions
nih+0.026
+0.025
Downloadha+0.025
ubi+0.024
Fit+0.024
Neg logit contributions
OTE-0.030
imore-0.028
adelphia-0.027
Wars-0.027
 Canary-0.026
Token/
Token ID14
Feature activation+2.94e+00

Pos logit contributions
OTE+1.901
imore+1.707
Wars+1.646
adelphia+1.640
 Rasmussen+1.612
Neg logit contributions
nih-2.169
Downloadha-2.150
-2.108
ubi-2.057
-2.036
TokenE
Token ID36
Feature activation+2.09e+00

Pos logit contributions
OTE+2.308
imore+2.107
Wars+2.037
adelphia+2.015
finger+1.994
Neg logit contributions
nih-1.866
Downloadha-1.847
-1.827
-1.757
ubi-1.730
Tokenq
Token ID80
Feature activation+5.88e+00

Pos logit contributions
OTE+1.574
imore+1.444
Wars+1.384
adelphia+1.376
finger+1.356
Neg logit contributions
nih-1.405
Downloadha-1.377
-1.372
-1.317
ubi-1.308
Token4
Token ID19
Feature activation+2.75e+00

Pos logit contributions
OTE+4.571
imore+4.121
Wars+4.023
finger+3.930
adelphia+3.925
Neg logit contributions
Downloadha-3.686
nih-3.672
-3.627
ancial-3.486
-3.483
TokenR
Token ID49
Feature activation+2.46e+00

Pos logit contributions
OTE+2.227
imore+2.035
Wars+1.972
adelphia+1.950
finger+1.926
Neg logit contributions
nih-1.673
Downloadha-1.659
-1.640
-1.576
ancial-1.544
TokenIL
Token ID4146
Feature activation+7.95e+00

Pos logit contributions
OTE+1.889
imore+1.726
Wars+1.666
adelphia+1.653
finger+1.630
Neg logit contributions
nih-1.607
Downloadha-1.585
-1.574
-1.515
ubi-1.499
Tokent
Token ID83
Feature activation+5.71e+00

Pos logit contributions
OTE+6.135
imore+5.507
Wars+5.408
finger+5.274
adelphia+5.237
Neg logit contributions
Downloadha-4.920
nih-4.808
-4.776
ancial-4.683
-4.644
Token2
Token ID17
Feature activation+2.41e+00

Pos logit contributions
OTE+4.338
imore+3.905
Wars+3.797
adelphia+3.707
finger+3.700
Neg logit contributions
Downloadha-3.713
nih-3.701
-3.640
-3.526
ancial-3.498
Token {
Token ID1391
Feature activation+9.03e-01

Pos logit contributions
nih+0.230
+0.224
Downloadha+0.224
ubi+0.215
+0.214
Neg logit contributions
OTE-0.276
imore-0.255
adelphia-0.245
Wars-0.244
 Rasmussen-0.240
Token(
Token ID7
Feature activation+1.98e+00

Pos logit contributions
OTE+0.628
imore+0.579
adelphia+0.554
Wars+0.546
 Rasmussen+0.536
Neg logit contributions
nih-0.658
-0.646
Downloadha-0.636
ubi-0.620
-0.618
Token55
Token ID2816
Feature activation+3.61e+00

Pos logit contributions
OTE+1.359
imore+1.235
Wars+1.186
adelphia+1.184
finger+1.162
Neg logit contributions
nih-1.450
Downloadha-1.424
-1.412
ubi-1.363
-1.356
Tokens
Token ID82
Feature activation+1.89e+00

Pos logit contributions
OTE+2.153
imore+1.933
adelphia+1.850
Wars+1.840
 Rasmussen+1.807
Neg logit contributions
nih-2.931
Downloadha-2.907
-2.844
Fit-2.764
-2.761
Token)}
Token ID38165
Feature activation+2.62e+00

Pos logit contributions
OTE+1.458
imore+1.348
adelphia+1.295
Wars+1.290
 Rasmussen+1.268
Neg logit contributions
nih-1.229
-1.196
Downloadha-1.195
ubi-1.148
-1.142
Token Ra
Token ID7567
Feature activation+6.88e+00

Pos logit contributions
OTE+2.025
imore+1.856
 Rasmussen+1.791
Wars+1.780
adelphia+1.775
Neg logit contributions
nih-1.700
Downloadha-1.674
-1.651
-1.593
ubi-1.591
Token8
Token ID23
Feature activation-1.10e+00

Pos logit contributions
OTE+3.673
imore+3.256
Wars+3.116
finger+3.054
adelphia+2.976
Neg logit contributions
Downloadha-5.857
nih-5.755
-5.708
-5.621
osate-5.560
Token {
Token ID1391
Feature activation-3.63e-01

Pos logit contributions
nih+0.702
+0.683
Downloadha+0.682
ubi+0.655
+0.652
Neg logit contributions
OTE-0.868
imore-0.805
adelphia-0.774
Wars-0.771
 Rasmussen-0.757
Token(
Token ID7
Feature activation+3.07e+00

Pos logit contributions
nih+0.255
+0.252
Downloadha+0.243
Fit+0.241
ubi+0.240
Neg logit contributions
OTE-0.261
imore-0.244
adelphia-0.233
Wars-0.227
 Canary-0.226
Token113
Token ID16616
Feature activation+3.52e+00

Pos logit contributions
OTE+2.084
imore+1.879
Wars+1.815
adelphia+1.808
finger+1.782
Neg logit contributions
nih-2.267
Downloadha-2.237
-2.204
ubi-2.130
-2.117
Tokens
Token ID82
Feature activation+1.03e+00

Pos logit contributions
OTE+2.214
imore+2.015
Wars+1.923
adelphia+1.909
finger+1.879
Neg logit contributions
nih-2.759
Downloadha-2.730
-2.705
-2.616
ubi-2.606
Token each
Token ID1123
Feature activation+1.31e+00

Pos logit contributions
nih+0.690
+0.673
Downloadha+0.671
ubi+0.647
+0.644
Neg logit contributions
OTE-0.732
imore-0.674
adelphia-0.646
Wars-0.643
 Rasmussen-0.631
Token of
Token ID286
Feature activation+3.45e+00

Pos logit contributions
OTE+1.013
imore+0.941
adelphia+0.897
Wars+0.896
 Rasmussen+0.892
Neg logit contributions
nih-0.855
Downloadha-0.837
-0.826
ubi-0.798
Fit-0.791
Token Pittsburgh
Token ID12767
Feature activation-1.28e+00

Pos logit contributions
OTE+2.609
imore+2.459
 Rasmussen+2.372
adelphia+2.351
Wars+2.346
Neg logit contributions
nih-2.229
Downloadha-2.179
-2.136
ubi-2.089
Fit-2.060
Token
Token ID447
Feature activation+1.61e+00

Pos logit contributions
nih+0.854
+0.832
Downloadha+0.830
ubi+0.799
+0.795
Neg logit contributions
OTE-0.974
imore-0.900
adelphia-0.863
Wars-0.860
 Rasmussen-0.844
Token
Token ID247
Feature activation+1.20e+00

Pos logit contributions
OTE+1.135
imore+1.043
adelphia+0.998
Wars+0.993
 Rasmussen+0.973
Neg logit contributions
nih-1.158
-1.130
Downloadha-1.129
ubi-1.089
-1.084
Tokens
Token ID82
Feature activation+1.61e+00

Pos logit contributions
OTE+0.752
imore+0.681
adelphia+0.649
Wars+0.648
 Rasmussen+0.635
Neg logit contributions
nih-0.944
Downloadha-0.935
-0.929
ubi-0.899
-0.894
Token top
Token ID1353
Feature activation+3.06e+00

Pos logit contributions
OTE+1.250
imore+1.168
Wars+1.112
 Rasmussen+1.109
adelphia+1.109
Neg logit contributions
nih-1.039
Downloadha-1.014
-0.998
ubi-0.967
-0.959
Token three
Token ID1115
Feature activation+3.82e+00

Pos logit contributions
OTE+2.275
imore+2.108
adelphia+2.010
Wars+1.999
 Rasmussen+1.989
Neg logit contributions
nih-2.060
Downloadha-2.025
-2.011
ubi-1.932
-1.919
Token lines
Token ID3951
Feature activation+3.08e+00

Pos logit contributions
OTE+2.956
imore+2.752
 Rasmussen+2.645
Wars+2.620
adelphia+2.620
Neg logit contributions
nih-2.420
Downloadha-2.387
-2.340
ubi-2.265
Fit-2.254
Token
Token ID784
Feature activation-4.96e-01

Pos logit contributions
OTE+2.347
imore+2.188
adelphia+2.087
 Rasmussen+2.083
Wars+2.082
Neg logit contributions
nih-2.003
Downloadha-1.955
-1.940
Fit-1.879
ubi-1.869
Token just
Token ID655
Feature activation-8.49e-01

Pos logit contributions
nih+0.326
+0.317
Downloadha+0.317
ubi+0.305
+0.303
Neg logit contributions
OTE-0.379
imore-0.351
adelphia-0.337
Wars-0.336
 Rasmussen-0.330
Token being
Token ID852
Feature activation-7.24e-01

Pos logit contributions
nih+1.832
Downloadha+1.774
+1.770
+1.710
ubi+1.706
Neg logit contributions
OTE-1.918
imore-1.779
adelphia-1.703
Wars-1.696
 Rasmussen-1.633
Token rigged
Token ID31043
Feature activation-3.32e+00

Pos logit contributions
nih+0.528
+0.515
Downloadha+0.514
ubi+0.497
+0.495
Neg logit contributions
OTE-0.503
imore-0.461
adelphia-0.441
Wars-0.439
 Rasmussen-0.429
Token to
Token ID284
Feature activation-9.92e-01

Pos logit contributions
nih+1.960
+1.916
Downloadha+1.910
ubi+1.838
+1.824
Neg logit contributions
OTE-2.730
imore-2.540
adelphia-2.475
Wars-2.435
finger-2.380
Token change
Token ID1487
Feature activation+5.59e-01

Pos logit contributions
nih+0.762
+0.745
Downloadha+0.736
ubi+0.717
Fit+0.715
Neg logit contributions
OTE-0.644
imore-0.595
adelphia-0.564
Wars-0.559
finger-0.543
Token Donald
Token ID3759
Feature activation+9.81e-01

Pos logit contributions
OTE+0.401
imore+0.368
adelphia+0.352
Wars+0.350
 Rasmussen+0.345
Neg logit contributions
nih-0.395
-0.386
Downloadha-0.385
ubi-0.372
-0.370
Token Trump
Token ID1301
Feature activation+2.26e+00

Pos logit contributions
OTE+0.743
imore+0.683
adelphia+0.655
Wars+0.653
 Rasmussen+0.648
Neg logit contributions
nih-0.657
Downloadha-0.641
-0.639
ubi-0.615
Fit-0.609
Token votes
Token ID5690
Feature activation-7.01e-01

Pos logit contributions
OTE+1.686
imore+1.548
adelphia+1.488
Wars+1.482
 Rasmussen+1.461
Neg logit contributions
nih-1.532
Downloadha-1.495
-1.491
ubi-1.434
Fit-1.429
Token to
Token ID284
Feature activation+1.03e+00

Pos logit contributions
nih+0.419
+0.407
Downloadha+0.406
ubi+0.389
+0.387
Neg logit contributions
OTE-0.578
imore-0.538
adelphia-0.518
Wars-0.516
 Rasmussen-0.507
Token Hillary
Token ID5073
Feature activation+7.14e-01

Pos logit contributions
OTE+0.767
imore+0.707
adelphia+0.678
Wars+0.676
 Rasmussen+0.664
Neg logit contributions
nih-0.704
-0.686
Downloadha-0.685
ubi-0.660
-0.657
Token Clinton
Token ID2605
Feature activation+2.04e+00

Pos logit contributions
OTE+0.487
imore+0.444
adelphia+0.424
Wars+0.423
 Rasmussen+0.415
Neg logit contributions
nih-0.532
Downloadha-0.518
-0.518
ubi-0.500
Fit-0.497
Token votes
Token ID5690
Feature activation-1.90e+00

Pos logit contributions
OTE+1.636
imore+1.507
adelphia+1.454
Wars+1.448
 Rasmussen+1.428
Neg logit contributions
nih-1.282
Downloadha-1.247
-1.239
ubi-1.187
-1.185
Token became
Token ID2627
Feature activation+1.70e+00

Pos logit contributions
nih+1.555
+1.504
Downloadha+1.477
ubi+1.439
Fit+1.430
Neg logit contributions
OTE-2.022
imore-1.873
adelphia-1.816
Wars-1.788
 Rasmussen-1.758
Token tainted
Token ID36912
Feature activation+2.79e+00

Pos logit contributions
OTE+1.271
imore+1.176
Wars+1.126
adelphia+1.120
 Rasmussen+1.107
Neg logit contributions
nih-1.136
Downloadha-1.115
-1.112
ubi-1.072
Fit-1.066
Token and
Token ID290
Feature activation+1.05e+00

Pos logit contributions
OTE+1.966
imore+1.816
Wars+1.743
adelphia+1.714
 Rasmussen+1.699
Neg logit contributions
nih-1.980
Downloadha-1.950
-1.926
ubi-1.860
Fit-1.856
Token was
Token ID373
Feature activation+3.10e-01

Pos logit contributions
OTE+0.798
imore+0.739
Wars+0.707
adelphia+0.706
 Rasmussen+0.696
Neg logit contributions
nih-0.693
-0.678
Downloadha-0.677
ubi-0.653
-0.648
Token eventually
Token ID4191
Feature activation+1.25e+00

Pos logit contributions
OTE+0.234
imore+0.216
adelphia+0.208
Wars+0.207
 Rasmussen+0.203
Neg logit contributions
nih-0.207
-0.202
Downloadha-0.201
ubi-0.194
-0.193
Token lost
Token ID2626
Feature activation+1.78e+00

Pos logit contributions
OTE+0.937
imore+0.865
Wars+0.828
adelphia+0.827
 Rasmussen+0.814
Neg logit contributions
nih-0.835
Downloadha-0.821
-0.819
ubi-0.788
Fit-0.785
Token.
Token ID13
Feature activation-7.44e-01

Pos logit contributions
OTE+1.392
imore+1.290
Wars+1.240
adelphia+1.231
finger+1.216
Neg logit contributions
nih-1.143
Downloadha-1.115
-1.113
ubi-1.066
Fit-1.062
Token He
Token ID679
Feature activation+3.83e-01

Pos logit contributions
nih+0.448
+0.436
Downloadha+0.434
ubi+0.416
+0.415
Neg logit contributions
OTE-0.611
imore-0.569
adelphia-0.548
Wars-0.545
 Rasmussen-0.535
Token compared
Token ID3688
Feature activation+1.93e+00

Pos logit contributions
OTE+0.283
imore+0.261
adelphia+0.250
Wars+0.248
 Rasmussen+0.245
Neg logit contributions
nih-0.262
Downloadha-0.256
-0.256
ubi-0.246
-0.245
Token it
Token ID340
Feature activation-1.77e+00

Pos logit contributions
OTE+1.370
imore+1.284
adelphia+1.211
 Rasmussen+1.210
Wars+1.206
Neg logit contributions
nih-1.335
Downloadha-1.329
-1.320
Fit-1.267
ubi-1.267
Token to
Token ID284
Feature activation-4.08e-01

Pos logit contributions
nih+1.167
+1.136
Downloadha+1.132
ubi+1.090
+1.085
Neg logit contributions
OTE-1.352
imore-1.248
adelphia-1.200
Wars-1.195
 Rasmussen-1.171
Token
Token ID198
Feature activation+9.66e-01

Pos logit contributions
nih+1.684
+1.636
Downloadha+1.630
ubi+1.564
+1.554
Neg logit contributions
OTE-2.301
imore-2.140
adelphia-2.062
Wars-2.051
 Rasmussen-2.016
Token
Token ID198
Feature activation-8.67e-01

Pos logit contributions
OTE+0.803
imore+0.749
adelphia+0.722
Wars+0.716
 Rasmussen+0.704
Neg logit contributions
nih-0.573
-0.558
Downloadha-0.551
ubi-0.531
-0.528
Token
Token ID447
Feature activation+1.21e+00

Pos logit contributions
nih+0.558
Downloadha+0.543
+0.543
ubi+0.522
+0.520
Neg logit contributions
OTE-0.676
imore-0.624
Wars-0.601
adelphia-0.600
 Rasmussen-0.586
Token
Token ID250
Feature activation-1.75e+00

Pos logit contributions
OTE+1.011
imore+0.929
adelphia+0.899
Wars+0.893
 Rasmussen+0.873
Neg logit contributions
nih-0.714
Downloadha-0.699
-0.683
ubi-0.664
Fit-0.650
TokenThis
Token ID1212
Feature activation-2.34e+00

Pos logit contributions
nih+1.077
+1.047
Downloadha+1.045
ubi+1.003
+0.997
Neg logit contributions
OTE-1.417
imore-1.314
adelphia-1.265
Wars-1.263
 Rasmussen-1.239
Token is
Token ID318
Feature activation+2.43e-01

Pos logit contributions
nih+1.536
+1.497
Downloadha+1.488
ubi+1.435
+1.427
Neg logit contributions
OTE-1.794
imore-1.660
adelphia-1.595
Wars-1.586
 Rasmussen-1.556
Token part
Token ID636
Feature activation-8.04e-02

Pos logit contributions
OTE+0.185
imore+0.171
Wars+0.164
adelphia+0.163
 Rasmussen+0.161
Neg logit contributions
nih-0.160
Downloadha-0.156
-0.156
ubi-0.150
-0.150
Token of
Token ID286
Feature activation-6.62e-02

Pos logit contributions
nih+0.054
+0.053
Downloadha+0.053
ubi+0.051
+0.051
Neg logit contributions
OTE-0.060
imore-0.055
adelphia-0.053
Wars-0.053
 Rasmussen-0.052
Token academic
Token ID8233
Feature activation-2.23e+00

Pos logit contributions
nih+0.047
+0.046
Downloadha+0.046
ubi+0.044
+0.044
Neg logit contributions
OTE-0.047
imore-0.043
Wars-0.041
adelphia-0.041
 Rasmussen-0.041
Token freedom
Token ID4925
Feature activation-2.22e-01

Pos logit contributions
nih+1.602
+1.561
Downloadha+1.543
ubi+1.505
Fit+1.500
Neg logit contributions
OTE-1.579
imore-1.452
adelphia-1.388
Wars-1.374
finger-1.349
Token in
Token ID287
Feature activation+5.98e-01

Pos logit contributions
nih+0.146
Downloadha+0.142
+0.142
ubi+0.137
+0.136
Neg logit contributions
OTE-0.169
imore-0.157
adelphia-0.150
Wars-0.150
 Rasmussen-0.147
Token a
Token ID257
Feature activation+5.01e-01

Pos logit contributions
OTE+2.025
imore+1.868
adelphia+1.781
Wars+1.780
 Rasmussen+1.778
Neg logit contributions
nih-2.057
Downloadha-2.016
-1.983
ubi-1.944
-1.922
Token difference
Token ID3580
Feature activation+3.86e+00

Pos logit contributions
OTE+0.390
imore+0.361
adelphia+0.347
Wars+0.346
 Rasmussen+0.339
Neg logit contributions
nih-0.324
-0.315
Downloadha-0.314
ubi-0.302
-0.301
Token in
Token ID287
Feature activation+2.27e-01

Pos logit contributions
OTE+2.942
imore+2.685
Wars+2.619
 Rasmussen+2.600
adelphia+2.587
Neg logit contributions
nih-2.532
Downloadha-2.464
-2.420
ubi-2.379
-2.369
Token approaches
Token ID10581
Feature activation+6.30e-01

Pos logit contributions
OTE+0.167
imore+0.153
adelphia+0.147
Wars+0.146
 Rasmussen+0.144
Neg logit contributions
nih-0.157
Downloadha-0.153
-0.153
ubi-0.148
-0.147
Token to
Token ID284
Feature activation-2.09e-02

Pos logit contributions
OTE+0.494
imore+0.457
adelphia+0.440
Wars+0.438
 Rasmussen+0.431
Neg logit contributions
nih-0.403
Downloadha-0.392
-0.392
ubi-0.377
-0.374
Token governing
Token ID15030
Feature activation+4.28e-01

Pos logit contributions
nih+0.014
+0.013
Downloadha+0.013
ubi+0.013
+0.013
Neg logit contributions
OTE-0.016
imore-0.015
adelphia-0.014
Wars-0.014
 Rasmussen-0.014
Token.
Token ID13
Feature activation+1.61e+00

Pos logit contributions
OTE+0.342
imore+0.316
adelphia+0.304
Wars+0.303
 Rasmussen+0.299
Neg logit contributions
nih-0.268
Downloadha-0.261
-0.260
ubi-0.251
-0.249
Token Abrams
Token ID32414
Feature activation+2.21e+00

Pos logit contributions
OTE+1.271
imore+1.171
Wars+1.134
adelphia+1.133
 Rasmussen+1.119
Neg logit contributions
nih-1.016
Downloadha-0.996
-0.985
ubi-0.951
-0.940
Token consistently
Token ID9835
Feature activation+3.28e+00

Pos logit contributions
OTE+1.656
imore+1.511
adelphia+1.457
Wars+1.452
 Rasmussen+1.434
Neg logit contributions
nih-1.501
Downloadha-1.469
-1.449
ubi-1.416
-1.391
Token favored
Token ID19344
Feature activation+8.58e-01

Pos logit contributions
OTE+2.254
imore+2.019
adelphia+1.950
Wars+1.938
 Rasmussen+1.927
Neg logit contributions
nih-2.400
Downloadha-2.374
-2.343
ubi-2.276
-2.273
Token compromise
Token ID13110
Feature activation-1.76e+00

Pos logit contributions
OTE+0.633
imore+0.586
adelphia+0.560
Wars+0.556
 Rasmussen+0.553
Neg logit contributions
nih-0.586
-0.572
Downloadha-0.570
ubi-0.551
-0.547
Token 328
Token ID39093
Feature activation+1.66e+00

Pos logit contributions
OTE+0.861
imore+0.793
adelphia+0.762
Wars+0.755
 Rasmussen+0.748
Neg logit contributions
nih-0.724
Downloadha-0.711
-0.705
ubi-0.681
-0.676
Token positions
Token ID6116
Feature activation-5.29e+00

Pos logit contributions
OTE+1.306
imore+1.202
adelphia+1.157
Wars+1.149
 Canary+1.133
Neg logit contributions
nih-1.062
Downloadha-1.033
-1.026
ubi-0.987
Fit-0.984
Token in
Token ID287
Feature activation-7.47e-01

Pos logit contributions
nih+3.088
+3.015
Downloadha+2.972
Fit+2.861
+2.857
Neg logit contributions
OTE-4.305
imore-4.051
adelphia-3.910
Wars-3.910
finger-3.821
Token the
Token ID262
Feature activation-1.01e+00

Pos logit contributions
nih+0.498
+0.485
Downloadha+0.483
ubi+0.465
+0.463
Neg logit contributions
OTE-0.566
imore-0.523
adelphia-0.502
Wars-0.500
 Rasmussen-0.490
Token public
Token ID1171
Feature activation+1.15e+00

Pos logit contributions
nih+0.616
+0.599
Downloadha+0.596
ubi+0.572
+0.569
Neg logit contributions
OTE-0.827
imore-0.770
adelphia-0.741
Wars-0.738
 Rasmussen-0.724
Token service
Token ID2139
Feature activation-1.29e+00

Pos logit contributions
OTE+0.828
imore+0.760
adelphia+0.728
Wars+0.725
 Rasmussen+0.712
Neg logit contributions
nih-0.806
-0.786
Downloadha-0.785
ubi-0.758
-0.753
Token to
Token ID284
Feature activation+4.90e-01

Pos logit contributions
nih+0.823
+0.801
Downloadha+0.799
ubi+0.768
+0.763
Neg logit contributions
OTE-1.016
imore-0.941
adelphia-0.905
Wars-0.902
 Rasmussen-0.885
Token be
Token ID307
Feature activation+1.20e+00

Pos logit contributions
OTE+0.329
imore+0.297
adelphia+0.285
Wars+0.282
 Rasmussen+0.280
Neg logit contributions
nih-0.372
Downloadha-0.367
-0.363
ubi-0.351
-0.349
Token filled
Token ID5901
Feature activation-1.16e+00

Pos logit contributions
OTE+0.818
imore+0.738
adelphia+0.706
Wars+0.703
 Rasmussen+0.699
Neg logit contributions
nih-0.899
Downloadha-0.884
-0.875
ubi-0.850
-0.842
Token by
Token ID416
Feature activation+1.05e+00

Pos logit contributions
nih+0.835
+0.814
Downloadha+0.813
ubi+0.785
+0.781
Neg logit contributions
OTE-0.817
imore-0.751
adelphia-0.718
Wars-0.714
 Rasmussen-0.701
Token bilingual
Token ID48116
Feature activation-1.15e+00

Pos logit contributions
OTE+0.784
imore+0.720
adelphia+0.687
Wars+0.686
 Rasmussen+0.682
Neg logit contributions
nih-0.717
Downloadha-0.710
-0.701
-0.674
ubi-0.673
Token that
Token ID326
Feature activation-9.28e-01

Pos logit contributions
OTE+0.641
imore+0.590
adelphia+0.565
Wars+0.564
 Rasmussen+0.554
Neg logit contributions
nih-0.610
Downloadha-0.595
-0.594
ubi-0.570
-0.569
Token an
Token ID281
Feature activation-1.29e+00

Pos logit contributions
nih+0.621
+0.604
Downloadha+0.602
ubi+0.580
+0.578
Neg logit contributions
OTE-0.701
imore-0.648
adelphia-0.622
Wars-0.618
 Rasmussen-0.606
Token early
Token ID1903
Feature activation+4.87e-01

Pos logit contributions
nih+0.862
+0.843
Downloadha+0.841
ubi+0.807
+0.805
Neg logit contributions
OTE-0.982
imore-0.906
adelphia-0.869
Wars-0.867
 Rasmussen-0.852
Token rate
Token ID2494
Feature activation-5.96e-01

Pos logit contributions
OTE+0.338
imore+0.309
adelphia+0.296
Wars+0.295
 Rasmussen+0.291
Neg logit contributions
nih-0.354
Downloadha-0.348
-0.347
ubi-0.336
-0.333
Token rise
Token ID4485
Feature activation-3.73e+00

Pos logit contributions
nih+0.442
+0.431
Downloadha+0.430
ubi+0.416
+0.415
Neg logit contributions
OTE-0.407
imore-0.373
adelphia-0.356
Wars-0.354
 Rasmussen-0.346
Token appears
Token ID3568
Feature activation-3.52e+00

Pos logit contributions
nih+2.249
+2.174
Downloadha+2.170
+2.079
ubi+2.073
Neg logit contributions
OTE-3.035
imore-2.820
adelphia-2.725
Wars-2.687
finger-2.627
Token to
Token ID284
Feature activation-6.58e-01

Pos logit contributions
nih+2.042
+1.993
Downloadha+1.942
+1.880
Fit+1.874
Neg logit contributions
OTE-2.942
imore-2.757
adelphia-2.666
Wars-2.619
finger-2.566
Token be
Token ID307
Feature activation+6.70e-01

Pos logit contributions
nih+0.497
+0.486
Downloadha+0.484
ubi+0.469
+0.467
Neg logit contributions
OTE-0.439
imore-0.402
adelphia-0.383
Wars-0.381
 Rasmussen-0.373
Token off
Token ID572
Feature activation+1.94e+00

Pos logit contributions
OTE+0.528
imore+0.489
adelphia+0.469
Wars+0.469
 Rasmussen+0.462
Neg logit contributions
nih-0.426
Downloadha-0.414
-0.414
ubi-0.399
-0.395
Token the
Token ID262
Feature activation+2.73e+00

Pos logit contributions
OTE+1.354
imore+1.233
Wars+1.195
adelphia+1.186
 Canary+1.170
Neg logit contributions
nih-1.400
-1.373
Downloadha-1.363
ubi-1.312
-1.309
Token agenda
Token ID8666
Feature activation+3.75e-01

Pos logit contributions
OTE+1.876
imore+1.693
Wars+1.640
adelphia+1.633
 Canary+1.628
Neg logit contributions
nih-2.001
Downloadha-1.961
-1.955
ubi-1.892
Fit-1.877
Token have
Token ID423
Feature activation-8.49e-01

Pos logit contributions
nih+1.313
+1.279
Downloadha+1.267
ubi+1.234
Fit+1.224
Neg logit contributions
OTE-1.356
imore-1.250
adelphia-1.197
Wars-1.191
 Rasmussen-1.165
Token Out
Token ID3806
Feature activation-5.09e+00

Pos logit contributions
nih+0.568
+0.553
Downloadha+0.551
ubi+0.532
+0.528
Neg logit contributions
OTE-0.641
imore-0.592
adelphia-0.568
Wars-0.566
 Rasmussen-0.554
Tokenwit
Token ID39289
Feature activation+9.95e-01

Pos logit contributions
nih+3.888
+3.746
Downloadha+3.746
Fit+3.744
ubi+3.716
Neg logit contributions
OTE-3.316
imore-3.026
adelphia-2.839
Wars-2.785
 Rasmussen-2.767
Token,
Token ID11
Feature activation-1.04e+00

Pos logit contributions
OTE+0.762
imore+0.705
adelphia+0.678
Wars+0.676
 Rasmussen+0.663
Neg logit contributions
nih-0.655
Downloadha-0.638
-0.637
-0.610
ubi-0.609
Token and
Token ID290
Feature activation+2.79e+00

Pos logit contributions
nih+0.638
+0.621
Downloadha+0.618
ubi+0.594
+0.590
Neg logit contributions
OTE-0.843
imore-0.782
adelphia-0.754
Wars-0.750
 Rasmussen-0.734
Token his
Token ID465
Feature activation-3.85e-01

Pos logit contributions
OTE+1.993
imore+1.835
Wars+1.770
 Rasmussen+1.762
 Canary+1.757
Neg logit contributions
nih-1.937
Downloadha-1.912
-1.892
-1.819
ubi-1.818
Token STOP
Token ID44934
Feature activation-2.60e+00

Pos logit contributions
nih+0.258
+0.251
Downloadha+0.251
ubi+0.241
+0.240
Neg logit contributions
OTE-0.291
imore-0.268
adelphia-0.258
Wars-0.257
 Rasmussen-0.252
Token click
Token ID3904
Feature activation-2.09e+00

Pos logit contributions
nih+1.764
+1.730
Downloadha+1.702
ubi+1.662
+1.655
Neg logit contributions
OTE-1.905
imore-1.783
adelphia-1.699
Wars-1.675
 Rasmussen-1.651
Token is
Token ID318
Feature activation-1.27e+00

Pos logit contributions
nih+1.365
+1.329
Downloadha+1.310
ubi+1.280
Fit+1.267
Neg logit contributions
OTE-1.601
imore-1.481
adelphia-1.424
Wars-1.410
 Rasmussen-1.379
Token pure
Token ID5899
Feature activation-1.09e+00

Pos logit contributions
nih+0.816
+0.797
Downloadha+0.791
ubi+0.762
+0.758
Neg logit contributions
OTE-0.990
imore-0.918
adelphia-0.884
Wars-0.878
 Rasmussen-0.861
Token chaos
Token ID11918
Feature activation-1.07e+00

Pos logit contributions
nih+0.736
+0.718
Downloadha+0.715
ubi+0.690
+0.687
Neg logit contributions
OTE-0.810
imore-0.748
adelphia-0.717
Wars-0.713
 Rasmussen-0.698
Token also
Token ID635
Feature activation-2.33e-01

Pos logit contributions
nih+0.957
+0.931
Downloadha+0.930
ubi+0.894
+0.889
Neg logit contributions
OTE-1.131
imore-1.046
adelphia-1.005
Wars-1.001
 Rasmussen-0.983
Token warns
Token ID22145
Feature activation-1.52e+00

Pos logit contributions
nih+0.160
+0.156
Downloadha+0.156
ubi+0.150
+0.149
Neg logit contributions
OTE-0.173
imore-0.159
adelphia-0.152
Wars-0.152
 Rasmussen-0.149
Token:
Token ID25
Feature activation+2.96e+00

Pos logit contributions
nih+1.079
+1.053
Downloadha+1.049
ubi+1.013
+1.010
Neg logit contributions
OTE-1.079
imore-0.993
adelphia-0.949
Wars-0.941
 Rasmussen-0.923
Token
Token ID198
Feature activation+2.63e+00

Pos logit contributions
OTE+2.292
imore+2.094
Wars+2.074
 Canary+2.045
 Rasmussen+2.022
Neg logit contributions
nih-1.870
Downloadha-1.812
-1.793
ubi-1.762
Fit-1.741
Token
Token ID198
Feature activation+1.73e+00

Pos logit contributions
OTE+2.074
imore+1.915
Wars+1.852
adelphia+1.842
 Rasmussen+1.821
Neg logit contributions
nih-1.661
Downloadha-1.639
-1.611
ubi-1.553
-1.544
TokenQuestion
Token ID24361
Feature activation-1.43e+00

Pos logit contributions
OTE+1.189
imore+1.076
Wars+1.057
adelphia+1.034
finger+1.017
Neg logit contributions
nih-1.267
Downloadha-1.238
-1.238
ubi-1.196
-1.189
Token marks
Token ID8849
Feature activation-1.95e+00

Pos logit contributions
nih+0.890
+0.866
Downloadha+0.863
ubi+0.828
+0.824
Neg logit contributions
OTE-1.152
imore-1.070
adelphia-1.029
Wars-1.025
 Rasmussen-1.007
Token over
Token ID625
Feature activation+4.16e+00

Pos logit contributions
nih+1.265
+1.234
Downloadha+1.229
ubi+1.182
+1.177
Neg logit contributions
OTE-1.509
imore-1.396
adelphia-1.342
Wars-1.333
 Rasmussen-1.307
Token the
Token ID262
Feature activation+1.56e+00

Pos logit contributions
OTE+2.817
imore+2.622
Wars+2.546
 Canary+2.536
adelphia+2.515
Neg logit contributions
nih-2.991
Downloadha-2.959
-2.925
ubi-2.846
Fit-2.782
Token rights
Token ID2489
Feature activation-2.67e+00

Pos logit contributions
OTE+1.170
imore+1.079
Wars+1.040
adelphia+1.040
 Canary+1.028
Neg logit contributions
nih-1.043
Downloadha-1.023
-1.015
ubi-0.982
Fit-0.965
Token of
Token ID286
Feature activation+2.61e+00

Pos logit contributions
nih+1.697
+1.657
Downloadha+1.653
+1.591
ubi+1.589
Neg logit contributions
OTE-2.085
imore-1.936
adelphia-1.860
Wars-1.838
 Rasmussen-1.817
Token Instead
Token ID5455
Feature activation+1.46e-01

Pos logit contributions
OTE+0.187
imore+0.173
adelphia+0.167
Wars+0.166
 Rasmussen+0.163
Neg logit contributions
nih-0.146
-0.142
Downloadha-0.142
ubi-0.136
-0.135
Token,
Token ID11
Feature activation-1.73e+00

Pos logit contributions
OTE+0.117
imore+0.109
adelphia+0.105
Wars+0.104
 Rasmussen+0.103
Neg logit contributions
nih-0.091
-0.088
Downloadha-0.088
ubi-0.084
-0.084
Token he
Token ID339
Feature activation-6.62e-02

Pos logit contributions
nih+1.142
+1.114
Downloadha+1.108
ubi+1.072
+1.061
Neg logit contributions
OTE-1.312
imore-1.213
adelphia-1.163
Wars-1.161
finger-1.135
Token was
Token ID373
Feature activation-2.74e-01

Pos logit contributions
nih+0.046
Downloadha+0.045
+0.045
ubi+0.043
+0.043
Neg logit contributions
OTE-0.049
imore-0.045
adelphia-0.043
Wars-0.042
 Rasmussen-0.042
Token five
Token ID1936
Feature activation-2.49e+00

Pos logit contributions
nih+0.183
+0.178
Downloadha+0.178
ubi+0.171
+0.171
Neg logit contributions
OTE-0.208
imore-0.192
adelphia-0.184
Wars-0.183
 Rasmussen-0.180
Token years
Token ID812
Feature activation-1.08e+00

Pos logit contributions
nih+1.660
+1.637
Downloadha+1.594
ubi+1.561
Fit+1.555
Neg logit contributions
OTE-1.867
imore-1.722
adelphia-1.656
Wars-1.646
 Rasmussen-1.615
Token away
Token ID1497
Feature activation-6.49e-01

Pos logit contributions
nih+0.730
+0.711
Downloadha+0.707
ubi+0.684
+0.680
Neg logit contributions
OTE-0.809
imore-0.748
adelphia-0.718
Wars-0.716
 Rasmussen-0.701
Token from
Token ID422
Feature activation+1.96e+00

Pos logit contributions
nih+0.363
Downloadha+0.352
+0.350
ubi+0.334
+0.332
Neg logit contributions
OTE-0.563
imore-0.525
adelphia-0.509
Wars-0.506
 Rasmussen-0.499
Token a
Token ID257
Feature activation+6.86e-02

Pos logit contributions
OTE+1.384
imore+1.282
adelphia+1.236
Wars+1.216
 Rasmussen+1.213
Neg logit contributions
nih-1.388
Downloadha-1.361
-1.357
ubi-1.300
-1.298
Token fateful
Token ID46873
Feature activation+2.29e+00

Pos logit contributions
OTE+0.052
imore+0.048
adelphia+0.046
Wars+0.046
 Rasmussen+0.045
Neg logit contributions
nih-0.046
Downloadha-0.045
-0.044
ubi-0.043
-0.043
Token date
Token ID3128
Feature activation+1.45e+00

Pos logit contributions
OTE+1.687
imore+1.552
adelphia+1.489
Wars+1.482
 Rasmussen+1.470
Neg logit contributions
Downloadha-1.577
nih-1.567
-1.535
ubi-1.477
-1.477
Token hitting
Token ID9008
Feature activation+1.49e+00

Pos logit contributions
OTE+1.215
imore+1.113
Wars+1.068
adelphia+1.066
 Rasmussen+1.047
Neg logit contributions
nih-1.239
-1.206
Downloadha-1.205
ubi-1.164
Fit-1.160
Token a
Token ID257
Feature activation-1.93e+00

Pos logit contributions
OTE+1.084
imore+0.997
Wars+0.953
 Rasmussen+0.945
adelphia+0.942
Neg logit contributions
nih-1.041
-1.013
Downloadha-1.012
ubi-0.974
-0.968
Token police
Token ID1644
Feature activation+1.06e+00

Pos logit contributions
nih+1.294
+1.264
Downloadha+1.248
ubi+1.214
+1.205
Neg logit contributions
OTE-1.444
imore-1.343
adelphia-1.296
Wars-1.268
finger-1.246
Token officer
Token ID3818
Feature activation-3.17e+00

Pos logit contributions
OTE+0.618
imore+0.556
adelphia+0.525
Wars+0.523
 Rasmussen+0.510
Neg logit contributions
nih-0.898
-0.880
Downloadha-0.878
ubi-0.853
-0.849
Token several
Token ID1811
Feature activation-3.19e+00

Pos logit contributions
nih+1.944
+1.903
Downloadha+1.902
ubi+1.813
+1.809
Neg logit contributions
OTE-2.546
imore-2.356
adelphia-2.287
Wars-2.260
finger-2.209
Token times
Token ID1661
Feature activation-6.50e+00

Pos logit contributions
nih+1.962
+1.921
Downloadha+1.895
+1.812
ubi+1.809
Neg logit contributions
OTE-2.573
imore-2.399
adelphia-2.321
Wars-2.279
 Rasmussen-2.231
Token.
Token ID13
Feature activation-2.84e+00

Pos logit contributions
nih+3.836
+3.793
Downloadha+3.784
+3.624
ubi+3.559
Neg logit contributions
OTE-5.254
imore-4.814
adelphia-4.718
Wars-4.633
finger-4.513
Token
Token ID198
Feature activation+1.65e-01

Pos logit contributions
nih+1.676
+1.640
Downloadha+1.617
+1.564
ubi+1.557
Neg logit contributions
OTE-2.344
imore-2.206
adelphia-2.126
Wars-2.085
finger-2.051
Token
Token ID198
Feature activation-1.78e+00

Pos logit contributions
OTE+0.139
imore+0.130
adelphia+0.125
Wars+0.124
 Canary+0.122
Neg logit contributions
nih-0.096
-0.094
Downloadha-0.092
ubi-0.089
Fit-0.089
TokenI
Token ID40
Feature activation-1.21e+00

Pos logit contributions
nih+1.115
+1.090
Downloadha+1.080
ubi+1.040
+1.039
Neg logit contributions
OTE-1.415
imore-1.320
adelphia-1.271
Wars-1.249
 Rasmussen-1.240
Tokenvan
Token ID10438
Feature activation+3.50e+00

Pos logit contributions
nih+0.847
+0.827
Downloadha+0.824
ubi+0.796
+0.792
Neg logit contributions
OTE-0.877
imore-0.808
adelphia-0.774
Wars-0.770
 Rasmussen-0.754
Token sketch
Token ID17548
Feature activation-7.86e-01

Pos logit contributions
OTE+0.114
imore+0.105
adelphia+0.101
Wars+0.101
 Rasmussen+0.099
Neg logit contributions
nih-0.102
-0.099
Downloadha-0.099
ubi-0.095
-0.095
Token comedy
Token ID10997
Feature activation-3.37e-01

Pos logit contributions
nih+0.527
+0.514
Downloadha+0.511
ubi+0.493
+0.492
Neg logit contributions
OTE-0.592
imore-0.547
adelphia-0.524
Wars-0.522
 Rasmussen-0.513
Token and
Token ID290
Feature activation+2.00e+00

Pos logit contributions
nih+0.220
+0.214
Downloadha+0.213
ubi+0.205
+0.204
Neg logit contributions
OTE-0.260
imore-0.240
adelphia-0.231
Wars-0.230
 Rasmussen-0.226
Token how
Token ID703
Feature activation-2.15e+00

Pos logit contributions
OTE+1.481
imore+1.375
adelphia+1.312
Wars+1.309
 Rasmussen+1.295
Neg logit contributions
nih-1.360
Downloadha-1.324
-1.323
ubi-1.271
Fit-1.257
Token-
Token ID12
Feature activation-5.32e+00

Pos logit contributions
nih+1.385
+1.339
Downloadha+1.323
ubi+1.292
+1.285
Neg logit contributions
OTE-1.662
imore-1.550
adelphia-1.496
Wars-1.465
 Rasmussen-1.444
Tokento
Token ID1462
Feature activation-7.16e+00

Pos logit contributions
nih+4.433
Downloadha+4.348
+4.320
ubi+4.227
Fit+4.194
Neg logit contributions
OTE-3.096
imore-2.766
adelphia-2.676
Wars-2.603
 Rasmussen-2.593
Token guide
Token ID5698
Feature activation-2.84e+00

Pos logit contributions
nih+4.459
+4.351
+4.233
Downloadha+4.231
ubi+4.192
Neg logit contributions
OTE-5.440
imore-5.023
adelphia-4.844
Wars-4.809
 Rasmussen-4.724
Token for
Token ID329
Feature activation-3.40e+00

Pos logit contributions
nih+1.731
+1.688
Downloadha+1.664
+1.621
ubi+1.619
Neg logit contributions
OTE-2.294
imore-2.131
adelphia-2.052
Wars-2.039
 Rasmussen-2.009
Token hosting
Token ID13662
Feature activation+6.77e-01

Pos logit contributions
nih+2.196
+2.142
Downloadha+2.104
ubi+2.046
Fit+2.044
Neg logit contributions
OTE-2.632
imore-2.431
adelphia-2.352
Wars-2.335
 Rasmussen-2.279
Token.
Token ID13
Feature activation+1.38e+00

Pos logit contributions
OTE+0.504
imore+0.463
adelphia+0.444
Wars+0.444
 Canary+0.435
Neg logit contributions
nih-0.459
Downloadha-0.450
-0.447
ubi-0.432
-0.428
Token Ton
Token ID16859
Feature activation-7.78e-01

Pos logit contributions
OTE+1.106
imore+1.019
Wars+0.986
adelphia+0.984
 Rasmussen+0.969
Neg logit contributions
nih-0.857
Downloadha-0.839
-0.834
ubi-0.800
-0.793
Token considered
Token ID3177
Feature activation-2.81e+00

Pos logit contributions
nih+0.590
+0.573
Downloadha+0.567
ubi+0.549
+0.545
Neg logit contributions
OTE-0.718
imore-0.666
adelphia-0.640
Wars-0.639
finger-0.622
Token a
Token ID257
Feature activation-2.01e+00

Pos logit contributions
nih+1.737
+1.688
Downloadha+1.646
ubi+1.601
Fit+1.591
Neg logit contributions
OTE-2.250
imore-2.107
adelphia-2.026
Wars-2.012
finger-1.966
Token good
Token ID922
Feature activation-1.60e+00

Pos logit contributions
nih+1.241
+1.207
Downloadha+1.181
ubi+1.147
+1.146
Neg logit contributions
OTE-1.620
imore-1.516
adelphia-1.454
Wars-1.448
finger-1.415
Token place
Token ID1295
Feature activation+7.23e-01

Pos logit contributions
nih+1.071
+1.037
Downloadha+1.025
ubi+0.994
Fit+0.993
Neg logit contributions
OTE-1.203
imore-1.113
adelphia-1.071
Wars-1.071
finger-1.040
Token to
Token ID284
Feature activation-1.16e+00

Pos logit contributions
OTE+0.543
imore+0.497
adelphia+0.477
Wars+0.476
 Rasmussen+0.470
Neg logit contributions
nih-0.487
Downloadha-0.477
-0.474
ubi-0.456
Fit-0.456
Token invest
Token ID1325
Feature activation-7.13e+00

Pos logit contributions
nih+0.816
+0.794
Downloadha+0.786
ubi+0.764
+0.761
Neg logit contributions
OTE-0.836
imore-0.774
adelphia-0.739
Wars-0.737
 Rasmussen-0.719
Token and
Token ID290
Feature activation+7.26e-01

Pos logit contributions
nih+3.897
+3.831
Downloadha+3.670
ubi+3.604
+3.602
Neg logit contributions
OTE-5.993
imore-5.655
adelphia-5.500
Wars-5.395
finger-5.266
Token the
Token ID262
Feature activation-1.85e+00

Pos logit contributions
OTE+0.543
imore+0.498
adelphia+0.478
Wars+0.477
 Rasmussen+0.471
Neg logit contributions
nih-0.492
Downloadha-0.481
-0.481
ubi-0.462
-0.460
Token government
Token ID1230
Feature activation-4.63e+00

Pos logit contributions
nih+1.200
+1.169
Downloadha+1.150
ubi+1.122
Fit+1.112
Neg logit contributions
OTE-1.431
imore-1.330
adelphia-1.279
Wars-1.272
finger-1.245
Token bonds
Token ID13100
Feature activation-4.14e+00

Pos logit contributions
nih+2.841
+2.761
Downloadha+2.687
ubi+2.617
Fit+2.615
Neg logit contributions
OTE-3.676
imore-3.486
adelphia-3.347
Wars-3.293
finger-3.246
Token of
Token ID286
Feature activation+3.08e+00

Pos logit contributions
nih+2.748
+2.693
Downloadha+2.628
Fit+2.567
ubi+2.565
Neg logit contributions
OTE-3.089
imore-2.885
adelphia-2.773
Wars-2.737
finger-2.660
Token other
Token ID584
Feature activation+2.86e+00

Pos logit contributions
OTE+1.714
imore+1.529
Wars+1.443
finger+1.436
adelphia+1.433
Neg logit contributions
nih-2.507
Downloadha-2.459
-2.443
ubi-2.391
Fit-2.356
Token and
Token ID290
Feature activation+8.59e-01

Pos logit contributions
OTE+2.163
imore+2.018
Wars+1.927
finger+1.925
adelphia+1.911
Neg logit contributions
nih-1.865
Downloadha-1.836
-1.810
-1.741
ubi-1.739
Token shaking
Token ID17275
Feature activation-1.75e+00

Pos logit contributions
OTE+0.635
imore+0.586
adelphia+0.562
Wars+0.560
 Rasmussen+0.552
Neg logit contributions
nih-0.587
Downloadha-0.573
-0.571
ubi-0.549
-0.547
Token
Token ID564
Feature activation-1.10e+00

Pos logit contributions
nih+1.119
+1.094
Downloadha+1.083
ubi+1.048
+1.039
Neg logit contributions
OTE-1.370
imore-1.270
adelphia-1.226
Wars-1.213
 Rasmussen-1.181
Token
Token ID250
Feature activation-1.59e-01

Pos logit contributions
nih+0.754
+0.737
Downloadha+0.732
ubi+0.708
+0.705
Neg logit contributions
OTE-0.809
imore-0.745
adelphia-0.716
Wars-0.712
 Rasmussen-0.698
Tokenwith
Token ID4480
Feature activation-7.87e+00

Pos logit contributions
nih+0.103
+0.100
Downloadha+0.100
ubi+0.096
+0.096
Neg logit contributions
OTE-0.124
imore-0.115
adelphia-0.110
Wars-0.110
 Rasmussen-0.108
Token the
Token ID262
Feature activation-2.37e+00

Pos logit contributions
nih+4.806
+4.782
Downloadha+4.539
ubi+4.495
+4.464
Neg logit contributions
OTE-6.058
imore-5.624
adelphia-5.458
Wars-5.380
 Rasmussen-5.177
Token longing
Token ID40314
Feature activation+1.73e+00

Pos logit contributions
nih+1.554
+1.529
Downloadha+1.486
ubi+1.452
Fit+1.441
Neg logit contributions
OTE-1.819
imore-1.688
adelphia-1.617
Wars-1.601
finger-1.554
Token of
Token ID286
Feature activation+3.40e-01

Pos logit contributions
OTE+1.283
imore+1.181
Wars+1.133
adelphia+1.130
finger+1.119
Neg logit contributions
nih-1.177
Downloadha-1.149
-1.137
-1.100
ubi-1.100
Token twenty
Token ID8208
Feature activation+2.49e+00

Pos logit contributions
OTE+0.251
imore+0.232
adelphia+0.222
Wars+0.221
 Rasmussen+0.217
Neg logit contributions
nih-0.232
-0.226
Downloadha-0.226
ubi-0.218
-0.217
Token years
Token ID812
Feature activation+9.38e-01

Pos logit contributions
OTE+1.757
imore+1.622
adelphia+1.550
Wars+1.543
finger+1.531
Neg logit contributions
nih-1.767
Downloadha-1.739
-1.709
ubi-1.656
-1.654
Token bel
Token ID894
Feature activation+1.31e+00

Pos logit contributions
nih+1.572
+1.531
Downloadha+1.513
ubi+1.467
Fit+1.460
Neg logit contributions
OTE-1.829
imore-1.692
adelphia-1.622
Wars-1.612
finger-1.575
Tokenitt
Token ID715
Feature activation+4.09e-01

Pos logit contributions
OTE+1.032
imore+0.964
adelphia+0.933
Wars+0.920
finger+0.908
Neg logit contributions
nih-0.827
Downloadha-0.809
-0.804
-0.770
ubi-0.764
Tokenling
Token ID1359
Feature activation-9.58e-01

Pos logit contributions
OTE+0.201
imore+0.178
adelphia+0.166
Wars+0.166
 Rasmussen+0.161
Neg logit contributions
nih-0.381
Downloadha-0.375
-0.375
ubi-0.363
-0.363
Token African
Token ID5510
Feature activation-8.30e-01

Pos logit contributions
nih+0.654
+0.637
Downloadha+0.636
ubi+0.613
+0.610
Neg logit contributions
OTE-0.711
imore-0.656
adelphia-0.629
Wars-0.626
 Rasmussen-0.614
Token di
Token ID2566
Feature activation-4.69e+00

Pos logit contributions
nih+0.550
+0.536
Downloadha+0.533
ubi+0.515
+0.512
Neg logit contributions
OTE-0.631
imore-0.583
adelphia-0.560
Wars-0.556
 Rasmussen-0.547
Tokenat
Token ID265
Feature activation-7.36e+00

Pos logit contributions
nih+2.813
+2.704
Downloadha+2.694
Fit+2.595
ubi+2.593
Neg logit contributions
OTE-3.867
imore-3.578
adelphia-3.440
Wars-3.438
 Rasmussen-3.392
Tokenribe
Token ID4892
Feature activation-2.35e+00

Pos logit contributions
nih+3.214
+2.872
ubi+2.826
Downloadha+2.823
Fit+2.814
Neg logit contributions
OTE-7.180
imore-6.563
Wars-6.493
adelphia-6.487
 Rasmussen-6.418
Token on
Token ID319
Feature activation-2.03e+00

Pos logit contributions
nih+1.432
+1.395
Downloadha+1.381
ubi+1.325
+1.323
Neg logit contributions
OTE-1.902
imore-1.758
adelphia-1.702
Wars-1.692
 Rasmussen-1.647
Token Lu
Token ID6026
Feature activation-2.53e+00

Pos logit contributions
nih+1.347
+1.316
Downloadha+1.314
ubi+1.259
Fit+1.255
Neg logit contributions
OTE-1.546
imore-1.425
adelphia-1.369
Wars-1.360
finger-1.331
Tokenol
Token ID349
Feature activation-4.45e+00

Pos logit contributions
nih+1.646
+1.590
Downloadha+1.579
ubi+1.551
Fit+1.531
Neg logit contributions
OTE-1.955
imore-1.803
adelphia-1.740
Wars-1.724
 Rasmussen-1.696
Token Deng
Token ID41985
Feature activation-2.99e+00

Pos logit contributions
nih+2.588
+2.457
ubi+2.421
Downloadha+2.347
Fit+2.346
Neg logit contributions
OTE-3.761
imore-3.469
adelphia-3.398
Wars-3.340
 Canary-3.245