TOP ACTIVATIONS
MAX = 5.826

effects . ( In other words it neglect s the laws
story In the capital , the police and huge
false . One photo , showing a dead Palestinian
THE BAD and THE CO OL . THE BAD
July , a fisherman went missing , and a cro c
Saudi Arabia . In other words , she threatened Swedish arms
a satire of after - school specials that ran on Comedy
A sixth firm was let off because it had blown the
infinite wisdom , tweeted the following at Ham ill ( m
in the state of the art Is ot ope Chron ology
Women for America , we pt as she told the committee
quite simple : function ob b _ sub div
Gr uden said Duke I hen acho also has gotten some
) AS count FROM ( SELECT SPL IT ( content ,
ul ik and John He yn went to a Jud as
away , and called to see if anyone there knew of
albeit operating more slowly than normal , says Timothy Cot é
than two . In other words , he wanted to increase
, you will be let off . The sentence of 20
than the standard F - 150 for improved stability off -

MIN ACTIVATIONS
MIN = -5.994

: Show nested quote + On September
TA WA ( Reuters ) - Canada s top
V ANA ( Reuters ) - Latin American and Caribbean leaders
Dark ho arse wrote : Show nested quote +
of people who make a living churn ing out dozens of
pur ak ushi wrote : Happy it worked out
ho arse wrote : Show nested quote +
Simon King , Albion 's Chief Commercial Officer , said :
ak ushi wrote : Happy it worked out .
arg uably the greatest game ever played in Cleveland , Ohio
am is . Certain d ' ent re e ux s
L ANS ING , MI -- The Michigan House is looking
. Er n est o Gal ar za ,
USS ELS ( Reuters ) - The world s
it just me , or does anyone else out there think
: 50 pur ak ushi wrote : Happy it
AD ENA , Calif . -- The first Martian rock NASA
. <|endoftext|> Dec 2 , 2015 Cycl oc ross hurts
N ATIONS ( Reuters ) - An army battalion in the
a Cohn prot é g é , was running the mortgage

INTERVAL 2.871 - 5.826
CONTAINS 0.343%

( alt itude 34 00 m ; 19 ° 30 N
, 2014 I ne v itably , with all
? Will P fl aum Bl ocked Un block
get your tickets ! Che ers ! <|endoftext|> Af ghan artist
, which for the most part is uncontroll able .

INTERVAL -0.084 - 2.871
CONTAINS 53.163%

: report Vietnamese airline takes steps to open flights to US
operating system . In April , two Korean Internet companies --
ot erie of the super - rich . And then perhaps
Libya might occur in the future , Obama launched a war
and to make every play difficult for his opponent ."

INTERVAL -3.039 - -0.084
CONTAINS 46.270%

aul i . re in ik ain en @ il media
way over to Liverpool , England actually worked ! HW ND
and culture that dominates the state . Like Islam and most
government , he says cheer fully .
en br un ner appeared in 5 64 games as a

INTERVAL -5.994 - -3.039
CONTAINS 0.224%

signal 11 , Se gment ation fault . 4 5 (
the potential or 3 har v ests a year , unlike
Share this comment on Facebook Share this comment on Twitter
say they received notices from Tinder saying the were unable to
Jen was pregnant . Flynn and his wife specifically decided to
Token effects
Token ID3048
Feature activation-4.95e-03

Pos logit contributions
 Mir+0.084
 Citation+0.084
hots+0.080
exec+0.075
camp+0.074
Neg logit contributions
Tam-0.105
paralleled-0.103
lear-0.101
-0.100
Compare-0.100
Token.
Token ID13
Feature activation-4.56e-01

Pos logit contributions
 Mir+0.003
 Citation+0.003
hots+0.003
exec+0.003
camp+0.003
Neg logit contributions
Tam-0.004
paralleled-0.004
lear-0.004
-0.004
Compare-0.004
Token (
Token ID357
Feature activation-1.21e+00

Pos logit contributions
 Mir+0.307
 Citation+0.306
hots+0.291
exec+0.275
camp+0.273
Neg logit contributions
Tam-0.353
paralleled-0.350
lear-0.341
��-0.339
-0.338
TokenIn
Token ID818
Feature activation-5.41e-01

Pos logit contributions
 Mir+0.830
 Citation+0.830
hots+0.800
exec+0.761
camp+0.752
Neg logit contributions
Tam-0.905
paralleled-0.902
��-0.876
lear-0.875
-0.872
Token other
Token ID584
Feature activation+3.49e-01

Pos logit contributions
 Mir+0.349
 Citation+0.347
hots+0.333
exec+0.314
camp+0.310
Neg logit contributions
Tam-0.433
paralleled-0.426
lear-0.416
-0.414
Compare-0.411
Token words
Token ID2456
Feature activation+5.83e+00

Pos logit contributions
Tam+0.258
paralleled+0.252
lear+0.246
+0.245
Compare+0.245
Neg logit contributions
 Mir-0.245
 Citation-0.245
hots-0.237
exec-0.225
camp-0.223
Token it
Token ID340
Feature activation+3.39e-01

Pos logit contributions
Tam+3.984
paralleled+3.856
Compare+3.817
+3.785
��+3.779
Neg logit contributions
 Citation-4.179
 Mir-4.140
hots-4.077
exec-3.920
ancial-3.869
Token neglect
Token ID17985
Feature activation-2.05e+00

Pos logit contributions
Tam+0.263
paralleled+0.257
lear+0.251
+0.250
Compare+0.249
Neg logit contributions
 Mir-0.226
 Citation-0.225
hots-0.218
exec-0.206
camp-0.204
Tokens
Token ID82
Feature activation-4.57e-01

Pos logit contributions
 Mir+1.245
 Citation+1.233
hots+1.187
exec+1.119
camp+1.111
Neg logit contributions
Tam-1.711
paralleled-1.681
-1.644
��-1.641
lear-1.637
Token the
Token ID262
Feature activation+5.35e-01

Pos logit contributions
 Mir+0.291
 Citation+0.290
hots+0.278
exec+0.261
camp+0.258
Neg logit contributions
Tam-0.369
paralleled-0.363
lear-0.354
-0.352
Compare-0.351
Token laws
Token ID3657
Feature activation-4.56e-01

Pos logit contributions
Tam+0.420
paralleled+0.411
lear+0.401
+0.400
Compare+0.399
Neg logit contributions
 Mir-0.353
 Citation-0.352
hots-0.339
exec-0.320
camp-0.317
Token story
Token ID1621
Feature activation-3.88e-01

Pos logit contributions
Tam+0.006
paralleled+0.006
lear+0.006
+0.006
Compare+0.006
Neg logit contributions
 Mir-0.004
 Citation-0.004
hots-0.004
exec-0.004
camp-0.003
Token
Token ID198
Feature activation+1.98e-01

Pos logit contributions
 Mir+0.227
 Citation+0.226
hots+0.214
exec+0.202
camp+0.200
Neg logit contributions
Tam-0.333
paralleled-0.330
��-0.327
-0.323
lear-0.322
Token
Token ID198
Feature activation-3.31e-01

Pos logit contributions
Tam+0.166
paralleled+0.163
lear+0.159
+0.159
��+0.159
Neg logit contributions
 Mir-0.119
 Citation-0.118
hots-0.114
exec-0.107
camp-0.106
TokenIn
Token ID818
Feature activation+2.45e-01

Pos logit contributions
 Mir+0.215
 Citation+0.214
hots+0.207
exec+0.197
camp+0.195
Neg logit contributions
Tam-0.259
paralleled-0.257
lear-0.251
-0.250
��-0.249
Token the
Token ID262
Feature activation+5.62e-01

Pos logit contributions
Tam+0.200
paralleled+0.196
lear+0.191
+0.191
Compare+0.191
Neg logit contributions
 Mir-0.153
 Citation-0.153
hots-0.147
exec-0.139
camp-0.137
Token capital
Token ID3139
Feature activation+5.81e+00

Pos logit contributions
Tam+0.450
paralleled+0.443
lear+0.434
+0.432
Compare+0.430
Neg logit contributions
 Mir-0.364
 Citation-0.358
hots-0.346
exec-0.326
camp-0.322
Token,
Token ID11
Feature activation+6.33e-01

Pos logit contributions
Tam+4.092
paralleled+3.965
Compare+3.843
lear+3.796
+3.736
Neg logit contributions
 Citation-4.151
hots-4.030
 Mir-3.993
exec-3.834
camp-3.756
Token the
Token ID262
Feature activation+2.94e-01

Pos logit contributions
Tam+0.520
paralleled+0.508
Compare+0.495
lear+0.494
+0.493
Neg logit contributions
 Citation-0.389
 Mir-0.386
hots-0.377
exec-0.354
camp-0.351
Token police
Token ID1644
Feature activation+7.56e-02

Pos logit contributions
Tam+0.232
paralleled+0.227
lear+0.222
+0.222
Compare+0.221
Neg logit contributions
 Mir-0.193
 Citation-0.191
hots-0.185
exec-0.175
camp-0.173
Token and
Token ID290
Feature activation-6.19e-01

Pos logit contributions
Tam+0.065
paralleled+0.064
lear+0.062
+0.062
Compare+0.062
Neg logit contributions
 Mir-0.044
 Citation-0.044
hots-0.042
exec-0.039
camp-0.039
Token huge
Token ID3236
Feature activation-1.62e-01

Pos logit contributions
 Mir+0.384
 Citation+0.381
hots+0.366
exec+0.346
camp+0.342
Neg logit contributions
Tam-0.510
paralleled-0.500
lear-0.489
-0.488
Compare-0.487
Token false
Token ID3991
Feature activation-1.75e+00

Pos logit contributions
 Mir+0.050
 Citation+0.049
hots+0.048
exec+0.045
camp+0.044
Neg logit contributions
Tam-0.054
paralleled-0.053
lear-0.052
Compare-0.051
-0.051
Token.
Token ID13
Feature activation-5.94e-01

Pos logit contributions
 Mir+1.184
 Citation+1.164
hots+1.142
exec+1.072
camp+1.054
Neg logit contributions
Tam-1.336
paralleled-1.321
lear-1.293
-1.281
��-1.281
Token
Token ID198
Feature activation-4.05e-01

Pos logit contributions
 Mir+0.367
 Citation+0.361
hots+0.346
exec+0.326
camp+0.322
Neg logit contributions
Tam-0.491
paralleled-0.487
lear-0.477
-0.473
��-0.472
Token
Token ID198
Feature activation-3.42e-01

Pos logit contributions
 Mir+0.239
 Citation+0.237
hots+0.227
exec+0.214
camp+0.212
Neg logit contributions
Tam-0.344
paralleled-0.340
��-0.335
lear-0.332
-0.332
TokenOne
Token ID3198
Feature activation+1.16e+00

Pos logit contributions
 Mir+0.216
 Citation+0.215
hots+0.209
exec+0.197
camp+0.194
Neg logit contributions
Tam-0.274
paralleled-0.272
lear-0.265
-0.264
��-0.263
Token photo
Token ID4590
Feature activation+5.73e+00

Pos logit contributions
Tam+0.921
paralleled+0.901
lear+0.879
+0.877
Compare+0.877
Neg logit contributions
 Mir-0.750
 Citation-0.749
hots-0.724
exec-0.684
camp-0.676
Token,
Token ID11
Feature activation+8.30e-01

Pos logit contributions
Tam+4.139
paralleled+4.029
Compare+3.933
��+3.863
+3.860
Neg logit contributions
 Citation-4.004
 Mir-3.914
hots-3.822
exec-3.687
ancial-3.681
Token showing
Token ID4478
Feature activation+1.73e+00

Pos logit contributions
Tam+0.683
paralleled+0.670
lear+0.655
+0.653
Compare+0.652
Neg logit contributions
 Mir-0.513
 Citation-0.511
hots-0.494
exec-0.464
camp-0.459
Token a
Token ID257
Feature activation-8.54e-01

Pos logit contributions
Tam+1.364
paralleled+1.331
Compare+1.295
lear+1.291
+1.288
Neg logit contributions
 Citation-1.123
 Mir-1.109
hots-1.084
exec-1.025
camp-1.020
Token dead
Token ID2636
Feature activation-4.27e-01

Pos logit contributions
 Mir+0.589
 Citation+0.581
hots+0.558
exec+0.530
camp+0.523
Neg logit contributions
Tam-0.650
paralleled-0.637
lear-0.625
-0.623
Compare-0.618
Token Palestinian
Token ID8501
Feature activation-7.03e-02

Pos logit contributions
 Mir+0.335
 Citation+0.330
hots+0.319
exec+0.305
camp+0.303
Neg logit contributions
Tam-0.283
paralleled-0.278
-0.271
lear-0.271
Compare-0.268
Token THE
Token ID3336
Feature activation+3.66e-01

Pos logit contributions
 Mir+0.115
 Citation+0.114
hots+0.110
exec+0.104
camp+0.103
Neg logit contributions
Tam-0.136
paralleled-0.133
lear-0.130
-0.129
Compare-0.129
Token BAD
Token ID33934
Feature activation-2.70e-01

Pos logit contributions
Tam+0.272
paralleled+0.266
+0.259
lear+0.259
Compare+0.258
Neg logit contributions
 Mir-0.257
 Citation-0.255
hots-0.246
exec-0.234
camp-0.233
Token and
Token ID290
Feature activation+4.66e-01

Pos logit contributions
 Mir+0.192
 Citation+0.190
hots+0.185
exec+0.176
camp+0.174
Neg logit contributions
Tam-0.197
paralleled-0.194
lear-0.189
-0.188
Compare-0.187
Token THE
Token ID3336
Feature activation-6.12e-03

Pos logit contributions
Tam+0.388
paralleled+0.380
lear+0.372
+0.371
Compare+0.370
Neg logit contributions
 Mir-0.286
 Citation-0.283
hots-0.273
exec-0.257
camp-0.255
Token CO
Token ID7375
Feature activation+5.79e-01

Pos logit contributions
 Mir+0.004
 Citation+0.004
hots+0.004
exec+0.004
camp+0.004
Neg logit contributions
Tam-0.004
paralleled-0.004
lear-0.004
-0.004
Compare-0.004
TokenOL
Token ID3535
Feature activation+5.73e+00

Pos logit contributions
Tam+0.473
paralleled+0.466
lear+0.453
+0.453
��+0.452
Neg logit contributions
 Mir-0.362
 Citation-0.360
hots-0.346
exec-0.328
camp-0.323
Token.
Token ID13
Feature activation+5.97e-01

Pos logit contributions
Tam+3.756
paralleled+3.648
Compare+3.585
lear+3.570
+3.526
Neg logit contributions
 Citation-4.333
 Mir-4.269
hots-4.196
exec-4.023
camp-3.957
Token
Token ID198
Feature activation-1.52e-01

Pos logit contributions
Tam+0.477
paralleled+0.471
lear+0.459
+0.456
��+0.456
Neg logit contributions
 Mir-0.387
 Citation-0.384
hots-0.367
exec-0.347
camp-0.345
Token
Token ID198
Feature activation-2.03e-01

Pos logit contributions
 Mir+0.091
 Citation+0.091
hots+0.087
exec+0.082
camp+0.081
Neg logit contributions
Tam-0.128
paralleled-0.126
��-0.123
lear-0.123
-0.123
TokenTHE
Token ID10970
Feature activation+2.47e-01

Pos logit contributions
 Mir+0.133
 Citation+0.132
hots+0.128
exec+0.122
camp+0.121
Neg logit contributions
Tam-0.157
paralleled-0.156
��-0.152
lear-0.152
-0.152
Token BAD
Token ID33934
Feature activation-8.83e-01

Pos logit contributions
Tam+0.165
paralleled+0.161
lear+0.156
+0.156
��+0.155
Neg logit contributions
 Mir-0.192
 Citation-0.191
hots-0.184
exec-0.176
camp-0.176
Token July
Token ID2901
Feature activation+4.41e-01

Pos logit contributions
Tam+0.069
paralleled+0.068
lear+0.066
+0.066
Compare+0.066
Neg logit contributions
 Mir-0.034
 Citation-0.034
hots-0.033
exec-0.030
camp-0.030
Token,
Token ID11
Feature activation+1.89e-01

Pos logit contributions
Tam+0.337
paralleled+0.329
+0.321
lear+0.321
Compare+0.320
Neg logit contributions
 Mir-0.300
 Citation-0.299
hots-0.289
exec-0.273
camp-0.271
Token a
Token ID257
Feature activation-2.59e-01

Pos logit contributions
Tam+0.156
paralleled+0.153
lear+0.150
+0.150
Compare+0.150
Neg logit contributions
 Mir-0.116
 Citation-0.115
hots-0.111
exec-0.104
camp-0.103
Token fisherman
Token ID48750
Feature activation-8.28e-01

Pos logit contributions
 Mir+0.173
 Citation+0.171
hots+0.164
exec+0.156
camp+0.155
Neg logit contributions
Tam-0.201
paralleled-0.197
-0.194
lear-0.193
Compare-0.192
Token went
Token ID1816
Feature activation+6.62e-01

Pos logit contributions
 Mir+0.506
 Citation+0.498
hots+0.484
exec+0.454
camp+0.448
Neg logit contributions
Tam-0.692
paralleled-0.678
-0.665
��-0.663
lear-0.660
Token missing
Token ID4814
Feature activation+5.66e+00

Pos logit contributions
Tam+0.543
paralleled+0.532
lear+0.520
+0.520
Compare+0.518
Neg logit contributions
 Mir-0.412
 Citation-0.409
hots-0.396
exec-0.372
camp-0.369
Token,
Token ID11
Feature activation+3.43e-01

Pos logit contributions
Tam+4.269
paralleled+4.174
Compare+4.068
lear+4.050
 Lenovo+3.916
Neg logit contributions
 Citation-3.656
 Mir-3.618
hots-3.588
ancial-3.430
exec-3.411
Token and
Token ID290
Feature activation+3.31e-02

Pos logit contributions
Tam+0.286
paralleled+0.281
lear+0.274
+0.273
Compare+0.273
Neg logit contributions
 Mir-0.208
 Citation-0.207
hots-0.200
exec-0.188
camp-0.186
Token a
Token ID257
Feature activation-6.72e-01

Pos logit contributions
Tam+0.027
paralleled+0.027
+0.026
lear+0.026
��+0.026
Neg logit contributions
 Mir-0.021
 Citation-0.020
hots-0.020
exec-0.018
camp-0.018
Token cro
Token ID6763
Feature activation+1.43e+00

Pos logit contributions
 Mir+0.444
 Citation+0.437
hots+0.419
exec+0.397
camp+0.396
Neg logit contributions
Tam-0.528
paralleled-0.518
-0.510
lear-0.507
��-0.506
Tokenc
Token ID66
Feature activation-9.11e-01

Pos logit contributions
Tam+1.128
paralleled+1.106
lear+1.087
Compare+1.072
+1.071
Neg logit contributions
 Mir-0.929
 Citation-0.923
hots-0.890
exec-0.832
camp-0.827
Token Saudi
Token ID7420
Feature activation-6.29e-01

Pos logit contributions
 Mir+0.011
 Citation+0.011
hots+0.010
exec+0.010
camp+0.010
Neg logit contributions
Tam-0.012
paralleled-0.012
lear-0.011
-0.011
Compare-0.011
Token Arabia
Token ID9671
Feature activation-1.01e+00

Pos logit contributions
 Mir+0.550
 Citation+0.548
hots+0.532
exec+0.511
camp+0.507
Neg logit contributions
Tam-0.358
paralleled-0.349
-0.336
Compare-0.334
lear-0.334
Token.
Token ID13
Feature activation+1.72e-01

Pos logit contributions
 Mir+0.678
 Citation+0.673
hots+0.648
exec+0.613
camp+0.610
Neg logit contributions
Tam-0.778
paralleled-0.760
lear-0.744
-0.740
Compare-0.739
Token In
Token ID554
Feature activation+1.85e-01

Pos logit contributions
Tam+0.142
paralleled+0.140
lear+0.136
+0.136
Compare+0.135
Neg logit contributions
 Mir-0.106
 Citation-0.106
hots-0.101
exec-0.095
camp-0.095
Token other
Token ID584
Feature activation-2.27e-01

Pos logit contributions
Tam+0.150
paralleled+0.147
lear+0.144
+0.143
Compare+0.143
Neg logit contributions
 Mir-0.117
 Citation-0.117
hots-0.112
exec-0.106
camp-0.105
Token words
Token ID2456
Feature activation+5.55e+00

Pos logit contributions
 Mir+0.157
 Citation+0.156
hots+0.151
exec+0.143
camp+0.142
Neg logit contributions
Tam-0.170
paralleled-0.167
-0.162
lear-0.162
Compare-0.161
Token,
Token ID11
Feature activation+1.77e+00

Pos logit contributions
Tam+3.730
paralleled+3.581
+3.506
��+3.497
Compare+3.486
Neg logit contributions
 Citation-4.119
hots-4.040
 Mir-4.034
exec-3.814
ancial-3.751
Token she
Token ID673
Feature activation+1.17e+00

Pos logit contributions
Tam+1.630
paralleled+1.595
Compare+1.566
lear+1.560
+1.559
Neg logit contributions
 Citation-0.901
 Mir-0.894
hots-0.876
exec-0.808
camp-0.797
Token threatened
Token ID8556
Feature activation+1.28e+00

Pos logit contributions
Tam+0.961
paralleled+0.940
lear+0.918
+0.917
Compare+0.915
Neg logit contributions
 Mir-0.721
 Citation-0.716
hots-0.692
exec-0.651
camp-0.644
Token Swedish
Token ID14023
Feature activation+1.51e+00

Pos logit contributions
Tam+1.042
paralleled+1.021
lear+0.997
+0.995
Compare+0.994
Neg logit contributions
 Mir-0.799
 Citation-0.797
hots-0.769
exec-0.725
camp-0.716
Token arms
Token ID5101
Feature activation+3.63e-01

Pos logit contributions
Tam+1.245
paralleled+1.217
lear+1.192
+1.190
Compare+1.187
Neg logit contributions
 Mir-0.928
 Citation-0.927
hots-0.899
exec-0.843
camp-0.836
Token a
Token ID257
Feature activation+7.59e-01

Pos logit contributions
Tam+0.504
paralleled+0.494
lear+0.483
+0.481
Compare+0.480
Neg logit contributions
 Citation-0.365
 Mir-0.364
hots-0.352
exec-0.330
camp-0.327
Token satire
Token ID35704
Feature activation+7.77e-02

Pos logit contributions
Tam+0.606
paralleled+0.593
lear+0.579
+0.577
Compare+0.577
Neg logit contributions
 Mir-0.487
 Citation-0.486
hots-0.470
exec-0.443
camp-0.438
Token of
Token ID286
Feature activation-1.16e+00

Pos logit contributions
Tam+0.066
paralleled+0.064
lear+0.063
+0.063
Compare+0.063
Neg logit contributions
 Mir-0.046
 Citation-0.046
hots-0.045
exec-0.042
camp-0.041
Token after
Token ID706
Feature activation+5.09e-01

Pos logit contributions
 Mir+0.764
 Citation+0.749
hots+0.723
exec+0.687
camp+0.680
Neg logit contributions
Tam-0.916
paralleled-0.900
lear-0.878
-0.876
��-0.872
Token-
Token ID12
Feature activation+1.40e+00

Pos logit contributions
Tam+0.396
paralleled+0.388
lear+0.378
+0.377
Compare+0.377
Neg logit contributions
 Mir-0.337
 Citation-0.337
hots-0.325
exec-0.307
camp-0.304
Tokenschool
Token ID14347
Feature activation+5.50e+00

Pos logit contributions
Tam+0.807
paralleled+0.782
lear+0.756
+0.751
Compare+0.751
Neg logit contributions
 Citation-1.212
 Mir-1.209
hots-1.175
exec-1.124
camp-1.114
Token specials
Token ID38102
Feature activation-1.09e+00

Pos logit contributions
Tam+4.417
paralleled+4.272
+4.223
lear+4.215
Compare+4.174
Neg logit contributions
 Citation-3.418
 Mir-3.262
hots-3.241
exec-3.064
ancial-3.046
Token that
Token ID326
Feature activation-6.78e-01

Pos logit contributions
 Mir+0.670
 Citation+0.657
hots+0.642
exec+0.598
camp+0.592
Neg logit contributions
Tam-0.914
paralleled-0.897
lear-0.874
Compare-0.873
-0.873
Token ran
Token ID4966
Feature activation-9.76e-01

Pos logit contributions
 Mir+0.414
 Citation+0.410
hots+0.397
exec+0.373
camp+0.368
Neg logit contributions
Tam-0.565
paralleled-0.553
lear-0.541
-0.539
Compare-0.538
Token on
Token ID319
Feature activation-2.69e-01

Pos logit contributions
 Mir+0.608
 Citation+0.598
hots+0.580
exec+0.548
camp+0.540
Neg logit contributions
Tam-0.803
paralleled-0.787
lear-0.771
��-0.768
-0.768
Token Comedy
Token ID22329
Feature activation-1.25e+00

Pos logit contributions
 Mir+0.167
 Citation+0.166
hots+0.160
exec+0.151
camp+0.149
Neg logit contributions
Tam-0.221
paralleled-0.217
lear-0.212
-0.211
Compare-0.211
TokenA
Token ID32
Feature activation-9.53e-01

Pos logit contributions
Tam+0.883
paralleled+0.862
lear+0.840
Compare+0.839
+0.837
Neg logit contributions
 Mir-0.787
 Citation-0.787
hots-0.760
exec-0.719
camp-0.712
Token sixth
Token ID11695
Feature activation+8.89e-01

Pos logit contributions
 Mir+0.676
 Citation+0.668
hots+0.646
exec+0.613
camp+0.610
Neg logit contributions
Tam-0.701
paralleled-0.685
lear-0.669
-0.668
Compare-0.667
Token firm
Token ID4081
Feature activation+2.46e+00

Pos logit contributions
Tam+0.690
paralleled+0.676
lear+0.659
+0.657
Compare+0.656
Neg logit contributions
 Mir-0.589
 Citation-0.588
hots-0.569
exec-0.538
camp-0.532
Token was
Token ID373
Feature activation+1.71e+00

Pos logit contributions
Tam+1.964
paralleled+1.921
lear+1.876
Compare+1.873
+1.865
Neg logit contributions
 Citation-1.562
 Mir-1.552
hots-1.508
exec-1.417
ancial-1.406
Token let
Token ID1309
Feature activation+1.39e+00

Pos logit contributions
Tam+1.545
paralleled+1.519
lear+1.486
Compare+1.481
+1.481
Neg logit contributions
 Citation-0.909
 Mir-0.909
hots-0.873
exec-0.811
camp-0.802
Token off
Token ID572
Feature activation+5.18e+00

Pos logit contributions
Tam+1.161
paralleled+1.138
lear+1.117
Compare+1.114
+1.110
Neg logit contributions
 Mir-0.834
 Citation-0.831
hots-0.805
exec-0.754
camp-0.746
Token because
Token ID780
Feature activation+5.77e-02

Pos logit contributions
Tam+3.775
paralleled+3.748
Compare+3.616
lear+3.598
+3.560
Neg logit contributions
 Citation-3.465
 Mir-3.440
hots-3.396
exec-3.235
camp-3.185
Token it
Token ID340
Feature activation+9.17e-01

Pos logit contributions
Tam+0.047
paralleled+0.046
lear+0.045
+0.045
Compare+0.045
Neg logit contributions
 Mir-0.036
 Citation-0.036
hots-0.035
exec-0.033
camp-0.032
Token had
Token ID550
Feature activation+6.99e-01

Pos logit contributions
Tam+0.756
paralleled+0.740
lear+0.723
+0.722
Compare+0.721
Neg logit contributions
 Mir-0.566
 Citation-0.564
hots-0.545
exec-0.513
camp-0.507
Token blown
Token ID16318
Feature activation-8.17e-01

Pos logit contributions
Tam+0.570
paralleled+0.558
lear+0.545
+0.544
Compare+0.543
Neg logit contributions
 Mir-0.437
 Citation-0.436
hots-0.420
exec-0.396
camp-0.392
Token the
Token ID262
Feature activation+6.42e-02

Pos logit contributions
 Mir+0.476
 Citation+0.470
hots+0.451
exec+0.422
camp+0.420
Neg logit contributions
Tam-0.705
paralleled-0.693
-0.679
lear-0.678
��-0.675
Token infinite
Token ID15541
Feature activation-2.51e+00

Pos logit contributions
 Mir+0.816
 Citation+0.810
hots+0.769
exec+0.732
camp+0.723
Neg logit contributions
Tam-1.003
paralleled-0.982
-0.961
��-0.960
lear-0.955
Token wisdom
Token ID11501
Feature activation-3.50e-01

Pos logit contributions
 Mir+1.944
 Citation+1.934
hots+1.830
exec+1.790
 Lar+1.755
Neg logit contributions
Tam-1.657
paralleled-1.623
��-1.575
-1.564
Compare-1.558
Token,
Token ID11
Feature activation-1.01e+00

Pos logit contributions
 Mir+0.242
 Citation+0.241
hots+0.234
exec+0.222
camp+0.220
Neg logit contributions
Tam-0.263
paralleled-0.257
lear-0.250
-0.250
Compare-0.249
Token tweeted
Token ID10830
Feature activation-5.00e-01

Pos logit contributions
 Mir+0.625
 Citation+0.618
hots+0.600
exec+0.564
camp+0.556
Neg logit contributions
Tam-0.834
paralleled-0.811
��-0.800
Compare-0.793
-0.793
Token the
Token ID262
Feature activation+1.74e+00

Pos logit contributions
 Mir+0.328
 Citation+0.326
hots+0.313
exec+0.296
camp+0.293
Neg logit contributions
Tam-0.395
paralleled-0.389
lear-0.378
-0.377
Compare-0.376
Token following
Token ID1708
Feature activation+5.08e+00

Pos logit contributions
Tam+1.485
paralleled+1.454
lear+1.423
+1.421
Compare+1.420
Neg logit contributions
 Mir-1.012
 Citation-1.010
hots-0.978
exec-0.915
camp-0.905
Token at
Token ID379
Feature activation+1.97e+00

Pos logit contributions
Tam+3.745
paralleled+3.642
Compare+3.608
lear+3.561
+3.544
Neg logit contributions
 Citation-3.428
 Mir-3.415
hots-3.376
exec-3.190
ancial-3.150
Token Ham
Token ID4345
Feature activation+6.57e-01

Pos logit contributions
Tam+1.568
paralleled+1.530
lear+1.498
Compare+1.493
+1.491
Neg logit contributions
 Citation-1.257
 Mir-1.252
hots-1.221
exec-1.149
camp-1.136
Tokenill
Token ID359
Feature activation-3.96e-01

Pos logit contributions
Tam+0.653
paralleled+0.642
lear+0.630
+0.629
Compare+0.628
Neg logit contributions
 Mir-0.292
 Citation-0.291
hots-0.277
exec-0.255
camp-0.250
Token (
Token ID357
Feature activation+7.96e-01

Pos logit contributions
 Mir+0.294
 Citation+0.293
hots+0.284
exec+0.271
camp+0.269
Neg logit contributions
Tam-0.276
paralleled-0.270
lear-0.262
-0.262
Compare-0.261
Tokenm
Token ID76
Feature activation-1.34e+00

Pos logit contributions
Tam+0.623
paralleled+0.610
lear+0.595
+0.593
Compare+0.593
Neg logit contributions
 Mir-0.523
 Citation-0.522
hots-0.506
exec-0.478
camp-0.473
Token in
Token ID287
Feature activation-9.94e-01

Pos logit contributions
 Mir+0.760
 Citation+0.750
hots+0.728
exec+0.681
camp+0.674
Neg logit contributions
Tam-1.071
paralleled-1.048
-1.028
lear-1.026
Compare-1.022
Token the
Token ID262
Feature activation-3.44e-01

Pos logit contributions
 Mir+0.539
 Citation+0.533
hots+0.512
exec+0.477
camp+0.470
Neg logit contributions
Tam-0.895
paralleled-0.879
lear-0.864
-0.863
Compare-0.859
Token state
Token ID1181
Feature activation-6.84e-01

Pos logit contributions
 Mir+0.245
 Citation+0.242
hots+0.232
exec+0.221
camp+0.219
Neg logit contributions
Tam-0.253
paralleled-0.248
lear-0.243
-0.243
��-0.241
Token of
Token ID286
Feature activation-9.30e-01

Pos logit contributions
 Mir+0.437
 Citation+0.434
hots+0.418
exec+0.396
camp+0.392
Neg logit contributions
Tam-0.551
paralleled-0.539
lear-0.527
��-0.526
-0.526
Token the
Token ID262
Feature activation+8.44e-01

Pos logit contributions
 Mir+0.706
 Citation+0.692
hots+0.667
exec+0.638
camp+0.631
Neg logit contributions
Tam-0.641
paralleled-0.632
lear-0.618
-0.616
��-0.615
Token art
Token ID1242
Feature activation+5.05e+00

Pos logit contributions
Tam+0.726
paralleled+0.712
lear+0.699
+0.698
Compare+0.695
Neg logit contributions
 Mir-0.495
 Citation-0.489
hots-0.469
exec-0.441
camp-0.436
Token Is
Token ID1148
Feature activation-3.47e-02

Pos logit contributions
Tam+3.858
paralleled+3.794
lear+3.691
Compare+3.681
+3.599
Neg logit contributions
 Citation-3.262
 Mir-3.222
hots-3.207
exec-3.041
ancial-2.975
Tokenot
Token ID313
Feature activation-3.32e-02

Pos logit contributions
 Mir+0.021
 Citation+0.021
hots+0.021
exec+0.019
camp+0.019
Neg logit contributions
Tam-0.029
paralleled-0.028
-0.027
lear-0.027
Compare-0.027
Tokenope
Token ID3008
Feature activation-1.38e+00

Pos logit contributions
 Mir+0.028
 Citation+0.028
hots+0.027
exec+0.026
camp+0.026
Neg logit contributions
Tam-0.020
paralleled-0.020
-0.019
��-0.019
Compare-0.019
Token Chron
Token ID8750
Feature activation-2.05e+00

Pos logit contributions
 Mir+0.921
 Citation+0.916
hots+0.875
exec+0.826
camp+0.825
Neg logit contributions
Tam-1.078
paralleled-1.057
-1.032
lear-1.024
Compare-1.019
Tokenology
Token ID1435
Feature activation+1.27e+00

Pos logit contributions
 Mir+1.513
 Citation+1.508
hots+1.468
exec+1.398
camp+1.382
Neg logit contributions
Tam-1.442
paralleled-1.407
��-1.374
-1.369
Compare-1.359
Token Women
Token ID6926
Feature activation-1.24e+00

Pos logit contributions
 Mir+0.319
 Citation+0.316
hots+0.297
exec+0.276
camp+0.275
Neg logit contributions
Tam-0.619
paralleled-0.612
lear-0.598
Compare-0.597
-0.596
Token for
Token ID329
Feature activation-1.47e+00

Pos logit contributions
 Mir+0.762
 Citation+0.750
hots+0.718
exec+0.686
camp+0.673
Neg logit contributions
Tam-1.032
paralleled-1.009
��-0.989
lear-0.988
Compare-0.987
Token America
Token ID2253
Feature activation-3.06e+00

Pos logit contributions
 Mir+1.402
 Citation+1.363
hots+1.297
exec+1.284
 Lar+1.281
Neg logit contributions
Tam-0.740
paralleled-0.733
-0.713
��-0.702
Compare-0.702
Token,
Token ID11
Feature activation-2.50e-01

Pos logit contributions
 Mir+1.932
 Citation+1.899
hots+1.823
exec+1.764
camp+1.736
Neg logit contributions
Tam-2.461
paralleled-2.424
-2.397
lear-2.360
Compare-2.351
Token we
Token ID356
Feature activation+1.96e+00

Pos logit contributions
 Mir+0.147
 Citation+0.145
hots+0.140
exec+0.132
camp+0.129
Neg logit contributions
Tam-0.214
paralleled-0.210
-0.206
lear-0.205
Compare-0.205
Tokenpt
Token ID457
Feature activation+5.02e+00

Pos logit contributions
Tam+1.279
paralleled+1.248
lear+1.214
Compare+1.206
+1.205
Neg logit contributions
 Citation-1.535
 Mir-1.533
hots-1.488
exec-1.419
camp-1.407
Token as
Token ID355
Feature activation+1.02e+00

Pos logit contributions
Tam+3.855
paralleled+3.777
lear+3.687
Compare+3.639
+3.601
Neg logit contributions
 Citation-3.288
hots-3.133
 Mir-3.126
ancial-2.995
exec-2.938
Token she
Token ID673
Feature activation+9.80e-01

Pos logit contributions
Tam+0.989
paralleled+0.970
lear+0.953
+0.950
Compare+0.949
Neg logit contributions
 Citation-0.472
 Mir-0.471
hots-0.452
exec-0.414
camp-0.409
Token told
Token ID1297
Feature activation+2.54e-02

Pos logit contributions
Tam+0.796
paralleled+0.780
lear+0.762
+0.759
Compare+0.759
Neg logit contributions
 Citation-0.613
 Mir-0.613
hots-0.592
exec-0.557
camp-0.552
Token the
Token ID262
Feature activation-2.95e-01

Pos logit contributions
Tam+0.021
paralleled+0.021
lear+0.021
+0.021
Compare+0.021
Neg logit contributions
 Mir-0.015
 Citation-0.015
hots-0.015
exec-0.014
camp-0.014
Token committee
Token ID5583
Feature activation-2.90e-01

Pos logit contributions
 Mir+0.184
 Citation+0.183
hots+0.175
exec+0.166
camp+0.164
Neg logit contributions
Tam-0.242
paralleled-0.237
lear-0.231
-0.231
Compare-0.230
Token quite
Token ID2407
Feature activation+2.25e-01

Pos logit contributions
 Mir+0.367
 Citation+0.365
hots+0.351
exec+0.333
camp+0.328
Neg logit contributions
Tam-0.458
paralleled-0.449
lear-0.438
-0.436
Compare-0.435
Token simple
Token ID2829
Feature activation-7.30e-01

Pos logit contributions
Tam+0.178
paralleled+0.175
lear+0.170
+0.170
Compare+0.169
Neg logit contributions
 Mir-0.147
 Citation-0.146
hots-0.141
exec-0.133
camp-0.131
Token:
Token ID25
Feature activation+6.90e-01

Pos logit contributions
 Mir+0.513
 Citation+0.510
hots+0.493
exec+0.470
camp+0.464
Neg logit contributions
Tam-0.541
paralleled-0.533
lear-0.520
-0.518
��-0.513
Token
Token ID198
Feature activation-2.94e-01

Pos logit contributions
Tam+0.558
paralleled+0.546
lear+0.533
Compare+0.532
+0.532
Neg logit contributions
 Mir-0.435
 Citation-0.435
hots-0.420
exec-0.396
camp-0.392
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
 Mir+0.175
 Citation+0.174
hots+0.167
exec+0.158
camp+0.156
Neg logit contributions
Tam-0.248
paralleled-0.244
��-0.240
-0.239
lear-0.239
Tokenfunction
Token ID8818
Feature activation+4.91e+00

Pos logit contributions
 Mir+0.089
 Citation+0.089
hots+0.087
exec+0.082
camp+0.081
Neg logit contributions
Tam-0.102
paralleled-0.100
lear-0.097
-0.097
Compare-0.096
Token ob
Token ID909
Feature activation+6.86e-01

Pos logit contributions
Tam+3.733
Compare+3.622
paralleled+3.620
lear+3.547
+3.521
Neg logit contributions
 Citation-3.227
 Mir-3.181
hots-3.151
exec-2.954
camp-2.942
Tokenb
Token ID65
Feature activation+2.80e-01

Pos logit contributions
Tam+0.462
paralleled+0.450
lear+0.436
+0.435
Compare+0.432
Neg logit contributions
 Citation-0.524
 Mir-0.523
hots-0.510
exec-0.489
camp-0.485
Token_
Token ID62
Feature activation+4.22e-01

Pos logit contributions
Tam+0.235
paralleled+0.231
+0.225
lear+0.225
��+0.223
Neg logit contributions
 Mir-0.168
 Citation-0.168
hots-0.162
exec-0.153
camp-0.151
Tokensub
Token ID7266
Feature activation-4.52e-01

Pos logit contributions
Tam+0.330
paralleled+0.323
lear+0.315
+0.314
Compare+0.314
Neg logit contributions
 Mir-0.278
 Citation-0.277
hots-0.269
exec-0.254
camp-0.252
Tokendiv
Token ID7146
Feature activation+8.20e-01

Pos logit contributions
 Mir+0.297
 Citation+0.296
hots+0.287
exec+0.274
camp+0.273
Neg logit contributions
Tam-0.354
paralleled-0.343
-0.338
Compare-0.335
��-0.334
Token Gr
Token ID1902
Feature activation+1.74e+00

Pos logit contributions
Tam+0.384
paralleled+0.378
lear+0.369
+0.367
��+0.366
Neg logit contributions
 Mir-0.309
 Citation-0.307
hots-0.293
exec-0.278
camp-0.277
Tokenuden
Token ID44452
Feature activation-4.45e-01

Pos logit contributions
Tam+0.994
paralleled+0.968
lear+0.935
+0.930
Compare+0.929
Neg logit contributions
 Mir-1.513
 Citation-1.512
hots-1.473
exec-1.408
camp-1.395
Token said
Token ID531
Feature activation-7.38e-01

Pos logit contributions
 Mir+0.277
 Citation+0.274
hots+0.266
exec+0.250
camp+0.249
Neg logit contributions
Tam-0.368
paralleled-0.357
lear-0.349
-0.349
��-0.349
Token Duke
Token ID11083
Feature activation+1.10e+00

Pos logit contributions
 Mir+0.442
 Citation+0.438
hots+0.420
exec+0.396
camp+0.394
Neg logit contributions
Tam-0.625
paralleled-0.613
lear-0.598
-0.597
Compare-0.596
Token I
Token ID314
Feature activation+3.64e-01

Pos logit contributions
Tam+0.913
paralleled+0.896
lear+0.874
+0.873
Compare+0.872
Neg logit contributions
 Mir-0.666
 Citation-0.664
hots-0.640
exec-0.602
camp-0.596
Tokenhen
Token ID831
Feature activation+4.89e+00

Pos logit contributions
Tam+0.273
paralleled+0.267
+0.262
Compare+0.261
��+0.258
Neg logit contributions
 Mir-0.250
 Citation-0.247
hots-0.243
exec-0.230
camp-0.228
Tokenacho
Token ID43703
Feature activation-4.27e-01

Pos logit contributions
Tam+2.857
paralleled+2.782
lear+2.689
��+2.644
Compare+2.639
Neg logit contributions
 Citation-4.153
 Mir-4.092
hots-4.009
exec-3.845
ancial-3.761
Token also
Token ID635
Feature activation+1.01e+00

Pos logit contributions
 Mir+0.266
 Citation+0.263
hots+0.255
exec+0.239
camp+0.239
Neg logit contributions
Tam-0.351
paralleled-0.344
-0.336
Compare-0.336
lear-0.335
Token has
Token ID468
Feature activation+1.10e+00

Pos logit contributions
Tam+0.819
paralleled+0.801
lear+0.782
+0.781
Compare+0.781
Neg logit contributions
 Mir-0.631
 Citation-0.629
hots-0.608
exec-0.573
camp-0.567
Token gotten
Token ID7891
Feature activation+1.65e+00

Pos logit contributions
Tam+0.916
paralleled+0.897
lear+0.877
+0.874
Compare+0.874
Neg logit contributions
 Mir-0.669
 Citation-0.668
hots-0.644
exec-0.605
camp-0.599
Token some
Token ID617
Feature activation+1.50e-01

Pos logit contributions
Tam+1.377
paralleled+1.351
lear+1.319
Compare+1.316
+1.311
Neg logit contributions
 Citation-0.987
 Mir-0.986
hots-0.949
exec-0.894
camp-0.883
Token )
Token ID1267
Feature activation+1.74e+00

Pos logit contributions
Tam+0.963
paralleled+0.942
lear+0.917
+0.915
Compare+0.914
Neg logit contributions
 Mir-0.893
 Citation-0.892
hots-0.863
exec-0.819
camp-0.811
Token AS
Token ID7054
Feature activation+2.35e+00

Pos logit contributions
Tam+1.107
paralleled+1.078
lear+1.045
+1.042
Compare+1.041
Neg logit contributions
 Mir-1.399
 Citation-1.397
hots-1.360
exec-1.299
camp-1.288
Token count
Token ID954
Feature activation+2.71e+00

Pos logit contributions
Tam+1.854
paralleled+1.815
lear+1.771
+1.767
Compare+1.763
Neg logit contributions
 Mir-1.532
 Citation-1.529
hots-1.475
exec-1.395
camp-1.379
Token FROM
Token ID16034
Feature activation+2.33e+00

Pos logit contributions
Tam+2.025
paralleled+1.977
Compare+1.932
lear+1.923
+1.919
Neg logit contributions
 Mir-1.874
 Citation-1.863
hots-1.811
exec-1.704
camp-1.699
Token (
Token ID357
Feature activation+2.06e+00

Pos logit contributions
Tam+1.690
paralleled+1.652
lear+1.607
Compare+1.603
+1.603
Neg logit contributions
 Mir-1.666
 Citation-1.664
hots-1.614
exec-1.532
camp-1.518
Token SELECT
Token ID33493
Feature activation+4.89e+00

Pos logit contributions
Tam+1.547
paralleled+1.514
lear+1.475
Compare+1.472
+1.470
Neg logit contributions
 Mir-1.420
 Citation-1.417
hots-1.373
exec-1.300
camp-1.289
Token SPL
Token ID46341
Feature activation+4.67e-01

Pos logit contributions
Tam+3.796
paralleled+3.726
Compare+3.659
lear+3.624
+3.586
Neg logit contributions
 Mir-3.114
 Citation-3.110
hots-3.109
exec-2.860
camp-2.851
TokenIT
Token ID2043
Feature activation-3.06e-01

Pos logit contributions
Tam+0.329
paralleled+0.324
+0.315
lear+0.314
��+0.312
Neg logit contributions
 Citation-0.343
 Mir-0.343
hots-0.331
exec-0.317
camp-0.312
Token (
Token ID357
Feature activation+1.53e+00

Pos logit contributions
 Mir+0.191
 Citation+0.190
hots+0.183
exec+0.173
camp+0.171
Neg logit contributions
Tam-0.250
paralleled-0.246
-0.240
lear-0.239
��-0.238
Token content
Token ID2695
Feature activation+1.15e+00

Pos logit contributions
Tam+1.235
paralleled+1.211
lear+1.182
Compare+1.181
+1.176
Neg logit contributions
 Mir-0.962
 Citation-0.960
hots-0.931
exec-0.872
camp-0.866
Token,
Token ID837
Feature activation+1.77e+00

Pos logit contributions
Tam+0.893
paralleled+0.874
lear+0.852
+0.850
Compare+0.849
Neg logit contributions
 Mir-0.766
 Citation-0.764
hots-0.739
exec-0.699
camp-0.692
Tokenul
Token ID377
Feature activation+9.95e-01

Pos logit contributions
Tam+0.720
paralleled+0.703
+0.684
lear+0.683
Compare+0.683
Neg logit contributions
 Mir-0.712
 Citation-0.710
hots-0.690
exec-0.655
camp-0.650
Tokenik
Token ID1134
Feature activation+1.45e+00

Pos logit contributions
Tam+0.770
paralleled+0.752
lear+0.733
Compare+0.732
+0.732
Neg logit contributions
 Mir-0.664
 Citation-0.662
hots-0.641
exec-0.607
camp-0.600
Token and
Token ID290
Feature activation-2.62e+00

Pos logit contributions
Tam+1.142
paralleled+1.118
lear+1.089
Compare+1.088
+1.087
Neg logit contributions
 Mir-0.950
 Citation-0.947
hots-0.917
exec-0.866
camp-0.857
Token John
Token ID1757
Feature activation-5.64e-02

Pos logit contributions
 Mir+1.810
 Citation+1.752
hots+1.680
exec+1.608
 Lar+1.592
Neg logit contributions
Tam-2.004
paralleled-1.978
Compare-1.936
lear-1.929
-1.921
Token He
Token ID679
Feature activation+1.90e-01

Pos logit contributions
 Mir+0.037
 Citation+0.037
hots+0.036
exec+0.034
camp+0.033
Neg logit contributions
Tam-0.044
paralleled-0.044
lear-0.042
-0.042
Compare-0.042
Tokenyn
Token ID2047
Feature activation+4.84e+00

Pos logit contributions
Tam+0.149
paralleled+0.145
+0.142
Compare+0.142
��+0.142
Neg logit contributions
 Mir-0.125
 Citation-0.124
hots-0.121
exec-0.114
camp-0.113
Token went
Token ID1816
Feature activation-1.05e+00

Pos logit contributions
Tam+3.700
paralleled+3.604
lear+3.563
+3.515
Compare+3.486
Neg logit contributions
 Citation-3.196
 Mir-3.183
hots-3.065
exec-2.907
ancial-2.844
Token to
Token ID284
Feature activation-6.38e-01

Pos logit contributions
 Mir+0.647
 Citation+0.639
hots+0.619
exec+0.582
camp+0.576
Neg logit contributions
Tam-0.879
paralleled-0.859
lear-0.840
-0.839
��-0.837
Token a
Token ID257
Feature activation-5.06e-01

Pos logit contributions
 Mir+0.416
 Citation+0.411
hots+0.398
exec+0.376
camp+0.374
Neg logit contributions
Tam-0.505
paralleled-0.496
-0.484
lear-0.483
Compare-0.481
Token Jud
Token ID4794
Feature activation+2.01e-01

Pos logit contributions
 Mir+0.311
 Citation+0.309
hots+0.298
exec+0.280
camp+0.278
Neg logit contributions
Tam-0.419
paralleled-0.411
lear-0.401
-0.401
Compare-0.400
Tokenas
Token ID292
Feature activation-9.69e-01

Pos logit contributions
Tam+0.158
paralleled+0.155
lear+0.151
+0.151
Compare+0.151
Neg logit contributions
 Mir-0.131
 Citation-0.131
hots-0.127
exec-0.120
camp-0.118
Token away
Token ID1497
Feature activation-2.47e-01

Pos logit contributions
 Mir+0.451
 Citation+0.443
hots+0.427
exec+0.398
camp+0.398
Neg logit contributions
Tam-0.652
paralleled-0.638
-0.632
��-0.630
lear-0.629
Token,
Token ID11
Feature activation+2.94e-01

Pos logit contributions
 Mir+0.165
 Citation+0.163
hots+0.158
exec+0.149
camp+0.148
Neg logit contributions
Tam-0.192
paralleled-0.189
-0.185
lear-0.184
Compare-0.183
Token and
Token ID290
Feature activation-5.05e-01

Pos logit contributions
Tam+0.247
paralleled+0.242
lear+0.237
+0.236
Compare+0.236
Neg logit contributions
 Mir-0.176
 Citation-0.176
hots-0.169
exec-0.159
camp-0.157
Token called
Token ID1444
Feature activation-4.33e-02

Pos logit contributions
 Mir+0.308
 Citation+0.303
hots+0.291
exec+0.274
camp+0.272
Neg logit contributions
Tam-0.423
paralleled-0.416
-0.406
��-0.405
lear-0.404
Token to
Token ID284
Feature activation+3.37e+00

Pos logit contributions
 Mir+0.028
 Citation+0.028
hots+0.027
exec+0.025
camp+0.025
Neg logit contributions
Tam-0.035
paralleled-0.034
lear-0.033
-0.033
��-0.033
Token see
Token ID766
Feature activation+4.80e+00

Pos logit contributions
Tam+2.699
paralleled+2.626
Compare+2.578
lear+2.575
+2.563
Neg logit contributions
 Citation-2.140
 Mir-2.126
hots-2.084
exec-1.959
ancial-1.938
Token if
Token ID611
Feature activation+8.97e-01

Pos logit contributions
Tam+3.224
paralleled+3.099
Compare+3.037
+3.022
lear+3.014
Neg logit contributions
 Citation-3.580
 Mir-3.540
hots-3.529
exec-3.354
ancial-3.328
Token anyone
Token ID2687
Feature activation-5.02e-01

Pos logit contributions
Tam+0.702
paralleled+0.688
lear+0.671
+0.669
Compare+0.669
Neg logit contributions
 Mir-0.591
 Citation-0.589
hots-0.568
exec-0.537
camp-0.532
Token there
Token ID612
Feature activation-4.15e-01

Pos logit contributions
 Mir+0.308
 Citation+0.299
hots+0.295
exec+0.275
camp+0.272
Neg logit contributions
Tam-0.419
paralleled-0.412
��-0.410
-0.404
Compare-0.402
Token knew
Token ID2993
Feature activation-6.74e-01

Pos logit contributions
 Mir+0.252
 Citation+0.245
hots+0.243
exec+0.227
camp+0.225
Neg logit contributions
Tam-0.347
paralleled-0.340
��-0.338
-0.333
Compare-0.332
Token of
Token ID286
Feature activation-4.33e-01

Pos logit contributions
 Mir+0.415
 Citation+0.410
hots+0.392
exec+0.369
camp+0.369
Neg logit contributions
Tam-0.559
paralleled-0.551
��-0.537
-0.537
lear-0.537
Token albeit
Token ID18244
Feature activation+3.33e+00

Pos logit contributions
 Mir+0.273
 Citation+0.270
hots+0.261
exec+0.247
camp+0.243
Neg logit contributions
Tam-0.341
paralleled-0.335
lear-0.327
-0.326
Compare-0.325
Token operating
Token ID5361
Feature activation+1.99e-02

Pos logit contributions
Tam+2.633
paralleled+2.581
lear+2.517
Compare+2.515
+2.504
Neg logit contributions
 Citation-2.138
 Mir-2.115
hots-2.055
exec-1.940
camp-1.924
Token more
Token ID517
Feature activation+7.36e-01

Pos logit contributions
Tam+0.017
paralleled+0.016
lear+0.016
+0.016
Compare+0.016
Neg logit contributions
 Mir-0.012
 Citation-0.012
hots-0.012
exec-0.011
camp-0.011
Token slowly
Token ID6364
Feature activation+1.08e+00

Pos logit contributions
Tam+0.538
paralleled+0.526
+0.512
lear+0.512
Compare+0.510
Neg logit contributions
 Mir-0.525
 Citation-0.520
hots-0.506
exec-0.481
camp-0.476
Token than
Token ID621
Feature activation-1.88e+00

Pos logit contributions
Tam+0.771
paralleled+0.753
lear+0.735
+0.734
Compare+0.729
Neg logit contributions
 Mir-0.791
 Citation-0.788
hots-0.764
exec-0.727
camp-0.721
Token normal
Token ID3487
Feature activation+4.79e+00

Pos logit contributions
 Mir+1.166
 Citation+1.143
hots+1.100
exec+1.027
camp+1.013
Neg logit contributions
Tam-1.562
paralleled-1.535
-1.509
lear-1.501
��-1.497
Token,
Token ID11
Feature activation+1.52e-01

Pos logit contributions
Tam+3.187
paralleled+3.101
Compare+3.015
lear+2.998
+2.986
Neg logit contributions
 Citation-3.669
 Mir-3.661
hots-3.575
exec-3.395
camp-3.363
Token says
Token ID1139
Feature activation-7.62e-01

Pos logit contributions
Tam+0.120
paralleled+0.117
lear+0.114
+0.114
Compare+0.114
Neg logit contributions
 Mir-0.100
 Citation-0.100
hots-0.096
exec-0.091
camp-0.090
Token Timothy
Token ID22283
Feature activation+6.76e-03

Pos logit contributions
 Mir+0.476
 Citation+0.467
hots+0.445
exec+0.424
camp+0.417
Neg logit contributions
Tam-0.627
paralleled-0.620
lear-0.604
-0.604
Compare-0.602
Token Cot
Token ID42266
Feature activation+6.63e-01

Pos logit contributions
Tam+0.005
paralleled+0.005
lear+0.005
+0.005
Compare+0.005
Neg logit contributions
 Mir-0.004
 Citation-0.004
hots-0.004
exec-0.004
camp-0.004
Tokené
Token ID2634
Feature activation+1.05e+00

Pos logit contributions
Tam+0.460
paralleled+0.448
+0.436
Compare+0.435
��+0.434
Neg logit contributions
 Mir-0.496
 Citation-0.495
hots-0.480
exec-0.458
camp-0.455
Token than
Token ID621
Feature activation+9.09e-01

Pos logit contributions
Tam+0.715
paralleled+0.696
+0.683
lear+0.680
��+0.680
Neg logit contributions
 Mir-0.684
 Citation-0.676
hots-0.660
exec-0.623
camp-0.619
Token two
Token ID734
Feature activation+2.40e-01

Pos logit contributions
Tam+0.764
paralleled+0.749
lear+0.732
+0.731
Compare+0.729
Neg logit contributions
 Mir-0.546
 Citation-0.545
hots-0.525
exec-0.493
camp-0.488
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
Tam+0.186
paralleled+0.182
+0.178
lear+0.178
��+0.177
Neg logit contributions
 Mir-0.162
 Citation-0.160
hots-0.155
exec-0.146
camp-0.144
Token In
Token ID554
Feature activation+3.11e-02

Pos logit contributions
 Mir+0.088
 Citation+0.087
hots+0.082
exec+0.078
camp+0.078
Neg logit contributions
Tam-0.103
paralleled-0.102
lear-0.099
��-0.099
-0.098
Token other
Token ID584
Feature activation-8.04e-01

Pos logit contributions
Tam+0.025
paralleled+0.024
lear+0.024
+0.024
��+0.023
Neg logit contributions
 Mir-0.020
 Citation-0.020
hots-0.019
exec-0.019
camp-0.018
Token words
Token ID2456
Feature activation+4.73e+00

Pos logit contributions
 Mir+0.545
 Citation+0.538
hots+0.520
exec+0.494
camp+0.487
Neg logit contributions
Tam-0.617
paralleled-0.606
-0.591
��-0.587
lear-0.586
Token,
Token ID11
Feature activation+1.61e+00

Pos logit contributions
Tam+3.355
paralleled+3.272
Compare+3.194
lear+3.183
��+3.160
Neg logit contributions
 Citation-3.350
 Mir-3.331
hots-3.293
exec-3.096
ancial-3.082
Token he
Token ID339
Feature activation+6.07e-02

Pos logit contributions
Tam+1.308
paralleled+1.281
lear+1.251
+1.248
Compare+1.246
Neg logit contributions
 Mir-1.008
 Citation-1.006
hots-0.970
exec-0.915
camp-0.904
Token wanted
Token ID2227
Feature activation+8.08e-01

Pos logit contributions
Tam+0.050
paralleled+0.048
+0.047
��+0.047
lear+0.047
Neg logit contributions
 Mir-0.038
 Citation-0.038
hots-0.036
exec-0.034
camp-0.034
Token to
Token ID284
Feature activation+1.00e+00

Pos logit contributions
Tam+0.671
paralleled+0.658
+0.643
lear+0.643
Compare+0.639
Neg logit contributions
 Mir-0.499
 Citation-0.497
hots-0.474
exec-0.449
camp-0.441
Token increase
Token ID2620
Feature activation-4.13e-01

Pos logit contributions
Tam+0.781
paralleled+0.765
+0.747
lear+0.745
Compare+0.742
Neg logit contributions
 Mir-0.664
 Citation-0.662
hots-0.637
exec-0.605
camp-0.599
Token,
Token ID11
Feature activation-5.28e-01

Pos logit contributions
Tam+0.906
paralleled+0.883
lear+0.861
Compare+0.860
+0.859
Neg logit contributions
 Citation-0.796
 Mir-0.792
hots-0.770
exec-0.732
camp-0.719
Token you
Token ID345
Feature activation+9.56e-02

Pos logit contributions
 Mir+0.300
 Citation+0.298
hots+0.285
exec+0.268
camp+0.265
Neg logit contributions
Tam-0.462
paralleled-0.455
lear-0.445
-0.444
Compare-0.442
Token will
Token ID481
Feature activation-5.90e-01

Pos logit contributions
Tam+0.082
paralleled+0.080
��+0.079
+0.079
Compare+0.078
Neg logit contributions
 Mir-0.056
 Citation-0.056
hots-0.054
exec-0.050
camp-0.050
Token be
Token ID307
Feature activation-3.71e-01

Pos logit contributions
 Mir+0.373
 Citation+0.370
hots+0.352
exec+0.334
camp+0.332
Neg logit contributions
Tam-0.482
paralleled-0.474
��-0.465
-0.464
lear-0.459
Token let
Token ID1309
Feature activation+1.05e+00

Pos logit contributions
 Mir+0.210
 Citation+0.208
hots+0.200
exec+0.187
camp+0.185
Neg logit contributions
Tam-0.325
paralleled-0.319
-0.312
lear-0.312
Compare-0.311
Token off
Token ID572
Feature activation+4.70e+00

Pos logit contributions
Tam+0.899
paralleled+0.879
lear+0.864
Compare+0.861
+0.858
Neg logit contributions
 Mir-0.612
 Citation-0.612
hots-0.590
exec-0.555
camp-0.543
Token.
Token ID13
Feature activation-8.93e-01

Pos logit contributions
Tam+3.627
paralleled+3.565
lear+3.450
+3.446
Compare+3.443
Neg logit contributions
 Citation-3.025
 Mir-3.017
hots-2.968
exec-2.796
camp-2.749
Token The
Token ID383
Feature activation-2.07e-01

Pos logit contributions
 Mir+0.636
 Citation+0.632
hots+0.598
camp+0.571
exec+0.570
Neg logit contributions
Tam-0.658
paralleled-0.654
lear-0.636
��-0.634
-0.628
Token sentence
Token ID6827
Feature activation+9.83e-02

Pos logit contributions
 Mir+0.139
 Citation+0.138
hots+0.133
exec+0.127
camp+0.125
Neg logit contributions
Tam-0.160
paralleled-0.157
lear-0.153
-0.153
Compare-0.152
Token of
Token ID286
Feature activation-3.85e-01

Pos logit contributions
Tam+0.079
paralleled+0.077
lear+0.075
+0.075
��+0.075
Neg logit contributions
 Mir-0.064
 Citation-0.063
hots-0.061
exec-0.057
camp-0.057
Token 20
Token ID1160
Feature activation+1.32e+00

Pos logit contributions
 Mir+0.237
 Citation+0.236
hots+0.226
exec+0.213
camp+0.211
Neg logit contributions
Tam-0.319
paralleled-0.314
-0.307
lear-0.306
Compare-0.305
Token than
Token ID621
Feature activation-7.00e-01

Pos logit contributions
Tam+0.301
paralleled+0.295
lear+0.289
+0.289
��+0.286
Neg logit contributions
 Mir-0.260
 Citation-0.258
hots-0.249
exec-0.237
camp-0.233
Token the
Token ID262
Feature activation+8.95e-01

Pos logit contributions
 Mir+0.425
 Citation+0.422
hots+0.404
exec+0.380
camp+0.372
Neg logit contributions
Tam-0.589
paralleled-0.576
lear-0.567
-0.565
��-0.560
Token standard
Token ID3210
Feature activation+4.04e+00

Pos logit contributions
Tam+0.693
paralleled+0.678
lear+0.662
+0.660
Compare+0.658
Neg logit contributions
 Mir-0.599
 Citation-0.597
hots-0.575
exec-0.545
camp-0.538
Token F
Token ID376
Feature activation+3.94e+00

Pos logit contributions
Tam+2.645
paralleled+2.590
Compare+2.509
lear+2.495
+2.472
Neg logit contributions
 Citation-3.060
hots-3.052
 Mir-3.049
ancial-2.879
exec-2.871
Token-
Token ID12
Feature activation+7.64e-01

Pos logit contributions
Tam+2.940
paralleled+2.867
Compare+2.835
lear+2.815
+2.772
Neg logit contributions
 Citation-2.604
 Mir-2.597
hots-2.597
camp-2.424
ancial-2.411
Token150
Token ID8628
Feature activation+4.66e+00

Pos logit contributions
Tam+0.643
paralleled+0.631
lear+0.617
Compare+0.616
+0.613
Neg logit contributions
 Mir-0.457
 Citation-0.455
hots-0.439
exec-0.411
camp-0.408
Token for
Token ID329
Feature activation+2.10e+00

Pos logit contributions
Tam+3.343
paralleled+3.244
Compare+3.170
lear+3.146
+3.118
Neg logit contributions
 Mir-3.261
 Citation-3.260
hots-3.246
exec-3.031
camp-3.026
Token improved
Token ID6596
Feature activation+1.02e+00

Pos logit contributions
Tam+1.671
paralleled+1.635
lear+1.601
+1.599
Compare+1.591
Neg logit contributions
 Mir-1.357
 Citation-1.349
hots-1.302
exec-1.231
camp-1.215
Token stability
Token ID10159
Feature activation+2.34e+00

Pos logit contributions
Tam+0.819
paralleled+0.801
lear+0.784
+0.783
Compare+0.778
Neg logit contributions
 Mir-0.650
 Citation-0.645
hots-0.621
exec-0.590
camp-0.581
Token off
Token ID572
Feature activation+8.93e-01

Pos logit contributions
Tam+1.845
paralleled+1.808
Compare+1.762
lear+1.760
+1.751
Neg logit contributions
 Mir-1.521
 Citation-1.521
hots-1.471
exec-1.388
camp-1.375
Token-
Token ID12
Feature activation+7.20e-01

Pos logit contributions
Tam+0.726
paralleled+0.711
lear+0.695
+0.693
Compare+0.692
Neg logit contributions
 Mir-0.560
 Citation-0.559
hots-0.540
exec-0.509
camp-0.503
Token:
Token ID25
Feature activation-3.37e+00

Pos logit contributions
 Mir+3.577
 Citation+3.498
hots+3.374
camp+3.287
exec+3.283
Neg logit contributions
 millenn-3.918
��-3.794
paralleled-3.512
Tam-3.505
lear-3.419
Token
Token ID198
Feature activation-5.51e+00

Pos logit contributions
 Mir+1.874
 Citation+1.849
hots+1.765
exec+1.669
camp+1.658
Neg logit contributions
��-2.992
Tam-2.962
paralleled-2.942
 millenn-2.923
-2.886
Token
Token ID198
Feature activation-5.29e+00

Pos logit contributions
 Mir+2.825
 Citation+2.728
hots+2.563
camp+2.453
exec+2.436
Neg logit contributions
 millenn-5.404
��-5.316
paralleled-5.037
Tam-4.996
-4.981
TokenShow
Token ID15307
Feature activation+6.79e-01

Pos logit contributions
hots+2.613
 Mir+2.572
 Citation+2.559
exec+2.411
camp+2.388
Neg logit contributions
��-4.799
paralleled-4.763
Tam-4.690
-4.690
 Lenovo-4.657
Token nested
Token ID28376
Feature activation-1.44e+00

Pos logit contributions
Tam+0.624
paralleled+0.613
+0.597
lear+0.595
Compare+0.592
Neg logit contributions
 Citation-0.344
 Mir-0.341
hots-0.334
exec-0.303
ancial-0.303
Token quote
Token ID9577
Feature activation-5.99e+00

Pos logit contributions
 Mir+0.704
 Citation+0.702
hots+0.672
exec+0.619
camp+0.612
Neg logit contributions
Tam-1.372
paralleled-1.349
lear-1.322
Compare-1.320
-1.320
Token +
Token ID1343
Feature activation-2.40e+00

Pos logit contributions
 Citation+3.239
 Mir+3.238
hots+3.077
exec+2.877
 Lar+2.843
Neg logit contributions
Tam-5.502
paralleled-5.315
��-5.291
lear-5.191
-5.188
Token
Token ID198
Feature activation-2.57e+00

Pos logit contributions
 Mir+1.359
 Citation+1.350
hots+1.252
exec+1.184
camp+1.179
Neg logit contributions
Tam-2.100
��-2.099
paralleled-2.092
lear-2.051
 millenn-2.035
Token
Token ID198
Feature activation-2.03e+00

Pos logit contributions
 Mir+1.438
 Citation+1.416
hots+1.343
exec+1.270
camp+1.266
Neg logit contributions
��-2.274
Tam-2.251
paralleled-2.241
 millenn-2.220
lear-2.201
TokenOn
Token ID2202
Feature activation-1.37e+00

Pos logit contributions
 Mir+1.270
 Citation+1.268
hots+1.239
exec+1.175
camp+1.162
Neg logit contributions
Tam-1.614
paralleled-1.611
��-1.586
-1.563
lear-1.555
Token September
Token ID2693
Feature activation-1.52e+00

Pos logit contributions
 Mir+0.805
 Citation+0.803
hots+0.773
exec+0.725
camp+0.717
Neg logit contributions
Tam-1.162
paralleled-1.139
lear-1.113
-1.111
Compare-1.110
TokenTA
Token ID5603
Feature activation+1.44e+00

Pos logit contributions
Tam+0.205
paralleled+0.200
lear+0.192
+0.192
Compare+0.191
Neg logit contributions
 Mir-0.345
 Citation-0.345
hots-0.335
exec-0.322
camp-0.320
TokenWA
Token ID15543
Feature activation-5.55e-01

Pos logit contributions
Tam+0.943
paralleled+0.930
lear+0.905
+0.900
��+0.897
Neg logit contributions
 Mir-1.135
 Citation-1.127
hots-1.091
exec-1.049
camp-1.041
Token (
Token ID357
Feature activation-3.26e+00

Pos logit contributions
 Mir+0.429
 Citation+0.424
hots+0.409
exec+0.393
camp+0.387
Neg logit contributions
Tam-0.374
paralleled-0.366
lear-0.360
-0.357
��-0.356
TokenReuters
Token ID12637
Feature activation-4.61e+00

Pos logit contributions
 Mir+1.957
 Citation+1.952
hots+1.808
exec+1.756
camp+1.716
Neg logit contributions
paralleled-2.703
Tam-2.680
��-2.649
lear-2.633
-2.608
Token)
Token ID8
Feature activation-2.32e+00

Pos logit contributions
 Citation+2.514
 Mir+2.505
hots+2.225
camp+2.154
exec+2.112
Neg logit contributions
paralleled-4.191
��-4.174
 millenn-4.123
Tam-4.094
lear-4.085
Token -
Token ID532
Feature activation-5.65e+00

Pos logit contributions
 Mir+1.548
 Citation+1.537
hots+1.431
camp+1.374
 Lar+1.354
Neg logit contributions
Tam-1.832
paralleled-1.824
��-1.818
lear-1.782
-1.781
Token Canada
Token ID3340
Feature activation+1.09e+00

Pos logit contributions
 Mir+4.095
 Citation+3.975
 Lar+3.628
hots+3.601
 JACK+3.515
Neg logit contributions
��-4.141
paralleled-4.046
Tam-4.029
lear-4.013
-4.002
Token
Token ID447
Feature activation-1.04e+00

Pos logit contributions
Tam+0.872
paralleled+0.854
lear+0.833
+0.831
Compare+0.830
Neg logit contributions
 Mir-0.694
 Citation-0.693
hots-0.668
exec-0.631
camp-0.624
Token
Token ID247
Feature activation-1.20e+00

Pos logit contributions
 Citation+0.767
 Mir+0.759
hots+0.726
exec+0.697
camp+0.681
Neg logit contributions
Tam-0.740
paralleled-0.720
lear-0.702
Compare-0.701
-0.699
Tokens
Token ID82
Feature activation+4.61e-01

Pos logit contributions
 Mir+0.756
 Citation+0.749
hots+0.731
exec+0.692
camp+0.679
Neg logit contributions
Tam-0.970
paralleled-0.950
��-0.933
Compare-0.930
-0.928
Token top
Token ID1353
Feature activation-1.62e+00

Pos logit contributions
Tam+0.361
paralleled+0.354
lear+0.345
Compare+0.344
+0.344
Neg logit contributions
 Mir-0.304
 Citation-0.303
hots-0.291
exec-0.276
camp-0.272
TokenV
Token ID53
Feature activation-4.88e-01

Pos logit contributions
 Mir+0.771
 Citation+0.763
hots+0.720
exec+0.688
camp+0.685
Neg logit contributions
Tam-0.831
paralleled-0.819
-0.805
lear-0.804
��-0.790
TokenANA
Token ID31574
Feature activation-2.02e+00

Pos logit contributions
 Mir+0.311
 Citation+0.308
hots+0.290
camp+0.278
exec+0.277
Neg logit contributions
Tam-0.390
paralleled-0.387
-0.380
lear-0.379
��-0.378
Token (
Token ID357
Feature activation-2.97e+00

Pos logit contributions
 Mir+1.444
 Citation+1.425
hots+1.360
exec+1.306
camp+1.288
Neg logit contributions
Tam-1.477
paralleled-1.462
lear-1.427
-1.417
��-1.410
TokenReuters
Token ID12637
Feature activation-4.86e+00

Pos logit contributions
 Mir+1.794
 Citation+1.781
hots+1.696
exec+1.622
camp+1.586
Neg logit contributions
paralleled-2.455
Tam-2.447
��-2.398
lear-2.394
-2.383
Token)
Token ID8
Feature activation-2.27e+00

Pos logit contributions
 Citation+2.646
 Mir+2.637
hots+2.326
camp+2.252
exec+2.215
Neg logit contributions
paralleled-4.419
��-4.417
 millenn-4.364
Tam-4.318
lear-4.318
Token -
Token ID532
Feature activation-5.55e+00

Pos logit contributions
 Mir+1.507
 Citation+1.498
hots+1.392
camp+1.339
 Lar+1.318
Neg logit contributions
Tam-1.781
paralleled-1.777
��-1.776
lear-1.737
 millenn-1.734
Token Latin
Token ID9133
Feature activation+5.10e-01

Pos logit contributions
 Mir+3.991
 Citation+3.873
hots+3.548
 Lar+3.510
exec+3.416
Neg logit contributions
��-4.090
Tam-4.055
paralleled-4.055
-4.006
lear-3.971
Token American
Token ID1605
Feature activation+3.48e+00

Pos logit contributions
Tam+0.247
paralleled+0.239
lear+0.228
+0.227
Compare+0.227
Neg logit contributions
 Mir-0.489
 Citation-0.487
hots-0.475
exec-0.459
camp-0.455
Token and
Token ID290
Feature activation+5.97e-01

Pos logit contributions
Tam+2.406
paralleled+2.329
lear+2.282
Compare+2.267
+2.257
Neg logit contributions
 Citation-2.570
 Mir-2.554
hots-2.510
camp-2.375
exec-2.374
Token Caribbean
Token ID18020
Feature activation+1.45e+00

Pos logit contributions
Tam+0.391
paralleled+0.382
lear+0.371
Compare+0.370
+0.369
Neg logit contributions
 Mir-0.470
 Citation-0.468
hots-0.455
exec-0.434
camp-0.430
Token leaders
Token ID2766
Feature activation-1.37e+00

Pos logit contributions
Tam+0.876
paralleled+0.852
lear+0.823
+0.823
Compare+0.821
Neg logit contributions
 Mir-1.219
 Citation-1.216
hots-1.184
exec-1.133
camp-1.123
Token Dark
Token ID3801
Feature activation+9.73e-01

Pos logit contributions
 Mir+1.854
 Citation+1.794
hots+1.719
 Lar+1.625
exec+1.625
Neg logit contributions
Tam-1.979
paralleled-1.943
Compare-1.898
lear-1.897
-1.890
Tokenho
Token ID8873
Feature activation+1.28e+00

Pos logit contributions
Tam+0.852
paralleled+0.836
lear+0.817
+0.816
Compare+0.816
Neg logit contributions
 Mir-0.550
 Citation-0.548
hots-0.529
exec-0.494
camp-0.489
Tokenarse
Token ID17208
Feature activation-1.21e+00

Pos logit contributions
Tam+0.873
paralleled+0.849
lear+0.827
Compare+0.824
+0.822
Neg logit contributions
 Mir-0.972
 Citation-0.971
hots-0.944
exec-0.898
camp-0.892
Token wrote
Token ID2630
Feature activation-4.97e+00

Pos logit contributions
 Mir+0.407
 Citation+0.405
hots+0.381
exec+0.337
camp+0.330
Neg logit contributions
Tam-1.335
paralleled-1.317
-1.291
lear-1.291
Compare-1.291
Token:
Token ID25
Feature activation-3.37e+00

Pos logit contributions
 Mir+3.577
 Citation+3.498
hots+3.374
camp+3.287
exec+3.283
Neg logit contributions
 millenn-3.918
��-3.794
paralleled-3.512
Tam-3.505
lear-3.419
Token
Token ID198
Feature activation-5.51e+00

Pos logit contributions
 Mir+1.874
 Citation+1.849
hots+1.765
exec+1.669
camp+1.658
Neg logit contributions
��-2.992
Tam-2.962
paralleled-2.942
 millenn-2.923
-2.886
Token
Token ID198
Feature activation-5.29e+00

Pos logit contributions
 Mir+2.825
 Citation+2.728
hots+2.563
camp+2.453
exec+2.436
Neg logit contributions
 millenn-5.404
��-5.316
paralleled-5.037
Tam-4.996
-4.981
TokenShow
Token ID15307
Feature activation+6.79e-01

Pos logit contributions
hots+2.613
 Mir+2.572
 Citation+2.559
exec+2.411
camp+2.388
Neg logit contributions
��-4.799
paralleled-4.763
Tam-4.690
-4.690
 Lenovo-4.657
Token nested
Token ID28376
Feature activation-1.44e+00

Pos logit contributions
Tam+0.624
paralleled+0.613
+0.597
lear+0.595
Compare+0.592
Neg logit contributions
 Citation-0.344
 Mir-0.341
hots-0.334
exec-0.303
ancial-0.303
Token quote
Token ID9577
Feature activation-5.99e+00

Pos logit contributions
 Mir+0.704
 Citation+0.702
hots+0.672
exec+0.619
camp+0.612
Neg logit contributions
Tam-1.372
paralleled-1.349
lear-1.322
Compare-1.320
-1.320
Token +
Token ID1343
Feature activation-2.40e+00

Pos logit contributions
 Citation+3.239
 Mir+3.238
hots+3.077
exec+2.877
 Lar+2.843
Neg logit contributions
Tam-5.502
paralleled-5.315
��-5.291
lear-5.191
-5.188
Token of
Token ID286
Feature activation-4.25e-01

Pos logit contributions
 Mir+0.529
 Citation+0.526
hots+0.510
camp+0.480
exec+0.479
Neg logit contributions
Tam-0.681
paralleled-0.668
lear-0.652
Compare-0.650
-0.650
Token people
Token ID661
Feature activation-5.18e-01

Pos logit contributions
 Mir+0.296
 Citation+0.292
hots+0.283
exec+0.268
camp+0.267
Neg logit contributions
Tam-0.319
paralleled-0.312
lear-0.303
��-0.303
Compare-0.303
Token who
Token ID508
Feature activation+9.02e-02

Pos logit contributions
 Mir+0.337
 Citation+0.334
hots+0.325
exec+0.305
camp+0.304
Neg logit contributions
Tam-0.410
paralleled-0.403
lear-0.392
Compare-0.391
-0.391
Token make
Token ID787
Feature activation-5.95e-01

Pos logit contributions
Tam+0.079
paralleled+0.078
lear+0.076
��+0.076
+0.076
Neg logit contributions
 Mir-0.051
 Citation-0.050
hots-0.049
exec-0.045
camp-0.045
Token a
Token ID257
Feature activation-3.29e-01

Pos logit contributions
 Mir+0.383
 Citation+0.380
hots+0.368
exec+0.346
camp+0.343
Neg logit contributions
Tam-0.476
paralleled-0.467
lear-0.456
-0.454
Compare-0.454
Token living
Token ID2877
Feature activation-5.44e+00

Pos logit contributions
 Mir+0.222
 Citation+0.220
hots+0.213
exec+0.201
camp+0.200
Neg logit contributions
Tam-0.253
paralleled-0.248
lear-0.242
-0.242
Compare-0.241
Token churn
Token ID28488
Feature activation+4.30e-01

Pos logit contributions
 Mir+2.995
 Citation+2.900
hots+2.854
camp+2.700
exec+2.618
Neg logit contributions
Tam-4.778
paralleled-4.762
��-4.645
-4.622
Compare-4.614
Tokening
Token ID278
Feature activation+1.97e+00

Pos logit contributions
Tam+0.371
paralleled+0.363
lear+0.356
+0.355
Compare+0.354
Neg logit contributions
 Mir-0.247
 Citation-0.247
hots-0.238
exec-0.223
camp-0.220
Token out
Token ID503
Feature activation-8.10e-01

Pos logit contributions
Tam+1.738
paralleled+1.700
lear+1.667
+1.661
Compare+1.658
Neg logit contributions
 Citation-1.087
 Mir-1.082
hots-1.044
exec-0.979
camp-0.961
Token dozens
Token ID9264
Feature activation-7.63e-02

Pos logit contributions
 Mir+0.540
 Citation+0.533
hots+0.516
exec+0.487
camp+0.484
Neg logit contributions
Tam-0.633
paralleled-0.620
-0.604
lear-0.603
Compare-0.603
Token of
Token ID286
Feature activation+3.32e-01

Pos logit contributions
 Mir+0.050
 Citation+0.050
hots+0.048
exec+0.046
camp+0.045
Neg logit contributions
Tam-0.060
paralleled-0.058
lear-0.057
-0.057
Compare-0.057
Token pur
Token ID1308
Feature activation+1.71e+00

Pos logit contributions
 Mir+1.824
 Citation+1.772
hots+1.715
exec+1.614
camp+1.609
Neg logit contributions
Tam-1.985
paralleled-1.944
Compare-1.901
lear-1.891
-1.886
Tokenak
Token ID461
Feature activation+4.93e-01

Pos logit contributions
Tam+1.270
paralleled+1.242
lear+1.212
+1.203
Compare+1.199
Neg logit contributions
 Citation-1.200
 Mir-1.198
hots-1.147
exec-1.090
ancial-1.087
Tokenushi
Token ID17731
Feature activation-8.84e-01

Pos logit contributions
Tam+0.469
paralleled+0.460
lear+0.451
+0.450
Compare+0.450
Neg logit contributions
 Mir-0.241
 Citation-0.240
hots-0.231
exec-0.214
camp-0.210
Token wrote
Token ID2630
Feature activation-5.05e+00

Pos logit contributions
 Mir+0.296
 Citation+0.296
hots+0.275
exec+0.245
camp+0.240
Neg logit contributions
Tam-0.977
paralleled-0.962
lear-0.947
-0.943
Compare-0.942
Token:
Token ID25
Feature activation-3.25e+00

Pos logit contributions
 Mir+3.653
 Citation+3.636
hots+3.472
exec+3.345
camp+3.344
Neg logit contributions
��-3.788
 millenn-3.767
Tam-3.587
paralleled-3.566
lear-3.458
Token
Token ID198
Feature activation-5.29e+00

Pos logit contributions
 Mir+1.814
 Citation+1.794
hots+1.704
exec+1.606
camp+1.590
Neg logit contributions
��-2.861
Tam-2.859
paralleled-2.846
-2.779
lear-2.775
Token
Token ID198
Feature activation-5.16e+00

Pos logit contributions
 Mir+2.784
 Citation+2.703
hots+2.543
camp+2.407
exec+2.406
Neg logit contributions
 millenn-5.014
��-4.990
paralleled-4.791
Tam-4.753
-4.705
TokenHappy
Token ID25082
Feature activation-1.56e+00

Pos logit contributions
 Mir+3.329
hots+3.306
 Citation+3.290
exec+3.120
camp+3.107
Neg logit contributions
��-3.905
paralleled-3.877
Tam-3.870
-3.793
 Lenovo-3.739
Token it
Token ID340
Feature activation+2.74e-02

Pos logit contributions
 Mir+0.917
 Citation+0.908
hots+0.866
exec+0.819
camp+0.809
Neg logit contributions
Tam-1.336
paralleled-1.317
��-1.295
lear-1.289
-1.281
Token worked
Token ID3111
Feature activation-7.60e-01

Pos logit contributions
Tam+0.024
paralleled+0.024
lear+0.023
+0.023
Compare+0.023
Neg logit contributions
 Mir-0.015
 Citation-0.015
hots-0.015
exec-0.014
camp-0.014
Token out
Token ID503
Feature activation-1.21e+00

Pos logit contributions
 Mir+0.382
 Citation+0.369
hots+0.356
exec+0.329
camp+0.327
Neg logit contributions
Tam-0.719
paralleled-0.711
��-0.702
-0.698
lear-0.695
Tokenho
Token ID8873
Feature activation+1.28e+00

Pos logit contributions
Tam+0.852
paralleled+0.836
lear+0.817
+0.816
Compare+0.816
Neg logit contributions
 Mir-0.550
 Citation-0.548
hots-0.529
exec-0.494
camp-0.489
Tokenarse
Token ID17208
Feature activation-1.21e+00

Pos logit contributions
Tam+0.873
paralleled+0.849
lear+0.827
Compare+0.824
+0.822
Neg logit contributions
 Mir-0.972
 Citation-0.971
hots-0.944
exec-0.898
camp-0.892
Token wrote
Token ID2630
Feature activation-4.97e+00

Pos logit contributions
 Mir+0.407
 Citation+0.405
hots+0.381
exec+0.337
camp+0.330
Neg logit contributions
Tam-1.335
paralleled-1.317
-1.291
lear-1.291
Compare-1.291
Token:
Token ID25
Feature activation-3.37e+00

Pos logit contributions
 Mir+3.577
 Citation+3.498
hots+3.374
camp+3.287
exec+3.283
Neg logit contributions
 millenn-3.918
��-3.794
paralleled-3.512
Tam-3.505
lear-3.419
Token
Token ID198
Feature activation-5.51e+00

Pos logit contributions
 Mir+1.874
 Citation+1.849
hots+1.765
exec+1.669
camp+1.658
Neg logit contributions
��-2.992
Tam-2.962
paralleled-2.942
 millenn-2.923
-2.886
Token
Token ID198
Feature activation-5.29e+00

Pos logit contributions
 Mir+2.825
 Citation+2.728
hots+2.563
camp+2.453
exec+2.436
Neg logit contributions
 millenn-5.404
��-5.316
paralleled-5.037
Tam-4.996
-4.981
TokenShow
Token ID15307
Feature activation+6.79e-01

Pos logit contributions
hots+2.613
 Mir+2.572
 Citation+2.559
exec+2.411
camp+2.388
Neg logit contributions
��-4.799
paralleled-4.763
Tam-4.690
-4.690
 Lenovo-4.657
Token nested
Token ID28376
Feature activation-1.44e+00

Pos logit contributions
Tam+0.624
paralleled+0.613
+0.597
lear+0.595
Compare+0.592
Neg logit contributions
 Citation-0.344
 Mir-0.341
hots-0.334
exec-0.303
ancial-0.303
Token quote
Token ID9577
Feature activation-5.99e+00

Pos logit contributions
 Mir+0.704
 Citation+0.702
hots+0.672
exec+0.619
camp+0.612
Neg logit contributions
Tam-1.372
paralleled-1.349
lear-1.322
Compare-1.320
-1.320
Token +
Token ID1343
Feature activation-2.40e+00

Pos logit contributions
 Citation+3.239
 Mir+3.238
hots+3.077
exec+2.877
 Lar+2.843
Neg logit contributions
Tam-5.502
paralleled-5.315
��-5.291
lear-5.191
-5.188
Token
Token ID198
Feature activation-2.57e+00

Pos logit contributions
 Mir+1.359
 Citation+1.350
hots+1.252
exec+1.184
camp+1.179
Neg logit contributions
Tam-2.100
��-2.099
paralleled-2.092
lear-2.051
 millenn-2.035
TokenSimon
Token ID35475
Feature activation-7.85e-01

Pos logit contributions
Tam+0.550
paralleled+0.537
lear+0.524
Compare+0.524
+0.523
Neg logit contributions
 Mir-0.451
 Citation-0.451
hots-0.435
exec-0.410
camp-0.406
Token King
Token ID2677
Feature activation+6.56e-01

Pos logit contributions
 Mir+0.525
 Citation+0.521
hots+0.497
exec+0.471
camp+0.468
Neg logit contributions
Tam-0.611
paralleled-0.602
lear-0.586
-0.584
Compare-0.583
Token,
Token ID11
Feature activation+6.15e-01

Pos logit contributions
Tam+0.503
paralleled+0.492
lear+0.480
+0.478
Compare+0.478
Neg logit contributions
 Mir-0.442
 Citation-0.441
hots-0.427
exec-0.404
camp-0.400
Token Albion
Token ID47995
Feature activation-2.36e-01

Pos logit contributions
Tam+0.529
paralleled+0.519
lear+0.507
+0.506
Compare+0.505
Neg logit contributions
 Citation-0.355
 Mir-0.355
hots-0.342
exec-0.320
camp-0.317
Token's
Token ID338
Feature activation-1.74e+00

Pos logit contributions
 Mir+0.149
 Citation+0.148
hots+0.143
exec+0.135
camp+0.134
Neg logit contributions
Tam-0.192
paralleled-0.189
lear-0.184
Compare-0.183
-0.183
Token Chief
Token ID5953
Feature activation-5.23e+00

Pos logit contributions
 Mir+1.062
 Citation+1.049
hots+1.005
exec+0.956
camp+0.935
Neg logit contributions
Tam-1.458
paralleled-1.428
lear-1.401
��-1.393
Compare-1.391
Token Commercial
Token ID22724
Feature activation-1.50e+00

Pos logit contributions
 Mir+2.863
 Citation+2.836
hots+2.634
exec+2.586
ancial+2.458
Neg logit contributions
Tam-4.674
paralleled-4.586
��-4.518
lear-4.513
Compare-4.408
Token Officer
Token ID10391
Feature activation-4.05e+00

Pos logit contributions
 Mir+0.996
 Citation+0.992
hots+0.958
exec+0.909
camp+0.895
Neg logit contributions
Tam-1.161
paralleled-1.138
lear-1.109
-1.104
Compare-1.104
Token,
Token ID11
Feature activation-4.59e-01

Pos logit contributions
 Mir+2.569
 Citation+2.520
hots+2.432
exec+2.288
camp+2.254
Neg logit contributions
Tam-3.276
paralleled-3.203
Compare-3.134
lear-3.132
-3.131
Token said
Token ID531
Feature activation-1.85e-01

Pos logit contributions
 Mir+0.285
 Citation+0.284
hots+0.273
exec+0.257
camp+0.254
Neg logit contributions
Tam-0.377
paralleled-0.369
lear-0.361
Compare-0.360
-0.360
Token:
Token ID25
Feature activation-1.62e+00

Pos logit contributions
 Mir+0.151
 Citation+0.151
hots+0.147
exec+0.140
camp+0.139
Neg logit contributions
Tam-0.116
paralleled-0.113
lear-0.109
Compare-0.109
-0.109
Tokenak
Token ID461
Feature activation+4.93e-01

Pos logit contributions
Tam+1.270
paralleled+1.242
lear+1.212
+1.203
Compare+1.199
Neg logit contributions
 Citation-1.200
 Mir-1.198
hots-1.147
exec-1.090
ancial-1.087
Tokenushi
Token ID17731
Feature activation-8.84e-01

Pos logit contributions
Tam+0.469
paralleled+0.460
lear+0.451
+0.450
Compare+0.450
Neg logit contributions
 Mir-0.241
 Citation-0.240
hots-0.231
exec-0.214
camp-0.210
Token wrote
Token ID2630
Feature activation-5.05e+00

Pos logit contributions
 Mir+0.296
 Citation+0.296
hots+0.275
exec+0.245
camp+0.240
Neg logit contributions
Tam-0.977
paralleled-0.962
lear-0.947
-0.943
Compare-0.942
Token:
Token ID25
Feature activation-3.25e+00

Pos logit contributions
 Mir+3.653
 Citation+3.636
hots+3.472
exec+3.345
camp+3.344
Neg logit contributions
��-3.788
 millenn-3.767
Tam-3.587
paralleled-3.566
lear-3.458
Token
Token ID198
Feature activation-5.29e+00

Pos logit contributions
 Mir+1.814
 Citation+1.794
hots+1.704
exec+1.606
camp+1.590
Neg logit contributions
��-2.861
Tam-2.859
paralleled-2.846
-2.779
lear-2.775
Token
Token ID198
Feature activation-5.16e+00

Pos logit contributions
 Mir+2.784
 Citation+2.703
hots+2.543
camp+2.407
exec+2.406
Neg logit contributions
 millenn-5.014
��-4.990
paralleled-4.791
Tam-4.753
-4.705
TokenHappy
Token ID25082
Feature activation-1.56e+00

Pos logit contributions
 Mir+3.329
hots+3.306
 Citation+3.290
exec+3.120
camp+3.107
Neg logit contributions
��-3.905
paralleled-3.877
Tam-3.870
-3.793
 Lenovo-3.739
Token it
Token ID340
Feature activation+2.74e-02

Pos logit contributions
 Mir+0.917
 Citation+0.908
hots+0.866
exec+0.819
camp+0.809
Neg logit contributions
Tam-1.336
paralleled-1.317
��-1.295
lear-1.289
-1.281
Token worked
Token ID3111
Feature activation-7.60e-01

Pos logit contributions
Tam+0.024
paralleled+0.024
lear+0.023
+0.023
Compare+0.023
Neg logit contributions
 Mir-0.015
 Citation-0.015
hots-0.015
exec-0.014
camp-0.014
Token out
Token ID503
Feature activation-1.21e+00

Pos logit contributions
 Mir+0.382
 Citation+0.369
hots+0.356
exec+0.329
camp+0.327
Neg logit contributions
Tam-0.719
paralleled-0.711
��-0.702
-0.698
lear-0.695
Token.
Token ID13
Feature activation-5.56e-01

Pos logit contributions
 Mir+0.875
 Citation+0.865
hots+0.839
exec+0.795
camp+0.793
Neg logit contributions
Tam-0.872
paralleled-0.861
��-0.848
lear-0.837
-0.836
Tokenarg
Token ID853
Feature activation-3.80e-01

Pos logit contributions
Tam+0.102
paralleled+0.100
lear+0.097
+0.097
Compare+0.097
Neg logit contributions
 Mir-0.076
 Citation-0.076
hots-0.073
exec-0.069
camp-0.068
Tokenuably
Token ID14632
Feature activation+9.34e-01

Pos logit contributions
 Mir+0.371
 Citation+0.368
hots+0.362
exec+0.352
camp+0.347
Neg logit contributions
Tam-0.176
paralleled-0.168
��-0.162
-0.161
Compare-0.161
Token the
Token ID262
Feature activation-2.92e-01

Pos logit contributions
Tam+0.764
paralleled+0.749
lear+0.731
+0.729
Compare+0.728
Neg logit contributions
 Citation-0.580
 Mir-0.580
hots-0.559
exec-0.528
camp-0.521
Token greatest
Token ID6000
Feature activation-5.02e-01

Pos logit contributions
 Mir+0.180
 Citation+0.179
hots+0.173
exec+0.162
camp+0.160
Neg logit contributions
Tam-0.242
paralleled-0.237
-0.232
lear-0.232
��-0.232
Token game
Token ID983
Feature activation-1.62e+00

Pos logit contributions
 Mir+0.332
 Citation+0.330
hots+0.317
exec+0.301
camp+0.297
Neg logit contributions
Tam-0.392
paralleled-0.384
lear-0.376
��-0.375
Compare-0.374
Token ever
Token ID1683
Feature activation-5.15e+00

Pos logit contributions
 Mir+1.013
 Citation+1.002
hots+0.964
exec+0.908
camp+0.903
Neg logit contributions
Tam-1.327
paralleled-1.298
lear-1.274
��-1.270
Compare-1.269
Token played
Token ID2826
Feature activation-2.68e-01

Pos logit contributions
 Mir+2.686
 Citation+2.579
hots+2.469
exec+2.334
 barric+2.264
Neg logit contributions
��-4.691
Tam-4.680
paralleled-4.542
lear-4.527
-4.527
Token in
Token ID287
Feature activation-3.99e-01

Pos logit contributions
 Mir+0.177
 Citation+0.176
hots+0.169
exec+0.160
camp+0.159
Neg logit contributions
Tam-0.210
paralleled-0.206
lear-0.202
��-0.201
-0.201
Token Cleveland
Token ID10306
Feature activation-1.09e+00

Pos logit contributions
 Mir+0.268
 Citation+0.264
hots+0.255
exec+0.241
camp+0.240
Neg logit contributions
Tam-0.308
paralleled-0.301
lear-0.296
Compare-0.293
-0.293
Token,
Token ID11
Feature activation+4.74e-01

Pos logit contributions
 Mir+0.795
 Citation+0.789
hots+0.760
camp+0.721
exec+0.720
Neg logit contributions
Tam-0.775
paralleled-0.756
lear-0.742
��-0.736
-0.732
Token Ohio
Token ID6835
Feature activation-1.29e+00

Pos logit contributions
Tam+0.386
paralleled+0.378
lear+0.370
+0.368
Compare+0.368
Neg logit contributions
 Mir-0.297
 Citation-0.296
hots-0.286
exec-0.269
camp-0.266
Token am
Token ID716
Feature activation-5.19e-01

Pos logit contributions
Tam+0.026
paralleled+0.026
lear+0.025
+0.025
Compare+0.025
Neg logit contributions
 Mir-0.021
 Citation-0.021
hots-0.020
exec-0.019
camp-0.019
Tokenis
Token ID271
Feature activation-1.31e-01

Pos logit contributions
 Citation+0.299
 Mir+0.298
hots+0.286
exec+0.268
camp+0.265
Neg logit contributions
Tam-0.448
paralleled-0.440
lear-0.430
-0.429
Compare-0.428
Token.
Token ID13
Feature activation-3.24e-01

Pos logit contributions
 Mir+0.084
 Citation+0.083
hots+0.081
exec+0.076
camp+0.075
Neg logit contributions
Tam-0.105
paralleled-0.103
lear-0.101
-0.101
Compare-0.100
Token Certain
Token ID16272
Feature activation-9.17e-01

Pos logit contributions
 Mir+0.211
 Citation+0.210
hots+0.200
exec+0.190
camp+0.189
Neg logit contributions
Tam-0.256
paralleled-0.253
lear-0.247
-0.246
��-0.245
Token d
Token ID288
Feature activation-9.33e-01

Pos logit contributions
 Mir+0.599
 Citation+0.595
hots+0.576
exec+0.545
camp+0.540
Neg logit contributions
Tam-0.723
paralleled-0.710
-0.691
lear-0.689
Compare-0.688
Token'
Token ID6
Feature activation-5.14e+00

Pos logit contributions
 Mir+0.725
 Citation+0.718
hots+0.703
exec+0.670
camp+0.664
Neg logit contributions
Tam-0.622
paralleled-0.606
-0.590
��-0.587
lear-0.587
Tokenent
Token ID298
Feature activation-1.12e+00

Pos logit contributions
 Mir+3.606
hots+3.582
 Citation+3.488
exec+3.476
camp+3.365
Neg logit contributions
Tam-3.587
��-3.568
paralleled-3.555
-3.498
lear-3.420
Tokenre
Token ID260
Feature activation-3.20e-02

Pos logit contributions
 Mir+0.722
 Citation+0.718
hots+0.698
exec+0.662
camp+0.653
Neg logit contributions
Tam-0.892
paralleled-0.871
-0.853
Compare-0.851
��-0.849
Token e
Token ID304
Feature activation-3.51e-01

Pos logit contributions
 Mir+0.021
 Citation+0.021
hots+0.020
exec+0.019
camp+0.019
Neg logit contributions
Tam-0.026
paralleled-0.025
lear-0.024
-0.024
Compare-0.024
Tokenux
Token ID2821
Feature activation-7.37e-01

Pos logit contributions
 Mir+0.237
 Citation+0.236
hots+0.229
exec+0.218
camp+0.216
Neg logit contributions
Tam-0.269
paralleled-0.263
-0.257
Compare-0.256
lear-0.256
Token s
Token ID264
Feature activation-5.46e-01

Pos logit contributions
 Mir+0.476
 Citation+0.470
hots+0.457
exec+0.431
camp+0.429
Neg logit contributions
Tam-0.588
paralleled-0.577
lear-0.561
-0.560
Compare-0.559
TokenL
Token ID43
Feature activation+2.71e-02

Pos logit contributions
 Mir+0.983
 Citation+0.982
hots+0.953
exec+0.908
camp+0.896
Neg logit contributions
Tam-1.191
paralleled-1.186
lear-1.158
��-1.157
-1.156
TokenANS
Token ID15037
Feature activation+1.59e+00

Pos logit contributions
Tam+0.018
paralleled+0.018
+0.017
lear+0.017
��+0.017
Neg logit contributions
 Mir-0.021
 Citation-0.021
hots-0.020
exec-0.019
camp-0.019
TokenING
Token ID2751
Feature activation+1.79e+00

Pos logit contributions
Tam+1.276
paralleled+1.250
lear+1.220
+1.217
Compare+1.216
Neg logit contributions
 Mir-1.008
 Citation-1.006
hots-0.971
exec-0.916
camp-0.906
Token,
Token ID11
Feature activation-7.18e-01

Pos logit contributions
Tam+1.304
paralleled+1.272
lear+1.239
Compare+1.235
+1.235
Neg logit contributions
 Mir-1.270
 Citation-1.269
hots-1.231
exec-1.168
camp-1.158
Token MI
Token ID15789
Feature activation-1.52e+00

Pos logit contributions
 Mir+0.332
 Citation+0.321
hots+0.302
exec+0.280
camp+0.271
Neg logit contributions
Tam-0.708
paralleled-0.700
-0.693
lear-0.688
��-0.683
Token --
Token ID1377
Feature activation-5.13e+00

Pos logit contributions
 Mir+1.037
 Citation+1.020
hots+0.985
exec+0.945
camp+0.922
Neg logit contributions
Tam-1.144
paralleled-1.139
-1.109
lear-1.106
��-1.103
Token The
Token ID383
Feature activation-8.85e-01

Pos logit contributions
 Mir+3.408
 Citation+3.334
hots+3.095
 Lar+2.984
exec+2.961
Neg logit contributions
paralleled-3.964
��-3.931
-3.902
Tam-3.902
lear-3.869
Token Michigan
Token ID7055
Feature activation-6.57e-02

Pos logit contributions
 Mir+0.578
 Citation+0.575
hots+0.552
exec+0.523
camp+0.514
Neg logit contributions
Tam-0.697
paralleled-0.685
-0.671
lear-0.670
Compare-0.667
Token House
Token ID2097
Feature activation-4.65e-01

Pos logit contributions
 Mir+0.042
 Citation+0.041
hots+0.040
exec+0.038
camp+0.037
Neg logit contributions
Tam-0.053
paralleled-0.052
lear-0.051
-0.051
Compare-0.051
Token is
Token ID318
Feature activation+7.04e-03

Pos logit contributions
 Mir+0.280
 Citation+0.279
hots+0.268
exec+0.253
camp+0.248
Neg logit contributions
Tam-0.391
paralleled-0.384
-0.374
lear-0.373
Compare-0.372
Token looking
Token ID2045
Feature activation-7.40e-01

Pos logit contributions
Tam+0.006
paralleled+0.006
lear+0.006
+0.006
��+0.006
Neg logit contributions
 Mir-0.004
 Citation-0.004
hots-0.004
exec-0.004
camp-0.004
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
 Mir+0.404
 Citation+0.401
hots+0.387
exec+0.366
camp+0.363
Neg logit contributions
Tam-0.479
paralleled-0.470
-0.459
lear-0.458
��-0.457
Token
Token ID198
Feature activation-8.38e-01

Pos logit contributions
Tam+0.237
paralleled+0.232
lear+0.227
+0.226
Compare+0.226
Neg logit contributions
 Mir-0.177
 Citation-0.176
hots-0.170
exec-0.160
camp-0.158
Token
Token ID198
Feature activation-6.91e-01

Pos logit contributions
 Mir+0.491
 Citation+0.485
hots+0.465
exec+0.440
camp+0.436
Neg logit contributions
Tam-0.715
paralleled-0.708
��-0.701
lear-0.692
-0.692
TokenEr
Token ID9139
Feature activation+1.82e+00

Pos logit contributions
 Mir+0.445
 Citation+0.442
hots+0.427
exec+0.407
camp+0.402
Neg logit contributions
Tam-0.542
paralleled-0.541
lear-0.526
-0.525
��-0.524
Tokenn
Token ID77
Feature activation-1.94e+00

Pos logit contributions
Tam+1.274
paralleled+1.244
lear+1.210
+1.206
Compare+1.205
Neg logit contributions
 Mir-1.339
 Citation-1.338
hots-1.298
exec-1.235
camp-1.223
Tokenest
Token ID395
Feature activation-5.12e+00

Pos logit contributions
 Mir+1.096
 Citation+1.074
hots+1.038
ancial+0.964
exec+0.963
Neg logit contributions
Tam-1.703
paralleled-1.677
��-1.655
-1.648
Compare-1.646
Tokeno
Token ID78
Feature activation-1.25e+00

Pos logit contributions
 Mir+3.545
 Citation+3.422
hots+3.217
 Lar+3.157
camp+3.110
Neg logit contributions
paralleled-3.828
Tam-3.825
��-3.729
Compare-3.692
-3.690
Token Gal
Token ID5027
Feature activation-4.63e-01

Pos logit contributions
 Mir+0.774
 Citation+0.758
hots+0.721
exec+0.682
camp+0.681
Neg logit contributions
Tam-1.039
paralleled-1.029
lear-1.000
��-0.998
-0.997
Tokenar
Token ID283
Feature activation-2.40e-01

Pos logit contributions
 Mir+0.304
 Citation+0.301
hots+0.292
exec+0.278
camp+0.276
Neg logit contributions
Tam-0.363
paralleled-0.356
Compare-0.348
-0.347
��-0.346
Tokenza
Token ID4496
Feature activation-1.22e+00

Pos logit contributions
 Mir+0.203
 Citation+0.202
hots+0.196
exec+0.190
camp+0.189
Neg logit contributions
Tam-0.143
paralleled-0.139
Compare-0.135
-0.135
��-0.135
Token,
Token ID11
Feature activation+6.19e-01

Pos logit contributions
 Mir+0.798
 Citation+0.787
hots+0.750
exec+0.714
camp+0.707
Neg logit contributions
Tam-0.972
paralleled-0.960
-0.932
Compare-0.929
lear-0.928
TokenUSS
Token ID32835
Feature activation+4.68e-01

Pos logit contributions
Tam+0.622
paralleled+0.610
lear+0.594
+0.590
Compare+0.588
Neg logit contributions
 Mir-0.602
 Citation-0.601
hots-0.580
exec-0.552
camp-0.547
TokenELS
Token ID37142
Feature activation-6.41e-01

Pos logit contributions
Tam+0.343
paralleled+0.337
lear+0.328
+0.326
Compare+0.324
Neg logit contributions
 Mir-0.335
 Citation-0.332
hots-0.318
exec-0.304
camp-0.303
Token (
Token ID357
Feature activation-2.10e+00

Pos logit contributions
 Mir+0.469
 Citation+0.467
hots+0.450
exec+0.428
camp+0.425
Neg logit contributions
Tam-0.459
paralleled-0.447
lear-0.440
��-0.437
-0.437
TokenReuters
Token ID12637
Feature activation-4.02e+00

Pos logit contributions
 Mir+1.267
 Citation+1.266
hots+1.202
exec+1.148
camp+1.125
Neg logit contributions
Tam-1.735
paralleled-1.729
��-1.690
lear-1.686
-1.676
Token)
Token ID8
Feature activation-2.27e+00

Pos logit contributions
 Citation+2.236
 Mir+2.234
hots+2.008
camp+1.929
exec+1.903
Neg logit contributions
paralleled-3.614
��-3.590
Tam-3.555
lear-3.522
 millenn-3.512
Token -
Token ID532
Feature activation-5.11e+00

Pos logit contributions
 Mir+1.509
 Citation+1.500
hots+1.396
camp+1.343
 Lar+1.320
Neg logit contributions
Tam-1.791
paralleled-1.786
��-1.784
lear-1.746
 millenn-1.743
Token The
Token ID383
Feature activation+1.19e+00

Pos logit contributions
 Mir+3.794
 Citation+3.722
hots+3.463
 Lar+3.363
 JACK+3.291
Neg logit contributions
��-3.635
Tam-3.619
paralleled-3.586
-3.523
lear-3.515
Token world
Token ID995
Feature activation+9.20e-01

Pos logit contributions
Tam+0.728
paralleled+0.708
lear+0.687
+0.684
Compare+0.684
Neg logit contributions
 Mir-0.987
 Citation-0.984
hots-0.957
exec-0.917
camp-0.906
Token
Token ID447
Feature activation-3.65e-01

Pos logit contributions
Tam+0.724
paralleled+0.708
lear+0.691
+0.690
Compare+0.689
Neg logit contributions
 Mir-0.603
 Citation-0.601
hots-0.581
exec-0.549
camp-0.543
Token
Token ID247
Feature activation-5.58e-01

Pos logit contributions
 Citation+0.274
 Mir+0.272
hots+0.261
exec+0.249
camp+0.246
Neg logit contributions
Tam-0.256
paralleled-0.249
lear-0.243
Compare-0.242
-0.240
Tokens
Token ID82
Feature activation+1.26e+00

Pos logit contributions
 Mir+0.384
 Citation+0.384
hots+0.375
exec+0.356
camp+0.351
Neg logit contributions
Tam-0.417
paralleled-0.406
��-0.400
Compare-0.398
-0.396
Token it
Token ID340
Feature activation-7.41e-01

Pos logit contributions
 Mir+1.395
 Citation+1.375
hots+1.317
exec+1.251
camp+1.243
Neg logit contributions
paralleled-1.756
Tam-1.746
��-1.690
Compare-1.689
-1.680
Token just
Token ID655
Feature activation-1.03e+00

Pos logit contributions
 Mir+0.485
 Citation+0.477
hots+0.461
exec+0.432
camp+0.430
Neg logit contributions
Tam-0.588
paralleled-0.579
��-0.568
-0.565
lear-0.564
Token me
Token ID502
Feature activation-1.26e+00

Pos logit contributions
 Mir+0.678
 Citation+0.664
hots+0.644
exec+0.608
camp+0.602
Neg logit contributions
Tam-0.810
paralleled-0.802
��-0.779
lear-0.779
-0.779
Token,
Token ID11
Feature activation+7.96e-01

Pos logit contributions
 Mir+0.827
 Citation+0.812
hots+0.793
exec+0.749
camp+0.741
Neg logit contributions
Tam-0.989
paralleled-0.973
-0.946
lear-0.946
Compare-0.941
Token or
Token ID393
Feature activation+1.51e+00

Pos logit contributions
Tam+0.607
paralleled+0.594
lear+0.579
+0.578
Compare+0.577
Neg logit contributions
 Mir-0.540
 Citation-0.538
hots-0.521
exec-0.494
camp-0.488
Token does
Token ID857
Feature activation-5.07e+00

Pos logit contributions
Tam+1.255
paralleled+1.221
Compare+1.199
lear+1.196
+1.189
Neg logit contributions
 Citation-0.912
 Mir-0.901
hots-0.876
exec-0.828
ancial-0.818
Token anyone
Token ID2687
Feature activation-5.90e-01

Pos logit contributions
 Mir+3.098
 Citation+2.982
hots+2.833
 Lar+2.680
exec+2.678
Neg logit contributions
paralleled-4.222
Tam-4.149
��-4.053
lear-4.039
Compare-4.038
Token else
Token ID2073
Feature activation-2.72e+00

Pos logit contributions
 Mir+0.364
 Citation+0.359
hots+0.348
exec+0.325
camp+0.323
Neg logit contributions
Tam-0.490
paralleled-0.484
-0.473
��-0.473
lear-0.467
Token out
Token ID503
Feature activation-1.41e+00

Pos logit contributions
 Mir+1.673
 Citation+1.633
hots+1.579
camp+1.474
exec+1.466
Neg logit contributions
Tam-2.261
paralleled-2.249
��-2.231
-2.211
lear-2.145
Token there
Token ID612
Feature activation+1.10e+00

Pos logit contributions
 Mir+0.907
 Citation+0.903
hots+0.865
camp+0.829
exec+0.829
Neg logit contributions
Tam-1.125
paralleled-1.111
-1.089
��-1.085
lear-1.075
Token think
Token ID892
Feature activation-1.72e+00

Pos logit contributions
Tam+0.911
paralleled+0.894
+0.872
lear+0.871
Compare+0.869
Neg logit contributions
 Mir-0.676
 Citation-0.674
hots-0.650
exec-0.611
camp-0.605
Token:
Token ID25
Feature activation-3.67e+00

Pos logit contributions
 Mir+0.690
 Citation+0.685
hots+0.664
camp+0.639
exec+0.638
Neg logit contributions
Tam-0.537
paralleled-0.527
��-0.516
lear-0.509
-0.508
Token50
Token ID1120
Feature activation-2.62e+00

Pos logit contributions
 Mir+2.422
 Citation+2.415
hots+2.305
camp+2.249
exec+2.235
Neg logit contributions
��-2.832
 millenn-2.810
paralleled-2.805
Tam-2.761
lear-2.715
Token pur
Token ID1308
Feature activation+1.71e+00

Pos logit contributions
 Mir+1.824
 Citation+1.772
hots+1.715
exec+1.614
camp+1.609
Neg logit contributions
Tam-1.985
paralleled-1.944
Compare-1.901
lear-1.891
-1.886
Tokenak
Token ID461
Feature activation+4.93e-01

Pos logit contributions
Tam+1.270
paralleled+1.242
lear+1.212
+1.203
Compare+1.199
Neg logit contributions
 Citation-1.200
 Mir-1.198
hots-1.147
exec-1.090
ancial-1.087
Tokenushi
Token ID17731
Feature activation-8.84e-01

Pos logit contributions
Tam+0.469
paralleled+0.460
lear+0.451
+0.450
Compare+0.450
Neg logit contributions
 Mir-0.241
 Citation-0.240
hots-0.231
exec-0.214
camp-0.210
Token wrote
Token ID2630
Feature activation-5.05e+00

Pos logit contributions
 Mir+0.296
 Citation+0.296
hots+0.275
exec+0.245
camp+0.240
Neg logit contributions
Tam-0.977
paralleled-0.962
lear-0.947
-0.943
Compare-0.942
Token:
Token ID25
Feature activation-3.25e+00

Pos logit contributions
 Mir+3.653
 Citation+3.636
hots+3.472
exec+3.345
camp+3.344
Neg logit contributions
��-3.788
 millenn-3.767
Tam-3.587
paralleled-3.566
lear-3.458
Token
Token ID198
Feature activation-5.29e+00

Pos logit contributions
 Mir+1.814
 Citation+1.794
hots+1.704
exec+1.606
camp+1.590
Neg logit contributions
��-2.861
Tam-2.859
paralleled-2.846
-2.779
lear-2.775
Token
Token ID198
Feature activation-5.16e+00

Pos logit contributions
 Mir+2.784
 Citation+2.703
hots+2.543
camp+2.407
exec+2.406
Neg logit contributions
 millenn-5.014
��-4.990
paralleled-4.791
Tam-4.753
-4.705
TokenHappy
Token ID25082
Feature activation-1.56e+00

Pos logit contributions
 Mir+3.329
hots+3.306
 Citation+3.290
exec+3.120
camp+3.107
Neg logit contributions
��-3.905
paralleled-3.877
Tam-3.870
-3.793
 Lenovo-3.739
Token it
Token ID340
Feature activation+2.74e-02

Pos logit contributions
 Mir+0.917
 Citation+0.908
hots+0.866
exec+0.819
camp+0.809
Neg logit contributions
Tam-1.336
paralleled-1.317
��-1.295
lear-1.289
-1.281
TokenAD
Token ID2885
Feature activation+9.93e-01

Pos logit contributions
Tam+0.766
paralleled+0.750
lear+0.728
+0.724
Compare+0.723
Neg logit contributions
 Mir-0.982
 Citation-0.976
hots-0.944
exec-0.906
camp-0.901
TokenENA
Token ID45510
Feature activation-3.22e-01

Pos logit contributions
Tam+0.800
paralleled+0.782
lear+0.764
��+0.759
+0.758
Neg logit contributions
 Mir-0.640
 Citation-0.634
hots-0.602
exec-0.577
camp-0.569
Token,
Token ID11
Feature activation-9.95e-01

Pos logit contributions
 Mir+0.217
 Citation+0.214
hots+0.205
exec+0.196
camp+0.192
Neg logit contributions
Tam-0.251
paralleled-0.247
lear-0.243
-0.240
��-0.238
Token Calif
Token ID3192
Feature activation-2.15e+00

Pos logit contributions
 Mir+0.800
 Citation+0.771
hots+0.744
exec+0.722
camp+0.708
Neg logit contributions
Tam-0.654
paralleled-0.652
lear-0.638
��-0.632
-0.625
Token.
Token ID13
Feature activation-1.69e+00

Pos logit contributions
 Mir+1.523
 Citation+1.498
hots+1.460
exec+1.374
camp+1.365
Neg logit contributions
Tam-1.577
paralleled-1.564
��-1.524
-1.521
lear-1.511
Token --
Token ID1377
Feature activation-5.04e+00

Pos logit contributions
 Mir+1.222
 Citation+1.196
hots+1.145
exec+1.095
camp+1.094
Neg logit contributions
Tam-1.227
paralleled-1.218
��-1.196
-1.192
lear-1.191
Token The
Token ID383
Feature activation-3.58e-01

Pos logit contributions
 Mir+3.368
 Citation+3.213
hots+2.984
camp+2.893
 Lar+2.869
Neg logit contributions
paralleled-3.957
��-3.936
Tam-3.931
-3.865
lear-3.861
Token first
Token ID717
Feature activation+1.07e+00

Pos logit contributions
 Mir+0.241
 Citation+0.238
hots+0.228
exec+0.216
camp+0.215
Neg logit contributions
Tam-0.279
paralleled-0.273
lear-0.267
-0.267
Compare-0.266
Token Martian
Token ID32508
Feature activation+1.34e+00

Pos logit contributions
Tam+0.855
paralleled+0.836
lear+0.817
+0.816
Compare+0.814
Neg logit contributions
 Mir-0.694
 Citation-0.688
hots-0.665
exec-0.625
camp-0.619
Token rock
Token ID3881
Feature activation+2.97e-01

Pos logit contributions
Tam+1.073
paralleled+1.050
lear+1.025
+1.024
Compare+1.022
Neg logit contributions
 Mir-0.872
 Citation-0.862
hots-0.836
exec-0.787
camp-0.780
Token NASA
Token ID8884
Feature activation-1.17e+00

Pos logit contributions
Tam+0.245
paralleled+0.240
+0.236
lear+0.235
Compare+0.234
Neg logit contributions
 Mir-0.184
 Citation-0.181
hots-0.177
exec-0.165
camp-0.164
Token.
Token ID13
Feature activation+3.75e-01

Pos logit contributions
Tam+0.088
paralleled+0.086
+0.084
lear+0.084
Compare+0.084
Neg logit contributions
 Mir-0.078
 Citation-0.078
hots-0.075
exec-0.071
camp-0.070
Token<|endoftext|>
Token ID50256
Feature activation-7.67e-01

Pos logit contributions
Tam+0.293
paralleled+0.288
lear+0.281
+0.280
��+0.279
Neg logit contributions
 Mir-0.248
 Citation-0.248
hots-0.237
exec-0.224
camp-0.222
TokenDec
Token ID10707
Feature activation+1.13e-01

Pos logit contributions
 Mir+0.510
 Citation+0.509
hots+0.496
exec+0.472
camp+0.467
Neg logit contributions
Tam-0.583
paralleled-0.579
lear-0.563
-0.563
��-0.562
Token 2
Token ID362
Feature activation-1.35e+00

Pos logit contributions
Tam+0.089
paralleled+0.087
+0.085
Compare+0.085
lear+0.085
Neg logit contributions
 Mir-0.074
 Citation-0.074
hots-0.071
exec-0.067
camp-0.067
Token,
Token ID11
Feature activation-2.60e+00

Pos logit contributions
 Mir+0.913
 Citation+0.908
hots+0.877
exec+0.832
camp+0.824
Neg logit contributions
Tam-1.034
paralleled-1.012
lear-0.988
-0.986
��-0.982
Token 2015
Token ID1853
Feature activation-5.04e+00

Pos logit contributions
 Mir+1.891
 Citation+1.868
hots+1.792
exec+1.713
camp+1.709
Neg logit contributions
Tam-1.860
paralleled-1.824
��-1.816
lear-1.785
-1.765
Token
Token ID784
Feature activation-4.75e+00

Pos logit contributions
 Citation+3.578
 Mir+3.567
hots+3.411
exec+3.257
camp+3.194
Neg logit contributions
paralleled-3.651
Tam-3.639
��-3.520
-3.488
lear-3.484
Token Cycl
Token ID28007
Feature activation+3.15e-01

Pos logit contributions
 Mir+3.263
 Citation+3.192
hots+2.987
 Lar+2.883
exec+2.871
Neg logit contributions
paralleled-3.579
Tam-3.562
��-3.531
lear-3.481
-3.464
Tokenoc
Token ID420
Feature activation+2.21e-01

Pos logit contributions
Tam+0.263
paralleled+0.258
+0.252
��+0.252
lear+0.251
Neg logit contributions
 Mir-0.190
 Citation-0.189
hots-0.183
exec-0.173
camp-0.170
Tokenross
Token ID1214
Feature activation+1.28e+00

Pos logit contributions
Tam+0.111
paralleled+0.107
+0.103
Compare+0.102
��+0.102
Neg logit contributions
 Mir-0.208
 Citation-0.207
hots-0.202
exec-0.195
camp-0.193
Token hurts
Token ID20406
Feature activation-6.42e-01

Pos logit contributions
Tam+0.981
paralleled+0.960
lear+0.935
+0.932
Compare+0.930
Neg logit contributions
 Mir-0.871
 Citation-0.868
hots-0.841
exec-0.796
camp-0.789
Token N
Token ID399
Feature activation-6.20e-02

Pos logit contributions
Tam+0.072
paralleled+0.071
lear+0.069
��+0.069
+0.069
Neg logit contributions
 Mir-0.075
 Citation-0.074
hots-0.071
exec-0.069
camp-0.067
TokenATIONS
Token ID18421
Feature activation-9.42e-01

Pos logit contributions
 Mir+0.045
 Citation+0.045
hots+0.044
exec+0.042
camp+0.041
Neg logit contributions
Tam-0.044
paralleled-0.043
lear-0.042
-0.042
Compare-0.042
Token (
Token ID357
Feature activation-2.18e+00

Pos logit contributions
 Mir+0.674
 Citation+0.669
hots+0.643
exec+0.612
camp+0.605
Neg logit contributions
Tam-0.693
paralleled-0.676
-0.662
lear-0.660
��-0.660
TokenReuters
Token ID12637
Feature activation-3.25e+00

Pos logit contributions
 Citation+1.311
 Mir+1.305
hots+1.228
exec+1.180
camp+1.149
Neg logit contributions
paralleled-1.818
Tam-1.804
��-1.778
lear-1.770
-1.760
Token)
Token ID8
Feature activation-2.13e+00

Pos logit contributions
 Citation+1.835
 Mir+1.833
hots+1.669
camp+1.592
exec+1.588
Neg logit contributions
paralleled-2.876
Tam-2.851
��-2.836
lear-2.798
-2.779
Token -
Token ID532
Feature activation-5.03e+00

Pos logit contributions
 Mir+1.410
 Citation+1.403
hots+1.299
camp+1.252
 Lar+1.230
Neg logit contributions
Tam-1.683
paralleled-1.680
��-1.675
lear-1.641
 millenn-1.638
Token An
Token ID1052
Feature activation-6.66e-01

Pos logit contributions
 Mir+3.680
 Citation+3.625
hots+3.337
 Lar+3.258
 JACK+3.182
Neg logit contributions
��-3.629
Tam-3.586
paralleled-3.576
-3.542
lear-3.528
Token army
Token ID5428
Feature activation-5.95e-01

Pos logit contributions
 Mir+0.491
 Citation+0.483
hots+0.475
exec+0.456
camp+0.441
Neg logit contributions
Tam-0.473
paralleled-0.459
lear-0.453
-0.453
Compare-0.449
Token battalion
Token ID36328
Feature activation+2.52e-04

Pos logit contributions
 Mir+0.382
 Citation+0.381
hots+0.363
exec+0.347
camp+0.344
Neg logit contributions
Tam-0.478
paralleled-0.468
lear-0.457
-0.457
Compare-0.456
Token in
Token ID287
Feature activation+7.88e-02

Pos logit contributions
Tam+0.000
paralleled+0.000
lear+0.000
+0.000
Compare+0.000
Neg logit contributions
 Citation-0.000
 Mir-0.000
hots-0.000
exec-0.000
camp-0.000
Token the
Token ID262
Feature activation+1.58e-01

Pos logit contributions
Tam+0.055
paralleled+0.054
+0.052
lear+0.052
Compare+0.052
Neg logit contributions
 Mir-0.059
 Citation-0.058
hots-0.057
exec-0.054
camp-0.053
Token a
Token ID257
Feature activation+2.13e-01

Pos logit contributions
Tam+0.628
paralleled+0.614
lear+0.599
+0.598
Compare+0.597
Neg logit contributions
 Mir-0.518
 Citation-0.517
hots-0.500
exec-0.472
camp-0.467
Token Cohn
Token ID45005
Feature activation-2.50e+00

Pos logit contributions
Tam+0.183
paralleled+0.179
lear+0.175
+0.175
Compare+0.174
Neg logit contributions
 Mir-0.125
 Citation-0.124
hots-0.119
exec-0.112
camp-0.111
Token prot
Token ID1237
Feature activation-5.37e-01

Pos logit contributions
 Mir+1.677
 Citation+1.656
hots+1.601
exec+1.527
camp+1.510
Neg logit contributions
Tam-1.942
paralleled-1.904
lear-1.844
��-1.834
-1.830
Tokené
Token ID2634
Feature activation-4.27e+00

Pos logit contributions
 Mir+0.273
 Citation+0.270
hots+0.261
exec+0.244
camp+0.238
Neg logit contributions
Tam-0.501
paralleled-0.493
lear-0.482
-0.482
��-0.481
Tokeng
Token ID70
Feature activation-2.14e+00

Pos logit contributions
 Mir+2.903
 Citation+2.826
hots+2.819
exec+2.722
camp+2.667
Neg logit contributions
Tam-3.277
paralleled-3.117
-3.090
 millenn-3.076
Compare-3.064
Tokené
Token ID2634
Feature activation-5.00e+00

Pos logit contributions
 Mir+1.065
 Citation+1.041
hots+0.995
exec+0.939
camp+0.925
Neg logit contributions
Tam-2.039
paralleled-1.997
��-1.967
-1.958
Compare-1.956
Token,
Token ID11
Feature activation-1.09e+00

Pos logit contributions
 Mir+3.110
 Citation+3.024
hots+2.968
exec+2.784
camp+2.757
Neg logit contributions
Tam-4.062
paralleled-3.981
lear-3.890
Compare-3.877
-3.877
Token was
Token ID373
Feature activation+4.46e-01

Pos logit contributions
 Mir+0.678
 Citation+0.666
hots+0.649
exec+0.610
camp+0.604
Neg logit contributions
Tam-0.899
paralleled-0.878
Compare-0.858
��-0.858
-0.857
Token running
Token ID2491
Feature activation-2.89e-02

Pos logit contributions
Tam+0.361
paralleled+0.354
lear+0.346
+0.345
Compare+0.345
Neg logit contributions
 Mir-0.282
 Citation-0.280
hots-0.271
exec-0.256
camp-0.253
Token the
Token ID262
Feature activation+8.27e-01

Pos logit contributions
 Mir+0.017
 Citation+0.017
hots+0.016
exec+0.015
camp+0.015
Neg logit contributions
Tam-0.025
paralleled-0.024
lear-0.024
-0.024
Compare-0.023
Token mortgage
Token ID13682
Feature activation+1.39e+00

Pos logit contributions
Tam+0.638
paralleled+0.624
lear+0.610
+0.607
Compare+0.606
Neg logit contributions
 Mir-0.555
 Citation-0.552
hots-0.534
exec-0.506
camp-0.501
Token (
Token ID357
Feature activation+1.26e+00

Pos logit contributions
Tam+0.343
paralleled+0.337
lear+0.330
+0.329
��+0.325
Neg logit contributions
 Mir-0.302
 Citation-0.302
hots-0.290
exec-0.273
camp-0.273
Tokenalt
Token ID2501
Feature activation+2.05e-01

Pos logit contributions
Tam+0.970
paralleled+0.949
lear+0.925
+0.923
Compare+0.922
Neg logit contributions
 Mir-0.851
 Citation-0.850
hots-0.822
exec-0.778
camp-0.770
Tokenitude
Token ID3984
Feature activation-4.65e-01

Pos logit contributions
Tam+0.180
paralleled+0.177
lear+0.174
+0.173
Compare+0.173
Neg logit contributions
 Mir-0.116
 Citation-0.114
hots-0.110
camp-0.103
exec-0.101
Token 34
Token ID4974
Feature activation+9.87e-02

Pos logit contributions
 Mir+0.317
 Citation+0.315
hots+0.302
camp+0.286
exec+0.284
Neg logit contributions
Tam-0.355
paralleled-0.348
lear-0.342
-0.340
��-0.338
Token00
Token ID405
Feature activation+4.31e-01

Pos logit contributions
Tam+0.079
paralleled+0.078
lear+0.076
+0.076
Compare+0.075
Neg logit contributions
 Mir-0.064
 Citation-0.063
hots-0.061
camp-0.057
exec-0.057
Token m
Token ID285
Feature activation+3.06e+00

Pos logit contributions
Tam+0.333
paralleled+0.327
lear+0.319
+0.318
Compare+0.317
Neg logit contributions
 Mir-0.290
 Citation-0.289
hots-0.278
exec-0.263
camp-0.262
Token;
Token ID26
Feature activation+2.38e+00

Pos logit contributions
Tam+2.410
paralleled+2.354
lear+2.297
Compare+2.292
+2.289
Neg logit contributions
 Citation-1.981
 Mir-1.981
hots-1.911
exec-1.814
camp-1.784
Token 19
Token ID678
Feature activation+5.87e-01

Pos logit contributions
Tam+1.851
paralleled+1.807
Compare+1.760
+1.757
lear+1.756
Neg logit contributions
 Citation-1.566
 Mir-1.561
hots-1.520
exec-1.438
ancial-1.419
Token°
Token ID7200
Feature activation-6.68e-01

Pos logit contributions
Tam+0.480
paralleled+0.473
lear+0.462
+0.461
Compare+0.458
Neg logit contributions
 Citation-0.367
 Mir-0.366
hots-0.351
exec-0.329
camp-0.329
Token30
Token ID1270
Feature activation-4.36e-01

Pos logit contributions
 Citation+0.458
 Mir+0.458
hots+0.440
exec+0.418
camp+0.416
Neg logit contributions
Tam-0.501
paralleled-0.498
lear-0.485
-0.483
��-0.480
Token N
Token ID399
Feature activation+1.92e+00

Pos logit contributions
 Mir+0.272
 Citation+0.271
hots+0.261
exec+0.246
camp+0.243
Neg logit contributions
Tam-0.356
paralleled-0.348
lear-0.343
-0.341
��-0.341
Token,
Token ID11
Feature activation-6.56e-01

Pos logit contributions
Tam+0.331
paralleled+0.325
lear+0.316
+0.316
��+0.315
Neg logit contributions
 Mir-0.294
 Citation-0.294
hots-0.284
exec-0.269
camp-0.267
Token 2014
Token ID1946
Feature activation-5.45e-02

Pos logit contributions
 Mir+0.387
 Citation+0.385
hots+0.370
exec+0.348
camp+0.343
Neg logit contributions
Tam-0.560
paralleled-0.550
lear-0.537
-0.536
��-0.535
Token
Token ID198
Feature activation+4.57e-02

Pos logit contributions
 Mir+0.034
 Citation+0.034
hots+0.033
exec+0.031
camp+0.030
Neg logit contributions
Tam-0.045
paralleled-0.044
lear-0.043
Compare-0.043
-0.043
Token
Token ID198
Feature activation-3.14e-01

Pos logit contributions
Tam+0.038
paralleled+0.038
��+0.037
lear+0.037
+0.037
Neg logit contributions
 Mir-0.027
 Citation-0.027
hots-0.026
exec-0.025
camp-0.024
TokenI
Token ID40
Feature activation-5.55e-01

Pos logit contributions
 Mir+0.210
 Citation+0.209
hots+0.203
exec+0.193
camp+0.192
Neg logit contributions
Tam-0.239
paralleled-0.237
��-0.231
lear-0.230
-0.230
Tokenne
Token ID710
Feature activation+3.18e+00

Pos logit contributions
 Mir+0.351
 Citation+0.346
hots+0.333
exec+0.315
camp+0.313
Neg logit contributions
Tam-0.450
paralleled-0.441
��-0.435
-0.431
Compare-0.428
Tokenv
Token ID85
Feature activation-1.53e+00

Pos logit contributions
Tam+2.099
paralleled+2.048
lear+1.994
Compare+1.986
+1.982
Neg logit contributions
 Citation-2.474
 Mir-2.473
hots-2.404
exec-2.292
camp-2.272
Tokenitably
Token ID14829
Feature activation+7.05e-01

Pos logit contributions
 Mir+1.247
 Citation+1.232
hots+1.194
exec+1.152
camp+1.141
Neg logit contributions
Tam-0.959
paralleled-0.926
��-0.908
lear-0.904
-0.902
Token,
Token ID11
Feature activation+5.41e-01

Pos logit contributions
Tam+0.541
paralleled+0.529
lear+0.516
+0.515
Compare+0.514
Neg logit contributions
 Mir-0.474
 Citation-0.474
hots-0.458
exec-0.434
camp-0.429
Token with
Token ID351
Feature activation+6.87e-01

Pos logit contributions
Tam+0.435
paralleled+0.427
lear+0.416
+0.415
Compare+0.415
Neg logit contributions
 Mir-0.345
 Citation-0.343
hots-0.331
exec-0.313
camp-0.309
Token all
Token ID477
Feature activation-2.93e-01

Pos logit contributions
Tam+0.552
paralleled+0.541
lear+0.528
+0.527
Compare+0.526
Neg logit contributions
 Mir-0.440
 Citation-0.438
hots-0.422
exec-0.398
camp-0.394
Token?
Token ID30
Feature activation+1.21e+00

Pos logit contributions
 Mir+0.536
 Citation+0.529
hots+0.511
exec+0.482
camp+0.476
Neg logit contributions
Tam-0.735
paralleled-0.719
��-0.710
lear-0.705
-0.702
Token
Token ID198
Feature activation-4.58e-01

Pos logit contributions
Tam+0.951
paralleled+0.930
lear+0.907
Compare+0.906
+0.906
Neg logit contributions
 Mir-0.787
 Citation-0.787
hots-0.762
exec-0.720
camp-0.712
Token
Token ID198
Feature activation-4.09e-01

Pos logit contributions
 Mir+0.271
 Citation+0.269
hots+0.258
exec+0.244
camp+0.242
Neg logit contributions
Tam-0.389
paralleled-0.383
��-0.378
lear-0.375
-0.375
TokenWill
Token ID8743
Feature activation-3.44e-01

Pos logit contributions
 Mir+0.271
 Citation+0.270
hots+0.262
exec+0.250
camp+0.247
Neg logit contributions
Tam-0.314
paralleled-0.311
lear-0.303
��-0.302
-0.302
Token P
Token ID350
Feature activation+9.54e-01

Pos logit contributions
 Mir+0.223
 Citation+0.220
hots+0.213
exec+0.202
camp+0.200
Neg logit contributions
Tam-0.274
paralleled-0.269
lear-0.262
-0.261
Compare-0.261
Tokenfl
Token ID2704
Feature activation+3.16e+00

Pos logit contributions
Tam+0.646
paralleled+0.631
lear+0.611
+0.610
Compare+0.609
Neg logit contributions
 Mir-0.727
 Citation-0.726
hots-0.706
exec-0.674
camp-0.669
Tokenaum
Token ID26043
Feature activation-2.60e+00

Pos logit contributions
Tam+1.632
paralleled+1.580
lear+1.520
+1.515
Compare+1.513
Neg logit contributions
 Mir-2.913
 Citation-2.909
hots-2.840
exec-2.730
camp-2.711
Token Bl
Token ID1086
Feature activation-3.27e-01

Pos logit contributions
 Mir+1.740
 Citation+1.731
hots+1.659
exec+1.587
camp+1.566
Neg logit contributions
Tam-2.018
paralleled-1.968
-1.913
lear-1.910
Compare-1.907
Tokenocked
Token ID3543
Feature activation+6.33e-01

Pos logit contributions
 Mir+0.259
 Citation+0.256
hots+0.252
exec+0.241
camp+0.239
Neg logit contributions
Tam-0.213
paralleled-0.208
��-0.203
-0.202
Compare-0.202
Token Un
Token ID791
Feature activation+1.06e+00

Pos logit contributions
Tam+0.491
paralleled+0.481
lear+0.468
+0.468
Compare+0.467
Neg logit contributions
 Mir-0.422
 Citation-0.421
hots-0.406
exec-0.384
camp-0.381
Tokenblock
Token ID9967
Feature activation-4.00e-01

Pos logit contributions
Tam+0.772
paralleled+0.752
lear+0.733
+0.732
Compare+0.732
Neg logit contributions
 Mir-0.759
 Citation-0.757
hots-0.735
exec-0.699
camp-0.692
Token get
Token ID651
Feature activation-8.23e-01

Pos logit contributions
 Mir+0.241
 Citation+0.238
hots+0.229
camp+0.217
exec+0.216
Neg logit contributions
Tam-0.284
paralleled-0.278
��-0.274
-0.270
lear-0.270
Token your
Token ID534
Feature activation-8.50e-01

Pos logit contributions
 Mir+0.523
 Citation+0.518
hots+0.498
exec+0.468
camp+0.467
Neg logit contributions
Tam-0.669
paralleled-0.657
lear-0.640
-0.639
��-0.639
Token tickets
Token ID8587
Feature activation-2.50e-01

Pos logit contributions
 Mir+0.507
 Citation+0.498
hots+0.474
camp+0.446
exec+0.443
Neg logit contributions
Tam-0.728
paralleled-0.715
��-0.698
lear-0.697
Compare-0.695
Token!
Token ID0
Feature activation-3.10e-01

Pos logit contributions
 Mir+0.157
 Citation+0.156
hots+0.151
exec+0.141
camp+0.140
Neg logit contributions
Tam-0.204
paralleled-0.200
lear-0.195
-0.195
Compare-0.195
Token Che
Token ID2580
Feature activation-8.73e-03

Pos logit contributions
 Mir+0.195
 Citation+0.194
hots+0.186
exec+0.175
camp+0.174
Neg logit contributions
Tam-0.252
paralleled-0.248
lear-0.242
-0.241
��-0.241
Tokeners
Token ID364
Feature activation+3.12e+00

Pos logit contributions
 Mir+0.007
 Citation+0.007
hots+0.006
exec+0.006
camp+0.006
Neg logit contributions
Tam-0.006
paralleled-0.006
lear-0.006
-0.006
Compare-0.006
Token!
Token ID0
Feature activation-1.19e+00

Pos logit contributions
Tam+2.223
paralleled+2.142
+2.100
lear+2.086
Compare+2.082
Neg logit contributions
 Citation-2.253
 Mir-2.236
hots-2.181
exec-2.094
camp-2.053
Token<|endoftext|>
Token ID50256
Feature activation+4.18e-02

Pos logit contributions
 Mir+0.727
 Citation+0.722
hots+0.691
exec+0.650
camp+0.646
Neg logit contributions
Tam-0.985
paralleled-0.969
lear-0.947
-0.944
Compare-0.943
TokenAf
Token ID17584
Feature activation+6.53e-02

Pos logit contributions
Tam+0.032
paralleled+0.032
lear+0.031
+0.031
��+0.031
Neg logit contributions
 Mir-0.028
 Citation-0.028
hots-0.027
exec-0.026
camp-0.025
Tokenghan
Token ID6064
Feature activation-3.76e-01

Pos logit contributions
Tam+0.054
paralleled+0.053
+0.052
lear+0.052
Compare+0.052
Neg logit contributions
 Mir-0.040
 Citation-0.040
hots-0.038
exec-0.036
camp-0.036
Token artist
Token ID6802
Feature activation+3.39e-01

Pos logit contributions
 Mir+0.277
 Citation+0.274
hots+0.265
exec+0.254
camp+0.251
Neg logit contributions
Tam-0.266
paralleled-0.262
Compare-0.254
lear-0.253
-0.252
Token,
Token ID11
Feature activation+4.62e-01

Pos logit contributions
 Mir+0.178
 Citation+0.178
hots+0.170
camp+0.163
exec+0.163
Neg logit contributions
Tam-0.174
paralleled-0.170
lear-0.168
��-0.165
Compare-0.164
Token which
Token ID543
Feature activation+2.69e-01

Pos logit contributions
Tam+0.372
paralleled+0.365
lear+0.356
Compare+0.354
+0.354
Neg logit contributions
 Mir-0.295
 Citation-0.294
hots-0.283
exec-0.268
camp-0.265
Token for
Token ID329
Feature activation-1.41e-01

Pos logit contributions
Tam+0.214
paralleled+0.209
lear+0.205
Compare+0.203
+0.203
Neg logit contributions
 Mir-0.176
 Citation-0.174
hots-0.168
exec-0.159
camp-0.158
Token the
Token ID262
Feature activation-3.55e-01

Pos logit contributions
 Mir+0.086
 Citation+0.085
hots+0.082
exec+0.078
camp+0.076
Neg logit contributions
Tam-0.117
paralleled-0.115
lear-0.112
-0.112
Compare-0.112
Token most
Token ID749
Feature activation+1.11e+00

Pos logit contributions
 Mir+0.197
 Citation+0.196
hots+0.185
exec+0.176
camp+0.171
Neg logit contributions
Tam-0.317
paralleled-0.311
lear-0.304
-0.304
Compare-0.302
Token part
Token ID636
Feature activation+3.58e+00

Pos logit contributions
Tam+1.277
paralleled+1.259
lear+1.238
+1.236
Compare+1.234
Neg logit contributions
 Mir-0.329
 Citation-0.326
hots-0.302
exec-0.263
camp-0.257
Token is
Token ID318
Feature activation+5.18e-01

Pos logit contributions
Tam+2.685
paralleled+2.610
+2.564
Compare+2.552
lear+2.546
Neg logit contributions
 Citation-2.440
 Mir-2.400
hots-2.342
exec-2.219
ancial-2.216
Token uncontroll
Token ID39483
Feature activation-1.04e+00

Pos logit contributions
Tam+0.426
paralleled+0.418
lear+0.409
Compare+0.407
+0.406
Neg logit contributions
 Mir-0.321
 Citation-0.318
hots-0.308
exec-0.290
camp-0.286
Tokenable
Token ID540
Feature activation-5.10e-02

Pos logit contributions
 Mir+0.602
 Citation+0.593
hots+0.582
exec+0.545
camp+0.541
Neg logit contributions
Tam-0.895
paralleled-0.877
��-0.871
Compare-0.861
-0.852
Token.
Token ID13
Feature activation-8.45e-02

Pos logit contributions
 Mir+0.036
 Citation+0.036
hots+0.035
exec+0.033
camp+0.033
Neg logit contributions
Tam-0.038
paralleled-0.037
lear-0.036
Compare-0.036
��-0.036
Token
Token ID447
Feature activation+7.17e-01

Pos logit contributions
 Mir+0.052
 Citation+0.051
hots+0.049
exec+0.046
camp+0.046
Neg logit contributions
Tam-0.070
paralleled-0.070
lear-0.068
��-0.068
-0.067
Token:
Token ID25
Feature activation+3.12e+00

Pos logit contributions
Tam+1.729
paralleled+1.631
Compare+1.612
+1.578
lear+1.574
Neg logit contributions
 Citation-2.394
hots-2.373
 Mir-2.372
exec-2.258
ancial-2.237
Token report
Token ID989
Feature activation+1.89e+00

Pos logit contributions
Tam+2.346
paralleled+2.303
lear+2.254
Compare+2.233
+2.224
Neg logit contributions
 Mir-2.177
 Citation-2.166
hots-2.069
exec-1.982
camp-1.953
Token Vietnamese
Token ID23618
Feature activation+2.87e+00

Pos logit contributions
Tam+1.462
paralleled+1.442
lear+1.415
+1.408
Compare+1.392
Neg logit contributions
 Mir-1.272
 Citation-1.269
hots-1.207
exec-1.150
camp-1.138
Token airline
Token ID18091
Feature activation-9.82e-02

Pos logit contributions
Tam+2.246
paralleled+2.193
Compare+2.142
lear+2.138
+2.136
Neg logit contributions
 Citation-1.882
 Mir-1.867
hots-1.820
exec-1.714
camp-1.708
Token takes
Token ID2753
Feature activation-8.60e-02

Pos logit contributions
 Mir+0.064
 Citation+0.063
hots+0.061
exec+0.057
camp+0.057
Neg logit contributions
Tam-0.078
paralleled-0.077
lear-0.075
-0.075
Compare-0.075
Token steps
Token ID4831
Feature activation+1.04e+00

Pos logit contributions
 Mir+0.038
 Citation+0.037
hots+0.035
exec+0.032
camp+0.032
Neg logit contributions
Tam-0.086
paralleled-0.086
-0.084
lear-0.084
Compare-0.084
Token to
Token ID284
Feature activation+1.20e-01

Pos logit contributions
Tam+0.861
paralleled+0.847
lear+0.828
+0.826
Compare+0.822
Neg logit contributions
 Mir-0.633
 Citation-0.629
hots-0.607
exec-0.572
camp-0.566
Token open
Token ID1280
Feature activation+1.81e+00

Pos logit contributions
Tam+0.095
paralleled+0.095
lear+0.095
��+0.094
Compare+0.091
Neg logit contributions
 Mir-0.081
 Citation-0.076
 barric-0.071
exec-0.071
camp-0.070
Token flights
Token ID13956
Feature activation+1.52e+00

Pos logit contributions
Tam+1.146
paralleled+1.114
Compare+1.081
lear+1.081
+1.077
Neg logit contributions
 Citation-1.456
 Mir-1.453
hots-1.415
exec-1.355
camp-1.341
Token to
Token ID284
Feature activation-2.55e-01

Pos logit contributions
Tam+1.156
paralleled+1.121
Compare+1.089
lear+1.079
+1.078
Neg logit contributions
 Citation-1.033
hots-0.989
 Mir-0.985
ancial-0.948
exec-0.936
Token US
Token ID1294
Feature activation-7.93e-02

Pos logit contributions
 Citation+0.182
 Mir+0.181
hots+0.176
exec+0.167
camp+0.165
Neg logit contributions
Tam-0.186
paralleled-0.182
-0.177
lear-0.176
Compare-0.176
Token operating
Token ID5361
Feature activation-8.14e-01

Pos logit contributions
Tam+0.304
paralleled+0.299
lear+0.292
+0.289
��+0.288
Neg logit contributions
 Mir-0.259
 Citation-0.256
hots-0.248
exec-0.235
camp-0.232
Token system
Token ID1080
Feature activation+3.15e-01

Pos logit contributions
 Mir+0.667
 Citation+0.658
hots+0.641
exec+0.618
camp+0.611
Neg logit contributions
Tam-0.507
paralleled-0.498
lear-0.483
Compare-0.479
��-0.479
Token.
Token ID13
Feature activation-1.63e-01

Pos logit contributions
Tam+0.234
paralleled+0.232
lear+0.225
+0.225
��+0.224
Neg logit contributions
 Mir-0.222
 Citation-0.219
hots-0.212
exec-0.200
camp-0.198
Token In
Token ID554
Feature activation+1.07e-01

Pos logit contributions
 Mir+0.106
 Citation+0.105
hots+0.101
exec+0.095
camp+0.094
Neg logit contributions
Tam-0.130
paralleled-0.129
lear-0.125
��-0.125
-0.124
Token April
Token ID3035
Feature activation+6.37e-01

Pos logit contributions
Tam+0.087
paralleled+0.086
lear+0.084
+0.083
Compare+0.083
Neg logit contributions
 Mir-0.066
 Citation-0.066
hots-0.064
exec-0.060
camp-0.059
Token,
Token ID11
Feature activation+4.37e-01

Pos logit contributions
Tam+0.488
paralleled+0.477
lear+0.465
+0.464
Compare+0.464
Neg logit contributions
 Mir-0.432
 Citation-0.430
hots-0.416
exec-0.393
camp-0.390
Token two
Token ID734
Feature activation+7.15e-02

Pos logit contributions
Tam+0.337
paralleled+0.330
lear+0.322
Compare+0.321
+0.321
Neg logit contributions
 Mir-0.293
 Citation-0.291
hots-0.282
exec-0.267
camp-0.264
Token Korean
Token ID6983
Feature activation+6.80e-01

Pos logit contributions
Tam+0.051
paralleled+0.050
lear+0.049
+0.049
Compare+0.049
Neg logit contributions
 Mir-0.052
 Citation-0.052
hots-0.050
exec-0.048
camp-0.047
Token Internet
Token ID4455
Feature activation-5.70e-01

Pos logit contributions
Tam+0.512
paralleled+0.501
lear+0.488
+0.487
Compare+0.486
Neg logit contributions
 Mir-0.467
 Citation-0.466
hots-0.451
exec-0.428
camp-0.423
Token companies
Token ID2706
Feature activation+7.63e-01

Pos logit contributions
 Mir+0.413
 Citation+0.413
hots+0.397
exec+0.376
camp+0.373
Neg logit contributions
Tam-0.412
paralleled-0.405
lear-0.394
��-0.393
Compare-0.390
Token--
Token ID438
Feature activation-1.37e+00

Pos logit contributions
Tam+0.625
paralleled+0.613
lear+0.598
+0.597
Compare+0.597
Neg logit contributions
 Mir-0.476
 Citation-0.473
hots-0.457
exec-0.430
camp-0.425
Tokenot
Token ID313
Feature activation-2.61e-01

Pos logit contributions
 Mir+0.104
 Citation+0.103
hots+0.100
exec+0.096
camp+0.095
Neg logit contributions
Tam-0.103
paralleled-0.101
-0.098
��-0.098
lear-0.098
Tokenerie
Token ID18287
Feature activation-2.31e+00

Pos logit contributions
 Mir+0.156
 Citation+0.155
hots+0.149
exec+0.141
camp+0.139
Neg logit contributions
Tam-0.219
paralleled-0.215
-0.210
lear-0.210
Compare-0.210
Token of
Token ID286
Feature activation-1.19e-01

Pos logit contributions
 Mir+1.434
 Citation+1.414
hots+1.374
exec+1.296
camp+1.285
Neg logit contributions
Tam-1.887
paralleled-1.858
lear-1.819
-1.809
��-1.808
Token the
Token ID262
Feature activation+7.58e-01

Pos logit contributions
 Mir+0.075
 Citation+0.074
hots+0.072
exec+0.068
camp+0.067
Neg logit contributions
Tam-0.097
paralleled-0.095
lear-0.093
-0.093
Compare-0.092
Token super
Token ID2208
Feature activation-3.10e-01

Pos logit contributions
Tam+0.625
paralleled+0.612
lear+0.598
+0.596
Compare+0.596
Neg logit contributions
 Mir-0.466
 Citation-0.466
hots-0.450
exec-0.423
camp-0.418
Token-
Token ID12
Feature activation+6.43e-01

Pos logit contributions
 Mir+0.158
 Citation+0.156
hots+0.150
exec+0.140
camp+0.138
Neg logit contributions
Tam-0.289
paralleled-0.284
lear-0.278
-0.277
Compare-0.277
Tokenrich
Token ID7527
Feature activation-1.55e+00

Pos logit contributions
Tam+0.713
paralleled+0.702
lear+0.690
+0.689
Compare+0.688
Neg logit contributions
 Mir-0.213
 Citation-0.212
hots-0.198
exec-0.175
camp-0.171
Token.
Token ID13
Feature activation-2.31e-01

Pos logit contributions
 Mir+1.028
 Citation+1.015
hots+0.988
exec+0.933
camp+0.929
Neg logit contributions
Tam-1.200
paralleled-1.184
lear-1.154
-1.147
��-1.145
Token And
Token ID843
Feature activation-1.51e+00

Pos logit contributions
 Mir+0.152
 Citation+0.151
hots+0.143
exec+0.136
camp+0.135
Neg logit contributions
Tam-0.183
paralleled-0.181
lear-0.176
��-0.175
-0.175
Token then
Token ID788
Feature activation-2.30e-02

Pos logit contributions
 Mir+0.935
 Citation+0.918
hots+0.884
exec+0.832
camp+0.822
Neg logit contributions
Tam-1.253
paralleled-1.239
lear-1.204
-1.203
��-1.202
Token perhaps
Token ID3737
Feature activation+8.33e-01

Pos logit contributions
 Mir+0.014
 Citation+0.014
hots+0.014
exec+0.013
camp+0.013
Neg logit contributions
Tam-0.019
paralleled-0.019
lear-0.018
-0.018
Compare-0.018
Token Libya
Token ID13465
Feature activation-4.58e-01

Pos logit contributions
 Mir+0.133
 Citation+0.131
hots+0.127
exec+0.120
camp+0.119
Neg logit contributions
Tam-0.142
paralleled-0.139
lear-0.136
-0.136
Compare-0.135
Token might
Token ID1244
Feature activation+4.89e-01

Pos logit contributions
 Mir+0.284
 Citation+0.280
hots+0.273
exec+0.255
camp+0.254
Neg logit contributions
Tam-0.378
paralleled-0.368
lear-0.361
-0.360
Compare-0.359
Token occur
Token ID3051
Feature activation-6.03e-01

Pos logit contributions
Tam+0.395
paralleled+0.387
+0.378
lear+0.378
Compare+0.377
Neg logit contributions
 Mir-0.310
 Citation-0.307
hots-0.298
exec-0.281
camp-0.278
Token in
Token ID287
Feature activation-7.42e-01

Pos logit contributions
 Mir+0.390
 Citation+0.384
hots+0.374
exec+0.351
camp+0.348
Neg logit contributions
Tam-0.483
paralleled-0.472
lear-0.463
��-0.462
-0.462
Token the
Token ID262
Feature activation-2.79e-01

Pos logit contributions
 Mir+0.488
 Citation+0.475
hots+0.462
exec+0.432
camp+0.430
Neg logit contributions
Tam-0.589
paralleled-0.581
-0.571
lear-0.570
��-0.569
Token future
Token ID2003
Feature activation+1.50e+00

Pos logit contributions
 Mir+0.183
 Citation+0.177
hots+0.172
exec+0.162
camp+0.161
Neg logit contributions
Tam-0.223
paralleled-0.222
��-0.216
-0.216
lear-0.215
Token,
Token ID11
Feature activation-4.54e-01

Pos logit contributions
Tam+1.151
paralleled+1.128
lear+1.101
+1.096
Compare+1.095
Neg logit contributions
 Mir-1.009
 Citation-1.004
hots-0.971
exec-0.918
camp-0.911
Token Obama
Token ID2486
Feature activation+2.68e-01

Pos logit contributions
 Mir+0.315
 Citation+0.313
hots+0.302
exec+0.287
camp+0.284
Neg logit contributions
Tam-0.341
paralleled-0.333
-0.325
lear-0.325
Compare-0.323
Token launched
Token ID5611
Feature activation-2.85e-02

Pos logit contributions
Tam+0.220
paralleled+0.215
+0.210
lear+0.210
Compare+0.209
Neg logit contributions
 Mir-0.167
 Citation-0.166
hots-0.160
exec-0.151
camp-0.149
Token a
Token ID257
Feature activation+1.09e+00

Pos logit contributions
 Mir+0.018
 Citation+0.018
hots+0.017
exec+0.016
camp+0.016
Neg logit contributions
Tam-0.023
paralleled-0.023
-0.022
lear-0.022
Compare-0.022
Token war
Token ID1175
Feature activation+7.78e-01

Pos logit contributions
Tam+0.876
paralleled+0.858
lear+0.837
+0.835
Compare+0.835
Neg logit contributions
 Mir-0.696
 Citation-0.695
hots-0.672
exec-0.633
camp-0.626
Token and
Token ID290
Feature activation-1.28e-01

Pos logit contributions
Tam+1.798
paralleled+1.750
Compare+1.707
lear+1.707
+1.697
Neg logit contributions
 Citation-1.622
 Mir-1.613
hots-1.563
exec-1.483
camp-1.465
Token to
Token ID284
Feature activation+4.54e-01

Pos logit contributions
 Mir+0.078
 Citation+0.077
hots+0.074
exec+0.070
camp+0.069
Neg logit contributions
Tam-0.108
paralleled-0.106
-0.103
lear-0.103
��-0.103
Token make
Token ID787
Feature activation+2.24e-01

Pos logit contributions
Tam+0.381
paralleled+0.374
+0.365
lear+0.365
Compare+0.364
Neg logit contributions
 Mir-0.274
 Citation-0.273
hots-0.263
exec-0.247
camp-0.245
Token every
Token ID790
Feature activation+1.30e+00

Pos logit contributions
Tam+0.184
paralleled+0.181
lear+0.177
+0.176
Compare+0.176
Neg logit contributions
 Mir-0.139
 Citation-0.138
hots-0.134
exec-0.125
camp-0.124
Token play
Token ID711
Feature activation+6.78e-01

Pos logit contributions
Tam+1.205
paralleled+1.182
lear+1.158
Compare+1.155
+1.151
Neg logit contributions
 Citation-0.669
 Mir-0.667
hots-0.642
exec-0.594
camp-0.587
Token difficult
Token ID2408
Feature activation+4.90e-01

Pos logit contributions
Tam+0.532
paralleled+0.520
lear+0.508
+0.506
Compare+0.506
Neg logit contributions
 Mir-0.444
 Citation-0.443
hots-0.428
exec-0.405
camp-0.401
Token for
Token ID329
Feature activation+8.84e-01

Pos logit contributions
Tam+0.369
paralleled+0.361
lear+0.351
+0.350
Compare+0.350
Neg logit contributions
 Mir-0.337
 Citation-0.337
hots-0.326
exec-0.309
camp-0.306
Token his
Token ID465
Feature activation+1.13e+00

Pos logit contributions
Tam+0.673
paralleled+0.660
lear+0.644
Compare+0.641
+0.641
Neg logit contributions
 Mir-0.603
 Citation-0.597
hots-0.579
exec-0.549
camp-0.541
Token opponent
Token ID6125
Feature activation+1.03e+00

Pos logit contributions
Tam+0.848
paralleled+0.829
lear+0.809
+0.807
Compare+0.806
Neg logit contributions
 Mir-0.779
 Citation-0.774
hots-0.749
exec-0.712
camp-0.703
Token."
Token ID526
Feature activation+7.91e-01

Pos logit contributions
Tam+0.752
paralleled+0.734
lear+0.715
Compare+0.712
+0.712
Neg logit contributions
 Mir-0.725
 Citation-0.725
hots-0.702
exec-0.666
camp-0.660
Token
Token ID198
Feature activation+1.31e+00

Pos logit contributions
Tam+0.634
paralleled+0.621
Compare+0.605
lear+0.605
+0.602
Neg logit contributions
 Citation-0.501
 Mir-0.500
hots-0.488
exec-0.457
camp-0.452
Tokenaul
Token ID2518
Feature activation+1.13e+00

Pos logit contributions
Tam+0.731
paralleled+0.717
+0.695
Compare+0.695
lear+0.695
Neg logit contributions
 Mir-0.708
 Citation-0.704
hots-0.684
exec-0.648
camp-0.642
Tokeni
Token ID72
Feature activation-4.60e-01

Pos logit contributions
Tam+0.954
paralleled+0.935
lear+0.914
+0.912
Compare+0.911
Neg logit contributions
 Mir-0.674
 Citation-0.674
hots-0.649
exec-0.609
camp-0.602
Token.
Token ID13
Feature activation+1.16e+00

Pos logit contributions
 Mir+0.305
 Citation+0.300
hots+0.291
exec+0.277
camp+0.273
Neg logit contributions
Tam-0.361
paralleled-0.351
Compare-0.343
-0.342
lear-0.341
Tokenre
Token ID260
Feature activation-4.56e-02

Pos logit contributions
Tam+0.899
paralleled+0.879
Compare+0.855
+0.854
lear+0.854
Neg logit contributions
 Mir-0.778
 Citation-0.773
hots-0.752
exec-0.712
camp-0.703
Tokenin
Token ID259
Feature activation-7.70e-01

Pos logit contributions
 Mir+0.031
 Citation+0.030
hots+0.030
exec+0.028
camp+0.028
Neg logit contributions
Tam-0.035
paralleled-0.034
-0.033
Compare-0.033
��-0.033
Tokenik
Token ID1134
Feature activation-9.56e-01

Pos logit contributions
 Mir+0.507
 Citation+0.504
hots+0.492
exec+0.463
camp+0.459
Neg logit contributions
Tam-0.605
paralleled-0.585
Compare-0.578
-0.574
��-0.572
Tokenain
Token ID391
Feature activation-1.07e+00

Pos logit contributions
 Mir+0.783
 Citation+0.773
hots+0.763
exec+0.731
camp+0.724
Neg logit contributions
Tam-0.599
paralleled-0.565
Compare-0.563
��-0.558
-0.555
Tokenen
Token ID268
Feature activation+6.21e-01

Pos logit contributions
 Mir+0.617
 Citation+0.603
hots+0.586
exec+0.552
camp+0.544
Neg logit contributions
Tam-0.925
paralleled-0.900
Compare-0.888
��-0.884
-0.884
Token @
Token ID2488
Feature activation-1.19e+00

Pos logit contributions
Tam+0.463
paralleled+0.452
lear+0.441
+0.440
Compare+0.439
Neg logit contributions
 Mir-0.432
 Citation-0.431
hots-0.418
exec-0.396
camp-0.392
Token il
Token ID4229
Feature activation+1.08e+00

Pos logit contributions
 Mir+0.794
 Citation+0.787
hots+0.767
exec+0.727
camp+0.719
Neg logit contributions
Tam-0.914
paralleled-0.903
Compare-0.874
lear-0.874
-0.873
Tokenmedia
Token ID11431
Feature activation+2.46e-01

Pos logit contributions
Tam+0.814
paralleled+0.797
lear+0.776
+0.774
Compare+0.774
Neg logit contributions
 Mir-0.742
 Citation-0.740
hots-0.717
exec-0.680
camp-0.673
Token way
Token ID835
Feature activation-2.34e-01

Pos logit contributions
 Mir+0.387
 Citation+0.381
hots+0.369
exec+0.345
camp+0.342
Neg logit contributions
Tam-0.519
paralleled-0.507
��-0.500
-0.499
lear-0.499
Token over
Token ID625
Feature activation-3.92e-01

Pos logit contributions
 Mir+0.142
 Citation+0.140
hots+0.137
exec+0.127
camp+0.126
Neg logit contributions
Tam-0.196
paralleled-0.192
-0.189
lear-0.189
��-0.187
Token to
Token ID284
Feature activation+1.07e-01

Pos logit contributions
 Mir+0.242
 Citation+0.239
hots+0.231
exec+0.218
camp+0.217
Neg logit contributions
Tam-0.324
paralleled-0.317
lear-0.312
-0.310
��-0.310
Token Liverpool
Token ID11761
Feature activation-1.06e+00

Pos logit contributions
Tam+0.085
paralleled+0.083
lear+0.081
+0.081
Compare+0.081
Neg logit contributions
 Mir-0.070
 Citation-0.069
hots-0.067
exec-0.063
camp-0.063
Token,
Token ID11
Feature activation+3.47e-01

Pos logit contributions
 Mir+0.682
 Citation+0.669
hots+0.648
exec+0.607
camp+0.605
Neg logit contributions
Tam-0.863
paralleled-0.841
lear-0.826
-0.824
��-0.822
Token England
Token ID4492
Feature activation-1.38e+00

Pos logit contributions
Tam+0.275
paralleled+0.270
lear+0.264
+0.263
Compare+0.262
Neg logit contributions
 Mir-0.227
 Citation-0.224
hots-0.217
exec-0.205
camp-0.202
Token actually
Token ID1682
Feature activation+4.58e-01

Pos logit contributions
 Mir+0.890
 Citation+0.875
hots+0.848
exec+0.796
camp+0.790
Neg logit contributions
Tam-1.106
paralleled-1.079
-1.063
��-1.062
lear-1.059
Token worked
Token ID3111
Feature activation+1.13e-01

Pos logit contributions
Tam+0.369
paralleled+0.359
+0.352
lear+0.351
Compare+0.351
Neg logit contributions
 Mir-0.294
 Citation-0.291
hots-0.283
exec-0.266
camp-0.263
Token!
Token ID0
Feature activation-2.28e-01

Pos logit contributions
Tam+0.090
paralleled+0.088
+0.086
lear+0.086
��+0.086
Neg logit contributions
 Mir-0.073
 Citation-0.072
hots-0.070
exec-0.066
camp-0.066
Token HW
Token ID44884
Feature activation+2.94e-01

Pos logit contributions
 Mir+0.150
 Citation+0.149
hots+0.143
exec+0.135
camp+0.133
Neg logit contributions
Tam-0.179
paralleled-0.177
lear-0.173
-0.172
��-0.172
TokenND
Token ID8575
Feature activation-7.58e-01

Pos logit contributions
Tam+0.225
paralleled+0.221
+0.216
lear+0.216
Compare+0.215
Neg logit contributions
 Mir-0.198
 Citation-0.197
hots-0.191
exec-0.181
camp-0.179
Token and
Token ID290
Feature activation-4.83e-01

Pos logit contributions
Tam+0.858
paralleled+0.839
lear+0.818
+0.817
Compare+0.815
Neg logit contributions
 Mir-0.713
 Citation-0.706
hots-0.683
exec-0.646
camp-0.640
Token culture
Token ID3968
Feature activation+8.17e-01

Pos logit contributions
 Mir+0.306
 Citation+0.300
hots+0.289
exec+0.275
camp+0.272
Neg logit contributions
Tam-0.394
paralleled-0.388
-0.378
��-0.376
lear-0.376
Token that
Token ID326
Feature activation-3.07e-01

Pos logit contributions
Tam+0.633
paralleled+0.618
lear+0.603
+0.602
Compare+0.600
Neg logit contributions
 Mir-0.548
 Citation-0.543
hots-0.526
exec-0.498
camp-0.494
Token dominates
Token ID38777
Feature activation-1.53e+00

Pos logit contributions
 Mir+0.200
 Citation+0.196
hots+0.190
exec+0.180
camp+0.178
Neg logit contributions
Tam-0.246
paralleled-0.239
lear-0.234
-0.233
��-0.233
Token the
Token ID262
Feature activation+4.18e-01

Pos logit contributions
 Mir+0.996
 Citation+0.970
hots+0.933
exec+0.882
 Lar+0.875
Neg logit contributions
Tam-1.222
paralleled-1.210
lear-1.181
-1.176
��-1.169
Token state
Token ID1181
Feature activation-1.11e+00

Pos logit contributions
Tam+0.298
paralleled+0.292
lear+0.285
+0.284
Compare+0.281
Neg logit contributions
 Mir-0.308
 Citation-0.304
hots-0.293
exec-0.281
camp-0.278
Token.
Token ID13
Feature activation-1.93e-01

Pos logit contributions
 Mir+0.754
 Citation+0.743
hots+0.713
exec+0.686
camp+0.681
Neg logit contributions
Tam-0.855
paralleled-0.841
��-0.823
lear-0.818
-0.814
Token Like
Token ID4525
Feature activation+5.63e-01

Pos logit contributions
 Mir+0.134
 Citation+0.132
hots+0.125
camp+0.119
exec+0.118
Neg logit contributions
Tam-0.147
paralleled-0.147
��-0.143
lear-0.142
-0.141
Token Islam
Token ID3449
Feature activation+1.18e+00

Pos logit contributions
Tam+0.476
paralleled+0.470
lear+0.459
+0.457
Compare+0.455
Neg logit contributions
 Mir-0.338
 Citation-0.333
hots-0.320
exec-0.300
camp-0.297
Token and
Token ID290
Feature activation-2.76e-01

Pos logit contributions
Tam+0.920
paralleled+0.899
lear+0.877
+0.874
Compare+0.873
Neg logit contributions
 Mir-0.781
 Citation-0.776
hots-0.751
exec-0.710
camp-0.702
Token most
Token ID749
Feature activation+9.52e-01

Pos logit contributions
 Mir+0.240
 Citation+0.234
hots+0.226
exec+0.218
camp+0.216
Neg logit contributions
paralleled-0.159
Tam-0.159
lear-0.154
-0.154
��-0.153
Token government
Token ID1230
Feature activation-1.50e+00

Pos logit contributions
 Mir+0.441
 Citation+0.436
hots+0.419
exec+0.403
camp+0.397
Neg logit contributions
Tam-0.464
paralleled-0.455
lear-0.442
Compare-0.440
-0.438
Token,
Token ID11
Feature activation+9.19e-01

Pos logit contributions
 Mir+0.979
 Citation+0.962
hots+0.923
camp+0.879
exec+0.879
Neg logit contributions
Tam-1.184
paralleled-1.167
��-1.134
lear-1.133
-1.128
Token
Token ID447
Feature activation+1.36e+00

Pos logit contributions
Tam+0.642
paralleled+0.626
lear+0.609
+0.607
Compare+0.607
Neg logit contributions
 Mir-0.682
 Citation-0.681
hots-0.661
exec-0.629
camp-0.623
Token
Token ID251
Feature activation+5.90e-01

Pos logit contributions
Tam+0.756
paralleled+0.734
lear+0.708
Compare+0.700
+0.696
Neg logit contributions
 Citation-1.219
 Mir-1.212
hots-1.175
exec-1.138
camp-1.127
Token he
Token ID339
Feature activation-6.25e-01

Pos logit contributions
Tam+0.455
paralleled+0.447
lear+0.434
Compare+0.433
��+0.431
Neg logit contributions
 Mir-0.397
 Citation-0.394
hots-0.380
exec-0.361
camp-0.358
Token says
Token ID1139
Feature activation-1.88e+00

Pos logit contributions
 Mir+0.417
 Citation+0.413
hots+0.401
exec+0.380
camp+0.375
Neg logit contributions
Tam-0.486
paralleled-0.476
��-0.462
Compare-0.462
-0.461
Token cheer
Token ID14042
Feature activation+1.96e+00

Pos logit contributions
 Mir+1.348
 Citation+1.338
hots+1.294
exec+1.232
camp+1.226
Neg logit contributions
Tam-1.354
paralleled-1.328
-1.290
lear-1.285
Compare-1.283
Tokenfully
Token ID2759
Feature activation-4.74e-01

Pos logit contributions
Tam+1.716
paralleled+1.682
lear+1.651
+1.644
Compare+1.644
Neg logit contributions
 Citation-1.097
 Mir-1.096
hots-1.053
exec-0.988
camp-0.968
Token.
Token ID13
Feature activation-3.06e-01

Pos logit contributions
 Mir+0.346
 Citation+0.344
hots+0.334
exec+0.317
camp+0.315
Neg logit contributions
Tam-0.338
paralleled-0.330
lear-0.321
-0.321
Compare-0.320
Token
Token ID564
Feature activation+4.66e-02

Pos logit contributions
 Mir+0.214
 Citation+0.213
hots+0.202
camp+0.193
exec+0.192
Neg logit contributions
Tam-0.229
paralleled-0.227
lear-0.220
��-0.219
-0.217
Token
Token ID250
Feature activation-7.66e-01

Pos logit contributions
Tam+0.018
paralleled+0.017
lear+0.016
Compare+0.016
+0.016
Neg logit contributions
 Citation-0.049
 Mir-0.049
hots-0.048
exec-0.047
camp-0.046
Tokenen
Token ID268
Feature activation-7.29e-03

Pos logit contributions
Tam+0.954
paralleled+0.935
lear+0.915
+0.910
Compare+0.910
Neg logit contributions
 Citation-0.695
 Mir-0.694
hots-0.669
exec-0.628
camp-0.621
Tokenbr
Token ID1671
Feature activation+7.60e-01

Pos logit contributions
 Mir+0.005
 Citation+0.005
hots+0.005
exec+0.005
camp+0.005
Neg logit contributions
Tam-0.005
paralleled-0.005
��-0.005
-0.005
Compare-0.005
Tokenun
Token ID403
Feature activation+3.47e-01

Pos logit contributions
Tam+0.448
��+0.433
paralleled+0.433
+0.421
Compare+0.417
Neg logit contributions
 Mir-0.647
 Citation-0.642
hots-0.633
exec-0.612
camp-0.603
Tokenner
Token ID1008
Feature activation+1.06e-01

Pos logit contributions
Tam+0.210
��+0.202
paralleled+0.202
+0.199
Compare+0.196
Neg logit contributions
 Mir-0.292
 Citation-0.291
hots-0.283
exec-0.274
camp-0.273
Token appeared
Token ID4120
Feature activation+1.68e+00

Pos logit contributions
Tam+0.087
paralleled+0.085
lear+0.083
+0.083
Compare+0.083
Neg logit contributions
 Mir-0.066
 Citation-0.066
hots-0.064
exec-0.060
camp-0.059
Token in
Token ID287
Feature activation-4.65e-01

Pos logit contributions
Tam+1.356
paralleled+1.327
lear+1.295
Compare+1.292
+1.290
Neg logit contributions
 Citation-1.055
 Mir-1.054
hots-1.017
exec-0.959
camp-0.947
Token 5
Token ID642
Feature activation+1.42e+00

Pos logit contributions
 Mir+0.291
 Citation+0.287
hots+0.276
exec+0.261
camp+0.257
Neg logit contributions
Tam-0.382
paralleled-0.374
lear-0.368
-0.367
��-0.365
Token64
Token ID2414
Feature activation+3.41e-01

Pos logit contributions
Tam+1.005
paralleled+0.983
lear+0.955
+0.952
Compare+0.950
Neg logit contributions
 Mir-1.042
 Citation-1.038
hots-1.009
exec-0.959
camp-0.950
Token games
Token ID1830
Feature activation+1.54e+00

Pos logit contributions
Tam+0.252
paralleled+0.247
lear+0.242
+0.239
Compare+0.237
Neg logit contributions
 Mir-0.242
 Citation-0.240
hots-0.233
exec-0.220
camp-0.216
Token as
Token ID355
Feature activation-4.46e-01

Pos logit contributions
Tam+1.203
paralleled+1.178
lear+1.147
Compare+1.144
+1.143
Neg logit contributions
 Citation-1.005
 Mir-1.003
hots-0.968
exec-0.918
camp-0.908
Token a
Token ID257
Feature activation-8.95e-01

Pos logit contributions
 Mir+0.276
 Citation+0.273
hots+0.264
exec+0.249
camp+0.246
Neg logit contributions
Tam-0.369
paralleled-0.361
lear-0.353
-0.353
Compare-0.352
Token signal
Token ID6737
Feature activation-6.49e-03

Pos logit contributions
Tam+0.515
paralleled+0.505
lear+0.492
+0.491
Compare+0.490
Neg logit contributions
 Mir-0.420
 Citation-0.419
hots-0.405
exec-0.383
camp-0.378
Token 11
Token ID1367
Feature activation-2.42e+00

Pos logit contributions
 Mir+0.004
 Citation+0.004
hots+0.004
exec+0.004
camp+0.004
Neg logit contributions
Tam-0.005
paralleled-0.005
-0.005
lear-0.005
Compare-0.005
Token,
Token ID837
Feature activation+1.33e+00

Pos logit contributions
 Mir+1.666
 Citation+1.655
hots+1.600
exec+1.546
camp+1.501
Neg logit contributions
Tam-1.826
paralleled-1.792
-1.751
Compare-1.730
lear-1.729
Token Se
Token ID1001
Feature activation+5.61e-02

Pos logit contributions
Tam+1.100
paralleled+1.078
lear+1.054
Compare+1.051
+1.051
Neg logit contributions
 Mir-0.808
 Citation-0.808
hots-0.780
exec-0.729
camp-0.725
Tokengment
Token ID5154
Feature activation-2.98e+00

Pos logit contributions
Tam+0.048
paralleled+0.047
lear+0.046
+0.046
Compare+0.046
Neg logit contributions
 Mir-0.033
 Citation-0.033
hots-0.031
exec-0.030
camp-0.029
Tokenation
Token ID341
Feature activation-3.40e+00

Pos logit contributions
 Mir+1.788
 Citation+1.780
hots+1.677
exec+1.605
camp+1.573
Neg logit contributions
Tam-2.506
paralleled-2.471
lear-2.412
-2.384
Compare-2.373
Token fault
Token ID8046
Feature activation-4.20e-01

Pos logit contributions
 Mir+1.858
 Citation+1.798
hots+1.686
exec+1.610
camp+1.589
Neg logit contributions
Tam-3.067
paralleled-3.042
lear-2.980
��-2.940
-2.938
Token.
Token ID13
Feature activation-1.40e-01

Pos logit contributions
 Mir+0.283
 Citation+0.282
hots+0.271
exec+0.258
camp+0.255
Neg logit contributions
Tam-0.324
paralleled-0.317
-0.311
lear-0.308
Compare-0.307
Token 4
Token ID604
Feature activation-2.32e+00

Pos logit contributions
 Mir+0.094
 Citation+0.094
hots+0.090
exec+0.086
camp+0.085
Neg logit contributions
Tam-0.108
paralleled-0.106
-0.103
lear-0.103
Compare-0.102
Token 5
Token ID642
Feature activation-2.08e+00

Pos logit contributions
 Mir+1.597
 Citation+1.587
hots+1.523
exec+1.462
camp+1.423
Neg logit contributions
Tam-1.757
paralleled-1.729
-1.684
lear-1.673
��-1.658
Token (
Token ID357
Feature activation+4.33e-01

Pos logit contributions
 Mir+1.368
 Citation+1.362
hots+1.309
exec+1.257
camp+1.218
Neg logit contributions
Tam-1.643
paralleled-1.605
-1.570
lear-1.561
��-1.557
Token the
Token ID262
Feature activation-1.68e-01

Pos logit contributions
Tam+0.488
paralleled+0.475
lear+0.467
Compare+0.464
+0.464
Neg logit contributions
 Mir-0.382
 Citation-0.378
hots-0.365
exec-0.343
camp-0.340
Token potential
Token ID2785
Feature activation+4.23e-01

Pos logit contributions
 Mir+0.097
 Citation+0.095
hots+0.091
exec+0.085
camp+0.084
Neg logit contributions
Tam-0.146
paralleled-0.143
-0.141
lear-0.141
��-0.140
Token or
Token ID393
Feature activation-1.79e+00

Pos logit contributions
Tam+0.343
paralleled+0.337
lear+0.329
+0.327
Compare+0.327
Neg logit contributions
 Mir-0.266
 Citation-0.266
hots-0.256
exec-0.242
camp-0.239
Token 3
Token ID513
Feature activation-7.44e-01

Pos logit contributions
 Mir+1.112
 Citation+1.091
hots+1.068
exec+0.991
camp+0.988
Neg logit contributions
Tam-1.480
paralleled-1.449
-1.412
Compare-1.410
��-1.410
Token har
Token ID3971
Feature activation-2.13e-01

Pos logit contributions
 Mir+0.496
 Citation+0.491
hots+0.478
camp+0.448
exec+0.446
Neg logit contributions
Tam-0.577
paralleled-0.565
lear-0.556
-0.554
Compare-0.551
Tokenv
Token ID85
Feature activation-3.05e+00

Pos logit contributions
 Mir+0.156
 Citation+0.155
hots+0.151
exec+0.144
camp+0.143
Neg logit contributions
Tam-0.150
paralleled-0.147
-0.144
��-0.143
Compare-0.142
Tokenests
Token ID3558
Feature activation-4.71e-01

Pos logit contributions
 Mir+2.386
 Citation+2.340
hots+2.294
exec+2.201
camp+2.183
Neg logit contributions
Tam-1.979
paralleled-1.940
��-1.938
-1.887
Compare-1.872
Token a
Token ID257
Feature activation+7.83e-02

Pos logit contributions
 Mir+0.294
 Citation+0.291
hots+0.284
exec+0.264
camp+0.262
Neg logit contributions
Tam-0.386
paralleled-0.378
lear-0.371
-0.370
��-0.368
Token year
Token ID614
Feature activation+1.95e+00

Pos logit contributions
Tam+0.053
paralleled+0.052
lear+0.051
+0.051
Compare+0.051
Neg logit contributions
 Mir-0.059
 Citation-0.059
hots-0.058
exec-0.055
camp-0.054
Token,
Token ID11
Feature activation-3.00e-01

Pos logit contributions
Tam+1.532
paralleled+1.501
Compare+1.460
lear+1.460
+1.457
Neg logit contributions
 Citation-1.276
 Mir-1.275
hots-1.229
exec-1.167
camp-1.152
Token unlike
Token ID5023
Feature activation-8.05e-01

Pos logit contributions
 Mir+0.193
 Citation+0.192
hots+0.185
exec+0.175
camp+0.173
Neg logit contributions
Tam-0.239
paralleled-0.235
lear-0.229
-0.228
Compare-0.228
Token Share
Token ID8734
Feature activation-1.83e+00

Pos logit contributions
Tam+1.047
paralleled+1.014
Compare+1.002
lear+0.985
+0.979
Neg logit contributions
 Mir-0.990
 Citation-0.979
hots-0.968
exec-0.918
ancial-0.916
Token this
Token ID428
Feature activation-5.73e-01

Pos logit contributions
 Mir+1.074
 Citation+1.063
hots+1.011
exec+0.961
camp+0.949
Neg logit contributions
Tam-1.572
paralleled-1.548
��-1.526
-1.519
lear-1.510
Token comment
Token ID2912
Feature activation-6.68e-01

Pos logit contributions
 Mir+0.225
 Citation+0.221
hots+0.204
exec+0.190
camp+0.185
Neg logit contributions
Tam-0.605
paralleled-0.598
��-0.588
-0.586
lear-0.585
Token on
Token ID319
Feature activation-7.24e-01

Pos logit contributions
 Mir+0.401
 Citation+0.399
hots+0.384
exec+0.361
camp+0.358
Neg logit contributions
Tam-0.563
paralleled-0.553
lear-0.539
-0.538
Compare-0.537
Token Facebook
Token ID3203
Feature activation-8.36e-01

Pos logit contributions
 Mir+0.528
 Citation+0.523
hots+0.494
exec+0.477
camp+0.475
Neg logit contributions
Tam-0.523
paralleled-0.519
��-0.511
lear-0.500
-0.497
Token Share
Token ID8734
Feature activation-3.74e+00

Pos logit contributions
 Mir+0.531
 Citation+0.529
hots+0.495
exec+0.469
camp+0.468
Neg logit contributions
Tam-0.682
paralleled-0.679
��-0.662
lear-0.661
-0.656
Token this
Token ID428
Feature activation-3.75e-01

Pos logit contributions
 Mir+2.030
 Citation+1.999
hots+1.859
exec+1.802
camp+1.775
Neg logit contributions
��-3.402
Tam-3.374
paralleled-3.340
 millenn-3.309
-3.278
Token comment
Token ID2912
Feature activation-5.97e-01

Pos logit contributions
 Mir+0.149
 Citation+0.147
hots+0.136
exec+0.126
camp+0.124
Neg logit contributions
Tam-0.394
paralleled-0.389
��-0.383
-0.382
lear-0.381
Token on
Token ID319
Feature activation-5.92e-02

Pos logit contributions
 Mir+0.357
 Citation+0.355
hots+0.342
exec+0.322
camp+0.319
Neg logit contributions
Tam-0.504
paralleled-0.495
lear-0.483
-0.483
��-0.482
Token Twitter
Token ID3009
Feature activation+3.68e-01

Pos logit contributions
 Mir+0.036
 Citation+0.035
hots+0.034
exec+0.032
camp+0.032
Neg logit contributions
Tam-0.050
paralleled-0.049
lear-0.048
-0.048
Compare-0.048
Token
Token ID198
Feature activation-1.48e+00

Pos logit contributions
Tam+0.301
paralleled+0.294
lear+0.288
Compare+0.287
+0.287
Neg logit contributions
 Mir-0.227
 Citation-0.227
hots-0.220
exec-0.207
camp-0.205
Token say
Token ID910
Feature activation-3.78e-01

Pos logit contributions
 Mir+0.301
 Citation+0.298
hots+0.289
exec+0.270
camp+0.268
Neg logit contributions
Tam-0.435
paralleled-0.426
-0.416
Compare-0.416
��-0.415
Token they
Token ID484
Feature activation+1.41e+00

Pos logit contributions
 Mir+0.203
 Citation+0.202
hots+0.194
exec+0.181
camp+0.179
Neg logit contributions
Tam-0.341
paralleled-0.335
lear-0.328
-0.327
Compare-0.327
Token received
Token ID2722
Feature activation-3.98e-01

Pos logit contributions
Tam+1.213
paralleled+1.189
lear+1.163
+1.160
Compare+1.160
Neg logit contributions
 Mir-0.813
 Citation-0.811
hots-0.780
exec-0.731
camp-0.723
Token notices
Token ID19748
Feature activation+3.28e-01

Pos logit contributions
 Mir+0.254
 Citation+0.252
hots+0.243
exec+0.228
camp+0.226
Neg logit contributions
Tam-0.320
paralleled-0.315
lear-0.307
-0.306
Compare-0.306
Token from
Token ID422
Feature activation+3.15e-01

Pos logit contributions
Tam+0.269
paralleled+0.264
lear+0.258
+0.257
Compare+0.257
Neg logit contributions
 Citation-0.202
 Mir-0.202
hots-0.195
exec-0.183
camp-0.181
Token Tinder
Token ID44043
Feature activation-3.07e+00

Pos logit contributions
Tam+0.235
paralleled+0.231
lear+0.225
+0.224
Compare+0.224
Neg logit contributions
 Mir-0.220
 Citation-0.219
hots-0.212
exec-0.201
camp-0.199
Token saying
Token ID2282
Feature activation-1.05e+00

Pos logit contributions
 Mir+1.857
 Citation+1.820
hots+1.764
exec+1.656
camp+1.644
Neg logit contributions
Tam-2.566
paralleled-2.546
lear-2.480
��-2.480
Compare-2.456
Token the
Token ID262
Feature activation-5.38e-01

Pos logit contributions
 Mir+0.575
 Citation+0.567
hots+0.544
exec+0.505
camp+0.503
Neg logit contributions
Tam-0.935
paralleled-0.925
lear-0.902
Compare-0.900
-0.899
Token were
Token ID547
Feature activation+6.87e-01

Pos logit contributions
 Mir+0.356
 Citation+0.352
hots+0.340
exec+0.321
camp+0.319
Neg logit contributions
Tam-0.422
paralleled-0.415
lear-0.405
-0.402
��-0.402
Token unable
Token ID5906
Feature activation-3.37e-01

Pos logit contributions
Tam+0.576
paralleled+0.566
lear+0.553
+0.551
Compare+0.551
Neg logit contributions
 Mir-0.413
 Citation-0.411
hots-0.397
exec-0.373
camp-0.369
Token to
Token ID284
Feature activation+9.24e-01

Pos logit contributions
 Mir+0.204
 Citation+0.203
hots+0.196
exec+0.185
camp+0.183
Neg logit contributions
Tam-0.281
paralleled-0.275
lear-0.269
-0.268
Compare-0.268
Token Jen
Token ID13364
Feature activation+2.54e-01

Pos logit contributions
 Mir+0.102
 Citation+0.101
hots+0.097
exec+0.093
camp+0.091
Neg logit contributions
Tam-0.120
paralleled-0.118
Compare-0.115
lear-0.115
-0.115
Token was
Token ID373
Feature activation+8.65e-01

Pos logit contributions
Tam+0.203
paralleled+0.199
lear+0.194
Compare+0.194
+0.194
Neg logit contributions
 Mir-0.164
 Citation-0.162
hots-0.156
exec-0.148
camp-0.146
Token pregnant
Token ID10423
Feature activation-1.95e+00

Pos logit contributions
Tam+0.830
paralleled+0.816
lear+0.799
+0.798
Compare+0.798
Neg logit contributions
 Mir-0.418
 Citation-0.415
hots-0.396
exec-0.367
camp-0.361
Token.
Token ID13
Feature activation-6.63e-01

Pos logit contributions
 Mir+1.285
 Citation+1.260
hots+1.213
camp+1.144
exec+1.143
Neg logit contributions
Tam-1.525
paralleled-1.505
lear-1.470
-1.465
Compare-1.462
Token Flynn
Token ID14654
Feature activation-2.17e+00

Pos logit contributions
 Mir+0.432
 Citation+0.424
hots+0.403
exec+0.382
camp+0.381
Neg logit contributions
Tam-0.529
paralleled-0.526
��-0.515
lear-0.514
-0.507
Token and
Token ID290
Feature activation-4.35e+00

Pos logit contributions
 Mir+1.335
 Citation+1.311
hots+1.257
exec+1.191
camp+1.182
Neg logit contributions
Tam-1.814
paralleled-1.774
��-1.740
-1.726
lear-1.725
Token his
Token ID465
Feature activation-1.46e+00

Pos logit contributions
 Mir+2.898
 Citation+2.733
hots+2.592
exec+2.542
 Lar+2.486
Neg logit contributions
Tam-3.432
paralleled-3.370
��-3.315
Compare-3.303
lear-3.296
Token wife
Token ID3656
Feature activation-3.07e+00

Pos logit contributions
 Mir+0.915
 Citation+0.901
hots+0.859
exec+0.821
camp+0.808
Neg logit contributions
Tam-1.207
paralleled-1.184
lear-1.159
Compare-1.155
-1.151
Token specifically
Token ID5734
Feature activation-1.71e+00

Pos logit contributions
 Mir+1.886
 Citation+1.818
hots+1.733
exec+1.656
camp+1.644
Neg logit contributions
Tam-2.562
paralleled-2.537
��-2.480
Compare-2.474
-2.470
Token decided
Token ID3066
Feature activation-4.95e-01

Pos logit contributions
 Mir+1.024
 Citation+1.003
hots+0.975
camp+0.917
exec+0.913
Neg logit contributions
Tam-1.470
paralleled-1.428
��-1.405
-1.398
Compare-1.394
Token to
Token ID284
Feature activation-3.18e-01

Pos logit contributions
 Mir+0.303
 Citation+0.299
hots+0.288
exec+0.271
camp+0.270
Neg logit contributions
Tam-0.412
paralleled-0.405
lear-0.395
-0.394
Compare-0.394