TOP ACTIVATIONS
MAX = 17.394

The Giant Squid <|endoftext|> Brazil s largest party abandoned
! <|endoftext|> President - elect Barack Obama is currently mull ing
was not having Jeffrey Lord s comparison .
amounts of snow via powerful Nor e asters that
1 , it seems the United States Conference of Catholic B
The H unch back of Notre Dame , Nightmare on Elm
almost here and we ve got an exciting
the video clip above , Major Nelson recently sat down with
of snow via powerful Nor e asters that make
about being a lesbian <|endoftext|> Well , the scores still aren
<|endoftext|> What Wall Street doesn t want us to
The sushi you eat isn t fresh
will step down , President Barack Obama announced Monday
More and more observers , however , believe that excessive speculation
Res isting violations of the Supreme Court Orders on Aadhaar
senior fellow at the leftist Southern Poverty Law Center , ass
stood for Parliament . <|endoftext|> Earlier today , the first Ebola
So , after almost a week of researching ,
<|endoftext|> Charlie Lee , the much celebrated creator of the silver
T ether s Unlimited Inc s ( T UI

MIN ACTIVATIONS
MIN = -16.302

bar bar ians . It is
Commerce Control List . Yes , we
Taliban . On Sunday , I think
Action Now and we ll send you three meaningful
never captured . It s not real
bill last month . We certainly need to work
for Medical Progress . Pl anned Parenthood is answering
, I m a product of yellow
often overlooked . She s has her own main
each year ( author s analysis of Current Population
ist 's disagreement with During these two ch or
. They re occupying a similar space
. We ve got this investment [
make projects . I ve contributed regularly to Arduino
. 4 billion . M . K . and
. I ve h iked in rain
352 in 1996 . What B . C .
before time zero . I cannot do it ,
ens y . I m happy to say my
that category ( author s analysis of Current Population

INTERVAL 8.970 - 17.394
CONTAINS 1.458%

s European edition . The Norwegian Police Security Service informed that
for Japan as he and Prime Minister Shinzo Abe issued a
UN ITED N ATIONS ( Reuters ) - An army battalion
in babys itting Nearly everyone will have to mend
. <|endoftext|> New Delhi : India ` s two

INTERVAL 0.546 - 8.970
CONTAINS 42.207%

, but also as a military av iator .
I was doing 180 knots , and I eased off on
but is sometimes also used as an umbrella term that encompasses
Read more by Justin R aim ondo <|endoftext|> lic hess .
great hike , though , and the weather was perfect but

INTERVAL -7.878 - 0.546
CONTAINS 53.901%

in such a state of dis repair , its value is
points . Co os County is not an isolated
is unstoppable , the Qu ist campaign said in
World War II veteran to death . Del bert
In the second case , the court found consultation with the

INTERVAL -16.302 - -7.878
CONTAINS 2.434%

tailored to an individual s needs . One only
especially , because they re black and female .
that the store properties primary use is as retail
also have allowed multiple touchdown passes in four of their past
wood ed path that s home to an impressive
Token The
Token ID383
Feature activation+4.01e+00

Pos logit contributions
 we+5.734
 accuses+5.234
 they+5.198
 he+5.182
 albeit+5.160
Neg logit contributions
idon-8.056
igate-8.035
agate-7.741
VERTISEMENT-7.696
OUT-7.668
Token Giant
Token ID17384
Feature activation+9.31e+00

Pos logit contributions
 Temper+1.895
 citiz+1.869
 we+1.864
 albeit+1.843
 accuses+1.820
Neg logit contributions
igate-4.020
VERTISEMENT-3.911
agate-3.853
idon-3.744
OUT-3.742
Token Squid
Token ID48799
Feature activation+7.11e+00

Pos logit contributions
 Temper+4.172
 Tw+4.156
 we+4.092
 albeit+4.012
 accuses+3.959
Neg logit contributions
igate-8.939
VERTISEMENT-8.858
agate-8.779
idon-8.647
OUT-8.597
Token<|endoftext|>
Token ID50256
Feature activation+3.74e+00

Pos logit contributions
 Temper+3.846
 accuses+3.840
 we+3.798
 rises+3.683
 albeit+3.631
Neg logit contributions
igate-6.325
VERTISEMENT-6.256
agate-6.128
OUT-5.939
TABLE-5.914
TokenBrazil
Token ID39190
Feature activation+1.35e+01

Pos logit contributions
 we+2.760
 accuses+2.743
 albeit+2.737
 Temper+2.647
 he+2.610
Neg logit contributions
igate-2.619
idon-2.545
VERTISEMENT-2.542
agate-2.423
-2.244
Token
Token ID447
Feature activation+1.63e+01

Pos logit contributions
 accuses+6.765
 we+6.056
 joins+5.990
 citiz+5.874
 Temper+5.789
Neg logit contributions
VERTISEMENT-12.088
igate-11.924
tnc-11.622
idon-11.375
OUT-11.263
Token
Token ID247
Feature activation-4.12e+00

Pos logit contributions
 citiz+9.326
 we+9.135
 accuses+9.043
 he+8.726
 ›+8.692
Neg logit contributions
idon-11.664
igate-11.190
VERTISEMENT-10.775
OUT-10.277
tnc-10.025
Tokens
Token ID82
Feature activation+2.75e+00

Pos logit contributions
igate+4.099
VERTISEMENT+3.986
idon+3.970
agate+3.909
OUT+3.851
Neg logit contributions
 accuses-2.240
 citiz-2.224
 albeit-2.127
 Temper-2.103
 we-2.055
Token largest
Token ID4387
Feature activation+1.39e+01

Pos logit contributions
 citiz+1.341
 accuses+1.286
 Temper+1.248
 albeit+1.240
 we+1.215
Neg logit contributions
igate-2.831
VERTISEMENT-2.779
idon-2.762
agate-2.691
OUT-2.669
Token party
Token ID2151
Feature activation+4.23e+00

Pos logit contributions
 sporting+6.789
 lingu+6.315
 accuses+6.273
 albeit+6.272
 we+6.270
Neg logit contributions
VERTISEMENT-12.778
igate-12.727
agate-12.425
idon-12.411
OUT-11.947
Token abandoned
Token ID9958
Feature activation+4.26e+00

Pos logit contributions
 accuses+2.122
 citiz+1.894
 we+1.853
 Temper+1.807
 albeit+1.807
Neg logit contributions
igate-4.387
VERTISEMENT-4.346
idon-4.255
agate-4.223
tnc-4.172
Token!
Token ID0
Feature activation+2.69e+00

Pos logit contributions
 citiz+0.044
 albeit+0.044
 we+0.043
 accuses+0.042
 though+0.042
Neg logit contributions
VERTISEMENT-0.091
igate-0.091
idon-0.089
agate-0.088
OUT-0.086
Token<|endoftext|>
Token ID50256
Feature activation+4.38e+00

Pos logit contributions
 we+1.443
 Temper+1.431
 albeit+1.386
 accuses+1.366
 Tw+1.341
Neg logit contributions
igate-2.562
VERTISEMENT-2.552
idon-2.550
agate-2.502
OUT-2.456
TokenPresident
Token ID10364
Feature activation+1.31e+01

Pos logit contributions
 we+3.220
 albeit+3.190
 accuses+3.187
 Temper+3.087
 he+3.045
Neg logit contributions
igate-3.042
VERTISEMENT-2.954
idon-2.952
agate-2.815
-2.598
Token-
Token ID12
Feature activation+5.28e+00

Pos logit contributions
 accuses+4.574
 we+4.218
 albeit+4.149
 Tw+3.994
 he+3.988
Neg logit contributions
VERTISEMENT-13.344
igate-13.261
tnc-12.892
idon-12.884
TABLE-12.862
Tokenelect
Token ID9509
Feature activation+7.22e+00

Pos logit contributions
 albeit+3.424
 we+3.288
 accuses+3.275
 citiz+3.223
 he+3.100
Neg logit contributions
igate-4.245
VERTISEMENT-4.160
idon-4.146
OUT-4.016
agate-3.944
Token Barack
Token ID8732
Feature activation+1.62e+01

Pos logit contributions
 accuses+1.030
 we+0.896
 albeit+0.888
 Tw+0.694
 though+0.651
Neg logit contributions
VERTISEMENT-9.579
igate-9.517
idon-9.320
agate-9.300
tnc-9.293
Token Obama
Token ID2486
Feature activation+6.51e+00

Pos logit contributions
 albeit+4.458
 we+4.281
 accuses+4.276
 Temper+3.965
 he+3.898
Neg logit contributions
VERTISEMENT-18.569
igate-17.924
idon-17.816
agate-17.345
-17.310
Token is
Token ID318
Feature activation+5.77e-01

Pos logit contributions
 accuses+3.072
 albeit+2.786
 we+2.755
 joins+2.617
 rises+2.617
Neg logit contributions
igate-6.799
VERTISEMENT-6.794
idon-6.613
agate-6.601
TABLE-6.536
Token currently
Token ID3058
Feature activation+7.86e+00

Pos logit contributions
 accuses+0.293
 albeit+0.288
 we+0.279
 citiz+0.269
 sporting+0.265
Neg logit contributions
VERTISEMENT-0.589
igate-0.583
idon-0.582
agate-0.565
OUT-0.550
Token mull
Token ID35024
Feature activation+4.95e+00

Pos logit contributions
 accuses+3.719
 albeit+3.708
 we+3.687
 sporting+3.625
 he+3.467
Neg logit contributions
idon-7.793
VERTISEMENT-7.775
igate-7.662
agate-7.539
TABLE-7.296
Tokening
Token ID278
Feature activation-1.93e+00

Pos logit contributions
 citiz+3.870
 accuses+3.853
 albeit+3.807
 Temper+3.707
 we+3.640
Neg logit contributions
igate-3.533
VERTISEMENT-3.483
idon-3.399
OUT-3.331
agate-3.329
Token was
Token ID373
Feature activation+2.30e+00

Pos logit contributions
 accuses+4.999
 we+4.488
 joins+4.421
 Temper+4.329
 albeit+4.317
Neg logit contributions
VERTISEMENT-8.382
igate-8.355
idon-8.141
agate-8.083
tnc-7.884
Token not
Token ID407
Feature activation+6.22e+00

Pos logit contributions
 accuses+1.259
 albeit+1.237
 we+1.234
 citiz+1.199
 sporting+1.182
Neg logit contributions
VERTISEMENT-2.229
idon-2.202
igate-2.192
agate-2.113
OUT-2.065
Token having
Token ID1719
Feature activation+1.53e+00

Pos logit contributions
 we+3.495
 accuses+3.437
 albeit+3.410
 sporting+3.373
 though+3.342
Neg logit contributions
VERTISEMENT-5.811
idon-5.682
agate-5.607
igate-5.601
tnc-5.356
Token Jeffrey
Token ID19627
Feature activation+1.00e+01

Pos logit contributions
 accuses+0.838
 albeit+0.837
 we+0.820
 though+0.819
 citiz+0.798
Neg logit contributions
VERTISEMENT-1.483
agate-1.464
idon-1.460
igate-1.446
OUT-1.405
Token Lord
Token ID4453
Feature activation+1.27e+01

Pos logit contributions
 we+4.567
 Temper+4.560
 accuses+4.548
 Tw+4.491
 he+4.414
Neg logit contributions
VERTISEMENT-9.721
igate-9.580
idon-9.468
tnc-9.241
agate-9.148
Token
Token ID447
Feature activation+1.60e+01

Pos logit contributions
 we+6.687
 he+6.476
 albeit+6.432
 though+6.390
 Tw+6.319
Neg logit contributions
idon-10.808
VERTISEMENT-10.652
igate-10.453
tnc-10.275
OUT-10.252
Token
Token ID247
Feature activation-3.68e+00

Pos logit contributions
 citiz+8.794
 we+8.686
 Tw+8.564
 though+8.337
 they+8.334
Neg logit contributions
idon-11.904
igate-11.399
VERTISEMENT-10.989
tnc-10.939
OUT-10.718
Tokens
Token ID82
Feature activation+4.70e+00

Pos logit contributions
igate+3.501
idon+3.464
VERTISEMENT+3.451
agate+3.354
OUT+3.352
Neg logit contributions
 accuses-2.087
 citiz-2.053
 albeit-2.043
 we-1.979
 Temper-1.976
Token comparison
Token ID7208
Feature activation+8.30e+00

Pos logit contributions
 accuses+2.456
 albeit+2.426
 we+2.420
 Tw+2.371
 Temper+2.343
Neg logit contributions
VERTISEMENT-4.543
idon-4.525
igate-4.472
agate-4.329
OUT-4.307
Token.
Token ID13
Feature activation+8.37e+00

Pos logit contributions
 we+4.458
 though+4.360
 albeit+4.322
 accuses+4.317
 he+4.268
Neg logit contributions
idon-7.592
igate-7.540
VERTISEMENT-7.535
agate-7.435
OUT-7.275
Token
Token ID198
Feature activation+2.83e+00

Pos logit contributions
 Temper+3.901
 Tw+3.746
 we+3.640
 he+3.625
 accuses+3.557
Neg logit contributions
idon-8.281
igate-8.133
agate-8.119
VERTISEMENT-8.024
ability-7.752
Token amounts
Token ID6867
Feature activation+5.88e+00

Pos logit contributions
 we+4.518
 albeit+4.497
 Temper+4.403
 amounts+4.394
 accuses+4.371
Neg logit contributions
igate-8.374
agate-8.169
VERTISEMENT-8.112
idon-7.988
OUT-7.977
Token of
Token ID286
Feature activation+2.18e+00

Pos logit contributions
 we+1.129
 albeit+0.964
 Temper+0.950
 citiz+0.939
 accuses+0.926
Neg logit contributions
igate-7.510
VERTISEMENT-7.475
OUT-7.415
agate-7.414
TABLE-7.242
Token snow
Token ID6729
Feature activation+6.80e+00

Pos logit contributions
 albeit+1.159
 Temper+1.154
 accuses+1.153
 we+1.153
 citiz+1.133
Neg logit contributions
igate-2.115
VERTISEMENT-2.104
agate-2.044
idon-2.042
OUT-2.010
Token via
Token ID2884
Feature activation+3.25e+00

Pos logit contributions
 we+3.459
 albeit+3.396
 though+3.305
 Temper+3.256
 accuses+3.208
Neg logit contributions
VERTISEMENT-6.440
igate-6.354
agate-6.338
TABLE-6.333
OUT-6.255
Token powerful
Token ID3665
Feature activation+8.85e+00

Pos logit contributions
 we+1.860
 Temper+1.830
 albeit+1.827
 accuses+1.803
 rises+1.780
Neg logit contributions
igate-2.988
VERTISEMENT-2.909
idon-2.882
agate-2.875
TABLE-2.821
Token Nor
Token ID5414
Feature activation+1.59e+01

Pos logit contributions
 albeit+4.464
 we+4.406
 rises+4.323
 Temper+4.295
 accuses+4.274
Neg logit contributions
igate-8.671
VERTISEMENT-8.323
agate-8.266
TABLE-8.082
idon-8.075
Token
Token ID447
Feature activation+1.57e+01

Pos logit contributions
 we+10.468
 Temper+10.098
 accuses+9.974
 albeit+9.905
 though+9.671
Neg logit contributions
VERTISEMENT-10.195
TABLE-9.719
OUT-9.590
-9.538
igate-9.435
Token
Token ID247
Feature activation-2.80e+00

Pos logit contributions
 we+9.988
 citiz+9.816
 they+9.592
 he+9.521
 albeit+9.393
Neg logit contributions
igate-9.720
idon-9.445
OUT-9.202
VERTISEMENT-9.098
agate-8.569
Tokene
Token ID68
Feature activation+4.43e+00

Pos logit contributions
igate+2.322
VERTISEMENT+2.272
idon+2.260
OUT+2.238
agate+2.210
Neg logit contributions
 accuses-1.888
 citiz-1.887
 albeit-1.880
 Temper-1.870
 we-1.837
Tokenasters
Token ID7060
Feature activation+7.62e+00

Pos logit contributions
 Temper+3.284
 we+3.279
 accuses+3.244
 albeit+3.229
 citiz+3.180
Neg logit contributions
igate-3.112
VERTISEMENT-3.034
OUT-3.031
TABLE-2.935
idon-2.912
Token that
Token ID326
Feature activation+8.39e+00

Pos logit contributions
 Temper+4.248
 we+4.205
 albeit+4.068
 though+3.929
 accuses+3.891
Neg logit contributions
VERTISEMENT-6.790
igate-6.713
agate-6.603
OUT-6.586
TABLE-6.546
Token 1
Token ID352
Feature activation+3.59e+00

Pos logit contributions
 accuses+5.644
 we+5.601
 albeit+5.521
 though+5.273
 they+5.094
Neg logit contributions
VERTISEMENT-12.701
igate-12.391
agate-12.364
idon-12.183
TABLE-11.835
Token,
Token ID11
Feature activation+8.89e+00

Pos logit contributions
 we+1.647
 accuses+1.589
 albeit+1.578
 citiz+1.519
 though+1.501
Neg logit contributions
igate-3.766
VERTISEMENT-3.757
agate-3.638
idon-3.632
OUT-3.593
Token it
Token ID340
Feature activation+3.49e+00

Pos logit contributions
 we+4.836
 albeit+4.584
 accuses+4.535
 though+4.469
 they+4.352
Neg logit contributions
VERTISEMENT-8.488
idon-8.182
agate-8.166
igate-8.149
OUT-7.848
Token seems
Token ID2331
Feature activation+8.08e+00

Pos logit contributions
 accuses+1.246
 we+1.143
 albeit+1.111
 rises+1.058
 joins+1.013
Neg logit contributions
VERTISEMENT-4.137
igate-4.121
idon-4.059
OUT-3.970
agate-3.961
Token the
Token ID262
Feature activation+3.71e+00

Pos logit contributions
 we+4.879
 they+4.462
 albeit+4.368
 he+4.363
 though+4.318
Neg logit contributions
VERTISEMENT-7.492
igate-7.273
idon-7.148
agate-7.147
uments-6.837
Token United
Token ID1578
Feature activation+1.58e+01

Pos logit contributions
 albeit+1.696
 accuses+1.675
 we+1.650
 Temper+1.642
 citiz+1.641
Neg logit contributions
VERTISEMENT-3.835
igate-3.829
idon-3.784
agate-3.740
OUT-3.632
Token States
Token ID1829
Feature activation+8.09e+00

Pos logit contributions
 we+4.044
 citiz+3.713
 Temper+3.605
 accuses+3.482
 Airlines+3.313
Neg logit contributions
VERTISEMENT-16.923
igate-16.438
OUT-16.434
FAULT-16.397
tnc-16.166
Token Conference
Token ID8785
Feature activation+8.16e+00

Pos logit contributions
 accuses+3.409
 we+3.273
 albeit+3.208
 citiz+3.162
 though+2.943
Neg logit contributions
VERTISEMENT-8.545
igate-8.537
idon-8.431
agate-8.365
TABLE-8.241
Token of
Token ID286
Feature activation+3.06e+00

Pos logit contributions
 accuses+2.051
 citiz+1.950
 we+1.839
 albeit+1.638
 Tw+1.582
Neg logit contributions
VERTISEMENT-10.003
igate-9.809
idon-9.636
OUT-9.526
agate-9.439
Token Catholic
Token ID7835
Feature activation+1.04e+01

Pos logit contributions
 citiz+1.089
 Temper+1.065
 accuses+1.036
 albeit+1.029
 we+1.006
Neg logit contributions
igate-3.556
idon-3.527
VERTISEMENT-3.478
OUT-3.420
agate-3.385
Token B
Token ID347
Feature activation+3.52e+00

Pos logit contributions
 Temper+3.366
 accuses+3.004
 albeit+2.963
 we+2.894
 Econom+2.680
Neg logit contributions
igate-12.341
idon-12.009
agate-11.996
VERTISEMENT-11.927
ability-11.645
Token The
Token ID383
Feature activation+3.91e+00

Pos logit contributions
 albeit+3.801
 we+3.703
 Tw+3.640
 Temper+3.598
 accuses+3.446
Neg logit contributions
VERTISEMENT-8.319
igate-8.239
idon-8.023
agate-7.979
TABLE-7.920
Token H
Token ID367
Feature activation+2.59e+00

Pos logit contributions
 Temper+1.780
 albeit+1.692
 Tw+1.657
 we+1.628
 accuses+1.607
Neg logit contributions
igate-4.153
VERTISEMENT-4.016
agate-3.891
OUT-3.883
idon-3.880
Tokenunch
Token ID3316
Feature activation+8.65e+00

Pos logit contributions
 accuses+1.828
 we+1.810
 albeit+1.781
 citiz+1.777
 Temper+1.748
Neg logit contributions
igate-1.963
VERTISEMENT-1.953
agate-1.851
tnc-1.816
OUT-1.800
Tokenback
Token ID1891
Feature activation+7.93e+00

Pos logit contributions
 we+7.545
 accuses+7.507
 albeit+7.371
 citiz+7.327
 though+7.240
Neg logit contributions
VERTISEMENT-5.392
igate-5.054
agate-4.977
VICE-4.565
OUT-4.543
Token of
Token ID286
Feature activation+3.28e+00

Pos logit contributions
 we+1.824
 accuses+1.814
 Temper+1.752
 citiz+1.679
 albeit+1.656
Neg logit contributions
VERTISEMENT-9.997
idon-9.801
agate-9.776
igate-9.728
OUT-9.559
Token Notre
Token ID23382
Feature activation+1.58e+01

Pos logit contributions
 citiz+1.533
 accuses+1.489
 albeit+1.443
 Temper+1.399
 we+1.338
Neg logit contributions
igate-3.429
VERTISEMENT-3.378
idon-3.327
agate-3.293
OUT-3.283
Token Dame
Token ID20377
Feature activation+6.59e+00

Pos logit contributions
 Tw+4.854
 Bart+4.733
 Anglic+4.559
 we+4.527
 Temper+4.494
Neg logit contributions
VERTISEMENT-17.230
tnc-16.680
ULE-16.530
-16.523
OUT-16.371
Token,
Token ID11
Feature activation+8.53e+00

Pos logit contributions
 we+2.373
 albeit+2.275
 accuses+2.273
 rises+2.197
 sporting+2.191
Neg logit contributions
VERTISEMENT-7.097
igate-7.003
idon-6.961
agate-6.952
tnc-6.733
Token Nightmare
Token ID23951
Feature activation+1.11e+01

Pos logit contributions
 albeit+3.887
 we+3.839
 Tw+3.736
 Temper+3.682
 accuses+3.533
Neg logit contributions
VERTISEMENT-8.302
igate-8.254
idon-8.056
agate-8.002
TABLE-7.931
Token on
Token ID319
Feature activation+2.66e+00

Pos logit contributions
 we+4.677
 accuses+4.453
 Temper+4.388
 Tw+4.373
 albeit+4.316
Neg logit contributions
tnc-10.856
igate-10.819
OUT-10.819
TABLE-10.726
VICE-10.663
Token Elm
Token ID35189
Feature activation+9.77e+00

Pos logit contributions
 citiz+0.185
 accuses+0.164
 albeit+0.159
 Temper+0.158
 we+0.147
Neg logit contributions
VERTISEMENT-3.760
igate-3.757
agate-3.715
idon-3.694
uments-3.679
Token almost
Token ID2048
Feature activation+1.25e+01

Pos logit contributions
 albeit+0.171
 accuses+0.169
 we+0.168
 citiz+0.167
 Temper+0.166
Neg logit contributions
igate-0.256
VERTISEMENT-0.255
idon-0.251
agate-0.247
tnc-0.238
Token here
Token ID994
Feature activation+6.50e+00

Pos logit contributions
 we+6.678
 albeit+6.306
 though+6.143
 accuses+6.088
 they+6.061
Neg logit contributions
VERTISEMENT-10.837
agate-10.685
igate-10.502
idon-10.439
tnc-10.388
Token
Token ID784
Feature activation+8.33e+00

Pos logit contributions
 albeit+3.321
 though+3.284
 we+3.276
 they+2.941
 accuses+2.929
Neg logit contributions
VERTISEMENT-6.331
igate-6.247
tnc-6.113
agate-6.056
OUT-6.052
Token and
Token ID290
Feature activation+1.18e+01

Pos logit contributions
 albeit+4.420
 we+4.316
 though+4.038
 accuses+3.797
 they+3.794
Neg logit contributions
VERTISEMENT-8.135
igate-7.951
idon-7.831
agate-7.755
tnc-7.685
Token we
Token ID356
Feature activation+5.94e+00

Pos logit contributions
 we+6.507
 albeit+5.920
 though+5.822
 they+5.744
 accuses+5.653
Neg logit contributions
VERTISEMENT-11.012
igate-10.837
tnc-10.669
agate-10.580
idon-10.556
Token
Token ID447
Feature activation+1.58e+01

Pos logit contributions
 we+3.254
 albeit+3.192
 though+3.062
 accuses+3.055
 they+2.932
Neg logit contributions
VERTISEMENT-5.768
igate-5.473
agate-5.408
idon-5.365
tnc-5.343
Token
Token ID247
Feature activation-5.49e+00

Pos logit contributions
 we+9.538
 albeit+9.320
 they+9.064
 he+9.003
 though+8.925
Neg logit contributions
idon-11.487
igate-11.128
VERTISEMENT-10.921
tnc-10.363
ability-10.265
Tokenve
Token ID303
Feature activation+1.88e+00

Pos logit contributions
igate+4.479
VERTISEMENT+4.449
idon+4.344
agate+4.265
OUT+4.195
Neg logit contributions
 citiz-3.790
 albeit-3.787
 accuses-3.784
 we-3.691
 Temper-3.638
Token got
Token ID1392
Feature activation+2.44e-01

Pos logit contributions
 we+1.093
 albeit+1.065
 accuses+1.049
 though+1.006
 citiz+1.003
Neg logit contributions
VERTISEMENT-1.748
igate-1.709
idon-1.687
agate-1.685
OUT-1.632
Token an
Token ID281
Feature activation+2.92e+00

Pos logit contributions
 we+0.135
 accuses+0.133
 albeit+0.131
 citiz+0.130
 Temper+0.128
Neg logit contributions
VERTISEMENT-0.236
igate-0.232
idon-0.228
agate-0.227
tnc-0.221
Token exciting
Token ID7895
Feature activation+8.57e+00

Pos logit contributions
 albeit+1.416
 accuses+1.366
 we+1.354
 citiz+1.319
 Temper+1.295
Neg logit contributions
igate-2.955
VERTISEMENT-2.931
idon-2.886
agate-2.846
tnc-2.801
Token the
Token ID262
Feature activation+3.44e+00

Pos logit contributions
 we+0.568
 citiz+0.550
 albeit+0.542
 accuses+0.539
 Temper+0.531
Neg logit contributions
idon-0.854
VERTISEMENT-0.852
igate-0.847
agate-0.826
OUT-0.821
Token video
Token ID2008
Feature activation+2.89e+00

Pos logit contributions
 albeit+1.702
 we+1.679
 citiz+1.671
 accuses+1.643
 Temper+1.597
Neg logit contributions
igate-3.425
agate-3.312
idon-3.311
VERTISEMENT-3.289
OUT-3.251
Token clip
Token ID10651
Feature activation+3.53e+00

Pos logit contributions
 we+1.562
 albeit+1.474
 citiz+1.455
 though+1.434
 accuses+1.419
Neg logit contributions
igate-2.839
VERTISEMENT-2.815
idon-2.805
agate-2.757
OUT-2.717
Token above
Token ID2029
Feature activation+6.30e+00

Pos logit contributions
 we+1.967
 albeit+1.860
 citiz+1.789
 though+1.789
 accuses+1.770
Neg logit contributions
igate-3.425
VERTISEMENT-3.377
idon-3.310
agate-3.292
OUT-3.241
Token,
Token ID11
Feature activation+1.06e+01

Pos logit contributions
 we+2.821
 albeit+2.500
 though+2.454
 citiz+2.359
 they+2.340
Neg logit contributions
igate-6.680
VERTISEMENT-6.673
agate-6.540
idon-6.489
OUT-6.445
Token Major
Token ID8386
Feature activation+1.57e+01

Pos logit contributions
 we+6.074
 albeit+5.471
 though+5.407
 they+5.258
 Temper+5.064
Neg logit contributions
igate-9.344
VERTISEMENT-9.342
idon-9.288
agate-9.266
OUT-9.120
Token Nelson
Token ID12996
Feature activation+1.05e+01

Pos logit contributions
 we+7.209
 accuses+6.694
 though+6.521
 citiz+6.380
 albeit+6.352
Neg logit contributions
VERTISEMENT-14.838
igate-13.778
-13.542
tnc-13.478
OUT-13.348
Token recently
Token ID2904
Feature activation+7.65e+00

Pos logit contributions
 accuses+5.226
 we+5.182
 joins+4.975
 albeit+4.974
 citiz+4.711
Neg logit contributions
igate-9.770
VERTISEMENT-9.739
agate-9.437
OUT-9.265
idon-9.248
Token sat
Token ID3332
Feature activation+3.78e+00

Pos logit contributions
 we+3.657
 accuses+3.590
 albeit+3.576
 joins+3.378
 though+3.300
Neg logit contributions
igate-7.721
VERTISEMENT-7.620
agate-7.394
OUT-7.335
idon-7.306
Token down
Token ID866
Feature activation+3.91e+00

Pos logit contributions
 we+2.412
 albeit+2.370
 citiz+2.306
 accuses+2.281
 he+2.251
Neg logit contributions
VERTISEMENT-3.241
igate-3.084
agate-3.041
idon-3.039
OUT-3.005
Token with
Token ID351
Feature activation+1.91e+00

Pos logit contributions
 albeit+1.743
 we+1.740
 citiz+1.682
 though+1.567
 accuses+1.545
Neg logit contributions
VERTISEMENT-4.227
igate-4.076
idon-4.044
OUT-3.969
agate-3.965
Token of
Token ID286
Feature activation+2.18e+00

Pos logit contributions
 we+1.129
 albeit+0.964
 Temper+0.950
 citiz+0.939
 accuses+0.926
Neg logit contributions
igate-7.510
VERTISEMENT-7.475
OUT-7.415
agate-7.414
TABLE-7.242
Token snow
Token ID6729
Feature activation+6.80e+00

Pos logit contributions
 albeit+1.159
 Temper+1.154
 accuses+1.153
 we+1.153
 citiz+1.133
Neg logit contributions
igate-2.115
VERTISEMENT-2.104
agate-2.044
idon-2.042
OUT-2.010
Token via
Token ID2884
Feature activation+3.25e+00

Pos logit contributions
 we+3.459
 albeit+3.396
 though+3.305
 Temper+3.256
 accuses+3.208
Neg logit contributions
VERTISEMENT-6.440
igate-6.354
agate-6.338
TABLE-6.333
OUT-6.255
Token powerful
Token ID3665
Feature activation+8.85e+00

Pos logit contributions
 we+1.860
 Temper+1.830
 albeit+1.827
 accuses+1.803
 rises+1.780
Neg logit contributions
igate-2.988
VERTISEMENT-2.909
idon-2.882
agate-2.875
TABLE-2.821
Token Nor
Token ID5414
Feature activation+1.59e+01

Pos logit contributions
 albeit+4.464
 we+4.406
 rises+4.323
 Temper+4.295
 accuses+4.274
Neg logit contributions
igate-8.671
VERTISEMENT-8.323
agate-8.266
TABLE-8.082
idon-8.075
Token
Token ID447
Feature activation+1.57e+01

Pos logit contributions
 we+10.468
 Temper+10.098
 accuses+9.974
 albeit+9.905
 though+9.671
Neg logit contributions
VERTISEMENT-10.195
TABLE-9.719
OUT-9.590
-9.538
igate-9.435
Token
Token ID247
Feature activation-2.80e+00

Pos logit contributions
 we+9.988
 citiz+9.816
 they+9.592
 he+9.521
 albeit+9.393
Neg logit contributions
igate-9.720
idon-9.445
OUT-9.202
VERTISEMENT-9.098
agate-8.569
Tokene
Token ID68
Feature activation+4.43e+00

Pos logit contributions
igate+2.322
VERTISEMENT+2.272
idon+2.260
OUT+2.238
agate+2.210
Neg logit contributions
 accuses-1.888
 citiz-1.887
 albeit-1.880
 Temper-1.870
 we-1.837
Tokenasters
Token ID7060
Feature activation+7.62e+00

Pos logit contributions
 Temper+3.284
 we+3.279
 accuses+3.244
 albeit+3.229
 citiz+3.180
Neg logit contributions
igate-3.112
VERTISEMENT-3.034
OUT-3.031
TABLE-2.935
idon-2.912
Token that
Token ID326
Feature activation+8.39e+00

Pos logit contributions
 Temper+4.248
 we+4.205
 albeit+4.068
 though+3.929
 accuses+3.891
Neg logit contributions
VERTISEMENT-6.790
igate-6.713
agate-6.603
OUT-6.586
TABLE-6.546
Token make
Token ID787
Feature activation+2.46e+00

Pos logit contributions
 we+4.436
 Temper+4.152
 accuses+4.097
 albeit+4.087
 they+4.036
Neg logit contributions
VERTISEMENT-7.956
idon-7.853
igate-7.779
TABLE-7.619
agate-7.602
Token about
Token ID546
Feature activation+3.11e+00

Pos logit contributions
 we+1.579
 accuses+1.434
 citiz+1.426
 they+1.412
 albeit+1.400
Neg logit contributions
VERTISEMENT-3.501
igate-3.482
idon-3.479
agate-3.433
OUT-3.316
Token being
Token ID852
Feature activation-3.25e+00

Pos logit contributions
 we+1.563
 citiz+1.532
 accuses+1.527
 albeit+1.510
 Brit+1.497
Neg logit contributions
VERTISEMENT-3.085
igate-3.053
agate-2.989
idon-2.973
OUT-2.903
Token a
Token ID257
Feature activation-1.91e+00

Pos logit contributions
VERTISEMENT+3.210
igate+3.202
idon+3.163
agate+3.128
OUT+3.095
Neg logit contributions
 accuses-1.724
 citiz-1.721
 albeit-1.687
 we-1.644
 Temper-1.614
Token lesbian
Token ID17834
Feature activation+6.23e+00

Pos logit contributions
igate+2.086
VERTISEMENT+2.039
idon+2.024
agate+2.004
OUT+1.982
Neg logit contributions
 citiz-0.854
 accuses-0.818
 albeit-0.818
 we-0.789
 Temper-0.784
Token<|endoftext|>
Token ID50256
Feature activation+4.03e+00

Pos logit contributions
 teen+3.130
 we+3.120
 accuses+3.047
 albeit+3.046
 citiz+3.037
Neg logit contributions
igate-5.949
idon-5.916
VERTISEMENT-5.885
agate-5.810
tnc-5.705
TokenWell
Token ID5779
Feature activation+1.55e+01

Pos logit contributions
 we+2.974
 accuses+2.955
 albeit+2.952
 Temper+2.853
 he+2.813
Neg logit contributions
igate-2.803
idon-2.728
VERTISEMENT-2.726
agate-2.595
uments-2.395
Token,
Token ID11
Feature activation+1.23e+01

Pos logit contributions
 we+8.743
 they+8.142
 he+8.080
 though+7.893
 she+7.719
Neg logit contributions
igate-12.310
VERTISEMENT-12.221
tnc-11.881
idon-11.671
TABLE-11.506
Token the
Token ID262
Feature activation+6.92e+00

Pos logit contributions
 we+6.807
 albeit+6.314
 they+6.221
 he+6.163
 though+6.120
Neg logit contributions
VERTISEMENT-11.253
igate-11.065
idon-10.594
agate-10.526
tnc-10.499
Token scores
Token ID8198
Feature activation+7.84e+00

Pos logit contributions
 we+3.443
 Temper+3.441
 albeit+3.435
 citiz+3.351
 accuses+3.348
Neg logit contributions
VERTISEMENT-6.612
igate-6.609
idon-6.369
agate-6.315
OUT-6.176
Token still
Token ID991
Feature activation+1.11e+01

Pos logit contributions
 we+3.749
 accuses+3.543
 albeit+3.479
 they+3.456
 though+3.427
Neg logit contributions
VERTISEMENT-8.051
igate-7.722
idon-7.496
agate-7.462
tnc-7.391
Token aren
Token ID3588
Feature activation+5.32e+00

Pos logit contributions
 we+5.132
 albeit+4.925
 accuses+4.841
 though+4.824
 he+4.765
Neg logit contributions
VERTISEMENT-11.066
idon-10.744
TABLE-10.512
Deploy-10.439
tnc-10.425
Token<|endoftext|>
Token ID50256
Feature activation+2.08e+00

Pos logit contributions
 we+0.777
 Temper+0.770
 albeit+0.754
 accuses+0.746
 citiz+0.737
Neg logit contributions
igate-1.360
idon-1.343
VERTISEMENT-1.331
agate-1.315
OUT-1.276
TokenWhat
Token ID2061
Feature activation+1.19e+01

Pos logit contributions
 accuses+1.553
 we+1.550
 albeit+1.547
 Temper+1.499
 though+1.467
Neg logit contributions
igate-1.478
VERTISEMENT-1.438
idon-1.431
agate-1.369
-1.265
Token Wall
Token ID5007
Feature activation+1.07e+01

Pos logit contributions
 we+5.949
 accuses+5.388
 they+5.332
 he+5.212
 though+5.147
Neg logit contributions
idon-11.084
VERTISEMENT-11.038
igate-10.898
OUT-10.716
-10.696
Token Street
Token ID3530
Feature activation+8.08e+00

Pos logit contributions
 accuses+5.131
 we+4.808
 albeit+4.488
 he+4.487
 citiz+4.454
Neg logit contributions
VERTISEMENT-10.106
igate-9.775
idon-9.739
-9.462
tnc-9.457
Token doesn
Token ID1595
Feature activation+7.12e+00

Pos logit contributions
 accuses+3.898
 we+3.562
 albeit+3.464
 Temper+3.439
 economists+3.423
Neg logit contributions
igate-8.083
VERTISEMENT-7.961
idon-7.909
agate-7.712
OUT-7.708
Token
Token ID447
Feature activation+1.54e+01

Pos logit contributions
 accuses+6.445
 albeit+6.431
 we+6.383
 Temper+6.281
 though+6.181
Neg logit contributions
VERTISEMENT-3.951
igate-3.866
idon-3.824
agate-3.645
OUT-3.534
Token
Token ID247
Feature activation-5.22e+00

Pos logit contributions
 we+9.533
 albeit+9.394
 though+9.200
 they+9.197
 he+9.178
Neg logit contributions
idon-10.834
igate-10.564
VERTISEMENT-10.341
tnc-9.678
agate-9.648
Tokent
Token ID83
Feature activation+1.54e+00

Pos logit contributions
igate+5.113
VERTISEMENT+5.039
idon+4.996
agate+4.887
OUT+4.847
Neg logit contributions
 accuses-2.858
 citiz-2.824
 albeit-2.770
 we-2.674
 Temper-2.650
Token want
Token ID765
Feature activation+2.62e+00

Pos logit contributions
 accuses+0.762
 albeit+0.759
 we+0.752
 citiz+0.734
 Temper+0.726
Neg logit contributions
igate-1.541
VERTISEMENT-1.527
idon-1.517
agate-1.480
OUT-1.471
Token us
Token ID514
Feature activation+2.54e+00

Pos logit contributions
 we+1.274
 accuses+1.265
 citiz+1.204
 albeit+1.197
 though+1.188
Neg logit contributions
VERTISEMENT-2.673
igate-2.660
idon-2.622
agate-2.610
OUT-2.523
Token to
Token ID284
Feature activation+1.64e-01

Pos logit contributions
 accuses+1.427
 citiz+1.414
 we+1.414
 albeit+1.366
 though+1.347
Neg logit contributions
VERTISEMENT-2.407
igate-2.390
idon-2.354
agate-2.270
OUT-2.257
TokenThe
Token ID464
Feature activation+4.99e+00

Pos logit contributions
 we+1.965
 accuses+1.962
 albeit+1.959
 Temper+1.894
 though+1.857
Neg logit contributions
igate-1.874
VERTISEMENT-1.826
idon-1.817
agate-1.734
-1.603
Token sushi
Token ID36324
Feature activation+3.95e+00

Pos logit contributions
 Temper+2.403
 citiz+2.380
 albeit+2.358
 we+2.338
 accuses+2.307
Neg logit contributions
VERTISEMENT-4.924
igate-4.915
idon-4.723
OUT-4.648
agate-4.648
Token you
Token ID345
Feature activation+4.58e+00

Pos logit contributions
 we+2.009
 accuses+1.930
 albeit+1.898
 he+1.858
 they+1.839
Neg logit contributions
igate-3.999
VERTISEMENT-3.933
idon-3.767
tnc-3.723
OUT-3.694
Token eat
Token ID4483
Feature activation-7.50e-01

Pos logit contributions
 we+2.396
 accuses+2.349
 albeit+2.308
 though+2.268
 he+2.198
Neg logit contributions
VERTISEMENT-4.524
igate-4.319
idon-4.284
OUT-4.268
BACK-4.224
Token isn
Token ID2125
Feature activation+4.22e+00

Pos logit contributions
igate+0.798
VERTISEMENT+0.783
idon+0.773
agate+0.766
OUT+0.760
Neg logit contributions
 accuses-0.349
 albeit-0.337
 citiz-0.329
 we-0.327
 though-0.315
Token
Token ID447
Feature activation+1.54e+01

Pos logit contributions
 accuses+4.210
 albeit+4.167
 citiz+4.160
 Temper+4.100
 we+4.053
Neg logit contributions
igate-2.073
VERTISEMENT-2.036
idon-1.970
agate-1.899
OUT-1.872
Token
Token ID247
Feature activation-5.96e+00

Pos logit contributions
 we+8.982
 citiz+8.667
 they+8.586
 he+8.574
 Tw+8.416
Neg logit contributions
idon-10.753
igate-10.700
VERTISEMENT-10.263
agate-9.989
OUT-9.658
Tokent
Token ID83
Feature activation+1.44e+00

Pos logit contributions
igate+5.176
VERTISEMENT+5.052
idon+5.031
agate+4.937
OUT+4.871
Neg logit contributions
 citiz-3.888
 accuses-3.868
 albeit-3.846
 we-3.739
 Temper-3.699
Token fresh
Token ID4713
Feature activation+3.10e+00

Pos logit contributions
 we+0.899
 accuses+0.880
 albeit+0.880
 citiz+0.870
 Temper+0.847
Neg logit contributions
VERTISEMENT-1.265
igate-1.253
idon-1.237
agate-1.217
tnc-1.191
Token
Token ID198
Feature activation+7.12e-01

Pos logit contributions
 albeit+1.621
 though+1.613
 we+1.601
 accuses+1.544
 citiz+1.515
Neg logit contributions
igate-2.997
VERTISEMENT-2.946
idon-2.900
agate-2.872
BACK-2.814
Token
Token ID198
Feature activation+1.14e+00

Pos logit contributions
 albeit+0.397
 accuses+0.390
 we+0.390
 Temper+0.390
 though+0.373
Neg logit contributions
igate-0.663
VERTISEMENT-0.654
idon-0.651
agate-0.632
OUT-0.615
Token will
Token ID481
Feature activation+1.30e+00

Pos logit contributions
 accuses+1.616
 we+1.402
 albeit+1.399
 joins+1.346
 rises+1.341
Neg logit contributions
VERTISEMENT-3.394
igate-3.336
idon-3.297
agate-3.283
OUT-3.258
Token step
Token ID2239
Feature activation+2.10e+00

Pos logit contributions
 accuses+0.701
 albeit+0.694
 we+0.689
 though+0.651
 he+0.648
Neg logit contributions
VERTISEMENT-1.279
OUT-1.221
idon-1.217
igate-1.215
agate-1.196
Token down
Token ID866
Feature activation+4.07e+00

Pos logit contributions
 accuses+1.386
 albeit+1.342
 we+1.330
 though+1.315
 he+1.299
Neg logit contributions
VERTISEMENT-1.769
idon-1.685
OUT-1.649
TABLE-1.644
igate-1.641
Token,
Token ID11
Feature activation+9.92e+00

Pos logit contributions
 accuses+1.956
 albeit+1.925
 we+1.878
 though+1.824
 he+1.793
Neg logit contributions
VERTISEMENT-4.159
igate-4.085
idon-4.057
agate-3.994
OUT-3.967
Token President
Token ID1992
Feature activation+1.03e+01

Pos logit contributions
 accuses+5.169
 albeit+4.973
 we+4.922
 though+4.816
 he+4.793
Neg logit contributions
VERTISEMENT-9.623
idon-9.360
agate-9.268
igate-9.194
OUT-9.024
Token Barack
Token ID8732
Feature activation+1.53e+01

Pos logit contributions
 accuses+2.772
 we+2.239
 albeit+2.212
 he+2.165
 Bart+2.042
Neg logit contributions
VERTISEMENT-12.386
igate-12.080
idon-11.987
tnc-11.965
agate-11.783
Token Obama
Token ID2486
Feature activation+5.17e+00

Pos logit contributions
 albeit+3.856
 accuses+3.689
 we+3.659
 Temper+3.381
 he+3.326
Neg logit contributions
VERTISEMENT-18.247
igate-17.555
idon-17.402
agate-17.159
-16.821
Token announced
Token ID3414
Feature activation-2.41e-01

Pos logit contributions
 accuses+1.849
 albeit+1.516
 we+1.492
 rises+1.430
 joins+1.423
Neg logit contributions
igate-6.102
VERTISEMENT-6.098
idon-6.025
agate-6.011
tnc-5.879
Token Monday
Token ID3321
Feature activation+1.01e+01

Pos logit contributions
VERTISEMENT+0.265
igate+0.262
idon+0.261
agate+0.257
OUT+0.253
Neg logit contributions
 accuses-0.102
 albeit-0.097
 citiz-0.095
 we-0.094
 Temper-0.092
Token
Token ID198
Feature activation-4.18e-01

Pos logit contributions
 accuses+4.278
 he+4.166
 though+4.104
 we+4.102
 albeit+4.009
Neg logit contributions
VERTISEMENT-10.202
tnc-9.982
igate-9.902
idon-9.800
agate-9.724
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
igate+0.321
OUT+0.305
VERTISEMENT+0.304
idon+0.302
agate+0.298
Neg logit contributions
 citiz-0.352
 accuses-0.309
 mathemat-0.308
 ende-0.306
etheless-0.299
Token More
Token ID3125
Feature activation+8.40e+00

Pos logit contributions
 Temper+2.923
 accuses+2.742
 we+2.727
 albeit+2.725
 Tw+2.625
Neg logit contributions
igate-6.019
VERTISEMENT-6.013
idon-5.901
agate-5.884
OUT-5.670
Token and
Token ID290
Feature activation+1.04e+01

Pos logit contributions
 we+3.935
 accuses+3.924
 importantly+3.886
 economists+3.727
 albeit+3.696
Neg logit contributions
VERTISEMENT-8.296
igate-8.100
agate-8.078
idon-7.939
OUT-7.800
Token more
Token ID517
Feature activation+7.78e+00

Pos logit contributions
 albeit+6.687
 accuses+6.683
 we+6.664
 Temper+6.422
 though+6.334
Neg logit contributions
igate-8.408
VERTISEMENT-8.386
idon-8.273
agate-8.144
OUT-8.023
Token observers
Token ID17984
Feature activation+5.86e+00

Pos logit contributions
 we+3.318
 accuses+3.041
 economists+3.006
 though+2.942
 albeit+2.921
Neg logit contributions
igate-8.412
VERTISEMENT-8.383
agate-8.302
OUT-8.144
idon-8.113
Token,
Token ID11
Feature activation+1.11e+01

Pos logit contributions
 accuses+2.534
 we+2.420
 albeit+2.329
 though+2.237
 he+2.177
Neg logit contributions
VERTISEMENT-6.490
idon-6.165
igate-6.161
OUT-6.135
tnc-6.032
Token however
Token ID2158
Feature activation+1.52e+01

Pos logit contributions
 albeit+5.807
 though+5.668
 we+5.615
 economists+5.414
 accuses+5.331
Neg logit contributions
VERTISEMENT-10.867
igate-10.538
agate-10.363
idon-10.294
OUT-10.170
Token,
Token ID11
Feature activation+9.60e+00

Pos logit contributions
 we+5.767
 accuses+5.583
 albeit+5.508
 though+5.399
 they+5.331
Neg logit contributions
VERTISEMENT-15.839
igate-15.630
idon-15.447
agate-15.305
OUT-15.173
Token believe
Token ID1975
Feature activation+5.33e+00

Pos logit contributions
 accuses+4.032
 we+3.863
 albeit+3.736
 though+3.567
 they+3.529
Neg logit contributions
VERTISEMENT-10.467
igate-9.889
OUT-9.813
idon-9.809
TABLE-9.780
Token that
Token ID326
Feature activation+5.73e+00

Pos logit contributions
 we+2.972
 they+2.755
 albeit+2.676
 he+2.654
 accuses+2.592
Neg logit contributions
VERTISEMENT-5.214
igate-5.082
idon-4.961
agate-4.958
OUT-4.864
Token excessive
Token ID13181
Feature activation+9.15e+00

Pos logit contributions
 we+3.337
 albeit+3.163
 accuses+3.109
 they+3.060
 though+3.039
Neg logit contributions
igate-5.323
VERTISEMENT-5.316
idon-5.208
agate-5.180
OUT-5.045
Token speculation
Token ID13367
Feature activation+3.20e+00

Pos logit contributions
 albeit+4.357
 we+4.326
 rises+4.324
 accuses+4.104
 though+4.048
Neg logit contributions
igate-8.950
VERTISEMENT-8.823
agate-8.648
idon-8.573
TABLE-8.419
TokenRes
Token ID4965
Feature activation+6.13e+00

Pos logit contributions
 we+3.240
 accuses+3.208
 albeit+3.208
 Temper+3.104
 he+3.062
Neg logit contributions
igate-3.051
idon-2.961
VERTISEMENT-2.959
agate-2.815
-2.595
Tokenisting
Token ID9665
Feature activation+6.08e+00

Pos logit contributions
 accuses+4.737
 Temper+4.682
 we+4.610
 albeit+4.514
 citiz+4.488
Neg logit contributions
VERTISEMENT-4.093
OUT-3.689
FAULT-3.654
igate-3.572
BACK-3.567
Token violations
Token ID11734
Feature activation+4.94e+00

Pos logit contributions
 accuses+3.375
 Temper+3.311
 we+3.232
 citiz+3.167
 albeit+3.144
Neg logit contributions
VERTISEMENT-5.459
idon-5.358
igate-5.352
agate-5.281
OUT-5.272
Token of
Token ID286
Feature activation+1.14e+00

Pos logit contributions
 accuses+2.257
 we+2.113
 albeit+2.067
 Temper+1.986
 though+1.968
Neg logit contributions
VERTISEMENT-5.229
igate-5.162
idon-5.130
agate-5.020
OUT-4.966
Token the
Token ID262
Feature activation+4.07e+00

Pos logit contributions
 citiz+0.592
 albeit+0.578
 we+0.578
 accuses+0.574
 Temper+0.574
Neg logit contributions
igate-1.128
VERTISEMENT-1.097
idon-1.094
agate-1.082
OUT-1.056
Token Supreme
Token ID5617
Feature activation+1.52e+01

Pos logit contributions
 Temper+1.915
 citiz+1.848
 albeit+1.846
 we+1.839
 accuses+1.799
Neg logit contributions
igate-4.195
VERTISEMENT-4.118
idon-4.062
agate-3.997
OUT-3.906
Token Court
Token ID3078
Feature activation+7.24e+00

Pos logit contributions
 Temper+4.540
 we+4.320
 accuses+4.121
 albeit+4.118
 Tw+4.106
Neg logit contributions
VERTISEMENT-17.121
idon-16.681
igate-16.391
OUT-16.342
agate-16.309
Token Orders
Token ID30689
Feature activation+3.33e+00

Pos logit contributions
 accuses+2.717
 we+2.687
 albeit+2.535
 Temper+2.523
 they+2.427
Neg logit contributions
idon-7.882
VERTISEMENT-7.702
igate-7.623
OUT-7.563
TABLE-7.483
Token on
Token ID319
Feature activation+3.95e+00

Pos logit contributions
 accuses+1.770
 we+1.702
 albeit+1.701
 citiz+1.663
 Temper+1.652
Neg logit contributions
VERTISEMENT-3.261
igate-3.239
idon-3.201
agate-3.137
OUT-3.110
Token Aadhaar
Token ID47727
Feature activation+8.94e+00

Pos logit contributions
 Temper+1.906
 accuses+1.803
 we+1.801
 albeit+1.780
 citiz+1.750
Neg logit contributions
idon-3.972
VERTISEMENT-3.968
igate-3.940
agate-3.879
OUT-3.747
Token
Token ID198
Feature activation+2.06e+00

Pos logit contributions
 accuses+4.447
 we+4.365
 albeit+4.214
 citiz+4.173
 though+4.074
Neg logit contributions
igate-8.546
VERTISEMENT-8.406
idon-8.387
agate-8.093
OUT-8.062
Token senior
Token ID4664
Feature activation+1.07e+01

Pos logit contributions
 citiz+0.862
 accuses+0.849
 lingu+0.833
 albeit+0.825
 Temper+0.821
Neg logit contributions
igate-2.306
VERTISEMENT-2.297
idon-2.262
agate-2.247
OUT-2.222
Token fellow
Token ID5891
Feature activation+8.87e+00

Pos logit contributions
 citiz+4.181
 accuses+4.170
 we+4.134
 lingu+4.116
 albeit+3.973
Neg logit contributions
VERTISEMENT-10.971
idon-10.919
igate-10.771
agate-10.733
OUT-10.668
Token at
Token ID379
Feature activation+4.42e+00

Pos logit contributions
 we+3.354
 accuses+3.310
 albeit+3.278
 lingu+3.124
 he+3.090
Neg logit contributions
VERTISEMENT-9.566
idon-9.218
OUT-9.173
igate-9.113
agate-9.025
Token the
Token ID262
Feature activation+3.82e+00

Pos logit contributions
 we+2.388
 accuses+2.342
 Temper+2.326
 citiz+2.320
 Tw+2.314
Neg logit contributions
VERTISEMENT-4.078
idon-4.031
igate-3.996
agate-3.903
OUT-3.854
Token leftist
Token ID28318
Feature activation+1.00e+01

Pos logit contributions
 Temper+1.423
 albeit+1.402
 accuses+1.388
 we+1.381
 Tw+1.354
Neg logit contributions
VERTISEMENT-4.279
igate-4.202
idon-4.165
OUT-4.078
agate-4.058
Token Southern
Token ID8050
Feature activation+1.52e+01

Pos logit contributions
 Temper+3.632
 we+3.578
 accuses+3.468
 Tw+3.425
 lingu+3.331
Neg logit contributions
VERTISEMENT-10.823
idon-10.635
igate-10.592
tnc-10.176
agate-10.148
Token Poverty
Token ID30627
Feature activation+7.93e+00

Pos logit contributions
 Temper+3.897
 Young+3.554
 Liter+3.523
 we+3.446
 Ari+3.262
Neg logit contributions
idon-17.088
igate-17.041
VERTISEMENT-16.852
agate-16.754
LESS-16.726
Token Law
Token ID3854
Feature activation+3.86e+00

Pos logit contributions
 accuses+3.355
 Temper+3.336
 albeit+3.329
 we+3.222
 citiz+3.163
Neg logit contributions
VERTISEMENT-8.424
igate-8.361
idon-8.166
agate-8.074
OUT-7.790
Token Center
Token ID3337
Feature activation+4.68e+00

Pos logit contributions
 citiz+1.934
 accuses+1.920
 albeit+1.864
 Temper+1.814
 we+1.746
Neg logit contributions
igate-3.853
VERTISEMENT-3.806
idon-3.753
agate-3.689
OUT-3.674
Token,
Token ID11
Feature activation+9.21e+00

Pos logit contributions
 accuses+1.973
 we+1.732
 though+1.660
 albeit+1.654
 Temper+1.578
Neg logit contributions
VERTISEMENT-5.223
igate-5.062
tnc-5.026
idon-5.019
OUT-4.959
Token ass
Token ID840
Feature activation+6.04e+00

Pos logit contributions
 accuses+4.277
 albeit+3.435
 we+3.431
 though+3.341
 joins+3.294
Neg logit contributions
VERTISEMENT-9.978
igate-9.630
idon-9.584
tnc-9.453
OUT-9.338
Token stood
Token ID6204
Feature activation-1.54e-01

Pos logit contributions
 accuses+0.820
 albeit+0.808
 we+0.794
 though+0.776
 citiz+0.771
Neg logit contributions
VERTISEMENT-1.309
igate-1.287
idon-1.248
OUT-1.247
agate-1.245
Token for
Token ID329
Feature activation-2.80e+00

Pos logit contributions
VERTISEMENT+0.139
igate+0.138
idon+0.133
OUT+0.131
TABLE+0.131
Neg logit contributions
 accuses-0.095
 albeit-0.094
 citiz-0.093
 we-0.090
 though-0.088
Token Parliament
Token ID8411
Feature activation-6.55e-01

Pos logit contributions
VERTISEMENT+2.789
igate+2.780
idon+2.728
agate+2.677
OUT+2.669
Neg logit contributions
 citiz-1.493
 albeit-1.458
 accuses-1.458
 we-1.398
 Temper-1.397
Token.
Token ID13
Feature activation+2.52e+00

Pos logit contributions
VERTISEMENT+0.671
igate+0.665
idon+0.650
OUT+0.641
agate+0.637
Neg logit contributions
 albeit-0.328
 citiz-0.322
 accuses-0.320
 though-0.313
 we-0.313
Token<|endoftext|>
Token ID50256
Feature activation+2.27e+00

Pos logit contributions
 Temper+1.197
 albeit+1.188
 we+1.185
 accuses+1.177
 Tw+1.138
Neg logit contributions
igate-2.522
VERTISEMENT-2.506
idon-2.483
agate-2.425
OUT-2.393
TokenEarlier
Token ID13689
Feature activation+1.51e+01

Pos logit contributions
 accuses+1.699
 we+1.698
 albeit+1.695
 Temper+1.643
 though+1.608
Neg logit contributions
igate-1.608
VERTISEMENT-1.565
idon-1.560
agate-1.489
uments-1.376
Token today
Token ID1909
Feature activation+1.13e+01

Pos logit contributions
 we+6.978
 he+6.063
 though+6.030
 they+6.002
 accuses+5.781
Neg logit contributions
VERTISEMENT-13.887
idon-13.710
OUT-13.508
igate-13.508
tnc-13.468
Token,
Token ID11
Feature activation+1.17e+01

Pos logit contributions
 we+5.443
 though+4.720
 they+4.599
 he+4.509
 albeit+4.438
Neg logit contributions
VERTISEMENT-11.007
igate-10.894
idon-10.686
tnc-10.594
agate-10.515
Token the
Token ID262
Feature activation+5.69e+00

Pos logit contributions
 we+6.481
 albeit+5.785
 though+5.756
 Temper+5.735
 they+5.692
Neg logit contributions
VERTISEMENT-10.669
igate-10.372
idon-10.262
agate-9.995
OUT-9.898
Token first
Token ID717
Feature activation+1.07e+01

Pos logit contributions
 Temper+2.552
 albeit+2.528
 we+2.523
 citiz+2.479
 accuses+2.451
Neg logit contributions
VERTISEMENT-5.881
igate-5.769
idon-5.632
OUT-5.504
agate-5.479
Token Ebola
Token ID15825
Feature activation+6.10e+00

Pos logit contributions
 we+5.287
 albeit+5.142
 accuses+5.128
 Temper+4.964
 teen+4.896
Neg logit contributions
VERTISEMENT-9.780
igate-9.764
idon-9.747
agate-9.588
OUT-9.269
Token
Token ID198
Feature activation+1.64e+00

Pos logit contributions
 we+2.755
 albeit+2.732
 Temper+2.684
 accuses+2.675
 though+2.577
Neg logit contributions
igate-5.187
VERTISEMENT-5.081
idon-5.072
agate-5.035
OUT-4.808
Token
Token ID198
Feature activation+2.02e+00

Pos logit contributions
 citiz+1.115
 accuses+1.083
 albeit+1.058
 Temper+1.035
 we+0.999
Neg logit contributions
igate-1.382
VERTISEMENT-1.349
idon-1.329
OUT-1.307
agate-1.306
TokenSo
Token ID2396
Feature activation+1.20e+01

Pos logit contributions
 we+1.487
 albeit+1.477
 accuses+1.474
 Temper+1.444
 though+1.394
Neg logit contributions
igate-1.466
VERTISEMENT-1.437
idon-1.429
agate-1.365
uments-1.271
Token,
Token ID11
Feature activation+1.21e+01

Pos logit contributions
 we+6.656
 they+5.929
 accuses+5.909
 albeit+5.887
 though+5.879
Neg logit contributions
VERTISEMENT-10.595
igate-10.476
idon-10.218
tnc-9.978
OUT-9.927
Token after
Token ID706
Feature activation+9.40e+00

Pos logit contributions
 we+6.622
 albeit+6.165
 though+5.946
 accuses+5.861
 they+5.852
Neg logit contributions
VERTISEMENT-11.193
igate-10.910
idon-10.545
agate-10.370
OUT-10.322
Token almost
Token ID2048
Feature activation+1.51e+01

Pos logit contributions
 we+5.436
 albeit+5.166
 they+4.926
 accuses+4.913
 he+4.888
Neg logit contributions
VERTISEMENT-8.694
igate-8.411
idon-8.129
agate-8.023
TABLE-7.977
Token a
Token ID257
Feature activation+4.43e+00

Pos logit contributions
 we+7.853
 albeit+7.534
 he+7.227
 though+7.174
 accuses+7.167
Neg logit contributions
VERTISEMENT-13.220
agate-13.045
igate-12.857
idon-12.481
tnc-12.241
Token week
Token ID1285
Feature activation+8.33e+00

Pos logit contributions
 albeit+1.199
 we+1.125
 accuses+1.103
 citiz+1.034
 though+1.005
Neg logit contributions
igate-5.449
agate-5.390
idon-5.339
VERTISEMENT-5.303
OUT-5.191
Token of
Token ID286
Feature activation+4.34e+00

Pos logit contributions
 we+3.446
 albeit+3.138
 though+3.111
 accuses+2.972
 they+2.965
Neg logit contributions
igate-8.902
VERTISEMENT-8.832
tnc-8.495
agate-8.492
idon-8.436
Token researching
Token ID24114
Feature activation+4.82e+00

Pos logit contributions
 albeit+2.229
 accuses+2.219
 we+2.214
 citiz+2.119
 Temper+2.090
Neg logit contributions
igate-4.250
VERTISEMENT-4.237
idon-4.106
agate-4.102
OUT-3.972
Token,
Token ID11
Feature activation+1.11e+01

Pos logit contributions
 we+2.594
 albeit+2.524
 accuses+2.514
 citiz+2.495
 though+2.423
Neg logit contributions
VERTISEMENT-4.642
igate-4.502
idon-4.455
agate-4.341
OUT-4.317
Token<|endoftext|>
Token ID50256
Feature activation+2.40e+00

Pos logit contributions
 accuses+2.480
 we+2.478
 Temper+2.453
 citiz+2.357
 albeit+2.335
Neg logit contributions
VERTISEMENT-4.231
idon-4.228
igate-4.224
agate-4.167
tnc-4.144
TokenCharlie
Token ID37136
Feature activation+8.66e+00

Pos logit contributions
 accuses+1.792
 we+1.790
 albeit+1.787
 Temper+1.732
 though+1.696
Neg logit contributions
igate-1.696
VERTISEMENT-1.650
idon-1.646
agate-1.573
uments-1.453
Token Lee
Token ID5741
Feature activation+4.16e+00

Pos logit contributions
 we+3.776
 Temper+3.694
 accuses+3.559
 albeit+3.511
 he+3.509
Neg logit contributions
igate-8.786
VERTISEMENT-8.588
idon-8.535
TABLE-8.205
OUT-8.173
Token,
Token ID11
Feature activation+1.01e+01

Pos logit contributions
 accuses+1.993
 we+1.824
 Temper+1.818
 albeit+1.798
 citiz+1.773
Neg logit contributions
igate-4.256
VERTISEMENT-4.221
idon-4.139
agate-4.067
OUT-4.033
Token the
Token ID262
Feature activation+6.04e+00

Pos logit contributions
 we+5.613
 accuses+5.542
 albeit+5.424
 though+5.401
 Temper+5.357
Neg logit contributions
VERTISEMENT-8.647
igate-8.520
idon-8.484
agate-8.407
TABLE-8.155
Token much
Token ID881
Feature activation+1.50e+01

Pos logit contributions
 albeit+2.666
 accuses+2.629
 teen+2.615
 citiz+2.609
 we+2.597
Neg logit contributions
VERTISEMENT-6.267
igate-6.152
idon-6.061
agate-5.940
OUT-5.848
Token celebrated
Token ID13943
Feature activation+9.82e+00

Pos logit contributions
 we+5.586
 albeit+5.325
 accuses+5.218
 though+5.039
 he+5.037
Neg logit contributions
VERTISEMENT-15.026
agate-15.012
idon-14.765
igate-14.235
ANY-14.102
Token creator
Token ID13172
Feature activation+6.49e+00

Pos logit contributions
 albeit+4.481
 lingu+4.459
 we+4.325
 teen+4.243
 sporting+4.232
Neg logit contributions
VERTISEMENT-9.836
igate-9.690
idon-9.629
agate-9.273
OUT-9.176
Token of
Token ID286
Feature activation+8.79e-01

Pos logit contributions
 we+1.583
 accuses+1.571
 albeit+1.523
 citiz+1.462
 he+1.326
Neg logit contributions
igate-8.004
VERTISEMENT-7.947
idon-7.804
agate-7.679
OUT-7.673
Token the
Token ID262
Feature activation+3.05e+00

Pos logit contributions
 albeit+0.448
 we+0.443
 citiz+0.440
 Tw+0.434
 Temper+0.432
Neg logit contributions
igate-0.880
VERTISEMENT-0.858
idon-0.852
agate-0.842
OUT-0.815
Token silver
Token ID8465
Feature activation+7.80e+00

Pos logit contributions
 albeit+1.446
 we+1.396
 Temper+1.376
 teen+1.368
 Tw+1.366
Neg logit contributions
igate-3.153
VERTISEMENT-3.091
idon-3.039
agate-3.006
OUT-2.947
TokenT
Token ID51
Feature activation+6.43e+00

Pos logit contributions
 accuses+1.477
 we+1.461
 albeit+1.451
 Temper+1.408
 they+1.388
Neg logit contributions
igate-1.579
VERTISEMENT-1.546
idon-1.510
agate-1.491
-1.401
Tokenether
Token ID6750
Feature activation+8.93e+00

Pos logit contributions
 we+5.165
 he+5.102
 Temper+5.036
 citiz+4.995
 accuses+4.994
Neg logit contributions
igate-3.926
VERTISEMENT-3.885
-3.581
agate-3.545
idon-3.526
Tokens
Token ID82
Feature activation+3.34e+00

Pos logit contributions
 we+6.059
 accuses+6.008
 Temper+5.830
 rises+5.712
 albeit+5.711
Neg logit contributions
VERTISEMENT-6.200
igate-6.158
tnc-5.901
agate-5.847
idon-5.780
Token Unlimited
Token ID26774
Feature activation+7.18e+00

Pos logit contributions
 we+1.811
 accuses+1.782
 albeit+1.749
 Temper+1.727
 citiz+1.727
Neg logit contributions
VERTISEMENT-3.118
igate-3.051
idon-2.953
agate-2.923
OUT-2.920
Token Inc
Token ID3457
Feature activation+9.38e+00

Pos logit contributions
 we+3.741
 accuses+3.736
 Temper+3.640
 albeit+3.537
 Tw+3.507
Neg logit contributions
igate-6.612
VERTISEMENT-6.572
idon-6.449
agate-6.384
tnc-6.204
Token
Token ID447
Feature activation+1.50e+01

Pos logit contributions
 accuses+5.041
 we+5.003
 joins+4.881
 Temper+4.779
 citiz+4.774
Neg logit contributions
igate-7.696
idon-7.628
agate-7.600
VERTISEMENT-7.576
tnc-7.549
Token
Token ID247
Feature activation-4.72e+00

Pos logit contributions
 citiz+8.646
 we+8.390
 accuses+8.125
 he+8.030
 they+8.030
Neg logit contributions
igate-10.701
idon-10.692
tnc-10.017
VERTISEMENT-9.916
OUT-9.695
Tokens
Token ID82
Feature activation+2.05e+00

Pos logit contributions
igate+4.572
VERTISEMENT+4.503
idon+4.451
agate+4.360
OUT+4.314
Neg logit contributions
 accuses-2.645
 citiz-2.629
 albeit-2.528
 Temper-2.484
 we-2.470
Token (
Token ID357
Feature activation+4.87e+00

Pos logit contributions
 citiz+1.078
 accuses+1.066
 we+1.046
 Temper+1.045
 albeit+1.040
Neg logit contributions
igate-1.978
VERTISEMENT-1.951
idon-1.932
agate-1.894
OUT-1.861
TokenT
Token ID51
Feature activation+3.22e+00

Pos logit contributions
 we+4.050
 accuses+4.022
 albeit+4.002
 Temper+3.996
 Tw+3.972
Neg logit contributions
igate-2.965
VERTISEMENT-2.926
idon-2.891
agate-2.723
uments-2.613
TokenUI
Token ID10080
Feature activation+4.50e+00

Pos logit contributions
 accuses+2.270
 we+2.265
 Temper+2.257
 albeit+2.242
 citiz+2.240
Neg logit contributions
VERTISEMENT-2.452
igate-2.399
agate-2.249
idon-2.186
uments-2.136
Token
Token ID246
Feature activation-1.12e+01

Pos logit contributions
igate+0.071
VERTISEMENT+0.067
idon+0.065
agate+0.062
OUT+0.062
Neg logit contributions
 citiz-0.219
 accuses-0.217
 albeit-0.214
 Temper-0.211
 we-0.208
Tokenbar
Token ID5657
Feature activation-6.26e+00

Pos logit contributions
ability+5.435
igate+5.377
idon+5.184
agate+5.129
OUT+5.073
Neg logit contributions
 citiz-12.127
 mathemat-10.815
 adolesc-10.618
 accuses-10.504
 Temper-10.350
Tokenbar
Token ID5657
Feature activation-7.87e+00

Pos logit contributions
igate+3.433
agate+3.242
ability+3.219
idon+3.188
OUT+2.976
Neg logit contributions
 citiz-6.671
 mathemat-5.956
 adolesc-5.940
 accuses-5.862
 ende-5.818
Tokenians
Token ID1547
Feature activation-3.95e+00

Pos logit contributions
igate+7.335
idon+7.198
agate+6.909
ability+6.749
uments+6.741
Neg logit contributions
 citiz-4.882
 ende-4.265
 mathemat-4.219
 accuses-4.174
 Temper-4.163
Token
Token ID447
Feature activation+3.75e-01

Pos logit contributions
igate+3.227
VERTISEMENT+3.073
idon+3.065
OUT+3.017
agate+3.014
Neg logit contributions
 citiz-2.885
 accuses-2.805
 albeit-2.781
 Temper-2.721
 we-2.687
Token
Token ID247
Feature activation-1.63e+01

Pos logit contributions
 citiz+0.373
 accuses+0.368
 albeit+0.366
 Temper+0.359
 we+0.354
Neg logit contributions
igate-0.192
VERTISEMENT-0.183
idon-0.180
OUT-0.176
agate-0.174
Token.
Token ID13
Feature activation-2.19e+00

Pos logit contributions
igate+15.709
VERTISEMENT+15.481
idon+15.374
agate+15.060
OUT+15.058
Neg logit contributions
 citiz-7.410
 accuses-7.109
 Temper-6.523
 albeit-6.517
 cannabin-6.175
Token
Token ID198
Feature activation-6.87e+00

Pos logit contributions
igate+2.148
VERTISEMENT+2.100
idon+2.081
agate+2.060
OUT+2.040
Neg logit contributions
 accuses-1.163
 albeit-1.143
 Temper-1.140
 citiz-1.133
 we-1.081
Token
Token ID198
Feature activation-6.72e+00

Pos logit contributions
igate+4.473
OUT+4.227
ability+4.205
idon+4.054
agate+4.004
Neg logit contributions
 citiz-7.077
 mathemat-6.370
 ende-6.159
 adolesc-6.018
etheless-5.992
TokenIt
Token ID1026
Feature activation-3.13e+00

Pos logit contributions
igate+4.692
VERTISEMENT+4.538
idon+4.475
agate+4.391
OUT+4.299
Neg logit contributions
 accuses-5.457
 citiz-5.440
 albeit-5.389
 Temper-5.295
 we-5.182
Token is
Token ID318
Feature activation-9.01e+00

Pos logit contributions
igate+3.683
VERTISEMENT+3.651
idon+3.585
OUT+3.549
agate+3.525
Neg logit contributions
 accuses-1.115
 citiz-1.075
 albeit-1.044
 we-0.999
 Temper-0.976
Token Commerce
Token ID16127
Feature activation-4.69e-01

Pos logit contributions
igate+1.248
VERTISEMENT+1.232
idon+1.206
agate+1.196
OUT+1.181
Neg logit contributions
 citiz-0.682
 albeit-0.667
 accuses-0.667
 Temper-0.638
 we-0.636
Token Control
Token ID6779
Feature activation-5.89e+00

Pos logit contributions
VERTISEMENT+0.472
igate+0.469
idon+0.459
agate+0.454
OUT+0.451
Neg logit contributions
 citiz-0.236
 accuses-0.230
 albeit-0.228
 we-0.222
 Temper-0.213
Token List
Token ID7343
Feature activation-3.76e+00

Pos logit contributions
igate+6.027
VERTISEMENT+5.912
idon+5.835
agate+5.762
OUT+5.750
Neg logit contributions
 citiz-2.916
 accuses-2.837
 albeit-2.759
 Temper-2.674
 we-2.563
Token.
Token ID13
Feature activation-7.17e-01

Pos logit contributions
igate+3.702
VERTISEMENT+3.643
idon+3.567
agate+3.552
OUT+3.535
Neg logit contributions
 citiz-2.054
 albeit-2.006
 accuses-1.978
 we-1.914
 Temper-1.904
Token
Token ID564
Feature activation-3.53e+00

Pos logit contributions
igate+0.675
idon+0.664
VERTISEMENT+0.661
agate+0.654
OUT+0.635
Neg logit contributions
 albeit-0.404
 Temper-0.399
 accuses-0.398
 we-0.393
 citiz-0.384
Token
Token ID250
Feature activation-1.61e+01

Pos logit contributions
igate+1.742
VERTISEMENT+1.678
idon+1.636
OUT+1.583
agate+1.577
Neg logit contributions
 citiz-3.633
 accuses-3.551
 albeit-3.478
 Temper-3.443
 we-3.346
TokenYes
Token ID5297
Feature activation+5.05e+00

Pos logit contributions
igate+10.350
OUT+10.211
VERTISEMENT+10.009
Deploy+9.678
BACK+9.662
Neg logit contributions
 citiz-13.438
 accuses-12.330
 ende-11.776
 albeit-11.711
 adolesc-11.673
Token,
Token ID11
Feature activation+4.45e+00

Pos logit contributions
 we+2.322
 they+2.058
 albeit+2.027
 citiz+2.014
 though+1.999
Neg logit contributions
VERTISEMENT-5.222
igate-5.142
OUT-5.081
idon-5.067
tnc-4.987
Token we
Token ID356
Feature activation-1.46e+00

Pos logit contributions
 we+2.336
 albeit+2.220
 citiz+2.119
 they+2.098
 though+2.084
Neg logit contributions
VERTISEMENT-4.449
igate-4.373
idon-4.308
agate-4.214
OUT-4.190
Token
Token ID447
Feature activation+3.91e+00

Pos logit contributions
VERTISEMENT+1.569
igate+1.543
idon+1.520
OUT+1.503
agate+1.497
Neg logit contributions
 albeit-0.642
 accuses-0.639
 citiz-0.637
 we-0.633
 Temper-0.597
Token
Token ID247
Feature activation-1.44e+01

Pos logit contributions
 albeit+3.473
 accuses+3.444
 citiz+3.429
 we+3.392
 Temper+3.339
Neg logit contributions
igate-2.273
VERTISEMENT-2.222
idon-2.157
OUT-2.116
agate-2.107
Token Taliban
Token ID15471
Feature activation+5.83e-01

Pos logit contributions
igate+2.363
VERTISEMENT+2.350
idon+2.297
agate+2.269
OUT+2.255
Neg logit contributions
 citiz-1.041
 accuses-1.037
 albeit-1.008
 we-0.968
 Temper-0.962
Token.
Token ID13
Feature activation+3.34e-01

Pos logit contributions
 accuses+0.331
 we+0.322
 citiz+0.321
 albeit+0.318
 though+0.304
Neg logit contributions
igate-0.559
VERTISEMENT-0.559
idon-0.544
agate-0.537
TABLE-0.529
Token
Token ID447
Feature activation+1.71e+00

Pos logit contributions
 Temper+0.198
 we+0.196
 albeit+0.195
 accuses+0.195
 Tw+0.190
Neg logit contributions
igate-0.306
idon-0.298
VERTISEMENT-0.295
agate-0.294
OUT-0.285
Token
Token ID251
Feature activation-1.05e+01

Pos logit contributions
 citiz+1.531
 we+1.487
 albeit+1.484
 accuses+1.476
 though+1.454
Neg logit contributions
igate-1.049
OUT-0.995
idon-0.980
agate-0.970
VERTISEMENT-0.951
Token
Token ID564
Feature activation-4.00e+00

Pos logit contributions
igate+10.466
VERTISEMENT+10.293
idon+10.128
OUT+9.999
agate+9.967
Neg logit contributions
 citiz-5.364
 accuses-5.143
 albeit-4.981
 Temper-4.811
 we-4.588
Token
Token ID250
Feature activation-1.60e+01

Pos logit contributions
igate+1.969
VERTISEMENT+1.900
idon+1.852
OUT+1.796
agate+1.783
Neg logit contributions
 citiz-4.135
 accuses-4.031
 albeit-3.954
 Temper-3.909
 we-3.799
TokenOn
Token ID2202
Feature activation-2.54e+00

Pos logit contributions
igate+11.749
VERTISEMENT+11.697
OUT+11.620
idon+11.243
agate+11.124
Neg logit contributions
 citiz-11.746
 accuses-10.652
 albeit-10.232
 ende-10.205
 mathemat-10.012
Token Sunday
Token ID3502
Feature activation+5.60e+00

Pos logit contributions
VERTISEMENT+2.398
igate+2.394
idon+2.352
agate+2.297
OUT+2.281
Neg logit contributions
 citiz-1.450
 accuses-1.421
 albeit-1.400
 we-1.374
 Temper-1.373
Token,
Token ID11
Feature activation+4.08e+00

Pos logit contributions
 we+2.360
 they+2.221
 though+2.156
 accuses+2.147
 he+2.121
Neg logit contributions
igate-6.116
VERTISEMENT-6.025
idon-5.916
agate-5.879
OUT-5.841
Token I
Token ID314
Feature activation-1.20e+00

Pos logit contributions
 we+2.179
 albeit+2.143
 accuses+2.118
 though+2.080
 they+2.066
Neg logit contributions
igate-3.921
VERTISEMENT-3.920
idon-3.852
agate-3.787
OUT-3.748
Token think
Token ID892
Feature activation+2.47e+00

Pos logit contributions
igate+1.259
VERTISEMENT+1.251
idon+1.231
OUT+1.219
agate+1.206
Neg logit contributions
 accuses-0.551
 citiz-0.539
 albeit-0.539
 we-0.525
 Temper-0.508
Token Action
Token ID7561
Feature activation-7.27e+00

Pos logit contributions
igate+6.951
OUT+6.552
VERTISEMENT+6.550
ability+6.410
agate+6.351
Neg logit contributions
 citiz-6.123
 mathemat-5.823
etheless-5.479
 ende-5.453
 adolesc-4.989
Token Now
Token ID2735
Feature activation-3.33e+00

Pos logit contributions
igate+5.221
OUT+5.035
VERTISEMENT+5.010
ability+4.884
idon+4.833
Neg logit contributions
 citiz-6.305
 mathemat-6.012
etheless-5.630
 ende-5.622
 adolesc-5.382
Token and
Token ID290
Feature activation+1.48e+00

Pos logit contributions
igate+3.487
OUT+3.343
ability+3.322
agate+3.305
idon+3.298
Neg logit contributions
 citiz-1.774
 mathemat-1.490
 accuses-1.447
 ende-1.408
 adolesc-1.402
Token we
Token ID356
Feature activation-3.19e+00

Pos logit contributions
 citiz+0.158
 accuses+0.117
 albeit+0.090
 Temper+0.079
soever+0.030
Neg logit contributions
igate-2.100
VERTISEMENT-2.062
idon-2.049
agate-2.033
OUT-2.032
Token
Token ID447
Feature activation+2.74e+00

Pos logit contributions
igate+2.404
VERTISEMENT+2.343
idon+2.311
agate+2.264
OUT+2.256
Neg logit contributions
 citiz-2.422
 accuses-2.386
 albeit-2.341
 Temper-2.298
 we-2.245
Token
Token ID247
Feature activation-1.59e+01

Pos logit contributions
 accuses+2.618
 citiz+2.614
 albeit+2.597
 Temper+2.537
 we+2.530
Neg logit contributions
igate-1.449
VERTISEMENT-1.407
idon-1.381
OUT-1.326
agate-1.323
Tokenll
Token ID297
Feature activation-6.60e+00

Pos logit contributions
igate+9.627
agate+8.590
OUT+8.271
idon+8.172
ability+8.157
Neg logit contributions
 citiz-14.592
 mathemat-13.551
 adolesc-13.394
 cannabin-12.638
 accuses-12.566
Token send
Token ID3758
Feature activation-1.26e+01

Pos logit contributions
igate+3.966
agate+3.668
idon+3.554
OUT+3.534
ability+3.453
Neg logit contributions
 citiz-6.279
 mathemat-5.790
 accuses-5.743
 Temper-5.717
etheless-5.655
Token you
Token ID345
Feature activation-1.05e+01

Pos logit contributions
igate+9.864
idon+8.971
ability+8.943
OUT+8.888
VICE+8.574
Neg logit contributions
 citiz-9.667
 mathemat-9.014
etheless-8.602
 ende-8.339
 adolesc-8.207
Token three
Token ID1115
Feature activation-1.54e+00

Pos logit contributions
VERTISEMENT+6.947
igate+6.887
OUT+6.540
ability+6.459
+6.266
Neg logit contributions
 citiz-8.283
 mathemat-8.107
 ende-7.716
etheless-7.483
 accuses-7.428
Token meaningful
Token ID11570
Feature activation+1.10e-01

Pos logit contributions
VERTISEMENT+1.040
igate+1.037
OUT+0.980
idon+0.977
ability+0.952
Neg logit contributions
 citiz-1.296
 accuses-1.263
 Temper-1.221
 albeit-1.201
 ende-1.173
Token never
Token ID1239
Feature activation+1.39e+00

Pos logit contributions
igate+5.293
VERTISEMENT+5.188
idon+5.069
agate+5.055
OUT+5.012
Neg logit contributions
 citiz-2.650
 accuses-2.617
 albeit-2.563
 Temper-2.444
 we-2.428
Token captured
Token ID7907
Feature activation-6.33e+00

Pos logit contributions
 accuses+0.760
 albeit+0.750
 we+0.729
 though+0.699
 citiz+0.696
Neg logit contributions
igate-1.347
VERTISEMENT-1.345
OUT-1.314
idon-1.305
agate-1.299
Token.
Token ID13
Feature activation+8.70e-02

Pos logit contributions
igate+5.971
VERTISEMENT+5.903
idon+5.792
agate+5.768
OUT+5.713
Neg logit contributions
 citiz-3.682
 accuses-3.585
 albeit-3.565
 Temper-3.462
 we-3.391
Token It
Token ID632
Feature activation-5.01e+00

Pos logit contributions
 Temper+0.044
 albeit+0.043
 accuses+0.043
 we+0.041
 Tw+0.041
Neg logit contributions
igate-0.087
VERTISEMENT-0.086
idon-0.085
agate-0.085
OUT-0.083
Token
Token ID447
Feature activation+9.14e-01

Pos logit contributions
igate+5.475
VERTISEMENT+5.386
idon+5.300
OUT+5.259
agate+5.248
Neg logit contributions
 accuses-2.204
 citiz-2.193
 albeit-2.124
 Temper-2.006
 we-1.998
Token
Token ID247
Feature activation-1.58e+01

Pos logit contributions
 citiz+0.870
 accuses+0.870
 albeit+0.866
 Temper+0.842
 we+0.837
Neg logit contributions
igate-0.499
VERTISEMENT-0.480
idon-0.470
agate-0.454
OUT-0.454
Tokens
Token ID82
Feature activation-9.40e+00

Pos logit contributions
igate+12.234
idon+11.886
VERTISEMENT+11.884
ability+11.508
OUT+11.495
Neg logit contributions
 citiz-11.044
 accuses-10.342
 albeit-10.155
 Temper-10.136
 cannabin-9.576
Token not
Token ID407
Feature activation-2.02e-01

Pos logit contributions
igate+8.499
VERTISEMENT+8.303
idon+8.190
OUT+8.058
agate+8.055
Neg logit contributions
 citiz-5.707
 accuses-5.553
 albeit-5.389
 Temper-5.284
 we-5.052
Token
Token ID564
Feature activation+7.72e-01

Pos logit contributions
igate+0.182
VERTISEMENT+0.181
idon+0.180
agate+0.176
OUT+0.170
Neg logit contributions
 albeit-0.123
 accuses-0.122
 citiz-0.122
 we-0.121
 though-0.118
Token
Token ID250
Feature activation-1.25e+01

Pos logit contributions
 albeit+0.638
 accuses+0.636
 citiz+0.636
 we+0.618
 though+0.607
Neg logit contributions
igate-0.521
VERTISEMENT-0.496
idon-0.486
OUT-0.480
agate-0.479
Tokenreal
Token ID5305
Feature activation+3.40e+00

Pos logit contributions
igate+10.889
VERTISEMENT+10.628
idon+10.484
OUT+10.368
agate+10.307
Neg logit contributions
 citiz-7.917
 accuses-7.701
 albeit-7.473
 Temper-7.340
 we-6.989
Token bill
Token ID2855
Feature activation-6.95e+00

Pos logit contributions
igate+5.726
VERTISEMENT+5.612
idon+5.539
agate+5.460
OUT+5.447
Neg logit contributions
 citiz-3.191
 accuses-3.115
 albeit-3.044
 Temper-2.944
 we-2.854
Token last
Token ID938
Feature activation+2.92e+00

Pos logit contributions
igate+6.437
VERTISEMENT+6.312
idon+6.219
agate+6.127
OUT+6.110
Neg logit contributions
 citiz-4.111
 accuses-4.014
 albeit-3.924
 Temper-3.817
 we-3.690
Token month
Token ID1227
Feature activation-1.05e-01

Pos logit contributions
 albeit+0.754
 we+0.698
 accuses+0.694
 citiz+0.657
 Fall+0.651
Neg logit contributions
VERTISEMENT-3.660
idon-3.648
igate-3.625
agate-3.574
TABLE-3.498
Token.
Token ID13
Feature activation-4.42e-01

Pos logit contributions
igate+0.094
VERTISEMENT+0.094
agate+0.090
idon+0.090
OUT+0.089
Neg logit contributions
 albeit-0.066
 accuses-0.065
 we-0.064
 citiz-0.063
 though-0.062
Token
Token ID564
Feature activation-3.25e+00

Pos logit contributions
igate+0.420
idon+0.416
agate+0.410
VERTISEMENT+0.409
OUT+0.395
Neg logit contributions
 accuses-0.242
 Temper-0.241
 albeit-0.241
 we-0.236
 citiz-0.230
Token
Token ID250
Feature activation-1.58e+01

Pos logit contributions
igate+1.579
VERTISEMENT+1.524
idon+1.479
OUT+1.433
agate+1.423
Neg logit contributions
 citiz-3.385
 accuses-3.302
 albeit-3.228
 Temper-3.201
 we-3.105
TokenWe
Token ID1135
Feature activation-3.83e+00

Pos logit contributions
igate+10.042
OUT+9.890
VERTISEMENT+9.831
idon+9.470
ability+9.399
Neg logit contributions
 citiz-13.479
 accuses-12.418
 ende-11.870
 albeit-11.839
 Temper-11.771
Token certainly
Token ID3729
Feature activation+4.18e+00

Pos logit contributions
igate+4.063
VERTISEMENT+4.055
idon+3.951
OUT+3.936
agate+3.903
Neg logit contributions
 citiz-1.718
 accuses-1.698
 albeit-1.675
 we-1.612
 Temper-1.582
Token need
Token ID761
Feature activation-5.50e+00

Pos logit contributions
 we+1.799
 albeit+1.728
 accuses+1.701
 though+1.640
 citiz+1.603
Neg logit contributions
VERTISEMENT-4.530
igate-4.395
idon-4.362
OUT-4.336
TABLE-4.278
Token to
Token ID284
Feature activation-6.29e+00

Pos logit contributions
igate+4.818
VERTISEMENT+4.777
idon+4.683
agate+4.627
OUT+4.606
Neg logit contributions
 citiz-3.570
 accuses-3.476
 albeit-3.462
 we-3.384
 Temper-3.314
Token work
Token ID670
Feature activation-6.65e+00

Pos logit contributions
igate+5.747
VERTISEMENT+5.599
idon+5.538
agate+5.445
OUT+5.428
Neg logit contributions
 citiz-3.797
 accuses-3.686
 albeit-3.564
 Temper-3.497
soever-3.310
Token for
Token ID329
Feature activation-4.81e+00

Pos logit contributions
 we+1.175
 accuses+1.169
 albeit+1.138
 citiz+1.128
 Tw+1.115
Neg logit contributions
VERTISEMENT-2.290
igate-2.207
idon-2.182
agate-2.139
ULE-2.120
Token Medical
Token ID8366
Feature activation+8.36e-01

Pos logit contributions
igate+5.492
VERTISEMENT+5.405
idon+5.388
OUT+5.308
agate+5.301
Neg logit contributions
 citiz-1.808
 accuses-1.729
 albeit-1.681
 Temper-1.621
 we-1.569
Token Progress
Token ID18387
Feature activation-7.05e-01

Pos logit contributions
 citiz+0.580
 accuses+0.566
 albeit+0.548
 Temper+0.535
 we+0.518
Neg logit contributions
igate-0.692
VERTISEMENT-0.675
idon-0.662
agate-0.654
OUT-0.653
Token.
Token ID13
Feature activation-6.48e-01

Pos logit contributions
VERTISEMENT+0.658
igate+0.646
idon+0.644
agate+0.625
OUT+0.614
Neg logit contributions
 accuses-0.420
 citiz-0.410
 albeit-0.410
 we-0.397
 Temper-0.396
Token
Token ID564
Feature activation-2.76e+00

Pos logit contributions
igate+0.638
idon+0.637
VERTISEMENT+0.631
agate+0.616
OUT+0.604
Neg logit contributions
 accuses-0.339
 Temper-0.334
 albeit-0.330
 we-0.324
 Tw-0.315
Token
Token ID250
Feature activation-1.58e+01

Pos logit contributions
igate+1.770
VERTISEMENT+1.684
idon+1.660
agate+1.633
OUT+1.628
Neg logit contributions
 citiz-2.453
 albeit-2.433
 accuses-2.424
 we-2.376
 Temper-2.333
TokenPl
Token ID3646
Feature activation-2.38e+00

Pos logit contributions
igate+12.065
OUT+11.942
VERTISEMENT+11.803
BACK+11.421
agate+11.410
Neg logit contributions
 citiz-11.263
 accuses-10.052
 albeit-9.681
 ende-9.637
 mathemat-9.614
Tokenanned
Token ID3577
Feature activation-2.23e+00

Pos logit contributions
igate+1.333
idon+1.251
VERTISEMENT+1.249
agate+1.246
OUT+1.225
Neg logit contributions
 citiz-2.320
 accuses-2.225
 albeit-2.200
 Temper-2.175
soever-2.120
Token Parenthood
Token ID18461
Feature activation-3.90e+00

Pos logit contributions
igate+1.812
VERTISEMENT+1.767
idon+1.738
agate+1.714
OUT+1.709
Neg logit contributions
 citiz-1.568
 accuses-1.521
 albeit-1.484
 Temper-1.458
 we-1.398
Token is
Token ID318
Feature activation-7.58e+00

Pos logit contributions
igate+4.477
VERTISEMENT+4.395
idon+4.353
agate+4.279
OUT+4.271
Neg logit contributions
 accuses-1.507
 citiz-1.495
 albeit-1.407
 Temper-1.344
 we-1.327
Token answering
Token ID18877
Feature activation-5.64e+00

Pos logit contributions
igate+6.692
VERTISEMENT+6.559
idon+6.464
agate+6.348
OUT+6.333
Neg logit contributions
 citiz-4.820
 accuses-4.717
 albeit-4.613
 Temper-4.494
 we-4.360
Token,
Token ID11
Feature activation+2.70e+00

Pos logit contributions
igate+7.511
VERTISEMENT+7.361
idon+7.261
agate+7.145
OUT+7.135
Neg logit contributions
 citiz-4.748
 accuses-4.635
 albeit-4.523
 Temper-4.405
 we-4.250
Token
Token ID564
Feature activation-1.30e+00

Pos logit contributions
 albeit+1.281
 we+1.255
 accuses+1.246
 though+1.190
 citiz+1.171
Neg logit contributions
VERTISEMENT-2.792
igate-2.771
idon-2.743
agate-2.703
OUT-2.659
Token
Token ID250
Feature activation-1.38e+01

Pos logit contributions
igate+0.803
VERTISEMENT+0.783
idon+0.754
OUT+0.746
agate+0.743
Neg logit contributions
 albeit-1.150
 accuses-1.144
 citiz-1.142
 we-1.119
 Temper-1.102
TokenI
Token ID40
Feature activation-3.24e+00

Pos logit contributions
igate+11.079
OUT+10.737
VERTISEMENT+10.716
idon+10.580
agate+10.477
Neg logit contributions
 citiz-9.386
 accuses-8.845
 albeit-8.460
 Temper-8.413
 ende-8.062
Token
Token ID447
Feature activation+2.86e-03

Pos logit contributions
igate+3.245
VERTISEMENT+3.232
idon+3.156
agate+3.114
OUT+3.108
Neg logit contributions
 accuses-1.650
 albeit-1.641
 citiz-1.633
 we-1.582
 Temper-1.554
Token
Token ID247
Feature activation-1.57e+01

Pos logit contributions
 accuses+0.003
 citiz+0.003
 albeit+0.003
 Temper+0.003
 we+0.003
Neg logit contributions
igate-0.002
VERTISEMENT-0.002
idon-0.001
agate-0.001
OUT-0.001
Tokenm
Token ID76
Feature activation-6.85e+00

Pos logit contributions
igate+10.292
idon+9.604
OUT+9.486
agate+9.486
VERTISEMENT+9.415
Neg logit contributions
 citiz-12.941
 accuses-12.167
 albeit-11.763
 Temper-11.727
soever-11.301
Token a
Token ID257
Feature activation-5.74e+00

Pos logit contributions
igate+6.212
VERTISEMENT+6.104
idon+6.021
agate+5.916
OUT+5.880
Neg logit contributions
 citiz-4.198
 accuses-4.124
 albeit-4.055
 Temper-3.936
 we-3.850
Token product
Token ID1720
Feature activation-2.89e+00

Pos logit contributions
igate+5.728
VERTISEMENT+5.572
idon+5.547
agate+5.468
OUT+5.429
Neg logit contributions
 citiz-3.074
 accuses-2.979
 albeit-2.943
 Temper-2.858
 we-2.782
Token of
Token ID286
Feature activation-3.87e+00

Pos logit contributions
igate+3.397
idon+3.301
VERTISEMENT+3.298
OUT+3.267
agate+3.257
Neg logit contributions
 citiz-1.073
 accuses-1.027
 albeit-1.010
 we-0.984
 though-0.922
Token yellow
Token ID7872
Feature activation+3.45e+00

Pos logit contributions
igate+3.406
VERTISEMENT+3.300
idon+3.284
agate+3.226
OUT+3.202
Neg logit contributions
 citiz-2.517
 accuses-2.454
 albeit-2.444
 Temper-2.401
 we-2.375
Token often
Token ID1690
Feature activation+2.39e+00

Pos logit contributions
 albeit+2.019
 we+1.985
 accuses+1.974
 citiz+1.943
 teen+1.915
Neg logit contributions
igate-3.535
idon-3.460
VERTISEMENT-3.409
agate-3.345
tnc-3.217
Token overlooked
Token ID21655
Feature activation-1.46e+00

Pos logit contributions
 accuses+1.385
 we+1.378
 albeit+1.343
 citiz+1.303
 though+1.297
Neg logit contributions
igate-2.224
VERTISEMENT-2.215
idon-2.194
agate-2.148
OUT-2.051
Token.
Token ID13
Feature activation+5.18e-01

Pos logit contributions
igate+1.516
VERTISEMENT+1.491
idon+1.473
agate+1.454
OUT+1.429
Neg logit contributions
 albeit-0.727
 citiz-0.706
 accuses-0.705
 we-0.691
 though-0.681
Token She
Token ID1375
Feature activation-3.51e+00

Pos logit contributions
 accuses+0.258
 Temper+0.257
 albeit+0.257
 we+0.248
 citiz+0.245
Neg logit contributions
igate-0.522
VERTISEMENT-0.509
idon-0.509
agate-0.505
OUT-0.489
Token
Token ID447
Feature activation+1.84e+00

Pos logit contributions
igate+3.897
VERTISEMENT+3.836
idon+3.766
agate+3.721
OUT+3.699
Neg logit contributions
 accuses-1.506
 citiz-1.480
 albeit-1.440
 we-1.368
 Temper-1.355
Token
Token ID247
Feature activation-1.57e+01

Pos logit contributions
 accuses+1.709
 albeit+1.707
 citiz+1.697
 Temper+1.656
 we+1.654
Neg logit contributions
igate-1.042
VERTISEMENT-1.004
idon-0.983
agate-0.955
OUT-0.955
Tokens
Token ID82
Feature activation-8.73e+00

Pos logit contributions
igate+11.730
idon+11.360
VERTISEMENT+11.190
ability+11.017
agate+11.009
Neg logit contributions
 citiz-11.383
 accuses-10.460
 Temper-10.311
 albeit-10.282
 cannabin-9.836
Token has
Token ID468
Feature activation-7.81e+00

Pos logit contributions
igate+8.033
VERTISEMENT+7.868
idon+7.762
agate+7.637
OUT+7.623
Neg logit contributions
 citiz-5.209
 accuses-5.078
 albeit-4.955
 Temper-4.833
 we-4.654
Token her
Token ID607
Feature activation-4.80e+00

Pos logit contributions
igate+7.199
VERTISEMENT+7.040
idon+6.955
agate+6.851
OUT+6.799
Neg logit contributions
 citiz-4.760
 accuses-4.638
 albeit-4.555
 Temper-4.440
 we-4.334
Token own
Token ID898
Feature activation-2.70e+00

Pos logit contributions
igate+4.476
VERTISEMENT+4.397
idon+4.333
agate+4.262
OUT+4.227
Neg logit contributions
 citiz-2.885
 accuses-2.837
 albeit-2.781
 Temper-2.715
 we-2.650
Token main
Token ID1388
Feature activation+2.84e+00

Pos logit contributions
igate+2.721
VERTISEMENT+2.655
idon+2.631
agate+2.594
OUT+2.557
Neg logit contributions
 citiz-1.418
 accuses-1.402
 albeit-1.398
 Temper-1.355
 we-1.329
Token each
Token ID1123
Feature activation+4.15e+00

Pos logit contributions
igate+2.560
VERTISEMENT+2.509
idon+2.488
agate+2.471
OUT+2.442
Neg logit contributions
 citiz-1.488
 albeit-1.470
 accuses-1.459
 we-1.422
 Temper-1.372
Token year
Token ID614
Feature activation-2.07e+00

Pos logit contributions
 albeit+0.879
 we+0.860
 citiz+0.749
 he+0.707
 they+0.706
Neg logit contributions
igate-5.383
agate-5.281
idon-5.257
VERTISEMENT-5.186
OUT-5.126
Token (
Token ID357
Feature activation-2.14e+00

Pos logit contributions
igate+1.979
VERTISEMENT+1.935
idon+1.912
agate+1.910
OUT+1.871
Neg logit contributions
 albeit-1.176
 accuses-1.157
 citiz-1.152
 we-1.139
 Temper-1.108
Tokenauthor
Token ID9800
Feature activation-2.38e+00

Pos logit contributions
VERTISEMENT+1.613
igate+1.598
idon+1.554
agate+1.532
OUT+1.466
Neg logit contributions
 albeit-1.629
 accuses-1.622
 citiz-1.570
 we-1.566
 Temper-1.540
Token
Token ID447
Feature activation+1.82e+00

Pos logit contributions
igate+1.977
VERTISEMENT+1.921
agate+1.897
idon+1.878
OUT+1.877
Neg logit contributions
 accuses-1.652
 citiz-1.636
 albeit-1.629
 Temper-1.592
 we-1.586
Token
Token ID247
Feature activation-1.57e+01

Pos logit contributions
 accuses+1.724
 citiz+1.724
 albeit+1.715
 Temper+1.678
 we+1.658
Neg logit contributions
igate-0.994
VERTISEMENT-0.955
idon-0.942
OUT-0.913
agate-0.911
Tokens
Token ID82
Feature activation-7.16e+00

Pos logit contributions
igate+9.338
ability+9.208
OUT+8.828
idon+8.820
uments+8.476
Neg logit contributions
 citiz-14.064
 mathemat-12.578
etheless-11.907
 ende-11.735
 adolesc-11.648
Token analysis
Token ID3781
Feature activation-5.39e+00

Pos logit contributions
igate+7.638
VERTISEMENT+7.561
idon+7.466
agate+7.369
OUT+7.292
Neg logit contributions
 citiz-3.310
 accuses-3.230
 albeit-3.222
 Temper-3.044
 we-2.981
Token of
Token ID286
Feature activation-6.35e+00

Pos logit contributions
igate+6.237
VERTISEMENT+6.143
idon+6.062
agate+6.005
OUT+5.988
Neg logit contributions
 citiz-1.959
 accuses-1.887
 albeit-1.823
 Temper-1.745
 we-1.650
Token Current
Token ID9236
Feature activation-9.94e-02

Pos logit contributions
igate+6.871
VERTISEMENT+6.773
idon+6.698
agate+6.634
OUT+6.573
Neg logit contributions
 citiz-2.795
 accuses-2.714
 albeit-2.650
 Temper-2.589
 we-2.476
Token Population
Token ID20133
Feature activation-1.82e+00

Pos logit contributions
igate+0.091
VERTISEMENT+0.089
idon+0.087
OUT+0.086
agate+0.085
Neg logit contributions
 citiz-0.062
 accuses-0.059
 albeit-0.057
 Temper-0.055
soever-0.053
Tokenist
Token ID396
Feature activation-6.13e+00

Pos logit contributions
igate+3.837
VERTISEMENT+3.789
idon+3.674
OUT+3.671
agate+3.648
Neg logit contributions
 accuses-1.213
 citiz-1.166
 albeit-1.149
 we-1.108
 Temper-1.098
Token's
Token ID338
Feature activation-3.20e+00

Pos logit contributions
igate+6.465
VERTISEMENT+6.351
idon+6.278
agate+6.192
OUT+6.186
Neg logit contributions
 citiz-2.849
 accuses-2.805
 albeit-2.706
 Temper-2.605
 we-2.507
Token disagreement
Token ID25800
Feature activation-5.02e+00

Pos logit contributions
igate+3.530
VERTISEMENT+3.456
idon+3.446
agate+3.421
OUT+3.375
Neg logit contributions
 accuses-1.342
 citiz-1.340
 albeit-1.325
 Temper-1.250
 we-1.231
Token with
Token ID351
Feature activation-5.70e+00

Pos logit contributions
igate+5.358
VERTISEMENT+5.261
idon+5.243
OUT+5.189
agate+5.125
Neg logit contributions
 accuses-2.373
 citiz-2.320
 albeit-2.280
 we-2.175
 Temper-2.136
Token
Token ID564
Feature activation-1.94e+00

Pos logit contributions
igate+5.048
VERTISEMENT+4.910
idon+4.864
agate+4.804
OUT+4.748
Neg logit contributions
 citiz-3.744
 accuses-3.674
 albeit-3.644
 Temper-3.560
 we-3.505
Token
Token ID250
Feature activation-1.57e+01

Pos logit contributions
igate+1.274
VERTISEMENT+1.217
idon+1.192
OUT+1.175
agate+1.168
Neg logit contributions
 citiz-1.696
 accuses-1.658
 albeit-1.657
 we-1.629
 Temper-1.609
TokenDuring
Token ID7191
Feature activation-2.23e-01

Pos logit contributions
igate+11.405
VERTISEMENT+11.136
idon+11.019
OUT+11.011
agate+10.838
Neg logit contributions
 citiz-11.470
 accuses-10.899
 albeit-10.542
 Temper-10.331
 lingu-10.002
Token these
Token ID777
Feature activation+1.44e+00

Pos logit contributions
igate+0.208
idon+0.203
VERTISEMENT+0.202
agate+0.201
OUT+0.195
Neg logit contributions
 albeit-0.129
 citiz-0.128
 accuses-0.128
 we-0.127
 Temper-0.126
Token two
Token ID734
Feature activation+2.48e+00

Pos logit contributions
 accuses+0.693
 albeit+0.692
 citiz+0.661
 we+0.654
 Temper+0.648
Neg logit contributions
igate-1.493
VERTISEMENT-1.458
agate-1.455
idon-1.452
OUT-1.408
Token ch
Token ID442
Feature activation+6.34e-01

Pos logit contributions
 accuses+1.086
 albeit+1.070
 we+1.020
 Temper+1.003
 citiz+0.991
Neg logit contributions
igate-2.691
agate-2.637
VERTISEMENT-2.627
idon-2.602
OUT-2.519
Tokenor
Token ID273
Feature activation+3.21e+00

Pos logit contributions
 accuses+0.378
 we+0.369
 albeit+0.363
 Temper+0.354
 he+0.349
Neg logit contributions
VERTISEMENT-0.581
igate-0.574
OUT-0.558
-0.553
agate-0.549
Token.
Token ID13
Feature activation-4.24e-02

Pos logit contributions
igate+1.757
VERTISEMENT+1.757
idon+1.714
agate+1.704
OUT+1.685
Neg logit contributions
 citiz-1.327
 accuses-1.319
 albeit-1.313
 we-1.276
 Temper-1.271
Token
Token ID564
Feature activation-3.45e+00

Pos logit contributions
igate+0.041
idon+0.041
agate+0.040
VERTISEMENT+0.040
OUT+0.039
Neg logit contributions
 Temper-0.023
 accuses-0.022
 albeit-0.022
 we-0.022
 Tw-0.021
Token
Token ID250
Feature activation-1.40e+01

Pos logit contributions
igate+1.853
VERTISEMENT+1.780
idon+1.745
agate+1.698
OUT+1.682
Neg logit contributions
 citiz-3.373
 accuses-3.340
 albeit-3.303
 Temper-3.252
 we-3.194
TokenThey
Token ID2990
Feature activation-5.46e+00

Pos logit contributions
igate+10.233
VERTISEMENT+9.945
OUT+9.911
idon+9.689
agate+9.594
Neg logit contributions
 citiz-10.662
 accuses-10.046
 albeit-9.707
 Temper-9.494
 ende-9.361
Token
Token ID447
Feature activation+1.02e-01

Pos logit contributions
VERTISEMENT+5.597
igate+5.590
idon+5.470
OUT+5.371
agate+5.324
Neg logit contributions
 citiz-2.715
 accuses-2.708
 albeit-2.661
 we-2.578
 Temper-2.545
Token
Token ID247
Feature activation-1.57e+01

Pos logit contributions
 citiz+0.100
 accuses+0.100
 albeit+0.099
 Temper+0.097
 we+0.096
Neg logit contributions
igate-0.054
VERTISEMENT-0.052
idon-0.051
agate-0.049
OUT-0.049
Tokenre
Token ID260
Feature activation-9.63e+00

Pos logit contributions
igate+12.755
idon+12.008
agate+11.991
ability+11.774
OUT+11.746
Neg logit contributions
 citiz-10.506
 accuses-9.713
 Temper-9.413
 albeit-9.296
 adolesc-9.097
Token occupying
Token ID30876
Feature activation-4.69e+00

Pos logit contributions
igate+8.989
VERTISEMENT+8.802
idon+8.660
OUT+8.549
agate+8.545
Neg logit contributions
 citiz-5.544
 accuses-5.373
 albeit-5.220
 Temper-5.116
 we-4.855
Token a
Token ID257
Feature activation-5.66e+00

Pos logit contributions
igate+4.220
VERTISEMENT+4.168
idon+4.136
agate+4.078
OUT+4.047
Neg logit contributions
 citiz-2.987
 accuses-2.906
 albeit-2.887
 we-2.796
 Temper-2.768
Token similar
Token ID2092
Feature activation+5.78e+00

Pos logit contributions
igate+4.788
VERTISEMENT+4.637
idon+4.624
agate+4.555
OUT+4.504
Neg logit contributions
 citiz-3.912
 accuses-3.826
 albeit-3.812
 Temper-3.693
 we-3.662
Token space
Token ID2272
Feature activation-3.37e+00

Pos logit contributions
 albeit+3.453
 we+3.316
 citiz+3.211
 though+3.159
 he+3.115
Neg logit contributions
igate-5.278
VERTISEMENT-5.095
idon-5.042
agate-4.992
OUT-4.938
Token.
Token ID13
Feature activation+6.42e-01

Pos logit contributions
igate+0.051
VERTISEMENT+0.049
agate+0.049
idon+0.049
OUT+0.048
Neg logit contributions
 albeit-0.023
 we-0.022
 accuses-0.022
 though-0.022
 citiz-0.021
Token
Token ID564
Feature activation-2.98e+00

Pos logit contributions
 Temper+0.335
 we+0.332
 albeit+0.330
 accuses+0.327
 Tw+0.317
Neg logit contributions
igate-0.631
idon-0.615
agate-0.613
VERTISEMENT-0.610
OUT-0.589
Token
Token ID250
Feature activation-1.55e+01

Pos logit contributions
igate+1.542
VERTISEMENT+1.485
idon+1.449
agate+1.407
OUT+1.403
Neg logit contributions
 citiz-2.973
 accuses-2.928
 albeit-2.886
 Temper-2.844
 we-2.782
TokenWe
Token ID1135
Feature activation-4.13e+00

Pos logit contributions
igate+10.295
OUT+10.146
VERTISEMENT+9.998
idon+9.793
ability+9.775
Neg logit contributions
 citiz-12.510
 accuses-11.596
 albeit-11.054
 ende-11.012
 Temper-10.901
Token
Token ID447
Feature activation+5.05e-01

Pos logit contributions
igate+4.351
VERTISEMENT+4.308
idon+4.229
OUT+4.189
agate+4.174
Neg logit contributions
 citiz-1.891
 accuses-1.868
 albeit-1.823
 Temper-1.739
 we-1.720
Token
Token ID247
Feature activation-1.57e+01

Pos logit contributions
 citiz+0.438
 accuses+0.434
 albeit+0.434
 we+0.428
 Temper+0.417
Neg logit contributions
igate-0.319
VERTISEMENT-0.304
idon-0.303
OUT-0.298
agate-0.295
Tokenve
Token ID303
Feature activation-4.34e+00

Pos logit contributions
igate+14.270
idon+13.730
agate+13.474
ability+13.441
VERTISEMENT+13.317
Neg logit contributions
 citiz-8.848
 accuses-8.201
 Temper-7.865
 albeit-7.729
 adolesc-7.468
Token got
Token ID1392
Feature activation-6.94e+00

Pos logit contributions
igate+3.947
VERTISEMENT+3.941
idon+3.846
agate+3.789
OUT+3.784
Neg logit contributions
 citiz-2.608
 accuses-2.592
 albeit-2.564
 we-2.548
 Temper-2.462
Token this
Token ID428
Feature activation-1.59e+00

Pos logit contributions
igate+6.058
VERTISEMENT+5.994
idon+5.874
agate+5.778
OUT+5.759
Neg logit contributions
 citiz-4.508
 accuses-4.435
 albeit-4.345
 Temper-4.230
 we-4.201
Token investment
Token ID4896
Feature activation-4.73e+00

Pos logit contributions
igate+1.520
VERTISEMENT+1.507
idon+1.486
agate+1.469
OUT+1.448
Neg logit contributions
 citiz-0.880
 albeit-0.873
 accuses-0.865
 we-0.851
 Temper-0.824
Token [
Token ID685
Feature activation-1.82e+00

Pos logit contributions
igate+4.681
VERTISEMENT+4.616
idon+4.546
agate+4.496
OUT+4.481
Neg logit contributions
 citiz-2.494
 accuses-2.441
 albeit-2.408
 Temper-2.301
 we-2.289
Token make
Token ID787
Feature activation-7.09e+00

Pos logit contributions
igate+5.798
VERTISEMENT+5.647
idon+5.577
OUT+5.481
agate+5.479
Neg logit contributions
 citiz-4.233
 accuses-4.120
 albeit-3.979
 Temper-3.914
soever-3.719
Token projects
Token ID4493
Feature activation-1.10e+01

Pos logit contributions
igate+6.580
VERTISEMENT+6.431
idon+6.340
OUT+6.246
agate+6.236
Neg logit contributions
 citiz-4.142
 accuses-4.026
 albeit-3.888
 Temper-3.815
 we-3.622
Token.
Token ID13
Feature activation-7.11e-01

Pos logit contributions
igate+11.177
VERTISEMENT+10.778
idon+10.625
ability+10.524
OUT+10.524
Neg logit contributions
 citiz-5.129
 accuses-4.955
 Temper-4.619
 albeit-4.454
soever-4.191
Token I
Token ID314
Feature activation-2.94e+00

Pos logit contributions
igate+0.741
VERTISEMENT+0.731
idon+0.721
agate+0.714
OUT+0.706
Neg logit contributions
 accuses-0.325
 albeit-0.324
 Temper-0.323
 citiz-0.321
 we-0.314
Token
Token ID447
Feature activation+1.63e+00

Pos logit contributions
igate+2.902
VERTISEMENT+2.892
idon+2.809
OUT+2.784
agate+2.769
Neg logit contributions
 albeit-1.544
 accuses-1.537
 citiz-1.534
 we-1.493
 Temper-1.454
Token
Token ID247
Feature activation-1.56e+01

Pos logit contributions
 albeit+1.502
 accuses+1.501
 citiz+1.480
 we+1.462
 Temper+1.456
Neg logit contributions
igate-0.906
VERTISEMENT-0.897
idon-0.870
OUT-0.844
agate-0.840
Tokenve
Token ID303
Feature activation-6.40e+00

Pos logit contributions
igate+10.972
idon+10.392
agate+10.154
ability+10.072
OUT+9.927
Neg logit contributions
 citiz-12.308
 accuses-11.385
 Temper-11.066
 albeit-10.851
 adolesc-10.637
Token contributed
Token ID8639
Feature activation-9.12e+00

Pos logit contributions
igate+5.521
VERTISEMENT+5.406
idon+5.323
agate+5.234
OUT+5.224
Neg logit contributions
 citiz-4.183
 accuses-4.092
 albeit-4.005
 Temper-3.913
 we-3.788
Token regularly
Token ID7987
Feature activation+4.21e-01

Pos logit contributions
igate+8.084
VERTISEMENT+7.890
idon+7.788
OUT+7.647
agate+7.646
Neg logit contributions
 citiz-5.699
 accuses-5.564
 albeit-5.404
 Temper-5.288
 we-5.070
Token to
Token ID284
Feature activation-5.93e+00

Pos logit contributions
 albeit+0.240
 though+0.231
 we+0.226
 accuses+0.223
 citiz+0.219
Neg logit contributions
VERTISEMENT-0.405
igate-0.399
idon-0.393
OUT-0.390
agate-0.390
Token Arduino
Token ID27634
Feature activation-4.26e+00

Pos logit contributions
igate+5.868
VERTISEMENT+5.763
idon+5.681
agate+5.601
OUT+5.571
Neg logit contributions
 citiz-3.165
 accuses-3.073
 albeit-3.008
 Temper-2.931
 we-2.829
Token.
Token ID13
Feature activation-2.03e+00

Pos logit contributions
igate+2.491
VERTISEMENT+2.411
agate+2.391
idon+2.370
OUT+2.355
Neg logit contributions
 citiz-1.659
 accuses-1.648
 albeit-1.623
 we-1.598
 Temper-1.583
Token4
Token ID19
Feature activation-3.20e+00

Pos logit contributions
igate+1.473
VERTISEMENT+1.448
idon+1.432
agate+1.404
OUT+1.364
Neg logit contributions
 accuses-1.582
 albeit-1.562
 Temper-1.541
 we-1.531
 citiz-1.496
Token billion
Token ID2997
Feature activation+5.70e-02

Pos logit contributions
igate+3.275
VERTISEMENT+3.210
idon+3.169
agate+3.162
OUT+3.087
Neg logit contributions
 accuses-1.619
 citiz-1.615
 albeit-1.567
 Temper-1.538
 we-1.525
Token.
Token ID13
Feature activation-5.97e-01

Pos logit contributions
 albeit+0.033
 we+0.032
 citiz+0.032
 accuses+0.031
 though+0.031
Neg logit contributions
igate-0.054
VERTISEMENT-0.053
idon-0.053
agate-0.052
OUT-0.052
Token
Token ID564
Feature activation-3.18e+00

Pos logit contributions
igate+0.558
idon+0.540
agate+0.536
VERTISEMENT+0.531
OUT+0.518
Neg logit contributions
 Temper-0.349
 accuses-0.341
 albeit-0.339
 citiz-0.338
 we-0.337
Token
Token ID250
Feature activation-1.56e+01

Pos logit contributions
igate+1.719
VERTISEMENT+1.650
idon+1.618
agate+1.574
OUT+1.563
Neg logit contributions
 citiz-3.088
 accuses-3.059
 albeit-3.029
 Temper-2.976
 we-2.928
TokenM
Token ID44
Feature activation-8.94e-01

Pos logit contributions
igate+10.085
OUT+10.060
VERTISEMENT+9.870
TABLE+9.539
BACK+9.537
Neg logit contributions
 citiz-13.055
 accuses-11.977
 ende-11.447
 albeit-11.419
etheless-11.279
Token.
Token ID13
Feature activation-9.62e-01

Pos logit contributions
VERTISEMENT+0.702
igate+0.688
idon+0.672
agate+0.654
OUT+0.649
Neg logit contributions
 accuses-0.635
 albeit-0.619
 citiz-0.618
 we-0.613
 Temper-0.605
TokenK
Token ID42
Feature activation-1.56e+00

Pos logit contributions
igate+0.703
VERTISEMENT+0.679
idon+0.675
agate+0.667
OUT+0.640
Neg logit contributions
 accuses-0.759
 citiz-0.750
 albeit-0.749
 we-0.740
 Temper-0.737
Token.
Token ID13
Feature activation-1.28e+00

Pos logit contributions
igate+1.309
VERTISEMENT+1.292
idon+1.273
agate+1.258
OUT+1.255
Neg logit contributions
 citiz-1.042
 accuses-1.028
 albeit-1.001
 Temper-0.993
 we-0.981
Token and
Token ID290
Feature activation+2.74e+00

Pos logit contributions
igate+1.329
VERTISEMENT+1.304
idon+1.304
agate+1.277
OUT+1.252
Neg logit contributions
 accuses-0.619
 citiz-0.615
 albeit-0.601
 we-0.590
 Temper-0.586
Token.
Token ID13
Feature activation+1.13e-01

Pos logit contributions
igate+6.965
VERTISEMENT+6.835
idon+6.751
agate+6.680
OUT+6.654
Neg logit contributions
 citiz-2.839
 accuses-2.742
 albeit-2.680
 Temper-2.579
 we-2.474
Token
Token ID564
Feature activation-2.06e+00

Pos logit contributions
 Temper+0.056
 albeit+0.056
 accuses+0.055
 we+0.053
 citiz+0.052
Neg logit contributions
igate-0.114
idon-0.112
VERTISEMENT-0.111
agate-0.111
OUT-0.108
Token
Token ID250
Feature activation-1.43e+01

Pos logit contributions
igate+1.392
idon+1.317
VERTISEMENT+1.310
agate+1.300
OUT+1.292
Neg logit contributions
 citiz-1.777
 albeit-1.736
 accuses-1.735
 we-1.700
 though-1.673
TokenI
Token ID40
Feature activation-3.04e+00

Pos logit contributions
igate+11.331
VERTISEMENT+11.265
OUT+11.102
idon+10.891
ability+10.663
Neg logit contributions
 citiz-9.753
 accuses-9.173
 albeit-8.698
 Temper-8.470
 ende-8.456
Token
Token ID447
Feature activation+1.32e+00

Pos logit contributions
igate+3.101
VERTISEMENT+3.069
idon+3.008
OUT+2.973
agate+2.967
Neg logit contributions
 citiz-1.493
 accuses-1.490
 albeit-1.481
 we-1.409
 Temper-1.403
Token
Token ID247
Feature activation-1.56e+01

Pos logit contributions
 accuses+1.194
 albeit+1.192
 citiz+1.191
 we+1.161
 Temper+1.158
Neg logit contributions
igate-0.756
VERTISEMENT-0.739
idon-0.730
agate-0.700
OUT-0.699
Tokenve
Token ID303
Feature activation-4.30e+00

Pos logit contributions
igate+11.946
idon+11.344
agate+11.153
VERTISEMENT+11.153
OUT+11.149
Neg logit contributions
 citiz-10.940
 accuses-10.277
 Temper-9.841
 albeit-9.701
 rises-9.333
Token h
Token ID289
Feature activation-4.34e+00

Pos logit contributions
igate+3.836
VERTISEMENT+3.799
idon+3.733
agate+3.675
OUT+3.649
Neg logit contributions
 citiz-2.663
 accuses-2.633
 albeit-2.606
 Temper-2.542
 we-2.520
Tokeniked
Token ID17951
Feature activation-7.69e+00

Pos logit contributions
VERTISEMENT+2.872
igate+2.871
idon+2.762
agate+2.725
OUT+2.708
Neg logit contributions
 citiz-3.624
 accuses-3.584
 albeit-3.579
 we-3.525
 Temper-3.512
Token in
Token ID287
Feature activation-5.73e+00

Pos logit contributions
igate+7.177
VERTISEMENT+7.007
idon+6.912
OUT+6.816
agate+6.803
Neg logit contributions
 citiz-4.443
 accuses-4.329
 albeit-4.182
 Temper-4.081
 we-3.897
Token rain
Token ID6290
Feature activation-2.42e+00

Pos logit contributions
igate+5.933
VERTISEMENT+5.843
idon+5.786
agate+5.718
OUT+5.658
Neg logit contributions
 citiz-2.829
 accuses-2.724
 albeit-2.690
 Temper-2.630
 we-2.535
Token 352
Token ID44063
Feature activation+1.30e+00

Pos logit contributions
igate+0.594
VERTISEMENT+0.584
idon+0.582
agate+0.565
OUT+0.561
Neg logit contributions
 accuses-0.434
 citiz-0.434
 albeit-0.431
 we-0.420
 Temper-0.419
Token in
Token ID287
Feature activation-4.96e+00

Pos logit contributions
 we+0.645
 albeit+0.645
 accuses+0.632
 citiz+0.629
 though+0.607
Neg logit contributions
VERTISEMENT-1.312
igate-1.311
idon-1.290
agate-1.289
OUT-1.247
Token 1996
Token ID8235
Feature activation+5.16e-01

Pos logit contributions
igate+4.363
VERTISEMENT+4.270
idon+4.208
agate+4.139
OUT+4.131
Neg logit contributions
 citiz-3.165
 accuses-3.092
 albeit-3.024
 Temper-2.955
 we-2.853
Token.
Token ID13
Feature activation-3.30e-01

Pos logit contributions
 albeit+0.305
 citiz+0.290
 accuses+0.288
 we+0.287
 though+0.285
Neg logit contributions
igate-0.487
idon-0.472
VERTISEMENT-0.470
agate-0.463
OUT-0.457
Token
Token ID564
Feature activation-2.31e+00

Pos logit contributions
igate+0.325
idon+0.322
VERTISEMENT+0.319
agate+0.318
OUT+0.306
Neg logit contributions
 Temper-0.174
 albeit-0.170
 accuses-0.168
 we-0.162
 Tw-0.162
Token
Token ID250
Feature activation-1.56e+01

Pos logit contributions
igate+1.506
idon+1.413
VERTISEMENT+1.405
OUT+1.389
agate+1.383
Neg logit contributions
 citiz-2.062
 accuses-2.005
 albeit-2.001
 we-1.951
 Temper-1.931
TokenWhat
Token ID2061
Feature activation+3.21e+00

Pos logit contributions
igate+10.763
VERTISEMENT+10.597
OUT+10.562
idon+10.174
agate+10.058
Neg logit contributions
 citiz-12.083
 accuses-11.306
 albeit-10.761
 ende-10.628
 Temper-10.560
Token B
Token ID347
Feature activation-4.80e+00

Pos logit contributions
 we+1.521
 citiz+1.457
 accuses+1.422
 albeit+1.388
 they+1.363
Neg logit contributions
igate-3.348
VERTISEMENT-3.302
idon-3.292
OUT-3.177
agate-3.160
Token.
Token ID13
Feature activation-8.64e-01

Pos logit contributions
igate+3.355
idon+3.141
OUT+3.125
VERTISEMENT+3.117
agate+3.089
Neg logit contributions
 citiz-4.128
 accuses-3.881
 albeit-3.762
 Temper-3.740
 ende-3.702
TokenC
Token ID34
Feature activation-2.58e+00

Pos logit contributions
igate+0.486
VERTISEMENT+0.461
idon+0.451
OUT+0.450
agate+0.440
Neg logit contributions
 citiz-0.858
 accuses-0.814
 albeit-0.795
 Temper-0.785
 ende-0.779
Token.
Token ID13
Feature activation-2.90e+00

Pos logit contributions
igate+2.293
VERTISEMENT+2.199
idon+2.183
agate+2.161
OUT+2.131
Neg logit contributions
 citiz-1.726
 accuses-1.682
 albeit-1.630
 we-1.626
 Temper-1.587
Token before
Token ID878
Feature activation+2.35e+00

Pos logit contributions
 albeit+0.444
 accuses+0.440
 we+0.439
 citiz+0.426
 though+0.426
Neg logit contributions
VERTISEMENT-0.907
igate-0.889
agate-0.884
idon-0.884
OUT-0.855
Token time
Token ID640
Feature activation+2.23e+00

Pos logit contributions
 we+1.350
 albeit+1.330
 accuses+1.313
 citiz+1.307
 he+1.301
Neg logit contributions
igate-2.207
VERTISEMENT-2.175
agate-2.170
idon-2.118
OUT-2.100
Token zero
Token ID6632
Feature activation+3.28e+00

Pos logit contributions
 albeit+1.293
 we+1.256
 accuses+1.243
 though+1.212
 they+1.198
Neg logit contributions
igate-2.093
VERTISEMENT-2.047
agate-2.013
idon-1.985
OUT-1.958
Token.
Token ID13
Feature activation+2.37e-01

Pos logit contributions
 albeit+1.887
 we+1.846
 accuses+1.792
 though+1.788
 Temper+1.739
Neg logit contributions
igate-3.039
VERTISEMENT-2.996
agate-2.986
OUT-2.909
idon-2.906
Token
Token ID564
Feature activation-3.05e+00

Pos logit contributions
 Temper+0.123
 albeit+0.118
 accuses+0.118
 Tw+0.114
 we+0.113
Neg logit contributions
igate-0.239
idon-0.231
agate-0.231
VERTISEMENT-0.231
OUT-0.223
Token
Token ID250
Feature activation-1.55e+01

Pos logit contributions
igate+1.784
VERTISEMENT+1.719
idon+1.687
agate+1.648
OUT+1.628
Neg logit contributions
 citiz-2.827
 accuses-2.807
 albeit-2.795
 Temper-2.730
 we-2.722
TokenI
Token ID40
Feature activation-3.38e+00

Pos logit contributions
igate+9.772
OUT+9.764
VERTISEMENT+9.592
idon+9.374
BACK+9.369
Neg logit contributions
 citiz-13.060
 accuses-12.005
 albeit-11.649
 ende-11.617
 mathemat-11.379
Token cannot
Token ID2314
Feature activation-1.26e+00

Pos logit contributions
igate+3.457
VERTISEMENT+3.421
idon+3.355
agate+3.310
OUT+3.309
Neg logit contributions
 citiz-1.654
 accuses-1.637
 albeit-1.589
 Temper-1.528
 we-1.488
Token do
Token ID466
Feature activation-6.11e+00

Pos logit contributions
igate+1.188
VERTISEMENT+1.185
idon+1.150
OUT+1.141
agate+1.137
Neg logit contributions
 accuses-0.708
 citiz-0.703
 albeit-0.702
 we-0.672
 Temper-0.672
Token it
Token ID340
Feature activation-5.02e+00

Pos logit contributions
igate+5.674
VERTISEMENT+5.617
idon+5.492
agate+5.416
OUT+5.405
Neg logit contributions
 citiz-3.626
 accuses-3.564
 albeit-3.492
 Temper-3.395
 we-3.331
Token,
Token ID11
Feature activation+3.60e+00

Pos logit contributions
igate+4.632
VERTISEMENT+4.605
idon+4.475
agate+4.427
OUT+4.426
Neg logit contributions
 citiz-3.000
 accuses-2.965
 albeit-2.928
 Temper-2.840
 we-2.813
Tokenens
Token ID641
Feature activation-7.35e+00

Pos logit contributions
idon+4.588
igate+4.539
agate+4.203
ability+4.202
OUT+4.106
Neg logit contributions
 citiz-8.385
 accuses-7.605
 ende-7.508
etheless-7.490
 mathemat-7.463
Tokeny
Token ID88
Feature activation-6.05e+00

Pos logit contributions
igate+4.357
idon+4.283
ability+4.094
agate+4.087
OUT+3.960
Neg logit contributions
 citiz-7.408
 mathemat-6.540
 accuses-6.506
 ende-6.503
 adolesc-6.405
Token.
Token ID13
Feature activation-6.93e-02

Pos logit contributions
igate+5.865
VERTISEMENT+5.742
idon+5.652
agate+5.589
OUT+5.528
Neg logit contributions
 citiz-3.392
 accuses-3.316
 albeit-3.284
 Temper-3.192
 we-3.108
Token I
Token ID314
Feature activation-3.47e+00

Pos logit contributions
igate+0.074
VERTISEMENT+0.073
idon+0.072
agate+0.071
OUT+0.070
Neg logit contributions
 Temper-0.030
 albeit-0.030
 accuses-0.030
 we-0.029
 citiz-0.029
Token
Token ID447
Feature activation+1.06e+00

Pos logit contributions
igate+3.430
VERTISEMENT+3.400
idon+3.315
OUT+3.282
agate+3.268
Neg logit contributions
 citiz-1.822
 albeit-1.822
 accuses-1.818
 we-1.754
 Temper-1.719
Token
Token ID247
Feature activation-1.55e+01

Pos logit contributions
 accuses+0.997
 albeit+0.994
 citiz+0.987
 Temper+0.968
 we+0.967
Neg logit contributions
igate-0.579
VERTISEMENT-0.570
idon-0.555
OUT-0.537
agate-0.535
Tokenm
Token ID76
Feature activation-5.33e+00

Pos logit contributions
igate+11.729
idon+11.181
agate+10.949
ability+10.884
VERTISEMENT+10.728
Neg logit contributions
 citiz-11.400
 accuses-10.421
 Temper-10.118
 albeit-9.833
 adolesc-9.823
Token happy
Token ID3772
Feature activation+2.42e+00

Pos logit contributions
igate+5.131
VERTISEMENT+5.073
idon+4.978
agate+4.902
OUT+4.865
Neg logit contributions
 citiz-2.980
 accuses-2.925
 albeit-2.906
 Temper-2.775
 we-2.753
Token to
Token ID284
Feature activation-3.36e+00

Pos logit contributions
 we+1.371
 albeit+1.335
 citiz+1.317
 though+1.286
 they+1.263
Neg logit contributions
igate-2.373
VERTISEMENT-2.366
idon-2.299
agate-2.246
OUT-2.231
Token say
Token ID910
Feature activation+3.67e-01

Pos logit contributions
igate+3.186
VERTISEMENT+3.140
idon+3.080
agate+3.045
OUT+3.044
Neg logit contributions
 citiz-1.899
 accuses-1.885
 albeit-1.866
 Temper-1.795
 we-1.777
Token my
Token ID616
Feature activation-1.57e+00

Pos logit contributions
 we+0.190
 albeit+0.181
 citiz+0.181
 Temper+0.175
 though+0.174
Neg logit contributions
igate-0.380
VERTISEMENT-0.374
agate-0.365
idon-0.361
OUT-0.356
Token that
Token ID326
Feature activation+3.15e+00

Pos logit contributions
igate+6.034
VERTISEMENT+5.903
idon+5.826
agate+5.765
OUT+5.721
Neg logit contributions
 citiz-3.770
 accuses-3.680
 albeit-3.626
 Temper-3.504
 we-3.427
Token category
Token ID6536
Feature activation-3.75e-01

Pos logit contributions
 albeit+0.998
 we+0.942
 citiz+0.921
 accuses+0.903
 teen+0.872
Neg logit contributions
agate-3.763
igate-3.759
idon-3.718
VERTISEMENT-3.700
OUT-3.572
Token (
Token ID357
Feature activation-1.53e+00

Pos logit contributions
igate+0.358
VERTISEMENT+0.347
agate+0.344
idon+0.343
OUT+0.340
Neg logit contributions
 albeit-0.217
 we-0.213
 citiz-0.207
 accuses-0.206
 though-0.204
Tokenauthor
Token ID9800
Feature activation-1.00e+00

Pos logit contributions
igate+1.139
VERTISEMENT+1.133
idon+1.115
agate+1.089
OUT+1.040
Neg logit contributions
 albeit-1.163
 accuses-1.156
 citiz-1.121
 we-1.120
 Temper-1.104
Token
Token ID447
Feature activation+1.67e+00

Pos logit contributions
igate+0.877
VERTISEMENT+0.861
agate+0.850
OUT+0.839
idon+0.839
Neg logit contributions
 accuses-0.637
 albeit-0.632
 citiz-0.629
 Temper-0.622
 we-0.619
Token
Token ID247
Feature activation-1.55e+01

Pos logit contributions
 accuses+1.539
 albeit+1.535
 citiz+1.533
 Temper+1.492
 we+1.484
Neg logit contributions
igate-0.943
VERTISEMENT-0.906
idon-0.902
OUT-0.868
agate-0.865
Tokens
Token ID82
Feature activation-6.63e+00

Pos logit contributions
igate+12.149
VERTISEMENT+11.806
idon+11.682
OUT+11.473
ability+11.400
Neg logit contributions
 citiz-10.689
 accuses-10.085
 albeit-9.875
 Temper-9.667
soever-9.279
Token analysis
Token ID3781
Feature activation-4.22e+00

Pos logit contributions
igate+6.951
VERTISEMENT+6.842
idon+6.765
agate+6.681
OUT+6.619
Neg logit contributions
 citiz-3.172
 accuses-3.098
 albeit-3.079
 Temper-2.942
 we-2.860
Token of
Token ID286
Feature activation-5.64e+00

Pos logit contributions
igate+4.990
VERTISEMENT+4.915
idon+4.840
agate+4.821
OUT+4.793
Neg logit contributions
 citiz-1.451
 accuses-1.405
 albeit-1.373
 Temper-1.306
 we-1.256
Token Current
Token ID9236
Feature activation+3.05e+00

Pos logit contributions
igate+6.720
VERTISEMENT+6.608
idon+6.584
agate+6.541
OUT+6.452
Neg logit contributions
 citiz-1.916
 accuses-1.835
 albeit-1.808
 Temper-1.762
 we-1.687
Token Population
Token ID20133
Feature activation+1.04e+00

Pos logit contributions
 albeit+1.353
 Temper+1.342
 accuses+1.309
 we+1.307
 citiz+1.295
Neg logit contributions
igate-3.170
agate-3.130
VERTISEMENT-3.123
idon-3.115
OUT-3.035
Tokens
Token ID82
Feature activation+1.84e+00

Pos logit contributions
igate+6.320
VERTISEMENT+6.186
idon+6.164
agate+6.029
OUT+6.003
Neg logit contributions
 citiz-3.200
 accuses-3.192
 albeit-3.112
 Temper-2.998
 we-2.951
Token European
Token ID3427
Feature activation+1.26e+01

Pos logit contributions
 accuses+0.864
 albeit+0.863
 citiz+0.846
 Temper+0.831
 we+0.830
Neg logit contributions
igate-1.882
idon-1.862
VERTISEMENT-1.851
agate-1.799
OUT-1.780
Token edition
Token ID8313
Feature activation+5.50e+00

Pos logit contributions
 citiz+5.950
 Temper+5.892
 albeit+5.881
 accuses+5.761
 we+5.692
Neg logit contributions
igate-11.238
VERTISEMENT-11.223
idon-11.045
OUT-10.794
BACK-10.737
Token.
Token ID13
Feature activation+4.67e+00

Pos logit contributions
 accuses+3.104
 we+3.046
 albeit+3.010
 Temper+2.933
 though+2.922
Neg logit contributions
igate-4.992
idon-4.852
VERTISEMENT-4.838
agate-4.773
TABLE-4.655
Token The
Token ID383
Feature activation+3.67e+00

Pos logit contributions
 Temper+2.434
 we+2.356
 accuses+2.355
 albeit+2.317
 Tw+2.276
Neg logit contributions
idon-4.428
igate-4.407
VERTISEMENT-4.307
agate-4.245
tnc-4.128
Token Norwegian
Token ID22158
Feature activation+1.15e+01

Pos logit contributions
 accuses+1.651
 citiz+1.639
 albeit+1.638
 Temper+1.602
 we+1.594
Neg logit contributions
VERTISEMENT-3.775
igate-3.745
idon-3.663
agate-3.605
TABLE-3.548
Token Police
Token ID4287
Feature activation+3.67e+00

Pos logit contributions
 accuses+5.371
 citiz+5.309
 albeit+5.153
 we+5.051
 Temper+5.018
Neg logit contributions
igate-10.988
idon-10.597
tnc-10.594
VERTISEMENT-10.512
agate-10.254
Token Security
Token ID4765
Feature activation+5.02e+00

Pos logit contributions
 accuses+1.920
 Temper+1.777
 albeit+1.753
 citiz+1.749
 we+1.741
Neg logit contributions
VERTISEMENT-3.678
igate-3.629
idon-3.581
OUT-3.519
TABLE-3.504
Token Service
Token ID4809
Feature activation-7.88e-01

Pos logit contributions
 accuses+2.529
 albeit+2.400
 citiz+2.400
 we+2.345
 Temper+2.343
Neg logit contributions
igate-4.978
VERTISEMENT-4.972
idon-4.788
agate-4.735
OUT-4.725
Token informed
Token ID7981
Feature activation-3.60e-01

Pos logit contributions
igate+0.843
VERTISEMENT+0.837
idon+0.828
OUT+0.814
agate+0.814
Neg logit contributions
 accuses-0.376
 citiz-0.356
 albeit-0.343
 Temper-0.330
 we-0.328
Token that
Token ID326
Feature activation+4.36e+00

Pos logit contributions
VERTISEMENT+0.361
igate+0.355
idon+0.347
agate+0.342
OUT+0.338
Neg logit contributions
 citiz-0.196
 accuses-0.190
 albeit-0.183
 we-0.182
 Temper-0.181
Token for
Token ID329
Feature activation+1.64e+00

Pos logit contributions
 albeit+1.822
 we+1.779
 accuses+1.764
 though+1.722
 he+1.718
Neg logit contributions
VERTISEMENT-3.832
agate-3.713
igate-3.707
idon-3.697
OUT-3.629
Token Japan
Token ID2869
Feature activation+8.79e+00

Pos logit contributions
 accuses+0.868
 albeit+0.855
 citiz+0.852
 Temper+0.827
 we+0.827
Neg logit contributions
VERTISEMENT-1.622
igate-1.601
idon-1.573
agate-1.559
OUT-1.538
Token as
Token ID355
Feature activation+6.85e+00

Pos logit contributions
 accuses+3.493
 albeit+3.460
 though+3.345
 citiz+3.300
 we+3.259
Neg logit contributions
igate-9.062
VERTISEMENT-9.012
tnc-8.882
idon-8.787
OUT-8.624
Token he
Token ID339
Feature activation+3.03e+00

Pos logit contributions
 he+3.734
 we+3.668
 they+3.577
 accuses+3.553
 though+3.508
Neg logit contributions
igate-6.364
VERTISEMENT-6.335
OUT-6.260
idon-6.253
agate-6.200
Token and
Token ID290
Feature activation+8.88e+00

Pos logit contributions
 accuses+1.549
 albeit+1.452
 we+1.419
 citiz+1.340
 though+1.337
Neg logit contributions
VERTISEMENT-3.176
igate-3.131
OUT-3.043
idon-3.040
agate-2.999
Token Prime
Token ID5537
Feature activation+1.30e+01

Pos logit contributions
 accuses+4.458
 albeit+4.417
 he+4.407
 we+4.290
 Temper+4.246
Neg logit contributions
VERTISEMENT-8.232
igate-8.225
agate-8.135
idon-8.076
OUT-7.856
Token Minister
Token ID4139
Feature activation+7.36e+00

Pos logit contributions
 accuses+4.561
 albeit+4.542
 Temper+4.448
 we+4.266
 Tw+4.072
Neg logit contributions
igate-14.126
VERTISEMENT-14.031
idon-13.764
agate-13.601
OUT-13.421
Token Shinzo
Token ID43531
Feature activation+1.08e+01

Pos logit contributions
 accuses+3.694
 albeit+3.692
 we+3.520
 citiz+3.418
 he+3.350
Neg logit contributions
igate-7.208
VERTISEMENT-7.197
agate-7.114
idon-6.955
OUT-6.928
Token Abe
Token ID22257
Feature activation+2.08e+00

Pos logit contributions
 accuses+4.379
 albeit+4.298
 we+4.097
 Temper+4.025
 though+3.887
Neg logit contributions
VERTISEMENT-11.230
igate-11.101
agate-10.845
idon-10.819
OUT-10.718
Token issued
Token ID4884
Feature activation-2.06e+00

Pos logit contributions
 accuses+1.012
 albeit+0.982
 we+0.973
 citiz+0.927
 though+0.908
Neg logit contributions
VERTISEMENT-2.141
igate-2.121
OUT-2.070
idon-2.066
agate-2.034
Token a
Token ID257
Feature activation-9.81e-01

Pos logit contributions
igate+1.708
VERTISEMENT+1.705
idon+1.692
agate+1.674
OUT+1.655
Neg logit contributions
 accuses-1.399
 citiz-1.389
 albeit-1.384
 Temper-1.343
 we-1.337
TokenUN
Token ID4944
Feature activation+7.52e+00

Pos logit contributions
igate+1.207
VERTISEMENT+1.176
idon+1.165
agate+1.122
OUT+1.052
Neg logit contributions
 accuses-1.288
 albeit-1.278
 we-1.262
 Temper-1.243
 citiz-1.212
TokenITED
Token ID22061
Feature activation+9.71e+00

Pos logit contributions
 accuses+6.904
 citiz+6.539
 Temper+6.467
 we+6.415
 albeit+6.329
Neg logit contributions
agate-4.103
igate-4.099
idon-3.950
VERTISEMENT-3.777
tnc-3.573
Token N
Token ID399
Feature activation+4.92e+00

Pos logit contributions
 we+4.219
 accuses+3.967
 Tw+3.875
 he+3.854
 they+3.808
Neg logit contributions
agate-9.970
igate-9.797
idon-9.556
tnc-9.418
VERTISEMENT-9.215
TokenATIONS
Token ID18421
Feature activation+4.08e+00

Pos logit contributions
 we+3.384
 accuses+3.313
 albeit+3.275
 Tw+3.222
 though+3.169
Neg logit contributions
agate-3.755
idon-3.752
ability-3.610
VERTISEMENT-3.577
tnc-3.549
Token (
Token ID357
Feature activation+2.42e+00

Pos logit contributions
 accuses+2.164
 citiz+1.982
 we+1.891
 Temper+1.886
 albeit+1.873
Neg logit contributions
igate-4.079
agate-4.059
VERTISEMENT-4.055
idon-4.045
tnc-3.792
TokenReuters
Token ID12637
Feature activation+9.72e+00

Pos logit contributions
 accuses+1.119
 we+1.080
 Temper+1.056
 albeit+1.037
 though+1.035
Neg logit contributions
igate-2.433
idon-2.422
VERTISEMENT-2.394
agate-2.375
utable-2.245
Token)
Token ID8
Feature activation+2.51e+00

Pos logit contributions
 accuses+6.112
 albeit+6.060
 Temper+5.999
 we+5.984
 Tw+5.741
Neg logit contributions
igate-8.081
VERTISEMENT-7.881
idon-7.853
agate-7.642
OUT-7.406
Token -
Token ID532
Feature activation+2.38e+00

Pos logit contributions
 citiz+1.450
 accuses+1.445
 albeit+1.410
 Temper+1.379
 we+1.332
Neg logit contributions
igate-2.328
VERTISEMENT-2.286
idon-2.257
agate-2.224
OUT-2.204
Token An
Token ID1052
Feature activation+2.67e+00

Pos logit contributions
 Temper+1.072
 accuses+1.065
 albeit+1.008
 Tw+1.002
 we+0.980
Neg logit contributions
igate-2.493
VERTISEMENT-2.485
idon-2.462
agate-2.454
OUT-2.357
Token army
Token ID5428
Feature activation+3.11e+00

Pos logit contributions
 accuses+1.088
 albeit+1.050
 citiz+1.002
 we+0.978
 Temper+0.959
Neg logit contributions
VERTISEMENT-2.914
igate-2.863
idon-2.815
-2.803
OUT-2.794
Token battalion
Token ID36328
Feature activation+1.81e+00

Pos logit contributions
 accuses+1.576
 albeit+1.465
 we+1.463
 citiz+1.404
 lingu+1.398
Neg logit contributions
VERTISEMENT-3.179
igate-3.152
idon-3.109
agate-3.070
OUT-3.030
Token in
Token ID287
Feature activation-1.26e+00

Pos logit contributions
 citiz+0.739
 accuses+0.735
 we+0.721
 albeit+0.715
 Temper+0.696
Neg logit contributions
igate-1.273
idon-1.245
VERTISEMENT-1.236
agate-1.226
uments-1.176
Token babys
Token ID46711
Feature activation+4.13e+00

Pos logit contributions
igate+1.267
VERTISEMENT+1.254
idon+1.246
agate+1.224
OUT+1.197
Neg logit contributions
 accuses-0.638
 Temper-0.626
 citiz-0.623
 albeit-0.613
 we-0.611
Tokenitting
Token ID2535
Feature activation-2.57e+00

Pos logit contributions
 we+2.878
 accuses+2.874
 albeit+2.800
 citiz+2.785
 Temper+2.765
Neg logit contributions
VERTISEMENT-3.043
igate-3.023
idon-2.975
OUT-2.926
BACK-2.882
Token
Token ID198
Feature activation-1.80e+00

Pos logit contributions
igate+2.531
VERTISEMENT+2.484
idon+2.468
agate+2.413
OUT+2.400
Neg logit contributions
 citiz-1.398
 accuses-1.378
 albeit-1.347
 we-1.335
 Temper-1.330
Token
Token ID198
Feature activation-1.48e+00

Pos logit contributions
igate+1.373
OUT+1.303
idon+1.291
VERTISEMENT+1.288
agate+1.277
Neg logit contributions
 citiz-1.543
 mathemat-1.353
 accuses-1.350
 ende-1.339
etheless-1.307
TokenNearly
Token ID27927
Feature activation+1.12e+01

Pos logit contributions
igate+1.101
VERTISEMENT+1.059
idon+1.054
agate+1.025
uments+0.966
Neg logit contributions
 accuses-1.127
 albeit-1.122
 we-1.114
 Temper-1.096
 citiz-1.089
Token everyone
Token ID2506
Feature activation+9.72e+00

Pos logit contributions
 we+5.774
 accuses+5.587
 albeit+5.485
 though+5.348
 Tw+5.305
Neg logit contributions
agate-10.338
VERTISEMENT-10.089
idon-9.913
igate-9.879
BACK-9.498
Token will
Token ID481
Feature activation+1.94e+00

Pos logit contributions
 accuses+4.401
 we+4.282
 albeit+3.809
 though+3.802
 rises+3.725
Neg logit contributions
VERTISEMENT-10.342
igate-10.318
idon-10.082
tnc-9.988
agate-9.735
Token have
Token ID423
Feature activation-3.12e+00

Pos logit contributions
 accuses+0.986
 albeit+0.977
 we+0.961
 though+0.914
 Temper+0.898
Neg logit contributions
VERTISEMENT-1.965
igate-1.944
OUT-1.897
idon-1.897
agate-1.864
Token to
Token ID284
Feature activation-1.48e+00

Pos logit contributions
igate+2.828
VERTISEMENT+2.800
idon+2.776
agate+2.730
OUT+2.712
Neg logit contributions
 accuses-1.937
 citiz-1.906
 albeit-1.881
 Temper-1.845
 we-1.833
Token mend
Token ID47618
Feature activation-2.95e+00

Pos logit contributions
VERTISEMENT+1.408
igate+1.403
idon+1.367
OUT+1.357
agate+1.349
Neg logit contributions
 accuses-0.818
 albeit-0.809
 citiz-0.797
 we-0.791
 Temper-0.777
Token.
Token ID13
Feature activation+1.48e+00

Pos logit contributions
 we+0.550
 albeit+0.538
 though+0.517
 accuses+0.517
 they+0.506
Neg logit contributions
igate-0.897
VERTISEMENT-0.896
agate-0.858
idon-0.850
OUT-0.840
Token
Token ID447
Feature activation+3.34e+00

Pos logit contributions
 Temper+0.846
 we+0.846
 accuses+0.821
 albeit+0.819
 Tw+0.794
Neg logit contributions
igate-1.380
VERTISEMENT-1.351
idon-1.347
agate-1.333
OUT-1.283
Token
Token ID251
Feature activation-8.61e+00

Pos logit contributions
 citiz+2.818
 accuses+2.748
 we+2.737
 albeit+2.710
 Temper+2.686
Neg logit contributions
igate-2.109
OUT-2.018
idon-1.984
agate-1.965
VERTISEMENT-1.928
Token<|endoftext|>
Token ID50256
Feature activation+1.87e+00

Pos logit contributions
igate+8.613
VERTISEMENT+8.451
idon+8.343
agate+8.220
OUT+8.214
Neg logit contributions
 citiz-4.439
 accuses-4.304
 albeit-4.180
 Temper-4.059
 we-3.878
TokenNew
Token ID3791
Feature activation+9.18e+00

Pos logit contributions
 accuses+1.403
 we+1.398
 albeit+1.397
 Temper+1.355
 though+1.325
Neg logit contributions
igate-1.331
VERTISEMENT-1.293
idon-1.290
agate-1.232
uments-1.139
Token Delhi
Token ID12517
Feature activation+9.92e+00

Pos logit contributions
 we+4.481
 accuses+4.451
 Temper+4.416
 citiz+4.355
 albeit+4.348
Neg logit contributions
igate-8.214
VERTISEMENT-8.201
idon-8.065
OUT-8.062
agate-7.907
Token:
Token ID25
Feature activation+6.67e+00

Pos logit contributions
 accuses+6.133
 we+5.837
 Temper+5.670
 albeit+5.553
 citiz+5.546
Neg logit contributions
VERTISEMENT-8.004
idon-7.890
igate-7.850
agate-7.662
tnc-7.438
Token India
Token ID3794
Feature activation+9.32e+00

Pos logit contributions
 Temper+3.092
 accuses+2.868
 we+2.851
 albeit+2.809
 Tw+2.800
Neg logit contributions
VERTISEMENT-6.822
igate-6.737
idon-6.711
agate-6.541
OUT-6.379
Token`
Token ID63
Feature activation+4.57e+00

Pos logit contributions
 accuses+4.050
 albeit+3.548
 we+3.478
 joins+3.364
 citiz+3.342
Neg logit contributions
VERTISEMENT-9.972
igate-9.706
idon-9.501
tnc-9.500
-9.271
Tokens
Token ID82
Feature activation+1.08e+00

Pos logit contributions
 accuses+2.137
 we+2.101
 albeit+2.064
 he+1.944
 rises+1.926
Neg logit contributions
igate-4.662
VERTISEMENT-4.551
idon-4.379
agate-4.357
OUT-4.311
Token two
Token ID734
Feature activation+8.64e+00

Pos logit contributions
 citiz+0.525
 accuses+0.508
 albeit+0.502
 Temper+0.491
 we+0.474
Neg logit contributions
igate-1.124
VERTISEMENT-1.111
idon-1.099
agate-1.068
OUT-1.068
Token,
Token ID11
Feature activation+7.10e+00

Pos logit contributions
igate+4.852
VERTISEMENT+4.776
idon+4.679
agate+4.673
OUT+4.666
Neg logit contributions
 albeit-1.391
 citiz-1.377
 accuses-1.351
 we-1.257
 Temper-1.226
Token but
Token ID475
Feature activation+9.08e+00

Pos logit contributions
 albeit+1.836
 he+1.612
 we+1.608
 though+1.600
 citiz+1.447
Neg logit contributions
igate-8.792
VERTISEMENT-8.531
agate-8.499
OUT-8.453
idon-8.448
Token also
Token ID635
Feature activation+4.25e+00

Pos logit contributions
 we+4.370
 he+4.324
 albeit+4.297
 though+4.093
 citiz+4.063
Neg logit contributions
igate-8.689
idon-8.512
agate-8.485
OUT-8.380
VERTISEMENT-8.305
Token as
Token ID355
Feature activation+2.44e+00

Pos logit contributions
 albeit+1.637
 we+1.608
 citiz+1.579
 he+1.553
 though+1.534
Neg logit contributions
igate-4.711
idon-4.595
agate-4.548
VERTISEMENT-4.545
OUT-4.501
Token a
Token ID257
Feature activation-2.38e+00

Pos logit contributions
 we+1.629
 albeit+1.610
 he+1.592
 citiz+1.586
 though+1.542
Neg logit contributions
igate-2.026
idon-1.969
VERTISEMENT-1.957
agate-1.949
OUT-1.932
Token military
Token ID2422
Feature activation+4.27e+00

Pos logit contributions
igate+2.462
idon+2.388
VERTISEMENT+2.379
agate+2.360
OUT+2.327
Neg logit contributions
 citiz-1.205
 albeit-1.183
 accuses-1.151
 Temper-1.115
 we-1.114
Token av
Token ID1196
Feature activation-7.60e-01

Pos logit contributions
 albeit+1.950
 citiz+1.876
 we+1.847
 lingu+1.824
 accuses+1.817
Neg logit contributions
igate-4.479
VERTISEMENT-4.408
idon-4.402
agate-4.360
OUT-4.254
Tokeniator
Token ID38585
Feature activation-4.31e-02

Pos logit contributions
VERTISEMENT+0.841
OUT+0.810
igate+0.809
agate+0.808
idon+0.799
Neg logit contributions
 accuses-0.302
 citiz-0.297
 albeit-0.296
 we-0.284
 Temper-0.273
Token.
Token ID13
Feature activation+2.42e+00

Pos logit contributions
igate+0.046
VERTISEMENT+0.046
idon+0.045
agate+0.045
OUT+0.045
Neg logit contributions
 albeit-0.020
 we-0.019
 accuses-0.019
 citiz-0.018
 he-0.018
Token
Token ID198
Feature activation-3.14e+00

Pos logit contributions
 Temper+1.202
 albeit+1.161
 Tw+1.135
 accuses+1.129
 we+1.124
Neg logit contributions
igate-2.431
idon-2.413
agate-2.382
VERTISEMENT-2.342
OUT-2.291
Token
Token ID198
Feature activation-2.85e+00

Pos logit contributions
igate+2.331
OUT+2.218
VERTISEMENT+2.185
idon+2.180
ability+2.172
Neg logit contributions
 citiz-2.783
 mathemat-2.470
 ende-2.426
 accuses-2.389
etheless-2.369
Token I
Token ID314
Feature activation-1.23e+00

Pos logit contributions
 we+2.892
 he+2.624
 they+2.603
 though+2.585
 albeit+2.520
Neg logit contributions
VERTISEMENT-6.382
igate-6.367
agate-6.340
idon-6.238
OUT-6.142
Token was
Token ID373
Feature activation-4.72e+00

Pos logit contributions
VERTISEMENT+1.400
igate+1.400
idon+1.375
OUT+1.362
agate+1.351
Neg logit contributions
 accuses-0.463
 albeit-0.459
 citiz-0.441
 we-0.437
 Temper-0.422
Token doing
Token ID1804
Feature activation-3.77e+00

Pos logit contributions
igate+4.616
VERTISEMENT+4.569
idon+4.506
agate+4.430
OUT+4.386
Neg logit contributions
 citiz-2.556
 accuses-2.532
 albeit-2.518
 Temper-2.391
 we-2.382
Token 180
Token ID11546
Feature activation+4.74e+00

Pos logit contributions
igate+3.341
VERTISEMENT+3.321
idon+3.256
agate+3.219
OUT+3.152
Neg logit contributions
 citiz-2.387
 albeit-2.378
 accuses-2.357
 Temper-2.312
 we-2.286
Token knots
Token ID33620
Feature activation-1.67e+00

Pos logit contributions
 albeit+2.540
 we+2.510
 Temper+2.471
 accuses+2.467
 though+2.458
Neg logit contributions
agate-4.480
igate-4.463
idon-4.423
VERTISEMENT-4.370
OUT-4.193
Token,
Token ID11
Feature activation+4.89e+00

Pos logit contributions
VERTISEMENT+1.725
igate+1.721
idon+1.695
agate+1.692
OUT+1.646
Neg logit contributions
 albeit-0.822
 accuses-0.796
 citiz-0.787
 we-0.782
 Temper-0.772
Token and
Token ID290
Feature activation+6.68e+00

Pos logit contributions
 albeit+2.578
 we+2.426
 though+2.386
 accuses+2.323
 citiz+2.266
Neg logit contributions
VERTISEMENT-4.877
igate-4.875
idon-4.791
agate-4.754
uments-4.612
Token I
Token ID314
Feature activation-1.98e+00

Pos logit contributions
 we+3.594
 albeit+3.485
 though+3.419
 he+3.366
 they+3.361
Neg logit contributions
VERTISEMENT-6.435
igate-6.383
agate-6.258
idon-6.256
uments-6.104
Token eased
Token ID44828
Feature activation-1.74e+00

Pos logit contributions
igate+2.202
VERTISEMENT+2.189
idon+2.148
agate+2.126
OUT+2.124
Neg logit contributions
 accuses-0.788
 albeit-0.778
 citiz-0.770
 we-0.727
 Temper-0.725
Token off
Token ID572
Feature activation-1.02e+00

Pos logit contributions
VERTISEMENT+1.460
igate+1.435
idon+1.424
agate+1.395
OUT+1.360
Neg logit contributions
 albeit-1.183
 accuses-1.168
 citiz-1.152
 Temper-1.145
 we-1.142
Token on
Token ID319
Feature activation-1.31e+00

Pos logit contributions
VERTISEMENT+0.948
igate+0.937
idon+0.924
agate+0.924
OUT+0.899
Neg logit contributions
 albeit-0.612
 accuses-0.591
 citiz-0.590
 we-0.585
 Temper-0.578
Token but
Token ID475
Feature activation+1.15e+01

Pos logit contributions
 albeit+3.988
 we+3.777
 though+3.700
 accuses+3.623
 citiz+3.524
Neg logit contributions
VERTISEMENT-7.385
igate-7.320
idon-7.210
agate-7.073
OUT-7.051
Token is
Token ID318
Feature activation-2.53e+00

Pos logit contributions
 we+6.282
 they+5.994
 albeit+5.952
 accuses+5.897
 he+5.679
Neg logit contributions
VERTISEMENT-10.779
igate-10.363
idon-10.207
tnc-10.013
agate-9.904
Token sometimes
Token ID3360
Feature activation+5.49e+00

Pos logit contributions
VERTISEMENT+2.302
igate+2.298
idon+2.253
agate+2.213
OUT+2.170
Neg logit contributions
 albeit-1.556
 citiz-1.553
 accuses-1.549
 we-1.502
 lingu-1.456
Token also
Token ID635
Feature activation+4.29e+00

Pos logit contributions
 we+3.061
 albeit+3.033
 accuses+3.020
 lingu+2.942
 though+2.896
Neg logit contributions
VERTISEMENT-5.265
idon-5.128
agate-5.081
igate-5.075
uments-4.849
Token used
Token ID973
Feature activation-1.53e+00

Pos logit contributions
 albeit+2.413
 accuses+2.382
 we+2.381
 lingu+2.332
 though+2.284
Neg logit contributions
VERTISEMENT-4.151
idon-4.051
igate-3.986
agate-3.967
OUT-3.790
Token as
Token ID355
Feature activation+1.16e+00

Pos logit contributions
VERTISEMENT+1.591
igate+1.575
idon+1.553
agate+1.525
OUT+1.522
Neg logit contributions
 albeit-0.768
 accuses-0.744
 we-0.731
 citiz-0.723
 though-0.706
Token an
Token ID281
Feature activation-5.04e-01

Pos logit contributions
 albeit+0.810
 accuses+0.793
 we+0.793
 though+0.790
 citiz+0.784
Neg logit contributions
idon-0.945
igate-0.935
agate-0.922
VERTISEMENT-0.916
OUT-0.894
Token umbrella
Token ID25510
Feature activation+3.92e+00

Pos logit contributions
igate+0.549
idon+0.543
VERTISEMENT+0.541
agate+0.528
OUT+0.523
Neg logit contributions
 albeit-0.215
 citiz-0.211
 accuses-0.210
 we-0.197
 Temper-0.194
Token term
Token ID3381
Feature activation-3.46e-01

Pos logit contributions
 albeit+1.529
 we+1.502
 accuses+1.417
 citiz+1.387
 though+1.356
Neg logit contributions
igate-4.461
VERTISEMENT-4.431
idon-4.430
agate-4.373
OUT-4.239
Token that
Token ID326
Feature activation+4.06e+00

Pos logit contributions
igate+0.341
VERTISEMENT+0.338
idon+0.332
agate+0.328
OUT+0.325
Neg logit contributions
 albeit-0.195
 we-0.191
 accuses-0.187
 though-0.181
 citiz-0.180
Token encompasses
Token ID38932
Feature activation-6.18e-01

Pos logit contributions
 accuses+1.674
 we+1.611
 albeit+1.549
 citiz+1.498
 they+1.445
Neg logit contributions
igate-4.597
VERTISEMENT-4.563
idon-4.521
agate-4.471
OUT-4.408
TokenRead
Token ID5569
Feature activation-2.69e+00

Pos logit contributions
igate+3.216
VERTISEMENT+3.138
OUT+3.065
idon+3.047
agate+3.011
Neg logit contributions
 citiz-4.638
 accuses-4.420
 albeit-4.392
 Temper-4.344
 we-4.193
Token more
Token ID517
Feature activation+3.71e+00

Pos logit contributions
ability+1.574
igate+1.560
OUT+1.530
VERTISEMENT+1.518
idon+1.456
Neg logit contributions
 citiz-2.507
 albeit-2.400
 ende-2.385
 cannabin-2.306
 mathemat-2.288
Token by
Token ID416
Feature activation-3.27e-02

Pos logit contributions
 citiz+2.409
 accuses+2.338
 albeit+2.289
 Temper+2.245
 we+2.145
Neg logit contributions
igate-3.224
VERTISEMENT-3.152
idon-3.113
agate-3.065
OUT-3.059
Token Justin
Token ID10799
Feature activation+7.18e+00

Pos logit contributions
igate+0.036
idon+0.036
VERTISEMENT+0.036
OUT+0.035
ability+0.034
Neg logit contributions
 citiz-0.014
 accuses-0.013
 albeit-0.013
 cannabin-0.012
 Temper-0.012
Token R
Token ID371
Feature activation+4.22e-01

Pos logit contributions
 accuses+2.734
 Tw+2.510
 albeit+2.473
 we+2.387
 citiz+2.358
Neg logit contributions
igate-7.914
idon-7.866
agate-7.824
VERTISEMENT-7.802
tnc-7.624
Tokenaim
Token ID1385
Feature activation+3.27e+00

Pos logit contributions
 citiz+0.180
 accuses+0.172
 albeit+0.167
 Temper+0.161
 we+0.151
Neg logit contributions
igate-0.462
VERTISEMENT-0.452
idon-0.448
agate-0.442
OUT-0.442
Tokenondo
Token ID22311
Feature activation+1.22e+00

Pos logit contributions
 citiz+1.784
 accuses+1.784
 albeit+1.742
 Temper+1.674
 we+1.660
Neg logit contributions
igate-3.098
VERTISEMENT-3.057
idon-2.999
agate-2.979
OUT-2.975
Token<|endoftext|>
Token ID50256
Feature activation-1.75e+00

Pos logit contributions
 accuses+0.832
 citiz+0.831
 albeit+0.813
 Temper+0.786
 we+0.776
Neg logit contributions
igate-0.999
VERTISEMENT-0.988
idon-0.980
agate-0.959
OUT-0.946
Tokenlic
Token ID677
Feature activation+4.17e+00

Pos logit contributions
igate+1.276
VERTISEMENT+1.243
idon+1.232
agate+1.185
OUT+1.110
Neg logit contributions
 accuses-1.351
 albeit-1.341
 we-1.323
 Temper-1.301
 citiz-1.269
Tokenhess
Token ID33979
Feature activation+7.77e+00

Pos logit contributions
 accuses+3.144
 citiz+3.128
 Temper+3.092
 we+3.091
 albeit+3.070
Neg logit contributions
igate-2.885
VERTISEMENT-2.863
OUT-2.782
idon-2.742
agate-2.679
Token.
Token ID13
Feature activation+2.91e+00

Pos logit contributions
 we+4.696
 accuses+4.681
 albeit+4.598
 Tw+4.537
 citiz+4.478
Neg logit contributions
VERTISEMENT-6.109
igate-5.989
idon-5.788
OUT-5.766
agate-5.666
Token great
Token ID1049
Feature activation+5.83e+00

Pos logit contributions
igate+1.668
idon+1.614
VERTISEMENT+1.613
agate+1.589
OUT+1.554
Neg logit contributions
 albeit-1.173
 citiz-1.156
 accuses-1.146
 Temper-1.129
 we-1.117
Token hike
Token ID18390
Feature activation-5.74e+00

Pos logit contributions
 albeit+2.962
 we+2.832
 though+2.678
 Temper+2.673
 he+2.639
Neg logit contributions
igate-5.788
VERTISEMENT-5.768
agate-5.700
idon-5.666
TABLE-5.427
Token,
Token ID11
Feature activation+6.23e+00

Pos logit contributions
igate+5.813
VERTISEMENT+5.710
idon+5.632
agate+5.568
OUT+5.535
Neg logit contributions
 citiz-2.899
 accuses-2.840
 albeit-2.809
 Temper-2.699
 we-2.616
Token though
Token ID996
Feature activation+7.80e+00

Pos logit contributions
 albeit+3.705
 we+3.518
 though+3.496
 Temper+3.301
 accuses+3.266
Neg logit contributions
igate-5.801
VERTISEMENT-5.682
idon-5.615
agate-5.611
OUT-5.455
Token,
Token ID11
Feature activation+6.45e+00

Pos logit contributions
 we+4.563
 albeit+4.342
 though+4.222
 they+4.218
 Temper+4.185
Neg logit contributions
igate-7.098
VERTISEMENT-7.029
agate-6.860
idon-6.857
OUT-6.672
Token and
Token ID290
Feature activation+7.30e+00

Pos logit contributions
 albeit+3.278
 we+3.127
 though+3.063
 Temper+2.923
 accuses+2.872
Neg logit contributions
igate-6.527
VERTISEMENT-6.436
idon-6.327
agate-6.230
OUT-6.143
Token the
Token ID262
Feature activation+1.24e+00

Pos logit contributions
 we+4.321
 albeit+4.207
 though+4.086
 Temper+3.980
 they+3.980
Neg logit contributions
igate-6.637
VERTISEMENT-6.542
agate-6.419
idon-6.418
OUT-6.262
Token weather
Token ID6193
Feature activation+7.04e-01

Pos logit contributions
 albeit+0.663
 citiz+0.660
 Temper+0.656
 accuses+0.644
 we+0.637
Neg logit contributions
igate-1.219
VERTISEMENT-1.202
agate-1.188
idon-1.169
OUT-1.142
Token was
Token ID373
Feature activation-3.62e+00

Pos logit contributions
 accuses+0.269
 albeit+0.267
 we+0.265
 Temper+0.260
 citiz+0.252
Neg logit contributions
igate-0.797
VERTISEMENT-0.778
idon-0.776
agate-0.772
OUT-0.751
Token perfect
Token ID2818
Feature activation+3.91e+00

Pos logit contributions
igate+3.049
VERTISEMENT+2.986
idon+2.961
agate+2.892
OUT+2.867
Neg logit contributions
 citiz-2.447
 accuses-2.409
 albeit-2.381
 Temper-2.330
 we-2.270
Token but
Token ID475
Feature activation+8.50e+00

Pos logit contributions
 albeit+2.280
 we+2.212
 Temper+2.157
 though+2.151
 accuses+2.050
Neg logit contributions
igate-3.691
VERTISEMENT-3.636
idon-3.633
agate-3.547
OUT-3.449
Token in
Token ID287
Feature activation-6.01e+00

Pos logit contributions
VERTISEMENT+0.632
igate+0.627
idon+0.620
agate+0.611
OUT+0.594
Neg logit contributions
 albeit-0.369
 accuses-0.367
 citiz-0.363
 we-0.362
 though-0.345
Token such
Token ID884
Feature activation+7.19e+00

Pos logit contributions
igate+5.533
VERTISEMENT+5.467
idon+5.323
agate+5.321
OUT+5.193
Neg logit contributions
 citiz-3.690
 albeit-3.606
 accuses-3.605
 Temper-3.479
 we-3.450
Token a
Token ID257
Feature activation-2.48e+00

Pos logit contributions
 albeit+4.160
 we+4.143
 though+3.957
 accuses+3.914
 he+3.880
Neg logit contributions
igate-6.449
VERTISEMENT-6.431
agate-6.246
idon-6.216
-5.826
Token state
Token ID1181
Feature activation+1.42e+00

Pos logit contributions
igate+2.499
VERTISEMENT+2.433
idon+2.418
agate+2.382
OUT+2.338
Neg logit contributions
 citiz-1.295
 albeit-1.288
 accuses-1.287
 we-1.235
 Temper-1.227
Token of
Token ID286
Feature activation-1.19e+00

Pos logit contributions
 we+0.442
 albeit+0.415
 they+0.408
 accuses+0.399
 he+0.392
Neg logit contributions
igate-1.742
VERTISEMENT-1.725
idon-1.700
agate-1.694
OUT-1.670
Token dis
Token ID595
Feature activation-1.45e+00

Pos logit contributions
igate+1.192
VERTISEMENT+1.167
idon+1.147
agate+1.146
OUT+1.112
Neg logit contributions
 albeit-0.623
 accuses-0.620
 citiz-0.617
 we-0.599
 Temper-0.592
Tokenrepair
Token ID49932
Feature activation-4.14e+00

Pos logit contributions
VERTISEMENT+1.209
igate+1.203
idon+1.174
agate+1.153
OUT+1.143
Neg logit contributions
 albeit-0.969
 accuses-0.963
 citiz-0.947
 we-0.932
 Temper-0.920
Token,
Token ID11
Feature activation+5.35e+00

Pos logit contributions
igate+4.060
VERTISEMENT+3.997
idon+3.928
agate+3.888
OUT+3.843
Neg logit contributions
 citiz-2.282
 accuses-2.268
 albeit-2.263
 we-2.175
 Temper-2.147
Token its
Token ID663
Feature activation-7.79e-01

Pos logit contributions
 albeit+2.955
 accuses+2.845
 we+2.791
 citiz+2.748
 though+2.729
Neg logit contributions
igate-5.184
VERTISEMENT-5.153
idon-5.038
agate-4.994
OUT-4.897
Token value
Token ID1988
Feature activation-1.22e+00

Pos logit contributions
igate+0.752
VERTISEMENT+0.737
idon+0.732
agate+0.724
OUT+0.703
Neg logit contributions
 citiz-0.438
 albeit-0.434
 accuses-0.434
 we-0.419
 Temper-0.416
Token is
Token ID318
Feature activation-5.26e+00

Pos logit contributions
igate+1.334
VERTISEMENT+1.307
idon+1.306
agate+1.275
OUT+1.266
Neg logit contributions
 accuses-0.540
 albeit-0.524
 citiz-0.522
 we-0.508
 rises-0.487
Token points
Token ID2173
Feature activation+1.65e+00

Pos logit contributions
 accuses+0.709
 albeit+0.703
 we+0.675
 citiz+0.652
 though+0.634
Neg logit contributions
idon-2.308
agate-2.300
igate-2.292
VERTISEMENT-2.264
OUT-2.217
Token.
Token ID13
Feature activation+3.19e+00

Pos logit contributions
 albeit+0.913
 we+0.870
 accuses+0.870
 though+0.856
 citiz+0.828
Neg logit contributions
igate-1.609
agate-1.590
VERTISEMENT-1.586
idon-1.573
tnc-1.549
Token
Token ID198
Feature activation-2.62e+00

Pos logit contributions
 Temper+1.338
 accuses+1.292
 albeit+1.242
 Tw+1.210
 we+1.208
Neg logit contributions
idon-3.447
igate-3.435
VERTISEMENT-3.434
agate-3.378
OUT-3.290
Token
Token ID198
Feature activation-2.82e+00

Pos logit contributions
igate+1.949
OUT+1.855
idon+1.816
VERTISEMENT+1.814
ability+1.802
Neg logit contributions
 citiz-2.341
 mathemat-2.070
 ende-2.040
 accuses-2.013
etheless-1.986
TokenCo
Token ID7222
Feature activation-1.90e+00

Pos logit contributions
igate+2.072
idon+2.027
VERTISEMENT+2.014
agate+1.949
OUT+1.843
Neg logit contributions
 accuses-2.152
 albeit-2.139
 Temper-2.102
 we-2.081
 citiz-2.016
Tokenos
Token ID418
Feature activation-3.16e+00

Pos logit contributions
igate+1.536
VERTISEMENT+1.484
idon+1.480
agate+1.462
OUT+1.436
Neg logit contributions
 citiz-1.369
 accuses-1.315
 albeit-1.293
 Temper-1.266
 we-1.213
Token County
Token ID3418
Feature activation-9.90e-01

Pos logit contributions
igate+3.328
VERTISEMENT+3.295
idon+3.229
OUT+3.214
agate+3.182
Neg logit contributions
 accuses-1.520
 citiz-1.516
 albeit-1.452
 Temper-1.408
 we-1.399
Token is
Token ID318
Feature activation-3.90e+00

Pos logit contributions
igate+1.122
VERTISEMENT+1.100
idon+1.094
agate+1.077
OUT+1.073
Neg logit contributions
 accuses-0.402
 citiz-0.395
 albeit-0.372
 we-0.365
 Temper-0.363
Token not
Token ID407
Feature activation+1.86e+00

Pos logit contributions
igate+3.685
VERTISEMENT+3.657
idon+3.624
agate+3.533
OUT+3.512
Neg logit contributions
 citiz-2.239
 albeit-2.216
 accuses-2.215
 we-2.115
 Temper-2.100
Token an
Token ID281
Feature activation-1.22e+00

Pos logit contributions
 albeit+1.162
 we+1.148
 accuses+1.128
 citiz+1.122
 though+1.113
Neg logit contributions
VERTISEMENT-1.645
igate-1.639
idon-1.612
agate-1.582
OUT-1.538
Token isolated
Token ID11557
Feature activation+5.21e+00

Pos logit contributions
igate+1.270
VERTISEMENT+1.250
idon+1.238
agate+1.216
OUT+1.208
Neg logit contributions
 citiz-0.570
 accuses-0.561
 albeit-0.554
 Temper-0.527
 we-0.513
Token is
Token ID318
Feature activation-6.14e+00

Pos logit contributions
igate+5.270
VERTISEMENT+5.171
idon+5.124
agate+5.048
OUT+5.040
Neg logit contributions
 citiz-1.865
 accuses-1.861
 albeit-1.781
 Temper-1.686
 we-1.656
Token unstoppable
Token ID40181
Feature activation+7.11e-01

Pos logit contributions
igate+5.968
VERTISEMENT+5.852
idon+5.779
agate+5.689
OUT+5.679
Neg logit contributions
 citiz-3.346
 accuses-3.256
 albeit-3.175
 Temper-3.084
 we-2.966
Token,
Token ID11
Feature activation+6.31e+00

Pos logit contributions
 albeit+0.404
 we+0.395
 accuses+0.390
 though+0.386
 citiz+0.384
Neg logit contributions
igate-0.673
VERTISEMENT-0.663
idon-0.657
OUT-0.650
agate-0.645
Token
Token ID447
Feature activation+5.41e+00

Pos logit contributions
 albeit+3.953
 we+3.916
 accuses+3.770
 though+3.732
 citiz+3.719
Neg logit contributions
igate-5.409
idon-5.334
VERTISEMENT-5.299
agate-5.133
OUT-5.073
Token
Token ID251
Feature activation-8.19e+00

Pos logit contributions
 citiz+4.396
 we+4.321
 accuses+4.297
 albeit+4.297
 though+4.243
Neg logit contributions
igate-3.514
OUT-3.300
idon-3.282
agate-3.233
VERTISEMENT-3.214
Token the
Token ID262
Feature activation+3.88e-01

Pos logit contributions
igate+9.159
VERTISEMENT+9.020
idon+8.879
OUT+8.792
agate+8.758
Neg logit contributions
 citiz-3.117
 accuses-2.924
 albeit-2.780
 Temper-2.708
 cannabin-2.473
Token Qu
Token ID2264
Feature activation-3.12e+00

Pos logit contributions
 citiz+0.167
 accuses+0.165
 albeit+0.164
 Temper+0.160
 we+0.155
Neg logit contributions
igate-0.418
VERTISEMENT-0.414
idon-0.410
agate-0.405
OUT-0.398
Tokenist
Token ID396
Feature activation-4.89e+00

Pos logit contributions
VERTISEMENT+3.602
igate+3.565
OUT+3.486
idon+3.483
+3.455
Neg logit contributions
 accuses-1.134
 citiz-1.080
 albeit-1.039
 we-1.023
 Temper-1.020
Token campaign
Token ID1923
Feature activation-5.16e+00

Pos logit contributions
igate+4.717
VERTISEMENT+4.623
idon+4.574
agate+4.489
OUT+4.486
Neg logit contributions
 citiz-2.714
 accuses-2.667
 albeit-2.585
 Temper-2.523
 we-2.438
Token said
Token ID531
Feature activation-2.64e-01

Pos logit contributions
igate+6.002
VERTISEMENT+5.906
idon+5.860
agate+5.784
OUT+5.763
Neg logit contributions
 accuses-1.863
 citiz-1.851
 albeit-1.740
 Temper-1.656
 we-1.600
Token in
Token ID287
Feature activation-4.15e+00

Pos logit contributions
idon+0.259
VERTISEMENT+0.259
igate+0.257
agate+0.253
OUT+0.251
Neg logit contributions
 accuses-0.140
 albeit-0.135
 we-0.135
 citiz-0.134
 Temper-0.131
Token World
Token ID2159
Feature activation+6.36e+00

Pos logit contributions
 accuses+1.079
 citiz+1.075
 albeit+1.050
 Temper+1.043
 we+1.031
Neg logit contributions
igate-2.811
VERTISEMENT-2.789
agate-2.724
idon-2.713
OUT-2.641
Token War
Token ID1810
Feature activation+3.02e+00

Pos logit contributions
 citiz+3.065
 accuses+2.947
 albeit+2.847
 Temper+2.753
 we+2.614
Neg logit contributions
igate-6.623
VERTISEMENT-6.492
idon-6.421
agate-6.330
OUT-6.328
Token II
Token ID2873
Feature activation+8.83e-01

Pos logit contributions
 citiz+1.359
 accuses+1.321
 albeit+1.281
 Temper+1.238
 we+1.180
Neg logit contributions
igate-3.213
VERTISEMENT-3.158
idon-3.119
agate-3.077
OUT-3.072
Token veteran
Token ID9298
Feature activation+2.06e+00

Pos logit contributions
 citiz+0.474
 albeit+0.472
 accuses+0.457
 we+0.456
 Temper+0.447
Neg logit contributions
igate-0.876
VERTISEMENT-0.857
idon-0.848
agate-0.837
OUT-0.824
Token to
Token ID284
Feature activation-1.31e+00

Pos logit contributions
 accuses+1.112
 albeit+1.083
 we+1.078
 citiz+1.067
 Temper+1.042
Neg logit contributions
VERTISEMENT-2.046
igate-2.027
agate-1.964
idon-1.961
OUT-1.913
Token death
Token ID1918
Feature activation+3.02e-01

Pos logit contributions
igate+1.170
VERTISEMENT+1.157
idon+1.138
agate+1.113
OUT+1.096
Neg logit contributions
 accuses-0.837
 citiz-0.813
 Temper-0.807
 albeit-0.802
 we-0.793
Token.
Token ID13
Feature activation+2.58e+00

Pos logit contributions
 accuses+0.166
 albeit+0.162
 we+0.158
 citiz+0.156
 Temper+0.154
Neg logit contributions
VERTISEMENT-0.299
igate-0.293
idon-0.287
agate-0.284
OUT-0.277
Token
Token ID198
Feature activation-3.13e+00

Pos logit contributions
 Temper+1.328
 accuses+1.287
 Tw+1.256
 we+1.218
 albeit+1.201
Neg logit contributions
igate-2.566
idon-2.562
VERTISEMENT-2.523
agate-2.523
ability-2.437
Token
Token ID198
Feature activation-3.05e+00

Pos logit contributions
igate+2.276
OUT+2.158
ability+2.106
idon+2.102
VERTISEMENT+2.097
Neg logit contributions
 citiz-2.874
 mathemat-2.557
 ende-2.511
etheless-2.448
 accuses-2.431
TokenDel
Token ID13856
Feature activation-2.29e+00

Pos logit contributions
igate+2.314
VERTISEMENT+2.258
idon+2.250
agate+2.189
OUT+2.048
Neg logit contributions
 accuses-2.303
 albeit-2.249
 Temper-2.231
 we-2.198
 citiz-2.151
Tokenbert
Token ID4835
Feature activation-1.56e+00

Pos logit contributions
igate+2.059
idon+1.982
VERTISEMENT+1.982
agate+1.956
OUT+1.941
Neg logit contributions
 citiz-1.505
 accuses-1.371
 albeit-1.343
soever-1.312
 Temper-1.312
TokenIn
Token ID818
Feature activation-5.64e+00

Pos logit contributions
igate+3.392
VERTISEMENT+3.321
idon+3.299
agate+3.257
OUT+2.995
Neg logit contributions
 accuses-3.875
 albeit-3.831
 Temper-3.767
 we-3.717
 citiz-3.649
Token the
Token ID262
Feature activation-2.37e-01

Pos logit contributions
VERTISEMENT+5.347
igate+5.287
idon+5.165
agate+5.153
OUT+5.113
Neg logit contributions
 accuses-3.229
 citiz-3.214
 albeit-3.156
 Temper-3.086
 we-3.013
Token second
Token ID1218
Feature activation+6.53e+00

Pos logit contributions
VERTISEMENT+0.244
igate+0.239
agate+0.234
idon+0.234
TABLE+0.230
Neg logit contributions
 albeit-0.115
 accuses-0.114
 Temper-0.112
 citiz-0.112
 we-0.108
Token case
Token ID1339
Feature activation-2.40e+00

Pos logit contributions
 albeit+2.788
 accuses+2.692
 we+2.661
 though+2.549
 Temper+2.462
Neg logit contributions
VERTISEMENT-6.967
agate-6.708
TABLE-6.688
igate-6.599
idon-6.578
Token,
Token ID11
Feature activation+4.75e+00

Pos logit contributions
VERTISEMENT+2.514
igate+2.495
idon+2.428
agate+2.427
OUT+2.401
Neg logit contributions
 accuses-1.155
 citiz-1.149
 albeit-1.118
 we-1.081
 Temper-1.062
Token the
Token ID262
Feature activation+3.51e-01

Pos logit contributions
 albeit+2.850
 accuses+2.803
 we+2.757
 though+2.718
 Temper+2.665
Neg logit contributions
VERTISEMENT-4.345
agate-4.192
igate-4.070
idon-4.063
OUT-4.025
Token court
Token ID2184
Feature activation-1.58e+00

Pos logit contributions
 citiz+0.158
 albeit+0.157
 accuses+0.157
 Temper+0.150
 we+0.149
Neg logit contributions
VERTISEMENT-0.377
agate-0.365
igate-0.364
idon-0.362
OUT-0.355
Token found
Token ID1043
Feature activation-4.83e+00

Pos logit contributions
VERTISEMENT+1.557
igate+1.548
idon+1.524
OUT+1.508
agate+1.508
Neg logit contributions
 accuses-0.871
 citiz-0.839
 albeit-0.835
 we-0.809
 Temper-0.782
Token consultation
Token ID18103
Feature activation-4.74e+00

Pos logit contributions
VERTISEMENT+4.505
igate+4.478
idon+4.408
agate+4.391
OUT+4.301
Neg logit contributions
 citiz-2.893
 accuses-2.818
 albeit-2.794
 Temper-2.686
 we-2.681
Token with
Token ID351
Feature activation-4.21e+00

Pos logit contributions
igate+4.955
VERTISEMENT+4.898
idon+4.862
agate+4.779
OUT+4.772
Neg logit contributions
 citiz-2.279
 accuses-2.276
 albeit-2.275
 we-2.141
 Temper-2.094
Token the
Token ID262
Feature activation-1.78e+00

Pos logit contributions
VERTISEMENT+4.279
igate+4.234
idon+4.172
agate+4.167
OUT+4.081
Neg logit contributions
 citiz-2.169
 accuses-2.094
 albeit-2.081
 Temper-2.007
 we-1.968
Token tailored
Token ID27571
Feature activation+4.98e-01

Pos logit contributions
igate+3.028
VERTISEMENT+2.967
agate+2.918
idon+2.901
OUT+2.867
Neg logit contributions
 citiz-1.799
 albeit-1.775
 accuses-1.734
 we-1.705
 Temper-1.661
Token to
Token ID284
Feature activation-3.81e+00

Pos logit contributions
 citiz+0.285
 albeit+0.283
 we+0.275
 though+0.271
 accuses+0.267
Neg logit contributions
idon-0.471
VERTISEMENT-0.471
igate-0.467
OUT-0.462
agate-0.455
Token an
Token ID281
Feature activation-1.81e+00

Pos logit contributions
igate+3.593
VERTISEMENT+3.537
idon+3.480
agate+3.465
OUT+3.416
Neg logit contributions
 citiz-2.238
 accuses-2.167
 albeit-2.138
 Temper-2.087
 we-2.057
Token individual
Token ID1981
Feature activation+2.55e+00

Pos logit contributions
igate+1.992
VERTISEMENT+1.960
idon+1.943
agate+1.925
OUT+1.910
Neg logit contributions
 citiz-0.752
 accuses-0.733
 albeit-0.719
 Temper-0.681
 we-0.659
Token
Token ID447
Feature activation+6.39e+00

Pos logit contributions
 citiz+1.112
 albeit+1.101
 accuses+1.077
 we+1.057
 Temper+1.015
Neg logit contributions
igate-2.744
agate-2.687
idon-2.678
VERTISEMENT-2.665
OUT-2.637
Token
Token ID247
Feature activation-1.14e+01

Pos logit contributions
 albeit+5.190
 citiz+5.028
 accuses+5.017
 we+4.982
 though+4.955
Neg logit contributions
igate-3.966
idon-3.908
OUT-3.831
VERTISEMENT-3.771
agate-3.659
Tokens
Token ID82
Feature activation-1.94e+00

Pos logit contributions
igate+10.347
VERTISEMENT+10.164
idon+9.995
agate+9.837
OUT+9.833
Neg logit contributions
 citiz-6.670
 accuses-6.468
 albeit-6.203
 Temper-6.136
 we-5.805
Token needs
Token ID2476
Feature activation-2.06e+00

Pos logit contributions
igate+2.033
VERTISEMENT+1.983
idon+1.981
agate+1.949
OUT+1.933
Neg logit contributions
 citiz-0.930
 albeit-0.909
 accuses-0.908
 Temper-0.864
 we-0.851
Token.
Token ID13
Feature activation+8.91e-01

Pos logit contributions
igate+2.108
VERTISEMENT+2.055
idon+2.032
agate+2.017
OUT+2.002
Neg logit contributions
 albeit-1.079
 citiz-1.066
 we-1.027
 accuses-1.025
 though-1.001
Token One
Token ID1881
Feature activation+3.71e+00

Pos logit contributions
 Temper+0.446
 accuses+0.442
 albeit+0.441
 citiz+0.424
 we+0.420
Neg logit contributions
igate-0.887
idon-0.868
VERTISEMENT-0.867
agate-0.865
OUT-0.848
Token only
Token ID691
Feature activation+5.96e+00

Pos logit contributions
 accuses+1.700
 albeit+1.691
 we+1.686
 citiz+1.603
 though+1.590
Neg logit contributions
igate-3.901
agate-3.833
VERTISEMENT-3.763
idon-3.702
OUT-3.632
Token especially
Token ID2592
Feature activation+7.16e+00

Pos logit contributions
 albeit+1.901
 accuses+1.862
 we+1.838
 citiz+1.811
 though+1.753
Neg logit contributions
VERTISEMENT-4.185
igate-4.177
idon-4.085
agate-4.041
OUT-3.974
Token,
Token ID11
Feature activation+4.55e+00

Pos logit contributions
 we+3.568
 accuses+3.553
 though+3.542
 albeit+3.506
 they+3.429
Neg logit contributions
VERTISEMENT-6.940
idon-6.900
igate-6.844
agate-6.812
uments-6.555
Token because
Token ID780
Feature activation+6.56e+00

Pos logit contributions
 albeit+2.016
 accuses+1.997
 we+1.967
 citiz+1.914
 though+1.874
Neg logit contributions
VERTISEMENT-4.850
igate-4.840
idon-4.745
agate-4.682
OUT-4.600
Token they
Token ID484
Feature activation-3.62e+00

Pos logit contributions
 we+2.943
 they+2.854
 accuses+2.791
 albeit+2.771
 he+2.701
Neg logit contributions
igate-6.961
VERTISEMENT-6.852
idon-6.745
agate-6.742
tnc-6.587
Token
Token ID447
Feature activation+4.83e+00

Pos logit contributions
igate+3.852
VERTISEMENT+3.842
idon+3.766
agate+3.701
OUT+3.700
Neg logit contributions
 accuses-1.649
 citiz-1.619
 albeit-1.591
 we-1.524
 Temper-1.522
Token
Token ID247
Feature activation-1.43e+01

Pos logit contributions
 albeit+4.206
 accuses+4.119
 we+4.087
 citiz+4.085
 Temper+3.999
Neg logit contributions
igate-2.909
VERTISEMENT-2.850
idon-2.778
OUT-2.699
agate-2.680
Tokenre
Token ID260
Feature activation-4.45e+00

Pos logit contributions
igate+10.899
idon+10.415
VERTISEMENT+10.408
OUT+10.207
agate+10.177
Neg logit contributions
 citiz-10.504
 accuses-10.054
 albeit-9.734
 Temper-9.671
 lingu-9.238
Token black
Token ID2042
Feature activation+1.26e+00

Pos logit contributions
igate+4.242
VERTISEMENT+4.201
idon+4.165
agate+4.068
OUT+4.021
Neg logit contributions
 citiz-2.540
 accuses-2.525
 albeit-2.445
 Temper-2.373
 we-2.368
Token and
Token ID290
Feature activation+3.66e+00

Pos logit contributions
 citiz+0.604
 albeit+0.602
 accuses+0.593
 we+0.586
 though+0.569
Neg logit contributions
VERTISEMENT-1.300
igate-1.298
idon-1.274
agate-1.266
OUT-1.235
Token female
Token ID4048
Feature activation+2.98e+00

Pos logit contributions
 we+1.793
 accuses+1.764
 citiz+1.748
 they+1.739
 albeit+1.725
Neg logit contributions
VERTISEMENT-3.784
igate-3.719
idon-3.628
agate-3.588
OUT-3.524
Token.
Token ID13
Feature activation+4.21e-01

Pos logit contributions
 citiz+1.562
 albeit+1.517
 accuses+1.513
 we+1.509
 though+1.446
Neg logit contributions
igate-2.994
VERTISEMENT-2.952
idon-2.908
agate-2.896
OUT-2.816
Token that
Token ID326
Feature activation+2.45e+00

Pos logit contributions
 we+0.216
 albeit+0.215
 accuses+0.209
 citiz+0.209
 though+0.207
Neg logit contributions
VERTISEMENT-0.366
igate-0.364
idon-0.358
agate-0.349
OUT-0.346
Token the
Token ID262
Feature activation+5.04e-01

Pos logit contributions
 we+1.246
 accuses+1.244
 albeit+1.238
 citiz+1.234
 Temper+1.231
Neg logit contributions
igate-2.424
VERTISEMENT-2.385
idon-2.382
agate-2.356
OUT-2.291
Token store
Token ID3650
Feature activation-1.94e+00

Pos logit contributions
 citiz+0.228
 accuses+0.226
 albeit+0.224
 Temper+0.223
 we+0.215
Neg logit contributions
igate-0.533
VERTISEMENT-0.526
idon-0.521
agate-0.512
OUT-0.505
Token properties
Token ID6608
Feature activation-3.23e+00

Pos logit contributions
igate+2.156
idon+2.126
VERTISEMENT+2.097
agate+2.068
OUT+2.064
Neg logit contributions
 accuses-0.818
 citiz-0.788
 albeit-0.775
 we-0.751
 Temper-0.726
Token
Token ID447
Feature activation+7.18e+00

Pos logit contributions
igate+3.472
VERTISEMENT+3.437
idon+3.419
OUT+3.353
agate+3.349
Neg logit contributions
 citiz-1.417
 accuses-1.404
 albeit-1.375
 Temper-1.326
 we-1.326
Token
Token ID247
Feature activation-1.03e+01

Pos logit contributions
 albeit+5.222
 citiz+5.216
 we+5.149
 accuses+5.085
 he+4.978
Neg logit contributions
igate-4.988
idon-4.865
OUT-4.558
VERTISEMENT-4.550
tnc-4.470
Token primary
Token ID4165
Feature activation+4.56e+00

Pos logit contributions
igate+9.119
VERTISEMENT+8.952
idon+8.797
OUT+8.656
agate+8.644
Neg logit contributions
 citiz-6.515
 accuses-6.426
 albeit-6.285
 Temper-6.099
 we-5.952
Token use
Token ID779
Feature activation-2.06e+00

Pos logit contributions
 albeit+2.214
 accuses+2.200
 citiz+2.199
 we+2.141
 though+2.037
Neg logit contributions
igate-4.573
VERTISEMENT-4.525
idon-4.521
agate-4.397
OUT-4.335
Token is
Token ID318
Feature activation-4.67e+00

Pos logit contributions
igate+2.299
idon+2.244
VERTISEMENT+2.219
agate+2.205
OUT+2.180
Neg logit contributions
 citiz-0.861
 accuses-0.859
 albeit-0.849
 we-0.806
 Temper-0.770
Token as
Token ID355
Feature activation+2.82e+00

Pos logit contributions
igate+4.565
idon+4.463
VERTISEMENT+4.449
agate+4.355
OUT+4.326
Neg logit contributions
 citiz-2.581
 albeit-2.532
 accuses-2.528
 Temper-2.447
 we-2.394
Token retail
Token ID6308
Feature activation+5.99e-01

Pos logit contributions
 albeit+1.888
 we+1.882
 accuses+1.850
 though+1.838
 citiz+1.832
Neg logit contributions
igate-2.320
idon-2.282
agate-2.226
OUT-2.216
VERTISEMENT-2.187
Token also
Token ID635
Feature activation+1.25e+00

Pos logit contributions
igate+2.199
VERTISEMENT+2.193
idon+2.168
agate+2.119
OUT+2.100
Neg logit contributions
 citiz-0.971
 accuses-0.963
 albeit-0.953
 we-0.898
 Temper-0.898
Token have
Token ID423
Feature activation-7.74e+00

Pos logit contributions
 albeit+0.595
 accuses+0.595
 we+0.574
 citiz+0.565
 though+0.549
Neg logit contributions
VERTISEMENT-1.309
igate-1.293
idon-1.281
agate-1.246
TABLE-1.239
Token allowed
Token ID3142
Feature activation-9.49e+00

Pos logit contributions
igate+7.154
VERTISEMENT+6.979
idon+6.880
OUT+6.815
agate+6.779
Neg logit contributions
 citiz-4.531
 accuses-4.380
 albeit-4.249
 Temper-4.166
 we-3.959
Token multiple
Token ID3294
Feature activation+2.73e+00

Pos logit contributions
igate+8.429
VERTISEMENT+8.163
OUT+8.020
idon+7.945
agate+7.905
Neg logit contributions
 citiz-5.731
 accuses-5.491
 albeit-5.260
 Temper-5.185
soever-5.008
Token touchdown
Token ID10242
Feature activation-1.89e+00

Pos logit contributions
 accuses+1.245
 albeit+1.243
 citiz+1.205
 we+1.201
 Temper+1.147
Neg logit contributions
idon-2.869
igate-2.863
VERTISEMENT-2.848
agate-2.805
OUT-2.719
Token passes
Token ID8318
Feature activation-8.98e+00

Pos logit contributions
igate+2.155
idon+2.141
VERTISEMENT+2.126
agate+2.079
OUT+2.040
Neg logit contributions
 albeit-0.744
 accuses-0.743
 citiz-0.734
 we-0.697
 Temper-0.672
Token in
Token ID287
Feature activation-6.17e+00

Pos logit contributions
igate+8.101
VERTISEMENT+7.786
idon+7.700
OUT+7.680
agate+7.623
Neg logit contributions
 citiz-5.439
 accuses-5.164
 Temper-4.935
 albeit-4.932
 cannabin-4.653
Token four
Token ID1440
Feature activation+2.90e+00

Pos logit contributions
igate+5.623
VERTISEMENT+5.518
idon+5.448
agate+5.352
OUT+5.328
Neg logit contributions
 citiz-3.743
 accuses-3.661
 albeit-3.585
 Temper-3.490
 we-3.378
Token of
Token ID286
Feature activation-4.26e-01

Pos logit contributions
 albeit+1.364
 accuses+1.356
 we+1.344
 citiz+1.317
 though+1.274
Neg logit contributions
igate-3.029
VERTISEMENT-3.003
idon-2.991
agate-2.976
TABLE-2.815
Token their
Token ID511
Feature activation-1.52e+00

Pos logit contributions
igate+0.381
VERTISEMENT+0.376
idon+0.371
agate+0.367
uments+0.359
Neg logit contributions
 accuses-0.263
 albeit-0.262
 citiz-0.261
 we-0.259
 Temper-0.254
Token past
Token ID1613
Feature activation+2.76e+00

Pos logit contributions
igate+1.580
VERTISEMENT+1.551
idon+1.545
agate+1.519
OUT+1.494
Neg logit contributions
 citiz-0.739
 accuses-0.725
 albeit-0.716
 Temper-0.683
 we-0.680
Token wood
Token ID4898
Feature activation+1.79e+00

Pos logit contributions
igate+1.694
VERTISEMENT+1.674
idon+1.646
agate+1.643
OUT+1.618
Neg logit contributions
 citiz-0.789
 albeit-0.766
 accuses-0.761
 Temper-0.740
 we-0.730
Tokened
Token ID276
Feature activation+1.17e+00

Pos logit contributions
 accuses+1.091
 albeit+1.088
 we+1.070
 citiz+1.047
 Temper+1.026
Neg logit contributions
igate-1.596
VERTISEMENT-1.576
agate-1.549
idon-1.526
OUT-1.492
Token path
Token ID3108
Feature activation-6.40e-01

Pos logit contributions
 citiz+0.599
 albeit+0.588
 accuses+0.574
 Temper+0.561
 we+0.558
Neg logit contributions
VERTISEMENT-1.186
igate-1.174
idon-1.144
agate-1.140
OUT-1.108
Token that
Token ID326
Feature activation+3.16e+00

Pos logit contributions
igate+0.627
VERTISEMENT+0.625
agate+0.616
idon+0.608
OUT+0.602
Neg logit contributions
 albeit-0.334
 accuses-0.330
 we-0.329
 citiz-0.325
 Temper-0.317
Token
Token ID447
Feature activation+5.83e+00

Pos logit contributions
 accuses+1.173
 we+1.110
 citiz+1.102
 albeit+1.079
 rises+1.078
Neg logit contributions
igate-3.624
agate-3.562
VERTISEMENT-3.557
idon-3.528
OUT-3.483
Token
Token ID247
Feature activation-1.23e+01

Pos logit contributions
 albeit+4.987
 accuses+4.929
 citiz+4.916
 we+4.861
 though+4.766
Neg logit contributions
igate-3.565
idon-3.397
VERTISEMENT-3.381
agate-3.304
OUT-3.257
Tokens
Token ID82
Feature activation-5.17e+00

Pos logit contributions
igate+9.887
VERTISEMENT+9.677
idon+9.633
OUT+9.327
agate+9.314
Neg logit contributions
 citiz-8.441
 accuses-8.010
 albeit-7.866
 Temper-7.811
 lingu-7.324
Token home
Token ID1363
Feature activation-1.92e+00

Pos logit contributions
igate+4.792
VERTISEMENT+4.712
idon+4.657
agate+4.594
OUT+4.559
Neg logit contributions
 citiz-3.069
 accuses-3.005
 albeit-2.961
 Temper-2.863
 we-2.804
Token to
Token ID284
Feature activation-3.94e+00

Pos logit contributions
igate+1.860
VERTISEMENT+1.838
agate+1.819
idon+1.812
OUT+1.776
Neg logit contributions
 citiz-1.088
 accuses-1.065
 we-1.042
 albeit-1.031
 Temper-0.981
Token an
Token ID281
Feature activation-2.47e+00

Pos logit contributions
igate+4.065
VERTISEMENT+4.014
idon+3.945
agate+3.924
OUT+3.873
Neg logit contributions
 citiz-1.985
 albeit-1.918
 accuses-1.910
 Temper-1.882
 we-1.845
Token impressive
Token ID8036
Feature activation+1.94e+00

Pos logit contributions
igate+2.570
VERTISEMENT+2.535
idon+2.498
agate+2.468
OUT+2.459
Neg logit contributions
 citiz-1.169
 accuses-1.149
 albeit-1.144
 Temper-1.093
 we-1.064