TOP ACTIVATIONS
MAX = 56.331

m also here as an Israeli to say :
COUR TH OU SE An Av ond ale man returned
improved efficiency . An all - new four -
a Young Girl An 1888 photo of with her
was the first monument honoring an African American or a woman
and therefore prevent able . An important contemporary challenge in injury
heart , and talent as an artist .
. It was an up - and - down
's privacy . An estimated 125 , 000 people
ler said . An important implication of the study
Every year , an estimated 460 , 000 people
there is nothing better than an NFL owner dancing .
the Democratic Party but as an up holder of the ideas
First of all , as an American , I oppose what
I , as an American , oppose what my
Dwar ves . After surviving an attack , they are pursued
and sol es ") is an autos omal dominant ker at
to sub due Greece . An ep igraph from the Greek
all passed . An all usion to fat cat
harsh , but it is an odd circumstance . With many

MIN ACTIVATIONS
MIN = -42.873

promoter who has no long term thinking and a governing body
meters per second ( 5 , 9 46 miles per hour
the Commonwealth does not have legislative power . Each
, 807 miles ( 149 , 597 , 870 kilometers ),
little sign that the two bodies are willing to compromise .
together these products . Has there been any discussion that such
who has no long term thinking and a governing body that
. Neil Farrell DT 6 ' 4 / 295 3 *
ry v nia ( about $ 115 . 3 million )
time of 1 : 54 . 234 in his No .
miles ( 149 , 597 , 870 kilometers ), according to
UF Leonard Warner LB 6 ' 4 / 226 4 *
the memory increases to 1 , 264 m hz . The
of possibilities I had never really considered before . In fact
stand at 2 , 88 , 339 tons . The move
introdu ctions describing own backgrounds and the projects we were working
at a speed of 2 , 658 meters per second (
meters per second ( 4 , 471 miles per hour ),
per cent of the 9 , 204 people employed in that
put the unity of the country at risk with his

INTERVAL 31.530 - 56.331
CONTAINS 0.136%

I 'm not talking about an animal ," she said .
President Obama issued an executive order in 2011 design
lose , but it is an " if " and people
in which Sed aris played an adult woman who returns to
I , as an American , oppose what my

INTERVAL 6.729 - 31.530
CONTAINS 16.026%

do know that it can be rough for mixed people because
une on his list of 100 Greatest African Americans .[ 58
500 million check - ins on Unt app d . Think
is the main attraction ; tourists are allowed to freely visit
being , and safety of LGBT children , youth , and

INTERVAL -18.072 - 6.729
CONTAINS 82.848%

you going to become so impatient with their process that you
you can even hear radio chatter from your fellow pilots !
other and block the tub ules . Credit : MP I
seats which represent different segments of the Mag p ies '
because beside marking the end of 2009 , in just couple

INTERVAL -42.873 - -18.072
CONTAINS 0.989%

s )} d 5 { ( 573 s )} 17 .
staff to ear mark roughly $ 2 . 12 million for
others , they said that wasn 't true . They gave
2 . 6 3 . 5 kg ), but differ
arer setting box underneath roughly ( 6 ) C S
Token
Token ID247
Feature activation+1.18e+01

Pos logit contributions
 unintentional+0.847
 epidemic+0.840
amaz+0.817
lete+0.814
 offender+0.798
Neg logit contributions
 licences-1.051
 preferences-1.040
abouts-1.039
 finances-1.013
Politics-1.010
Tokenm
Token ID76
Feature activation+8.64e+00

Pos logit contributions
amaz+8.678
lete+8.601
 epidemic+8.568
 unintentional+8.511
 aggravated+8.247
Neg logit contributions
 licences-10.034
 preferences-9.832
abouts-9.754
 Preferences-9.652
 ambitions-9.546
Token also
Token ID635
Feature activation+7.70e+00

Pos logit contributions
 unintentional+7.496
 epidemic+7.328
amaz+7.239
lete+7.144
 aggravated+7.103
Neg logit contributions
 licences-6.742
abouts-6.573
 preferences-6.486
 Preferences-6.364
 finances-6.359
Token here
Token ID994
Feature activation+1.46e+01

Pos logit contributions
 unintentional+6.784
 epidemic+6.634
amaz+6.525
 aggravated+6.426
lete+6.418
Neg logit contributions
 licences-6.040
abouts-5.916
 preferences-5.821
 Preferences-5.722
 finances-5.683
Token as
Token ID355
Feature activation+1.15e+01

Pos logit contributions
 unintentional+10.205
amaz+9.958
 epidemic+9.882
 aggravated+9.827
lete+9.809
Neg logit contributions
 licences-12.408
 preferences-11.991
 Preferences-11.896
 finances-11.553
 ambitions-11.529
Token an
Token ID281
Feature activation+5.63e+01

Pos logit contributions
 unintentional+8.042
 epidemic+7.860
lete+7.505
amaz+7.458
 offender+7.419
Neg logit contributions
 licences-10.693
abouts-10.333
 preferences-10.301
 Preferences-10.195
 finances-10.067
Token Israeli
Token ID6085
Feature activation+2.85e+01

Pos logit contributions
 American+29.706
 elderly+29.199
 unintentional+29.105
 adult+29.087
 educator+28.811
Neg logit contributions
 licences-23.532
ヴァ-22.914
 Preferences-20.613
abouts-20.577
 preferences-20.382
Token to
Token ID284
Feature activation+1.73e+01

Pos logit contributions
 adult+20.066
 unintentional+19.844
 elderly+19.753
 educator+19.726
 addict+19.391
Neg logit contributions
 licences-17.996
 Preferences-15.950
abouts-15.843
 preferences-15.808
 finances-15.749
Token say
Token ID910
Feature activation+6.79e+00

Pos logit contributions
 epidemic+14.228
 unintentional+14.000
 aggravated+13.568
 offender+13.560
 unsuspecting+13.459
Neg logit contributions
 licences-11.676
abouts-11.594
616-11.165
 Preferences-11.150
 preferences-11.012
Token:
Token ID25
Feature activation+3.68e+00

Pos logit contributions
 unintentional+6.005
 epidemic+5.931
amaz+5.760
lete+5.725
 offender+5.697
Neg logit contributions
 licences-5.201
abouts-5.140
 preferences-5.057
 Preferences-4.951
 finances-4.925
Token
Token ID564
Feature activation+1.11e+00

Pos logit contributions
 unintentional+3.336
 epidemic+3.308
amaz+3.228
lete+3.209
 offender+3.172
Neg logit contributions
 licences-2.824
abouts-2.784
 preferences-2.766
 finances-2.684
 Preferences-2.673
Token COUR
Token ID37384
Feature activation-3.75e+00

Pos logit contributions
 unintentional+3.509
 epidemic+3.487
amaz+3.386
 offender+3.383
lete+3.359
Neg logit contributions
 licences-2.774
abouts-2.743
 preferences-2.719
Politics-2.660
 finances-2.639
TokenTH
Token ID4221
Feature activation+3.55e-01

Pos logit contributions
 licences+2.642
 preferences+2.594
abouts+2.588
Politics+2.550
 Preferences+2.502
Neg logit contributions
 epidemic-3.605
 unintentional-3.601
amaz-3.481
lete-3.473
 unsuspecting-3.451
TokenOU
Token ID2606
Feature activation-6.41e+00

Pos logit contributions
 unintentional+0.313
 epidemic+0.310
amaz+0.303
lete+0.302
 offender+0.297
Neg logit contributions
 licences-0.282
 preferences-0.279
abouts-0.278
 finances-0.270
Politics-0.269
TokenSE
Token ID5188
Feature activation+3.50e-02

Pos logit contributions
abouts+4.030
 licences+3.991
 preferences+3.917
 Preferences+3.857
Politics+3.800
Neg logit contributions
amaz-6.591
 epidemic-6.588
 unintentional-6.559
 involuntary-6.379
lete-6.327
Token
Token ID851
Feature activation+6.54e+00

Pos logit contributions
 unintentional+0.029
 epidemic+0.029
amaz+0.028
lete+0.028
 involuntary+0.027
Neg logit contributions
 licences-0.030
 preferences-0.030
abouts-0.030
 finances-0.029
 Preferences-0.029
Token An
Token ID1052
Feature activation+5.34e+01

Pos logit contributions
 unintentional+4.885
 epidemic+4.842
amaz+4.765
 offender+4.663
lete+4.651
Neg logit contributions
 licences-5.918
 preferences-5.833
abouts-5.762
 finances-5.675
 ambitions-5.660
Token Av
Token ID5184
Feature activation+2.93e+00

Pos logit contributions
 elderly+33.326
 aggravated+32.535
 offender+32.507
 unintentional+32.476
 unsuspecting+32.080
Neg logit contributions
ヴァ-18.358
Relations-17.587
 Preferences-17.498
 preferences-17.228
 licences-16.681
Tokenond
Token ID623
Feature activation+4.90e+00

Pos logit contributions
 unintentional+1.678
 epidemic+1.660
amaz+1.599
lete+1.592
 offender+1.550
Neg logit contributions
 licences-3.243
 preferences-3.214
abouts-3.211
 finances-3.143
Politics-3.135
Tokenale
Token ID1000
Feature activation+2.49e+01

Pos logit contributions
 unintentional+2.158
 epidemic+2.135
lete+2.076
amaz+2.022
 offender+1.990
Neg logit contributions
 licences-5.990
abouts-5.947
 preferences-5.939
 Preferences-5.825
 finances-5.824
Token man
Token ID582
Feature activation+7.93e+00

Pos logit contributions
 offender+17.827
 elderly+17.350
 aggravated+17.333
 unintentional+17.291
 addict+17.064
Neg logit contributions
 licences-15.790
Politics-15.569
abouts-15.446
ヴァ-15.316
 preferences-15.176
Token returned
Token ID4504
Feature activation+8.47e+00

Pos logit contributions
 unintentional+6.135
 epidemic+6.035
 aggravated+5.917
 involuntary+5.843
 offender+5.833
Neg logit contributions
Politics-6.796
 licences-6.782
abouts-6.778
 Preferences-6.729
 preferences-6.681
Token improved
Token ID6596
Feature activation+4.33e+00

Pos logit contributions
 epidemic+16.103
 unintentional+15.997
 astounding+15.597
 aggravated+15.387
 addict+15.342
Neg logit contributions
 licences-13.736
Politics-13.480
abouts-13.234
 Preferences-13.157
616-13.121
Token efficiency
Token ID9332
Feature activation+1.06e+01

Pos logit contributions
 unintentional+4.316
 epidemic+4.277
amaz+4.132
lete+4.132
 offender+4.124
Neg logit contributions
 licences-2.942
abouts-2.887
 preferences-2.859
Politics-2.804
 Preferences-2.788
Token.
Token ID13
Feature activation+1.44e+01

Pos logit contributions
 unintentional+7.291
 epidemic+7.281
 offender+6.872
 aggravated+6.860
lete+6.796
Neg logit contributions
 licences-9.681
abouts-9.615
Politics-9.525
 Preferences-9.374
 finances-9.288
Token
Token ID198
Feature activation+8.18e+00

Pos logit contributions
 unintentional+9.143
lete+9.050
amaz+9.010
 epidemic+9.002
 offender+8.705
Neg logit contributions
 licences-12.719
abouts-12.401
 finances-12.351
 preferences-12.322
 ambitions-12.113
Token
Token ID198
Feature activation+1.41e+01

Pos logit contributions
 epidemic+4.682
 unintentional+4.538
lete+4.501
amaz+4.416
 addict+4.382
Neg logit contributions
 licences-8.532
abouts-8.323
 preferences-8.304
 ambitions-8.256
 finances-8.255
TokenAn
Token ID2025
Feature activation+5.21e+01

Pos logit contributions
lete+11.509
amaz+11.459
 unintentional+11.318
 epidemic+11.256
 addict+10.956
Neg logit contributions
 licences-10.215
 preferences-9.767
abouts-9.718
 finances-9.596
 Preferences-9.474
Token all
Token ID477
Feature activation+1.95e+01

Pos logit contributions
 astounding+29.715
 unintentional+29.352
 epidemic+29.135
 increase+28.832
 estimated+28.722
Neg logit contributions
 licences-19.485
ヴァ-17.670
 preferences-17.278
Relations-17.200
 Preferences-17.020
Token-
Token ID12
Feature activation+1.31e+01

Pos logit contributions
 epidemic+10.596
lete+10.353
 unintentional+10.349
 offender+10.149
 unsuspecting+9.513
Neg logit contributions
 licences-17.739
 Preferences-17.593
abouts-17.263
 preferences-17.211
 finances-16.990
Tokennew
Token ID3605
Feature activation+1.03e+01

Pos logit contributions
 epidemic+7.301
amaz+7.291
 unintentional+7.169
lete+7.137
 offender+6.759
Neg logit contributions
 licences-13.329
 Preferences-13.107
 preferences-13.019
 finances-12.778
 ambitions-12.662
Token four
Token ID1440
Feature activation+1.64e+01

Pos logit contributions
 unintentional+9.383
 epidemic+9.307
 offender+9.084
lete+9.045
 involuntary+8.854
Neg logit contributions
 licences-7.281
abouts-7.187
 preferences-6.985
Politics-6.972
 Preferences-6.842
Token-
Token ID12
Feature activation+8.85e+00

Pos logit contributions
 unintentional+10.925
 epidemic+10.868
lete+10.741
 offender+10.632
amaz+10.321
Neg logit contributions
 licences-13.537
abouts-13.399
Politics-13.280
 Preferences-13.275
 preferences-13.134
Token a
Token ID257
Feature activation-4.00e+00

Pos logit contributions
 licences+4.794
 preferences+4.779
abouts+4.778
 Preferences+4.704
 finances+4.662
Neg logit contributions
 unintentional-4.281
 epidemic-4.235
amaz-4.191
lete-4.144
 offender-4.092
Token Young
Token ID6960
Feature activation+1.97e+01

Pos logit contributions
 preferences+3.676
 licences+3.671
abouts+3.647
 Preferences+3.608
 finances+3.586
Neg logit contributions
 unintentional-2.893
 epidemic-2.866
amaz-2.834
lete-2.773
 offender-2.734
Token Girl
Token ID7430
Feature activation+5.49e+00

Pos logit contributions
lete+14.130
 epidemic+14.027
 adult+13.844
 unintentional+13.815
 offender+13.785
Neg logit contributions
 licences-15.179
 preferences-14.492
 finances-14.427
 ambitions-14.055
abouts-13.923
Token
Token ID198
Feature activation-9.92e-01

Pos logit contributions
 epidemic+4.364
 unintentional+4.360
lete+4.233
amaz+4.199
 offender+4.142
Neg logit contributions
 licences-4.757
 preferences-4.683
abouts-4.661
Politics-4.559
 finances-4.544
Token
Token ID198
Feature activation-2.42e+00

Pos logit contributions
 licences+0.856
 preferences+0.848
abouts+0.847
Politics+0.826
 finances+0.823
Neg logit contributions
 unintentional-0.804
 epidemic-0.798
amaz-0.777
lete-0.775
 offender-0.762
TokenAn
Token ID2025
Feature activation+5.18e+01

Pos logit contributions
abouts+1.756
 licences+1.744
Politics+1.731
 preferences+1.731
616+1.687
Neg logit contributions
 unintentional-2.276
 epidemic-2.254
amaz-2.183
 offender-2.177
lete-2.177
Token 1888
Token ID49584
Feature activation+1.68e+01

Pos logit contributions
 elderly+31.760
 unintentional+31.407
 epidemic+31.244
 astounding+30.639
 unsuspecting+30.354
Neg logit contributions
 licences-18.599
 preferences-18.025
 frameworks-16.859
ヴァ-16.761
abouts-16.695
Token photo
Token ID4590
Feature activation+8.00e+00

Pos logit contributions
 epidemic+13.284
 unintentional+12.913
 involuntary+12.463
 offender+12.246
 elderly+12.232
Neg logit contributions
 licences-12.370
abouts-12.338
 preferences-12.078
Politics-12.006
 finances-11.736
Token of
Token ID286
Feature activation+7.42e+00

Pos logit contributions
 epidemic+5.973
 unintentional+5.932
lete+5.712
 offender+5.610
amaz+5.576
Neg logit contributions
abouts-7.180
 licences-7.164
 preferences-7.011
Politics-6.990
 Preferences-6.935
Tokenwith
Token ID4480
Feature activation+6.25e+00

Pos logit contributions
 unintentional+5.635
 epidemic+5.610
 unsuspecting+5.364
lete+5.355
amaz+5.327
Neg logit contributions
 licences-6.630
abouts-6.575
 preferences-6.508
Politics-6.405
 Preferences-6.330
Token her
Token ID607
Feature activation+1.55e+00

Pos logit contributions
 unintentional+5.000
 epidemic+4.969
amaz+4.830
lete+4.819
 unsuspecting+4.741
Neg logit contributions
 licences-5.359
abouts-5.288
 preferences-5.266
Politics-5.162
 Preferences-5.134
Token was
Token ID373
Feature activation+8.15e+00

Pos logit contributions
 unintentional+6.894
 epidemic+6.806
lete+6.731
 aggravated+6.568
amaz+6.560
Neg logit contributions
 Preferences-7.455
 licences-7.404
abouts-7.355
 preferences-7.348
Politics-7.339
Token the
Token ID262
Feature activation+1.03e+01

Pos logit contributions
 unintentional+6.626
 epidemic+6.414
lete+6.207
 involuntary+6.175
amaz+6.150
Neg logit contributions
 licences-6.849
abouts-6.838
 Preferences-6.713
 preferences-6.713
Politics-6.594
Token first
Token ID717
Feature activation+2.10e+01

Pos logit contributions
 unintentional+7.779
 epidemic+7.485
 involuntary+7.245
 offender+7.229
 unsuspecting+7.177
Neg logit contributions
abouts-9.183
 licences-9.027
 Preferences-8.802
 preferences-8.760
Politics-8.682
Token monument
Token ID17757
Feature activation+7.34e+00

Pos logit contributions
 unintentional+16.944
 epidemic+16.327
 offender+16.063
 elderly+15.969
 involuntary+15.899
Neg logit contributions
 licences-13.874
abouts-13.707
 Preferences-13.118
Politics-12.934
 preferences-12.871
Token honoring
Token ID38276
Feature activation+1.69e+01

Pos logit contributions
 unintentional+5.563
 epidemic+5.443
lete+5.294
 offender+5.226
amaz+5.215
Neg logit contributions
 licences-6.560
abouts-6.475
 preferences-6.458
 Preferences-6.418
Politics-6.380
Token an
Token ID281
Feature activation+5.17e+01

Pos logit contributions
 unintentional+13.057
 epidemic+12.861
 unsuspecting+12.336
 elderly+12.300
 involuntary+12.294
Neg logit contributions
abouts-12.692
 licences-12.428
 Preferences-12.139
616-12.021
Politics-11.959
Token African
Token ID5510
Feature activation+2.01e+01

Pos logit contributions
 unintentional+29.388
 elderly+29.328
 adult+29.190
 epidemic+29.065
 African+28.733
Neg logit contributions
ヴァ-22.778
 licences-22.081
abouts-21.738
krit-21.543
 Preferences-20.759
Token American
Token ID1605
Feature activation+2.08e+01

Pos logit contributions
 epidemic+13.535
 unintentional+13.205
 offender+12.844
lete+12.660
 adult+12.589
Neg logit contributions
 licences-16.504
abouts-16.068
 Preferences-15.293
Politics-14.976
 preferences-14.831
Token or
Token ID393
Feature activation+1.65e+01

Pos logit contributions
 epidemic+15.394
 unintentional+15.272
 offender+15.183
 adult+14.862
 elderly+14.802
Neg logit contributions
 licences-15.086
abouts-14.876
 Preferences-14.285
Politics-13.982
 preferences-13.834
Token a
Token ID257
Feature activation+4.06e+00

Pos logit contributions
 unintentional+12.399
 epidemic+12.165
lete+11.862
 involuntary+11.717
 elderly+11.622
Neg logit contributions
 licences-13.286
 preferences-12.544
abouts-12.391
 finances-12.381
Politics-12.268
Token woman
Token ID2415
Feature activation+1.05e+01

Pos logit contributions
 unintentional+2.827
 epidemic+2.801
lete+2.709
amaz+2.653
 offender+2.647
Neg logit contributions
 licences-3.983
abouts-3.923
 preferences-3.895
Politics-3.815
 finances-3.814
Token and
Token ID290
Feature activation+1.73e+01

Pos logit contributions
 unintentional+9.466
 epidemic+9.298
 involuntary+8.893
 aggravated+8.686
 offender+8.655
Neg logit contributions
abouts-9.402
 licences-9.161
Politics-8.963
 preferences-8.885
 Preferences-8.849
Token therefore
Token ID4361
Feature activation+1.40e+01

Pos logit contributions
 unintentional+15.780
 epidemic+15.538
 involuntary+14.730
 offender+14.562
 aggravated+14.481
Neg logit contributions
abouts-11.113
Politics-10.743
 licences-10.623
 Preferences-10.423
616-10.368
Token prevent
Token ID2948
Feature activation+2.19e+01

Pos logit contributions
 unintentional+12.345
 epidemic+12.142
 involuntary+11.550
 offender+11.369
 aggravated+11.270
Neg logit contributions
abouts-9.946
Politics-9.836
 licences-9.815
 Preferences-9.676
616-9.638
Tokenable
Token ID540
Feature activation+9.93e+00

Pos logit contributions
 epidemic+14.705
 unintentional+14.676
 involuntary+13.411
 aggravated+13.187
amaz+13.186
Neg logit contributions
Politics-16.859
 licences-16.722
abouts-16.580
 Preferences-16.567
 ambitions-16.280
Token.
Token ID13
Feature activation+9.76e+00

Pos logit contributions
 unintentional+7.287
 epidemic+7.220
 offender+6.733
 involuntary+6.661
amaz+6.593
Neg logit contributions
 licences-8.878
abouts-8.817
Politics-8.621
 preferences-8.599
 Preferences-8.548
Token An
Token ID1052
Feature activation+5.17e+01

Pos logit contributions
 unintentional+7.425
 epidemic+7.396
amaz+7.066
lete+7.012
 offender+6.967
Neg logit contributions
 licences-8.406
abouts-8.300
 preferences-8.188
 ambitions-8.118
 finances-8.082
Token important
Token ID1593
Feature activation+1.79e+01

Pos logit contributions
 epidemic+33.707
 unintentional+32.420
 offender+31.014
 increase+30.998
 estimated+30.854
Neg logit contributions
abouts-18.434
 licences-18.416
 Preferences-18.257
ヴァ-18.020
 Talks-17.741
Token contemporary
Token ID11811
Feature activation+1.49e+01

Pos logit contributions
 epidemic+15.541
 unintentional+15.061
 offender+14.689
 involuntary+14.245
 addict+14.006
Neg logit contributions
abouts-12.294
 licences-11.876
 Preferences-11.535
616-11.506
Politics-11.425
Token challenge
Token ID4427
Feature activation+8.44e+00

Pos logit contributions
 epidemic+13.425
 unintentional+12.927
 offender+12.616
 involuntary+12.293
 addict+12.074
Neg logit contributions
abouts-10.714
 licences-10.257
Politics-9.962
 Preferences-9.903
616-9.814
Token in
Token ID287
Feature activation+1.84e+01

Pos logit contributions
 epidemic+6.786
 unintentional+6.754
 offender+6.455
lete+6.435
 aggravated+6.393
Neg logit contributions
abouts-6.946
 licences-6.946
 Preferences-6.773
 preferences-6.772
Politics-6.734
Token injury
Token ID5095
Feature activation+6.36e+00

Pos logit contributions
 epidemic+16.542
 unintentional+16.526
 involuntary+15.403
 offender+15.380
 aggravated+15.267
Neg logit contributions
abouts-12.019
 licences-11.582
Politics-11.429
616-11.209
 Preferences-11.161
Token heart
Token ID2612
Feature activation+1.10e+01

Pos logit contributions
 unintentional+10.566
 epidemic+10.556
lete+10.141
 offender+10.025
amaz+9.972
Neg logit contributions
 licences-7.737
abouts-7.491
 preferences-7.305
Politics-7.296
 Preferences-7.270
Token,
Token ID11
Feature activation+4.08e+00

Pos logit contributions
 epidemic+8.120
 unintentional+7.808
lete+7.672
 offender+7.495
 addict+7.486
Neg logit contributions
 licences-9.713
abouts-9.343
 preferences-9.200
 Preferences-9.178
Politics-9.135
Token and
Token ID290
Feature activation+9.99e+00

Pos logit contributions
 unintentional+3.118
 epidemic+3.076
lete+2.965
amaz+2.944
 involuntary+2.922
Neg logit contributions
 licences-3.734
abouts-3.722
 preferences-3.657
 Preferences-3.593
Politics-3.592
Token talent
Token ID7401
Feature activation+1.11e+01

Pos logit contributions
 unintentional+9.253
 epidemic+9.091
 unsuspecting+8.709
lete+8.689
 offender+8.677
Neg logit contributions
abouts-7.166
 licences-7.137
Politics-6.848
 Preferences-6.848
 preferences-6.766
Token as
Token ID355
Feature activation+5.76e+00

Pos logit contributions
 epidemic+8.576
 unintentional+8.548
lete+8.239
 offender+8.122
amaz+8.004
Neg logit contributions
 licences-9.216
abouts-8.995
 preferences-8.917
Politics-8.835
 Preferences-8.822
Token an
Token ID281
Feature activation+5.11e+01

Pos logit contributions
 unintentional+5.549
 epidemic+5.532
lete+5.368
amaz+5.304
 offender+5.270
Neg logit contributions
 licences-4.083
abouts-4.031
 preferences-4.006
 Preferences-3.884
Politics-3.872
Token artist
Token ID6802
Feature activation+1.10e+01

Pos logit contributions
 epidemic+31.413
 unintentional+31.356
 educator+31.234
 addict+31.187
 adult+31.050
Neg logit contributions
ヴァ-21.491
 licences-20.964
 Preferences-19.246
abouts-19.109
 preferences-18.911
Token.
Token ID13
Feature activation+8.32e+00

Pos logit contributions
 epidemic+7.540
 unintentional+7.493
lete+7.431
 offender+7.163
amaz+7.089
Neg logit contributions
 licences-10.158
abouts-9.845
 preferences-9.833
 Preferences-9.742
Politics-9.718
Token
Token ID447
Feature activation-1.47e+00

Pos logit contributions
 unintentional+6.408
 epidemic+6.408
lete+6.279
amaz+6.253
 offender+6.095
Neg logit contributions
 licences-7.242
abouts-7.039
 preferences-7.035
 ambitions-6.866
 finances-6.863
Token
Token ID251
Feature activation+9.67e+00

Pos logit contributions
 licences+1.108
 preferences+1.100
abouts+1.089
 finances+1.063
Politics+1.063
Neg logit contributions
 unintentional-1.345
 epidemic-1.334
amaz-1.306
lete-1.297
 offender-1.281
Token
Token ID198
Feature activation+3.19e+00

Pos logit contributions
 unintentional+6.775
 epidemic+6.760
lete+6.698
amaz+6.673
 offender+6.368
Neg logit contributions
 licences-8.929
 preferences-8.653
abouts-8.619
 ambitions-8.476
 finances-8.470
Token.
Token ID13
Feature activation+5.81e+00

Pos logit contributions
 unintentional+7.953
 epidemic+7.740
 offender+7.500
amaz+7.491
 unsuspecting+7.445
Neg logit contributions
abouts-9.006
 licences-8.977
Politics-8.843
 Preferences-8.810
 preferences-8.706
Token
Token ID198
Feature activation+1.43e+00

Pos logit contributions
 unintentional+4.345
 epidemic+4.316
amaz+4.270
lete+4.224
 offender+4.114
Neg logit contributions
 licences-5.290
abouts-5.192
 preferences-5.191
 finances-5.111
 ambitions-5.058
Token
Token ID198
Feature activation+4.43e+00

Pos logit contributions
 unintentional+1.052
 epidemic+1.051
amaz+1.017
lete+1.017
 offender+0.994
Neg logit contributions
 licences-1.341
abouts-1.320
 preferences-1.319
 finances-1.291
 ambitions-1.286
TokenIt
Token ID1026
Feature activation+1.60e+01

Pos logit contributions
 unintentional+4.008
 epidemic+3.979
amaz+3.944
lete+3.935
 offender+3.830
Neg logit contributions
 licences-3.392
abouts-3.322
 preferences-3.317
 finances-3.236
 ambitions-3.202
Token was
Token ID373
Feature activation+1.53e+01

Pos logit contributions
 aggravated+11.749
 unintentional+11.667
 epidemic+11.488
lete+11.282
amaz+11.171
Neg logit contributions
 licences-12.571
 Preferences-12.261
 preferences-12.250
abouts-12.221
Politics-11.953
Token an
Token ID281
Feature activation+5.07e+01

Pos logit contributions
 unintentional+12.026
 epidemic+11.651
 involuntary+11.071
 aggravated+11.037
amaz+10.989
Neg logit contributions
abouts-12.191
 licences-11.937
 preferences-11.393
 Preferences-11.371
Politics-11.276
Token up
Token ID510
Feature activation+1.70e+01

Pos logit contributions
 unintentional+30.446
 astounding+29.524
 epidemic+29.385
 unforgettable+28.242
 egregious+28.198
Neg logit contributions
ヴァ-21.940
 licences-20.535
abouts-20.528
krit-19.810
 Preferences-19.678
Token-
Token ID12
Feature activation+1.02e+01

Pos logit contributions
 epidemic+12.062
 unintentional+11.747
amaz+11.572
ounding+11.504
 aggravated+11.344
Neg logit contributions
 licences-14.042
 preferences-13.542
 Preferences-13.500
abouts-13.420
 finances-13.301
Tokenand
Token ID392
Feature activation+1.14e+01

Pos logit contributions
amaz+6.961
 epidemic+6.886
 unintentional+6.859
lete+6.667
 involuntary+6.517
Neg logit contributions
 licences-9.538
 preferences-9.345
 Preferences-9.256
abouts-9.214
 finances-9.116
Token-
Token ID12
Feature activation-5.12e+00

Pos logit contributions
 epidemic+4.949
 unintentional+4.926
amaz+4.735
 offender+4.656
 aggravated+4.539
Neg logit contributions
 licences-13.149
abouts-12.915
 preferences-12.871
 Preferences-12.819
Politics-12.690
Tokendown
Token ID2902
Feature activation+1.87e+01

Pos logit contributions
abouts+3.950
 licences+3.834
 preferences+3.830
Politics+3.767
 finances+3.754
Neg logit contributions
 unintentional-4.511
 epidemic-4.501
lete-4.321
 offender-4.309
 involuntary-4.295
Token's
Token ID338
Feature activation-1.22e+01

Pos logit contributions
 preferences+3.818
 licences+3.757
abouts+3.672
 finances+3.634
 ambitions+3.566
Neg logit contributions
amaz-5.258
 unintentional-5.253
 epidemic-5.201
lete-5.114
 involuntary-4.963
Token privacy
Token ID6782
Feature activation-5.31e+00

Pos logit contributions
 preferences+9.678
 licences+9.221
 finances+9.083
 ambitions+8.895
 Preferences+8.548
Neg logit contributions
amaz-10.450
lete-10.026
 unintentional-9.854
 epidemic-9.621
agically-9.588
Token.
Token ID13
Feature activation+8.96e+00

Pos logit contributions
 preferences+4.627
 licences+4.574
 ambitions+4.410
 finances+4.387
abouts+4.379
Neg logit contributions
 unintentional-4.259
amaz-4.240
lete-4.162
 epidemic-4.161
 unsuspecting-4.018
Token
Token ID198
Feature activation+5.94e+00

Pos logit contributions
 unintentional+6.091
 epidemic+6.075
amaz+6.008
lete+6.004
 offender+5.792
Neg logit contributions
 licences-8.424
 preferences-8.195
abouts-8.122
 finances-8.122
 ambitions-8.092
Token
Token ID198
Feature activation+6.01e+00

Pos logit contributions
 epidemic+3.771
 unintentional+3.698
lete+3.618
amaz+3.602
 addict+3.505
Neg logit contributions
 licences-6.012
abouts-5.889
 preferences-5.864
 finances-5.800
 ambitions-5.780
TokenAn
Token ID2025
Feature activation+5.05e+01

Pos logit contributions
 unintentional+5.342
lete+5.328
 epidemic+5.314
amaz+5.306
 offender+5.124
Neg logit contributions
 licences-4.617
 preferences-4.478
abouts-4.458
 finances-4.376
 ambitions-4.346
Token estimated
Token ID6108
Feature activation+5.84e+00

Pos logit contributions
 elderly+31.722
 estimated+31.098
 epidemic+30.783
 unintentional+30.665
 hour+30.238
Neg logit contributions
 licences-18.272
ヴァ-17.112
 Preferences-16.842
Relations-16.703
abouts-16.656
Token 125
Token ID13151
Feature activation+2.11e+01

Pos logit contributions
 epidemic+4.880
 unintentional+4.865
 offender+4.615
lete+4.606
 involuntary+4.601
Neg logit contributions
 licences-4.823
abouts-4.779
 preferences-4.726
 Preferences-4.657
Politics-4.654
Token,
Token ID11
Feature activation-1.14e+01

Pos logit contributions
 unintentional+16.747
 unsuspecting+16.488
 elderly+16.346
 epidemic+16.233
 adult+16.191
Neg logit contributions
abouts-13.133
Politics-12.794
 licences-12.431
 Preferences-12.350
 finances-12.323
Token000
Token ID830
Feature activation+1.92e+01

Pos logit contributions
616+5.231
abouts+4.883
 licences+4.664
 preferences+4.655
Politics+4.428
Neg logit contributions
 unintentional-13.169
amaz-12.909
 epidemic-12.885
 offender-12.682
 involuntary-12.586
Token people
Token ID661
Feature activation+3.70e+00

Pos logit contributions
 unintentional+15.119
 unsuspecting+14.855
 adults+14.743
 elderly+14.737
 epidemic+14.632
Neg logit contributions
Politics-12.866
abouts-12.792
 licences-12.274
 Preferences-12.169
616-12.099
Tokenler
Token ID1754
Feature activation+1.05e+01

Pos logit contributions
lete+4.540
 unintentional+4.465
 epidemic+4.445
amaz+4.405
 involuntary+4.131
Neg logit contributions
 licences-7.059
 preferences-6.974
abouts-6.888
 Preferences-6.823
Politics-6.813
Token said
Token ID531
Feature activation-1.66e-01

Pos logit contributions
 unintentional+8.055
 epidemic+8.006
amaz+7.866
lete+7.857
 aggravated+7.583
Neg logit contributions
abouts-8.923
 licences-8.835
 preferences-8.543
 finances-8.356
 Preferences-8.353
Token.
Token ID13
Feature activation+2.02e+00

Pos logit contributions
 licences+0.147
 preferences+0.145
abouts+0.145
 finances+0.141
Politics+0.141
Neg logit contributions
 unintentional-0.132
 epidemic-0.131
amaz-0.127
lete-0.127
 offender-0.124
Token
Token ID564
Feature activation+7.77e+00

Pos logit contributions
 unintentional+1.624
 epidemic+1.613
amaz+1.572
lete+1.563
 offender+1.533
Neg logit contributions
 licences-1.766
abouts-1.746
 preferences-1.739
 finances-1.695
 ambitions-1.688
Token
Token ID250
Feature activation+2.03e+01

Pos logit contributions
 epidemic+5.620
 unintentional+5.581
lete+5.485
amaz+5.352
 involuntary+5.273
Neg logit contributions
abouts-7.130
 licences-7.120
 preferences-6.915
 ambitions-6.787
 finances-6.764
TokenAn
Token ID2025
Feature activation+5.04e+01

Pos logit contributions
 epidemic+16.021
lete+15.902
amaz+15.768
 unintentional+15.758
 aggravated+14.944
Neg logit contributions
 licences-13.072
abouts-12.439
 preferences-12.352
 finances-12.230
 ambitions-12.115
Token important
Token ID1593
Feature activation+1.79e+01

Pos logit contributions
 epidemic+32.518
 unintentional+30.961
 estimated+30.468
 increase+30.401
 astounding+30.038
Neg logit contributions
 licences-17.745
 Preferences-17.437
 preferences-17.425
ヴァ-16.881
abouts-16.731
Token implication
Token ID26863
Feature activation+1.12e+00

Pos logit contributions
 epidemic+15.541
 unintentional+14.933
 offender+14.472
 involuntary+14.174
 addict+13.901
Neg logit contributions
abouts-12.236
 licences-12.016
 preferences-11.481
 Preferences-11.468
616-11.452
Token of
Token ID286
Feature activation+1.19e+01

Pos logit contributions
 unintentional+0.862
 epidemic+0.855
amaz+0.830
lete+0.828
 offender+0.813
Neg logit contributions
 licences-1.025
abouts-1.014
 preferences-1.013
 finances-0.986
Politics-0.984
Token the
Token ID262
Feature activation+8.12e+00

Pos logit contributions
 epidemic+8.564
 unintentional+8.472
amaz+8.070
 aggravated+7.923
 involuntary+7.878
Neg logit contributions
abouts-10.544
 licences-10.441
Politics-10.083
 preferences-10.020
 Preferences-9.934
Token study
Token ID2050
Feature activation-3.51e+00

Pos logit contributions
 epidemic+6.743
 unintentional+6.660
 involuntary+6.256
 offender+6.253
amaz+6.223
Neg logit contributions
abouts-6.776
 licences-6.693
 preferences-6.495
Politics-6.473
 Preferences-6.407
Token
Token ID198
Feature activation+2.97e+00

Pos logit contributions
 epidemic+2.441
 unintentional+2.437
amaz+2.363
lete+2.346
 offender+2.292
Neg logit contributions
 licences-2.955
 preferences-2.929
abouts-2.911
 finances-2.848
 ambitions-2.842
Token
Token ID198
Feature activation+2.23e+00

Pos logit contributions
 epidemic+2.091
 unintentional+2.082
lete+2.013
amaz+2.011
 offender+1.961
Neg logit contributions
 licences-2.856
abouts-2.809
 preferences-2.805
 finances-2.751
 ambitions-2.750
TokenEvery
Token ID6109
Feature activation+2.25e+01

Pos logit contributions
 unintentional+2.028
 epidemic+2.020
amaz+1.975
lete+1.969
 offender+1.931
Neg logit contributions
 licences-1.710
 preferences-1.689
abouts-1.683
 finances-1.634
 Preferences-1.626
Token year
Token ID614
Feature activation+5.79e+00

Pos logit contributions
 epidemic+16.685
 unintentional+15.603
 addict+15.474
 elderly+15.425
 adult+15.356
Neg logit contributions
abouts-15.273
 preferences-15.088
 Preferences-15.062
 licences-15.029
Politics-14.596
Token,
Token ID11
Feature activation+7.13e+00

Pos logit contributions
 epidemic+4.827
 unintentional+4.808
amaz+4.577
lete+4.525
 involuntary+4.525
Neg logit contributions
 licences-4.767
 preferences-4.761
abouts-4.715
 Preferences-4.668
Politics-4.644
Token an
Token ID281
Feature activation+5.02e+01

Pos logit contributions
 epidemic+6.219
 unintentional+6.182
 unsuspecting+5.851
amaz+5.843
 involuntary+5.839
Neg logit contributions
abouts-5.535
 preferences-5.529
 licences-5.508
 Preferences-5.431
Politics-5.405
Token estimated
Token ID6108
Feature activation+3.41e+00

Pos logit contributions
 epidemic+32.536
 estimated+30.744
 unintentional+29.609
 astounding+29.281
 elderly+29.127
Neg logit contributions
ヴァ-21.889
 Preferences-21.253
 preferences-20.259
Relations-19.622
 Talks-19.417
Token 460
Token ID34091
Feature activation+2.52e+01

Pos logit contributions
 unintentional+2.914
 epidemic+2.913
amaz+2.791
lete+2.781
 offender+2.756
Neg logit contributions
 licences-2.789
abouts-2.766
 preferences-2.759
 Preferences-2.689
Politics-2.686
Token,
Token ID11
Feature activation-1.18e+01

Pos logit contributions
 unintentional+18.516
 epidemic+18.507
 elderly+18.006
 unsuspecting+17.463
 adults+17.170
Neg logit contributions
Politics-15.641
 Preferences-15.526
abouts-15.411
 finances-15.218
 preferences-15.084
Token000
Token ID830
Feature activation+2.12e+01

Pos logit contributions
616+5.589
abouts+5.257
 licences+5.233
 preferences+5.102
 ambitions+4.896
Neg logit contributions
 unintentional-13.019
amaz-12.840
 epidemic-12.785
 offender-12.709
lete-12.584
Token people
Token ID661
Feature activation-1.78e+00

Pos logit contributions
 unintentional+16.822
 epidemic+16.672
 elderly+16.465
 unsuspecting+16.135
 involuntary+15.776
Neg logit contributions
Politics-13.806
 Preferences-13.738
abouts-13.403
 preferences-13.045
 finances-12.997
Token there
Token ID612
Feature activation-1.19e+01

Pos logit contributions
 unintentional+4.352
 epidemic+4.317
lete+4.150
 offender+4.150
amaz+4.145
Neg logit contributions
 licences-3.356
abouts-3.347
 preferences-3.297
Politics-3.220
 Preferences-3.219
Token is
Token ID318
Feature activation-5.60e-01

Pos logit contributions
abouts+8.483
 preferences+8.136
 finances+7.927
 licences+7.919
 ambitions+7.640
Neg logit contributions
amaz-10.396
 unintentional-10.228
lete-10.162
 epidemic-9.999
 involuntary-9.848
Token nothing
Token ID2147
Feature activation-1.68e-01

Pos logit contributions
 licences+0.454
 preferences+0.452
abouts+0.447
 finances+0.438
 ambitions+0.436
Neg logit contributions
 unintentional-0.478
 epidemic-0.474
amaz-0.469
lete-0.468
 offender-0.456
Token better
Token ID1365
Feature activation+6.22e+00

Pos logit contributions
 licences+0.133
 preferences+0.132
abouts+0.131
 finances+0.128
 ambitions+0.127
Neg logit contributions
 unintentional-0.146
 epidemic-0.146
amaz-0.144
lete-0.143
 offender-0.140
Token than
Token ID621
Feature activation+1.06e+01

Pos logit contributions
 unintentional+5.297
 epidemic+5.241
lete+5.046
 offender+5.031
amaz+5.007
Neg logit contributions
 licences-5.075
abouts-4.990
 preferences-4.940
 Preferences-4.887
Politics-4.821
Token an
Token ID281
Feature activation+5.02e+01

Pos logit contributions
 unintentional+9.413
 epidemic+9.101
 unsuspecting+8.831
 involuntary+8.822
amaz+8.770
Neg logit contributions
abouts-8.021
 licences-7.750
 Preferences-7.668
 preferences-7.656
Politics-7.543
Token NFL
Token ID5134
Feature activation+1.85e+01

Pos logit contributions
 unintentional+32.304
 unsuspecting+31.064
 epidemic+30.914
 addict+30.263
 elderly+29.778
Neg logit contributions
ヴァ-20.384
abouts-19.845
 Preferences-17.964
 licences-17.715
 preferences-17.635
Token owner
Token ID4870
Feature activation+2.51e+00

Pos logit contributions
 epidemic+15.227
 unintentional+14.941
 offender+14.739
 addict+14.563
lete+14.433
Neg logit contributions
abouts-12.523
 licences-12.102
 Preferences-11.814
 preferences-11.809
Politics-11.562
Token dancing
Token ID15360
Feature activation+2.07e+00

Pos logit contributions
 unintentional+2.267
 epidemic+2.251
amaz+2.190
lete+2.189
 offender+2.156
Neg logit contributions
 licences-1.941
abouts-1.920
 preferences-1.915
 Preferences-1.859
Politics-1.857
Token.
Token ID13
Feature activation+3.04e+00

Pos logit contributions
 unintentional+1.774
 epidemic+1.761
amaz+1.713
lete+1.711
 offender+1.683
Neg logit contributions
 licences-1.693
 preferences-1.672
abouts-1.670
Politics-1.622
 Preferences-1.621
Token
Token ID198
Feature activation-5.21e-01

Pos logit contributions
 unintentional+2.315
 epidemic+2.301
amaz+2.233
lete+2.232
 offender+2.188
Neg logit contributions
 licences-2.769
abouts-2.733
 preferences-2.731
 finances-2.659
 ambitions-2.656
Token the
Token ID262
Feature activation+5.43e+00

Pos logit contributions
 unintentional+8.618
 epidemic+8.594
lete+8.218
 unsuspecting+8.206
 involuntary+8.153
Neg logit contributions
 licences-7.590
abouts-7.506
 preferences-7.148
 Preferences-7.137
Politics-7.079
Token Democratic
Token ID4390
Feature activation-9.62e-01

Pos logit contributions
 epidemic+5.103
 unintentional+5.097
lete+4.883
 unsuspecting+4.858
 offender+4.851
Neg logit contributions
 licences-3.995
abouts-3.898
 preferences-3.808
Politics-3.755
 Preferences-3.747
Token Party
Token ID3615
Feature activation-1.26e+01

Pos logit contributions
 preferences+0.718
 licences+0.709
abouts+0.700
 ambitions+0.689
 finances+0.689
Neg logit contributions
 unintentional-0.894
amaz-0.887
 epidemic-0.881
lete-0.867
 involuntary-0.853
Token but
Token ID475
Feature activation+1.67e+01

Pos logit contributions
 preferences+9.331
 finances+8.991
Politics+8.959
abouts+8.912
 ambitions+8.879
Neg logit contributions
amaz-10.677
 unintentional-10.464
lete-10.186
 involuntary-10.118
 epidemic-10.047
Token as
Token ID355
Feature activation+1.07e+01

Pos logit contributions
 epidemic+11.733
 unintentional+11.593
lete+11.108
 involuntary+11.011
 offender+10.987
Neg logit contributions
 licences-13.494
abouts-12.819
616-12.720
 preferences-12.634
 Preferences-12.511
Token an
Token ID281
Feature activation+4.99e+01

Pos logit contributions
 epidemic+7.498
 unintentional+7.471
lete+7.156
amaz+7.015
 offender+7.003
Neg logit contributions
 licences-9.881
abouts-9.626
 preferences-9.481
 Preferences-9.389
 finances-9.280
Token up
Token ID510
Feature activation+1.50e+01

Pos logit contributions
 educator+30.615
 epidemic+30.292
 unintentional+29.813
 adult+29.641
 addict+29.633
Neg logit contributions
 licences-19.750
ヴァ-18.687
abouts-18.684
616-17.690
 Preferences-17.618
Tokenholder
Token ID13829
Feature activation+1.50e+01

Pos logit contributions
 epidemic+10.959
lete+10.692
 unintentional+10.495
 offender+10.461
amaz+10.269
Neg logit contributions
 licences-12.718
 preferences-12.024
 Preferences-11.965
 finances-11.937
abouts-11.849
Token of
Token ID286
Feature activation+1.07e+01

Pos logit contributions
 epidemic+10.449
 unintentional+10.254
lete+10.081
 offender+10.073
 addict+9.943
Neg logit contributions
 licences-12.882
abouts-12.295
 preferences-12.123
 finances-12.098
 Preferences-12.023
Token the
Token ID262
Feature activation-2.78e+00

Pos logit contributions
 epidemic+9.279
 unintentional+9.264
 involuntary+8.831
lete+8.823
 unsuspecting+8.801
Neg logit contributions
abouts-8.002
 licences-7.965
Politics-7.540
616-7.535
 preferences-7.488
Token ideas
Token ID4213
Feature activation-8.57e-01

Pos logit contributions
 preferences+2.134
 licences+2.088
 ambitions+2.058
 finances+2.050
abouts+2.043
Neg logit contributions
 unintentional-2.509
amaz-2.498
 epidemic-2.463
lete-2.460
 offender-2.394
TokenFirst
Token ID5962
Feature activation+7.30e+00

Pos logit contributions
 unintentional+10.277
lete+10.201
 epidemic+10.185
amaz+10.140
 offender+9.726
Neg logit contributions
 licences-7.884
 preferences-7.631
abouts-7.455
 finances-7.412
 ambitions-7.367
Token of
Token ID286
Feature activation+1.20e+01

Pos logit contributions
 unintentional+6.213
 epidemic+6.142
lete+5.965
 offender+5.961
amaz+5.948
Neg logit contributions
 licences-5.798
abouts-5.713
 preferences-5.630
 finances-5.572
 ambitions-5.507
Token all
Token ID477
Feature activation-2.30e+00

Pos logit contributions
 epidemic+3.504
 unintentional+3.327
 offender+3.168
 aggravated+3.050
lete+3.023
Neg logit contributions
abouts-15.380
 licences-15.332
 preferences-14.990
 Preferences-14.918
 finances-14.838
Token,
Token ID11
Feature activation-1.72e+00

Pos logit contributions
 licences+1.939
 preferences+1.935
abouts+1.927
 finances+1.880
Politics+1.877
Neg logit contributions
 unintentional-1.880
 epidemic-1.871
amaz-1.828
lete-1.820
 involuntary-1.785
Token as
Token ID355
Feature activation+7.67e+00

Pos logit contributions
 licences+1.236
 preferences+1.236
abouts+1.221
 finances+1.197
Politics+1.194
Neg logit contributions
 unintentional-1.624
 epidemic-1.618
amaz-1.595
lete-1.590
 involuntary-1.555
Token an
Token ID281
Feature activation+4.95e+01

Pos logit contributions
 unintentional+6.368
 epidemic+6.272
lete+6.024
amaz+5.971
 unsuspecting+5.966
Neg logit contributions
 licences-6.377
abouts-6.295
 preferences-6.206
 Preferences-6.063
 ambitions-6.009
Token American
Token ID1605
Feature activation+1.73e+01

Pos logit contributions
 adult+28.652
 elderly+28.577
 unintentional+28.220
 American+28.169
 educator+27.932
Neg logit contributions
 licences-21.854
ヴァ-21.359
abouts-19.973
 Preferences-19.541
 favourites-18.987
Token,
Token ID11
Feature activation-2.29e+00

Pos logit contributions
 unintentional+13.946
 epidemic+13.861
 offender+13.676
 addict+13.574
 elderly+13.489
Neg logit contributions
 licences-12.306
abouts-11.654
 Preferences-11.370
 preferences-11.318
 finances-11.157
Token I
Token ID314
Feature activation-2.58e-02

Pos logit contributions
 preferences+1.647
 licences+1.643
abouts+1.627
 finances+1.595
Politics+1.588
Neg logit contributions
 unintentional-2.152
 epidemic-2.150
amaz-2.119
lete-2.108
 offender-2.068
Token oppose
Token ID12856
Feature activation+4.05e+00

Pos logit contributions
 licences+0.019
 preferences+0.019
abouts+0.019
 finances+0.019
Politics+0.018
Neg logit contributions
 unintentional-0.024
 epidemic-0.024
amaz-0.023
lete-0.023
 involuntary-0.023
Token what
Token ID644
Feature activation-4.91e+00

Pos logit contributions
 unintentional+3.642
 epidemic+3.610
lete+3.500
amaz+3.489
 involuntary+3.461
Neg logit contributions
 licences-3.126
abouts-3.100
 preferences-3.071
 Preferences-2.996
 finances-2.978
Token
Token ID447
Feature activation-1.75e+00

Pos logit contributions
 licences+2.251
 preferences+2.250
abouts+2.244
Politics+2.239
 Preferences+2.196
Neg logit contributions
 unintentional-3.485
 epidemic-3.471
lete-3.372
 involuntary-3.364
amaz-3.362
Token
Token ID250
Feature activation+8.50e+00

Pos logit contributions
 licences+1.475
 preferences+1.472
abouts+1.455
Politics+1.438
 Preferences+1.429
Neg logit contributions
 unintentional-1.453
 epidemic-1.438
amaz-1.401
lete-1.385
 unsuspecting-1.376
TokenI
Token ID40
Feature activation+6.37e+00

Pos logit contributions
 unintentional+8.388
 epidemic+8.351
amaz+8.334
lete+8.324
 offender+8.020
Neg logit contributions
 licences-5.479
 preferences-5.325
abouts-5.193
 finances-5.160
 ambitions-5.124
Token,
Token ID11
Feature activation+5.62e+00

Pos logit contributions
 unintentional+5.708
 epidemic+5.665
amaz+5.579
lete+5.543
 offender+5.461
Neg logit contributions
 licences-4.798
 preferences-4.699
abouts-4.668
 Preferences-4.596
 finances-4.543
Token as
Token ID355
Feature activation+7.91e+00

Pos logit contributions
 unintentional+4.989
 epidemic+4.938
amaz+4.831
lete+4.814
 offender+4.741
Neg logit contributions
 licences-4.403
 preferences-4.279
abouts-4.276
 finances-4.167
 Preferences-4.152
Token an
Token ID281
Feature activation+4.94e+01

Pos logit contributions
 unintentional+6.441
 epidemic+6.337
lete+6.105
amaz+6.056
 unsuspecting+6.025
Neg logit contributions
 licences-6.710
abouts-6.537
 preferences-6.503
 Preferences-6.366
 finances-6.337
Token American
Token ID1605
Feature activation+2.57e+01

Pos logit contributions
 adult+29.299
 elderly+28.912
 unintentional+28.857
 educator+28.550
 American+28.370
Neg logit contributions
 licences-21.488
ヴァ-20.407
abouts-18.803
 favourites-18.567
 Preferences-18.493
Token,
Token ID11
Feature activation+2.93e+00

Pos logit contributions
 adult+18.407
 offender+18.346
 unintentional+18.341
 elderly+18.210
 epidemic+18.132
Neg logit contributions
 licences-16.313
abouts-14.868
 Preferences-14.461
 ambitions-14.404
 finances-14.357
Token oppose
Token ID12856
Feature activation+6.68e+00

Pos logit contributions
 unintentional+2.675
 epidemic+2.656
amaz+2.595
lete+2.587
 offender+2.547
Neg logit contributions
 licences-2.241
abouts-2.206
 preferences-2.202
 finances-2.132
 Preferences-2.129
Token what
Token ID644
Feature activation-1.14e+01

Pos logit contributions
 unintentional+6.027
 epidemic+5.970
lete+5.777
 involuntary+5.745
amaz+5.738
Neg logit contributions
 licences-5.012
abouts-4.997
 preferences-4.913
 Preferences-4.833
 finances-4.763
Token my
Token ID616
Feature activation-4.36e+00

Pos logit contributions
abouts+9.021
 preferences+9.011
 licences+8.845
Politics+8.793
 ambitions+8.776
Neg logit contributions
 unintentional-8.663
lete-8.596
 epidemic-8.586
amaz-8.481
 involuntary-8.307
Token Dwar
Token ID21469
Feature activation+4.28e-01

Pos logit contributions
 unintentional+0.323
 epidemic+0.321
amaz+0.315
lete+0.314
 offender+0.308
Neg logit contributions
 licences-0.296
 preferences-0.293
abouts-0.292
 finances-0.284
 ambitions-0.283
Tokenves
Token ID1158
Feature activation+3.74e-01

Pos logit contributions
 unintentional+0.316
 epidemic+0.314
amaz+0.303
lete+0.301
 offender+0.299
Neg logit contributions
 licences-0.402
abouts-0.400
 preferences-0.398
 finances-0.389
 ambitions-0.388
Token.
Token ID13
Feature activation-7.02e-02

Pos logit contributions
 unintentional+0.288
 epidemic+0.285
amaz+0.278
lete+0.277
 offender+0.272
Neg logit contributions
 licences-0.339
abouts-0.336
 preferences-0.336
 finances-0.327
 ambitions-0.326
Token After
Token ID2293
Feature activation+4.08e+00

Pos logit contributions
 licences+0.060
abouts+0.059
 preferences+0.059
 Preferences+0.058
Politics+0.057
Neg logit contributions
 unintentional-0.058
 epidemic-0.057
amaz-0.056
lete-0.056
 offender-0.055
Token surviving
Token ID16997
Feature activation+9.38e+00

Pos logit contributions
 unintentional+3.551
 epidemic+3.519
amaz+3.417
lete+3.385
 unsuspecting+3.354
Neg logit contributions
 licences-3.268
abouts-3.240
 preferences-3.224
 Preferences-3.153
Politics-3.139
Token an
Token ID281
Feature activation+4.89e+01

Pos logit contributions
 unintentional+7.270
 epidemic+7.266
 involuntary+6.805
 unsuspecting+6.759
amaz+6.747
Neg logit contributions
 licences-8.072
abouts-8.003
 preferences-7.961
Politics-7.944
 Preferences-7.867
Token attack
Token ID1368
Feature activation+5.53e+00

Pos logit contributions
 epidemic+30.785
 unintentional+29.281
 unsuccessful+28.263
 unsuspecting+27.790
 involuntary+27.752
Neg logit contributions
ヴァ-21.201
 licences-21.052
 Preferences-21.013
Politics-20.965
 preferences-20.631
Token,
Token ID11
Feature activation-4.46e+00

Pos logit contributions
 epidemic+4.270
 unintentional+4.259
lete+4.051
amaz+4.017
 involuntary+3.991
Neg logit contributions
 licences-4.978
 preferences-4.883
abouts-4.835
Politics-4.806
 Preferences-4.801
Token they
Token ID484
Feature activation+1.07e+01

Pos logit contributions
 licences+3.267
 preferences+3.233
abouts+3.192
 finances+3.147
 ambitions+3.140
Neg logit contributions
 unintentional-4.091
 epidemic-4.060
lete-4.045
amaz-4.042
 offender-3.953
Token are
Token ID389
Feature activation+6.44e+00

Pos logit contributions
 epidemic+8.532
 unintentional+8.420
 unsuspecting+8.161
 involuntary+8.093
 aggravated+8.085
Neg logit contributions
 Preferences-8.569
Politics-8.565
 licences-8.488
 preferences-8.379
abouts-8.314
Token pursued
Token ID19189
Feature activation+1.62e+00

Pos logit contributions
 unintentional+4.715
 epidemic+4.707
 unsuspecting+4.482
amaz+4.468
 involuntary+4.447
Neg logit contributions
 licences-5.931
 preferences-5.856
 Preferences-5.834
abouts-5.811
Politics-5.807
Token and
Token ID290
Feature activation-6.68e+00

Pos logit contributions
 unintentional+0.175
 epidemic+0.173
amaz+0.171
lete+0.169
 offender+0.166
Neg logit contributions
 licences-0.138
 preferences-0.136
abouts-0.136
 finances-0.132
 Preferences-0.131
Token sol
Token ID1540
Feature activation-4.25e+00

Pos logit contributions
 licences+5.426
 preferences+5.405
abouts+5.289
 finances+5.215
 ambitions+5.130
Neg logit contributions
amaz-5.606
lete-5.496
 unintentional-5.458
 epidemic-5.423
 offender-5.246
Tokenes
Token ID274
Feature activation-6.18e+00

Pos logit contributions
abouts+4.107
 licences+4.043
 preferences+3.988
 finances+3.896
Politics+3.867
Neg logit contributions
 unintentional-3.029
 epidemic-2.970
 offender-2.870
 involuntary-2.860
lete-2.838
Token")
Token ID4943
Feature activation+8.20e+00

Pos logit contributions
 preferences+4.860
 licences+4.832
abouts+4.756
 finances+4.648
 Preferences+4.602
Neg logit contributions
amaz-5.273
 unintentional-5.234
 epidemic-5.139
lete-5.059
 unsuspecting-5.014
Token is
Token ID318
Feature activation+1.27e+01

Pos logit contributions
 epidemic+6.584
 unintentional+6.487
lete+6.218
 offender+6.205
 aggravated+6.178
Neg logit contributions
abouts-6.854
 licences-6.837
Politics-6.657
 preferences-6.655
 ambitions-6.544
Token an
Token ID281
Feature activation+4.88e+01

Pos logit contributions
 epidemic+9.581
 unintentional+9.314
 aggravated+8.977
 involuntary+8.896
lete+8.866
Neg logit contributions
abouts-10.955
 licences-10.871
Politics-10.682
 preferences-10.507
 Preferences-10.464
Token autos
Token ID44619
Feature activation+1.59e+01

Pos logit contributions
 epidemic+29.862
 unintentional+28.504
 adult+28.456
 involuntary+28.218
 elderly+27.671
Neg logit contributions
Politics-20.647
 Talks-19.978
ヴァ-19.793
 licences-19.661
abouts-19.558
Tokenomal
Token ID18048
Feature activation+1.91e+01

Pos logit contributions
lete+2.525
ounding+2.258
 epidemic+2.235
amaz+1.984
agically+1.920
Neg logit contributions
 licences-22.375
 preferences-21.871
 ambitions-21.849
 finances-21.847
 Preferences-21.593
Token dominant
Token ID11410
Feature activation+1.47e+01

Pos logit contributions
 epidemic+11.623
 unintentional+11.049
 offender+10.998
 involuntary+10.681
 aggravated+10.479
Neg logit contributions
abouts-17.053
Politics-16.845
 licences-16.789
 finances-16.247
 ambitions-16.040
Token ker
Token ID41927
Feature activation+1.22e+01

Pos logit contributions
 epidemic+12.084
 unintentional+11.847
 offender+11.431
 involuntary+11.358
 adult+11.153
Neg logit contributions
Politics-11.143
abouts-11.066
 licences-10.879
 Preferences-10.592
 preferences-10.500
Tokenat
Token ID265
Feature activation+1.71e+01

Pos logit contributions
lete+7.170
amaz+6.768
 epidemic+6.737
agically+6.389
 unintentional+6.367
Neg logit contributions
 licences-12.727
 preferences-12.263
 ambitions-12.149
 Preferences-12.147
Politics-12.132
Token to
Token ID284
Feature activation+5.89e+00

Pos logit contributions
 unintentional+0.655
 epidemic+0.650
amaz+0.631
lete+0.629
 offender+0.616
Neg logit contributions
 licences-0.783
 preferences-0.774
abouts-0.772
 finances-0.752
 Preferences-0.752
Token sub
Token ID850
Feature activation+7.64e+00

Pos logit contributions
 epidemic+4.414
 unintentional+4.362
amaz+4.207
lete+4.198
 unsuspecting+4.158
Neg logit contributions
 licences-5.360
abouts-5.320
 preferences-5.272
Politics-5.185
 Preferences-5.176
Tokendue
Token ID23301
Feature activation-1.28e-01

Pos logit contributions
 unintentional+3.751
lete+3.711
 epidemic+3.624
amaz+3.519
 unsuspecting+3.346
Neg logit contributions
 licences-8.847
abouts-8.796
 preferences-8.706
Politics-8.608
 Preferences-8.589
Token Greece
Token ID10315
Feature activation-2.53e+00

Pos logit contributions
 licences+0.098
 preferences+0.098
abouts+0.097
 finances+0.095
 ambitions+0.094
Neg logit contributions
 unintentional-0.117
 epidemic-0.115
amaz-0.113
lete-0.113
 offender-0.111
Token.
Token ID13
Feature activation-1.74e+00

Pos logit contributions
abouts+2.252
 licences+2.245
 preferences+2.239
 finances+2.195
 ambitions+2.180
Neg logit contributions
 unintentional-1.956
 epidemic-1.931
amaz-1.892
lete-1.877
 offender-1.853
Token An
Token ID1052
Feature activation+4.86e+01

Pos logit contributions
 licences+1.511
abouts+1.506
 preferences+1.501
Politics+1.475
 Preferences+1.473
Neg logit contributions
 unintentional-1.395
 epidemic-1.379
lete-1.343
amaz-1.340
 involuntary-1.327
Token ep
Token ID2462
Feature activation+7.31e+00

Pos logit contributions
 epidemic+32.627
 unsuspecting+30.871
 unintentional+30.574
 astounding+30.271
 estimated+30.149
Neg logit contributions
 Preferences-18.518
 licences-18.072
 preferences-17.717
abouts-17.437
ヴァ-16.383
Tokenigraph
Token ID45920
Feature activation+9.67e+00

Pos logit contributions
 epidemic+4.776
lete+4.746
 unintentional+4.712
amaz+4.642
 offender+4.514
Neg logit contributions
 licences-7.303
 preferences-7.184
 Preferences-7.074
abouts-7.073
 ambitions-6.974
Token from
Token ID422
Feature activation-7.02e+00

Pos logit contributions
 epidemic+8.074
 unintentional+7.994
lete+7.963
amaz+7.667
 offender+7.639
Neg logit contributions
 licences-7.564
abouts-7.488
Politics-7.368
 preferences-7.327
 Preferences-7.280
Token the
Token ID262
Feature activation-5.00e+00

Pos logit contributions
 preferences+5.576
 licences+5.535
abouts+5.453
 ambitions+5.413
 finances+5.411
Neg logit contributions
 unintentional-5.809
 epidemic-5.649
amaz-5.633
lete-5.622
 involuntary-5.538
Token Greek
Token ID8312
Feature activation+2.73e+00

Pos logit contributions
 licences+4.333
 preferences+4.327
 finances+4.224
 ambitions+4.208
abouts+4.172
Neg logit contributions
 unintentional-3.887
amaz-3.838
lete-3.800
 epidemic-3.784
 involuntary-3.695
Token all
Token ID477
Feature activation-3.77e+00

Pos logit contributions
 unintentional+1.402
 epidemic+1.392
amaz+1.362
lete+1.356
 offender+1.334
Neg logit contributions
 licences-1.198
 preferences-1.183
abouts-1.181
 finances-1.145
Politics-1.139
Token passed
Token ID3804
Feature activation+3.76e+00

Pos logit contributions
 licences+2.913
 preferences+2.888
abouts+2.854
 finances+2.808
 ambitions+2.766
Neg logit contributions
amaz-3.360
 unintentional-3.342
 epidemic-3.282
lete-3.239
 unsuspecting-3.171
Token.
Token ID13
Feature activation+4.62e+00

Pos logit contributions
 unintentional+2.957
 epidemic+2.924
lete+2.848
amaz+2.811
 involuntary+2.796
Neg logit contributions
 licences-3.342
abouts-3.305
 preferences-3.290
Politics-3.214
 Preferences-3.206
Token
Token ID198
Feature activation+7.47e+00

Pos logit contributions
 unintentional+3.599
 epidemic+3.562
lete+3.483
amaz+3.455
 offender+3.378
Neg logit contributions
 licences-4.134
 preferences-4.061
abouts-4.042
 finances-3.961
 ambitions-3.941
Token
Token ID198
Feature activation+5.41e+00

Pos logit contributions
 epidemic+4.457
 unintentional+4.341
lete+4.274
amaz+4.238
 addict+4.149
Neg logit contributions
 licences-7.721
abouts-7.540
 preferences-7.501
 finances-7.468
 ambitions-7.438
TokenAn
Token ID2025
Feature activation+4.86e+01

Pos logit contributions
 unintentional+4.679
 epidemic+4.638
lete+4.626
amaz+4.558
 offender+4.433
Neg logit contributions
 licences-4.355
 preferences-4.254
abouts-4.221
 finances-4.134
 ambitions-4.105
Token all
Token ID477
Feature activation+1.61e+01

Pos logit contributions
 unintentional+32.244
 epidemic+31.821
 elderly+31.354
 unsuspecting+30.719
 adult+30.690
Neg logit contributions
 licences-17.847
ヴァ-16.532
 preferences-16.261
abouts-15.536
 Preferences-15.390
Tokenusion
Token ID4241
Feature activation+1.20e+01

Pos logit contributions
 unintentional+11.278
 epidemic+11.029
lete+10.826
 involuntary+10.600
 unsuspecting+10.459
Neg logit contributions
 licences-13.612
 preferences-13.487
 Preferences-13.392
abouts-13.307
 ambitions-13.043
Token to
Token ID284
Feature activation+1.85e+00

Pos logit contributions
 unintentional+8.452
 epidemic+8.186
lete+7.846
 offender+7.778
 aggravated+7.716
Neg logit contributions
 licences-10.959
abouts-10.941
 preferences-10.555
Politics-10.538
 Preferences-10.491
Token fat
Token ID3735
Feature activation+4.66e+00

Pos logit contributions
 unintentional+1.535
 epidemic+1.523
amaz+1.480
lete+1.479
 offender+1.453
Neg logit contributions
 licences-1.564
abouts-1.545
 preferences-1.544
 finances-1.497
Politics-1.495
Token cat
Token ID3797
Feature activation+6.31e+00

Pos logit contributions
 epidemic+3.537
 unintentional+3.525
lete+3.397
amaz+3.368
 offender+3.345
Neg logit contributions
 licences-4.245
abouts-4.214
 preferences-4.139
 Preferences-4.058
Politics-4.056
Token harsh
Token ID11859
Feature activation+9.04e+00

Pos logit contributions
 epidemic+10.714
 unintentional+10.523
 aggravated+10.176
 involuntary+10.020
 offender+9.870
Neg logit contributions
 licences-12.159
abouts-12.015
Politics-11.873
 Preferences-11.813
 preferences-11.523
Token,
Token ID11
Feature activation+3.13e+00

Pos logit contributions
 epidemic+6.578
 unintentional+6.547
 offender+6.313
lete+6.214
 aggravated+6.206
Neg logit contributions
 licences-8.174
 preferences-7.895
Politics-7.887
abouts-7.870
 Preferences-7.840
Token but
Token ID475
Feature activation+1.04e+01

Pos logit contributions
 unintentional+2.770
 epidemic+2.752
lete+2.663
amaz+2.655
 offender+2.635
Neg logit contributions
 licences-2.476
abouts-2.446
 preferences-2.437
Politics-2.368
 Preferences-2.359
Token it
Token ID340
Feature activation+1.77e+01

Pos logit contributions
 unintentional+9.263
 epidemic+9.160
 involuntary+8.859
 offender+8.851
 aggravated+8.776
Neg logit contributions
abouts-7.513
 licences-7.470
Politics-7.331
 preferences-7.256
616-7.204
Token is
Token ID318
Feature activation+1.78e+01

Pos logit contributions
 epidemic+14.229
 aggravated+14.164
 unintentional+14.127
 involuntary+13.648
 offender+13.556
Neg logit contributions
Politics-12.170
 Preferences-11.988
 licences-11.786
abouts-11.726
 preferences-11.431
Token an
Token ID281
Feature activation+4.83e+01

Pos logit contributions
 epidemic+14.259
 unintentional+14.160
 involuntary+13.653
 aggravated+13.549
 offender+13.080
Neg logit contributions
abouts-12.663
 licences-12.631
 Preferences-12.349
Politics-12.160
 preferences-12.032
Token odd
Token ID5629
Feature activation+1.38e+01

Pos logit contributions
 epidemic+31.729
 unintentional+30.347
 egregious+29.781
 involuntary+29.746
 offender+29.333
Neg logit contributions
ヴァ-19.008
abouts-17.932
 Talks-17.908
 licences-17.897
 Preferences-17.512
Token circumstance
Token ID25179
Feature activation+1.23e+01

Pos logit contributions
 epidemic+12.374
 unintentional+12.184
 offender+12.015
 involuntary+11.710
lete+11.568
Neg logit contributions
 licences-9.575
abouts-9.220
 Preferences-9.091
 preferences-9.023
Politics-8.975
Token.
Token ID13
Feature activation+6.23e+00

Pos logit contributions
 unintentional+9.245
 epidemic+9.187
 offender+8.744
lete+8.702
 involuntary+8.699
Neg logit contributions
 licences-10.199
abouts-9.939
 Preferences-9.850
Politics-9.824
 preferences-9.800
Token With
Token ID2080
Feature activation+8.52e+00

Pos logit contributions
 unintentional+4.565
 epidemic+4.529
lete+4.378
amaz+4.366
 offender+4.331
Neg logit contributions
 licences-5.745
 preferences-5.656
abouts-5.651
 ambitions-5.538
 finances-5.503
Token many
Token ID867
Feature activation+1.06e+01

Pos logit contributions
 unintentional+7.091
 epidemic+7.048
 offender+6.753
lete+6.726
 involuntary+6.690
Neg logit contributions
abouts-6.933
 licences-6.772
Politics-6.668
 preferences-6.652
616-6.534
Token promoter
Token ID32972
Feature activation-9.79e+00

Pos logit contributions
 licences+12.264
 preferences+12.185
 finances+12.021
 ambitions+11.931
abouts+11.853
Neg logit contributions
amaz-11.289
 unintentional-11.025
 epidemic-10.850
 involuntary-10.608
lete-10.599
Token who
Token ID508
Feature activation-1.69e+01

Pos logit contributions
 licences+8.041
 preferences+8.004
abouts+7.861
 ambitions+7.803
 finances+7.778
Neg logit contributions
amaz-7.658
 unintentional-7.569
 epidemic-7.367
lete-7.336
 involuntary-7.180
Token has
Token ID468
Feature activation-2.00e+01

Pos logit contributions
 finances+13.283
 licences+13.271
 preferences+12.864
abouts+12.803
 ambitions+12.775
Neg logit contributions
amaz-12.320
lete-11.862
 unintentional-11.860
 epidemic-11.769
ounding-11.363
Token no
Token ID645
Feature activation-3.05e+01

Pos logit contributions
 ambitions+17.110
 licences+17.023
 preferences+16.664
 finances+16.649
 plans+15.507
Neg logit contributions
amaz-12.327
lete-11.861
ogether-11.391
ounding-11.298
 unintentional-11.158
Token long
Token ID890
Feature activation-1.51e+01

Pos logit contributions
 ambitions+23.243
 preferences+22.839
 finances+22.828
 licences+22.475
 plans+22.002
Neg logit contributions
amaz-16.502
lete-14.955
ogether-14.949
agically-14.535
ocious-14.386
Token term
Token ID3381
Feature activation-4.29e+01

Pos logit contributions
 preferences+13.225
 ambitions+12.947
 licences+12.884
 finances+12.663
abouts+12.286
Neg logit contributions
amaz-10.404
 unintentional-10.082
lete-9.896
 epidemic-9.709
 unsuspecting-9.483
Token thinking
Token ID3612
Feature activation-3.35e+01

Pos logit contributions
 ambitions+29.512
 plans+28.437
 preferences+27.863
 finances+27.712
 interests+27.440
Neg logit contributions
amaz-19.457
lete-19.014
ogether-18.466
agically-17.526
oche-17.381
Token and
Token ID290
Feature activation-2.12e+01

Pos logit contributions
 ambitions+23.732
 preferences+23.645
 licences+22.503
 plans+22.456
 finances+22.430
Neg logit contributions
amaz-18.449
 unintentional-16.991
ogether-16.766
lete-16.639
ilant-16.259
Token a
Token ID257
Feature activation-2.02e+01

Pos logit contributions
 ambitions+17.667
 preferences+17.583
 finances+17.450
 licences+17.413
 plans+16.205
Neg logit contributions
amaz-13.367
ogether-12.315
lete-12.201
ounding-11.978
 unintentional-11.972
Token governing
Token ID15030
Feature activation-8.71e+00

Pos logit contributions
 preferences+16.332
 finances+16.252
 licences+16.177
 ambitions+16.146
 politics+15.613
Neg logit contributions
amaz-12.596
 unintentional-11.638
 epidemic-11.549
ogether-11.420
lete-11.391
Token body
Token ID1767
Feature activation-7.50e+00

Pos logit contributions
 preferences+6.811
 licences+6.776
 ambitions+6.654
 finances+6.576
 Preferences+6.319
Neg logit contributions
amaz-7.614
 unintentional-7.350
lete-7.147
 epidemic-7.098
 involuntary-7.005
Token meters
Token ID10700
Feature activation+2.92e+00

Pos logit contributions
 unintentional+4.674
 epidemic+4.625
lete+4.504
amaz+4.480
 involuntary+4.460
Neg logit contributions
 licences-3.108
abouts-3.065
 preferences-3.052
Politics-2.990
 finances-2.954
Token per
Token ID583
Feature activation+1.02e+01

Pos logit contributions
 unintentional+2.230
 epidemic+2.211
amaz+2.137
lete+2.134
 offender+2.096
Neg logit contributions
 licences-2.659
 preferences-2.628
abouts-2.626
Politics-2.567
 finances-2.565
Token second
Token ID1218
Feature activation+5.73e+00

Pos logit contributions
 epidemic+6.581
 unintentional+6.539
 involuntary+6.239
lete+6.227
 unsuspecting+6.102
Neg logit contributions
 licences-10.050
Politics-9.942
 preferences-9.893
abouts-9.878
 finances-9.739
Token (
Token ID357
Feature activation-1.60e+01

Pos logit contributions
 unintentional+4.418
 epidemic+4.380
lete+4.225
amaz+4.188
 offender+4.146
Neg logit contributions
 licences-5.109
abouts-5.024
 preferences-5.013
Politics-4.971
 finances-4.931
Token5
Token ID20
Feature activation-1.69e+00

Pos logit contributions
616+10.185
abouts+9.812
 licences+9.257
Politics+9.223
 preferences+9.214
Neg logit contributions
 unintentional-14.149
 epidemic-14.041
 offender-13.732
 involuntary-13.655
 unsuspecting-13.637
Token,
Token ID11
Feature activation-3.73e+01

Pos logit contributions
 licences+1.471
 preferences+1.457
abouts+1.453
 finances+1.412
 ambitions+1.410
Neg logit contributions
 unintentional-1.362
 epidemic-1.353
amaz-1.321
lete-1.315
 offender-1.291
Token9
Token ID24
Feature activation-5.00e+00

Pos logit contributions
616+23.912
abouts+20.760
70710+20.081
Politics+19.527
 licences+19.494
Neg logit contributions
 unintentional-18.945
 epidemic-18.676
 unsuspecting-18.015
 involuntary-18.014
 offender-17.782
Token46
Token ID3510
Feature activation+2.49e-02

Pos logit contributions
 licences+2.131
 preferences+2.123
abouts+2.117
Politics+2.056
616+2.041
Neg logit contributions
 unintentional-6.131
 epidemic-6.085
amaz-6.033
 involuntary-5.954
lete-5.950
Token miles
Token ID4608
Feature activation-1.76e+00

Pos logit contributions
 unintentional+0.023
 epidemic+0.023
amaz+0.023
lete+0.023
 offender+0.022
Neg logit contributions
 licences-0.018
abouts-0.018
 preferences-0.018
 finances-0.018
 ambitions-0.017
Token per
Token ID583
Feature activation+1.26e+01

Pos logit contributions
 licences+1.521
abouts+1.504
 preferences+1.503
 ambitions+1.460
 finances+1.455
Neg logit contributions
 unintentional-1.415
 epidemic-1.404
amaz-1.380
lete-1.374
 offender-1.344
Token hour
Token ID1711
Feature activation-6.18e+00

Pos logit contributions
 epidemic+3.759
 unintentional+3.615
 hour+3.178
 unsuspecting+3.168
 involuntary+3.142
Neg logit contributions
Politics-16.410
 licences-16.279
abouts-16.131
 preferences-16.070
 Preferences-16.042
Token the
Token ID262
Feature activation-1.35e+01

Pos logit contributions
 licences+18.692
 preferences+18.345
 finances+18.068
 boundaries+17.487
 Preferences+17.449
Neg logit contributions
amaz-13.740
lete-13.331
 unintentional-12.562
 epidemic-12.282
 addict-12.165
Token Commonwealth
Token ID17165
Feature activation-2.73e+01

Pos logit contributions
 preferences+11.049
 licences+10.977
 finances+10.636
 boundaries+10.411
 Preferences+10.388
Neg logit contributions
amaz-10.430
lete-10.194
 unintentional-10.043
 epidemic-9.876
ounding-9.486
Token does
Token ID857
Feature activation-2.28e+01

Pos logit contributions
 licences+20.138
 finances+19.976
 preferences+19.946
 Preferences+19.017
 ambitions+18.980
Neg logit contributions
amaz-16.772
lete-15.730
 unintentional-15.246
ocious-14.993
ounding-14.993
Token not
Token ID407
Feature activation-3.11e+01

Pos logit contributions
 preferences+15.059
 licences+14.899
 finances+14.820
 ambitions+14.319
abouts+13.909
Neg logit contributions
amaz-16.744
 unintentional-16.196
lete-16.129
letes-15.681
 addict-15.583
Token have
Token ID423
Feature activation-2.70e+01

Pos logit contributions
 licences+20.175
 preferences+19.769
 finances+19.425
 ambitions+18.531
 boundaries+18.298
Neg logit contributions
amaz-19.068
 unintentional-19.022
ocious-18.313
lete-17.962
ortunately-17.609
Token legislative
Token ID10828
Feature activation-3.50e+01

Pos logit contributions
 preferences+21.142
 licences+20.828
 ambitions+20.312
 finances+20.049
 boundaries+19.664
Neg logit contributions
amaz-15.502
lete-14.140
 unintentional-14.006
agically-13.823
ilant-13.626
Token power
Token ID1176
Feature activation-1.84e+01

Pos logit contributions
 preferences+24.096
 licences+23.588
 ambitions+23.470
 finances+22.444
 levers+22.361
Neg logit contributions
amaz-19.671
ilant-17.674
lete-17.416
ocious-17.176
 unintentional-17.028
Token.
Token ID13
Feature activation+1.25e-02

Pos logit contributions
 licences+13.012
 preferences+12.735
abouts+12.464
 finances+12.124
 ambitions+12.004
Neg logit contributions
amaz-14.129
 unintentional-14.104
 epidemic-13.887
 unsuspecting-13.719
lete-13.584
Token
Token ID198
Feature activation-4.83e+00

Pos logit contributions
 unintentional+0.010
 epidemic+0.010
amaz+0.009
lete+0.009
 involuntary+0.009
Neg logit contributions
 licences-0.011
 preferences-0.011
abouts-0.011
 Preferences-0.011
Politics-0.011
Token
Token ID198
Feature activation-7.28e+00

Pos logit contributions
 licences+3.837
 preferences+3.835
abouts+3.804
Politics+3.756
 Preferences+3.696
Neg logit contributions
 unintentional-4.198
 epidemic-4.122
amaz-4.048
lete-4.017
 unsuspecting-3.990
TokenEach
Token ID10871
Feature activation+7.52e+00

Pos logit contributions
Politics+5.580
 preferences+5.275
abouts+5.256
 licences+5.256
616+5.244
Neg logit contributions
 unintentional-6.622
 epidemic-6.584
 unsuspecting-6.339
 involuntary-6.293
lete-6.254
Token,
Token ID11
Feature activation-2.31e+01

Pos logit contributions
 licences+3.126
 preferences+3.092
abouts+3.076
616+2.985
 ambitions+2.967
Neg logit contributions
 unintentional-3.777
 epidemic-3.770
amaz-3.736
lete-3.677
 offender-3.668
Token807
Token ID36928
Feature activation-4.42e+00

Pos logit contributions
616+16.703
 preferences+14.901
abouts+14.542
 licences+14.525
 ambitions+14.386
Neg logit contributions
 unintentional-15.444
 epidemic-15.431
amaz-15.193
 offender-15.150
 unsuspecting-14.770
Token miles
Token ID4608
Feature activation-4.29e+00

Pos logit contributions
 licences+3.169
 preferences+3.153
abouts+3.150
616+3.039
 ambitions+3.016
Neg logit contributions
 epidemic-4.134
 unintentional-4.124
amaz-4.082
lete-4.024
 offender-4.013
Token (
Token ID357
Feature activation-7.28e+00

Pos logit contributions
abouts+3.436
 licences+3.398
 preferences+3.378
616+3.263
 ambitions+3.262
Neg logit contributions
 unintentional-3.706
 epidemic-3.652
amaz-3.585
lete-3.553
 offender-3.528
Token149
Token ID19442
Feature activation-6.56e+00

Pos logit contributions
abouts+3.790
616+3.749
 licences+3.656
 preferences+3.557
Politics+3.545
Neg logit contributions
 unintentional-8.256
 epidemic-8.204
 offender-7.954
 involuntary-7.941
 unsuspecting-7.891
Token,
Token ID11
Feature activation-3.46e+01

Pos logit contributions
 licences+5.330
 preferences+5.266
abouts+5.257
616+5.113
 ambitions+5.112
Neg logit contributions
 unintentional-5.424
 epidemic-5.424
amaz-5.369
lete-5.284
 offender-5.224
Token597
Token ID43239
Feature activation-5.10e+00

Pos logit contributions
616+23.020
abouts+19.938
70710+19.417
 licences+18.793
Politics+18.583
Neg logit contributions
 unintentional-18.851
 epidemic-18.572
 offender-18.064
 involuntary-17.959
 unsuspecting-17.875
Token,
Token ID11
Feature activation-3.31e+01

Pos logit contributions
 licences+3.426
abouts+3.375
 preferences+3.370
 ambitions+3.265
 finances+3.228
Neg logit contributions
amaz-5.002
 epidemic-4.989
 unintentional-4.967
lete-4.863
 offender-4.834
Token870
Token ID46951
Feature activation-8.45e+00

Pos logit contributions
616+22.651
abouts+20.042
 licences+19.126
70710+19.076
 preferences+18.944
Neg logit contributions
 unintentional-17.678
 epidemic-17.336
 offender-16.854
amaz-16.791
 unsuspecting-16.707
Token kilometers
Token ID18212
Feature activation-1.09e+01

Pos logit contributions
 licences+4.488
abouts+4.393
 preferences+4.349
 ambitions+4.214
616+4.204
Neg logit contributions
amaz-9.251
 epidemic-9.169
 unintentional-9.106
lete-9.002
 offender-8.964
Token),
Token ID828
Feature activation-2.79e+00

Pos logit contributions
abouts+5.390
 licences+5.174
 preferences+5.010
616+4.943
 ambitions+4.877
Neg logit contributions
 unintentional-12.139
 epidemic-12.005
amaz-11.904
 involuntary-11.773
lete-11.766
Token little
Token ID1310
Feature activation-5.57e+00

Pos logit contributions
 unintentional+3.142
 epidemic+3.121
lete+3.008
amaz+2.998
 unsuspecting+2.963
Neg logit contributions
 licences-3.251
abouts-3.235
 preferences-3.191
 Preferences-3.130
Politics-3.128
Token sign
Token ID1051
Feature activation-4.48e+00

Pos logit contributions
 preferences+4.669
 licences+4.641
 finances+4.532
 ambitions+4.522
abouts+4.451
Neg logit contributions
amaz-4.505
lete-4.472
 unintentional-4.419
 epidemic-4.403
 offender-4.223
Token that
Token ID326
Feature activation-1.39e+01

Pos logit contributions
 licences+3.970
 preferences+3.931
abouts+3.923
 finances+3.828
Politics+3.816
Neg logit contributions
 unintentional-3.459
 epidemic-3.391
amaz-3.343
lete-3.335
 unsuspecting-3.281
Token the
Token ID262
Feature activation-5.17e+00

Pos logit contributions
 preferences+11.242
 licences+11.091
 finances+11.020
 ambitions+10.955
abouts+10.847
Neg logit contributions
amaz-10.357
lete-10.134
 unintentional-10.062
 epidemic-9.701
 offender-9.502
Token two
Token ID734
Feature activation-8.75e+00

Pos logit contributions
 licences+4.366
 preferences+4.356
 finances+4.280
 ambitions+4.238
abouts+4.236
Neg logit contributions
amaz-4.208
 unintentional-4.163
lete-4.104
 epidemic-4.045
 involuntary-3.943
Token bodies
Token ID5920
Feature activation-3.44e+01

Pos logit contributions
 preferences+6.725
 licences+6.723
 ambitions+6.538
abouts+6.522
 finances+6.510
Neg logit contributions
amaz-7.557
 unintentional-7.401
lete-7.369
 epidemic-7.278
 involuntary-7.055
Token are
Token ID389
Feature activation-2.64e+01

Pos logit contributions
abouts+22.589
 licences+21.541
 ambitions+20.857
 plans+20.850
 finances+20.847
Neg logit contributions
amaz-20.114
 unintentional-18.516
lete-18.366
 unsuspecting-17.585
 epidemic-17.190
Token willing
Token ID4684
Feature activation-4.91e+00

Pos logit contributions
abouts+19.677
 preferences+19.291
 ambitions+19.105
 favourites+18.968
 finances+18.892
Neg logit contributions
amaz-15.824
lete-14.999
agically-14.903
ounding-14.307
 epidemic-14.081
Token to
Token ID284
Feature activation-1.64e+01

Pos logit contributions
 licences+4.402
 preferences+4.344
abouts+4.316
 finances+4.233
 ambitions+4.214
Neg logit contributions
 unintentional-3.784
 epidemic-3.762
amaz-3.690
lete-3.627
 offender-3.589
Token compromise
Token ID13110
Feature activation-1.79e+01

Pos logit contributions
 licences+12.992
 preferences+12.914
abouts+12.548
 finances+12.433
 ambitions+12.340
Neg logit contributions
amaz-12.076
lete-11.652
 unintentional-11.474
 epidemic-10.787
 unsuspecting-10.672
Token.
Token ID13
Feature activation-5.38e+00

Pos logit contributions
 preferences+13.114
 finances+12.856
 ambitions+12.849
 licences+12.839
 frameworks+12.061
Neg logit contributions
amaz-13.706
 unintentional-13.527
lete-13.190
 epidemic-12.863
 involuntary-12.764
Token together
Token ID1978
Feature activation-1.58e+01

Pos logit contributions
 licences+11.381
 preferences+11.228
 finances+11.081
 boundaries+10.774
 frameworks+10.743
Neg logit contributions
lete-9.584
amaz-9.533
 epidemic-9.405
 unintentional-9.381
 addict-8.996
Token these
Token ID777
Feature activation-1.39e+01

Pos logit contributions
 licences+13.167
 preferences+12.899
 frameworks+12.605
 finances+12.474
 ambitions+12.085
Neg logit contributions
lete-11.027
 epidemic-10.801
amaz-10.722
 unintentional-10.671
 unsuspecting-10.395
Token products
Token ID3186
Feature activation-1.01e+01

Pos logit contributions
 licences+12.029
 preferences+11.690
 frameworks+11.328
 finances+11.235
 ambitions+11.104
Neg logit contributions
lete-10.050
 epidemic-9.930
amaz-9.740
 unintentional-9.730
 unsuspecting-9.332
Token.
Token ID13
Feature activation-9.31e-01

Pos logit contributions
 licences+8.339
 preferences+8.165
abouts+7.933
 finances+7.843
 ambitions+7.740
Neg logit contributions
 unintentional-7.957
 epidemic-7.910
amaz-7.741
 unsuspecting-7.647
lete-7.555
Token Has
Token ID7875
Feature activation-9.16e+00

Pos logit contributions
 licences+0.801
 preferences+0.792
abouts+0.791
 Preferences+0.774
Politics+0.773
Neg logit contributions
 unintentional-0.763
 epidemic-0.756
amaz-0.734
lete-0.734
 involuntary-0.723
Token there
Token ID612
Feature activation-3.37e+01

Pos logit contributions
 licences+7.799
 preferences+7.704
abouts+7.700
 finances+7.558
Politics+7.553
Neg logit contributions
 unintentional-6.914
 epidemic-6.824
lete-6.696
amaz-6.618
 unsuspecting-6.602
Token been
Token ID587
Feature activation-1.48e+01

Pos logit contributions
 ambitions+21.843
abouts+21.834
 licences+21.606
 preferences+21.599
 politics+20.766
Neg logit contributions
lete-17.277
amaz-16.520
letes-16.354
ilant-15.727
 unsuspecting-15.677
Token any
Token ID597
Feature activation-1.08e+01

Pos logit contributions
 preferences+11.433
 licences+11.431
 ambitions+11.190
 finances+10.881
abouts+10.829
Neg logit contributions
lete-11.024
amaz-10.916
letes-10.376
 unintentional-10.346
 unsuspecting-10.289
Token discussion
Token ID5114
Feature activation-2.71e+01

Pos logit contributions
 licences+9.893
 preferences+9.788
 ambitions+9.554
 finances+9.445
abouts+9.369
Neg logit contributions
lete-7.295
amaz-7.176
 unintentional-6.938
 epidemic-6.857
 unsuspecting-6.719
Token that
Token ID326
Feature activation-1.51e+01

Pos logit contributions
 licences+18.117
 preferences+17.948
abouts+17.886
 ambitions+17.556
 finances+17.271
Neg logit contributions
lete-16.186
 unintentional-16.068
amaz-15.985
 unsuspecting-15.950
ilant-15.407
Token such
Token ID884
Feature activation-1.27e+01

Pos logit contributions
 licences+12.971
 preferences+12.589
 finances+12.552
abouts+12.210
 ambitions+12.159
Neg logit contributions
lete-10.659
amaz-10.246
 unintentional-10.186
 epidemic-10.136
ounding-9.868
Token who
Token ID508
Feature activation-1.69e+01

Pos logit contributions
 licences+8.041
 preferences+8.004
abouts+7.861
 ambitions+7.803
 finances+7.778
Neg logit contributions
amaz-7.658
 unintentional-7.569
 epidemic-7.367
lete-7.336
 involuntary-7.180
Token has
Token ID468
Feature activation-2.00e+01

Pos logit contributions
 finances+13.283
 licences+13.271
 preferences+12.864
abouts+12.803
 ambitions+12.775
Neg logit contributions
amaz-12.320
lete-11.862
 unintentional-11.860
 epidemic-11.769
ounding-11.363
Token no
Token ID645
Feature activation-3.05e+01

Pos logit contributions
 ambitions+17.110
 licences+17.023
 preferences+16.664
 finances+16.649
 plans+15.507
Neg logit contributions
amaz-12.327
lete-11.861
ogether-11.391
ounding-11.298
 unintentional-11.158
Token long
Token ID890
Feature activation-1.51e+01

Pos logit contributions
 ambitions+23.243
 preferences+22.839
 finances+22.828
 licences+22.475
 plans+22.002
Neg logit contributions
amaz-16.502
lete-14.955
ogether-14.949
agically-14.535
ocious-14.386
Token term
Token ID3381
Feature activation-4.29e+01

Pos logit contributions
 preferences+13.225
 ambitions+12.947
 licences+12.884
 finances+12.663
abouts+12.286
Neg logit contributions
amaz-10.404
 unintentional-10.082
lete-9.896
 epidemic-9.709
 unsuspecting-9.483
Token thinking
Token ID3612
Feature activation-3.35e+01

Pos logit contributions
 ambitions+29.512
 plans+28.437
 preferences+27.863
 finances+27.712
 interests+27.440
Neg logit contributions
amaz-19.457
lete-19.014
ogether-18.466
agically-17.526
oche-17.381
Token and
Token ID290
Feature activation-2.12e+01

Pos logit contributions
 ambitions+23.732
 preferences+23.645
 licences+22.503
 plans+22.456
 finances+22.430
Neg logit contributions
amaz-18.449
 unintentional-16.991
ogether-16.766
lete-16.639
ilant-16.259
Token a
Token ID257
Feature activation-2.02e+01

Pos logit contributions
 ambitions+17.667
 preferences+17.583
 finances+17.450
 licences+17.413
 plans+16.205
Neg logit contributions
amaz-13.367
ogether-12.315
lete-12.201
ounding-11.978
 unintentional-11.972
Token governing
Token ID15030
Feature activation-8.71e+00

Pos logit contributions
 preferences+16.332
 finances+16.252
 licences+16.177
 ambitions+16.146
 politics+15.613
Neg logit contributions
amaz-12.596
 unintentional-11.638
 epidemic-11.549
ogether-11.420
lete-11.391
Token body
Token ID1767
Feature activation-7.50e+00

Pos logit contributions
 preferences+6.811
 licences+6.776
 ambitions+6.654
 finances+6.576
 Preferences+6.319
Neg logit contributions
amaz-7.614
 unintentional-7.350
lete-7.147
 epidemic-7.098
 involuntary-7.005
Token that
Token ID326
Feature activation-2.01e+01

Pos logit contributions
 licences+6.612
 preferences+6.585
abouts+6.497
 finances+6.469
 ambitions+6.418
Neg logit contributions
amaz-5.630
 unintentional-5.565
 epidemic-5.465
lete-5.264
 involuntary-5.241
Token.
Token ID13
Feature activation-5.47e+00

Pos logit contributions
 licences+1.416
abouts+1.408
 preferences+1.397
 finances+1.368
Politics+1.365
Neg logit contributions
 unintentional-0.880
 epidemic-0.875
lete-0.830
amaz-0.829
 unsuspecting-0.824
Token Neil
Token ID15929
Feature activation-7.72e+00

Pos logit contributions
 licences+4.493
 preferences+4.474
abouts+4.472
Politics+4.376
 Preferences+4.356
Neg logit contributions
 unintentional-4.475
 epidemic-4.416
 involuntary-4.270
amaz-4.230
lete-4.215
Token Farrell
Token ID37568
Feature activation+3.16e+00

Pos logit contributions
abouts+6.180
 preferences+6.140
 Preferences+6.082
 licences+6.074
Politics+5.922
Neg logit contributions
 unintentional-6.187
 epidemic-6.091
amaz-5.966
 offender-5.916
 involuntary-5.880
Token DT
Token ID24311
Feature activation-6.66e+00

Pos logit contributions
 unintentional+2.588
 epidemic+2.567
amaz+2.504
lete+2.502
 offender+2.458
Neg logit contributions
 licences-2.711
 preferences-2.676
abouts-2.676
 finances-2.602
Politics-2.594
Token 6
Token ID718
Feature activation-1.13e+01

Pos logit contributions
 licences+3.861
 preferences+3.823
abouts+3.731
616+3.678
 Preferences+3.674
Neg logit contributions
 unintentional-6.968
 epidemic-6.961
amaz-6.818
 involuntary-6.744
lete-6.736
Token'
Token ID6
Feature activation-3.34e+01

Pos logit contributions
abouts+5.653
Politics+5.587
 preferences+5.570
 licences+5.519
616+5.499
Neg logit contributions
 unintentional-12.533
 epidemic-12.145
 unsuspecting-12.020
 involuntary-12.014
ortunately-11.886
Token4
Token ID19
Feature activation-9.44e+00

Pos logit contributions
616+15.986
5+14.551
Politics+14.237
7+14.185
70710+14.100
Neg logit contributions
ortunately-24.788
 unintentional-24.475
 eleph-24.353
 epidemic-24.332
 unsuspecting-24.288
Token/
Token ID14
Feature activation-2.32e+01

Pos logit contributions
abouts+5.189
 preferences+5.188
Politics+5.137
 licences+5.074
616+5.021
Neg logit contributions
 unintentional-10.101
 epidemic-9.853
amaz-9.758
 involuntary-9.690
 unsuspecting-9.640
Token295
Token ID25710
Feature activation-1.09e+01

Pos logit contributions
616+11.355
Politics+10.088
70710+9.874
abouts+9.583
 licences+8.820
Neg logit contributions
 unintentional-21.899
 epidemic-21.422
 unsuspecting-20.882
 involuntary-20.826
amaz-20.357
Token 3
Token ID513
Feature activation-5.61e+00

Pos logit contributions
 preferences+6.291
 licences+6.136
abouts+6.065
 Preferences+6.028
 ambitions+5.754
Neg logit contributions
 unintentional-10.752
 epidemic-10.737
 involuntary-10.566
amaz-10.500
lete-10.461
Token*
Token ID9
Feature activation-1.19e+01

Pos logit contributions
 licences+4.235
 preferences+4.227
abouts+4.134
Politics+4.034
 Preferences+4.034
Neg logit contributions
 unintentional-5.006
 epidemic-4.979
amaz-4.841
lete-4.814
 involuntary-4.776
Tokenry
Token ID563
Feature activation-3.69e+00

Pos logit contributions
 licences+1.730
abouts+1.716
 preferences+1.705
 finances+1.663
 ambitions+1.652
Neg logit contributions
 unintentional-1.762
 epidemic-1.744
 offender-1.675
amaz-1.670
lete-1.668
Tokenv
Token ID85
Feature activation-9.91e+00

Pos logit contributions
abouts+2.861
 licences+2.840
 preferences+2.795
 finances+2.745
 ambitions+2.694
Neg logit contributions
 unintentional-3.345
 epidemic-3.288
 offender-3.201
 involuntary-3.173
lete-3.166
Tokennia
Token ID18142
Feature activation-2.88e+00

Pos logit contributions
 licences+7.460
abouts+7.376
 finances+7.261
 preferences+7.220
 ambitions+7.005
Neg logit contributions
 unintentional-8.654
 epidemic-8.422
 offender-8.406
amaz-8.286
 involuntary-8.262
Token (
Token ID357
Feature activation-1.67e+01

Pos logit contributions
 licences+2.468
 preferences+2.433
abouts+2.394
 finances+2.362
 ambitions+2.344
Neg logit contributions
 unintentional-2.371
 epidemic-2.361
lete-2.311
amaz-2.305
 unsuspecting-2.252
Tokenabout
Token ID10755
Feature activation-6.27e+00

Pos logit contributions
abouts+11.727
616+10.979
 licences+10.588
Politics+10.421
 preferences+10.412
Neg logit contributions
 unintentional-14.108
 epidemic-13.731
 offender-13.573
 involuntary-13.267
lete-13.265
Token $
Token ID720
Feature activation-3.34e+01

Pos logit contributions
 licences+4.818
abouts+4.797
 preferences+4.777
 finances+4.706
 ambitions+4.593
Neg logit contributions
 unintentional-5.503
 epidemic-5.403
lete-5.348
amaz-5.303
 offender-5.251
Token115
Token ID15363
Feature activation+9.98e-01

Pos logit contributions
616+19.027
70710+16.004
Politics+15.852
abouts+15.304
7+15.043
Neg logit contributions
ortunately-24.499
 unintentional-23.427
 eleph-23.149
 unsuspecting-22.792
 involuntary-22.579
Token.
Token ID13
Feature activation-2.14e+01

Pos logit contributions
 unintentional+0.689
 epidemic+0.682
amaz+0.658
lete+0.655
 offender+0.644
Neg logit contributions
 licences-0.984
abouts-0.976
 preferences-0.975
 finances-0.951
 Preferences-0.950
Token3
Token ID18
Feature activation+5.91e+00

Pos logit contributions
616+10.356
 licences+8.763
Politics+8.659
abouts+8.581
70710+8.483
Neg logit contributions
 unintentional-20.336
 epidemic-20.137
amaz-20.036
 unsuspecting-19.978
 involuntary-19.729
Token million
Token ID1510
Feature activation-7.48e+00

Pos logit contributions
 epidemic+3.649
 unintentional+3.635
amaz+3.411
lete+3.389
 offender+3.373
Neg logit contributions
abouts-6.110
 licences-6.108
 preferences-6.039
Politics-5.961
 Preferences-5.950
Token)
Token ID8
Feature activation-5.57e+00

Pos logit contributions
 licences+4.174
 preferences+4.062
 finances+3.963
abouts+3.940
 ambitions+3.833
Neg logit contributions
 unintentional-8.078
 epidemic-8.029
lete-8.006
amaz-7.898
 unsuspecting-7.792
Token time
Token ID640
Feature activation+3.74e+00

Pos logit contributions
 unintentional+3.888
 epidemic+3.867
lete+3.747
amaz+3.733
 offender+3.691
Neg logit contributions
 licences-4.409
 preferences-4.319
abouts-4.308
 Preferences-4.249
Politics-4.237
Token of
Token ID286
Feature activation-8.97e+00

Pos logit contributions
 unintentional+2.814
 epidemic+2.801
amaz+2.702
lete+2.691
 offender+2.664
Neg logit contributions
 licences-3.431
abouts-3.395
 preferences-3.384
 Preferences-3.308
Politics-3.306
Token 1
Token ID352
Feature activation+6.42e+00

Pos logit contributions
 preferences+6.413
abouts+6.402
 licences+6.392
 ambitions+6.176
 finances+6.095
Neg logit contributions
 epidemic-7.832
amaz-7.830
lete-7.810
 unintentional-7.800
 offender-7.530
Token:
Token ID25
Feature activation-2.78e+01

Pos logit contributions
 unintentional+4.701
 epidemic+4.633
lete+4.539
amaz+4.524
 offender+4.424
Neg logit contributions
 licences-5.939
 preferences-5.849
abouts-5.846
 finances-5.726
Politics-5.721
Token54
Token ID4051
Feature activation-5.47e+00

Pos logit contributions
616+11.955
Politics+10.420
70710+9.970
abouts+9.373
27+9.344
Neg logit contributions
ortunately-28.591
 eleph-28.227
 earthqu-26.721
oreAnd-25.646
 unintentional-25.379
Token.
Token ID13
Feature activation-3.32e+01

Pos logit contributions
abouts+4.762
 licences+4.732
 preferences+4.725
 ambitions+4.575
 finances+4.528
Neg logit contributions
 epidemic-4.278
 unintentional-4.238
amaz-4.155
lete-4.086
 offender-4.041
Token234
Token ID24409
Feature activation-3.32e+00

Pos logit contributions
616+18.546
70710+15.921
Politics+15.616
abouts+14.744
7+14.743
Neg logit contributions
ortunately-23.554
 epidemic-23.116
 unintentional-22.980
 unsuspecting-22.897
 eleph-22.864
Token in
Token ID287
Feature activation-7.34e-01

Pos logit contributions
 licences+2.647
abouts+2.622
 preferences+2.620
 ambitions+2.532
616+2.525
Neg logit contributions
 epidemic-2.888
 unintentional-2.872
amaz-2.830
lete-2.773
 offender-2.750
Token his
Token ID465
Feature activation+4.74e+00

Pos logit contributions
 licences+0.604
 preferences+0.597
abouts+0.596
 finances+0.578
 Preferences+0.577
Neg logit contributions
 unintentional-0.626
 epidemic-0.624
amaz-0.611
lete-0.607
 offender-0.596
Token No
Token ID1400
Feature activation-5.62e+00

Pos logit contributions
 unintentional+3.928
 epidemic+3.868
amaz+3.729
lete+3.728
 offender+3.707
Neg logit contributions
 licences-3.950
abouts-3.938
 preferences-3.888
Politics-3.843
 finances-3.799
Token.
Token ID13
Feature activation-1.81e+01

Pos logit contributions
 licences+4.940
 preferences+4.842
abouts+4.831
 ambitions+4.716
 Preferences+4.695
Neg logit contributions
 unintentional-4.332
 epidemic-4.324
amaz-4.205
 involuntary-4.161
lete-4.132
Token miles
Token ID4608
Feature activation-4.29e+00

Pos logit contributions
 licences+3.169
 preferences+3.153
abouts+3.150
616+3.039
 ambitions+3.016
Neg logit contributions
 epidemic-4.134
 unintentional-4.124
amaz-4.082
lete-4.024
 offender-4.013
Token (
Token ID357
Feature activation-7.28e+00

Pos logit contributions
abouts+3.436
 licences+3.398
 preferences+3.378
616+3.263
 ambitions+3.262
Neg logit contributions
 unintentional-3.706
 epidemic-3.652
amaz-3.585
lete-3.553
 offender-3.528
Token149
Token ID19442
Feature activation-6.56e+00

Pos logit contributions
abouts+3.790
616+3.749
 licences+3.656
 preferences+3.557
Politics+3.545
Neg logit contributions
 unintentional-8.256
 epidemic-8.204
 offender-7.954
 involuntary-7.941
 unsuspecting-7.891
Token,
Token ID11
Feature activation-3.46e+01

Pos logit contributions
 licences+5.330
 preferences+5.266
abouts+5.257
616+5.113
 ambitions+5.112
Neg logit contributions
 unintentional-5.424
 epidemic-5.424
amaz-5.369
lete-5.284
 offender-5.224
Token597
Token ID43239
Feature activation-5.10e+00

Pos logit contributions
616+23.020
abouts+19.938
70710+19.417
 licences+18.793
Politics+18.583
Neg logit contributions
 unintentional-18.851
 epidemic-18.572
 offender-18.064
 involuntary-17.959
 unsuspecting-17.875
Token,
Token ID11
Feature activation-3.31e+01

Pos logit contributions
 licences+3.426
abouts+3.375
 preferences+3.370
 ambitions+3.265
 finances+3.228
Neg logit contributions
amaz-5.002
 epidemic-4.989
 unintentional-4.967
lete-4.863
 offender-4.834
Token870
Token ID46951
Feature activation-8.45e+00

Pos logit contributions
616+22.651
abouts+20.042
 licences+19.126
70710+19.076
 preferences+18.944
Neg logit contributions
 unintentional-17.678
 epidemic-17.336
 offender-16.854
amaz-16.791
 unsuspecting-16.707
Token kilometers
Token ID18212
Feature activation-1.09e+01

Pos logit contributions
 licences+4.488
abouts+4.393
 preferences+4.349
 ambitions+4.214
616+4.204
Neg logit contributions
amaz-9.251
 epidemic-9.169
 unintentional-9.106
lete-9.002
 offender-8.964
Token),
Token ID828
Feature activation-2.79e+00

Pos logit contributions
abouts+5.390
 licences+5.174
 preferences+5.010
616+4.943
 ambitions+4.877
Neg logit contributions
 unintentional-12.139
 epidemic-12.005
amaz-11.904
 involuntary-11.773
lete-11.766
Token according
Token ID1864
Feature activation+2.10e+00

Pos logit contributions
 licences+1.957
 preferences+1.954
abouts+1.913
 ambitions+1.869
 finances+1.867
Neg logit contributions
 unintentional-2.689
 epidemic-2.665
amaz-2.640
lete-2.622
 offender-2.581
Token to
Token ID284
Feature activation-9.42e+00

Pos logit contributions
 unintentional+1.432
 epidemic+1.424
lete+1.379
amaz+1.366
 offender+1.338
Neg logit contributions
 licences-2.089
abouts-2.056
 preferences-2.045
 finances-2.004
 ambitions-1.998
TokenUF
Token ID36820
Feature activation-4.40e+00

Pos logit contributions
Politics+6.967
abouts+6.915
616+6.860
 licences+6.787
 preferences+6.774
Neg logit contributions
 unintentional-8.723
 epidemic-8.661
 involuntary-8.323
 unsuspecting-8.240
 offender-8.202
Token Leonard
Token ID20131
Feature activation-1.72e+00

Pos logit contributions
 licences+3.569
 preferences+3.564
abouts+3.529
 Preferences+3.464
 finances+3.440
Neg logit contributions
 unintentional-3.682
 epidemic-3.668
amaz-3.577
lete-3.519
 involuntary-3.498
Token Warner
Token ID14469
Feature activation+1.02e+00

Pos logit contributions
 licences+1.370
 preferences+1.370
abouts+1.364
 Preferences+1.335
 ambitions+1.316
Neg logit contributions
 unintentional-1.498
 epidemic-1.483
amaz-1.449
lete-1.435
 involuntary-1.422
Token LB
Token ID22199
Feature activation-9.70e+00

Pos logit contributions
 unintentional+0.855
 epidemic+0.850
amaz+0.829
lete+0.825
 involuntary+0.810
Neg logit contributions
 licences-0.861
 preferences-0.852
abouts-0.851
 Preferences-0.826
 finances-0.826
Token 6
Token ID718
Feature activation-1.07e+01

Pos logit contributions
 licences+5.113
 preferences+5.103
 Preferences+4.943
abouts+4.884
616+4.881
Neg logit contributions
 epidemic-10.263
 unintentional-10.244
 involuntary-9.988
amaz-9.950
lete-9.867
Token'
Token ID6
Feature activation-3.26e+01

Pos logit contributions
abouts+5.462
 preferences+5.412
Politics+5.405
 licences+5.359
616+5.315
Neg logit contributions
 unintentional-11.701
 epidemic-11.390
 unsuspecting-11.244
 involuntary-11.236
 offender-11.082
Token4
Token ID19
Feature activation-9.63e+00

Pos logit contributions
616+15.448
5+13.897
Politics+13.777
70710+13.693
7+13.640
Neg logit contributions
ortunately-24.912
 eleph-24.554
 unintentional-24.546
 epidemic-24.448
 unsuspecting-24.366
Token/
Token ID14
Feature activation-2.21e+01

Pos logit contributions
abouts+5.228
 preferences+5.205
Politics+5.175
 licences+5.111
616+5.064
Neg logit contributions
 unintentional-10.350
 epidemic-10.104
amaz-10.004
 involuntary-9.935
 unsuspecting-9.899
Token226
Token ID24909
Feature activation-9.35e+00

Pos logit contributions
616+11.292
Politics+10.040
70710+9.803
abouts+9.699
 licences+8.930
Neg logit contributions
 unintentional-20.938
 epidemic-20.495
 involuntary-19.996
 unsuspecting-19.983
amaz-19.572
Token 4
Token ID604
Feature activation-5.17e+00

Pos logit contributions
 preferences+5.502
 licences+5.396
abouts+5.380
 Preferences+5.231
 ambitions+5.056
Neg logit contributions
 unintentional-9.405
 epidemic-9.371
 involuntary-9.155
amaz-9.150
lete-9.115
Token*
Token ID9
Feature activation-1.01e+01

Pos logit contributions
 preferences+3.578
abouts+3.575
 licences+3.560
Politics+3.490
 Preferences+3.459
Neg logit contributions
 unintentional-5.011
 epidemic-4.912
 involuntary-4.773
amaz-4.762
 unsuspecting-4.761
Token the
Token ID262
Feature activation-1.68e+00

Pos logit contributions
 unintentional+5.531
 epidemic+5.494
amaz+5.245
lete+5.236
 involuntary+5.235
Neg logit contributions
abouts-5.791
 licences-5.738
 preferences-5.616
Politics-5.584
 Preferences-5.511
Token memory
Token ID4088
Feature activation+5.57e-01

Pos logit contributions
 licences+1.191
 preferences+1.179
abouts+1.167
 finances+1.137
 ambitions+1.131
Neg logit contributions
 unintentional-1.616
 epidemic-1.608
amaz-1.590
lete-1.585
 offender-1.544
Token increases
Token ID5732
Feature activation+5.19e+00

Pos logit contributions
 unintentional+0.538
 epidemic+0.534
amaz+0.524
lete+0.522
 offender+0.514
Neg logit contributions
 licences-0.396
 preferences-0.391
abouts-0.388
 finances-0.377
 ambitions-0.375
Token to
Token ID284
Feature activation-5.44e+00

Pos logit contributions
 epidemic+3.440
 unintentional+3.423
lete+3.265
amaz+3.250
 offender+3.198
Neg logit contributions
 licences-5.093
abouts-5.080
 preferences-5.007
Politics-5.007
 Preferences-4.939
Token 1
Token ID352
Feature activation+1.33e+00

Pos logit contributions
 licences+3.646
 preferences+3.636
abouts+3.583
 ambitions+3.471
 Preferences+3.458
Neg logit contributions
 unintentional-5.297
 epidemic-5.232
amaz-5.194
lete-5.150
 offender-5.107
Token,
Token ID11
Feature activation-3.26e+01

Pos logit contributions
 unintentional+0.992
 epidemic+0.988
amaz+0.959
lete+0.959
 offender+0.936
Neg logit contributions
 licences-1.240
abouts-1.222
 preferences-1.222
 finances-1.195
 ambitions-1.187
Token264
Token ID18897
Feature activation+2.74e+00

Pos logit contributions
616+21.151
abouts+18.523
70710+17.563
Politics+17.437
 licences+17.026
Neg logit contributions
 unintentional-19.666
 epidemic-19.065
 involuntary-18.836
 unsuspecting-18.770
 offender-18.311
Tokenm
Token ID76
Feature activation-3.52e+00

Pos logit contributions
 unintentional+1.857
 epidemic+1.848
lete+1.802
amaz+1.785
 offender+1.733
Neg logit contributions
 licences-2.685
abouts-2.681
 preferences-2.662
Politics-2.626
 finances-2.613
Tokenhz
Token ID32179
Feature activation+9.19e+00

Pos logit contributions
 licences+2.684
abouts+2.672
 preferences+2.649
 finances+2.564
Politics+2.562
Neg logit contributions
 unintentional-3.205
 epidemic-3.155
lete-3.066
amaz-3.059
 involuntary-3.054
Token.
Token ID13
Feature activation+6.41e+00

Pos logit contributions
 unintentional+6.308
 epidemic+6.300
lete+6.083
amaz+6.013
 offender+5.982
Neg logit contributions
 licences-8.466
abouts-8.458
Politics-8.292
 preferences-8.273
 finances-8.182
Token The
Token ID383
Feature activation+4.50e+00

Pos logit contributions
 unintentional+4.507
 epidemic+4.451
amaz+4.342
lete+4.320
 offender+4.231
Neg logit contributions
 licences-6.048
abouts-5.970
 preferences-5.924
 finances-5.888
 ambitions-5.816
Token of
Token ID286
Feature activation-1.17e+01

Pos logit contributions
 preferences+8.443
abouts+8.347
 licences+8.342
 ambitions+8.135
 finances+8.036
Neg logit contributions
 epidemic-9.194
amaz-9.131
lete-9.059
 unintentional-9.041
 involuntary-8.841
Token possibilities
Token ID12779
Feature activation+1.31e+00

Pos logit contributions
 licences+9.576
 preferences+9.515
 frameworks+9.318
 ambitions+9.288
 boundaries+9.142
Neg logit contributions
lete-8.980
amaz-8.909
 epidemic-8.761
agically-8.453
 unintentional-8.426
Token I
Token ID314
Feature activation-4.83e+00

Pos logit contributions
 unintentional+0.939
 epidemic+0.931
amaz+0.904
lete+0.901
 offender+0.882
Neg logit contributions
 licences-1.252
 preferences-1.240
abouts-1.238
 finances-1.207
Politics-1.204
Token had
Token ID550
Feature activation-1.31e+01

Pos logit contributions
 licences+3.621
abouts+3.619
 preferences+3.611
 ambitions+3.490
 finances+3.477
Neg logit contributions
 epidemic-4.336
 unintentional-4.317
amaz-4.253
lete-4.193
 involuntary-4.163
Token never
Token ID1239
Feature activation-2.74e+01

Pos logit contributions
 preferences+10.046
abouts+9.736
 ambitions+9.708
 licences+9.572
 Preferences+9.346
Neg logit contributions
amaz-10.509
lete-10.387
 epidemic-10.373
 offender-10.045
 unintentional-9.913
Token really
Token ID1107
Feature activation-3.25e+01

Pos logit contributions
 ambitions+18.489
abouts+18.223
 licences+17.926
 frameworks+17.821
 preferences+17.612
Neg logit contributions
amaz-17.227
lete-16.126
 epidemic-16.044
agically-15.558
letes-15.487
Token considered
Token ID3177
Feature activation-1.27e+01

Pos logit contributions
 ambitions+19.817
abouts+19.742
 licences+19.217
 frameworks+19.092
 preferences+18.874
Neg logit contributions
amaz-19.533
lete-18.243
agically-18.021
ogether-17.879
 epidemic-17.838
Token before
Token ID878
Feature activation-2.16e+00

Pos logit contributions
 preferences+9.538
 licences+9.376
 ambitions+9.370
 finances+9.331
abouts+9.303
Neg logit contributions
 epidemic-10.134
lete-10.018
amaz-9.922
 unintentional-9.808
 involuntary-9.498
Token.
Token ID13
Feature activation+4.83e+00

Pos logit contributions
 licences+1.998
 preferences+1.993
abouts+1.978
 finances+1.935
 Preferences+1.925
Neg logit contributions
 unintentional-1.610
 epidemic-1.604
amaz-1.562
lete-1.556
 involuntary-1.524
Token In
Token ID554
Feature activation+7.42e+00

Pos logit contributions
 unintentional+3.684
 epidemic+3.630
amaz+3.552
lete+3.527
 offender+3.459
Neg logit contributions
 licences-4.353
abouts-4.283
 preferences-4.272
 finances-4.193
 ambitions-4.155
Token fact
Token ID1109
Feature activation-3.66e+00

Pos logit contributions
 unintentional+6.140
 epidemic+6.054
amaz+5.859
lete+5.792
 involuntary+5.791
Neg logit contributions
 licences-6.098
abouts-6.039
 preferences-5.907
Politics-5.827
 finances-5.826
Token stand
Token ID1302
Feature activation-8.26e+00

Pos logit contributions
 unintentional+1.879
 epidemic+1.869
amaz+1.822
lete+1.818
 offender+1.791
Neg logit contributions
 licences-1.549
 preferences-1.529
abouts-1.527
Politics-1.481
 finances-1.479
Token at
Token ID379
Feature activation-1.24e+01

Pos logit contributions
 licences+5.623
abouts+5.617
 preferences+5.589
 finances+5.377
 ambitions+5.372
Neg logit contributions
 unintentional-7.744
 epidemic-7.615
amaz-7.611
lete-7.607
 offender-7.500
Token 2
Token ID362
Feature activation+8.38e+00

Pos logit contributions
 licences+9.494
abouts+9.370
 preferences+9.311
 ambitions+8.983
 finances+8.894
Neg logit contributions
lete-9.766
 unintentional-9.654
amaz-9.560
 epidemic-9.382
 offender-9.324
Token,
Token ID11
Feature activation-2.97e+01

Pos logit contributions
 unintentional+6.045
 epidemic+6.028
amaz+5.769
lete+5.762
 offender+5.691
Neg logit contributions
abouts-7.408
 preferences-7.372
 licences-7.370
 finances-7.322
 Preferences-7.304
Token88
Token ID3459
Feature activation-3.02e+00

Pos logit contributions
616+19.654
abouts+17.188
 licences+16.739
70710+16.559
Politics+16.424
Neg logit contributions
 unintentional-18.181
 epidemic-17.941
 unsuspecting-17.594
 involuntary-17.520
amaz-17.350
Token,
Token ID11
Feature activation-3.23e+01

Pos logit contributions
 licences+1.263
 preferences+1.229
abouts+1.201
 finances+1.138
 ambitions+1.133
Neg logit contributions
 unintentional-3.778
 epidemic-3.778
amaz-3.744
lete-3.711
 involuntary-3.669
Token339
Token ID29626
Feature activation+1.08e+01

Pos logit contributions
616+21.735
abouts+19.053
 licences+18.485
70710+18.283
Politics+18.029
Neg logit contributions
 unintentional-17.841
 epidemic-17.646
 unsuspecting-17.072
 involuntary-17.025
amaz-16.911
Token tons
Token ID10860
Feature activation+1.48e+01

Pos logit contributions
 unintentional+8.113
 epidemic+7.953
lete+7.751
amaz+7.594
 unsuspecting+7.574
Neg logit contributions
 preferences-8.861
abouts-8.778
 Preferences-8.764
Politics-8.753
 finances-8.719
Token.
Token ID13
Feature activation+9.07e+00

Pos logit contributions
 unintentional+10.077
 epidemic+9.643
amaz+9.436
lete+9.345
 involuntary+9.282
Neg logit contributions
abouts-12.589
 Preferences-12.423
 preferences-12.401
 licences-12.373
 finances-12.364
Token The
Token ID383
Feature activation+1.49e+01

Pos logit contributions
 unintentional+6.268
 epidemic+6.162
amaz+6.097
lete+5.967
 aggravated+5.828
Neg logit contributions
 licences-8.297
abouts-8.258
 preferences-8.235
 finances-8.028
 ambitions-8.008
Token move
Token ID1445
Feature activation+1.01e+01

Pos logit contributions
 unintentional+12.668
 epidemic+12.546
 offender+12.032
 aggravated+12.002
 involuntary+11.997
Neg logit contributions
abouts-10.208
Politics-9.984
 licences-9.772
 Preferences-9.686
 preferences-9.669
Token introdu
Token ID3120
Feature activation-4.81e-01

Pos logit contributions
 preferences+9.988
 licences+9.706
 ambitions+9.401
 frameworks+9.371
 finances+9.365
Neg logit contributions
amaz-8.622
 epidemic-8.467
lete-8.450
 unintentional-8.331
agically-8.177
Tokenctions
Token ID2733
Feature activation-5.56e+00

Pos logit contributions
 licences+0.205
 preferences+0.201
abouts+0.200
 finances+0.188
 Preferences+0.187
Neg logit contributions
 unintentional-0.602
 epidemic-0.599
amaz-0.588
lete-0.587
 offender-0.581
Token describing
Token ID12059
Feature activation-2.13e+01

Pos logit contributions
 preferences+4.635
 licences+4.620
abouts+4.606
 finances+4.473
 ambitions+4.404
Neg logit contributions
 unintentional-4.559
 epidemic-4.498
amaz-4.454
lete-4.329
 involuntary-4.305
Token own
Token ID898
Feature activation-2.72e+01

Pos logit contributions
 preferences+17.878
 finances+17.238
 frameworks+17.234
 licences+17.172
 ambitions+17.051
Neg logit contributions
amaz-12.756
lete-12.354
 epidemic-11.881
 unintentional-11.844
agically-11.786
Token backgrounds
Token ID19063
Feature activation-1.67e+01

Pos logit contributions
 preferences+22.877
 ambitions+21.642
 frameworks+21.489
 finances+21.399
 licences+21.141
Neg logit contributions
lete-14.062
amaz-13.754
agically-12.816
 unintentional-12.696
 epidemic-12.682
Token and
Token ID290
Feature activation-3.23e+01

Pos logit contributions
 preferences+12.532
abouts+11.962
 licences+11.877
 finances+11.720
 Preferences+11.597
Neg logit contributions
amaz-13.075
 unintentional-12.630
 epidemic-12.499
 involuntary-12.095
 unsuspecting-12.091
Token the
Token ID262
Feature activation-2.32e+01

Pos logit contributions
 preferences+25.778
 ambitions+24.735
 finances+23.855
 interests+23.667
 frameworks+23.600
Neg logit contributions
amaz-15.702
lete-15.301
agically-14.814
ogether-13.999
 epidemic-13.662
Token projects
Token ID4493
Feature activation-2.52e+01

Pos logit contributions
 preferences+20.144
 ambitions+19.458
 licences+19.140
 frameworks+19.111
 finances+19.102
Neg logit contributions
amaz-12.709
lete-12.679
agically-11.689
 unintentional-11.652
 epidemic-11.630
Token we
Token ID356
Feature activation-1.77e+01

Pos logit contributions
 preferences+17.845
 licences+17.361
 ambitions+17.359
 finances+17.229
abouts+17.132
Neg logit contributions
amaz-16.106
lete-15.098
 epidemic-15.095
 unintentional-15.016
agically-14.675
Token were
Token ID547
Feature activation-1.31e+01

Pos logit contributions
 licences+12.587
abouts+12.443
 ambitions+12.111
 preferences+12.106
 finances+12.057
Neg logit contributions
amaz-13.936
 unintentional-13.593
 epidemic-13.551
 unsuspecting-13.106
lete-13.028
Token working
Token ID1762
Feature activation-7.20e+00

Pos logit contributions
abouts+9.505
 licences+9.347
 ambitions+9.244
 preferences+9.235
 finances+8.987
Neg logit contributions
amaz-11.007
 epidemic-10.868
 unintentional-10.765
lete-10.677
agically-10.388
Token at
Token ID379
Feature activation-6.99e+00

Pos logit contributions
 unintentional+2.155
 epidemic+2.142
amaz+2.075
lete+2.065
 offender+2.030
Neg logit contributions
 licences-2.647
 preferences-2.611
abouts-2.606
Politics-2.550
 finances-2.549
Token a
Token ID257
Feature activation-1.00e+01

Pos logit contributions
 preferences+5.359
abouts+5.322
 licences+5.318
 Preferences+5.164
 ambitions+5.150
Neg logit contributions
 unintentional-5.938
amaz-5.882
lete-5.851
 epidemic-5.831
 offender-5.674
Token speed
Token ID2866
Feature activation+3.36e+00

Pos logit contributions
 preferences+7.539
 licences+7.376
abouts+7.314
 ambitions+7.123
 finances+7.121
Neg logit contributions
amaz-8.347
 epidemic-8.201
lete-8.161
 unintentional-8.114
 offender-7.856
Token of
Token ID286
Feature activation-7.80e+00

Pos logit contributions
 unintentional+2.525
 epidemic+2.509
amaz+2.414
lete+2.403
 offender+2.374
Neg logit contributions
 licences-3.102
 preferences-3.059
abouts-3.050
Politics-2.996
 finances-2.992
Token 2
Token ID362
Feature activation+2.99e+00

Pos logit contributions
abouts+6.184
 preferences+6.122
 licences+6.067
 ambitions+5.912
 finances+5.882
Neg logit contributions
lete-6.349
 unintentional-6.272
 epidemic-6.230
amaz-6.202
 offender-6.070
Token,
Token ID11
Feature activation-3.22e+01

Pos logit contributions
 unintentional+2.262
 epidemic+2.238
amaz+2.164
lete+2.163
 offender+2.125
Neg logit contributions
 licences-2.736
 preferences-2.703
abouts-2.699
 finances-2.641
Politics-2.636
Token658
Token ID38431
Feature activation+4.67e+00

Pos logit contributions
616+20.935
abouts+18.338
70710+17.819
 licences+17.216
Politics+17.208
Neg logit contributions
 unintentional-18.557
 epidemic-18.376
amaz-17.890
 unsuspecting-17.817
 involuntary-17.790
Token meters
Token ID10700
Feature activation+2.92e+00

Pos logit contributions
 unintentional+4.674
 epidemic+4.625
lete+4.504
amaz+4.480
 involuntary+4.460
Neg logit contributions
 licences-3.108
abouts-3.065
 preferences-3.052
Politics-2.990
 finances-2.954
Token per
Token ID583
Feature activation+1.02e+01

Pos logit contributions
 unintentional+2.230
 epidemic+2.211
amaz+2.137
lete+2.134
 offender+2.096
Neg logit contributions
 licences-2.659
 preferences-2.628
abouts-2.626
Politics-2.567
 finances-2.565
Token second
Token ID1218
Feature activation+5.73e+00

Pos logit contributions
 epidemic+6.581
 unintentional+6.539
 involuntary+6.239
lete+6.227
 unsuspecting+6.102
Neg logit contributions
 licences-10.050
Politics-9.942
 preferences-9.893
abouts-9.878
 finances-9.739
Token (
Token ID357
Feature activation-1.60e+01

Pos logit contributions
 unintentional+4.418
 epidemic+4.380
lete+4.225
amaz+4.188
 offender+4.146
Neg logit contributions
 licences-5.109
abouts-5.024
 preferences-5.013
Politics-4.971
 finances-4.931
Token meters
Token ID10700
Feature activation-3.21e+00

Pos logit contributions
 unintentional+3.462
 epidemic+3.435
lete+3.348
amaz+3.338
 involuntary+3.304
Neg logit contributions
 licences-2.337
abouts-2.301
 preferences-2.296
Politics-2.231
 finances-2.222
Token per
Token ID583
Feature activation+1.07e+01

Pos logit contributions
 licences+2.672
 preferences+2.656
abouts+2.636
 ambitions+2.573
 finances+2.552
Neg logit contributions
 unintentional-2.646
 epidemic-2.626
amaz-2.612
lete-2.594
 offender-2.528
Token second
Token ID1218
Feature activation+2.47e-01

Pos logit contributions
 unintentional+6.849
 epidemic+6.837
 involuntary+6.512
lete+6.489
 unsuspecting+6.354
Neg logit contributions
 licences-10.493
Politics-10.378
abouts-10.340
 preferences-10.286
 finances-10.145
Token (
Token ID357
Feature activation-1.52e+01

Pos logit contributions
 unintentional+0.205
 epidemic+0.204
amaz+0.200
lete+0.199
 offender+0.195
Neg logit contributions
 licences-0.208
 preferences-0.206
abouts-0.206
 finances-0.200
 Preferences-0.199
Token4
Token ID19
Feature activation+2.31e+00

Pos logit contributions
616+10.817
abouts+10.560
 preferences+10.172
 licences+10.124
Politics+9.991
Neg logit contributions
 unintentional-12.457
 epidemic-12.410
 offender-12.069
 unsuspecting-12.006
 involuntary-11.944
Token,
Token ID11
Feature activation-3.22e+01

Pos logit contributions
 unintentional+1.739
 epidemic+1.724
amaz+1.672
lete+1.671
 offender+1.637
Neg logit contributions
 licences-2.125
abouts-2.098
 preferences-2.093
 finances-2.049
Politics-2.043
Token471
Token ID38339
Feature activation-5.14e-01

Pos logit contributions
616+21.262
abouts+18.977
70710+17.816
Politics+17.770
 licences+17.614
Neg logit contributions
 unintentional-18.394
 epidemic-18.101
 involuntary-17.544
 unsuspecting-17.458
amaz-17.432
Token miles
Token ID4608
Feature activation-6.22e+00

Pos logit contributions
 licences+0.370
 preferences+0.365
abouts+0.365
 finances+0.352
 ambitions+0.352
Neg logit contributions
 unintentional-0.490
 epidemic-0.489
amaz-0.479
lete-0.477
 offender-0.470
Token per
Token ID583
Feature activation+1.30e+01

Pos logit contributions
 licences+4.792
 preferences+4.737
abouts+4.722
 ambitions+4.614
 finances+4.511
Neg logit contributions
 unintentional-5.371
 epidemic-5.318
amaz-5.281
lete-5.253
 offender-5.118
Token hour
Token ID1711
Feature activation-1.16e+01

Pos logit contributions
 epidemic+4.765
 unintentional+4.614
 unsuspecting+4.184
 hour+4.157
 involuntary+4.119
Neg logit contributions
Politics-15.979
 licences-15.903
abouts-15.736
 preferences-15.611
 Preferences-15.549
Token),
Token ID828
Feature activation-1.57e+00

Pos logit contributions
 preferences+6.079
abouts+6.074
 licences+6.025
616+5.743
 ambitions+5.701
Neg logit contributions
 unintentional-12.043
amaz-11.920
 epidemic-11.894
lete-11.750
 involuntary-11.636
Token per
Token ID583
Feature activation+1.49e+01

Pos logit contributions
 epidemic+10.880
 unintentional+10.795
 involuntary+10.538
 offender+10.454
 aggravated+10.354
Neg logit contributions
abouts-8.611
Politics-8.330
 Preferences-8.173
 licences-7.959
 preferences-7.950
Token cent
Token ID1247
Feature activation+7.69e-01

Pos logit contributions
 epidemic+9.691
 unintentional+9.326
lete+9.202
 offender+9.148
 involuntary+8.990
Neg logit contributions
abouts-13.641
Politics-13.223
 Preferences-13.094
 licences-12.978
 preferences-12.973
Token of
Token ID286
Feature activation+6.07e+00

Pos logit contributions
 unintentional+0.580
 epidemic+0.575
lete+0.557
amaz+0.556
 offender+0.548
Neg logit contributions
 licences-0.706
abouts-0.700
 preferences-0.699
 finances-0.682
 Preferences-0.682
Token the
Token ID262
Feature activation-3.21e+00

Pos logit contributions
 unintentional+5.072
 epidemic+5.049
 involuntary+4.867
 offender+4.844
lete+4.832
Neg logit contributions
abouts-4.928
 licences-4.823
 preferences-4.816
Politics-4.795
 Preferences-4.750
Token 9
Token ID860
Feature activation+1.86e-01

Pos logit contributions
 licences+2.962
 preferences+2.920
abouts+2.874
 finances+2.845
 ambitions+2.810
Neg logit contributions
 unintentional-2.426
amaz-2.405
 epidemic-2.398
lete-2.365
 unsuspecting-2.278
Token,
Token ID11
Feature activation-3.21e+01

Pos logit contributions
 unintentional+0.180
 epidemic+0.178
amaz+0.175
lete+0.174
 offender+0.172
Neg logit contributions
 licences-0.133
 preferences-0.131
abouts-0.130
 finances-0.126
Politics-0.126
Token204
Token ID18638
Feature activation+5.62e+00

Pos logit contributions
616+21.437
abouts+18.652
 licences+18.391
 preferences+17.954
70710+17.934
Neg logit contributions
 unintentional-17.747
 epidemic-17.533
amaz-17.465
 unsuspecting-17.287
 offender-17.056
Token people
Token ID661
Feature activation-7.76e+00

Pos logit contributions
 unintentional+6.273
 epidemic+6.229
 involuntary+6.094
lete+6.054
 offender+6.042
Neg logit contributions
abouts-3.002
 licences-2.928
 preferences-2.923
Politics-2.906
 Preferences-2.874
Token employed
Token ID9322
Feature activation-7.75e+00

Pos logit contributions
 licences+6.339
 preferences+6.209
abouts+6.076
 finances+6.027
 ambitions+5.943
Neg logit contributions
amaz-6.358
 unintentional-6.320
 epidemic-6.285
lete-6.222
 unsuspecting-6.042
Token in
Token ID287
Feature activation-2.62e+00

Pos logit contributions
 licences+5.934
 preferences+5.802
abouts+5.789
 finances+5.592
 ambitions+5.542
Neg logit contributions
 unintentional-6.803
amaz-6.687
lete-6.668
 epidemic-6.662
 unsuspecting-6.466
Token that
Token ID326
Feature activation-4.74e+00

Pos logit contributions
 licences+2.276
 preferences+2.234
abouts+2.209
 finances+2.187
 ambitions+2.152
Neg logit contributions
 unintentional-2.128
 epidemic-2.108
amaz-2.087
lete-2.081
 offender-2.012
Token put
Token ID1234
Feature activation-7.78e+00

Pos logit contributions
 ambitions+17.049
 preferences+17.013
 licences+16.624
 finances+16.302
 politics+15.788
Neg logit contributions
amaz-11.704
lete-11.415
ogether-11.187
ounding-10.763
 epidemic-10.515
Token the
Token ID262
Feature activation-1.39e+01

Pos logit contributions
 preferences+6.840
 licences+6.773
 finances+6.578
abouts+6.570
 ambitions+6.557
Neg logit contributions
 unintentional-5.842
amaz-5.835
 epidemic-5.747
lete-5.652
 offender-5.509
Token unity
Token ID14111
Feature activation-2.15e+01

Pos logit contributions
 preferences+12.271
 licences+12.089
 finances+11.885
 ambitions+11.772
 politics+11.306
Neg logit contributions
amaz-9.756
 unintentional-9.291
lete-9.220
 epidemic-8.865
 involuntary-8.700
Token of
Token ID286
Feature activation-6.19e+00

Pos logit contributions
 preferences+16.081
 ambitions+15.892
 licences+15.393
 finances+15.366
 politics+15.331
Neg logit contributions
 unintentional-14.283
amaz-14.200
lete-13.895
ogether-13.444
 unsuspecting-13.274
Token the
Token ID262
Feature activation-6.72e+00

Pos logit contributions
 preferences+5.145
 licences+5.140
 finances+5.024
abouts+4.947
 ambitions+4.943
Neg logit contributions
 unintentional-5.025
 epidemic-4.978
amaz-4.894
lete-4.827
 involuntary-4.752
Token country
Token ID1499
Feature activation-3.19e+01

Pos logit contributions
 preferences+5.333
 licences+5.312
 finances+5.213
 ambitions+5.141
abouts+5.119
Neg logit contributions
 unintentional-5.661
amaz-5.660
 epidemic-5.553
lete-5.511
 involuntary-5.361
Token at
Token ID379
Feature activation-1.29e+01

Pos logit contributions
 preferences+20.668
 finances+20.605
abouts+20.401
 ambitions+20.320
 politics+20.211
Neg logit contributions
amaz-17.270
 unintentional-16.993
 epidemic-16.400
reated-15.940
 offender-15.896
Token risk
Token ID2526
Feature activation-1.35e+00

Pos logit contributions
 preferences+9.161
 licences+8.813
 finances+8.634
 ambitions+8.612
Politics+8.567
Neg logit contributions
 unintentional-10.468
amaz-10.327
lete-10.262
 unsuspecting-9.958
 epidemic-9.952
Token with
Token ID351
Feature activation-4.12e+00

Pos logit contributions
 licences+1.208
 preferences+1.203
abouts+1.202
 finances+1.170
Politics+1.165
Neg logit contributions
 unintentional-1.043
 epidemic-1.036
amaz-1.015
lete-1.007
 offender-0.988
Token his
Token ID465
Feature activation-4.49e+00

Pos logit contributions
 preferences+3.241
 licences+3.228
 ambitions+3.146
 finances+3.112
abouts+3.099
Neg logit contributions
 unintentional-3.585
 epidemic-3.563
amaz-3.503
lete-3.493
 offender-3.434
Token
Token ID564
Feature activation-1.79e+00

Pos logit contributions
 preferences+3.882
 licences+3.845
 ambitions+3.800
 finances+3.750
abouts+3.680
Neg logit contributions
 unintentional-3.555
lete-3.553
amaz-3.540
 epidemic-3.540
 offender-3.386
Token I
Token ID314
Feature activation+2.06e+00

Pos logit contributions
 licences+3.014
 preferences+3.004
abouts+2.972
 finances+2.902
 ambitions+2.889
Neg logit contributions
 unintentional-3.634
lete-3.602
 epidemic-3.597
amaz-3.591
 offender-3.466
Token'm
Token ID1101
Feature activation-5.84e+00

Pos logit contributions
 unintentional+1.767
 epidemic+1.755
lete+1.707
amaz+1.706
 offender+1.678
Neg logit contributions
 licences-1.685
 preferences-1.665
abouts-1.661
Politics-1.614
 Preferences-1.614
Token not
Token ID407
Feature activation-4.78e+00

Pos logit contributions
 preferences+4.781
 licences+4.754
abouts+4.716
 finances+4.639
 ambitions+4.638
Neg logit contributions
amaz-4.758
 epidemic-4.696
 unintentional-4.690
lete-4.658
 offender-4.554
Token talking
Token ID3375
Feature activation-8.26e+00

Pos logit contributions
 licences+3.541
 preferences+3.537
abouts+3.473
 finances+3.414
 ambitions+3.405
Neg logit contributions
amaz-4.310
 unintentional-4.306
lete-4.270
 epidemic-4.232
 offender-4.091
Token about
Token ID546
Feature activation-1.12e+00

Pos logit contributions
 licences+5.997
 preferences+5.975
 finances+5.831
 ambitions+5.817
abouts+5.812
Neg logit contributions
amaz-7.389
 unintentional-7.363
lete-7.321
 epidemic-7.142
 offender-6.963
Token an
Token ID281
Feature activation+3.45e+01

Pos logit contributions
 licences+0.976
 preferences+0.968
abouts+0.950
 finances+0.944
 ambitions+0.936
Neg logit contributions
 unintentional-0.902
 epidemic-0.891
lete-0.880
amaz-0.878
 offender-0.855
Token animal
Token ID5044
Feature activation+1.63e+00

Pos logit contributions
 epidemic+27.106
 adult+25.908
 elderly+25.781
 unintentional+25.655
 unsuspecting+25.327
Neg logit contributions
abouts-16.535
ヴァ-16.376
 Preferences-15.678
 licences-15.567
Politics-15.245
Token,"
Token ID553
Feature activation-5.75e+00

Pos logit contributions
 unintentional+1.323
 epidemic+1.311
amaz+1.280
lete+1.276
 offender+1.251
Neg logit contributions
 licences-1.412
 preferences-1.397
abouts-1.394
 finances-1.357
Politics-1.351
Token she
Token ID673
Feature activation+7.42e+00

Pos logit contributions
 licences+3.608
 preferences+3.591
abouts+3.545
 finances+3.440
 ambitions+3.429
Neg logit contributions
 unintentional-5.748
 epidemic-5.688
amaz-5.678
lete-5.667
 offender-5.537
Token said
Token ID531
Feature activation+5.57e+00

Pos logit contributions
 unintentional+4.948
 epidemic+4.908
lete+4.710
 aggravated+4.672
amaz+4.637
Neg logit contributions
 licences-7.188
 Preferences-7.172
Politics-7.160
 preferences-7.157
abouts-7.113
Token.
Token ID13
Feature activation+2.20e-01

Pos logit contributions
 unintentional+3.861
 epidemic+3.793
amaz+3.633
lete+3.619
 involuntary+3.611
Neg logit contributions
 licences-5.377
abouts-5.325
 preferences-5.315
 Preferences-5.270
Politics-5.260
Token
Token ID198
Feature activation-3.87e-01

Pos logit contributions
 unintentional+2.633
 epidemic+2.614
amaz+2.550
lete+2.519
 offender+2.496
Neg logit contributions
 licences-3.184
 preferences-3.152
abouts-3.149
 ambitions-3.062
 finances-3.062
Token
Token ID198
Feature activation+1.87e+00

Pos logit contributions
 licences+0.347
 preferences+0.343
abouts+0.343
 finances+0.334
 Preferences+0.333
Neg logit contributions
 unintentional-0.303
 epidemic-0.300
amaz-0.292
lete-0.291
 offender-0.286
TokenPresident
Token ID10364
Feature activation-1.44e-01

Pos logit contributions
 unintentional+1.737
 epidemic+1.726
amaz+1.690
lete+1.685
 offender+1.657
Neg logit contributions
 licences-1.393
 preferences-1.374
abouts-1.372
 finances-1.329
 Preferences-1.324
Token Obama
Token ID2486
Feature activation+1.30e+00

Pos logit contributions
 licences+0.122
 preferences+0.121
abouts+0.120
 Preferences+0.117
 ambitions+0.117
Neg logit contributions
 epidemic-0.118
 unintentional-0.118
amaz-0.115
lete-0.114
 offender-0.112
Token issued
Token ID4884
Feature activation+3.93e+00

Pos logit contributions
 unintentional+1.099
 epidemic+1.090
amaz+1.065
lete+1.061
 offender+1.042
Neg logit contributions
 licences-1.083
 preferences-1.071
abouts-1.068
 finances-1.039
Politics-1.034
Token an
Token ID281
Feature activation+3.61e+01

Pos logit contributions
 unintentional+3.135
 epidemic+3.117
amaz+3.019
lete+3.002
 involuntary+2.973
Neg logit contributions
abouts-3.412
 licences-3.406
 preferences-3.369
Politics-3.310
 finances-3.289
Token executive
Token ID4640
Feature activation+1.06e+01

Pos logit contributions
 epidemic+24.456
 unintentional+24.292
 involuntary+23.356
 aggravated+23.066
 offender+22.555
Neg logit contributions
abouts-20.629
616-18.714
 favourites-18.550
ヴァ-18.515
 licences-18.447
Token order
Token ID1502
Feature activation+1.19e+01

Pos logit contributions
 epidemic+6.423
 unintentional+6.343
 offender+6.064
amaz+5.984
 involuntary+5.970
Neg logit contributions
abouts-10.966
 licences-10.659
Politics-10.598
 Preferences-10.528
616-10.510
Token in
Token ID287
Feature activation+6.67e-01

Pos logit contributions
 unintentional+9.038
 epidemic+8.976
amaz+8.822
 involuntary+8.655
 aggravated+8.612
Neg logit contributions
abouts-9.783
Politics-9.647
 licences-9.639
 preferences-9.509
616-9.458
Token 2011
Token ID2813
Feature activation+2.51e+01

Pos logit contributions
 unintentional+0.470
 epidemic+0.466
amaz+0.452
lete+0.451
 offender+0.441
Neg logit contributions
 licences-0.649
 preferences-0.642
abouts-0.641
 finances-0.626
 Preferences-0.624
Token design
Token ID1486
Feature activation+1.02e+01

Pos logit contributions
 aggravated+18.005
 epidemic+17.847
amaz+17.808
 involuntary+17.792
 unintentional+17.653
Neg logit contributions
Politics-15.084
abouts-15.035
 Preferences-14.659
616-14.390
 licences-14.043
Token lose
Token ID4425
Feature activation-8.01e+00

Pos logit contributions
 preferences+13.567
 ambitions+13.475
abouts+12.997
 finances+12.862
 licences+12.807
Neg logit contributions
amaz-13.446
lete-13.384
 unintentional-12.934
 epidemic-12.800
ounding-12.356
Token,
Token ID11
Feature activation-1.52e+00

Pos logit contributions
 preferences+6.760
 licences+6.613
 ambitions+6.594
abouts+6.585
 finances+6.473
Neg logit contributions
 unintentional-6.292
 epidemic-6.229
amaz-6.225
lete-6.090
 offender-5.981
Token but
Token ID475
Feature activation+8.94e-02

Pos logit contributions
 licences+1.328
 preferences+1.324
abouts+1.312
 finances+1.288
 ambitions+1.283
Neg logit contributions
 unintentional-1.205
 epidemic-1.195
amaz-1.183
lete-1.171
 offender-1.143
Token it
Token ID340
Feature activation+6.97e+00

Pos logit contributions
 unintentional+0.081
 epidemic+0.081
amaz+0.080
lete+0.079
 offender+0.078
Neg logit contributions
 licences-0.068
 preferences-0.068
abouts-0.067
 finances-0.066
 ambitions-0.065
Token is
Token ID318
Feature activation+9.57e+00

Pos logit contributions
 unintentional+5.580
 epidemic+5.539
 aggravated+5.314
 involuntary+5.261
 offender+5.260
Neg logit contributions
 licences-5.862
abouts-5.815
 Preferences-5.784
 preferences-5.777
Politics-5.746
Token an
Token ID281
Feature activation+4.15e+01

Pos logit contributions
 unintentional+8.125
 epidemic+7.918
 involuntary+7.585
 unsuspecting+7.487
 aggravated+7.485
Neg logit contributions
abouts-7.717
 licences-7.648
 Preferences-7.478
 preferences-7.472
616-7.271
Token "
Token ID366
Feature activation+3.20e+01

Pos logit contributions
 unintentional+28.801
 epidemic+27.964
 astounding+27.180
 involuntary+26.944
 elderly+26.364
Neg logit contributions
ヴァ-20.297
abouts-19.315
 Preferences-18.206
 licences-17.969
krit-17.920
Tokenif
Token ID361
Feature activation-2.82e+00

Pos logit contributions
amaz+23.725
 epidemic+23.294
 unintentional+22.990
ounding+22.153
 involuntary+22.023
Neg logit contributions
 licences-16.370
 Preferences-15.444
 preferences-15.425
 finances-15.352
 permissions-14.532
Token"
Token ID1
Feature activation+1.53e+01

Pos logit contributions
 preferences+2.214
abouts+2.208
 licences+2.205
 finances+2.143
 ambitions+2.142
Neg logit contributions
 unintentional-2.464
 epidemic-2.461
amaz-2.421
lete-2.411
 offender-2.358
Token and
Token ID290
Feature activation+3.73e+00

Pos logit contributions
 epidemic+13.234
 unintentional+13.180
 offender+12.566
 addict+12.504
 involuntary+12.467
Neg logit contributions
 licences-10.459
abouts-10.424
 Preferences-10.100
 preferences-9.840
616-9.838
Token people
Token ID661
Feature activation-7.84e+00

Pos logit contributions
 unintentional+3.510
 epidemic+3.480
lete+3.374
amaz+3.364
 offender+3.338
Neg logit contributions
 licences-2.730
abouts-2.701
 preferences-2.674
 Preferences-2.602
Politics-2.592
Token in
Token ID287
Feature activation+4.56e+00

Pos logit contributions
 unintentional+5.556
 epidemic+5.452
lete+5.299
amaz+5.235
 involuntary+5.228
Neg logit contributions
 licences-5.372
abouts-5.302
 preferences-5.274
Politics-5.169
 Preferences-5.164
Token which
Token ID543
Feature activation-1.31e-01

Pos logit contributions
 unintentional+3.400
 epidemic+3.374
lete+3.243
amaz+3.226
 involuntary+3.206
Neg logit contributions
 licences-4.207
abouts-4.199
 preferences-4.161
Politics-4.072
 Preferences-4.054
Token Sed
Token ID22710
Feature activation+9.37e+00

Pos logit contributions
 licences+0.081
 preferences+0.080
abouts+0.080
 finances+0.077
 ambitions+0.077
Neg logit contributions
 unintentional-0.137
 epidemic-0.136
amaz-0.134
lete-0.134
 offender-0.132
Tokenaris
Token ID20066
Feature activation+8.13e+00

Pos logit contributions
 epidemic+7.160
lete+7.076
amaz+7.067
 unintentional+7.040
 addict+6.815
Neg logit contributions
 licences-7.608
 preferences-7.575
 Preferences-7.485
abouts-7.406
 finances-7.260
Token played
Token ID2826
Feature activation+2.96e+00

Pos logit contributions
 unintentional+6.996
 epidemic+6.924
lete+6.737
 aggravated+6.707
amaz+6.624
Neg logit contributions
 licences-6.282
 preferences-6.188
abouts-6.184
 Preferences-6.144
Politics-6.097
Token an
Token ID281
Feature activation+4.58e+01

Pos logit contributions
 unintentional+2.355
 epidemic+2.341
amaz+2.259
lete+2.257
 involuntary+2.224
Neg logit contributions
 licences-2.608
abouts-2.579
 preferences-2.574
Politics-2.504
 Preferences-2.503
Token adult
Token ID4044
Feature activation+1.63e+01

Pos logit contributions
 elderly+30.533
 addict+30.419
 epidemic+30.399
 unsuspecting+30.251
 unintentional+30.128
Neg logit contributions
 licences-17.274
abouts-16.746
ヴァ-16.541
 Preferences-16.466
 preferences-16.157
Token woman
Token ID2415
Feature activation+5.42e+00

Pos logit contributions
 epidemic+14.240
 addict+13.909
 offender+13.788
 unintentional+13.722
 involuntary+13.371
Neg logit contributions
 licences-11.304
abouts-10.850
 Preferences-10.557
Politics-10.552
 preferences-10.383
Token who
Token ID508
Feature activation-1.40e+00

Pos logit contributions
 epidemic+4.380
 unintentional+4.374
lete+4.222
amaz+4.172
 offender+4.166
Neg logit contributions
 licences-4.640
 preferences-4.540
abouts-4.537
Politics-4.484
 Preferences-4.483
Token returns
Token ID5860
Feature activation+6.43e+00

Pos logit contributions
 licences+1.178
 preferences+1.168
abouts+1.159
 finances+1.142
 ambitions+1.131
Neg logit contributions
 unintentional-1.163
 epidemic-1.153
amaz-1.144
lete-1.133
 offender-1.107
Token to
Token ID284
Feature activation-3.81e+00

Pos logit contributions
 unintentional+5.529
 epidemic+5.455
 unsuspecting+5.279
amaz+5.237
 offender+5.217
Neg logit contributions
 licences-5.129
abouts-5.122
 preferences-5.024
Politics-5.020
 Preferences-4.964
Token
Token ID447
Feature activation-1.75e+00

Pos logit contributions
 licences+2.251
 preferences+2.250
abouts+2.244
Politics+2.239
 Preferences+2.196
Neg logit contributions
 unintentional-3.485
 epidemic-3.471
lete-3.372
 involuntary-3.364
amaz-3.362
Token
Token ID250
Feature activation+8.50e+00

Pos logit contributions
 licences+1.475
 preferences+1.472
abouts+1.455
Politics+1.438
 Preferences+1.429
Neg logit contributions
 unintentional-1.453
 epidemic-1.438
amaz-1.401
lete-1.385
 unsuspecting-1.376
TokenI
Token ID40
Feature activation+6.37e+00

Pos logit contributions
 unintentional+8.388
 epidemic+8.351
amaz+8.334
lete+8.324
 offender+8.020
Neg logit contributions
 licences-5.479
 preferences-5.325
abouts-5.193
 finances-5.160
 ambitions-5.124
Token,
Token ID11
Feature activation+5.62e+00

Pos logit contributions
 unintentional+5.708
 epidemic+5.665
amaz+5.579
lete+5.543
 offender+5.461
Neg logit contributions
 licences-4.798
 preferences-4.699
abouts-4.668
 Preferences-4.596
 finances-4.543
Token as
Token ID355
Feature activation+7.91e+00

Pos logit contributions
 unintentional+4.989
 epidemic+4.938
amaz+4.831
lete+4.814
 offender+4.741
Neg logit contributions
 licences-4.403
 preferences-4.279
abouts-4.276
 finances-4.167
 Preferences-4.152
Token an
Token ID281
Feature activation+4.94e+01

Pos logit contributions
 unintentional+6.441
 epidemic+6.337
lete+6.105
amaz+6.056
 unsuspecting+6.025
Neg logit contributions
 licences-6.710
abouts-6.537
 preferences-6.503
 Preferences-6.366
 finances-6.337
Token American
Token ID1605
Feature activation+2.57e+01

Pos logit contributions
 adult+29.299
 elderly+28.912
 unintentional+28.857
 educator+28.550
 American+28.370
Neg logit contributions
 licences-21.488
ヴァ-20.407
abouts-18.803
 favourites-18.567
 Preferences-18.493
Token,
Token ID11
Feature activation+2.93e+00

Pos logit contributions
 adult+18.407
 offender+18.346
 unintentional+18.341
 elderly+18.210
 epidemic+18.132
Neg logit contributions
 licences-16.313
abouts-14.868
 Preferences-14.461
 ambitions-14.404
 finances-14.357
Token oppose
Token ID12856
Feature activation+6.68e+00

Pos logit contributions
 unintentional+2.675
 epidemic+2.656
amaz+2.595
lete+2.587
 offender+2.547
Neg logit contributions
 licences-2.241
abouts-2.206
 preferences-2.202
 finances-2.132
 Preferences-2.129
Token what
Token ID644
Feature activation-1.14e+01

Pos logit contributions
 unintentional+6.027
 epidemic+5.970
lete+5.777
 involuntary+5.745
amaz+5.738
Neg logit contributions
 licences-5.012
abouts-4.997
 preferences-4.913
 Preferences-4.833
 finances-4.763
Token my
Token ID616
Feature activation-4.36e+00

Pos logit contributions
abouts+9.021
 preferences+9.011
 licences+8.845
Politics+8.793
 ambitions+8.776
Neg logit contributions
 unintentional-8.663
lete-8.596
 epidemic-8.586
amaz-8.481
 involuntary-8.307
Token do
Token ID466
Feature activation-1.51e-01

Pos logit contributions
 unintentional+5.344
 epidemic+5.299
lete+5.173
amaz+5.167
 offender+5.111
Neg logit contributions
 licences-4.346
abouts-4.233
 preferences-4.226
 Preferences-4.117
 finances-4.081
Token know
Token ID760
Feature activation+3.02e+00

Pos logit contributions
 licences+0.109
 preferences+0.109
abouts+0.108
 finances+0.105
 ambitions+0.104
Neg logit contributions
 unintentional-0.143
 epidemic-0.142
amaz-0.139
lete-0.139
 offender-0.136
Token that
Token ID326
Feature activation-9.48e-01

Pos logit contributions
 unintentional+2.344
 epidemic+2.322
amaz+2.248
lete+2.241
 offender+2.206
Neg logit contributions
 licences-2.721
abouts-2.684
 preferences-2.676
 finances-2.606
 Preferences-2.604
Token it
Token ID340
Feature activation+6.13e+00

Pos logit contributions
 licences+0.778
 preferences+0.777
abouts+0.768
 finances+0.753
 ambitions+0.747
Neg logit contributions
 unintentional-0.801
 epidemic-0.799
amaz-0.786
lete-0.783
 offender-0.764
Token can
Token ID460
Feature activation+1.44e+00

Pos logit contributions
 unintentional+5.273
 epidemic+5.211
lete+5.041
amaz+5.041
 aggravated+4.998
Neg logit contributions
 licences-4.902
abouts-4.812
 preferences-4.789
 Preferences-4.694
Politics-4.670
Token be
Token ID307
Feature activation+8.13e+00

Pos logit contributions
 unintentional+1.361
 epidemic+1.352
amaz+1.323
lete+1.319
 offender+1.298
Neg logit contributions
 licences-1.053
 preferences-1.040
abouts-1.038
 finances-1.004
Politics-1.000
Token rough
Token ID5210
Feature activation-2.68e-02

Pos logit contributions
 unintentional+6.271
 epidemic+6.114
 involuntary+5.776
 addict+5.771
 aggravated+5.765
Neg logit contributions
abouts-7.300
 licences-7.292
 preferences-7.112
 Preferences-7.087
Politics-6.985
Token for
Token ID329
Feature activation+1.41e+01

Pos logit contributions
 licences+0.023
 preferences+0.023
abouts+0.023
 finances+0.023
Politics+0.022
Neg logit contributions
 unintentional-0.022
 epidemic-0.021
amaz-0.021
lete-0.021
 offender-0.020
Token mixed
Token ID7668
Feature activation+7.43e+00

Pos logit contributions
 unintentional+11.197
 epidemic+10.651
 unsuspecting+10.567
 elderly+10.339
 involuntary+10.222
Neg logit contributions
abouts-10.948
 licences-10.945
 Preferences-10.602
616-10.556
Politics-10.553
Token people
Token ID661
Feature activation-2.95e+00

Pos logit contributions
 unintentional+6.138
 epidemic+6.031
 unsuspecting+5.795
lete+5.774
 offender+5.774
Neg logit contributions
 licences-6.093
abouts-6.049
Politics-5.921
 preferences-5.916
 Preferences-5.879
Token because
Token ID780
Feature activation-2.97e+00

Pos logit contributions
 preferences+2.545
abouts+2.541
 licences+2.530
 finances+2.475
 ambitions+2.446
Neg logit contributions
 unintentional-2.346
 epidemic-2.335
amaz-2.313
lete-2.297
 involuntary-2.233
Tokenune
Token ID1726
Feature activation-8.27e+00

Pos logit contributions
lete+8.108
amaz+7.780
 epidemic+7.425
agically+7.338
 unintentional+7.235
Neg logit contributions
 licences-17.156
 levers-16.245
 Preferences-16.232
 preferences-16.105
 finances-16.045
Token on
Token ID319
Feature activation-4.14e+00

Pos logit contributions
 preferences+6.162
abouts+6.061
 licences+6.049
 finances+5.993
 Preferences+5.867
Neg logit contributions
 unintentional-7.163
amaz-7.060
 epidemic-7.054
lete-6.873
 involuntary-6.855
Token his
Token ID465
Feature activation-2.70e+00

Pos logit contributions
 preferences+3.264
 licences+3.237
abouts+3.221
 Preferences+3.155
 finances+3.146
Neg logit contributions
 unintentional-3.581
 epidemic-3.559
amaz-3.537
lete-3.464
 unsuspecting-3.390
Token list
Token ID1351
Feature activation+4.97e+00

Pos logit contributions
 preferences+2.090
 licences+2.050
abouts+2.042
 Preferences+2.005
 finances+2.003
Neg logit contributions
 unintentional-2.404
 epidemic-2.391
amaz-2.384
lete-2.339
 involuntary-2.286
Token of
Token ID286
Feature activation+8.44e+00

Pos logit contributions
 unintentional+3.525
 epidemic+3.481
lete+3.398
amaz+3.386
 offender+3.331
Neg logit contributions
 licences-4.742
abouts-4.628
 preferences-4.588
 finances-4.526
 Preferences-4.509
Token 100
Token ID1802
Feature activation+1.23e+01

Pos logit contributions
 unintentional+6.901
 epidemic+6.785
lete+6.539
amaz+6.494
 involuntary+6.444
Neg logit contributions
 licences-6.954
abouts-6.809
 preferences-6.689
Politics-6.630
 finances-6.613
Token Greatest
Token ID33575
Feature activation+1.37e+01

Pos logit contributions
 unintentional+10.276
 epidemic+10.073
lete+9.854
amaz+9.647
 unsuspecting+9.593
Neg logit contributions
 licences-9.226
abouts-8.969
 preferences-8.844
 finances-8.686
Politics-8.686
Token African
Token ID5510
Feature activation+1.90e+01

Pos logit contributions
 unintentional+12.244
 epidemic+12.012
lete+11.823
amaz+11.628
 unsuspecting+11.518
Neg logit contributions
 licences-9.256
 preferences-8.818
 finances-8.560
616-8.468
 ambitions-8.427
Token Americans
Token ID3399
Feature activation+4.40e+00

Pos logit contributions
 unintentional+11.434
lete+11.129
 epidemic+11.124
 American+10.621
 Americans+10.528
Neg logit contributions
 licences-16.912
 finances-15.769
 preferences-15.674
616-15.610
abouts-15.485
Token.[
Token ID3693
Feature activation-1.79e+01

Pos logit contributions
 unintentional+3.359
 epidemic+3.318
lete+3.219
amaz+3.198
 offender+3.168
Neg logit contributions
 licences-4.003
 preferences-3.931
abouts-3.906
 finances-3.835
 ambitions-3.821
Token58
Token ID3365
Feature activation-5.58e+00

Pos logit contributions
616+8.441
Politics+8.431
 licences+8.258
 preferences+8.102
 finances+7.748
Neg logit contributions
amaz-17.957
 epidemic-17.956
 involuntary-17.468
lete-17.442
 unintentional-17.349
Token 500
Token ID5323
Feature activation+1.11e+01

Pos logit contributions
 epidemic+7.413
 unintentional+7.386
lete+7.044
amaz+7.037
 involuntary+6.923
Neg logit contributions
abouts-7.972
 licences-7.872
 preferences-7.647
Politics-7.613
 Preferences-7.524
Token million
Token ID1510
Feature activation+1.98e+01

Pos logit contributions
 unintentional+7.475
 epidemic+7.204
amaz+7.078
lete+6.992
 unsuspecting+6.969
Neg logit contributions
abouts-10.266
 licences-9.890
Politics-9.814
 finances-9.767
 preferences-9.759
Token check
Token ID2198
Feature activation+1.18e+01

Pos logit contributions
 unintentional+18.231
 unsuspecting+17.683
 epidemic+17.279
 addict+17.245
 involuntary+17.238
Neg logit contributions
abouts-10.769
Politics-10.508
 licences-9.998
616-9.665
 Preferences-9.462
Token-
Token ID12
Feature activation+1.09e+01

Pos logit contributions
 unintentional+7.139
 epidemic+6.913
amaz+6.779
 offender+6.697
 unsuspecting+6.695
Neg logit contributions
Politics-11.110
 licences-11.084
abouts-11.031
 Preferences-10.971
616-10.925
Tokenins
Token ID1040
Feature activation+1.27e+01

Pos logit contributions
amaz+4.195
 unintentional+3.974
 epidemic+3.825
lete+3.805
 unsuspecting+3.634
Neg logit contributions
 licences-13.013
 preferences-12.870
 Preferences-12.833
 ambitions-12.757
Politics-12.714
Token on
Token ID319
Feature activation+9.40e+00

Pos logit contributions
 unintentional+8.875
 epidemic+8.707
lete+8.520
amaz+8.304
 unsuspecting+8.283
Neg logit contributions
abouts-10.887
 licences-10.822
Politics-10.699
 preferences-10.537
 Preferences-10.506
Token Unt
Token ID26970
Feature activation+3.25e+00

Pos logit contributions
 unintentional+7.533
 epidemic+7.446
amaz+7.205
lete+7.189
 unsuspecting+7.174
Neg logit contributions
abouts-7.770
 licences-7.713
Politics-7.513
 preferences-7.512
 ambitions-7.303
Tokenapp
Token ID1324
Feature activation-2.81e+00

Pos logit contributions
 unintentional+2.498
 epidemic+2.497
amaz+2.460
lete+2.444
 offender+2.366
Neg logit contributions
 licences-2.929
 preferences-2.888
abouts-2.866
 finances-2.808
 Preferences-2.801
Tokend
Token ID67
Feature activation-2.52e+00

Pos logit contributions
 licences+2.555
 preferences+2.544
abouts+2.522
 finances+2.473
 Preferences+2.461
Neg logit contributions
 unintentional-2.151
 epidemic-2.116
lete-2.043
amaz-2.039
 involuntary-2.028
Token.
Token ID13
Feature activation+1.21e+01

Pos logit contributions
 licences+2.294
 preferences+2.283
abouts+2.248
 finances+2.223
Politics+2.214
Neg logit contributions
 unintentional-1.918
 epidemic-1.908
amaz-1.848
lete-1.836
 involuntary-1.814
Token Think
Token ID11382
Feature activation-7.33e-01

Pos logit contributions
amaz+8.257
 unintentional+8.197
 epidemic+8.160
lete+8.100
 addict+7.584
Neg logit contributions
 licences-10.949
 preferences-10.608
abouts-10.607
 finances-10.444
 ambitions-10.381
Token is
Token ID318
Feature activation-5.42e+00

Pos logit contributions
 unintentional+2.744
 epidemic+2.731
lete+2.646
amaz+2.643
 offender+2.606
Neg logit contributions
 licences-2.728
abouts-2.706
 preferences-2.696
Politics-2.620
 finances-2.617
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
 licences+4.316
 preferences+4.263
abouts+4.250
 ambitions+4.167
 finances+4.161
Neg logit contributions
 unintentional-4.566
 epidemic-4.517
amaz-4.487
lete-4.427
 offender-4.368
Token main
Token ID1388
Feature activation+6.53e+00

Pos logit contributions
 unintentional+0.852
 epidemic+0.846
amaz+0.826
lete+0.824
 offender+0.809
Neg logit contributions
 licences-0.828
 preferences-0.818
abouts-0.817
 finances-0.794
 Preferences-0.791
Token attraction
Token ID17416
Feature activation+6.23e+00

Pos logit contributions
 unintentional+5.256
 epidemic+5.190
 offender+5.037
amaz+4.993
lete+4.992
Neg logit contributions
abouts-5.603
 licences-5.575
 preferences-5.478
Politics-5.381
 Preferences-5.348
Token;
Token ID26
Feature activation+2.51e+00

Pos logit contributions
 unintentional+4.691
 epidemic+4.657
lete+4.517
 offender+4.463
amaz+4.444
Neg logit contributions
 licences-5.577
abouts-5.553
 preferences-5.493
Politics-5.447
 Preferences-5.418
Token tourists
Token ID15930
Feature activation+7.18e+00

Pos logit contributions
 unintentional+2.176
 epidemic+2.155
amaz+2.098
lete+2.094
 offender+2.063
Neg logit contributions
 licences-2.029
abouts-2.008
 preferences-2.004
Politics-1.947
 finances-1.943
Token are
Token ID389
Feature activation+6.17e+00

Pos logit contributions
 epidemic+5.764
 unintentional+5.761
 unsuspecting+5.541
lete+5.512
 offender+5.493
Neg logit contributions
Politics-5.903
 licences-5.891
abouts-5.884
 Preferences-5.796
 preferences-5.782
Token allowed
Token ID3142
Feature activation-1.94e+00

Pos logit contributions
 unintentional+4.753
 epidemic+4.679
 unsuspecting+4.493
 involuntary+4.477
amaz+4.472
Neg logit contributions
abouts-5.483
 licences-5.454
Politics-5.407
 preferences-5.388
 Preferences-5.376
Token to
Token ID284
Feature activation-5.42e+00

Pos logit contributions
 licences+1.664
 preferences+1.646
abouts+1.622
 finances+1.588
 ambitions+1.584
Neg logit contributions
 unintentional-1.582
 epidemic-1.574
amaz-1.550
lete-1.544
 offender-1.505
Token freely
Token ID12748
Feature activation-5.98e+00

Pos logit contributions
 preferences+4.096
 licences+4.092
abouts+3.991
 finances+3.896
 ambitions+3.877
Neg logit contributions
 unintentional-4.856
 epidemic-4.797
amaz-4.790
lete-4.761
 offender-4.591
Token visit
Token ID3187
Feature activation+7.54e+00

Pos logit contributions
 licences+4.598
 preferences+4.581
abouts+4.473
 finances+4.348
 ambitions+4.332
Neg logit contributions
 unintentional-5.304
amaz-5.263
 epidemic-5.191
lete-5.126
 involuntary-5.001
Tokenbeing
Token ID11873
Feature activation+1.66e+00

Pos logit contributions
 unintentional+0.556
 epidemic+0.554
amaz+0.544
lete+0.541
 offender+0.535
Neg logit contributions
 licences-0.269
 preferences-0.266
abouts-0.265
 finances-0.254
Politics-0.251
Token,
Token ID11
Feature activation-3.29e+00

Pos logit contributions
 unintentional+1.172
 epidemic+1.168
lete+1.120
amaz+1.115
 offender+1.100
Neg logit contributions
 licences-1.610
abouts-1.599
 preferences-1.580
Politics-1.553
 Preferences-1.550
Token and
Token ID290
Feature activation-6.46e-02

Pos logit contributions
 preferences+3.027
 licences+2.992
 finances+2.952
 ambitions+2.914
abouts+2.898
Neg logit contributions
 unintentional-2.461
amaz-2.457
 epidemic-2.426
lete-2.404
 unsuspecting-2.327
Token safety
Token ID3747
Feature activation+1.61e+00

Pos logit contributions
 licences+0.051
 preferences+0.050
abouts+0.049
 finances+0.049
 ambitions+0.048
Neg logit contributions
 unintentional-0.058
 epidemic-0.057
amaz-0.057
lete-0.056
 offender-0.055
Token of
Token ID286
Feature activation+3.52e+01

Pos logit contributions
 unintentional+1.223
 epidemic+1.217
lete+1.175
amaz+1.175
 offender+1.154
Neg logit contributions
 licences-1.476
abouts-1.460
 preferences-1.454
Politics-1.419
 finances-1.419
Token LGBT
Token ID9916
Feature activation+2.43e+01

Pos logit contributions
 adult+25.294
 adults+24.992
 epidemic+24.928
 unintentional+24.833
 individuals+24.796
Neg logit contributions
Politics-16.482
 licences-15.443
ヴァ-15.401
abouts-15.347
616-15.036
Token children
Token ID1751
Feature activation+1.58e+01

Pos logit contributions
 adults+17.764
 unintentional+17.700
 epidemic+17.600
 adult+17.591
 individuals+17.495
Neg logit contributions
 licences-15.149
Politics-14.882
abouts-14.182
616-14.179
 preferences-13.959
Token,
Token ID11
Feature activation+2.32e+01

Pos logit contributions
 epidemic+11.040
 unintentional+10.901
lete+10.705
 offender+10.492
 aggravated+10.410
Neg logit contributions
 licences-13.122
Politics-12.713
abouts-12.432
 Preferences-12.431
 preferences-12.373
Token youth
Token ID6205
Feature activation+5.50e+00

Pos logit contributions
 unintentional+18.877
 epidemic+18.832
 elderly+18.702
 adults+18.635
 adult+18.503
Neg logit contributions
abouts-13.928
 licences-13.121
Politics-12.910
ヴァ-12.637
616-12.066
Token,
Token ID11
Feature activation+1.66e+01

Pos logit contributions
 unintentional+4.031
 epidemic+4.025
lete+3.889
 offender+3.855
amaz+3.817
Neg logit contributions
 licences-5.115
abouts-5.041
 preferences-4.998
Politics-4.930
 Preferences-4.917
Token and
Token ID290
Feature activation+2.46e+01

Pos logit contributions
 unintentional+12.431
 epidemic+12.185
 elderly+12.014
 adults+11.806
 offender+11.782
Neg logit contributions
abouts-12.954
 licences-12.866
Politics-12.597
 Preferences-12.370
 preferences-12.232
Token you
Token ID345
Feature activation-6.67e+00

Pos logit contributions
 preferences+10.341
 licences+9.812
abouts+9.730
 finances+9.727
 ambitions+9.653
Neg logit contributions
lete-12.341
amaz-12.296
 epidemic-12.114
 unintentional-11.992
 offender-11.858
Token going
Token ID1016
Feature activation+2.85e+00

Pos logit contributions
 preferences+5.194
 licences+5.143
abouts+5.128
 finances+5.009
 ambitions+4.952
Neg logit contributions
 epidemic-5.599
 unintentional-5.583
lete-5.563
amaz-5.529
 offender-5.348
Token to
Token ID284
Feature activation-2.01e+00

Pos logit contributions
 unintentional+2.000
 epidemic+1.982
lete+1.908
amaz+1.901
 involuntary+1.885
Neg logit contributions
 licences-2.786
abouts-2.740
 preferences-2.732
Politics-2.685
 Preferences-2.680
Token become
Token ID1716
Feature activation-4.23e+00

Pos logit contributions
 licences+1.695
 preferences+1.690
abouts+1.669
 finances+1.633
 ambitions+1.623
Neg logit contributions
 unintentional-1.651
 epidemic-1.631
amaz-1.623
lete-1.611
 offender-1.556
Token so
Token ID523
Feature activation+9.34e-01

Pos logit contributions
 preferences+3.788
 licences+3.773
abouts+3.715
 finances+3.660
 ambitions+3.637
Neg logit contributions
 unintentional-3.144
amaz-3.128
 epidemic-3.124
lete-3.094
 offender-2.984
Token impatient
Token ID33440
Feature activation+2.81e+00

Pos logit contributions
 unintentional+0.749
 epidemic+0.744
amaz+0.725
lete+0.722
 offender+0.709
Neg logit contributions
 licences-0.817
 preferences-0.808
abouts-0.807
 finances-0.785
 Preferences-0.782
Token with
Token ID351
Feature activation+1.19e+00

Pos logit contributions
 unintentional+2.127
 epidemic+2.109
amaz+2.048
lete+2.042
 offender+2.009
Neg logit contributions
 licences-2.575
abouts-2.542
 preferences-2.534
Politics-2.474
 finances-2.471
Token their
Token ID511
Feature activation-1.15e+01

Pos logit contributions
 unintentional+1.018
 epidemic+1.014
amaz+0.990
lete+0.987
 offender+0.970
Neg logit contributions
 licences-0.974
 preferences-0.966
abouts-0.958
 finances-0.937
 ambitions-0.930
Token process
Token ID1429
Feature activation-5.50e+00

Pos logit contributions
 preferences+10.127
 licences+9.684
 finances+9.660
 ambitions+9.622
 Preferences+9.412
Neg logit contributions
lete-8.410
amaz-8.310
 epidemic-8.080
 unintentional-7.966
ounding-7.681
Token that
Token ID326
Feature activation-6.32e+00

Pos logit contributions
 preferences+5.202
 licences+5.121
abouts+5.069
 finances+5.004
 Preferences+4.989
Neg logit contributions
 unintentional-3.906
 epidemic-3.866
amaz-3.768
lete-3.759
 unsuspecting-3.668
Token you
Token ID345
Feature activation+3.23e+00

Pos logit contributions
 preferences+4.392
 licences+4.284
abouts+4.217
 finances+4.206
 ambitions+4.166
Neg logit contributions
 epidemic-5.925
 unintentional-5.863
amaz-5.831
lete-5.821
 offender-5.682
Token you
Token ID345
Feature activation-8.71e-02

Pos logit contributions
 unintentional+1.010
 epidemic+1.003
amaz+0.979
lete+0.975
 offender+0.958
Neg logit contributions
 licences-1.004
 preferences-0.994
abouts-0.992
 finances-0.964
 Preferences-0.960
Token can
Token ID460
Feature activation-6.47e-01

Pos logit contributions
 licences+0.073
 preferences+0.073
abouts+0.073
 finances+0.071
 Preferences+0.070
Neg logit contributions
 unintentional-0.073
 epidemic-0.072
amaz-0.071
lete-0.070
 involuntary-0.069
Token even
Token ID772
Feature activation-5.82e+00

Pos logit contributions
 licences+0.541
 preferences+0.539
abouts+0.533
 finances+0.519
 ambitions+0.518
Neg logit contributions
 unintentional-0.543
 epidemic-0.539
amaz-0.528
lete-0.524
 involuntary-0.514
Token hear
Token ID3285
Feature activation+5.44e+00

Pos logit contributions
 preferences+4.841
 licences+4.754
abouts+4.650
 finances+4.585
 Preferences+4.578
Neg logit contributions
 unintentional-4.761
 epidemic-4.729
amaz-4.697
lete-4.624
 involuntary-4.484
Token radio
Token ID5243
Feature activation-8.28e+00

Pos logit contributions
 unintentional+4.503
 epidemic+4.411
amaz+4.289
lete+4.265
 unsuspecting+4.251
Neg logit contributions
 licences-4.584
abouts-4.475
 preferences-4.458
Politics-4.453
 finances-4.363
Token chatter
Token ID37303
Feature activation-2.45e+00

Pos logit contributions
 preferences+5.839
 licences+5.755
abouts+5.611
 ambitions+5.549
 finances+5.500
Neg logit contributions
amaz-7.528
 epidemic-7.518
 unintentional-7.489
lete-7.378
 involuntary-7.271
Token from
Token ID422
Feature activation-1.31e+00

Pos logit contributions
 preferences+2.070
 licences+2.064
abouts+2.057
 finances+1.984
 Preferences+1.980
Neg logit contributions
 unintentional-2.009
 epidemic-1.995
amaz-1.970
lete-1.954
 offender-1.913
Token your
Token ID534
Feature activation-6.83e+00

Pos logit contributions
 licences+1.014
abouts+1.007
 preferences+1.007
 finances+0.974
 ambitions+0.968
Neg logit contributions
 unintentional-1.179
 epidemic-1.173
amaz-1.155
lete-1.148
 involuntary-1.124
Token fellow
Token ID5891
Feature activation+1.09e+01

Pos logit contributions
 preferences+5.752
 licences+5.700
abouts+5.695
 finances+5.560
 Preferences+5.525
Neg logit contributions
amaz-5.479
 epidemic-5.424
 unintentional-5.401
lete-5.343
 involuntary-5.201
Token pilots
Token ID14982
Feature activation-1.89e+00

Pos logit contributions
 unintentional+8.125
 epidemic+7.996
 unsuspecting+7.936
 offender+7.744
lete+7.681
Neg logit contributions
 licences-9.226
Politics-9.222
 preferences-9.048
abouts-8.991
 Preferences-8.859
Token!
Token ID0
Feature activation+8.69e+00

Pos logit contributions
 licences+1.644
abouts+1.640
 preferences+1.637
 finances+1.582
 ambitions+1.579
Neg logit contributions
 unintentional-1.505
 epidemic-1.494
amaz-1.466
lete-1.454
 offender-1.429
Token other
Token ID584
Feature activation-6.62e+00

Pos logit contributions
 unintentional+2.906
 epidemic+2.900
lete+2.806
 offender+2.771
 involuntary+2.760
Neg logit contributions
 licences-3.326
abouts-3.308
 preferences-3.274
Politics-3.232
 Preferences-3.208
Token and
Token ID290
Feature activation-5.49e+00

Pos logit contributions
abouts+5.519
 preferences+5.471
 licences+5.442
 finances+5.274
 ambitions+5.261
Neg logit contributions
 unintentional-5.327
 epidemic-5.242
amaz-5.167
lete-5.083
 offender-5.031
Token block
Token ID2512
Feature activation-6.97e-01

Pos logit contributions
 licences+4.546
 preferences+4.535
abouts+4.473
 finances+4.392
 ambitions+4.386
Neg logit contributions
amaz-4.494
 unintentional-4.477
 epidemic-4.412
lete-4.373
 offender-4.241
Token the
Token ID262
Feature activation-5.67e+00

Pos logit contributions
 licences+0.620
 preferences+0.616
abouts+0.612
 finances+0.597
 ambitions+0.595
Neg logit contributions
 unintentional-0.546
 epidemic-0.543
amaz-0.532
lete-0.527
 offender-0.518
Token tub
Token ID12202
Feature activation+1.87e-01

Pos logit contributions
 preferences+4.773
 licences+4.737
 ambitions+4.640
abouts+4.608
 finances+4.596
Neg logit contributions
amaz-4.560
 unintentional-4.469
 epidemic-4.449
lete-4.402
 offender-4.232
Tokenules
Token ID5028
Feature activation-9.41e+00

Pos logit contributions
 unintentional+0.131
 epidemic+0.129
amaz+0.125
lete+0.124
 offender+0.123
Neg logit contributions
 licences-0.183
abouts-0.181
 preferences-0.181
 finances-0.176
 ambitions-0.176
Token.
Token ID13
Feature activation+3.87e+00

Pos logit contributions
 licences+7.531
 preferences+7.447
abouts+7.343
 ambitions+7.288
 finances+7.202
Neg logit contributions
amaz-7.461
 unintentional-7.453
 epidemic-7.285
lete-7.202
 offender-7.076
Token Credit
Token ID10504
Feature activation-1.44e+00

Pos logit contributions
 unintentional+2.908
 epidemic+2.892
amaz+2.812
lete+2.803
 offender+2.743
Neg logit contributions
 licences-3.545
abouts-3.497
 preferences-3.494
 finances-3.422
 ambitions-3.395
Token:
Token ID25
Feature activation-3.66e+00

Pos logit contributions
 licences+1.210
 preferences+1.204
abouts+1.194
 Preferences+1.167
 finances+1.166
Neg logit contributions
 unintentional-1.207
 epidemic-1.196
lete-1.166
amaz-1.166
 offender-1.143
Token MP
Token ID4904
Feature activation-2.69e+00

Pos logit contributions
 licences+3.066
 preferences+3.041
abouts+3.017
 Preferences+2.989
 finances+2.945
Neg logit contributions
 unintentional-2.992
 epidemic-2.952
amaz-2.916
lete-2.894
 offender-2.843
TokenI
Token ID40
Feature activation+6.08e-02

Pos logit contributions
 licences+1.930
 preferences+1.897
abouts+1.881
Politics+1.838
 finances+1.834
Neg logit contributions
 unintentional-2.574
 epidemic-2.551
amaz-2.507
lete-2.482
 offender-2.464
Token seats
Token ID8632
Feature activation+4.76e-01

Pos logit contributions
 licences+0.480
 preferences+0.473
abouts+0.462
 finances+0.445
 ambitions+0.444
Neg logit contributions
 unintentional-1.028
 epidemic-1.022
amaz-1.011
lete-1.004
 involuntary-0.985
Token which
Token ID543
Feature activation-5.00e+00

Pos logit contributions
 unintentional+0.372
 epidemic+0.369
amaz+0.360
lete+0.358
 offender+0.350
Neg logit contributions
 licences-0.428
 preferences-0.424
abouts-0.423
 finances-0.412
 Preferences-0.410
Token represent
Token ID2380
Feature activation-3.57e+00

Pos logit contributions
 licences+4.130
 preferences+4.041
abouts+3.992
 finances+3.926
 ambitions+3.871
Neg logit contributions
 unintentional-4.265
amaz-4.217
 epidemic-4.208
lete-4.165
 unsuspecting-4.008
Token different
Token ID1180
Feature activation-9.95e+00

Pos logit contributions
 licences+3.245
 preferences+3.222
abouts+3.168
 finances+3.115
 ambitions+3.106
Neg logit contributions
 unintentional-2.734
 epidemic-2.719
amaz-2.707
lete-2.681
 offender-2.585
Token segments
Token ID17894
Feature activation-1.06e+01

Pos logit contributions
 preferences+8.760
 licences+8.573
 ambitions+8.365
 finances+8.265
abouts+8.055
Neg logit contributions
amaz-7.827
 unintentional-7.483
lete-7.394
 epidemic-7.378
 unsuspecting-7.102
Token of
Token ID286
Feature activation+5.26e-01

Pos logit contributions
 preferences+8.309
 licences+8.228
abouts+8.149
 finances+7.854
 ambitions+7.805
Neg logit contributions
amaz-8.930
 unintentional-8.817
 epidemic-8.771
lete-8.624
 involuntary-8.444
Token the
Token ID262
Feature activation-2.29e+00

Pos logit contributions
 unintentional+0.437
 epidemic+0.433
amaz+0.423
lete+0.422
 offender+0.414
Neg logit contributions
 licences-0.446
 preferences-0.442
abouts-0.440
 finances-0.429
 Preferences-0.427
Token Mag
Token ID2944
Feature activation-4.76e-01

Pos logit contributions
 preferences+1.784
 licences+1.783
abouts+1.760
 finances+1.727
 Preferences+1.710
Neg logit contributions
 unintentional-2.045
 epidemic-2.023
amaz-2.014
lete-1.988
 involuntary-1.940
Tokenp
Token ID79
Feature activation+9.05e-01

Pos logit contributions
 licences+0.382
abouts+0.382
 preferences+0.380
Politics+0.368
 finances+0.368
Neg logit contributions
 unintentional-0.415
 epidemic-0.410
amaz-0.402
lete-0.395
 involuntary-0.395
Tokenies
Token ID444
Feature activation-6.26e+00

Pos logit contributions
 unintentional+0.650
 epidemic+0.639
amaz+0.625
lete+0.617
 offender+0.607
Neg logit contributions
 licences-0.872
 preferences-0.867
abouts-0.867
 finances-0.843
 ambitions-0.842
Token'
Token ID6
Feature activation-1.10e+01

Pos logit contributions
 preferences+5.060
 licences+4.993
abouts+4.955
 finances+4.951
 ambitions+4.883
Neg logit contributions
 unintentional-5.280
amaz-5.218
 epidemic-5.205
lete-5.143
 involuntary-5.030
Token because
Token ID780
Feature activation-8.05e+00

Pos logit contributions
 unintentional+2.541
 epidemic+2.526
lete+2.432
amaz+2.431
 offender+2.401
Neg logit contributions
 licences-3.212
abouts-3.179
 preferences-3.171
 Preferences-3.096
Politics-3.091
Token beside
Token ID13970
Feature activation-8.83e-01

Pos logit contributions
 preferences+6.722
 licences+6.698
 finances+6.554
abouts+6.534
 ambitions+6.478
Neg logit contributions
amaz-6.344
lete-6.304
 unintentional-6.285
 epidemic-6.181
 offender-5.971
Token marking
Token ID18730
Feature activation+1.80e+00

Pos logit contributions
 licences+0.774
 preferences+0.768
abouts+0.759
 finances+0.746
 ambitions+0.742
Neg logit contributions
 unintentional-0.706
 epidemic-0.700
amaz-0.686
lete-0.685
 offender-0.670
Token the
Token ID262
Feature activation+2.43e+00

Pos logit contributions
 unintentional+1.411
 epidemic+1.400
amaz+1.359
lete+1.356
 offender+1.332
Neg logit contributions
 licences-1.602
abouts-1.584
 preferences-1.584
 finances-1.540
Politics-1.538
Token end
Token ID886
Feature activation-6.96e-01

Pos logit contributions
 unintentional+1.729
 epidemic+1.717
lete+1.649
amaz+1.649
 offender+1.625
Neg logit contributions
 licences-2.334
abouts-2.318
 preferences-2.309
Politics-2.255
 Preferences-2.252
Token of
Token ID286
Feature activation+3.85e-01

Pos logit contributions
 licences+0.631
 preferences+0.624
abouts+0.623
 finances+0.607
Politics+0.605
Neg logit contributions
 unintentional-0.536
 epidemic-0.532
amaz-0.518
lete-0.516
 offender-0.506
Token 2009
Token ID3717
Feature activation-6.15e+00

Pos logit contributions
 unintentional+0.286
 epidemic+0.283
amaz+0.277
lete+0.275
 offender+0.269
Neg logit contributions
 licences-0.361
 preferences-0.357
abouts-0.356
 finances-0.348
 Preferences-0.346
Token,
Token ID11
Feature activation-6.21e+00

Pos logit contributions
 preferences+5.071
 licences+5.023
abouts+4.944
 finances+4.917
 ambitions+4.835
Neg logit contributions
 unintentional-5.033
amaz-5.025
 epidemic-4.926
lete-4.913
 offender-4.799
Token in
Token ID287
Feature activation+2.62e-01

Pos logit contributions
 licences+4.938
 preferences+4.935
abouts+4.821
 finances+4.761
 ambitions+4.740
Neg logit contributions
 unintentional-5.205
amaz-5.190
 epidemic-5.149
lete-5.098
 offender-4.990
Token just
Token ID655
Feature activation-4.64e+00

Pos logit contributions
 unintentional+0.206
 epidemic+0.204
amaz+0.199
lete+0.198
 offender+0.195
Neg logit contributions
 licences-0.233
 preferences-0.231
abouts-0.230
 finances-0.224
 Preferences-0.223
Token couple
Token ID3155
Feature activation-5.12e+00

Pos logit contributions
 licences+3.956
 preferences+3.955
abouts+3.882
 finances+3.816
 ambitions+3.804
Neg logit contributions
 unintentional-3.689
 epidemic-3.654
lete-3.638
amaz-3.613
 offender-3.488
Tokens
Token ID82
Feature activation-4.03e+00

Pos logit contributions
 licences+0.701
 preferences+0.693
abouts+0.687
 finances+0.664
 Preferences+0.662
Neg logit contributions
 unintentional-0.944
 epidemic-0.938
amaz-0.918
lete-0.916
 offender-0.901
Token)}
Token ID38165
Feature activation-9.24e+00

Pos logit contributions
abouts+1.699
 licences+1.696
 preferences+1.673
Politics+1.617
 finances+1.570
Neg logit contributions
 unintentional-5.001
 epidemic-4.980
amaz-4.899
lete-4.843
 unsuspecting-4.833
Token d
Token ID288
Feature activation+1.64e+00

Pos logit contributions
 licences+6.562
 preferences+6.428
 ambitions+6.257
abouts+6.206
 Preferences+6.134
Neg logit contributions
amaz-8.275
 unintentional-8.194
lete-8.117
 epidemic-8.067
 offender-7.849
Token5
Token ID20
Feature activation+6.12e+00

Pos logit contributions
 unintentional+2.018
 epidemic+2.007
amaz+1.976
lete+1.975
 offender+1.947
Neg logit contributions
 licences-0.735
 preferences-0.721
abouts-0.717
 Preferences-0.682
 finances-0.680
Token {
Token ID1391
Feature activation-6.09e+00

Pos logit contributions
 unintentional+4.740
 epidemic+4.730
amaz+4.618
lete+4.554
 offender+4.520
Neg logit contributions
 licences-5.335
abouts-5.190
 preferences-5.177
 finances-5.143
Politics-5.059
Token(
Token ID7
Feature activation-2.75e+01

Pos logit contributions
abouts+3.876
Politics+3.821
 preferences+3.815
 licences+3.808
616+3.801
Neg logit contributions
 unintentional-6.239
 epidemic-6.158
 involuntary-5.980
lete-5.976
amaz-5.974
Token573
Token ID48638
Feature activation-7.61e-02

Pos logit contributions
616+18.586
abouts+16.836
Politics+16.420
 preferences+16.171
 licences+16.121
Neg logit contributions
 unintentional-17.056
 epidemic-16.829
 involuntary-16.733
 unsuspecting-16.695
 offender-16.277
Tokens
Token ID82
Feature activation-1.86e+00

Pos logit contributions
 licences+0.065
 preferences+0.064
abouts+0.064
 finances+0.062
 Preferences+0.062
Neg logit contributions
 unintentional-0.063
 epidemic-0.062
amaz-0.061
lete-0.060
 offender-0.059
Token)}
Token ID38165
Feature activation+5.07e-01

Pos logit contributions
 licences+0.858
abouts+0.844
 preferences+0.843
Politics+0.799
 finances+0.797
Neg logit contributions
 unintentional-2.249
 epidemic-2.238
amaz-2.200
lete-2.188
 offender-2.169
Token 17
Token ID1596
Feature activation+6.49e+00

Pos logit contributions
 unintentional+0.459
 epidemic+0.457
amaz+0.445
lete+0.444
 offender+0.437
Neg logit contributions
 licences-0.391
 preferences-0.386
abouts-0.386
 finances-0.374
Politics-0.373
Token.
Token ID13
Feature activation-6.89e+00

Pos logit contributions
 unintentional+3.594
 epidemic+3.580
amaz+3.494
lete+3.489
 offender+3.353
Neg logit contributions
 licences-7.017
 preferences-6.836
abouts-6.812
 finances-6.764
 ambitions-6.720
Token staff
Token ID3085
Feature activation+9.92e+00

Pos logit contributions
 preferences+3.740
 licences+3.715
 finances+3.680
abouts+3.641
 ambitions+3.596
Neg logit contributions
 unintentional-3.695
 epidemic-3.632
amaz-3.621
lete-3.548
 involuntary-3.468
Token to
Token ID284
Feature activation+6.31e+00

Pos logit contributions
 unintentional+7.031
 epidemic+7.005
lete+6.949
amaz+6.774
 offender+6.674
Neg logit contributions
 licences-8.905
abouts-8.850
 preferences-8.622
Politics-8.584
 Preferences-8.558
Token ear
Token ID1027
Feature activation+1.70e+00

Pos logit contributions
 epidemic+5.054
 unintentional+5.049
amaz+4.862
lete+4.844
 offender+4.839
Neg logit contributions
 licences-5.315
abouts-5.300
 preferences-5.218
Politics-5.207
 Preferences-5.117
Tokenmark
Token ID4102
Feature activation-1.18e+00

Pos logit contributions
 unintentional+1.351
 epidemic+1.346
lete+1.319
amaz+1.309
 offender+1.288
Neg logit contributions
 licences-1.499
abouts-1.475
 preferences-1.468
Politics-1.433
 Preferences-1.432
Token roughly
Token ID7323
Feature activation-1.54e+00

Pos logit contributions
 licences+0.986
 preferences+0.977
abouts+0.971
 finances+0.951
 ambitions+0.941
Neg logit contributions
 unintentional-0.986
 epidemic-0.977
lete-0.957
amaz-0.957
 offender-0.934
Token $
Token ID720
Feature activation-2.07e+01

Pos logit contributions
 licences+1.278
 preferences+1.266
abouts+1.262
 finances+1.234
 ambitions+1.220
Neg logit contributions
 unintentional-1.300
 epidemic-1.286
amaz-1.265
lete-1.263
 offender-1.238
Token2
Token ID17
Feature activation+7.75e+00

Pos logit contributions
616+10.645
Politics+9.143
abouts+9.088
70710+8.657
 licences+7.925
Neg logit contributions
 unintentional-20.411
 epidemic-19.915
 involuntary-19.778
 offender-19.739
 unsuspecting-19.613
Token.
Token ID13
Feature activation-1.40e+01

Pos logit contributions
 unintentional+4.761
 epidemic+4.668
amaz+4.431
lete+4.419
 offender+4.417
Neg logit contributions
 licences-7.855
abouts-7.830
 preferences-7.763
Politics-7.702
 Preferences-7.689
Token12
Token ID1065
Feature activation+6.35e+00

Pos logit contributions
616+4.446
Politics+3.635
abouts+3.510
 licences+3.440
70710+3.359
Neg logit contributions
 unintentional-17.905
 epidemic-17.723
 involuntary-17.526
 unsuspecting-17.451
amaz-17.448
Token million
Token ID1510
Feature activation+1.38e-01

Pos logit contributions
 unintentional+4.045
 epidemic+3.980
amaz+3.812
lete+3.800
 offender+3.759
Neg logit contributions
 licences-6.423
abouts-6.399
 preferences-6.356
Politics-6.298
 Preferences-6.268
Token for
Token ID329
Feature activation+2.37e+00

Pos logit contributions
 unintentional+0.104
 epidemic+0.103
lete+0.101
amaz+0.101
 offender+0.098
Neg logit contributions
 licences-0.128
 preferences-0.127
abouts-0.126
 finances-0.123
 ambitions-0.122
Token others
Token ID1854
Feature activation+3.50e+00

Pos logit contributions
 unintentional+4.866
 epidemic+4.821
lete+4.647
amaz+4.641
 offender+4.639
Neg logit contributions
 licences-3.631
abouts-3.588
 preferences-3.549
Politics-3.476
 Preferences-3.456
Token,
Token ID11
Feature activation+7.87e+00

Pos logit contributions
 unintentional+2.903
 epidemic+2.885
lete+2.789
amaz+2.776
 offender+2.754
Neg logit contributions
 licences-2.946
abouts-2.903
 preferences-2.901
Politics-2.837
 Preferences-2.832
Token they
Token ID484
Feature activation+1.21e+01

Pos logit contributions
 unintentional+6.275
 epidemic+6.195
lete+5.937
amaz+5.908
 unsuspecting+5.902
Neg logit contributions
 licences-6.655
abouts-6.549
 preferences-6.481
Politics-6.405
 Preferences-6.398
Token said
Token ID531
Feature activation-4.59e+00

Pos logit contributions
 epidemic+9.140
 unintentional+9.040
 aggravated+8.961
 offender+8.729
lete+8.673
Neg logit contributions
 Preferences-9.738
 licences-9.708
 preferences-9.626
Politics-9.592
abouts-9.517
Token that
Token ID326
Feature activation-9.96e+00

Pos logit contributions
 preferences+3.705
 licences+3.694
abouts+3.629
 finances+3.602
 ambitions+3.530
Neg logit contributions
amaz-3.836
 epidemic-3.835
 unintentional-3.835
lete-3.778
 involuntary-3.661
Token wasn
Token ID2492
Feature activation-1.83e+01

Pos logit contributions
 preferences+7.931
 licences+7.925
 finances+7.802
abouts+7.694
 ambitions+7.589
Neg logit contributions
amaz-7.879
lete-7.793
 epidemic-7.720
 unintentional-7.670
ounding-7.364
Token't
Token ID470
Feature activation-1.01e+01

Pos logit contributions
616+6.599
abouts+6.323
Politics+6.248
 preferences+5.707
 licences+5.604
Neg logit contributions
ortunately-21.999
 eleph-21.554
 unintentional-20.680
 earthqu-20.553
 unsuspecting-20.238
Token true
Token ID2081
Feature activation-5.66e+00

Pos logit contributions
 licences+8.121
 preferences+8.055
 finances+7.903
 ambitions+7.730
abouts+7.670
Neg logit contributions
lete-7.649
 epidemic-7.536
amaz-7.517
 unintentional-7.378
 unsuspecting-7.319
Token.
Token ID13
Feature activation+3.48e+00

Pos logit contributions
 preferences+4.888
abouts+4.879
 licences+4.876
 finances+4.751
Politics+4.699
Neg logit contributions
 unintentional-4.386
 epidemic-4.357
amaz-4.320
lete-4.259
 unsuspecting-4.190
Token They
Token ID1119
Feature activation+1.48e+01

Pos logit contributions
 unintentional+2.805
 epidemic+2.787
amaz+2.718
lete+2.708
 offender+2.658
Neg logit contributions
 licences-3.007
 preferences-2.965
abouts-2.955
 finances-2.881
 ambitions-2.875
Token gave
Token ID2921
Feature activation+4.71e+00

Pos logit contributions
 epidemic+10.965
 aggravated+10.835
 unintentional+10.795
 offender+10.532
 addict+10.403
Neg logit contributions
 Preferences-11.552
 preferences-11.363
 licences-11.334
Politics-11.319
abouts-11.152
Token2
Token ID17
Feature activation+1.77e+00

Pos logit contributions
abouts+8.761
 preferences+8.497
 licences+8.491
616+8.480
Politics+8.367
Neg logit contributions
 unintentional-7.297
 epidemic-7.132
lete-6.891
 unsuspecting-6.861
 involuntary-6.827
Token.
Token ID13
Feature activation-2.05e+01

Pos logit contributions
 unintentional+1.291
 epidemic+1.278
amaz+1.243
lete+1.239
 offender+1.213
Neg logit contributions
 licences-1.680
 preferences-1.663
abouts-1.659
 finances-1.617
 Preferences-1.612
Token6
Token ID21
Feature activation+4.41e+00

Pos logit contributions
616+8.620
Politics+7.122
70710+7.105
 licences+6.737
abouts+6.692
Neg logit contributions
 unintentional-21.862
 involuntary-21.451
 epidemic-21.390
 unsuspecting-21.369
 offender-20.874
Token
Token ID784
Feature activation-8.52e+00

Pos logit contributions
 epidemic+3.717
 unintentional+3.713
lete+3.612
amaz+3.607
 offender+3.545
Neg logit contributions
 licences-3.599
abouts-3.579
 preferences-3.563
Politics-3.488
 finances-3.477
Token 3
Token ID513
Feature activation+6.57e+00

Pos logit contributions
 preferences+5.295
 licences+5.278
abouts+5.162
 ambitions+5.008
 finances+4.910
Neg logit contributions
 unintentional-8.469
lete-8.330
 epidemic-8.322
amaz-8.261
 offender-8.159
Token.
Token ID13
Feature activation-2.93e+01

Pos logit contributions
 epidemic+4.744
 unintentional+4.710
lete+4.575
amaz+4.487
 offender+4.479
Neg logit contributions
abouts-6.017
 licences-6.000
 preferences-5.956
Politics-5.923
 Preferences-5.861
Token5
Token ID20
Feature activation+3.40e-01

Pos logit contributions
616+16.164
 licences+15.126
7+14.650
 preferences+14.549
5+14.471
Neg logit contributions
 unintentional-19.716
 epidemic-19.560
amaz-19.482
 unsuspecting-19.149
 involuntary-18.961
Token kg
Token ID14211
Feature activation-5.65e+00

Pos logit contributions
 unintentional+0.288
 epidemic+0.285
amaz+0.278
lete+0.277
 involuntary+0.273
Neg logit contributions
 licences-0.283
 preferences-0.280
abouts-0.278
 finances-0.271
 ambitions-0.270
Token),
Token ID828
Feature activation-2.59e+00

Pos logit contributions
 licences+3.814
 preferences+3.749
abouts+3.744
 ambitions+3.574
 finances+3.537
Neg logit contributions
 unintentional-5.584
 epidemic-5.452
lete-5.384
amaz-5.377
 involuntary-5.332
Token but
Token ID475
Feature activation+7.40e+00

Pos logit contributions
 licences+1.958
 preferences+1.953
abouts+1.910
 finances+1.873
 ambitions+1.856
Neg logit contributions
 unintentional-2.359
amaz-2.343
 epidemic-2.337
lete-2.322
 offender-2.254
Token differ
Token ID13238
Feature activation-6.63e+00

Pos logit contributions
 unintentional+6.223
 epidemic+6.222
 involuntary+5.952
lete+5.946
 offender+5.923
Neg logit contributions
 licences-5.850
abouts-5.818
Politics-5.772
 preferences-5.747
 Preferences-5.723
Tokenarer
Token ID11258
Feature activation-6.77e+00

Pos logit contributions
lete+6.259
amaz+5.467
ager+4.887
 offender+4.792
 epidemic+4.515
Neg logit contributions
 licences-25.974
 Preferences-24.736
 preferences-24.726
 finances-24.705
Politics-24.634
Token setting
Token ID4634
Feature activation-7.61e+00

Pos logit contributions
 preferences+4.092
 licences+3.929
 ambitions+3.833
abouts+3.827
 finances+3.809
Neg logit contributions
amaz-7.043
 unintentional-6.949
 epidemic-6.902
lete-6.818
 involuntary-6.678
Token box
Token ID3091
Feature activation-6.72e+00

Pos logit contributions
 preferences+4.606
 licences+4.533
 finances+4.387
abouts+4.384
 ambitions+4.289
Neg logit contributions
 unintentional-7.825
amaz-7.764
 epidemic-7.658
lete-7.605
 unsuspecting-7.501
Token underneath
Token ID14638
Feature activation-9.52e+00

Pos logit contributions
abouts+3.495
 preferences+3.408
 licences+3.365
 finances+3.304
 Preferences+3.181
Neg logit contributions
amaz-7.458
 epidemic-7.393
 unintentional-7.389
lete-7.329
 offender-7.197
Token roughly
Token ID7323
Feature activation-3.99e+00

Pos logit contributions
abouts+6.894
 preferences+6.461
 licences+6.236
 finances+6.188
 Preferences+6.127
Neg logit contributions
 unintentional-8.760
amaz-8.629
lete-8.566
 epidemic-8.554
 offender-8.500
Token (
Token ID357
Feature activation-2.31e+01

Pos logit contributions
abouts+3.108
 preferences+3.064
 licences+3.050
 finances+2.953
 Preferences+2.937
Neg logit contributions
 unintentional-3.536
 epidemic-3.473
amaz-3.451
lete-3.417
 unsuspecting-3.351
Token6
Token ID21
Feature activation-1.10e+01

Pos logit contributions
616+14.266
abouts+13.400
Politics+13.002
 licences+12.662
70710+12.338
Neg logit contributions
 epidemic-18.028
 unintentional-17.732
 involuntary-17.426
 addict-16.965
 unsuspecting-16.964
Token)
Token ID8
Feature activation-1.36e+01

Pos logit contributions
 preferences+5.803
 licences+5.764
abouts+5.702
Politics+5.502
 Preferences+5.425
Neg logit contributions
 unintentional-11.502
 epidemic-11.439
lete-11.290
amaz-11.250
 involuntary-11.155
Token C
Token ID327
Feature activation-5.84e+00

Pos logit contributions
 Preferences+9.970
abouts+9.783
 preferences+9.723
 licences+9.463
 finances+9.364
Neg logit contributions
amaz-10.536
 unintentional-10.461
 epidemic-10.432
lete-10.374
 offender-10.075
Token
Token ID1906
Feature activation-1.02e+01

Pos logit contributions
 preferences+4.915
abouts+4.827
 licences+4.803
 finances+4.725
 ambitions+4.723
Neg logit contributions
 unintentional-4.577
 epidemic-4.558
amaz-4.469
lete-4.436
 offender-4.412
TokenS
Token ID50
Feature activation-7.83e+00

Pos logit contributions
abouts+6.978
 preferences+6.922
 licences+6.877
Politics+6.845
 Preferences+6.738
Neg logit contributions
 unintentional-8.986
 epidemic-8.932
lete-8.798
amaz-8.715
 involuntary-8.649