TOP ACTIVATIONS
MAX = 17.708

- party package format would compare . I could not find
, but in practice cannot compete with the speed of the
he could help pick up votes with some in the less
The EPA s civil - rights office has long
to promote the accuracy , fairness , and privacy of consumer
c v 2 . COL OR _ B GR 2 G
, O . J . stands as the cab ler
G AU Opt ronics AH VA ( IPS - type )
Wiki Leaks pr ides itself as a website devoted to
Pages looks pretty good at first glance . But once you
graffiti there the freedom to live in this country and honour
counterpart , but we can live without that . Perhaps a
Amar illo hops , it smells powerfully of passion fruit ,
Root FS , it will look for the init binary in
of a food hall that ranks among what you find in
that protect the privacy and civil rights of all Virginia residents
. For the last 15 days , boats have
said , and giving the civil - rights office more
the EPA s civil - rights office will boost
complaint with the Colorado Civil Rights Division , which spent a

MIN ACTIVATIONS
MIN = -18.022

Donna brought in her four cub s . Shortly , the
D awa along with her cub s is one of the
both of their seasons on the line . Then it went
Trump knew Stone talked with WikiLeaks about DNC emails Trump urges
Faith 's two male cub s - 8 / 10
of their own . His overhaul saw the dismissal of veterans
? and how does it look from the
guns linger Philip Rivers on the other side Sunday , Ramsey
True . Obama on a chain , C iti
s challenges are on their door steps ref
or so . How would you view the Seahawks
🙂 On what made him so successful ,
s challenges are on their door steps ref uge
1987 album Blessing was on constant rotation and received its fair
- named , who are on Facebook , did not respond
this cryptic message on her Instagram following the incident
Maria Santana will also be on the scene later Monday .
News plans to air On the Record With Van S ust
black bear and her two cub s about 15 yards in
the Packers . How do you view the Cowboys

INTERVAL 8.776 - 17.708
CONTAINS 0.742%

believe the citizens of F ayette ville would want males with
mark Time ( ns ) CPU ( ns ) Mem c
the following description of their competing technology : Distribut ing applications
desert . Obama , by contrast , could only wish for
arrested within the parameters of law ." Sources in the Delhi

INTERVAL -0.157 - 8.776
CONTAINS 52.316%

tea at night . Paul Sol man : Dec
now on display at the National Museum of the United States
and we 're an overall community now ," Cue ll ar
. Though they are incapable of real ising it ; this
. Todd C . Gilbert ( R - S hen ando

INTERVAL -9.089 - -0.157
CONTAINS 46.190%

Jersey , said on Thursday that the Food and Drug Administration
not the Syrian government , that used Sar in nerve gas
he was a big part of their hockey team . It
companies that operate Medicare Advantage plans , those privately run alternatives
, Make 6 figures in 6 months .

INTERVAL -18.022 - -9.089
CONTAINS 0.752%

O & A - Op Sick - Ant s Dad A
complete mammoth arrived in Britain on Monday to go on public
back there and be on the other side of the rink
torn down during the on - site filming of the 1979
tto & George - 8 Track Porn Pt 4 - Ott
Token-
Token ID12
Feature activation+6.18e+00

Pos logit contributions
rity+5.944
pun+5.458
 upscale+5.092
endment+5.011
geist+4.962
Neg logit contributions
ilers-9.040
erous-8.703
itored-8.641
 Consent-8.024
ilitating-7.992
Tokenparty
Token ID10608
Feature activation+3.17e+00

Pos logit contributions
rity+2.544
pun+2.201
endment+1.872
geist+1.810
 upscale+1.778
Neg logit contributions
ilers-7.256
erous-7.177
itored-7.063
 Consent-6.737
 spur-6.678
Token package
Token ID5301
Feature activation+2.62e+00

Pos logit contributions
rity+2.430
pun+2.276
 upscale+2.094
endment+2.076
geist+2.074
Neg logit contributions
ilers-2.624
erous-2.594
itored-2.465
 Consent-2.295
 spur-2.281
Token format
Token ID5794
Feature activation+9.68e-01

Pos logit contributions
rity+2.059
pun+1.922
endment+1.767
 upscale+1.760
geist+1.759
Neg logit contributions
ilers-2.134
erous-2.124
itored-1.995
 Consent-1.883
 spur-1.871
Token would
Token ID561
Feature activation+7.55e+00

Pos logit contributions
rity+0.758
pun+0.704
endment+0.650
geist+0.643
 upscale+0.641
Neg logit contributions
ilers-0.797
erous-0.794
itored-0.743
 spur-0.702
 Consent-0.701
Token compare
Token ID8996
Feature activation+1.77e+01

Pos logit contributions
rity+5.183
pun+4.862
 upscale+4.735
ior+4.494
geist+4.477
Neg logit contributions
erous-6.392
ilers-6.347
itored-6.182
 Consent-5.779
zee-5.665
Token.
Token ID13
Feature activation+2.49e+00

Pos logit contributions
rity+9.904
pun+9.819
 upscale+9.109
ior+9.076
geist+8.958
Neg logit contributions
ilers-14.296
erous-14.151
itored-13.377
 Consent-13.268
zee-12.749
Token I
Token ID314
Feature activation-1.88e+00

Pos logit contributions
rity+2.083
pun+1.958
endment+1.807
geist+1.800
 upscale+1.793
Neg logit contributions
ilers-1.906
erous-1.896
itored-1.766
 spur-1.657
 Consent-1.641
Token could
Token ID714
Feature activation+4.88e+00

Pos logit contributions
ilers+1.447
erous+1.437
itored+1.342
 spur+1.270
 Consent+1.259
Neg logit contributions
rity-1.567
pun-1.460
endment-1.370
geist-1.355
 upscale-1.333
Token not
Token ID407
Feature activation+5.38e+00

Pos logit contributions
rity+3.519
pun+3.287
 upscale+3.042
ior+2.957
endment+2.923
Neg logit contributions
erous-4.233
ilers-4.201
itored-3.993
 Consent-3.750
ilitating-3.695
Token find
Token ID1064
Feature activation+2.74e+00

Pos logit contributions
rity+3.940
pun+3.703
 upscale+3.509
ior+3.356
geist+3.295
Neg logit contributions
erous-4.505
ilers-4.497
itored-4.281
 Consent-3.989
zee-3.961
Token,
Token ID11
Feature activation+1.33e+00

Pos logit contributions
rity+1.232
pun+1.150
endment+1.063
geist+1.054
 upscale+1.054
Neg logit contributions
ilers-1.210
erous-1.205
itored-1.126
 Consent-1.061
 spur-1.059
Token but
Token ID475
Feature activation+6.40e+00

Pos logit contributions
rity+1.131
pun+1.059
endment+0.983
geist+0.975
 upscale+0.974
Neg logit contributions
ilers-1.000
erous-0.996
itored-0.927
 Consent-0.869
 spur-0.869
Token in
Token ID287
Feature activation+4.51e+00

Pos logit contributions
rity+4.721
pun+4.485
 upscale+4.273
geist+4.122
ior+4.107
Neg logit contributions
ilers-5.308
erous-5.240
itored-4.995
 Consent-4.645
zee-4.547
Token practice
Token ID3357
Feature activation+7.06e+00

Pos logit contributions
rity+3.503
pun+3.262
 upscale+3.059
endment+2.989
geist+2.981
Neg logit contributions
ilers-3.677
erous-3.626
itored-3.439
 Consent-3.209
zee-3.128
Token cannot
Token ID2314
Feature activation+5.51e+00

Pos logit contributions
rity+5.219
pun+4.902
 upscale+4.660
ior+4.520
geist+4.503
Neg logit contributions
ilers-5.833
erous-5.736
itored-5.503
 Consent-5.118
ilitating-4.997
Token compete
Token ID9320
Feature activation+1.70e+01

Pos logit contributions
rity+4.041
pun+3.825
 upscale+3.717
ior+3.556
geist+3.461
Neg logit contributions
erous-4.585
ilers-4.544
itored-4.335
 Consent-4.029
zee-3.964
Token with
Token ID351
Feature activation+1.03e+01

Pos logit contributions
pun+9.559
rity+9.492
 upscale+8.983
ior+8.795
against+8.542
Neg logit contributions
ilers-14.160
erous-13.602
itored-13.414
 Consent-13.023
zee-12.360
Token the
Token ID262
Feature activation+8.26e+00

Pos logit contributions
rity+7.295
pun+6.922
 upscale+6.868
ior+6.544
geist+6.322
Neg logit contributions
ilers-8.252
erous-8.050
itored-7.848
 Consent-7.256
zee-7.236
Token speed
Token ID2866
Feature activation+2.42e+00

Pos logit contributions
rity+6.174
pun+5.929
 upscale+5.772
ior+5.527
geist+5.457
Neg logit contributions
ilers-6.494
erous-6.441
itored-6.181
 Consent-5.659
ilitating-5.625
Token of
Token ID286
Feature activation+9.22e+00

Pos logit contributions
rity+1.982
pun+1.862
 upscale+1.716
endment+1.711
geist+1.708
Neg logit contributions
ilers-1.880
erous-1.875
itored-1.759
 Consent-1.652
 spur-1.639
Token the
Token ID262
Feature activation+8.31e+00

Pos logit contributions
rity+6.789
pun+6.419
 upscale+6.291
ior+6.013
geist+5.934
Neg logit contributions
ilers-7.262
erous-7.171
itored-6.936
 Consent-6.404
zee-6.339
Token he
Token ID339
Feature activation+8.94e-01

Pos logit contributions
rity+3.281
pun+3.081
 upscale+2.864
endment+2.848
geist+2.819
Neg logit contributions
ilers-2.935
erous-2.898
itored-2.704
 Consent-2.518
 spur-2.492
Token could
Token ID714
Feature activation+4.16e+00

Pos logit contributions
rity+0.736
pun+0.687
endment+0.636
 upscale+0.630
geist+0.630
Neg logit contributions
ilers-0.700
erous-0.697
itored-0.651
 Consent-0.612
 spur-0.610
Token help
Token ID1037
Feature activation+1.13e-01

Pos logit contributions
rity+3.398
pun+3.211
 upscale+3.023
ior+2.947
geist+2.919
Neg logit contributions
erous-3.173
ilers-3.166
itored-2.987
 Consent-2.788
zee-2.702
Token pick
Token ID2298
Feature activation+2.63e+00

Pos logit contributions
rity+0.085
pun+0.078
endment+0.072
geist+0.071
 upscale+0.071
Neg logit contributions
ilers-0.096
erous-0.096
itored-0.090
 spur-0.086
 Consent-0.085
Token up
Token ID510
Feature activation+6.24e+00

Pos logit contributions
rity+2.095
pun+1.953
endment+1.802
 upscale+1.793
geist+1.778
Neg logit contributions
ilers-2.134
erous-2.112
itored-1.981
 Consent-1.865
 spur-1.858
Token votes
Token ID5690
Feature activation+1.65e+01

Pos logit contributions
rity+4.714
pun+4.416
 upscale+4.158
endment+4.019
ior+3.962
Neg logit contributions
ilers-5.169
erous-5.062
itored-4.806
zee-4.466
 Consent-4.455
Token with
Token ID351
Feature activation+6.34e+00

Pos logit contributions
pun+10.000
rity+9.965
 upscale+9.184
ior+9.151
against+8.869
Neg logit contributions
ilers-13.745
erous-13.240
itored-12.463
 Consent-11.791
zee-11.772
Token some
Token ID617
Feature activation+5.99e+00

Pos logit contributions
rity+4.956
pun+4.622
 upscale+4.360
endment+4.231
ior+4.194
Neg logit contributions
ilers-5.107
erous-4.919
itored-4.732
 Consent-4.417
zee-4.352
Token in
Token ID287
Feature activation+3.39e+00

Pos logit contributions
rity+4.526
pun+4.195
 upscale+3.961
endment+3.837
geist+3.778
Neg logit contributions
ilers-5.048
erous-4.921
itored-4.687
 Consent-4.400
zee-4.340
Token the
Token ID262
Feature activation+3.47e+00

Pos logit contributions
rity+2.908
pun+2.710
 upscale+2.525
endment+2.520
ior+2.477
Neg logit contributions
ilers-2.521
erous-2.486
itored-2.319
 Consent-2.203
 spur-2.165
Token less
Token ID1342
Feature activation+2.13e+00

Pos logit contributions
rity+2.541
pun+2.345
endment+2.148
 upscale+2.146
geist+2.119
Neg logit contributions
ilers-3.025
erous-3.011
itored-2.833
 Consent-2.679
 spur-2.672
TokenThe
Token ID464
Feature activation+2.62e+00

Pos logit contributions
rity+0.558
pun+0.516
endment+0.474
 upscale+0.469
geist+0.469
Neg logit contributions
ilers-0.649
erous-0.647
itored-0.608
 Consent-0.576
 spur-0.575
Token EPA
Token ID10193
Feature activation+3.62e+00

Pos logit contributions
rity+2.133
pun+1.984
endment+1.848
 upscale+1.827
geist+1.815
Neg logit contributions
ilers-2.078
erous-2.056
itored-1.933
 Consent-1.797
 spur-1.795
Token
Token ID447
Feature activation-3.94e-01

Pos logit contributions
rity+2.962
pun+2.779
endment+2.580
geist+2.546
ior+2.542
Neg logit contributions
ilers-2.826
erous-2.796
itored-2.621
 Consent-2.432
 spur-2.425
Token
Token ID247
Feature activation+2.06e+00

Pos logit contributions
ilers+0.356
erous+0.354
itored+0.336
 Consent+0.320
 spur+0.318
Neg logit contributions
rity-0.272
pun-0.253
geist-0.229
 upscale-0.229
endment-0.229
Tokens
Token ID82
Feature activation+4.65e+00

Pos logit contributions
rity+2.156
pun+2.041
endment+1.928
geist+1.909
ior+1.906
Neg logit contributions
ilers-1.156
erous-1.145
itored-1.045
 spur-0.954
 Consent-0.949
Token civil
Token ID3026
Feature activation+1.64e+01

Pos logit contributions
rity+3.732
pun+3.450
endment+3.224
 upscale+3.183
ior+3.181
Neg logit contributions
ilers-3.687
erous-3.642
itored-3.440
 spur-3.159
 Consent-3.146
Token-
Token ID12
Feature activation+1.21e+01

Pos logit contributions
rity+8.199
pun+7.559
ior+6.784
endment+6.623
 upscale+6.543
Neg logit contributions
ilers-15.385
erous-14.769
itored-14.750
ilitating-13.719
 Consent-13.672
Tokenrights
Token ID28046
Feature activation+1.05e+01

Pos logit contributions
rity+5.551
pun+5.097
ior+4.327
against+4.084
endment+4.076
Neg logit contributions
erous-12.979
ilers-12.943
itored-12.751
 spur-12.258
 Consent-12.157
Token office
Token ID2607
Feature activation+2.78e+00

Pos logit contributions
rity+6.892
pun+6.126
endment+5.808
ior+5.680
 upscale+5.637
Neg logit contributions
ilers-9.175
erous-8.965
itored-8.719
 Consent-8.069
zee-8.017
Token has
Token ID468
Feature activation+2.61e+00

Pos logit contributions
rity+2.267
pun+2.122
endment+1.957
 upscale+1.943
geist+1.942
Neg logit contributions
ilers-2.186
erous-2.165
itored-2.032
 Consent-1.900
 spur-1.885
Token long
Token ID890
Feature activation+9.65e-01

Pos logit contributions
rity+2.136
pun+1.999
endment+1.842
 upscale+1.835
geist+1.827
Neg logit contributions
ilers-2.049
erous-2.030
itored-1.903
 Consent-1.782
 spur-1.765
Token to
Token ID284
Feature activation+1.13e+00

Pos logit contributions
rity+2.346
pun+2.211
endment+2.057
 upscale+2.035
ior+2.029
Neg logit contributions
ilers-2.135
erous-2.124
itored-1.976
 Consent-1.844
 spur-1.832
Token promote
Token ID7719
Feature activation+3.37e+00

Pos logit contributions
rity+0.980
pun+0.921
endment+0.854
 upscale+0.851
geist+0.845
Neg logit contributions
ilers-0.828
erous-0.824
itored-0.767
 Consent-0.716
 spur-0.710
Token the
Token ID262
Feature activation+2.87e+00

Pos logit contributions
rity+2.674
pun+2.460
 upscale+2.277
endment+2.268
geist+2.246
Neg logit contributions
ilers-2.745
erous-2.692
itored-2.580
 Consent-2.381
 spur-2.362
Token accuracy
Token ID9922
Feature activation+6.28e+00

Pos logit contributions
rity+2.240
pun+2.066
endment+1.910
 upscale+1.898
geist+1.877
Neg logit contributions
ilers-2.375
erous-2.352
itored-2.227
 Consent-2.072
 spur-2.071
Token,
Token ID11
Feature activation+6.07e+00

Pos logit contributions
rity+4.760
pun+4.359
endment+4.047
 upscale+3.932
ior+3.910
Neg logit contributions
ilers-5.229
erous-5.167
itored-4.844
 Consent-4.639
 spur-4.584
Token fairness
Token ID22692
Feature activation+1.60e+01

Pos logit contributions
rity+4.310
pun+3.915
endment+3.581
 upscale+3.561
geist+3.511
Neg logit contributions
ilers-5.415
erous-5.328
itored-5.122
 Consent-4.759
zee-4.758
Token,
Token ID11
Feature activation+7.34e+00

Pos logit contributions
rity+9.297
pun+8.236
endment+7.342
ior+7.050
 upscale+7.027
Neg logit contributions
ilers-14.230
erous-13.649
itored-13.220
 Consent-12.997
zee-12.748
Token and
Token ID290
Feature activation+9.18e+00

Pos logit contributions
rity+5.549
pun+5.077
endment+4.647
 upscale+4.637
geist+4.554
Neg logit contributions
ilers-6.105
erous-5.985
itored-5.824
 Consent-5.354
zee-5.317
Token privacy
Token ID6782
Feature activation+1.15e+01

Pos logit contributions
rity+6.709
pun+6.010
endment+5.577
 upscale+5.539
geist+5.423
Neg logit contributions
ilers-7.681
erous-7.487
itored-7.436
zee-6.830
 Consent-6.753
Token of
Token ID286
Feature activation+8.53e+00

Pos logit contributions
rity+7.674
pun+6.947
endment+6.505
ior+6.236
geist+6.138
Neg logit contributions
ilers-10.109
erous-9.844
itored-9.240
 Consent-8.819
zee-8.763
Token consumer
Token ID7172
Feature activation+5.25e+00

Pos logit contributions
rity+6.433
pun+5.818
 upscale+5.605
endment+5.372
ior+5.296
Neg logit contributions
ilers-6.885
erous-6.607
itored-6.565
zee-5.941
esses-5.895
Token c
Token ID269
Feature activation+8.12e+00

Pos logit contributions
rity+3.534
pun+3.268
 upscale+3.090
ior+2.927
geist+2.900
Neg logit contributions
ilers-5.137
erous-5.123
itored-4.835
ilitating-4.503
zee-4.480
Tokenv
Token ID85
Feature activation+2.13e+00

Pos logit contributions
rity+3.951
pun+3.748
ior+3.184
endment+3.120
geist+3.051
Neg logit contributions
erous-8.620
ilers-8.607
itored-8.237
 spur-8.034
 Consent-7.906
Token2
Token ID17
Feature activation+1.21e+01

Pos logit contributions
rity+2.132
pun+2.033
endment+1.900
ior+1.893
 upscale+1.891
Neg logit contributions
ilers-1.273
erous-1.261
itored-1.157
 spur-1.071
 Consent-1.055
Token.
Token ID764
Feature activation+7.16e+00

Pos logit contributions
rity+4.682
pun+4.399
 upscale+3.803
ior+3.795
geist+3.737
Neg logit contributions
erous-13.026
ilers-12.852
itored-12.203
ilitating-12.127
 spur-11.902
Token COL
Token ID20444
Feature activation+4.68e+00

Pos logit contributions
rity+5.027
pun+4.717
 upscale+4.513
ior+4.368
geist+4.320
Neg logit contributions
erous-6.101
ilers-6.083
itored-5.722
zee-5.332
 spur-5.315
TokenOR
Token ID1581
Feature activation+1.60e+01

Pos logit contributions
rity+3.812
pun+3.571
 upscale+3.331
ior+3.325
POL+3.294
Neg logit contributions
ilers-3.632
erous-3.600
itored-3.313
 spur-3.133
 Consent-3.117
Token_
Token ID62
Feature activation+1.03e+01

Pos logit contributions
rity+4.726
pun+4.573
POL+4.388
CVE+3.920
 upscale+3.909
Neg logit contributions
erous-17.796
ilers-17.768
ilitating-16.630
 spur-16.483
itored-16.474
TokenB
Token ID33
Feature activation+7.32e+00

Pos logit contributions
rity+6.294
pun+6.244
POL+6.072
CVE+5.718
ior+5.617
Neg logit contributions
ilers-9.364
erous-9.112
itored-8.606
esses-8.264
ilitating-8.209
TokenGR
Token ID10761
Feature activation-3.77e-01

Pos logit contributions
rity+3.958
pun+3.581
ior+3.266
POL+3.241
geist+3.208
Neg logit contributions
ilers-7.510
erous-7.476
itored-7.161
 Consent-6.816
 spur-6.799
Token2
Token ID17
Feature activation+4.36e+00

Pos logit contributions
ilers+0.208
erous+0.207
itored+0.188
 Consent+0.172
 spur+0.171
Neg logit contributions
rity-0.397
pun-0.376
endment-0.355
geist-0.352
 upscale-0.351
TokenG
Token ID38
Feature activation-1.52e+00

Pos logit contributions
rity+3.544
pun+3.343
geist+3.094
 upscale+3.093
endment+3.055
Neg logit contributions
erous-3.391
ilers-3.391
itored-3.159
 spur-2.946
 Consent-2.935
Token,
Token ID11
Feature activation+5.28e+00

Pos logit contributions
rity+5.713
pun+5.321
endment+4.877
 upscale+4.852
geist+4.848
Neg logit contributions
ilers-7.496
erous-7.439
itored-7.073
 Consent-6.686
 spur-6.630
Token O
Token ID440
Feature activation+2.06e+00

Pos logit contributions
rity+4.374
pun+4.121
 upscale+3.844
geist+3.794
endment+3.791
Neg logit contributions
ilers-3.986
erous-3.985
itored-3.718
 spur-3.406
 Consent-3.405
Token.
Token ID13
Feature activation+1.83e+00

Pos logit contributions
rity+1.537
pun+1.425
endment+1.302
ior+1.297
 upscale+1.293
Neg logit contributions
erous-1.755
ilers-1.744
itored-1.653
 Consent-1.558
 spur-1.555
TokenJ
Token ID41
Feature activation+6.09e-02

Pos logit contributions
rity+1.570
pun+1.483
endment+1.365
geist+1.365
 upscale+1.363
Neg logit contributions
ilers-1.357
erous-1.353
itored-1.255
 spur-1.179
 Consent-1.171
Token.
Token ID13
Feature activation-6.30e-01

Pos logit contributions
rity+0.047
pun+0.044
endment+0.040
geist+0.040
 upscale+0.040
Neg logit contributions
ilers-0.051
erous-0.051
itored-0.048
 Consent-0.045
 spur-0.045
Token stands
Token ID6296
Feature activation+1.59e+01

Pos logit contributions
ilers+0.488
erous+0.486
itored+0.453
 spur+0.429
 Consent+0.427
Neg logit contributions
rity-0.524
pun-0.488
endment-0.454
geist-0.449
 upscale-0.447
Token as
Token ID355
Feature activation+6.80e+00

Pos logit contributions
rity+9.852
pun+9.166
 upscale+9.156
ior+8.706
geist+8.424
Neg logit contributions
ilers-13.276
erous-12.943
itored-12.340
 Consent-11.779
ilitating-11.286
Token the
Token ID262
Feature activation+4.61e+00

Pos logit contributions
rity+5.357
pun+5.011
 upscale+4.698
endment+4.607
geist+4.603
Neg logit contributions
ilers-5.313
erous-5.194
itored-4.963
 Consent-4.558
ilitating-4.519
Token cab
Token ID16212
Feature activation-1.72e+00

Pos logit contributions
rity+3.815
pun+3.582
 upscale+3.347
geist+3.277
endment+3.273
Neg logit contributions
ilers-3.493
erous-3.477
itored-3.289
 Consent-3.033
ilitating-2.983
Tokenler
Token ID1754
Feature activation+1.16e+00

Pos logit contributions
ilers+1.140
erous+1.137
itored+1.038
 spur+0.972
 Consent+0.965
Neg logit contributions
rity-1.621
pun-1.521
endment-1.426
geist-1.425
 upscale-1.414
Token
Token ID447
Feature activation-2.07e-01

Pos logit contributions
rity+1.064
pun+1.004
endment+0.935
geist+0.933
 upscale+0.930
Neg logit contributions
ilers-0.792
erous-0.788
itored-0.730
 Consent-0.679
 spur-0.678
TokenG
Token ID38
Feature activation-1.93e+00

Pos logit contributions
rity+1.689
pun+1.591
geist+1.472
endment+1.468
 upscale+1.464
Neg logit contributions
erous-1.477
ilers-1.476
itored-1.374
 spur-1.287
 Consent-1.276
Token AU
Token ID27548
Feature activation+9.26e+00

Pos logit contributions
ilers+1.505
erous+1.497
itored+1.397
 Consent+1.321
 spur+1.317
Neg logit contributions
rity-1.604
pun-1.488
endment-1.385
geist-1.365
 upscale-1.355
Token Opt
Token ID13123
Feature activation-4.36e-01

Pos logit contributions
rity+4.438
pun+4.407
 upscale+4.093
ior+3.839
geist+3.780
Neg logit contributions
ilers-9.578
erous-9.530
itored-8.992
ilitating-8.774
esses-8.585
Tokenronics
Token ID20844
Feature activation+1.81e+00

Pos logit contributions
ilers+0.294
erous+0.290
itored+0.269
 Consent+0.248
 spur+0.247
Neg logit contributions
rity-0.406
pun-0.383
endment-0.360
geist-0.357
 upscale-0.355
Token AH
Token ID28159
Feature activation+9.34e+00

Pos logit contributions
rity+1.299
pun+1.206
 upscale+1.106
endment+1.101
geist+1.100
Neg logit contributions
ilers-1.589
erous-1.585
itored-1.491
 spur-1.412
 Consent-1.403
TokenVA
Token ID11731
Feature activation+1.56e+01

Pos logit contributions
rity+4.006
pun+3.674
CVE+3.278
ior+3.139
 upscale+3.067
Neg logit contributions
ilers-10.432
erous-10.403
itored-9.921
 spur-9.777
ilitating-9.508
Token (
Token ID357
Feature activation+3.13e+00

Pos logit contributions
rity+6.508
pun+6.410
 upscale+6.145
ior+5.936
geist+5.735
Neg logit contributions
ilers-15.371
erous-15.156
ilitating-14.446
itored-14.410
esses-13.570
TokenIPS
Token ID47643
Feature activation+6.28e+00

Pos logit contributions
rity+2.251
pun+2.112
endment+1.908
 upscale+1.898
ior+1.894
Neg logit contributions
erous-2.740
ilers-2.732
itored-2.572
 spur-2.451
 Consent-2.435
Token-
Token ID12
Feature activation+1.45e+00

Pos logit contributions
rity+4.028
pun+3.851
 upscale+3.496
ior+3.441
geist+3.420
Neg logit contributions
ilers-5.829
erous-5.695
itored-5.445
 spur-5.179
ilitating-5.134
Tokentype
Token ID4906
Feature activation-1.46e+00

Pos logit contributions
rity+1.004
pun+0.928
endment+0.843
geist+0.834
 upscale+0.833
Neg logit contributions
ilers-1.325
erous-1.320
itored-1.246
 spur-1.185
 Consent-1.184
Token)
Token ID8
Feature activation+4.60e+00

Pos logit contributions
ilers+0.864
erous+0.859
itored+0.784
 Consent+0.725
 spur+0.719
Neg logit contributions
rity-1.485
pun-1.401
endment-1.321
geist-1.307
 upscale-1.305
Token
Token ID198
Feature activation+3.57e+00

Pos logit contributions
rity+1.498
pun+1.413
 upscale+1.283
endment+1.281
geist+1.276
Neg logit contributions
ilers-1.564
erous-1.547
itored-1.454
 spur-1.371
 Consent-1.358
TokenWiki
Token ID32603
Feature activation-2.40e+00

Pos logit contributions
rity+2.593
pun+2.429
endment+2.189
ior+2.171
geist+2.169
Neg logit contributions
ilers-3.119
erous-3.086
itored-2.903
 spur-2.786
 Consent-2.759
TokenLeaks
Token ID17874
Feature activation-1.59e+00

Pos logit contributions
ilers+0.861
erous+0.823
itored+0.724
 Consent+0.614
 spur+0.604
Neg logit contributions
rity-2.995
pun-2.863
endment-2.738
 upscale-2.718
geist-2.702
Token pr
Token ID778
Feature activation+5.64e+00

Pos logit contributions
ilers+1.272
erous+1.262
itored+1.173
 spur+1.125
 Consent+1.116
Neg logit contributions
rity-1.279
pun-1.186
endment-1.113
geist-1.098
 upscale-1.087
Tokenides
Token ID1460
Feature activation+8.99e+00

Pos logit contributions
rity+3.316
pun+3.111
ior+2.675
endment+2.600
 upscale+2.570
Neg logit contributions
erous-5.598
ilers-5.597
itored-5.305
 Consent-5.192
 spur-5.127
Token itself
Token ID2346
Feature activation+1.54e+01

Pos logit contributions
rity+5.881
pun+5.189
 upscale+4.802
endment+4.739
geist+4.625
Neg logit contributions
ilers-7.839
erous-7.711
itored-7.556
 Consent-7.231
zee-7.148
Token as
Token ID355
Feature activation+9.45e+00

Pos logit contributions
rity+8.042
pun+7.143
 upscale+6.505
geist+6.236
endment+6.226
Neg logit contributions
erous-14.563
ilers-14.440
itored-13.698
 Consent-13.540
zee-12.955
Token a
Token ID257
Feature activation+3.65e+00

Pos logit contributions
rity+7.126
pun+6.574
 upscale+6.206
endment+6.061
geist+5.864
Neg logit contributions
ilers-7.581
erous-7.405
itored-7.116
 Consent-6.648
zee-6.539
Token website
Token ID3052
Feature activation+2.40e+00

Pos logit contributions
rity+3.068
pun+2.887
 upscale+2.678
endment+2.666
geist+2.627
Neg logit contributions
ilers-2.750
erous-2.734
itored-2.550
 Consent-2.406
 spur-2.347
Token devoted
Token ID13378
Feature activation+2.53e+00

Pos logit contributions
rity+1.896
pun+1.774
endment+1.629
 upscale+1.624
geist+1.616
Neg logit contributions
ilers-1.947
erous-1.939
itored-1.814
 Consent-1.716
 spur-1.699
Token to
Token ID284
Feature activation+2.62e+00

Pos logit contributions
rity+2.230
pun+2.096
endment+1.960
 upscale+1.956
geist+1.934
Neg logit contributions
erous-1.806
ilers-1.788
itored-1.653
 Consent-1.582
 spur-1.539
Token Pages
Token ID28221
Feature activation+4.10e+00

Pos logit contributions
rity+0.670
pun+0.626
endment+0.580
geist+0.577
 upscale+0.576
Neg logit contributions
ilers-0.617
erous-0.614
itored-0.574
 Consent-0.538
 spur-0.537
Token looks
Token ID3073
Feature activation+1.19e+01

Pos logit contributions
rity+3.237
pun+3.066
 upscale+2.842
geist+2.841
endment+2.807
Neg logit contributions
erous-3.240
ilers-3.229
itored-3.087
 Consent-2.851
 spur-2.797
Token pretty
Token ID2495
Feature activation+8.00e+00

Pos logit contributions
pun+7.222
rity+7.167
 upscale+7.107
ior+6.660
geist+6.503
Neg logit contributions
ilers-10.363
itored-10.015
erous-9.890
 Consent-9.143
ilitating-8.853
Token good
Token ID922
Feature activation+1.09e+01

Pos logit contributions
rity+5.284
pun+5.049
 upscale+4.915
geist+4.691
ior+4.643
Neg logit contributions
ilers-7.030
erous-6.849
itored-6.750
 Consent-6.196
ilitating-6.057
Token at
Token ID379
Feature activation+6.25e+00

Pos logit contributions
rity+7.014
pun+6.825
ior+6.330
geist+6.246
 upscale+6.200
Neg logit contributions
ilers-9.360
erous-9.327
itored-8.903
 Consent-8.520
zee-8.165
Token first
Token ID717
Feature activation+1.52e+01

Pos logit contributions
rity+5.279
pun+5.008
 upscale+4.684
ior+4.648
geist+4.630
Neg logit contributions
erous-4.532
ilers-4.526
itored-4.337
 Consent-3.952
zee-3.871
Token glance
Token ID16086
Feature activation+1.09e+01

Pos logit contributions
rity+8.185
pun+7.320
 upscale+7.117
endment+6.863
ior+6.780
Neg logit contributions
ilers-14.103
erous-13.567
itored-13.487
 Consent-12.694
zee-12.578
Token.
Token ID13
Feature activation+7.98e+00

Pos logit contributions
rity+6.788
pun+6.520
geist+6.061
endment+6.001
 upscale+5.942
Neg logit contributions
ilers-9.568
erous-9.521
itored-9.111
 Consent-8.652
zee-8.441
Token But
Token ID887
Feature activation+4.37e+00

Pos logit contributions
rity+6.398
pun+6.145
geist+5.754
 upscale+5.643
ior+5.596
Neg logit contributions
ilers-6.028
erous-5.892
itored-5.588
ilitating-5.207
 spur-5.184
Token once
Token ID1752
Feature activation+4.48e+00

Pos logit contributions
rity+3.437
pun+3.236
 upscale+3.006
geist+2.965
endment+2.961
Neg logit contributions
ilers-3.484
erous-3.479
itored-3.296
 Consent-3.070
 spur-3.026
Token you
Token ID345
Feature activation+3.09e+00

Pos logit contributions
rity+3.639
pun+3.396
 upscale+3.152
endment+3.133
geist+3.127
Neg logit contributions
ilers-3.493
erous-3.447
itored-3.250
 Consent-3.056
 spur-3.032
Token graffiti
Token ID28530
Feature activation+3.07e+00

Pos logit contributions
rity+3.725
pun+3.487
 upscale+3.181
endment+3.175
geist+3.170
Neg logit contributions
ilers-4.144
erous-4.058
itored-3.864
 Consent-3.608
 spur-3.541
Token there
Token ID612
Feature activation+3.54e+00

Pos logit contributions
rity+2.214
pun+2.050
endment+1.873
geist+1.857
 upscale+1.854
Neg logit contributions
ilers-2.716
erous-2.705
itored-2.547
 Consent-2.412
 spur-2.404
Token the
Token ID262
Feature activation+5.49e+00

Pos logit contributions
rity+2.879
pun+2.698
geist+2.474
endment+2.466
 upscale+2.440
Neg logit contributions
ilers-2.786
erous-2.762
itored-2.598
 Consent-2.410
 spur-2.403
Token freedom
Token ID4925
Feature activation+3.10e+00

Pos logit contributions
rity+4.532
pun+4.251
 upscale+3.916
geist+3.856
endment+3.847
Neg logit contributions
ilers-4.194
erous-4.131
itored-3.923
 Consent-3.557
esses-3.527
Token to
Token ID284
Feature activation+6.53e+00

Pos logit contributions
rity+2.676
pun+2.521
endment+2.352
geist+2.301
 upscale+2.301
Neg logit contributions
ilers-2.258
erous-2.246
itored-2.102
 Consent-1.960
 spur-1.936
Token live
Token ID2107
Feature activation+1.52e+01

Pos logit contributions
rity+4.788
pun+4.538
 upscale+4.207
ior+4.102
endment+4.098
Neg logit contributions
erous-5.433
ilers-5.320
itored-5.175
 Consent-4.749
esses-4.704
Token in
Token ID287
Feature activation+5.93e+00

Pos logit contributions
rity+9.940
pun+9.533
 upscale+8.927
deen+8.662
geist+8.615
Neg logit contributions
ilers-11.831
erous-11.647
itored-11.612
esses-10.294
 Consent-10.223
Token this
Token ID428
Feature activation+8.63e+00

Pos logit contributions
rity+4.471
pun+4.101
 upscale+3.780
endment+3.755
ior+3.694
Neg logit contributions
ilers-4.992
erous-4.886
itored-4.679
 Consent-4.314
 spur-4.284
Token country
Token ID1499
Feature activation-4.12e-01

Pos logit contributions
rity+6.780
pun+6.436
 upscale+6.002
geist+5.922
endment+5.858
Neg logit contributions
ilers-6.725
erous-6.292
itored-6.177
 Consent-5.636
ilitating-5.579
Token and
Token ID290
Feature activation+4.68e+00

Pos logit contributions
ilers+0.326
erous+0.325
itored+0.303
 spur+0.286
 Consent+0.286
Neg logit contributions
rity-0.336
pun-0.313
endment-0.290
 upscale-0.287
geist-0.287
Token honour
Token ID15393
Feature activation+6.50e+00

Pos logit contributions
rity+3.750
pun+3.543
 upscale+3.288
endment+3.266
ior+3.242
Neg logit contributions
erous-3.639
ilers-3.592
itored-3.436
 Consent-3.169
ilitating-3.099
Token counterpart
Token ID11283
Feature activation+3.44e+00

Pos logit contributions
erous+2.574
ilers+2.560
itored+2.265
 spur+2.173
 Consent+2.117
Neg logit contributions
rity-4.175
pun-3.917
endment-3.669
geist-3.646
ior-3.597
Token,
Token ID11
Feature activation+3.71e+00

Pos logit contributions
rity+2.646
pun+2.474
endment+2.275
geist+2.262
 upscale+2.259
Neg logit contributions
ilers-2.834
erous-2.811
itored-2.662
 Consent-2.493
 spur-2.481
Token but
Token ID475
Feature activation+6.13e+00

Pos logit contributions
rity+3.008
pun+2.847
 upscale+2.602
endment+2.595
geist+2.593
Neg logit contributions
ilers-2.904
erous-2.885
itored-2.707
 Consent-2.529
 spur-2.500
Token we
Token ID356
Feature activation+1.04e+00

Pos logit contributions
rity+4.778
pun+4.511
 upscale+4.150
geist+4.136
ior+4.088
Neg logit contributions
ilers-4.912
erous-4.829
itored-4.606
 Consent-4.256
 spur-4.235
Token can
Token ID460
Feature activation+3.61e+00

Pos logit contributions
rity+0.841
pun+0.784
endment+0.726
geist+0.719
 upscale+0.718
Neg logit contributions
ilers-0.823
erous-0.819
itored-0.765
 spur-0.721
 Consent-0.721
Token live
Token ID2107
Feature activation+1.51e+01

Pos logit contributions
rity+2.893
pun+2.717
endment+2.484
 upscale+2.482
geist+2.478
Neg logit contributions
erous-2.880
ilers-2.871
itored-2.688
 Consent-2.529
 spur-2.508
Token without
Token ID1231
Feature activation+5.46e+00

Pos logit contributions
rity+9.016
pun+8.904
ior+7.818
 upscale+7.787
endment+7.509
Neg logit contributions
erous-13.051
ilers-12.943
itored-12.303
 Consent-11.583
ilitating-11.437
Token that
Token ID326
Feature activation+2.90e+00

Pos logit contributions
rity+4.325
pun+4.039
 upscale+3.753
endment+3.698
ior+3.641
Neg logit contributions
ilers-4.377
erous-4.285
itored-4.104
 Consent-3.799
 spur-3.764
Token.
Token ID13
Feature activation+4.88e+00

Pos logit contributions
rity+2.322
pun+2.182
 upscale+2.006
endment+2.003
geist+1.990
Neg logit contributions
ilers-2.318
erous-2.284
itored-2.159
 Consent-2.022
 spur-2.011
Token Perhaps
Token ID8673
Feature activation+1.56e+00

Pos logit contributions
rity+4.036
pun+3.817
geist+3.507
endment+3.494
 upscale+3.470
Neg logit contributions
ilers-3.750
erous-3.693
itored-3.468
 spur-3.255
ilitating-3.176
Token a
Token ID257
Feature activation+2.30e+00

Pos logit contributions
rity+1.302
pun+1.217
endment+1.129
geist+1.118
 upscale+1.118
Neg logit contributions
ilers-1.203
erous-1.197
itored-1.117
 Consent-1.048
 spur-1.048
Token Amar
Token ID44291
Feature activation+3.06e+00

Pos logit contributions
rity+3.068
pun+2.839
 upscale+2.661
geist+2.642
endment+2.637
Neg logit contributions
ilers-2.908
erous-2.877
itored-2.693
 spur-2.471
 Consent-2.470
Tokenillo
Token ID16111
Feature activation-8.69e-01

Pos logit contributions
rity+1.752
pun+1.581
endment+1.420
ior+1.397
geist+1.395
Neg logit contributions
ilers-3.127
erous-3.127
itored-2.977
 spur-2.855
 Consent-2.842
Token hops
Token ID29438
Feature activation-1.63e+00

Pos logit contributions
ilers+0.668
erous+0.665
itored+0.619
 spur+0.587
 Consent+0.585
Neg logit contributions
rity-0.725
pun-0.678
endment-0.630
geist-0.621
 upscale-0.620
Token,
Token ID11
Feature activation+3.51e+00

Pos logit contributions
ilers+1.282
erous+1.275
itored+1.192
 spur+1.132
 Consent+1.122
Neg logit contributions
rity-1.327
pun-1.233
endment-1.151
 upscale-1.130
geist-1.130
Token it
Token ID340
Feature activation+4.58e+00

Pos logit contributions
rity+2.767
pun+2.567
endment+2.372
 upscale+2.368
geist+2.359
Neg logit contributions
ilers-2.860
erous-2.847
itored-2.662
 spur-2.498
 Consent-2.497
Token smells
Token ID25760
Feature activation+1.50e+01

Pos logit contributions
rity+3.919
pun+3.713
 upscale+3.496
geist+3.481
ior+3.412
Neg logit contributions
ilers-3.346
erous-3.300
itored-3.128
 Consent-2.900
ilitating-2.795
Token powerfully
Token ID48745
Feature activation+1.11e+01

Pos logit contributions
rity+9.077
pun+8.990
 upscale+8.884
geist+7.901
ior+7.790
Neg logit contributions
ilers-12.893
erous-12.282
itored-12.177
 Consent-11.044
zee-10.908
Token of
Token ID286
Feature activation+7.83e+00

Pos logit contributions
rity+8.049
pun+7.640
 upscale+7.338
geist+6.955
ior+6.936
Neg logit contributions
ilers-9.305
erous-8.979
itored-8.447
 Consent-7.786
ilitating-7.565
Token passion
Token ID7506
Feature activation+1.29e+00

Pos logit contributions
rity+6.126
pun+5.668
 upscale+5.422
geist+5.235
ior+5.184
Neg logit contributions
ilers-6.241
erous-6.147
itored-5.807
 Consent-5.288
ilitating-5.276
Token fruit
Token ID8234
Feature activation+5.62e+00

Pos logit contributions
rity+1.122
pun+1.051
endment+0.977
geist+0.969
 upscale+0.968
Neg logit contributions
ilers-0.953
erous-0.948
itored-0.882
 Consent-0.826
 spur-0.825
Token,
Token ID11
Feature activation+6.13e+00

Pos logit contributions
rity+4.317
pun+4.009
geist+3.673
 upscale+3.644
endment+3.619
Neg logit contributions
ilers-4.676
erous-4.636
itored-4.386
 Consent-4.141
 spur-3.996
Token Root
Token ID20410
Feature activation-3.72e-01

Pos logit contributions
rity+0.515
pun+0.481
endment+0.447
geist+0.440
 upscale+0.440
Neg logit contributions
ilers-0.495
erous-0.493
itored-0.459
 Consent-0.434
 spur-0.433
Token FS
Token ID23324
Feature activation+2.27e+00

Pos logit contributions
erous+0.300
ilers+0.300
itored+0.281
 Consent+0.266
 spur+0.264
Neg logit contributions
rity-0.296
pun-0.275
endment-0.255
geist-0.253
 upscale-0.252
Token,
Token ID11
Feature activation+2.56e+00

Pos logit contributions
rity+1.836
pun+1.710
endment+1.579
geist+1.569
 upscale+1.568
Neg logit contributions
ilers-1.805
erous-1.797
itored-1.679
 spur-1.579
 Consent-1.579
Token it
Token ID340
Feature activation+2.62e+00

Pos logit contributions
rity+2.157
pun+2.012
endment+1.862
 upscale+1.857
geist+1.853
Neg logit contributions
ilers-1.947
erous-1.938
itored-1.807
 Consent-1.693
 spur-1.693
Token will
Token ID481
Feature activation+3.83e+00

Pos logit contributions
rity+2.284
pun+2.139
 upscale+1.991
geist+1.982
endment+1.967
Neg logit contributions
ilers-1.914
erous-1.899
itored-1.781
 Consent-1.663
 spur-1.638
Token look
Token ID804
Feature activation+1.50e+01

Pos logit contributions
rity+3.042
pun+2.847
 upscale+2.666
geist+2.610
ior+2.586
Neg logit contributions
erous-3.045
ilers-3.041
itored-2.856
 Consent-2.678
 spur-2.622
Token for
Token ID329
Feature activation+6.51e+00

Pos logit contributions
rity+9.120
pun+8.967
 upscale+8.649
geist+8.361
ior+8.238
Neg logit contributions
ilers-12.343
erous-12.132
itored-11.667
 Consent-10.862
ilitating-10.798
Token the
Token ID262
Feature activation+4.95e+00

Pos logit contributions
rity+5.130
pun+4.757
 upscale+4.406
geist+4.354
ior+4.302
Neg logit contributions
ilers-5.186
erous-5.131
itored-4.836
 Consent-4.484
 spur-4.477
Token init
Token ID2315
Feature activation-8.05e-01

Pos logit contributions
rity+3.992
pun+3.732
 upscale+3.465
geist+3.423
ior+3.410
Neg logit contributions
ilers-3.889
erous-3.853
itored-3.596
 spur-3.332
 Consent-3.326
Token binary
Token ID13934
Feature activation-3.04e-01

Pos logit contributions
ilers+0.613
erous+0.613
itored+0.568
 spur+0.537
 Consent+0.536
Neg logit contributions
rity-0.678
pun-0.632
endment-0.590
geist-0.582
 upscale-0.580
Token in
Token ID287
Feature activation+5.60e-01

Pos logit contributions
ilers+0.239
erous+0.239
itored+0.223
 spur+0.211
 Consent+0.210
Neg logit contributions
rity-0.248
pun-0.231
endment-0.215
 upscale-0.211
geist-0.211
Token of
Token ID286
Feature activation+3.24e+00

Pos logit contributions
erous+1.408
ilers+1.407
itored+1.302
 spur+1.238
 Consent+1.225
Neg logit contributions
rity-1.575
pun-1.460
endment-1.362
geist-1.348
 upscale-1.338
Token a
Token ID257
Feature activation+4.53e+00

Pos logit contributions
rity+2.646
pun+2.474
 upscale+2.309
endment+2.284
ior+2.266
Neg logit contributions
ilers-2.534
erous-2.510
itored-2.365
 Consent-2.204
 spur-2.188
Token food
Token ID2057
Feature activation+1.06e+01

Pos logit contributions
rity+3.612
pun+3.420
 upscale+3.223
ior+3.144
endment+3.143
Neg logit contributions
ilers-3.534
erous-3.494
itored-3.292
 Consent-3.062
 spur-3.015
Token hall
Token ID6899
Feature activation+7.31e+00

Pos logit contributions
pun+7.858
rity+7.833
 upscale+7.302
geist+7.300
ior+7.176
Neg logit contributions
ilers-7.981
erous-7.860
itored-7.660
 Consent-7.153
 Fathers-6.887
Token that
Token ID326
Feature activation+5.34e+00

Pos logit contributions
rity+5.493
pun+5.324
 upscale+4.959
ior+4.858
endment+4.854
Neg logit contributions
ilers-5.902
erous-5.799
itored-5.454
 Consent-5.180
 Fathers-4.973
Token ranks
Token ID9803
Feature activation+1.50e+01

Pos logit contributions
rity+4.307
pun+4.129
 upscale+3.873
geist+3.807
ior+3.796
Neg logit contributions
ilers-4.093
erous-4.034
itored-3.787
 Consent-3.540
ilitating-3.470
Token among
Token ID1871
Feature activation+8.88e+00

Pos logit contributions
rity+9.894
pun+9.242
 upscale+8.948
ior+8.523
geist+8.412
Neg logit contributions
ilers-11.954
erous-11.664
itored-11.297
 Consent-10.747
zee-10.373
Token what
Token ID644
Feature activation+1.22e-01

Pos logit contributions
rity+6.836
pun+6.360
 upscale+6.218
ior+5.904
geist+5.851
Neg logit contributions
ilers-6.922
erous-6.618
itored-6.410
 Consent-6.008
 spur-5.928
Token you
Token ID345
Feature activation+3.46e+00

Pos logit contributions
rity+0.103
pun+0.096
endment+0.090
geist+0.089
 upscale+0.088
Neg logit contributions
ilers-0.092
erous-0.092
itored-0.085
 spur-0.080
 Consent-0.080
Token find
Token ID1064
Feature activation+2.97e+00

Pos logit contributions
rity+2.659
pun+2.469
 upscale+2.264
endment+2.260
geist+2.253
Neg logit contributions
ilers-2.876
erous-2.867
itored-2.689
 Consent-2.546
 spur-2.525
Token in
Token ID287
Feature activation+3.91e+00

Pos logit contributions
rity+2.289
pun+2.134
endment+1.960
 upscale+1.956
geist+1.952
Neg logit contributions
ilers-2.470
erous-2.459
itored-2.304
 Consent-2.183
 spur-2.167
Token that
Token ID326
Feature activation-4.46e-01

Pos logit contributions
rity+3.295
pun+3.118
endment+2.850
geist+2.821
ior+2.799
Neg logit contributions
ilers-3.685
erous-3.661
itored-3.460
 Consent-3.232
 spur-3.224
Token protect
Token ID1805
Feature activation+4.67e+00

Pos logit contributions
ilers+0.346
erous+0.345
itored+0.321
 spur+0.303
 Consent+0.303
Neg logit contributions
rity-0.371
pun-0.346
endment-0.321
geist-0.318
 upscale-0.317
Token the
Token ID262
Feature activation+6.29e+00

Pos logit contributions
rity+3.785
pun+3.532
endment+3.293
 upscale+3.270
ior+3.212
Neg logit contributions
ilers-3.640
erous-3.528
itored-3.377
 Consent-3.108
 spur-3.092
Token privacy
Token ID6782
Feature activation+9.71e+00

Pos logit contributions
rity+4.578
pun+4.181
endment+3.914
 upscale+3.880
ior+3.830
Neg logit contributions
ilers-5.366
erous-5.228
itored-5.044
ilitating-4.646
esses-4.640
Token and
Token ID290
Feature activation+9.90e+00

Pos logit contributions
rity+6.745
pun+6.235
endment+5.845
geist+5.627
ior+5.596
Neg logit contributions
ilers-8.368
erous-8.103
itored-7.785
 Consent-7.260
esses-7.226
Token civil
Token ID3026
Feature activation+1.50e+01

Pos logit contributions
rity+7.104
pun+6.508
 upscale+5.998
endment+5.984
ior+5.955
Neg logit contributions
ilers-8.075
erous-7.776
itored-7.705
esses-7.079
zee-7.053
Token rights
Token ID2489
Feature activation+1.40e+01

Pos logit contributions
rity+8.049
pun+7.411
endment+6.920
ior+6.618
 upscale+6.524
Neg logit contributions
ilers-14.330
erous-13.731
itored-13.083
zee-12.472
 Consent-12.308
Token of
Token ID286
Feature activation+4.40e+00

Pos logit contributions
rity+8.739
pun+8.003
endment+7.593
geist+6.967
ior+6.951
Neg logit contributions
ilers-12.092
erous-11.710
itored-11.020
 Consent-10.572
zee-10.553
Token all
Token ID477
Feature activation+2.91e+00

Pos logit contributions
rity+3.573
pun+3.328
endment+3.159
 upscale+3.152
geist+3.036
Neg logit contributions
ilers-3.365
erous-3.267
itored-3.129
 spur-2.854
 Consent-2.853
Token Virginia
Token ID6025
Feature activation-1.96e-01

Pos logit contributions
rity+2.563
pun+2.409
endment+2.262
 upscale+2.244
geist+2.224
Neg logit contributions
ilers-2.076
erous-2.036
itored-1.914
 spur-1.761
 Consent-1.745
Token residents
Token ID5085
Feature activation+1.79e+00

Pos logit contributions
ilers+0.147
erous+0.146
itored+0.135
 spur+0.127
 Consent+0.127
Neg logit contributions
rity-0.168
pun-0.157
endment-0.147
 upscale-0.145
geist-0.145
Token.
Token ID13
Feature activation-7.05e-02

Pos logit contributions
ilers+1.420
erous+1.409
itored+1.327
 spur+1.261
 Consent+1.251
Neg logit contributions
rity-1.376
pun-1.274
endment-1.178
geist-1.170
 upscale-1.153
Token
Token ID198
Feature activation+4.53e-01

Pos logit contributions
ilers+0.054
erous+0.054
itored+0.050
 spur+0.047
 Consent+0.047
Neg logit contributions
rity-0.059
pun-0.055
endment-0.051
geist-0.051
 upscale-0.051
Token
Token ID198
Feature activation+9.80e-01

Pos logit contributions
rity+0.372
pun+0.349
endment+0.322
geist+0.319
 upscale+0.319
Neg logit contributions
ilers-0.355
erous-0.353
itored-0.330
 spur-0.310
 Consent-0.310
TokenFor
Token ID1890
Feature activation-5.45e-01

Pos logit contributions
rity+0.721
pun+0.666
endment+0.612
geist+0.606
 upscale+0.605
Neg logit contributions
ilers-0.852
erous-0.850
itored-0.799
 Consent-0.756
 spur-0.755
Token the
Token ID262
Feature activation+1.96e+00

Pos logit contributions
ilers+0.447
erous+0.444
itored+0.416
 spur+0.394
 Consent+0.393
Neg logit contributions
rity-0.427
pun-0.397
endment-0.367
geist-0.364
 upscale-0.361
Token last
Token ID938
Feature activation+1.49e+01

Pos logit contributions
rity+1.784
pun+1.678
endment+1.562
 upscale+1.561
geist+1.550
Neg logit contributions
ilers-1.358
erous-1.353
itored-1.255
 Consent-1.165
 spur-1.161
Token 15
Token ID1315
Feature activation+6.18e+00

Pos logit contributions
rity+8.951
pun+7.983
 upscale+7.900
ior+7.332
geist+7.207
Neg logit contributions
ilers-12.812
erous-12.633
itored-12.097
 Consent-11.275
ilitating-11.198
Token days
Token ID1528
Feature activation+3.82e+00

Pos logit contributions
rity+4.598
pun+4.295
 upscale+3.910
ior+3.904
endment+3.886
Neg logit contributions
ilers-5.155
erous-5.131
itored-4.783
 Consent-4.556
 spur-4.429
Token,
Token ID11
Feature activation+2.14e+00

Pos logit contributions
rity+2.904
pun+2.686
 upscale+2.473
endment+2.471
geist+2.428
Neg logit contributions
erous-3.206
ilers-3.199
itored-2.996
 Consent-2.836
 spur-2.805
Token boats
Token ID16659
Feature activation+6.93e-01

Pos logit contributions
rity+1.795
pun+1.674
endment+1.550
 upscale+1.548
geist+1.535
Neg logit contributions
ilers-1.638
erous-1.635
itored-1.519
 Consent-1.426
 spur-1.421
Token have
Token ID423
Feature activation-7.48e-01

Pos logit contributions
rity+0.545
pun+0.507
endment+0.468
geist+0.464
 upscale+0.463
Neg logit contributions
ilers-0.567
erous-0.565
itored-0.529
 spur-0.499
 Consent-0.499
Token said
Token ID531
Feature activation-3.70e+00

Pos logit contributions
erous+1.771
ilers+1.768
itored+1.651
 spur+1.577
 Consent+1.562
Neg logit contributions
rity-1.654
pun-1.524
endment-1.408
geist-1.396
 upscale-1.391
Token,
Token ID11
Feature activation-2.36e+00

Pos logit contributions
erous+2.875
ilers+2.871
itored+2.672
 spur+2.539
 Consent+2.516
Neg logit contributions
rity-3.037
pun-2.835
endment-2.622
 upscale-2.610
geist-2.604
Token and
Token ID290
Feature activation-1.57e+00

Pos logit contributions
erous+1.757
ilers+1.754
itored+1.628
 spur+1.561
 Consent+1.538
Neg logit contributions
rity-2.025
pun-1.878
endment-1.756
geist-1.748
 upscale-1.733
Token giving
Token ID3501
Feature activation-3.44e+00

Pos logit contributions
erous+1.191
ilers+1.186
itored+1.099
 spur+1.051
 Consent+1.042
Neg logit contributions
rity-1.331
pun-1.240
endment-1.151
geist-1.143
 upscale-1.136
Token the
Token ID262
Feature activation+8.38e-01

Pos logit contributions
erous+2.631
ilers+2.593
itored+2.395
 spur+2.316
 Consent+2.296
Neg logit contributions
rity-2.899
pun-2.698
endment-2.510
geist-2.498
 upscale-2.469
Token civil
Token ID3026
Feature activation+1.48e+01

Pos logit contributions
rity+0.681
pun+0.635
endment+0.587
geist+0.583
 upscale+0.581
Neg logit contributions
ilers-0.665
erous-0.664
itored-0.618
 Consent-0.585
 spur-0.585
Token-
Token ID12
Feature activation+1.20e+01

Pos logit contributions
rity+8.637
pun+7.911
endment+7.487
ior+7.095
 upscale+6.888
Neg logit contributions
ilers-12.861
erous-12.424
itored-12.092
 Consent-11.637
zee-11.556
Tokenrights
Token ID28046
Feature activation+9.51e+00

Pos logit contributions
rity+5.688
pun+5.149
ior+4.381
endment+4.140
against+4.051
Neg logit contributions
ilers-12.675
erous-12.662
itored-12.495
 spur-11.936
 Consent-11.923
Token office
Token ID2607
Feature activation-2.13e-01

Pos logit contributions
rity+6.538
pun+5.904
endment+5.504
 upscale+5.418
ior+5.408
Neg logit contributions
ilers-8.278
erous-8.033
itored-7.901
 Consent-7.250
zee-7.240
Token more
Token ID517
Feature activation-3.07e+00

Pos logit contributions
ilers+0.163
erous+0.163
itored+0.151
 spur+0.143
 Consent+0.143
Neg logit contributions
rity-0.179
pun-0.167
endment-0.155
geist-0.154
 upscale-0.153
Token
Token ID564
Feature activation+4.88e+00

Pos logit contributions
erous+2.399
ilers+2.391
itored+2.211
 spur+2.121
 Consent+2.095
Neg logit contributions
rity-2.530
pun-2.356
endment-2.188
geist-2.167
ior-2.152
Token the
Token ID262
Feature activation+1.38e+00

Pos logit contributions
rity+1.287
pun+1.201
endment+1.119
 upscale+1.114
geist+1.104
Neg logit contributions
ilers-1.135
erous-1.128
itored-1.052
 Consent-0.979
 spur-0.977
Token EPA
Token ID10193
Feature activation+2.94e+00

Pos logit contributions
rity+1.100
pun+1.022
endment+0.944
 upscale+0.938
geist+0.933
Neg logit contributions
ilers-1.118
erous-1.112
itored-1.043
 spur-0.977
 Consent-0.976
Token
Token ID447
Feature activation-4.98e-01

Pos logit contributions
rity+2.570
pun+2.415
endment+2.241
 upscale+2.226
ior+2.225
Neg logit contributions
ilers-2.124
erous-2.108
itored-1.957
 Consent-1.821
 spur-1.820
Token
Token ID247
Feature activation+3.23e+00

Pos logit contributions
ilers+0.449
erous+0.447
itored+0.423
 Consent+0.404
 spur+0.402
Neg logit contributions
rity-0.344
pun-0.320
endment-0.290
geist-0.289
 upscale-0.289
Tokens
Token ID82
Feature activation+5.24e+00

Pos logit contributions
rity+3.303
pun+3.120
endment+2.933
ior+2.918
geist+2.894
Neg logit contributions
ilers-1.861
erous-1.840
itored-1.687
 spur-1.578
 Consent-1.553
Token civil
Token ID3026
Feature activation+1.48e+01

Pos logit contributions
rity+3.986
pun+3.657
 upscale+3.388
ior+3.386
endment+3.385
Neg logit contributions
ilers-4.325
erous-4.288
itored-4.050
 spur-3.768
zee-3.730
Token-
Token ID12
Feature activation+1.28e+01

Pos logit contributions
rity+8.250
pun+7.727
endment+7.123
ior+6.934
 upscale+6.589
Neg logit contributions
ilers-13.150
erous-12.685
itored-12.399
zee-11.990
 Consent-11.818
Tokenrights
Token ID28046
Feature activation+1.14e+01

Pos logit contributions
rity+5.764
pun+5.385
ior+4.540
against+4.262
endment+4.190
Neg logit contributions
ilers-13.644
erous-13.637
itored-13.440
 spur-12.885
 Consent-12.821
Token office
Token ID2607
Feature activation+3.06e+00

Pos logit contributions
rity+8.807
pun+8.037
endment+7.485
ior+7.477
 upscale+7.458
Neg logit contributions
ilers-8.513
erous-8.337
itored-8.005
 spur-7.332
 Consent-7.289
Token will
Token ID481
Feature activation-9.47e-01

Pos logit contributions
rity+2.682
pun+2.526
endment+2.332
 upscale+2.328
geist+2.323
Neg logit contributions
ilers-2.200
erous-2.183
itored-2.029
 Consent-1.893
 spur-1.888
Token boost
Token ID5750
Feature activation-2.34e+00

Pos logit contributions
ilers+0.764
erous+0.761
itored+0.712
 spur+0.675
 Consent+0.671
Neg logit contributions
rity-0.757
pun-0.705
endment-0.654
geist-0.647
 upscale-0.640
Token complaint
Token ID8224
Feature activation+6.21e-01

Pos logit contributions
rity+1.480
pun+1.367
endment+1.226
geist+1.212
 upscale+1.211
Neg logit contributions
ilers-2.311
erous-2.311
itored-2.183
 spur-2.068
 Consent-2.062
Token with
Token ID351
Feature activation+6.40e-01

Pos logit contributions
rity+0.476
pun+0.441
endment+0.406
geist+0.402
 upscale+0.401
Neg logit contributions
ilers-0.521
erous-0.520
itored-0.487
 Consent-0.461
 spur-0.461
Token the
Token ID262
Feature activation+2.00e+00

Pos logit contributions
rity+0.563
pun+0.527
endment+0.491
geist+0.487
 upscale+0.486
Neg logit contributions
ilers-0.465
erous-0.463
itored-0.429
 Consent-0.402
 spur-0.402
Token Colorado
Token ID7492
Feature activation+1.09e+00

Pos logit contributions
rity+1.570
pun+1.461
endment+1.344
 upscale+1.336
geist+1.331
Neg logit contributions
ilers-1.647
erous-1.639
itored-1.537
 spur-1.445
 Consent-1.442
Token Civil
Token ID7511
Feature activation+1.46e+01

Pos logit contributions
rity+0.773
pun+0.713
endment+0.652
geist+0.645
 upscale+0.644
Neg logit contributions
ilers-0.973
erous-0.970
itored-0.913
 Consent-0.867
 spur-0.866
Token Rights
Token ID6923
Feature activation+1.48e+01

Pos logit contributions
rity+8.944
pun+8.624
endment+7.945
ior+7.805
geist+6.968
Neg logit contributions
ilers-12.829
erous-12.475
itored-11.578
 spur-11.426
ilitating-11.132
Token Division
Token ID7458
Feature activation+3.33e+00

Pos logit contributions
rity+8.467
pun+7.310
endment+6.854
ior+6.598
 upscale+6.106
Neg logit contributions
ilers-14.043
erous-13.004
itored-12.866
 spur-12.238
zee-12.183
Token,
Token ID11
Feature activation+2.03e+00

Pos logit contributions
rity+2.541
pun+2.357
endment+2.185
geist+2.159
ior+2.151
Neg logit contributions
ilers-2.779
erous-2.763
itored-2.591
 spur-2.434
 Consent-2.422
Token which
Token ID543
Feature activation-1.14e+00

Pos logit contributions
rity+1.590
pun+1.479
endment+1.363
 upscale+1.351
geist+1.348
Neg logit contributions
ilers-1.678
erous-1.668
itored-1.563
 spur-1.473
 Consent-1.468
Token spent
Token ID3377
Feature activation+3.85e+00

Pos logit contributions
erous+0.858
ilers+0.855
itored+0.797
 Consent+0.752
 spur+0.750
Neg logit contributions
rity-0.967
pun-0.900
endment-0.846
geist-0.834
 upscale-0.827
Token a
Token ID257
Feature activation+4.18e+00

Pos logit contributions
rity+3.039
pun+2.852
 upscale+2.647
endment+2.614
geist+2.610
Neg logit contributions
ilers-3.075
erous-3.047
itored-2.871
 Consent-2.659
 spur-2.646
Token Donna
Token ID29216
Feature activation+2.00e+00

Pos logit contributions
rity+0.610
pun+0.573
endment+0.534
geist+0.530
 upscale+0.530
Neg logit contributions
ilers-0.467
erous-0.465
itored-0.430
 spur-0.401
 Consent-0.400
Token brought
Token ID3181
Feature activation-2.89e-02

Pos logit contributions
rity+1.704
pun+1.597
endment+1.475
geist+1.471
 upscale+1.468
Neg logit contributions
ilers-1.507
erous-1.493
itored-1.387
 Consent-1.298
 spur-1.293
Token in
Token ID287
Feature activation-2.06e+00

Pos logit contributions
ilers+0.021
erous+0.021
itored+0.020
 spur+0.018
 Consent+0.018
Neg logit contributions
rity-0.025
pun-0.024
endment-0.022
geist-0.022
 upscale-0.022
Token her
Token ID607
Feature activation-1.03e+00

Pos logit contributions
erous+1.467
ilers+1.461
itored+1.343
 Consent+1.276
 spur+1.275
Neg logit contributions
rity-1.835
pun-1.726
endment-1.612
geist-1.594
 upscale-1.582
Token four
Token ID1440
Feature activation-2.27e+00

Pos logit contributions
ilers+0.744
erous+0.743
itored+0.686
 Consent+0.648
 spur+0.647
Neg logit contributions
rity-0.907
pun-0.851
endment-0.795
geist-0.787
 upscale-0.782
Token cub
Token ID13617
Feature activation-1.80e+01

Pos logit contributions
erous+1.637
ilers+1.631
itored+1.504
 spur+1.424
 Consent+1.420
Neg logit contributions
rity-1.993
pun-1.871
endment-1.748
geist-1.738
 upscale-1.709
Tokens
Token ID82
Feature activation-2.86e+00

Pos logit contributions
ilers+9.618
erous+9.333
esses+8.392
er+8.204
 spur+8.194
Neg logit contributions
rity-16.020
pun-15.123
endment-14.422
geist-14.348
CVE-13.989
Token.
Token ID13
Feature activation-2.64e+00

Pos logit contributions
ilers+2.152
erous+2.143
itored+1.993
 spur+1.890
 Consent+1.884
Neg logit contributions
rity-2.426
pun-2.257
endment-2.116
geist-2.089
 upscale-2.078
Token Shortly
Token ID27265
Feature activation+1.78e+00

Pos logit contributions
erous+1.723
ilers+1.719
itored+1.588
 Consent+1.507
 spur+1.471
Neg logit contributions
rity-2.481
pun-2.336
endment-2.203
geist-2.170
 upscale-2.170
Token,
Token ID11
Feature activation-4.85e+00

Pos logit contributions
rity+1.532
pun+1.428
endment+1.332
 upscale+1.325
geist+1.318
Neg logit contributions
ilers-1.329
erous-1.318
itored-1.229
 Consent-1.146
 spur-1.141
Token the
Token ID262
Feature activation-1.73e+00

Pos logit contributions
erous+3.256
ilers+3.238
itored+2.972
 spur+2.844
 Consent+2.842
Neg logit contributions
rity-4.406
pun-4.186
endment-3.926
geist-3.877
ior-3.821
Token D
Token ID360
Feature activation+3.63e+00

Pos logit contributions
rity+1.786
pun+1.678
 upscale+1.552
endment+1.550
geist+1.549
Neg logit contributions
ilers-1.508
erous-1.501
itored-1.400
 Consent-1.299
 spur-1.294
Tokenawa
Token ID6909
Feature activation+1.21e+00

Pos logit contributions
rity+3.095
pun+2.892
endment+2.679
ior+2.662
geist+2.654
Neg logit contributions
erous-2.723
ilers-2.722
itored-2.538
 Consent-2.407
 spur-2.391
Token along
Token ID1863
Feature activation-1.75e+00

Pos logit contributions
rity+1.014
pun+0.948
endment+0.877
geist+0.869
 upscale+0.869
Neg logit contributions
ilers-0.936
erous-0.933
itored-0.870
 Consent-0.817
 spur-0.815
Token with
Token ID351
Feature activation-6.21e-01

Pos logit contributions
ilers+1.450
erous+1.445
itored+1.353
 spur+1.278
 Consent+1.276
Neg logit contributions
rity-1.357
pun-1.260
endment-1.162
geist-1.151
 upscale-1.147
Token her
Token ID607
Feature activation-1.29e+00

Pos logit contributions
ilers+0.483
erous+0.482
itored+0.448
 spur+0.424
 Consent+0.422
Neg logit contributions
rity-0.513
pun-0.479
endment-0.445
geist-0.440
 upscale-0.438
Token cub
Token ID13617
Feature activation-1.79e+01

Pos logit contributions
ilers+0.902
erous+0.900
itored+0.829
 spur+0.784
 Consent+0.777
Neg logit contributions
rity-1.175
pun-1.102
endment-1.035
geist-1.024
 upscale-1.012
Tokens
Token ID82
Feature activation-4.64e+00

Pos logit contributions
erous+11.270
ilers+11.081
itored+10.306
zee+10.239
er+10.223
Neg logit contributions
rity-13.374
pun-12.937
endment-12.573
geist-11.921
CVE-11.796
Token is
Token ID318
Feature activation-4.40e+00

Pos logit contributions
ilers+3.419
erous+3.415
itored+3.187
 spur+3.047
 Consent+2.992
Neg logit contributions
rity-3.907
pun-3.663
endment-3.478
geist-3.390
 upscale-3.361
Token one
Token ID530
Feature activation+3.95e-01

Pos logit contributions
ilers+3.314
erous+3.294
itored+3.104
 spur+3.015
 Consent+2.897
Neg logit contributions
rity-3.670
pun-3.355
endment-3.211
geist-3.151
ior-3.088
Token of
Token ID286
Feature activation-3.22e+00

Pos logit contributions
rity+0.327
pun+0.305
endment+0.283
 upscale+0.280
geist+0.280
Neg logit contributions
ilers-0.307
erous-0.306
itored-0.285
 Consent-0.268
 spur-0.267
Token the
Token ID262
Feature activation-5.89e+00

Pos logit contributions
ilers+2.269
erous+2.268
itored+2.098
 spur+1.995
 Consent+1.972
Neg logit contributions
rity-2.858
pun-2.693
endment-2.529
geist-2.499
ior-2.460
Token both
Token ID1111
Feature activation-2.85e+00

Pos logit contributions
ilers+1.635
erous+1.633
itored+1.501
 spur+1.435
 Consent+1.421
Neg logit contributions
rity-1.911
pun-1.779
endment-1.667
geist-1.650
 upscale-1.630
Token of
Token ID286
Feature activation-1.03e-01

Pos logit contributions
ilers+1.431
erous+1.425
itored+1.251
 spur+1.173
 Consent+1.151
Neg logit contributions
rity-3.159
pun-2.978
endment-2.837
geist-2.807
 upscale-2.782
Token their
Token ID511
Feature activation-2.83e+00

Pos logit contributions
ilers+0.080
erous+0.079
itored+0.074
 spur+0.070
 Consent+0.070
Neg logit contributions
rity-0.086
pun-0.081
endment-0.075
geist-0.074
 upscale-0.074
Token seasons
Token ID7028
Feature activation-3.68e+00

Pos logit contributions
ilers+2.027
erous+2.016
itored+1.844
 spur+1.768
 Consent+1.747
Neg logit contributions
rity-2.531
pun-2.352
endment-2.210
geist-2.182
 upscale-2.152
Token on
Token ID319
Feature activation-1.57e+01

Pos logit contributions
ilers+2.792
erous+2.784
itored+2.560
 spur+2.493
 Consent+2.418
Neg logit contributions
rity-3.141
pun-2.831
endment-2.671
geist-2.661
 upscale-2.613
Token the
Token ID262
Feature activation-1.70e+01

Pos logit contributions
erous+10.008
ilers+8.956
itored+8.277
 spur+8.089
 Consent+8.042
Neg logit contributions
rity-13.841
pun-12.841
geist-12.657
endment-12.163
ior-11.879
Token line
Token ID1627
Feature activation-1.07e+01

Pos logit contributions
erous+9.104
ilers+9.049
 spur+8.661
itored+7.743
er+7.643
Neg logit contributions
rity-15.411
pun-14.386
endment-13.749
geist-13.536
CVE-13.463
Token.
Token ID13
Feature activation+2.37e+00

Pos logit contributions
ilers+8.073
erous+8.043
itored+7.503
 spur+7.422
er+7.060
Neg logit contributions
rity-8.477
pun-7.395
endment-7.088
geist-7.064
 upscale-7.017
Token Then
Token ID3244
Feature activation+3.03e+00

Pos logit contributions
rity+2.035
pun+1.923
endment+1.772
geist+1.771
 upscale+1.758
Neg logit contributions
ilers-1.766
erous-1.753
itored-1.629
 spur-1.525
 Consent-1.497
Token it
Token ID340
Feature activation+7.49e-01

Pos logit contributions
rity+2.443
pun+2.310
endment+2.121
 upscale+2.116
geist+2.102
Neg logit contributions
ilers-2.383
erous-2.366
itored-2.223
 Consent-2.079
 spur-2.059
Token went
Token ID1816
Feature activation+4.11e+00

Pos logit contributions
rity+0.624
pun+0.584
endment+0.540
 upscale+0.537
geist+0.536
Neg logit contributions
ilers-0.578
erous-0.576
itored-0.537
 Consent-0.505
 spur-0.502
Token Trump
Token ID1301
Feature activation-7.49e+00

Pos logit contributions
ilers+4.248
erous+4.174
itored+3.853
 Consent+3.756
 spur+3.746
Neg logit contributions
rity-5.925
pun-5.566
endment-5.199
geist-5.146
 upscale-5.140
Token knew
Token ID2993
Feature activation-9.97e+00

Pos logit contributions
ilers+4.588
erous+4.421
itored+4.069
 spur+3.959
 Consent+3.958
Neg logit contributions
rity-7.117
pun-6.807
endment-6.409
CVE-6.296
geist-6.224
Token Stone
Token ID8026
Feature activation-5.45e+00

Pos logit contributions
erous+6.969
ilers+6.861
itored+6.333
 Consent+6.333
 spur+6.215
Neg logit contributions
rity-8.465
pun-8.029
endment-7.799
geist-7.421
 upscale-7.241
Token talked
Token ID6619
Feature activation-4.97e+00

Pos logit contributions
erous+3.751
ilers+3.743
itored+3.565
 spur+3.510
 Consent+3.382
Neg logit contributions
rity-4.992
pun-4.604
endment-4.297
ior-4.129
geist-4.119
Token with
Token ID351
Feature activation-1.29e+01

Pos logit contributions
erous+3.214
ilers+3.147
itored+3.024
 Consent+2.827
 spur+2.802
Neg logit contributions
rity-4.702
pun-4.326
endment-4.113
geist-4.067
ior-4.011
Token WikiLeaks
Token ID19766
Feature activation-1.69e+01

Pos logit contributions
erous+8.337
ilers+8.257
itored+7.534
 spur+7.528
 Fathers+7.266
Neg logit contributions
rity-11.336
pun-11.153
endment-10.620
geist-10.451
CVE-9.857
Token about
Token ID546
Feature activation-8.42e+00

Pos logit contributions
ilers+7.049
erous+6.381
er+5.843
 Consent+5.803
itored+5.792
Neg logit contributions
rity-17.533
pun-17.500
deen-16.990
endment-16.426
geist-16.378
Token DNC
Token ID17824
Feature activation-8.57e+00

Pos logit contributions
ilers+5.093
erous+4.725
 spur+4.529
itored+4.471
 Consent+4.339
Neg logit contributions
rity-7.667
geist-7.646
pun-7.398
CVE-7.318
 upscale-6.976
Token emails
Token ID7237
Feature activation-7.66e+00

Pos logit contributions
ilers+5.265
erous+5.203
itored+4.659
 spur+4.644
 Consent+4.625
Neg logit contributions
rity-7.877
pun-7.730
geist-7.278
deen-7.247
endment-7.016
Token Trump
Token ID1301
Feature activation-3.77e+00

Pos logit contributions
ilers+5.199
erous+5.114
itored+4.666
 Consent+4.651
 spur+4.500
Neg logit contributions
rity-6.898
pun-6.523
endment-6.150
deen-6.121
 upscale-6.074
Token urges
Token ID28804
Feature activation-7.01e+00

Pos logit contributions
erous+2.743
ilers+2.719
itored+2.530
 Consent+2.396
 spur+2.387
Neg logit contributions
rity-3.332
pun-3.110
endment-2.897
geist-2.870
 upscale-2.837
Token
Token ID198
Feature activation-2.68e+00

Pos logit contributions
erous+2.088
ilers+2.085
itored+1.941
 Consent+1.828
 spur+1.811
Neg logit contributions
rity-2.450
pun-2.283
endment-2.138
 upscale-2.122
geist-2.115
TokenFaith
Token ID45536
Feature activation-1.66e+00

Pos logit contributions
erous+2.142
ilers+2.136
itored+1.995
 Consent+1.883
 spur+1.859
Neg logit contributions
rity-2.140
pun-1.969
endment-1.848
 upscale-1.841
geist-1.817
Token's
Token ID338
Feature activation+2.15e+00

Pos logit contributions
erous+1.243
ilers+1.241
itored+1.152
 spur+1.089
 Consent+1.086
Neg logit contributions
rity-1.413
pun-1.320
endment-1.231
geist-1.218
 upscale-1.214
Token two
Token ID734
Feature activation-5.69e-02

Pos logit contributions
rity+1.965
pun+1.844
 upscale+1.726
endment+1.711
geist+1.710
Neg logit contributions
ilers-1.491
erous-1.475
itored-1.372
 spur-1.256
 Consent-1.250
Token male
Token ID4257
Feature activation-5.37e+00

Pos logit contributions
ilers+0.038
erous+0.037
itored+0.034
 spur+0.032
 Consent+0.032
Neg logit contributions
rity-0.054
pun-0.051
endment-0.047
geist-0.047
 upscale-0.047
Token cub
Token ID13617
Feature activation-1.69e+01

Pos logit contributions
ilers+3.291
erous+3.286
itored+2.955
 spur+2.842
 Consent+2.819
Neg logit contributions
rity-5.213
pun-4.932
endment-4.711
geist-4.609
 upscale-4.507
Tokens
Token ID82
Feature activation-3.88e+00

Pos logit contributions
ilers+8.750
erous+8.500
esses+7.763
itored+7.547
 spur+7.533
Neg logit contributions
rity-15.593
pun-14.939
endment-14.319
geist-13.988
CVE-13.781
Token -
Token ID532
Feature activation-3.10e+00

Pos logit contributions
erous+2.912
ilers+2.909
itored+2.701
 spur+2.579
 Consent+2.560
Neg logit contributions
rity-3.260
pun-3.049
endment-2.877
geist-2.812
 upscale-2.789
Token 8
Token ID807
Feature activation-5.90e+00

Pos logit contributions
ilers+2.122
erous+2.120
itored+1.959
 Consent+1.846
 spur+1.840
Neg logit contributions
rity-2.827
pun-2.657
endment-2.497
geist-2.462
 upscale-2.451
Token/
Token ID14
Feature activation-6.86e+00

Pos logit contributions
ilers+4.450
erous+4.412
itored+4.078
 spur+3.906
 Consent+3.890
Neg logit contributions
rity-4.907
pun-4.553
endment-4.286
geist-4.204
 upscale-4.188
Token10
Token ID940
Feature activation-4.27e+00

Pos logit contributions
ilers+4.074
erous+4.070
itored+3.680
 Consent+3.524
 spur+3.422
Neg logit contributions
rity-6.691
pun-6.117
endment-5.964
 upscale-5.862
geist-5.790
Token of
Token ID286
Feature activation+4.49e+00

Pos logit contributions
rity+0.520
pun+0.485
endment+0.449
geist+0.444
 upscale+0.444
Neg logit contributions
ilers-0.507
erous-0.505
itored-0.472
 spur-0.445
 Consent-0.444
Token their
Token ID511
Feature activation+3.32e+00

Pos logit contributions
rity+3.848
pun+3.553
 upscale+3.346
endment+3.294
geist+3.288
Neg logit contributions
ilers-3.309
erous-3.300
itored-3.082
 Consent-2.863
 spur-2.823
Token own
Token ID898
Feature activation-3.13e-01

Pos logit contributions
rity+2.726
pun+2.551
 upscale+2.369
geist+2.343
endment+2.343
Neg logit contributions
ilers-2.572
erous-2.558
itored-2.397
 Consent-2.234
 spur-2.210
Token.
Token ID13
Feature activation-4.22e+00

Pos logit contributions
ilers+0.252
erous+0.251
itored+0.235
 spur+0.221
 Consent+0.221
Neg logit contributions
rity-0.251
pun-0.234
endment-0.216
geist-0.214
 upscale-0.214
Token His
Token ID2399
Feature activation-3.36e+00

Pos logit contributions
erous+2.808
ilers+2.803
itored+2.590
 Consent+2.504
 spur+2.412
Neg logit contributions
rity-3.915
pun-3.653
endment-3.478
 upscale-3.398
geist-3.370
Token overhaul
Token ID18708
Feature activation-1.69e+01

Pos logit contributions
ilers+2.404
erous+2.391
itored+2.206
 spur+2.114
 Consent+2.083
Neg logit contributions
rity-2.950
pun-2.774
endment-2.615
geist-2.573
ior-2.538
Token saw
Token ID2497
Feature activation-6.56e+00

Pos logit contributions
erous+10.581
ilers+10.018
er+9.567
itored+9.503
 spur+9.404
Neg logit contributions
rity-13.316
pun-12.140
endment-12.004
CVE-11.646
geist-11.636
Token the
Token ID262
Feature activation-3.83e+00

Pos logit contributions
erous+4.527
ilers+4.494
itored+4.118
 spur+3.970
 Consent+3.851
Neg logit contributions
rity-5.830
pun-5.506
endment-5.187
geist-5.122
 upscale-5.004
Token dismissal
Token ID25693
Feature activation-4.88e+00

Pos logit contributions
ilers+2.623
erous+2.613
itored+2.383
 spur+2.265
 Consent+2.232
Neg logit contributions
rity-3.504
pun-3.289
endment-3.074
geist-3.069
 upscale-3.010
Token of
Token ID286
Feature activation-3.32e+00

Pos logit contributions
erous+3.489
ilers+3.475
itored+3.236
 spur+3.035
 Consent+2.989
Neg logit contributions
rity-4.257
pun-3.982
endment-3.716
 upscale-3.700
geist-3.681
Token veterans
Token ID11255
Feature activation-4.42e+00

Pos logit contributions
ilers+2.335
erous+2.333
itored+2.116
 spur+2.018
 Consent+1.997
Neg logit contributions
rity-2.981
pun-2.806
endment-2.641
geist-2.596
 upscale-2.568
Token
Token ID247
Feature activation-4.94e-01

Pos logit contributions
ilers+0.225
erous+0.224
itored+0.212
 Consent+0.201
 spur+0.201
Neg logit contributions
rity-0.170
pun-0.157
endment-0.142
geist-0.141
 upscale-0.141
Token?
Token ID30
Feature activation-3.74e+00

Pos logit contributions
ilers+0.323
erous+0.322
itored+0.296
 Consent+0.275
 spur+0.274
Neg logit contributions
rity-0.470
pun-0.442
endment-0.414
geist-0.412
 upscale-0.412
Token
Token ID198
Feature activation-5.30e+00

Pos logit contributions
ilers+2.721
erous+2.704
itored+2.485
 Consent+2.360
 spur+2.345
Neg logit contributions
rity-3.278
pun-3.052
endment-2.858
geist-2.853
 upscale-2.837
Token
Token ID198
Feature activation-5.50e+00

Pos logit contributions
erous+3.505
ilers+3.457
itored+3.176
 Consent+3.032
er+2.977
Neg logit contributions
rity-4.933
pun-4.535
endment-4.362
geist-4.296
 upscale-4.279
Tokenand
Token ID392
Feature activation-6.27e+00

Pos logit contributions
ilers+4.455
erous+4.435
itored+4.099
 Consent+3.870
er+3.815
Neg logit contributions
rity-4.267
pun-3.788
 upscale-3.708
geist-3.666
endment-3.641
Token how
Token ID703
Feature activation-1.68e+01

Pos logit contributions
erous+3.806
ilers+3.798
itored+3.377
 Consent+3.191
 spur+3.190
Neg logit contributions
rity-6.165
pun-5.748
geist-5.481
endment-5.460
 upscale-5.362
Token does
Token ID857
Feature activation-8.81e+00

Pos logit contributions
ilers+8.654
erous+8.137
 alters+6.906
 Fathers+6.736
 spur+6.634
Neg logit contributions
rity-15.564
pun-14.762
geist-14.398
endment-14.321
deen-13.928
Token it
Token ID340
Feature activation-6.25e+00

Pos logit contributions
erous+5.835
ilers+5.818
itored+5.220
 spur+5.195
 Consent+5.128
Neg logit contributions
rity-7.845
pun-7.265
geist-6.873
endment-6.869
 upscale-6.556
Token look
Token ID804
Feature activation+3.04e+00

Pos logit contributions
ilers+4.688
erous+4.683
 spur+4.201
itored+4.169
er+4.061
Neg logit contributions
rity-5.286
pun-4.841
endment-4.563
geist-4.425
deen-4.283
Token from
Token ID422
Feature activation+1.11e-02

Pos logit contributions
rity+2.280
pun+2.114
 upscale+1.961
ior+1.957
endment+1.944
Neg logit contributions
ilers-2.560
erous-2.530
itored-2.399
 Consent-2.263
 spur-2.231
Token the
Token ID262
Feature activation-5.63e+00

Pos logit contributions
rity+0.010
pun+0.009
endment+0.009
geist+0.009
 upscale+0.009
Neg logit contributions
ilers-0.008
erous-0.008
itored-0.007
 Consent-0.007
 spur-0.007
Token guns
Token ID6541
Feature activation+3.85e+00

Pos logit contributions
ilers+0.706
erous+0.704
itored+0.650
 spur+0.613
 Consent+0.610
Neg logit contributions
rity-0.888
pun-0.831
endment-0.779
geist-0.773
 upscale-0.766
Tokenlinger
Token ID33550
Feature activation-2.49e+00

Pos logit contributions
rity+2.226
pun+2.058
endment+1.795
 upscale+1.793
geist+1.778
Neg logit contributions
ilers-3.918
erous-3.915
itored-3.716
 Consent-3.560
 spur-3.535
Token Philip
Token ID14576
Feature activation-3.48e+00

Pos logit contributions
erous+1.618
ilers+1.617
itored+1.478
 spur+1.395
 Consent+1.379
Neg logit contributions
rity-2.366
pun-2.221
endment-2.099
geist-2.077
 upscale-2.059
Token Rivers
Token ID18280
Feature activation-5.06e+00

Pos logit contributions
erous+1.870
ilers+1.870
itored+1.656
 Consent+1.564
 spur+1.553
Neg logit contributions
rity-3.683
pun-3.514
endment-3.322
geist-3.285
 upscale-3.245
Token on
Token ID319
Feature activation-1.34e+01

Pos logit contributions
erous+3.796
ilers+3.765
itored+3.439
 spur+3.438
er+3.249
Neg logit contributions
rity-4.238
pun-3.809
endment-3.680
geist-3.606
ior-3.512
Token the
Token ID262
Feature activation-1.68e+01

Pos logit contributions
erous+8.474
ilers+7.536
itored+7.237
 spur+6.916
er+6.542
Neg logit contributions
rity-12.170
geist-11.193
pun-11.103
endment-10.828
deen-10.622
Token other
Token ID584
Feature activation-8.56e+00

Pos logit contributions
ilers+9.205
 spur+9.010
erous+8.864
itored+8.298
er+7.994
Neg logit contributions
rity-14.843
pun-13.284
endment-12.966
geist-12.945
ior-12.728
Token side
Token ID1735
Feature activation-3.57e+00

Pos logit contributions
erous+5.364
ilers+5.336
itored+4.865
 spur+4.857
er+4.445
Neg logit contributions
rity-7.863
pun-7.332
geist-6.980
endment-6.838
ior-6.786
Token Sunday
Token ID3502
Feature activation+4.12e+00

Pos logit contributions
ilers+2.656
erous+2.648
itored+2.465
 spur+2.340
 Consent+2.281
Neg logit contributions
rity-3.036
pun-2.803
endment-2.632
geist-2.615
 upscale-2.592
Token,
Token ID11
Feature activation-5.71e-01

Pos logit contributions
rity+3.029
pun+2.840
endment+2.603
 upscale+2.578
ior+2.558
Neg logit contributions
ilers-3.535
erous-3.504
itored-3.269
 Consent-3.139
 spur-3.079
Token Ramsey
Token ID30552
Feature activation+2.39e+00

Pos logit contributions
ilers+0.411
erous+0.409
itored+0.378
 spur+0.356
 Consent+0.354
Neg logit contributions
rity-0.506
pun-0.473
endment-0.443
geist-0.440
 upscale-0.436
Token True
Token ID6407
Feature activation+1.61e+00

Pos logit contributions
ilers+0.677
erous+0.674
itored+0.625
 Consent+0.588
 spur+0.585
Neg logit contributions
rity-0.834
pun-0.780
endment-0.729
geist-0.724
 upscale-0.722
Token.
Token ID13
Feature activation-8.49e-01

Pos logit contributions
rity+1.304
pun+1.216
endment+1.124
 upscale+1.114
geist+1.113
Neg logit contributions
erous-1.279
ilers-1.274
itored-1.189
 spur-1.116
 Consent-1.105
Token
Token ID198
Feature activation-6.58e-01

Pos logit contributions
ilers+0.613
erous+0.610
itored+0.567
 Consent+0.534
 spur+0.530
Neg logit contributions
rity-0.750
pun-0.702
endment-0.655
geist-0.650
 upscale-0.649
Token
Token ID198
Feature activation-1.71e+00

Pos logit contributions
ilers+0.497
erous+0.496
itored+0.461
 Consent+0.434
 spur+0.432
Neg logit contributions
rity-0.560
pun-0.522
endment-0.487
geist-0.482
 upscale-0.481
TokenObama
Token ID15948
Feature activation-2.72e+00

Pos logit contributions
ilers+1.433
erous+1.431
itored+1.342
 Consent+1.267
 spur+1.255
Neg logit contributions
rity-1.314
pun-1.201
endment-1.120
 upscale-1.119
geist-1.110
Token on
Token ID319
Feature activation-1.66e+01

Pos logit contributions
ilers+2.054
erous+2.049
itored+1.904
 Consent+1.809
 spur+1.806
Neg logit contributions
rity-2.299
pun-2.133
endment-1.996
geist-1.983
 upscale-1.971
Token a
Token ID257
Feature activation-1.10e+01

Pos logit contributions
erous+11.762
ilers+11.063
itored+10.609
 Consent+10.604
 spur+10.399
Neg logit contributions
rity-12.155
pun-11.269
geist-11.012
endment-10.760
ior-10.324
Token chain
Token ID6333
Feature activation-1.04e+01

Pos logit contributions
erous+8.136
ilers+7.900
 spur+7.458
itored+7.434
 Consent+7.280
Neg logit contributions
rity-8.737
pun-7.953
endment-7.650
geist-7.611
ior-7.332
Token,
Token ID11
Feature activation-7.50e+00

Pos logit contributions
erous+7.269
ilers+7.110
itored+6.836
 spur+6.504
 Consent+6.402
Neg logit contributions
rity-8.698
pun-7.783
geist-7.484
 upscale-7.392
endment-7.334
Token C
Token ID327
Feature activation-5.18e+00

Pos logit contributions
erous+5.404
ilers+5.373
itored+5.045
 Consent+4.852
 spur+4.813
Neg logit contributions
rity-6.335
pun-5.824
geist-5.563
endment-5.511
 upscale-5.357
Tokeniti
Token ID8846
Feature activation-3.83e+00

Pos logit contributions
ilers+3.957
erous+3.945
itored+3.658
 spur+3.405
 Consent+3.385
Neg logit contributions
rity-4.287
pun-3.967
geist-3.706
endment-3.705
 upscale-3.666
Token
Token ID447
Feature activation-1.24e+00

Pos logit contributions
ilers+4.488
erous+4.456
itored+4.006
 spur+3.972
 Consent+3.797
Neg logit contributions
rity-6.012
pun-5.475
endment-5.291
ior-5.098
 upscale-5.029
Token
Token ID247
Feature activation+2.85e-01

Pos logit contributions
erous+1.128
ilers+1.123
itored+1.062
 Consent+1.018
 spur+1.013
Neg logit contributions
rity-0.843
pun-0.788
endment-0.711
geist-0.710
 upscale-0.705
Tokens
Token ID82
Feature activation+8.90e-01

Pos logit contributions
rity+0.302
pun+0.286
endment+0.270
geist+0.268
 upscale+0.268
Neg logit contributions
ilers-0.157
erous-0.155
itored-0.141
 Consent-0.129
 spur-0.128
Token challenges
Token ID6459
Feature activation-4.82e+00

Pos logit contributions
rity+0.725
pun+0.676
endment+0.625
 upscale+0.622
geist+0.621
Neg logit contributions
ilers-0.702
erous-0.700
itored-0.655
 Consent-0.615
 spur-0.613
Token are
Token ID389
Feature activation-8.57e+00

Pos logit contributions
ilers+3.519
erous+3.487
itored+3.201
 spur+3.128
 Consent+3.036
Neg logit contributions
rity-4.215
pun-3.886
endment-3.686
geist-3.584
 upscale-3.568
Token on
Token ID319
Feature activation-1.65e+01

Pos logit contributions
erous+6.039
ilers+5.992
 spur+5.683
itored+5.481
 Consent+5.216
Neg logit contributions
rity-7.442
pun-6.725
endment-6.469
geist-6.295
deen-6.126
Token their
Token ID511
Feature activation-1.61e+01

Pos logit contributions
erous+10.865
ilers+9.818
itored+9.041
 spur+8.722
er+8.459
Neg logit contributions
rity-14.071
pun-12.912
geist-12.419
endment-12.248
deen-11.943
Token door
Token ID3420
Feature activation-1.17e+01

Pos logit contributions
erous+8.498
ilers+8.192
 spur+8.173
itored+7.317
er+7.173
Neg logit contributions
rity-15.151
pun-14.078
endment-13.406
geist-12.978
CVE-12.964
Tokensteps
Token ID20214
Feature activation-9.41e+00

Pos logit contributions
ilers+4.522
erous+4.242
itored+3.582
 spur+3.522
er+2.963
Neg logit contributions
rity-13.873
pun-12.301
endment-12.222
geist-12.118
 upscale-12.085
Token
Token ID960
Feature activation-2.55e+00

Pos logit contributions
erous+6.522
ilers+6.505
itored+5.897
 spur+5.863
ilitating+5.530
Neg logit contributions
rity-8.159
pun-7.370
endment-7.008
 upscale-6.849
geist-6.768
Tokenref
Token ID5420
Feature activation-4.35e+00

Pos logit contributions
ilers+2.188
erous+2.186
itored+2.035
 spur+1.942
 Consent+1.925
Neg logit contributions
rity-1.910
pun-1.731
endment-1.632
geist-1.600
 upscale-1.598
Token or
Token ID393
Feature activation+3.73e+00

Pos logit contributions
ilers+1.458
erous+1.447
itored+1.338
 spur+1.286
 Consent+1.270
Neg logit contributions
rity-1.588
pun-1.468
endment-1.369
geist-1.363
 upscale-1.345
Token so
Token ID523
Feature activation-8.91e-01

Pos logit contributions
rity+2.985
pun+2.808
 upscale+2.604
endment+2.578
ior+2.561
Neg logit contributions
erous-2.944
ilers-2.941
itored-2.785
 Consent-2.561
 spur-2.534
Token.
Token ID13
Feature activation+3.64e-02

Pos logit contributions
ilers+0.700
erous+0.695
itored+0.648
 spur+0.615
 Consent+0.611
Neg logit contributions
rity-0.734
pun-0.681
endment-0.633
geist-0.627
 upscale-0.625
Token
Token ID198
Feature activation-5.57e+00

Pos logit contributions
rity+0.031
pun+0.029
endment+0.027
geist+0.027
 upscale+0.027
Neg logit contributions
ilers-0.027
erous-0.027
itored-0.025
 Consent-0.024
 spur-0.024
Token
Token ID198
Feature activation-5.55e+00

Pos logit contributions
erous+3.622
ilers+3.585
itored+3.311
 Consent+3.171
er+3.106
Neg logit contributions
rity-5.220
pun-4.789
endment-4.629
 upscale-4.526
geist-4.514
TokenHow
Token ID2437
Feature activation-1.64e+01

Pos logit contributions
ilers+4.055
erous+4.036
itored+3.739
 Consent+3.550
er+3.509
Neg logit contributions
rity-4.750
pun-4.263
 upscale-4.113
endment-4.102
geist-3.997
Token would
Token ID561
Feature activation-5.78e+00

Pos logit contributions
ilers+10.145
erous+9.779
 spur+9.008
er+8.654
 Consent+8.405
Neg logit contributions
rity-13.454
pun-12.575
endment-12.026
CVE-11.555
geist-11.410
Token you
Token ID345
Feature activation-3.34e+00

Pos logit contributions
ilers+3.949
erous+3.908
itored+3.521
 spur+3.439
 Consent+3.378
Neg logit contributions
rity-5.274
pun-4.928
endment-4.630
geist-4.591
CVE-4.488
Token view
Token ID1570
Feature activation+1.79e+00

Pos logit contributions
ilers+3.060
erous+3.006
itored+2.823
 spur+2.757
 Consent+2.709
Neg logit contributions
rity-2.325
pun-2.102
endment-1.945
geist-1.903
ior-1.865
Token the
Token ID262
Feature activation+1.12e+00

Pos logit contributions
rity+1.512
pun+1.415
 upscale+1.318
endment+1.313
geist+1.305
Neg logit contributions
ilers-1.345
erous-1.333
itored-1.252
 Consent-1.163
 spur-1.159
Token Seahawks
Token ID16732
Feature activation+4.11e+00

Pos logit contributions
rity+0.966
pun+0.906
 upscale+0.841
endment+0.841
geist+0.835
Neg logit contributions
ilers-0.832
erous-0.827
itored-0.774
 Consent-0.720
 spur-0.716
Token
Token ID251
Feature activation+2.11e+00

Pos logit contributions
erous+1.806
ilers+1.804
itored+1.704
 Consent+1.633
 spur+1.631
Neg logit contributions
rity-1.217
pun-1.124
endment-1.014
geist-1.006
 upscale-1.002
Token 🙂
Token ID32485
Feature activation-2.62e+00

Pos logit contributions
rity+1.776
pun+1.663
geist+1.541
endment+1.541
 upscale+1.529
Neg logit contributions
ilers-1.607
erous-1.594
itored-1.490
 spur-1.397
 Consent-1.383
Token
Token ID198
Feature activation-2.42e+00

Pos logit contributions
ilers+1.905
erous+1.902
itored+1.760
 Consent+1.675
 spur+1.649
Neg logit contributions
rity-2.289
pun-2.139
endment-1.989
 upscale-1.967
geist-1.952
Token
Token ID198
Feature activation-2.10e+00

Pos logit contributions
erous+1.737
ilers+1.734
itored+1.607
 Consent+1.522
 spur+1.500
Neg logit contributions
rity-2.136
pun-1.978
endment-1.871
geist-1.844
 upscale-1.839
TokenOn
Token ID2202
Feature activation-1.25e+01

Pos logit contributions
ilers+1.749
erous+1.746
itored+1.633
 Consent+1.544
 spur+1.526
Neg logit contributions
rity-1.624
pun-1.490
endment-1.387
 upscale-1.382
geist-1.355
Token what
Token ID644
Feature activation-1.64e+01

Pos logit contributions
erous+8.139
ilers+7.799
itored+7.037
 Consent+6.930
 spur+6.853
Neg logit contributions
rity-11.060
pun-10.292
endment-9.743
geist-9.520
CVE-9.320
Token made
Token ID925
Feature activation-1.11e+01

Pos logit contributions
erous+11.364
ilers+10.760
 spur+10.609
 Consent+9.948
itored+9.914
Neg logit contributions
rity-12.203
pun-11.671
endment-11.164
CVE-10.606
geist-10.486
Token him
Token ID683
Feature activation-5.93e+00

Pos logit contributions
erous+6.939
ilers+6.784
 Consent+6.034
itored+5.970
 spur+5.941
Neg logit contributions
rity-10.229
pun-9.687
endment-9.114
CVE-8.738
geist-8.655
Token so
Token ID523
Feature activation-5.82e+00

Pos logit contributions
ilers+4.556
erous+4.506
 spur+4.027
itored+3.974
 Consent+3.959
Neg logit contributions
rity-4.911
pun-4.559
endment-4.265
geist-4.194
ior-4.115
Token successful
Token ID4388
Feature activation-2.39e+00

Pos logit contributions
erous+4.135
ilers+4.118
 spur+3.680
itored+3.633
 Consent+3.516
Neg logit contributions
rity-5.121
pun-4.720
endment-4.494
geist-4.356
ior-4.280
Token,
Token ID11
Feature activation-3.83e+00

Pos logit contributions
ilers+1.834
erous+1.824
itored+1.697
 spur+1.607
 Consent+1.593
Neg logit contributions
rity-1.999
pun-1.862
endment-1.740
 upscale-1.712
geist-1.709
Token
Token ID247
Feature activation+2.85e-01

Pos logit contributions
erous+1.128
ilers+1.123
itored+1.062
 Consent+1.018
 spur+1.013
Neg logit contributions
rity-0.843
pun-0.788
endment-0.711
geist-0.710
 upscale-0.705
Tokens
Token ID82
Feature activation+8.90e-01

Pos logit contributions
rity+0.302
pun+0.286
endment+0.270
geist+0.268
 upscale+0.268
Neg logit contributions
ilers-0.157
erous-0.155
itored-0.141
 Consent-0.129
 spur-0.128
Token challenges
Token ID6459
Feature activation-4.82e+00

Pos logit contributions
rity+0.725
pun+0.676
endment+0.625
 upscale+0.622
geist+0.621
Neg logit contributions
ilers-0.702
erous-0.700
itored-0.655
 Consent-0.615
 spur-0.613
Token are
Token ID389
Feature activation-8.57e+00

Pos logit contributions
ilers+3.519
erous+3.487
itored+3.201
 spur+3.128
 Consent+3.036
Neg logit contributions
rity-4.215
pun-3.886
endment-3.686
geist-3.584
 upscale-3.568
Token on
Token ID319
Feature activation-1.65e+01

Pos logit contributions
erous+6.039
ilers+5.992
 spur+5.683
itored+5.481
 Consent+5.216
Neg logit contributions
rity-7.442
pun-6.725
endment-6.469
geist-6.295
deen-6.126
Token their
Token ID511
Feature activation-1.61e+01

Pos logit contributions
erous+10.865
ilers+9.818
itored+9.041
 spur+8.722
er+8.459
Neg logit contributions
rity-14.071
pun-12.912
geist-12.419
endment-12.248
deen-11.943
Token door
Token ID3420
Feature activation-1.17e+01

Pos logit contributions
erous+8.498
ilers+8.192
 spur+8.173
itored+7.317
er+7.173
Neg logit contributions
rity-15.151
pun-14.078
endment-13.406
geist-12.978
CVE-12.964
Tokensteps
Token ID20214
Feature activation-9.41e+00

Pos logit contributions
ilers+4.522
erous+4.242
itored+3.582
 spur+3.522
er+2.963
Neg logit contributions
rity-13.873
pun-12.301
endment-12.222
geist-12.118
 upscale-12.085
Token
Token ID960
Feature activation-2.55e+00

Pos logit contributions
erous+6.522
ilers+6.505
itored+5.897
 spur+5.863
ilitating+5.530
Neg logit contributions
rity-8.159
pun-7.370
endment-7.008
 upscale-6.849
geist-6.768
Tokenref
Token ID5420
Feature activation-4.35e+00

Pos logit contributions
ilers+2.188
erous+2.186
itored+2.035
 spur+1.942
 Consent+1.925
Neg logit contributions
rity-1.910
pun-1.731
endment-1.632
geist-1.600
 upscale-1.598
Tokenuge
Token ID2217
Feature activation-7.22e+00

Pos logit contributions
ilers+3.328
erous+3.291
itored+3.040
 spur+2.904
 Consent+2.843
Neg logit contributions
rity-3.677
pun-3.385
endment-3.146
 upscale-3.143
geist-3.137
Token 1987
Token ID12923
Feature activation+5.49e+00

Pos logit contributions
rity+2.539
pun+2.372
 upscale+2.214
geist+2.196
endment+2.190
Neg logit contributions
ilers-2.158
erous-2.141
itored-2.002
 Consent-1.852
 spur-1.844
Token album
Token ID5062
Feature activation+3.78e-01

Pos logit contributions
rity+4.751
pun+4.470
 upscale+4.243
ior+4.153
endment+4.139
Neg logit contributions
ilers-3.844
erous-3.823
itored-3.599
 spur-3.258
 Consent-3.239
Token Blessing
Token ID49964
Feature activation-1.03e+00

Pos logit contributions
rity+0.318
pun+0.298
endment+0.276
geist+0.274
 upscale+0.274
Neg logit contributions
ilers-0.288
erous-0.287
itored-0.267
 spur-0.250
 Consent-0.250
Token was
Token ID373
Feature activation+1.89e+00

Pos logit contributions
ilers+0.769
erous+0.767
itored+0.713
 Consent+0.670
 spur+0.669
Neg logit contributions
rity-0.886
pun-0.826
endment-0.772
 upscale-0.763
geist-0.762
Token on
Token ID319
Feature activation-1.04e+01

Pos logit contributions
rity+1.561
pun+1.463
endment+1.350
 upscale+1.344
geist+1.339
Neg logit contributions
ilers-1.468
erous-1.461
itored-1.363
 Consent-1.277
 spur-1.271
Token constant
Token ID6937
Feature activation-1.59e+01

Pos logit contributions
erous+7.233
ilers+6.683
itored+6.321
 spur+6.219
 Consent+6.134
Neg logit contributions
rity-8.988
pun-8.313
geist-8.006
endment-7.877
ior-7.662
Token rotation
Token ID13179
Feature activation-1.05e+01

Pos logit contributions
erous+9.460
 spur+8.943
ilers+8.612
itored+8.381
er+8.126
Neg logit contributions
rity-13.712
geist-12.571
pun-12.430
deen-11.947
endment-11.886
Token and
Token ID290
Feature activation-3.88e+00

Pos logit contributions
erous+7.454
ilers+7.203
itored+6.955
 spur+6.848
er+6.428
Neg logit contributions
rity-8.628
pun-7.855
endment-7.566
geist-7.328
 upscale-7.288
Token received
Token ID2722
Feature activation-4.16e-01

Pos logit contributions
erous+2.919
ilers+2.886
itored+2.691
 spur+2.641
 Consent+2.560
Neg logit contributions
rity-3.264
pun-3.010
endment-2.852
geist-2.800
ior-2.749
Token its
Token ID663
Feature activation+3.77e+00

Pos logit contributions
ilers+0.320
erous+0.319
itored+0.296
 spur+0.281
 Consent+0.279
Neg logit contributions
rity-0.347
pun-0.324
endment-0.301
geist-0.299
 upscale-0.296
Token fair
Token ID3148
Feature activation+6.80e+00

Pos logit contributions
rity+3.246
pun+3.056
 upscale+2.842
endment+2.799
ior+2.786
Neg logit contributions
ilers-2.774
erous-2.744
itored-2.576
 Consent-2.378
 spur-2.342
Token-
Token ID12
Feature activation-5.23e-01

Pos logit contributions
rity+0.645
pun+0.604
endment+0.562
geist+0.557
 upscale+0.555
Neg logit contributions
ilers-0.557
erous-0.555
itored-0.515
 spur-0.484
 Consent-0.484
Tokennamed
Token ID13190
Feature activation-2.15e+00

Pos logit contributions
ilers+0.409
erous+0.408
itored+0.380
 Consent+0.356
 spur+0.356
Neg logit contributions
rity-0.430
pun-0.399
endment-0.372
geist-0.368
 upscale-0.368
Token,
Token ID11
Feature activation-1.49e+00

Pos logit contributions
erous+1.512
ilers+1.504
itored+1.370
 Consent+1.311
 spur+1.310
Neg logit contributions
rity-1.932
pun-1.818
endment-1.712
geist-1.675
ior-1.656
Token who
Token ID508
Feature activation-6.80e+00

Pos logit contributions
erous+1.122
ilers+1.121
itored+1.041
 spur+0.986
 Consent+0.983
Neg logit contributions
rity-1.265
pun-1.181
endment-1.105
geist-1.086
ior-1.077
Token are
Token ID389
Feature activation-5.08e+00

Pos logit contributions
ilers+4.888
erous+4.834
itored+4.458
 spur+4.312
 Consent+4.219
Neg logit contributions
rity-5.825
pun-5.474
endment-5.220
geist-5.078
ior-4.930
Token on
Token ID319
Feature activation-1.58e+01

Pos logit contributions
ilers+3.659
erous+3.646
itored+3.349
 spur+3.227
 Consent+3.226
Neg logit contributions
rity-4.408
pun-4.106
endment-3.887
geist-3.764
ior-3.746
Token Facebook
Token ID3203
Feature activation-1.07e+01

Pos logit contributions
erous+10.774
itored+9.649
ilers+9.612
 Consent+9.008
 spur+8.875
Neg logit contributions
rity-12.528
pun-11.722
endment-11.427
geist-11.172
CVE-11.060
Token,
Token ID11
Feature activation-3.83e+00

Pos logit contributions
ilers+7.391
erous+7.290
itored+6.877
 spur+6.433
 Consent+6.388
Neg logit contributions
rity-9.063
pun-8.548
endment-8.073
CVE-7.751
 upscale-7.719
Token did
Token ID750
Feature activation-6.09e+00

Pos logit contributions
ilers+2.836
erous+2.805
itored+2.609
 spur+2.502
 Consent+2.482
Neg logit contributions
rity-3.284
pun-3.059
endment-2.899
geist-2.840
 upscale-2.768
Token not
Token ID407
Feature activation-5.68e+00

Pos logit contributions
ilers+4.823
erous+4.767
itored+4.394
 spur+4.208
 Consent+4.156
Neg logit contributions
rity-4.901
pun-4.507
endment-4.228
geist-4.108
CVE-4.093
Token respond
Token ID3031
Feature activation-9.49e-01

Pos logit contributions
ilers+4.514
erous+4.423
itored+4.133
 spur+4.103
 Consent+4.007
Neg logit contributions
rity-4.546
pun-4.201
endment-3.938
geist-3.870
CVE-3.755
Token this
Token ID428
Feature activation-2.71e+00

Pos logit contributions
rity+1.365
pun+1.280
 upscale+1.180
endment+1.178
geist+1.173
Neg logit contributions
ilers-1.353
erous-1.342
itored-1.254
 Consent-1.179
 spur-1.173
Token cryptic
Token ID47774
Feature activation-4.94e+00

Pos logit contributions
erous+2.138
ilers+2.131
itored+1.996
 spur+1.902
 Consent+1.872
Neg logit contributions
rity-2.234
pun-2.050
endment-1.906
geist-1.881
ior-1.858
Token message
Token ID3275
Feature activation-2.50e+00

Pos logit contributions
erous+4.168
ilers+4.167
itored+3.950
 spur+3.766
 Consent+3.691
Neg logit contributions
rity-3.749
pun-3.404
endment-3.103
geist-3.060
CVE-3.037
Token on
Token ID319
Feature activation-1.11e+01

Pos logit contributions
ilers+2.281
erous+2.274
itored+2.163
 spur+2.070
 Consent+2.044
Neg logit contributions
rity-1.738
pun-1.573
endment-1.438
geist-1.413
 upscale-1.405
Token her
Token ID607
Feature activation-9.57e+00

Pos logit contributions
erous+5.387
ilers+5.158
itored+4.800
 spur+4.629
er+4.342
Neg logit contributions
rity-11.656
pun-10.796
endment-10.212
CVE-10.178
geist-10.071
Token Instagram
Token ID10767
Feature activation-1.58e+01

Pos logit contributions
erous+4.427
ilers+4.290
itored+4.024
 spur+3.657
 Consent+3.521
Neg logit contributions
rity-10.396
pun-9.798
endment-9.226
geist-9.153
CVE-9.138
Token following
Token ID1708
Feature activation+9.12e-01

Pos logit contributions
erous+8.533
ilers+8.276
itored+7.895
 spur+7.489
ilitating+7.157
Neg logit contributions
rity-14.233
pun-13.400
endment-12.419
geist-12.047
CVE-11.969
Token the
Token ID262
Feature activation-3.96e+00

Pos logit contributions
rity+0.809
pun+0.759
endment+0.706
geist+0.701
 upscale+0.701
Neg logit contributions
ilers-0.655
erous-0.652
itored-0.605
 Consent-0.565
 spur-0.565
Token incident
Token ID4519
Feature activation-4.49e+00

Pos logit contributions
erous+3.744
ilers+3.737
itored+3.507
 spur+3.480
 Consent+3.403
Neg logit contributions
rity-2.538
pun-2.248
endment-2.137
geist-2.068
ior-2.016
Token
Token ID198
Feature activation-7.06e-01

Pos logit contributions
erous+3.012
ilers+2.978
itored+2.772
 spur+2.609
 Consent+2.608
Neg logit contributions
rity-4.126
pun-3.846
endment-3.614
geist-3.564
 upscale-3.548
Token
Token ID198
Feature activation-8.16e-01

Pos logit contributions
ilers+0.534
erous+0.533
itored+0.496
 Consent+0.467
 spur+0.465
Neg logit contributions
rity-0.600
pun-0.559
endment-0.522
geist-0.516
 upscale-0.515
Token Maria
Token ID14200
Feature activation+3.91e+00

Pos logit contributions
ilers+2.200
erous+2.198
itored+1.989
 Consent+1.881
 spur+1.872
Neg logit contributions
rity-3.596
pun-3.392
endment-3.203
geist-3.186
ior-3.141
Token Santana
Token ID49067
Feature activation+7.22e-01

Pos logit contributions
rity+3.337
pun+3.106
endment+2.890
 upscale+2.881
geist+2.844
Neg logit contributions
ilers-2.923
erous-2.892
itored-2.691
 spur-2.513
 Consent-2.485
Token will
Token ID481
Feature activation-5.28e+00

Pos logit contributions
rity+0.631
pun+0.596
endment+0.550
 upscale+0.549
geist+0.547
Neg logit contributions
erous-0.525
ilers-0.525
itored-0.485
 Consent-0.455
 spur-0.453
Token also
Token ID635
Feature activation-4.40e+00

Pos logit contributions
ilers+4.126
erous+4.018
itored+3.757
 spur+3.668
 Consent+3.533
Neg logit contributions
rity-4.360
pun-3.995
endment-3.796
geist-3.741
ior-3.623
Token be
Token ID307
Feature activation-5.56e+00

Pos logit contributions
ilers+3.681
erous+3.604
itored+3.378
 spur+3.281
 Consent+3.187
Neg logit contributions
rity-3.393
pun-3.108
endment-2.924
geist-2.890
ior-2.791
Token on
Token ID319
Feature activation-1.58e+01

Pos logit contributions
ilers+4.249
erous+4.191
itored+3.916
 spur+3.846
esses+3.655
Neg logit contributions
rity-4.552
pun-4.169
endment-3.968
geist-3.925
 upscale-3.785
Token the
Token ID262
Feature activation-1.34e+01

Pos logit contributions
erous+9.883
ilers+9.351
itored+9.067
 Consent+8.431
er+8.399
Neg logit contributions
rity-13.233
pun-11.927
geist-11.556
endment-11.420
deen-11.260
Token scene
Token ID3715
Feature activation-4.62e+00

Pos logit contributions
ilers+8.377
erous+8.098
 spur+7.879
itored+7.804
er+7.391
Neg logit contributions
rity-11.372
pun-10.169
endment-9.919
geist-9.906
CVE-9.565
Token later
Token ID1568
Feature activation+2.91e+00

Pos logit contributions
ilers+3.513
erous+3.472
itored+3.275
 spur+3.117
 Consent+3.029
Neg logit contributions
rity-3.839
pun-3.509
endment-3.313
geist-3.288
 upscale-3.240
Token Monday
Token ID3321
Feature activation+2.22e+00

Pos logit contributions
rity+2.227
pun+2.084
 upscale+1.913
endment+1.904
ior+1.900
Neg logit contributions
ilers-2.413
erous-2.408
itored-2.244
 Consent-2.145
 spur-2.116
Token.
Token ID13
Feature activation-2.36e+00

Pos logit contributions
rity+1.548
pun+1.433
endment+1.307
 upscale+1.294
geist+1.288
Neg logit contributions
ilers-2.004
erous-1.999
itored-1.878
 Consent-1.785
 spur-1.779
Token News
Token ID3000
Feature activation-1.06e+00

Pos logit contributions
ilers+2.715
erous+2.692
itored+2.440
 spur+2.307
 Consent+2.299
Neg logit contributions
rity-4.696
pun-4.335
endment-4.160
 upscale-4.065
geist-4.055
Token plans
Token ID3352
Feature activation-3.73e+00

Pos logit contributions
ilers+0.822
erous+0.817
itored+0.761
 spur+0.721
 Consent+0.720
Neg logit contributions
rity-0.882
pun-0.818
endment-0.762
geist-0.752
 upscale-0.750
Token to
Token ID284
Feature activation-1.55e+00

Pos logit contributions
ilers+2.560
erous+2.548
itored+2.360
 spur+2.244
 Consent+2.210
Neg logit contributions
rity-3.401
pun-3.169
endment-2.963
geist-2.941
 upscale-2.891
Token air
Token ID1633
Feature activation+2.13e+00

Pos logit contributions
ilers+1.149
erous+1.139
itored+1.060
 spur+1.006
 Consent+0.994
Neg logit contributions
rity-1.351
pun-1.258
endment-1.178
geist-1.166
 upscale-1.149
Token On
Token ID1550
Feature activation-1.23e+01

Pos logit contributions
rity+1.753
pun+1.644
 upscale+1.520
endment+1.511
geist+1.510
Neg logit contributions
ilers-1.652
erous-1.642
itored-1.534
 Consent-1.434
 spur-1.425
Token the
Token ID262
Feature activation-1.58e+01

Pos logit contributions
erous+7.307
ilers+7.149
itored+6.795
 Consent+6.458
 Fathers+6.301
Neg logit contributions
rity-11.071
pun-10.411
endment-9.932
geist-9.836
 upscale-9.754
Token Record
Token ID13266
Feature activation-6.03e+00

Pos logit contributions
erous+9.530
itored+9.283
 spur+9.095
ilers+9.039
 Consent+9.024
Neg logit contributions
rity-12.635
pun-12.026
endment-11.777
CVE-11.624
conservancy-11.511
Token With
Token ID2080
Feature activation-4.76e+00

Pos logit contributions
erous+4.400
ilers+4.378
itored+4.073
 spur+3.923
 Consent+3.869
Neg logit contributions
rity-5.123
pun-4.701
endment-4.443
geist-4.331
 upscale-4.311
Token Van
Token ID6656
Feature activation-7.81e+00

Pos logit contributions
ilers+2.509
erous+2.505
itored+2.221
 Consent+2.114
 spur+2.099
Neg logit contributions
rity-5.017
pun-4.757
geist-4.551
endment-4.518
ior-4.456
Token S
Token ID311
Feature activation+6.45e-01

Pos logit contributions
ilers+3.373
erous+3.218
itored+2.882
 Consent+2.723
er+2.606
Neg logit contributions
rity-8.870
pun-8.316
geist-8.040
endment-8.022
CVE-7.949
Tokenust
Token ID436
Feature activation+3.43e+00

Pos logit contributions
rity+0.583
pun+0.546
endment+0.510
geist+0.506
 upscale+0.505
Neg logit contributions
ilers-0.452
erous-0.451
itored-0.418
 spur-0.392
 Consent-0.391
Token black
Token ID2042
Feature activation+4.48e+00

Pos logit contributions
rity+2.470
pun+2.323
 upscale+2.163
endment+2.135
geist+2.118
Neg logit contributions
ilers-2.629
erous-2.580
itored-2.453
 Consent-2.273
ilitating-2.241
Token bear
Token ID6842
Feature activation-4.95e-02

Pos logit contributions
rity+3.872
pun+3.630
 upscale+3.401
geist+3.390
endment+3.388
Neg logit contributions
ilers-3.258
erous-3.191
itored-2.990
 Consent-2.824
ilitating-2.727
Token and
Token ID290
Feature activation+1.84e+00

Pos logit contributions
ilers+0.040
erous+0.040
itored+0.037
 spur+0.035
 Consent+0.035
Neg logit contributions
rity-0.040
pun-0.037
endment-0.034
geist-0.034
 upscale-0.034
Token her
Token ID607
Feature activation+1.67e+00

Pos logit contributions
rity+1.544
pun+1.444
endment+1.334
 upscale+1.329
geist+1.327
Neg logit contributions
ilers-1.407
erous-1.399
itored-1.305
 Consent-1.223
 spur-1.214
Token two
Token ID734
Feature activation-3.94e-01

Pos logit contributions
rity+1.434
pun+1.344
endment+1.244
 upscale+1.242
geist+1.236
Neg logit contributions
ilers-1.239
erous-1.229
itored-1.149
 Consent-1.074
 spur-1.064
Token cub
Token ID13617
Feature activation-1.58e+01

Pos logit contributions
erous+0.280
ilers+0.280
itored+0.258
 spur+0.244
 Consent+0.242
Neg logit contributions
rity-0.352
pun-0.330
endment-0.309
geist-0.306
 upscale-0.303
Tokens
Token ID82
Feature activation-1.72e+00

Pos logit contributions
ilers+7.534
erous+7.167
 spur+6.414
itored+6.376
esses+6.303
Neg logit contributions
rity-15.630
pun-14.878
endment-14.317
geist-14.078
CVE-13.659
Token about
Token ID546
Feature activation-1.68e+00

Pos logit contributions
ilers+1.360
erous+1.355
itored+1.265
 spur+1.210
 Consent+1.192
Neg logit contributions
rity-1.386
pun-1.284
endment-1.200
geist-1.184
 upscale-1.179
Token 15
Token ID1315
Feature activation+1.96e+00

Pos logit contributions
ilers+1.267
erous+1.267
itored+1.174
 spur+1.118
 Consent+1.112
Neg logit contributions
rity-1.425
pun-1.326
endment-1.238
geist-1.227
 upscale-1.212
Token yards
Token ID5695
Feature activation+8.60e-01

Pos logit contributions
rity+1.599
pun+1.505
endment+1.380
 upscale+1.378
ior+1.372
Neg logit contributions
ilers-1.530
erous-1.529
itored-1.428
 Consent-1.337
 spur-1.321
Token in
Token ID287
Feature activation-3.10e+00

Pos logit contributions
rity+0.656
pun+0.611
endment+0.559
 upscale+0.559
geist+0.555
Neg logit contributions
ilers-0.723
erous-0.720
itored-0.677
 Consent-0.638
 spur-0.634
Token the
Token ID262
Feature activation+7.49e-01

Pos logit contributions
rity+1.623
pun+1.526
endment+1.417
 upscale+1.409
geist+1.401
Neg logit contributions
ilers-1.358
erous-1.348
itored-1.266
 Consent-1.179
 spur-1.174
Token Packers
Token ID17832
Feature activation-6.56e-01

Pos logit contributions
rity+0.734
pun+0.694
endment+0.651
geist+0.646
 upscale+0.646
Neg logit contributions
ilers-0.467
erous-0.465
itored-0.427
 Consent-0.394
 spur-0.393
Token.
Token ID13
Feature activation+5.91e-01

Pos logit contributions
ilers+0.520
erous+0.516
itored+0.481
 spur+0.455
 Consent+0.453
Neg logit contributions
rity-0.536
pun-0.496
endment-0.461
geist-0.457
 upscale-0.456
Token
Token ID198
Feature activation-5.79e+00

Pos logit contributions
rity+0.513
pun+0.481
endment+0.447
geist+0.444
 upscale+0.443
Neg logit contributions
ilers-0.436
erous-0.434
itored-0.404
 spur-0.378
 Consent-0.377
Token
Token ID198
Feature activation-5.33e+00

Pos logit contributions
erous+3.753
ilers+3.727
itored+3.438
 Consent+3.274
er+3.217
Neg logit contributions
rity-5.411
pun-4.964
endment-4.813
 upscale-4.685
geist-4.684
TokenHow
Token ID2437
Feature activation-1.57e+01

Pos logit contributions
ilers+3.930
erous+3.889
itored+3.621
 Consent+3.408
er+3.389
Neg logit contributions
rity-4.540
pun-4.052
 upscale-3.937
endment-3.921
geist-3.814
Token do
Token ID466
Feature activation-1.20e+01

Pos logit contributions
ilers+9.753
erous+9.421
 spur+8.648
er+8.248
 Consent+7.889
Neg logit contributions
rity-13.066
pun-12.207
endment-11.795
CVE-11.210
geist-11.081
Token you
Token ID345
Feature activation-7.90e+00

Pos logit contributions
ilers+7.111
erous+7.099
 spur+6.033
itored+6.017
 Consent+5.839
Neg logit contributions
rity-11.236
pun-10.410
endment-10.029
geist-9.807
CVE-9.796
Token view
Token ID1570
Feature activation-1.39e+00

Pos logit contributions
ilers+6.130
erous+5.966
 spur+5.550
itored+5.482
 Consent+5.302
Neg logit contributions
rity-6.376
pun-5.759
endment-5.444
geist-5.335
ior-5.146
Token the
Token ID262
Feature activation-1.12e+00

Pos logit contributions
ilers+0.992
erous+0.990
itored+0.912
 spur+0.858
 Consent+0.856
Neg logit contributions
rity-1.240
pun-1.163
endment-1.086
geist-1.076
 upscale-1.070
Token Cowboys
Token ID16535
Feature activation+2.07e+00

Pos logit contributions
ilers+0.766
erous+0.763
itored+0.701
 spur+0.658
 Consent+0.656
Neg logit contributions
rity-1.038
pun-0.973
endment-0.912
geist-0.904
 upscale-0.899
Token believe
Token ID1975
Feature activation+1.05e+00

Pos logit contributions
ilers+2.588
erous+2.559
itored+2.378
 Consent+2.335
 spur+2.297
Neg logit contributions
rity-2.713
pun-2.495
endment-2.336
geist-2.302
ior-2.259
Token the
Token ID262
Feature activation+2.98e+00

Pos logit contributions
rity+0.845
pun+0.790
endment+0.731
 upscale+0.724
geist+0.723
Neg logit contributions
ilers-0.838
erous-0.833
itored-0.781
 spur-0.732
 Consent-0.730
Token citizens
Token ID4290
Feature activation+8.41e+00

Pos logit contributions
rity+2.396
pun+2.237
endment+2.071
 upscale+2.051
geist+2.040
Neg logit contributions
ilers-2.392
erous-2.364
itored-2.226
 spur-2.077
 Consent-2.073
Token of
Token ID286
Feature activation+7.26e+00

Pos logit contributions
rity+6.235
pun+5.944
endment+5.476
geist+5.362
 upscale+5.287
Neg logit contributions
ilers-6.865
erous-6.815
itored-6.409
 spur-5.938
zee-5.909
Token F
Token ID376
Feature activation+3.09e+00

Pos logit contributions
rity+5.439
pun+4.992
 upscale+4.702
endment+4.679
ior+4.586
Neg logit contributions
ilers-5.963
erous-5.883
itored-5.568
ilitating-5.150
zee-5.062
Tokenayette
Token ID27067
Feature activation+8.97e+00

Pos logit contributions
rity+2.159
pun+1.976
endment+1.822
ior+1.794
 upscale+1.773
Neg logit contributions
erous-2.792
ilers-2.784
itored-2.647
 Consent-2.534
 spur-2.492
Tokenville
Token ID4244
Feature activation+8.26e+00

Pos logit contributions
rity+4.330
pun+4.184
endment+3.626
ior+3.509
 upscale+3.496
Neg logit contributions
ilers-9.631
erous-9.414
itored-9.036
 spur-8.557
ilitating-8.501
Token would
Token ID561
Feature activation+4.95e+00

Pos logit contributions
rity+5.891
pun+5.631
endment+5.153
geist+5.084
ior+5.080
Neg logit contributions
ilers-6.928
erous-6.772
itored-6.383
ilitating-5.907
 spur-5.903
Token want
Token ID765
Feature activation+4.76e+00

Pos logit contributions
rity+3.863
pun+3.694
 upscale+3.395
endment+3.394
ior+3.339
Neg logit contributions
erous-3.913
ilers-3.907
itored-3.681
ilitating-3.358
 Consent-3.355
Token males
Token ID10835
Feature activation+6.41e+00

Pos logit contributions
rity+3.938
pun+3.752
endment+3.485
 upscale+3.460
ior+3.420
Neg logit contributions
ilers-3.557
erous-3.500
itored-3.307
 spur-3.018
 Consent-3.012
Token with
Token ID351
Feature activation+6.37e-01

Pos logit contributions
rity+5.168
pun+4.950
geist+4.538
endment+4.529
 upscale+4.513
Neg logit contributions
erous-4.920
ilers-4.892
itored-4.557
 Consent-4.220
 spur-4.175
Tokenmark
Token ID4102
Feature activation-1.27e+00

Pos logit contributions
rity+3.945
pun+3.661
ior+3.271
endment+3.169
geist+3.144
Neg logit contributions
erous-7.441
ilers-7.357
itored-7.031
 Consent-6.805
 spur-6.721
Token Time
Token ID3862
Feature activation+7.71e+00

Pos logit contributions
ilers+0.722
erous+0.722
itored+0.653
 Consent+0.608
 spur+0.600
Neg logit contributions
rity-1.321
pun-1.251
endment-1.182
geist-1.170
 upscale-1.167
Token(
Token ID7
Feature activation+2.67e+00

Pos logit contributions
rity+5.256
pun+4.937
endment+4.459
ior+4.440
geist+4.440
Neg logit contributions
ilers-6.764
erous-6.732
itored-6.309
 spur-6.088
ilitating-6.023
Tokenns
Token ID5907
Feature activation+3.29e+00

Pos logit contributions
rity+2.363
pun+2.223
endment+2.058
ior+2.051
geist+2.041
Neg logit contributions
erous-1.904
ilers-1.897
itored-1.769
 Consent-1.661
 spur-1.655
Token)
Token ID8
Feature activation-7.20e-01

Pos logit contributions
rity+3.030
pun+2.892
ior+2.688
endment+2.682
geist+2.677
Neg logit contributions
ilers-2.194
erous-2.180
itored-1.996
 spur-1.897
ilitating-1.850
Token CPU
Token ID9135
Feature activation+9.77e+00

Pos logit contributions
ilers+0.539
erous+0.534
itored+0.497
 Consent+0.468
 spur+0.468
Neg logit contributions
rity-0.620
pun-0.579
endment-0.542
geist-0.532
 upscale-0.530
Token(
Token ID7
Feature activation+3.50e+00

Pos logit contributions
rity+5.846
pun+5.617
CVE+5.100
ior+5.015
POL+4.901
Neg logit contributions
ilers-8.952
erous-8.920
itored-8.314
 spur-8.287
ilitating-8.148
Tokenns
Token ID5907
Feature activation+3.09e+00

Pos logit contributions
rity+3.017
pun+2.843
ior+2.620
endment+2.608
geist+2.596
Neg logit contributions
erous-2.559
ilers-2.545
itored-2.382
 spur-2.259
 Consent-2.248
Token)
Token ID8
Feature activation+3.15e+00

Pos logit contributions
rity+2.832
pun+2.702
ior+2.504
endment+2.504
geist+2.494
Neg logit contributions
ilers-2.076
erous-2.061
itored-1.893
 spur-1.807
ilitating-1.759
Token Mem
Token ID4942
Feature activation+7.88e+00

Pos logit contributions
rity+2.662
pun+2.495
 upscale+2.331
geist+2.309
ior+2.308
Neg logit contributions
ilers-2.363
erous-2.358
itored-2.187
 spur-2.039
ilitating-2.012
Tokenc
Token ID66
Feature activation+8.68e+00

Pos logit contributions
rity+5.678
pun+5.384
ior+4.928
endment+4.810
CVE+4.766
Neg logit contributions
erous-6.570
ilers-6.451
itored-6.092
 spur-6.055
 Consent-5.881
Token the
Token ID262
Feature activation+1.46e+00

Pos logit contributions
rity+2.083
pun+1.959
endment+1.801
 upscale+1.800
geist+1.793
Neg logit contributions
ilers-2.016
erous-2.004
itored-1.886
 Consent-1.751
 spur-1.750
Token following
Token ID1708
Feature activation+3.64e+00

Pos logit contributions
rity+1.272
pun+1.195
endment+1.110
 upscale+1.102
geist+1.101
Neg logit contributions
ilers-1.076
erous-1.070
itored-0.996
 spur-0.930
 Consent-0.928
Token description
Token ID6764
Feature activation+3.22e-01

Pos logit contributions
rity+2.900
pun+2.733
 upscale+2.522
geist+2.501
endment+2.497
Neg logit contributions
ilers-2.873
erous-2.860
itored-2.705
 spur-2.480
 Consent-2.477
Token of
Token ID286
Feature activation+2.82e+00

Pos logit contributions
rity+0.270
pun+0.253
endment+0.235
geist+0.233
 upscale+0.232
Neg logit contributions
ilers-0.247
erous-0.246
itored-0.229
 Consent-0.215
 spur-0.215
Token their
Token ID511
Feature activation+3.45e+00

Pos logit contributions
rity+2.372
pun+2.233
endment+2.061
 upscale+2.060
geist+2.052
Neg logit contributions
ilers-2.128
erous-2.102
itored-1.982
 spur-1.837
 Consent-1.835
Token competing
Token ID11780
Feature activation+9.22e+00

Pos logit contributions
rity+2.724
pun+2.568
 upscale+2.364
endment+2.342
geist+2.341
Neg logit contributions
ilers-2.780
erous-2.749
itored-2.599
 spur-2.403
 Consent-2.393
Token technology
Token ID3037
Feature activation-3.12e+00

Pos logit contributions
rity+6.627
pun+6.405
 upscale+5.965
geist+5.765
endment+5.685
Neg logit contributions
ilers-7.477
erous-7.282
itored-7.197
ilitating-6.472
esses-6.378
Token:
Token ID25
Feature activation-9.49e-01

Pos logit contributions
ilers+2.186
erous+2.167
itored+1.968
 spur+1.901
 Consent+1.884
Neg logit contributions
rity-2.816
pun-2.626
endment-2.459
 upscale-2.424
ior-2.410
Token Distribut
Token ID46567
Feature activation-6.22e+00

Pos logit contributions
ilers+0.727
erous+0.725
itored+0.675
 Consent+0.638
 spur+0.635
Neg logit contributions
rity-0.795
pun-0.740
endment-0.688
geist-0.681
 upscale-0.681
Tokening
Token ID278
Feature activation+2.40e+00

Pos logit contributions
ilers+2.687
erous+2.603
itored+2.240
 Consent+1.993
 spur+1.952
Neg logit contributions
rity-7.187
pun-6.912
endment-6.603
geist-6.540
 upscale-6.524
Token applications
Token ID5479
Feature activation+8.86e+00

Pos logit contributions
rity+1.928
pun+1.800
 upscale+1.657
endment+1.652
geist+1.641
Neg logit contributions
ilers-1.918
erous-1.906
itored-1.794
 spur-1.673
 Consent-1.664
Token desert
Token ID10326
Feature activation-5.25e+00

Pos logit contributions
rity+1.474
pun+1.383
 upscale+1.295
endment+1.277
geist+1.263
Neg logit contributions
ilers-1.261
erous-1.250
itored-1.171
 Consent-1.085
 spur-1.074
Token.
Token ID13
Feature activation+2.99e+00

Pos logit contributions
ilers+4.029
erous+4.014
itored+3.741
 spur+3.623
 Consent+3.491
Neg logit contributions
rity-4.314
pun-3.958
geist-3.695
endment-3.677
 upscale-3.611
Token Obama
Token ID2486
Feature activation+1.99e+00

Pos logit contributions
rity+2.447
pun+2.297
endment+2.109
geist+2.105
 upscale+2.098
Neg logit contributions
ilers-2.346
erous-2.311
itored-2.165
 spur-2.035
ilitating-1.998
Token,
Token ID11
Feature activation+1.48e+00

Pos logit contributions
rity+1.644
pun+1.545
endment+1.424
 upscale+1.417
geist+1.415
Neg logit contributions
ilers-1.546
erous-1.534
itored-1.439
 Consent-1.352
 spur-1.337
Token by
Token ID416
Feature activation-6.11e+00

Pos logit contributions
rity+1.190
pun+1.115
endment+1.027
 upscale+1.025
geist+1.019
Neg logit contributions
ilers-1.180
erous-1.173
itored-1.097
 Consent-1.031
 spur-1.025
Token contrast
Token ID6273
Feature activation+1.14e+01

Pos logit contributions
ilers+4.360
erous+4.348
 spur+3.917
itored+3.903
 Consent+3.749
Neg logit contributions
rity-5.371
pun-4.899
geist-4.577
endment-4.550
ior-4.445
Token,
Token ID11
Feature activation+3.58e+00

Pos logit contributions
rity+6.352
pun+6.143
 upscale+5.596
ior+5.545
geist+5.447
Neg logit contributions
ilers-10.677
erous-10.286
itored-9.806
 Consent-9.511
ilitating-9.370
Token could
Token ID714
Feature activation+2.56e+00

Pos logit contributions
rity+2.793
pun+2.648
 upscale+2.424
endment+2.401
ior+2.395
Neg logit contributions
ilers-2.917
erous-2.881
itored-2.712
 Consent-2.565
 spur-2.483
Token only
Token ID691
Feature activation+3.99e+00

Pos logit contributions
rity+1.905
pun+1.778
endment+1.617
 upscale+1.617
geist+1.602
Neg logit contributions
erous-2.186
ilers-2.185
itored-2.056
 Consent-1.942
 spur-1.918
Token wish
Token ID4601
Feature activation+4.77e+00

Pos logit contributions
rity+3.159
pun+3.017
 upscale+2.797
ior+2.743
endment+2.724
Neg logit contributions
erous-3.141
ilers-3.133
itored-2.964
 Consent-2.757
ilitating-2.695
Token for
Token ID329
Feature activation-1.10e+00

Pos logit contributions
rity+3.829
pun+3.610
 upscale+3.331
endment+3.294
ior+3.291
Neg logit contributions
ilers-3.776
erous-3.687
itored-3.490
 Consent-3.248
 spur-3.240
Token arrested
Token ID5169
Feature activation+2.53e+00

Pos logit contributions
rity+1.449
pun+1.360
endment+1.253
 upscale+1.249
geist+1.238
Neg logit contributions
ilers-1.403
erous-1.393
itored-1.295
 Consent-1.222
 spur-1.209
Token within
Token ID1626
Feature activation+4.61e-01

Pos logit contributions
rity+2.064
pun+1.933
endment+1.783
 upscale+1.774
geist+1.768
Neg logit contributions
ilers-1.985
erous-1.980
itored-1.843
 Consent-1.733
 spur-1.721
Token the
Token ID262
Feature activation-2.13e+00

Pos logit contributions
rity+0.376
pun+0.351
endment+0.325
geist+0.322
 upscale+0.321
Neg logit contributions
ilers-0.363
erous-0.362
itored-0.338
 spur-0.318
 Consent-0.318
Token parameters
Token ID10007
Feature activation+5.70e-01

Pos logit contributions
ilers+1.698
erous+1.689
itored+1.584
 spur+1.525
 Consent+1.514
Neg logit contributions
rity-1.675
pun-1.581
geist-1.478
endment-1.453
 upscale-1.427
Token of
Token ID286
Feature activation+2.68e+00

Pos logit contributions
rity+0.478
pun+0.447
endment+0.414
geist+0.411
 upscale+0.410
Neg logit contributions
ilers-0.438
erous-0.436
itored-0.406
 Consent-0.381
 spur-0.381
Token law
Token ID1099
Feature activation+9.20e+00

Pos logit contributions
rity+2.272
pun+2.125
endment+1.969
 upscale+1.951
ior+1.933
Neg logit contributions
ilers-2.027
erous-2.014
itored-1.876
 spur-1.752
 Consent-1.746
Token."
Token ID526
Feature activation+3.09e+00

Pos logit contributions
rity+6.675
pun+6.290
endment+5.747
ior+5.596
 upscale+5.561
Neg logit contributions
ilers-7.708
erous-7.651
itored-7.093
 Consent-6.685
 spur-6.597
Token Sources
Token ID26406
Feature activation+3.49e+00

Pos logit contributions
rity+2.592
pun+2.422
endment+2.242
 upscale+2.220
geist+2.219
Neg logit contributions
ilers-2.365
erous-2.354
itored-2.189
 spur-2.061
 Consent-2.046
Token in
Token ID287
Feature activation+3.46e+00

Pos logit contributions
rity+2.552
pun+2.376
endment+2.156
 upscale+2.149
geist+2.145
Neg logit contributions
ilers-3.048
erous-3.041
itored-2.848
 Consent-2.703
 spur-2.691
Token the
Token ID262
Feature activation+2.01e+00

Pos logit contributions
rity+2.862
pun+2.671
 upscale+2.472
endment+2.469
ior+2.432
Neg logit contributions
ilers-2.679
erous-2.676
itored-2.480
 Consent-2.336
 spur-2.327
Token Delhi
Token ID12517
Feature activation+5.65e+00

Pos logit contributions
rity+1.583
pun+1.472
endment+1.357
 upscale+1.348
geist+1.340
Neg logit contributions
ilers-1.652
erous-1.647
itored-1.540
 spur-1.452
 Consent-1.450
Token tea
Token ID8887
Feature activation+3.45e+00

Pos logit contributions
rity+2.267
pun+2.106
endment+1.959
 upscale+1.944
ior+1.923
Neg logit contributions
ilers-2.303
erous-2.287
itored-2.149
 Consent-2.020
 spur-2.002
Token at
Token ID379
Feature activation+2.01e+00

Pos logit contributions
rity+2.730
pun+2.569
endment+2.381
geist+2.358
 upscale+2.351
Neg logit contributions
ilers-2.764
erous-2.745
itored-2.589
 Consent-2.412
 spur-2.378
Token night
Token ID1755
Feature activation+3.14e+00

Pos logit contributions
rity+1.637
pun+1.526
endment+1.413
 upscale+1.401
geist+1.400
Neg logit contributions
ilers-1.596
erous-1.588
itored-1.484
 Consent-1.395
 spur-1.394
Token.
Token ID13
Feature activation+3.96e+00

Pos logit contributions
rity+2.443
pun+2.292
endment+2.128
geist+2.091
 upscale+2.088
Neg logit contributions
ilers-2.568
erous-2.564
itored-2.398
 Consent-2.274
 spur-2.238
Token
Token ID198
Feature activation-1.65e+00

Pos logit contributions
rity+3.405
pun+3.215
endment+2.984
 upscale+2.967
geist+2.963
Neg logit contributions
ilers-2.930
erous-2.892
itored-2.698
 spur-2.517
ilitating-2.470
Token
Token ID198
Feature activation+6.42e-01

Pos logit contributions
erous+1.211
ilers+1.209
itored+1.121
 Consent+1.059
 spur+1.049
Neg logit contributions
rity-1.441
pun-1.336
endment-1.259
geist-1.242
 upscale-1.239
TokenPaul
Token ID12041
Feature activation-1.07e+00

Pos logit contributions
rity+0.504
pun+0.468
endment+0.432
geist+0.428
 upscale+0.427
Neg logit contributions
ilers-0.528
erous-0.526
itored-0.492
 Consent-0.465
 spur-0.465
Token Sol
Token ID4294
Feature activation+8.49e-02

Pos logit contributions
ilers+0.804
erous+0.802
itored+0.745
 Consent+0.702
 spur+0.700
Neg logit contributions
rity-0.917
pun-0.856
endment-0.797
geist-0.789
 upscale-0.788
Tokenman
Token ID805
Feature activation+4.04e+00

Pos logit contributions
rity+0.051
pun+0.047
endment+0.042
geist+0.041
 upscale+0.041
Neg logit contributions
ilers-0.085
erous-0.085
itored-0.080
 spur-0.077
 Consent-0.077
Token:
Token ID25
Feature activation-1.48e+00

Pos logit contributions
rity+3.275
pun+3.112
geist+2.854
ior+2.847
 upscale+2.831
Neg logit contributions
ilers-3.133
erous-3.072
itored-2.912
 spur-2.735
ilitating-2.691
Token Dec
Token ID4280
Feature activation+2.20e+00

Pos logit contributions
erous+0.943
ilers+0.943
itored+0.863
 Consent+0.819
 spur+0.804
Neg logit contributions
rity-1.436
pun-1.348
endment-1.267
geist-1.254
 upscale-1.249
Token now
Token ID783
Feature activation-1.65e-02

Pos logit contributions
ilers+1.863
erous+1.857
itored+1.744
 spur+1.665
 Consent+1.642
Neg logit contributions
rity-1.871
pun-1.731
endment-1.613
geist-1.591
ior-1.569
Token on
Token ID319
Feature activation-1.02e+01

Pos logit contributions
ilers+0.013
erous+0.013
itored+0.012
 spur+0.012
 Consent+0.012
Neg logit contributions
rity-0.013
pun-0.012
endment-0.011
geist-0.011
 upscale-0.011
Token display
Token ID3359
Feature activation+4.37e+00

Pos logit contributions
erous+7.585
ilers+7.264
itored+6.827
 spur+6.588
 Consent+6.567
Neg logit contributions
rity-8.196
pun-7.773
geist-7.429
endment-7.223
ior-7.106
Token at
Token ID379
Feature activation+1.51e+00

Pos logit contributions
rity+3.304
pun+3.084
endment+2.824
 upscale+2.817
ior+2.806
Neg logit contributions
ilers-3.667
erous-3.656
itored-3.380
 Consent-3.219
 spur-3.182
Token the
Token ID262
Feature activation+2.19e-01

Pos logit contributions
rity+1.325
pun+1.237
endment+1.152
 upscale+1.151
geist+1.142
Neg logit contributions
ilers-1.105
erous-1.097
itored-1.022
 spur-0.951
 Consent-0.947
Token National
Token ID2351
Feature activation+1.77e+00

Pos logit contributions
rity+0.180
pun+0.168
endment+0.155
 upscale+0.154
geist+0.154
Neg logit contributions
ilers-0.171
erous-0.171
itored-0.159
 spur-0.150
 Consent-0.150
Token Museum
Token ID9594
Feature activation+2.28e+00

Pos logit contributions
rity+1.501
pun+1.406
endment+1.304
 upscale+1.295
geist+1.290
Neg logit contributions
ilers-1.337
erous-1.327
itored-1.238
 spur-1.160
 Consent-1.152
Token of
Token ID286
Feature activation+7.93e+00

Pos logit contributions
rity+1.861
pun+1.736
endment+1.613
geist+1.596
 upscale+1.595
Neg logit contributions
ilers-1.793
erous-1.785
itored-1.663
 spur-1.565
 Consent-1.561
Token the
Token ID262
Feature activation+5.51e+00

Pos logit contributions
rity+5.956
pun+5.449
 upscale+5.295
ior+5.276
endment+5.034
Neg logit contributions
ilers-6.352
erous-6.284
itored-6.033
 spur-5.534
esses-5.465
Token United
Token ID1578
Feature activation+6.40e+00

Pos logit contributions
rity+4.021
pun+3.714
ior+3.496
 upscale+3.480
geist+3.444
Neg logit contributions
erous-4.648
ilers-4.641
itored-4.492
ilitating-4.071
 spur-4.070
Token States
Token ID1829
Feature activation+4.88e+00

Pos logit contributions
rity+5.786
pun+5.304
ior+5.147
 upscale+5.051
geist+5.036
Neg logit contributions
ilers-4.263
erous-4.257
itored-4.015
ilitating-3.698
 spur-3.619
Token and
Token ID290
Feature activation-3.98e+00

Pos logit contributions
erous+3.437
ilers+3.423
itored+3.183
 spur+3.072
 Consent+2.955
Neg logit contributions
rity-3.986
pun-3.636
endment-3.439
geist-3.381
 upscale-3.338
Token we
Token ID356
Feature activation-1.07e+00

Pos logit contributions
erous+2.977
ilers+2.974
itored+2.712
 spur+2.660
 Consent+2.603
Neg logit contributions
rity-3.394
pun-3.129
endment-2.952
geist-2.901
ior-2.841
Token're
Token ID821
Feature activation+2.43e+00

Pos logit contributions
ilers+0.837
erous+0.830
itored+0.774
 spur+0.737
 Consent+0.728
Neg logit contributions
rity-0.889
pun-0.827
endment-0.773
geist-0.762
 upscale-0.754
Token an
Token ID281
Feature activation+1.52e+00

Pos logit contributions
rity+1.906
pun+1.791
endment+1.645
 upscale+1.644
geist+1.634
Neg logit contributions
ilers-1.976
erous-1.975
itored-1.847
 Consent-1.739
 spur-1.718
Token overall
Token ID4045
Feature activation-1.08e-02

Pos logit contributions
rity+1.247
pun+1.163
endment+1.077
geist+1.068
 upscale+1.063
Neg logit contributions
ilers-1.193
erous-1.187
itored-1.108
 spur-1.044
 Consent-1.042
Token community
Token ID2055
Feature activation+1.28e+00

Pos logit contributions
ilers+0.008
erous+0.008
itored+0.008
 spur+0.007
 Consent+0.007
Neg logit contributions
rity-0.009
pun-0.008
endment-0.008
geist-0.008
 upscale-0.008
Token now
Token ID783
Feature activation+5.20e+00

Pos logit contributions
rity+1.015
pun+0.945
endment+0.873
geist+0.865
 upscale+0.863
Neg logit contributions
ilers-1.037
erous-1.033
itored-0.966
 Consent-0.911
 spur-0.911
Token,"
Token ID553
Feature activation-4.25e+00

Pos logit contributions
rity+3.747
pun+3.538
endment+3.216
 upscale+3.214
geist+3.206
Neg logit contributions
erous-4.483
ilers-4.476
itored-4.220
 Consent-4.001
 spur-3.916
Token Cue
Token ID40568
Feature activation+6.49e-01

Pos logit contributions
ilers+3.519
erous+3.493
itored+3.286
 Consent+3.154
 spur+3.139
Neg logit contributions
rity-3.230
pun-2.954
endment-2.780
geist-2.737
 upscale-2.710
Tokenll
Token ID297
Feature activation+2.59e+00

Pos logit contributions
rity+0.515
pun+0.481
endment+0.443
geist+0.439
 upscale+0.438
Neg logit contributions
ilers-0.526
erous-0.525
itored-0.491
 spur-0.464
 Consent-0.463
Tokenar
Token ID283
Feature activation+1.19e+00

Pos logit contributions
rity+2.135
pun+2.020
ior+1.863
endment+1.839
geist+1.822
Neg logit contributions
ilers-2.003
erous-1.968
itored-1.859
 Consent-1.759
 spur-1.758
Token.
Token ID13
Feature activation-2.63e+00

Pos logit contributions
rity+0.473
pun+0.440
endment+0.406
geist+0.401
 upscale+0.401
Neg logit contributions
ilers-0.499
erous-0.497
itored-0.465
 spur-0.441
 Consent-0.440
Token Though
Token ID7486
Feature activation+1.17e+00

Pos logit contributions
ilers+1.830
erous+1.826
itored+1.673
 Consent+1.655
 Fathers+1.574
Neg logit contributions
rity-2.378
pun-2.220
endment-2.079
 upscale-2.053
ior-2.034
Token they
Token ID484
Feature activation+2.81e+00

Pos logit contributions
rity+0.979
pun+0.914
endment+0.849
geist+0.842
 upscale+0.841
Neg logit contributions
ilers-0.894
erous-0.890
itored-0.831
 spur-0.778
 Consent-0.777
Token are
Token ID389
Feature activation-2.65e-01

Pos logit contributions
rity+2.206
pun+2.051
 upscale+1.892
endment+1.881
geist+1.878
Neg logit contributions
ilers-2.281
erous-2.275
itored-2.140
 Consent-2.002
 spur-1.980
Token incapable
Token ID23402
Feature activation-2.07e+00

Pos logit contributions
ilers+0.208
erous+0.207
itored+0.193
 spur+0.184
 Consent+0.183
Neg logit contributions
rity-0.217
pun-0.202
endment-0.187
geist-0.185
 upscale-0.184
Token of
Token ID286
Feature activation+2.90e+00

Pos logit contributions
ilers+1.563
erous+1.553
itored+1.443
 spur+1.361
 Consent+1.359
Neg logit contributions
rity-1.765
pun-1.649
endment-1.530
geist-1.521
 upscale-1.516
Token real
Token ID1103
Feature activation-6.26e-01

Pos logit contributions
rity+2.405
pun+2.251
endment+2.091
 upscale+2.085
geist+2.064
Neg logit contributions
ilers-2.229
erous-2.222
itored-2.088
 Consent-1.936
 spur-1.920
Tokenising
Token ID1710
Feature activation-4.24e+00

Pos logit contributions
ilers+0.509
erous+0.507
itored+0.473
 spur+0.449
 Consent+0.449
Neg logit contributions
rity-0.496
pun-0.461
endment-0.426
geist-0.423
 upscale-0.421
Token it
Token ID340
Feature activation-1.38e+00

Pos logit contributions
erous+3.054
ilers+3.022
itored+2.764
 Consent+2.695
 spur+2.656
Neg logit contributions
rity-3.699
pun-3.470
endment-3.238
geist-3.189
ior-3.173
Token;
Token ID26
Feature activation-2.59e+00

Pos logit contributions
erous+1.077
ilers+1.074
itored+0.996
 spur+0.948
 Consent+0.944
Neg logit contributions
rity-1.147
pun-1.065
endment-0.993
 upscale-0.975
geist-0.973
Token this
Token ID428
Feature activation+1.16e+00

Pos logit contributions
ilers+1.892
erous+1.883
itored+1.728
 Consent+1.657
 spur+1.649
Neg logit contributions
rity-2.251
pun-2.105
endment-1.963
geist-1.928
 upscale-1.925
Token.
Token ID13
Feature activation+4.33e+00

Pos logit contributions
ilers+0.778
erous+0.776
itored+0.723
 spur+0.679
 Consent+0.679
Neg logit contributions
rity-0.823
pun-0.766
endment-0.713
geist-0.707
 upscale-0.705
Token Todd
Token ID14377
Feature activation-2.28e+00

Pos logit contributions
rity+4.180
pun+3.991
 upscale+3.710
endment+3.705
geist+3.701
Neg logit contributions
ilers-2.717
erous-2.655
itored-2.447
 spur-2.242
 Consent-2.197
Token C
Token ID327
Feature activation+9.67e-01

Pos logit contributions
erous+1.604
ilers+1.597
itored+1.471
 Consent+1.393
 spur+1.379
Neg logit contributions
rity-2.054
pun-1.926
endment-1.799
geist-1.783
 upscale-1.780
Token.
Token ID13
Feature activation-6.82e-01

Pos logit contributions
rity+0.819
pun+0.767
endment+0.712
ior+0.705
geist+0.702
Neg logit contributions
ilers-0.730
erous-0.728
itored-0.684
 Consent-0.644
 spur-0.641
Token Gilbert
Token ID24023
Feature activation+4.66e+00

Pos logit contributions
ilers+0.473
erous+0.471
itored+0.435
 Consent+0.407
 spur+0.406
Neg logit contributions
rity-0.622
pun-0.584
endment-0.546
geist-0.541
 upscale-0.540
Token (
Token ID357
Feature activation+3.80e+00

Pos logit contributions
rity+3.543
pun+3.341
endment+3.052
ior+3.037
geist+3.033
Neg logit contributions
ilers-3.863
erous-3.796
itored-3.622
 Consent-3.379
 spur-3.344
TokenR
Token ID49
Feature activation+5.96e+00

Pos logit contributions
rity+2.661
pun+2.485
endment+2.230
 upscale+2.187
ior+2.184
Neg logit contributions
ilers-3.428
erous-3.406
itored-3.236
 spur-3.081
 Consent-3.075
Token-
Token ID12
Feature activation+5.90e+00

Pos logit contributions
rity+4.467
pun+4.212
endment+3.872
ior+3.860
 upscale+3.771
Neg logit contributions
ilers-4.922
erous-4.900
itored-4.627
 Consent-4.424
 spur-4.376
TokenS
Token ID50
Feature activation+4.21e+00

Pos logit contributions
rity+4.182
pun+3.974
endment+3.574
ior+3.534
geist+3.527
Neg logit contributions
ilers-5.107
erous-5.097
itored-4.810
 spur-4.597
 Consent-4.525
Tokenhen
Token ID831
Feature activation-6.94e-01

Pos logit contributions
rity+3.499
pun+3.325
endment+3.071
ior+3.047
 upscale+2.987
Neg logit contributions
erous-3.201
ilers-3.165
itored-2.996
 Consent-2.829
 spur-2.793
Tokenando
Token ID25440
Feature activation-2.05e+00

Pos logit contributions
ilers+0.663
erous+0.661
itored+0.623
 spur+0.594
 Consent+0.592
Neg logit contributions
rity-0.451
pun-0.410
endment-0.376
geist-0.370
 upscale-0.368
Token Jersey
Token ID8221
Feature activation+6.64e-02

Pos logit contributions
rity+0.954
pun+0.895
endment+0.830
 upscale+0.824
geist+0.823
Neg logit contributions
ilers-0.815
erous-0.811
itored-0.755
 Consent-0.707
 spur-0.706
Token,
Token ID11
Feature activation+2.29e+00

Pos logit contributions
rity+0.053
pun+0.049
endment+0.046
geist+0.045
 upscale+0.045
Neg logit contributions
ilers-0.053
erous-0.053
itored-0.050
 spur-0.047
 Consent-0.047
Token said
Token ID531
Feature activation-2.03e+00

Pos logit contributions
rity+1.880
pun+1.759
 upscale+1.621
endment+1.611
geist+1.604
Neg logit contributions
ilers-1.799
erous-1.795
itored-1.668
 Consent-1.567
 spur-1.560
Token on
Token ID319
Feature activation-6.57e+00

Pos logit contributions
erous+1.572
ilers+1.571
itored+1.467
 spur+1.397
 Consent+1.388
Neg logit contributions
rity-1.677
pun-1.575
endment-1.458
geist-1.446
 upscale-1.419
Token Thursday
Token ID3635
Feature activation+2.76e+00

Pos logit contributions
erous+4.852
ilers+4.700
itored+4.394
 Consent+4.163
 spur+4.154
Neg logit contributions
rity-5.637
pun-5.308
endment-4.976
geist-4.964
 upscale-4.789
Token that
Token ID326
Feature activation-2.69e+00

Pos logit contributions
rity+2.063
pun+1.903
endment+1.759
 upscale+1.755
geist+1.730
Neg logit contributions
ilers-2.358
erous-2.339
itored-2.192
 Consent-2.072
 spur-2.063
Token the
Token ID262
Feature activation+2.13e+00

Pos logit contributions
erous+2.090
ilers+2.082
itored+1.954
 spur+1.868
 Consent+1.850
Neg logit contributions
rity-2.200
pun-2.069
endment-1.912
geist-1.897
ior-1.858
Token Food
Token ID7318
Feature activation+8.30e+00

Pos logit contributions
rity+1.863
pun+1.741
 upscale+1.628
endment+1.615
ior+1.602
Neg logit contributions
ilers-1.553
erous-1.541
itored-1.432
 Consent-1.331
 spur-1.320
Token and
Token ID290
Feature activation+4.89e+00

Pos logit contributions
rity+5.842
pun+5.530
endment+5.045
ior+4.933
 upscale+4.925
Neg logit contributions
erous-7.151
ilers-7.007
itored-6.673
 spur-6.119
ilitating-6.092
Token Drug
Token ID11095
Feature activation+3.24e+00

Pos logit contributions
rity+3.890
pun+3.644
ior+3.347
 upscale+3.332
endment+3.312
Neg logit contributions
ilers-3.926
erous-3.750
itored-3.592
 spur-3.302
ilitating-3.293
Token Administration
Token ID8694
Feature activation+4.84e+00

Pos logit contributions
rity+1.193
pun+1.041
 upscale+0.849
endment+0.844
geist+0.832
Neg logit contributions
ilers-3.978
erous-3.956
itored-3.805
 spur-3.643
zee-3.601
Token not
Token ID407
Feature activation-3.42e+00

Pos logit contributions
ilers+1.023
erous+1.021
itored+0.955
 spur+0.907
 Consent+0.901
Neg logit contributions
rity-1.004
pun-0.933
endment-0.865
geist-0.856
 upscale-0.849
Token the
Token ID262
Feature activation-4.00e+00

Pos logit contributions
erous+2.407
ilers+2.391
itored+2.198
 spur+2.097
 Consent+2.073
Neg logit contributions
rity-3.074
pun-2.877
endment-2.706
geist-2.682
ior-2.627
Token Syrian
Token ID6318
Feature activation-3.53e+00

Pos logit contributions
erous+2.916
ilers+2.909
itored+2.643
 spur+2.576
 Consent+2.547
Neg logit contributions
rity-3.454
pun-3.215
endment-3.039
geist-3.014
ior-2.930
Token government
Token ID1230
Feature activation-4.66e+00

Pos logit contributions
ilers+2.514
erous+2.490
itored+2.293
 spur+2.202
 Consent+2.190
Neg logit contributions
rity-3.113
pun-2.906
endment-2.720
geist-2.710
 upscale-2.676
Token,
Token ID11
Feature activation-7.51e+00

Pos logit contributions
ilers+3.465
erous+3.438
itored+3.191
 spur+3.064
 Consent+3.009
Neg logit contributions
rity-3.933
pun-3.651
endment-3.429
geist-3.413
 upscale-3.365
Token that
Token ID326
Feature activation-8.54e+00

Pos logit contributions
ilers+5.259
erous+5.216
itored+4.854
 spur+4.779
 Consent+4.571
Neg logit contributions
rity-6.506
pun-6.039
endment-5.694
geist-5.680
 upscale-5.526
Token used
Token ID973
Feature activation-2.91e+00

Pos logit contributions
erous+5.614
ilers+5.541
 spur+5.249
itored+5.160
 Consent+4.825
Neg logit contributions
rity-7.717
pun-7.039
endment-6.911
geist-6.823
ior-6.502
Token Sar
Token ID6866
Feature activation+7.51e+00

Pos logit contributions
erous+2.347
ilers+2.341
itored+2.182
 spur+2.087
 Consent+2.060
Neg logit contributions
rity-2.329
pun-2.150
endment-2.010
geist-2.005
ior-1.958
Tokenin
Token ID259
Feature activation+2.95e+00

Pos logit contributions
rity+5.594
pun+5.257
 upscale+4.836
ior+4.781
CVE+4.607
Neg logit contributions
erous-6.259
ilers-6.249
itored-5.900
 Consent-5.556
esses-5.475
Token nerve
Token ID16384
Feature activation+1.84e+00

Pos logit contributions
rity+2.381
pun+2.228
endment+2.051
 upscale+2.038
geist+2.033
Neg logit contributions
erous-2.349
ilers-2.342
itored-2.184
 Consent-2.050
 spur-2.030
Token gas
Token ID3623
Feature activation-6.48e-01

Pos logit contributions
rity+1.485
pun+1.391
 upscale+1.281
endment+1.276
ior+1.269
Neg logit contributions
ilers-1.460
erous-1.459
itored-1.359
 Consent-1.273
 spur-1.258
Token he
Token ID339
Feature activation+8.94e-01

Pos logit contributions
rity+0.980
pun+0.916
endment+0.847
 upscale+0.842
geist+0.839
Neg logit contributions
ilers-0.932
erous-0.927
itored-0.867
 Consent-0.814
 spur-0.810
Token was
Token ID373
Feature activation+1.75e-01

Pos logit contributions
rity+0.809
pun+0.761
endment+0.708
 upscale+0.705
geist+0.703
Neg logit contributions
ilers-0.626
erous-0.621
itored-0.577
 Consent-0.538
 spur-0.534
Token a
Token ID257
Feature activation-1.88e+00

Pos logit contributions
rity+0.146
pun+0.136
endment+0.126
geist+0.125
 upscale+0.124
Neg logit contributions
ilers-0.135
erous-0.135
itored-0.125
 spur-0.119
 Consent-0.118
Token big
Token ID1263
Feature activation-7.98e+00

Pos logit contributions
ilers+1.540
erous+1.539
itored+1.435
 spur+1.371
 Consent+1.356
Neg logit contributions
rity-1.480
pun-1.371
endment-1.270
geist-1.256
 upscale-1.247
Token part
Token ID636
Feature activation-5.73e+00

Pos logit contributions
erous+5.016
ilers+4.973
 spur+4.643
itored+4.521
 Consent+4.234
Neg logit contributions
rity-7.549
pun-6.939
endment-6.686
geist-6.539
ior-6.379
Token of
Token ID286
Feature activation-5.69e+00

Pos logit contributions
ilers+4.020
erous+4.008
itored+3.735
 spur+3.556
 Consent+3.400
Neg logit contributions
rity-5.062
pun-4.668
endment-4.401
geist-4.393
 upscale-4.337
Token their
Token ID511
Feature activation-3.15e+00

Pos logit contributions
erous+4.020
ilers+3.994
 spur+3.635
itored+3.605
 Consent+3.422
Neg logit contributions
rity-5.029
pun-4.683
endment-4.447
geist-4.428
CVE-4.277
Token hockey
Token ID12217
Feature activation+6.71e-01

Pos logit contributions
ilers+2.150
erous+2.138
itored+1.942
 spur+1.906
 Consent+1.836
Neg logit contributions
rity-2.875
pun-2.707
endment-2.558
geist-2.514
CVE-2.467
Token team
Token ID1074
Feature activation-1.96e+00

Pos logit contributions
rity+0.594
pun+0.556
 upscale+0.516
endment+0.516
geist+0.515
Neg logit contributions
ilers-0.485
erous-0.482
itored-0.450
 Consent-0.419
 spur-0.414
Token.
Token ID13
Feature activation+1.34e+00

Pos logit contributions
ilers+1.533
erous+1.525
itored+1.416
 spur+1.352
 Consent+1.333
Neg logit contributions
rity-1.602
pun-1.494
endment-1.391
geist-1.371
 upscale-1.368
Token It
Token ID632
Feature activation+9.61e-01

Pos logit contributions
rity+1.217
pun+1.148
endment+1.067
geist+1.063
 upscale+1.061
Neg logit contributions
ilers-0.933
erous-0.927
itored-0.857
 spur-0.799
 Consent-0.791
Token companies
Token ID2706
Feature activation+1.53e+00

Pos logit contributions
rity+2.902
pun+2.744
 upscale+2.543
endment+2.530
geist+2.512
Neg logit contributions
ilers-2.399
erous-2.355
itored-2.229
 Consent-2.061
 spur-2.047
Token that
Token ID326
Feature activation-1.77e+00

Pos logit contributions
rity+1.241
pun+1.158
endment+1.070
geist+1.061
 upscale+1.060
Neg logit contributions
ilers-1.209
erous-1.204
itored-1.126
 Consent-1.059
 spur-1.058
Token operate
Token ID8076
Feature activation+2.64e+00

Pos logit contributions
erous+1.354
ilers+1.352
itored+1.248
 spur+1.200
 Consent+1.188
Neg logit contributions
rity-1.492
pun-1.391
endment-1.306
geist-1.286
ior-1.261
Token Medicare
Token ID13771
Feature activation+1.09e+00

Pos logit contributions
rity+2.159
pun+2.012
 upscale+1.859
endment+1.857
geist+1.844
Neg logit contributions
ilers-2.082
erous-2.064
itored-1.931
 Consent-1.813
 spur-1.808
Token Advantage
Token ID45318
Feature activation+7.85e-01

Pos logit contributions
rity+0.887
pun+0.829
endment+0.765
 upscale+0.759
geist+0.759
Neg logit contributions
ilers-0.861
erous-0.856
itored-0.801
 spur-0.752
 Consent-0.752
Token plans
Token ID3352
Feature activation-1.17e+00

Pos logit contributions
rity+0.704
pun+0.661
endment+0.616
geist+0.611
 upscale+0.610
Neg logit contributions
ilers-0.556
erous-0.554
itored-0.513
 spur-0.479
 Consent-0.479
Token,
Token ID11
Feature activation-1.39e+00

Pos logit contributions
ilers+0.932
erous+0.930
itored+0.865
 spur+0.823
 Consent+0.816
Neg logit contributions
rity-0.943
pun-0.867
endment-0.809
geist-0.802
 upscale-0.795
Token those
Token ID883
Feature activation+4.59e+00

Pos logit contributions
erous+1.063
ilers+1.063
itored+0.985
 spur+0.937
 Consent+0.932
Neg logit contributions
rity-1.171
pun-1.090
endment-1.018
geist-1.008
 upscale-0.997
Token privately
Token ID17092
Feature activation-1.68e+00

Pos logit contributions
rity+3.772
pun+3.534
 upscale+3.327
ior+3.253
endment+3.248
Neg logit contributions
ilers-3.526
erous-3.449
itored-3.274
 Consent-3.036
zee-3.016
Token run
Token ID1057
Feature activation-1.54e+00

Pos logit contributions
erous+1.183
ilers+1.170
itored+1.082
 spur+1.022
 Consent+1.014
Neg logit contributions
rity-1.534
pun-1.421
endment-1.347
geist-1.325
 upscale-1.306
Token alternatives
Token ID14693
Feature activation+6.01e+00

Pos logit contributions
erous+1.116
ilers+1.110
itored+1.025
 spur+0.977
 Consent+0.969
Neg logit contributions
rity-1.348
pun-1.261
endment-1.182
geist-1.170
ior-1.147
Token,
Token ID11
Feature activation-8.05e-01

Pos logit contributions
ilers+0.761
erous+0.760
itored+0.710
 spur+0.672
 Consent+0.672
Neg logit contributions
rity-0.719
pun-0.668
endment-0.617
geist-0.610
 upscale-0.607
Token
Token ID564
Feature activation+9.83e+00

Pos logit contributions
ilers+0.688
erous+0.685
itored+0.643
 spur+0.609
 Consent+0.609
Neg logit contributions
rity-0.606
pun-0.561
endment-0.516
geist-0.510
 upscale-0.509
Token
Token ID250
Feature activation-2.73e-01

Pos logit contributions
rity+5.538
pun+4.978
endment+4.658
 upscale+4.498
ior+4.411
Neg logit contributions
ilers-9.792
erous-9.760
itored-9.125
zee-8.653
 spur-8.626
TokenMake
Token ID12050
Feature activation-9.61e+00

Pos logit contributions
ilers+0.221
erous+0.220
itored+0.205
 Consent+0.193
 spur+0.193
Neg logit contributions
rity-0.219
pun-0.203
endment-0.188
geist-0.187
 upscale-0.186
Token 6
Token ID718
Feature activation-7.46e+00

Pos logit contributions
erous+6.748
ilers+6.657
itored+6.089
 Consent+5.967
 spur+5.844
Neg logit contributions
rity-8.131
pun-7.680
endment-7.307
geist-7.019
CVE-6.940
Token figures
Token ID5538
Feature activation-7.40e+00

Pos logit contributions
ilers+5.440
erous+5.439
itored+4.942
 Consent+4.857
 spur+4.833
Neg logit contributions
rity-6.277
pun-5.773
endment-5.540
geist-5.414
 upscale-5.304
Token in
Token ID287
Feature activation-1.76e+00

Pos logit contributions
erous+5.423
ilers+5.412
itored+4.907
 spur+4.783
 Consent+4.757
Neg logit contributions
rity-6.210
pun-5.715
endment-5.368
geist-5.298
 upscale-5.212
Token 6
Token ID718
Feature activation-8.48e+00

Pos logit contributions
erous+1.322
ilers+1.321
itored+1.221
 spur+1.162
 Consent+1.151
Neg logit contributions
rity-1.491
pun-1.397
endment-1.302
geist-1.288
 upscale-1.278
Token months
Token ID1933
Feature activation-1.04e+01

Pos logit contributions
ilers+5.741
erous+5.730
itored+5.123
 spur+5.054
 Consent+4.902
Neg logit contributions
rity-7.365
pun-6.891
endment-6.580
geist-6.526
 upscale-6.321
Token.
Token ID13
Feature activation-1.66e+00

Pos logit contributions
ilers+7.116
erous+7.090
itored+6.437
 spur+6.318
 Consent+6.282
Neg logit contributions
rity-8.799
pun-8.138
geist-7.587
endment-7.533
 upscale-7.476
Token
Token ID447
Feature activation-8.07e-01

Pos logit contributions
ilers+1.202
erous+1.200
itored+1.111
 Consent+1.050
 spur+1.040
Neg logit contributions
rity-1.458
pun-1.363
endment-1.274
geist-1.259
 upscale-1.257
Token O
Token ID440
Feature activation-1.42e+00

Pos logit contributions
ilers+1.201
erous+1.198
itored+1.117
 Consent+1.054
 spur+1.052
Neg logit contributions
rity-1.251
pun-1.165
endment-1.079
geist-1.069
 upscale-1.067
Token&
Token ID5
Feature activation-4.44e+00

Pos logit contributions
ilers+1.101
erous+1.093
itored+1.016
 Consent+0.960
 spur+0.954
Neg logit contributions
rity-1.185
pun-1.104
endment-1.027
geist-1.016
 upscale-1.015
TokenA
Token ID32
Feature activation+3.24e+00

Pos logit contributions
erous+3.300
ilers+3.282
itored+3.028
 Consent+2.951
er+2.839
Neg logit contributions
rity-3.792
pun-3.463
endment-3.292
geist-3.220
 upscale-3.183
Token-
Token ID12
Feature activation-5.86e+00

Pos logit contributions
rity+2.201
pun+2.081
geist+1.890
ior+1.862
endment+1.854
Neg logit contributions
ilers-2.945
erous-2.891
itored-2.755
 spur-2.622
ilitating-2.585
TokenOp
Token ID18257
Feature activation-8.03e+00

Pos logit contributions
erous+4.599
ilers+4.572
itored+4.266
 Consent+4.108
 spur+4.006
Neg logit contributions
rity-4.660
pun-4.171
endment-4.007
 upscale-3.934
geist-3.923
Token Sick
Token ID32181
Feature activation-1.08e+01

Pos logit contributions
ilers+5.790
erous+5.768
itored+5.312
 Consent+5.110
 spur+4.928
Neg logit contributions
rity-6.580
pun-6.249
geist-5.802
 upscale-5.799
endment-5.783
Token-
Token ID12
Feature activation-6.51e+00

Pos logit contributions
erous+7.736
ilers+7.535
itored+7.015
er+6.695
 Consent+6.690
Neg logit contributions
rity-8.969
pun-8.113
 upscale-7.785
CVE-7.776
endment-7.769
TokenAnt
Token ID13217
Feature activation-3.34e+00

Pos logit contributions
erous+5.187
ilers+5.172
itored+4.842
 Consent+4.632
er+4.516
Neg logit contributions
rity-5.031
pun-4.492
endment-4.314
 upscale-4.269
geist-4.224
Tokens
Token ID82
Feature activation-4.08e+00

Pos logit contributions
ilers+2.400
erous+2.395
itored+2.223
 Consent+2.092
 spur+2.064
Neg logit contributions
rity-2.937
pun-2.742
endment-2.563
 upscale-2.555
geist-2.527
Token Dad
Token ID17415
Feature activation-1.00e+01

Pos logit contributions
erous+2.957
ilers+2.947
itored+2.737
 Consent+2.601
 spur+2.558
Neg logit contributions
rity-3.540
pun-3.295
endment-3.106
 upscale-3.069
geist-3.053
Token A
Token ID317
Feature activation-4.59e+00

Pos logit contributions
erous+7.245
ilers+7.018
itored+6.550
 Consent+6.331
 spur+6.202
Neg logit contributions
rity-8.167
pun-7.465
 upscale-7.195
endment-7.150
CVE-7.088
Token complete
Token ID1844
Feature activation+1.40e+00

Pos logit contributions
rity+2.034
pun+1.906
 upscale+1.780
endment+1.765
geist+1.746
Neg logit contributions
ilers-1.833
erous-1.792
itored-1.688
 Consent-1.593
 spur-1.564
Token mammoth
Token ID44268
Feature activation-3.47e-02

Pos logit contributions
rity+1.104
pun+1.028
 upscale+0.946
endment+0.943
geist+0.940
Neg logit contributions
ilers-1.149
erous-1.132
itored-1.069
 Consent-1.008
 spur-0.999
Token arrived
Token ID5284
Feature activation+4.00e+00

Pos logit contributions
ilers+0.028
erous+0.028
itored+0.026
 Consent+0.024
 spur+0.024
Neg logit contributions
rity-0.028
pun-0.026
endment-0.024
geist-0.024
 upscale-0.024
Token in
Token ID287
Feature activation-1.54e+00

Pos logit contributions
rity+3.043
pun+2.870
 upscale+2.629
endment+2.606
ior+2.594
Neg logit contributions
ilers-3.311
erous-3.299
itored-3.057
 Consent-2.927
 spur-2.875
Token Britain
Token ID5491
Feature activation-2.82e+00

Pos logit contributions
ilers+1.080
erous+1.075
itored+0.993
 spur+0.940
 Consent+0.933
Neg logit contributions
rity-1.391
pun-1.299
endment-1.222
geist-1.213
ior-1.192
Token on
Token ID319
Feature activation-9.59e+00

Pos logit contributions
ilers+2.214
erous+2.208
itored+2.075
 spur+1.982
 Consent+1.952
Neg logit contributions
rity-2.298
pun-2.126
endment-1.989
geist-1.963
 upscale-1.929
Token Monday
Token ID3321
Feature activation+2.57e+00

Pos logit contributions
erous+5.978
ilers+5.612
itored+5.312
 spur+5.249
 Consent+5.099
Neg logit contributions
rity-8.992
pun-8.331
endment-8.119
geist-8.084
ior-7.663
Token to
Token ID284
Feature activation+3.06e-01

Pos logit contributions
rity+1.985
pun+1.847
endment+1.701
 upscale+1.689
geist+1.678
Neg logit contributions
ilers-2.146
erous-2.140
itored-1.998
 Consent-1.890
 spur-1.883
Token go
Token ID467
Feature activation+7.01e+00

Pos logit contributions
rity+0.251
pun+0.235
endment+0.217
 upscale+0.216
geist+0.215
Neg logit contributions
ilers-0.239
erous-0.238
itored-0.222
 Consent-0.209
 spur-0.208
Token on
Token ID319
Feature activation-7.44e+00

Pos logit contributions
rity+5.296
pun+5.044
 upscale+4.685
geist+4.531
ior+4.521
Neg logit contributions
ilers-5.724
erous-5.711
itored-5.305
 Consent-5.081
 spur-4.892
Token public
Token ID1171
Feature activation-5.40e+00

Pos logit contributions
erous+5.756
ilers+5.461
itored+5.226
 spur+5.029
 Consent+4.884
Neg logit contributions
rity-6.042
pun-5.585
endment-5.254
geist-5.247
ior-5.127
Token back
Token ID736
Feature activation+9.80e+00

Pos logit contributions
rity+5.821
pun+5.607
 upscale+5.270
ior+5.025
endment+4.996
Neg logit contributions
erous-6.367
ilers-6.357
itored-6.128
 Consent-5.728
ilitating-5.606
Token there
Token ID612
Feature activation+2.29e+00

Pos logit contributions
rity+6.934
pun+6.785
 upscale+6.266
ior+6.022
endment+6.021
Neg logit contributions
ilers-7.918
erous-7.849
itored-7.504
 Consent-7.186
ilitating-6.870
Token and
Token ID290
Feature activation+3.56e+00

Pos logit contributions
rity+1.857
pun+1.742
endment+1.603
 upscale+1.601
geist+1.595
Neg logit contributions
ilers-1.810
erous-1.796
itored-1.688
 Consent-1.588
 spur-1.571
Token be
Token ID307
Feature activation+1.58e+00

Pos logit contributions
rity+2.794
pun+2.644
 upscale+2.450
endment+2.403
ior+2.395
Neg logit contributions
erous-2.855
ilers-2.843
itored-2.698
 Consent-2.518
 spur-2.462
Token on
Token ID319
Feature activation-1.01e+01

Pos logit contributions
rity+1.269
pun+1.182
endment+1.092
geist+1.082
 upscale+1.081
Neg logit contributions
ilers-1.274
erous-1.269
itored-1.186
 spur-1.118
 Consent-1.118
Token the
Token ID262
Feature activation-1.22e+01

Pos logit contributions
erous+6.818
ilers+6.257
itored+5.741
 spur+5.620
 Consent+5.531
Neg logit contributions
rity-9.174
pun-8.453
geist-8.097
endment-8.094
CVE-7.948
Token other
Token ID584
Feature activation-4.86e+00

Pos logit contributions
ilers+6.789
erous+6.727
 spur+6.348
itored+5.863
 Fathers+5.718
Neg logit contributions
rity-11.658
pun-10.634
endment-10.318
CVE-10.244
geist-10.072
Token side
Token ID1735
Feature activation-6.35e-01

Pos logit contributions
ilers+3.016
erous+2.991
itored+2.659
 spur+2.595
 Consent+2.472
Neg logit contributions
rity-4.724
pun-4.468
geist-4.165
endment-4.164
 upscale-4.118
Token of
Token ID286
Feature activation+1.69e+00

Pos logit contributions
ilers+0.484
erous+0.484
itored+0.448
 spur+0.425
 Consent+0.421
Neg logit contributions
rity-0.536
pun-0.498
endment-0.465
geist-0.460
 upscale-0.458
Token the
Token ID262
Feature activation-1.70e+00

Pos logit contributions
rity+1.423
pun+1.331
endment+1.235
 upscale+1.228
geist+1.224
Neg logit contributions
ilers-1.286
erous-1.279
itored-1.197
 Consent-1.122
 spur-1.119
Token rink
Token ID47925
Feature activation-4.92e+00

Pos logit contributions
ilers+1.031
erous+1.030
itored+0.925
 spur+0.875
 Consent+0.859
Neg logit contributions
rity-1.694
pun-1.594
endment-1.505
geist-1.494
ior-1.479
Token torn
Token ID12445
Feature activation-2.10e+00

Pos logit contributions
erous+1.131
ilers+1.129
itored+1.050
 spur+1.007
 Consent+0.991
Neg logit contributions
rity-1.199
pun-1.109
endment-1.038
geist-1.028
 upscale-1.011
Token down
Token ID866
Feature activation-2.56e+00

Pos logit contributions
erous+1.913
ilers+1.903
itored+1.796
 spur+1.728
 Consent+1.697
Neg logit contributions
rity-1.464
pun-1.328
endment-1.235
geist-1.217
 upscale-1.186
Token during
Token ID1141
Feature activation+6.92e-02

Pos logit contributions
erous+1.930
ilers+1.922
itored+1.778
 spur+1.706
 Consent+1.673
Neg logit contributions
rity-2.183
pun-2.020
endment-1.900
geist-1.876
 upscale-1.850
Token the
Token ID262
Feature activation-2.46e+00

Pos logit contributions
rity+0.060
pun+0.056
endment+0.052
 upscale+0.052
geist+0.052
Neg logit contributions
ilers-0.051
erous-0.051
itored-0.048
 Consent-0.045
 spur-0.044
Token on
Token ID319
Feature activation-9.98e+00

Pos logit contributions
erous+1.864
ilers+1.854
itored+1.716
 spur+1.645
 Consent+1.631
Neg logit contributions
rity-2.088
pun-1.936
endment-1.811
geist-1.799
ior-1.767
Token-
Token ID12
Feature activation-1.22e+01

Pos logit contributions
erous+7.845
ilers+7.256
itored+6.757
 spur+6.381
 Consent+6.285
Neg logit contributions
rity-8.180
pun-7.503
geist-7.183
endment-7.150
 upscale-6.958
Tokensite
Token ID15654
Feature activation-1.36e+00

Pos logit contributions
erous+8.766
ilers+8.171
itored+7.389
er+7.269
 spur+7.144
Neg logit contributions
rity-9.900
pun-8.897
geist-8.892
 upscale-8.662
endment-8.641
Token filming
Token ID17691
Feature activation-6.51e+00

Pos logit contributions
erous+1.132
ilers+1.129
itored+1.051
 spur+1.005
 Consent+1.000
Neg logit contributions
rity-1.054
pun-0.974
endment-0.902
geist-0.892
 upscale-0.885
Token of
Token ID286
Feature activation-8.94e-01

Pos logit contributions
erous+4.723
ilers+4.596
itored+4.320
 spur+4.198
 Consent+4.142
Neg logit contributions
rity-5.580
pun-5.246
geist-4.898
endment-4.852
ior-4.777
Token the
Token ID262
Feature activation-2.65e-03

Pos logit contributions
ilers+0.633
erous+0.630
itored+0.583
 spur+0.546
 Consent+0.546
Neg logit contributions
rity-0.805
pun-0.754
endment-0.705
geist-0.698
 upscale-0.695
Token 1979
Token ID13521
Feature activation+3.14e+00

Pos logit contributions
ilers+0.002
erous+0.002
itored+0.002
 Consent+0.002
 spur+0.002
Neg logit contributions
rity-0.002
pun-0.002
endment-0.002
 upscale-0.002
geist-0.002
Tokentto
Token ID33955
Feature activation-6.17e+00

Pos logit contributions
ilers+4.308
erous+4.250
itored+3.879
 Consent+3.681
 spur+3.678
Neg logit contributions
rity-4.574
pun-4.237
endment-3.977
 upscale-3.955
geist-3.941
Token &
Token ID1222
Feature activation-6.32e+00

Pos logit contributions
ilers+4.340
erous+4.321
itored+4.046
 Consent+3.877
 spur+3.788
Neg logit contributions
rity-5.423
pun-5.016
endment-4.748
 upscale-4.653
geist-4.616
Token George
Token ID4502
Feature activation-9.84e+00

Pos logit contributions
erous+4.083
ilers+4.044
itored+3.697
 Consent+3.623
 spur+3.477
Neg logit contributions
rity-5.891
pun-5.490
endment-5.228
geist-5.104
 upscale-5.077
Token-
Token ID12
Feature activation-6.89e+00

Pos logit contributions
ilers+6.899
erous+6.892
itored+6.397
 Consent+6.163
 spur+6.006
Neg logit contributions
rity-8.211
pun-7.493
endment-7.234
CVE-7.096
 upscale-7.055
Token8
Token ID23
Feature activation-7.38e+00

Pos logit contributions
erous+5.597
ilers+5.569
itored+5.228
 Consent+5.027
er+4.926
Neg logit contributions
rity-5.193
pun-4.594
endment-4.428
 upscale-4.367
geist-4.310
Token Track
Token ID17762
Feature activation-9.15e+00

Pos logit contributions
erous+5.390
ilers+5.343
itored+4.959
 Consent+4.868
 spur+4.666
Neg logit contributions
rity-6.119
pun-5.646
endment-5.318
geist-5.294
 upscale-5.238
Token Porn
Token ID29629
Feature activation-8.95e+00

Pos logit contributions
erous+6.710
ilers+6.637
itored+6.188
 Consent+5.960
 spur+5.736
Neg logit contributions
rity-7.447
pun-6.802
 upscale-6.493
endment-6.486
CVE-6.444
Token Pt
Token ID34352
Feature activation-5.60e+00

Pos logit contributions
erous+6.474
ilers+6.391
itored+5.930
 Consent+5.680
 spur+5.607
Neg logit contributions
rity-7.483
pun-6.861
endment-6.568
 upscale-6.522
CVE-6.502
Token 4
Token ID604
Feature activation-3.42e+00

Pos logit contributions
ilers+3.968
erous+3.888
itored+3.630
 Consent+3.449
 spur+3.414
Neg logit contributions
rity-4.938
pun-4.597
 upscale-4.300
endment-4.299
geist-4.298
Token-
Token ID12
Feature activation-4.91e+00

Pos logit contributions
ilers+2.635
erous+2.632
itored+2.445
 spur+2.308
 Consent+2.299
Neg logit contributions
rity-2.833
pun-2.628
endment-2.464
 upscale-2.434
geist-2.429
TokenOtt
Token ID49092
Feature activation-6.77e+00

Pos logit contributions
erous+4.043
ilers+4.037
itored+3.779
 Consent+3.614
 spur+3.530
Neg logit contributions
rity-3.727
pun-3.341
endment-3.185
 upscale-3.148
geist-3.111