TOP ACTIVATIONS
MAX = 2.498

s con ­ ven ­ tion in 2008 , Steve Kor
Newsletter Sign Up Continue reading the main story Please
10 years in prison MORE and Hillary Clinton Hillary Diane Rodham
Newsletter Sign Up Continue reading the main story Please
Newsletter Sign Up Continue reading the main story Please
did not issue the total Loving v . Virginia victory that
use of its Android mobile OS . In a
. Explore further : New Insight Into How Bees
prison MORE and Hillary Clinton Hillary Diane Rodham Clinton Sanders :
's Great America in Santa Clara , California was originally called
former President Barack Obama Barack Hussein Obama With low birth rate
will be the equivalent of Loving v . Virginia , which
world with kindred spirits . Sign up for our
sleeve at basketball game MORE and Hillary Clinton Hillary Diane Rodham
10 years in prison MORE and tick ed through her usual
game MORE and Hillary Clinton Hillary Diane Rodham Clinton Sanders :
Par liament operates in both official languages
two parties . Prep aring for independence : 1961
10 years in prison MORE , in his weekly column Wednesday
University of California , Santa Barbara , and an organizer of

MIN ACTIVATIONS
MIN = -2.978

- and - a - half - deep " source rock
File ) L ANS ING , MI -- The
- 35 790 347 . html https :// www . bel
be very relieved . The BBC 's David L
would be very relieved . The BBC 's David
р а м е
their products . Shares of Alt ria Group ,
assists for the Jets / Th ras hers , helped the
make reproductive choices . Why don t you trust
T NS via Getty Images ) Indiana Gov . Mike Pence
о� р а с с л е
https :// www . eff . org / d ee
и я , т о
4 percent . Shares of Philip Morris International ,
hardware is almost 4 years old and once I turn AA
as the Caps out - shot the Blues 10 - 8
increasing claims of fraud . Full preliminary results were
к и м и р у
via Getty Images ) Indiana Gov . Mike Pence Pence is
history . <|endoftext|> About This Game Choose from one

INTERVAL 1.129 - 2.498
CONTAINS 0.299%

thread before this lesson . Show your child how to insert
Newsletter Sign Up Continue reading the main story Please
s new domain buying service Google Dom ains at 1
two years away , it seems unlikely that " King dom
ta V NS following sham ). Results After four

INTERVAL -0.240 - 1.129
CONTAINS 75.689%

ica , who suffered serious injuries in a rally crash last
1960 s when strong unions , good jobs and high taxes
best publisher in the industry . It s a
light , and then searched his vehicle for weapons . Once
for the incident , but has apparently not apologized to Crow

INTERVAL -1.609 - -0.240
CONTAINS 23.997%

the fierce crackdown that took place in June and July .
commitment to the univers ality of reason as responsiveness to reality
moved there and then upl ifted and put on a basement
2017 . [ 22 ] Literally , R ashi
countries to those same market forces . Ne oliberal analysts similarly

INTERVAL -2.978 - -1.609
CONTAINS 0.014%

others , they said that wasn 't true . They gave
https :// www . eff . org / d ee
r ump 220 22 . jpg Email
increasing claims of fraud . Full preliminary results were
assists for the Jets / Th ras hers , helped the
Tokens
Token ID82
Feature activation+4.06e-01

Pos logit contributions
-+0.175
 distance+0.168
 on+0.155
 by+0.153
 your+0.151
Neg logit contributions
 advoc-0.906
 rul-0.886
 mathemat-0.864
 Citiz-0.854
 conclud-0.830
Token con
Token ID369
Feature activation+8.54e-01

Pos logit contributions
-+0.184
PI+0.171
 on+0.167
 distance+0.167
 by+0.161
Neg logit contributions
 advoc-0.725
 rul-0.709
 mathemat-0.694
 Citiz-0.679
 conclud-0.657
Token­
Token ID3907
Feature activation+6.86e-01

Pos logit contributions
 distance+0.158
-+0.151
 on+0.140
 by+0.134
PI+0.134
Neg logit contributions
 advoc-1.705
 rul-1.696
 mathemat-1.652
 Citiz-1.623
 conclud-1.565
Tokenven
Token ID574
Feature activation+1.80e+00

Pos logit contributions
-+0.372
 on+0.361
 distance+0.354
 by+0.349
 your+0.342
Neg logit contributions
 advoc-1.144
 rul-1.122
 mathemat-1.087
 Citiz-1.064
 conclud-1.025
Token­
Token ID3907
Feature activation+3.99e-01

Pos logit contributions
 distance+0.363
-+0.326
 on+0.310
PI+0.282
 safely+0.279
Neg logit contributions
 rul-3.561
 advoc-3.528
 mathemat-3.421
 Citiz-3.403
 tyr-3.286
Tokention
Token ID5378
Feature activation+2.50e+00

Pos logit contributions
 distance+0.176
PI+0.170
-+0.168
 your+0.165
 by+0.165
Neg logit contributions
 rul-0.676
 advoc-0.665
 mathemat-0.637
 Citiz-0.636
 tyr-0.613
Token in
Token ID287
Feature activation+2.11e-01

Pos logit contributions
-+0.510
 distance+0.450
PI+0.413
 by+0.363
 your+0.363
Neg logit contributions
 advoc-5.077
 rul-5.015
 mathemat-4.856
 Citiz-4.802
 conclud-4.683
Token 2008
Token ID3648
Feature activation+8.97e-02

Pos logit contributions
-+0.100
 distance+0.093
 on+0.091
PI+0.091
 by+0.088
Neg logit contributions
 advoc-0.369
 rul-0.362
 mathemat-0.352
 Citiz-0.345
 tyr-0.333
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
 distance+0.016
-+0.016
PI+0.014
 on+0.013
 your+0.012
Neg logit contributions
 advoc-0.182
 rul-0.178
 mathemat-0.174
 Citiz-0.171
 conclud-0.167
Token Steve
Token ID6542
Feature activation+9.22e-01

Pos logit contributions
-+0.093
 distance+0.087
PI+0.085
 on+0.080
erm+0.077
Neg logit contributions
 advoc-0.401
 rul-0.392
 mathemat-0.381
 Citiz-0.374
 tyr-0.363
Token Kor
Token ID14769
Feature activation-6.12e-01

Pos logit contributions
-+0.476
 distance+0.466
 on+0.452
PI+0.435
 by+0.430
Neg logit contributions
 advoc-1.578
 rul-1.532
 mathemat-1.497
 Citiz-1.474
 embr-1.409
Token
Token ID198
Feature activation-2.16e-01

Pos logit contributions
-+0.014
 distance+0.013
 safely+0.012
 on+0.012
 hunting+0.012
Neg logit contributions
 advoc-0.046
 rul-0.045
 Citiz-0.044
 mathemat-0.044
 conclud-0.042
Token
Token ID198
Feature activation-1.82e-01

Pos logit contributions
 advoc+0.350
 rul+0.344
 mathemat+0.331
 Citiz+0.329
 embr+0.321
Neg logit contributions
 distance-0.096
--0.091
 hunting-0.087
 on-0.086
 tour-0.085
TokenNewsletter
Token ID33031
Feature activation+5.38e-01

Pos logit contributions
 advoc+0.265
 rul+0.251
 mathemat+0.248
 Citiz+0.246
 embr+0.228
Neg logit contributions
 distance-0.130
 hunting-0.121
 on-0.121
 tour-0.121
--0.120
Token Sign
Token ID5865
Feature activation+7.39e-01

Pos logit contributions
-+0.256
 distance+0.243
 on+0.239
 by+0.228
PI+0.227
Neg logit contributions
 advoc-0.926
 rul-0.901
 mathemat-0.890
 Citiz-0.867
 embr-0.832
Token Up
Token ID3205
Feature activation+1.30e+00

Pos logit contributions
 distance+0.358
-+0.338
 on+0.333
 safely+0.326
 tour+0.320
Neg logit contributions
 advoc-1.227
 rul-1.197
 Citiz-1.171
 mathemat-1.167
 embr-1.102
Token Continue
Token ID10054
Feature activation+2.41e+00

Pos logit contributions
-+0.759
 distance+0.745
 on+0.710
 by+0.679
 ass+0.679
Neg logit contributions
 advoc-2.085
 rul-2.057
 mathemat-2.014
 Citiz-1.954
 conclud-1.865
Token reading
Token ID3555
Feature activation-1.18e-01

Pos logit contributions
-+0.880
 on+0.763
 distance+0.758
 by+0.710
PI+0.702
Neg logit contributions
 advoc-4.525
 rul-4.434
 mathemat-4.373
 Citiz-4.241
 conclud-4.113
Token the
Token ID262
Feature activation-2.25e-01

Pos logit contributions
 advoc+0.231
 rul+0.229
 mathemat+0.225
 Citiz+0.218
 embr+0.215
Neg logit contributions
--0.021
 distance-0.017
PI-0.016
 on-0.014
 by-0.013
Token main
Token ID1388
Feature activation+1.36e+00

Pos logit contributions
 advoc+0.380
 rul+0.372
 mathemat+0.366
 Citiz+0.354
 embr+0.342
Neg logit contributions
--0.115
 distance-0.108
 on-0.104
PI-0.104
 by-0.101
Token story
Token ID1621
Feature activation+1.24e+00

Pos logit contributions
-+0.831
 distance+0.799
 on+0.783
PI+0.779
 by+0.769
Neg logit contributions
 advoc-2.161
 rul-2.108
 mathemat-2.065
 Citiz-1.986
 conclud-1.932
Token Please
Token ID4222
Feature activation+1.66e+00

Pos logit contributions
-+0.790
 distance+0.767
 on+0.731
PI+0.720
 by+0.714
Neg logit contributions
 advoc-1.976
 rul-1.888
 mathemat-1.884
 Citiz-1.821
 conclud-1.725
Token 10
Token ID838
Feature activation-3.43e-01

Pos logit contributions
 advoc+0.743
 rul+0.708
 mathemat+0.692
 Citiz+0.672
 conclud+0.669
Neg logit contributions
--0.174
PI-0.171
 distance-0.168
 solely-0.158
 by-0.157
Token years
Token ID812
Feature activation-9.10e-02

Pos logit contributions
 advoc+0.608
 rul+0.592
 mathemat+0.579
 Citiz+0.561
 conclud+0.555
Neg logit contributions
--0.148
 distance-0.139
PI-0.138
 on-0.135
 by-0.134
Token in
Token ID287
Feature activation-7.96e-02

Pos logit contributions
 advoc+0.177
 rul+0.174
 mathemat+0.170
 Citiz+0.169
 tyr+0.162
Neg logit contributions
--0.022
 distance-0.020
PI-0.017
 your-0.017
123-0.017
Token prison
Token ID3770
Feature activation-5.32e-01

Pos logit contributions
 advoc+0.135
 rul+0.131
 mathemat+0.127
 Citiz+0.123
 tyr+0.122
Neg logit contributions
--0.041
PI-0.038
 distance-0.038
 on-0.037
 specifically-0.036
Token MORE
Token ID12011
Feature activation+7.88e-01

Pos logit contributions
 advoc+0.873
 rul+0.847
 mathemat+0.837
 Citiz+0.827
 tyr+0.809
Neg logit contributions
--0.271
 distance-0.248
PI-0.247
 your-0.235
ering-0.234
Token and
Token ID290
Feature activation+2.35e+00

Pos logit contributions
-+0.168
 distance+0.162
 your+0.146
123+0.138
PI+0.136
Neg logit contributions
 advoc-1.566
 rul-1.521
 mathemat-1.512
 Citiz-1.445
 conclud-1.432
Token Hillary
Token ID5073
Feature activation+1.43e+00

Pos logit contributions
-+1.408
 on+1.300
PI+1.290
 distance+1.276
 by+1.250
Neg logit contributions
 advoc-3.885
 mathemat-3.751
 rul-3.741
 Citiz-3.567
 conclud-3.474
Token Clinton
Token ID2605
Feature activation+9.58e-01

Pos logit contributions
-+1.001
 distance+0.967
 on+0.943
PI+0.922
 your+0.912
Neg logit contributions
 advoc-2.152
 rul-2.069
 mathemat-2.027
 Citiz-1.977
 embr-1.903
Token Hillary
Token ID5073
Feature activation+2.17e+00

Pos logit contributions
-+0.615
 distance+0.595
PI+0.591
123+0.590
 your+0.558
Neg logit contributions
 advoc-1.488
 rul-1.427
 mathemat-1.404
 Citiz-1.326
 embr-1.318
Token Diane
Token ID26542
Feature activation+1.38e+00

Pos logit contributions
-+1.735
 on+1.646
 distance+1.617
 by+1.593
 your+1.570
Neg logit contributions
 advoc-3.141
 rul-3.033
 mathemat-3.010
 Citiz-2.888
 conclud-2.767
Token Rodham
Token ID41848
Feature activation+1.67e+00

Pos logit contributions
-+0.925
 distance+0.866
 on+0.864
PI+0.851
 by+0.846
Neg logit contributions
 advoc-2.132
 rul-2.033
 mathemat-2.014
 Citiz-1.931
 embr-1.870
Token
Token ID198
Feature activation+3.45e-01

Pos logit contributions
-+0.042
 distance+0.041
 safely+0.038
 your+0.038
 on+0.037
Neg logit contributions
 advoc-0.156
 rul-0.153
 Citiz-0.152
 mathemat-0.150
 conclud-0.143
Token
Token ID198
Feature activation+2.58e-02

Pos logit contributions
 distance+0.150
-+0.145
 tour+0.134
 hunting+0.134
 on+0.134
Neg logit contributions
 advoc-0.568
 rul-0.559
 mathemat-0.537
 Citiz-0.535
 embr-0.520
TokenNewsletter
Token ID33031
Feature activation+5.33e-01

Pos logit contributions
 distance+0.018
 hunting+0.017
-+0.017
 your+0.017
 on+0.017
Neg logit contributions
 advoc-0.037
 rul-0.035
 mathemat-0.035
 Citiz-0.035
 embr-0.033
Token Sign
Token ID5865
Feature activation+4.89e-01

Pos logit contributions
-+0.262
 distance+0.257
 on+0.245
PI+0.239
 by+0.234
Neg logit contributions
 advoc-0.899
 rul-0.868
 mathemat-0.861
 Citiz-0.845
 embr-0.810
Token Up
Token ID3205
Feature activation+7.34e-01

Pos logit contributions
 distance+0.236
-+0.221
 on+0.218
 safely+0.216
 hunting+0.213
Neg logit contributions
 advoc-0.817
 rul-0.793
 Citiz-0.779
 mathemat-0.777
 embr-0.733
Token Continue
Token ID10054
Feature activation+2.30e+00

Pos logit contributions
-+0.430
 distance+0.422
 on+0.400
PI+0.386
 ass+0.384
Neg logit contributions
 advoc-1.169
 rul-1.155
 mathemat-1.127
 Citiz-1.099
 conclud-1.047
Token reading
Token ID3555
Feature activation-3.47e-01

Pos logit contributions
-+0.848
 on+0.732
 distance+0.731
PI+0.699
 by+0.682
Neg logit contributions
 advoc-4.308
 rul-4.220
 mathemat-4.164
 Citiz-4.032
 conclud-3.906
Token the
Token ID262
Feature activation-3.95e-01

Pos logit contributions
 advoc+0.662
 rul+0.656
 mathemat+0.639
 Citiz+0.626
 embr+0.618
Neg logit contributions
--0.060
 distance-0.056
PI-0.053
erm-0.046
 on-0.043
Token main
Token ID1388
Feature activation+9.93e-01

Pos logit contributions
 advoc+0.652
 rul+0.638
 mathemat+0.625
 Citiz+0.602
 embr+0.585
Neg logit contributions
--0.201
PI-0.194
 distance-0.193
 on-0.185
 by-0.179
Token story
Token ID1621
Feature activation+9.91e-01

Pos logit contributions
-+0.610
 distance+0.592
PI+0.586
 on+0.576
 by+0.565
Neg logit contributions
 advoc-1.548
 rul-1.511
 mathemat-1.470
 Citiz-1.414
 embr-1.376
Token Please
Token ID4222
Feature activation+1.53e+00

Pos logit contributions
-+0.619
 distance+0.595
 on+0.573
PI+0.564
 by+0.557
Neg logit contributions
 advoc-1.588
 rul-1.520
 mathemat-1.514
 Citiz-1.472
 conclud-1.390
Token
Token ID198
Feature activation+3.76e-01

Pos logit contributions
-+0.177
 distance+0.167
 on+0.160
 safely+0.153
 ass+0.153
Neg logit contributions
 advoc-0.704
 rul-0.685
 mathemat-0.670
 Citiz-0.670
 conclud-0.634
Token
Token ID198
Feature activation+2.88e-01

Pos logit contributions
 distance+0.153
-+0.152
 on+0.143
 ass+0.137
 tour+0.136
Neg logit contributions
 advoc-0.639
 rul-0.626
 mathemat-0.608
 Citiz-0.601
 embr-0.581
TokenNewsletter
Token ID33031
Feature activation+4.64e-01

Pos logit contributions
 distance+0.181
 on+0.171
 by+0.168
-+0.167
 ass+0.164
Neg logit contributions
 advoc-0.450
 rul-0.430
 mathemat-0.426
 Citiz-0.418
 embr-0.393
Token Sign
Token ID5865
Feature activation+5.06e-01

Pos logit contributions
-+0.214
 distance+0.199
 on+0.198
 by+0.188
PI+0.183
Neg logit contributions
 advoc-0.820
 rul-0.796
 mathemat-0.788
 Citiz-0.765
 tyr-0.734
Token Up
Token ID3205
Feature activation+5.91e-01

Pos logit contributions
 distance+0.246
 on+0.230
-+0.229
 safely+0.225
PI+0.221
Neg logit contributions
 advoc-0.840
 rul-0.816
 mathemat-0.798
 Citiz-0.796
 embr-0.753
Token Continue
Token ID10054
Feature activation+2.26e+00

Pos logit contributions
-+0.346
 distance+0.341
 on+0.324
 ass+0.310
 by+0.309
Neg logit contributions
 advoc-0.942
 rul-0.930
 mathemat-0.909
 Citiz-0.882
 conclud-0.843
Token reading
Token ID3555
Feature activation+6.63e-03

Pos logit contributions
-+0.821
 on+0.711
 distance+0.704
 by+0.663
PI+0.658
Neg logit contributions
 advoc-4.241
 rul-4.155
 mathemat-4.100
 Citiz-3.976
 conclud-3.854
Token the
Token ID262
Feature activation-2.54e-01

Pos logit contributions
-+0.001
 distance+0.001
PI+0.001
 on+0.001
 by+0.001
Neg logit contributions
 advoc-0.013
 rul-0.013
 mathemat-0.013
 Citiz-0.012
 embr-0.012
Token main
Token ID1388
Feature activation+1.07e+00

Pos logit contributions
 advoc+0.432
 rul+0.423
 mathemat+0.415
 Citiz+0.402
 embr+0.388
Neg logit contributions
--0.127
 distance-0.119
 on-0.116
PI-0.114
 by-0.112
Token story
Token ID1621
Feature activation+1.03e+00

Pos logit contributions
-+0.651
 distance+0.624
 on+0.614
PI+0.607
 by+0.604
Neg logit contributions
 advoc-1.696
 rul-1.651
 mathemat-1.616
 Citiz-1.559
 conclud-1.513
Token Please
Token ID4222
Feature activation+1.54e+00

Pos logit contributions
-+0.638
 distance+0.612
 on+0.592
 by+0.577
PI+0.573
Neg logit contributions
 advoc-1.643
 rul-1.574
 mathemat-1.564
 Citiz-1.518
 embr-1.440
Token did
Token ID750
Feature activation-3.34e-02

Pos logit contributions
-+0.075
PI+0.070
 distance+0.068
123+0.064
 on+0.062
Neg logit contributions
 advoc-0.316
 rul-0.314
 mathemat-0.305
 Citiz-0.290
 conclud-0.288
Token not
Token ID407
Feature activation+4.73e-03

Pos logit contributions
 advoc+0.061
 rul+0.060
 mathemat+0.059
 conclud+0.056
 tyr+0.056
Neg logit contributions
--0.012
PI-0.012
 distance-0.011
123-0.010
 on-0.010
Token issue
Token ID2071
Feature activation-2.55e-01

Pos logit contributions
PI+0.002
-+0.002
123+0.002
 on+0.002
 distance+0.002
Neg logit contributions
 rul-0.008
 advoc-0.008
 mathemat-0.008
 tyr-0.007
 conclud-0.007
Token the
Token ID262
Feature activation-8.01e-02

Pos logit contributions
 rul+0.493
 advoc+0.492
 mathemat+0.473
 Citiz+0.461
 conclud+0.451
Neg logit contributions
--0.071
PI-0.063
 distance-0.062
 on-0.055
 by-0.053
Token total
Token ID2472
Feature activation+2.76e-01

Pos logit contributions
 advoc+0.133
 rul+0.132
 mathemat+0.128
 Citiz+0.124
 tyr+0.120
Neg logit contributions
--0.045
PI-0.042
 distance-0.042
 on-0.041
 by-0.040
Token Loving
Token ID39202
Feature activation+2.24e+00

Pos logit contributions
-+0.131
PI+0.127
 distance+0.126
 on+0.121
 by+0.118
Neg logit contributions
 advoc-0.476
 rul-0.475
 mathemat-0.457
 Citiz-0.436
 tyr-0.433
Token v
Token ID410
Feature activation+1.17e-01

Pos logit contributions
-+1.160
 distance+1.121
 on+1.106
 by+1.069
PI+1.064
Neg logit contributions
 advoc-3.828
 rul-3.762
 mathemat-3.633
 Citiz-3.539
 conclud-3.442
Token.
Token ID13
Feature activation-1.85e-01

Pos logit contributions
-+0.018
 distance+0.013
 on+0.013
PI+0.012
 by+0.011
Neg logit contributions
 advoc-0.241
 rul-0.237
 mathemat-0.232
 Citiz-0.229
 conclud-0.222
Token Virginia
Token ID6025
Feature activation-2.65e-01

Pos logit contributions
 advoc+0.275
 rul+0.270
 mathemat+0.260
 Citiz+0.254
 conclud+0.245
Neg logit contributions
--0.132
PI-0.128
 on-0.125
 distance-0.124
 by-0.122
Token victory
Token ID5373
Feature activation+7.11e-01

Pos logit contributions
 rul+0.426
 advoc+0.424
 mathemat+0.404
 Citiz+0.393
 conclud+0.387
Neg logit contributions
--0.159
 distance-0.152
PI-0.149
 on-0.146
123-0.145
Token that
Token ID326
Feature activation+9.93e-02

Pos logit contributions
-+0.198
 distance+0.194
PI+0.179
123+0.169
 your+0.164
Neg logit contributions
 advoc-1.364
 rul-1.355
 mathemat-1.308
 Citiz-1.262
 conclud-1.248
Token use
Token ID779
Feature activation-3.06e-01

Pos logit contributions
-+0.110
PI+0.104
erm+0.099
 on+0.098
 distance+0.097
Neg logit contributions
 advoc-0.468
 rul-0.456
 mathemat-0.448
 Citiz-0.434
 embr-0.418
Token of
Token ID286
Feature activation-9.11e-02

Pos logit contributions
 advoc+0.614
 rul+0.604
 mathemat+0.598
 Citiz+0.571
 conclud+0.563
Neg logit contributions
--0.048
 distance-0.044
PI-0.044
 ass-0.040
erm-0.037
Token its
Token ID663
Feature activation-5.33e-03

Pos logit contributions
 advoc+0.165
 rul+0.161
 mathemat+0.158
 Citiz+0.154
 tyr+0.148
Neg logit contributions
--0.036
PI-0.034
 distance-0.032
 on-0.032
erm-0.032
Token Android
Token ID5565
Feature activation+9.85e-01

Pos logit contributions
 advoc+0.009
 rul+0.009
 mathemat+0.009
 Citiz+0.009
 tyr+0.008
Neg logit contributions
--0.003
PI-0.002
 distance-0.002
 on-0.002
123-0.002
Token mobile
Token ID5175
Feature activation+1.04e+00

Pos logit contributions
 distance+0.454
-+0.449
PI+0.434
 on+0.433
123+0.432
Neg logit contributions
 advoc-1.698
 rul-1.664
 mathemat-1.629
 Citiz-1.583
 tyr-1.521
Token OS
Token ID7294
Feature activation+2.23e+00

Pos logit contributions
 distance+0.485
-+0.482
 on+0.466
PI+0.455
123+0.455
Neg logit contributions
 advoc-1.805
 rul-1.770
 mathemat-1.726
 Citiz-1.675
 tyr-1.613
Token.
Token ID13
Feature activation-7.91e-02

Pos logit contributions
-+0.383
 distance+0.356
PI+0.292
 ass+0.282
 your+0.281
Neg logit contributions
 advoc-4.534
 rul-4.481
 mathemat-4.401
 Citiz-4.304
 conclud-4.178
Token
Token ID198
Feature activation+2.04e-01

Pos logit contributions
 advoc+0.134
 rul+0.131
 Citiz+0.130
 mathemat+0.127
 conclud+0.120
Neg logit contributions
--0.038
 distance-0.037
 safely-0.034
 on-0.034
 ass-0.034
Token
Token ID198
Feature activation+2.19e-01

Pos logit contributions
 distance+0.090
-+0.085
 hunting+0.081
 on+0.080
 tour+0.079
Neg logit contributions
 advoc-0.332
 rul-0.326
 mathemat-0.314
 Citiz-0.312
 embr-0.304
TokenIn
Token ID818
Feature activation-1.55e-02

Pos logit contributions
 distance+0.151
 on+0.141
 by+0.139
-+0.139
 ass+0.138
Neg logit contributions
 advoc-0.327
 rul-0.315
 Citiz-0.308
 mathemat-0.308
 embr-0.284
Token a
Token ID257
Feature activation-1.35e-01

Pos logit contributions
 advoc+0.029
 rul+0.028
 mathemat+0.027
 Citiz+0.027
 conclud+0.026
Neg logit contributions
--0.005
 distance-0.005
 on-0.005
PI-0.005
 by-0.005
Token.
Token ID13
Feature activation-3.91e-02

Pos logit contributions
 advoc+0.829
 mathemat+0.816
 rul+0.807
 Citiz+0.781
 conclud+0.769
Neg logit contributions
 distance-0.092
--0.088
 tour-0.068
 stalk-0.067
 safely-0.065
Token
Token ID198
Feature activation+2.75e-01

Pos logit contributions
 advoc+0.069
 rul+0.067
 mathemat+0.066
 Citiz+0.065
 conclud+0.062
Neg logit contributions
--0.018
 distance-0.017
 on-0.016
 by-0.016
 your-0.016
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
 distance+0.120
-+0.114
 on+0.108
 tour+0.107
 hunting+0.106
Neg logit contributions
 advoc-0.454
 rul-0.446
 mathemat-0.429
 Citiz-0.426
 embr-0.415
TokenExplore
Token ID35433
Feature activation-3.04e-04

Pos logit contributions
 distance+0.079
 on+0.077
-+0.075
 by+0.075
 auto+0.074
Neg logit contributions
 advoc-0.163
 rul-0.155
 mathemat-0.154
 Citiz-0.151
 embr-0.141
Token further
Token ID2252
Feature activation+6.41e-01

Pos logit contributions
 advoc+0.001
 rul+0.001
 mathemat+0.001
 Citiz+0.001
 conclud+0.001
Neg logit contributions
--0.000
 distance-0.000
 on-0.000
123-0.000
PI-0.000
Token:
Token ID25
Feature activation+2.20e+00

Pos logit contributions
-+0.109
 distance+0.101
PI+0.085
 safely+0.081
 tour+0.079
Neg logit contributions
 advoc-1.230
 rul-1.213
 mathemat-1.182
 Citiz-1.159
 embr-1.126
Token New
Token ID968
Feature activation+4.87e-01

Pos logit contributions
-+1.411
 distance+1.337
 on+1.323
 by+1.282
 your+1.269
Neg logit contributions
 advoc-3.464
 rul-3.364
 mathemat-3.329
 Citiz-3.294
 conclud-3.103
Token Insight
Token ID39917
Feature activation+1.99e-01

Pos logit contributions
-+0.235
 on+0.217
 distance+0.214
PI+0.213
 by+0.211
Neg logit contributions
 advoc-0.846
 rul-0.828
 mathemat-0.811
 Citiz-0.787
 conclud-0.762
Token Into
Token ID20008
Feature activation-2.31e-01

Pos logit contributions
 distance+0.063
-+0.061
PI+0.055
 your+0.053
 ass+0.053
Neg logit contributions
 advoc-0.373
 rul-0.362
 mathemat-0.361
 Citiz-0.353
 conclud-0.339
Token How
Token ID1374
Feature activation-5.83e-02

Pos logit contributions
 advoc+0.380
 rul+0.374
 mathemat+0.368
 Citiz+0.364
 conclud+0.344
Neg logit contributions
--0.125
 distance-0.118
PI-0.114
 on-0.114
 ass-0.111
Token Bees
Token ID46698
Feature activation+2.91e-01

Pos logit contributions
 advoc+0.087
 rul+0.086
 mathemat+0.084
 Citiz+0.084
 embr+0.078
Neg logit contributions
--0.039
 distance-0.038
 on-0.037
PI-0.037
 ass-0.036
Token prison
Token ID3770
Feature activation-5.32e-01

Pos logit contributions
 advoc+0.135
 rul+0.131
 mathemat+0.127
 Citiz+0.123
 tyr+0.122
Neg logit contributions
--0.041
PI-0.038
 distance-0.038
 on-0.037
 specifically-0.036
Token MORE
Token ID12011
Feature activation+7.88e-01

Pos logit contributions
 advoc+0.873
 rul+0.847
 mathemat+0.837
 Citiz+0.827
 tyr+0.809
Neg logit contributions
--0.271
 distance-0.248
PI-0.247
 your-0.235
ering-0.234
Token and
Token ID290
Feature activation+2.35e+00

Pos logit contributions
-+0.168
 distance+0.162
 your+0.146
123+0.138
PI+0.136
Neg logit contributions
 advoc-1.566
 rul-1.521
 mathemat-1.512
 Citiz-1.445
 conclud-1.432
Token Hillary
Token ID5073
Feature activation+1.43e+00

Pos logit contributions
-+1.408
 on+1.300
PI+1.290
 distance+1.276
 by+1.250
Neg logit contributions
 advoc-3.885
 mathemat-3.751
 rul-3.741
 Citiz-3.567
 conclud-3.474
Token Clinton
Token ID2605
Feature activation+9.58e-01

Pos logit contributions
-+1.001
 distance+0.967
 on+0.943
PI+0.922
 your+0.912
Neg logit contributions
 advoc-2.152
 rul-2.069
 mathemat-2.027
 Citiz-1.977
 embr-1.903
Token Hillary
Token ID5073
Feature activation+2.17e+00

Pos logit contributions
-+0.615
 distance+0.595
PI+0.591
123+0.590
 your+0.558
Neg logit contributions
 advoc-1.488
 rul-1.427
 mathemat-1.404
 Citiz-1.326
 embr-1.318
Token Diane
Token ID26542
Feature activation+1.38e+00

Pos logit contributions
-+1.735
 on+1.646
 distance+1.617
 by+1.593
 your+1.570
Neg logit contributions
 advoc-3.141
 rul-3.033
 mathemat-3.010
 Citiz-2.888
 conclud-2.767
Token Rodham
Token ID41848
Feature activation+1.67e+00

Pos logit contributions
-+0.925
 distance+0.866
 on+0.864
PI+0.851
 by+0.846
Neg logit contributions
 advoc-2.132
 rul-2.033
 mathemat-2.014
 Citiz-1.931
 embr-1.870
Token Clinton
Token ID2605
Feature activation+1.31e+00

Pos logit contributions
-+1.186
 distance+1.102
 on+1.095
PI+1.061
 by+1.061
Neg logit contributions
 advoc-2.544
 rul-2.464
 mathemat-2.399
 Citiz-2.321
 conclud-2.256
TokenSanders
Token ID26747
Feature activation+9.13e-01

Pos logit contributions
-+1.034
 distance+1.010
 on+0.990
 by+0.974
 your+0.963
Neg logit contributions
 advoc-1.886
 rul-1.794
 mathemat-1.770
 Citiz-1.702
 conclud-1.635
Token:
Token ID25
Feature activation+1.18e+00

Pos logit contributions
-+0.796
 on+0.751
 distance+0.742
 by+0.738
 your+0.730
Neg logit contributions
 advoc-1.269
 rul-1.219
 mathemat-1.201
 Citiz-1.152
 conclud-1.121
Token's
Token ID338
Feature activation+2.62e-02

Pos logit contributions
 advoc+1.208
 rul+1.181
 mathemat+1.165
 Citiz+1.149
 conclud+1.095
Neg logit contributions
--0.391
 distance-0.379
 on-0.372
 by-0.364
PI-0.363
Token Great
Token ID3878
Feature activation-4.45e-01

Pos logit contributions
-+0.014
 on+0.013
 distance+0.013
 by+0.013
PI+0.012
Neg logit contributions
 advoc-0.044
 rul-0.043
 mathemat-0.043
 Citiz-0.041
 tyr-0.040
Token America
Token ID2253
Feature activation-1.16e+00

Pos logit contributions
 advoc+0.715
 rul+0.690
 Citiz+0.684
 mathemat+0.677
 embr+0.644
Neg logit contributions
--0.239
 on-0.231
 distance-0.231
 ass-0.228
 by-0.227
Token in
Token ID287
Feature activation-6.66e-02

Pos logit contributions
 advoc+1.945
 rul+1.888
 mathemat+1.874
 Citiz+1.839
 conclud+1.774
Neg logit contributions
--0.565
 distance-0.540
PI-0.536
 ass-0.514
 your-0.511
Token Santa
Token ID8909
Feature activation+9.44e-01

Pos logit contributions
 advoc+0.116
 rul+0.112
 mathemat+0.111
 Citiz+0.110
 tyr+0.105
Neg logit contributions
--0.030
PI-0.028
 distance-0.028
 on-0.027
 by-0.027
Token Clara
Token ID27443
Feature activation+2.17e+00

Pos logit contributions
-+0.556
 distance+0.535
PI+0.518
 on+0.513
 your+0.513
Neg logit contributions
 advoc-1.518
 rul-1.478
 mathemat-1.465
 Citiz-1.430
 tyr-1.378
Token,
Token ID11
Feature activation+3.44e-01

Pos logit contributions
-+0.881
 distance+0.795
PI+0.772
 on+0.762
 your+0.760
Neg logit contributions
 advoc-3.954
 rul-3.847
 mathemat-3.798
 Citiz-3.738
 conclud-3.603
Token California
Token ID3442
Feature activation+5.73e-01

Pos logit contributions
-+0.223
 distance+0.204
PI+0.202
 on+0.198
erm+0.197
Neg logit contributions
 advoc-0.544
 rul-0.527
 mathemat-0.515
 Citiz-0.501
 conclud-0.484
Token was
Token ID373
Feature activation+5.36e-02

Pos logit contributions
-+0.160
 distance+0.140
PI+0.137
 your+0.127
erm+0.124
Neg logit contributions
 advoc-1.107
 rul-1.081
 mathemat-1.069
 Citiz-1.048
 conclud-1.012
Token originally
Token ID6198
Feature activation+4.94e-01

Pos logit contributions
-+0.025
PI+0.024
 distance+0.023
 on+0.022
123+0.022
Neg logit contributions
 advoc-0.093
 rul-0.091
 mathemat-0.089
 Citiz-0.088
 tyr-0.084
Token called
Token ID1444
Feature activation+3.46e-01

Pos logit contributions
-+0.254
PI+0.238
 distance+0.235
 on+0.227
123+0.221
Neg logit contributions
 advoc-0.841
 rul-0.818
 mathemat-0.809
 Citiz-0.783
 tyr-0.755
Token former
Token ID1966
Feature activation-2.49e-01

Pos logit contributions
-+0.047
 distance+0.045
 on+0.044
PI+0.044
123+0.043
Neg logit contributions
 advoc-0.124
 rul-0.120
 mathemat-0.118
 Citiz-0.115
 tyr-0.111
Token President
Token ID1992
Feature activation+3.82e-01

Pos logit contributions
 advoc+0.404
 rul+0.387
 mathemat+0.382
 Citiz+0.372
 tyr+0.360
Neg logit contributions
--0.145
 on-0.141
 distance-0.139
erm-0.138
PI-0.138
Token Barack
Token ID8732
Feature activation+8.84e-02

Pos logit contributions
 distance+0.221
PI+0.221
 on+0.213
-+0.212
erm+0.209
Neg logit contributions
 advoc-0.618
 rul-0.595
 mathemat-0.579
 Citiz-0.562
 tyr-0.546
Token Obama
Token ID2486
Feature activation+5.23e-01

Pos logit contributions
-+0.055
 distance+0.054
PI+0.052
 on+0.052
 by+0.051
Neg logit contributions
 advoc-0.136
 rul-0.132
 mathemat-0.129
 Citiz-0.126
 embr-0.120
Token Barack
Token ID8732
Feature activation+1.42e+00

Pos logit contributions
-+0.113
 distance+0.104
PI+0.101
 your+0.093
123+0.083
Neg logit contributions
 advoc-1.043
 rul-1.019
 mathemat-1.006
 Citiz-0.969
 conclud-0.958
Token Hussein
Token ID22983
Feature activation+2.16e+00

Pos logit contributions
-+1.100
 on+1.058
 by+1.030
 distance+1.029
 your+1.007
Neg logit contributions
 advoc-2.090
 rul-2.017
 mathemat-2.011
 Citiz-1.929
 tyr-1.822
Token Obama
Token ID2486
Feature activation+1.28e+00

Pos logit contributions
-+1.374
 distance+1.302
 on+1.297
 by+1.271
PI+1.251
Neg logit contributions
 advoc-3.507
 rul-3.358
 mathemat-3.271
 Citiz-3.209
 conclud-3.077
TokenWith
Token ID3152
Feature activation+1.59e-01

Pos logit contributions
-+0.854
 distance+0.851
 on+0.837
 by+0.833
 your+0.826
Neg logit contributions
 advoc-1.974
 rul-1.882
 mathemat-1.857
 Citiz-1.802
 conclud-1.728
Token low
Token ID1877
Feature activation+1.12e+00

Pos logit contributions
-+0.087
 distance+0.083
 on+0.083
PI+0.081
 by+0.080
Neg logit contributions
 advoc-0.268
 mathemat-0.257
 rul-0.255
 Citiz-0.250
 conclud-0.236
Token birth
Token ID4082
Feature activation+1.70e+00

Pos logit contributions
-+0.458
 on+0.438
 distance+0.437
 by+0.419
 your+0.401
Neg logit contributions
 advoc-2.034
 rul-1.979
 mathemat-1.948
 Citiz-1.895
 conclud-1.826
Token rate
Token ID2494
Feature activation+6.71e-01

Pos logit contributions
 on+0.637
 auto+0.613
-+0.612
 distance+0.597
 by+0.585
Neg logit contributions
 advoc-3.095
 rul-3.003
 mathemat-2.904
 Citiz-2.816
 tyr-2.743
Token will
Token ID481
Feature activation+4.10e-01

Pos logit contributions
-+0.170
 distance+0.163
PI+0.163
123+0.152
 your+0.146
Neg logit contributions
 advoc-0.764
 rul-0.749
 mathemat-0.735
 Citiz-0.696
 conclud-0.692
Token be
Token ID307
Feature activation+3.45e-01

Pos logit contributions
-+0.180
PI+0.176
123+0.160
 distance+0.157
 on+0.154
Neg logit contributions
 advoc-0.711
 rul-0.701
 mathemat-0.685
 tyr-0.658
 conclud-0.657
Token the
Token ID262
Feature activation+1.54e-02

Pos logit contributions
-+0.151
 distance+0.143
PI+0.132
123+0.130
erm+0.127
Neg logit contributions
 advoc-0.621
 rul-0.607
 mathemat-0.597
 Citiz-0.575
 tyr-0.560
Token equivalent
Token ID7548
Feature activation+3.49e-02

Pos logit contributions
-+0.007
PI+0.006
 on+0.006
 distance+0.006
 by+0.006
Neg logit contributions
 advoc-0.027
 rul-0.027
 mathemat-0.026
 Citiz-0.025
 tyr-0.025
Token of
Token ID286
Feature activation+4.34e-02

Pos logit contributions
-+0.005
 on+0.004
 distance+0.003
 by+0.003
 your+0.003
Neg logit contributions
 advoc-0.073
 rul-0.072
 mathemat-0.071
 Citiz-0.069
 conclud-0.068
Token Loving
Token ID39202
Feature activation+2.08e+00

Pos logit contributions
-+0.014
PI+0.013
 distance+0.012
 on+0.012
 by+0.012
Neg logit contributions
 advoc-0.082
 rul-0.081
 mathemat-0.079
 Citiz-0.077
 tyr-0.075
Token v
Token ID410
Feature activation-4.28e-01

Pos logit contributions
-+0.763
 distance+0.701
 on+0.669
PI+0.648
 your+0.641
Neg logit contributions
 advoc-3.913
 rul-3.832
 mathemat-3.769
 Citiz-3.644
 conclud-3.585
Token.
Token ID13
Feature activation-1.79e-01

Pos logit contributions
 advoc+0.876
 rul+0.863
 mathemat+0.844
 Citiz+0.837
 conclud+0.807
Neg logit contributions
--0.059
 distance-0.046
 on-0.041
PI-0.035
 by-0.033
Token Virginia
Token ID6025
Feature activation-7.61e-01

Pos logit contributions
 advoc+0.265
 rul+0.262
 mathemat+0.251
 Citiz+0.247
 embr+0.236
Neg logit contributions
--0.126
PI-0.119
 distance-0.119
 on-0.118
 by-0.115
Token,
Token ID11
Feature activation-1.39e-01

Pos logit contributions
 advoc+1.493
 rul+1.488
 mathemat+1.444
 Citiz+1.395
 conclud+1.391
Neg logit contributions
--0.161
 distance-0.149
PI-0.136
123-0.127
 tour-0.120
Token which
Token ID543
Feature activation+3.77e-02

Pos logit contributions
 advoc+0.268
 rul+0.265
 mathemat+0.260
 Citiz+0.252
 conclud+0.247
Neg logit contributions
--0.042
 distance-0.034
PI-0.033
 tour-0.030
 on-0.030
Token world
Token ID995
Feature activation+9.52e-02

Pos logit contributions
-+0.259
PI+0.258
 distance+0.247
 on+0.240
123+0.240
Neg logit contributions
 advoc-0.831
 rul-0.825
 mathemat-0.801
 Citiz-0.766
 embr-0.742
Token with
Token ID351
Feature activation+4.00e-01

Pos logit contributions
 distance+0.026
-+0.025
PI+0.023
 tour+0.021
 auto+0.021
Neg logit contributions
 advoc-0.174
 rul-0.172
 Citiz-0.165
 mathemat-0.164
 embr-0.159
Token kindred
Token ID46449
Feature activation+5.33e-01

Pos logit contributions
-+0.419
 on+0.404
 distance+0.397
 by+0.395
PI+0.393
Neg logit contributions
 advoc-0.481
 rul-0.460
 mathemat-0.450
 Citiz-0.432
 conclud-0.410
Token spirits
Token ID13747
Feature activation+1.56e-01

Pos logit contributions
-+0.329
 distance+0.309
 on+0.308
PI+0.303
 by+0.301
Neg logit contributions
 advoc-0.853
 rul-0.826
 mathemat-0.800
 Citiz-0.782
 embr-0.753
Token.
Token ID13
Feature activation+2.15e-01

Pos logit contributions
 distance+0.028
-+0.026
 hunting+0.022
 tour+0.021
 safely+0.021
Neg logit contributions
 advoc-0.292
 rul-0.289
 mathemat-0.278
 Citiz-0.276
 embr-0.271
Token
Token ID198
Feature activation+2.04e+00

Pos logit contributions
-+0.087
 distance+0.083
 on+0.079
 by+0.076
 safely+0.074
Neg logit contributions
 advoc-0.375
 rul-0.368
 Citiz-0.357
 mathemat-0.356
 embr-0.340
Token
Token ID198
Feature activation+1.78e+00

Pos logit contributions
-+0.823
 distance+0.812
 on+0.758
 by+0.719
 ass+0.718
Neg logit contributions
 advoc-3.515
 rul-3.441
 mathemat-3.345
 Citiz-3.307
 embr-3.197
TokenSign
Token ID11712
Feature activation+7.19e-01

Pos logit contributions
-+1.060
 distance+1.007
 on+0.998
 by+0.969
 your+0.948
Neg logit contributions
 advoc-2.915
 rul-2.809
 mathemat-2.774
 Citiz-2.737
 conclud-2.607
Token up
Token ID510
Feature activation+1.29e+00

Pos logit contributions
 distance+0.163
-+0.162
PI+0.143
 on+0.131
 by+0.128
Neg logit contributions
 advoc-1.369
 rul-1.349
 mathemat-1.306
 Citiz-1.294
 embr-1.251
Token for
Token ID329
Feature activation+4.45e-01

Pos logit contributions
-+0.302
 distance+0.278
PI+0.275
 ass+0.226
 on+0.212
Neg logit contributions
 advoc-2.449
 rul-2.418
 mathemat-2.330
 Citiz-2.297
 conclud-2.237
Token our
Token ID674
Feature activation+3.26e-01

Pos logit contributions
-+0.189
 distance+0.179
PI+0.176
123+0.173
erm+0.169
Neg logit contributions
 advoc-0.768
 rul-0.746
 Citiz-0.738
 mathemat-0.734
 tyr-0.693
Token sleeve
Token ID23179
Feature activation-4.60e-01

Pos logit contributions
 advoc+0.340
 rul+0.335
 mathemat+0.331
 Citiz+0.322
 tyr+0.313
Neg logit contributions
--0.051
 distance-0.041
 on-0.041
 by-0.039
PI-0.037
Token at
Token ID379
Feature activation+2.12e-01

Pos logit contributions
 advoc+0.769
 rul+0.743
 mathemat+0.737
 Citiz+0.730
 conclud+0.686
Neg logit contributions
--0.253
 distance-0.231
PI-0.228
 on-0.223
 your-0.220
Token basketball
Token ID9669
Feature activation-4.22e-01

Pos logit contributions
-+0.065
 on+0.054
 distance+0.054
 by+0.051
 in+0.046
Neg logit contributions
 advoc-0.413
 rul-0.398
 mathemat-0.397
 Citiz-0.386
 tyr-0.375
Token game
Token ID983
Feature activation+1.96e-02

Pos logit contributions
 advoc+0.703
 rul+0.688
 mathemat+0.685
 Citiz+0.660
 conclud+0.639
Neg logit contributions
--0.209
PI-0.202
 by-0.200
 on-0.198
 distance-0.197
Token MORE
Token ID12011
Feature activation+6.79e-01

Pos logit contributions
-+0.011
 distance+0.011
 on+0.010
PI+0.010
 by+0.010
Neg logit contributions
 advoc-0.032
 rul-0.031
 mathemat-0.031
 Citiz-0.030
 tyr-0.029
Token and
Token ID290
Feature activation+2.03e+00

Pos logit contributions
-+0.150
 distance+0.149
PI+0.126
123+0.124
 your+0.123
Neg logit contributions
 advoc-1.341
 rul-1.301
 mathemat-1.285
 Citiz-1.226
 conclud-1.221
Token Hillary
Token ID5073
Feature activation+1.18e+00

Pos logit contributions
-+1.012
PI+0.917
 on+0.904
 distance+0.903
 by+0.862
Neg logit contributions
 advoc-3.587
 rul-3.467
 mathemat-3.420
 Citiz-3.319
 tyr-3.225
Token Clinton
Token ID2605
Feature activation+6.38e-01

Pos logit contributions
-+0.812
 distance+0.789
 on+0.762
PI+0.759
 your+0.749
Neg logit contributions
 advoc-1.831
 rul-1.748
 mathemat-1.733
 Citiz-1.666
 conclud-1.611
Token Hillary
Token ID5073
Feature activation+1.91e+00

Pos logit contributions
-+0.391
 distance+0.374
123+0.364
PI+0.364
 on+0.355
Neg logit contributions
 advoc-1.031
 rul-0.995
 mathemat-0.981
 Citiz-0.943
 conclud-0.917
Token Diane
Token ID26542
Feature activation+1.44e+00

Pos logit contributions
-+1.520
 on+1.442
 distance+1.414
 by+1.395
 your+1.374
Neg logit contributions
 advoc-2.793
 rul-2.699
 mathemat-2.679
 Citiz-2.574
 conclud-2.466
Token Rodham
Token ID41848
Feature activation+1.71e+00

Pos logit contributions
-+0.948
 on+0.885
 distance+0.885
PI+0.864
 by+0.863
Neg logit contributions
 advoc-2.268
 rul-2.170
 mathemat-2.151
 Citiz-2.068
 conclud-1.991
Token 10
Token ID838
Feature activation-2.48e-01

Pos logit contributions
 advoc+0.766
 rul+0.732
 mathemat+0.710
 Citiz+0.692
 conclud+0.689
Neg logit contributions
--0.193
PI-0.183
 distance-0.183
 solely-0.177
 by-0.177
Token years
Token ID812
Feature activation-2.65e-02

Pos logit contributions
 advoc+0.441
 rul+0.430
 mathemat+0.421
 Citiz+0.407
 tyr+0.404
Neg logit contributions
--0.104
 distance-0.099
PI-0.098
 on-0.094
 by-0.094
Token in
Token ID287
Feature activation+2.84e-02

Pos logit contributions
 advoc+0.053
 rul+0.052
 mathemat+0.051
 Citiz+0.051
 tyr+0.049
Neg logit contributions
--0.005
 distance-0.004
PI-0.003
123-0.003
 your-0.003
Token prison
Token ID3770
Feature activation-5.25e-01

Pos logit contributions
-+0.015
 distance+0.013
PI+0.013
 on+0.013
 specifically+0.013
Neg logit contributions
 advoc-0.048
 rul-0.046
 mathemat-0.045
 Citiz-0.044
 tyr-0.044
Token MORE
Token ID12011
Feature activation+4.91e-01

Pos logit contributions
 advoc+0.911
 rul+0.885
 mathemat+0.878
 Citiz+0.865
 tyr+0.850
Neg logit contributions
--0.216
 distance-0.192
PI-0.192
ering-0.179
 your-0.179
Token and
Token ID290
Feature activation+1.96e+00

Pos logit contributions
 distance+0.103
-+0.102
PI+0.096
123+0.090
erm+0.087
Neg logit contributions
 advoc-0.975
 rul-0.940
 mathemat-0.934
 Citiz-0.888
 conclud-0.882
Token tick
Token ID4378
Feature activation+4.78e-01

Pos logit contributions
-+1.037
PI+0.967
 distance+0.935
 on+0.932
123+0.906
Neg logit contributions
 advoc-3.370
 rul-3.250
 mathemat-3.222
 Citiz-3.102
 conclud-3.042
Tokened
Token ID276
Feature activation+1.33e+00

Pos logit contributions
 distance+0.116
-+0.115
PI+0.108
 on+0.102
 by+0.099
Neg logit contributions
 advoc-0.924
 rul-0.899
 mathemat-0.892
 Citiz-0.871
 conclud-0.844
Token through
Token ID832
Feature activation-3.83e-01

Pos logit contributions
-+0.382
PI+0.376
 distance+0.368
123+0.338
 by+0.323
Neg logit contributions
 advoc-2.553
 rul-2.460
 mathemat-2.423
 Citiz-2.376
 tyr-2.296
Token her
Token ID607
Feature activation+5.34e-01

Pos logit contributions
 advoc+0.723
 rul+0.692
 mathemat+0.680
 Citiz+0.656
 conclud+0.644
Neg logit contributions
--0.128
PI-0.123
 distance-0.115
123-0.106
 on-0.103
Token usual
Token ID6678
Feature activation-4.92e-01

Pos logit contributions
-+0.253
PI+0.235
 on+0.226
 distance+0.224
123+0.223
Neg logit contributions
 advoc-0.951
 rul-0.909
 mathemat-0.894
 Citiz-0.860
 tyr-0.836
Token game
Token ID983
Feature activation+1.96e-02

Pos logit contributions
 advoc+0.703
 rul+0.688
 mathemat+0.685
 Citiz+0.660
 conclud+0.639
Neg logit contributions
--0.209
PI-0.202
 by-0.200
 on-0.198
 distance-0.197
Token MORE
Token ID12011
Feature activation+6.79e-01

Pos logit contributions
-+0.011
 distance+0.011
 on+0.010
PI+0.010
 by+0.010
Neg logit contributions
 advoc-0.032
 rul-0.031
 mathemat-0.031
 Citiz-0.030
 tyr-0.029
Token and
Token ID290
Feature activation+2.03e+00

Pos logit contributions
-+0.150
 distance+0.149
PI+0.126
123+0.124
 your+0.123
Neg logit contributions
 advoc-1.341
 rul-1.301
 mathemat-1.285
 Citiz-1.226
 conclud-1.221
Token Hillary
Token ID5073
Feature activation+1.18e+00

Pos logit contributions
-+1.012
PI+0.917
 on+0.904
 distance+0.903
 by+0.862
Neg logit contributions
 advoc-3.587
 rul-3.467
 mathemat-3.420
 Citiz-3.319
 tyr-3.225
Token Clinton
Token ID2605
Feature activation+6.38e-01

Pos logit contributions
-+0.812
 distance+0.789
 on+0.762
PI+0.759
 your+0.749
Neg logit contributions
 advoc-1.831
 rul-1.748
 mathemat-1.733
 Citiz-1.666
 conclud-1.611
Token Hillary
Token ID5073
Feature activation+1.91e+00

Pos logit contributions
-+0.391
 distance+0.374
123+0.364
PI+0.364
 on+0.355
Neg logit contributions
 advoc-1.031
 rul-0.995
 mathemat-0.981
 Citiz-0.943
 conclud-0.917
Token Diane
Token ID26542
Feature activation+1.44e+00

Pos logit contributions
-+1.520
 on+1.442
 distance+1.414
 by+1.395
 your+1.374
Neg logit contributions
 advoc-2.793
 rul-2.699
 mathemat-2.679
 Citiz-2.574
 conclud-2.466
Token Rodham
Token ID41848
Feature activation+1.71e+00

Pos logit contributions
-+0.948
 on+0.885
 distance+0.885
PI+0.864
 by+0.863
Neg logit contributions
 advoc-2.268
 rul-2.170
 mathemat-2.151
 Citiz-2.068
 conclud-1.991
Token Clinton
Token ID2605
Feature activation+1.28e+00

Pos logit contributions
-+1.216
 distance+1.139
 on+1.123
PI+1.092
 your+1.087
Neg logit contributions
 advoc-2.610
 rul-2.529
 mathemat-2.455
 Citiz-2.382
 conclud-2.311
TokenSanders
Token ID26747
Feature activation+1.07e+00

Pos logit contributions
-+1.063
 distance+1.027
 on+1.015
 by+0.995
 your+0.984
Neg logit contributions
 advoc-1.816
 rul-1.726
 mathemat-1.705
 Citiz-1.645
 conclud-1.575
Token:
Token ID25
Feature activation+1.32e+00

Pos logit contributions
-+0.969
 on+0.918
 distance+0.904
 by+0.901
 your+0.889
Neg logit contributions
 advoc-1.449
 rul-1.394
 mathemat-1.375
 Citiz-1.319
 conclud-1.277
Token
Token ID198
Feature activation-1.56e-01

Pos logit contributions
 advoc+0.706
 rul+0.688
 mathemat+0.672
 Citiz+0.669
 conclud+0.635
Neg logit contributions
--0.175
 distance-0.173
 on-0.160
 safely-0.155
 tour-0.152
Token
Token ID198
Feature activation-1.97e-01

Pos logit contributions
 advoc+0.255
 rul+0.251
 mathemat+0.242
 Citiz+0.240
 embr+0.233
Neg logit contributions
 distance-0.068
--0.065
 on-0.061
 hunting-0.061
 tour-0.061
Token
Token ID447
Feature activation-4.79e-01

Pos logit contributions
 advoc+0.309
 rul+0.294
 mathemat+0.292
 Citiz+0.285
 embr+0.270
Neg logit contributions
 distance-0.126
 on-0.117
--0.115
 tour-0.114
 by-0.114
Token
Token ID250
Feature activation-2.20e-01

Pos logit contributions
 advoc+0.621
 rul+0.603
 mathemat+0.583
 Citiz+0.557
 tyr+0.535
Neg logit contributions
--0.454
 distance-0.442
 on-0.437
 by-0.427
 tour-0.423
TokenPar
Token ID10044
Feature activation+5.66e-01

Pos logit contributions
 advoc+0.379
 rul+0.367
 mathemat+0.359
 Citiz+0.351
 conclud+0.339
Neg logit contributions
--0.112
 distance-0.107
 on-0.105
 by-0.100
 your-0.100
Tokenliament
Token ID5130
Feature activation+1.90e+00

Pos logit contributions
-+0.626
 distance+0.617
PI+0.607
 on+0.606
 your+0.596
Neg logit contributions
 advoc-0.630
 rul-0.612
 mathemat-0.593
 Citiz-0.564
 tyr-0.542
Token operates
Token ID14051
Feature activation-1.04e-01

Pos logit contributions
-+0.879
 distance+0.820
PI+0.801
 on+0.777
 your+0.768
Neg logit contributions
 advoc-3.364
 rul-3.305
 mathemat-3.230
 Citiz-3.170
 conclud-3.061
Token in
Token ID287
Feature activation+1.49e-01

Pos logit contributions
 advoc+0.202
 rul+0.201
 mathemat+0.197
 Citiz+0.192
 tyr+0.186
Neg logit contributions
--0.027
 distance-0.024
PI-0.022
erm-0.020
123-0.019
Token both
Token ID1111
Feature activation-2.12e-01

Pos logit contributions
-+0.051
PI+0.048
 distance+0.047
 on+0.047
 by+0.045
Neg logit contributions
 advoc-0.277
 rul-0.271
 mathemat-0.269
 Citiz-0.258
 tyr-0.252
Token official
Token ID1743
Feature activation-5.64e-01

Pos logit contributions
 advoc+0.346
 rul+0.337
 mathemat+0.330
 Citiz+0.316
 tyr+0.306
Neg logit contributions
--0.126
 distance-0.120
PI-0.118
 on-0.118
123-0.116
Token languages
Token ID8950
Feature activation-3.82e-01

Pos logit contributions
 advoc+0.931
 rul+0.909
 mathemat+0.897
 Citiz+0.860
 tyr+0.828
Neg logit contributions
--0.315
 distance-0.307
 on-0.302
PI-0.298
 your-0.292
Token two
Token ID734
Feature activation-8.46e-02

Pos logit contributions
 advoc+0.482
 rul+0.466
 mathemat+0.457
 Citiz+0.442
 tyr+0.425
Neg logit contributions
--0.087
 on-0.077
PI-0.076
123-0.075
 distance-0.075
Token parties
Token ID4671
Feature activation-9.42e-01

Pos logit contributions
 advoc+0.153
 rul+0.147
 mathemat+0.144
 Citiz+0.138
 tyr+0.134
Neg logit contributions
--0.037
PI-0.036
 distance-0.035
123-0.034
 on-0.033
Token.
Token ID13
Feature activation-4.58e-01

Pos logit contributions
 advoc+1.863
 rul+1.829
 mathemat+1.793
 Citiz+1.727
 conclud+1.710
Neg logit contributions
 distance-0.170
--0.168
PI-0.166
 your-0.141
123-0.137
Token
Token ID198
Feature activation-1.63e-01

Pos logit contributions
 advoc+0.785
 rul+0.770
 Citiz+0.754
 mathemat+0.746
 conclud+0.710
Neg logit contributions
--0.210
 distance-0.202
 safely-0.195
 your-0.192
 on-0.192
Token
Token ID198
Feature activation-4.72e-01

Pos logit contributions
 advoc+0.277
 rul+0.271
 Citiz+0.263
 mathemat+0.261
 conclud+0.248
Neg logit contributions
 distance-0.069
--0.067
 on-0.064
 safely-0.062
 tour-0.062
TokenPrep
Token ID37534
Feature activation+1.86e+00

Pos logit contributions
 advoc+0.737
 rul+0.706
 mathemat+0.694
 Citiz+0.688
 embr+0.640
Neg logit contributions
 distance-0.287
 on-0.271
 your-0.269
 tour-0.267
 auto-0.266
Tokenaring
Token ID1723
Feature activation+7.18e-01

Pos logit contributions
-+0.911
 distance+0.861
 on+0.860
 by+0.824
 your+0.801
Neg logit contributions
 advoc-3.242
 rul-3.184
 mathemat-3.108
 Citiz-3.050
 conclud-2.945
Token for
Token ID329
Feature activation-2.18e-02

Pos logit contributions
-+0.171
PI+0.141
 distance+0.139
 on+0.135
 ass+0.123
Neg logit contributions
 advoc-1.434
 rul-1.406
 mathemat-1.368
 Citiz-1.350
 conclud-1.300
Token independence
Token ID10404
Feature activation+3.39e-01

Pos logit contributions
 advoc+0.040
 rul+0.040
 mathemat+0.039
 Citiz+0.038
 tyr+0.037
Neg logit contributions
--0.008
PI-0.008
 on-0.007
 distance-0.007
 by-0.007
Token:
Token ID25
Feature activation+1.94e-01

Pos logit contributions
-+0.103
PI+0.097
 distance+0.096
 your+0.089
 on+0.085
Neg logit contributions
 advoc-0.648
 rul-0.639
 mathemat-0.618
 Citiz-0.607
 conclud-0.589
Token 1961
Token ID20510
Feature activation+2.29e-01

Pos logit contributions
-+0.094
 distance+0.082
PI+0.081
 on+0.081
 safely+0.078
Neg logit contributions
 advoc-0.342
 rul-0.337
 mathemat-0.330
 Citiz-0.322
 conclud-0.310
Token 10
Token ID838
Feature activation-2.09e-01

Pos logit contributions
 advoc+0.765
 rul+0.731
 mathemat+0.714
 Citiz+0.695
 conclud+0.689
Neg logit contributions
--0.189
PI-0.186
 distance-0.181
 by-0.172
 on-0.171
Token years
Token ID812
Feature activation-4.35e-02

Pos logit contributions
 advoc+0.374
 rul+0.365
 mathemat+0.358
 Citiz+0.346
 conclud+0.341
Neg logit contributions
--0.088
 distance-0.083
PI-0.083
 on-0.081
 by-0.080
Token in
Token ID287
Feature activation-1.48e-01

Pos logit contributions
 advoc+0.086
 rul+0.084
 mathemat+0.082
 Citiz+0.082
 tyr+0.078
Neg logit contributions
--0.009
 distance-0.009
PI-0.007
 your-0.007
 by-0.007
Token prison
Token ID3770
Feature activation-4.77e-01

Pos logit contributions
 advoc+0.249
 rul+0.241
 mathemat+0.235
 Citiz+0.227
 tyr+0.225
Neg logit contributions
--0.076
PI-0.071
 distance-0.071
 on-0.069
 specifically-0.068
Token MORE
Token ID12011
Feature activation+6.85e-01

Pos logit contributions
 advoc+0.809
 rul+0.788
 mathemat+0.778
 Citiz+0.768
 tyr+0.754
Neg logit contributions
--0.214
 distance-0.194
PI-0.193
 your-0.182
ering-0.181
Token,
Token ID11
Feature activation+1.84e+00

Pos logit contributions
 distance+0.168
-+0.166
PI+0.148
 your+0.140
123+0.139
Neg logit contributions
 advoc-1.334
 rul-1.295
 mathemat-1.281
 Citiz-1.234
 conclud-1.210
Token in
Token ID287
Feature activation+1.09e+00

Pos logit contributions
-+0.931
 distance+0.840
PI+0.839
 on+0.787
 by+0.786
Neg logit contributions
 advoc-3.212
 rul-3.119
 mathemat-3.086
 Citiz-2.974
 conclud-2.900
Token his
Token ID465
Feature activation+3.90e-01

Pos logit contributions
-+0.312
 distance+0.268
 on+0.260
PI+0.258
 by+0.246
Neg logit contributions
 advoc-2.128
 rul-2.072
 mathemat-2.050
 Citiz-2.000
 conclud-1.940
Token weekly
Token ID10273
Feature activation-2.07e-01

Pos logit contributions
-+0.189
PI+0.173
 distance+0.172
 on+0.169
 by+0.165
Neg logit contributions
 advoc-0.679
 rul-0.662
 mathemat-0.648
 Citiz-0.635
 conclud-0.607
Token column
Token ID5721
Feature activation+7.05e-01

Pos logit contributions
 advoc+0.377
 rul+0.366
 mathemat+0.364
 Citiz+0.351
 conclud+0.341
Neg logit contributions
--0.079
 distance-0.074
PI-0.073
 on-0.070
123-0.070
Token Wednesday
Token ID3583
Feature activation-2.58e-01

Pos logit contributions
-+0.198
 distance+0.186
PI+0.179
 your+0.158
123+0.155
Neg logit contributions
 advoc-1.369
 rul-1.326
 mathemat-1.317
 Citiz-1.282
 conclud-1.249
Token University
Token ID2059
Feature activation-1.07e-01

Pos logit contributions
 advoc+0.400
 mathemat+0.389
 rul+0.387
 Citiz+0.371
 conclud+0.355
Neg logit contributions
--0.163
 on-0.154
 distance-0.149
 by-0.149
 ass-0.148
Token of
Token ID286
Feature activation+7.88e-01

Pos logit contributions
 advoc+0.199
 rul+0.197
 Citiz+0.194
 mathemat+0.192
 embr+0.182
Neg logit contributions
 distance-0.022
--0.020
 hunting-0.018
 ass-0.018
PI-0.018
Token California
Token ID3442
Feature activation+3.31e-02

Pos logit contributions
-+0.518
 distance+0.500
 on+0.491
 by+0.477
 ass+0.477
Neg logit contributions
 advoc-1.211
 rul-1.175
 mathemat-1.153
 Citiz-1.145
 conclud-1.077
Token,
Token ID11
Feature activation+8.46e-01

Pos logit contributions
 distance+0.007
-+0.006
 on+0.006
 ass+0.005
 your+0.005
Neg logit contributions
 advoc-0.065
 mathemat-0.064
 rul-0.064
 Citiz-0.062
 conclud-0.060
Token Santa
Token ID8909
Feature activation+1.33e+00

Pos logit contributions
-+0.507
 distance+0.488
 on+0.470
 ass+0.456
 your+0.454
Neg logit contributions
 advoc-1.351
 mathemat-1.323
 rul-1.322
 Citiz-1.263
 conclud-1.224
Token Barbara
Token ID16685
Feature activation+1.84e+00

Pos logit contributions
-+0.850
 distance+0.840
 on+0.811
 your+0.803
 ass+0.802
Neg logit contributions
 advoc-2.050
 rul-1.980
 mathemat-1.980
 Citiz-1.917
 conclud-1.846
Token,
Token ID11
Feature activation+3.56e-01

Pos logit contributions
-+0.337
 distance+0.308
 on+0.262
 your+0.244
 by+0.228
Neg logit contributions
 advoc-3.713
 mathemat-3.656
 rul-3.647
 Citiz-3.522
 conclud-3.475
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
-+0.126
 distance+0.110
 on+0.098
PI+0.097
 your+0.095
Neg logit contributions
 advoc-0.667
 rul-0.650
 mathemat-0.648
 Citiz-0.627
 conclud-0.616
Token an
Token ID281
Feature activation-2.34e-01

Pos logit contributions
 advoc+0.248
 rul+0.241
 mathemat+0.238
 Citiz+0.232
 conclud+0.226
Neg logit contributions
--0.038
 distance-0.031
 on-0.030
PI-0.029
 by-0.028
Token organizer
Token ID26311
Feature activation+2.99e-01

Pos logit contributions
 advoc+0.399
 mathemat+0.379
 rul+0.379
 Citiz+0.367
 conclud+0.351
Neg logit contributions
--0.128
 distance-0.119
 by-0.116
 on-0.115
 your-0.113
Token of
Token ID286
Feature activation-4.14e-01

Pos logit contributions
 distance+0.054
-+0.053
PI+0.045
 your+0.043
123+0.042
Neg logit contributions
 advoc-0.599
 rul-0.581
 mathemat-0.571
 Citiz-0.563
 conclud-0.547
Token-
Token ID12
Feature activation+1.02e-01

Pos logit contributions
 advoc+0.762
 rul+0.745
 mathemat+0.732
 embr+0.711
 Citiz+0.708
Neg logit contributions
PI-0.080
123-0.068
 your-0.066
--0.064
 distance-0.063
Tokenand
Token ID392
Feature activation-3.72e-01

Pos logit contributions
-+0.065
 on+0.063
PI+0.063
 distance+0.062
 Oct+0.062
Neg logit contributions
 advoc-0.154
 rul-0.151
 mathemat-0.146
 Citiz-0.141
 conclud-0.140
Token-
Token ID12
Feature activation-5.73e-01

Pos logit contributions
 advoc+0.795
 rul+0.785
 mathemat+0.771
 Citiz+0.753
 embr+0.734
Neg logit contributions
 on-0.001
--0.000
 distance+0.002
PI+0.003
 by+0.006
Tokena
Token ID64
Feature activation-1.63e+00

Pos logit contributions
 advoc+0.996
 rul+0.990
 mathemat+0.961
 Citiz+0.954
 embr+0.920
Neg logit contributions
--0.200
 distance-0.197
 on-0.197
 solely-0.186
 Oct-0.186
Token-
Token ID12
Feature activation-3.20e-01

Pos logit contributions
 advoc+3.335
 rul+3.294
 mathemat+3.223
 Citiz+3.158
 embr+3.089
Neg logit contributions
 on-0.048
 distance-0.036
PI-0.033
 by-0.021
 your-0.012
Tokenhalf
Token ID13959
Feature activation-2.18e+00

Pos logit contributions
 advoc+0.490
 rul+0.480
 mathemat+0.466
 Citiz+0.457
 embr+0.438
Neg logit contributions
--0.190
 on-0.187
 distance-0.185
 by-0.181
 your-0.180
Token-
Token ID12
Feature activation-5.58e-02

Pos logit contributions
 advoc+4.027
 rul+3.942
 mathemat+3.850
 embr+3.776
 Citiz+3.690
Neg logit contributions
PI-0.607
123-0.517
erm-0.468
 Oct-0.438
 your-0.430
Tokendeep
Token ID22089
Feature activation+2.99e-01

Pos logit contributions
 advoc+0.082
 rul+0.080
 mathemat+0.078
 Citiz+0.075
 embr+0.074
Neg logit contributions
--0.037
 on-0.036
PI-0.036
 your-0.035
 Oct-0.035
Token "
Token ID366
Feature activation-3.60e-01

Pos logit contributions
PI+0.138
-+0.135
 on+0.127
 distance+0.126
 your+0.124
Neg logit contributions
 advoc-0.517
 rul-0.506
 mathemat-0.492
 Citiz-0.475
 embr-0.474
Tokensource
Token ID10459
Feature activation-3.68e-01

Pos logit contributions
 advoc+0.595
 rul+0.576
 mathemat+0.566
 Citiz+0.550
 embr+0.533
Neg logit contributions
--0.193
 distance-0.192
 on-0.187
 by-0.180
 your-0.179
Token rock
Token ID3881
Feature activation+5.64e-01

Pos logit contributions
 advoc+0.687
 rul+0.672
 mathemat+0.658
 Citiz+0.644
 embr+0.627
Neg logit contributions
PI-0.110
--0.103
 distance-0.100
 on-0.098
 your-0.092
Token File
Token ID9220
Feature activation-7.65e-01

Pos logit contributions
-+0.025
 on+0.023
 distance+0.023
PI+0.022
 by+0.022
Neg logit contributions
 advoc-0.086
 rul-0.084
 mathemat-0.083
 Citiz-0.081
 tyr-0.077
Token)
Token ID8
Feature activation-2.11e-01

Pos logit contributions
 advoc+1.346
 rul+1.322
 mathemat+1.313
 Citiz+1.276
 tyr+1.226
Neg logit contributions
--0.284
 distance-0.272
 ass-0.261
 safely-0.260
PI-0.259
Token
Token ID198
Feature activation+1.35e-01

Pos logit contributions
 advoc+0.364
 rul+0.356
 mathemat+0.352
 Citiz+0.344
 conclud+0.329
Neg logit contributions
--0.102
 distance-0.096
 on-0.093
 your-0.090
 ass-0.089
Token
Token ID198
Feature activation+2.14e-01

Pos logit contributions
 distance+0.059
-+0.056
 hunting+0.053
 tour+0.053
 on+0.053
Neg logit contributions
 advoc-0.221
 rul-0.217
 mathemat-0.209
 Citiz-0.208
 embr-0.202
TokenL
Token ID43
Feature activation-1.86e-01

Pos logit contributions
 distance+0.131
-+0.125
 on+0.125
 your+0.121
 by+0.121
Neg logit contributions
 advoc-0.336
 rul-0.324
 mathemat-0.322
 Citiz-0.312
 tyr-0.296
TokenANS
Token ID15037
Feature activation-2.09e+00

Pos logit contributions
 advoc+0.299
 rul+0.289
 mathemat+0.288
 Citiz+0.272
 tyr+0.268
Neg logit contributions
 distance-0.109
--0.108
 on-0.105
 by-0.104
 solely-0.102
TokenING
Token ID2751
Feature activation-5.00e-01

Pos logit contributions
 advoc+3.228
 rul+3.098
 mathemat+3.088
 Citiz+3.079
 embr+2.877
Neg logit contributions
 distance-1.123
 ass-1.030
 safely-1.020
--1.013
 stalk-0.991
Token,
Token ID11
Feature activation-8.19e-01

Pos logit contributions
 advoc+0.969
 rul+0.947
 mathemat+0.933
 Citiz+0.918
 tyr+0.885
Neg logit contributions
--0.124
 distance-0.121
 on-0.112
 your-0.104
 by-0.104
Token MI
Token ID15789
Feature activation-7.98e-01

Pos logit contributions
 advoc+1.132
 rul+1.112
 mathemat+1.079
 Citiz+1.063
 tyr+1.007
Neg logit contributions
--0.661
 distance-0.635
 on-0.616
 by-0.605
 ass-0.603
Token --
Token ID1377
Feature activation-2.58e-01

Pos logit contributions
 advoc+1.441
 rul+1.410
 mathemat+1.401
 Citiz+1.357
 tyr+1.314
Neg logit contributions
 distance-0.298
--0.276
 on-0.267
 ass-0.258
 your-0.258
Token The
Token ID383
Feature activation-4.46e-01

Pos logit contributions
 advoc+0.420
 rul+0.409
 mathemat+0.403
 Citiz+0.401
 conclud+0.377
Neg logit contributions
--0.139
 distance-0.136
 on-0.129
 ass-0.128
 safely-0.126
Token-
Token ID12
Feature activation-3.77e-01

Pos logit contributions
 distance+0.048
 on+0.039
-+0.031
 by+0.030
 your+0.028
Neg logit contributions
 advoc-1.082
 rul-1.063
 mathemat-1.055
 Citiz-1.024
 conclud-1.000
Token35
Token ID2327
Feature activation-3.49e-02

Pos logit contributions
 advoc+0.621
 rul+0.610
 mathemat+0.600
 Citiz+0.580
 conclud+0.571
Neg logit contributions
--0.196
 distance-0.192
 on-0.185
 tour-0.181
 by-0.179
Token790
Token ID37750
Feature activation-2.51e-01

Pos logit contributions
 advoc+0.056
 rul+0.056
 mathemat+0.053
 Citiz+0.053
 conclud+0.051
Neg logit contributions
 distance-0.019
--0.018
 on-0.018
 by-0.018
 ass-0.017
Token347
Token ID30995
Feature activation-5.67e-01

Pos logit contributions
 advoc+0.396
 rul+0.394
 mathemat+0.376
 Citiz+0.374
 conclud+0.361
Neg logit contributions
 distance-0.141
--0.131
 on-0.130
 ass-0.128
 tour-0.127
Token.
Token ID13
Feature activation-1.18e+00

Pos logit contributions
 rul+1.053
 advoc+1.049
 mathemat+1.011
 Citiz+0.982
 conclud+0.973
Neg logit contributions
 distance-0.139
 safely-0.115
 ass-0.109
 tour-0.107
 stalk-0.103
Tokenhtml
Token ID6494
Feature activation-2.07e+00

Pos logit contributions
 advoc+1.436
 rul+1.424
 mathemat+1.393
 Citiz+1.308
 embr+1.291
Neg logit contributions
 distance-1.053
--1.048
 on-1.027
 your-1.003
 ass-0.996
Token https
Token ID3740
Feature activation-1.47e+00

Pos logit contributions
 advoc+3.146
 rul+3.095
 mathemat+3.013
 Citiz+2.965
 embr+2.892
Neg logit contributions
 distance-1.218
--1.181
 safely-1.143
 on-1.122
 tour-1.122
Token://
Token ID1378
Feature activation-1.00e+00

Pos logit contributions
 advoc+2.145
 rul+2.112
 Citiz+2.022
 mathemat+2.011
 embr+1.979
Neg logit contributions
 distance-0.803
 hunting-0.761
 ass-0.738
 tour-0.736
 safely-0.732
Tokenwww
Token ID2503
Feature activation-1.35e+00

Pos logit contributions
 advoc+1.409
 rul+1.369
 mathemat+1.336
 Citiz+1.283
 embr+1.273
Neg logit contributions
 distance-0.690
--0.668
 on-0.656
 by-0.650
 Oct-0.650
Token.
Token ID13
Feature activation-5.15e-01

Pos logit contributions
 advoc+2.414
 rul+2.394
 mathemat+2.288
 Citiz+2.288
 embr+2.256
Neg logit contributions
 distance-0.286
 hunting-0.241
 safely-0.226
 tour-0.222
 stalk-0.222
Tokenbel
Token ID6667
Feature activation-1.40e+00

Pos logit contributions
 advoc+0.624
 rul+0.599
 mathemat+0.591
 Citiz+0.569
 tyr+0.554
Neg logit contributions
 distance-0.469
--0.456
 on-0.455
 by-0.449
 tour-0.448
Token be
Token ID307
Feature activation-1.14e-01

Pos logit contributions
PI+0.118
-+0.108
123+0.100
 Oct+0.097
 on+0.097
Neg logit contributions
 advoc-0.416
 rul-0.412
 mathemat-0.399
 tyr-0.388
 conclud-0.388
Token very
Token ID845
Feature activation-2.57e-01

Pos logit contributions
 advoc+0.192
 rul+0.187
 mathemat+0.181
 Citiz+0.176
 tyr+0.172
Neg logit contributions
--0.059
PI-0.059
 distance-0.056
123-0.054
 Oct-0.052
Token relieved
Token ID22598
Feature activation-1.79e-02

Pos logit contributions
 advoc+0.380
 rul+0.365
 mathemat+0.355
 Citiz+0.345
 embr+0.335
Neg logit contributions
PI-0.185
--0.185
 distance-0.180
123-0.175
erm-0.175
Token.
Token ID13
Feature activation-5.87e-01

Pos logit contributions
 advoc+0.034
 rul+0.033
 mathemat+0.033
 Citiz+0.032
 conclud+0.031
Neg logit contributions
--0.005
PI-0.005
 distance-0.005
 tour-0.004
123-0.004
Token
Token ID198
Feature activation-1.98e+00

Pos logit contributions
 advoc+0.958
 rul+0.931
 Citiz+0.926
 mathemat+0.912
 conclud+0.863
Neg logit contributions
--0.304
 distance-0.295
 on-0.281
 safely-0.280
 ass-0.277
Token
Token ID198
Feature activation-2.03e+00

Pos logit contributions
 advoc+3.017
 rul+2.971
 Citiz+2.858
 mathemat+2.827
 embr+2.774
Neg logit contributions
 distance-0.943
 hunting-0.865
--0.853
 tour-0.845
 stalk-0.843
TokenThe
Token ID464
Feature activation-3.27e-01

Pos logit contributions
 advoc+3.025
 rul+2.849
 mathemat+2.826
 Citiz+2.810
 embr+2.616
Neg logit contributions
 distance-1.323
 hunting-1.225
 tour-1.218
 on-1.218
 auto-1.215
Token BBC
Token ID7823
Feature activation-6.30e-01

Pos logit contributions
 advoc+0.562
 rul+0.548
 mathemat+0.542
 Citiz+0.520
 tyr+0.500
Neg logit contributions
--0.166
PI-0.154
 on-0.151
 by-0.146
 distance-0.145
Token's
Token ID338
Feature activation-3.44e-01

Pos logit contributions
 advoc+1.156
 rul+1.130
 mathemat+1.120
 Citiz+1.092
 conclud+1.065
Neg logit contributions
--0.213
 distance-0.208
PI-0.205
123-0.192
 on-0.192
Token David
Token ID3271
Feature activation-2.12e-01

Pos logit contributions
 advoc+0.575
 rul+0.562
 mathemat+0.560
 Citiz+0.543
 tyr+0.518
Neg logit contributions
--0.184
PI-0.173
 distance-0.171
 on-0.169
 by-0.168
Token L
Token ID406
Feature activation-1.38e-01

Pos logit contributions
 advoc+0.322
 rul+0.310
 Citiz+0.308
 mathemat+0.302
 tyr+0.289
Neg logit contributions
 distance-0.131
 safely-0.128
PI-0.124
 stalk-0.122
 specifically-0.122
Token would
Token ID561
Feature activation+2.46e-01

Pos logit contributions
 advoc+0.362
 rul+0.358
 mathemat+0.350
 Citiz+0.335
 conclud+0.330
Neg logit contributions
--0.096
PI-0.092
 distance-0.084
123-0.082
 on-0.082
Token be
Token ID307
Feature activation-1.14e-01

Pos logit contributions
PI+0.118
-+0.108
123+0.100
 Oct+0.097
 on+0.097
Neg logit contributions
 advoc-0.416
 rul-0.412
 mathemat-0.399
 tyr-0.388
 conclud-0.388
Token very
Token ID845
Feature activation-2.57e-01

Pos logit contributions
 advoc+0.192
 rul+0.187
 mathemat+0.181
 Citiz+0.176
 tyr+0.172
Neg logit contributions
--0.059
PI-0.059
 distance-0.056
123-0.054
 Oct-0.052
Token relieved
Token ID22598
Feature activation-1.79e-02

Pos logit contributions
 advoc+0.380
 rul+0.365
 mathemat+0.355
 Citiz+0.345
 embr+0.335
Neg logit contributions
PI-0.185
--0.185
 distance-0.180
123-0.175
erm-0.175
Token.
Token ID13
Feature activation-5.87e-01

Pos logit contributions
 advoc+0.034
 rul+0.033
 mathemat+0.033
 Citiz+0.032
 conclud+0.031
Neg logit contributions
--0.005
PI-0.005
 distance-0.005
 tour-0.004
123-0.004
Token
Token ID198
Feature activation-1.98e+00

Pos logit contributions
 advoc+0.958
 rul+0.931
 Citiz+0.926
 mathemat+0.912
 conclud+0.863
Neg logit contributions
--0.304
 distance-0.295
 on-0.281
 safely-0.280
 ass-0.277
Token
Token ID198
Feature activation-2.03e+00

Pos logit contributions
 advoc+3.017
 rul+2.971
 Citiz+2.858
 mathemat+2.827
 embr+2.774
Neg logit contributions
 distance-0.943
 hunting-0.865
--0.853
 tour-0.845
 stalk-0.843
TokenThe
Token ID464
Feature activation-3.27e-01

Pos logit contributions
 advoc+3.025
 rul+2.849
 mathemat+2.826
 Citiz+2.810
 embr+2.616
Neg logit contributions
 distance-1.323
 hunting-1.225
 tour-1.218
 on-1.218
 auto-1.215
Token BBC
Token ID7823
Feature activation-6.30e-01

Pos logit contributions
 advoc+0.562
 rul+0.548
 mathemat+0.542
 Citiz+0.520
 tyr+0.500
Neg logit contributions
--0.166
PI-0.154
 on-0.151
 by-0.146
 distance-0.145
Token's
Token ID338
Feature activation-3.44e-01

Pos logit contributions
 advoc+1.156
 rul+1.130
 mathemat+1.120
 Citiz+1.092
 conclud+1.065
Neg logit contributions
--0.213
 distance-0.208
PI-0.205
123-0.192
 on-0.192
Token David
Token ID3271
Feature activation-2.12e-01

Pos logit contributions
 advoc+0.575
 rul+0.562
 mathemat+0.560
 Citiz+0.543
 tyr+0.518
Neg logit contributions
--0.184
PI-0.173
 distance-0.171
 on-0.169
 by-0.168
Token
Token ID141
Feature activation-5.44e-01

Pos logit contributions
 advoc+0.082
 rul+0.080
 mathemat+0.078
 Citiz+0.074
 conclud+0.071
Neg logit contributions
--0.059
 distance-0.058
 on-0.057
 by-0.056
 your-0.055
Token
Token ID227
Feature activation+7.33e-01

Pos logit contributions
 advoc+0.732
 rul+0.718
 mathemat+0.688
 Citiz+0.666
 tyr+0.637
Neg logit contributions
--0.474
 distance-0.461
 on-0.458
 by-0.446
PI-0.442
Token
Token ID12466
Feature activation-6.32e-01

Pos logit contributions
-+0.515
 distance+0.497
 on+0.489
 by+0.476
 your+0.473
Neg logit contributions
 advoc-1.122
 rul-1.096
 mathemat-1.073
 Citiz-1.034
 conclud-1.000
Token
Token ID110
Feature activation-2.44e-01

Pos logit contributions
 advoc+0.792
 rul+0.765
 mathemat+0.749
 Citiz+0.714
 conclud+0.683
Neg logit contributions
--0.626
 distance-0.605
 on-0.601
 by-0.592
 tour-0.580
Token
Token ID220
Feature activation+4.56e-01

Pos logit contributions
 advoc+0.332
 rul+0.322
 mathemat+0.314
 Citiz+0.301
 conclud+0.289
Neg logit contributions
--0.209
 distance-0.205
 on-0.202
 by-0.197
 tour-0.196
Tokenр
Token ID21169
Feature activation-1.98e+00

Pos logit contributions
-+0.424
 distance+0.416
 on+0.411
 by+0.404
 your+0.399
Neg logit contributions
 advoc-0.598
 rul-0.574
 mathemat-0.565
 Citiz-0.538
 conclud-0.515
Tokenа
Token ID16142
Feature activation-8.46e-01

Pos logit contributions
 advoc+2.403
 rul+2.308
 mathemat+2.248
 Citiz+2.136
 tyr+2.041
Neg logit contributions
 distance-1.901
--1.881
 on-1.866
 tour-1.836
 by-1.819
Token
Token ID140
Feature activation-1.21e+00

Pos logit contributions
 advoc+1.126
 rul+1.099
 mathemat+1.069
 Citiz+1.026
 conclud+0.994
Neg logit contributions
--0.731
 distance-0.726
 on-0.709
 tour-0.699
 your-0.694
Token
Token ID115
Feature activation+1.37e-02

Pos logit contributions
 advoc+1.608
 rul+1.555
 mathemat+1.520
 Citiz+1.443
 conclud+1.379
Neg logit contributions
--1.086
 distance-1.051
 on-1.044
 by-1.028
 solely-1.006
Tokenм
Token ID43108
Feature activation-5.71e-01

Pos logit contributions
-+0.014
 distance+0.014
 on+0.013
 by+0.013
 your+0.013
Neg logit contributions
 advoc-0.017
 rul-0.016
 mathemat-0.016
 Citiz-0.015
 tyr-0.014
Tokenе
Token ID16843
Feature activation-1.38e-01

Pos logit contributions
 advoc+0.714
 rul+0.694
 mathemat+0.680
 Citiz+0.648
 tyr+0.615
Neg logit contributions
--0.543
 distance-0.538
 on-0.527
 by-0.517
 your-0.516
Token their
Token ID511
Feature activation+2.42e-01

Pos logit contributions
 advoc+0.157
 rul+0.151
 mathemat+0.148
 Citiz+0.143
 tyr+0.139
Neg logit contributions
--0.044
PI-0.043
 distance-0.040
123-0.040
erm-0.040
Token products
Token ID3186
Feature activation+5.21e-01

Pos logit contributions
-+0.133
PI+0.126
 distance+0.125
 on+0.123
123+0.122
Neg logit contributions
 advoc-0.408
 rul-0.393
 mathemat-0.382
 Citiz-0.370
 tyr-0.359
Token.
Token ID13
Feature activation+2.92e-01

Pos logit contributions
-+0.090
 distance+0.085
PI+0.070
 your+0.069
 on+0.060
Neg logit contributions
 advoc-1.051
 rul-1.030
 mathemat-1.021
 Citiz-0.981
 conclud-0.971
Token
Token ID198
Feature activation-7.04e-02

Pos logit contributions
 distance+0.141
-+0.141
 safely+0.132
 on+0.130
 your+0.128
Neg logit contributions
 advoc-0.490
 rul-0.475
 Citiz-0.468
 mathemat-0.458
 embr-0.439
Token
Token ID198
Feature activation-1.29e-01

Pos logit contributions
 advoc+0.112
 rul+0.110
 mathemat+0.106
 Citiz+0.106
 embr+0.103
Neg logit contributions
 distance-0.032
--0.030
 hunting-0.029
 tour-0.028
 ass-0.028
TokenShares
Token ID43566
Feature activation-1.93e+00

Pos logit contributions
 advoc+0.198
 rul+0.186
 Citiz+0.183
 mathemat+0.183
 embr+0.173
Neg logit contributions
 distance-0.083
 tour-0.076
 on-0.075
 hunting-0.075
 by-0.074
Token of
Token ID286
Feature activation-8.17e-01

Pos logit contributions
 advoc+3.782
 rul+3.691
 mathemat+3.657
 Citiz+3.518
 conclud+3.502
Neg logit contributions
--0.400
 distance-0.370
PI-0.368
 tour-0.308
123-0.305
Token Alt
Token ID12344
Feature activation-4.02e-01

Pos logit contributions
 advoc+1.421
 rul+1.382
 mathemat+1.357
 Citiz+1.318
 tyr+1.288
Neg logit contributions
--0.381
 distance-0.350
 on-0.345
PI-0.343
 by-0.331
Tokenria
Token ID7496
Feature activation+7.03e-01

Pos logit contributions
 advoc+0.531
 rul+0.516
 mathemat+0.500
 Citiz+0.488
 embr+0.459
Neg logit contributions
--0.332
 distance-0.330
 on-0.325
 by-0.320
 ass-0.319
Token Group
Token ID4912
Feature activation-1.32e-01

Pos logit contributions
-+0.319
 distance+0.306
PI+0.288
 on+0.285
 your+0.283
Neg logit contributions
 advoc-1.242
 rul-1.204
 mathemat-1.196
 Citiz-1.158
 conclud-1.121
Token,
Token ID11
Feature activation+3.07e-01

Pos logit contributions
 advoc+0.238
 rul+0.231
 mathemat+0.228
 Citiz+0.221
 conclud+0.215
Neg logit contributions
--0.053
 distance-0.052
PI-0.047
 on-0.047
 tour-0.046
Token assists
Token ID13648
Feature activation-2.64e-01

Pos logit contributions
 advoc+0.711
 rul+0.697
 mathemat+0.682
 Citiz+0.661
 tyr+0.637
Neg logit contributions
 distance-0.225
--0.213
PI-0.209
 tour-0.204
 on-0.204
Token for
Token ID329
Feature activation-5.28e-01

Pos logit contributions
 advoc+0.516
 mathemat+0.502
 rul+0.501
 Citiz+0.474
 conclud+0.468
Neg logit contributions
PI-0.052
 distance-0.050
--0.049
123-0.042
 tour-0.041
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
 advoc+0.919
 rul+0.905
 mathemat+0.888
 Citiz+0.855
 tyr+0.833
Neg logit contributions
--0.242
PI-0.225
 on-0.218
 distance-0.216
 by-0.209
Token Jets
Token ID14728
Feature activation-8.73e-03

Pos logit contributions
 advoc+0.249
 rul+0.242
 mathemat+0.241
 Citiz+0.231
 tyr+0.222
Neg logit contributions
--0.095
 on-0.089
PI-0.088
 distance-0.087
 by-0.086
Token/
Token ID14
Feature activation-4.17e-01

Pos logit contributions
 advoc+0.017
 rul+0.017
 mathemat+0.017
 Citiz+0.016
 conclud+0.016
Neg logit contributions
 distance-0.002
--0.002
PI-0.002
123-0.001
erm-0.001
TokenTh
Token ID817
Feature activation-1.92e+00

Pos logit contributions
 advoc+0.647
 mathemat+0.619
 rul+0.617
 Citiz+0.600
 conclud+0.575
Neg logit contributions
 distance-0.268
--0.265
 on-0.257
 by-0.249
 your-0.248
Tokenras
Token ID8847
Feature activation-1.63e-01

Pos logit contributions
 rul+2.952
 advoc+2.949
 mathemat+2.874
 Citiz+2.793
 embr+2.660
Neg logit contributions
 distance-1.115
--1.105
 on-1.067
 by-1.031
 your-1.029
Tokenhers
Token ID7084
Feature activation-1.15e+00

Pos logit contributions
 advoc+0.258
 rul+0.249
 mathemat+0.248
 Citiz+0.239
 embr+0.229
Neg logit contributions
--0.095
 distance-0.094
 on-0.090
PI-0.089
 by-0.088
Token,
Token ID11
Feature activation-2.81e-01

Pos logit contributions
 advoc+2.197
 mathemat+2.153
 rul+2.145
 Citiz+2.064
 conclud+2.016
Neg logit contributions
--0.262
 distance-0.261
PI-0.232
123-0.226
ering-0.219
Token helped
Token ID4193
Feature activation+4.45e-01

Pos logit contributions
 advoc+0.486
 rul+0.471
 mathemat+0.469
 Citiz+0.444
 tyr+0.441
Neg logit contributions
--0.133
PI-0.127
 distance-0.124
123-0.117
 on-0.116
Token the
Token ID262
Feature activation+2.25e-01

Pos logit contributions
-+0.167
PI+0.150
 distance+0.141
 on+0.139
erm+0.136
Neg logit contributions
 advoc-0.817
 mathemat-0.800
 rul-0.794
 Citiz-0.763
 tyr-0.744
Token make
Token ID787
Feature activation-4.42e-02

Pos logit contributions
 advoc+0.584
 rul+0.568
 mathemat+0.552
 Citiz+0.532
 conclud+0.521
Neg logit contributions
--0.167
PI-0.163
 distance-0.154
 on-0.154
123-0.152
Token reproductive
Token ID18391
Feature activation+4.97e-01

Pos logit contributions
 advoc+0.079
 rul+0.076
 mathemat+0.074
 Citiz+0.073
 tyr+0.071
Neg logit contributions
--0.019
PI-0.019
 distance-0.018
123-0.017
 on-0.017
Token choices
Token ID7747
Feature activation-5.41e-01

Pos logit contributions
PI+0.223
-+0.216
 on+0.215
123+0.212
 distance+0.210
Neg logit contributions
 advoc-0.876
 rul-0.831
 mathemat-0.816
 Citiz-0.792
 tyr-0.781
Token.
Token ID13
Feature activation-4.10e-01

Pos logit contributions
 advoc+1.053
 rul+1.019
 mathemat+0.995
 Citiz+0.967
 tyr+0.955
Neg logit contributions
--0.138
PI-0.126
 distance-0.125
 tour-0.111
 auto-0.105
Token Why
Token ID4162
Feature activation+4.37e-01

Pos logit contributions
 advoc+0.682
 rul+0.670
 Citiz+0.656
 mathemat+0.645
 conclud+0.618
Neg logit contributions
--0.204
 distance-0.200
 on-0.187
 safely-0.186
 hunting-0.180
Token don
Token ID836
Feature activation-1.92e+00

Pos logit contributions
-+0.193
PI+0.179
 distance+0.177
 on+0.164
 by+0.163
Neg logit contributions
 advoc-0.779
 rul-0.772
 mathemat-0.747
 Citiz-0.723
 conclud-0.709
Token
Token ID447
Feature activation-7.41e-01

Pos logit contributions
 advoc+2.425
 rul+2.383
 Citiz+2.243
 mathemat+2.240
 embr+2.112
Neg logit contributions
 distance-1.567
--1.496
 on-1.461
 auto-1.457
 tour-1.457
Token
Token ID247
Feature activation+2.00e-01

Pos logit contributions
 advoc+0.969
 rul+0.953
 mathemat+0.900
 Citiz+0.870
 conclud+0.843
Neg logit contributions
--0.671
 distance-0.648
 on-0.641
 by-0.626
PI-0.620
Tokent
Token ID83
Feature activation+1.79e-01

Pos logit contributions
-+0.083
 distance+0.080
 on+0.078
 by+0.074
PI+0.071
Neg logit contributions
 advoc-0.359
 rul-0.351
 mathemat-0.340
 Citiz-0.336
 conclud-0.325
Token you
Token ID345
Feature activation+1.64e-01

Pos logit contributions
-+0.069
 distance+0.063
 on+0.062
PI+0.062
123+0.059
Neg logit contributions
 advoc-0.328
 rul-0.322
 mathemat-0.315
 Citiz-0.306
 tyr-0.300
Token trust
Token ID3774
Feature activation+4.07e-01

Pos logit contributions
-+0.073
PI+0.068
 on+0.067
 distance+0.065
123+0.064
Neg logit contributions
 advoc-0.289
 rul-0.282
 mathemat-0.279
 Citiz-0.266
 tyr-0.263
TokenT
Token ID51
Feature activation-5.50e-01

Pos logit contributions
 advoc+0.716
 rul+0.698
 mathemat+0.682
 Citiz+0.673
 tyr+0.638
Neg logit contributions
 distance-0.258
--0.256
 on-0.243
 tour-0.238
 ass-0.236
TokenNS
Token ID8035
Feature activation-4.89e-01

Pos logit contributions
 advoc+0.825
 rul+0.795
 mathemat+0.777
 Citiz+0.756
 tyr+0.732
Neg logit contributions
 distance-0.381
--0.365
 on-0.361
 your-0.357
 ass-0.356
Token via
Token ID2884
Feature activation-1.16e+00

Pos logit contributions
 advoc+0.897
 rul+0.878
 mathemat+0.871
 Citiz+0.845
 tyr+0.822
Neg logit contributions
 distance-0.160
--0.146
 your-0.139
 safely-0.137
 on-0.132
Token Getty
Token ID7136
Feature activation-7.86e-01

Pos logit contributions
 advoc+1.996
 rul+1.980
 mathemat+1.961
 Citiz+1.891
 tyr+1.807
Neg logit contributions
--0.510
 distance-0.484
 on-0.483
erm-0.469
PI-0.468
Token Images
Token ID5382
Feature activation-6.56e-01

Pos logit contributions
 advoc+1.202
 rul+1.180
 Citiz+1.155
 mathemat+1.144
 conclud+1.076
Neg logit contributions
--0.480
 distance-0.468
 on-0.462
 your-0.458
PI-0.447
Token)
Token ID8
Feature activation-1.91e+00

Pos logit contributions
 advoc+1.171
 rul+1.148
 mathemat+1.122
 Citiz+1.115
 conclud+1.062
Neg logit contributions
--0.239
 distance-0.233
 your-0.222
 on-0.221
PI-0.215
Token Indiana
Token ID10278
Feature activation-9.24e-01

Pos logit contributions
 advoc+3.056
 rul+2.952
 Citiz+2.886
 mathemat+2.874
 conclud+2.719
Neg logit contributions
--1.091
 distance-1.037
 on-0.996
 safely-0.986
PI-0.978
Token Gov
Token ID10964
Feature activation-1.79e+00

Pos logit contributions
 advoc+1.404
 rul+1.348
 mathemat+1.332
 Citiz+1.310
 conclud+1.251
Neg logit contributions
--0.604
 distance-0.589
 by-0.582
 on-0.580
123-0.571
Token.
Token ID13
Feature activation-7.51e-01

Pos logit contributions
 advoc+3.379
 rul+3.294
 mathemat+3.212
 Citiz+3.137
 conclud+3.104
Neg logit contributions
 distance-0.360
--0.306
 solely-0.303
 your-0.299
PI-0.299
Token Mike
Token ID4995
Feature activation-8.47e-01

Pos logit contributions
 advoc+1.179
 rul+1.135
 mathemat+1.095
 Citiz+1.086
 conclud+1.065
Neg logit contributions
 distance-0.398
PI-0.394
--0.381
mus-0.376
sk-0.375
Token Pence
Token ID17339
Feature activation-5.12e-01

Pos logit contributions
 advoc+1.052
 rul+1.005
 Citiz+0.960
 mathemat+0.937
 embr+0.897
Neg logit contributions
 distance-0.745
--0.719
 hunting-0.709
 clean-0.707
 stalk-0.706
Tokenо�
Token ID25443
Feature activation+1.15e-01

Pos logit contributions
-+0.326
 distance+0.321
 on+0.315
 by+0.310
 ass+0.308
Neg logit contributions
 advoc-0.396
 rul-0.386
 mathemat-0.374
 Citiz-0.357
 tyr-0.340
Token
Token ID110
Feature activation+3.94e-01

Pos logit contributions
-+0.118
 distance+0.114
 on+0.113
 by+0.112
 your+0.111
Neg logit contributions
 advoc-0.139
 rul-0.133
 mathemat-0.131
 Citiz-0.127
 conclud-0.120
Token
Token ID12466
Feature activation-5.23e-01

Pos logit contributions
-+0.242
 distance+0.234
 on+0.228
 by+0.223
 your+0.220
Neg logit contributions
 advoc-0.631
 rul-0.616
 mathemat-0.603
 Citiz-0.586
 conclud-0.564
Token
Token ID116
Feature activation-5.65e-01

Pos logit contributions
 advoc+0.666
 rul+0.641
 mathemat+0.629
 Citiz+0.600
 conclud+0.574
Neg logit contributions
--0.508
 distance-0.490
 on-0.488
 by-0.480
 your-0.471
Token
Token ID220
Feature activation-2.01e-01

Pos logit contributions
 advoc+0.764
 rul+0.741
 mathemat+0.716
 Citiz+0.691
 conclud+0.667
Neg logit contributions
--0.485
 distance-0.476
 on-0.471
 by-0.461
 safely-0.459
Tokenр
Token ID21169
Feature activation-1.89e+00

Pos logit contributions
 advoc+0.259
 rul+0.249
 mathemat+0.244
 Citiz+0.233
 conclud+0.222
Neg logit contributions
--0.190
 distance-0.187
 on-0.184
 by-0.182
 your-0.179
Tokenа
Token ID16142
Feature activation-7.04e-01

Pos logit contributions
 advoc+2.271
 rul+2.189
 mathemat+2.121
 Citiz+2.023
 conclud+1.931
Neg logit contributions
 distance-1.827
--1.818
 on-1.798
 tour-1.773
 by-1.753
Tokenс
Token ID21727
Feature activation-5.60e-01

Pos logit contributions
 advoc+0.940
 rul+0.917
 mathemat+0.892
 Citiz+0.855
 conclud+0.826
Neg logit contributions
--0.605
 distance-0.601
 on-0.589
 your-0.576
 tour-0.576
Tokenс
Token ID21727
Feature activation-2.05e-01

Pos logit contributions
 advoc+0.699
 rul+0.683
 mathemat+0.665
 Citiz+0.633
 conclud+0.609
Neg logit contributions
--0.532
 distance-0.523
 on-0.516
 your-0.505
 tour-0.504
Tokenл
Token ID30143
Feature activation-2.04e-01

Pos logit contributions
 advoc+0.241
 rul+0.234
 mathemat+0.228
 Citiz+0.216
 conclud+0.207
Neg logit contributions
--0.212
 distance-0.207
 on-0.205
 your-0.202
 by-0.201
Tokenе
Token ID16843
Feature activation+1.12e-01

Pos logit contributions
 advoc+0.250
 rul+0.240
 mathemat+0.234
 Citiz+0.225
 conclud+0.215
Neg logit contributions
--0.203
 distance-0.200
 on-0.197
 tour-0.193
PI-0.192
Token
Token ID198
Feature activation-3.17e-01

Pos logit contributions
 advoc+0.465
 rul+0.457
 mathemat+0.438
 Citiz+0.435
 embr+0.426
Neg logit contributions
 distance-0.128
--0.121
 hunting-0.115
 tour-0.114
PI-0.114
Tokenhttps
Token ID5450
Feature activation-8.74e-01

Pos logit contributions
 advoc+0.467
 rul+0.448
 mathemat+0.439
 Citiz+0.431
 tyr+0.402
Neg logit contributions
 distance-0.229
 on-0.216
--0.212
 by-0.212
 tour-0.211
Token://
Token ID1378
Feature activation-6.34e-01

Pos logit contributions
 advoc+1.325
 rul+1.296
 Citiz+1.244
 mathemat+1.236
 embr+1.201
Neg logit contributions
 distance-0.469
 hunting-0.434
--0.429
 tour-0.428
 safely-0.424
Tokenwww
Token ID2503
Feature activation-9.48e-01

Pos logit contributions
 advoc+0.997
 rul+0.962
 mathemat+0.938
 Citiz+0.914
 tyr+0.879
Neg logit contributions
 distance-0.403
--0.397
 on-0.388
 by-0.378
 your-0.373
Token.
Token ID13
Feature activation-4.36e-01

Pos logit contributions
 advoc+1.761
 rul+1.740
 mathemat+1.679
 Citiz+1.671
 embr+1.637
Neg logit contributions
 distance-0.177
 hunting-0.141
--0.139
 tour-0.131
 safely-0.130
Tokeneff
Token ID14822
Feature activation-1.86e+00

Pos logit contributions
 advoc+0.676
 rul+0.652
 mathemat+0.632
 Citiz+0.621
 tyr+0.594
Neg logit contributions
--0.288
 distance-0.285
 on-0.280
 by-0.272
 tour-0.267
Token.
Token ID13
Feature activation-1.12e-01

Pos logit contributions
 advoc+3.595
 rul+3.540
 mathemat+3.432
 Citiz+3.398
 embr+3.309
Neg logit contributions
 distance-0.293
--0.268
 on-0.223
 safely-0.212
 tour-0.208
Tokenorg
Token ID2398
Feature activation-6.63e-01

Pos logit contributions
 advoc+0.161
 rul+0.158
 mathemat+0.149
 Citiz+0.149
 embr+0.142
Neg logit contributions
 distance-0.078
--0.078
 on-0.075
 auto-0.074
 by-0.073
Token/
Token ID14
Feature activation-7.02e-01

Pos logit contributions
 advoc+1.140
 rul+1.115
 mathemat+1.078
 Citiz+1.072
 embr+1.040
Neg logit contributions
 distance-0.256
--0.239
 hunting-0.223
 on-0.223
 safely-0.222
Tokend
Token ID67
Feature activation-8.24e-01

Pos logit contributions
 advoc+1.138
 rul+1.101
 mathemat+1.057
 Citiz+1.046
 embr+0.995
Neg logit contributions
--0.411
 distance-0.410
 on-0.390
 by-0.383
 your-0.380
Tokenee
Token ID1453
Feature activation-4.44e-01

Pos logit contributions
 advoc+1.438
 rul+1.406
 mathemat+1.367
 Citiz+1.329
 embr+1.287
Neg logit contributions
 distance-0.372
--0.346
 on-0.341
 ass-0.327
 your-0.327
Token
Token ID228
Feature activation+4.47e-01

Pos logit contributions
 advoc+0.980
 rul+0.958
 mathemat+0.936
 Citiz+0.886
 tyr+0.862
Neg logit contributions
--0.671
 distance-0.654
 on-0.647
 by-0.628
PI-0.623
Tokenи
Token ID18849
Feature activation-1.64e-01

Pos logit contributions
-+0.491
 distance+0.481
 on+0.478
 by+0.469
 your+0.467
Neg logit contributions
 advoc-0.497
 rul-0.479
 mathemat-0.466
 Citiz-0.441
 conclud-0.424
Tokenя
Token ID40623
Feature activation-1.03e+00

Pos logit contributions
 advoc+0.194
 rul+0.188
 mathemat+0.183
 Citiz+0.175
 tyr+0.165
Neg logit contributions
--0.169
 distance-0.167
 on-0.164
 by-0.161
 tour-0.160
Token,
Token ID11
Feature activation-7.79e-01

Pos logit contributions
 advoc+1.655
 rul+1.618
 mathemat+1.581
 Citiz+1.529
 conclud+1.476
Neg logit contributions
--0.618
 distance-0.600
 on-0.580
PI-0.572
 tour-0.564
Token
Token ID220
Feature activation+1.82e-01

Pos logit contributions
 advoc+1.197
 rul+1.163
 mathemat+1.140
 Citiz+1.101
 tyr+1.059
Neg logit contributions
--0.556
 distance-0.524
 on-0.516
 by-0.501
PI-0.499
Token
Token ID141
Feature activation-1.86e+00

Pos logit contributions
-+0.172
 distance+0.169
 on+0.166
 by+0.165
 your+0.162
Neg logit contributions
 advoc-0.235
 rul-0.225
 mathemat-0.221
 Citiz-0.212
 conclud-0.202
Token
Token ID229
Feature activation-8.52e-02

Pos logit contributions
 advoc+2.439
 rul+2.377
 mathemat+2.324
 Citiz+2.204
 tyr+2.152
Neg logit contributions
--1.656
 distance-1.632
 on-1.594
 by-1.560
 solely-1.547
Tokenт
Token ID20375
Feature activation-6.98e-01

Pos logit contributions
 advoc+0.104
 rul+0.100
 mathemat+0.096
 Citiz+0.093
 conclud+0.089
Neg logit contributions
--0.085
 distance-0.084
 on-0.083
 by-0.082
 safely-0.081
Tokenо
Token ID15166
Feature activation-3.84e-01

Pos logit contributions
 advoc+0.748
 rul+0.720
 mathemat+0.698
 Citiz+0.665
 embr+0.620
Neg logit contributions
--0.787
 distance-0.786
 on-0.776
 by-0.764
 tour-0.760
Token
Token ID12466
Feature activation-6.39e-01

Pos logit contributions
 advoc+0.541
 rul+0.524
 mathemat+0.509
 Citiz+0.492
 tyr+0.471
Neg logit contributions
--0.310
 distance-0.305
 on-0.303
 by-0.296
 safely-0.291
Token
Token ID110
Feature activation-3.19e-01

Pos logit contributions
 advoc+0.812
 rul+0.786
 mathemat+0.770
 Citiz+0.735
 conclud+0.703
Neg logit contributions
--0.620
 distance-0.598
 on-0.596
 by-0.587
 your-0.576
Token 4
Token ID604
Feature activation-1.63e-01

Pos logit contributions
 advoc+0.105
 rul+0.103
 mathemat+0.101
 Citiz+0.097
 embr+0.095
Neg logit contributions
--0.021
PI-0.018
 distance-0.018
 on-0.017
erm-0.017
Token percent
Token ID1411
Feature activation+2.66e-01

Pos logit contributions
 advoc+0.283
 rul+0.277
 mathemat+0.270
 Citiz+0.263
 conclud+0.256
Neg logit contributions
 distance-0.067
--0.066
 tour-0.064
 your-0.064
 on-0.064
Token.
Token ID13
Feature activation+1.32e-03

Pos logit contributions
-+0.052
PI+0.049
 distance+0.046
123+0.041
 your+0.038
Neg logit contributions
 advoc-0.521
 rul-0.515
 mathemat-0.503
 Citiz-0.488
 conclud-0.483
Token
Token ID198
Feature activation+1.78e-01

Pos logit contributions
-+0.001
 distance+0.001
 on+0.001
 safely+0.001
 your+0.001
Neg logit contributions
 advoc-0.002
 rul-0.002
 Citiz-0.002
 mathemat-0.002
 tyr-0.002
Token
Token ID198
Feature activation+1.11e-01

Pos logit contributions
 distance+0.081
-+0.075
 hunting+0.073
 tour+0.072
 ass+0.071
Neg logit contributions
 advoc-0.285
 rul-0.280
 mathemat-0.270
 Citiz-0.268
 embr-0.261
TokenShares
Token ID43566
Feature activation-1.84e+00

Pos logit contributions
 distance+0.072
 on+0.067
 tour+0.067
 hunting+0.066
 by+0.065
Neg logit contributions
 advoc-0.170
 rul-0.161
 mathemat-0.159
 Citiz-0.158
 embr-0.149
Token of
Token ID286
Feature activation-7.46e-01

Pos logit contributions
 advoc+3.612
 rul+3.528
 mathemat+3.499
 Citiz+3.367
 conclud+3.354
Neg logit contributions
--0.383
 distance-0.341
PI-0.328
 tour-0.284
 your-0.279
Token Philip
Token ID14576
Feature activation+4.54e-01

Pos logit contributions
 advoc+1.294
 rul+1.261
 mathemat+1.242
 Citiz+1.215
 tyr+1.177
Neg logit contributions
--0.354
 on-0.323
 distance-0.319
PI-0.313
 by-0.310
Token Morris
Token ID14433
Feature activation+6.56e-01

Pos logit contributions
 distance+0.223
-+0.217
 on+0.207
 tour+0.201
 by+0.199
Neg logit contributions
 advoc-0.769
 rul-0.745
 mathemat-0.736
 Citiz-0.716
 tyr-0.692
Token International
Token ID4037
Feature activation+3.09e-01

Pos logit contributions
-+0.275
 distance+0.263
 on+0.248
PI+0.244
 your+0.243
Neg logit contributions
 advoc-1.175
 rul-1.149
 mathemat-1.144
 Citiz-1.098
 conclud-1.069
Token,
Token ID11
Feature activation+4.41e-01

Pos logit contributions
-+0.126
 distance+0.118
 on+0.112
 your+0.110
PI+0.109
Neg logit contributions
 advoc-0.555
 rul-0.544
 mathemat-0.537
 Citiz-0.519
 conclud-0.506
Token hardware
Token ID6890
Feature activation-4.49e-01

Pos logit contributions
 advoc+0.504
 rul+0.503
 mathemat+0.495
 Citiz+0.467
 embr+0.461
Neg logit contributions
--0.162
 distance-0.152
 on-0.149
PI-0.146
 by-0.145
Token is
Token ID318
Feature activation-6.60e-01

Pos logit contributions
 advoc+0.736
 rul+0.726
 mathemat+0.702
 Citiz+0.682
 embr+0.673
Neg logit contributions
--0.194
 distance-0.187
PI-0.176
erm-0.171
 on-0.170
Token almost
Token ID2048
Feature activation+3.58e-01

Pos logit contributions
 advoc+1.183
 rul+1.181
 mathemat+1.156
 Citiz+1.107
 conclud+1.087
Neg logit contributions
--0.242
 distance-0.221
 on-0.208
PI-0.202
 your-0.200
Token 4
Token ID604
Feature activation-2.81e-01

Pos logit contributions
-+0.165
 distance+0.154
 on+0.152
PI+0.146
 by+0.143
Neg logit contributions
 advoc-0.609
 rul-0.599
 mathemat-0.589
 Citiz-0.565
 conclud-0.550
Token years
Token ID812
Feature activation-4.92e-01

Pos logit contributions
 advoc+0.542
 rul+0.534
 mathemat+0.518
 Citiz+0.506
 embr+0.495
Neg logit contributions
--0.059
 distance-0.054
 on-0.052
PI-0.050
 by-0.046
Token old
Token ID1468
Feature activation-1.82e+00

Pos logit contributions
 advoc+0.792
 rul+0.778
 mathemat+0.754
 Citiz+0.719
 embr+0.710
Neg logit contributions
--0.263
PI-0.261
 distance-0.248
123-0.247
 on-0.243
Token and
Token ID290
Feature activation-3.76e-01

Pos logit contributions
 rul+3.351
 advoc+3.340
 mathemat+3.215
 Citiz+3.173
 embr+3.098
Neg logit contributions
--0.419
 distance-0.397
PI-0.365
erm-0.336
 tour-0.334
Token once
Token ID1752
Feature activation-1.17e+00

Pos logit contributions
 rul+0.682
 advoc+0.680
 mathemat+0.655
 Citiz+0.641
 conclud+0.632
Neg logit contributions
--0.128
erm-0.116
 distance-0.115
PI-0.114
 Oct-0.106
Token I
Token ID314
Feature activation+3.00e-02

Pos logit contributions
 advoc+1.915
 rul+1.910
 mathemat+1.844
 Citiz+1.796
 embr+1.752
Neg logit contributions
--0.484
 distance-0.472
PI-0.462
erm-0.441
 tour-0.427
Token turn
Token ID1210
Feature activation-6.62e-03

Pos logit contributions
-+0.012
 distance+0.011
 on+0.011
PI+0.010
 by+0.010
Neg logit contributions
 advoc-0.053
 rul-0.053
 mathemat-0.051
 Citiz-0.049
 conclud-0.049
Token AA
Token ID15923
Feature activation-4.99e-01

Pos logit contributions
 advoc+0.012
 rul+0.011
 mathemat+0.011
 Citiz+0.011
 embr+0.011
Neg logit contributions
--0.003
 distance-0.003
PI-0.002
 on-0.002
 by-0.002
Token as
Token ID355
Feature activation-1.91e-01

Pos logit contributions
 advoc+0.388
 rul+0.378
 mathemat+0.373
 Citiz+0.365
 conclud+0.353
Neg logit contributions
--0.064
 distance-0.056
PI-0.053
 on-0.050
 tour-0.050
Token the
Token ID262
Feature activation-3.42e-01

Pos logit contributions
 advoc+0.351
 rul+0.343
 mathemat+0.338
 Citiz+0.330
 conclud+0.318
Neg logit contributions
--0.073
 distance-0.067
PI-0.064
 on-0.062
123-0.061
Token Caps
Token ID23534
Feature activation+2.45e-01

Pos logit contributions
 advoc+0.553
 rul+0.537
 mathemat+0.529
 Citiz+0.516
 conclud+0.493
Neg logit contributions
--0.207
 distance-0.190
 on-0.190
PI-0.189
 by-0.186
Token out
Token ID503
Feature activation-8.81e-01

Pos logit contributions
-+0.119
 distance+0.109
PI+0.107
 on+0.107
 your+0.104
Neg logit contributions
 advoc-0.422
 rul-0.418
 mathemat-0.412
 Citiz-0.396
 conclud-0.384
Token-
Token ID12
Feature activation-2.93e-01

Pos logit contributions
 advoc+1.514
 rul+1.500
 mathemat+1.469
 Citiz+1.420
 conclud+1.365
Neg logit contributions
 distance-0.372
 tour-0.365
 on-0.364
 your-0.361
--0.360
Tokenshot
Token ID9442
Feature activation-1.81e+00

Pos logit contributions
 advoc+0.497
 rul+0.483
 mathemat+0.481
 Citiz+0.462
 tyr+0.442
Neg logit contributions
 distance-0.139
--0.135
 on-0.134
 your-0.130
 tour-0.130
Token the
Token ID262
Feature activation-6.84e-02

Pos logit contributions
 advoc+3.476
 rul+3.419
 mathemat+3.367
 Citiz+3.300
 conclud+3.180
Neg logit contributions
--0.390
 distance-0.375
PI-0.351
 tour-0.337
123-0.332
Token Blues
Token ID15261
Feature activation+1.78e-01

Pos logit contributions
 advoc+0.104
 rul+0.101
 mathemat+0.100
 Citiz+0.097
 tyr+0.092
Neg logit contributions
--0.048
 on-0.045
 distance-0.045
PI-0.044
 by-0.044
Token 10
Token ID838
Feature activation+1.13e-01

Pos logit contributions
-+0.060
 distance+0.057
PI+0.055
 tour+0.050
 your+0.050
Neg logit contributions
 advoc-0.331
 rul-0.325
 mathemat-0.319
 Citiz-0.307
 tyr-0.300
Token-
Token ID12
Feature activation-2.03e-02

Pos logit contributions
 distance+0.005
PI+0.003
 your+0.002
 on+0.002
 tour+0.001
Neg logit contributions
 advoc-0.242
 rul-0.237
 mathemat-0.232
 Citiz-0.230
 conclud-0.222
Token8
Token ID23
Feature activation+3.15e-01

Pos logit contributions
 advoc+0.034
 rul+0.033
 mathemat+0.032
 Citiz+0.032
 embr+0.030
Neg logit contributions
 distance-0.010
--0.009
 tour-0.009
 your-0.009
 address-0.009
Token increasing
Token ID3649
Feature activation-2.99e-01

Pos logit contributions
 advoc+0.094
 rul+0.091
 mathemat+0.090
 Citiz+0.086
 conclud+0.084
Neg logit contributions
--0.022
PI-0.021
 distance-0.020
 on-0.019
 your-0.018
Token claims
Token ID3667
Feature activation+5.06e-02

Pos logit contributions
 advoc+0.510
 rul+0.487
 mathemat+0.484
 Citiz+0.456
 embr+0.448
Neg logit contributions
--0.156
PI-0.153
 on-0.147
 distance-0.143
 by-0.143
Token of
Token ID286
Feature activation-4.06e-01

Pos logit contributions
-+0.010
PI+0.009
 distance+0.009
erm+0.007
 tour+0.007
Neg logit contributions
 advoc-0.101
 rul-0.098
 mathemat-0.097
 Citiz-0.094
 tyr-0.092
Token fraud
Token ID7394
Feature activation-3.53e-02

Pos logit contributions
 advoc+0.682
 mathemat+0.658
 rul+0.657
 Citiz+0.622
 embr+0.611
Neg logit contributions
--0.208
PI-0.206
 on-0.191
 distance-0.190
erm-0.189
Token.
Token ID13
Feature activation-5.49e-01

Pos logit contributions
 advoc+0.070
 rul+0.069
 mathemat+0.068
 Citiz+0.066
 conclud+0.065
Neg logit contributions
 distance-0.007
--0.007
PI-0.006
 your-0.005
 tour-0.005
Token
Token ID198
Feature activation-1.81e+00

Pos logit contributions
 advoc+0.930
 rul+0.903
 Citiz+0.900
 mathemat+0.887
 conclud+0.843
Neg logit contributions
--0.246
 distance-0.244
 safely-0.227
 on-0.224
 ass-0.223
Token
Token ID198
Feature activation-1.62e+00

Pos logit contributions
 advoc+2.688
 rul+2.655
 Citiz+2.552
 mathemat+2.514
 embr+2.488
Neg logit contributions
 distance-0.881
 hunting-0.815
 stalk-0.796
 tour-0.795
 safely-0.785
TokenFull
Token ID13295
Feature activation+2.11e-01

Pos logit contributions
 advoc+2.415
 rul+2.274
 mathemat+2.258
 Citiz+2.252
 embr+2.085
Neg logit contributions
 distance-1.064
 on-0.979
 hunting-0.977
 tour-0.976
 auto-0.971
Token preliminary
Token ID15223
Feature activation-2.69e-01

Pos logit contributions
-+0.089
 distance+0.087
PI+0.085
 on+0.085
 by+0.083
Neg logit contributions
 advoc-0.372
 rul-0.366
 mathemat-0.359
 Citiz-0.348
 conclud-0.335
Token results
Token ID2482
Feature activation-6.05e-01

Pos logit contributions
 advoc+0.462
 rul+0.451
 mathemat+0.446
 Citiz+0.422
 tyr+0.411
Neg logit contributions
--0.134
PI-0.130
 distance-0.123
 on-0.123
 specifically-0.120
Token were
Token ID547
Feature activation-6.69e-01

Pos logit contributions
 advoc+1.094
 rul+1.073
 mathemat+1.062
 Citiz+1.006
 conclud+1.000
Neg logit contributions
--0.222
PI-0.215
 distance-0.209
erm-0.189
 your-0.188
Token
Token ID115
Feature activation-1.26e+00

Pos logit contributions
 advoc+0.997
 rul+0.970
 mathemat+0.956
 Citiz+0.905
 tyr+0.877
Neg logit contributions
--0.664
 distance-0.646
 on-0.640
 by-0.620
 your-0.615
Tokenк
Token ID31583
Feature activation+6.88e-01

Pos logit contributions
 advoc+2.344
 rul+2.285
 mathemat+2.269
 Citiz+2.202
 conclud+2.138
Neg logit contributions
--0.498
 on-0.449
 distance-0.427
 by-0.423
 your-0.404
Tokenи
Token ID18849
Feature activation-3.88e-01

Pos logit contributions
-+0.588
 distance+0.574
 on+0.566
 by+0.554
 tour+0.549
Neg logit contributions
 advoc-0.948
 rul-0.922
 mathemat-0.901
 Citiz-0.866
 conclud-0.833
Tokenм
Token ID43108
Feature activation-1.21e+00

Pos logit contributions
 advoc+0.540
 rul+0.531
 mathemat+0.518
 Citiz+0.495
 conclud+0.478
Neg logit contributions
--0.312
 distance-0.308
 on-0.301
 tour-0.295
 your-0.294
Tokenи
Token ID18849
Feature activation-1.99e-01

Pos logit contributions
 advoc+1.824
 rul+1.783
 mathemat+1.746
 Citiz+1.680
 conclud+1.619
Neg logit contributions
--0.826
 distance-0.820
 on-0.792
 by-0.769
 tour-0.768
Token
Token ID12466
Feature activation-1.81e+00

Pos logit contributions
 advoc+0.289
 rul+0.283
 mathemat+0.277
 Citiz+0.266
 conclud+0.256
Neg logit contributions
--0.151
 distance-0.148
 on-0.144
 by-0.140
 tour-0.140
Token
Token ID116
Feature activation-9.98e-01

Pos logit contributions
 advoc+2.310
 rul+2.243
 mathemat+2.191
 Citiz+2.081
 tyr+2.001
Neg logit contributions
--1.704
 distance-1.665
 on-1.638
 by-1.613
 tour-1.593
Token
Token ID12466
Feature activation-1.69e+00

Pos logit contributions
 advoc+1.337
 rul+1.295
 mathemat+1.258
 Citiz+1.204
 conclud+1.161
Neg logit contributions
--0.865
 distance-0.854
 on-0.844
 by-0.824
 tour-0.822
Token
Token ID112
Feature activation-6.39e-01

Pos logit contributions
 advoc+2.186
 rul+2.122
 mathemat+2.072
 Citiz+1.968
 tyr+1.893
Neg logit contributions
--1.571
 distance-1.530
 on-1.513
 by-1.489
 tour-1.472
Tokenр
Token ID21169
Feature activation-1.52e+00

Pos logit contributions
 advoc+0.774
 rul+0.749
 mathemat+0.724
 Citiz+0.693
 conclud+0.664
Neg logit contributions
--0.631
 distance-0.628
 on-0.615
 tour-0.603
 by-0.603
Tokenу
Token ID35072
Feature activation-4.55e-01

Pos logit contributions
 advoc+1.924
 rul+1.854
 mathemat+1.817
 Citiz+1.724
 tyr+1.645
Neg logit contributions
 distance-1.411
--1.403
 on-1.386
 tour-1.365
 solely-1.347
Token via
Token ID2884
Feature activation-1.16e+00

Pos logit contributions
 advoc+0.897
 rul+0.878
 mathemat+0.871
 Citiz+0.845
 tyr+0.822
Neg logit contributions
 distance-0.160
--0.146
 your-0.139
 safely-0.137
 on-0.132
Token Getty
Token ID7136
Feature activation-7.86e-01

Pos logit contributions
 advoc+1.996
 rul+1.980
 mathemat+1.961
 Citiz+1.891
 tyr+1.807
Neg logit contributions
--0.510
 distance-0.484
 on-0.483
erm-0.469
PI-0.468
Token Images
Token ID5382
Feature activation-6.56e-01

Pos logit contributions
 advoc+1.202
 rul+1.180
 Citiz+1.155
 mathemat+1.144
 conclud+1.076
Neg logit contributions
--0.480
 distance-0.468
 on-0.462
 your-0.458
PI-0.447
Token)
Token ID8
Feature activation-1.91e+00

Pos logit contributions
 advoc+1.171
 rul+1.148
 mathemat+1.122
 Citiz+1.115
 conclud+1.062
Neg logit contributions
--0.239
 distance-0.233
 your-0.222
 on-0.221
PI-0.215
Token Indiana
Token ID10278
Feature activation-9.24e-01

Pos logit contributions
 advoc+3.056
 rul+2.952
 Citiz+2.886
 mathemat+2.874
 conclud+2.719
Neg logit contributions
--1.091
 distance-1.037
 on-0.996
 safely-0.986
PI-0.978
Token Gov
Token ID10964
Feature activation-1.79e+00

Pos logit contributions
 advoc+1.404
 rul+1.348
 mathemat+1.332
 Citiz+1.310
 conclud+1.251
Neg logit contributions
--0.604
 distance-0.589
 by-0.582
 on-0.580
123-0.571
Token.
Token ID13
Feature activation-7.51e-01

Pos logit contributions
 advoc+3.379
 rul+3.294
 mathemat+3.212
 Citiz+3.137
 conclud+3.104
Neg logit contributions
 distance-0.360
--0.306
 solely-0.303
 your-0.299
PI-0.299
Token Mike
Token ID4995
Feature activation-8.47e-01

Pos logit contributions
 advoc+1.179
 rul+1.135
 mathemat+1.095
 Citiz+1.086
 conclud+1.065
Neg logit contributions
 distance-0.398
PI-0.394
--0.381
mus-0.376
sk-0.375
Token Pence
Token ID17339
Feature activation-5.12e-01

Pos logit contributions
 advoc+1.052
 rul+1.005
 Citiz+0.960
 mathemat+0.937
 embr+0.897
Neg logit contributions
 distance-0.745
--0.719
 hunting-0.709
 clean-0.707
 stalk-0.706
Token Pence
Token ID17339
Feature activation-5.99e-01

Pos logit contributions
 advoc+0.895
 rul+0.867
 mathemat+0.855
 Citiz+0.832
 tyr+0.802
Neg logit contributions
--0.236
 distance-0.223
PI-0.212
 your-0.212
 on-0.209
Token is
Token ID318
Feature activation-4.86e-01

Pos logit contributions
 advoc+1.072
 rul+1.045
 mathemat+1.016
 Citiz+0.988
 conclud+0.964
Neg logit contributions
--0.253
 distance-0.237
 your-0.224
 by-0.221
 on-0.217
Token history
Token ID2106
Feature activation-3.06e-01

Pos logit contributions
-+0.040
PI+0.036
 distance+0.035
123+0.033
 on+0.032
Neg logit contributions
 advoc-0.261
 rul-0.253
 mathemat-0.250
 Citiz-0.242
 conclud-0.237
Token.
Token ID13
Feature activation+3.13e-01

Pos logit contributions
 advoc+0.608
 rul+0.592
 mathemat+0.584
 Citiz+0.564
 conclud+0.561
Neg logit contributions
 distance-0.051
--0.051
PI-0.048
 your-0.039
123-0.038
Token<|endoftext|>
Token ID50256
Feature activation+7.22e-02

Pos logit contributions
-+0.146
 distance+0.141
PI+0.130
 safely+0.129
 ass+0.129
Neg logit contributions
 advoc-0.534
 rul-0.519
 Citiz-0.511
 mathemat-0.505
 conclud-0.479
TokenAbout
Token ID8585
Feature activation+1.14e-01

Pos logit contributions
 distance+0.044
-+0.043
 on+0.043
 by+0.041
 your+0.041
Neg logit contributions
 advoc-0.115
 rul-0.110
 mathemat-0.110
 Citiz-0.106
 tyr-0.101
Token This
Token ID770
Feature activation-9.56e-01

Pos logit contributions
-+0.043
 distance+0.039
 on+0.038
PI+0.037
 by+0.037
Neg logit contributions
 advoc-0.209
 rul-0.204
 mathemat-0.201
 Citiz-0.198
 tyr-0.190
Token Game
Token ID3776
Feature activation-1.78e+00

Pos logit contributions
 advoc+1.627
 rul+1.573
 mathemat+1.561
 Citiz+1.530
 tyr+1.454
Neg logit contributions
--0.481
 on-0.456
 distance-0.452
 by-0.444
 ass-0.432
Token
Token ID198
Feature activation-6.83e-01

Pos logit contributions
 advoc+3.112
 rul+3.040
 mathemat+2.985
 Citiz+2.936
 tyr+2.819
Neg logit contributions
--0.766
 distance-0.732
 on-0.711
 ass-0.685
 by-0.675
Token
Token ID198
Feature activation-7.99e-01

Pos logit contributions
 advoc+1.117
 rul+1.096
 mathemat+1.053
 Citiz+1.048
 embr+1.020
Neg logit contributions
 distance-0.296
--0.284
 on-0.267
 hunting-0.266
 ass-0.262
TokenChoose
Token ID31851
Feature activation+4.61e-02

Pos logit contributions
 advoc+1.232
 rul+1.176
 mathemat+1.164
 Citiz+1.130
 tyr+1.074
Neg logit contributions
 distance-0.514
 on-0.494
 by-0.485
 ass-0.482
--0.480
Token from
Token ID422
Feature activation-4.74e-01

Pos logit contributions
-+0.009
 distance+0.008
 on+0.007
PI+0.007
 ass+0.007
Neg logit contributions
 advoc-0.093
 rul-0.091
 mathemat-0.089
 Citiz-0.087
 tyr-0.084
Token one
Token ID530
Feature activation-2.19e-01

Pos logit contributions
 advoc+0.906
 rul+0.887
 mathemat+0.871
 Citiz+0.844
 conclud+0.817
Neg logit contributions
--0.136
 distance-0.119
PI-0.118
 on-0.116
erm-0.112
Token thread
Token ID4704
Feature activation+4.92e-02

Pos logit contributions
 advoc+0.238
 rul+0.234
 mathemat+0.232
 Citiz+0.221
 conclud+0.216
Neg logit contributions
--0.054
PI-0.049
 on-0.047
 distance-0.046
123-0.046
Token before
Token ID878
Feature activation-2.14e-01

Pos logit contributions
-+0.010
 distance+0.009
PI+0.009
 tour+0.009
 Oct+0.008
Neg logit contributions
 advoc-0.096
 rul-0.096
 mathemat-0.093
 Citiz-0.090
 conclud-0.089
Token this
Token ID428
Feature activation+8.14e-02

Pos logit contributions
 advoc+0.374
 rul+0.368
 mathemat+0.364
 conclud+0.340
 Citiz+0.340
Neg logit contributions
--0.092
PI-0.090
 distance-0.087
123-0.087
 on-0.083
Token lesson
Token ID11483
Feature activation+2.30e-01

Pos logit contributions
-+0.032
PI+0.031
 distance+0.028
123+0.028
 on+0.027
Neg logit contributions
 advoc-0.146
 rul-0.144
 mathemat-0.143
 conclud-0.136
 Citiz-0.135
Token.
Token ID13
Feature activation-2.23e-01

Pos logit contributions
-+0.052
PI+0.048
 distance+0.046
123+0.040
 ass+0.040
Neg logit contributions
 advoc-0.446
 rul-0.439
 mathemat-0.439
 conclud-0.421
 Citiz-0.416
Token Show
Token ID5438
Feature activation+1.16e+00

Pos logit contributions
 advoc+0.364
 rul+0.359
 mathemat+0.353
 Citiz+0.343
 conclud+0.330
Neg logit contributions
--0.119
 distance-0.114
 on-0.107
 ass-0.107
PI-0.107
Token your
Token ID534
Feature activation-5.03e-01

Pos logit contributions
-+0.291
 distance+0.252
PI+0.240
 on+0.239
 by+0.223
Neg logit contributions
 advoc-2.260
 rul-2.228
 mathemat-2.213
 Citiz-2.111
 conclud-2.089
Token child
Token ID1200
Feature activation-7.52e-01

Pos logit contributions
 advoc+0.870
 rul+0.861
 mathemat+0.849
 Citiz+0.800
 conclud+0.788
Neg logit contributions
--0.241
 on-0.220
 distance-0.219
PI-0.218
123-0.212
Token how
Token ID703
Feature activation+3.52e-01

Pos logit contributions
 advoc+1.401
 rul+1.388
 mathemat+1.375
 Citiz+1.300
 conclud+1.295
Neg logit contributions
--0.228
PI-0.208
 distance-0.208
123-0.187
 ass-0.186
Token to
Token ID284
Feature activation+7.11e-01

Pos logit contributions
-+0.068
 distance+0.060
 on+0.056
PI+0.051
 tour+0.047
Neg logit contributions
 advoc-0.707
 rul-0.699
 mathemat-0.694
 Citiz-0.661
 conclud-0.656
Token insert
Token ID7550
Feature activation+2.49e-03

Pos logit contributions
-+0.345
PI+0.331
 on+0.314
123+0.307
 distance+0.307
Neg logit contributions
 advoc-1.204
 rul-1.192
 mathemat-1.184
 conclud-1.117
 tyr-1.104
Token
Token ID198
Feature activation-3.26e-01

Pos logit contributions
-+0.029
 distance+0.029
 on+0.027
 your+0.026
 ass+0.026
Neg logit contributions
 advoc-0.102
 rul-0.099
 mathemat-0.097
 Citiz-0.097
 tyr-0.092
Token
Token ID198
Feature activation-2.29e-01

Pos logit contributions
 advoc+0.524
 rul+0.515
 mathemat+0.494
 Citiz+0.494
 embr+0.481
Neg logit contributions
 distance-0.145
--0.137
 hunting-0.131
 tour-0.130
 ass-0.129
TokenNewsletter
Token ID33031
Feature activation+4.35e-01

Pos logit contributions
 advoc+0.346
 rul+0.331
 mathemat+0.329
 Citiz+0.326
 tyr+0.304
Neg logit contributions
 distance-0.149
--0.140
 on-0.139
 your-0.137
 ass-0.136
Token Sign
Token ID5865
Feature activation+3.03e-01

Pos logit contributions
-+0.213
 distance+0.206
 on+0.200
PI+0.193
 by+0.190
Neg logit contributions
 advoc-0.741
 rul-0.716
 mathemat-0.710
 Citiz-0.694
 embr-0.666
Token Up
Token ID3205
Feature activation+5.23e-01

Pos logit contributions
 distance+0.147
-+0.137
 on+0.135
 safely+0.134
 hunting+0.131
Neg logit contributions
 advoc-0.507
 rul-0.492
 mathemat-0.483
 Citiz-0.482
 embr-0.454
Token Continue
Token ID10054
Feature activation+1.58e+00

Pos logit contributions
-+0.311
 distance+0.309
 on+0.291
PI+0.284
 ass+0.282
Neg logit contributions
 advoc-0.820
 rul-0.813
 mathemat-0.795
 Citiz-0.772
 tyr-0.734
Token reading
Token ID3555
Feature activation-3.61e-01

Pos logit contributions
-+0.596
 distance+0.523
 on+0.514
PI+0.504
 by+0.478
Neg logit contributions
 advoc-2.927
 rul-2.873
 mathemat-2.826
 Citiz-2.738
 conclud-2.650
Token the
Token ID262
Feature activation-4.43e-01

Pos logit contributions
 advoc+0.696
 rul+0.690
 mathemat+0.673
 Citiz+0.659
 embr+0.649
Neg logit contributions
--0.061
 distance-0.055
PI-0.051
 on-0.043
erm-0.042
Token main
Token ID1388
Feature activation+7.45e-01

Pos logit contributions
 advoc+0.734
 rul+0.717
 mathemat+0.703
 Citiz+0.677
 embr+0.658
Neg logit contributions
--0.226
 distance-0.217
PI-0.216
 on-0.207
 by-0.200
Token story
Token ID1621
Feature activation+8.74e-01

Pos logit contributions
-+0.461
 distance+0.450
PI+0.447
 on+0.437
 by+0.428
Neg logit contributions
 advoc-1.143
 rul-1.119
 mathemat-1.082
 Citiz-1.042
 embr-1.018
Token Please
Token ID4222
Feature activation+1.34e+00

Pos logit contributions
-+0.550
 distance+0.531
 on+0.511
PI+0.507
 by+0.496
Neg logit contributions
 advoc-1.394
 rul-1.336
 mathemat-1.326
 Citiz-1.289
 embr-1.220
Token
Token ID247
Feature activation+3.92e-01

Pos logit contributions
-+0.373
 distance+0.359
 on+0.355
 by+0.347
 your+0.341
Neg logit contributions
 advoc-0.586
 rul-0.575
 mathemat-0.551
 Citiz-0.534
 conclud-0.510
Tokens
Token ID82
Feature activation+1.26e-01

Pos logit contributions
-+0.142
 distance+0.139
 on+0.127
 by+0.122
 ass+0.121
Neg logit contributions
 advoc-0.725
 rul-0.713
 mathemat-0.695
 Citiz-0.684
 conclud-0.658
Token new
Token ID649
Feature activation+9.73e-02

Pos logit contributions
-+0.054
 distance+0.049
PI+0.049
 on+0.048
 by+0.047
Neg logit contributions
 advoc-0.227
 rul-0.225
 mathemat-0.219
 Citiz-0.214
 tyr-0.204
Token domain
Token ID7386
Feature activation+2.31e-01

Pos logit contributions
-+0.036
 on+0.033
 distance+0.033
PI+0.032
 by+0.032
Neg logit contributions
 advoc-0.179
 rul-0.178
 mathemat-0.173
 Citiz-0.170
 tyr-0.163
Token buying
Token ID7067
Feature activation+5.93e-01

Pos logit contributions
-+0.123
 distance+0.123
PI+0.118
 on+0.118
 Oct+0.115
Neg logit contributions
 advoc-0.383
 rul-0.379
 mathemat-0.372
 Citiz-0.359
 tyr-0.343
Token service
Token ID2139
Feature activation+1.14e+00

Pos logit contributions
-+0.257
 distance+0.236
PI+0.235
 on+0.234
 by+0.227
Neg logit contributions
 advoc-1.048
 rul-1.035
 mathemat-1.016
 Citiz-0.980
 tyr-0.951
Token Google
Token ID3012
Feature activation+9.76e-01

Pos logit contributions
-+0.244
 distance+0.221
PI+0.185
 your+0.172
 tour+0.169
Neg logit contributions
 advoc-2.292
 rul-2.268
 mathemat-2.224
 Citiz-2.192
 conclud-2.095
Token Dom
Token ID9666
Feature activation+2.87e-02

Pos logit contributions
-+0.449
 distance+0.447
 on+0.408
 by+0.400
PI+0.400
Neg logit contributions
 advoc-1.721
 rul-1.690
 mathemat-1.661
 Citiz-1.637
 conclud-1.547
Tokenains
Token ID1299
Feature activation+5.85e-01

Pos logit contributions
-+0.016
 distance+0.016
 on+0.014
 your+0.014
PI+0.014
Neg logit contributions
 advoc-0.047
 rul-0.047
 mathemat-0.046
 Citiz-0.046
 embr-0.043
Token at
Token ID379
Feature activation+3.63e-01

Pos logit contributions
 distance+0.138
-+0.137
PI+0.107
 your+0.106
 tour+0.103
Neg logit contributions
 advoc-1.156
 rul-1.136
 mathemat-1.119
 Citiz-1.109
 conclud-1.050
Token 1
Token ID352
Feature activation+2.27e-01

Pos logit contributions
-+0.086
 on+0.070
PI+0.068
 distance+0.067
 by+0.064
Neg logit contributions
 advoc-0.719
 rul-0.708
 mathemat-0.702
 Citiz-0.688
 conclud-0.661
Token two
Token ID734
Feature activation+8.50e-02

Pos logit contributions
 advoc+0.718
 rul+0.704
 mathemat+0.689
 Citiz+0.676
 conclud+0.662
Neg logit contributions
--0.100
PI-0.091
 distance-0.087
123-0.078
erm-0.076
Token years
Token ID812
Feature activation+2.60e-01

Pos logit contributions
PI+0.023
-+0.023
 on+0.022
 distance+0.022
 your+0.020
Neg logit contributions
 advoc-0.163
 rul-0.159
 mathemat-0.154
 Citiz-0.150
 conclud-0.147
Token away
Token ID1497
Feature activation-3.45e-01

Pos logit contributions
PI+0.104
-+0.100
123+0.096
erm+0.091
 distance+0.089
Neg logit contributions
 advoc-0.466
 rul-0.452
 mathemat-0.443
 Citiz-0.433
 embr-0.422
Token,
Token ID11
Feature activation+5.87e-01

Pos logit contributions
 advoc+0.674
 rul+0.666
 mathemat+0.653
 conclud+0.631
 Citiz+0.630
Neg logit contributions
--0.068
PI-0.065
 distance-0.061
erm-0.052
 ass-0.051
Token it
Token ID340
Feature activation+2.52e-01

Pos logit contributions
-+0.212
PI+0.180
 distance+0.175
 on+0.169
 tour+0.160
Neg logit contributions
 advoc-1.094
 rul-1.076
 mathemat-1.060
 Citiz-1.031
 conclud-1.010
Token seems
Token ID2331
Feature activation+1.35e+00

Pos logit contributions
-+0.106
PI+0.099
 distance+0.098
 on+0.095
123+0.091
Neg logit contributions
 advoc-0.445
 rul-0.441
 mathemat-0.431
 Citiz-0.414
 conclud-0.409
Token unlikely
Token ID7485
Feature activation+1.05e+00

Pos logit contributions
-+0.367
 distance+0.319
 on+0.299
PI+0.289
 by+0.276
Neg logit contributions
 advoc-2.650
 rul-2.589
 mathemat-2.550
 Citiz-2.478
 conclud-2.442
Token that
Token ID326
Feature activation+4.30e-01

Pos logit contributions
-+0.225
 distance+0.191
PI+0.175
 on+0.170
 ass+0.156
Neg logit contributions
 advoc-2.095
 rul-2.047
 mathemat-1.998
 Citiz-1.964
 conclud-1.937
Token "
Token ID366
Feature activation+3.00e-02

Pos logit contributions
-+0.178
 distance+0.159
PI+0.159
 on+0.156
 ass+0.148
Neg logit contributions
 advoc-0.779
 rul-0.759
 mathemat-0.744
 Citiz-0.725
 conclud-0.711
TokenKing
Token ID15708
Feature activation+3.45e-01

Pos logit contributions
-+0.022
 distance+0.022
 on+0.021
 by+0.021
 ass+0.020
Neg logit contributions
 advoc-0.044
 rul-0.042
 mathemat-0.041
 Citiz-0.040
 conclud-0.039
Tokendom
Token ID3438
Feature activation-2.10e-01

Pos logit contributions
-+0.151
 distance+0.139
 on+0.137
 by+0.132
PI+0.129
Neg logit contributions
 advoc-0.607
 rul-0.591
 mathemat-0.582
 Citiz-0.568
 tyr-0.547
Token ta
Token ID20486
Feature activation+6.68e-01

Pos logit contributions
-+0.232
 on+0.216
 distance+0.215
PI+0.212
 by+0.209
Neg logit contributions
 advoc-0.792
 rul-0.780
 mathemat-0.766
 Citiz-0.724
 tyr-0.718
TokenV
Token ID53
Feature activation+8.59e-01

Pos logit contributions
-+0.234
 distance+0.223
 on+0.216
 by+0.204
 ass+0.201
Neg logit contributions
 advoc-1.217
 rul-1.193
 mathemat-1.166
 Citiz-1.150
 conclud-1.102
TokenNS
Token ID8035
Feature activation+6.05e-01

Pos logit contributions
-+0.537
 distance+0.509
 on+0.507
 by+0.494
 your+0.482
Neg logit contributions
 advoc-1.362
 rul-1.329
 mathemat-1.309
 Citiz-1.273
 conclud-1.212
Token following
Token ID1708
Feature activation+4.56e-01

Pos logit contributions
-+0.191
 distance+0.179
PI+0.171
 your+0.162
 on+0.160
Neg logit contributions
 advoc-1.142
 rul-1.128
 mathemat-1.100
 Citiz-1.063
 conclud-1.053
Token sham
Token ID23565
Feature activation+2.98e-01

Pos logit contributions
-+0.195
 distance+0.181
PI+0.180
 on+0.176
erm+0.176
Neg logit contributions
 advoc-0.804
 rul-0.797
 mathemat-0.781
 conclud-0.738
 Citiz-0.734
Token).
Token ID737
Feature activation+1.44e+00

Pos logit contributions
-+0.108
 distance+0.100
 on+0.094
 your+0.094
PI+0.093
Neg logit contributions
 advoc-0.541
 rul-0.541
 mathemat-0.529
 conclud-0.502
 Citiz-0.502
Token
Token ID198
Feature activation+1.44e+00

Pos logit contributions
-+0.736
 distance+0.708
 on+0.682
 safely+0.667
 your+0.667
Neg logit contributions
 advoc-2.427
 rul-2.346
 Citiz-2.305
 mathemat-2.305
 conclud-2.201
Token
Token ID198
Feature activation+9.10e-01

Pos logit contributions
-+0.576
 distance+0.564
 on+0.533
 by+0.506
 tour+0.500
Neg logit contributions
 advoc-2.506
 rul-2.448
 mathemat-2.388
 Citiz-2.351
 embr-2.266
TokenResults
Token ID25468
Feature activation-5.53e-01

Pos logit contributions
 distance+0.694
 on+0.670
-+0.665
 tour+0.661
 by+0.661
Neg logit contributions
 advoc-1.305
 rul-1.238
 mathemat-1.237
 Citiz-1.186
 conclud-1.129
Token After
Token ID2293
Feature activation-4.83e-01

Pos logit contributions
 advoc+0.921
 rul+0.888
 mathemat+0.871
 Citiz+0.853
 tyr+0.823
Neg logit contributions
 distance-0.282
--0.280
PI-0.266
 your-0.260
 on-0.256
Token four
Token ID1440
Feature activation+1.82e-01

Pos logit contributions
 advoc+0.878
 rul+0.858
 mathemat+0.847
 Citiz+0.802
 tyr+0.801
Neg logit contributions
--0.186
PI-0.173
 distance-0.170
 on-0.165
 by-0.163
Tokenica
Token ID3970
Feature activation+6.04e-01

Pos logit contributions
 advoc+0.193
 rul+0.187
 mathemat+0.182
 Citiz+0.178
 embr+0.171
Neg logit contributions
--0.071
 distance-0.070
 on-0.067
 by-0.065
PI-0.065
Token,
Token ID11
Feature activation+1.57e-01

Pos logit contributions
-+0.149
PI+0.148
 distance+0.144
 your+0.122
123+0.119
Neg logit contributions
 advoc-1.164
 rul-1.149
 mathemat-1.128
 Citiz-1.081
 conclud-1.072
Token who
Token ID508
Feature activation+4.13e-01

Pos logit contributions
-+0.063
PI+0.055
 distance+0.054
 on+0.052
 your+0.049
Neg logit contributions
 advoc-0.287
 rul-0.281
 mathemat-0.278
 Citiz-0.267
 tyr-0.261
Token suffered
Token ID6989
Feature activation+4.91e-01

Pos logit contributions
-+0.190
PI+0.180
 distance+0.174
123+0.168
 on+0.168
Neg logit contributions
 advoc-0.724
 rul-0.707
 mathemat-0.701
 Citiz-0.668
 tyr-0.656
Token serious
Token ID2726
Feature activation+1.27e-01

Pos logit contributions
-+0.132
PI+0.123
 distance+0.108
123+0.103
 on+0.099
Neg logit contributions
 advoc-0.950
 rul-0.935
 mathemat-0.932
 tyr-0.882
 Citiz-0.881
Token injuries
Token ID6821
Feature activation+3.67e-02

Pos logit contributions
-+0.067
PI+0.067
123+0.063
 on+0.062
 distance+0.062
Neg logit contributions
 advoc-0.211
 rul-0.207
 mathemat-0.205
 Citiz-0.193
 embr-0.191
Token in
Token ID287
Feature activation-4.01e-01

Pos logit contributions
PI+0.009
-+0.009
 distance+0.008
123+0.008
erm+0.007
Neg logit contributions
 advoc-0.070
 rul-0.070
 mathemat-0.069
 tyr-0.065
 Citiz-0.065
Token a
Token ID257
Feature activation-2.02e-02

Pos logit contributions
 advoc+0.760
 rul+0.747
 mathemat+0.737
 Citiz+0.710
 tyr+0.703
Neg logit contributions
PI-0.119
--0.116
 distance-0.099
123-0.098
 on-0.096
Token rally
Token ID7903
Feature activation+1.13e+00

Pos logit contributions
 advoc+0.035
 rul+0.035
 mathemat+0.034
 Citiz+0.033
 tyr+0.033
Neg logit contributions
--0.009
PI-0.009
 on-0.008
 distance-0.008
123-0.008
Token crash
Token ID7014
Feature activation+4.71e-02

Pos logit contributions
PI+0.343
-+0.322
 distance+0.300
 ass+0.279
123+0.279
Neg logit contributions
 advoc-2.141
 rul-2.098
 mathemat-2.056
 Citiz-1.973
 tyr-1.962
Token last
Token ID938
Feature activation-1.09e-01

Pos logit contributions
PI+0.011
-+0.010
 distance+0.009
123+0.008
erm+0.008
Neg logit contributions
 advoc-0.092
 rul-0.091
 mathemat-0.089
 Citiz-0.085
 tyr-0.085
Token 1960
Token ID9507
Feature activation+2.88e-01

Pos logit contributions
-+0.024
PI+0.023
 distance+0.022
123+0.022
 on+0.021
Neg logit contributions
 advoc-0.089
 rul-0.086
 mathemat-0.085
 Citiz-0.083
 embr-0.080
Tokens
Token ID82
Feature activation+4.64e-01

Pos logit contributions
-+0.101
 distance+0.095
 on+0.093
PI+0.087
 by+0.086
Neg logit contributions
 advoc-0.527
 rul-0.517
 mathemat-0.506
 Citiz-0.495
 conclud-0.476
Token when
Token ID618
Feature activation-1.78e-01

Pos logit contributions
-+0.087
 distance+0.082
PI+0.071
 on+0.069
123+0.065
Neg logit contributions
 advoc-0.939
 rul-0.911
 mathemat-0.906
 Citiz-0.874
 conclud-0.856
Token strong
Token ID1913
Feature activation+2.10e-01

Pos logit contributions
 advoc+0.326
 rul+0.319
 mathemat+0.316
 Citiz+0.300
 tyr+0.295
Neg logit contributions
--0.067
 distance-0.062
PI-0.061
 on-0.059
erm-0.057
Token unions
Token ID11936
Feature activation+1.12e-01

Pos logit contributions
-+0.087
 distance+0.086
 on+0.083
 by+0.080
PI+0.080
Neg logit contributions
 advoc-0.378
 rul-0.366
 mathemat-0.360
 Citiz-0.349
 tyr-0.336
Token,
Token ID11
Feature activation+1.13e-01

Pos logit contributions
-+0.044
PI+0.041
 distance+0.040
 on+0.038
 your+0.037
Neg logit contributions
 advoc-0.206
 rul-0.202
 mathemat-0.198
 Citiz-0.190
 conclud-0.184
Token good
Token ID922
Feature activation+5.77e-01

Pos logit contributions
-+0.052
PI+0.048
 distance+0.047
 on+0.046
123+0.046
Neg logit contributions
 advoc-0.202
 rul-0.196
 mathemat-0.194
 Citiz-0.187
 tyr-0.180
Token jobs
Token ID3946
Feature activation+5.02e-01

Pos logit contributions
-+0.295
 distance+0.293
 on+0.287
PI+0.284
123+0.277
Neg logit contributions
 advoc-0.978
 rul-0.951
 mathemat-0.931
 Citiz-0.901
 tyr-0.864
Token and
Token ID290
Feature activation+4.34e-01

Pos logit contributions
-+0.099
 distance+0.093
PI+0.081
 on+0.077
 Oct+0.072
Neg logit contributions
 advoc-1.013
 rul-0.992
 mathemat-0.983
 Citiz-0.948
 conclud-0.925
Token high
Token ID1029
Feature activation+4.44e-01

Pos logit contributions
-+0.190
PI+0.182
 distance+0.174
 on+0.172
erm+0.171
Neg logit contributions
 advoc-0.778
 rul-0.755
 mathemat-0.749
 Citiz-0.719
 embr-0.694
Token taxes
Token ID5704
Feature activation+2.43e-01

Pos logit contributions
 distance+0.261
-+0.260
 on+0.255
erm+0.253
PI+0.250
Neg logit contributions
 advoc-0.715
 rul-0.692
 mathemat-0.681
 Citiz-0.650
 tyr-0.630
Token best
Token ID1266
Feature activation+5.75e-02

Pos logit contributions
-+0.020
 distance+0.019
 on+0.019
PI+0.018
 by+0.018
Neg logit contributions
 advoc-0.081
 rul-0.079
 mathemat-0.078
 Citiz-0.076
 tyr-0.074
Token publisher
Token ID9991
Feature activation+2.91e-01

Pos logit contributions
-+0.018
 distance+0.017
PI+0.017
 on+0.016
123+0.015
Neg logit contributions
 advoc-0.108
 rul-0.105
 mathemat-0.104
 Citiz-0.101
 conclud-0.098
Token in
Token ID287
Feature activation-8.90e-03

Pos logit contributions
-+0.057
 distance+0.055
PI+0.052
123+0.045
erm+0.042
Neg logit contributions
 advoc-0.576
 rul-0.566
 mathemat-0.560
 Citiz-0.542
 conclud-0.528
Token the
Token ID262
Feature activation+9.43e-02

Pos logit contributions
 advoc+0.017
 rul+0.017
 mathemat+0.017
 Citiz+0.016
 tyr+0.016
Neg logit contributions
--0.002
 distance-0.002
PI-0.002
 on-0.002
123-0.002
Token industry
Token ID2831
Feature activation+4.42e-02

Pos logit contributions
-+0.046
 on+0.043
 distance+0.042
PI+0.042
123+0.041
Neg logit contributions
 advoc-0.162
 rul-0.158
 mathemat-0.157
 Citiz-0.151
 tyr-0.145
Token.
Token ID13
Feature activation-9.35e-02

Pos logit contributions
-+0.009
 distance+0.009
PI+0.008
 on+0.007
123+0.007
Neg logit contributions
 advoc-0.087
 rul-0.086
 mathemat-0.084
 Citiz-0.081
 conclud-0.081
Token It
Token ID632
Feature activation-3.13e-01

Pos logit contributions
 advoc+0.157
 rul+0.155
 Citiz+0.152
 mathemat+0.151
 conclud+0.143
Neg logit contributions
--0.043
 distance-0.043
 safely-0.040
 on-0.039
 ass-0.038
Token
Token ID447
Feature activation+1.87e-01

Pos logit contributions
 advoc+0.566
 rul+0.562
 mathemat+0.554
 Citiz+0.534
 conclud+0.525
Neg logit contributions
--0.112
 distance-0.108
PI-0.106
 on-0.100
 by-0.095
Token
Token ID247
Feature activation-1.44e-01

Pos logit contributions
-+0.170
 distance+0.164
 on+0.163
 by+0.159
 tour+0.156
Neg logit contributions
 advoc-0.246
 rul-0.241
 mathemat-0.229
 Citiz-0.222
 conclud-0.213
Tokens
Token ID82
Feature activation-3.94e-01

Pos logit contributions
 advoc+0.254
 rul+0.252
 mathemat+0.244
 Citiz+0.238
 conclud+0.233
Neg logit contributions
--0.056
 distance-0.056
 on-0.053
 by-0.051
 your-0.050
Token a
Token ID257
Feature activation+2.70e-01

Pos logit contributions
 advoc+0.719
 rul+0.703
 mathemat+0.703
 Citiz+0.671
 tyr+0.658
Neg logit contributions
--0.147
 distance-0.138
PI-0.131
 on-0.124
erm-0.123
Tokenlight
Token ID2971
Feature activation+1.12e-01

Pos logit contributions
-+0.139
 distance+0.134
PI+0.133
 your+0.132
 on+0.131
Neg logit contributions
 advoc-0.431
 rul-0.420
 mathemat-0.412
 Citiz-0.400
 tyr-0.386
Token,
Token ID11
Feature activation-2.27e-03

Pos logit contributions
PI+0.022
-+0.021
 distance+0.018
erm+0.017
123+0.016
Neg logit contributions
 advoc-0.222
 rul-0.216
 mathemat-0.213
 Citiz-0.205
 embr-0.203
Token and
Token ID290
Feature activation-1.35e-01

Pos logit contributions
 advoc+0.004
 rul+0.004
 mathemat+0.004
 Citiz+0.004
 tyr+0.004
Neg logit contributions
--0.001
PI-0.001
 distance-0.001
 on-0.001
123-0.001
Token then
Token ID788
Feature activation+1.57e-02

Pos logit contributions
 advoc+0.247
 rul+0.240
 mathemat+0.239
 Citiz+0.232
 tyr+0.226
Neg logit contributions
--0.050
PI-0.046
 distance-0.045
123-0.043
 on-0.042
Token searched
Token ID16499
Feature activation+2.74e-01

Pos logit contributions
-+0.008
PI+0.007
 distance+0.007
123+0.007
 on+0.007
Neg logit contributions
 advoc-0.027
 rul-0.026
 mathemat-0.026
 Citiz-0.025
 tyr-0.024
Token his
Token ID465
Feature activation+1.94e-01

Pos logit contributions
-+0.083
PI+0.077
 distance+0.075
erm+0.072
 on+0.068
Neg logit contributions
 advoc-0.512
 rul-0.500
 mathemat-0.497
 Citiz-0.481
 tyr-0.472
Token vehicle
Token ID4038
Feature activation+2.48e-01

Pos logit contributions
-+0.084
PI+0.080
 on+0.075
erm+0.074
 distance+0.073
Neg logit contributions
 advoc-0.343
 rul-0.332
 mathemat-0.331
 Citiz-0.321
 tyr-0.311
Token for
Token ID329
Feature activation+4.72e-01

Pos logit contributions
-+0.050
PI+0.049
 distance+0.045
 your+0.038
erm+0.036
Neg logit contributions
 advoc-0.486
 rul-0.476
 mathemat-0.468
 Citiz-0.455
 conclud-0.449
Token weapons
Token ID3777
Feature activation+4.81e-01

Pos logit contributions
-+0.199
PI+0.183
 on+0.174
 distance+0.170
123+0.169
Neg logit contributions
 advoc-0.848
 rul-0.819
 mathemat-0.814
 Citiz-0.783
 embr-0.762
Token.
Token ID13
Feature activation+4.82e-01

Pos logit contributions
-+0.086
 distance+0.083
PI+0.080
 your+0.068
erm+0.065
Neg logit contributions
 advoc-0.960
 rul-0.929
 mathemat-0.927
 Citiz-0.884
 conclud-0.883
Token Once
Token ID4874
Feature activation-2.81e-01

Pos logit contributions
-+0.222
 distance+0.214
 on+0.200
 ass+0.197
 your+0.197
Neg logit contributions
 advoc-0.826
 rul-0.804
 mathemat-0.795
 Citiz-0.793
 conclud-0.752
Token for
Token ID329
Feature activation+1.45e-01

Pos logit contributions
 distance+0.003
-+0.003
PI+0.003
erm+0.002
123+0.002
Neg logit contributions
 advoc-0.023
 rul-0.023
 mathemat-0.023
 Citiz-0.022
 conclud-0.022
Token the
Token ID262
Feature activation+1.55e-01

Pos logit contributions
-+0.060
PI+0.054
 distance+0.054
 on+0.051
123+0.050
Neg logit contributions
 advoc-0.262
 rul-0.255
 mathemat-0.251
 Citiz-0.242
 tyr-0.239
Token incident
Token ID4519
Feature activation+4.62e-02

Pos logit contributions
-+0.079
PI+0.073
 distance+0.071
 on+0.071
 by+0.069
Neg logit contributions
 advoc-0.267
 rul-0.261
 mathemat-0.258
 Citiz-0.248
 tyr-0.240
Token,
Token ID11
Feature activation-1.16e-01

Pos logit contributions
-+0.009
 distance+0.009
PI+0.008
123+0.007
erm+0.007
Neg logit contributions
 advoc-0.092
 rul-0.090
 mathemat-0.089
 Citiz-0.086
 conclud-0.085
Token but
Token ID475
Feature activation+2.05e-01

Pos logit contributions
 advoc+0.216
 rul+0.212
 mathemat+0.212
 Citiz+0.202
 conclud+0.198
Neg logit contributions
--0.040
PI-0.035
 distance-0.034
erm-0.032
 on-0.032
Token has
Token ID468
Feature activation+1.93e-01

Pos logit contributions
-+0.097
PI+0.092
 distance+0.088
 on+0.084
123+0.084
Neg logit contributions
 advoc-0.361
 rul-0.350
 mathemat-0.348
 Citiz-0.332
 conclud-0.328
Token apparently
Token ID5729
Feature activation-1.66e-02

Pos logit contributions
PI+0.082
-+0.082
 distance+0.075
123+0.075
erm+0.072
Neg logit contributions
 advoc-0.343
 rul-0.333
 mathemat-0.328
 Citiz-0.310
 embr-0.310
Token not
Token ID407
Feature activation-2.56e-01

Pos logit contributions
 advoc+0.028
 rul+0.027
 mathemat+0.026
 embr+0.025
 Citiz+0.025
Neg logit contributions
--0.009
PI-0.009
123-0.008
 distance-0.008
 on-0.008
Token apologized
Token ID22893
Feature activation+4.33e-02

Pos logit contributions
 advoc+0.422
 rul+0.416
 mathemat+0.406
 embr+0.380
 conclud+0.380
Neg logit contributions
PI-0.138
--0.135
123-0.129
 distance-0.128
erm-0.125
Token to
Token ID284
Feature activation-1.04e-01

Pos logit contributions
PI+0.010
-+0.009
 distance+0.009
123+0.008
 auto+0.008
Neg logit contributions
 advoc-0.084
 rul-0.083
 mathemat-0.082
 embr-0.077
 Citiz-0.077
Token Crow
Token ID15502
Feature activation-1.57e-02

Pos logit contributions
 advoc+0.157
 rul+0.153
 mathemat+0.153
 Citiz+0.146
 conclud+0.141
Neg logit contributions
--0.073
PI-0.069
 on-0.069
 distance-0.068
 by-0.067
Token the
Token ID262
Feature activation+1.76e-01

Pos logit contributions
 advoc+0.560
 rul+0.553
 mathemat+0.539
 Citiz+0.523
 tyr+0.505
Neg logit contributions
--0.130
PI-0.123
 on-0.113
 distance-0.111
123-0.111
Token fierce
Token ID14800
Feature activation+1.65e-01

Pos logit contributions
-+0.089
PI+0.085
 on+0.082
 distance+0.080
 by+0.079
Neg logit contributions
 advoc-0.298
 rul-0.295
 mathemat-0.287
 Citiz-0.279
 tyr-0.268
Token crackdown
Token ID22671
Feature activation+1.11e-01

Pos logit contributions
-+0.079
PI+0.078
 on+0.075
123+0.074
 distance+0.074
Neg logit contributions
 advoc-0.282
 rul-0.278
 mathemat-0.270
 Citiz-0.263
 embr-0.253
Token that
Token ID326
Feature activation-3.76e-02

Pos logit contributions
-+0.021
 distance+0.020
PI+0.019
123+0.016
 your+0.015
Neg logit contributions
 advoc-0.219
 rul-0.218
 mathemat-0.213
 Citiz-0.206
 conclud-0.202
Token took
Token ID1718
Feature activation-2.86e-01

Pos logit contributions
 advoc+0.063
 rul+0.062
 mathemat+0.061
 Citiz+0.059
 tyr+0.056
Neg logit contributions
--0.020
PI-0.019
 distance-0.018
erm-0.018
123-0.018
Token place
Token ID1295
Feature activation-4.21e-01

Pos logit contributions
 advoc+0.468
 rul+0.460
 mathemat+0.444
 Citiz+0.434
 tyr+0.418
Neg logit contributions
--0.161
PI-0.156
 distance-0.147
 tour-0.142
erm-0.142
Token in
Token ID287
Feature activation-2.79e-01

Pos logit contributions
 rul+0.802
 advoc+0.801
 mathemat+0.777
 Citiz+0.751
 conclud+0.742
Neg logit contributions
--0.105
PI-0.103
 distance-0.095
123-0.088
erm-0.087
Token June
Token ID2795
Feature activation+2.09e-01

Pos logit contributions
 advoc+0.499
 rul+0.495
 mathemat+0.487
 Citiz+0.479
 tyr+0.459
Neg logit contributions
--0.102
PI-0.099
 distance-0.090
123-0.089
 on-0.086
Token and
Token ID290
Feature activation-6.79e-02

Pos logit contributions
 distance+0.040
-+0.036
PI+0.035
 your+0.030
 tour+0.028
Neg logit contributions
 advoc-0.413
 rul-0.410
 mathemat-0.401
 Citiz-0.396
 conclud-0.385
Token July
Token ID2901
Feature activation+2.59e-01

Pos logit contributions
 advoc+0.118
 mathemat+0.115
 rul+0.115
 Citiz+0.112
 conclud+0.107
Neg logit contributions
--0.030
PI-0.029
 distance-0.028
123-0.027
 auto-0.026
Token.
Token ID13
Feature activation+5.14e-01

Pos logit contributions
 distance+0.046
-+0.044
PI+0.042
 your+0.034
123+0.034
Neg logit contributions
 advoc-0.518
 rul-0.513
 mathemat-0.502
 Citiz-0.489
 conclud-0.482
Token commitment
Token ID7901
Feature activation-6.18e-01

Pos logit contributions
-+0.322
PI+0.303
 on+0.296
 by+0.293
 distance+0.292
Neg logit contributions
 advoc-0.841
 mathemat-0.801
 rul-0.801
 Citiz-0.763
 conclud-0.744
Token to
Token ID284
Feature activation+2.35e-01

Pos logit contributions
 advoc+1.263
 rul+1.238
 mathemat+1.232
 Citiz+1.188
 conclud+1.171
Neg logit contributions
--0.114
 distance-0.084
 on-0.083
PI-0.075
 your-0.075
Token the
Token ID262
Feature activation-3.60e-02

Pos logit contributions
-+0.085
PI+0.080
 on+0.075
123+0.073
 by+0.072
Neg logit contributions
 advoc-0.441
 mathemat-0.426
 rul-0.425
 Citiz-0.407
 conclud-0.401
Token univers
Token ID5820
Feature activation+7.35e-01

Pos logit contributions
 advoc+0.059
 mathemat+0.058
 rul+0.057
 Citiz+0.054
 conclud+0.053
Neg logit contributions
--0.022
PI-0.020
 on-0.020
 by-0.020
 distance-0.019
Tokenality
Token ID1483
Feature activation-3.85e-01

Pos logit contributions
-+0.581
PI+0.565
 distance+0.565
 on+0.560
 by+0.547
Neg logit contributions
 advoc-1.035
 rul-1.009
 mathemat-1.008
 Citiz-0.963
 conclud-0.925
Token of
Token ID286
Feature activation-3.45e-01

Pos logit contributions
 advoc+0.805
 rul+0.785
 mathemat+0.784
 Citiz+0.754
 conclud+0.743
Neg logit contributions
--0.057
 on-0.043
 distance-0.041
 by-0.035
 your-0.034
Token reason
Token ID1738
Feature activation-8.13e-02

Pos logit contributions
 advoc+0.617
 rul+0.593
 mathemat+0.591
 Citiz+0.565
 conclud+0.555
Neg logit contributions
--0.148
PI-0.147
123-0.136
 on-0.135
 distance-0.132
Token as
Token ID355
Feature activation+6.80e-02

Pos logit contributions
 advoc+0.163
 rul+0.161
 mathemat+0.159
 Citiz+0.153
 conclud+0.153
Neg logit contributions
--0.014
 distance-0.013
PI-0.012
 your-0.011
 tour-0.011
Token responsiveness
Token ID49815
Feature activation+8.82e-01

Pos logit contributions
-+0.021
PI+0.019
 on+0.018
 distance+0.018
123+0.017
Neg logit contributions
 advoc-0.131
 mathemat-0.127
 rul-0.126
 Citiz-0.121
 conclud-0.119
Token to
Token ID284
Feature activation+1.52e-01

Pos logit contributions
-+0.171
PI+0.149
 distance+0.148
 your+0.139
 on+0.134
Neg logit contributions
 advoc-1.782
 rul-1.737
 mathemat-1.735
 Citiz-1.645
 conclud-1.633
Token reality
Token ID3950
Feature activation+7.92e-02

Pos logit contributions
-+0.063
PI+0.059
123+0.057
 on+0.057
 distance+0.056
Neg logit contributions
 advoc-0.274
 rul-0.267
 mathemat-0.265
 Citiz-0.254
 conclud-0.249
Token moved
Token ID3888
Feature activation+1.65e-01

Pos logit contributions
 advoc+0.328
 rul+0.320
 mathemat+0.318
 Citiz+0.304
 tyr+0.299
Neg logit contributions
--0.078
 distance-0.071
PI-0.071
 on-0.066
123-0.065
Token there
Token ID612
Feature activation-2.02e-01

Pos logit contributions
-+0.040
PI+0.037
 distance+0.035
123+0.031
 on+0.029
Neg logit contributions
 advoc-0.318
 rul-0.314
 mathemat-0.310
 Citiz-0.298
 tyr-0.294
Token and
Token ID290
Feature activation+1.56e-01

Pos logit contributions
 advoc+0.390
 rul+0.387
 mathemat+0.381
 Citiz+0.368
 conclud+0.361
Neg logit contributions
--0.051
PI-0.046
 distance-0.046
 your-0.038
123-0.038
Token then
Token ID788
Feature activation+1.27e-01

Pos logit contributions
-+0.057
PI+0.051
 distance+0.049
 on+0.048
123+0.048
Neg logit contributions
 advoc-0.289
 rul-0.282
 mathemat-0.280
 Citiz-0.270
 tyr-0.265
Token upl
Token ID28996
Feature activation-6.66e-02

Pos logit contributions
-+0.034
PI+0.030
 distance+0.030
123+0.027
 on+0.027
Neg logit contributions
 advoc-0.244
 rul-0.241
 mathemat-0.239
 Citiz-0.228
 conclud-0.224
Tokenifted
Token ID21715
Feature activation-2.82e-01

Pos logit contributions
 advoc+0.097
 mathemat+0.094
 rul+0.094
 Citiz+0.090
 conclud+0.087
Neg logit contributions
--0.048
 on-0.047
 distance-0.046
PI-0.046
 by-0.045
Token and
Token ID290
Feature activation+4.23e-02

Pos logit contributions
 advoc+0.555
 rul+0.549
 mathemat+0.543
 Citiz+0.520
 conclud+0.513
Neg logit contributions
--0.059
PI-0.052
 distance-0.050
 on-0.042
123-0.041
Token put
Token ID1234
Feature activation+4.13e-01

Pos logit contributions
-+0.020
PI+0.019
 distance+0.018
 on+0.018
123+0.018
Neg logit contributions
 advoc-0.074
 rul-0.072
 mathemat-0.072
 Citiz-0.068
 tyr-0.067
Token on
Token ID319
Feature activation+6.59e-01

Pos logit contributions
-+0.090
 distance+0.073
PI+0.072
123+0.067
 tour+0.065
Neg logit contributions
 advoc-0.821
 rul-0.808
 mathemat-0.799
 Citiz-0.778
 tyr-0.756
Token a
Token ID257
Feature activation+1.21e-01

Pos logit contributions
-+0.187
 distance+0.161
 on+0.160
PI+0.158
123+0.147
Neg logit contributions
 advoc-1.283
 rul-1.251
 mathemat-1.240
 Citiz-1.201
 conclud-1.169
Token basement
Token ID17012
Feature activation+1.31e-01

Pos logit contributions
-+0.063
 on+0.059
PI+0.056
 distance+0.055
 by+0.055
Neg logit contributions
 advoc-0.206
 rul-0.201
 mathemat-0.199
 Citiz-0.192
 tyr-0.186
Token 2017
Token ID2177
Feature activation+1.11e-02

Pos logit contributions
 advoc+0.312
 rul+0.306
 mathemat+0.300
 Citiz+0.297
 conclud+0.284
Neg logit contributions
--0.102
 distance-0.096
 on-0.093
PI-0.090
erm-0.090
Token.
Token ID13
Feature activation-9.05e-02

Pos logit contributions
 distance+0.002
-+0.002
 on+0.002
 your+0.001
 by+0.001
Neg logit contributions
 advoc-0.022
 rul-0.022
 mathemat-0.022
 Citiz-0.021
 conclud-0.021
Token
Token ID198
Feature activation+6.57e-01

Pos logit contributions
 advoc+0.161
 rul+0.157
 mathemat+0.156
 Citiz+0.153
 conclud+0.147
Neg logit contributions
--0.040
 distance-0.037
 on-0.035
 your-0.034
 by-0.034
Token
Token ID198
Feature activation+2.10e-01

Pos logit contributions
 distance+0.279
-+0.273
 on+0.256
 hunting+0.251
 tour+0.250
Neg logit contributions
 advoc-1.091
 rul-1.070
 mathemat-1.034
 Citiz-1.026
 embr-0.997
Token[
Token ID58
Feature activation-4.28e-02

Pos logit contributions
 distance+0.078
 on+0.069
 tour+0.067
 by+0.067
-+0.067
Neg logit contributions
 advoc-0.370
 rul-0.358
 Citiz-0.354
 mathemat-0.352
 embr-0.334
Token22
Token ID1828
Feature activation-7.03e-01

Pos logit contributions
 advoc+0.067
 rul+0.064
 mathemat+0.063
 Citiz+0.062
 conclud+0.059
Neg logit contributions
 distance-0.024
--0.023
 on-0.022
 tour-0.022
 your-0.022
Token]
Token ID60
Feature activation+8.51e-02

Pos logit contributions
 advoc+1.112
 rul+1.100
 Citiz+1.055
 mathemat+1.053
 embr+1.016
Neg logit contributions
 distance-0.344
--0.325
 safely-0.318
 on-0.316
 hunting-0.315
Token Literally
Token ID48414
Feature activation+5.59e-01

Pos logit contributions
-+0.050
 distance+0.048
 on+0.047
 your+0.045
 safely+0.045
Neg logit contributions
 advoc-0.138
 rul-0.134
 mathemat-0.132
 Citiz-0.131
 conclud-0.124
Token,
Token ID11
Feature activation-8.62e-03

Pos logit contributions
-+0.237
 distance+0.224
 on+0.217
PI+0.211
 tour+0.211
Neg logit contributions
 advoc-1.006
 mathemat-0.983
 rul-0.979
 Citiz-0.927
 conclud-0.919
Token R
Token ID371
Feature activation+2.89e-01

Pos logit contributions
 advoc+0.015
 rul+0.014
 mathemat+0.014
 Citiz+0.014
 conclud+0.013
Neg logit contributions
--0.005
 distance-0.004
 on-0.004
PI-0.004
 tour-0.004
Tokenashi
Token ID12144
Feature activation+5.41e-01

Pos logit contributions
-+0.129
 distance+0.124
 on+0.121
 by+0.115
 your+0.114
Neg logit contributions
 advoc-0.512
 rul-0.497
 mathemat-0.496
 Citiz-0.473
 conclud-0.460
Token countries
Token ID2678
Feature activation+2.45e-02

Pos logit contributions
-+0.055
 distance+0.052
 on+0.052
PI+0.051
123+0.050
Neg logit contributions
 advoc-0.158
 rul-0.151
 mathemat-0.150
 Citiz-0.142
 tyr-0.138
Token to
Token ID284
Feature activation+1.46e-01

Pos logit contributions
-+0.005
 distance+0.004
PI+0.004
 on+0.004
 your+0.004
Neg logit contributions
 advoc-0.049
 rul-0.048
 mathemat-0.048
 Citiz-0.046
 conclud-0.045
Token those
Token ID883
Feature activation+7.16e-03

Pos logit contributions
-+0.097
 distance+0.091
PI+0.091
 on+0.090
 by+0.087
Neg logit contributions
 advoc-0.229
 rul-0.224
 mathemat-0.223
 Citiz-0.212
 tyr-0.205
Token same
Token ID976
Feature activation-5.48e-01

Pos logit contributions
-+0.004
 distance+0.004
PI+0.003
 on+0.003
 your+0.003
Neg logit contributions
 advoc-0.012
 rul-0.012
 mathemat-0.012
 Citiz-0.011
 tyr-0.011
Token market
Token ID1910
Feature activation+3.49e-01

Pos logit contributions
 advoc+0.909
 rul+0.884
 mathemat+0.883
 Citiz+0.826
 tyr+0.812
Neg logit contributions
--0.303
 distance-0.290
 on-0.284
PI-0.283
 your-0.276
Token forces
Token ID3386
Feature activation-2.78e-01

Pos logit contributions
-+0.168
 distance+0.165
 on+0.163
PI+0.158
 your+0.156
Neg logit contributions
 advoc-0.608
 rul-0.590
 mathemat-0.584
 Citiz-0.554
 conclud-0.544
Token.
Token ID13
Feature activation-2.80e-01

Pos logit contributions
 advoc+0.557
 rul+0.549
 mathemat+0.546
 Citiz+0.526
 conclud+0.517
Neg logit contributions
 distance-0.049
--0.049
PI-0.037
 your-0.033
 tour-0.032
Token Ne
Token ID3169
Feature activation+5.72e-01

Pos logit contributions
 advoc+0.464
 rul+0.452
 Citiz+0.452
 mathemat+0.444
 conclud+0.421
Neg logit contributions
--0.140
 distance-0.137
 safely-0.130
 on-0.128
 ass-0.126
Tokenoliberal
Token ID28525
Feature activation+1.03e-01

Pos logit contributions
-+0.550
 distance+0.546
 on+0.538
 by+0.531
 your+0.525
Neg logit contributions
 advoc-0.713
 rul-0.685
 mathemat-0.682
 Citiz-0.653
 tyr-0.613
Token analysts
Token ID13440
Feature activation-1.03e-02

Pos logit contributions
-+0.068
 on+0.066
 distance+0.065
 by+0.064
PI+0.064
Neg logit contributions
 advoc-0.162
 rul-0.156
 mathemat-0.155
 Citiz-0.148
 conclud-0.143
Token similarly
Token ID12470
Feature activation+2.31e-01

Pos logit contributions
 advoc+0.019
 rul+0.018
 mathemat+0.018
 Citiz+0.017
 conclud+0.017
Neg logit contributions
--0.004
PI-0.004
 distance-0.004
 on-0.004
 your-0.004
Token others
Token ID1854
Feature activation-2.10e-01

Pos logit contributions
-+0.065
 distance+0.058
PI+0.058
123+0.056
 on+0.055
Neg logit contributions
 advoc-0.273
 rul-0.263
 mathemat-0.262
 Citiz-0.254
 tyr-0.246
Token,
Token ID11
Feature activation+4.58e-01

Pos logit contributions
 advoc+0.379
 rul+0.370
 mathemat+0.366
 Citiz+0.353
 conclud+0.345
Neg logit contributions
--0.085
PI-0.078
 distance-0.077
123-0.073
 your-0.072
Token they
Token ID484
Feature activation+5.26e-01

Pos logit contributions
-+0.165
 distance+0.142
PI+0.135
 on+0.131
123+0.126
Neg logit contributions
 advoc-0.864
 rul-0.837
 mathemat-0.833
 Citiz-0.809
 tyr-0.788
Token said
Token ID531
Feature activation+7.26e-02

Pos logit contributions
-+0.284
PI+0.266
 on+0.257
 distance+0.254
123+0.252
Neg logit contributions
 advoc-0.884
 rul-0.865
 mathemat-0.856
 Citiz-0.814
 conclud-0.803
Token that
Token ID326
Feature activation+2.22e-01

Pos logit contributions
-+0.023
 distance+0.020
PI+0.020
 on+0.019
 by+0.018
Neg logit contributions
 advoc-0.139
 rul-0.135
 mathemat-0.133
 Citiz-0.130
 tyr-0.126
Token wasn
Token ID2492
Feature activation-1.67e+00

Pos logit contributions
-+0.082
 distance+0.074
PI+0.071
 on+0.070
123+0.067
Neg logit contributions
 advoc-0.413
 rul-0.402
 mathemat-0.397
 Citiz-0.385
 tyr-0.373
Token't
Token ID470
Feature activation-3.37e-01

Pos logit contributions
 advoc+2.184
 rul+2.128
 mathemat+2.044
 Citiz+2.016
 embr+1.952
Neg logit contributions
 distance-1.262
--1.223
 on-1.194
 hunting-1.193
 tour-1.187
Token true
Token ID2081
Feature activation-2.00e-01

Pos logit contributions
 advoc+0.629
 rul+0.605
 mathemat+0.599
 Citiz+0.578
 tyr+0.564
Neg logit contributions
--0.127
 distance-0.117
PI-0.114
123-0.108
 on-0.106
Token.
Token ID13
Feature activation-3.49e-01

Pos logit contributions
 advoc+0.398
 rul+0.387
 mathemat+0.386
 Citiz+0.369
 conclud+0.365
Neg logit contributions
--0.040
 distance-0.037
PI-0.033
 on-0.030
 your-0.028
Token They
Token ID1119
Feature activation+1.87e-02

Pos logit contributions
 advoc+0.592
 rul+0.578
 mathemat+0.567
 Citiz+0.566
 conclud+0.536
Neg logit contributions
--0.167
 distance-0.159
 on-0.152
 safely-0.148
 by-0.145
Token gave
Token ID2921
Feature activation-4.66e-02

Pos logit contributions
-+0.010
PI+0.009
 on+0.009
 distance+0.009
 by+0.009
Neg logit contributions
 advoc-0.032
 rul-0.031
 mathemat-0.031
 Citiz-0.029
 conclud-0.029
Token
Token ID198
Feature activation-2.31e-02

Pos logit contributions
 distance+0.023
-+0.021
 hunting+0.020
 on+0.020
 safely+0.020
Neg logit contributions
 advoc-0.081
 rul-0.080
 mathemat-0.076
 Citiz-0.076
 embr-0.074
Tokenhttps
Token ID5450
Feature activation-3.79e-01

Pos logit contributions
 advoc+0.035
 rul+0.034
 mathemat+0.033
 Citiz+0.032
 tyr+0.030
Neg logit contributions
 distance-0.016
 on-0.015
--0.015
 by-0.015
 your-0.015
Token://
Token ID1378
Feature activation-3.42e-01

Pos logit contributions
 advoc+0.590
 rul+0.577
 mathemat+0.554
 Citiz+0.551
 embr+0.534
Neg logit contributions
 distance-0.197
--0.186
 hunting-0.182
 on-0.181
 tour-0.181
Tokenwww
Token ID2503
Feature activation-7.15e-01

Pos logit contributions
 advoc+0.547
 rul+0.528
 mathemat+0.517
 Citiz+0.501
 tyr+0.482
Neg logit contributions
 distance-0.209
--0.209
 on-0.202
 by-0.198
 your-0.195
Token.
Token ID13
Feature activation-3.03e-01

Pos logit contributions
 advoc+1.337
 rul+1.321
 mathemat+1.279
 Citiz+1.268
 embr+1.241
Neg logit contributions
 distance-0.134
--0.107
 hunting-0.106
 tour-0.098
 on-0.096
Tokeneff
Token ID14822
Feature activation-1.76e+00

Pos logit contributions
 advoc+0.471
 rul+0.452
 mathemat+0.442
 Citiz+0.431
 tyr+0.412
Neg logit contributions
--0.201
 distance-0.198
 on-0.196
 by-0.189
 solely-0.186
Token.
Token ID13
Feature activation-5.54e-03

Pos logit contributions
 advoc+3.483
 rul+3.422
 mathemat+3.337
 Citiz+3.288
 embr+3.189
Neg logit contributions
 distance-0.254
--0.245
 on-0.204
 your-0.186
 by-0.184
Tokenorg
Token ID2398
Feature activation-3.82e-01

Pos logit contributions
 advoc+0.008
 rul+0.008
 mathemat+0.008
 Citiz+0.007
 embr+0.007
Neg logit contributions
--0.004
 distance-0.004
 on-0.004
 by-0.004
 auto-0.004
Token/
Token ID14
Feature activation-5.60e-01

Pos logit contributions
 advoc+0.673
 rul+0.658
 mathemat+0.639
 Citiz+0.633
 embr+0.612
Neg logit contributions
 distance-0.141
--0.137
 on-0.126
 ass-0.122
 safely-0.122
Tokend
Token ID67
Feature activation-4.23e-01

Pos logit contributions
 advoc+0.892
 rul+0.858
 mathemat+0.830
 Citiz+0.815
 embr+0.777
Neg logit contributions
--0.348
 distance-0.344
 on-0.329
 by-0.324
 your-0.322
Tokenee
Token ID1453
Feature activation-5.73e-01

Pos logit contributions
 advoc+0.696
 rul+0.676
 mathemat+0.661
 Citiz+0.638
 embr+0.617
Neg logit contributions
 distance-0.237
--0.226
 on-0.222
 by-0.217
 your-0.216
Tokenr
Token ID81
Feature activation+5.70e-01

Pos logit contributions
 advoc+1.268
 rul+1.230
 mathemat+1.211
 Citiz+1.191
 tyr+1.125
Neg logit contributions
 distance-0.332
 on-0.295
--0.285
 by-0.285
 your-0.280
Tokenump
Token ID931
Feature activation-5.80e-01

Pos logit contributions
 distance+0.275
 on+0.268
-+0.267
 by+0.261
 your+0.254
Neg logit contributions
 advoc-0.982
 rul-0.956
 mathemat-0.953
 Citiz-0.918
 conclud-0.872
Token220
Token ID17572
Feature activation+4.70e-01

Pos logit contributions
 advoc+1.043
 rul+1.020
 mathemat+1.011
 Citiz+0.981
 tyr+0.943
Neg logit contributions
 distance-0.220
 on-0.203
--0.201
 by-0.198
 ass-0.196
Token22
Token ID1828
Feature activation-3.93e-02

Pos logit contributions
 distance+0.177
 on+0.164
-+0.160
 by+0.158
 ass+0.156
Neg logit contributions
 advoc-0.855
 rul-0.839
 mathemat-0.828
 Citiz-0.803
 tyr-0.772
Token.
Token ID13
Feature activation-4.85e-01

Pos logit contributions
 advoc+0.073
 rul+0.071
 mathemat+0.070
 Citiz+0.068
 tyr+0.066
Neg logit contributions
 distance-0.014
 on-0.013
 by-0.012
--0.012
 ass-0.012
Tokenjpg
Token ID9479
Feature activation-1.66e+00

Pos logit contributions
 advoc+0.647
 rul+0.615
 mathemat+0.602
 Citiz+0.574
 tyr+0.546
Neg logit contributions
 distance-0.422
 on-0.399
 ass-0.395
--0.390
 by-0.390
Token
Token ID198
Feature activation-1.73e+00

Pos logit contributions
 advoc+2.670
 rul+2.639
 Citiz+2.566
 mathemat+2.533
 embr+2.446
Neg logit contributions
 distance-0.757
 safely-0.692
--0.682
 tour-0.667
 on-0.665
Token
Token ID198
Feature activation-1.21e+00

Pos logit contributions
 advoc+2.585
 rul+2.548
 Citiz+2.431
 mathemat+2.398
 embr+2.372
Neg logit contributions
 distance-0.839
 hunting-0.791
 tour-0.765
 safely-0.759
 stalk-0.757
TokenEmail
Token ID15333
Feature activation+1.30e-01

Pos logit contributions
 advoc+1.518
 rul+1.469
 mathemat+1.433
 Citiz+1.397
 embr+1.295
Neg logit contributions
 distance-1.090
 on-1.072
--1.071
 by-1.056
 tour-1.055
Token
Token ID198
Feature activation-1.46e+00

Pos logit contributions
 distance+0.055
-+0.054
 on+0.051
 tour+0.049
 hunting+0.048
Neg logit contributions
 advoc-0.219
 rul-0.215
 mathemat-0.209
 Citiz-0.206
 embr-0.198
Token
Token ID198
Feature activation-1.20e+00

Pos logit contributions
 advoc+2.330
 rul+2.267
 Citiz+2.197
 mathemat+2.176
 embr+2.114
Neg logit contributions
 distance-0.671
 hunting-0.629
 tour-0.614
 stalk-0.612
--0.609
Token increasing
Token ID3649
Feature activation-2.99e-01

Pos logit contributions
 advoc+0.094
 rul+0.091
 mathemat+0.090
 Citiz+0.086
 conclud+0.084
Neg logit contributions
--0.022
PI-0.021
 distance-0.020
 on-0.019
 your-0.018
Token claims
Token ID3667
Feature activation+5.06e-02

Pos logit contributions
 advoc+0.510
 rul+0.487
 mathemat+0.484
 Citiz+0.456
 embr+0.448
Neg logit contributions
--0.156
PI-0.153
 on-0.147
 distance-0.143
 by-0.143
Token of
Token ID286
Feature activation-4.06e-01

Pos logit contributions
-+0.010
PI+0.009
 distance+0.009
erm+0.007
 tour+0.007
Neg logit contributions
 advoc-0.101
 rul-0.098
 mathemat-0.097
 Citiz-0.094
 tyr-0.092
Token fraud
Token ID7394
Feature activation-3.53e-02

Pos logit contributions
 advoc+0.682
 mathemat+0.658
 rul+0.657
 Citiz+0.622
 embr+0.611
Neg logit contributions
--0.208
PI-0.206
 on-0.191
 distance-0.190
erm-0.189
Token.
Token ID13
Feature activation-5.49e-01

Pos logit contributions
 advoc+0.070
 rul+0.069
 mathemat+0.068
 Citiz+0.066
 conclud+0.065
Neg logit contributions
 distance-0.007
--0.007
PI-0.006
 your-0.005
 tour-0.005
Token
Token ID198
Feature activation-1.81e+00

Pos logit contributions
 advoc+0.930
 rul+0.903
 Citiz+0.900
 mathemat+0.887
 conclud+0.843
Neg logit contributions
--0.246
 distance-0.244
 safely-0.227
 on-0.224
 ass-0.223
Token
Token ID198
Feature activation-1.62e+00

Pos logit contributions
 advoc+2.688
 rul+2.655
 Citiz+2.552
 mathemat+2.514
 embr+2.488
Neg logit contributions
 distance-0.881
 hunting-0.815
 stalk-0.796
 tour-0.795
 safely-0.785
TokenFull
Token ID13295
Feature activation+2.11e-01

Pos logit contributions
 advoc+2.415
 rul+2.274
 mathemat+2.258
 Citiz+2.252
 embr+2.085
Neg logit contributions
 distance-1.064
 on-0.979
 hunting-0.977
 tour-0.976
 auto-0.971
Token preliminary
Token ID15223
Feature activation-2.69e-01

Pos logit contributions
-+0.089
 distance+0.087
PI+0.085
 on+0.085
 by+0.083
Neg logit contributions
 advoc-0.372
 rul-0.366
 mathemat-0.359
 Citiz-0.348
 conclud-0.335
Token results
Token ID2482
Feature activation-6.05e-01

Pos logit contributions
 advoc+0.462
 rul+0.451
 mathemat+0.446
 Citiz+0.422
 tyr+0.411
Neg logit contributions
--0.134
PI-0.130
 distance-0.123
 on-0.123
 specifically-0.120
Token were
Token ID547
Feature activation-6.69e-01

Pos logit contributions
 advoc+1.094
 rul+1.073
 mathemat+1.062
 Citiz+1.006
 conclud+1.000
Neg logit contributions
--0.222
PI-0.215
 distance-0.209
erm-0.189
 your-0.188
Token assists
Token ID13648
Feature activation-2.64e-01

Pos logit contributions
 advoc+0.711
 rul+0.697
 mathemat+0.682
 Citiz+0.661
 tyr+0.637
Neg logit contributions
 distance-0.225
--0.213
PI-0.209
 tour-0.204
 on-0.204
Token for
Token ID329
Feature activation-5.28e-01

Pos logit contributions
 advoc+0.516
 mathemat+0.502
 rul+0.501
 Citiz+0.474
 conclud+0.468
Neg logit contributions
PI-0.052
 distance-0.050
--0.049
123-0.042
 tour-0.041
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
 advoc+0.919
 rul+0.905
 mathemat+0.888
 Citiz+0.855
 tyr+0.833
Neg logit contributions
--0.242
PI-0.225
 on-0.218
 distance-0.216
 by-0.209
Token Jets
Token ID14728
Feature activation-8.73e-03

Pos logit contributions
 advoc+0.249
 rul+0.242
 mathemat+0.241
 Citiz+0.231
 tyr+0.222
Neg logit contributions
--0.095
 on-0.089
PI-0.088
 distance-0.087
 by-0.086
Token/
Token ID14
Feature activation-4.17e-01

Pos logit contributions
 advoc+0.017
 rul+0.017
 mathemat+0.017
 Citiz+0.016
 conclud+0.016
Neg logit contributions
 distance-0.002
--0.002
PI-0.002
123-0.001
erm-0.001
TokenTh
Token ID817
Feature activation-1.92e+00

Pos logit contributions
 advoc+0.647
 mathemat+0.619
 rul+0.617
 Citiz+0.600
 conclud+0.575
Neg logit contributions
 distance-0.268
--0.265
 on-0.257
 by-0.249
 your-0.248
Tokenras
Token ID8847
Feature activation-1.63e-01

Pos logit contributions
 rul+2.952
 advoc+2.949
 mathemat+2.874
 Citiz+2.793
 embr+2.660
Neg logit contributions
 distance-1.115
--1.105
 on-1.067
 by-1.031
 your-1.029
Tokenhers
Token ID7084
Feature activation-1.15e+00

Pos logit contributions
 advoc+0.258
 rul+0.249
 mathemat+0.248
 Citiz+0.239
 embr+0.229
Neg logit contributions
--0.095
 distance-0.094
 on-0.090
PI-0.089
 by-0.088
Token,
Token ID11
Feature activation-2.81e-01

Pos logit contributions
 advoc+2.197
 mathemat+2.153
 rul+2.145
 Citiz+2.064
 conclud+2.016
Neg logit contributions
--0.262
 distance-0.261
PI-0.232
123-0.226
ering-0.219
Token helped
Token ID4193
Feature activation+4.45e-01

Pos logit contributions
 advoc+0.486
 rul+0.471
 mathemat+0.469
 Citiz+0.444
 tyr+0.441
Neg logit contributions
--0.133
PI-0.127
 distance-0.124
123-0.117
 on-0.116
Token the
Token ID262
Feature activation+2.25e-01

Pos logit contributions
-+0.167
PI+0.150
 distance+0.141
 on+0.139
erm+0.136
Neg logit contributions
 advoc-0.817
 mathemat-0.800
 rul-0.794
 Citiz-0.763
 tyr-0.744