TOP ACTIVATIONS
MAX = 10.913

ker atin 9 and ker atin 16 .[ 22 ]
and explore the world with kindred spirits . Sign
by Chinese Agg ression // Quartz Steve M oll
over with her family : preparing the ritual foods , doing
politics ), the petty - bour ge ois intellig ents ia
roughly 20 cents per kil ow att - hour , electricity
seat is moved all the way back or you may have
was also told to " watch how I speak because this
residents to get serious about preparations . On Monday afternoon Governor
question , the petty - bour ge ois intellig ents ia
said . F ul op had hoped to remake
River ash ponds and moving ash to a lined structural fill
the threat coming . To prepare . To fight . The
yo and depending on what type of player you are that
. chest o @ gl obe . com . Follow him
like it struggles to keep up with the two top yo
Trump had fallen far behind Clinton by May ,
. When Trump named his vice presidential pick , Gov .
ities of the bourgeois type ( l abor unions
the rock , hitting different mineral grains within it .

MIN ACTIVATIONS
MIN = -11.781

ι ρ π α γ
as Goldman worked to j ett ison the billions in sub
1 AL READ Y COMP ET ED : at Arizona Cardinals
scientific community , this was NASA s seventh consecutive
larg er image T IFF original image Download : Figure
Sam , Merry , P ipp in , Ar ag orn
( S 4 ) DOI : 10 . 10 86 /
achievements . You get the idea . There was still a
se " ( note the space ) { options } a
scientific community , this was NASA s seventh consecutive
Mark ota , a P izz ag ater conspiracy theorist and
asley . At NASA , one casualty could be
again that the old address https :// . nice hash .
Power Point slide PNG larger image larg er
summer , New Castle 's waterfront at Battery Park remains the
c ps j . com / 1 BP g 8 sm
both died of cerebral ha em or rh age ," said
as school boys on a Thursday night . F
THE L AND OF EXT REM ELY EXP ENS IVE BE
2 nd Avenue on one side of the university was renamed

INTERVAL 5.239 - 10.913
CONTAINS 0.687%

the Saudi version of sh aria law . Her criticism was
that long jab to set up what s next
fraction of that time ( ch ick ens take three weeks
campaign . Another user , Haley Lent , told
:// www . bel fast te legraph . co . uk

INTERVAL -0.434 - 5.239
CONTAINS 59.265%

industry Many toy manufacturers are global giants that provide
they begin streaming . Tw itch enables video game
scan to confirm that Kell Brook s left eye
have not yet been final ised , the statement
Mr . Sanders isn t going anywhere .

INTERVAL -6.108 - -0.434
CONTAINS 39.780%

doors because it would impede movement of the wheelchair . During
lives . Government is committed to reduce the accidents and fatalities
As a ball of fire envelop ed the launch
land confisc ation and Juda isation that has be fall en
personal opinion about Islam . It den igr ates the religion

INTERVAL -11.781 - -6.108
CONTAINS 0.268%

ote ( O mega Pharma - Quick step ) s .
. A Soy uz carrying NASA astronaut Shane Kim b rough
the atmosphere different than the quiet colonial town that attracted them
Worlds . SUP ER HER O +
the successful Z Z P ops line with Zach Par ise
Token ker
Token ID41927
Feature activation-5.14e-01

Pos logit contributions
 amen+0.188
 lent+0.179
 sidel+0.175
selves+0.175
 conject+0.174
Neg logit contributions
ansas-0.246
ti-0.246
anim-0.231
velt-0.230
hor-0.223
Tokenatin
Token ID10680
Feature activation+7.85e+00

Pos logit contributions
 amen+0.468
 lent+0.454
 sidel+0.447
 conject+0.444
selves+0.444
Neg logit contributions
ansas-0.258
ti-0.256
anim-0.232
velt-0.231
hor-0.217
Token 9
Token ID860
Feature activation+2.99e+00

Pos logit contributions
ansas+5.349
ti+5.328
anim+5.009
velt+4.972
hor+4.925
Neg logit contributions
 amen-5.112
selves-5.046
 sidel-4.938
 lent-4.899
 conject-4.882
Token and
Token ID290
Feature activation-7.29e-02

Pos logit contributions
ti+2.354
ansas+2.310
anim+2.195
velt+2.155
hor+2.111
Neg logit contributions
 amen-1.848
 lent-1.742
 sidel-1.736
selves-1.727
 conject-1.711
Token ker
Token ID41927
Feature activation+8.96e-01

Pos logit contributions
 amen+0.041
 lent+0.039
selves+0.039
 sidel+0.038
 conject+0.038
Neg logit contributions
ti-0.061
ansas-0.061
anim-0.058
velt-0.057
hor-0.056
Tokenatin
Token ID10680
Feature activation+1.09e+01

Pos logit contributions
ansas+0.358
ti+0.355
anim+0.314
velt+0.309
hor+0.287
Neg logit contributions
 amen-0.909
 lent-0.884
 sidel-0.872
 conject-0.868
selves-0.866
Token 16
Token ID1467
Feature activation+2.73e+00

Pos logit contributions
ti+7.084
ansas+6.885
anim+6.458
hor+6.364
velt+6.281
Neg logit contributions
selves-6.944
 amen-6.840
 sidel-6.796
 conject-6.747
 lent-6.737
Token.[
Token ID3693
Feature activation+2.18e+00

Pos logit contributions
ti+2.224
ansas+2.202
anim+2.084
velt+2.053
hor+2.008
Neg logit contributions
 amen-1.617
 lent-1.505
selves-1.500
 sidel-1.497
 conject-1.477
Token22
Token ID1828
Feature activation+1.22e+00

Pos logit contributions
ti+1.285
ansas+1.270
anim+1.182
velt+1.153
hor+1.118
Neg logit contributions
 amen-1.801
 lent-1.737
 sidel-1.711
 conject-1.698
 helic-1.693
Token]
Token ID60
Feature activation+2.01e+00

Pos logit contributions
ansas+0.961
ti+0.954
anim+0.896
velt+0.895
hor+0.861
Neg logit contributions
 amen-0.769
 lent-0.737
 sidel-0.718
 conject-0.713
selves-0.711
Token
Token ID198
Feature activation+1.96e+00

Pos logit contributions
ti+1.653
ansas+1.646
anim+1.551
velt+1.543
hor+1.490
Neg logit contributions
 amen-1.176
 lent-1.110
 sidel-1.094
 conject-1.082
selves-1.079
Token and
Token ID290
Feature activation+4.85e-01

Pos logit contributions
 amen+0.263
 lent+0.250
 sidel+0.244
 conject+0.242
selves+0.241
Neg logit contributions
ansas-0.377
ti-0.376
anim-0.354
velt-0.352
hor-0.341
Token explore
Token ID7301
Feature activation-1.41e+00

Pos logit contributions
ansas+0.443
ti+0.441
velt+0.419
anim+0.419
hor+0.409
Neg logit contributions
 amen-0.239
 lent-0.225
 helic-0.222
 sidel-0.221
selves-0.218
Token the
Token ID262
Feature activation+2.89e-01

Pos logit contributions
 amen+0.744
 lent+0.703
 sidel+0.684
 conject+0.678
 helic+0.676
Neg logit contributions
ansas-1.243
ti-1.241
anim-1.174
velt-1.169
hor-1.133
Token world
Token ID995
Feature activation+3.12e+00

Pos logit contributions
ti+0.178
ansas+0.176
velt+0.163
anim+0.163
hor+0.156
Neg logit contributions
 amen-0.229
 lent-0.221
 helic-0.218
selves-0.218
 sidel-0.217
Token with
Token ID351
Feature activation+4.78e-01

Pos logit contributions
ti+2.541
ansas+2.537
anim+2.368
velt+2.363
hor+2.288
Neg logit contributions
 amen-1.841
 helic-1.765
 lent-1.738
 conject-1.700
 sidel-1.697
Token kindred
Token ID46449
Feature activation+1.06e+01

Pos logit contributions
ti+0.421
ansas+0.415
anim+0.396
velt+0.392
hor+0.378
Neg logit contributions
 amen-0.251
 sidel-0.234
 helic-0.234
 lent-0.232
selves-0.228
Token spirits
Token ID13747
Feature activation+7.03e+00

Pos logit contributions
ti+2.508
anim+2.242
ansas+2.062
spe+1.552
velt+1.523
Neg logit contributions
 amen-11.848
 helicop-11.514
 helic-11.244
MIT-11.162
 TRANS-11.121
Token.
Token ID13
Feature activation+2.56e+00

Pos logit contributions
ti+5.009
ansas+4.843
anim+4.605
velt+4.415
hor+4.338
Neg logit contributions
 helic-4.939
 thous-4.883
 skelet-4.848
 helicop-4.833
 amen-4.828
Token
Token ID198
Feature activation-3.08e+00

Pos logit contributions
ti+1.897
ansas+1.852
anim+1.755
velt+1.725
hor+1.672
Neg logit contributions
 amen-1.724
 helic-1.683
 lent-1.672
 conject-1.647
 sidel-1.636
Token
Token ID198
Feature activation-2.01e+00

Pos logit contributions
 amen+1.764
 lent+1.670
selves+1.629
 sidel+1.628
 conject+1.608
Neg logit contributions
ansas-2.589
ti-2.545
velt-2.420
anim-2.412
inois-2.340
TokenSign
Token ID11712
Feature activation+3.04e-01

Pos logit contributions
 amen+1.500
 lent+1.436
 conject+1.415
 sidel+1.415
 helic+1.412
Neg logit contributions
ti-1.351
ansas-1.334
anim-1.253
velt-1.236
hor-1.196
Token by
Token ID416
Feature activation-2.29e+00

Pos logit contributions
 amen+3.160
 lent+3.049
selves+2.994
 sidel+2.972
 conject+2.910
Neg logit contributions
ansas-4.322
ti-4.272
velt-4.039
anim-4.034
inois-3.895
Token Chinese
Token ID3999
Feature activation+4.31e+00

Pos logit contributions
 amen+1.493
 lent+1.432
 sidel+1.405
selves+1.397
 conject+1.384
Neg logit contributions
ansas-1.746
ti-1.730
velt-1.622
anim-1.621
hor-1.553
Token Agg
Token ID19015
Feature activation-4.14e+00

Pos logit contributions
ti+3.418
ansas+3.349
anim+3.169
velt+3.152
hor+3.067
Neg logit contributions
 amen-2.635
 lent-2.475
 conject-2.449
 helic-2.443
 sidel-2.426
Tokenression
Token ID2234
Feature activation-3.91e+00

Pos logit contributions
 amen+2.475
 lent+2.411
 sidel+2.334
selves+2.299
 conject+2.280
Neg logit contributions
ansas-3.263
ti-3.228
velt-3.064
anim-3.028
hor-2.937
Token //
Token ID3373
Feature activation-6.42e-01

Pos logit contributions
 amen+2.328
 lent+2.253
selves+2.196
 sidel+2.188
 conject+2.153
Neg logit contributions
ansas-3.126
ti-3.082
velt-2.928
anim-2.927
hor-2.818
Token Quartz
Token ID45976
Feature activation+1.06e+01

Pos logit contributions
 amen+0.412
 lent+0.393
 sidel+0.384
 conject+0.381
selves+0.381
Neg logit contributions
ansas-0.495
ti-0.493
anim-0.463
velt-0.462
hor-0.445
Token
Token ID447
Feature activation+1.36e+00

Pos logit contributions
ti+7.821
ansas+7.590
anim+7.376
hor+7.114
velt+7.054
Neg logit contributions
 amen-6.515
 lent-6.221
 sidel-6.166
 helic-6.164
selves-6.073
Token
Token ID247
Feature activation+3.05e+00

Pos logit contributions
ti+1.224
ansas+1.203
anim+1.151
velt+1.147
hor+1.114
Neg logit contributions
 amen-0.685
 lent-0.648
 conject-0.635
 sidel-0.634
selves-0.629
Token Steve
Token ID6542
Feature activation+6.97e-01

Pos logit contributions
ti+1.989
ansas+1.961
anim+1.823
velt+1.797
hor+1.757
Neg logit contributions
 amen-2.328
 lent-2.237
 sidel-2.202
 helic-2.188
 conject-2.183
Token M
Token ID337
Feature activation+2.71e+00

Pos logit contributions
ansas+0.544
ti+0.544
anim+0.510
velt+0.506
hor+0.492
Neg logit contributions
 amen-0.440
 lent-0.417
 sidel-0.411
 helic-0.408
 conject-0.406
Tokenoll
Token ID692
Feature activation-3.63e+00

Pos logit contributions
ansas+1.945
ti+1.939
anim+1.818
velt+1.800
hor+1.753
Neg logit contributions
 amen-1.897
 lent-1.800
 helic-1.767
 sidel-1.766
 conject-1.747
Tokenover
Token ID2502
Feature activation-1.79e+00

Pos logit contributions
ti+1.419
ansas+1.400
anim+1.311
velt+1.270
hor+1.246
Neg logit contributions
 amen-1.712
 sidel-1.647
 lent-1.645
 helic-1.617
selves-1.614
Token with
Token ID351
Feature activation+3.10e+00

Pos logit contributions
 amen+0.997
 lent+0.945
 sidel+0.915
 conject+0.909
selves+0.906
Neg logit contributions
ansas-1.539
ti-1.530
anim-1.447
velt-1.442
hor-1.395
Token her
Token ID607
Feature activation+4.57e-01

Pos logit contributions
ti+2.536
ansas+2.526
anim+2.390
velt+2.352
hor+2.295
Neg logit contributions
 amen-1.786
 lent-1.688
selves-1.670
 sidel-1.664
 helic-1.656
Token family
Token ID1641
Feature activation+3.13e+00

Pos logit contributions
ti+0.379
ansas+0.379
anim+0.358
velt+0.354
hor+0.344
Neg logit contributions
 amen-0.264
 lent-0.251
 sidel-0.245
 conject-0.243
selves-0.242
Token:
Token ID25
Feature activation+3.02e+00

Pos logit contributions
ti+2.501
ansas+2.496
anim+2.364
velt+2.336
hor+2.267
Neg logit contributions
 amen-1.846
 lent-1.765
 sidel-1.745
 conject-1.732
 helic-1.723
Token preparing
Token ID10629
Feature activation+1.03e+01

Pos logit contributions
ti+2.383
ansas+2.372
anim+2.237
velt+2.203
hor+2.149
Neg logit contributions
 amen-1.816
 lent-1.738
selves-1.706
 conject-1.706
 sidel-1.704
Token the
Token ID262
Feature activation+4.94e+00

Pos logit contributions
ti+6.973
ansas+6.885
anim+6.694
velt+6.295
hor+6.286
Neg logit contributions
 amen-6.525
 sidel-6.202
 helic-6.198
 conject-6.105
selves-6.014
Token ritual
Token ID12146
Feature activation+4.81e+00

Pos logit contributions
ti+3.507
ansas+3.501
anim+3.336
velt+3.222
hor+3.150
Neg logit contributions
 amen-3.259
 conject-3.121
 sidel-3.103
selves-3.095
 lent-3.089
Token foods
Token ID9013
Feature activation+1.51e+00

Pos logit contributions
ti+3.508
ansas+3.496
anim+3.293
velt+3.151
hor+3.130
Neg logit contributions
 amen-3.105
 conject-2.993
 lent-2.963
selves-2.947
 sidel-2.932
Token,
Token ID11
Feature activation-4.55e-01

Pos logit contributions
ti+1.234
ansas+1.231
anim+1.163
velt+1.154
hor+1.121
Neg logit contributions
 amen-0.890
 lent-0.837
 sidel-0.828
 conject-0.822
 helic-0.822
Token doing
Token ID1804
Feature activation+5.24e+00

Pos logit contributions
 amen+0.261
 lent+0.249
 sidel+0.242
 conject+0.240
selves+0.239
Neg logit contributions
ansas-0.382
ti-0.381
anim-0.359
velt-0.358
hor-0.346
Token politics
Token ID4819
Feature activation+4.11e-01

Pos logit contributions
ti+2.471
ansas+2.448
anim+2.294
velt+2.281
hor+2.214
Neg logit contributions
 amen-2.129
 lent-2.033
 sidel-1.978
selves-1.966
 helic-1.965
Token),
Token ID828
Feature activation+1.73e+00

Pos logit contributions
ansas+0.364
ti+0.363
anim+0.344
velt+0.341
hor+0.331
Neg logit contributions
 amen-0.218
 lent-0.204
 sidel-0.200
selves-0.197
 conject-0.197
Tokenthe
Token ID1169
Feature activation+2.10e+00

Pos logit contributions
ti+1.320
ansas+1.317
anim+1.238
velt+1.227
hor+1.186
Neg logit contributions
 amen-1.119
 lent-1.070
 sidel-1.046
 conject-1.036
 helic-1.034
Token petty
Token ID25229
Feature activation+1.40e+00

Pos logit contributions
ti+1.597
ansas+1.583
anim+1.489
velt+1.472
hor+1.430
Neg logit contributions
 amen-1.369
 lent-1.306
 sidel-1.275
 helic-1.266
 conject-1.262
Token-
Token ID12
Feature activation+1.65e+00

Pos logit contributions
ansas+1.147
ti+1.142
anim+1.076
velt+1.072
hor+1.035
Neg logit contributions
 amen-0.831
 lent-0.792
 sidel-0.772
 conject-0.765
selves-0.764
Tokenbour
Token ID6084
Feature activation+9.82e+00

Pos logit contributions
ti+1.273
ansas+1.267
anim+1.193
velt+1.177
hor+1.145
Neg logit contributions
 amen-1.070
 lent-1.022
 sidel-1.008
 conject-0.993
 helic-0.992
Tokenge
Token ID469
Feature activation+6.41e+00

Pos logit contributions
ti+3.667
ansas+3.429
anim+3.015
hor+2.984
velt+2.948
Neg logit contributions
 amen-10.116
 helic-9.881
 lent-9.866
 thous-9.714
 sidel-9.652
Tokenois
Token ID10924
Feature activation+2.85e+00

Pos logit contributions
ansas+3.599
ti+3.561
anim+3.348
velt+3.245
inois+3.157
Neg logit contributions
 amen-5.360
 helic-5.235
 lent-5.223
 sidel-5.195
 conject-5.047
Token intellig
Token ID10878
Feature activation+3.43e-01

Pos logit contributions
ti+1.578
ansas+1.578
anim+1.449
velt+1.418
hor+1.354
Neg logit contributions
 amen-2.437
 lent-2.344
 sidel-2.319
 helic-2.294
 conject-2.287
Tokenents
Token ID658
Feature activation+2.98e+00

Pos logit contributions
ansas+0.126
ti+0.126
anim+0.110
velt+0.106
hor+0.099
Neg logit contributions
 amen-0.361
 lent-0.349
 sidel-0.348
 helic-0.345
 conject-0.343
Tokenia
Token ID544
Feature activation+9.90e-01

Pos logit contributions
ti+1.740
ansas+1.726
anim+1.592
velt+1.532
hor+1.485
Neg logit contributions
 amen-2.496
 lent-2.397
 sidel-2.381
 helic-2.364
 conject-2.346
Token roughly
Token ID7323
Feature activation+8.38e-01

Pos logit contributions
ansas+1.920
ti+1.913
velt+1.783
anim+1.781
hor+1.729
Neg logit contributions
 amen-1.518
 lent-1.457
selves-1.435
 sidel-1.430
 conject-1.415
Token 20
Token ID1160
Feature activation+1.29e+00

Pos logit contributions
ansas+0.626
ti+0.624
anim+0.584
velt+0.581
hor+0.562
Neg logit contributions
 amen-0.553
 lent-0.530
 sidel-0.518
selves-0.518
 conject-0.516
Token cents
Token ID16059
Feature activation+4.70e+00

Pos logit contributions
ti+1.050
ansas+1.045
anim+0.983
velt+0.969
hor+0.951
Neg logit contributions
 amen-0.770
 lent-0.736
 sidel-0.723
selves-0.715
 conject-0.715
Token per
Token ID583
Feature activation+3.95e+00

Pos logit contributions
ti+3.963
ansas+3.919
anim+3.704
velt+3.623
hor+3.569
Neg logit contributions
 amen-2.576
 sidel-2.431
selves-2.431
 helic-2.423
 lent-2.406
Token kil
Token ID8769
Feature activation+6.16e-01

Pos logit contributions
ti+3.262
ansas+3.191
anim+2.999
velt+2.948
hor+2.920
Neg logit contributions
 amen-2.234
 conject-2.149
selves-2.141
 sidel-2.127
 lent-2.124
Tokenow
Token ID322
Feature activation+9.76e+00

Pos logit contributions
ti+0.356
ansas+0.353
anim+0.325
velt+0.320
hor+0.309
Neg logit contributions
 amen-0.515
 lent-0.497
 sidel-0.492
 conject-0.488
 helic-0.488
Tokenatt
Token ID1078
Feature activation+2.96e+00

Pos logit contributions
ti+4.357
ansas+4.059
anim+3.662
hor+3.466
inois+3.273
Neg logit contributions
 amen-9.463
 sidel-9.170
 helic-9.108
 conject-9.057
selves-9.053
Token-
Token ID12
Feature activation+3.20e+00

Pos logit contributions
ti+2.126
ansas+2.111
anim+1.984
velt+1.901
hor+1.887
Neg logit contributions
 amen-2.033
 sidel-1.947
 conject-1.936
 helic-1.931
 lent-1.931
Tokenhour
Token ID9769
Feature activation+1.51e+00

Pos logit contributions
ti+1.549
ansas+1.496
anim+1.408
hor+1.320
velt+1.303
Neg logit contributions
 amen-2.964
 sidel-2.883
 conject-2.881
 helic-2.881
 lent-2.873
Token,
Token ID11
Feature activation+1.48e+00

Pos logit contributions
ti+1.288
ansas+1.286
anim+1.210
velt+1.202
hor+1.167
Neg logit contributions
 amen-0.839
 lent-0.792
 sidel-0.778
selves-0.773
 conject-0.770
Token electricity
Token ID8744
Feature activation+2.59e+00

Pos logit contributions
ansas+1.202
ti+1.196
anim+1.124
velt+1.115
hor+1.080
Neg logit contributions
 amen-0.886
 lent-0.844
 sidel-0.827
selves-0.827
 conject-0.819
Token seat
Token ID5852
Feature activation+4.25e-01

Pos logit contributions
ansas+0.607
ti+0.606
anim+0.568
velt+0.565
hor+0.546
Neg logit contributions
 amen-0.485
 lent-0.462
 sidel-0.451
 conject-0.450
selves-0.449
Token is
Token ID318
Feature activation+1.26e+00

Pos logit contributions
ansas+0.309
ti+0.309
anim+0.288
velt+0.286
hor+0.276
Neg logit contributions
 amen-0.292
 lent-0.279
 sidel-0.274
 conject-0.272
selves-0.271
Token moved
Token ID3888
Feature activation-3.48e-01

Pos logit contributions
ansas+0.953
ti+0.951
anim+0.892
velt+0.883
hor+0.859
Neg logit contributions
 amen-0.826
 lent-0.787
 sidel-0.772
 conject-0.770
selves-0.768
Token all
Token ID477
Feature activation+1.70e+00

Pos logit contributions
 amen+0.225
 lent+0.215
 sidel+0.210
 conject+0.209
selves+0.208
Neg logit contributions
ansas-0.267
ti-0.266
anim-0.250
velt-0.248
hor-0.240
Token the
Token ID262
Feature activation+6.25e+00

Pos logit contributions
ti+1.448
ansas+1.440
anim+1.361
velt+1.345
hor+1.315
Neg logit contributions
 amen-0.948
 lent-0.886
selves-0.874
 conject-0.874
 sidel-0.866
Token way
Token ID835
Feature activation+9.67e+00

Pos logit contributions
ti+4.261
ansas+4.099
anim+3.958
velt+3.849
hor+3.774
Neg logit contributions
 amen-4.440
 conject-4.155
 lent-4.120
selves-4.115
 sidel-4.011
Token back
Token ID736
Feature activation+1.81e+00

Pos logit contributions
ti+6.311
ansas+6.169
velt+5.766
anim+5.717
hor+5.690
Neg logit contributions
 amen-6.554
 conject-6.328
 lent-6.185
selves-6.133
 helic-6.125
Token or
Token ID393
Feature activation+2.05e+00

Pos logit contributions
ti+1.411
ansas+1.411
anim+1.327
velt+1.312
hor+1.281
Neg logit contributions
 amen-1.137
 lent-1.084
 conject-1.063
 sidel-1.057
selves-1.049
Token you
Token ID345
Feature activation+1.14e+00

Pos logit contributions
ti+1.660
ansas+1.658
anim+1.560
velt+1.551
hor+1.505
Neg logit contributions
 amen-1.217
 lent-1.157
 conject-1.134
selves-1.131
 sidel-1.127
Token may
Token ID743
Feature activation+1.69e+00

Pos logit contributions
ansas+0.832
ti+0.831
anim+0.776
velt+0.769
hor+0.744
Neg logit contributions
 amen-0.773
 lent-0.739
 sidel-0.723
 conject-0.722
selves-0.717
Token have
Token ID423
Feature activation-1.65e+00

Pos logit contributions
ti+1.288
ansas+1.285
anim+1.204
velt+1.195
hor+1.161
Neg logit contributions
 amen-1.083
 lent-1.033
 conject-1.011
 sidel-1.006
selves-1.004
Token was
Token ID373
Feature activation-3.08e-01

Pos logit contributions
 amen+0.005
 lent+0.005
 sidel+0.005
selves+0.005
 conject+0.005
Neg logit contributions
ansas-0.006
ti-0.006
anim-0.005
velt-0.005
hor-0.005
Token also
Token ID635
Feature activation-2.03e-01

Pos logit contributions
 amen+0.212
 lent+0.202
 sidel+0.198
selves+0.197
 conject+0.197
Neg logit contributions
ti-0.222
ansas-0.221
anim-0.207
velt-0.205
hor-0.199
Token told
Token ID1297
Feature activation-8.70e-01

Pos logit contributions
 amen+0.142
 lent+0.136
 sidel+0.133
selves+0.133
 conject+0.133
Neg logit contributions
ti-0.144
ansas-0.144
anim-0.134
velt-0.133
hor-0.129
Token to
Token ID284
Feature activation+7.67e-01

Pos logit contributions
 amen+0.503
 lent+0.478
 sidel+0.466
 conject+0.461
selves+0.461
Neg logit contributions
ansas-0.729
ti-0.726
anim-0.685
velt-0.683
hor-0.659
Token "
Token ID366
Feature activation+1.08e+00

Pos logit contributions
ti+0.583
ansas+0.581
anim+0.545
velt+0.541
hor+0.524
Neg logit contributions
 amen-0.497
 lent-0.475
 sidel-0.464
 conject-0.464
selves-0.463
Tokenwatch
Token ID8340
Feature activation+9.61e+00

Pos logit contributions
ti+0.709
ansas+0.708
anim+0.660
velt+0.650
hor+0.629
Neg logit contributions
 amen-0.819
 lent-0.788
 sidel-0.773
 conject-0.771
 helic-0.765
Token how
Token ID703
Feature activation+5.06e+00

Pos logit contributions
ti+7.019
ansas+6.773
anim+6.500
velt+6.246
hor+6.108
Neg logit contributions
 amen-5.995
 lent-5.701
 conject-5.623
selves-5.539
 helic-5.524
Token I
Token ID314
Feature activation+3.94e+00

Pos logit contributions
ti+3.885
ansas+3.866
anim+3.670
velt+3.606
hor+3.492
Neg logit contributions
 amen-3.174
 lent-2.990
 conject-2.932
 sidel-2.927
selves-2.917
Token speak
Token ID2740
Feature activation-9.25e-01

Pos logit contributions
ti+2.772
ansas+2.755
anim+2.555
velt+2.553
hor+2.442
Neg logit contributions
 amen-2.747
 lent-2.618
selves-2.579
 conject-2.578
 sidel-2.572
Token because
Token ID780
Feature activation+2.09e-01

Pos logit contributions
 amen+0.531
 lent+0.506
 sidel+0.491
 conject+0.487
selves+0.485
Neg logit contributions
ansas-0.778
ti-0.774
anim-0.730
velt-0.729
hor-0.704
Token this
Token ID428
Feature activation+3.10e+00

Pos logit contributions
ansas+0.170
ti+0.170
anim+0.160
velt+0.159
hor+0.154
Neg logit contributions
 amen-0.125
 lent-0.119
 sidel-0.116
selves-0.115
 conject-0.115
Token residents
Token ID5085
Feature activation+9.60e-01

Pos logit contributions
ansas+1.336
ti+1.332
anim+1.257
velt+1.245
hor+1.207
Neg logit contributions
 amen-0.904
 lent-0.857
selves-0.831
 sidel-0.829
 conject-0.827
Token to
Token ID284
Feature activation+2.82e+00

Pos logit contributions
ansas+0.758
ti+0.756
anim+0.713
velt+0.705
hor+0.684
Neg logit contributions
 amen-0.588
 lent-0.560
 sidel-0.547
 conject-0.547
selves-0.545
Token get
Token ID651
Feature activation+4.10e+00

Pos logit contributions
ti+2.150
ansas+2.145
anim+2.026
velt+2.009
hor+1.965
Neg logit contributions
 amen-1.765
 lent-1.661
 conject-1.642
 helic-1.636
selves-1.630
Token serious
Token ID2726
Feature activation+2.50e+00

Pos logit contributions
ti+2.856
ansas+2.839
anim+2.688
velt+2.592
hor+2.565
Neg logit contributions
 amen-2.816
 lent-2.638
 helic-2.615
 sidel-2.608
selves-2.604
Token about
Token ID546
Feature activation+7.98e-01

Pos logit contributions
ti+2.028
ansas+2.027
anim+1.911
velt+1.881
hor+1.837
Neg logit contributions
 amen-1.462
 lent-1.382
 sidel-1.350
 helic-1.340
selves-1.332
Token preparations
Token ID21518
Feature activation+9.59e+00

Pos logit contributions
ansas+0.679
ti+0.676
anim+0.639
velt+0.636
hor+0.617
Neg logit contributions
 amen-0.445
 lent-0.420
selves-0.407
 sidel-0.405
 conject-0.404
Token.
Token ID13
Feature activation+1.86e+00

Pos logit contributions
ansas+6.341
ti+6.329
anim+5.913
velt+5.752
hor+5.714
Neg logit contributions
 amen-6.320
 helic-6.072
 lent-5.999
 sidel-5.956
selves-5.879
Token On
Token ID1550
Feature activation+1.93e+00

Pos logit contributions
ansas+1.509
ti+1.507
anim+1.411
velt+1.403
hor+1.360
Neg logit contributions
 amen-1.119
 lent-1.069
 sidel-1.043
 conject-1.038
 helic-1.030
Token Monday
Token ID3321
Feature activation-2.28e-01

Pos logit contributions
ansas+1.417
ti+1.412
anim+1.319
velt+1.310
hor+1.269
Neg logit contributions
 amen-1.302
 lent-1.241
 sidel-1.220
 conject-1.207
selves-1.204
Token afternoon
Token ID6672
Feature activation+1.27e+00

Pos logit contributions
 amen+0.133
 lent+0.127
 sidel+0.123
 conject+0.122
selves+0.122
Neg logit contributions
ansas-0.190
ti-0.189
anim-0.179
velt-0.178
hor-0.172
Token Governor
Token ID10807
Feature activation+2.54e+00

Pos logit contributions
ansas+1.111
ti+1.110
anim+1.045
velt+1.042
hor+1.011
Neg logit contributions
 amen-0.689
 lent-0.652
 sidel-0.634
 conject-0.629
selves-0.627
Token question
Token ID1808
Feature activation-4.49e+00

Pos logit contributions
 amen+2.632
 lent+2.524
 sidel+2.479
 conject+2.459
selves+2.435
Neg logit contributions
ansas-3.148
ti-3.095
velt-2.926
anim-2.922
inois-2.796
Token,
Token ID11
Feature activation-4.62e-02

Pos logit contributions
 amen+2.405
 lent+2.306
 sidel+2.233
 conject+2.203
selves+2.202
Neg logit contributions
ansas-3.884
ti-3.830
velt-3.666
anim-3.635
inois-3.528
Tokenthe
Token ID1169
Feature activation+1.44e+00

Pos logit contributions
 amen+0.030
 lent+0.029
 sidel+0.028
 conject+0.028
selves+0.028
Neg logit contributions
ansas-0.035
ti-0.035
anim-0.033
velt-0.033
hor-0.032
Token petty
Token ID25229
Feature activation+8.33e-01

Pos logit contributions
ti+1.094
ansas+1.089
anim+1.022
velt+1.012
hor+0.984
Neg logit contributions
 amen-0.945
 lent-0.902
 sidel-0.880
 helic-0.874
 conject-0.872
Token-
Token ID12
Feature activation+6.47e-01

Pos logit contributions
ansas+0.654
ti+0.652
anim+0.612
velt+0.608
hor+0.588
Neg logit contributions
 amen-0.524
 lent-0.500
 sidel-0.489
 conject-0.485
selves-0.484
Tokenbour
Token ID6084
Feature activation+9.47e+00

Pos logit contributions
ti+0.499
ansas+0.498
anim+0.468
velt+0.463
hor+0.450
Neg logit contributions
 amen-0.416
 lent-0.398
 sidel-0.391
 conject-0.386
 helic-0.385
Tokenge
Token ID469
Feature activation+6.80e+00

Pos logit contributions
ti+3.731
ansas+3.462
anim+3.096
hor+3.081
velt+3.061
Neg logit contributions
 amen-9.460
 lent-9.243
 helic-9.232
 sidel-9.172
selves-9.013
Tokenois
Token ID10924
Feature activation+2.29e+00

Pos logit contributions
ansas+3.915
ti+3.900
anim+3.659
velt+3.577
inois+3.437
Neg logit contributions
 amen-5.522
 lent-5.365
 sidel-5.332
 helic-5.321
 conject-5.218
Token intellig
Token ID10878
Feature activation-1.28e-01

Pos logit contributions
ti+1.536
ansas+1.533
anim+1.432
velt+1.410
hor+1.362
Neg logit contributions
 amen-1.685
 lent-1.615
 sidel-1.593
 helic-1.574
 conject-1.569
Tokenents
Token ID658
Feature activation+3.26e+00

Pos logit contributions
 amen+0.130
 lent+0.127
 sidel+0.125
 conject+0.124
selves+0.124
Neg logit contributions
ansas-0.051
ti-0.051
anim-0.045
velt-0.044
hor-0.041
Tokenia
Token ID544
Feature activation+1.13e+00

Pos logit contributions
ti+1.935
ansas+1.919
anim+1.778
velt+1.717
hor+1.657
Neg logit contributions
 amen-2.694
 lent-2.589
 sidel-2.572
 helic-2.543
 conject-2.537
Token said
Token ID531
Feature activation+1.02e+00

Pos logit contributions
 amen+1.116
 lent+1.068
 sidel+1.032
 conject+1.031
selves+1.011
Neg logit contributions
ansas-1.472
ti-1.450
anim-1.374
velt-1.365
hor-1.316
Token.
Token ID13
Feature activation+4.08e-01

Pos logit contributions
ansas+0.816
ti+0.816
anim+0.766
velt+0.762
hor+0.736
Neg logit contributions
 amen-0.618
 lent-0.589
 sidel-0.575
selves-0.571
 conject-0.570
Token
Token ID198
Feature activation-7.84e-02

Pos logit contributions
ansas+0.329
ti+0.328
anim+0.309
velt+0.309
inois+0.296
Neg logit contributions
 amen-0.247
 lent-0.235
 sidel-0.229
selves-0.227
 conject-0.226
Token
Token ID198
Feature activation-1.42e-01

Pos logit contributions
 amen+0.049
 lent+0.047
 sidel+0.046
 helic+0.046
 conject+0.046
Neg logit contributions
ti-0.062
ansas-0.061
anim-0.058
velt-0.057
hor-0.056
TokenF
Token ID37
Feature activation-9.27e-01

Pos logit contributions
 amen+0.093
 lent+0.089
 sidel+0.087
 conject+0.087
 helic+0.086
Neg logit contributions
ansas-0.108
ti-0.108
anim-0.101
velt-0.100
hor-0.097
Tokenul
Token ID377
Feature activation+9.41e+00

Pos logit contributions
 amen+0.684
 lent+0.653
 sidel+0.642
 conject+0.637
selves+0.636
Neg logit contributions
ansas-0.633
ti-0.630
anim-0.589
velt-0.582
hor-0.564
Tokenop
Token ID404
Feature activation-4.15e-01

Pos logit contributions
ti+4.852
ansas+4.779
velt+4.527
anim+4.466
hor+4.332
Neg logit contributions
 thous-8.211
 amen-8.011
 helic-7.999
 skelet-7.829
 helicop-7.775
Token had
Token ID550
Feature activation-9.99e-01

Pos logit contributions
 amen+0.269
 lent+0.256
 sidel+0.251
 conject+0.249
selves+0.249
Neg logit contributions
ansas-0.318
ti-0.317
anim-0.297
velt-0.296
hor-0.285
Token hoped
Token ID10719
Feature activation+1.50e+00

Pos logit contributions
 amen+0.638
 lent+0.610
 sidel+0.596
 conject+0.591
selves+0.588
Neg logit contributions
ansas-0.776
ti-0.773
anim-0.726
velt-0.722
hor-0.696
Token to
Token ID284
Feature activation+1.76e+00

Pos logit contributions
ansas+1.212
ti+1.210
anim+1.145
velt+1.137
hor+1.100
Neg logit contributions
 amen-0.889
 lent-0.850
selves-0.829
 sidel-0.827
 conject-0.826
Token remake
Token ID28763
Feature activation-2.00e-01

Pos logit contributions
ti+1.305
ansas+1.300
velt+1.221
anim+1.214
hor+1.175
Neg logit contributions
 amen-1.152
selves-1.098
 lent-1.091
 conject-1.081
 helic-1.079
Token River
Token ID5866
Feature activation-1.49e+00

Pos logit contributions
ansas+0.351
ti+0.349
anim+0.322
velt+0.320
hor+0.307
Neg logit contributions
 amen-0.444
 lent-0.428
 sidel-0.421
 conject-0.418
selves-0.418
Token ash
Token ID12530
Feature activation+6.26e+00

Pos logit contributions
 amen+0.932
 lent+0.886
 sidel+0.865
 conject+0.857
selves+0.853
Neg logit contributions
ansas-1.182
ti-1.178
anim-1.112
velt-1.104
hor-1.066
Token ponds
Token ID47339
Feature activation+2.98e+00

Pos logit contributions
ansas+5.000
ti+4.919
anim+4.527
velt+4.520
hor+4.406
Neg logit contributions
 amen-3.724
selves-3.670
 sidel-3.563
 lent-3.554
 helic-3.513
Token and
Token ID290
Feature activation-2.96e+00

Pos logit contributions
ansas+2.362
ti+2.343
anim+2.199
velt+2.180
hor+2.121
Neg logit contributions
 amen-1.827
 lent-1.744
 helic-1.709
selves-1.709
 sidel-1.704
Token moving
Token ID3867
Feature activation+2.75e+00

Pos logit contributions
 amen+1.780
 lent+1.692
 sidel+1.657
 conject+1.633
 helic+1.616
Neg logit contributions
ti-2.398
ansas-2.397
anim-2.269
velt-2.254
hor-2.176
Token ash
Token ID12530
Feature activation+9.41e+00

Pos logit contributions
ansas+2.202
ti+2.179
anim+2.044
velt+2.024
hor+1.960
Neg logit contributions
 amen-1.655
 lent-1.590
selves-1.559
 sidel-1.557
 conject-1.543
Token to
Token ID284
Feature activation+3.04e+00

Pos logit contributions
ansas+6.828
ti+6.795
velt+6.181
anim+6.139
inois+5.976
Neg logit contributions
 amen-5.896
selves-5.686
 lent-5.650
 sidel-5.593
 helic-5.556
Token a
Token ID257
Feature activation+1.36e+00

Pos logit contributions
ansas+2.404
ti+2.378
anim+2.218
velt+2.209
inois+2.140
Neg logit contributions
 amen-1.857
 lent-1.787
 sidel-1.757
selves-1.746
 conject-1.739
Token lined
Token ID16566
Feature activation-2.21e+00

Pos logit contributions
ansas+0.968
ti+0.961
anim+0.890
velt+0.886
hor+0.858
Neg logit contributions
 amen-0.939
 lent-0.900
 sidel-0.886
selves-0.885
 conject-0.880
Token structural
Token ID13204
Feature activation+4.53e+00

Pos logit contributions
 amen+1.449
 lent+1.380
 sidel+1.347
 conject+1.327
selves+1.320
Neg logit contributions
ti-1.678
ansas-1.674
anim-1.584
velt-1.575
hor-1.507
Token fill
Token ID6070
Feature activation-2.55e-01

Pos logit contributions
ansas+3.371
ti+3.317
anim+3.077
velt+3.024
hor+3.015
Neg logit contributions
 amen-2.863
selves-2.840
 lent-2.787
 conject-2.776
 helic-2.754
Token the
Token ID262
Feature activation+4.70e+00

Pos logit contributions
ti+3.610
ansas+3.585
anim+3.447
velt+3.372
hor+3.279
Neg logit contributions
 amen-2.774
 lent-2.612
 sidel-2.523
selves-2.520
 helic-2.515
Token threat
Token ID2372
Feature activation+3.66e+00

Pos logit contributions
ti+3.510
ansas+3.459
anim+3.299
velt+3.253
hor+3.168
Neg logit contributions
 amen-3.046
 lent-2.905
selves-2.810
 conject-2.798
 sidel-2.797
Token coming
Token ID2406
Feature activation+2.92e+00

Pos logit contributions
ti+2.887
ansas+2.875
anim+2.713
velt+2.671
hor+2.599
Neg logit contributions
 amen-2.244
 lent-2.108
selves-2.057
 conject-2.056
 sidel-2.051
Token.
Token ID13
Feature activation-1.33e+00

Pos logit contributions
ti+2.319
ansas+2.307
anim+2.175
velt+2.154
hor+2.086
Neg logit contributions
 amen-1.786
 lent-1.695
 conject-1.657
 helic-1.649
selves-1.649
Token To
Token ID1675
Feature activation+1.06e+00

Pos logit contributions
 amen+0.665
 lent+0.630
selves+0.611
 sidel+0.609
 conject+0.604
Neg logit contributions
ansas-1.206
ti-1.199
anim-1.139
velt-1.135
hor-1.100
Token prepare
Token ID8335
Feature activation+9.39e+00

Pos logit contributions
ti+0.798
ansas+0.798
anim+0.749
velt+0.745
hor+0.718
Neg logit contributions
 amen-0.700
 lent-0.667
 sidel-0.652
selves-0.648
 conject-0.648
Token.
Token ID13
Feature activation+1.80e+00

Pos logit contributions
ti+6.934
ansas+6.851
anim+6.660
velt+6.493
hor+6.356
Neg logit contributions
 amen-5.767
 lent-5.268
 sidel-5.213
 helic-5.094
 conject-5.015
Token To
Token ID1675
Feature activation+3.33e+00

Pos logit contributions
ansas+1.567
ti+1.563
anim+1.479
velt+1.471
hor+1.424
Neg logit contributions
 amen-0.974
 lent-0.921
 sidel-0.901
 conject-0.889
 helic-0.884
Token fight
Token ID1907
Feature activation+6.20e-01

Pos logit contributions
ti+2.419
ansas+2.407
velt+2.268
anim+2.267
hor+2.174
Neg logit contributions
 amen-2.220
 lent-2.105
selves-2.054
 helic-2.053
 sidel-2.052
Token.
Token ID13
Feature activation+1.46e+00

Pos logit contributions
ansas+0.518
ti+0.517
anim+0.488
velt+0.485
hor+0.470
Neg logit contributions
 amen-0.357
 lent-0.339
 sidel-0.330
 conject-0.328
selves-0.327
Token The
Token ID383
Feature activation+1.09e-01

Pos logit contributions
ansas+1.288
ti+1.285
anim+1.215
velt+1.209
hor+1.170
Neg logit contributions
 amen-0.781
 lent-0.738
 sidel-0.722
 conject-0.713
 helic-0.709
Tokenyo
Token ID8226
Feature activation+1.20e+00

Pos logit contributions
 amen+2.376
 lent+2.284
 sidel+2.232
selves+2.226
 conject+2.213
Neg logit contributions
ansas-1.616
ti-1.547
velt-1.456
anim-1.432
inois-1.371
Token and
Token ID290
Feature activation+1.74e+00

Pos logit contributions
ti+1.075
ansas+1.073
anim+1.016
velt+1.009
hor+0.981
Neg logit contributions
 amen-0.614
 lent-0.582
 sidel-0.566
 conject-0.562
selves-0.562
Token depending
Token ID6906
Feature activation+5.24e-01

Pos logit contributions
ti+1.362
ansas+1.358
anim+1.276
velt+1.264
hor+1.224
Neg logit contributions
 amen-1.082
 lent-1.035
 sidel-1.012
selves-1.011
 conject-1.009
Token on
Token ID319
Feature activation+2.51e+00

Pos logit contributions
ansas+0.401
ti+0.398
velt+0.374
anim+0.373
hor+0.358
Neg logit contributions
 amen-0.342
 lent-0.327
 sidel-0.319
 conject-0.317
selves-0.316
Token what
Token ID644
Feature activation-4.18e-01

Pos logit contributions
ti+2.112
ansas+2.105
anim+1.986
velt+1.968
hor+1.912
Neg logit contributions
 amen-1.409
 lent-1.347
selves-1.316
 conject-1.307
 sidel-1.306
Token type
Token ID2099
Feature activation+9.33e+00

Pos logit contributions
 amen+0.253
 lent+0.241
 sidel+0.235
 conject+0.233
selves+0.232
Neg logit contributions
ansas-0.338
ti-0.336
anim-0.317
velt-0.316
hor-0.305
Token of
Token ID286
Feature activation+4.74e+00

Pos logit contributions
ti+7.472
ansas+7.275
anim+6.896
velt+6.720
hor+6.517
Neg logit contributions
selves-5.196
 amen-5.194
 conject-5.070
 sidel-5.030
 lent-4.967
Token player
Token ID2137
Feature activation+1.95e+00

Pos logit contributions
ti+3.499
ansas+3.461
anim+3.266
velt+3.202
hor+3.109
Neg logit contributions
 amen-3.016
selves-2.938
 lent-2.924
 conject-2.903
 sidel-2.875
Token you
Token ID345
Feature activation+4.82e+00

Pos logit contributions
ti+1.590
ansas+1.576
anim+1.492
velt+1.473
hor+1.426
Neg logit contributions
 amen-1.157
 lent-1.102
selves-1.085
 sidel-1.083
 conject-1.079
Token are
Token ID389
Feature activation+3.37e+00

Pos logit contributions
ti+3.478
ansas+3.427
anim+3.169
velt+3.112
hor+3.030
Neg logit contributions
 amen-3.199
selves-3.018
 conject-3.012
 helic-2.999
 lent-2.993
Token that
Token ID326
Feature activation-2.68e-01

Pos logit contributions
ti+2.759
ansas+2.728
anim+2.583
velt+2.541
hor+2.483
Neg logit contributions
 amen-1.936
selves-1.822
 lent-1.822
 helic-1.811
 conject-1.797
Token.
Token ID13
Feature activation-5.22e-01

Pos logit contributions
 amen+1.119
 lent+1.062
 sidel+1.038
 conject+1.032
selves+1.025
Neg logit contributions
ansas-1.586
ti-1.583
anim-1.485
velt-1.481
hor-1.433
Tokenchest
Token ID46713
Feature activation+8.69e-01

Pos logit contributions
 amen+0.483
 lent+0.466
 sidel+0.461
 helic+0.458
 conject+0.456
Neg logit contributions
ansas-0.259
ti-0.257
anim-0.235
velt-0.233
hor-0.222
Tokeno
Token ID78
Feature activation-2.30e+00

Pos logit contributions
ti+0.649
ansas+0.646
anim+0.606
velt+0.602
hor+0.579
Neg logit contributions
 amen-0.586
 lent-0.563
 sidel-0.550
 helic-0.547
selves-0.547
Token@
Token ID31
Feature activation-4.31e+00

Pos logit contributions
 amen+0.974
 lent+0.907
 sidel+0.874
 conject+0.862
selves+0.852
Neg logit contributions
ansas-2.278
ti-2.269
anim-2.161
velt-2.147
hor-2.091
Tokengl
Token ID4743
Feature activation-6.30e-01

Pos logit contributions
 amen+2.979
 lent+2.920
 sidel+2.839
 conject+2.797
selves+2.794
Neg logit contributions
ansas-3.054
ti-2.984
anim-2.804
velt-2.796
inois-2.691
Tokenobe
Token ID5910
Feature activation+9.21e+00

Pos logit contributions
 amen+0.604
 lent+0.586
 sidel+0.578
 conject+0.575
selves+0.574
Neg logit contributions
ansas-0.287
ti-0.285
anim-0.255
velt-0.252
hor-0.238
Token.
Token ID13
Feature activation+4.67e-01

Pos logit contributions
ti+6.516
velt+6.230
ansas+6.183
anim+6.166
hor+6.130
Neg logit contributions
 amen-6.209
 thous-6.112
 lent-6.002
selves-5.980
 sidel-5.931
Tokencom
Token ID785
Feature activation-4.49e+00

Pos logit contributions
ti+0.243
ansas+0.243
anim+0.222
velt+0.218
hor+0.207
Neg logit contributions
 amen-0.418
 lent-0.404
 sidel-0.398
 conject-0.396
selves-0.396
Token.
Token ID764
Feature activation-3.92e+00

Pos logit contributions
 amen+2.569
 lent+2.466
 sidel+2.409
 conject+2.366
selves+2.319
Neg logit contributions
ansas-3.715
ti-3.678
anim-3.505
velt-3.494
hor-3.363
Token Follow
Token ID7281
Feature activation+1.54e+00

Pos logit contributions
 amen+1.859
 lent+1.767
 sidel+1.699
 conject+1.665
selves+1.664
Neg logit contributions
ansas-3.661
ti-3.628
velt-3.469
anim-3.463
inois-3.355
Token him
Token ID683
Feature activation+4.14e-01

Pos logit contributions
ti+1.186
ansas+1.185
anim+1.116
velt+1.100
hor+1.071
Neg logit contributions
 amen-0.975
 lent-0.925
 sidel-0.911
 helic-0.910
 conject-0.907
Token like
Token ID588
Feature activation+2.41e+00

Pos logit contributions
ansas+0.659
ti+0.658
anim+0.617
velt+0.614
hor+0.593
Neg logit contributions
 amen-0.540
 lent-0.516
 sidel-0.505
 conject-0.501
selves-0.501
Token it
Token ID340
Feature activation+2.04e+00

Pos logit contributions
ti+1.865
ansas+1.864
anim+1.741
velt+1.732
hor+1.675
Neg logit contributions
 amen-1.532
 lent-1.463
 sidel-1.433
selves-1.432
 conject-1.430
Token struggles
Token ID12766
Feature activation+2.93e+00

Pos logit contributions
ti+1.467
ansas+1.448
anim+1.346
velt+1.339
hor+1.288
Neg logit contributions
 amen-1.425
 lent-1.358
 sidel-1.342
 conject-1.333
 helic-1.328
Token to
Token ID284
Feature activation-2.21e-01

Pos logit contributions
ti+2.317
ansas+2.296
anim+2.158
velt+2.125
hor+2.078
Neg logit contributions
 amen-1.791
 lent-1.716
 sidel-1.681
selves-1.679
 conject-1.676
Token keep
Token ID1394
Feature activation+2.95e+00

Pos logit contributions
 amen+0.148
 lent+0.141
 sidel+0.138
 conject+0.137
selves+0.137
Neg logit contributions
ansas-0.166
ti-0.165
anim-0.155
velt-0.154
hor-0.148
Token up
Token ID510
Feature activation+9.13e+00

Pos logit contributions
ti+2.239
ansas+2.223
anim+2.081
velt+2.064
hor+1.998
Neg logit contributions
 amen-1.927
 lent-1.829
 sidel-1.796
 conject-1.787
selves-1.782
Token with
Token ID351
Feature activation+4.40e+00

Pos logit contributions
ti+6.999
ansas+6.942
anim+6.453
velt+6.332
hor+6.236
Neg logit contributions
 amen-5.355
selves-5.029
 lent-5.018
 sidel-4.994
 conject-4.985
Token the
Token ID262
Feature activation+3.40e+00

Pos logit contributions
ti+3.502
ansas+3.477
anim+3.257
velt+3.226
hor+3.134
Neg logit contributions
 amen-2.626
 lent-2.502
selves-2.492
 conject-2.472
 sidel-2.447
Token two
Token ID734
Feature activation+4.51e+00

Pos logit contributions
ti+2.647
ansas+2.617
anim+2.456
velt+2.439
hor+2.371
Neg logit contributions
 amen-2.105
 lent-2.014
 conject-1.981
selves-1.979
 sidel-1.974
Token top
Token ID1353
Feature activation-8.86e-01

Pos logit contributions
ti+3.523
ansas+3.457
anim+3.248
velt+3.209
hor+3.134
Neg logit contributions
 amen-2.766
 lent-2.638
 conject-2.605
 sidel-2.597
selves-2.578
Token yo
Token ID27406
Feature activation-1.42e+00

Pos logit contributions
 amen+0.615
 lent+0.589
 sidel+0.578
selves+0.571
 conject+0.570
Neg logit contributions
ansas-0.641
ti-0.634
anim-0.594
velt-0.593
hor-0.568
Token
Token ID198
Feature activation+8.14e-01

Pos logit contributions
ti+1.664
ansas+1.663
anim+1.562
velt+1.552
hor+1.503
Neg logit contributions
 amen-1.137
 lent-1.083
 sidel-1.061
 conject-1.051
 helic-1.050
Token
Token ID198
Feature activation+8.00e-01

Pos logit contributions
ti+0.628
ansas+0.623
anim+0.587
velt+0.579
hor+0.564
Neg logit contributions
 amen-0.525
 lent-0.505
 helic-0.496
 sidel-0.493
 conject-0.490
TokenTrump
Token ID6170
Feature activation+1.41e+00

Pos logit contributions
ansas+0.603
ti+0.602
anim+0.564
velt+0.559
hor+0.541
Neg logit contributions
 amen-0.529
 lent-0.507
 sidel-0.496
 conject-0.493
 helic-0.491
Token had
Token ID550
Feature activation+1.33e+00

Pos logit contributions
ansas+1.064
ti+1.064
anim+0.996
velt+0.992
hor+0.957
Neg logit contributions
 amen-0.917
 lent-0.873
 sidel-0.861
selves-0.853
 conject-0.851
Token fallen
Token ID9292
Feature activation+4.20e+00

Pos logit contributions
ansas+1.030
ti+1.028
anim+0.965
velt+0.960
hor+0.926
Neg logit contributions
 amen-0.843
 lent-0.801
 sidel-0.789
selves-0.784
 conject-0.779
Token far
Token ID1290
Feature activation+9.11e+00

Pos logit contributions
ti+3.035
ansas+3.013
anim+2.816
velt+2.783
hor+2.725
Neg logit contributions
 amen-2.814
 lent-2.700
 conject-2.643
 sidel-2.629
selves-2.621
Token behind
Token ID2157
Feature activation+4.30e+00

Pos logit contributions
ti+5.190
ansas+5.039
anim+4.705
velt+4.622
hor+4.607
Neg logit contributions
 amen-7.026
 lent-6.852
 helic-6.839
 conject-6.800
selves-6.796
Token Clinton
Token ID2605
Feature activation+1.92e+00

Pos logit contributions
ti+3.131
ansas+3.125
anim+2.906
velt+2.889
hor+2.790
Neg logit contributions
 amen-2.875
 lent-2.758
 sidel-2.702
 conject-2.700
selves-2.693
Token by
Token ID416
Feature activation+5.74e-01

Pos logit contributions
ti+1.456
ansas+1.456
anim+1.359
velt+1.357
hor+1.306
Neg logit contributions
 amen-1.252
 lent-1.198
 sidel-1.172
 conject-1.164
 helic-1.162
Token May
Token ID1737
Feature activation-8.03e-01

Pos logit contributions
ansas+0.452
ti+0.451
anim+0.424
velt+0.420
hor+0.407
Neg logit contributions
 amen-0.357
 lent-0.340
 sidel-0.331
selves-0.331
 conject-0.330
Token,
Token ID11
Feature activation+1.38e+00

Pos logit contributions
 amen+0.484
 lent+0.461
 sidel+0.450
 conject+0.446
selves+0.445
Neg logit contributions
ansas-0.652
ti-0.650
anim-0.612
velt-0.609
hor-0.588
Token.
Token ID13
Feature activation+1.72e+00

Pos logit contributions
ansas+0.913
ti+0.911
anim+0.858
velt+0.854
hor+0.827
Neg logit contributions
 amen-0.619
 lent-0.588
 sidel-0.574
 conject-0.568
selves-0.567
Token When
Token ID1649
Feature activation-1.24e+00

Pos logit contributions
ansas+1.465
ti+1.464
anim+1.376
velt+1.364
hor+1.324
Neg logit contributions
 amen-0.973
 lent-0.925
 sidel-0.902
 conject-0.894
 helic-0.891
Token Trump
Token ID1301
Feature activation-1.02e+00

Pos logit contributions
 amen+0.762
 lent+0.729
 sidel+0.710
 conject+0.704
 helic+0.697
Neg logit contributions
ansas-0.993
ti-0.988
anim-0.932
velt-0.928
hor-0.895
Token named
Token ID3706
Feature activation-1.82e+00

Pos logit contributions
 amen+0.667
 lent+0.641
 sidel+0.623
 conject+0.620
selves+0.616
Neg logit contributions
ansas-0.771
ti-0.765
anim-0.719
velt-0.716
hor-0.688
Token his
Token ID465
Feature activation+8.76e-01

Pos logit contributions
 amen+1.248
 lent+1.179
 sidel+1.162
 conject+1.150
selves+1.146
Neg logit contributions
ansas-1.342
ti-1.332
velt-1.246
anim-1.245
hor-1.192
Token vice
Token ID7927
Feature activation+9.08e+00

Pos logit contributions
ansas+0.644
ti+0.641
anim+0.600
velt+0.597
hor+0.574
Neg logit contributions
 amen-0.596
 lent-0.570
 sidel-0.558
 conject-0.554
selves-0.553
Token presidential
Token ID4787
Feature activation+4.76e+00

Pos logit contributions
ansas+5.468
ti+5.364
anim+5.041
velt+4.938
hor+4.890
Neg logit contributions
 lent-6.815
 amen-6.808
selves-6.658
 sidel-6.623
 helic-6.574
Token pick
Token ID2298
Feature activation+4.27e+00

Pos logit contributions
ti+3.085
ansas+3.026
anim+2.826
velt+2.777
hor+2.691
Neg logit contributions
 amen-3.540
 lent-3.471
selves-3.420
 sidel-3.363
 helic-3.354
Token,
Token ID11
Feature activation+3.52e+00

Pos logit contributions
ti+3.378
ansas+3.333
anim+3.170
velt+3.101
hor+3.044
Neg logit contributions
 amen-2.579
 lent-2.506
 sidel-2.451
selves-2.445
 helic-2.434
Token Gov
Token ID10964
Feature activation+4.43e-01

Pos logit contributions
ansas+2.438
ti+2.435
anim+2.271
velt+2.239
hor+2.175
Neg logit contributions
 amen-2.480
 lent-2.410
 sidel-2.362
selves-2.354
 helic-2.338
Token.
Token ID13
Feature activation+3.93e+00

Pos logit contributions
ansas+0.376
ti+0.376
anim+0.355
velt+0.352
hor+0.342
Neg logit contributions
 amen-0.250
 lent-0.238
 sidel-0.232
 conject-0.229
selves-0.229
Tokenities
Token ID871
Feature activation+5.08e-01

Pos logit contributions
ti+2.168
ansas+2.154
anim+2.044
velt+1.981
hor+1.921
Neg logit contributions
 amen-2.225
 lent-2.129
 sidel-2.099
 helic-2.084
 conject-2.070
Token
Token ID447
Feature activation-2.29e-01

Pos logit contributions
ti+0.425
ansas+0.425
anim+0.400
velt+0.398
hor+0.385
Neg logit contributions
 amen-0.293
 lent-0.277
 sidel-0.271
 conject-0.268
selves-0.267
Token
Token ID247
Feature activation-1.21e+00

Pos logit contributions
 amen+0.135
 lent+0.128
 sidel+0.126
 conject+0.125
selves+0.124
Neg logit contributions
ti-0.188
ansas-0.187
anim-0.176
velt-0.176
hor-0.170
Token of
Token ID286
Feature activation+1.80e+00

Pos logit contributions
 amen+0.764
 lent+0.729
 sidel+0.713
 conject+0.706
selves+0.706
Neg logit contributions
ansas-0.951
ti-0.947
anim-0.890
velt-0.886
hor-0.854
Token the
Token ID262
Feature activation+4.55e+00

Pos logit contributions
ti+1.506
ansas+1.504
anim+1.415
velt+1.403
hor+1.363
Neg logit contributions
 amen-1.030
 lent-0.978
 sidel-0.951
 helic-0.943
 conject-0.942
Token bourgeois
Token ID19749
Feature activation+9.06e+00

Pos logit contributions
ti+3.517
ansas+3.463
anim+3.242
velt+3.189
hor+3.153
Neg logit contributions
 amen-2.850
 lent-2.705
 helic-2.646
 sidel-2.645
 conject-2.612
Token type
Token ID2099
Feature activation+5.43e+00

Pos logit contributions
ti+6.592
ansas+6.520
anim+6.079
velt+5.993
hor+5.869
Neg logit contributions
 amen-5.673
 lent-5.397
 sidel-5.339
 helic-5.327
 conject-5.188
Token (
Token ID357
Feature activation+4.95e-01

Pos logit contributions
ti+4.245
ansas+4.203
anim+3.987
velt+3.936
hor+3.780
Neg logit contributions
 amen-3.315
 lent-3.089
 helic-3.079
 sidel-3.076
 conject-3.054
Tokenl
Token ID75
Feature activation+1.22e+00

Pos logit contributions
ti+0.362
ansas+0.362
anim+0.339
velt+0.335
hor+0.324
Neg logit contributions
 amen-0.338
 lent-0.324
 sidel-0.317
 conject-0.315
 helic-0.313
Tokenabor
Token ID4820
Feature activation+8.87e-01

Pos logit contributions
ansas+0.803
ti+0.802
anim+0.744
velt+0.737
hor+0.708
Neg logit contributions
 amen-0.925
 lent-0.889
 sidel-0.872
 conject-0.868
selves-0.863
Token unions
Token ID11936
Feature activation+5.22e-01

Pos logit contributions
ansas+0.759
ti+0.758
anim+0.714
velt+0.712
hor+0.691
Neg logit contributions
 amen-0.495
 lent-0.469
 sidel-0.457
 conject-0.453
selves-0.452
Token the
Token ID262
Feature activation-9.31e-01

Pos logit contributions
ansas+1.585
ti+1.575
anim+1.470
velt+1.460
hor+1.407
Neg logit contributions
 amen-1.489
 lent-1.418
 sidel-1.398
selves-1.390
 conject-1.388
Token rock
Token ID3881
Feature activation+1.63e+00

Pos logit contributions
 amen+0.555
 lent+0.529
 sidel+0.515
 conject+0.510
selves+0.507
Neg logit contributions
ansas-0.763
ti-0.760
anim-0.717
velt-0.713
hor-0.689
Token,
Token ID11
Feature activation+1.75e+00

Pos logit contributions
ansas+1.372
ti+1.369
anim+1.289
velt+1.280
hor+1.248
Neg logit contributions
 amen-0.937
 lent-0.878
 sidel-0.867
selves-0.855
 conject-0.852
Token hitting
Token ID9008
Feature activation+2.30e+00

Pos logit contributions
ansas+1.437
ti+1.435
anim+1.352
velt+1.346
hor+1.305
Neg logit contributions
 amen-1.025
 lent-0.968
 sidel-0.946
selves-0.945
 conject-0.936
Token different
Token ID1180
Feature activation+3.20e+00

Pos logit contributions
ansas+1.802
ti+1.792
anim+1.682
velt+1.670
hor+1.614
Neg logit contributions
 amen-1.425
 lent-1.357
selves-1.331
 sidel-1.330
 conject-1.317
Token mineral
Token ID18352
Feature activation+9.04e+00

Pos logit contributions
ansas+2.286
ti+2.276
anim+2.131
velt+2.098
hor+2.041
Neg logit contributions
 amen-2.167
 lent-2.072
selves-2.054
 sidel-2.019
 conject-2.016
Token grains
Token ID21824
Feature activation+4.83e+00

Pos logit contributions
ti+6.123
ansas+6.116
anim+5.655
hor+5.498
spe+5.481
Neg logit contributions
 amen-5.918
 lent-5.664
selves-5.617
 sidel-5.568
 conject-5.485
Token within
Token ID1626
Feature activation-5.23e-01

Pos logit contributions
ti+3.675
ansas+3.615
anim+3.414
velt+3.368
hor+3.297
Neg logit contributions
 amen-3.033
selves-2.835
 lent-2.812
 sidel-2.783
 helic-2.760
Token it
Token ID340
Feature activation-2.94e+00

Pos logit contributions
 amen+0.303
 lent+0.288
 sidel+0.280
 conject+0.278
selves+0.277
Neg logit contributions
ansas-0.438
ti-0.436
anim-0.412
velt-0.410
hor-0.396
Token.
Token ID13
Feature activation+9.74e-01

Pos logit contributions
 amen+1.605
 lent+1.542
 sidel+1.493
 conject+1.483
selves+1.482
Neg logit contributions
ansas-2.538
ti-2.521
anim-2.390
velt-2.385
hor-2.303
Token
Token ID198
Feature activation+5.24e-01

Pos logit contributions
ansas+0.838
ti+0.836
anim+0.789
velt+0.786
hor+0.761
Neg logit contributions
 amen-0.540
 lent-0.511
 sidel-0.499
 conject-0.494
selves-0.491
Tokenι
Token ID29945
Feature activation-6.17e+00

Pos logit contributions
 amen+1.863
 lent+1.784
 sidel+1.728
 conject+1.726
selves+1.720
Neg logit contributions
ansas-2.286
ti-2.239
velt-2.126
anim-2.119
inois-2.025
Token
Token ID157
Feature activation-4.08e+00

Pos logit contributions
 amen+4.552
 lent+4.356
 conject+4.244
 sidel+4.209
 helic+4.200
Neg logit contributions
ansas-4.129
ti-3.921
velt-3.830
anim-3.740
inois-3.625
Token
Token ID122
Feature activation-6.19e+00

Pos logit contributions
 amen+1.812
 lent+1.694
 sidel+1.634
 helic+1.621
selves+1.619
Neg logit contributions
ansas-3.975
ti-3.890
velt-3.730
anim-3.714
hor-3.600
Token
Token ID121
Feature activation-2.05e+00

Pos logit contributions
 amen+2.968
 lent+2.790
 conject+2.663
 helic+2.643
 sidel+2.629
Neg logit contributions
ansas-5.805
ti-5.531
velt-5.426
anim-5.308
inois-5.209
Token
Token ID28053
Feature activation-2.11e+00

Pos logit contributions
 amen+1.399
 lent+1.339
 sidel+1.307
 conject+1.301
selves+1.290
Neg logit contributions
ansas-1.508
ti-1.484
velt-1.399
anim-1.396
hor-1.332
Token
Token ID120
Feature activation-1.18e+01

Pos logit contributions
 amen+1.150
 lent+1.089
 sidel+1.062
selves+1.053
 conject+1.051
Neg logit contributions
ansas-1.836
ti-1.815
anim-1.724
velt-1.714
hor-1.654
Token
Token ID223
Feature activation-2.89e+00

Pos logit contributions
 amen+5.426
 lent+5.147
 conject+4.910
selves+4.784
 sidel+4.756
Neg logit contributions
ansas-10.570
ti-9.920
velt-9.827
anim-9.602
inois-9.550
Tokenρ
Token ID33643
Feature activation-2.84e+00

Pos logit contributions
 amen+2.012
 lent+1.932
 sidel+1.882
 conject+1.871
selves+1.863
Neg logit contributions
ansas-2.094
ti-2.043
velt-1.931
anim-1.925
inois-1.848
Tokenπ
Token ID46582
Feature activation-6.27e+00

Pos logit contributions
 amen+1.931
 lent+1.851
 sidel+1.807
 conject+1.798
selves+1.792
Neg logit contributions
ansas-2.096
ti-2.055
velt-1.941
anim-1.938
inois-1.850
Tokenα
Token ID17394
Feature activation-3.96e+00

Pos logit contributions
 amen+3.863
 lent+3.720
 conject+3.605
 helic+3.585
 sidel+3.566
Neg logit contributions
ansas-4.924
ti-4.728
velt-4.583
anim-4.514
inois-4.366
Tokenγ
Token ID42063
Feature activation-3.63e+00

Pos logit contributions
 amen+2.730
 lent+2.622
 sidel+2.544
 conject+2.541
 helic+2.517
Neg logit contributions
ansas-2.885
ti-2.807
velt-2.658
anim-2.645
inois-2.529
Token as
Token ID355
Feature activation+1.31e+00

Pos logit contributions
ti+0.669
ansas+0.667
anim+0.622
velt+0.618
hor+0.598
Neg logit contributions
 amen-0.599
 lent-0.572
 sidel-0.561
 conject-0.557
selves-0.554
Token Goldman
Token ID18740
Feature activation+2.08e+00

Pos logit contributions
ansas+0.975
ti+0.973
anim+0.912
velt+0.905
hor+0.873
Neg logit contributions
 amen-0.869
 lent-0.832
 sidel-0.813
 conject-0.809
selves-0.806
Token worked
Token ID3111
Feature activation+1.50e-01

Pos logit contributions
ti+1.473
ansas+1.471
anim+1.365
velt+1.358
hor+1.309
Neg logit contributions
 amen-1.469
 lent-1.396
 sidel-1.379
selves-1.369
 conject-1.368
Token to
Token ID284
Feature activation+1.46e+00

Pos logit contributions
ansas+0.121
ti+0.121
anim+0.114
velt+0.113
hor+0.109
Neg logit contributions
 amen-0.092
 lent-0.087
 sidel-0.085
 conject-0.085
selves-0.084
Token j
Token ID474
Feature activation-1.82e-02

Pos logit contributions
ti+1.109
ansas+1.107
velt+1.036
anim+1.036
hor+0.995
Neg logit contributions
 amen-0.949
 lent-0.901
 conject-0.881
selves-0.881
 sidel-0.880
Tokenett
Token ID3087
Feature activation-1.18e+01

Pos logit contributions
 amen+0.010
 lent+0.009
 sidel+0.009
selves+0.009
 conject+0.009
Neg logit contributions
ti-0.016
ansas-0.016
anim-0.015
velt-0.015
hor-0.015
Tokenison
Token ID1653
Feature activation+8.34e-01

Pos logit contributions
 amen+7.501
 lent+7.215
selves+7.209
 conject+7.054
 sidel+7.007
Neg logit contributions
ansas-8.091
ti-7.908
anim-7.498
velt-7.399
inois-7.277
Token the
Token ID262
Feature activation+8.08e-01

Pos logit contributions
ti+0.659
ansas+0.659
anim+0.619
velt+0.616
hor+0.596
Neg logit contributions
 amen-0.517
 lent-0.489
 sidel-0.480
 conject-0.477
selves-0.475
Token billions
Token ID13188
Feature activation-1.18e+00

Pos logit contributions
ti+0.614
ansas+0.612
anim+0.574
velt+0.571
hor+0.553
Neg logit contributions
 amen-0.524
 lent-0.500
 sidel-0.491
 conject-0.487
selves-0.486
Token in
Token ID287
Feature activation-9.21e-01

Pos logit contributions
 amen+0.684
 lent+0.652
 sidel+0.634
 conject+0.628
selves+0.626
Neg logit contributions
ansas-0.990
ti-0.985
anim-0.930
velt-0.925
hor-0.895
Token sub
Token ID850
Feature activation-7.80e-01

Pos logit contributions
 amen+0.572
 lent+0.550
 sidel+0.533
 conject+0.529
selves+0.527
Neg logit contributions
ansas-0.734
ti-0.729
anim-0.686
velt-0.683
hor-0.658
Token 1
Token ID352
Feature activation-4.94e-02

Pos logit contributions
 amen+0.170
 lent+0.162
 sidel+0.158
 conject+0.157
 helic+0.156
Neg logit contributions
ansas-0.225
ti-0.224
anim-0.210
velt-0.209
hor-0.202
Token AL
Token ID8355
Feature activation-8.61e-01

Pos logit contributions
 amen+0.025
 lent+0.023
 sidel+0.023
 conject+0.022
selves+0.022
Neg logit contributions
ansas-0.045
ti-0.045
anim-0.043
velt-0.042
hor-0.041
TokenREAD
Token ID15675
Feature activation+1.98e+00

Pos logit contributions
 amen+0.470
 lent+0.445
 sidel+0.434
 conject+0.428
selves+0.427
Neg logit contributions
ansas-0.748
ti-0.743
anim-0.706
velt-0.702
hor-0.679
TokenY
Token ID56
Feature activation-3.96e-01

Pos logit contributions
ti+1.178
ansas+1.110
anim+1.015
velt+0.980
hor+0.977
Neg logit contributions
 amen-1.653
 sidel-1.592
 lent-1.589
 conject-1.575
selves-1.560
Token COMP
Token ID24301
Feature activation-3.81e-02

Pos logit contributions
 amen+0.240
 lent+0.229
 sidel+0.224
 conject+0.222
selves+0.221
Neg logit contributions
ansas-0.320
ti-0.319
anim-0.300
velt-0.298
hor-0.289
TokenET
Token ID2767
Feature activation-1.10e+01

Pos logit contributions
 amen+0.018
 lent+0.017
 sidel+0.017
 conject+0.017
selves+0.017
Neg logit contributions
ansas-0.035
ti-0.035
anim-0.034
velt-0.033
hor-0.032
TokenED
Token ID1961
Feature activation-5.02e+00

Pos logit contributions
 amen+5.189
 lent+5.188
 TRANS+4.789
 conject+4.717
MIT+4.638
Neg logit contributions
ansas-9.569
velt-9.366
ti-9.147
inois-8.923
anim-8.890
Token:
Token ID25
Feature activation+8.72e-01

Pos logit contributions
 amen+2.750
 lent+2.614
 sidel+2.531
selves+2.496
 conject+2.475
Neg logit contributions
ansas-4.290
ti-4.262
anim-4.081
velt-4.067
inois-3.908
Token at
Token ID379
Feature activation+1.48e+00

Pos logit contributions
ansas+0.697
ti+0.694
anim+0.652
velt+0.649
hor+0.627
Neg logit contributions
 amen-0.535
 lent-0.511
 sidel-0.499
 conject-0.495
selves-0.494
Token Arizona
Token ID7943
Feature activation+2.46e+00

Pos logit contributions
ansas+1.040
ti+1.037
anim+0.965
velt+0.957
hor+0.923
Neg logit contributions
 amen-1.042
 lent-1.002
 sidel-0.982
selves-0.981
 conject-0.978
Token Cardinals
Token ID18071
Feature activation+2.52e+00

Pos logit contributions
ansas+1.982
ti+1.967
anim+1.853
velt+1.821
hor+1.771
Neg logit contributions
 amen-1.499
 lent-1.416
selves-1.393
 conject-1.390
 sidel-1.381
Token scientific
Token ID5654
Feature activation+3.74e-01

Pos logit contributions
ansas+2.107
ti+2.104
anim+1.982
velt+1.968
hor+1.895
Neg logit contributions
 amen-1.702
 lent-1.632
 sidel-1.596
selves-1.583
 conject-1.567
Token community
Token ID2055
Feature activation+1.00e+00

Pos logit contributions
ansas+0.278
ti+0.276
anim+0.260
velt+0.259
hor+0.248
Neg logit contributions
 amen-0.246
 lent-0.237
 sidel-0.233
selves-0.231
 helic-0.229
Token,
Token ID11
Feature activation+1.89e+00

Pos logit contributions
ansas+0.908
ti+0.905
anim+0.857
velt+0.854
hor+0.829
Neg logit contributions
 amen-0.511
 lent-0.482
 sidel-0.467
 conject-0.463
selves-0.460
Token this
Token ID428
Feature activation-1.85e-01

Pos logit contributions
ansas+1.688
ti+1.681
anim+1.594
velt+1.588
hor+1.536
Neg logit contributions
 amen-0.981
 lent-0.926
 sidel-0.901
selves-0.894
 conject-0.892
Token was
Token ID373
Feature activation+2.37e+00

Pos logit contributions
 amen+0.133
 lent+0.128
 sidel+0.125
 conject+0.124
 helic+0.123
Neg logit contributions
ansas-0.128
ti-0.128
anim-0.119
velt-0.118
hor-0.114
Token NASA
Token ID8884
Feature activation-1.08e+01

Pos logit contributions
ansas+2.291
ti+2.286
anim+2.182
velt+2.167
hor+2.114
Neg logit contributions
 amen-1.023
 lent-0.958
 sidel-0.931
selves-0.929
 helic-0.916
Token
Token ID447
Feature activation-2.22e+00

Pos logit contributions
 amen+5.497
 lent+5.072
 conject+5.043
selves+4.963
 sidel+4.860
Neg logit contributions
ansas-8.906
ti-8.677
velt-8.330
anim-8.195
inois-8.125
Token
Token ID247
Feature activation+1.14e+00

Pos logit contributions
 amen+1.035
 lent+0.980
 sidel+0.949
 conject+0.942
 helic+0.931
Neg logit contributions
ti-2.108
ansas-2.095
anim-2.000
velt-1.987
hor-1.930
Tokens
Token ID82
Feature activation-1.66e+00

Pos logit contributions
ti+0.737
ansas+0.720
anim+0.677
velt+0.660
hor+0.644
Neg logit contributions
 amen-0.887
 lent-0.852
 helic-0.839
 sidel-0.835
 conject-0.830
Token seventh
Token ID14024
Feature activation-2.28e+00

Pos logit contributions
 amen+0.725
 lent+0.678
 sidel+0.655
 conject+0.648
selves+0.645
Neg logit contributions
ansas-1.619
ti-1.612
anim-1.535
velt-1.529
hor-1.485
Token consecutive
Token ID12785
Feature activation+2.52e+00

Pos logit contributions
 amen+2.028
 lent+1.960
 sidel+1.933
 conject+1.921
selves+1.917
Neg logit contributions
ansas-1.198
ti-1.186
anim-1.079
velt-1.072
hor-1.016
Token
Token ID198
Feature activation-5.66e+00

Pos logit contributions
 amen+3.375
 lent+3.138
selves+3.112
 sidel+3.037
 conject+3.037
Neg logit contributions
ansas-6.225
ti-6.006
velt-5.871
anim-5.750
inois-5.737
Tokenlarg
Token ID15521
Feature activation-7.74e+00

Pos logit contributions
 amen+2.627
 lent+2.379
 sidel+2.361
 conject+2.317
selves+2.314
Neg logit contributions
ansas-5.251
ti-5.094
velt-4.956
inois-4.842
anim-4.776
Tokener
Token ID263
Feature activation-6.58e+00

Pos logit contributions
 amen+3.562
 lent+3.396
selves+3.302
 conject+3.278
 sidel+3.209
Neg logit contributions
ansas-7.170
ti-7.010
velt-6.663
anim-6.612
inois-6.421
Token image
Token ID2939
Feature activation-5.35e+00

Pos logit contributions
 amen+3.826
 lent+3.620
 sidel+3.558
selves+3.519
 conject+3.489
Neg logit contributions
ansas-5.354
ti-5.330
velt-5.081
anim-4.980
inois-4.831
Token T
Token ID309
Feature activation-2.72e+00

Pos logit contributions
 amen+2.774
 lent+2.583
 conject+2.515
 sidel+2.513
selves+2.503
Neg logit contributions
ansas-4.758
ti-4.681
anim-4.500
velt-4.494
inois-4.313
TokenIFF
Token ID29267
Feature activation-1.03e+01

Pos logit contributions
 amen+1.468
 lent+1.386
 sidel+1.346
 conject+1.342
selves+1.320
Neg logit contributions
ansas-2.363
ti-2.342
velt-2.221
anim-2.206
hor-2.158
Token original
Token ID2656
Feature activation-3.52e+00

Pos logit contributions
 amen+4.943
 lent+4.586
 conject+4.455
 sidel+4.304
selves+4.294
Neg logit contributions
ansas-9.648
ti-9.427
velt-9.154
anim-8.694
ptin-8.394
Token image
Token ID2939
Feature activation-3.84e+00

Pos logit contributions
 amen+2.337
 lent+2.231
 sidel+2.185
 conject+2.163
selves+2.159
Neg logit contributions
ansas-2.641
ti-2.631
anim-2.461
velt-2.454
hor-2.360
Token Download
Token ID10472
Feature activation-5.80e+00

Pos logit contributions
 amen+2.083
 lent+1.970
selves+1.915
 sidel+1.903
 conject+1.901
Neg logit contributions
ansas-3.343
ti-3.327
velt-3.144
anim-3.140
hor-3.031
Token:
Token ID25
Feature activation-5.62e+00

Pos logit contributions
 amen+2.411
 lent+2.166
selves+2.114
 conject+2.083
 sidel+2.082
Neg logit contributions
ti-5.802
ansas-5.758
velt-5.662
anim-5.374
hor-5.354
Token Figure
Token ID11291
Feature activation-1.04e+00

Pos logit contributions
 amen+3.406
 lent+3.169
 conject+3.092
 helic+3.035
 sidel+3.023
Neg logit contributions
ansas-4.599
ti-4.441
velt-4.313
anim-4.220
inois-4.142
Token Sam
Token ID3409
Feature activation+4.53e-01

Pos logit contributions
ti+0.939
ansas+0.936
anim+0.883
velt+0.875
hor+0.846
Neg logit contributions
 amen-0.751
 lent-0.713
 sidel-0.698
selves-0.696
 conject-0.692
Token,
Token ID11
Feature activation+2.56e+00

Pos logit contributions
ti+0.388
ansas+0.384
anim+0.363
velt+0.358
hor+0.353
Neg logit contributions
 amen-0.257
 lent-0.239
 sidel-0.235
selves-0.235
 conject-0.234
Token Merry
Token ID39001
Feature activation-1.05e-01

Pos logit contributions
ti+1.987
ansas+1.971
anim+1.869
velt+1.848
hor+1.790
Neg logit contributions
 amen-1.615
 lent-1.529
 sidel-1.507
selves-1.491
 conject-1.491
Token,
Token ID11
Feature activation+2.15e+00

Pos logit contributions
 amen+0.056
 lent+0.053
 sidel+0.052
 conject+0.051
selves+0.051
Neg logit contributions
ansas-0.092
ti-0.092
anim-0.087
velt-0.087
hor-0.084
Token P
Token ID350
Feature activation+5.00e-01

Pos logit contributions
ti+1.655
ansas+1.654
anim+1.560
velt+1.545
hor+1.488
Neg logit contributions
 amen-1.358
 lent-1.287
 sidel-1.269
selves-1.267
 helic-1.255
Tokenipp
Token ID3974
Feature activation-1.01e+01

Pos logit contributions
ansas+0.375
ti+0.374
anim+0.350
velt+0.348
hor+0.336
Neg logit contributions
 amen-0.332
 lent-0.318
 sidel-0.311
 conject-0.309
selves-0.308
Tokenin
Token ID259
Feature activation+2.77e+00

Pos logit contributions
 amen+6.847
 lent+6.727
 conject+6.303
 sidel+6.184
 helic+6.051
Neg logit contributions
ansas-6.836
ti-6.627
velt-6.601
anim-5.906
Room-5.820
Token,
Token ID11
Feature activation+2.00e+00

Pos logit contributions
ti+2.368
ansas+2.353
anim+2.239
velt+2.227
hor+2.167
Neg logit contributions
 amen-1.532
 lent-1.439
 sidel-1.414
selves-1.401
 conject-1.400
Token Ar
Token ID943
Feature activation-3.17e-01

Pos logit contributions
ti+1.574
ansas+1.564
anim+1.482
velt+1.465
hor+1.418
Neg logit contributions
 amen-1.242
 lent-1.176
 sidel-1.157
selves-1.151
 conject-1.146
Tokenag
Token ID363
Feature activation-2.82e+00

Pos logit contributions
 amen+0.227
 lent+0.218
 sidel+0.214
 conject+0.212
selves+0.212
Neg logit contributions
ansas-0.222
ti-0.221
anim-0.206
velt-0.205
hor-0.197
Tokenorn
Token ID1211
Feature activation-5.87e-01

Pos logit contributions
 amen+1.490
 lent+1.466
 conject+1.408
 sidel+1.400
selves+1.364
Neg logit contributions
ansas-2.414
ti-2.339
velt-2.297
inois-2.234
anim-2.234
Token (
Token ID357
Feature activation-2.72e-01

Pos logit contributions
 amen+0.373
 lent+0.353
 sidel+0.336
 conject+0.334
selves+0.332
Neg logit contributions
ansas-0.888
ti-0.879
velt-0.843
anim-0.840
hor-0.818
TokenS
Token ID50
Feature activation+3.75e-01

Pos logit contributions
 amen+0.219
 lent+0.211
 sidel+0.208
 conject+0.206
selves+0.206
Neg logit contributions
ansas-0.166
ti-0.165
anim-0.152
velt-0.152
hor-0.144
Token4
Token ID19
Feature activation-7.73e-01

Pos logit contributions
ansas+0.254
ti+0.253
anim+0.235
velt+0.234
hor+0.224
Neg logit contributions
 amen-0.277
 lent-0.267
 sidel-0.262
selves-0.260
 conject-0.260
Token)
Token ID8
Feature activation+1.08e+00

Pos logit contributions
 amen+0.362
 lent+0.341
 sidel+0.330
 conject+0.326
selves+0.324
Neg logit contributions
ansas-0.731
ti-0.726
anim-0.691
velt-0.690
hor-0.668
Token DOI
Token ID40722
Feature activation-7.70e+00

Pos logit contributions
ansas+0.863
ti+0.861
anim+0.810
velt+0.804
hor+0.777
Neg logit contributions
 amen-0.657
 lent-0.625
 sidel-0.612
 conject-0.605
selves-0.605
Token:
Token ID25
Feature activation-9.84e+00

Pos logit contributions
 amen+3.383
 lent+3.197
 sidel+2.986
 conject+2.977
 helic+2.911
Neg logit contributions
ansas-7.348
ti-7.168
velt-7.005
anim-6.809
inois-6.736
Token 10
Token ID838
Feature activation-4.61e+00

Pos logit contributions
 amen+4.721
 lent+4.578
 conject+4.385
 helic+4.361
 sidel+4.356
Neg logit contributions
ansas-8.645
ti-8.457
velt-8.254
anim-8.072
inois-7.934
Token.
Token ID13
Feature activation-2.53e+00

Pos logit contributions
 amen+2.331
 lent+2.196
 sidel+2.125
selves+2.124
 conject+2.112
Neg logit contributions
ansas-4.148
ti-4.072
velt-3.917
anim-3.880
inois-3.777
Token10
Token ID940
Feature activation-3.32e+00

Pos logit contributions
 amen+1.710
 lent+1.640
 sidel+1.599
selves+1.599
 conject+1.590
Neg logit contributions
ansas-1.844
ti-1.825
velt-1.714
anim-1.709
inois-1.643
Token86
Token ID4521
Feature activation-2.57e+00

Pos logit contributions
 amen+2.800
 lent+2.720
 sidel+2.643
selves+2.631
 conject+2.618
Neg logit contributions
ansas-1.892
ti-1.833
velt-1.732
anim-1.683
inois-1.616
Token/
Token ID14
Feature activation-3.33e+00

Pos logit contributions
 amen+1.581
 lent+1.502
 sidel+1.466
selves+1.464
 conject+1.455
Neg logit contributions
ansas-2.049
ti-2.025
anim-1.911
velt-1.909
inois-1.844
Token achievements
Token ID16970
Feature activation+4.11e+00

Pos logit contributions
 amen+0.709
 lent+0.673
 sidel+0.655
selves+0.647
 conject+0.646
Neg logit contributions
ansas-1.095
ti-1.087
velt-1.027
anim-1.027
hor-0.991
Token.
Token ID13
Feature activation-1.03e+00

Pos logit contributions
ti+3.325
ansas+3.287
anim+3.121
velt+3.079
hor+3.007
Neg logit contributions
 amen-2.439
 lent-2.287
 sidel-2.280
 helic-2.240
selves-2.238
Token You
Token ID921
Feature activation+1.49e+00

Pos logit contributions
 amen+0.538
 lent+0.508
 sidel+0.493
selves+0.490
 conject+0.488
Neg logit contributions
ansas-0.919
ti-0.915
anim-0.867
velt-0.865
hor-0.837
Token get
Token ID651
Feature activation+1.51e-01

Pos logit contributions
ti+1.129
ansas+1.125
anim+1.054
velt+1.047
hor+1.015
Neg logit contributions
 amen-0.974
 lent-0.926
 sidel-0.913
 helic-0.904
selves-0.903
Token the
Token ID262
Feature activation-1.67e+00

Pos logit contributions
ansas+0.123
ti+0.122
anim+0.115
velt+0.115
hor+0.111
Neg logit contributions
 amen-0.090
 lent-0.086
 sidel-0.084
selves-0.083
 conject-0.083
Token idea
Token ID2126
Feature activation-9.78e+00

Pos logit contributions
 amen+0.931
 lent+0.880
 sidel+0.860
 conject+0.853
selves+0.844
Neg logit contributions
ansas-1.436
ti-1.426
anim-1.347
velt-1.343
hor-1.298
Token.
Token ID13
Feature activation-3.04e+00

Pos logit contributions
 amen+4.470
 lent+4.018
 conject+3.927
 sidel+3.865
selves+3.834
Neg logit contributions
ansas-9.044
ti-8.678
velt-8.547
anim-8.236
inois-8.207
Token There
Token ID1318
Feature activation-3.45e-01

Pos logit contributions
 amen+1.564
 lent+1.468
selves+1.426
 sidel+1.416
 conject+1.403
Neg logit contributions
ansas-2.728
ti-2.703
velt-2.570
anim-2.566
inois-2.492
Token was
Token ID373
Feature activation+8.43e-01

Pos logit contributions
 amen+0.228
 lent+0.218
 sidel+0.214
 conject+0.212
selves+0.211
Neg logit contributions
ansas-0.259
ti-0.258
anim-0.242
velt-0.241
hor-0.232
Token still
Token ID991
Feature activation-8.59e-01

Pos logit contributions
ti+0.623
ansas+0.622
anim+0.582
velt+0.578
hor+0.558
Neg logit contributions
 amen-0.566
 lent-0.542
 sidel-0.531
selves-0.527
 conject-0.526
Token a
Token ID257
Feature activation-1.21e+00

Pos logit contributions
 amen+0.550
 lent+0.525
 sidel+0.514
 conject+0.510
selves+0.507
Neg logit contributions
ansas-0.667
ti-0.662
anim-0.621
velt-0.621
hor-0.596
Tokense
Token ID325
Feature activation-2.90e+00

Pos logit contributions
 amen+1.714
 lent+1.646
selves+1.614
 sidel+1.613
 conject+1.596
Neg logit contributions
ansas-1.705
ti-1.686
velt-1.576
anim-1.560
inois-1.496
Token "
Token ID366
Feature activation-4.02e+00

Pos logit contributions
 amen+1.739
 lent+1.661
selves+1.614
 sidel+1.612
 conject+1.604
Neg logit contributions
ansas-2.327
ti-2.315
velt-2.181
anim-2.178
hor-2.106
Token (
Token ID357
Feature activation-1.63e-01

Pos logit contributions
 amen+2.368
 lent+2.243
 sidel+2.207
selves+2.174
 conject+2.172
Neg logit contributions
ansas-3.291
ti-3.255
velt-3.112
anim-3.058
inois-2.954
Tokennote
Token ID11295
Feature activation-1.38e+00

Pos logit contributions
 amen+0.127
 lent+0.123
 sidel+0.121
 conject+0.120
selves+0.120
Neg logit contributions
ansas-0.103
ti-0.103
anim-0.095
velt-0.095
hor-0.090
Token the
Token ID262
Feature activation-2.80e+00

Pos logit contributions
 amen+0.678
 lent+0.638
 sidel+0.620
 conject+0.613
selves+0.612
Neg logit contributions
ansas-1.270
ti-1.262
velt-1.200
anim-1.196
hor-1.158
Token space
Token ID2272
Feature activation-9.78e+00

Pos logit contributions
 amen+1.616
 lent+1.537
 sidel+1.509
 conject+1.485
 helic+1.471
Neg logit contributions
ansas-2.358
ti-2.331
velt-2.212
anim-2.193
hor-2.107
Token)
Token ID8
Feature activation-9.86e-01

Pos logit contributions
 amen+5.645
 lent+5.367
 sidel+5.164
 conject+5.137
 helic+4.995
Neg logit contributions
ansas-7.645
ti-7.574
velt-7.276
anim-7.044
inois-6.885
Token {
Token ID1391
Feature activation-8.42e-01

Pos logit contributions
 amen+0.585
 lent+0.557
 sidel+0.543
 conject+0.538
selves+0.537
Neg logit contributions
ansas-0.810
ti-0.806
anim-0.760
velt-0.757
hor-0.731
Tokenoptions
Token ID25811
Feature activation+1.64e+00

Pos logit contributions
 amen+0.679
 lent+0.657
 sidel+0.643
selves+0.643
 conject+0.641
Neg logit contributions
ansas-0.508
ti-0.500
velt-0.462
anim-0.461
hor-0.437
Token}
Token ID92
Feature activation-2.25e+00

Pos logit contributions
ansas+1.064
ti+1.056
anim+0.977
velt+0.975
hor+0.931
Neg logit contributions
 amen-1.259
 lent-1.215
 sidel-1.192
selves-1.183
 conject-1.183
Token a
Token ID257
Feature activation-1.16e+00

Pos logit contributions
 amen+1.359
 lent+1.294
 sidel+1.263
 conject+1.256
selves+1.255
Neg logit contributions
ansas-1.824
ti-1.810
velt-1.712
anim-1.699
hor-1.636
Token scientific
Token ID5654
Feature activation-1.80e+00

Pos logit contributions
ansas+0.897
ti+0.896
anim+0.843
velt+0.837
hor+0.809
Neg logit contributions
 amen-0.719
 lent-0.688
 sidel-0.672
selves-0.668
 conject-0.664
Token community
Token ID2055
Feature activation-8.87e-01

Pos logit contributions
 amen+1.114
 lent+1.063
 sidel+1.038
 conject+1.031
selves+1.025
Neg logit contributions
ansas-1.432
ti-1.426
anim-1.340
velt-1.334
hor-1.288
Token,
Token ID11
Feature activation+7.89e-01

Pos logit contributions
 amen+0.458
 lent+0.435
 sidel+0.421
 conject+0.417
selves+0.412
Neg logit contributions
ansas-0.800
ti-0.796
anim-0.753
velt-0.749
hor-0.729
Token this
Token ID428
Feature activation-2.66e-01

Pos logit contributions
ansas+0.644
ti+0.642
anim+0.605
velt+0.602
hor+0.582
Neg logit contributions
 amen-0.471
 lent-0.449
 sidel-0.438
 conject-0.434
selves-0.433
Token was
Token ID373
Feature activation+2.71e+00

Pos logit contributions
 amen+0.174
 lent+0.166
 sidel+0.163
 conject+0.161
selves+0.161
Neg logit contributions
ansas-0.202
ti-0.201
anim-0.188
velt-0.187
hor-0.181
Token NASA
Token ID8884
Feature activation-9.74e+00

Pos logit contributions
ti+2.075
ansas+2.072
anim+1.948
velt+1.925
hor+1.873
Neg logit contributions
 amen-1.723
 lent-1.642
selves-1.610
 sidel-1.607
 helic-1.595
Token
Token ID447
Feature activation+1.30e-01

Pos logit contributions
 amen+4.418
 conject+4.085
 lent+4.051
 sidel+4.001
 helic+3.851
Neg logit contributions
ansas-8.972
ti-8.753
anim-8.464
velt-8.372
inois-8.231
Token
Token ID247
Feature activation+1.88e+00

Pos logit contributions
ti+0.113
ansas+0.111
velt+0.105
anim+0.105
hor+0.102
Neg logit contributions
 amen-0.071
 lent-0.068
 sidel-0.067
 conject-0.066
 helic-0.066
Tokens
Token ID82
Feature activation-2.36e+00

Pos logit contributions
ti+1.252
ansas+1.237
anim+1.149
velt+1.138
hor+1.101
Neg logit contributions
 amen-1.415
 lent-1.361
 sidel-1.338
 conject-1.321
 helic-1.320
Token seventh
Token ID14024
Feature activation-1.26e+00

Pos logit contributions
 amen+1.433
 lent+1.359
 sidel+1.328
 conject+1.320
selves+1.308
Neg logit contributions
ansas-1.911
ti-1.901
anim-1.792
velt-1.785
hor-1.717
Token consecutive
Token ID12785
Feature activation+2.46e+00

Pos logit contributions
 amen+0.845
 lent+0.807
 sidel+0.789
 conject+0.782
 helic+0.778
Neg logit contributions
ansas-0.938
ti-0.934
anim-0.874
velt-0.869
hor-0.835
Token Mark
Token ID2940
Feature activation+9.60e-01

Pos logit contributions
ti+1.672
ansas+1.662
anim+1.563
velt+1.559
hor+1.506
Neg logit contributions
 amen-1.414
 lent-1.348
 sidel-1.320
 conject-1.313
selves-1.310
Tokenota
Token ID4265
Feature activation-2.32e+00

Pos logit contributions
ansas+0.635
ti+0.635
anim+0.589
velt+0.583
hor+0.562
Neg logit contributions
 amen-0.722
 lent-0.694
 sidel-0.682
selves-0.677
 conject-0.676
Token,
Token ID11
Feature activation+1.81e-01

Pos logit contributions
 amen+1.221
 lent+1.162
 sidel+1.122
 conject+1.113
selves+1.098
Neg logit contributions
ansas-2.066
ti-2.036
anim-1.937
velt-1.921
hor-1.863
Token a
Token ID257
Feature activation-1.54e+00

Pos logit contributions
ansas+0.156
ti+0.155
anim+0.147
velt+0.146
hor+0.142
Neg logit contributions
 amen-0.101
 lent-0.095
 sidel-0.093
 conject-0.092
selves-0.091
Token P
Token ID350
Feature activation+1.39e+00

Pos logit contributions
 amen+0.986
 lent+0.942
 sidel+0.921
 conject+0.913
selves+0.911
Neg logit contributions
ansas-1.197
ti-1.193
anim-1.119
velt-1.115
hor-1.074
Tokenizz
Token ID6457
Feature activation-9.70e+00

Pos logit contributions
ti+1.011
ansas+1.008
anim+0.949
velt+0.934
hor+0.901
Neg logit contributions
 amen-0.947
 lent-0.911
 sidel-0.895
selves-0.891
 conject-0.885
Tokenag
Token ID363
Feature activation-2.11e+00

Pos logit contributions
 amen+5.727
 lent+5.460
 conject+5.269
 sidel+5.261
 helic+5.162
Neg logit contributions
ansas-7.359
ti-7.163
velt-7.088
anim-6.683
hor-6.507
Tokenater
Token ID729
Feature activation+4.50e-01

Pos logit contributions
 amen+1.647
 lent+1.590
 sidel+1.558
 conject+1.547
selves+1.542
Neg logit contributions
ansas-1.326
ti-1.319
anim-1.218
velt-1.213
hor-1.157
Token conspiracy
Token ID10086
Feature activation+2.86e-01

Pos logit contributions
ansas+0.377
ti+0.376
anim+0.355
velt+0.353
hor+0.341
Neg logit contributions
 amen-0.259
 lent-0.246
 sidel-0.240
 conject-0.238
selves-0.237
Token theorist
Token ID46410
Feature activation-7.43e-01

Pos logit contributions
ansas+0.287
ti+0.286
anim+0.273
velt+0.271
hor+0.264
Neg logit contributions
 amen-0.117
 lent-0.109
 sidel-0.105
selves-0.104
 conject-0.103
Token and
Token ID290
Feature activation-1.98e+00

Pos logit contributions
 amen+0.396
 lent+0.374
 sidel+0.364
 conject+0.361
selves+0.361
Neg logit contributions
ansas-0.655
ti-0.653
anim-0.618
velt-0.615
hor-0.596
Tokenasley
Token ID30705
Feature activation+5.78e-01

Pos logit contributions
ansas+0.917
ti+0.911
anim+0.852
velt+0.844
hor+0.810
Neg logit contributions
 amen-1.059
 lent-1.020
 sidel-1.004
 conject-0.994
 helic-0.992
Token.
Token ID13
Feature activation-1.38e+00

Pos logit contributions
ansas+0.497
ti+0.496
anim+0.468
velt+0.466
hor+0.451
Neg logit contributions
 amen-0.321
 lent-0.304
 sidel-0.297
 conject-0.294
selves-0.293
Token
Token ID198
Feature activation+8.19e-01

Pos logit contributions
 amen+0.761
 lent+0.722
 sidel+0.704
selves+0.698
 conject+0.694
Neg logit contributions
ansas-1.192
ti-1.187
anim-1.123
velt-1.120
hor-1.082
Token
Token ID198
Feature activation+1.68e+00

Pos logit contributions
ti+0.635
ansas+0.630
anim+0.594
velt+0.586
hor+0.571
Neg logit contributions
 amen-0.526
 lent-0.504
 helic-0.495
 sidel-0.494
 conject-0.490
TokenAt
Token ID2953
Feature activation+7.59e-01

Pos logit contributions
ti+1.291
ansas+1.282
anim+1.211
velt+1.190
hor+1.162
Neg logit contributions
 amen-1.088
 lent-1.045
 helic-1.016
 conject-1.016
 sidel-1.015
Token NASA
Token ID8884
Feature activation-9.65e+00

Pos logit contributions
ansas+0.622
ti+0.621
anim+0.586
velt+0.581
hor+0.563
Neg logit contributions
 amen-0.450
 lent-0.428
 sidel-0.418
selves-0.416
 conject-0.414
Token,
Token ID11
Feature activation-6.97e-01

Pos logit contributions
 amen+4.601
 lent+4.446
 sidel+4.184
 conject+4.151
 helic+4.103
Neg logit contributions
ansas-8.617
ti-8.427
anim-8.010
velt-7.925
inois-7.823
Token one
Token ID530
Feature activation+2.05e-01

Pos logit contributions
 amen+0.421
 lent+0.401
 sidel+0.392
 conject+0.388
selves+0.387
Neg logit contributions
ansas-0.565
ti-0.563
anim-0.530
velt-0.527
hor-0.510
Token casualty
Token ID39068
Feature activation+2.92e+00

Pos logit contributions
ansas+0.173
ti+0.173
anim+0.163
velt+0.162
hor+0.157
Neg logit contributions
 amen-0.115
 lent-0.110
selves-0.107
 sidel-0.107
 conject-0.106
Token could
Token ID714
Feature activation-7.53e-01

Pos logit contributions
ansas+2.312
ti+2.306
anim+2.165
velt+2.138
hor+2.096
Neg logit contributions
 amen-1.759
 lent-1.674
 helic-1.655
selves-1.650
 conject-1.635
Token be
Token ID307
Feature activation-6.73e-03

Pos logit contributions
 amen+0.434
 lent+0.413
 sidel+0.403
 conject+0.397
selves+0.397
Neg logit contributions
ansas-0.632
ti-0.630
anim-0.595
velt-0.592
hor-0.572
Token again
Token ID757
Feature activation-8.16e-01

Pos logit contributions
 amen+3.119
 lent+2.923
 sidel+2.858
selves+2.824
 conject+2.786
Neg logit contributions
ansas-5.817
ti-5.627
anim-5.336
velt-5.290
hor-5.250
Token that
Token ID326
Feature activation-5.17e+00

Pos logit contributions
 amen+0.392
 lent+0.372
 sidel+0.363
 helic+0.360
selves+0.357
Neg logit contributions
ti-0.758
ansas-0.754
anim-0.720
velt-0.710
hor-0.690
Token the
Token ID262
Feature activation-2.37e+00

Pos logit contributions
 amen+2.446
 lent+2.277
 sidel+2.217
 conject+2.179
selves+2.154
Neg logit contributions
ansas-4.846
ti-4.802
velt-4.569
anim-4.538
hor-4.426
Token old
Token ID1468
Feature activation-2.58e+00

Pos logit contributions
 amen+1.834
 lent+1.769
 sidel+1.739
 conject+1.721
 helic+1.708
Neg logit contributions
ansas-1.526
ti-1.510
velt-1.401
anim-1.400
hor-1.331
Token address
Token ID2209
Feature activation-2.71e+00

Pos logit contributions
 amen+1.855
 lent+1.772
 sidel+1.738
 conject+1.730
selves+1.715
Neg logit contributions
ansas-1.807
ti-1.791
anim-1.677
velt-1.664
hor-1.592
Token https
Token ID3740
Feature activation-9.64e+00

Pos logit contributions
 amen+1.780
 lent+1.689
 sidel+1.645
 conject+1.626
 helic+1.616
Neg logit contributions
ansas-2.044
ti-2.023
anim-1.905
velt-1.902
inois-1.823
Token://
Token ID1378
Feature activation-5.60e+00

Pos logit contributions
 amen+2.239
selves+1.998
 lent+1.875
 sidel+1.785
 conject+1.697
Neg logit contributions
ansas-10.807
ti-10.501
anim-10.286
velt-10.276
inois-10.197
Token.
Token ID764
Feature activation-4.51e+00

Pos logit contributions
 amen+3.344
 lent+3.118
 sidel+3.053
 conject+3.036
selves+3.011
Neg logit contributions
ansas-4.505
ti-4.435
velt-4.239
anim-4.197
inois-4.057
Tokennice
Token ID44460
Feature activation-3.63e+00

Pos logit contributions
 amen+2.444
 lent+2.252
selves+2.247
 conject+2.189
 sidel+2.184
Neg logit contributions
ansas-3.906
ti-3.796
anim-3.625
velt-3.624
inois-3.540
Tokenhash
Token ID17831
Feature activation+4.42e+00

Pos logit contributions
 amen+2.813
 lent+2.695
selves+2.680
 conject+2.651
 sidel+2.650
Neg logit contributions
ansas-2.265
ti-2.196
velt-2.054
anim-2.054
inois-1.978
Token.
Token ID13
Feature activation+1.09e+00

Pos logit contributions
ti+3.414
ansas+3.341
anim+3.167
velt+3.102
hor+3.024
Neg logit contributions
 amen-2.821
 lent-2.741
 helic-2.724
 sidel-2.681
 conject-2.644
Token
Token ID198
Feature activation-2.56e+00

Pos logit contributions
 amen+1.682
 lent+1.597
 sidel+1.549
selves+1.549
 conject+1.541
Neg logit contributions
ansas-2.450
ti-2.415
velt-2.292
anim-2.288
inois-2.211
TokenPower
Token ID13434
Feature activation+1.37e+00

Pos logit contributions
 amen+2.330
 lent+2.268
 sidel+2.226
selves+2.216
 conject+2.206
Neg logit contributions
ansas-1.246
ti-1.221
velt-1.114
anim-1.100
inois-1.044
TokenPoint
Token ID12727
Feature activation-1.09e+00

Pos logit contributions
ansas+0.706
ti+0.700
anim+0.640
velt+0.633
hor+0.600
Neg logit contributions
 amen-1.220
 lent-1.182
 sidel-1.167
selves-1.165
 helic-1.155
Token slide
Token ID10649
Feature activation-4.39e+00

Pos logit contributions
 amen+0.854
 lent+0.822
 sidel+0.806
 conject+0.802
selves+0.797
Neg logit contributions
ansas-0.690
ti-0.687
velt-0.635
anim-0.634
inois-0.601
Token PNG
Token ID36182
Feature activation-4.29e+00

Pos logit contributions
 amen+1.797
 lent+1.744
 helic+1.633
 conject+1.630
 sidel+1.591
Neg logit contributions
ansas-4.308
ti-4.213
velt-4.085
anim-4.024
inois-3.888
Token larger
Token ID4025
Feature activation-9.62e+00

Pos logit contributions
 amen+1.845
 lent+1.765
 conject+1.670
 helic+1.659
 sidel+1.635
Neg logit contributions
ansas-4.191
ti-4.148
velt-3.959
anim-3.913
hor-3.824
Token image
Token ID2939
Feature activation-6.94e+00

Pos logit contributions
 amen+4.760
 lent+4.625
 sidel+4.364
 conject+4.296
selves+4.268
Neg logit contributions
ti-8.457
ansas-8.398
velt-8.173
anim-7.769
inois-7.539
Token
Token ID198
Feature activation-6.91e+00

Pos logit contributions
 amen+3.522
 lent+3.269
 conject+3.168
selves+3.152
 sidel+3.137
Neg logit contributions
ansas-6.218
ti-6.045
velt-5.822
anim-5.792
inois-5.642
Token
Token ID198
Feature activation-5.66e+00

Pos logit contributions
 amen+3.375
 lent+3.138
selves+3.112
 sidel+3.037
 conject+3.037
Neg logit contributions
ansas-6.225
ti-6.006
velt-5.871
anim-5.750
inois-5.737
Tokenlarg
Token ID15521
Feature activation-7.74e+00

Pos logit contributions
 amen+2.627
 lent+2.379
 sidel+2.361
 conject+2.317
selves+2.314
Neg logit contributions
ansas-5.251
ti-5.094
velt-4.956
inois-4.842
anim-4.776
Tokener
Token ID263
Feature activation-6.58e+00

Pos logit contributions
 amen+3.562
 lent+3.396
selves+3.302
 conject+3.278
 sidel+3.209
Neg logit contributions
ansas-7.170
ti-7.010
velt-6.663
anim-6.612
inois-6.421
Token summer
Token ID3931
Feature activation+7.05e-01

Pos logit contributions
 amen+0.003
 lent+0.003
 sidel+0.003
 conject+0.003
selves+0.003
Neg logit contributions
ansas-0.003
ti-0.003
anim-0.003
velt-0.003
hor-0.003
Token,
Token ID11
Feature activation+1.58e-01

Pos logit contributions
ansas+0.630
ti+0.627
anim+0.595
velt+0.592
hor+0.573
Neg logit contributions
 amen-0.367
 lent-0.348
 sidel-0.337
 conject-0.333
selves-0.331
Token New
Token ID968
Feature activation+3.42e-01

Pos logit contributions
ansas+0.125
ti+0.124
anim+0.117
velt+0.116
hor+0.112
Neg logit contributions
 amen-0.099
 lent-0.094
 sidel-0.092
 conject-0.091
selves-0.091
Token Castle
Token ID11312
Feature activation+1.98e+00

Pos logit contributions
ansas+0.183
ti+0.182
anim+0.166
velt+0.165
hor+0.156
Neg logit contributions
 amen-0.300
 lent-0.291
 sidel-0.285
 conject-0.284
selves-0.284
Token's
Token ID338
Feature activation-1.51e+00

Pos logit contributions
ansas+1.581
ti+1.580
anim+1.486
velt+1.476
hor+1.427
Neg logit contributions
 amen-1.196
 lent-1.139
 sidel-1.120
selves-1.117
 helic-1.105
Token waterfront
Token ID40250
Feature activation-9.59e+00

Pos logit contributions
 amen+0.930
 lent+0.885
 sidel+0.864
 conject+0.856
selves+0.852
Neg logit contributions
ansas-1.215
ti-1.210
anim-1.137
velt-1.133
hor-1.093
Token at
Token ID379
Feature activation-1.50e+00

Pos logit contributions
 amen+6.126
 lent+5.715
 sidel+5.442
 conject+5.402
 helic+5.286
Neg logit contributions
ansas-7.356
ti-7.271
velt-6.829
anim-6.810
inois-6.664
Token Battery
Token ID23490
Feature activation-6.13e-01

Pos logit contributions
 amen+0.941
 lent+0.898
 sidel+0.876
 conject+0.869
selves+0.865
Neg logit contributions
ansas-1.177
ti-1.173
anim-1.102
velt-1.097
hor-1.059
Token Park
Token ID3250
Feature activation-1.73e+00

Pos logit contributions
 amen+0.338
 lent+0.320
 sidel+0.315
selves+0.313
 conject+0.310
Neg logit contributions
ansas-0.526
ti-0.526
velt-0.495
anim-0.495
hor-0.478
Token remains
Token ID3793
Feature activation-4.88e-01

Pos logit contributions
 amen+1.133
 lent+1.087
 sidel+1.056
 conject+1.049
selves+1.040
Neg logit contributions
ansas-1.325
ti-1.318
anim-1.235
velt-1.229
hor-1.183
Token the
Token ID262
Feature activation-7.50e-01

Pos logit contributions
 amen+0.300
 lent+0.287
 sidel+0.283
selves+0.281
 conject+0.279
Neg logit contributions
ansas-0.387
ti-0.386
anim-0.364
velt-0.360
hor-0.348
Tokenc
Token ID66
Feature activation-4.25e+00

Pos logit contributions
 amen+2.917
 lent+2.772
 sidel+2.693
 conject+2.685
 helic+2.654
Neg logit contributions
ansas-2.792
ti-2.765
velt-2.641
anim-2.614
inois-2.518
Tokenps
Token ID862
Feature activation-1.80e+00

Pos logit contributions
 amen+2.955
 lent+2.831
 conject+2.757
 sidel+2.755
selves+2.730
Neg logit contributions
ti-2.985
ansas-2.982
anim-2.817
velt-2.814
inois-2.713
Tokenj
Token ID73
Feature activation-4.40e+00

Pos logit contributions
 amen+1.628
 lent+1.576
 sidel+1.552
 conject+1.542
selves+1.539
Neg logit contributions
ti-0.917
ansas-0.913
anim-0.824
velt-0.815
hor-0.776
Token.
Token ID13
Feature activation-2.27e+00

Pos logit contributions
 amen+2.123
 lent+1.996
 sidel+1.933
selves+1.918
 conject+1.905
Neg logit contributions
ansas-3.990
ti-3.894
velt-3.794
anim-3.766
inois-3.642
Tokencom
Token ID785
Feature activation-7.66e+00

Pos logit contributions
 amen+1.929
 lent+1.858
 sidel+1.826
selves+1.820
 conject+1.816
Neg logit contributions
ansas-1.274
ti-1.254
velt-1.152
anim-1.151
hor-1.085
Token/
Token ID14
Feature activation-9.55e+00

Pos logit contributions
 amen+3.853
 lent+3.567
 sidel+3.487
selves+3.473
 conject+3.411
Neg logit contributions
ansas-6.644
ti-6.421
velt-6.267
anim-6.225
inois-6.191
Token1
Token ID16
Feature activation-3.72e+00

Pos logit contributions
 amen+6.040
 lent+5.596
 sidel+5.531
 conject+5.378
selves+5.332
Neg logit contributions
ansas-7.068
ti-6.611
velt-6.544
anim-6.414
inois-6.343
TokenBP
Token ID20866
Feature activation-5.09e+00

Pos logit contributions
 amen+2.664
 lent+2.540
 sidel+2.489
 conject+2.469
selves+2.467
Neg logit contributions
ansas-2.566
ti-2.471
velt-2.348
anim-2.334
inois-2.247
Tokeng
Token ID70
Feature activation-2.70e+00

Pos logit contributions
 amen+3.822
 lent+3.635
 sidel+3.566
selves+3.544
 conject+3.536
Neg logit contributions
ansas-3.257
ti-3.093
velt-2.975
anim-2.932
inois-2.831
Token8
Token ID23
Feature activation-1.14e+00

Pos logit contributions
 amen+1.985
 lent+1.901
 sidel+1.864
selves+1.848
 conject+1.846
Neg logit contributions
ansas-1.814
ti-1.774
anim-1.664
velt-1.663
inois-1.589
Tokensm
Token ID5796
Feature activation-2.28e+00

Pos logit contributions
 amen+0.826
 lent+0.792
 sidel+0.776
 conject+0.770
selves+0.770
Neg logit contributions
ansas-0.780
ti-0.770
anim-0.720
velt-0.718
hor-0.685
Token both
Token ID1111
Feature activation-6.07e-01

Pos logit contributions
ti+1.865
ansas+1.860
anim+1.758
velt+1.736
hor+1.676
Neg logit contributions
 amen-1.487
 lent-1.403
selves-1.377
 sidel-1.374
 helic-1.372
Token died
Token ID3724
Feature activation+3.43e-01

Pos logit contributions
 amen+0.387
 lent+0.370
 sidel+0.362
 conject+0.359
selves+0.358
Neg logit contributions
ansas-0.471
ti-0.470
anim-0.441
velt-0.438
hor-0.423
Token of
Token ID286
Feature activation-9.18e-01

Pos logit contributions
ti+0.281
ansas+0.281
anim+0.265
velt+0.261
hor+0.255
Neg logit contributions
 amen-0.203
 lent-0.192
 sidel-0.186
selves-0.186
 helic-0.185
Token cerebral
Token ID31169
Feature activation-3.41e+00

Pos logit contributions
 amen+0.519
 lent+0.493
 sidel+0.480
 conject+0.476
selves+0.474
Neg logit contributions
ansas-0.782
ti-0.778
anim-0.735
velt-0.735
hor-0.707
Token ha
Token ID387
Feature activation+2.17e-01

Pos logit contributions
 amen+1.705
 lent+1.601
 sidel+1.542
 conject+1.522
 helic+1.517
Neg logit contributions
ansas-3.147
ti-3.127
velt-2.996
anim-2.971
hor-2.871
Tokenem
Token ID368
Feature activation-9.47e+00

Pos logit contributions
ansas+0.178
ti+0.178
anim+0.167
velt+0.166
hor+0.161
Neg logit contributions
 amen-0.129
 lent-0.123
 sidel-0.120
 conject-0.118
selves-0.118
Tokenor
Token ID273
Feature activation-3.00e+00

Pos logit contributions
 amen+4.845
 lent+4.356
selves+4.145
 conject+4.125
 sidel+4.082
Neg logit contributions
ansas-8.296
ti-8.132
velt-7.874
inois-7.639
anim-7.586
Tokenrh
Token ID17179
Feature activation-3.56e+00

Pos logit contributions
 amen+2.532
 lent+2.443
 sidel+2.407
 conject+2.385
selves+2.381
Neg logit contributions
ansas-1.703
ti-1.688
velt-1.555
anim-1.547
inois-1.461
Tokenage
Token ID496
Feature activation-3.38e+00

Pos logit contributions
 amen+3.263
 lent+3.163
selves+3.157
 sidel+3.122
 conject+3.122
Neg logit contributions
ansas-1.690
ti-1.649
velt-1.558
anim-1.449
hor-1.404
Token,"
Token ID553
Feature activation-3.10e-01

Pos logit contributions
 amen+1.684
 lent+1.613
 sidel+1.562
 conject+1.545
selves+1.510
Neg logit contributions
ansas-3.068
ti-3.040
velt-2.907
anim-2.899
hor-2.801
Token said
Token ID531
Feature activation-6.93e-01

Pos logit contributions
 amen+0.190
 lent+0.181
 sidel+0.177
 conject+0.175
selves+0.174
Neg logit contributions
ansas-0.249
ti-0.248
anim-0.233
velt-0.232
hor-0.224
Token as
Token ID355
Feature activation+6.64e-01

Pos logit contributions
 amen+1.239
 lent+1.183
 sidel+1.152
selves+1.144
 conject+1.141
Neg logit contributions
ansas-1.696
ti-1.684
anim-1.587
velt-1.584
hor-1.526
Token school
Token ID1524
Feature activation-8.44e-01

Pos logit contributions
ansas+0.534
ti+0.533
anim+0.502
velt+0.498
hor+0.484
Neg logit contributions
 amen-0.402
 lent-0.383
 sidel-0.375
 conject-0.371
selves-0.370
Tokenboys
Token ID13202
Feature activation-2.54e+00

Pos logit contributions
 amen+0.683
 lent+0.659
 sidel+0.648
 conject+0.643
selves+0.642
Neg logit contributions
ansas-0.511
ti-0.508
anim-0.468
velt-0.465
hor-0.444
Token on
Token ID319
Feature activation+6.08e-01

Pos logit contributions
 amen+1.462
 lent+1.395
 sidel+1.352
selves+1.341
 conject+1.341
Neg logit contributions
ansas-2.133
ti-2.121
anim-2.006
velt-1.993
hor-1.929
Token a
Token ID257
Feature activation-2.14e+00

Pos logit contributions
ti+0.504
ansas+0.502
anim+0.474
velt+0.468
hor+0.457
Neg logit contributions
 amen-0.352
 lent-0.332
 sidel-0.328
 conject-0.324
selves-0.322
Token Thursday
Token ID3635
Feature activation-9.43e+00

Pos logit contributions
 amen+1.371
 lent+1.309
 sidel+1.281
 conject+1.271
selves+1.268
Neg logit contributions
ansas-1.650
ti-1.645
anim-1.544
velt-1.535
hor-1.482
Token night
Token ID1755
Feature activation-5.80e+00

Pos logit contributions
 amen+5.109
 lent+4.955
 sidel+4.702
 conject+4.592
selves+4.592
Neg logit contributions
ansas-7.890
ti-7.764
velt-7.462
anim-7.430
inois-7.079
Token.
Token ID13
Feature activation-2.16e+00

Pos logit contributions
 amen+3.128
 lent+2.988
 sidel+2.878
selves+2.851
 conject+2.819
Neg logit contributions
ansas-5.009
ti-4.953
anim-4.741
velt-4.728
inois-4.538
Token
Token ID198
Feature activation+7.80e-01

Pos logit contributions
 amen+1.162
 lent+1.097
selves+1.066
 sidel+1.052
 conject+1.048
Neg logit contributions
ansas-1.891
ti-1.877
anim-1.789
velt-1.775
inois-1.729
Token
Token ID198
Feature activation+1.07e+00

Pos logit contributions
ti+0.606
ansas+0.601
anim+0.566
velt+0.559
hor+0.544
Neg logit contributions
 amen-0.501
 lent-0.480
 helic-0.472
 sidel-0.470
 conject-0.466
TokenF
Token ID37
Feature activation-1.71e+00

Pos logit contributions
ti+0.838
ansas+0.835
anim+0.787
velt+0.778
hor+0.757
Neg logit contributions
 amen-0.678
 lent-0.650
 sidel-0.636
 helic-0.633
 conject-0.631
Token THE
Token ID3336
Feature activation-1.03e+00

Pos logit contributions
 amen+0.369
 lent+0.349
 sidel+0.339
 conject+0.336
selves+0.335
Neg logit contributions
ansas-0.611
ti-0.607
anim-0.576
velt-0.573
hor-0.555
Token L
Token ID406
Feature activation-2.00e+00

Pos logit contributions
 amen+0.665
 lent+0.636
 sidel+0.621
 conject+0.616
selves+0.615
Neg logit contributions
ansas-0.788
ti-0.785
anim-0.737
velt-0.733
hor-0.706
TokenAND
Token ID6981
Feature activation-2.64e+00

Pos logit contributions
 amen+1.309
 lent+1.247
 sidel+1.220
selves+1.211
 conject+1.210
Neg logit contributions
ansas-1.524
ti-1.512
velt-1.419
anim-1.418
hor-1.361
Token OF
Token ID3963
Feature activation+5.90e-01

Pos logit contributions
 amen+1.425
 lent+1.347
 sidel+1.296
selves+1.294
 conject+1.288
Neg logit contributions
ansas-2.313
ti-2.287
anim-2.185
velt-2.171
inois-2.093
Token EXT
Token ID27489
Feature activation-3.11e+00

Pos logit contributions
ti+0.490
ansas+0.489
anim+0.460
velt+0.457
hor+0.443
Neg logit contributions
 amen-0.343
 lent-0.326
 sidel-0.320
 conject-0.317
 helic-0.315
TokenREM
Token ID40726
Feature activation-9.41e+00

Pos logit contributions
 amen+2.097
 lent+2.034
 conject+1.964
 sidel+1.958
selves+1.955
Neg logit contributions
ansas-2.280
ti-2.245
velt-2.125
anim-2.118
hor-2.019
TokenELY
Token ID30943
Feature activation-8.92e-01

Pos logit contributions
 amen+3.535
 lent+3.121
selves+3.095
 sidel+2.990
 conject+2.946
Neg logit contributions
ansas-9.557
ti-9.197
velt-9.053
anim-8.868
inois-8.691
Token EXP
Token ID25703
Feature activation-4.34e+00

Pos logit contributions
 amen+0.562
 lent+0.537
 sidel+0.522
selves+0.520
 conject+0.519
Neg logit contributions
ansas-0.703
ti-0.693
anim-0.656
velt-0.655
hor-0.627
TokenENS
Token ID16938
Feature activation-5.25e+00

Pos logit contributions
 amen+1.943
 lent+1.833
selves+1.754
 sidel+1.746
 conject+1.743
Neg logit contributions
ansas-4.162
ti-4.134
velt-3.951
anim-3.936
inois-3.808
TokenIVE
Token ID9306
Feature activation-2.16e+00

Pos logit contributions
 amen+2.403
 lent+2.311
 conject+2.147
 sidel+2.125
 helic+2.117
Neg logit contributions
ansas-4.975
ti-4.814
velt-4.701
anim-4.596
inois-4.558
Token BE
Token ID9348
Feature activation-7.01e-01

Pos logit contributions
 amen+1.295
 lent+1.232
 sidel+1.194
selves+1.190
 conject+1.185
Neg logit contributions
ansas-1.763
ti-1.742
velt-1.651
anim-1.650
hor-1.583
Token 2
Token ID362
Feature activation-1.43e+00

Pos logit contributions
 amen+1.043
 lent+0.996
 sidel+0.973
 conject+0.966
selves+0.958
Neg logit contributions
ansas-1.208
ti-1.207
anim-1.132
velt-1.126
hor-1.083
Tokennd
Token ID358
Feature activation-4.21e+00

Pos logit contributions
 amen+0.880
 lent+0.838
 sidel+0.816
 conject+0.811
selves+0.807
Neg logit contributions
ansas-1.143
ti-1.139
anim-1.074
velt-1.067
hor-1.029
Token Avenue
Token ID8878
Feature activation-3.40e+00

Pos logit contributions
 amen+2.823
 lent+2.674
 conject+2.615
 sidel+2.599
 helic+2.593
Neg logit contributions
ansas-3.129
ti-3.114
anim-2.926
velt-2.920
hor-2.773
Token on
Token ID319
Feature activation-1.93e+00

Pos logit contributions
 amen+2.330
 lent+2.225
 sidel+2.176
 conject+2.154
 helic+2.139
Neg logit contributions
ansas-2.473
ti-2.465
anim-2.318
velt-2.292
hor-2.212
Token one
Token ID530
Feature activation-1.72e+00

Pos logit contributions
 amen+1.185
 lent+1.133
 sidel+1.101
 conject+1.093
selves+1.085
Neg logit contributions
ansas-1.543
ti-1.535
anim-1.449
velt-1.441
hor-1.391
Token side
Token ID1735
Feature activation-9.41e+00

Pos logit contributions
 amen+0.879
 lent+0.829
 sidel+0.806
 conject+0.797
selves+0.793
Neg logit contributions
ansas-1.549
ti-1.543
anim-1.464
velt-1.457
hor-1.412
Token of
Token ID286
Feature activation-3.33e+00

Pos logit contributions
 lent+4.803
 amen+4.752
 conject+4.425
 sidel+4.317
 helic+4.121
Neg logit contributions
ti-8.066
ansas-8.036
anim-7.783
velt-7.479
hor-7.273
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
 amen+1.724
 lent+1.631
 sidel+1.572
 conject+1.557
selves+1.542
Neg logit contributions
ansas-2.974
ti-2.967
anim-2.835
velt-2.796
hor-2.724
Token university
Token ID6403
Feature activation+2.25e+00

Pos logit contributions
 amen+0.893
 lent+0.853
 sidel+0.833
 conject+0.826
selves+0.822
Neg logit contributions
ansas-1.076
ti-1.072
anim-1.009
velt-1.001
hor-0.965
Token was
Token ID373
Feature activation-9.47e-01

Pos logit contributions
ansas+1.836
ti+1.826
velt+1.718
anim+1.711
hor+1.663
Neg logit contributions
 amen-1.315
selves-1.235
 lent-1.234
 sidel-1.231
 conject-1.209
Token renamed
Token ID25121
Feature activation-3.17e+00

Pos logit contributions
 amen+0.653
 lent+0.623
 sidel+0.614
selves+0.612
 conject+0.608
Neg logit contributions
ansas-0.684
ti-0.682
anim-0.636
velt-0.631
hor-0.608
Token the
Token ID262
Feature activation-5.62e-01

Pos logit contributions
ansas+0.546
ti+0.546
anim+0.512
velt+0.509
hor+0.492
Neg logit contributions
 amen-0.429
 lent-0.410
selves-0.400
 sidel-0.399
 conject-0.398
Token Saudi
Token ID7420
Feature activation+1.50e+00

Pos logit contributions
 amen+0.360
 lent+0.345
 sidel+0.337
 conject+0.334
selves+0.334
Neg logit contributions
ansas-0.435
ti-0.433
anim-0.406
velt-0.404
hor-0.390
Token version
Token ID2196
Feature activation-2.93e-01

Pos logit contributions
ansas+1.117
ti+1.106
anim+1.035
velt+1.022
hor+0.991
Neg logit contributions
 amen-1.004
 lent-0.956
selves-0.940
 sidel-0.939
 conject-0.931
Token of
Token ID286
Feature activation-2.32e+00

Pos logit contributions
 amen+0.147
 lent+0.138
 sidel+0.134
 conject+0.132
selves+0.132
Neg logit contributions
ansas-0.268
ti-0.267
anim-0.253
velt-0.252
hor-0.245
Token sh
Token ID427
Feature activation+5.11e+00

Pos logit contributions
 amen+1.436
 lent+1.365
 sidel+1.341
 conject+1.323
selves+1.312
Neg logit contributions
ansas-1.848
ti-1.839
anim-1.731
velt-1.727
hor-1.662
Tokenaria
Token ID10312
Feature activation+8.23e+00

Pos logit contributions
ti+2.883
ansas+2.762
anim+2.563
velt+2.457
hor+2.444
Neg logit contributions
 amen-4.490
 helic-4.243
 lent-4.233
selves-4.232
 sidel-4.169
Token law
Token ID1099
Feature activation+9.44e-01

Pos logit contributions
ti+5.625
ansas+5.503
anim+5.146
velt+5.085
hor+4.872
Neg logit contributions
 amen-5.637
selves-5.537
 helic-5.470
 lent-5.447
 sidel-5.299
Token.
Token ID13
Feature activation-7.47e-01

Pos logit contributions
ansas+0.803
ti+0.803
anim+0.758
velt+0.752
hor+0.728
Neg logit contributions
 amen-0.530
 lent-0.503
 sidel-0.489
selves-0.487
 conject-0.486
Token Her
Token ID2332
Feature activation+6.33e-01

Pos logit contributions
 amen+0.381
 lent+0.359
 sidel+0.349
selves+0.345
 conject+0.345
Neg logit contributions
ansas-0.676
ti-0.673
anim-0.639
velt-0.637
hor-0.617
Token criticism
Token ID7734
Feature activation+9.95e-01

Pos logit contributions
ansas+0.486
ti+0.484
anim+0.455
velt+0.452
hor+0.436
Neg logit contributions
 amen-0.408
 lent-0.391
 sidel-0.381
 conject-0.379
selves-0.378
Token was
Token ID373
Feature activation+1.69e+00

Pos logit contributions
ti+0.843
ansas+0.843
anim+0.796
velt+0.790
hor+0.767
Neg logit contributions
 amen-0.557
 lent-0.524
 sidel-0.514
selves-0.514
 conject-0.511
Token that
Token ID326
Feature activation-2.05e+00

Pos logit contributions
ti+1.692
ansas+1.689
anim+1.593
velt+1.575
hor+1.531
Neg logit contributions
 amen-1.258
 lent-1.199
selves-1.172
 sidel-1.164
 conject-1.162
Token long
Token ID890
Feature activation-3.91e+00

Pos logit contributions
 amen+1.396
 lent+1.331
 sidel+1.315
 conject+1.286
 helic+1.277
Neg logit contributions
ansas-1.511
ti-1.492
velt-1.403
anim-1.396
inois-1.342
Token jab
Token ID33896
Feature activation-4.80e-01

Pos logit contributions
 amen+2.532
 lent+2.399
 sidel+2.374
 helic+2.301
 conject+2.299
Neg logit contributions
ansas-3.004
ti-2.966
velt-2.822
anim-2.776
inois-2.684
Token to
Token ID284
Feature activation+1.72e+00

Pos logit contributions
 amen+0.300
 lent+0.287
 sidel+0.280
 conject+0.277
selves+0.277
Neg logit contributions
ansas-0.380
ti-0.377
anim-0.355
velt-0.354
hor-0.341
Token set
Token ID900
Feature activation+7.15e-01

Pos logit contributions
ti+1.313
ansas+1.312
anim+1.230
velt+1.222
hor+1.182
Neg logit contributions
 amen-1.108
 lent-1.061
 sidel-1.032
 conject-1.032
selves-1.030
Token up
Token ID510
Feature activation+6.43e+00

Pos logit contributions
ansas+0.569
ti+0.568
anim+0.534
velt+0.531
hor+0.513
Neg logit contributions
 amen-0.441
 lent-0.421
 sidel-0.411
 conject-0.408
selves-0.407
Token what
Token ID644
Feature activation-5.64e-01

Pos logit contributions
ti+4.757
ansas+4.751
anim+4.486
velt+4.344
hor+4.325
Neg logit contributions
 amen-4.038
 lent-3.876
selves-3.796
 conject-3.754
 helic-3.702
Token
Token ID447
Feature activation+1.81e-01

Pos logit contributions
 amen+0.338
 lent+0.322
 sidel+0.314
 conject+0.311
selves+0.311
Neg logit contributions
ansas-0.460
ti-0.458
anim-0.431
velt-0.429
hor-0.415
Token
Token ID247
Feature activation+1.17e+00

Pos logit contributions
ti+0.160
ansas+0.156
anim+0.149
velt+0.148
hor+0.144
Neg logit contributions
 amen-0.096
 lent-0.091
 sidel-0.090
 conject-0.090
 helic-0.088
Tokens
Token ID82
Feature activation+1.77e+00

Pos logit contributions
ti+0.811
ansas+0.801
anim+0.752
velt+0.739
hor+0.715
Neg logit contributions
 amen-0.848
 lent-0.815
 sidel-0.802
 conject-0.795
 helic-0.789
Token next
Token ID1306
Feature activation+2.57e+00

Pos logit contributions
ti+1.278
ansas+1.272
anim+1.193
velt+1.174
hor+1.141
Neg logit contributions
 amen-1.217
 lent-1.169
 sidel-1.141
selves-1.136
 helic-1.134
Token fraction
Token ID13390
Feature activation-2.45e+00

Pos logit contributions
 amen+0.903
 lent+0.866
 sidel+0.848
 conject+0.841
selves+0.837
Neg logit contributions
ansas-0.963
ti-0.959
anim-0.896
velt-0.892
hor-0.857
Token of
Token ID286
Feature activation-3.98e+00

Pos logit contributions
 amen+1.148
 lent+1.086
 sidel+1.033
selves+1.024
 conject+1.014
Neg logit contributions
ansas-2.296
ti-2.280
velt-2.179
anim-2.170
hor-2.108
Token that
Token ID326
Feature activation-2.23e+00

Pos logit contributions
 amen+1.969
 lent+1.862
 sidel+1.804
 conject+1.782
 helic+1.757
Neg logit contributions
ansas-3.662
ti-3.634
velt-3.461
anim-3.435
hor-3.346
Token time
Token ID640
Feature activation-6.59e-01

Pos logit contributions
 amen+1.370
 lent+1.313
 sidel+1.282
 conject+1.268
 helic+1.257
Neg logit contributions
ansas-1.782
ti-1.771
velt-1.665
anim-1.663
hor-1.599
Token (
Token ID357
Feature activation+1.30e+00

Pos logit contributions
 amen+0.357
 lent+0.339
 sidel+0.330
 conject+0.326
selves+0.325
Neg logit contributions
ansas-0.575
ti-0.573
anim-0.542
velt-0.540
hor-0.522
Tokench
Token ID354
Feature activation+5.80e+00

Pos logit contributions
ansas+0.935
ti+0.934
anim+0.878
velt+0.864
hor+0.837
Neg logit contributions
 amen-0.905
 lent-0.869
 sidel-0.852
 conject-0.844
 helic-0.840
Tokenick
Token ID624
Feature activation+7.44e-01

Pos logit contributions
ansas+3.710
ti+3.649
anim+3.434
velt+3.356
inois+3.250
Neg logit contributions
 amen-4.524
 helic-4.324
 sidel-4.324
 lent-4.288
 conject-4.249
Tokenens
Token ID641
Feature activation+1.19e+00

Pos logit contributions
ti+0.561
ansas+0.561
anim+0.529
velt+0.517
hor+0.504
Neg logit contributions
 amen-0.494
 lent-0.470
 sidel-0.464
 helic-0.460
 conject-0.458
Token take
Token ID1011
Feature activation-1.26e+00

Pos logit contributions
ansas+0.925
ti+0.922
anim+0.869
velt+0.861
hor+0.832
Neg logit contributions
 amen-0.753
 lent-0.710
 sidel-0.708
 conject-0.697
selves-0.694
Token three
Token ID1115
Feature activation-9.59e-01

Pos logit contributions
 amen+0.769
 lent+0.735
 sidel+0.716
 conject+0.710
selves+0.707
Neg logit contributions
ansas-1.022
ti-1.017
anim-0.956
velt-0.954
hor-0.919
Token weeks
Token ID2745
Feature activation-1.31e+00

Pos logit contributions
 amen+0.660
 lent+0.633
 sidel+0.619
 conject+0.614
selves+0.612
Neg logit contributions
ansas-0.697
ti-0.693
anim-0.647
velt-0.645
hor-0.620
Token campaign
Token ID1923
Feature activation+2.67e+00

Pos logit contributions
ti+2.908
ansas+2.893
anim+2.721
velt+2.661
hor+2.584
Neg logit contributions
 amen-2.378
 lent-2.271
 sidel-2.246
selves-2.235
 conject-2.220
Token.
Token ID13
Feature activation+7.41e-02

Pos logit contributions
ansas+2.235
ti+2.229
anim+2.099
velt+2.087
hor+2.023
Neg logit contributions
 amen-1.531
 lent-1.452
 sidel-1.424
 conject-1.413
selves-1.406
Token
Token ID198
Feature activation+2.11e-01

Pos logit contributions
ansas+0.061
ti+0.060
anim+0.057
velt+0.057
hor+0.055
Neg logit contributions
 amen-0.044
 lent-0.042
 sidel-0.041
selves-0.041
 conject-0.041
Token
Token ID198
Feature activation+2.63e-01

Pos logit contributions
ti+0.165
ansas+0.164
anim+0.154
velt+0.153
hor+0.148
Neg logit contributions
 amen-0.134
 lent-0.128
 sidel-0.125
 helic-0.125
 conject-0.124
TokenAnother
Token ID6610
Feature activation+2.60e+00

Pos logit contributions
ti+0.205
ansas+0.205
anim+0.192
velt+0.190
hor+0.185
Neg logit contributions
 amen-0.168
 lent-0.161
 sidel-0.157
 conject-0.156
 helic-0.155
Token user
Token ID2836
Feature activation+5.38e+00

Pos logit contributions
ti+1.825
ansas+1.811
velt+1.674
anim+1.669
hor+1.611
Neg logit contributions
 amen-1.823
 lent-1.745
selves-1.727
 conject-1.715
 helic-1.714
Token,
Token ID11
Feature activation+2.41e+00

Pos logit contributions
ti+4.179
ansas+4.087
velt+3.879
anim+3.847
hor+3.743
Neg logit contributions
 amen-3.265
selves-3.076
 helic-3.065
 lent-3.010
 sidel-2.977
Token Haley
Token ID30646
Feature activation+3.19e+00

Pos logit contributions
ti+1.864
ansas+1.858
anim+1.742
velt+1.734
hor+1.678
Neg logit contributions
 amen-1.521
 lent-1.449
 sidel-1.417
selves-1.413
 conject-1.410
Token Lent
Token ID43751
Feature activation-5.01e-02

Pos logit contributions
ti+2.457
ansas+2.417
anim+2.271
velt+2.271
hor+2.194
Neg logit contributions
 amen-2.051
 lent-1.951
 sidel-1.920
 conject-1.909
 helic-1.906
Token,
Token ID11
Feature activation-7.31e-01

Pos logit contributions
 amen+0.039
 lent+0.037
 sidel+0.037
 conject+0.036
selves+0.036
Neg logit contributions
ansas-0.032
ti-0.032
anim-0.030
velt-0.029
hor-0.028
Token told
Token ID1297
Feature activation+1.85e+00

Pos logit contributions
 amen+0.473
 lent+0.453
 sidel+0.443
 conject+0.439
selves+0.437
Neg logit contributions
ansas-0.562
ti-0.559
anim-0.525
velt-0.522
hor-0.503
Token://
Token ID1378
Feature activation+4.15e-01

Pos logit contributions
 amen+0.067
 lent+0.062
 sidel+0.060
 helic+0.059
 conject+0.059
Neg logit contributions
ti-0.189
ansas-0.188
anim-0.180
velt-0.179
hor-0.175
Tokenwww
Token ID2503
Feature activation+4.29e+00

Pos logit contributions
ti+0.288
ansas+0.283
anim+0.267
velt+0.260
hor+0.255
Neg logit contributions
 amen-0.306
 lent-0.293
 sidel-0.287
 conject-0.287
 helic-0.285
Token.
Token ID13
Feature activation-2.54e-01

Pos logit contributions
ti+3.304
ansas+3.218
anim+3.058
velt+2.961
hor+2.919
Neg logit contributions
 amen-2.781
 helic-2.732
 lent-2.700
 helicop-2.661
 thous-2.655
Tokenbel
Token ID6667
Feature activation+6.03e+00

Pos logit contributions
 amen+0.217
 lent+0.209
 sidel+0.206
 conject+0.205
 helic+0.204
Neg logit contributions
ansas-0.144
ti-0.144
anim-0.131
velt-0.129
hor-0.124
Tokenfast
Token ID7217
Feature activation+8.42e+00

Pos logit contributions
ti+2.490
ansas+2.389
velt+2.151
anim+2.127
hor+1.960
Neg logit contributions
 amen-6.099
 lent-5.859
 helic-5.838
 sidel-5.788
 conject-5.720
Tokente
Token ID660
Feature activation+8.33e+00

Pos logit contributions
ti+3.171
ansas+2.514
anim+2.427
hor+2.291
spe+2.172
Neg logit contributions
 amen-9.036
 thous-8.740
 helic-8.734
 lent-8.654
 conject-8.635
Tokenlegraph
Token ID16606
Feature activation+4.64e+00

Pos logit contributions
ti+1.970
ansas+1.736
anim+1.620
velt+1.509
spe+1.312
Neg logit contributions
 amen-9.571
 helic-9.509
 thous-9.392
 lent-9.386
 sidel-9.336
Token.
Token ID13
Feature activation+7.32e-03

Pos logit contributions
ti+3.611
ansas+3.522
anim+3.357
velt+3.299
hor+3.210
Neg logit contributions
 amen-2.990
 helic-2.888
 lent-2.867
 sidel-2.837
 conject-2.811
Tokenco
Token ID1073
Feature activation+2.90e-01

Pos logit contributions
ansas+0.004
ti+0.004
anim+0.004
velt+0.004
hor+0.004
Neg logit contributions
 amen-0.006
 lent-0.006
 sidel-0.006
 conject-0.006
selves-0.006
Token.
Token ID13
Feature activation+1.59e+00

Pos logit contributions
ansas+0.245
ti+0.245
anim+0.231
velt+0.230
hor+0.222
Neg logit contributions
 amen-0.165
 lent-0.157
 sidel-0.153
 conject-0.152
 helic-0.151
Tokenuk
Token ID2724
Feature activation-4.78e+00

Pos logit contributions
ansas+0.679
ti+0.678
anim+0.605
velt+0.597
hor+0.560
Neg logit contributions
 amen-1.581
 lent-1.538
 helic-1.524
 sidel-1.522
 conject-1.510
Token industry
Token ID2831
Feature activation+3.52e+00

Pos logit contributions
ti+1.127
ansas+1.126
anim+1.049
velt+1.038
hor+1.001
Neg logit contributions
 amen-1.258
 lent-1.207
 sidel-1.190
 conject-1.186
selves-1.184
Token
Token ID198
Feature activation+1.76e+00

Pos logit contributions
ansas+2.445
ti+2.435
anim+2.266
velt+2.258
hor+2.183
Neg logit contributions
 amen-2.479
 lent-2.360
 sidel-2.334
selves-2.328
 conject-2.325
Token
Token ID198
Feature activation+1.60e-01

Pos logit contributions
ti+1.328
ansas+1.311
anim+1.238
velt+1.212
hor+1.185
Neg logit contributions
 amen-1.174
 lent-1.129
 helic-1.124
 sidel-1.105
 conject-1.101
TokenMany
Token ID7085
Feature activation+9.79e-01

Pos logit contributions
ansas+0.121
ti+0.121
anim+0.113
velt+0.113
hor+0.108
Neg logit contributions
 amen-0.105
 lent-0.100
 sidel-0.098
 conject-0.097
selves-0.097
Token toy
Token ID13373
Feature activation+2.62e-01

Pos logit contributions
ansas+0.700
ti+0.699
anim+0.655
velt+0.645
hor+0.624
Neg logit contributions
 amen-0.681
 lent-0.650
 sidel-0.640
selves-0.634
 conject-0.634
Token manufacturers
Token ID11372
Feature activation+4.13e+00

Pos logit contributions
ansas+0.139
ti+0.138
anim+0.126
velt+0.125
hor+0.118
Neg logit contributions
 amen-0.231
 lent-0.223
 sidel-0.220
 conject-0.219
selves-0.218
Token are
Token ID389
Feature activation+9.56e-04

Pos logit contributions
ti+3.010
ansas+2.991
anim+2.791
velt+2.777
hor+2.703
Neg logit contributions
 amen-2.738
 lent-2.562
selves-2.552
 conject-2.549
 sidel-2.547
Token global
Token ID3298
Feature activation-1.80e-01

Pos logit contributions
ansas+0.001
ti+0.001
anim+0.001
velt+0.001
hor+0.001
Neg logit contributions
 amen-0.001
 lent-0.001
 sidel-0.001
selves-0.001
 conject-0.001
Token giants
Token ID20178
Feature activation+8.78e-01

Pos logit contributions
 amen+0.136
 lent+0.131
 sidel+0.129
 conject+0.128
selves+0.128
Neg logit contributions
ansas-0.119
ti-0.118
anim-0.110
velt-0.109
hor-0.104
Token that
Token ID326
Feature activation+1.48e+00

Pos logit contributions
ansas+0.725
ti+0.723
anim+0.682
velt+0.675
hor+0.655
Neg logit contributions
 amen-0.514
 lent-0.487
 sidel-0.476
 conject-0.475
 helic-0.473
Token provide
Token ID2148
Feature activation+4.84e-01

Pos logit contributions
ti+1.086
ansas+1.086
anim+1.009
velt+1.009
hor+0.971
Neg logit contributions
 amen-0.994
 lent-0.942
 sidel-0.933
 conject-0.927
selves-0.923
Token they
Token ID484
Feature activation+7.71e-01

Pos logit contributions
 amen+0.407
 lent+0.390
 sidel+0.381
 conject+0.374
selves+0.372
Neg logit contributions
ansas-0.528
ti-0.523
velt-0.491
anim-0.491
hor-0.474
Token begin
Token ID2221
Feature activation+6.83e-01

Pos logit contributions
ti+0.586
ansas+0.586
anim+0.548
velt+0.546
hor+0.526
Neg logit contributions
 amen-0.498
 lent-0.474
 conject-0.465
 sidel-0.464
selves-0.462
Token streaming
Token ID11305
Feature activation-2.98e-01

Pos logit contributions
ansas+0.532
ti+0.531
anim+0.499
velt+0.495
hor+0.479
Neg logit contributions
 amen-0.430
 lent-0.411
 sidel-0.400
 conject-0.399
selves-0.399
Token.
Token ID13
Feature activation+9.83e-01

Pos logit contributions
 amen+0.176
 lent+0.168
 sidel+0.164
 conject+0.162
selves+0.162
Neg logit contributions
ansas-0.246
ti-0.244
velt-0.230
anim-0.230
hor-0.222
Token
Token ID198
Feature activation+1.52e+00

Pos logit contributions
ansas+0.801
ti+0.800
anim+0.753
velt+0.748
hor+0.723
Neg logit contributions
 amen-0.590
 lent-0.561
 sidel-0.547
 conject-0.544
 helic-0.541
Token
Token ID198
Feature activation+1.12e+00

Pos logit contributions
ti+1.159
ansas+1.142
anim+1.080
velt+1.063
hor+1.037
Neg logit contributions
 amen-0.995
 lent-0.957
 helic-0.947
 sidel-0.936
 conject-0.930
TokenTw
Token ID5080
Feature activation+2.72e+00

Pos logit contributions
ti+0.851
ansas+0.845
anim+0.796
velt+0.785
hor+0.762
Neg logit contributions
 amen-0.746
 lent-0.713
 conject-0.694
 sidel-0.693
 helic-0.693
Tokenitch
Token ID2007
Feature activation+1.95e+00

Pos logit contributions
ti+1.365
ansas+1.325
anim+1.220
velt+1.205
inois+1.128
Neg logit contributions
 amen-2.529
 lent-2.433
 helic-2.410
 conject-2.395
selves-2.392
Token enables
Token ID13536
Feature activation+2.34e+00

Pos logit contributions
ti+1.470
ansas+1.464
anim+1.374
velt+1.362
hor+1.318
Neg logit contributions
 amen-1.284
 lent-1.219
 sidel-1.195
selves-1.193
 conject-1.191
Token video
Token ID2008
Feature activation+1.93e+00

Pos logit contributions
ti+1.568
ansas+1.558
anim+1.468
velt+1.437
hor+1.391
Neg logit contributions
 amen-1.704
 lent-1.632
selves-1.607
 helic-1.598
 conject-1.596
Token game
Token ID983
Feature activation-3.17e+00

Pos logit contributions
ti+1.326
ansas+1.318
anim+1.239
velt+1.219
hor+1.177
Neg logit contributions
 amen-1.385
 lent-1.324
selves-1.314
 conject-1.293
 sidel-1.292
Token scan
Token ID9367
Feature activation-2.45e+00

Pos logit contributions
 amen+0.755
 lent+0.720
 sidel+0.703
 conject+0.697
selves+0.695
Neg logit contributions
ansas-0.982
ti-0.978
anim-0.921
velt-0.917
hor-0.884
Token to
Token ID284
Feature activation+1.05e+00

Pos logit contributions
 amen+1.429
 lent+1.368
 sidel+1.329
 conject+1.316
selves+1.303
Neg logit contributions
ansas-2.037
ti-2.024
anim-1.913
velt-1.909
hor-1.841
Token confirm
Token ID6216
Feature activation-1.29e+00

Pos logit contributions
ti+0.759
ansas+0.758
anim+0.708
velt+0.706
hor+0.679
Neg logit contributions
 amen-0.727
 lent-0.693
selves-0.680
 sidel-0.676
 conject-0.675
Token that
Token ID326
Feature activation-2.52e-01

Pos logit contributions
 amen+0.705
 lent+0.668
 sidel+0.650
 conject+0.644
selves+0.642
Neg logit contributions
ansas-1.126
ti-1.123
anim-1.061
velt-1.057
hor-1.023
Token Kell
Token ID25043
Feature activation+5.12e+00

Pos logit contributions
 amen+0.161
 lent+0.154
 sidel+0.151
 conject+0.149
selves+0.149
Neg logit contributions
ansas-0.195
ti-0.194
anim-0.182
velt-0.181
hor-0.175
Token Brook
Token ID7590
Feature activation+1.29e+00

Pos logit contributions
ti+3.334
ansas+3.303
anim+3.131
velt+3.024
inois+2.982
Neg logit contributions
 amen-3.866
 lent-3.720
 sidel-3.649
 conject-3.638
 helic-3.635
Token
Token ID447
Feature activation-1.24e+00

Pos logit contributions
ti+0.939
ansas+0.937
velt+0.874
anim+0.873
hor+0.843
Neg logit contributions
 amen-0.876
 lent-0.837
 sidel-0.823
selves-0.819
 conject-0.817
Token
Token ID247
Feature activation+2.36e+00

Pos logit contributions
 amen+0.632
 lent+0.599
 sidel+0.586
 conject+0.581
selves+0.576
Neg logit contributions
ti-1.117
ansas-1.114
velt-1.055
anim-1.055
hor-1.023
Tokens
Token ID82
Feature activation+1.03e+00

Pos logit contributions
ti+1.799
ansas+1.795
anim+1.677
velt+1.667
hor+1.616
Neg logit contributions
 amen-1.529
 lent-1.458
 sidel-1.433
 conject-1.427
selves-1.421
Token left
Token ID1364
Feature activation-8.03e-01

Pos logit contributions
ansas+0.856
ti+0.852
anim+0.803
velt+0.798
hor+0.773
Neg logit contributions
 amen-0.594
 lent-0.566
selves-0.551
 sidel-0.550
 conject-0.550
Token eye
Token ID4151
Feature activation-4.43e+00

Pos logit contributions
 amen+0.463
 lent+0.440
 sidel+0.429
selves+0.426
 conject+0.426
Neg logit contributions
ansas-0.673
ti-0.670
anim-0.632
velt-0.629
hor-0.608
Token have
Token ID423
Feature activation-7.84e-01

Pos logit contributions
ti+1.025
ansas+1.024
anim+0.967
velt+0.949
hor+0.926
Neg logit contributions
 amen-0.750
 lent-0.713
 sidel-0.701
selves-0.697
 helic-0.695
Token not
Token ID407
Feature activation-1.02e+00

Pos logit contributions
 amen+0.515
 lent+0.492
 sidel+0.482
 conject+0.478
selves+0.477
Neg logit contributions
ansas-0.594
ti-0.592
anim-0.555
velt-0.551
hor-0.532
Token yet
Token ID1865
Feature activation+5.90e-01

Pos logit contributions
 amen+0.638
 lent+0.609
 sidel+0.595
 conject+0.590
selves+0.589
Neg logit contributions
ansas-0.806
ti-0.803
anim-0.755
velt-0.750
hor-0.725
Token been
Token ID587
Feature activation-2.25e+00

Pos logit contributions
ansas+0.469
ti+0.468
anim+0.442
velt+0.434
hor+0.425
Neg logit contributions
 amen-0.362
 lent-0.340
 sidel-0.337
selves-0.334
 helic-0.332
Token final
Token ID2457
Feature activation+4.50e-01

Pos logit contributions
 amen+1.565
 lent+1.515
 sidel+1.473
 conject+1.471
selves+1.451
Neg logit contributions
ansas-1.624
ti-1.610
velt-1.512
anim-1.506
hor-1.447
Tokenised
Token ID1417
Feature activation+1.32e+00

Pos logit contributions
ti+0.213
ansas+0.212
anim+0.193
velt+0.186
hor+0.178
Neg logit contributions
 amen-0.423
 lent-0.408
 sidel-0.405
 helic-0.402
selves-0.402
Token,
Token ID11
Feature activation+1.24e-03

Pos logit contributions
ti+1.101
ansas+1.098
anim+1.040
velt+1.023
hor+0.999
Neg logit contributions
 amen-0.755
 lent-0.721
 sidel-0.702
 helic-0.698
 conject-0.693
Token
Token ID447
Feature activation+5.42e-01

Pos logit contributions
ansas+0.001
ti+0.001
anim+0.001
velt+0.001
hor+0.001
Neg logit contributions
 amen-0.001
 lent-0.001
 sidel-0.001
 conject-0.001
selves-0.001
Token
Token ID251
Feature activation-3.81e-01

Pos logit contributions
ti+0.451
ansas+0.449
anim+0.422
velt+0.420
hor+0.408
Neg logit contributions
 amen-0.313
 lent-0.299
 sidel-0.292
 conject-0.290
 helic-0.289
Token the
Token ID262
Feature activation+2.62e-01

Pos logit contributions
 amen+0.230
 lent+0.219
 sidel+0.214
 conject+0.212
selves+0.211
Neg logit contributions
ansas-0.308
ti-0.307
anim-0.289
velt-0.288
hor-0.278
Token statement
Token ID2643
Feature activation+7.03e-01

Pos logit contributions
ansas+0.201
ti+0.200
anim+0.188
velt+0.187
hor+0.180
Neg logit contributions
 amen-0.170
 lent-0.162
 sidel-0.158
 conject-0.157
selves-0.156
Token Mr
Token ID1770
Feature activation+2.65e+00

Pos logit contributions
ansas+1.348
ti+1.344
anim+1.270
velt+1.259
hor+1.218
Neg logit contributions
 amen-0.899
 lent-0.852
 sidel-0.830
selves-0.829
 helic-0.826
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
ti+2.116
ansas+2.096
anim+1.985
velt+1.947
hor+1.890
Neg logit contributions
 amen-1.632
 lent-1.576
 helic-1.573
 sidel-1.542
 conject-1.517
Token Sanders
Token ID5831
Feature activation+1.80e+00

Pos logit contributions
ansas+0.800
ti+0.795
anim+0.731
velt+0.726
hor+0.692
Neg logit contributions
 amen-1.123
 lent-1.084
 sidel-1.065
 conject-1.058
selves-1.057
Token isn
Token ID2125
Feature activation+1.39e-01

Pos logit contributions
ansas+1.303
ti+1.302
anim+1.213
velt+1.204
hor+1.157
Neg logit contributions
 amen-1.241
 lent-1.190
 sidel-1.168
 helic-1.158
selves-1.158
Token
Token ID447
Feature activation+2.34e+00

Pos logit contributions
ti+0.103
ansas+0.102
anim+0.096
velt+0.095
hor+0.092
Neg logit contributions
 amen-0.094
 lent-0.090
 helic-0.088
 sidel-0.088
 conject-0.088
Token
Token ID247
Feature activation+1.78e+00

Pos logit contributions
ti+1.951
ansas+1.847
velt+1.780
anim+1.772
inois+1.730
Neg logit contributions
 amen-1.366
 conject-1.289
 sidel-1.281
 lent-1.280
 helic-1.265
Tokent
Token ID83
Feature activation+2.98e-01

Pos logit contributions
ti+0.857
ansas+0.809
anim+0.736
velt+0.717
hor+0.693
Neg logit contributions
 amen-1.698
 lent-1.630
 sidel-1.617
 helic-1.615
 conject-1.606
Token going
Token ID1016
Feature activation+3.94e+00

Pos logit contributions
ansas+0.228
ti+0.227
anim+0.213
velt+0.211
hor+0.204
Neg logit contributions
 amen-0.193
 lent-0.184
 sidel-0.180
 conject-0.179
selves-0.179
Token anywhere
Token ID6609
Feature activation+1.33e+00

Pos logit contributions
ansas+3.121
ti+3.114
anim+2.924
velt+2.882
hor+2.793
Neg logit contributions
 amen-2.426
 lent-2.316
 conject-2.260
 sidel-2.259
selves-2.257
Token.
Token ID13
Feature activation+1.44e+00

Pos logit contributions
ansas+1.063
ti+1.059
anim+0.996
velt+0.990
hor+0.957
Neg logit contributions
 amen-0.811
 lent-0.773
 sidel-0.755
 conject-0.749
selves-0.748
Token
Token ID198
Feature activation+9.54e-02

Pos logit contributions
ansas+1.223
ti+1.221
anim+1.150
velt+1.142
hor+1.106
Neg logit contributions
 amen-0.816
 lent-0.776
 sidel-0.757
 conject-0.751
 helic-0.750
Token doors
Token ID8215
Feature activation-2.27e+00

Pos logit contributions
ansas+1.147
ti+1.146
anim+1.068
velt+1.060
hor+1.025
Neg logit contributions
 amen-1.149
 lent-1.100
 sidel-1.082
selves-1.075
 conject-1.074
Token because
Token ID780
Feature activation+1.89e+00

Pos logit contributions
 amen+1.348
 lent+1.287
 sidel+1.254
selves+1.240
 conject+1.239
Neg logit contributions
ansas-1.858
ti-1.850
anim-1.741
velt-1.735
hor-1.673
Token it
Token ID340
Feature activation-5.97e-02

Pos logit contributions
ti+1.545
ansas+1.540
anim+1.462
velt+1.445
hor+1.402
Neg logit contributions
 amen-1.108
 lent-1.049
selves-1.031
 sidel-1.030
 conject-1.024
Token would
Token ID561
Feature activation-2.40e+00

Pos logit contributions
 amen+0.042
 lent+0.040
 sidel+0.040
 conject+0.039
selves+0.039
Neg logit contributions
ansas-0.042
ti-0.042
anim-0.039
velt-0.039
hor-0.038
Token impede
Token ID43195
Feature activation-1.29e+00

Pos logit contributions
 amen+1.565
 lent+1.512
 sidel+1.479
 conject+1.449
selves+1.446
Neg logit contributions
ansas-1.842
ti-1.817
anim-1.712
velt-1.708
inois-1.628
Token movement
Token ID3356
Feature activation-1.72e+00

Pos logit contributions
 amen+0.769
 lent+0.733
 sidel+0.717
 conject+0.709
selves+0.702
Neg logit contributions
ansas-1.059
ti-1.052
anim-0.988
velt-0.987
hor-0.950
Token of
Token ID286
Feature activation+9.72e-01

Pos logit contributions
 amen+0.962
 lent+0.916
 sidel+0.891
 conject+0.883
selves+0.877
Neg logit contributions
ansas-1.480
ti-1.470
anim-1.390
velt-1.385
hor-1.338
Token the
Token ID262
Feature activation+2.59e+00

Pos logit contributions
ansas+0.808
ti+0.807
anim+0.762
velt+0.756
hor+0.733
Neg logit contributions
 amen-0.564
 lent-0.534
 sidel-0.522
 conject-0.517
selves-0.517
Token wheelchair
Token ID27181
Feature activation+9.74e-01

Pos logit contributions
ti+1.692
ansas+1.685
anim+1.573
velt+1.545
hor+1.499
Neg logit contributions
 amen-1.950
 lent-1.861
 sidel-1.834
selves-1.831
 conject-1.827
Token.
Token ID13
Feature activation+4.12e-01

Pos logit contributions
ansas+0.797
ti+0.796
anim+0.750
velt+0.746
hor+0.724
Neg logit contributions
 amen-0.579
 lent-0.548
 sidel-0.537
 conject-0.534
selves-0.531
Token During
Token ID5856
Feature activation+2.30e+00

Pos logit contributions
ansas+0.351
ti+0.350
anim+0.331
velt+0.329
hor+0.318
Neg logit contributions
 amen-0.232
 lent-0.220
 sidel-0.215
 conject-0.213
selves-0.212
Token lives
Token ID3160
Feature activation-1.32e+00

Pos logit contributions
 amen+1.800
 lent+1.691
 sidel+1.639
selves+1.635
 conject+1.624
Neg logit contributions
ansas-3.118
ti-3.088
velt-2.931
anim-2.914
inois-2.848
Token.
Token ID13
Feature activation-2.72e+00

Pos logit contributions
 amen+0.750
 lent+0.713
 sidel+0.695
 conject+0.688
selves+0.687
Neg logit contributions
ansas-1.110
ti-1.106
anim-1.044
velt-1.039
hor-1.005
Token Government
Token ID5070
Feature activation-1.12e+00

Pos logit contributions
 amen+1.463
 lent+1.389
 sidel+1.350
selves+1.347
 conject+1.333
Neg logit contributions
ansas-2.377
ti-2.348
velt-2.233
anim-2.225
inois-2.157
Token is
Token ID318
Feature activation-4.22e-01

Pos logit contributions
 amen+0.733
 lent+0.702
 sidel+0.687
 conject+0.681
selves+0.679
Neg logit contributions
ansas-0.847
ti-0.843
anim-0.790
velt-0.786
hor-0.757
Token committed
Token ID5364
Feature activation+2.96e+00

Pos logit contributions
 amen+0.248
 lent+0.235
 sidel+0.229
selves+0.228
 helic+0.227
Neg logit contributions
ti-0.348
ansas-0.347
anim-0.327
velt-0.324
hor-0.315
Token to
Token ID284
Feature activation-1.07e+00

Pos logit contributions
ti+2.345
ansas+2.325
anim+2.198
velt+2.172
hor+2.103
Neg logit contributions
 amen-1.836
 lent-1.749
 sidel-1.721
 conject-1.707
selves-1.693
Token reduce
Token ID4646
Feature activation-2.29e+00

Pos logit contributions
 amen+0.643
 lent+0.612
 sidel+0.598
 conject+0.593
selves+0.592
Neg logit contributions
ansas-0.866
ti-0.864
anim-0.813
velt-0.809
hor-0.782
Token the
Token ID262
Feature activation-1.54e+00

Pos logit contributions
 amen+1.307
 lent+1.243
 sidel+1.212
 conject+1.200
selves+1.194
Neg logit contributions
ansas-1.939
ti-1.929
anim-1.820
velt-1.814
hor-1.753
Token accidents
Token ID17390
Feature activation+8.49e-01

Pos logit contributions
 amen+0.823
 lent+0.779
 sidel+0.759
 conject+0.751
selves+0.747
Neg logit contributions
ansas-1.352
ti-1.345
anim-1.272
velt-1.269
hor-1.227
Token and
Token ID290
Feature activation-3.39e+00

Pos logit contributions
ti+0.692
ansas+0.688
anim+0.650
velt+0.646
hor+0.627
Neg logit contributions
 amen-0.507
 lent-0.481
 sidel-0.468
selves-0.466
 conject-0.465
Token fatalities
Token ID30091
Feature activation+1.60e+00

Pos logit contributions
 amen+2.129
 lent+2.033
 sidel+1.993
 conject+1.964
 helic+1.953
Neg logit contributions
ansas-2.697
ti-2.655
anim-2.495
velt-2.476
hor-2.404
Token
Token ID198
Feature activation+6.86e-01

Pos logit contributions
 amen+0.229
 lent+0.219
 sidel+0.213
 conject+0.211
selves+0.211
Neg logit contributions
ansas-0.321
ti-0.320
anim-0.302
velt-0.300
hor-0.291
Token
Token ID198
Feature activation-2.73e-01

Pos logit contributions
ti+0.531
ansas+0.527
anim+0.496
velt+0.490
hor+0.476
Neg logit contributions
 amen-0.441
 lent-0.424
 helic-0.415
 sidel-0.415
 conject-0.410
TokenAs
Token ID1722
Feature activation-1.40e+00

Pos logit contributions
 amen+0.171
 lent+0.163
 sidel+0.160
selves+0.158
 conject+0.158
Neg logit contributions
ansas-0.215
ti-0.214
anim-0.201
velt-0.201
hor-0.193
Token a
Token ID257
Feature activation+5.78e-01

Pos logit contributions
 amen+0.819
 lent+0.779
 sidel+0.762
 conject+0.757
selves+0.746
Neg logit contributions
ansas-1.155
ti-1.153
anim-1.086
velt-1.079
hor-1.043
Token ball
Token ID2613
Feature activation-3.11e+00

Pos logit contributions
ti+0.472
ansas+0.469
anim+0.444
velt+0.441
hor+0.428
Neg logit contributions
 amen-0.342
 lent-0.325
selves-0.317
 conject-0.312
 helic-0.312
Token of
Token ID286
Feature activation-5.06e-01

Pos logit contributions
 amen+1.731
 lent+1.654
 sidel+1.608
 conject+1.591
selves+1.573
Neg logit contributions
ansas-2.646
ti-2.642
anim-2.489
velt-2.488
hor-2.398
Token fire
Token ID2046
Feature activation+2.55e+00

Pos logit contributions
 amen+0.399
 lent+0.385
 sidel+0.378
 conject+0.376
selves+0.374
Neg logit contributions
ansas-0.317
ti-0.316
anim-0.292
velt-0.290
hor-0.277
Token envelop
Token ID16441
Feature activation+1.90e+00

Pos logit contributions
ansas+1.946
ti+1.943
anim+1.814
velt+1.792
hor+1.736
Neg logit contributions
 amen-1.663
 lent-1.562
selves-1.541
 sidel-1.540
 helic-1.520
Tokened
Token ID276
Feature activation+1.04e+00

Pos logit contributions
ansas+1.553
ti+1.552
anim+1.474
velt+1.445
hor+1.413
Neg logit contributions
 amen-1.144
 lent-1.058
 sidel-1.052
 conject-1.036
 helic-1.029
Token the
Token ID262
Feature activation-2.90e-02

Pos logit contributions
ansas+0.900
ti+0.898
anim+0.849
velt+0.845
hor+0.819
Neg logit contributions
 amen-0.568
 lent-0.538
 sidel-0.521
 conject-0.518
selves-0.517
Token launch
Token ID4219
Feature activation-3.93e-01

Pos logit contributions
 amen+0.020
 lent+0.019
 sidel+0.018
 conject+0.018
selves+0.018
Neg logit contributions
ansas-0.021
ti-0.021
anim-0.020
velt-0.020
hor-0.019
Token land
Token ID1956
Feature activation-2.66e+00

Pos logit contributions
 amen+1.036
 lent+0.989
 sidel+0.968
 conject+0.959
selves+0.956
Neg logit contributions
ansas-1.259
ti-1.254
anim-1.177
velt-1.172
hor-1.129
Token confisc
Token ID22735
Feature activation-4.86e+00

Pos logit contributions
 amen+1.577
 lent+1.505
 sidel+1.465
 conject+1.451
selves+1.443
Neg logit contributions
ansas-2.185
ti-2.177
anim-2.049
velt-2.042
hor-1.970
Tokenation
Token ID341
Feature activation-6.18e+00

Pos logit contributions
 amen+3.489
 lent+3.398
 conject+3.307
 sidel+3.306
selves+3.252
Neg logit contributions
ansas-3.282
ti-3.226
velt-3.062
anim-2.961
hor-2.878
Token and
Token ID290
Feature activation-5.39e+00

Pos logit contributions
 amen+3.404
 lent+3.313
 sidel+3.213
 conject+3.189
selves+3.079
Neg logit contributions
ansas-5.286
ti-5.198
velt-4.915
anim-4.906
inois-4.746
Token Juda
Token ID24771
Feature activation-4.69e+00

Pos logit contributions
 amen+3.430
 lent+3.243
 sidel+3.225
 conject+3.121
 helic+3.083
Neg logit contributions
ansas-4.220
ti-4.193
velt-3.932
anim-3.919
inois-3.785
Tokenisation
Token ID5612
Feature activation-2.15e+00

Pos logit contributions
 amen+2.932
 lent+2.778
 sidel+2.709
 conject+2.684
selves+2.660
Neg logit contributions
ansas-3.668
ti-3.646
anim-3.409
velt-3.377
hor-3.278
Token that
Token ID326
Feature activation-1.08e+00

Pos logit contributions
 amen+1.248
 lent+1.189
 sidel+1.159
 conject+1.149
selves+1.142
Neg logit contributions
ansas-1.791
ti-1.784
anim-1.682
velt-1.675
hor-1.620
Token has
Token ID468
Feature activation-1.09e+00

Pos logit contributions
 amen+0.721
 lent+0.691
 sidel+0.676
 conject+0.669
selves+0.668
Neg logit contributions
ansas-0.802
ti-0.798
anim-0.747
velt-0.743
hor-0.716
Token be
Token ID307
Feature activation-8.09e-01

Pos logit contributions
 amen+0.666
 lent+0.635
 sidel+0.620
 conject+0.615
selves+0.613
Neg logit contributions
ansas-0.873
ti-0.870
anim-0.819
velt-0.815
hor-0.787
Tokenfall
Token ID7207
Feature activation+3.23e+00

Pos logit contributions
 amen+0.666
 lent+0.643
 sidel+0.632
 conject+0.628
selves+0.627
Neg logit contributions
ansas-0.478
ti-0.475
anim-0.437
velt-0.434
hor-0.413
Tokenen
Token ID268
Feature activation-4.17e-01

Pos logit contributions
ti+2.277
ansas+2.262
anim+2.163
velt+2.100
hor+2.054
Neg logit contributions
 amen-2.301
 sidel-2.156
 lent-2.153
 conject-2.134
 helic-2.129
Token personal
Token ID2614
Feature activation-2.45e-01

Pos logit contributions
 amen+0.578
 lent+0.551
 sidel+0.538
selves+0.534
 conject+0.534
Neg logit contributions
ansas-0.694
ti-0.692
anim-0.650
velt-0.645
hor-0.625
Token opinion
Token ID4459
Feature activation+5.94e-01

Pos logit contributions
 amen+0.185
 lent+0.178
 sidel+0.175
selves+0.174
 conject+0.173
Neg logit contributions
ansas-0.160
ti-0.160
anim-0.149
velt-0.147
hor-0.142
Token about
Token ID546
Feature activation-1.54e+00

Pos logit contributions
ti+0.507
ansas+0.506
anim+0.479
velt+0.474
hor+0.461
Neg logit contributions
 amen-0.331
 lent-0.313
 sidel-0.306
 helic-0.304
selves-0.303
Token Islam
Token ID3449
Feature activation+1.63e+00

Pos logit contributions
 amen+0.929
 lent+0.886
 sidel+0.866
 conject+0.858
selves+0.853
Neg logit contributions
ansas-1.254
ti-1.250
anim-1.174
velt-1.174
hor-1.130
Token.
Token ID13
Feature activation-7.04e-01

Pos logit contributions
ti+1.391
ansas+1.389
anim+1.316
velt+1.301
hor+1.262
Neg logit contributions
 amen-0.910
 lent-0.858
 sidel-0.839
 helic-0.837
 conject-0.833
Token It
Token ID632
Feature activation-2.06e+00

Pos logit contributions
 amen+0.387
 lent+0.367
 sidel+0.358
 conject+0.355
selves+0.353
Neg logit contributions
ansas-0.608
ti-0.607
anim-0.573
velt-0.570
hor-0.552
Token den
Token ID2853
Feature activation+2.50e+00

Pos logit contributions
 amen+1.350
 lent+1.301
 sidel+1.268
 conject+1.263
selves+1.255
Neg logit contributions
ansas-1.559
ti-1.537
anim-1.451
velt-1.438
hor-1.387
Tokenigr
Token ID3692
Feature activation+3.67e+00

Pos logit contributions
ti+1.320
ansas+1.302
anim+1.210
velt+1.191
hor+1.153
Neg logit contributions
 amen-2.198
 lent-2.124
 sidel-2.104
selves-2.092
 helic-2.080
Tokenates
Token ID689
Feature activation+1.59e+00

Pos logit contributions
ti+1.728
ansas+1.718
anim+1.607
velt+1.489
hor+1.467
Neg logit contributions
 amen-3.449
 lent-3.366
 sidel-3.296
selves-3.280
 helic-3.278
Token the
Token ID262
Feature activation+1.53e-01

Pos logit contributions
ansas+1.261
ti+1.257
anim+1.194
velt+1.169
hor+1.142
Neg logit contributions
 amen-0.970
 lent-0.927
selves-0.903
 sidel-0.901
 helic-0.901
Token religion
Token ID5737
Feature activation+4.36e+00

Pos logit contributions
ansas+0.123
ti+0.122
anim+0.115
velt+0.114
hor+0.111
Neg logit contributions
 amen-0.093
 lent-0.088
 sidel-0.086
 conject-0.085
selves-0.085
Tokenote
Token ID1258
Feature activation+1.58e+00

Pos logit contributions
 amen+0.893
 lent+0.867
 sidel+0.853
 conject+0.850
selves+0.849
Neg logit contributions
ansas-0.482
ti-0.479
anim-0.433
velt-0.429
hor-0.403
Token (
Token ID357
Feature activation-2.77e+00

Pos logit contributions
ansas+1.304
ti+1.301
anim+1.226
velt+1.221
hor+1.180
Neg logit contributions
 amen-0.935
 lent-0.889
 sidel-0.869
 conject-0.860
selves-0.858
TokenO
Token ID46
Feature activation-2.55e-01

Pos logit contributions
 amen+1.742
 lent+1.661
 sidel+1.617
selves+1.615
 conject+1.601
Neg logit contributions
ansas-2.149
ti-2.106
velt-1.996
anim-1.985
inois-1.923
Tokenmega
Token ID13731
Feature activation-2.28e+00

Pos logit contributions
 amen+0.167
 lent+0.161
 sidel+0.156
 conject+0.156
 helic+0.155
Neg logit contributions
ansas-0.189
ti-0.187
anim-0.177
velt-0.176
inois-0.171
Token Pharma
Token ID41845
Feature activation-5.88e-01

Pos logit contributions
 amen+1.912
 lent+1.849
 sidel+1.819
 conject+1.806
selves+1.805
Neg logit contributions
ansas-1.310
ti-1.304
anim-1.195
velt-1.186
hor-1.128
Token -
Token ID532
Feature activation-8.26e+00

Pos logit contributions
 amen+0.379
 lent+0.362
 sidel+0.355
 helic+0.352
 conject+0.352
Neg logit contributions
ti-0.452
ansas-0.450
anim-0.423
velt-0.418
hor-0.406
Token Quick
Token ID12029
Feature activation+1.25e+00

Pos logit contributions
 amen+4.662
 lent+4.541
selves+4.395
 conject+4.312
 sidel+4.261
Neg logit contributions
ansas-6.782
ti-6.510
velt-6.414
inois-6.241
hor-6.202
Tokenstep
Token ID9662
Feature activation+1.52e+00

Pos logit contributions
ti+0.378
ansas+0.373
anim+0.319
velt+0.307
hor+0.280
Neg logit contributions
 amen-1.397
 lent-1.360
 sidel-1.344
 helic-1.338
 conject-1.336
Token)
Token ID8
Feature activation+1.55e+00

Pos logit contributions
ti+1.393
ansas+1.378
anim+1.311
velt+1.284
hor+1.262
Neg logit contributions
 amen-0.767
 helic-0.731
 lent-0.726
 sidel-0.714
 conject-0.704
Token s
Token ID264
Feature activation-1.80e-03

Pos logit contributions
ansas+1.400
ti+1.396
anim+1.322
velt+1.318
hor+1.277
Neg logit contributions
 amen-0.800
 lent-0.755
 sidel-0.734
 conject-0.726
selves-0.723
Token.
Token ID13
Feature activation+8.50e-01

Pos logit contributions
 amen+0.001
 lent+0.001
 sidel+0.001
 helic+0.001
 conject+0.001
Neg logit contributions
ti-0.002
ansas-0.002
anim-0.001
velt-0.001
hor-0.001
Token.
Token ID13
Feature activation-6.05e-01

Pos logit contributions
 amen+2.341
 lent+2.224
 sidel+2.152
 conject+2.105
selves+2.074
Neg logit contributions
ansas-3.986
ti-3.951
velt-3.742
anim-3.742
inois-3.633
Token A
Token ID317
Feature activation+1.06e+00

Pos logit contributions
 amen+0.336
 lent+0.318
 sidel+0.310
selves+0.306
 conject+0.306
Neg logit contributions
ansas-0.519
ti-0.517
anim-0.489
velt-0.487
hor-0.472
Token Soy
Token ID30022
Feature activation-3.12e+00

Pos logit contributions
ti+0.821
ansas+0.819
anim+0.771
velt+0.767
hor+0.741
Neg logit contributions
 amen-0.668
 lent-0.633
selves-0.622
 sidel-0.619
 conject-0.618
Tokenuz
Token ID10277
Feature activation-1.91e+00

Pos logit contributions
 amen+2.072
 lent+2.001
 sidel+1.935
 conject+1.934
selves+1.919
Neg logit contributions
ansas-2.333
ti-2.308
anim-2.162
velt-2.158
inois-2.067
Token carrying
Token ID6872
Feature activation-4.95e-01

Pos logit contributions
 amen+1.160
 lent+1.117
 sidel+1.082
 conject+1.067
 helic+1.064
Neg logit contributions
ansas-1.545
ti-1.523
velt-1.445
anim-1.442
inois-1.389
Token NASA
Token ID8884
Feature activation-7.13e+00

Pos logit contributions
 amen+0.312
 lent+0.299
 sidel+0.291
 conject+0.288
selves+0.287
Neg logit contributions
ansas-0.389
ti-0.387
anim-0.364
velt-0.363
hor-0.350
Token astronaut
Token ID33779
Feature activation-2.16e+00

Pos logit contributions
 amen+3.731
 lent+3.647
 sidel+3.482
 conject+3.393
 helic+3.384
Neg logit contributions
ansas-6.210
ti-6.052
anim-5.782
velt-5.706
inois-5.629
Token Shane
Token ID24379
Feature activation+1.82e+00

Pos logit contributions
 amen+1.558
 lent+1.503
 sidel+1.466
 conject+1.450
selves+1.444
Neg logit contributions
ansas-1.500
ti-1.482
anim-1.385
velt-1.378
hor-1.327
Token Kim
Token ID6502
Feature activation-1.03e+00

Pos logit contributions
ti+1.373
ansas+1.360
anim+1.277
velt+1.273
hor+1.229
Neg logit contributions
 amen-1.204
 lent-1.142
 sidel-1.125
 conject-1.117
selves-1.111
Tokenb
Token ID65
Feature activation-1.74e+00

Pos logit contributions
 amen+0.698
 lent+0.669
 sidel+0.655
 conject+0.649
selves+0.648
Neg logit contributions
ansas-0.765
ti-0.762
anim-0.713
velt-0.709
hor-0.683
Tokenrough
Token ID740
Feature activation+2.19e+00

Pos logit contributions
 amen+1.421
 lent+1.382
selves+1.353
 sidel+1.352
 conject+1.348
Neg logit contributions
ansas-1.009
ti-1.005
velt-0.932
anim-0.922
hor-0.868
Token the
Token ID262
Feature activation-1.48e+00

Pos logit contributions
ansas+0.410
ti+0.410
anim+0.385
velt+0.384
hor+0.370
Neg logit contributions
 amen-0.337
 lent-0.323
 sidel-0.315
selves-0.312
 conject-0.312
Token atmosphere
Token ID8137
Feature activation-2.63e+00

Pos logit contributions
 amen+1.026
 lent+0.985
 sidel+0.964
 conject+0.957
selves+0.956
Neg logit contributions
ansas-1.071
ti-1.067
anim-0.997
velt-0.992
hor-0.954
Token different
Token ID1180
Feature activation+1.75e+00

Pos logit contributions
 amen+1.615
 lent+1.535
 sidel+1.500
 conject+1.482
selves+1.472
Neg logit contributions
ansas-2.131
ti-2.108
anim-1.984
velt-1.984
inois-1.908
Token than
Token ID621
Feature activation+1.92e-01

Pos logit contributions
ansas+1.395
ti+1.395
anim+1.313
velt+1.300
hor+1.262
Neg logit contributions
 amen-1.062
 lent-1.014
 sidel-0.990
 conject-0.982
selves-0.982
Token the
Token ID262
Feature activation+9.05e-01

Pos logit contributions
ansas+0.157
ti+0.156
anim+0.147
velt+0.147
hor+0.142
Neg logit contributions
 amen-0.115
 lent-0.109
 sidel-0.106
 conject-0.105
selves-0.105
Token quiet
Token ID5897
Feature activation-7.07e+00

Pos logit contributions
ansas+0.711
ti+0.708
anim+0.669
velt+0.665
hor+0.642
Neg logit contributions
 amen-0.558
 lent-0.536
selves-0.525
 sidel-0.523
 conject-0.520
Token colonial
Token ID17091
Feature activation-3.02e+00

Pos logit contributions
 amen+4.355
 lent+4.081
 sidel+4.011
 conject+3.947
 helic+3.902
Neg logit contributions
ansas-5.620
ti-5.587
anim-5.200
velt-5.163
inois-5.028
Token town
Token ID3240
Feature activation-2.48e+00

Pos logit contributions
 amen+2.024
 lent+1.929
 sidel+1.887
 conject+1.872
selves+1.859
Neg logit contributions
ansas-2.251
ti-2.243
anim-2.098
velt-2.087
hor-2.010
Token that
Token ID326
Feature activation-2.79e+00

Pos logit contributions
 amen+1.574
 lent+1.497
 sidel+1.457
 conject+1.445
selves+1.440
Neg logit contributions
ansas-1.942
ti-1.931
anim-1.815
velt-1.803
inois-1.735
Token attracted
Token ID12725
Feature activation-2.77e+00

Pos logit contributions
 amen+1.768
 lent+1.701
 sidel+1.643
 conject+1.623
selves+1.616
Neg logit contributions
ansas-2.184
ti-2.173
anim-2.033
velt-2.022
hor-1.955
Token them
Token ID606
Feature activation-3.71e+00

Pos logit contributions
 amen+1.651
 lent+1.565
 sidel+1.527
 conject+1.506
 helic+1.490
Neg logit contributions
ansas-2.289
ti-2.279
anim-2.128
velt-2.119
hor-2.056
Token Worlds
Token ID19946
Feature activation+3.79e+00

Pos logit contributions
ti+1.505
ansas+1.484
anim+1.401
velt+1.372
hor+1.329
Neg logit contributions
 amen-1.737
 lent-1.664
 sidel-1.644
selves-1.618
 helic-1.617
Token.
Token ID13
Feature activation-5.80e-01

Pos logit contributions
ti+2.940
ansas+2.911
velt+2.786
anim+2.774
hor+2.677
Neg logit contributions
 amen-2.362
selves-2.212
 lent-2.182
 sidel-2.178
 helic-2.141
Token
Token ID198
Feature activation-4.71e-01

Pos logit contributions
 amen+0.333
 lent+0.319
 sidel+0.307
 conject+0.307
selves+0.305
Neg logit contributions
ansas-0.484
ti-0.480
anim-0.453
velt-0.452
inois-0.438
Token
Token ID198
Feature activation+2.34e-01

Pos logit contributions
 amen+0.291
 lent+0.278
 sidel+0.272
 conject+0.270
 helic+0.269
Neg logit contributions
ansas-0.374
ti-0.374
anim-0.351
velt-0.349
hor-0.337
TokenSUP
Token ID40331
Feature activation-3.57e+00

Pos logit contributions
ansas+0.175
ti+0.175
anim+0.164
velt+0.163
hor+0.158
Neg logit contributions
 amen-0.156
 lent-0.149
 sidel-0.146
 conject-0.145
selves-0.144
TokenER
Token ID1137
Feature activation-7.59e+00

Pos logit contributions
 amen+1.738
 lent+1.640
 conject+1.574
selves+1.566
 sidel+1.566
Neg logit contributions
ansas-3.303
ti-3.256
velt-3.107
anim-3.091
inois-3.014
TokenHER
Token ID16879
Feature activation-1.98e+00

Pos logit contributions
 amen+4.666
 lent+4.526
 TRANS+4.412
selves+4.367
 helic+4.320
Neg logit contributions
ansas-5.823
ti-5.566
velt-5.373
inois-5.248
anim-5.191
TokenO
Token ID46
Feature activation-1.94e+00

Pos logit contributions
 amen+1.148
 lent+1.092
 sidel+1.065
 conject+1.055
selves+1.053
Neg logit contributions
ansas-1.646
ti-1.640
anim-1.547
velt-1.540
hor-1.489
Token
Token ID198
Feature activation+5.88e-01

Pos logit contributions
 amen+1.160
 lent+1.108
 sidel+1.076
 conject+1.068
selves+1.064
Neg logit contributions
ansas-1.589
ti-1.579
anim-1.488
velt-1.482
hor-1.431
Token
Token ID198
Feature activation+4.45e-01

Pos logit contributions
ti+0.455
ansas+0.452
anim+0.426
velt+0.420
hor+0.409
Neg logit contributions
 amen-0.378
 lent-0.363
 helic-0.356
 sidel-0.355
 conject-0.352
Token+
Token ID10
Feature activation-1.65e+00

Pos logit contributions
ansas+0.336
ti+0.334
anim+0.313
velt+0.313
hor+0.300
Neg logit contributions
 amen-0.293
 lent-0.280
 sidel-0.273
selves-0.271
 conject-0.271
Token the
Token ID262
Feature activation+1.88e+00

Pos logit contributions
ansas+0.557
ti+0.556
anim+0.525
velt+0.522
hor+0.505
Neg logit contributions
 amen-0.375
 lent-0.355
 sidel-0.346
 conject-0.343
selves-0.343
Token successful
Token ID4388
Feature activation+1.81e+00

Pos logit contributions
ti+1.590
ansas+1.578
anim+1.498
velt+1.485
hor+1.444
Neg logit contributions
 amen-1.058
 lent-0.999
 sidel-0.973
 conject-0.969
selves-0.968
Token Z
Token ID1168
Feature activation+2.20e+00

Pos logit contributions
ti+1.473
ansas+1.463
anim+1.385
velt+1.380
hor+1.333
Neg logit contributions
 amen-1.085
 lent-1.023
 sidel-1.001
selves-1.000
 conject-0.999
TokenZ
Token ID57
Feature activation-3.33e+00

Pos logit contributions
ansas+1.740
ti+1.740
anim+1.640
velt+1.609
hor+1.575
Neg logit contributions
 amen-1.383
 lent-1.338
 sidel-1.302
selves-1.298
 helic-1.293
Token P
Token ID350
Feature activation-1.20e+00

Pos logit contributions
 amen+2.189
 lent+2.091
 sidel+2.044
 conject+2.030
selves+2.022
Neg logit contributions
ansas-2.525
ti-2.508
anim-2.350
velt-2.349
hor-2.250
Tokenops
Token ID2840
Feature activation-7.94e+00

Pos logit contributions
 amen+0.811
 lent+0.775
 sidel+0.758
 conject+0.755
selves+0.747
Neg logit contributions
ansas-0.881
ti-0.874
velt-0.823
anim-0.816
inois-0.785
Token line
Token ID1627
Feature activation-6.34e+00

Pos logit contributions
 amen+4.968
 lent+4.797
 sidel+4.584
 conject+4.574
selves+4.559
Neg logit contributions
ansas-5.991
ti-5.794
anim-5.466
velt-5.452
hor-5.318
Token with
Token ID351
Feature activation+8.81e-01

Pos logit contributions
 amen+3.564
 lent+3.402
 sidel+3.283
 conject+3.195
selves+3.173
Neg logit contributions
ansas-5.201
ti-5.177
velt-4.959
anim-4.912
hor-4.745
Token Zach
Token ID18825
Feature activation+3.65e+00

Pos logit contributions
ti+0.690
ansas+0.690
anim+0.647
velt+0.642
hor+0.621
Neg logit contributions
 amen-0.554
 lent-0.527
 sidel-0.516
 conject-0.512
selves-0.510
Token Par
Token ID2547
Feature activation-5.12e-01

Pos logit contributions
ti+3.072
ansas+3.009
velt+2.836
anim+2.835
hor+2.770
Neg logit contributions
 amen-2.084
 lent-1.994
selves-1.961
 sidel-1.944
 conject-1.922
Tokenise
Token ID786
Feature activation-1.49e+00

Pos logit contributions
 amen+0.558
 lent+0.544
 sidel+0.537
 conject+0.534
selves+0.534
Neg logit contributions
ansas-0.165
ti-0.164
anim-0.140
velt-0.138
hor-0.124