TOP ACTIVATIONS
MAX = 1.934

11 , 2016 . REUTERS / Cl od agh Kil c
24 , 2016 . REUTERS / Nick Oxford / File Photo
26 , 2014 . REUTERS / En rique de la Os
28 , 2014 . REUTERS / Michael K lim enty ev
23 , 2017 . REUTERS / Joshua Roberts DE
other heavy metals . REUTERS / B obby Y ip
legraph . co . uk / business / analysis / why
legraph . co . uk / in coming / article 35
could not have been accomplished without our wonderful community enjoying many
bonded aluminum frame with an od ized tubing , features a
Memory : 2 GB GDDR 5 - 48 00 MHz
www . youtube . com / channel / UC _ K
wards . af . mil / history / docs _ html
The article was published by and translated to English by After
bring my childhood fantasy one step closer to reality .
, and the grown - ups believed what the kids were
121 months . On the flip side , there seems to
. E arl Morse K Z 8 E suggest
make the job doub ly hard for us .
a . dot . gov / pub s / 8 11

MIN ACTIVATIONS
MIN = -1.600

Bland y is an Ad j unct Professor of Economics at
of complaints . Analysis It has been clear since
. Guess what ! Ch ances are , you rarely do
for the Montreal Canadiens . Let s take a
_ P ulse Amount ]. Let s do this
revolt in the early 5 th century that was the catalyst
Most make silly jokes , drink more than they usually would
che . Thanks again for listening ! <|endoftext|> President
ograph , and so he drinks boxed wine and has these
. For example , El j ig ide y was not
their own code . So let s try to
pulls few punches , so caveat empt or .)
pat ul oy lang n atin ang at ing k amp
replace ObamaCare . He told CBS s John Dick
confident investors . Disclaimer : © 2015 Money Morning
There is no chance that you 'll be flung
would be ). Disclaimer : These tips are based
rather than a victim . Again , the United States did
the most significant show ? Again , title is important )
er , according to a CNN reporter . A

INTERVAL 1.050 - 1.934
CONTAINS 0.038%

they saw as the over polit ic isation of his role
It initially had three 90 MW t OK - 150 reactors
the Jets want to see playoff hockey this year , they
Memory : 2 GB GDDR 5 - 48 00 MHz
Mars up ial ia : Mac rop od idae ). IV

INTERVAL 0.167 - 1.050
CONTAINS 25.765%

s what this campaign is all about .
disrupt Israel s ongoing violations and increase in settlement
the 100 - player arena title made not one , but
fall ( Blu - ray / DVD + Digital Copy )
in May 2015 . According to the Orange County Register ,

INTERVAL -0.716 - 0.167
CONTAINS 73.465%

d forbid . I strongly advocate the rights of
different ways and enhance a kids imagination and problem solving skills
a paddle sweep , to a full delivery outside off stump
self has a wealth of statistics which back up the idea
each element of ` se ach Space ` to each of

INTERVAL -1.600 - -0.716
CONTAINS 0.732%

, he says j okingly . One of the
her get married ? We feel like not inviting Barbara would
oliath TBA TBA CBS All Access All
I don t feel that there was any compromise
free newsletters . Update , 3 : 01 p
Token 11
Token ID1367
Feature activation+6.55e-02

Pos logit contributions
 acquaintance+0.430
 ambush+0.404
 introdu+0.401
 equation+0.378
 jealousy+0.373
Neg logit contributions
OM-0.623
hers-0.593
yk-0.591
Mand-0.579
kefeller-0.574
Token,
Token ID11
Feature activation-2.78e-01

Pos logit contributions
OM+0.223
hers+0.210
yk+0.209
Mand+0.204
kefeller+0.201
Neg logit contributions
 acquaintance-0.181
 introdu-0.169
 ambush-0.169
 equation-0.160
 assail-0.159
Token 2016
Token ID1584
Feature activation-2.15e-01

Pos logit contributions
 acquaintance+0.825
 ambush+0.783
 introdu+0.778
 equation+0.741
 jealousy+0.734
Neg logit contributions
OM-0.878
hers-0.829
yk-0.826
Mand-0.807
kefeller-0.795
Token.
Token ID13
Feature activation+1.99e-01

Pos logit contributions
 acquaintance+0.605
 ambush+0.568
 introdu+0.567
 equation+0.536
 jealousy+0.533
Neg logit contributions
OM-0.717
hers-0.675
yk-0.672
Mand-0.658
kefeller-0.646
Token REUTERS
Token ID15862
Feature activation+2.81e-01

Pos logit contributions
OM+0.542
hers+0.500
yk+0.497
Mand+0.487
kefeller+0.472
Neg logit contributions
 acquaintance-0.688
 ambush-0.653
 introdu-0.652
 equation-0.623
 assail-0.622
Token/
Token ID14
Feature activation+1.93e+00

Pos logit contributions
OM+0.867
hers+0.818
yk+0.813
Mand+0.795
kefeller+0.783
Neg logit contributions
 acquaintance-0.855
 ambush-0.809
 introdu-0.805
 equation-0.766
 jealousy-0.762
TokenCl
Token ID2601
Feature activation+4.60e-01

Pos logit contributions
OM+5.136
Mand+4.637
hers+4.573
yk+4.502
OSH+4.351
Neg logit contributions
 acquaintance-6.832
 introdu-6.550
 ambush-6.512
 assail-6.329
 jealousy-6.167
Tokenod
Token ID375
Feature activation+6.53e-01

Pos logit contributions
OM+1.371
hers+1.287
yk+1.278
Mand+1.247
kefeller+1.224
Neg logit contributions
 acquaintance-1.456
 ambush-1.381
 introdu-1.378
 equation-1.312
 assail-1.307
Tokenagh
Token ID10471
Feature activation-9.75e-02

Pos logit contributions
OM+1.470
yk+1.382
hers+1.368
Mand+1.301
kefeller+1.265
Neg logit contributions
 acquaintance-2.530
 ambush-2.440
 introdu-2.437
 assail-2.334
 jealousy-2.328
Token Kil
Token ID19902
Feature activation-4.18e-01

Pos logit contributions
 acquaintance+0.277
 ambush+0.261
 introdu+0.260
 equation+0.246
 jealousy+0.244
Neg logit contributions
OM-0.322
hers-0.305
yk-0.301
Mand-0.296
kefeller-0.293
Tokenc
Token ID66
Feature activation-4.71e-02

Pos logit contributions
 acquaintance+1.294
 ambush+1.225
 introdu+1.212
 equation+1.168
 <-+1.163
Neg logit contributions
OM-1.252
hers-1.169
yk-1.153
Mand-1.138
kefeller-1.128
Token 24
Token ID1987
Feature activation-6.93e-02

Pos logit contributions
 acquaintance+0.116
 ambush+0.108
 introdu+0.108
 equation+0.101
 jealousy+0.100
Neg logit contributions
OM-0.172
hers-0.164
yk-0.164
Mand-0.160
kefeller-0.159
Token,
Token ID11
Feature activation+6.40e-02

Pos logit contributions
 acquaintance+0.180
 ambush+0.170
 introdu+0.169
 equation+0.159
 jealousy+0.158
Neg logit contributions
OM-0.245
hers-0.232
yk-0.231
Mand-0.226
kefeller-0.224
Token 2016
Token ID1584
Feature activation+1.64e-02

Pos logit contributions
OM+0.200
hers+0.188
yk+0.187
Mand+0.183
kefeller+0.179
Neg logit contributions
 acquaintance-0.194
 ambush-0.183
 introdu-0.182
 equation-0.173
 assail-0.173
Token.
Token ID13
Feature activation+3.64e-01

Pos logit contributions
OM+0.053
yk+0.049
hers+0.049
Mand+0.048
kefeller+0.047
Neg logit contributions
 acquaintance-0.049
 introdu-0.046
 ambush-0.046
 jealousy-0.043
 equation-0.043
Token REUTERS
Token ID15862
Feature activation+1.83e-01

Pos logit contributions
OM+0.949
hers+0.868
yk+0.863
Mand+0.850
kefeller+0.814
Neg logit contributions
 acquaintance-1.302
 ambush-1.240
 introdu-1.231
 assail-1.179
 equation-1.177
Token/
Token ID14
Feature activation+1.86e+00

Pos logit contributions
OM+0.570
hers+0.539
yk+0.536
Mand+0.524
kefeller+0.519
Neg logit contributions
 acquaintance-0.552
 ambush-0.522
 introdu-0.519
 equation-0.493
 jealousy-0.491
TokenNick
Token ID23609
Feature activation+1.53e-01

Pos logit contributions
OM+4.546
Mand+4.046
hers+4.003
yk+3.929
OSH+3.796
Neg logit contributions
 acquaintance-7.044
 introdu-6.743
 ambush-6.707
 assail-6.551
 jealousy-6.408
Token Oxford
Token ID13643
Feature activation-9.47e-02

Pos logit contributions
OM+0.500
hers+0.473
yk+0.469
Mand+0.460
kefeller+0.452
Neg logit contributions
 acquaintance-0.442
 ambush-0.417
 introdu-0.415
 equation-0.393
 assail-0.392
Token/
Token ID14
Feature activation+7.35e-01

Pos logit contributions
 acquaintance+0.288
 ambush+0.272
 introdu+0.272
 assail+0.257
 jealousy+0.257
Neg logit contributions
OM-0.297
hers-0.278
yk-0.277
Mand-0.270
kefeller-0.266
TokenFile
Token ID8979
Feature activation-2.64e-01

Pos logit contributions
OM+2.525
hers+2.339
yk+2.339
Mand+2.315
kefeller+2.250
Neg logit contributions
 acquaintance-2.022
 ambush-1.906
 introdu-1.902
 equation-1.784
 assail-1.783
Token Photo
Token ID5555
Feature activation-5.21e-01

Pos logit contributions
 acquaintance+0.815
 ambush+0.776
 introdu+0.772
 assail+0.734
 equation+0.730
Neg logit contributions
OM-0.807
hers-0.757
yk-0.755
Mand-0.735
kefeller-0.724
Token 26
Token ID2608
Feature activation-7.76e-02

Pos logit contributions
 acquaintance+0.883
 ambush+0.829
 introdu+0.818
 equation+0.773
 jealousy+0.763
Neg logit contributions
OM-1.356
hers-1.300
yk-1.299
kefeller-1.276
Mand-1.267
Token,
Token ID11
Feature activation+1.25e-01

Pos logit contributions
 acquaintance+0.199
 ambush+0.187
 introdu+0.185
 equation+0.175
 jealousy+0.173
Neg logit contributions
OM-0.277
hers-0.263
yk-0.262
Mand-0.257
kefeller-0.256
Token 2014
Token ID1946
Feature activation-2.24e-01

Pos logit contributions
OM+0.380
hers+0.357
yk+0.354
Mand+0.347
kefeller+0.340
Neg logit contributions
 acquaintance-0.386
 ambush-0.365
 introdu-0.364
 equation-0.346
 assail-0.345
Token.
Token ID13
Feature activation+1.50e-01

Pos logit contributions
 acquaintance+0.597
 ambush+0.563
 introdu+0.559
 equation+0.528
 jealousy+0.524
Neg logit contributions
OM-0.773
hers-0.735
yk-0.732
Mand-0.717
kefeller-0.711
Token REUTERS
Token ID15862
Feature activation-9.06e-02

Pos logit contributions
OM+0.398
hers+0.368
yk+0.363
Mand+0.359
kefeller+0.346
Neg logit contributions
 acquaintance-0.529
 ambush-0.503
 introdu-0.501
 equation-0.480
 assail-0.479
Token/
Token ID14
Feature activation+1.86e+00

Pos logit contributions
 acquaintance+0.261
 ambush+0.246
 introdu+0.242
 <-+0.232
 equation+0.231
Neg logit contributions
OM-0.289
hers-0.276
yk-0.276
kefeller-0.271
Mand-0.269
TokenEn
Token ID4834
Feature activation+2.06e-01

Pos logit contributions
OM+4.856
Mand+4.397
hers+4.340
yk+4.177
OSH+3.986
Neg logit contributions
 acquaintance-6.849
 ambush-6.528
 introdu-6.500
 assail-6.292
 jealousy-6.200
Tokenrique
Token ID33865
Feature activation+2.54e-01

Pos logit contributions
OM+0.544
hers+0.508
yk+0.502
Mand+0.491
kefeller+0.482
Neg logit contributions
 acquaintance-0.718
 ambush-0.683
 introdu-0.680
 equation-0.656
 jealousy-0.647
Token de
Token ID390
Feature activation-4.70e-01

Pos logit contributions
OM+0.753
hers+0.705
yk+0.697
Mand+0.687
kefeller+0.668
Neg logit contributions
 acquaintance-0.811
 introdu-0.765
 ambush-0.765
 equation-0.728
 assail-0.727
Token la
Token ID8591
Feature activation+3.15e-01

Pos logit contributions
 acquaintance+1.374
 ambush+1.311
 introdu+1.296
 equation+1.235
 jealousy+1.228
Neg logit contributions
OM-1.484
yk-1.430
hers-1.423
kefeller-1.393
Mand-1.364
Token Os
Token ID8834
Feature activation-1.98e-01

Pos logit contributions
OM+1.040
hers+0.980
yk+0.970
Mand+0.957
kefeller+0.925
Neg logit contributions
 acquaintance-0.902
 introdu-0.845
 ambush-0.845
 equation-0.796
 assail-0.796
Token 28
Token ID2579
Feature activation-1.34e-01

Pos logit contributions
 acquaintance+1.568
 ambush+1.468
 introdu+1.449
 equation+1.369
 <-+1.362
Neg logit contributions
OM-2.472
yk-2.371
hers-2.368
kefeller-2.357
Mand-2.324
Token,
Token ID11
Feature activation-2.59e-01

Pos logit contributions
 acquaintance+0.345
 ambush+0.324
 introdu+0.322
 equation+0.303
 jealousy+0.301
Neg logit contributions
OM-0.474
hers-0.452
yk-0.450
Mand-0.440
kefeller-0.438
Token 2014
Token ID1946
Feature activation-4.36e-01

Pos logit contributions
 acquaintance+0.761
 ambush+0.723
 introdu+0.716
 equation+0.682
 jealousy+0.677
Neg logit contributions
OM-0.826
hers-0.781
yk-0.778
Mand-0.761
kefeller-0.752
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
 acquaintance+1.168
 ambush+1.101
 introdu+1.094
 equation+1.034
 jealousy+1.025
Neg logit contributions
OM-1.507
hers-1.430
yk-1.426
Mand-1.397
kefeller-1.381
Token REUTERS
Token ID15862
Feature activation-1.49e-01

Pos logit contributions
OM+0.339
hers+0.312
yk+0.310
Mand+0.304
kefeller+0.294
Neg logit contributions
 acquaintance-0.449
 ambush-0.428
 introdu-0.426
 equation-0.408
 assail-0.407
Token/
Token ID14
Feature activation+1.81e+00

Pos logit contributions
 acquaintance+0.435
 ambush+0.409
 introdu+0.404
 equation+0.385
 <-+0.385
Neg logit contributions
OM-0.472
hers-0.451
yk-0.450
kefeller-0.440
Mand-0.440
TokenMichael
Token ID13256
Feature activation+1.01e-01

Pos logit contributions
OM+4.531
hers+4.048
Mand+3.962
yk+3.885
OSH+3.741
Neg logit contributions
 acquaintance-6.838
 ambush-6.612
 introdu-6.507
 assail-6.330
racuse-6.228
Token K
Token ID509
Feature activation-1.51e-01

Pos logit contributions
OM+0.329
hers+0.311
yk+0.308
Mand+0.304
kefeller+0.297
Neg logit contributions
 acquaintance-0.288
 ambush-0.272
 introdu-0.270
 equation-0.257
 jealousy-0.255
Tokenlim
Token ID2475
Feature activation+9.43e-01

Pos logit contributions
 acquaintance+0.606
 ambush+0.579
 introdu+0.577
 equation+0.558
 jealousy+0.553
Neg logit contributions
OM-0.317
hers-0.290
yk-0.288
Mand-0.281
kefeller-0.272
Tokenenty
Token ID3787
Feature activation-2.58e-01

Pos logit contributions
OM+1.970
hers+1.810
yk+1.802
Mand+1.651
OSH+1.602
Neg logit contributions
 acquaintance-3.867
 ambush-3.753
 introdu-3.745
 jealousy-3.621
 assail-3.610
Tokenev
Token ID1990
Feature activation+1.68e-02

Pos logit contributions
 acquaintance+0.952
 ambush+0.909
 introdu+0.907
 equation+0.870
 assail+0.866
Neg logit contributions
OM-0.635
hers-0.589
yk-0.581
Mand-0.567
kefeller-0.555
Token 23
Token ID2242
Feature activation-3.69e-01

Pos logit contributions
 acquaintance+0.840
 ambush+0.789
 introdu+0.783
 equation+0.738
 <-+0.732
Neg logit contributions
OM-1.324
yk-1.268
hers-1.267
kefeller-1.256
Mand-1.242
Token,
Token ID11
Feature activation-2.71e-01

Pos logit contributions
 acquaintance+0.942
 ambush+0.889
 introdu+0.882
 equation+0.827
 jealousy+0.820
Neg logit contributions
OM-1.313
hers-1.250
yk-1.245
Mand-1.220
kefeller-1.214
Token 2017
Token ID2177
Feature activation-4.59e-01

Pos logit contributions
 acquaintance+0.826
 ambush+0.776
 introdu+0.775
 assail+0.736
 equation+0.736
Neg logit contributions
OM-0.850
hers-0.802
yk-0.789
Mand-0.769
kefeller-0.756
Token.
Token ID13
Feature activation+8.85e-02

Pos logit contributions
 acquaintance+1.312
 ambush+1.229
 introdu+1.226
 equation+1.163
 jealousy+1.157
Neg logit contributions
OM-1.525
hers-1.434
yk-1.425
Mand-1.395
kefeller-1.371
Token REUTERS
Token ID15862
Feature activation-4.55e-03

Pos logit contributions
OM+0.237
hers+0.220
yk+0.218
Mand+0.213
kefeller+0.207
Neg logit contributions
 acquaintance-0.309
 ambush-0.294
 introdu-0.293
 equation-0.281
 assail-0.279
Token/
Token ID14
Feature activation+1.70e+00

Pos logit contributions
 acquaintance+0.013
 ambush+0.013
 introdu+0.012
 <-+0.012
 equation+0.012
Neg logit contributions
OM-0.014
hers-0.014
yk-0.014
Mand-0.013
kefeller-0.013
TokenJoshua
Token ID47740
Feature activation-1.27e-01

Pos logit contributions
OM+4.290
hers+3.828
Mand+3.827
yk+3.767
OSH+3.583
Neg logit contributions
 acquaintance-6.341
 ambush-6.001
 introdu-5.996
 assail-5.846
 equation-5.721
Token Roberts
Token ID10918
Feature activation-3.80e-01

Pos logit contributions
 acquaintance+0.433
 ambush+0.416
 introdu+0.410
 equation+0.398
 jealousy+0.394
Neg logit contributions
OM-0.346
hers-0.323
yk-0.322
Mand-0.314
kefeller-0.304
Token
Token ID198
Feature activation-8.78e-01

Pos logit contributions
 acquaintance+1.093
 ambush+1.031
 introdu+1.027
 equation+0.973
 assail+0.968
Neg logit contributions
OM-1.242
hers-1.173
yk-1.165
Mand-1.140
kefeller-1.121
Token
Token ID198
Feature activation-6.38e-01

Pos logit contributions
 acquaintance+2.087
 ambush+1.996
 introdu+1.942
 <-+1.900
 equation+1.834
Neg logit contributions
OM-3.162
kefeller-3.054
yk-3.042
hers-3.037
Mand-2.942
TokenDE
Token ID7206
Feature activation-9.28e-01

Pos logit contributions
 acquaintance+2.130
 ambush+2.039
 introdu+2.020
 <-+1.946
 equation+1.937
Neg logit contributions
OM-1.707
hers-1.618
yk-1.615
kefeller-1.570
Mand-1.538
Token other
Token ID584
Feature activation-2.77e-01

Pos logit contributions
 acquaintance+0.257
 ambush+0.243
 introdu+0.243
 equation+0.231
 assail+0.230
Neg logit contributions
OM-0.242
hers-0.227
yk-0.225
Mand-0.220
kefeller-0.216
Token heavy
Token ID4334
Feature activation-4.18e-01

Pos logit contributions
 acquaintance+0.845
 ambush+0.802
 introdu+0.798
 equation+0.760
 jealousy+0.753
Neg logit contributions
OM-0.857
hers-0.808
yk-0.805
Mand-0.785
kefeller-0.774
Token metals
Token ID21782
Feature activation-7.00e-01

Pos logit contributions
 acquaintance+1.183
 ambush+1.114
 introdu+1.110
 equation+1.053
 assail+1.043
Neg logit contributions
OM-1.390
hers-1.313
yk-1.306
Mand-1.279
kefeller-1.260
Token.
Token ID13
Feature activation+5.94e-02

Pos logit contributions
 acquaintance+1.847
 ambush+1.734
 introdu+1.726
 equation+1.634
 jealousy+1.617
Neg logit contributions
OM-2.438
yk-2.312
hers-2.307
Mand-2.269
kefeller-2.242
Token REUTERS
Token ID15862
Feature activation-5.91e-02

Pos logit contributions
OM+0.200
hers+0.189
yk+0.188
Mand+0.184
kefeller+0.181
Neg logit contributions
 acquaintance-0.165
 ambush-0.155
 introdu-0.155
 equation-0.146
 assail-0.146
Token/
Token ID14
Feature activation+1.64e+00

Pos logit contributions
 acquaintance+0.184
 ambush+0.175
 introdu+0.174
 equation+0.166
 assail+0.165
Neg logit contributions
OM-0.180
hers-0.169
yk-0.167
Mand-0.163
kefeller-0.160
TokenB
Token ID33
Feature activation+1.48e-01

Pos logit contributions
OM+4.390
hers+4.023
Mand+4.012
yk+3.906
OSH+3.751
Neg logit contributions
 acquaintance-5.696
 ambush-5.480
 introdu-5.421
 assail-5.236
 jealousy-5.162
Tokenobby
Token ID11369
Feature activation+7.70e-01

Pos logit contributions
OM+0.459
hers+0.431
yk+0.428
Mand+0.419
kefeller+0.408
Neg logit contributions
 acquaintance-0.455
 ambush-0.432
 introdu-0.431
 equation-0.409
 assail-0.408
Token Y
Token ID575
Feature activation-1.87e-02

Pos logit contributions
OM+2.081
yk+1.972
 Mills+1.911
hers+1.904
Mand+1.895
Neg logit contributions
 acquaintance-2.655
 introdu-2.534
 ambush-2.516
 assail-2.449
 appro-2.382
Tokenip
Token ID541
Feature activation-3.01e-01

Pos logit contributions
 acquaintance+0.057
 ambush+0.054
 introdu+0.054
 equation+0.051
 assail+0.051
Neg logit contributions
OM-0.058
hers-0.054
yk-0.054
Mand-0.053
kefeller-0.052
Token
Token ID198
Feature activation-4.27e-01

Pos logit contributions
 acquaintance+0.854
 ambush+0.805
 introdu+0.802
 equation+0.759
 assail+0.755
Neg logit contributions
OM-0.996
hers-0.941
yk-0.935
Mand-0.916
kefeller-0.900
Tokenlegraph
Token ID16606
Feature activation+1.45e-01

Pos logit contributions
OM+0.201
yk+0.187
kefeller+0.185
hers+0.184
Mand+0.181
Neg logit contributions
 acquaintance-0.259
 ambush-0.248
 introdu-0.244
 equation-0.235
 jealousy-0.235
Token.
Token ID13
Feature activation+2.60e-01

Pos logit contributions
OM+0.498
hers+0.472
yk+0.470
Mand+0.461
kefeller+0.457
Neg logit contributions
 acquaintance-0.389
 ambush-0.367
 introdu-0.364
 equation-0.344
 <--0.343
Tokenco
Token ID1073
Feature activation+2.31e-01

Pos logit contributions
OM+0.699
hers+0.649
yk+0.646
Mand+0.631
kefeller+0.619
Neg logit contributions
 acquaintance-0.900
 ambush-0.857
 introdu-0.854
 equation-0.820
 jealousy-0.814
Token.
Token ID13
Feature activation+3.77e-01

Pos logit contributions
OM+0.784
hers+0.742
yk+0.739
Mand+0.724
kefeller+0.714
Neg logit contributions
 acquaintance-0.634
 ambush-0.597
 introdu-0.594
 equation-0.562
 jealousy-0.558
Tokenuk
Token ID2724
Feature activation+4.74e-01

Pos logit contributions
OM+1.098
hers+1.029
yk+1.023
Mand+1.003
kefeller+0.995
Neg logit contributions
 acquaintance-1.206
 ambush-1.147
 introdu-1.141
 equation-1.087
 jealousy-1.082
Token/
Token ID14
Feature activation+1.63e+00

Pos logit contributions
OM+1.419
hers+1.328
yk+1.317
Mand+1.288
kefeller+1.246
Neg logit contributions
 acquaintance-1.515
 introdu-1.429
 ambush-1.428
 assail-1.365
 equation-1.358
Tokenbusiness
Token ID22680
Feature activation+4.63e-04

Pos logit contributions
OM+4.426
hers+4.070
yk+4.023
Mand+3.962
kefeller+3.726
Neg logit contributions
 acquaintance-5.630
 ambush-5.384
 introdu-5.288
 assail-5.139
 jealousy-5.109
Token/
Token ID14
Feature activation+1.15e+00

Pos logit contributions
OM+0.001
hers+0.001
yk+0.001
Mand+0.001
kefeller+0.001
Neg logit contributions
 acquaintance-0.001
 ambush-0.001
 introdu-0.001
 equation-0.001
 jealousy-0.001
Tokenanalysis
Token ID20930
Feature activation-2.43e-01

Pos logit contributions
OM+3.333
hers+3.138
yk+3.112
Mand+3.007
kefeller+2.879
Neg logit contributions
 acquaintance-3.765
 ambush-3.562
 introdu-3.536
 assail-3.399
 jealousy-3.374
Token/
Token ID14
Feature activation+5.51e-01

Pos logit contributions
 acquaintance+0.765
 ambush+0.721
 introdu+0.721
 equation+0.685
 jealousy+0.684
Neg logit contributions
OM-0.737
hers-0.691
yk-0.686
Mand-0.671
kefeller-0.657
Tokenwhy
Token ID22850
Feature activation-4.85e-01

Pos logit contributions
OM+1.747
hers+1.649
yk+1.639
Mand+1.591
kefeller+1.549
Neg logit contributions
 acquaintance-1.671
 ambush-1.570
 introdu-1.565
 assail-1.490
 equation-1.484
Tokenlegraph
Token ID16606
Feature activation+2.56e-01

Pos logit contributions
OM+0.342
yk+0.316
hers+0.315
Mand+0.309
kefeller+0.305
Neg logit contributions
 acquaintance-0.466
 ambush-0.446
 introdu-0.442
 equation-0.424
 jealousy-0.422
Token.
Token ID13
Feature activation+3.98e-01

Pos logit contributions
OM+0.866
hers+0.820
yk+0.815
Mand+0.798
kefeller+0.787
Neg logit contributions
 acquaintance-0.704
 ambush-0.663
 introdu-0.660
 equation-0.624
 jealousy-0.620
Tokenco
Token ID1073
Feature activation+4.04e-01

Pos logit contributions
OM+1.063
hers+0.990
yk+0.983
Mand+0.958
kefeller+0.942
Neg logit contributions
 acquaintance-1.378
 ambush-1.313
 introdu-1.309
 equation-1.254
 jealousy-1.247
Token.
Token ID13
Feature activation+5.23e-01

Pos logit contributions
OM+1.348
hers+1.271
yk+1.261
Mand+1.236
kefeller+1.209
Neg logit contributions
 acquaintance-1.145
 ambush-1.075
 introdu-1.074
 assail-1.018
 equation-1.016
Tokenuk
Token ID2724
Feature activation+6.09e-01

Pos logit contributions
OM+1.494
hers+1.398
yk+1.388
Mand+1.354
kefeller+1.328
Neg logit contributions
 acquaintance-1.723
 ambush-1.638
 introdu-1.633
 equation-1.559
 assail-1.551
Token/
Token ID14
Feature activation+1.62e+00

Pos logit contributions
OM+1.811
hers+1.688
yk+1.675
Mand+1.639
kefeller+1.576
Neg logit contributions
 acquaintance-1.967
 introdu-1.856
 ambush-1.854
 assail-1.784
 equation-1.765
Tokenin
Token ID259
Feature activation+5.41e-02

Pos logit contributions
OM+4.530
hers+4.180
yk+4.131
Mand+4.097
kefeller+3.865
Neg logit contributions
 acquaintance-5.433
 ambush-5.204
 introdu-5.125
 assail-4.959
 jealousy-4.927
Tokencoming
Token ID4976
Feature activation-4.93e-02

Pos logit contributions
OM+0.103
yk+0.094
hers+0.092
Mand+0.090
kefeller+0.089
Neg logit contributions
 acquaintance-0.228
 ambush-0.220
 introdu-0.219
 equation-0.211
bench-0.209
Token/
Token ID14
Feature activation+7.49e-01

Pos logit contributions
 acquaintance+0.152
 ambush+0.144
 introdu+0.143
 equation+0.136
 assail+0.135
Neg logit contributions
OM-0.152
hers-0.143
yk-0.142
Mand-0.138
kefeller-0.136
Tokenarticle
Token ID20205
Feature activation-3.69e-01

Pos logit contributions
OM+1.991
hers+1.845
yk+1.826
Mand+1.781
kefeller+1.723
Neg logit contributions
 acquaintance-2.641
 ambush-2.510
 introdu-2.498
 jealousy-2.391
 assail-2.391
Token35
Token ID2327
Feature activation-1.42e-01

Pos logit contributions
 acquaintance+1.233
 ambush+1.181
 introdu+1.165
 equation+1.114
 <-+1.113
Neg logit contributions
OM-1.023
hers-0.954
yk-0.947
Mand-0.925
kefeller-0.915
Token could
Token ID714
Feature activation-1.41e-01

Pos logit contributions
OM+0.059
hers+0.056
yk+0.055
Mand+0.054
kefeller+0.053
Neg logit contributions
 acquaintance-0.045
 ambush-0.042
 introdu-0.042
 equation-0.040
 assail-0.040
Token not
Token ID407
Feature activation-1.31e-01

Pos logit contributions
 acquaintance+0.376
 ambush+0.354
 introdu+0.352
 equation+0.333
 assail+0.330
Neg logit contributions
OM-0.492
hers-0.466
yk-0.464
Mand-0.455
kefeller-0.448
Token have
Token ID423
Feature activation+4.19e-01

Pos logit contributions
 acquaintance+0.332
 ambush+0.309
 introdu+0.308
 equation+0.288
 assail+0.288
Neg logit contributions
OM-0.474
hers-0.452
yk-0.446
Mand-0.437
kefeller-0.427
Token been
Token ID587
Feature activation+3.60e-01

Pos logit contributions
OM+1.356
hers+1.290
yk+1.262
Mand+1.244
kefeller+1.202
Neg logit contributions
 acquaintance-1.214
 ambush-1.148
 introdu-1.138
 assail-1.085
 equation-1.077
Token accomplished
Token ID13013
Feature activation+5.16e-01

Pos logit contributions
OM+0.949
hers+0.887
yk+0.875
Mand+0.853
kefeller+0.827
Neg logit contributions
 acquaintance-1.257
 ambush-1.202
 introdu-1.195
 assail-1.145
 equation-1.144
Token without
Token ID1231
Feature activation+1.52e+00

Pos logit contributions
OM+1.556
hers+1.463
yk+1.460
Mand+1.405
kefeller+1.366
Neg logit contributions
 acquaintance-1.597
 ambush-1.521
 introdu-1.508
 assail-1.447
 equation-1.435
Token our
Token ID674
Feature activation+4.07e-01

Pos logit contributions
OM+5.167
yk+4.820
hers+4.783
Mand+4.640
OSH+4.542
Neg logit contributions
 acquaintance-3.956
 ambush-3.801
 introdu-3.650
 assail-3.581
aign-3.560
Token wonderful
Token ID7932
Feature activation+1.26e-01

Pos logit contributions
OM+1.345
hers+1.268
yk+1.257
Mand+1.230
kefeller+1.206
Neg logit contributions
 acquaintance-1.149
 ambush-1.089
 introdu-1.080
 assail-1.026
 equation-1.025
Token community
Token ID2055
Feature activation-1.42e-01

Pos logit contributions
OM+0.400
hers+0.377
yk+0.374
Mand+0.366
kefeller+0.359
Neg logit contributions
 acquaintance-0.371
 ambush-0.352
 introdu-0.349
 equation-0.332
 assail-0.332
Token enjoying
Token ID13226
Feature activation+5.07e-02

Pos logit contributions
 acquaintance+0.385
 ambush+0.364
 introdu+0.360
 assail+0.341
 equation+0.340
Neg logit contributions
OM-0.487
hers-0.461
yk-0.456
Mand-0.447
kefeller-0.438
Token many
Token ID867
Feature activation-7.95e-02

Pos logit contributions
OM+0.170
hers+0.160
yk+0.159
Mand+0.156
kefeller+0.154
Neg logit contributions
 acquaintance-0.142
 ambush-0.134
 introdu-0.133
 equation-0.126
 assail-0.126
Token bonded
Token ID40270
Feature activation+5.13e-02

Pos logit contributions
OM+0.834
hers+0.787
yk+0.782
Mand+0.766
kefeller+0.753
Neg logit contributions
 acquaintance-0.726
 ambush-0.684
 introdu-0.683
 equation-0.646
 assail-0.644
Token aluminum
Token ID15955
Feature activation+6.93e-03

Pos logit contributions
OM+0.163
hers+0.154
yk+0.153
Mand+0.149
kefeller+0.147
Neg logit contributions
 acquaintance-0.152
 ambush-0.144
 introdu-0.144
 equation-0.136
 assail-0.136
Token frame
Token ID5739
Feature activation-2.10e-01

Pos logit contributions
OM+0.026
hers+0.025
yk+0.024
Mand+0.024
kefeller+0.024
Neg logit contributions
 acquaintance-0.017
 ambush-0.016
 introdu-0.016
 equation-0.015
 assail-0.015
Token with
Token ID351
Feature activation-1.75e-01

Pos logit contributions
 acquaintance+0.565
 ambush+0.531
 introdu+0.529
 equation+0.500
 assail+0.496
Neg logit contributions
OM-0.727
hers-0.689
yk-0.685
Mand-0.672
kefeller-0.662
Token an
Token ID281
Feature activation+1.95e-01

Pos logit contributions
 acquaintance+0.484
 ambush+0.456
 introdu+0.454
 equation+0.430
 jealousy+0.426
Neg logit contributions
OM-0.591
hers-0.559
yk-0.557
Mand-0.545
kefeller-0.538
Tokenod
Token ID375
Feature activation+1.44e+00

Pos logit contributions
OM+0.640
hers+0.603
yk+0.595
Mand+0.584
kefeller+0.572
Neg logit contributions
 acquaintance-0.556
 introdu-0.525
 ambush-0.523
 assail-0.496
 jealousy-0.495
Tokenized
Token ID1143
Feature activation+4.99e-01

Pos logit contributions
OM+4.785
hers+4.684
yk+4.417
Mand+4.271
kefeller+4.190
Neg logit contributions
 acquaintance-3.942
 introdu-3.751
 ambush-3.707
 jealousy-3.540
 <--3.501
Token tubing
Token ID43128
Feature activation+3.01e-01

Pos logit contributions
OM+1.284
hers+1.186
yk+1.160
Mand+1.132
kefeller+1.085
Neg logit contributions
 acquaintance-1.777
 introdu-1.690
 ambush-1.682
 assail-1.619
 equation-1.610
Token,
Token ID11
Feature activation+1.41e-01

Pos logit contributions
OM+1.009
hers+0.959
yk+0.943
Mand+0.922
kefeller+0.902
Neg logit contributions
 acquaintance-0.836
 ambush-0.782
 introdu-0.779
 assail-0.737
 equation-0.735
Token features
Token ID3033
Feature activation-4.16e-01

Pos logit contributions
OM+0.478
hers+0.452
yk+0.450
Mand+0.441
kefeller+0.434
Neg logit contributions
 acquaintance-0.390
 ambush-0.367
 introdu-0.366
 equation-0.346
 assail-0.344
Token a
Token ID257
Feature activation+2.76e-01

Pos logit contributions
 acquaintance+1.076
 ambush+1.014
 introdu+1.007
 equation+0.956
 jealousy+0.942
Neg logit contributions
OM-1.472
yk-1.396
hers-1.394
Mand-1.367
kefeller-1.352
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
 acquaintance+0.187
 ambush+0.176
 introdu+0.175
 equation+0.165
 <-+0.164
Neg logit contributions
OM-0.241
hers-0.228
yk-0.228
Mand-0.222
kefeller-0.222
TokenMemory
Token ID30871
Feature activation+6.83e-01

Pos logit contributions
OM+0.325
hers+0.301
yk+0.298
Mand+0.292
kefeller+0.283
Neg logit contributions
 acquaintance-0.411
 ambush-0.391
 introdu-0.391
 equation-0.373
 assail-0.372
Token:
Token ID25
Feature activation-1.69e-01

Pos logit contributions
OM+2.077
hers+1.905
Mand+1.866
yk+1.828
OSH+1.787
Neg logit contributions
 acquaintance-2.140
 ambush-2.036
 introdu-2.017
 jealousy-1.931
 assail-1.931
Token 2
Token ID362
Feature activation+2.68e-01

Pos logit contributions
 acquaintance+0.454
 ambush+0.427
 introdu+0.425
 equation+0.401
 assail+0.399
Neg logit contributions
OM-0.585
hers-0.554
yk-0.550
Mand-0.540
kefeller-0.531
TokenGB
Token ID4579
Feature activation+7.68e-01

Pos logit contributions
OM+0.971
hers+0.916
yk+0.911
Mand+0.895
kefeller+0.870
Neg logit contributions
 acquaintance-0.684
 ambush-0.635
 introdu-0.633
 assail-0.594
 jealousy-0.594
Token GDDR
Token ID50092
Feature activation+1.39e+00

Pos logit contributions
OM+1.973
hers+1.813
Mand+1.744
yk+1.740
OSH+1.656
Neg logit contributions
 acquaintance-2.777
 ambush-2.633
 introdu-2.630
 assail-2.528
 jealousy-2.521
Token5
Token ID20
Feature activation+3.58e-02

Pos logit contributions
OM+4.127
Mand+3.730
hers+3.725
yk+3.680
OSH+3.508
Neg logit contributions
 acquaintance-4.342
 introdu-4.225
 ambush-4.187
 assail-4.098
 jealousy-4.021
Token -
Token ID532
Feature activation-1.71e-03

Pos logit contributions
OM+0.117
hers+0.110
yk+0.110
Mand+0.108
kefeller+0.105
Neg logit contributions
 acquaintance-0.103
 ambush-0.097
 introdu-0.097
 equation-0.092
 assail-0.092
Token 48
Token ID4764
Feature activation+6.04e-01

Pos logit contributions
 acquaintance+0.005
 ambush+0.004
 introdu+0.004
 equation+0.004
 assail+0.004
Neg logit contributions
OM-0.006
hers-0.006
yk-0.006
Mand-0.005
kefeller-0.005
Token00
Token ID405
Feature activation+4.20e-02

Pos logit contributions
OM+1.847
hers+1.715
yk+1.715
Mand+1.676
kefeller+1.630
Neg logit contributions
 acquaintance-1.875
 introdu-1.774
 ambush-1.771
 assail-1.679
 jealousy-1.679
Token MHz
Token ID19805
Feature activation+7.69e-01

Pos logit contributions
OM+0.108
hers+0.100
yk+0.099
Mand+0.097
kefeller+0.095
Neg logit contributions
 acquaintance-0.150
 ambush-0.143
 introdu-0.143
 equation-0.137
 assail-0.136
Tokenwww
Token ID2503
Feature activation-5.92e-02

Pos logit contributions
 acquaintance+1.199
 ambush+1.129
 introdu+1.126
 equation+1.066
 assail+1.060
Neg logit contributions
OM-1.474
hers-1.397
yk-1.380
Mand-1.360
kefeller-1.343
Token.
Token ID13
Feature activation+2.79e-01

Pos logit contributions
 acquaintance+0.162
 ambush+0.152
 introdu+0.152
 equation+0.143
 jealousy+0.142
Neg logit contributions
OM-0.201
hers-0.191
yk-0.190
Mand-0.186
kefeller-0.184
Tokenyoutube
Token ID11604
Feature activation+1.91e-02

Pos logit contributions
OM+0.772
hers+0.723
yk+0.722
Mand+0.697
kefeller+0.678
Neg logit contributions
 acquaintance-0.948
 ambush-0.901
 introdu-0.900
 equation-0.859
 jealousy-0.854
Token.
Token ID13
Feature activation+4.35e-01

Pos logit contributions
OM+0.065
hers+0.062
yk+0.061
Mand+0.060
kefeller+0.059
Neg logit contributions
 acquaintance-0.052
 ambush-0.049
 introdu-0.049
 equation-0.046
 jealousy-0.046
Tokencom
Token ID785
Feature activation+9.24e-01

Pos logit contributions
OM+1.604
hers+1.527
yk+1.518
Mand+1.487
kefeller+1.454
Neg logit contributions
 acquaintance-1.076
 ambush-1.002
 introdu-0.997
 assail-0.943
 equation-0.938
Token/
Token ID14
Feature activation+1.39e+00

Pos logit contributions
OM+2.561
yk+2.388
hers+2.311
Mand+2.273
kefeller+2.194
Neg logit contributions
 acquaintance-3.093
 ambush-2.909
 introdu-2.906
 equation-2.798
 jealousy-2.786
Tokenchannel
Token ID17620
Feature activation-2.42e-01

Pos logit contributions
OM+4.422
yk+4.214
hers+4.099
Mand+4.054
kefeller+3.841
Neg logit contributions
 acquaintance-4.060
 introdu-3.819
 ambush-3.818
 equation-3.619
 jealousy-3.614
Token/
Token ID14
Feature activation+6.27e-01

Pos logit contributions
 acquaintance+0.718
 ambush+0.684
 introdu+0.678
 <-+0.645
 equation+0.644
Neg logit contributions
OM-0.763
hers-0.721
yk-0.717
Mand-0.699
kefeller-0.697
TokenUC
Token ID9598
Feature activation+2.41e-02

Pos logit contributions
OM+1.597
hers+1.466
yk+1.459
Mand+1.418
kefeller+1.372
Neg logit contributions
 acquaintance-2.281
 ambush-2.174
 introdu-2.171
 equation-2.082
 assail-2.081
Token_
Token ID62
Feature activation-1.09e-01

Pos logit contributions
OM+0.071
hers+0.067
yk+0.066
Mand+0.065
kefeller+0.064
Neg logit contributions
 acquaintance-0.077
 ambush-0.073
 introdu-0.073
 equation-0.069
 jealousy-0.069
TokenK
Token ID42
Feature activation-3.65e-01

Pos logit contributions
 acquaintance+0.356
 ambush+0.338
 introdu+0.337
 equation+0.321
 jealousy+0.320
Neg logit contributions
OM-0.309
hers-0.291
yk-0.287
Mand-0.282
kefeller-0.278
Tokenwards
Token ID2017
Feature activation-1.50e-01

Pos logit contributions
 acquaintance+0.143
 ambush+0.136
 introdu+0.136
 equation+0.130
 assail+0.130
Neg logit contributions
OM-0.101
hers-0.093
yk-0.093
Mand-0.091
kefeller-0.089
Token.
Token ID13
Feature activation+1.78e-01

Pos logit contributions
 acquaintance+0.391
 ambush+0.366
 introdu+0.365
 equation+0.342
 jealousy+0.342
Neg logit contributions
OM-0.534
hers-0.507
yk-0.505
Mand-0.493
kefeller-0.486
Tokenaf
Token ID1878
Feature activation+5.75e-02

Pos logit contributions
OM+0.479
hers+0.445
yk+0.443
Mand+0.430
kefeller+0.421
Neg logit contributions
 acquaintance-0.617
 ambush-0.587
 introdu-0.586
 equation-0.560
 jealousy-0.558
Token.
Token ID13
Feature activation+2.57e-01

Pos logit contributions
OM+0.176
hers+0.165
yk+0.165
Mand+0.160
kefeller+0.157
Neg logit contributions
 acquaintance-0.177
 ambush-0.168
 introdu-0.167
 jealousy-0.159
 equation-0.159
Tokenmil
Token ID25433
Feature activation+5.21e-01

Pos logit contributions
OM+0.500
hers+0.452
yk+0.448
Mand+0.429
kefeller+0.415
Neg logit contributions
 acquaintance-1.080
 ambush-1.036
 introdu-1.035
 equation-0.998
 jealousy-0.996
Token/
Token ID14
Feature activation+1.38e+00

Pos logit contributions
OM+1.525
hers+1.420
yk+1.416
Mand+1.375
kefeller+1.346
Neg logit contributions
 acquaintance-1.676
 ambush-1.595
 introdu-1.582
 assail-1.514
 jealousy-1.510
Tokenhistory
Token ID23569
Feature activation+4.83e-01

Pos logit contributions
OM+4.390
yk+4.085
hers+4.038
Mand+3.957
kefeller+3.843
Neg logit contributions
 acquaintance-4.109
 ambush-3.903
 introdu-3.806
 assail-3.694
 jealousy-3.675
Token/
Token ID14
Feature activation+9.96e-01

Pos logit contributions
OM+1.474
hers+1.376
yk+1.371
Mand+1.330
kefeller+1.304
Neg logit contributions
 acquaintance-1.512
 ambush-1.437
 introdu-1.419
 assail-1.362
 jealousy-1.354
Tokendocs
Token ID31628
Feature activation+1.29e-01

Pos logit contributions
OM+3.135
yk+2.925
hers+2.897
Mand+2.833
kefeller+2.770
Neg logit contributions
 acquaintance-3.008
 ambush-2.875
 introdu-2.805
 assail-2.720
 jealousy-2.711
Token_
Token ID62
Feature activation+8.08e-02

Pos logit contributions
OM+0.387
hers+0.361
yk+0.360
Mand+0.350
kefeller+0.342
Neg logit contributions
 acquaintance-0.412
 ambush-0.390
 introdu-0.387
 assail-0.370
 equation-0.369
Tokenhtml
Token ID6494
Feature activation+7.68e-01

Pos logit contributions
OM+0.246
hers+0.231
yk+0.229
Mand+0.224
kefeller+0.219
Neg logit contributions
 acquaintance-0.252
 ambush-0.239
 introdu-0.238
 equation-0.227
 assail-0.226
Token The
Token ID383
Feature activation+8.51e-02

Pos logit contributions
OM+0.556
hers+0.525
yk+0.522
Mand+0.512
kefeller+0.502
Neg logit contributions
 acquaintance-0.460
 ambush-0.433
 introdu-0.431
 equation-0.408
 assail-0.406
Token article
Token ID2708
Feature activation-1.55e-01

Pos logit contributions
OM+0.343
hers+0.327
yk+0.325
Mand+0.320
kefeller+0.315
Neg logit contributions
 acquaintance-0.180
 ambush-0.166
 introdu-0.165
 equation-0.153
 assail-0.152
Token was
Token ID373
Feature activation-1.88e-02

Pos logit contributions
 acquaintance+0.426
 ambush+0.400
 introdu+0.399
 equation+0.377
 assail+0.375
Neg logit contributions
OM-0.528
hers-0.499
yk-0.496
Mand-0.486
kefeller-0.479
Token published
Token ID3199
Feature activation-1.39e-01

Pos logit contributions
 acquaintance+0.057
 ambush+0.054
 introdu+0.054
 equation+0.052
 assail+0.051
Neg logit contributions
OM-0.058
hers-0.055
yk-0.054
Mand-0.053
kefeller-0.052
Token by
Token ID416
Feature activation+4.80e-01

Pos logit contributions
 acquaintance+0.353
 ambush+0.330
 introdu+0.329
 equation+0.310
 assail+0.308
Neg logit contributions
OM-0.501
hers-0.475
yk-0.472
Mand-0.463
kefeller-0.456
Tokenand
Token ID392
Feature activation+1.37e+00

Pos logit contributions
OM+1.546
hers+1.453
yk+1.441
Mand+1.412
kefeller+1.372
Neg logit contributions
 acquaintance-1.401
 ambush-1.328
 introdu-1.318
 equation-1.250
 assail-1.249
Token translated
Token ID14251
Feature activation+2.34e-01

Pos logit contributions
OM+4.424
hers+4.157
yk+4.112
Mand+4.024
OSH+3.858
Neg logit contributions
 acquaintance-3.925
 ambush-3.714
 introdu-3.630
 assail-3.513
 jealousy-3.486
Token to
Token ID284
Feature activation+1.69e-01

Pos logit contributions
OM+0.810
hers+0.763
yk+0.758
Mand+0.746
kefeller+0.726
Neg logit contributions
 acquaintance-0.628
 ambush-0.592
 introdu-0.585
 assail-0.556
 equation-0.553
Token English
Token ID3594
Feature activation+8.04e-01

Pos logit contributions
OM+0.557
hers+0.526
yk+0.522
Mand+0.512
kefeller+0.502
Neg logit contributions
 acquaintance-0.481
 ambush-0.454
 introdu-0.452
 equation-0.428
 assail-0.426
Token by
Token ID416
Feature activation+2.82e-01

Pos logit contributions
OM+2.689
hers+2.516
yk+2.506
Mand+2.474
kefeller+2.365
Neg logit contributions
 acquaintance-2.251
 ambush-2.124
 introdu-2.095
 assail-2.000
 equation-1.993
TokenAfter
Token ID3260
Feature activation+1.42e-01

Pos logit contributions
OM+0.908
hers+0.854
yk+0.848
Mand+0.831
kefeller+0.811
Neg logit contributions
 acquaintance-0.825
 ambush-0.781
 introdu-0.776
 equation-0.736
 assail-0.735
Token bring
Token ID2222
Feature activation+2.99e-01

Pos logit contributions
OM+1.038
hers+0.988
yk+0.982
Mand+0.965
kefeller+0.951
Neg logit contributions
 acquaintance-0.635
 ambush-0.589
 introdu-0.587
 equation-0.549
 assail-0.545
Token my
Token ID616
Feature activation+3.24e-02

Pos logit contributions
OM+0.994
hers+0.936
yk+0.931
Mand+0.910
kefeller+0.896
Neg logit contributions
 acquaintance-0.842
 ambush-0.794
 introdu-0.789
 assail-0.749
 equation-0.748
Token childhood
Token ID9963
Feature activation+2.58e-04

Pos logit contributions
OM+0.110
hers+0.104
yk+0.103
Mand+0.101
kefeller+0.099
Neg logit contributions
 acquaintance-0.089
 ambush-0.084
 introdu-0.084
 equation-0.079
 assail-0.079
Token fantasy
Token ID8842
Feature activation+8.12e-02

Pos logit contributions
OM+0.001
hers+0.001
yk+0.001
Mand+0.001
kefeller+0.001
Neg logit contributions
 acquaintance-0.001
 ambush-0.001
 introdu-0.001
 equation-0.001
 assail-0.001
Token one
Token ID530
Feature activation+8.86e-02

Pos logit contributions
OM+0.277
hers+0.262
yk+0.260
Mand+0.255
kefeller+0.250
Neg logit contributions
 acquaintance-0.222
 ambush-0.209
 introdu-0.208
 equation-0.196
 assail-0.195
Token step
Token ID2239
Feature activation+1.36e+00

Pos logit contributions
OM+0.302
hers+0.287
yk+0.284
Mand+0.279
kefeller+0.274
Neg logit contributions
 acquaintance-0.243
 ambush-0.228
 introdu-0.227
 equation-0.215
 assail-0.213
Token closer
Token ID5699
Feature activation+1.61e-01

Pos logit contributions
OM+3.324
hers+3.043
yk+3.016
Mand+2.915
kefeller+2.825
Neg logit contributions
 acquaintance-4.807
 ambush-4.668
 introdu-4.599
 assail-4.513
 <--4.493
Token to
Token ID284
Feature activation+2.73e-01

Pos logit contributions
OM+0.537
hers+0.506
yk+0.499
Mand+0.488
kefeller+0.481
Neg logit contributions
 acquaintance-0.452
 ambush-0.431
 introdu-0.426
 assail-0.404
 jealousy-0.402
Token reality
Token ID3950
Feature activation+3.38e-01

Pos logit contributions
OM+0.783
hers+0.735
yk+0.724
Mand+0.709
kefeller+0.693
Neg logit contributions
 acquaintance-0.894
 ambush-0.850
 introdu-0.847
 equation-0.807
 assail-0.806
Token.
Token ID13
Feature activation+2.69e-01

Pos logit contributions
OM+1.125
hers+1.060
yk+1.050
Mand+1.024
kefeller+1.004
Neg logit contributions
 acquaintance-0.959
 ambush-0.905
 introdu-0.899
 assail-0.850
 equation-0.848
Token
Token ID198
Feature activation+9.03e-02

Pos logit contributions
OM+0.929
hers+0.874
yk+0.869
Mand+0.856
kefeller+0.832
Neg logit contributions
 acquaintance-0.732
 ambush-0.686
 introdu-0.683
 assail-0.644
 equation-0.644
Token,
Token ID11
Feature activation+1.56e-01

Pos logit contributions
OM+0.652
hers+0.617
yk+0.611
Mand+0.599
kefeller+0.586
Neg logit contributions
 acquaintance-0.527
 introdu-0.494
 ambush-0.493
 equation-0.465
 assail-0.463
Token and
Token ID290
Feature activation+1.73e-01

Pos logit contributions
OM+0.513
hers+0.484
yk+0.481
Mand+0.471
kefeller+0.464
Neg logit contributions
 acquaintance-0.446
 ambush-0.420
 introdu-0.419
 equation-0.397
 assail-0.395
Token the
Token ID262
Feature activation+3.75e-02

Pos logit contributions
OM+0.580
hers+0.549
yk+0.546
Mand+0.535
kefeller+0.528
Neg logit contributions
 acquaintance-0.483
 ambush-0.455
 introdu-0.453
 equation-0.428
 jealousy-0.426
Token grown
Token ID7334
Feature activation+5.50e-01

Pos logit contributions
OM+0.126
hers+0.119
yk+0.118
Mand+0.116
kefeller+0.114
Neg logit contributions
 acquaintance-0.105
 ambush-0.099
 introdu-0.098
 equation-0.093
 jealousy-0.092
Token-
Token ID12
Feature activation+2.97e-01

Pos logit contributions
OM+1.729
hers+1.635
yk+1.606
Mand+1.573
kefeller+1.532
Neg logit contributions
 acquaintance-1.650
 ambush-1.560
 introdu-1.557
 equation-1.478
 assail-1.472
Tokenups
Token ID4739
Feature activation+1.35e+00

Pos logit contributions
OM+0.951
hers+0.898
yk+0.885
Mand+0.869
kefeller+0.845
Neg logit contributions
 acquaintance-0.882
 ambush-0.836
 introdu-0.831
 equation-0.791
 assail-0.786
Token believed
Token ID4762
Feature activation+3.60e-01

Pos logit contributions
OM+4.257
hers+4.054
yk+3.953
Mand+3.866
OSH+3.710
Neg logit contributions
 acquaintance-3.879
 introdu-3.651
 ambush-3.650
 equation-3.481
 assail-3.455
Token what
Token ID644
Feature activation-1.32e-01

Pos logit contributions
OM+1.196
hers+1.131
yk+1.119
Mand+1.098
kefeller+1.074
Neg logit contributions
 acquaintance-1.015
 ambush-0.953
 introdu-0.952
 equation-0.902
 assail-0.899
Token the
Token ID262
Feature activation-1.24e-01

Pos logit contributions
 acquaintance+0.363
 ambush+0.342
 introdu+0.341
 equation+0.322
 jealousy+0.320
Neg logit contributions
OM-0.449
hers-0.425
yk-0.424
Mand-0.415
kefeller-0.411
Token kids
Token ID3988
Feature activation-4.20e-02

Pos logit contributions
 acquaintance+0.356
 ambush+0.338
 introdu+0.334
 equation+0.319
 jealousy+0.315
Neg logit contributions
OM-0.410
yk-0.386
hers-0.386
Mand-0.377
kefeller-0.374
Token were
Token ID547
Feature activation+2.42e-01

Pos logit contributions
 acquaintance+0.125
 ambush+0.119
 introdu+0.118
 equation+0.112
 assail+0.112
Neg logit contributions
OM-0.133
hers-0.125
yk-0.124
Mand-0.122
kefeller-0.120
Token 121
Token ID20416
Feature activation+8.25e-02

Pos logit contributions
 acquaintance+0.699
 ambush+0.653
 introdu+0.650
 equation+0.612
 jealousy+0.606
Neg logit contributions
OM-0.970
hers-0.920
yk-0.915
Mand-0.895
kefeller-0.890
Token months
Token ID1933
Feature activation+1.67e-01

Pos logit contributions
OM+0.258
hers+0.243
yk+0.241
Mand+0.236
kefeller+0.231
Neg logit contributions
 acquaintance-0.249
 ambush-0.236
 introdu-0.235
 equation-0.223
 assail-0.222
Token.
Token ID13
Feature activation+1.01e-03

Pos logit contributions
OM+0.596
hers+0.565
yk+0.561
Mand+0.552
kefeller+0.539
Neg logit contributions
 acquaintance-0.431
 ambush-0.404
 introdu-0.402
 equation-0.378
 assail-0.376
Token On
Token ID1550
Feature activation+1.84e-01

Pos logit contributions
OM+0.004
hers+0.003
yk+0.003
Mand+0.003
kefeller+0.003
Neg logit contributions
 acquaintance-0.003
 ambush-0.002
 introdu-0.002
 equation-0.002
 jealousy-0.002
Token the
Token ID262
Feature activation+3.26e-01

Pos logit contributions
OM+0.607
hers+0.573
yk+0.570
Mand+0.556
kefeller+0.547
Neg logit contributions
 acquaintance-0.525
 ambush-0.496
 introdu-0.494
 equation-0.467
 assail-0.467
Token flip
Token ID14283
Feature activation+1.34e+00

Pos logit contributions
OM+1.028
yk+0.968
hers+0.968
Mand+0.940
kefeller+0.921
Neg logit contributions
 acquaintance-0.962
 ambush-0.911
 introdu-0.910
 assail-0.866
 equation-0.858
Token side
Token ID1735
Feature activation+3.20e-01

Pos logit contributions
OM+2.979
yk+2.779
hers+2.724
Mand+2.586
kefeller+2.422
Neg logit contributions
 acquaintance-5.076
 introdu-4.884
 ambush-4.856
aign-4.710
 <--4.709
Token,
Token ID11
Feature activation+1.68e-01

Pos logit contributions
OM+1.056
hers+0.996
yk+0.988
Mand+0.964
kefeller+0.933
Neg logit contributions
 acquaintance-0.911
 ambush-0.861
 introdu-0.855
 assail-0.808
 jealousy-0.800
Token there
Token ID612
Feature activation+2.92e-01

Pos logit contributions
OM+0.565
hers+0.535
yk+0.531
Mand+0.521
kefeller+0.510
Neg logit contributions
 acquaintance-0.465
 ambush-0.439
 introdu-0.436
 equation-0.412
 assail-0.411
Token seems
Token ID2331
Feature activation-2.40e-02

Pos logit contributions
OM+0.986
hers+0.936
yk+0.924
Mand+0.908
kefeller+0.885
Neg logit contributions
 acquaintance-0.806
 ambush-0.760
 introdu-0.757
 assail-0.712
 equation-0.710
Token to
Token ID284
Feature activation-6.16e-02

Pos logit contributions
 acquaintance+0.068
 ambush+0.064
 introdu+0.064
 equation+0.060
 assail+0.060
Neg logit contributions
OM-0.080
hers-0.075
yk-0.074
Mand-0.073
kefeller-0.071
Token.
Token ID13
Feature activation+5.59e-02

Pos logit contributions
OM+0.691
hers+0.653
yk+0.646
Mand+0.630
kefeller+0.619
Neg logit contributions
 acquaintance-0.564
 ambush-0.531
 introdu-0.528
 assail-0.499
 jealousy-0.498
Token
Token ID198
Feature activation+1.74e-01

Pos logit contributions
OM+0.197
hers+0.186
yk+0.185
Mand+0.182
kefeller+0.179
Neg logit contributions
 acquaintance-0.147
 ambush-0.137
 introdu-0.137
 equation-0.129
 assail-0.128
Token
Token ID198
Feature activation+2.44e-01

Pos logit contributions
OM+0.585
hers+0.552
yk+0.548
Mand+0.538
kefeller+0.524
Neg logit contributions
 acquaintance-0.488
 ambush-0.458
 introdu-0.458
 assail-0.434
 equation-0.433
TokenE
Token ID36
Feature activation-1.83e-02

Pos logit contributions
OM+0.696
hers+0.648
yk+0.643
Mand+0.630
kefeller+0.612
Neg logit contributions
 acquaintance-0.806
 ambush-0.767
 introdu-0.764
 equation-0.729
 assail-0.728
Tokenarl
Token ID7063
Feature activation-1.49e-03

Pos logit contributions
 acquaintance+0.054
 ambush+0.052
 introdu+0.051
 assail+0.049
 equation+0.049
Neg logit contributions
OM-0.059
hers-0.055
yk-0.055
Mand-0.053
kefeller-0.052
Token Morse
Token ID44049
Feature activation+1.34e+00

Pos logit contributions
 acquaintance+0.004
 ambush+0.004
 introdu+0.004
 equation+0.004
 assail+0.004
Neg logit contributions
OM-0.005
hers-0.004
yk-0.004
Mand-0.004
kefeller-0.004
Token K
Token ID509
Feature activation+1.54e-04

Pos logit contributions
OM+4.430
hers+4.161
yk+4.126
Mand+4.046
OSH+3.909
Neg logit contributions
 acquaintance-3.647
 ambush-3.485
 introdu-3.429
 assail-3.306
 jealousy-3.295
TokenZ
Token ID57
Feature activation+1.53e-01

Pos logit contributions
OM+0.000
hers+0.000
yk+0.000
Mand+0.000
kefeller+0.000
Neg logit contributions
 acquaintance-0.000
 ambush-0.000
 introdu-0.000
 equation-0.000
 assail-0.000
Token8
Token ID23
Feature activation+2.11e-01

Pos logit contributions
OM+0.467
hers+0.438
yk+0.435
Mand+0.425
kefeller+0.416
Neg logit contributions
 acquaintance-0.473
 ambush-0.448
 introdu-0.446
 equation-0.424
 assail-0.423
TokenE
Token ID36
Feature activation+2.31e-02

Pos logit contributions
OM+0.644
hers+0.602
yk+0.599
Mand+0.585
kefeller+0.572
Neg logit contributions
 acquaintance-0.656
 ambush-0.621
 introdu-0.619
 equation-0.588
 assail-0.587
Token suggest
Token ID1950
Feature activation+4.83e-01

Pos logit contributions
OM+0.074
hers+0.070
yk+0.069
Mand+0.068
kefeller+0.066
Neg logit contributions
 acquaintance-0.068
 ambush-0.065
 introdu-0.064
 equation-0.061
 assail-0.061
Token make
Token ID787
Feature activation-2.68e-02

Pos logit contributions
OM+0.118
hers+0.111
yk+0.111
Mand+0.108
kefeller+0.107
Neg logit contributions
 acquaintance-0.097
 ambush-0.092
 introdu-0.091
 equation-0.086
 assail-0.086
Token the
Token ID262
Feature activation+1.99e-01

Pos logit contributions
 acquaintance+0.076
 ambush+0.072
 introdu+0.072
 equation+0.068
 jealousy+0.067
Neg logit contributions
OM-0.089
hers-0.084
yk-0.083
Mand-0.082
kefeller-0.080
Token job
Token ID1693
Feature activation+3.25e-01

Pos logit contributions
OM+0.685
hers+0.648
yk+0.644
Mand+0.632
kefeller+0.621
Neg logit contributions
 acquaintance-0.536
 ambush-0.504
 introdu-0.502
 equation-0.473
 assail-0.472
Token doub
Token ID3385
Feature activation+3.44e-01

Pos logit contributions
OM+1.105
hers+1.047
yk+1.040
Mand+1.018
kefeller+0.990
Neg logit contributions
 acquaintance-0.878
 ambush-0.828
 introdu-0.820
 assail-0.781
 jealousy-0.774
Tokenly
Token ID306
Feature activation+1.19e+00

Pos logit contributions
OM+0.953
hers+0.901
yk+0.880
Mand+0.857
kefeller+0.820
Neg logit contributions
 acquaintance-1.167
 ambush-1.104
 introdu-1.099
 assail-1.051
 jealousy-1.045
Token hard
Token ID1327
Feature activation+1.34e+00

Pos logit contributions
OM+3.660
yk+3.408
hers+3.404
Mand+3.301
OSH+3.202
Neg logit contributions
 acquaintance-3.588
 introdu-3.381
 ambush-3.356
 assail-3.215
aign-3.189
Token for
Token ID329
Feature activation+5.07e-01

Pos logit contributions
OM+4.274
hers+3.985
yk+3.919
Mand+3.804
OSH+3.717
Neg logit contributions
 acquaintance-3.881
 ambush-3.710
 introdu-3.623
 assail-3.501
 jealousy-3.474
Token us
Token ID514
Feature activation+6.83e-02

Pos logit contributions
OM+1.691
hers+1.598
yk+1.584
Mand+1.555
kefeller+1.522
Neg logit contributions
 acquaintance-1.415
 ambush-1.331
 introdu-1.329
 equation-1.258
 assail-1.255
Token.
Token ID13
Feature activation+2.52e-01

Pos logit contributions
OM+0.231
hers+0.219
yk+0.216
Mand+0.212
kefeller+0.208
Neg logit contributions
 acquaintance-0.189
 ambush-0.178
 introdu-0.177
 equation-0.167
 assail-0.167
Token
Token ID447
Feature activation+2.02e-01

Pos logit contributions
OM+0.775
hers+0.727
yk+0.720
Mand+0.708
kefeller+0.686
Neg logit contributions
 acquaintance-0.781
 ambush-0.739
 introdu-0.736
 assail-0.699
 equation-0.699
Token
Token ID251
Feature activation+1.90e-01

Pos logit contributions
OM+0.586
hers+0.552
yk+0.546
Mand+0.530
kefeller+0.515
Neg logit contributions
 acquaintance-0.657
 ambush-0.626
 introdu-0.623
 assail-0.594
 equation-0.591
Tokena
Token ID64
Feature activation+2.49e-01

Pos logit contributions
OM+0.947
yk+0.901
hers+0.900
kefeller+0.851
Mand+0.845
Neg logit contributions
 acquaintance-1.000
 introdu-0.938
 ambush-0.933
 jealousy-0.896
 assail-0.892
Token.
Token ID13
Feature activation+2.43e-01

Pos logit contributions
OM+0.826
hers+0.781
yk+0.775
Mand+0.757
kefeller+0.744
Neg logit contributions
 acquaintance-0.705
 ambush-0.664
 introdu-0.661
 assail-0.627
 equation-0.625
Tokendot
Token ID26518
Feature activation+3.53e-01

Pos logit contributions
OM+0.643
hers+0.599
yk+0.594
Mand+0.578
kefeller+0.566
Neg logit contributions
 acquaintance-0.854
 ambush-0.814
 introdu-0.812
 equation-0.777
 assail-0.774
Token.
Token ID13
Feature activation+4.47e-01

Pos logit contributions
OM+1.156
hers+1.090
yk+1.088
Mand+1.056
kefeller+1.031
Neg logit contributions
 acquaintance-1.031
 introdu-0.966
 ambush-0.962
 assail-0.925
 equation-0.912
Tokengov
Token ID9567
Feature activation+8.34e-01

Pos logit contributions
OM+1.193
hers+1.108
yk+1.106
Mand+1.068
kefeller+1.049
Neg logit contributions
 acquaintance-1.560
 ambush-1.482
 introdu-1.477
 equation-1.412
 assail-1.410
Token/
Token ID14
Feature activation+1.34e+00

Pos logit contributions
OM+2.319
yk+2.126
hers+2.085
Mand+2.053
kefeller+2.005
Neg logit contributions
 acquaintance-2.859
 ambush-2.682
 introdu-2.664
 jealousy-2.585
 assail-2.557
Tokenpub
Token ID12984
Feature activation+6.90e-01

Pos logit contributions
OM+4.155
hers+3.813
yk+3.805
Mand+3.766
OSH+3.622
Neg logit contributions
 acquaintance-4.115
 ambush-3.851
 introdu-3.832
 jealousy-3.686
 assail-3.677
Tokens
Token ID82
Feature activation+3.87e-01

Pos logit contributions
OM+1.765
hers+1.630
yk+1.610
Mand+1.577
kefeller+1.559
Neg logit contributions
 acquaintance-2.501
 ambush-2.384
 introdu-2.352
 assail-2.283
 jealousy-2.282
Token/
Token ID14
Feature activation+9.78e-01

Pos logit contributions
OM+1.118
hers+1.040
yk+1.031
Mand+1.007
kefeller+0.992
Neg logit contributions
 acquaintance-1.291
 ambush-1.216
 introdu-1.207
 jealousy-1.166
 assail-1.155
Token8
Token ID23
Feature activation-2.22e-02

Pos logit contributions
OM+2.956
hers+2.741
yk+2.717
Mand+2.678
kefeller+2.567
Neg logit contributions
 acquaintance-3.106
 ambush-2.927
 introdu-2.917
 jealousy-2.799
 assail-2.789
Token11
Token ID1157
Feature activation-3.80e-01

Pos logit contributions
 acquaintance+0.070
 ambush+0.067
 introdu+0.066
 equation+0.063
 assail+0.063
Neg logit contributions
OM-0.067
hers-0.062
yk-0.062
Mand-0.061
kefeller-0.059
Token Bland
Token ID49524
Feature activation+5.40e-02

Pos logit contributions
OM+0.608
hers+0.573
yk+0.568
Mand+0.556
kefeller+0.545
Neg logit contributions
 acquaintance-0.605
 ambush-0.574
 introdu-0.572
 equation-0.542
 assail-0.541
Tokeny
Token ID88
Feature activation+2.47e-02

Pos logit contributions
OM+0.179
hers+0.169
yk+0.169
Mand+0.165
kefeller+0.161
Neg logit contributions
 acquaintance-0.153
 ambush-0.144
 introdu-0.144
 equation-0.136
 assail-0.136
Token is
Token ID318
Feature activation+1.09e-01

Pos logit contributions
OM+0.084
hers+0.079
yk+0.079
Mand+0.077
kefeller+0.076
Neg logit contributions
 acquaintance-0.068
 ambush-0.064
 introdu-0.063
 equation-0.060
 assail-0.060
Token an
Token ID281
Feature activation+5.80e-02

Pos logit contributions
OM+0.375
hers+0.355
yk+0.353
Mand+0.346
kefeller+0.340
Neg logit contributions
 acquaintance-0.295
 ambush-0.278
 introdu-0.277
 equation-0.261
 assail-0.260
Token Ad
Token ID1215
Feature activation-3.16e-01

Pos logit contributions
OM+0.197
hers+0.186
yk+0.184
Mand+0.181
kefeller+0.177
Neg logit contributions
 acquaintance-0.157
 introdu-0.148
 ambush-0.148
 jealousy-0.141
 assail-0.140
Tokenj
Token ID73
Feature activation-1.60e+00

Pos logit contributions
 acquaintance+0.959
 ambush+0.906
 introdu+0.902
 equation+0.860
 assail+0.854
Neg logit contributions
OM-0.985
hers-0.926
yk-0.919
Mand-0.900
kefeller-0.887
Tokenunct
Token ID16260
Feature activation+3.33e-01

Pos logit contributions
 acquaintance+5.220
 ambush+4.920
 introdu+4.828
 equation+4.665
 assail+4.659
Neg logit contributions
OM-4.441
hers-4.220
yk-4.143
Mand-4.020
kefeller-4.019
Token Professor
Token ID8129
Feature activation-5.30e-01

Pos logit contributions
OM+1.069
hers+1.006
yk+1.000
Mand+0.978
kefeller+0.959
Neg logit contributions
 acquaintance-0.974
 ambush-0.925
 introdu-0.919
 equation-0.874
 assail-0.871
Token of
Token ID286
Feature activation+2.50e-01

Pos logit contributions
 acquaintance+1.339
 ambush+1.250
 introdu+1.247
 equation+1.171
 assail+1.162
Neg logit contributions
OM-1.921
hers-1.824
yk-1.814
Mand-1.779
kefeller-1.755
Token Economics
Token ID18963
Feature activation+2.74e-01

Pos logit contributions
OM+0.741
hers+0.694
yk+0.690
Mand+0.674
kefeller+0.661
Neg logit contributions
 acquaintance-0.795
 ambush-0.754
 introdu-0.751
 equation-0.715
 assail-0.714
Token at
Token ID379
Feature activation+2.71e-01

Pos logit contributions
OM+0.970
hers+0.919
yk+0.914
Mand+0.892
kefeller+0.870
Neg logit contributions
 acquaintance-0.710
 ambush-0.676
 introdu-0.665
 assail-0.631
 equation-0.626
Token of
Token ID286
Feature activation-1.51e-01

Pos logit contributions
 acquaintance+0.557
 ambush+0.523
 introdu+0.521
 equation+0.491
 assail+0.488
Neg logit contributions
OM-0.727
hers-0.689
yk-0.685
Mand-0.672
kefeller-0.663
Token complaints
Token ID9687
Feature activation+4.43e-01

Pos logit contributions
 acquaintance+0.427
 ambush+0.402
 introdu+0.401
 equation+0.379
 assail+0.377
Neg logit contributions
OM-0.503
hers-0.475
yk-0.472
Mand-0.462
kefeller-0.454
Token.
Token ID13
Feature activation+1.23e-03

Pos logit contributions
OM+1.479
hers+1.398
yk+1.386
Mand+1.357
kefeller+1.319
Neg logit contributions
 acquaintance-1.235
 ambush-1.164
 introdu-1.153
 equation-1.098
 assail-1.095
Token
Token ID198
Feature activation+1.35e-02

Pos logit contributions
OM+0.004
hers+0.004
yk+0.004
Mand+0.004
kefeller+0.003
Neg logit contributions
 acquaintance-0.004
 ambush-0.003
 introdu-0.003
 equation-0.003
 assail-0.003
Token
Token ID198
Feature activation+1.24e-01

Pos logit contributions
OM+0.046
hers+0.044
yk+0.044
Mand+0.043
kefeller+0.043
Neg logit contributions
 acquaintance-0.036
 ambush-0.034
 introdu-0.034
 equation-0.032
 jealousy-0.032
TokenAnalysis
Token ID32750
Feature activation-1.54e+00

Pos logit contributions
OM+0.354
hers+0.332
yk+0.329
Mand+0.321
kefeller+0.315
Neg logit contributions
 acquaintance-0.410
 ambush-0.389
 introdu-0.388
 equation-0.371
 assail-0.369
Token It
Token ID632
Feature activation+9.43e-02

Pos logit contributions
 acquaintance+3.950
 ambush+3.754
 introdu+3.736
 equation+3.576
 <-+3.575
Neg logit contributions
OM-5.237
hers-5.022
yk-4.951
kefeller-4.844
Mand-4.826
Token has
Token ID468
Feature activation-2.74e-01

Pos logit contributions
OM+0.326
hers+0.310
yk+0.307
Mand+0.301
kefeller+0.295
Neg logit contributions
 acquaintance-0.253
 ambush-0.238
 introdu-0.237
 assail-0.224
 equation-0.223
Token been
Token ID587
Feature activation-6.74e-01

Pos logit contributions
 acquaintance+0.721
 ambush+0.678
 introdu+0.675
 equation+0.636
 jealousy+0.632
Neg logit contributions
OM-0.964
yk-0.914
hers-0.908
Mand-0.892
kefeller-0.882
Token clear
Token ID1598
Feature activation-3.63e-01

Pos logit contributions
 acquaintance+1.739
 ambush+1.641
 introdu+1.626
 equation+1.525
 jealousy+1.515
Neg logit contributions
OM-2.399
yk-2.267
hers-2.265
Mand-2.220
kefeller-2.207
Token since
Token ID1201
Feature activation+2.08e-01

Pos logit contributions
 acquaintance+0.891
 ambush+0.832
 introdu+0.829
 equation+0.778
 jealousy+0.771
Neg logit contributions
OM-1.340
hers-1.274
yk-1.268
Mand-1.244
kefeller-1.226
Token.
Token ID13
Feature activation-3.76e-02

Pos logit contributions
 acquaintance+0.126
 ambush+0.119
 introdu+0.119
 equation+0.113
 assail+0.112
Neg logit contributions
OM-0.136
hers-0.128
yk-0.127
Mand-0.125
kefeller-0.122
Token Guess
Token ID37571
Feature activation-5.24e-01

Pos logit contributions
 acquaintance+0.097
 ambush+0.091
 introdu+0.090
 equation+0.085
 jealousy+0.085
Neg logit contributions
OM-0.133
hers-0.127
yk-0.126
Mand-0.124
kefeller-0.123
Token what
Token ID644
Feature activation-8.18e-02

Pos logit contributions
 acquaintance+1.452
 ambush+1.369
 introdu+1.365
 equation+1.294
 jealousy+1.288
Neg logit contributions
OM-1.738
yk-1.652
hers-1.631
kefeller-1.610
Mand-1.602
Token!
Token ID0
Feature activation-2.09e-01

Pos logit contributions
 acquaintance+0.243
 ambush+0.230
 introdu+0.229
 equation+0.217
 assail+0.217
Neg logit contributions
OM-0.260
hers-0.245
yk-0.243
Mand-0.237
kefeller-0.232
Token Ch
Token ID609
Feature activation-1.10e-01

Pos logit contributions
 acquaintance+0.524
 ambush+0.491
 introdu+0.490
 equation+0.460
 jealousy+0.457
Neg logit contributions
OM-0.754
hers-0.719
yk-0.716
Mand-0.702
kefeller-0.699
Tokenances
Token ID1817
Feature activation-1.50e+00

Pos logit contributions
 acquaintance+0.382
 ambush+0.366
 introdu+0.364
 equation+0.348
 assail+0.346
Neg logit contributions
OM-0.291
hers-0.271
yk-0.267
Mand-0.263
kefeller-0.259
Token are
Token ID389
Feature activation+1.59e-01

Pos logit contributions
 acquaintance+3.534
 ambush+3.353
 introdu+3.311
 equation+3.129
 assail+3.124
Neg logit contributions
OM-5.383
hers-5.224
yk-5.161
Mand-5.132
kefeller-5.114
Token,
Token ID11
Feature activation-2.18e-01

Pos logit contributions
OM+0.584
hers+0.554
yk+0.550
Mand+0.538
kefeller+0.529
Neg logit contributions
 acquaintance-0.393
 ambush-0.367
 introdu-0.365
 equation-0.343
 assail-0.343
Token you
Token ID345
Feature activation+1.37e-01

Pos logit contributions
 acquaintance+0.520
 ambush+0.484
 introdu+0.482
 equation+0.451
 jealousy+0.448
Neg logit contributions
OM-0.821
hers-0.782
yk-0.777
Mand-0.764
kefeller-0.754
Token rarely
Token ID8365
Feature activation-2.96e-01

Pos logit contributions
OM+0.509
hers+0.484
yk+0.481
Mand+0.472
kefeller+0.465
Neg logit contributions
 acquaintance-0.331
 ambush-0.308
 introdu-0.307
 equation-0.288
 assail-0.286
Token do
Token ID466
Feature activation-1.34e-01

Pos logit contributions
 acquaintance+0.702
 ambush+0.656
 introdu+0.647
 equation+0.606
 assail+0.603
Neg logit contributions
OM-1.123
hers-1.066
yk-1.061
Mand-1.048
kefeller-1.034
Token for
Token ID329
Feature activation+7.98e-02

Pos logit contributions
 acquaintance+0.474
 ambush+0.443
 introdu+0.442
 equation+0.415
 assail+0.412
Neg logit contributions
OM-0.684
hers-0.650
yk-0.646
Mand-0.634
kefeller-0.625
Token the
Token ID262
Feature activation-5.05e-02

Pos logit contributions
OM+0.261
hers+0.246
yk+0.245
Mand+0.240
kefeller+0.236
Neg logit contributions
 acquaintance-0.230
 ambush-0.217
 introdu-0.216
 equation-0.205
 assail-0.203
Token Montreal
Token ID12871
Feature activation-4.13e-01

Pos logit contributions
 acquaintance+0.153
 ambush+0.145
 introdu+0.145
 equation+0.138
 assail+0.137
Neg logit contributions
OM-0.157
hers-0.148
yk-0.147
Mand-0.144
kefeller-0.142
Token Canadiens
Token ID33664
Feature activation-1.84e-01

Pos logit contributions
 acquaintance+1.614
 ambush+1.550
 introdu+1.543
 equation+1.490
 assail+1.476
Neg logit contributions
OM-0.919
hers-0.838
yk-0.837
Mand-0.808
kefeller-0.797
Token.
Token ID13
Feature activation-2.36e-01

Pos logit contributions
 acquaintance+0.512
 ambush+0.481
 introdu+0.480
 equation+0.453
 assail+0.451
Neg logit contributions
OM-0.619
hers-0.586
yk-0.581
Mand-0.570
kefeller-0.560
Token Let
Token ID3914
Feature activation-1.46e+00

Pos logit contributions
 acquaintance+0.618
 ambush+0.581
 introdu+0.580
 equation+0.546
 <-+0.542
Neg logit contributions
OM-0.827
hers-0.786
yk-0.782
Mand-0.764
kefeller-0.763
Token
Token ID447
Feature activation+7.80e-02

Pos logit contributions
 acquaintance+3.175
 introdu+3.005
 ambush+2.920
 equation+2.781
 jealousy+2.743
Neg logit contributions
OM-5.651
yk-5.359
hers-5.327
Mand-5.298
kefeller-5.246
Token
Token ID247
Feature activation-3.51e-02

Pos logit contributions
OM+0.259
hers+0.246
yk+0.243
Mand+0.238
kefeller+0.232
Neg logit contributions
 acquaintance-0.221
 ambush-0.208
 introdu-0.208
 equation-0.196
 assail-0.196
Tokens
Token ID82
Feature activation+1.68e-01

Pos logit contributions
 acquaintance+0.124
 ambush+0.118
 introdu+0.117
 equation+0.112
 assail+0.112
Neg logit contributions
OM-0.093
hers-0.086
yk-0.085
Mand-0.083
kefeller-0.081
Token take
Token ID1011
Feature activation-9.14e-02

Pos logit contributions
OM+0.621
hers+0.592
yk+0.586
Mand+0.576
kefeller+0.565
Neg logit contributions
 acquaintance-0.407
 ambush-0.378
 introdu-0.375
 equation-0.353
 jealousy-0.351
Token a
Token ID257
Feature activation-7.62e-02

Pos logit contributions
 acquaintance+0.228
 ambush+0.214
 introdu+0.211
 equation+0.199
 assail+0.199
Neg logit contributions
OM-0.334
hers-0.318
yk-0.314
Mand-0.309
kefeller-0.303
Token _
Token ID4808
Feature activation-3.15e-01

Pos logit contributions
OM+0.020
hers+0.019
yk+0.019
Mand+0.018
kefeller+0.018
Neg logit contributions
 acquaintance-0.021
 ambush-0.020
 introdu-0.020
 equation-0.019
 assail-0.019
TokenP
Token ID47
Feature activation-2.88e-01

Pos logit contributions
 acquaintance+1.095
 ambush+1.044
 introdu+1.040
 equation+0.997
 assail+0.991
Neg logit contributions
OM-0.840
hers-0.783
yk-0.777
Mand-0.756
kefeller-0.742
Tokenulse
Token ID9615
Feature activation-4.63e-01

Pos logit contributions
 acquaintance+1.040
 ambush+1.000
 introdu+0.983
 <-+0.956
 equation+0.953
Neg logit contributions
OM-0.707
hers-0.664
yk-0.660
kefeller-0.644
Mand-0.640
TokenAmount
Token ID31264
Feature activation-8.86e-03

Pos logit contributions
 acquaintance+1.746
 ambush+1.670
 introdu+1.666
 equation+1.610
 assail+1.594
Neg logit contributions
OM-1.080
hers-1.001
yk-0.998
Mand-0.959
kefeller-0.948
Token].
Token ID4083
Feature activation-6.00e-01

Pos logit contributions
 acquaintance+0.027
 ambush+0.025
 introdu+0.025
 equation+0.024
 assail+0.024
Neg logit contributions
OM-0.028
hers-0.026
yk-0.026
Mand-0.026
kefeller-0.025
Token Let
Token ID3914
Feature activation-1.39e+00

Pos logit contributions
 acquaintance+1.572
 ambush+1.477
 introdu+1.474
 equation+1.409
 <-+1.387
Neg logit contributions
OM-2.078
hers-1.978
yk-1.963
kefeller-1.928
Mand-1.928
Token
Token ID447
Feature activation-1.76e-02

Pos logit contributions
 acquaintance+3.613
 introdu+3.417
 ambush+3.385
 equation+3.296
 <-+3.213
Neg logit contributions
OM-4.804
yk-4.542
hers-4.474
Mand-4.440
kefeller-4.438
Token
Token ID247
Feature activation-1.08e-01

Pos logit contributions
 acquaintance+0.062
 ambush+0.059
 introdu+0.059
 equation+0.057
 <-+0.057
Neg logit contributions
OM-0.045
yk-0.042
hers-0.042
kefeller-0.042
Mand-0.040
Tokens
Token ID82
Feature activation-9.88e-04

Pos logit contributions
 acquaintance+0.382
 ambush+0.364
 introdu+0.363
 equation+0.347
 assail+0.346
Neg logit contributions
OM-0.285
hers-0.265
yk-0.263
Mand-0.256
kefeller-0.250
Token do
Token ID466
Feature activation-2.24e-01

Pos logit contributions
 acquaintance+0.002
 ambush+0.002
 introdu+0.002
 equation+0.002
 assail+0.002
Neg logit contributions
OM-0.004
hers-0.004
yk-0.004
Mand-0.003
kefeller-0.003
Token this
Token ID428
Feature activation-1.73e-01

Pos logit contributions
 acquaintance+0.644
 ambush+0.607
 introdu+0.605
 equation+0.574
 assail+0.570
Neg logit contributions
OM-0.730
hers-0.690
yk-0.685
Mand-0.671
kefeller-0.660
Token revolt
Token ID27360
Feature activation-3.59e-01

Pos logit contributions
 acquaintance+0.327
 ambush+0.305
 introdu+0.303
 equation+0.283
 assail+0.281
Neg logit contributions
OM-0.550
hers-0.524
yk-0.522
Mand-0.512
kefeller-0.506
Token in
Token ID287
Feature activation-2.38e-02

Pos logit contributions
 acquaintance+0.904
 ambush+0.848
 introdu+0.844
 equation+0.794
 assail+0.788
Neg logit contributions
OM-1.298
hers-1.233
yk-1.228
Mand-1.204
kefeller-1.192
Token the
Token ID262
Feature activation-3.27e-01

Pos logit contributions
 acquaintance+0.066
 ambush+0.062
 introdu+0.062
 equation+0.059
 jealousy+0.058
Neg logit contributions
OM-0.080
hers-0.076
yk-0.076
Mand-0.074
kefeller-0.073
Token early
Token ID1903
Feature activation-1.75e-01

Pos logit contributions
 acquaintance+0.839
 ambush+0.791
 introdu+0.785
 equation+0.741
 jealousy+0.735
Neg logit contributions
OM-1.166
hers-1.113
yk-1.107
Mand-1.080
kefeller-1.077
Token 5
Token ID642
Feature activation-1.30e-01

Pos logit contributions
 acquaintance+0.460
 ambush+0.432
 introdu+0.430
 equation+0.405
 assail+0.402
Neg logit contributions
OM-0.616
hers-0.584
yk-0.581
Mand-0.570
kefeller-0.562
Tokenth
Token ID400
Feature activation-1.39e+00

Pos logit contributions
 acquaintance+0.464
 ambush+0.443
 introdu+0.441
 equation+0.423
 assail+0.421
Neg logit contributions
OM-0.335
hers-0.311
yk-0.309
Mand-0.301
kefeller-0.295
Token century
Token ID4289
Feature activation-7.48e-01

Pos logit contributions
 acquaintance+4.288
 ambush+4.075
 introdu+4.065
 equation+3.894
 assail+3.876
Neg logit contributions
OM-4.064
yk-3.912
hers-3.823
kefeller-3.791
Mand-3.707
Token that
Token ID326
Feature activation-1.20e-01

Pos logit contributions
 acquaintance+1.935
 ambush+1.817
 introdu+1.811
 equation+1.709
 assail+1.693
Neg logit contributions
OM-2.648
hers-2.522
yk-2.503
Mand-2.453
kefeller-2.436
Token was
Token ID373
Feature activation-3.36e-01

Pos logit contributions
 acquaintance+0.358
 introdu+0.340
 ambush+0.339
 equation+0.322
 jealousy+0.319
Neg logit contributions
OM-0.378
hers-0.356
yk-0.354
Mand-0.345
kefeller-0.342
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
 acquaintance+0.965
 introdu+0.915
 ambush+0.912
 jealousy+0.864
 equation+0.863
Neg logit contributions
OM-1.095
yk-1.031
hers-1.028
Mand-1.006
kefeller-1.004
Token catalyst
Token ID31357
Feature activation+6.75e-02

Pos logit contributions
 acquaintance+0.624
 ambush+0.590
 introdu+0.588
 equation+0.558
 jealousy+0.552
Neg logit contributions
OM-0.757
hers-0.717
yk-0.711
Mand-0.696
kefeller-0.692
Token Most
Token ID4042
Feature activation-2.25e-01

Pos logit contributions
 acquaintance+0.035
 ambush+0.033
 introdu+0.033
 equation+0.031
 assail+0.031
Neg logit contributions
OM-0.047
hers-0.044
yk-0.044
Mand-0.043
kefeller-0.042
Token make
Token ID787
Feature activation+3.81e-02

Pos logit contributions
 acquaintance+0.564
 ambush+0.527
 introdu+0.525
 equation+0.494
 assail+0.490
Neg logit contributions
OM-0.818
hers-0.777
yk-0.772
Mand-0.758
kefeller-0.745
Token silly
Token ID14397
Feature activation-1.59e-01

Pos logit contributions
OM+0.130
hers+0.123
yk+0.122
Mand+0.119
kefeller+0.117
Neg logit contributions
 acquaintance-0.103
 ambush-0.097
 introdu-0.096
 assail-0.092
 equation-0.091
Token jokes
Token ID14532
Feature activation-3.08e-01

Pos logit contributions
 acquaintance+0.375
 ambush+0.351
 introdu+0.347
 assail+0.327
 equation+0.326
Neg logit contributions
OM-0.603
hers-0.572
yk-0.568
Mand-0.558
kefeller-0.548
Token,
Token ID11
Feature activation-1.96e-01

Pos logit contributions
 acquaintance+0.712
 ambush+0.662
 introdu+0.658
 equation+0.617
 assail+0.614
Neg logit contributions
OM-1.179
hers-1.127
yk-1.116
Mand-1.095
kefeller-1.074
Token drink
Token ID4144
Feature activation-1.39e+00

Pos logit contributions
 acquaintance+0.505
 ambush+0.472
 introdu+0.471
 equation+0.444
 assail+0.440
Neg logit contributions
OM-0.699
hers-0.663
yk-0.658
Mand-0.646
kefeller-0.634
Token more
Token ID517
Feature activation-3.06e-01

Pos logit contributions
 acquaintance+3.757
 introdu+3.539
 ambush+3.530
 jealousy+3.353
 equation+3.325
Neg logit contributions
OM-4.678
hers-4.432
yk-4.429
kefeller-4.329
Mand-4.322
Token than
Token ID621
Feature activation-3.43e-01

Pos logit contributions
 acquaintance+0.853
 ambush+0.804
 introdu+0.800
 equation+0.757
 assail+0.755
Neg logit contributions
OM-1.025
hers-0.968
yk-0.961
Mand-0.943
kefeller-0.925
Token they
Token ID484
Feature activation-2.99e-01

Pos logit contributions
 acquaintance+0.918
 ambush+0.862
 introdu+0.858
 equation+0.810
 assail+0.806
Neg logit contributions
OM-1.194
hers-1.131
yk-1.123
Mand-1.102
kefeller-1.083
Token usually
Token ID3221
Feature activation-1.26e-01

Pos logit contributions
 acquaintance+0.783
 ambush+0.734
 introdu+0.731
 equation+0.689
 assail+0.684
Neg logit contributions
OM-1.054
hers-1.000
yk-0.993
Mand-0.974
kefeller-0.957
Token would
Token ID561
Feature activation-4.62e-01

Pos logit contributions
 acquaintance+0.309
 ambush+0.288
 introdu+0.287
 equation+0.269
 assail+0.267
Neg logit contributions
OM-0.468
hers-0.446
yk-0.443
Mand-0.435
kefeller-0.428
Tokenche
Token ID2395
Feature activation-3.41e-01

Pos logit contributions
OM+0.545
hers+0.517
yk+0.513
Mand+0.502
kefeller+0.495
Neg logit contributions
 acquaintance-0.435
 ambush-0.410
 introdu-0.408
 equation-0.385
 assail-0.385
Token.
Token ID13
Feature activation-3.31e-01

Pos logit contributions
 acquaintance+0.941
 ambush+0.885
 introdu+0.882
 equation+0.834
 assail+0.829
Neg logit contributions
OM-1.153
hers-1.091
yk-1.083
Mand-1.063
kefeller-1.047
Token
Token ID198
Feature activation-3.28e-01

Pos logit contributions
 acquaintance+0.889
 ambush+0.836
 introdu+0.835
 equation+0.789
 jealousy+0.782
Neg logit contributions
OM-1.141
hers-1.083
yk-1.076
Mand-1.054
kefeller-1.045
Token
Token ID198
Feature activation-1.51e-01

Pos logit contributions
 acquaintance+0.859
 ambush+0.811
 introdu+0.803
 equation+0.760
 <-+0.759
Neg logit contributions
OM-1.145
hers-1.089
yk-1.085
kefeller-1.064
Mand-1.060
TokenThanks
Token ID9690
Feature activation-6.60e-01

Pos logit contributions
 acquaintance+0.465
 ambush+0.441
 introdu+0.439
 equation+0.418
 jealousy+0.416
Neg logit contributions
OM-0.459
hers-0.434
yk-0.430
Mand-0.419
kefeller-0.416
Token again
Token ID757
Feature activation-1.39e+00

Pos logit contributions
 acquaintance+1.741
 introdu+1.637
 ambush+1.630
 equation+1.539
 jealousy+1.520
Neg logit contributions
OM-2.292
yk-2.171
hers-2.170
Mand-2.132
kefeller-2.120
Token for
Token ID329
Feature activation-5.55e-01

Pos logit contributions
 acquaintance+3.347
 introdu+3.141
 ambush+3.105
 equation+2.928
 <-+2.896
Neg logit contributions
OM-4.988
yk-4.783
hers-4.755
kefeller-4.716
Mand-4.708
Token listening
Token ID8680
Feature activation-7.66e-01

Pos logit contributions
 acquaintance+1.580
 introdu+1.489
 ambush+1.479
 equation+1.399
 assail+1.388
Neg logit contributions
OM-1.836
yk-1.730
hers-1.725
Mand-1.696
kefeller-1.695
Token!
Token ID0
Feature activation-1.90e-01

Pos logit contributions
 acquaintance+2.104
 ambush+1.979
 introdu+1.971
 equation+1.863
 assail+1.853
Neg logit contributions
OM-2.606
hers-2.464
yk-2.449
Mand-2.400
kefeller-2.360
Token<|endoftext|>
Token ID50256
Feature activation+1.27e-01

Pos logit contributions
 acquaintance+0.536
 ambush+0.505
 introdu+0.502
 equation+0.475
 assail+0.474
Neg logit contributions
OM-0.637
hers-0.601
yk-0.597
Mand-0.584
kefeller-0.572
TokenPresident
Token ID10364
Feature activation-2.77e-01

Pos logit contributions
OM+0.361
hers+0.336
yk+0.333
Mand+0.327
kefeller+0.316
Neg logit contributions
 acquaintance-0.426
 ambush-0.405
 introdu-0.404
 equation-0.386
 assail-0.385
Tokenograph
Token ID2384
Feature activation+4.90e-02

Pos logit contributions
OM+0.081
hers+0.077
yk+0.076
Mand+0.075
kefeller+0.074
Neg logit contributions
 acquaintance-0.054
 ambush-0.050
 introdu-0.050
 equation-0.047
 assail-0.047
Token,
Token ID11
Feature activation+2.47e-01

Pos logit contributions
OM+0.167
hers+0.158
yk+0.157
Mand+0.154
kefeller+0.152
Neg logit contributions
 acquaintance-0.135
 ambush-0.127
 introdu-0.127
 equation-0.120
 assail-0.118
Token and
Token ID290
Feature activation+1.26e-01

Pos logit contributions
OM+0.823
hers+0.778
yk+0.767
Mand+0.754
kefeller+0.737
Neg logit contributions
 acquaintance-0.692
 ambush-0.650
 introdu-0.647
 equation-0.616
 assail-0.613
Token so
Token ID523
Feature activation+7.48e-02

Pos logit contributions
OM+0.439
hers+0.415
yk+0.412
Mand+0.404
kefeller+0.397
Neg logit contributions
 acquaintance-0.338
 ambush-0.317
 introdu-0.316
 equation-0.298
 assail-0.296
Token he
Token ID339
Feature activation-3.58e-01

Pos logit contributions
OM+0.262
hers+0.249
yk+0.247
Mand+0.243
kefeller+0.239
Neg logit contributions
 acquaintance-0.197
 ambush-0.185
 introdu-0.184
 equation-0.174
 assail-0.173
Token drinks
Token ID11758
Feature activation-1.38e+00

Pos logit contributions
 acquaintance+0.941
 ambush+0.889
 introdu+0.884
 equation+0.833
 jealousy+0.823
Neg logit contributions
OM-1.256
yk-1.185
hers-1.179
Mand-1.160
kefeller-1.152
Token boxed
Token ID46069
Feature activation+5.56e-02

Pos logit contributions
 acquaintance+3.700
 introdu+3.495
 ambush+3.477
 jealousy+3.291
 equation+3.266
Neg logit contributions
OM-4.680
yk-4.467
hers-4.450
kefeller-4.321
Mand-4.308
Token wine
Token ID8237
Feature activation-2.01e-01

Pos logit contributions
OM+0.197
hers+0.187
yk+0.185
Mand+0.182
kefeller+0.179
Neg logit contributions
 acquaintance-0.145
 ambush-0.136
 introdu-0.135
 equation-0.128
 assail-0.127
Token and
Token ID290
Feature activation-2.25e-01

Pos logit contributions
 acquaintance+0.548
 ambush+0.515
 introdu+0.513
 equation+0.485
 assail+0.483
Neg logit contributions
OM-0.688
hers-0.652
yk-0.647
Mand-0.634
kefeller-0.623
Token has
Token ID468
Feature activation-9.67e-02

Pos logit contributions
 acquaintance+0.570
 ambush+0.534
 introdu+0.532
 equation+0.499
 jealousy+0.496
Neg logit contributions
OM-0.811
hers-0.769
yk-0.767
Mand-0.752
kefeller-0.744
Token these
Token ID777
Feature activation+2.54e-01

Pos logit contributions
 acquaintance+0.253
 ambush+0.238
 introdu+0.237
 equation+0.223
 assail+0.222
Neg logit contributions
OM-0.341
hers-0.323
yk-0.321
Mand-0.315
kefeller-0.310
Token.
Token ID13
Feature activation-1.93e-01

Pos logit contributions
 acquaintance+1.181
 ambush+1.106
 introdu+1.106
 equation+1.045
 jealousy+1.035
Neg logit contributions
OM-1.497
hers-1.420
yk-1.412
Mand-1.384
kefeller-1.365
Token For
Token ID1114
Feature activation+6.20e-02

Pos logit contributions
 acquaintance+0.512
 ambush+0.481
 introdu+0.480
 equation+0.452
 jealousy+0.449
Neg logit contributions
OM-0.671
hers-0.637
yk-0.633
Mand-0.618
kefeller-0.613
Token example
Token ID1672
Feature activation-3.03e-02

Pos logit contributions
OM+0.223
hers+0.212
yk+0.210
Mand+0.207
kefeller+0.203
Neg logit contributions
 acquaintance-0.157
 ambush-0.147
 introdu-0.146
 equation-0.138
 assail-0.138
Token,
Token ID11
Feature activation-1.49e-01

Pos logit contributions
 acquaintance+0.085
 ambush+0.080
 introdu+0.079
 equation+0.075
 assail+0.075
Neg logit contributions
OM-0.101
hers-0.096
yk-0.095
Mand-0.094
kefeller-0.091
Token El
Token ID2574
Feature activation-4.63e-01

Pos logit contributions
 acquaintance+0.413
 ambush+0.388
 introdu+0.386
 equation+0.366
 assail+0.364
Neg logit contributions
OM-0.502
hers-0.475
yk-0.472
Mand-0.463
kefeller-0.454
Tokenj
Token ID73
Feature activation-1.36e+00

Pos logit contributions
 acquaintance+1.435
 ambush+1.359
 introdu+1.355
 equation+1.290
 assail+1.281
Neg logit contributions
OM-1.406
hers-1.321
yk-1.311
Mand-1.283
kefeller-1.263
Tokenig
Token ID328
Feature activation+3.50e-01

Pos logit contributions
 acquaintance+4.499
 ambush+4.260
 introdu+4.246
aign+4.087
 equation+4.071
Neg logit contributions
OM-3.665
hers-3.415
yk-3.362
Mand-3.341
kefeller-3.326
Tokenide
Token ID485
Feature activation-4.53e-01

Pos logit contributions
OM+1.113
hers+1.053
yk+1.047
Mand+1.022
kefeller+0.992
Neg logit contributions
 acquaintance-1.039
 ambush-0.982
 introdu-0.977
 assail-0.929
 equation-0.927
Tokeny
Token ID88
Feature activation+6.02e-02

Pos logit contributions
 acquaintance+1.312
 introdu+1.240
 ambush+1.240
 equation+1.173
 jealousy+1.163
Neg logit contributions
OM-1.463
hers-1.387
yk-1.366
Mand-1.338
kefeller-1.333
Token was
Token ID373
Feature activation-7.32e-03

Pos logit contributions
OM+0.197
hers+0.186
yk+0.186
Mand+0.182
kefeller+0.178
Neg logit contributions
 acquaintance-0.173
 ambush-0.163
 introdu-0.162
 equation-0.153
 assail-0.153
Token not
Token ID407
Feature activation-3.88e-01

Pos logit contributions
 acquaintance+0.020
 ambush+0.019
 introdu+0.018
 equation+0.017
 assail+0.017
Neg logit contributions
OM-0.025
hers-0.024
yk-0.024
Mand-0.023
kefeller-0.023
Token their
Token ID511
Feature activation+1.01e-01

Pos logit contributions
OM+0.057
hers+0.054
yk+0.053
Mand+0.052
kefeller+0.051
Neg logit contributions
 acquaintance-0.045
 ambush-0.042
 introdu-0.042
 assail-0.040
 equation-0.040
Token own
Token ID898
Feature activation+1.95e-01

Pos logit contributions
OM+0.340
hers+0.321
yk+0.319
Mand+0.312
kefeller+0.306
Neg logit contributions
 acquaintance-0.278
 ambush-0.262
 introdu-0.261
 assail-0.247
 equation-0.246
Token code
Token ID2438
Feature activation+4.14e-01

Pos logit contributions
OM+0.680
hers+0.642
yk+0.638
Mand+0.624
kefeller+0.612
Neg logit contributions
 acquaintance-0.518
 ambush-0.486
 introdu-0.482
 assail-0.457
 equation-0.452
Token.
Token ID13
Feature activation-2.16e-01

Pos logit contributions
OM+1.412
hers+1.330
yk+1.322
Mand+1.281
kefeller+1.253
Neg logit contributions
 acquaintance-1.138
 ambush-1.070
 introdu-1.057
 assail-1.010
 jealousy-1.001
Token So
Token ID1406
Feature activation-1.57e-01

Pos logit contributions
 acquaintance+0.559
 ambush+0.524
 introdu+0.522
 equation+0.493
 jealousy+0.488
Neg logit contributions
OM-0.766
hers-0.728
yk-0.724
Mand-0.710
kefeller-0.704
Token let
Token ID1309
Feature activation-1.36e+00

Pos logit contributions
 acquaintance+0.406
 ambush+0.381
 introdu+0.379
 equation+0.358
 jealousy+0.354
Neg logit contributions
OM-0.557
hers-0.529
yk-0.526
Mand-0.516
kefeller-0.510
Token
Token ID447
Feature activation-5.48e-02

Pos logit contributions
 acquaintance+2.975
 introdu+2.788
 ambush+2.753
 equation+2.609
 jealousy+2.570
Neg logit contributions
OM-5.264
yk-5.011
hers-4.978
Mand-4.921
kefeller-4.882
Token
Token ID247
Feature activation-2.42e-01

Pos logit contributions
 acquaintance+0.191
 ambush+0.182
 introdu+0.181
 equation+0.174
 <-+0.173
Neg logit contributions
OM-0.145
yk-0.134
hers-0.134
kefeller-0.131
Mand-0.130
Tokens
Token ID82
Feature activation+1.12e-01

Pos logit contributions
 acquaintance+0.842
 ambush+0.803
 introdu+0.801
 equation+0.767
 assail+0.762
Neg logit contributions
OM-0.641
hers-0.596
yk-0.592
Mand-0.577
kefeller-0.567
Token try
Token ID1949
Feature activation+3.73e-01

Pos logit contributions
OM+0.416
hers+0.396
yk+0.393
Mand+0.386
kefeller+0.380
Neg logit contributions
 acquaintance-0.272
 ambush-0.253
 introdu-0.252
 equation-0.236
 assail-0.235
Token to
Token ID284
Feature activation-4.37e-01

Pos logit contributions
OM+1.216
hers+1.148
yk+1.137
Mand+1.115
kefeller+1.089
Neg logit contributions
 acquaintance-1.074
 ambush-1.012
 introdu-1.006
 equation-0.953
 assail-0.952
Token pulls
Token ID16194
Feature activation+1.37e-01

Pos logit contributions
OM+0.364
hers+0.346
yk+0.344
Mand+0.338
kefeller+0.333
Neg logit contributions
 acquaintance-0.250
 ambush-0.233
 introdu-0.232
 equation-0.218
 assail-0.217
Token few
Token ID1178
Feature activation-1.27e-01

Pos logit contributions
OM+0.465
hers+0.440
yk+0.437
Mand+0.429
kefeller+0.422
Neg logit contributions
 acquaintance-0.376
 ambush-0.354
 introdu-0.353
 equation-0.333
 assail-0.331
Token punches
Token ID25495
Feature activation-5.57e-02

Pos logit contributions
 acquaintance+0.329
 ambush+0.308
 introdu+0.308
 equation+0.290
 jealousy+0.288
Neg logit contributions
OM-0.451
hers-0.427
yk-0.426
Mand-0.418
kefeller-0.412
Token,
Token ID11
Feature activation-7.37e-02

Pos logit contributions
 acquaintance+0.157
 ambush+0.148
 introdu+0.147
 equation+0.139
 assail+0.138
Neg logit contributions
OM-0.186
hers-0.176
yk-0.174
Mand-0.171
kefeller-0.168
Token so
Token ID523
Feature activation-1.30e-01

Pos logit contributions
 acquaintance+0.200
 ambush+0.188
 introdu+0.188
 equation+0.177
 assail+0.176
Neg logit contributions
OM-0.253
hers-0.239
yk-0.238
Mand-0.233
kefeller-0.229
Token caveat
Token ID36531
Feature activation-1.35e+00

Pos logit contributions
 acquaintance+0.351
 ambush+0.329
 introdu+0.328
 equation+0.310
 assail+0.308
Neg logit contributions
OM-0.450
hers-0.426
yk-0.422
Mand-0.415
kefeller-0.407
Token empt
Token ID23909
Feature activation+1.17e-01

Pos logit contributions
 acquaintance+5.091
 introdu+4.843
 ambush+4.813
 equation+4.639
 <-+4.630
Neg logit contributions
OM-3.131
kefeller-2.860
yk-2.854
hers-2.811
Mand-2.761
Tokenor
Token ID273
Feature activation-1.92e-01

Pos logit contributions
OM+0.294
hers+0.273
yk+0.270
Mand+0.262
kefeller+0.256
Neg logit contributions
 acquaintance-0.424
 ambush-0.404
 introdu-0.403
 equation-0.387
 assail-0.386
Token.)
Token ID2014
Feature activation-2.24e-01

Pos logit contributions
 acquaintance+0.573
 ambush+0.541
 introdu+0.540
 equation+0.513
 jealousy+0.510
Neg logit contributions
OM-0.604
hers-0.569
yk-0.566
Mand-0.553
kefeller-0.547
Token
Token ID198
Feature activation-8.80e-02

Pos logit contributions
 acquaintance+0.586
 ambush+0.550
 introdu+0.549
 equation+0.517
 jealousy+0.515
Neg logit contributions
OM-0.785
hers-0.744
yk-0.744
Mand-0.724
kefeller-0.723
Token
Token ID198
Feature activation-4.86e-02

Pos logit contributions
 acquaintance+0.237
 ambush+0.223
 introdu+0.222
 equation+0.210
 jealousy+0.208
Neg logit contributions
OM-0.302
hers-0.287
yk-0.285
Mand-0.279
kefeller-0.277
Tokenpat
Token ID8071
Feature activation+2.97e-02

Pos logit contributions
OM+1.810
hers+1.712
yk+1.693
Mand+1.649
kefeller+1.605
Neg logit contributions
 acquaintance-1.851
 introdu-1.757
 ambush-1.748
 equation-1.667
 jealousy-1.662
Tokenul
Token ID377
Feature activation-2.38e-01

Pos logit contributions
OM+0.094
hers+0.088
yk+0.088
Mand+0.086
kefeller+0.084
Neg logit contributions
 acquaintance-0.089
 ambush-0.084
 introdu-0.084
 equation-0.080
 assail-0.079
Tokenoy
Token ID726
Feature activation+1.70e-01

Pos logit contributions
 acquaintance+0.710
 ambush+0.671
 introdu+0.668
 equation+0.634
 assail+0.631
Neg logit contributions
OM-0.754
hers-0.710
yk-0.705
Mand-0.690
kefeller-0.679
Token lang
Token ID42392
Feature activation-2.90e-01

Pos logit contributions
OM+0.567
hers+0.535
yk+0.533
Mand+0.522
kefeller+0.510
Neg logit contributions
 acquaintance-0.478
 ambush-0.450
 introdu-0.448
 equation-0.425
 assail-0.422
Token n
Token ID299
Feature activation-9.75e-01

Pos logit contributions
 acquaintance+0.801
 ambush+0.753
 introdu+0.752
 equation+0.709
 assail+0.706
Neg logit contributions
OM-0.985
hers-0.931
yk-0.925
Mand-0.906
kefeller-0.895
Tokenatin
Token ID10680
Feature activation-1.35e+00

Pos logit contributions
 acquaintance+3.041
 ambush+2.898
 introdu+2.859
 assail+2.714
 <-+2.712
Neg logit contributions
OM-2.886
hers-2.677
Mand-2.624
kefeller-2.615
yk-2.611
Token ang
Token ID3550
Feature activation-6.83e-01

Pos logit contributions
 acquaintance+3.760
 ambush+3.567
 introdu+3.556
 assail+3.348
 equation+3.340
Neg logit contributions
OM-4.406
hers-4.141
yk-4.083
kefeller-4.054
Mand-4.011
Token at
Token ID379
Feature activation-3.76e-01

Pos logit contributions
 acquaintance+1.915
 introdu+1.802
 ambush+1.802
 equation+1.700
 assail+1.690
Neg logit contributions
OM-2.281
hers-2.153
yk-2.132
kefeller-2.084
Mand-2.084
Tokening
Token ID278
Feature activation+3.15e-02

Pos logit contributions
 acquaintance+1.201
 ambush+1.139
 introdu+1.137
 equation+1.082
 assail+1.076
Neg logit contributions
OM-1.107
hers-1.038
yk-1.026
Mand-1.005
kefeller-0.992
Token k
Token ID479
Feature activation-3.67e-01

Pos logit contributions
OM+0.111
hers+0.105
yk+0.105
Mand+0.103
kefeller+0.101
Neg logit contributions
 acquaintance-0.082
 ambush-0.077
 introdu-0.077
 equation-0.072
 assail-0.072
Tokenamp
Token ID696
Feature activation-3.28e-01

Pos logit contributions
 acquaintance+1.161
 ambush+1.102
 introdu+1.095
 equation+1.044
 assail+1.039
Neg logit contributions
OM-1.088
hers-1.020
yk-1.009
Mand-0.991
kefeller-0.977
Token replace
Token ID6330
Feature activation+3.28e-01

Pos logit contributions
 acquaintance+0.045
 ambush+0.041
 introdu+0.041
 equation+0.038
 assail+0.038
Neg logit contributions
OM-0.093
hers-0.089
yk-0.089
Mand-0.087
kefeller-0.086
Token ObamaCare
Token ID38195
Feature activation-1.43e-01

Pos logit contributions
OM+0.965
hers+0.905
yk+0.894
Mand+0.880
kefeller+0.855
Neg logit contributions
 acquaintance-1.048
 ambush-0.995
 introdu-0.991
 equation-0.944
 assail-0.941
Token.
Token ID13
Feature activation-1.50e-01

Pos logit contributions
 acquaintance+0.389
 ambush+0.366
 introdu+0.365
 equation+0.345
 assail+0.342
Neg logit contributions
OM-0.486
hers-0.461
yk-0.459
Mand-0.449
kefeller-0.445
Token He
Token ID679
Feature activation+6.48e-02

Pos logit contributions
 acquaintance+0.397
 ambush+0.374
 introdu+0.372
 equation+0.351
 assail+0.349
Neg logit contributions
OM-0.520
hers-0.494
yk-0.492
Mand-0.481
kefeller-0.478
Token told
Token ID1297
Feature activation-3.42e-01

Pos logit contributions
OM+0.216
hers+0.204
yk+0.203
Mand+0.199
kefeller+0.196
Neg logit contributions
 acquaintance-0.183
 ambush-0.172
 introdu-0.171
 equation-0.162
 assail-0.161
Token CBS
Token ID11133
Feature activation-1.35e+00

Pos logit contributions
 acquaintance+0.984
 ambush+0.927
 introdu+0.924
 equation+0.876
 assail+0.870
Neg logit contributions
OM-1.117
hers-1.054
yk-1.048
Mand-1.027
kefeller-1.009
Token
Token ID447
Feature activation-1.38e-01

Pos logit contributions
 acquaintance+3.542
 ambush+3.311
 introdu+3.282
 equation+3.081
 assail+3.074
Neg logit contributions
OM-4.626
yk-4.438
hers-4.436
Mand-4.339
kefeller-4.279
Token
Token ID247
Feature activation-2.16e-01

Pos logit contributions
 acquaintance+0.486
 ambush+0.463
 introdu+0.461
 equation+0.443
 <-+0.440
Neg logit contributions
OM-0.358
yk-0.333
hers-0.330
kefeller-0.325
Mand-0.322
Tokens
Token ID82
Feature activation-2.20e-01

Pos logit contributions
 acquaintance+0.758
 ambush+0.722
 introdu+0.720
 equation+0.690
 assail+0.687
Neg logit contributions
OM-0.567
hers-0.528
yk-0.524
Mand-0.510
kefeller-0.500
Token John
Token ID1757
Feature activation+3.49e-01

Pos logit contributions
 acquaintance+0.677
 ambush+0.640
 introdu+0.638
 equation+0.608
 jealousy+0.603
Neg logit contributions
OM-0.677
hers-0.638
yk-0.634
Mand-0.620
kefeller-0.611
Token Dick
Token ID11740
Feature activation-3.31e-01

Pos logit contributions
OM+0.897
hers+0.831
yk+0.827
Mand+0.803
kefeller+0.799
Neg logit contributions
 acquaintance-1.252
 ambush-1.190
 introdu-1.186
 equation-1.141
 jealousy-1.127
Token confident
Token ID6563
Feature activation-2.59e-01

Pos logit contributions
 acquaintance+0.279
 ambush+0.263
 introdu+0.262
 equation+0.248
 assail+0.247
Neg logit contributions
OM-0.339
hers-0.321
yk-0.318
Mand-0.311
kefeller-0.305
Token investors
Token ID7713
Feature activation-3.68e-01

Pos logit contributions
 acquaintance+0.696
 ambush+0.653
 introdu+0.645
 assail+0.612
 equation+0.611
Neg logit contributions
OM-0.899
hers-0.857
yk-0.842
Mand-0.825
kefeller-0.810
Token.
Token ID13
Feature activation-1.30e-01

Pos logit contributions
 acquaintance+1.000
 ambush+0.940
 introdu+0.937
 equation+0.885
 assail+0.879
Neg logit contributions
OM-1.261
hers-1.194
yk-1.187
Mand-1.163
kefeller-1.145
Token
Token ID198
Feature activation+2.62e-02

Pos logit contributions
 acquaintance+0.344
 ambush+0.323
 introdu+0.322
 equation+0.304
 assail+0.302
Neg logit contributions
OM-0.453
hers-0.429
yk-0.426
Mand-0.418
kefeller-0.411
Token
Token ID198
Feature activation+1.40e-01

Pos logit contributions
OM+0.089
hers+0.084
yk+0.083
Mand+0.082
kefeller+0.080
Neg logit contributions
 acquaintance-0.072
 ambush-0.068
 introdu-0.068
 equation-0.064
 assail-0.064
TokenDisclaimer
Token ID19618
Feature activation-1.32e+00

Pos logit contributions
OM+0.415
hers+0.388
yk+0.385
Mand+0.377
kefeller+0.368
Neg logit contributions
 acquaintance-0.446
 ambush-0.423
 introdu-0.422
 equation-0.402
 assail-0.401
Token:
Token ID25
Feature activation+1.20e-01

Pos logit contributions
 acquaintance+3.557
 introdu+3.353
 ambush+3.305
 equation+3.182
 <-+3.166
Neg logit contributions
OM-4.438
hers-4.226
yk-4.186
Mand-4.129
kefeller-4.093
Token ©
Token ID10673
Feature activation+1.98e-01

Pos logit contributions
OM+0.431
hers+0.408
yk+0.405
Mand+0.398
kefeller+0.389
Neg logit contributions
 acquaintance-0.311
 ambush-0.291
 introdu-0.288
 equation-0.272
 assail-0.271
Token 2015
Token ID1853
Feature activation+9.44e-02

Pos logit contributions
OM+0.667
hers+0.628
yk+0.624
Mand+0.612
kefeller+0.601
Neg logit contributions
 acquaintance-0.554
 ambush-0.521
 introdu-0.519
 equation-0.491
 assail-0.489
Token Money
Token ID12911
Feature activation+2.77e-01

Pos logit contributions
OM+0.336
hers+0.316
yk+0.314
Mand+0.307
kefeller+0.302
Neg logit contributions
 acquaintance-0.248
 ambush-0.231
 introdu-0.230
 equation-0.216
 assail-0.216
Token Morning
Token ID14410
Feature activation-7.07e-01

Pos logit contributions
OM+1.161
hers+1.107
yk+1.104
Mand+1.087
kefeller+1.065
Neg logit contributions
 acquaintance-0.549
 ambush-0.502
 introdu-0.500
 assail-0.458
 equation-0.457
Token
Token ID198
Feature activation-4.06e-02

Pos logit contributions
OM+1.336
hers+1.259
yk+1.247
Mand+1.226
kefeller+1.195
Neg logit contributions
 acquaintance-1.111
 ambush-1.044
 introdu-1.040
 assail-0.981
 equation-0.979
Token
Token ID198
Feature activation-7.10e-02

Pos logit contributions
 acquaintance+0.109
 ambush+0.103
 introdu+0.102
 equation+0.097
 jealousy+0.096
Neg logit contributions
OM-0.140
hers-0.132
yk-0.132
Mand-0.129
kefeller-0.128
TokenThere
Token ID1858
Feature activation+8.37e-02

Pos logit contributions
 acquaintance+0.228
 ambush+0.216
 introdu+0.215
 equation+0.206
 jealousy+0.205
Neg logit contributions
OM-0.205
hers-0.194
yk-0.192
kefeller-0.187
Mand-0.186
Token is
Token ID318
Feature activation-1.56e-02

Pos logit contributions
OM+0.283
hers+0.268
yk+0.265
Mand+0.259
kefeller+0.254
Neg logit contributions
 acquaintance-0.231
 ambush-0.217
 introdu-0.217
 assail-0.204
 jealousy-0.204
Token no
Token ID645
Feature activation-2.76e-01

Pos logit contributions
 acquaintance+0.041
 ambush+0.038
 introdu+0.038
 assail+0.036
 equation+0.036
Neg logit contributions
OM-0.055
hers-0.052
yk-0.052
Mand-0.051
kefeller-0.050
Token chance
Token ID2863
Feature activation-1.32e+00

Pos logit contributions
 acquaintance+0.741
 ambush+0.695
 introdu+0.690
 equation+0.663
 jealousy+0.647
Neg logit contributions
OM-0.961
hers-0.914
yk-0.907
Mand-0.888
kefeller-0.879
Token that
Token ID326
Feature activation+7.66e-02

Pos logit contributions
 acquaintance+3.105
 introdu+2.879
 ambush+2.860
 equation+2.762
 jealousy+2.655
Neg logit contributions
OM-4.926
yk-4.755
hers-4.718
kefeller-4.643
Mand-4.634
Token you
Token ID345
Feature activation-1.47e-02

Pos logit contributions
OM+0.256
hers+0.242
yk+0.240
Mand+0.235
kefeller+0.230
Neg logit contributions
 acquaintance-0.214
 ambush-0.202
 introdu-0.201
 assail-0.190
 equation-0.189
Token'll
Token ID1183
Feature activation-2.66e-01

Pos logit contributions
 acquaintance+0.042
 ambush+0.040
 introdu+0.040
 equation+0.038
 assail+0.038
Neg logit contributions
OM-0.048
hers-0.045
yk-0.045
Mand-0.044
kefeller-0.043
Token be
Token ID307
Feature activation+1.07e-01

Pos logit contributions
 acquaintance+0.656
 ambush+0.613
 introdu+0.609
 equation+0.573
 assail+0.569
Neg logit contributions
OM-0.979
hers-0.930
yk-0.925
Mand-0.908
kefeller-0.895
Token flung
Token ID45111
Feature activation+1.55e-01

Pos logit contributions
OM+0.372
hers+0.352
yk+0.350
Mand+0.343
kefeller+0.337
Neg logit contributions
 acquaintance-0.283
 ambush-0.266
 introdu-0.265
 equation-0.250
 assail-0.249
Token would
Token ID561
Feature activation+4.27e-02

Pos logit contributions
 acquaintance+0.041
 ambush+0.039
 introdu+0.039
 equation+0.037
 assail+0.037
Neg logit contributions
OM-0.052
hers-0.049
yk-0.049
Mand-0.048
kefeller-0.047
Token be
Token ID307
Feature activation+1.52e-01

Pos logit contributions
OM+0.151
hers+0.142
yk+0.141
Mand+0.139
kefeller+0.136
Neg logit contributions
 acquaintance-0.112
 ambush-0.105
 introdu-0.105
 assail-0.099
 equation-0.099
Token).
Token ID737
Feature activation-2.79e-01

Pos logit contributions
OM+0.515
hers+0.487
yk+0.484
Mand+0.474
kefeller+0.465
Neg logit contributions
 acquaintance-0.421
 ambush-0.397
 introdu-0.395
 equation-0.374
 assail-0.372
Token
Token ID198
Feature activation-2.44e-01

Pos logit contributions
 acquaintance+0.803
 ambush+0.758
 introdu+0.754
 equation+0.713
 assail+0.713
Neg logit contributions
OM-0.914
hers-0.862
yk-0.856
Mand-0.839
kefeller-0.820
Token
Token ID198
Feature activation-1.71e-01

Pos logit contributions
 acquaintance+0.650
 ambush+0.612
 introdu+0.608
 equation+0.575
 <-+0.571
Neg logit contributions
OM-0.844
hers-0.801
yk-0.797
Mand-0.780
kefeller-0.777
TokenDisclaimer
Token ID19618
Feature activation-1.32e+00

Pos logit contributions
 acquaintance+0.538
 ambush+0.510
 introdu+0.509
 equation+0.484
 jealousy+0.482
Neg logit contributions
OM-0.509
hers-0.481
yk-0.477
kefeller-0.463
Mand-0.463
Token:
Token ID25
Feature activation+1.54e-02

Pos logit contributions
 acquaintance+3.568
 introdu+3.354
 ambush+3.314
 equation+3.185
 <-+3.161
Neg logit contributions
OM-4.421
hers-4.194
yk-4.174
Mand-4.102
kefeller-4.084
Token These
Token ID2312
Feature activation+1.49e-01

Pos logit contributions
OM+0.055
hers+0.052
yk+0.052
Mand+0.051
kefeller+0.050
Neg logit contributions
 acquaintance-0.039
 ambush-0.037
 introdu-0.037
 equation-0.035
 assail-0.034
Token tips
Token ID9040
Feature activation+1.16e-01

Pos logit contributions
OM+0.525
hers+0.497
yk+0.494
Mand+0.485
kefeller+0.476
Neg logit contributions
 acquaintance-0.392
 ambush-0.368
 introdu-0.365
 equation-0.345
 assail-0.343
Token are
Token ID389
Feature activation+1.39e-01

Pos logit contributions
OM+0.394
hers+0.373
yk+0.370
Mand+0.362
kefeller+0.356
Neg logit contributions
 acquaintance-0.320
 ambush-0.300
 introdu-0.299
 equation-0.282
 assail-0.281
Token based
Token ID1912
Feature activation-2.01e-01

Pos logit contributions
OM+0.486
hers+0.461
yk+0.457
Mand+0.449
kefeller+0.441
Neg logit contributions
 acquaintance-0.371
 ambush-0.348
 introdu-0.346
 equation-0.327
 assail-0.325
Token rather
Token ID2138
Feature activation-4.81e-01

Pos logit contributions
 acquaintance+0.018
 ambush+0.017
 introdu+0.017
 equation+0.016
 assail+0.016
Neg logit contributions
OM-0.023
hers-0.021
yk-0.021
Mand-0.021
kefeller-0.020
Token than
Token ID621
Feature activation-3.08e-01

Pos logit contributions
 acquaintance+1.199
 introdu+1.119
 ambush+1.117
 equation+1.050
 assail+1.048
Neg logit contributions
OM-1.755
hers-1.668
yk-1.657
Mand-1.625
kefeller-1.593
Token a
Token ID257
Feature activation-1.66e-01

Pos logit contributions
 acquaintance+0.793
 ambush+0.742
 introdu+0.739
 equation+0.696
 assail+0.692
Neg logit contributions
OM-1.101
hers-1.044
yk-1.038
Mand-1.018
kefeller-1.003
Token victim
Token ID3117
Feature activation+2.33e-01

Pos logit contributions
 acquaintance+0.404
 ambush+0.377
 introdu+0.375
 equation+0.352
 assail+0.349
Neg logit contributions
OM-0.616
hers-0.586
yk-0.582
Mand-0.571
kefeller-0.564
Token.
Token ID13
Feature activation-1.95e-01

Pos logit contributions
OM+0.759
hers+0.709
yk+0.704
Mand+0.691
kefeller+0.671
Neg logit contributions
 acquaintance-0.672
 ambush-0.637
 introdu-0.636
 jealousy-0.601
 assail-0.600
Token Again
Token ID6521
Feature activation-1.32e+00

Pos logit contributions
 acquaintance+0.498
 ambush+0.466
 introdu+0.465
 equation+0.439
 jealousy+0.435
Neg logit contributions
OM-0.694
hers-0.660
yk-0.657
Mand-0.642
kefeller-0.639
Token,
Token ID11
Feature activation-4.53e-01

Pos logit contributions
 acquaintance+3.352
 introdu+3.125
 ambush+3.113
 equation+2.954
 assail+2.921
Neg logit contributions
OM-4.656
yk-4.418
hers-4.402
Mand-4.314
kefeller-4.302
Token the
Token ID262
Feature activation-7.96e-02

Pos logit contributions
 acquaintance+1.242
 introdu+1.165
 ambush+1.164
 equation+1.102
 jealousy+1.094
Neg logit contributions
OM-1.539
hers-1.456
yk-1.452
Mand-1.421
kefeller-1.403
Token United
Token ID1578
Feature activation-1.25e-01

Pos logit contributions
 acquaintance+0.220
 ambush+0.207
 introdu+0.207
 equation+0.195
 assail+0.194
Neg logit contributions
OM-0.269
hers-0.254
yk-0.253
Mand-0.248
kefeller-0.244
Token States
Token ID1829
Feature activation-5.13e-04

Pos logit contributions
 acquaintance+0.345
 ambush+0.328
 introdu+0.325
 assail+0.306
 jealousy+0.305
Neg logit contributions
OM-0.423
hers-0.400
yk-0.397
Mand-0.387
kefeller-0.382
Token did
Token ID750
Feature activation+3.13e-01

Pos logit contributions
 acquaintance+0.001
 ambush+0.001
 introdu+0.001
 equation+0.001
 assail+0.001
Neg logit contributions
OM-0.002
hers-0.002
yk-0.002
Mand-0.002
kefeller-0.002
Token the
Token ID262
Feature activation-2.01e-01

Pos logit contributions
 acquaintance+0.948
 ambush+0.894
 introdu+0.890
 equation+0.843
 jealousy+0.838
Neg logit contributions
OM-1.157
hers-1.094
yk-1.091
Mand-1.066
kefeller-1.057
Token most
Token ID749
Feature activation-1.30e-01

Pos logit contributions
 acquaintance+0.532
 ambush+0.499
 introdu+0.497
 equation+0.469
 jealousy+0.465
Neg logit contributions
OM-0.704
hers-0.667
yk-0.663
Mand-0.650
kefeller-0.641
Token significant
Token ID2383
Feature activation-1.74e-01

Pos logit contributions
 acquaintance+0.344
 ambush+0.323
 introdu+0.322
 equation+0.303
 assail+0.301
Neg logit contributions
OM-0.458
hers-0.434
yk-0.431
Mand-0.423
kefeller-0.416
Token show
Token ID905
Feature activation-6.10e-01

Pos logit contributions
 acquaintance+0.481
 ambush+0.453
 introdu+0.451
 equation+0.427
 assail+0.424
Neg logit contributions
OM-0.587
hers-0.556
yk-0.552
Mand-0.541
kefeller-0.533
Token?
Token ID30
Feature activation+9.98e-02

Pos logit contributions
 acquaintance+1.562
 ambush+1.464
 introdu+1.464
 equation+1.382
 jealousy+1.358
Neg logit contributions
OM-2.169
hers-2.065
yk-2.048
Mand-2.017
kefeller-2.005
Token Again
Token ID6521
Feature activation-1.31e+00

Pos logit contributions
OM+0.332
hers+0.314
yk+0.312
Mand+0.306
kefeller+0.300
Neg logit contributions
 acquaintance-0.281
 ambush-0.265
 introdu-0.264
 equation-0.250
 assail-0.249
Token,
Token ID11
Feature activation-5.26e-01

Pos logit contributions
 acquaintance+3.302
 ambush+3.072
 introdu+3.048
 equation+2.893
 jealousy+2.866
Neg logit contributions
OM-4.670
yk-4.455
kefeller-4.427
hers-4.427
Mand-4.352
Token title
Token ID3670
Feature activation-6.06e-01

Pos logit contributions
 acquaintance+1.393
 ambush+1.302
 introdu+1.298
 equation+1.228
 jealousy+1.219
Neg logit contributions
OM-1.838
hers-1.743
yk-1.739
Mand-1.704
kefeller-1.700
Token is
Token ID318
Feature activation-1.48e-01

Pos logit contributions
 acquaintance+1.630
 introdu+1.532
 ambush+1.529
 equation+1.456
 jealousy+1.445
Neg logit contributions
OM-2.072
hers-1.974
yk-1.955
Mand-1.931
kefeller-1.920
Token important
Token ID1593
Feature activation-1.35e-01

Pos logit contributions
 acquaintance+0.427
 ambush+0.403
 introdu+0.401
 equation+0.381
 jealousy+0.379
Neg logit contributions
OM-0.484
hers-0.456
yk-0.456
Mand-0.445
kefeller-0.441
Token)
Token ID8
Feature activation-1.86e-01

Pos logit contributions
 acquaintance+0.385
 ambush+0.363
 introdu+0.362
 equation+0.343
 assail+0.341
Neg logit contributions
OM-0.442
hers-0.418
yk-0.415
Mand-0.407
kefeller-0.400
Tokener
Token ID263
Feature activation+2.63e-01

Pos logit contributions
 acquaintance+0.548
 ambush+0.507
 introdu+0.503
 equation+0.472
 assail+0.466
Neg logit contributions
OM-0.844
hers-0.801
yk-0.799
Mand-0.786
kefeller-0.783
Token,
Token ID11
Feature activation+2.74e-01

Pos logit contributions
OM+0.916
hers+0.876
yk+0.860
Mand+0.839
kefeller+0.814
Neg logit contributions
 acquaintance-0.691
 ambush-0.650
 introdu-0.650
 equation-0.618
 assail-0.617
Token according
Token ID1864
Feature activation+4.27e-01

Pos logit contributions
OM+0.899
hers+0.849
yk+0.842
Mand+0.825
kefeller+0.808
Neg logit contributions
 acquaintance-0.782
 ambush-0.737
 introdu-0.735
 equation-0.698
 assail-0.693
Token to
Token ID284
Feature activation+4.28e-01

Pos logit contributions
OM+1.459
hers+1.381
yk+1.370
Mand+1.344
kefeller+1.320
Neg logit contributions
 acquaintance-1.166
 ambush-1.096
 introdu-1.092
 equation-1.033
 assail-1.029
Token a
Token ID257
Feature activation+2.01e-03

Pos logit contributions
OM+1.393
hers+1.323
yk+1.299
Mand+1.275
kefeller+1.238
Neg logit contributions
 acquaintance-1.220
 introdu-1.151
 ambush-1.149
 equation-1.097
 assail-1.093
Token CNN
Token ID8100
Feature activation-1.31e+00

Pos logit contributions
OM+0.007
hers+0.007
yk+0.006
Mand+0.006
kefeller+0.006
Neg logit contributions
 acquaintance-0.005
 ambush-0.005
 introdu-0.005
 equation-0.005
 assail-0.005
Token reporter
Token ID9095
Feature activation-6.78e-01

Pos logit contributions
 acquaintance+2.884
 ambush+2.669
 introdu+2.625
 equation+2.435
 assail+2.380
Neg logit contributions
OM-5.078
hers-4.932
yk-4.872
kefeller-4.842
Mand-4.807
Token.
Token ID13
Feature activation-3.59e-01

Pos logit contributions
 acquaintance+1.855
 ambush+1.743
 introdu+1.737
 equation+1.637
 jealousy+1.627
Neg logit contributions
OM-2.313
hers-2.190
yk-2.176
Mand-2.137
kefeller-2.110
Token
Token ID198
Feature activation-3.08e-01

Pos logit contributions
 acquaintance+0.971
 ambush+0.914
 introdu+0.909
 equation+0.859
 assail+0.854
Neg logit contributions
OM-1.231
hers-1.165
yk-1.160
Mand-1.135
kefeller-1.120
Token
Token ID198
Feature activation-1.66e-01

Pos logit contributions
 acquaintance+0.801
 ambush+0.758
 introdu+0.749
 <-+0.708
 equation+0.708
Neg logit contributions
OM-1.075
hers-1.022
yk-1.019
kefeller-1.001
Mand-0.995
TokenA
Token ID32
Feature activation-8.77e-02

Pos logit contributions
 acquaintance+0.565
 ambush+0.539
 introdu+0.536
 equation+0.512
 jealousy+0.511
Neg logit contributions
OM-0.445
hers-0.417
yk-0.415
Mand-0.401
kefeller-0.401
Token they
Token ID484
Feature activation+1.65e-02

Pos logit contributions
 acquaintance+0.289
 introdu+0.271
 ambush+0.271
 equation+0.256
 jealousy+0.254
Neg logit contributions
OM-0.360
hers-0.341
yk-0.339
Mand-0.332
kefeller-0.329
Token saw
Token ID2497
Feature activation-2.60e-01

Pos logit contributions
OM+0.059
hers+0.056
yk+0.056
Mand+0.055
kefeller+0.054
Neg logit contributions
 acquaintance-0.042
 ambush-0.039
 introdu-0.039
 equation-0.037
 assail-0.037
Token as
Token ID355
Feature activation-2.34e-01

Pos logit contributions
 acquaintance+0.612
 ambush+0.567
 introdu+0.563
 equation+0.527
 assail+0.523
Neg logit contributions
OM-0.990
hers-0.941
yk-0.937
Mand-0.920
kefeller-0.907
Token the
Token ID262
Feature activation+3.03e-02

Pos logit contributions
 acquaintance+0.640
 ambush+0.603
 introdu+0.601
 equation+0.567
 jealousy+0.564
Neg logit contributions
OM-0.798
hers-0.756
yk-0.752
Mand-0.736
kefeller-0.725
Token over
Token ID625
Feature activation-6.08e-01

Pos logit contributions
OM+0.104
hers+0.098
yk+0.097
Mand+0.095
kefeller+0.094
Neg logit contributions
 acquaintance-0.083
 ambush-0.078
 introdu-0.077
 equation-0.073
 assail-0.073
Tokenpolit
Token ID34470
Feature activation+1.06e+00

Pos logit contributions
 acquaintance+1.862
 ambush+1.763
 introdu+1.759
 equation+1.674
 jealousy+1.664
Neg logit contributions
OM-1.867
hers-1.758
yk-1.749
Mand-1.702
kefeller-1.683
Tokenic
Token ID291
Feature activation+4.24e-01

Pos logit contributions
OM+2.570
hers+2.429
yk+2.407
Mand+2.317
kefeller+2.235
Neg logit contributions
 acquaintance-3.910
 introdu-3.744
 ambush-3.707
 assail-3.600
 equation-3.549
Tokenisation
Token ID5612
Feature activation+9.08e-02

Pos logit contributions
OM+1.556
hers+1.489
yk+1.472
Mand+1.440
kefeller+1.409
Neg logit contributions
 acquaintance-1.052
 introdu-0.974
 ambush-0.971
 assail-0.913
 equation-0.905
Token of
Token ID286
Feature activation+3.19e-01

Pos logit contributions
OM+0.314
hers+0.298
yk+0.295
Mand+0.290
kefeller+0.284
Neg logit contributions
 acquaintance-0.244
 ambush-0.229
 introdu-0.227
 assail-0.215
 equation-0.214
Token his
Token ID465
Feature activation+2.37e-01

Pos logit contributions
OM+1.026
hers+0.969
yk+0.960
Mand+0.941
kefeller+0.924
Neg logit contributions
 acquaintance-0.936
 ambush-0.884
 introdu-0.879
 equation-0.834
 assail-0.832
Token role
Token ID2597
Feature activation+4.72e-01

Pos logit contributions
OM+0.803
hers+0.757
yk+0.754
Mand+0.740
kefeller+0.723
Neg logit contributions
 acquaintance-0.646
 ambush-0.608
 introdu-0.605
 assail-0.573
 jealousy-0.569
Token It
Token ID632
Feature activation+2.20e-01

Pos logit contributions
OM+0.745
hers+0.705
yk+0.700
Mand+0.687
kefeller+0.675
Neg logit contributions
 acquaintance-0.561
 ambush-0.526
 introdu-0.523
 equation-0.493
 assail-0.492
Token initially
Token ID7317
Feature activation+3.40e-02

Pos logit contributions
OM+0.726
hers+0.688
yk+0.681
Mand+0.666
kefeller+0.654
Neg logit contributions
 acquaintance-0.628
 ambush-0.592
 introdu-0.589
 assail-0.557
 equation-0.557
Token had
Token ID550
Feature activation+2.67e-04

Pos logit contributions
OM+0.097
hers+0.091
yk+0.090
Mand+0.088
kefeller+0.086
Neg logit contributions
 acquaintance-0.112
 ambush-0.106
 introdu-0.106
 equation-0.101
 assail-0.101
Token three
Token ID1115
Feature activation+6.49e-01

Pos logit contributions
OM+0.001
hers+0.001
yk+0.001
Mand+0.001
kefeller+0.001
Neg logit contributions
 acquaintance-0.001
 ambush-0.001
 introdu-0.001
 equation-0.001
 assail-0.001
Token 90
Token ID4101
Feature activation+3.83e-01

Pos logit contributions
OM+2.158
hers+2.047
yk+2.016
Mand+1.985
kefeller+1.935
Neg logit contributions
 acquaintance-1.808
 ambush-1.696
 introdu-1.687
 assail-1.604
 equation-1.600
Token MW
Token ID29961
Feature activation+1.06e+00

Pos logit contributions
OM+1.123
hers+1.051
yk+1.040
Mand+1.013
kefeller+0.987
Neg logit contributions
 acquaintance-1.234
 ambush-1.172
 introdu-1.169
 jealousy-1.110
 assail-1.110
Tokent
Token ID83
Feature activation+1.85e-01

Pos logit contributions
OM+3.225
hers+3.040
yk+3.000
Mand+2.933
kefeller+2.813
Neg logit contributions
 acquaintance-3.220
 ambush-3.016
 introdu-3.003
 assail-2.888
aign-2.888
Token OK
Token ID7477
Feature activation-3.98e-01

Pos logit contributions
OM+0.589
hers+0.556
yk+0.550
Mand+0.539
kefeller+0.528
Neg logit contributions
 acquaintance-0.551
 ambush-0.520
 introdu-0.518
 equation-0.492
 assail-0.490
Token-
Token ID12
Feature activation+2.83e-01

Pos logit contributions
 acquaintance+1.317
 introdu+1.251
 ambush+1.249
 equation+1.195
 assail+1.181
Neg logit contributions
OM-1.105
hers-1.041
yk-1.038
Mand-1.008
kefeller-1.005
Token150
Token ID8628
Feature activation+6.83e-02

Pos logit contributions
OM+0.840
hers+0.787
yk+0.781
Mand+0.764
kefeller+0.745
Neg logit contributions
 acquaintance-0.904
 ambush-0.858
 introdu-0.855
 equation-0.815
 assail-0.812
Token reactors
Token ID28502
Feature activation+3.37e-01

Pos logit contributions
OM+0.205
hers+0.192
yk+0.191
Mand+0.186
kefeller+0.183
Neg logit contributions
 acquaintance-0.215
 ambush-0.204
 introdu-0.203
 equation-0.193
 assail-0.192
Token the
Token ID262
Feature activation-7.95e-02

Pos logit contributions
 acquaintance+0.399
 introdu+0.375
 ambush+0.374
 equation+0.354
 jealousy+0.350
Neg logit contributions
OM-0.484
hers-0.460
yk-0.459
Mand-0.449
kefeller-0.446
Token Jets
Token ID14728
Feature activation-6.12e-01

Pos logit contributions
 acquaintance+0.238
 ambush+0.225
 introdu+0.224
 equation+0.213
 assail+0.212
Neg logit contributions
OM-0.251
hers-0.236
yk-0.235
Mand-0.230
kefeller-0.226
Token want
Token ID765
Feature activation+4.67e-02

Pos logit contributions
 acquaintance+1.447
 ambush+1.372
 introdu+1.342
 equation+1.267
 assail+1.265
Neg logit contributions
OM-2.293
yk-2.191
hers-2.186
Mand-2.144
kefeller-2.133
Token to
Token ID284
Feature activation+2.52e-03

Pos logit contributions
OM+0.157
hers+0.149
yk+0.147
Mand+0.144
kefeller+0.142
Neg logit contributions
 acquaintance-0.131
 ambush-0.123
 introdu-0.122
 equation-0.115
 assail-0.115
Token see
Token ID766
Feature activation-1.28e-01

Pos logit contributions
OM+0.010
hers+0.009
yk+0.009
Mand+0.009
kefeller+0.009
Neg logit contributions
 acquaintance-0.006
 ambush-0.006
 introdu-0.006
 equation-0.005
 assail-0.005
Token playoff
Token ID13119
Feature activation+1.11e+00

Pos logit contributions
 acquaintance+0.348
 introdu+0.328
 ambush+0.327
 equation+0.310
 jealousy+0.308
Neg logit contributions
OM-0.440
hers-0.417
yk-0.417
Mand-0.408
kefeller-0.405
Token hockey
Token ID12217
Feature activation+5.64e-01

Pos logit contributions
OM+3.758
hers+3.553
yk+3.512
Mand+3.456
kefeller+3.377
Neg logit contributions
 acquaintance-3.011
 ambush-2.825
 introdu-2.811
 assail-2.660
 equation-2.653
Token this
Token ID428
Feature activation+2.77e-01

Pos logit contributions
OM+1.946
hers+1.844
yk+1.828
Mand+1.791
kefeller+1.758
Neg logit contributions
 acquaintance-1.521
 ambush-1.428
 introdu-1.420
 equation-1.339
 assail-1.337
Token year
Token ID614
Feature activation+4.75e-01

Pos logit contributions
OM+1.019
hers+0.969
yk+0.962
Mand+0.945
kefeller+0.930
Neg logit contributions
 acquaintance-0.686
 ambush-0.641
 introdu-0.638
 equation-0.598
 assail-0.595
Token,
Token ID11
Feature activation+8.88e-02

Pos logit contributions
OM+1.592
hers+1.505
yk+1.485
Mand+1.460
kefeller+1.412
Neg logit contributions
 acquaintance-1.330
 ambush-1.254
 introdu-1.242
 assail-1.176
 equation-1.170
Token they
Token ID484
Feature activation+1.61e-01

Pos logit contributions
OM+0.300
hers+0.284
yk+0.283
Mand+0.277
kefeller+0.273
Neg logit contributions
 acquaintance-0.245
 ambush-0.231
 introdu-0.230
 equation-0.218
 jealousy-0.216
Token
Token ID198
Feature activation+1.19e-01

Pos logit contributions
 acquaintance+0.187
 ambush+0.176
 introdu+0.175
 equation+0.165
 <-+0.164
Neg logit contributions
OM-0.241
hers-0.228
yk-0.228
Mand-0.222
kefeller-0.222
TokenMemory
Token ID30871
Feature activation+6.83e-01

Pos logit contributions
OM+0.325
hers+0.301
yk+0.298
Mand+0.292
kefeller+0.283
Neg logit contributions
 acquaintance-0.411
 ambush-0.391
 introdu-0.391
 equation-0.373
 assail-0.372
Token:
Token ID25
Feature activation-1.69e-01

Pos logit contributions
OM+2.077
hers+1.905
Mand+1.866
yk+1.828
OSH+1.787
Neg logit contributions
 acquaintance-2.140
 ambush-2.036
 introdu-2.017
 jealousy-1.931
 assail-1.931
Token 2
Token ID362
Feature activation+2.68e-01

Pos logit contributions
 acquaintance+0.454
 ambush+0.427
 introdu+0.425
 equation+0.401
 assail+0.399
Neg logit contributions
OM-0.585
hers-0.554
yk-0.550
Mand-0.540
kefeller-0.531
TokenGB
Token ID4579
Feature activation+7.68e-01

Pos logit contributions
OM+0.971
hers+0.916
yk+0.911
Mand+0.895
kefeller+0.870
Neg logit contributions
 acquaintance-0.684
 ambush-0.635
 introdu-0.633
 assail-0.594
 jealousy-0.594
Token GDDR
Token ID50092
Feature activation+1.39e+00

Pos logit contributions
OM+1.973
hers+1.813
Mand+1.744
yk+1.740
OSH+1.656
Neg logit contributions
 acquaintance-2.777
 ambush-2.633
 introdu-2.630
 assail-2.528
 jealousy-2.521
Token5
Token ID20
Feature activation+3.58e-02

Pos logit contributions
OM+4.127
Mand+3.730
hers+3.725
yk+3.680
OSH+3.508
Neg logit contributions
 acquaintance-4.342
 introdu-4.225
 ambush-4.187
 assail-4.098
 jealousy-4.021
Token -
Token ID532
Feature activation-1.71e-03

Pos logit contributions
OM+0.117
hers+0.110
yk+0.110
Mand+0.108
kefeller+0.105
Neg logit contributions
 acquaintance-0.103
 ambush-0.097
 introdu-0.097
 equation-0.092
 assail-0.092
Token 48
Token ID4764
Feature activation+6.04e-01

Pos logit contributions
 acquaintance+0.005
 ambush+0.004
 introdu+0.004
 equation+0.004
 assail+0.004
Neg logit contributions
OM-0.006
hers-0.006
yk-0.006
Mand-0.005
kefeller-0.005
Token00
Token ID405
Feature activation+4.20e-02

Pos logit contributions
OM+1.847
hers+1.715
yk+1.715
Mand+1.676
kefeller+1.630
Neg logit contributions
 acquaintance-1.875
 introdu-1.774
 ambush-1.771
 assail-1.679
 jealousy-1.679
Token MHz
Token ID19805
Feature activation+7.69e-01

Pos logit contributions
OM+0.108
hers+0.100
yk+0.099
Mand+0.097
kefeller+0.095
Neg logit contributions
 acquaintance-0.150
 ambush-0.143
 introdu-0.143
 equation-0.137
 assail-0.136
TokenMars
Token ID43725
Feature activation-1.92e-01

Pos logit contributions
OM+0.481
hers+0.449
yk+0.445
Mand+0.436
kefeller+0.424
Neg logit contributions
 acquaintance-0.577
 ambush-0.548
 introdu-0.547
 equation-0.523
 assail-0.521
Tokenup
Token ID929
Feature activation+2.89e-01

Pos logit contributions
 acquaintance+0.585
 ambush+0.553
 introdu+0.552
 equation+0.524
 assail+0.523
Neg logit contributions
OM-0.593
hers-0.558
yk-0.555
Mand-0.541
kefeller-0.530
Tokenial
Token ID498
Feature activation+8.55e-02

Pos logit contributions
OM+1.191
hers+1.143
yk+1.136
Mand+1.111
kefeller+1.096
Neg logit contributions
 acquaintance-0.587
 ambush-0.541
 introdu-0.539
 assail-0.495
 equation-0.494
Tokenia
Token ID544
Feature activation+5.17e-01

Pos logit contributions
OM+0.252
hers+0.236
yk+0.235
Mand+0.229
kefeller+0.225
Neg logit contributions
 acquaintance-0.274
 ambush-0.260
 introdu-0.259
 equation-0.247
 assail-0.245
Token:
Token ID25
Feature activation+1.05e-01

Pos logit contributions
OM+1.742
hers+1.646
yk+1.642
Mand+1.604
kefeller+1.565
Neg logit contributions
 acquaintance-1.418
 ambush-1.334
 introdu-1.320
 equation-1.253
 jealousy-1.250
Token Mac
Token ID4100
Feature activation+1.17e+00

Pos logit contributions
OM+0.335
hers+0.315
yk+0.313
Mand+0.307
kefeller+0.300
Neg logit contributions
 acquaintance-0.309
 ambush-0.291
 introdu-0.290
 equation-0.276
 assail-0.275
Tokenrop
Token ID1773
Feature activation+9.12e-01

Pos logit contributions
OM+3.464
hers+3.280
yk+3.245
Mand+3.158
kefeller+3.098
Neg logit contributions
 acquaintance-3.666
 ambush-3.486
 introdu-3.462
 assail-3.343
 jealousy-3.333
Tokenod
Token ID375
Feature activation+8.73e-01

Pos logit contributions
OM+2.606
hers+2.475
yk+2.447
Mand+2.343
kefeller+2.294
Neg logit contributions
 acquaintance-3.001
 introdu-2.844
 ambush-2.841
 jealousy-2.713
 assail-2.706
Tokenidae
Token ID31718
Feature activation+1.55e-01

Pos logit contributions
OM+2.565
hers+2.410
yk+2.409
Mand+2.300
kefeller+2.241
Neg logit contributions
 acquaintance-2.769
 ambush-2.627
 introdu-2.604
 jealousy-2.514
 <--2.502
Token).
Token ID737
Feature activation-3.85e-02

Pos logit contributions
OM+0.450
hers+0.421
yk+0.419
Mand+0.407
kefeller+0.397
Neg logit contributions
 acquaintance-0.504
 ambush-0.478
 introdu-0.475
 equation-0.454
 assail-0.453
Token IV
Token ID8363
Feature activation-9.30e-02

Pos logit contributions
 acquaintance+0.110
 ambush+0.103
 introdu+0.103
 equation+0.097
 assail+0.097
Neg logit contributions
OM-0.127
hers-0.119
yk-0.118
Mand-0.117
kefeller-0.113
Token
Token ID247
Feature activation+6.59e-02

Pos logit contributions
 acquaintance+0.287
 ambush+0.270
 introdu+0.269
 equation+0.257
 <-+0.256
Neg logit contributions
OM-0.303
yk-0.286
hers-0.285
kefeller-0.281
Mand-0.280
Tokens
Token ID82
Feature activation+1.15e-02

Pos logit contributions
OM+0.171
hers+0.159
yk+0.157
Mand+0.153
kefeller+0.150
Neg logit contributions
 acquaintance-0.234
 ambush-0.224
 introdu-0.223
 equation-0.213
 assail-0.213
Token what
Token ID644
Feature activation-3.70e-02

Pos logit contributions
OM+0.041
hers+0.039
yk+0.039
Mand+0.038
kefeller+0.038
Neg logit contributions
 acquaintance-0.030
 ambush-0.028
 introdu-0.028
 equation-0.026
 assail-0.026
Token this
Token ID428
Feature activation+1.12e-01

Pos logit contributions
 acquaintance+0.099
 ambush+0.092
 introdu+0.092
 equation+0.087
 assail+0.086
Neg logit contributions
OM-0.129
hers-0.122
yk-0.122
Mand-0.119
kefeller-0.118
Token campaign
Token ID1923
Feature activation+4.88e-02

Pos logit contributions
OM+0.375
hers+0.354
yk+0.352
Mand+0.345
kefeller+0.339
Neg logit contributions
 acquaintance-0.314
 ambush-0.295
 introdu-0.295
 equation-0.278
 assail-0.277
Token is
Token ID318
Feature activation+2.33e-01

Pos logit contributions
OM+0.169
hers+0.160
yk+0.159
Mand+0.156
kefeller+0.153
Neg logit contributions
 acquaintance-0.131
 ambush-0.123
 introdu-0.123
 equation-0.116
 assail-0.115
Token all
Token ID477
Feature activation-1.26e-02

Pos logit contributions
OM+0.950
hers+0.910
yk+0.901
Mand+0.889
kefeller+0.873
Neg logit contributions
 acquaintance-0.481
 ambush-0.442
 introdu-0.439
 equation-0.407
 assail-0.406
Token about
Token ID546
Feature activation+4.97e-02

Pos logit contributions
 acquaintance+0.021
 ambush+0.019
 introdu+0.019
 equation+0.017
 assail+0.017
Neg logit contributions
OM-0.057
hers-0.054
yk-0.054
Mand-0.053
kefeller-0.053
Token.
Token ID13
Feature activation+6.13e-02

Pos logit contributions
OM+0.165
hers+0.156
yk+0.155
Mand+0.151
kefeller+0.148
Neg logit contributions
 acquaintance-0.141
 ambush-0.132
 introdu-0.132
 equation-0.124
 assail-0.124
Token
Token ID447
Feature activation+1.92e-02

Pos logit contributions
OM+0.198
hers+0.186
yk+0.185
Mand+0.181
kefeller+0.177
Neg logit contributions
 acquaintance-0.180
 ambush-0.170
 introdu-0.169
 equation-0.160
 assail-0.160
Token
Token ID251
Feature activation+1.04e-01

Pos logit contributions
OM+0.056
hers+0.052
yk+0.052
Mand+0.050
kefeller+0.049
Neg logit contributions
 acquaintance-0.062
 ambush-0.059
 introdu-0.059
 equation-0.056
 assail-0.056
Token disrupt
Token ID10114
Feature activation-2.10e-02

Pos logit contributions
OM+0.189
hers+0.179
yk+0.178
Mand+0.175
kefeller+0.173
Neg logit contributions
 acquaintance-0.126
 ambush-0.118
 introdu-0.117
 equation-0.110
 assail-0.109
Token Israel
Token ID2692
Feature activation-1.21e-01

Pos logit contributions
 acquaintance+0.062
 ambush+0.058
 introdu+0.058
 equation+0.055
 jealousy+0.055
Neg logit contributions
OM-0.067
hers-0.063
yk-0.063
Mand-0.062
kefeller-0.061
Token
Token ID447
Feature activation+1.63e-01

Pos logit contributions
 acquaintance+0.375
 ambush+0.355
 introdu+0.353
 equation+0.336
 assail+0.335
Neg logit contributions
OM-0.370
hers-0.347
yk-0.345
Mand-0.338
kefeller-0.331
Token
Token ID247
Feature activation+1.54e-01

Pos logit contributions
OM+0.413
yk+0.380
hers+0.379
kefeller+0.368
Mand+0.367
Neg logit contributions
 acquaintance-0.591
 ambush-0.563
 introdu-0.561
 equation-0.540
 <--0.536
Tokens
Token ID82
Feature activation+1.55e-02

Pos logit contributions
OM+0.415
yk+0.382
hers+0.382
Mand+0.370
kefeller+0.369
Neg logit contributions
 acquaintance-0.530
 introdu-0.504
 ambush-0.504
 equation-0.483
 <--0.478
Token ongoing
Token ID7044
Feature activation+2.04e-01

Pos logit contributions
OM+0.053
hers+0.050
yk+0.049
Mand+0.048
kefeller+0.048
Neg logit contributions
 acquaintance-0.043
 ambush-0.040
 introdu-0.040
 equation-0.038
 jealousy-0.038
Token violations
Token ID11734
Feature activation-8.07e-02

Pos logit contributions
OM+0.761
hers+0.722
yk+0.720
Mand+0.708
kefeller+0.695
Neg logit contributions
 acquaintance-0.489
 ambush-0.456
 introdu-0.453
 assail-0.425
 equation-0.422
Token and
Token ID290
Feature activation-1.48e-01

Pos logit contributions
 acquaintance+0.214
 ambush+0.201
 introdu+0.200
 equation+0.189
 assail+0.188
Neg logit contributions
OM-0.282
hers-0.267
yk-0.266
Mand-0.260
kefeller-0.255
Token increase
Token ID2620
Feature activation+1.93e-01

Pos logit contributions
 acquaintance+0.405
 ambush+0.381
 introdu+0.379
 equation+0.358
 assail+0.356
Neg logit contributions
OM-0.502
hers-0.475
yk-0.472
Mand-0.463
kefeller-0.455
Token in
Token ID287
Feature activation-4.40e-01

Pos logit contributions
OM+0.599
hers+0.560
yk+0.557
Mand+0.546
kefeller+0.538
Neg logit contributions
 acquaintance-0.587
 introdu-0.555
 ambush-0.555
 assail-0.527
 <--0.524
Token settlement
Token ID9443
Feature activation+1.51e-02

Pos logit contributions
 acquaintance+1.240
 ambush+1.173
 introdu+1.160
 equation+1.105
 jealousy+1.099
Neg logit contributions
OM-1.467
hers-1.395
yk-1.380
Mand-1.351
kefeller-1.333
Token the
Token ID262
Feature activation-6.70e-02

Pos logit contributions
OM+1.511
hers+1.429
yk+1.412
Mand+1.385
kefeller+1.353
Neg logit contributions
 acquaintance-1.279
 ambush-1.209
 introdu-1.203
 equation-1.144
 assail-1.138
Token 100
Token ID1802
Feature activation+5.30e-02

Pos logit contributions
 acquaintance+0.189
 ambush+0.178
 introdu+0.178
 equation+0.168
 assail+0.168
Neg logit contributions
OM-0.222
hers-0.210
yk-0.208
Mand-0.204
kefeller-0.201
Token-
Token ID12
Feature activation+6.87e-03

Pos logit contributions
OM+0.164
hers+0.155
yk+0.153
Mand+0.150
kefeller+0.147
Neg logit contributions
 acquaintance-0.161
 ambush-0.153
 introdu-0.152
 equation-0.145
 assail-0.144
Tokenplayer
Token ID7829
Feature activation-4.54e-02

Pos logit contributions
OM+0.023
hers+0.022
yk+0.022
Mand+0.022
kefeller+0.021
Neg logit contributions
 acquaintance-0.019
 ambush-0.018
 introdu-0.018
 equation-0.017
 assail-0.017
Token arena
Token ID13478
Feature activation+1.66e-01

Pos logit contributions
 acquaintance+0.127
 ambush+0.119
 introdu+0.119
 equation+0.113
 jealousy+0.111
Neg logit contributions
OM-0.153
hers-0.145
yk-0.144
Mand-0.141
kefeller-0.139
Token title
Token ID3670
Feature activation+3.04e-01

Pos logit contributions
OM+0.527
hers+0.497
yk+0.493
Mand+0.483
kefeller+0.475
Neg logit contributions
 acquaintance-0.491
 ambush-0.464
 introdu-0.462
 equation-0.439
 assail-0.437
Token made
Token ID925
Feature activation+3.29e-01

Pos logit contributions
OM+0.964
hers+0.908
yk+0.900
Mand+0.881
kefeller+0.865
Neg logit contributions
 acquaintance-0.906
 ambush-0.855
 introdu-0.852
 equation-0.808
 assail-0.807
Token not
Token ID407
Feature activation+3.95e-01

Pos logit contributions
OM+1.023
hers+0.961
yk+0.947
Mand+0.927
kefeller+0.908
Neg logit contributions
 acquaintance-0.992
 ambush-0.943
 introdu-0.934
 equation-0.895
 assail-0.893
Token one
Token ID530
Feature activation-1.70e-01

Pos logit contributions
OM+1.324
hers+1.259
yk+1.241
Mand+1.209
kefeller+1.178
Neg logit contributions
 acquaintance-1.101
 ambush-1.038
 introdu-1.028
 assail-0.987
 equation-0.982
Token,
Token ID11
Feature activation+3.92e-01

Pos logit contributions
 acquaintance+0.464
 ambush+0.437
 introdu+0.437
 equation+0.414
 assail+0.414
Neg logit contributions
OM-0.577
hers-0.543
yk-0.541
Mand-0.532
kefeller-0.516
Token but
Token ID475
Feature activation+3.62e-01

Pos logit contributions
OM+1.269
hers+1.200
yk+1.175
Mand+1.160
kefeller+1.130
Neg logit contributions
 acquaintance-1.131
 ambush-1.066
 introdu-1.065
 equation-1.011
 assail-1.007
Tokenfall
Token ID7207
Feature activation-1.30e-01

Pos logit contributions
OM+1.055
hers+0.990
yk+0.984
Mand+0.962
kefeller+0.929
Neg logit contributions
 acquaintance-0.947
 introdu-0.889
 ambush-0.888
 assail-0.840
 jealousy-0.840
Token (
Token ID357
Feature activation+3.78e-01

Pos logit contributions
 acquaintance+0.365
 ambush+0.344
 introdu+0.343
 equation+0.324
 assail+0.322
Neg logit contributions
OM-0.435
hers-0.412
yk-0.409
Mand-0.400
kefeller-0.395
TokenBlu
Token ID38676
Feature activation-2.06e-02

Pos logit contributions
OM+1.103
hers+1.027
yk+1.020
Mand+1.001
kefeller+0.971
Neg logit contributions
 acquaintance-1.230
 ambush-1.165
 introdu-1.161
 equation-1.108
 assail-1.107
Token-
Token ID12
Feature activation+3.58e-01

Pos logit contributions
 acquaintance+0.067
 ambush+0.063
 introdu+0.063
 equation+0.060
 assail+0.060
Neg logit contributions
OM-0.061
hers-0.057
yk-0.056
Mand-0.056
kefeller-0.054
Tokenray
Token ID2433
Feature activation+2.84e-01

Pos logit contributions
OM+1.179
hers+1.106
yk+1.090
Mand+1.089
kefeller+1.042
Neg logit contributions
 acquaintance-1.046
 ambush-0.981
 introdu-0.974
 equation-0.925
 assail-0.923
Token/
Token ID14
Feature activation+6.04e-01

Pos logit contributions
OM+0.940
hers+0.878
yk+0.871
Mand+0.863
kefeller+0.836
Neg logit contributions
 acquaintance-0.810
 ambush-0.762
 introdu-0.758
 assail-0.718
 equation-0.716
Token DVD
Token ID12490
Feature activation+2.79e-01

Pos logit contributions
OM+1.411
yk+1.266
hers+1.260
Mand+1.244
kefeller+1.153
Neg logit contributions
 acquaintance-2.339
 ambush-2.234
 introdu-2.219
 assail-2.153
 equation-2.139
Token +
Token ID1343
Feature activation+1.83e-01

Pos logit contributions
OM+0.858
hers+0.798
yk+0.794
Mand+0.781
kefeller+0.756
Neg logit contributions
 acquaintance-0.859
 ambush-0.812
 introdu-0.808
 equation-0.768
 assail-0.768
Token Digital
Token ID10231
Feature activation+9.51e-01

Pos logit contributions
OM+0.534
hers+0.500
yk+0.496
Mand+0.485
kefeller+0.473
Neg logit contributions
 acquaintance-0.592
 ambush-0.562
 introdu-0.559
 equation-0.534
 assail-0.533
Token Copy
Token ID17393
Feature activation+3.61e-01

Pos logit contributions
OM+3.117
hers+2.882
yk+2.857
Mand+2.843
kefeller+2.711
Neg logit contributions
 acquaintance-2.760
 ambush-2.578
 introdu-2.560
 jealousy-2.452
 assail-2.450
Token)
Token ID8
Feature activation+4.52e-01

Pos logit contributions
OM+1.110
hers+1.039
yk+1.030
Mand+1.014
kefeller+0.985
Neg logit contributions
 acquaintance-1.110
 ambush-1.051
 introdu-1.043
 assail-0.993
 equation-0.992
Token in
Token ID287
Feature activation+3.52e-02

Pos logit contributions
OM+1.007
hers+0.949
yk+0.937
Mand+0.918
kefeller+0.897
Neg logit contributions
 acquaintance-0.939
 ambush-0.884
 introdu-0.882
 equation-0.837
 assail-0.836
Token May
Token ID1737
Feature activation-2.55e-01

Pos logit contributions
OM+0.107
hers+0.101
yk+0.100
Mand+0.098
kefeller+0.096
Neg logit contributions
 acquaintance-0.110
 ambush-0.104
 introdu-0.103
 equation-0.098
 assail-0.098
Token 2015
Token ID1853
Feature activation-4.98e-01

Pos logit contributions
 acquaintance+0.717
 ambush+0.677
 introdu+0.673
 equation+0.639
 jealousy+0.633
Neg logit contributions
OM-0.845
hers-0.801
yk-0.797
Mand-0.780
kefeller-0.770
Token.
Token ID13
Feature activation+1.02e-01

Pos logit contributions
 acquaintance+1.382
 ambush+1.299
 introdu+1.295
 equation+1.224
 assail+1.218
Neg logit contributions
OM-1.684
hers-1.592
yk-1.582
Mand-1.550
kefeller-1.523
Token According
Token ID4784
Feature activation+4.87e-01

Pos logit contributions
OM+0.349
hers+0.329
yk+0.326
Mand+0.321
kefeller+0.313
Neg logit contributions
 acquaintance-0.283
 ambush-0.265
 introdu-0.264
 equation-0.250
 assail-0.249
Token to
Token ID284
Feature activation+6.97e-01

Pos logit contributions
OM+1.651
hers+1.563
yk+1.551
Mand+1.519
kefeller+1.490
Neg logit contributions
 acquaintance-1.340
 ambush-1.258
 introdu-1.255
 equation-1.187
 assail-1.185
Token the
Token ID262
Feature activation+1.44e-01

Pos logit contributions
OM+2.280
hers+2.146
yk+2.115
Mand+2.082
OSH+2.024
Neg logit contributions
 acquaintance-1.978
 ambush-1.873
 introdu-1.857
 assail-1.774
aign-1.765
Token Orange
Token ID11942
Feature activation-2.97e-01

Pos logit contributions
OM+0.460
hers+0.434
yk+0.431
Mand+0.422
kefeller+0.414
Neg logit contributions
 acquaintance-0.424
 ambush-0.400
 introdu-0.399
 equation-0.378
 assail-0.377
Token County
Token ID3418
Feature activation-4.18e-01

Pos logit contributions
 acquaintance+0.817
 ambush+0.768
 introdu+0.765
 equation+0.722
 assail+0.720
Neg logit contributions
OM-1.010
hers-0.957
yk-0.950
Mand-0.932
kefeller-0.912
Token Register
Token ID17296
Feature activation-3.61e-01

Pos logit contributions
 acquaintance+1.080
 ambush+1.010
 introdu+1.007
 equation+0.950
 jealousy+0.943
Neg logit contributions
OM-1.491
yk-1.417
hers-1.415
Mand-1.377
kefeller-1.372
Token,
Token ID11
Feature activation+1.41e-01

Pos logit contributions
 acquaintance+0.981
 ambush+0.922
 introdu+0.918
 equation+0.866
 assail+0.862
Neg logit contributions
OM-1.242
hers-1.173
yk-1.166
Mand-1.144
kefeller-1.121
Tokend
Token ID67
Feature activation-2.52e-01

Pos logit contributions
OM+0.940
hers+0.877
yk+0.868
Mand+0.848
kefeller+0.826
Neg logit contributions
 acquaintance-1.138
 ambush-1.081
 introdu-1.079
 equation-1.030
 assail-1.027
Token forbid
Token ID33277
Feature activation-1.60e-02

Pos logit contributions
 acquaintance+0.687
 ambush+0.646
 introdu+0.643
 equation+0.608
 assail+0.605
Neg logit contributions
OM-0.860
hers-0.814
yk-0.810
Mand-0.794
kefeller-0.785
Token.
Token ID13
Feature activation+1.28e-01

Pos logit contributions
 acquaintance+0.040
 ambush+0.037
 introdu+0.037
 equation+0.035
 assail+0.035
Neg logit contributions
OM-0.058
hers-0.055
yk-0.055
Mand-0.054
kefeller-0.053
Token
Token ID198
Feature activation-5.22e-02

Pos logit contributions
OM+0.433
hers+0.409
yk+0.406
Mand+0.399
kefeller+0.388
Neg logit contributions
 acquaintance-0.355
 ambush-0.333
 introdu-0.332
 equation-0.313
 assail-0.313
Token
Token ID198
Feature activation-9.53e-02

Pos logit contributions
 acquaintance+0.142
 ambush+0.133
 introdu+0.133
 equation+0.125
 jealousy+0.124
Neg logit contributions
OM-0.178
hers-0.169
yk-0.168
Mand-0.165
kefeller-0.163
TokenI
Token ID40
Feature activation-2.49e-01

Pos logit contributions
 acquaintance+0.302
 ambush+0.287
 introdu+0.285
 equation+0.272
 jealousy+0.271
Neg logit contributions
OM-0.279
hers-0.264
yk-0.262
kefeller-0.255
Mand-0.253
Token strongly
Token ID7634
Feature activation+1.03e-02

Pos logit contributions
 acquaintance+0.634
 ambush+0.594
 introdu+0.590
 equation+0.557
 assail+0.554
Neg logit contributions
OM-0.896
hers-0.852
yk-0.846
Mand-0.832
kefeller-0.822
Token advocate
Token ID12811
Feature activation-1.71e-01

Pos logit contributions
OM+0.042
hers+0.040
yk+0.040
Mand+0.039
kefeller+0.038
Neg logit contributions
 acquaintance-0.022
 ambush-0.020
 introdu-0.020
 equation-0.018
 jealousy-0.018
Token the
Token ID262
Feature activation+1.02e-01

Pos logit contributions
 acquaintance+0.452
 ambush+0.425
 introdu+0.423
 equation+0.399
 assail+0.397
Neg logit contributions
OM-0.599
hers-0.568
yk-0.564
Mand-0.553
kefeller-0.543
Token rights
Token ID2489
Feature activation+3.36e-01

Pos logit contributions
OM+0.349
hers+0.330
yk+0.328
Mand+0.321
kefeller+0.316
Neg logit contributions
 acquaintance-0.278
 ambush-0.261
 introdu-0.260
 equation-0.246
 assail-0.244
Token of
Token ID286
Feature activation+3.18e-01

Pos logit contributions
OM+1.139
hers+1.078
yk+1.073
Mand+1.047
kefeller+1.014
Neg logit contributions
 acquaintance-0.917
 ambush-0.873
 introdu-0.859
 assail-0.812
 <--0.809
Token different
Token ID1180
Feature activation-9.30e-03

Pos logit contributions
 acquaintance+0.183
 ambush+0.174
 introdu+0.174
 equation+0.166
 assail+0.165
Neg logit contributions
OM-0.151
hers-0.141
yk-0.140
Mand-0.136
kefeller-0.134
Token ways
Token ID2842
Feature activation+8.88e-02

Pos logit contributions
 acquaintance+0.030
 ambush+0.028
 introdu+0.028
 equation+0.027
 assail+0.027
Neg logit contributions
OM-0.028
hers-0.026
yk-0.026
Mand-0.025
kefeller-0.025
Token and
Token ID290
Feature activation-1.27e-01

Pos logit contributions
OM+0.299
hers+0.283
yk+0.280
Mand+0.275
kefeller+0.270
Neg logit contributions
 acquaintance-0.248
 ambush-0.233
 introdu-0.232
 equation-0.219
 assail-0.218
Token enhance
Token ID9494
Feature activation-1.47e-01

Pos logit contributions
 acquaintance+0.334
 ambush+0.313
 introdu+0.312
 equation+0.294
 jealousy+0.292
Neg logit contributions
OM-0.450
hers-0.426
yk-0.424
Mand-0.416
kefeller-0.412
Token a
Token ID257
Feature activation-8.90e-02

Pos logit contributions
 acquaintance+0.428
 ambush+0.405
 introdu+0.403
 equation+0.382
 assail+0.381
Neg logit contributions
OM-0.473
hers-0.446
yk-0.443
Mand-0.433
kefeller-0.426
Token kids
Token ID3988
Feature activation+4.16e-02

Pos logit contributions
 acquaintance+0.270
 ambush+0.255
 introdu+0.254
 equation+0.242
 assail+0.240
Neg logit contributions
OM-0.277
hers-0.261
yk-0.259
Mand-0.254
kefeller-0.249
Token imagination
Token ID13843
Feature activation-2.90e-02

Pos logit contributions
OM+0.146
hers+0.139
yk+0.138
Mand+0.135
kefeller+0.132
Neg logit contributions
 acquaintance-0.108
 ambush-0.102
 introdu-0.102
 assail-0.096
 equation-0.095
Token and
Token ID290
Feature activation-3.43e-01

Pos logit contributions
 acquaintance+0.081
 ambush+0.077
 introdu+0.077
 equation+0.072
 assail+0.072
Neg logit contributions
OM-0.097
hers-0.092
yk-0.091
Mand-0.089
kefeller-0.087
Token problem
Token ID1917
Feature activation-1.88e-01

Pos logit contributions
 acquaintance+0.949
 ambush+0.889
 introdu+0.887
 equation+0.841
 jealousy+0.834
Neg logit contributions
OM-1.161
hers-1.099
yk-1.094
Mand-1.073
kefeller-1.062
Token solving
Token ID18120
Feature activation+3.26e-01

Pos logit contributions
 acquaintance+0.633
 ambush+0.602
 introdu+0.601
 equation+0.574
 assail+0.571
Neg logit contributions
OM-0.525
hers-0.490
yk-0.487
Mand-0.474
kefeller-0.465
Token skills
Token ID4678
Feature activation-1.61e-01

Pos logit contributions
OM+1.132
hers+1.068
yk+1.063
Mand+1.042
kefeller+1.013
Neg logit contributions
 acquaintance-0.868
 ambush-0.820
 introdu-0.819
 assail-0.769
 jealousy-0.763
Token a
Token ID257
Feature activation-4.56e-02

Pos logit contributions
 acquaintance+0.479
 ambush+0.451
 introdu+0.449
 equation+0.425
 assail+0.423
Neg logit contributions
OM-0.572
hers-0.541
yk-0.536
Mand-0.526
kefeller-0.514
Token paddle
Token ID39517
Feature activation+1.91e-01

Pos logit contributions
 acquaintance+0.127
 ambush+0.120
 introdu+0.119
 equation+0.113
 assail+0.112
Neg logit contributions
OM-0.153
hers-0.145
yk-0.144
Mand-0.141
kefeller-0.139
Token sweep
Token ID16085
Feature activation+4.02e-01

Pos logit contributions
OM+0.569
hers+0.534
yk+0.530
Mand+0.517
kefeller+0.506
Neg logit contributions
 acquaintance-0.603
 introdu-0.570
 ambush-0.570
 equation-0.542
 jealousy-0.540
Token,
Token ID11
Feature activation-1.64e-01

Pos logit contributions
OM+1.344
hers+1.272
yk+1.260
Mand+1.234
kefeller+1.202
Neg logit contributions
 acquaintance-1.127
 ambush-1.059
 introdu-1.056
 equation-0.996
 assail-0.996
Token to
Token ID284
Feature activation-3.80e-02

Pos logit contributions
 acquaintance+0.448
 ambush+0.421
 introdu+0.419
 equation+0.396
 assail+0.394
Neg logit contributions
OM-0.561
hers-0.531
yk-0.528
Mand-0.517
kefeller-0.509
Token a
Token ID257
Feature activation+1.61e-01

Pos logit contributions
 acquaintance+0.097
 ambush+0.091
 introdu+0.090
 equation+0.085
 assail+0.085
Neg logit contributions
OM-0.137
hers-0.130
yk-0.129
Mand-0.126
kefeller-0.124
Token full
Token ID1336
Feature activation+1.20e-01

Pos logit contributions
OM+0.552
hers+0.523
yk+0.520
Mand+0.509
kefeller+0.499
Neg logit contributions
 acquaintance-0.436
 ambush-0.409
 introdu-0.408
 equation-0.385
 assail-0.384
Token delivery
Token ID7585
Feature activation+1.41e-01

Pos logit contributions
OM+0.409
hers+0.388
yk+0.386
Mand+0.377
kefeller+0.368
Neg logit contributions
 acquaintance-0.329
 introdu-0.308
 ambush-0.308
 assail-0.290
 equation-0.290
Token outside
Token ID2354
Feature activation-5.76e-02

Pos logit contributions
OM+0.482
hers+0.456
yk+0.453
Mand+0.444
kefeller+0.434
Neg logit contributions
 acquaintance-0.383
 ambush-0.359
 introdu-0.359
 equation-0.338
 assail-0.337
Token off
Token ID572
Feature activation-4.13e-01

Pos logit contributions
 acquaintance+0.163
 ambush+0.154
 introdu+0.153
 equation+0.145
 assail+0.144
Neg logit contributions
OM-0.191
hers-0.180
yk-0.179
Mand-0.175
kefeller-0.172
Token stump
Token ID42497
Feature activation-3.05e-01

Pos logit contributions
 acquaintance+1.153
 ambush+1.087
 introdu+1.083
 equation+1.025
 assail+1.018
Neg logit contributions
OM-1.381
hers-1.305
yk-1.298
Mand-1.271
kefeller-1.255
Token self
Token ID2116
Feature activation-1.51e-01

Pos logit contributions
 acquaintance+1.237
 ambush+1.151
 introdu+1.144
 equation+1.076
 jealousy+1.068
Neg logit contributions
OM-1.982
hers-1.888
yk-1.881
Mand-1.845
kefeller-1.832
Token has
Token ID468
Feature activation-7.51e-02

Pos logit contributions
 acquaintance+0.423
 ambush+0.398
 introdu+0.397
 assail+0.373
 equation+0.373
Neg logit contributions
OM-0.506
hers-0.478
yk-0.471
Mand-0.465
kefeller-0.453
Token a
Token ID257
Feature activation+4.11e-01

Pos logit contributions
 acquaintance+0.210
 ambush+0.198
 introdu+0.197
 equation+0.187
 assail+0.185
Neg logit contributions
OM-0.251
hers-0.238
yk-0.237
Mand-0.232
kefeller-0.228
Token wealth
Token ID5129
Feature activation+7.71e-01

Pos logit contributions
OM+1.399
hers+1.321
yk+1.313
Mand+1.288
kefeller+1.261
Neg logit contributions
 acquaintance-1.127
 ambush-1.057
 introdu-1.056
 assail-0.994
 equation-0.991
Token of
Token ID286
Feature activation+4.02e-01

Pos logit contributions
OM+2.503
yk+2.357
hers+2.347
Mand+2.268
kefeller+2.204
Neg logit contributions
 acquaintance-2.176
 introdu-2.057
 ambush-2.045
 assail-1.955
 <--1.927
Token statistics
Token ID7869
Feature activation-1.06e-01

Pos logit contributions
OM+1.364
hers+1.287
yk+1.281
Mand+1.257
kefeller+1.224
Neg logit contributions
 acquaintance-1.090
 ambush-1.024
 introdu-1.021
 assail-0.969
 equation-0.962
Token which
Token ID543
Feature activation-5.60e-02

Pos logit contributions
 acquaintance+0.277
 ambush+0.260
 introdu+0.259
 equation+0.245
 assail+0.243
Neg logit contributions
OM-0.373
hers-0.354
yk-0.352
Mand-0.346
kefeller-0.341
Token back
Token ID736
Feature activation-2.69e-01

Pos logit contributions
 acquaintance+0.140
 ambush+0.131
 introdu+0.130
 equation+0.122
 assail+0.121
Neg logit contributions
OM-0.204
hers-0.194
yk-0.193
Mand-0.189
kefeller-0.187
Token up
Token ID510
Feature activation-6.15e-02

Pos logit contributions
 acquaintance+0.687
 ambush+0.644
 introdu+0.641
 equation+0.605
 assail+0.599
Neg logit contributions
OM-0.962
hers-0.914
yk-0.910
Mand-0.894
kefeller-0.881
Token the
Token ID262
Feature activation-1.70e-01

Pos logit contributions
 acquaintance+0.159
 ambush+0.149
 introdu+0.148
 equation+0.139
 assail+0.139
Neg logit contributions
OM-0.219
hers-0.208
yk-0.207
Mand-0.203
kefeller-0.200
Token idea
Token ID2126
Feature activation+1.10e-01

Pos logit contributions
 acquaintance+0.435
 ambush+0.407
 introdu+0.405
 equation+0.382
 jealousy+0.378
Neg logit contributions
OM-0.613
hers-0.582
yk-0.579
Mand-0.568
kefeller-0.560
Token each
Token ID1123
Feature activation+5.50e-02

Pos logit contributions
 acquaintance+0.790
 ambush+0.744
 introdu+0.741
 equation+0.700
 assail+0.698
Neg logit contributions
OM-0.951
hers-0.899
yk-0.892
Mand-0.875
kefeller-0.859
Token element
Token ID5002
Feature activation-1.17e-01

Pos logit contributions
OM+0.187
hers+0.177
yk+0.175
Mand+0.172
kefeller+0.168
Neg logit contributions
 acquaintance-0.150
 ambush-0.141
 introdu-0.141
 assail-0.133
 jealousy-0.133
Token of
Token ID286
Feature activation-4.17e-01

Pos logit contributions
 acquaintance+0.304
 ambush+0.285
 introdu+0.283
 equation+0.266
 assail+0.266
Neg logit contributions
OM-0.414
hers-0.393
yk-0.390
Mand-0.383
kefeller-0.376
Token `
Token ID4600
Feature activation-4.33e-02

Pos logit contributions
 acquaintance+1.149
 ambush+1.082
 introdu+1.075
 assail+1.014
 jealousy+1.012
Neg logit contributions
OM-1.415
hers-1.339
yk-1.326
Mand-1.302
kefeller-1.276
Tokense
Token ID325
Feature activation+3.08e-01

Pos logit contributions
 acquaintance+0.122
 ambush+0.115
 introdu+0.114
 assail+0.108
 jealousy+0.108
Neg logit contributions
OM-0.145
hers-0.137
yk-0.136
Mand-0.133
kefeller-0.130
Tokenach
Token ID620
Feature activation-3.25e-01

Pos logit contributions
OM+1.109
hers+1.057
yk+1.048
Mand+1.022
kefeller+0.982
Neg logit contributions
 acquaintance-0.785
 introdu-0.749
 ambush-0.736
 jealousy-0.685
 equation-0.685
TokenSpace
Token ID14106
Feature activation-3.52e-01

Pos logit contributions
 acquaintance+1.376
 introdu+1.318
 ambush+1.317
 equation+1.265
 assail+1.264
Neg logit contributions
OM-0.636
hers-0.575
yk-0.561
Mand-0.547
kefeller-0.523
Token`
Token ID63
Feature activation-3.07e-01

Pos logit contributions
 acquaintance+0.607
 ambush+0.548
 introdu+0.544
 assail+0.491
 equation+0.490
Neg logit contributions
OM-1.563
hers-1.499
yk-1.488
Mand-1.469
kefeller-1.449
Token to
Token ID284
Feature activation-1.63e-01

Pos logit contributions
 acquaintance+0.779
 ambush+0.728
 introdu+0.726
 equation+0.683
 assail+0.678
Neg logit contributions
OM-1.105
hers-1.049
yk-1.043
Mand-1.023
kefeller-1.009
Token each
Token ID1123
Feature activation+3.26e-01

Pos logit contributions
 acquaintance+0.428
 ambush+0.401
 introdu+0.399
 equation+0.375
 assail+0.374
Neg logit contributions
OM-0.576
hers-0.547
yk-0.542
Mand-0.532
kefeller-0.523
Token of
Token ID286
Feature activation+3.47e-02

Pos logit contributions
OM+1.169
hers+1.110
yk+1.101
Mand+1.084
kefeller+1.055
Neg logit contributions
 acquaintance-0.822
 ambush-0.771
 introdu-0.767
 assail-0.721
 jealousy-0.719
Token,
Token ID11
Feature activation+5.20e-02

Pos logit contributions
 acquaintance+0.443
 ambush+0.417
 introdu+0.415
 equation+0.393
 assail+0.390
Neg logit contributions
OM-0.536
hers-0.507
yk-0.504
Mand-0.494
kefeller-0.486
Token
Token ID447
Feature activation-5.84e-02

Pos logit contributions
OM+0.162
hers+0.152
yk+0.151
Mand+0.148
kefeller+0.145
Neg logit contributions
 acquaintance-0.158
 ambush-0.149
 introdu-0.149
 equation-0.141
 assail-0.141
Token
Token ID251
Feature activation+8.20e-02

Pos logit contributions
 acquaintance+0.182
 introdu+0.171
 ambush+0.171
 equation+0.163
 <-+0.162
Neg logit contributions
OM-0.178
hers-0.166
yk-0.166
Mand-0.163
kefeller-0.162
Token he
Token ID339
Feature activation-1.70e-01

Pos logit contributions
OM+0.279
hers+0.264
yk+0.262
Mand+0.257
kefeller+0.253
Neg logit contributions
 acquaintance-0.226
 ambush-0.212
 introdu-0.211
 equation-0.200
 assail-0.199
Token says
Token ID1139
Feature activation-6.69e-01

Pos logit contributions
 acquaintance+0.452
 introdu+0.424
 ambush+0.423
 equation+0.398
 assail+0.395
Neg logit contributions
OM-0.591
hers-0.557
yk-0.555
Mand-0.545
kefeller-0.539
Token j
Token ID474
Feature activation-8.77e-01

Pos logit contributions
 acquaintance+1.785
 introdu+1.678
 ambush+1.673
 equation+1.571
 jealousy+1.566
Neg logit contributions
OM-2.320
hers-2.201
yk-2.187
Mand-2.145
kefeller-2.126
Tokenokingly
Token ID48343
Feature activation+3.10e-02

Pos logit contributions
 acquaintance+2.268
 ambush+2.111
 introdu+2.088
 <-+1.989
 equation+1.978
Neg logit contributions
OM-3.079
hers-2.916
yk-2.890
Mand-2.866
kefeller-2.816
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
OM+0.102
hers+0.096
yk+0.096
Mand+0.094
kefeller+0.091
Neg logit contributions
 acquaintance-0.088
 ambush-0.083
 introdu-0.083
 equation-0.079
 assail-0.079
Token One
Token ID1881
Feature activation-4.52e-01

Pos logit contributions
 acquaintance+0.339
 ambush+0.318
 introdu+0.316
 equation+0.298
 assail+0.297
Neg logit contributions
OM-0.445
hers-0.422
yk-0.418
Mand-0.411
kefeller-0.403
Token of
Token ID286
Feature activation+4.27e-01

Pos logit contributions
 acquaintance+1.194
 ambush+1.112
 introdu+1.107
 equation+1.047
 jealousy+1.039
Neg logit contributions
OM-1.591
hers-1.513
yk-1.500
Mand-1.472
kefeller-1.456
Token the
Token ID262
Feature activation+3.60e-01

Pos logit contributions
OM+1.376
hers+1.300
yk+1.287
Mand+1.261
kefeller+1.229
Neg logit contributions
 acquaintance-1.242
 ambush-1.173
 introdu-1.169
 assail-1.113
 equation-1.111
Token her
Token ID607
Feature activation+2.42e-02

Pos logit contributions
 acquaintance+0.243
 ambush+0.227
 introdu+0.227
 equation+0.213
 jealousy+0.213
Neg logit contributions
OM-0.328
hers-0.311
yk-0.310
Mand-0.304
kefeller-0.300
Token get
Token ID651
Feature activation+4.50e-02

Pos logit contributions
OM+0.085
hers+0.080
yk+0.080
Mand+0.078
kefeller+0.077
Neg logit contributions
 acquaintance-0.064
 ambush-0.060
 introdu-0.060
 equation-0.057
 assail-0.056
Token married
Token ID6405
Feature activation-9.99e-02

Pos logit contributions
OM+0.165
hers+0.157
yk+0.156
Mand+0.154
kefeller+0.152
Neg logit contributions
 acquaintance-0.112
 ambush-0.104
 introdu-0.104
 equation-0.097
 jealousy-0.097
Token?
Token ID30
Feature activation+4.36e-02

Pos logit contributions
 acquaintance+0.291
 ambush+0.275
 introdu+0.274
 equation+0.260
 assail+0.259
Neg logit contributions
OM-0.323
hers-0.305
yk-0.302
Mand-0.295
kefeller-0.289
Token We
Token ID775
Feature activation-1.53e-01

Pos logit contributions
OM+0.149
hers+0.141
yk+0.140
Mand+0.137
kefeller+0.135
Neg logit contributions
 acquaintance-0.119
 ambush-0.112
 introdu-0.112
 equation-0.105
 assail-0.105
Token feel
Token ID1254
Feature activation-7.76e-01

Pos logit contributions
 acquaintance+0.384
 ambush+0.360
 introdu+0.358
 equation+0.336
 assail+0.334
Neg logit contributions
OM-0.555
hers-0.527
yk-0.524
Mand-0.516
kefeller-0.509
Token like
Token ID588
Feature activation+7.23e-01

Pos logit contributions
 acquaintance+1.840
 ambush+1.714
 introdu+1.714
 jealousy+1.624
 equation+1.595
Neg logit contributions
OM-2.902
yk-2.758
hers-2.755
kefeller-2.713
Mand-2.704
Token not
Token ID407
Feature activation+8.57e-02

Pos logit contributions
OM+2.477
hers+2.364
yk+2.314
Mand+2.274
 Mills+2.188
Neg logit contributions
 acquaintance-1.928
 ambush-1.819
 introdu-1.788
 assail-1.720
 equation-1.706
Token inviting
Token ID23310
Feature activation-4.69e-01

Pos logit contributions
OM+0.277
hers+0.262
yk+0.260
Mand+0.254
kefeller+0.249
Neg logit contributions
 acquaintance-0.249
 ambush-0.235
 introdu-0.234
 equation-0.222
 assail-0.221
Token Barbara
Token ID16685
Feature activation-4.32e-01

Pos logit contributions
 acquaintance+1.142
 introdu+1.061
 ambush+1.058
 jealousy+0.990
 equation+0.989
Neg logit contributions
OM-1.742
yk-1.652
hers-1.651
Mand-1.617
kefeller-1.605
Token would
Token ID561
Feature activation-4.49e-01

Pos logit contributions
 acquaintance+1.149
 introdu+1.072
 ambush+1.066
 equation+1.007
 jealousy+1.005
Neg logit contributions
OM-1.514
hers-1.430
yk-1.428
Mand-1.395
kefeller-1.386
Tokenoliath
Token ID43009
Feature activation+4.65e-02

Pos logit contributions
OM+0.225
hers+0.205
yk+0.202
Mand+0.195
kefeller+0.188
Neg logit contributions
 acquaintance-0.437
 introdu-0.419
 ambush-0.419
 equation-0.403
 assail-0.402
Token TBA
Token ID48591
Feature activation+2.06e-01

Pos logit contributions
OM+0.121
hers+0.112
yk+0.110
Mand+0.108
kefeller+0.104
Neg logit contributions
 acquaintance-0.166
 ambush-0.157
 introdu-0.157
 equation-0.151
 assail-0.150
Token TBA
Token ID48591
Feature activation+4.18e-01

Pos logit contributions
OM+0.555
hers+0.512
yk+0.508
Mand+0.497
kefeller+0.479
Neg logit contributions
 acquaintance-0.716
 introdu-0.678
 ambush-0.677
 equation-0.649
 jealousy-0.646
Token
Token ID198
Feature activation+3.55e-01

Pos logit contributions
OM+1.434
hers+1.346
yk+1.343
Mand+1.320
kefeller+1.278
Neg logit contributions
 acquaintance-1.148
 ambush-1.069
 introdu-1.067
 equation-1.011
 assail-1.006
Token
Token ID198
Feature activation+4.31e-01

Pos logit contributions
OM+1.188
hers+1.116
yk+1.107
Mand+1.091
kefeller+1.053
Neg logit contributions
 acquaintance-1.008
 ambush-0.945
 introdu-0.944
 assail-0.901
 equation-0.894
TokenCBS
Token ID22923
Feature activation-8.19e-01

Pos logit contributions
OM+1.255
hers+1.162
yk+1.159
Mand+1.139
kefeller+1.091
Neg logit contributions
 acquaintance-1.413
 ambush-1.339
 introdu-1.336
 assail-1.277
 equation-1.277
Token All
Token ID1439
Feature activation+2.70e-01

Pos logit contributions
 acquaintance+2.267
 ambush+2.152
 introdu+2.139
 equation+2.018
 <-+2.010
Neg logit contributions
OM-2.700
hers-2.588
yk-2.572
Mand-2.515
kefeller-2.491
Token Access
Token ID8798
Feature activation-1.70e-01

Pos logit contributions
OM+0.759
hers+0.707
yk+0.702
Mand+0.686
kefeller+0.662
Neg logit contributions
 acquaintance-0.904
 ambush-0.861
 introdu-0.848
 equation-0.821
 assail-0.819
Token
Token ID198
Feature activation+2.24e-01

Pos logit contributions
 acquaintance+0.473
 ambush+0.445
 introdu+0.444
 equation+0.420
 assail+0.417
Neg logit contributions
OM-0.571
hers-0.541
yk-0.538
Mand-0.526
kefeller-0.519
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
OM+0.746
hers+0.703
yk+0.696
Mand+0.685
kefeller+0.662
Neg logit contributions
 acquaintance-0.639
 introdu-0.600
 ambush-0.599
 assail-0.573
 equation-0.567
TokenAll
Token ID3237
Feature activation+2.54e-01

Pos logit contributions
OM+0.922
hers+0.853
yk+0.847
Mand+0.836
kefeller+0.799
Neg logit contributions
 acquaintance-1.075
 ambush-1.020
 introdu-1.017
 assail-0.974
 equation-0.974
TokenI
Token ID40
Feature activation-3.42e-01

Pos logit contributions
 acquaintance+0.548
 ambush+0.521
 introdu+0.518
 equation+0.496
 <-+0.493
Neg logit contributions
OM-0.480
hers-0.453
yk-0.450
kefeller-0.438
Mand-0.434
Token don
Token ID836
Feature activation+1.69e-01

Pos logit contributions
 acquaintance+0.913
 ambush+0.855
 introdu+0.853
 equation+0.804
 assail+0.799
Neg logit contributions
OM-1.179
hers-1.125
yk-1.115
Mand-1.098
kefeller-1.086
Token
Token ID447
Feature activation+4.52e-01

Pos logit contributions
OM+0.671
hers+0.640
yk+0.636
Mand+0.626
kefeller+0.615
Neg logit contributions
 acquaintance-0.370
 ambush-0.341
 introdu-0.340
 equation-0.316
 assail-0.315
Token
Token ID247
Feature activation+1.55e-01

Pos logit contributions
OM+1.438
hers+1.355
yk+1.345
Mand+1.315
kefeller+1.289
Neg logit contributions
 acquaintance-1.339
 ambush-1.267
 introdu-1.262
 equation-1.198
 assail-1.195
Tokent
Token ID83
Feature activation+1.13e-01

Pos logit contributions
OM+0.484
hers+0.456
yk+0.453
Mand+0.443
kefeller+0.435
Neg logit contributions
 acquaintance-0.467
 ambush-0.441
 introdu-0.440
 equation-0.418
 assail-0.416
Token feel
Token ID1254
Feature activation-8.77e-01

Pos logit contributions
OM+0.434
hers+0.413
yk+0.409
Mand+0.403
kefeller+0.395
Neg logit contributions
 acquaintance-0.257
 introdu-0.239
 ambush-0.239
 equation-0.222
 assail-0.221
Token that
Token ID326
Feature activation+2.69e-01

Pos logit contributions
 acquaintance+2.137
 ambush+2.001
 introdu+1.983
 jealousy+1.893
 equation+1.872
Neg logit contributions
OM-3.199
yk-3.056
hers-3.051
kefeller-2.995
Mand-2.977
Token there
Token ID612
Feature activation+1.63e-01

Pos logit contributions
OM+0.936
hers+0.888
yk+0.880
Mand+0.864
kefeller+0.846
Neg logit contributions
 acquaintance-0.715
 ambush-0.672
 introdu-0.669
 assail-0.629
 equation-0.627
Token was
Token ID373
Feature activation+1.13e-01

Pos logit contributions
OM+0.552
hers+0.523
yk+0.517
Mand+0.509
kefeller+0.496
Neg logit contributions
 acquaintance-0.451
 ambush-0.425
 introdu-0.423
 assail-0.399
 equation-0.398
Token any
Token ID597
Feature activation-2.90e-01

Pos logit contributions
OM+0.416
hers+0.395
yk+0.392
Mand+0.386
kefeller+0.379
Neg logit contributions
 acquaintance-0.280
 ambush-0.261
 introdu-0.260
 equation-0.244
 assail-0.243
Token compromise
Token ID13110
Feature activation-4.36e-02

Pos logit contributions
 acquaintance+0.764
 ambush+0.720
 introdu+0.711
 equation+0.676
 jealousy+0.672
Neg logit contributions
OM-1.020
hers-0.967
yk-0.963
Mand-0.941
kefeller-0.934
Token free
Token ID1479
Feature activation-2.03e-01

Pos logit contributions
 acquaintance+1.100
 ambush+1.058
 introdu+1.052
 equation+1.006
 jealousy+1.000
Neg logit contributions
OM-0.918
yk-0.860
hers-0.860
Mand-0.832
kefeller-0.825
Token newsletters
Token ID16983
Feature activation-4.03e-01

Pos logit contributions
 acquaintance+0.662
 introdu+0.632
 ambush+0.628
 equation+0.601
 jealousy+0.598
Neg logit contributions
OM-0.572
hers-0.544
yk-0.543
kefeller-0.535
Mand-0.524
Token.
Token ID13
Feature activation-1.17e-02

Pos logit contributions
 acquaintance+0.997
 ambush+0.948
 introdu+0.937
 equation+0.894
 <-+0.881
Neg logit contributions
OM-1.461
hers-1.382
yk-1.372
Mand-1.363
kefeller-1.352
Token
Token ID198
Feature activation-2.15e-01

Pos logit contributions
 acquaintance+0.035
 introdu+0.033
 ambush+0.033
 equation+0.032
 <-+0.032
Neg logit contributions
OM-0.036
hers-0.034
yk-0.034
kefeller-0.033
Mand-0.033
Token
Token ID198
Feature activation-5.89e-02

Pos logit contributions
 acquaintance+0.570
 ambush+0.537
 introdu+0.533
 equation+0.503
 <-+0.501
Neg logit contributions
OM-0.743
hers-0.707
yk-0.705
kefeller-0.687
Mand-0.687
TokenUpdate
Token ID10260
Feature activation-8.02e-01

Pos logit contributions
 acquaintance+0.192
 ambush+0.183
 introdu+0.182
 equation+0.173
 jealousy+0.173
Neg logit contributions
OM-0.169
hers-0.159
yk-0.158
Mand-0.153
kefeller-0.152
Token,
Token ID11
Feature activation+1.03e-01

Pos logit contributions
 acquaintance+2.108
 ambush+1.982
 introdu+1.966
 equation+1.861
 <-+1.840
Neg logit contributions
OM-2.796
hers-2.658
yk-2.650
kefeller-2.598
Mand-2.592
Token 3
Token ID513
Feature activation+1.54e-01

Pos logit contributions
OM+0.375
hers+0.356
yk+0.354
Mand+0.348
kefeller+0.342
Neg logit contributions
 acquaintance-0.257
 ambush-0.240
 introdu-0.239
 equation-0.224
 assail-0.223
Token:
Token ID25
Feature activation+1.61e-01

Pos logit contributions
OM+0.493
hers+0.463
yk+0.458
Mand+0.450
kefeller+0.439
Neg logit contributions
 acquaintance-0.459
 ambush-0.433
 introdu-0.431
 equation-0.410
 assail-0.408
Token01
Token ID486
Feature activation-6.25e-02

Pos logit contributions
OM+0.450
hers+0.421
yk+0.417
Mand+0.407
kefeller+0.399
Neg logit contributions
 acquaintance-0.537
 ambush-0.511
 introdu-0.510
 equation-0.487
 assail-0.485
Token p
Token ID279
Feature activation-2.44e-01

Pos logit contributions
 acquaintance+0.164
 ambush+0.153
 introdu+0.152
 equation+0.143
 assail+0.143
Neg logit contributions
OM-0.222
hers-0.209
yk-0.208
Mand-0.204
kefeller-0.199